Zechu Li, Tao Chen, Zhang-Wei Hong, Anurag Ajay, Pulkit Agrawal · Parallel Q-Learning: a Scheme for Time-efficient Reinforcement Learning · SlidesLive

Kategorien

DE

Anmelden Vertrieb kontaktieren

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Parallel Q-Learning: a Scheme for Time-efficient Reinforcement Learning

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-002-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-002-alpha.b-cdn.net
sl-yoda-v2-stream-002-beta.b-cdn.net
1001562353.rsc.cdn77.org
1075090661.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Parallel Q-Learning: a Scheme for Time-efficient Reinforcement Learning

Parallel Q-Learning: a Scheme for Time-efficient Reinforcement Learning

Jul 24, 2023

Sprecher:innen

Zechu Li

Speaker · 0 followers

Tao Chen

Speaker · 0 followers

Zhang-Wei Hong

Speaker · 0 followers

Über

Reinforcement learning algorithms require a long time to learn policies on complex tasks due to the need for a large amount of training data. With the recent advances in GPU-based simulation, such as Isaac Gym, data collection has been sped up thousands of times on a commodity GPU. Most prior works have used on-policy methods such as PPO to train policies due to their simplicity and easy-to-scale nature. Off-policy methods are usually more sample-efficient but more challenging to be scaled up, r…

Organisator

ICML 2023

Account · 657 followers

Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

Professionelle Aufzeichnung und Livestreaming – weltweit.

Freigeben

Empfohlene Videos

Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

Adversarial Classification: Necessary Conditions and Geometric Flows

05:16

Adversarial Classification: Necessary Conditions and Geometric Flows

Watch later

Favorite

Nicolas Garcia Trillos, …

ICML 2023 2 years ago

Learning Mixtures of Markov Chains and MDPs

04:50

Learning Mixtures of Markov Chains and MDPs

Watch later

Favorite

Chinmaya Kausik, …

ICML 2023 2 years ago

Offline Meta Reinforcement Learning with In-Distribution Online Adaptation

04:28

Offline Meta Reinforcement Learning with In-Distribution Online Adaptation

Watch later

Favorite

Jianhao Wang, …

ICML 2023 2 years ago

Generalized-Smooth Nonconvex Optimization is As Efficient As Smooth Nonconvex Optimization

04:44

Generalized-Smooth Nonconvex Optimization is As Efficient As Smooth Nonconvex Optimization

Watch later

Favorite

ICML 2023 2 years ago

Short Poster Talks 2

11:53

Short Poster Talks 2

Watch later

Favorite

Jesse Michel, …

ICML 2023 2 years ago

Finding the Missing-half: Graph Complementary Learning for Homophily-prone and Heterophily-prone Graphs

05:35

Finding the Missing-half: Graph Complementary Learning for Homophily-prone and Heterophily-prone Graphs

Watch later

Favorite

Yizhen Zheng, …

ICML 2023 2 years ago