Zechu Li, Tao Chen, Zhang-Wei Hong, Anurag Ajay, Pulkit Agrawal · Parallel Q-Learning: a Scheme for Time-efficient Reinforcement Learning · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Parallel Q-Learning: a Scheme for Time-efficient Reinforcement Learning

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-002-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-002-alpha.b-cdn.net
sl-yoda-v2-stream-002-beta.b-cdn.net
1001562353.rsc.cdn77.org
1075090661.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Parallel Q-Learning: a Scheme for Time-efficient Reinforcement Learning

Parallel Q-Learning: a Scheme for Time-efficient Reinforcement Learning

Jul 24, 2023

Speakers

Zechu Li

Speaker · 0 followers

Tao Chen

Speaker · 0 followers

Zhang-Wei Hong

Speaker · 0 followers

About

Reinforcement learning algorithms require a long time to learn policies on complex tasks due to the need for a large amount of training data. With the recent advances in GPU-based simulation, such as Isaac Gym, data collection has been sped up thousands of times on a commodity GPU. Most prior works have used on-policy methods such as PPO to train policies due to their simplicity and easy-to-scale nature. Off-policy methods are usually more sample-efficient but more challenging to be scaled up, r…

Organizer

ICML 2023

Account · 657 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Veridical Data Science with a case study to find genetic drivers of a heart disease

32:09

Veridical Data Science with a case study to find genetic drivers of a heart disease

Watch later

Favorite

ICML 2023 2 years ago

Equivariant Architectures for Learning in Deep Weight Spaces

05:20

Equivariant Architectures for Learning in Deep Weight Spaces

Watch later

Favorite

Aviv Navon, …

ICML 2023 2 years ago

Counterfactuals, play and causal inference in young children and machines

15:42

Counterfactuals, play and causal inference in young children and machines

Watch later

Favorite

ICML 2023 2 years ago

Pruning via Sparsity-indexed ODE: A Continuous Sparsity Viewpoint

05:22

Pruning via Sparsity-indexed ODE: A Continuous Sparsity Viewpoint

Watch later

Favorite

Zhanfeng Mo, …

ICML 2023 2 years ago

Conformal Prediction Sets for Graph Neural Networks

05:31

Conformal Prediction Sets for Graph Neural Networks

Watch later

Favorite

Soroush H. Zargarbashi, …

ICML 2023 2 years ago

Dual Focal Loss for Calibration

05:08

Dual Focal Loss for Calibration

Watch later

Favorite

Linwei Tao, …

ICML 2023 2 years ago