Zechu Li, Tao Chen, Zhang-Wei Hong, Anurag Ajay, Pulkit Agrawal · Parallel Q-Learning: a Scheme for Time-efficient Reinforcement Learning · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Parallel Q-Learning: a Scheme for Time-efficient Reinforcement Learning

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-002-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-002-alpha.b-cdn.net
sl-yoda-v2-stream-002-beta.b-cdn.net
1001562353.rsc.cdn77.org
1075090661.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Parallel Q-Learning: a Scheme for Time-efficient Reinforcement Learning

Parallel Q-Learning: a Scheme for Time-efficient Reinforcement Learning

Jul 24, 2023

Speakers

Zechu Li

Řečník · 0 sledujících

Tao Chen

Řečník · 0 sledujících

Zhang-Wei Hong

Řečník · 0 sledujících

About

Reinforcement learning algorithms require a long time to learn policies on complex tasks due to the need for a large amount of training data. With the recent advances in GPU-based simulation, such as Isaac Gym, data collection has been sped up thousands of times on a commodity GPU. Most prior works have used on-policy methods such as PPO to train policies due to their simplicity and easy-to-scale nature. Off-policy methods are usually more sample-efficient but more challenging to be scaled up, r…

Organizer

ICML 2023

Účet · 657 sledujících

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Not All Neuro-Symbolic Concepts Are Created Equal: Analysis and Mitigation of Reasoning Shortcuts

13:29

Not All Neuro-Symbolic Concepts Are Created Equal: Analysis and Mitigation of Reasoning Shortcuts

Zhlédnout později

Oblíbené

Emanuele Marconato, …

ICML 2023 2 years ago

Averaged Method of Multipliers for Bi-Level Optimization without Lower-Level Strong Convexity

04:56

Averaged Method of Multipliers for Bi-Level Optimization without Lower-Level Strong Convexity

Zhlédnout později

Oblíbené

Risheng Liu, …

ICML 2023 2 years ago

Interactive Object Placement with Reinforcement Learning

04:48

Interactive Object Placement with Reinforcement Learning

Zhlédnout později

Oblíbené

Shengping Zhang, …

ICML 2023 2 years ago

“AI For Good” Isn’t Good Enough: A Call for Human-Centered AI

42:52

“AI For Good” Isn’t Good Enough: A Call for Human-Centered AI

Zhlédnout později

Oblíbené

James A. Landay

ICML 2023 2 years ago

Achieving Linear Speedup in Non-IID Federated Bilevel Learning

04:45

Achieving Linear Speedup in Non-IID Federated Bilevel Learning

Zhlédnout později

Oblíbené

Minhui Huang, …

ICML 2023 2 years ago

Data for Agriculture: Challenges and Opportunities in East Africa

27:28

Data for Agriculture: Challenges and Opportunities in East Africa

Zhlédnout později

Oblíbené

ICML 2023 2 years ago