Rishabh Agarwal, Max Schwarzer, Pablo Samuel Castro, Aaron Courville, Marc Bellemare · Reincarnating RL: Reusing Prior Computation to Accelerate Progress · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Reincarnating RL: Reusing Prior Computation to Accelerate Progress

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-008-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-008-alpha.b-cdn.net
sl-yoda-v2-stream-008-beta.b-cdn.net
1159783934.rsc.cdn77.org
1511376917.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Reincarnating RL: Reusing Prior Computation to Accelerate Progress

Reincarnating RL: Reusing Prior Computation to Accelerate Progress

Nov 28, 2022

Speakers

Rishabh Agarwal

Speaker · 2 followers

Max Schwarzer

Speaker · 1 follower

Pablo Samuel Castro

Speaker · 1 follower

About

Learning tabula rasa, that is without any prior knowledge, is the prevalent workflow in reinforcement learning (RL) research. However, RL systems, when applied to large-scale settings, rarely operate tabula rasa. Such large-scale systems undergo multiple design or algorithmic changes during their development cycle and use ad hoc approaches for incorporating these changes without re-training from scratch, which would have been prohibitively expensive. Additionally, the inefficiency of deep RL typ…

Organizer

NeurIPS 2022

Account · 954 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Alignment-guided Temporal Attention for Video Action Recognition

04:14

Alignment-guided Temporal Attention for Video Action Recognition

Watch later

Favorite

Yizhou Zhao, …

NeurIPS 2022 2 years ago

BayesPCN: A Continually Learnable Predictive Coding Associative Memory

04:58

BayesPCN: A Continually Learnable Predictive Coding Associative Memory

Watch later

Favorite

Jinsoo Yoo, …

NeurIPS 2022 2 years ago

Temporal Graph Learning: Some Challenges and Recent Directions

37:33

Temporal Graph Learning: Some Challenges and Recent Directions

Watch later

Favorite

NeurIPS 2022 2 years ago

Towards Reasoning-Aware Explainable VQA

10:42

Towards Reasoning-Aware Explainable VQA

Watch later

Favorite

Rakesh Vaideeswaran, …

NeurIPS 2022 2 years ago

Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function

13:59

Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function

Watch later

Favorite

Ruijie Zheng, …

NeurIPS 2022 2 years ago

Learning to Scaffold: Optimizing Model Explanations for Teaching

04:59

Learning to Scaffold: Optimizing Model Explanations for Teaching

Watch later

Favorite

Patrick Fernandes, …

NeurIPS 2022 2 years ago