Ali Rahimi-Kalahroudi, Janarthanan Rajendran, Ida Momennejad, Harm van Seijen, Sarath Chandar · Replay Buffer With Local Forgetting for Adaptive Deep Model-Based Reinforcement Learning · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Replay Buffer With Local Forgetting for Adaptive Deep Model-Based Reinforcement Learning

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-005-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-005-alpha.b-cdn.net
sl-yoda-v2-stream-005-beta.b-cdn.net
1034628162.rsc.cdn77.org
1409346856.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Replay Buffer With Local Forgetting for Adaptive Deep Model-Based Reinforcement Learning

Replay Buffer With Local Forgetting for Adaptive Deep Model-Based Reinforcement Learning

Dec 2, 2022

Speakers

Ali Rahimi-Kalahroudi

Speaker · 0 followers

Janarthanan Rajendran

Speaker · 0 followers

Ida Momennejad

Speaker · 0 followers

About

One of the key behavioral characteristics used in neuroscience to determine whether the subject of study—be it a rodent or a human—exhibits model-based learning is effective adaptation to local changes in the environment. In reinforcement learning, however, recent work has shown that modern deep model-based reinforcement-learning (MBRL) methods adapt poorly to such changes. An explanation for this mismatch is that MBRL methods are typically designed with sample-efficiency on a single task in min…

Organizer

NeurIPS 2022

Account · 961 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Dimenison-Reduced Adaptive Gradient Method

05:30

Dimenison-Reduced Adaptive Gradient Method

Watch later

Favorite

Jingyang Li, …

NeurIPS 2022 2 years ago

Mildly Conservative Q-Learning for Offline Reinforcement Learning

04:47

Mildly Conservative Q-Learning for Offline Reinforcement Learning

Watch later

Favorite

Jiafei Lyu, …

NeurIPS 2022 2 years ago

General Cutting Planes for Bound-Propagation-Based Neural Network Verification

05:29

General Cutting Planes for Bound-Propagation-Based Neural Network Verification

Watch later

Favorite

Huan Zhang, …

NeurIPS 2022 2 years ago

Momentum Boosted Episodic Memory for Improving Learning in Long-Tailed RL Environments

05:12

Momentum Boosted Episodic Memory for Improving Learning in Long-Tailed RL Environments

Watch later

Favorite

Dolton Fernandes, …

NeurIPS 2022 2 years ago

Feasible Adversarial Robust Reinforcement Learning

05:03

Feasible Adversarial Robust Reinforcement Learning

Watch later

Favorite

NeurIPS 2022 2 years ago

Private Synthetic Data for Multitask Learning and Marginal Queries

05:00

Private Synthetic Data for Multitask Learning and Marginal Queries

Watch later

Favorite

Giuseppe Vietri, …

NeurIPS 2022 2 years ago