Max Sobol Mark, Ali Ghadirzadeh, Xi Chen, Chelsea Finn · Fine-tuning Offline Policies with Optimistic Action Selection · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Fine-tuning Offline Policies with Optimistic Action Selection

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-008-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-008-alpha.b-cdn.net
sl-yoda-v2-stream-008-beta.b-cdn.net
1159783934.rsc.cdn77.org
1511376917.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Fine-tuning Offline Policies with Optimistic Action Selection

Fine-tuning Offline Policies with Optimistic Action Selection

Dec 2, 2022

Speakers

Max Sobol Mark

Speaker · 1 follower

Ali Ghadirzadeh

Speaker · 0 followers

Xi Chen

Speaker · 0 followers

About

Offline reinforcement learning algorithms can train performant policies for hard tasks using previously-collected datasets. However, the quality of the offline dataset often limits the levels of performance possible. We consider the problem of improving offline policies through online fine-tuning. Offline RL requires a pessimistic training objective to mitigate distributional shift between the trained policy and the offline behavior policy, which will make the trained policy averse to picking no…

Organizer

NeurIPS 2022

Account · 961 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Explaining a Reinforcement Learning Agent via Prototyping

05:04

Explaining a Reinforcement Learning Agent via Prototyping

Watch later

Favorite

Ronilo Ragodos, …

NeurIPS 2022 2 years ago

Pruning's Effect on Generalization Through the Lens of Training and Regularization

05:00

Pruning's Effect on Generalization Through the Lens of Training and Regularization

Watch later

Favorite

NeurIPS 2022 2 years ago

Fair Synthetic Data Does not Necessarily Lead to Fair Models

02:36

Fair Synthetic Data Does not Necessarily Lead to Fair Models

Watch later

Favorite

NeurIPS 2022 2 years ago

Increasing Confidence in Adversarial Robustness Evaluations

05:04

Increasing Confidence in Adversarial Robustness Evaluations

Watch later

Favorite

Roland S. Zimmermann, …

NeurIPS 2022 2 years ago

Sparse2Dense: Learn to Densify 3D Features to Boost 3D Object Detection

04:54

Sparse2Dense: Learn to Densify 3D Features to Boost 3D Object Detection

Watch later

Favorite

Tianyu Wang, …

NeurIPS 2022 2 years ago

Panel 1: The Rise of Community-driven Research

44:26

Panel 1: The Rise of Community-driven Research

Watch later

Favorite

Rosanne Liu, …

NeurIPS 2022 2 years ago