Aviral Kumar, Rishabh Agarwal, Xinyang Geng, George Tucker, Sergey Levine · Offline Q-learning on Diverse Multi-Task Data Both Scales And Generalizes · SlidesLive

Kategorien

DE

Anmelden Vertrieb kontaktieren

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Offline Q-learning on Diverse Multi-Task Data Both Scales And Generalizes

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-009-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-009-alpha.b-cdn.net
sl-yoda-v2-stream-009-beta.b-cdn.net
1766500541.rsc.cdn77.org
1441886916.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Offline Q-learning on Diverse Multi-Task Data Both Scales And Generalizes

Offline Q-learning on Diverse Multi-Task Data Both Scales And Generalizes

Dez 2, 2022

Sprecher:innen

Aviral Kumar

Sprecher:in · 10 Follower:innen

Rishabh Agarwal

Sprecher:in · 0 Follower:innen

Xinyang Geng

Sprecher:in · 0 Follower:innen

Über

The potential of offline reinforcement learning (RL) is that high-capacity models trained on large, heterogeneous datasets can lead to agents that generalize broadly, analogously to similar advances in vision and NLP. However, recent works argue that offline RL methods encounter unique challenges to scaling up model capacity. Drawing on the learnings from these works, we re-examine previous design choices and find that with appropriate choices: ResNets, cross-entropy based distributional backups…

Organisator

NeurIPS 2022

Konto · 961 Follower:innen

Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

Professionelle Aufzeichnung und Livestreaming – weltweit.

Freigeben

Empfohlene Videos

Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

Tutorial: Data Compression with Machine Learning

1:52:16

Tutorial: Data Compression with Machine Learning

Später ansehen

Favorit

Karen Ullrich, …

NeurIPS 2022 2 years ago

Picking on the Same Person: Does Algorithmic Monoculture lead to Outcome Homogenization?

06:30

Picking on the Same Person: Does Algorithmic Monoculture lead to Outcome Homogenization?

Später ansehen

Favorit

Rishi Bommasani, …

NeurIPS 2022 2 years ago

Target-independent XLA optimization using Reinforcement Learning

05:16

Target-independent XLA optimization using Reinforcement Learning

Später ansehen

Favorit

Milan Ganai, …

NeurIPS 2022 2 years ago

Robust Calibration with Multi-domain Temperature Scaling

04:51

Robust Calibration with Multi-domain Temperature Scaling

Später ansehen

Favorit

Yaodong Yu, …

NeurIPS 2022 2 years ago

On the Representation Collapse of Sparse Mixture of Experts

03:53

On the Representation Collapse of Sparse Mixture of Experts

Später ansehen

Favorit

NeurIPS 2022 2 years ago

A Classification of G-invariant Shallow Neural Networks

04:17

A Classification of G-invariant Shallow Neural Networks

Später ansehen

Favorit

Devanshu Agrawal, …

NeurIPS 2022 2 years ago