Keiran Paster, Sheila McIlraith, Jimmy Ba · You Can’t Count on Luck: Why Decision Transformers and RvS Fail in Stochastic Environments · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: You Can’t Count on Luck: Why Decision Transformers and RvS Fail in Stochastic Environments

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-004-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-004-alpha.b-cdn.net
sl-yoda-v2-stream-004-beta.b-cdn.net
1685195716.rsc.cdn77.org
1239898752.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

You Can’t Count on Luck: Why Decision Transformers and RvS Fail in Stochastic Environments

You Can’t Count on Luck: Why Decision Transformers and RvS Fail in Stochastic Environments

Nov 28, 2022

Speakers

Keiran Paster

Řečník · 0 sledujících

Sheila McIlraith

Řečník · 0 sledujících

Jimmy Ba

Řečník · 2 sledující

About

Recently, methods such as Decision Transformer that reduce reinforcement learning to a prediction task and solve it via supervised learning (RvS) have become popular due to their simplicity, robustness to hyperparameters, and strong overall performance on offline RL tasks. However, simply conditioning a probabilistic model on a desired return and taking the predicted action can fail dramatically in stochastic environments since trajectories that result in a return may have only achieved that ret…

Organizer

NeurIPS 2022

Účet · 961 sledujících

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Closing remarks: Memory in Artificial and Real Intelligence (MemARI)

00:46

Closing remarks: Memory in Artificial and Real Intelligence (MemARI)

Zhlédnout později

Oblíbené

NeurIPS 2022 2 years ago

Parameter-free Regret in High Probability with Heavy Tails

05:07

Parameter-free Regret in High Probability with Heavy Tails

Zhlédnout později

Oblíbené

Jiujia Zhang, …

NeurIPS 2022 2 years ago

Diversity Boosted Learning for Domain Generalization with A Large Number of Domains

05:37

Diversity Boosted Learning for Domain Generalization with A Large Number of Domains

Zhlédnout později

Oblíbené

NeurIPS 2022 2 years ago

MSDS: A Large-Scale Chinese Signature and Token Digit String Dataset for Handwriting Verification

04:57

MSDS: A Large-Scale Chinese Signature and Token Digit String Dataset for Handwriting Verification

Zhlédnout později

Oblíbené

Peirong Zhang, …

NeurIPS 2022 2 years ago

Contrastive Graph Structure Learning via Information Bottleneck for Recommendation

04:59

Contrastive Graph Structure Learning via Information Bottleneck for Recommendation

Zhlédnout později

Oblíbené

Chunyu Wei, …

NeurIPS 2022 2 years ago

Imperceptible Adversarial Attacks on Discrete-Time Dynamic Graph Models

09:35

Imperceptible Adversarial Attacks on Discrete-Time Dynamic Graph Models

Zhlédnout později

Oblíbené

Kartik Sharma, …

NeurIPS 2022 2 years ago