Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: You Can’t Count on Luck: Why Decision Transformers and RvS Fail in Stochastic Environments
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-004-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-004-alpha.b-cdn.net
      • sl-yoda-v2-stream-004-beta.b-cdn.net
      • 1685195716.rsc.cdn77.org
      • 1239898752.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            You Can’t Count on Luck: Why Decision Transformers and RvS Fail in Stochastic Environments
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            You Can’t Count on Luck: Why Decision Transformers and RvS Fail in Stochastic Environments

            Nov 28, 2022

            Speakers

            KP

            Keiran Paster

            Řečník · 0 sledujících

            SM

            Sheila McIlraith

            Řečník · 0 sledujících

            JB

            Jimmy Ba

            Řečník · 2 sledující

            About

            Recently, methods such as Decision Transformer that reduce reinforcement learning to a prediction task and solve it via supervised learning (RvS) have become popular due to their simplicity, robustness to hyperparameters, and strong overall performance on offline RL tasks. However, simply conditioning a probabilistic model on a desired return and taking the predicted action can fail dramatically in stochastic environments since trajectories that result in a return may have only achieved that ret…

            Organizer

            N2
            N2

            NeurIPS 2022

            Účet · 961 sledujících

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Closing remarks: Memory in Artificial and Real Intelligence (MemARI)
            00:46

            Closing remarks: Memory in Artificial and Real Intelligence (MemARI)

            Mariya Toneva

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Parameter-free Regret in High Probability with Heavy Tails
            05:07

            Parameter-free Regret in High Probability with Heavy Tails

            Jiujia Zhang, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Diversity Boosted Learning for Domain Generalization with A Large Number of Domains
            05:37

            Diversity Boosted Learning for Domain Generalization with A Large Number of Domains

            Xi Leng, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            MSDS: A Large-Scale Chinese Signature and Token Digit String Dataset for Handwriting Verification
            04:57

            MSDS: A Large-Scale Chinese Signature and Token Digit String Dataset for Handwriting Verification

            Peirong Zhang, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Contrastive Graph Structure Learning via Information Bottleneck for Recommendation
            04:59

            Contrastive Graph Structure Learning via Information Bottleneck for Recommendation

            Chunyu Wei, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Imperceptible Adversarial Attacks on Discrete-Time Dynamic Graph Models
            09:35

            Imperceptible Adversarial Attacks on Discrete-Time Dynamic Graph Models

            Kartik Sharma, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Interested in talks like this? Follow NeurIPS 2022