Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Transformer-based World Models Are Happy With 100k Interactions
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-001-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-001-alpha.b-cdn.net
      • sl-yoda-v2-stream-001-beta.b-cdn.net
      • 1824830694.rsc.cdn77.org
      • 1979322955.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Transformer-based World Models Are Happy With 100k Interactions
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Transformer-based World Models Are Happy With 100k Interactions

            Dec 2, 2022

            Speakers

            JR

            Jan Robine

            Speaker · 0 followers

            MH

            Marc Höftmann

            Speaker · 0 followers

            TU

            Tobias Uelwer

            Speaker · 0 followers

            About

            Deep neural networks have been successful in many reinforcement learning settings. However, compared to human learners they are overly data hungry. To build a sample-efficient world model, we apply a transformer to real-world episodes in an autoregressive manner: not only the compact latent states and the taken actions but also the experienced or predicted rewards are fed into the transformer, so that it can attend flexibly to all three modalities at different time steps. The transformer allows…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 961 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Parameter-Efficient Low-Resource Dialogue State Tracking by Prompt Tuning
            04:51

            Parameter-Efficient Low-Resource Dialogue State Tracking by Prompt Tuning

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Toward Causal-Aware RL: State-Wise Action-Refined Temporal Difference
            02:04

            Toward Causal-Aware RL: State-Wise Action-Refined Temporal Difference

            Hao Sun, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Introduction to Algorithmic Fairness
            53:14

            Introduction to Algorithmic Fairness

            Golnoosh Farnadi

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Few-Shot Calibration of Set Predictors via Meta-Learned Cross-Validation-Based Conformal Prediction
            05:00

            Few-Shot Calibration of Set Predictors via Meta-Learned Cross-Validation-Based Conformal Prediction

            Sangwoo Park, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback
            04:44

            Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback

            Tiancheng Jin, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Variational Context Adjustment for Temporal Event Prediction under Distribution Shifts
            04:15

            Variational Context Adjustment for Temporal Event Prediction under Distribution Shifts

            Chenxiao Yang, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022