Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: In-context Reinforcement Learning with Algorithm Distillation
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-005-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-005-alpha.b-cdn.net
      • sl-yoda-v2-stream-005-beta.b-cdn.net
      • 1034628162.rsc.cdn77.org
      • 1409346856.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            In-context Reinforcement Learning with Algorithm Distillation
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            In-context Reinforcement Learning with Algorithm Distillation

            Dec 2, 2022

            Speakers

            ML

            Michael Laskin

            Speaker · 0 followers

            LW

            Luyu Wang

            Speaker · 0 followers

            JO

            Junhyuk Oh

            Speaker · 0 followers

            About

            We propose Algorithm Distillation (AD), a method for distilling reinforcement learning (RL) algorithms into neural networks by modeling their training histories with a causal sequence model. Algorithm Distillation treats learning to reinforcement learn as an across-episode sequential prediction problem. A dataset of learning histories is generated by a source RL algorithm, and then a causal transformer is trained by autoregressively predicting actions given their preceding learning histories as…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 961 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Beyond Spectral Gap: The role of topology in decentralized learning
            05:01

            Beyond Spectral Gap: The role of topology in decentralized learning

            Thijs Vogels, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training
            12:08

            VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training

            Jason Ma, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Graph Neural Networks with Adaptive Readouts
            00:34

            Graph Neural Networks with Adaptive Readouts

            David Buterez, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            CLUTR: Curriculum Learning via Unsupervised Task Representation Learning
            05:30

            CLUTR: Curriculum Learning via Unsupervised Task Representation Learning

            Abdus Salam Azad, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            LLM.int8():  8-bit Matrix Multiplication for Transformers at Scale
            04:30

            LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale

            Mike Lewis, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Aligning individual brains with fused unbalanced Gromov Wasserstein
            04:56

            Aligning individual brains with fused unbalanced Gromov Wasserstein

            Alexis Thual, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022