Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: In-context Reinforcement Learning with Algorithm Distillation
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-005-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-005-alpha.b-cdn.net
      • sl-yoda-v2-stream-005-beta.b-cdn.net
      • 1034628162.rsc.cdn77.org
      • 1409346856.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            In-context Reinforcement Learning with Algorithm Distillation
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            In-context Reinforcement Learning with Algorithm Distillation

            Dez 2, 2022

            Sprecher:innen

            ML

            Michael Laskin

            Řečník · 0 sledujících

            LW

            Luyu Wang

            Řečník · 0 sledujících

            JO

            Junhyuk Oh

            Řečník · 0 sledujících

            Über

            We propose Algorithm Distillation (AD), a method for distilling reinforcement learning (RL) algorithms into neural networks by modeling their training histories with a causal sequence model. Algorithm Distillation treats learning to reinforcement learn as an across-episode sequential prediction problem. A dataset of learning histories is generated by a source RL algorithm, and then a causal transformer is trained by autoregressively predicting actions given their preceding learning histories as…

            Organisator

            N2
            N2

            NeurIPS 2022

            Účet · 962 sledujících

            Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

            Professionelle Aufzeichnung und Livestreaming – weltweit.

            Freigeben

            Empfohlene Videos

            Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

            Lifelong Learning Machines Tutorial
            20:07

            Lifelong Learning Machines Tutorial

            Tyler Hayes, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 3 diváků, což je 0.3 %

            Towards Algorithmic Fairness in Space-Time: Filling in Black Holes
            04:28

            Towards Algorithmic Fairness in Space-Time: Filling in Black Holes

            Subho Majumdar, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Chromatic Correlation Clustering, Revisited
            04:39

            Chromatic Correlation Clustering, Revisited

            Qing Xiu, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Generative Visual Prompt: Unified Distributional Control of Pre-Trained Generative Vision Models
            04:07

            Generative Visual Prompt: Unified Distributional Control of Pre-Trained Generative Vision Models

            Chen Henry Wu, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Potential Energy based Mixture Model for Noisy Label Learning
            05:15

            Potential Energy based Mixture Model for Noisy Label Learning

            Wenbin Yang, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs
            04:49

            Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs

            Benjamin Fuhrer, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Interessiert an Vorträgen wie diesem? NeurIPS 2022 folgen