Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Do As You Teach: A Multi-Teacher Approach to Self-Play in Deep Reinforcement Learning
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-005-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-005-alpha.b-cdn.net
      • sl-yoda-v2-stream-005-beta.b-cdn.net
      • 1034628162.rsc.cdn77.org
      • 1409346856.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Do As You Teach: A Multi-Teacher Approach to Self-Play in Deep Reinforcement Learning
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Do As You Teach: A Multi-Teacher Approach to Self-Play in Deep Reinforcement Learning

            Dec 2, 2022

            Speakers

            CK

            Chaitanya Kharyal

            Speaker · 0 followers

            TKS

            Tanmay Kumar Sinha

            Speaker · 0 followers

            SKG

            Sai Krishna Gottipati

            Speaker · 0 followers

            About

            A long-running challenge in the reinforcement learning (RL) community has been to train a goal-conditioned agent in a sparse reward environment such that it could also generalize to other unseen goals. Empirical results in Fetch-Reach and a novel driving simulator demonstrate that our proposed algorithm, Multi-Teacher Asymmetric Self-Play, allows one agent (i.e., a teacher) to create a successful curriculum for another agent (i.e., the student). Surprisingly, results also show that training with…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 961 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward
            04:49

            LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward

            Daejin Jo, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Adjoint-aided inference of Gaussian process driven differential equations
            04:30

            Adjoint-aided inference of Gaussian process driven differential equations

            Paterne Gahungu, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Information bottleneck theory of high-dimensional regression
            05:11

            Information bottleneck theory of high-dimensional regression

            Vudtiwat Ngampruetikorn, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Closing Remarks
            02:22

            Closing Remarks

            Roshan Rao

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Panel Discussion - What Role Should Empiricism Play in Building AI?
            50:38

            Panel Discussion - What Role Should Empiricism Play in Building AI?

            Samy Bengio, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            UniGAN: Reducing Mode Collapse in GANs using a Uniform Generator
            05:04

            UniGAN: Reducing Mode Collapse in GANs using a Uniform Generator

            Ziqi Pan, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022