Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Converging to Unexploitable Policies in Continuous Control Adversarial Games
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-008-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-008-alpha.b-cdn.net
      • sl-yoda-v2-stream-008-beta.b-cdn.net
      • 1159783934.rsc.cdn77.org
      • 1511376917.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Converging to Unexploitable Policies in Continuous Control Adversarial Games
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Converging to Unexploitable Policies in Continuous Control Adversarial Games

            Dec 2, 2022

            Speakers

            MG

            Maxwell Goldstein

            Speaker · 0 followers

            NB

            Noam Brown

            Speaker · 0 followers

            About

            Fictitious Self-Play (FSP) is an iterative algorithm capable of learning approximate Nash equilibria in many types of two-player zero-sum games. In FSP, at each iteration, a best response is learned to the opponent's meta strategy. However, FSP can be slow to converge in continuous control games in which two embodied agents compete against one another. We propose Adaptive FSP (AdaptFSP), a deep reinforcement learning (RL) algorithm inspired by FSP. The main idea is that instead of training a bes…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 961 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Generalization Bounds for Gradient Methods via Discrete and Continuous Prior
            04:51

            Generalization Bounds for Gradient Methods via Discrete and Continuous Prior

            Xuanyuan Luo, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Alleviating “Posterior Collapse” in Deep Topic Models via Policy Gradient
            04:36

            Alleviating “Posterior Collapse” in Deep Topic Models via Policy Gradient

            Yewen Li, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            RegCLR: A Self-Supervised Framework for Tabular Representation Learning in the Wild
            09:37

            RegCLR: A Self-Supervised Framework for Tabular Representation Learning in the Wild

            Byung-Hak Kim

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Don't Waste Data: Transfer Learning to Leverage All Data for Machine-Learnt Climate Model Emulation
            09:10

            Don't Waste Data: Transfer Learning to Leverage All Data for Machine-Learnt Climate Model Emulation

            Raghul Parthipan, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Influencing Long-Term Behavior in Multiagent Reinforcement Learning
            04:57

            Influencing Long-Term Behavior in Multiagent Reinforcement Learning

            Dong-Ki Kim, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Imagenary Patterns with Diffusion Models
            28:09

            Imagenary Patterns with Diffusion Models

            Mohammad Norouzi

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022