Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-004-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-004-alpha.b-cdn.net
      • sl-yoda-v2-stream-004-beta.b-cdn.net
      • 1685195716.rsc.cdn77.org
      • 1239898752.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning

            Jul 24, 2023

            Speakers

            YZ

            Yulai Zhao

            Speaker · 0 followers

            ZY

            Zhuoran Yang

            Speaker · 2 followers

            ZW

            Zhaoran Wang

            Speaker · 1 follower

            About

            Policy optimization methods with function approximation are widely used in multi-agent reinforcement learning. However, it remains elusive how to design such algorithms with statistical guarantees. Leveraging a multi-agent performance difference lemma that characterizes the landscape of multi-agent policy optimization, we find that the localized action value function serves as an ideal descent direction for each local policy. Motivated by the observation, we present a multi-agent PPO algorithm i…

            Organizer

            I2
            I2

            ICML 2023

            Account · 657 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            OCD: Learning to Overfit with Conditional Diffusion Models
            05:41

            OCD: Learning to Overfit with Conditional Diffusion Models

            Shahar Lutati, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Information-Theoretic State Space Model for Multi-View Reinforcement Learning
            06:58

            Information-Theoretic State Space Model for Multi-View Reinforcement Learning

            HyeongJoo Hwang, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Pretraining Language Models with Human Preferences
            09:25

            Pretraining Language Models with Human Preferences

            Tomek Korbak, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time
            05:09

            Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time

            Zichang Liu, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Language Instructed RL for Human-AI Coordination
            05:12

            Language Instructed RL for Human-AI Coordination

            Hengyuan Hu, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Using Perturbation to Improve Goodness-of-Fit Tests based on Kernelized Stein Discrepancy
            05:17

            Using Perturbation to Improve Goodness-of-Fit Tests based on Kernelized Stein Discrepancy

            Xing Liu, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow ICML 2023