Yulai Zhao, Zhuoran Yang, Zhaoran Wang, Jason D. Lee · Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-004-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-004-alpha.b-cdn.net
sl-yoda-v2-stream-004-beta.b-cdn.net
1685195716.rsc.cdn77.org
1239898752.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning

Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning

Jul 24, 2023

Speakers

Yulai Zhao

Speaker · 0 followers

Zhuoran Yang

Speaker · 2 followers

Zhaoran Wang

Speaker · 1 follower

About

Policy optimization methods with function approximation are widely used in multi-agent reinforcement learning. However, it remains elusive how to design such algorithms with statistical guarantees. Leveraging a multi-agent performance difference lemma that characterizes the landscape of multi-agent policy optimization, we find that the localized action value function serves as an ideal descent direction for each local policy. Motivated by the observation, we present a multi-agent PPO algorithm i…

Organizer

ICML 2023

Account · 657 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

OCD: Learning to Overfit with Conditional Diffusion Models

05:41

OCD: Learning to Overfit with Conditional Diffusion Models

Watch later

Favorite

Shahar Lutati, …

ICML 2023 2 years ago

Information-Theoretic State Space Model for Multi-View Reinforcement Learning

06:58

Information-Theoretic State Space Model for Multi-View Reinforcement Learning

Watch later

Favorite

HyeongJoo Hwang, …

ICML 2023 2 years ago

Pretraining Language Models with Human Preferences

09:25

Pretraining Language Models with Human Preferences

Watch later

Favorite

Tomek Korbak, …

ICML 2023 2 years ago

Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time

05:09

Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time

Watch later

Favorite

Zichang Liu, …

ICML 2023 2 years ago

Language Instructed RL for Human-AI Coordination

05:12

Language Instructed RL for Human-AI Coordination

Watch later

Favorite

Hengyuan Hu, …

ICML 2023 2 years ago

Using Perturbation to Improve Goodness-of-Fit Tests based on Kernelized Stein Discrepancy

05:17

Using Perturbation to Improve Goodness-of-Fit Tests based on Kernelized Stein Discrepancy

Watch later

Favorite

ICML 2023 2 years ago