Maxwell Goldstein, Noam Brown · Converging to Unexploitable Policies in Continuous Control Adversarial Games · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Converging to Unexploitable Policies in Continuous Control Adversarial Games

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-008-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-008-alpha.b-cdn.net
sl-yoda-v2-stream-008-beta.b-cdn.net
1159783934.rsc.cdn77.org
1511376917.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Converging to Unexploitable Policies in Continuous Control Adversarial Games

Converging to Unexploitable Policies in Continuous Control Adversarial Games

Dec 2, 2022

Speakers

Maxwell Goldstein

Speaker · 0 followers

Noam Brown

Speaker · 0 followers

About

Fictitious Self-Play (FSP) is an iterative algorithm capable of learning approximate Nash equilibria in many types of two-player zero-sum games. In FSP, at each iteration, a best response is learned to the opponent's meta strategy. However, FSP can be slow to converge in continuous control games in which two embodied agents compete against one another. We propose Adaptive FSP (AdaptFSP), a deep reinforcement learning (RL) algorithm inspired by FSP. The main idea is that instead of training a bes…

Organizer

NeurIPS 2022

Account · 961 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Generalization Bounds for Gradient Methods via Discrete and Continuous Prior

04:51

Generalization Bounds for Gradient Methods via Discrete and Continuous Prior

Watch later

Favorite

Xuanyuan Luo, …

NeurIPS 2022 2 years ago

Alleviating “Posterior Collapse” in Deep Topic Models via Policy Gradient

04:36

Alleviating “Posterior Collapse” in Deep Topic Models via Policy Gradient

Watch later

Favorite

NeurIPS 2022 2 years ago

RegCLR: A Self-Supervised Framework for Tabular Representation Learning in the Wild

09:37

RegCLR: A Self-Supervised Framework for Tabular Representation Learning in the Wild

Watch later

Favorite

NeurIPS 2022 2 years ago

Don't Waste Data: Transfer Learning to Leverage All Data for Machine-Learnt Climate Model Emulation

09:10

Don't Waste Data: Transfer Learning to Leverage All Data for Machine-Learnt Climate Model Emulation

Watch later

Favorite

Raghul Parthipan, …

NeurIPS 2022 2 years ago

Influencing Long-Term Behavior in Multiagent Reinforcement Learning

04:57

Influencing Long-Term Behavior in Multiagent Reinforcement Learning

Watch later

Favorite

Dong-Ki Kim, …

NeurIPS 2022 2 years ago

Imagenary Patterns with Diffusion Models

28:09

Imagenary Patterns with Diffusion Models

Watch later

Favorite

Mohammad Norouzi

NeurIPS 2022 2 years ago