Anton Bakhtin, David X. Wu, Adam Lerer, Jonathan Gray, Athul P. Jacob, Gabriele Farina, Alexander Miller, Noam Brown · Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-003-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-003-alpha.b-cdn.net
sl-yoda-v2-stream-003-beta.b-cdn.net
1544410162.rsc.cdn77.org
1005514182.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning

Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning

Dec 2, 2022

Speakers

Anton Bakhtin

Speaker · 0 followers

David X. Wu

Speaker · 0 followers

Adam Lerer

Speaker · 0 followers

About

No-press Diplomacy is a complex strategy game involving both cooperation and competition that has served as a benchmark for multi-agent AI research. While self-play reinforcement learning has resulted in numerous successes in purely adversarial games like chess, Go, and poker, self-play alone is insufficient for achieving optimal performance in domains involving cooperation with humans. We address this shortcoming by first introducing a planning algorithm we call DiL-piKL that regularizes a rewa…

Organizer

NeurIPS 2022

Account · 961 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Cross-Linked Unified Embedding for cross-modality representation learning

04:43

Cross-Linked Unified Embedding for cross-modality representation learning

Watch later

Favorite

Xinming Tu, …

NeurIPS 2022 2 years ago

Learning Semantics-Aware Locomotion Skills from Human Demonstrations

05:18

Learning Semantics-Aware Locomotion Skills from Human Demonstrations

Watch later

Favorite

Yuxiang Yang, …

NeurIPS 2022 2 years ago

Look Around and Refer: 2D Synthetic Semantics Knowledge Distillation for 3D Visual Grounding

06:04

Look Around and Refer: 2D Synthetic Semantics Knowledge Distillation for 3D Visual Grounding

Watch later

Favorite

Eslam Mohamed Bakr, …

NeurIPS 2022 2 years ago

The Gyro-Structure of Some Matrix Manifolds

04:55

The Gyro-Structure of Some Matrix Manifolds

Watch later

Favorite

Xuan Son Nguyen

NeurIPS 2022 2 years ago

Learning Optical Flow From Continuous Spike Streams

04:50

Learning Optical Flow From Continuous Spike Streams

Watch later

Favorite

NeurIPS 2022 2 years ago

Data Augmentation MCMC for Bayesian Inference from Privatized Data

01:03

Data Augmentation MCMC for Bayesian Inference from Privatized Data

Watch later

Favorite

Nianqiao Ju, …

NeurIPS 2022 2 years ago