Hengyuan Hu, David X. Wu, Adam Lerer, Jakob Foerster, Noam Brown · Human-AI Coordination via Human-Regularized Search and Learning · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Human-AI Coordination via Human-Regularized Search and Learning

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-002-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-002-alpha.b-cdn.net
sl-yoda-v2-stream-002-beta.b-cdn.net
1001562353.rsc.cdn77.org
1075090661.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Human-AI Coordination via Human-Regularized Search and Learning

Human-AI Coordination via Human-Regularized Search and Learning

Dec 2, 2022

Speakers

Hengyuan Hu

Speaker · 0 followers

David X. Wu

Speaker · 0 followers

Adam Lerer

Speaker · 0 followers

About

We consider the problem of making AI agents that collaborate well with humans in partially observable fully cooperative environments given datasets of human behavior. Inspired by piKL, a human-data-regularized search method that improves upon a behavioral cloning policy without diverging far away from it, we develop a three-step algorithm that achieve strong performance in coordinating with real humans in the Hanabi benchmark. We first use a regularized search algorithm and behavioral cloning to…

Organizer

NeurIPS 2022

Account · 961 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

SCONE: Surface Coverage Optimization in uNknown Environments by Volumetric Integration

05:01

SCONE: Surface Coverage Optimization in uNknown Environments by Volumetric Integration

Watch later

Favorite

Antoine Guédon, …

NeurIPS 2022 2 years ago

CausalIML Challenge: Causal Insights for Learning Paths in Education

12:51

CausalIML Challenge: Causal Insights for Learning Paths in Education

Watch later

Favorite

Wenbo Gong, …

NeurIPS 2022 2 years ago

Emotional Glossary of Creative Al

29:10

Emotional Glossary of Creative Al

Watch later

Favorite

Alexa Steinbrück

NeurIPS 2022 2 years ago

DreamShard: Generalizable Embedding Table Placement for Recommender Systems

05:13

DreamShard: Generalizable Embedding Table Placement for Recommender Systems

Watch later

Favorite

Daochen Zha, …

NeurIPS 2022 2 years ago

Near Instance-Optimal PAC Reinforcement Learning for Deterministic MDPs

05:11

Near Instance-Optimal PAC Reinforcement Learning for Deterministic MDPs

Watch later

Favorite

Andrea Tirinzoni, …

NeurIPS 2022 2 years ago

Provably Adversarially Robust Detection of Out-of-Distribution Data (Almost) for Free

04:56

Provably Adversarially Robust Detection of Out-of-Distribution Data (Almost) for Free

Watch later

Favorite

Alexander Meinke, …

NeurIPS 2022 2 years ago