Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Human-AI Coordination via Human-Regularized Search and Learning
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-002-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-002-alpha.b-cdn.net
      • sl-yoda-v2-stream-002-beta.b-cdn.net
      • 1001562353.rsc.cdn77.org
      • 1075090661.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Human-AI Coordination via Human-Regularized Search and Learning
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Human-AI Coordination via Human-Regularized Search and Learning

            Dec 2, 2022

            Speakers

            HH

            Hengyuan Hu

            Speaker · 0 followers

            DXW

            David X. Wu

            Speaker · 0 followers

            AL

            Adam Lerer

            Speaker · 0 followers

            About

            We consider the problem of making AI agents that collaborate well with humans in partially observable fully cooperative environments given datasets of human behavior. Inspired by piKL, a human-data-regularized search method that improves upon a behavioral cloning policy without diverging far away from it, we develop a three-step algorithm that achieve strong performance in coordinating with real humans in the Hanabi benchmark. We first use a regularized search algorithm and behavioral cloning to…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 961 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            SCONE: Surface Coverage Optimization in uNknown Environments by Volumetric Integration
            05:01

            SCONE: Surface Coverage Optimization in uNknown Environments by Volumetric Integration

            Antoine Guédon, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            CausalIML Challenge: Causal Insights for Learning Paths in Education
            12:51

            CausalIML Challenge: Causal Insights for Learning Paths in Education

            Wenbo Gong, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Emotional Glossary of Creative Al
            29:10

            Emotional Glossary of Creative Al

            Alexa Steinbrück

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            DreamShard: Generalizable Embedding Table Placement for Recommender Systems
            05:13

            DreamShard: Generalizable Embedding Table Placement for Recommender Systems

            Daochen Zha, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Near Instance-Optimal PAC Reinforcement Learning for Deterministic MDPs
            05:11

            Near Instance-Optimal PAC Reinforcement Learning for Deterministic MDPs

            Andrea Tirinzoni, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Provably Adversarially Robust Detection of Out-of-Distribution Data (Almost) for Free
            04:56

            Provably Adversarially Robust Detection of Out-of-Distribution Data (Almost) for Free

            Alexander Meinke, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022