Chaitanya Kharyal, Tanmay Kumar Sinha, Sai Krishna Gottipati, Srijita Das, Matthew E. Taylor · Do As You Teach: A Multi-Teacher Approach to Self-Play in Deep Reinforcement Learning · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Do As You Teach: A Multi-Teacher Approach to Self-Play in Deep Reinforcement Learning

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-005-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-005-alpha.b-cdn.net
sl-yoda-v2-stream-005-beta.b-cdn.net
1034628162.rsc.cdn77.org
1409346856.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Do As You Teach: A Multi-Teacher Approach to Self-Play in Deep Reinforcement Learning

Do As You Teach: A Multi-Teacher Approach to Self-Play in Deep Reinforcement Learning

Dec 2, 2022

Speakers

Chaitanya Kharyal

Speaker · 0 followers

Tanmay Kumar Sinha

Speaker · 0 followers

Sai Krishna Gottipati

Speaker · 0 followers

About

A long-running challenge in the reinforcement learning (RL) community has been to train a goal-conditioned agent in a sparse reward environment such that it could also generalize to other unseen goals. Empirical results in Fetch-Reach and a novel driving simulator demonstrate that our proposed algorithm, Multi-Teacher Asymmetric Self-Play, allows one agent (i.e., a teacher) to create a successful curriculum for another agent (i.e., the student). Surprisingly, results also show that training with…

Organizer

NeurIPS 2022

Account · 961 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward

04:49

LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward

Watch later

Favorite

NeurIPS 2022 2 years ago

Adjoint-aided inference of Gaussian process driven differential equations

04:30

Adjoint-aided inference of Gaussian process driven differential equations

Watch later

Favorite

Paterne Gahungu, …

NeurIPS 2022 2 years ago

Information bottleneck theory of high-dimensional regression

05:11

Information bottleneck theory of high-dimensional regression

Watch later

Favorite

Vudtiwat Ngampruetikorn, …

NeurIPS 2022 2 years ago

Closing Remarks

02:22

Closing Remarks

Watch later

Favorite

NeurIPS 2022 2 years ago

Panel Discussion - What Role Should Empiricism Play in Building AI?

50:38

Panel Discussion - What Role Should Empiricism Play in Building AI?

Watch later

Favorite

Samy Bengio, …

NeurIPS 2022 2 years ago

UniGAN: Reducing Mode Collapse in GANs using a Uniform Generator

05:04

UniGAN: Reducing Mode Collapse in GANs using a Uniform Generator

Watch later

Favorite

NeurIPS 2022 2 years ago