Eddy Hudson, Garrett Warnell, Ishan Durugkar, Peter Stone · ABC: Adversarial Behavioral Cloning for Offline Mode-Seeking Imitation Learning · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: ABC: Adversarial Behavioral Cloning for Offline Mode-Seeking Imitation Learning

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-002-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-002-alpha.b-cdn.net
sl-yoda-v2-stream-002-beta.b-cdn.net
1001562353.rsc.cdn77.org
1075090661.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

ABC: Adversarial Behavioral Cloning for Offline Mode-Seeking Imitation Learning

ABC: Adversarial Behavioral Cloning for Offline Mode-Seeking Imitation Learning

Dec 2, 2022

Speakers

Eddy Hudson

Speaker · 0 followers

Garrett Warnell

Speaker · 0 followers

Ishan Durugkar

Speaker · 0 followers

About

Given a dataset of interactions with an environment of interest, a viable method to extract an agent policy is to estimate the maximum likelihood policy indicated by this data. This approach is commonly referred to as behavioral cloning (BC). In this work, we describe a key disadvantage of BC that arises due to the maximum likelihood objective function; namely that BC is mean-seeking with respect to the state-conditional expert action distribution when the learner's policy is represented with a…

Organizer

NeurIPS 2022

Account · 961 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

RecursiveMix: Mixed Learning with History

01:05

RecursiveMix: Mixed Learning with History

Watch later

Favorite

Lingfeng Yang, …

NeurIPS 2022 2 years ago

Imperceptible Adversarial Attacks on Discrete-Time Dynamic Graph Models

09:35

Imperceptible Adversarial Attacks on Discrete-Time Dynamic Graph Models

Watch later

Favorite

Kartik Sharma, …

NeurIPS 2022 2 years ago

58:56

FedML + Panel

Watch later

Favorite

Chaoyang He, …

NeurIPS 2022 2 years ago

Boosting as Frank-Wolfe

04:03

Boosting as Frank-Wolfe

Watch later

Favorite

Ryotaro Mitsuboshi, …

NeurIPS 2022 2 years ago

Mitigating Health Data Poverty: Generative Approaches versus Resampling for Time-series Clinical Data

01:54

Mitigating Health Data Poverty: Generative Approaches versus Resampling for Time-series Clinical Data

Watch later

Favorite

Raffaele Marchesi, …

NeurIPS 2022 2 years ago

On the Interpretability of Regularisation for Neural Networks Through Model Gradient Similarity

04:49

On the Interpretability of Regularisation for Neural Networks Through Model Gradient Similarity

Watch later

Favorite

Vincent Szolnoky, …

NeurIPS 2022 2 years ago