Risto Vuorio, Johann Brehmer, Hanno Ackermann, Daniel Dijkman, Taco Cohen, Pim de Haan · Deconfounded Imitation Learning · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Deconfounded Imitation Learning

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-004-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-004-alpha.b-cdn.net
sl-yoda-v2-stream-004-beta.b-cdn.net
1685195716.rsc.cdn77.org
1239898752.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Deconfounded Imitation Learning

Deconfounded Imitation Learning

Dec 2, 2022

Speakers

Risto Vuorio

Řečník · 0 sledujících

Johann Brehmer

Řečník · 0 sledujících

Hanno Ackermann

Řečník · 0 sledujících

About

Standard imitation learning can fail when the expert demonstrators have different sensory inputs than the imitating agent. This partial observability gives rise to hidden confounders in the causal graph, which lead to the failure to imitate. We break down the space of confounded imitation learning problems and identify three settings with different data requirements in which the correct imitation policy can be identified. We then introduce an algorithm for deconfounded imitation learning, which…

Organizer

NeurIPS 2022

Účet · 961 sledujících

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Posted Pricing and Dynamic Prior-independent Mechanisms with Value Maximizers

04:43

Posted Pricing and Dynamic Prior-independent Mechanisms with Value Maximizers

Zhlédnout později

Oblíbené

NeurIPS 2022 2 years ago

Language Conditioned Spatial Relation Reasoning for 3D Object Grounding

04:42

Language Conditioned Spatial Relation Reasoning for 3D Object Grounding

Zhlédnout později

Oblíbené

Shizhe Chen, …

NeurIPS 2022 2 years ago

Causal Inference out of Control: the Steerability of Consumption

10:27

Causal Inference out of Control: the Steerability of Consumption

Zhlédnout později

Oblíbené

Gary Cheng, …

NeurIPS 2022 2 years ago

Off-Policy Evaluation for Action-Dependent Non-stationary Environments

05:26

Off-Policy Evaluation for Action-Dependent Non-stationary Environments

Zhlédnout později

Oblíbené

Yash Chandak, …

NeurIPS 2022 2 years ago

The geometry of hidden representations of protein language models

02:04

The geometry of hidden representations of protein language models

Zhlédnout později

Oblíbené

Lucrezia Valeriani, …

NeurIPS 2022 2 years ago

Okapi: Generalising Better by Making Statistical Matches Match

04:40

Okapi: Generalising Better by Making Statistical Matches Match

Zhlédnout později

Oblíbené

Myles Bartlett, …

NeurIPS 2022 2 years ago