Div Garg, Shuvam Chakraborty, Chris Cundy, Jiaming Song, Stefano Ermon · IQ-Learn: Inverse soft-Q Learning for Imitation · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: IQ-Learn: Inverse soft-Q Learning for Imitation

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-009-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-009-alpha.b-cdn.net
sl-yoda-v2-stream-009-beta.b-cdn.net
1766500541.rsc.cdn77.org
1441886916.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

IQ-Learn: Inverse soft-Q Learning for Imitation

IQ-Learn: Inverse soft-Q Learning for Imitation

Dec 6, 2021

Speakers

Div Garg

Speaker · 0 followers

Shuvam Chakraborty

Speaker · 0 followers

Chris Cundy

Speaker · 0 followers

About

In many sequential decision-making problems (e.g., robotics control, game playing, sequential prediction), human or expert data is available containing useful information about the task. However, imitation learning (IL) from a small amount of expert data can be challenging in high-dimensional environments with complex dynamics. Behavioral cloning is a simple method that is widely used due to its simplicity of implementation and stable convergence but doesn't utilize any information involving the…

Organizer

NeurIPS 2021

Account · 1.9k followers

Categories

AI & Data Science

Category · 10.8k presentations

About NeurIPS 2021

Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Accurately Solving Rod Dynamics with Graph Learning

12:14

Accurately Solving Rod Dynamics with Graph Learning

Watch later

Favorite

NeurIPS 2021 3 years ago

Learning to Execute (L2E): Efficient Learning of Plan-Conditioned Policies in Robotics

08:36

Learning to Execute (L2E): Efficient Learning of Plan-Conditioned Policies in Robotics

Watch later

Favorite

Ingmar Schubert, …

NeurIPS 2021 3 years ago

Spotlight Introduction

01:42

Spotlight Introduction

Watch later

Favorite

NeurIPS 2021 3 years ago

Identification of Subgroups With Similar Benefits in Off-Policy Policy Evaluation

06:21

Identification of Subgroups With Similar Benefits in Off-Policy Policy Evaluation

Watch later

Favorite

Ramtin Keramati, …

NeurIPS 2021 3 years ago

Leveraging Distribution Alignment via Stein Path for Cross-Domain Cold-Start Recommendation

05:13

Leveraging Distribution Alignment via Stein Path for Cross-Domain Cold-Start Recommendation

Watch later

Favorite

Weiming Liu, …

NeurIPS 2021 3 years ago

Dual Adaptivity: A Universal Algorithm for Minimizing the Adaptive Regret of Convex Functions

11:38

Dual Adaptivity: A Universal Algorithm for Minimizing the Adaptive Regret of Convex Functions

Watch later

Favorite

Lijun Zhang, …

NeurIPS 2021 3 years ago