Jason Yecheng Ma, Kausik Sivakumar, Jason Yan, Osbert Bastani, Dinesh Jayaraman · Policy Aware Model Learning via Transition Occupancy Matching · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Policy Aware Model Learning via Transition Occupancy Matching

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-010-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-010-alpha.b-cdn.net
sl-yoda-v2-stream-010-beta.b-cdn.net
1759419103.rsc.cdn77.org
1016618226.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Policy Aware Model Learning via Transition Occupancy Matching

Policy Aware Model Learning via Transition Occupancy Matching

Dec 2, 2022

Speakers

Jason Yecheng Ma

Speaker · 0 followers

Kausik Sivakumar

Speaker · 0 followers

Jason Yan

Speaker · 0 followers

About

Model-based reinforcement learning (MBRL) is an effective paradigm for sample-efficient policy learning. The pre-dominant MBRL strategy iteratively learns the dynamics model by performing maximum likelihood (MLE) on the entire replay buffer and trains the policy using fictitious transitions from the learned model. Given that not all transitions in the replay buffer are equally informative about the task or the policy's current progress, this MLE strategy cannot be optimal and bears no clear rela…

Organizer

NeurIPS 2022

Account · 961 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Sequential Information Design: Learning to Persuade in the Dark

04:50

Sequential Information Design: Learning to Persuade in the Dark

Watch later

Favorite

Martino Bernasconi, …

NeurIPS 2022 2 years ago

Deep Combinatorial Aggregation

00:52

Deep Combinatorial Aggregation

Watch later

Favorite

Yuesong Shen, …

NeurIPS 2022 2 years ago

Machine Learning for Predicting Climate Extremes

10:37

Machine Learning for Predicting Climate Extremes

Watch later

Favorite

Hritik Bansal, …

NeurIPS 2022 2 years ago

Teaching Algorithmic Reasoning via In-context Learning

05:58

Teaching Algorithmic Reasoning via In-context Learning

Watch later

Favorite

Hattie Zhou, …

NeurIPS 2022 2 years ago

Alternating Mirror Descent for Constrained Min-Max Games

04:23

Alternating Mirror Descent for Constrained Min-Max Games

Watch later

Favorite

Andre Wibisono, …

NeurIPS 2022 2 years ago

Bridging the Gap: Unifying the Training and Evaluation of Neural Network Binary Classifiers

05:00

Bridging the Gap: Unifying the Training and Evaluation of Neural Network Binary Classifiers

Watch later

Favorite

Nathan Tsoi, …

NeurIPS 2022 2 years ago