Tianying Ji, Yu Luo, Fuchun Sun, Mingxuan Jing, Fengxiang He, Wenbing Huang · When to Update Your Model: Constrained Model-based Reinforcement Learning · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: When to Update Your Model: Constrained Model-based Reinforcement Learning

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-010-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-010-alpha.b-cdn.net
sl-yoda-v2-stream-010-beta.b-cdn.net
1759419103.rsc.cdn77.org
1016618226.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

When to Update Your Model: Constrained Model-based Reinforcement Learning

When to Update Your Model: Constrained Model-based Reinforcement Learning

Nov 28, 2022

Speakers

Tianying Ji

Speaker · 0 followers

Yu Luo

Speaker · 0 followers

Fuchun Sun

Speaker · 0 followers

About

Designing and analyzing model-based RL (MBRL) algorithms with guaranteed monotonic improvement has been challenging, mainly due to the interdependence between policy optimization and model learning. Existing discrepancy bounds generally ignore the impacts of model shifts, and their corresponding algorithms are prone to degrade performance by drastic model updating. In this work, we first propose a novel and general theoretical scheme for a non-decreasing performance guarantee of MBRL. Our follow…

Organizer

NeurIPS 2022

Account · 962 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Performance and utility trade-off in interpretable sleep staging

03:14

Performance and utility trade-off in interpretable sleep staging

Watch later

Favorite

Irfan Al-Hussaini, …

NeurIPS 2022 2 years ago

The effects of gender bias in word embeddings on depression prediction

11:15

The effects of gender bias in word embeddings on depression prediction

Watch later

Favorite

Gizem Sogancioglu, …

NeurIPS 2022 2 years ago

Coresets for Relational Data and The Applications

04:58

Coresets for Relational Data and The Applications

Watch later

Favorite

Jiaxiang Chen, …

NeurIPS 2022 2 years ago

Panel RL Implementation

37:30

Panel RL Implementation

Watch later

Favorite

Alborz Geramifard, …

NeurIPS 2022 2 years ago

Building a Subspace of Policies for Scalable Continual Learning

05:07

Building a Subspace of Policies for Scalable Continual Learning

Watch later

Favorite

Jean-Baptiste Gaya, …

NeurIPS 2022 2 years ago

Geometric Order Learning for Rank Estimation

04:21

Geometric Order Learning for Rank Estimation

Watch later

Favorite

Seon-Ho Lee, …

NeurIPS 2022 2 years ago