Jiafei Lyu, Xiu Li, Zongqing Lu · Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-009-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-009-alpha.b-cdn.net
sl-yoda-v2-stream-009-beta.b-cdn.net
1766500541.rsc.cdn77.org
1441886916.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination

Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination

Nov 28, 2022

Speakers

Jiafei Lyu

Speaker · 0 followers

Xiu Li

Speaker · 0 followers

Zongqing Lu

Speaker · 0 followers

About

The learned policy of model-free offline reinforcement learning (RL) methods is often constrained to stay within the support of datasets to avoid possible dangerous out-of-distribution actions or states, making it challenging to handle out-of-support region. Model-based RL methods offer a richer dataset and benefit generalization by generating imaginary trajectories with either trained forward or reverse dynamics model. However, the imagined transitions may be inaccurate, thus downgrading the pe…

Organizer

NeurIPS 2022

Account · 952 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Are You Stealing My Model? Sample Correlation for Fingerprinting Deep Neural Networks

04:50

Are You Stealing My Model? Sample Correlation for Fingerprinting Deep Neural Networks

Watch later

Favorite

Jiyang Guan, …

NeurIPS 2022 2 years ago

CommsVAE: Learning the brain´s macroscale communication dynamics using coupled sequential VAEs

19:16

CommsVAE: Learning the brain´s macroscale communication dynamics using coupled sequential VAEs

Watch later

Favorite

Eloy Geenjaar, …

NeurIPS 2022 2 years ago

Lethal Dose Conjecture on Data Poisoning

05:01

Lethal Dose Conjecture on Data Poisoning

Watch later

Favorite

Wenxiao Wang, …

NeurIPS 2022 2 years ago

BEER: Fast O(1/T) Rate for Decentralized Nonconvex Optimization with Communication Compression

05:04

BEER: Fast O(1/T) Rate for Decentralized Nonconvex Optimization with Communication Compression

Watch later

Favorite

Haoyu Zhao, …

NeurIPS 2022 2 years ago

Adjusting the Gender Wage Gap with a Low-Dimensional Representation of Job History

02:48

Adjusting the Gender Wage Gap with a Low-Dimensional Representation of Job History

Watch later

Favorite

Keyon Vafa, …

NeurIPS 2022 2 years ago

Scalable and Communication-Efficient Vertical Federated Learning

22:09

Scalable and Communication-Efficient Vertical Federated Learning

Watch later

Favorite

Stacy Patterson, …

NeurIPS 2022 2 years ago