Gaon An, Seungyong Moon, Jang-Hyun Kim, Hyun Oh Song · Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-005-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-005-alpha.b-cdn.net
sl-yoda-v2-stream-005-beta.b-cdn.net
1034628162.rsc.cdn77.org
1409346856.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble

Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble

Dec 6, 2021

Speakers

Gaon An

Speaker · 0 followers

Seungyong Moon

Speaker · 0 followers

Jang-Hyun Kim

Speaker · 0 followers

About

Offline reinforcement learning (offline RL), which aims to find an optimal policy from a previously collected static dataset, bears algorithmic difficulties due to function approximation errors from out-of-distribution (OOD) data points. To this end, offline RL algorithms adopt either a constraint or a penalty term that explicitly guides the policy to stay close to the given dataset. However, prior methods typically require accurate estimation of the behavior policy or sampling from OOD data poi…

Organizer

NeurIPS 2021

Account · 1.9k followers

About NeurIPS 2021

Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Next-Generation Datasets for Safe Self-driving

27:33

Next-Generation Datasets for Safe Self-driving

Watch later

Favorite

NeurIPS 2021 3 years ago

Long-Short Transformer: Efficient Transformers for Language and Vision

11:44

Long-Short Transformer: Efficient Transformers for Language and Vision

Watch later

Favorite

NeurIPS 2021 3 years ago

Automatic Symmetry Discovery with Lie Algebra Convolutional Network

14:42

Automatic Symmetry Discovery with Lie Algebra Convolutional Network

Watch later

Favorite

Nima Dehmamy, …

NeurIPS 2021 3 years ago

AutoDC: Automated data-centric processing

01:54

AutoDC: Automated data-centric processing

Watch later

Favorite

Zac Yung-Chun Liu, …

NeurIPS 2021 3 years ago

Directed Spectral Measures Improve Latent Network Models Of Neural Populations

11:43

Directed Spectral Measures Improve Latent Network Models Of Neural Populations

Watch later

Favorite

Neil Gallagher, …

NeurIPS 2021 3 years ago

Continuous Mean-Covariance Bandits

11:33

Continuous Mean-Covariance Bandits

Watch later

Favorite

NeurIPS 2021 3 years ago