Yijie Guo, Yao Fu, Run Peng, Honglak Lee · Learning Exploration Policies with View-based Intrinsic Rewards · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Learning Exploration Policies with View-based Intrinsic Rewards

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-008-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-008-alpha.b-cdn.net
sl-yoda-v2-stream-008-beta.b-cdn.net
1159783934.rsc.cdn77.org
1511376917.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Learning Exploration Policies with View-based Intrinsic Rewards

Learning Exploration Policies with View-based Intrinsic Rewards

Dec 2, 2022

Speakers

Yijie Guo

Speaker · 0 followers

Yao Fu

Speaker · 0 followers

Run Peng

Speaker · 0 followers

About

Efficient exploration in sparse-reward tasks is one of the biggest challenges in deep reinforcement learning. Common approaches introduce intrinsic rewards to motivate exploration. For example, visitation count and prediction-based curiosity utilize some measures of novelty to drive the agent to visit novel states in the environment. However, in partially-observable environments, these methods can easily be misled by relatively “novel” or noisy observations and get stuck around them. Motivated b…

Organizer

NeurIPS 2022

Account · 961 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Toward Robust Spiking Neural Network Against Adversarial Perturbation

00:54

Toward Robust Spiking Neural Network Against Adversarial Perturbation

Watch later

Favorite

Ling Liang, …

NeurIPS 2022 2 years ago

Training Spiking Neural Networks with Local Tandem Learning

05:17

Training Spiking Neural Networks with Local Tandem Learning

Watch later

Favorite

NeurIPS 2022 2 years ago

Monitoring of Perception Systems: Deterministic, Probabilistic, and Learning-based Fault Detection and Identification

02:59

Monitoring of Perception Systems: Deterministic, Probabilistic, and Learning-based Fault Detection and Identification

Watch later

Favorite

Pasquale Antonante, …

NeurIPS 2022 2 years ago

Monocular Dynamic 3D Voew Syntesis A Reality Check

04:44

Monocular Dynamic 3D Voew Syntesis A Reality Check

Watch later

Favorite

NeurIPS 2022 2 years ago

CausalAgents: A Robustness Benchmark for Motion Forecasting using Causal Relationships

31:58

CausalAgents: A Robustness Benchmark for Motion Forecasting using Causal Relationships

Watch later

Favorite

Rebecca Roelofs, …

NeurIPS 2022 2 years ago

OpenAUC: Towards AUC-Oriented Open-Set Recognition

04:59

OpenAUC: Towards AUC-Oriented Open-Set Recognition

Watch later

Favorite

Zitai Wang, …

NeurIPS 2022 2 years ago