Taehyun Cho, Seungyub Han, Heesoo Lee, Kyungjae Lee, Jungwoo Lee · Perturbed Quantile Regression for Distributional Reinforcement Learning · SlidesLive

Kategorien

DE

Anmelden Vertrieb kontaktieren

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Perturbed Quantile Regression for Distributional Reinforcement Learning

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-004-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-004-alpha.b-cdn.net
sl-yoda-v2-stream-004-beta.b-cdn.net
1685195716.rsc.cdn77.org
1239898752.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Perturbed Quantile Regression for Distributional Reinforcement Learning

Perturbed Quantile Regression for Distributional Reinforcement Learning

Dez 2, 2022

Sprecher:innen

Taehyun Cho

Sprecher:in · 0 Follower:innen

Seungyub Han

Sprecher:in · 0 Follower:innen

Heesoo Lee

Sprecher:in · 0 Follower:innen

Über

Distributional reinforcement learning aims to learn distribution of return under stochastic environments. Since the learned distribution of return contains rich information about the stochasticity of the environment, previous studies have relied on descriptive statistics, such as standard deviation, for optimism in the face of uncertainty. However, using the uncertainty from an empirical distribution can hinder convergence and performance when exploring with the certain criterion that has an one…

Organisator

NeurIPS 2022

Konto · 961 Follower:innen

Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

Professionelle Aufzeichnung und Livestreaming – weltweit.

Freigeben

Empfohlene Videos

Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

Semi-Supervised Video Salient Object Detection Based on Uncertainty-Guided Pseudo Labels

04:55

Semi-Supervised Video Salient Object Detection Based on Uncertainty-Guided Pseudo Labels

Später ansehen

Favorit

Yongri Piao, …

NeurIPS 2022 2 years ago

Graph Neural Networks are Dynamic Programmers

05:08

Graph Neural Networks are Dynamic Programmers

Später ansehen

Favorit

Andrew Dudzik, …

NeurIPS 2022 2 years ago

Accelerating Open Science for AI in Heliophysics

10:21

Accelerating Open Science for AI in Heliophysics

Später ansehen

Favorit

Dolores García, …

NeurIPS 2022 2 years ago

Aligning Humans and Robots: Active Elicitation of Informative and Compatible Queries

31:27

Aligning Humans and Robots: Active Elicitation of Informative and Compatible Queries

Später ansehen

Favorit

NeurIPS 2022 2 years ago

A Greek Parliament Proceedings Dataset for Computational Linguistics and Political Analysis

04:55

A Greek Parliament Proceedings Dataset for Computational Linguistics and Political Analysis

Später ansehen

Favorit

Konstantina Dritsa, …

NeurIPS 2022 2 years ago

Myriad: a real-world testbed to bridge trajectory optimization and deep learning

04:59

Myriad: a real-world testbed to bridge trajectory optimization and deep learning

Später ansehen

Favorit

Nikolaus H. R. Howe, …

NeurIPS 2022 2 years ago