Mastane Achab, Reda Alami, Yasser Abdelaziz Dahou Djilali, Kirill Fedyanin, Eric Moulines, Maxim Panov · Distributional deep Q-learning with CVaR regression · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Distributional deep Q-learning with CVaR regression

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-009-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-009-alpha.b-cdn.net
sl-yoda-v2-stream-009-beta.b-cdn.net
1766500541.rsc.cdn77.org
1441886916.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Distributional deep Q-learning with CVaR regression

Distributional deep Q-learning with CVaR regression

Dec 2, 2022

Speakers

About

Reinforcement learning (RL) allows an agent interacting sequentially with an environment to maximize its long-term return, in expectation. In distributional RL (DRL), the agent is also interested in the probability distribution of the return, not just its expected value. This so-called distributional perspective of RL has led to new algorithms with improved empirical performance. In this paper, we recall the atomic DRL (ADRL) framework based on atomic distributions projected via the Wasserstein-…

Organizer

NeurIPS 2022

Účet · 962 sledujících

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning

04:54

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning

Zhlédnout později

Oblíbené

Yuanpei Chen, …

NeurIPS 2022 2 years ago

LOT: Layer-wise Orthogonal Training on Improving l2 Certified Robustness

01:04

LOT: Layer-wise Orthogonal Training on Improving l2 Certified Robustness

Zhlédnout později

Oblíbené

Xiaojun Xu, …

NeurIPS 2022 2 years ago

Memorization and Optimization in Deep Neural Networks with Minimum Over-parameterization

04:58

Memorization and Optimization in Deep Neural Networks with Minimum Over-parameterization

Zhlédnout později

Oblíbené

Simone Bombari, …

NeurIPS 2022 2 years ago

Polynomial time guarantees for the Burer-Monteiro method

04:59

Polynomial time guarantees for the Burer-Monteiro method

Zhlédnout později

Oblíbené

Diego Cifuentes, …

NeurIPS 2022 2 years ago

Adversarial training for high-stakes reliability

04:48

Adversarial training for high-stakes reliability

Zhlédnout později

Oblíbené

Daniel Ziegler, …

NeurIPS 2022 2 years ago

Controlled Sparsity via Constrained Optimization or: How I Learned to Stop Tuning Penalties and Love Constraints

05:17

Controlled Sparsity via Constrained Optimization or: How I Learned to Stop Tuning Penalties and Love Constraints

Zhlédnout později

Oblíbené

Jose Gallego-Posada, …

NeurIPS 2022 2 years ago