Mastane Achab, Reda Alami, Yasser Abdelaziz Dahou Djilali, Kirill Fedyanin, Eric Moulines, Maxim Panov · Distributional deep Q-learning with CVaR regression · SlidesLive

Kategorie

CS

Přihlásit se Kontaktujte nás

Další

Živý přenos začne již brzy!

Živý přenos již skončil.

Prezentace ještě nebyla nahrána!

SlidesLive

title: Distributional deep Q-learning with CVaR regression

0:00 / 0:00

Nahlásit chybu
Nastavení
Playlisty
Záložky
Titulky Off
Rychlost přehrávání
Kvalita

Nastavení
Debug informace
Server sl-yoda-v2-stream-009-alpha.b-cdn.net
Velikost titulků Střední

Záložky

Server
sl-yoda-v2-stream-009-alpha.b-cdn.net
sl-yoda-v2-stream-009-beta.b-cdn.net
1766500541.rsc.cdn77.org
1441886916.rsc.cdn77.org

Titulky
Off
English

Rychlost přehrávání

Kvalita

Velikost titulků
Velké
Střední
Malé

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Distributional deep Q-learning with CVaR regression

Distributional deep Q-learning with CVaR regression

2. prosince 2022

Řečníci

Mastane Achab

Řečník · 0 sledujících

Reda Alami

Řečník · 0 sledujících

Yasser Abdelaziz Dahou Djilali

Řečník · 0 sledujících

O prezentaci

Reinforcement learning (RL) allows an agent interacting sequentially with an environment to maximize its long-term return, in expectation. In distributional RL (DRL), the agent is also interested in the probability distribution of the return, not just its expected value. This so-called distributional perspective of RL has led to new algorithms with improved empirical performance. In this paper, we recall the atomic DRL (ADRL) framework based on atomic distributions projected via the Wasserstein-…

Organizátor

NeurIPS 2022

Účet · 962 sledujících

Baví vás formát? Nechte SlidesLive zachytit svou akci!

Profesionální natáčení a streamování po celém světě.

Sdílení

Doporučená videa

Prezentace na podobné téma, kategorii nebo přednášejícího

Truncated Emphatic Temporal Difference Methods for Prediction and Control

05:01

Truncated Emphatic Temporal Difference Methods for Prediction and Control

Zhlédnout později

Oblíbené

Shangtong Zhang, …

NeurIPS 2022 2 years ago

Deconfounded Representation Similarity for Comparison of Neural Networks

04:55

Deconfounded Representation Similarity for Comparison of Neural Networks

Zhlédnout později

Oblíbené

Tianyu Cui, …

NeurIPS 2022 2 years ago

Conditional Progressive Generative Adversarial Network for satellite image generation

02:01

Conditional Progressive Generative Adversarial Network for satellite image generation

Zhlédnout později

Oblíbené

Renato Cardoso, …

NeurIPS 2022 2 years ago

Censored Quantile Regression Neural Networks for Distribution-Free Survival Analysis

04:34

Censored Quantile Regression Neural Networks for Distribution-Free Survival Analysis

Zhlédnout později

Oblíbené

Tim Pearce, …

NeurIPS 2022 2 years ago

RORL: Robust Offline Reinforcement Learning via Conservative Smoothing

04:49

RORL: Robust Offline Reinforcement Learning via Conservative Smoothing

Zhlédnout později

Oblíbené

NeurIPS 2022 2 years ago

Recommender Forest for Efficient Retrieval

04:37

Recommender Forest for Efficient Retrieval

Zhlédnout později

Oblíbené

NeurIPS 2022 2 years ago