Mark Rowland, Yunhao Tang, Clare Lyle, Remi Munos, Marc G. Bellemare, Will Dabney · The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles
Playback rate
Quality

Settings
Debug information
Server
Subtitles size Medium

Bookmarks

Server

Subtitles

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation

Jul 24, 2023

Speakers

About

We study the problem of temporal-difference-based policy evaluation in reinforcement learning. In particular, we analyse the use of a distributional reinforcement learning algorithm, quantile temporal-difference learning (QTD), for this task. We reach the surprising conclusion that even if a practitioner has no interest in the return distribution beyond the mean, QTD (which learns predictions about the full distribution of returns) may offer performance superior to approaches such as classical T…

Organizer

ICML 2023

Account · 638 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

GAT: Guided Adversarial Training with Pareto-optimal Auxiliary Tasks

05:11

GAT: Guided Adversarial Training with Pareto-optimal Auxiliary Tasks

Watch later

Favorite

Salah Ghamizi, …

ICML 2023 2 years ago

Deterministic equivalent and error universality of deep random features learning

05:15

Deterministic equivalent and error universality of deep random features learning

Watch later

Favorite

Dominik Schröder, …

ICML 2023 2 years ago

Machine Learning Models Learn Statistical Rules Inferred from Data

05:04

Machine Learning Models Learn Statistical Rules Inferred from Data

Watch later

Favorite

Aaditya Naik, …

ICML 2023 2 years ago

Conformal Prediction Sets for Graph Neural Networks

05:31

Conformal Prediction Sets for Graph Neural Networks

Watch later

Favorite

Soroush H. Zargarbashi, …

ICML 2023 2 years ago

Rethinking Backdoor Attacks

04:56

Rethinking Backdoor Attacks

Watch later

Favorite

Alaa Khaddaj, …

ICML 2023 2 years ago

Linear Causal Disentanglement via Interventions

05:16

Linear Causal Disentanglement via Interventions

Watch later

Favorite

Chandler Squires, …

ICML 2023 2 years ago