Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
        0:00 / 0:00
        • Report Issue
        • Settings
        • Playlists
        • Bookmarks
        • Subtitles
        • Playback rate
        • Quality
        • Settings
        • Debug information
        • Server
        • Subtitles size Medium
        • Bookmarks
        • Server
        • Subtitles
        • Playback rate
        • Quality
        • Subtitles size
        • Large
        • Medium
        • Small
        • Mode
        • Video Slideshow
        • Audio Slideshow
        • Slideshow
        • Video
        My playlists
          Bookmarks
            00:00:00
              • Settings
              • Sync diff
              • Quality
              • Settings
              • Server
              • Quality
              • Server

              The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation

              Jul 24, 2023

              Speakers

              About

              We study the problem of temporal-difference-based policy evaluation in reinforcement learning. In particular, we analyse the use of a distributional reinforcement learning algorithm, quantile temporal-difference learning (QTD), for this task. We reach the surprising conclusion that even if a practitioner has no interest in the return distribution beyond the mean, QTD (which learns predictions about the full distribution of returns) may offer performance superior to approaches such as classical T…

              Organizer

              I2
              I2

              ICML 2023

              Account · 638 followers

              Like the format? Trust SlidesLive to capture your next event!

              Professional recording and live streaming, delivered globally.

              Sharing

              Recommended Videos

              Presentations on similar topic, category or speaker

              GAT: Guided Adversarial Training with Pareto-optimal Auxiliary Tasks
              05:11

              GAT: Guided Adversarial Training with Pareto-optimal Auxiliary Tasks

              Salah Ghamizi, …

              I2
              I2
              ICML 2023 2 years ago

              Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

              Deterministic equivalent and error universality of deep random features learning
              05:15

              Deterministic equivalent and error universality of deep random features learning

              Dominik Schröder, …

              I2
              I2
              ICML 2023 2 years ago

              Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

              Machine Learning Models Learn Statistical Rules Inferred from Data
              05:04

              Machine Learning Models Learn Statistical Rules Inferred from Data

              Aaditya Naik, …

              I2
              I2
              ICML 2023 2 years ago

              Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

              Conformal Prediction Sets for Graph Neural Networks
              05:31

              Conformal Prediction Sets for Graph Neural Networks

              Soroush H. Zargarbashi, …

              I2
              I2
              ICML 2023 2 years ago

              Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

              Rethinking Backdoor Attacks
              04:56

              Rethinking Backdoor Attacks

              Alaa Khaddaj, …

              I2
              I2
              ICML 2023 2 years ago

              Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

              Linear Causal Disentanglement via Interventions
              05:16

              Linear Causal Disentanglement via Interventions

              Chandler Squires, …

              I2
              I2
              ICML 2023 2 years ago

              Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

              Interested in talks like this? Follow ICML 2023