Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Distributional deep Q-learning with CVaR regression
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-009-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-009-alpha.b-cdn.net
      • sl-yoda-v2-stream-009-beta.b-cdn.net
      • 1766500541.rsc.cdn77.org
      • 1441886916.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Distributional deep Q-learning with CVaR regression
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Distributional deep Q-learning with CVaR regression

            Dec 2, 2022

            Speakers

            About

            Reinforcement learning (RL) allows an agent interacting sequentially with an environment to maximize its long-term return, in expectation. In distributional RL (DRL), the agent is also interested in the probability distribution of the return, not just its expected value. This so-called distributional perspective of RL has led to new algorithms with improved empirical performance. In this paper, we recall the atomic DRL (ADRL) framework based on atomic distributions projected via the Wasserstein-…

            Organizer

            N2
            N2

            NeurIPS 2022

            Účet · 962 sledujících

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
            04:54

            Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning

            Yuanpei Chen, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            LOT: Layer-wise Orthogonal Training on Improving l2 Certified Robustness
            01:04

            LOT: Layer-wise Orthogonal Training on Improving l2 Certified Robustness

            Xiaojun Xu, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Memorization and Optimization in Deep Neural Networks with Minimum Over-parameterization
            04:58

            Memorization and Optimization in Deep Neural Networks with Minimum Over-parameterization

            Simone Bombari, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Polynomial time guarantees for the Burer-Monteiro method
            04:59

            Polynomial time guarantees for the Burer-Monteiro method

            Diego Cifuentes, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Adversarial training for high-stakes reliability
            04:48

            Adversarial training for high-stakes reliability

            Daniel Ziegler, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Controlled Sparsity via Constrained Optimization or: How I Learned to Stop Tuning Penalties and Love Constraints
            05:17

            Controlled Sparsity via Constrained Optimization or: How I Learned to Stop Tuning Penalties and Love Constraints

            Jose Gallego-Posada, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Interested in talks like this? Follow NeurIPS 2022