Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Parallel Q-Learning: a Scheme for Time-efficient Reinforcement Learning
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-002-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-002-alpha.b-cdn.net
      • sl-yoda-v2-stream-002-beta.b-cdn.net
      • 1001562353.rsc.cdn77.org
      • 1075090661.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Parallel Q-Learning: a Scheme for Time-efficient Reinforcement Learning
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Parallel Q-Learning: a Scheme for Time-efficient Reinforcement Learning

            Jul 24, 2023

            Speakers

            ZL

            Zechu Li

            Speaker · 0 followers

            TC

            Tao Chen

            Speaker · 0 followers

            ZH

            Zhang-Wei Hong

            Speaker · 0 followers

            About

            Reinforcement learning algorithms require a long time to learn policies on complex tasks due to the need for a large amount of training data. With the recent advances in GPU-based simulation, such as Isaac Gym, data collection has been sped up thousands of times on a commodity GPU. Most prior works have used on-policy methods such as PPO to train policies due to their simplicity and easy-to-scale nature. Off-policy methods are usually more sample-efficient but more challenging to be scaled up, r…

            Organizer

            I2
            I2

            ICML 2023

            Account · 657 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Veridical Data Science with a case study to find genetic drivers of a heart disease
            32:09

            Veridical Data Science with a case study to find genetic drivers of a heart disease

            Bin Yu

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Equivariant Architectures for Learning in Deep Weight Spaces
            05:20

            Equivariant Architectures for Learning in Deep Weight Spaces

            Aviv Navon, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Counterfactuals, play and causal inference in young children and machines
            15:42

            Counterfactuals, play and causal inference in young children and machines

            Alison Gopnik

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Pruning via Sparsity-indexed ODE: A Continuous Sparsity Viewpoint
            05:22

            Pruning via Sparsity-indexed ODE: A Continuous Sparsity Viewpoint

            Zhanfeng Mo, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Conformal Prediction Sets for Graph Neural Networks
            05:31

            Conformal Prediction Sets for Graph Neural Networks

            Soroush H. Zargarbashi, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Dual Focal Loss for Calibration
            05:08

            Dual Focal Loss for Calibration

            Linwei Tao, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow ICML 2023