Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Parallel Q-Learning: a Scheme for Time-efficient Reinforcement Learning
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-002-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-002-alpha.b-cdn.net
      • sl-yoda-v2-stream-002-beta.b-cdn.net
      • 1001562353.rsc.cdn77.org
      • 1075090661.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Parallel Q-Learning: a Scheme for Time-efficient Reinforcement Learning
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Parallel Q-Learning: a Scheme for Time-efficient Reinforcement Learning

            Jul 24, 2023

            Sprecher:innen

            ZL

            Zechu Li

            Speaker · 0 followers

            TC

            Tao Chen

            Speaker · 0 followers

            ZH

            Zhang-Wei Hong

            Speaker · 0 followers

            Über

            Reinforcement learning algorithms require a long time to learn policies on complex tasks due to the need for a large amount of training data. With the recent advances in GPU-based simulation, such as Isaac Gym, data collection has been sped up thousands of times on a commodity GPU. Most prior works have used on-policy methods such as PPO to train policies due to their simplicity and easy-to-scale nature. Off-policy methods are usually more sample-efficient but more challenging to be scaled up, r…

            Organisator

            I2
            I2

            ICML 2023

            Account · 657 followers

            Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

            Professionelle Aufzeichnung und Livestreaming – weltweit.

            Freigeben

            Empfohlene Videos

            Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

            Adversarial Classification: Necessary Conditions and Geometric Flows
            05:16

            Adversarial Classification: Necessary Conditions and Geometric Flows

            Nicolas Garcia Trillos, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Learning Mixtures of Markov Chains and MDPs
            04:50

            Learning Mixtures of Markov Chains and MDPs

            Chinmaya Kausik, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Offline Meta Reinforcement Learning with In-Distribution Online Adaptation
            04:28

            Offline Meta Reinforcement Learning with In-Distribution Online Adaptation

            Jianhao Wang, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Generalized-Smooth Nonconvex Optimization is As Efficient As Smooth Nonconvex Optimization
            04:44

            Generalized-Smooth Nonconvex Optimization is As Efficient As Smooth Nonconvex Optimization

            Ziyi Chen, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Short Poster Talks 2
            11:53

            Short Poster Talks 2

            Jesse Michel, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Finding the Missing-half: Graph Complementary Learning for Homophily-prone and Heterophily-prone Graphs
            05:35

            Finding the Missing-half: Graph Complementary Learning for Homophily-prone and Heterophily-prone Graphs

            Yizhen Zheng, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interessiert an Vorträgen wie diesem? ICML 2023 folgen