Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Parallel Q-Learning: a Scheme for Time-efficient Reinforcement Learning
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-002-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-002-alpha.b-cdn.net
      • sl-yoda-v2-stream-002-beta.b-cdn.net
      • 1001562353.rsc.cdn77.org
      • 1075090661.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Parallel Q-Learning: a Scheme for Time-efficient Reinforcement Learning
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Parallel Q-Learning: a Scheme for Time-efficient Reinforcement Learning

            Jul 24, 2023

            Speakers

            ZL

            Zechu Li

            Řečník · 0 sledujících

            TC

            Tao Chen

            Řečník · 0 sledujících

            ZH

            Zhang-Wei Hong

            Řečník · 0 sledujících

            About

            Reinforcement learning algorithms require a long time to learn policies on complex tasks due to the need for a large amount of training data. With the recent advances in GPU-based simulation, such as Isaac Gym, data collection has been sped up thousands of times on a commodity GPU. Most prior works have used on-policy methods such as PPO to train policies due to their simplicity and easy-to-scale nature. Off-policy methods are usually more sample-efficient but more challenging to be scaled up, r…

            Organizer

            I2
            I2

            ICML 2023

            Účet · 657 sledujících

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Not All Neuro-Symbolic Concepts Are Created Equal: Analysis and Mitigation of Reasoning Shortcuts
            13:29

            Not All Neuro-Symbolic Concepts Are Created Equal: Analysis and Mitigation of Reasoning Shortcuts

            Emanuele Marconato, …

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Averaged Method of Multipliers for Bi-Level Optimization without Lower-Level Strong Convexity
            04:56

            Averaged Method of Multipliers for Bi-Level Optimization without Lower-Level Strong Convexity

            Risheng Liu, …

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Interactive Object Placement with Reinforcement Learning
            04:48

            Interactive Object Placement with Reinforcement Learning

            Shengping Zhang, …

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            “AI For Good” Isn’t Good Enough: A Call for Human-Centered AI
            42:52

            “AI For Good” Isn’t Good Enough: A Call for Human-Centered AI

            James A. Landay

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Achieving Linear Speedup in Non-IID Federated Bilevel Learning
            04:45

            Achieving Linear Speedup in Non-IID Federated Bilevel Learning

            Minhui Huang, …

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Data for Agriculture: Challenges and Opportunities in East Africa
            27:28

            Data for Agriculture: Challenges and Opportunities in East Africa

            Dina Machuve

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Interested in talks like this? Follow ICML 2023