Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Reincarnating RL: Reusing Prior Computation to Accelerate Progress
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-008-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-008-alpha.b-cdn.net
      • sl-yoda-v2-stream-008-beta.b-cdn.net
      • 1159783934.rsc.cdn77.org
      • 1511376917.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Reincarnating RL: Reusing Prior Computation to Accelerate Progress
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Reincarnating RL: Reusing Prior Computation to Accelerate Progress

            Nov 28, 2022

            Speakers

            RA
            RA

            Rishabh Agarwal

            Speaker · 2 followers

            MS

            Max Schwarzer

            Speaker · 1 follower

            PSC

            Pablo Samuel Castro

            Speaker · 1 follower

            About

            Learning tabula rasa, that is without any prior knowledge, is the prevalent workflow in reinforcement learning (RL) research. However, RL systems, when applied to large-scale settings, rarely operate tabula rasa. Such large-scale systems undergo multiple design or algorithmic changes during their development cycle and use ad hoc approaches for incorporating these changes without re-training from scratch, which would have been prohibitively expensive. Additionally, the inefficiency of deep RL typ…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 954 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Alignment-guided Temporal Attention for Video Action Recognition
            04:14

            Alignment-guided Temporal Attention for Video Action Recognition

            Yizhou Zhao, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            BayesPCN: A Continually Learnable Predictive Coding Associative Memory
            04:58

            BayesPCN: A Continually Learnable Predictive Coding Associative Memory

            Jinsoo Yoo, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Temporal Graph Learning: Some Challenges and Recent Directions
            37:33

            Temporal Graph Learning: Some Challenges and Recent Directions

            Bryan Hooi

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Towards Reasoning-Aware Explainable VQA
            10:42

            Towards Reasoning-Aware Explainable VQA

            Rakesh Vaideeswaran, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function
            13:59

            Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function

            Ruijie Zheng, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Learning to Scaffold: Optimizing Model Explanations for Teaching
            04:59

            Learning to Scaffold: Optimizing Model Explanations for Teaching

            Patrick Fernandes, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022