Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Replay Buffer With Local Forgetting for Adaptive Deep Model-Based Reinforcement Learning
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-005-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-005-alpha.b-cdn.net
      • sl-yoda-v2-stream-005-beta.b-cdn.net
      • 1034628162.rsc.cdn77.org
      • 1409346856.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Replay Buffer With Local Forgetting for Adaptive Deep Model-Based Reinforcement Learning
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Replay Buffer With Local Forgetting for Adaptive Deep Model-Based Reinforcement Learning

            Dec 2, 2022

            Speakers

            AR

            Ali Rahimi-Kalahroudi

            Speaker · 0 followers

            JR

            Janarthanan Rajendran

            Speaker · 0 followers

            IM

            Ida Momennejad

            Speaker · 0 followers

            About

            One of the key behavioral characteristics used in neuroscience to determine whether the subject of study—be it a rodent or a human—exhibits model-based learning is effective adaptation to local changes in the environment. In reinforcement learning, however, recent work has shown that modern deep model-based reinforcement-learning (MBRL) methods adapt poorly to such changes. An explanation for this mismatch is that MBRL methods are typically designed with sample-efficiency on a single task in min…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 961 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Dimenison-Reduced Adaptive Gradient Method
            05:30

            Dimenison-Reduced Adaptive Gradient Method

            Jingyang Li, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Mildly Conservative Q-Learning for Offline Reinforcement Learning
            04:47

            Mildly Conservative Q-Learning for Offline Reinforcement Learning

            Jiafei Lyu, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            General Cutting Planes for Bound-Propagation-Based Neural Network Verification
            05:29

            General Cutting Planes for Bound-Propagation-Based Neural Network Verification

            Huan Zhang, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Momentum Boosted Episodic Memory for Improving Learning in Long-Tailed RL Environments
            05:12

            Momentum Boosted Episodic Memory for Improving Learning in Long-Tailed RL Environments

            Dolton Fernandes, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Feasible Adversarial Robust Reinforcement Learning
            05:03

            Feasible Adversarial Robust Reinforcement Learning

            JB Lanier, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Private Synthetic Data for Multitask Learning and Marginal Queries
            05:00

            Private Synthetic Data for Multitask Learning and Marginal Queries

            Giuseppe Vietri, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022