Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: When to Update Your Model: Constrained Model-based Reinforcement Learning
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-010-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-010-alpha.b-cdn.net
      • sl-yoda-v2-stream-010-beta.b-cdn.net
      • 1759419103.rsc.cdn77.org
      • 1016618226.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            When to Update Your Model: Constrained Model-based Reinforcement Learning
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            When to Update Your Model: Constrained Model-based Reinforcement Learning

            Nov 28, 2022

            Speakers

            TJ

            Tianying Ji

            Speaker · 0 followers

            YL

            Yu Luo

            Speaker · 0 followers

            FS

            Fuchun Sun

            Speaker · 0 followers

            About

            Designing and analyzing model-based RL (MBRL) algorithms with guaranteed monotonic improvement has been challenging, mainly due to the interdependence between policy optimization and model learning. Existing discrepancy bounds generally ignore the impacts of model shifts, and their corresponding algorithms are prone to degrade performance by drastic model updating. In this work, we first propose a novel and general theoretical scheme for a non-decreasing performance guarantee of MBRL. Our follow…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 962 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Performance and utility trade-off in interpretable sleep staging
            03:14

            Performance and utility trade-off in interpretable sleep staging

            Irfan Al-Hussaini, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            The effects of gender bias in word embeddings on depression prediction
            11:15

            The effects of gender bias in word embeddings on depression prediction

            Gizem Sogancioglu, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Coresets for Relational Data and The Applications
            04:58

            Coresets for Relational Data and The Applications

            Jiaxiang Chen, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Panel RL Implementation
            37:30

            Panel RL Implementation

            Alborz Geramifard, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Building a Subspace of Policies for Scalable Continual Learning
            05:07

            Building a Subspace of Policies for Scalable Continual Learning

            Jean-Baptiste Gaya, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Geometric Order Learning for Rank Estimation
            04:21

            Geometric Order Learning for Rank Estimation

            Seon-Ho Lee, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022