Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Mishatched no More: Joint Model-Policy Optimization for Model-Based RL
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-003-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-003-alpha.b-cdn.net
      • sl-yoda-v2-stream-003-beta.b-cdn.net
      • 1544410162.rsc.cdn77.org
      • 1005514182.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Mishatched no More: Joint Model-Policy Optimization for Model-Based RL
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Mishatched no More: Joint Model-Policy Optimization for Model-Based RL

            Nov 28, 2022

            Speakers

            BE

            Benjamin Eysenbach

            Speaker · 0 followers

            AK

            Alexander Khazatsky

            Speaker · 0 followers

            SL

            Sergey Levine

            Speaker · 0 followers

            About

            Many model-based reinforcement learning (RL) methods follow a similar template: fit a model to previously observed data, and then use data from that model for RL or planning. However, models that achieve better training performance (e.g., lower MSE) are not necessarily better for control: an RL agent may seek out the small fraction of states where an accurate model makes mistakes, or it might act in ways that do not expose the errors of an inaccurate model. As noted in prior work, there is an ob…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 963 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Fair Bayes-Optimal Classifiers Under Predictive Parity
            04:55

            Fair Bayes-Optimal Classifiers Under Predictive Parity

            Xianli Zeng, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Supporting Food Security in Africa using Machine Learning and Earth Observations
            38:31

            Supporting Food Security in Africa using Machine Learning and Earth Observations

            Catherine Nakalembe, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            FedML + Panel
            58:56

            FedML + Panel

            Chaoyang He, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            GNM: A General Navigation Model to Driv Any Robot
            05:04

            GNM: A General Navigation Model to Driv Any Robot

            Dhruv Shah, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            BackdoorBench: A Comprehensive Benchmark of Backdoor Learning
            04:25

            BackdoorBench: A Comprehensive Benchmark of Backdoor Learning

            Baoyuan Wu, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Private Stochastic Optimization Without Uniform Lipschitz Continuity: Optimal Rates for (Non-Smooth) Convex Losses and Extension to Non-Convex Losses
            06:16

            Private Stochastic Optimization Without Uniform Lipschitz Continuity: Optimal Rates for (Non-Smooth) Convex Losses and Extension to Non-Convex Losses

            Andrew Lowy, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022