Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement Learning
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-007-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-007-alpha.b-cdn.net
      • sl-yoda-v2-stream-007-beta.b-cdn.net
      • 1678031076.rsc.cdn77.org
      • 1932936657.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement Learning
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement Learning

            Nov 28, 2022

            Speakers

            SZ

            Shenao Zhang

            Speaker · 0 followers

            About

            Provably efficient Model-Based Reinforcement Learning (MBRL) based on optimism or posterior sampling (PSRL) is ensured to attain the global optimality asymptotically by introducing complexity measure of the model class. However, the complexity might grow exponentially for even the simplest nonlinear models, where global convergence is impossible within finite iterations. When the model suffers a large generalization error, which is quantitatively measured by the model complexity, the uncertainty…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 962 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            SoftTreeMax: Policy Gradient with Tree Search
            05:01

            SoftTreeMax: Policy Gradient with Tree Search

            Gal Dalal, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Deterministic Langevin Monte Carlo with Normalizing Flows for Bayesian Inference
            01:00

            Deterministic Langevin Monte Carlo with Normalizing Flows for Bayesian Inference

            Richard Grumitt, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Optimal Complexity in Non-Convex Decentralized Learning over Time-Varying Networks
            05:19

            Optimal Complexity in Non-Convex Decentralized Learning over Time-Varying Networks

            Xinmeng Huang, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Extra-Newton: A First Approach to Noise-Adaptive Accelerated Second-Order Methods
            04:32

            Extra-Newton: A First Approach to Noise-Adaptive Accelerated Second-Order Methods

            Kimon Antonakopoulos, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Statistical Learning and Inverse Problems: An Stochastic Gradient Approach
            05:25

            Statistical Learning and Inverse Problems: An Stochastic Gradient Approach

            Yuri Fonseca

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Vision-centric Autonomous Driving: from Perception to Prediction
            31:33

            Vision-centric Autonomous Driving: from Perception to Prediction

            Hang Zhao

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 1 viewers voted for saving the presentation to eternal vault which is 0.1%

            Interested in talks like this? Follow NeurIPS 2022