Shenao Zhang · Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement Learning · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement Learning

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-007-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-007-alpha.b-cdn.net
sl-yoda-v2-stream-007-beta.b-cdn.net
1678031076.rsc.cdn77.org
1932936657.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement Learning

Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement Learning

Nov 28, 2022

Speakers

Shenao Zhang

Speaker · 0 followers

About

Provably efficient Model-Based Reinforcement Learning (MBRL) based on optimism or posterior sampling (PSRL) is ensured to attain the global optimality asymptotically by introducing complexity measure of the model class. However, the complexity might grow exponentially for even the simplest nonlinear models, where global convergence is impossible within finite iterations. When the model suffers a large generalization error, which is quantitatively measured by the model complexity, the uncertainty…

Organizer

NeurIPS 2022

Account · 962 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

SoftTreeMax: Policy Gradient with Tree Search

05:01

SoftTreeMax: Policy Gradient with Tree Search

Watch later

Favorite

NeurIPS 2022 2 years ago

Deterministic Langevin Monte Carlo with Normalizing Flows for Bayesian Inference

01:00

Deterministic Langevin Monte Carlo with Normalizing Flows for Bayesian Inference

Watch later

Favorite

Richard Grumitt, …

NeurIPS 2022 2 years ago

Optimal Complexity in Non-Convex Decentralized Learning over Time-Varying Networks

05:19

Optimal Complexity in Non-Convex Decentralized Learning over Time-Varying Networks

Watch later

Favorite

Xinmeng Huang, …

NeurIPS 2022 2 years ago

Extra-Newton: A First Approach to Noise-Adaptive Accelerated Second-Order Methods

04:32

Extra-Newton: A First Approach to Noise-Adaptive Accelerated Second-Order Methods

Watch later

Favorite

Kimon Antonakopoulos, …

NeurIPS 2022 2 years ago

Statistical Learning and Inverse Problems: An Stochastic Gradient Approach

05:25

Statistical Learning and Inverse Problems: An Stochastic Gradient Approach

Watch later

Favorite

NeurIPS 2022 2 years ago

Vision-centric Autonomous Driving: from Perception to Prediction

31:33

Vision-centric Autonomous Driving: from Perception to Prediction

Watch later

Favorite

NeurIPS 2022 2 years ago