Dongsheng Ding, Xiaohan Wei, Zhuoran Yang, Zhaoran Wang, Mihailo Jovanovic · Oral: Provably Efficient Safe Exploration via Primal-Dual Policy Optimization · SlidesLive

Categories

Arts, Design & Media

Category · 1.2k presentations

Business & Economics

Category · 3.8k presentations

Computer Science & IT

Category · 14.8k presentations

Engineering & Technology

Category · 491 presentations

Humanities & Social Sciences

Category · 1.3k presentations

Medicine & Health

Category · 529 presentations

Natural & Formal Sciences

Category · 3.3k presentations

Self Development & Lifestyle

Category · 599 presentations

EN

Log in Talk to sales

Player loading failed. Try again.
If this error persists report it to support@slideslive.com.

User-Agent: Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)

Oral: Provably Efficient Safe Exploration via Primal-Dual Policy Optimization

Apr 14, 2021

Speakers

Dongsheng Ding

Speaker · 0 followers

Xiaohan Wei

Speaker · 0 followers

Zhuoran Yang

Speaker · 2 followers

About

We study the Safe Reinforcement Learning (SRL) problem using the Constrained Markov Decision Process (CMDP) formulation in which an agent aims to maximize the expected total reward subject to a safety constraint on the expected total value of a utility function. We focus on an episodic setting with the function approximation where the Markov transition kernels have a linear structure but do not impose any additional assumptions on the sampling model. Designing SRL algorithms with provable comput…

Organizer

AISTATS 2021

Account · 12 followers

Categories

AI & Data Science

Category · 10.8k presentations

Mathematics

Category · 2.4k presentations

About AISTATS 2021

The 24th International Conference on Artificial Intelligence and Statistics was held virtually from Tuesday, 13 April 2021 to Thursday, 15 April 2021.

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Fast Statistical Leverage Score Approximation in Kernel Ridge Regression

02:52

Fast Statistical Leverage Score Approximation in Kernel Ridge Regression

Watch later

Favorite

Yi-fan Chen, …

AISTATS 2021 4 years ago

Differentiable Divergences Between Time Series

03:09

Differentiable Divergences Between Time Series

Watch later

Favorite

Mathieu Blondel, …

AISTATS 2021 4 years ago

Variational Autoencoder with Learned Latent Structure

03:00

Variational Autoencoder with Learned Latent Structure

Watch later

Favorite

Marissa Connor, …

AISTATS 2021 4 years ago

Simulation-Based Inference

1:16:04

Simulation-Based Inference

Watch later

Favorite

AISTATS 2021 4 years ago

Oral: Minimax optimality of Laplacian smoothing

11:49

Oral: Minimax optimality of Laplacian smoothing

Watch later

Favorite

Alden Green, …

AISTATS 2021 4 years ago

Rao-Blackwellised parallel MCMC

03:02

Rao-Blackwellised parallel MCMC

Watch later

Favorite

Tobias Schwedes, …

AISTATS 2021 4 years ago