Qiuhao Wang, Chin Pang Ho, Marek Petrik · Policy Gradient in Robust MDPs with Global Convergence Guarantee · SlidesLive

Kategorien

DE

Anmelden Vertrieb kontaktieren

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Policy Gradient in Robust MDPs with Global Convergence Guarantee

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-006-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-006-alpha.b-cdn.net
sl-yoda-v2-stream-006-beta.b-cdn.net
1549480416.rsc.cdn77.org
1102696603.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Policy Gradient in Robust MDPs with Global Convergence Guarantee

Policy Gradient in Robust MDPs with Global Convergence Guarantee

Jul 24, 2023

Sprecher:innen

Qiuhao Wang

Řečník · 0 sledujících

Chin Pang Ho

Řečník · 0 sledujících

Marek Petrik

Řečník · 0 sledujících

Über

Robust Markov decision processes (RMDPs) represent a promising framework for computing reliable policies in the face of model errors. Many successful reinforcement learning algorithms build on variations of policy-gradient methods, but adapting these methods to RMDPs has been challenging. As a result, the applicability of RMDPs to large, practical domains remains limited. This paper proposes a new Double-Loop Robust Policy Gradient (DRPG), the first generic policy gradient method for RMDPs. In c…

Organisator

ICML 2023

Účet · 657 sledujících

Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

Professionelle Aufzeichnung und Livestreaming – weltweit.

Freigeben

Empfohlene Videos

Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

Rockmate: an Efficient, Fast, Automatic and Generic Tool for Re-materialization in PyTorch

05:08

Rockmate: an Efficient, Fast, Automatic and Generic Tool for Re-materialization in PyTorch

Zhlédnout později

Oblíbené

Xunyi Zhao, …

ICML 2023 2 years ago

Towards Reliable Neural Specifications

07:13

Towards Reliable Neural Specifications

Zhlédnout později

Oblíbené

Chuqin Geng, …

ICML 2023 2 years ago

Advances In Bits-Back Coding 2019-2023

40:55

Advances In Bits-Back Coding 2019-2023

Zhlédnout později

Oblíbené

ICML 2023 2 years ago

Generative Pre-training for Black-Box Optimization

05:03

Generative Pre-training for Black-Box Optimization

Zhlédnout později

Oblíbené

Satvik Mashkaria, …

ICML 2023 2 years ago

Restoration based Generative Models

04:46

Restoration based Generative Models

Zhlédnout později

Oblíbené

Jaemoo Choi, …

ICML 2023 2 years ago

What can online reinforcement learning with function approximation benefit from coverage conditions?

05:19

What can online reinforcement learning with function approximation benefit from coverage conditions?

Zhlédnout později

Oblíbené

Fanghui Liu, …

ICML 2023 2 years ago