Michal Moshkovitz, Lee Cohen, Yishay Mansour · Finding Safe Zones of Markov Decision Processes Policies · SlidesLive

Kategorien

DE

Anmelden Vertrieb kontaktieren

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Finding Safe Zones of Markov Decision Processes Policies

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-002-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-002-alpha.b-cdn.net
sl-yoda-v2-stream-002-beta.b-cdn.net
1001562353.rsc.cdn77.org
1075090661.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Finding Safe Zones of Markov Decision Processes Policies

Finding Safe Zones of Markov Decision Processes Policies

Dez 2, 2022

Sprecher:innen

Michal Moshkovitz

Sprecher:in · 0 Follower:innen

Lee Cohen

Sprecher:in · 0 Follower:innen

Yishay Mansour

Sprecher:in · 1 Follower:in

Über

Given a policy, we define a SafeZone as a subset of states, such that most of the policy's trajectories are confined to this subset. The quality of the SafeZone is parameterized by the number of states and the escape probability, i.e., the probability that a random trajectory will leave the subset.SafeZones are especially interesting when they have a small number of states and low escape probability. We study the complexity of finding optimal SafeZones and show that in general, the problem is co…

Organisator

NeurIPS 2022

Konto · 962 Follower:innen

Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

Professionelle Aufzeichnung und Livestreaming – weltweit.

Freigeben

Empfohlene Videos

Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

Graph Neural Networks with Adaptive Readouts

03:51

Graph Neural Networks with Adaptive Readouts

Später ansehen

Favorit

David Buterez, …

NeurIPS 2022 2 years ago

Train Offline, Test Online: A Real Robot Learning Benchmark

02:55

Train Offline, Test Online: A Real Robot Learning Benchmark

Später ansehen

Favorit

Gaoyue Zhou, …

NeurIPS 2022 2 years ago

Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees

06:35

Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees

Später ansehen

Favorit

Siliang Zeng, …

NeurIPS 2022 2 years ago

Towards a spatially transferable super resolution model for downscaling Antarctic surface melt

06:21

Towards a spatially transferable super resolution model for downscaling Antarctic surface melt

Später ansehen

Favorit

Zhongyang Hu, …

NeurIPS 2022 2 years ago

S3-NeRF: Neural Reflectance Field from Shading and Shadow under a Single Viewpoint

05:01

S3-NeRF: Neural Reflectance Field from Shading and Shadow under a Single Viewpoint

Später ansehen

Favorit

Wenqi Yang, …

NeurIPS 2022 2 years ago

Sound and Complete Incorporation wit Latent Variables Given Local Background Knowledge

02:49

Sound and Complete Incorporation wit Latent Variables Given Local Background Knowledge

Später ansehen

Favorit

Tian-Zuo Wang, …

NeurIPS 2022 2 years ago