Kai-Chieh Hsu, Allen Z. Ren, Duy Phuong Hguyen, Anirudha Majumdar, Jaime F. Fisac · Sim-to-Lab-to-Real: Safe Reinforcement Learning with Shielding and Generalization Guarantees · SlidesLive

Kategorie

CS

Přihlásit se Kontaktujte nás

Další

Živý přenos začne již brzy!

Živý přenos již skončil.

Prezentace ještě nebyla nahrána!

SlidesLive

title: Sim-to-Lab-to-Real: Safe Reinforcement Learning with Shielding and Generalization Guarantees

0:00 / 0:00

Nahlásit chybu
Nastavení
Playlisty
Záložky
Titulky Off
Rychlost přehrávání
Kvalita

Nastavení
Debug informace
Server sl-yoda-v2-stream-007-alpha.b-cdn.net
Velikost titulků Střední

Záložky

Server
sl-yoda-v2-stream-007-alpha.b-cdn.net
sl-yoda-v2-stream-007-beta.b-cdn.net
1678031076.rsc.cdn77.org
1932936657.rsc.cdn77.org

Titulky
Off
English

Rychlost přehrávání

Kvalita

Velikost titulků
Velké
Střední
Malé

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Sim-to-Lab-to-Real: Safe Reinforcement Learning with Shielding and Generalization Guarantees

Sim-to-Lab-to-Real: Safe Reinforcement Learning with Shielding and Generalization Guarantees

2. prosince 2022

Řečníci

Kai-Chieh Hsu

Řečník · 0 sledujících

Allen Z. Ren

Řečník · 0 sledujících

Duy Phuong Hguyen

Řečník · 0 sledujících

O prezentaci

Safety is a critical component of autonomous systems and remains a challenge for learning-based policies to be utilized in the real world. In this paper, we propose Sim-to-Lab-to-Real to bridge the reality gap with a probabilistically guaranteed safety-aware policy distribution.. To improve safety, we apply a dual policy setup where a performance policy is trained using the cumulative task reward and a backup (safety) policy is trained by solving the safety Bellman Equation based on Hamilton-Jac…

Organizátor

NeurIPS 2022

Účet · 961 sledujících

Baví vás formát? Nechte SlidesLive zachytit svou akci!

Profesionální natáčení a streamování po celém světě.

Sdílení

Doporučená videa

Prezentace na podobné téma, kategorii nebo přednášejícího

Effective Backdoor Defense by Exploiting Sensitivity of Poisoned Samples

04:47

Effective Backdoor Defense by Exploiting Sensitivity of Poisoned Samples

Zhlédnout později

Oblíbené

Weixin Chen, …

NeurIPS 2022 2 years ago

BILCO: An Efficient Algorithm for Joint Alignment of Time Series

04:58

BILCO: An Efficient Algorithm for Joint Alignment of Time Series

Zhlédnout později

Oblíbené

NeurIPS 2022 2 years ago

On Spectral and Temporal Feature Encoding Behaviour in Stacked Architectures

04:45

On Spectral and Temporal Feature Encoding Behaviour in Stacked Architectures

Zhlédnout později

Oblíbené

Vaibhav Singh, …

NeurIPS 2022 2 years ago

Factored Adaptation for Non-Stationary Reinforcement Learning

05:03

Factored Adaptation for Non-Stationary Reinforcement Learning

Zhlédnout později

Oblíbené

NeurIPS 2022 2 years ago

Battery GraphNets : Relational Learning for Lithium-ion Batteries(LiBs) Life Estimation

08:14

Battery GraphNets : Relational Learning for Lithium-ion Batteries(LiBs) Life Estimation

Zhlédnout později

Oblíbené

Rajat Kumar Sarkar, …

NeurIPS 2022 2 years ago

MineRL BASALT 2022

12:15

MineRL BASALT 2022

Zhlédnout později

Oblíbené

Anssi Kanervisto

NeurIPS 2022 2 years ago