Hao Sun, Ziping Xu, Zhenghao Peng, Meng Fang, Bo Dai, Bolei Zhou · MOPA: a Minimalist Off-Policy Approach to Safe-RL · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: MOPA: a Minimalist Off-Policy Approach to Safe-RL

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-009-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-009-alpha.b-cdn.net
sl-yoda-v2-stream-009-beta.b-cdn.net
1766500541.rsc.cdn77.org
1441886916.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

MOPA: a Minimalist Off-Policy Approach to Safe-RL

MOPA: a Minimalist Off-Policy Approach to Safe-RL

Dec 2, 2022

Speakers

Hao Sun

Speaker · 2 followers

Ziping Xu

Speaker · 0 followers

Zhenghao Peng

Speaker · 0 followers

About

Safety is one of the crucial concerns for the real-world application of reinforcement learning (RL). Previous works consider the safe exploration problem as Constrained Markov Decision Process (CMDP), where the policies are being optimized under constraints. However, when encountering any potential danger, human tends to stop immediately and rarely learns to behave safely in danger. Moreover, the off-policy learning nature of humans guarantees high learning efficiency in risky tasks. Motivated b…

Organizer

NeurIPS 2022

Account · 961 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Pre-trained Models for Learned DBMS Components

28:46

Pre-trained Models for Learned DBMS Components

Watch later

Favorite

NeurIPS 2022 2 years ago

SizeShiftReg: a Regularization Method for Improving Size-Generalization in Graph Neural Networks

05:00

SizeShiftReg: a Regularization Method for Improving Size-Generalization in Graph Neural Networks

Watch later

Favorite

Davide Buffelli, …

NeurIPS 2022 2 years ago

Decoupling Knowledge from Memorization: Retrieval-augmented Prompt Learning

04:30

Decoupling Knowledge from Memorization: Retrieval-augmented Prompt Learning

Watch later

Favorite

Xiang Chen, …

NeurIPS 2022 2 years ago

Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks

03:41

Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks

Watch later

Favorite

Zhiyang Chen, …

NeurIPS 2022 2 years ago

Coded Residual Transform for Generalizable Deep Metric Learning

04:46

Coded Residual Transform for Generalizable Deep Metric Learning

Watch later

Favorite

Shichao Kan, …

NeurIPS 2022 2 years ago

Optimal Transport of Classifiers to Fairness

04:32

Optimal Transport of Classifiers to Fairness

Watch later

Favorite

Maarten Buyl, …

NeurIPS 2022 2 years ago