Yuping Luo, Tengyu Ma · Learning Barrier Certificates: Towards Safe Reinforcement Learning with Zero Training-time Violations · SlidesLive

Kategorien

DE

Anmelden Vertrieb kontaktieren

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Learning Barrier Certificates: Towards Safe Reinforcement Learning with Zero Training-time Violations

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-005-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-005-alpha.b-cdn.net
sl-yoda-v2-stream-005-beta.b-cdn.net
1034628162.rsc.cdn77.org
1409346856.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Learning Barrier Certificates: Towards Safe Reinforcement Learning with Zero Training-time Violations

Learning Barrier Certificates: Towards Safe Reinforcement Learning with Zero Training-time Violations

Dez 6, 2021

Sprecher:innen

Yuping Luo

Speaker · 0 followers

Tengyu Ma

Speaker · 9 followers

Über

Training-time safety violations have been a major concern when we deploy reinforcement learning algorithms in the real world.This paper explores the possibility of safe RL algorithms with zero training-time safety violations in the challenging setting where we are only given a safe but trivial-reward initial policy without any prior knowledge of the dynamics and additional offline data.We propose an algorithm, Co-trained Barrier Certificate for Safe RL (CRABS), which iteratively learns barrier c…

Organisator

NeurIPS 2021

Account · 1.9k followers

Über NeurIPS 2021

Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

Professionelle Aufzeichnung und Livestreaming – weltweit.

Freigeben

Empfohlene Videos

Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

Optimal transport: past, present, and future

1:30:07

Optimal transport: past, present, and future

Watch later

Favorite

Alessio Figalli

NeurIPS 2021 3 years ago

EF21: A New, Simpler, Theoretically Better, and Practically Faster Error Feedback

19:56

EF21: A New, Simpler, Theoretically Better, and Practically Faster Error Feedback

Watch later

Favorite

Peter Richtárik, …

NeurIPS 2021 3 years ago

Neural Symplectic Form: Learning Hamiltonian Equations on General Coordinate Systems

14:35

Neural Symplectic Form: Learning Hamiltonian Equations on General Coordinate Systems

Watch later

Favorite

Yuhan Chen, …

NeurIPS 2021 3 years ago

NaturalProofs: Mathematical Theorem Proving in Natural Language

15:54

NaturalProofs: Mathematical Theorem Proving in Natural Language

Watch later

Favorite

Sean Welleck, …

NeurIPS 2021 3 years ago

The Skellam Mechanism for Differentially Private Federated Learning

15:37

The Skellam Mechanism for Differentially Private Federated Learning

Watch later

Favorite

Naman Agarwal, …

NeurIPS 2021 3 years ago

Memory-Efficient Approximation Algorithms for Max-k-Cut and Correlation Clustering

12:14

Memory-Efficient Approximation Algorithms for Max-k-Cut and Correlation Clustering

Watch later

Favorite

Nimita Shinde, …

NeurIPS 2021 3 years ago