Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Learning Barrier Certificates: Towards Safe Reinforcement Learning with Zero Training-time Violations
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-005-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-005-alpha.b-cdn.net
      • sl-yoda-v2-stream-005-beta.b-cdn.net
      • 1034628162.rsc.cdn77.org
      • 1409346856.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Learning Barrier Certificates: Towards Safe Reinforcement Learning with Zero Training-time Violations
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Learning Barrier Certificates: Towards Safe Reinforcement Learning with Zero Training-time Violations

            Dez 6, 2021

            Sprecher:innen

            YL

            Yuping Luo

            Speaker · 0 followers

            TM

            Tengyu Ma

            Speaker · 9 followers

            Über

            Training-time safety violations have been a major concern when we deploy reinforcement learning algorithms in the real world.This paper explores the possibility of safe RL algorithms with zero training-time safety violations in the challenging setting where we are only given a safe but trivial-reward initial policy without any prior knowledge of the dynamics and additional offline data.We propose an algorithm, Co-trained Barrier Certificate for Safe RL (CRABS), which iteratively learns barrier c…

            Organisator

            N2
            N2

            NeurIPS 2021

            Account · 1.9k followers

            Über NeurIPS 2021

            Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

            Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

            Professionelle Aufzeichnung und Livestreaming – weltweit.

            Freigeben

            Empfohlene Videos

            Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

            Optimal transport: past, present, and future
            1:30:07

            Optimal transport: past, present, and future

            Alessio Figalli

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            EF21: A New, Simpler, Theoretically Better, and Practically Faster Error Feedback
            19:56

            EF21: A New, Simpler, Theoretically Better, and Practically Faster Error Feedback

            Peter Richtárik, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Neural Symplectic Form: Learning Hamiltonian Equations on General Coordinate Systems
            14:35

            Neural Symplectic Form: Learning Hamiltonian Equations on General Coordinate Systems

            Yuhan Chen, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            NaturalProofs: Mathematical Theorem Proving in Natural Language
            15:54

            NaturalProofs: Mathematical Theorem Proving in Natural Language

            Sean Welleck, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            The Skellam Mechanism for Differentially Private Federated Learning
            15:37

            The Skellam Mechanism for Differentially Private Federated Learning

            Naman Agarwal, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 1 viewers voted for saving the presentation to eternal vault which is 0.1%

            Memory-Efficient Approximation Algorithms for Max-k-Cut and Correlation Clustering
            12:14

            Memory-Efficient Approximation Algorithms for Max-k-Cut and Correlation Clustering

            Nimita Shinde, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 1 viewers voted for saving the presentation to eternal vault which is 0.1%

            Interessiert an Vorträgen wie diesem? NeurIPS 2021 folgen