Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: MOPA: a Minimalist Off-Policy Approach to Safe-RL
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-009-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-009-alpha.b-cdn.net
      • sl-yoda-v2-stream-009-beta.b-cdn.net
      • 1766500541.rsc.cdn77.org
      • 1441886916.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            MOPA: a Minimalist Off-Policy Approach to Safe-RL
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            MOPA: a Minimalist Off-Policy Approach to Safe-RL

            Dec 2, 2022

            Speakers

            HS

            Hao Sun

            Speaker · 2 followers

            ZX

            Ziping Xu

            Speaker · 0 followers

            ZP

            Zhenghao Peng

            Speaker · 0 followers

            About

            Safety is one of the crucial concerns for the real-world application of reinforcement learning (RL). Previous works consider the safe exploration problem as Constrained Markov Decision Process (CMDP), where the policies are being optimized under constraints. However, when encountering any potential danger, human tends to stop immediately and rarely learns to behave safely in danger. Moreover, the off-policy learning nature of humans guarantees high learning efficiency in risky tasks. Motivated b…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 961 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Pre-trained Models for Learned DBMS Components
            28:46

            Pre-trained Models for Learned DBMS Components

            Carsten Binnig

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            SizeShiftReg: a Regularization Method for Improving Size-Generalization in Graph Neural Networks
            05:00

            SizeShiftReg: a Regularization Method for Improving Size-Generalization in Graph Neural Networks

            Davide Buffelli, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Decoupling Knowledge from Memorization: Retrieval-augmented Prompt Learning
            04:30

            Decoupling Knowledge from Memorization: Retrieval-augmented Prompt Learning

            Xiang Chen, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks
            03:41

            Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks

            Zhiyang Chen, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Coded Residual Transform for Generalizable Deep Metric Learning
            04:46

            Coded Residual Transform for Generalizable Deep Metric Learning

            Shichao Kan, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Optimal Transport of Classifiers to Fairness
            04:32

            Optimal Transport of Classifiers to Fairness

            Maarten Buyl, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022