Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: A Near-Optimal Primal-Dual Method for Off-Policy Learning in CMDP
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-008-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-008-alpha.b-cdn.net
      • sl-yoda-v2-stream-008-beta.b-cdn.net
      • 1159783934.rsc.cdn77.org
      • 1511376917.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            A Near-Optimal Primal-Dual Method for Off-Policy Learning in CMDP
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            A Near-Optimal Primal-Dual Method for Off-Policy Learning in CMDP

            Nov 28, 2022

            Speakers

            FC

            Fan Chen

            Speaker · 0 followers

            JZ

            Junyu Zhang

            Speaker · 0 followers

            ZW

            Zaiwen Wen

            Speaker · 0 followers

            About

            As an important framework for safe Reinforcement Learning, the Constrained Markov Decision Process (CMDP) has been extensively studied in the recent literature. However, despite the rich results under various on-policy learning settings, there still lacks some essential understanding of the offline CMDP problems, in terms of both the algorithm design and the information theoretic sample complexity lower bound. In this paper, we focus on solving the CMDP problems where only offline data are avail…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 962 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            RTFormer: Efficient Design for Real-Time Semantic Segmentation with Transformer
            01:01

            RTFormer: Efficient Design for Real-Time Semantic Segmentation with Transformer

            Jian Wang, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Drones in Public Safety
            29:56

            Drones in Public Safety

            Chase Dawson Gitter

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Wasserstein Iterative Networks for Barycenter Estimation
            01:02

            Wasserstein Iterative Networks for Barycenter Estimation

            Alexander Korotin, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Simple Mechanisms for Welfare Maximization in Rich Advertising Auctions
            04:56

            Simple Mechanisms for Welfare Maximization in Rich Advertising Auctions

            Divyarthi Mohan, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            MaskTune: Mitigating Spurious Correlations by Forcing to Explore
            05:36

            MaskTune: Mitigating Spurious Correlations by Forcing to Explore

            Saeid Asgari, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Concentration of Data Encoding in Parameterized Quantum Circuits
            01:05

            Concentration of Data Encoding in Parameterized Quantum Circuits

            Guangxi Li, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022