Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Differentially Private Episodic Reinforcement Learning with Heavy-tailed Rewards
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-006-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-006-alpha.b-cdn.net
      • sl-yoda-v2-stream-006-beta.b-cdn.net
      • 1549480416.rsc.cdn77.org
      • 1102696603.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Differentially Private Episodic Reinforcement Learning with Heavy-tailed Rewards
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Differentially Private Episodic Reinforcement Learning with Heavy-tailed Rewards

            Jul 24, 2023

            Speakers

            YW

            Yulian Wu

            Speaker · 0 followers

            XZ

            Xingyu Zhou

            Speaker · 0 followers

            SRC

            Sayak Ray Chowdhury

            Speaker · 0 followers

            About

            In this paper we study the problem of (finite horizon tabular) Markov decision processes (MDPs) with heavy-tailed rewards under the constraint of differential privacy (DP). Compared with the previous studies for private reinforcement learning that typically assume rewards are sampled from some bounded or sub-Gaussian distributions to ensure DP, we consider the setting where reward distributions have only finite (1+v)-th moments with some v ∈ (0,1]. By resorting to robust mean estimators for rewa…

            Organizer

            I2
            I2

            ICML 2023

            Account · 657 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Leveraging Label Non-Uniformity for Node Classification in Graph Neural Networks
            04:04

            Leveraging Label Non-Uniformity for Node Classification in Graph Neural Networks

            Feng Ji, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Integrating Prior Knowledge in Contrastive Learning
            05:19

            Integrating Prior Knowledge in Contrastive Learning

            Benoit Dufumier, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Autoregressive Diffusion Model for Graph Generation
            04:47

            Autoregressive Diffusion Model for Graph Generation

            Lingkai Kong, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Multi-task Hierarchical Adversarial Inverse Reinforcement Learning
            05:21

            Multi-task Hierarchical Adversarial Inverse Reinforcement Learning

            Jiayu Chen, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Transformed Distribution Matching for Missing Value Imputation
            03:28

            Transformed Distribution Matching for Missing Value Imputation

            He Zhao, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Equivariant Architectures for Learning in Deep Weight Spaces
            08:25

            Equivariant Architectures for Learning in Deep Weight Spaces

            Aviv Navon, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow ICML 2023