Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Differentially Private Episodic Reinforcement Learning with Heavy-tailed Rewards
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-006-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-006-alpha.b-cdn.net
      • sl-yoda-v2-stream-006-beta.b-cdn.net
      • 1549480416.rsc.cdn77.org
      • 1102696603.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Differentially Private Episodic Reinforcement Learning with Heavy-tailed Rewards
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Differentially Private Episodic Reinforcement Learning with Heavy-tailed Rewards

            Jul 24, 2023

            Speakers

            YW

            Yulian Wu

            Speaker · 0 followers

            XZ

            Xingyu Zhou

            Speaker · 0 followers

            SRC

            Sayak Ray Chowdhury

            Speaker · 0 followers

            About

            In this paper we study the problem of (finite horizon tabular) Markov decision processes (MDPs) with heavy-tailed rewards under the constraint of differential privacy (DP). Compared with the previous studies for private reinforcement learning that typically assume rewards are sampled from some bounded or sub-Gaussian distributions to ensure DP, we consider the setting where reward distributions have only finite (1+v)-th moments with some v ∈ (0,1]. By resorting to robust mean estimators for rewa…

            Organizer

            I2
            I2

            ICML 2023

            Account · 657 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Multi-class Graph Clustering via Approximated Effective p-Resistance
            05:13

            Multi-class Graph Clustering via Approximated Effective p-Resistance

            Shota Saito, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Generalizing the Gumbel-Softmax with Stochastic Softmax Tricks
            25:13

            Generalizing the Gumbel-Softmax with Stochastic Softmax Tricks

            Max B. Paulus, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Structured Cooperative Learning with Graphical Model Priors
            05:19

            Structured Cooperative Learning with Graphical Model Priors

            Shuangtong Li, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Concept Learning Across Domains and Modalities
            31:40

            Concept Learning Across Domains and Modalities

            Jiajun Wu

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Learning Control by Iterative Inversion
            04:52

            Learning Control by Iterative Inversion

            Gal Leibovich, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            CLIPood: Generalizing CLIP to Out-of-Distributions
            04:49

            CLIPood: Generalizing CLIP to Out-of-Distributions

            Yang Shu, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow ICML 2023