Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Differentially Private Episodic Reinforcement Learning with Heavy-tailed Rewards
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-006-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-006-alpha.b-cdn.net
      • sl-yoda-v2-stream-006-beta.b-cdn.net
      • 1549480416.rsc.cdn77.org
      • 1102696603.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Differentially Private Episodic Reinforcement Learning with Heavy-tailed Rewards
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Differentially Private Episodic Reinforcement Learning with Heavy-tailed Rewards

            Jul 24, 2023

            Sprecher:innen

            YW

            Yulian Wu

            Sprecher:in · 0 Follower:innen

            XZ

            Xingyu Zhou

            Sprecher:in · 0 Follower:innen

            SRC

            Sayak Ray Chowdhury

            Sprecher:in · 0 Follower:innen

            Über

            In this paper we study the problem of (finite horizon tabular) Markov decision processes (MDPs) with heavy-tailed rewards under the constraint of differential privacy (DP). Compared with the previous studies for private reinforcement learning that typically assume rewards are sampled from some bounded or sub-Gaussian distributions to ensure DP, we consider the setting where reward distributions have only finite (1+v)-th moments with some v ∈ (0,1]. By resorting to robust mean estimators for rewa…

            Organisator

            I2
            I2

            ICML 2023

            Konto · 657 Follower:innen

            Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

            Professionelle Aufzeichnung und Livestreaming – weltweit.

            Freigeben

            Empfohlene Videos

            Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

            CrossSplit: Mitigating Label Noise Memorization through Data Splitting
            05:17

            CrossSplit: Mitigating Label Noise Memorization through Data Splitting

            Jihye Kim, …

            I2
            I2
            ICML 2023 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            GAT: Guided Adversarial Training with Pareto-optimal Auxiliary Tasks
            05:11

            GAT: Guided Adversarial Training with Pareto-optimal Auxiliary Tasks

            Salah Ghamizi, …

            I2
            I2
            ICML 2023 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            NeuralStagger: Accelerating Physics-constrained Neural PDE Solver with Spatial-temporal Decomposition
            04:25

            NeuralStagger: Accelerating Physics-constrained Neural PDE Solver with Spatial-temporal Decomposition

            Xinquan Huang, …

            I2
            I2
            ICML 2023 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            SpotEM: Efficient Video Search for Episodic Memory
            05:20

            SpotEM: Efficient Video Search for Episodic Memory

            Santhosh Kumar Ramakrishnan, …

            I2
            I2
            ICML 2023 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Reliable Measures of Spread in High Dimensional Latent Spaces
            05:19

            Reliable Measures of Spread in High Dimensional Latent Spaces

            Anna Marbut, …

            I2
            I2
            ICML 2023 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Interessiert an Vorträgen wie diesem? ICML 2023 folgen