Yulian Wu, Xingyu Zhou, Sayak Ray Chowdhury, Di Wang · Differentially Private Episodic Reinforcement Learning with Heavy-tailed Rewards · SlidesLive

Kategorien

DE

Anmelden Vertrieb kontaktieren

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Differentially Private Episodic Reinforcement Learning with Heavy-tailed Rewards

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-006-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-006-alpha.b-cdn.net
sl-yoda-v2-stream-006-beta.b-cdn.net
1549480416.rsc.cdn77.org
1102696603.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Differentially Private Episodic Reinforcement Learning with Heavy-tailed Rewards

Differentially Private Episodic Reinforcement Learning with Heavy-tailed Rewards

Jul 24, 2023

Sprecher:innen

Yulian Wu

Sprecher:in · 0 Follower:innen

Xingyu Zhou

Sprecher:in · 0 Follower:innen

Sayak Ray Chowdhury

Sprecher:in · 0 Follower:innen

Über

In this paper we study the problem of (finite horizon tabular) Markov decision processes (MDPs) with heavy-tailed rewards under the constraint of differential privacy (DP). Compared with the previous studies for private reinforcement learning that typically assume rewards are sampled from some bounded or sub-Gaussian distributions to ensure DP, we consider the setting where reward distributions have only finite (1+v)-th moments with some v ∈ (0,1]. By resorting to robust mean estimators for rewa…

Organisator

ICML 2023

Konto · 657 Follower:innen

Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

Professionelle Aufzeichnung und Livestreaming – weltweit.

Freigeben

Empfohlene Videos

Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

Rethinking Warm-Starts with Predictions: Learning Predictions Close to Sets of Optimal Solutions for Faster L-/L-Convex Function Minimization

04:37

Rethinking Warm-Starts with Predictions: Learning Predictions Close to Sets of Optimal Solutions for Faster L-/L-Convex Function Minimization

Später ansehen

Favorit

Shinsaku Sakaue, …

ICML 2023 2 years ago

CrossSplit: Mitigating Label Noise Memorization through Data Splitting

05:17

CrossSplit: Mitigating Label Noise Memorization through Data Splitting

Später ansehen

Favorit

ICML 2023 2 years ago

GAT: Guided Adversarial Training with Pareto-optimal Auxiliary Tasks

05:11

GAT: Guided Adversarial Training with Pareto-optimal Auxiliary Tasks

Später ansehen

Favorit

Salah Ghamizi, …

ICML 2023 2 years ago

NeuralStagger: Accelerating Physics-constrained Neural PDE Solver with Spatial-temporal Decomposition

04:25

NeuralStagger: Accelerating Physics-constrained Neural PDE Solver with Spatial-temporal Decomposition

Später ansehen

Favorit

Xinquan Huang, …

ICML 2023 2 years ago

SpotEM: Efficient Video Search for Episodic Memory

05:20

SpotEM: Efficient Video Search for Episodic Memory

Später ansehen

Favorit

Santhosh Kumar Ramakrishnan, …

ICML 2023 2 years ago

Reliable Measures of Spread in High Dimensional Latent Spaces

05:19

Reliable Measures of Spread in High Dimensional Latent Spaces

Später ansehen

Favorit

Anna Marbut, …

ICML 2023 2 years ago