Vincent Zhuang, Yanan Sui · No-Regret Reinforcement Learning with Heavy-Tailed Rewards · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: No-Regret Reinforcement Learning with Heavy-Tailed Rewards

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v3-stream-014-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v3-stream-014-alpha.b-cdn.net
sl-yoda-v3-stream-014-beta.b-cdn.net
1978117156.rsc.cdn77.org
1243944885.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

No-Regret Reinforcement Learning with Heavy-Tailed Rewards

No-Regret Reinforcement Learning with Heavy-Tailed Rewards

Apr 14, 2021

Speakers

Vincent Zhuang

Speaker · 0 followers

Yanan Sui

Speaker · 0 followers

About

Reinforcement learning algorithms typically assume rewards to be sampled from light-tailed distributions, such as Gaussian or bounded. However, a wide variety of real-world systems generate rewards that follow heavy-tailed distributions. We consider such scenarios in the setting of undiscounted reinforcement learning. By constructing a lower bound, we show that the difficulty of learning heavy-tailed rewards asymptotically dominates the difficulty of learning transition probabilities. Leveraging…

Organizer

AISTATS 2021

Account · 63 followers

Categories

AI & Data Science

Category · 10.8k presentations

Mathematics

Category · 2.4k presentations

About AISTATS 2021

The 24th International Conference on Artificial Intelligence and Statistics was held virtually from Tuesday, 13 April 2021 to Thursday, 15 April 2021.

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Communication Efficient Primal-Dual Algorithm for Nonconvex Nonsmooth Distributed Optimization

03:01

Communication Efficient Primal-Dual Algorithm for Nonconvex Nonsmooth Distributed Optimization

Watch later

Favorite

Congliang Chen, …

AISTATS 2021 4 years ago

On the Faster Alternating Least-Squares for CCA

02:55

On the Faster Alternating Least-Squares for CCA

Watch later

Favorite

Zhiqiang Xu, …

AISTATS 2021 4 years ago

On the Memory Mechanism of Tensor-Power Recurrent Models

03:04

On the Memory Mechanism of Tensor-Power Recurrent Models

Watch later

Favorite

AISTATS 2021 4 years ago

Latent Gaussian process with composite likelihoods and numerical quadrature

03:01

Latent Gaussian process with composite likelihoods and numerical quadrature

Watch later

Favorite

Siddharth Ramchandran, …

AISTATS 2021 4 years ago

ChEES-HMC: What to Do if Your GPU Is Allergic to NUTS

03:33

ChEES-HMC: What to Do if Your GPU Is Allergic to NUTS

Watch later

Favorite

Matthew Hoffman, …

AISTATS 2021 4 years ago

Context-Specific Likelihood Weighting

02:44

Context-Specific Likelihood Weighting

Watch later

Favorite

Nitesh Kumar, …

AISTATS 2021 4 years ago