Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: No-Regret Reinforcement Learning with Heavy-Tailed Rewards
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v3-stream-014-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v3-stream-014-alpha.b-cdn.net
      • sl-yoda-v3-stream-014-beta.b-cdn.net
      • 1978117156.rsc.cdn77.org
      • 1243944885.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            No-Regret Reinforcement Learning with Heavy-Tailed Rewards
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            No-Regret Reinforcement Learning with Heavy-Tailed Rewards

            Apr 14, 2021

            Speakers

            VZ

            Vincent Zhuang

            Speaker · 0 followers

            YS

            Yanan Sui

            Speaker · 0 followers

            About

            Reinforcement learning algorithms typically assume rewards to be sampled from light-tailed distributions, such as Gaussian or bounded. However, a wide variety of real-world systems generate rewards that follow heavy-tailed distributions. We consider such scenarios in the setting of undiscounted reinforcement learning. By constructing a lower bound, we show that the difficulty of learning heavy-tailed rewards asymptotically dominates the difficulty of learning transition probabilities. Leveraging…

            Organizer

            A2
            A2

            AISTATS 2021

            Account · 63 followers

            Categories

            AI & Data Science

            Category · 10.8k presentations

            Mathematics

            Category · 2.4k presentations

            About AISTATS 2021

            The 24th International Conference on Artificial Intelligence and Statistics was held virtually from Tuesday, 13 April 2021 to Thursday, 15 April 2021.

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Communication Efficient Primal-Dual Algorithm for Nonconvex Nonsmooth Distributed Optimization
            03:01

            Communication Efficient Primal-Dual Algorithm for Nonconvex Nonsmooth Distributed Optimization

            Congliang Chen, …

            A2
            A2
            AISTATS 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            On the Faster Alternating Least-Squares for CCA
            02:55

            On the Faster Alternating Least-Squares for CCA

            Zhiqiang Xu, …

            A2
            A2
            AISTATS 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            On the Memory Mechanism of Tensor-Power Recurrent Models
            03:04

            On the Memory Mechanism of Tensor-Power Recurrent Models

            Hejia Qiu, …

            A2
            A2
            AISTATS 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Latent Gaussian process with composite likelihoods and numerical quadrature
            03:01

            Latent Gaussian process with composite likelihoods and numerical quadrature

            Siddharth Ramchandran, …

            A2
            A2
            AISTATS 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            ChEES-HMC: What to Do if Your GPU Is Allergic to NUTS
            03:33

            ChEES-HMC: What to Do if Your GPU Is Allergic to NUTS

            Matthew Hoffman, …

            A2
            A2
            AISTATS 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Context-Specific Likelihood Weighting
            02:44

            Context-Specific Likelihood Weighting

            Nitesh Kumar, …

            A2
            A2
            AISTATS 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow AISTATS 2021