Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Oral: Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v3-stream-012-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v3-stream-012-alpha.b-cdn.net
      • sl-yoda-v3-stream-012-beta.b-cdn.net
      • 1338956956.rsc.cdn77.org
      • 1656830687.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Oral: Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Oral: Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning

            Apr 14, 2021

            Speakers

            MY

            Ming Yin

            Speaker · 0 followers

            YB

            Yu Bai

            Speaker · 0 followers

            YW

            Yu-Xiang Wang

            Speaker · 0 followers

            About

            The problem of \emph{Offline Policy Evaluation} (OPE) in Reinforcement Learning (RL) is a critical step towards applying RL in real life applications. Existing work on OPE mostly focus on evaluating a \emph{fixed} target policy $\pi$, which does not provide useful bounds for offline policy learning as $\pi$ will then be data-dependent. We address this problem by \emph{simultaneously} evaluating all policies in a policy class $\Pi$ --- uniform convergence in OPE --- and obtain nearly optimal erro…

            Organizer

            A2
            A2

            AISTATS 2021

            Account · 63 followers

            Categories

            AI & Data Science

            Category · 10.8k presentations

            About AISTATS 2021

            The 24th International Conference on Artificial Intelligence and Statistics was held virtually from Tuesday, 13 April 2021 to Thursday, 15 April 2021.

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Homeomorphic-Invariance of EM: Non-Asymptotic Convergence in KL Divergence for Exponential Families via Mirror Descent
            02:48

            Homeomorphic-Invariance of EM: Non-Asymptotic Convergence in KL Divergence for Exponential Families via Mirror Descent

            Fred Kunstner, …

            A2
            A2
            AISTATS 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Provably Efficient Safe Exploration via Primal-Dual Policy Optimization
            03:07

            Provably Efficient Safe Exploration via Primal-Dual Policy Optimization

            Dongsheng Ding, …

            A2
            A2
            AISTATS 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Projection-Free Optimization on Uniformly Convex Sets
            03:08

            Projection-Free Optimization on Uniformly Convex Sets

            Thomas Kerdreux, …

            A2
            A2
            AISTATS 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Generalization of Quasi-Newton Methods: Application to Robust Symmetric Multisecant Updates
            03:28

            Generalization of Quasi-Newton Methods: Application to Robust Symmetric Multisecant Updates

            Damien Scieur, …

            A2
            A2
            AISTATS 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Federated Multi-armed Bandits with Personalization
            02:52

            Federated Multi-armed Bandits with Personalization

            Chengshuai Shi, …

            A2
            A2
            AISTATS 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Variational Selective Autoencoder: Learning from Partially-Observed Heterogeneous Data
            03:17

            Variational Selective Autoencoder: Learning from Partially-Observed Heterogeneous Data

            Yu Gong, …

            A2
            A2
            AISTATS 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow AISTATS 2021