Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Oral: Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v3-stream-012-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v3-stream-012-alpha.b-cdn.net
      • sl-yoda-v3-stream-012-beta.b-cdn.net
      • 1338956956.rsc.cdn77.org
      • 1656830687.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Oral: Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Oral: Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning

            Apr 14, 2021

            Speakers

            MY

            Ming Yin

            Speaker · 0 followers

            YB

            Yu Bai

            Speaker · 0 followers

            YW

            Yu-Xiang Wang

            Speaker · 0 followers

            About

            The problem of \emph{Offline Policy Evaluation} (OPE) in Reinforcement Learning (RL) is a critical step towards applying RL in real life applications. Existing work on OPE mostly focus on evaluating a \emph{fixed} target policy $\pi$, which does not provide useful bounds for offline policy learning as $\pi$ will then be data-dependent. We address this problem by \emph{simultaneously} evaluating all policies in a policy class $\Pi$ --- uniform convergence in OPE --- and obtain nearly optimal erro…

            Organizer

            A2
            A2

            AISTATS 2021

            Account · 63 followers

            Categories

            AI & Data Science

            Category · 10.8k presentations

            About AISTATS 2021

            The 24th International Conference on Artificial Intelligence and Statistics was held virtually from Tuesday, 13 April 2021 to Thursday, 15 April 2021.

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Hogwild! over Distributed Local Data Sets with Linearly Increasing Mini-Batch Sizes
            03:13

            Hogwild! over Distributed Local Data Sets with Linearly Increasing Mini-Batch Sizes

            Nhuong V. Nguyen, …

            A2
            A2
            AISTATS 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Animal pose estimation from video data with a hierarchical von Mises-Fisher-Gaussian model
            03:23

            Animal pose estimation from video data with a hierarchical von Mises-Fisher-Gaussian model

            Libby Zhang, …

            A2
            A2
            AISTATS 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Approximate Message Passing with Spectral Initialization for Generalized Linear Models
            02:55

            Approximate Message Passing with Spectral Initialization for Generalized Linear Models

            Marco Mondelli, …

            A2
            A2
            AISTATS 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Generalization Bounds for Stochastic Saddle Point Problems
            02:55

            Generalization Bounds for Stochastic Saddle Point Problems

            Junyu Zhang, …

            A2
            A2
            AISTATS 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Shapley Flow: A Graph-based Approach to Interpreting Model Predictions
            03:01

            Shapley Flow: A Graph-based Approach to Interpreting Model Predictions

            Jiaxuan Wang, …

            A2
            A2
            AISTATS 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Oral: Right Decisions from Wrong Predictions: A Mechanism Design Alternative to Individual Calibration
            16:19

            Oral: Right Decisions from Wrong Predictions: A Mechanism Design Alternative to Individual Calibration

            Shengjia Zhao, …

            A2
            A2
            AISTATS 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow AISTATS 2021