Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-003-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-003-alpha.b-cdn.net
      • sl-yoda-v2-stream-003-beta.b-cdn.net
      • 1544410162.rsc.cdn77.org
      • 1005514182.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL

            Nov 28, 2022

            Speakers

            JC

            Jinglin Chen

            Speaker · 0 followers

            AM

            Aditya Modi

            Speaker · 0 followers

            AK

            Akshay Krishnamurthy

            Speaker · 5 followers

            About

            We study reward-free reinforcement learning (RL) under general non-linear function approximation, and establish sample efficiency and hardness results under various standard structural assumptions. On the positive side, we propose the RFOLIVE (Reward-Free OLIVE) algorithm for sample-efficient reward-free exploration under minimal structural assumptions, which covers the previously studied settings of linear MDPs (Jin et al., 2020b), linear completeness (Zanette et al., 2020b) and low-rank MDPs w…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 952 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Recommender Forest for Efficient Retrieval
            04:37

            Recommender Forest for Efficient Retrieval

            Chao Feng, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            The Lakota AI Code Camp
            1:04:11

            The Lakota AI Code Camp

            Michael Running Wolf, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Personalized Online Federated Multi-Federated Learning with Multiple Kernels
            04:06

            Personalized Online Federated Multi-Federated Learning with Multiple Kernels

            Pouya M. Ghari, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Mutual Information Divergence: A Unified Metric for Multimodal Generative Models
            04:58

            Mutual Information Divergence: A Unified Metric for Multimodal Generative Models

            Jin-Hwa Kim, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Generating High Fidelity Synthetic Data via Coreset selection and Entropic Regularization
            02:22

            Generating High Fidelity Synthetic Data via Coreset selection and Entropic Regularization

            Omead Pooladzandi, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Federated Submodel Optimization for Hot and Cold Data Features
            04:57

            Federated Submodel Optimization for Hot and Cold Data Features

            Yucheng Ding, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022