Jinglin Chen, Aditya Modi, Akshay Krishnamurthy, Nan Jiang, Alekh Agarwal · On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-003-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-003-alpha.b-cdn.net
sl-yoda-v2-stream-003-beta.b-cdn.net
1544410162.rsc.cdn77.org
1005514182.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL

On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL

Nov 28, 2022

Speakers

Jinglin Chen

Speaker · 0 followers

Aditya Modi

Speaker · 0 followers

Akshay Krishnamurthy

Speaker · 5 followers

About

We study reward-free reinforcement learning (RL) under general non-linear function approximation, and establish sample efficiency and hardness results under various standard structural assumptions. On the positive side, we propose the RFOLIVE (Reward-Free OLIVE) algorithm for sample-efficient reward-free exploration under minimal structural assumptions, which covers the previously studied settings of linear MDPs (Jin et al., 2020b), linear completeness (Zanette et al., 2020b) and low-rank MDPs w…

Organizer

NeurIPS 2022

Account · 952 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Recommender Forest for Efficient Retrieval

04:37

Recommender Forest for Efficient Retrieval

Watch later

Favorite

NeurIPS 2022 2 years ago

The Lakota AI Code Camp

1:04:11

The Lakota AI Code Camp

Watch later

Favorite

Michael Running Wolf, …

NeurIPS 2022 2 years ago

Personalized Online Federated Multi-Federated Learning with Multiple Kernels

04:06

Personalized Online Federated Multi-Federated Learning with Multiple Kernels

Watch later

Favorite

Pouya M. Ghari, …

NeurIPS 2022 2 years ago

Mutual Information Divergence: A Unified Metric for Multimodal Generative Models

04:58

Mutual Information Divergence: A Unified Metric for Multimodal Generative Models

Watch later

Favorite

Jin-Hwa Kim, …

NeurIPS 2022 2 years ago

Generating High Fidelity Synthetic Data via Coreset selection and Entropic Regularization

02:22

Generating High Fidelity Synthetic Data via Coreset selection and Entropic Regularization

Watch later

Favorite

Omead Pooladzandi, …

NeurIPS 2022 2 years ago

Federated Submodel Optimization for Hot and Cold Data Features

04:57

Federated Submodel Optimization for Hot and Cold Data Features

Watch later

Favorite

Yucheng Ding, …

NeurIPS 2022 2 years ago