Philip Amortila, Nan Jiang, Dhruv Madeka, Dean Foster · A Few Expert Queries Suffices for Sample-Efficient RL with Resets and Linear Value Approximation · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: A Few Expert Queries Suffices for Sample-Efficient RL with Resets and Linear Value Approximation

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-008-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-008-alpha.b-cdn.net
sl-yoda-v2-stream-008-beta.b-cdn.net
1159783934.rsc.cdn77.org
1511376917.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

A Few Expert Queries Suffices for Sample-Efficient RL with Resets and Linear Value Approximation

A Few Expert Queries Suffices for Sample-Efficient RL with Resets and Linear Value Approximation

Nov 28, 2022

Speakers

Philip Amortila

Speaker · 0 followers

Nan Jiang

Speaker · 3 followers

Dhruv Madeka

Speaker · 0 followers

About

The current paper studies sample-efficient Reinforcement Learning (RL) in settings where only the optimal value function is assumed to be linearly-realizable. It has recently been understood that, even under this seemingly strong assumption and access to a generative model, worst-case sample complexities can be prohibitively (i.e., exponentially) large. We investigate the setting where the learner additionally has access to interactive demonstrations from an expert policy, and we present a stati…

Organizer

NeurIPS 2022

Account · 952 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Learning Articulated Rigid Body Dynamics with Lagrangian Graph Neural Network

04:35

Learning Articulated Rigid Body Dynamics with Lagrangian Graph Neural Network

Watch later

Favorite

Ravinder Bhattoo, …

NeurIPS 2022 2 years ago

Deep Multi-Modal Structural Equations For Causal Effect Estimation With Unstructured Proxies

06:58

Deep Multi-Modal Structural Equations For Causal Effect Estimation With Unstructured Proxies

Watch later

Favorite

Shachi Deshpande, …

NeurIPS 2022 2 years ago

Panel discussion 1 - INTERPOLATE — First Workshop on Interpolation Regularizers and Beyond

58:36

Panel discussion 1 - INTERPOLATE — First Workshop on Interpolation Regularizers and Beyond

Watch later

Favorite

Youssef Mroueh, …

NeurIPS 2022 2 years ago

Concept-based Understanding of Emergent Multi-Agent Behavior

05:13

Concept-based Understanding of Emergent Multi-Agent Behavior

Watch later

Favorite

Niko Grupen, …

NeurIPS 2022 2 years ago

Theoretically Provable Spiking Neural Networks

04:07

Theoretically Provable Spiking Neural Networks

Watch later

Favorite

Shao-Qun Zhang, …

NeurIPS 2022 2 years ago

zPROBE: Zero Peek Robustness Checks for Federated Learning

09:55

zPROBE: Zero Peek Robustness Checks for Federated Learning

Watch later

Favorite

Zahra Ghodsi, …

NeurIPS 2022 2 years ago