Alekh Agarwal, Tong Zhang · Non-Linear Reinforcement Learning in Large Action Spaces: Structural Conditions and Sample-efficiency of Posterior Sampling · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Non-Linear Reinforcement Learning in Large Action Spaces: Structural Conditions and Sample-efficiency of Posterior Sampling

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-010-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-010-alpha.b-cdn.net
sl-yoda-v2-stream-010-beta.b-cdn.net
1759419103.rsc.cdn77.org
1016618226.rsc.cdn77.org

Subtitles

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Non-Linear Reinforcement Learning in Large Action Spaces: Structural Conditions and Sample-efficiency of Posterior Sampling

Non-Linear Reinforcement Learning in Large Action Spaces: Structural Conditions and Sample-efficiency of Posterior Sampling

Jul 2, 2022

Speakers

Alekh Agarwal

Speaker · 1 follower

Tong Zhang

Speaker · 0 followers

About

Provably sample-efficient Reinforcement Learning (RL) with rich observations and function approximation has witnessed tremendous recent progress, particularly when the underlying function approximators are linear. In this linear regime, computationally and statistically efficient methods exist where the potentially infinite state and action spaces can be captured through a known feature embedding, with the sample complexity scaling with the (intrinsic) dimension of these features. When the actio…

Organizer

COLT

Account · 20 followers

About COLT

The conference is held annually since 1988 and has become the leading conference on Learning theory by maintaining a highly selective process for submissions. It is committed in high-quality articles in all theoretical aspects of machine learning and related topics.

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Covariance-adapting algorithm for semi-bandits with application to sparse outcomes

13:29

Covariance-adapting algorithm for semi-bandits with application to sparse outcomes

Watch later

Favorite

Michal Valko, …

COLT 5 years ago

Optimal SQ Lower Bounds for Learning Halfspaces with Massart Noise

16:45

Optimal SQ Lower Bounds for Learning Halfspaces with Massart Noise

Watch later

Favorite

Rajai Nasser, …

COLT 3 years ago

Information Complexity of VC Learning

04:37

Information Complexity of VC Learning

Watch later

Favorite

Lydia Zakynthinou, …

COLT 5 years ago

Active Local Learning

00:55

Active Local Learning

Watch later

Favorite

Arturs Backurs, …

COLT 5 years ago

A bounded noise mechanism for differential privacy

19:38

A bounded noise mechanism for differential privacy

Watch later

Favorite

Yuval Dagan, …

COLT 3 years ago

Chasing Convex Bodies and Functions with Black-Box Advice

20:38

Chasing Convex Bodies and Functions with Black-Box Advice

Watch later

Favorite

Nicolas Christianson, …

COLT 3 years ago