Niladri Chatterji, Aldo Pacchiano, Peter L. Bartlett, Michael I. Jordan · On the Theory of Reinforcement Learning with Once-per-Episode Feedback · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: On the Theory of Reinforcement Learning with Once-per-Episode Feedback

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v3-stream-015-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v3-stream-015-alpha.b-cdn.net
sl-yoda-v3-stream-015-beta.b-cdn.net
1963568160.rsc.cdn77.org
1940033649.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

On the Theory of Reinforcement Learning with Once-per-Episode Feedback

On the Theory of Reinforcement Learning with Once-per-Episode Feedback

Dec 6, 2021

Speakers

Niladri Chatterji

Speaker · 0 followers

Aldo Pacchiano

Speaker · 0 followers

Peter L. Bartlett

Speaker · 1 follower

About

We introduce a theory of reinforcement learning (RL) in which the learner receives feedback only once at the end of an episode. While this is an extreme test case for theory, it is also arguably more representative of real-world applications than the traditional requirement in RL practice that the learner receive feedback at every time step. Indeed, in many real-world applications of reinforcement learning, such as self driving cars and robotics, it is easier to evaluate whether a learner's comp…

Organizer

NeurIPS 2021

Account · 1.9k followers

About NeurIPS 2021

Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Statistical Testing under Distributional Shifts

25:44

Statistical Testing under Distributional Shifts

Watch later

Favorite

NeurIPS 2021 3 years ago

The Limits of Optimal Pricing in the Dark

15:06

The Limits of Optimal Pricing in the Dark

Watch later

Favorite

Quinlan Dawkins, …

NeurIPS 2021 3 years ago

Modality-Agnostic Topology Aware Localization

11:06

Modality-Agnostic Topology Aware Localization

Watch later

Favorite

Farhad G. Zanjani, …

NeurIPS 2021 3 years ago

ResNet strikes back: An improved training procedure in timm

13:35

ResNet strikes back: An improved training procedure in timm

Watch later

Favorite

Ross Wightman, …

NeurIPS 2021 3 years ago

Model based Multi-agent Reinforcement Learning with Tensor Decompositions

05:12

Model based Multi-agent Reinforcement Learning with Tensor Decompositions

Watch later

Favorite

Pascal Van Der Vaart, …

NeurIPS 2021 3 years ago

Solving Soft Clustering Ensemble via k-Sparse Discrete Wasserstein Barycenter

12:11

Solving Soft Clustering Ensemble via k-Sparse Discrete Wasserstein Barycenter

Watch later

Favorite

Ruizhe Qin, …

NeurIPS 2021 3 years ago