Christoph Dann, Teodor Vanislavov Marinov, Mehryar Mohri, Julian Zimmert · Beyond Value-Function Gaps: Improved Instance-Dependent Regret Bounds for Episodic Reinforcement Learning · SlidesLive

Kategorie

CS

Přihlásit se Kontaktujte nás

Další

Živý přenos začne již brzy!

Živý přenos již skončil.

Prezentace ještě nebyla nahrána!

SlidesLive

title: Beyond Value-Function Gaps: Improved Instance-Dependent Regret Bounds for Episodic Reinforcement Learning

0:00 / 0:00

Nahlásit chybu
Nastavení
Playlisty
Záložky
Titulky Off
Rychlost přehrávání
Kvalita

Nastavení
Debug informace
Server sl-yoda-v3-stream-016-alpha.b-cdn.net
Velikost titulků Střední

Záložky

Server
sl-yoda-v3-stream-016-alpha.b-cdn.net
sl-yoda-v3-stream-016-beta.b-cdn.net
1504562137.rsc.cdn77.org
1896834465.rsc.cdn77.org

Titulky
Off
English

Rychlost přehrávání

Kvalita

Velikost titulků
Velké
Střední
Malé

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Beyond Value-Function Gaps: Improved Instance-Dependent Regret Bounds for Episodic Reinforcement Learning

Beyond Value-Function Gaps: Improved Instance-Dependent Regret Bounds for Episodic Reinforcement Learning

6. prosince 2021

Řečníci

Christoph Dann

Řečník · 0 sledujících

Teodor Vanislavov Marinov

Řečník · 0 sledujících

Mehryar Mohri

Řečník · 4 sledující

O prezentaci

We provide improved gap-dependent regret bounds for reinforcement learning in finite episodic Markov decision processes. Compared to prior work, our bounds depend on alternative definitions of gaps. These definitions are based on the insight that, in order to achieve a favorable regret, an algorithm does not need to learn how to behave optimally in states that are not reached by an optimal policy. We prove tighter upper regret bounds for optimistic algorithms and accompany them with new informat…

Organizátor

NeurIPS 2021

Účet · 1,9k sledujících

O organizátorovi (NeurIPS 2021)

Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

Baví vás formát? Nechte SlidesLive zachytit svou akci!

Profesionální natáčení a streamování po celém světě.

Sdílení

Doporučená videa

Prezentace na podobné téma, kategorii nebo přednášejícího

Learning from Data through the Lens of Ocean Models, Surrogates, and their Derivatives

36:37

Learning from Data through the Lens of Ocean Models, Surrogates, and their Derivatives

Zhlédnout později

Oblíbené

Patrick Heimbach

NeurIPS 2021 3 years ago

From One Hand to Multiple Hands: Imitation Learning for Dexterous Manipulation from Single-Camera Teleoperation

04:53

From One Hand to Multiple Hands: Imitation Learning for Dexterous Manipulation from Single-Camera Teleoperation

Zhlédnout později

Oblíbené

NeurIPS 2021 3 years ago

LSH methods for data deduplication in a Wikipedia artificial dataset

02:39

LSH methods for data deduplication in a Wikipedia artificial dataset

Zhlédnout později

Oblíbené

NeurIPS 2021 3 years ago

Tailoring: encoding inductive biases by optimizing unsupervised objectives at prediction time

15:05

Tailoring: encoding inductive biases by optimizing unsupervised objectives at prediction time

Zhlédnout později

Oblíbené

Ferran Alet, …

NeurIPS 2021 3 years ago

Data-driven adversarial regularization for inverse problems

36:23

Data-driven adversarial regularization for inverse problems

Zhlédnout později

Oblíbené

Carole-Bibiane Schonlieb

NeurIPS 2021 3 years ago

PCA Retargeting: Encoding Linear Shape Models as Convolutional Mesh Autoencoders

19:50

PCA Retargeting: Encoding Linear Shape Models as Convolutional Mesh Autoencoders

Zhlédnout později

Oblíbené

Eimear O'Sullivan

NeurIPS 2021 3 years ago