Andrea Zanette, Alessandro Lazaric, Mykel Kochenderfer, Emma Brunskill · Learning Near Optimal Policies with Low Inherent Bellman Error · SlidesLive

Kategorie

CS

Přihlásit se Kontaktujte nás

Další

Živý přenos začne již brzy!

Živý přenos již skončil.

Prezentace ještě nebyla nahrána!

SlidesLive

title: Learning Near Optimal Policies with Low Inherent Bellman Error

0:00 / 0:00

Nahlásit chybu
Nastavení
Playlisty
Záložky
Titulky Off
Rychlost přehrávání
Kvalita

Nastavení
Debug informace
Server sl-yoda-v3-stream-011-alpha.b-cdn.net
Velikost titulků Střední

Záložky

Server
sl-yoda-v3-stream-011-alpha.b-cdn.net
sl-yoda-v3-stream-011-beta.b-cdn.net
1150868944.rsc.cdn77.org
1511650057.rsc.cdn77.org

Titulky
Off
en

Rychlost přehrávání

Kvalita

Velikost titulků
Velké
Střední
Malé

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Learning Near Optimal Policies with Low Inherent Bellman Error

Learning Near Optimal Policies with Low Inherent Bellman Error

12. července 2020

Řečníci

Andrea Zanette

Řečník · 0 sledujících

Alessandro Lazaric

Řečník · 2 sledující

Mykel Kochenderfer

Řečník · 0 sledujících

O prezentaci

We study the exploration problem with approximate linear action-value functions in episodic reinforcement learning under the notion of low inherent Bellman error, a condition normally employed to show convergence of approximate value iteration. We relate this condition to other common frameworks and show that it is strictly more general than the low rank (or linear) MDP assumption of prior work. We provide an algorithm with a rate optimal regret bound for this setting. While computational tracta…

Organizátor

ICML 2020

Účet · 2,7k sledujících

Kategorie

Ekonomika a podnikání

Kategorie · 3,8k prezentací

O organizátorovi (ICML 2020)

The International Conference on Machine Learning (ICML) is the premier gathering of professionals dedicated to the advancement of the branch of artificial intelligence known as machine learning. ICML is globally renowned for presenting and publishing cutting-edge research on all aspects of machine learning used in closely related areas like artificial intelligence, statistics and data science, as well as important application areas such as machine vision, computational biology, speech recognition, and robotics. ICML is one of the fastest growing artificial intelligence conferences in the world. Participants at ICML span a wide range of backgrounds, from academic and industrial researchers, to entrepreneurs and engineers, to graduate students and postdocs.

Baví vás formát? Nechte SlidesLive zachytit svou akci!

Profesionální natáčení a streamování po celém světě.

Sdílení

Doporučená videa

Prezentace na podobné téma, kategorii nebo přednášejícího

The Sample Complexity of Best-k Items Selection from Pairwise Comparisons

13:16

The Sample Complexity of Best-k Items Selection from Pairwise Comparisons

Zhlédnout později

Oblíbené

ICML 2020 5 years ago

PolyGen: An Autoregressive Generative Model of 3D Meshes

12:20

PolyGen: An Autoregressive Generative Model of 3D Meshes

Zhlédnout později

Oblíbené

Charlie Nash, …

ICML 2020 5 years ago

Time-aware Large Kernel Convolutions

12:25

Time-aware Large Kernel Convolutions

Zhlédnout později

Oblíbené

Vasileios Lioutas, …

ICML 2020 5 years ago

Quantum Boosting

13:47

Quantum Boosting

Zhlédnout později

Oblíbené

Srinivasan Arunachalam, …

ICML 2020 5 years ago

Test-Time Training with Self-Supervision for Generalization under Distribution Shifts

15:18

Test-Time Training with Self-Supervision for Generalization under Distribution Shifts

Zhlédnout později

Oblíbené

YuSun, …

ICML 2020 5 years ago

Invited talk: A New RNN Algorithm Using the Computational Inductive Bias of Span Independence

50:03

Invited talk: A New RNN Algorithm Using the Computational Inductive Bias of Span Independence

Zhlédnout později

Oblíbené

ICML 2020 5 years ago