Andrew Wagenmaker, Aldo Pacchiano · Leveraging Offline Data in Online Reinforcement Learning · SlidesLive

Kategorie

CS

Přihlásit se Kontaktujte nás

Další

Živý přenos začne již brzy!

Živý přenos již skončil.

Prezentace ještě nebyla nahrána!

SlidesLive

title: Leveraging Offline Data in Online Reinforcement Learning

0:00 / 0:00

Nahlásit chybu
Nastavení
Playlisty
Záložky
Titulky Off
Rychlost přehrávání
Kvalita

Nastavení
Debug informace
Server sl-yoda-v2-stream-006-alpha.b-cdn.net
Velikost titulků Střední

Záložky

Server
sl-yoda-v2-stream-006-alpha.b-cdn.net
sl-yoda-v2-stream-006-beta.b-cdn.net
1549480416.rsc.cdn77.org
1102696603.rsc.cdn77.org

Titulky
Off
English

Rychlost přehrávání

Kvalita

Velikost titulků
Velké
Střední
Malé

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Leveraging Offline Data in Online Reinforcement Learning

Leveraging Offline Data in Online Reinforcement Learning

24. července 2023

Řečníci

Andrew Wagenmaker

Sprecher:in · 0 Follower:innen

Aldo Pacchiano

Sprecher:in · 0 Follower:innen

O prezentaci

Two central paradigms have emerged in the reinforcement learning (RL) community: online RL and offline RL. In the online RL setting, the agent has no prior knowledge of the environment, and must interact with it in order to find an ϵ-optimal policy. In the offline RL setting, the learner instead has access to a fixed dataset to learn from, but is unable to otherwise interact with the environment, and must obtain the best policy it can from this offline data. Practical scenarios often motivate an…

Organizátor

ICML 2023

Konto · 657 Follower:innen

Baví vás formát? Nechte SlidesLive zachytit svou akci!

Profesionální natáčení a streamování po celém světě.

Sdílení

Doporučená videa

Prezentace na podobné téma, kategorii nebo přednášejícího

Deep Latent State Space Models for Time-Series Generation

05:08

Deep Latent State Space Models for Time-Series Generation

Später ansehen

Favorit

Linqi Zhou, …

ICML 2023 2 years ago

Continual Vision-Language Representation Learning with Off-Diagonal Information

02:30

Continual Vision-Language Representation Learning with Off-Diagonal Information

Später ansehen

Favorit

ICML 2023 2 years ago

MEWL: Few-shot multimodal word learning with referential uncertainty

04:32

MEWL: Few-shot multimodal word learning with referential uncertainty

Später ansehen

Favorit

Guangyuan Jiang, …

ICML 2023 2 years ago

Intuition for the Data Types and Interactions in Euclidean Symmetry-Equivariant Neural Networks

38:48

Intuition for the Data Types and Interactions in Euclidean Symmetry-Equivariant Neural Networks

Später ansehen

Favorit

ICML 2023 2 years ago

Closing Remarks

04:52

Closing Remarks

Später ansehen

Favorit

ICML 2023 2 years ago

Neural Image Compression: Generalization, Robustness, and Spectral Bias

14:08

Neural Image Compression: Generalization, Robustness, and Spectral Bias

Später ansehen

Favorit

Kelsey Lieberman, …

ICML 2023 2 years ago