Keiran Paster, Silviu Pitis, Sheila McIlraith, Jimmy Ba · Return Augmentation gives Supervised RL Temporal Compositionality · SlidesLive

Kategorie

CS

Přihlásit se Kontaktujte nás

Další

Živý přenos začne již brzy!

Živý přenos již skončil.

Prezentace ještě nebyla nahrána!

SlidesLive

title: Return Augmentation gives Supervised RL Temporal Compositionality

0:00 / 0:00

Nahlásit chybu
Nastavení
Playlisty
Záložky
Titulky Off
Rychlost přehrávání
Kvalita

Nastavení
Debug informace
Server sl-yoda-v2-stream-005-alpha.b-cdn.net
Velikost titulků Střední

Záložky

Server
sl-yoda-v2-stream-005-alpha.b-cdn.net
sl-yoda-v2-stream-005-beta.b-cdn.net
1034628162.rsc.cdn77.org
1409346856.rsc.cdn77.org

Titulky
Off
English

Rychlost přehrávání

Kvalita

Velikost titulků
Velké
Střední
Malé

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Return Augmentation gives Supervised RL Temporal Compositionality

Return Augmentation gives Supervised RL Temporal Compositionality

2. prosince 2022

Řečníci

Keiran Paster

Řečník · 0 sledujících

Silviu Pitis

Řečník · 0 sledujících

Sheila McIlraith

Řečník · 0 sledujících

O prezentaci

Offline Reinforcement Learning (RL) methods that use supervised learning or sequence modeling (e.g., Decision Transformer) work by training a return-conditioned policy. A fundamental limitation of these approaches, as compared to value-based methods, is that they have trouble generalizing to behaviors that have a higher return than what was seen at training. Value-based offline-RL algorithms like CQL use bootstrapping to combine training data from multiple trajectories to learn strong behaviors…

Organizátor

NeurIPS 2022

Účet · 961 sledujících

Baví vás formát? Nechte SlidesLive zachytit svou akci!

Profesionální natáčení a streamování po celém světě.

Sdílení

Doporučená videa

Prezentace na podobné téma, kategorii nebo přednášejícího

Time-Myopic Go-Explore: Learning A State Representation for the Go-Explore Paradigm

04:50

Time-Myopic Go-Explore: Learning A State Representation for the Go-Explore Paradigm

Zhlédnout později

Oblíbené

Marc Höftmann, …

NeurIPS 2022 2 years ago

Analyzing Deep Learning Representations for Real-Time In-Vehicle LiDAR Perception

02:37

Analyzing Deep Learning Representations for Real-Time In-Vehicle LiDAR Perception

Zhlédnout později

Oblíbené

Marc Uecker, …

NeurIPS 2022 2 years ago

Engineering Uncertainty Representations to Monitor Distribution Shifts

07:15

Engineering Uncertainty Representations to Monitor Distribution Shifts

Zhlédnout později

Oblíbené

Thomas Bonnier, …

NeurIPS 2022 2 years ago

Accelerated Training of Physics Informed Neural Networks (PINNs) using Meshless Discretizations

04:57

Accelerated Training of Physics Informed Neural Networks (PINNs) using Meshless Discretizations

Zhlédnout později

Oblíbené

Ramansh Sharma, …

NeurIPS 2022 2 years ago

11:57

WeatherCast

Zhlédnout později

Oblíbené

Sepp Hochreiter

NeurIPS 2022 2 years ago

Symmetry Teleportation for Accelerated Optimization

04:33

Symmetry Teleportation for Accelerated Optimization

Zhlédnout později

Oblíbené

NeurIPS 2022 2 years ago