Kenny Young, Aditya Ramesh, Louis Kirsch, Jürgen Schmidhuber · The Benefits of Model-Based Generalization in Reinforcement Learning · SlidesLive

Kategorie

CS

Přihlásit se Kontaktujte nás

Další

Živý přenos začne již brzy!

Živý přenos již skončil.

Prezentace ještě nebyla nahrána!

SlidesLive

title: The Benefits of Model-Based Generalization in Reinforcement Learning

0:00 / 0:00

Nahlásit chybu
Nastavení
Playlisty
Záložky
Titulky Off
Rychlost přehrávání
Kvalita

Nastavení
Debug informace
Server sl-yoda-v2-stream-007-alpha.b-cdn.net
Velikost titulků Střední

Záložky

Server
sl-yoda-v2-stream-007-alpha.b-cdn.net
sl-yoda-v2-stream-007-beta.b-cdn.net
1678031076.rsc.cdn77.org
1932936657.rsc.cdn77.org

Titulky
Off
English

Rychlost přehrávání

Kvalita

Velikost titulků
Velké
Střední
Malé

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

The Benefits of Model-Based Generalization in Reinforcement Learning

The Benefits of Model-Based Generalization in Reinforcement Learning

24. července 2023

Řečníci

Kenny Young

Sprecher:in · 0 Follower:innen

Aditya Ramesh

Sprecher:in · 1 Follower:in

Louis Kirsch

Sprecher:in · 0 Follower:innen

O prezentaci

Model-Based Reinforcement Learning (RL) is widely believed to have the potential to improve sample efficiency by allowing an agent to synthesize large amounts of imagined experience. Experience Replay (ER) can be considered a simple kind of model, which has proved effective at improving the stability and efficiency of deep RL. In principle, a learned parametric model could improve on ER by generalizing from real experience to augment the dataset with additional plausible experience. However, giv…

Organizátor

ICML 2023

Konto · 657 Follower:innen

Baví vás formát? Nechte SlidesLive zachytit svou akci!

Profesionální natáčení a streamování po celém světě.

Sdílení

Doporučená videa

Prezentace na podobné téma, kategorii nebo přednášejícího

Data-Efficient Contrastive Self-supervised Learning

05:14

Data-Efficient Contrastive Self-supervised Learning

Später ansehen

Favorit

Siddharth Joshi, …

ICML 2023 2 years ago

Reliable Measures of Spread in High Dimensional Latent Spaces

05:19

Reliable Measures of Spread in High Dimensional Latent Spaces

Später ansehen

Favorit

Anna Marbut, …

ICML 2023 2 years ago

Self-Repellent Random Walks on General Graphs - Achieving Minimal Sampling Variance via Nonlinear Markov Chains

04:49

Self-Repellent Random Walks on General Graphs - Achieving Minimal Sampling Variance via Nonlinear Markov Chains

Später ansehen

Favorit

Vishwaraj Doshi, …

ICML 2023 2 years ago

SIM-CNN: Self-Supervised Individualized Multimodal Learning for Stress Prediction on Nurses Using Biosignals

09:54

SIM-CNN: Self-Supervised Individualized Multimodal Learning for Stress Prediction on Nurses Using Biosignals

Später ansehen

Favorit

Sunmin Eom, …

ICML 2023 2 years ago

Improving Open Language Models by Learning from Organic Interactions

40:42

Improving Open Language Models by Learning from Organic Interactions

Später ansehen

Favorit

Jason Weston, …

ICML 2023 2 years ago

Open-Vocabulary Universal Image Segmentation with MaskCLIP

05:16

Open-Vocabulary Universal Image Segmentation with MaskCLIP

Später ansehen

Favorit

Zheng Ding, …

ICML 2023 2 years ago