Mikael Henaff, Minqi Jiang, Roberta Raileanu · A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs · SlidesLive

Kategorie

CS

Přihlásit se Kontaktujte nás

Další

Živý přenos začne již brzy!

Živý přenos již skončil.

Prezentace ještě nebyla nahrána!

SlidesLive

title: A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs

0:00 / 0:00

Nahlásit chybu
Nastavení
Playlisty
Záložky
Titulky Off
Rychlost přehrávání
Kvalita

Nastavení
Debug informace
Server sl-yoda-v2-stream-007-alpha.b-cdn.net
Velikost titulků Střední

Záložky

Server
sl-yoda-v2-stream-007-alpha.b-cdn.net
sl-yoda-v2-stream-007-beta.b-cdn.net
1678031076.rsc.cdn77.org
1932936657.rsc.cdn77.org

Titulky
Off
English

Rychlost přehrávání

Kvalita

Velikost titulků
Velké
Střední
Malé

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs

A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs

25. července 2023

Řečníci

Mikael Henaff

Sprecher:in · 0 Follower:innen

Minqi Jiang

Sprecher:in · 0 Follower:innen

Roberta Raileanu

Sprecher:in · 0 Follower:innen

O prezentaci

Exploration in environments which differ across episodes has received increasing attention in recent years. Current methods use some combination of global novelty bonuses, computed using the agent's entire training experience, and episodic novelty bonuses, computed using only experience from the current episode. However, the use of these two types of bonuses has been ad-hoc and poorly understood. In this work, we shed light on the behavior of these two types of bonuses through controlled experim…

Organizátor

ICML 2023

Konto · 657 Follower:innen

Baví vás formát? Nechte SlidesLive zachytit svou akci!

Profesionální natáčení a streamování po celém světě.

Sdílení

Doporučená videa

Prezentace na podobné téma, kategorii nebo přednášejícího

Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning

05:48

Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning

Später ansehen

Favorit

ICML 2023 2 years ago

Self-Repellent Random Walks on General Graphs - Achieving Minimal Sampling Variance via Nonlinear Markov Chains

04:49

Self-Repellent Random Walks on General Graphs - Achieving Minimal Sampling Variance via Nonlinear Markov Chains

Später ansehen

Favorit

Vishwaraj Doshi, …

ICML 2023 2 years ago

Panel Discussion on Privacy

58:24

Panel Discussion on Privacy

Später ansehen

Favorit

Kristen Vaccaro, …

ICML 2023 2 years ago

Self-Supervised Learning in Vision: from Research Advances to Best Practices

1:52:07

Self-Supervised Learning in Vision: from Research Advances to Best Practices

Später ansehen

Favorit

Xinlei Chen, …

ICML 2023 2 years ago

Spatial Implicit Neural Representations for Global-Scale Species Mapping

05:15

Spatial Implicit Neural Representations for Global-Scale Species Mapping

Später ansehen

Favorit

Elijah Cole, …

ICML 2023 2 years ago

FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU

08:35

FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU

Später ansehen

Favorit

Ying Sheng, …

ICML 2023 2 years ago