Mikael Henaff, Minqi Jiang, Roberta Raileanu · A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs · SlidesLive

Kategorien

DE

Anmelden Vertrieb kontaktieren

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-007-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-007-alpha.b-cdn.net
sl-yoda-v2-stream-007-beta.b-cdn.net
1678031076.rsc.cdn77.org
1932936657.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs

A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs

Jul 25, 2023

Sprecher:innen

Mikael Henaff

Sprecher:in · 0 Follower:innen

Minqi Jiang

Sprecher:in · 0 Follower:innen

Roberta Raileanu

Sprecher:in · 0 Follower:innen

Über

Exploration in environments which differ across episodes has received increasing attention in recent years. Current methods use some combination of global novelty bonuses, computed using the agent's entire training experience, and episodic novelty bonuses, computed using only experience from the current episode. However, the use of these two types of bonuses has been ad-hoc and poorly understood. In this work, we shed light on the behavior of these two types of bonuses through controlled experim…

Organisator

ICML 2023

Konto · 657 Follower:innen

Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

Professionelle Aufzeichnung und Livestreaming – weltweit.

Freigeben

Empfohlene Videos

Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning

05:48

Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning

Später ansehen

Favorit

ICML 2023 2 years ago

Self-Repellent Random Walks on General Graphs - Achieving Minimal Sampling Variance via Nonlinear Markov Chains

04:49

Self-Repellent Random Walks on General Graphs - Achieving Minimal Sampling Variance via Nonlinear Markov Chains

Später ansehen

Favorit

Vishwaraj Doshi, …

ICML 2023 2 years ago

Panel Discussion on Privacy

58:24

Panel Discussion on Privacy

Später ansehen

Favorit

Kristen Vaccaro, …

ICML 2023 2 years ago

Self-Supervised Learning in Vision: from Research Advances to Best Practices

1:52:07

Self-Supervised Learning in Vision: from Research Advances to Best Practices

Später ansehen

Favorit

Xinlei Chen, …

ICML 2023 2 years ago

Spatial Implicit Neural Representations for Global-Scale Species Mapping

05:15

Spatial Implicit Neural Representations for Global-Scale Species Mapping

Später ansehen

Favorit

Elijah Cole, …

ICML 2023 2 years ago

FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU

08:35

FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU

Später ansehen

Favorit

Ying Sheng, …

ICML 2023 2 years ago