Alexander Nikulin, Vladislav Kurenkov, Denis Tarasov, Sergey Kolesnikov · Anti-Exploration by Random Network Distillation · SlidesLive

Kategorie

CS

Přihlásit se Kontaktujte nás

Další

Živý přenos začne již brzy!

Živý přenos již skončil.

Prezentace ještě nebyla nahrána!

SlidesLive

title: Anti-Exploration by Random Network Distillation

0:00 / 0:00

Nahlásit chybu
Nastavení
Playlisty
Záložky
Titulky Off
Rychlost přehrávání
Kvalita

Nastavení
Debug informace
Server sl-yoda-v2-stream-004-alpha.b-cdn.net
Velikost titulků Střední

Záložky

Server
sl-yoda-v2-stream-004-alpha.b-cdn.net
sl-yoda-v2-stream-004-beta.b-cdn.net
1685195716.rsc.cdn77.org
1239898752.rsc.cdn77.org

Titulky
Off
English

Rychlost přehrávání

Kvalita

Velikost titulků
Velké
Střední
Malé

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Anti-Exploration by Random Network Distillation

Anti-Exploration by Random Network Distillation

24. července 2023

Řečníci

Alexander Nikulin

Sprecher:in · 0 Follower:innen

Vladislav Kurenkov

Sprecher:in · 0 Follower:innen

Denis Tarasov

Sprecher:in · 0 Follower:innen

O prezentaci

Despite the success of Random Network Distillation (RND) in various domains, it was shown as not discriminative enough to be used as an uncertainty estimator for penalizing out-of-distribution actions in offline reinforcement learning. In this paper, we revisit these results and show that, with a naive choice of conditioning for the RND prior, it becomes infeasible for the actor to effectively minimize the anti-exploration bonus and discriminativity is not an issue. We show that this limitation…

Organizátor

ICML 2023

Konto · 657 Follower:innen

Baví vás formát? Nechte SlidesLive zachytit svou akci!

Profesionální natáčení a streamování po celém světě.

Sdílení

Doporučená videa

Prezentace na podobné téma, kategorii nebo přednášejícího

Optimal randomized multilevel Monte Carlo estimators for repeatedly nested expectations

05:07

Optimal randomized multilevel Monte Carlo estimators for repeatedly nested expectations

Später ansehen

Favorit

ICML 2023 2 years ago

Latent Traversals in Generative Models as Potential Flows

05:18

Latent Traversals in Generative Models as Potential Flows

Später ansehen

Favorit

ICML 2023 2 years ago

Finite-Sample Analysis of Learning High-Dimensional Single ReLU Neuron

05:21

Finite-Sample Analysis of Learning High-Dimensional Single ReLU Neuron

Später ansehen

Favorit

Jingfeng Wu, …

ICML 2023 2 years ago

BNN-DP: Robustness Certification of Bayesian Neural Networks via Dynamic Programming

05:13

BNN-DP: Robustness Certification of Bayesian Neural Networks via Dynamic Programming

Später ansehen

Favorit

Steven Adams, …

ICML 2023 2 years ago

Diffusion Models are Minimax Optimal Distribution Estimators

08:25

Diffusion Models are Minimax Optimal Distribution Estimators

Später ansehen

Favorit

Kazusato Oko, …

ICML 2023 2 years ago

A closer look at few-shot classification again

05:15

A closer look at few-shot classification again

Später ansehen

Favorit

ICML 2023 2 years ago