Bo Liu, Xidong Feng, Jie Ren, Luo Mai, Rui Zhu, Haifeng Zhang, Jun Wang, Yaodong Yang · A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning · SlidesLive

Kategorie

CS

Přihlásit se Kontaktujte nás

Další

Živý přenos začne již brzy!

Živý přenos již skončil.

Prezentace ještě nebyla nahrána!

SlidesLive

title: A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning

0:00 / 0:00

Nahlásit chybu
Nastavení
Playlisty
Záložky
Titulky Off
Rychlost přehrávání
Kvalita

Nastavení
Debug informace
Server sl-yoda-v2-stream-010-alpha.b-cdn.net
Velikost titulků Střední

Záložky

Server
sl-yoda-v2-stream-010-alpha.b-cdn.net
sl-yoda-v2-stream-010-beta.b-cdn.net
1759419103.rsc.cdn77.org
1016618226.rsc.cdn77.org

Titulky
Off
English

Rychlost přehrávání

Kvalita

Velikost titulků
Velké
Střední
Malé

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning

A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning

28. listopadu 2022

Řečníci

Bo Liu

Řečník · 1 sledující

Xidong Feng

Řečník · 0 sledujících

Jie Ren

Řečník · 0 sledujících

O prezentaci

Gradient-based Meta-RL (GMRL) refers to methods that maintain two-level optimisation procedures wherein the outer-loop meta-learner guides the inner-loop gradient-based reinforcement learner to achieve fast adaptations. In this paper, we develop a unified framework that describes variations of GMRL algorithms and points out that existing stochastic meta-gradient estimators adopted by GMRL are actually biased. Such meta-gradient bias comes from two sources: 1) the compositional bias incurred by t…

Organizátor

NeurIPS 2022

Účet · 961 sledujících

Baví vás formát? Nechte SlidesLive zachytit svou akci!

Profesionální natáčení a streamování po celém světě.

Sdílení

Doporučená videa

Prezentace na podobné téma, kategorii nebo přednášejícího

fficient Finetuning of Transformers for Source Code

05:01

fficient Finetuning of Transformers for Source Code

Zhlédnout později

Oblíbené

Shamil Ayupov, …

NeurIPS 2022 2 years ago

PhysGNN: A Physics–Driven Graph Neural Network Based Model for Predicting Soft Tissue Deformation in Image–Guided Neurosurgery

04:45

PhysGNN: A Physics–Driven Graph Neural Network Based Model for Predicting Soft Tissue Deformation in Image–Guided Neurosurgery

Zhlédnout později

Oblíbené

Yasmin Salehi, …

NeurIPS 2022 2 years ago

Estimating Noise Transition Matrix with Label Correlations for Noisy Multi-Label Learning

01:00

Estimating Noise Transition Matrix with Label Correlations for Noisy Multi-Label Learning

Zhlédnout později

Oblíbené

NeurIPS 2022 2 years ago

Compression supports low-dimensional representations of behavior across neural circuits

10:40

Compression supports low-dimensional representations of behavior across neural circuits

Zhlédnout později

Oblíbené

NeurIPS 2022 2 years ago

Cross-modal Learning for Image-Guided Point Cloud Shape Completion

05:13

Cross-modal Learning for Image-Guided Point Cloud Shape Completion

Zhlédnout později

Oblíbené

Emanuele Aiello, …

NeurIPS 2022 2 years ago

When is the Convergence Time of Langevin Algorithms Dimension Independent? A Composite Optimization Viewpoint

05:02

When is the Convergence Time of Langevin Algorithms Dimension Independent? A Composite Optimization Viewpoint

Zhlédnout později

Oblíbené

Yoav S Freund, …

NeurIPS 2022 2 years ago