Bo Liu, Xidong Feng, Jie Ren, Luo Mai, Rui Zhu, Haifeng Zhang, Jun Wang, Yaodong Yang · A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-010-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-010-alpha.b-cdn.net
sl-yoda-v2-stream-010-beta.b-cdn.net
1759419103.rsc.cdn77.org
1016618226.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning

A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning

Nov 28, 2022

Speakers

Bo Liu

Speaker · 1 follower

Xidong Feng

Speaker · 0 followers

Jie Ren

Speaker · 0 followers

About

Gradient-based Meta-RL (GMRL) refers to methods that maintain two-level optimisation procedures wherein the outer-loop meta-learner guides the inner-loop gradient-based reinforcement learner to achieve fast adaptations. In this paper, we develop a unified framework that describes variations of GMRL algorithms and points out that existing stochastic meta-gradient estimators adopted by GMRL are actually biased. Such meta-gradient bias comes from two sources: 1) the compositional bias incurred by t…

Organizer

NeurIPS 2022

Account · 960 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

MinVIS: A Minimal Video Instance Segmentation Framework without Video-based Training

04:50

MinVIS: A Minimal Video Instance Segmentation Framework without Video-based Training

Watch later

Favorite

De-An Huang, …

NeurIPS 2022 2 years ago

Vision-centric Autonomous Driving: from Perception to Prediction

31:33

Vision-centric Autonomous Driving: from Perception to Prediction

Watch later

Favorite

NeurIPS 2022 2 years ago

First Steps Toward Understanding the Extrapolation of Nonlinear Models to Unseen Domains

08:34

First Steps Toward Understanding the Extrapolation of Nonlinear Models to Unseen Domains

Watch later

Favorite

Kefan Dong, …

NeurIPS 2022 2 years ago

Interactive Imitation Learning in Robotics

26:07

Interactive Imitation Learning in Robotics

Watch later

Favorite

NeurIPS 2022 2 years ago

Language Models Can Teach Themselves to Program Better

04:43

Language Models Can Teach Themselves to Program Better

Watch later

Favorite

Patrick Haluptzok, …

NeurIPS 2022 2 years ago

Cache-Augmented Inbatch Importance Resampling for Training Recommender Retriever

04:48

Cache-Augmented Inbatch Importance Resampling for Training Recommender Retriever

Watch later

Favorite

NeurIPS 2022 2 years ago