Fengdi Che, Xiru Zhu, Doina Precup, David Meger, Gregory Dudek · Bayesian Q-learning With Imperfect Expert Demonstrations · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Bayesian Q-learning With Imperfect Expert Demonstrations

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-007-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-007-alpha.b-cdn.net
sl-yoda-v2-stream-007-beta.b-cdn.net
1678031076.rsc.cdn77.org
1932936657.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Bayesian Q-learning With Imperfect Expert Demonstrations

Bayesian Q-learning With Imperfect Expert Demonstrations

Dec 2, 2022

Speakers

Fengdi Che

Sprecher:in · 0 Follower:innen

Xiru Zhu

Sprecher:in · 0 Follower:innen

Doina Precup

Sprecher:in · 17 Follower:innen

About

Guided exploration with expert demonstrations improves data efficiency for reinforcement learning, but current algorithms often overuse expert information. We propose a novel algorithm to speed up Q-learning with the help of a limited amount of imperfect expert demonstrations. The algorithm is based on a Bayesian framework to model suboptimal expert actions and derives Q-values' update rules by maximizing the posterior probability. It weighs expert information by the uncertainty of learnt Q-valu…

Organizer

NeurIPS 2022

Konto · 961 Follower:innen

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Positively Weighted Kernel Quadrature via Subsampling

05:02

Positively Weighted Kernel Quadrature via Subsampling

Später ansehen

Favorit

Satoshi Hayakawa, …

NeurIPS 2022 2 years ago

Behavioral Engagement and Manifold Representation in the Hippocampus: Evidence from the Mutual Information of Population Encoding and Location

03:05

Behavioral Engagement and Manifold Representation in the Hippocampus: Evidence from the Mutual Information of Population Encoding and Location

Später ansehen

Favorit

Shagesh Sridharan

NeurIPS 2022 2 years ago

Self-Supervised Fair Representation Learning without Demographics

04:48

Self-Supervised Fair Representation Learning without Demographics

Später ansehen

Favorit

Junyi Chai, …

NeurIPS 2022 2 years ago

The Benefits of Model-Based Generalization in Reinforcement Learning

04:22

The Benefits of Model-Based Generalization in Reinforcement Learning

Später ansehen

Favorit

Kenny Young, …

NeurIPS 2022 2 years ago

An Analysis of Social Biases Present in BERT Variants Across Multiple Languages

10:35

An Analysis of Social Biases Present in BERT Variants Across Multiple Languages

Später ansehen

Favorit

Parishad BehnamGhader, …

NeurIPS 2022 2 years ago

Closing remarks

04:05

Closing remarks

Später ansehen

Favorit

NeurIPS 2022 2 years ago