Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Bayesian Q-learning With Imperfect Expert Demonstrations
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-007-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-007-alpha.b-cdn.net
      • sl-yoda-v2-stream-007-beta.b-cdn.net
      • 1678031076.rsc.cdn77.org
      • 1932936657.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Bayesian Q-learning With Imperfect Expert Demonstrations
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Bayesian Q-learning With Imperfect Expert Demonstrations

            Dec 2, 2022

            Speakers

            FC

            Fengdi Che

            Sprecher:in · 0 Follower:innen

            XZ

            Xiru Zhu

            Sprecher:in · 0 Follower:innen

            DP

            Doina Precup

            Sprecher:in · 17 Follower:innen

            About

            Guided exploration with expert demonstrations improves data efficiency for reinforcement learning, but current algorithms often overuse expert information. We propose a novel algorithm to speed up Q-learning with the help of a limited amount of imperfect expert demonstrations. The algorithm is based on a Bayesian framework to model suboptimal expert actions and derives Q-values' update rules by maximizing the posterior probability. It weighs expert information by the uncertainty of learnt Q-valu…

            Organizer

            N2
            N2

            NeurIPS 2022

            Konto · 961 Follower:innen

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Positively Weighted Kernel Quadrature via Subsampling
            05:02

            Positively Weighted Kernel Quadrature via Subsampling

            Satoshi Hayakawa, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Self-Supervised Fair Representation Learning without Demographics
            04:48

            Self-Supervised Fair Representation Learning without Demographics

            Junyi Chai, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            The Benefits of Model-Based Generalization in Reinforcement Learning
            04:22

            The Benefits of Model-Based Generalization in Reinforcement Learning

            Kenny Young, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            An Analysis of Social Biases Present in BERT Variants Across Multiple Languages
            10:35

            An Analysis of Social Biases Present in BERT Variants Across Multiple Languages

            Parishad BehnamGhader, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Closing remarks
            04:05

            Closing remarks

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Interested in talks like this? Follow NeurIPS 2022