Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-010-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-010-alpha.b-cdn.net
      • sl-yoda-v2-stream-010-beta.b-cdn.net
      • 1759419103.rsc.cdn77.org
      • 1016618226.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning

            Nov 28, 2022

            Sprecher:innen

            BL
            BL

            Bo Liu

            Řečník · 1 sledující

            XF

            Xidong Feng

            Řečník · 0 sledujících

            JR

            Jie Ren

            Řečník · 0 sledujících

            Über

            Gradient-based Meta-RL (GMRL) refers to methods that maintain two-level optimisation procedures wherein the outer-loop meta-learner guides the inner-loop gradient-based reinforcement learner to achieve fast adaptations. In this paper, we develop a unified framework that describes variations of GMRL algorithms and points out that existing stochastic meta-gradient estimators adopted by GMRL are actually biased. Such meta-gradient bias comes from two sources: 1) the compositional bias incurred by t…

            Organisator

            N2
            N2

            NeurIPS 2022

            Účet · 961 sledujících

            Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

            Professionelle Aufzeichnung und Livestreaming – weltweit.

            Freigeben

            Empfohlene Videos

            Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

            A Finite-Particle Convergence Rate for Stein Variational Gradient Descent
            04:54

            A Finite-Particle Convergence Rate for Stein Variational Gradient Descent

            Jiaxin Shi, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Score-based Generative Models and Their Applications
            29:07

            Score-based Generative Models and Their Applications

            Chenlin Meng

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Making Sense of Dependence: Efficient Black-box Explanations Using Dependence Measure
            03:41

            Making Sense of Dependence: Efficient Black-box Explanations Using Dependence Measure

            Paul Novello, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Near-Optimal Randomized Exploration for Tabular Markov Decision Processes
            05:01

            Near-Optimal Randomized Exploration for Tabular Markov Decision Processes

            Zhihan Xiong, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Causal Analysis of the TOPCAT Trial: Spironolactone for Preserved Cardiac Function Heart Failure
            13:32

            Causal Analysis of the TOPCAT Trial: Spironolactone for Preserved Cardiac Function Heart Failure

            Francesca Raimondi, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 1 diváků, což je 0.1 %

            Biologically plausible solutions for spiking networks with efficient coding
            05:06

            Biologically plausible solutions for spiking networks with efficient coding

            Veronika Koren, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Interessiert an Vorträgen wie diesem? NeurIPS 2022 folgen