Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: A Framework for Predictable Actor-Critic Control
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-001-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-001-alpha.b-cdn.net
      • sl-yoda-v2-stream-001-beta.b-cdn.net
      • 1824830694.rsc.cdn77.org
      • 1979322955.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            A Framework for Predictable Actor-Critic Control
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            A Framework for Predictable Actor-Critic Control

            Dez 2, 2022

            Sprecher:innen

            JC

            Josiah Coad

            Řečník · 0 sledujících

            JA

            James Ault

            Řečník · 0 sledujících

            JH

            Jeff Hykin

            Řečník · 0 sledujících

            Über

            Reinforcement learning (RL) algorithms commonly provide a one-action plan per time step. Doing this allows the RL agent to quickly adapt and respond to stochastic environments yet it restricts the ability to predict the agent's future behavior. This paper proposes an actor-critic framework that predicts and follows an n-step plan. Committing to the next n actions presents a trade-off between behavior predictability and reduced performance. In order to balance this trade-off, a dynamic plan-follo…

            Organisator

            N2
            N2

            NeurIPS 2022

            Účet · 962 sledujících

            Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

            Professionelle Aufzeichnung und Livestreaming – weltweit.

            Freigeben

            Empfohlene Videos

            Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

            Meta-Adaptive Stock Movement Prediction with Two-Stage Representation Learning
            05:53

            Meta-Adaptive Stock Movement Prediction with Two-Stage Representation Learning

            Donglin Zhan, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            List-decodable Mean Estimation via Difference of Pairs
            04:30

            List-decodable Mean Estimation via Difference of Pairs

            Ilias Diakonikolas, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Visual Backtracking Teleoperation: A Data Collection Protocol for Offline Image-Based  Reinforcement Learning
            03:03

            Visual Backtracking Teleoperation: A Data Collection Protocol for Offline Image-Based Reinforcement Learning

            David Brandfonbrener, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Provably Efficient Model-Free Constrained Reinforcement Learning Algorithm with Linear Function Approximation
            05:02

            Provably Efficient Model-Free Constrained Reinforcement Learning Algorithm with Linear Function Approximation

            Xingyu Zhou, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Composition Theorems for Interactive Differential Privacy
            01:00

            Composition Theorems for Interactive Differential Privacy

            Xin Lyu

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Panel Discussion: Deep Reinforcement Learning Workshop
            56:02

            Panel Discussion: Deep Reinforcement Learning Workshop

            Stephanie Chan, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Interessiert an Vorträgen wie diesem? NeurIPS 2022 folgen