Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: A Framework for Predictable Actor-Critic Control
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-001-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-001-alpha.b-cdn.net
      • sl-yoda-v2-stream-001-beta.b-cdn.net
      • 1824830694.rsc.cdn77.org
      • 1979322955.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            A Framework for Predictable Actor-Critic Control
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            A Framework for Predictable Actor-Critic Control

            Dec 2, 2022

            Speakers

            JC

            Josiah Coad

            Speaker · 0 followers

            JA

            James Ault

            Speaker · 0 followers

            JH

            Jeff Hykin

            Speaker · 0 followers

            About

            Reinforcement learning (RL) algorithms commonly provide a one-action plan per time step. Doing this allows the RL agent to quickly adapt and respond to stochastic environments yet it restricts the ability to predict the agent's future behavior. This paper proposes an actor-critic framework that predicts and follows an n-step plan. Committing to the next n actions presents a trade-off between behavior predictability and reduced performance. In order to balance this trade-off, a dynamic plan-follo…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 961 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Collaborative Linear Bandits with Adversarial Agents: Near-Optimal Regret Bounds
            04:31

            Collaborative Linear Bandits with Adversarial Agents: Near-Optimal Regret Bounds

            Aritra Mitra, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Reinforcement Learning Approaches for Traffic Signal Control under Missing Data
            03:42

            Reinforcement Learning Approaches for Traffic Signal Control under Missing Data

            Hao Mei, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Draft-and-Revise: Effective Image Generation with Contextual RQ-Transformer
            04:48

            Draft-and-Revise: Effective Image Generation with Contextual RQ-Transformer

            Doyup Lee, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Lifelong Learning Machines:  What can be learned from biology?
            22:12

            Lifelong Learning Machines: What can be learned from biology?

            Dhireesha Kudithipudi

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Stability Analysis and Generalization Bounds of Adversarial Training
            01:03

            Stability Analysis and Generalization Bounds of Adversarial Training

            Jiancong Xiao, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Robust Forecasting for Robotic Control: A Game-Theoretic Approach
            11:54

            Robust Forecasting for Robotic Control: A Game-Theoretic Approach

            Shubhankar Agarwal, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022