Josiah Coad, James Ault, Jeff Hykin, Guni Sharon · A Framework for Predictable Actor-Critic Control · SlidesLive

Kategorien

DE

Anmelden Vertrieb kontaktieren

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: A Framework for Predictable Actor-Critic Control

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-001-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-001-alpha.b-cdn.net
sl-yoda-v2-stream-001-beta.b-cdn.net
1824830694.rsc.cdn77.org
1979322955.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

A Framework for Predictable Actor-Critic Control

A Framework for Predictable Actor-Critic Control

Dez 2, 2022

Sprecher:innen

Josiah Coad

Řečník · 0 sledujících

James Ault

Řečník · 0 sledujících

Jeff Hykin

Řečník · 0 sledujících

Über

Reinforcement learning (RL) algorithms commonly provide a one-action plan per time step. Doing this allows the RL agent to quickly adapt and respond to stochastic environments yet it restricts the ability to predict the agent's future behavior. This paper proposes an actor-critic framework that predicts and follows an n-step plan. Committing to the next n actions presents a trade-off between behavior predictability and reduced performance. In order to balance this trade-off, a dynamic plan-follo…

Organisator

NeurIPS 2022

Účet · 962 sledujících

Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

Professionelle Aufzeichnung und Livestreaming – weltweit.

Freigeben

Empfohlene Videos

Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

Meta-Adaptive Stock Movement Prediction with Two-Stage Representation Learning

05:53

Meta-Adaptive Stock Movement Prediction with Two-Stage Representation Learning

Zhlédnout později

Oblíbené

Donglin Zhan, …

NeurIPS 2022 2 years ago

List-decodable Mean Estimation via Difference of Pairs

04:30

List-decodable Mean Estimation via Difference of Pairs

Zhlédnout později

Oblíbené

Ilias Diakonikolas, …

NeurIPS 2022 2 years ago

Visual Backtracking Teleoperation: A Data Collection Protocol for Offline Image-Based Reinforcement Learning

03:03

Visual Backtracking Teleoperation: A Data Collection Protocol for Offline Image-Based Reinforcement Learning

Zhlédnout později

Oblíbené

David Brandfonbrener, …

NeurIPS 2022 2 years ago

Provably Efficient Model-Free Constrained Reinforcement Learning Algorithm with Linear Function Approximation

05:02

Provably Efficient Model-Free Constrained Reinforcement Learning Algorithm with Linear Function Approximation

Zhlédnout později

Oblíbené

Xingyu Zhou, …

NeurIPS 2022 2 years ago

Composition Theorems for Interactive Differential Privacy

01:00

Composition Theorems for Interactive Differential Privacy

Zhlédnout později

Oblíbené

NeurIPS 2022 2 years ago

Panel Discussion: Deep Reinforcement Learning Workshop

56:02

Panel Discussion: Deep Reinforcement Learning Workshop

Zhlédnout později

Oblíbené

Stephanie Chan, …

NeurIPS 2022 2 years ago