Geraud Nangue Tasse, Devon Jarvis, Steven James, Benjamin Rosman · Skill Machines: Temporal Logic Composition in Reinforcement Learning · SlidesLive

Kategorien

DE

Anmelden Vertrieb kontaktieren

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Skill Machines: Temporal Logic Composition in Reinforcement Learning

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-001-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-001-alpha.b-cdn.net
sl-yoda-v2-stream-001-beta.b-cdn.net
1824830694.rsc.cdn77.org
1979322955.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Skill Machines: Temporal Logic Composition in Reinforcement Learning

Skill Machines: Temporal Logic Composition in Reinforcement Learning

Dez 2, 2022

Sprecher:innen

Geraud Nangue Tasse

Sprecher:in · 0 Follower:innen

Devon Jarvis

Sprecher:in · 0 Follower:innen

Steven James

Sprecher:in · 0 Follower:innen

Über

A major challenge in reinforcement learning is specifying tasks in a manner that is both interpretable and verifiable. One common approach is to specify tasks through reward machines—finite state machines that encode the task to be solved. We introduce skill machines, a representation that can be learned directly from these reward machines that encode the solution to such tasks. We propose a framework where an agent first learns a set of base skills in a reward-free setting, and then combines th…

Organisator

NeurIPS 2022

Konto · 962 Follower:innen

Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

Professionelle Aufzeichnung und Livestreaming – weltweit.

Freigeben

Empfohlene Videos

Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

Constraining Gaussian Processes to Systems of Linear Ordinary Differential Equations

16:26

Constraining Gaussian Processes to Systems of Linear Ordinary Differential Equations

Später ansehen

Favorit

Andreas Besginow, …

NeurIPS 2022 2 years ago

GRACE-C: Generalized Rate Agnostic Causal Estimation via Constraints

10:46

GRACE-C: Generalized Rate Agnostic Causal Estimation via Constraints

Später ansehen

Favorit

Mohammadsajad Abavisani, …

NeurIPS 2022 2 years ago

Privacy-Preserving Group Fairness in Cross-Device Federated Learning

03:03

Privacy-Preserving Group Fairness in Cross-Device Federated Learning

Später ansehen

Favorit

Sikha Pentyala, …

NeurIPS 2022 2 years ago

A Simple and Optimal Policy Design for Online Learning with Safety against Heavy-tailed Risk

04:42

A Simple and Optimal Policy Design for Online Learning with Safety against Heavy-tailed Risk

Später ansehen

Favorit

David Simchi-Levi, …

NeurIPS 2022 2 years ago

Bayesian Clustering of Neural Spiking Activity Using a Mixture of Dynamic Poisson Factor Analyzers

04:06

Bayesian Clustering of Neural Spiking Activity Using a Mixture of Dynamic Poisson Factor Analyzers

Später ansehen

Favorit

Ganchao Wei, …

NeurIPS 2022 2 years ago

A Policy-Guided Imitation Approach for Offline Reinforcement Learning

04:57

A Policy-Guided Imitation Approach for Offline Reinforcement Learning

Später ansehen

Favorit

NeurIPS 2022 2 years ago