Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Skill Machines: Temporal Logic Composition in Reinforcement Learning
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-001-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-001-alpha.b-cdn.net
      • sl-yoda-v2-stream-001-beta.b-cdn.net
      • 1824830694.rsc.cdn77.org
      • 1979322955.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Skill Machines: Temporal Logic Composition in Reinforcement Learning
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Skill Machines: Temporal Logic Composition in Reinforcement Learning

            Dez 2, 2022

            Sprecher:innen

            GNT

            Geraud Nangue Tasse

            Sprecher:in · 0 Follower:innen

            DJ

            Devon Jarvis

            Sprecher:in · 0 Follower:innen

            SJ

            Steven James

            Sprecher:in · 0 Follower:innen

            Über

            A major challenge in reinforcement learning is specifying tasks in a manner that is both interpretable and verifiable. One common approach is to specify tasks through reward machines—finite state machines that encode the task to be solved. We introduce skill machines, a representation that can be learned directly from these reward machines that encode the solution to such tasks. We propose a framework where an agent first learns a set of base skills in a reward-free setting, and then combines th…

            Organisator

            N2
            N2

            NeurIPS 2022

            Konto · 962 Follower:innen

            Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

            Professionelle Aufzeichnung und Livestreaming – weltweit.

            Freigeben

            Empfohlene Videos

            Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

            Constraining Gaussian Processes to Systems of Linear Ordinary Differential Equations
            16:26

            Constraining Gaussian Processes to Systems of Linear Ordinary Differential Equations

            Andreas Besginow, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            GRACE-C: Generalized Rate Agnostic Causal Estimation via Constraints
            10:46

            GRACE-C: Generalized Rate Agnostic Causal Estimation via Constraints

            Mohammadsajad Abavisani, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Privacy-Preserving Group Fairness in Cross-Device Federated Learning
            03:03

            Privacy-Preserving Group Fairness in Cross-Device Federated Learning

            Sikha Pentyala, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            A Simple and Optimal Policy Design for Online Learning with Safety against Heavy-tailed Risk
            04:42

            A Simple and Optimal Policy Design for Online Learning with Safety against Heavy-tailed Risk

            David Simchi-Levi, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Bayesian Clustering of Neural Spiking Activity Using a Mixture of Dynamic Poisson Factor Analyzers
            04:06

            Bayesian Clustering of Neural Spiking Activity Using a Mixture of Dynamic Poisson Factor Analyzers

            Ganchao Wei, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            A Policy-Guided Imitation Approach for Offline Reinforcement Learning
            04:57

            A Policy-Guided Imitation Approach for Offline Reinforcement Learning

            Haoran Xu, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Interessiert an Vorträgen wie diesem? NeurIPS 2022 folgen