Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: FlowPG: Action-constrained Policy Gradient with Normalizing Flows
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-001-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-001-alpha.b-cdn.net
      • sl-yoda-v2-stream-001-beta.b-cdn.net
      • 1824830694.rsc.cdn77.org
      • 1979322955.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            FlowPG: Action-constrained Policy Gradient with Normalizing Flows
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            FlowPG: Action-constrained Policy Gradient with Normalizing Flows

            Dez 10, 2023

            Sprecher:innen

            JCB

            Janaka Chathuranga Brahmanage

            Sprecher:in · 0 Follower:innen

            JL

            Jiajing Ling

            Sprecher:in · 0 Follower:innen

            AK

            Akshat Kumar

            Sprecher:in · 0 Follower:innen

            Über

            Action-constrained reinforcement learning (ACRL) is a popular approach for solving safety-critical and resource-allocation related decision making problems. However, one of the major challenges in solving ACRL is to find valid actions that satisfy the constraints in each RL step. While adding a projection layer on top of the original policy network is a commonly used approach, it involves solving a mathematical program, either during training or in action execution, or both, which can result in…

            Organisator

            N2
            N2

            NeurIPS 2023

            Konto · 648 Follower:innen

            Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

            Professionelle Aufzeichnung und Livestreaming – weltweit.

            Freigeben

            Empfohlene Videos

            Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

            TradeMaster: A Holistic Quantitative Trading Platform Empowered by Reinforcement Learning
            02:49

            TradeMaster: A Holistic Quantitative Trading Platform Empowered by Reinforcement Learning

            Shuo Sun, …

            N2
            N2
            NeurIPS 2023 16 months ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            The Tunnel Effect: Building Data Representations in Deep Neural Networks
            04:41

            The Tunnel Effect: Building Data Representations in Deep Neural Networks

            Wojciech Masarczyk, …

            N2
            N2
            NeurIPS 2023 16 months ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Self-supervised Learning: Towards Rich Representations?
            32:52

            Self-supervised Learning: Towards Rich Representations?

            Abhinav Gupta

            N2
            N2
            NeurIPS 2023 16 months ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            TensorNet: Cartesian Tensor Representations for Efficient Learning of Molecular Potentials
            03:13

            TensorNet: Cartesian Tensor Representations for Efficient Learning of Molecular Potentials

            Guillem Simeon, …

            N2
            N2
            NeurIPS 2023 16 months ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Uncovering Meanings of Embeddings via Partial Orthogonality
            04:35

            Uncovering Meanings of Embeddings via Partial Orthogonality

            Yibo Jiang, …

            N2
            N2
            NeurIPS 2023 16 months ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Segment-then-Classify: Few-shot Instance Segmentation for Environmental Remote Sensing
            04:31

            Segment-then-Classify: Few-shot Instance Segmentation for Environmental Remote Sensing

            Yang Hu, …

            N2
            N2
            NeurIPS 2023 16 months ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Interessiert an Vorträgen wie diesem? NeurIPS 2023 folgen