Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Towards A Unified Policy Abstraction Theory and Representation Learning Approach in Markov Decision Processes
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-002-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-002-alpha.b-cdn.net
      • sl-yoda-v2-stream-002-beta.b-cdn.net
      • 1001562353.rsc.cdn77.org
      • 1075090661.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Towards A Unified Policy Abstraction Theory and Representation Learning Approach in Markov Decision Processes
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Towards A Unified Policy Abstraction Theory and Representation Learning Approach in Markov Decision Processes

            Dec 2, 2022

            Speakers

            MZ

            Min Zhang

            Speaker · 0 followers

            HT

            Hongyao Tang

            Speaker · 0 followers

            JH

            Jianye Hao

            Speaker · 0 followers

            About

            Lying on the heart of intelligent decision-making systems, how policy is represented and optimized is a fundamental problem. The root challenge in this problem is the large scale and the high complexity of policy space, which exacerbates the difficulty of policy learning especially in real-world scenarios. Towards a desirable surrogate policy space, recently policy representation in a low-dimensional latent space has shown its potential in improving both the evaluation and optimization of policy…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 961 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Riemannian Score-Based Generative Modeling
            05:14

            Riemannian Score-Based Generative Modeling

            Valentin De Bortoli, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            When Combinatorial Thompson Sampling meets Approximation Regret
            05:02

            When Combinatorial Thompson Sampling meets Approximation Regret

            Pierre Perrault

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            MaskPlace: Fast Chip Placement via Reinforced Visual Representation Learning
            01:04

            MaskPlace: Fast Chip Placement via Reinforced Visual Representation Learning

            Yao Lai, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Learning Discrete Directed Acyclic Graphs via Backpropagation
            13:46

            Learning Discrete Directed Acyclic Graphs via Backpropagation

            Andrew Wren, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Do Residual Neural Networks discretize Neural Ordinary Differential Equations?
            04:54

            Do Residual Neural Networks discretize Neural Ordinary Differential Equations?

            Michael E. Sander, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Group Meritocratic Fairness in Linear Contextual Bandits
            05:57

            Group Meritocratic Fairness in Linear Contextual Bandits

            Riccardo Grazzi, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022