Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-004-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-004-alpha.b-cdn.net
      • sl-yoda-v2-stream-004-beta.b-cdn.net
      • 1685195716.rsc.cdn77.org
      • 1239898752.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning

            Dec 2, 2022

            Speakers

            ZW

            Zhendong Wang

            Speaker · 0 followers

            JJH

            Jonathan J. Hunt

            Speaker · 0 followers

            MZ

            Mingyuan Zhou

            Speaker · 0 followers

            About

            Offline reinforcement learning (RL), which aims to learn an optimal policy using a previously collected static dataset, is an important paradigm of RL. Standard RL methods often perform poorly in this regime due to the function approximation errors on out-of-distribution actions. While a variety of regularization methods have been proposed to mitigate this issue, they are often constrained by policy classes with limited expressiveness that can lead to highly suboptimal solutions. In this paper,…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 961 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            A Data-efficient Multiobjective Machine Learning Method For 3D-printed Architected Materials Design
            07:47

            A Data-efficient Multiobjective Machine Learning Method For 3D-printed Architected Materials Design

            Ye Wei, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Conformal Prediction in 2022
            56:41

            Conformal Prediction in 2022

            Emmanuel Candés

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 3 viewers voted for saving the presentation to eternal vault which is 0.3%

            RTFormer: Efficient Design for Real-Time Semantic Segmentation with Transformer
            01:01

            RTFormer: Efficient Design for Real-Time Semantic Segmentation with Transformer

            Jian Wang, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            APG: Adaptive Parameter Generation Network for Click-Through Rate Prediction
            05:16

            APG: Adaptive Parameter Generation Network for Click-Through Rate Prediction

            Bencheng Yan, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Domain Generalization without Excess Empirical Risk
            05:11

            Domain Generalization without Excess Empirical Risk

            Ozan Sener, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Semi-analytical Industrial Cooling System Model for Reinforcement Learning
            02:59

            Semi-analytical Industrial Cooling System Model for Reinforcement Learning

            Yuri Chervonyi, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022