Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Variance Reduction in Off-Policy Deep Reinforcement Learning using Spectral Normalization
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-007-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-007-alpha.b-cdn.net
      • sl-yoda-v2-stream-007-beta.b-cdn.net
      • 1678031076.rsc.cdn77.org
      • 1932936657.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Variance Reduction in Off-Policy Deep Reinforcement Learning using Spectral Normalization
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Variance Reduction in Off-Policy Deep Reinforcement Learning using Spectral Normalization

            Dec 2, 2022

            Speakers

            PB

            Payal Bawa

            Speaker · 0 followers

            RO

            Rafael Oliveira

            Speaker · 0 followers

            FR

            Fabio Ramos

            Speaker · 0 followers

            About

            Off-policy deep reinforcement learning algorithms like Soft Actor Critic (SAC) have achieved state-of-the-art results in several high dimensional continuous control tasks. Despite their success, they are prone to instability due to the deadly triad of off-policy training, function approximation, and bootstrapping. Unstable training of off-policy algorithms leads to sample inefficient and sub-optimal asymptotic performance, thus preventing their real-world deployment. To mitigate these issues, pr…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 961 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Beyond Not-Forgetting: Continual Learning with Backward Knowledge Transfer
            04:52

            Beyond Not-Forgetting: Continual Learning with Backward Knowledge Transfer

            Sen Lin, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Bias Amplification in Image Classification
            08:07

            Bias Amplification in Image Classification

            Melissa Hall, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Self-supervised Heterogeneous Graph Pre-training Based on Structural Clustering
            03:57

            Self-supervised Heterogeneous Graph Pre-training Based on Structural Clustering

            Yaming Yang, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Cost-Sensitive Self-Training for Optimizing Non-Decomposable Metrics
            04:53

            Cost-Sensitive Self-Training for Optimizing Non-Decomposable Metrics

            Harsh Rangwani, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            The Phenomenon of Policy Churn
            05:09

            The Phenomenon of Policy Churn

            Tom Schaul, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Effectiveness of Vision Transformer for Fast and Accurate Single-Stage Pedestrian Detector
            03:53

            Effectiveness of Vision Transformer for Fast and Accurate Single-Stage Pedestrian Detector

            Jing Yuan, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022