Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Truly Deterministic Policy Optimization
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-005-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-005-alpha.b-cdn.net
      • sl-yoda-v2-stream-005-beta.b-cdn.net
      • 1034628162.rsc.cdn77.org
      • 1409346856.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Truly Deterministic Policy Optimization
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Truly Deterministic Policy Optimization

            Nov 28, 2022

            Speakers

            ES

            Ehsan Saleh

            Speaker · 0 followers

            SG

            Saba Ghaffari

            Speaker · 0 followers

            TB

            Timothy Bretl

            Speaker · 0 followers

            About

            In this paper, we present a policy gradient method that avoids exploratory noise injection and performs policy search over the deterministic landscape, with the goal of improving learning with long horizons and non-local rewards. By avoiding noise injection all sources of estimation variance can be eliminated in systems with deterministic dynamics (up to the initial state distribution). Since deterministic policy regularization is impossible using traditional non-metric measures such as the KL …

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 952 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Online Nonnegative CP-dictionary Learning for Markovian Data
            05:04

            Online Nonnegative CP-dictionary Learning for Markovian Data

            Hanbaek Lyu, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Membership Inference Attacks via Adversarial Examples
            09:26

            Membership Inference Attacks via Adversarial Examples

            Hamid Jalalzai, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Free Probability for predicting the performance of feed-forward fully connected neural networks
            04:57

            Free Probability for predicting the performance of feed-forward fully connected neural networks

            Reda Chhaibi, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Are GAN Biased? Evaluating GAN-Generated Facial Images via Crowdsourcing
            06:10

            Are GAN Biased? Evaluating GAN-Generated Facial Images via Crowdsourcing

            Hangzhi Guo, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            a-ReQ: Assessing Representation Quality in Self-Supervised Learning by measuring eigenspectrum decay
            05:40

            a-ReQ: Assessing Representation Quality in Self-Supervised Learning by measuring eigenspectrum decay

            Kumar Krishna Agrawal, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            SageMix: Saliency-Guided Mixup for Point Clouds
            04:50

            SageMix: Saliency-Guided Mixup for Point Clouds

            Sanghyeok Lee, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022