Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: PPG Reloaded: An Empirical Study on What Matters in Phasic Policy Gradient
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-002-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-002-alpha.b-cdn.net
      • sl-yoda-v2-stream-002-beta.b-cdn.net
      • 1001562353.rsc.cdn77.org
      • 1075090661.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            PPG Reloaded: An Empirical Study on What Matters in Phasic Policy Gradient
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            PPG Reloaded: An Empirical Study on What Matters in Phasic Policy Gradient

            Jul 24, 2023

            Speakers

            KW

            Kaixin Wang

            Speaker · 0 followers

            DZ

            Daquan Zhou

            Speaker · 0 followers

            JF

            Jiashi Feng

            Speaker · 0 followers

            About

            In model-free reinforcement learning, recent methods based on a phasic policy gradient (PPG) framework have shown impressive improvements in sample efficiency and zero-shot generalization on the challenging Procgen benchmark. In PPG, two design choices are believed to be the key contributing factors to its superior performance over PPO: the high level of value sample reuse and the low frequency of feature distillation. However, through an extensive empirical study, we unveil that policy regulari…

            Organizer

            I2
            I2

            ICML 2023

            Account · 657 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Trapdoor Normalization with Irreversible Ownership Verification
            05:14

            Trapdoor Normalization with Irreversible Ownership Verification

            Hanwen Liu, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Revisiting Gradient Clipping: Stochastic bias and tight convergence guarantees
            05:35

            Revisiting Gradient Clipping: Stochastic bias and tight convergence guarantees

            Anastasiia Koloskova, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            FedHPO-Bench: A Benchmark Suite for Federated Hyperparameter Optimization
            05:23

            FedHPO-Bench: A Benchmark Suite for Federated Hyperparameter Optimization

            Zhen Wang, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Learning Mixtures of Markov Chains and MDPs
            08:21

            Learning Mixtures of Markov Chains and MDPs

            Chinmaya Kausik, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Decoding Layer Saliency in Transformers
            04:36

            Decoding Layer Saliency in Transformers

            Elizabeth M. Hou, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Optimal LP Rounding and Linear-Time Approximation Algorithms for Clustering Edge-Colored Hypergraphs
            05:22

            Optimal LP Rounding and Linear-Time Approximation Algorithms for Clustering Edge-Colored Hypergraphs

            Nate Veldt

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow ICML 2023