Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Tactical Optimism and Pessimism for Deep Reinforcement Learning
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-006-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-006-alpha.b-cdn.net
      • sl-yoda-v2-stream-006-beta.b-cdn.net
      • 1549480416.rsc.cdn77.org
      • 1102696603.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Tactical Optimism and Pessimism for Deep Reinforcement Learning
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Tactical Optimism and Pessimism for Deep Reinforcement Learning

            Dec 6, 2021

            Speakers

            TM

            Ted Moskovitz

            Speaker · 0 followers

            JP

            Jack Parker-Holder

            Speaker · 1 follower

            AP

            Aldo Pacchiano

            Speaker · 0 followers

            About

            In recent years, deep off-policy actor-critic algorithms have become a dominant approach to reinforcement learning for continuous control. One of the primary drivers of this improved performance is the use of pessimistic value updates to address function approximation errors, which previously led to disappointing performance. However, a direct consequence of pessimism is reduced exploration, running counter to theoretical support for the efficacy of optimism in the face of uncertainty. So which…

            Organizer

            N2
            N2

            NeurIPS 2021

            Account · 1.9k followers

            About NeurIPS 2021

            Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Panel Discussion
            37:59

            Panel Discussion

            Elias Bareinboim, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Improved Regret Bounds for Tracking Experts with Memory
            14:06

            Improved Regret Bounds for Tracking Experts with Memory

            James Robinson, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            f-Mutual Information Contrastive Learning
            10:54

            f-Mutual Information Contrastive Learning

            Guojun Zhang, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
            05:07

            Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates

            Litian Liang, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Neural View Synthesis and Matching for Semi-Supervised Few-Shot Learning of 3D Pose
            14:55

            Neural View Synthesis and Matching for Semi-Supervised Few-Shot Learning of 3D Pose

            Angtian Wang, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Do Wider Neural Networks Really Help Adversarial Robustness?
            12:23

            Do Wider Neural Networks Really Help Adversarial Robustness?

            Boxi Wu, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2021