Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: A Connection between One-Step Regularization and Critic Regularization in Reinforcement Learning
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-008-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-008-alpha.b-cdn.net
      • sl-yoda-v2-stream-008-beta.b-cdn.net
      • 1159783934.rsc.cdn77.org
      • 1511376917.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            A Connection between One-Step Regularization and Critic Regularization in Reinforcement Learning
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            A Connection between One-Step Regularization and Critic Regularization in Reinforcement Learning

            Dec 2, 2022

            Speakers

            BE

            Benjamin Eysenbach

            Sprecher:in · 0 Follower:innen

            MG

            Matthieu Geist

            Sprecher:in · 0 Follower:innen

            SL

            Sergey Levine

            Sprecher:in · 1 Follower:in

            About

            As with any machine learning problem with limited data, effective offline RL algorithms require careful regularization to avoid overfitting. One-step methods perform regularization by doing just a single step of policy improvement, while critic regularization methods do many steps of policy improvement with a regularized objective. These methods appear distinct. One-step methods, such as advantage-weighted regression and conditional behavioral cloning, are simple and stable. Critic regularizatio…

            Organizer

            N2
            N2

            NeurIPS 2022

            Konto · 961 Follower:innen

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            SolarDK: A high-resolution urban solar panel image classification and localisation dataset
            04:53

            SolarDK: A high-resolution urban solar panel image classification and localisation dataset

            Carl A. Schmidt, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Local Convolutions Cause an Implicit Bias towards High Frequency Adversarial Examples
            20:27

            Local Convolutions Cause an Implicit Bias towards High Frequency Adversarial Examples

            Josué Ortega Caro

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Target-based Surrogates for Stochastic Optimization
            04:20

            Target-based Surrogates for Stochastic Optimization

            Jonathan Wilder Lavington, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            A Brief Overview of AI Governance for Responsible Machine Learning Systems
            10:35

            A Brief Overview of AI Governance for Responsible Machine Learning Systems

            Navdeep Gill, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            SALSA: Attacking Lattice Cryptography with Transformers
            04:59

            SALSA: Attacking Lattice Cryptography with Transformers

            Emily Wenger, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            FlyView: a bio-inspired optical flow truth dataset for visual navigation using panoramic stereo vision
            04:40

            FlyView: a bio-inspired optical flow truth dataset for visual navigation using panoramic stereo vision

            Alix Leroy, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Interested in talks like this? Follow NeurIPS 2022