Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Fine-tuning Offline Policies with Optimistic Action Selection
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-008-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-008-alpha.b-cdn.net
      • sl-yoda-v2-stream-008-beta.b-cdn.net
      • 1159783934.rsc.cdn77.org
      • 1511376917.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Fine-tuning Offline Policies with Optimistic Action Selection
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Fine-tuning Offline Policies with Optimistic Action Selection

            Dec 2, 2022

            Speakers

            MSM

            Max Sobol Mark

            Speaker · 1 follower

            AG

            Ali Ghadirzadeh

            Speaker · 0 followers

            XC

            Xi Chen

            Speaker · 0 followers

            About

            Offline reinforcement learning algorithms can train performant policies for hard tasks using previously-collected datasets. However, the quality of the offline dataset often limits the levels of performance possible. We consider the problem of improving offline policies through online fine-tuning. Offline RL requires a pessimistic training objective to mitigate distributional shift between the trained policy and the offline behavior policy, which will make the trained policy averse to picking no…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 961 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Explaining a Reinforcement Learning Agent via Prototyping
            05:04

            Explaining a Reinforcement Learning Agent via Prototyping

            Ronilo Ragodos, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Pruning's Effect on Generalization Through the Lens of Training and Regularization
            05:00

            Pruning's Effect on Generalization Through the Lens of Training and Regularization

            Tian Jin, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Fair Synthetic Data Does not Necessarily Lead to Fair Models
            02:36

            Fair Synthetic Data Does not Necessarily Lead to Fair Models

            Yam Eitan, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Increasing Confidence in Adversarial Robustness Evaluations
            05:04

            Increasing Confidence in Adversarial Robustness Evaluations

            Roland S. Zimmermann, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Sparse2Dense: Learn to Densify 3D Features to Boost 3D Object Detection
            04:54

            Sparse2Dense: Learn to Densify 3D Features to Boost 3D Object Detection

            Tianyu Wang, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Panel 1: The Rise of Community-driven Research
            44:26

            Panel 1: The Rise of Community-driven Research

            Rosanne Liu, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022