Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Optimistic Posterior Sampling for Model-based RL
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-010-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-010-alpha.b-cdn.net
      • sl-yoda-v2-stream-010-beta.b-cdn.net
      • 1759419103.rsc.cdn77.org
      • 1016618226.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Optimistic Posterior Sampling for Model-based RL
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Optimistic Posterior Sampling for Model-based RL

            Nov 28, 2022

            Speakers

            AA

            Alekh Agarwal

            Speaker · 1 follower

            TZ

            Tong Zhang

            Speaker · 0 followers

            About

            We propose a general framework to design posterior sampling methods for model-based RL. We show that the proposed algorithms can be analyzed by reducing regret to Hellinger distance based conditional probability estimation. We further show that optimistic posterior sampling can control this Hellinger distance, when we measure model error via data likelihood. This technique allows us to design and analyze unified posterior sampling algorithms with state-of-the-art sample complexity guarantees for…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 954 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Explanation Shift: Detecting distribution shifts on tabular data via the explanation space
            04:52

            Explanation Shift: Detecting distribution shifts on tabular data via the explanation space

            Carlos Mougan, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Adapting Self-Supervised Vision Transformers by Probing Attention-Conditioned Masking Consistency
            05:04

            Adapting Self-Supervised Vision Transformers by Probing Attention-Conditioned Masking Consistency

            Viraj Prabhu, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Masked inverse folding with sequence transfer for protein representation learning
            02:04

            Masked inverse folding with sequence transfer for protein representation learning

            Kevin Kaichuang Yang, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Learning on Arbitrary Graph Topologies via Predictive Coding
            04:32

            Learning on Arbitrary Graph Topologies via Predictive Coding

            Tommaso Salvatori, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Efficiently Minimizing the Maximum Loss
            30:47

            Efficiently Minimizing the Maximum Loss

            Aaron Sidford

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            On the Stability and Scalability of Node Perturbation Learning
            04:42

            On the Stability and Scalability of Node Perturbation Learning

            Naoki Hiratani, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022