Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Improved Variance-Aware Confidence Sets for Linear Bandits and Linear Mixture MDP
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v3-stream-012-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v3-stream-012-alpha.b-cdn.net
      • sl-yoda-v3-stream-012-beta.b-cdn.net
      • 1338956956.rsc.cdn77.org
      • 1656830687.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Improved Variance-Aware Confidence Sets for Linear Bandits and Linear Mixture MDP
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Improved Variance-Aware Confidence Sets for Linear Bandits and Linear Mixture MDP

            Dec 6, 2021

            Speakers

            ZZ

            Zihan Zhang

            Speaker · 0 followers

            JY

            Jia-Qi Yang

            Speaker · 0 followers

            XJ

            Xiangyang Ji

            Speaker · 0 followers

            About

            This paper presents new variance-aware confidence sets for linear bandits and linear mixture Markov Decision Processes (MDPs).With the new confidence sets, we obtain the follow regret bounds:For linear bandits, we obtain an O(poly(d)√(1 + ∑_k=1^Kσ_k^2)) data-dependent regret bound, where d is the feature dimension, K is the number of rounds, and σ_k^2 is the unknown variance of the reward at the k-th round. This is the first regret bound that only scales with the variance and the dimension but …

            Organizer

            N2
            N2

            NeurIPS 2021

            Account · 1.9k followers

            About NeurIPS 2021

            Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Role of recurrent computations in primate visual object recognition
            17:31

            Role of recurrent computations in primate visual object recognition

            Kohitij Kar

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Self-Supervised Learning Disentangled Group Representation as Feature
            16:04

            Self-Supervised Learning Disentangled Group Representation as Feature

            Tan Wang, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Contextual Similarity Aggregation with Self-attention for Visual Re-ranking
            06:34

            Contextual Similarity Aggregation with Self-attention for Visual Re-ranking

            Jianbo Ouyang, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            A First Look Towards One-Shot Object Detection with SPOT for Data-Efficient Learning
            02:08

            A First Look Towards One-Shot Object Detection with SPOT for Data-Efficient Learning

            Ria Chakraborty, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Proxy-Normalizing Activations to Match Batch Normalization while Removing Batch Dependence
            11:29

            Proxy-Normalizing Activations to Match Batch Normalization while Removing Batch Dependence

            Antoine Labatie, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            When False Positive is Intolerant: End-to-End  Optimization with Low FPR for Multipartite Ranking
            07:00

            When False Positive is Intolerant: End-to-End Optimization with Low FPR for Multipartite Ranking

            Peisong Wen, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2021