Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Regime Switching Bandits
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v3-stream-013-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v3-stream-013-alpha.b-cdn.net
      • sl-yoda-v3-stream-013-beta.b-cdn.net
      • 1668715672.rsc.cdn77.org
      • 1420896597.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Regime Switching Bandits
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Regime Switching Bandits

            Dec 6, 2021

            Speakers

            YX

            Yi Xiong

            Speaker · 0 followers

            XZ

            Xiang Zhou

            Speaker · 0 followers

            NC

            Ningyuan Chen

            Speaker · 0 followers

            About

            We study a multi-armed bandit problem where the rewards exhibit regime switching. Specifically, the distributions of the random rewards generated from all arms are modulated by a common underlying state modeled as a finite-state Markov chain. The agent does not observe the underlying state and has to learn the transition matrix and the reward distributions. We propose a learning algorithm for this problem, building on spectral method-of-moments estimations for hidden Markov models, belief error…

            Organizer

            N2
            N2

            NeurIPS 2021

            Account · 1.9k followers

            About NeurIPS 2021

            Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Detecting Anomalous Event Sequences with Temporal Point Processes
            11:34

            Detecting Anomalous Event Sequences with Temporal Point Processes

            Oleksandr Shchur, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Discussion Panel and QA Session
            1:13:26

            Discussion Panel and QA Session

            Sebastian Gehrmann, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Nested Variational Inference
            07:46

            Nested Variational Inference

            Heiko Zimmermann, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Architecture Personalization in Resource-constrained Federated Learning
            12:46

            Architecture Personalization in Resource-constrained Federated Learning

            Mi Luo, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Charting and Navigating the Space of Solutions for Recurrent Neural Networks
            12:41

            Charting and Navigating the Space of Solutions for Recurrent Neural Networks

            Elia Turner, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulation
            05:17

            Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulation

            Todor Davchev, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2021