Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: The Best of Both Worlds: Stochastic and Adversarial Episodic MDPs with Unknown Transition
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-003-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-003-alpha.b-cdn.net
      • sl-yoda-v2-stream-003-beta.b-cdn.net
      • 1544410162.rsc.cdn77.org
      • 1005514182.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            The Best of Both Worlds: Stochastic and Adversarial Episodic MDPs with Unknown Transition
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            The Best of Both Worlds: Stochastic and Adversarial Episodic MDPs with Unknown Transition

            Dec 6, 2021

            Speakers

            TJ

            Tiancheng Jin

            Speaker Β· 0 followers

            LH

            Longbo Huang

            Speaker Β· 0 followers

            HL

            Haipeng Luo

            Speaker Β· 1 follower

            About

            We consider the best-of-both-worlds problem for learning an episodic Markov Decision Process through T episodes, with the goal of achieving π’ͺ(√(T)) regret when the losses are adversarial and simultaneously π’ͺ(log T) regret when the losses are (almost) stochastic. Recent work by [Jin and Luo, 2020] achieves this goal when the fixed transition is known, and leaves the case of unknown transition as a major open question. In this work, we resolve this open problem by using the same Follow-the-Regul…

            Organizer

            N2
            N2

            NeurIPS 2021

            Account Β· 1.9k followers

            Categories

            AI & Data Science

            Category Β· 10.8k presentations

            Mathematics

            Category Β· 2.4k presentations

            About NeurIPS 2021

            Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            That Escalated Quickly: Accelerating Complexity by Editing Levels at the Frontier of Agent Capabilities
            05:01

            That Escalated Quickly: Accelerating Complexity by Editing Levels at the Frontier of Agent Capabilities

            Jack Parker-Holder, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Improving Deep Learning Interpretability by Saliency Guided Training
            10:45

            Improving Deep Learning Interpretability by Saliency Guided Training

            Aya Abdelsalam Ismail, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Uncertainty Calibration for Ensemble-Based Debiasing Methods
            05:58

            Uncertainty Calibration for Ensemble-Based Debiasing Methods

            Ruibin Xiong, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Invited Speakers Panel
            48:38

            Invited Speakers Panel

            Sham M. Kakade, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Neural Algorithmic Reasoners are Implicit Planners
            13:10

            Neural Algorithmic Reasoners are Implicit Planners

            Andreea Deac, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Biological learning in key-value memory networks
            11:23

            Biological learning in key-value memory networks

            Danial Tyulmankov, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2021