Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Near-Optimal Randomized Exploration for Tabular Markov Decision Processes
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-006-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-006-alpha.b-cdn.net
      • sl-yoda-v2-stream-006-beta.b-cdn.net
      • 1549480416.rsc.cdn77.org
      • 1102696603.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Near-Optimal Randomized Exploration for Tabular Markov Decision Processes
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Near-Optimal Randomized Exploration for Tabular Markov Decision Processes

            Nov 28, 2022

            Speakers

            ZX

            Zhihan Xiong

            Speaker · 0 followers

            RS

            Ruoqi Shen

            Speaker · 0 followers

            QC

            Qiwen Cui

            Speaker · 0 followers

            About

            We study algorithms using randomized value functions for exploration in reinforcement learning. This type of algorithms enjoys appealing empirical performance. We show that when we use 1) a single random seed in each episode, and 2) a Bernstein-type magnitude of noise, we obtain a worst-case O(H√(SAT)) regret bound for episodic time-inhomogeneous Markov Decision Process where S is the size of state space, A is the size of action space, H is the planning horizon and T is the number of interaction…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 953 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Score-based Generative Models and Their Applications
            29:07

            Score-based Generative Models and Their Applications

            Chenlin Meng

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Cross-Linked Unified Embedding for cross-modality representation learning
            04:43

            Cross-Linked Unified Embedding for cross-modality representation learning

            Xinming Tu, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Maximum a posteriori natural scene reconstruction from retinal ganglion cells with deep denoiser priors
            04:55

            Maximum a posteriori natural scene reconstruction from retinal ganglion cells with deep denoiser priors

            Eric G. Wu, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            HyperMiner: Topic Taxonomy Mining with Hyperbolic Embedding
            01:05

            HyperMiner: Topic Taxonomy Mining with Hyperbolic Embedding

            Yishi Xu, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Approximate Value Equivalence
            05:02

            Approximate Value Equivalence

            Christopher Grimm, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Leveraging Maths to Understand Transformers
            30:32

            Leveraging Maths to Understand Transformers

            François Charton

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022