Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Non-Asymptotic Analysis for Two Time-scale TDC with General Smooth Function Approximation
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v3-stream-013-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v3-stream-013-alpha.b-cdn.net
      • sl-yoda-v3-stream-013-beta.b-cdn.net
      • 1668715672.rsc.cdn77.org
      • 1420896597.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Non-Asymptotic Analysis for Two Time-scale TDC with General Smooth Function Approximation
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Non-Asymptotic Analysis for Two Time-scale TDC with General Smooth Function Approximation

            Dec 6, 2021

            Speakers

            YW

            Yue Wang

            Speaker · 1 follower

            SZ

            Shaofeng Zou

            Speaker · 0 followers

            YZ

            Yi Zhou

            Speaker · 0 followers

            About

            Temporal-difference learning with gradient correction (TDC) is a two time-scale algorithm for policy evaluation in reinforcement learning. This algorithm was initially proposed with linear function approximation, and was later extended to the one with general smooth function approximation. The asymptotic convergence for the on-policy setting with general smooth function approximation was established in [Bhatnagar et al., 2009], however, the non-asymptotic convergence analysis remains unsolved du…

            Organizer

            N2
            N2

            NeurIPS 2021

            Account · 1.9k followers

            About NeurIPS 2021

            Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Gamifying Math Education using Object Detection
            04:45

            Gamifying Math Education using Object Detection

            Yueqiu Sun, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Diversity is All You Need to Improve Bayesian Model Averaging
            06:31

            Diversity is All You Need to Improve Bayesian Model Averaging

            Yashvir Singh Grewal, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Contrastive Learning of Global-Local Video Representations
            15:47

            Contrastive Learning of Global-Local Video Representations

            Shuang Ma, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Computer-Aided Design as Language
            15:08

            Computer-Aided Design as Language

            Yaroslav Ganin, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            FIgLib & SmokeyNet: Dataset and Deep Learning Model for Real-Time Wildland Fire Smoke Detection
            05:05

            FIgLib & SmokeyNet: Dataset and Deep Learning Model for Real-Time Wildland Fire Smoke Detection

            Anshuman Dewangan, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Rank Overspecified Robust Matrix Recovery: Subgradient Method and Exact Recovery
            14:02

            Rank Overspecified Robust Matrix Recovery: Subgradient Method and Exact Recovery

            Liwei Jiang, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 1 viewers voted for saving the presentation to eternal vault which is 0.1%

            Interested in talks like this? Follow NeurIPS 2021