Yue Wang, Shaofeng Zou, Yi Zhou · Non-Asymptotic Analysis for Two Time-scale TDC with General Smooth Function Approximation · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Non-Asymptotic Analysis for Two Time-scale TDC with General Smooth Function Approximation

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v3-stream-013-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v3-stream-013-alpha.b-cdn.net
sl-yoda-v3-stream-013-beta.b-cdn.net
1668715672.rsc.cdn77.org
1420896597.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Non-Asymptotic Analysis for Two Time-scale TDC with General Smooth Function Approximation

Non-Asymptotic Analysis for Two Time-scale TDC with General Smooth Function Approximation

Dec 6, 2021

Speakers

Yue Wang

Speaker · 1 follower

Shaofeng Zou

Speaker · 0 followers

Yi Zhou

Speaker · 0 followers

About

Temporal-difference learning with gradient correction (TDC) is a two time-scale algorithm for policy evaluation in reinforcement learning. This algorithm was initially proposed with linear function approximation, and was later extended to the one with general smooth function approximation. The asymptotic convergence for the on-policy setting with general smooth function approximation was established in [Bhatnagar et al., 2009], however, the non-asymptotic convergence analysis remains unsolved du…

Organizer

NeurIPS 2021

Account · 1.9k followers

About NeurIPS 2021

Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

$Gamifying Math Education using Object Detection$

04:45

Gamifying Math Education using Object Detection

Watch later

Favorite

Yueqiu Sun, …

NeurIPS 2021 3 years ago

Diversity is All You Need to Improve Bayesian Model Averaging

06:31

Diversity is All You Need to Improve Bayesian Model Averaging

Watch later

Favorite

Yashvir Singh Grewal, …

NeurIPS 2021 3 years ago

Contrastive Learning of Global-Local Video Representations

15:47

Contrastive Learning of Global-Local Video Representations

Watch later

Favorite

NeurIPS 2021 3 years ago

Computer-Aided Design as Language

15:08

Computer-Aided Design as Language

Watch later

Favorite

Yaroslav Ganin, …

NeurIPS 2021 3 years ago

FIgLib & SmokeyNet: Dataset and Deep Learning Model for Real-Time Wildland Fire Smoke Detection

05:05

FIgLib & SmokeyNet: Dataset and Deep Learning Model for Real-Time Wildland Fire Smoke Detection

Watch later

Favorite

Anshuman Dewangan, …

NeurIPS 2021 3 years ago

Rank Overspecified Robust Matrix Recovery: Subgradient Method and Exact Recovery

14:02

Rank Overspecified Robust Matrix Recovery: Subgradient Method and Exact Recovery

Watch later

Favorite

Liwei Jiang, …

NeurIPS 2021 3 years ago