Sina Ghiassian, Andrew Patterson, Shivam Garg, Dhawal Gutpa, Adam White, Martha White · Gradient Temporal-Difference Learning with Regularized Corrections · SlidesLive

Kategorien

DE

Anmelden Vertrieb kontaktieren

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Gradient Temporal-Difference Learning with Regularized Corrections

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v3-stream-014-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v3-stream-014-alpha.b-cdn.net
sl-yoda-v3-stream-014-beta.b-cdn.net
1978117156.rsc.cdn77.org
1243944885.rsc.cdn77.org

Subtitles
Off
en

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Gradient Temporal-Difference Learning with Regularized Corrections

Gradient Temporal-Difference Learning with Regularized Corrections

Jul 12, 2020

Sprecher:innen

Sina Ghiassian

Sprecher:in · 0 Follower:innen

Andrew Patterson

Sprecher:in · 0 Follower:innen

Shivam Garg

Sprecher:in · 0 Follower:innen

Über

Value function learning remains a critical component of many reinforcement learning systems. Many algorithms are based on temporal difference (TD) updates, which have well-documented divergence issues, even though potentially sound alternatives exist like Gradient TD. Unsound approaches like Q-learning and TD remain popular because divergence seems rare in practice and these algorithms typically perform well. However, recent work with large neural network learning systems reveals that instabilit…

Organisator

ICML 2020

Konto · 2,7k Follower:innen

Kategorien

KI und Datenwissenschaft

Kategorie · 10,8k Präsentationen

Über ICML 2020

The International Conference on Machine Learning (ICML) is the premier gathering of professionals dedicated to the advancement of the branch of artificial intelligence known as machine learning. ICML is globally renowned for presenting and publishing cutting-edge research on all aspects of machine learning used in closely related areas like artificial intelligence, statistics and data science, as well as important application areas such as machine vision, computational biology, speech recognition, and robotics. ICML is one of the fastest growing artificial intelligence conferences in the world. Participants at ICML span a wide range of backgrounds, from academic and industrial researchers, to entrepreneurs and engineers, to graduate students and postdocs.

Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

Professionelle Aufzeichnung und Livestreaming – weltweit.

Freigeben

Empfohlene Videos

Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

Opening Remarks

20:05

Opening Remarks

Später ansehen

Favorit

Petar Veličković, …

ICML 2020 5 years ago

A Benchmark of Medical Out of Distribution Detection

08:10

A Benchmark of Medical Out of Distribution Detection

Später ansehen

Favorit

Tianshi Cao, …

ICML 2020 5 years ago

Fast and Private Submodular and k-Submodular Functions Maximization with Matroid Constraints

15:00

Fast and Private Submodular and k-Submodular Functions Maximization with Matroid Constraints

Später ansehen

Favorit

Akbar Rafiey, …

ICML 2020 5 years ago

Invited Talk 3 - Q&A

Invited Talk 3 - Q&A

Später ansehen

Favorit

Sungjin Ahn, …

ICML 2020 5 years ago

Adversarial Mutual Information for Text Generation

12:37

Adversarial Mutual Information for Text Generation

Später ansehen

Favorit

Boyuan Pan, …

ICML 2020 5 years ago

On-Device Machine Learning with Apple

15:42

On-Device Machine Learning with Apple

Später ansehen

Favorit

ICML 2020 5 years ago