Che Wang, Yanqiu Wu, Quan Vuong, Keith Ross · Striving for simplicity and performance in off-policy DRL: Output Normalization and Non-Uniform Sampling · SlidesLive

Kategorien

DE

Anmelden Vertrieb kontaktieren

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Striving for simplicity and performance in off-policy DRL: Output Normalization and Non-Uniform Sampling

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v3-stream-013-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v3-stream-013-alpha.b-cdn.net
sl-yoda-v3-stream-013-beta.b-cdn.net
1668715672.rsc.cdn77.org
1420896597.rsc.cdn77.org

Subtitles
Off
en

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Striving for simplicity and performance in off-policy DRL: Output Normalization and Non-Uniform Sampling

Striving for simplicity and performance in off-policy DRL: Output Normalization and Non-Uniform Sampling

Jul 12, 2020

Sprecher:innen

Che Wang

Sprecher:in · 0 Follower:innen

Yanqiu Wu

Sprecher:in · 0 Follower:innen

Quan Vuong

Sprecher:in · 0 Follower:innen

Über

We aim to develop off-policy DRL algorithms that not only exceed state-of-the-art performance but are also simple and minimalistic. For standard continuous control benchmarks, Soft Actor Critic (SAC), which employs entropy maximization, currently provides state-of-the-art performance. We first demonstrate that the entropy term in SAC addresses action saturation due to the bounded nature of the action spaces. With this insight, we propose a streamlined algorithm with a simple normalization scheme…

Organisator

ICML 2020

Konto · 2,7k Follower:innen

Kategorien

Software und Programmierung

Kategorie · 1k Präsentationen

KI und Datenwissenschaft

Kategorie · 10,8k Präsentationen

Über ICML 2020

The International Conference on Machine Learning (ICML) is the premier gathering of professionals dedicated to the advancement of the branch of artificial intelligence known as machine learning. ICML is globally renowned for presenting and publishing cutting-edge research on all aspects of machine learning used in closely related areas like artificial intelligence, statistics and data science, as well as important application areas such as machine vision, computational biology, speech recognition, and robotics. ICML is one of the fastest growing artificial intelligence conferences in the world. Participants at ICML span a wide range of backgrounds, from academic and industrial researchers, to entrepreneurs and engineers, to graduate students and postdocs.

Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

Professionelle Aufzeichnung und Livestreaming – weltweit.

Freigeben

Empfohlene Videos

Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

Rapid policy updating in human physical construction

05:18

Rapid policy updating in human physical construction

Später ansehen

Favorit

Will McCarthy, …

ICML 2020 5 years ago

Adversarial Nonnegative Matrix Factorization

10:36

Adversarial Nonnegative Matrix Factorization

Später ansehen

Favorit

ICML 2020 5 years ago

Implicit Learning Dynamics in Stackelberg Games: Equilibria Characterization, Convergence Analysis, and Empirical Study

15:13

Implicit Learning Dynamics in Stackelberg Games: Equilibria Characterization, Convergence Analysis, and Empirical Study

Später ansehen

Favorit

Tanner Fiez, …

ICML 2020 5 years ago

Graphical Models based Solutions for Missing Data Problems

29:18

Graphical Models based Solutions for Missing Data Problems

Später ansehen

Favorit

ICML 2020 5 years ago

Poster #59

Später ansehen

Favorit

ICML 2020 5 years ago

Black-Box Methods for Restoring Monotonicity

15:40

Black-Box Methods for Restoring Monotonicity

Später ansehen

Favorit

Evangelia Gergatsouli, …

ICML 2020 5 years ago