Yanqiu Wu, Xinyue Chen, Che Wang, Yiming Zhang, Keith Ross · Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance · SlidesLive

Kategorien

DE

Anmelden Vertrieb kontaktieren

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-004-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-004-alpha.b-cdn.net
sl-yoda-v2-stream-004-beta.b-cdn.net
1685195716.rsc.cdn77.org
1239898752.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance

Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance

Dez 2, 2022

Sprecher:innen

Yanqiu Wu

Sprecher:in · 0 Follower:innen

Xinyue Chen

Sprecher:in · 0 Follower:innen

Che Wang

Sprecher:in · 0 Follower:innen

Über

Recent advances in model-free deep reinforcement learning (DRL) show that simple model-free methods can be highly effective in challenging high-dimensional continuous control tasks. In particular, Truncated Quantile Critics (TQC) achieves state-of-the-art asymptotic training performance on the MuJoCo benchmark with a distributional representation of critics; and Randomized Ensemble Double Q-Learning (REDQ) achieves high sample efficiency that is competitive with state-of-the-art model-based meth…

Organisator

NeurIPS 2022

Konto · 961 Follower:innen

Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

Professionelle Aufzeichnung und Livestreaming – weltweit.

Freigeben

Empfohlene Videos

Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

Local Identifiability of Deep ReLU Neural Networks: the Theory

04:28

Local Identifiability of Deep ReLU Neural Networks: the Theory

Später ansehen

Favorit

Joachim Bona-Pellissier, …

NeurIPS 2022 2 years ago

Stochastic Gradient-Free Methods for Nonsmooth Nonconvex Optimization

05:24

Stochastic Gradient-Free Methods for Nonsmooth Nonconvex Optimization

Später ansehen

Favorit

Tianyi Lin, …

NeurIPS 2022 2 years ago

ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings

05:04

ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings

Später ansehen

Favorit

Arjun Majumdar, …

NeurIPS 2022 2 years ago

LASSIE: Learning Articulated Shapes from Sparse Image Ensemble via 3D Part Discovery

05:01

LASSIE: Learning Articulated Shapes from Sparse Image Ensemble via 3D Part Discovery

Später ansehen

Favorit

Chun-Han Yao, …

NeurIPS 2022 2 years ago

APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking

01:02

APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking

Später ansehen

Favorit

Yuxiang Yang, …

NeurIPS 2022 2 years ago

Latent Hierarchical Causal Structure Discovery with Rank Constraints

05:41

Latent Hierarchical Causal Structure Discovery with Rank Constraints

Später ansehen

Favorit

Biwei Huang, …

NeurIPS 2022 2 years ago