Baturay Saglam, Furkan B. Mutlu, Doğan Can Çiçek, Suleyman Kozat · Actor Prioritized Experience Replay · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Actor Prioritized Experience Replay

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-007-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-007-alpha.b-cdn.net
sl-yoda-v2-stream-007-beta.b-cdn.net
1678031076.rsc.cdn77.org
1932936657.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Actor Prioritized Experience Replay

Actor Prioritized Experience Replay

Dec 2, 2022

Speakers

Baturay Saglam

Speaker · 0 followers

Furkan B. Mutlu

Speaker · 0 followers

Doğan Can Çiçek

Speaker · 0 followers

About

A widely-studied deep reinforcement learning (RL) technique known as Prioritized Experience Replay (PER) allows agents to learn from transitions sampled with non-uniform probability proportional to their temporal-difference (TD) error. Although it has been shown that PER is one of the most crucial components for the overall performance of deep RL methods in discrete action domains, many empirical studies indicate that it considerably underperforms actor-critic algorithms in continuous control. W…

Organizer

NeurIPS 2022

Account · 961 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Run-Time Monitoring for Safe Robot Autonomy

29:33

Run-Time Monitoring for Safe Robot Autonomy

Watch later

Favorite

NeurIPS 2022 2 years ago

Urban Heat Island Detection and Causal Inference Using Convolutional Neural Networks

04:09

Urban Heat Island Detection and Causal Inference Using Convolutional Neural Networks

Watch later

Favorite

Zach Calhoun, …

NeurIPS 2022 2 years ago

SALSA: Attacking Lattice Cryptography with Transformers

04:59

SALSA: Attacking Lattice Cryptography with Transformers

Watch later

Favorite

Emily Wenger, …

NeurIPS 2022 2 years ago

Mean-field analysis for heavy ball methods: Dropout-stability, connectivity, and global convergence

04:30

Mean-field analysis for heavy ball methods: Dropout-stability, connectivity, and global convergence

Watch later

Favorite

NeurIPS 2022 2 years ago

De Novo Protein Design

33:09

De Novo Protein Design

Watch later

Favorite

NeurIPS 2022 2 years ago

Polynomial-Time Optimal Equilibria with a Mediator in Extensive-Form Games

04:19

Polynomial-Time Optimal Equilibria with a Mediator in Extensive-Form Games

Watch later

Favorite

Brian Hu Zhang, …

NeurIPS 2022 2 years ago