Michal Nauman, Marek Cygan · On All-Action Policy Gradients · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: On All-Action Policy Gradients

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-001-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-001-alpha.b-cdn.net
sl-yoda-v2-stream-001-beta.b-cdn.net
1824830694.rsc.cdn77.org
1979322955.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

On All-Action Policy Gradients

On All-Action Policy Gradients

Dec 2, 2022

Speakers

Michal Nauman

Speaker · 0 followers

Marek Cygan

Speaker · 0 followers

About

In this paper, we analyze the variance of stochastic policy gradient with many action samples per state (all-action SPG). We decompose the variance of SPG and derive an optimality condition for all-action SPG. The optimality condition shows when all-action SPG should be preferred over single-action counterpart and allows to determine a variance-minimizing sampling scheme in SPG estimation. Furthermore, we propose dynamics-all-action (DAA) module, an augmentation that allows for all-action sampli…

Organizer

NeurIPS 2022

Account · 961 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Large Language Models Are Human-Level Prompt Engineers

12:37

Large Language Models Are Human-Level Prompt Engineers

Watch later

Favorite

Yongchao Zhou, …

NeurIPS 2022 2 years ago

Concept Activation Regions: A Generalized Framework For Concept-Based Explanations

05:06

Concept Activation Regions: A Generalized Framework For Concept-Based Explanations

Watch later

Favorite

Jonathan Crabbé, …

NeurIPS 2022 2 years ago

A Nonconvex Framework for Structured Dynamic Covariance Recovery

04:47

A Nonconvex Framework for Structured Dynamic Covariance Recovery

Watch later

Favorite

Katherine Tsai, …

NeurIPS 2022 2 years ago

Opening Remarks

05:24

Opening Remarks

Watch later

Favorite

NeurIPS 2022 2 years ago

HyperSound: Generating Implicit Neural Representations of Audio Signals with Hypernetworks

05:01

HyperSound: Generating Implicit Neural Representations of Audio Signals with Hypernetworks

Watch later

Favorite

Filip Szatkowski, …

NeurIPS 2022 2 years ago

Intelligent transportation systems - ACFR

37:30

Intelligent transportation systems - ACFR

Watch later

Favorite

Stewart Worrall

NeurIPS 2022 2 years ago