Ehsan Saleh, Saba Ghaffari, Timothy Bretl, Matthew West · Truly Deterministic Policy Optimization · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Truly Deterministic Policy Optimization

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-005-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-005-alpha.b-cdn.net
sl-yoda-v2-stream-005-beta.b-cdn.net
1034628162.rsc.cdn77.org
1409346856.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Truly Deterministic Policy Optimization

Truly Deterministic Policy Optimization

Nov 28, 2022

Speakers

Ehsan Saleh

Speaker · 0 followers

Saba Ghaffari

Speaker · 0 followers

Timothy Bretl

Speaker · 0 followers

About

In this paper, we present a policy gradient method that avoids exploratory noise injection and performs policy search over the deterministic landscape, with the goal of improving learning with long horizons and non-local rewards. By avoiding noise injection all sources of estimation variance can be eliminated in systems with deterministic dynamics (up to the initial state distribution). Since deterministic policy regularization is impossible using traditional non-metric measures such as the KL …

Organizer

NeurIPS 2022

Account · 952 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Online Nonnegative CP-dictionary Learning for Markovian Data

05:04

Online Nonnegative CP-dictionary Learning for Markovian Data

Watch later

Favorite

Hanbaek Lyu, …

NeurIPS 2022 2 years ago

Membership Inference Attacks via Adversarial Examples

09:26

Membership Inference Attacks via Adversarial Examples

Watch later

Favorite

Hamid Jalalzai, …

NeurIPS 2022 2 years ago

Free Probability for predicting the performance of feed-forward fully connected neural networks

04:57

Free Probability for predicting the performance of feed-forward fully connected neural networks

Watch later

Favorite

Reda Chhaibi, …

NeurIPS 2022 2 years ago

Are GAN Biased? Evaluating GAN-Generated Facial Images via Crowdsourcing

06:10

Are GAN Biased? Evaluating GAN-Generated Facial Images via Crowdsourcing

Watch later

Favorite

Hangzhi Guo, …

NeurIPS 2022 2 years ago

a-ReQ: Assessing Representation Quality in Self-Supervised Learning by measuring eigenspectrum decay

05:40

a-ReQ: Assessing Representation Quality in Self-Supervised Learning by measuring eigenspectrum decay

Watch later

Favorite

Kumar Krishna Agrawal, …

NeurIPS 2022 2 years ago

SageMix: Saliency-Guided Mixup for Point Clouds

04:50

SageMix: Saliency-Guided Mixup for Point Clouds

Watch later

Favorite

Sanghyeok Lee, …

NeurIPS 2022 2 years ago