Benjamin Eysenbach, Matthieu Geist, Sergey Levine, Ruslan Salakhutinov · A Connection between One-Step Regularization and Critic Regularization in Reinforcement Learning · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: A Connection between One-Step Regularization and Critic Regularization in Reinforcement Learning

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-008-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-008-alpha.b-cdn.net
sl-yoda-v2-stream-008-beta.b-cdn.net
1159783934.rsc.cdn77.org
1511376917.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

A Connection between One-Step Regularization and Critic Regularization in Reinforcement Learning

A Connection between One-Step Regularization and Critic Regularization in Reinforcement Learning

Dec 2, 2022

Speakers

Benjamin Eysenbach

Sprecher:in · 0 Follower:innen

Matthieu Geist

Sprecher:in · 0 Follower:innen

Sergey Levine

Sprecher:in · 1 Follower:in

About

As with any machine learning problem with limited data, effective offline RL algorithms require careful regularization to avoid overfitting. One-step methods perform regularization by doing just a single step of policy improvement, while critic regularization methods do many steps of policy improvement with a regularized objective. These methods appear distinct. One-step methods, such as advantage-weighted regression and conditional behavioral cloning, are simple and stable. Critic regularizatio…

Organizer

NeurIPS 2022

Konto · 961 Follower:innen

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

SolarDK: A high-resolution urban solar panel image classification and localisation dataset

04:53

SolarDK: A high-resolution urban solar panel image classification and localisation dataset

Später ansehen

Favorit

Carl A. Schmidt, …

NeurIPS 2022 2 years ago

Local Convolutions Cause an Implicit Bias towards High Frequency Adversarial Examples

20:27

Local Convolutions Cause an Implicit Bias towards High Frequency Adversarial Examples

Später ansehen

Favorit

Josué Ortega Caro

NeurIPS 2022 2 years ago

Target-based Surrogates for Stochastic Optimization

04:20

Target-based Surrogates for Stochastic Optimization

Später ansehen

Favorit

Jonathan Wilder Lavington, …

NeurIPS 2022 2 years ago

A Brief Overview of AI Governance for Responsible Machine Learning Systems

10:35

A Brief Overview of AI Governance for Responsible Machine Learning Systems

Später ansehen

Favorit

Navdeep Gill, …

NeurIPS 2022 2 years ago

SALSA: Attacking Lattice Cryptography with Transformers

04:59

SALSA: Attacking Lattice Cryptography with Transformers

Später ansehen

Favorit

Emily Wenger, …

NeurIPS 2022 2 years ago

FlyView: a bio-inspired optical flow truth dataset for visual navigation using panoramic stereo vision

04:40

FlyView: a bio-inspired optical flow truth dataset for visual navigation using panoramic stereo vision

Später ansehen

Favorit

Alix Leroy, …

NeurIPS 2022 2 years ago