Benjamin Eysenbach, Matthieu Geist, Sergey Levine, Russ Salakhutdinov · A Connection between One-Step RL and Critic Regularization in Reinforcement Learning · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: A Connection between One-Step RL and Critic Regularization in Reinforcement Learning

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-010-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-010-alpha.b-cdn.net
sl-yoda-v2-stream-010-beta.b-cdn.net
1759419103.rsc.cdn77.org
1016618226.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

A Connection between One-Step RL and Critic Regularization in Reinforcement Learning

A Connection between One-Step RL and Critic Regularization in Reinforcement Learning

Jul 24, 2023

Speakers

Benjamin Eysenbach

Speaker · 0 followers

Matthieu Geist

Speaker · 0 followers

Sergey Levine

Speaker · 1 follower

About

As with any machine learning problem with limited data, effective offline RL algorithms require careful regularization to avoid overfitting. One class of methods, known as one-step RL, perform just one step of policy improvement. These methods, which include advantage-weighted regression and conditional behavioral cloning, are thus simple and stable, but can have limited asymptotic performance. A second class of methods, known as critic regularization, perform many steps of policy improvement wi…

Organizer

ICML 2023

Account · 657 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

A Kernel-Based View of Language Model Fine-Tuning

05:09

A Kernel-Based View of Language Model Fine-Tuning

Watch later

Favorite

Sadhika Malladi, …

ICML 2023 2 years ago

Sample Complexity Bounds for Learning High-dimensional Simplices in Noisy Regimes

04:41

Sample Complexity Bounds for Learning High-dimensional Simplices in Noisy Regimes

Watch later

Favorite

Amir H. Saberi, …

ICML 2023 2 years ago

DIVISION: Memory Efficient Training via Dual Activation Precision

04:58

DIVISION: Memory Efficient Training via Dual Activation Precision

Watch later

Favorite

Guanchu Wang, …

ICML 2023 2 years ago

Generalization and Corruption Resistance via Distributionally Robust Optimization

05:46

Generalization and Corruption Resistance via Distributionally Robust Optimization

Watch later

Favorite

Amine Bennouna, …

ICML 2023 2 years ago

Looped Transformers as Programmable Computers

05:12

Looped Transformers as Programmable Computers

Watch later

Favorite

Angeliki Giannou, …

ICML 2023 2 years ago

In Search of Insights, Not Magic Bullets: Towards Demystification of the Model Selection Dilemma in Heterogeneous Treatment Effect Estimation

05:15

In Search of Insights, Not Magic Bullets: Towards Demystification of the Model Selection Dilemma in Heterogeneous Treatment Effect Estimation

Watch later

Favorite

Alicia Curth, …

ICML 2023 2 years ago