Joan Bas-Serrano, Andreas Krause, Sebastian Curi, Gergely Neu · Logistic Q-Learning · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Logistic Q-Learning

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v3-stream-015-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v3-stream-015-alpha.b-cdn.net
sl-yoda-v3-stream-015-beta.b-cdn.net
1963568160.rsc.cdn77.org
1940033649.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Logistic Q-Learning

Logistic Q-Learning

Apr 14, 2021

Speakers

Joan Bas-Serrano

Speaker · 0 followers

Andreas Krause

Speaker · 6 followers

Sebastian Curi

Speaker · 0 followers

About

We propose a new reinforcement learning algorithm derived from a regularized linear-programming formulation of optimal control in MDPs. The method is closely related to the classic Relative Entropy Policy Search (REPS) algorithm of Peters et al. (2010), with the key difference that our method introduces a Q-function that enables efficient exact model-free implementation. The main feature of our algorithm (called Q-REPS) is a convex loss function for policy evaluation that serves as a theoretical…

Organizer

AISTATS 2021

Account · 63 followers

Categories

AI & Data Science

Category · 10.8k presentations

About AISTATS 2021

The 24th International Conference on Artificial Intelligence and Statistics was held virtually from Tuesday, 13 April 2021 to Thursday, 15 April 2021.

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

An Analysis of LIME for Text Data

02:49

An Analysis of LIME for Text Data

Watch later

Favorite

Dina Mardaoui, …

AISTATS 2021 4 years ago

On the Privacy Properties of GAN-generated Samples

03:09

On the Privacy Properties of GAN-generated Samples

Watch later

Favorite

AISTATS 2021 4 years ago

Associative Convolutional Layers

03:09

Associative Convolutional Layers

Watch later

Favorite

Hamed Omidvar, …

AISTATS 2021 4 years ago

Momentum Improves Optimization on Riemannian Manifolds

03:15

Momentum Improves Optimization on Riemannian Manifolds

Watch later

Favorite

Foivos Alimisis, …

AISTATS 2021 4 years ago

Fair for All: Best-effort Guarantees for Fairness in Classification

02:50

Fair for All: Best-effort Guarantees for Fairness in Classification

Watch later

Favorite

Anilesh Krishnaswamy, …

AISTATS 2021 4 years ago

Distributionally Robust Optimization for Deep Kernel Multiple Instance Learning

02:58

Distributionally Robust Optimization for Deep Kernel Multiple Instance Learning

Watch later

Favorite

Hitesh Sapkota, …

AISTATS 2021 4 years ago