Yuta Saito, Qingyang Ren, Thorsten Joachims · Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-006-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-006-alpha.b-cdn.net
sl-yoda-v2-stream-006-beta.b-cdn.net
1549480416.rsc.cdn77.org
1102696603.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling

Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling

Jul 24, 2023

Speakers

Yuta Saito

Speaker · 0 followers

Qingyang Ren

Speaker · 0 followers

Thorsten Joachims

Speaker · 2 followers

About

We study off-policy evaluation (OPE) of contextual bandit policies for large discrete action spaces where conventional importance-weighting approaches suffer from excessive variance. To circumvent this variance issue, we propose a new estimator, called OffCEM, that is based on the conjunct effect model (CEM), a novel decomposition of the causal effect into a cluster effect and a residual effect. OffCEM applies importance weighting only to action clusters and addresses the residual causal effect…

Organizer

ICML 2023

Account · 657 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Universal Physics-Informed Neural Networks:

05:19

Universal Physics-Informed Neural Networks:

Watch later

Favorite

Lena Podina, …

ICML 2023 2 years ago

Welcoming remarks and introduction

07:14

Welcoming remarks and introduction

Watch later

Favorite

ICML 2023 2 years ago

Out-of-Domain Robustness via Targeted Augmentations

04:37

Out-of-Domain Robustness via Targeted Augmentations

Watch later

Favorite

ICML 2023 2 years ago

Orthogonality-Enforced Latent Space in Autoencoders: An Approach to Learning Disentangled Representations

04:08

Orthogonality-Enforced Latent Space in Autoencoders: An Approach to Learning Disentangled Representations

Watch later

Favorite

Jaehoon Cha, …

ICML 2023 2 years ago

Why does Throwing Away Data Improve Worst-Group Error?

07:21

Why does Throwing Away Data Improve Worst-Group Error?

Watch later

Favorite

Kamalika Chaudhuri, …

ICML 2023 2 years ago

Quantum Speedups for Zero-Sum Games via Improved Dynamic Gibbs Sampling

05:07

Quantum Speedups for Zero-Sum Games via Improved Dynamic Gibbs Sampling

Watch later

Favorite

Yosheb Getachew, …

ICML 2023 2 years ago