Christina Yuan, Yash Chandak, Stephen Giguere, Philip S. Thomas, Scott Niekum · SOPE: Spectrum of Off-Policy Estimators · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: SOPE: Spectrum of Off-Policy Estimators

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v3-stream-016-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v3-stream-016-alpha.b-cdn.net
sl-yoda-v3-stream-016-beta.b-cdn.net
1504562137.rsc.cdn77.org
1896834465.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

SOPE: Spectrum of Off-Policy Estimators

SOPE: Spectrum of Off-Policy Estimators

Dec 6, 2021

Speakers

Christina Yuan

Speaker · 0 followers

Yash Chandak

Speaker · 0 followers

Stephen Giguere

Speaker · 0 followers

About

Many sequential decision making problems are high-stakes and require off-policy evaluation (OPE) of a new policy using historical data collected using some other policy. One of the most common OPE technique that provides unbiased estimates is trajectory based importance sampling (IS). However, due to the high variance of trajectory IS estimates, importance sampling methods based on stationary distributions (SIS) have recently been adopted. Unfortunately, while SIS often provides lower variance e…

Organizer

NeurIPS 2021

Account · 1.9k followers

About NeurIPS 2021

Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Differentially Private Stochastic Optimization: New Results in Convex and Non-Convex Settings

12:28

Differentially Private Stochastic Optimization: New Results in Convex and Non-Convex Settings

Watch later

Favorite

Raef Bassily, …

NeurIPS 2021 3 years ago

Generative models, inference and symmetries

21:59

Generative models, inference and symmetries

Watch later

Favorite

Danilo J. Rezende, …

NeurIPS 2021 3 years ago

Kernel Functional Optimisation

12:48

Kernel Functional Optimisation

Watch later

Favorite

Arun Kumar A V, …

NeurIPS 2021 3 years ago

Learning Transferable Motor Skills with Hierarchical Latent Mixture Policies

04:57

Learning Transferable Motor Skills with Hierarchical Latent Mixture Policies

Watch later

Favorite

Dushyant Rao, …

NeurIPS 2021 3 years ago

Combinatorial Optimization for Panoptic Segmentation: A Fully Differentiable Approach

09:49

Combinatorial Optimization for Panoptic Segmentation: A Fully Differentiable Approach

Watch later

Favorite

Ahmed Abbas, …

NeurIPS 2021 3 years ago

Opening remarks

09:23

Opening remarks

Watch later

Favorite

Yahav Bechavod, …

NeurIPS 2021 3 years ago