Yash Chandak, Scott Niekum, Bruno C. Da Silva, Erik Learned-Miller, Emma Brunskill, Philip S. Thomas · Universal Off-Policy Evaluation · SlidesLive

Categories

Arts, Design & Media

Category · 1.2k presentations

Business & Economics

Category · 3.8k presentations

Computer Science & IT

Category · 14.8k presentations

Engineering & Technology

Category · 491 presentations

Humanities & Social Sciences

Category · 1.3k presentations

Medicine & Health

Category · 529 presentations

Natural & Formal Sciences

Category · 3.3k presentations

Self Development & Lifestyle

Category · 599 presentations

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Universal Off-Policy Evaluation

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v3-stream-013-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v3-stream-013-alpha.b-cdn.net
sl-yoda-v3-stream-013-beta.b-cdn.net
1668715672.rsc.cdn77.org
1420896597.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Universal Off-Policy Evaluation

Universal Off-Policy Evaluation

Dec 6, 2021

Speakers

Yash Chandak

Speaker · 0 followers

Scott Niekum

Speaker · 1 follower

Bruno C. Da Silva

Speaker · 0 followers

About

When faced with sequential decision-making problems, it is often useful to be able to predict what would happen if decisions were made using a new policy. Those predictions must often be based on data collected under some previously used decision-making rule. Many previous methods enable such off-policy (or counterfactual) estimation of the _expected_ value of a performance measure called the return. In this paper, we take the first steps towards a 'universal off-policy estimator' (UnO)—one that…

Organizer

NeurIPS 2021

Account · 1.5k followers

About NeurIPS 2021

Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

An Outcome Test of Discrimination for Ranked Lists

09:21

An Outcome Test of Discrimination for Ranked Lists

Watch later

Favorite

Jonathan Roth, …

NeurIPS 2021 3 years ago

Covariance-Aware Private Mean Estimation Without Private Covariance Estimation

14:33

Covariance-Aware Private Mean Estimation Without Private Covariance Estimation

Watch later

Favorite

Gavin Brown, …

NeurIPS 2021 3 years ago

Individual Privacy Accounting via a Rényi Filter

15:16

Individual Privacy Accounting via a Rényi Filter

Watch later

Favorite

Vitaly Feldman, …

NeurIPS 2021 3 years ago

You Only Look at One Sequence: Rethinking Transformer in Vision through Object Detection

04:20

You Only Look at One Sequence: Rethinking Transformer in Vision through Object Detection

Watch later

Favorite

Yuxin Fang, …

NeurIPS 2021 3 years ago

Invertible Tabular GANs: Killing Two Birds with One Stone for Tabular Data Synthesis

10:55

Invertible Tabular GANs: Killing Two Birds with One Stone for Tabular Data Synthesis

Watch later

Favorite

Jaehoon Lee, …

NeurIPS 2021 3 years ago

Analytic Insights into Structure and Rank of Neural Network Hessian Maps

15:08

Analytic Insights into Structure and Rank of Neural Network Hessian Maps

Watch later

Favorite

Sidak Pal Singh, …

NeurIPS 2021 3 years ago