Adish Singla, Alberto Maria Metelli, Ana Paiva, Borislav Mavrin, Byron Boots, Carles Gelada, Chelsea Finn, Ching-An Cheng, David C. Parkes, EECS Anca Dragan, Ellis Ratner, Goran Radanovic, Hengshuai Yao, Jacob Buckman, Josiah P. Hanna, Kaiwen Wu, Kelvin Xu, Linglong Kong, Lorenzo Lupo, Marc Bellemare, Marcello Restelli, Matteo Papini, Matthieu Geist, Nathan Ratliff, Ofir Nachum, Olivier Pietquin, Paul TRICHELAIR, Peter Stone, Rati Devidze, Remi Tachet des Combes, Romain Laroche, Saurabh Kumar, Scott Niekum, Sergey Levine, Shan Luo, Xinyan Yan, Yaoliang Yu, alexis jacq, zhengyao jiang · Reinforcement Learning Theory · SlidesLive

Categories

EN

Log in Talk to sales

Next

Reinforcement Learning Theory

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Reinforcement Learning Theory

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v3-stream-006-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v3-stream-006-alpha.b-cdn.net
sl-yoda-v3-stream-006-beta.b-cdn.net
1375548855.rsc.cdn77.org
1312734894.rsc.cdn77.org

Subtitles
Off
English (auto-generated)

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Reinforcement Learning Theory

Reinforcement Learning Theory

Jun 11, 2019

Speakers

Adish Singla

Speaker · 0 followers

Alberto Maria Metelli

Speaker · 0 followers

Ana Paiva

Speaker · 0 followers

About

Safe Policy Improvement with Baseline Bootstrapping This paper considers Safe Policy Improvement (SPI) in Batch Reinforcement Learning (Batch RL): from a fixed dataset and without direct access to the true environment, train a policy that is guaranteed to perform at least as well as the baseline policy used to collect the data. Our approach, called SPI with Baseline Bootstrapping (SPIBB), is inspired by the knows-what-it-knows paradigm: it bootstraps the trained policy with the baseline when the…

Organizer

ICML 2019

Account · 3.2k followers

Categories

AI & Data Science

Category · 10.8k presentations

About ICML 2019

The International Conference on Machine Learning (ICML) is the premier gathering of professionals dedicated to the advancement of the branch of artificial intelligence known as machine learning. ICML is globally renowned for presenting and publishing cutting-edge research on all aspects of machine learning used in closely related areas like artificial intelligence, statistics and data science, as well as important application areas such as machine vision, computational biology, speech recognition, and robotics. ICML is one of the fastest growing artificial intelligence conferences in the world. Participants at ICML span a wide range of backgrounds, from academic and industrial researchers, to entrepreneurs and engineers, to graduate students and postdocs.

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

RLlib: A Platform for Finance Research

20:19

RLlib: A Platform for Finance Research

Watch later

Favorite

ICML 2019 6 years ago

1:10:48

Optimization

Watch later

Favorite

Afshin Rostamizadeh, …

ICML 2019 6 years ago

A Mean Field Theory of Quantized Deep Networks: The Quantization-Depth Trade-Off

13:35

A Mean Field Theory of Quantized Deep Networks: The Quantization-Depth Trade-Off

Watch later

Favorite

ICML 2019 6 years ago

Optimization and Graphical Models

1:00:33

Optimization and Graphical Models

Watch later

Favorite

Ashish Katiyar, …

ICML 2019 6 years ago

Skill Representation and Supervision in Multi-Task Reinforcement Learning

28:27

Skill Representation and Supervision in Multi-Task Reinforcement Learning

Watch later

Favorite

ICML 2019 6 years ago

Recent advances in Multimedia Forensics

31:17

Recent advances in Multimedia Forensics

Watch later

Favorite

Luisa Verdoliva

ICML 2019 6 years ago