Adish Singla, Alberto Maria Metelli, Ana Paiva, Borislav Mavrin, Byron Boots, Carles Gelada, Chelsea Finn, Ching-An Cheng, David C. Parkes, EECS Anca Dragan, Ellis Ratner, Goran Radanovic, Hengshuai Yao, Jacob Buckman, Josiah P. Hanna, Kaiwen Wu, Kelvin Xu, Linglong Kong, Lorenzo Lupo, Marc Bellemare, Marcello Restelli, Matteo Papini, Matthieu Geist, Nathan Ratliff, Ofir Nachum, Olivier Pietquin, Paul TRICHELAIR, Peter Stone, Rati Devidze, Remi Tachet des Combes, Romain Laroche, Saurabh Kumar, Scott Niekum, Sergey Levine, Shan Luo, Xinyan Yan, Yaoliang Yu, alexis jacq, zhengyao jiang · Reinforcement Learning Theory · SlidesLive

Categories

EN

Log in Talk to sales

Next

Reinforcement Learning Theory

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Reinforcement Learning Theory

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v3-stream-006-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v3-stream-006-alpha.b-cdn.net
sl-yoda-v3-stream-006-beta.b-cdn.net
1375548855.rsc.cdn77.org
1312734894.rsc.cdn77.org

Subtitles
Off
English (auto-generated)

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Reinforcement Learning Theory

Reinforcement Learning Theory

Jun 11, 2019

Speakers

Adish Singla

Speaker · 0 followers

Alberto Maria Metelli

Speaker · 0 followers

Ana Paiva

Speaker · 0 followers

About

Safe Policy Improvement with Baseline Bootstrapping This paper considers Safe Policy Improvement (SPI) in Batch Reinforcement Learning (Batch RL): from a fixed dataset and without direct access to the true environment, train a policy that is guaranteed to perform at least as well as the baseline policy used to collect the data. Our approach, called SPI with Baseline Bootstrapping (SPIBB), is inspired by the knows-what-it-knows paradigm: it bootstraps the trained policy with the baseline when the…

Organizer

ICML 2019

Account · 3.2k followers

Categories

AI & Data Science

Category · 10.8k presentations

About ICML 2019

The International Conference on Machine Learning (ICML) is the premier gathering of professionals dedicated to the advancement of the branch of artificial intelligence known as machine learning. ICML is globally renowned for presenting and publishing cutting-edge research on all aspects of machine learning used in closely related areas like artificial intelligence, statistics and data science, as well as important application areas such as machine vision, computational biology, speech recognition, and robotics. ICML is one of the fastest growing artificial intelligence conferences in the world. Participants at ICML span a wide range of backgrounds, from academic and industrial researchers, to entrepreneurs and engineers, to graduate students and postdocs.

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Making Efficient use of Musical Annotations

19:05

Making Efficient use of Musical Annotations

Watch later

Favorite

ICML 2019 6 years ago

Doubly Robust Off-Policy Evaluation with Shrinkage

32:25

Doubly Robust Off-Policy Evaluation with Shrinkage

Watch later

Favorite

ICML 2019 6 years ago

Contributed talks

15:49

Contributed talks

Watch later

Favorite

Krzysztof Jerzy Geras, …

ICML 2019 6 years ago

Robust Perception, Imitation, and Reinforcement Learning for Embodied Learning Machines

31:13

Robust Perception, Imitation, and Reinforcement Learning for Embodied Learning Machines

Watch later

Favorite

ICML 2019 6 years ago

We Need No Pixels: Video Manipulation Detection Using Stream Descriptors

14:05

We Need No Pixels: Video Manipulation Detection Using Stream Descriptors

Watch later

Favorite

ICML 2019 6 years ago

Adversarial Policies: Attacking Deep Reinforcement Learning

09:57

Adversarial Policies: Attacking Deep Reinforcement Learning

Watch later

Favorite

ICML 2019 6 years ago