Adish Singla, Alberto Maria Metelli, Ana Paiva, Borislav Mavrin, Byron Boots, Carles Gelada, Chelsea Finn, Ching-An Cheng, David C. Parkes, EECS Anca Dragan, Ellis Ratner, Goran Radanovic, Hengshuai Yao, Jacob Buckman, Josiah P. Hanna, Kaiwen Wu, Kelvin Xu, Linglong Kong, Lorenzo Lupo, Marc Bellemare, Marcello Restelli, Matteo Papini, Matthieu Geist, Nathan Ratliff, Ofir Nachum, Olivier Pietquin, Paul TRICHELAIR, Peter Stone, Rati Devidze, Remi Tachet des Combes, Romain Laroche, Saurabh Kumar, Scott Niekum, Sergey Levine, Shan Luo, Xinyan Yan, Yaoliang Yu, alexis jacq, zhengyao jiang · Reinforcement Learning Theory · SlidesLive

Kategorien

DE

Anmelden Vertrieb kontaktieren

Next

Reinforcement Learning Theory

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Reinforcement Learning Theory

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v3-stream-006-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v3-stream-006-alpha.b-cdn.net
sl-yoda-v3-stream-006-beta.b-cdn.net
1375548855.rsc.cdn77.org
1312734894.rsc.cdn77.org

Subtitles
Off
English (auto-generated)

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Reinforcement Learning Theory

Reinforcement Learning Theory

Jun 11, 2019

Sprecher:innen

Adish Singla

Speaker · 0 followers

Alberto Maria Metelli

Speaker · 0 followers

Ana Paiva

Speaker · 0 followers

Über

Safe Policy Improvement with Baseline Bootstrapping This paper considers Safe Policy Improvement (SPI) in Batch Reinforcement Learning (Batch RL): from a fixed dataset and without direct access to the true environment, train a policy that is guaranteed to perform at least as well as the baseline policy used to collect the data. Our approach, called SPI with Baseline Bootstrapping (SPIBB), is inspired by the knows-what-it-knows paradigm: it bootstraps the trained policy with the baseline when the…

Organisator

ICML 2019

Account · 3.2k followers

Kategorien

AI & Data Science

Category · 10.8k presentations

Über ICML 2019

The International Conference on Machine Learning (ICML) is the premier gathering of professionals dedicated to the advancement of the branch of artificial intelligence known as machine learning. ICML is globally renowned for presenting and publishing cutting-edge research on all aspects of machine learning used in closely related areas like artificial intelligence, statistics and data science, as well as important application areas such as machine vision, computational biology, speech recognition, and robotics. ICML is one of the fastest growing artificial intelligence conferences in the world. Participants at ICML span a wide range of backgrounds, from academic and industrial researchers, to entrepreneurs and engineers, to graduate students and postdocs.

Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

Professionelle Aufzeichnung und Livestreaming – weltweit.

Freigeben

Empfohlene Videos

Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

Skew-Fit: State-Covering Self-Supervised Reinforcement Learning

07:19

Skew-Fit: State-Covering Self-Supervised Reinforcement Learning

Watch later

Favorite

Vitchyr H. Pong

ICML 2019 6 years ago

Neural Imaging Pipelines - the Scourge or Hope of Forensics?

33:40

Neural Imaging Pipelines - the Scourge or Hope of Forensics?

Watch later

Favorite

ICML 2019 6 years ago

Asymptotics of Wide Networks from Feynman Diagrams

16:01

Asymptotics of Wide Networks from Feynman Diagrams

Watch later

Favorite

ICML 2019 6 years ago

A Real World Reinforcement Learning Revolution

17:52

A Real World Reinforcement Learning Revolution

Watch later

Favorite

ICML 2019 6 years ago

Trajectory Forecasting with Multi-Modal Distributions

21:10

Trajectory Forecasting with Multi-Modal Distributions

Watch later

Favorite

ICML 2019 6 years ago

Improving Relevance Prediction with Transfer Learning in Large-Scale Retrieval systems

12:06

Improving Relevance Prediction with Transfer Learning in Large-Scale Retrieval systems

Watch later

Favorite

ICML 2019 6 years ago