Adrien Ali Taiga, Ahmed Touati, Andrea Zanette, Ann Nowé, Anna Harutyunyan, Ben London, Bruno Scherrer, Chen Tessler, Christoph Dann, Dale Schuurmans, David Abel, Doina Precup, Emma Brunskill, George Konidaris, Georgios Theocharous, James Kostas, Jee Won Park, Joelle Pineau, Joshua Romoff, Lihong Li, Marc Bellemare, Matthieu Geist, Nicolas Le Roux, Olivier Pietquin, Peter Henderson, Peter Vrancx, Philip S. Thomas, Philippe Hamel, Robert Dadashi, Scott M. Jordan, Shie Mannor, Ted Sandler, Wei Wei, Yann Ollivier, Yash Chandak, Yonathan Efroni, Yuu Jinnai · Reinforcement Learning Theory · SlidesLive

Categories

EN

Log in Talk to sales

Next

Deep Learning Theory

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Reinforcement Learning Theory

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-005-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-005-alpha.b-cdn.net
sl-yoda-v2-stream-005-beta.b-cdn.net
1034628162.rsc.cdn77.org
1409346856.rsc.cdn77.org

Subtitles
Off
English (auto-generated)

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Reinforcement Learning Theory

Reinforcement Learning Theory

Jun 11, 2019

Speakers

Adrien Ali Taiga

Speaker · 0 followers

Ahmed Touati

Speaker · 0 followers

Andrea Zanette

Speaker · 0 followers

About

Separable value functions across time-scales In many finite horizon episodic reinforcement learning (RL) settings, it is desirable to optimize for the undiscounted return - in settings like Atari, for instance, the goal is to collect the most points while staying alive in the long run. Yet, it may be difficult (or even intractable) mathematically to learn with this target. As such, temporal discounting is often applied to optimize over a shorter effective planning horizon. This comes at the cost…

Organizer

ICML 2019

Account · 3.2k followers

Categories

AI & Data Science

Category · 10.8k presentations

Mathematics

Category · 2.4k presentations

About ICML 2019

The International Conference on Machine Learning (ICML) is the premier gathering of professionals dedicated to the advancement of the branch of artificial intelligence known as machine learning. ICML is globally renowned for presenting and publishing cutting-edge research on all aspects of machine learning used in closely related areas like artificial intelligence, statistics and data science, as well as important application areas such as machine vision, computational biology, speech recognition, and robotics. ICML is one of the fastest growing artificial intelligence conferences in the world. Participants at ICML span a wide range of backgrounds, from academic and industrial researchers, to entrepreneurs and engineers, to graduate students and postdocs.

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Real-World Sequential Decision Making - Panel Discussion

42:45

Real-World Sequential Decision Making - Panel Discussion

Watch later

Favorite

Dawn Woodard, …

ICML 2019 6 years ago

1:12:54

Fairness

Watch later

Favorite

Aaron Roth, …

ICML 2019 6 years ago

06:59

AI Commons

Watch later

Favorite

ICML 2019 6 years ago

Limits of Deepfake Detection: A Robust Estimation Viewpoint

09:38

Limits of Deepfake Detection: A Robust Estimation Viewpoint

Watch later

Favorite

ICML 2019 6 years ago

Toward Robust AI Systems for Understanding and Reasoning Over Multimodal Data

30:08

Toward Robust AI Systems for Understanding and Reasoning Over Multimodal Data

Watch later

Favorite

Hanna Hajishirzi

ICML 2019 6 years ago

Exploiting redundancy for efficient processing of DNNs and beyond

31:24

Exploiting redundancy for efficient processing of DNNs and beyond

Watch later

Favorite

ICML 2019 6 years ago