Lior Shani, Yonathan Efroni, Aviv Rosenberg, Shie Mannor · Optimistic Policy Optimization with Bandit Feedback · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Optimistic Policy Optimization with Bandit Feedback

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v3-stream-011-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v3-stream-011-alpha.b-cdn.net
sl-yoda-v3-stream-011-beta.b-cdn.net
1150868944.rsc.cdn77.org
1511650057.rsc.cdn77.org

Subtitles
Off
en

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Optimistic Policy Optimization with Bandit Feedback

Optimistic Policy Optimization with Bandit Feedback

Jul 12, 2020

Speakers

Lior Shani

Speaker · 0 followers

Yonathan Efroni

Speaker · 0 followers

Aviv Rosenberg

Speaker · 0 followers

About

Policy optimization methods are one of the most widely used classes of Reinforcement Learning (RL) algorithms. Yet, so far, such methods have been mostly analyzed from an optimization perspective, without addressing the problem of exploration, or by making strong assumptions on the interaction with the environment. In this paper we consider model-based RL in the tabular finite-horizon MDP setting with unknown transitions and bandit feedback. For this setting, we propose an optimistic trust regio…

Organizer

ICML 2020

Account · 2.7k followers

Categories

AI & Data Science

Category · 10.8k presentations

About ICML 2020

The International Conference on Machine Learning (ICML) is the premier gathering of professionals dedicated to the advancement of the branch of artificial intelligence known as machine learning. ICML is globally renowned for presenting and publishing cutting-edge research on all aspects of machine learning used in closely related areas like artificial intelligence, statistics and data science, as well as important application areas such as machine vision, computational biology, speech recognition, and robotics. ICML is one of the fastest growing artificial intelligence conferences in the world. Participants at ICML span a wide range of backgrounds, from academic and industrial researchers, to entrepreneurs and engineers, to graduate students and postdocs.

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Optimistic bounds for multi-output prediction

14:40

Optimistic bounds for multi-output prediction

Watch later

Favorite

Henry Reeve, …

ICML 2020 5 years ago

Relational Structure Discovery

30:15

Relational Structure Discovery

Watch later

Favorite

ICML 2020 5 years ago

Amortised Learning by Wake-Sleep

10:33

Amortised Learning by Wake-Sleep

Watch later

Favorite

Kevin Wenliang, …

ICML 2020 5 years ago

Step-size Adaptation Using Exponentiated Gradient Updates

06:37

Step-size Adaptation Using Exponentiated Gradient Updates

Watch later

Favorite

Ehsan Amid, …

ICML 2020 5 years ago

A Sample Complexity Separation between Non-Convex and Convex Meta-Learning

15:03

A Sample Complexity Separation between Non-Convex and Convex Meta-Learning

Watch later

Favorite

Nikunj Saunshi, …

ICML 2020 5 years ago

Single Point Transductive Prediction

14:48

Single Point Transductive Prediction

Watch later

Favorite

Nilesh Tripuraneni, …

ICML 2020 5 years ago