Chen-Yu Wei, Haipeng Luo, Hiteshi Sharma, Rahul Jain, Mehdi Jafarnia-Jahromi · Model-free Reinforcement Learning in Infinite-horizon Average-reward MDPs · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Model-free Reinforcement Learning in Infinite-horizon Average-reward MDPs

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v3-stream-012-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v3-stream-012-alpha.b-cdn.net
sl-yoda-v3-stream-012-beta.b-cdn.net
1338956956.rsc.cdn77.org
1656830687.rsc.cdn77.org

Subtitles
Off
en

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Model-free Reinforcement Learning in Infinite-horizon Average-reward MDPs

Model-free Reinforcement Learning in Infinite-horizon Average-reward MDPs

Jul 12, 2020

Speakers

Chen-Yu Wei

Speaker · 0 followers

Haipeng Luo

Speaker · 1 follower

Hiteshi Sharma

Speaker · 0 followers

About

Model-free reinforcement learning is known to be memory and computation efficient and more amendable to large scale problems. In this paper, two model-free algorithms are introduced for learning infinite-horizon average-reward Markov Decision Processes (MDPs). The first algorithm reduces the problem to the discounted-reward version and achieves O(T^2/3) regret after T steps, under the minimal assumption of weakly communicating MDPs. The second algorithm makes use of recent advances in adaptive a…

Organizer

ICML 2020

Account · 2.6k followers

Categories

AI & Data Science

Category · 10.8k presentations

About ICML 2020

The International Conference on Machine Learning (ICML) is the premier gathering of professionals dedicated to the advancement of the branch of artificial intelligence known as machine learning. ICML is globally renowned for presenting and publishing cutting-edge research on all aspects of machine learning used in closely related areas like artificial intelligence, statistics and data science, as well as important application areas such as machine vision, computational biology, speech recognition, and robotics. ICML is one of the fastest growing artificial intelligence conferences in the world. Participants at ICML span a wide range of backgrounds, from academic and industrial researchers, to entrepreneurs and engineers, to graduate students and postdocs.

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Consistency Regularization for Variational Autoencoders

04:30

Consistency Regularization for Variational Autoencoders

Watch later

Favorite

Samarth Sinha, …

ICML 2020 5 years ago

Parameter-free Online Optimization - Part 3

44:00

Parameter-free Online Optimization - Part 3

Watch later

Favorite

Francesco Orabona, …

ICML 2020 5 years ago

Symbolic Network: Generalized Neural Policies for Relational MDPs

15:25

Symbolic Network: Generalized Neural Policies for Relational MDPs

Watch later

Favorite

Sankalp Garg, …

ICML 2020 5 years ago

European Privacy Law and Global Markets for Data

13:35

European Privacy Law and Global Markets for Data

Watch later

Favorite

Christian Peukert, …

ICML 2020 5 years ago

W-EDGE: Weight Updating in Directed Graph Ensembles to improve Classification

01:17

W-EDGE: Weight Updating in Directed Graph Ensembles to improve Classification

Watch later

Favorite

Xavier Fontes, …

ICML 2020 5 years ago

AutoGluon-Tabular: Robust and Accurate AutoML for Structured Data

01:35

AutoGluon-Tabular: Robust and Accurate AutoML for Structured Data

Watch later

Favorite

Nick Erickson, …

ICML 2020 5 years ago