Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Model-free Reinforcement Learning in Infinite-horizon Average-reward MDPs
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v3-stream-012-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v3-stream-012-alpha.b-cdn.net
      • sl-yoda-v3-stream-012-beta.b-cdn.net
      • 1338956956.rsc.cdn77.org
      • 1656830687.rsc.cdn77.org
      • Subtitles
      • Off
      • en
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Model-free Reinforcement Learning in Infinite-horizon Average-reward MDPs
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Model-free Reinforcement Learning in Infinite-horizon Average-reward MDPs

            Jul 12, 2020

            Speakers

            CW

            Chen-Yu Wei

            Speaker · 0 followers

            HL

            Haipeng Luo

            Speaker · 1 follower

            HS

            Hiteshi Sharma

            Speaker · 0 followers

            About

            Model-free reinforcement learning is known to be memory and computation efficient and more amendable to large scale problems. In this paper, two model-free algorithms are introduced for learning infinite-horizon average-reward Markov Decision Processes (MDPs). The first algorithm reduces the problem to the discounted-reward version and achieves O(T^2/3) regret after T steps, under the minimal assumption of weakly communicating MDPs. The second algorithm makes use of recent advances in adaptive a…

            Organizer

            I2
            I2

            ICML 2020

            Account · 2.6k followers

            Categories

            AI & Data Science

            Category · 10.8k presentations

            About ICML 2020

            The International Conference on Machine Learning (ICML) is the premier gathering of professionals dedicated to the advancement of the branch of artificial intelligence known as machine learning. ICML is globally renowned for presenting and publishing cutting-edge research on all aspects of machine learning used in closely related areas like artificial intelligence, statistics and data science, as well as important application areas such as machine vision, computational biology, speech recognition, and robotics. ICML is one of the fastest growing artificial intelligence conferences in the world. Participants at ICML span a wide range of backgrounds, from academic and industrial researchers, to entrepreneurs and engineers, to graduate students and postdocs.

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Consistency Regularization for Variational Autoencoders
            04:30

            Consistency Regularization for Variational Autoencoders

            Samarth Sinha, …

            I2
            I2
            ICML 2020 5 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Parameter-free Online Optimization - Part 3
            44:00

            Parameter-free Online Optimization - Part 3

            Francesco Orabona, …

            I2
            I2
            ICML 2020 5 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Symbolic Network: Generalized Neural Policies for Relational MDPs
            15:25

            Symbolic Network: Generalized Neural Policies for Relational MDPs

            Sankalp Garg, …

            I2
            I2
            ICML 2020 5 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            European Privacy Law and Global Markets for Data
            13:35

            European Privacy Law and Global Markets for Data

            Christian Peukert, …

            I2
            I2
            ICML 2020 5 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            W-EDGE: Weight Updating in Directed Graph Ensembles to improve Classification
            01:17

            W-EDGE: Weight Updating in Directed Graph Ensembles to improve Classification

            Xavier Fontes, …

            I2
            I2
            ICML 2020 5 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            AutoGluon-Tabular: Robust and Accurate AutoML for Structured Data
            01:35

            AutoGluon-Tabular: Robust and Accurate AutoML for Structured Data

            Nick Erickson, …

            I2
            I2
            ICML 2020 5 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow ICML 2020