Next
Reinforcement Learning Theory
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Reinforcement Learning Theory
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v3-stream-006-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v3-stream-006-alpha.b-cdn.net
      • sl-yoda-v3-stream-006-beta.b-cdn.net
      • 1375548855.rsc.cdn77.org
      • 1312734894.rsc.cdn77.org
      • Subtitles
      • Off
      • English (auto-generated)
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Reinforcement Learning Theory
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Reinforcement Learning Theory

            Jun 11, 2019

            Speakers

            AS

            Adish Singla

            Speaker · 0 followers

            AMM

            Alberto Maria Metelli

            Speaker · 0 followers

            AP

            Ana Paiva

            Speaker · 0 followers

            About

            Safe Policy Improvement with Baseline Bootstrapping This paper considers Safe Policy Improvement (SPI) in Batch Reinforcement Learning (Batch RL): from a fixed dataset and without direct access to the true environment, train a policy that is guaranteed to perform at least as well as the baseline policy used to collect the data. Our approach, called SPI with Baseline Bootstrapping (SPIBB), is inspired by the knows-what-it-knows paradigm: it bootstraps the trained policy with the baseline when the…

            Organizer

            I2
            I2

            ICML 2019

            Account · 3.2k followers

            Categories

            AI & Data Science

            Category · 10.8k presentations

            About ICML 2019

            The International Conference on Machine Learning (ICML) is the premier gathering of professionals dedicated to the advancement of the branch of artificial intelligence known as machine learning. ICML is globally renowned for presenting and publishing cutting-edge research on all aspects of machine learning used in closely related areas like artificial intelligence, statistics and data science, as well as important application areas such as machine vision, computational biology, speech recognition, and robotics. ICML is one of the fastest growing artificial intelligence conferences in the world. Participants at ICML span a wide range of backgrounds, from academic and industrial researchers, to entrepreneurs and engineers, to graduate students and postdocs.

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            RLlib: A Platform for Finance Research
            20:19

            RLlib: A Platform for Finance Research

            Ion Stoica

            I2
            I2
            ICML 2019 6 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Optimization
            1:10:48

            Optimization

            Afshin Rostamizadeh, …

            I2
            I2
            ICML 2019 6 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            A Mean Field Theory of Quantized Deep Networks: The Quantization-Depth Trade-Off
            13:35

            A Mean Field Theory of Quantized Deep Networks: The Quantization-Depth Trade-Off

            Dar Gilboa

            I2
            I2
            ICML 2019 6 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Optimization and Graphical Models
            1:00:33

            Optimization and Graphical Models

            Ashish Katiyar, …

            I2
            I2
            ICML 2019 6 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Skill Representation and Supervision in Multi-Task Reinforcement Learning
            28:27

            Skill Representation and Supervision in Multi-Task Reinforcement Learning

            Karol Hausman

            I2
            I2
            ICML 2019 6 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Recent advances in Multimedia Forensics
            31:17

            Recent advances in Multimedia Forensics

            Luisa Verdoliva

            I2
            I2
            ICML 2019 6 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow ICML 2019