Next
Reinforcement Learning Theory
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Reinforcement Learning Theory
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v3-stream-006-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v3-stream-006-alpha.b-cdn.net
      • sl-yoda-v3-stream-006-beta.b-cdn.net
      • 1375548855.rsc.cdn77.org
      • 1312734894.rsc.cdn77.org
      • Subtitles
      • Off
      • English (auto-generated)
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Reinforcement Learning Theory
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Reinforcement Learning Theory

            Jun 11, 2019

            Sprecher:innen

            AS

            Adish Singla

            Speaker · 0 followers

            AMM

            Alberto Maria Metelli

            Speaker · 0 followers

            AP

            Ana Paiva

            Speaker · 0 followers

            Über

            Safe Policy Improvement with Baseline Bootstrapping This paper considers Safe Policy Improvement (SPI) in Batch Reinforcement Learning (Batch RL): from a fixed dataset and without direct access to the true environment, train a policy that is guaranteed to perform at least as well as the baseline policy used to collect the data. Our approach, called SPI with Baseline Bootstrapping (SPIBB), is inspired by the knows-what-it-knows paradigm: it bootstraps the trained policy with the baseline when the…

            Organisator

            I2
            I2

            ICML 2019

            Account · 3.2k followers

            Kategorien

            AI & Data Science

            Category · 10.8k presentations

            Über ICML 2019

            The International Conference on Machine Learning (ICML) is the premier gathering of professionals dedicated to the advancement of the branch of artificial intelligence known as machine learning. ICML is globally renowned for presenting and publishing cutting-edge research on all aspects of machine learning used in closely related areas like artificial intelligence, statistics and data science, as well as important application areas such as machine vision, computational biology, speech recognition, and robotics. ICML is one of the fastest growing artificial intelligence conferences in the world. Participants at ICML span a wide range of backgrounds, from academic and industrial researchers, to entrepreneurs and engineers, to graduate students and postdocs.

            Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

            Professionelle Aufzeichnung und Livestreaming – weltweit.

            Freigeben

            Empfohlene Videos

            Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

            Skew-Fit: State-Covering Self-Supervised Reinforcement Learning
            07:19

            Skew-Fit: State-Covering Self-Supervised Reinforcement Learning

            Vitchyr H. Pong

            I2
            I2
            ICML 2019 6 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Neural Imaging Pipelines - the Scourge or Hope of Forensics?
            33:40

            Neural Imaging Pipelines - the Scourge or Hope of Forensics?

            Pawel Korus

            I2
            I2
            ICML 2019 6 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Asymptotics of Wide Networks from Feynman Diagrams
            16:01

            Asymptotics of Wide Networks from Feynman Diagrams

            Guy Gur-Ari

            I2
            I2
            ICML 2019 6 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            A Real World Reinforcement Learning Revolution
            17:52

            A Real World Reinforcement Learning Revolution

            John Langford

            I2
            I2
            ICML 2019 6 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Trajectory Forecasting with Multi-Modal Distributions
            21:10

            Trajectory Forecasting with Multi-Modal Distributions

            Kris M. Kitani

            I2
            I2
            ICML 2019 6 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Improving Relevance Prediction with Transfer Learning in Large-Scale Retrieval systems
            12:06

            Improving Relevance Prediction with Transfer Learning in Large-Scale Retrieval systems

            Ruoxi Wang

            I2
            I2
            ICML 2019 6 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interessiert an Vorträgen wie diesem? ICML 2019 folgen