Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: An Optimistic Perspective on Offline Deep Reinforcement Learning
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-010-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-010-alpha.b-cdn.net
      • sl-yoda-v2-stream-010-beta.b-cdn.net
      • 1759419103.rsc.cdn77.org
      • 1016618226.rsc.cdn77.org
      • Subtitles
      • Off
      • en
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            An Optimistic Perspective on Offline Deep Reinforcement Learning
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            An Optimistic Perspective on Offline Deep Reinforcement Learning

            Jul 12, 2020

            Speakers

            RA
            RA

            Rishabh Agarwal

            Speaker · 2 followers

            DS

            Dale Schuurmans

            Speaker · 2 followers

            MN

            Mohammad Norouzi

            Speaker · 1 follower

            About

            Off-policy reinforcement learning (RL) using a fixed offline dataset of logged interactions is an important consideration in real world applications. This paper studies offline RL using the DQN replay dataset comprising the entire replay experience of a DQN agent on 60 Atari 2600 games. We demonstrate that recent off-policy deep RL algorithms, even when trained solely on this replay dataset, outperform the fully trained DQN agent. To enhance generalization in the offline setting, we present Rand…

            Organizer

            I2
            I2

            ICML 2020

            Account · 2.7k followers

            Categories

            AI & Data Science

            Category · 10.8k presentations

            About ICML 2020

            The International Conference on Machine Learning (ICML) is the premier gathering of professionals dedicated to the advancement of the branch of artificial intelligence known as machine learning. ICML is globally renowned for presenting and publishing cutting-edge research on all aspects of machine learning used in closely related areas like artificial intelligence, statistics and data science, as well as important application areas such as machine vision, computational biology, speech recognition, and robotics. ICML is one of the fastest growing artificial intelligence conferences in the world. Participants at ICML span a wide range of backgrounds, from academic and industrial researchers, to entrepreneurs and engineers, to graduate students and postdocs.

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Sights and Sounds in 3D Spaces
            23:12

            Sights and Sounds in 3D Spaces

            Kristen Grauman

            I2
            I2
            ICML 2020 5 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Ready Policy One: World Building Through Active Learning
            15:31

            Ready Policy One: World Building Through Active Learning

            Jack Parker-Holder, …

            I2
            I2
            ICML 2020 5 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Amortised Learning by Wake-Sleep
            10:33

            Amortised Learning by Wake-Sleep

            Kevin Wenliang, …

            I2
            I2
            ICML 2020 5 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Reducing Sampling Error in Batch Temporal Difference Learning
            14:32

            Reducing Sampling Error in Batch Temporal Difference Learning

            Brahma Pavse, …

            I2
            I2
            ICML 2020 5 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Spectral Frank-Wolfe Algorithm: Strict Complementarity and Linear Convergence
            15:12

            Spectral Frank-Wolfe Algorithm: Strict Complementarity and Linear Convergence

            Lijun Ding, …

            I2
            I2
            ICML 2020 5 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Selective Dyna-style Planning Under Limited Model Capacity
            14:59

            Selective Dyna-style Planning Under Limited Model Capacity

            Muhammad Zaheer, …

            I2
            I2
            ICML 2020 5 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow ICML 2020