Next
Deep Learning Theory
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Reinforcement Learning Theory
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-005-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-005-alpha.b-cdn.net
      • sl-yoda-v2-stream-005-beta.b-cdn.net
      • 1034628162.rsc.cdn77.org
      • 1409346856.rsc.cdn77.org
      • Subtitles
      • Off
      • English (auto-generated)
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Reinforcement Learning Theory
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Reinforcement Learning Theory

            Jun 11, 2019

            Speakers

            AAT

            Adrien Ali Taiga

            Speaker · 0 followers

            AT

            Ahmed Touati

            Speaker · 0 followers

            AZ

            Andrea Zanette

            Speaker · 0 followers

            About

            Separable value functions across time-scales In many finite horizon episodic reinforcement learning (RL) settings, it is desirable to optimize for the undiscounted return - in settings like Atari, for instance, the goal is to collect the most points while staying alive in the long run. Yet, it may be difficult (or even intractable) mathematically to learn with this target. As such, temporal discounting is often applied to optimize over a shorter effective planning horizon. This comes at the cost…

            Organizer

            I2
            I2

            ICML 2019

            Account · 3.2k followers

            Categories

            AI & Data Science

            Category · 10.8k presentations

            Mathematics

            Category · 2.4k presentations

            About ICML 2019

            The International Conference on Machine Learning (ICML) is the premier gathering of professionals dedicated to the advancement of the branch of artificial intelligence known as machine learning. ICML is globally renowned for presenting and publishing cutting-edge research on all aspects of machine learning used in closely related areas like artificial intelligence, statistics and data science, as well as important application areas such as machine vision, computational biology, speech recognition, and robotics. ICML is one of the fastest growing artificial intelligence conferences in the world. Participants at ICML span a wide range of backgrounds, from academic and industrial researchers, to entrepreneurs and engineers, to graduate students and postdocs.

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Real-World Sequential Decision Making - Panel Discussion
            42:45

            Real-World Sequential Decision Making - Panel Discussion

            Dawn Woodard, …

            I2
            I2
            ICML 2019 6 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Fairness
            1:12:54

            Fairness

            Aaron Roth, …

            I2
            I2
            ICML 2019 6 years ago

            Total of 1 viewers voted for saving the presentation to eternal vault which is 0.1%

            AI Commons
            06:59

            AI Commons

            Yoshua Bengio

            I2
            I2
            ICML 2019 6 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Limits of Deepfake Detection: A Robust Estimation Viewpoint
            09:38

            Limits of Deepfake Detection: A Robust Estimation Viewpoint

            Kush Varshney

            I2
            I2
            ICML 2019 6 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Toward Robust AI Systems for Understanding and Reasoning Over Multimodal Data
            30:08

            Toward Robust AI Systems for Understanding and Reasoning Over Multimodal Data

            Hanna Hajishirzi

            I2
            I2
            ICML 2019 6 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Exploiting redundancy for efficient processing of DNNs and beyond
            31:24

            Exploiting redundancy for efficient processing of DNNs and beyond

            Vivienne Sze

            I2
            I2
            ICML 2019 6 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow ICML 2019