Next
Deep Learning Theory
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Reinforcement Learning Theory
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-005-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-005-alpha.b-cdn.net
      • sl-yoda-v2-stream-005-beta.b-cdn.net
      • 1034628162.rsc.cdn77.org
      • 1409346856.rsc.cdn77.org
      • Subtitles
      • Off
      • English (auto-generated)
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Reinforcement Learning Theory
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Reinforcement Learning Theory

            Jun 11, 2019

            Speakers

            AAT

            Adrien Ali Taiga

            Speaker · 0 followers

            AT

            Ahmed Touati

            Speaker · 0 followers

            AZ

            Andrea Zanette

            Speaker · 0 followers

            About

            Separable value functions across time-scales In many finite horizon episodic reinforcement learning (RL) settings, it is desirable to optimize for the undiscounted return - in settings like Atari, for instance, the goal is to collect the most points while staying alive in the long run. Yet, it may be difficult (or even intractable) mathematically to learn with this target. As such, temporal discounting is often applied to optimize over a shorter effective planning horizon. This comes at the cost…

            Organizer

            I2
            I2

            ICML 2019

            Account · 3.2k followers

            Categories

            AI & Data Science

            Category · 10.8k presentations

            Mathematics

            Category · 2.4k presentations

            About ICML 2019

            The International Conference on Machine Learning (ICML) is the premier gathering of professionals dedicated to the advancement of the branch of artificial intelligence known as machine learning. ICML is globally renowned for presenting and publishing cutting-edge research on all aspects of machine learning used in closely related areas like artificial intelligence, statistics and data science, as well as important application areas such as machine vision, computational biology, speech recognition, and robotics. ICML is one of the fastest growing artificial intelligence conferences in the world. Participants at ICML span a wide range of backgrounds, from academic and industrial researchers, to entrepreneurs and engineers, to graduate students and postdocs.

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Robust Statistics and Interpretability
            1:14:51

            Robust Statistics and Interpretability

            Alain Tapp, …

            I2
            I2
            ICML 2019 6 years ago

            Total of 1 viewers voted for saving the presentation to eternal vault which is 0.1%

            Gordian Biotechnology: Exploring the in vivo perturbome
            39:43

            Gordian Biotechnology: Exploring the in vivo perturbome

            Francisco LePort

            I2
            I2
            ICML 2019 6 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Panel discussion
            53:13

            Panel discussion

            Aviv Ovadya, …

            I2
            I2
            ICML 2019 6 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Visualizing and Understanding Self-attention based Music Tagging
            13:23

            Visualizing and Understanding Self-attention based Music Tagging

            Minz Won

            I2
            I2
            ICML 2019 6 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Convergence Properties of Neural Networks on Separable Data
            08:26

            Convergence Properties of Neural Networks on Separable Data

            Remi Tachet des Combes

            I2
            I2
            ICML 2019 6 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Random Search and Reproducibility for Neural Architecture Search
            22:37

            Random Search and Reproducibility for Neural Architecture Search

            Liam Li

            I2
            I2
            ICML 2019 6 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow ICML 2019