Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Fast Adaptation via Policy-Dynamics Value Functions
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v3-stream-015-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v3-stream-015-alpha.b-cdn.net
      • sl-yoda-v3-stream-015-beta.b-cdn.net
      • 1963568160.rsc.cdn77.org
      • 1940033649.rsc.cdn77.org
      • Subtitles
      • Off
      • en
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Fast Adaptation via Policy-Dynamics Value Functions
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Fast Adaptation via Policy-Dynamics Value Functions

            Jul 12, 2020

            Speakers

            RR

            Roberta Raileanu

            Speaker · 0 followers

            MG

            Max Goldstein

            Speaker · 0 followers

            AS

            Arthur Szlam

            Speaker · 0 followers

            About

            Standard RL algorithms assume fixed environment dynamics and require a significant amount of interaction to adapt to new environments. We introduce Policy-Dynamics Value Functions (PD-VF), a novel approach for rapidly adapting to dynamics different from those previously seen in training. PD-VF explicitly estimates the cumulative reward in a space of policies and environments. An ensemble of conventional RL policies is used to gather experience on training environments, from which embeddings of b…

            Organizer

            I2
            I2

            ICML 2020

            Account · 2.7k followers

            Categories

            Software & Programming

            Category · 1k presentations

            AI & Data Science

            Category · 10.8k presentations

            About ICML 2020

            The International Conference on Machine Learning (ICML) is the premier gathering of professionals dedicated to the advancement of the branch of artificial intelligence known as machine learning. ICML is globally renowned for presenting and publishing cutting-edge research on all aspects of machine learning used in closely related areas like artificial intelligence, statistics and data science, as well as important application areas such as machine vision, computational biology, speech recognition, and robotics. ICML is one of the fastest growing artificial intelligence conferences in the world. Participants at ICML span a wide range of backgrounds, from academic and industrial researchers, to entrepreneurs and engineers, to graduate students and postdocs.

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Keynote speakers: Q&A - 2 + MC
            12:48

            Keynote speakers: Q&A - 2 + MC

            Karla Caballero, …

            I2
            I2
            ICML 2020 5 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Closing Remarks
            13:06

            Closing Remarks

            Petar Veličković

            I2
            I2
            ICML 2020 5 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            A Sequential Self Teaching Approach for Improving Generalization in Sound Event Recognition
            14:17

            A Sequential Self Teaching Approach for Improving Generalization in Sound Event Recognition

            Anurag Kumar, …

            I2
            I2
            ICML 2020 5 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            On Relativistic f-Divergences

            Alexia Jolicoeur-Martineau

            I2
            I2
            ICML 2020 5 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Off-Policy Actor-Critic with Shared Experience Replay
            14:38

            Off-Policy Actor-Critic with Shared Experience Replay

            Simon Schmitt, …

            I2
            I2
            ICML 2020 5 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Continual Learning from an Learning perspective
            36:16

            Continual Learning from an Learning perspective

            Razvan Pascanu

            I2
            I2
            ICML 2020 5 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow ICML 2020