Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-007-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-007-alpha.b-cdn.net
      • sl-yoda-v2-stream-007-beta.b-cdn.net
      • 1678031076.rsc.cdn77.org
      • 1932936657.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs

            Jul 25, 2023

            Speakers

            MH

            Mikael Henaff

            Speaker · 0 followers

            MJ

            Minqi Jiang

            Speaker · 0 followers

            RR

            Roberta Raileanu

            Speaker · 0 followers

            About

            Exploration in environments which differ across episodes has received increasing attention in recent years. Current methods use some combination of global novelty bonuses, computed using the agent's entire training experience, and episodic novelty bonuses, computed using only experience from the current episode. However, the use of these two types of bonuses has been ad-hoc and poorly understood. In this work, we shed light on the behavior of these two types of bonuses through controlled experim…

            Organizer

            I2
            I2

            ICML 2023

            Account · 655 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            MABe22: A Multi-Species Multi-Task Benchmark for Learned Representations of Behavior
            05:19

            MABe22: A Multi-Species Multi-Task Benchmark for Learned Representations of Behavior

            Jennifer J. Sun, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            RényiTester, a Variational Approach to Testing Differential Privacy
            35:43

            RényiTester, a Variational Approach to Testing Differential Privacy

            Mónica Ribero

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Causal Bounds in Quasi-Markovian Graphs
            05:32

            Causal Bounds in Quasi-Markovian Graphs

            Madhumitha Shridharan, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            DIVISION: Memory Efficient Training via Dual Activation Precision
            04:58

            DIVISION: Memory Efficient Training via Dual Activation Precision

            Guanchu Wang, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning
            05:20

            Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning

            Mingqi Yuan, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Cooperation in the Latent Space: The Benefits of Adding Mixture Components in Variational Autoencoders
            04:21

            Cooperation in the Latent Space: The Benefits of Adding Mixture Components in Variational Autoencoders

            Oskar Kviman, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow ICML 2023