Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-007-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-007-alpha.b-cdn.net
      • sl-yoda-v2-stream-007-beta.b-cdn.net
      • 1678031076.rsc.cdn77.org
      • 1932936657.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs

            Jul 25, 2023

            Sprecher:innen

            MH

            Mikael Henaff

            Sprecher:in · 0 Follower:innen

            MJ

            Minqi Jiang

            Sprecher:in · 0 Follower:innen

            RR

            Roberta Raileanu

            Sprecher:in · 0 Follower:innen

            Über

            Exploration in environments which differ across episodes has received increasing attention in recent years. Current methods use some combination of global novelty bonuses, computed using the agent's entire training experience, and episodic novelty bonuses, computed using only experience from the current episode. However, the use of these two types of bonuses has been ad-hoc and poorly understood. In this work, we shed light on the behavior of these two types of bonuses through controlled experim…

            Organisator

            I2
            I2

            ICML 2023

            Konto · 657 Follower:innen

            Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

            Professionelle Aufzeichnung und Livestreaming – weltweit.

            Freigeben

            Empfohlene Videos

            Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

            Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning
            05:48

            Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning

            Sam Lobel, …

            I2
            I2
            ICML 2023 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Self-Repellent Random Walks on General Graphs - Achieving Minimal Sampling Variance via Nonlinear Markov Chains
            04:49

            Self-Repellent Random Walks on General Graphs - Achieving Minimal Sampling Variance via Nonlinear Markov Chains

            Vishwaraj Doshi, …

            I2
            I2
            ICML 2023 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Panel Discussion on Privacy
            58:24

            Panel Discussion on Privacy

            Kristen Vaccaro, …

            I2
            I2
            ICML 2023 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Self-Supervised Learning in Vision: from Research Advances to Best Practices
            1:52:07

            Self-Supervised Learning in Vision: from Research Advances to Best Practices

            Xinlei Chen, …

            I2
            I2
            ICML 2023 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Spatial Implicit Neural Representations for Global-Scale Species Mapping
            05:15

            Spatial Implicit Neural Representations for Global-Scale Species Mapping

            Elijah Cole, …

            I2
            I2
            ICML 2023 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU
            08:35

            FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU

            Ying Sheng, …

            I2
            I2
            ICML 2023 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Interessiert an Vorträgen wie diesem? ICML 2023 folgen