Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Learning Exploration Policies with View-based Intrinsic Rewards
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-008-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-008-alpha.b-cdn.net
      • sl-yoda-v2-stream-008-beta.b-cdn.net
      • 1159783934.rsc.cdn77.org
      • 1511376917.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Learning Exploration Policies with View-based Intrinsic Rewards
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Learning Exploration Policies with View-based Intrinsic Rewards

            Dec 2, 2022

            Speakers

            YG

            Yijie Guo

            Speaker · 0 followers

            YF

            Yao Fu

            Speaker · 0 followers

            RP

            Run Peng

            Speaker · 0 followers

            About

            Efficient exploration in sparse-reward tasks is one of the biggest challenges in deep reinforcement learning. Common approaches introduce intrinsic rewards to motivate exploration. For example, visitation count and prediction-based curiosity utilize some measures of novelty to drive the agent to visit novel states in the environment. However, in partially-observable environments, these methods can easily be misled by relatively “novel” or noisy observations and get stuck around them. Motivated b…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 961 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Toward Robust Spiking Neural Network Against Adversarial Perturbation
            00:54

            Toward Robust Spiking Neural Network Against Adversarial Perturbation

            Ling Liang, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Training Spiking Neural Networks with Local Tandem Learning
            05:17

            Training Spiking Neural Networks with Local Tandem Learning

            Qu Yang, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Monitoring of Perception Systems: Deterministic, Probabilistic, and Learning-based Fault Detection and Identification
            02:59

            Monitoring of Perception Systems: Deterministic, Probabilistic, and Learning-based Fault Detection and Identification

            Pasquale Antonante, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Monocular Dynamic 3D Voew Syntesis A Reality Check
            04:44

            Monocular Dynamic 3D Voew Syntesis A Reality Check

            Hang Gao, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            CausalAgents: A Robustness Benchmark for Motion Forecasting using Causal Relationships
            31:58

            CausalAgents: A Robustness Benchmark for Motion Forecasting using Causal Relationships

            Rebecca Roelofs, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            OpenAUC: Towards AUC-Oriented Open-Set Recognition
            04:59

            OpenAUC: Towards AUC-Oriented Open-Set Recognition

            Zitai Wang, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022