Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-008-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-008-alpha.b-cdn.net
      • sl-yoda-v2-stream-008-beta.b-cdn.net
      • 1159783934.rsc.cdn77.org
      • 1511376917.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning

            Dec 2, 2022

            Speakers

            MY

            Mingqi Yuan

            Speaker · 0 followers

            BL

            Bo Li

            Speaker · 0 followers

            XJ

            Xin Jin

            Speaker · 0 followers

            About

            Exploration is critical for deep reinforcement learning in complex environments with high-dimensional observations and sparse rewards. To address this problem, recent approaches proposed to leverage intrinsic rewards to improve exploration, such as novelty-based exploration and prediction-based exploration. However, many intrinsic reward modules require sophisticated structures and representation learning, resulting in prohibitive computational complexity and unstable performance. In this paper,…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 961 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Mitigating Health Data Poverty: Generative Approaches versus Resampling for Time-series Clinical Data
            01:54

            Mitigating Health Data Poverty: Generative Approaches versus Resampling for Time-series Clinical Data

            Raffaele Marchesi, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            AntiFaceGAN: Animatable 3D-Aware Face Image Generation for Realistic Video Avatars
            01:03

            AntiFaceGAN: Animatable 3D-Aware Face Image Generation for Realistic Video Avatars

            Yue Wu, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Welcome and Introduction
            04:51

            Welcome and Introduction

            Tom White

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            From Gradient Flow on Population Loss to Learning with Stochastic Gradient Descent
            05:05

            From Gradient Flow on Population Loss to Learning with Stochastic Gradient Descent

            Ayush Sekhari, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Towards Efficient 3D Object Detection with Knowledge Distillation
            05:45

            Towards Efficient 3D Object Detection with Knowledge Distillation

            Jihan Yang, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            BLaDE: Robust Exploration via Diffusion Models
            05:37

            BLaDE: Robust Exploration via Diffusion Models

            Zhaohan Daniel Guo, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022