Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v3-stream-012-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v3-stream-012-alpha.b-cdn.net
      • sl-yoda-v3-stream-012-beta.b-cdn.net
      • 1338956956.rsc.cdn77.org
      • 1656830687.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning

            Dez 6, 2021

            Sprecher:innen

            JK

            Junsu Kim

            Sprecher:in · 0 Follower:innen

            YS

            Younggyo Seo

            Sprecher:in · 0 Follower:innen

            JS

            Jinwoo Shin

            Sprecher:in · 2 Follower:innen

            Über

            Goal-conditioned hierarchical reinforcement learning (HRL) has shown promising results for solving complex and long-horizon RL tasks. However, the action space of high-level policy in the goal-conditioned HRL is often large, so it results in poor exploration, leading to inefficiency in training. In this paper, we present HIerarchical reinforcement learning Guided by Landmarks (HIGL), a novel framework for training a high-level policy with a reduced action space guided by landmarks, i.e., promisi…

            Organisator

            N2
            N2

            NeurIPS 2021

            Konto · 1,9k Follower:innen

            Über NeurIPS 2021

            Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

            Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

            Professionelle Aufzeichnung und Livestreaming – weltweit.

            Freigeben

            Empfohlene Videos

            Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

            Topic Modeling Revisited: A Document Graph-based Neural Network Perspective
            09:38

            Topic Modeling Revisited: A Document Graph-based Neural Network Perspective

            Dazhong Shen, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Short-term Solar Irradiance Forecasting from Sky Images
            05:06

            Short-term Solar Irradiance Forecasting from Sky Images

            Hoang Chuong Nguyen, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Modeling Advection on Directed Graphs using  Matérn Gaussian Processes for Traffic Flow
            05:19

            Modeling Advection on Directed Graphs using Matérn Gaussian Processes for Traffic Flow

            Nadim Saad, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Chest ImaGenome Dataset
            10:31

            Chest ImaGenome Dataset

            Joy Wu, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Automated Mechanism Design for Strategic Classification
            42:51

            Automated Mechanism Design for Strategic Classification

            Vincent Conitzer, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Contributed talks in Session 4
            24:44

            Contributed talks in Session 4

            Quanquan Gu, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Interessiert an Vorträgen wie diesem? NeurIPS 2021 folgen