Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Temporary Goals for Exploration
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-010-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-010-alpha.b-cdn.net
      • sl-yoda-v2-stream-010-beta.b-cdn.net
      • 1759419103.rsc.cdn77.org
      • 1016618226.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Temporary Goals for Exploration
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Temporary Goals for Exploration

            Dec 2, 2022

            Speakers

            HX

            Haoyang Xu

            Řečník · 0 sledujících

            JB

            Jimmy Ba

            Řečník · 2 sledující

            SP

            Silviu Pitis

            Řečník · 0 sledujících

            About

            Exploration has always been a crucial aspect of reinforcement learning. When facing long horizon sparse reward environments modern methods still struggle with effective exploration and generalize poorly. In the multi-goal reinforcement learning setting, out-of-distribution goals might appear similar to the achieved ones, but the agent can never accurately assess its ability to achieve them without attempting them. To enable faster exploration and improve generalization, we propose an exploration…

            Organizer

            N2
            N2

            NeurIPS 2022

            Účet · 961 sledujících

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Understanding Cross-Domain Few-Shot Learning Based on Domain Similarity and Few-Shot Difficulty
            04:51

            Understanding Cross-Domain Few-Shot Learning Based on Domain Similarity and Few-Shot Difficulty

            Jaehoon Oh, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Leveraging Inter-Layer Dependency for Post -Training Quantization
            03:58

            Leveraging Inter-Layer Dependency for Post -Training Quantization

            Changbao Wang, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Welcome and Opening Remarks
            06:01

            Welcome and Opening Remarks

            Karl Popper

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting
            05:02

            On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting

            Tomasz Korbak, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Contrastive Language-Image Pre-Training with Knowledge Graphs
            05:04

            Contrastive Language-Image Pre-Training with Knowledge Graphs

            Xuran Pan, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Antigen-Specific Antibody Design and Optimization with Diffusion-Based Generative Models for Protein Structures
            02:32

            Antigen-Specific Antibody Design and Optimization with Diffusion-Based Generative Models for Protein Structures

            Shitong Luo, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Interested in talks like this? Follow NeurIPS 2022