Další
Živý přenos začne již brzy!
Živý přenos již skončil.
Prezentace ještě nebyla nahrána!
  • title: General policy mapping: online continual reinforcement learning inspired on the insect brain
      0:00 / 0:00
      • Nahlásit chybu
      • Nastavení
      • Playlisty
      • Záložky
      • Titulky Off
      • Rychlost přehrávání
      • Kvalita
      • Nastavení
      • Debug informace
      • Server sl-yoda-v2-stream-010-alpha.b-cdn.net
      • Velikost titulků Střední
      • Záložky
      • Server
      • sl-yoda-v2-stream-010-alpha.b-cdn.net
      • sl-yoda-v2-stream-010-beta.b-cdn.net
      • 1759419103.rsc.cdn77.org
      • 1016618226.rsc.cdn77.org
      • Titulky
      • Off
      • English
      • Rychlost přehrávání
      • Kvalita
      • Velikost titulků
      • Velké
      • Střední
      • Malé
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      Moje playlisty
        Záložky
          00:00:00
            General policy mapping: online continual reinforcement learning inspired on the insect brain
            • Nastavení
            • Sync diff
            • Kvalita
            • Nastavení
            • Server
            • Kvalita
            • Server

            General policy mapping: online continual reinforcement learning inspired on the insect brain

            2. prosince 2022

            Řečníci

            AY

            Angel Yanguas-Gil

            Sprecher:in · 0 Follower:innen

            SM

            Sandeep Madireddy

            Sprecher:in · 0 Follower:innen

            O prezentaci

            We have developed a model for online continual reinforcement learning (RL) inspired on the insect brain. Our model leverages the offline training of a feature extraction and a common general policy layer to enable the convergence of RL algorithms in online settings. Sharing a common policy layer across tasks leads to positive backward transfer, where the agent continuously improved in older tasks sharing the same underlying general policy. Biologically inspired restrictions to the agent's networ…

            Organizátor

            N2
            N2

            NeurIPS 2022

            Konto · 961 Follower:innen

            Baví vás formát? Nechte SlidesLive zachytit svou akci!

            Profesionální natáčení a streamování po celém světě.

            Sdílení

            Doporučená videa

            Prezentace na podobné téma, kategorii nebo přednášejícího

            Provable General Function Class Representation Learning in Multitask Bandits and MDP
            04:17

            Provable General Function Class Representation Learning in Multitask Bandits and MDP

            Rui Lu, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Understanding Cross-Domain Few-Shot Learning Based on Domain Similarity and Few-Shot Difficulty
            04:51

            Understanding Cross-Domain Few-Shot Learning Based on Domain Similarity and Few-Shot Difficulty

            Jaehoon Oh, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            A Few Expert Queries Suffices for Sample-Efficient RL with Resets and Linear Value Approximation
            04:56

            A Few Expert Queries Suffices for Sample-Efficient RL with Resets and Linear Value Approximation

            Philip Amortila, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech
            01:00

            Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech

            Ziyue Jiang, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Hypothesis Testing for Differentially Private Linear Regression
            05:07

            Hypothesis Testing for Differentially Private Linear Regression

            Daniel Alabi, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Deep Hierarchical Planning from Pixels
            08:36

            Deep Hierarchical Planning from Pixels

            Danijar Hafner, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Zajímají Vás podobná videa? Sledujte NeurIPS 2022