Angel Yanguas-Gil, Sandeep Madireddy · General policy mapping: online continual reinforcement learning inspired on the insect brain · SlidesLive

Kategorie

CS

Přihlásit se Kontaktujte nás

Další

Živý přenos začne již brzy!

Živý přenos již skončil.

Prezentace ještě nebyla nahrána!

SlidesLive

title: General policy mapping: online continual reinforcement learning inspired on the insect brain

0:00 / 0:00

Nahlásit chybu
Nastavení
Playlisty
Záložky
Titulky Off
Rychlost přehrávání
Kvalita

Nastavení
Debug informace
Server sl-yoda-v2-stream-010-alpha.b-cdn.net
Velikost titulků Střední

Záložky

Server
sl-yoda-v2-stream-010-alpha.b-cdn.net
sl-yoda-v2-stream-010-beta.b-cdn.net
1759419103.rsc.cdn77.org
1016618226.rsc.cdn77.org

Titulky
Off
English

Rychlost přehrávání

Kvalita

Velikost titulků
Velké
Střední
Malé

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

General policy mapping: online continual reinforcement learning inspired on the insect brain

General policy mapping: online continual reinforcement learning inspired on the insect brain

2. prosince 2022

Řečníci

Angel Yanguas-Gil

Sprecher:in · 0 Follower:innen

Sandeep Madireddy

Sprecher:in · 0 Follower:innen

O prezentaci

We have developed a model for online continual reinforcement learning (RL) inspired on the insect brain. Our model leverages the offline training of a feature extraction and a common general policy layer to enable the convergence of RL algorithms in online settings. Sharing a common policy layer across tasks leads to positive backward transfer, where the agent continuously improved in older tasks sharing the same underlying general policy. Biologically inspired restrictions to the agent's networ…

Organizátor

NeurIPS 2022

Konto · 961 Follower:innen

Baví vás formát? Nechte SlidesLive zachytit svou akci!

Profesionální natáčení a streamování po celém světě.

Sdílení

Doporučená videa

Prezentace na podobné téma, kategorii nebo přednášejícího

Provable General Function Class Representation Learning in Multitask Bandits and MDP

04:17

Provable General Function Class Representation Learning in Multitask Bandits and MDP

Später ansehen

Favorit

NeurIPS 2022 2 years ago

Understanding Cross-Domain Few-Shot Learning Based on Domain Similarity and Few-Shot Difficulty

04:51

Understanding Cross-Domain Few-Shot Learning Based on Domain Similarity and Few-Shot Difficulty

Später ansehen

Favorit

Jaehoon Oh, …

NeurIPS 2022 2 years ago

A Few Expert Queries Suffices for Sample-Efficient RL with Resets and Linear Value Approximation

04:56

A Few Expert Queries Suffices for Sample-Efficient RL with Resets and Linear Value Approximation

Später ansehen

Favorit

Philip Amortila, …

NeurIPS 2022 2 years ago

Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech

01:00

Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech

Später ansehen

Favorit

Ziyue Jiang, …

NeurIPS 2022 2 years ago

Hypothesis Testing for Differentially Private Linear Regression

05:07

Hypothesis Testing for Differentially Private Linear Regression

Später ansehen

Favorit

Daniel Alabi, …

NeurIPS 2022 2 years ago

Deep Hierarchical Planning from Pixels

08:36

Deep Hierarchical Planning from Pixels

Später ansehen

Favorit

Danijar Hafner, …

NeurIPS 2022 2 years ago