Guanting Liu, En-Pei Hu, Pu-Jen Cheng, Hung-yi Lee, Shao-Hua Sun · Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs · SlidesLive

Kategorie

CS

Přihlásit se Kontaktujte nás

Další

Živý přenos začne již brzy!

Živý přenos již skončil.

Prezentace ještě nebyla nahrána!

SlidesLive

title: Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs

0:00 / 0:00

Nahlásit chybu
Nastavení
Playlisty
Záložky
Titulky Off
Rychlost přehrávání
Kvalita

Nastavení
Debug informace
Server sl-yoda-v2-stream-009-alpha.b-cdn.net
Velikost titulků Střední

Záložky

Server
sl-yoda-v2-stream-009-alpha.b-cdn.net
sl-yoda-v2-stream-009-beta.b-cdn.net
1766500541.rsc.cdn77.org
1441886916.rsc.cdn77.org

Titulky
Off
English

Rychlost přehrávání

Kvalita

Velikost titulků
Velké
Střední
Malé

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs

Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs

24. července 2023

Řečníci

Guanting Liu

Sprecher:in · 0 Follower:innen

En-Pei Hu

Sprecher:in · 0 Follower:innen

Pu-Jen Cheng

Sprecher:in · 0 Follower:innen

O prezentaci

Aiming to produce reinforcement learning (RL) policies that are human-interpretable and can generalize better to novel scenarios, Trivedi et al. (2021) present a method (LEAPS) that first learns a program embedding space to continuously parameterizes diverse programs from a pre-generated program dataset, and then searches for a task-solving program in the learned program embedding space when given a task. Despite encouraging results, the program policies that LEAPS can produce are limited by the…

Organizátor

ICML 2023

Konto · 657 Follower:innen

Baví vás formát? Nechte SlidesLive zachytit svou akci!

Profesionální natáčení a streamování po celém světě.

Sdílení

Doporučená videa

Prezentace na podobné téma, kategorii nebo přednášejícího

PAC Generalization via Invariant Representations

05:15

PAC Generalization via Invariant Representations

Später ansehen

Favorit

Advait Parulekar, …

ICML 2023 2 years ago

Backpropagation Alternatives and Scalable AI

50:43

Backpropagation Alternatives and Scalable AI

Später ansehen

Favorit

ICML 2023 2 years ago

Entropy Coding of Unordered Data Structures

13:29

Entropy Coding of Unordered Data Structures

Später ansehen

Favorit

Julius Kunze, …

ICML 2023 2 years ago

A Fully First-Order Method for Stochastic Bilevel Optimization

07:30

A Fully First-Order Method for Stochastic Bilevel Optimization

Später ansehen

Favorit

Jeongyeol Kwon, …

ICML 2023 2 years ago

Traversing Between Modes in Function Space for Fast Ensembling

05:14

Traversing Between Modes in Function Space for Fast Ensembling

Später ansehen

Favorit

EungGu Yun, …

ICML 2023 2 years ago

Self-Repellent Random Walks on General Graphs - Achieving Minimal Sampling Variance via Nonlinear Markov Chains

04:49

Self-Repellent Random Walks on General Graphs - Achieving Minimal Sampling Variance via Nonlinear Markov Chains

Später ansehen

Favorit

Vishwaraj Doshi, …

ICML 2023 2 years ago