Hao Sun, Taiyi Wang · Toward Causal-Aware RL: State-Wise Action-Refined Temporal Difference · SlidesLive

Kategorie

CS

Přihlásit se Kontaktujte nás

Další

Živý přenos začne již brzy!

Živý přenos již skončil.

Prezentace ještě nebyla nahrána!

SlidesLive

title: Toward Causal-Aware RL: State-Wise Action-Refined Temporal Difference

0:00 / 0:00

Nahlásit chybu
Nastavení
Playlisty
Záložky
Titulky Off
Rychlost přehrávání
Kvalita

Nastavení
Debug informace
Server sl-yoda-v2-stream-006-alpha.b-cdn.net
Velikost titulků Střední

Záložky

Server
sl-yoda-v2-stream-006-alpha.b-cdn.net
sl-yoda-v2-stream-006-beta.b-cdn.net
1549480416.rsc.cdn77.org
1102696603.rsc.cdn77.org

Titulky
Off
English

Rychlost přehrávání

Kvalita

Velikost titulků
Velké
Střední
Malé

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Toward Causal-Aware RL: State-Wise Action-Refined Temporal Difference

Toward Causal-Aware RL: State-Wise Action-Refined Temporal Difference

2. prosince 2022

Řečníci

Hao Sun

Řečník · 2 sledující

Taiyi Wang

Řečník · 0 sledujících

O prezentaci

Although it is well known that exploration plays a key role in Reinforcement Learning (RL), prevailing exploration strategies for continuous control tasks in RL are mainly based on naive isotropic Gaussian noise regardless of the causality relationship between action space and the task and consider all dimensions of actions equally important. In this work, we propose to conduct interventions on the primal action space to discover the causal relationship between the action space and the task rewa…

Organizátor

NeurIPS 2022

Účet · 961 sledujících

Baví vás formát? Nechte SlidesLive zachytit svou akci!

Profesionální natáčení a streamování po celém světě.

Sdílení

Doporučená videa

Prezentace na podobné téma, kategorii nebo přednášejícího

Memory for narratives

40:33

Memory for narratives

Zhlédnout později

Oblíbené

NeurIPS 2022 2 years ago

QUARK: Controllable Text Generation with Reinforced UNlearning

04:40

QUARK: Controllable Text Generation with Reinforced UNlearning

Zhlédnout později

Oblíbené

NeurIPS 2022 2 years ago

Multivariate Time-Series Forecasting with Temporal Polynomial Graph Neural Networks

01:02

Multivariate Time-Series Forecasting with Temporal Polynomial Graph Neural Networks

Zhlédnout později

Oblíbené

Yijing Liu, …

NeurIPS 2022 2 years ago

Tikhonov Regularization is Optimal Transport Robust under Martingale Constraints

04:54

Tikhonov Regularization is Optimal Transport Robust under Martingale Constraints

Zhlédnout později

Oblíbené

NeurIPS 2022 2 years ago

Efficiently Minimizing the Maximum Loss

30:47

Efficiently Minimizing the Maximum Loss

Zhlédnout později

Oblíbené

NeurIPS 2022 2 years ago

Unsupervised Learning From Incomplete Measurements for Inverse Problems

03:55

Unsupervised Learning From Incomplete Measurements for Inverse Problems

Zhlédnout později

Oblíbené

Julián Tachella, …

NeurIPS 2022 2 years ago