Chang Yang, Ruiyu Wang, Xinrun Wang, Zhen Wang · A Game-Theoretic Perspective of Generalization in Reinforcement Learning · SlidesLive

Kategorie

CS

Přihlásit se Kontaktujte nás

Další

Živý přenos začne již brzy!

Živý přenos již skončil.

Prezentace ještě nebyla nahrána!

SlidesLive

title: A Game-Theoretic Perspective of Generalization in Reinforcement Learning

0:00 / 0:00

Nahlásit chybu
Nastavení
Playlisty
Záložky
Titulky Off
Rychlost přehrávání
Kvalita

Nastavení
Debug informace
Server sl-yoda-v2-stream-007-alpha.b-cdn.net
Velikost titulků Střední

Záložky

Server
sl-yoda-v2-stream-007-alpha.b-cdn.net
sl-yoda-v2-stream-007-beta.b-cdn.net
1678031076.rsc.cdn77.org
1932936657.rsc.cdn77.org

Titulky
Off
English

Rychlost přehrávání

Kvalita

Velikost titulků
Velké
Střední
Malé

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

A Game-Theoretic Perspective of Generalization in Reinforcement Learning

A Game-Theoretic Perspective of Generalization in Reinforcement Learning

2. prosince 2022

Řečníci

Chang Yang

Sprecher:in · 0 Follower:innen

Ruiyu Wang

Sprecher:in · 0 Follower:innen

Xinrun Wang

Sprecher:in · 0 Follower:innen

O prezentaci

Generalization in reinforcement learning (RL) is of importance for real deployment of RL algorithms. Various schemes are proposed to address the generalization issues, including transfer learning, multi-task learning, meta learning, as well as robust and adversarial reinforcement learning. However, there is not a unified formulation of various schemes and comprehensive comparisons of methods across different schemes. In this work, we propound GiRL, a game-theoretic framework for generalization i…

Organizátor

NeurIPS 2022

Konto · 960 Follower:innen

Baví vás formát? Nechte SlidesLive zachytit svou akci!

Profesionální natáčení a streamování po celém světě.

Sdílení

Doporučená videa

Prezentace na podobné téma, kategorii nebo přednášejícího

Differentially Private Learning with Margin Guarantees

25:43

Differentially Private Learning with Margin Guarantees

Später ansehen

Favorit

NeurIPS 2022 2 years ago

Value Function Decomposition for Iterative Design of Reinforcement Learning Agents

04:29

Value Function Decomposition for Iterative Design of Reinforcement Learning Agents

Später ansehen

Favorit

James MacGlashan, …

NeurIPS 2022 2 years ago

Uncertainty-Aware Reinforcement Learning for Risk-Sensitive Player Evaluation in Sports Game

04:50

Uncertainty-Aware Reinforcement Learning for Risk-Sensitive Player Evaluation in Sports Game

Später ansehen

Favorit

Guiliang Liu, …

NeurIPS 2022 2 years ago

On the Complexity of Adversarial Decision Making

05:32

On the Complexity of Adversarial Decision Making

Später ansehen

Favorit

Ayush Sekhari, …

NeurIPS 2022 2 years ago

Dynamic Sparse Network for Time Series Classification: Learning What to “See”

05:13

Dynamic Sparse Network for Time Series Classification: Learning What to “See”

Später ansehen

Favorit

NeurIPS 2022 2 years ago

A gradient estimator for zero-order optimization with two point feedback

04:24

A gradient estimator for zero-order optimization with two point feedback

Später ansehen

Favorit

Arya Akhavan, …

NeurIPS 2022 2 years ago