Qiwei Di, Jiafan He, Dongruo Zhou, Quanquan Gu · Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path · SlidesLive

Kategorie

CS

Přihlásit se Kontaktujte nás

Další

Živý přenos začne již brzy!

Živý přenos již skončil.

Prezentace ještě nebyla nahrána!

SlidesLive

title: Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path

0:00 / 0:00

Nahlásit chybu
Nastavení
Playlisty
Záložky
Titulky Off
Rychlost přehrávání
Kvalita

Nastavení
Debug informace
Server sl-yoda-v2-stream-004-alpha.b-cdn.net
Velikost titulků Střední

Záložky

Server
sl-yoda-v2-stream-004-alpha.b-cdn.net
sl-yoda-v2-stream-004-beta.b-cdn.net
1685195716.rsc.cdn77.org
1239898752.rsc.cdn77.org

Titulky
Off
English

Rychlost přehrávání

Kvalita

Velikost titulků
Velké
Střední
Malé

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path

Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path

24. července 2023

Řečníci

Qiwei Di

Řečník · 0 sledujících

Jiafan He

Řečník · 0 sledujících

Dongruo Zhou

Řečník · 0 sledujících

O prezentaci

We study the Stochastic Shortest Path (SSP) problem with a linear mixture transition kernel, where an agent repeatedly interacts with a stochastic environment and seeks to reach certain goal state while minimizing the cumulative cost. Existing works often assume a strictly positive lower bound of the cost function or an upper bound of the expected length for the optimal policy. In this paper, we propose a new algorithm to eliminate these restrictive assumptions. Our algorithm is based on extende…

Organizátor

ICML 2023

Účet · 657 sledujících

Baví vás formát? Nechte SlidesLive zachytit svou akci!

Profesionální natáčení a streamování po celém světě.

Sdílení

Doporučená videa

Prezentace na podobné téma, kategorii nebo přednášejícího

On Learning Domain General Predictors

42:30

On Learning Domain General Predictors

Zhlédnout později

Oblíbené

Sanmi Koyejo, …

ICML 2023 2 years ago

Towards Reliable Neural Specifications

05:17

Towards Reliable Neural Specifications

Zhlédnout později

Oblíbené

Chuqin Geng, …

ICML 2023 2 years ago

Multi-Epoch Matrix Factorization Mechanisms for Private Machine Learning

09:18

Multi-Epoch Matrix Factorization Mechanisms for Private Machine Learning

Zhlédnout později

Oblíbené

Christopher Choquette-Choo, …

ICML 2023 2 years ago

Marginalization is not Marginal: No Bad VAE Local Minima when Learning Optimal Sparse Representations

08:22

Marginalization is not Marginal: No Bad VAE Local Minima when Learning Optimal Sparse Representations

Zhlédnout později

Oblíbené

ICML 2023 2 years ago

NTK-approximating MLP Fusion for Efficient Language Model Fine-tuning

04:52

NTK-approximating MLP Fusion for Efficient Language Model Fine-tuning

Zhlédnout později

Oblíbené

Tianxin Wei, …

ICML 2023 2 years ago

Hardness of Independent Learning and Sparse Equilibrium Computation in Markov Games

05:41

Hardness of Independent Learning and Sparse Equilibrium Computation in Markov Games

Zhlédnout později

Oblíbené

Dylan J. Foster, …

ICML 2023 2 years ago