Chinmaya Kausik, Kevin Tan, Ambuj Tewari · Learning Mixtures of Markov Chains and MDPs · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Learning Mixtures of Markov Chains and MDPs

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-007-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-007-alpha.b-cdn.net
sl-yoda-v2-stream-007-beta.b-cdn.net
1678031076.rsc.cdn77.org
1932936657.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Learning Mixtures of Markov Chains and MDPs

Learning Mixtures of Markov Chains and MDPs

Jul 25, 2023

Speakers

Chinmaya Kausik

Speaker · 0 followers

Kevin Tan

Speaker · 0 followers

Ambuj Tewari

Speaker · 0 followers

About

We present an algorithm for learning mixtures of Markov chains and Markov decision processes (MDPs) from short unlabeled trajectories. Specifically, our method handles mixtures of Markov chains with optional control input by going through a multi-step process, involving (1) a subspace estimation step, (2) spectral clustering of trajectories using "pairwise distance estimators," along with refinement using the EM algorithm, (3) a model estimation step, and (4) a classification step for predicting…

Organizer

ICML 2023

Account · 626 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Surrogate Module Learning: Reduce the Gradient Error Accumulation in Training Spiking Neural Networks

05:08

Surrogate Module Learning: Reduce the Gradient Error Accumulation in Training Spiking Neural Networks

Watch later

Favorite

Shikuang Deng, …

ICML 2023 2 years ago

Orthogonality-Enforced Latent Space in Autoencoders: An Approach to Learning Disentangled Representations

04:08

Orthogonality-Enforced Latent Space in Autoencoders: An Approach to Learning Disentangled Representations

Watch later

Favorite

Jaehoon Cha, …

ICML 2023 2 years ago

K-SHAP: Policy Clustering Algorithm for Anonymous Multi-Agent State-Action Pairs

05:29

K-SHAP: Policy Clustering Algorithm for Anonymous Multi-Agent State-Action Pairs

Watch later

Favorite

Andrea Coletta, …

ICML 2023 2 years ago

Towards Explaining Distribution Shifts

05:18

Towards Explaining Distribution Shifts

Watch later

Favorite

Sean Kulinski, …

ICML 2023 2 years ago

Off-Policy Average Reward Actor-Critic with Deterministic Policy Search

04:51

Off-Policy Average Reward Actor-Critic with Deterministic Policy Search

Watch later

Favorite

Naman Saxena, …

ICML 2023 2 years ago

Future-conditioned Unsupervised Pretraining for Decision Transformer

05:02

Future-conditioned Unsupervised Pretraining for Decision Transformer

Watch later

Favorite

Zhihui Xie, …

ICML 2023 2 years ago