Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Learning Mixtures of Markov Chains and MDPs
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-007-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-007-alpha.b-cdn.net
      • sl-yoda-v2-stream-007-beta.b-cdn.net
      • 1678031076.rsc.cdn77.org
      • 1932936657.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Learning Mixtures of Markov Chains and MDPs
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Learning Mixtures of Markov Chains and MDPs

            Jul 25, 2023

            Speakers

            CK

            Chinmaya Kausik

            Speaker · 0 followers

            KT

            Kevin Tan

            Speaker · 0 followers

            AT

            Ambuj Tewari

            Speaker · 0 followers

            About

            We present an algorithm for learning mixtures of Markov chains and Markov decision processes (MDPs) from short unlabeled trajectories. Specifically, our method handles mixtures of Markov chains with optional control input by going through a multi-step process, involving (1) a subspace estimation step, (2) spectral clustering of trajectories using "pairwise distance estimators," along with refinement using the EM algorithm, (3) a model estimation step, and (4) a classification step for predicting…

            Organizer

            I2
            I2

            ICML 2023

            Account · 626 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Surrogate Module Learning: Reduce the Gradient Error Accumulation in Training Spiking Neural Networks
            05:08

            Surrogate Module Learning: Reduce the Gradient Error Accumulation in Training Spiking Neural Networks

            Shikuang Deng, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Orthogonality-Enforced Latent Space in Autoencoders: An Approach to Learning Disentangled Representations
            04:08

            Orthogonality-Enforced Latent Space in Autoencoders: An Approach to Learning Disentangled Representations

            Jaehoon Cha, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            K-SHAP: Policy Clustering Algorithm for Anonymous Multi-Agent State-Action Pairs
            05:29

            K-SHAP: Policy Clustering Algorithm for Anonymous Multi-Agent State-Action Pairs

            Andrea Coletta, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Towards Explaining Distribution Shifts
            05:18

            Towards Explaining Distribution Shifts

            Sean Kulinski, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Off-Policy Average Reward Actor-Critic with Deterministic Policy Search
            04:51

            Off-Policy Average Reward Actor-Critic with Deterministic Policy Search

            Naman Saxena, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Future-conditioned Unsupervised Pretraining for Decision Transformer
            05:02

            Future-conditioned Unsupervised Pretraining for Decision Transformer

            Zhihui Xie, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow ICML 2023