Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Online Policy Optimization for Robust MDP
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-008-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-008-alpha.b-cdn.net
      • sl-yoda-v2-stream-008-beta.b-cdn.net
      • 1159783934.rsc.cdn77.org
      • 1511376917.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Online Policy Optimization for Robust MDP
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Online Policy Optimization for Robust MDP

            Dec 2, 2022

            Speakers

            JD

            Jing Dong

            Speaker · 0 followers

            JL

            Jingwei Liang

            Speaker · 0 followers

            BW

            Baoxiang Wang

            Speaker · 0 followers

            About

            Reinforcement learning (RL) has exceeded human performance in many synthetic settings such as video games and Go. However, real-world deployment of end-to-end RL models is less common, as RL models can be very sensitive to slight perturbation of the environment. The robust Markov decision process (MDP) framework—in which the transition probabilities belong to an uncertainty set around a nominal model—provides one way to develop robust models. While previous analysis shows RL algorithms are effec…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 954 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Bayesian Optimistic Optimization: Optimistic Exploration for Model-based Reinforcement Learning
            04:15

            Bayesian Optimistic Optimization: Optimistic Exploration for Model-based Reinforcement Learning

            Chenyang Wu, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Finite-Time Regret of Thompson Sampling Algorithms for Exponential Family Multi-Armed Bandits
            04:57

            Finite-Time Regret of Thompson Sampling Algorithms for Exponential Family Multi-Armed Bandits

            Tianyuan Jin, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            First Contact: Unsupervised Human-Machine Co-Adaptation via Mutual Information Maximization
            05:01

            First Contact: Unsupervised Human-Machine Co-Adaptation via Mutual Information Maximization

            Siddharth Reddy, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Online Neural Sequence Detection with Hierarchical Dirichlet Point Process
            05:47

            Online Neural Sequence Detection with Hierarchical Dirichlet Point Process

            Weihan Li, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Exploring the Latent Space of Autoencoders with Interventional Assays
            04:49

            Exploring the Latent Space of Autoencoders with Interventional Assays

            Felix Leeb, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Score-based Generative Models and Their Applications
            29:07

            Score-based Generative Models and Their Applications

            Chenlin Meng

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022