Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Logistic Q-Learning
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v3-stream-015-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v3-stream-015-alpha.b-cdn.net
      • sl-yoda-v3-stream-015-beta.b-cdn.net
      • 1963568160.rsc.cdn77.org
      • 1940033649.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Logistic Q-Learning
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Logistic Q-Learning

            Apr 14, 2021

            Speakers

            JB

            Joan Bas-Serrano

            Speaker · 0 followers

            AK

            Andreas Krause

            Speaker · 6 followers

            SC

            Sebastian Curi

            Speaker · 0 followers

            About

            We propose a new reinforcement learning algorithm derived from a regularized linear-programming formulation of optimal control in MDPs. The method is closely related to the classic Relative Entropy Policy Search (REPS) algorithm of Peters et al. (2010), with the key difference that our method introduces a Q-function that enables efficient exact model-free implementation. The main feature of our algorithm (called Q-REPS) is a convex loss function for policy evaluation that serves as a theoretical…

            Organizer

            A2
            A2

            AISTATS 2021

            Account · 63 followers

            Categories

            AI & Data Science

            Category · 10.8k presentations

            About AISTATS 2021

            The 24th International Conference on Artificial Intelligence and Statistics was held virtually from Tuesday, 13 April 2021 to Thursday, 15 April 2021.

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            An Analysis of LIME for Text Data
            02:49

            An Analysis of LIME for Text Data

            Dina Mardaoui, …

            A2
            A2
            AISTATS 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            On the Privacy Properties of GAN-generated Samples
            03:09

            On the Privacy Properties of GAN-generated Samples

            Zinan Lin, …

            A2
            A2
            AISTATS 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Associative Convolutional Layers
            03:09

            Associative Convolutional Layers

            Hamed Omidvar, …

            A2
            A2
            AISTATS 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Momentum Improves Optimization on Riemannian Manifolds
            03:15

            Momentum Improves Optimization on Riemannian Manifolds

            Foivos Alimisis, …

            A2
            A2
            AISTATS 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Fair for All: Best-effort Guarantees for Fairness in Classification
            02:50

            Fair for All: Best-effort Guarantees for Fairness in Classification

            Anilesh Krishnaswamy, …

            A2
            A2
            AISTATS 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Distributionally Robust Optimization for Deep Kernel Multiple Instance Learning
            02:58

            Distributionally Robust Optimization for Deep Kernel Multiple Instance Learning

            Hitesh Sapkota, …

            A2
            A2
            AISTATS 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow AISTATS 2021