Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Non-Linear Reinforcement Learning in Large Action Spaces: Structural Conditions and Sample-efficiency of Posterior Sampling
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-010-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-010-alpha.b-cdn.net
      • sl-yoda-v2-stream-010-beta.b-cdn.net
      • 1759419103.rsc.cdn77.org
      • 1016618226.rsc.cdn77.org
      • Subtitles
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Non-Linear Reinforcement Learning in Large Action Spaces: Structural Conditions and Sample-efficiency of Posterior Sampling
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Non-Linear Reinforcement Learning in Large Action Spaces: Structural Conditions and Sample-efficiency of Posterior Sampling

            Jul 2, 2022

            Speakers

            AA

            Alekh Agarwal

            Speaker · 1 follower

            TZ

            Tong Zhang

            Speaker · 0 followers

            About

            Provably sample-efficient Reinforcement Learning (RL) with rich observations and function approximation has witnessed tremendous recent progress, particularly when the underlying function approximators are linear. In this linear regime, computationally and statistically efficient methods exist where the potentially infinite state and action spaces can be captured through a known feature embedding, with the sample complexity scaling with the (intrinsic) dimension of these features. When the actio…

            Organizer

            C
            C

            COLT

            Account · 20 followers

            About COLT

            The conference is held annually since 1988 and has become the leading conference on Learning theory by maintaining a highly selective process for submissions. It is committed in high-quality articles in all theoretical aspects of machine learning and related topics.

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Covariance-adapting algorithm for semi-bandits with application to sparse outcomes
            13:29

            Covariance-adapting algorithm for semi-bandits with application to sparse outcomes

            Michal Valko, …

            C
            C
            COLT 5 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Optimal SQ Lower Bounds for Learning Halfspaces with Massart Noise
            16:45

            Optimal SQ Lower Bounds for Learning Halfspaces with Massart Noise

            Rajai Nasser, …

            C
            C
            COLT 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Information Complexity of VC Learning
            04:37

            Information Complexity of VC Learning

            Lydia Zakynthinou, …

            C
            C
            COLT 5 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Active Local Learning
            00:55

            Active Local Learning

            Arturs Backurs, …

            C
            C
            COLT 5 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            A bounded noise mechanism for differential privacy
            19:38

            A bounded noise mechanism for differential privacy

            Yuval Dagan, …

            C
            C
            COLT 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Chasing Convex Bodies and Functions with Black-Box Advice
            20:38

            Chasing Convex Bodies and Functions with Black-Box Advice

            Nicolas Christianson, …

            C
            C
            COLT 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow COLT