Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-009-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-009-alpha.b-cdn.net
      • sl-yoda-v2-stream-009-beta.b-cdn.net
      • 1766500541.rsc.cdn77.org
      • 1441886916.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes

            Jul 24, 2023

            Speakers

            CY

            Chenlu Ye

            Speaker · 0 followers

            WX

            Wei Xiong

            Speaker · 0 followers

            QG

            Quanquan Gu

            Speaker · 5 followers

            About

            Despite the significant interest and progress in reinforcement learning (RL) problems with adversarial corruption, current works are either confined to the linear setting or lead to an undesired 𝒪̃(√(T)ζ) regret bound, where T is the number of rounds and ζ is the total amount of corruption. In this paper, we consider contextual bandits with general function approximation and propose a computationally efficient algorithm to achieve a regret of 𝒪̃(√(T)+ζ). The proposed algorithm relies on the re…

            Organizer

            I2
            I2

            ICML 2023

            Account · 636 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Mixture Proportion Estimation Beyond Irreducibility
            05:12

            Mixture Proportion Estimation Beyond Irreducibility

            Yilun Zhu, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Towards Trustworthy Explanation: On Causal Rationalization
            05:19

            Towards Trustworthy Explanation: On Causal Rationalization

            Wenbo Zhang, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Transformers Meet Directed Graphs
            04:51

            Transformers Meet Directed Graphs

            Simon Geilser, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Primal and Dual Analysis of Entropic Fictitious Play for Finite-sum Problems
            05:06

            Primal and Dual Analysis of Entropic Fictitious Play for Finite-sum Problems

            Atsushi Nitanda, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills
            05:16

            Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills

            Seongun Kim, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Online Mechanism Design for Information Acquisition
            04:55

            Online Mechanism Design for Information Acquisition

            Federico Cacciamani, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow ICML 2023