Chenlu Ye, Wei Xiong, Quanquan Gu, Tong Zhang · Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-009-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-009-alpha.b-cdn.net
sl-yoda-v2-stream-009-beta.b-cdn.net
1766500541.rsc.cdn77.org
1441886916.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes

Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes

Jul 24, 2023

Speakers

Chenlu Ye

Speaker · 0 followers

Wei Xiong

Speaker · 0 followers

Quanquan Gu

Speaker · 5 followers

About

Despite the significant interest and progress in reinforcement learning (RL) problems with adversarial corruption, current works are either confined to the linear setting or lead to an undesired 𝒪̃(√(T)ζ) regret bound, where T is the number of rounds and ζ is the total amount of corruption. In this paper, we consider contextual bandits with general function approximation and propose a computationally efficient algorithm to achieve a regret of 𝒪̃(√(T)+ζ). The proposed algorithm relies on the re…

Organizer

ICML 2023

Account · 636 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Mixture Proportion Estimation Beyond Irreducibility

05:12

Mixture Proportion Estimation Beyond Irreducibility

Watch later

Favorite

ICML 2023 2 years ago

Towards Trustworthy Explanation: On Causal Rationalization

05:19

Towards Trustworthy Explanation: On Causal Rationalization

Watch later

Favorite

Wenbo Zhang, …

ICML 2023 2 years ago

Transformers Meet Directed Graphs

04:51

Transformers Meet Directed Graphs

Watch later

Favorite

Simon Geilser, …

ICML 2023 2 years ago

Primal and Dual Analysis of Entropic Fictitious Play for Finite-sum Problems

05:06

Primal and Dual Analysis of Entropic Fictitious Play for Finite-sum Problems

Watch later

Favorite

Atsushi Nitanda, …

ICML 2023 2 years ago

Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills

05:16

Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills

Watch later

Favorite

Seongun Kim, …

ICML 2023 2 years ago

Online Mechanism Design for Information Acquisition

04:55

Online Mechanism Design for Information Acquisition

Watch later

Favorite

Federico Cacciamani, …

ICML 2023 2 years ago