Zihan Zhang, Jia-Qi Yang, Xiangyang Ji, Simon Shaolei Du · Improved Variance-Aware Confidence Sets for Linear Bandits and Linear Mixture MDP · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Improved Variance-Aware Confidence Sets for Linear Bandits and Linear Mixture MDP

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v3-stream-012-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v3-stream-012-alpha.b-cdn.net
sl-yoda-v3-stream-012-beta.b-cdn.net
1338956956.rsc.cdn77.org
1656830687.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Improved Variance-Aware Confidence Sets for Linear Bandits and Linear Mixture MDP

Improved Variance-Aware Confidence Sets for Linear Bandits and Linear Mixture MDP

Dec 6, 2021

Speakers

Zihan Zhang

Speaker · 0 followers

Jia-Qi Yang

Speaker · 0 followers

Xiangyang Ji

Speaker · 0 followers

About

This paper presents new variance-aware confidence sets for linear bandits and linear mixture Markov Decision Processes (MDPs).With the new confidence sets, we obtain the follow regret bounds:For linear bandits, we obtain an O(poly(d)√(1 + ∑_k=1^Kσ_k^2)) data-dependent regret bound, where d is the feature dimension, K is the number of rounds, and σ_k^2 is the unknown variance of the reward at the k-th round. This is the first regret bound that only scales with the variance and the dimension but …

Organizer

NeurIPS 2021

Account · 1.9k followers

About NeurIPS 2021

Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Role of recurrent computations in primate visual object recognition

17:31

Role of recurrent computations in primate visual object recognition

Watch later

Favorite

NeurIPS 2021 3 years ago

Self-Supervised Learning Disentangled Group Representation as Feature

16:04

Self-Supervised Learning Disentangled Group Representation as Feature

Watch later

Favorite

NeurIPS 2021 3 years ago

Contextual Similarity Aggregation with Self-attention for Visual Re-ranking

06:34

Contextual Similarity Aggregation with Self-attention for Visual Re-ranking

Watch later

Favorite

Jianbo Ouyang, …

NeurIPS 2021 3 years ago

A First Look Towards One-Shot Object Detection with SPOT for Data-Efficient Learning

02:08

A First Look Towards One-Shot Object Detection with SPOT for Data-Efficient Learning

Watch later

Favorite

Ria Chakraborty, …

NeurIPS 2021 3 years ago

Proxy-Normalizing Activations to Match Batch Normalization while Removing Batch Dependence

11:29

Proxy-Normalizing Activations to Match Batch Normalization while Removing Batch Dependence

Watch later

Favorite

Antoine Labatie, …

NeurIPS 2021 3 years ago

When False Positive is Intolerant: End-to-End Optimization with Low FPR for Multipartite Ranking

07:00

When False Positive is Intolerant: End-to-End Optimization with Low FPR for Multipartite Ranking

Watch later

Favorite

Peisong Wen, …

NeurIPS 2021 3 years ago