Taehyun Hwang, Kyuwook Chai, Min-hwan Oh · Combinatorial Neural Bandits · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Combinatorial Neural Bandits

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-002-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-002-alpha.b-cdn.net
sl-yoda-v2-stream-002-beta.b-cdn.net
1001562353.rsc.cdn77.org
1075090661.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Combinatorial Neural Bandits

Combinatorial Neural Bandits

Jul 24, 2023

Speakers

Taehyun Hwang

Speaker · 0 followers

Kyuwook Chai

Speaker · 0 followers

Min-hwan Oh

Speaker · 0 followers

About

We consider a contextual combinatorial bandit problem where in each round a learning agent selects a subset of arms and receives feedback on the selected arms according to their score. The score of an arm is an unknown function of the arm's feature. Approximating this unknown score function with deep neural networks, we propose algorithms: Combinatorial Neural UCB (CN-UCB) and Combinatorial Neural Thompson Sampling (CN-TS). We prove that CN-UCB achieves 𝒪̃(d̃√(T)) or 𝒪̃(√(d̃ T K)) regret, wher…

Organizer

ICML 2023

Account · 631 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

On Coresets for Clustering in Small Dimensional Euclidean Spaces

05:06

On Coresets for Clustering in Small Dimensional Euclidean Spaces

Watch later

Favorite

Lingxiao Huang, …

ICML 2023 2 years ago

DualHSIC: HSIC-Bottleneck and Alignment for Continual Learning

05:14

DualHSIC: HSIC-Bottleneck and Alignment for Continual Learning

Watch later

Favorite

Zifeng Wang, …

ICML 2023 2 years ago

Provable Data Subset Selection For Efficient Neural Networks Training

05:02

Provable Data Subset Selection For Efficient Neural Networks Training

Watch later

Favorite

Murad Tukan, …

ICML 2023 2 years ago

Efficient RL via Disentangled Environment and Agent Representations

07:23

Efficient RL via Disentangled Environment and Agent Representations

Watch later

Favorite

Kevin Gmelin, …

ICML 2023 2 years ago

Likelihood Adjusted Semidefinite Programs for Clustering Heterogeneous Data

05:14

Likelihood Adjusted Semidefinite Programs for Clustering Heterogeneous Data

Watch later

Favorite

Yubo Zhuang, …

ICML 2023 2 years ago

Adversarial Attacks on Aligned LLMs

34:42

Adversarial Attacks on Aligned LLMs

Watch later

Favorite

Zico Kolter, …

ICML 2023 2 years ago