Tiancheng Jin, Longbo Huang, Haipeng Luo · The Best of Both Worlds: Stochastic and Adversarial Episodic MDPs with Unknown Transition · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: The Best of Both Worlds: Stochastic and Adversarial Episodic MDPs with Unknown Transition

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-003-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-003-alpha.b-cdn.net
sl-yoda-v2-stream-003-beta.b-cdn.net
1544410162.rsc.cdn77.org
1005514182.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

The Best of Both Worlds: Stochastic and Adversarial Episodic MDPs with Unknown Transition

The Best of Both Worlds: Stochastic and Adversarial Episodic MDPs with Unknown Transition

Dec 6, 2021

Speakers

Tiancheng Jin

Speaker · 0 followers

Longbo Huang

Speaker · 0 followers

Haipeng Luo

Speaker · 1 follower

About

We consider the best-of-both-worlds problem for learning an episodic Markov Decision Process through T episodes, with the goal of achieving 𝒪(√(T)) regret when the losses are adversarial and simultaneously 𝒪(log T) regret when the losses are (almost) stochastic. Recent work by [Jin and Luo, 2020] achieves this goal when the fixed transition is known, and leaves the case of unknown transition as a major open question. In this work, we resolve this open problem by using the same Follow-the-Regul…

Organizer

NeurIPS 2021

Account · 1.9k followers

Categories

AI & Data Science

Category · 10.8k presentations

Mathematics

Category · 2.4k presentations

About NeurIPS 2021

Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

That Escalated Quickly: Accelerating Complexity by Editing Levels at the Frontier of Agent Capabilities

05:01

That Escalated Quickly: Accelerating Complexity by Editing Levels at the Frontier of Agent Capabilities

Watch later

Favorite

Jack Parker-Holder, …

NeurIPS 2021 3 years ago

Improving Deep Learning Interpretability by Saliency Guided Training

10:45

Improving Deep Learning Interpretability by Saliency Guided Training

Watch later

Favorite

Aya Abdelsalam Ismail, …

NeurIPS 2021 3 years ago

Uncertainty Calibration for Ensemble-Based Debiasing Methods

05:58

Uncertainty Calibration for Ensemble-Based Debiasing Methods

Watch later

Favorite

Ruibin Xiong, …

NeurIPS 2021 3 years ago

Invited Speakers Panel

48:38

Invited Speakers Panel

Watch later

Favorite

Sham M. Kakade, …

NeurIPS 2021 3 years ago

Neural Algorithmic Reasoners are Implicit Planners

13:10

Neural Algorithmic Reasoners are Implicit Planners

Watch later

Favorite

Andreea Deac, …

NeurIPS 2021 3 years ago

Biological learning in key-value memory networks

11:23

Biological learning in key-value memory networks

Watch later

Favorite

Danial Tyulmankov, …

NeurIPS 2021 3 years ago