Jinpeng Zhang, Yufeng Zheng, Chuheng Zhang, Li Zhao, Lei Song, Yuan Zhou, Jiang Bian · Robust Situational Reinforcement Learning in Face of Context Disturbances · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Robust Situational Reinforcement Learning in Face of Context Disturbances

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-001-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-001-alpha.b-cdn.net
sl-yoda-v2-stream-001-beta.b-cdn.net
1824830694.rsc.cdn77.org
1979322955.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Robust Situational Reinforcement Learning in Face of Context Disturbances

Robust Situational Reinforcement Learning in Face of Context Disturbances

Jul 24, 2023

Speakers

Jinpeng Zhang

Speaker · 0 followers

Yufeng Zheng

Speaker · 0 followers

Chuheng Zhang

Speaker · 0 followers

About

In many real-world tasks, some parts of state features, called contexts, are independent of action signals, e.g., customer demand in inventory control, speed of lead car in autonomous driving, etc. One of the challenges of reinforcement learning in these applications is that the true context transitions can be easily exposed some unknown source of contamination, leading to a shift of context transitions between source domains and target domains, which could cause performance degradation for RL a…

Organizer

ICML 2023

Account · 657 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Leveraging Large Scale Models for Identifying and Fixing Deep Neural Networks Biases

21:03

Leveraging Large Scale Models for Identifying and Fixing Deep Neural Networks Biases

Watch later

Favorite

Polina Kirichenko, …

ICML 2023 2 years ago

GastroVision: A Multi-class Endoscopy Image Dataset for Computer Aided Gastrointestinal Disease Detection

10:46

GastroVision: A Multi-class Endoscopy Image Dataset for Computer Aided Gastrointestinal Disease Detection

Watch later

Favorite

Debesh Jha, …

ICML 2023 2 years ago

Over-parametrization via Lifting for Low-rank Matrix Sensing: Conversion of Spurious Solutions to Strict Saddle Points

08:19

Over-parametrization via Lifting for Low-rank Matrix Sensing: Conversion of Spurious Solutions to Strict Saddle Points

Watch later

Favorite

ICML 2023 2 years ago

Discover and Cure: Concept-aware Mitigation of Spurious Correlation

05:24

Discover and Cure: Concept-aware Mitigation of Spurious Correlation

Watch later

Favorite

Shirley Wu, …

ICML 2023 2 years ago

Learning Rate Schedules in the Presence of Distribution Shift

05:30

Learning Rate Schedules in the Presence of Distribution Shift

Watch later

Favorite

Matthew Fahrbach, …

ICML 2023 2 years ago

Policy Gradient in Robust MDPs with Global Convergence Guarantee

05:02

Policy Gradient in Robust MDPs with Global Convergence Guarantee

Watch later

Favorite

Qiuhao Wang, …

ICML 2023 2 years ago