Daesol Cho, Dongseok Shim, H. Jin Kim · S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-004-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-004-alpha.b-cdn.net
sl-yoda-v2-stream-004-beta.b-cdn.net
1685195716.rsc.cdn77.org
1239898752.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning

S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning

Nov 28, 2022

Speakers

Daesol Cho

Speaker · 0 followers

Dongseok Shim

Speaker · 0 followers

H. Jin Kim

Speaker · 0 followers

About

Offline reinforcement learning (Offline RL) suffers from the innate distributional shift as it cannot interact with the physical environment during training. To alleviate such limitation, state-based offline RL leverages a learned dynamics model from the logged experience and augments the predicted state transition to extend the data distribution. For exploiting such benefit also on the image-based RL, we firstly propose a generative model, S2P (State2Pixel), which synthesizes the raw pixel of t…

Organizer

NeurIPS 2022

Account · 958 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

A Unified Analysis of Mixed Sample Data Augmentation: A Loss Function Perspective

04:53

A Unified Analysis of Mixed Sample Data Augmentation: A Loss Function Perspective

Watch later

Favorite

Chanwoo Park, …

NeurIPS 2022 2 years ago

Pitfalls of Epistemic Uncertainty Quantification through Loss Minimisation

05:23

Pitfalls of Epistemic Uncertainty Quantification through Loss Minimisation

Watch later

Favorite

Viktor Bengs, …

NeurIPS 2022 2 years ago

Memorization and Optimization in Deep Neural Networks with Minimum Over-parameterization

04:58

Memorization and Optimization in Deep Neural Networks with Minimum Over-parameterization

Watch later

Favorite

Simone Bombari, …

NeurIPS 2022 2 years ago

PAC Prediction Sets for Meta-Learning

05:04

PAC Prediction Sets for Meta-Learning

Watch later

Favorite

Sangdon Park, …

NeurIPS 2022 2 years ago

Frank-Wolfe-based Algorithms for Approximating Tyler's M-estimator

04:54

Frank-Wolfe-based Algorithms for Approximating Tyler's M-estimator

Watch later

Favorite

Lior Danon, …

NeurIPS 2022 2 years ago

Hard ImageNet: Segmentations for Objects with Strong Spurious Cues

05:06

Hard ImageNet: Segmentations for Objects with Strong Spurious Cues

Watch later

Favorite

Mazda Moayeri, …

NeurIPS 2022 2 years ago