Tom Dupuis, Jaonary Rabarisoa, Quoc Cuong Pham, David Filliat · Domain Invariant Q-Learning for Model-Free Robust Continuous Control under Visual Distractions · SlidesLive

Kategorien

DE

Anmelden Vertrieb kontaktieren

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Domain Invariant Q-Learning for Model-Free Robust Continuous Control under Visual Distractions

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-003-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-003-alpha.b-cdn.net
sl-yoda-v2-stream-003-beta.b-cdn.net
1544410162.rsc.cdn77.org
1005514182.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Domain Invariant Q-Learning for Model-Free Robust Continuous Control under Visual Distractions

Domain Invariant Q-Learning for Model-Free Robust Continuous Control under Visual Distractions

Dez 2, 2022

Sprecher:innen

Tom Dupuis

Sprecher:in · 0 Follower:innen

Jaonary Rabarisoa

Sprecher:in · 0 Follower:innen

Quoc Cuong Pham

Sprecher:in · 0 Follower:innen

Über

End-to-end reinforcement learning on images showed significant performance progress in the recent years, especially with regularization to value estimation brought by data augmentation <cit.>. At the same time, domain randomization and representation learning helped push the limits of these algorithms in visually diverse environments, full of distractors and spurious noise, making RL more robust to unrelated visual features. We present DIQL, a method that combines risk invariant regulariz…

Organisator

NeurIPS 2022

Konto · 961 Follower:innen

Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

Professionelle Aufzeichnung und Livestreaming – weltweit.

Freigeben

Empfohlene Videos

Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios

03:09

Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios

Später ansehen

Favorit

NeurIPS 2022 2 years ago

Information-Theoretic Methods in the Study of the Lexicon

29:42

Information-Theoretic Methods in the Study of the Lexicon

Später ansehen

Favorit

NeurIPS 2022 2 years ago

Multi-block-Single-probe Variance Reduced Estimator for Coupled Compositional Optimization

01:00

Multi-block-Single-probe Variance Reduced Estimator for Coupled Compositional Optimization

Später ansehen

Favorit

NeurIPS 2022 2 years ago

Perceptual Attacks of No-Reference Image Quality Models with Human-in-the-Loop

01:03

Perceptual Attacks of No-Reference Image Quality Models with Human-in-the-Loop

Später ansehen

Favorit

Weixia Zhang, …

NeurIPS 2022 2 years ago

Contributed talk session 1

26:45

Contributed talk session 1

Später ansehen

Favorit

Gabriele Corso, …

NeurIPS 2022 2 years ago

Image-Based Soil Organic Carbon Estimation from Multispectral Satellite Images with Fourier Neural Operator and Structural Similarity

04:55

Image-Based Soil Organic Carbon Estimation from Multispectral Satellite Images with Fourier Neural Operator and Structural Similarity

Später ansehen

Favorit

Ken C. L. Wong, …

NeurIPS 2022 2 years ago