Hayato Watahiki, Ryo Iwase, Ryosuke Unno, Yoshimasa Tsuruoka · Learning a Domain-Agnostic Policy through Adversarial Representation Matching for Cross-Domain Policy Transfer · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Learning a Domain-Agnostic Policy through Adversarial Representation Matching for Cross-Domain Policy Transfer

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-002-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-002-alpha.b-cdn.net
sl-yoda-v2-stream-002-beta.b-cdn.net
1001562353.rsc.cdn77.org
1075090661.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Learning a Domain-Agnostic Policy through Adversarial Representation Matching for Cross-Domain Policy Transfer

Learning a Domain-Agnostic Policy through Adversarial Representation Matching for Cross-Domain Policy Transfer

Dec 2, 2022

Speakers

Hayato Watahiki

Speaker · 0 followers

Ryo Iwase

Speaker · 0 followers

Ryosuke Unno

Speaker · 0 followers

About

The low transferability of learned policies is one of the most critical problems limiting the applicability of learning-based solutions to decision-making tasks. In this paper, we present a way to align latent representations of states and actions between different domains by optimizing an adversarial objective. We train two models, a policy and a domain discriminator, with unpaired trajectories of proxy tasks through behavioral cloning as well as adversarial training. After the latent represent…

Organizer

NeurIPS 2022

Account · 961 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Unleashing The Potential of Data Sharing in Ensemble Deep Reinforcement Learning

05:00

Unleashing The Potential of Data Sharing in Ensemble Deep Reinforcement Learning

Watch later

Favorite

Zhixuan Lin, …

NeurIPS 2022 2 years ago

List-decodable Mean Estimation via Difference of Pairs

04:30

List-decodable Mean Estimation via Difference of Pairs

Watch later

Favorite

Ilias Diakonikolas, …

NeurIPS 2022 2 years ago

Tensor Program Optimization with Probabilistic Programs

05:21

Tensor Program Optimization with Probabilistic Programs

Watch later

Favorite

Junru Shao, …

NeurIPS 2022 2 years ago

Don't Waste Data: Transfer Learning to Leverage All Data for Machine-Learnt Climate Model Emulation

09:10

Don't Waste Data: Transfer Learning to Leverage All Data for Machine-Learnt Climate Model Emulation

Watch later

Favorite

Raghul Parthipan, …

NeurIPS 2022 2 years ago

How to Select Important Participants in Vertical Federated Learning, Efficiently and Securely?

04:38

How to Select Important Participants in Vertical Federated Learning, Efficiently and Securely?

Watch later

Favorite

Jiawei Jiang, …

NeurIPS 2022 2 years ago

Gradient Knowledge Distillation for Pre-trained Language Models

06:47

Gradient Knowledge Distillation for Pre-trained Language Models

Watch later

Favorite

NeurIPS 2022 2 years ago