Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Learning a Domain-Agnostic Policy through Adversarial Representation Matching for Cross-Domain Policy Transfer
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-002-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-002-alpha.b-cdn.net
      • sl-yoda-v2-stream-002-beta.b-cdn.net
      • 1001562353.rsc.cdn77.org
      • 1075090661.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Learning a Domain-Agnostic Policy through Adversarial Representation Matching for Cross-Domain Policy Transfer
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Learning a Domain-Agnostic Policy through Adversarial Representation Matching for Cross-Domain Policy Transfer

            Dec 2, 2022

            Speakers

            HW

            Hayato Watahiki

            Speaker · 0 followers

            RI

            Ryo Iwase

            Speaker · 0 followers

            RU

            Ryosuke Unno

            Speaker · 0 followers

            About

            The low transferability of learned policies is one of the most critical problems limiting the applicability of learning-based solutions to decision-making tasks. In this paper, we present a way to align latent representations of states and actions between different domains by optimizing an adversarial objective. We train two models, a policy and a domain discriminator, with unpaired trajectories of proxy tasks through behavioral cloning as well as adversarial training. After the latent represent…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 961 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Unleashing The Potential of Data Sharing in Ensemble Deep Reinforcement Learning
            05:00

            Unleashing The Potential of Data Sharing in Ensemble Deep Reinforcement Learning

            Zhixuan Lin, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            List-decodable Mean Estimation via Difference of Pairs
            04:30

            List-decodable Mean Estimation via Difference of Pairs

            Ilias Diakonikolas, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Tensor Program Optimization with Probabilistic Programs
            05:21

            Tensor Program Optimization with Probabilistic Programs

            Junru Shao, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Don't Waste Data: Transfer Learning to Leverage All Data for Machine-Learnt Climate Model Emulation
            09:10

            Don't Waste Data: Transfer Learning to Leverage All Data for Machine-Learnt Climate Model Emulation

            Raghul Parthipan, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            How to Select Important Participants in Vertical Federated Learning, Efficiently and Securely?
            04:38

            How to Select Important Participants in Vertical Federated Learning, Efficiently and Securely?

            Jiawei Jiang, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Gradient Knowledge Distillation for Pre-trained Language Models
            06:47

            Gradient Knowledge Distillation for Pre-trained Language Models

            Lean Wang, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022