Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: ABC: Adversarial Behavioral Cloning for Offline Mode-Seeking Imitation Learning
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-002-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-002-alpha.b-cdn.net
      • sl-yoda-v2-stream-002-beta.b-cdn.net
      • 1001562353.rsc.cdn77.org
      • 1075090661.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            ABC: Adversarial Behavioral Cloning for Offline Mode-Seeking Imitation Learning
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            ABC: Adversarial Behavioral Cloning for Offline Mode-Seeking Imitation Learning

            Dec 2, 2022

            Speakers

            EH

            Eddy Hudson

            Speaker · 0 followers

            GW

            Garrett Warnell

            Speaker · 0 followers

            ID

            Ishan Durugkar

            Speaker · 0 followers

            About

            Given a dataset of interactions with an environment of interest, a viable method to extract an agent policy is to estimate the maximum likelihood policy indicated by this data. This approach is commonly referred to as behavioral cloning (BC). In this work, we describe a key disadvantage of BC that arises due to the maximum likelihood objective function; namely that BC is mean-seeking with respect to the state-conditional expert action distribution when the learner's policy is represented with a…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 961 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            RecursiveMix: Mixed Learning with History
            01:05

            RecursiveMix: Mixed Learning with History

            Lingfeng Yang, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Imperceptible Adversarial Attacks on Discrete-Time Dynamic Graph Models
            09:35

            Imperceptible Adversarial Attacks on Discrete-Time Dynamic Graph Models

            Kartik Sharma, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            FedML + Panel
            58:56

            FedML + Panel

            Chaoyang He, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Boosting as Frank-Wolfe
            04:03

            Boosting as Frank-Wolfe

            Ryotaro Mitsuboshi, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Mitigating Health Data Poverty: Generative Approaches versus Resampling for Time-series Clinical Data
            01:54

            Mitigating Health Data Poverty: Generative Approaches versus Resampling for Time-series Clinical Data

            Raffaele Marchesi, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            On the Interpretability of Regularisation for Neural Networks Through Model Gradient Similarity
            04:49

            On the Interpretability of Regularisation for Neural Networks Through Model Gradient Similarity

            Vincent Szolnoky, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022