Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Policy Aware Model Learning via Transition Occupancy Matching
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-010-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-010-alpha.b-cdn.net
      • sl-yoda-v2-stream-010-beta.b-cdn.net
      • 1759419103.rsc.cdn77.org
      • 1016618226.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Policy Aware Model Learning via Transition Occupancy Matching
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Policy Aware Model Learning via Transition Occupancy Matching

            Dec 2, 2022

            Speakers

            JYM

            Jason Yecheng Ma

            Speaker · 0 followers

            KS

            Kausik Sivakumar

            Speaker · 0 followers

            JY

            Jason Yan

            Speaker · 0 followers

            About

            Model-based reinforcement learning (MBRL) is an effective paradigm for sample-efficient policy learning. The pre-dominant MBRL strategy iteratively learns the dynamics model by performing maximum likelihood (MLE) on the entire replay buffer and trains the policy using fictitious transitions from the learned model. Given that not all transitions in the replay buffer are equally informative about the task or the policy's current progress, this MLE strategy cannot be optimal and bears no clear rela…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 961 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Sequential Information Design: Learning to Persuade in the Dark
            04:50

            Sequential Information Design: Learning to Persuade in the Dark

            Martino Bernasconi, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Deep Combinatorial Aggregation
            00:52

            Deep Combinatorial Aggregation

            Yuesong Shen, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Machine Learning for Predicting Climate Extremes
            10:37

            Machine Learning for Predicting Climate Extremes

            Hritik Bansal, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Teaching Algorithmic Reasoning via In-context Learning
            05:58

            Teaching Algorithmic Reasoning via In-context Learning

            Hattie Zhou, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Alternating Mirror Descent for Constrained Min-Max Games
            04:23

            Alternating Mirror Descent for Constrained Min-Max Games

            Andre Wibisono, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Bridging the Gap: Unifying the Training and Evaluation of Neural Network Binary Classifiers
            05:00

            Bridging the Gap: Unifying the Training and Evaluation of Neural Network Binary Classifiers

            Nathan Tsoi, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022