Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: PADDLE: Logic Program Guided Policy Reuse in Deep Reinforcement Learning
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v3-stream-005-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v3-stream-005-alpha.b-cdn.net
      • sl-yoda-v3-stream-005-beta.b-cdn.net
      • 1026534588.rsc.cdn77.org
      • 1776530814.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            PADDLE: Logic Program Guided Policy Reuse in Deep Reinforcement Learning
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            PADDLE: Logic Program Guided Policy Reuse in Deep Reinforcement Learning

            Dec 15, 2023

            Speakers

            HZ

            Hao Zhang

            Speaker · 3 followers

            TY

            Tianpei Yang

            Speaker · 0 followers

            YZ

            Yan Zheng

            Speaker · 0 followers

            About

            Learning new skills through previous experience is common in human life, which is the core idea of Transfer Reinforcement Learning (TRL). This requires the agent to learn \emph{when} and \emph{which} source policy is the best to reuse as the target task's policy, and \emph{how} to reuse the source policy. Most TRL methods learn, transfer, and reuse black-box policies, which is hard to explain 1) when to reuse, 2) which source policy is effective, and 3) reduces transfer efficiency. In this paper…

            Organizer

            N2
            N2

            NeurIPS 2023

            Account · 615 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            On Mitigating Unconscious Bias through Bandits with Evolving Biased Feedback
            03:05

            On Mitigating Unconscious Bias through Bandits with Evolving Biased Feedback

            Matthew Faw, …

            N2
            N2
            NeurIPS 2023 15 months ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            PaSS: Parallel Speculative Sampling
            06:05

            PaSS: Parallel Speculative Sampling

            Giovanni Monea, …

            N2
            N2
            NeurIPS 2023 15 months ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models
            04:49

            DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models

            Weijia Wu, …

            N2
            N2
            NeurIPS 2023 15 months ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Bridging the Dataset Gap in Automated Citation Screening for Systematic Literature Reviews
            05:06

            Bridging the Dataset Gap in Automated Citation Screening for Systematic Literature Reviews

            Wojciech Kusa, …

            N2
            N2
            NeurIPS 2023 15 months ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Adversarial Robustness in Graph Neural Networks: A Hamiltonian Approach
            05:06

            Adversarial Robustness in Graph Neural Networks: A Hamiltonian Approach

            Kai Zhao, …

            N2
            N2
            NeurIPS 2023 15 months ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            TempME: Towards the Explainability of Temporal Graph Neural Networks via Motif Discovery
            30:41

            TempME: Towards the Explainability of Temporal Graph Neural Networks via Motif Discovery

            Jialin Chen, …

            N2
            N2
            NeurIPS 2023 15 months ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2023