Hao Zhang, Tianpei Yang, Yan Zheng, Jianye Hao, Matthew E. Taylor · PADDLE: Logic Program Guided Policy Reuse in Deep Reinforcement Learning · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: PADDLE: Logic Program Guided Policy Reuse in Deep Reinforcement Learning

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v3-stream-005-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v3-stream-005-alpha.b-cdn.net
sl-yoda-v3-stream-005-beta.b-cdn.net
1026534588.rsc.cdn77.org
1776530814.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

PADDLE: Logic Program Guided Policy Reuse in Deep Reinforcement Learning

PADDLE: Logic Program Guided Policy Reuse in Deep Reinforcement Learning

Dec 15, 2023

Speakers

Hao Zhang

Speaker · 3 followers

Tianpei Yang

Speaker · 0 followers

Yan Zheng

Speaker · 0 followers

About

Learning new skills through previous experience is common in human life, which is the core idea of Transfer Reinforcement Learning (TRL). This requires the agent to learn \emph{when} and \emph{which} source policy is the best to reuse as the target task's policy, and \emph{how} to reuse the source policy. Most TRL methods learn, transfer, and reuse black-box policies, which is hard to explain 1) when to reuse, 2) which source policy is effective, and 3) reduces transfer efficiency. In this paper…

Organizer

NeurIPS 2023

Account · 615 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

On Mitigating Unconscious Bias through Bandits with Evolving Biased Feedback

03:05

On Mitigating Unconscious Bias through Bandits with Evolving Biased Feedback

Watch later

Favorite

Matthew Faw, …

NeurIPS 2023 15 months ago

PaSS: Parallel Speculative Sampling

06:05

PaSS: Parallel Speculative Sampling

Watch later

Favorite

Giovanni Monea, …

NeurIPS 2023 15 months ago

DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models

04:49

DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models

Watch later

Favorite

NeurIPS 2023 15 months ago

Bridging the Dataset Gap in Automated Citation Screening for Systematic Literature Reviews

05:06

Bridging the Dataset Gap in Automated Citation Screening for Systematic Literature Reviews

Watch later

Favorite

Wojciech Kusa, …

NeurIPS 2023 15 months ago

Adversarial Robustness in Graph Neural Networks: A Hamiltonian Approach

05:06

Adversarial Robustness in Graph Neural Networks: A Hamiltonian Approach

Watch later

Favorite

NeurIPS 2023 15 months ago

TempME: Towards the Explainability of Temporal Graph Neural Networks via Motif Discovery

30:41

TempME: Towards the Explainability of Temporal Graph Neural Networks via Motif Discovery

Watch later

Favorite

Jialin Chen, …

NeurIPS 2023 15 months ago