Yifu Yuan, Jianye Hao, Fei Ni, Mu Yao, Yan Zheng, Yujing Hu, Jinyi Liu, Yingfeng Chen, Changjie Fan · EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model · SlidesLive

Kategorien

DE

Anmelden Vertrieb kontaktieren

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-006-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-006-alpha.b-cdn.net
sl-yoda-v2-stream-006-beta.b-cdn.net
1549480416.rsc.cdn77.org
1102696603.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model

EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model

Dez 2, 2022

Sprecher:innen

Yifu Yuan

Řečník · 0 sledujících

Jianye Hao

Řečník · 0 sledujících

Fei Ni

Řečník · 0 sledujících

Über

Unsupervised reinforcement learning (URL) poses a promising paradigm to learn useful behaviors in a task-agnostic environment without the guidance of extrinsic rewards to facilitate the fast adaptation of various downstream tasks. Previous works focused on the pre-training in a model-free manner while lacking the study of transition dynamics modeling that leaves a large space for the improvement of sample efficiency in downstream tasks. To this end, we propose an Efficient Unsupervised Reinforce…

Organisator

NeurIPS 2022

Účet · 961 sledujících

Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

Professionelle Aufzeichnung und Livestreaming – weltweit.

Freigeben

Empfohlene Videos

Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

Fast Learning of Multivariate Hawkes Processes via Frank-Wolfe

02:03

Fast Learning of Multivariate Hawkes Processes via Frank-Wolfe

Zhlédnout později

Oblíbené

Renbo Zhao, …

NeurIPS 2022 2 years ago

Quantum algorithms for sampling log-concave distributions and estimating normalizing constants

04:29

Quantum algorithms for sampling log-concave distributions and estimating normalizing constants

Zhlédnout později

Oblíbené

Andrew Childs, …

NeurIPS 2022 2 years ago

BYOL-Explore: Exploration by Bootstrapped Prediction

04:11

BYOL-Explore: Exploration by Bootstrapped Prediction

Zhlédnout později

Oblíbené

Zhaohan Guo, …

NeurIPS 2022 2 years ago

Reinforcement Learning Explainability via Model Transforms

03:16

Reinforcement Learning Explainability via Model Transforms

Zhlédnout později

Oblíbené

Mira Finkelstein, …

NeurIPS 2022 2 years ago

Learning Energy Networks with Generalized Fenchel-Young Losses

05:20

Learning Energy Networks with Generalized Fenchel-Young Losses

Zhlédnout později

Oblíbené

Mathieu Blondel, …

NeurIPS 2022 2 years ago

Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning

05:00

Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning

Zhlédnout později

Oblíbené

Byungchan Ko, …

NeurIPS 2022 2 years ago