Zichen Liu, Siyi Li, Wee Sun Lee, Shuicheng Yan, Zhongwen Xu · Efficient Offline Policy Optimization with a Learned Model · SlidesLive

Kategorien

DE

Anmelden Vertrieb kontaktieren

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Efficient Offline Policy Optimization with a Learned Model

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-004-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-004-alpha.b-cdn.net
sl-yoda-v2-stream-004-beta.b-cdn.net
1685195716.rsc.cdn77.org
1239898752.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Efficient Offline Policy Optimization with a Learned Model

Efficient Offline Policy Optimization with a Learned Model

Dez 2, 2022

Sprecher:innen

Zichen Liu

Sprecher:in · 0 Follower:innen

Siyi Li

Sprecher:in · 0 Follower:innen

Wee Sun Lee

Sprecher:in · 1 Follower:in

Über

MuZero Unplugged presents a promising approach for offline policy learning from logged data. It conducts Monte-Carlo Tree Search (MCTS) with a learned model and leverages Reanalyze algorithm to learn purely from offline data. For good performance, MCTS requires accurate learned models and a large number of simulations, thus costing huge computing time. This paper investigates a few hypotheses where MuZero Unplugged may not work well under the offline RL settings, including 1) learning with limit…

Organisator

NeurIPS 2022

Konto · 961 Follower:innen

Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

Professionelle Aufzeichnung und Livestreaming – weltweit.

Freigeben

Empfohlene Videos

Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

Subsidiary Prototype Alignment for Universal Domain Adaptation

03:39

Subsidiary Prototype Alignment for Universal Domain Adaptation

Später ansehen

Favorit

Jogendra Nath Kundu, …

NeurIPS 2022 2 years ago

ForestBench: Equitable Benchmarks for Monitoring, Reporting, and Verification of Nature-Based Solutions with Machine Learning

04:40

ForestBench: Equitable Benchmarks for Monitoring, Reporting, and Verification of Nature-Based Solutions with Machine Learning

Später ansehen

Favorit

Lucas Czech, …

NeurIPS 2022 2 years ago

Accelerating Perturbed Stochastic Iterates in Asynchronous Lock-Free Optimization

04:33

Accelerating Perturbed Stochastic Iterates in Asynchronous Lock-Free Optimization

Später ansehen

Favorit

Kaiwen Zhou, …

NeurIPS 2022 2 years ago

04:28

Q A

Später ansehen

Favorit

Sayak Paul, …

NeurIPS 2022 2 years ago

Peripheral Vision Transformer

05:03

Peripheral Vision Transformer

Später ansehen

Favorit

Juhong Min, …

NeurIPS 2022 2 years ago

Rethinking Alignment in Video Super-Resolution Transformers

05:28

Rethinking Alignment in Video Super-Resolution Transformers

Später ansehen

Favorit

Shuwei Shi, …

NeurIPS 2022 2 years ago