Michael Laskin, Luyu Wang, Junhyuk Oh, Emilio Parisotto, Stephen Spencer, Richie Steigerwald, Dj Strouse, Steven Hansen, Angelos Filos, Ethan Brooks, Maxime Gazeau, Himanshu Sahni, Satinder Singh, Volodymyr Mnih · In-context Reinforcement Learning with Algorithm Distillation · SlidesLive

Kategorien

DE

Anmelden Vertrieb kontaktieren

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: In-context Reinforcement Learning with Algorithm Distillation

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-005-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-005-alpha.b-cdn.net
sl-yoda-v2-stream-005-beta.b-cdn.net
1034628162.rsc.cdn77.org
1409346856.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

In-context Reinforcement Learning with Algorithm Distillation

In-context Reinforcement Learning with Algorithm Distillation

Dez 2, 2022

Sprecher:innen

Michael Laskin

Řečník · 0 sledujících

Luyu Wang

Řečník · 0 sledujících

Junhyuk Oh

Řečník · 0 sledujících

Über

We propose Algorithm Distillation (AD), a method for distilling reinforcement learning (RL) algorithms into neural networks by modeling their training histories with a causal sequence model. Algorithm Distillation treats learning to reinforcement learn as an across-episode sequential prediction problem. A dataset of learning histories is generated by a source RL algorithm, and then a causal transformer is trained by autoregressively predicting actions given their preceding learning histories as…

Organisator

NeurIPS 2022

Účet · 962 sledujících

Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

Professionelle Aufzeichnung und Livestreaming – weltweit.

Freigeben

Empfohlene Videos

Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

Lifelong Learning Machines Tutorial

20:07

Lifelong Learning Machines Tutorial

Zhlédnout později

Oblíbené

Tyler Hayes, …

NeurIPS 2022 2 years ago

Towards Algorithmic Fairness in Space-Time: Filling in Black Holes

04:28

Towards Algorithmic Fairness in Space-Time: Filling in Black Holes

Zhlédnout později

Oblíbené

Subho Majumdar, …

NeurIPS 2022 2 years ago

Chromatic Correlation Clustering, Revisited

04:39

Chromatic Correlation Clustering, Revisited

Zhlédnout později

Oblíbené

NeurIPS 2022 2 years ago

Generative Visual Prompt: Unified Distributional Control of Pre-Trained Generative Vision Models

04:07

Generative Visual Prompt: Unified Distributional Control of Pre-Trained Generative Vision Models

Zhlédnout později

Oblíbené

Chen Henry Wu, …

NeurIPS 2022 2 years ago

Potential Energy based Mixture Model for Noisy Label Learning

05:15

Potential Energy based Mixture Model for Noisy Label Learning

Zhlédnout později

Oblíbené

Wenbin Yang, …

NeurIPS 2022 2 years ago

Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs

04:49

Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs

Zhlédnout později

Oblíbené

Benjamin Fuhrer, …

NeurIPS 2022 2 years ago