Michael Laskin, Luyu Wang, Junhyuk Oh, Emilio Parisotto, Stephen Spencer, Richie Steigerwald, Dj Strouse, Steven Hansen, Angelos Filos, Ethan Brooks, Maxime Gazeau, Himanshu Sahni, Satinder Singh, Volodymyr Mnih · In-context Reinforcement Learning with Algorithm Distillation · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: In-context Reinforcement Learning with Algorithm Distillation

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-005-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-005-alpha.b-cdn.net
sl-yoda-v2-stream-005-beta.b-cdn.net
1034628162.rsc.cdn77.org
1409346856.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

In-context Reinforcement Learning with Algorithm Distillation

In-context Reinforcement Learning with Algorithm Distillation

Dec 2, 2022

Speakers

Michael Laskin

Speaker · 0 followers

Luyu Wang

Speaker · 0 followers

Junhyuk Oh

Speaker · 0 followers

About

We propose Algorithm Distillation (AD), a method for distilling reinforcement learning (RL) algorithms into neural networks by modeling their training histories with a causal sequence model. Algorithm Distillation treats learning to reinforcement learn as an across-episode sequential prediction problem. A dataset of learning histories is generated by a source RL algorithm, and then a causal transformer is trained by autoregressively predicting actions given their preceding learning histories as…

Organizer

NeurIPS 2022

Account · 961 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Beyond Spectral Gap: The role of topology in decentralized learning

05:01

Beyond Spectral Gap: The role of topology in decentralized learning

Watch later

Favorite

Thijs Vogels, …

NeurIPS 2022 2 years ago

VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training

12:08

VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training

Watch later

Favorite

NeurIPS 2022 2 years ago

Graph Neural Networks with Adaptive Readouts

00:34

Graph Neural Networks with Adaptive Readouts

Watch later

Favorite

David Buterez, …

NeurIPS 2022 2 years ago

CLUTR: Curriculum Learning via Unsupervised Task Representation Learning

05:30

CLUTR: Curriculum Learning via Unsupervised Task Representation Learning

Watch later

Favorite

Abdus Salam Azad, …

NeurIPS 2022 2 years ago

LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale

04:30

LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale

Watch later

Favorite

Mike Lewis, …

NeurIPS 2022 2 years ago

Aligning individual brains with fused unbalanced Gromov Wasserstein

04:56

Aligning individual brains with fused unbalanced Gromov Wasserstein

Watch later

Favorite

Alexis Thual, …

NeurIPS 2022 2 years ago