Oren Neumann, Claudius Gros · Scaling Laws for a Multi-Agent Reinforcement Learning Model · SlidesLive

Kategorien

DE

Anmelden Vertrieb kontaktieren

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Scaling Laws for a Multi-Agent Reinforcement Learning Model

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-006-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-006-alpha.b-cdn.net
sl-yoda-v2-stream-006-beta.b-cdn.net
1549480416.rsc.cdn77.org
1102696603.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Scaling Laws for a Multi-Agent Reinforcement Learning Model

Scaling Laws for a Multi-Agent Reinforcement Learning Model

Dez 2, 2022

Sprecher:innen

Oren Neumann

Sprecher:in · 0 Follower:innen

Claudius Gros

Sprecher:in · 0 Follower:innen

Über

The recent observation of neural power-law scaling relations has made a significant impact in the field of deep learning. A substantial amount of attention has been dedicated as a consequence to the description of scaling laws, although mostly for supervised learning and only to a reduced extent for reinforcement learning frameworks. In this paper we present an extensive study of performance scaling for a cornerstone reinforcement learning algorithm, AlphaZero. On the basis of a relationship bet…

Organisator

NeurIPS 2022

Konto · 961 Follower:innen

Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

Professionelle Aufzeichnung und Livestreaming – weltweit.

Freigeben

Empfohlene Videos

Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

Outracing Champion Gran Turismo Drivers with Deep Reinforcement Learning

32:51

Outracing Champion Gran Turismo Drivers with Deep Reinforcement Learning

Später ansehen

Favorit

NeurIPS 2022 2 years ago

Peer Prediction for Learning Agents

00:57

Peer Prediction for Learning Agents

Später ansehen

Favorit

NeurIPS 2022 2 years ago

Adversarial Task Up-sampling for Meta-learning

04:54

Adversarial Task Up-sampling for Meta-learning

Später ansehen

Favorit

NeurIPS 2022 2 years ago

Unsupervised Cross-Task Generalization via Retrieval Augmentation

05:12

Unsupervised Cross-Task Generalization via Retrieval Augmentation

Später ansehen

Favorit

Bill Yuchen Lin, …

NeurIPS 2022 2 years ago

INRAS: Implicit Neural Representation for Audio Scenes

05:00

INRAS: Implicit Neural Representation for Audio Scenes

Später ansehen

Favorit

NeurIPS 2022 2 years ago

Diffusion Visual Counterfactual Explanations

05:44

Diffusion Visual Counterfactual Explanations

Später ansehen

Favorit

Maximilian Augustin, …

NeurIPS 2022 2 years ago