Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Scaling Laws for a Multi-Agent Reinforcement Learning Model
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-006-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-006-alpha.b-cdn.net
      • sl-yoda-v2-stream-006-beta.b-cdn.net
      • 1549480416.rsc.cdn77.org
      • 1102696603.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Scaling Laws for a Multi-Agent Reinforcement Learning Model
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Scaling Laws for a Multi-Agent Reinforcement Learning Model

            Dez 2, 2022

            Sprecher:innen

            ON

            Oren Neumann

            Sprecher:in · 0 Follower:innen

            CG

            Claudius Gros

            Sprecher:in · 0 Follower:innen

            Über

            The recent observation of neural power-law scaling relations has made a significant impact in the field of deep learning. A substantial amount of attention has been dedicated as a consequence to the description of scaling laws, although mostly for supervised learning and only to a reduced extent for reinforcement learning frameworks. In this paper we present an extensive study of performance scaling for a cornerstone reinforcement learning algorithm, AlphaZero. On the basis of a relationship bet…

            Organisator

            N2
            N2

            NeurIPS 2022

            Konto · 961 Follower:innen

            Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

            Professionelle Aufzeichnung und Livestreaming – weltweit.

            Freigeben

            Empfohlene Videos

            Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

            Outracing Champion Gran Turismo Drivers with Deep Reinforcement Learning
            32:51

            Outracing Champion Gran Turismo Drivers with Deep Reinforcement Learning

            Peter Stone

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Peer Prediction for Learning Agents
            00:57

            Peer Prediction for Learning Agents

            Shi Feng, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 1 = 0.1%

            Adversarial Task Up-sampling for Meta-learning
            04:54

            Adversarial Task Up-sampling for Meta-learning

            Yichen Wu, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Unsupervised Cross-Task Generalization via Retrieval Augmentation
            05:12

            Unsupervised Cross-Task Generalization via Retrieval Augmentation

            Bill Yuchen Lin, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            INRAS: Implicit Neural Representation for Audio Scenes
            05:00

            INRAS: Implicit Neural Representation for Audio Scenes

            Kun Su, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Diffusion Visual Counterfactual Explanations
            05:44

            Diffusion Visual Counterfactual Explanations

            Maximilian Augustin, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Interessiert an Vorträgen wie diesem? NeurIPS 2022 folgen