Další
Živý přenos začne již brzy!
Živý přenos již skončil.
Prezentace ještě nebyla nahrána!
  • title: Value-based CTDE Methods in Symmetric Two-team Markov Game: from Cooperation to Team Competition
      0:00 / 0:00
      • Nahlásit chybu
      • Nastavení
      • Playlisty
      • Záložky
      • Titulky Off
      • Rychlost přehrávání
      • Kvalita
      • Nastavení
      • Debug informace
      • Server sl-yoda-v2-stream-006-alpha.b-cdn.net
      • Velikost titulků Střední
      • Záložky
      • Server
      • sl-yoda-v2-stream-006-alpha.b-cdn.net
      • sl-yoda-v2-stream-006-beta.b-cdn.net
      • 1549480416.rsc.cdn77.org
      • 1102696603.rsc.cdn77.org
      • Titulky
      • Off
      • English
      • Rychlost přehrávání
      • Kvalita
      • Velikost titulků
      • Velké
      • Střední
      • Malé
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      Moje playlisty
        Záložky
          00:00:00
            Value-based CTDE Methods in Symmetric Two-team Markov Game: from Cooperation to Team Competition
            • Nastavení
            • Sync diff
            • Kvalita
            • Nastavení
            • Server
            • Kvalita
            • Server

            Value-based CTDE Methods in Symmetric Two-team Markov Game: from Cooperation to Team Competition

            2. prosince 2022

            Řečníci

            PL

            Pascal Leroy

            Speaker · 0 followers

            JP

            Jonathan Pisane

            Speaker · 0 followers

            DE

            Damien Ernst

            Speaker · 0 followers

            O prezentaci

            In this paper, we identify the best training scenario to train a team of agents to compete against multiple possible strategies of opposing teams.We restrict ourselves to the case of a symmetric two-team Markov game which is a competition between two symmetric teams.We evaluate cooperative value-based methods in a mixed cooperative-competitive environment.We selected three training methods based on the centralised training and decentralised execution (CTDE) paradigm: QMIX, MAVEN and QVMix.To tra…

            Organizátor

            N2
            N2

            NeurIPS 2022

            Account · 962 followers

            Baví vás formát? Nechte SlidesLive zachytit svou akci!

            Profesionální natáčení a streamování po celém světě.

            Sdílení

            Doporučená videa

            Prezentace na podobné téma, kategorii nebo přednášejícího

            Where do models go wrong? Parameter-space saliency maps for explainability
            04:56

            Where do models go wrong? Parameter-space saliency maps for explainability

            Roman Levin, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            What Art can tell us about the brain.
            50:23

            What Art can tell us about the brain.

            Margaret Livingstone

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Learning neural causal models
            21:26

            Learning neural causal models

            Rosemary Ke

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Imbalance Trouble: Revisiting Neural-Collapse Geometry
            05:55

            Imbalance Trouble: Revisiting Neural-Collapse Geometry

            Ganesh Ramachandra Kini, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Prompt Certified Machine Unlearning with Randomized Gradient Smoothing and Quantization
            05:05

            Prompt Certified Machine Unlearning with Randomized Gradient Smoothing and Quantization

            Zijie Zhang, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            General policy mapping: online continual reinforcement learning inspired on the insect brain
            14:22

            General policy mapping: online continual reinforcement learning inspired on the insect brain

            Angel Yanguas-Gil, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Zajímají Vás podobná videa? Sledujte NeurIPS 2022