Thomy Phan, Fabian Ritz, Lenz Belzner, Philipp Altmann, Thomas Gabor, Claudia Linnhoff-Popien · VAST: Value Function Factorization with Variable Agent Sub-Teams · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: VAST: Value Function Factorization with Variable Agent Sub-Teams

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-004-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-004-alpha.b-cdn.net
sl-yoda-v2-stream-004-beta.b-cdn.net
1685195716.rsc.cdn77.org
1239898752.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

VAST: Value Function Factorization with Variable Agent Sub-Teams

VAST: Value Function Factorization with Variable Agent Sub-Teams

Dec 6, 2021

Speakers

Thomy Phan

Speaker · 0 followers

Fabian Ritz

Speaker · 0 followers

Lenz Belzner

Speaker · 0 followers

About

Value function factorization (VFF) is a popular approach to cooperative multi-agent reinforcement learning in order to learn local value functions from global rewards. However, state-of-the-art VFF is limited to a handful of agents in most domains. We hypothesize that this is due to the flat factorization scheme, where the VFF operator becomes a performance bottleneck with an increasing number of agents. Therefore, we propose VFF with variable agent sub-teams (VAST). VAST approximates a factoriz…

Organizer

NeurIPS 2021

Account · 1.9k followers

About NeurIPS 2021

Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning

05:05

Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning

Watch later

Favorite

Nicolai Dorka, …

NeurIPS 2021 3 years ago

Fast Abductive Learning by Similarity-based Consistency Optimization

13:25

Fast Abductive Learning by Similarity-based Consistency Optimization

Watch later

Favorite

Yu-Xuan Huang, …

NeurIPS 2021 3 years ago

Fast Certified Robust Training with Short Warmup

11:49

Fast Certified Robust Training with Short Warmup

Watch later

Favorite

Zhouxing Shi, …

NeurIPS 2021 3 years ago

Data-Driven Offline Optimization for Architecting Hardware Accelerators

12:22

Data-Driven Offline Optimization for Architecting Hardware Accelerators

Watch later

Favorite

Aviral Kumar, …

NeurIPS 2021 3 years ago

Maximum Mean Discrepancy for Generalization in the Presence of Distribution and Missingness Shift

05:05

Maximum Mean Discrepancy for Generalization in the Presence of Distribution and Missingness Shift

Watch later

Favorite

Liwen Ouyang, …

NeurIPS 2021 3 years ago

Oral Session 1: Generative Modeling

1:33:54

Oral Session 1: Generative Modeling

Watch later

Favorite

NeurIPS 2021 3 years ago