Go to:
Logótipo
Comuta visibilidade da coluna esquerda
Você está em: Start > Publications > View > Multi-agent actor centralized-critic with communication
Publication

Publications

Multi-agent actor centralized-critic with communication

Title
Multi-agent actor centralized-critic with communication
Type
Article in International Scientific Journal
Year
2020
Authors
Simoes, D
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
lau, n
(Author)
Other
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page Without ORCID
Journal
Title: NeurocomputingImported from Authenticus Search for Journal Publications
Vol. 390
Pages: 40-56
ISSN: 0925-2312
Publisher: Elsevier
Other information
Authenticus ID: P-00R-PNN
Abstract (EN): Multiple real-world problems are naturally modeled as cooperative multi-agent systems, ranging from satellite formation to traffic monitoring. These systems require algorithms that can learn successful policies with independent agents that rely solely on local partial-observations of the environment. However, multi-agent environments are more complex, due to their partial-observability and non-stationarity from an agent's perspective, as well as the structural credit assignment problem and the curse of dimensionality, and achieving coordination in such systems remains a complex challenge. To this end, we propose a multi-agent actor-critic algorithm called Asynchronous Advantage Actor Centralized-Critic with Communication (A3C3). A3C3 uses a centralized critic to estimate a value function, decentralized actors to approximate each agent's policy function, and decentralized communication networks for each agent to share relevant information with its team. The critic can incorporate additional information, like the environment's global state, when available, and optimizes the actor networks. The actor networks of an agent's teammates optimize that agent's communication network, such that each agent learns to output information that is relevant to the policies of others. A3C3 supports a dynamic amount of agents, noisy communication mediums, and can be horizontally scaled to shorten its learning phase. We evaluate A3C3 in two partially-observable multi-agent suites where agents benefit from communicating local information to each other. A3C3 outperforms state-of-the-art multi-agent algorithms, independent approaches, and centralized controllers with access to all agents' observations.
Language: English
Type (Professor's evaluation): Scientific
No. of pages: 17
Documents
We could not find any documents associated to the publication.
Related Publications

Of the same authors

MULTI AGENT DEEP LEARNING WITH COOPERATIVE COMMUNICATION (2020)
Article in International Scientific Journal
Simoes, D; lau, n; reis, lp
Exploring communication protocols and centralized critics in multi-agent deep learning (2020)
Article in International Scientific Journal
Simoes, D; lau, n; reis, lp
Multi-agent Double Deep Q-Networks (2017)
Article in International Conference Proceedings Book
Simoes, D; lau, n; reis, lp
Learning Low-Level Behaviors and High-Level Strategies in Humanoid Soccer (2020)
Article in International Conference Proceedings Book
Simoes, D; Amaro, P; Maria Teresa Andrade; lau, n; reis, lp
Guided Deep Reinforcement Learning in the GeoFriends2 Environment (2018)
Article in International Conference Proceedings Book
Simoes, D; lau, n; reis, lp

See all (6)

Of the same journal

The vitality of pattern recognition and image analysis (2015)
Another Publication in an International Scientific Journal
Luisa Mico; Joao M Sanches; Jaime S Cardoso
ydata-profiling: Accelerating data-centric AI with high-quality data (2023)
Article in International Scientific Journal
Clemente, F; Ribeiro, GM; Quemy, A; Santos, MS; Pereira, RC; Barros, A
The vitality of pattern recognition and image analysis (2015)
Article in International Scientific Journal
Micó, L; Sanches, JM; Jaime S Cardoso
Pre-processing approaches for imbalanced distributions in regression (2019)
Article in International Scientific Journal
Branco, P; Torgo, L; Rita Ribeiro
Predicting satisfaction: perceived decision quality by decision-makers in Web-based group decision support systems (2019)
Article in International Scientific Journal
João Carneiro; Pedro Saraiva; Luís Conceição; Ricardo Santos; Goreti Marreiros; Paulo Novais

See all (22)

Recommend this page Top
Copyright 1996-2025 © Faculdade de Direito da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2025-08-10 at 18:55:34 | Privacy Policy | Personal Data Protection Policy | Whistleblowing