TY - GEN
T1 - Multi-Agent Pattern Formation
T2 - 2020 International Joint Conference on Neural Networks, IJCNN 2020
AU - Diallo, Elhadji Amadou Oury
AU - Sugawara, Toshiharu
N1 - Funding Information:
This research is partly supported by JSPS KAKENHI Grants No. 17KT0044 and 20H04245.
Publisher Copyright:
© 2020 IEEE.
PY - 2020/7
Y1 - 2020/7
N2 - In this paper, we investigate how a large-scale system of independently learning agents can collectively form acceptable two-dimensional patterns (pattern formation) from any initial configuration. We propose a decentralized multi-agent deep reinforcement learning architecture MAPF-DQN (Multi-Agent Pattern Formation DQN) in which a set of independent and distributed agents capture their local visual field and learn how to act so as to collectively form target shapes. Agents exploit their individual networks with a central replay memory and target networks that are used to store and update the representation of the environment as well as learning the dynamics of the other agents. We then show that agents trained on random patterns using MAPF-DQN can organize themselves into very complex shapes in large-scale environments. Our results suggest that the proposed framework achieves zero-shot generalization on most of the environments independently of the depth of view of agents.
AB - In this paper, we investigate how a large-scale system of independently learning agents can collectively form acceptable two-dimensional patterns (pattern formation) from any initial configuration. We propose a decentralized multi-agent deep reinforcement learning architecture MAPF-DQN (Multi-Agent Pattern Formation DQN) in which a set of independent and distributed agents capture their local visual field and learn how to act so as to collectively form target shapes. Agents exploit their individual networks with a central replay memory and target networks that are used to store and update the representation of the environment as well as learning the dynamics of the other agents. We then show that agents trained on random patterns using MAPF-DQN can organize themselves into very complex shapes in large-scale environments. Our results suggest that the proposed framework achieves zero-shot generalization on most of the environments independently of the depth of view of agents.
KW - deep reinforcement learning
KW - multi-agent systems
KW - pattern formation
KW - self-organization
KW - swarms
UR - http://www.scopus.com/inward/record.url?scp=85093861443&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85093861443&partnerID=8YFLogxK
U2 - 10.1109/IJCNN48605.2020.9207657
DO - 10.1109/IJCNN48605.2020.9207657
M3 - Conference contribution
AN - SCOPUS:85093861443
T3 - Proceedings of the International Joint Conference on Neural Networks
BT - 2020 International Joint Conference on Neural Networks, IJCNN 2020 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 19 July 2020 through 24 July 2020
ER -