Learning Efficient Coordination Strategy for Multi-step Tasks in Multi-agent Systems using Deep Reinforcement Learning

Zean Zhu, Elhadji Amadou Oury Diallo, Toshiharu Sugawara

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We investigated whether a group of agents could learn the strategic policy with different sizes of input by deep Q-learning in a simulated takeout platform environment. Agents are often required to cooperate and/or coordinate with each other to achieve their goals, but making appropriate sequential decisions for coordinated behaviors based on dynamic and complex states is one of the challenging issues for the study of multi-agent systems. Although it is already investigated that intelligent agents could learn the coordinated strategies using deep Q-learning to efficiently execute simple one-step tasks, they are also expected to generate a certain coordination regime for more complex tasks, such as multi-step coordinated ones, in dynamic environments. To solve this problem, we introduced the deep reinforcement learning framework with two kinds of distributions of the neural networks, centralized and decentralized deep Q-networks (DQNs). We examined and compared the performances using these two DQN network distributions with various sizes of the agents’ views. The experimental results showed that these networks could learn coordinated policies to manage agents by using local view inputs, and thus, could improve their entire performance. However, we also showed that their behaviors of multiple agents seemed quite different depending on the network distributions.

Original languageEnglish
Title of host publicationICAART 2020 - Proceedings of the 12th International Conference on Agents and Artificial Intelligence
EditorsAna Rocha, Luc Steels, Jaap van den Herik
PublisherSciTePress
Pages287-294
Number of pages8
ISBN (Electronic)9789897583957
Publication statusPublished - 2020
Event12th International Conference on Agents and Artificial Intelligence, ICAART 2020 - Valletta, Malta
Duration: 2020 Feb 222020 Feb 24

Publication series

NameICAART 2020 - Proceedings of the 12th International Conference on Agents and Artificial Intelligence
Volume1

Conference

Conference12th International Conference on Agents and Artificial Intelligence, ICAART 2020
CountryMalta
CityValletta
Period20/2/2220/2/24

Keywords

  • Cooperation
  • Coordination
  • Deep Reinforcement Learning
  • Multi-agent System

ASJC Scopus subject areas

  • Artificial Intelligence
  • Software

Fingerprint Dive into the research topics of 'Learning Efficient Coordination Strategy for Multi-step Tasks in Multi-agent Systems using Deep Reinforcement Learning'. Together they form a unique fingerprint.

  • Cite this

    Zhu, Z., Oury Diallo, E. A., & Sugawara, T. (2020). Learning Efficient Coordination Strategy for Multi-step Tasks in Multi-agent Systems using Deep Reinforcement Learning. In A. Rocha, L. Steels, & J. van den Herik (Eds.), ICAART 2020 - Proceedings of the 12th International Conference on Agents and Artificial Intelligence (pp. 287-294). (ICAART 2020 - Proceedings of the 12th International Conference on Agents and Artificial Intelligence; Vol. 1). SciTePress.