TY - GEN
T1 - MARL Based Cooperative Passive Beamforming Design for Multi-IRS Aided Networks
AU - Li, Yiding
AU - Pan, Zhenni
AU - Shimamoto, Shigeru
N1 - Funding Information:
This work was partly supported by the research fund of GMO Internet Foundation.
Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Intelligent Reflecting Surface (IRS) is a planar surface that allows the signal transmission to be artificially manipulated. Communication aided by multiple IRSs is proven to be more effective in aspects such as transmission rate and signal coverage, but it raises complex coordination in issues such as channel estimation, resource allocation, and active/passive beamforming design problems for the Base Station (BS), IRSs and User Equipments (UEs). In this paper, we discuss the cooperative passive beamforming problem for multiple IRSs. First, we develop a multi-user communication scenario, in which multiple discrete phase shift IRSs are deployed to assist the transmission. Then, we formulate the cooperative passive beam-forming problem to maximize the sum transmission rate within a cellular multi-user network. To address this problem, we discuss a Reinforcement Learning (RL) approach, i.e., to transform the problem into a Multi-agent Markov Decision Process (MA MDP) and raise a Q-learning-based solution. Three different designs of the reward function are tested. Simulation results regarding the performance under different IRS and RL settings are given, which show that the proposed algorithm is capable of reaching a favorable passive beamforming solution in limited iterations and is effective in both convergence and long-term performance.
AB - Intelligent Reflecting Surface (IRS) is a planar surface that allows the signal transmission to be artificially manipulated. Communication aided by multiple IRSs is proven to be more effective in aspects such as transmission rate and signal coverage, but it raises complex coordination in issues such as channel estimation, resource allocation, and active/passive beamforming design problems for the Base Station (BS), IRSs and User Equipments (UEs). In this paper, we discuss the cooperative passive beamforming problem for multiple IRSs. First, we develop a multi-user communication scenario, in which multiple discrete phase shift IRSs are deployed to assist the transmission. Then, we formulate the cooperative passive beam-forming problem to maximize the sum transmission rate within a cellular multi-user network. To address this problem, we discuss a Reinforcement Learning (RL) approach, i.e., to transform the problem into a Multi-agent Markov Decision Process (MA MDP) and raise a Q-learning-based solution. Three different designs of the reward function are tested. Simulation results regarding the performance under different IRS and RL settings are given, which show that the proposed algorithm is capable of reaching a favorable passive beamforming solution in limited iterations and is effective in both convergence and long-term performance.
KW - discrete value passive beamforming
KW - Intelligent reflecting surface
KW - multi-agent systems
KW - reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=85145667776&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85145667776&partnerID=8YFLogxK
U2 - 10.1109/PIMRC54779.2022.9977473
DO - 10.1109/PIMRC54779.2022.9977473
M3 - Conference contribution
AN - SCOPUS:85145667776
T3 - IEEE International Symposium on Personal, Indoor and Mobile Radio Communications, PIMRC
SP - 397
EP - 402
BT - 2022 IEEE 33rd Annual International Symposium on Personal, Indoor and Mobile Radio Communications, PIMRC 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 33rd IEEE Annual International Symposium on Personal, Indoor and Mobile Radio Communications, PIMRC 2022
Y2 - 12 September 2022 through 15 September 2022
ER -