Machine-Learning-Based Optimization for Multiple-IRS-Aided Communication System
Abstract
:1. Introduction
- To maintain a green wireless communication system, we formulate a minimization problem for the transmit power of the BS by jointly optimizing the beamforming of the BS and reflection phases of multiple IRSs under the QoS constraints of multiple UEs, while considering two scenarios: a static UE scenario and a dynamic UE scenario.
- Under the case of the static UE scenario, we propose a new GRNN-based algorithm, while, under the case of the dynamic UE scenario, as the new GRNN-based algorithm is a supervised-learning-based approach that needs to collect a large data set to reach an optimal regression surface to predict solutions based on static locations for UEs. We then propose a DDPG-based algorithm that learns a policy for the joint beamforming prediction based on the dynamic surrounding environmental states, which is robust in the wireless environment.
- Our numerical results demonstrate a power-saving system by adopting the proposed algorithms compared to the benchmark schemes. The time complexity of the algorithms is also presented, which shows the superiority of the proposed ones. Additionally, we show the effectiveness of increasing the number of reflecting elements in reducing the transmit power for the multiple-IRS-aided system.
2. System Model
2.1. Network Model
2.2. Formulation and Analysis of the Optimization Problem
3. Proposed Algorithm for Static UE Scenario
3.1. The Proposed New GRNN-Based Algorithm
3.1.1. Data Preprocessing
3.1.2. Data Processing
- The input layer, which has the same number of neurons as the number of inputs, receives channel instances of the learning samples of both the direct and indirect channel path coefficients after preprocessing, as described in the previous section.
- The pattern layer comprises radbas (Gaussian) neurons, with a total number of neurons equal to the number of training samples received by the input layer. Layer units will store the non-linear relationship constructed between the generated response and corresponding input, e.g., bypassing the input via each unit in the pattern layer. Weights of this layer are computed as the transpose of the input channel coefficient vectors, , i.e., . At the same time, biases are , where the spread represents the width of a radial basis activation function used, i.e., spread ≤ 1. Then, the i-th pattern neuron’s equivalent transfer function is given by
- The regression layer consists of linear neurons whose weights are set to the target values, denoted by T. Two types of summation operations are performed in this layer, which are symbolized as and , defined as
- In the end, the output layer uses a purlin linear activation function to formulate the final optimized output. For each output neuron, the predicted output in the model is calculated as
3.1.3. Data Representation
3.2. Model Training and Validation
4. Proposed Algorithm for Dynamic UE Scenario
4.1. MDP Formulation
- State: The state consists of all the current observed channel matrices of direct and indirect paths at time step t, i.e., and ; the previous transmission power from the source to each k-th active UE at time step , i.e., ; and the previously received powers at each k-th active UE at time step , i.e., . Here, the transmission power for each k-th UE is defined as , which leads to K inputs stacked to the state. As each k-th UE in the system can receive its desired signal and interference signals due to other UEs, the received power for each k-th UE is defined as . This induces K inputs for each UE; consequently, we have stacked in total to the state. Mathematically, the state at the time step t is given byNote that since the complex-valued features of channel inputs are treated as two separate features, the instant state dimension is .
- Action: The action at the time step t is constructed as the optimized variables of problem , i.e., the active beamforming, , and the passive beamforming vectors of all deployed IRSs, . Thus, the action can be defined asCorrespondingly, the instant action dimension is simply , where the complex-valued form is considered.
- Reward: Since the total transmit power minimization at the source is the objective function of the presented optimization problem, linking the objective function and the reward will permit the achievement of MDP’s goal of increasing long-term rewards. Accordingly, the instant reward feedback by the environment in this model is defined as the overall energy efficiency, i.e., the ratio between the total achieved rate and total transmitted power. Considering the SINR constraints, the SINR of each UE should be above the SINR threshold to satisfy the required QoS threshold. When the resulting UE’s SINR is below the threshold, a penalty is applied to help the agent to alter the inappropriate beamforming. In our designed reward function, we impose the penalty value as zero or can simply assume that an unsuitable action leads to the Inf value of the corresponding total transmit power. Hence, the instant reward at each time step t can be calculated as follows:
- Policy: The policy adopted in this work is a deterministic policy that learns to select the best possible actions of active and passive beamforming depending on different states, as will be discussed in the following section.
4.2. The Proposed DDPG-Based Algorithm
4.2.1. Description of the Proposed DDPG-Based Algorithm
Algorithm 1 The Proposed DDPG-Based Algorithm |
|
4.2.2. Structure of the Actor and Critic Networks
5. Simulation Results
- Without IRSs: no IRSs are deployed in the system, i.e., solving with .
- Random Phases: all IRS elements are set to be random phases and then we optimize by solving .
- Iterative Approach: iteratively solving by random phases and, through iterations, update with variables, leading to minimum transmit power until some criteria are achieved.
- AO Heuristic: is solved by optimized near-optimum solutions obtained through the AO approach [38].
5.1. Performance Analysis for the Static UE Scenario
5.2. Performance Analysis for the Dynamic UE Scenario
5.3. Complexity Analysis
6. Conclusions
Author Contributions
Funding
Conflicts of Interest
References
- Saad, W.; Bennis, M.; Chen, M. A vision of 6G wireless systems: Applications, trends, technologies, and open research problems. IEEE Netw. 2019, 34, 134–142. [Google Scholar] [CrossRef] [Green Version]
- Gong, S.; Lu, X.; Hoang, D.T.; Niyato, D.; Shu, L.; Kim, D.I.; Liang, Y.C. Toward smart wireless communications via intelligent reflecting surfaces: A contemporary survey. IEEE Commun. Surv. Tutor. 2020, 22, 2283–2314. [Google Scholar] [CrossRef]
- Di Renzo, M.; Debbah, M.; Phan-Huy, D.T.; Zappone, A.; Alouini, M.S.; Yuen, C.; Sciancalepore, V.; Alexandropoulos, G.C.; Hoydis, J.; Gacanin, H.; et al. Smart radio environments empowered by reconfigurable AI meta-surfaces: An idea whose time has come. EURASIP J. Wirel. Commun. Netw. 2019, 2019, 1–20. [Google Scholar] [CrossRef] [Green Version]
- Hu, S.; Rusek, F.; Edfors, O. Beyond massive MIMO: The potential of data transmission with large intelligent surfaces. IEEE Trans. Signal Process. 2018, 66, 2746–2758. [Google Scholar] [CrossRef] [Green Version]
- Wu, Q.; Zhang, R. Intelligent reflecting surface enhanced wireless network via joint active and passive beamforming. IEEE Trans. Wirel. Commun. 2019, 18, 5394–5409. [Google Scholar] [CrossRef] [Green Version]
- Fu, M.; Zhou, Y.; Shi, Y. Intelligent reflecting surface for downlink non-orthogonal multiple access networks. In Proceedings of the 2019 IEEE Globecom Workshops (GC Wkshps), Waikoloa, HI, USA, 9–13 December 2019; pp. 1–6. [Google Scholar]
- Huang, C.; Zappone, A.; Alexandropoulos, G.C.; Debbah, M.; Yuen, C. Reconfigurable intelligent surfaces for energy efficiency in wireless communication. IEEE Trans. Wirel. Commun. 2019, 18, 4157–4170. [Google Scholar] [CrossRef] [Green Version]
- Yu, X.; Xu, D.; Schober, R. MISO wireless communication systems via intelligent reflecting surfaces. In Proceedings of the 2019 IEEE/CIC International Conference on Communications in China (ICCC), Changchun, China, 11–13 August 2019; pp. 735–740. [Google Scholar]
- Han, H.; Zhao, J.; Niyato, D.; Di Renzo, M.; Pham, Q.V. Intelligent reflecting surface aided network: Power control for physical-layer broadcasting. In Proceedings of the ICC 2020-2020 IEEE International Conference on Communications (ICC), Dublin, Irleand, 7–11 June 2020; pp. 1–7. [Google Scholar]
- Guo, H.; Liang, Y.C.; Chen, J.; Larsson, E.G. Weighted sum-rate maximization for reconfigurable intelligent surface aided wireless networks. IEEE Trans. Wirel. Commun. 2020, 19, 3064–3076. [Google Scholar] [CrossRef] [Green Version]
- Zhang, Q.; Saad, W.; Bennis, M. Millimeter wave communications with an intelligent reflector: Performance optimization and distributional reinforcement learning. IEEE Trans. Wirel. Commun. 2021, 21, 1836–1850. [Google Scholar] [CrossRef]
- Wang, P.; Fang, J.; Yuan, X.; Chen, Z.; Li, H. Intelligent reflecting surface-assisted millimeter wave communications: Joint active and passive precoding design. IEEE Trans. Veh. Technol. 2020, 69, 14960–14973. [Google Scholar] [CrossRef]
- Li, Z.; Hua, M.; Wang, Q.; Song, Q. Weighted sum-rate maximization for multi-IRS aided cooperative transmission. IEEE Wirel. Commun. Lett. 2020, 9, 1620–1624. [Google Scholar] [CrossRef]
- Zhao, J. Optimizations with intelligent reflecting surfaces (IRSs) in 6G wireless networks: Power control, quality of service, max-min fair beamforming for unicast, broadcast, and multicast with multi-antenna mobile users and multiple IRSs. arXiv 2019, arXiv:1908.03965. [Google Scholar]
- Zappone, A.; Di Renzo, M.; Debbah, M. Wireless networks design in the era of deep learning: Model-based, AI-based, or both? IEEE Trans. Commun. 2019, 67, 7331–7376. [Google Scholar] [CrossRef] [Green Version]
- Liu, Y.; Liu, X.; Mu, X.; Hou, T.; Xu, J.; Di Renzo, M.; Al-Dhahir, N. Reconfigurable intelligent surfaces: Principles and opportunities. IEEE Commun. Surv. Tutorials 2021, 23, 1546–1577. [Google Scholar] [CrossRef]
- Elbir, A.M.; Papazafeiropoulos, A.; Kourtessis, P.; Chatzinotas, S. Deep channel learning for large intelligent surfaces aided mm-wave massive MIMO systems. IEEE Wirel. Commun. Lett. 2020, 9, 1447–1451. [Google Scholar] [CrossRef]
- Liu, S.; Gao, Z.; Zhang, J.; Di Renzo, M.; Alouini, M.S. Deep denoising neural network assisted compressive channel estimation for mmWave intelligent reflecting surfaces. IEEE Trans. Veh. Technol. 2020, 69, 9223–9228. [Google Scholar] [CrossRef]
- Huang, C.; Alexandropoulos, G.C.; Yuen, C.; Debbah, M. Indoor signal focusing with deep learning designed reconfigurable intelligent surfaces. In Proceedings of the 2019 IEEE 20th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC), Cannes, France, 2–5 July 2019; pp. 1–5. [Google Scholar]
- Gao, J.; Zhong, C.; Chen, X.; Lin, H.; Zhang, Z. Unsupervised learning for passive beamforming. IEEE Commun. Lett. 2020, 24, 1052–1056. [Google Scholar] [CrossRef] [Green Version]
- Taha, A.; Alrabeiah, M.; Alkhateeb, A. Enabling large intelligent surfaces with compressive sensing and deep learning. IEEE Access 2021, 9, 44304–44321. [Google Scholar] [CrossRef]
- Khan, S.; Khan, K.S.; Haider, N.; Shin, S.Y. Deep-learning-aided detection for reconfigurable intelligent surfaces. arXiv 2019, arXiv:1910.09136. [Google Scholar]
- Xu, G.; Zhang, N.; Xu, M.; Xu, Z.; Zhang, Q.; Song, Z. Outage Probability and Average BER of UAV-assisted Dual-hop FSO Communication with Amplify-and-Forward Relaying. IEEE Trans. Veh. Technol. 2023, 1–16. [Google Scholar] [CrossRef]
- Cui, P.F.; Zhang, J.A.; Lu, W.J.; Guo, Y.J.; Zhu, H. Statistical sparse channel modeling for measured and simulated wireless temporal channels. IEEE Trans. Wirel. Commun. 2019, 18, 5868–5881. [Google Scholar] [CrossRef]
- Taha, A.; Zhang, Y.; Mismar, F.B.; Alkhateeb, A. Deep reinforcement learning for intelligent reflecting surfaces: Towards standalone operation. In Proceedings of the 2020 IEEE 21st International Workshop on Signal ProcessingAdvances in Wireless Communications (SPAWC), Atlanta, GA, USA, 26–29 May 2020; pp. 1–5. [Google Scholar]
- Huang, C.; Mo, R.; Yuen, C. Reconfigurable intelligent surface assisted multiuser MISO systems exploiting deep reinforcement learning. IEEE J. Sel. Areas Commun. 2020, 38, 1839–1850. [Google Scholar] [CrossRef]
- Yang, H.; Xiong, Z.; Zhao, J.; Niyato, D.; Xiao, L.; Wu, Q. Deep reinforcement learning-based intelligent reflecting surface for secure wireless communications. IEEE Trans. Wirel. Commun. 2020, 20, 375–388. [Google Scholar] [CrossRef]
- Chiang, M.; Hande, P.; Lan, T.; Tan, C.W. Power control in wireless cellular networks. Found. Trends® Netw. 2008, 2, 381–533. [Google Scholar] [CrossRef]
- Pan, C.; Ren, H.; Wang, K.; Elkashlan, M.; Nallanathan, A.; Wang, J.; Hanzo, L. Intelligent reflecting surface aided MIMO broadcasting for simultaneous wireless information and power transfer. IEEE J. Sel. Areas Commun. 2020, 38, 1719–1734. [Google Scholar] [CrossRef]
- Subrt, L.; Pechac, P. Intelligent walls as autonomous parts of smart indoor environments. IET Commun. 2012, 6, 1004–1010. [Google Scholar] [CrossRef]
- Wiesel, A.; Eldar, Y.C.; Shamai, S. Linear precoding via conic optimization for fixed MIMO receivers. IEEE Trans. Signal Process. 2005, 54, 161–176. [Google Scholar] [CrossRef] [Green Version]
- CVX: Matlab Software for Disciplined Convex Programming, Version 2.2. 2020. Available online: http://cvxr.com/cvx/ (accessed on 30 November 2022).
- Wasserman, P.D. Advanced Methods in Neural Computing; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 1993. [Google Scholar]
- Hastie, T.; Tibshirani, R.; Friedman, J.H.; Friedman, J.H. The Elements of Statistical Learning: Data Mining, Inference, and Prediction; Springer: Berlin/Heidelberg, Germany, 2009; Volume 2. [Google Scholar]
- Sutton, R.S.; Barto, A.G. Introduction to Reinforcement Learning; MIT Press: Cambridge, MA, USA, 2018. [Google Scholar]
- Sur, S.N.; Singh, A.K.; Kandar, D.; Silva, A.; Nguyen, N.D. Intelligent Reflecting Surface Assisted Localization: Opportunities and Challenges. Electronics 2022, 11, 1411. [Google Scholar] [CrossRef]
- Jiang, T.; Cheng, H.V.; Yu, W. Learning to reflect and to beamform for intelligent reflecting surface with implicit channel estimation. IEEE J. Sel. Areas Commun. 2021, 39, 1931–1945. [Google Scholar] [CrossRef]
- Fathy, M.; Abood, M.S.; Guo, J. A Generalized Neural Network-based Optimization for Multiple IRSs-aided Communication System. In Proceedings of the 2021 IEEE 21st International Conference on Communication Technology (ICCT), Tianjin, China, 13–16 October 2021; pp. 480–486. [Google Scholar]
Hyper-Parameter | Setting |
---|---|
Actor learning rate | 0.001 |
Critic learning rate | 0.001 |
Target smooth factor | 0.001 |
Discount factor | 0.95 |
Mini-batch size | 16 |
Experience replay buffer size C | 200,000 |
Target update frequency U | 1 |
Sampling time | 1 |
SF | ||||
---|---|---|---|---|
0.28 | 0.2308 | 2.5185e-6 | 0.0859 | 1.0752e-10 |
0.29 | 0.2305 | 7.6251e-6 | 0.0852 | 8.2084e-10 |
0.3 | 0.2303 | 2.1302e-5 | 0.0848 | 5.4173e-9 |
0.31 | 0.2303 | 5.5296e-5 | 0.0845 | 3.1258e-8 |
0.32 | 0.2303 | 1.3417e-4 | 0.0843 | 1.5922e-7 |
0.33 | 0.2304 | 3.0588e-4 | 0.0842 | 7.215e-7 |
0.34 | 0.2306 | 6.5791e-4 | 0.0842 | 2.9245e-6 |
SINR | −10 dB | 0 dB | 10 dB | |
---|---|---|---|---|
Approach | ||||
AO | 10.6562 | 10.9562 | 12.9007 | |
AO Heuristic | 3.8671 | 3.9377 | 4.5948 | |
GRNN-Based | 0.2237 | 0.2366 | 0.2392 | |
DDPG-Based | 0.0091 | 0.0092 | 0.0092 |
N 10 | 12 | 14 | 16 | 18 | 20 | ||
---|---|---|---|---|---|---|---|
Approach | |||||||
AO | 8.5039 | 9.2905 | 11.0527 | 13.2385 | 15.3093 | 19.8383 | |
AO Heuristic | 5.5492 | 6.5612 | 6.5886 | 8.1265 | 9.9036 | 12.7005 | |
DDPG-Based | 0.0092 | 0.0094 | 0.0095 | 0.0096 | 0.0097 | 0.0098 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Fathy, M.; Fei, Z.; Guo, J.; Abood, M.S. Machine-Learning-Based Optimization for Multiple-IRS-Aided Communication System. Electronics 2023, 12, 1703. https://doi.org/10.3390/electronics12071703
Fathy M, Fei Z, Guo J, Abood MS. Machine-Learning-Based Optimization for Multiple-IRS-Aided Communication System. Electronics. 2023; 12(7):1703. https://doi.org/10.3390/electronics12071703
Chicago/Turabian StyleFathy, Maha, Zesong Fei, Jing Guo, and Mohamed Salah Abood. 2023. "Machine-Learning-Based Optimization for Multiple-IRS-Aided Communication System" Electronics 12, no. 7: 1703. https://doi.org/10.3390/electronics12071703
APA StyleFathy, M., Fei, Z., Guo, J., & Abood, M. S. (2023). Machine-Learning-Based Optimization for Multiple-IRS-Aided Communication System. Electronics, 12(7), 1703. https://doi.org/10.3390/electronics12071703