The Dynamic Response of Dual Cellular-Connected UAVs for Random Real-Time Communication Requests from Multiple Hotspots: A Deep Reinforcement Learning Approach
Abstract
:1. Introduction
- (1)
- Both the limitation of the UAV-UE’s energy and the service waiting time are considered with respect to the actual conditions. The UAV-UEs can fly back to the BS for battery charging if the residual energy is low and can continue to respond to service requests after being fully charged. Meanwhile, a service request must become invalid if there is a long wait time (i.e., longer than a given threshold).
- (2)
- By modeling the dual-UAV joint C&C problem of the continuous random services from multiple hotspots, a deep Q-Learning approach with a single agent at the BS is proposed. This method can address both the environmental complexity and infinite time that conventional mathematical methods cannot deal with.
- (3)
- The reward based on each completion of a service request is designed to overcome the delay reward difficulty, which can result in the final real-time decision for the UAVs in dynamic conditions. The performance of the proposed method is compared with the shortest distance priority algorithm and the shortest waiting time algorithm, demonstrating that the proposed algorithm can outperform these two counterparts.
2. System Model and Problem Formulation
2.1. System Model
2.1.1. Request Arrival
2.1.2. The UAV Flight Time and the Corresponding Energy Consumption
2.1.3. The On-Site UAV Video Service Time and the Corresponding Energy Consumption
2.1.4. Energy Consumption Model and Flight Safety
2.2. Problem Formulation
3. The Solution Approach of the Dual-UAV Dynamic Response Based on Deep Q-Learning
3.1. State
3.2. Action
3.3. Reward Based on Service Completion and Energy Consumption
3.4. Training and Updating by Deep Q-Learning
3.5. Baseline Methods
4. Results and Discussions
4.1. Simulation Results
4.2. Algorithm Comparison Results
4.3. Discussions
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Mozaffari, M.; Saad, W.; Bennis, M.; Nam, Y.; Debbah, M. A Tutorial on UAVs for Wireless Networks: Applications, Challenges, and Open Problems. IEEE Commun. Surv. Tutor. 2019, 21, 2334–2360. [Google Scholar] [CrossRef]
- Qi, F.; Zhu, X.; Mang, G.; Kadoch, M.; Li, W. UAV Network and IoT in the Sky for Future Smart Cities. IEEE Netw. 2019, 33, 96–101. [Google Scholar] [CrossRef]
- Wolfert, S.; Ge, L.; Verdouw, C.; Bogaardt, M.J. Big Data in Smart Farming—A review. Agric. Syst. 2017, 153, 69–80. [Google Scholar] [CrossRef]
- Gupta, L.; Jain, R.; Vaszkun, G. Survey of Important Issues in UAV Communication Networks. IEEE Commun. Surv. Tutor. 2016, 18, 1123–1152. [Google Scholar] [CrossRef]
- Motlagh, N.H.; Bagaa, M.; Taleb, T. UAV-Based IoT Platform: A Crowd Surveillance Use Case. IEEE Commun. Mag. 2017, 55, 128–134. [Google Scholar] [CrossRef]
- Mozaffari, M.; Taleb Zadeh Kasgari, A.; Saad, W.; Bennis, M.; Debbah, M. Beyond 5G With UAVs: Foundations of a 3D Wireless Cellular Network. IEEE Trans. Wirel. Commun. 2019, 18, 357–372. [Google Scholar] [CrossRef]
- Challita, U.; Saad, W.; Bettstetter, C. Interference Management for Cellular-Connected UAVs: A Deep Reinforcement Learning Approach. IEEE Trans. Wirel. Commun. 2019, 18, 2125–2140. [Google Scholar] [CrossRef]
- Gapeyenko, M.; Petrov, V.; Moltchanov, D.; Andreev, S.; Himayat, N.; Koucheryavy, Y. Flexible and Reliable UAV-Assisted Backhaul Operation in 5G mmWave Cellular Networks. IEEE J. Sel. Areas Commun. 2018, 36, 2486–2496. [Google Scholar] [CrossRef]
- Kumbhar, A.; Koohifar, F.; Güvenç, I.; Mueller, B. A Survey on Legacy and Emerging Technologies for Public Safety Communications. IEEE Commun. Surv. Tutor. 2017, 19, 97–124. [Google Scholar] [CrossRef]
- Huang, H.; Savkin, A.V. Aerial Surveillance in Cities: When UAVs Take Public Transportation Vehicles. IEEE Trans. Autom. Sci. Eng. 2023, 20, 1069–1080. [Google Scholar] [CrossRef]
- Wei, Z.; Zhu, M.; Zhang, N.; Wang, L.; Zou, Y.; Meng, Z.; Wu, H.; Feng, Z. UAV-Assisted Data Collection for Internet of Things: A Survey. IEEE Internet Things J. 2022, 9, 15460–15483. [Google Scholar] [CrossRef]
- Xu, H.; Wang, L.; Han, W.; Yang, Y.; Li, J.; Lu, Y.; Li, J. A Survey on UAV Applications in Smart City Management: Challenges, Advances, and Opportunities. IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens. 2023, 16, 8982–9010. [Google Scholar] [CrossRef]
- Zeng, Y.; Zhang, R.; Lim, T.J. Wireless communications with unmanned aerial vehicles: Opportunities and challenges. IEEE Commun. Mag. 2016, 54, 36–42. [Google Scholar] [CrossRef]
- Zeng, Y.; Zhang, R. Energy-Efficient UAV Communication with Trajectory Optimization. IEEE Trans. Wirel. Commun. 2017, 16, 3747–3760. [Google Scholar] [CrossRef]
- Wu, Q.; Zeng, Y.; Zhang, R. Joint Trajectory and Communication Design for Multi-UAV Enabled Wireless Networks. IEEE Trans. Wirel. Commun. 2018, 17, 2109–2121. [Google Scholar] [CrossRef]
- Zeng, Y.; Xu, X.; Zhang, R. Trajectory Design for Completion Time Minimization in UAV-Enabled Multicasting. IEEE Trans. Wirel. Commun. 2018, 17, 2233–2246. [Google Scholar] [CrossRef]
- Zhang, S.; Zhang, H.; Di, B.; Song, L. Joint Trajectory and Power Optimization for UAV Sensing Over Cellular Networks. IEEE Commun. Lett. 2018, 22, 2382–2385. [Google Scholar] [CrossRef]
- Mozaffari, M.; Saad, W.; Bennis, M.; Debbah, M. Mobile Internet of Things: Can UAVs Provide an Energy-Efficient Mobile Architecture? In Proceedings of the 2016 IEEE Global Communications Conference (GLOBECOM), Washington, DC, USA, 4–8 December 2016; pp. 1–6. [Google Scholar] [CrossRef]
- Koulali, S.; Sabir, E.; Taleb, T.; Azizi, M. A green strategic activity scheduling for UAV networks: A sub-modular game perspective. IEEE Commun. Mag. 2016, 54, 58–64. [Google Scholar] [CrossRef]
- Ramaithitima, R.; Whitzer, M.; Bhattacharya, S.; Kumar, V. Automated Creation of Topological Maps in Unknown Environments Using a Swarm of Resource-Constrained Robots. IEEE Robot. Autom. Lett. 2016, 1, 746–753. [Google Scholar] [CrossRef]
- Monwar, M.; Omid Semiari, W.S. Optimized Path Planning for Inspection by Unmanned Aerial Vehicles Swarm with Energy Constraints. In Proceedings of the 2018 IEEE Global Communications Conference (GLOBECOM), Abu Dhabi, United Arab Emirates, 9–13 December 2018. [Google Scholar]
- Zhang, S.; Zeng, Y.; Zhang, R. Cellular-Enabled UAV Communication: A Connectivity-Constrained Trajectory Optimization Perspective. IEEE Trans. Commun. 2019, 67, 2580–2604. [Google Scholar] [CrossRef]
- Lin, C.; Wang, Z.; Deng, J.; Wang, L.; Ren, J.; Wu, G. mTS: Temporal-and Spatial-Collaborative Charging for Wireless Rechargeable Sensor Networks with Multiple Vehicles. In Proceedings of the IEEE INFOCOM 2018—IEEE Conference on Computer Communications, Honolulu, HI, USA, 16–19 April 2018; pp. 99–107. [Google Scholar] [CrossRef]
- Kurunathan, H.; Huang, H.; Li, K.; Ni, W.; Hossain, E. Machine Learning-Aided Operations and Communications of Unmanned Aerial Vehicles: A Contemporary Survey. IEEE Commun. Surv. Tutor. 2024, 26, 496–533. [Google Scholar] [CrossRef]
- Wang, C.; Wang, J.; Shen, Y.; Zhang, X. Autonomous Navigation of UAVs in Large-Scale Complex Environments: A Deep Reinforcement Learning Approach. IEEE Trans. Veh. Technol. 2019, 68, 2124–2136. [Google Scholar] [CrossRef]
- Liu, X.; Chen, M.; Yin, C. Optimized Trajectory Design in UAV Based Cellular Networks for 3D Users: A Double Q-Learning Approach. arXiv 2019, arXiv:1902.06610. [Google Scholar] [CrossRef]
- Liu, C.H.; Chen, Z.; Tang, J.; Xu, J.; Piao, C. Energy-Efficient UAV Control for Effective and Fair Communication Coverage: A Deep Reinforcement Learning Approach. IEEE J. Sel. Areas Commun. 2018, 36, 2059–2070. [Google Scholar] [CrossRef]
- Sha, P.; Wang, Q. Autonomous Navigation of UAVs in Resource Limited Environment Using Deep Reinforcement Learning. In Proceedings of the 2022 37th Youth Academic Annual Conference of Chinese Association of Automation (YAC), Beijing, China, 19–20 November 2022; pp. 36–41. [Google Scholar] [CrossRef]
- Cai, Y.; Zhang, E.; Qi, Y.; Lu, L. A Review of Research on the Application of Deep Reinforcement Learning in Unmanned Aerial Vehicle Resource Allocation and Trajectory Planning. In Proceedings of the 2022 4th International Conference on Machine Learning, Big Data and Business Intelligence (MLBDBI), Shanghai, China, 28–30 October 2022; pp. 238–241. [Google Scholar] [CrossRef]
- Osborn, D.R.; Tseloni, A. The Distribution of Household Property Crimes. J. Quant. Criminol. 1998, 14, 307–330. [Google Scholar] [CrossRef]
- Renna, F.; Doyle, J.; Giotsas, V.; Andreopoulos, Y. Media Query Processing for the Internet-of-Things: Coupling of Device Energy Consumption and Cloud Infrastructure Billing. IEEE Trans. Multimed. 2016, 18, 2537–2552. [Google Scholar] [CrossRef]
- Stolaroff, J.K.; Samaras, C.; O’Neill, E.R.; Lubers, A.; Mitchell, A.S.; Ceperley, D. Energy use and life cycle greenhouse gas emissions of drones for commercial package delivery. Nat. Commun. 2018, 9, 409. [Google Scholar] [CrossRef]
- Mozaffari, M.; Saad, W.; Bennis, M.; Debbah, M. Wireless Communication Using Unmanned Aerial Vehicles (UAVs): Optimal Transport Theory for Hover Time Optimization. IEEE Trans. Wirel. Commun. 2017, 16, 8052–8066. [Google Scholar] [CrossRef]
- Lysenko, O.; Valuiskyi, S.; Kirchu, P.; Romaniuk, A. Optimal control of telecommunication aeroplatform in the area of emergency. Inf. Telecommun. Sci. 2013. [Google Scholar] [CrossRef]
- Tuong, V.D.; Noh, W.; Cho, S. Sparse CNN and Deep Reinforcement Learning-Based D2D Scheduling in UAV-Assisted Industrial IoT Networks. IEEE Trans. Ind. Inform. 2024, 20, 213–223. [Google Scholar] [CrossRef]
- 3GPP. Study on New Radio (NR) to Support Non Terrestrial Networks V15.4.0 (Release 15). 2020. Available online: https://portal.3gpp.org/Specifications.aspx?q=1&releases=190 (accessed on 4 May 2024).
Role | Parameter | Notation | Instruction |
---|---|---|---|
UAV i | Current location | Initial at , | |
Current energy | |||
Request k | Location | Uniform distribution | |
Workload | Initial with uniform distribution | ||
Waiting deadline | Declining with time | ||
Association mark | , : already served or overdue; : unserved; : being served. |
Parameter | Value | Unit |
---|---|---|
Video transmit power | 30 | dBm |
UAV number N | 2 | - |
Total bandwidth B | 3 | MHz |
Noise power | −174 | dBm/Hz |
Reference channel gain | - | |
Flight height h | 50 | m |
Power efficiency | 0.7 | - |
Flight speed v | 10 | m/s |
UAV mass m | 2 | kg |
Drag Force | 9.6998 | N |
Rotor number q | 4 | - |
Rotor diameter | 0.254 | m |
Density of air | 1.225 | kg/m3 |
Gravitational acceleration g | 9.8 | m/s2 |
Pitch angle | rad | |
Induced velocity | 4.9556 | m/s |
Request density in each time slot | 0.05, 0.1, 0.15, 0.2, 0.25 | /m2 |
Max waiting time | 30 | time slots |
Workload parameter | 500 | Mbits |
UAV moving step | 50 | m |
Potential hotspots M | 10 | - |
Radius of the target region | 250 | m |
Full energy capacity E | 100,000 | Joule |
Radius of a hotspot r | 50 | m |
Discount rate | 0.8 | - |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Yang, S.; Zhou, J.; Meng, X. The Dynamic Response of Dual Cellular-Connected UAVs for Random Real-Time Communication Requests from Multiple Hotspots: A Deep Reinforcement Learning Approach. Electronics 2024, 13, 4181. https://doi.org/10.3390/electronics13214181
Yang S, Zhou J, Meng X. The Dynamic Response of Dual Cellular-Connected UAVs for Random Real-Time Communication Requests from Multiple Hotspots: A Deep Reinforcement Learning Approach. Electronics. 2024; 13(21):4181. https://doi.org/10.3390/electronics13214181
Chicago/Turabian StyleYang, Shengzhi, Jianming Zhou, and Xiao Meng. 2024. "The Dynamic Response of Dual Cellular-Connected UAVs for Random Real-Time Communication Requests from Multiple Hotspots: A Deep Reinforcement Learning Approach" Electronics 13, no. 21: 4181. https://doi.org/10.3390/electronics13214181
APA StyleYang, S., Zhou, J., & Meng, X. (2024). The Dynamic Response of Dual Cellular-Connected UAVs for Random Real-Time Communication Requests from Multiple Hotspots: A Deep Reinforcement Learning Approach. Electronics, 13(21), 4181. https://doi.org/10.3390/electronics13214181