A Novel Method for a Pursuit–Evasion Game Based on Fuzzy Q-Learning and Model-Predictive Control
Abstract
:1. Introduction
- (1)
- For the first time, the Apollonius circle is extended from 2D space to 3D space, and an analytical equation is provided. A novel learning algorithm based on the FQL framework is proposed, and a reward function based on the idea of artificial potential field is designed for the learning algorithm, which improves the convergence speed and learning performance of the algorithm.
- (2)
- Distinct from existing methods, with the strategy derived from the FQL algorithm as the reference signal, we model the trajectory tracking problem as an MPC optimization problem for decoupled quadrotor. We designed a state feedback controller for the quadrotor that takes into account control input constraints and obstacle avoidance constraints and analyzed the feasibility and stability of the solution.
- (3)
- We have verified the learning performance of FQL in 3D scenarios. The trajectory tracking capability of the designed state feedback algorithm has been validated through 3D trajectory tracking results. Finally, the combination of FQL and MPC algorithms is employed to control the quadrotor to achieve a PEG on the Gazebo platform.
2. Perception and Decision Based on Fuzzy Q-Learning
2.1. The Model of Pursuit–Evasion Game
2.2. Fuzzy Q-Learning Algorithm
3. Quadrotor Control Based on MPC
3.1. The Model of Quadrotor and Control Objective
3.2. Design of the State Feedback Controller Based on MPC
3.3. Feasibility Analysis of the State Feedback Control Law
3.4. Stability Analysis of the State Feedback Control Law
Algorithm 1 PEG Algorithm Based on FQL and MPC |
|
4. Simulation Results and Analysis
4.1. Simulation of Quadrotor PEG Based on FQL
4.2. Simulation of Quadrotor Control Based on MPC
4.3. Simulation of Quadrotor PEG Based on FQL and MPC
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Weintraub, I.E.; Pachter, M.; Garcia, E. An introduction to pursuit–evasion differential games. In Proceedings of the 2020 American Control Conference (ACC), Denver, CO, USA, 1–3 July 2020; pp. 1049–1066. [Google Scholar]
- Lee, E.S.; Shishika, D.; Loianno, G.; Kumar, V. Defending a perimeter from a ground intruder using an aerial defender: Theory and practice. In Proceedings of the 2021 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR), New York City, NY, USA, 25–27 October 2021; pp. 184–189. [Google Scholar]
- Biediger, D.; Popov, L.; Becker, A.T. The pursuit and evasion of drones attacking an automated turret. In Proceedings of the 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Prague, Czech Republic, 27 September–1 October 2021; pp. 9677–9682. [Google Scholar]
- Zhang, F.; Zha, W. Evasion strategies of a three-player lifeline game. Sci. China Inf. Sci. 2018, 61, 112206. [Google Scholar] [CrossRef]
- Yan, R.; Deng, R.; Lai, H.; Zhang, W.; Shi, Z.; Zhong, Y. Multiplayer homicidal chauffeur reach-avoid games via guaranteed winning strategies. arXiv 2021, arXiv:2107.04709. [Google Scholar]
- Liang, L.; Deng, F.; Peng, Z.; Li, X.; Zha, W. A differential game for cooperative target defense. Automatica 2019, 102, 58–71. [Google Scholar] [CrossRef]
- Ibragimov, G.; Ferrara, M.; Ruziboev, M.; Pansera, B.A. Linear evasion differential game of one evader and several pursuers with integral constraints. Int. J. Game Theory 2021, 50, 729–750. [Google Scholar] [CrossRef]
- Deng, Z.; Kong, Z. Multi-agent cooperative pursuit-defense strategy against one single attacker. IEEE Robot. Autom. Lett. 2020, 5, 5772–5778. [Google Scholar] [CrossRef]
- Garcia, E.; Casbeer, D.W.; Von Moll, A.; Pachter, M. Multiple pursuer multiple evader differential games. IEEE Trans. Autom. Control. 2020, 66, 2345–2350. [Google Scholar] [CrossRef]
- Yan, R.; Shi, Z.; Zhong, Y. Cooperative strategies for two-evader-one-pursuer reach-avoid differential games. Int. J. Syst. Sci. 2021, 52, 1894–1912. [Google Scholar] [CrossRef]
- Yan, R.; Shi, Z.; Zhong, Y. Task assignment for multiplayer reach–avoid games in convex domains via analytical barriers. IEEE Trans. Robot. 2019, 36, 107–124. [Google Scholar] [CrossRef]
- Lopez, V.G.; Lewis, F.L.; Wan, Y.; Sanchez, E.N.; Fan, L. Solutions for multiagent pursuit–evasion games on communication graphs: Finite-time capture and asymptotic behaviors. IEEE Trans. Autom. Control. 2019, 65, 1911–1923. [Google Scholar] [CrossRef]
- Sani, M.; Hably, A.; Robu, B.; Dumon, J. Real-time game-theoretic model predictive control for differential game of target defense. Asian J. Control. 2023, 25, 3343–3355. [Google Scholar] [CrossRef]
- Schwartz, H.M. Multi-Agent Machine Learning: A Reinforcement Approach; John Wiley and Sons: Hoboken, NJ, USA, 2014. [Google Scholar]
- Eklund, J.M.; Sprinkle, J.; Sastry, S.S. Switched and symmetric pursuit/evasion games using online model predictive control with application to autonomous aircraft. IEEE Trans. Control. Syst. Technol. 2011, 20, 604–620. [Google Scholar] [CrossRef]
- De Simone, D.; Scianca, N.; Ferrari, P.; Lanari, L.; Oriolo, G. MPC-based humanoid pursuit–evasion in the presence of obstacles. In Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Vancouver, BC, Canada, 24–28 September 2017; pp. 5245–5250. [Google Scholar]
- Sani, M.; Robu, B.; Hably, A. pursuit–evasion Games Based on Game-theoretic and Model Predictive Control Algorithms. In Proceedings of the 2021 International Conference on Control, Automation and Diagnosis (ICCAD), Grenoble, France, 3–5 November 2021; pp. 1–6. [Google Scholar]
- Sani, M.; Robu, B.; Hably, A. Limited information model predictive control for pursuit–evasion games. In Proceedings of the 2021 60th IEEE Conference on Decision and Control (CDC), Austin, TX, USA, 14–17 December 2021; pp. 265–270. [Google Scholar]
- Manoharan, A.; Sujit, P.B. NMPC-Based Cooperative Strategy to Lure Two Attackers Into Collision by Two Targets. IEEE Control. Syst. Lett. 2022, 7, 496–501. [Google Scholar] [CrossRef]
- Peng, Y.; Mo, T.; Zheng, D.; Deng, Q.; Wang, J.; Qu, D.; Xie, Y. Model Predictive Control-based pursuit–evasion Games for Unmanned Surface Vessel; Springer: Singapore, 2023. [Google Scholar]
- Yu, C.; Dong, Y.; Li, Y.; Chen, Y. Distributed multi-agent deep reinforcement learning for cooperative multi-robot pursuit. J. Eng. 2020, 2020, 499–504. [Google Scholar] [CrossRef]
- Wang, Y.; Dong, L.; Sun, C. Cooperative control for multi-player pursuit–evasion games with reinforcement learning. Neurocomputing 2020, 412, 101–114. [Google Scholar] [CrossRef]
- Kartal, Y.; Subbarao, K.; Dogan, A.; Lewis, F. Optimal game theoretic solution of the pursuit-evasion intercept problem using on-policy reinforcement learning. Int. J. Robust Nonlinear Control. 2021, 31, 7886–7903. [Google Scholar] [CrossRef]
- Yang, B.; Liu, P.; Feng, J.; Li, S. Two-stage pursuit strategy for incomplete-information impulsive space pursuit–evasion mission using reinforcement learning. Aerospace 2021, 8, 299. [Google Scholar] [CrossRef]
- Zhang, R.; Zong, Q.; Zhang, X.; Dou, L.; Tian, B. Game of Drones: Multi-uav pursuit–evasion Game with Online Motion Planning by Deep Reinforcement Learning; IEEE Transactions on Neural Networks and Learning Systems: Piscataway, NJ, USA, 2022. [Google Scholar]
- Selvakumar, J.; Bakolas, E. Min–Max Q-learning for multi-player pursuit–evasion games. Neurocomputing 2022, 475, 1–14. [Google Scholar] [CrossRef]
- Camci, E.; Kayacan, E. Game of drones: UAV pursuit–evasion game with type-2 fuzzy logic controllers tuned by reinforcement learning. In Proceedings of the 2016 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), Vancouver, BC, Canada, 24–29 July 2016; pp. 618–625. [Google Scholar]
- Awheda, M.D.; Schwartz, H.M. A decentralized fuzzy learning algorithm for pursuit–evasion differential games with superior evaders. J. Intell. Robot. Syst. 2016, 83, 35–53. [Google Scholar] [CrossRef]
- Wang, L.; Wang, M.; Yue, T. A fuzzy deterministic policy gradient algorithm for pursuit–evasion differential games. Neurocomputing 2019, 362, 106–117. [Google Scholar] [CrossRef]
- Liu, S.; Hu, X.; Dong, K. Adaptive Double Fuzzy Systems Based Q-Learning for pursuit–evasion Game. IFAC-PapersOnLine 2022, 55, 251–256. [Google Scholar] [CrossRef]
- Vidyasagar, M. Nonlinear Systems Analysis; Society for Industrial and Applied Mathematics: Philadelphia, PA, USA, 2002. [Google Scholar]
- Hu, P.; Pan, Q.; Zhao, C.; Guo, Y. Transfer reinforcement learning for multi-agent pursuit-evasion differential game with obstacles in a continuous environment. Asian J. Control. 2024, 26, 2125–2140. [Google Scholar] [CrossRef]
- Jafari, H.; Zareh, M.; Roshanian, J.; Nikkhah, A. An optimal guidance law applied to quadrotor using LQR method. Trans. Jpn. Soc. Aeronaut. Space Sci. 2010, 53, 32–39. [Google Scholar] [CrossRef]
- Fan, B.; Sun, J.; Yu, Y. A LQR controller for a quadrotor: Design and experiment. In Proceedings of the 2016 31st Youth Academic Annual Conference of Chinese Association of Automation, Wuhan, China, 11–13 November 2016. [Google Scholar]
- Ihnak, M.S.; Edardar, M.M. Comparing LQR and PID controllers for quadcopter control effectiveness and cost analysis. In Proceedings of the 2023 IEEE 11th International Conference on Systems and Control, Sousse, Tunisia, 18–20 December 2023. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Hu, P.; Zhao, C.; Pan, Q. A Novel Method for a Pursuit–Evasion Game Based on Fuzzy Q-Learning and Model-Predictive Control. Drones 2024, 8, 509. https://doi.org/10.3390/drones8090509
Hu P, Zhao C, Pan Q. A Novel Method for a Pursuit–Evasion Game Based on Fuzzy Q-Learning and Model-Predictive Control. Drones. 2024; 8(9):509. https://doi.org/10.3390/drones8090509
Chicago/Turabian StyleHu, Penglin, Chunhui Zhao, and Quan Pan. 2024. "A Novel Method for a Pursuit–Evasion Game Based on Fuzzy Q-Learning and Model-Predictive Control" Drones 8, no. 9: 509. https://doi.org/10.3390/drones8090509
APA StyleHu, P., Zhao, C., & Pan, Q. (2024). A Novel Method for a Pursuit–Evasion Game Based on Fuzzy Q-Learning and Model-Predictive Control. Drones, 8(9), 509. https://doi.org/10.3390/drones8090509