Anti-Jamming Communication Using Slotted Cross Q Learning
Abstract
:1. Introduction
- Analyzing the influence of different sensing, learning, and transmission timeslot structure distribution on the time utilization and anti-jamming effect of the algorithm.
- Proposing a PQL algorithm to address low slot utilization using limited jamming change speed.
- Proposing a SCQL algorithm to mitigate high jamming collision rates.
2. System Components and Problem Formulation
2.1. System Components
- The communication subsystem contains a transmitter, a receiver, and a jammer. An optional channel is allocated between the transmitter and the receiver. The transmitter has no a priori knowledge about the jamming strategy, but can sense whether there is jamming in all channels at each timeslot using wide-band spectrum sensing technology;
- The jammer can sense the communication probability of each channel in each timeslot of the previous period, and carry out high-power and pressure jamming on the channels with the highest communication probability, and the communication on the jamming channel is bound to fail;
- In the SCQL algorithm, there are multiple Q-tables in the QL subsystem, and different Q-tables are updated alternately with the timeslot. The decision results obtained by learning are stored in the registers of the decision subsystem.
2.2. Problem Formulation
3. Scheme
3.1. Algorithm
- Firstly, through a period of observation, the jamming period , perception and learning period are calculated, and the required number of Q-tables N is obtained according to Equation (16).
- The jamming in this jamming period is sensed and the table is updated.
- For timeslot t, if , the optimal action in the next timeslot is obtained by using the table and according to the -greedy strategy, that is, the appropriate communication channel is selected. The -greedy strategy is as follows:
3.2. Complexity and Convergence Analysis
Algorithm 1 Parallel Q-learning (PQL) |
|
Algorithm 2 Slotted Cross Q-learning (SCQL) |
|
4. Simulation
4.1. Setting
4.2. Results
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Zhou, Q.; Niu, Y.; Xiang, P.; Li, Y. Intra-Domain Knowledge Reuse Assisted Reinforcement Learning for Fast Anti-Jamming Communication. IEEE Trans. Inf. Forensics Secur. 2023. [Google Scholar] [CrossRef]
- Pärlin, K.; Riihonen, T.; Turunen, M.; Nir, V.L.; Adrat, M. Known-Interference Cancellation in Cooperative Jamming: Experimental Evaluation and Benchmark Algorithm Performance. IEEE Wirel. Commun. Lett. 2023. [Google Scholar] [CrossRef]
- Kirk, B.H.; Martone, A.F.; Gallagher, K.A.; Narayanan, R.M.; Sherbondy, K.D. Mitigation of Clutter Modulation in Cognitive Radar for Spectrum Sharing Applications. IEEE Trans. Radar Syst. 2023. [Google Scholar] [CrossRef]
- Chen, X.; Guo, D.; Wu, W. A Robust Detection-Estimation-Navigation Framework with Deep Reinforcement Learning against GPS Jamming. IEEE Access 2020, 8, 131750–131763. [Google Scholar]
- Zhang, H.; Han, X.; Cheng, J.; Lu, Y. A Reconfigurable Bayesian Adaptive Interference Mitigation SIC Receiver With Reinforcement Learning. IEEE Access 2021, 9, 15420–15434. [Google Scholar]
- Shahriar, M.; Lichtman, M.; Poston, J.D.; Amuru, S. Intelligent Anti-jamming Method for LEO Satellite Communication System Based on Reinforcement Learning. AEU—Int. J. Electron. Commun. 2021, 138, 153567. [Google Scholar]
- Yang, T.; Chen, Z.; Cao, G. Q-Learning-Based Dynamic anti-Jamming Transmission Design in Device-to-Device Assisted Vehicular Ad Hoc Networks. IEEE Trans. Intell. Transp. Syst. 2021, 4, 2193–2202. [Google Scholar]
- Li, W.; Chen, H.; Shi, R.; Gao, J. Anti-jamming stochastic gradient descent-based optimization deployment algorithm for wireless sensor networks. IET Wirel. Sens. Syst. 2021, 6, 536–543. [Google Scholar]
- Liu, J.; Xue, Y.; Shen, X. Reinforcement Learning for Active Interference Cancelling with Power Control in mmWave Cellular Networks. IEEE J. Sel. Areas Commun. 2020, 9, 2015–2027. [Google Scholar]
- Luo, Y.; Shi, S.; Ye, F. Anti-interference access control method based on the adaptive Q-learning algorithm for the satellite-IoT network. Comput. Electr. Eng. 2021, 91, 107072. [Google Scholar]
- Bout, E.; Loscri, V.; Gallais, A. How Machine Learning Changes the Nature of Cyberattacks on IoT Networks: A Survey. IEEE Commun. Surv. Tutorials 2022, 1, 248–279. [Google Scholar] [CrossRef]
- Wang, T.; Li, Y.; Wu, Y.; Tony, Q. Secrecy driven Federated Learning via Cooperative Jamming: An Approach of Latency Minimization. IEEE Trans. Emerg. Top. Comput. 2022, 99, 1. [Google Scholar] [CrossRef]
- Wang, Q.; Sun, H.; Hu, R.Q.; Bhuyan, A. When Machine Learning Meets Spectrum Sharing Security: Methodologies and Challenges. IEEE Open J. Commun. Soc. 2022, 3, 176–208. [Google Scholar] [CrossRef]
- Slimane, H.; Benouadah, S.; Khoei, T.; Kaabouch, N. A Light Boosting-based ML Model for Detecting Deceptive Jamming Attacks on UAVs. In Proceedings of the 2022 IEEE 12th Annual Computing and Communication Workshop and Conference (CCWC), Las Vegas, NV, USA, 26–29 January 2022. [Google Scholar]
- Zhang, G.; Li, Y.; Niu, Y.; Zhou, Q. Anti-Jamming Path Selection Method in a Wireless Communication Network Based on Dyna-Q. Electronics 2022, 15, 2397. [Google Scholar] [CrossRef]
- Han, X.; Ma, L.; Wu, M.; Lu, Y.; Wang, S. A time-domain anti-random impulsive interference method based on reinforcement learning for wireless communication. IEEE Trans. Veh. Technol. 2018, 8, 7863–7877. [Google Scholar]
- Huang, L.; Ma, C.; Liang, J.; Xu, G. Parallel Q-learning Based Intelligence Anti-interference Method for Wireless Communication. J. Cent. South Univ. 2021, 4, 1286–1294. [Google Scholar]
- Zhang, Z.; Liu, J.; Xiao, P.; Liu, Y. Slot-cross Q-learning Algorithm for Intelligence Anti-jamming. J. Electron. Inf. Technol. 2015, 6, 1401–1405. [Google Scholar]
- Li, H.; Zhao, S.; Zhang, K.; Jin, D. Deep Reinforcement Learning for Anti-jamming in Cognitive Radio Networks. In Proceedings of the 2020 IEEE International Conference on Communications (ICC), Marrakech, Morocco, 12–14 December 2020; pp. 1–6. [Google Scholar]
- Wang, Q.; Chen, J.; Lin, F.; Meng, F. Reinforcement Learning-Based Anti-jamming Mechanism for Intelligent Transportation Systems. In Proceedings of the 2020 IEEE International Conference on Communications, Control, and Computing Technologies for Smart Grids (SmartGridComm), Tempe, AZ, USA, 11–13 November 2020; pp. 1–6. [Google Scholar]
- Yao, F.; Li, Y.; Fan, T.; Sun, Y. A Q-Learning Based Anti-jamming Strategy for UAVs Communication System. In Proceedings of the 2018 International Wireless Communications and Mobile Computing Conference (IWCMC), Limassol, Cyprus, 25–29 June 2018; pp. 1294–1299. [Google Scholar]
- Su, H.; Pan, M.; Chen, H.; Liu, X. MDP-Based MAC Protocol for WBANs in Edge-Enabled eHealth Systems. Electronics 2023, 12, 947. [Google Scholar] [CrossRef]
- Xing, H.; Xing, Q.; Wang, K. Radar Anti-Jamming Countermeasures Intelligent Decision-Making: A Partially Observable Markov Decision Process Approach. Aerospace 2023, 10, 236. [Google Scholar] [CrossRef]
- Li, W.; Chen, J.; Liu, X.; Wang, X.; Li, Y.; Liu, D.; Xu, Y. Intelligent Dynamic Spectrum Anti-Jamming Communications: A Deep Reinforcement Learning Perspective. IEEE Wirel. Commun. 2022, 99, 1–7. [Google Scholar] [CrossRef]
- Machuzak, S.; Jayaweera, S.K. Reinforcement learning based anti-jamming with wideband autonomous cognitive radios. In Proceedings of the 2016 IEEE/CIC International Conference on Communications in China (ICCC), Chengdu, China, 27–29 July 2016. [Google Scholar]
- Morghare, G.; Bhadauria, S.S. An Effective Approach for Cognitive Radio Wideband Spectrum Compressive Sensing over AWGN/Rayleigh Channels. In Proceedings of the 2023 10th International Conference on Signal Processing and Integrated Networks (SPIN), Delhi, India, 23–24 March 2023; pp. 609–614. [Google Scholar]
- Ebrahimi, M.R.; Li, W.; Chai, Y.; Pacheco, J.; Chen, H. An Adversarial Reinforcement Learning Framework for Robust Machine Learning-based Malware Detection. In Proceedings of the 2022 IEEE International Conference on Data Mining Workshops (ICDMW), Orlando, FL, USA, 28 November–1 December 2022; pp. 567–576. [Google Scholar]
- Sayghe, A.; Anubi, O.M.; Konstantinou, C. Adversarial Examples on Power Systems State Estimation. In Proceedings of the 2020 IEEE Power & Energy Society Innovative Smart Grid Technologies Conference (ISGT), Washington, DC, USA, 17–20 February 2020; pp. 1–5. [Google Scholar]
- Vaccari, I.; Carlevaro, A.; Narteni, S.; Cambiaso, E.; Mongelli, M. eXplainable and Reliable Against Adversarial Machine Learning in Data Analytics. IEEE Access 2022, 10, 83949–83970. [Google Scholar] [CrossRef]
- Edwards, D.; Rawat, D.B. Quantum Adversarial Machine Learning: Status, Challenges and Perspectives. In Proceedings of the 2020 Second IEEE International Conference on Trust, Privacy and Security in Intelligent Systems and Applications (TPS-ISA), Atlanta, GA, USA, 28–31 October 2020; pp. 128–133. [Google Scholar]
Parameter | Value |
---|---|
Jamming timeslot length | 0.1/0.05 ms |
Transmission subslot length | 0.1/0.05 ms |
Sensing subslot length | 0.08 ms |
Learning subslot length | 0.02 ms |
The total number of transmission timeslots | 10,000 |
Number of available channels C | 10 |
Learning rate factor | 0.1 |
Discount factor | 0.5 |
Communication signal power P | 1 W |
Jamming power J | 5 W |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Niu, Y.; Zhou, Z.; Pu, Z.; Wan, B. Anti-Jamming Communication Using Slotted Cross Q Learning. Electronics 2023, 12, 2879. https://doi.org/10.3390/electronics12132879
Niu Y, Zhou Z, Pu Z, Wan B. Anti-Jamming Communication Using Slotted Cross Q Learning. Electronics. 2023; 12(13):2879. https://doi.org/10.3390/electronics12132879
Chicago/Turabian StyleNiu, Yingtao, Zhanyang Zhou, Ziming Pu, and Boyu Wan. 2023. "Anti-Jamming Communication Using Slotted Cross Q Learning" Electronics 12, no. 13: 2879. https://doi.org/10.3390/electronics12132879
APA StyleNiu, Y., Zhou, Z., Pu, Z., & Wan, B. (2023). Anti-Jamming Communication Using Slotted Cross Q Learning. Electronics, 12(13), 2879. https://doi.org/10.3390/electronics12132879