Prior to the advent of radio navigation during World War II [
1], pilots relied heavily on visual cues during the day (road maps) and on searchlights and bonfires at night for navigation. The introduction of radio navigation marked a significant advancement in aviation by allowing aircraft to determine their position more accurately. Ground stations used the very high-frequency (VHF) band to broadcast position-finding signals that aircraft latch on to determine their location. However, in areas with sparse ground station coverage, aircraft were forced to make deviations to flight plans to remain within the range of these broadcasts. At its peak, the United States had over 1000 ground stations dedicated to this purpose. The advent of GPS was truly revolutionary to aircraft navigation as it became fully operational and available for civil use by 1995. This satellite-based navigation system allows aircraft to obtain precise three-dimensional positioning information anywhere on Earth, eliminating the need for ground-based navigation aids and significantly enhancing the accuracy and reliability of aircraft navigation. GPS has become a cornerstone of modern aviation, with both civil and military sectors heavily dependent on it for navigation purposes [
2,
3,
4]. The widespread adoption of GPS is evident in the staggering number of receivers in use—by 2019, the United States alone reported over 900 million GPS devices [
5]. This extensive integration of GPS technology into aviation systems highlights its critical role in ensuring safe and efficient air travel while also underscoring the potential vulnerabilities that could arise from disruptions to GPS signals. The GPS is susceptible to malicious and unintentional interference because the signals from GPS satellites are extremely faint [
6].
According to the International Air Transport Association (IATA), the average number of GPS signal-loss events was 24 per 1000 flights before May 2023. This number spiked to 40 events per 1000 flights in August 2023 before dropping slightly to 34 per 1000 flights later that year. The majority of these incidents occurred in Eastern Europe and the Middle East due to conflicts in Ukraine and Syria, where GPS jamming and spoofing are used as defense mechanisms against drones [
7]. In August 2018, a civilian aircraft with passengers navigating in restricted visibility veered off course due to GPS interference and was saved at the last minute by the intervention of ATC, avoiding a mountain collision [
1]. In early 2022, Denver Airport experienced an interference event lasting over 33 h, affecting aircraft in an 8000 square-mile area around Denver at altitudes of up to 14,000 feet [
8]. Another episode at DFW lasted almost 48 h and caused an entire runway to shut down, with several flights diverted to other airports [
9]. Additionally, interference can significantly impact devices like Unmanned Aerial Vehicles (UAVs), affecting their GPS, compass, and other central control modules [
10,
11]. In December 2012, a passenger jet near Reno, Nevada, veered 10 miles off course due to military GPS jamming, necessitating air traffic control intervention to prevent a collision. The U.S. military regularly conducts GPS jamming tests, impacting civilian aviation [
1]. In 2017, 173 incidents of GPS interference were reported over six months, affecting various aircraft types in Southern California, Nevada, Utah, Arizona, and the Pacific Ocean. These incidents prompted the FAA to issue Notices to Airmen (NOTAM) to inform pilots of GPS disruptions during military tests, although this has caused anxiety among civilian aviation professionals [
12]. To address these issues, the FAA is upgrading to the Next-Generation Air Transportation System (NextGen), which will improve aviation safety by utilizing satellite-based navigation systems [
6]. The ADS-B is the cornerstone of this initiative.
1.1. ADS-B and ADS-B Databases
The ADS-B system works by periodically broadcasting aircraft state parameters without operator intervention by using other navigational systems to gather position, altitude, and velocity information [
13,
14,
15]. The data are available to anyone with appropriate receiving equipment and aim to enhance situational awareness for pilots and Air Traffic Controllers alike. The information transmitted by the ADS-B (segregated by message type) is outlined in
Table 1. ADS-B version 2 broadcasts state parameters and operational status messages that indicate the accuracy or quality of the transmitted positional information.
Operational status parameters include uncertainty metrics. The Navigation Integrity Category (NIC) indicates position accuracy, with higher values denoting greater precision. NIC values range from 0 to 11, reflecting the containment radius of the aircraft position [
16]. NIC superseded the earlier Navigational Uncertainty Category (NUCp) with the introduction of ADS-B Version 1. NIC values indicate the containment radius of the aircraft’s position. NIC values and containment radius are inversely proportional; a higher NIC indicates a lower containment radius, while a lower NIC indicates a higher containment radius. A value of 11 has the least containment radius of about 7.5 m, while a value of 0 indicates an infinite containment radius, indicating a complete loss of position. The FAA considers NIC values of 7 and above as reliable positions [
13,
17].
Since 2013, the OpenSky Network has operated as a non-profit, crowd-sourced initiative that collects aviation data globally using off-the-shelf ADS-B receivers. The network processes and stores this information in a central database. The collected data included aircraft positional details (both airborne and surface), identification, velocity, operational status, and uncertainty metrics. This information is transmitted by ADS-B-equipped aircraft when within range of the Opensky network’s volunteer-operated sensors. OpenSky Network data has been used in several applications like ADS-B error and fault diagnosis, aircraft performance evaluation, ADS-B data validation, aircraft position multilateration, security analysis, and air traffic modeling [
18,
19,
20,
21,
22]. The Open Sky Network’s historical database is available for research and non-commercial use by applying for an educational account at their website [
23]. For these reasons, the OpenSky Network is chosen as the ADS-B data source. The OpenSky Network uses Apache Impala, a popular Hadoop Distributed File System (HDFS) database that is capable of handling the large volumes of data generated by the ADS-B networks [
24].
1.2. Current-State-of-Research
Current research on detecting GPS interference can be grouped into data-driven approaches, satellite-based techniques, and receiver-based methods. Murrian et al. [
25] use Low Earth Orbiting (LEO) to detect the presence of interference from ground-based sources by using a Software Defined Radio (SDR) that listens for signals on the L1 and L2 GPS bands. The idea is that GPS signals originate from outer space and are faint. However, interference sources are much stronger and originate from the Earth. Such instances were recorded and localized using the Doppler shift. SWEPOS, a network of satellites, is used to monitor and detect Global Navigation Satellite Systems (GNSS) interference by analyzing the historic Signal-to-noise ratio (SNR) of different GNSS, including GPS, GLONASS, etc. The historic SNR characteristics of multiple satellites, in combination with statistical methods, help differentiate RFI sources. The detection capabilities in both simulated and real-world scenarios are shown by Abraha et al. [
26].
Methods also focus on using the GPS receiver to detect interference. O’Mahony et al. [
27] used received in-phase and quadrature samples and employed an ML-based approach to detect interference in Edge devices, using simulated SDR data and is useable by resource-constraint edge devices like the Raspberry Pi. Other methods, like Sun et al.’s [
28], use a re-arranged Wavelet–Hough transform to detect common interference signals like sweep and continuous waves. While previous approaches focused on GPS interference sources from Earth, Patil et al. [
29] investigated space-based GNSS interference with a network of 43 frequency receivers in the US and Europe and identified a power spike at 1268.52 MHz, which was traceable to satellites.
ADS-B data-driven approaches rely on data from ADS-B/ADS-B databases to analyze and find patterns indicative of GPS interference. Using a jammer and aircraft on the ground, Lukevs et al. [
30] recorded the ADS-B transmission of an aircraft and found that NACp dropped from an acceptable value of 9 to below 7. Liu et al. [
31] analyzed pilot reports of interrupted GNSS service and recorded ADS-B messages from a test flight during a GPS interference exercise conducted at Edwards Air Force Base. The main finding was similar to Lukevs et al.’s: a combination of low NIC values and ADS-B dropouts is typical of a GPS interference event. The authors also found that as the jammer was moved farther from the aircraft, the NACp gradually recovered and stabilized at a distance of 275 m (902.2 Feet). The main finding of this work was that the NACp values dropped, and there were gaps in the transmission of ADS-B messages when the aircraft’s GPS was affected by an RF interference source. Ala et al. [
32] builds on the finding that there is a gap in the continuous transmission of ADS-B messages due to GPS interference. A moving average of NACp is calculated and a threshold of 0.135 (moving average of NACp) is established. Readings that exceed this threshold suggest potential GPS jamming. The loss of messages is leveraged to find the likely location of the jammer by Jonavs et al. [
33]. First, they correlate and rule out other causes of interference, such as military testing, constellation or satellite failure, and space weather, using relevant data sources. Once these are ruled out, they assume RF interference as the cause and use the Friis transmission equation to estimate the jammer’s position, assuming that the start of the gap in message reception is the point closest to the Jammer. Lui et al. [
34] has created a real-time monitoring system capable of updating probabilities every 30 s and is capable of localizing the location of the jammer within 20 min of interference onset, using real-time ADS-B data. The authors divide the airspace into sections using the Bayesian updating algorithm, updating the probability of GNSS interference every 30 s based on NIC values. The jammer’s location and power are estimated by minimizing the difference between estimated and measured jamming power using Friis’s formula, refined interactively with the Gauss-Newton method. Research in this category is consistent in finding the pattern of GPS interference, which is a gap in time with a drop-in NIC around the gap.
A summary of the findings from the literature review is provided in
Table 2. Satellite and receiver-based techniques have shown effectiveness in localizing interference, leveraging signal characteristics, and Doppler shifts. Receiver-based techniques that leverage ML, and signal processing may be costly, invasive (require modification to existing equipment), and necessitate testing before they can be implemented in critical applications, which may not always be practical for widespread use. In contrast, data-driven approaches, particularly those using open-source databases, offer a more cost-effective and non-invasive alternative. The analysis of ADS-B data has resulted in the identification of a pattern, which is a gap in ADS-B data, with a drop in NIC from above seven to below seven on either side or both sides of the gap by multiple studies [
25,
30,
32,
33,
35]. Our proposed methodology focuses on the use of ML to identify the GPS interference pattern in ADS-B data. Existing methods use complex convolutional neural networks in addition to conventional methods like logistic regression to detect GPS interference using ML [
36]. Our methodology builds on these methods and uses simpler, conventional algorithms to detect GPS interference. We focus on doing so in an efficient manner that may be suitable for real-time application. This work will focus on answering the following research questions:
What are the characteristics of NIC during GPS interference events, and can the patterns described in the existing literature be observed in actual GPS interference incidents?
Which ML algorithm is able to detect the GPS interference pattern accurately?
Which algorithm is computationally inexpensive?
This paper is organized into the following sections: The method section explains how data was acquired and analyzed to understand the properties of NIC during a GPS interference event. It details the challenges faced during data acquisition, like server timeouts and large file sizes, and how they were overcome using a Python script. The creation of synthetic ADS-B data to reflect real-world conditions and the training process of ML models to detect GPS interference patterns, aiming for a balance between computational efficiency and prediction accuracy, is explained. Results: This section details the outcomes of the model training process and results, providing insights into model performance during the training phase, variance testing, real-world applicability, and computational efficiency. Conclusion: This section concludes the investigation into GPS interference through ADS-B data; it also documents future work.