1. Introduction
In recent years, obstacles between communication transmitters and receivers have posed significant challenges to their high-quality communication, which even results in the disconnecting of communication link. Unmanned Aerial Vehicles (UAVs) are widely utilized to alleviate the above problem due to the characteristics of flexible deployment, all-weather application, and low cost. The term UAV mainly refers to radio remote control technology and automatic program control technology that enables unmanned aircraft. Due to the limited technology and economic requirements, UAVs were initially only used in military applications. Recently, the manufacturing costs have gradually decreased, and UAVs have begun to enter people’s daily lives. UAV sensor networks are becoming increasingly important because of their low cost, comprehensive coverage, flexibility, and availability of small-scale sensors.
However, in addition to these effective applications that greatly benefit daily life, some safety hazards and social problems also need to be solved. For example, many civil UAV flights are “black flights”, which means they are flown without an airworthiness certificate from the civil aviation administration or a license issued by the relevant authorities. This makes the necessary communication subject to human-induced interference, resulting in harmful incidents such as interference with normal aerospace communication by non-cooperative UAVs. Therefore, avoiding such interference through reasonable and effective methods is a major topic that needs to be studied. The first step in this topic is to monitor as accurately as possible whether there is non-cooperative UAV interference and whether the interference affects the normal flight of cooperative UAV. In terms of communication, monitoring channel quality and non-cooperative UAV can be achieved by analyzing and comparing the received signals of receivers for evaluation.
Channel quality evaluation is a key technology for UAV wireless communication systems, and the signal-to-noise ratio (SNR) is a significant metric to assess the channel quality of UAV communication [
1,
2,
3,
4,
5,
6]. SNR evaluation plays an important role in many wireless communication systems, such as signal detection, power control, adaptive modulation and demodulation, etc. Obtaining SNR evaluation results accurately in real-time can improve system performance to some extent. In addition, SNR evaluation technology has also been widely used in fields such as electrostatics and the Internet. In the transmission of modulated signals, SNR generally refers to the ratio of the average power of the carrier signal at the output end of the channel, that is to say, the average power of the noise in the channel at the input end of the receiver. Because of its direct correspondence with the communication bit error rate, SNR becomes an effective evaluation standard for the reliability of the channel state and communication quality; it is necessary prior information for many signal processing algorithms or technologies. The real-time performance and accuracy of SNR evaluation directly affect the performance of communication systems [
7,
8,
9,
10,
11].
Generally, the currently available SNR evaluation algorithms can be divided into two categories, one is the data-aid SNR evaluation algorithm that requires some prior information as an auxiliary condition (e.g., auxiliary data, modulation method, etc.), and this type of algorithm will increase the overhead of the communication system to varying degrees; the other is the non-data-aid SNR evaluation algorithm that does not require the knowledge of auxiliary information. The non-data-aid evaluation algorithm ensures the stability of the evaluation results, and this type of algorithm generally employs some observation data [
12].
Specifically, the traditional SNR evaluation methods contain the maximum likelihood ratio-based SNR evaluation algorithm [
13,
14], Second-order and Four-order Moments (M2M4) algorithm [
15], etc. Deep learning (DL) has been applied in various scenario, and has provided significant developments for these fields [
16,
17,
18]. Recently, S. Zhang and Z. Bao proposed an adaptive spectrum sensing (ASS) algorithm [
19]; Li et al. proposed an algorithmic model that fused convolution neural network (CNN) and long short-term memory (LSTM) network [
20], which improved the performance of SNR evaluation relative to the traditional algorithm. However, the existing DL-based SNR evaluation methods are at the expense of computation complexity and give less attention to the two-dimensional features of signals. Meanwhile, the lack of open dataset of UAV control signals also increases the difficulty of model training. To sum up, it seems indispensable to build a two-dimensional dataset of UAV control signals, thus containing enough salient features and avoiding too much interference information. In addition, the corresponding model design is also a time-limited task.
Motivated by the information mentioned above, this paper proposes an intelligent SNR evaluation method based on a single-way convolution neural network, which generates UAV signal datasets by converting one-dimensional signals into two-dimensional ones, trains neural networks of different structures on them, and finally confirms an evaluation network model that can perform a smaller SNR error on the input signals, effectively enhancing the SNR assessment, allowing the neural network model to pay more attention to signal feature information [
21]. Specifically, the algorithm model is firstly presented and is applied in the field of SNR evaluation of UAV frequency hopping signals. In addition, this paper also proposes a more optimised two-way convolution neural network-based intelligent SNR evaluation method based on a classifiable model for better and more extraction of signal features. The main contributions of this paper are concluded as follows:
(1) A two-dimensional dataset of UAV remote control signal is constructed and expanded. Firstly, a one-dimensional frequency hopping signal is generated based on the UAV frequency hopping communication system, and then the one-dimensional signal data are converted to two-dimensional signal data based on the signal time-domain diagram method.
(2) The feature fusion method is considered to combine features of different levels, then the proposed model can further classify as many features as possible and reduce randomness.
(3) A two-path convolution neural network is proposed that fuses the features extracted from two neural networks with different structures, and the proposed model can further improve the accuracy of SNR evaluation.
2. Related Work
In this section, the works related to deep learning methods based on SNR evaluation are introduced, mainly including related works on SNR evaluation and DL-based schemes. Finally, we analyze the problems existing in the related works.
2.1. SNR Evaluation
Typical SNR evaluation methods include the maximum likelihood (ML) ratio-based SNR evaluation algorithm [
13], Second-order and Four-order Moments (M2M4) algorithm [
15], etc. In recent years, S. Zhang and Z. Bao proposed the Adaptive Spectrum Sensing (ASS) algorithm [
19]. Yuhang Sun et al. proposed an algorithmic model that fuses convolution neural network (CNN) and long short-term memory (LSTM) network to improve the SNR evaluation accuracy. The algorithms introduced above are described next, respectively.
In the case of accessible auxiliary data, the maximum likelihood estimation method is the most satisfactory. Its basic premise is to obtain the joint probability density function with the received channel based on the probability density function of the noise. It is a typical DA SNR estimation algorithm. The advantage of this estimation method is that the estimation is accurate. The result is close to the actual value under high SNR conditions. However, the disadvantages are: the computational effort is enormous, a large deviation occurs when the SNR is low, and carrier synchronization is necessary.
The M2M4 SNR estimation method is a self-adaptive algorithm that uses the relationship between the variance and peakedness of a signal to perform SNR evaluation. It is a moment-based SNR estimator proposed in modulated communication signals. This algorithm solves the problem of estimating the SNR of complex sinusoidal signals with deterministic but unknown phases under additive Gaussian noise interference and is an NDA estimation algorithm. Its advantages include simple calculation and insensitivity to carrier phase deviation. Since it is a cumulative volume algorithm, it has the feature that the evaluated results are better with increased data volume. However, the estimation error of this method for the SNR ratio is positively correlated with the SNR ratio and modulation order.
The ASS algorithm evaluates the SNR by estimating the noise and signal power separately. A significance probability level constant representing the fluctuating characteristics of additive Gaussian white noise is set. The frequency domain samples of the received signal are divided into low- and high-frequency bands. The band with lower average energy is selected, and the selected part is again divided into high- and low-frequency bands until the significance probability level of this band is less than the set constant. This method results in the estimated received signal power. The noise power is estimated oppositely, requiring the significance probability level of the last obtained band to be greater than the constant. Finally, the two are calculated using the SNR formula to obtain the estimated value.
Finally, the CNN-LSTM algorithm is analyzed. The segmented signal is extracted using a CNN-LSTM network to obtain feature vectors. Then the features are fused by a fully connected layer, and the SNR value is calculated. This method uses one-dimensional signal data for estimation and has superior performance, but the complexity is relatively high. The comparison of the related SNR evaluation methods are concluded in
Table 1.
2.2. DL-Based Scheme
As deep learning becomes more and more intelligent, its influence has been expanding, and it has a complete theoretical framework and some practical experience base. It has been applied very effectively in various fields, such as target detection, speech recognition, video recommendation, text data analysis, medical diagnosis, treatment, etc. The applications of deep learning methods in communication are the spectrum sensing of signals, classification of signal modulation methods, etc.
There are also a few experiments combined with deep learning in the study of SNR evaluation. The DL-based algorithm proposed by Yang K et al. [
22] indirectly performs SNR estimation by estimating the target signal amplitude using a neural network structure containing five convolution layers for one-dimensional signal data. First, the received signal is segmented according to the input dimension of the deep neural network. Then, the trained deep neural network is used to estimate the signal amplitude of each segment. Next, the estimated amplitude of the target signal is calculated based on the estimated amplitude of each segment. Finally, calculate the SNR of the received signal. This method can be applied to more modulation types, and the effective range of SNR evaluation is broader and more robust to adapt to phase and frequency shifts. The idea of deep learning also inspires research on this topic; two-dimensional images of the signal are used as the input of the neural network, then improve the neural network model to make the SNR ratio evaluation more accurate.
2.3. Existing Problem of Related Works
The existing SNR evaluation methods involving deep learning are relatively complex and pay little attention to the image features of the signal. For deep learning, in terms of datasets, because there are fewer algorithms for SNR evaluation, there is a lack of publicly available datasets for UAV communication RF signals, which increases the difficulty of model training work from objective conditions. As data are received in a one-dimensional sequence, the coupling of signals in the spatial dimension cannot be too firm when generating two-dimensional datasets. In terms of network models, there is no neural network that uses two-dimensional data of the signal as input for SNR evaluation. In summary, there is a need to find a method to generate a dataset that can extract the signal features while avoiding too much interference information and complete the construction of a targeted neural network model, which is the primary goal of this investigation.
3. SNR Evaluation Based on Proposed TP-CNN
In the section, the proposed SNR evaluation method based on DL is discussed. Firstly, a novel UAV remote control signal dataset is generated. Based on the constructed dataset, a two-path CNN model is proposed to train the dataset next.
3.1. The Generation of UAV Remote Signal Dataset
Before generating UAV remote signal dataset, the drone frequency hopping signal should first be generated. Frequency hopping communication system mainly includes a signal modulator, frequency synthesizer, and pseudo random (PN) code generator, where the PN code generator continuously and randomly generates PN code, and inputs it into a frequency synthesizer to control the frequency generation. The transmitted signal is obtained by mixing the frequency generated by a frequency synthesizer with the baseband modulated signal, and the final mixing output signal is frequency-hopping signal, which is shown in
Figure 1.
To obtain the frequency hopping signal of UAV, Matlab is used for simulation modeling. Firstly, the baseband signal of the system is obtained by MSK modulation after using the 01 discrete signal generated randomly as the source bit. At the same time, carrier signals of different frequencies required by the frequency hopping communication system are generated. The hop frequency is 49, and the baseband signal is modulated with different carrier signals in each frequency hopping period. Then the drone frequency hopping signal is obtained. Finally, The signal passes through the small-scale flat fading Rayleigh channel and is, respectively, superimposed with noise with different signal-to-noise ratios of dB. Here, the superimposed noise has a 1 dB interval with the signal-to-noise ratio of the effective signal, starting at −10 dB and ending at 10 dB. After down-sampling, the one-dimensional signal sequence dataset is obtained. The frequency hopping signal is denoted as:
where
N is the number of frequency points,
A denotes the amplitude of signal,
represents a rectangular window of width
,
is the period of a frequency hopping signal,
describes the set of frequency of drone hopping signal,
denotes the initial phase and
.
The expression of the signal after modulation is expressed as:
where
denotes the signal obtained after the original baseband signal data are modulated by the modulator,
expresses the signal sent out by the transmitter. The definition of SNR is defined as:
where
denotes the operation of calculating the average power and
signifies the variance of noise.
After obtaining one-dimensional signal dataset, one-dimensional data need to be converted into two-dimensional data before inputting into neural network for training. The existing conversion methods mainly include Toplitz matrix, frequency waveform diagram and time waveform diagram. The time waveform diagram is applied to generate two-dimensional data in the paper. The time domain waveform diagram of a signal represents the curve of the signal over time. The horizontal axis represents the order in time, and the vertical axis represents the amplitude of the signal at the current moment. Compared with frequency domain diagrams, the envelope of time domain diagrams can more easily show the energy or power characteristics of the signal. In addition, the Toplitz matrix method associates the unrelated data in a one-dimensional sequence spatially, increases the processing of irrelevant information in the dataset by the neural network model, and the accuracy of the final classification results is affected.
3.2. Feature Fusion
The idea of integration refers to the rationalization of different approaches to problem solving in order to achieve better results. Fusion includes, but is not limited to, the direct integration of results; it can also occur at various stages of problem solving, with the aim of gaining a stronger affirmation of the findings at that stage. It has been used in many problems, particularly in classification problems. Specific methods include voting mechanisms, weighted averages, etc.
To optimize the performance of the deep learning algorithm for signal-to-noise evaluation, further improvements to the model are made. In the experience of previous work, many neural network models have been used to influence the classification results by modifying the depth, parameters, and other information based on a single-way neural network similar to the one mentioned. Additionally, the comparative experiments presented show that none of the participating models performed optimally on all SNR, i.e., the same test image with different convolution kernels may yield inconsistent classification results. The difference in the convolution kernels means that the features are extracted differently and, therefore, the features are extracted differently.
Feature fusion is the combination of different levels or branches of features [
23] and is a ubiquitous part of modern network architectures, allowing the proposed model to use as many features as possible for further classification and reducing randomness. The fusion of multiple features is an effective method for visual and multimedia applications. Therefore, if one feature extraction method can be overlaid with another, the effect of numerous feature complementary can be achieved, avoiding the omission of small features. Therefore, we argue that using them together is better than using any one alone and that this complementary fusion of elements is formed at the fully connected level. A literature review shows that even simple fusion schemes can significantly improve results by adding or multiplying features as long as they are complementary [
24]. For the networks with deep layers (e.g., ResNet [
25]), deep features are often fused with shallow features to weigh the results better globally and locally. The idea of the Inception module in GoogleNet [
26] is also to collect multi-scale data information by stacking different networks.
The first method is to join the features to be fused, and the dimension of the fused vector is the sum of the dimensions of the original vector, which is shown in
Figure 2a. The second add method combines the feature vectors into a composite vector, and the size of the fused vector is the maximum number of dimensions of the original vector, which is shown in
Figure 2b. The second add method combines the feature vectors into a composite vector, and the size after fusion is the maximum number of dimensions in the initial vector.
3.3. Proposed TP-CNN Model
In the proposed model, the original signal data are fed into two separate convolution modules with different structures and parameters, and the two modules are parallel and uncorrelated. After extracting the features using different convolution kernels, the features are fused. The fused feature vectors are obtained by adding the feature vectors from the two neural networks using the concat method. Then, all the features are concatenated and fed into the fully connected layer for linear transformation. Based on the investigating of baseline methods, the performance of the single-way convolution neural network with a convolution kernel size of 7 × 7 is worse than that with a convolution kernel size of 3 × 3, the results are still acceptable and the F1 values are better than those with a convolution kernel size of 5 × 5. Therefore, the multi-features can be extracted as an aid to the original model to obtain better results. The above analysis shows the structure of the proposed Two-Pathway CNN(TP-CNN) in
Figure 3. The pseudo code of the proposed algorithm is shown in Algorithm 1.
Algorithm 1 Proposed SNR evaluation algorithm. |
Input: UAV remote control signal. Output: Classification result . - 1:
Obtain frequency hopping signal based on UAV frequency hopping communication system. - 2:
Transform the signal dimension from one-dimension to two-dimension based on signal time domain diagram method. - 3:
Pre-process the two-dimension data and obtain the dataset D. - 4:
Train the image dataset at batch-size B based on proposed TP-CNN model. - 5:
Calculate the MAE and MSE error according to predicted value and true value y.
|
Figure 4 shows the work process. Based on the location in the schematic, the model is divided into two layers, both of which use the same input and 128 × 128 data size, and use the maximum pooling method with a convolution kernel shift step of 1. The upper layer consists of a convolution layer and a pooling layer, with the convolution kernel size set to 7 × 7, which extracts larger features. The lower layer consists of a convolution layer and a pooling layer, with the convolution kernel size set to 3 × 3, which is mainly used for extracting smaller and more subtle features. The networks with different convolution kernels are fused by padding the edges of the image to ensure that the feature maps are of the same size.
Figure 5 shows an example of a 12 × 12 input size with edge fill. The blank in the center indicates the input size. In order to fuse the features, the input data needs to be filled. The blue part of the figure indicates the fill. In our experiments, we have chosen to fill with 0.