1. Introduction
With the rapid development of bridge construction, the management and maintenance of bridges have become a key issue. In the past, the health monitoring of bridges was mostly performed by manual inspection. With the increasing number of bridges being built and the increasing complexity of bridge structures, the traditional manual inspection exposes the shortcomings, such as a small monitoring range, large workload, and low inspection efficiency, which may also endanger the safety of inspectors under harsh environments. Therefore, a complete bridge structural health monitoring and early warning system [
1,
2] is needed. Such a system can realize real-time monitoring of field data, mitigate risk, and provide timely warning by promptly detecting early defects during the operation of bridges.
The current health monitoring of bridges is mostly based on overall monitoring. This method is mainly based on the monitoring of bridge service environment, operational loads, and structural response data to achieve bridge structural conditions and safety performance [
3]. However, compared with the overall condition, monitoring local structures such as cables and suspenders is also important for bridge operation and maintenance. Therefore, the local damage monitoring methods should be introduced in conjunction with overall bridge monitoring techniques. Acoustic emission (AE), as a non-destructive testing technique [
4], can be applied. This technique can record the signals generated by the target acoustic emission process in real time with deploying sensors to the target for data acquisition. By applying it to bridge cable monitoring, only a small number of sensors are required to achieve efficient detection of the broken wires, thus effectively assessing the damage level of bridge cables. Researchers in the field of acoustic emission monitoring have conducted numerous studies based on different types of cable damage. Such damage analyses include corrosion of the natural environment in which the bridge is located [
5,
6,
7,
8], fatigue loading from long-term vehicle passage [
9,
10,
11], and external tensile breaking action [
12,
13,
14,
15,
16,
17,
18].
Today, the wave of artificial intelligence is sweeping the world. Machine learning and deep learning models have become common research hotspots in the fields of artificial intelligence. They have been widely used to solve complex problems in engineering applications and scientific fields with theories and methods. For instance, Son et al., 2021 proposed a deep learning model to locate the damaged cables and conduct the severity assessment of cable-stayed bridges [
19]. A machine learning-based approach was developed to detect bridge cable damage subjected to stochastic effects caused by corrosion and fire [
20]. Han et al., 2019 applied a deep learning algorithm to distinguish the AE signal from damage signals [
21]. Wu and Li, 2022 implemented a method considering both qualitative and quantitative analysis [
22]. They employed AE rate process theorem and a machine learning algorithm to evaluate the damage of masonry. However, incorporating machine learning or deep learning algorithms in the signal recognition of broken wires of bridge cables has not been thoroughly investigated. It should be noted that it is not simple to apply machine learning and even deep learning models to analyze acoustic emission signals for bridge monitoring. Training is the first fundamental requirement for machine learning to build useful models to analyze data. However, in acoustic emission signal monitoring and detection of wire breaking, the available samples are often far from sufficient for proper training in machine learning models. This is because such samples are normally generated and measured from testbeds in a controlled laboratory environment. Considering the testbed setup and data measurement, the data generation process will be slow, expensive, and with limited supply. Due to this data issue, the transfer learning technique has been proposed to reduce the required samples for training [
23]. However, transfer learning can complicate the modelling process during the application. Therefore, building useful models with limited training data becomes a significant challenge for applying machine learning (deep learning) to the acoustic emission tests of bridges.
Another problem is the feature extraction for acoustic emission signals. We may analyze the acoustic emission signals from time, frequency, and time-frequency domains. Hence, there are many sets of features/parameters that can be extracted for analysis. So far, the feature selection for analysis is often by experience. Generally, only common acoustic emission parameters are analyzed. Some studies are only based on statistical analysis of parameters to distinguish damage. Surely, unsuitable features having been extracted for acoustic emission signals can have an impact on the accuracy of signal identification even when powerful machine learning (deep learning) models are used.
There are many machine learning models available for acoustic emission signal analysis. Some methods such as support vector machine (SVM) and decision trees are easy to be implemented but may only provide simple classification functions. Others such as long short-term memory (LSTM) are powerful with deep learning capability but the modelling process may be more complicated and needs a large number of training samples. Since the acoustic emission analysis may be required for the whole process (i.e., from cable breaking detection to health status estimation), there can be many factors affecting the analysis. Hence, the proposed model should strike a balance between multiple factors to ensure effectiveness and accuracy.
In this paper, we propose to use machine learning models for signal identification of wire breaking in bridge cables. According to the problems listed in the above paragraphs, we first describe our testbed for generating acoustic emission signals in
Section 2. Owing to the limited number of signal samples available, the synthetic data approach is proposed to solve the problem. An algorithm is developed to generate the simulated acoustic emission signals for training machine learning models. From the time, frequency, and time-frequency domains, 22 features being extractable from the acoustic emission signals are described in
Section 3. Such a comprehensive features list is used as input to machine learning models. In
Section 4, the structure of a deep learning LSTM model is described. LSTM is powerful for waveform analysis and needs a large number of training samples. It can also be used to demonstrate the usefulness of simulated acoustic emission signals being generated by the proposed algorithm. In
Section 5, the performance of LSTM and other machine learning models are compared. All machine learning models are trained with the simulated acoustic emission signals. As all machine learning models (including LSTM) have the desired performance, it demonstrates the simulated acoustic emission signals to be effective. Finally, conclusions are given in
Section 6.
3. Acoustic Emission Signal Feature Extraction
In order to create a comprehensive picture of the signals, this study performed feature extraction of the wire-break and non-wire-break signals from various perspectives, i.e., time domain, frequency domain, and time-frequency domain analysis. The features extracted in the time domain include the acoustic emission parameters of the signal and the statistical parameters of the signal waveform. They are shown in
Table 2.
The frequency domain feature extraction is based on Fast Fourier Transform (FFT). After transforming the discrete acoustic emission signals from the time domain to the frequency domain, frequency domain feature parameters can be easily obtained. The parameters of the frequency domain feature extraction are shown in
Table 3.
The time-frequency analysis method of acoustic emission signal uses continuous wavelet transform. The essence of continuous wavelet transform is the process of wavelet function to process the measured signal under different time domain and frequency domain windows corresponding to different scale factors. When the scale decreases, the time domain window becomes narrower and the frequency domain window becomes wider, so that the high frequency components of the signal can be extracted, while when the scale increases, the time domain window becomes wider and the frequency domain window becomes narrower, so that the low frequency components of the signal can be extracted. Different scales correspond to a filter set, which can realize the filtering of different frequency components, and different translation factors correspond to different positions of the time domain window, which can realize the processing of different time positions of the signal. Therefore, the continuous wavelet transform is an adaptive time-frequency analysis method, which can realize multi-resolution analysis of the signal.
The continuous wavelet transform of the signal is based on the Morlet function and the decomposition scale is 7. The wavelet coefficients of each scale obtained by the continuous wavelet transform are further calculated as the energy share of each scale, and the energy share is used as the wavelet characteristics of the signal. Using the broken wire signal as an example, let the total decomposition scale of the broken wire signal be
, and the wavelet coefficients of each scale decomposition be
, the further energy of the signal at each scale
can be calculated as:
where
denotes the signal length, the proportion of signal energy
per scale is:
After the computational analysis, the wavelet features of the broken wire signal and the non-broken wire signal differ significantly on the second, third, fourth, sixth, and seventh scales. These five scales are used as the features extracted from the signal time-frequency analysis.
Based on the above multi-angle feature extraction, a comprehensive feature vector characterizing the acoustic emission signal can be constructed, and this feature vector is the basis for generating samples to establish the broken wire signal recognition model.
4. Long Short-Term Memory (LSTM)
From the discussion in
Section 2, it is apparent that there are mainly two kinds of acoustic emission signals, i.e., broken wire signals and non-broken wire signals. Non-broken wire signals are the acoustic emission signals generated from the wires under certain pulling forces, but the wires are still non-broken. Apart from providing alarms by detecting the wire broken signals, a machine learning model may be possible to provide the health status of wires. Determining the health status of wires is similar to estimating the probability of a wire broken signal to happen within a predefined time period. By analyzing currently received signals, clearly, it will be much more complicated than only detecting the wire broken signals. Such applications require more powerful machine learning models and deep learning capability will be needed. In this study, long short-term memory (LSTM) was chosen as a typical powerful machine learning model which also demands a large number of training samples.
LSTM is an artificial neural network often used in the fields of artificial intelligence and deep learning [
27]. It is a chain structure containing a large number of repetitive neural network modules, three gates (input, forgetting, and output gates), and the same memory cells as the hidden state. The internal structure of a LSTM cell is shown in
Figure 12.
Let the number of cells in the hidden layer be
. Given a small batch of input samples
at time
, the number of samples
, the number of inputs
, and the hidden state
at the previous time
t − 1, the status of input gate, forgetting gate, and output gate at time
can be calculated as:
In the above equation,
,
,
,
,
and
are weights and
,
and
are biases. The candidate memory cell uses a different activation function than the three gates, and the tanh function it uses can obtain an output in the range of [−1, 1]. From the above figure, the candidate memory cell output for time
is:
In the above equation,
and
denote the weights and
denotes the bias parameter. The memory cell
at the current time
t carries the information of the memory cell at the previous time step and the candidate memory cell at the current time step,
can be calculated as:
Combined with the above analysis, the main role of the forgetting gate in the figure is to control whether the information in the memory cell of the previous time step can be passed to the memory cell of the current time step . The main role of the input gate is to control how the information from the input of the current time step t flows through the candidate memory cell to the memory cell of the current time step. If the output of the forgetting gate is kept as 1 and the output of the input gate is 0, the information in the past memory cells will be passed to the current time step over time, which is similar to a conveyor belt. Such a network design can cope with the gradient decay problem in RNN networks and better capture the dependencies in a time series where the time steps are far away from each other.
The working process of LSTM can be simply understood as follows: given the input value at the current time step, useful information will be filtered through the candidate memory cells under the control of the input gate for the current memory cell update, while the forgetting gate will control whether the information passed from the previous cell flows into the current memory cell, and the two parts of the retained valuable information, i.e., the updated memory, will be passed to the next LSTM cell module. The output gate controls whether the information in the memory cell is passed to the hidden state for use in the output layer, and is also connected to the next LSTM cell module. The interaction and control of the three gates achieves a longer-term memory of the input information.
The structure of the LSTM model built in this paper is shown in
Figure 13. For the previously extracted acoustic emission signal features, the main role of the input layer is to import the feature vector of each acoustic emission signal into the LSTM network. The LSTM hidden layer is responsible for further analysis of the feature vector of the input batch samples and passing the valuable information to the fully connected layer, which is mainly responsible for converting the dimension of the LSTM output vector into the dimension of the model label vector. The fully-connected layer is mainly responsible for converting the dimensionality of the LSTM output vector into the dimensionality of the model label vector, so that the loss function can be calculated. The final Softmax layer is mainly responsible for mapping the category scores’ output from the fully connected layer to a positive range, and then normalizing them to (0, 1) to obtain the probability of each category. The category to which each acoustic emission signal sample belongs is finally obtained from the output layer.
The deep learning model needs to set the relevant parameters of the model before training. According to the previously established LSTM structure, in order to avoid overfitting and increase the number of operations, the hidden layers of LSTM should not be stacked with multiple layers. The model structure parameters are as follows: the input layer has a total of 22 dimensions; the LSTM hidden layer contains 10 network module units, and the activation functions used include sigmoid and tanh functions; a fully connected layer includes 2 neurons for dimensional conversion; the output of the Softmax classification layer is connected to 2 signal categories. By tuning the model parameters, the LSTM model training parameters are shown in
Table 4.
5. Performance Evaluation
Three metrics (precision, recall rate, and F1-score) are used to evaluate the classification ability of the long short-term memory (LSTM) model using the simulated signals for training and measured signals for test set (
Table 5). Apart from the LSTM model we discussed in
Section 3, a set of machine learning models are used to demonstrate the feasibility of using the proposed synthetic data approach to solve the problem of insufficient training samples. They are support vector machine (SVM), particle swarm optimized support vector machine (PSO-SVM), multilayer perceptron, k-nearest neighbors (KNN), decision tree, and Naive Bayes models. The precision performance between machine learning models with simulated signals for training and measured signals for test sets are also compared. To further compare the performance between the machine learning models, they are used to classify the simulated signals with noise of SNR 30dB.
Precision (i.e., recognition accuracy) is the most common performance index in verifying machine learning models. It is defined as
and that of the non-broken wire signal is
The definition of
TP,
TN,
FP, and
FN are listed in
Table 5.
The recall rate, also known as the check-all rate, is a measure of coverage, and can measure how many broken/non-broken wire signals are identified accurately, using the broken wire signal as an example. It is calculated as follows:
and that of the non-broken wire signals is
The F-Score can be calculated by combining the precision
and recall rate
metrics as follows
When
= 1, it is the F1-score index. Owing to the limited number of measured signals (249 and 363 measured samples of broken and non-broken wire signals, respectively), the algorithm in
Figure 9 has been used to generate simulated broken and non-broken wire signals (each having 832 samples) to train the LSTM model described in
Section 3 and the measured signals are used for test set.
Table 6 shows the performance of LSTM on detecting both broken-wire and non-broken wire signals under such arrangement. One can observe that the LSTM model has rather good performance with the three metrics shown in
Table 6.
To further verify the effectiveness of the sample generation algorithm proposed in
Section 2, the performance differences between machine learning models were compared.
Table 7 shows the comparison of performance of several machine learning models using the simulated signals for training and measured signals for test set. From
Table 7, one can observe that all machine learning models have similar good performance under such arrangement. Hence, we arranged another experiment such that simulated signals with Gaussian white noise (SNR = 30 dB) were used as the test set.
Table 8 shows the performance of the machine learning models under such arrangement. From
Table 8, one can observe that some machine learning models such as SVM and decision trees will have large performance degradation in such a situation. It implies that more factors should be considered when choosing a machine learning model, e.g., the required number of training samples and performance under different situations.
Table 9 shows the F1-score performance of the machine learning models under both test set arrangement. One can observe similar phenomena. Nevertheless, all machine learning models will have good performance in the tests only if training samples are appropriate and their number is sufficiently large. It demonstrates the usefulness of the proposed simulated signal generation algorithm. A limitation of the investigated case is that cross validation is not performed due to a limited number of data samples. Future studies should incorporate cross validation when using the proposed approach. A larger number of data samples are encouraged for the variation.
6. Conclusions
Acoustic emission (AE) is a dynamic nondestructive testing method that is increasingly used in the local monitoring of bridge cables. In this paper, a testbed is described for generating the acoustic emission signals for signal identification testing with machine learning models. Owing to the limited number of measured signals being available, an algorithm is proposed to simulate acoustic emission signals for model training. A multi-angle feature extraction method was used to extract the acoustic emission signals and construct a comprehensive feature vector to characterize the acoustic emission signals. Seven ML models were trained with the simulated acoustic emission signals. As all machine learning models (including LSTM) provide desired performance, it shows the approach of simulated acoustic emission signals to be favorable. A limitation of the study is that the model has not been applied to real field data. Future studies are encouraged to test the model in practice.
Another limitation of the proposed simulated signal generation algorithm is that it relies too much on professional expertise. One example is the hand-crafted statistical features. Using powerful machine learning models such as LSTM may help us to extract overlooked features and consolidate the required model parameters. Consequently, it will reduce the model training time and improve the performance of machine learning models. To obtain the proper feature extraction, however, a sufficient large number of raw samples are required. As the raw sample generation is costly and only a limited number of samples (249 broken wire signals and 363 non-broken wire signals) have been recorded, it is difficult to currently carry out serious work on feature extraction. Nevertheless, we have planned to refine the proposed simulated signal generation algorithm.