1. Introduction
Ultrasonic bulk waves are commonly used to identify structural defects by applying “time of flight” measurement-based methods such as the pulse-echo method and the through-transmission method. They have been actively used in the industry because of its wide application and high precision at a relatively low cost. Mouritz et al. [
1] evaluated the fatigue damage of polymer-matrix composites used in ships by applying the pulse-echo method. Hao et al. [
2] employed a low frequency ultrasonic pulse-echo method to investigate aging of large generators by comparing and analyzing four types of stator bar insulation structures. Tian et al. [
3] performed the pulse-echo test to detect void defects in epoxy composite specimens. Lee et al. [
4] developed a rotational through-transmission ultrasonic imaging system to identify a cylindrical pressure vessel damage and obtained the clear scanned images.
In addition to detecting structural defects, the ultrasonic non-destructive testing methods also can be used to measure the thickness [
5,
6,
7] or elasticity [
8] of the material. Furthermore, ultrasonic flaw detection technique has been applied to food [
9] and medical fields [
10,
11,
12]. As described above, ultrasonic non-destructive inspection methods are capable of detecting defects on several structures and can be used for various purposes.
Since these methods can only inspect a local area, it is inconvenient and time-consuming to scan a region of interest with an ultrasonic probe. For this reason, using guided ultrasonic waves and applying array signal processing techniques have been studied to overcome this limitation. Guided waves in a thin structure have the advantage of having a long wave propagation distance with small attenuation [
13]. When a structural defect exists, it can be detected through the array signal processing technique by measuring the reflected wave from the ultrasonic transducer array.
However, guided ultrasonic waves such as Lamb waves have multi-modes with difference phase speed at a same excitation frequency and this mixed response due to the multi-modes makes it difficult to apply the array signal processing techniques. Giurgiutiu [
14] presented a mode-tuning technique by choosing an appropriate excitation frequency to excite a dominant single mode. Rose [
15] suggested a single mode-excitation method by proposing comb transducers which could be locked to a specific wavelength.
Yan et al [
16]. applied a delay-and-sum beamforming algorithm to identify the defect location in plates. Han et al. found structural defects in an aluminum panel by constructing time-frequency MUSIC beamforming power maps and structural damping was considered to improve the spatial resolution [
17]. These algorithms have a big advantage of excellent spatial resolution without the time-consuming scanning procedure, but they are difficult to apply to a real complex-shaped structure due to the numerous reflected waves and the complicated wave propagation characteristics.
On the other hand, according to the demand for the effective fault diagnosis that can be applied to real complex-shaped structure, research on deep learning-based fault diagnosis has been actively conducted [
18,
19]. Among deep learning artificial neural networks, CNN (Convolutional Neural Network) is one of the effective algorithms in representing and extracting spatial patterns. Because of the efficiency and high accuracy in image classification, the CNN algorithm is widely used in plant disease diagnosis [
20,
21,
22,
23,
24] and medical fields [
25,
26,
27,
28,
29], as well as fault diagnosis of mechanical systems [
30,
31,
32,
33,
34,
35,
36,
37,
38,
39,
40,
41,
42,
43]. Typically, bearings are the biggest cause of motor failures, so plenty of CNN-based fault diagnosis studies have been conducted for monitoring the bearing condition [
30,
31,
32,
33,
34,
35,
36,
37,
38]. Wang et al. conducted a study using three deep learning methods with data obtained by SCADA for fault diagnosis of power systems [
39]. Zhong et al. applied SVM (Support Vector Machine) to identify small faults in the gas turbine by training changes in performance parameters such as exhaust gas temperature and fan speed when a fault occurred in a gas turbine [
40]. In addition, researches have been conducted on detecting defects in composites using CNN training [
41], determining defects in weld joints [
42], and recognizing cracks in asphalt pavement [
43].
As CNN is being actively used, studies have also been conducted to solve overfitting problems. Zheng et al. achieved a high level of classification performance by obtaining an initial distribution of samples through a pre-training process and detecting outliers through an implicit regularization training process to solve overfitting [
44]. Ide et al. introduced sparseness to the input of rectified linear units to prevent unnecessary increase in the model parameters [
45]. This can reduce overfitting and improve generalization by preventing unnecessary output of ReLU. EISayed proposed a regularization method called SD-Reg to improve network intrusion detection systems used to detect unseen intrusion events and solve overfitting [
46]. This algorithm improved the performance over the existing L1 and L2 methods by using the standard deviation of the weight matrix. By solving overfitting of deep learning with these various methods, classification performance can be improved and deep learning can be comprehensively applied to various data.
In the case of the aforementioned beamforming algorithms, it is difficult to apply them when other sub-structures are attached to the specimen or when it has a complicated shape, since the reflected waves from the boundary can be mistaken for those from a defect. In order to solve this problem, in this study, a guided Lamb wave is excited on a plate with a defect and the measured wave reflected from the defect is 2D imaged. Then, CNN training is performed by labeling this image as the location of the defect. Even if an additional structure such as a stiffener is attached to the plate, as long as there is a difference in the measured data for each defect location, the defect location can be characterized by CNN feature extraction and it can be also applied to various mechanical systems.
As far as similar research, a study was conducted to apply the pattern of Lamb waves passing through defects on plates to the CNN algorithm [
47]. However, this method has limitations on the number and location of defects due to the constrains that defects should exist between the actuator and the sensor. In this study, there is no such limitation, and the sensor array is arranged in the order of “actuator-sensor array-defect” to obtain the defect-induced reflected wave. In order to improve the defect-detection performance, the proper excitation position and sensor locations are investigated. For the efficiency of the experiment, the structural defect is simulated by attaching a coin [
48], and difficulty in collecting a data set can be overcome since the simulated defect can be removed and attached easily.
In this study, an important factor that affects the accuracy of the results as much as the training data are the CNN architecture. Lecun et al. introduced the CNN in which the weights and biases of the convolution filter are automatically updated while reducing the error using the backpropagation method [
49]. This network is called LeNet-1. As research based on it is actively conducted, LeNet-5 with an improved performance has been proposed [
50]. In LeNet-5, the input data of size 32 × 32 passes through the convolution layer and the pooling layer twice, respectively, to create 16 feature maps of size 5 × 5. If this feature map is convolved with a 5 × 5 Kernel again, 120 feature maps of 1 × 1 size are created. All of them are connected to a fully-connected layer of size 84, and finally, when passing through an output layer of size 10, a training model with high performance can be obtained for the Modified National Institute of Standards and Technology database (MNIST) with class 10. Currently, numerous research on deeper and more effective neural networks such as AlexNet [
51], VGG 16, VGG 19 [
52], Goog-leNet [
53], ResNet 18, ResNet 50, ResNet 101 [
54], DenseNet 201 [
55] are in progress. In this study, we tried to implement a CNN network for defect classification by referring to LeNet-5, which has the simplest structure.
5. Result and Discussion
In order to evaluate the influence of elements in the designed network, the ablation test and comparison test are performed as shown in
Table 11 and
Table 12. For the purpose of investigation on the effect of the changes in the defect location and the excitation frequency, the classification accuracy for each location is calculated by verifying with data that do not participate in the training after 10 training sessions. To simplify the comparison, only six out of 16 defect cases are trained.
First, when training is conducted 10 times with 50 full data sets and verified with one datum, 100% accuracy is observed in all locations as shown in
Figure 23a. Next, in order to investigate the effect of the excitation frequency change on the training data set, training and verification are performed with only 17 data excited at 20 kHz, excluding data with the excitation frequencies of 19 kHz and 21 kHz. At this time, it is verified with data obtained from the defect existing at the standard position and trained with the remaining 16 images. As a result, as shown in
Figure 23b, a little lowered accuracy is observed at the positions C, E, and F. In addition, in order to look into the effect of changing the defect location on the training data set, training is conducted with only three data acquired from the standard location. Due to lack of training data, it is trained using 15 overlapped data and verified with data located 6.5 mm (i.e., about 1/2 of the defect radius) away from the standard position. The corresponding result is shown in
Figure 23c. Although the verification accuracy during training is 100%, when it is verified with new data, the accuracy becomes very low at most positions, indicating that the training model is overfitting.
Through these tests, it can be observed that the change in defect location has a greater effect on the training result than the change in the excitation frequency. Therefore, acquisition of the additional training data set while slightly moving the defect position is essential to obtain a robust training model, and it can be concluded that changing the location of the defect is more effective than changing the excitation frequency as a way to increase the data set.
As shown in
Figure 24a, the classification possibility is investigated for eight randomly selected positions on the aluminum panel. A total of 400 images ( = 50 × 8) obtained from experiments are labeled with eight classes, respectively, and 10 training models are derived by training 10 times. As a result of classifying the verification image which is excluded from training, the classification accuracy for each location is calculated as shown in
Figure 24a. The average classification accuracy for the eight locations is 87.5%.
The classification performance was successfully verified in the eight-defects case. As shown in
Figure 24b, the number of defect locations is expanded to 16. A total of 800 images are used as a data set by conducting the experiment under the same conditions. It is labeled as 16 classes, respectively, and 10 training models are obtained. Even though the number of defect candidates doubles, 12 out of 16 locations shows classification accuracy of 80% or more. The average classification accuracy for 16 locations is 78.1%, showing a decrease in accuracy of 9.4% compared to the previous case. It is observed that the classification accuracy of some positions decreases as the number of defect candidates increases. When DenseNet201 is applied through the F1 score summarized in
Table 10, it is expected that classification will be possible at the ‘J’ or ‘O’ position, which is difficult to classify in
Figure 24b.
Even if a stiffener is additionally attached to the test object, the presented algorithm in this paper is expected to be able to extract features with a small difference in the measured signals on the sensor array. In this case, non-destructive testing using the beamforming methods [
16,
17] becomes very difficult to apply since the steering vector, that is a kind of spatial transfer function, is disturbed by the additional structure. To experimentally verify this case, a 5 mm thick steel bar is attached to the panel while maintaining all conditions, and the experiment is conducted. For this case, in order to investigate the classification possibility of defects and the change in classification accuracy according to the attachment of stiffeners, as shown in
Figure 24a and
Figure 24b, experiments are conducted under the same transducer-placement conditions as without stiffeners. The steel bar is attached to the right side of the transducer array as shown in
Figure 24c. As a result of verification, the classification accuracy of over 80% is shown for 12 out of 16 locations.
Table 13 summarizes the average classification accuracies. The classification accuracy of the left side of the panel increases by 6.6% above the average compared to when there is no bar while the classification accuracy of the right side decreases by 8.8%.
Next, as shown in
Figure 24d, the steel bar is attached to the left side of the panel. When the bar is attached to the left side of the panel, the classification accuracy on the left side is significantly lowered to 52.5% while the classification accuracy on the right side increases by 17.5% compared to when there is no bar. This can be inferred from the fact that the location of the defect may not be accurately classified if there is an interfering structure such as a stiffener between the reflected wave and the sensor array.
In order to solve this problem, if there is an arbitrary structure in the test structure, a method of additionally placing sensors on both sides of the structure can be considered. In this study, we mainly dealt with the optimization of the sensor locations for a bare panel, and since there is a limitation to collect the data set through the experiment, only the classification possibility could be known. In the future, if a larger data set can be obtained using various methods, it is expected that the classifiable area can be improved with a high classification accuracy.