1. Introduction
The health assessment of structures has been gaining importance, as more structures are aging, deteriorating due to environmental factors, and becoming damaged due to extreme events such as earthquakes and storms. The timely detection and accurate localization of incurred damage is important to plan a retrofitting strategy to save the structure and potentially extend its service life. Most global assessment methods indicate the presence of damage and provide its approximate location. The local assessment methods, on the other hand, scan a small portion of the structure at a time and provide accurate details of the damage which include its presence, location, and severity. The presented study deals with the latter category of methods.
The conventional method for local assessment of damage is visual inspection. This method requires expert knowledge, is prone to human errors, applicable to only reachable areas of structure, and is discontinuous monitoring. Most importantly, only surface damage can be identified by this method, while the SSD remains undetected. SSD is a concerning issue and a major contributing factor to failure in a variety of fields, such as the failure of subsurface cracks due to rolling contact for bearing steel [
1]. Bearing damage due to subsurface cracks is a major contributor to wind turbine gearbox failure resulting in increased downtime [
2]. SSDs such as delamination and cracks impact aerospace components, causing their failure and reduced service life [
3,
4]. Concrete debonding with rebars, invisible voids, and cracks in concrete are a major concern for the deteriorating state of reinforced concrete bridges [
5,
6,
7]. SSD in steel members of steel truss bridges is responsible for causing economic loss due to repairs, replacement of bridge members, and the associated downtime [
8]. SSD is found to be a critical factor in the strength degradation of brittle crystal materials [
9] and ground structural ceramics [
10]. SSD is also detrimental to the performance and life of silicon chips, which is a major concern for electronic and computer industries [
11].
Several methods are already available in the literature for SSD detection. A ground-penetrating radar technique has been used to detect subsurface delamination in reinforced concrete pavements [
12]. Infrared thermography has been used to detect concrete delamination [
5] by studying the temperature difference in the sound and damaged part. A technology based on the reflection analysis of electromagnetic waves was developed to detect internal voids and debonding in concrete structures [
13]. Along similar lines, reflections of an elastic wave propagating in the ground were studied to detect subsurface cavities [
14]. Other technologies include extracting features from acoustic emission measurements to detect SSD in wind turbine bearings [
2]. Using short pulses of sound infuses and induces frictional heating at the delamination area to detect SSD using an infrared camera [
15]. The authors study the depolarization effect on the reflected light with respect to the incident light to detect SSD in ground silicon wafers in this study [
11]. In all these methods, features extracted from the response of the system are used to detect the damage. These features are selected by human judgment and understanding of the subject, which may often result in the selection of sub-optimal features.
Advancements in machine learning algorithms, especially deep learning methods, allow automatic feature extraction without any human judgment. With proper training and optimization techniques, optimal features are automatically extracted for damage characterization. A study [
16] provides a very nice review of computer vision and machine learning-based techniques for damage detection in concrete and steel structures. Several deep learning networks have been developed for crack detection in concrete structures [
17,
18,
19,
20], crack detection in pavement structures [
21,
22,
23,
24], spalling detection in concrete structures [
25,
26], and crack detection in steel structures [
27,
28,
29]. All of the works mentioned above detect surface defects in concrete and steel. Only a handful of works are present in the literature to detect SSD, which includes concrete delamination detection [
30,
31] and a deep learning network [
8] to detect SSD in steel members of steel truss using infrared thermography. There is a lot of scope to explore the potential of data-driven methods for SSD detection. Moreover, most of the works on SSD detection of concrete and steel structures are based on infrared thermography. Another physical quantity that correlates well with a damaged structure is the strain concentration. The location of damage corresponds to higher strains than its neighboring area, indicating the presence of damage. With the advancements in techniques to measure full-field strain maps such as S
4 [
32,
33,
34,
35] and Digital Image Correlation [
36], SSD detection methods based on strain measurement can be a viable option. There are a few studies on damage detection and localization using full-field strain information [
37,
38]. However, ref. [
37] focuses on the surface damage of metallic structures, and ref. [
38] uses strain mode shapes for localization on a global level which cannot provide a precise localization at the local level. There are no present formulations that use full-field strain information for the much more complex and difficult task of SSD localization.
In the present study, a Convolutional Neural Network (CNN) has been developed for SSD detection based on full-field surface strain data. This work tries to fill the void of deep learning networks for SSD detection and introduces strain as an indicator quantity. When working with full-field strain data, damage detection can be performed at several stages of refinement in the spatial domain using the deep CNN architecture. The basic stage is the classification of an image into a damaged or undamaged state. At this stage, no information about the shape, size, length, or number of damages is possible to retrieve. The next stage is segmenting the image into bounding boxes of damaged and undamaged regions, as shown in [
8]. At this stage, the presence of damage is indicated to be somewhere within the specified rectangular region. The next stage is a more refined version of the previous stage with damage classification at the pixel level; at this stage, damage is visible with clear boundaries. The present work deals with the pixel-level classification of the damage.
The organization of this paper is as follows, First, the adopted deep CNN architecture is described. Next, the development of the dataset for training and testing of the network is discussed. Following this, the optimization algorithm is discussed and the corresponding hyperparameter tuning is shown. In the end, the testing of the trained network is performed on aluminum, steel, triple damage, and experimental datasets.
3. Dataset and Data Augmentation
The full-field strain maps were obtained from a FE analysis of a rectangular bar using Ansys 2021 R1 (version 21.1) software; see
Figure 2a. The dimensions of the bar are 152.4 mm × 25.4 mm × 12.7 mm with the assigned material properties of aluminum. The damage to the bar was provided by removing material in the shape of a cylinder at the desired depth and location, as shown in
Figure 2b. Since the dimensions of subsurface damage are small as compared to the bar, adaptive meshing has been performed to generate a fine mesh around the SSD and a coarser mesh away from it, as shown in
Figure 2c. Additional details about the FEM modeling can be found in the
Appendix A. The bar is axially loaded and the axial strains developed at the top surface are recorded. Damage to the bar is restricted to an area within 25.4 mm on both sides of the center of the bar, and the strains are recorded within the same area. The nodal strain values obtained from the FE analysis are interpolated in a grid of 256 × 256 points. The FE analysis is performed based on the assumption that the bar remains linearly elastic; however, this may not be true if the applied force causes the bar to yield. It should also be noted that the damage is a subsurface and localized phenomenon. This means, although the redistribution of strain does take place after yielding, those effects have diminishing effects away from the damage location.
Figure 3b,c shows the surface strain of a bar obtained from linear analysis and non-linear analysis for an SSD (at a depth of 3.81 mm from the top) shown in
Figure 3a. In the first case, the force is kept small enough to keep the system linearly elastic (
Figure 3a), while in the other case, the force is large enough to cause plastic strains to develop (
Figure 3b). The magnitude of surface strains is significantly different in both cases, but the surface strain distribution looks very similar, indicating that subsurface yielding does not affect surface strains significantly. To further confirm this, another example case is taken in which a bar is subjected to incremental force until the surface strain exceeds the yielding strain.
Figure 4 shows the surface strain of the damaged bar (at a depth of 5.08 mm) subjected to the different magnitudes of force. The force is gradually increased such that the average strain roughly increases by one
. Except for
Figure 4b, all the following figures show surface strains when yielding has occurred in the bar. Maximum surface strains (
) in
Figure 4b are 24% of the yielding strain (
), while in
Figure 4e, the maximum surface strain is beyond the yielding strain. If the magnitude of strains is ignored in all the cases then they look very similar; it becomes difficult to differentiate the strain distribution based on the extent of yielding in the bar. Although a small variation just above the damage can be observed in
Figure 4e, the visual differences remain insignificant.
The strain variation observed on the surface of the bar is dependent on the depth at which damage has occurred and also on its orientation. As the damage occurs deeper in the bar, the spatial variation in the surface strain decreases in magnitude. Moreover, when the orientation of the damage aligns with the direction of the force, the variations show further decrements. A total of 1% Gaussian noise is added to the data for training since some amount of noise is always present in the real measurements. On adding noise, strain variations become completely lost after a certain depth, given that all the orientations of damage are allowed.
Figure 5 shows surface strains for various orientations of damage which have a radius of 0.76 mm at mid-depth of the bar. Each row in the figure includes the damage orientation, noise-free surface strain obtained from the finite element method, and surface strains with 1% added noise. In the first two figures when damage is oriented perpendicular to the direction of applied force, the spatial variation is significant and is noticeable in the presence of noise. However, as the damage starts to align with the direction of force, the spatial variation in surface strain starts to decrease and it becomes hard to differentiate it in the noisy data. In the last row when damage aligns perfectly with the direction of force, it becomes difficult to tell the strain pattern without a priori knowledge of the noise-free strain pattern. Therefore, for preparation of the training data, the minimum radius of the damage is restricted to 0.76 mm, and the depth of damage is restricted until the mid-thickness of the bar.
When damage occurs, it is possible that it may not remain as a single damage; rather, it branches into two, or there can be two independent damages altogether. To account for multiple damages, the data includes damage formed in three ways; see
Figure 6. The three types of damages are: (i) Type I—two independent damages oriented in random directions (see
Figure 6a), (ii) Type II—two damages having the same root but branched at some point in a random direction (see
Figure 6b), and (iii) Type III—two intersecting damages oriented in random directions (see
Figure 6c). A total of 903 numerical simulations were carried out, which included 290 examples of single damage and 613 examples (the rest) from all three types of multi-damage. In all the 903 example cases, the orientation of damage is randomly assigned. The depth of the damage is also randomly assigned between the mid-depth and one radius away from the top of the bar. The radius of the damage is selected randomly from a minimum value of 0.76 mm to a maximum value of 1.52 mm.
Data Augmentation
Although 903 examples are available to train the deep CNN network, it may still be a small number to train such a large network. Data augmentation is a powerful tool to overcome this problem which creates more training data points by augmenting the existing data. Five data augmentation strategies were adopted and applied in series, as stated below:
Random horizontal flip with probability = 0.5.
Random vertical flip with probability = 0.5.
Random rotation within 10 degrees with uniform probability.
Random shear within 10 degrees with uniform probability.
Random resized crop with an equal probability between a scale from 0.5 to 1.
Random rotation is restricted within 10 degrees to maintain uniqueness between the measured strain and the damage. The fifth augmentation helps add scale invariance to the network. In addition to data augmentation, data normalization is an essential and important operation in machine learning. The standard practice to normalize the data is to subtract the ensemble mean from the image and divide it by the ensemble standard deviation. One crucial thing to consider while using strain data is the huge variations that can occur in strain for the same structure. There could be a difference of an order of magnitude in strain values depending on the amount of force applied. This means if 10 N of force causes a strain of order 0.1 , then 100 N will cause 1 , while the strain pattern remains exactly the same. To make the network invariant of applied force, the individual image is normalized using an image mean and image standard deviation rather than using ensemble metrics.
5. Predictions
Using the optimum network and the optimum threshold value, damage prediction was conducted for the test set which was not observed by the network. The IoU score for the test set came out to be 0.794.
Figure 7 shows a few predictions on the test set along with the true damage and the input strain map.
Figure 7a is the type I damage where two damages start from one end and move further apart along the width of the bar. The presence and location of both damages can be inferred by just looking at the strain map. A red strip of high strain at the top portion of the strain map indicates that damages are close enough such that strains resulting from both damages superimpose. Then, they move apart and show two patches of strain.
Figure 7b is an example of type II damage, where one damage is close to vertical and the other close to horizontal. It is easy to predict the vertical damage from the strain map but not the horizontal damage. The orientation and connectivity of the two damages are difficult to predict. The red region indicating the vertical damage is only until the connection point of the two damages; beyond that, it is difficult to precisely locate the extent of the vertical damage.
Figure 7c shows slant damage along the width of the bar. There is a single line of high strain in the middle with two other lines of high strain on either side of the middle line. It gives the impression that two different damages are present, but it is hard to tell whether both damages are connected or not. Both in
Figure 7b,c, it is difficult to precisely find the boundary of the damages, but the trained network can accurately predict it.
Figure 7d–f are examples of type I, type II, and type III damages, respectively. It is difficult to predict the presence of second damage just by looking at all these strain maps. In each case, the strain pattern indicates single near-vertical damage and no obvious strain variations indicating the presence of other damage. In
Figure 7d, there is a slight greenish region at the top and left side of the dominant strain pattern, while in
Figure 7e, a little bulge at the location of the second damage can be observed. In
Figure 7f, on careful inspection, a small variation can be observed at the ends of the damage location. The observed variations in these three cases are subtle; to make any conclusion on the damage description will require expertise on strain patterns due to SSD. The trained network can pick up the subtle variations and correctly predict the damage location.
Figure 7g–j are examples where the network shows partially accurate damage prediction. In
Figure 7g, both damages are correctly picked but the gap between them at the bottom right corner is missed; the same can be observed in
Figure 7h. In
Figure 7i, the second horizontal damage is only partially predicted, while in
Figure 7j, a part of the longer damage is missed and is merged with the smaller damage, giving an impression of the single damage. The errors in
Figure 7g–j are minor, which could be attributed to the added noise to the data. It should be noted here that in cases where the network prediction has shown errors, none of those cases show any obvious strain patterns to describe the damage. It is difficult for a non-expert to accurately predict the damage by visually observing the strain map. The overall picture predicted by the network is close to the truth, showing the damaged location and shape with reasonable accuracy.
5.1. Prediction on Steel Dataset
The trained network on the aluminum dataset has proved to work well on aluminum test examples. Now, the performance of the network is tested on data derived from the numerical simulation of steel bars. Dimensions of the bar are kept the same as the aluminum examples. Nine example cases have been considered, each having two damages of random length, random orientation, and random diameter.
Figure 8 shows predictions for four example cases.
Figure 8a,d are damage cases where two damages cross each other, while
Figure 8b,c are cases of two independent damages. Except for
Figure 8a, there are no obvious indicators in strain maps to tell whether the two damages are crossing each other. In
Figure 8b,d, the orientation and location of one damage can be predicted from the strain map; however, features of the other damage are difficult to interpret. In
Figure 8c, it is very difficult to predict the orientation of both damages, as there are no clear features present that can help in predicting the correct orientation of the damage. The trained network on the other hand is able to accurately predict the location and orientation of both damages for all the example cases. It successfully identifies whether damages are crossing each other or not; in addition, the gap between both damages in
Figure 8b,c is accurately predicted. The prediction of the boundary of damages is not very neat and shows some visible errors. However, the error is present in some sections of the boundary only; overall, the prediction is of an acceptable accuracy.
The IoU score for the steel test set is found to be 0.793, interestingly, which is the same as the IoU score of the aluminum test set. In the steel test set, strains were obtained from a non-linear analysis of the steel bar. Enough force was applied to yield the bar to ensure plastic strains were formed. Surface strains from the analysis of the steel bar will have the same spatial characteristics as that of the aluminum bar, since only the stiffness value is different for both materials, given that yielding does not significantly change the strain pattern. The network works well even though the magnitude of stiffness between steel and aluminum differs in orders of magnitude because of the normalization of the strain map, which makes it independent of the absolute value of the stiffness. Therefore, as long as surface strains from the non-linear response of the system show similar characteristics as the linear response (as shown before in
Section 3), the trained network should work well in that case.
5.2. Prediction on Triple Damage Cases
The performance of the network is now checked on totally unseen damage cases. The network was trained on single and double damage cases and it did well on the aluminum test case and steel test case, which were also formed from single and double damage cases. It will be interesting to observe how well the network works on triple damage cases to obtain an estimate of its generalization potential. A total of 125 example cases were generated using a numerical simulation consisting of three damages of random length, random orientation, and random thickness. Predictions on four cases along with a true damage and strain map are shown in
Figure 9.
Figure 9a shows damage diverging into two damages at the bottom point and then the left leg splits again into two damages. The trained network is able to predict the correct damage pattern without any trouble.
Figure 9b,c shows three damages randomly crossing each other. In these two cases, the prediction of the damage pattern is correct with some distortion along the boundary of the damage. In
Figure 9d, the network predicts two of the three damages correctly but misses out on a part of the third damage. In none of the cases except
Figure 9a, the strain map showed patterns directly indicating all the damage locations. It is easy to predict one of the three damages by just looking at the strain map but after that, it becomes difficult to judge, and the probability of making human error increases. The trained network does a good job at it by getting an IoU score of 0.764 for the test set of 125 example cases.
6. Experimental Validation
The network trained on numerically simulated data is now tested on the strain map obtained from the experiment on an aluminum bar that is 152.4 mm long, 25.4 mm wide, and 6.35 mm thick. The bar has a SSD in the form of a cylindrical hole along its width. The damage is 8.47 mm long starting from the edge of the plate, which is one-third of the width of the plate. The damage is provided at the mid-depth of the bar. The experimental setup in
Figure 10 shows a bar fixed from both sides, except the displacement along the longitudinal direction is allowed on one side. The strain measurement is performed using S
4. The laser head shown in
Figure 10 shoots a laser at a point where the strain needs to be measured. The thickness of the laser is so small that the measured strains can be treated as point measurements. The laser head is then moved along a two-dimensional grid to obtain a full-field strain map. The measured strain map need not necessarily be in a grid of 256 × 256 points, which is the input size of our network. Therefore, the measured strains are mapped to a grid of 256 × 256 points by interpolation using cubic splines.
During the experiment, the bar was stretched beyond the yielding point to allow plastic strains to form. The bar was then unloaded and a strain map was generated which measured the residual strains formed during the process. Since the location of strain concentration during loading will be the same as the location of the residual strains, the prediction of SSD can be conducted using residual strains as well.
Figure 11 shows the predicted damage, true damage, and measured strain map from S
4 for the bar. The network is able to detect the correct damage location at the center bottom region. The width and length (region) of the true damage are 3.18 mm and 8.47 mm, respectively, while the estimated damage is 2.47 mm wide and 6.54 mm long. The CNN underestimates the width of damage by just 0.71 mm and length by 1.93 mm. Given the noise in the measured data and external factors such as imperfect loading and boundary conditions causing changes in the strain pattern as compared to the numerical simulations (with ideal conditions), the relative error of 22% in both width and length estimates can be considered to be of reasonable accuracy. The network’s ability to pick up damage indicates that training based on numerically simulated data is working on real data as well. A small blob of false damage detection can be observed at the bottom right corner of the prediction in
Figure 11. The reason for this is the eccentricity in the measured strains due to imperfect loading which causes strains on the right side of the damage to be slightly greater than the left side. It gives a false indication of a rather small damage somewhere on the right side. However, given the amount of noise and eccentricity in the measured strains, the majority of the damage is predicted in the region of the true damage. It is difficult to include such unpredictable conditions in the finite element modeling which assumes ideal conditions on which the network is trained. Real-life strain data for SSD is scarce; because of this, the training of the CNN was not possible on real data. However, the experiment validation shows that CNN trained on FEM data does work on real data with a possibility of a few anomalies. With the increasing availability of real-life data, transfer learning can be adopted in the future to update the proposed network with a considerably fewer number of data points to obtain a more robust network.
7. Conclusions
A deep CNN network based on surface strains is developed to detect SSD. The architecture of the network is adopted from U-Net with some modifications which can perform pixel-level segmentation tasks to classify them as damaged or undamaged pixels. A dataset containing 903 full-field strain data (256 × 256 points) was generated by numerical simulation of aluminum bars with diverse damage scenarios. It was shown that surface strains obtained from linear analysis had similar strain patterns with those from non-linear analysis, even when surface strains reached a near-yielding strain. Therefore, all the simulations were based on a linear analysis of the aluminum bar. Out of 903, only 452 strain data were used for training, which is a lot less than the total number of parameters in the network architecture. Therefore, extensive data augmentation was performed by random flips in the horizontal and vertical direction, random rotation, random shear, and random resized cropping. The optimum network was obtained by tuning four hyperparameters: the learning rate (1 × 10), the strength of regularization (1 × 10), the weight of positive samples (2), and the threshold value (0.2) for classification.
The physical principles in this study mainly pertain to the relationship between damage–stress–strain. The flow of information from damage to stress concentration to strain pattern is rather direct and highly correlated, which the trained neural network learns. Since the sub-surface damage causes complex surface strain patterns, the adopted convolutional neural network (CNN) learns the complex and nonlinear relationships between them and provides damage localization.
The trained network was able to segment damage in the test set for all damage types to an acceptable accuracy. The localization of damage was accurate with clear boundaries except for a few cases with minor errors. The IoU score came out to be 0.794 for the aluminum test set and 0.793 for the steel test set. The similarity in the performance even when material changes is because surface strains are less sensitive to the non-linearity introduced by subsurface yielding. As long as surface strains show similar strain patterns in the presence or absence of subsurface yielding around damages, the proposed network should work well for any material. The testing of the network on unseen triple damage cases achieved an IoU score of 0.764. This shows that the network has learned hidden and complex characteristic patterns from the surface strains and it is capable of localizing damage for unseen damage patterns. The network was also tested on experimental data where the strain was measured using Strain Sensing Smart Skin. The accuracy of prediction was reasonably good and it proved the efficacy of the network to be used on measured full-field strain data. For future improvements, updating the network using real-life/experimental data will enable it to identify the anomalies in the measurement data which should improve the robustness of the network.