This section presents the results obtained for the proposed automated defect detection procedure and its subsequent depth estimation, applying different tests on the image sets described in
Section 2.3. Sum-type [
34] and maxigram [
3] synthesis images were used for the tests. Sum-type images were chosen to evaluate the methodologies for defect detection on thermal information under low-contrast conditions. Maxigrams were chosen because they also allow estimating the depth of the detected anomalies. Although maxigrams can be obtained on image sequences without preprocessing, the temporal information concerning them is in the first instants of the sequence due to the energy generated by the PT experiment. This condition produces unreliable depth estimates with the chosen approach. Therefore, sum-type images were only used on raw images.
Section 3.1 presents the detection performance for defects in simulated sum-type images.
Section 3.2 shows performance results for maxigrams of simulated images, including depth estimation results for detected defects. Finally,
Section 3.3 presents results for defect detection in actual thermal image maxigrams and depth estimation for the detected defects.
3.1. Defect Detection in Simulated Thermal Sequence Sum-Type Synthetic Images
Row 1 in
Figure 3 shows sum-type synthetic images obtained for the CFRP material in simulated raw sequences. The CFRP sample contains internally defective regions of interest (ROI-d) with lateral sizes (
S) of 3 mm (column 1), 6 mm (column 2), 9 mm (column 3), 12 mm (column 4), and 15 mm (column 5) located at a depth (
D) of
mm. Row 2 shows the binary images resulting from the application of the HOG-based automated defect detection methodology [
26].
In all the sum-type images in
Figure 3 (row 1, columns 1 to 5) the effect of nonuniform heating can be seen, which manifests itself with a concentration of intensities greater in the center of the image and progressively decreasing towards the edges. This phenomenon, in addition to producing low contrast in the thermal images, prevents easy identification of the ROI-d, especially for smaller defects (e.g., columns 1 and 2). However, the HOG-based automated detection method shows acceptable results after evaluating different (
S) conditions (see
Figure 3, row 2). Supporting the results described above, the AUC performance values for depth (
D) values of
mm,
mm,
mm,
mm, and
mm are given in
Table 4. The AUC indicator is constructed by comparing the results obtained in the analyzed images with a template or binary reference image containing the correctly segmented defects. In our case, the AUC is used to evaluate the classification results (Equation (
13)). An ideal system will achieve a true-positive rate (TPR) of 1.0 and a false-positive rate (FPR) of 0; thus, the ideal AUC will be 1.0. However, in practice, a reliable system should preferably have AUC values greater than 0.5.
3.2. Automated Defect Detection and Depth Estimation in Simulated Sequence Maxigrams
Row 1 in
Figure 4 shows maxigrams of simulated CFRP sequences contrast-enhanced using the BTCOp method [
27]. The CFRP sample contains ROI-d located at the same depth (
D) and with lateral sizes (
S) equal to those used in the test described in
Section 3.1 (columns 1 to 5).
The improved information is seen with a dark and uniform intensity level that represents the background of the image, while the defective regions are easily distinguished with light intensity levels. Thus, the defects are observed for all the lateral size (
S) conditions evaluated. In row 2 of the same figure, the binary images resulting from applying the methodology for automated defect detection based on the combined use of BTCOp and local thresholding described in
Section 2.2 are shown.
Table 5 presents the AUC performance indexes for the depth (
D) and lateral size (
S) values used with the simulated thermal sequence maxigrams. In general, the performance indexes are slightly lower than those obtained with the sum-type images processed with the HOG method.
With the HOG-based methodology, overly segmented ROI-d are obtained (see
Figure 3, row 2). This result can be attributed to the geometry and resolution of the structures used to calculate the HOGs, as explained in [
26]. On the other hand, after compensating in the images for nonuniform heating with the BTCOp method [
27], more well-defined defects with more uniform sizes can be observed (see
Figure 4, row 2). The BTCOp method uses all available samples in the image for estimating the background model parameters and therefore has no resolution constraints.
Applying a linear regression method to Equation (
14) [
3], it is possible to calculate the coefficients
A and
n that allow automatic estimation of the depths (
) of the ROI-d detected. The values
and
signify the maximum contrast value, in our case located at the element (pixel) having the maximum intensity value within the detected ROI-d, and its corresponding time, respectively.
With the estimated depth values (
D) for the ROI-d, percentage error values relative to the actual depths (
D) of the defects are calculated (see Equation (
15)). In a maxigram of simulated sequences, all ROI-d are at the same depth.
Table 6 shows the calculated percentage errors for all evaluated length/depth ratios. No trends were identified in the results that relate the calculated error as a function of depth (
D) or lateral size (
S) of the defects. However, a maximum percentage error of
(
S = 9 mm y
D =
mm) and a minimum percentage error of
for (
S = 3 mm y
D =
mm) were observed.
Figure 5 presents the results of the ANOVA that statistically complements the information described in
Table 6. In this test, the null hypothesis (
) was that the mean values of estimated depth (
D) of the ROI-d are equal for the five lateral sizes (
S) used. On the other hand, the alternative hypothesis (
) was that these mean depth values are different. Before performing this statistical test, the datasets were tested for normality.
The ANOVA test was repeated with the same conditions for each of the five depth (
D) values evaluated (
mm,
mm,
mm,
mm, and
mm). The datasets for each ANOVA corresponded to estimated depth (
D) values for the eight ROI-d present in each maxigram relative to the five lateral size (
S) conditions (see
Figure 5a–e). Thus, each (
D) condition was represented by 39 degrees of freedom (eight
D values for five lateral size values).
From the
p-values obtained in the tests and with a certainty of
, it is possible to affirm that the defects with
S values of 6 mm, 9 mm, 12 mm, and 15 mm did not differ significantly in
D. However, they exhibited higher
D values than those found for a lateral size of 3 mm (see
Table 6). These differences could be explained by noise levels and nonuniform heating correction affecting small defects more than large ones. The above behavior was maintained for all depth (
D) values evaluated.
3.3. Automated Defect Detection and Depth Estimation in Actual Sequence Maxigrams
In this section, we present the detection of defective regions by the techniques proposed in our study (BTCOp and normalized BTCOp, BTCOpN) and using traditional thermal information processing techniques: normalized contrast (CN), DAC, BTCF, and normalized BTCF (BTCFN). For ROI-d segmentation with the traditional methods, the classical Canny edge detection algorithm was used. For the BTCOp and BTCOpN methods, the automated detection process proposed in this paper and described in
Section 2.2 was used.
Figure 6 shows in the first row the sequence maxigrams of the actual CFRP sample (see
Section 2.3.1) contrast-enhanced using the CN (column 1), DAC (column 2), BTCF (column 3), BTCFN (column 4), BTCOp (column 5), and BTCOpN (column 6) methods. In the second row are the corresponding binary images resulting from the detection process.
The binary images produced after analyzing the contrast-enhanced maxigrams show that the BTCOpN, BTCFN, and BTCOp methods correctly detected 19, 18, and 13 ROI-d and falsely detected 1, 1, and 5 ROI-d, respectively (
Figure 6, row 2, columns 6, 4 and 3). The BTCF, CN, and DAC methods underperformed, correctly detecting 11, 11, and 5 ROI-d, respectively, and falsely detecting 2, 2, and 5 ROI-d, respectively (
Figure 6, row 2, columns 3, 1, and 2).
With the normalized BTCOpN and BTCFN techniques, maxigrams with better contrast were obtained (in comparison with the maxigrams generated with the other methods, which were BTCF, BTCOp, CN, and DAC). The result described above is consistent with the number of correct detections obtained using these methods and additionally with the superior performance displayed by these techniques as a function of the signal-to-noise ratio (SNR) [
27]. The performance of the BTCOpN method is similar to the result obtained for the same actual CFRP sample recently analyzed with a deep learning algorithm [
23], where 17 correct detections and 4 false ROI-d detections were reported.
Table 7 shows the performance indicators that were calculated using the different processing techniques for the maxigrams in
Figure 6. Using the recall indicator, which represents the proportion of actual defects that were correctly identified, the best performance was exhibited by the BTCOpN technique (0.84), followed by the BTCFN (0.72), BTCOp (0.61), BTCF (0.56), CN (0.39), and DAC (0.18) methods.
In terms of accuracy, the BTCOpN and BTCOp methods showed values close to 0.56, representing a nonnegligible level of false detections. However, these were mostly located around the correctly detected ROI-d, which would facilitate the future implementation of a strategy to improve shape characterization in defects (ROI-d). This phenomenon, which can be interpreted as oversegmentation, can be attributed to the heat scattering effect that occurs at the boundaries between the sound and defective regions of the inspected material, the resolution that was defined for the regions, and the calculation of the local thresholds.
Following a procedure similar to that described in
Section 3.2,
Table 8 and
Table 9 present the estimated depth (
D) values and the calculated percentage errors between the actual depths (
D) of the defects and the estimated depths (
D) that were obtained using the BTCOp and BTCOpN contrast enhancement methods.
As with the simulated sequence maxigrams, no trends were identified in the results for the actual sequence maxigrams for the error values calculated as a function of defect depth (D) or lateral size (S). Using BTCOp for contrast enhancement, a maximum relative percentage error of for S = 7 mm and a minimum relative percentage error of for S = 5 mm were found. On the other hand, with the BTCOpN method, the maximum value was for S = 3 mm, and the minimum was for S = 15 mm.
Figure 7 presents ANOVA results to complement the statistical analysis of the information in
Table 8 and
Table 9. Similar to the hypotheses examined in
Section 3.2, the null hypothesis (H
) was that the mean estimated depth values (
D) of the detected ROI-d are equal in relation to the actual depth values (
D) (
mm,
mm,
mm,
mm, and
mm). The alternative hypothesis (H
) was that these mean depth values are different.
Based on the p-values obtained in the tests and with a certainty of , it is possible to state the following:
On the maxigram of the actual sequence compensated with the BTCOp method, in the detected ROI-d and using the values of (
D), statistically significant differences were found for the four actual depths corresponding to the defects (see the left side of
Figure 7). In this case, the numbers of defects that could be detected were five for
D =
mm, four for
D =
mm, two for
D =
mm, and two for
D =
mm.
On the maxigram of the same actual sequence, compensated with the BTCOpN method, in the detected ROI-d and taking into account the values of (
D), no significant differences were found between the actual depths (
D) of
mm and
mm or among the actual depths (
D) of
mm,
mm, and
mm. However, between these two groups, there were statistically significant differences (see
Figure 7, right).
Table 10 shows percentage values of the errors in estimated depth (
D) obtained for the ROI-d detected with the BTCOpN techniques and the BTCFN. These quantities were calculated for the defects having the greatest lateral size (
S =
mm) in the actual CFRP sample. For the comparative analysis, the BTCFN method was chosen since it detected a higher number of defects compared to the remainder of the traditional techniques evaluated. The BTCFN method showed a minimum absolute error of 1.5% for
S =
mm and a maximum of 7.0% for
S =
mm. The BTCOpN method had a minimum value of 0.2% for
S =
mm and 3.6% for
S =
and 0.8 mm. In general, the error values for the two methods did not show trends in relation to the lateral sizes (
S) of defective regions or their depths (
D). However, the BTCOpN method in most of the depth conditions evaluated exhibited lower error values than those obtained with the BTCFN method.
The differences in the performance results could have been due to how each method accomplishes the decoupling of the background information and the information corresponding to the defects. Possibly, the optimal estimation of the parameters describing the image background favors obtaining the nonuniform heating model, in contrast to the BTCFN method, which masks the defect regions using a median filter operation.