1. Introduction
Bronze wares are remains and monuments from human social activities with crucial historical, artistic, and scientific values, making them vital objects of study in archaeology [
1]. Due to natural and human causes, many unearthed bronze wares are often found to have mutilations, fractures, and foreign objects covering them, along with deteriorated important decorations and inscriptions. This not only seriously damages the information they convey but it also threatens the material survival of the bronze wares themselves, creating difficulties for later study and use. Identifying and extracting key historical information from bronze wares is a critical issue for archaeologists. In the realm of traditional cultural relics preservation, the cleaning and information extraction of damaged bronze wares is frequently approached through a method of gradual, in-depth peeling. However, this technique is not only time-consuming but also has the potential to cause damage to the bronze wares.
With the widespread expansion of scientific and technological testing applications, an increasing number of testing equipment and methods are being employed in the protection of bronze wares [
2]. This trend has led to enhanced operational security and precision. Non-destructive examination technology utilizes various non-destructive imaging techniques such as X-ray and computed tomography to conduct contactless scanning and testing of bronze wares. It has important practical significance for the research of bronze wares.
This paper focuses on the X-ray imaging of Han Dynasty bronze mirrors. During the X-ray imaging process, the varying transmittance of X-rays in distinct areas of the bronze mirrors, specifically the decoration and rim areas, results in differences in brightness and darkness in the final image. Additionally, the optimal diffraction energy needed varies between the center and rim of the mirror due to their differing thicknesses. To acquire more comprehensive information about the mirror, it is necessary to capture images of different areas using varying levels of diffraction energy. As a result, multiple X-ray images of a mirror are often produced, which poses a challenge for heritage workers in analyzing and extracting information about the mirror. To improve the efficiency of cultural relics protection work, multiple bronze mirror X-ray image fusion using image fusion technology can present the information of the bronze mirror more clearly and comprehensively.
Image fusion technology is widely utilized in cultural relic protection and research. Its primary principle is to enhance texture clarity, information reproduction, and detail richness of the target image by merging pixel data from the source image with the corresponding positions of the target image [
3]. This process generates a more precise and realistic fused image, making it an ideal fit for most bronze ware pixel-level image fusion applications.
Due to the unique nature of bronze wares, their digitized information is often not as extensive as that of other fields. As a result, traditional methods are predominantly used for the image fusion of bronze wares, with multi-scale transform (MST) being the most prevalent method in recent years [
4]. The MST fusion method decomposes the image into high and low-frequency subbands using a transform and applies a specific fusion rule to merge them, and, finally, obtains the fused image by inverse transformation. Commonly used MST methods includes Wavelet Transform (WT) [
5], Shearlet Transformation (ST) [
6], Discrete Wavelet Transform (DWT) [
5], Non-Subsampled Contourlet Transform (NSCT) [
7] and Non-Subsampled Shearlet Transform (NSST) [
8].
However, these methods are mostly applied in the fields of medical image fusion and infrared and visible image fusion. At present, there is limited research on bronze X-ray image fusion, with only the use of basic fusion frameworks. This paper draws on medical image fusion, as well as infrared and visible light image fusion techniques to process the bronze X-ray images.
To observe all the information present on bronze mirrors simultaneously, enhancing the expression of the bronze mirror decorations and highlighting the defective areas of the mirrors through image fusion techniques is necessary. However, achieving such an effect utilizing the traditional MST method proves challenging, so some scholars have improved the fusion rule. Zhu et al. [
9] proposed an NSCT-based fusion technique that incorporates the local Laplace operator and phase consistency for enhanced fusion performance, but it results in the loss of some fusion details. Liu et al. [
10] proposed an image fusion method based on multi-decomposition LatLRR. The method fuses images after decomposing them several times with LatLRR, which improves the brightness while preserving the detail information. However, it is worth noting that in some cases, the fused images may exhibit overly excessive brightness. Mei et al. [
11] proposed a method for image fusion that combines NSCT and adaptive PCNN. The fusion rules utilize the sum of directional and global gradients, leading to improved method timeliness. However, it is important to note that a potential disadvantage of this method is that some fused images may exhibit subtle blurring. Vanitha et al. [
12] proposed a spatial frequency excitation-based PA-PCNN fusion method in the NSST domain. The method utilizes a maxima strategy for the low-frequency subband coefficients. However, it is important to note that a potential drawback of this method is that the resulting image contrast may not be sufficient, leading to a loss of energy and detail information. Chinmaya et al. [
13] proposed the NSST-based PAULPCNN fusion method to fuse significant complementary details in grayscale images with pseudo-color images. This method can extract rich information and is effective in medical image fusion. However, these methods still have more defects in terms of image quality and detail preservation in bronze mirrors. In order to solve this problem, this paper undertakes the task of introducing a multi-scale morphological gradient operator combined with a WSEML operator in the NSST domain for high-frequency information fusion. The Laplace operator, a second-order differential operator, is highly sensitive to grayscale variations in images, making it ideal for accentuating areas with rapid changes. Building on this, the multi-scale morphological gradient operator adjusts weight coefficients to refine the geometric relationships among eight neighboring pixels, thereby enhancing edge detection accuracy and mitigating noise impact. For the low-frequency components, fusion is achieved through a locally topologically coupled neural P system that leverages nonlinear modulation and dynamic thresholds to preserve the image’s overall integrity and detail. The results demonstrate the optimal performance in the evaluation metrics of Average Gradient, Mutual Information, Gradient-based Fusion Performance, and visual information fidelity, with an average improvement of about 4%. Visual information fidelity is more prominent, increasing by 6.2%.
2. Materials and Methods
In this paper, a new MST-based fusion framework for X-ray images of bronze wares is proposed. As illustrated in
Figure 1, the proposed method entails three key steps: NSST decomposition, the fusion of high-pass and low-pass subbands, and NSST inversion.
A and B are properly aligned X-ray source images of bronze wares. These images undergo decomposition into high-pass subbands and low-pass subbands, using NSST to describe the detailed and structured information of the corresponding X-ray images. The high-pass and low-pass subbands are individually fused according to different fusion rules. High-pass subbands are fused using rules based on multi-scale morphological gradients, whereas low-pass subbands are fused using rules based on localized topological CNP systems. Finally, the fused image is obtained by inverse transforming the fused high-pass and low-pass subbands using NSST inverse transform.
2.1. Non-Subsampled Shearlet Transform
In 2005, Colonna et al. [
14] introduced a synthetic dilation affine system in their research. They utilized multi-resolution and geometric analysis to construct shearlet-wave transformations and developed the basic framework for continuous and discrete shearlet-wave transformations. On this basis, Easley et al. [
15] proposed he Non-Subsampled Shearlet Transform (NSST) method. NSST has numerous advantages in the field of image processing, making it one of the most frequently used methods. It not only inherits the flexibility of shearlet-wave but also offers a better choice of spatial orientation, compared to other methods. This implies that NSST has the capability to efficiently capture and describe the intricate details of an image, thereby enhancing the quality of the fused image and significantly minimizing the probability of the Gibbs effect. NSST achieves an optimal sparse representation of an image, which is highly important in image processing. This means that NSST can accurately represent image features and reduce redundancy by selectively representing crucial information in the image.
First, the high-frequency subbands and the low-frequency subbands of the image are obtained by multi-scale decomposition of the source image using a non-subsampled pyramid filter (NSPF). Then, the high frequency subbands of the image are decomposed in multiple directions using a shearlet filter (SF). Finally, the image is reconstructed by inverse NSST operation after completing the processing of the corresponding subbands. This process is shown in
Figure 2, which presents a schematic diagram of the three-level NSST decomposition.
2.2. MSMG-Based High-Pass Subband Fusion Rule
High-pass subband fusion is an image processing technique that extracts precise details such as edges and textures from the source image through spectral analysis and filtering. The primary objective of high-pass subband fusion is to preserve the intricate information of the source image and produce a fused image that is superior in clarity, richness, and depiction of the detailed attributes within the image. In the process of high-pass subband fusion, the essential factor is to optimize all the detailed characteristics of the source image. The multi-scale morphological gradient (MSMG) [
16] shares similar functionality with common edge detection operators (e.g., Prewitt, Sobel, Laplacian) in detecting image edges. However, common edge detection operators enhance the noise in the image and degrade the image quality when extracting the image edges [
17]. For the detection of boundaries, it is very important in the fusion of bronze mirror X-ray images [
18]. The MSMG is an operator that can extract gradient information from an image. It represents the contrast difference between a pixel and its neighboring pixels. Consequently, the MSMG is frequently utilized in scenarios such as edge detection and image segmentation [
17]. This method of transformation can maintain a low enhancement rate in noise while extracting image edges, effectively decreasing the noise’s impairment of the results of edge detection. To enhance the informational value of the high-pass subband image and create a fused image with detailed edges and textures, this study utilizes the multi-scale morphological gradient operator (MSMG) in combination with the eight-neighborhood-based weighted and modified Laplace operator (WSEML) [
9]. This approach significantly enhances the clarity of contour edges in the fused image, preserves detailed information, and leads to improved fusion of the high frequency subbands.
The single-scale morphological gradient is defined in Equation (1):
where
denotes the pixel position,
stands for the number of multi-scale levels,
represents the source image,
refers to the structural elements at the
level scale,
represents the structural information at the
level scale, and
and
denote the dilation and erosion operations, respectively.
In summary, the
MSMG is calculated as in Equation (2):
where
denotes the gradient weight of the
level, which can be expressed as in Equation (3):
The multi-scale morphological gradient of the high frequency subbands
and
in each layer are as follows:
In Equation (4), and are the multi-scale morphological gradient of the high frequency subbands of the layer of the bronze mirror X-ray images A and B, respectively.
is the metric used for extracting details and is defined by the following equation:
is the weighting matrix of
, where
is defined as follows:
where
is the high-frequency subband coefficient located at
in the
-layer and
directions.
Assuming that
and
are associated with the bronzoscopic X-ray images A, B, the fusion rules based on the
and
high-pass subbands are defined as follows:
where
and
are the high-frequency subband coefficients of the two bronze mirror X-ray images at position
in the
-layer direction
, respectively, and
is the high-frequency subband coefficient of the fused image
at position
in the
-layer direction
.
2.3. CNP-Based Low-Pass Subband Fusion Rule
In general, the fusion strategy for the low-pass subbands has a significant impact on the final fused image. These subbands contain a significant amount of energy from the source image, which in turn has a significant influence on the fusion result. These low-frequency data are essential during the fusion process to maintain the overall structure, detail, and color balance. Therefore, selecting a suitable low-pass subband fusion approach is essential to achieving superior fused images. In order to fully guarantee the energy and extract as much detail as possible, a rule based on a localized topological CNP system is used in this paper for the fusion of low-pass subband components in the NSST domain.
Inspired by the synchronized pulse bursts in mammalian visual cortex, H. Peng et al. [
19] proposed a similar neural P system called the coupled neural P system (CNP). The CNP system is a computational model consisting of multiple inter-coupled neurons linked together, each containing receptive fields, modulation, and output modules. These neurons form a directed graph structure among themselves, resembling a spiking neural P system, utilized for distributed parallel computation. The CNP system has unique characteristics. Firstly, it has a nonlinear coupling modulation property, which means that the interactions between neurons are nonlinear, and the coupling strength can be modulated. This feature allows the CNP system to exhibit enhanced computational abilities for complex tasks. And secondly, the CNP system also has a dynamic thresholding mechanism. The output of a neuron depends on its input, as well as on a dynamically adjusted threshold. By adjusting the thresholds dynamically, the CNP can adapt to various environments and task demands, thus enhancing computation’s flexibility and adaptability. In order to better handle the image fusion problem, B. Li et al. [
20] designed the CNP system as a neural array with local topology, which is a CNP system with local topology.
A CNP system with an m × n sublocal topology is defined as follows:
where
is an alphabet, and the object
is called a spike. In this model, the spikes play a neuronal role. Spikes can generate and propagate pulse signals, simulating action potentials in neurons. These pulsed signals are used to transfer and process information in the computational model. The connection and coupling between the spikes form the network structure for parallel computing through the interactions in the network.
is an array of coupled neurons of
in the specific form of
, where
.
where
is the neighborhood radius.
According to the spike mechanism, the state equation of neuron
is as follows:
where
is the spike received by neuron
from the neighboring neuron
,
is the corresponding local weight,
is the external stimulus, and
is the spike generated when neuron
is excited [
20].
If X-ray images of bronze mirrors A and B, respectively, equate to
and
, the low-pass subbands of two bronze mirror X-ray images are used externally for
and
. The two CNP systems operate from an initial state until the number of iterations reaches
. Let
and
denote the excitation matrices related to
and
, respectively, where
and
, and let
denote the excitation frequency of neuron
in the
. Then, the fusion rules for the low-pass subbands are as follows:
where
and
represent the low-frequency subband coefficients of the decomposition of the source image, and
represent the low-frequency coefficients of the fused image
located at
.
2.4. NSST Reconstruction
The fused image
was reconstructed through an inverse transformation method, using NSST, from
and
.
3. Results
To evaluate the effectiveness of the proposed fusion method, this study used four sets of eight registered Han Dynasty bronze mirror X-ray images for fusion performance testing. To ensure consistency, both the high- and low-energy X-ray images of the same bronze mirror must have the same size. The resolution of the images is set at 300 × 300. The image data provided for this study were obtained from the Shaanxi Provincial Institute of Cultural Relics Protection. The X-ray exposure of bronze mirrors was conducted using an ART-GIL350/6 fixed flaw detector manufactured by GILARDONI, Milan, Italy. This detector operates within a voltage range of 95 to 350 kV and has a maximum current of 5 mA.
Figure 3 illustrates the ART-GIL350/6 fixed flaw detector.
In this study, the proposed method is compared to eight other multi-scale image fusion techniques. This comparison is carried out using both subjective visual evaluation and the evaluation of six sets of objective metrics. These evaluations aim to authenticate the efficacy of the proposed method. The eight multi-scale fusion methods are LDR [
21], MDLatLRR [
22], F-PCNN [
23], NMP [
24], IVFusion [
25], PL-NSCT [
9], MMIF [
26], and IFE [
27], with the same parameters as in the papers on each of these eight methods. The objective evaluation metrics are selected as entropy (EN) [
28], average gradient (AG) [
28], mutual information (MI) [
29], gradient-based fusion performance (Q
AB/F) [
30], peak signal-to-noise ratio (PSNR) [
31], and visual information fidelity (VIF) [
32], totaling 6 metrics. The method uses the following parameters: the NSST decomposition consists of four layers, with 16, 16, 8, and 8 decomposition directions respectively, the MSMG operator has three scales, and
in the CNP system. The fusion performance tests were implemented on the Intel(R) Core(TM) i7-6700 CPU @ 3.40 GHz and 16.00 GB of RAM produced in Shanghai, China in the MATLAB 2018B environment, and the other compared fusion methods were implemented on the same platform using open-source code.
3.1. Qualitative Analysis
The Han Dynasty bronze mirrors are two-dimensional flat cultural relics with unique painting styles and artistic values, so it is important to subjectively analyze the effects of image fusion. In this study, a detailed subjective analysis of the fused images of the bronze mirrors is shown in
Figure 4,
Figure 5,
Figure 6,
Figure 7,
Figure 8,
Figure 9,
Figure 10,
Figure 11,
Figure 12 and
Figure 13. Specifically, in the fusion result of the bronze mirror images, blue and yellow rectangles represent crack areas with significant differences in the effectiveness of each method, while red rectangles represent texture areas with significant differences in the effectiveness of each method.
In
Figure 4 and
Figure 5, the method used in this study, as well as the other eight methods incorporate the edge information and the ornamentation and inscription information of the bronze mirror. However, the LRD, NMP, F-PCNN, and PL-NSCT aspects do not present the crack information of the upper part of the bronze mirror clearly. The fusion performance of the F-PCNN, LatLRR, IVFusion, PL-NSCT, and MMIF methods is found to be weak in terms of preserving the ornamentation information. These methods do not clearly present the ornamentation and inscriptions found on the bronze mirror. Additionally, the information appears blurred and cluttered in the fused images produced by these methods. On the other hand, the LRD, F-PCNN, LatLRR, PL-NSCT, MMIF, and IFE methods also struggled to clearly present the uneven rusting and breakage information on the mirror rim. This difficulty arose due to overexposure of the rim portion during the fusion process. Overall, the fusion quality of the LRD, NMP, and IFE methods is good, but some of the subtle lesions and ornamentation information are not completely preserved. In contrast to the methods, the method proposed in this paper achieves a clearer fusion of bronze mirrors. It particularly excels in preserving the detail information of features such as disease information on the rim of the mirrors, as well as breakage and inscriptions in the ornamentation area. These details are more prominently displayed in the fused images produced by the method, and
Figure 6,
Figure 7 and
Figure 8 show a comparison of specific details.
Because the third and fourth sets of bronze mirrors are more severely damaged, it is better to highlight the differences in the fusion methods. As shown in
Figure 9 and
Figure 10, the LRD, NMP, and IFE methods have better fusion quality in terms of ornamentation information. The pattern of the bronze mirror can be clearly seen using these methods. In contrast, the other methods exhibit poor fusion quality in terms of ornamentation information, with blurred ornamentation areas and insufficiently prominent internal cracks.
For the mirror edge region, the LRD, NMP, F-PCNN, LatLRR, PL-NSCT, and MMIF methods did not clearly reflect the rust disease information in the mirror edge portion. Furthermore, the IFE method showed a black shadow in the mirror edge region, covering the disease information in that area. As shown in
Figure 11, specific effect comparisons are presented.
The F-PCNN, LatLRR, IVFusion, PL-NSCT, and MMIF methods exhibit significant blurring in the fragmented portion of the bronze mirrors, failing to accurately represent the varying levels of fragmentation, as well as the transformation in thickness information, as shown in
Figure 12 and
Figure 13. The method used in this study demonstrates high contrast in the third and fourth sets of bronze mirror fusion. The decoration information is clearly visible, and the information regarding bronze mirror diseases, as well as the level of fragmentation in the bronze mirror fragments, is more prominently displayed. It is evident that the method utilized in this study enhances the visibility of the fused result in the fusion of bronze mirrors. This method enables clear visualization of detailed information, such as ornamentation, while effectively highlighting cracks, edge details, and the transformation of bronze mirror levels. As a result, the fused image of the bronze mirror exhibits improved information richness.
3.2. Quantitative Analysis
Table 1,
Table 2,
Table 3 and
Table 4, respectively, show the evaluation results of the first, second, third, and fourth sets of bronze mirrors on six objective evaluation indicators.
Table 5 displays the average values for six evaluation indices from 36 fusion result charts across four groups of eight bronze mirrors.
The method used in this paper outperforms the other comparative methods on four of these metrics, and the remaining two metrics are better than most of the comparative methods. EN responds to the richness of the image information: generally, the larger the EN value, the richer the information. The AG value indicates the sharpness of the fused image, with a larger value indicating a sharper image. MI represents how much information of the source image is acquired by the fused image, and a larger value indicates that the fused resultant map retains more information of the source image. QAB/F, on the other hand, reflects the quality of the visual information obtained from the source image by the fusion resultant map, with larger values indicating that more of the original information is retained after fusion. PSNR reflects whether the image is distorted or not, and ideally, a larger value indicates better image quality. VIF is indeed an evaluation index that incorporates the quality of human visual perception and has a strong correlation with human judgment of visual quality. In the context of methodology concerning cultural relics protection, the specific nature of this work emphasizes the significance of subjective human eye judgment. Hence, the introduction of VIF allows for a comprehensive assessment of visual quality, aligning with the aim of effectively preserving cultural relics.
The method employed in this study demonstrates the highest values in all four evaluation indexes, namely AG, MI, QAB\F, and VIF. Notably, it excels particularly in the VIF index, suggesting that this method is adept at capturing more intricate details from the source image during the fusion process of X-ray images of bronze mirrors. As a result, the fused image exhibits a richer texture, aligning more closely with human visual perception.
The method of this study is ranked at a medium level on the PSNR index, which indicates that some noise is present in the fusion process. However, due to the specificity of PSNR, the score cannot be completely aligned with the visual quality perceived by the human eye. The sensitivity of human vision to errors is not absolute, and its perception can be influenced by several factors and variations. In the context of the fusion of bronze mirrors, it is important to focus on the visual perception of the human eye. Therefore, the PSNR index ranking can be considered acceptable.
Figure 14 represents the visualization of six evaluation indicators for the first, second, third, and fourth sets of bronze mirrors in different fusion methods. From the figure, although the method can still be further improved to a certain extent, such as in improving the PSNR value, the method in this chapter has demonstrated significant advantages in bronze mirror image fusion processing. Notably, considering the large difference in some data, the values of Q
AB/F, PSNR, and VIF, shown in
Figure 14, are proportionally scaled for analysis.
In addition, this paper compares the computational costs of the proposed method and the comparison methods. In order to conduct this evaluation fairly, all methods are programmed by MATLAB 2018B and executed on a desktop equipped with an Intel(R) Core(TM) i7-6700 CPU @ 3.40 GHz and 16.00 GB of RAM. Then, the average running time of each method processing the entire bronze mirror images was statistically analyzed and listed in
Table 6.
This table shows that the proposed method is competitive with the recent image fusion methods in terms of computational efficiency. Specifically, the running time of the proposed method is comparable to that of the PL-NSCT method and significantly better than those of the LRD, NMP, F-PCNN, LatLRR, IVFusion, and MMIF methods. The IFE method results in the shortest execution time but does not yield competitive results.
In summary, the method employed in this paper effectively preserves both the decorative and disease information present in the source images of bronze mirrors. It achieves a visual quality that closely resembles human perception and surpasses the performance of the other eight methods in terms of overall fusion results. This significant outcome holds great potential for advancing research and conservation efforts concerning bronze mirrors.
4. Conclusions and Discussion
This study presents a method for fusing X-ray images of bronze wares using pixel-level-based image fusion technology, with a multi-scale morphology gradient in the NSST domain and a local topology CNP system. Indeed, the proposed method has been specifically developed to address the unique requirements of the X-ray imaging process applied to bronze wares. The method takes into consideration the distinct characteristics and requirements associated with cultural relics research and preservation efforts. By tailoring the fusion approach to suit the particularities of bronze wares, the method can effectively enhance the quality of X-ray images and assist in the analysis, research, and conservation of these valuable cultural relics.
In high-frequency information fusion, using MSMG to represent the nature of the contrast strength between the pixels in the image and their neighboring pixels can be combined with WSEML. This combination ensures that the detail information of the source image is fully retained. Notably, this approach is well-suited for bronze wares research and conservation work, as it enables the scientific and comprehensive analysis of bronze wares. For the fusion of low-frequency information, a local topology CNP system is employed. This system draws inspiration from the impulse discharge mechanism of coupled neurons. The rich complementary information within the system can stimulate more local neuron discharges, resulting in enhanced clarity in the corresponding regions of the fused image. In this study, a comparative analysis was conducted using four groups of eight Han Dynasty bronze mirror X-ray images and eight multi-scale fusion methods from the control group experimental data. According to the experiment, the AG, MI, QAB/F, and VIF indicators showed an average increase of 4%. The objective was to effectively retain the maximum degree of information related to the bronze mirror rim disease and mirror center decoration.
This method fulfills the observation and protection requirements for the detailed features and conditions of bronze mirror jewelry. It enhances the edge and detail information within the image, thereby fully demonstrating the research value of bronze mirrors. This method reduces the difficulty of analyzing non-destructive testing of X-ray images of bronze mirrors. It provides a solution to the problems existing in X-ray non-destructive testing of bronze mirrors, helping to study and preserve the characteristics of bronze mirrors in cultural relic research and protection.
As the research progresses, a meaningful direction for further research in the field has been identified. The image evaluation indicators used in this paper are universal evaluation indicators, and the differences in bronze mirror types and thicknesses currently studied are relatively small. To accommodate the analysis of more complex, three-dimensional large-scale bronze artifacts, it is necessary to add depth operators to the evaluation operators for analysis. In the future, through in-depth research in this direction, the methods proposed in this paper can provide more valuable assistance for the protection and repair of bronze mirrors and even large bronze artifacts.