1. Introduction
With the expansion of the power grid and increasingly serious environmental pollution, insulator contamination flashover has become the biggest threat to the safe and stable operation of the power system. It is a traditional measure for contamination flashover prevention to formulate a cleaning plan according to the distribution of contamination areas and to clean insulators regularly. This methodology can reduce the occurrence of contamination flashover to a certain extent and improve the reliability of the power grid. However, due to the lack of actual contamination grades for insulators, the formulation of cleaning plans is often inaccurate. Another disadvantage of this method is that due to the complexity of the climate and the environment, as well as on-site contamination accumulation, the cleaning effect is difficult to maintain for a long time, while contamination flashover caused by poor cleaning quality can also occur. In addition, the long power failure time and the risks of high-altitude operations also restrict extensive regular cleaning. Relying on regular cleaning to prevent contamination flashover requires high maintenance costs and significant human resources, which do not meet the requirements of an intelligent power grid. Advanced technical means can be used to monitor the contamination severity, flexibly arrange the cleaning work according to the contamination severity, and promote the transformation from planned maintenance to condition-based maintenance (CBM), which is the focus of contamination flashover prevention in the future. In order to accurately detect insulator contamination grades, researchers have carried out a lot of research. At present, the detection methods for assessing the contamination level of insulators mainly include evaluating the equivalent salt deposit density (ESDD), measuring the leakage current, using infrared temperature measurement methods, and so on [
1,
2,
3,
4,
5,
6,
7,
8]. The ESDD method is widely used today for contamination measurement. Since the acquisition of salt deposit density and non-soluble deposit density samples requires turning off the power, the process is complex and cannot facilitate efficient detection of contamination in the field. In the leakage current method, the leakage current on the surface of the insulator is analyzed to identify the contamination state or to issue a flashover warning. However, the leakage current is affected by the operating voltage, ambient temperature, humidity, type of insulator, and string length [
9,
10,
11]. These issues have hindered the practical application of the leakage current method. In the infrared temperature measurement method, the heat generated by the leakage current is used to identify the degree of contamination [
12]. An infrared imager is used to measure the surface temperature of the insulator, and the state of contamination is then identified by analyzing the temperature features. Since the surface temperature of an insulator in the field can be affected by several factors, including the heat dissipation condition, radiation from sunlight, and ambient temperature, many issues remain to be resolved for the infrared method. In theory, the detection of partial discharge intensity using ultraviolet images or ultrasonic signals can facilitate the monitoring of contamination severity. However, as the obvious discharge phenomenon can only be formed on the surface of contaminated insulator when the environmental humidity is high, the application of this method is restricted by environmental factors. Moreover, the relationship between partial discharge intensity, contamination grade, environmental humidity, and other factors is complex, and there are still many problems that need to be further studied. Compared to these methods, the method of identifying the contamination grade by analyzing a visible-light image is not affected by the ambient temperature and humidity, is cost effective, requires no power outage, and is non-contact. RGB and HSI are two widely used color space models for color images. For the RGB color model, red (R), green (G), and blue (B) components are utilized to represent color images [
13,
14]. for the HSI color model, hue (H), saturation (S), and intensity (I) components are utilized to represent color images. The two color models reflect the color differences of insulators with different contamination grades in the two color spaces. The comprehensive utilization of the information contained in the two color spaces can help detect the contamination state of insulators more accurately.
Compared to the conventional shallow learning method, a deep learning algorithm has a better capability of discovering and describing the complicated internal characteristics of a problem. Because it has a powerful learning ability, deep learning is able to obtain deep features from raw information directly [
15]. Deep belief network (DBN) is a kind of representative deep network composed of restricted Boltzmann machines (RBMs). It has the capability of discovering deep structure information and learning complicated nonlinear features from data.
In this work, a method for identifying the contamination state using multimodal deep learning of image color information is proposed. The mathematical morphology-improved optimal entropic threshold (OET) segmentation method is used to extract information from the image of an insulator disk and eliminate the influence of background. Feature calculations of visible-light images of insulators in the field are carried out separately in the RGB and HSI color spaces, and the original features from the two color spaces are fused with kernel principal component analysis (KPCA) to obtain the fused features. Finally, classifiers are constructed using a DBN to realize the accurate recognition of insulator contamination grades using the fused features. Compared with image recognition algorithms such as convolutional neural network (CNN) and graph convolutional network(GCN), the scheme proposed in this paper uses surface color information to realize contamination grade recognition, avoid the influence of background, edge, structure, and other parameters independent of contamination severity, and realize a method that is more suitable for the research objectives and application scenarios. The process of insulator contamination grade recognition is shown in
Figure 1.
5. Recognition of the Contamination Grade
After the photographing, the contamination sampling and testing of insulators are carried out. The on-site photographing and contamination sampling are shown in
Figure 7. In the laboratory, the contamination grade test is conducted with reference to GB/T 16434-1996 to determine the contamination grade of each insulator. The standard for the insulator contamination grades is shown in
Table 1.
In the contamination test, no grade 0 samples were found in the sampled insulators, therefore the follow-up research only focuses on the four contamination grades of I to IV.
For the insulator image taken on site, the mathematical morphology-improved OET segmentation method is used to extract the insulator from the background, and the extracted insulator area is selected as the region of analysis. Sixty-six features of the region—namely, the mean, median, maximum, minimum, common, range, variance, skewness, kurtosis, energy, and entropy of the six color components of R, G, B, H, S, and I—are used as the original feature quantities. The calculation formulas for several of these features are:
Entropy
:
where
is the value of a color component in the insulator disk area,
is its distribution probability, and
. In order to improve the comparability of data and the operation speed of the classifier, each group of features needs to be normalized. The normalization formula is as follows:
where
is the normalized value of the feature,
is the original value of the feature,
is the number of features,
is the sample serial number, and
and
are the minimum and maximum values of the
k-th feature, respectively.
Feature extraction is then performed on the 66 original features of each image using KPCA. The DBN is trained by the fused features from the KPCA to create a recognition model for the insulator contamination grades. The proposed method is validated using test samples. For the KPCA, the dimension of the input feature is 66 and the dimension of the output fused feature is 20. For the DBN, the input node number (i.e., the input feature dimension) is 20. The node numbers of the three hidden layers in the DBN are set to 40, 20, 10, respectively. The output node number is set to 4. For the remaining parameters, the learning ratio and the weight decay ratio are set to 0.1 and 0.0001, respectively. For the output state of the DBN, [0,0,0,1], [0,0,1,0], [0,1,0,0], and [1,0,0,0] represent the contamination grade from I to IV, respectively. For each contamination grade, 120 images are utilized as training samples for the training of the DBN. For each contamination grade, 40 images are utilized as test samples, and the trained classifier is used to identify them.
To verify the improvement of the contamination grade recognition derived from information fusion, contamination grade recognition was performed using the proposed algorithm on the 33 features of the R, G, and B color components and the 33 features of the H, S, and I color components of the same set of samples. For the KPCA, the dimension of the input feature is 33 and the dimension of the output fused feature is 10. For the DBN, the input node number (i.e., the input feature dimension) is 10. The node numbers of three hidden layers are 40, 20, and 10, respectively. The output node number is set to 4. All the other parameter values are the same as given previously. The recognition results are shown in
Table 2.
Results show that the accuracy of contamination grade recognition using the color features of RGB and HSI alone are 81.25% and 85%, respectively. In contrast, the recognition accuracy using the fused features obtained by the KPCA has been significantly improved, reaching 95.625%. Compared with the RGB and HSI color-space features before fusion, the fused feature makes full use of the information of the two color spaces to more comprehensively reflect the contamination state of insulators. It has better separability and has significantly improved the accuracy of contamination grade recognition.
Figure 8 shows the confusion matrices of the contamination grade recognition.
In order to verify the improvement of the recognition accuracy of the deep learning method compared with the shallow learning method, classifiers were constructed using BP and SVM to realize contamination grade recognition. The input of the two classifiers is consistent with the above DBN, which is a 20-dimensional fused feature extracted by the KPCA. The structure of the BP network is 20-16-4 and the learning rate is set to 0.1. The kernel function of the SVM adopts RBF and the penalty coefficient is set to 10. The training samples and test samples are consistent with the samples used by the DBN. After testing, the recognition accuracy of BP and SVM are 80.625% and 83.75% respectively. It can be seen that deep learning can make better use of feature information, and the recognition accuracy of the DBN is obviously better than the two classical shallow learning methods.
In order to study the impact of each feature on the performance of the algorithm, ablation analysis is used to remove one input feature at a time from the best performance, observe the change of the accuracy of the algorithm, and determine the features that have greater impacts on the performance.
Figure 9 shows the recognition accuracy after removing each RGB color feature. The ablation analysis diagrams of HSI color features are shown in
Figure 10.
It can be seen from
Figure 9 and
Figure 10 that 11 features, such as R-mean, R-maximum, G-minimum, G-skewness, B-mean, B-common, H-mean, H-median, H-common, H-skewness and I-skewness, have a greater impact on the performance of the algorithm, and the accuracy reduction caused by removing any feature alone is more than 10%.
The parameters of the computer used in this study are i5-9500 CPU, 3 GHz basic frequency and 16GB RAM. Running in the MATLAB 2019 environment, the disk segmentation of 480 test images takes 104.4 s, the feature calculation and feature fusion takes 115.2 s, the DBN network training takes 585.6 s, and the whole training process takes 805.2 s in total. The algorithm performs well in terms of time complexity. In the follow-up research, the operation time can be further reduced by using a commercial algorithm framework or hardware platform.
The current research is focused on brown porcelain external insulation. In the future, research can be carried out on white porcelain insulators, glass insulators, and composite insulators to build contamination grade classifiers for different materials and types of insulators. Furthermore, after further improvement, the contamination grade recognition method proposed in this paper can be combined with inspection platforms, such as robots and UAVs, to realize remote and unmanned detection of insulator contamination status.