Next Article in Journal
Intelligent Mobile Wireless Network for Toxic Gas Cloud Monitoring and Tracking
Previous Article in Journal
THz Sensing of Human Skin: A Review of Skin Modeling Approaches
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Novel Transformers Fault Diagnosis Method Based on Probabilistic Neural Network and Bio-Inspired Optimizer

1
College of Information Engineering, Nanchang University, Nanchang 330031, China
2
College of Qianhu, Nanchang University, Nanchang 330031, China
*
Author to whom correspondence should be addressed.
Sensors 2021, 21(11), 3623; https://doi.org/10.3390/s21113623
Submission received: 16 April 2021 / Revised: 14 May 2021 / Accepted: 17 May 2021 / Published: 23 May 2021
(This article belongs to the Section Fault Diagnosis & Sensors)

Abstract

:
Since it is difficult for the traditional fault diagnosis method based on dissolved gas analysis (DGA) to meet today’s engineering needs in terms of diagnostic accuracy and stability, this paper proposes an artificial intelligence fault diagnosis method based on a probabilistic neural network (PNN) and bio-inspired optimizer. The PNN is used as the basic classifier of the fault diagnosis model, and the bio-inspired optimizer, improved salp swarm algorithm (ISSA), is used to optimize the hidden layer smoothing factor of PNN, which stably improves the classification performance of PNN. Compared with the traditional SSA, the sine cosine algorithm (SCA) and disruption operator are introduced in ISSA, which effectively improves the exploration capability and convergence speed. To verify the engineering applicability of the proposed method, the ISSA-PNN model was developed and tested using sensor data provided by Jiangxi Province Power Supply Company. In addition, the method is compared with machine learning methods such as support vector machine (SVM), back propagation neural network (BPNN), multi-layer perceptron (MLP), and traditional fault diagnosis methods such as the international electrotechnical commission (IEC) ratio method. The results show that the proposed method has a strong learning ability for complex fault data and has advantages in accuracy and robustness compared to other methods.

1. Introduction

Oil-immersed power transformers are among the most expensive and essential pieces of equipment in power systems [1,2,3]. During operation, oil-immersed transformers are subjected to various stresses, such as electrical, thermal, chemical, and mechanical stresses, which can lead to the aging and deterioration of their insulation. In addition, the insulation degradation or moisture generated by the external environment can accelerate the aging process, reduce the dielectric strength, and lower the partial discharge initiation voltage. Insulation defects are the most common cause of failure in excitation transformers and directly affect the reliability of the equipment [4,5]. In today’s increasingly large power demand, if a power transformer fails, it will likely cause an interruption of power supply to the energy system and bring significant economic losses. Therefore, being able to quickly and accurately diagnose the type of faults during transformer operation has become an important issue in promoting the smart grid process.
Currently, the dissolved gas analysis (DGA) method has been widely used in the fault diagnosis of oil-immersed transformers [6,7]. The occurrence of mechanical, electrical, and thermal faults in an oil-immersed power transformer leads to the degradation of the insulating oil and the increase in some gases. The causes of gas generation include corona (partial discharge), low energy sparks, arcing, cellulose overheating, and insulation overheating. In this case, gases such as carbon monoxide (CO), carbon dioxide (CO 2 ), hydrogen (H 2 ), methane (CH 4 ), ethane (C 2 H 6 ), ethylene (C 2 H 4 ), and acetylene (C 2 H 2 ) are dissolved in the oil in different proportions. Although the oil contains oxygen (O 2 ) and nitrogen (N 2 ), they enter the transformer from outside and are not related to the degradation of the insulating oil [8]. Therefore, through intelligent sensors and in order to obtain dissolved gas data in the oil and analyze it, we can achieve a real-time view of the operating conditions of the transformer to quickly and efficiently check the internal hidden problems and faults of the transformer [9,10]. Currently, the classical fault diagnosis methods based on DGA data include Doernenburg [11], Rogers [12], IEC 599 [13] and IEC 60599 [14,15]. They attempt to study the hydrogen (H 2 ), methane (CH 4 ), ethane (C 2 H 6 ), ethylene (C 2 H 4 ), and acetylene (C 2 H 2 ) gas concentrations and the relationships between them, and then evaluate the transformer operating conditions according to various pre-defined thresholds [16].
Although the methods above are easy to apply, they require a large amount of engineering experience, are not sufficiently generalized, and their fault detection accuracy is limited. Thus, they may not be reliable enough for predicting fault types [17]. To improve the fault diagnosis accuracy to meet the current industrial demand, scholars in related fields have combined traditional fault diagnosis methods with emerging artificial intelligence technologies to improve the accuracy of fault diagnosis significantly. A series of classical statistical learning methods such as hidden Markov models (HMM) [18], support vector machine (SVM) [19], k-nearest neighbor (KNN) [20], etc., are applied to process DGA data to improve the accuracy of fault diagnosis. With the continuous development of data-driven technologies, new machine learning methods with better performance are combined with traditional fault diagnosis methods to achieve excellent diagnostic results. Wang et al. [21] improved the traditional fault diagnosis method, i.e., the dielectric response method. Low-frequency dielectric parameters were first extracted using mixed-frequency excitation. Then, the extended Debye equivalent circuit parameters were determined using the cuckoo search (CS) optimization algorithm. Finally, the specific parameters were used for testing to establish a simulation model and obtain a recovery voltage curve. Compared with the traditional method, this method greatly reduces the test time. To cope with the high uncertainty and variability of DGA data, Kirkbas et al. [22] used the information-rich feature data set obtained by spectral techniques along with the common vector approach (CVA) for the fault mode identification of DGA data. The CVA-based method is well suited and has better fault diagnosis performance than the traditional SVM-PSO method, as demonstrated by examples. Jiang et al. [23] proposed lasso regression to build a multidimensional linear model of the selected features. The change point detection method based on lasso regression is based on the minimum days and standard deviation (SD) between the change point and fault time, which accurately reflects the location of the transformer fault in most cases. The method provides an effective method for dynamic fault prediction based on dissolved gas data with the advantages of robustness and no data training. The application of an artificial neural network (ANN) in the field of fault diagnosis of oil-immersed transformers [24] has led to a significant improvement in the reliability of diagnosis. Meanwhile, optimization schemes for a neural network are becoming more abundant and mature. Yang et al. [17] proposed a power transformer fault diagnosis system combining a polynomial logistic regression model and a back propagation neural network (BPNN) to determine the type of transformer faults by analyzing the dissolved gases in the transformer. The test results show that this intelligent fault diagnosis system’s recognition rate is about 10–30% higher than that of the single neural network or multi-neural network recognition system without a polynomial logistic regression model. Huang and Wang et al. [25] proposed a transformer fault diagnosis method based on the gray wolf optimization (GWO) algorithm to optimize the hybrid kernel function learning machine. The parameters of the hybrid kernel function can be optimized using the GWO algorithm. Simultaneously, the initial population parameters of the GWO algorithm are generated by using chaotic logistic mapping to avoid the adverse effects of overly fast convergence on the optimization results, which effectively improves the classification accuracy. Dai et al. [26] developed an oil-immersed fault diagnosis model based on a deep belief network (DBN) and compared the performance of the method with the support vector machine (SVM), back propagation neural network (BPNN), and ratio method. The results show that the method significantly improves the accuracy of power transformer fault diagnosis. Ou et al. [27] proposed a dynamic Adam and dropout-based deep neural network (DADDNN) for oil-immersed power transformer fault diagnosis. Ou et al. utilized the dropout technique to randomly reset some neurons to prevent overfitting and indirectly enhanced the information exchange between them.
It is clear that ANN has promising applications in fault diagnosis. There are many different types of ANN available for classification tasks. Probabilistic neural network (PNN), as a radial neural network, has a strong fault tolerance. PNN can converge to a Bayesian classifier as long as sufficient sample data are available, which is more efficient than other network models such as the multi-layer perceptron (MLP) and the back propagation neural network (BPNN). In addition, PNN has some advantages in solving multi-classification problems [28,29].
In this paper, PNN was chosen as the basis for the fault diagnosis classifier. Since the classification performance of PNN is easily affected by the smoothing factor ( σ ) of the hidden layer [30], the selection of the smoothing factor ( σ ) can make the network converge too quickly and lead to a significant decrease in classification accuracy. Therefore, we used the improved salp swarm algorithm (ISSA) optimized by the sine cosine algorithm (SCA) and disruption operator (D o p ) to filter the smoothing factor ( σ ). SCA introduces the update mechanism of SSA, which enhances the exploration capability and reduces the possibility of getting into the local optimum. The disruption operator (D o p ) was used to improve the population diversity and maintain the balance between exploration and exploitation processes. To verify whether the ISSA-PNN model is applicable to the field of oil-immersed transformer fault diagnosis, we compared it with traditional fault diagnosis methods and other intelligent algorithm models based on the same set of DGA data. The experimental results show that ISSA has a better performance on the DGA experimental data set than the traditional SSA. The ISSA-PNN method outperformed other methods in terms of accuracy, diagnosis efficiency, and robustness compared with other fault diagnosis methods.
The rest of the paper is organized as follows: Section 2 describes the proposed method. In Section 3, the transformer fault diagnosis model is described. Section 4 presents the experimental results, and Section 5 discusses them. Finally, the conclusion is drawn in Section 6.

2. The Proposed Method

In this section, we present the proposed fault diagnosis method for power transformers. We first introduce the salp swarm algorithm (SSA), the sine cosine algorithm (SCA), and then discuss the improved salp swarm algorithm (ISSA). Finally, the ISSA-based probabilistic neural network fault diagnosis model is described in detail.

2.1. Salp Swarm Algorithm

The salp swarm algorithm (SSA) is a novel intelligent optimization algorithm proposed by Seyedali Mirjalili et al. in 2017 [31]. The algorithm performs an optimization search process in the solution space by simulating the salps swarm’s predation behavior. In the deep sea, the salp group moves and feeds in a chain behavior. There are leaders and followers in the chain, and the leaders move towards the food and guide the followers to follow them. At each iteration, the leader performs global exploration while the followers fully explore locally. Compared with other algorithms, the iterative optimization-seeking mechanism of the SSA algorithm dramatically reduces the cases of falling into the local optimum.
The specific process of SSA can be divided into two steps, as follows.
Step 1: Initialization of SSA. Set the number of populations N, the spatial dimension d, the maximum number of iterations T, and initialize the salps population’s position by Equation (1):  
X N , d = r a n d N , d · u b l b + l b
where u b and l b denote the upper and lower bounds of the search space, respectively; the matrix represented by X ( N , d ) stores the bottle’s positions ascidian group.
Step 2: Position Update.
(1) The leader position is updated, as shown in Equation (2):
X j l e a d e r = F j + c 1 u b l b c 2 + l b , c 3 0.5 F j c 1 u b l b c 2 + l b , c 3 < 0.5
where X j l e a d e r and F j in Equation (2) represent the leader and food positions in the jth dimension, respectively. Since the position of the food/target is not clear in the actual iterative process, the salp’s position with the current optimal fitness value is set as the food position in each iteration process. c 1 , c 2 , c 3 are control parameters, where c 1 is the convergence factor in the algorithm, which is the essential control parameter in SSA and plays the role of balancing the global search and local exploitation capability, and its expression is:
c 1 = 2 e 4 t T 2
where t represents the current number of iterations, and it can be seen that the convergence factor is a decreasing function from 2 to 0 during the iterative process. c 2 and c 3 are random numbers of [0,1], which are used to enhance X j l e a d e r ’s randomness to improve the global search capability of the algorithm.
(2) The followers advance in a chain-like sequence by influencing each other between the individuals before and after them. Their displacements are following Newton’s law of motion, and the motion displacement of the followers can be expressed as
X j i = 1 2 a t 2 + v 0 Δ t
where X j i is the position of the ith follower in the jth dimension, and a is the acceleration and is calculated as
a = v f i n a l v 0 / Δ t
where v f i n a l = X j i 1 X j i / Δ t , X j i 1 is the position of the i 1th salp in the jth dimensional space. Since v 0 = 0 and time t is the number of iterations, i.e., Δ t = 1 during the algorithm, Equation (4) can be expressed as
X j i = X j i X j i 1 2

2.2. The Sine and Cosine Algorithm

The SCA algorithm is a stochastic optimization algorithm that is highly flexible, simple in principle, easy to implement, and easily applied to optimization problems in different fields [32]. The optimization process of the sine cosine optimization algorithm can be divided into two phases: in the exploration phase, the optimization algorithm quickly finds a feasible region in the search space by combining a specific stochastic solution among all stochastic solutions; in the development phase, the stochastic solution will gradually change, and the speed of the change of the stochastic solution will be lower than that of the exploration phase.
In the sine cosine algorithm, the candidate solution is first randomly initialized. Then, the current solution is updated in each dimension according to the sine or cosine function combined with a random factor. The specific update equation is:   
X j t + 1 = X j t + r 1 · sin r 2 · r 3 P j t X j t r 4 > 0.5 X j t + r 1 · cos r 2 · r 3 P j t X j t r 4 0.5
In Equation (7), X j t is the position of the jth dimension of the current individual in the t-th generation, r 2 is a random number from 0 to 2 π , r 3 is a random number between 0 and 2, r 4 is a random number from 0 to 1, and P j t denotes the position of the jth dimension of the optimal individual position at t iterations. r 1 can be expressed as
r 1 = a t a T
where a is a constant, t is the current number of iterations, and T is the maximum number of iterations. The value of r 1 decreases gradually with the iterative process, balancing the algorithm’s local exploitation and global search capability.

2.3. Improved Salp Swarm Algorithm

To improve the exploration and exploitation capabilities of SSA, we used SCA instead of the traditional follower position update mechanism in SSA and introduced a disruption operator ( D o p ) to increase the diversity of the salp population [33]. To achieve this goal, Liu et al. [34] gave the following equation to define the disruption operator:
D o p = D i , j · r a n d 2 , 2 i f D i , b e s t 0.2 1 + D i , b e s t · r a n d 1 20 , 1 20 o t h e r w i s e
In Equation (9), D i , j represents the distance between the ith solution and the jth nearest solution, and D i , b e s t describes the distance between the ith solution and the best solution.
The initialization process of ISSA and the leader update mechanism are consistent with the traditional SSA. However, the update mechanism of followers is not the same as SSA. At this time, the update method of SCA is selected instead of the traditional SSA update method, that is, using Equation (7) for the position update of followers.
The disruption operator is introduced after the end of the position update, and to reduce the computation time in this phase, the disruption operator is used as shown in Equation (10):
X = X · D o p δ 0 > 0.5 X δ 0 0.5
where X represents the updated population of salps using the disruption operator. It can be seen from Equation(10) that the disruption operator is used to diversify the salp population only when the random number δ 0 is greater than 0.5. The pseudo-code of SCA-SSA is shown in Algorithm 1.
Algorithm 1 Improved salp swarm algorithm.
1:
Initialization parameters: population size N, dimension d, maximum number of iterations T.
2:
Generate the initial population X by Equation (1);
3:
Calculate the fitness value for each individual;
4:
while t < = T do
5:
    Update c 1 by Equation (3) and r 1 by Equation (8);
6:
    for  i = 1 : n  do
7:
        if  X i ( l e a d e r )  then
8:
           Update random numbers c 2 and c 3 ;
9:
           Update the position of the leader salp as in Equation (2);
10:
        else
11:
           Update random numbers r 2 , r 3 and r 4 ;
12:
           Update the position of the follower salp as in Equation (7);
13:
        end if
14:
        Calculation D o p using Equation (9);
15:
    end for
16:
    if  δ 0 > 0.5  then
17:
         X = X · D o p ;
18:
    end if
19:
    Set t = t + 1 ;
20:
end while
Output: Best classification and predication results.

2.4. Probabilistic Neural Network

A probabilistic neural network (PNN) is a radial basis network that belongs to a feed-forward kind network. It has the following advantages: simple learning process, fast training speed, more accurate classification, good fault tolerance, etc. In essence, it belongs to a supervised network classifier based on the Bayesian minimum risk criterion.
Probabilistic neural networks generally have four layers: input layer; pattern layer; summation layer; and output layer. Among them, the pattern layer is connected to the input layer by connecting weights, calculating the degree of matching between the input feature vector and each pattern in the training set, that is, the similarity, and feeding its distance into a Gaussian function to obtain the output of the pattern layer. The output of each pattern unit is as follows:
Φ i j x = 1 2 π 1 2 σ d e X x i j T X x i j σ 2
where X = [ x 1 , x 2 , , x n ] T , n = 1 , 2 , , l . d is the input feature dimension, and l is all training types. x i j represents the jth data of the ith neuron. σ represents the smoothing factor. The summation layer averages the output weights of neurons belonging to the same type of pattern layer, and the results can be calculated by
v i = j = 1 L Φ i j L
where v i represents the output of type i, and L represents the number of type i neurons. The output layer is responsible for outputting the highest scoring category in the summation layer, and the output is as follows:
T y p e v i = a r g max v i
In the topology of PNN: the number of input layers is the number of sample features, the number of neurons in the pattern layer is the number of input sample vectors, and the number of neurons in the summation layer is the number of sample categories. Therefore, if we assume a pattern recognition task with four categories of samples, a variable number of samples in each category, and a three-dimensional feature dimension for each sample, we can draw the network structure as shown in Figure 1.

2.5. The Proposed ISSA-PNN Model

The classification performance of a PNN is easily affected by the smoothing factor σ . Choosing an overly large or overly small value of σ will make the network converge too quickly and thus fail to find the optimal solution, making the diagnostic classification accuracy drop significantly. To this end, we use the hybrid algorithm ISSA to find the most suitable σ to improve the classification performance of the network, to establish a practical performance, high accuracy, and reliable ISSA-PNN fault diagnosis model, and the optimization process of ISSA on PNN can be represented by Figure 2.
To this end, we used the hybrid algorithm ISSA to find the most suitable σ to improve the classification performance of the network, to establish the ISSA-PNN fault diagnosis model with good practical performance, high accuracy, and reliability. Moreover, the optimization process of ISSA-PNN can be represented by Figure 2.
The main specific steps of the ISSA-PNN fault diagnosis model are shown below.
  • Step 1: The pre-processed DGA data are input into PNN randomly, and the parameters are initialized.
  • Step 2: The initial parameters of ISSA are set: population size N; dimension d; and the maximum number of iterations T. Moreover, the population position of ISSA is initialized by Equation (1), and each salp individual represents a set of smoothing factors σ .
  • Step 3: The salp group’s fitness values within the population were calculated and ranked. In this paper, the mean square error is set as the fitness function, as shown in Equation (14).
    f x = 1 N i = 1 N Y i O i 2
  • Step 4: The one with the best adaptation is considered as the current food position. Among the remaining N 1 salps, the salps with the top half of adaptation are considered as the leader, and the rest of the salps are considered as followers.
  • Step 5: Update r 1 and c 1 according to Equation (3) and Equation (8), respectively.
  • Step 6: Update the leader position by Equation (2) and the follower position by Equation (7).
  • Step 7: Calculate D o p by Equation (9) and generate a random number δ 0 . If δ 0 is greater than 0.5, diversify the salp population, according to Equation (10).
  • Step 8: If the current number of iterations reaches the maximum number of iterations, then proceed to the next step—otherwise, return to Step 5.
  • Step 9: Input ISSA optimized smoothing factor into PNN to obtain a better performance PNN model and the input test set data into PNN to obtain the best diagnostic results.

3. Implementation and Experiment Setup

3.1. Model Implementation

The implementation framework of the proposed ISSA-PNN model for power transformer fault diagnosis is shown in Figure 3. As can be seen from the figure, the implementation of the proposed model is divided into three parts: data collection and processing, the training of the neural network, and the testing and evaluation of the network model. Firstly, some dissolved gas content of transformer oil is collected from the smart sensors inside the oil-immersed transformer as DGA data, and then the collected DGA data are pre-processed and filtered using an IEC three-ratio method. A random selection of 80% of the pre-processed DGA data are input into the ISSA-PNN model for training and optimization. The remaining data are used for testing and evaluating the performance of the diagnostic model.
In this paper, we focused on four types of oil-immersed power transformer faults to train and test the diagnostic models, namely low temperature and overheating (LT) (<150 °C); low temperature and overheating (LT) (150–300 °C); partial discharge (PD); and arc discharge (AD). Table 1 shows some real data for judging the fault types of oil-immersed power transformers by the DGA method from the power supply companies (PSCs) in some provinces of China. In addition, since the proposed model was based on a probabilistic neural network, each fault type will be coded in the form as shown in Table 2.

3.2. Data Collection and Pre-Processing

To ensure the validity of the experiment and reduce the influence of temperature, humidity, transformer model, and other parameters on the experimental data, we deliberately collected several groups of sensor data of various gases dissolved in oil-immersed transformer oil from Jiangxi Power Supply Company and transformer factory as experimental data samples. For dissolved gas analysis, we selected the volume fraction of some gases (C 2 H 2 , C 2 H 4 , CH 4 , H 2 , C 2 H 6 ) dissolved in transformer oil as the primary basis for transformer fault type judgment.
After excluding some noisy data and incomplete data samples and processing them by the IEC three-ratio method, 555 valid characteristic gas data samples were obtained, including 361 sets of low-temperature overheating (LT) (<150 °C), 40 sets of low-temperature overheating (LT) (150–300 °C), 65 sets of partial discharge (PD), and 89 sets of arc discharge (AD). Among them, 444 sets of data were used as training samples, and 111 sets of data were used as test samples. Some of the data samples are shown in Table 3.
The distribution of dissolved gas data for the four-fault types after pre-processing is shown in Figure 4, and it can be seen that the distribution of C 2 H 2 /C 2 H 4 , CH 4 /H 2 , and C 2 H 4 /C 2 H 6 gas ratios is with apparent differences. Although the data samples of low-temperature overheating (<150 ℃) fault are large, the data distribution is the most complicated, which is a problematic point in fault diagnosis.

3.3. Performance Evaluation

For validating the performance of the proposed model in power transformer fault diagnosis, we compared ISSA-PNN with the conventional PNN model and the PNN model optimized by particle swarm optimization (PSO), seagull optimization algorithm (SOA), bat algorithm (BA), multi-verse optimizer (MVO), and salp swarm algorithm (SSA), respectively. Secondly, we compared with other hybrid back propagation neural network (BP) models mentioned in the similar research literature [35,36], including BA-BP, cuckoo search (CS)-BP, genetic algorithm (GA)-BP. Moreover, we compared with some standard classical diagnostic methods for comparison, such as the IEC ratio method, the support vector machine (SVM), and the multi-layer perceptron (MLP). The parameter settings of different optimization methods are detailed in Table 4.
To effectively demonstrate the effectiveness of the proposed model, this paper will be evaluated by accuracy and F1-score. The confusion matrix is an essential criterion for the classification model. As shown in Table 5, it contains four values, which are a true positive (TP), true negative (TN), false positive (FP), and false negative (FN).
Precision is the ratio of the number of positives correctly predicted to the number predicted to be positive, as shown in Equation (15):
P r e c i s i o n = T P T P + F P
Recall is the ratio of the number of positives correctly predicted to the number of actual positive examples, as shown in Equation (16):
R e c a l l = T P T P + F N
The other two crucial evaluation metrics can be obtained from Table 5, i.e., accuracy and F-score, calculated as follows:
A c c u r a c y = T P + T N T N + T P + F N + F P
F s c o r e = 1 + β 2 P r e c i s i o n × R e c a l l β 2 · P r e c i s i o n + R e c a l l
When β is equal to 1, this evaluation metric is called the balanced F-score (F1-score), indicating that recalls and precisions are weighed on the same footing for consideration. In this paper, β is 1.

4. The Experimental Results

We performed simulation training on the MATLAB platform with the same test set and training set. We compared it with five other modified PNN methods and traditional PNN, and the results of each failure and average accuracy are shown in Table 6. It can be seen that the average accuracy of ISSA-PNN is 99.65%, which is higher than the other methods: SSA-PNN 97.37%; MVO-PNN 97.02%; BA-PNN 96.52%; SOA-PNN 95.80%; PSO-PNN 94.49%; and PNN 86.70%. In addition, the ISSA-PNN method only has error cases at LT (<150 °C) with an accuracy of 98.59%.
Moreover, to further prove the excellence of the proposed method, we compared it with the excellent methods proposed by other researchers and some classical methods. The comparison results are shown in Table 7. It can be seen that the average accuracy of ISSA-PNN is also superior to other methods, although ISSA-PNN (98.59%) is inferior to BA-BP (99.06%) and GA-BP (99.06%) methods in LT (<150 °C) faults. However, the remaining three fault types’ performance is much better than BA-BP and GA-BP, so the combined performance of ISSA-PNN is still the best, proving the excellent performance of the proposed method in a power transformer fault diagnosis.
Figure 5 shows the classification results of different methods on data samples after training, where subgraphs a, c, e, g, i, k, m are the classification results of training samples, and subgraphs b, d, f, h, j, l, n are the classification results of test samples.
The confusion matrix is one of the crucial tools for evaluating classification models. We plotted the confusion matrix of various methods to compare the performance of each method, as shown in Figure 6. The target classes 1, 2, 3, and 4 in the subgraphs refer to LT (<150 °C), LT (150–300 °C), PD, and AD, respectively. In addition, the rightmost column of the subgraph is the precision (or positive predictive value), and the bottom row is the recall (or true positive rate). It is worth noting that the lower right cell indicates the overall accuracy, which is different from the average accuracy in Table 6 and Table 7.
According to the confusion matrix in Figure 6, the F1-score corresponding to each method can be calculated, which is one of the critical indicators for evaluating classification models, and the calculation results are shown in Table 8. It can be seen from Table 8 that the F1-score of the four-fault types of ISSA-PNN are 99.29%; 100.00%; 100.00%; 97.44%; and the Marco F1-score is 99.18%, which are higher than the other method models.
The comparison of MSE for different methods is shown in Table 9. Figure 7 shows the change curve of the fitness of different algorithms in the optimization process for PNN, which can well show the optimization process of various algorithms for comparison.

5. Discussion

As can be seen from Table 6 and Table 7 regarding the comparison in terms of accuracy, ISSA-PNN has the best overall performance among all four faults, both compared with various optimized PNN methods and with other excellent improved neural network methods, and is only slightly inferior to BA-BP and GA-BP in LT (<150 °C) faults. Except for LT (<150 °C) fault, ISSA-PNN is clearly the best in the remaining three faults and the final average accuracy, especially in AD fault diagnosis where the accuracy is generally low, ISSA-PNNN still maintains 100% accuracy.
Figure 5 shows that the performance of other algorithms in training samples and test samples is not ideal, and there are generally multiple errors. Many algorithms perform well during training, and once they switch to test samples for model testing, there is a sudden increase in errors, which indicates that these algorithms are less robust to the point that they fall into overfitting during training. From Figure 5m,n, it can be seen that ISSA-PNN has no error in training samples, and the results are ideal. In the test samples, only errors are generated in the first type of faults, and the classification of the remaining faults is correct. This indicates that the proposed method not only has good accuracy but also has strong robustness and does not easily fall into overfitting during the training process which degrades the model performance.
In Table 8, the F1-score of the ISSA-PNN method is the highest, consistent with the accuracy assessment results, indicating that the accuracy assessment results have high reliability. It was also fully demonstrated that ISSA-PNN has the best comprehensive performance in power transformer fault diagnosis and can meet various engineering needs in terms of precision and recall.
In Table 9, ISSA-PNN performs the best in training with the MSE of 0. Moreover, ISSA-PNN does not have the same as the GA-BP method: the MSE during training is very low, second only to ISSA-PNN (0.005), but it falls into overfitting, and its performance during testing is not satisfactory (only 0.19030). The result indicates that ISSA-PNN can learn the internal laws implied by the data more quickly and effectively, has excellent generalization ability and self-regulation ability, and can better cope with the interference of various random noises in practical applications.
As can be seen in Figure 7, for ISSA-PNN, its fitness starts decreasing at the fourth iteration, and falls into the local optimum at the fifth iteration, and then jumps out of the local optimum at the sixth iteration to continue the optimization search, and finally reaches the global optimum at the seventh iteration. In contrast, other optimization algorithms take several iterations to struggle to jump out of the local optimum. Most of the decreases in the fitness of each iteration are not as large as those of the ISSA algorithm. It can be seen that compared with the original SSA algorithm and other algorithms for PNN optimization, ISSA has a robust global search capability and can quickly jump out of the local optimum without getting trapped in it and find the global optimum solution quickly. The rapid and efficient convergence process of ISSA-PNN fully shows its practicality in engineering applications. It is worth noting that the initial fitness value of the ISSA algorithm model is smaller than other algorithm models, which indicates that it is less susceptible to some initial noise and has better stability and anti-interference ability.
These results all reflect that the ISSA-PNN method is superior to other methods with better diagnostic accuracy, robustness, and generalization capability. Therefore, the proposed method has high applicability, reliability, and practicality in the field of oil-immersed transformer fault diagnosis.

6. Conclusions

In this paper, the PNN and DGA methods were combined to establish a basic fault diagnosis model. Then, a bio-inspired optimization algorithm was introduced to optimize the smoothing factor, which is an important parameter of PNN, to improve the performance of the fault diagnosis model. Meanwhile, we also improved the traditional SSA by introducing the SCA algorithm and disruption operator ( D o p ) to enhance the search capability of the traditional SSA algorithm, which enables the solution space to be sufficiently searched to prevent falling into the local optimum. Furthermore, the search time was also reduced to meet the practical engineering requirements. We compared the proposed method with other classical and excellent models using real data collected from sensors installed inside the transformers and evaluated them by multiple dimensions and multiple metrics. The experimental results show that the ISSA-PNN method has better diagnostic performance in power transformer fault diagnosis, can overcome some initial error interference, does not easily fall into overfitting, and has good robustness and accuracy.

Author Contributions

Conceptualization, L.T.; methodology, L.T. and Y.Z.; software, L.T.; validation, X.Y. and L.T.; formal analysis, L.Y.; writing—original draft preparation, L.T.; writing—review and editing, X.Y.; visualization, L.T.; funding acquisition, X.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the National Natural Science Foundation of China (51765042, 61963026).

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
ANNArtificial neural network
ADArc discharge
BABat algorithm
BPBack-propagation
BPNNBack-propagation neural network
CSCuckoo search
CVACommon vector approach
D o p Disruption operator
DADDNNDynamic Adam and dropout-based deep neural network
DBNDeep belief network
DGADissolved gas analysis
GAGenetic algorithm
GWOGray wolf optimization
IECInternational electrotechnical commission
ISSAImproved salp swarm algorithm
KNNK-nearest neighbor
LTLow temperature and overheating
MLPMulti-layer perceptron
MSEMean square error
MVOMulti-verse optimizer
PDPartial discharge
PNNProbabilistic neural network
PSCPower supply companies
PSOParticle swarm optimization
SCASine cosine algorithm;
SDStandard deviation
SOASeagull optimization algorithm
SSASalp swarm algorithm
SVMSupport vector machine
σ Smoothing factor

References

  1. Faiz, J.; Soleimani, M. Assessment of computational intelligence and conventional dissolved gas analysis methods for transformer fault diagnosis. IEEE Trans. Dielectr. Electr. Insul. 2018, 25, 1798–1806. [Google Scholar] [CrossRef]
  2. Wang, X.; Li, Q.; Li, C.; Yang, R.; Su, Q. Reliability assessment of the fault diagnosis methodologies for transformers and a new diagnostic scheme based on fault info integration. IEEE Trans. Dielectr. Electr. Insul. 2013, 20, 2292–2298. [Google Scholar] [CrossRef]
  3. Xiang, C.; Huang, Z.; Li, J.; Zhou, Q.; Yao, W. Graphic approaches for faults diagnosis for Camellia insulating liquid filled transformers based on dissolved gas analysis. IEEE Trans. Dielectr. Electr. Insul. 2018, 25, 1897–1903. [Google Scholar] [CrossRef]
  4. Wang, L.; Zhou, L.; Tang, H.; Wang, D.; Cui, Y. Numerical and experimental validation of variation of power transformers’ thermal time constants with load factor. Appl. Therm. Eng. 2017, 126, 939–948. [Google Scholar] [CrossRef]
  5. Wang, D.; Zhou, L.; Wang, L.; Guo, L.; Liao, W. Modified expression of moisture diffusion factor for non-oil-immersed insulation paper. IEEE Access 2019, 7, 41315–41323. [Google Scholar] [CrossRef]
  6. Bagheri, M.; Naderi, M.S.; Blackburn, T. Advanced transformer winding deformation diagnosis: Moving from off-line to on-line. IEEE Trans. Dielectr. Electr. Insul. 2012, 19, 1860–1870. [Google Scholar] [CrossRef]
  7. Faiz, J.; Soleimani, M. Dissolved gas analysis evaluation in electric power transformers using conventional methods a review. IEEE Trans. Dielectr. Electr. Insul. 2017, 24, 1239–1248. [Google Scholar] [CrossRef]
  8. Sun, H.C.; Huang, Y.C.; Huang, C.M. A review of dissolved gas analysis in power transformers. Energy Procedia 2012, 14, 1220–1225. [Google Scholar] [CrossRef] [Green Version]
  9. Wang, T.; He, Y.; Li, B.; Shi, T. Transformer fault diagnosis using self-powered RFID sensor and deep learning approach. IEEE Sens. J. 2018, 18, 6399–6411. [Google Scholar] [CrossRef]
  10. Li, A.; Yang, X.; Dong, H.; Xie, Z.; Yang, C. Machine learning-based sensor data modeling methods for power transformer PHM. Sensors 2018, 18, 4430. [Google Scholar] [CrossRef] [Green Version]
  11. Souahlia, S.; Bacha, K.; Chaari, A. MLP neural network-based decision for power transformers fault diagnosis using an improved combination of Rogers and Doernenburg ratios DGA. Int. J. Electr. Power Energy Syst. 2012, 43, 1346–1353. [Google Scholar] [CrossRef]
  12. Taha, I.B.; Hoballah, A.; Ghoneim, S.S. Optimal ratio limits of rogers’ four-ratios and IEC 60599 code methods using particle swarm optimization fuzzy-logic approach. IEEE Trans. Dielectr. Electr. Insul. 2020, 27, 222–230. [Google Scholar] [CrossRef]
  13. Hung, C.P.; Wang, M.H. Diagnosis of incipient faults in power transformers using CMAC neural network approach. Electr. Power Syst. Res. 2004, 71, 235–244. [Google Scholar] [CrossRef]
  14. Duval, M.; DePabla, A. Interpretation of gas-in-oil analysis using new IEC publication 60599 and IEC TC 10 databases. IEEE Electr. Insul. Mag. 2001, 17, 31–41. [Google Scholar] [CrossRef]
  15. Irungu, G.; Akumu, A.; Munda, J. A new fault diagnostic technique in oil-filled electrical equipment; the dual of Duval triangle. IEEE Trans. Dielectr. Electr. Insul. 2016, 23, 3405–3410. [Google Scholar] [CrossRef]
  16. Morais, D.R.; Rolim, J.G. A hybrid tool for detection of incipient faults in transformers based on the dissolved gas analysis of insulating oil. IEEE Trans. Power Deliv. 2006, 21, 673–680. [Google Scholar] [CrossRef]
  17. Yang, M.T.; Hu, L.S. Intelligent fault types diagnostic system for dissolved gas analysis of oil-immersed power transformer. IEEE Trans. Dielectr. Electr. Insul. 2013, 20, 2317–2324. [Google Scholar] [CrossRef]
  18. Jiang, J.; Chen, R.; Chen, M.; Wang, W.; Zhang, C. Dynamic fault prediction of power transformers based on hidden Markov model of dissolved gases analysis. IEEE Trans. Power Deliv. 2019, 34, 1393–1400. [Google Scholar] [CrossRef]
  19. Li, J.; Zhang, Q.; Wang, K.; Wang, J.; Zhou, T.; Zhang, Y. Optimal dissolved gas ratios selected by genetic algorithm for power transformer fault diagnosis based on support vector machine. IEEE Trans. Dielectr. Electr. Insul. 2016, 23, 1198–1206. [Google Scholar] [CrossRef]
  20. Benmahamed, Y.; Teguar, M.; Boubakeur, A. Application of SVM and KNN to Duval Pentagon 1 for transformer oil diagnosis. IEEE Trans. Dielectr. Electr. Insul. 2017, 24, 3443–3451. [Google Scholar] [CrossRef]
  21. Wang, D.; Zhou, L.; Yang, Z.X.; Cui, Y.; Wang, L.; Jiang, J.; Guo, L. A new testing method for the dielectric response of oil-immersed transformer. IEEE Trans. Ind. Electron. 2019, 67, 10833–10843. [Google Scholar] [CrossRef]
  22. Kirkbas, A.; Demircali, A.; Koroglu, S.; Kizilkaya, A. Fault diagnosis of oil-immersed power transformers using common vector approach. Electr. Power Syst. Res. 2020, 184, 106346. [Google Scholar] [CrossRef]
  23. Jiang, J.; Chen, R.; Zhang, C.; Chen, M.; Li, X.; Ma, G. Dynamic Fault Prediction of Power Transformers Based on Lasso Regression and Change Point Detection by Dissolved Gas Analysis. IEEE Trans. Dielectr. Electr. Insul. 2020, 27, 2130–2137. [Google Scholar] [CrossRef]
  24. Ghoneim, S.S.; Taha, I.B.; Elkalashy, N.I. Integrated ANN-based proactive fault diagnostic scheme for power transformers using dissolved gas analysis. IEEE Trans. Dielectr. Electr. Insul. 2016, 23, 1838–1845. [Google Scholar] [CrossRef]
  25. Huang, X.; Wang, X.; Tian, Y. Research on Transformer Fault Diagnosis Method based on GWO Optimized Hybrid Kernel Extreme Learning Machine. In Proceedings of the 2018 Condition Monitoring and Diagnosis (CMD), Perth, Australia, 23–26 September 2018; pp. 1–5. [Google Scholar]
  26. Dai, J.; Song, H.; Sheng, G.; Jiang, X. Dissolved gas analysis of insulating oil for power transformer fault diagnosis with deep belief network. IEEE Trans. Dielectr. Electr. Insul. 2017, 24, 2828–2835. [Google Scholar] [CrossRef]
  27. Ou, M.; Wei, H.; Zhang, Y.; Tan, J. A dynamic adam based deep neural network for fault diagnosis of oil-immersed power transformers. Energies 2019, 12, 995. [Google Scholar] [CrossRef] [Green Version]
  28. Yang, X.; Chen, W.; Li, A.; Yang, C.; Xie, Z.; Dong, H. BA-PNN-based methods for power transformer fault diagnosis. Adv. Eng. Inform. 2019, 39, 178–185. [Google Scholar] [CrossRef]
  29. Seshadrinath, J.; Singh, B.; Panigrahi, B.K. Incipient interturn fault diagnosis in induction machines using an analytic wavelet-based optimized Bayesian inference. IEEE Trans. Neural Netw. Learn. Syst. 2013, 25, 990–1001. [Google Scholar] [CrossRef]
  30. Kusy, M.; Zajdel, R. Application of reinforcement learning algorithms for the adaptive computation of the smoothing parameter for probabilistic neural network. IEEE Trans. Neural Netw. Learn. Syst. 2014, 26, 2163–2175. [Google Scholar] [CrossRef]
  31. Mirjalili, S.; Gandomi, A.H.; Mirjalili, S.Z.; Saremi, S.; Faris, H.; Mirjalili, S.M. Salp Swarm Algorithm: A bio-inspired optimizer for engineering design problems. Adv. Eng. Softw. 2017, 114, 163–191. [Google Scholar] [CrossRef]
  32. Mirjalili, S. SCA: A sine cosine algorithm for solving optimization problems. Knowl. Based Syst. 2016, 96, 120–133. [Google Scholar] [CrossRef]
  33. Neggaz, N.; Ewees, A.A.; Abd Elaziz, M.; Mafarja, M. Boosting salp swarm algorithm by sine cosine algorithm and disrupt operator for feature selection. Expert Syst. Appl. 2020, 145, 113103. [Google Scholar] [CrossRef]
  34. Liu, H.; Ding, G.; Wang, B. Bare-bones particle swarm optimization with disruption operator. Appl. Math. Comput. 2014, 238, 106–122. [Google Scholar] [CrossRef]
  35. Yang, X.; Chen, W.; Li, A.; Yang, C. A Hybrid machine-learning method for oil-immersed power transformer fault diagnosis. IEEJ Trans. Electr. Electron. Eng. 2020, 15, 501–507. [Google Scholar] [CrossRef]
  36. Zhang, W.; Yang, X.; Deng, Y.; Li, A. An Inspired Machine-Learning Algorithm with a Hybrid Whale Optimization for Power Transformer PHM. Energies 2020, 13, 3143. [Google Scholar] [CrossRef]
Figure 1. Probabilistic neural network structure diagram.
Figure 1. Probabilistic neural network structure diagram.
Sensors 21 03623 g001
Figure 2. The diagram of the proposed ISSA-based PNN for fault diagnostics.
Figure 2. The diagram of the proposed ISSA-based PNN for fault diagnostics.
Sensors 21 03623 g002
Figure 3. The implemented framework of the power transformer fault diagnosis.
Figure 3. The implemented framework of the power transformer fault diagnosis.
Sensors 21 03623 g003
Figure 4. The Dissolved gas data distribution of three-ratio for four fault types. (ad), in the order of low temperature overheating (<150 ℃), low temperature overheating (150–300 ℃), partial discharge, and arc discharge.
Figure 4. The Dissolved gas data distribution of three-ratio for four fault types. (ad), in the order of low temperature overheating (<150 ℃), low temperature overheating (150–300 ℃), partial discharge, and arc discharge.
Sensors 21 03623 g004
Figure 5. The classification results of different methods.
Figure 5. The classification results of different methods.
Sensors 21 03623 g005aSensors 21 03623 g005b
Figure 6. Confusion matrix for different methods.
Figure 6. Confusion matrix for different methods.
Sensors 21 03623 g006
Figure 7. The fitness value curve of different optimization methods.
Figure 7. The fitness value curve of different optimization methods.
Sensors 21 03623 g007
Table 1. Some real data from power supply companies in some provinces of China diagnosing power transformer fault types by DGA method.
Table 1. Some real data from power supply companies in some provinces of China diagnosing power transformer fault types by DGA method.
Fault TypeDissolved Gas (µL/L)Sources
CH 4 C 2 H 2 C 2 H 4 C 2 H 6 TH
LT (<150 ℃)8353131.2150.2Jiujiang PSC
LT (150–300 ℃)6.598161.5122Fuzhou PSC
LT (150–300 ℃)1931912816428Yingtan PSC
LT (150–300 ℃)1246111.870.8Nanchang PSC
LT (150–300 ℃)3.5318.2143.7Yichun PSC
AD613071056479Yingtan PSC
Table 2. Coding format for different fault types.
Table 2. Coding format for different fault types.
Fault TypeLT
(<150 ℃)
LT
(150–300 ℃)
PDAD
Coding format1000
0100
0010
0001
Table 3. Partial sample data.
Table 3. Partial sample data.
Dissolved Gas (µL/L)Fault Type
C 2 H 2 /C 2 H 4 CH 4 /H 2 C 2 H 4 /C 2 H 6
0.051720.854550.04839LT (<150 ℃)
00.175290LT (<150 ℃)
0.06250.155170.1LT (<150 ℃)
0.018991.218280.00885LT (150–300 ℃)
0.016131.1250.01389LT (150–300 ℃)
0.016671.081080.01563LT (150–300 ℃)
0.055560.075240.05882PD
00.070590PD
0.066670.067540.21910PD
0.016131.125000.01389PD
0.016671.081080.01563PD
0.017861.231880.01923PD
0.3750.458820.75AD
0.40.893610.28571AD
0.80.339281AD
0.250.323230.33333AD
0.148440.078360.14394AD
Table 4. Parameter setting of various methods.
Table 4. Parameter setting of various methods.
MethodsParameters Settings
PSO-PNNc1 = c2 = 1.49445
SOA-PNNNP = 10, T = 50
BA-PNNNP = 20, A = 0.5, r = 0.5
MVO-PNNNP = 10, T = 50
SSA-PNNNP = 6, T = 50
ISSA-PNNNP = 3, T = 10, δ = 0.5
BA-BPNP = 20, A = 0.5, r = 0.5
CS-BPNP = 20, Pa = 0.25
GA-BPNP = 20, Pm = 0.01, Px = 0.7
Table 5. Confusion matrix.
Table 5. Confusion matrix.
Actual ClassPredicted Class
PositiveNegative
PositiveTrue positive (TP)False negative (FN)
NegativeFalse positive (FP)True negative (TN)
Table 6. Accuracy comparison of different optimized PNN methods.
Table 6. Accuracy comparison of different optimized PNN methods.
Fault TypeAccuracy (%)
ISSA-PNNSSA-PNNMVO-PNNBA-PNNSOA-PNNPSO-PNNPNN
LT (<150 ℃)98.59100.0098.5998.5998.5998.5995.77
LT (150–300 ℃)100.00100.00100.00100.0084.6284.6261.54
PD100.00100.00100.0087.50100.00100.00100.00
AD100.0089.4789.47100.00100.0094.7489.47
Average99.6597.3797.0296.5295.8094.4986.70
Table 7. Accuracy comparison of different methods.
Table 7. Accuracy comparison of different methods.
Fault TypeAccuracy (%)
ISSA-PNNBA-BPCS-BPGA-BPMLPSVMIEC
LT (<150 ℃)98.5999.0694.3499.0691.5584.5197.17
LT (150–300 ℃)100.0092.31100.0092.31100.0092.31100.00
PD100.00100.00100.00100.0062.5075.007.14
AD100.0095.4590.9181.8278.9568.42100.00
Average99.6596.7196.3193.3083.2580.0676.08
Table 8. Accuracy comparison of different methods.
Table 8. Accuracy comparison of different methods.
Fault TypeF1-Score (%)
ISSA-PNNSSA-PNNMVO-PNNBA-PNNSOA-PNNPSO-PNNPNN
LT (<150 ℃)99.2998.6198.5998.6197.9097.2293.79
LT (150–300 ℃)100.00100.00100.00100.0091.6791.6776.19
PD100.00100.0094.1293.33100.00100.0088.89
AD97.4494.4491.8997.4497.4494.7489.47
Marco F1-score99.1898.2696.1597.3596.7595.9187.09
Table 9. Comparison of sample errors.
Table 9. Comparison of sample errors.
MethodsMSE of TrainingMSE of Test
ISSA-PNN0.000000.08108
SOA-PNN0.009010.10910
BA-PNN0.009010.11712
MVO-PNN0.002250.17117
SSA-PNN0.009010.16216
PSO-PNN0.029280.18018
BA-BP0.025000.13100
CS-BP0.007500.15480
GA-BP0.005000.19030
PNN0.037030.33333
MLP0.042770.38013
SVM0.043440.41231
IEC0.056250.46770
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Tao, L.; Yang, X.; Zhou, Y.; Yang, L. A Novel Transformers Fault Diagnosis Method Based on Probabilistic Neural Network and Bio-Inspired Optimizer. Sensors 2021, 21, 3623. https://doi.org/10.3390/s21113623

AMA Style

Tao L, Yang X, Zhou Y, Yang L. A Novel Transformers Fault Diagnosis Method Based on Probabilistic Neural Network and Bio-Inspired Optimizer. Sensors. 2021; 21(11):3623. https://doi.org/10.3390/s21113623

Chicago/Turabian Style

Tao, Lingyu, Xiaohui Yang, Yichen Zhou, and Li Yang. 2021. "A Novel Transformers Fault Diagnosis Method Based on Probabilistic Neural Network and Bio-Inspired Optimizer" Sensors 21, no. 11: 3623. https://doi.org/10.3390/s21113623

APA Style

Tao, L., Yang, X., Zhou, Y., & Yang, L. (2021). A Novel Transformers Fault Diagnosis Method Based on Probabilistic Neural Network and Bio-Inspired Optimizer. Sensors, 21(11), 3623. https://doi.org/10.3390/s21113623

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop