1. Introduction
In order to ensure the secure and reliable operation of converter transformers, which serve as intermediate devices for AC-DC power transmission technology, it is essential to conduct research on fault diagnosis. This research aims to enhance the accuracy and speed of fault identification, helping to promptly detect internal defects and prevent the further escalation of accidents. Unlike regular power transformers, converter transformers operate in a unique AC-DC working environment, which implies a higher level of harmonic currents. Consequently, this complexity in operational characteristics presents challenges in employing conventional fault diagnosis methods designed for traditional power transformers [
1,
2].
The converter transformer, in the process of operation with the core and windings, produces vibration because of electric power and other factors, and thus mechanical wave propagation through the transformer oil and rigid connection to the box. The resulting vibration and voiceprint signals contain a large amount of state information based on the vibration signal monitoring means that are widely used in the online monitoring of power equipment [
3,
4]. In the vibration signal acquisition process, the deployment location requirements of sensors are strict. Smaller deviations will interfere with the results. The noise detection method is used as a non-contact measurement; its sensor installation is convenient for solving the problem of high spatial sensitivity. At the same time, the voiceprint signal acquisition device has a wide frequency range to meet the monitoring requirements of different specifications of the transformer [
5,
6]. Numerous scholars both domestically and internationally have conducted research in this area, achieving promising results. In reference [
7], four voiceprint emission feature spectra were constructed, and a lightweight fault diagnosis model was established to diagnose loose winding faults in transformers. Reference [
8], based on the no-load operation of transformers, employed MFCC for voiceprint feature extraction, introduced Principal Components Analysis (PCA) to remove redundant features, and ultimately utilized the Vector Quantization (VQ) algorithm for accurate identification of loosened iron core faults. Reference [
9] extracted features of on-load tap changers using Mel spectrograms and combined them with convolutional neural networks to recognize mechanical faults. However, the abovementioned voiceprint emission recognition techniques are based on traditional MFCC, which involves a cumbersome process of frame segmentation, windowing, and Fourier transformation to overcome spectral leakage issues. Furthermore, due to the inherent limitations of single-channel signal sources, the practicality of fault diagnosis using voiceprint emission signals is mostly limited to single-fault diagnosis.
To address the issue of the single-fault feature, reference [
10] utilized Complete Ensemble Empirical Mode Decomposition (CEEMD) and short-time Fourier transform (STFT) to obtain temporal and spectral information about the signals. Deep fault features were then extracted using a deep fused convolutional neural network (DFCNN). Similarly, reference [
11] proposed a mixed algorithm called high-order singular value decomposition (HOSVD)–high-order alternation least square (HOALS) to extract multi-dimensional features for pattern recognition. Furthermore, reference [
12] combined the fusion multiscale convolutional neural network (F-MSCNN) to fuse sound and vibration features, leveraging the learning of multi-scale features for subsequent classification. Reference [
13] proposed a real-time fault diagnostic method for hydraulic systems using data collected from multiple sensors in order to overcome the lack of information contained in a single sensor. Reference [
14] processed signals from multiple sensors, thereby expanding the number of samples to enhance the diagnostic performance. However, most of the existing studies are based on single or homogeneous signals. They focus on extracting multidimensional features from different angles without considering multiple signal sources. The above diagnostic models do not start from different types of signal sources and ignore the correlation between different signals, making it difficult to extract deep information effectively from faults.
Existing approaches on data-driven fault classification mostly rely on artificial intelligence algorithms to analyze historical data and extract fault features, and the selection of parameters during the model training process has a crucial impact on the accuracy and convergence speed of fault classifiers. Reference [
15] proposed a novel expectation maximization-unscented particle filter-Wilcoxon rank sum test (EM-UPF-W) method for data-driven techniques, which adaptively estimates noise variables with the help of the EM algorithm. References [
16,
17] used an artificial intelligence optimization algorithm for the adaptive optimization of machine learning parameters to avoid the human experience of parameter selection, but the existing artificial intelligence optimization is prone to the problem of local optimal stagnation, which has an impact on the final convergence speed and accuracy of the model.
Given this context, this article is focused on the division of current signals into intervals, combining voiceprint signals to achieve fault diagnosis in converter transformers. It overcomes the inherent limitations of single signal sources and conducts research on multi-fault diagnosis. The IHPO method is proposed to effectively address the local optimization problem, serving as a subsequent parameter optimization algorithm. VMD is employed for noise reduction, while the S-transform is utilized as a time-frequency conversion method. The improved MFCC technique based on multiple strategies is employed for feature extraction. ITCN is utilized for accurate fault identification, offering a novel approach for fault diagnosis in converter transformer systems. Furthermore, a specific 800 kV converter station was taken as a case study to validate the effectiveness of this integrated model.
The main contributions of this article are summarized as follows:
This paper aims to counteract the problems of the traditional hunter–prey optimization algorithm, which easily falls into the local optimum, and of which the traversal of population initialization is not strong. It is improved via the introduction of SPM chaotic mapping and the Levy flight strategy, which is used for the adaptive selection of parameters in the fault diagnostic model to avoid the interference of the human experience selection.
Multi-strategy improved MFCC is proposed for extracting voiceprint signals from converter transformers. Compared with the traditional voiceprint signal feature extraction method, the proposed approach incorporates the characteristics specific to the voiceprint signals of electric power equipment. It overcomes the interference of redundant information and demonstrates enhanced feature extraction capabilities.
This paper introduces load signals to segment the operational intervals of converter transformers, realizes fault diagnosis through multiple types of signal sources, and proposes the improved multi-strategy MFCC and IHPO-VMD-ITCN fault diagnostic models. The experimental results demonstrate that the proposed fault diagnostic methods exhibit significant improvements in terms of both accuracy and calculation speed.
2. Analysis of Vibration Mechanism of Converter Transformer
Similarly to traditional power transformers, the vibration of converter transformers is induced by the electromagnetic forces in the windings and the expansion and contraction of the core due to magnetic hysteresis. These vibrations propagate through the transformer oil and rigid connections to the enclosure. However, owing to the complex environment resulting from the dual impact of alternating and direct currents, the vibration excitations are often characterized by multiple harmonic frequencies, leading to intricate vibration patterns in different areas.
2.1. Winding Vibration Mechanism Analysis
In accordance with the principles of high-voltage transmission, the current in converter transformers is accompanied by harmonic currents, including the
th harmonic current at 50 Hz. This is manifested in Equation (1).
where
is the amplitude of each harmonic current,
is the phase angle of each harmonic, and
is the angular frequency of the 50 Hz current.
The interaction between currents of varying frequencies and magnetic fields generates axial and radial electromagnetic forces is expressed in Equation (2). The windings vibrate under the influence of these electromagnetic forces.
where
and
represent the axial and radial electromagnetic force coefficient and
and
represent the winding axial and radial electromagnetic force.
Based on the motion differential equation, the acceleration of winding vibration can be represented by Equation (3):
where
is the sum of multiplication of different harmonics,
and
are the axial and radial acceleration coefficients,
,
, and
are the calculation parameters,
are the number of harmonics, and
,
, and
are the acceleration phase angles.
From Equation (3), it can be observed that under the influence of the th harmonic, apart from the 100 Hz component, there is also a significant presence of the th harmonic in the vibration of the converter transformer. When the natural frequency of the windings is close, resonance can easily occur, leading to a deviation of the dominant vibration frequency from 100 Hz.
2.2. Core Vibration Mechanism Analysis
The vibration of the core is primarily induced by magnetostriction. Furthermore, the excitation voltage of the converter transformer contains numerous harmonic components. Taking the influence of harmonic voltages into account, the vibration of the core can be represented by Equation (4):
Among them:
where
is the amplitude of each voltage harmonic,
is the magnetostrictive deformation of the silicon steel sheet, and
is the saturation flux coefficient.
From Equation (4), it can be observed that the dominant frequency of the core vibration is primarily at 100 Hz. The influence of harmonics introduce a significant presence of the harmonic components. However, nonlinearities in the core and other factors may lead to deviations in vibration.
2.3. Fault Voiceprint Characterization of Converter Transformers
Similarly to ordinary power transformers, converter transformers are mainly composed of iron core, windings, and rigid connectors. When the iron core ages or experiences transportation and installation before operation, iron core loosening may occur. If the condition of iron core loosening is not promptly addressed, it will continue to accumulate, ultimately leading to iron core loosening failure. Iron core loosening failure results in a decrease in the fastening force between the silicon steel sheets of the iron core, thereby increasing the air gap between the stacked pieces. This causes a significant rise in the amplitude of iron core vibration acceleration, leading to changes in the intrinsic frequency of vibration and altering the voiceprint characteristics of the transformer. Similarly, during operation, the converter transformer is constantly subjected to the impact of electric power. In the event of a short-circuit fault, the intensification of electric power can prompt the occurrence of winding loosening faults. This leads to an aggravation of axial vibration, a significant increase in vibration acceleration amplitude, and changes in the vibration frequency distribution, resulting in alterations to the voiceprint characteristics of the transformer. When the converter transformer is running under bias magnetic conditions, the current signal can be regarded as the superposition of a DC component and Equation (1); according to
Section 2.1 and
Section 2.2 of the core and winding vibration mechanism analysis, it can be observed that, at this time, the vibration frequency of the converter transformer changes significantly.
In summary, when a fault occurs in the converter transformer, its core and winding vibration change significantly. The fault voiceprint signal generated under these conditions differs from that of normal operation. Therefore, the fault diagnosis of the converter transformer can be realized by adopting a machine learning algorithm for effective feature extraction of the voiceprint signal.
2.4. Characterization of Voiceprint Pattern Changes under Operating Conditions
The voiceprint signal and vibration signal, originating from the same source, exhibit a strong correlation. Based on the analysis in
Section 2.1 and
Section 2.2, this study delves into the vibration characteristics of converter transformers during operation.
This study focuses on 28 converter transformers in a specific 800 kV converter station. Among them, there are 12 transformers per pole and 4 transformers on standby. The parameters of certain converter transformers are presented in
Table 1.
The voiceprint signal acquisition system for the converter transformers is illustrated in
Figure 1, and on-site acquisition photos are presented in
Figure 2. We employed a combination of HS14401 capacitive sound sensors with a sampling frequency of 16 kHz along with a DHDAS dynamic signal acquisition instrument. Each converter transformer is equipped with three voiceprint acquisition devices, positioned on both sides and at a 45-degree angle, 0.5 m away from the enclosure. The data were collected in the outdoor substation environment under normal operating conditions, which may include noise interference. The voiceprint acquisition system was configured to collect voiceprint signals every 30 min, with each collection lasting for 60 s. Electrical parameters within the converter station were recorded every 30 min to ensure synchronization between the voiceprint signals and electrical parameters.
We selected time-length 0.1 s converter transformer in-operation voiceprint slices as the object of study. The time-domain and frequency-domain characteristics are illustrated in
Figure 3. The main frequency of the converter transformer is 400 Hz, accompanied by a significant number of harmonics. This is attributed to the proximity of the winding intrinsic frequency to 400 Hz and the resonance of the converter transformer
component, resulting in a deviation of 100 Hz compared to ordinary power transformers. This deviation corresponds to the theoretical analysis mentioned above.
The vibration characteristics of converter transformers vary under different operating conditions. In a no-load converter transformer, the core winding resonance becomes prominent. Under heavy load, the dominant vibration shifts to winding [
18,
19,
20]. To facilitate a more precise quantitative analysis, this article focuses on the high-end Y/D converter transformer of pole II. The main objective is to analyze the main frequency change pattern of voiceprint characteristics concerning the magnitude of current. The results are depicted in
Figure 4. Under no load, the main frequency of the converter transformer is 200 Hz, indicating the core vibration stage. At the rated voltage, when the valve side current is less than 0.2
, the main frequency alternates between 200 Hz and 400 Hz. During this period, the core winding dominance alternates. However, when the current exceeds 0.23
, the main frequency stabilizes at 400 Hz, signifying the dominance of winding vibration.
Based on the information provided, a strong correlation exists between the electrical signals and voiceprint features of converter transformers. The division of converter transformers into three interval states, as illustrated in
Table 2, allows for a phased approach to fault diagnosis. This approach proved effective in overcoming the issue of overlapping between core faults and winding faults, ultimately enhancing the accuracy of fault identification.
3. Description of Fault Diagnosis Algorithms
3.1. Improved Hunter–Prey Optimization Algorithms
The hunter–prey optimization algorithm is a new intelligent optimization algorithm proposed by Naruei et al. in 2021 [
21]. In this algorithm, the hunter adjusts its position to obtain the best hunting position, while the prey moves to a safe position to avoid the hunter’s attack, and the safest position of the prey is the optimal solution of the problem to be optimized. This article proposes an improvement of the HPO algorithm by introducing the Levy flight strategy and SPM chaotic mapping. The modifications are briefly described as follows.
- (1)
Initialization: The conventional HPO algorithm achieves population initialization using Equation (6), as described below:
wherein
represents the positions of hunters or prey,
d represents the problem dimensionality, and
,
represent the upper and lower bounds of the problem.
We chose Strongly Perturbed Mix (SPM) chaotic mapping for initializing the population, as shown in
Figure 5. In comparison to circle mapping, the SPM demonstrates enhanced randomness and tergodicity, effectively addressing the issue of local clustering of individual hunters and prey [
22]. The expression for SPM chaotic mapping is given by Equation (7).
In Equation (7), the parameter is typically chosen within the range of (0.4, 0.3).
- (2)
Optimization strategy: Hunters select prey that are far away from the group as their search targets, while the prey continuously move to evade hunter attacks and maximize their chances of survival. The position update for hunters and prey can be described by Equations (8) and (9), respectively.
wherein
represents the position of the
ith hunter in the
jth dimension at the (
t + 1)th iteration,
represents the position of the
ith hunter at the
tth iteration,
represents the position of the prey in the
jth dimension,
represents the balance parameter between exploration and exploitation, and
Z is an adaptive parameter.
wherein
represents the global best position and
represents a random number within the range of [−1, 1].
It is challenging to overcome local optima solely by introducing SPM chaotic mapping. However, the utilization of the Levy flight strategy allows for a quick escape from local optima. The implementation approach is depicted in Equation (10).
wherein
and the value of
is set to 1.5.
In practical applications, the Mantegna method is commonly used to generate random step lengths following a Levy distribution, as described in Equations (11) and (12).
In the IHPO optimization algorithm, if the change in fitness values is continuously less than 0.001, the Levy flight strategy aids in escaping local optima. This generates the candidate solution for the next iteration, as shown in Equation (13).
In the equation, denotes element-wise multiplication, is a random number uniformly distributed in the range [0, 1], and is equal to 1.5.
The pseudocode used to improve the hunter–prey optimization algorithm is as follows in Algorithm 1:
Algorithm 1 Improve hunter–prey optimization |
Input: HPO Parameters Output: TargetScore, Best pos, Convergence curve
1: Initialize Hppos
2: Evaluate fitness of each HPpos
3: Set Target as the best HPpos, TargetScore as its fitness
4: for t = 2 to Max_iteration do
5: Update c
6: Update kbest
7: for i = 1 to N do
8: Generate random numbers
9: if rand < B then
10: Calculate xi and dist
11: Set SI as HPpos(idxsortdist(kbest))
12: Update HPpos(i,:) using formula with levy, l, c, z, SI, xi
13: else
14: for j = 1 to dim do
15: Calculate v and rr
16: Update HPpos(i,j) using formula with z(j), rr, Target(j), HPpos(i,j)
17: end for
18: end if
19: Clip HPpos(i,:) values to be within bounds of lb and ub
20: Evaluate fitness of HPpos(i,:)
21: if HPposFitness(i) < TargetScore then
22: Update Target and TargetScore
23: end if
24: end for
25: Store TargetScore in Convergence curve(t)
26: end for |
To validate the superiority of the IHPO algorithm, this article compares its performance with traditional optimization algorithms using the test function described in Equations (14) and (15). The results are depicted in
Figure 6.
According to
Figure 6a,b, it can be observed that the IHPO optimization algorithm converges to values of
and 0, respectively. The convergence speed of the IHPO algorithm is significantly higher than that of other traditional algorithms, achieving superior convergence values with the fewest number of iterations.
3.2. Variational Mode Decomposition
During the process of collecting transformed voiceprint signals, there is often a significant amount of noise interference. In order to ensure the accuracy of fault diagnosis, this article adopts the VMD algorithm for denoising processing, aiming to restore the original voiceprint signal as faithfully as possible.
The VMD algorithm constructs a variational problem and solves it [
23,
24]. Firstly, the original signal is decomposed into
k modal components, denoted as
. The energy spectrum is obtained through Hilbert transformation.
is made equal to each modal component
as a constraint condition, and the Lagrange multiplier
and penalty factor
are introduced to transform it into a variational problem, as shown in Equation (16).
In Equation (16), * represents the convolution operation, is the k-th modal component, is the central frequency, is the impulse function, represents the partial derivative with respect to t, and denotes the inner product.
The alternating direction multiplier method is used to solve the variational problem to find the optimal values of , , which is realized in the following steps.
- (1)
Initialize the parameters , , , set the loop , and iteratively update the parameters according to Equations (17)–(19).
- (2)
In Equation (17), , , are the Fourier transforms corresponding to , , .
- (3)
- (4)
- (5)
Determine convergence.
by setting
.
- (6)
Determine whether the iteration condition is satisfied; if not, return to step (2).
3.3. Multi-Strategy Improvement of MFCC for Dimensionality Reduction Extraction of Voiceprint Features
As a common speech feature extraction method, MFCC is widely used in the field of speech recognition [
25]. Considering that spectral leakage in the Fourier transform is very likely to occur, the S-transform is used as a time-frequency conversion method, and combined with the characteristics of the stationary energy of the converter voiceprint signal, it undergoes processing in the medium time to obtain the improved MFCC method to realize the voiceprint signal feature extraction.
3.3.1. S-Transform
The S-transform employs the Gaussian window function with adaptive adjustment of time and frequency parameters, replacing the fixed window function of the Fourier transform and the scale parameter window function of the wavelet transform. This approach exhibits higher-frequency characteristics at low frequencies and effectively improves the shortcomings of the Fourier transform [
26].
The result of signal
after S-transformation is shown in Equation (21).
where
f is the frequency,
is the time variable of
,
is the time component after S-transformation, and
is the Gaussian window function for adaptive adjustment, as shown in Equation (22):
3.3.2. Multi-Strategy Improvement MFCC
In the field of audible sound recognition, given that the human ear exhibits varying sensitivities to the perception of each frequency band and the perception of the normal frequency band is nonlinear, Mel filtering is typically employed to transform the spectral information of voiceprint into Mel spectrum under Mel scale. The relationship between the normal frequency scale and the Mel frequency scale is expressed as in Equation (23):
where
f is the frequency on the regular scale and
k is the frequency scale on the Mel scale.
In the domain of power equipment fault diagnosis, low-frequency information within 1000 Hz frequently incorporates numerous fault characteristics. Consequently, the utilization of Mel filters can adjust voiceprint information to varying degrees, enhance low-frequency information, and filter high-frequency information and compress it. The equal-height Mel filter bank function is expressed in Equation (24):
where
m is the filter bank number and the number of filters in this paper is set to 26; therefore, the range of
m is
, the center frequency of the Mel filter. The formula for the calculation of
is:
where
is the sampling frequency,
,
represent the frequency range of the Mel filter bank,
N is the number of S-transform samples, and M is the number of Mel filters.
The improved MFCC feature extraction method is distinguished from MFCC by the simpler operations of frame splitting and window adding. The specific steps are as follows:
- (1)
Framing: the S-transform has a high time complexity, so in order to save time, the original signal is framed with a fixed frame length.
- (2)
S-transform: the S-transform is performed on each frame by Equation (16) to obtain the time-frequency matrix .
- (3)
The spectral information is sought, as shown in Equation (26).
where
is the time-frequency matrix,
t is the time corresponding to the S-transform matrix, and
f is the frequency.
- (4)
Bandpass filtering is performed, as in Equation (27).
where
is the Mel filter output and
is the filter bank.
- (5)
A discrete cosine transform is performed as in Equation (28) to obtain the first set of voiceprint characterization coefficients
.
- (6)
We perform first-order and second-order differentiation operations on to obtain the second and third sets of parameters , of the improved MFCC eigenvectors.
- (7)
We splice the three sets of parameters to form the feature vector .
Compared with the human speaking voice, power equipment voiceprint signal characteristics tend to be stationary; the feature vector obtained above contains a large amount of redundant information between the frames, so the use of mid-time features as shown in Equation (29) is more in line with the characteristics of stationary power equipment voiceprint features, reducing the interference of the heterogeneous long frames and having a stronger generalization [
27], The multi-strategy improvement MFCC flowchart is shown in
Figure 7.
where
is the
ith frame signal feature and
N is the number of medium-time signal frames and denotes
is the medium-time feature vector.
3.4. Improved Temporal Convolutional Neural Networks
Time convolutional networks have good sequence information processing capabilities. In comparison to traditional architectures such as convolutional neural networks, this network achieves deeper networks by incorporating skip connections of residual blocks, effectively integrating shallow features into the depths for improved accuracy [
28,
29]. To simplify the network’s complexity, cavity convolution is employed to expand the sensory field, and the causal cavity convolution is calculated as shown in Equation (30):
where
d is the void coefficient,
k is the convolution kernel size, and
is the
ith element of the convolution kernel.
The traditional TCN residual module introduces nonlinearity through the Relu activation function. However, when the input is negative, the zero-gradient problem occurs, leading to the offset phenomenon. This, in turn, limits the learning efficiency and effectiveness of the TCN. Setting the output mean of the activation function to zero serves a dual purpose: it reduces the gradient vanishing problem and mitigates the impact of weight initialization. Additionally, the output of the activation function with zero-mean facilitates the propagation of information between the different layers of the network, resulting in better learning dynamics. This helps the network learn complex features and representations more efficiently. To a greater extent, it can enhance the network’s learning performance. Therefore, the Mish activation function is used to replace the traditional Relu function, as in this equation:
As depicted in
Figure 8, compared with other activation functions, although the Tanh function has an absolute 0-mean value, it is prone to gradient vanishing due to the range of [−1, 1]. The Mish activation function is a better trade-off between the 0-mean value and the gradient vanishing problem [
30].
The improved TCN architecture is illustrated in
Figure 9 (
k = 2,
d = 1, 2, 4), where each residual module contains two causal convolutional layers. The network’s performance is enhanced through the incorporation of the Mish activation function, weight normalization, and dropout.
The improved TCN pseudocode is shown in Algorithm 2:
Algorithm 2 improved Temporal Convolutional Network |
Input: Input sequence X with length T, Number of residual blocks K, Stack size S, Number of output channels C, Filter size f, Initial dilation value d0, Learning rate η Output: Probability distribution over classes
1: Initialize all model parameters
2: Set learning rate to η
3: Set initial dilation value to d0
4: for k = 1 to K do
5: for s = 1 to S do
6: for c = 1 to C do
7: Apply causal convolution to input sequence X with dilation d
8: Apply activation function (e.g., Mish) to the output
9: Apply weight normalization to the output
10: Update output sequence O
11: end for
12: end for
13: Stack the output sequence O with the input sequence X as the new input
14: Increase the dilation value d exponentially
15: end for
16: Apply a fully connected layer to the final output sequence O
17: Apply softmax function to obtain probability distribution over classes |
3.5. Multi-Strategy Improved MFCC-IHPO-VMD-ITCN Combined Fault Diagnosis Modeling
Converter transformer voiceprint signals are mainly concentrated in the low-frequency band. Considering the operating patterns of the converter transformer, a combined voiceprint–electric feature vector is adopted to overcome the problem of interference between core and winding vibrations. The accurate identification of converter transformer faults is achieved through a diagnostic process from denoising through feature extraction to pattern recognition. The diagnostic workflow is illustrated in
Figure 10.
The VMD is optimized based on IHPO to obtain the proprioceptive voiceprint signal. The selection of the decomposition number
k and the penalty factor
α has a significant impact on the decomposition result. It is prone to over-decomposition or loss of band information. Therefore, the minimum envelope entropy shown in Equation (32) is selected as the fitness function. IHPO is utilized to select the optimal [
k,
α] to overcome the inherent defects of VMD decomposition.
where
N is the number of Intrinsic Mode Function (IMF) components,
is the envelope entropy after Hilbert adjustment,
is the normalized form, and
is the envelope signal.
Through the normalization of the load signal combined with the construction of multi-strategy improved MFCC for converter voiceprint and electric joint feature vector, multi-channel signal fault diagnosis is achieved.
Optimizing ITCN based on IHPO involves fine-tuning key parameters like kernel size (
k) and dilation factor (
d) for expansion convolution, which are crucial in determining the receptive field size and training accuracy. Utilizing Equation (33) as the fitness function enables adaptive optimization of ITCN to find optimal values for (
k) and (
d) that maximize the performance.
where
is the training set accuracy.
5. Conclusions
This paper proposes a fault diagnosis method that combines the multidimensional-improvement strategy of MFCC with adaptive VMD-ITCN and incorporates the influence of load signals. This method significantly enhances recognition accuracy and is applicable in the field of fault diagnosis for converter transformers. Our experimental results demonstrate that the application of IHPO for optimizing VMD and ITCN has significant benefits, such as improved convergence and the avoidance of parameter-related impacts on fault diagnosis models. The introduction of load signals divides the entire operational process of the converter transformer into three stages, diagnosing core faults in Stage I and winding faults in Stage III. The effectiveness of the proposed model was verified using a sample dataset from an 800 kV converter station. This model exhibits superior performance in terms of recognition accuracy and training speed, providing a new approach for maintenance personnel to promptly and accurately detect internal defects in converter transformers.
The fault diagnosis model proposed in this article is based on a data-driven background, which achieves fault classification through row analysis of historical data of converter transformers. Therefore, the number of fault categories and samples is relatively small. In future research, we will collect fault data of converter transformers in different scenarios and expand the types of faults. The idea of transfer learning, as described in reference [
32,
33], can also be introduced to further improve the generalization of diagnostic models. On the other hand, we will consider establishing an accurate mathematical model from a model-driven perspective to simulate fault signals and achieve fault diagnosis.