1. Introduction
Centrifugal compressors, steam turbines, flue gas turbines, expanders and other high-speed rotating machinery are widely used in petrochemical, coal chemical, metallurgical and other industrial fields. In the event of blade fracture, rotor imbalance, rubbing, surges and other faults [
1], minor faults may cause equipment failure and production interruption, and serious faults may cause machine damage and fatal accidents, leading to huge economic losses or social impact to enterprises [
2,
3]. Timely and automatic identification of equipment failure types, to take control and to take preventive measures, is of great significance for reducing or avoiding economic losses in enterprises and preventing catastrophic failures of rotating machinery [
4].
Oil-whirl faults caused by improper assembly-clearance and contact areas between tilting pad bearing and shaft is one of the most common faults in rotating equipment. Oil whirl failure refers to severe fluctuations or vibrations in the lubricating oil film, which usually occurs when the lubricating oil film cannot be stably maintained on the surface of mechanical parts. This failure may cause serious harm to mechanical equipment and systems. The specific hazards include: (1) Friction and wear increase: oil whirl will lead to the instability of the lubricating oil film, so that the contact area between the mechanical parts increases; friction and wear will increase accordingly. Long-term friction and wear can lead to damage and the shortened life of parts. (2) Energy loss: oil whirl will cause abnormal contact between mechanical parts, which will lead to energy loss, thus affecting the efficiency and performance of the mechanical system. (3) Vibration and noise: Oil whirl can cause the vibration of mechanical parts, and then produce noise. These vibrations and noises not only affect the normal operation of mechanical equipment, but also may affect the surrounding environment and the health of workers. (4) Heat accumulation: oil whirl may lead to local energy concentration, resulting in excessive heat accumulation. This may lead to overheating of lubricating oil and further aggravate the damage to mechanical parts. (5) System failure: If the problem caused by oil whirl is not solved in time, it may lead to the failure of the components of the mechanical system, which in turn affects the normal operation of the entire equipment. This may require expensive maintenance and downtime. In addition, due to the shaft misalignment, rotor imbalance, surges, rubbing and other faults of rotating equipment may also occur at the same time; it is challenging to accurately identify the early faults of tilting pad bearings.
The research on fault diagnosis methods based on artificial intelligence has achieved fruitful results. Zhong et al. [
5] proposed a rolling bearing fault diagnosis method based on a convolutional autoencoder and nearest-neighbor algorithm, which was verified experimentally by using the experimental data set published by CWRU under different working conditions. Mohiuddin et al. [
6] proposed an improved AlexNet-based intelligent fault diagnosis method for rolling bearings, which was verified experimentally using the data of different working conditions and a different signal-to-noise ratio of the experimental data set published by CWRU. Cui et al. [
7] proposed a method for fault diagnosis of rolling bearings under the condition of sample imbalance based on CNN, and used the conventional rolling bearing-fault data set collected in the laboratory for experimental verification. Zhang et al. [
8] proposed a CNN-based multi-channel data fusion neural network for rolling bearing fault diagnosis, using bearing data collected by eight vibration sensors on the SB25 aero-engine bearing bench test for experimental verification of the model. Shen et al. [
9] proposed an improved Gray Wolf optimizer algorithm based on a support vector machine and swarm intelligence optimization algorithm for rolling bearing fault diagnosis. The proposed algorithm was verified experimentally using the experimental data set published by CWRU and the data obtained from the mechanical transmission bearing life-cycle test platform independently developed by Nanjing Agricultural University. Huang et al. [
10] proposed a rolling bearing fault-detection method based on an improved Gray Wolf algorithm to optimize multi-stable stochastic resonance parameters, and conducted experimental verification using the published experimental data sets of CWRU and MFPT. Tian et al. [
11] proposed a CNN-LSTM bearing fault diagnosis model based on hybrid particle swarm optimization, and conducted experimental verification using the experimental data set disclosed by CWRU. However, most of the research objects of these research results are rolling bearings, and the model test data are almost all derived from laboratory bench test data. Moreover, there are few research results on the fault diagnosis of rotor systems composed of shafts, impellers or blades, couplings, and tilting pad bearings. There are still shortcomings in the operational risk evaluation of rotor systems.
The traditional fault diagnosis method for rotating machinery relies on the experience and knowledge of external experts, and relies on a spectrum analysis diagram, Bode diagram, Nyquist diagram and other analysis toolboxes in the condition-monitoring and analysis software to carry out a one-by-one manual analysis. This not only has a low efficiency of fault diagnosis and analysis, but also has a great lag, which often leads to untimely early fault-detection. Industrial Internet-enabled equipment management technology has developed rapidly in China. The accumulated equipment state-aware data has laid a foundation for intelligent fault diagnosis based on artificial intelligence and big data analysis. This data-driven, deep learning intelligent fault diagnosis method [
12,
13,
14] makes full use of the advantages of industrial big data and greatly reduces the dependence of the model on external experts. It has gradually become a development trend for equipment fault diagnosis technology in the industrial Internet environment [
15,
16].
During the service life-cycle of rotating equipment, the time of fault-free operation for equipment is far greater than that of fault operation, which determines that the data samples of normal-state perception for equipment are significantly more abundant than those of fault-state perception. The data samples are of a long-tail distribution type and have the characteristics of low value density [
17]. For specific rotating equipment, it is impossible to go through all the faults such as rotor imbalance, axis misalignment, rubbing, oil film whirl, surges and so on in the service life-cycle. Some equipment will not even have any kind of fault in the whole life-cycle. The lack of equipment-fault sample data is one of the challenging problems faced by fault diagnosis technology based on artificial intelligence and big data [
18].
When model-training samples are insufficient, a generative adversarial network is considered as one of the effective methods for solving the problem of data imbalance [
19]. Generative Adversarial Networks (GAN) [
20] are a deep learning model that is one of the most promising approaches to unsupervised learning over complex distributions in recent years. The model produces a fairly good output through game learning between (at least) two modules in the framework: the generative model and the discriminative model. GAN models generally use deep neural networks as G and D. A good GAN application needs to have a good training method; otherwise the output may not be ideal due to the freedom of the neural network model. To improve a GAN’s data generation capabilities and optimize the training process, a Deep Convolution Generative Adversarial Network (DCGAN) based on a deep Convolutional Neural Network (CNN) and generated high-resolution images have been proposed [
21]. However, as the training time of the model increases, some filters of the model will collapse and oscillate, resulting in mode collapse.
In order to solve the problem of GAN pattern collapse, a Wasserstein generative adversarial network (WGAN) model was constructed to overcome the problem by improving the stability of training [
22]. Aiming at the problem that WGAN adopts weight-clipping to solve the problem of Lipschitz constraints that can easily cause gradient disappearance or gradient explosion and slow model convergence, an improved Wasserstein GAN training method (WGAN-GP) has been proposed by Gulrajani et al. [
23]. By using a gradient penalty instead of weight-clipping to solve the problem of Lipschitz constraints, gradient disappearance or gradient explosion during model training can be avoided, and the problem of slow convergence of WGAN also can be solved. GAN, DCGAN, WGAN, and WGAN-GP are all unsupervised learning models that generate samples without category labels and cannot generate multiple types of samples using the same model.
In order to enhance the performance of GAN, an Auxiliary Classifier GAN (ACGAN) supervised learning model, which adds category labels to the generator and discriminator, as well as a classifier to the output part of the discriminator, have been proposed [
24]. This ACGAN model realizes that the generated samples all have a corresponding category label. The ACGAN model is improved based on DCGAN, so ACGAN still has the problem of model collapse. A Parallel Classification Wasserstein Generative Adversarial Network with Gradient Penalty (PCWGAN-GP) has been proposed by Yu et al. [
25]. By feeding healthy samples into the PCWGAN-GP model, the model will produce various failure samples of good quality, which can gradually expand the unbalanced data set until equilibrium is reached.
PCWGAN-GP is an unsupervised learning model, which needs to be constructed and trained independently for each fault type to obtain a balanced data set. This undoubtedly increases the workload of model construction and increases the time of model training. An ACWGAN-GP model based on a gradient penalty and auxiliary classifier has been built by Li et al. [
26], which can generate good-quality samples from an unbalanced training set, and has used the balanced data set for training Multilayer Perceptron (MLP), CNN, Support Vector Machine (SVM) and other classifiers for fault diagnosis. Cao et al. [
27] constructed a fault diagnosis model based on ACWGAN-GP and homogeneous superposition ensemble learning, which significantly improved the classification accuracy and stability of the model.
ACWGAN-GP combines the advantages of WGAN-GP and ACGAN, so that the model has the ability to generate multi-class label samples while overcoming the problems of pattern collapse and gradient disappearance. As a supervised learning model, ACWGAN-GP still needs a complete variety of fault label sample training data sets. Obviously, engineering application scenarios are not always able to meet such needs. Furthermore, the length of a single device state-aware data file is different, and the ACWGAN-GP model can only adapt to a single data file type, which cannot meet the needs of engineering applications. Therefore, the application of the ACWGAN-GP model for equipment full-fault diagnosis needs improvements in the model structure, so that it cannot only meet the data function of generating complete fault samples, but also automatically adapt to different equipment-state perception data.
Rotating equipment is generally composed of shafts, impellers or blades, comb seals, couplings, tilting pad bearings and other components. Among them, oil whirl faults caused by improper assembly-clearance and contact areas between tilting pad bearings and shafts are the most common. It is challenging to accurately identify the early faults of tilting pad bearings, because shaft misalignment, rotor imbalance, surges, rubbing and other faults of rotating equipment may also occur at the same time. Aiming at the engineering status of unbalanced data samples of rotating equipment, this paper studies an improved auxiliary classifier Wasserstein generative adversarial network with a gradient penalty for fault diagnosis of tilting pad bearings. The contributions of this paper are listed as follows:
(1) An improved auxiliary classifier Wasserstein generative adversarial network with gradient penalty is developed, in which the input data-length adaptive layer is added before the 2D convolution layer of the discriminator.
(2) A fault diagnosis method based on IACWGAN-GP for tilting pad bearings is proposed, which is able to accurately identify the early faults of tilting pad bearing oil whirl despite the interference of shaft misalignment, rotor imbalance, surges, rubbing and other faults that may occur simultaneously in rotating equipment.
(3) The application of an IACWGAN-GP-based fault diagnosis model in an industrial Internet environment via cloud-integrated prediction and health management systems, which includes a cyber-physical system layer, network layer and application layer, is proposed. The application layer consists of micro-service systems such as early fault warning, health evaluation and fault diagnosis.
5. Conclusions
Aiming at the engineering status of unbalanced data samples for rotating equipment, this paper studies an improved auxiliary classifier Wasserstein generative adversarial network with gradient penalty for fault diagnosis of tilting pad bearings. The work can be summarized as follows:
(1) An improved auxiliary classifier Wasserstein generative adversarial network with gradient penalty is developed, in which the input data length adaptive layer is added before the 2D convolution layer of the discriminator. It overcomes the limitation of neural networks on the length of input data and improves the applicability and generalization of neural networks to various types of data.
(2) A fault diagnosis method based on IACWGAN-GP for tilting pad bearings is proposed, which is able to accurately identify the early faults of tilting pad bearing oil whirl despite the interference of shaft misalignment, rotor imbalance, surges, rubbing and other faults that may occur simultaneously in rotating equipment. This method can identify oil whirl faults as they develop from weak to strong, and evaluate the grade of the fault. The engineering case-data verification results show that, with only normal data of the equipment, the model can achieve an accuracy of 98.7% in spotting upcoming faults. Train Multilayer Perceptron, CNN and Auxiliary Classifier GAN fault diagnosis models using full-fault virtual samples, and the accuracy of the models reach 92.7%, 97.7%, and 98.3%, separately. The proposed method and three comparison methods are tested by using cross-device and cross-condition engineering case data sets. The fault diagnosis accuracy of the proposed method and the three comparison methods are 98%, 60.8%, 31.8% and 77.7%, respectively, and the proposed method shows better robustness.
(3) The application of an IACWGAN-GP-based fault diagnosis model in an industrial Internet environment, via a cloud-integrated prediction and health management system, which includes cyber-physical system layer, network layer and application layer, is proposed. The application layer consists of micro-service systems such as early fault warning, health evaluation and fault diagnosis.
In this paper, the typical fault diagnosis of rotor systems is studied, and the proposed fault diagnosis method has a high fault diagnosis accuracy and robustness. However, the engineering case data involved in this paper only contains the data of a single type of fault. When the equipment has multiple faults at the same time, the proposed method can only draw a diagnosis conclusion for one of the faults. In addition, when a fault outside the fault category included in the training data set occurs, the proposed method will draw a similar diagnostic conclusion based on the similarity of fault characteristics between the unknown fault and the known fault, which may lead to incorrect diagnosis results.
Future studies will collect more complex fault engineering case data, use the virtual sample generation module in the proposed method to generate complex virtual fault samples based on the normal data of equipment and the fault characteristic frequency of complex faults, and use the engineering case data for experimental verification. In addition, the knowledge base of fault mechanisms except typical faults will be extended, and the range of fault categories in the training set of the model will be expanded to solve the fault diagnosis problem of unknown faults to a certain extent. The proposed method can also be applied to the fault diagnosis of various types of bearings, gears and other key components, providing support for fault prediction and health management of rotating equipment.