1. Introduction
Rolling element bearings play a vital role in various industrial fields such as aviation, energy, and metallurgy. They generally work under varying speed conditions, which brings great challenges to the health monitoring and performance degradation evaluation of bearings [
1]. For instance, the rotating speed of wind turbines changes because of time-varying wind speed and grid control over the long term. In such operation conditions, vibration signals always show non-stationary characteristics. Both amplitudes and frequency components of vibration signals change with time [
2,
3]. Traditional health monitoring methods generally assume that the change of vibration signals is just caused by faults of mechanical equipment [
4]. Therefore, these methods are unsuitable to health monitoring in the varying speed conditions. Aiming for the issue of incipient fault detection of rolling element bearings under varying speed conditions, new health monitoring methods should be developed. Basically, there are two crucial factors determining the accuracy of incipient fault detection under varying speed conditions: health indicator extraction and alarm trigger mechanism.
In the incipient fault detection process, a health indicator is always desired to effectively compress the large amount of vibration signals and reflect the degradation trend of rotating machinery. Many studies have been conducted on constructing such health indicators. Niu et al. [
5] exploited self-organizing map neural networks to extract a minimum quantization error fusion indicator for methane compressor health monitoring. Soualhi et al. [
6] used the Hilbert–Huang transform to extract Hilbert spectral density as a health indicator to track the degradation of the critical components of bearings. Li et al. [
7] extracted a compressed acoustic emission indicator based on the empirical model decomposition for gear fault detection. Guo et al. [
8] proposed a recurrent neural network based health indicator for remaining useful life prediction of bearings. Javed et al. [
9] extracted a novel indicator based on trigonometric functions and cumulative transformation to monitor the degradation trend of cutting tools and bearings. Qian et al. [
10] applied recurrence quantification analysis to extract recurrence plot entropy indicators for monitoring bearing performance degradation. Picot et al. [
11] proposed a statistic-based spectral health indicator to detect bearing faults. These indicators are able to reflect the degradation trend of rotating machinery, but they are hard to avoid being influenced by the operation condition change. On the contrary, indicators such as kurtosis are insensitive to the change of speed condition, but they are unable to reflect the degradation trend of rotating machinery. Generally speaking, a desirable monitoring indicator should predominantly satisfy the following two requirements: (1) being insensitive to the change of operation conditions; and (2) reflecting the degradation degree of fault severity quantitatively. Considering the shortcoming of existing indicators, it is necessary to extract new health indicators for the incipient fault detection under varying speed conditions.
After constructing health indicators, an alarm trigger mechanism is needed to judge whether incipient faults occur on rotating machinery or not. Various fault trigger mechanisms have been designed to deal with this problem. Ginart et al. [
12] presented a fault trigger mechanism based on constant alarm threshold for fault detection of rotating machinery. Li et al. [
7] detected gearbox faults using a probability trigger mechanism, i.e., if more than a given percentage of features are greater than a fixed fault detection threshold, then it is believed that the incipient fault happens. Loutas et al. [
13] employed an alarm trigger mechanism based on a heuristic threshold for health monitoring of gearboxes. Li et al. [
14] proposed an adaptive continuous trigger mechanism, which is regarded that the incipient fault has occurred when several continuous indicator values exceed the 3σ interval. Jin et al. [
15] employed a combined alarm trigger mechanism to successfully detect faults of cooling fans and motor bearings. These approaches have good performance in the fault detection. However, they still have several shortcomings. For instance, in the probability trigger mechanism, the percentage is calculated based on all of the historical measurements, which changes little when a new indicator value above the threshold appears. In the continuous trigger mechanism, several continuous indicator values above the threshold often occur in the significant faults stage, but seldom happen in the incipient fault stage.
To deal with the aforementioned problems, this paper proposes an incipient fault detection method for rolling element bearings under varying speed conditions. The major contributions of this method are: (1) relative residual (RR) features based on extreme learning machine (ELM) are proposed, which are insensitive to the varying speed conditions and are able to better reflect the degradation trend of bearings; (2) a health indicator, i.e., selected negative log-likelihood probability (SNLLP), is constructed to fuse multiple features; and (3) an alarm trigger mechanism is established to improve the accuracy of incipient fault detection based on the analysis of the fault typical behavior.
The rest of this paper is organized as follows.
Section 2 describes the basic theory of ELM and Gaussian mixture model (GMM), respectively. In
Section 3, the proposed incipient fault detection method is introduced in detail. In
Section 4, the health state of the industrial wind turbine is monitored to further verify the effectiveness of the proposed method. Some concluding remarks are drawn in
Section 5.
2. Brief Introduction to ELM and GMM
2.1. Extreme Learning Machine
ELM was proposed by Huang for single hidden layer feedforward neural networks (SLFNs) [
16]. The learning speed of the feedforward neural network is constrained by gradient-based iterative learning algorithms. Unlike many other popular learning algorithms, the learning speed of ELM is fast because input weights and hidden neuron biases are randomly generated [
17]. ELM has been widely used for regression and classification issues by virtue of few parameters tuning, fast learning speed and good generalization performance.
The output of SLFNs with
L hidden nodes can be expressed as:
where
is the weight vector connecting input nodes and the
ith hidden node,
is the bias of the
ith hidden node,
is the weight connecting the
ith hidden node to the output node and
is the output of the
ith hidden node with respect to the input
x. The architecture of SLFNs is shown in the
Figure 1.
The samples are used to train parameters of SLFNs, where is an input vector and is a target vector.
Then, Equation (1) can be rewritten as
where
The number of hidden nodes
is always less than the number of training samples, which means that the training error cannot be exactly zero but can approach zero. ELM randomly generates the input weight
and the hidden node bias
, so the output weight
is just needed to be solved, which is estimated as
where
is a Moore–Penrose generalized inverse of the
matrix.
For a given training sample set and L hidden nodes, the solution process of ELM parameters is listed as:
Step 1: Randomly generate the hidden layer parameters .
Step 2: Select an infinitely differentiable function as the activation function for hidden nodes and then calculate the output matrix of hidden layer .
Step 3: Calculate the output weight .
2.2. Gaussian Mixture Model
GMM has strong robustness and excellent computing performance to model the complicated internal distribution characteristic contained in the data by using several Gaussian mixture components [
18]. This characteristic makes it very suitable to smooth the probability distribution of arbitrary shape. Therefore, the GMM has been widely used in image background extraction and speech recognition in recent years [
19].
Theoretically, arbitrary data can be expressed by
M Gaussian probability density functions
where
is the weight of the
jth mixture component that satisfies the constraint
and
,
.
is a unimodal Gaussian probability density function, which is expressed as
where
is the
n-dimensional data vector,
is a
mean vector and
is a
covariance matrix.
To a dimensional training sample set the sample set is used to estimate the GMM parameters with the aid of the expectation–maximization (EM) algorithm. The solution process of the GMM parameters is listed as follows:
Step 1: The K-means clustering algorithm is first utilized to initialize GMM parameters .
Step 2: To each training data , estimate the probability produced by each unimodal Gaussian probability density function. The probability produced by the jth Gaussian mixture component is expressed as
Step 3: Update parameters .
Step 4: Repeatedly iterate Steps 2 and 3 to update the parameters until , where and are the probability density of kth and (k+1)th iteration, respectively.
Once the final parameters are determined, the GMM that is trained by the sample set has been constructed. Then, the trained GMM can be used to evaluate the membership degree of new data relative to the training sample set, which is able to be denoted by the .
3. Proposed Incipient Fault Detection Method
The flowchart of the proposed method is shown in
Figure 2. It is composed of three modules, i.e., RR extraction, health indicator construction and incipient fault detection.
In the RR features extraction module, real-time vibration signals acquired from rotating machinery are de-noised by singular value decomposition (SVD) to enhance weak incipient fault characteristics. Then, dimensional features are extracted from the de-noised signals. Since the dimensional features are easily influenced by speed variation, RR features are proposed to reduce the interference and better reflect the health degradation trend of bearings. In the health indicator construction module, a feature set is made up of RR features and non-dimensional features. Then, some features are selected from the feature set using degradation evaluation metrics, and the dimension of these selected features is further reduced by orthogonal locality preserving projections (OLPP). A novel health indicator called SNLLP is constructed based on GMM to comprehensively quantify performance degradation of bearings. In the incipient fault detection module, based on the SNLLP indicator, the incipient fault is detected by the proposed alarm trigger mechanism. More details about these three modules are presented as follows.
3.1. RR Features Extraction
In the RR features extraction module, at first, real-time vibration signals are acquired from sensors attached on rolling element bearings. Since bearing fault characteristics, especially in the incipient stage, are always submerged in strong background noise, SVD is employed to cancel the noise imbedded in vibration signals [
20]. Then, original features are extracted from the de-noised signals in time domain, frequency domain and time-frequency domain. Generally, these features are classified into two classes, i.e., dimensional features and non-dimensional features. Many dimensional features are always able to directly reflect the health degradation trend of bearings at a constant speed condition, but the amplitudes of these features are also susceptible to varying speed conditions. To explore the relationships between the speed and dimensional features, varying speed experiments of bearings are executed in a test rig.
The experimental system is shown in
Figure 3. The test rig includes two bearings, a 3-hp motor for driving bearing operation. The motor rotating speed is controlled by a speed controller, which allows the tested bearing to operate under various speeds. Accelerometers are fixed on end cap closing to the tested bearing. The sampling frequency is 25.6 kHz. Data acquisition equipment is utilized to collect the vibration signals of bearings in normal and fault states via increasing the motor speed from 20 Hz to 40 Hz. The speed keeps stable for a period after increasing each 0.5 Hz. Under each constant-speed period, 10 sets of vibration signals are acquired. Each set of vibration signal contains 76,800 data points, i.e., 3 s.
Figure 4 shows the variation of standard deviation of inverse hyperbolic cosine (Std of IHC) [
9] and root mean square(RMS) values with the increase of speed under normal and fault states. From
Figure 4, two obvious rules could be concluded:
There is an approximately linear relationship between the motor speed and the extracted features whether in the normal state or fault state.
The fitting-regression lines of the features under the fault state have a larger slope than those under a normal state, which means that the bearing in the fault state is more susceptible to speed than the bearing in the normal state.
In order to reduce the influence of the time-varying speed to dimensional features, the dimensional features are transformed to a set of RR features using ELM to reflect the degradation trend of rotating machinery under varying speed conditions. The process of RR feature extraction is shown in
Figure 5.
Step 1: The speed condition parameters in the normal state is used as the input of the trained model and the dimensional feature in the normal state is utilized as the output of the trained model to train the ELM empirical model of the normal behavior, where B is the number of training samples.
Step 2: The new captured operating speed is used as the input of the trained model. The ELM model estimation output value of can be obtained as .
Step 3: The RR feature is constructed as , where is the actual feature value extracted from the measured vibration signal.
Feature values in the fault state will deviate from the distribution of those in the normal state, leading to statistically abnormal changes in RR values. Therefore, faults can be detected by evaluating the extracted RR values.
To evaluate the effectiveness of the proposed RR features, experimental data are further analyzed. As described above, vibration signals of bearings in normal and fault states within the 20–40 Hz speed range are first de-noised by the SVD. Then, Std of IHC and RMS are selected from the dimensional features to demonstrate the performance of corresponding RR features. The Std of IHC and RMS values in the normal state and corresponding speed parameters are used to train the ELM empirical model. The performance of the extracted RR features feature in the normal and fault state is shown in
Figure 6. It is seen that the amplitudes of RR features in the normal state are close to 0 and stay stable. In the same fault degree, the amplitudes of RR features are greater than 0. In addition, they are still stable without being influenced by the speed variation obviously. The results indicate that RR features can well recognize the normal and fault state of bearings and also have a good resistance to speed variation.
3.2. Health Indicator Construction
In this module, GMM-based SNLLP is constructed to provide a fusion health indicator. When the feature set composed of RR features and non-dimensional features is extracted, an optimal degradation feature selection method is required to select the features with useful information about bearing degradation processes. Good condition monitoring features should be well correlated to bearing degradation progresses, monotonous increasing or decreasing, robustness to outliers caused by noise, etc. Therefore, correlation, monotonicity and robustness metrics are utilized here for optimal feature selection. An integrated metric about these three evaluation metrics is used [
21]:
where
is total degradation evaluation score whose values belong to the interval
,
is the correlation metric,
is the monotonicity metric,
is the robustness metric and
is the weight of each metric. Three degradation evaluation metrics are computed by virtue of the following equations:
where
is the degradation feature value at the time
,
is the trend value which can be obtained by common smoothness method,
is the residual random value,
is the number of total feature values and
is the unit step function.
In order to select the degradation features that have good performance, the features whose
have higher scores are primarily selected. The selected degradation features still have redundant information. Therefore, the dimension of these selected features is further reduced by OLPP. The OLPP is utilized here to retain the low-dimensional nonlinear manifold structure that is hidden in the data set and better dig out intrinsic fault information [
22]. The former principal components whose total cumulative contribution rate is above 90% are retained.
At last, the reduced principal components are fused using a GMM-based method [
23]. The GMM is trained by the values of the reduced principal components under the normal state. Then, the trained GMM is used to online fuse degradation features. For each new captured sample, GMM will generate
using Equation (6), and the SNLLP health indicator is calculated as follows:
3.3. Incipient Fault Detection
In this module, based on the constructed SNLLP health indicator, a new alarm trigger mechanism is designed to detect the incipient fault based on the analysis of the fault typical behavior. For example, more abnormal fluctuation would appear in the signals and the amplitude of signals may increase gradually. The proposed alarm trigger mechanism is composed of two criteria: (1) in a sliding time window, the number of indicator values above the adaptive threshold has exceeded a safety number; and (2) the indicator values in the sliding time-window present an increasing trend, which is evaluated by the Spearman coefficient.
At first, the adaptive threshold based on the Chebyshev’s inequality is established to identify outliers. For an SNLLP indicator
, the mean and Std of its index set at time
are
and
respectively. The Chebyshev’s inequality can be expressed as:
where
is threshold tolerance. For a given confidence level
, the bound interval is expressed as
. Equation (16) means that the probability of
is less than
[
24]. The former partial SNLLP indicators
are selected to calculate the initial bound:
As a new sample at time
is captured, judge whether the SNLLP value of the sample belongs to the bound interval or not. If the sample belongs to the bound interval, it is added into the index set, i.e.,
, the bound
is recalculated:
where
and
are the mean and standard deviation of the index set
. Otherwise, the sample is regarded as an outlier and the bound
is not updated.
Since the SNLLP indicator values always perform the gradually increasing trend during the whole life cycle of bearings, the upper bound is selected as the detection threshold. Compared with random fluctuation, more abnormal fluctuation would appear when incipient faults occur. Therefore, the first criterion is defined as follows: the number of indicator values above the adaptive threshold in the sliding time-window has exceeded a safety number. The safety number is determined by the threshold tolerance and window length, which can be calculated by the following equation:
where
is the safety number and
is the window length.
When incipient faults occur, the indicator always performs the gradually increasing trend. The Spearman coefficient between the indicator and time is an effective tool to evaluate the trend [
25]. First, indicator
and time
are converted to their ranks
and
, and then the Spearman coefficient
between
and
is computed using the following equality:
where
and
are the means of
and
, respectively. When
is a monotonous increasing or decreasing function of
, the Spearman coefficient
equals
or
. On the contrary, if
does not have a good monotonic relation with
, the
will be close to
.
Since the gradually increasing trend just performs after the incipient fault occurs, it is not a wise choice using the Spearman coefficient in all of the historical measurements. Application of the Spearman coefficient in the sliding time-window is able to increase the sensitivity of detecting the local increasing trends. Therefore, the second criterion can be defined as follows: the Spearman coefficient in the sliding time-window is higher than . When the two criteria above are satisfied simultaneously, it means that an incipient fault has occurred.
5. Conclusions
Aiming at the health monitoring of bearings under varying speed conditions, this paper proposes an accurate incipient fault detection method. Based on the dependence relation analysis between speed and dimensional features, RR features based on ELM are proposed to better reflect the degradation trend of bearings under varying speed conditions. Then, the SNLLP health indicator is constructed to fuse multi-parameter features. Finally, based on the constructed SNLLP health indicator, an incipient fault alarm is triggered by the proposed alarm trigger mechanism. The proposed method is demonstrated using vibration signals from test rigs of bearings and industrial WTs. The results show that the RR features are insensitive to the varying speed conditions and are able to quantitatively reflect the degradation trend of bearings. The proposed alarm trigger mechanism is more reliable and accurate for the detection of incipient faults compared with two other alarm trigger mechanisms.
Although this study has improved the accuracy of the incipient fault detection for bearings under varying speed conditions, the selection number of original features is still set subjectively. How to determine the selection number of original features adaptively is still a problem that needs to be researched in our future.