4.1.1. XJTU-SY Data Set Introduction
In this paper, the effectiveness of the proposed method is verified using accelerated life test data of XJTU-SY bearing, and the experimental bench consists of an AC motor, speed controller, rotating shaft, support bearing, and test bearing [
29]. The experimental data consisted of failure data of 15 bearings in total for three different operating conditions. The accelerated life experiment platform of rolling bearings of Xi’an Jiaotong University is shown in
Figure 6, and the bearing data information is shown in
Table 3.
In this paper, the experimental data of bearings 1, 2, 3, and 4 under working condition one and bearings 1, 3, 4, and 5 under working condition two are selected as the training set, with a total of 1969 sample data. The sample data of the training set is divided into two parts: validation and test. The ratio of validation samples to test samples is 8:2.
4.1.2. XJTU-SY Data Set Experimental Verification and Analysis
(1) Multi-scale features extraction and selection.
Taking bearing 1 under working condition one as an example, a multi-scale hybrid features set is extracted and constructed. First, its data set is decomposed to third-order using EWT. Then the original vibration signal and the third order IMF components are extracted from the time domain, frequency domain, and time-frequency domain with a total of 30 features. Since the test bearing has two kinds of original vibration signals, the horizontal original vibration signal and its third-order IMF components are extracted with a total of 120 features, and the vertical original vibration signal and its third-order IMF components are extracted with a total of 120 features, so a total of 240 multi-scale hybrid features are extracted after processing. The processing results are shown in
Table 4.
Secondly, the correlation coefficient is used to select features in the multi-scale mixed features vector set that can better characterize the fault degradation process of bearing 1 under operating condition one. In this paper, 60 multi-scale mixed features with high ratings are selected to form the optimal feature vector set. The multi-scale hybrid features selection and the comparison of superior difference features are shown in
Figure 7.
Finally, the processed multi-scale mixed features vector set is then fused and downscaled by KPCA to output the health status index of the bearing. The constructed rolling bearing health state index is shown in
Figure 8.
According to the constructed health status index, analysis of domestic and international literature related to the field of research equipment health identification reveals that equipment performance degradation can usually be classified into four states [
30,
31]. Therefore, combined with the fundamental operation of rolling bearings and expert experience, the rolling bearing health status is divided into four levels [
32], and each level and the corresponding health status index interval are shown in
Table 5.
(2) MSCCNN model training and validation.
The MSCCNN model and training parameters built in this paper are as follows: ① The 60 features are convolved by convolution of 3 channels and 3 scales, with a convolution kernel size of 3, 5, and 7, channel size of 32, and activation function of Relu activation function. ② The multi-scale hybrid features are stitched by the Add function and input to the pooling layer for pooling with a pooling step of 2. ③ The convolution operation is performed by 1 convolution layer with a convolution kernel size of 3 and a channel size of 64, and using the Relu activation function. ④ The features are input to the CBAM module for more advanced feature extraction, followed by convolutional operation of the features through one convolutional layer, and the convolved features are input to the pooling layer, where the convolutional kernel size is 3, the channel size is 32, the pooling step is 2, and the activation function is the Relu activation function. ⑤ The features after the convolution are input to the last convolutional layer for the convolution operation, where the convolution kernel size is 5, the channel size is 16, the pooling step size is 2, and the activation function is the Relu activation function. ⑥ Finally, the bearing health status is classified into four categories: health, good, deterioration and failure through the full connection layer and “Softmax” function. ⑦ The optimizer is Adam, the learning rate is 0.001, and the training times are 200.
During the experiment, the optimizer and loss function of the model have a significant impact on how well the model is trained. When the model is trained, firstly, the improved custom loss function is selected, and the optimizer is selected from SGD, RMSprop, and Adam for comparison. Through the experiment, the model reaches the optimum, and the model accuracy is the highest when the Adam optimizer is selected. The recognition accuracy results of different optimizers are shown in
Figure 9.
Then, after the optimizer of the model is determined, the loss functions “MAE,” “MSE,” and “Log cosh” are selected for comparison with the improved custom loss functions. Through experiments, the enhanced custom loss function (LM) is chosen to make the highest accuracy of the model in this paper. It is proved that the improved custom loss function in this paper outperforms the “MAE,” “MSE,” and “Log cosh” loss functions. The recognition accuracies of different loss functions are shown in
Figure 10.
In summary, the finalized optimizer of the MSCCNN model is Adam, and the improved custom loss function is selected as the loss function of the model in this paper. When the parameters of the MSCCNN model are optimal, the confusion matrix of the test results of the model for bearing health status identification in the validation sample is shown in
Figure 11.
The accuracy of the model in assessing the health status of the bearings as measured by the test samples is as high as 98.22%, and the accuracy of identification is good in each state.
To further validate the accuracy and superiority of the MSCCNN health state assessment model proposed in this paper, it is compared with support vector machine (SVM), CNN, CNN + CBAM, MSCNN, and MSCCNN + conventional features (60 features in the time domain, frequency domain and time-frequency domain) [
33], respectively, and the same training set is used to conduct the experiments. The confusion matrix results of each method for the accuracy of bearing health status identification are shown in
Figure 12, and the specific data are shown in
Table 6.
From the specific data in
Figure 12 and
Table 6, it can be seen that the multi-scale hybrid features and improved convolutional neural network (MSCCNN) proposed in this paper are more accurate for the bearing health status identification method, with 5.83%, 3.30%, 2.29%, 1.52%, and 4.31% improvement compared to SVM, CNN, CNN + CBAM, MSCNN, and MSCCNN + conventional features methods, respectively.