A Novel Multi-Task Self-Supervised Transfer Learning Framework for Cross-Machine Rolling Bearing Fault Diagnosis
Abstract
:1. Introduction
- A novel multi-task self-supervised transfer learning framework is proposed for rolling bearing fault diagnosis on cross-machine. The proposed method is trained on three self-supervised tasks and one fault diagnosis task, which can extract general transferable fault diagnosis features via multi-task learning. By multi-task learning, the ability of the model to resist overfitting has been improved. The effectiveness of the proposed method is evaluated on three datasets from different test-beds. The experimental results indicate that the proposed multi-task framework can complete cross-machine rolling bearing fault diagnosis effectively.
- Multi-scale self-supervised learning tasks are constructed to fully extract the intrinsic information from the original vibration data. Leveraging the label-free data feature extraction capability of self-supervised learning, the model is more capable of focusing on the data rather than the task level. Moreover, different self-supervised training tasks focus on different intrinsic features of the data, which enhances the diversity in the model’s feature extraction capabilities.
- A multi-perspective feature transfer method based on Wasserstein distance and cosine similarity is proposed to acquire transferable fault diagnosis knowledge, thereby completing fault diagnosis of rolling bearings on cross-machines. By integrating probability distribution metrics and geometric similarity metrics, the model pays attention to diverse transferable diagnostic knowledge from different levels, enhancing the model’s transfer diagnostic ability, and thus efficiently accomplishing the task of cross-machine bearing fault diagnosis.
2. Related Works
2.1. Self-Supervised Learning
2.2. Transfer Learning
2.3. Multi-Task Learning
3. The Proposed MTSTLF Method
3.1. Multi-Scale Self-Supervised Learning Tasks
3.2. Multi-Perspective Feature Transfer Method
3.3. Multi-Task Learning of the Proposed Method
3.4. The Procedure for the Proposed Method
4. Experiment Verification
4.1. Experiment 1: CWRU and Ottawa
4.1.1. Dataset Description
4.1.2. Experimental Result Analysis
- 4.
- TCA is a classic transfer learning method that uses MMD as the measurement method to map the cross-domain data to the same space for distribution difference calculation without combining in-depth learning. Through grid search technology, the weight coefficient of TCA regularization term is set to 0.7, and the subspace dimension is 12.
- 5.
- The basic structure of DDC is consistent with the proposed method, with the decoder removed and only MMD used as the migration distance metric.
- 6.
- The basic structure of the DAN is consistent with the proposed method, which removes the decoder. The DAN uses multi-kernel MMD as the transfer distance metric and calculates the distance of multi-layer output features.
- 7.
- The design of the DCC method is consistent with the original literature.
4.2. Experiment 2: CWRU and SEU
4.2.1. Data Description
4.2.2. Experimental Result Analysis
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Ma, S.; Chu, F.L. Ensemble deep learning-based fault diagnosis of rotor bearing systems. Comput. Ind. 2019, 105, 143–152. [Google Scholar] [CrossRef]
- Wu, C.M.; Zheng, S.P. Fault diagnosis method of rolling bearing based on MSCNN-LSTM. Comput. Mater. Contin. 2024, 79, 4395–4411. [Google Scholar] [CrossRef]
- Liu, G.Z.; Wu, L.F. Incremental bearing fault diagnosis method under imbalanced sample conditions. Comput. Ind. Eng. 2024, 192, 110203. [Google Scholar] [CrossRef]
- Dai, X.; Yi, K.; Wang, F.L.; Cai, C.; Tang, W. Bearing fault diagnosis based on POA-VMD with GADF-Swin Transformer transfer learning network. Measurement 2024, 238, 115328. [Google Scholar] [CrossRef]
- Lei, Y.G.; Yang, B.; Jiang, X.W.; Jia, F.; Li, N.; Nandi, A.K. Applications of machine learning to machine fault diagnosis: A review and roadmap. Mech. Syst. Signal Process. 2020, 138, 106587. [Google Scholar] [CrossRef]
- Zheng, B.; Huang, J.H.; Ma, X.; Zhang, X.; Zhang, Q. An unsupervised transfer learning method based on SOCNN and FBNN and its application on bearing fault diagnosis. Mech. Syst. Signal Process. 2024, 208, 111047. [Google Scholar] [CrossRef]
- Kumar, P.; Raouf, I.; Kim, H.S. Transfer learning for servomotor bearing fault detection in the industrial robot. Adv. Eng. Softw. 2024, 194, 103672. [Google Scholar] [CrossRef]
- Vashishtha, G.; Kumar, R. Unsupervised Learning Model of Sparse Filtering Enhanced Using Wasserstein Distance for Intelligent Fault Diagnosis. J. Vib. Eng. Technol. 2023, 11, 2985–3002. [Google Scholar] [CrossRef]
- Wang, R.; Yan, F.C.; Yu, L.; Shen, C.; Hu, X. Joint Wasserstein distance matching under conditional probability distribution for cross-domain fault diagnosis of rotating machinery. Mech. Syst. Signal Process. 2024, 210, 111121. [Google Scholar] [CrossRef]
- Zhao, K.; Jia, F.; Shao, H. A novel conditional weighting transfer Wasserstein auto-encoder for rolling bearing fault diagnosis with multi-source domains. Knowl. Based Syst. 2023, 262, 110203. [Google Scholar] [CrossRef]
- Islam, M.; Zunair, H.; Mohammed, N. CosSIF: Cosine similarity-based image filtering to overcome low inter-class variation in synthetic medical image datasets. Comput. Biol. Med. 2024, 172, 108317. [Google Scholar] [CrossRef] [PubMed]
- Gao, C.; Li, W.F.; He, L.J.; Zhong, L. A distance and cosine similarity-based fitness evaluation mechanism for large-scale many-objective optimization. Eng. Appl. Artif. Intell. 2024, 133 Pt B, 108127. [Google Scholar] [CrossRef]
- Akram, F.; Ahmad, T.; Sadiq, M. An integrated fuzzy adjusted cosine similarity and TOPSIS based recommendation system for information system requirements selection. Decis. Anal. J. 2024, 11, 100443. [Google Scholar] [CrossRef]
- Lu, F.Y.; Tong, Q.B.; Jiang, X.D.; Feng, Z.; Xu, J.; Wang, X.; Huo, J. A deep targeted transfer network with clustering pseudo-label learning for fault diagnosis across different machines. Mech. Syst. Signal Process. 2024, 213, 111344. [Google Scholar] [CrossRef]
- Yang, B.; Lei, Y.; Jia, F.; Xing, S. A transfer learning method for intelligent fault diagnosis from laboratory machines to real-case machines. In Proceedings of the International Conference on Sensing, Diagnostics, Prognostics, and Control, Xi’an, China, 15–17 August 2018; pp. 35–40. [Google Scholar]
- Yang, B.; Lei, Y.; Jia, F.; Xing, S. An intelligent fault diagnosis approach based on transfer learning from laboratory bearings to locomotive bearings. Mech. Syst. Signal Process. 2019, 122, 692–706. [Google Scholar] [CrossRef]
- Guo, L.; Lei, Y.; Xing, S.; Yan, T.; Li, N. Deep convolutional transfer learning network: A new method for intelligent fault diagnosis of machines with unlabeled data. IEEE Trans. Ind. Electron. 2019, 66, 7316–7325. [Google Scholar] [CrossRef]
- Jia, S.; Li, Y.; Wang, X.; Sun, D.; Deng, Z. Deep causal factorization network: A novel domain generalization method for cross-machine bearing fault diagnosis. Mech. Syst. Signal Process. 2023, 192, 110228. [Google Scholar] [CrossRef]
- Jia, N.; Huang, W.G.; Ding, C.C.; Wang, J.; Zhu, Z. Physics-informed unsupervised domain adaptation framework for cross-machine bearing fault diagnosis. Adv. Eng. Inform. 2024, 62 Pt C, 102774. [Google Scholar] [CrossRef]
- Ding, P.; Xu, Y.; Sun, X.M. Multitask learning for aero-engine bearing fault diagnosis with limited data. IEEE Trans. Instrum. Meas. 2024, 73, 3520111. [Google Scholar] [CrossRef]
- Xie, Z.L.; Chen, J.L.; Feng, Y.; Zhang, K.; Zhou, Z. End to end multi-task learning with attention for multi-objective fault diagnosis under small sample. J. Manuf. Syst. 2022, 62, 301–316. [Google Scholar] [CrossRef]
- Kong, D.P.; Huang, W.D.; Zhao, L.B.; Ding, J.; Wu, H.; Yang, G. Mining knowledge from unlabeled data for fault diagnosis: A multi-task self-supervised approach. Mech. Syst. Signal Process. 2024, 211, 111189. [Google Scholar] [CrossRef]
- Sun, X.C.; Wang, Z.H.; Lu, Z.M.; Lu, Z. Self-supervised graph representations with generative adversarial learning. Neurocomputing 2024, 592, 127786. [Google Scholar] [CrossRef]
- Li, J.L.; Li, R.N.; Xu, L.H.; Liu, J. Self-supervised generative adversarial learning with conditional cyclical constraints towards missing traffic data imputation. Knowl. Based Syst. 2024, 284, 111233. [Google Scholar] [CrossRef]
- Zhang, X.Y.; Shi, H.C.; Zhu, X.B.; Li, P. Active semi-supervised learning based on self-expressive correlation with generative adversarial networks. Neurocomputing 2019, 345, 103–113. [Google Scholar] [CrossRef]
- Cao, R.L.; Li, Z.X.; Tang, Z.J.; Zhang, C.; Ma, H. Enhancing robust VQA via contrastive and self-supervised learning. Pattern Recognit. 2025, 159, 111129. [Google Scholar] [CrossRef]
- Yang, M.; Ling, J.; Chen, J.M.; Feng, M.; Yang, J. Discriminative semi-supervised learning via deep and dictionary representation for image classification. Pattern Recognit. 2023, 140, 109521. [Google Scholar] [CrossRef]
- Li, J.M.; Ye, Z.D.; Gao, J.; Meng, Z.; Tong, K.; Yu, S. Fault transfer diagnosis of rolling bearings across different devices via multi-domain information fusion and multi-kernel maximum mean discrepancy. Appl. Soft Comput. 2024, 159, 111620. [Google Scholar] [CrossRef]
- Jia, S.Y.; Deng, Y.F.; Lv, J.; Du, S.; Xie, Z. Joint distribution adaptation with diverse feature aggregation: A new transfer learning framework for bearing diagnosis across different machines. Measurement 2022, 187, 110332. [Google Scholar] [CrossRef]
- Qian, Q.; Wang, Y.; Zhang, T.S.; Qin, Y. Maximum mean square discrepancy: A new discrepancy representation metric for mechanical fault transfer diagnosis. Knowl. Based Syst. 2023, 276, 110748. [Google Scholar] [CrossRef]
- Wang, P.C.; Xiong, H.; He, H.X. Bearing fault diagnosis under various conditions using an incremental learning-based multi-task shared classifier. Knowl. Based Syst. 2023, 266, 110395. [Google Scholar] [CrossRef]
- Yu, J.B.; Yan, X.F. Multiscale intelligent fault detection system based on agglomerative hierarchical clustering using stacked denoising autoencoder with temporal information. Appl. Soft Comput. 2020, 95, 106525. [Google Scholar] [CrossRef]
- Yan, X.A.; Liu, Y.; Jia, M.P. Multiscale cascading deep belief network for fault identification of rotating machinery under various working conditions. Knowl. Based Syst. 2020, 193, 105484. [Google Scholar] [CrossRef]
- Sehri, M.; Dumond, P.; Bouchard, M. University of Ottawa constant load and speed rolling-element bearing vibration and acoustic fault signature datasets. Data Brief 2023, 49, 109327. [Google Scholar] [CrossRef] [PubMed]
- Li, G.; Kang, G.; Zhu, Y.; Wei, Y.; Yang, Y. Domain consensus clustering for universal domain adaptation. In Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA, 20–25 June 2021; pp. 9752–9761. [Google Scholar]
- Li, B.T.; Pi, D.C.; Lin, Y.X. Learning ladder neural networks for semi-supervised node classification in social network. Expert Syst. Appl. 2021, 165, 113957. [Google Scholar] [CrossRef]
- Shao, S.; McAleer, S.; Yan, R.Q.; Baldi, P. Highly accurate machine fault diagnosis using deep transfer learning. IEEE Trans. Ind. Inform. 2019, 15, 2446–2455. [Google Scholar] [CrossRef]
Modules | Description of Layer | Parameter |
---|---|---|
Feature encoder | Kernel size of first convolutional layer | 1∗64 |
Number of units (input, output) in first convolutional layer | (1, 16) | |
Kernel size and stride of max-pooling layer | 2∗2/2 | |
Kernel size of second convolutional layer | 1∗3 | |
Number of units in second convolutional layer | (16, 16) | |
Kernel size and stride of max-pooling layer | 2∗2/2 | |
Kernel size of forth convolutional layer | 1∗3 | |
Number of units in third convolutional layer | (16, 16) | |
Kernel size of third convolutional layer | 1∗3 | |
Kernel size and stride of max-pooling layer | 2∗2/2 | |
Feature decoder | Kernel size of first deconvolutional layer | 1∗3 |
Number of units (input, output) in first deconvolutional layer | (16, 16) | |
Kernel size and stride of up max-pooling layer | 2∗2/2 | |
Kernel size of second deconvolutional layer | 1∗3 | |
Number of units (input, output) in second deconvolutional layer | (16, 16) | |
Kernel size and stride of up max-pooling layer | 2∗2/2 | |
Kernel size of third deconvolutional layer | 1∗3 | |
Number of units (input, output) in third deconvolutional layer | (16, 1) | |
Kernel size and stride of up max-pooling layer | 2∗2/2 | |
Classifier | Number of units (input, output) in first full-connected layer | 16∗120/512 |
Number of units (input, output) in second full-connected layer | 512/512 | |
Number of units (input, output) in third full-connected layer | 512/4 |
Task Number | Source Domain | Target Domain | Fault Types and Label | The Number of Samples from Each Health Condition |
---|---|---|---|---|
T0 | CWRU | Ottawa | Normal/1 Inner race fault/2 Outer race fault/3 Ball fault/4 | 40 |
T1 | Ottawa | CWRU | Normal/1 Inner race fault/2 Outer race fault/3 Ball fault/4 | 40 |
Method | Accuracy of T0 (%) | Accuracy of T1 (%) |
---|---|---|
TCA | 53.13 | 51.25 |
DDC | 59.38 | 55.63 |
DAN | 69.38 | 62.50 |
DCC | 80.00 | 81.25 |
MTSTLF (ours) | 91.87 | 91.25 |
Method | Training Time (s) | Testing Time (s) |
---|---|---|
TCA | / | 2.6 |
DDC | 1.80 | 0.38 |
DAN | 7.35 | 0.43 |
DCC | 2.70 | 0.88 |
MTSTLF (ours) | 6.48 | 0.83 |
Task Number | Source Domain | Target Domain | Fault Types and Label | The Number of Samples from Each Health Condition |
---|---|---|---|---|
T2 | CWRU | SEU | Normal/1 Inner race fault/2 Outer race fault/3 Ball fault/4 | 40 |
T3 | SEU | CWRU | Normal/1 Inner race fault/2 Outer race fault/3 Ball fault/4 | 40 |
Method | Accuracy of T2 (%) | Accuracy of T3 (%) |
---|---|---|
TCA | 56.25 | 53.75 |
DDC | 71.25 | 66.25 |
DAN | 74.37 | 77.50 |
DCC | 83.13 | 83.75 |
MTSTLF (ours) | 95.63 | 90.00 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zhao, L.; He, Y.; Dai, D.; Wang, X.; Bai, H.; Huang, W. A Novel Multi-Task Self-Supervised Transfer Learning Framework for Cross-Machine Rolling Bearing Fault Diagnosis. Electronics 2024, 13, 4622. https://doi.org/10.3390/electronics13234622
Zhao L, He Y, Dai D, Wang X, Bai H, Huang W. A Novel Multi-Task Self-Supervised Transfer Learning Framework for Cross-Machine Rolling Bearing Fault Diagnosis. Electronics. 2024; 13(23):4622. https://doi.org/10.3390/electronics13234622
Chicago/Turabian StyleZhao, Lujia, Yuling He, Derui Dai, Xiaolong Wang, Honghua Bai, and Weiling Huang. 2024. "A Novel Multi-Task Self-Supervised Transfer Learning Framework for Cross-Machine Rolling Bearing Fault Diagnosis" Electronics 13, no. 23: 4622. https://doi.org/10.3390/electronics13234622
APA StyleZhao, L., He, Y., Dai, D., Wang, X., Bai, H., & Huang, W. (2024). A Novel Multi-Task Self-Supervised Transfer Learning Framework for Cross-Machine Rolling Bearing Fault Diagnosis. Electronics, 13(23), 4622. https://doi.org/10.3390/electronics13234622