Efficient DCNN-LSTM Model for Fault Diagnosis of Raw Vibration Signals: Applications to Variable Speed Rotating Machines and Diverse Fault Depths Datasets
Abstract
:1. Introduction
- Autoencoders: These are neural networks that have been trained to learn a compressed representation of input data. The network is initially trained to encode the input data into a lower-dimensional representation before decoding it back into its original format. Autoencoders are frequently used to extract features and reduce dimensionality.
- Deep belief networks: These are generative models with numerous layers of hidden units. Deep belief networks are trained through unsupervised learning and may be utilized for image and voice recognition.
- Deep Boltzmann machines: These are similar to deep belief networks, but they employ a different form of model known as a Boltzmann machine. Deep Boltzmann machines are also learned via unsupervised learning and may be utilized for tasks such as collaborative filtering and anomaly detection.
- Recurrent neural networks (RNNs) are neural networks that are designed to process sequential input, such as text or time series data. RNNs include loops in their network design that allow them to recall prior inputs and learn dependencies over time. RNNs are frequently used for language modeling and speech recognition.
- Convolutional neural networks (CNNs): These are neural networks that employ convolutional layers to learn spatial patterns in a picture or audio input. CNNs are frequently used for object identification and speech recognition.
- Variable-speed rotating machines generate sequential data where each time step is influenced by the preceding ones. LSTM’s recurrent connections allow for the capture of temporal dependencies in the data. By retaining information from previous time steps in its memory cell, LSTM can learn and exploit the patterns and relationships that exist across different speed regimes and time periods.
- Variable-speed machines produce data sequences of varying lengths depending on the duration of operation or the occurrence of faults. LSTM is designed to handle variable-length sequences as it processes data in a step-by-step manner, adapting to the varying time lengths. This flexibility makes LSTM well-suited for accommodating the dynamic nature of variable-speed rotating machines.
- LSTM’s ability to recognize and learn speed-dependent features is crucial for fault classification in variable-speed rotating machines. By training on historical data that include speed information, LSTM can capture the relationships between speed and fault characteristics. It can then leverage these learned associations to make accurate fault predictions and classifications when new data are presented, considering the specific speed regime of the machine.
- Faults in rotating machines can exhibit complex patterns that may be difficult to detect using traditional techniques. LSTM’s architecture allows it to learn and model complex relationships within the data. It can automatically extract relevant features, recognize subtle fault patterns, and capture the interactions between speed variations and fault signatures. This enables LSTM to provide accurate fault classifications even in challenging scenarios.
2. Materials and Methods
2.1. Structure of DCNN Model
2.2. Structure of LSTM
- Cell State : The cell state serves as the memory of the LSTM. It carries information across time steps, allowing the network to maintain long-term dependencies.
- Input Gate : The input gate controls the amount of new information that is added to the cell state at each time step. It decides which parts of the input are relevant and should be stored in the cell state.
- Forget Gate : The forget gate determines which parts of the cell state should be forgotten or discarded. It selectively removes information that is no longer relevant, preventing the cell state from being cluttered with unnecessary information.
- Output Gate : The output gate controls the amount of information that is output from the cell state to the next layer or as the final prediction. It determines which parts of the cell state are relevant for the current time step.
2.3. Proposed Models
2.3.1. Data Segmentation
- Reduce bias: When the data are ordered in a certain way, such as being sorted by class labels, it can introduce bias during training. Shuffling the data helps to ensure that the model sees a diverse range of samples from different classes, reducing the potential bias.
- Enhance generalization: If the data are not shuffled and there is a particular order or pattern in the dataset, the model may learn to rely on that pattern instead of learning the underlying relationships between features and labels. Shuffling the data helps to break any sequential patterns and encourages the model to learn more generalized representations.
- Improve gradient descent optimization: Optimization algorithms like stochastic gradient descent (SGD) work by updating the model’s parameters based on mini-batches of data. Shuffling the data ensures that each mini-batch contains a random sample of data, leading to more effective updates and faster convergence.
- Mitigate overfitting: Shuffling the data helps to prevent overfitting by introducing randomness in the training process. Overfitting occurs when a model becomes too specialized in the training data and fails to generalize well to new, unseen data. Shuffling the data helps to make the model more robust and less prone to overfitting.
2.3.2. DCNN-LSTM Model with SoftMax Classifier
2.3.3. ANN Model with SoftMax Classifier
- Layer 1 (Dense): This layer has 1024 neurons with ReLU activation function. The output shape of this layer is (None, 1024), where “None” indicates that the batch size can be variable. The number of parameters in this layer is 1,025,024, which is calculated as (input shape × number of neurons) + number of biases (1).
- Layer 2 (Dense): This layer has 512 neurons with ReLU activation function. The output shape of this layer is (None, 512), and it has 524,800 parameters.
- Layer 3 (Dense): This layer has 256 neurons with ReLU activation function. The output shape of this layer is (None, 256), and it has 131,328 parameters.
- Layer 4 (Dense): This layer has 128 neurons with ReLU activation function. The output shape of this layer is (None, 128), and it has 32,896 parameters.
- Layer 5 (Dense): This layer has 5 neurons with SoftMax activation function. The output shape of this layer is (None, 5), which corresponds to the number of classes in the classification task. The SoftMax function is used to convert the output of the previous layer into a probability distribution over the classes. This layer has 645 parameters.
2.4. Experimental Setups and Datasets
2.4.1. Variable Speed Dataset
2.4.2. Diverse Fault Depths Dataset
3. Experimental Results
3.1. Case I: Variable-Speed Vibration Dataset
3.2. Case II: Diverse Fault Depths Vibration Dataset
4. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Wang, L.; Cai, G.; Wang, J.; Jiang, X.; Zhu, Z. Dual-Enhanced Sparse Decomposition for Wind Turbine Gearbox Fault Diagnosis. IEEE Trans. Instrum. Meas. 2019, 68, 450–461. [Google Scholar] [CrossRef]
- Jiang, G.; He, H.; Xie, P.; Tang, Y. Stacked Multilevel-Denoising Autoencoders: A New Representation Learning Approach for Wind Turbine Gearbox Fault Diagnosis. IEEE Trans. Instrum. Meas. 2017, 66, 2391–2402. [Google Scholar] [CrossRef]
- He, Q.; Zhao, J.; Jiang, G.; Xie, P. An Unsupervised Multiview Sparse Filtering Approach for Current-Based Wind Turbine Gearbox Fault Diagnosis. IEEE Trans. Instrum. Meas. 2020, 69, 5569–5578. [Google Scholar] [CrossRef]
- Yu, X.; Tang, B.; Zhang, K. Fault Diagnosis of Wind Turbine Gearbox Using a Novel Method of Fast Deep Graph Convolutional Networks. IEEE Trans. Instrum. Meas. 2021, 70, 6502714. [Google Scholar] [CrossRef]
- Ahsan, M.; Bismor, D. Early-Stage Fault Diagnosis for Rotating Element Bearing Using Improved Harmony Search Algorithm with Different Fitness Functions. IEEE Trans. Instrum. Meas. 2022, 71, 3519309. [Google Scholar] [CrossRef]
- Abboud, D.; Antoni, J.; Sieg-Zieba, S.; Eltabach, M. Envelope analysis of rotating machine vibrations in variable speed conditions: A comprehensive treatment. Mech. Syst. Signal Process. 2017, 84, 200–226. [Google Scholar] [CrossRef]
- Udmale, S.S.; Singh, S.K. Application of Spectral Kurtosis and Improved Extreme Learning Machine for Bearing Fault Classification. IEEE Trans. Instrum. Meas. 2019, 68, 4222–4233. [Google Scholar] [CrossRef]
- Hu, Y.; Bao, W.; Tu, X.; Li, F.; Li, K. An Adaptive Spectral Kurtosis Method and its Application to Fault Detection of Rolling Element Bearings. IEEE Trans. Instrum. Meas. 2020, 69, 739–750. [Google Scholar] [CrossRef]
- Jérôme, A. The spectral kurtosis: A useful tool for characterising non-stationary signals. Mech. Syst. Signal Process. 2006, 20, 282–307. [Google Scholar] [CrossRef]
- Dong, W.; Peter, W.T.; Kwok, L.T. An enhanced Kurtogram method for fault diagnosis of rolling element bearings. Mech. Syst. Signal Process. 2013, 35, 176–199. [Google Scholar] [CrossRef]
- Jérôme, A. Fast computation of the kurtogram for the detection of transient faults. Mech. Syst. Signal Process. 2007, 21, 108–124. [Google Scholar] [CrossRef]
- Rai, A.; Upadhyay, S.H. A review on signal processing techniques utilized in the fault diagnosis of rolling element bearings. Tribol. Int. 2016, 96, 289–306. [Google Scholar] [CrossRef]
- Robert, B.R.; Jérôme, A. Rolling element bearing diagnostics—A tutorial. Mech. Syst. Signal Process. 2011, 25, 485–520. [Google Scholar] [CrossRef]
- Soualhi, A.; Medjaher, K.; Zerhouni, N. Bearing Health Monitoring Based on Hilbert–Huang Transform, Support Vector Machine, and Regression. IEEE Trans. Instrum. Meas. 2015, 64, 52–62. [Google Scholar] [CrossRef] [Green Version]
- Huang, W.; Gao, G.; Li, N.; Jiang, X.; Zhu, Z. Time-Frequency Squeezing and Generalized Demodulation Combined for Variable Speed Bearing Fault Diagnosis. IEEE Trans. Instrum. Meas. 2019, 68, 2819–2829. [Google Scholar] [CrossRef]
- Yan, R.; Gao, R.X. Hilbert–Huang Transform-Based Vibration Signal Analysis for Machine Health Monitoring. IEEE Trans. Instrum. Meas. 2006, 55, 2320–2329. [Google Scholar] [CrossRef]
- He, Q.; Song, H.; Ding, X. Sparse Signal Reconstruction Based on Time-Frequency Manifold for Rolling Element Bearing Fault Signature Enhancement. IEEE Trans. Instrum. Meas. 2016, 65, 482–491. [Google Scholar] [CrossRef]
- Attoui, I.; Boutasseta, N.; Fergani, N. Novel Machinery Monitoring Strategy Based on Time–Frequency Domain Similarity Measurement with Limited Labeled Data. IEEE Trans. Instrum. Meas. 2021, 70, 3500708. [Google Scholar] [CrossRef]
- Qian, W.; Li, S.; Jiang, X. Deep transfer network for rotating machine fault analysis. Pattern Recognit. 2019, 96, 106993. [Google Scholar] [CrossRef]
- Li, W.; Huang, R.; Li, J.; Liao, Y.; Chen, Z.; He, G.; Yan, R.; Gryllias, K. A prospective survey on deep transfer learning for fault diagnosis in industrial scenarios: Theories, applications and challenges. Mech. Syst. Signal Process. 2022, 167, 108487. [Google Scholar] [CrossRef]
- Mohammed, H.; Abdoulhdi, A.B.O.; Ali, N.A.; Muhannad, A.W.; Abdallah, A. A systematic review of rolling bearing fault diagnoses based on deep learning and transfer learning: Taxonomy, overview, application, open challenges, weaknesses and recommendations. Ain Shams Eng. J. 2023, 14, 101945. [Google Scholar] [CrossRef]
- Samir, K.; Takehisa, Y. A review on the application of deep learning in system health management. Mech. Syst. Signal Process. 2018, 107, 241–265. [Google Scholar] [CrossRef]
- Zhang, W.; Li, C.; Peng, G.; Chen, Y.; Zhang, Z. A deep convolutional neural network with new training methods for bearing fault diagnosis under noisy environment and different working load. Mech. Syst. Signal Process. 2018, 100, 439–453. [Google Scholar] [CrossRef]
- Li, X.; Zhang, W.; Xu, N.X.; Ding, Q. Deep Learning-Based Machinery Fault Diagnostics with Domain Adaptation Across Sensors at Different Places. IEEE Trans. Ind. Electron. 2020, 67, 6785–6794. [Google Scholar] [CrossRef]
- Zhou, Y.; Zhi, G.; Chen, W.; Qian, Q.; He, D.; Sun, B.; Sun, W. A new tool wear condition monitoring method based on deep learning under small samples. Measurement 2022, 189, 110622. [Google Scholar] [CrossRef]
- Zhao, R.; Yan, R.; Chen, Z.; Mao, K.; Wang, P.; Gao, R.X. Deep learning and its applications to machine health monitoring. Mech. Syst. Signal Process. 2019, 115, 213–237. [Google Scholar] [CrossRef]
- Lu, C.; Wang, Z.; Zhou, B. Intelligent fault diagnosis of rolling bearing using hierarchical convolutional network-based health state classification. Adv. Eng. Inform. 2017, 32, 139–151. [Google Scholar] [CrossRef]
- Saravanakumar, R.; Krishnaraj, N.; Venkatraman, S.; Sivakumar, B.; Prasanna, S.; Shankar, K. Hierarchical symbolic analysis and particle swarm optimization based fault diagnosis model for rotating machineries with deep neural networks. Measurement 2021, 171, 108771. [Google Scholar] [CrossRef]
- Yang, Y.; Zheng, H.; Li, Y.; Xu, M.; Chen, Y. A fault diagnosis scheme for rotating machinery using hierarchical symbolic analysis and convolutional neural network. ISA Trans. 2019, 91, 235–252. [Google Scholar] [CrossRef]
- Guo, J.; Liu, X.; Li, S.; Wang, Z. Bearing Intelligent Fault Diagnosis Based on Wavelet Transform and Convolutional Neural Network. Shock Vib. 2020, 2020, 6380486. [Google Scholar] [CrossRef]
- Chen, J.; Huang, R.; Zhao, K.; Wang, W.; Liu, L.; Li, W. Multiscale Convolutional Neural Network with Feature Alignment for Bearing Fault Diagnosis. IEEE Trans. Instrum. Meas. 2021, 70, 3517010. [Google Scholar] [CrossRef]
- Jiao, J.; Zhao, M.; Lin, J.; Zhao, J. A multivariate encoder information-based convolutional neural network for intelligent fault diagnosis of planetary gearboxes. Knowl.-Based Syst. 2018, 160, 237–250. [Google Scholar] [CrossRef]
- Feng, J.; Yaguo, L.; Na, L.; Saibo, X. Deep normalized convolutional neural network for imbalanced fault classification of machinery and its understanding via visualization. Mech. Syst. Signal Process. 2018, 110, 349–367. [Google Scholar] [CrossRef]
- Guo, Y.; Liu, Y.; Oerlemans, A.; Lao, S.; Wu, S.; Lew, M.S. Deep learning for visual understanding: A review. Neurocomputing 2016, 187, 27–48. [Google Scholar] [CrossRef]
- Wang, J.; Mo, Z.; Zhang, H.; Miao, Q. A Deep Learning Method for Bearing Fault Diagnosis Based on Time-Frequency Image. IEEE Access 2019, 7, 42373–42383. [Google Scholar] [CrossRef]
- Nassim, L.; Nida, S.O.; Sami, O. Support Vector Machines for Fault Detection in Wind Turbines. IFAC Proc. Vol. 2011, 44, 7067–7072. [Google Scholar] [CrossRef]
- Jürgen, S. Deep learning in neural networks: An overview. Neural Netw. 2015, 61, 85–117. [Google Scholar] [CrossRef] [Green Version]
- Muralidharan, V.; Sugumaran, V. A comparative study of Naïve Bayes classifier and Bayes net classifier for fault diagnosis of monoblock centrifugal pump using wavelet analysis. Appl. Soft Comput. 2012, 12, 2023–2029. [Google Scholar] [CrossRef]
- Liu, R.; Yang, B.; Zhang, X.; Wang, S.; Chen, X. Time-frequency atoms-driven support vector machine method for bearings incipient fault diagnosis. Mech. Syst. Signal Process. 2016, 75, 345–370. [Google Scholar] [CrossRef]
- Savita, A.; Amit, C. Hybrid CNN-SVM Classifier for Handwritten Digit Recognition. Procedia Comput. Sci. 2020, 167, 2554–2560. [Google Scholar] [CrossRef]
- Huan, H.; Natalie, B.; Ming, L. Bearing fault diagnosis under unknown time-varying rotational speed conditions via multiple time-frequency curve extraction. J. Sound Vib. 2018, 414, 43–60. [Google Scholar] [CrossRef]
- Huan, H.; Natalie, B. Bearing vibration data collected under time-varying rotational speed conditions. Data Brief 2018, 21, 1745–1749. [Google Scholar] [CrossRef] [PubMed]
- Geng, H.; Peng, Y.; Ye, L.; Guo, Y. Fault identification of rolling bearing with variable speed based on generalized broadband mode decomposition and distance evaluation technique. Digit. Signal Process. 2022, 129, 103662. [Google Scholar] [CrossRef]
- Lu, F.; Tong, Q.; Feng, Z.; Wan, Q.; Li, Y.; Wang, M.; Cao, J.; Guo, T. Explainable 1DCNN with demodulated frequency features method for fault diagnosis of rolling bearing under time-varying speed conditions. Meas. Sci. Technol. 2022, 33, 095022. [Google Scholar] [CrossRef]
- Jin, Z.; Xiao, Y.; He, D.; Wei, Z.; Sun, Y.; Yang, W. Fault diagnosis of bearing based on refined piecewise composite multivariate multiscale fuzzy entropy. Digit. Signal Process. 2023, 133, 103884. [Google Scholar] [CrossRef]
- Che, C.; Wang, H.; Xiong, M.; Ni, X. Few-shot fault diagnosis of rolling bearing under variable working conditions based on ensemble meta-learning. Digit. Signal Process. 2022, 131, 103777. [Google Scholar] [CrossRef]
- Chu, C.; Ge, Y.; Qian, Q.; Hua, B.; Guo, J. A novel multi-scale convolution model based on multi-dilation rates and multi-attention mechanism for mechanical fault diagnosis. Digit. Signal Process. 2022, 122, 103355. [Google Scholar] [CrossRef]
Layer (Type) | Activation Function | Output Shape | No. of Parameters |
---|---|---|---|
Layer 1 (Convolution) | ReLU | (None, 901, 64) | 6464 |
Layer 2 (Convolution) | ReLU | (None, 852, 32) | 102,432 |
Layer 3 (MaxPooling) | - | (None, 213, 32) | 0 |
Layer 4 (Flatten) | - | (None, 6816) | 0 |
Layer 5 (Reshape) | - | (None, 213, 32) | 0 |
Layer 6 (LSTM) | Sigmoid | (None, 213, 64) | 24,832 |
Layer 7 (LSTM) | Sigmoid | (None, 16) | 5184 |
Layer 8 (Dense) | ReLU | (None, 100) | 1700 |
Layer 9 (Dense) | ReLU | (None, 50) | 5050 |
Layer 10 (Dense) | SoftMax | (None, 5) | 255 |
Layer (Type) | Activation Function | Output Shape | No. of Parameters |
---|---|---|---|
Layer 1 (Dense) | ReLU | (None, 1024) | 1,025,024 |
Layer 2 (Dense) | ReLU | (None, 512) | 524,800 |
Layer 3 (Dense) | ReLU | (None, 256) | 131,328 |
Layer 4 (Dense) | ReLU | (None, 128) | 32,896 |
Layer 5 (Dense) | SoftMax | (None, 5) | 645 |
Description | Variable | Value |
---|---|---|
Number of balls | n | 9 |
Ball diameter | d | 7.94 mm |
Pitch diameter | D | 38.52 mm |
Bearing contact angle | 0 |
Fault Class | Rotating Speed |
---|---|
Healthy bearing | First increased then decreased |
Ball fault | First increased then decreased |
Inner race fault | First increased then decreased |
Outer race fault | First increased then decreased |
Combined fault | First increased then decreased |
Fault Class | Symbol | Fault Depth |
---|---|---|
Healthy bearing | N | - |
Ball fault | 007_BA | 0.007 inch |
Ball fault | 014_BA | 0.014 inch |
Ball fault | 021_BA | 0.021 inch |
Ball fault | 028_BA | 0.028 inch |
Inner race fault | 007_IR | 0.007 inch |
Inner race fault | 014_IR | 0.014 inch |
Inner race fault | 021_IR | 0.021 inch |
Inner race fault | 028_IR | 0.028 inch |
Outer race fault | 007_OR1 | 0.007 inch |
Outer race fault | 007_OR2 | 0.007 inch |
Outer race fault | 007_OR3 | 0.007 inch |
Outer race fault | 014_OR1 | 0.014 inch |
Outer race fault | 021_OR1 | 0.021 inch |
Outer race fault | 021_OR2 | 0.021 inch |
Outer race fault | 021_OR3 | 0.021 inch |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Ahsan, M.; Salah, M.M. Efficient DCNN-LSTM Model for Fault Diagnosis of Raw Vibration Signals: Applications to Variable Speed Rotating Machines and Diverse Fault Depths Datasets. Symmetry 2023, 15, 1413. https://doi.org/10.3390/sym15071413
Ahsan M, Salah MM. Efficient DCNN-LSTM Model for Fault Diagnosis of Raw Vibration Signals: Applications to Variable Speed Rotating Machines and Diverse Fault Depths Datasets. Symmetry. 2023; 15(7):1413. https://doi.org/10.3390/sym15071413
Chicago/Turabian StyleAhsan, Muhammad, and Mostafa M. Salah. 2023. "Efficient DCNN-LSTM Model for Fault Diagnosis of Raw Vibration Signals: Applications to Variable Speed Rotating Machines and Diverse Fault Depths Datasets" Symmetry 15, no. 7: 1413. https://doi.org/10.3390/sym15071413
APA StyleAhsan, M., & Salah, M. M. (2023). Efficient DCNN-LSTM Model for Fault Diagnosis of Raw Vibration Signals: Applications to Variable Speed Rotating Machines and Diverse Fault Depths Datasets. Symmetry, 15(7), 1413. https://doi.org/10.3390/sym15071413