An Industrial Load Classification Method Based on a Two-Stage Feature Selection Strategy and an Improved MPA-KELM Classifier: A Chinese Cement Plant Case
Abstract
:1. Introduction
2. Methodology
2.1. Framework of the Proposed Method
2.2. Time–Frequency-Domain Features
2.3. ReliefF
2.4. Marine Predator Algorithm
- Initialization: construct the matrix and matrix. They contain the position vectors of variable random positions and repeated best fitness function in the proposed domain, respectively.
- Phase 1 [while ]: The first one-third of the iterations are dedicated to this phase, when the prey moves using the Brownian strategy and the predator is stationary. Equation (2) reflects this phase.
- Stage 2 [while ]: The predator uses Brownian movement and the prey uses Levy movement. The first half of the population was updated using Equation (3).
- Phase 3 [While ]: This occurs in the last third of the iteration. In this phase, the predator moves using Levy motion and the prey is updated using Equation (6).
- The effects of FADs: Environmental issues can also cause changes in the behavior of marine predators. One example is the effects of fish-aggregating devices (FADs), also known as eddy formation. The mathematical model of FAD’s effect is defined in Equation (7):
2.5. MPA Feature Selection
2.6. Kernel Extreme Learning Machine Optimized by the Improved Marine Predator Algorithm (IMPA-KELM)
2.6.1. Kernel Extreme Learning Machine (KELM)
2.6.2. Improved Marine Predator Algorithm
- a.
- Chaos initialization strategy
- b.
- Boundary mutation strategy
2.6.3. Steps for IMPA to Optimize KELM
- (1)
- The load features after two-stage feature selection are divided into the training and test samples at a ratio of 8:2.
- (2)
- The prey matrix is designed according to the chaotic optimization strategy (Equations (13) and (14)), while the IMPA parameters are initialized, the number of populations is set to 30, and the maximum number of iterations is 50. The upper and lower bounds for the KELM penalty coefficient and the kernel parameter are set to 0.001 and 1000, respectively, for the optimization search.
- (3)
- Calculate the fitness value of prey, and update the best fitness value.
- (4)
- Update the positions of predators and prey at different iteration periods according to Equations (2)–(6). Calculate and update the optimal fitness values again. The prey moves according to the FADs in Equation (7), thus changing the predator’s behavior. In this case, the position correction is performed using Equation (16) for prey that are beyond the search boundary.
- (5)
- Steps (3) and (4) are repeated until the maximum number of iterations is reached, and the predator position with the best adaptation is obtained and retained as the best and parameters for KELM.
- (6)
- KELM classification model is developed using the obtained parameters and . Implement load identification of the test set samples using IMPA-KELM.
2.7. Performance Metrics
3. Experiments and Results
4. Comparative Study
4.1. Comparison of Different Feature Extraction
4.2. Comparison of Feature Selection Methods
4.2.1. Comparison of First-Stage Filtered Feature Selection
4.2.2. Comparison of Second-Stage Heuristic Feature Selection
4.3. Comparison of Different Classifiers and Optimization Algorithms
5. Conclusions
- (1)
- The operating information of each load is extracted as much as possible by extracting the time-domain features and frequency-domain features of P and Q of electrical equipment. A rich set of combined features is generated.
- (2)
- This paper proposes a new method combining ReliefF and MPA feature selection. ReliefF is used as a pre-feature filter, and MPA is used to optimize the features again. Feature dimensionality and recognition accuracy are considered by setting the fitness function equation. This method reduces the original time–frequency-domain feature set of 58 features to 3, which is 5.17% of the original feature dimensionality. While ensuring high classification accuracy, features with redundant information are eliminated as much as possible, reducing the complexity of the classification process, the computational cost and storage requirements.
- (3)
- A new and improved marine predator algorithm is proposed. This algorithm introduces a chaotic initialization strategy and boundary variation operation to improve MPA’s convergence speed and global search capability. In this paper, the improved MPA is applied to the parameter optimization process of KELM to obtain the new classifier, IMPA-KELM, which achieves the optimal selection of KELM kernel parameters and penalty coefficients and thus improves the accuracy of its electric load identification.
- (4)
- A valuable reference for future research is provided. A large number of comparative experiments are set up to summarize and verify the determination of the initial feature set, the application effect of the dimensionality reduction methods, the classification performance of different classifiers and the impact of some optimization algorithms. The datasets are obtained through the actual cement plant’s record collection, which avoids the performance saturation of some methods and can effectively distinguish the effects in different ways.
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Lamnatou, C.; Chemisana, D.; Cristofari, C. Smart grids and smart technologies in relation to photovoltaics, storage systems, buildings and the environment. Renew. Energy 2022, 185, 1376–1391. [Google Scholar] [CrossRef]
- Wu, X.; Han, X.; Liang, K.X. Event-based non-intrusive load identification algorithm for residential loads combined with underdetermined decomposition and characteristic filtering. IET Gener. Transm. Distrib. 2019, 13, 99–107. [Google Scholar] [CrossRef]
- Zator, S. Power scheduling scheme for DSM in smart homes with photovoltaic and energy storage. Energies 2021, 14, 8571. [Google Scholar] [CrossRef]
- Yu, J.Y.; Liu, W.N.; Wu, X. Noninvasive industrial power load monitoring based on collaboration of edge device and edge data center. In Proceedings of the 2020 IEEE International Conference on Edge Computing (EDGE), Beijing, China, 19–23 October 2020; pp. 23–30. [Google Scholar]
- Du, Y.; Du, L.; Lu, B.; Harley, R.; Habetler, T. A review of identification and monitoring methods for electric loads in commercial and residential buildings. In Proceedings of the 2010 IEEE Energy Conversion Congress and Exposition, Atlanta, GA, USA, 12–16 September 2010; pp. 4527–4533. [Google Scholar]
- Huang, Y.; Yang, H. A Method based on K-Means and fuzzy algorithm for industrial load identification. In Proceedings of the 2012 Asia-Pacific Power and Energy Engineering Conference, Shanghai, China, 27–29 March 2012; pp. 1–4. [Google Scholar]
- Hart, G.W.; Kern, E.C., Jr.; Schweppe, F.C. Non-Intrusive Appliance Monitor Apparatus. U.S. Patent 4,858,141, 15 August 1989. [Google Scholar]
- Heo, S.; Kim, H. Toward load identification based on the hilbert transform and sequence to sequence long short-term memory. IEEE Trans. Smart Grid 2021, 12, 3252–3264. [Google Scholar]
- Yu, M.; Wang, B.; Lu, L.; Bao, Z.; Qi, D. Non-intrusive adaptive load identification based on siamese network. IEEE Access 2022, 10, 11564–11573. [Google Scholar] [CrossRef]
- Bucci, G.; Ciancetta, F.; Fiorucci, E.; Mari, S.; Fioravanti, A. A non-intrusive load identification system based on frequency response analysis. In Proceedings of the 2021 IEEE International Workshop on Metrology for Industry 4.0&IoT (MetroInd4.0&IoT), Rome, Italy, 7–9 June 2021; pp. 254–258. [Google Scholar]
- Tao, P.; Liu, X.A.; Zhang, Y.; Li, C.; Ding, J. Multi-level non-intrusive load identification based on k-NN. In Proceedings of the 2019 IEEE 3rd Conference on Energy Internet and Energy System Integration (EI2), Changsha, China, 8–10 November 2019; pp. 1905–1910. [Google Scholar]
- Zhang, R.; Song, Y. Non-intrusive load identification method based on color encoding and improve R-FCN. Sustain. Energy Technol. Assess. 2022, 53, 102714. [Google Scholar] [CrossRef]
- Kim, J.; Le, T.T.; Kim, H. Nonintrusive load monitoring based on advanced deep learning and novel signature. Comput. Intell. Neurosci. 2017, 2017, 4216281. [Google Scholar] [CrossRef]
- Mukaroh, A.; Le, T.T.; Kim, H. Background load denoising across complex load based on generative adversarial network to enhance load identification. Sensors 2020, 20, 5674. [Google Scholar] [CrossRef]
- Yan, L.; Sheikholeslami, M.; Gong, W.; Tian, W.; Li, Z. Challenges for real-world applications of nonintrusive load monitoring and opportunities for machine learning approaches. Electr. J. 2022, 35, 107136. [Google Scholar] [CrossRef]
- Luan, W.; Yang, F.; Zhao, B.; Liu, B. Industrial load disaggregation based on hidden Markov models. Electr. Power Syst. Res. 2022, 210, 108086. [Google Scholar] [CrossRef]
- Wei, W.; Peng, T.; Ye, L.; Feng, X.; Jiang, Z.; Yu, H. A Feature Extraction Method for Non-intrusive Load Identification. J. Phys. Conf. Ser. 2022, 2195, 012008. [Google Scholar] [CrossRef]
- Huang, Z.; Yang, C.; Zhou, X.; Huang, T. A hybrid feature selection method based on binary state transition algorithm and ReliefF. IEEE J. Biomed. Health Inform. 2018, 23, 1888–1898. [Google Scholar] [CrossRef]
- Yan, C.; Liang, J.; Zhao, M.; Zhang, X.; Zhang, T.; Li, H. A novel hybrid feature selection strategy in quantitative analysis of laser-induced breakdown spectroscopy. Anal. Chim. Acta 2019, 1080, 35–42. [Google Scholar] [CrossRef]
- Cao, Y.; Sun, Y.; Xie, G.; Li, P. A sound-based fault diagnosis method for railway point machines based on two-stage feature selection strategy and ensemble classifier. IEEE Trans. Intell. Transp. Syst. 2021, 23, 12074–12083. [Google Scholar] [CrossRef]
- Wu, S.D.; Wu, P.H.; Wu, C.W.; Ding, J.J.; Wang, C.C. Bearing fault diagnosis based on multiscale permutation entropy and support vector machine. Entropy 2012, 14, 1343–1356. [Google Scholar] [CrossRef] [Green Version]
- Yun, Y.H.; Li, H.D.; Deng, B.C.; Cao, D.S. An overview of variable selection methods in multivariate analysis of near-infrared spectra. TrAC Trends Anal. Chem. 2019, 113, 102–115. [Google Scholar] [CrossRef]
- Sun, L.; Yin, T.; Ding, W.; Qian, Y.; Xu, J. Multilabel feature selection using ML-ReliefF and neighborhood mutual information for multilabel neighborhood decision systems. Inf. Sci. 2020, 537, 401–424. [Google Scholar] [CrossRef]
- Varzaneh, Z.A.; Hossein, S.; Mood, S.E.; Javidi, M.M. A new hybrid feature selection based on Improved Equilibrium Optimization. Chemom. Intell. Lab. Syst. 2022, 228, 104618. [Google Scholar] [CrossRef]
- Kılıç, F.; Kaya, Y.; Yildirim, S. A novel multi population based particle swarm optimization for feature selection. Knowl. Based Syst. 2021, 219, 106894. [Google Scholar] [CrossRef]
- Hu, P.; Pan, J.S.; Chu, S.C. Improved binary grey wolf optimizer and its application for feature selection. Knowl. Based Syst. 2020, 195, 105746. [Google Scholar] [CrossRef]
- Nadimi-Shahraki, M.H.; Zamani, H.; Mirjalili, S. Enhanced whale optimization algorithm for medical feature selection: A COVID-19 case study. Comput. Biol. Med. 2022, 148, 105858. [Google Scholar] [CrossRef] [PubMed]
- Faramarzi, A.; Heidarinejad, M.; Mirjalili, S.; Gandomi, A.H. Marine Predators Algorithm: A nature-inspired metaheuristic. Expert Syst. Appl. 2020, 152, 113377. [Google Scholar] [CrossRef]
- Wang, Z.; Zheng, L.; Wang, J.; Du, W. Research on novel bearing fault diagnosis method based on improved krill herd algorithm and kernel extreme learning machine. Complexity 2019, 2019, 4031795. [Google Scholar] [CrossRef] [Green Version]
- Eshtay, M.; Faris, H.; Obeid, N. Metaheuristic-based extreme learning machines: A review of design formulations and applications. Int. J. Mach. Learn. Cybern. 2019, 10, 1543–1561. [Google Scholar] [CrossRef]
- Wang, J.; Lu, S.; Wang, S.H.; Zhang, Y.D. A review on extreme learning machine. Multimed. Tools Appl. 2021, 81, 41611–41660. [Google Scholar] [CrossRef]
- Cai, Z.; Gu, J.; Luo, J.; Zhang, Q.; Chen, H.; Pan, Z.; Li, Y.; Li, C. Evolving an optimal kernel extreme learning machine by using an enhanced grey wolf optimization strategy. Expert Syst. Appl. 2019, 138, 112814. [Google Scholar] [CrossRef]
- Lu, H.; Du, B.; Liu, J.; Xia, H.; Yeap, W.K. A kernel extreme learning machine algorithm based on improved particle swam optimization. Memetic Comput. 2017, 9, 121–128. [Google Scholar] [CrossRef]
- Wang, M.; Chen, H.; Yang, B.; Zhao, X.; Hu, L.; Cai, Z.; Huang, H.; Tong, C. Toward an optimal kernel extreme learning machine using a chaotic moth-flame optimization strategy with applications in medical diagnoses. Neurocomputing 2017, 267, 69–84. [Google Scholar] [CrossRef]
- Zhu, W.; Ma, C.; Zhao, X.; Wang, M.; Heidari, A.A.; Chen, H.; Li, C. Evaluation of sino foreign cooperative education project using orthogonal sine cosine optimized kernel extreme learning machine. IEEE Access 2020, 8, 61107–61123. [Google Scholar] [CrossRef]
- Abdel-Basset, M.; El-Shahat, D.; Chakrabortty, R.K.; Ryan, M. Parameter estimation of photovoltaic models using an improved marine predators algorithm. Energy Convers. Manag. 2021, 227, 113491. [Google Scholar] [CrossRef]
- Wen, T.; Dong, D.; Chen, Q.; Chen, L.; Roberts, C. Maximal information coefficient-based two-stage feature selection method for railway condition monitoring. IEEE Trans. Intell. Transp. Syst. 2019, 20, 2681–2690. [Google Scholar] [CrossRef]
- Helmi, H.; Forouzantabar, A. Rolling bearing fault detection of electric motor using time domain and frequency domain features extraction and ANFIS. IET Electr. Power Appl. 2019, 13, 662–669. [Google Scholar] [CrossRef]
- Praveena, H.D.; Subhas, C.; Naidu, K.R. Retracted article: Automatic epileptic seizure recognition using relieff feature selection and long short term memory classifier. J. Ambient. Intell. Humaniz. Comput. 2021, 12, 6151–6167. [Google Scholar] [CrossRef]
- Zhang, B.; Li, Y.; Chai, Z. A novel random multi-subspace based ReliefF for feature selection. Knowl. Based Syst. 2022, 252, 109400. [Google Scholar] [CrossRef]
- Zhong, K.; Zhou, G.; Deng, W.; Zhou, Y.; Luo, Q. MOMPA: Multi-objective marine predator algorithm. Comput. Methods Appl. Mech. Eng. 2021, 385, 114029. [Google Scholar] [CrossRef]
- Gaspar, A.; Oliva, D.; Hinojosa, S.; Aranguren, I.; Zaldivar, D. An optimized Kernel Extreme Learning Machine for the classification of the autism spectrum disorder by using gaze tracking images. Appl. Soft Comput. 2022, 120, 108654. [Google Scholar] [CrossRef]
- Hu, W. An improved flower pollination algorithm for optimization of intelligent logistics distribution center. Adv. Prod. Eng. Manag. 2019, 14, 177–188. [Google Scholar] [CrossRef]
- Kumar, V.; Kumar, D. A systematic review on firefly algorithm: Past, present, and future. Arch. Comput. Methods Eng. 2021, 28, 3269–3291. [Google Scholar] [CrossRef]
- Liu, Y.; Heidari, A.A.; Ye, X.; Liang, G.; Chen, H.; He, C. Boosting slime mould algorithm for parameter identification of photovoltaic models. Energy 2021, 234, 121164. [Google Scholar] [CrossRef]
Feature | Equation | Feature | Equation |
---|---|---|---|
Mean value | Minimum value | ||
Root mean square | Peak-to-peak value | ||
Square mean root | Waveform index | ||
Absolute mean | Peak index | ||
Skewness | Pulse index | ||
Kurtosis | Margin index | ||
Variance | Skewness index | ||
Maximum value | Kurtosis index |
Feature | Equation | Feature | Equation |
---|---|---|---|
Mean | Kurtosis | ||
Variance of mean frequency | Root mean square ratio | ||
Skewness power spectrum | Frequency center | ||
Kurtosis power spectrum | Root mean square | ||
Root variance | Mean frequency that crosses the mean of the time-domain signal | ||
Coefficient of variability | Stabilisation factor | ||
Skewness |
No. | Device ID | Device Type | P/kW | Q/kVar | Sample Size | ||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Max | Min | Mean | Variance | Max | Min | Mean | Variance | ||||
1 | RMM | Raw material mill | 2.6467 × 103 | 0 | 1.1478 × 103 | 713.4977 | 1.0959 × 103 | 0 | 514.6201 | 329.7208 | 76 |
2 | KTHTF | kiln tail high temperature fan | 1.1358 × 103 | 0 | 953.7741 | 199.1667 | 333.5100 | 0 | 232.5849 | 48.6378 | 77 |
3 | EGTF | Exhaust gas treatment fan | 354.8300 | 0 | 173.9166 | 56.7118 | 84.2600 | 0 | 41.2736 | 12.1320 | 77 |
4 | CMF | Coal mill fan | 165.3700 | 0 | 133.2663 | 22.7862 | 40.6000 | 0 | 31.7787 | 5.7884 | 78 |
5 | CMM | Coal mill motor | 195.4000 | 0 | 121.2151 | 27.3095 | 148.4600 | 0 | 125.8172 | 14.7483 | 78 |
6 | KHIDF | Kiln head induced draft fan | 105.9500 | 49.4900 | 79.3093 | 7.6361 | 26.6500 | 12.0500 | 19.1434 | 1.9674 | 77 |
7 | FRP | Fixed roller press | 1.6523 × 103 | 0 | 1.0958 × 103 | 557.2647 | 549.7500 | 0 | 335.6737 | 169.0129 | 71 |
8 | DRP | Dynamic roller press | 1.6599 × 103 | 0 | 1.1245 × 103 | 561.6063 | 558.8900 | 0 | 351.2886 | 173.1773 | 73 |
Feature Set | Number of Features | Training Set | Test Set |
---|---|---|---|
Time-domain features | 32 | 0.8251 (401/486) | 0.7686 (93/121) |
Frequency-domain features | 26 | 0.8498 (413/486) | 0.7934 (96/121) |
Time–frequency-domain features | 58 | 0.8930 (434/486) | 0.8264 (100/121) |
Classification Model | Accuracy | Precision | Recall | F1-Score | Rank |
---|---|---|---|---|---|
SVM | 0.7769 | / | 0.7714 | / | 8 |
ELM | 0.7273 | 0.7432 | 0.7239 | 0.6995 | 6 |
KELM | 0.7355 | 0.7445 | 0.7342 | 0.7124 | 10 |
FA-SVM | 0.9008 | 0.8975 | 0.8975 | 0.8971 | 36 |
FA-ELM | 0.8347 | 0.8312 | 0.8306 | 0.8243 | 22 |
FA-KELM | 0.9256 | 0.9242 | 0.9225 | 0.9222 | 55 |
SMA-SVM | 0.9091 | 0.9069 | 0.9065 | 0.9056 | 43 |
SMA-ELM | 0.8182 | 0.8123 | 0.8145 | 0.8108 | 16 |
SMA-KELM | 0.8926 | 0.8886 | 0.8880 | 0.8874 | 32 |
MPA-SVM | 0.9091 | 0.9064 | 0.9053 | 0.9048 | 40 |
MPA-ELM | 0.8347 | 0.8372 | 0.8300 | 0.8186 | 21 |
MPA-KELM | 0.9256 | 0.9225 | 0.9220 | 0.9220 | 52 |
IMPA-SVM | 0.9174 | 0.9148 | 0.9148 | 0.9143 | 48 |
IMPA-ELM | 0.8430 | 0.8613 | 0.8436 | 0.8375 | 28 |
IMPA-KELM | 0.9339 | 0.9321 | 0.9315 | 0.9313 | 60 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zhou, M.; Zhu, Z.; Hu, F.; Bian, K.; Lai, W. An Industrial Load Classification Method Based on a Two-Stage Feature Selection Strategy and an Improved MPA-KELM Classifier: A Chinese Cement Plant Case. Electronics 2023, 12, 3356. https://doi.org/10.3390/electronics12153356
Zhou M, Zhu Z, Hu F, Bian K, Lai W. An Industrial Load Classification Method Based on a Two-Stage Feature Selection Strategy and an Improved MPA-KELM Classifier: A Chinese Cement Plant Case. Electronics. 2023; 12(15):3356. https://doi.org/10.3390/electronics12153356
Chicago/Turabian StyleZhou, Mengran, Ziwei Zhu, Feng Hu, Kai Bian, and Wenhao Lai. 2023. "An Industrial Load Classification Method Based on a Two-Stage Feature Selection Strategy and an Improved MPA-KELM Classifier: A Chinese Cement Plant Case" Electronics 12, no. 15: 3356. https://doi.org/10.3390/electronics12153356
APA StyleZhou, M., Zhu, Z., Hu, F., Bian, K., & Lai, W. (2023). An Industrial Load Classification Method Based on a Two-Stage Feature Selection Strategy and an Improved MPA-KELM Classifier: A Chinese Cement Plant Case. Electronics, 12(15), 3356. https://doi.org/10.3390/electronics12153356