Coal and Gas Outburst Prediction Model Based on Miceforest Filling and PHHO–KELM
Abstract
:1. Introduction
2. Selection of Characteristic Variables and Data Preprocessing
2.1. Selection of Characteristic Variables
2.2. Multiple Filling of Chained Equations for Random Forests (Miceforest)
- Use the miceforest algorithm to fill missing data of the original dataset m (default m = 4) times to obtain multiple complete datasets.
- Perform mathematical statistical analyses such as the mean change rate for each complete dataset, and tabulate the resulting results.
- In accordance with the principle of taking the optimum, select statistical results from step 2 for integration of the same columns to obtain the final filled complete dataset.
2.3. Filling Missing Data Values with the Miceforest Algorithm
3. Coal and Gas Outburst Prediction Model
3.1. Kernel Extreme Learning Machine
3.2. Harris Hawk Optimization Algorithm
- Global search phase
- Transition phase
- Local development stage
3.3. Piecewise Chaotic Mapping
3.4. Construction of Coal and Gas Outburst Prediction Model Based on Miceforest–PHHO–KELM
- Establish before- and after-filling datasets of coal and gas outbursts, respectively. The missing parts of the pre-fill dataset are deleted directly and only the complete parts are kept. The post-fill dataset is used to fill the missing parts of the data using the miceforest algorithm to ensure that all data are complete.
- Initialize all the parameters of the KELM and HHO models. Piecewise chaotic mapping is used to optimize and initialize the population; the number of populations is set to N and the number of iterations is set to T.
- Use the train set as the input vector of KELM to train KELM.
- Calculate the fitness value of each solution and evaluate the fitness to select the best fitness value.
- Update the prey’s escape energy, probability of escape and random escape intensity according to the HHO algorithm formula.
- Update the search space selected by the hawks to implement the update adjustment to the individual positions, obtain the new search prey and selection area, and calculate and evaluate the adaptability of its corresponding deterministic solution.
- Find the optimal position and make a record of its corresponding fitness value, and change the number of iterations to t + 1.
- Determine whether the maximum number of iterations is reached. If it is satisfied, obtain the optimal penalty coefficient and threshold value; otherwise, go to step 4.
- Build the PHHO–KELM classification prediction model according to the optimal parameters of the final output, and import the test set to output the recognition results.
4. Experimental Results and Analysis
5. Conclusions
- The miceforest method was proposed to fill in missing data values, and gave optimal results in terms of RMSE and R2 evaluation compared to KNN, regression and RF. After miceforest filling, the accuracy of non-salient samples was improved to a certain extent, and the accuracy of salient samples and the overall accuracy were significantly higher than for the pre-filling dataset in each model. Miceforest filling improved the salient sample accuracy and overall accuracy by at least 8.2% and 3.79%, so it was an effective algorithm to fill in the missing values of the samples.
- Comparing the prediction effect of KELM and SVM, results showed that the salient sample accuracy, overall accuracy and Kappa coefficient of KELM before and after filling were significantly better than SVM, so the prediction effect of KELM was better than that of SVM.
- After miceforest filling, the optimal coal and gas outburst prediction miceforest–PHHO–KELM model was established by selecting PHHO to optimize the penalty coefficient and kernel function parameters of KELM. Compared with other models, miceforest–PHHO–KELM had higher prediction accuracy and precision, and its salient sample prediction accuracy, overall prediction accuracy and Kappa coefficient were 96.77%, 98.50% and 0.9698, respectively. These results verified that PHHO can effectively improve the prediction performance of KELM, and the miceforest–PHHO–KELM model had better prediction accuracy and recognition rate in the prediction of coal and gas outbursts.
Author Contributions
Funding
Institutional Review Board Statement
Data Availability Statement
Conflicts of Interest
References
- Zhang, C.L.; Wang, E.Y.; Wang, Y.B.; Zhou, X.F. Spatial and temporal distribution of coal and gas protrusion accidents in China in the past 20 years and suggestions for prevention and control. Coalf. Geol. Explor. 2021, 49, 134–141. [Google Scholar]
- Rudakov, D.; Sobolev, V. A mathematical model of gas flow during coal outburst initiation. Int. J. Min. Sci. Technol. 2019, 29, 791–796. [Google Scholar] [CrossRef]
- Black, D.J. Review of coal and gas outburst in Australian underground coal mines. Int. J. Min. Sci. Technol. 2019, 29, 815–824. [Google Scholar] [CrossRef]
- Kursunioglu, N.; Onder, M. Application of structural equation modeling to evaluate coal and gas outbursts. Tunn. Undergr. Space Technol. 2019, 88, 63–72. [Google Scholar] [CrossRef]
- Li, Y.J.; Yang, Y.G. Research on coal and gas prominence prediction based on improved PSO optimized LS-SVM parameters. Coal Technol. 2017, 36, 129–131. [Google Scholar] [CrossRef]
- Yan, Z.; Yao, K.; Yang, Y. A novel adaptive differential evolution SVM model for predicting coal and gas outbursts. J. Differ. Equ. Appl. 2016, 23, 238–248. [Google Scholar] [CrossRef]
- Li, J. IGSA-SVM Prediction Model for Coal and Gas Outburst and Its Application. Ph.D. Thesis, Taiyuan University of Technology, Taiyuan, China, 2016. [Google Scholar]
- Wu, Y.; Gao, R.; Yang, J. Prediction of coal and gas outburst: A method based on the BP neural network optimized by GASA. Process Saf. Environ. Prot. 2020, 133, 64–72. [Google Scholar] [CrossRef]
- Xu, Y.S.; Cheng, Y.W. Coal and gas protrusion hazard prediction based on SKPCA and NEAT algorithm. J. Saf. Environ. 2021, 21, 1427–1433. [Google Scholar]
- Fu, H.; Meng, T.R.; Yan, X.; Lu, W.J. Optimization of coal and gas protrusion prediction model for quantum gate line. Control Eng. 2021, 28, 1731–1737. [Google Scholar]
- Wang, Y.; Meng, Y.; Fu, H.; Tu, N.; Xu, Y. Optimized limit learning machine for coal and gas prominence prediction method. Control Eng. 2022, 29, 2131–2137. [Google Scholar]
- Lin, H.F.; Zhou, J.; Jin, H.W.; Yang, Z.Y.; Liu, S.H. Collaborative prediction method of coal and gas prominence hazard level based on feature selection and machine learning. J. Min. Saf. Eng. 2023, 40, 361–370. [Google Scholar]
- Ru, Y.D.; Lyu, X.F.; Guo, J.K.; Hongquan, Z.; Lijuan, C. Real-Time prediction model of coal and gas outburst. Math. Probl. Eng. 2020, 2020, 2432806. [Google Scholar]
- Zheng, X.L. Research on the key technology of coal and gas protrusion prediction based on gas content method. Ph.D. Thesis, Anhui University of Technology, Maanshan, China, 2018. [Google Scholar]
- Zheng, X.L.; Lai, W.H.; Xue, S. Application of MI and SVM in coal and gas outburst prediction. China Saf. Sci. J. 2021, 31, 75–80. [Google Scholar]
- Chen, L.-C.; Chen, J.-H. Research on the effect of coal and gas prominence prediction based on data filling-machine learning. China Sci. Technol. Saf. Prod. 2022, 18, 69–74. [Google Scholar]
- Shao, L.S.; Zhan, S.F. Coal and gas protrusion missForest-EGWO-SVM prediction model. J. Liaoning Univ. Eng. Technol. 2020, 39, 214–218. [Google Scholar]
- Wen, T.X.; Su, H.B. WOA-ELM coal and gas protrusion prediction model based on chain multiple interpolation. China Saf. Prod. Sci. Technol. 2022, 18, 68–74. [Google Scholar]
- Shi, H.Y.; Ma, N.J.; Xu, H.T. Exploring the mechanism of coal and gas protrusion based on energy theory. China Sci. Technol. Saf. Prod. 2019, 15, 88–92. [Google Scholar]
- Wang, G.; Wu, M.; Wang, H.; Huang, Q.; Zhong, Y. Sensitivity analysis of influencing factors of coal and gas outburst based on energy balance model. J. Rock Mech. Eng. 2015, 34, 238–248. [Google Scholar]
- Zhang, J.; Wang, X.; Lu, L.; Niu, P. A comparative analysis study of several new intelligent optimization algorithms. Comput. Sci. Explor. 2022, 16, 88–105. [Google Scholar]
- Huang, G.-B.; Zhou, H.; Ding, X.; Zhang, R. Extreme learning machine for regression and multiclass classification. IEEE Trans. Syst. Man Cybern. Part B 2011, 42, 513–529. [Google Scholar] [CrossRef]
- Heidari, A.A.; Mirjalili, S.; Faris, H.; Aljarah, I.; Mafarja, M.; Chen, H. Harris hawks optimization: Algorithm and applications. Future Gener. Comput. Syst. 2019, 97, 849–872. [Google Scholar] [CrossRef]
- Tang, A.D.; Han, T.; Xu, D.W. Chaotic elite Harris hawks optimization algorithm. J. Comput. Appl. 2021, 41, 2265–2272. [Google Scholar]
- Song, L.; Wan, J.Z. A comparative study of missing data interpolation methods. Stat. Decis. Mak. 2020, 36, 10–14. [Google Scholar]
- Yi, H.; Wang, Y.; Huang, J.; Song, W.; Li, L. Research on sparrow search algorithm based on hybrid optimization strategy. Mechatron. Eng. Technol. 2023, 52, 93–97+17. [Google Scholar]
Parameter | Gas Content | Gas Pressure | Porosity of Coal | Coefficient of Coal-Bed Solidity | Initial Velocity of Gas Dissipation |
---|---|---|---|---|---|
Groups | 62 | 62 | 48 | 47 | 51 |
Missing | 0 | 0 | 14 | 15 | 11 |
Maximum | 26.00 | 4.54 | 9.60 | 2.00 | 35.00 |
Minimum | 7.12 | 0.28 | 2.94 | 0.12 | 5.00 |
Mean | 12.15 | 1.86 | 5.70 | 0.55 | 9.90 |
Standard deviation | 4.01 | 1.04 | 1.68 | 0.35 | 4.74 |
Missing Value Parameter | Porosity of Coal | Coefficient of Coal-Bed Solidity | Initial Velocity of Gas Dissipation | |
---|---|---|---|---|
Missing | 14 | 15 | 11 | |
Mean | Original data | 5.70 | 0.55 | 9.90 |
Regression | 5.70 | 0.56 | 9.93 | |
KNN | 5.76 | 0.54 | 9.86 | |
RF | 5.75 | 0.53 | 9.67 | |
miceforest | 5.83 | 0.53 | 9.66 | |
Standard deviation | Original data | 1.68 | 0.35 | 4.74 |
Regression | 1.48 | 0.31 | 4.32 | |
KNN | 1.53 | 0.31 | 4.32 | |
RF | 1.52 | 0.31 | 4.37 | |
miceforest | 1.68 | 0.31 | 4.37 |
Evaluation Index | Regression | KNN | RF | Miceforest |
---|---|---|---|---|
RMSE | 2.264 | 2.227 | 2.235 | 2.136 |
R2 | 0.584 | 0.599 | 0.619 | 0.651 |
Model | SVM | KELM | PSO–KELM | HHO–KELM | PHHO–KELM | |
---|---|---|---|---|---|---|
Pre-fill | Salient sample prediction accuracy/% | 62.86 | 65.71 | 74.29 | 85.71 | 88.57 |
Non-salient sample prediction accuracy/% | 91.55 | 92.96 | 94.37 | 95.77 | 97.18 | |
Overall prediction accuracy/% | 82.08 | 83.96 | 87.74 | 92.45 | 94.34 | |
Kappa coefficient | 0.5732 | 0.6180 | 0.7124 | 0.8268 | 0.8702 | |
Post-fill | Salient sample prediction accuracy/% | 83.87 | 85.48 | 88.71 | 93.55 | 96.77 |
Non-salient sample prediction accuracy/% | 92.96 | 94.37 | 97.18 | 98.59 | 100 | |
Overall prediction accuracy/% | 88.72 | 90.23 | 93.23 | 96.24 | 98.50 | |
Kappa coefficient | 0.7722 | 0.8027 | 0.8633 | 0.9242 | 0.9698 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Shao, L.; Chen, W. Coal and Gas Outburst Prediction Model Based on Miceforest Filling and PHHO–KELM. Processes 2023, 11, 2722. https://doi.org/10.3390/pr11092722
Shao L, Chen W. Coal and Gas Outburst Prediction Model Based on Miceforest Filling and PHHO–KELM. Processes. 2023; 11(9):2722. https://doi.org/10.3390/pr11092722
Chicago/Turabian StyleShao, Liangshan, and Wenjing Chen. 2023. "Coal and Gas Outburst Prediction Model Based on Miceforest Filling and PHHO–KELM" Processes 11, no. 9: 2722. https://doi.org/10.3390/pr11092722
APA StyleShao, L., & Chen, W. (2023). Coal and Gas Outburst Prediction Model Based on Miceforest Filling and PHHO–KELM. Processes, 11(9), 2722. https://doi.org/10.3390/pr11092722