A Powerful Prediction Framework of Fracture Parameters for Hydraulic Fracturing Incorporating eXtreme Gradient Boosting and Bayesian Optimization
Abstract
:1. Introduction
- In situ observation methods: This method focuses on observing and recording at the fracturing site to study the phenomena that occur during hydraulic fracturing. Representative field observation methods include seismic exploration, surface deformation monitoring, and groundwater level monitoring. American petroleum engineer John F. Schlumberger invented electronic logging, which can characterize rock fractures by measuring the speed and path of sound waves propagating through rock, in the 1930s; Hiroaki Niitsuma [2] used the seismic exploration method to determine the fracture and stress of underground rocks; and Zou [3] obtained three-dimensional images and internal fracture information of underground rocks using computed tomography.
- Laboratory experimental methods: This method mainly uses laboratory experiments to study the physical and mechanical properties during hydraulic fracturing. J. Groenenboom [4] used the velocity and path of ultrasonic wave propagation in rock to determine the fracture characteristics of rock; Liu [5] used a triaxial hydraulic fracturing experimental setup to study the effects of prefabricated fracture parameters, fracturing fluid viscosity and rock physical properties on hydraulic fracturing.
- Mathematical modeling method: This method mainly describes the physical and mechanical phenomena in the hydraulic fracturing process by establishing mathematical models. Axel KL Ng [6] proposed a numerical procedure based on the finite element method to predict hydraulic fractures in the core wall of earth and rock dams; Torres Sergio Andres Galindo [7] proposed a discrete element simulation method of hydraulic fracture process in oil wells considering the elastic properties of rocks and the Mohr–Coulomb fracture criterion; and Jeroen Groenenboom [8] proposed a finite difference model to characterize fractures using seismic waves generated by an active seismic source.
2. Research Methodology and Model Construction
2.1. Numerical Simulation
2.2. eXtreme Gradient Boosting Model
2.3. Hyperparameter Optimization
3. Practical Use of Model
3.1. Background of the Project
3.2. Methods of Construction and Use of the Dataset
3.3. Research Technology Route
3.4. Model Evaluation Metrics
4. Results and Discussion
4.1. Model Preference
4.2. Bayesian Optimization
4.3. Analysis of the Results of the Time-Series Fit to the Two-Dimensional Data
4.3.1. Analysis of Fracture Length Results for Two-Dimensional Data
4.3.2. Analysis of Fracture Area Results for Two-Dimensional Data
4.4. Feature Importance Analysis
- Tree model-based feature importance analysis methods: For example, in tree model-based algorithms such as random forest and GBDT, feature importance metrics (information gain, Gini index, etc.) can be used to assess the degree of impact of each feature on the performance of the model.
- Linear model-based feature importance analysis methods: In linear regression, logistic regression and other linear model-based algorithms, feature coefficients or normalization coefficients can be used to assess the degree of contribution of each feature to the model.
- Feature importance analysis method based on model adjustment: The impact of each feature on the performance of the model can be assessed by gradually removing certain features and then comparing the change in model performance. After obtaining the feature importance assessment results, feature selection or adjustment can be carried out based on the assessment results, and the improvement in model performance and efficiency can be achieved by retaining features with high importance, eliminating redundant features, or merging features.
4.5. Analysis of the Results of the 3D Data Time-Series Fitting
4.6. Actual Engineering Applications
4.6.1. Prediction of Actual Fracture Width in the Vertical Well Projects
4.6.2. Prediction of Actual Fracture Pressures in the Vertical Well Projects
4.6.3. Prediction of Actual Proppant Concentration in the Vertical Well Projects
4.6.4. Prediction of Actual Hydraulic Conductivity in the Vertical Well Projects
4.7. Computational Time Analysis
5. Conclusions
- For the analysis of the spatiotemporal fitting results of 2D data, it can be found that the model is able to predict the change rule of the length and the area of each fracture under different samples with relatively high accuracy under the initial time step after completing the training through the training set. However, with increasing time steps, the prediction accuracy decreases, and the change in geological conditions during the hydraulic fracturing injection process leads to the above change rule of the model prediction accuracy.
- The data obtained from the model predictions are visualized to generate three-dimensional images that can be directly observed. Comparing the prediction results of numerical simulation and machine learning models, it can be found that the numerical simulation software generates cracks by continuously adding “empty grids” with a value of 0 on both sides to simulate the increase in cracks in the length direction. Under three-dimensional conditions, the morphological changes during crack expansion can be visually depicted.
- The hyperparameters of the XGBoost-based fracturing simulation model were adaptively optimized using a Bayesian optimization algorithm. The predicted fracture width, fracturing pressure, proppant concentration and inflow capacity of a single fracture in an actual vertical well have R2 values above 0.94, which makes the optimization of the model somewhat universal.
- The study compares the time required to calculate the fracture morphology by different methods. Compared with the traditional numerical simulation methods, the average computation time of the prediction methods using machine learning models is improved by more than 95%, which greatly optimizes the computation process and reduces the required computational resources.
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Mahmoud, A.; Gowida, A.; Aljawad, M.S.; Al-Ramadan, M.; Ibrahim, A.F. Advancement of Hydraulic Fracture Diagnostics in Unconventional Formations. Geofluids 2021, 2021, 4223858. [Google Scholar] [CrossRef]
- Niitsuma, H.; Fehler, M.; Jones, R.; Wilson, S.; Albright, J.; Green, A.; Baria, R.; Hayashi, K.; Kaieda, H.; Tezuka, K.; et al. Current status of seismic and borehole measurements for HDR/HWR development. Geothermics 1999, 28, 475–490. [Google Scholar] [CrossRef]
- Zou, Y.; Zhang, S.; Zhou, T.; Zhou, X.; Guo, T. Experimental Investigation into Hydraulic Fracture Network Propagation in Gas Shales Using CT Scanning Technology. Rock Mech. Rock Eng. 2016, 49, 33–45. [Google Scholar]
- Groenenboom, J.; Romijn, R. Acoustic monitoring of hydraulic fracture growth. First Break. 1997, 15, 295–303. [Google Scholar]
- Liu, Z.; Wang, S.; Ye, H.; Yang, L.; Feng, F.; Lian, H.; Yang, D. Experimental study on the effects of pre-cracks, fracturing fluid, and rock mechanical characteristics on directional hydraulic fracturing with axial pre-cracks. Geomech. Geophys. Geo-Energy Geo-Resour. 2021, 7, 29. [Google Scholar] [CrossRef]
- Axel, K.L.N.; John, C.S. A case study of hydraulic fracturing using finite element methods. Can. Geotech. J. 1999, 36, 861–875. [Google Scholar]
- Galindo, T.S.A.; Muñoz, C.J.D. Simulation of the hydraulic fracture process in two dimensions using a discrete element method. Phys. Rev. E Stat. Nonlinear Soft Matter Phys. 2007, 75 Pt 2, 066109. [Google Scholar] [CrossRef]
- Groenenboom, J.; Falk, J. Scattering by hydraulic fractures: Finite-difference modeling and laboratory data. Geophysics 2012, 65, 612–622. [Google Scholar] [CrossRef]
- Ma, H.; Zhao, W.; Zhao, Y.; He, Y. A Data-Driven Oil Production Prediction Method Based on the Gradient Boosting Decision Tree Regression. Comput. Model. Eng. Sci. 2022, 134, 1773–1790. [Google Scholar] [CrossRef]
- Song, H.; Zhu, J.; Wei, C.; Wang, J.; Du, S.; Xie, C. Data-driven physics-informed interpolation evolution combining historical-predicted knowledge for remaining oil distribution prediction. J. Pet. Sci. Eng. 2022, 217, 110795. [Google Scholar] [CrossRef]
- Chen, G.; Luo, X.; Jiao, J.J.; Xue, X. Data-driven evolutionary algorithm for oil reservoir well-placement and control optimization. Fuel 2022, 326, 125125. [Google Scholar] [CrossRef]
- Qin, S.; Liu, J.; Yang, X.; Li, Y.; Zhang, L.; Liu, Z. Predicting Heavy Oil Production by Hybrid Data-Driven Intelligent Models. Math. Probl. Eng. 2021, 2021, 5558623. [Google Scholar] [CrossRef]
- Du, S.; Wang, M.; Yang, J.; Zhao, Y.; Wang, J.; Yue, M.; Xie, C.; Song, H. An enhanced prediction framework for coalbed methane production incorporating deep learning and transfer learning. Energy 2023, 282, 128877. [Google Scholar] [CrossRef]
- Du, S.; Wang, J.; Wang, M.; Yang, J.; Zhang, C.; Zhao, Y.; Song, H. A systematic data-driven approach for production forecasting of coalbed methane incorporating deep learning and ensemble learning adapted to complex production patterns. Energy 2023, 263, 126121. [Google Scholar] [CrossRef]
- Song, H.; Du, S.; Yang, J.; Zhao, Y.; Yu, M. Evaluation of hydraulic fracturing effect on coalbed methane reservoir based on deep learning method considering physical constraints. J. Pet. Sci. Eng. 2022, 212, 110360. [Google Scholar] [CrossRef]
- Du, S.Y.; Zhao, X.G.; Xie, C.Y.; Zhu, J.W.; Wang, J.L.; Yang, J.S.; Song, H.Q. Data-driven production optimization using particle swarm algorithm based on the ensemble-learning proxy model. Pet. Sci. 2023, 20, 2951–2966. [Google Scholar] [CrossRef]
- Yang, R.; Qin, X.; Liu, W.; Huang, Z.; Shi, Y.; Pang, Z.; Zhang, Y.; Li, J.; Wang, T. A Physics-Constrained Data-Driven Workflow for Predicting Coalbed Methane Well Production Using Artificial Neural Network. SPE J. 2022, 27, 1531–1552. [Google Scholar] [CrossRef]
- Shen, Y.; Cao, D.; Ruddy, K.; de Moraes, L.F.T. Near Real-Time Hydraulic Fracturing Event Recognition Using Deep Learning Methods. SPE Drill. Complet. 2020, 35, 478–489. [Google Scholar] [CrossRef]
- Shi, J.; Shen, B. Approximation schemes of stresses on elements for the three-dimensional displacement discontinuity method. Eng. Anal. Bound. Elem. 2014, 48, 63–72. [Google Scholar] [CrossRef]
- Fath, H.A.; Madanifar, F.; Abbasi, M. Implementation of multilayer perceptron (MLP) and radial basis function (RBF) neural networks to predict solution gas-oil ratio of crude oil systems. Petroleum 2020, 6, 80–91. [Google Scholar] [CrossRef]
- Mohammadzaheri, M.; Tafreshi, R.; Khan, Z.; Ghodsi, M.; Franchek, M.; Grigoriadis, K. Modelling of petroleum multiphase flow in electrical submersible pumps with shallow artificial neural networks. Ships Offshore Struct. 2020, 15, 174–183. [Google Scholar] [CrossRef]
- Salehinia, S.; Salehinia, Y.; Alimadadi, F.; Sadati, S.H. Forecasting density, oil formation volume factor and bubble point pressure of crude oil systems based on nonlinear system identification approach. J. Pet. Sci. Eng. 2016, 147, 47–55. [Google Scholar] [CrossRef]
- Tahmasebi, P.; Hezarkhani, A. A fast and independent architecture of artificial neural network for permeability prediction. J. Pet. Sci. Eng. 2012, 86–87, 118–126. [Google Scholar] [CrossRef]
- Al-Bulushi, N.I.; King, P.R.; Blunt, M.J.; Kraaijveld, M. Artificial neural networks workflow and its application in the petroleum industry. Neural Comput. Appl. 2012, 21, 409–421. [Google Scholar] [CrossRef]
- Torabi, F.; Abedini, A.; Abedini, R. The Development of an Artificial Neural Network Model for Prediction of Crude Oil Viscosities. Pet. Sci. Technol. 2011, 29, 804–816. [Google Scholar] [CrossRef]
- Chen, X.; Tang, C.; Du, Z.; Tang, L.; Wei, J.; Ma, X. Numerical simulation on multi-stage fractured horizontal wells in shale gas reservoirs based on the finite volume method. Nat. Gas Ind. B 2018, 6, 347–356. [Google Scholar] [CrossRef]
- Zou, J.; Jiao, Y.Y.; Tan, F.; Lv, J.; Zhang, Q. Complex hydraulic-fracture-network propagation in a naturally fractured reservoir. Comput. Geotech. 2021, 135, 104165. [Google Scholar] [CrossRef]
- Ma, X.; Sun, Y.; Guo, W.; Jia, R.; Li, B. Numerical simulation of horizontal well hydraulic fracturing technology for gas production from hydrate reservoir. Appl. Ocean. Res. 2021, 112, 102674. [Google Scholar] [CrossRef]
- Peng, Z.; Desheng, Z. Study on fracture propagation and interaction mechanism during hydraulic fracturing. IOP Conf. Ser. Earth Environ. Sci. 2021, 621, 012134. [Google Scholar]
- Barba, D.F.; Picano, F. A Novel Approach for Direct Numerical Simulation of Hydraulic Fracture Problems. Flow Turbul. Combust. 2020, 105, 335–357. [Google Scholar] [CrossRef]
- Davarpanah, A.; Mirshekari, B. Numerical modelling of hydraulic fracturing procedure in hydrocarbon reservoirs. Model. Earth Syst. Environ. 2019, 5, 1297–1302. [Google Scholar] [CrossRef]
- Shi, Y.; Qi, H.; Qi, X.; Mu, X. An efficient hyper-parameter optimization method for supervised learning. Appl. Soft Comput. J. 2022, 126, 109226. [Google Scholar] [CrossRef]
- Zhang, L.; Zhang, J.; Wang, Y. A Novel Classification Model based on Ensemble Feature Selection and Hyper-parameter Optimation. J. Artif. Intell. Pract. 2022, 5, 11–17. [Google Scholar]
- Chukwuemeka, O.N.; MacGregor AO, J.; Chukwualuka, L.N.; Ferguson, O. Automated Machine Learning Models and State-Of-The-Art Effort in Mitigating Combined Algorithm Selection and Hyperparameter Optimization Problems: A Review. Mach. Learn. Res. 2022, 7, 1–7. [Google Scholar]
- Nametala, C.A.; Faria, W.R.; Pereira Júnior, B.R. On the performance of the Bayesian optimization algorithm with combined scenarios of search algorithms and scoring metrics. Genet. Program. Evolvable Mach. 2022, 23, 193–223. [Google Scholar] [CrossRef]
- Vahid, R.; Somayeh, M.; Erik, C. Using Bayesian optimization algorithm for model-based integration testing. Soft Comput. 2021, 26, 3503–3525. [Google Scholar]
- Dobreva, P.; Dyk, V.E.E.; Vorster, J.F. New approach to evaluating predictive models of photovoltaic systems. Sol. Energy 2020, 204, 134–143. [Google Scholar] [CrossRef]
- Martins, C.J.; Sousa, A.L. Bioelectronic Vision: Retina Models, Evaluation Metrics and System Design; World Scientific Publishing Company: Singapore, 2009. [Google Scholar]
- Wood, N.T.; Peoples, C.M. Pre-validation and performance prediction using pressure monitoring to evaluate HPLC method development changes. Chromatographia 2012, 75, 441–448. [Google Scholar] [CrossRef]
- Tibshirani, J.R.; Efron, B. Pre-validation and inference in microarrays. Stat. Appl. Genet. Mol. Biol. 2011, 1. [Google Scholar] [CrossRef]
- Anita, R.; Dávid, B.; Károly, H. Effect of Dataset Size and Train/Test Split Ratios in QSAR/QSPR Multiclass Classification. Molecules 2021, 26, 1111. [Google Scholar]
- Liu, D.; Lecampion, B.; Garagash, I.D. Propagation of a fluid-driven fracture with fracture length dependent apparent toughness. Eng. Fract. Mech. 2019, 220, 106616. [Google Scholar] [CrossRef]
- Wang, Z.; Li, W.; Qiao, L.; Liu, J.; Yang, J. Hydraulic properties of fractured rock mass with correlated fracture length and aperture in both radial and unidirectional flow configurations. Comput. Geotech. 2018, 104, 167–184. [Google Scholar] [CrossRef]
- Yang, L.; Wu, S.; Gao, K.; Shen, L. Simultaneous propagation of hydraulic fractures from multiple perforation clusters in layered tight reservoirs: Non-planar three-dimensional modelling. Energy 2022, 254, 124483. [Google Scholar] [CrossRef]
- Luo, T.; Liu, S.; Liu, S. A productivity model for vertical wells with horizontal multi-fractures. Int. J. Oil Gas Coal Technol. 2022, 31, 225–241. [Google Scholar] [CrossRef]
- Luo, T.; Liu, S.; Liu, S. Using XGBoost Regression to Analyze the Importance of Input Features Applied to an Artificial Intelligence Model for the Biomass Gasification System. Inventions 2022, 31, 225–241. [Google Scholar]
- Coffer, B.; Kubacki, M.; Wen, Y.; Zhang, T.; Barajas, C.A.; Gobbert, M.K. Machine Learning with Feature Importance Analysis for Tornado Prediction from Environmental Sounding Data. PAMM 2021, 20, e202000112. [Google Scholar] [CrossRef]
- Wu, M.H.; Cai, S.; Zeng, C.Y.; Wang, Z.F.; Zhao, N.; Zhu, L.; Wang, J. Big data analytics: An improved method for large-scale fabrics detection based on feature importance analysis from cascaded representation. Int. J. Grid Util. Comput. 2021, 12, 81–93. [Google Scholar] [CrossRef]
- Qiu, C.; Schmitt, M.; Mou, L.; Ghamisi, P.; Zhu, X.X. Feature Importance Analysis for Local Climate Zone Classification Using a Residual Convolutional Neural Network with Multi-Source Datasets. Remote Sens. 2018, 10, 1572. [Google Scholar] [CrossRef]
- Ma, D.; Xiao, J.; Li, Y.; Diao, Y.; Guo, Y.; Li, M. Feature importance analysis in guide strand identification of microRNAs. Comput. Biol. Chem. 2011, 35, 131–136. [Google Scholar] [CrossRef]
Model | R2 Score |
---|---|
eXtreme gradient boosting model | 0.9985 |
Bayesian ridge regression model | 0.8645 |
Support vector machine regression model | 0.9395 |
Multilayer perceptron regression model | 0.9736 |
Hyperparameters | Range | Optimal Parameter Results |
---|---|---|
Number of decision trees | 1–300 | 78 |
Maximum depth of the tree | 1–25 | 3 |
Minimum sample size for splitting | 2–10 | 2 |
Bootstrap sampling | True/False | True |
Number of Bayesian iterations | / | 40 |
k-fold cross-validation | / | 5 |
Model | R2 Score |
---|---|
Pre-optimization | 0.9533 |
Post-optimization | 0.9907 |
R2 Score | MAE | MSE | |
---|---|---|---|
The first fracture | 0.9667 | 0.0125 | 0.000376 |
The second fracture | 0.9726 | 0.0131 | 0.000414 |
The third fracture | 0.9614 | 0.0138 | 0.000463 |
The fourth fracture | 0.9692 | 0.0129 | 0.000406 |
R2 Score | MAE | MSE | |
---|---|---|---|
The first fracture | 0.9816 | 0.0091 | 0.000225 |
The second fracture | 0.9792 | 0.0116 | 0.000377 |
The third fracture | 0.9793 | 0.0111 | 0.000311 |
The fourth fracture | 0.9820 | 0.0093 | 0.000209 |
Time Step | R2 Score |
---|---|
10 min | 0.9976 |
20 min | 0.9803 |
30 min | 0.9782 |
Hyperparameters | Range | Optimal Parameter Results |
---|---|---|
Number of decision trees | 1–300 | 125 |
Maximum depth of the tree | 1–25 | 6 |
Minimum sample size for splitting | 2–10 | 3 |
Bootstrap sampling | True/False | True |
Number of Bayesian iterations | / | 40 |
k-fold cross-validation | / | 5 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Liu, Z.; Lei, Q.; Weng, D.; Yang, L.; Wang, X.; Wang, Z.; Fan, M.; Wang, J. A Powerful Prediction Framework of Fracture Parameters for Hydraulic Fracturing Incorporating eXtreme Gradient Boosting and Bayesian Optimization. Energies 2023, 16, 7890. https://doi.org/10.3390/en16237890
Liu Z, Lei Q, Weng D, Yang L, Wang X, Wang Z, Fan M, Wang J. A Powerful Prediction Framework of Fracture Parameters for Hydraulic Fracturing Incorporating eXtreme Gradient Boosting and Bayesian Optimization. Energies. 2023; 16(23):7890. https://doi.org/10.3390/en16237890
Chicago/Turabian StyleLiu, Zhe, Qun Lei, Dingwei Weng, Lifeng Yang, Xin Wang, Zhen Wang, Meng Fan, and Jiulong Wang. 2023. "A Powerful Prediction Framework of Fracture Parameters for Hydraulic Fracturing Incorporating eXtreme Gradient Boosting and Bayesian Optimization" Energies 16, no. 23: 7890. https://doi.org/10.3390/en16237890
APA StyleLiu, Z., Lei, Q., Weng, D., Yang, L., Wang, X., Wang, Z., Fan, M., & Wang, J. (2023). A Powerful Prediction Framework of Fracture Parameters for Hydraulic Fracturing Incorporating eXtreme Gradient Boosting and Bayesian Optimization. Energies, 16(23), 7890. https://doi.org/10.3390/en16237890