1. Introduction
Naturally fractured reservoirs represent a significant percentage of oil and gas reservoirs throughout the world [
1]. Hydraulic fracturing is a crucial technology for economically developing this type of reservoir. It was found that hydraulic fracturing of existing wells was more economic than infill drilling [
2]. In addition, the possibility of future discoveries of giant oil and gas reservoirs in the world is low. Hence, a robust and efficient method for choosing wells for fracturing through the utilization of the available reservoir data would lead to maximizing the recovery rates and efficiency of existing fields. Candidate well selection is the process of choosing wells that have the potential for higher production and better return on investment following the stimulation process [
3]. Over the last few decades, a number of studies have investigated the application of a range of decision support and artificial intelligence techniques and technologies in candidate well selection. These range from decision support systems using multivariate nonlinear regression [
4,
5,
6] to neural networks [
7,
8,
9,
10], analytical hierarchy process (AHP) [
11,
12,
13,
14,
15,
16] and fuzzy logic [
10,
17,
18,
19,
20]. Each of these approaches carries with it a set of advantages and limitations. Hence, it would seem natural that some attempts have been made to integrate them in order to obtain the best of all approaches [
21,
22,
23,
24,
25].
In order to choose the most appropriate wells for hydraulic fracturing, it is advantageous to be able to take into account all known relevant criteria. However, this increases the complexity of the process and may raise the number of parameters beyond the capacity of many conventional methods. Therefore, a way to assess all of the known criteria by combining methods would be very useful [
25,
26].
Many operators agree that the candidate well choosing process involves a high level of uncertainty and ambiguity [
7,
21]. Large scale field applications also show that one main factor leading to prominent dissatisfaction is the systems’ inability to handle uncertainty [
23,
27]. Each parameter or selection criterion has its own unique influence on identifying suitable candidate wells for hydraulic fracturing. Determining the relative importance of each one under varying conditions is complicated by confounding factors such as human bias, subjectivity and the complex interrelationships among the various criteria.
AHP [
15,
28,
29] provides a way to deal with these uncertainties by providing a framework for making pairwise comparisons among the criteria and assigning appropriate weights to each factor. However, it is sometimes difficult to make an accurate comparison due to incomplete information or uncertainty about the amount of difference there is between factors. In this situation, there is a need to augment the classical AHP method to be able to operate in the presence of fuzzy or incomplete comparisons [
14]. There is also some inherent statistical variation associated with the assessment provided by human decision makers. Hence, in order to reflect this stochastic behavior, it may be useful to apply a probability distribution to impute the values for certain variables.
Some researchers [
30,
31] highlight that geoscience disciplines, including candidate well selection decisions are subject to multiple sources of uncertainty and contain fuzzy issues. As a statistical technique, fuzzy logic has gained attention as for its ability to cope with uncertainty [
18,
32] and imprecise linguistic concepts or fuzzy terms [
33]. This is particularly true for the newly introduced type-2 fuzzy logic systems (type-2 FLS) [
34,
35,
36].
In addition to the issues relating to uncertainties, it is also evident that many operators lack the very high level of technical knowledge and analytical skills needed for selecting candidate wells for fracturing [
37]. McVey et al. [
2,
38,
39] argue that there is a need to provide a methodology that facilitates selecting the desired well/layer with minimum time and costs and that provides a framework that makes it easier to overcome the difficulties in conventional techniques.
In addition to the large number of objective and measurable factors that inform the selection of wells for fracturing, many oilfield operators possess a wealth of experience, intuition, and judgement that is invaluable for decision making. Therefore, operators’ judgement, intuition, and creativity should be an integral element of the candidate well selection process. However, there are no previous examples in the literature, where AHP, grey theory and fuzzy logic have been combined for the purpose of selecting wells for hydraulic fracturing. This paper aims to satisfy this need by implementing such a system and then to measure its performance on a set of wells in Western China.
The remainder of this paper is organised as follows. First, we present an overview of the hybridisation method, which combines human judgement, AHP, grey theory, and fuzzy logic. Each component is then described in more detail. This is followed by an evaluation of the method by testing it in a field application, and then the performance of the presented method is compared against a neural network application. Finally, some conclusions are drawn at the end of the paper.
2. A Hybrid Approach that Combines Human Judgment, AHP, Grey Theory, and Fuzzy Expert Systems for Candidate Well Selection
The problem of selecting the most appropriate well for fracturing is highly complex and involves several interacting variables. In this context, we use a combined method known as judgment incorporated intercommunication hybridisation for addressing the candidate well selection problem. In this method, the overall problem is divided into distinct jobs, with the roles of human decision-makers clarified and defined. These jobs are then assigned to relatively independent and distributed software components that share or exchange data, information and knowledge. The various tasks are carried out synchronously to produce solutions that incorporate human participants’ judgement for joint problem solving. The main motivation for designing such integration is to achieve a hybrid of diverse functions and the benefits of different tools in order to deliver improved decision support capabilities for candidate well selection.
The architecture of the hybrid approach is illustrated in
Figure 1, where different shapes are used to depict different system elements. The system consists of an AHP component that helps to determine individual strategic factors, a grey theory component that simulates and estimates the relative importance of the relevant factors as well as the fuzzy expert system element that performs intelligent approximate reasoning and recommends candidate wells. The central square-shaped symbol represents human judgment, which links all the elements in an intercommunication job-sharing method.
3. The AHP Component
The limitations of human cognition make it difficult to reliably obtain information about a complex system when there is a large volume of data in a range of disparate formats. AHP was developed to assist decision-making when there is a mix of qualitative, quantitative, and sometimes conflicting factors. It has previously been shown to be very effective in making complicated and often irreversible decisions [
29,
40]. The AHP component of the hybrid approach provides a framework to assist the operators to make reliable judgements when presented with verbal, graphical or numeric data [
41]. The results are then passed to the fuzzy expert system, which recommends wells for fracturing.
For the purposes of this study, the factors or criteria to be taken into consideration for selecting wells were obtained from actual field data and have been verified by experts [
3,
20,
42]. These are arranged into three high-level categories: ‘reserve capacity’, ‘deliverability’, and ‘fracturing efficiency’. These are then subdivided into several factors: ‘thickness’ (TH), ‘porosity’ (POR), ‘gas saturation’ (SGT), ‘natural gamma’ (GR), ‘neutron’ (NR), ‘density’ (DEN), ‘sonic’ (SON), ‘structure position’ (SPI), ‘lithology’ (LTG), ‘lateral resistivity depth difference’ (LRDD), ‘discharge rate’ (DCR), ‘sand ratio’ (SAR), ‘prepad ratio’ (PADR), and ‘sand intensity’ (SANDR). This arrangement is shown in
Table 1 and illustrated in
Figure 2.
4. The Grey Theory Component
After the formation of the decision hierarchy, the second stage of the well selection process is to assign weights to the criteria using what is known as grey theory. Grey system theory is an interdisciplinary scientific area that was first introduced by Deng [
43]. Its advantage over conventional statistical methods is that grey models only require a limited amount of data to estimate the behavior of unknown systems [
44] and obtain an unbiased and consistent point estimator. In grey systems, items such as operation, mechanism, structure, and behavior, are neither deterministic nor totally unknown, but are partially known. System behavior is explored using relational analysis and model construction [
44]. The method involves several steps, which are briefly discussed here.
4.1. Determine the Compared and Reference Sequence
In order to use grey theory, the first step is to generate an interval comparative series, which reflects the various criteria to be used for well selection. The decision factors can be represented by
, which are the evaluation object factors. Equation (1) shows a generic comparative series comprising
wells with
decision factors represented in the form of a matrix
.
where in
is the number of criteria for each well,
is the number of comparative series samples. The standard series is defined as an objective series that reflects the desired level of all the criteria. Here, this is made up of the ‘initial production’ (IP) of each well after hydraulic fracturing.
4.2. Dimensionless Processing
For better comparability between factors, it is essential to normalise the comparative series by performing a grey extreme difference transform. By correlating with IP, for the bigger is the better type indexes, the corresponding measured value of
is:
For the smaller is the better type, the corresponding measured value of
is:
where
represents the minimum value of the
jth criteria of all samples
,
represents the maximum value of the
jth feature of all samples
. Then we can obtain a normalized evaluation vector within the range [0, 1].
4.3. Calculation of Grey Relation Coefficient
The difference between the comparative and standard series is calculated using Equation (6) and is known as the grey relation coefficient:
where
,
is the value of the standard series
under dimensionless processing with Equations (3) and (4),
is the normalized evaluation value, and
is an identifier which only affects the relative value of risk without changing the priority, which can be set at 0.5 [
45].
4.4. Solution of the Correlation Coefficient
The essence of the correlation analysis is to compare the comparative series and standard series. The correlation between the two sequences will be the average of the two sequences associated with each moment coefficients, namely:
4.5. Calculation of the Weighting Factor
Each parameter is given a weight to reflect its overall importance in well selection. The weights are obtained as follows:
From this, we obtain vector . The degree of relation in the candidate well selection model denotes the relationship between the potential factors and the likelihood of choosing the candidate well. The higher the value obtained from Equation (8), the more influence it has on candidate well selection. Therefore, the increasing order of the degree of relation represents the importance ranking of the criterion.
7. Results and Discussion
In order to test the hybridising system in practice, real industrial databases from nineteen different Western Sichuan distinct fractured reservoir wells were acquired.
Table 2 shows the data for Well-1 thru Well-19 in terms of the variables chosen earlier namely: ‘thickness’ (TH), ‘porosity’ (POR), ‘gas saturation’ (SGT), ‘natural gamma’ (GR), ‘neutron’ (NR), ‘density’ (DEN), ‘sonic’ (SON), ‘structure position’ (SPI), ‘lithology’ (LTG), ‘lateral resistivity depth difference’ (LRDD), ‘discharge rate’ (DCR), ‘sand ratio’ (SAR), ‘prepad ratio’ (PADR), and ‘sand intensity’ (SANDR).
The objective is to predict the level of ‘initial production’ (IP) after fracturing. The model was estimated using data from wells 1 to 14 then the predictions for wells 15 to 19 were evaluated by comparing the results with the actual values.
7.1. Input Variable Analysis
In order to enable a clearer understanding of the input data, we first perform an analysis of the correlation coefficients between initial production (IP) and each of the input variables. In addition, cross plots are produced to illustrate the results of the statistical analysis. This is depicted in
Table 3,
Figure 3a,b. One should note that the input variable analysis indicates quite a low correlation with IP. This is because selecting a candidate well for stimulation in fractured reservoirs is a multi-criteria decision making process. Each parameter has its own unique influence on identifying suitable candidate wells and cannot provide all the information needed on its own. However this can be accomplished by the integration of geological and construction information [
26,
52].
An R-squared determination coefficient is used to determine whether two variables are related, and if so, how strongly. The coefficient ranges from +1, indicating a perfectly reliable relationship, to 0, indicating no relationship at all. Here we use the R-squared determination to show how each of the parameters is related to initial production. These are shown in
Table 3.
Another important parameter is the slope, which reflects how each parameter varies with initial production (IP). If the slope is positive, IP increases with the parameter. If it is negative, initial production decreases with the parameter. In this sense, we can see that parameters TH, POR, SGT, NR, SON, SPI, LTG, DCR, SAR, PADR, and SANDR are positively correlated with IP,
Figure 3a, whereas the remaining input variables: GR, DEN, and LRDD are negatively correlated,
Figure 3b.
7.2. Ranking the Input Variables
Our strategy for testing the hybrid model was to build a version using only data for wells 1–well 14. Then use the model to predict the initial output for wells 15–well 19. We divided each input variable into four review grades “I”, “II”, “III”, and “IV” with equal difference. For the variables that are positively correlated with IP, review grade “I” represents the larger value of the input variables under consideration. While for the input variables that are negatively correlated with IP, review grade “I” represents the smallest value of the input variables under consideration. This is illustrated in
Table 4 and
Table 5.
In this methodology, we utilise a grey theory tool that has proven to be very efficient in ranking the input parameters. In this specific study, we dealt with fourteen wells with fourteen drivers for each well. Based on the data listed in
Table 2, we used Equations (1)–(7) to calculate each criterion correlation coefficient, the results of which are shown in the fourth column of
Table 6. This is followed by applying Equation (8) to estimate the main criteria weights. For example, the sum of the weights for reservoir capacity is (4.2434). This was divided by the sum of all the values of the parameters (9.3188) to obtain the value of 0.4554. We find the main criteria weights (first-grade) for reserve capacity, deliverability, and hydraulic efficiency are 0.4554, 0.2416, and 0.3030 respectively. Also, we can obtain the sub-criteria weights (second-grade) for each of the factors listed in
Table 6.
By using the data from
Table 2 and the method described in Equations (1)–(8), the weights of the drivers in respect of their influence on IP were calculated. These are ranked on a scale from 0 to 1, shown in
Figure 4. In this example SPI is highest ranked driver, followed by LRDD, POR, SGT, SANDR, NR, SAR, LTG, SON, PADR, DCR, DEN, TH, and GR. The finding that SPI is the most important factor suggests that if a large portion of gas is produced from the wells located in a higher structure position this indicates that higher initial production is likely after fracturing. This is consistent with the structure-lithology of the region under investigation. In this type of reservoir, high IP implies plenty of natural fractures and the consequence is consistent with the real situation.
7.3. Single Criteria Assessment
A fuzzy model is used to reflect uncertainties presented as inexact intervals for a number of system parameters. Due to the prevalence of the normal distribution, supported by the central limit theorem and observation, it is the best distribution to use in most cases [
7]. Here we use Interval Type-2 Fuzzy Sets and Systems [
19,
48] to calculate the membership function. Taking the parameters of well-15 as an example, applying Equations (9)–(16) gives the results shown in
Table 7.
From these results, a single factor can be obtained corresponding to each factor fuzzy evaluation set. From the preceding analysis, there are fourteen terms of evaluation parameters and four level review grades. This is described in the following 14 × 4 fuzzy matrix
U:
7.4. Comprehensive Evaluation
A single factor fuzzy evaluation cannot reflect the combined effects of all the factors. Therefore, in order to obtain comprehensive results, the matrix multiplication summation algorithm, fuzzy evaluation matrix
U, and composite fuzzy weight vector
A single factor (
Figure 4) is used to consider the contribution of all the factors. This gives the fuzzy comprehensive evaluation vector
B:
where
bi corresponds to the membership grade I, II, III, and IV. For well-15, its membership of grades I, II, III, IV are 0.1348, 0.1188, 0.2849, and 0.2839 respectively. As grade III has the highest value, it can be said that well-15 belongs to grade III for candidate well selection. There is an initial production of 0.78 × 10
4 m
3/d after hydraulic fracturing, which is consistent with the estimate made using the hybrid approach. Similarly, the IP of the other four wells was calculated to verify the plausibility of the results shown in
Table 8. The wells with a higher grade are consistent with higher daily production after construction. This demonstrates that, despite the complexity of the problem, the proposed approach was able to capture the underlying relationship that exists between the input parameters and initial production (IP) after fracturing, based on all the available data. The actual field application of many wells using this technique has proven reliable and able to determine the most appropriate wells for hydraulic fracturing.
7.5. Comparison of the Presented Method with Other Approaches
In comparison with other approaches that are traditionally used for well selection, such as linear regression, nonlinear regression, AHP, conventional fuzzy logic, and a BP (backpropagation) neural network, the hybrid method has a number of advantages. Firstly, since the evaluation of candidate wells for fracturing involves a large number of parameters involving complex nonlinear relationships linear regression is clearly unsuitable. Non-linear regression approaches such as, for example, quadratic regression requires a much larger sample of data to be able to fit the function between the input parameters and IP. In addition, the functional form of the relationship must be specified in advance but we do not have a method of determining precisely what it is. Secondly, in the process of selecting a well for fracturing, it is necessary to specify a set of weights which indicate the extent of the influence each element has on the performance of the fractured well. While AHP is able to provide the weights, this is based solely on the conductors’ experience and subjective judgment, which, as we noted earlier, is subject to a range of biases and inconsistencies. Also, a neural network model trained by backpropagation only makes the inputs and outputs available for inspection while the internal workings are hidden. Finally, the conventional fuzzy logic algorithm can only assess a single criterion at a time, which precludes a comprehensive evaluation of all the parameters simultaneously.
The hybrid approach presented here combines the advantages of human judgment, AHP, grey theory, and fuzzy expert systems for well selection, which not only provides the final results but also reveals each parameter’s effect on the outcome.
In order to test the accuracy of the predictions provided by this method, the results are compared against those generated by a well-trained neural network. As this implements non-linear, non-parametric regression, it forms a general benchmark for evaluation.
7.5.1. BP Neural Network Optimization
Applying the BP (back propagation) neural network forecasting process to achieve a good prediction effect, we need to set the parameters for the network structure, specifically including the hidden layer nodes S, node transfer function TF, training function BTF, network learning function BLF, performance analysis function PF, input processing function ITF, output processing functions OPF etc. In addition, we also need to optimize the training times and rates repeatedly to achieve the best results.
In this test, we use the newff function to create the BP neural network through repeatedly changing each function parameter in the newff function. When the mean squared error of the BP neural network training results, compared against the actual value, reaches a minimum, this indicates the best parameters for the newff function.
A range of combinations of node transfer functions and training functions were tested, for example, we compared six different kinds of newff function composition. The other values of the remaining parameters are as follows: the number of training iterations is 100,000, the training rate is 0.01 and the learning accuracy is 0.000001.
Table 9 shows the value of different combinations of transfer functions and function prediction mean square difference after training. From
Table 9, we can see that the combination of different transfer functions TF and training functions
BTF give different average mean squared errors, the smaller the variance, the better the accuracy. As can be seen from the table: in combination [tansig, trainlm], the average value of the minimum mean square deviation obtained was (0.44), so we chose the [tansig, trainlm] combination. Also, the same neural network structure can also be obtained in other functions to eventually determine the best newff to create functions: newff (inputn, outputn, 5, {‘tansig’}, ‘trainlm’, ‘learngdm’, ‘mse’).
7.5.2. Comparison with BP Neural Networks
Table 10 presents a comprehensive evaluation of the results based on the four membership grades
V = {I, II, III, IV} of the presented model using fuzzy mathematical theory. When using the BP neural network model to train IP values on test samples 1–14, the actual IP values are almost the same as the ones obtained by prediction. However, the IP values of wells 15–19 greatly differ from the actual IP values. In particular, the relative error of the IP value for sample 16 reached 55%. In the application of the presented model to test the IP value of samples 1–14 samples, based on the principle of maximum membership grade, leads to more accurate comprehensive evaluation results and predict the IP of 15–19 prediction samples).
From
Table 10, we find that when predicting the IP values, the presented model outperforms the neural network model on some key measures. For example, the proposed method puts the best well (well 19) in grade II. Wells 15, 17, and 18 are mid-range and are all placed in grade III. The well with the lowest IP is in grade IV. All these results are consistent in terms of the ordering of well output after fracturing.
Meanwhile, the neural network correctly assigns positions 1 and 2 to wells 19 and 18 but it mixes up the order of the others. Well 16 in particular is ranked third by the neural network when it should be ranked fifth, with the lowest output. Hence, in terms of ordering the wells by their post fracturing output, which is critical for selecting the best wells for fracturing, the proposed method gives more reliable results than the neural network benchmark. It can also be noted that this is in an environment where the wells have a similar geology, which is where the neural network can be expected to perform well.
7.6. Discussion
The selection of a candidate well for hydraulic fracturing in fractured reservoirs is a complex multi-criterion decision process, which involves quantitative evaluation of parameters in isolation as well as their combined influence. In addition, error, uncertainty and fragile correlations between data-sets are intrinsic to this environment due to the challenge of designing and building sensors to measure complex formations in hostile environments [
7], and that uncertainty in data may be due to fuzziness rather than chance [
53]. For fracture treatments that are performed in fields where large databases are available, statistical multivariate techniques and neural networks are usually used to choose the candidate wells for hydraulic fracturing. Regression analysis also offers appealing solutions. However, their main drawback is the need to identify all the factors and then establish a linear or nonlinear model that best represents the interactions among them. This becomes unwieldy when the number of parameters is very large. In addition, regression analysis is poor at extrapolating and predicting values outside the range of the conditioning data-set [
54]. Neural networks are also promising because, unlike regression analysis techniques, it is not necessary to specify structural relationships between the input and output data. Neural networks have the ability to infer general rules, extract patterns from a set of examples and recognize input output mappings from complex multi-dimensional field data. These properties give the neural networks the ability to interpolate between typical patterns or data and generalize their learning in order to extrapolate to a region beyond their training domains. However, using neural networks for identification purposes is more useful when a large amount of data is available. In addition, neural network models cannot deal with uncertainty in data due to fuzziness [
55]. Furthermore, all the above mentioned methods rely on there being a statistical correlation between fracture performance and the field variables. In some cases, no such relationship exists and this results in a poor selection of the best candidate wells for fracturing [
56]. The Analytic Hierarchy Process (AHP) [
28] is a nonlinear framework, which is based on deductive and inductive reasoning. It makes it possible to consider several factors simultaneously and allow for interdependence and feedback among variables. However, one of the difficulties associated with the AHP method is that operators must assign a specific number, within a scale from 1 to 9, to the pair-wise comparisons in order to enable the priority vector to be computed. In AHP, when two factors are equally important, a scale value of 1 is assigned to the comparison. Weak to moderate intensities in the difference in importance of a factor, score from 2 to 4. More highly critical differences score from 5 to 8 while 9 would indicate a high level of importance of one alternative over the other. The corresponding reciprocals 1, 1/2, 1/3, ..., 1/9 are used to make a comparison in the opposite direction. However, this is restrictive in cases where the expert is unsure as to the magnitude or direction of the comparison or they do not have sufficient information to make a comparison. This makes classical AHP unsuitable for use when choices are fuzzy or there is incomplete information.
So far, all the above approaches make use of simplifying assumptions such as consideration of extreme or mean values, or the application of safety factors and cannot deal adequately with the uncertainties associated with vague or imprecise information in the objective and constraint functions [
57]. The process of selecting candidate wells is inherently error prone with uncertainty and unstable correlations between parameters. This is due in part to the challenge, in petro-physics, of designing and building sensors to measure complex formations in hostile environments [
7] and that uncertainty in data may be due to fuzziness rather than chance [
53]. Fuzzy logic has the ability to deal with human error and uncertainty in systems to a greater extent than any methods discussed above [
58]. This paper presented a fuzzy model for well selection, which is done by fusing the analytic hierarchy process (AHP) method, grey theory and an advanced version of fuzzy logic theory (FLT). It undertakes approximate reasoning and generates recommendations for candidate wells. In addition, these techniques and technologies are hybridized by using an intercommunication job-sharing method that integrates human judgment. The presented approach is able to reduce uncertainty caused by mathematical approximations and the prediction results will become more accurate as more construction data is collected. Field application shows that in the application of the proposed methodology, the best candidate wells are determined for stimulation and achieve promising production.