1. Introduction
Space heating for buildings is a major energy consumer in China. As of 2022, the heating area reached 16.7 billion m
2, consuming 217 million tons of standard coal and emitting 440 million tons of CO
2 [
1]. Fossil fuels dominate, accounting for nearly 88% of heating sources, primarily through combined heat and power (CHP) generation, and gas or coal boilers, while electric boilers, heat pumps, and industrial waste heat recovery contribute only 12% [
2]. This extensive reliance on fossil fuels leads to significant CO
2 emissions. On 22 September 2020, China announced its goal to peak CO
2 emissions by 2030 and achieve carbon neutrality by 2060. Consequently, adopting clean and low-carbon heating technologies is critical to meeting increasing demand [
3].
Renewable energy-based heating technologies include solar thermal systems [
4,
5] and electric-driven heat pumps, such as ground source (GSHPs) and air source (ASHPs) heat pumps [
6]. GSHPs are further classified into surface water heat pump systems (SWHPs), groundwater heat pump systems (GWHPs), and ground-coupled heat pump systems (GCHPs) [
7]. GCHPs are considered more reliable and efficient as the ground temperature remains stable during the heating season, making them widely adopted in recent decades [
8]. However, conventional GCHPs, with heat exchangers installed at depths below 200 m, face challenges including environmental and climatic sensitivities [
9]. Moreover, large-scale land requirements [
10] and thermal imbalance in the ground over time [
11,
12,
13,
14] limit their effectiveness. To address these limitations, GCHPs have been coupled with other renewable energy sources, such as ambient air [
15,
16], solar thermal energy [
17,
18,
19], and biomass energy [
20]. Nevertheless, these hybrid systems often increase land use and initial investment, restricting wider application.
A more direct solution involves mid-deep borehole heat exchangers (MDBHEs), which harness medium-depth geothermal energy from depths of 2 ~ 3 km, where temperatures range between 70 ~ 90 °C. Originally implemented in the U.S. and Europe, MDBHEs extract geothermal heat without withdrawing groundwater, as demonstrated by field tests (
Table 1) [
21,
22,
23,
24,
25,
26,
27].
Recently, this approach has been integrated with heat pump systems, termed mid-deep geothermal heat pump systems (MD-GHPs), for space heating in China. Field tests conducted by Deng revealed that for a single 2500 m DBHE with a ground-side water flow rate of 30 m
3/h and inlet water temperature of 5 °C, the heat extraction rate could reach about 500 kW under continuous operation mode [
28] and more than 600 kW under intermittent operation mode [
29]. The heat pump’s coefficient of performance (COP) reached 5.43, while the overall system efficiency was 4.58. This high-temperature heat source enables MD-GHPs to outperform conventional shallow-depth systems in both energy savings and CO
2 reduction [
28].
Research on MDBHEs has primarily focused on heat transfer performance, with numerical simulations identifying key influencing factors. Higher ground thermal conductivity and temperature, larger MDBHE depth and diameter, and larger thermal conductivity of the outer and lower thermal conductivity of the inner tubes enhance heat extraction [
30,
31]. Additionally, operational conditions, such as lower inlet water temperatures, higher flow rates, and intermittent operation could also optimize the heat extraction performance [
32]. Long-term studies suggest that although ground temperature decreases over time, the reduction is less than 4.0% after 10 years, indicating stable long-term operation [
33,
34]. Deng [
35] also studied the long-term performance of MDBHE arrays in single-line layout and revealed that with line space decreasing from 100 m to 10 m, the maximum accumulated heat extraction capacity per heating season decreased by 0.3% to 19.1% compared to a single MDBHE. Therefore, a line space larger than 30 m was recommended. Chen [
36] compared deep enhanced U-tube borehole heat exchangers (EUBHEs) with coaxial tubes, finding that EUBHE systems achieve up to 1.2 MW of heat extraction per season, outperforming two DBHEs with equivalent borehole length. Previous studies have primarily focused on analyzing the heat transfer performance of MDBHEs, highlighting important optimization directions. However, due to the distinct operational conditions and performance characteristics of MD-GHPs compared to conventional shallow-depth geothermal heat pump systems, applying the same design parameters, equipment selection, control methods, or evaluation approaches used for traditional systems can lead to suboptimal energy performance [
37]. As such, it is crucial to establish a comprehensive, multistage evaluation index system tailored to the unique attributes of MD-GHPs.
In the field of engineering, developing comprehensive evaluation systems has gained increasing attention. Researchers have applied various methods to construct comprehensive evaluation systems across various fields, supporting optimal decision-making and comprehensive benefit assessments. Among these methods, the analytic hierarchy process (AHP) method, proposed by Saaty and Vargas [
38] in the mid-1970s, has been widely adopted due to its high feasibility and effectiveness, which is a systematic and hierarchical analysis method combining qualitative and quantitative analysis. For example, Yang et al. [
39] developed an AHP-based evaluation system for ground source heat pumps and established grading standards for 15 evaluation indices. Meng et al. [
40] combined the decision-making AHP with fuzzy mathematics to create a fuzzy comprehensive evaluation model for the green retrofitting of existing buildings. Man and Zhang [
41] set up a green building evaluation system based on the AHP and grey clustering method. Yu et al. [
42] developed an assessment method for green store buildings in China, which used the expert group decision AHP to determine the weight distributions of evaluation aspects highlighting the importance of indoor environmental quality, energy efficiency, and operation management within store buildings. Huang et al. [
43] pointed out that in previous studies, an AHP was commonly used in planning fire station layouts considering critical factors affecting fire protection coverage. Additionally, they utilized a geographic information system to establish an optimal fire station layout. Ren et al. [
44] established different operational models of the conventional geothermal heating system coupled with energy storage based on time-of-use electricity prices. They adopted a genetic algorithm to find the optimal decision variables minimizing the levelized cost of heat. Therefore, this study employed the AHP and entropy weight method (EWM), considering expert thinking and minimizing the influence of sample quality, to determine the weight distributions of evaluation indices. Linear reorganization was employed to integrate the subjective and objective weights. Additionally, fuzzy mathematics theory was used to decide each evaluation index rating.
As shown in
Figure 1, this paper addresses the existing research gap by first introducing the field test methodology for MD-GHPs. Then, drawing upon a comprehensive review of the relevant literature, standards, and practical project experience, a hierarchical entropy weight method is applied to calculate the index weights. Subsequently, a fuzzy comprehensive evaluation model is constructed to integrate the results from multiple indices. A total of 26 evaluation indices are established, covering aspects such as the heat transfer performance of the MDBHEs, energy efficiency of the heat pump systems, terminal heat consumption, overall energy use, return on investment, carbon emissions, and other environmental impacts. The structural flow of the study is illustrated in
Figure 1. This systematic evaluation framework aids in the optimization of MD-GHPs, from system design and equipment selection to practical operation. Additionally, by identifying weaknesses within the system, this evaluation method provides valuable insights for operational improvements and investment decisions, serving as a robust tool for enhancing the long-term sustainability and efficiency of MD-GHPs.
3. Analytic Hierarchy Process–Fuzzy Comprehensive Evaluation
Selecting an appropriate weighting method for evaluating comprehensive benefits in engineering is critical. The analytic hierarchy process (AHP) is a multi-criteria decision-making technique that decomposes complex decisions into smaller, manageable components. AHP promotes expert consistency through rigorous testing, resulting in reasonable weighted outcomes. Conversely, the entropy weight method (EWM) assigns weights based on the inherent information content of criteria, minimizing subjective bias [
65,
66,
67,
68]. In this section, the EWM and the AHP are integrated based on the constructed index system, which comprehensively considers both subjective and objective analyses. Then, a fuzzy comprehensive evaluation model, along with a multi-level evaluation index system tailored for MD-GHPs, is established through the fuzzy relationship matrix and evaluation factor, as illustrated in
Figure 6.
3.1. Construction of the Entropy Weight Method
EWM provides a systematic approach to weight allocation when faced with numerous criteria, assigning weights based on the entropy of each criterion’s information. Entropy serves as a measure of uncertainty, and higher entropy indicates greater uncertainty. Consequently, weights are assigned inversely to entropy values, with lower entropy corresponding to higher importance. The effectiveness of EWM in facilitating objective decision-making is especially pronounced when prioritizing criteria based on their significance. The steps in the MD-GHP Multistage Evaluation Index System are as follows:
Constructing a Standardized Matrix: The MD-GHP data matrix, comprising n evaluation indices and m evaluation projects, is normalized to form a standardized matrix using Equations (25)–(27),
Then, for indices where smaller values are preferable:
While, for indices where larger values are preferable:
where
;
refers to the value of the
i-th indicator of the
j-th evaluation project;
is the maximum value in
; and
is the minimum value in
.
Entropy Calculation: The entropy value
Hi for the
i-th indicator is calculated using Equations (28)–(30),
Additionally, to prevent situations where
, a correction formula is introduced as shown in Equation (31),
Weight Calculation: after determining the entropy values, weights are computed using Equation (32),
where
,
, and
refers to the weight value of each evaluation indicator.
3.2. Construction of the Analytic Hierarchy Process
AHP is a decision-making methodology that integrates qualitative and quantitative approaches to address complex problems. It structures decision criteria hierarchically, using pairwise comparisons to form a judgment matrix, which determines priority weights through mathematical algorithms. These weights reflect a criterion’s relative importance in decision objectives. AHP emphasizes consistency evaluation to ensure reliable judgments and adjust for any inconsistencies. This systematic approach enhances clarity and confidence in navigating complex decisions.
Judgment Matrix Construction: This paper employs the 1-9 scale method for pairwise comparisons, quantifying the relative importance of each indicator in a hierarchical structure. The scale includes five basic categories: Equal (1), Weak (3), Moderate (5), Strong (7), and Absolute (9), with intermediate values (2, 4, 6, 8) providing finer gradations between them.
Table 9 illustrates the evaluation scale. Based on this scale, a judgment matrix A can be constructed for further analysis.
- 2.
Weight Analysis: Once the judgment matrix A is established, the
m-th root of each row’s product is computed to form an m-dimensional vector. The vector is then normalized to determine the weights of each indicator. The maximum eigenvalue
λmax is calculated using the weight matrix via Equations (33)–(35),
where
refers to the weight without normalization; and
refers to the normalized weights.
- 3.
Consistency Calculation: to ensure the validity of the assigned weights, the consistency index (CI) and consistency ratio (CR) are calculated using Equations (36) and (37),
where
refers to the consistency index;
refers to a random index;
refers to the consistency ratio; and
refers to the order of the judgment matrix. If
, this indicates that the constructed judgment matrix has passed the consistency check, and the assignment of weights to indices is reasonable; otherwise, if
, this indicates that the consistency check has not been passed, and it is necessary to readjust the judgment matrix and recalculate the weights.
3.3. Integration of the Hierarchical and Entropy Weights
Using either the EWM or the AHP independently can result in either overly objective or overly subjective weights. To address this limitation, this paper combines both methods to calculate the comprehensive weight,
, using Equation (38) [
66],
where
refers to the objective weight obtained from the EWM;
refers to the subjective weight obtained from the AHP; and
refers to the combination coefficient of the two weights, with a range from 0 to 1. Here, it is set as 0.5, indicating that both weights are considered equally important.
3.4. Construction of a Fuzzy Comprehensive Evaluation Model
A comprehensive fuzzy evaluation model is essential for assessing the feasibility and performance of MD-GHPs. This model integrates both objective and subjective inputs, offering a systematic framework for decision-making. When combining the analytic hierarchy process and entropy weight method, AHP assigns priority weights through pairwise comparisons, while EWM objectively allocates weights based on criteria entropy, capturing uncertainty. The combination of these methods addresses the complexity of evaluation and improves both analytical depth and practical relevance. Fuzzy logic principles account for uncertainties and imprecision, enhancing the model’s robustness in real-world applications. This integrated approach offers a comprehensive evaluation framework, guiding informed decisions on MD-GHP project development and deployment.
Evaluation Factor Set: The evaluation factor set is determined by selecting appropriate metrics. If there are n metrics, and the evaluation factor is u, the set is represented using Equation (39),
Evaluation Set: For metrics with multiple rating levels, a rating scale is established based on specific criteria. Assuming there are mmm levels, the evaluation set is represented using Equation (40),
Fuzzy Relationship Matrix: Given
n influencing factors and mmm levels, an
n ×
m fuzzy relationship matrix
Y is constructed. Each
yi factor forms part of the matrix. The membership degree, representing the association between each metric and rating level, can be calculated using triangular membership functions using Equations (41) and (42),
The triangular membership function is defined by Equation (43),
where
with
refers to the membership degree of the
n-th influencing factor to the
m-th evaluation level;
,
and
, respectively, represent the parameters of the triangular membership function, denoting the starting point, peak point, and ending point of the membership function.
Matrix calculation: In fuzzy comprehensive evaluation, factors hold varying importance. By applying fuzzy operators, the membership matrix can be combined with weight results. The weighted average operator is often used, highlighting key factors while considering all metrics. The final comprehensive membership degree is calculated using Equation (44),
where
refers to the result of the fuzzy matrix calculation;
refers to the membership degree matrix; and
refers to the comprehensive matrix calculated based on EWM and AHP.
Defuzzification and Final Evaluation Result: The final step is defuzzification, which translates the fuzzy evaluation result BBB into a crisp score or decision. This is typically accomplished using the weighted average method, which calculates a single score that represents the overall performance of the MD-GHPs. It is defined using Equation (45),
where
is the overall score of the system;
represents the membership degree of the system at the
i-th evaluation level; and
corresponds to the specific value assigned to the
i-th evaluation level in the set
.
This weighted average calculation combines the influence of all factors and their corresponding membership degrees, producing a clear, quantifiable result that reflects the system’s overall performance.
4. Evaluating the Performance of Practical Projects
4.1. Analysis of the Practical Operation Performance of MD-GHPs
To validate the multistage evaluation index system for MD-GHPs developed through the AHP–EWM and fuzzy comprehensive evaluation model, the research team adopted a two-pronged approach. First, field tests were conducted on four MD-GHP projects in Shaanxi Province, China, to gather data for an EWM-based analysis, which derived objective weights. Simultaneously, a weights questionnaire was distributed to senior experts for an AHP-based analysis to obtain subjective weights. By combining the comprehensive weights with the fuzzy comprehensive evaluation model, the team analyzed and assessed the projects’ performance, effectively verifying the system’s reliability and validity.
Table 10 displays the field test results for the corresponding criterion layer indices A to F for the four MD-GHP projects. The results indicate no significant differences in resource conditions among the projects. However, the heat exchange performance of Project MG-4 was notably higher in both instantaneous and cumulative heating power compared to the others, despite its relatively poorer long-term operational performance. In terms of energy performance, Project MG-4 also showed prominent results. All projects exhibited average performance regarding building thermal demand and grid responsiveness. Although MG-4 had the lowest annual operating costs, its investment payback period was the longest. Overall, the varying performances across different dimensions highlight the necessity for further rigorous analysis based on scientific methods.
4.2. Weight Calculation of MD-GHPs
4.2.1. EWM Weight Calculation of MD-GHPs
The EWM determines objective weights by quantifying the variability of each indicator, ensuring a data-driven approach to weighting. In this study, field test results from the MD-GHP projects were used to construct a decision matrix, following Equations (20)–(22) to ensure an accurate representation of each indicator’s performance. After constructing the matrix, a standardization process was applied to normalize the data and eliminate any scale inconsistencies. Using Equations (23)–(26), the entropy for each indicator was calculated to capture the degree of variation and uncertainty within the data. These calculations culminated in the entropy weight results shown in
Table 11, which provide objective insight into the relative importance of each indicator.
Key indices emerged with high weights, including A5 at 0.055, B3 at 0.054, and D1 at 0.056. These indices emphasize the critical role of factors such as soil thermal properties, long-term heat extraction efficiency, and heating demand in evaluating system performance. The objective weighting highlights how these factors contribute significantly to the overall performance of MD-GHP systems, reflecting their importance within the operational context.
4.2.2. AHP Weight Calculation of MD-GHPs
A questionnaire was distributed to 15 experts from universities and construction and design companies to allocate subjective weights for the criterion and index layers using the AHP. The experts’ responses were used to construct judgment matrices, followed by consistency checks.
The criterion layer judgment matrix constructed based on the responses to the questionnaire is as follows:
The independent judgment matrices for criterion layer A to criterion layer F constructed based on responses to the questionnaire are as follows:
Table 12 shows the calculated weights, and
Table 13 illustrates the consistency ratios, all of which are below 0.1, confirming the validity of the analysis.
Based on the calculated weights, experts subjectively determined that the F Layer (Energy Saving and Economic Benefits) is the most important in the criterion layer, with a weight of 0.444. Within this category, F5 (Investment Payback Period) was identified as the most critical sub-indicator, assigned a weight of 0.510. Conversely, the E Layer indicator (Grid Dynamic Response Capacity) received a weight of only 0.025, indicating a lower level of emphasis from stakeholders regarding this sub-indicator.
4.2.3. Comprehensive Weight Calculation for MD-GHPs
Combining the results of the EWM-based analysis and the AHP-based analysis, comprehensive weights were derived (Equation (34)), integrating both subjective and objective factors.
Figure 7 illustrates the comprehensive and sub-item weights for each index layer. As shown, the comprehensive weights of the B1 and F5 indices are significantly higher than those of the other indices, primarily due to the influence of subjective weighting, which elevates their perceived importance. It is evident that compared to the distribution of objective weights, the subjective weights display greater variability, likely due to differences in perception among experts from various industries.
4.3. Establishment of Evaluation Model for MD-GHPs
Building on the content of
Section 2, the scoring standards for each index layer are provided in
Table 14. For each indicator, after determining specific evaluation levels, reference is made to set values from existing standards or to quartiles derived from nationwide data distribution. Depending on the characteristics of each indicator, appropriate mathematical and statistical methods, such as equal spacing interpolation or equal proportion floating, are used to determine the remaining levels. These scoring standards form the basis for determining membership degrees using the triangular fuzzy function method.
Based on Equations (32) and (33), during the fuzzy composition process of the factor sets, the hierarchical weight of each index is determined using the AHP–EWM approach. The comprehensive membership degree of the factor set in the criterion layer represents the overall fuzzy evaluation result. After processing the data from the decision set, a comprehensive score is obtained, which reflects the actual performance of the system. This comprehensive score is a useful tool for identifying weak points in the operation of MD-GHP systems.
Figure 8 illustrates the evaluation results of the multistage index system for the four field test projects. From the figure, it is clear that the C and E layers are relatively weak, with most of the sub-indices falling in the mid-level range, and no sub-indicator achieving exceptional results.
4.4. Analysis of Evaluation Results for MD-GHPs
Based on the model and evaluation results, this section provides an in-depth analysis of the factors affecting performance scores across the four MD-GHP projects.
Figure 9 illustrates the comprehensive scores for each project: MG-1 scored 61.56, MG-2 scored 58.33, MG-3 scored 72.73, and MG-4 scored 78.41. In this figure, green represents sub-indices rated as “Very Good,” while red denotes “Poor” ratings. Across the evaluation categories, several key factors contribute to performance differences.
In terms of resource conditions (A-Layer), all projects performed well due to thorough planning and site selection, ensuring stability for effective system operation. For heat transfer performance of the MDBHEs (B-Layer), MG-3 and MG-4 demonstrate superior scores, particularly in index layer B3, attributed to factors such as lower inlet water temperature, higher flow rates, and greater DBHE depths. MG-4, in particular, benefits from its DBHEs reaching 2800 m in depth and an intermittent operational mode (11 h on, 13 h off), which promotes greater heat extraction and higher outlet water temperature. Through index layer B4, it can also be seen that all four projects have the potential for long-term sustainable operation. The energy performance of the MD-GHPs (C-Layer) also highlights MG-4 as an outlier, as it employs a specially designed, second-generation heat pump capable of handling large temperature variations (20 to 35 °C) and partial load ratios, unlike the conventional SD-GHPs used in MG-1 to MG-3. Additionally, MG-4’s transmission and distribution system benefits from an optimized design, enhancing temperature differentials and reducing water resistance. Regarding building space heating demand (D-Layer) and grid dynamic response capacity (E-Layer), all projects reveal improvement opportunities related to building design, heating pipe network enhancements, and operational strategies to align with grid demands. Lastly, energy-saving and economic benefits (F-Layer) show MG-4 outperforming the others due to its use as an office building, where separate heating systems in traditional offices generally lead to higher energy consumption. The use of MD-GHPs in office settings, therefore, demonstrates notable economic and energy-saving advantages.
In summary, these findings underscore the major factors impacting disparities in project performance, including system depth, operational modes, heat pump design, and tailored operational strategies. Future optimizations should focus on these elements to improve MD-GHP system performance across multiple metrics.
4.5. Analysis of Energy-Saving and Emission Reduction of Different Heating Sources
To better evaluate the energy saving and CO
2 emission reduction effect of MD-GHPs, the energy performance of conventional heat pumps and optimized heat pumps in MD-GHPs have been compared with heat pumps in ASHPs, SD-GHPs, and gas boilers.
Table 15 shows a comparison of the energy cost, primary energy consumption, and CO
2 emissions of these systems. For the analysis, the cumulative heating consumption and the energy efficiency of MD-GHPs in MG3 have been applied, which reach 17.48 GWh and 5.09, respectively.
For the initial cost of MD-GHPs, based on feedback from the project owner, the initial cost per MDBHE reached about CNY 2.0 million per MDBHE with a depth of 2500 m in Xi’an. Then, considering the device and installation costs of MDBHEs and heat pump systems, the total initial cost of MD-GHPs in MG3 reached CNY 26.05 million. It can be seen that the energy efficiency of MD-GHPs was superior to that of the other three types of heat pumps. Thus, the electric consumption reached only 3.43 GWh; while the electric consumption of ASHPs, SD-GHPs, and user-side water pumps in gas boiler systems reached 7.28 GWh, 6.22 GWh, and 0.35 GWh, respectively. Notably, the electric consumption of MD-GHPs is 52.8% and 44.8% lower than ASHPs and SD-GHPs, respectively. Simultaneously, gas boilers require a gas consumption of 1.84 million Nm3. In terms of primary energy consumption, the MD-GHPs produced energy saving rates that were 52.8%, 44.8%, and 58.3% higher than those of ASHPs, SD-GHPs, and gas boilers, respectively. In addition, the CO2 emission reduction rate of MD-GHPs was 52.8%, 44.8%, and 48.5% higher than those of ASHPs, SD-GHPs, and gas boilers, respectively.
From the perspective of operational energy cost, taking an electricity price of 0.8 CNY/kWh and a gas price of 3.6 CNY/Nm
3 in Xian, the operational energy cost of MD-GHPs reached CNY 2.75 million while the energy cost of ASHPs, SD-GHPs, and gas boilers reached CNY 5.83 million, CNY 4.98 million, and CNY 6.90 million, respectively. Thus, the energy cost of MD-GHPs was 52.8%, 44.8%, and 60.2% lower, respectively. To further evaluate the economic effect, the initial costs of different heat sources were assumed according to a previous study [
28] at CNY 26.05, 7.56, 13.61, and 6.35 million for MD-GHPs, ASHPs, SD-GHPs, and gas boilers, respectively. Although the initial costs of MD-GHPs were significantly higher, due to the higher energy efficiency and significant energy saving effects, the static incremental payback period of MD-GHPs was about 6.0, 5.6, and 4.7 years compared with ASHPs, SD-GHPs, and gas boilers, respectively. That is to say, MD-GHPs are superior to other heat sources in terms of energy cost, primary energy consumption, and CO
2 emissions, making them more suitable for buildings space heating from the perspective of long-term operation and development.