Benchmarking Evaluation of Building Energy Consumption Based on Data Mining
Abstract
:1. Introduction
- (1)
- The classification in the original campus building classification model is inconsistent with the reality. Influenced by the number of floors, degrees and other factors, and based on the data-driven idea, through the random forest mining of important features affecting building energy consumption and classification, with the building EUI and the original building classification label as targets, we build a random forest model and determine the importance ranking of each building feature.
- (2)
- Factor analysis is adopted to reduce the dimension of the studied features according to the importance of architectural features, and several common factors affecting architectural classification are extracted to reduce subsequent clustering errors. Then, K-means clustering method is used to cluster the extracted common factors of architectural features, and a new architectural classification is obtained. Compared with the original classification method, the new classification method has significant improvement in many indexes.
- (3)
- On the basis of the above methods, based on the study of building energy consumption factors by the second type of benchmark research method, we do not directly obtain specific energy consumption lines, but use the statistical method in the third type of energy consumption benchmark research to compare the target building with the same type of building, so as to delimit the low, medium and high energy consumption lines, and to make the energy consumption benchmark more practical. A more accurate and practical reference line for energy consumption of campus buildings is obtained by calculating the quartile line among the buildings of the same type.
2. Benchmarking Assessment of Building Energy Consumption
2.1. New Campus Building Benchmarking Assessment Method
2.2. Random Forest Algorithm to Determine the Importance of Features
2.3. Factor Analysis Is Used to Extract Common Factors of Building Features
- (1)
- Correlation investigation among variables. Factor analysis requires a strong correlation between original variables. Commonly used tests include the Kaiser–Meyer–Olkin (KMO) test for correlation coefficient and partial correlation coefficient between variables and Bartlett spherical test for independence. The calculation formula of KMO is as follows:
- (2)
- Principal component extraction. By solving the eigen value () of the correlation coefficient matrix of the original variable and the corresponding orthonormal eigenvectors (), and selecting the previous eigenvector with eigenvalues greater than 1 as the principal component of the original variable for analysis.
- (3)
- Factor load matrix calculation. The factor load matrix determined by each principal component is defined as shown in Equation (2), are the eigenvalue of the correlation coefficient matrix, () are the corresponding eigenvector.
- (4)
- Factor rotation. According to the elementary load matrix, the contribution rate of each common factor is calculated, and m principal factors are selected. By rotating the extracted factor load matrix, the matrix (where is the front m column of , and is the orthogonal matrix) is obtained, and the factor model is constructed.
- (5)
- Calculate the factor score. Using the regression method, the common factor and variables (..), make the regression, establish regression equation and then substitute variable values into the regression equation. The relationship between the factor and the original variable is shown in Equation (3). is each element of the load matrix, which is essentially the correlation coefficient between the common factor and the original variable. The factor score is obtained according to Equation (4).
2.4. Building Evaluation Cluster Analysis Based on K-Means
- (1)
- Randomly set K feature space points as the initial building clustering center;
- (2)
- Calculate the distance between the points corresponding to other buildings and K centers, and select the nearest cluster center point as the marker category for the unknown points;
- (3)
- Place the points corresponding to each building against the labeled cluster center, and recalculate the new center point of each cluster;
- (4)
- If the calculated new center point is the same as the original center point, the algorithm will be terminated; otherwise, return to the second step.
2.5. Evaluation of Clustering Effect
3. Methods
3.1. Data Preprocessing
3.2. Screen Building Features through Random Forest
3.3. Reduces the Dimension of the Remaining Building Features by Factor Extraction
3.4. Cluster the Extracted Common Factors by K-Means
3.5. New Building Energy Benchmarking
4. Results and Discussion
5. Conclusions
- (1)
- The random forest model was used to determine several main characteristics affecting building energy consumption and building classification. Considering building EUI and building the original classification label as double objectives, the random forest model was constructed successively to obtain the importance ranking of building features’ contribution to building EUI and building original classification, so that the buildings in the obtained clustering result not only adhere to mathematical laws of relation but also have the necessary similarities in practical work. In this way, the building features that have an important influence on both the EUI of buildings and the original classification of buildings can be obtained, which lays a foundation for further making improving a fine energy-saving benchmarking.
- (2)
- The dimensionality of the important building features obtained in the above steps is reduced to the building cluster type to eliminate errors by factor analysis. The K-means method is adopted for cluster analysis of the building set, and the common factors extracted from the campus buildings are clustered to remove the influence of each building feature on the energy consumption level in the classification. We aimed to solve the main problem, which was that the original building classification is not practical. Thus, the energy consumption reference line measured by the quartile method is more practical value. For each kind of building, three levels of low, medium and high energy consumption level are proposed, respectively. It makes the method of evaluating the energy use level and energy saving potential of campus buildings much more reasonable.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Acknowledgments
Conflicts of Interest
References
- Zhu, H.; Goh, H.H.; Zhang, D.; Ahmad, T.; Liu, H.; Wang, S.; Li, S.; Liu, T.; Dai, H.; Wu, T. Key technologies for smart energy systems: Recent developments, challenges, and research opportunities in the context of carbon neutrality. J. Clean. Prod. 2021, 331, 129809. [Google Scholar] [CrossRef]
- Lei, L.; Wu, B.; Fang, X.; Chen, L.; Wu, H.; Liu, W. A dynamic anomaly detection method of building energy consumption based on data mining technology. Energy 2023, 263, 125575. [Google Scholar] [CrossRef]
- Tong, P.; Zhao, C.; Wang, H. Research on the Survival and Sustainable Development of Small and Medium-Sized Enterprises in China under the Background of Low-Carbon Economy. Sustainability 2019, 11, 1221. [Google Scholar] [CrossRef] [Green Version]
- Zhang, D.; Li, H.; Zhu, H.; Zhang, H.; Goh, H.H.; Wong, M.C.; Wu, T. Impact of COVID-19 on Urban Energy Consumption of Commercial Tourism City. Sustain. Cities Soc. 2021, 73, 103133. [Google Scholar] [CrossRef]
- Amasyali, K.; El-Gohary, N.M. A review of data-driven building energy consumption prediction studies. Renew. Sustain. Energy Rev. 2018, 81, 1192–1205. [Google Scholar] [CrossRef]
- Pham, A.D.; Ngo, N.T.; Truong, T.T.H.; Huynh, N.T.; Truong, N.S. Predicting energy consumption in multiple buildings using machine learning for improving energy efficiency and sustainability. J. Clean. Prod. 2020, 260, 121082. [Google Scholar] [CrossRef]
- Xu, Z.; Kuishan, L. Sensitivity analysis of several energy-saving measures to residential building energy consumption in hot summer and cold winter area. HVAC 2008, 38, 6. [Google Scholar]
- Ma, J.; Cheng, J.C.P. Identifying the influential features on the regional energy use intensity of residential buildings based on Random Forests. Appl. Energy 2016, 183, 193–201. [Google Scholar] [CrossRef]
- Hsu, D. Identifying key variables and interactions in statistical models of building energy consumption using regularization. Energy 2015, 83, 144–155. [Google Scholar] [CrossRef] [Green Version]
- Robinson, C.; Dilkina, B.; Hubbs, J.; Zhang, W.; Guhathakurta, S.; Brown, M.A.; Pendyala, R.M. Machine learning approaches for estimating commercial building energy consumption. Appl. Energy 2017, 208, 889–904. [Google Scholar] [CrossRef]
- US Energy Star Portfolio Manager. Energy Star Score. Technical Reference. 2019. Available online: https://portfoliomanager.energystar.gov/pdf/reference/ENERGY%20STAR%20Score.pdf (accessed on 3 July 2020).
- Walker, S.; Khan, W.; Katic, K.; Maassen, W.; Zeiler, W. Accuracy of different machine learning algorithms and added-value of predicting aggre-gated-level energy performance of commercial buildings. Energy Build. 2020, 209, 109705. [Google Scholar] [CrossRef]
- Guo, Y.; Wang, J.; Chen, H.; Li, G.; Liu, J.; Xu, C.; Huang, R.; Huang, Y. Machine learning-based thermal response time ahead energy demand prediction for building heating systems. Appl. Energy 2018, 221, 16–27. [Google Scholar] [CrossRef]
- Ascione, F.; Bianco, N.; De Stasio, C.; Mauro, G.M.; Vanoli, G.P. Artificial neural networks to predict energy performance and retrofit scenarios for any member of a building category: A novel approach. Energy 2017, 118, 999–1017. [Google Scholar] [CrossRef]
- Bui, D.K.; Nguyen, T.N.; Ngo, T.D.; Nguyen-Xuan, H. An artificial neural network (ANN) expert system enhanced with the electromag-netism-based firefly algorithm (EFA) for predicting the energy consumption in buildings. Energy 2020, 190, 116370. [Google Scholar] [CrossRef]
- Kim, M.; Jung, S.; Kang, J. Artificial neural network-based residential energy consumption prediction models considering resi-dential building information and user features in South Korea. Sustainability 2019, 12, 109. [Google Scholar] [CrossRef] [Green Version]
- Wei, Z.; Zou, Y.; Wang, H. Introduction on British public building energy consumption benchmark evaluation methods and en-lightenment for China. Build. Sci. 2011, 27, 7–12. [Google Scholar]
- Yong, C.; Zheng, W.; Hui, L.; Chong, M. The Inspiration of German VDI 3807 Standard to China’s Energy Consumption Quota. Constr. Sci. Technol. 2011, 22, 78–81. [Google Scholar]
- Santamouris, M.; Mihalakakou, G.; Patargias, P.; Gaitani, N.; Sfakianaki, K.; Papaglastra, M.; Pavlou, C.; Doukas, P.; Primikiri, E.; Geros, V.; et al. Using intelligent clustering techniques to classify the energy performance of school buildings. Energy Build. 2007, 39, 45–51. [Google Scholar] [CrossRef]
- Hernandez, P.; Burke, K.; Lewis, J.O. Development of energy performance benchmarks and building energy ratings for non-domestic buildings: An example for Irish primary schools. Energy Build. 2007, 40, 249–254. [Google Scholar] [CrossRef]
- Vaisi, S.; Varmazyari, P.; Esfandiari, M.; Sharbaf, S.A. Developing a multi-level energy benchmarking and certification system for office buildings in a cold climate region. Appl. Energy 2023, 336, 120824. [Google Scholar] [CrossRef]
- Marrone, P.; Gori, P.; Asdrubali, F.; Evangelisti, L.; Calcagnini, L.; Grazieschi, G. Energy benchmarking in educational buildings through cluster analysis of energy retrofitting. Energies 2018, 11, 649. [Google Scholar] [CrossRef] [Green Version]
- GB/T51161-2016; Ministry of Housing and Urban-Rural Development, PRC. Energy Consumption Standards for Civil Buildings. China Building and Building Press: Beijing, China, 2016.
- Zhao, J.; Xin, Y.; Tong, D. Energy consumption quota of public buildings based on statistical analysis. Energy Policy 2012, 43, 362–370. [Google Scholar] [CrossRef]
- Zhu, N.; Zhu, T.L.; Tong, D.D. Determination method of building energy consumption base line of campus degree days method in cold area. J. Chongqing Univ. 2016, 1, 105–112. [Google Scholar]
- Kuo CF, J.; Lin, C.H.; Lee, M.H. Analyze the the energy consumption characteristics and affecting factors of Taiwan’s convenience stores-using the big data mining approach. Energy Build. 2018, 168, 120–136. [Google Scholar]
- Deb, C.; Lee, S.E. Determining key variables influencing energy consumption in office buildings through cluster analysis of pre-and post-retrofit building data. Energy Build. 2018, 159, 228–245. [Google Scholar] [CrossRef]
- Li, S.; Chen, Y. Internal benchmarking of higher education buildings using the floor-area percentages of different space usages. Energy Build. 2020, 231, 110574. [Google Scholar] [CrossRef]
- Xie, H.; Jiang, M.; Zhang, D.; Goh, H.H.; Ahmad, T.; Liu, H.; Liu, T.; Wang, S.; Wu, T. IntelliSense technology in the new power systems. Renew. Sustain. Energy Rev. 2023, 177, 113229. [Google Scholar] [CrossRef]
- Liu, S.; Zhou, C.; Guo, H.; Shi, Q.; Song, T.E.; Schomer, I.; Liu, Y. Operational optimization of a building-level integrated energy system considering additional potential benefits of energy storage. Prot. Control. Mod. Power Syst. 2021, 6, 55–64. [Google Scholar] [CrossRef]
- Fan, G.F.; Zhang, L.Z.; Yu, M.; Hong, W.-C.; Dong, S.-Q. Applications of random forest in multivariable response surface for short-term load fore-casting. Int. J. Electr. Power Energy Syst. 2022, 139, 108073. [Google Scholar] [CrossRef]
- Ilbeigi, M.; Ghomeishi, M.; Dehghanbanadaki, A. Prediction and optimization of energy consumption in an office building using artificial neural network and a genetic algorithm. Sustain. Cities Soc. 2020, 61, 102325. [Google Scholar] [CrossRef]
- Kai, Y.; Yan, H.; Kang, L. Significance Scores of Random Forest Variables and Their Research Progress. Available online: https://www.paper.edu.cn (accessed on 12 July 2015).
- Deng, H.; Fannon, D.; Eckelman, M.J. Predictive modeling for US commercial building energy use: A comparison of existing statistical and machine learning algorithms using CBECS microdata. Energy Build. 2018, 163, 34–43. [Google Scholar] [CrossRef]
- Rodriguez-Galiano, V.; Sanchez-Castillo, M.; Chica-Olmo, M.; Chica-Rivas, M.J.O.G.R. Machine learning predictive models for mineral prospectivity: An evaluation of neural networks, random forest, regression trees and support vector machines. Ore Geol. Rev. 2015, 71, 804–818. [Google Scholar] [CrossRef]
- Jang, S.; Elmqvist, N.; Ramani, K. MotionFlow: Visual Abstraction and Aggregation of Sequential Patterns in Human Motion Tracking Data. IEEE Trans. Vis. Comput. Graph. 2015, 22, 21–30. [Google Scholar] [CrossRef] [PubMed]
- Hong, T.; Yang, L.; Hill, D.; Feng, W. Data and analytics to inform energy retrofit of high performance buildings. Appl. Energy 2014, 126, 90–106. [Google Scholar] [CrossRef] [Green Version]
- US Energy Information Administration (EIA). Available online: https://www.eia.gov/ (accessed on 3 September 2022).
- Yang, J.; Ning, C.; Deb, C.; Zhang, F.; Cheong, D.; Lee, S.E.; Sekhar, C.; Tham, K.W. k-Shape clustering algorithm for building energy usage patterns analysis and forecasting model accuracy improvement. Energy Build. 2017, 146, 27–37. [Google Scholar] [CrossRef]
No. | Feature |
---|---|
1 | One activity in building |
2 | Area |
3 | Building shape |
4 | Exterior glass ratio |
5 | Number of floors |
6 | Year of construction |
7 | Month used |
8 | Working hours |
9 | Office equipment |
10 | CDD |
11 | HDD |
12 | Heat percent |
13 | Cool percent |
14 | Number of elevators |
15 | Number of computers |
Feature | Sample | ||
---|---|---|---|
1 | 2 | 3 | |
Single function | No | No | Yes |
First function | Office | Education | Office |
First percent | 60 | 60 | 100 |
Second function | Education | Auditorium | None |
Second percent | 40 | 40 | 0 |
No. | Feature |
---|---|
1 | Office equipment |
2 | CDD |
3 | Area |
4 | Exterior glass ratio |
5 | Number of floors |
6 | Working hours |
7 | Number of computers |
Index | Original Classification | New Classification |
---|---|---|
DBI | 1.647 | 0.645 |
CH | 81.263 | 355.391 |
Building Classification | EUI/(KW·h/m2) | ||
---|---|---|---|
Quartile 1 | Quartile 2 | Quartile 3 | |
Class 1 | 51.93 | 87.02 | 160.24 |
Class 2 | 115.79 | 145.54 | 186.90 |
Class 3 | 153.16 | 176.38 | 191.46 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Wu, T.; Wang, B.; Zhang, D.; Zhao, Z.; Zhu, H. Benchmarking Evaluation of Building Energy Consumption Based on Data Mining. Sustainability 2023, 15, 5211. https://doi.org/10.3390/su15065211
Wu T, Wang B, Zhang D, Zhao Z, Zhu H. Benchmarking Evaluation of Building Energy Consumption Based on Data Mining. Sustainability. 2023; 15(6):5211. https://doi.org/10.3390/su15065211
Chicago/Turabian StyleWu, Thomas, Bo Wang, Dongdong Zhang, Ziwei Zhao, and Hongyu Zhu. 2023. "Benchmarking Evaluation of Building Energy Consumption Based on Data Mining" Sustainability 15, no. 6: 5211. https://doi.org/10.3390/su15065211
APA StyleWu, T., Wang, B., Zhang, D., Zhao, Z., & Zhu, H. (2023). Benchmarking Evaluation of Building Energy Consumption Based on Data Mining. Sustainability, 15(6), 5211. https://doi.org/10.3390/su15065211