Short-Term Forecasting of Daily Electricity of Different Campus Building Clusters Based on a Combined Forecasting Model

Wu, Wenyu; Deng, Qinli; Shan, Xiaofang; Miao, Lei; Wang, Rui; Ren, Zhigang

doi:10.3390/buildings13112721

Open AccessArticle

Short-Term Forecasting of Daily Electricity of Different Campus Building Clusters Based on a Combined Forecasting Model

by

Wenyu Wu

¹,

Qinli Deng

^1,2

,

Xiaofang Shan

^1,2,*,

Lei Miao

^1,2,*,

Rui Wang

³ and

Zhigang Ren

^1,2

¹

School of Civil Engineering and Architecture, Wuhan University of Technology, No. 122 Luoshi Road, Wuhan 430070, China

²

Sanya Science and Education Innovation Park, Wuhan University of Technology, No. 5 Chuangxin Road, Yazhou District, Sanya 572024, China

³

Logistics Support Office, Wuhan University of Technology, No. 122 Luoshi Road, Wuhan 430070, China

^*

Authors to whom correspondence should be addressed.

Buildings 2023, 13(11), 2721; https://doi.org/10.3390/buildings13112721

Submission received: 28 September 2023 / Revised: 17 October 2023 / Accepted: 24 October 2023 / Published: 28 October 2023

(This article belongs to the Section Building Energy, Physics, Environment, and Systems)

Download

Browse Figures

Versions Notes

Abstract

:

In the building field, campus buildings are a building group with great energy-saving potential due to a lack of reasonable energy management policies. The accurate prediction of power energy usage is the basis for energy management. To address this issue, this study proposes a novel combined forecasting model based on clustering results, which can achieve a short-time prediction of daily electricity based on a campus building’s electricity data over the past 15 days. Considering the diversity of campus buildings in energy consumption and functional aspects, the selected campus buildings are firstly classified into three categories using K-Means clustering in terms of their daily power consumption. Compared with the mainstream building energy consumption prediction models, i.e., LSTM and SVR, the results show that the combined forecast model is superior to other models. Furthermore, an average percentage fluctuation (APF) index is found to be close to the MAPE, which can reflect the prediction accuracy in advance.

Keywords:

time series prediction; campus buildings; electric consumption; combined forecasting method

1. Introduction

With the development and progress of society, people’s demands for a comfortable living, work, and living environment are becoming increasingly high, leading to a gradual increase in the proportion of building energy consumption in global energy. To lower the environmental and economic burden caused by the increasing building energy demand, improving the energy efficiency of buildings would be an effective solution [1]. Building energy prediction, also known as building energy estimating or forecasting, is critical for efficient energy use in buildings because it can help develop an energy-efficient building design, automate the functions of the building and its energy systems, and plan its energy distributions [2]. With the progress and development of science and technology, researchers have made significant progress toward improving building energy prediction. Various methods, including physical models and data-driven models, have been proposed and verified. Among these forecasting methods, data-driven models have recently received great attention due to their convenient modeling methods and accurate forecasting capabilities [3,4,5].

As the most populated country in the world, China’s building energy consumption accounts for a large global share. In 2021, there were around 4.82 million teachers and college students in China, accounting for 3.41% of the Chinese population [6]. However, most of the university buildings were built in the last century, some of which are historical buildings symbolizing the cultural characteristics of the campus. This has an important negative impact on the effective collection of building-related information. Based on limited historical datasets of poor quality, it is a significant challenge to obtain reasonable and reliable campus building energy consumption prediction results.

The occurrence of problems in data quality usually determines the accuracy of a prediction task. To solve problems in data quality, data preprocessing can be carried out to filter useful data and remove outliers. The clustering algorithm, a data preprocessing method, is used to classify the data in the preprocessing stage to improve the accuracy of the prediction model, and it has been proven to be an effective method [7]. Juan Sala used the clustering method to classify the historically similar days of household electricity consumption and used logical regression and a random forest (RF) algorithm to predict electricity consumption [8]. Yang used the k-shape algorithm to classify buildings according to their energy consumed per hour and per week and used the classification results for Support Vector Regression (SVR) model prediction. The experiment showed that this preprocessing significantly improves the accuracy of the prediction model [9].

General data-driven models can be divided into two categories: regression models and machining learning algorithms. The typical regression models include Linear Regression (LR), Auto-Regressive Moving Average (ARMA), and Auto-Regressive Integrated Moving Average (ARIMA). Widely used machine learning models include Long Short-term Memory (LSTM), Regression Tree (RT), and SVR [10,11,12]. Most of the existing research has improved the accuracy of building energy consumption prediction models by studying the optimization and update of traditional algorithms. For example, Karijadi used complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) to transform data into several components and then used RF and LSTM to predict a building’s energy [13]. Wang Ran improved a new integration model (stacking model) to solve complex multifactor engineering tasks [14]. Luo proposed an adaptive LSTM neural network that is better than the existing feedforward neural network and LSTM-based prediction models in accuracy and robustness [15]. Jin proposed a novel hybrid AI-empowered forecasting model that combines singular spectrum analysis (SSA) and parallel long short-term memory (PLSTM) neural networks [16]. Ding divided campus electricity consumption into two categories—“basic” and “variable”—and established a two-part building electricity forecasting model based on human behavior [11].

In terms of the timescale of the forecast, electricity consumption forecasting can be divided into four categories: long-term (a year or more) prediction, medium-term (between a week and a year) prediction, short-term (hours to a week) prediction, and very short-term (from minutes to an hour) prediction [17,18,19]. Long-term and mid-term forecasting, which need extensive historical data, can provide useful reference values for strategic planning. Different from long-term prediction, short-term and very short-term forecasting have implications for energy management that only needs energy consumption data from within a few days [20]. The focus of this study is to use small amounts of historical data to predict the electricity consumption of campus buildings on the following day. It has important reference value for campus building electricity management personnel.

For the short-term prediction of campus buildings, Luo found that the genetic algorithm–deep neural network (GA-DFNN) model can obtain the predicted electricity consumption in an hour or a week by inputting weather conditions, historical data, and time indicators. Compared with the DFNN model, the GA-DFNN model was proved to have better prediction performance due to the optimization ability of GA [21]. Reddy, A. performed short-term electricity consumption prediction using ensemble learning with a consideration of historical energy consumption values to forecast the energy consumption in the next 4 h [22]. However, most colleges and universities do not have long-term effective historical electricity consumption data, and comprehensive climate data can only be obtained from local climate stations.

Based on the above research, people have performed a variety of research on building energy prediction and have also accomplished great achievements in the variation of the prediction model. However, the energy management system of many Chinese universities is relatively backward, and it is difficult to collect relevant data such as building parameters, indoor human behavior, and weather. The clustering method can be used to improve prediction accuracy by clustering the existing historical data and then optimizing the prediction model according to the clustering results. Therefore, in this study, the K-means algorithm is used to cluster the daily electricity consumption of buildings, and a new combination forecasting model for short-term forecasting is proposed based on its clustering results, which is practically operable for logistics groups and provides useful guidance for energy system managers in advance.

2. Methodology

2.1. Overall Flowchart of the Research

To analyze the power energy usage law of different campus buildings, the studied buildings are first classified into different categories; then, the targeted energy management policy can be made. Moreover, the accurate prediction of power energy consumption is the foundation of energy management. In searching for the optimal prediction model, it is found that the ARMA and LR models have obvious opposite effects on the fluctuation of time series. By combining the two models, the prediction error can be reduced despite time series fluctuations. Therefore, a combined forecasting model based on the clustering results of the daily electricity consumption of campus buildings is proposed.

The overall framework of this study is presented in Figure 1. It mainly consists of three steps, i.e., data processing, time series prediction, and model evaluation:

Step 1: Filtering out abnormal data from the raw data and dividing 29 buildings into three categories using the K-means clustering method.

Step 2: Comparing the prediction results of ARMA and LR to propose an integrated model (ARMA-LR).

Step 3: Comparing the ARMA-LR models with the mainstream prediction algorithm and evaluating the predicted accuracy of these models based on metrics of MAE, RMSE, and MAPE.

This study collected the daily power consumption data of 29 buildings in a university in Wuhan from 1 January 2020 to 31 December 2021, and a total of 21,141 daily power consumption data was obtained. Due to the influence of COVID-19, students were studying at home from 1 January 2020 to 31 August 2020; therefore, this part of the data was excluded, and the outliers and abnormal data were ignored.

Through the above process of eliminating data, the first 100 days were used to divide the buildings into three categories by using the clustering method. The initial 15 days of training data was the beginning of the prediction and the last 150 days of validation data was the prediction reliability of the test model.

2.2. K-Means Clustering

Since the changing pattern of daily power energy consumption varies across building types, the relationship between power energy usage and building types needs to be clarified. Thus, appropriate energy management strategies can be formulated in terms of building categories. This paper adopts the K-means clustering method to classify 29 campus buildings into 3 categories according to daily electricity consumption, i.e., high-energy consumption buildings, medium-energy consumption buildings, and low-energy consumption buildings.

The principle of the K-means algorithm is to calculate the Euclidean distance between each point and the centroid based on the planning of the initial centroid in advance. After classifying according to the Euclidean distance, the centroid is calculated again, and the classification process is repeated until the classification result does not change [23,24,25,26].

As shown in Table 1, 29 buildings were clustered into three categories: high-energy consumption buildings, medium-energy consumption buildings, and low-energy consumption buildings. The three initial centroids are 1,000,300 and 500.

After iterative calculation, the three types of results of the three semester periods are different, as shown in Table 1. The primary median electricity consumption of the three types of buildings is 100, 500, 1400. The number of buildings of each type is the average, which is 10, 10, and 9, respectively. Meanwhile, high-energy-consuming buildings include 5 high-rise offices, 3 high-rise laboratories, and 2 high-rise teaching buildings, with a large number of administrative personnel, students, and teachers. Medium energy-consuming buildings mainly consist of 7 mid-level office buildings and 3 mid-level teaching buildings. Low-energy-consuming buildings consist of 9 small laboratories.

2.3. Fluctuation

After eliminating the outliers in the energy consumption time series of buildings, their time series still fluctuates frequently. Generally, the coefficient of variation (Cv) can be used to evaluate the fluctuation level of a time series. We propose a new indicator, average percentage fluctuation (APF), which is a dimensionless indicator similar to the Cv, and both indicators can represent the degree of the volatility of a sequence. Compared to Cv, APF is more relevant to time series because the calculation of Cv does not consider the sequence order, which is included in APF. Therefore, APF can better describe the volatility of time series compared to Cv.

C v = \frac{\sqrt{\frac{\sum {(h_{t} - \bar{h})}^{2}}{n - 1}}}{\bar{h}} * 100 %

(1)

A P F = \frac{1}{n} \sum |((h_{t + 1} - h_{t})) / h_{t}| * 100 %

(2)

where

h_{t}

is the daily electricity consumption of one day,

h_{t + 1}

is the daily electricity consumption of the next day.

\bar{h}

is the average value of the daily electricity consumption of the whole time series.

As shown in Figure 2, the APF of high-energy consumption buildings, medium-energy consumption buildings, and low-energy consumption buildings is 7.95%, 11.7%, and 12.74%, respectively. Meanwhile, the Cv of the three building categories is 12.80%, 14.06%, and 14.12%, respectively. The description trend of time series fluctuations for the three buildings, when using APF and Cv, is consistent: both decrease with an increase in building energy consumption. This also indicates that the higher the energy consumption of a building, the smaller its fluctuation.

2.4. Combined Forecasting Model

Although the ARMA model and LR model are both linear models, these two time series models are feasible in terms of data patterns. It was found in the experiment that the ARMA model is more sensitive to the fluctuation of the original time series, subsequently enlarging the effect of data fluctuation on the prediction results, which results in predictions that are too large and too small. Meanwhile, the regression model is less sensitive to the fluctuation of the original data and will respond slowly to the change and weaken the impact of fluctuation on the prediction results, which implies slightly smaller or larger prediction results. Therefore, these two models can complement each other in time series prediction. In this study, the two models are combined, i.e., the ARMA-LR model, as shown in Equation (3), where

ω_{1}

is the weight of the ARMA model,

y_{1}

is the prediction result of the ARMA model,

ω_{2}

is the weight of the LR model, and

y_{2}

is the prediction result of the LR model. Because of the energy consumption law of different building types, both

ω_{1}

and

ω_{2}

change with the building type. In the ARMA-LR model, the weights of the two models are trained based on the training effectiveness of the training set, using MAE as the evaluation indicator. In this study, the first 100 days of training for three types of buildings were used for training, and the weights of the model (

ω_{1}

and

ω_{2}

) were determined through training. The values of

ω_{1}

and

ω_{2}

are presented in Table 2.

y = ω_{1} y_{1} + ω_{2} y_{2}

(3)

3. Results and Evaluation

3.1. Prediction Accuracy Evaluation Index

Mean average error (MAE), root mean square error (RMSE), and mean absolute percentage error (MAPE) are the most commonly used indicators to evaluate the accuracy of prediction models. MAE and RMSE are usually used to compare the effectiveness of prediction models. MAPE is used to evaluate the accuracy of the prediction model. The smaller the MAPE, the higher the prediction accuracy of the evaluated model. The mathematical expressions of MAE, RMSE, and MAPE are expressed as shown in Equations (4)–(6), respectively.

M A E = \frac{1}{n} \sum_{t = 1}^{n} |{\hat{y}}_{i} - y_{i}|

(4)

R M S E = \sqrt{\frac{1}{n} \sum_{t = 1}^{n} {({\hat{y}}_{i} - y_{i})}^{2}}

(5)

M A P E = \frac{1}{n} \sum_{t = 1}^{n} |\frac{y_{i} - {\hat{y}}_{i}}{y_{i}}|

(6)

where t represents the time,

y_{i}

represents the forecast made for period t, and

{\hat{y}}_{i}

represents the actual observation at time t.

3.2. Comparison of Prediction Results among Three Models

The prediction results of two representative buildings in each building cluster are selected. The black line, red line, green line, and blue line represent the real daily electricity data and prediction results of the ARMA model, LR model, and ARMA-LR model, respectively.

As presented in Figure 3, the prediction results of the three prediction models of the first typical building within the high-energy consumption buildings category are presented. The average daily energy consumption of the building is 2482.19 kWh. Compared with real data, the prediction accuracy of the ARMA model is the highest among the three models during the first 50 days, with a long peak-to-trough span. The prediction accuracy of the LR model is the highest from 100 days to 150 days, with a short peak-to-trough span. Overall, the ARMA-LR model’s prediction result is not the best for each day, but its accuracy is the best over 150 days, and it reduces the impact of excessive errors in some of the forecast results of the ARMA and LR models.

As shown in Figure 4, the prediction results of the second typical building of the high-energy consumption buildings are presented. The average daily energy consumption of the building is 1782.49 kWh. Compared with the first typical building, the real data of this building do not have large seasonal trends such as long-lasting trends like those in the initial 50 days from the first typical building test. Compared with the first 50 days and the period from 100 days to 150 days, the peak-to-trough difference of 50 days to 100 days is large, and the LR model performs better than the ARMA model. The peak-to-trough span is stable over 150 days.

As illustrated in Figure 5, the prediction results of the first typical building of the medium-energy consumption buildings are presented. The average daily energy consumption of the building is 558.41 kWh. Compared with the other buildings, the prediction results of the three models are similar. Although the fluctuation amplitude is smaller than in other buildings, the fluctuation frequency is higher.

As shown in Figure 6, the prediction results of the second typical building of the medium-energy consumption buildings are presented. The average daily energy consumption of the building is 727.24 kWh/d. Compared with the data from 50 days to 150 days, the real data of the first 50 days is stable, and the peak-to-trough span is smaller. Thus, the LR model performs better than the ARMA model in the initial days. It was found that, from 50 days to 150 days, the ARMA model performs better than the LR model in detecting seasonal trends. The ARMA-LR model can neutralize the advantages and disadvantages of the two models and obtain a prediction result with a relatively stable error.

As indicated in Figure 7, the prediction results of the first typical building of the low-energy consumption buildings are presented. The average daily energy consumption of the building is 120.192 kWh/d. Compared with the high-energy consumption buildings and medium-energy consumption buildings, the peak-to-trough span of this building is consistently small. When the peak-to-trough difference is small, it was found that the ARMA model performs better than the LR model. But the LR performs better during the other days. Therefore, the ARMA-LR model performs better than the two models.

As shown in Figure 8, the prediction results of the second typical building of the low-energy consumption buildings are presented. The average daily energy consumption of the building is 108.59 kWh/d. Compared with the high-energy consumption buildings and medium-energy consumption buildings, both the two typical buildings of the low-energy consumption buildings have an obvious seasonal trend. Moreover, while some of the peak-to-trough spans are large, others are small. The reason for this is that the factors influencing building energy consumption have the largest influence on low-energy consumption buildings compared to others. Therefore, the ARMA-LR can solve the limitations of the ARMA model and the LR model.

3.3. Evaluation of the Combined Forecasting Model

The indicators of the above three models are shown in Table 3, Table 4 and Table 5. It can be found that the overall MAPE of the high-energy-consuming buildings is the lowest, but the overall MAE and RMSE are the highest. low-energy consumption buildings have the highest overall MAPE, but MAE and RMSE are the lowest. It can be seen from the above results that, when taking volatility as the evaluation standard, the prediction result of high-energy-consuming buildings is the best, and the prediction result of the corresponding low-energy-consuming buildings is the worst. If the specified value of the offset is used as the evaluation standard, the prediction result of high-energy-consuming buildings is the worst.

Comparing the overall indicators of the three models indicates that the ARMA-LR model obtains the best prediction result and outperforms the ARMA model and the LR model. In terms of high-energy consumption buildings, the ARMA model is better than the LR model, and, in terms of medium-energy consumption buildings and low-energy consumption buildings, the LR model is better than the ARMA model. This conclusion is also consistent with the previous conclusion from the prediction results and further confirms that the ARMA-LR model integrates the advantages of the two models. In order to verify the superiority of the ARMA-LR model, this paper also selects two classical prediction algorithms—SVR and LSTM—for comparison. SVR has good performance, but LSTM has poor performance. This also proves that the deep learning algorithm performs poorly in the case of fewer data. However, the ARMA-LR model proposed in this paper is superior to the other four algorithms in predicting the three different building types.

3.4. The Relationship between APF and MAPE

Figure 9, Figure 10 and Figure 11 present the comparison results of the APF and MAPE of the ARMA-LR prediction model for the three building clusters. Among them, the horizontal axis of the graph represents the statistical sequence of each building within 29 buildings. It is interesting to find that the APF result is very close to that of the MAPE. Within medium-energy consumption buildings and low-energy consumption buildings, the APF is a little larger than MAPE. For high-energy consumption buildings, the APF is generally smaller than MAPE.

The relationship between MAPE and APF for different types of buildings is obtained by using a univariate LR. Within three building clusters, the relationship between APF and MAPE is expressed in Equations (7)–(9), respectively.

R^{2}

represents the fitting degree of the LR equation (Equation (10)), the

R^{2}

of the high-energy building is 0.8671, the

R^{2}

of the medium-energy building is 0.9318, and the

R^{2}

of the low-energy building 0.7527.

M A P E = 0.8266 * A P F + 1.7753

(7)

M A P E = 0.7343 * A P F + 2.5297

(8)

M A P E = 0.6819 * A P F + 3.2782

(9)

R^{2} = \frac{\sum_{i = 1}^{n} \hat{y_{i}} - \bar{y}}{\sum_{i = 1}^{n} y_{i} - \bar{y}}

(10)

The LR results are shown in Figure 12, Figure 13 and Figure 14, and it can be seen in the results that the best result of the LR model in three categories is in the medium-energy consumption buildings. This is because the APF of medium-energy consumption buildings is high. The reason why the results of the LR model are not the best for low-energy consumption buildings is that the APF is more than 10%, and these buildings affect the result of LR.

Based on the relationship between MAPE and APF, it can be found that, as building energy consumption increases, the proportion of APF increases. It can be seen in Section 2.3 that the higher the building energy consumption, the smaller the APF. Therefore, the smaller the fluctuation of the time series of the building itself, the higher the prediction accuracy.

Based on the above results, we can conclude that the relationship between APF and MAPE is linear. The ARMA model and the regression model—being, in essence, linear time series prediction methods—can represent the trend of the original time series. APF represents the fluctuation change of the original time series, that is, the deviation of its time series trend. MAPE represents the difference between the predicted time series trend and the true value. In principle, these two indices are equal. But due to the noise of the original data, there are some discrepancies between the two values. Therefore, APF can be used as an indicator to evaluate the short-term linear prediction potential of building energy consumption, which can be used during the training phase to evaluate the prediction potential of each building and reduce the workload of building energy management personnel.

4. Discussion

Some universities in China lack efficient building energy management systems, resulting in limited building energy consumption information. Most of them only collect energy consumption through electricity meters and have not established a comprehensive building energy management system. At present, research on energy consumption in university buildings is mainly based on more comprehensive datasets with sufficient building information [11,13,14,15,16]. Therefore, based on the current situation of inadequate building energy management systems in Chinese universities, this study used the daily electricity consumption of campus buildings to achieve the short-term forecasting of building energy consumption. Twenty-nine university buildings measured using electricity meters in a certain university were selected as the sample, covering large, medium, and small office buildings, teaching buildings, experimental buildings, and other small, specialized buildings, which can represent all types of buildings in a university. Therefore, this study can help building managers in Chinese universities establish short-term prediction models for university buildings, use prediction data to conduct building energy consumption warnings, and formulate building energy consumption management policies.

When predicting building energy consumption, data preprocessing is essential to attaining final prediction accuracy. In data preprocessing, it is necessary to analyze the features of raw data, which can determine the necessity of subsequent predictions. The fluctuation of the time series of building energy consumption can reflect the potential of building energy consumption prediction. In general, it is often difficult to obtain accurate predictions for buildings with large fluctuations in time series. However, the traditional indices of standard deviation (SD) and Cv for describing sequence fluctuations cannot incorporate the temporal nature of the time series in the calculation process. To compensate for the limitation of SD and Cv, this study proposes a new indicator, APF, that includes the temporal sequence of building energy consumption. Compared with the traditional indicator Cv, APF includes a time series of building energy consumption, which can more significantly demonstrate the volatility of daily energy consumption changes in buildings. At the same time, when faced with the linear prediction of the time series of energy consumption in university buildings, this indicator has a direct linear relationship with the MAPE. Therefore, it can be used to evaluate the accuracy of short-term linear prediction models for the daily electricity consumption of buildings, which greatly increases the reliability of a building energy consumption prediction potential assessment.

5. Conclusions

This study collected the electricity consumption data of campus buildings from 1 January 2020 to 31 December 2021. Firstly, the processed building electricity consumption data were divided into three categories according to electricity consumption using K-means clustering; then, time series fluctuation analysis was carried out using Cv and the APF model proposed in this study. By comparing two time series forecasting models, i.e., LR and ARMA, this study proposes a combined model of LR and ARMA. By analyzing the fluctuations and predicted results of building energy consumption sequences, the findings are as follows:

A new indicator, i.e., APF, is proposed, which can describe the fluctuation of time series. Compared with the traditional indicator, Cv, it considers the temporal nature of a given time series. At the same time, the results prove that, for the short-term linear prediction of daily energy consumption in university buildings, the APF of the time series of building energy consumption has a linear relationship with the MAPE, and the values of the two indices are close. However, whether this indicator can be used for the nonlinear prediction of building energy consumption or for the prediction of other objects is uncertain. Therefore, future research can use this indicator for the nonlinear and long-term prediction of building energy consumption to tap into its potential use;
This study proposes a short-term linear prediction model, ARMA-LR, for assessing the daily energy consumption of university buildings. The weights of the ARMA-LR model need to be trained using MAE based on the training set. This study demonstrates that, in the face of incomplete building energy management systems, this model outperforms current mainstream prediction models in the short-term linear prediction of building energy consumption in universities. But, whether the model can perform well in nonlinear and long-term predictions is questionable. Future research can use this model for the long-term and nonlinear prediction of building energy consumption to further investigate its predictive potential.

Author Contributions

Methodology, X.S.; Software, W.W. and L.M.; Data curation, X.S.; Writing—original draft, W.W.; Writing—review & editing, X.S.; Supervision, Q.D. and L.M.; Project administration, R.W.; Funding acquisition, Z.R. All authors have read and agreed to the published version of the manuscript.

Funding

The research work was supported by Sanya Science and Education Innovation Park of Wuhan University of Technology (2021KF0002, 2021KF0004), Hainan Province Science and Technology Special Fund (ZDKJ2021024), Major R&D projects of China Metallurgical Group Corporation (2022 No. 14). The authors also acknowledge the support of the Logistics Support Office of Wuhan University of Technology.

Data Availability Statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflict of interest.

References

Sun, Y.; Haghighat, F.; Fung, B.C.M. A review of the state-of-the-art in data-driven approaches for building energy prediction. Energy Build. 2020, 221, 110022. [Google Scholar] [CrossRef]
Wang, Z.; Xia, L.; Yuan, H.; Srinivasan, R.S.; Song, X. Principles, research status, and prospects of feature engineering for data-driven building energy prediction: A comprehensive review. J. Build. Eng. 2022, 58, 105028. [Google Scholar] [CrossRef]
Wang, Z.; Srinivasan, R.S. A review of artificial intelligence based building energy use prediction: Contrasting the capabilities of single and ensemble prediction models. Renew. Sustain. Energy Rev. 2017, 75, 796–808. [Google Scholar] [CrossRef]
Bourdeau, M.; Zhai, X.Q.; Nefzaoui, E.; Guo, X.; Chatellier, P. Modeling and forecasting building energy consumption: A review of data-driven techniques. Sustain. Cities Soc. 2019, 48, 101533. [Google Scholar] [CrossRef]
Seyedzadeh, S.; Pour Rahimian, F.; Rastogi, P.; Glesk, I. Tuning machine learning models for prediction of building energy loads. Sustain. Cities Soc. 2019, 47, 101484. [Google Scholar] [CrossRef]
Main Results of the 2021 National Education Statistics Report. Available online: https://www.ihzw.com.cn/jytj/21898.html (accessed on 1 March 2022).
Hsu, D. Comparison of integrated clustering methods for accurate and stable prediction of building energy consumption data. Appl. Energy 2015, 160, 153–163. [Google Scholar] [CrossRef]
Sala, J.; Li, R.; Christensen, M.H. Clustering and classification of energy meter data: A comparison analysis of data from individual homes and the aggregated data from multiple homes. Build. Simul. 2021, 14, 103–117. [Google Scholar] [CrossRef]
Yang, J.; Ning, C.; Deb, C.; Zhang, F.; Cheong, D.; Lee, S.E.; Sekhar, C.; Tham, K.W. K-shape clustering algorithm for building energy usage patterns analysis and forecasting model accuracy improvement. Energy Build. 2017, 146, 27–37. [Google Scholar] [CrossRef]
Li, Y.; Tong, Z.; Tong, S.; Westerdahl, D. A data-driven interval forecasting model for building energy prediction using attention-based lstm and fuzzy information granulation. Sustain. Cities Soc. 2022, 76, 103481. [Google Scholar] [CrossRef]
Kim, M.K.; Kim, Y.-S.; Srebric, J. Predictions of electricity consumption in a campus building using occupant rates and weather elements with sensitivity analysis: Artificial neural network vs. linear regression. Sustain. Cities Soc. 2020, 62, 102385. [Google Scholar] [CrossRef]
Shao, M.; Wang, X.; Bu, Z.; Chen, X.; Wang, Y. Prediction of energy consumption in hotel buildings via support vector machines. Sustain. Cities Soc. 2020, 57, 102128. [Google Scholar] [CrossRef]
Karijadi, I.; Chou, S.-Y. A hybrid rf-lstm based on ceemdan for improving the accuracy of building energy consumption prediction. Energy Build. 2022, 259, 111908. [Google Scholar] [CrossRef]
Wang, R.; Lu, S.; Feng, W. A novel improved model for building energy consumption prediction based on model integration. Appl. Energy 2020, 262, 114561. [Google Scholar] [CrossRef]
Luo, X.J.; Oyedele, L.O. Forecasting building energy consumption: Adaptive long-short term memory neural networks driven by genetic algorithm. Adv. Eng. Inform. 2021, 50, 101357. [Google Scholar] [CrossRef]
Jin, N.; Yang, F.; Mo, Y.; Zeng, Y.; Zhou, X.; Yan, K.; Ma, X. Highly accurate energy consumption forecasting model based on parallel lstm neural networks. Adv. Eng. Inform. 2022, 51, 101442. [Google Scholar] [CrossRef]
Pallonetto, F.; Jin, C.; Mangina, E. Forecast electricity demand in commercial building with machine learning models to enable demand response programs. Energy AI 2022, 7, 100121. [Google Scholar] [CrossRef]
Yan, D. An occupancy-based model for building electricity consumption prediction: A case study of three campus buildings in tianjin. Energy Build. 2019, 202, 109412. [Google Scholar]
Kuster, C.; Rezgui, Y.; Mourshed, M. Electrical load forecasting models: A critical systematic review. Sustain. Cities Soc. 2017, 35, 257–270. [Google Scholar] [CrossRef]
Liu, C.; Sun, B.; Zhang, C.; Li, F. A hybrid prediction model for residential electricity consumption using holt-winters and extreme learning machine. Appl. Energy 2020, 275, 115383. [Google Scholar] [CrossRef]
Luo, X.; Oyedele, L.O.; Ajayi, A.O.; Akinade, O.O.; Delgado, J.M.D.; Owolabi, H.A.; Ahmed, A. Genetic algorithm-determined deep feedforward neural network architecture for predicting electricity consumption in real buildings. Energy AI 2020, 2, 100015. [Google Scholar] [CrossRef]
Reddy, S.; Akashdeep, S.; Harshvardhan, R.; Kamath, S. Stacking deep learning and machine learning models for short-term energy consumption forecasting. Adv. Eng. Inform. 2022, 52, 101542. [Google Scholar] [CrossRef]
Fahim, A. K and starting means for k-means algorithm. J. Comput. Sci. 2021, 55, 101445. [Google Scholar] [CrossRef]
Ribeiro, M.; Grolinger, K.; ElYamany, H.F.; Higashino, W.A.; Capretz, M.A. Transfer learning with seasonal and trend adjustment for cross-building energy forecasting. Energy Build. 2018, 165, 352–363. [Google Scholar] [CrossRef]
Chen, L.; Shan, W.; Liu, P. Identification of concrete aggregates using k-means clustering and level set method. Structures 2021, 34, 2069–2076. [Google Scholar] [CrossRef]
Troccoli, E.B.; Cerqueira, A.G.; Lemos, J.B.; Holz, M. K-means clustering using principal component analysis to automate label organization in multi-attribute seismic facies analysis. J. Appl. Geophys. 2022, 198, 104555. [Google Scholar] [CrossRef]

Figure 1. The prediction framework of different campus building clusters.

Figure 2. The fluctuation of the three building clusters.

Figure 3. The prediction results of the first building within the high-energy consumption buildings category.

Figure 4. The prediction results of the second building within the high-energy consumption buildings category.

Figure 5. The prediction results of the first typical building within the medium-energy consumption buildings.

Figure 6. The prediction results of the second typical building within the medium-energy consumption buildings.

Figure 7. The prediction results of the first typical building within low-energy consumption buildings.

Figure 8. The prediction results of the second typical building within the low-energy consumption buildings.

Figure 9. A comparison of MAPE and APF within high-energy consumption buildings.

Figure 10. A comparison of MAPE and APF within medium-energy consumption buildings.

Figure 11. The comparison of MAPE and APF within low-energy consumption buildings.

Figure 12. The fitting result of APF and MAPE within high-energy consumption buildings.

Figure 13. The fitting result of APF and MAPE within medium-energy consumption buildings.

Figure 14. The fitting result of APF and MAPE within low-energy consumption buildings.

Table 1. The result of K-means clustering.

Clusters	k Centroids/kWh	Range/kWh	No.
High-energy consumption buildings	1376.33	799.95–2482.20	10
Medium-energy consumption buildings	483.78	185.87–789.43	10
Low-energy consumption buildings	113.95	49.99–175.07	9

Table 2. The

ω_{1}

and

ω_{2}

of the ARMA-LR model for different building clusters.

Table 2. The

ω_{1}

and

ω_{2}

of the ARMA-LR model for different building clusters.

Clusters	ω₁/%	ω₂/%	Total/%
High-energy consumption buildings	37	63	100
Medium-energy consumption buildings	60	40	100
Low-energy consumption buildings	74	26	100

Table 3. MAE of three prediction models.

Clusters	High-Energy Consumption Buildings	Medium-Energy Consumption Buildings	Low-Energy Consumption Buildings
ARMA	109.87	48.06	13.43
LR	117.49	48.71	13.29
SVR	117.98	49.84	13.24
LSTM	158.16	61.58	14.99
ARMA-LR	108.83	45.93	12.94

Table 4. RMSE of three prediction models.

Clusters	High-Energy Consumption Buildings	Medium-Energy Consumption Buildings	Low-Energy Consumption Buildings
ARMA	147.56	61.93	16.65
LR	154.56	62.72	16.56
SVR	158.61	64.17	16.44
LSTM	202.64	76.51	18.75
ARMA-LR	143.52	59.31	16.04

Table 5. MAPE of the prediction models.

Clusters	High-Energy Consumption Buildings	Medium-Energy Consumption Buildings	Low-Energy Consumption Buildings
ARMA	8.16	10.93	12.03
LR	8.70	11.09	11.85
SVR	8.70	11.23	11.79
LSTM	11.44	13.24	12.96
ARMA-LR	8.06	10.47	11.59

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wu, W.; Deng, Q.; Shan, X.; Miao, L.; Wang, R.; Ren, Z. Short-Term Forecasting of Daily Electricity of Different Campus Building Clusters Based on a Combined Forecasting Model. Buildings 2023, 13, 2721. https://doi.org/10.3390/buildings13112721

AMA Style

Wu W, Deng Q, Shan X, Miao L, Wang R, Ren Z. Short-Term Forecasting of Daily Electricity of Different Campus Building Clusters Based on a Combined Forecasting Model. Buildings. 2023; 13(11):2721. https://doi.org/10.3390/buildings13112721

Chicago/Turabian Style

Wu, Wenyu, Qinli Deng, Xiaofang Shan, Lei Miao, Rui Wang, and Zhigang Ren. 2023. "Short-Term Forecasting of Daily Electricity of Different Campus Building Clusters Based on a Combined Forecasting Model" Buildings 13, no. 11: 2721. https://doi.org/10.3390/buildings13112721

APA Style

Wu, W., Deng, Q., Shan, X., Miao, L., Wang, R., & Ren, Z. (2023). Short-Term Forecasting of Daily Electricity of Different Campus Building Clusters Based on a Combined Forecasting Model. Buildings, 13(11), 2721. https://doi.org/10.3390/buildings13112721

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Short-Term Forecasting of Daily Electricity of Different Campus Building Clusters Based on a Combined Forecasting Model

Abstract

1. Introduction

2. Methodology

2.1. Overall Flowchart of the Research

2.2. K-Means Clustering

2.3. Fluctuation

2.4. Combined Forecasting Model

3. Results and Evaluation

3.1. Prediction Accuracy Evaluation Index

3.2. Comparison of Prediction Results among Three Models

3.3. Evaluation of the Combined Forecasting Model

3.4. The Relationship between APF and MAPE

4. Discussion

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI