Next Article in Journal
A Digital Twin of Hot Pumping Waxy Oil Through a Main Pipeline
Next Article in Special Issue
Investigation of Energy-Efficient Solutions for a Single-Family House Based on the 4E Idea in Poland
Previous Article in Journal
Enhancing Air Conditioning System Efficiency Through Load Prediction and Deep Reinforcement Learning: A Case Study of Ground Source Heat Pumps
Previous Article in Special Issue
CFD and Statistical Analysis of the Impact of Surface Physical Parameters on the Thermal Resistance of Layered Partitions in ETICS Systems
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

BIM-Based Machine Learning Application for Parametric Assessment of Building Energy Performance

by
Panagiotis Tsikas
,
Athanasios Chassiakos
*,
Vasileios Papadimitropoulos
and
Antonios Papamanolis
Department of Civil Engineering, University of Patras, 26500 Patras, Greece
*
Author to whom correspondence should be addressed.
Energies 2025, 18(1), 201; https://doi.org/10.3390/en18010201
Submission received: 22 November 2024 / Revised: 23 December 2024 / Accepted: 31 December 2024 / Published: 5 January 2025
(This article belongs to the Special Issue Building Energy Performance Modelling and Simulation)

Abstract

:
The energy performance of buildings has become a main concern globally in response to increased energy demand, the environmental impacts of energy production, and the reality of energy poverty. To improve energy efficiency, proper building design should be secured at the early design phase. Digital tools are currently available for performing energy assessment analyses and can efficiently handle complex and technically demanding buildings. However, alternative designs should be checked individually, and this makes the process time-consuming and prone to errors. Machine learning techniques can provide valuable assistance in developing decision support tools. In this paper, typical residential buildings are considered along with eleven factors that highly affect energy performance. A dataset of 337 instances of such parameters is developed. For each dataset, the building energy performance is estimated based on BIM analysis. Next, statistical and machine learning techniques are implemented to provide artificial models of energy performance. They include statistical regression modeling (SRM), decision trees (DTs), random forests (RFs), and artificial neural networks (ANNs). The analysis reveals the contribution of each factor and highlights the ANN as the best performing model. An easy-to-use interface tool has been developed for the instantaneous calculation of the energy performance based on the independent parameter values.

1. Introduction

The increase in population and human activities, followed by the expansion of the built environment, have intensified sustainability problems, among which is the need for significant amounts of energy in contemporary buildings. According to the Global Status Report for Buildings and Construction [1], buildings consume about 36% of annual global energy resources and are responsible for 60% of global electricity production, whilst they contribute 39% of global annual CO2 emissions.
To facilitate residents’ well-being, modern building design requires increased energy consumption. In addition, existing building structures are highly energy-intensive and environmentally unfriendly [2]. Previous studies and construction field experience have demonstrated that energy consumption in buildings can be effectively reduced through simple design solutions, such as the selection of appropriate building shapes and orientations. Thus, one of the challenges of the construction industry is the sustainable design of new buildings and adaptation of existing ones. In this direction, it is imperative to reassess the conventional design and evaluation procedures of structures, particularly through an integrated parametric approach that considers interdependences among leverage factors. In current practice, energy analyses are often conducted separately, as the planning and dimensioning processes are not aligned with the energy assessment process. In fact, most existing buildings were designed in the past with outdated regulations οr without any sort of provisions concerning their energy efficiency [3]. Additionally, structure complexity, multiple influential parameters and interdependencies, uncertainties, and limited data often result in assumptions and simplifications, which underestimate or distort the influence of these parameters, leading to inaccurate results and inadequate designs.
The energy efficiency of buildings is affected by several factors related to a building’s structure, location, and usage. These factors can be classified into two main classes: physical and human-influenced ones [4]. The first class encompasses technical and physical factors, such as the building envelope, local climate, heating, ventilation, and air conditioning systems. The second category includes human-related factors, such as occupant behavior and energy usage patterns, which exhibit significant variability and uncertainty in energy consumption. Finally, there are social factors, such as energy prices, that have social consequences on energy efficiency.
An effective way to estimate building energy efficiency is the utilization of building energy modeling (BEM) software [5]. However, partially using BEM software without synchronizing it with the global project design and construction process can result in ineffective decisions, which are partly due to discontinuous information flow among digital design models. In addition, the process is time-consuming, tedious, and susceptible to errors, as energy efficiency data must be entered into BEM software manually. The introduction of BIM technology allows for options that can overcome the limitations of conventional energy efficiency assessment processes through BEM, as currently, BEM can act as a subsystem of BIM. The use of BIM can result in accurate and systematic calculations and provide the ability to monitor and control energy use during the operation phase of buildings [6]. The application of BIM technology combined with simulations from the initial stages of project design allow for a wide exploration of alternative design solutions [7]. In fact, this technology provides the ground for understanding and quantifying the actual conditions of a building as well as simulating real-world user behavior, thus improving energy efficiency analyses.
While BIM represents the foundation for performing several types of analyses in building design, these analyses are usually multifactorial and require comparative evaluation of alternative solutions, with user involvement in setting these solutions up and comparing the results. The partial automation of such analyses results in increased time and effort requirements. Instead, one can employ a parametric analysis and use machine learning techniques to easily capture the influence of energy consumption factors and comparatively evaluate alternative building configurations in terms of energy efficiency. The present study first analyzes several factors that predominantly affect the energy efficiency of a typical residential building using BIM technologies. Based on this analysis, a dataset relating the factors influencing energy to their corresponding energy consumption is developed. Machine learning techniques are then developed and evaluated as an instrument for instantaneous energy assessment analyses that can assist in decisions on initial building designs or retrofitting.

2. Background

Existing research has dealt with several aspects of building energy analysis. In the direction of sustainable design, key architectural design variables (ADVs), which influence sustainability characteristics during early building design and integrate computational simulations and stakeholder insights, were examined [8]. This research underscores the importance of including stakeholder perspectives in developing design optimization tools, aligned with real-world needs for effective sustainable design.
The role of BIM in facilitating the LEED® certification process was explored in [9]. The findings underscore BIM’s potential to save time and resources, thus enhancing the feasibility of sustainable project management. Further, the implementation of national and international energy standards in low-rise residential buildings in Jordan utilizing BIM models was examined in [10]. The use of BIM tools and regression models to optimize energy performance in early-stage building design was demonstrated in [11]. By simulating various prototype building shapes, the study identified energy-efficient geometries, noting that triangular designs consumed the least energy. A framework combining BIM with a life cycle cost (LCC) analysis and orthogonal testing methods to identify optimal design parameters for energy efficiency in green buildings was introduced in [12]. The study revealed significant annual energy reductions, emphasizing enhancements in building envelope features such as insulation. A BIM-driven approach aiming at net-zero energy in tall buildings within the Malaysian construction sector was the focus of [13]. The results demonstrate significant correlations between BIM usage and factors such as early design integration and predictive energy performance.
Moving on to machine learning applications for energy consumption prediction, the effectiveness of deep learning (DL) methods for short-term cooling load prediction in buildings was explored in [14], emphasizing their potential to outperform traditional physic-based models. A comprehensive review of methods for predicting building energy demand, with a focus on data-driven techniques, like support vector machines (SVMs) and artificial neural networks (ANNs), was presented in [15]. The study highlights the strengths of these methods, including achieving a high predictive accuracy. It further notes that their effectiveness can be hindered by limited data availability. Another review of data-driven models for predicting building energy consumption, emphasizing their role in energy management, conservation, and planning, was performed in [16]. The study recommended focusing on big data analytics and incorporating occupant behavior insights to enhance model accuracy and reliability, paving the way for more tailored prediction models. An analysis of building energy prediction models was performed in [17], categorizing them into white-box, black-box, and grey-box types and highlighted that black-box models, which include machine learning techniques, like SVMs and ANNs, can handle complex data relationships. However, they face challenges related to human behavior and climate variability. A hybrid predictive model that combines empirical mode decomposition (CEEMDAN), deep learning, and ARIMA to enhance short-term heating load forecasts was introduced in [18]. The study demonstrated the potential for hybrid models to address complex data and improve forecast reliability, suggesting future research on dynamic weighting strategies to enhance adaptability. Deep transfer learning (DTL) strategies were evaluated in [19] to overcome data scarcity in cross-building energy prediction. The study concluded that this approach is effective for improving prediction accuracy across various building types and highlights the need for further explorations into diverse application contexts. The use of ANNs for national-level hourly heat demand prediction, focusing on simplifying input requirements while maintaining accuracy, was explored in [20]. The study identifies temperature and radiation as critical factors, allowing for reduced input complexity.
Focusing more closely on optimized and hybrid ML approaches, a comprehensive comparison of various machine learning (ML) techniques to predict annual energy consumption in residential buildings was provided in [21], with a particular focus on aiding early design decisions to enhance energy efficiency. The authors advocate for the use of DNN models for early-design-stage energy consumption forecasting and recommend further studies incorporating larger datasets and exploring ensemble algorithms to enhance predictive reliability. A hybrid machine learning model designed to enhance the accuracy of heating energy consumption predictions in residential buildings was presented in [22]. The study analyzed the support vector regression (SVR) and six meta-heuristic algorithms, including battle royale optimization (BRO) and particle swarm optimization (PSO). The authors recommend extending this method to various climates and building types to verify its broader utility. Energy demand forecasting across seven sectors in Iran was explored in [23], employing machine learning algorithms (ANN, LSTM, and ARIMA models), enhanced with optimization techniques (PSO and Grey Wolf Optimizer). The work concludes that this framework provides a foundation for strategic energy planning and management. An AutoML framework for predicting residential heating and cooling loads was introduced in [24], which automates the feature engineering and model optimization process. The study concludes that the use of SHapley Additive exPlanations (SHAP) values in model explainability enhances the framework’s practical utility for early-stage building design. Finally, the integration of evolutionary algorithms with conventional neural networks for residential energy consumption forecasting was proposed in [25], leading to the recommendation of using hybrid techniques for future optimization.
To relate energy analyses and socio-economic factors, artificial neural networks (ANNs) have been utilized as tools for policymakers to identify and assist vulnerable populations, as part of an energy poverty study in Greece, with targeted energy efficiency initiatives and financial aid [26]. The analysis incorporates various socio-economic and geographical factors, including house age and ownership status, to predict seven energy poverty indicators.
The existing literature has dealt with several aspects of the energy efficiency of buildings. However, the analyses mainly focus on fragments of the whole process while the outcomes remain within a rather scientific framework without the appropriate connection to engineering practice and design tools. There are also several research works using ML methods for assessing energy consumption. According to [16], 84% of such studies focus on short-term energy consumption prediction because of its direct relation to the day-to-day operations of buildings, while only 12% of the studies focus on long-term (yearly) energy consumption prediction. The first class refers to energy efficiency improvements, in terms of temperature or lighting, following certain interventions, such as cell insulation upgrades. An indicative study of this kind [27] focuses on an energy simulation of an academic building and aims at evaluating the BIM’s potential to improve the building’s energy efficiency through the simulation and optimization of design parameters, such as building orientation or insulation characteristics, individually or collectively. A long-term energy efficiency goal refers to a multi-parametric evaluation of energy-efficient building design in a structure’s life cycle [3]. Within the latter class, the present work combines BIM modeling and machine learning techniques to evaluate the influencing factors of building energy performance and provides a practical and easy to use tool for energy efficiency analyses of typical residential buildings. The adoption of BIM provides an unlimited capability to simulate the structure and develop energy models for any type and size of building, design characteristics, environmental factors, etc.

3. Methodology

The traditional methods of building energy performance assessment, via analytical calculations, are quite tedious and of low accuracy. This is because they rely on simplified assumptions and limited data that do not fully reflect the linkage of various factors with energy consumption. To address these limitations, a more comprehensive and integrated approach is needed, which considers all relevant factors with their interrelationships. The advancement of building information modeling (BIM) can significantly contribute to energy performance assessment upgrades, the optimal design of new buildings, and the retrofitting of existing ones.
While BIM tools can effectively carry out the desired analysis, the process of comparatively assessing alternative building designs requires the development of each individual BIM design. There are many of these designs and they can be quite different in terms of their characteristics, making the whole process arduous and time-consuming. On the other hand, the existence of a ready-to-use digital solution that could instantly provide an estimate of the building energy performance, based on some prevailing characteristics, without the need to develop each BIM model, could provide a great decision support in designing high-energy-performance buildings. To obtain such a solution, a two-step process is required, which includes the identification of the prevailing parameters and their interrelationships, as well as the energy performance assessment based on those parameters.
In the present study, a set of parameters that influence the building energy performance are developed based on research findings and practical experience. Then, an experiment is performed with alternative sets of these parameters, and the corresponding energy performance is assessed via a BIM application. The input–output set of this process is used to develop artificial prediction models, employing alternative methods of machine learning. A typical group of residential buildings is examined. The energy performance of buildings depends on several parameters with varying levels of contributions. In this work, the ones that have been highlighted in the literature and in practice as having a significant impact on the energy performance of structures are considered. These factors are generally related to the architectural design, the characteristics of the building shell, and the climatic conditions in the area and are in agreement with previous studies [28,29,30]. The analysis involves eleven main parameters that considerably influence the energy performance of such buildings. These parameters are codified in Table 1 and schematically illustrated in Figure 1. The location parameter corresponds to four different climatic zones, in terms of the average temperature, as is further described in the case study section.

Energy Efficiency Analysis of Buildings Utilizing BIM

This section describes the methodology to assess the energy efficiency of buildings using building information modeling (BIM) technology. This assessment is based on simulated data that have come from full building modeling. The energy efficiency assessment is performed via the quantitative parameter of energy consumption. The methodology can be used in the early design process of buildings for providing decision support for selecting appropriate features for a design.
The BIM design of a building starts with architectural modeling, in which the building elements with their characteristics (geometric dimensions, materials, supplier info, etc.) are entered in the 3D digital model. Then, the building structural characteristics are entered in the model and the building/shell design is created. Finally, the HVAC (heating, ventilation, and air conditioning) design is integrated. The whole building design process is dynamic and subject to continuous revisions and improvements. The BIM model, due to its high interoperability, is continually updated and provides a powerful model and a vivid picture of the alternative designs to all stakeholders. Figure 2 illustrates the BIM modules and their involvement in building energy modeling. Specifically, 3D-BIM is used for modeling the design characteristics of the building (Figure 2a). The solar path and the sunlight radiance components, which are shown in Figure 2b and Figure 2c, respectively, are tools that determine the sun orbit and the intensity and distribution of natural lighting at the building’s location as well as their effect on energy consumption. Finally, the energy model is a BIM tool that calculates the energy consumption of a building with specific characteristics (Figure 2d).
BIM technology provides significant benefits in interpreting and quantifying the actual condition of a building, while it further allows for the simulation of user behavior. By incorporating energy simulations into the BIM model, alternative designs can be explored, and energy-efficient solutions can be identified. In this direction, BIM-based building energy assessment is crucial in promoting sustainable construction.
In the following analysis, different pieces of terminology are used in terms of the aim and the output variable of the model. “Energy efficiency” refers broadly to the capability of the building to provide comfortable temperature conditions for inhabitants at relatively low energy consumption rates. “Energy consumption” represents the quantitative assessment of the required energy to attain acceptable levels of comfort and represents the model output variable. “Energy estimation or prediction” highlights the process of calculating the energy consumption level of an existing building or forecasting the energy consumption of a newly designed or reconstructed building.

4. Model Development

The energy consumption of a building is affected by several parameters associated with the characteristics of the building and its surroundings. To obtain a generic energy assessment model, the actual building design and energy performance should be analyzed on a parametric basis. This objective can be fulfilled if a representative set of input parameters are developed, and the corresponding energy consumption outputs are computed. Next, a machine learning technique can be employed to develop the corresponding artificial model.
Two general approaches for developing the artificial energy prediction model are examined in this paper. The first includes statistical methods in the form of curve fitting by means of statistical regression analysis (SRM model). The second class includes machine learning techniques, namely decision trees (DT model), random forest (RF model) and artificial neural networks (ANN model). All these methods are particularized as alternatives to energy performance assessments based on BIM modeling and aimed to involve low-effort calculations and provide wide applications without restrictions (Figure 3).

4.1. Multiple Nonlinear Regression Model

The approximation of the relationship between energy consumption and the factors that leverage it on a statistical ground can be elaborated through regression. The use of multiple nonlinear regression requires the selection (or assumption) of the nonlinear relationship type between input and output data. In many cases, however, the use of multiple nonlinear regression is not sufficient (for strong nonlinear problems), as the relationship type is confined within the available software libraries. Stata software allows for switching among different parameter combinations and exponents (Equation (1)), thus facilitating the determination of functional relationships. Equation (1) describes the general form of the SRM model [31], where Y is the output parameter (e.g., in kWh/m2 or EUR/m2 per year), X is the vector of the independent variables, and Bi and Pi are the model coefficients and exponents.
Y = B 0 + B 1 X p 1 + B 2 X p 2 +

4.2. Machine Learning Model

4.2.1. Decision Trees (DTs)

In terms of machine learning techniques, the first method employs decision trees (Figure 4). Decision trees (DTs) provide a graphical representation of problem solving through gradual decisions. They can be used in problems with numerical and classification data, being more effective in resolving problems of the latter type. A decision tree algorithm is a way of creating a tree-like model which makes decisions based on the features of the data. The tree starts with a single node known as the root node, which represents the entire dataset. The root node is then split into two or more child nodes based on the values of one of the features in the dataset. This process is repeated for each child node retrospectively, creating a hierarchical structure of nodes that represents the decision process. The algorithm selects the feature that best separates the data into subsets with similar target values to split the data at each node. This is accomplished by gauging the information gain of each feature. The information gain is a measure of how well a feature separates the data into subsets with similar target values. Eventually, the feature with the highest information gain is chosen to split the data at that node. The process of splitting the data continues recursively until it reaches a stopping criterion, i.e., a set of rules that determine when the algorithm should stop creating new nodes. This could be the maximum tree depth, the minimum number of samples per leaf, or the minimum information gain. Once the tree is built, it can be used to make predictions. Each leaf node of the tree represents a prediction for the target variable. The structure of the tree allows one to represent the decision process and the decision boundary in a readable format.
In the current application, the parameters related to the architectural design, the building frame characteristics, and the climate parameters are set up in a question form and the energy efficiency is set up as a decision output (assessment). These components are combined through a learning process to create a decision tree that can be used to estimate energy costs. However, in many cases, if the decision-making process consists of several steps, the decision trees can grow extensively. Still, due to the ease and supervision they offer, DTs are popular and practical tools in decision-making processes.

4.2.2. Random Forests (RFs)

A combination of DT algorithms is often developed to form a random forest algorithm (RF). Random forests (RFs) use a series of different decision trees combining the individual predictions to enhance the accuracy of the output. Figure 5 indicates such a structure with N decision trees. The RF algorithm is powerful and accurate for both problem classes, i.e., regression and classification.
An RF algorithm builds multiple decision trees (also known as “forests”) and combines their individual predictions to a final decision. A random sample is selected from the dataset for training each decision tree. A random subset is selected for each decision tree for splitting the data at each node. Then each decision tree is trained on its sample of data and selected features. The predictions from all decision trees are finally combined by taking, for instance, the majority of votes for classification problems, or averaging the predictions for regression problems. An advantage of random forests compared to decision trees is the reduced overfitting outcome.

4.2.3. Artificial Neural Network (ANN)

Artificial neural network (ANN) methods are particularly useful in cases in which the cost of implementing an analytical computational process is high or if the way that influencing factors interact is not well known. The ANN structure consists of several input nodes (which refer to the input parameters) and intermediate neurons arranged in layers to process the input data, as well as the output node (Figure 6). The efficiency of an artificial neural network is determined by its architecture and the learning process. The selection of the number of neurons, their layers, and the way that they are connected depend on the problem’s nature and characteristics. ANN learning is the process of acquiring knowledge through training examples, which is then recalled, providing an output in similar problems. Neural network operation is based on the transmission of a signal from the initial neurons (input layer) to the final neuron (output layer). Depending on the neuron connection characteristics, the signal is attenuated or amplified during its propagation, so that the physical quantity in the last neuron is calculated.
When developing machine learning models, a balance between model and actual problem complexities should be retained. In this sense, it is not always effective to use large-scale (several input parameters) or complex (in architecture) models. The employment of such a model does not necessarily guarantee more precise predictions. This is because the increase in size and complexity makes the model hypersensitive [32] to such an extent that it cannot work for out-of-training data due to overfitting. On the contrary, a simple model may exhibit high deviation between simulated and actual output data due to underfitting. The above inefficiencies can be regulated by the two complementary error types, variance and bias, which indicate overfitting and underfitting conditions, respectively. An effective model should retain balance between the two errors and be obtained by a trial-and-error development process.
In summary, four compact models are developed based on non-linear multivariate regression, decision trees, random forests, and artificial neural networks. More specifically, the statistical regression modeling (SRM) approach is employed, utilizing Stata 18 software to develop a non-linear and multivariable regression model. The decision tree model consists of 7 levels and 24 terminal nodes and has been trained using the C4.5 algorithm (https://en.wikipedia.org/wiki/C4.5_algorithm (accessed on 10 December 2024)). The random forest model includes five trees (with the same form as that in DT model) and has also been trained using the C4.5 algorithm. The root mean square error (RMSE) statistic is used as the objective (loss) function. The number of epochs in both models (DT and RF) is in the order of 1000.
A multi-layer perceptron (MLP) architecture is implemented for neural network development. The MLP structure consists of 11 input neurons corresponding to the input variables, a hidden layer of 11 neurons, and an output neuron. An activation function of a sigmoid form is employed for the hidden and the output layers. The specific design efficiently serves as a trade-off between model accuracy and simplicity, following experimentation on alternative designs to prevent overfitting and underfitting issues. The training process has been carried out using the Levenberg–Marquardt algorithm, which is a variant of the back-propagation method. The number of epochs is in the order of 5000.

5. Data Management and Development Software

The model’s effectiveness highly depends on the size and quality of the available data. In this work, the necessary data have been developed from simulation and parametric analysis of buildings through BIM application. The input parameter selection was made in a way that data are rather representative of all eleven design parameters in Table 1 within a reasonable range of values. The generated sample includes 337 arrays of input–output data. The dataset is randomly divided into training, validation, and testing subsets in proportions of 70%, 15%, and 15%, respectively.
According to [33], the training sample size in such type of analysis should be above an approximate level of fifty plus eight times the number of independent parameters. Further, the work in [34] proposes that 10 to 15 observations per parameter are required to avoid overfitting in statistical and machine learning models. In the present work, the training sample size is roughly twice as high as these levels. As such, it can assure that the model complexity (overfitting) is controllable, and the estimates are statistically reliable and representative of the population, thus avoiding underfitting.
The energy consumption assessment is performed using Autodesk Revit and Green Building Studio (GBS) software (https://gbs.autodesk.com/gbs). Revit has been used to develop the building model while GBS provides energy analysis and consumption optimization. The framework for the evaluation of energy assessment and decision making is illustrated in Figure 7.

6. Case Study

6.1. Input Data

A typical residential building is analyzed as part of the case study. The structure consists of two or four apartments, as per the architectural form in Figure 8, over one or two floors. Building type A includes two apartments each with an area of 212 m2 that are placed either side by side (single floor) or on top of each other (two-floor configuration). Building type B includes four apartments each with an area of 212 m2 that are placed either side by side (single floor) or on top of each other (two-floor arrangement). The exact building formations are shown in Figure 9.
Each apartment is a typical four-person family flat. Rooms are cooled and heated via an HVAC system. Rooms are classified into three common typical thermal zones. The first includes the kitchen and the living room, the second comprises the bedrooms, and the third the corridors and bathrooms. Two thermostats—one for heating and one for cooling—are considered. The set points for heating and cooling are at 24 °C and at 21 °C, respectively. The electric loads (for lighting and equipment) are determined based on the number of residents and defined as Watt (W) per square meter (m2).
Regarding the location parameter (X7), four different climatic zones are considered, based on ambient conditions in Greece and represented by four cities, as shown in Figure 10. These cities are Chania, Patras, Ioannina, and Florina with average annual temperatures of 17.2 °C, 15.5 °C, 12.8 °C, and 10.4 °C, respectively (https://en.climate-data.org).
To determine the impact of each influencing factor, a default parameter setting is considered. Each factor is then altered to another value, and the effect on energy consumption cost is computed. A bar graph that represents the effect of each factor on the annual energy cost is estimated for each building type (Figure 11). The graph displays the percentage change in energy consumption per each factor when shifting from the default to a new value. The default parameter values are shown in the graph legend. Positive values indicate an increase and negative values indicate a reduction in energy consumption. The annual cost estimate is based on a unit energy cost of 0.12 EUR/KWh.
The main observations from Figure 11 are as follows:
  • The overall view of the two diagrams indicates that individual factors play a similar role in terms of energy consumption or savings in both building types. A rather flat proportion of 1.85 to 1.00 in terms of total energy consumption is observed in these two cases. This is as expected, as type B buildings are more environment-proof, being more condensed than type A buildings. This outcome may highlight an energy performance pattern for buildings of larger sizes.
  • The highest influence is observed in terms of the location and the corresponding climatic conditions (parameter X7). For instance, moving from zone 2 (Patras) to zone 4 (Florina), the energy consumption is increased by 13.37% and 12.36% for building types A and B, respectively.
  • The presence of adjacent buildings has a positive effect on energy efficiency. This is observed by comparing building configurations 2 and 4 in Figure 9, with the energy savings in the latter case calculated to be 8.80%.
  • The energy consumption is increased in one-story compared to two-story structures (parameter X11) by 7.23% and 6.91% for building types A and B, respectively.
  • The existence of a roof (parameter X9) within the basic scenario results in energy savings of 6.24% and 6.23% for building types A and B, respectively.
  • The window-to-wall ratio (parameter X5) increases from 15% (in the basic scenario) to 30%, leading to increases in energy consumption of 5.08% and 5.25% for building types A and B, respectively.
  • Likewise, the thickness variation in the external wall insulation (parameter X2) from one level to another has an influence of about 3% in all cases. The wall thickness itself plays a marginal role in all cases.
  • All other parameters have a contribution of less than 2% to energy consumption.

6.2. Models of Energy Performance Prediction

Equation (2) provides the best-fitting relationship for estimating annual energy consumption as a function of the eleven independent factors, as derived from the fractional polynomials in the statistical regression model (SRM).
y = −0.12 X1 − 0.52 X2 + 0.05 X3 − 0.16 X4 + 3.72 X5 − 0.31 X6
+1.06 X7 − 0.99 X8 + 0.98 X9 − 0.10 X10 − 1.10 X11 + 21.30
Figure 12 presents an illustrative part (due to size limitations) of the developed decision tree (DT) model in a self-explanatory graphical form. The random forest (RF) and the neural network (NN) cannot be easily rendered in a graphical way due to their size. The RF consists of multiple decision trees, which makes it difficult to visualize the entire model. The NN structure is rather standardized and globally known. Nevertheless, typical structures of these models are presented earlier in Figure 5 and Figure 6, respectively. The DT model includes 7 levels and 24 end nodes, with part of it being displayed in Figure 12. The RF model consists of 5 individual decision trees. The ANN model is structured upon 11 input nodes, 11 intermediate nodes, and 168 variables in total.
Table 2 presents the energy consumption estimates, which are derived from the four models for indicative datasets, and the corresponding reference values, which are derived from the BIM model. The results indicate that the model output values do not greatly deviate from each other or from the corresponding reference values. To obtain some quantitative measures, the deviations from the associated reference values are in the ranges of 0.51% to 10.6%, 0.56% to 2.15%, 0.21% to 4.88%, and 0.14% to 1.65% for the SRM, DT, RF, and NN models, respectively. The results highlight the ML model as the best-performing one.

6.3. Model Performance Assesment

Statistical indicators (RMSE, PMRE, and correlation coefficient) are used to evaluate the dependencies between machine learning model output and actual observations. Further, the F-test is a statistical criterion to determine whether model results fit the observed ones. The corresponding mathematical relationships are given in Equations (3)–(5).
RMSE = 1 n 1 n x i y i 2
PMRE = 100 n 1 n x i y i x i
R 2 = 1 n x i x m e a n y i y m e a n 1 n x i x m e a n 2 1 n y i y m e a n 2 2
where xi is the observed value, yi is the predicted value, xmean and ymean are the corresponding mean values, and n is the sample size.
For in-depth performance appraisal of the developed models (SRM, DT, RF, and ANN), appropriate statistical analysis indexes are computed (Table 3). They include the root mean square error (RMSE), the percent mean relative error (PMRE), and the correlation coefficient (R2). The F-test is further used for contrasting the goodness of fit between the observed and the estimated values. The results indicate that the decision tree (DT), random forest (RF), and artificial neural network (ANN) models exhibit improved performance compared to the statistical regression model (SRM), as indicated by the error indices (RMSE and PMRE) and the correlation coefficient (R2). Among pure ML methods, the ANN model presents the best performance and satisfies the F-test in every dataset. The other two methods (DT and RF) partially satisfy the F-test only in the validation and testing datasets.
More specifically, the RMSE of the ANN model output ranges from 0.1278 to 0.2759 for individual data subsets and is 0.1840 for the full dataset. On the other hand, the SRM model presents much higher RMSE values in the order of 0.7514 and 0.8525 in the data subsets and 0.8329 in the full dataset. Similarly, the PMRE values for the ANN model are between 0.54% and 1.23%, while for the SRM, they are from 3.63% to 3.78%. Finally, the F-test is fully satisfied in all tests of the ANN model, partly satisfied in the cases of the DT and RF models (in validation and training subsets), and not satisfied in any dataset of the SRM model. Interestingly, the performance in the validation and testing subsets occasionally appears to be higher than that resulting from the training subset. This effect may be attributed to the randomness of data segmentation in the three subsets. The observed peculiarity is rather insignificant as the model characteristics are not considerably affected. Additionally, this is an indication that the model does not overfit. The superior performance of ML models is due to their inherent ability to model nonlinear relationships and complex interactions among input parameters. Further, ML models hold the ability to adapt to data of different types and sources, which makes them suitable for use in problems with high complexity.
The model performance comparison can be further visualized in graphical form as in Figure 13, Figure 14, Figure 15 and Figure 16. These figures illustrate the comparison of the model output with the reference (target) values that have been produced by BIM simulation. The main diagrams show the range and dispersion of the dataset output and target points in relation to the diagram diagonal line, which indicates full convergence. The histograms above and to the right show the distribution of target and model values within their respective ranges. In an ideal convergence case, the two histograms would retain the same distribution. The multi-faceted presentation facilitates a better understanding and interpretation of the results, highlighting the accuracy and effectiveness of the proposed models. These diagrams reveal that, in general, there is an acceptable level of convergence between the observed and predicted values with all models. In a more detailed assessment, the ANN (mainly) and the RF present a stronger relationship between the two set values. Finally, the error distributions of the four prediction methods are shown in Figure 17. The ANN model presents the best performance with zero average values and low standard deviation. Instead, other method distributions shift to non-zero average values and present higher standard deviation values.
The research development and the case study results indicate that the proposed methodology can provide energy cost estimates easily and effectively at the very early design stage of typical residential buildings. Additionally, development and testing for specific building types and characteristics have been performed. The experiment shows that this research, in its current form, can have notable potential for scalability and can be extended to different building structures, places, and environments. More importantly, the adoption of BIM provides unlimited capability to develop energy models for any type and size of buildings, design characteristics, environmental factors, etc. Another important aspect is that building energy performance is highly affected by human-related factors, such as occupant number and behavior in relation to energy usage patterns, or operational and maintenance practices, which exhibit significant variability and uncertainty in energy consumption. Although such parameters may be marginally known at the early design phase, they may be incorporated in the modeling in a parametric or stochastic way. The above issues can be investigated as part of future research.
As a final note, the results of this work are qualitatively consistent with those in the literature. However, a comparative quantitative analysis that would enable meaningful comparisons with and evaluations using previous models and research works is not feasible, as the individual studies’ data structures are not fully comparable. Nevertheless, the evaluation of ML model performance is universally performed through the training, testing, and validation processes, which also depend on the size and representativeness of the data used.
In summary, the proposed methodology integrates a BIM simulation, parametric evaluation, and automated estimation of building energy consumption with a simple and effective GUI. The outcome of this effort is the development of ML models that can assess the energy consumption of buildings with distinct characteristics with various levels of performance. These models can effectively assist building designers and researchers in making decisions. The proposed methodology has the advantage of developing energy performance models without the need for the existence or collection of actual data. Instead, it relies on simulated data that can be extracted by consistent BIM-based simulations. Although the current development is confined to a (common) class of residential buildings, the above capability can highly support the repeatability, scalability, and extensibility aims of model development within a practical range of application. In the absence of such models, each individual building design should undergo an energy analysis of several potential solutions based on a multi-parameter evaluation. Another contribution is the development of a simple-to-use GUI (which is described in the next section) for obtaining energy consumption estimates at the early design stage, without the need for comprehensive building modeling and wide energy analyses of alternative building design configurations.

7. Graphical User Interface

The study has developed a method for estimating the long-term energy performance of buildings. BIM technology has been elaborated for the simulation of energy consumption analysis and machine learning techniques to predict building energy efficiency. A drawback of machine learning techniques is their unsuitability for real-life problems. To overcome this obstacle, a user-interface tool has been developed to facilitate the model application (Figure 18). In this application, the user simply inserts the appropriate design parameter values and presses the “Calculation” button. An instant energy consumption calculation is provided based on the selected parameters without any other manual input or calculation.
The integration of the trained artificial neural network (ANN) into the graphical user interface (GUI) has been performed using the synaptic weights of the developed model. The ANN model includes 168 parameters, representing the network weight and bias factors. The model has been implemented in Visual Basic programming language, which is suitable for interface development.

8. Conclusions

The energy performance of buildings has become a main concern globally in view of increased energy demand, the environmental impacts of energy production, and the reality of energy poverty in many areas. To increase building energy efficiency, proper building design should be implemented starting at the early design phase or when retrofitting decisions regarding existing buildings need to be made. Digital tools are currently available for performing energy assessment analyses in the form of building information modeling (BIM) and building energy modeling (BEM), and they can efficiently handle complex and technically demanding buildings. However, alternative designs should be subject to individual analyses, and this makes the process time-consuming and prone to errors. Machine learning techniques, on the other hand, can ameliorate such deficiencies and provide effective and easy-to-use tools for such analyses.
In this paper, representative residential buildings are considered along with eleven factors that affect energy performance. A dataset of 337 instances of such parameters is developed. For each dataset, the building energy performance is estimated based on a BIM simulation. Next, statistical and machine learning techniques are implemented to provide artificial models of energy performance. They include statistical regression modeling (SRM), decision trees (DTs), random forests (RFs), and artificial neural networks (ANNs). The above models are appropriately trained, tested, and validated using the dataset.
The case study results reveal that building energy consumption is mainly affected by the local ambient conditions, the number of building floors (one or two), the existence of adjacent buildings, the presence of a roof, and the window-to-wall ratio. Regarding the prediction models, the evaluation results from the case study indicate that all artificial models present satisfactory (for practical use at the early design phase) levels of convergence to the observed values. The comparative evaluation highlights the ANN as the best-performing model, with high statistical performance indicator values. Calculating the modeling energy efficiency of buildings via machine learning techniques appears to be an attractive approach to overcome the computational cost of creating energy efficiency simulations. Further, to enhance the model’s applicability, a user interface tool has been developed into which the design parameters of the eleven influencing factors are inserted and the energy cost is instantly calculated.
The experiment in the case study indicates that the proposed models can be directly extensible and scalable, at least within a reasonable parameter extrapolation range. For out-of-range cases, the process of developing the required dataset and training machine learning models can be presumably extended to any type and size of building, along with the rest of the influencing parameters. In a similar path, the development can incorporate human-related factors, which highly affect energy consumption.
The proposed methodology integrates BIM simulations, parametric evaluations, and automated estimations of building energy consumption with a simple and effective GUI. In particular, it provides a useful and handy tool for energy consumption estimations of buildings with varying characteristics without the need for acquiring and using actual data, as it develops these data via a BIM simulation. Another contribution is the development of a simple and easy-to-use GUI for calculating the energy consumption and comparatively assessing the energy performance of alternative building design configurations.

Author Contributions

Conceptualization, P.T. and A.C.; methodology, P.T. and A.C.; software, P.T.; validation, P.T., V.P. and A.P.; formal analysis, P.T., A.C., V.P. and A.P.; investigation, P.T., V.P. and A.P.; resources, A.C.; data curation, P.T., V.P. and A.P.; writing—original draft preparation, P.T., V.P. and A.P.; writing—review and editing, P.T., A.C., V.P. and A.P.; visualization, P.T., A.C., V.P. and A.P.; supervision, A.C.; project administration, P.T. and A.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data that were developed and used in this study will be made available by the authors upon request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. International Energy Agency. 2019 Global Status Report for Buildings and Construction; International Energy Agency: Paris, France, 2019; ISBN 9789280736861. [Google Scholar]
  2. Mauro, G.M.; Menna, C.; Vitiello, U.; Asprone, D.; Ascione, F.; Bianco, N.; Prota, A.; Vanoli, G.P. A Multi-Step Approach to Assess the Lifecycle Economic Impact of Seismic Risk on Optimal Energy Retrofit. Sustainability 2017, 9, 989. [Google Scholar] [CrossRef]
  3. Pohoryles, D.A.; Maduta, C.; Bournas, D.A.; Kouris, L.A. Energy Performance of Existing Residential Buildings in Europe: A Novel Approach Combining Energy with Seismic Retrofitting. Energy Build. 2020, 223, 110024. [Google Scholar] [CrossRef]
  4. Yoshino, H.; Hong, T.; Nord, N. IEA EBC Annex 53: Total Energy Use in Buildings—Analysis and Evaluation Methods. Energy Build. 2017, 152, 124–136. [Google Scholar] [CrossRef]
  5. Gao, H.; Koch, C.; Wu, Y. Building Information Modelling Based Building Energy Modelling: A Review. Appl. Energy 2019, 238, 320–343. [Google Scholar] [CrossRef]
  6. Reeves, T.; Olbina, S.; Issa, R.R.A. Guidelines for Using Building Information Modeling for Energy Analysis of Buildings. Buildings 2015, 5, 1361–1388. [Google Scholar] [CrossRef]
  7. Habibi, S. The Promise of BIM for Improving Building Performance. Energy Build. 2017, 153, 525–548. [Google Scholar] [CrossRef]
  8. Wang, X.; Teigland, R.; Hollberg, A. Identifying Influential Architectural Design Variables for Early-Stage Building Sustainability Optimization. Build. Environ. 2024, 252, 111295. [Google Scholar] [CrossRef]
  9. Azhar, S.; Carlton, W.A.; Olsen, D.; Ahmad, I. Building Information Modeling for Sustainable Design and LEED® Rating Analysis. Autom. Constr. 2011, 20, 217–224. [Google Scholar] [CrossRef]
  10. Albdour, M.S.; Shalby, M.; Salah, A.A.; Alhomaidat, F. Evaluating and Enhancing the Energy Efficiency of Representative Residential Buildings by Applying National and International Standards Using BIM. Energies 2022, 15, 7763. [Google Scholar] [CrossRef]
  11. Tahmasebinia, F.; Jiang, R.; Sepasgozar, S.; Wei, J.; Ding, Y.; Ma, H. Using Regression Model to Develop Green Building Energy Simulation by BIM Tools. Sustainability 2022, 14, 6262. [Google Scholar] [CrossRef]
  12. Li, X.; Lin, M.; Jiang, M.; Jim, C.Y.; Liu, K.; Tserng, H. Bim and Orthogonal Test Methods To Optimize the Energy Consumption of Green Buildings. J. Civ. Eng. Manag. 2024, 30, 670–690. [Google Scholar] [CrossRef]
  13. Sajjad, M.; Hu, A.; Alshehri, A.M.; Waqar, A.; Khan, A.M.; Bageis, A.S.; Elaraki, Y.G.; Shohan, A.A.A.; Benjeddou, O. BIM-Driven Energy Simulation and Optimization for Net-Zero Tall Buildings: Sustainable Construction Management. Front. Built Environ. 2024, 10, 1296817. [Google Scholar] [CrossRef]
  14. Fan, C.; Xiao, F.; Zhao, Y. A Short-Term Building Cooling Load Prediction Method Using Deep Learning Algorithms. Appl. Energy 2017, 195, 222–233. [Google Scholar] [CrossRef]
  15. Ahmad, T.; Chen, H.; Guo, Y.; Wang, J. A Comprehensive Overview on the Data Driven and Large Scale Based Approaches for Forecasting of Building Energy Demand: A Review. Energy Build. 2018, 165, 301–320. [Google Scholar] [CrossRef]
  16. Amasyali, K.; El-Gohary, N.M. A Review of Data-Driven Building Energy Consumption Prediction Studies. Renew. Sustain. Energy Rev. 2018, 81, 1192–1205. [Google Scholar] [CrossRef]
  17. Yu, J.; Chang, W.S.; Dong, Y. Building Energy Prediction Models and Related Uncertainties: A Review. Buildings 2022, 12, 1284. [Google Scholar] [CrossRef]
  18. Zhou, Y.; Wang, L.; Qian, J. Application of Combined Models Based on Empirical Mode Decomposition, Deep Learning, and Autoregressive Integrated Moving Average Model for Short-Term Heating Load Predictions. Sustainability 2022, 14, 7349. [Google Scholar] [CrossRef]
  19. Li, G.; Wu, Y.; Liu, J.; Fang, X.; Wang, Z. Performance Evaluation of Short-Term Cross-Building Energy Predictions Using Deep Transfer Learning Strategies. Energy Build. 2022, 275, 112461. [Google Scholar] [CrossRef]
  20. Zhang, M.; Millar, M.-A.; Chen, S.; Ren, Y.; Yu, Z.; Yu, J. Enhancing Hourly Heat Demand Prediction through Artificial Neural Networks: A National Level Case Study. Energy AI 2024, 15, 100315. [Google Scholar] [CrossRef]
  21. Olu-Ajayi, R.; Alaka, H. Building Energy Consumption Prediction Using Deep Learning. Environ. Des. Manag. Conf. 2021, 202, 1. [Google Scholar]
  22. Khajavi, H.; Rastgoo, A. Improving the Prediction of Heating Energy Consumed at Residential Buildings Using a Combination of Support Vector Regression and Meta-Heuristic Algorithms. Energy 2023, 272, 127069. [Google Scholar] [CrossRef]
  23. Javanmard, M.E.; Ghaderi, S.F. Energy Demand Forecasting in Seven Sectors by an Optimization Model Based on Machine Learning Algorithms. Sustain. Cities Soc. 2023, 95, 104623. [Google Scholar] [CrossRef]
  24. Lu, C.; Li, S.; Penaka, S.R.; Olofsson, T. Automated Machine Learning-Based Framework of Heating and Cooling Load Prediction for Quick Residential Building Design. Energy 2023, 274, 127334. [Google Scholar] [CrossRef]
  25. Wang, G.; Mukhtar, A.; Moayedi, H.; Khalilpoor, N.; Tt, Q. Application and Evaluation of the Evolutionary Algorithms Combined with Conventional Neural Network to Determine the Building Energy Consumption of the Residential Sector. Energy 2024, 298, 131312. [Google Scholar] [CrossRef]
  26. Papada, L.; Kaliampakos, D. Artificial Neural Networks as a Tool to Understand Complex Energy Poverty Relationships: The Case of Greece. Energies 2024, 17, 3163. [Google Scholar] [CrossRef]
  27. Maglad, A.M.; Houda, M.; Alrowais, R.; Khan, A.M.; Jameel, M.; Rehman, S.K.U.; Khan, H.; Javed, M.F.; Rehman, M.F. Bim-Based Energy Analysis and Optimization Using Insight 360 (Case Study). Case Stud. Constr. Mater. 2023, 18, e01755. [Google Scholar] [CrossRef]
  28. Tsanas, A.; Xifara, A. Accurate quantitative estimation of energy performance of residential buildings using statistical machine learning tools. Energy Build. 2012, 49, 560–567. [Google Scholar] [CrossRef]
  29. Ibrahim, M.; Ghaddar, N.; Ghali, K. Optimal Location and Thickness of Insulation Layers for Minimizing Building Energy Consumption. J. Build. Perform. Simul. 2012, 5, 384–398. [Google Scholar] [CrossRef]
  30. Banihashemi, S.; Ding, G.; Wang, J. Developing a Hybrid Model of Prediction and Classification Algorithms for Building Energy Consumption. Energy Procedia 2017, 110, 371–376. [Google Scholar] [CrossRef]
  31. StataCorp. Stata User Guide; Muthén & Muthén: Los Angeles, CA, USA, 2021; ISBN 9781597181709. [Google Scholar]
  32. Huang, W.; Martin, P.; Zhuang, H.L. Machine-Learning Phase Prediction of High-Entropy Alloys. Acta Mater. 2019, 169, 225–236. [Google Scholar] [CrossRef]
  33. Green, S.B. How Many Subjects Does It Take To Do A Regression Analysis. Multivar. Behav. Res. 1991, 26, 499–510. [Google Scholar] [CrossRef] [PubMed]
  34. Babyak, M.A. What You See May Not Be What You Get: A Brief, Nontechnical Introduction to Overfitting in Regression-Type Models. Psychosom. Med. 2004, 66, 411–421. [Google Scholar]
Figure 1. Schematic representation of design parameters.
Figure 1. Schematic representation of design parameters.
Energies 18 00201 g001
Figure 2. Digital 3D/BIM and energy models of a two-story residential building. (a) 3D/BIM model; (b) Solar path; (c) Sunlight radiance; (d) Energy model.
Figure 2. Digital 3D/BIM and energy models of a two-story residential building. (a) 3D/BIM model; (b) Solar path; (c) Sunlight radiance; (d) Energy model.
Energies 18 00201 g002
Figure 3. Flowchart of machine-learning-based energy performance assessment.
Figure 3. Flowchart of machine-learning-based energy performance assessment.
Energies 18 00201 g003
Figure 4. Schematic illustration of a decision tree algorithm.
Figure 4. Schematic illustration of a decision tree algorithm.
Energies 18 00201 g004
Figure 5. Schematic illustration of random forest algorithm.
Figure 5. Schematic illustration of random forest algorithm.
Energies 18 00201 g005
Figure 6. Schematic representation of neural network algorithm.
Figure 6. Schematic representation of neural network algorithm.
Energies 18 00201 g006
Figure 7. Process flow diagram for data generation.
Figure 7. Process flow diagram for data generation.
Energies 18 00201 g007
Figure 8. BIM architectural model of a single apartment.
Figure 8. BIM architectural model of a single apartment.
Energies 18 00201 g008
Figure 9. (a) Digital 3D/BIM model of type A buildings with one or two floors. (b) Digital 3D/BIM model of type B buildings with one or two floors.
Figure 9. (a) Digital 3D/BIM model of type A buildings with one or two floors. (b) Digital 3D/BIM model of type B buildings with one or two floors.
Energies 18 00201 g009
Figure 10. Climatic zones of representative cities in Greece.
Figure 10. Climatic zones of representative cities in Greece.
Energies 18 00201 g010
Figure 11. (a) Effect of design parameters for building type A. (b) Effect of design parameters for building type B.
Figure 11. (a) Effect of design parameters for building type A. (b) Effect of design parameters for building type B.
Energies 18 00201 g011
Figure 12. Partial graphic illustration of the DT model.
Figure 12. Partial graphic illustration of the DT model.
Energies 18 00201 g012
Figure 13. Convergence of observed and predicted values by the SRM.
Figure 13. Convergence of observed and predicted values by the SRM.
Energies 18 00201 g013
Figure 14. Convergence of observed and predicted values by the DT model.
Figure 14. Convergence of observed and predicted values by the DT model.
Energies 18 00201 g014
Figure 15. Convergence of observed and predicted values by the RF model.
Figure 15. Convergence of observed and predicted values by the RF model.
Energies 18 00201 g015
Figure 16. Convergence of observed and predicted values by the ANN model.
Figure 16. Convergence of observed and predicted values by the ANN model.
Energies 18 00201 g016
Figure 17. Error distribution between target values from BIM simulation and output from developed models.
Figure 17. Error distribution between target values from BIM simulation and output from developed models.
Energies 18 00201 g017
Figure 18. Graphical user interface for energy consumption estimation.
Figure 18. Graphical user interface for energy consumption estimation.
Energies 18 00201 g018
Table 1. Description of design factors and corresponding values.
Table 1. Description of design factors and corresponding values.
DescriptionParameter ValuesModel Values
X1Masonry thickness15, 20, 25 cm15, 20, 25
X2Wall insulation thickness2, 3, 5 cm2, 3, 5
X3Slab thickness15, 20 cm15, 20
X4Slab insulation thickness2, 3, 5 cm2, 3, 5
X5Window-to-wall ratio15%, 20%, 30%15, 20, 30
X6Window glass typeSingle, double1, 2
X7Climate zone at building locationZone 1, 2, 3, 41, 2, 3, 4
X8Building orientationSouth, North, East, West1, 2, 3, 4
X9Roof existenceYes, No1, 2
X10Basement existenceYes, No1, 2
X11Building size1 floor, 2 floors1, 2
Table 2. Comparison of results of alternative methods.
Table 2. Comparison of results of alternative methods.
Annual Energy Consumption (EUR/m2)
Model-SRMModel-DTModel-RFModel-ANNBIM
119.0018.5218.3918.1218.15
215.0214.4914.3014.3114.33
317.4517.2617.8417.1417.01
416.5716.1215.9216.0415.78
517.8417.3917.2817.5217.75
616.7716.1215.8815.9216.03
720.5022.7822.1923.3022.93
817.7517.3717.3217.4817.27
913.2613.7214.0213.3713.44
1019.2018.5218.7118.7618.79
Table 3. Model performance indicators.
Table 3. Model performance indicators.
ModelRoot Mean Square Error (RMSE)Percent Mean Relative Error (PMRE)Correlation Coefficient
(R2)
F-Test
All dataset F 2, 335, 0.05 = 3.03
SRM0.83293.74%0.8810171.55
DT0.49302.17%0.94607.06
RF0.27221.15%0.98365.46
ANN0.18400.75%0.99252.71
Training F 2, 233, 0.05 = 3.04
SRM0.85253.76%0.8797119.56
DT0.48222.02%0.94986.70
RF0.23020.96%0.98885.51
ANN0.12780.54%0.99651.60
Validation F 2, 49, 0.05 = 3.19
SRM0.81903.78%0.819023.43
DT0.54552.63%0.93411.15
RF0.37441.74%0.96880.20
ANN0.27591.19%0.98201.01
Testing F 2, 49, 0.05 = 3.19
SRM0.75143.63%0.751428.22
DT0.48712.40%0.94002.34
RF0.32411.43%0.97271.54
ANN0.26921.23%0.98200.14
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Tsikas, P.; Chassiakos, A.; Papadimitropoulos, V.; Papamanolis, A. BIM-Based Machine Learning Application for Parametric Assessment of Building Energy Performance. Energies 2025, 18, 201. https://doi.org/10.3390/en18010201

AMA Style

Tsikas P, Chassiakos A, Papadimitropoulos V, Papamanolis A. BIM-Based Machine Learning Application for Parametric Assessment of Building Energy Performance. Energies. 2025; 18(1):201. https://doi.org/10.3390/en18010201

Chicago/Turabian Style

Tsikas, Panagiotis, Athanasios Chassiakos, Vasileios Papadimitropoulos, and Antonios Papamanolis. 2025. "BIM-Based Machine Learning Application for Parametric Assessment of Building Energy Performance" Energies 18, no. 1: 201. https://doi.org/10.3390/en18010201

APA Style

Tsikas, P., Chassiakos, A., Papadimitropoulos, V., & Papamanolis, A. (2025). BIM-Based Machine Learning Application for Parametric Assessment of Building Energy Performance. Energies, 18(1), 201. https://doi.org/10.3390/en18010201

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop