Enhancing Zero-Energy Building Operations for ESG: Accurate Solar Power Prediction through Automatic Machine Learning
Abstract
:1. Introduction
1.1. Renewable Energy Usage in Zero-Energy Buildings Confirmed
1.2. Studies Related to Predicting the Output of a Solar Power System
1.3. Structure and Aim of this Study
2. Data Set
2.1. Information about the Demonstration Site
2.2. Weather Data from the Meteorological Administration
2.3. PV Data of the Demonstration Site
2.4. Pre-Processing for Data Set
3. Methods—Creation of the Models via Automatic Machine Learning
3.1. Automatic Machine Learning (AML)
3.2. Process of Creating Models via AML
- The period of the obtained dataset is from ‘22.01 to ‘23.03. This period is divided into four seasonal intervals as follows:
- Interval A [’22.03 to ’22.05]
- Interval B [’22.06 to ’22.08]
- Interval C [’22.09 to ’22.11]
- Interval D [’22.12 to ’23.02]
- 2.
- Each data interval is divided into training data and validation data randomly in a 9:1 ratio.
- 3.
- The available algorithms are utilized using the training data to create solar power generation models.
- 4.
- The generated models are evaluated using the validation data to derive their performance and compare them to select the best model.
- 5.
- If the best model’s performance falls below a certain threshold for certain metrics, the process is repeated from step 2.
- 6.
- If there is a model that meets all criteria, the algorithms and performance metrics of the prediction models generated concurrently with that model are also checked.
- 7.
- The best prediction models for each interval are derived by executing steps 3 to 6 for all intervals.
4. Methods—Improving the Accuracy of the Model
4.1. Relation of Data to Improve Accuracy
- It can be observed that ‘10’ is closely related to ‘0’ and ‘1’. For example, when it is ‘0’ at sunrise, ‘10’ increases with time, and when it approaches ‘0’ at sunset, ‘10’ decreases. During this process, if ‘1’, the sky status becomes ‘cloudy’, and the fluctuation range of ‘10’ decreases.
- It can be observed that ‘11’ is closely related to ‘10 and 14’. For example, when the value of ‘10’ increases, ‘11’ increases proportionally and remains constant. Conversely, when the value of ‘10’ decreases, ‘11’ decreases proportionally and remains constant. During this process, if the state of ‘14’ is ‘Off’, the value of ‘11’ is fixed at zero.
- It can be observed that ‘12’ is closely related to ‘10’ and ‘11’. ‘12’ is a value that can be derived through the multiplication of ‘10’ and ‘11’. This derived value is affected by the values ‘1 to 7’, ‘8 to 9’, and ‘13 to 14’ and can, therefore, vary accordingly. Through the first condition among the three conditions, it is possible to create a model for predicting ‘10’ using the dataset composed of information that can be obtained in advance. Therefore, it is possible to obtain predicted values for ‘10’ that are similar to the actual values and construct the dataset by replacing the actual values with the predicted values.
4.2. Process to Improve the Accuracy of the Model
- Check the “Data Set 01”, which includes all data.
- Create a model to predict ‘10’ by excluding ‘11 and 12’ from the original dataset. Obtain the predicted ‘10’ for a specific period.
- Replace ‘10’ in the “Data Set 01” with the predicted values obtained in step (2) to create the “Data Set 02”.
- Create a model to predict ‘11’ by excluding ‘12’ from the “Data Set 02”. Obtain the predicted ‘11’ for a specific period.
- Replace ‘11’ in “Data Set 02” with the predicted values obtained in step (4) to create “Data Set 03”.
- Create a model to predict ‘12’ using the “Data Set 03”. Obtain the predicted values of ‘12’ for a specific period. This is the final prediction for solar power generation.
- Utilize the models generated in steps (2), (4), and (6) as the proposed models, and evaluate their performance using the model obtained in step (6) as the main performance indicator.
5. Methods—Application on an Actual System
5.1. Create a Model through AML with Increased Accuracy
5.2. Predict Value via AML with Increased Accuracy
6. Results
6.1. Performance of Each Model
6.2. Prediction Accuracy of Each Model
7. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Ahmed, A.; Ge, T.; Peng, J.; Yan, W.-C.; Tee, B.T.; You, S. Assessment of the renewable energy generation towards net-zero energy buildings: A review. Energy Build. 2022, 256, 111755. [Google Scholar] [CrossRef]
- Vares, S.; Häkkinen, T.; Ketomäki, J.; Shemeikka, J.; Jung, N. Impact of renewable energy technologies on the embodied and operational GHG emissions of a nearly zero energy building. J. Build. Eng. 2019, 22, 439–450. [Google Scholar] [CrossRef]
- Park, S.; Lee, S.; Park, S.; Park, S. AI-based physical and virtual platform with 5-layered architecture for sustainable smart energy city development. Sustainability 2019, 11, 4479. [Google Scholar] [CrossRef] [Green Version]
- Li, X.; Lin, A.; Young, C.-H.; Dai, Y.; Wang, C.-H. Energetic and economic evaluation of hybrid solar energy systems in a residential net-zero energy building. Appl. Energy 2019, 254, 113709. [Google Scholar] [CrossRef]
- Park, S.; Cho, K.; Kim, S.; Yoon, G.; Choi, M.-I.; Park, S.; Park, S. Distributed energy IoT-based real-time virtual energy prosumer business model for distributed power resource. Sensors 2021, 21, 4533. [Google Scholar] [CrossRef]
- Sharafi, M.; ElMekkawy, T.Y.; Bibeau, E.L. Optimal design of hybrid renewable energy systems in buildings with low to high renewable energy ratio. Renew. Energy 2015, 83, 1026–1042. [Google Scholar] [CrossRef]
- Derkenbaeva, E.; Vega, S.H.; Hofstede, G.J.; Van Leeuwen, E. Positive energy districts: Mainstreaming energy transition in urban areas. Renew. Sustain. Energy Rev. 2022, 153, 111782. [Google Scholar] [CrossRef]
- Calise, F.; Cappiello, F.L.; d’Accadia, M.D.; Vicidomini, M. Dynamic modelling and thermoeconomic analysis of micro wind turbines and building integrated photovoltaic panels. Renew. Energy 2020, 160, 633–652. [Google Scholar] [CrossRef]
- Tamašauskas, R.; Šadauskienė, J.; Bruzgevičius, P.; Krawczyk, D.A. Investigation and Evaluation of Primary Energy from Wind Turbines for a Nearly Zero Energy Building (nZEB). Energies 2019, 12, 2145. [Google Scholar] [CrossRef] [Green Version]
- Li, M.; Cao, S.; Zhu, X.; Xu, Y. Techno-economic analysis of the transition towards the large-scale hybrid wind-tidal supported coastal zero-energy communities. Appl. Energy 2022, 316, 119118. [Google Scholar] [CrossRef]
- Brecl, K.; Topič, M. Photovoltaics (PV) system energy forecast on the basis of the local weather forecast: Problems, uncertainties and solutions. Energies 2018, 11, 1143. [Google Scholar] [CrossRef] [Green Version]
- World Bank. Off-Grid Solar Market Trends Report 2016; An Innovation of the World Bank Group in Cooperation with Global Off-Grid Lighting Association; World Bank: Washington, DC, USA, 2016. [Google Scholar]
- Park, S.; Park, S.; Yun, S.-P.; Lee, K.; Kang, B.; Choi, M.-I.; Jang, H.; Park, S. Design and Implementation of a Futuristic EV Energy Trading System (FEETS) Connected with Buildings, PV, and ESS for a Carbon-Neutral Society. Buildings 2023, 13, 829. [Google Scholar] [CrossRef]
- Magrini, A.; Lentini, G.; Cuman, S.; Bodrato, A.; Marenco, L. From nearly zero energy buildings (NZEB) to positive energy buildings (PEB): The next challenge-The most recent European trends with some notes on the energy analysis of a forerunner PEB example. Dev. Built Environ. 2020, 3, 100019. [Google Scholar] [CrossRef]
- Zero-Energy Building Information Site in Korea Energy Agency. Available online: https://zeb.energy.or.kr/BC/BC00/BC00_01_001.do (accessed on 1 June 2023).
- Antonanzas, J.; Osorio, N.; Escobar, R.; Urraca, R.; Martinez-de-Pison, F.J.; Antonanzas-Torres, F. Review of photovoltaic power forecasting. Sol. Energy 2016, 136, 78–111. [Google Scholar] [CrossRef]
- Mussard, M.; Amara, M. Performance of solar photovoltaic modules under arid climatic conditions: A review. Sol. Energy 2018, 174, 409–421. [Google Scholar] [CrossRef]
- Memiche, M.; Bouzian, C.; Benzahia, A.; Moussi, A. Effects of dust, soiling, aging, and weather conditions on photovoltaic system performances in a Saharan environment—Case study in Algeria. Glob. Energy Interconnect. 2020, 3, 60–67. [Google Scholar] [CrossRef]
- Li, T.-T.; Wang, K.; Sueyoshi, T.; Wang, D.D. ESG: Research progress and future prospects. Sustainability 2021, 13, 11663. [Google Scholar] [CrossRef]
- Schram, W.L.; AlSkaif, T.; Lampropoulos, I.; Henein, S.; Van Sark, W.G. On the trade-off between environmental and economic objectives in community energy storage operational optimization. IEEE Trans. Sustain. Energy 2020, 11, 2653–2661. [Google Scholar] [CrossRef]
- Shi, J.; Lee, W.-J.; Liu, Y.; Yang, Y.; Wang, P. Forecasting power output of photovoltaic systems based on weather classification and support vector machines. IEEE Trans. Ind. Appl. 2012, 48, 1064–1069. [Google Scholar] [CrossRef]
- Lee, E.S.; Gehbauer, C.; Coffey, B.E.; McNeil, A.; Stadler, M.; Marnay, C. Integrated control of dynamic facades and distributed energy resources for energy cost minimization in commercial buildings. Sol. Energy 2015, 122, 1384–1397. [Google Scholar] [CrossRef] [Green Version]
- Kim, S.-G.; Jung, J.-Y.; Sim, M.K. A two-step approach to solar power generation prediction based on weather data using machine learning. Sustainability 2019, 11, 1501. [Google Scholar] [CrossRef] [Green Version]
- Trabelsi, M.; Massaoudi, M.; Chihi, I.; Sidhom, L.; Refaat, S.S.; Huang, T.; Oueslati, F.S. An effective hybrid symbolic regression–deep multilayer perceptron technique for PV power forecasting. Energies 2022, 15, 9008. [Google Scholar] [CrossRef]
- Khandakar, A.; EH Chowdhury, M.; Khoda Kazi, M.; Benhmed, K.; Touati, F.; Al-Hitmi, M.; SP Gonzales, A., Jr. Machine learning based photovoltaics (PV) power prediction using different environmental parameters of Qatar. Energies 2019, 12, 2782. [Google Scholar] [CrossRef] [Green Version]
- Park, S.; Park, S.; Choi, M.-I.; Lee, S.; Lee, T.; Kim, S.; Cho, K.; Park, S. Reinforcement learning-based bems architecture for energy usage optimization. Sensors 2020, 20, 4918. [Google Scholar] [CrossRef] [PubMed]
- Cruz, J.; Mamani, W.; Romero, C.; Pineda, F. Selection of Characteristics by Hybrid Method: RFE, Ridge, Lasso, and Bayesian for the Power Forecast for a Photovoltaic System. SN Comput. Sci. 2021, 2, 202. [Google Scholar] [CrossRef]
- Shin, D.; Ha, E.; Kim, T.; Kim, C. Short-term photovoltaic power generation predicting by input/output structure of weather forecast using deep learning. Soft Comput. 2021, 25, 771–783. [Google Scholar] [CrossRef]
- Durrani, S.P.; Balluff, S.; Wurzer, L.; Krauter, S. Photovoltaic yield prediction using an irradiance forecast model based on multiple neural networks. J. Mod. Power Syst. Clean Energy 2018, 6, 255–267. [Google Scholar] [CrossRef]
- De Leone, R.; Pietrini, M.; Giovannelli, A. Photovoltaic energy production forecast using support vector regression. Neural Comput. Appl. 2015, 26, 1955–1962. [Google Scholar] [CrossRef]
- Gao, Y.; Li, S.; Dong, W. A learning-based load, PV and energy storage system control for nearly zero energy building. In Proceedings of the 2020 IEEE Power & Energy Society General Meeting (PESGM), Montreal, QC, Canada, 2–6 August 2020; pp. 1–5. [Google Scholar]
- Bataineh, K.; Dalalah, D. Optimal configuration for design of stand-alone PV system. Smart Grid Renew. Energy 2012, 3, 139. [Google Scholar] [CrossRef] [Green Version]
- Zamo, M.; Mestre, O.; Arbogast, P.; Pannekoucke, O. A benchmark of statistical regression methods for short-term forecasting of photovoltaic electricity production, part I: Deterministic forecast of hourly production. Sol. Energy 2014, 105, 792–803. [Google Scholar] [CrossRef]
- Hasan, K.; Yousuf, S.B.; Tushar, M.S.H.K.; Das, B.K.; Das, P.; Islam, M.S. Effects of different environmental and operational factors on the PV performance: A comprehensive review. Energy Sci. Eng. 2022, 10, 656–675. [Google Scholar] [CrossRef]
- Lu, C.; Li, S.; Penaka, S.R.; Olofsson, T. Automated machine learning-based framework of heating and cooling load prediction for quick residential building design. Energy 2023, 274, 127334. [Google Scholar] [CrossRef]
- Olsavszky, V.; Dosius, M.; Vladescu, C.; Benecke, J. Time series analysis and forecasting with automated machine learning on a national ICD-10 database. Int. J. Environ. Res. Public Health 2020, 17, 4979. [Google Scholar] [CrossRef] [PubMed]
- Mahjoubi, S.; Barhemat, R.; Guo, P.; Meng, W.; Bao, Y. Prediction and multi-objective optimization of mechanical, economical, and environmental properties for strain-hardening cementitious composites (SHCC) based on automated machine learning and metaheuristic algorithms. J. Clean. Prod. 2021, 329, 129665. [Google Scholar] [CrossRef]
- Open Weather Data Portal Site in Korea Meteorological Administration. Available online: https://data.kma.go.kr/cmmn/main.do (accessed on 1 June 2023).
- Abdul-Rahman, S.; Bakar, A.A.; Mohamed-Hussein, Z.-A. An intelligent data pre-processing of complex datasets. Intell. Data Anal. 2012, 16, 305–325. [Google Scholar] [CrossRef]
- Berthold, M.R.; Borgelt, C.; Höppner, F.; Klawonn, F. Guide to Intelligent Data Analysis: How to Intelligently Make Sense of Real Data; Springer Science & Business Media: London, UK, 2010. [Google Scholar]
- Hutter, F.; Kotthoff, L.; Vanschoren, J. Automated Machine Learning: Methods, Systems, Challenges; Springer Nature: Cham, Switzerland, 2019. [Google Scholar]
- Çetin, V.; Yildiz, O. A comprehensive review on data preprocessing techniques in data analysis. Pamukkale Üniversitesi Mühendislik Bilim. Derg. 2022, 28, 299–312. [Google Scholar] [CrossRef]
- Hohman, F.; Wongsuphasawat, K.; Kery, M.B.; Patel, K. Understanding and visualizing data iteration in machine learning. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems, Honolulu, HI, USA, 25–30 April 2020; pp. 1–13. [Google Scholar]
- Zhao, W.; Zhang, H.; Zheng, J.; Dai, Y.; Huang, L.; Shang, W.; Liang, Y. A point prediction method based automatic machine learning for day-ahead power output of multi-region photovoltaic plants. Energy 2021, 223, 120026. [Google Scholar] [CrossRef]
- Chepurko, N.; Marcus, R.; Zgraggen, E.; Fernandez, R.C.; Kraska, T.; Karger, D. ARDA: Automatic relational data augmentation for machine learning. arXiv 2020, arXiv:2003.09758. [Google Scholar] [CrossRef]
Grade of Zero-Energy Building | Energy-Independence Rate |
---|---|
1st grade | More than 100% |
2nd grade | More than 80%, below 100% |
3rd grade | More than 60%, below 80% |
4th grade | More than 40%, below 60% |
5th grade | More than 20%, below 40% |
Type | Data | Value [Unit] | Variable Names Used in the Model |
---|---|---|---|
TIME | Meterdate | YYYY-DD-MM hh:mm:ss | MeterDate |
WEATHER | Status of Rain | 0: None 1: Rain 2: Rain/Snow 4: Rain Shower | RAIN_STATUS |
Humidity | 0~100 [%] | HUMI | |
Precipitation of Rain | 0~1000 [mm] | RAIN_PRECIP | |
Status of Sky | 1: Clean, 3: Cloudy, 4: Dark Cloudy | SKY_STATUS | |
Temperature | −99~100 [°C] | TEMP | |
Direction of Wind | 0~359 [°] | WIND_DIRECTION | |
Speed of Wind | 0~1000 [m/s] | WIND_SPEED |
Type | Data | Value [Unit] | Variable Names Used in the Model |
---|---|---|---|
TIME | Meterdate | YYYY-DD-MM hh:mm:ss | MeterDate |
PV_SENSOR | Temperature of heatsink | 0~100 [°C] | CH#_SINK_TEMP |
Temperature of panel’s surface | 0~100 [°C] | CH#_IN_TEMP | |
PV_ENERGY | Voltage measured at PCS | 0~1000 [V] | INPUT_VOL |
Current measured at PCS | 0~1000 [A] | INPUT_CUR | |
Power measured at PCS | 0~1000 [kW] | INPUT_PWR | |
PV_STATUS | Mode of PCS | 0: Manual Mode 1: Safety Mode 2: Schedule Mode | PCS_MODE |
Status of PCS | 0: Off 1: On | PCS_STATUS |
Algorithm | Abbreviation |
---|---|
Linear Regression | ‘lr’ |
Lasso Regression | ‘lasso’ |
Ridge Regression | ‘ridge’ |
Elastic Net | ‘en’ |
Least Angle Regression | ‘lar’ |
Lasso Least Angle Regression | ‘llar’ |
Orthogonal Matching Pursuit | ‘omp’ |
Bayesian Ridge | ‘br’ |
Automatic Relevance Determination | ‘ard’ |
Passive Aggressive Regressor | ‘par’ |
Random Sample Consensus | ‘ransac’ |
Theil-Sen Regressor | ‘tr |
Huber Regressor | ‘huber’ |
Kernel Ridge | ‘kr’ |
Support Vector Regression | ‘svm’ |
K Neighbors Regressor | ‘knn’ |
Decision Tree Regressor | ‘dt’ |
Extra Trees Regressor | ‘et’ |
AdaBoost Regressor | ‘ada’ |
Gradient Boosting Regressor | ‘gbr’ |
MLP Regressor | ‘mlp’ |
Extreme Gradient Boosting | ‘xgboost’ |
Light Gradient Boosting Machine | ‘lightgbm’ |
CatBoost Regressor | ‘catboost’ |
Model | Target | Algorithm | MAE | R2 | Training Time [s] |
---|---|---|---|---|---|
IDEAL MODEL | INPUT_PWR | Bayesian Ridge | 0.207 | 0.997 | 0.137 |
PROPOSED MODEL | INPUT_VOL | Random Forest Regressor | 14.495 | 0.976 | 0.281 |
INPUT_CUR | Extra Tree Regressor | 1.280 | 0.934 | 0.239 | |
INPUT_PWR | Bayesian Ridge | 0.375 | 0.974 | 0.134 | |
COMPARISON MODEL | INPUT_PWR | Bayesian Ridge | 1.744 | 0.845 | - |
Model | Target | Algorithm | MAE | R2 | Training Time [s] |
---|---|---|---|---|---|
IDEAL MODEL | INPUT_PWR | Random Forest Regressor | 0.05 | 0.999 | 0.103 |
PROPOSED MODEL | INPUT_VOL | Random Forest Regressor | 14.435 | 0.976 | 0.167 |
INPUT_CUR | Random Forest Regressor | 1.131 | 0.925 | 0.213 | |
INPUT_PWR | Random Forest Regressor | 0.324 | 0.968 | 0.223 | |
COMPARISON MODEL | INPUT_PWR | Random Forest Regressor | 0.577 | 0.908 | - |
Model | Target | Algorithm | MAE | R2 | Training Time [s] |
---|---|---|---|---|---|
IDEAL MODEL | INPUT_PWR | Bayesian Ridge | 0.007 | 0.999 | 0.107 |
PROPOSED MODEL | INPUT_VOL | Random Forest Regressor | 20.520 | 0.970 | 0.168 |
INPUT_CUR | Extra Tree Regressor | 1.864 | 0.929 | 0.123 | |
INPUT_PWR | Random Forest Regressor | 0.737 | 0.977 | 0.105 | |
COMPARISON MODEL | INPUT_PWR | Bayesian Ridge | 0.972 | 0.920 | - |
Model | Average of Predicted Value by the Model | Average of Solar Power Generation Value by PV1 | Average of Errors |
---|---|---|---|
IDEAL MODEL | 20.5092 | 20.5341 | 0.3849 |
PROPOSED MODEL | 19.1675 | 3.2467 | |
COMPARISON MODEL | 16.9806 | 5.3285 |
Model | Average of Predicted INPUT_PWR Value by the Model | Average of Solar Power Generation Value by PV2 | Average of Errors |
---|---|---|---|
IDEAL MODEL | 18.7010 | 19.1666 | 0.5888 |
PROPOSED MODEL | 21.2819 | 4.2394 | |
COMPARISON MODEL | 17.7638 | 5.6152 |
Model | Average of Predicted INPUT_PWR Value by the Model | Average of Solar Power Generation Value by PV3 | Average of Errors |
---|---|---|---|
IDEAL MODEL | 24.3856 | 24.4906 | 0.3447 |
PROPOSED MODEL | 23.6446 | 4.5075 | |
COMPARISON MODEL | 25.0363 | 5.5748 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Lee, S.; Park, S.; Kang, B.; Choi, M.-i.; Jang, H.; Shmilovitz, D.; Park, S. Enhancing Zero-Energy Building Operations for ESG: Accurate Solar Power Prediction through Automatic Machine Learning. Buildings 2023, 13, 2050. https://doi.org/10.3390/buildings13082050
Lee S, Park S, Kang B, Choi M-i, Jang H, Shmilovitz D, Park S. Enhancing Zero-Energy Building Operations for ESG: Accurate Solar Power Prediction through Automatic Machine Learning. Buildings. 2023; 13(8):2050. https://doi.org/10.3390/buildings13082050
Chicago/Turabian StyleLee, Sanghoon, Sangmin Park, Byeongkwan Kang, Myeong-in Choi, Hyeonwoo Jang, Doron Shmilovitz, and Sehyun Park. 2023. "Enhancing Zero-Energy Building Operations for ESG: Accurate Solar Power Prediction through Automatic Machine Learning" Buildings 13, no. 8: 2050. https://doi.org/10.3390/buildings13082050
APA StyleLee, S., Park, S., Kang, B., Choi, M. -i., Jang, H., Shmilovitz, D., & Park, S. (2023). Enhancing Zero-Energy Building Operations for ESG: Accurate Solar Power Prediction through Automatic Machine Learning. Buildings, 13(8), 2050. https://doi.org/10.3390/buildings13082050