1. Introduction
Nowadays, the practice of forecasting is increasingly prevalent across various sectors, including the national economy, business domains, and applied research [
1]. For example, it is employed in predicting electricity consumption, financial market prices, technical device failures, weather conditions, and many other factors [
2]. The concept of forecasting is derived from the Greek word “πρόγνωση”, meaning “foresight” or “prediction,”, which refers to a scientifically informed assessment of the future state of a studied system considering the present moment and the combined impact of internal and external factors. The statistical data and historical models are examined using software tools and data analysis systems to generate a forecast capable of handling extensive datasets gathered over an extended period. The primary objective is to mitigate risks associated with financial, marketing, and operational decision making by anticipating future demand and trends [
3,
4]. The accuracy (or reliability) of a forecast serves as a measure of the analysis’s quality. It shows the degree to which the predicted outcomes align with actual values; thus, the accuracy is directly influenced by the forecasting system and the methods employed [
5]. Enhancing reliability, reducing risks, and minimizing potential losses are achieved by employing a variety of resources in the forecasting process. For instance, methods such as scenarios, constructing a “goal tree,” and morphological analysis have gained widespread usage when predicting outcomes in fundamental research, system analysis, and synthesis. Conversely, statistical approaches are frequently constrained due to the absence or scarcity of initial data and, also, the challenges in determining the nature of the predicted phenomenon [
6].
The task of forecasting electricity consumption patterns by managing load profiles in the electrical grid and effectively overseeing modern energy power systems (EPS) holds tremendous importance from an energy perspective [
7,
8]. The planning and management of energy power systems have gained particular significance in light of the increasing integration of renewable energy sources (RES) and their inherent stochastic nature of energy generation. Accurately predicting electricity generation from wind and solar power plants (providing estimates of expected electricity output) has become crucial for grid operators. This is especially essential for maintaining a balance between energy supply and demand, which requires facilitating the successful integration of distributed renewable energy sources (DRES), optimizing the component sizes in autonomous hybrid systems, and enhancing the reliability, efficiency, and safety of isolated systems [
9]. Furthermore, the global trend of expanding network capacity through the integration of hybrid distributed systems that utilize renewable energy is emerging as an option for sustainable energy supply for the foreseeable future [
10].
To address the aforementioned challenges, there is a need for high-precision forecasting systems for renewable energy generation, enabling the effective planning of electricity production for the upcoming day [
11]. Typically, solar or wind power facilities cannot guarantee a precise output of electrical power at a given time, making the forecast of generated energy and its supply to the grid highly relevant [
12,
13].
Solar energy represents a dynamic sector of the energy industry characterized by significant regional disparities in its development [
14]. The task of forecasting solar power output presents greater intricacies and complexities compared to other energy sources due to the influence of meteorological factors on the performance of photovoltaic arrays. These factors exhibit stochastic behavior, making it challenging to predict accurately the changes in power generation. Consequently, the absorbed solar radiation can be modeled as a correlation between the maximum power output of the photovoltaic modules and solar radiation levels. The intensity of solar radiation directly depends on geographical location, time, and the orientation of the solar panel in relation to both the sun and the sky [
15]. In such a case, quite complex mathematical models are used, for instance, deep neural networks [
16] and recurrent neural networks [
17].
The primary objective of this study is to develop a model for predicting solar insolation by employing adaptive neural network techniques. This model is designed to forecast hourly solar radiation patterns. The hidden-layer neurons utilize the rectified linear unit (ReLU) and sigmoid activation functions in the proposed model. A comparative analysis is conducted between two learning methods: the conventional stochastic gradient descent (SGD) and the adapted lead scenting algorithm (Adam). To evaluate the performance of this model, testing is conducted for an isolated hybrid power system located in the Gorno-Badakhshan Autonomous Oblast (GBAO) considering data for a winter month. In this region, the unpredictability of the energy generation from renewable and alternative sources (including small mountain river-based hydropower plants (HPPs), solar power plants (SPPs), and wind power plants (WPPs) combined with energy storage devices) poses a significant challenge to maintaining a stable electric energy balance. Therefore, the problem addressed in this study holds substantial relevance [
18,
19]. The problem of the control of the autonomous power system of GBAO is described in [
18]. The problem of the short-term forecasting of wind speed, which is necessary for forecasting generation at WPPs, is considered in [
19].
Recently, the global solar energy industry has begun to predict the output of renewable energy in the short-term horizon, which ranges from an hour to a week. It is important to predict the solar insolation for different horizons since this allows better planning and management of renewable energy systems. For example, if the forecast predicts low insolation for the next few hours, energy storage systems can be charged in advance to ensure a continuous power supply. Similarly, if high insolation is predicted, excess energy can be sold back to the grid or stored for later use. Additionally, accurate short-term forecasting can support grid operators to balance the supply and demand of electricity in real time, leading to a more reliable and efficient power system. However, the currently used forecasting method lacks free and useful software frameworks, which could be used by small energy companies. Therefore, they are forced to build their own predictive models.
Over the past few years, numerous methods have been proposed to forecast solar radiation values. All existing methods for predicting the generation of electric energy by solar power plants (SPPs) can be categorized into three main groups [
20]:
Physical models: These models rely on the relationship between weather patterns and solar radiation. They utilize numerical weather forecasting and electricity generation data from stations. Various factors are taken into account, for instance, location, historical orientation data, meteorological variables, characteristics of the photovoltaic installation, and predicted weather variables, like global horizontal illumination. However, the accuracy of forecasts based on physical models largely depends on changes in meteorological elements [
21]. An example of a popular physical model is the numerical weather-predicting (NWP) model, which uses a set of mathematical equations to describe the physical state and motion of the atmosphere. Studies like [
22] propose methodologies to assess the influence of clouds, ambient temperature, and other factors on the hourly output power of photovoltaic systems. Additionally, research in [
21] proposes an approach to forecasting solar and photovoltaic energy through the post-processing of a global ecological multiscale numerical weather-forecasting model. The descriptions of various methods for numerical weather prediction, including methods for using satellite-based techniques, are reviewed in [
23].
Statistical models: These models describe the relationship between solar radiation flux density, obtained using numerical weather forecasting, and the generation of electrical energy at a SPP through the statistical analysis of time series. They can utilize the following techniques: autoregressive (AR), autoregressive moving average (ARMA), autoregressive integrated moving average (ARIMA), autoregressive integrated moving average with exogenous inputs (ARIMMAX), or other forecast models based on artificial intelligence (AI) [
24]. The accuracy of these forecasts depends on the quantity and quality of historical input data [
25]. For instance, the study [
26] presents a two-stage method that involves the statistical normalization of solar energy using a clear-sky model in the first stage before forecasting normalized solar energy using adaptive linear time-series models. The authors apply the ARMA model for the short-term forecasting of solar potential [
27]. Seasonal ARIMA (SARIMA) models are proposed in this study [
28], based on the insolation data from NASA’s POWER (Prediction of Worldwide Energy Resources) data archive.
Machine learning models: These models employ artificial intelligence systems to establish the relationship between forecasted weather conditions and the output power of a power plant. Artificial neural networks (ANNs) are particularly powerful tools for solving problems that require difficult-to-obtain knowledge and they have been extensively used [
29,
30]. ANNs have been proposed and successfully applied for solar radiation forecasting. They provide accurate simulation results by leveraging the self-learning and adaptive capabilities of neural architectures, thus reducing the need for human intervention [
31]. For example, ref. [
32] proposes models using radial basis functions (RBF) and multilayer perceptron for estimating solar radiation based on the preliminary estimation of the clarity index; meanwhile, ref. [
33] investigates the use of backpropagation artificial neural networks (BAN) for estimating global solar radiation.
Deep neural network models have been introduced to model and predict solar radiation data using meteorological and geographical parameters without requiring knowledge of the data generation process [
34]. Developing a reliable model for estimating or forecasting solar radiation data typically requires a significant amount of long-term data [
35]. In addition to neural networks, other machine learning methods are applied when considering issues in solar insolation forecasting, among which, ensemble methods show high accuracy, for instance, XGBoost [
36], gradient boosting, and random forest [
37].
When using adaptive models for forecasting, it is crucial to consider their properties and operational principles. One significant distinction of adaptive models is their ability to exhibit the qualities of a series and continuously account for the evolving characteristics of the studied processes. The primary objective of adaptive methods is to construct self-correcting models that can capture the changing conditions over time, consider the informational significance of all potential elements in the time series, and provide reliable estimates for the future elements of the given series. Consequently, these models directly serve the purpose of forecasting [
38].
One approach in adaptive methods is the piecewise linear approximation method, which entails a reduction in the model’s “memory”. This leads to the “forgetting” of past data and the construction of a regression line based on a limited amount of information. An important advantage of this method is its rapid adaptation when the parameters of the process are changed. However, a drawback of the adaptive method is its susceptibility to interference, small fluctuations, or deviations that can distort the results.
In many instances, hybrid models are formed by combining physical and statistical models. These models aimed to reduce computational complexity and the time required for online applications in forecasting [
39]. For example, authors in [
40] propose a hybrid technique that merges stochastic learning, earth image processing using remote sensing, and ground telemetry. They claim that such hybrid models offer significant benefits in terms of precision and reliability, leading to highly accurate predictions of solar activity.
This study presents the findings of an investigation of the impact of hyperparameters in feed-forward neural networks on the accuracy of the short-term and medium-term forecasting of solar insolation. The contribution can be formulated in two points. Firstly, it is shown that for both different seasons of the year and forecasting horizons, even for compact models, careful tuning of the hyperparameters is required. Secondly, the possible accuracy of operation and the short-term solar insolation forecasting being conducted for the first time for the Gorno-Badakhshan Autonomous Oblast of the Republic of Tajikistan highlights the importance of the sustainable development of the energy sector in this region and improving the quality of people’s lives.
This paper is structured as follows:
Section 2 provides a description of this study’s focus, which is the Gorno-Badakhshan Autonomous Oblast;
Section 3 details the development of the model employed for forecasting hourly solar radiation, along with the obtained results; finally, a summary of the outcomes is provided.
3. Results and Discussion
The metric of accuracy employed in this study is the mean absolute percentage error (MAPE). The dataset was divided into an 80% training set and a 20% validation/test set.
Figure 3,
Figure 4,
Figure 5 and
Figure 6 illustrate examples of the learning process. It can be seen that at the learning rate of 10
−4, the process is too smooth and slow.
The outcomes of the computational experiments for the first variant (employing previous hours that align with the forecast hour within the same month) are presented in
Table 3,
Table 4,
Table 5,
Table 6,
Table 7,
Table 8,
Table 9 and
Table 10. A dashed line in
Table 3,
Table 4,
Table 5,
Table 6,
Table 7,
Table 8,
Table 9 and
Table 10 shows that the learning process was terminated before the specified number of learning stages was reached.
Table 4,
Table 6,
Table 8 and
Table 10 illustrate the process of model training for the best option, selected from the experiments for choosing the best hyperparameters given in
Table 3,
Table 5,
Table 7 and
Table 9. These experiments were conducted for all seasons of the year.
Table 11,
Table 12,
Table 13 and
Table 14 showcase the best results achieved for the combinations of Z + S, Z + A, R + S, and R + A across all seasons when utilizing the first variant. The results demonstrate that the neural network’s hyperparameters greatly influence one another. Certain factors can be determined beforehand, such as the lower learning rate required for SGD compared to the Adam (10
−4 vs. 10
−3, respectively). However, the optimal number of neurons needs to be adjusted accordingly. Furthermore, the ideal model configuration and forecast accuracy exhibit strong dependence on the season. The utilization of the sigmoidal activation function in conjunction with the Adam resulted in the longest process duration for three out of the four seasons while also producing the best results. This highlights the importance of conducting experimental analysis when selecting the activation function following many popular software machine learning frameworks, such as Keras and PyTorch, using the ReLU function by default. Moreover, the obtained results indicate that meteorological conditions (seasonal) exert a significant influence on the performance of even a simple neural network. For each combination of learning methods and hidden-layer activation functions, different hyperparameter values (such as the number of neurons and the learning rate) yield the best results in different seasons.
Considering the deviation between the test and validation sets (a measure of overfitting), it can be concluded that a compact network with no more than 15 neurons should be chosen for this problem.
To provide a clearer understanding of how the hyperparameter values presented in
Table 11,
Table 12,
Table 13 and
Table 14 were determined, the experimental results in
Table 15 and
Table 16 are provided below. These experiments were conducted across all activation functions, training methods, numbers of neurons, and seasons. The total number of tested parameter combinations for each season amounted to 84, calculated as 2 (Adam, SGD) × 2 (ReLU, Sigmoid) × 7 (3, 6, … 21 neurons) × 3 (10
−2, 10
−3, 10
−4 learning rate values).
For the second variant (employing all previous hours during the week, excluding nighttime hours), the best option was determined to be Z + A with a learning rate of a4.
Table 8 illustrates the influence of the count of hidden-layer neurons. The forecasting results, both for one hour ahead (
Table 17) and one day ahead (
Table 18), are highly dependent on the season and necessitate the selection of the optimal number of neurons.
Figure 7 and
Figure 8 depict the learning process.
One intriguing finding suggests that the accuracy of one-hour-ahead forecasts during the summer period is significantly higher compared to the other seasons. However, when predicting weather conditions 24 h in advance, the accuracy diminishes. This suggests that the summer weather in the specific region is relatively stable in the immediate future but becomes less predictable in the short term.
Figure 9,
Figure 10,
Figure 11 and
Figure 12 depict the alignment of the forecast with the actual solar insolation patterns. The curve of solar radiation closely resembles the desired shape for the majority of the days throughout the year. This can be attributed to two key factors. Firstly, the solar radiation data were collected from a highly reliable and precise weather station, ensuring the accuracy of the measurements. Secondly, mountainous areas of the Pamirs region (the high-altitude area) experience minimal cloud cover and a scarcity of cloudy days. This notion is substantiated by statistical records indicating that the region enjoys over three-hundred sunny days annually.
Research findings indicate that the accuracy of short-term forecasting is significantly influenced by the determination of the optimal combination of these factors. Despite its compact nature, the Adam training method has demonstrated a considerable improvement over the traditional SGD in terms of performance. While the ReLU activation function is commonly employed as the default, it has exhibited lower effectiveness compared to the sigmoid function when paired with the Adam method; but, it has shown greater effectiveness when used in conjunction with SGD. In general, for the problem of predicting solar insolation for the examined region, the best results were obtained using the ReLU + Adam combination.
The decline in accuracy during winter can be attributed to a higher frequency of cloudy days (the proportions of sunny days: winter—49%, spring—66%, summer—97%, autumn—80%) and reduced solar insolation levels (the average value of solar insolation in winter is 1.85 times lower than in the rest of the year).
4. Conclusions
To enhance the integration of solar power plants into the conventional energy grid, it is crucial to address vulnerabilities that arise in the network due to the unpredictable nature of these resources. Fluctuations and sudden spikes in power output pose significant challenges for system operators, leading to an impact on system balancing, reserve management, planning, and the availability of generating plants. Consequently, researchers have been focused on developing advanced forecasting methods to predict solar radiation across various timeframes and locations.
Presently, the research in this field primarily revolves around complex neural network models. However, even the simplest models can achieve significantly improved accuracy through appropriate adjustments. Conducted experiments for tuning the hyperparameters of neural networks have underscored the importance of careful selection, even for compact models with minimal layers. Additionally, this study confirms the necessity of developing distinct models for different seasons and weather conditions. The research focuses on shallow neural network models that consist of only one hidden layer to demonstrate the impact of hyperparameters and operational condition choices, even for the most basic machine learning models. Notably, even these simple models exhibit relatively low error rates after tuning.
For the short-term forecasting of solar insolation, the mean absolute percentage error for predicting one hour ahead ranges from 0.6% to 1.7% based on the validation set. This indicates a high level of forecast accuracy, considering the collective research experience on this issue. For one-day-ahead forecasts, the error ranges from 2.6% to 7.2%. The simplicity of these models makes them highly accessible for implementation, including the microcontrollers of individual solar panels, enabling generation prediction based on insolation.
The obtained results of this study have the potential to significantly enhance the efficiency of the planning and operational management of solar power plants in the Republic of Tajikistan due to more precise generation forecasting.