2. Literature Review
This section briefly reviews PV power forecasting approaches based on computational intelligence applied to model-based predictive control (MBPC) and scheduling energy management applications. Performance metrics and results of pertinent studies that may be an object of comparison are also highlighted.
Like airlines, consumer package goods, and oil and gas industries, the electric power industry needs forecasts of supply, demand and price, the so-called energy forecasts, to plan and operate the grid. As electricity cannot be massively stored using today’s technologies, electricity must be generated and delivered as soon as it is consumed. In other words, utilities must balance the supply and demand at every moment.
The type of solar resource to be forecast depends on the installed technology. For concentrating photovoltaics (CPV), direct normal incident irradiance (DNI) must be forecasted. Because of non-linear dependence of concentrating solar-thermal efficiency on DNI and the controllability of power generation through thermal energy storage (if available), DNI forecasts are especially crucial for the management and operation of CPV power plants. DNI is impacted by phenomena that are very difficult to forecast, such as cirrus clouds, wildfires, dust storms, and episodic air pollution events, reducing DNI by up to 30% on otherwise cloud-free days. For non-concentrating systems (i.e., most PV systems), primarily global irradiance (GI = diffuse + direct) on a tilted surface is required, which is less sensitive to errors than DNI since a reduction in clear-sky DNI usually increases diffuse irradiance. For higher accuracy, forecasts of PV-panel temperature are needed to account for the (weak) dependence of solar-conversion efficiency on PV-panel temperature [
6]. This section does not aim to describe the physics behind solar energy PV power generation, as they are well known and widely available sources in the literature. The interested reader can find details in [
7,
8,
9].
Forecasting models for solar irradiance can be broadly classified as [
10]: statistical, cloud-based, and numerical weather prediction (NWP) models.
Statistical models can be further subdivided into linear models (persistent forecasts, using the clearness or clear sky indexes, or Fourier series expansions, AutoRegressive–Moving-Average—ARMA, AutoRegressive Integrated Moving Average—ARIMA or Classification And Regression Trees—CART models) and non-linear, typically computational intelligence-based models (neural networks, wavelet, fuzzy or evolutionary). Cloud-based models use weather satellite images or ground-based sky (GBS) images [
11] to improve solar irradiance forecast. Basically, by processing previous sky images, clouds can be detected, and their motion extrapolated using motion vector fields, or sky cover indices can be obtained, and their evolution forecasted. GBS images have been shown to improve statistical models’ performance for forecasts up to 6 h in advance. NWP models are used to forecast the state of the atmosphere up to 15 days ahead. The main limitation of NWP forecasting is its coarse resolution. Spatial resolution can be as low as 1 Km, but the minimum temporal resolution is typically 3 h, which does not allow its use for any patterns less than this value.
The choice of solar-forecasting method depends strongly on the timescales involved, which can vary from horizons of a few seconds or minutes (intra-hour or very short forecasts, for control and adjustment actions), a few hours (intra-day or short/medium, for energy resource planning and scheduling as well as for the electricity market), to a few days ahead (intra-week or long, for unit commitment and maintenance schedules) [
12]. The ability to forecast solar irradiance in near-real-time, or nowcasting, is crucial for network operators to guarantee power grid stability with specific reference to power plant operations, grid balancing, real-time unit dispatching, automatic generation control and trading, and for home energy management systems. As this work aims to incorporate the PV power forecasts in HEMS, the authors are interested in real-time, intra-hour and intra-day forecasts.
The accuracy of solar radiation forecasting has direct financial impacts, at various levels. For example, in [
13] the effect of DNI forecast accuracy on concentration solar thermal (CST) plants’ financial value, incorporating energy storage is examined. When the RMSE of a 48-h DNI forecast is between 325 and 400 W/m
2, a 1 W/m
2 improvement increases the financial value by
$400–1300 per six months’ operation for a CST plant. Excellent reviews of solar irradiance forecasting are available, such as [
10,
14,
15,
16,
17,
18].
Focusing now on PV power generation forecasting, it is, in a simplistic definition, the application of techniques that use registered historical data to make informed prediction/estimation of the future value that PV power will assume in a defined prediction horizon. These forecasts can be obtained directly, i.e., a model is developed using inputs lags of the endogenous variable and, possibly, exogenous ones. Alternatively, forecasting models of those exogenous variables are designed first, and their outputs are fed to a static model which generates the forecasts of the PV power. In [
19], the correlations between PV power output and several input variables are explained and computed, concluding that the solar irradiance’s correlation achieves large values such as 0.988, followed by the panel temperature and the ambient temperature.
The same categories of models pointed out for solar irradiance forecasting apply to PV generation forecasting [
12,
20,
21,
22,
23].
Throughout the years, the use of computational intelligence models for forecasting applications has been steadily increasing and, this way, are the focus of this synthetic survey presented in the next sub-section. Computational learning is a mathematical and theoretical field of artificial intelligence that analyses machine learning algorithms’ design and their learnability capacity while improving the necessary functions’ accuracy and efficiency. These models are based on estimating a function deduced only from samples of training data describing a specific system’s behavior and are well suited when physical features are not known [
20]. As these are data-based models, the lack of accurate data can become an issue for computational learning methods [
20], as the accuracy is strongly dependent on the quality and amount of the available data.
The most used computational learning architectures used for PV power forecasting are shallow and deep artificial neural networks, support vector machine and fuzzy models.
For solar power forecasting, ANNs are among the most employed machine learning techniques. Artificial neural networks are inspired in the biologic neuron and networks. A neural network’s fundamental processing element is a neuron, which performs a simple computation on its inputs. Neurons are typically grouped into layers. The network usually consists of an input layer, some hidden layers, and an output layer. ANNs can somehow be classified by the neuron model employed, number of layers, neurons interconnection and the training mechanism. The forecasting performance of an ANN model depends not only on the above items, but also on the design data, and the inputs employed.
The ANN model used in this work is a radial basis function (RBF) Neural Network (NN). The hidden neurons employ a radial type of function, typically a Gaussian, their outputs being linear combined afterwards. Thus, the output of an RBF model is given by:
In (1),
y[
k] denotes the output, at instant
k,
i[
k] is the
jth input at that instant,
w represents the vector of linear weights,
C(
j) represents the vector (extracted from the
C matrix) of the centers associated with hidden neuron
j,
σj is its spread, and ||
2 denotes the Euclidean distance. RBFs, as well as multilayers perceptrons (MLP), B-spline neural networks and others, are nowadays called shallow ANNs [
24], in contrast with deep architectures, employing a larger number of layers, such as deep neural networks, deep belief networks, long-short term memory neural networks (LTSM) and convolutional neural networks (CNN).
Yang and co-workers [
25] proposed a hybrid scheme, involving classification, training, and forecasting stages. In the classification stage, self-organizing maps and learning vector quantization networks classify the collected historical data of PV power output. The training stage employed the support vector regression to train the input/output data sets for temperature, probability of precipitation, and solar irradiance of defined similar hours. In the forecasting stage, the fuzzy inference method was used to select an adequately trained model for an accurate forecast. This scheme is used for one-day ahead hourly forecasting of PV output.
Leva and co-workers [
12] used an MLP to forecast the hourly day-ahead PV power, based on weather forecasts, and power and irradiance measurements. Having available a 72-h weather forecast, in each day x an hourly forecast for day x + 1 was produced, between sunrise and sunset.
In [
26], a 72 h deterministic and probabilistic forecast of PV power output was developed using MLPs combined with an analog ensemble, using as inputs forecasts of global irradiance, cloud cover and atmospheric temperature, obtained from a numerical weather prediction model, as well as solar azimuth and elevation. Values of RMSE/C (C is the rated power of the PV system) in the range of 8 to 8.7% were obtained.
Rana and co-workers [
27] provided twelve different forecasts, from 5 to 60 min ahead, obtained with 12 different prediction models. The authors employed MLPs, arranged in different ways: using single and ensemble models, and using only the past PV power measured values (univariate) and combined with solar irradiance, atmospheric temperature and relative humidly, and wind speed (multivariate). A correlation-based feature selection algorithm was employed for each model, to select the inputs from a set of lags until one-week before, i.e., 12 × 24 × 7 = 840 lags for a univariate model, and 5 × 840 = 4200 lags for multivariate models. In [
28], PV power was forecasted for a solar power plant located at the Applied Science Private University, in Jordan. One day-ahead PH was considered, using the measured irradiance values of the five previous days, obtained at the same hour. Employing MLPs, R-squared values for the training, testing and validation sets of 0.97 were achieved.
Addressing now deep architectures, in [
29] one day-ahead forecasting was obtained using generative adversarial networks and convolution neural networks to classify weather types, and with trained forecasting PV power models for each type of weather. Unfortunately, no information about the quality of the forecasts was supplied. Zhou et al. [
30] employed two long-short-term memory neural networks for PV power output forecasting and atmospheric air temperature, obtaining forecasts for PHs from 7.5 min up to 1 h, using 7.5 min sampling intervals.
Wang and co-workers [
31], developed three forecasting models: convolutional neural network, long-short-term memory neural network and an hybrid model (combining the other two models). They used the PV power output itself, the atmospheric air temperature, and the global solar irradiance. MAPE values varying from 2.2 to 11.2% were obtained for a one-step-ahead (5 min) prediction.
Li and co-workers [
32] propose a hybrid deep learning approach based on CNN and LSTM for PV output power forecasting. The CNN model is intended to discover the non-linear features and invariant structures exhibited in the previous output power data, thereby facilitating PV power prediction. The LSTM is used to model the temporal changes in the latest PV data and predict the next time step’s PV power. Then, the prediction results in the two models are comprehensively considered to obtain the expected output power.
Hossain and Mahmood [
33] employ an LSTM neural network, using as exogeneous variable a synthetic weather forecast. A synthetic weather forecast is created for the targeted PV plant location by integrating the statistical knowledge of historical solar irradiance data with the publicly available sky forecast of the host city. They compared their proposal with recurrent neural networks, generalized regression neural networks and extreme learning machines, for different intraday horizon lengths in different seasons.
The primary purpose of improving the accuracy of solar power forecasts is to reduce the uncertainties related to this type of intermittent energy source, resulting in safer and easier energy management. As solar penetration increases in the energy portfolio, the impact of incorrect forecasts in the grid and HEMS can be larger [
1].
A particular model’s performance and accuracy can be assessed via several metrics, which allow the comparison between different models and locations. Each one focus on a certain aspect of point distribution. Thus, there is no unique metric valid for all situations; instead, each one adds some information about the model’s accuracy. In the bibliography, several metrics can be found [
34], although there is a group that is more commonly used, such as mean absolute error (MAE) (2), mean relative error (MRE) (3), normalized MAE (NMAE) (4), mean absolute percentage error (MAPE) (5), root mean square error (RMSE) (6) and coefficient of determination (R
2) (7). For this reason, these performance criteria will be used in this work, in Case Study 2.
In the previous equations, n is the number of samples, yt is the measured tth value, is the predicted value, C is the rated power of the PV system, and r is the range of the measured variable.
Independently of the metrics used to assess the proposed models’ performance, some other factors hamper comparisons among studies. According to [
1], they are climatic variability, day/night values and normalization, sample aggregation, testing period, and specific system attributes.
Table 1 presents the summary of pertinent comparable studies in terms of the method used, the prediction horizon, sampling time, input variables, and the model’s accuracy.
Although the works described above are general techniques for short-time and intra-day PV power forecasting, the majority cannot be applied for model-based predictive control and HEMS scheduling. The reason is that MBPC uses predictive models, that should output the modelled variable’s forecasts for each step-ahead within the PH considered, i.e., provide multi-step-ahead forecasting. This type of forecasts can be achieved in a direct mode, by having several one-step-ahead forecasting models, each providing the prediction of each-step ahead within PH. This is the approach followed in [
27,
32]. An alternative, which is the one followed in this work, is to use a recursive version. In this case, only one model is necessary, and for each step within the PH, the inputs change, eventually employing predictions obtained in previous steps. This way, the RBFs models described in (1) are used as NAR (nonlinear autoregressive) models or NAR models with eXogenous inputs (NARX). Denoting as y the modelled variable, and considering only one exogenous input, v, for the sake of simplicity, the estimation (
), at instant
k, can be given as (8):
In (8), f(.) represents the RBF model (1), which means that its arguments (the delays of
y and v) represent the network input vector,
i[
k]. As the objective is to determine the evolution of the forecasts over a prediction horizon, (8) is iterated over that horizon. For
k + 1, we shall have:
Depending on the indices of the delays, and the steps within the prediction horizon, no measured values may exist for one or more terms in the argument (8). These must be obtained using previous predictions. This way, the computation of the predictions over a prediction horizon PH may require PH executions of the model (8).
Please note that, if multi-step-ahead forecasting is considered (as is the case of this work), the metrics (2–7) can be computed for each step within the prediction horizon. Longer PHs tend to increase forecasting errors, especially under abnormal weather conditions [
6], and with an increased interval between samples and with an increasing number of steps. This is more problematic for recursive multi-steps-ahead forecasting methods, as the errors propagate through the steps.
5. Discussion
For assessing the quality of the forecasting results obtained with the proposed approach, they should be compared with the results achieved with the works referenced in
Table 1. It should be noted that a completely fair comparison is not possible, as data used in these works are all different, the prediction horizons are not the same, and the majority of the papers only supply one-step-ahead predictions.
The only work that achieves multi-step forecasts is [
27], employing a direct mode. This work uses 12 models that output forecasts with a time-step of 5 min, supplying forecasts between 5 min and 1 h. The MRE reported ranged from 4.2 to 9.3%. The MRE values obtained by our approach are shown in
Figure 13.
The MRE obtained 1 h ahead in [
27] is 9.3%. Our approach achieves a value of 1.3% for the same prediction horizon, which is seven times smaller.
All the other works only supply one-step-ahead forecasts. A comparison of the results obtained with the multi-step-ahead forecast is not fair to our approach, especially when one-step-ahead forecasts correspond to many steps-ahead of the multi-step forecasts. However, this is what will be done as there is no other alternative.
The work presented in [
25] achieves an MRE of 3.3% for a PH of 1 day. In our work, the 48-steps forecasts correspond only to 12 h, with a value of 1.5%. Assuming that a 96 steps prediction would follow the same trend of
Figure 12, our approach would obtain a smaller MRE.
The work presented in [
12] achieves NMAE values between 5.2 to 13.2% for a Prediction Horizon between 25 and 48 h ahead. Considering a rated power of 6 kW, this work’s approach obtains NMAE evolutions shown in
Figure 14, this is, below 1.5%.
Again, speculating that a higher number of steps prediction would follow the same trend of
Figure 10, our approach would obtain a smaller NMAE.
The work presented in [
30] supplies forecasts of up to 1 h, with MAPE values between 24.7% and 37.8%. The MAPE values obtained with our approach are shown in
Figure 15. For a 1-h forecast (four-steps-ahead) the MAPE is 8.0%.
In [
31], the MAPE criterion is also employed. For a 5 min forecast, MAPE ranges between 2.2% and 11.2%. In our approach, the MAPE for a 15 min forecast is 8.0%.
The work presented in [
33] also employs the MAPE criterion. For summer months, for 6, 12, and 24 h the values obtained were 28.6%, 38.5% and 37.8%. In our approach, 9.1% and 9.4% were obtained for 6 and 12 h ahead.
Work [
28] shows their results in terms of the R-squared (R
2) criterion. For a one-day forecast, the value obtained is 0.97.
Figure 16 depicts the evolution of the coefficient of determination within the PH. Speculating again that a higher number of steps prediction would follow the same trend of
Figure 16, our approach would obtain a much higher R
2 value.
Work [
32] uses a direct mode to supply multi-step ahead PV forecasts with a prediction horizon of 3 h, with steps of 15 min. Different models are designed for each season. For spring, summer, fall and winter, the coefficient of determination ranges from 0.998 to 0.918, 0.999 to 0.964, 0.996 to 0.907 and 0.997 to 0.905, respectively, between 15 min and 180 min ahead. Our second case uses data from spring and summer, with considerable better results.
To conclude the discussion, the authors would like to highlight that this predictive model is only one of the models necessary for our HEMS. Models of electric consumption were developed by the authors using the same methodology [
30,
33,
44] and will be integrated into a MBPC HEMS scheme as the one described in [
54]. Furthermore, in future studies the effect of dust on the PV panels will also to be considered, following the guidelines of [
55], where an experimental analysis was developed for different dust types to evaluate their impact on the power output of the modules.