Next Article in Journal
Medium-Term Probabilistic Forecasting of Extremely Low Prices in Electricity Markets: Application to the Spanish Case
Previous Article in Journal
Measuring the Dynamic Characteristics of a Low Specific Speed Pump—Turbine Model
Previous Article in Special Issue
Wind Speed Prediction Using a Univariate ARIMA Model and a Multivariate NARX Model
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Solar Radiation Forecasting, Accounting for Daily Variability

1
Department of Industrial and Information Engineering, Second University of Naples via Roma n. 29, Aversa 81031, Italy
2
Department of Electrical Engineering and Information Technology, University of Naples Federico II, Via Claudio, n. 21, Napoli 80125, Italy
*
Author to whom correspondence should be addressed.
Energies 2016, 9(3), 200; https://doi.org/10.3390/en9030200
Submission received: 30 October 2015 / Revised: 14 February 2016 / Accepted: 7 March 2016 / Published: 15 March 2016

Abstract

:
Radiation forecast accounting for daily and instantaneous variability was pursued by means of a new bi-parametric statistical model that builds on a model previously proposed by the same authors. The statistical model is developed with direct reference to the Liu-Jordan clear sky theoretical expression but is not bound by a specific clear sky model; it accounts separately for the mean daily variability and for the variation of solar irradiance during the day by means of two corrective parameters. This new proposal allows for a better understanding of the physical phenomena and improves the effectiveness of statistical characterization and subsequent simulation of the introduced parameters to generate a synthetic solar irradiance time series. Furthermore, the analysis of the experimental distributions of the two parameters’ data was developed, obtaining opportune fittings by means of parametric analytical distributions or mixtures of more than one distribution. Finally, the model was further improved toward the inclusion of weather prediction information in the solar irradiance forecasting stage, from the perspective of overcoming the limitations of purely statistical approaches and implementing a new tool in the frame of solar irradiance prediction accounting for weather predictions over different time horizons.

1. Introduction

Many photovoltaic (PV) applications, such as the sizing of stand-alone microgrids or sizing of energy storage systems for their inclusion in standalone (or grid connected) systems or day-ahead market offering, require forecasting PV production variability along different time scenarios, often also short and very short terms. For this purpose, it is necessary to forecast solar radiation on these time scenarios to account for weather variations during the day and even within each hour of the day.
Significant research is currently being devoted to the development of models to predict solar radiation and PV power. Common operational approaches to solar radiation forecasting include: (1) numerical weather prediction (NWP) models that infer local cloud information through the dynamic modeling of the atmosphere up to several days ahead [1]; (2) models using satellite remote sensing or ground-based sky measurements to infer the motion of clouds and project their impact in the future; (3) statistical time series models based on measured irradiance data applied for very short term forecasting in the range of minutes to hours [2,3,4,5,6,7,8,9,10,11,12,13].
In this context, daily variability of solar irradiance and its statistical characterization (and forecasting) is important. This has been treated in the relevant technical literature in [3,4,5,6,7,8,9,10,11,12,13,14,15,16], where the clear sky index and the clearness index (or the normalized clearness index [12]) are typically used to describe solar irradiance and radiation.
In particular, in [14,15,16] the variability of solar irradiance is captured by observing the changes in clear sky index for a considered time interval. The primary usefulness of the clear sky index is the removal of diurnal and seasonal signals from a given set of radiation data to compute fluctuation power content ([17]).
Papers [3,4,5,6,7,8,9,10,11,12,13] refer to the clearness index and emphasize that the probability density distributions of the clearness index on short-term intervals, unlike those considered in longer intervals, present a multimodal nature [3,12]. In [13] it is stated that the instantaneous solar irradiance (in terms of clearness index) has bimodal probability distributions with peaks corresponding to clear and cloudy conditions. Regarding the statistical characterization of solar quantities, several attempts were proposed to identify the distribution functions that best fit solar data on a short time basis, some based on the Boltzmann statistics ([3,4]), others referring to the mixture of two normal distributions ([5,6]) or two bi-exponential probability density functions ([7,8]). These aspects make such methods mainly suitable for very short time forecasting with no exogenous inputs, as shown in [2], where five different techniques were assessed for one hour ahead and two hours ahead power forecasting of PV plants.
A common characteristic of the aforementioned models is that they refer to a unique parameter accounting for solar irradiance variations.
In [18] a statistical model referring to the clear sky index was proposed to characterize solar radiation, taking into account the daily variability in terms of two proper parameters. More specifically, solar irradiance was described as the sum of two components, characterized by specific quantities that take into account mean daily weather conditions and instantaneous variations of the solar irradiance, respectively. It was demonstrated that experimental characterization of the distributions of the two introduced quantities was suitable for a statistical characterization and for generating synthetic time series characterized by both daily and instantaneous variability starting from experimental probability distributions.
This paper introduces an evolution of the model proposed in [18]. The statistical model is developed with direct reference to the Liu-Jordan clear sky theoretical expression but is not bound by specific clear sky models; it accounts separately for the mean daily variability and for variation of solar irradiance during the day by means of two corrective parameters. The first parameter accounts for the mean daily variability, while the second accounts for the variation in solar irradiance during the day. This new proposal allows for a better understanding of the physical phenomena and improves the effectiveness of statistical characterization and subsequent simulation of the introduced parameters to generate synthetic solar irradiance time series, also starting from parametric analytical distributions. The analysis of the parametric analytical distributions that best fit the experimental distributions of the two-parameter data was also developed with reference to field measurements [19], obtaining opportune single parametric distributions or mixtures of more than one distribution. Moreover, the model’s ability to combine with weather predictions was investigated, starting from the consideration that changing the variability time scale (e.g., introducing two or more sub-parts of the day) seems to allow for using it for both long-term and short-term scenarios. Thus, a further model improvement was developed by including weather prediction information in the forecasting stage, thus overcoming the limitation of a purely statistical approach. In the application proposed in this paper, the model was tested only for day-ahead forecasting, even if the prospect of utilizing it in other time scenarios, with various forecasting techniques, seems promising.
The remainder of the paper is organized as follows. Section 2 includes a description of the model; Section 3 describes the statistical analysis stage. Simulation and forecasting are developed in Section 4 and Section 5, while Section 6 reports the results of field measurement analyses. Finally, conclusions are drawn in Section 7.

2. Proposed Model

The proposed model is an evolution of that proposed in [18], introducing a new expression decomposing the measured solar irradiance at time n t (i.e., the nth sample of the jth day of the mth month), R j , m ( n t ) , in two components:
R j , m ( n t ) = S LJ j , m ( n t ) · [ C m · k j , m + ε j , m * ( n t ) ] = S LJ j , m ( n t ) · C m · [ k j , m + ε j , m * ( n t ) ]
where:
-
S LJ j , m ( n t ) is the “clear sky” theoretical model developed by Liu-Jordan at ground level [20,21];
-
Cm is the monthly clear sky index for the mth month;
-
k j , m , mean daily variability for the jth day of the mth month, is a daily parameter, which plays the role of correction factor of C m accounting for specific daily weather conditions;
-
ε j , m * ( n t ) , instantaneous variability for the nth time interval of the jth day of the mth month, is an additive “noise” component accounting for variations of the solar irradiance during a day;
-
ε j , m * ( n t ) is equal to ε j , m * ( n t ) / C m .
It is worth noting that in Equation (1) any clear sky model (other than that proposed by Liu & Jordan) can be used without compromising the validity of the proposed model.
The second expression in Equation (1) is a useful alternative to the first and allows for establishing correspondences with popular simplified forms as it will appear clearly in the following sections (e.g., in Section 4, Equation (14)).
It is worth noting that the following constraints apply for k j , m and ε j , m * ( n t ) values:
0 k j , m 1 C m
C m · k j , m ε j , m * ( n t ) 1 C m · k j , m
The parameter C m is calculated for the mth month as:
C m = W o m W LJ m
where W o m is the historical mth month average value of the daily solar radiation issued by meteorological services [19,22]; W LJ m is the mth month average value of the daily theoretical Liu-Jordan radiation, that is:
W LJ m = t N m · j = 1 N m [ n = 1 N n S LJ j , m ( n t ) ]
being N m the number of days of the mth month and N n the number of ∆t of a day.
The formulation corresponding to Equation (1) given in [18] is:
R j , m ( n t ) = S W j , m ( n t ) · k j , m + ε j , m ( n t )
and is based on:
-
S W j , m ( n t ) , which is the historically expected irradiance at ground level for nth sample of the jth day of the mth month, in absence of daily variability and instantaneous weather variability respect to historical behavior, that is:
S W j , m ( n t ) = S LJ j , m ( n t ) · C m
-
k j , m as in Equation (1),
-
ε j , m ( n t ) given by:
ε j , m ( n t ) = S LJ j , m ( n t ) · [ ε j , m * ( n t ) ]
Figure 1a,b show, for the 11th and 12th of August, respectively, of 2004 for a site in the south of Italy, the profiles of the measures [19] of solar irradiance, R j , m ( n t ) , of its components k j , m · S W j , m ( n t ) and ε j , m ( n t ) and of the expected irradiance at ground level, S W j , m ( n t ) according to Equation (6). Of course, a similar interpretation of the same figure can be effected in terms of the components given by Equation (1).
In the figure, the regularity of the red and dotted black curves can be observed; they have the same shape as the theoretical Liu-Jordan profile multiplied by Cm; the red curve derives from the product of the black curve and k j , m . The measured solar irradiance, R j , m ( n t ) , (blue curve), is affected by irregularities deriving from the additional “noise” ε j , m ( n t ) (green curve).
The 11th of August was characterized by a daily radiation (area of the red curve) almost equal to that expected (area of the black curve); the irregularities appear concentrated in some specific parts of the day. The 12th of August was characterized by a daily radiation (area of the red curve) sensibly lower than that expected (area of the black curve); the irregularities appear to be scattered along the day with morning hours better than those of the afternoon.
The application of the proposed model includes two stages: (1) statistical analysis of irradiance quantities summarized in the flow chart of Figure 2 and detailed in Section 3; (2) simulation and forecast of the solar irradiance summarized in the flow chart of Figure 3 and detailed in Section 4.

3. Statistical Analysis

The statistical characterization of the parameters k j , m and ε j , m * ( n t ) , which refer to the jth day of the mth month, is performed by identifying their distributions at each month m. This stage includes the derivation of the distributions of the parameters and their statistical characterization.
Inputs are the historical data of solar irradiance measured at the specific location where the procedure is applied.

3.1. Experimental Distributions

Starting from the measurements of solar irradiance R j , m ( n t ) , the k j , m parameter can be evaluated from Equation (1) as follows:
k j , m = 1 N n R j , m ( n t ) 1 N n ε j , m * ( n t ) · S LJ j , m ( n t ) C m 1 N n S LJ j , m ( n t ) = 1 N n R j , m ( n t ) C m 1 N n S LJ j , m ( n t )
with N n number of samples in a day and 1 N n ε j , m * ( n t ) · S LJ j , m ( n t ) = 0 .
The parameter k j , m —which is different from the clearness index typically used for characterizing the solar radiation—assumes values lower than 1 on cloudy days, when the total energy registered is lower than the average energy expected in that month according to historical data, and assumes values greater than 1 on sunny days. Its monthly expected value, for a huge number of observations, is equal to 1.
With reference to ε j , m * ( n t ) , the following relation can be derived from Equation (1):
ε j , m * ( n t ) = R j , m ( n t ) S LJ j , m ( n t ) · C m · k j , m S LJ j , m ( n t )
Note that the values of all the Nn + 1 parameters for each day can be evaluated only at the end of the day—that is, when the measurements R j , m ( n t ) for all the Nn samples of the day have been carried out. Once the data for k j , m and ε j , m * ( n t ) have been collected, a statistical analysis can be performed with reference to the mth month of the year, obtaining the experimental joint distribution f K m , E m * or the experimental marginal distributions f K m and f E m * . Details on the statistical behavior of these quantities are provided in Section 3.2.
In order to statistically characterize the above parameters, the procedure was tested in [18] to derive the daily joint probability density function (jpdf) f k j , m ,   ε j , m or, as an alternative, the marginal pdfs ( f K m , f E m ). Considering the marginal is preferable due to the scarcity of available data and to practical reasons without relevant effects on the accuracy. For both the parameters, k j , m and ε j , m ( k t ) , the analysis was limited to the case of experimental probability mass functions (pmf) as obtainable from the data observed.

3.2. Parametric Distributions

We focused our attention on the analysis of the parametric pdfs that best fit the distributions of the data experimentally obtained. This approach, in fact, has the advantage of simply referring to specific values of the parameters of the distributions of the random variables K m and E m * , thus making the data storage very efficient and the simulation/forecast procedure extremely straightforward.
In the following subsections, the parametrical characterizations of the distributions of k j , m and ε j , m * ( k t ) are shown separately.

3.2.1. Daily Variability Parameter

From the observation of the pmfs of K m at each month of the year, it emerged that they always exhibited a multimodal nature. It also emerged that the different seasons were typically characterized by very different behaviors of the pmfs. Some months of the year, for example, are characterized by scattered distributions in correspondence of low values of k j , m which appear almost uniformly distributed, whereas an increased concentration of the occurrences can be observed for higher values of k j , m , with the majority of occurrences around a specific value, analogous to Gaussian behavior. In other cases, the multimodality is characterized by shapes that are more similar to either Gaussian or Weibull distributions, with modes appearing for both the low and high values of k j , m . In the most general case, the heterogeneity of the distributions of k j , m at each month could be represented by the mixture of two or more distributions that can be expressed as weighted sum of pdfs:
f ( x ) = i = 1 N w i · f i ( x )
where f i ( x ) are proper pdfs, each characterized by specific values of its parameters and w i are opportune weighting factors.
In our analysis, several options were considered for fitting the distributions of data with probability density curves. For each of them, the root mean square error (RMSE) was evaluated between f(x) and the experimental pmf in order to choose the most suitable parametric distribution to reproduce the selected data. In all the cases, it emerged that a mixture of two pdfs could be an acceptable trade-off between accuracy and simplicity.
The distributions used in this paper to compose the mixtures were Gaussian, Weibull, and uniform, whose analytical expressions are reported in Table 1.

3.2.2. Instantaneous Variability Parameter

A mono-modal nature of the distribution of the instantaneous variability parameter, ε j , m * ( n t ) , was observed for all months of the year. The behavior was found to be very well approximated by using a T-location scale distribution:
Γ ( p 3 + 1 2 ) p 2 p 3 π Γ ( p 3 2 ) [ p 3 + ( x p 1 p 2 ) 2 p 3 ] ( p 3 + 1 2 )
where p 1 is the location parameter, p 2 is the scale parameter, and p 3 is the degree of freedom.
The Gaussian distribution was also tested but the results always appeared to be worse than those obtained with the T-location scale.

4. Simulation of Solar Radiation

Simulation results are illustrated for the purely statistical case.
Starting from the pdfs of the two variability parameters, k j , m and ε j , m * ( n t ) , it is possible to simulate the solar irradiance with reference to each n t of the jth day of the mth month as:
R ^ j , m ( n t ) = S LJ j , m ( n t ) · [ C m · k j , m + ε j , m * ( n t ) ]
where k j , m and ε j , m * ( n t ) assume the values obtained by means of opportune extractions from the marginal parametric distributions f K m and f E m * ; C m and S LJ j , m ( n t ) assume values deriving from their definitions. R ^ j , m ( n t ) , evaluated according to Equation (13), is able to reproduce synthetic time series of solar irradiance whose variability is similar to that observed in the building stage of f K m and f E m * .
It is useful to recall that the expression typically used in the design stage to evaluate the power producibility is based on the simple expression:
R ^ j , m ( n t ) = S LJ j , m ( n t ) · C m
able to reproduce the expected value of the irradiance without taking into account the phenomena of daily and instantaneous variability.

5. Forecasting Solar Radiation Including Weather Predictions

Forecasting solar radiation including weather predictions requires a more complex procedure than the simple use of Equation (13), as illustrated in Figure 4.
Here reference is made to day-ahead forecasting, but the procedure can be implemented in different time scenarios depending on the specific weather prediction model used. For this purpose, it has to be noted that in principle any weather prediction model can be used; of course, the critical aspect is to provide an interface procedure to convert the information coming from weather prediction into corresponding proper conditioning of the statistical distributions of k j , m and ε j , m * that our statistical approach utilizes.
In what follows, the aforementioned critical aspect is considered to be already solved and the availability of properly conditioned pdfs is assumed. It is worth noting that, in this framework, the statistical approach proposed in this paper could play the role of representing the variability characteristic observed in the past, improving, as a statistical post-processing, the outputs of NWP models.
First of all, k j , m values should be extracted according to day-ahead weather predictions with reference to the conditioned pdfs expressed, for instance, as:
f ( K m | α j < k ˜ j , m < β j )
where k ˜ j , m is the weather prediction for k j , m and α j and β j the edges of the interval of confidence of k ˜ j , m . The conditioned pdfs can be easily derived from the original unconditioned f K m .
More complex is the inclusion of weather prediction for ε j , m * ( n t ) values. A possible procedure can be:
  • division of the daylight hours in Ni subintervals;
  • expression of weather prediction for each ith subinterval of duration ∆i as an opportune value of k ˜ i , j , m , α i , j and β i , j ;
  • evaluation of k ˜ j , m as opportune weighted mean of k ˜ i , j , m :
    k ˜ j , m = i = 1 N i k ˜ i , j , m · i i = 1 N i i
  • evaluation of opportune intervals of variability for ε j , m * ( n t ) for each ith subinterval, as:
    γ i , j , m ( C m k ˜ i , j , m ) ε i , j , m * ( n t ) δ i , j , m ( 1 C m k ˜ i , j , m )
    with γ i , j , m and δ i , j , m assuming values minor or equal to 1 deriving from weather prediction;
  • Extraction of the ki,j,m values for each of the Ni subintervals of the jth day according to the day-ahead weather prediction from conditioned distributions f ( k m | α i , j < k ˜ i , j , m < β i , j ) analogously to Equation (15);
  • Extraction of the ε i , j , m * ( n t ) values according to day-ahead weather predictions, for instance, with reference to the conditioned pdfs expressed as:
    f ( E m * | G i , j , m < ε ˜ i , j , m * ( n t ) < D i , j , m )
    with G i , j , m = γ i , j , m ( C m k ˜ i , j , m ) and D i , j , m = δ i , j , m ( 1 C m k ˜ i , j , m ) .
For the ith subinterval the forecasted irradiance is given by:
R ^ i , j , m ( n t ) = S LJ j , m ( n t ) · [ C m · k i , j , m + ε i , j , m * ( n t ) ]

6. Field Measurement Analysis

In order to test the proposed model, the data, related to six-year observations (years 2004–2009) from a location in the south of Italy whose latitude and longitude are 40°48.8′, 14°20.3′ [19], were analyzed. The years 2004–2008 were used to calibrate the model (identification period) while the year 2009 was used to validate the model (validation period). The data used represent the global horizontal solar irradiance and are available on a 10 min basis, already filtered in order to clean values deriving from measurements errors or absence of measurements. Due to the characteristics of the proposed method, the data were utilized without any ad hoc time series pre-processing finalized to extract trends or seasonality. In the following subsections, the results of the fitting of the aforementioned data (i.e., K m and E m * ) are reported first. Then, the results of the application of the model for forecasting purposes are shown together with an evaluation of the method’s performance.

6.1. Parametric Distribution Fitting

The results are reported separately for mean daily variability and instantaneous variability.

6.1.1. Mean Daily Variability

Table 2 reports the parameters of the distributions and weights of their mixture chosen to fit the K m data with reference to all months of the year. The values of the corresponding RMSEs are also reported in the table. The analysis was effected for all months of the year, each characterized by a proper combination of distributions. Several attempts were made to find the best combination of the above parametric distributions. The choice fell on those characterized by the minor value of the RMSE.
In order to provide a graphical example, Figure 5 shows the pmfs and their fitting parametric distributions for the months of January, April, July, and October (representatives of the four seasons) where the experimental pmfs were fitted by the mixture of (1) Uniform and Gaussian distribution in the case of January; (2) two Gaussian distributions in the case of April; and (3) Weibull and Gaussian distributions in the cases of both July and October.
From the analysis of the figure, the bimodal nature of f K m clearly appears. It can also be observed how the selected mixtures of parametric distributions quite accurately approximate the pmfs. Figure 5 also reveals that, in some months, more than two modes could be identified for the pmfs. The results of the analysis performed, however, revealed that mixing more than two parametric distributions doesn’t significantly improve the statistical characterization of K m .

6.1.2. Instantaneous Daily Variability

The analysis performed for E m * , for all months of the year, allowed us to identify the parameter values associated with the t location-scale distribution of each month. Table 3 synthetically reports the results of the analysis.
The pmfs and their fitting parametric distributions for the months of January, April, July, and October are reported in Figure 6, showing the suitability of this distribution for E m * .

6.2. Forecasting Application

Starting from the parametric pdfs of K m and E m * , the forecasting procedure described in Section 4 was applied to several tests performed with reference to all months of the year. For the sake of brevity, in what follows, only the results referring to the day-ahead forecasts for the months of January, April, July, and October are reported.
The results of the application of Parametric Marginal distribution functions (PARAMETRIC) have been compared with those obtained by means of Experimental Marginal distribution functions (EXPERIMENTAL) [18] and the Persistence method (PERS, one of the most commonly used reference methods [23]).
In order to compare the forecast methods performances, the mean average % error, MAEj,m, the mean bias % error, MBEj,m, and the root mean square % error, RMSEj,m, have been calculated for each day of the mth month:
MAE j , m = 1 N n n = 1 N n | R ^ j , m ( n t ) R j , m ( n t ) | MR j , m · 100
MBE j , m = 1 N n n = 1 N n ( R ^ j , m ( n t ) R j , m ( n t ) ) MR j , m · 100
RMSE j , m = 1 N n n = 1 N n ( R ^ j , m ( n t ) R j , m ( n t ) ) 2 MR j , m · 100
with MRj,m being the daily mean value of the measured solar irradiance evaluated over the Ns,j,m samples of the day with solar irradiance different from zero:
MR j . m = n = 1 N s , j , m R j , m ( n t ) N s , j , m
Skill Scores, evaluated with reference to the Persistence method of MAE and RMSE, have been also evaluated by means of the following expressions where METH refers either to EXPERIMENTAL or PARAMETRIC forecasting methods:
SS MAE , j . m METH = 1 MAE j , m METH MAE j , m PERS  
SS RMSE , j . m METH = 1 RMSE j , m METH RMSE j , m PERS
Figure 7 reports MAEj,m values for the two different forecast methods with reference to each day of: (a) January, (b) April, (c) July, and (d) October.
It is possible to observe that:
  • For all the months considered, the daily error varies in a quite wide range (e.g., in October from less than 10% on the 20th to almost 50% on the 13th);
  • The performances of the methods are very close to each other.
Monthly mean values of the normalized MAE, normalized MBE, and normalized RMSE for the two methods are reported in Table 4, together with the corresponding values for the Persistence method, for the same months considered in Figure 7.
Table 5 reports Skill Scores obtained using the Persistence method as reference of the monthly mean of MAEj,m and of RMSEj,m for the months given in Figure 7. EXPERIMENTAL refers to the use of the experimental marginal distribution functions proposed in [18] and PARAMETRIC refers to the use of the parametric distribution functions proposed in this paper.
It is possible to observe that:
  • Both methods presented by the authors always have better performance than the Persistence method (Table 4);
  • The use of parametric distributions gives almost the same performance as the experimental distributions (Table 4);
  • Skill Scores for both MAE and RMSE quantify the improvement of the performance with respect to the Persistence method (Table 5);
  • The performance of the proposed methods is also better for the month of July, when more stable weather conditions allow the Persistence method to have good performance (Table 5).
Going back to Figure 7, it is possible to observe that only some days of the month show MAE values appreciably higher than the rest of the month. In fact, looking at the monthly mode values of the MAE metric reported in Table 6 for the four months analyzed, it is evident that the mode values are appreciably lower than the mean values (see Table 4).
In order to understand the reason why high error values appear in some specific days (see Figure 7), the solar irradiance on such days has been further analyzed.
As an example, Figure 8a reports, with reference to the 22nd of October (that is, one of the days characterized by a high error value), the measured solar irradiance, R22,10, the forecasted solar irradiance, R ^ 22 , 10 , the expected clear sky theoretical solar irradiance, SLJ22,10, and the product of the factor, k22,10 by SW22,10 versus the time. It is evident that the daily radiation is lower than that expected from the historical data available for the specific site (k22,10 is lower than 1); furthermore, 2/3 of daylight hours are characterized by very low irradiance values while the remaining 1/3 of the time is characterized by high values. So this particular day is characterized by a very strong correlation between the k and ε values. This highlights an intrinsic limitation of purely statistical forecasting, which is able to follow the average behavior of the solar irradiance but randomly generates statistically independent epsilon values through the whole daylight period.
It is worth noting that the behavior of the analyzed 22nd of October helps us understand the reason why the values of the monthly modes are lower than those of the monthly means (Table 4). In fact, looking at Figure 7, it is possible to note that the majority of the days for all months are characterized by low error (resulting in a low value of the monthly mode) while only some days of all the months are characterized by very high error, resulting in an increased value of the monthly mean.
Results with a sensibly higher level of accuracy were obtained by applying the procedures described in Section 5, which is able to take into account weather predictions.
Figure 8b reports the results referring to the 22nd of October obtained from the study of weather predictions able to predict the two-time behavior of the day, with a level of approximation of k ˜ of ±10%.
The figure gives an immediate visual idea of the improvement in forecasting solar irradiance through the hours of the day; the corresponding MAE22,10 results reduced to 10% from the initial value 47% of Figure 8d). As for the other two metrics introduced, MBE22,10 remains almost constant around 5% while RMSE22,10 results reduced to 13% from 58%. By analysis of the figure, the strength of the procedures shown in Section 5 clearly appears to be predicting the different behavior during the day, not only in terms of radiation level but also in terms of variability of instantaneous solar irradiance. The first part of the day, in fact, is characterized by low values of mean solar irradiance (low k ˜ 1,22,10, see Equation (16)) and a significant variability value. The second part of the day, instead, is characterized by a regular profile of the solar irradiance with a high value of mean solar irradiance (high k ˜ 2,22,10 , see Equation (16)). This behavior is quite well captured by the forecasting procedure, of which the accuracy is evident, especially in case of accurate weather predictions.
This shows the difference between a purely statistical method (without taking into account any weather predictions) and the procedure completed with information on weather conditions. The former seems to be a practical solution for simulating synthetic time series in the long run (e.g., for isolated microgrids design purposes). The last seems to be a practical solution especially for those applications where information on instantaneous variations of solar irradiance is required (e.g., for day-ahead forecasts).
For comparative purposes, the results of the proposed model have been compared with the evaluations performed in [24], which refer to a benchmarking exercise organized within the framework of the European project “WIRE” [25] with the purpose of evaluating the performance of state-of-the-art models for short-term renewable energy forecasting. More specifically, 10 different solar power forecasting methods were applied to historical data for 2010 and 2011 and with reference to the suburbs of the city of Milan in Northern Italy and to the suburban area of Catania in Southern Italy. The location analyzed in this paper (Portici, Italy) is geographically situated between these two realities. Hence, though based on different data, this non-customized comparison can give an idea of the performance of the proposed model. For the evaluation in [24], among the metrics adopted, the MAE, normalized by the mean power (MP) measured during the test period, was reported. Table 7 reports the forecasting methods used by the participants in the project [25] and their error ranges in terms of MAE (%MP) for Milano (IT) and Catania (IT). Error ranges refer to minimum and maximum values regardless of the location of the best scores reported in Figure 7 and Figure 8 of [24]. Mean normalized MAE (%MR) for the methods reported in Table 4 (evaluated for the city of Portici, IT) are reported in Table 8. The commensurateness of the proposed method with all of the analyzed approaches clearly appears.

7. Conclusions

Radiation forecasting, accounting for daily and instantaneous variability, was pursued by means of a new bi-parametric model that builds on a model previously proposed by the same authors. The statistical model is developed with direct reference to the Liu-Jordan clear sky theoretical expression but is not bound by specific clear sky models; it accounts separately for the mean daily variability and for the variation of solar irradiance during the day by means of two corrective parameters. This new proposal allows for a better understanding of the physical phenomena and improves the effectiveness of statistical characterization and subsequent simulation of the introduced parameters to generate a synthetic solar irradiance time series. Furthermore, the analysis of the parametric distributions that best fit the experimental distributions of the two-parameter data was developed by obtaining opportune parametric distributions or mixtures of more than one distribution. Finally, the model was further improved by including weather prediction information in the simulation and forecasting stage, thereby overcoming the limitations of a purely statistical approach.
The main outcomes are:
-
The introduction of single parametric distributions and of mixtures of parametric distributions seems to offer, with reference to specific geographical areas, general models easy to handle in both data acquisition and subsequent simulation stages.
-
The model is suitable for inclusion of weather prediction in the solar radiation forecast stage.

Acknowledgments

The author gratefully acknowledges financial support from Italian Ministry of University and Research under Grant PON03PE 00178 1.

Author Contributions

The authors contributed equally to this work.

Conflicts of Interest

The authors declare no conflict of interest.

Nomenclature

m = 1 , 2 , , 12 month of the year
j = 1 , 2 , , N m day of the mth month
N m number of days of the mth month
N n number of samples in a day
N s , j number of samples of the day with solar irradiance different from zero
S LJ j , m ( n t ) Liu-Jordan theoretical solar irradiance at ground level at time n t of the jth day of the mth month (W/m2)
W LJ m average daily solar radiation (or solar irradiation) of the theoretical Liu-Jordan model during the mth month (Wh/m2)
S W j , m ( n t ) expected irradiance at ground level based on historical data at time n t of the jth day of the mth month (W/m2)
C m monthly clear sky index for the mth month based on historical data
W o m expected daily solar radiation during the mth month (Wh/m2)
R j , m ( n t ) measured solar irradiance at time n t of the jth day of the mth month (W/m2)
k j , m ratio between the average daily R j , m and the average daily S W j , m
ε j , m ( n t ) absolute deviation of R j , m ( n t ) from S LJ j , m ( n t ) * C m * k j , m
ε j , m * ( n t ) deviation of R j , m ( n t ) from S LJ j , m ( n t ) * C m * k j , m divided by S LJ j , m ( n t ) (W/m2)
ε j , m * ( n t ) deviation of R j , m ( n t ) from S LJ j , m ( n t ) * C m * k j , m divided by C m * S LJ j , m ( n t ) (W/m2)
K m random variable whose determinations are k j , m
E m random variable whose determinations are ε j , m ( n t )
E m * random variable whose determinations are ε j , m * ( n t )
R ^ j , m ( n t ) simulated solar irradiance at time n t of the jth day of the mth month (W/m2)

References

  1. Remund, J.; Perez, R.; Lorenz, E. Comparison of solar radiation forecasts for the USA. In Proceedings of the 23rd European Photovoltaic Solar Energy Conference, Valencia, Spain, 1–5 September 2008.
  2. Pedro, H.T.C.; Coimbra, C.F.M. Assessment of forecasting techniques for solar power production with no exogenous inputs. Sol. Energy 2012, 86, 2017–2028. [Google Scholar] [CrossRef]
  3. Tovar, J.; Olmo, F.J.; Alados-Arboledas, L. One-minute global irradiance probability density distributions conditioned to the optical air mass. Sol. Energy 1998, 62, 387–393. [Google Scholar] [CrossRef]
  4. Tovar, J.; Olmo, F.J.; Batlles, F.J.; Alados-Arboledas, L. Dependence of one-minute global irradiance probability density distributions on hourly irradiation. Energy 2001, 26, 659–668. [Google Scholar] [CrossRef]
  5. Sim, M.L.; Kung, F.W.L.; Ghassemlooy, Z. Statistical analysis and modelling of one-minute global solar irradiance for a tropical country. In Proceedings of the 2nd International Symposium on Environment Friendly Energies and Applications (EFEA), Newcastle upon Tyne, UK, 25–27 June 2012; pp. 243–248.
  6. Jurado, M.; Caridad, J.M.; Ruiz, V. Statistical distribution of the clearness index with radiation data integrated over five minute intervals. Sol. Energy 1995, 55, 469–473. [Google Scholar] [CrossRef]
  7. Ibáñez, M.; Rosell, J.I.; Beckman, W.A. A bi-variable probability density function for the daily clearness index. Sol. Energy 2003, 75, 73–80. [Google Scholar] [CrossRef]
  8. Ibañez, M.; Beckman, W.A.; Klein, S.A. Frequency Distributions for Hourly and Daily Clearness Indices. J. Sol. Energy Eng. 2001, 124, 28–33. [Google Scholar] [CrossRef]
  9. Skartveit, A.; Olseth, J.A. The probability density and autocorrelation of short term global and beam irradiance. Sol. Energy 1992, 49, 447–487. [Google Scholar] [CrossRef]
  10. Woyte, A.; Belmans, R.; Nijs, J. Fluctuations in instantaneous clearness index: Analysis and statistics. Sol. Energy 2007, 81, 195–206. [Google Scholar] [CrossRef]
  11. Suehrcke, H.; McCormick, P.G. The Frequency Distribution of Instantaneous Insolation Values. Sol. Energy 1988, 40, 413–422. [Google Scholar] [CrossRef]
  12. Hollands, K.G.T.; Suehrcke, H. A three-state model for the probability distribution of instantaneous solar radiation, with applications. Sol. Energy 2013, 96, 103–112. [Google Scholar] [CrossRef]
  13. Widén, J.; Carpman, N.; Castellucci, V.; Lingfors, D.; Olauson, J.; Remouit, F.; Bergkvist, M.; Grabbe, M.; Waters, R. Variability assessment and forecasting of renewables: A review for solar, wind, wave and tidal resources. Renew. Sustain. Energy Rev. 2015, 44, 356–375. [Google Scholar] [CrossRef]
  14. Hoff, T.E.; Perez, R. Modeling PV fleet output variability. Sol. Energy 2012, 86, 2177–2189. [Google Scholar] [CrossRef]
  15. Perez, R.; Hoff, T.E. Solar Resource Variability. In Solar Energy Forecasting and Resource Assessment; Elsevier: Philadelphia, PA, USA, 2013; pp. 133–148. [Google Scholar]
  16. David, M.; Andriamasomanana, F.H.R.; Liandrat, O. Spatial and Temporal Variability of PV Output in an Insular Grid: Case of Reunion Island. Energy Procedia 2014, 57, 1275–1282. [Google Scholar] [CrossRef]
  17. Engerer, N.A.; Mills, F.P. KPV: A clear-sky index for photovoltaics. Sol. Energy 2014, 105, 679–693. [Google Scholar] [CrossRef]
  18. Langella, R.; Proto, D.; Testa, A. A statistical model of solar radiation daily variability. In Proceedings of the International Conference on Probabilistic Methods Applied to Power Systems (PMAPS), Durham, UK, 7–10 July 2014.
  19. ENEA Fonti Rinnovabili: Atlante Italiano della Radiazione Solare. Available online: http://www.solaritaly.enea.it/CalcComune/Calcola.php (accessed on 10 March 2016).
  20. Liu, B.Y.H.; Jordan, R.C. The interrelationship and characteristics distribution of direct, diffuse, and total solar radiation. Sol. Energy 1960, 4, 1–19. [Google Scholar] [CrossRef]
  21. Liu, B.Y.H.; Jordan, R.C. A rational procedure for predicting the long term average performance of flat plate solar energy collectors. Sol. Energy 1965, 7, 53–74. [Google Scholar] [CrossRef]
  22. European Solar Radiation Atlas. Available online: http://www.soda-is.com/esra/reference.html (accessed on 10 March 2016).
  23. Jolliffe, I.T.; Stephenson, D.B. Forecast Verification: A Practitioner’s Guide in Atmospheric Science; John Wiley & Sons, Ltd.: London, UK, 2011. [Google Scholar]
  24. Sperati, S.; Alessandrini, S.; Pinson, P.; Kariniotakis, G. The “Weather Intelligence for Renewable Energies” Benchmarking Exercise on Short-Term Forecasting of Wind and Solar Power Generation. Energies 2015, 8, 9594–9619. [Google Scholar] [CrossRef] [Green Version]
  25. COST Action ES1002, WIRE: Weather Intelligence for Renewable Energies Official Website. Available online: http://www.wire1002.ch/ (accessed on 10 March 2016).
Figure 1. Profile of solar irradiance and its components, according to Equation (6), on (a) 11 August 2004; and (b) 12 August 2004.
Figure 1. Profile of solar irradiance and its components, according to Equation (6), on (a) 11 August 2004; and (b) 12 August 2004.
Energies 09 00200 g001
Figure 2. Flow chart of the “Statistical Analysis” stage.
Figure 2. Flow chart of the “Statistical Analysis” stage.
Energies 09 00200 g002
Figure 3. Flow chart of the “Simulation of solar irradiance” stage.
Figure 3. Flow chart of the “Simulation of solar irradiance” stage.
Energies 09 00200 g003
Figure 4. Flow chart of the “Forecasting of solar irradiance including weather predictions” stage.
Figure 4. Flow chart of the “Forecasting of solar irradiance including weather predictions” stage.
Energies 09 00200 g004
Figure 5. Discrete probability distributions of K m and their fitting probability density functions (pdfs) with reference to the months of (a) January; (b) April; (c) July; and (d) October.
Figure 5. Discrete probability distributions of K m and their fitting probability density functions (pdfs) with reference to the months of (a) January; (b) April; (c) July; and (d) October.
Energies 09 00200 g005aEnergies 09 00200 g005b
Figure 6. Discrete probability distributions of E m * , and their fitting pdfs with reference to the months of (a) January; (b) April; (c) July; and (d) October.
Figure 6. Discrete probability distributions of E m * , and their fitting pdfs with reference to the months of (a) January; (b) April; (c) July; and (d) October.
Energies 09 00200 g006
Figure 7. MAEj,m values versus the day of the month for the two different forecast methods: EXPERIMENTAL and PARAMETRIC. (a) January, (b) April, (c) July, and (d) October.
Figure 7. MAEj,m values versus the day of the month for the two different forecast methods: EXPERIMENTAL and PARAMETRIC. (a) January, (b) April, (c) July, and (d) October.
Energies 09 00200 g007aEnergies 09 00200 g007b
Figure 8. 22nd October: Measured solar irradiance, R22,10, forecasted solar irradiance, R ^ 22 , 10 , clear sky theoretical solar irradiance, SLJ22,10, and product of the factor , k22,10 by SW22,10 versus the time: (a) pure statistical forecasting; and (b) forecasting accounting for weather predictions.
Figure 8. 22nd October: Measured solar irradiance, R22,10, forecasted solar irradiance, R ^ 22 , 10 , clear sky theoretical solar irradiance, SLJ22,10, and product of the factor , k22,10 by SW22,10 versus the time: (a) pure statistical forecasting; and (b) forecasting accounting for weather predictions.
Energies 09 00200 g008
Table 1. Density functions of the distributions chosen for the analysis.
Table 1. Density functions of the distributions chosen for the analysis.
NameDensity FunctionParameters
Weibull y = p 2   p 1 p 2   x p 2 1   e ( x p 1 ) p 2   x [ 0 , ) p1 scale parameter
p2 shape parameter
Gaussian y = 1 p 2 2 π   e ( x p 1 ) 2 2 p 2 2   x ( , + ) p1 mean
p2 standard deviation
Uniform y = 1 p 2 p 1   x [ p 1 , p 2 ] p1 lower endpoint (minimum)
p2 upper endpoint (maximum)
Table 2. Mixtures of distributions resulting from the analysis of K m (p1, p2, p1 and p2 are defined in Table 1 and w1 and w2 in Equation (11)). Root mean square error: RMSE.
Table 2. Mixtures of distributions resulting from the analysis of K m (p1, p2, p1 and p2 are defined in Table 1 and w1 and w2 in Equation (11)). Root mean square error: RMSE.
Month1st Distributionp′1p′2Weight (w1)2st Distributionp″1p″2Weight (w2)Norm. RMSE Mean (%)
JanuaryUniform0.09771.3710.7800Gaussian1.3100.06880.22005.7%
FebruaryUniform0.00071.3910.7572Gaussian1.3080.08320.24284.3%
MarchGaussian0.62190.31220.4671Weibull1.25569.7580.53294.8%
AprilGaussian0.82790.29430.5740Gaussian1.2830.06520.42606.6%
MayGaussian0.90530.23870.5626Gaussian1.2380.04130.43747.7%
JuneWeibull0.91275.1530.3136Weibull1.17325.810.68648.5%
JulyWeibull1.14123.330.9119Gaussian0.85530.20410.08819.7%
AugustGaussian0.86640.19140.2526Gaussian1.1590.03990.74748.8%
SeptemberWeibull 0.89783.9420.3323Gaussian1.2130.05870.66775.1%
OctoberWeibull0.89383.6060.4896Gaussian1.2150.06640.51046.9%
NovemberGaussian0.66050.29990.6448Gaussian1.2770.08040.35525.6%
DecemberWeibull0.73762.1870.5712Weibull1.32214.170.42884.1%
Table 3. Parameters of the t location-scale distribution resulting from the analysis of E m * .
Table 3. Parameters of the t location-scale distribution resulting from the analysis of E m * .
Montht Location-Scale Parameters
Location (p1)Scale (p2)Degree of Freedom (p3)
January−0.001816720.1397262.29907
February−0.003677670.1520672.82915
March−0.002599970.1678673.68524
April−0.00594130.1598163.43902
May0.002581890.1443552.82872
June0.01207930.08477741.42587
July0.01376820.04482221.06117
August0.009110460.04837261.11224
September0.009578220.08365221.36912
October0.0005834340.1248822.08324
November−0.007605120.1495132.27951
December−0.008824190.1440972.15243
Table 4. Monthly mean values of MAEj,m, MBEj,m and of RMSEj,m for the months in Figure 7.
Table 4. Monthly mean values of MAEj,m, MBEj,m and of RMSEj,m for the months in Figure 7.
MonthEXPERIMENTALPARAMETRICPERSISTENCE
MAE (%MR)MBE (%MR)RMSE (%MR)MAE (%MR)MBE (%MR)RMSE (%MR)MAE (%MR)MBE (%MR)RMSE (%MR)
January2803628035722994
April35−14435045682190
July12−1161201515221
October21−12722−12738952
Table 5. Skill Scores, referring to the Persistence method, of the monthly mean of MAEj,m, and of RMSEj,m for the months in Figure 7.
Table 5. Skill Scores, referring to the Persistence method, of the monthly mean of MAEj,m, and of RMSEj,m for the months in Figure 7.
MonthEXPERIMENTALPARAMETRIC
SSMAE (pu)SSRMSE (pu)SSMAE (pu)SSRMSE (pu)
January0.60.60.60.6
April0.50.50.50.5
July0.20.20.20.3
October0.40.40.50.5
Table 6. Monthly mode values of MAEj,m for the months in Figure 7.
Table 6. Monthly mode values of MAEj,m for the months in Figure 7.
MonthEXPERIMENTALPARAMETRIC
MAE (%MR)MAE (%MR)
January68
April88
July66
October88
Table 7. Normalized MAE (%MP) ranges for the methods applied in [24] to Milano (IT) and Catania (IT).
Table 7. Normalized MAE (%MP) ranges for the methods applied in [24] to Milano (IT) and Catania (IT).
idMethodNormalized MAE (%MP) Range
Min.Max.
1WIRE model data + linear regression (random forest), (Milano and Catania)20%32%
2Own meteorological model + output correction using tendency of past production (Milano and Catania)23%40%
3GFS + Model Output Statistics + conversion to power (Milano) WIRE data + conversion to power (Catania)29%38%
4WIRE data + Support Vector Machines (Milano and Catania)19%65%
5Linear regression between GHI and solar power (Milano and Catania)27%58%
6WIRE data + ANN (Multilayer Perceptron with Standard Back Propagation and Logistic Functions), (Milano and Catania)42%56%
7WIRE data + quantile regression to estimate clear sky production, irradiation and medium temperature + linear regression to explain the rate of clear sky production observed (Milano and Catania)15%30%
8WIRE data + linear regression model (Milano and Catania)23%42%
9Combination of WIRE data and WRF ARW model version 2.2.1 using initial and boundary conditions from NCEP GFS + Gaussian Generalized Linear Model (Catania)17%-
Table 8. Mean normalized MAE (%MR) for the methods reported in Table 4 applied to Portici (IT).
Table 8. Mean normalized MAE (%MR) for the methods reported in Table 4 applied to Portici (IT).
MethodNormalized MAE (%MR)
EXPERIMENTAL24%
PARAMETRIC25%
PERSISTENCE48%

Share and Cite

MDPI and ACS Style

Langella, R.; Proto, D.; Testa, A. Solar Radiation Forecasting, Accounting for Daily Variability. Energies 2016, 9, 200. https://doi.org/10.3390/en9030200

AMA Style

Langella R, Proto D, Testa A. Solar Radiation Forecasting, Accounting for Daily Variability. Energies. 2016; 9(3):200. https://doi.org/10.3390/en9030200

Chicago/Turabian Style

Langella, Roberto, Daniela Proto, and Alfredo Testa. 2016. "Solar Radiation Forecasting, Accounting for Daily Variability" Energies 9, no. 3: 200. https://doi.org/10.3390/en9030200

APA Style

Langella, R., Proto, D., & Testa, A. (2016). Solar Radiation Forecasting, Accounting for Daily Variability. Energies, 9(3), 200. https://doi.org/10.3390/en9030200

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop