1. Introduction
The variation of annual performance in solar energy systems is the essential factor considered in a project’s economic feasibility [
1]. The risk evaluation of P50/P90 is mainly used to evaluate the economic feasibility of wind farm projects. Since solar irradiance is a more predictable resource than wind velocity, it can be applied to risk evaluations of photovoltaic (PV) or concentrated solar thermal power projects [
2,
3].
P50 means that the predicted solar resource/energy yield may either be exceeded or not be exceeded, with a 50% probability of either occurring. The P90 value is expected to be exceeded in 90% of the cases. Thus, P90 is less than P50. For example, in the PV power generation system, a P50 value of 30,000 kWh means the system output may exceed 30,000 kWh with a probability of 50%. Similarly, a P90 value of 30,000 kWh would mean that the system is likely to generate over 30,000 kWh 90% of the time.
Figure 1 shows the concept of P50 and P90 as a graph.
For PV system, which is widely used as a simulation program of solar energy systems [
4], annual solar irradiance is assumed to follow the normal distribution in the P50/P90 calculation [
5]. With the system advisor model (SAM) algorithm, two methods are used. Depending on whether or not the solar irradiance data distribution follows the normal distribution, P50/P90 is calculated from the cumulative distribution function (CDF) of the normal distribution if the solar irradiance data follow the normal distribution. Otherwise, P50/P90 is calculated from the empirical CDF, based on the assumption that all data occurrence probabilities are the same, by sorting the data in ascending order [
3]. Furthermore, the results of a study performed in Spain, which exhibited the characteristics of annual global horizontal irradiance (GHI) in 13 locations in the USA, Europe, etc., reported that they followed the normal distribution [
6].
However, real solar irradiance is a variable natural energy source that does not always follow the normal distribution, as shown in
Figure 2. This graph shows the probability density function (PDF) of the normal distribution and probability histogram of the GHI on Mokpo over a 27-year period (1991 to 2017). Moreover, existing studies have also reported that the actual solar irradiance follow the asymmetric distribution rather than the normal distribution [
1].
This study calculated the P50/P90 PV power potential from the skew-normal distribution, whereby skewness was applied to the normal distribution rather than assuming that the probability was the same, after sorting the data in ascending order if the solar irradiance simply did not follow the normal distribution.
In previous studies, the skew-normal distribution was found to be ideal for presenting appropriate models of real data that did not follow the normal distribution. Thus, a large number of studies on the application of the skew-normal distribution have been conducted [
7,
8,
9]. In particular, the authors of Reference [
7] improved the predictability of probabilistic solar irradiance using Bayesian model averaging (BMA), to which the skew-normal PDF was applied using the ensemble technique in Singapore. Moreover, when producing a daily solar irradiance using the CLImate GENerator (CLIGEN) model for regions in China, the model’s performance was improved by adding a skew coefficient to the normal distribution [
9]. In addition, a study on daily solar irradiance modeling was also conducted based on various distributions in a tropical climate in France and Nigeria [
10,
11,
12].
In this paper, four cities in Korea, which is located in the mid-latitude temperate zone and has four distinct seasons (spring, summer, autumn, and winter), were targeted to apply the normal distribution, skew-normal distribution, and empirical cumulative distribution, from which the P50/P90 PV power potential was calculated and the results were compared. The Jarque–Bera test was used as the goodness-of-fit test to determine whether solar irradiance data followed the normal distribution, and the AICc was used to determine which method produced a better result between normal and skew-normal distributions.
5. Conclusions
This study determined the best goodness-of-fit distribution when applying the normal, skew-normal, and empirical distributions based on the long-term solar irradiance database in four major cities in Korea, and compared the results after calculating the P50/P90 PV power potentials. In PVsyst, which has been widely used as a performance simulation program in existing solar energy systems, the normal distribution was assumed for annual solar irradiance data, and the P50/P90 values were calculated from the empirical CDF, whose data were sorted in ascending order based on the assumption that the probability of occurrence of all data was the same when the solar irradiance data did not follow the normal distribution in the existing SAM algorithm. However, this study presented a distribution, which was closer to the actual solar irradiance distribution mathematically, by applying the skew-normal distribution when the solar irradiance data did not follow the normal distribution. The Jarque–Bera test was used as the goodness-of-fit test to determine whether the solar irradiance data followed the normal distribution, while the AICc was used to determine which model produced a better quality result between the normal and skew-normal distributions.
For Seoul, the result of AICc was obtained that the solar irradiance distribution was appropriate to the normal distribution. In contrast, for Daejeon, Mokpo, and Jeju Island, the skew-normal distribution was more appropriate during May and November, and July and August. The above results show that the appropriate distribution shape may differ depending on the region and season.
Considering that the relative likelihood of the annual AICc was at least 0.3 or larger in the four regions, the quality of any single distribution model was not considered to be much better than the others. Information loss occurred upon selecting a single model, which showed that any one distribution would be appropriate for all. For the purposes of this study, only 27 samples were used, but at least 30 samples would be needed for model fitting [
25]. Thus, the larger the number of samples, the lower the uncertainty will be [
26]. Therefore, it is necessary to secure more annual solar irradiance data than the current study because this will make possible a more accurate model fitting.
The results of the comparison of the P90 and P50 values when applying three distributions showed a larger difference in the P90 values than in the P50 values. The greatest difference between distributions was obtained in Daejeon, where the difference between the normal and empirical distributions was around 4.37%, followed by Mokpo, where the difference between the skew-normal and empirical distributions was around 2.14%.
Based on the PV power potential, according to the seasonal analysis of the four cities, Jeju Island produced the lowest generation in winter, unlike the other three cities. In addition, in the previous study, when the PV power generation ramp analysis was conducted on 1450 PV power plants in Korea, the average ramp rate of Jeju Island was 11.5%, which was the most variable region in Korea [
27]. Therefore, when the PV penetration level increased, especially in Jeju Island, it was necessary to thoroughly prepare for the worst case and take complementary measures when estimating backup facility capacity and reserve.
Projects that utilize solar energy, such as photovoltaics and concentrated solar thermal power, entail a certain degree of uncertainty, with the proportion attributable to solar resource uncertainty being around 5% to 17% [
28]. This uncertainty could increase the project risk. As such, we hope that the results of this study could be used as the main data in an effort to reduce the degree of uncertainty in PV projects in Korea, where the proportion of PV power is being steadily increased by identifying the difference, although the difference was found to be just under 5% after applying a more realistic distribution to Korea, which has a continental climate.