1. Introduction
Given the increasingly serious problems of fossil energy shortage and greenhouse gas emissions, the need for sustainable and low-carbon energy technologies is on the rise [
1]. As one of the most promising emerging renewable energy technologies, solar photovoltaic (PV) power generation has developed faster than anticipated [
2]. The global installed PV capacity has grown from 0.96 GW in 1998 to 227 GW in 2015 [
3]. There is no doubt that PV industry will play a more significant role in the future energy production. In particular, China has become the leader in the PV module manufacturing since 2009. At the end of 2013, China accounted for 67% of the total PV production [
4]. According to the International Energy Agency (IEA), China leads the world in cumulative installed PV capacity, with over 110 GW by 2015, and solar power could potentially provide one-third of the global energy demand after 2060 [
5,
6].
However, there are still several problems that need to be solved due to the inherent properties of solar energy, such as its variability and uncertainty, particularly the sharp ramps under specific meteorological events and different outputs under variable weather conditions [
7]. In order to maintain a consistent balance of supply and demand, power system operation is generally based on an understanding of the random variations in the load and the controllable and dispatchable conventional stand-by power generation plants [
8]. Therefore, the predictability of the solar PV power generation and its accuracy are very important, both for the transmission and distribution sides. For the transmission side, non-dispatchable central large-scale solar PV plants with tempestuous and fast variable output may cause great issues to the active power balance and economic operation of regional grids. From the distribution side, roof-top PV, building integrated photovoltaic (BIPV) and other small-scale PV systems can be equivalent to negative electricity demand during the daytime with solar irradiance, which significantly reshape the traditional load curves by providing electricity directly to the load behind the meter. This effect will result in difficulties in load forecasting under different weather conditions, on which the dispatch operation depends [
9]. Numerical weather prediction (NWP)-based multiple temporal and spatial scales solar PV power forecasting is a good measure to facilitate more economic decisions for the dispatch operation of power system. With this knowledge, the solar PV generation can be added into the grid operation more precisely like the economic dispatch considering the uncertainty of deep penetration of wind power, which will make it possible to schedule and adjust the dispatchable generators’ output to coordinate the fluctuation of solar PV plants as well as the wind farms and the electricity load from users’ demand [
10].
In addition to the material of PV modules, the output power produced by PV modules (
P) mainly depends on the amount of surface solar radiance flux on module plane (
GT) and the PV cell temperature (
Tc) [
11], which can be approximately analyzed using Equation (1) [
6]:
where
P is PV power output, η
r is the reference module efficiency,
S is the aperture surface area of PV module, β is temperature coefficients of PV modules, γ is solar irradiance coefficient of PV modules,
Tc and
T0 are PV cell temperature and reference values of temperature respectively, and finally,
GT is surface solar radiance flux on module plane.
The γ, solar irradiance coefficient of PV modules, usually can be neglected because of its small value. Then Equation (1) is rewritten as Equation (2):
Reference [
12] indicates that the temperature is almost uniform on the panel. Thus, an average PV module temperature (
Tm) is used to replace the PV cell temperature. Therefore, Equation (2) can be rewritten as Equation (3):
The above formulation clearly demonstrates the impact of
Tm on the total power production with the given value of other parameters including
GT. Previous studies have shown a reduction in electrical efficiency for increase in
Tm exceeding a certain limit [
13]. As a result of this relationship,
Tm is now regularly included in PV power prediction models [
14,
15,
16]. This sufficiently highlights the importance of
Tm as a key factor in modeling and assessing the performance of PV modules.
As a characteristic parameter of PV module, the PV module temperature is influenced by many impact factors, and the primary drivers of
Tm have been determined to be ambient temperature (
Ta), solar irradiance (
GT), wind speed (
VWS) and relative humidity, etc. [
17,
18,
19], which reflect the complex ongoing energy balance and heat transfer processes occurring in PV module environments [
20]. Reference [
11] preliminarily discussed the heat transfer and energy balance of a PV module. Besides being a function of the weather variables, PV module temperature also depends on PV material, parameters and module encapsulation materials [
21]. To analyze meteorological impact factors (MIFs) and material or system-dependent properties for PV module temperature, reference [
22] summarized a number of formulas of physical expression of
Tm. Considering exchange of PV module temperature, apart from the heat transfer and energy balance, power conversion efficiency of PV module affecting by PV effect must take into account. Different weather classifications, namely, sunny, cloudy, or shower as well as heavy rain and so on, could lead to different heat dissipation conditions, which also obviously affect heat exchange between the PV module and the external environment, affecting the PV module temperature [
23].
The concern of this research is how to compute the quantitative metrics for the specific impacts of MIFs on PV module temperature. Although the prior research and existing models illustrate basic knowledge about MIFs of PV module temperature, however, most of them only focus on the calculation of PV module temperature using related meteorological parameters, or observation from curves of actual data and experimental proof through qualitative analysis from the theoretical and empirical perspective based on thermodynamic and electrical theories, which hardly provided any corresponding information with respect to the quantitative impact of MIFs on PV module temperature and other MIFs. Therefore, here we introduce two mathematical methods to quantitatively describe the correlative relations between different MIFs and PV module temperature associate with the quality analysis based on heat transfer theory. The results of this research can help classification modeling to consider the specific influences of multiple MIFs more clearly and precisely under different weather conditions.
References [
24,
25] adopted autocorrelation (AC) for the tasks of data preprocessing before forecasting. As a measurement of correlation between feature-to-output variables, correlation-based selection (CFS) is more suitable for identifying relevancy between variables. CFS based on correlation coefficient analysis possesses the ability that can accurately capture the main features of the relationships and express the measurement of each variable influence on the relationship [
26]. However, CFS is only able to detect linear correlations. In other words, it will not work well while extracting the nonlinear relations between variables in many real applications. Fortunately, mutual information (MI) theory has been widely used to explore nonlinear correlations between multiple variables [
27,
28,
29] in these cases. MI is utilized to extract the most informative feature with a maximum relevancy and minimum redundancy for wind power forecasting [
30]. As the impact factors of other forecasting objections, the impact factors of PV module temperature also are complex and may bring more redundancy to the results because of its tight coupling relations within each other.
The rest of this paper is organized as follows.
Section 2 analyzes the electrical and thermal processes of solar PV cells including PV effect, energy balance and heat transfer from the perspective of physics to determine the primary MIFs of solar PV module temperature.
Section 3 introduces the mathematical foundation of this research.
Section 4 is the case study using actual data of a grid-connect PV plant to illustrate the quantitative analyses based on CFS and MI on the specific influence degree of the determined impact factors under the cases of four different weather statuses. Finally, conclusions were drawn in
Section 5.
2. Physical Description of Photovoltaic Module Temperature
The impacts on PV module temperature include internal and external aspects [
31]. The internal aspect refers to the PV module physical characteristics related factors including material category, parameters and system-dependent properties [
32], which are fixed and unique to those individual PV plants that already put into operation. The external aspect mostly refers to those meteorological factors that impact the PV module temperature during its operating duration. In particular, the heat transfer process and thermal energy balance caused by radiation and convection [
33], which is directly related to the real-time environmental conditions of different weather statuses should be illustrated at first.
Equations (4)–(20) are utilized to express the overall module efficiency. The energy balance of PV module can be divided into thermal and electrical performance [
34]. We begin with the energy balance analysis from Equation (4) [
23]:
where τ is the transmittance of the cover system for irradiance, τ
GT is the part of
GT crossing the glass, α is the absorption coefficient of PV cells, ατ
GT is the part of
GT absorbed by PV modules, η is conversion efficiency of PV module,
QS respects the thermal energy losses through radiation and convection heat transfer from modules to surrounding [
35].
The schematic of the heat transfer process of the PV cell is shown in
Figure 1, where
Tg and
Ts, are the temperature of ground and the temperature of sky, respectively. Here,
Tg and
Ts are all assumed to equal to
Ta.
According to Newton’s law of cooling, the convective heat and irradiative transfer exchange from a surface to the surrounding fluid can be expressed in Equation (5):
where
h is heat transfer coefficient,
hr and
hc are heat transfer coefficient of radiation and heat transfer coefficient of convection, respectively. They are calculated according to Equations (6) and (7):
where ε is emissivity of materials, σ is Stefan-Boltzmann constant,
d is air thermal conductivity,
l is board length and
Nu is Nusselt number.
For free cooling, if predominantly laminar flow is assumed, an approximation of
Nu given by Holman can be expressed as Equation (8) [
35]:
If it is the turbulent flow, the formula of
Nu can be expressed as Equation (9) [
35]:
where
Pr is Prandtl number, and
Re is the Reynolds number, which is used to characterize the flow of fluid, it is defined by Equation (10):
where
VWS is the wind speed and ν is kinematic viscosity.
If we substitute Equations (6)–(10) into Equation (5),
QS will be obtained as Equation (11) in the situation of laminar flow:
or as Equation (12) in the situation of turbulent flow:
Referring back to Equation (4),
QS can be obtained through the above and we now turn to the module efficiency. Here we adopt the method given by Notton in 2005 [
12]:
where η
r is the reference value of PV module efficiency.
Finally, on the basis of
σ being much smaller than
Re,
Tm is obtained in situation of laminar flow as Equation (14):
or in situation of turbulent flow as Equation (15):
From Equations (14) and (15), there are 11 impact factors of Tm:
(1) Five material/system-dependent factors: l, α, τ, ηr, and β.
Except for the structure parameter, l, the remaining four factors, α, τ, ηr, and β are performance parameters of the PV module, which can be obtained from the specifications of the PV module. These PV module parameters, which are constants for an already built PV plants, are dependent on the PV module technologies and encapsulation materials.
Three performance parameters of PV module can be decided by PV module technologies: α, η
r, and β. PV power plants usually use different PV module technologies, such as monocrystalline silicon (mc-Si), poly-silicon (p-Si), amorphous silicon (a-Si), and other thin film technologies such as copper indium diselenide (CIS), etc. [
36]. In general, due to the advantages of crystalline silicon in the balance of energy conversion and the cost, the PV power plants applying this technology account for the largest proportion around the world, and dominate the PV market with around 90% share in 2014 [
37,
38]. When we refer to crystalline silicon, p-Si PV modules are normally cheaper, while mc-Si PV modules are more efficient, which means lager values of α and η
r.
The encapsulation materials are key characteristics of PV modules to determine the transmittance, τ, which is important in both the immediate and long-term power production of modules. Appropriate encapsulation materials can improve the optical flux transmittance, as well as protect the PV cell from the surroundings. The materials include ethylene vinyl acetate (EVA), polyvinyl butyral (PVB), poly dimethyl siloxane (PDMS), polyolefins, ionomers, and thermoplastic polyurethane (TPU) and so on [
39]. Many encapsulation materials were found to discolor, with the resulting reduction in transmittance compromising PV module performance, so the transmittance of a PV module is influenced by thickness and durability of the encapsulation materials. However, there is no evidence for different encapsulation materials exhibiting much more different influence on transmittance in terms of same technology [
39].
(2) Three surrounding-dependent factors: ν, Pr, k.
These three impact factors depend on the surroundings, especially ambient temperature. According to “Properties of Air at Atmospheric Pressure” in Heat Transfer written by Holman ([
35], p. 643), the fitting relations between three factors and ambient temperature can be calculated as follows:
It is obvious that at atmospheric pressure, the changes of the three factors only rely on ambient temperature, which also enhances the effect of ambient temperature on PV module temperature.
(3) Three MIFs: Ta, GT, and VWS.
Due to the characteristics of the above nine impact factors, when it comes to an already built PV plants, the changes of PV module temperature mainly rely on three MIFs: ambient temperature, solar irradiance and wind speed. According to Equations (14) and (15), the effects of ambient temperature and solar irradiance are almost proportional to the PV module temperature, while the effect of wind speed is in the form of fractional exponent power, which means the effect of wind speed is qualitatively weaker than the former two. Due to the fact the surroundings-dependent factors only rely on ambient temperature, from the angle of qualitative analysis, ambient temperature should be the most influential factor on PV module temperature, followed by solar irradiance, and wind speed is the weakest one.
The PV module temperature mainly relies on MIFs, which means the PV module temperature process will be different when the regulations of MIFs change. It is apparent that there are varying weather conditions, such as sunny days, cloudy days and so on [
40], whose impacts on the PV module temperature can be grouped in two aspects: heat dissipation conditions and power generation performance. For example, on sunny days, solar irradiance and ambient temperature are stronger than on cloudy days, which will reduce the heat dissipation but enhance power generation, and finally, increase the PV module temperature. Thus, when we analyze the PV module temperature, the weather conditions should be classified to several types based on the distinction of three MIFs.
On the other hand, ambient temperature, solar irradiance and wind speed as MIFs of PV module temperature, are affected by each other. From a meteorological view, solar irradiance, as the main factor measuring the solar power, has a dominant impact on the ambient temperature changes on long-term time scale conditions or between different seasons other than a single day. It is the complexity and tight coupling between MIFs, which will bring redundancy to the researches and make quantitative analyses inaccurate, so mathematical methods relating PV module temperature and MIFs require further research.
4. Quantitative Degree of Influence of Meteorological Impact Factors on Photovoltaic Module Temperature
4.1. Data
The dataset used to carry out this research, covering the time range from January 2012 to December 2012, comes from a 500 kW grid-connected PV plant in China connecting the grid through a voltage level of 6 kV, which includes the variables of solar irradiance (
GT), ambient temperature (
Ta), and wind speed (
VWS) with the time interval of 30 min. The annual total available records cover 310 days during the whole year. The manufacturer of the PV module installed in this plant is JinKo Solar Company (Shangrao, Jiangxi, China) and the assembly model is JKMS300P-72 adopting p-Si technology, which can be seen in
Figure 3. The total number of PV cells in each module is 72 (6 × 12). The size of each cell is 156 mm × 156 mm. The parameters of the JKM300P-72 PV module are listed in
Table 1, where
NOCT means normal operating cell temperature. The models and parameters of the sensors deployed in this plant are listed in
Table 2.
The analysis work is conducted only during the daytime period with sunlight because the purpose of this research is to try to help improve the solar PV power forecasting through making clear the influence of MIFs on PV module temperature. More importantly, due to heat transfer, PV module temperature is appropriately the same as the ambient temperature, which means the MIFs are different during the day and night. Therefore, although the temperature data is continuous in a whole day, we select the data from 7:00 a.m. to 5:00 p.m. to conduct the mathematical analysis, i.e., 20 data points per day are selected for the case study.
As mentioned above in
Section 1 and
Section 2 of this paper, weather conditions have significant impacts on the heat dissipation conditions of PV modules. “GB/T 22164-2008 Public Climate Service—Weather Graphic Symbols” released by China Meteorological Administration defined 33 types of weather status [
47]. In order to balance the accuracy and complexity of the analysis of PV module temperature, the numbers of weather types should be reasonable and the summative weather statuses should be typical and representative. Summarizing the most vital characteristics of all climates, the weather statuses can generally be divided into four typical different classes: sunny day, cloudy day, shower day and heavy rainy day [
32]. Furthermore, the other weather statuses are assigned to one of these four classes according to the degree of closeness described by a correlation coefficient, and then four general weather classes (GWCs) named A, B, C, and D are constituted [
40]. After removing the invalid data, the distribution of four different GWCs are 24, 119, 148, and 19 days, respectively.
Figure 4 is the actual data of PV module temperature and its MIFs on a certain day under four GWCs. The dates selected to represent the rules of four GWCs are 10 June, 18 June, 24 June, and 6 June respectively. It is shown that the changes of PV module temperature are the common interaction among all three MIFs. For example, in weather class A, although the solar irradiance is much stronger, the ambient temperature is lower and the wind speed is bigger than in other classes, so the PV module temperature is not very high.
Furthermore, we take the averages of the PV module temperature and its MIFs on each day to draw
Figure 5 to make further observation on the correlation relations.
Figure 6 is the actual data of PV module temperature in the whole year under four GWCs. The PV module temperature data show a strip distribution, which is caused by the temperature difference of about 20 °C per day, but as a whole, it is in accord with the seasonal characteristics.
4.2. Quantitative Correlation Analysis by Correlation-Based Feature Selection
Figure 5 shows that the positive relation between ambient temperature and PV module temperature is more obvious than the other two impact factors, but whether the change trend is similar to the trend of impact degrees merits quantitative study. CFS provides a possibility for quantitative analysis for PV module temperature. The quantitative relations in four GWCs between MIFs and PV module temperature measured by correlation coefficients are shown in
Table 3.
Furthermore, the ratios of quantitative influence degrees of these three MIFs under four GWCs measured by correlation coefficients are shown as
Table 4.
It shows that in four GWCs, the ambient temperature has the strongest correlation with PV module temperature, followed by solar irradiance, and wind speed is the weakest influential factor on PV module temperature. The quantitative ratio of influence degrees of these three MIFs measured by correlation coefficients are 50:40:10, 45:42:13, 50:38:12 and 52:29:19, respectively, under four GWCs. When we focus on only one factor, such as solar irradiance, the ratios of four GWCs are 40%, 42%, 38% and 29%, which are different in different weather classes. The reason is that the classification of weather types is based on the differences of the meteorological factors, which in turn meteorological factors perform differently in different weather classifications. For heavy rainy days (weather class D), solar irradiance is the lowest and the changes are very small, which is the reason why the effect of solar irradiance becomes less in weather class D.
The values of
rcz are calculated based on the correlation coefficients, and the results are shown in
Table 5. According to the analysis above, the correlation coefficients between ambient temperature and PV module temperature are the highest under four GWCs, so we put
Ta in the C set firstly, and add one another impact feature at one time.
Besides putting only one factor or all three factors as the study object, the combination of two variables also has the opportunity to become the most influential subset. The process of selection by
rsz is presented in
Figure 7 with the example in weather class A: firstly, determining
C = {
Ta}, secondly, adding
GT and
VWS into the
C set, comparing the
rcz values of {
Ta,
GT} and {
Ta,
VWS} with original
rcz value, and choosing the new set with increased
rcz values: {
Ta,
GT}, thirdly, adding
VWS into {
Ta,
GT}, and repeating the previous step. The final result shows that in weather class A, the combination of
GT and
Ta is the most relevant subset with
Tm.
Table 6 shows the other most relevant subsets with
Tm in four GWCs.
It is evident that Ta is the most influential factor in all kinds of weather classifications, and GT is a primary impact factor, except in weather class D, while VWS is not primarily relevant to Tm. Like in the analysis above, in weather class D, solar irradiance is lower and changes little, which makes it have little effect in this GWCs.
The result analyzed by CFS verifies the physical analysis results. However, the correlation coefficient values cannot express the degree of relevance exactly because the correlation coefficient is a linear value while the relationship between Tm and each impact factor is nonlinear. More importantly, the relevant subsets contain a certain degree of redundant information due to the coupling relation between MIFs. Thus, the nonlinear method, MI is worthy of further research.
4.3. Quantitative Correlation Analysis by Mutual Information Theory
The MI of two random variables is a measure of the mutual dependence between the two variables. More specifically, it quantifies the “amount of information” obtained about one random variable, through the other random variable. According to Equation (22), k as the number of data segments, determines the intricacy of the study. If k is larger, the data is divided more detailed, and each part of data range is smaller, which will lead the sample distribution more discrete, so in this case, the entropy information and MI are larger generally. If k is smaller, the entropy information and MI are smaller generally. Thus, what is the premise of accurate reflection of nonlinear coupling relation is a reasonable k value.
In order to achieve a scientific and reasonable
k value, accuracy and complexity need to be considered. Due to the fact there is no mathematical method to determine
k values, a method based on actual data is applied in this paper. Firstly, a range values set of
k are determined. Secondly, the tendency of the average value of
H is observed to find the reasonable
k value after the
H values of all variables are calculated. Let’s take the weather class A as an example. As shown in
Figure 8a, the
H values of four variables are calculated with
k between 13 and 32 in weather class A. The slope values of the
H values are shown in
Figure 8b, in which the tendency is more clear. It is obvious that when the
k value reaches 20, the slope values tend to be constant, which is the
k value that should be selected. As a result, the
k values in weather class A is determined as 20.
Figure 8a,b both indicates that the
H values of different variables are similar in the same weather classes, so we choose one of four variables to study
k values in the other three GWCs. The
H values of
GT in four GWCs with different
k values are shown in
Figure 8c, and
Figure 8d is the slope value of the above data. What is noteworthy is that the set of
k values in different weather classifications is different, which means that
ki reflects different values in different weather classification. The value ranges of
k value in weather class B, C and D are between 33 and 52, 37 and 56, 11 and 30, respectively. Similarly, according to our definition, the
k values in weather classes B, C and D are 40, 44 and 18, respectively.
With the identified
k values, according to Equations (21) and (23), entropy information (
H(
X)) and joint entropy (
H(
X,
Y)) can be calculated. The results are shown in
Table 7 and
Table 8.
According to Equation (28), the MI (
I(
X,
Y)) is obtained on the basic of entropy information and joint entropy.
Table 9 is the MI between three MIFs and PV module temperature. MI and correlation coefficient are two methods to measure the relevance between two variables. When we compare
Table 9 with
Table 3, the correlation coefficient values are larger than the values of MI, shown as
Figure 9.
The reason is that CFS is a linear method, which will neglect the coupling relation between MIFs, so the degree of influence of each MIF on PV module temperature is enhanced, which contains redundancy between MIFs, while MI takes the interaction with other factors into consideration, which will reduce the effect of this single impact factor. Thus, MI is more precise in quantitative research on MIFs of PV module temperature.
Table 10 shows the ratios of MI between MIFs and PV module temperature under four GWCs and the ratios of relevancy of these three MIFs measured by MI are 34:45:21, 35:50:15, 29:59:12, and 23:55:22, respectively, under four GWCs. Furthermore, the results measured by two mathematical methods are compared with each other, shown as
Figure 10.
The comparative results show several commonalities. Firstly, the monotone increasing order of the degrees of influence of the three MIFs in four GWCs is: Ta, GT, and VWS. Secondly, the relation between solar irradiance and PV module temperature (GT-Tm) measured by MI is smaller than the value measured by CFS, while the relation between wind speed and PV module temperature (VWS-Tm) is similar or larger. The decreasing ratios of solar irradiance are mainly due to the increasing redundancy between ambient temperature and solar irradiance and MI can eliminate the influence of redundancy, while CFS cannot.
Furthermore, how much redundancy existed between each MIF is also addressed. According to Equations (30) and (31), the values of II can measure the redundancy between two variables when they act on the same object, which is shown in
Table 11.
The ratio of quantitative redundancy degrees is shown in
Table 12. It is worth noticing that:
and:
The ratios of quantitative redundancy degrees between Ta and GT, Ta and VWS, GT and VWS are 61:28:11, 57:30:13, 62:26:12, and 62:33:5, respectively, under four GWCs. The redundancy between solar irradiance and ambient temperature is highest in three variables, which is caused by the significant effect of solar irradiance to ambient temperature. Wind speed also influences the ambient temperature by accelerating heat transfer, which reflects in the higher values of I(Ta; VWS; Tm). Wind speed and solar irradiance have little redundancy relation.