2.1. The Antarctic Climate
The climate of the Antarctic continent is strongly influenced by astronomical factors, its geographical position, the average altitude and its ice cover. Since the Earth, during the winter season of the southern hemisphere, is at the maximum distance from the Sun, this season is longer than in the northern hemisphere. Therefore, this long polar night and the Sun’s low inclination angle makes Antarctica a less heated continent even in the summer. In addition, the surface is almost completely covered by ice or by snow, which reflects most of the incident solar energy. Moreover, the average altitude of the continent exceeds 2000 m, contributing to a reduction in the thickness of the atmospheric troposphere, which normally acts as a heat source.
All these factors make Antarctica the coldest place on Earth, with a very short summer and very long winter, where temperatures reach winter minimum values even in autumn. In winter, the average temperatures range between −20 °C in the coastal strip and −70 °C in the continent, while in the summer the average temperatures range between 0 °C along the coast and −35 °C in the continent.
The orography of the continent presents a vast central elevated landform, the plateau, with a slope towards the ocean. MZS, located on the coast of the Ross Sea, is separated from the plateau by the Transantarctic Chain. The heat loss caused by surface albedo acts in the formation of an intense thermal inversion near the ground, with the consequent formation of very cold air that moves from the plateau by gravity towards the coastal areas, forming the katabatic winds. These winds, which often exceed 100 km/h, give rise to snowstorms that can persist for long periods; once they reach the coastal strip, they encounter more humid and warm air masses generating violent disturbances. Precipitation, in Antarctica, is generally scarce and more concentrated along the coastal strip, so that we may consider the Antarctic continent a cold desert. The area surrounding MZS is in fact considered one of the windiest places on Earth.
The trend of average monthly temperatures shows a maximum corresponding to the months of December and January, when temperatures are close to 0 °C. A sudden drop in temperatures occurs between February and March, reaching average values of around −20 °C in April. The temperatures continue to reach minimal values until July and August with the lowest temperatures recorded throughout the year, reaching −22 °C in average values with extreme peaks of −40 °C. In September the temperatures start to increase, which continues until December. The predominant winds measured at MSZ are oriented in a west and north-west direction. Those wind directions are the most intense winds coming from the glacial valleys of the Reeves and Priestley glaciers, the area that connects the Antarctic Plateau with the Ocean. In general, in this area, eastern and southern winds are rare. Days with no wind are very frequent [
3].
2.3. Data Analysis and Model Development
The solar-metric instrumentation control unit was programmed to perform a measurement of the three solar components, DNI, GHI and Diff.HI, each minute, while the data acquisition was recorded with a frequency of one record every five minutes. In particular, every five minutes, for each of the three irradiation components, the average and the mean standard deviation values (MSD) of the five consecutive measures in the interval were recorded. In the model construction of the DNI estimation starting from the GHI, only the averages value were used, while the MSD were used to check (and potentially discard) those average values that seemed far-fetched. Throughout 65 days of constant observations, 18,709 specific data elements were recorded which corresponds to 288 records per day.
18,545 of the recorded data elements from the entire data set are related to diurnal intervals that can be used for the estimation of the DNI. The reason the records of daily observations with respect to the total observations are so preponderant is that beyond the parallels of the polar circles, the day duration increases considerably in the summer season. The Sun in this area remains uninterruptedly beneath the horizon (the polar night). At the latitude where the Mario Zucchelli base is located, sunrise and sunset are present until 3rd November; then, from 4th November until 7th February, the Sun remains above the horizon for the entire 24 h period.
Conversely, it should be noted that although the days are long, in reality the Sun rises relatively little above the horizon during the day and this effect is much more pronounced as we move towards latitudes close to the poles. The phenomenon can be shown by calculating the Sun’s maximum elevation during the day [
6], for all the days of the year (
Figure 3).
The three observed components by solar-metric instrument are linked by the following relationship:
where,
I is global horizontal irradiance,
Ibn is the direct normal irradiance on the normal plane, θ
z is the solar zenith angle and
Id the diffuse solar irradiance on the horizontal plane. The availability of the three observed solar components therefore makes it possible to validate the individual measurements. The solar zenith angle is the angle that the direction of the solar beams forms with the normal to the horizontal plane at a fixed location. The zenith angle is calculated according to the latitude and longitude, day number of the year and the time of day. The exact Sun position and the moment of acquisition are essential to Equation (1). The product of the Ibn with the cosine of the zenith angle is the direct irradiance referred not to the normal, but rather to the horizontal plane, for which the symbol
Ib will be used from this point in the text:
The Equation (2), translated into the following relationship between the three irradiance components (GHI, DNI and Diff.HI), where all are referred to the horizontal arrangement is the sequent:
The dispersion graph covering the 65 days of the observed and calculated DNI values from Equation (3) is presented in
Figure 4. Following the relationship in Equation (3), we could validate our observations and calculate one of the needed values if measurements for the other two are available.
To begin with, all the observed data set must be validated. The good planning and execution of the observations certainly enables the elimination of the majority of systematic errors in the dataset. In the dataset, random errors inevitably occur, which are intimately connected to the observational process itself and can never be completely eliminated or prevented. The validation of the acquired dataset therefore has the purpose of identifying those values that are incorrect and excluding them from subsequent processing.
The validation tests that are used to check the accuracy of the measurement data in particular identifies inaccurate data, which do not satisfy the validation tests (
Figure 5), and which are declared incorrect and excluded from processing. Nevertheless, exact data may be inaccurate where the measured data still can be affected by a lack of precision the magnitude of which depends on the (in)sensitivity of the instrument, random disturbances, and so on. In theory, indicated with Δ, the allowable inaccuracy interval, a measure
m is acceptable if the value falls in the amplitude interval Δ centered on the true value ν:
However, since the true value is unknown, it will be a question of transforming the previous criteria, from time to time, in a suitable manner.
Figure 5 graphically illustrates the criteria described above.
The intervals of imprecision for the horizontal global and diffuse irradiance are: I ± ΔI/2 and Id ± ΔId/2 respectively. With reference to the possible reciprocal positions, there are therefore four cases, of which only the first is incompatible with the criteria Id ≤ I; the other three are collectively translated into the condition .
In the night-time irradiance measurements, which theoretically should have zero values, they were found to have values almost always between the range ±10 W·m
−2. Therefore, it is reasonable to follow the inaccuracy interval as:
The same inaccuracy interval is followed for all three radiations components. Consequently, the summary of the validated data following the above criteria is (from now on) shown in
Table 1:
2.4. Fine-Tuning Model
The 10,641 triplet (GHI, DNI, Diff.HI) observed daily and validated records, related to a period of 65 consecutive days, cannot be considered sufficient to extrapolate an estimate of the radiation in Antarctica. Indeed, the period that extends for the entire calendar year of 365 days, cannot be considered sufficient for the multi-year characterization of Sun exposure on the ground in Antarctica. However, the observed period can provide an adequate sample if the purpose is to model the radiative phenomenon through the correlations between the three forms of radiation. In particular, the relationship between DNI and GHI is of particular interest in this study, where:
The quality of the relationship between the radiation components (3) lies in the fact that numerous expeditions in Antarctica have enabled the acquisition of a multi-decade historical series of hourly GHI, both at Eneide station and at the weather stations scattered around the Mario Zucchelli base. The availability of GHI data allows the derivation of the DNI through the previous relationship and consequently to fully characterize the area of the Mario Zucchelli base.
In the literature, numerous correlations between the irradiance components are attested, established in different geographical areas for different time intervals (hourly, daily and so on), as in References [
7,
8,
9,
10,
11] and others. Historically, the most studied correlation is the dependence of the diffused component
k on the global atmospheric transmission coefficient
KT:
The global atmospheric transmission coefficient (
KT) is defined as the ratio between the global irradiance observed on the ground (
I) and the extraterrestrial irradiance (
I0), both referred to the horizontal plane:
while the fraction of the horizontal diffused radiation (
k) with respect to the global irradiance is given by the ratio:
where
Id is diffuse horizontal irradiance (Diff.HI), Thus, the transmission coefficient for normal direct radiation (
Kbn) is defined as:
where
Ion is the extraterrestrial normal irradiance.
Kbn is primary relevant for the calculation of DNI, where could be derived from
KT and
k through the relationship:
An excellent correlation
k =
f(
KT) established by Reference [
12] is as follows:
where, α and β are the Boland–Ridley parameters defined for certain specific geographic locations [
12]. Therefore, the relationship between DNI and GHI is derived from:
There are two non-negligible contraindications which need to be satisfied in order to obtain a direct correlation between the two parameter pairs; (KT, k) and (KT, Kbn):
(1) The dispersion graph of the parameters pair (
KT,
k) in
Figure 6 presents the excessive variability of the data around the hypothetical correlation curve. This phenomenon is inevitable considering that the radiative attenuation in the atmosphere depends on many other meteorological variables, which are all unpredictable and difficult to measure or estimate, such as: humidity, wind, temperature, aerosol content, and various others. Usually, the above-mentioned correlation is mostly studied through average hourly data, for which the variability is obviously less accentuated than the measures adopted in this study, carried out at a frequency of every 5 min.
The same consideration also applies to the dispersion graph that considers the second pair of parameters (
KT,
Kbn) in
Figure 7.
(2) Usually the estimation of the regression parameters is carried out using the least squared method, which is computationally easy if the proposed correlation is linear. Therefore, until regression analysis can be applied to find the parameters α and β, it is necessary to transform Equation (11) in its linear expression, which contains the needed parameters that now can be estimated. Following the above, we obtain:
The following parameters pair (
Kbn,
k) in
Figure 8 better represent the measurements with the approximate fitting second order polynomial curve than the other two parameters pairs. On the graph we observe that the measurement points are gathered more strongly and continuously on the fitting curve.
In this case, the candidate second order polynomial is:
In conditions of zero direct radiation, Kbn = 0, and the global radiation becomes entirely composed of the diffused component; consequently the diffused fraction must be unitary, k = 1. Therefore, in Equation (14) the constant c is 1. When these assumptions are taken into account and the least squares method is applied, the regression on the parabola (14) produces the following estimates (a = 0.994, b = −1.93, R2 = 0.98).
To interpret the parabolic correlation of Equation (14) in the Boland–Ridley formulas, it is necessary to report on a Cartesian reference of the abscissa
KT and the ordinate
k, both the dependence
k =
f(
KT) obtained by the polynomial expression (14) and also the dependence found in expression (11) of Boland–Ridley with different parameters pairs (α, β). Afterwards, we are able to estimate the final coefficients by regression. Proceeding in this way we obtain:
The two final coefficients (15) are estimated from Equation (11) considering the 65 days of observed data from the Antarctic region.
Figure 9 and
Figure 10 show the final dispersion graphs between the three coefficients (
KT;
k,
Kbn) and the estimated interpolation curves. The Boland–Ridley model uses the α and β coefficients for the European (blue) and Australian (violet) regions [
13], while the model developed in this study uses the calculated coefficients (15) for the Antarctic region (ENEA).
2.5. Model Validation
The models, in general, when developed according to the acquired observational data set, need to be compared with the corresponding estimates, and whether the deviation magnitudes do not compromise the quality of the proposed model needs to be evaluated. In order to quantify the quality of the proposed model, some statistical indicators are useful. The first of these indicators, called the Mean Bias Error (MBE), is simply the difference between the mean estimate values and the observation values. In our case, the calculated value is:
In the above expression,
N = 10,641, it is the total number of observation pairs in the data; the subscript “i” refers to the i-th pair, while “o” and “c” refer to the observations and calculated values, respectively. The estimates are calculated using Equation (12) with the values given by Equation (15). The low value of the MBE means that the measurements and estimates are offset on the average value. Even though, a low (or, at least, zero) MBE value does not exclude the possibility that there are some significant gaps between the measurements and the estimates, even potentially in a non-negligible number. In order to underline the variance index that takes the above fact into account, it is useful to use the Mean Absolute Error (MAE), which in our case gives:
or, in relative terms:
where,
is indicated as the average of the DNI measurements, which results in:
Another widely used quality deviation index is the Root Mean Squared Error (RMSE) which, compared to the previous evaluated MAE, considers the squares of the discard rather than the absolute values, in order to attribute a greater weight to higher deviations in the summation, such as:
The final quality index considered here can be obtained by comparing the average DNI values of all the measurements and the corresponding estimated values. Since the average value of the DNI estimate is calculated as:
we finally get:
The summary of statistical measures of presented model is compared with Boland–Ridley coefficients for Australia and Europe taken from Reference [
13] are presented in
Table 2. Within the new calculated α and β coefficients for Antarctic region from observed data we reduced the model error from 12.9% considering the known coefficients for Australia to 2.8%.
The values of the reported quality indicators are affected by the complexity of the phenomenology of solar radiation and its components that reach the ground. They depend on numerous meteorological factors which are contingent and difficult to establish for the purpose of fine-tuning models. In this study, we tried to develop a fine-tuning model which is established on the basis of high acquisition frequency data (every 5 min). Usually the models are developed with reference to hourly data, in which the variability inherent in the phenomenon is established thanks to the much longer measurement intervals that are considered. Nevertheless, the values of the above obtained quality indices are in line with similar studies based on hourly data [
7,
10,
11,
12].