1. Introduction
Rainfall is the most important meteorological and climatological parameter for natural ecosystems and human life on earth, as it affects the enrichment of lakes and underground aquifers, river flow regime, and many natural hazards (floods, drought, landslides, etc.). Accurate knowledge of the spatial and seasonal variations of long-term rainfall time series is required for rural and forest development and planning, sustainable development, as well as infrastructure work scheduling.
The Mediterranean basin has a wide range of climatic conditions [
1]. In particular, the rainfall regime in Greece presents a highly irregular behavior, both on spatial and temporal scales, namely in rainfall amount and rainfall distribution [
1,
2]. It is well accepted that the main physical and physico-geographical factors controlling the spatial distribution of rainfall over Greece are the following: the atmospheric circulation, the mountains in the west and east, the Mediterranean Sea-surface temperature distribution, the dehumidification of the air masses crossing the Aegean Sea, and land and sea interactions [
3]. Furthermore, the highest rainfall totals for western Greece were found to be related to the atmospheric circulation associated with the Mediterranean Sea-surface temperature distribution and the complex topography of the region, as imposed by the orography of the Pindus Mountains in northwestern and central Greece and the mountains of Olympus and Crete [
4].
Mountainous areas are of great interest, because runoff is generated and supplies lowlands (through catchments) with water. Moreover, the plain areas receive the eroded material deposited by mountainous catchments, due to intense rainfall. Variability is considered particularly higher in a mountainous environment, because the rainfall pattern is influenced by complex terrain conditions [
5,
6,
7].
The assessment of climate variability is a common issue that should be treated by hydrologists; in particular, the total rainfall in an ungauged site over an area (e.g., catchment) should be evaluated. However, hydrologists face a crucial challenge when it comes to mountainous terrains, since data from only a few meteorological stations are usually available.
To overcome the lack of rainfall data, interpolation methods have been developed over the last few years for rainfall modeling and mapping. These methods are based on the similarity and the topological relationship between nearby sample points and on the value of the variable to be measured [
8]. Interpolation can be achieved using simple methods (splines, inverse distance weighting, Thiessen polygons, etc.) or advanced geostatistical methods (e.g., kriging). Geostatistical interpolation has become the most appropriate downscale technique in applied climatology and for areas with complex terrain, since it is based on the spatial variability of the variables of interest and allows the quantification of the estimation uncertainty [
9,
10,
11].
In recent decades, the interest in climate variability and climate change has augmented. Climate change has emerged as a key issue facing environmental and economic aspects, as it affects floods [
12], soil erosion [
13], drought phenomena [
14], agriculture [
15], tourism [
16], groundwater aquifers [
17], and forest fires [
18].
According to IPCC reports [
19], the Mediterranean basin is expected to become warmer and drier due to the anthropogenic increase of greenhouse gas (GHG) emissions (CO
2, CH
4, N
2O, and F-gases), until the end of the 21st century [
20,
21]. Moreover, in Mediterranean regions, future warming is expected to be greater than the global mean, accompanied by a significant decrease in rainfall [
22]. Based on the above, researchers are orienting their work to investigate trends in rainfall conditions [
23,
24,
25,
26,
27,
28] and to estimate future rainfalls [
29,
30] within Greece. Research results highlighted the decreasing trend of rainfalls recorded from long-term time series analysis, whereas this reduction is expected to be higher in the future, based on regional climate models (RCMs) that have been proposed. Even though much research has been conducted in Greece on trend analysis and spatial mapping of rainfall, only limited research efforts concern mountainous areas with consideration given to long-term time series using a dense network of stations. The identification and recording of seasonal trends can improve water resources management through the selection of appropriate management practices.
The main object of this study was to detect annual and seasonal variation and trends in rainfall time series based on data from rain gauge stations located in mountainous areas. Furthermore, variation and uncertainty in the small-scale rainfall interpolation in mountainous catchments were also evaluated.
2. Materials and Methods
2.1. Study Area
The study was conducted over the central Pindus mountain range, in Central Greece. The area is considered highly important from a hydrological point of view because it is located in the mountainous area of two hydrological basins (Pinios and Acheloos), which supply Central Greece with water, and where many hydropower dams have been constructed. For this purpose, a dense network of meteorological stations (compared to other regions in Greece) has been established, in mountainous terrain. The characteristics of the meteorological stations used in this study are given in
Table 1.
Observations of monthly rainfall totals for a period of 55 years of rainfall (1961–2016) were used from all nine stations of the wider region (see
Figure 1). These stations are equipped with pluviometer and Hellmann-type rain gauges (Fuess Meteorologische Instrumente KG, Königs Wusterhausen, Germany) with a precision of 0.1 mm. The data series are complete, that is, they have no missing values. Moreover, the instruments and observing practices were common among all stations used, and they remained the same during this study’s research. The double mass method and two parametric statistical tests (Student’s
t-test and chi-squared test) were applied to adjust any heterogeneity of the rainfall data, and the details regarding these methods can be obtained from the WMO [
31]. The latter tests demonstrated that the precipitation data were indeed homogeneous and ready to be entered into the subsequent procedures of the study.
These stations are operated by the Ministry of Environment & Energy (Agiofylo, Chrysomilia, Elati, Katafyto, and Malakasi), the Public Power Corporation (Mesochora, Polyneri, and Stournareika) and the University Forest Administration and Management Fund (Pertouli).
The study area is an area of increased importance, because it is located in the mountainous area of two hydrological basins (Pinios and Acheloos), which supply Central Greece with fresh water. The mountainous catchments examined within this study are (1) Klinovitikos, (2) Aspropotamos, (3) Korpos, and (4) Portaikos, as showed in
Figure 1. Additionally, the basic morphometrical and hydrographical characteristics are given in
Table 2.
The study area is characterized as mountainous, whereas the relief is rather intense. Regarding geology, the main rocks are flysch and limestones, quite vulnerable to landslides and weathering phenomena. The forest cover is high and distributed to the mountainous catchment as follows: (1) Klinovitikos, 66%; (2) Aspropotamos, 73%; (3) Korpos, 72%; and (4) Portaikos, 44%. The dominant forest species in the study area are Abies borisii-regis, Quercus frainetto, Quercus petraea, Pinus nigra, and Fagus Sylvatica. Moreover, the study region is of great environmental importance, belonging to the European nature conservation network Natura 2000 according to the criteria of Directive 92/43/EEC.
2.2. Trend Analysis
Time series of annual and seasonal rainfall were subjected to the Mann–Kendall test to detect possible trends over the period of 1961–2016. It is the most widely used test for trend analysis in climatological time series [
32].
The Mann–Kendall test is a non-parametric statistical test to detect the presence of a monotonic increasing or decreasing trend within a time series [
33,
34]. The advantage of the non-parametric tests over the parametric tests is that they are robust and more suitable for non-normally distributed data with missing and extreme values, frequently encountered in environmental time series [
35].
The Mann–Kendall test statistics
S is calculated as:
where
n is the number of data points,
xi and
xk are the data values in the time series
j and
k (
j >
k), respectively, and sign (
xj −
xi) is the sign function as follows:
The variance is computed as:
where
n is the number of data points,
P is the number of tied groups, the summary sign (
P) indicates the summation over all tied groups, and
ti is the number of data values in the
Pth group. In case of no tied groups, this summary process can be ignored. A tied group is a set of sample data having the same value. In the case where the sample size
n > 30, the standard normal test statistic
Z is estimated by:
Positive values of Z indicate increasing trends, whereas negative Z values indicate decreasing trends. Trend testing is done at a specific significance level. When |Z| > Z1−a/2, the null hypothesis is rejected and a significant trend exists in the time series. The value of Z1−a/2 is obtained from the standard normal distribution table. In this study, the significance level a = 0.05 was used. At the 5% significance level, the null hypothesis of no trend is rejected if |Z| > 1.96.
Furthermore, a change-point analysis approach was applied, using the Change-Point Analyzer (CPA) [
36]. This method iteratively uses a combination of cumulative sum charts (CUSUM) and bootstrapping to detect whether a change in the mean of the rainfall time series has taken place. A sudden change in the direction of the CUSUM indicates a sudden shift or change in the average. Additionally, trend magnitudes were computed by employing the Theil–Sen approach (TSA) [
37,
38], which is based on slope β, often referred to as Sen’s slope [
38]. It is preferable to linear regression, because it limits the influence of outliers on the slope [
39].
2.3. Spatial Mapping of Rainfall
Initially, the relationship between altitude and rainfall height (mm) for both an annual and a seasonal basis was evaluated using different types of trendlines (linear, logarithmic, polynomial, power, and exponential). Moreover, the spatial distribution was determined, applying multiple regression equation and taking into account not only the altitude but also the longitude and latitude. The multiple linear regression equation has the following form:
where
P represents the rainfall (mm),
a is constant,
b1…
b3 are coefficients obtained for each independent variable,
X1 is longitude (°),
X2 is latitude (°), and
X3 is altitude (m).
Furthermore, the geostatistical interpolation method of ordinary kriging (spherical variogram) was employed, using the ArcGIS 10.2 software. At this point, it should be noted that geostatistical methods are more valid for increasing sample size. To this end, automatic points were generated in a 1 km × 1 km grid resolution within the catchments, using the Fishnet command of the ArcGIS 10.2 software’s Data Management toolbar. Therefore, rainfall height was calculated for each point and all seasons, based on the multiple regression equation described above and the calculation of the individual variables for each point.
Finally, cross-validation was performed, in order to compare results of rainfall spatial interpolation derived from ordinary kriging with other spatial interpolation methods, for example, inverse distance weighting (IDW), radial basis function (RBF), and universal kriging (UK), and a combination of variograms (spherical, exponential), using the Geostatistical Wizard tool of ArcGIS [
40].
Cross-validation is any of various similar model validation techniques for assessing how the results of a statistical analysis will generalize to an independent dataset. It is mainly used in settings where the goal is prediction, and where one wants to estimate how accurately a predictive model will perform in practice. In a prediction problem, a model is usually given a dataset of known data on which training is run (training dataset), and a dataset of unknown data (or first seen data) against which the model is tested. The goal of cross-validation is to test the model’s ability to predict new data that were not used in estimating it, in order to flag problems like overfitting and to give an insight on how the model will generalize to an independent dataset.
The root mean square error (RMSE) and mean error were used as evaluation indexes in this case study. The mathematical description of these indexes is given below:
where
n is the number of observations, and
xi and
yi are the observed and interpolated rainfall values, respectively, for
i = 1, 2 … n. The RMSE is considered one of the most reliable indexes because it depicts the deviation from the truth rather than the mean value, as in the case of standard deviation. The RMSE gives the weighted variations (residuals) in errors between the estimated and observed values, whereas mean error measures the weighted average magnitude of the errors. Mean error is the most natural and unambiguous measure of average error magnitude [
41,
42]. RMSE, on the other hand, is one of the most widely used error measures [
43].
3. Results
The rainfall pattern in the case study area demonstrated certain particularities and varied greatly in both space and time, in line with the main characteristics of the climate type in the Mediterranean basin. The seasonal distribution of rainfall based on the examined meteorological stations’ data is shown in
Figure 2. As depicted, 35% of the annual rainfall occurs during winter, 32% in autumn, 24% in spring, and only 9% in summer.
The results of the Mann–Kendall statistics test indicated that most of the meteorological stations (around 67%) recorded a downward trend in annual rainfall, which could be considered as statistically significant for the Katafyto station. In addition, decreasing trends of rainfall time series were recorded in winter and autumn for most of the stations. During spring, half of the stations revealed a decreasing trend, whereas the other half revealed an increasing one. Finally, summer was the only time season when rainfall trends were recorded increasing in most of the stations.
Detailed results of the application of the Mann–Kendall test are given in
Table 3. The upward arrow ↑ indicates an increasing trend, whereas the downward arrow ↓ indicates a decreasing one. Furthermore, the light grey cell color shows that the trend is not statistically significant (for a significance level of a = 0.05), whereas the dark grey color shows a statistically significant trend. In addition, the number within the parenthesis indicates the time of occurrence for changes in the mean of the rainfall time series.
As shown in
Table 3, the rainfall non-stationarity starts to occur in the middle of 1960s for the annual, autumn, spring, and summer rainfalls and the early 1970s for the winter rainfall in most of the stations.
Moreover, Sen’s slope was used to compute the trend magnitude per decade, which ranged from approximately −5.3% to +1.5% (average −1.9%) in annual rainfalls, from −14.5% to +7.5% (average −3.2%) in winter, from −4.7% to +5.5% (average +0.7%) in autumn, from −4.2% to +3.6% (average +0.2%) in spring, and from −0.3% to +4.8% (average +2.4%) in summer. Detailed results for each station are given in the next table (see
Table 4).
The investigation of the relationships between the variation of rainfall and altitude showed that the derived coefficients of determination are rather low. This indicates that only a small percentage (19–25%) of rainfall variation in the study area was due to the change in altitude. Furthermore, it was observed that the power trendline performed the best fit in all cases. In the equations below (see
Table 5),
y is the rainfall (mm) and
x is altitude (m).
For that reason, the spatial variability of both seasonal and annual rainfall was assessed using a multiple regression analysis. All the necessary factors affecting rainfalls were included into the multiple regression procedure, including longitude, latitude, and altitude. The coefficient obtained for each factor, based on a regression analysis, is given in
Table 6.
It is noteworthy that the multiple regression model can explain 62.2% of the spatial variability of the annual rainfall, 58.9% of variability in winter, 75.9% of variability in autumn, 55.1% of variability in spring, and 32.2% of variability in summer. In order to evaluate the statistical significance of the examined factors,
p-values were estimated (see
Table 7). In cases where the significant level of the examined factor is less than 95% (
p > 0.05), the factor should be eliminated from the model and the multi-linear regression must be performed again. In this study, the
p-values for all factors were less than 0.05, which means a strong presumption against null hypothesis.
Furthermore, regarding the results of cross validation amongst different spatial interpolation methods it was revealed that better results were achieved by Ordinary Kriging combined with spherical semivariogram (
Table 8).
The semivariogram/covariance cloud tool shows the empirical semivariogram and covariance values for all pairs of locations within a dataset and plots them as a function of the distance that separates the two locations. It can be used to examine the local characteristics of spatial autocorrelation within a dataset and look for local outliers. The selection of lag size has an important effect on the semivariogram. If the lag size is too large, the short-range autocorrelation may be masked, whereas if the lag size is too small, there may be many empty bins. A rule of thumb is to multiply the lag size times the number of lags, which should be about half of the largest distance among all points.
Important characteristics of the semivariogram are also the nugget and the partial sill. The nugget is a parameter of covariance or semivariogram model that represents independent error, and a microscale variation at spatial scales that are too fine to detect. As for the partial sill, it is a parameter that represents the variance of a spatially autocorrelated process without any nugget effect. These parameters for the spherical variogram that was used in this study are given in the following table (see
Table 9).
Regarding the semivariograms (see
Figure 3), it can be assumed that the phenomenon to estimate is smooth (i.e., rainfall values change gradually with the distance). The semivariogram represents the continuity structure quite well also. Additionally, the semivariogram diagrams showed that the samples did not show autocorrelation in any direction.
The rainfall spatial distribution maps over the mountainous catchment of the study area were produced using ordinary kriging and are given in the following figure (see
Figure 4).
The major range of rainfall conditions within the mountainous catchment of the study area is shown in
Figure 2. The rainfall range (mm) in each catchment and every season is given in
Table 10.