1. Introduction
Statistical precipitation downscaling is the process of making a link between a set of large-scale atmospheric variables (i.e., mean sea level pressure, vorticity, and geopotential height) and predictand (i.e., local precipitation). The large-scale predictors are essential for climate change research, but they do not actually provide a truthful presentation of the climate in a small basin. Generally, they have a spatial resolution coarser than 2 by 2 degrees in latitude and longitude, whereas hydrologists are more concerned with the catchment scale which is usually up to a few hundred square kilometers. This leads to a need for downscaling large-scale predictors to local precipitation. The NCEP reanalysis data set is a continually updated globally gridded data set that represents the state of the Earth’s atmosphere, incorporating observations and numerical weather prediction model output from 1948 to present. The NCEP reanalysis data is commonly used to develop a statistical relationship between large-scale climate factors with local rainfall for building (or training) downscaling models. The GCM outputs under climate change scenarios are then used as the inputs of downscaling models to project future precipitations for studying climate-change impacts [
1]. The current study used the NCEP reanalysis data for building the proposed downscaling approach.
To date, there are many methods proposed for statistical downscaling using different techniques such as stochastic weather generators [
2,
3,
4,
5], weather typing method [
6,
7,
8], resampling methods [
9,
10,
11], and regression methods. The regression methods are attracting more attention and preferred to apply due to their flexibility and straightforwardness. There are numerous variant approaches of regression-based downscaling techniques such as logistic regression model [
12], local polynomial regression [
13], linear and non-linear regression [
14], canonical correlation analysis [
15], principal components [
16], artificial neural network [
17,
18], support vector machine (SVM) [
19,
20,
21,
22], and beta regression [
23].
Among these statistical downscaling methods, SVM shows its elegant and remarkable advantages comparing to the other methods. There are several studies which proved that SVM and variants of SVM are superior to ANN [
19,
24], multivariate analysis, and the Statistical DownScaling Model (SDSM) [
24]. For instance, SVM performed better than ANN in predicting groundwater levels [
25], runoff and sediment yield simulation [
26], flood stage prediction [
27], rainfall–runoff modeling [
28], river flow forecasting [
29,
30], long-term discharge prediction [
31], and modeling discharge-suspended sediment relationship [
32]. SVM is also superior to multiple linear regression (MLR) in streamflow forecasting [
33], autoregressive moving average (ARMA) in discharge prediction [
31,
34], autoregressive integrated moving average (ARIMA) in streamflow prediction [
35], neural networks (NN), and MLR in daily water demand and inflow forecasting [
36] and prediction of reservoir inflows [
37]. In addition, SVM performed better than NN and empirical models in modeling daily reference evapotranspiration [
38], neuro fuzzy inference system (ANFIS) in river flow forecasting [
29] and daily forecasting of dam water levels [
39], and genetic programming (GP) in forecasting monthly discharge time series [
34].
However, many researches for downscaling precipitation at the catchment scale using SVM [
19,
24,
40,
41,
42,
43] conclude that the downscaling methods based on SVM performed well for normal rainfall but unsatisfactorily for extreme rainfall (i.e., underestimated extreme-rainfall amount). Tripathi et al. [
19] detected that monthly precipitation downscaling by SVM could not reproduce the high rainfall observed in the historical records since the regression-based statistical downscaling models regularly cannot explain entire variance of the downscaled variable. They suggested that investigation of more large-scale predictor variables and a much longer validation period might likely provide more insight into this problem. A similar finding about the inability of SVM to mimic high rainfall has also been reported by Anandhi et al. [
40].
In Taiwan, the downscaling methods based on SVM have been proposed by Chen et al. [
24] and Yang et al. [
41] for Shih-Men Reservoir catchment in northern Taiwan. The main structure of their proposed downscaling approach comprises the rainfall-state classification and the regression for rainfall amount. Chen et al. [
24] used support vector classification (SVC) and linear discriminant analysis (LDA) for rainfall-state classification, while Yang et al. [
41] only used LDA. Both the studies use the support vector regression (SVR) for the rainfall-amount prediction for wet days. Chen et al. [
24] compared the performance of SVM to linear multiple regression and SDSM. The downscaled results showed that the SVM produced more accurate daily precipitation than SDSM and linear multiple regression. Yang et al. [
41] found that the proposed downscaling model performed well in capturing the magnitude and variation of daily precipitations below 50 mm/day but underestimated the extreme rainfalls.
The aforementioned weakness of SVM in downscaling extreme rainfall inspires the current study to propose a modified statistical downscaling approach based on the methods developed by Chen et al. [
24] and Yang et al. [
41] for improving the extreme rainfall downscaling. The main structure of the proposed downscaling approach comprises the rainfall-state classification and the regression for rainfall-amount prediction. Three classification methods, including LDA, random forest (RF) and SVC, were adopted for rainfall-state classification and their performances were compared. The least square support vector regression (LS-SVR) was used for the rainfall-amount prediction for different rainfall states. Two rainfall states (i.e., dry day and wet day) and three rainfall states (dry day, non-extreme-rainfall day, and extreme-rainfall day) were defined and compared for judging their downscaling performances. Through the above comparisons, the optimal classification method with proper rainfall-state delineation can be found and linked with the rainfall-amount prediction method for improving the extreme rainfall downscaling.
The remaining part of this paper is organized as follows.
Section 2 “Study Area and Data Set” provides a summary description of the study area and the data set including local rainfall and large-scale predictors.
Section 3 “Methodology” describes three types of the proposed approach (i.e., Approach Type-I, Approach Type-II and Approach Type-III) and briefly introduces LDA, RF, and LS-SVR.
Section 4 “Results and Discussion” describes the analysis results of rainfall-states classification and regression for rainfall-amount prediction by different classification methods and types of approach. Comparison of different classification methods (i.e., LDA, RF, and SVC) and different types of approach were made. Finally,
Section 5 "Conclusions and Future Work” concludes the paper.
2. Study Area and Data Set
Shih-Men Reservoir, located in the Danshuei River basin in northern Taiwan, was completed in 1964 as a multifunction reservoir for water supply, agriculture, hydropower generation, and flood control. The Shih-Men Reservoir is a major reservoir with a storage capacity of around 3 × 10
8 m
3. Its upstream catchment (
Figure 1) has an area of 763 km
2, and the basin ground elevation varies from 209 to 2609 meters. The average annual rainfall of the catchment is around 2250 mm.
Taiwan’s climate is governed by the East Asian Monsoon, which is divided into the summer and winter monsoons. Therefore, the Water Resources Bureau in Taiwan divided a year into the wet season (May–October) and the dry season (November–April) based on the summer and winter monsoons, respectively. The proportion of rainfall during the wet and dry seasons is about 7:3. The long-term daily rainfall from 1964 to 2013 at 10 rain gauges in the study area were collected to serve as the dataset (
Table 1). The daily areal rainfalls in Shih-Men Reservoir catchment were calculated by using the Thiessen polygons method which determined the weights of all the stations listed in
Table 1.
The daily data of 28 climate variables at the nearest grid point (i.e., Grid #2 at 122.5° E, 25° N in
Figure 1) from 1964 to 2013 are obtained from the re-analysis data of National Centre for Environmental Prediction (NCEP)/National Centre for Atmospheric Research (NCAR) as listed in
Table 2. These climate variables were used as the candidates of model predictors. The areal rainfalls and the NCEP reanalysis data during 1964–1999 (calibration period) and 2000–2013 (validation period) were used for building statistical downscaling models and for examining and comparing downscaling results, respectively.
4. Results and Discussion
4.1. Rainfall-State Classification
For Approach Type-I, the calibration data (including the NCEP reanalysis data and local rainfalls) were separated into two groups (i.e., wet-day group and dry-day groups) according to daily local rainfalls in both dry and wet seasons. The two-sample Kolmogorov–Smirnov test was then performed to choose suitable predictors of the NCEP reanalysis data. This study used the two-sample Kolmogorov–Smirnov test to select predictors of the NCEP reanalysis data that are distinguishable between the dry-day group and the wet-day group. The predictors which showed a significant difference between two groups (with a significance level of 0.01) were considered as the suitable predictors for classification models. In the current study, the test was performed by the “kstest2” function provided by MathWorks. The selected predictors after testing are mean sea level pressure (mslp), vorticity (p_z, p5_z, and p8_z), geopotential height (p300, p500, and p850), relative humidity (r500, r850, and rhum), zonal wind speed (ua_700 and ua_850), meridional wind speed (vas and va_925), and temperature (ta_700, ta_850, and ta_925). The above selected predictors for Approach Type-I were also used for Approach Type-II and Approach Type-III.
For Approach Type-II, after conducting the same aforementioned process of Approach Type-I, the given wet days were further classified into non-extreme-rainfall-day group and extreme-rainfall-day group. For Approach Type-III, the calibration data (including the NCEP reanalysis data and local rainfalls) were separated into dry-day, non-extreme-rainfall-day, and extreme-rainfall-day groups according to the daily local rainfalls. Because there are only few extreme rainfalls during the dry season, the classification of non-extreme-day and extreme-day was only conducted during the wet season.
The accuracies of (1) the dry-day/wet-day classification for Approach Type-I, (2) the non-extreme-rainfall-day/extreme-rainfall-day classification for Approach Type-II and (3) the dry-day/non-extreme-rainfall-day/extreme-rainfall-day classification for Approach Type-III can be estimated respectively as
where
D is the number of dry days, W is the number of wet days, D|D indicates the number of days that a dry day is correctly classified as a dry day, W|W indicates the number of wet days that a wet day correctly classified as a wet day, N is the number of non-extreme-rainfall days, E is the number of extreme-rainfall days, N|N indicates the number of days that a non-extreme-rainfall day is correctly classified as a non-extreme-rainfall day, and E|E indicates the number of extreme-rainfall days that an extreme-rainfall day correctly classified as an extreme-rainfall day.
Since the formulas for calculating the classification accuracies for Approach Type-I (Equation (10)), Approach Type-II (Equations (10) and (11)) and Approach Type-III (Equation (12)) are different, the results in
Table 3 were only used for comparing the classification performances by different methods (LDA, SVC, and RF) in each approach, not for judging which type of approach is the best in the classification step.
The performances of (1) the dry-day/wet-day classification for Approach Type-I, (2) the non-extreme-rainfall-day/extreme-rainfall-day classification for Approach Type-II and (3) the dry-day/non-extreme-rainfall-day/extreme-rainfall-day classification for Approach Type-III are shown in
Table 3. There are three methods (i.e., LDA, RF, and SVC) which were used for classifying rainfall states in both wet and dry season. The accuracies of dry-day/wet-day classification are generally higher than 72%. The performance of the dry-day/wet-day classification models in the wet season are better than those in the dry season for all three methods. The accuracies of dry-day/non-extreme-rainfall-day/extreme-rainfall-day classification are generally higher than 66%. The accuracies of non-extreme-rainfall-day/extreme-rainfall-day classification in Step 2 of Approach Type-II are generally higher than 93%.
The proportions of individual states (dry day, non-extreme-rainfall day, and extreme-rainfall day) during the wet season in the calibration period are 33.83%, 62.72%, and 3.45%, respectively. In the validation period, the proportions of individual states (dry day, non-extreme-rainfall day, and extreme-rainfall day) during the wet season are 34.07%, 61.54%, and 4.39%, respectively. Improvement of extreme rainfall downscaling is the main concern of the current study. For emphasizing the classification accuracy for extreme-rainfall-day state, the classification accuracies (%) of extreme-rainfall day during wet season were presented in
Table 4. The dry season was not taken into account because most of extreme-rainfall-day occurred during wet season.
By comparing the performances of the three classification methods, it is found that RF outperforms LDA and SVC by the largest classification accuracy (%) of dry/wet day and extreme-rainfall/non-extreme-rainfall day in
Table 3, and the largest classification accuracy of extreme-rainfall-day state during the wet season in
Table 4. Therefore, the outputs of RF classification models were selected as inputs for the regression models to simulate rainfall amounts.
4.2. Regression for Rainfall-Amount
Before establishing the regression models, the principal component analysis (PCA) was used to transform the predictors (i.e., the 28 climate variables of the NCEP reanalysis data) to new matrices as the input matrices for LS-SVR models. The purposes of PCA are to eliminate the multicollinearity and reduce the dimension of a large data set. In the current study, PCA was carried out with the NCEP reanalysis data obtained from the nearest grid point of the study area. Nine principal components were selected based on the eigen-values which are greater than 1.0, which can explain more than 85% of the variance of the data set (i.e., the NCEP reanalysis data). The transformed variables by the nine principal components were used as the predictors of the LS-SVR models for different types of approach. Based on the transformed variables by PCA, the LS-SVR models were developed for the wet and dry seasons separately. PCA reduced the dimension of the large data set from a sample size of 143,080 corresponding to 28 predictors to a smaller sample size of 45,990 corresponding to nine principle components, which considerably reduces the computational consumption. The local rainfall and the NCEP reanalysis data during the calibration period were used to tune the two hyper-parameters of each LS-SVR model.
Table 5 lists the tuned parameters of the LS-SVR models. Since most of extreme rainfalls occur during the wet season, the observed data were separated into two groups (i.e., non-extreme-rainfall group and extreme-rainfall group) for Approach Type-II and Approach Type-III. As there are too few extreme rainfalls during the dry season, only Approach Type-I approach was used for this season. The rainfalls calculated by the LS-SVR models are normalized values which should be converted to their original scale.
The data of 1964–1999 were used to train the classification and regression models. During the validation period (2000–2013), the 2990 wet days were extracted for construction and evaluation of the LS-SVR models. In order to demonstrate the accuracy of the proposed approach objectively and evidently, three statistical measures (i.e., Mean, standard deviation (SD) and Skewness) are employed for examining whether the downscaling rainfalls by the proposed approach conserves the statistical characteristics of the observed rainfalls.
Table 6 and
Table 7 list these above measures for comparing the performances of the three types of approach. From the tables, the output of Approach Type-II is slightly better than that of Approach Type-III. The simulated values of Mean, SD, and skewness in Approach Type-II are closer to the observed values than those in Approach Type-III except for SD during the calibration period (
Table 6). In general, the Mean and SD of simulated rainfalls from the three types of approach tend to underestimate the observed rainfalls. However, Approach Type-II and Approach Type-III conserve the Mean and SD of observed rainfalls significantly more than Approach Type-I.
To compare the simulated performances for each type of approach,
Figure 5 shows the RMSE of individual months for three types of approach during the wet season in the validation period. Since most of extreme rainfalls occur during the wet season and the efficiency of Approach Type-II and Approach Type-III strongly represents during this season, only the RMSE of individual months during the wet season is presented in the figure. In
Figure 5, Approach Type-III and Approach Type-II have the RMSE smaller than that of Approach Type-I in most of months except for the month of July. This is because that the classification models of Approach Type-II and Approach Type-III only have the accuracy around 50% (correctly classified 9 extreme rainfalls in a total of 18 extreme rainfalls) in July. While the accuracy in August and September are 64.29% and 79.16%, respectively, for Approach Type-III, which is much better than Approach Type-I. This implies that the accuracy of extreme rainfall classification has a significant impact on the efficiency of the proposed approach. The classification of the non-extreme-day/extreme-day showed that the performance in August and September are better when compared to July, which might be attributed to the number of heavy rainfalls in August and September.
In general, Approach Type-II and Approach Type-III show that their performances in terms of Mean, SD, and skewness are better than the performance of Approach Type-I. Approach Type-II shows its SD significantly better than that of Approach Type-I and Approach Type-III. Approach Type-II is slightly better than Approach Type-III in terms of Mean and skewness. It is apparent that both Approach Type-II and Approach Type-III outperform Approach Type-I in term of generation of extreme rainfalls during both calibration and validation periods (
Figure 6 and
Figure 7). Approach Type-II and Approach Type-III are quite similar in reproducing extreme rainfalls.
Figure 6 and
Figure 7 shows the daily downscaling rainfalls for the three types of approach in the form of quantile–quantile (Q–Q) plots. It reveals that Approach Type-II and Approach Type-III significantly outperform the Approach Type-I when rainfalls are larger than around 50 mm/day. This results are consistent with the comparison results of statistical characteristics for both Approach Type-II and Approach Type-III with a better skewness estimate than that of Approach Type-I. Overall, both Approach Type-II and Approach Type-III models perform better than Approach Type-I in downscaling extreme rainfall amounts. It is worth noting that there are three very extreme rainfalls greater than 450 mm/day during the validation period (
Figure 7) and the three very extreme rainfalls were still significantly underestimated. This is because there are too few data of very extreme rainfalls for training the models.
4.3. Discussion
Negative output values from the LS-SVR models were set to zero in the current study. The proportion of negative values among the total number of wet days are 3.59% in Approach Type-I, 1.45% in Approach Type-II, and 1.96% in Approach Type-III during the calibration period. Those are 3.89% in Approach Type-I, 2.68% in Approach Type-II, and 3.65 in Approach Type-III during the validation period. It is obvious that Approach Type-II and Approach Type-III have less negative output values than Approach Type-I. Separation of wet days into non-extreme-rainfall-day and extreme-rainfall-day data groups can get the benefit in terms of gaining less negative output values from LS-SVR models. The reason might be that the separation supports the LS-SVR models in Approach Type-II and Approach Type-III to gain more suitable parameters for each data groups (i.e., non-extreme-rainfall day and extreme-rainfall day), while the LS-SVR model in Approach Type-I only tunes one set of parameters for only a wet-day data group.
The poor skill of downscaling in capturing extreme events is attributed to two reasons. First, the standardization may reduce the bias in the mean and variance of the predictor variable, but it is much harder to accommodate the bias in large-scale patterns of atmospheric circulation or unrealistic intervariable relationships between predictor variables. The other reason may be that the NCEP reanalysis data are not able to reproduce the extreme value as many extreme events occur at a much smaller scale.
Even though the poor skill of GCM downscaling in capturing extreme events, it is found that the proposed downscaling approach with three rainfall states classification (i.e., Approach Type-II and Approach Type-III) can improve the extreme-rainfall downscaling by Approach Type-I. These two types of approach (i.e., Approach Type-II and Approach Type-III) can conserve the statistical characteristics (e.g., standard deviation and skewness) of observation data, which is a big challenge of many downscaling models. It is noted that Approach Type-II and Approach Type-III performed the extreme-rainfall downscaling better than Approach Type-I during the wet season.
5. Conclusions and Future Work
The current study proposes a statistical downscaling approach for improving daily extreme rainfall simulation at Shih-Men Reservoir catchment in northern Taiwan, which comprises rainfall-state classification and regression for rainfall-amount prediction. Three classification methods (i.e., LDA, RF, and SVC) were adopted for rainfall-state classification and the LS-SVR was used for the rainfall-amount prediction for different rainfall states. Two rainfall states (i.e., dry day and wet day) and three rainfall states (dry day, non-extreme-rainfall day, and extreme-rainfall day) were defined and compared for judging their downscaling performances.
Three types of approach (i.e., Approach Type-I, Approach Type-II and Approach Type-III) have been developed and tested for rainfall downscaling in the study area. Approach Type-I adopts two rainfall states for rainfall-state classification. Approach Type-II and Approach Type-III adopt three rainfall states for two-steps and one-step rainfall-state classification, respectively. The results reveal that RF outperforms LDA and SVC for the rainfall-state classification for all three types of approach. Approach Type-II and Approach Type-III, which use RF for three-rainfall-states classification and LS-SVR for rainfall-amount prediction, have better extreme rainfall simulation than Approach Type I. Future work can apply the two types of approach for the areas with more extreme-rainfall data to validate the performances for extreme-rainfall downscaling.
Adopting a proper threshold of daily extreme rainfall is essential for extreme/non-extreme-rainfall-day classification. The threshold of extreme rainfall strongly influences the rainfall-state classification performance. The current study adopted 50 mm/day as the threshold of extreme rainfall which is defined by the Central Weather Bureau of Taiwan. Using the thresholds less than 50 mm/day (i.e., 30 mm/day and 10 mm/day) for getting more extreme events (i.e., larger sample size) was also tested and had no improvement for rainfall-state classification in the study case. Therefore, using an inappropriate threshold of extreme rainfall may result in a failure of extreme rainfall classification. Selection of a proper threshold of extreme rainfall should be further investigated scientifically and carefully. The future work may apply the detrended fluctuation analysis (DFA) to choose an appropriate threshold of extreme rainfall for a catchment [
59].
The choice of a certain reanalysis dataset is often motivated by either ease of access (availability of the dataset at the institution), ease of use (availability of code to read it), or by the preference for the local provider [
60]. In the current study, the NCEP reanalysis data were used for ease of access and ease of use. The other available reanalysis data (e.g., European Centre for Medium-Range Weather Forecasts, ECMWF) with a much better spatial resolution data can be the alternatives for future work.