Next Article in Journal
Soil Nutrient Status and Morphometric Responses of Guava under Drip Irrigation and High-Tech Horticultural Techniques for Sustainable Farming
Previous Article in Journal
Streamflow Analysis in Data-Scarce Kabompo River Basin, Southern Africa, for the Potential of Small Hydropower Projects under Changing Climate
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Prediction at Ungauged Catchments through Parameter Optimization and Uncertainty Estimation to Quantify the Regional Water Balance of the Ethiopian Rift Valley Lake Basin

1
Department of Water Resources and Irrigation Engineering, Institute of Technology, Hawassa University, Hawassa P.O. Box 05, Ethiopia
2
Institute of Groundwater Management, Dresden University of Technology, 01069 Dresden, Germany
3
Faculty of Environment and Natural Resources, University of Freiburg, 79098 Freiburg, Germany
*
Author to whom correspondence should be addressed.
Hydrology 2022, 9(8), 150; https://doi.org/10.3390/hydrology9080150
Submission received: 25 June 2022 / Revised: 12 August 2022 / Accepted: 14 August 2022 / Published: 19 August 2022

Abstract

:
Quantifying uncertainties in water resource prediction in data-scarce regions is essential for resource development. We use globally available datasets of precipitation and potential evapotranspiration for the regionalization of model parameters in the data-scarce regions of Ethiopia. A regional model was developed based on 14 gauged catchments. Three possible parameter sets were tested for regionalization: (1) the best calibration parameters, (2) the best validation parameter set derived from behavioral parameters during the validation period, and (3) the stable parameter sets. Weighted multiple linear regression was applied by assigning more weight to identifiable parameters, using a novel leave-one-out cross-validation technique for evaluation and uncertainty quantification. The regionalized parameter sets were applied to the remaining 35 ungauged catchments in the Ethiopian Rift Valley Lake Basin (RVLB) to provide regional water balance estimations. The monthly calibration of the gauged catchments resulted in Nash Sutcliffe Efficiencies (NSE) ranging from 0.53 to 0.86. The regionalization approach provides acceptable regional model performances with a median NSE of 0.63. The results showed that, other than the commonly used best-calibrated parameters, the stable parameter sets provide the most robust estimates of regionalized parameters. As this approach is model-independent and the input data used are available globally, it can be applied to any other data-scarce region.

1. Introduction

Global freshwater is particularly stressed by rapidly growing human populations and all the negative consequences of environmental change. Hydrological quantification of freshwater resources is crucial to managing and mitigating these impacts and thus promoting benefits from these resources. As such, there is a growing need for the accurate monitoring and simulation of water balance components to support and maximize water resource management practices. In Ethiopia, most of the freshwater lakes are located in the Great East African Rift Valley, which was formed by volcanic depressions and cracks during the Pliocene [1]. The region is known for its scarce and limited hydro-climatic data, which have limited regional understanding of water balance processes. In many developing regions of the world, hydro-climatic stations are not sufficiently established due to limited economic and technological development [2]. In the Ethiopian Rift Valley Lake Basin (RVLB), even the data from the few available gauging networks are of poor quality, contain gaps, and are subjected to human disturbances. Consequently, the region has remained one of the least studied in Ethiopia. Due to data scarcity, there is a lack of hydrologic simulations using available rainfall-runoff models in poorly managed catchments around the world. New approaches that use global datasets and quantify the uncertainties of regionalization would therefore be helpful for these data-scarce regions.
The Prediction for Ungauged Basins (PUB) initiative aims in particular at developing strategies for better understanding and reduced uncertainty in data-scarce regions [3,4]. Studies have started to focus on ungauged catchment predictions during the PUB decade [4,5] and in the post-PUB decade [6], including those comparing existing regionalization approaches for large sample studies [7]. Two general approaches have been used for predictions in ungauged basins: the first estimates model parameters from calibrated model parameters based on selected objective functions [8,9,10,11]; the second is a model-independent approach, which uses streamflow signatures to establish constraints that can describe the physical and climatic characteristics of watersheds [12]. The latter has been shown to reduce uncertainties that can emerge from the model structural error [13,14,15,16,17]. This approach has shown skill in predicting the expected streamflow in ungauged catchments, especially when going along with the quantification of uncertainties emerging from observed streamflow data [18]. However, recent work also showed that the information content of streamflow signatures is limited [19]. A recent study also analyzed the calibration of the relationship between model parameters and catchment properties, rather than regionalizing the best-calibrated parameter [20]. So far, however, most of these approaches have only been applied in data-rich regions of the world. The credibility and validity of such strategies have yet to be well tested in poorly recorded climatic data regions such as ours.
Rainfall-runoff models are used to represent the typical physical and climatic properties of a catchment. The physical representation of the available rainfall-runoff models may range from parsimonious–spatially lumped to complex physical–spatially distributed models. The common problem with most rainfall-runoff models is that they require some sort of parameter estimation to provide robust predictions. Most model parameters are not directly measurable or linkable to the physical properties of the given catchment because of model simplifications or disagreements between the model scale and the observation scale (incommensurability) [21,22,23]. However, an inherent correlation between the model parameters and catchment properties can often be assumed [5,7,24,25]. Model parameters represent the characteristics of the complex catchment system that are difficult to measure on a small scale. The accurate representation of catchment properties by model parameters should be evaluated, to some degree, by the selected objective function, which measures the fit between observed discharge and simulated discharge. However, most of the catchments are ungauged and their parameter estimations will be subject to uncertainties. Despite the aforementioned limitations in data-scarce regions, there are some studies that have developed strategies to derive model parameters at the global scale using various regionalization approaches [26,27,28]. In addition, different hydrological models have been employed for the regionalization studies [6]. Yet, the obtained simulations often show a lack of precision due to errors emerging from global input data quality and regionalization uncertainty. Uncertainties would be particularly propagated due to the regionalization method itself and the human interference in the catchments [28]. Li and Zhang [29] have also found huge uncertainty from the grid parameter regionalization for regions with rare gauging sites. Another approach estimates regionalized parameter sets on a global scale by using data from catchments distributed mainly in temperate regions [30]; their implementation and validity in the tropics remain questionable because their regionalization procedure does not include many catchments from this climatic region.
In this study, the applicability of high-resolution global climate data was tested for deriving parameters of the ungauged catchments through regionalization and demonstrating a novel leave-one-out spatial cross-validation procedure to quantify regionalization uncertainties for the region in the RVLB with scarce precipitation data. To reduce the regionalization uncertainties, the weighted least square regression was applied that accounts for the errors introduced by unidentifiable model parameters [31]. This brings more advantages to the regression model by increasing the representation of catchments with identifiable parameters. Other than the typical approach of using the best-calibrated parameters of the gauged catchments, e.g., Wagener and Wheater [10]; Singh et al. [32]; Lane et al. [33], the idea of using multiple similar parameter sets for regionalization was extended. Previous studies already considered multiple similar parameter sets for regionalization [34] and calibrated the relationship between the model parameters and catchment properties [20]. However, to our knowledge, the differences between using the best-calibrated parameter, the best parameter set in the validation period, and the most stable parameter set considering their performance in the calibration and validation period have not yet been explored. A spatial split-sample test was used to evaluate the best-calibrated parameter, the best parameter set in the validation period, and the most stable parameter set considering their performance in the calibration and validation period for their adequateness for regionalization. We presented this new approach to quantify uncertainty in regional model development that can be adapted for data-scared regions. We have also demonstrated the applicability of global climate datasets for deriving the regional model. Rather than using a common best-calibrated parameter set to derive the regional model, we compared three different approaches to find the best regional parameter sets. Although our approach uses a low number of catchments, we demonstrate acceptable regional simulation performance. Thus, the objective of this study is to identify the most reliable regional model that can be derived from the three different best parameter sets. In addition, the study hypothesized that a parameter set that is stable over time (based on its performance during calibration and validation) could lead to a more stable spatial extrapolation of the parameters among the three approaches for deriving the best parameter sets. Due to the limited number of observations of streamflow data, this approach is implemented to a relatively low number of 14 gauged catchments with reliable streamflow data to estimate the water balances of 35 ungauged catchments in the RVLB. Repeating the leave-one-out spatial split-sample test multiple times, the uncertainty was quantified that goes along when regionalizing parameter sets from a low number of catchments. That way, this study provides useful directions for regional modeling and uncertainty quantification in under-represented and data-scarce regions such as the RVLB, where assessments of the impacts of climate variability and climate changes are most urgently needed. The remainder of the paper is outlined as follows: A description of the study area and the datasets used in Section 2; A model description and the overall parameter estimation and evaluation procedure in Section 3; A comparison of regionalization results in Section 4; Discussion in Section 5; The summary of conclusions in Section 6.

2. The Study Area

The RVLB is located in the southern part of the Main Ethiopian Rift Valley and covers an area of 53,035 km2, providing water supply to a population of more than 15 million people who live mainly on subsistence agriculture (Figure 1). The RVLB is 84 km wide, and adjacent to it there are large, discontinuous Miocene-aged normal faults [35,36]. Within the Main Ethiopian Rift, there are a series of right-stepping, quaternary rift basins, which have faulted magmatic segments, extending about 20 km wide and 60 km long, that are embryonic oceanic spreading centers. The central part of the RVLB is formed by a Pliocene-aged, faulted caldera, caused by a fractured volcano [1]. Existing faults and repeated ground cracks at the floor of the caldera have increased the permeability of the rock. Within the basin, several small-to-medium-sized catchments drain into eight freshwater lakes. For most of these catchments, there is a lack of hydro-climatic data; what data are available contain gaps and are subject to human interventions. In recent decades, the RVLB has experienced major droughts and extreme flooding due to rainfall variability, making prediction difficult [37]. This has resulted in an uncertain analysis of high and low flows, factors that are important for quantifying the hydrological water balance components in this region.
The climate is characterized by the range from semi-humid to semi-arid subtropical climate in the central and southern part of RVLB with a mean annual temperature of 20 °C [38]. In the same region, the long-term analysis of annual average precipitation shows a variation in the range of 951 mm to 1653 mm, and such variations are attributed to the topographic differences in the region [38]. However, catchments in the northern part of RVLB are characterized as sub-humid and humid climates, with average annual precipitation of 1050 mm and temperatures of around 15 °C [39,40].

Data and Catchment Properties

The regionalization procedure used precipitation products of Multi-Source Weighted-Ensemble Precipitation (MSWEP) version 2 and evapotranspiration from the Global Land Evaporation Amsterdam Model (GLEAM V3) (Table 1). MSWEP products have recently developed precipitation datasets at a finer scale (0.1°), which are constructed using different sets of precipitation data from gauges, monthly satellite data, and re-analysis data on the global scale. The global MSWEP data have been evaluated and showed better performance for data-scarce regions including Ethiopia [41]. The data product has been widely applied for the evaluation of gridded satellite products and regionalization studies in data-scarce regions [42]. For the discharge data due to monitoring and organization issues, high-quality streamflow data are only available from 1995 to 2007. The data for the recent period (2008–2015) are mostly not available for the user, and the available parts are of poor quality. We, therefore, collect the daily streamflow data for the period from 1995 to 2007 from the Ministry of Water Irrigation and Energy of Ethiopia (MOWIE) for 14 catchments in the RVLB. The 14 catchments provide a sufficient length of data (>10 years) for these simulation periods. The area of 14 catchments ranges from 144.2 to 4528.2 km2.
Table 1. Variables showing the climatic and physiographic data and their resolutions and periods.
Table 1. Variables showing the climatic and physiographic data and their resolutions and periods.
VariableSpatial ResolutionTime PeriodTemporal ResolutionSourceReference
Precipitation0.1°1995–2007DailyMSWEP V2Beck et al. [41]
Potential evapotranspiration0.25°1995–2007DailyGLEAM v3Martens et al. [43]
HBV-parameters0.5°--www.gloh2o.org (access date 12 January 2020)Beck et al. [30]
Elevation30 m--SRMT V2.1https://earthexplorer.usgs.gov (access date 28 January 2020)
Wetness index (P/PE)Point scale1995–2007DailyMSWEP V2 and GLEAM v3Beck et al. [41]; Martens et al. [43]
Streamflow Pont scale 1995–2007 Daily MOWIE -
Catchment properties are common descriptors of the hydrological process and are frequently applied to estimate model parameters in ungauged catchments. However, there is no general rule to select suitable properties. The available option should be a selection of as many catchment descriptors as possible while reducing correlated catchment properties so as to exclude redundant information and obtain independent variables. In this regard, the main criteria to select catchment properties is that they are model-independent and can be used for model predictions to be hydrologically realistic in both gauged and ungauged catchments [12]. For reliable regionalization, a sufficient number of catchment properties should be selected. Their selection, in turn, depends on data availability, hydrologic relevance, and the suitability of the properties. With these considerations, nine catchment properties were derived from the physical and/or climatic information in both gauged and ungauged catchments from the RVLB. Due to the lack of local land cover and soil property data, the catchment properties that need to be derived from those data are not included in the analysis. Physical catchment properties such as drainage area, drainage density, mean catchment slope, mean elevation, and catchment index were extracted from the Digital Elevation Model (DEM). Climatic properties of potential evapotranspiration and precipitation that usually affect the rainfall-runoff process were extracted from the global data sets (Table 2 and Table S2). Other physical properties where local information is not available, such as permeability and porosity, were extracted from the global datasets prepared by Huscroft et al. [44]. These dominant catchment properties are also widely applied in the previous regionalization studies using the HBV model structure [7,45,46].
Each parameter in the HBV model has a certain physical interpretation that can be linked with the selected catchment properties. Previous study has also shown that a sufficiently large correlation coefficient between catchment property and model parameter can be a good indicator of the predictive power of the selected catchment properties, provided there is no collinearity between them [47]. Considering this, nine catchment properties were carefully selected that will be linked with the physical interpretation of the calibrated HBV model parameters (see Section 3.1). By undertaking this, catchment properties that have redundant information and may not be suitable for regionalization were also removed.

3. Methods

The concept of the regionalization procedure is shown in Figure 2. Global climatic forcings of precipitation and potential evapotranspiration were applied for parameter estimation in the gauged catchments using the HBV hydrologic model. The precipitation products of the Multi-Source Weighted-Ensemble Precipitation (MSWEP) version 2 and evapotranspiration of the Global Land Evaporation Amsterdam Model (GLEAM V3) were used for the study. Parameter sets that perform best during calibration and validation and ones that are stable between calibration and validation (stable parameter sets) were derived. Using the best parameter sets from calibration, a correlation analysis was conducted with the physical and climatic properties of gauged catchments, which forms the basis for the regression. By applying the best parameters from the calibration, validation, and stable sets, three regression models were derived in the leave-one-out procedure and the one that performed best was selected. The parameters of the ungauged catchment were estimated by applying the ungauged catchment properties to the regression models (Figure 2). In addition, the regionalization uncertainty was quantified through the spatial cross-validation procedure that applies the leave-one-out method (see Section 3.4). Consequently, the parameters derived from the regression were evaluated by comparing the discharge observations and simulations of the gauged catchments on their calibration (1995–2002) and validation periods (2003–2007). Comparing their performances during the calibration and validation period enables the selection of the best regional models derived from the parameters estimated from calibration, validation, and stable sets.

3.1. Hydrological Model

In this study, a lumped HBV hydrologic model was used [48,49], which has been applied in a wide range of climatic and physiographic conditions [50,51]. The HBV model has been tested in various parts of the world and frequently applied in several regionalization studies due to the simplicity and flexibility of its model structure [24,25,45,52,53,54,55].
This study applied the HBV model modified by Zhang and Lindström [51] and Beck et al. [30]. The model was run at a daily time scale using daily inputs of precipitation and potential evapotranspiration. The lumped HBV model uses effective/average precipitation in the catchment domain as input. Since the aim of this study is to explore the possibility of using global precipitation products in data-scare regions, the gridded MSWEP V2 precipitation data were used. The data have been evaluated with separate gauge-radar data and have undergone bias correction so that the influence of elevation was indirectly considered while producing the gridded precipitation data. Due to the absence of snow processes in the RVLB, the model consisted of routines of soil moisture accounting, runoff response, and a channel routing procedure, which are controlled by nine model parameters (Table 3). Three calibrated parameters, β, FC, and LP, control the soil moisture dynamics. β controls the contribution (dQ) to the runoff response routing and the increase (dP-dQ) in soil moisture storage (Ssm) and FC is the maximum soil moisture storage in the model as shown by Equation (1). LP is the value of the soil moisture above which evapotranspiration (Ea) reaches its potential level (Ep). The actual evaporation from the soil moisture zone equals the potential evaporation if Ssm/FC is above LP * FC as shown by Equation (2).
d Q d P = ( S s m F C ) β
where dP and dQ are precipitation and runoff [mm d−1].
E a = E P . m i n ( S s m F C . L P , 1 )
The runoff response function transforms excess water from the soil. This routine consists of upper and lower reservoirs that distribute the generated runoff over time. The lower reservoir is a simple linear reservoir representing a contribution to baseflow. It also includes the effects of direct precipitation and evaporation over open water bodies in the basin. The lower reservoir storage, SLZ [mm], is filled by percolation from the upper reservoir (PMAX), and the outflow from this lower reservoir (Q2) is controlled by the recession coefficient K2 [d−1]. However, the upper reservoir storage SUZ [mm] is drained by two recession coefficients, K0 [d−1] and K1 [d−1], draining the quick flow Q0 [mm d−1] and slow flow component Q1 [mm d−1] separated by a threshold VUZL [mm] (Equations (3)–(5)).
Q 0 = K 0 S U Z V U Z L
Q 1 = K 1 S U Z
Q 2 = K 2 S L Z
If the yield dQ [mm d−1] from the soil moisture routine exceeds the capacity, the upper reservoir will start to fill. This reservoir models the response during flood periods. The parameters calibrated from the runoff response function are PMAX, K0, K1, K2, and VUZL. Finally, the runoff is computed independently for each sub-basin by adding the contributions from the upper and the lower reservoir. To account for the damping of the runoff pulse in the river before reaching the basin outlet, a simple routing transformation is performed. This filter has a triangular distribution of weights with the base length and is expressed by the parameter MMAXBAS [d]. A detailed description of the model is shown by Bergstrom [56] and Seibert and Vis [49]. The ranges of the nine model parameters are derived from prior knowledge, provided through a global set of HBV model parameters [30].

3.2. Parameter Estimation in the Gauged Catchments

The study used a uniform random sampling approach to produce simulated streamflow ensembles with the HBV model. To obtain reasonable parameter sets, 20,000 combinations of the nine HBV parameters were generated from a uniform random Monte-Carlo sampling procedure. There is a possibility to use other parameter optimization options. However, the aim of this study is not only to obtain the best parameter but also a set of large behavioral parameters that are later used to derive weights in the weighted regression procedure using the inverse of their coefficient of variation. The random sampling procedure has been effectively used in obtaining behavioral parameters and deriving parameter uncertainties for the previous regionalization studies [10,25]. Other global optimization procedures such as Dynamically Dimensioned Search and Shuffled Complex Evolution can find the best parameter sets [57]; however, the purpose of this study is to find a range of suitable behavioral parameters in addition to the best parameters. The choice of a uniform random sample is relatively computationally efficient and easy to implement. Therefore, the Monte Carlo method was selected to produce a large number of suitable behavioral parameters from the reduced parameter samples using a minimum 0.5 NSE threshold.
A split sample test [58] is used by splitting the simulation period into the calibration period (1995–2002) and the validation period (2003–2007) and calculating the Nash Sutcliffe Efficiency (NSE) [59] for both periods (Equation (6)):
N S E = 1 i = 1 n   Q o b s Q s i m 2 i = 1 n   Q o b s Q ¯ 0 b s 2
where Qobs and Qsim are monthly averages of observed and simulated discharges [m³ month−1], respectively, while Q ¯ 0 b s is the mean observed discharge over the calibration or validation periods. Using monthly averages, it is focused on the seasonal, long-term behavior instead of daily, short-term fluctuations. In order to remove unrealistic parameter combinations, only parameter sets that produced NSE ≥ 0.5 in the calibration period were kept. Consequently, different catchments can result in a different number of behavioral parameter sets. A pre-analysis using NSE on a monthly time scale showed that the model performs well for all of the catchments. Comparing the mean and variability of model performance for the remaining catchments during calibration and validation allows to assess the predictive performance and uncertainty of the selected parameter sets. To prepare for regionalization, (1) the variability of each model parameter in the reduced parameter sample after NSE ≥ 0.5 (expressed by their coefficient of variation, CV), and (2) the best parameter set (largest NSE) of the calibration, NSECAL, and the validation period, NSEVAL, for each of the catchments were extracted. We derived the behavioral parameter ranges (parRANGE) during calibration with NSE ≥ 0.5. Using the behavioral parameter sets, we ran the model for the validation period (2003–2007), and we selected the best validation parameter sets among the behavioral parameters. Among the behavioral parameter sets, there may be a best parameter set that performs differently from the best-calibrated parameter sets in some catchments. In addition, the most stable parameter set for each catchment was identified, i.e., the parameter set that showed the smallest difference in NSE values between calibration and validation periods, NSEDIFF. Again, it is considered that the parameter set with the highest NSE is selected. In this procedure, after sorting the NSEDIFF in ascending order, only 5% of the parameters with the lowest NSEDIFF were extracted, from which the parameter set with the highest NSE was selected. In the selection of the stable parameter sets, parameter sets of the calibration period with the highest NSE that show a minimum difference with the NSE of the validation period were preferred.

3.3. Parameter Estimation in the Ungauged Catchments

To estimate model parameters for the 35 ungauged catchments, a parameter regionalization procedure was developed using weighted linear regressions (Equations (7)–(9)), also known as weighted least squares. This will allow linking the catchment properties (Table 2) and the estimated model parameters of the gauged catchments (see Section 3.2). Compared to the ordinary least squares, the weighted linear regression introduces a weight matrix to account for the unequal variances of observations [10,31]. This brings advantages in regionalization since the identifiability of a model parameter can vary significantly between catchments. To obtain a robust regression model, more weights can be put to the catchments with identifiable parameters. The weighted linear regression is described as follows:
Y = X β + ε
W = w 1 0 0 0 w 2 0 0 0 0 w n
β ^ = a r g m i n β i = 1 n ε i 2 = X T W X 1 X T W Y
where Y and X are, respectively, the response variable (estimated model parameters in this study) and the independent variable (catchment properties in this study). ε is the error vector and W is a diagonal matrix containing weights. β represents the regression coefficients vector and is estimated by β ^ , which minimizes the weighted sum of errors. The coefficient of variation (CV), which is the standard deviation divided by the mean of behavioral parameter sets of a catchment, represents the variability of a parameter. The smaller the CV, the less variable and more identifiable the parameter. Therefore, the reciprocal CV of the parameter of interest was used as weights for each catchment. Normally the weights in a weighted least squares regression are 1/variance. However, using 1/CV to calculate the weights brings the advantages of comparing the weights of a catchment for different parameters, because using 1/CV removes the influence of magnitude and units of a parameter. Additionally, using a weight 1/CV would result in a similar result as 1/variance since a parameter introducing a constant (the mean) into the regression will not change the relative weights to each catchment.
A correlation analysis was performed between the model parameters and catchment properties to select the independent variables. With this regard, the linear correlation, Spearman’s rank correlation [60], and the correlation on the log-transformed scale were applied. For the weighted linear regression, the catchment properties with the strongest correlation with the model parameter were chosen as the independent variable as shown in Table S1 (Supplementary Material). The normal scale linear regression model was selected over the log-transformed ones due to the better performance of the normal correlation than the log-transformed correlation. There is a possibility to do linear regression on the log-transformed scale; however, the correlation coefficients are superior on the normal scale to the log-transformed ones which can provide a better regression model on the normal scale. Choosing a log transformation reduces the weight of extreme values in predictor and response variables for large sample studies [15]; however, this may not be the case for the low catchment numbers. To increase the representation of more identifiable catchments, a weighted regression was applied on the normal scale. Parameters poorly correlated with the catchment property and remain unidentifiable throughout the catchments are rejected from the regression model, and their median value was considered as the best model instead.

3.4. Evaluation and Uncertainty Estimation of the Regionalization Procedure

A leave-one-out cross-validation method [61] was used for parameter estimation and for evaluating the prediction skill of the regressions. Leave-one-out is a simple cross-validation procedure: each regression model is created by taking all the catchments except one, the evaluation catchment. For large sample training data, this method is more resilient to irreducible errors [62]. Recently, this technique is applied for regionalization studies involving the prediction of discharge signatures [15,18,63] and model parameters in the ungauged catchments [7,8,64]. Thus, for 14 catchments, there are 14 different regression models and 14 different evaluation catchments. For each of the 14 iterations, the method produces one regionalized model parameter set for the left-out catchment using the regression model derived from the remaining 13 catchments and parameter sets. Since it is not known which parameter sets provide the most stable regionalization method, the procedure was repeated three times using the best parameter sets of (1) the calibration period, (2) the validation period, and (3) the most stable parameter sets between the calibration and validation periods. Thus, the quality of the regionalization procedure was evaluated three times. In order to choose the best of them for the following analyses, the simulations of the left-out catchments were evaluated for both the calibration (1995–2002) and the validation (2003–2007) period. In order to quantify the uncertainty of the regionalization procedure, all the 14 regionalized models were applied that were created for the leave-one-out evaluation to the ungauged catchments. Hence, an ensemble of 14 predicted streamflow time series is produced for each ungauged catchment to reflect the regionalization uncertainty.

3.5. Estimation of Regional Resilience of Streamflow to Precipitation Variability

In addition to the simulations at the 14 gauged catchments (and their respective uncertainty), the regionalization procedure produces simulations, and uncertainty estimates, of the 35 ungauged catchments, altogether covering a majority of the RVLB area (60.81%). The regional simulation tool was used to estimate the region’s streamflow resilience to precipitation variability, which is quantified through streamflow elasticity [65]. The elasticity of streamflow quantifies the sensitivity of a catchment’s streamflow response to the precipitation changes at the annual scale. Resilient catchments show low streamflow variability in response to precipitation changes. Elasticity is defined as the ratio between the change in the annual aggregated discharge (dQ) and the change in the annual aggregated precipitation (dP) shown by Equation (10):
ε P = m e d i a n d Q P d P Q
The elasticity values were calculated for each year using the entire ensemble of simulated daily discharge time series at all gauged and ungauged catchments. From this, the median, wettest, and driest year elasticities were extracted for each catchment, including their uncertainty expressed by the respective CV resulting from all parameter sets with NSE ≥ 0.5 for the gauged catchments and the 14 parameter sets obtained through the leave-one-out cross-validation procedure for the ungauged catchments. The wettest year elasticity is calculated for the transition from the normal year to the wettest year, and the driest year elasticity is calculated for the transition from the normal year to the driest year. Using the median, wettest, and driest year elasticities, it is possible to learn how resilient a hydrological system is against extreme climatic conditions such that one can evaluate the corresponding security of the water supply. For instance, if a catchment has a large streamflow elasticity, there will be a large system change in response to a big change in precipitation (e.g., the driest year); such a system is not resilient as there would be a huge change in the water available, thus affecting the security of the water supply.

4. Results

4.1. Estimated Parameters in the Gauged Catchments

The model calibration was performed using a split-sample test for the period 1995–2002, with a threshold of 0.5 NSE used to determine the behavioral parameter sets. Consequently, the model was run using the behavioral parameter sets for the validation period (2003–2007). Using the 0.5 NSE threshold, a wide range of behavioral parameter sets were obtained in all gauged catchments during calibration. The threshold-based separation of behavioral and non-behavioral parameter sets was used as a threshold-based approach, which helps to explicitly state under which minimum performance requirements, i.e., NSE ≥ 0.5, regionalization by the CV-weighted regression was conducted. Figure 3 shows the monthly NSE values for the calibration and validation periods, as well as their uncertainty, using behavioral parameter sets. For the monthly calibration, the model performance results in NSE 0.53 to 0.86 in the 14 gauged catchments. The maximum NSE values were highly reduced from the calibration to the validation periods for catchments #07 and #09. Catchments #01 and #03 show a small decrease in the monthly NSE values from the calibration to the validation period (Figure 3). In most cases, the monthly model calibration and validation show a uniform distribution of NSE values, indicating the stability of model parameters. The NSE values for the stable parameter set show the maximum value (0.80) for catchments #08 and #13, demonstrating good predictive skills in these catchments. Catchments #04 and #06 show lower parameter stability. The monthly NSE of the validation period was the highest for most catchments compared to the calibration period. This comes from the possibility to choose the best parameter from the confined parameter ranges using a threshold of 0.5 NSE throughout the catchments. The standard deviation of the monthly NSE values shows the ranges of uncertainty provided by the simulation for the selected parameter ranges (Table S3). In this regard, catchments #04, #05, and #06 result in the lowest simulation uncertainty during calibration.
In addition to the uncertainties, Figure 4 shows the identifiability of the model parameters in the 14 catchments using the Cumulative Distribution Function (CDF). More deviation from the 1-1 line shows higher identifiability by the parameters. The catchments with highly identified parameters are shown by the dark gray color on the CDF. Figure S1 shows the variability of the confined model parameters obtained by the calibration, which is later applied to derive the weights of the regional regression model. The parameters that are well-identified are highly represented in the regional model by their weight (1/CV). In this regard, parameters such as β, FC, K2, and LP were well identified in most of the catchments. Parameter β was highly identified towards the lower values as shown by the dark gray color (Figure 4) or by the narrow parameter range in catchment #6 (Figure S1). FC shows identifiability towards higher values in their parameter range for catchment #2. On the other hand, K2 showed higher sensitivity towards the lower values for catchment #7. The confined ranges of model parameters (K0, and VUZL) result in a relatively uniform distribution in their median values for all catchments. These parameters (K0 and VUZL) remained insensitive and were later excluded from the regression procedure due to less information contained in their parameter identifiability. Taking median values for unidentifiable parameters would be a better model for the regional model [10] (Figure 4, Figures S2 and S3).

4.2. Performance of the Regionalization Procedure

The transferability of the model parameters derived from weighted regression was evaluated using a leave-one-out cross-validation technique for each of the 14 gauged catchments. Figure 5 shows the results of evaluations for the three regionalized models on both calibration (1995–2002) and validation periods (2003–2007). Figure 5a,b shows the relationships between the performances of the monthly NSE of the best parameters estimated from calibration, validation, and the most stable sets and the corresponding regionalized parameter while evaluated on the calibration and validation periods. The regional parameter sets derived from the most stable parameters show the best performance in predicting discharge in the left-out catchments (NSE REGstable in Figure 5c) while evaluated during the validation period. The results show that 13 out of 14 catchments exceed a 0.25 NSE value and 10 out of 14 catchments exceed a 0.5 NSE value for the regression model derived from the most stable parameters during evaluation in the calibration period (1995–2002) (Figure 5a). Furthermore, the median NSE value of the 14 catchments for the regression model derived from the most stable parameters (NSE REGstable) is 0.63, compared to 0.50 for the best-calibration ones (NSE REGcal) and 0.57 for the best-validated parameter sets (NSE REGval) (Figure 5c) for the evaluation during the validation period (2003–2007). To assess the influence of low flow and high flow scales on the regionalization performance, the NSE was calculated on the Log transformed scale (logNSE) of observed and simulated discharge. Table S4 shows the logNSE values during the evaluation of the regression model on both the calibration and validation periods. The median NSE values were not improved on the log scale compared to the normal scale evaluation (Table S4).
Figure 6 shows the relationships between the best model parameter set obtained from the most stable parameter and the regression for the 14 catchments. However, these results should be carefully interpreted for data-scarce regions due to the use of available global data sets (precipitation and potential evapotranspiration). Parameters K0 and VUZL were unidentifiable during calibration, and their median values were taken for the regionalized model. The weighted regression shows acceptable performance in reproducing most of the parameters. For instance, the weighted regression reproduces well the parameters β, FC, and K2, whereas the remaining parameters are less reproducible by the regression model. Table S5 shows the performance of the weighted regression model during the leave-one-out cross-validation using a coefficient of determination (R2). It is seen that the weighted regression procedure does not always produce model parameters in their predefined range (Table 3). For example, the regressed FC of catchment #6 is above the maximum threshold of 700. In such cases, the outlier model parameter is assigned to its maximum value. Among the remaining parameters, most of them show acceptable correlations. However, some of them, such as K1 and PMAX, are poorly reflected through the regional regression in a few catchments. For instance, K1 in catchment #11 is poorly modeled. Therefore, parameter K1 is not identified in catchment #11. The regression model using parameter PMAX is poorly represented in four catchments (#09, #10, #13, and #14). This shows that for catchments #09, #10, #13, and #14, the parameter PMAX is not identifiable for the carefully selected catchment properties. Furthermore, for catchment #09, model parameters FC, and PMAX were poorly identified. However, the weighted regression procedure sufficiently represents the parameters in the remaining catchments. The best performing (#08) and most poorly performing (#09) catchments were also shown by the black and red colors, respectively. Catchment #08 primarily shows a stable prediction for all parameter-sampling procedures during calibration, validation, and stable relationships. Identifiable parameters in this catchment are also reproduced well from the regression model, whereas in catchment #09 (red scatter), most parameters were poorly identified and poorly reproduced by the regression model.
The reliability of this approach is further evaluated by comparing the observed discharge with the uncertainty interval of the regionalized model while the model is running for the validation period (Figure 7). The observed discharge for the best-performing catchment (#08) is enveloped by the prediction interval during the low-flow and high-flow periods. Furthermore, it corresponds highly with the mean of the 14 regionalized simulations (Figure 7a), whereas the observed discharge for the most poorly performing catchment (#09) is not well captured by the prediction interval nor by the mean of the 14 regionalized simulations (Figure 7b).

4.3. Estimation of Regional Resilience of Streamflow to Precipitation Variability

With the acceptable performance of the regional regression model, it has been applied for the ungauged catchments over the entire RVLB region. Figure 8 shows the median, wettest, and driest year elasticities computed from both gauged (all parameter combinations with NSE ≥ 0.5) and ungauged catchments (all 14 regionalized parameter sets from the leave-one-out cross-validation). The median elasticity values are fairly distributed in the basin. However, the highest elasticity is shown in the western part of RVLB (Figure 8a). For the wettest year, the highest elasticity results from the catchments in the south and the northeast show less resilience to extreme precipitation (Figure 8b). Relatively low elasticities result from the driest year, except for a few catchments in the south. This variation is because catchments in the south, except for some outliers, are mainly dry and receive a comparatively low amount of precipitation.
The uncertainty of the 14 ensembles derived from the regression was shown using the CV. It is seen that the median values, as well as the wettest and driest year elasticities, show low uncertainty in the gauged catchments, indicated by low CVs (Figure 8d–f) compared to the ungauged regions, whereas the highest CV values are shown for the median, wettest, and driest years for the ungauged catchments in the southern region. However, most of the catchments in the central and northern parts show less variability, which shows low uncertainties for these catchments. The highest and lowest precipitation will eventually result in different values of elasticities. The differences in the catchment properties would result in the variability of the CV of the wettest and driest year for the neighboring catchments (Figure 8e,f). In areas such as the RVLB, the highest yearly and seasonal precipitation amounts could be interrupted by the seasonal or yearly dry spells [37]. The response to runoff is low in the driest year, as shown by the low elasticity values in most of the ungauged catchments (Figure 8c). Comparatively, the driest year CV in the southern part shows relatively lower values than the wettest year.

5. Discussion

5.1. Reliability of the Regionalization Approach

The study demonstrates the applicability of a process-based hydrological model at a regional scale despite data scarcity. The reliability of the entire approach is shown through a three-step parameter estimation and model evaluation procedure, which enables one to identify the most reliable setting in the regional model development procedure. Overall, the stability of the parameters for the calibration and validation periods remained acceptable. Uncertainty was quantified by parameters sampled from the Monte Carlo random sampling procedure. The split sample test was applied to select the best parameter from the calibration and validation periods. Furthermore, the most stable parameter sets were calculated from the calibration and validation periods, which have also been performed in previous studies aiming at stable predictions [66]. The temporally stable model parameters indicate a relatively uniform system response during both calibration and validation and provide strong justification for the regional extrapolation of model parameters.
Parameters were estimated for the ungauged regions through the weighted regression procedure. Like previous studies using a similar approach [10], the weighting procedure increases the representation of identifiable parameters from the catchment in the regression model. More weight (reciprocal of CV) is assigned to the more identifiable parameters in a given catchment. Consequently, three regional models were derived using the best parameter sets obtained from the calibration, validation, and most stable parameter sets between calibration and validation for the 14 catchments. Figure 6 presents the nine parameters reproduced by using the most stable parameters, while the remaining two regression models (using best calibration and validation) are shown in Figure S2 and Figure S3, respectively. Comparing the performance (NSE) of these three regression models (Figure 5c), a regional model derived from using the most stable parameter has shown superiority over the other two. These procedures increase the reliability of the approach for the model development. Uncertainty was also quantified through the leave-one-out spatial cross-validation of parameters. In this approach, every catchment is considered as an evaluation catchment, while a prediction is formed using the remaining catchments. Therefore, the 14 catchments produced a 14-regression model in the leave-one-out cross-validation method that quantifies prediction uncertainty in the ungauged catchments. The leave-one-out spatial cross-validations also show better performance in the regionalization studies that use discharge signatures [15,18] and for ungauged catchments parameter prediction [7,8,65]. This method is more stable and more resilient to irreducible errors for large sample studies [63]. The leave-one-out cross-validation approach also has a uniformly low bias and root mean squared error (RMSE) [62]. Therefore, the approach combines three steps of uncertainty quantification from (1) the parameter sampling, (2) the best parameter set identification, and (3) the leave-one-out cross-validation in the data-scarce regions.
A scatter plot of monthly NSE values between the parameters estimated (from the model calibration, validation, and stable sets) and the parameters regionalized from the regression equations shows the evaluation and reliability check. Figure 5b indicates that the median NSE of regression from the most stable parameters outperforms the NSE of parameters obtained from calibration and validation for leave-one-out cross-validation. Furthermore, it shows most of the scatter points are above the 0.5 NSE limit, indicating a better reproduction of parameters while evaluated during the calibration period (1995–2002). In this regard, the parameters in 71% of the catchments result in an NSE of ≥ 0.5, and in 93% of the catchments, an NSE of > 0.25, for the regression model derived from the most stable parameters. The median value of the NSE for the 14 catchments is 0.63 during evaluation during the validation period (2003–2007); this is a sufficient performance in regionalization despite the few catchment numbers applied (Figure 5c). Studies showed difficulties in the regionalization performances for both the model parameters and discharge signatures [6,15,30,67]. With few training datasets used, the regionalization performances are acceptable which are derived from the monthly calibrated and validated parameters. The study focused on the challenges in regionalization in data-limited conditions showing the applicability of global forcing data for the regionalization of data-scarce regions considering the resulting uncertainties. This approach provides a basis for regional model estimation and uncertainty quantification for low catchment numbers in the data-scarce regions. The regionalization of the model parameters with the use of global data products in the data-scarce region can be expected to result in considerable uncertainty. However, this approach demonstrated that an acceptable median NSE value can be obtained despite low catchment numbers. In order to check the influence of low and high flow scales on the interpretation of the NSE, a more balanced logNSE to the regionalization performance was applied. Zhang and Post [68] show the influence of log transformation on the low-flow and high-flow scales. Table S4 shows the evaluation of the regression model on both the calibration and validation periods for the log-transformed observed and regionalized discharge. In this regard, the median NSE values have not improved the results any more than the normal scale NSE evaluation.
Figure 6 shows that more identifiable parameters for the gauged catchments (Figure 4) are also reproduced well from the regression model. The parameters of catchment #08 (black scatter) are highly identifiable during parameter estimation and the resulting parameters from the regression model are highly reproduced, whereas parameters in catchment #09 (red scatter) were poorly estimated and poorly reproduced by the regression model. Studies also show the selection of a more identifiable catchment as a donor for parameter regionalization depends on the score during the parameter calibration and validation [30].
In general, the evaluation indicates that despite ignoring parameter interactions [23,45,69], the regionalization procedure produces useful predictions of the model parameters in the ungauged catchments. Plotting the simulated (monthly) time series of the best and the most poorly performing regional catchments (Figure 7), the study showed that the uncertainty estimates derived from the leave-one-out cross-validation procedure capture well the simulation uncertainty for the best-performing catchment #08. On the other hand, the prediction interval of catchment #09 shows less agreement with observed discharge, particularly for low flow and high flow, but with the estimated ranges of uncertainty much closer to the observations. Previous regionalization studies have already shown that some outlier catchments cannot be captured if they behave very differently from the general trend [25], as is most probably the case for catchment #09. However, since most regionalized catchments are with an NSE of ≥ 0.5, there is sufficient reason to believe that the parameters of most ungauged catchments in the RVLB can be approximated acceptably.

5.2. Parameter Sensitivity and Spatial Variability

It is found that the model parameters have different sensitivities in the gauged catchments (Figure 4). Since the hydrogeology of the region is very heterogeneous, the parameters controlling the underground water flow show high variabilities among the catchments. Volume-controlling parameters (β, FC, and LP) and the recession coefficient in the lower reservoir (K2) are readily identifiable in most catchments. However, the insensitivity of the parameters in some catchments may be due to interactions with other parameters. Abebe et al. [70] show the interaction between parameter K2 and the percolation rate (PMAX), where the increment of K2 beyond the optimum rate of percolation may not show any sensitivity. In addition, the insensitivity of model parameters is related to the poor identifiability of model parameters in the catchments. For instance, parameters such as K0 and VUZL were poorly identified and remained insensitive for any parameter value throughout the catchment (Figure 4).
The most identifiable parameters in a catchment result in the highest sensitivity towards any parameter value. However, parameter insensitivity is due to variations in the values of catchment properties. This is illustrated by parameter K2, which is a recession coefficient in the lower reservoir, and it is well identified in catchments #07 towards the lower parameter value. Catchment #07 is characterized by sloppy catchment with a high drainage channel slope (catchment index) (Table 2) which facilitates the drainage of available water by surface runoff. Thus, for the little remaining water in the lower reservoir, lower values of K2 will become more sensitive. Parameter K1, which is a recession coefficient from the upper reservoir, is well identified in catchment #05. A lower value of the recession coefficient results in a relatively low drainage density in catchment #05 (Table 2), which facilitates a relatively higher percolation (PMAX) in the catchment.
The interactions among model parameters may not be the only reason for insensitivity as it varies in different catchments and their properties. For instance, sloppy catchments in a small drainage area (#07 and #09) facilitate the conversion of precipitation into runoff, resulting in less soil moisture in the upper reservoirs in the HBV model. In such cases, adjustment to a parameter (K1) controlling the water flow might not affect the outflow conditions. This is also shown by the negative correlation of the slope with K1 (Table S1). Furthermore, the insensitivity of the parameters in the upper reservoir can be affected by low precipitation amounts. In low precipitation conditions (such as in catchments #08, #10, and #13) the resulting soil moisture from the soil profile and the upper reservoir will be much less, resulting in less runoff. The adjustment of the runoff-controlling parameters (K0 and VUZL) might not have any influence (remain insensitive) on the resulting runoff. Parameter K0 only functions when the cumulative precipitation exceeds the threshold of the VUZL value. Other than climatic properties, the insensitivity of the parameters can result from the interaction of the parameters, and this influence will be more pronounced for complex models. Other studies also showed that the insensitivities of the model parameters could result from the mismatch between the model complexity and the available data used to parameterize the model [71,72,73,74].

5.3. Estimation of Regional Resilience of Streamflow to Precipitation Variability

In this study, elasticity values for the median, wettest, and driest years and their respective CVs were calculated (Figure 8). In the regions with higher elasticities (Figure 8a–c), the catchments respond faster to any change in precipitation promoting fast flow components. However, for low elasticity values, the streamflow responds slowly to precipitation change.
The CV indicates the uncertainty of prediction for the gauged and ungauged catchments (Figure 8d–f). Most of the catchments located in the southern part of the basin show a higher CV value in combination with a low resilience of streamflow to precipitation variability. Prediction variability is higher in the southern part compared to the north, and there are also few gauged catchments. Therefore, the higher uncertainty in the south may be attributable to the mixed effects of higher precipitation variability and the remoteness of the gauged catchments used to establish the regional regression.
The variability in areas of gauged and ungauged catchments used for parameter estimation and prediction, respectively, reduces with the strength of correlation between calibrated parameters and catchment properties (Table S1). Other studies also show that runoff from smaller catchments can have a stronger relationship with local climate and catchment properties than larger catchments [24,45,75]. In addition, the wettest year shows higher uncertainty compared to the driest year. However, most of the ungauged catchments that are located in the northern parts show lower uncertainty, more or less similar to the gauged catchments. This is because these ungauged catchments are much closer to the gauged catchments and are hence better represented by the regression model than the remote ungauged catchments. This is supported by the study of Qi et al. [76] that compared different regionalization techniques and found the strongest performance using the spatial proximity under different climate regions. Moreover, most of the streamflow in the north remains more resilient to precipitation change as it shows lower elasticity values, which indicate a higher resilience to streamflow in the dry years (Figure 8c). Furthermore, the CV for the driest year shows a relatively low uncertainty in the northern and eastern parts for both gauged and ungauged catchments.
The application of this approach to the RVLB shows that the predicted elasticities are characterized by a wide range of uncertainty for the ungauged catchments in the southern part. The reasons for such variation in uncertainty could be the fact that the ungauged catchments in the southern part are mainly dry and receive a comparatively low amount of rainfall (Table S2). This is different from the wetter, northern part, where most of the gauged catchments are located. This accords with studies showing a decreasing regionalization performance from smaller and more arid catchments [77]. Previous work has also shown that the spatial variability of precipitation can interact with catchment properties to alter hydrological processes [78]. Thus, higher precipitation variability in the gauged and ungauged catchments introduces more uncertainty to parameter regionalization. Such variability in prediction also results from the relative location of an ungauged catchment to the gauged one where the regional model is developed [79].
This approach shows variability in the resilience of gauged and ungauged regions, which emerges from parameter uncertainty and climate variability. Over the RVLB, lakes are particularly stressed by growing water demand, climate variability, and drought [39]. The reliability of open water resources in low-resilient catchments remains uncertain. Coupled with the significant reduction in lake sizes and water levels [80], this will negatively affect water resource availability and ecosystem stability in the future.

5.4. Transferability of the Approach to Other Catchments and Models

This study provides a methodology that accounts for uncertainty throughout all steps of the regionalization approach. It translates data limitations into the remaining uncertainties that were found in the regional simulations (expressed, for instance, by the CV values of the elasticities). The approach is independent of the model and the number of parameters, but it is expected that a more complex model (with more parameters) would struggle more through data limitations and generate larger uncertainties when applied regionally.
In this study, the identifiability of the model parameters varied across the 14 catchments. This could be from the variation in the catchment properties used for the regionalization. Van Esse et al. [81] show the performance of conceptual hydrological models to vary depending on the size of a catchment and the soil moisture state of a catchment. Yang et al. [73] also showed the regionalized model to be dependent on the complexity of hydrological models in different climatic regions. In this approach, the variation is accounted for through regionalization uncertainties by applying the leave-one-out spatial cross-validation. Thereby, one can have different levels of parameter identifiability in the transferability of a model parameter from one model component to another. By choosing stable parameter sets for the regional model, one can guarantee the transferability to another region by the spatially evaluated robust regional model within their uncertainty range. Nevertheless, model parameter transferability does not always result in success. The difference in model parameters could derive from the complex relationship between different model components within the model structures. The transferability of insensitive parameters (K0, and VUZL) could add bias to the regionalized models, thus taking their median values from the estimated parameters can be a better model [10]. The stability and resilience of the model parameters from the regionalization procedure would minimize the error and bias in parameter transfer within the model component [62]. However, success in the parameter transfers will be influenced by the dominant catchment properties that are identified regionally [32]. Despite this, a sufficient correlation between the catchment property and model parameters can be a good indicator of the predictive power of the selected catchment properties.

6. Conclusions

The study demonstrated the use of global data products for the regionalization of model parameters using a small sample of gauged catchments despite data scarcity. The study applied multiple options for parameter estimation and model evaluation procedures. Three steps of uncertainty quantification were combined from the parameter sampling, best parameter sets identification, and leave-one-out cross-validation. The study demonstrated the validity and reliability of this approach at 14 test catchments with varying catchment properties. The parameter estimates from the leave-one-out cross-validation using the most stable parameters have outperformed the parameter estimates from best-calibrated and best-validated parameter sets.
The study incorporated uncertainties from the leave-one-out cross-validation that can provide a robust way of uncertainty quantification by generating 14 estimates of plausible streamflow ensembles and simulation uncertainties. This approach shows variability in the resilience of gauged and ungauged regions, which emerges from parameter uncertainty and climate variability. The study showed the uncertainties of elasticities in the gauged catchments obtained from simulation to be less than that of the uncertainties of elasticities in the ungauged catchments obtained from regionalization. The study further enables the quantification of the wettest and driest year elasticities and their uncertainties throughout the catchments, which provides a basis for the integrated water resource management in the region. Linking this approach with more observations of catchment properties on a larger scale can provide a good basis for large-scale water resource management. This approach can be extended to simulate and quantify the resilience of gauged and ungauged regions under climate change by quantifying the additional uncertainties emerging from climate projections.
Overall, this approach provided directions for uncertainty reduction by combining global input data with local discharge measurements, which result in a refinement of estimated model parameters for both gauged and ungauged catchments. The acceptable monthly simulation results obtained in the gauged catchments and the acceptable results of the spatial split-sample test (with median Nash Sutcliffe efficiencies of 0.63) indicate that global products can be used as model inputs to provide reasonable simulations in data-scarce regions. Small-scale studies that use simple models (few parameter numbers) show the transferability of the model parameters to ungauged basins [10]. In extension to this, this approach provides the possibility of identifying parameters of ungauged basins in data-scarce regions, including a thorough evaluation and uncertainty quantification procedure. Using well-identified parameters, more reliable regional relationships can be obtained from the most dominant catchment attributes. In either large or low samples of catchments, a leave-one-out procedure should always be possible. By applying a specific hydrological model (HBV) and using a general framework, it can be also easily adapted to other hydrologic models. This approach demonstrated a way for robust parameter regionalization through leave-one-out cross-validation that enables the transfer of parameters across regions. It is proposed for further research that this approach can be applied for parameter transfer with model structures other than HBV to elaborate its general applicability in data-scarce regions. As this approach is model-independent and the input data used are available globally, it can be applied to any other data-scarce region where predictions of regional water availability are required. Despite the results, caution should always be taken when considering low catchment numbers as poorly assessed catchment properties or imprecise forcing data for some of them can greatly bias regional estimates.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/hydrology9080150/s1. Table S1 Correlation coefficients (CC) between catchment properties and model parameters. Table S2 Catchment properties that are derived for streamflow prediction in the ungauged catchments using the 14 parameter sets obtained by the weighted regression. Table S3 The maximum NSE on the monthly scale for the best parameters derived from calibration, validation, and stable sets, and standard deviations (std) of monthly NSE for calibration and validation. Table S4 Evaluation of the three different regression models from calibration, validation, and stable sets on both calibration (1995–2002) and validation periods (2003–2007). NSE was calculated on the Log transformed scale of observed and simulated discharge. Table S5 R2 of weighted regression performance for the nine HBV parameters during the leave-one-out cross-validation. Figure S1 Confined ranges of model parameters derived during model calibration for each catchment. Figure S2 Scatter plots between best parameters estimated from calibration and parameters derived from the regression model. The black circle (catchment #8) represents best performing catchment from parameter estimation and its corresponding parameters from regression. The red circle (catchment #9) represents the most poorly performing catchment by the regression model. For unidentifiable parameters (K0 and VUZL) their median value was taken for the regionalized model. Figure S3 Scatter plots between best parameters estimated from validation and parameters derived from the regression model. The black circle (catchment #8) represents best performing catchment from parameter estimation and its corresponding parameters from regression. The red circle (catchment #9) represents the most poorly performing catchment by the regression model. For unidentifiable parameters (K0 and VUZL) their median value was taken for the regionalized model.

Author Contributions

T.A. conceptualized and wrote the paper, developed and applied the regional model, and analyzed and visualized the results. Y.L., S.T. and A.H. provided supervision and advice throughout the development and application of this study and provided support in developing the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

The German Academic Exchange Service DAAD and Hawassa University under the EECBP Home Grown Ph.D. Scholarship Programme, 2019 (57472170) supported Tesfalem Abraham. Yan Liu and Andreas Hartmann were supported by the Emmy-Noether-Programme of the German Research Foundation (DFG, grant number: HA 8113/1-1; project “Global Assessment of Water Stress in Karst Regions in a Changing World”).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The climatic forcing data are publicly available and can be obtained via the link http://www.gloh2o.org/mswep for MSWEP (access date: 14 January 2020) and from https://www.gleam.eu for GLEAM (access date: 20 January 2020). The simulation of the hydrological model was conducted using a freely available lumped HBV model code implemented by MATLAB that is available at www.gloh2o.org (access date: 12 January 2020). The streamflow data, with its prediction interval and the uncertainties of parameters for the ungauged catchments produced during the current study, are available in the repository at https://doi.org/10.5281/ZENODO.5806358 (access date: 27 December 2021). Detailed descriptions of the datasets are shown by Abraham et al. [82]. The physical properties data such as permeability and porosity were extracted from the global datasets prepared by Huscroft et al. [44]. The catchment properties data extracted for gauged and ungauged catchments are shown in the manuscript in Table 2 and Table S2, respectively.

Acknowledgments

The authors would like to thank the German Academic Exchange Service DAAD and Hawassa University for the grant of this Ph.D. scholarship. We are thankful for all the structural facilities provided by the chair of hydrological modeling and water resources at the University of Freiburg. The Ministry of Water Irrigation and Electricity of Ethiopia provides the streamflow data used in this study. The authors would like to acknowledge the work of Beck et al. [83] and Beck et al. [30] who provided the MSWEP precipitation and HBV parameters, respectively, for the global scale. We are also thankful for the provision of the global scale potential evapotranspiration (GLEAM) datasets by Martens et al. [43].

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

  1. Woldegabriel, G.; Aronson, J.L.; Walter, R. Geology, geochronology, and rift basin development in the central sector of the Main Ethiopia Rift. GSA Bull. 1990, 102, 439–458. [Google Scholar] [CrossRef]
  2. Stokstad, E. Scarcity of Rain, Stream Gages Threatens Forecasts. Science 1999, 285, 1199–1200. [Google Scholar] [CrossRef]
  3. Blöschl, G.; Sivapalan, M.; Wagener, T.; Viglione, A.; Savenije, H. Runoff Prediction in Ungauged Basins: Synthesis across Processes, Places and Scales; Cambridge University Press: Cambridge, UK, 2011. [Google Scholar]
  4. Sivapalan, M.; Takeuchi, K.; Franks, S.; Gupta, V.; Karambiri, H.; Lakshmi, V.; Liang, X.; Mcdonnell, J.; Mendiondo, E.; Connell, O.; et al. IAHS decade on Predictions in Ungauged Basins (PUB), 2003–2012: Shaping an exciting future for the hydrological sciences Catchments as space–time filters—A joint spatio-temporal geostatistical analysis of runoff and precipitation. Hydrol. Sci. J. 2003, 48, 857–880. [Google Scholar] [CrossRef]
  5. Wagener, T.; Wheater, H.S.; Gupta, H.V. Rainfall-Runoff Modelling in Gauged and Ungauged Catchments; Imperial College Press: London, UK, 2004. [Google Scholar]
  6. Guo, Y.; Zhang, Y.; Zhang, L.; Wang, Z. Regionalization of hydrological modeling for predicting streamflow in ungauged catchments: A comprehensive review. WIREs Water 2020, 8, e1487. [Google Scholar] [CrossRef]
  7. Pool, S.; Vis, M.; Seibert, J. Regionalization for Ungauged Catchments—Lessons Learned From a Comparative Large-Sample Study. Water Resour. Res. 2021, 57, e2021WR030437. [Google Scholar] [CrossRef]
  8. Arsenault, R.; Breton-Dufour, M.; Poulin, A.; Dallaire, G.; Romero-Lopez, R. Streamflow prediction in ungauged basins: Analysis of regionalization methods in a hydrologically heterogeneous region of Mexico. Hydrol. Sci. J. 2019, 64, 1297–1311. [Google Scholar] [CrossRef]
  9. Arsenault, R.; Brissette, F. Analysis of continuous streamflow regionalization methods within a virtual setting. Hydrol. Sci. J. 2016, 61, 2680–2693. [Google Scholar] [CrossRef]
  10. Wagener, T.; Wheater, H.S. Parameter estimation and regionalization for continuous rainfall-runoff models including uncertainty. J. Hydrol. 2006, 320, 132–154. [Google Scholar] [CrossRef]
  11. Yang, X.; Magnusson, J.; Rizzi, J.; Xu, C.-Y. Runoff prediction in ungauged catchments in Norway: Comparison of regionalization approaches. Water Policy 2017, 49, 487–505. [Google Scholar] [CrossRef]
  12. Wagener, T.; Montanari, A. Convergence of approaches toward reducing uncertainty in predictions in ungauged basins. Water Resour. Res. 2011, 47, W06301. [Google Scholar] [CrossRef]
  13. Kapangaziwiri, E.; Hughes, D.; Wagener, T. Incorporating uncertainty in hydrological predictions for gauged and ungauged basins in southern Africa. Hydrol. Sci. J. 2012, 57, 1000–1019. [Google Scholar] [CrossRef]
  14. Yadav, M.; Wagener, T.; Gupta, H. Regionalization of constraints on expected watershed response behavior for improved predictions in ungauged basins. Adv. Water Resour. 2007, 30, 1756–1774. [Google Scholar] [CrossRef]
  15. Zhang, Y.; Chiew, F.H.S.; Li, M.; Post, D. Predicting Runoff Signatures Using Regression and Hydrological Modeling Approaches. Water Resour. Res. 2018, 54, 7859–7878. [Google Scholar] [CrossRef]
  16. Zhang, Z.; Wagener, T.; Reed, P.; Bhushan, R. Reducing uncertainty in predictions in ungauged basins by combining hydrologic indices regionalization and multiobjective optimization. Water Resour. Res. 2008, 44, W00B04. [Google Scholar] [CrossRef]
  17. Nega, H.; Seleshi, Y. Regionalization of mean annual flow for ungauged catchments in case of Abbay River Basin, Ethiopia. Model. Earth Syst. Environ. 2020, 7, 341–350. [Google Scholar] [CrossRef]
  18. Westerberg, I.K.; Wagener, T.; Coxon, G.; McMillan, H.K.; Castellarin, A.; Montanari, A.; Freer, J. Uncertainty in hydrological signatures for gauged and ungauged catchments. Water Resour. Res. 2016, 52, 1847–1865. [Google Scholar] [CrossRef]
  19. Addor, N.; Nearing, G.; Prieto, C.; Newman, A.J.; Le Vine, N.; Clark, M.P. A Ranking of Hydrological Signatures Based on Their Predictability in Space. Water Resour. Res. 2018, 54, 8792–8812. [Google Scholar] [CrossRef]
  20. Merz, R.; Tarasova, L.; Basso, S. Parameter’s Controls of Distributed Catchment Models—How Much Information is in Conventional Catchment Descriptors? Water Resour Res. 2020, 56, 1–18. [Google Scholar] [CrossRef]
  21. Beven, K. Environmental Modelling: An Uncertain Future? 1st ed.; CRC Press: London, UK, 2018; pp. 1–310. [Google Scholar]
  22. Beven, K. A manifesto for the equifinality thesis. J. Hydrol. 2006, 320, 18–36. [Google Scholar] [CrossRef]
  23. Gibbs, M.S.; Maier, H.R.; Dandy, G.C. A generic framework for regression regionalization in ungauged catchments. Environ. Model. Softw. 2012, 27–28, 1–14. [Google Scholar] [CrossRef]
  24. Merz, R.; Blöschl, G. Regionalisation of catchment model parameters. J. Hydrol. 2004, 287, 95–123. [Google Scholar] [CrossRef]
  25. Seibert, J. Regionalisation of parameters for a conceptual rainfall-runoff model. Agric. For. Meteorol. 1999, 98-99, 279–293. [Google Scholar] [CrossRef]
  26. Döll, P.; Kaspar, F.; Lehner, B. A global hydrological model for deriving water availability indicators: Model tuning and validation. J. Hydrol. 2003, 270, 105–134. [Google Scholar] [CrossRef]
  27. Nijssen, B.; O’Donnell, G.M.; Lettenmaier, D.P.; Lohmann, D.; Wood, E. Predicting the Discharge of Global Rivers. J. Clim. 2001, 14, 3307–3323. [Google Scholar] [CrossRef]
  28. Widén-Nilsson, E.; Halldin, S.; Xu, C.-Y. Global water-balance modelling with WASMOD-M: Parameter estimation and regionalisation. J. Hydrol. 2007, 340, 105–118. [Google Scholar] [CrossRef]
  29. Li, H.; Zhang, Y. Regionalising rainfall-runoff modelling for predicting daily runoff: Comparing gridded spatial proximity and gridded integrated similarity approaches against their lumped counterparts. J. Hydrol. 2017, 550, 279–293. [Google Scholar] [CrossRef]
  30. Beck, H.E.; van Dijk, A.I.J.M.; de Roo, A.; Miralles, D.G.; McVicar, T.R.; Schellekens, J.; Bruijnzeel, L.A. Global-scale regionalization of hydrologic model parameters. Water Resour. Res. 2016, 52, 3599–3622. [Google Scholar] [CrossRef]
  31. Tasker, G.D. Hydrologic regression with weighted least squares. Water Resour. Res. 1980, 16, 1107–1113. [Google Scholar] [CrossRef]
  32. Singh, R.; Archfield, S.; Wagener, T. Identifying dominant controls on hydrologic parameter transfer from gauged to ungauged catchments—A comparative hydrology approach. J. Hydrol. 2014, 517, 985–996. [Google Scholar] [CrossRef]
  33. Lane, R.A.; Freer, J.E.; Coxon, G.; Wagener, T. Incorporating Uncertainty Into Multiscale Parameter Regionalization to Evaluate the Performance of Nationally Consistent Parameter Fields for a Hydrological Model. Water Resour. Res. 2021, 57, e2020WR028393. [Google Scholar] [CrossRef]
  34. Livneh, B.; Lettenmaier, D.P. Regional parameter estimation for the unified land model. Water Resour. Res. 2013, 49, 100–114. [Google Scholar] [CrossRef]
  35. Wolfenden, E.; Ebinger, C.; Yirgu, G.; Deino, A.; Ayalew, D. Evolution of the northern Main Ethiopian rift: Birth of a triple junction. Earth Planet. Sci. Lett. 2004, 224, 213–228. [Google Scholar] [CrossRef]
  36. Abebe, T.; Manetti, P.; Bonini, M.; Corti, G.; Innocenti, F.; Mazzarini, F.; Peckzay, Z. Geological map (scale 1:200,000) of the northern Main Ethiopian Rift and its implications for the volcano-tectonic evolution of the rift. Geol. Soc. Am. Map Chart Ser. 2005, MCH094, 1–20. [Google Scholar] [CrossRef]
  37. Segele, Z.T.; Lamb, P.J. Characterization and variability of Kiremt rainy season over Ethiopia. Meteorol. Atmos. Phys. 2005, 89, 153–180. [Google Scholar] [CrossRef]
  38. Molla, D.D.; Tegaye, T.A.; Fletcher, C.G. Simulated surface and shallow groundwater resources in the Abaya-Chamo Lake basin, Ethiopia using a spatially-distributed water balance model. J. Hydrol. Reg. Stud. 2019, 24, 100615. [Google Scholar] [CrossRef]
  39. Seyoum, W.M.; Milewski, A.M.; Durham, M.C. Understanding the relative impacts of natural processes and human activities on the hydrology of the Central Rift Valley lakes, East Africa. Hydrol. Process. 2015, 29, 4312–4324. [Google Scholar] [CrossRef]
  40. Tesfalem, A.; Abraham, W.; Alemayehu, M.; Brook, A. Hydrological Responses of Climate Change on Lake Ziway Catchment, Central Rift Valley of Ethiopia. J. Earth Sci. Clim. Chang. 2018, 9. [Google Scholar] [CrossRef]
  41. Beck, H.E.; Wood, E.F.; Pan, M.; Fisher, C.K.; Miralles, D.G.; Van Dijk, A.I.J.M.; McVicar, T.R.; Adler, R.F. MSWep v2 Global 3-hourly 0.1° precipitation: Methodology and quantitative assessment. Bull. Am. Meteorol. Soc. 2019, 100, 473–500. [Google Scholar] [CrossRef]
  42. Baez-Villanueva, O.M.; Zambrano-Bigiarini, M.; Mendoza, P.A.; McNamara, I.; Beck, H.E.; Thurner, J.; Nauditt, A.; Ribbe, L.; Thinh, N.X. On the selection of precipitation products for the regionalisation of hydrological model parameters. Hydrol. Earth Syst. Sci. 2021, 25, 5805–5837. [Google Scholar] [CrossRef]
  43. Martens, B.; Gonzalez Miralles, D.; Lievens, H.; Van Der Schalie, R.; De Jeu, R.A.M.; Fernández-Prieto, D.; Beck, H.E.; Dorigo, W.A.; Verhoest, N.E.C. GLEAM v3: Satellite-based land evaporation and root-zone soil moisture. Geosci. Model Dev. 2017, 10, 1903–1925. [Google Scholar] [CrossRef]
  44. Huscroft, J.; Gleeson, T.; Hartmann, J.; Börker, J. Compiling and Mapping Global Permeability of the Unconsolidated and Consolidated Earth: GLobal HYdrogeology MaPS 2.0 (GLHYMPS 2.0). Geophys. Res. Lett. 2018, 45, 1897–1904. [Google Scholar] [CrossRef]
  45. Bárdossy, A. Calibration of hydrological model parameters for ungauged catchments. Hydrol. Earth Syst. Sci. 2007, 11, 703–710. [Google Scholar] [CrossRef]
  46. Razavi, T.; Coulibaly, P. Improving streamflow estimation in ungauged basins using a multi-modelling approach. Hydrol. Sci. J. 2016, 61, 2668–2679. [Google Scholar] [CrossRef]
  47. Parajka, J.; Merz, R.; Blöschl, G. A comparison of regionalisation methods for catchment model parameters. Hydrol. Earth Syst. Sci. 2005, 9, 157–171. [Google Scholar] [CrossRef]
  48. Bergström, S. Development and Application of a Conceptual Runoff Model for Scandinavian Catchments; SMHI Rep. RHO 7; Swedish Meteorological and Hydrological Institute: Lund, Sweden, 1976; Volume 134. [Google Scholar]
  49. Seibert, J.; Vis, M.J.P. Teaching hydrological modeling with a user-friendly catchment-runoff-model software package. Hydrol. Earth Syst. Sci. 2012, 16, 3315–3325. [Google Scholar] [CrossRef]
  50. Te Linde, A.; Hurkmans, R.; Eberle, M. Comparing model performance of two rainfall-runoff models in the Rhine basin using different atmospheric forcing data sets. Hydrol. Earth Syst Sci. 2008, 12, 943–957. [Google Scholar] [CrossRef]
  51. Zhang, X.; Lindström, G. A Comparative Study of a Swedish and a Chinese Hydrological Model. JAWRA J. Am. Water Resour. Assoc. 2007, 32, 985–994. [Google Scholar] [CrossRef]
  52. Hundecha, Y.; Bárdossy, A. Modeling of the effect of land use changes on the runoff generation of a river basin through parameter regionalization of a watershed model. J. Hydrol. 2004, 292, 281–295. [Google Scholar] [CrossRef]
  53. Jin, X.; Xu, C.-Y.; Zhang, Q.; Chen, Y.D. Regionalization study of a conceptual hydrological model in Dongjiang basin, south China. Quat. Int. 2009, 208, 129–137. [Google Scholar] [CrossRef]
  54. Masih, I.; Uhlenbrook, S.; Maskey, S.; Ahmad, M.-U. Regionalization of a conceptual rainfall–runoff model based on similarity of the flow duration curve: A case study from the semi-arid Karkheh basin, Iran. J. Hydrol. 2010, 391, 188–201. [Google Scholar] [CrossRef]
  55. Parajka, J.; Bloschl, G.; Merz, R. Regional calibration of catchment models: Potential for ungauged catchments. Water Resour. Res. 2007, 43, 1–16. [Google Scholar] [CrossRef]
  56. Bergström, S. The HBV Model—Its Structure and Applications; Swedish Meteorological and Hydrological Institute: Norrköping, Sweden, 1992; Volume 4, pp. 1–33. [Google Scholar]
  57. Tolson, B.A.; Shoemaker, C.A. Dynamically dimensioned search algorithm for computationally efficient watershed model calibration. Water Resour. Res. 2007, 43. [Google Scholar] [CrossRef]
  58. Klemeš, V. Operational testing of hydrological simulation models. Hydrol. Sci. J. 1986, 31, 13–24. [Google Scholar] [CrossRef]
  59. Nash, J.E.; Sutcliffe, J.V. River flow forecasting through conceptual models part I—A discussion of principles. J. Hydrol. 1970, 10, 282–290. [Google Scholar] [CrossRef]
  60. Spearman, C. The Proof and Measurement of Association between Two Things. Am. J. Psychol. 1904, 15, 72–101. [Google Scholar] [CrossRef]
  61. Breiman, L.; Spector, P. Submodel Selection and Evaluation in Regression. The X-Random Case. Int. Stat. Rev. 1992, 60, 291. [Google Scholar] [CrossRef]
  62. Hastie, T.; Tibshirani, R.; Friedman, J. The Elements of Statistical Learning: Data Mining, Inference and Prediction Probability Theory: The Logic of Science The Fundamentals of Risk Measurement Mathematicians, pure and applied, think there is something weirdly different about. Math Intell. 2005, 27, 83–85. [Google Scholar]
  63. Worland, S.C.; Farmer, W.; Kiang, J.E. Improving predictions of hydrological low-flow indices in ungaged basins using machine learning. Environ. Model. Softw. 2018, 101, 169–182. [Google Scholar] [CrossRef]
  64. Estacio, A.B.S.; Costa, A.C.; Filho, F.A.S.; Rocha, R.V. Uncertainty analysis in parameter regionalization for streamflow prediction in ungauged semi-arid catchments. Hydrol. Sci. J. 2021, 66, 1132–1150. [Google Scholar] [CrossRef]
  65. Sankarasubramanian, A.; Vogel, R.M.; Limbrunner, J.F. Climate elasticity of streamflow in the United States. Water Resour Res. 2001, 37, 1771–1781. [Google Scholar] [CrossRef]
  66. Hartmann, A.; Kobler, J.; Kralik, M.; Dirnböck, T.; Humer, F.; Weiler, M. Model-aided quantification of dissolved carbon and nitrogen release after windthrow disturbance in an Austrian karst system. Biogeosciences 2016, 13, 159–174. [Google Scholar] [CrossRef]
  67. Prieto, C.; Le Vine, N.; Kavetski, D.; García, E.; Medina, R. Flow Prediction in Ungauged Catchments Using Probabilistic Random Forests Regionalization and New Statistical Adequacy Tests. Water Resour. Res. 2019, 55, 4364–4392. [Google Scholar] [CrossRef]
  68. Zhang, Y.; Post, D. How good are hydrological models for gap-filling streamflow data? Hydrol. Earth Syst. Sci. 2018, 22, 4593–4604. [Google Scholar] [CrossRef]
  69. Brunner, M.I.; Furrer, R.; Sikorska, A.E.; Viviroli, D.; Seibert, J.; Favre, A.-C. Synthetic design hydrographs for ungauged catchments: A comparison of regionalization methods. Stoch. Hydrol. Hydraul. 2018, 32, 1993–2023. [Google Scholar] [CrossRef]
  70. Abebe, N.A.; Ogden, F.L.; Pradhan, N.R. Sensitivity and uncertainty analysis of the conceptual HBV rainfall–runoff model: Implications for parameter estimation. J. Hydrol. 2010, 389, 301–310. [Google Scholar] [CrossRef]
  71. Gupta, H.V.; Clark, M.P.; Vrugt, J.A.; Abramowitz, G.Y.M. Towards a comprehensive assessment of model structural adequacy. Water Resour Res. 2012, 48, 1–16. [Google Scholar] [CrossRef]
  72. Song, X.; Zhang, J.; Zhan, C.; Xuan, Y.; Ye, M.; Xu, C. Global sensitivity analysis in hydrological modeling: Review of concepts, methods, theoretical framework, and applications. J. Hydrol. 2015, 523, 739–757. [Google Scholar] [CrossRef]
  73. Yang, X.; Magnusson, J.; Huang, S.; Beldring, S.; Xu, C.-Y. Dependence of regionalization methods on the complexity of hydrological models in multiple climatic regions. J. Hydrol. 2019, 582, 124357. [Google Scholar] [CrossRef]
  74. Zhang, D.; Zhang, L.; Guan, Y.; Chen, X.; Chen, X. Sensitivity analysis of Xinanjiang rainfall-runoff model parameters: A case study in Lianghui, Zhejiang province, China. Hydrol. Res. 2012, 43, 123–134. [Google Scholar] [CrossRef]
  75. Kokkonen, T.S.; Jakeman, A.J.; Young, P.C.; Koivusalo, H.J. Predicting daily flows in ungauged catchments: Model regionalization from catchment descriptors at the Coweeta Hydrologic Laboratory, North Carolina. Hydrol. Process. 2003, 17, 2219–2238. [Google Scholar] [CrossRef]
  76. Qi, W.; Chen, J.; Li, L.; Xu, C.Y.; Xiang Y heng Zhang S bo Wang, H.M. Impact of the number of donor catchments and the efficiency threshold on regionalization performance of hydrological models. J. Hydrol. 2021, 601, 126680. [Google Scholar] [CrossRef]
  77. Parajka, J.; Viglione, A.; Rogger, M.; Salinas, J.L.; Sivapalan, M.; Blöschl, G. Comparative assessment of predictions in ungauged basins—Part 1: Runoff-hydrograph studies. Hydrol. Earth Syst. Sci. 2013, 17, 1783–1795. [Google Scholar] [CrossRef]
  78. Zhao, F.; Zhang, L.; Chiew, F.H.; Vaze, J.; Cheng, L. The effect of spatial rainfall variability on water balance modelling for south-eastern Australian catchments. J. Hydrol. 2013, 493, 16–29. [Google Scholar] [CrossRef]
  79. Patil, S.; Stieglitz, M. Controls on hydrologic similarity: Role of nearby gauged catchments for prediction at an ungauged catchment. Hydrol. Earth Syst. Sci. 2012, 16, 551–562. [Google Scholar] [CrossRef]
  80. Ayenew, T.; Becht, R. Comparative assessment of the water balance and hydrology of selected Ethiopian and Kenyan Rift Lakes. Lakes Reserv. Sci. Policy Manag. Sustain. Use 2008, 13, 181–196. [Google Scholar] [CrossRef]
  81. Van Esse, W.R.; Perrin, C.; Booij, M.J.; Augustijn, D.C.M.; Fenicia, F.; Kavetski, D.; Lobligeois, F. The influence of conceptual model structure on model performance: A comparative study for 237 French catchments. Hydrol. Earth Syst. Sci. 2013, 17, 4227–4239. [Google Scholar] [CrossRef]
  82. Abraham, T.; Liu, Y.; Tekleab, S.; Hartmann, A. Uncertainties of Monthly Discharge Data and Parameters in the Ungauged Catchments of the Ethiopian (RVLB). [Data Set]. Zenodo. 2021. Available online: https://doi.org/10.5281/zenodo.5806358 (accessed on 27 December 2021).
  83. Beck, H.E.; Vergopolan, N.; Pan, M.; Levizzani, V.; Van Dijk, A.I.J.M.; Weedon, G.P.; Brocca, L.; Pappenberger, F.; Huffman, G.J.; Wood, E.F. Global-scale evaluation of 22 precipitation datasets using gauge observations and hydrological modeling. Hydrol. Earth Syst. Sci. 2017, 21, 6201–6217. [Google Scholar] [CrossRef]
Figure 1. The study basin showing 14 gauged catchments for the regional model development and 35 ungauged catchments that are draining to the respective lakes through the river networks.
Figure 1. The study basin showing 14 gauged catchments for the regional model development and 35 ungauged catchments that are draining to the respective lakes through the river networks.
Hydrology 09 00150 g001
Figure 2. Schematic diagram showing the entire procedure applied in this study.
Figure 2. Schematic diagram showing the entire procedure applied in this study.
Hydrology 09 00150 g002
Figure 3. The ranges of monthly NSE values derived from the confined parameter sets during calibration and the corresponding NSE ranges during the validation period derived from the confined parameter sets; and the monthly NSE derived from parameter sets of best calibration, best validation, and most stable parameter sets for each catchment.
Figure 3. The ranges of monthly NSE values derived from the confined parameter sets during calibration and the corresponding NSE ranges during the validation period derived from the confined parameter sets; and the monthly NSE derived from parameter sets of best calibration, best validation, and most stable parameter sets for each catchment.
Hydrology 09 00150 g003
Figure 4. Cumulative distributions of parameters in the 14 catchments that show the probability of identifiability for each parameter on their ranges. The ranges between the most identifiable and poorly identified parameters are indicated through dark gray to light gray colors. The respective identifiability is also shown by the weight (1/CV) of parameter distribution. Where the dark gray color shows the catchment number with the most identifiable parameter. Parameters such as K0 and VUZL are not identified through their parameter ranges, therefore the median of the parameter was used for the regression model.
Figure 4. Cumulative distributions of parameters in the 14 catchments that show the probability of identifiability for each parameter on their ranges. The ranges between the most identifiable and poorly identified parameters are indicated through dark gray to light gray colors. The respective identifiability is also shown by the weight (1/CV) of parameter distribution. Where the dark gray color shows the catchment number with the most identifiable parameter. Parameters such as K0 and VUZL are not identified through their parameter ranges, therefore the median of the parameter was used for the regression model.
Hydrology 09 00150 g004
Figure 5. Evaluation of the regression procedure on the calibration (1995–2002) and validation period (2003–2007), showing (a,b) the relationship of the NSE of the best-estimated parameters (from calibration, validation, and stable parameters set) and the parameters from the regression. The black and red scatter indicate the best and most poorly performing catchments, respectively. (c) Shows the performances of best-estimated parameters (from calibration, validation, and stable sets) and the performance of the three regression models developed using the best parameters of calibration (REGcal), validation (REGval), and stable sets (REGstable) while evaluated during the calibration and validation period.
Figure 5. Evaluation of the regression procedure on the calibration (1995–2002) and validation period (2003–2007), showing (a,b) the relationship of the NSE of the best-estimated parameters (from calibration, validation, and stable parameters set) and the parameters from the regression. The black and red scatter indicate the best and most poorly performing catchments, respectively. (c) Shows the performances of best-estimated parameters (from calibration, validation, and stable sets) and the performance of the three regression models developed using the best parameters of calibration (REGcal), validation (REGval), and stable sets (REGstable) while evaluated during the calibration and validation period.
Hydrology 09 00150 g005
Figure 6. Relationships between best parameters estimated from stable sets and parameters derived from the regression model. The black scatters (catchment #8) represent the catchment with most parameters that are highly identified during parameter estimation and its corresponding parameters during regression. The red scatters (catchment #9) represent a catchment with most parameters that are poorly identified during parameter estimation and its corresponding parameters during regression. For unidentifiable parameters (K0 and VUZL), their median value was taken for the regionalized model.
Figure 6. Relationships between best parameters estimated from stable sets and parameters derived from the regression model. The black scatters (catchment #8) represent the catchment with most parameters that are highly identified during parameter estimation and its corresponding parameters during regression. The red scatters (catchment #9) represent a catchment with most parameters that are poorly identified during parameter estimation and its corresponding parameters during regression. For unidentifiable parameters (K0 and VUZL), their median value was taken for the regionalized model.
Hydrology 09 00150 g006
Figure 7. Prediction interval derived from the 14 regression models using the best parameter sets from stable parameter sets and uncertainty interval; (a) for best−performing catchment (#08) and mean of the 14 simulations, and (b) uncertainty interval for most poorly performing catchment (#09) and mean of the 14 simulations.
Figure 7. Prediction interval derived from the 14 regression models using the best parameter sets from stable parameter sets and uncertainty interval; (a) for best−performing catchment (#08) and mean of the 14 simulations, and (b) uncertainty interval for most poorly performing catchment (#09) and mean of the 14 simulations.
Hydrology 09 00150 g007
Figure 8. (ac) Elasticity calculated for the median, wettest, and driest years for both gauged and ungauged catchments and (df) their corresponding simulation uncertainty expressed by the coefficient of variation CV.
Figure 8. (ac) Elasticity calculated for the median, wettest, and driest years for both gauged and ungauged catchments and (df) their corresponding simulation uncertainty expressed by the coefficient of variation CV.
Hydrology 09 00150 g008
Table 2. Descriptions and values of properties for gauged catchments in the RVLB that were used for the development of the regional model.
Table 2. Descriptions and values of properties for gauged catchments in the RVLB that were used for the development of the regional model.
Cat No and Names at the Gauge LocationCatchment PropertiesDrainage Area [km2]Drainage Density [km km−2]Mean Slope [%]Mean Elevation [m]Catchment Index [m km−1]Permeability [log10 m2]Porosity [-]Wi [-]P [mm]
Description of propertiesIndex of catchment areaThe ratio of catchment stream length to the drainage areaMean of the percentage slope for each terrain unitIndex describing the mean of catchment elevationMean of all inter-nodal slopes in a catchmentIndex describing the nature of water flow in the shallow aquiferThe fraction of the volume of voids in the shallow aquiferWetness index (Wi) as the ratio of precipitation (P) to potential evapotranspiration (PE)Annual average precipitation (1995–2007)
#01-Bilate@Tena3821.20.07516.222037.110.07−12.1940.070.85923.6
#02-Gelana@Tore bridge506.40.12424.172084.510.39−12.50.091.171309.1
#03-Gidabo@Measso25900.113201805.414.54−12.2480.0970.85942.07
#04-Gedemso@Langano241.50.6718.12759.328.11−12.50.090.88919.19
#05-Woito@Bridge4528.20.0729.091439.510.64−11.4330.0281.341319.6
#06-Hamassa@Wajifo534.40.3415.861655.515.22−12.3060.0760.941208
#07-Hare196.50.3633.082343.177.67−12.20.0760.8971107.5
#08-Katar@Abura3241.10.1158.72601.919.99−12.0340.0640.69779.8
#9-Kulfo@Arbaminch397.20.22636.392249.976.55−12.2830.081.521617.8
#10-Meki@Meki village2033.10.11119.272124.411.12−12.1550.0680.58667.2
#11-Gidabo@Bedesa144.20.34130.182149.756.74−12.50.091.291397.7
#12-Katar@Fete1940.90.11714.492668.917.78−12.1710.0750.87991.2
#13-Katar@Timela205.20.6518.292953.540.87−12.3710.0840.7785.6
#14-Gidabo@Aposto 491.8 0.327 21.49 2012.6 26.76 −12.483 0.089 1.18 1300.1
Table 3. HBV parameter ranges for the RVLB and their descriptions, derived from Beck et al. [30].
Table 3. HBV parameter ranges for the RVLB and their descriptions, derived from Beck et al. [30].
ParameterDescriptionGlobal Range (Min to Max)
β [-]Shape coefficient of recharge function1–6
FC [mm]Maximum water storage in unsaturated-zone store50–700
K0 [d−1]Additional recession coefficient of upper groundwater store0.05–0.99
K1 [d−1]Recession coefficient of upper groundwater store0.01–0.8
K2 [d−1]Recession coefficient of lower groundwater store0.001–0.15
LP [-]Soil moisture value above which actual evaporation reaches potential evaporation0.3–1
MMAXBAS [d]Length of equilateral triangular weighting function1–3
PMAX [mm d−1]Maximum percolation to lower zone0–6
VUZL [mm]Threshold parameter for extra outflow from upper zone0–100
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Abraham, T.; Liu, Y.; Tekleab, S.; Hartmann, A. Prediction at Ungauged Catchments through Parameter Optimization and Uncertainty Estimation to Quantify the Regional Water Balance of the Ethiopian Rift Valley Lake Basin. Hydrology 2022, 9, 150. https://doi.org/10.3390/hydrology9080150

AMA Style

Abraham T, Liu Y, Tekleab S, Hartmann A. Prediction at Ungauged Catchments through Parameter Optimization and Uncertainty Estimation to Quantify the Regional Water Balance of the Ethiopian Rift Valley Lake Basin. Hydrology. 2022; 9(8):150. https://doi.org/10.3390/hydrology9080150

Chicago/Turabian Style

Abraham, Tesfalem, Yan Liu, Sirak Tekleab, and Andreas Hartmann. 2022. "Prediction at Ungauged Catchments through Parameter Optimization and Uncertainty Estimation to Quantify the Regional Water Balance of the Ethiopian Rift Valley Lake Basin" Hydrology 9, no. 8: 150. https://doi.org/10.3390/hydrology9080150

APA Style

Abraham, T., Liu, Y., Tekleab, S., & Hartmann, A. (2022). Prediction at Ungauged Catchments through Parameter Optimization and Uncertainty Estimation to Quantify the Regional Water Balance of the Ethiopian Rift Valley Lake Basin. Hydrology, 9(8), 150. https://doi.org/10.3390/hydrology9080150

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop