1. Introduction
The renewable contribution of the electric energy matrix in Uruguay has been increasing steadily during the last decades, with hydroelectric, wind and solar components that have different inherent variability and predictability. This poses both a challenge and an opportunity to optimize planning at different embedded timescales and, ultimately, dispatch. The interconnected Electric System Simulator (SimSEE [
1]) is used for these purposes [
2], from the management of the spot market to long-term analysis of the evolution of the generation capacity, with intermediate seasonal and nested weekly planning. Particularly, in the case of the hydroelectric generation, we used a coupled hydrological and electric system modeling approach in order to generate and process a hydrological ensemble forecast for the largest reservoir of the system [
3]. The ability to forecast the hydrological inflow contributes to the optimal use of each energy source, with the corresponding economic and environmental benefits.
In most applications that use operational hydrological models, the spatial and temporal variability of precipitation constitutes one of the dominant factors and with greater associated uncertainty. In this context, remote sensing products are ideal instruments to be used in real-time hydrological modeling, embedded in operational systems for flood warnings, drought monitoring and water resource management [
4,
5]. These products are especially useful in data-sparse, ungauged or large-scale catchments. Several satellite-based precipitation estimates, with high spatial and temporal resolution, are currently available in near-real-time (NRT): the Climate Prediction Center (CPC) MORPHing algorithm (CMORPH) [
6], the CPC Quick MORPHing technique (QMORPH) [
7], the Tropical Rainfall Measuring Mission (TRMM) Multi-satellite Precipitation Analysis (TMPA) 3B42 [
8], the Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks and a Cloud Classification System (PERSIANN-CCS) [
9], the Global Satellite Mapping of Precipitation (GSMaP) [
10,
11] and the Integrated Multi-satellite Retrievals for Global Precipitation Measurement (IMERG-GPM) [
12]. Information from a large number of rain gauges, which are more accurate but generally sparsely distributed, is already assimilated as part of these global/regional satellite algorithms. However, since they are indirect estimates derived by algorithms from satellite images, they require validation in each region and climate and eventually calibration against local ground-based precipitation measurements. Additionally, it is well known that the NRT versions of these products are less accurate (as compared with the final or research-quality products) but provide quick precipitation estimates suitable for NRT monitoring and operational modeling activities [
13].
Several validation studies evaluated the utility of the satellite precipitation products as input for hydrologic models [
4,
14,
15,
16,
17,
18,
19]. Particularly, Jiang et al. (2019) [
5] provided a comprehensive review about the role of satellite-based remote sensing products in improving simulated streamflow. They highlighted that, in general, the capability and feasibility of satellite rainfall estimates in driving hydrological models vary widely due to differences in topography, season, climate, basin scale, selected hydrological model and satellite product type. Moreover, prior to their implementation in the hydrological model, the estimates require thorough validation; bias correction based on rain gauge data is commonly needed. Nerini et al. (2015) [
20] and Beck et al. (2017) [
16] emphasized that careful data merging can exploit the complementary strengths of each source of precipitation data. Considering this, several merging methods have been developed based on the early efforts of the radar research community: mean-field bias correction, inverse-error-weighted averaging methods, interpolations by inverse distance weighting, double-kernel smoothing, nearest neighbor method, correction through regression analysis, correction using probability distributions and geostatistical methods (cokriging, kriging with external drift, regression kriging, Bayesian combination, among others) [
20,
21]. In general, identifying the spatial correlation in the error (residual) structure model is the most important step in the merging process.
Previous experience with this type of products (CMORPH) in the study area, the Rio Negro basin in northeastern Uruguay, confirms the need to implement a bias removal scheme based on available surface observations prior to any application [
22,
23]. Their results indicated that quantile matching method produces an unbiased estimate whose skill, as measured by the probability of detection (POD), is better than that obtained from surface observations for average distances among stations larger than approximately 50 km. Adjustment of satellite estimates using spatial interpolation of CMORPH residuals evaluated at nearby points eliminates biases to a large degree. Moreover, it shows higher skill than using only surface data for the entire range of distances and daily precipitation thresholds considered and for both seasons (cold and warm).
In this study, we present the operational implementation of a methodology for the combination of rain gauge observations and satellite-rainfall estimates at daily time step to improve the rainfall monitoring in NRT over the Rio Negro Basin. The interpolated precipitation field obtained with the proposed methodology is then used as initial condition for the hydrological modeling of the basin, coupled with an electric system modeling, in order to obtain the optimal dispatch of the system for the following seven days [
3]. The performance of the proposed merged precipitation estimate was statistically evaluated through comparison with an independent historic rain gauge dataset. Furthermore, a hydrological application was implemented using the GR4J model [
24] at daily time step and compared to the estimated “theoretical” inflow to the hydroelectric reservoir. We expect that this work will contribute to the understanding of the reliability of the latest NRT satellite-based precipitation products and provide a reference for their applications in operational hydrological simulation and water resource management.
2. Study Area
The upper Rio Negro basin, in northeastern Uruguay, has a surface area of about 40,000 km
2 (
Table 1), taking Gabriel Terra hydroelectric plant (G. Terra) as its closure point.
Downstream G. Terra in the Rio Negro, we find Baygorria and Constitución hydroelectric plants. The binational (Argentina-Uruguay) hydroelectric plant Salto Grande, in the Uruguay River, completes the total hydroelectric capacity that collectively represents a third of the current installed power in the country’s electric system and contribute with more than the 50% of the mean total generated electricity [
25] with large interannual variability. In the Uruguayan system, the main storable resource is the water in the hydropower reservoirs, particularly in G. Terra, since it has the highest storage capacity (
Table 2). Considering the growing pressure for water demand from both agricultural and forestry expansion, together with the continuous increase of electric energy demand, this highlights the need for adequate tools for the management of water resources in the Rio Negro basin [
26].
Figure 1 shows the location of existing hydroelectric plants (black triangle) and the delimitation of G. Terra basin. We also included the location of the rain gauges available at NRT (red square), used for the operational implementation of the merging approach and the historic rain gauge data (blue dot) used for validation purposes. Both datasets are presented in
Section 3.
4. Methodology
4.1. Merging Approach
The satellite-rain gauge data merging technique considered is based on the universal model of spatial variation [
31,
32]. As one of the hybrid geostatistical models,
Regression Kriging (RK) is a spatial interpolation technique that combines a deterministic model (regression) with a statistical model (Ordinary Kriging of the regression residuals). It uses a deterministic model to estimate a value of the variable (precipitation) by using actual ground measurements to calibrate a model for the satellite estimates and then refines the estimate analyzing the residuals for spatial correlation; finally, it combines the statistical fitting and deterministic modeling [
33].
The 3-step proposed model is summarized as follows (
Figure 4).
Regression of the station data on the satellite data using a Generalized Linear Model (GLM). A GLM model is implemented in order to fit the satellite estimates to the rainfall observations at station locations. For the GLM, we use a spatially correlated residual structure that is fit to the available data. For each day, we calculate both exponential and spherical spatial correlations and choose the one with the highest Akaike information criterion (AIC). Several regressors alternatives were tested, including both satellite products and each one separately. Based on the performance statistics obtained (not shown), we decided to use, for each day the individual satellite product, either IMERG or GSMaP, with the highest Pearson correlation between the rain gauge observations and the collocated satellite values.
Interpolation of the regression residuals at station locations to the entire grid using Ordinary Kriging. Once the regressed satellite estimation is obtained, we calculate the error (residual) between it and the observations at the station locations. Then, the interpolation of the regression residuals to the entire grid is done through Ordinary Kriging [
34], which exploits the spatial correlation in the residuals and this is added to the regressed satellite estimation in order to obtain the “unmasked” merged product.
Application of a rain/no rain mask (RNR mask). We apply an RNR mask to the merged product to prevent overestimation of the occurrence of rainfall in the interpolated field. The mask is obtained using the same merged precipitation estimate (RK) technique but switching the target observations to binary rain/no rain observations. Satellite estimates are used as regressors to forecast this binary field with the same RK technique described in 1 and 2. We use a threshold of 0.3 in the output of RK, a continuous field, to delimitate rainy region for the mask. Finally, the unmasked product is multiplied by the RNR mask to obtain the final masked merged product.
Figure 5 shows an example of the application of the RNR mask for a given day (6 July 2017). The middle column corresponds to the unmasked OK and RK products while the rightmost column shows the masked versions. As can be seen, there is a large purple region of slightly positive values (zeros are transparent) in the unmasked product, while in the masked products this region is forced to zero.
The merging algorithm in this study was written in R and is available on GitHub [
35,
36].
4.2. Rainfall-Runoff Model
The G. Terra basin (39,500 km
2) was discretized into 17 sub-basins with areas smaller than 7000 km
2.
Figure 6 presents the delimited catchments and their characteristics including: basin area (km
2), slope (%), concentration-time (Tc) (h) and SC (mm).
To simulate the hydrological inflows to G. Terra reservoir we use a daily hydrological model (GR4J) coupled with a hydrological transit model (Muskingum). The GR4J model is a daily lumped four-parameter rainfall-runoff model developed by Perrin et al. (2003) [
24]. The Muskingum model [
37] is a two-parameter hydrologic flood routing method, based on the storage continuity equation.
In a previous study, Narbondo et al. (2020) [
38] present a successful application of the GR4J daily rainfall-runoff model at 13 watersheds of Uruguay. They proposed an improved regionalization approach to predict runoff time series in ungauged catchments at country scale. Particularly, they found the optimal set of parameters of the GR4J model and, in addition, they found the relationships between them and watershed-physiographic factors.
Table 5 shows the description of the “GR4J-Muskingum” model parameters and the values adopted in each case following these recommendations.
4.3. Goodness-of-Fit Indicators
The performance of the merged precipitation estimate (RK) was statistically evaluated through comparison with an independent rain gauge network, relatively dense and uniformly distributed (referred to as “historic data” in
Figure 1). We also included in the evaluation the estimation based on the Ordinary Kriging interpolation from NRT rain gauge observations (OK) at the same grid as the satellite data (0.1° × 0.1°), which serves as the baseline for comparison with the merged product. Both estimates (RK and OK) were compared with the rain gauge observations belonging to the historic reference dataset. The performance statistics used for the comparison are the mean error (ME), the RMSE, the frequency bias (FBS), the POD and the false alarm ratio (FAR) for a precipitation threshold of 5 mm [
39,
40].
Furthermore, several verification indices were used to quantitatively assess the hydrological utility of the precipitation estimates based on the estimated “theoretical” inflow to G. Terra reservoir (
Figure 3), including the difference of total accumulated inflow (∆V), the Nash–Sutcliffe efficiency (NSE), the Kling-Gupta efficiency (KGE), the coefficient of determination (R
2) and RMSE [
41]. Additionally, we also conducted a first-level catchment water balance using the runoff ratio (RR), defined as the ratio of the precipitation that contributes to runoff [
20]. The RR values calculated using the different outputs from both estimates (RK and OK) were compared to known values from the literature [
42].
In all cases, the period analyzed is from 1 February 2017 to 31 May 2020.
4.4. Operational Implementation
The 5-step operational implementation of the coupled hydrological and electric system modeling approach is presented next (
Figure 7).
Data download and daily accumulation. The required precipitation input data are adequately collected: records of NRT stations, GSMaP-NRT, IMERG-NRT Late Run and GEFS ensemble forecast. Daily rainfall totals are accumulated at 1000 UTC.
Data quality control. Prior to the merging algorithm, a data quality control from both NRT rain gauges and satellite estimates is performed based on the Climate Data Tools (CDT-IRI) [
43]. Data quality control focuses on outlier detection for the purpose of elimination of data contamination, including the implementation of spatial-plausibility checks based on Scherrer et al. (2011) [
27]. The threshold values used in the controls were adjusted manually, looking to eliminate the most obvious suspicious values in the available historical data set.
Merging technique. The satellite-rain gauge data merging technique is implemented in order to obtain the RK precipitation estimate over the Rio Negro basin.
Hydrological modeling. Based on the RK estimate and the GEFS precipitation ensemble forecast, the GR4J rainfall-runoff model is implemented at the 17 sub-catchments of the G. Terra basin. The runoff output is then routed along the river network using the Muskingum model to simulate the daily inflow ensemble to G. Terra reservoir.
Electricity-system simulation. The simulated inflow ensemble is integrated to the existing model of the interconnected electric system (SimSEE), particularly into the synthesizer model (CEGH), through biases and noise attenuators per time step adjusted through maximum likelihood [
44].
The implemented model was integrated into SimSEE’s on June 2020 and has since run under the responsibility of the Electricity Market Administration (ADME) of Uruguay. The application (called VATES) is continuously updating and executing a SimSEE Room with the representation of the Uruguayan generation system, in order to obtain the dispatch of the following seven days with hourly detail. The results and information relevant to the operation are published automatically on ADME’s website [
45]. They also provide the required statistical information for the design of exchange offers with neighboring countries and the energy spot market.
6. Summary and Conclusions
In this study, we developed and implemented a methodology that combines rain gauge observations and satellite-rainfall estimates at daily time step to improve the rainfall monitoring in NRT. The proposed methodology involves 3 steps: (1) regression of station data on the satellite estimate using a Generalized Linear Model, (2) interpolation of the regression residuals at station locations to the entire grid using Ordinary Kriging and (3) an application of a rain/no rain mask. The merged precipitation field thus obtained is then used in a hydrological modeling of the Rio Negro basin whose output is, in turn, coupled with an electric system modeling that guides planning and dispatch decisions for the following seven days.
The performance of the proposed merged precipitation estimate was statistically evaluated through comparison with an independent historic rain gauge dataset. The incorporation of satellite information enhances the representation of spatial variability, particularly in data-sparse regions with reductions in RMSE of up to 20%, although the overall improvement is statistically marginal.
As far as the operation of the energy system is concerned, it is the input to the reservoirs that most directly affect the electric system simulations and, in turn, management optimization. The GR4J hydrological model, with a daily time step, was implemented at 17 sub-catchments of the G. Terra basin with routing up to the reservoir. Model performance was assessed comparing model output to the estimated “theoretical” inflow to G. Terra computed from a mass budget to the reservoir and rendered satisfactory statistics: 0.60 < R2 ≤ 0.75 and 0.50 < NSE ≤ 0.70. The estimation that incorporates the satellite information in addition to the surface observations has a higher performance, for all statistics considered, compared to the one that only incorporates the rain gauge data.
In an operational setting, simplicity and robustness of the implementation are as important as accuracy. All steps are currently implemented and run on a daily basis at the Electricity Market Administration (ADME): data download and quality control, merging algorithm, hydrological modeling and electric system simulation. The presented implementation improves the estimation of the precipitation field and carries that information all the way to the decision-making stage, with its corresponding socio-economic and environmental benefits.