Next Article in Journal
Impact of Meteorological Conditions on Overhead Transmission Line Outages in Lithuania
Previous Article in Journal
An Innovative TOPSIS–Mahalanobis Distance Approach to Comprehensive Spatial Prioritization Based on Multi-Dimensional Drought Indicators
Previous Article in Special Issue
River Flashiness in Great Britain: A Spatio-Temporal Analysis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Hydrological Response to Climate Change: McGAN for Multi-Site Scenario Weather Series Generation and LSTM for Streamflow Modeling

1
Tianjin Key Laboratory of Water Resources and Environment, Tianjin Normal University, Tianjin 300387, China
2
Hangzhou Jiayuan Environmental Technology Co., Ltd., Hangzhou 310005, China
*
Author to whom correspondence should be addressed.
Atmosphere 2024, 15(11), 1348; https://doi.org/10.3390/atmos15111348
Submission received: 3 October 2024 / Revised: 3 November 2024 / Accepted: 7 November 2024 / Published: 9 November 2024
(This article belongs to the Special Issue Impacts of Climate Change on Basin Hydrology)

Abstract

:
This study focuses on the impacts of climate change on hydrological processes in watersheds and proposes an integrated approach combining a weather generator with a multi-site conditional generative adversarial network (McGAN) model. The weather generator incorporates ensemble GCM predictions to generate regional average synthetic weather series, while McGAN transforms these regional averages into spatially consistent multi-site data. By addressing the spatial consistency problem in generating multi-site synthetic weather series, this approach tackles a key challenge in site-scale climate change impact assessment. Applied to the Jinghe River Basin in west-central China, the approach generated synthetic daily temperature and precipitation data for four stations under different shared socioeconomic pathways (SSP1-26, SSP2-45, SSP3-70, SSP5-85) up to 2100. These data were then used with a long short-term memory (LSTM) network, trained on historical data, to simulate daily river flow from 2021 to 2100. The results show that (1) the approach effectively addresses the spatial correlation problem in multi-site weather data generation; (2) future climate change is likely to increase river flow, particularly under high-emission scenarios; and (3) while the frequency of extreme events may increase, proactive climate policies can mitigate flood and drought risks. This approach offers a new tool for hydrologic–climatic impact assessment in climate change studies.

1. Introduction

The effects of global climate change on the water cycle are of significance [1,2]. Understanding how these changes impact hydrological processes is a crucial issue in current hydrometeorological research [3,4,5]. As the fundamental component of the hydrological cycle, watershed processes are particularly sensitive to climate change [6,7,8]. Quantitatively assessing the impacts of future climate change on watershed streamflow is crucial [9,10]. This not only helps us understand possible changes in the global water cycle but also provides support information for the development of regional adaptive water management strategies [11,12]. The main approach to this assessment involves constructing relationships between meteorological data and streamflow and then estimating watershed hydrological responses to future climate change scenarios [13,14,15]. The techniques for constructing these relationships vary, including physically based distributed hydrological models [16,17], data-driven statistical models [18,19], and semi-empirical models that combine both approaches [20]. Recently, machine learning techniques, particularly deep learning approaches such as recurrent neural networks (RNNs), convolutional neural networks (CNNs), and long short-term memory networks (LSTMs), have shown potential and advantages in modeling watershed streamflow using time series weather data [21,22,23]. However, regardless of the method used, a key challenge in assessing future climate change responses is obtaining projected future climate data that can meet the requirements of the relational model assessment.
In climate change impact studies, global climate models (GCMs) are the primary tools for modeling future climates under different scenarios [24,25]. The latest shared socio-economic pathways (SSPs) scenarios presented by the Coupled Model Intercomparison Program Phase 6 (CMIP6) provide a common framework for inter-country scientists to use different GCMs and/or their outputs in an equivalent way for climate change studies [26]. However, GCM data have several major limitations. GCM outputs are often systematically biased and require correction. They also have very low spatial resolution (typically ~100-km), which is much lower than the accuracy required for watershed-scale or site-scale studies [27]. Additionally, simulations from different GCMs are often significantly different, and using a single GCM output increases the uncertainty of climate change impact assessments. To overcome these limitations and obtain highly accurate time-series data with short time steps, researchers have developed various bias correction and downscaling methods [28,29]. Kim et al. [30] proposed and implemented a new multivariate bias correction method to improve the accuracy of regional climate models. Islam et al. [31] developed a simple polynomial bias correction method for correcting GCM-derived forecast runoff biases, further demonstrating the importance of bias correction for hydrologic modeling. Elisabeth Vogel et al. [32] emphasized the importance of downscaling and bias correction methods in improving the spatial and temporal resolution of climate model outputs and removing systematic biases. These improvements are essential for accurately assessing the impacts of climate change on hydrological processes. Wamahiu et al. [33] highlighted the need for bias correction of GCMs prior to dynamic downscaling processes, which can significantly impact the simulation of precipitation and temperature for current and future climates. The study by David E. Robertson [34] examines how precipitation data output from GCM downscaling can be processed to improve hydrologic simulation results, focusing on the importance of data processing at different time scales. Preprocessing GCM data is common and necessary based on specific problems and application requirements prior to use, with weather generators being particularly effective in addressing time series related issues.
Weather generators (WGs), as commonly used statistical downscaling tools, are capable of generating long series of synthetic meteorological data, which have significant advantages especially for the analysis of small-area or site-scale future meteorological data problems. Sylvie Parey et al. [35] proposed a bivariate generator of daily temperature and rainfall based on Hidden Markov Models, which can be used not only for analyzing current climatic conditions but also for predicting future climates, providing a new way of thinking about the generation of multivariate and multisite weather series. Li et al. [36] evaluated the application of the multisite weather generator MulGETS in the Yangtze River Basin, and the results showed that the model can effectively generate spatially correlated precipitation series while maintaining the spatial and temporal distribution characteristics of precipitation. This provides a new tool for precipitation simulation and climate change impact assessment in large-scale basins. Evin et al. [37] proposed an important extension to the Wilks multisite daily precipitation model, which integrates the heavy-tailed distribution, spatial tail dependence, and temporal dependence using decomposition methods, effectively reproducing the statistical characteristics of extreme rare events across various temporal and spatial scales and demonstrating a clear advantage in simulating and assessing large-area extreme precipitation events. However, existing downscaling methods still face challenges in dealing with multi-site data, especially in modeling extreme events and maintaining consistency of precipitation occurrence among sites. Additionally, effectively integrating outputs from multiple GCMs while maintaining the spatial consistency of the downscaled data remains a critical issue [38].
In recent years, artificial intelligence and deep learning techniques have shown great potential in reanalysis, downscaling, and the generation of synthetic meteorological data [39,40]. Among them, generative adversarial networks (GANs) are increasingly being applied to this field due to their advantages in generating complex distributed data [41,42]. Sha et al. [43] proposed a conditional deep convolution generative adversarial nets (cDCGAN)-based downscaling method that employs image super-resolution techniques to generate high-resolution daily temperature data from low-resolution GCM output. While this method performs well in maintaining spatial distribution features, it primarily targets univariate and raster data. Hong Kang Ji et al. [44] highlight that GANs are effective at capturing the spatial correlations of extreme events and accurately reflecting the original data’s spatial and temporal distributions, rather than simply replicating their statistical features. This ability makes GANs particularly effective in modeling extreme rainfall events and reflecting real spatial correlation patterns. The GAN-based weather generators utilize semi-supervised learning mechanisms to better capture potential spatial distribution features of historical data, generating new data that are similar to, but not identical to, historical data, avoiding the rigidity of traditional regression methods. Meanwhile, by introducing noise inputs, GANs can effectively simulate the inherent uncertainty of meteorological data, especially excelling in reflecting the stochastic nature of precipitation distributions, thus generating simulation results more aligned with the characteristics of real meteorological distributions. However, previous studies have mainly focused on applying GANs to single-site weather generation or spatial downscaling of gridded data, while the application of GANs to multi-site weather generation, especially maintaining both temporal and spatial consistency among multiple stations, remains largely unexplored.
Accordingly, this study proposes an innovative technical framework for assessing the impacts of climate change on watershed hydrological processes. The framework consists of three key components: (1) a future scenario weather generator (FS-WG), established and validated with historical site observations, that adjusts probability density function parameters based on monthly-scale climate change information from multiple GCM ensemble predictions to generate regional average synthetic daily weather series; (2) a multi-site conditional generative adversarial network (McGAN) model that captures spatial correlation characteristics from historical data to transform regional average weather data into multi-site daily scale data while maintaining spatial consistency; and (3) a long short-term memory (LSTM) network model for establishing nonlinear relationships between multi-site meteorological data and watershed outlet streamflow. Using the Jinghe River Watershed in west-central China as a case study, this framework evaluates the impacts of future climate change on watershed hydrological processes under four shared socioeconomic pathways (SSPs) scenarios. Unlike traditional spatial downscaling methods, this approach focuses on generating site-scale weather sequences with temporal continuity and spatial consistency, particularly suitable for watershed hydrological impact assessments that require continuous daily-scale station data. The overall technical framework is illustrated in Figure 1.

2. Materials and Methods

2.1. Study Area and Data

This study is carried out in the Jing River watershed (Figure 2), located in central-western China, at the junction of Shaanxi, Gansu, and Ningxia provinces. The Jing River, a secondary tributary of the Yellow River, joins the Wei River before eventually flowing into the Yellow River. In this study, the Tao-yuan hydrological station, situated upstream of the confluence of the Jing and Wei Rivers, is used as the watershed outlet. The total catchment area is 45,442 km². The average elevation of the Jing River Basin is 1387 m, with the highest point reaching 2906 m and the lowest at 352 m above sea level. Cropland and grassland are the primary land use types within the watershed, with forested areas dominating in the headwater regions. There are four national meteorological stations within the watershed, with detailed information provided in Table 1. The average annual temperature is 9.2 °C, and the average annual precipitation is 524.5 mm. Details as to the data sources are provided in Table 2.

2.2. Hydrological Long Short-Term Memory (LSTM) Design

In this study, a long short-term memory (LSTM) network model was developed to simulate daily streamflow at the watershed outlet. LSTM networks are particularly effective for processing time series data, making them suitable for capturing the relationships between meteorological conditions and watershed discharge. The model uses daily precipitation and temperature data from four meteorological stations within the watershed to predict the discharge at the outlet. The training and testing datasets were created using daily records from 2006 to 2020, including both meteorological data and discharge measurements. All data were normalized to a range of 0 to 1 using the min–max scaler method to ensure numerical stability during training. A 30-day window period was established, where meteorological data within this window serve as input features and the discharge at the end of the window serves as the prediction target, enabling the model to learn streamflow patterns based on recent meteorological variations.
The model architecture consists of four LSTM layers, each with 64 hidden units, and incorporates dropout to prevent overfitting. A fully connected layer follows the LSTM layers to produce the final predictions. This design captures both short-term and long-term dependencies in the time series data. The model was trained using mean squared error (MSE) as the loss function and the Adam optimizer, with L2 regularization implemented for better generalization. Following the temporal nature of the data, the dataset was chronologically divided, with the first 80% used for training and the remaining 20% for testing. This split ensures that only past data are used to predict future streamflow. Model performance was evaluated using both the mean absolute percentage error (MAPE) and Nash–Sutcliffe efficiency coefficient (NSE). The trained LSTM model provides an effective tool for assessing climate change impacts on watershed hydrology. By inputting synthetic weather data generated under different climate scenarios, the model can simulate potential future streamflow patterns. The methods for generating these synthetic weather series are detailed in Section 2.3 and Section 2.4.

2.3. Future Scenario Weather Generator (FS-WG) Design

The future scenario weather generator (FS-WG) developed in this study treats the target study area as a whole, representing the average conditions of the four meteorological stations within the area. The FS-WG model distinguishes between dry days (no precipitation) and wet days (with precipitation) to generate two key meteorological variables: daily average temperature and daily precipitation. Assuming regional meteorological characteristics remain consistent within each month, separate model parameter sets are established for each of the 12 months. A two-state first-order Markov chain model is employed to generate sequences of dry and wet weather status.
The model parameters are calibrated using historical data from 1957 to 2020. Specifically, if the daily average precipitation from the four meteorological stations on any given day is greater than zero (i.e., if at least one station records precipitation), that day is classified as wet. Two sets of transition probabilities are calculated for each month: dry-to-wet and wet-to-dry. The first day’s status is determined by sampling based on historical precipitation probability, with subsequent days determined using these transition probabilities. The FS-WG model generates daily precipitation and temperature data separately for dry and wet days using probability density functions established through kernel density estimation (KDE) with a Gaussian distribution kernel. On dry days, precipitation is set to zero and temperature is sampled from the dry-day temperature distribution. On wet days, both variables are sampled from their respective wet-day distributions.
For future climate change scenarios, this study utilizes four shared socioeconomic pathways (SSPs: 1-26, 2-45, 3-70, and 5-85) proposed by CMIP6, focusing on four time periods (2030s, 2050s, 2070s, and 2090s). To minimize uncertainties, an ensemble of GCM results was employed using the WorldClim version 2.1 database [45]. This dataset offers global raster data of monthly temperature and precipitation projections for the four SSPs across the four future time periods, as predicted by different GCMs. By extracting the raster data for the target study area, monthly change rates in precipitation and temperature were calculated by comparing future projections with baseline data for each scenario and period. These rates were used to adjust the KDE probability density functions for generating scenario-based synthetic weather series. The FS-WG outputs, representing regional averages, are further processed by the McGAN model to generate spatially consistent multi-site data, which then serve as inputs for the LSTM model to assess climate change impacts on streamflow.
This study initially identified 11 GCMs from WorldClim version 2.1 that provide complete data sets for each of the four SSPs (SSP1-26, 2-45, 3-70, 5-85) and corresponding time periods (2030s, 2050s, 2070s, 2090s). These models include ACCESS-CM2, BCC-CSM2-MR, CMCC-ESM2, EC-Earth3-Veg, GISS-E2-1-G, INM-CM5-0, IPSL-CM6A-LR, MIROC6, MPI-ESM1-2-HR, MRI-ESM2-0, and UKESM1-0-LL. To evaluate their performance in representing local climate conditions, we used the original CMIP6 outputs (100 km × 100 km resolution) from these GCMs. For each GCM, historical monthly precipitation and temperature data at the four meteorological station locations were extracted from their CMIP6 simulations for the period 1957–2015. These values were compared with observed historical data from the four meteorological stations to estimate the performance of each GCM in simulating historical climate conditions for the study area. Historical accuracy was estimated using metrics such as mean squared error (MSE), root mean squared error (RMSE), mean absolute error (MAE), the coefficient of determination (R²), and the Nash–Sutcliffe efficiency coefficient (NSE). K-means clustering analysis was performed based on the suite of estimation metrics, and the optimal number of clusters was determined using the Davies–Bouldin index (DBI). The GCMs in the cluster with the highest historical accuracy were selected as the final models for assessing future climate changes.
For these selected GCMs, we then used their WorldClim data (18.5 × 18.5 km resolution), which provides downscaled and bias-corrected monthly values for both historical period and future periods under different SSP scenarios. Monthly change rates were calculated by comparing the historical data with future projections for each combination of time periods (2030s, 2050s, 2070s, 2090s) and SSP scenarios (SSP1-26, 2-45, 3-70, 5-85). Using consistent resolution data for both historical and future periods avoided potential biases from resolution differences. The ensemble average of these change rates across the selected GCMs was used as the adjustment factor for the KDE in our weather generator, with different adjustment factors applied for each SSP scenario and future period. This ensemble approach helps reduce uncertainties associated with individual GCM projections while maintaining the temporal and spatial consistency of climate change signals across different scenarios. Using these updated parameters, the FS-WG model, which was originally developed and validated with historical data, was applied to generate synthetic daily weather series for each combination of SSP scenarios and future time periods (2030s, 2050s, 2070s, and 2090s). For each scenario-time period combination, a 20-year daily temperature and precipitation series was generated. Since the output of the FS-WG model represents the average values for the four meteorological stations in the study area, a multi-conditional generative adversarial network (McGANs) model was further employed to generate daily temperature and precipitation data for each of the four stations, based on the daily average temperature and precipitation values. The design and training process of the McGANs model are detailed in Section 2.4.

2.4. Multi-Conditional Generative Adversarial Networks (McGANs) Design

In this study, a multi-conditional generative adversarial networks (McGANs) model was developed to generate sets of temperature and precipitation data for four stations within the study area based on the regional average temperature and precipitation. The model architecture is specifically designed for transforming regional average values into 2 × 2 grid data representing meteorological values for four target stations (Figure 3).
The generator employs a deconvolutional–convolutional neural network architecture. As shown in Figure 3a, an initial random 512 × 1 × 1 noise vector is transformed into a 128 × 16 × 16 noise tensor through four layers of deconvolution. Concurrently, a 1 × 2 × 2 conditional tensor is constructed where all four values are identical and equal to the target regional average. This conditional tensor undergoes three layers of deconvolution to form a 128 × 16 × 16 conditional tensor. The noise tensor and conditional tensor are merged along their dimensions to form a 256 × 16 × 16 coupled tensor, which is further processed through three convolutional layers to produce a 1 × 2 × 2 output tensor. Each layer is followed by BatchNorm2d and uses LeakyReLU activation function, while the final output layer employs a Sigmoid activation function to ensure the outputs are within the range of 0–1. A bias correction ensures the mean of the generated 2 × 2 grid equals the input conditional number, while the spatial distribution pattern is determined by the network based on learned historical characteristics.
The discriminator is constructed based on a conditional convolutional neural network architecture. As shown in Figure 3b, both the input 2 × 2 grid tensor and its corresponding condition tensor are processed through convolutional layers, each yielding intermediate tensors of size 32 × 2 × 2. These tensors are then concatenated along the first dimension to form a 64 × 2 × 2 tensor, which is further processed through two additional convolutional layers to produce a final 1 × 1 × 1 tensor output. Each convolutional layer is followed by BatchNorm2d and uses LeakyReLU activation function, with the final layer employing a Sigmoid activation function to ensure the output value is constrained between 0 and 1, representing the probability that the discriminator estimates the input tensor as real.
The training process follows the standard GAN framework with several stability enhancements: (1) initial 100 epochs for discriminator-only training; (2) dynamic learning rate adjustment for the generator; and (3) early stopping mechanism based on Nash equilibrium state detection. Training stops when the discriminator’s accuracy for both real and fake samples stabilizes around 0.5 for 50 continuous epochs after the 500th epoch.
More details about the model architectures and implementation procedures described in Section 2.2, Section 2.3 and Section 2.4, including LSTM configuration, FS-WG parameter calibration processes, and McGAN training strategies, along with all model codes used in this study, are available in the Supplementary Materials.

3. Results

3.1. LSTM Results for Hydrological Modelling

Using an LSTM model, the daily river streamflow at the watershed outlet was simulated based on 30 days of prior daily temperature and precipitation data from four meteorological stations, as illustrated in Figure 4. The results indicate that the mean absolute percentage error (MAPE) between the simulated and observed streamflow during the test period is 81.10%, while the Nash–Sutcliffe efficiency (NSE) is 0.79. The test dataset was entirely independent from the training dataset, having never been used during the LSTM training process. This suggests that the LSTM model demonstrates good extrapolation capabilities and is well-trained.
The main hyperparameters used for training the LSTM included setting the model with four LSTM layers, a hidden layer dimension of 64, a learning rate of 0.0005, and a dropout ratio of 0.5. Using the LSTM model developed in this study, it is possible to analyze daily river streamflow at the watershed outlet based on daily weather data from the four stations. By using synthetic daily weather data series that represent future climate change scenarios as input into the LSTM model, future river streamflow can be modeled and predicted, allowing for an assessment of the potential impacts of future climate change on the watershed’s hydrological processes. Detailed discussions about the results and reliability of the LSTM model can be found in Section 4.1.

3.2. Future Scenario Weather Generation

3.2.1. GCM Selection and Projected Future Climate Change

The simulation accuracy of the 11 alternative GCMs for historical monthly precipitation and average daily temperature in the target study area is shown in Table 3. Based on the accuracy indicators, a K-means clustering analysis was performed, and the Davies–Bouldin index (DBI) was used to determine the optimal number of clusters. The DBI assesses the quality of clustering by comparing the compactness within each cluster and the separation between clusters; a lower DBI value indicates better clustering effectiveness. The DBI results for the clustering analysis of the historical data fitting accuracy of the 11 GCMs are shown in Figure 5. The analysis indicated that clustering into six groups was a locally optimal choice, with the effectiveness improving with more categories. When the number of categories reached 10, the DBI value was at its lowest, suggesting optimal internal compactness and separation between groups at this classification. However, since there are only 11 datasets in total in this study, dividing into 10 groups clearly contradicts the purpose of conducting clustering analysis and does not meet the application demands. Therefore, this study adopted a locally lower DBI value and performed a K-means clustering analysis of the 11 GCMs into six groups, selecting the group with the highest result accuracy to be the chosen GCMs.
Six GCMs that have historically performed well in the study area were selected, including ACCESS-CM2, EC-Earth3-Veg, IPSL-CM6A-LR, MPI-ESM1-2-HR, MRI-ESM2-0, and UKESM1-0-LL. These GCMs have a coefficient of determination (R2) above 0.84 and a Nash–Sutcliffe efficiency coefficient (NSE) above 0.69 for historic data, indicating that their simulation results significantly outperform the average of actual measurements, proving to be both positive and effective. The extracted data include monthly precipitation and average daily temperature from these six GCMs across four future time periods (2030s, 2050s, 2070s, and 2090s) and four climate scenarios (SSP 1-26, SSP 2-45, SSP 3-70, and SSP 5-85). These values are compared with current baseline levels, and the mean of the change rates from the six GCMs is used as the ensemble outputs, representing projected climate changes. Since the future GCM data provided by WorldClim includes only daily maximum and minimum temperatures, their average is used to compute the daily average temperature to meet the meteorological data requirements necessary for LSTM-based streamflow modeling. These projected climate change data are then integrated into the FS-WG model to generate synthetic daily temperature and precipitation data series that reflect the characteristics of various time periods and scenarios projected for the study area in the future. Detailed results of this integration are described in the following Section 3.2.2.

3.2.2. FS-WG for Reginal Weather Generation

The FS-WG model developed in this study is calibrated based on averages of the historical data from four meteorological stations in the study area from 1957 to 2020, with dry day ratios and dry-to-wet and wet-to-dry transition probabilities as shown in Table 4. Here, “Probability Dry” represents the probability that any given day in a month is a dry day (no precipitation), which is used to generate the dry or wet status for the first day of the month. “Probability Wet to Dry” and “Probability Dry to Wet” represent the probabilities of transitioning from a wet day to a dry day and from a dry day to a wet day, respectively, based on which the daily dry or wet status throughout the month is generated. After generating the daily dry/wet statuses, samples are taken from the kernel density functions corresponding to dry and wet states to obtain daily precipitation and average daily temperatures. The kernel density functions for monthly precipitation amounts and average temperatures on dry and wet days are derived from historical data analysis.
Using the calibrated FS-WG model, a 64-year series of synthetic daily weather data was generated and compared with the historical weather records from weather stations in the study area. This comparison was achieved to assess whether the FS-WG model could produce synthetic weather data series with statistical characteristics similar to those of the historical data in the target region. The results, as shown in Figure 6, indicate that the synthetic daily temperature series generated by the FS-WG closely matches the actual monthly mean temperatures, with both the correlation coefficient and the Nash–Sutcliffe efficiency coefficient reaching as high as 0.9999. The monthly variance of the synthetic temperature series is generally higher than that of the actual series, indicating a slightly greater dispersion in the synthetic data, yet the similarity between them is still high, with a correlation coefficient of 0.9988 and a Nash–Sutcliffe efficiency coefficient of 0.9634. The synthetic daily precipitation series generated by the FS-WG showed relatively lower accuracy compared to the temperature series. The correlation coefficient between the synthetic and actual monthly mean precipitation was 0.9989, and the Nash–Sutcliffe efficiency coefficient was 0.9837. The correlation coefficient for the monthly variance between the synthetic and actual precipitation series was 0.9980, with a Nash–Sutcliffe efficiency coefficient of 0.9858. These results demonstrate that the FS-WG model is capable of generating synthetic daily weather data series that have similar statistical characteristics to those observed in the study area.
Using the FS-WG model calibrated and validated with historical observations (1957–2020), synthetic daily weather data series for the years 2021–2100 were generated to serve as baseline data. This baseline scenario maintains the original model parameters without any climate change adjustments, representing a hypothetical future where climate conditions remain similar to the historical period. In contrast, for climate change scenarios, the ensemble mean changes in monthly precipitation and average daily temperature relative to current levels, based on the six selected GCMs across four future time periods and four scenarios, were calculated to characterize future climate changes, as shown in Table 5. For each SSP scenario, changes in monthly temperatures and precipitation rates for the four periods are used as scenario factors to adjust the probability density functions of the FS-WG model. This modification enables the generation of synthetic daily weather data series for the target study area from 2021 to 2100 under four SSPs climate change scenarios. These scenario-based data, along with the baseline scenario data, are then used as inputs for the McGANs model, which generates synthetic weather data series for the four weather stations in the study area under both baseline and future climate change scenarios. More details about the McGANs model application are described in Section 3.2.3.

3.2.3. McGANs for Multi-Site Weather Generation

Separate McGANs models were trained for temperature and precipitation, aiming to reach a Nash equilibrium where the loss functions converge around 0.5. This convergence point marks the conclusion of the training process, as illustrated in Figure 7. The McGANs model for temperature required 1005 epochs to converge, while the precipitation model converged in 830 epochs. More stringent early stopping conditions were applied to temperature training due to the more regular spatial distribution of temperatures across the four sites, which facilitated convergence. In contrast, precipitation’s spatial distribution exhibits greater randomness, leading to less stability in the loss functions during training. Excessive iterations could result in gradient explosion or divergence issues in the precipitation generator. Consequently, more robust generator hyperparameters and relatively simpler discriminator hyperparameters were set for precipitation generator to achieve a balanced architecture and ensure successful GAN training.
The main hyperparameters for both the generator and discriminator are listed in Table 6. Using the trained McGANs model’s generator in inference mode, the output from the FS-WG model is input as conditional data to generate synthetic daily temperature and precipitation series for the four stations over the next 80 years under different SSPs climate change scenarios. These synthetic data series can then serve as input for the LSTM model to simulate daily streamflow at the watershed outlet under various scenarios, thereby assessing the impact of future climate changes on the watershed’s hydrological processes. Detailed descriptions of these simulations are provided in Section 3.3.

3.3. Future Streamflow Scenario Analysis

Using the synthetic daily weather data for the four stations generated by the McGANs model under various future climate change scenarios, the LSTM model was used to simulate daily streamflow under different SSPs scenarios, as illustrated in Figure 8. Overall, compared to the current baseline level, future climate changes are projected to have a positive impact on watershed streamflow yields. Under the SSP 1-26 and SSP 2-45 scenarios, the increase in streamflow is relatively modest, while under the SSP 3-70 and SSP 5-85 scenarios, the increase in streamflow is more significant. This suggests that proactive climate policies can positively influence the mitigation of climate change impacts on watershed hydrological processes and reduce variability in streamflow. Detailed discussions of the impact of future climate changes on streamflow can be found in Section 4.2.

4. Discussion

4.1. Models Accuracy and Reliability

This study applied a long short-term memory (LSTM) model to simulate watershed streamflow, demonstrating the effectiveness of using daily weather data from multiple sites to model river flow at the watershed outlet. According to the model results, the Nash–Sutcliffe efficiency (NSE) value of 0.79 indicates that the model accurately simulates streamflow dynamics and captures the overall patterns effectively. Compared to traditional hydrological models, the LSTM model proposed in this study shows unique advantages, especially in handling long-term dependencies and nonlinear time series data. The mean absolute percentage error (MAPE) of 81.10% during the validation period requires careful interpretation. This seemingly high value is primarily due to the nature of the MAPE calculation itself, where even small absolute errors during low flow periods can result in large percentage errors. For example, an absolute error of 1 m3/s would represent a 50% error when the observed flow is 2 m3/s, but only a 1% error when the flow is 100 m3/s. Additionally, this characteristic may be related to our choice of loss function during model training. When using mean squared error as the loss function, the squared differences between simulated and observed values during high flow periods may disproportionately influence the optimization process, as the same relative errors at higher flow values produce larger absolute differences and thus larger contributions to the loss function. This mathematical property of the loss function might lead to an optimization bias towards high flow periods. However, this is common in hydrological modeling and does not necessarily indicate poor model performance during low flow periods. The absolute errors remain within acceptable ranges across all flow conditions, and importantly, the predictions for low and medium flow periods do not show order-of-magnitude discrepancies, making the model suitable for analyzing future climate change impacts during dry seasons and assessing drought risks. The model shows consistent performance across different flow magnitudes, with particularly good results in capturing the dynamics of both high and low flows. This comprehensive performance is validated by comparing our results with related daily-scale flow simulation studies [46,47,48], demonstrating that our model achieves accuracy levels suitable for climate change scenario analysis. The LSTM model’s ability to capture both the overall patterns and specific flow characteristics makes it a reliable tool for assessing the impact of future climate changes on watershed hydrology.
The results of the FS-WG weather generator are comparable to those of similar weather generator models based on the concept of statistical downscaling [49]. For this study in particular, the averages of generated daily temperature are almost identical to the observed values, demonstrating high accuracy, a characteristic also observed in other weather generator models [50]. This similarity may be attributed to the relative regularity of temperature data, and because this study models different months separately, resulting in high consistency in monthly temperatures, which enhances the outcomes. The standard deviation of the generated daily temperature dataset is greater than that of the observed dataset, indicating that the data variability produced by the weather generator is higher, probably due to the use of the KDE method for sampling temperature values. The averages of daily precipitation data generated by the FS-WG are slightly higher than the observed data, with an overestimation more noticeable during the rainy season from May to July. This may be because the KDE sampling for wet day precipitation is set with a minimum limit of zero to prevent negative values, but without a maximum limit control. Thus, removing low negative values without similarly eliminating high values can shift the sampled data distribution towards higher values relative to the original distribution. However, overall, the consistency between the FS-WG weather generator’s simulation results and the original data is acceptable. In practical applications, the FS-WG is employed to generate synthetic weather data series for both an unchanged baseline scenario and various climate change scenarios. These synthetic series are then used to model future streamflow to estimate impacts of climate change on watershed hydrological processes. This approach effectively mitigates systematic errors introduced by the FS-WG, thereby enhancing the reliability of comparative scenario analysis.
In this study, the McGAN model was used to generate temperature and precipitation data for four stations based on regional average values as conditional numbers. Since the generator model was obtained through semi-supervised training, the results essentially represent one possible outcome that maintains similar spatial distribution characteristics to the historical temperature and precipitation distributions at these four stations. The output of the McGAN model is not a regression to a specific set of values and therefore cannot be evaluated using common metrics such as correlation coefficients and various errors. The primary purpose of using the McGAN model in this study was to generate input data for the LSTM model to simulate streamflow. Consequently, a streamflow prediction comparison experiment was designed to test the suitability of McGAN. The actual measurement group used real daily temperature and precipitation data from the four stations from 1957 to 2020 as input data for the LSTM model, simulating daily streamflow. In contrast, the simulation group used the average of real daily temperature and precipitation data from the four stations from 2006 to 2020 as conditional inputs for the McGAN model, generating synthetic daily temperature and precipitation data for these stations from 2006 to 2020, which was then used as input for the LSTM model to simulate daily streamflow. The consistency of streamflow results obtained using output from the McGAN model and original historical measured data as inputs for the LSTM model is shown in Figure 9. It was found that the Nash–Sutcliffe efficiency coefficient for the simulated and actual measurement groups was 0.92, and the linear correlation coefficient was 0.94. This indicates that the streamflow modeled using the four stations synthetic data generated by the McGAN model for LSTM model inputs are highly similar to those obtained using real weather data of the four stations as LSTM inputs. Thus, it can be concluded that using McGAN-generated data for four stations, based on regional average temperature and precipitation for LSTM input to simulate watershed streamflow, yields reliable results.
In summary, the results of the LSTM model, FS-WG, and McGAN model used in this study all fall within an acceptable range of accuracy. The modeling results obtained from these models are sufficient for estimating the impacts of future climate change on watershed streamflow, and the related response assessments are reliable. The feature analysis and discussions of climate change impacts on watershed streamflow are detailed in the following Section 4.2.

4.2. Impacts of Climate Changes on Streamflow

From 2021 to 2100, the average annual streamflow under various climate scenarios for each 20-year period (2030s, 2050s, 2070s, and 2090s) is shown in Figure 10. A notable trend of generally increasing streamflow is observed from the 2030s to the 2090s, suggesting that climate change may alter the hydrological cycle, potentially leading to increased streamflow yields. Compared to the baseline scenario, most climate scenarios show higher annual streamflow, particularly under SSP 3-70 and SSP 5-85 scenarios. This may indicate that high greenhouse gas emission scenarios could result in more frequent precipitation events in the future.
SSP 3-70 and SSP 5-85 consistently show higher streamflow than other scenarios across all periods, particularly in the 2070s and 2090s, indicating that under these high-emission scenarios, more precipitation events and increased flood risks may occur. Specifically, the 2030s reflect the changes in the near term, with streamflow slightly increasing across all scenarios relative to current levels. This trend may be attributed to the gradual rise in precipitation, coupled with the fact that temperature increases have not yet become pronounced. By the 2050s, streamflow further increases under all scenarios, especially in SSP 3-70 and SSP 5-85, where streamflows are significantly higher than those in the baseline and other lower emission scenarios (SSP 1-26 and SSP 2-45), indicating that the impacts of climate change are beginning to intensify, causing more precipitation and potential extreme hydrological events. In the 2070s, streamflow continues to rise across all scenarios, with notable increases in SSP 3-70 and SSP 5-85 compared to the 2050s, reflecting the long-term effects of climate change on the water cycle, including more frequent and intense precipitation events. By the end of the century, streamflow peaks in all scenarios, particularly under high-emission scenarios, indicating that the impacts of climate change on streamflow have reached their peak, potentially leading to more severe flood risks, especially under scenarios where greenhouse gas emissions are uncontrolled.
Average monthly streamflow under different climate change scenarios for each month is calculated as shown in Figure 11. The analysis indicates that all scenarios reach peak streamflow between May and September, suggesting that these months constitute the primary rainy season for the region. During these months, the SSP 3-70 and SSP 5-85 scenarios exhibit higher peak precipitation, which may be attributed to the more intense rainfall events projected under these more aggressive climate change scenarios. The baseline scenario generally shows the lowest streamflow, especially during the dry seasons (January to April and October to December), suggesting that future climate change scenarios could potentially increase streamflow, primarily concentrated in the rainy season from May to September. The SSP 1-26 and SSP 2-45 scenarios show mild increases in monthly streamflow compared to the baseline, reflecting the controlled greenhouse gas emissions under positive climate policies. In contrast, the SSP 3-70 and SSP 5-85 scenarios show substantially higher streamflow in almost all months, especially during the rainy season, reflecting the significant impact of uncontrolled greenhouse gas emissions on precipitation-streamflow patterns.
By identifying events of extreme high and low streamflow under different climate scenarios, and analyzing the frequency, magnitude, and temporal distribution of these extreme events, the study further discusses the differences in flood and drought risks under various future climate change scenarios. The 95th percentile is defined as the threshold for identifying events with a higher risk of flooding, and the 5th percentile as the threshold for identifying events with a higher risk of drought. The frequency of extreme events over the next 80 years under each scenario is shown in Figure 12. The results show that there are significant differences in the frequency of extreme events across scenarios. SSP 3-70 and SSP 5-85 exhibit a notably higher number of extreme high flow events compared to other scenarios, indicating a potentially higher flood risk under these scenarios. Conversely, the baseline scenario shows the highest number of extreme low flow events, suggesting that drought risks are more pronounced under current climate conditions, implying that future climate changes may mitigate drought risks. However, it is important to notice that the SSP 5-85 scenario also shows a higher number of extreme low flow events compared to other climate change scenarios, although less than the baseline level. This suggests that under uncontrolled future climate extremes, there might be a scenario where both flood and drought risks are elevated. This could be related to the coupling increase in precipitation and temperature under high greenhouse gas emission conditions: high precipitation may lead to an increased risk of floods during intense short-duration rainfall, while high temperatures may enhance evapotranspiration during dry seasons, elevating drought risks. Relatively, under proactive climate policies represented by SSP 1-26, a balance between flood and drought risks can be achieved, resulting in the lowest risk of extreme flow events.

4.3. Features and Limitations of the Study

The FS-WG-McGANs technique proposed in this study represents a novel approach for generating synthetic weather data series at multiple locations. This technique ensures that the generated weather data retain statistical characteristics similar to historical observations and simultaneously produces multi-site artificial weather data series with related attributes. The conditional generative adversarial network architecture guarantees that the data generated not only follow the overall regional averages (conditional numbers) but also reflect the spatial correlations (through the deconvolutional-convolutional neural network architecture) and randomness (through random noise inputs) between multiple sites. Compared to traditional single-location weather generators that iterate multiple times to produce results, the FS-WG-McGANs model provides more consistent multi-site data series on a daily scale, making it better suited for input in hydrological scenario analysis. Moreover, the inherent properties of generative models that blend rationale with randomness allow for a better representation of potential extreme meteorological conditions, thereby more comprehensively reflecting potential hydrological process impacts.
The limitations of this study primarily include the challenges in effectively using the FS-WG-McGANs model, the limited reliability of generated weather data, and uncertainties surrounding the accuracy of streamflow simulations produced by the LSTM model. First, training the McGANs model requires specific skills, especially for precipitation processes. This is partly due to the inherent limitations of the GAN method, which requires a delicate balance between generator and discriminator capabilities to prevent mode collapse, thus demanding considerable modeling expertise. Additionally, the spatial distribution of precipitation data is highly random. Temperature is mainly influenced by topography and underlying surface conditions, which are relatively fixed, making the spatial distribution of temperature data more regular and easier to model. In contrast, precipitation is mainly influenced by cloud cover, with less impact from topography and underlying conditions, making its spatial distribution less regular. This is also why we utilize generative models to address this issue, but it also presents challenges in training, with precipitation generators often achieving lower stability than temperature generators.
Secondly, FS-WG’s approach to generating future climate change scenario data is deficient in accounting for changes in precipitation patterns. The current assumption in FS-WG is that the mechanisms of regional dry–wet transitions remain constant, and future changes in monthly precipitation are mainly reflected in the daily precipitation amount on wet days. In reality, changes in monthly precipitation could result from maintaining the frequency of precipitation with more or less rainfall each time or from changes in the frequency of rainfall events (continuous rain or dry spells). The monthly-scale data from GCMs currently used lacks support for this analysis, and direct use of GCM daily data is challenging due to the low spatial resolution, which makes it difficult to reflect changes in precipitation frequency at the station scale. Future studies need to further refine the FS-WG model to address this issue.
Lastly, while this study focuses specifically on climate change impacts on streamflow, it is important to acknowledge that future hydrological changes will likely result from multiple interacting factors. These include land use changes, water resource management policies, human activities (such as water withdrawal and reservoir regulation), vegetation dynamics, and changes in watershed characteristics. The coupling effects between climate change and these factors present an important direction for future research. Additionally, the LSTM model used in this study is a classical deep neural network, and recent advances in machine learning offer potential improvements. Various novel technologies have been introduced to address meteorological–hydrological time series issues, including improved recurrent neural network models, combinations of RNNs and CNNs, transformer models focusing on spatiotemporal attention, hybrid models coupling data pattern decomposition with deep neural network predictions, and even applications of general large models on time series problems. These technologies have shown advantages in addressing specific challenges, and future research could explore state-of-the-art time series methods to establish more robust relationships between regional site-scale meteorological data and target river streamflow, potentially leading to more precise predictions and improved decision-making support for water resource management.

5. Conclusions

This study has developed an innovative LSTM-FS-WG-McGANs approach for assessing the impact of climate change on hydrological processes in medium to small-scale watersheds. The key conclusions are as follows:
(1)
The proposed FS-WG-McGANs method effectively generates synthetic multi-site weather data series that preserve both temporal and spatial characteristics of historical observations. The FS-WG model demonstrates high accuracy in reproducing historical patterns, with correlation coefficients and NSE values of 0.9999 for temperature and 0.9989 for precipitation. The McGAN model successfully maintains spatial consistency among stations, as validated by streamflow simulation comparison tests (NSE = 0.92; R = 0.94). This integrated approach enables the robust generation of daily temperature and precipitation data representing various climate change scenarios at multiple sites simultaneously based on ensemble outputs from multiple GCMs.
(2)
The LSTM model effectively simulates river discharge at the watershed outlet, achieving a Nash–Sutcliffe efficiency coefficient (NSE) of 0.79 on daily scale streamflow simulation during the validation period. While the model shows varying prediction accuracy across different flow magnitudes (MAPE = 81.10%), the absolute errors remain within acceptable ranges comparable to other daily-scale flow simulation studies. The model maintains reliable performance across both high and low flow conditions, making it suitable for comprehensive climate change impact assessment including both flood and drought risk analysis.
(3)
Under future climate scenarios, streamflow in the study area is projected to increase, particularly under high emission scenarios (SSP 3-70 and SSP 5-85). Quantitative analysis shows varying degrees of change across different SSPs: compared to the baseline scenario, SSP 1-26 and SSP 2-45 show mild increases in monthly streamflow, while SSP 3-70 and SSP 5-85 exhibit substantially higher flows, especially during the rainy season (May to September). The frequency of extreme hydrological events is likely to increase under high emission scenarios, with analysis indicating higher risks of both floods and droughts. However, proactive climate policies, such as those represented by the SSP 1-26 scenario, can effectively reduce future flood and drought risks in the Jing River watershed and decrease the probability of extreme streamflow events, serving as an effective approach to address the impacts of climate change.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/atmos15111348/s1, Models’ source codes.

Author Contributions

Conceptualization, Y.L. and J.S.; methodology, Y.C.; software, Y.C. and J.S.; validation, Y.L.; formal analysis, J.S.; investigation, Y.C.; resources, Y.L.; data curation, J.S.; writing—original draft preparation, J.S. and Y.C.; writing—review and editing, Y.L.; visualization, Y.C.; supervision, Y.L.; project administration, Y.L. and J.S.; funding acquisition, Y.L. and J.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Xin’anjiang Reservoir Drinking Water Source Protection Research Project (grant number: 2024HW022).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original GCM data presented in the study are openly available in the ESGF dataset for CMIP6 at https://aims2.llnl.gov/search/cmip6/ (accessed on 1 October 2024). The original future downscaled GCM data presented in the study are openly available in WorldClim at https://worldclim.org (accessed on 1 October 2024). The original weather data presented in this study are available on request from the China Meteorological Data Service Center at https://data.cma.cn/en (accessed on 1 October 2024). The original hydrological data presented in this study are openly available in the books of Annual Hydrological Report P. R. China in the National Library of China.

Acknowledgments

The authors would like to thank the World Climate Research Programme, which, through its Working Group on Coupled Modeling, coordinated and promoted CMIP6. We extend our appreciation to the climate modeling groups for producing and making available their model output, the Earth System Grid Federation (ESGF) for archiving the data and providing access, and the multiple funding agencies that support CMIP6 and ESGF. We also thank the WorldClim data website (http://www.worldclim.org, accessed on 1 October 2024) for providing high-spatial-resolution global weather and climate data. Furthermore, we acknowledge the invaluable contribution of the China Meteorological Data Service Center (http://data.cma.cn, accessed on 1 October 2024) in supplying long-term historical weather data for the study area.

Conflicts of Interest

Ms. Yaxiu Liu is an employee of Hangzhou Jiayuan Environmental Technology Co., Ltd. The paper reflects the views of the scientists and not the company. The authors declare no conflicts of interest.

References

  1. Hogikyan, A.; Resplandy, L.; Liu, M.; Vecchi, G. Hydrological cycle amplification reshapes warming-driven oxygen loss in the Atlantic Ocean. Nat. Clim. Chang. 2024, 14, 82–90. [Google Scholar] [CrossRef]
  2. Blöschl, G.; Chaffe, P.L.B. Water scarcity is exacerbated in the south. Science 2023, 382, 512–513. [Google Scholar] [CrossRef] [PubMed]
  3. Wang, H.; Liu, J.; Klaar, M.; Chen, A.; Gudmundsson, L.; Holden, J. Anthropogenic climate change has influenced global river flow seasonality. Science 2024, 383, 1009–1014. [Google Scholar] [CrossRef] [PubMed]
  4. Richardson, C.M.; Davis, K.L.; Ruiz-González, C.; Guimond, J.A.; Michael, H.A.; Paldor, A.; Moosdorf, N.; Paytan, A. The impacts of climate change on coastal groundwater. Nat. Rev. Earth Environ. 2024, 5, 100–119. [Google Scholar] [CrossRef]
  5. Davamani, V.; John, J.E.; Poornachandhra, C.; Gopalakrishnan, B.; Arulmani, S.; Parameswari, E.; Santhosh, A.; Srinivasulu, A.; Lal, A.; Naidu, R. A Critical Review of Climate Change Impacts on Groundwater Resources: A Focus on the Current Status, Future Possibilities, and Role of Simulation Models. Atmosphere 2024, 15, 122. [Google Scholar] [CrossRef]
  6. Pokhrel, Y.; Felfelani, F.; Satoh, Y.; Boulange, J.; Burek, P.; Gädeke, A.; Gerten, D.; Gosling, S.N.; Grillakis, M.; Gudmundsson, L.; et al. Global terrestrial water storage and drought severity under climate change. Nat. Clim. Chang. 2021, 11, 226–233. [Google Scholar] [CrossRef]
  7. Nourani, V.; Gholinia, A.; Abbaszadeh, P.; Ahmadi, R.; Ke, C.-Q. Unravelling the impact of climate change and anthropogenic activities on streamflow: The benefit of newly developed evapotranspiration data. Hydrol. Sci. J. 2024, 69, 2026–2043. [Google Scholar] [CrossRef]
  8. Nguyen, B.Q.; Van Binh, D.; Tran, T.-N.-D.; Kantoush, S.A.; Sumi, T. Response of streamflow and sediment variability to cascade dam development and climate change in the Sai Gon Dong Nai River basin. Clim. Dyn. 2024, 62, 7997–8017. [Google Scholar] [CrossRef]
  9. Gudmundsson, L.; Boulange, J.; Do, H.X.; Gosling, S.N.; Grillakis, M.G.; Koutroulis, A.G.; Leonard, M.; Liu, J.; Müller Schmied, H.; Papadimitriou, L.; et al. Globally observed trends in mean and extreme river flow attributed to climate change. Science 2021, 371, 1159–1162. [Google Scholar] [CrossRef]
  10. Buechel, M.; Slater, L.; Dadson, S. Broadleaf afforestation impacts on terrestrial hydrology insignificant compared to climate change in Great Britain. Hydrol. Earth Syst. Sci. 2024, 28, 2081–2105. [Google Scholar] [CrossRef]
  11. Yang, J.; Winrich, A.; Zhang, T.; Qiao, L.; Mattingly, C.; Zou, C. Responses of streamflow to forest expansion in a typical subhumid watershed under future climate conditions. J. Environ. Manag. 2024, 357, 120780. [Google Scholar] [CrossRef] [PubMed]
  12. Mashaly, A.F.; Fernald, A.G.; Geli, H.M.E.; Salim Bawazir, A.; Steiner, R.L. Dynamic simulation modeling for sustainable water management with climate change in a semi-arid environment. J. Hydrol. 2024, 644, 132126. [Google Scholar] [CrossRef]
  13. Zheng, H.; Chiew, F.H.S.; Post, D.A.; Robertson, D.E.; Charles, S.P.; Grose, M.R.; Potter, N.J. Projections of future streamflow for Australia informed by CMIP6 and previous generations of global climate models. J. Hydrol. 2024, 636, 131286. [Google Scholar] [CrossRef]
  14. Li, X.; Sha, J.; Zhao, Y.; Wang, Z.-L. Estimating the Responses of Hydrological and Sedimental Processes to Future Climate Change in Watersheds with Different Landscapes in the Yellow River Basin, China. Int. J. Environ. Res. Public Health 2019, 16, 4054. [Google Scholar] [CrossRef] [PubMed]
  15. Marras, P.A.; Lima, D.C.A.; Soares, P.M.M.; Cardoso, R.M.; Medas, D.; Dore, E.; De Giudici, G. Future precipitation in a Mediterranean island and streamflow changes for a small basin using EURO-CORDEX regional climate simulations and the SWAT model. J. Hydrol. 2021, 603, 127025. [Google Scholar] [CrossRef]
  16. Bosompemaa, P.; Brookfield, A.; Zipper, S.; Hill, M.C. Using national hydrologic models to obtain regional climate change impacts on streamflow basins with unrepresented processes. Environ. Model. Softw. 2024, 138, 106234. [Google Scholar] [CrossRef]
  17. Li, J.; Li, Y.; Zhang, T.; Feng, P. Research on the future climate change and runoff response in the mountainous area of Yongding watershed. J. Hydrol. 2023, 625, 130108. [Google Scholar] [CrossRef]
  18. Li, X.; Sha, J.; Wang, Z.-L. Comparison of daily streamflow forecasts using extreme learning machines and the random forest method. Hydrol. Sci. J. 2019, 64, 1857–1866. [Google Scholar] [CrossRef]
  19. Li, X.; Sha, J.; Li, Y.-m.; Wang, Z.-L. Comparison of hybrid models for daily streamflow prediction in a forested basin. J. Hydroinformatics 2017, 20, 191–205. [Google Scholar] [CrossRef]
  20. Sha, J.; Li, X.; Yang, J. Estimation of Watershed Hydrochemical Responses to Future Climate Changes Based on CMIP6 Scenarios in the Tianhe River (China). Sustainability 2021, 13, 10102. [Google Scholar] [CrossRef]
  21. Kapoor, A.; Pathiraja, S.; Marshall, L.; Chandra, R. DeepGR4J: A deep learning hybridization approach for conceptual rainfall-runoff modelling. Environ. Model. Softw. 2023, 169, 105831. [Google Scholar] [CrossRef]
  22. Sadeghi Tabas, S.; Humaira, N.; Samadi, S.; Hubig, N.C. FlowDyn: A daily streamflow prediction pipeline for dynamical deep neural network applications. Environ. Model. Softw. 2023, 170, 105854. [Google Scholar] [CrossRef]
  23. Yao, Z.; Wang, Z.; Wang, D.; Wu, J.; Chen, L. An ensemble CNN-LSTM and GRU adaptive weighting model based improved sparrow search algorithm for predicting runoff using historical meteorological and runoff data as input. J. Hydrol. 2023, 625, 129977. [Google Scholar] [CrossRef]
  24. Yu, X.; Zhang, L.; Zhou, T.; Zheng, J. Assessing the Performance of CMIP6 Models in Simulating Droughts across Global Drylands. Adv. Atmos. Sci. 2024, 41, 193–208. [Google Scholar] [CrossRef]
  25. Zhang, B.; Song, S.; Wang, H.; Guo, T.; Ding, Y. Evaluation of the performance of CMIP6 models in simulating extreme precipitation and its projected changes in global climate regions. Nat. Hazards 2024. [Google Scholar] [CrossRef]
  26. Eyring, V.; Bony, S.; Meehl, G.A.; Senior, C.A.; Stevens, B.; Stouffer, R.J.; Taylor, K.E. Overview of the Coupled Model Intercomparison Project Phase 6 (CMIP6) experimental design and organization. Geosci. Model Dev. 2016, 9, 1937–1958. [Google Scholar] [CrossRef]
  27. Liu, Y.; Yan, X.; Du, W.; Zhang, T.; Bai, X.; Nan, R. Downscaling Daily Reference Evapotranspiration Using a Super-Resolution Convolutional Transposed Network. Water 2024, 16, 335. [Google Scholar] [CrossRef]
  28. Gibson, P.B.; Stuart, S.; Sood, A.; Stone, D.; Rampal, N.; Lewis, H.; Broadbent, A.; Thatcher, M.; Morgenstern, O. Dynamical downscaling CMIP6 models over New Zealand: Added value of climatology and extremes. Clim. Dyn. 2024, 62, 8255–8281. [Google Scholar] [CrossRef]
  29. Chae, S.T.; Chung, E.-S. Significant contribution of bias correction methods to uncertainty in future runoff projections under CMIP6 climate change. J. Hydrol. Reg. Stud. 2024, 56, 101973. [Google Scholar] [CrossRef]
  30. Kim, Y.; Evans, J.P.; Sharma, A. Multivariate bias correction of regional climate model boundary conditions. Clim. Dyn. 2023, 61, 3253–3269. [Google Scholar] [CrossRef]
  31. Islam, S.A.; Anwar, A.H.M.F.; Bari, M. A simple method of bias correction for GCM derived streamflow at catchment scale. Hydrol. Sci. J. 2023, 68, 1409–1425. [Google Scholar] [CrossRef]
  32. Vogel, E.; Johnson, F.; Marshall, L.; Bende-Michl, U.; Wilson, L.; Peter, J.R.; Wasko, C.; Srikanthan, S.; Sharples, W.; Dowdy, A.; et al. An evaluation framework for downscaling and bias correction in climate change impact studies. J. Hydrol. 2023, 622, 129693. [Google Scholar] [CrossRef]
  33. Wamahiu, K.; Kala, J.; Evans, J.P. The influence of bias correction of global climate models prior to dynamical downscaling on projections of changes in climate: A case study over the CORDEX-Australasia domain. Clim. Dyn. 2024, 62, 1219–1231. [Google Scholar] [CrossRef]
  34. Robertson, D.E.; Chiew, F.H.S.; Potter, N. Adapting rainfall bias-corrections to improve hydrological simulations generated from climate model forcings. J. Hydrol. 2023, 619, 129322. [Google Scholar] [CrossRef]
  35. Parey, S.; Gailhard, J. Extreme Low Flow Estimation under Climate Change. Atmosphere 2022, 13, 164. [Google Scholar] [CrossRef]
  36. Li, Y.; Zeng, X.-M.; Guo, J. Evaluation of a multisite weather generator on precipitation simulation in the Yangtze river basin. Hydrol. Res. 2021, 53, 206–220. [Google Scholar] [CrossRef]
  37. Evin, G.; Favre, A.C.; Hingray, B. Stochastic generation of multi-site daily precipitation focusing on extreme events. Hydrol. Earth Syst. Sci. 2018, 22, 655–672. [Google Scholar] [CrossRef]
  38. Maraun, D.; Widmann, M. Statistical Downscaling and Bias Correction for Climate Research; Cambridge University Press: Cambridge, UK, 2018. [Google Scholar]
  39. Kochkov, D.; Yuval, J.; Langmore, I.; Norgaard, P.; Smith, J.; Mooers, G.; Klöwer, M.; Lottes, J.; Rasp, S.; Düben, P.; et al. Neural general circulation models for weather and climate. Nature 2024, 632, 1060–1066. [Google Scholar] [CrossRef]
  40. Eyring, V.; Gentine, P.; Camps-Valls, G.; Lawrence, D.M.; Reichstein, M. AI-empowered next-generation multiscale climate modelling for mitigation and adaptation. Nat. Geosci. 2024, 17, 963–971. [Google Scholar] [CrossRef]
  41. Leinonen, J.; Nerini, D.; Berne, A. Stochastic Super-Resolution for Downscaling Time-Evolving Atmospheric Fields With a Generative Adversarial Network. IEEE Trans. Geosci. Remote Sens. 2021, 59, 7211–7223. [Google Scholar] [CrossRef]
  42. Stengel, K.; Glaws, A.; Hettinger, D.; King, R.N. Adversarial super-resolution of climatological wind and solar data. Proc. Natl. Acad. Sci. USA 2020, 117, 16805–16815. [Google Scholar] [CrossRef] [PubMed]
  43. Sha, J.; Chen, X.; Chang, Y.; Zhang, M.; Li, X. A spatial weather generator based on conditional deep convolution generative adversarial nets (cDCGAN). Clim. Dyn. 2024, 62, 1275–1290. [Google Scholar] [CrossRef]
  44. Ji, H.K.; Mirzaei, M.; Lai, S.H.; Dehghani, A.; Dehghani, A. Implementing generative adversarial network (GAN) as a data-driven multi-site stochastic weather generator for flood frequency estimation. Environ. Model. Softw. 2024, 172, 105896. [Google Scholar] [CrossRef]
  45. Fick, S.E.; Hijmans, R.J. WorldClim 2: New 1-km spatial resolution climate surfaces for global land areas. Int. J. Climatol. 2017, 37, 4302–4315. [Google Scholar] [CrossRef]
  46. Wang, Y.; Xu, H.-M.; Li, Y.-H.; Liu, L.-L.; Hu, Z.-H.; Xiao, C.; Yang, T.-T. Climate Change Impacts on Runoff in the Fujiang River Basin Based on CMIP6 and SWAT Model. Water 2022, 14, 3614. [Google Scholar] [CrossRef]
  47. Fanta, S.S.; Sime, C.H. Performance assessment of SWAT and HEC-HMS model for runoff simulation of Toba watershed, Ethiopia. Sustain. Water Resour. Manag. 2021, 8, 8. [Google Scholar] [CrossRef]
  48. Guo, J.; Liu, Y.; Zou, Q.; Ye, L.; Zhu, S.; Zhang, H. Study on optimization and combination strategy of multiple daily runoff prediction models coupled with physical mechanism and LSTM. J. Hydrol. 2023, 624, 129969. [Google Scholar] [CrossRef]
  49. Mohammed, Z.M.; Hassan, W.H. Climate change and the projection of future temperature and precipitation in southern Iraq using a LARS-WG model. Model. Earth Syst. Environ. 2022, 8, 4205–4218. [Google Scholar] [CrossRef]
  50. Munawar, S.; Rahman, G.; Moazzam, M.F.U.; Miandad, M.; Ullah, K.; Al-Ansari, N.; Linh, N.T.T. Future Climate Projections Using SDSM and LARS-WG Downscaling Methods for CMIP5 GCMs over the Transboundary Jhelum River Basin of the Himalayas Region. Atmosphere 2022, 13, 898. [Google Scholar] [CrossRef]
Figure 1. Technical framework of the coupled FS-WG-McGAN-LSTM approach for climate change impact assessment on watershed hydrology. Red borders indicate input/source datasets, purple borders represent modeling tools, blue borders denote outputs/derived results, and black dashed borders explain processing steps/contents.
Figure 1. Technical framework of the coupled FS-WG-McGAN-LSTM approach for climate change impact assessment on watershed hydrology. Red borders indicate input/source datasets, purple borders represent modeling tools, blue borders denote outputs/derived results, and black dashed borders explain processing steps/contents.
Atmosphere 15 01348 g001
Figure 2. Geographical location of the study area: (a) location of the Yellow River basin in China; (b) the Jinghe River basin within the Yellow River system; (c) details of the Jinghe River basin showing meteorological and hydrological stations.
Figure 2. Geographical location of the study area: (a) location of the Yellow River basin in China; (b) the Jinghe River basin within the Yellow River system; (c) details of the Jinghe River basin showing meteorological and hydrological stations.
Atmosphere 15 01348 g002
Figure 3. Architectural diagram of the multi-conditional generative adversarial network (McGAN): (a) generator network structure showing the transformation from noise vector and conditional tensor to multi-site weather data; (b) discriminator network structure for authenticity evaluation.
Figure 3. Architectural diagram of the multi-conditional generative adversarial network (McGAN): (a) generator network structure showing the transformation from noise vector and conditional tensor to multi-site weather data; (b) discriminator network structure for authenticity evaluation.
Atmosphere 15 01348 g003
Figure 4. Accuracy of daily water discharge simulations by LSTM model on test set.
Figure 4. Accuracy of daily water discharge simulations by LSTM model on test set.
Atmosphere 15 01348 g004
Figure 5. DBI Scores for clustering numbers of the 11 GCMs.
Figure 5. DBI Scores for clustering numbers of the 11 GCMs.
Atmosphere 15 01348 g005
Figure 6. Comparison of synthetic weather data series with historical data generated by the FS-WG model.
Figure 6. Comparison of synthetic weather data series with historical data generated by the FS-WG model.
Atmosphere 15 01348 g006
Figure 7. Convergence process of the generator and discriminator during McGANs model training.
Figure 7. Convergence process of the generator and discriminator during McGANs model training.
Atmosphere 15 01348 g007
Figure 8. Simulated daily streamflow at the watershed outlet under future climate changes.
Figure 8. Simulated daily streamflow at the watershed outlet under future climate changes.
Atmosphere 15 01348 g008
Figure 9. Comparison of streamflow simulated using observed meteorological data and synthetic data generated by McGANs.
Figure 9. Comparison of streamflow simulated using observed meteorological data and synthetic data generated by McGANs.
Atmosphere 15 01348 g009
Figure 10. Annual streamflow for different future periods under various climate scenarios.
Figure 10. Annual streamflow for different future periods under various climate scenarios.
Atmosphere 15 01348 g010
Figure 11. Average monthly precipitation under different climate change scenarios.
Figure 11. Average monthly precipitation under different climate change scenarios.
Atmosphere 15 01348 g011
Figure 12. Frequency of extreme events under different climate change scenarios.
Figure 12. Frequency of extreme events under different climate change scenarios.
Atmosphere 15 01348 g012
Table 1. Characteristics of meteorological stations in the Jinghe River watershed.
Table 1. Characteristics of meteorological stations in the Jinghe River watershed.
NameIDLongitudeLatitudeElevationPeriods
Huan Xian5382136.583107.3001255.61957-01-01 to 2020-12-31
Ping Liang5391535.550106.6671346.61957-01-01 to 2020-12-31
Xi Feng Zhen5392335.733107.6331421.01957-01-01 to 2020-12-31
Chang Wu5392935.200107.8001206.51957-01-01 to 2020-12-31
Table 2. Summary of the original data source.
Table 2. Summary of the original data source.
NameSource and DescriptionResolutionRemark
Historical Weather RecordsClimatic Data Center, National Meteorological Information Center, China Meteorological Administration (http://data.cma.cn, accessed on 1 October 2024)Daily1957–2020
Historical Hydrological DataAnnual Hydrological Report P. R. China, Volume 4(8), from National Library of ChinaMonthly2006–2020
Historical GCM DataCoupled Model Intercomparison Project Phase 6 Dataset (https://pcmdi.llnl.gov/CMIP6/, accessed on 1 October 2024)1 degree~100 km × 100 km
Future Climate DataGlobal Historical and Future Climate Grid Dataset Downscaled Based on GCMs (www.worldclim.org, accessed on 1 October 2024)10 min18.5 km × 18.5 km
Table 3. Simulation accuracy and clustering analysis results for the 11 selected GCMs for the study area.
Table 3. Simulation accuracy and clustering analysis results for the 11 selected GCMs for the study area.
GCM\ItemsMonthly PrecipitationMonthly Average Daily TemperatureClusters
MSERMSEMAER2NSEMSERMSEMAER2NSE
ACCESS-CM2166.2410.487.830.920.82163.3410.037.570.920.821
BCC-CSM2-MR706.2620.3317.130.940.41668.1818.7815.710.940.454
CMCC-ESM23974.8046.0137.560.93−2.063439.8440.0332.540.93−1.502
INM-CM5-01818.7531.0423.430.80−0.411781.3928.9621.870.82−0.343
IPSL-CM6A-LR-INCA212.5612.099.310.920.72150.1410.188.070.930.771
MIROC61317.2927.1819.550.80−0.071364.5226.5819.030.82−0.090
MRI-ESM2-0319.7014.229.760.840.69301.2913.238.900.850.711
UKESM1-0-LL101.278.696.080.940.84101.738.515.930.940.831
EC-Earth3-Veg124.439.277.480.940.85119.558.796.980.930.851
GISS-E2-1-G994.0723.3020.440.940.211020.3922.5619.620.940.205
MPI-ESM1-2-HR315.0514.0511.270.860.70287.9112.8510.290.870.731
Table 4. Dry–wet transition probability parameters of the FS-WG model.
Table 4. Dry–wet transition probability parameters of the FS-WG model.
MonthProbability DryProbability Wet to DryProbability Dry to Wet
Jan.0.820.360.14
Feb.0.740.350.22
Mar.0.700.350.27
Apr.0.620.330.35
May.0.560.310.44
Jun.0.510.270.50
Jul.0.380.190.54
Aug.0.380.170.49
Sep.0.400.190.46
Oct.0.530.280.39
Nov.0.720.350.21
Dec.0.860.420.13
Table 5. Future climate change scenarios: temperature changes (°C, absolute difference from current) and precipitation changes (ratio to current).
Table 5. Future climate change scenarios: temperature changes (°C, absolute difference from current) and precipitation changes (ratio to current).
PeriodScenarioItemJan.Feb.Mar.Apr.May.Jun.Jul.Aug.Sep.Oct.Nov.Dec.
2030sSSP1-26temperatures1.961.711.461.631.561.982.242.632.361.941.932.11
precipitation1.001.061.041.041.101.011.010.951.111.031.041.00
SSP2-45temperatures1.871.421.541.621.581.952.132.462.381.761.741.88
precipitation1.001.041.051.051.061.021.040.981.051.041.021.02
SSP3-70temperatures1.531.481.251.241.441.832.042.372.191.711.581.75
precipitation1.001.011.061.051.051.011.040.961.041.081.011.01
SSP5-85temperatures1.821.851.471.591.772.082.322.672.421.901.951.98
precipitation1.001.031.071.021.061.041.040.971.081.061.031.02
2050sSSP1-26temperatures2.272.332.121.911.962.552.913.302.992.202.202.43
precipitation1.051.081.081.091.090.991.041.021.151.141.081.02
SSP2-45temperatures2.782.002.332.272.392.933.133.493.462.762.662.57
precipitation1.031.101.061.091.091.001.090.961.111.111.041.01
SSP3-70temperatures2.692.312.142.132.402.993.173.593.432.752.592.63
precipitation1.001.061.121.071.061.011.060.981.081.091.021.00
SSP5-85temperatures3.412.992.902.692.893.593.814.354.073.493.093.39
precipitation1.001.101.121.131.061.031.050.961.181.101.071.00
2070sSSP1-26temperatures2.722.462.242.362.112.723.033.543.102.492.362.59
precipitation1.061.121.101.131.141.041.050.961.081.141.151.05
SSP2-45temperatures3.312.813.032.973.023.713.944.564.293.543.263.32
precipitation1.041.121.091.101.121.011.050.961.141.141.111.05
SSP3-70temperatures3.933.673.433.363.454.184.535.054.903.973.713.96
precipitation1.011.081.111.141.111.051.050.961.111.221.061.01
SSP5-85temperatures4.684.223.983.954.295.125.476.306.024.774.704.70
precipitation1.151.201.201.181.111.051.040.961.131.291.101.08
2090sSSP1-26temperatures2.612.542.311.922.112.703.013.203.022.312.362.54
precipitation1.031.111.111.121.090.991.041.061.141.121.101.06
SSP2-45temperatures3.823.493.243.203.294.114.234.974.813.893.613.80
precipitation1.051.161.171.121.181.031.050.961.141.251.151.06
SSP3-70temperatures5.044.984.614.464.605.465.726.476.535.435.215.17
precipitation1.101.151.201.151.161.071.090.951.191.271.141.04
SSP5-85temperatures6.596.615.805.705.917.057.168.228.276.826.536.40
precipitation1.191.201.291.251.181.071.110.941.201.441.271.19
Table 6. Summary of hyperparameters for the McGANs model.
Table 6. Summary of hyperparameters for the McGANs model.
Hyper-ParametersTemperaturePrecipitation
GeneratorDiscriminatorGeneratorDiscriminator
Noise dimension512NA512NA
Batch size5844584423312331
Learning rate0.0001~0.00050.00010.0001~0.00050.0001
Initial features64326432
Early stop interval0.491~0.5090.491~0.5090.483~0.5170.483~0.517
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Sha, J.; Chang, Y.; Liu, Y. Hydrological Response to Climate Change: McGAN for Multi-Site Scenario Weather Series Generation and LSTM for Streamflow Modeling. Atmosphere 2024, 15, 1348. https://doi.org/10.3390/atmos15111348

AMA Style

Sha J, Chang Y, Liu Y. Hydrological Response to Climate Change: McGAN for Multi-Site Scenario Weather Series Generation and LSTM for Streamflow Modeling. Atmosphere. 2024; 15(11):1348. https://doi.org/10.3390/atmos15111348

Chicago/Turabian Style

Sha, Jian, Yaxin Chang, and Yaxiu Liu. 2024. "Hydrological Response to Climate Change: McGAN for Multi-Site Scenario Weather Series Generation and LSTM for Streamflow Modeling" Atmosphere 15, no. 11: 1348. https://doi.org/10.3390/atmos15111348

APA Style

Sha, J., Chang, Y., & Liu, Y. (2024). Hydrological Response to Climate Change: McGAN for Multi-Site Scenario Weather Series Generation and LSTM for Streamflow Modeling. Atmosphere, 15(11), 1348. https://doi.org/10.3390/atmos15111348

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop