Next Article in Journal
Variability of Drinking Water Quality on the Basis of Analysis of Qualitative Monitoring from a Selected Water Supply Network Located in South-Eastern Poland
Previous Article in Journal
Pollution Characteristics and Eutrophication Assessment in Plain River Network Areas: A Case Study of the Beijing–Hangzhou Grand Canal (Changzhou Section)
Previous Article in Special Issue
Synthetic Time Series Data in Groundwater Analytics: Challenges, Insights, and Applications
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Quantitative Assessment of Future Environmental Changes in Hydrological Risk Components: Integration of Remote Sensing, Machine Learning, and Hydraulic Modeling

1
College of Automation, Qingdao University, Qingdao 266071, China
2
College of Environmental Science and Engineering, Qingdao University, Qingdao 266071, China
3
Carbon Neutrality and Eco-Environmental Technology Innovation Center of Qingdao, Qingdao 266071, China
4
Institute for Future (IFF), Qingdao University, Qingdao 266071, China
*
Authors to whom correspondence should be addressed.
Water 2024, 16(23), 3354; https://doi.org/10.3390/w16233354
Submission received: 13 October 2024 / Revised: 18 November 2024 / Accepted: 20 November 2024 / Published: 22 November 2024

Abstract

:
Floods are one of the most devastating natural hazards that have intensified due to land use land cover (LULC) changes in recent years. Flood risk assessment is a crucial task for disaster management in flood-prone areas. In this study, we proposed a flood risk assessment framework that combines flood vulnerability, hazard, and damages under long-term LULC changes in the Tajan watershed, northern Iran. The research analyzed historical land use change trends and predicted changes up to 2040 by employing a Geographic Information System (GIS), remote sensing, and land change modeling. The flood vulnerability map was generated using the Random Forest model, incorporating historical data from 332 flooded locations and 12 geophysical and anthropogenic flood factors under LULC change scenarios. The potential flood damage costs in residential and agricultural areas, considering long-term LULC changes, were calculated using the HEC-RAS hydraulic model and a global damage function. The results revealed that unplanned urban growth, agricultural expansion, and deforestation near the river downstream amplify flood risk in 2040. High and very high flood vulnerability areas would increase by 43% in 2040 due to human activities and LULC changes. Estimated annual flood damage for agriculture and built-up areas was projected to surge from USD 162 million to USD 376 million and USD 91 million to USD 220 million, respectively, considering 2021 and 2040 land use change scenarios in the flood-prone region. This research highlights the importance of land use planning in mitigating flood-associated risks, both in the studied area and other flood-prone regions.

1. Introduction

Floods are the most extensive global natural phenomena that cause noticeable damage to economic activity, people’s lives, the natural environment, property, and substructures [1]. Land use land cover (LULC) change, encompassing urban sprawl, deforestation, and agricultural land expansion, has substantial impacts on the hydrologic cycle, hydraulic flow, and climate systems [2,3]. These impacts contribute to increased flood frequency and intensity and consequently raise flood risk levels [4]. Accurately assessing the connection between LULC dynamic changes and flood risk provides an effective decision-making tool for risk mitigation [5].
Generally, flood risk encompasses three main components: hazards (flood extent and depth), vulnerabilities (probability of exposure), and flood consequences (loss of human life, property damage, and infrastructure damage) [6,7]. Prior research has addressed flood vulnerability, hazard, and damage as distinct components using a variety of methods [8,9]. These methods include physical modeling, probabilistic analysis, geospatial analysis, and multiple-criteria decision making (MCDM) tools, coupled with hydrological and numerical hydraulic models [10]. For instance, HA-Mim et al., in 2022, generated hazard exposure and vulnerability maps using the MCDM method and geospatial techniques in the Barguna district of Bangladesh [11]. Adnan et al., in 2020, assessed flood damages through the analysis of diverse flood recurrence intervals and LULC change scenarios; an increase in flood damages corresponding to shifts in both recurrence intervals and LULC scenarios was revealed [10]. Guido et al., in 2023, analyzed the potential impacts of flood events and identified vulnerable areas by considering the influence of LULC changes through hydraulic modeling (HEC-RAS) and geospatial analysis in a small watershed in the southeast United States [12]. Previous studies have made valuable contributions to assessing hazards, vulnerabilities, and flood consequences under LULC changes [13]. In fact, flood consequences arise from the interplay of hazards and vulnerabilities. Flood hazard, influenced by variables such as flood vulnerability, occurrence probability, velocity, depth, and duration, determine the extent of consequences [14]. Flood vulnerability directly influences an area or population’s risk level during a flood hazard event, intensifying potential impacts and emphasizing consequences like damage and loss [13]. However, no research has developed a comprehensive framework that effectively and simultaneously integrates the identification of flood vulnerability, hazards, and damages under long-term LULC changes [5].
Furthermore, flood risk assessment may have limitations in providing reliable and accountable outcomes for flood control when dealing with large and complex datasets [15]. In this regard, the machine learning (ML) models and data mining technique models have gained attention for their ability to excel in creating vulnerability maps by efficiently handling complex relationships, processing large datasets, and providing improved accuracy, faster computational times, and reduced model development costs [10]. Notably, significant ML models, such as Random Forest (RF) [16], Linear Regression [17], Support Vector Machine (SVM) [18], Neural Networks (NNs) [19], Frequency Ratio (FR) [20], and the Bayesian Generalized Linear Model [21], have been utilized to forecast various natural hazards. Hence, it is essential to employ efficient ML models and data mining techniques to assist urban planners and decision-makers in identifying high-risk, flood-prone areas under complex data.
Therefore, this study aims to develop a comprehensive framework that integrates multiple techniques and models, including ML, hydraulic modeling, remote sensing (RS), GIS, and Land Change Modeler (LCM), to assess the impacts of long-term LULC changes on flood vulnerability, hazard, and consequences in the Tajan watershed of northern Iran. This framework uniquely combines flood vulnerability, hazard, and damage assessments under long-term LULC change scenarios into a single modeling approach. By utilizing advanced technologies that reduce uncertainties, it provides a comprehensive and dynamic assessment of flood risk, surpassing previous studies that have examined these aspects separately. Using RS-GIS and LCM, we predict future LULC changes in 2040 based on historical trends. The widely used ML model, Random Forest, and the HEC-RAS hydraulic–hydrology model are utilized to map flood vulnerability and hazards, considering the long-term dynamic changes in LULC. Global depth–damage curves and functions under changing LULC conditions are used to calculate the negative consequences of flooding, including direct financial damage to flooded agricultural land and built-up areas. This study will (i) identify high-risk areas, flood extent, and depth under different LULC change scenarios and varying time intervals, and (ii) estimate the potential flood damage costs in residential and agricultural areas, considering long-term LULC changes within flood-prone regions and over different time intervals. The findings will aid urban planners and policymakers in effectively managing and mitigating the consequences of flooding, particularly in rapidly developing countries dealing with urban growth and flood damage.

2. Cases Study

The Tajan River watershed is sited between 53°05′ and 53°18′ Eastern longitude and 36°09′ and 36°29′ Northern latitude in Mazandaran Province, Iran. As shown in Figure 1, it passes through 77 villages, covering an area of roughly 3810 km2 with a length of 140 km. The climate of the area is extremely humid and generally mild, with an annual temperature of nearly 14 °C and an average yearly rainfall precipitation of 830 mm; maximum and minimum precipitation occur in autumn/winter and summer, respectively. About 65–70% of annual rainfall takes place during the September–December period [22]. The altitude in the region varies from −2 m at the outlet to 3720 m above mean sea level. This area’s dominant land cover types include rangeland, forest, agricultural, waterbodies, and built-up areas. The economy of the study area relies mainly on rice cultivation and its associated supply chains [23]. The Tajan River regularly experiences severe floods with high discharge, resulting in damage to built-up areas and agricultural lands every year [22]. For example, the 2019 flood in Mazandaran Province resulted in damages amounting to USD 4.7 billion across agricultural, industry, residential, commercial, and road properties, making it as the most costly natural disaster in Iran [24]. Despite the severity of such floods, research on flood risk management in this watershed remains limited [19,25].

3. Methodology and Data

This research was conducted in the following four main stages: (i) LULC map development and future prediction; (ii) flood vulnerability assessment based on LULC change scenarios; (iii) flood hazard assessment for different return periods under LULC change scenarios; (iv) flood consequences assessment in terms of economic damages. Figure 2 provides a visual representation of the methodology framework.

3.1. LULC Maps Development and Future Prediction

3.1.1. Processing Remote Sensing Images to Generate a LULC Map

Spatial–temporal changes in different land cover categories are revealed using scenes from the Landsat 5, Landsat 7, and Landsat 8 satellites [2]. Various satellite images, including those from sensors such as ETM+, TM, and OLI, with a resolution of 30 m, were obtained from the United States Geological Survey (USGS) Earth Explorer. These images were utilized to create LULC maps for three distinct years: 2001, 2011, and 2021. Detailed information about the features of the different Landsat sensors can be found in Table 1.
In this study, we employed the maximum likelihood algorithm to analyze the LULC categories in the Tajan watershed. To create the LULC maps, we utilized 6 to 8 bands from Landsat images for each year. Radiometric and topographic normalization, as well as geometric correction, were applied using ArcGIS tools to ensure accurate LULC classification and to minimize the impacts of factors such as atmospheric conditions, solar angle, and sensor view angle [26]. Following the pre-processing steps, the maximum likelihood classification method was employed, utilizing nearest neighbor analyses with 150 training samples, to categorize the LULC maps into five classes of land use and land cover, as presented in Equation (1). Accuracy assessment of the classification results is crucial in aerial imagery analysis. We used two uncertainty matrices, the overall accuracy test and the kappa coefficient matrix, to estimate the accuracy of the generated maps [27].
P ( c l a s s i | d a t a ) = P d a t a | c l a s s i · P c l a s s i P d a t a
where P ( c l a s s i | d a t a ) is the probability of class i given the data, P d a t a | c l a s s i is the likelihood of the data given class i, P c l a s s i is the prior probability of class i, and P d a t a is the probability of the data.

3.1.2. Prediction of LULC Changes and Model Validation

The Land Change Modeler (LCM) within the TerrSet geospatial monitoring software 2020 was utilized to measure the biodiversity effects in order to investigate and predict land cover changes. This model incorporates several techniques, including Markov chain matrices, artificial neural networks (ANNs), logistic regression (LR), and multilayer perceptron (MLP) [28]. To generate future LULC changes using LCM, we followed four steps: 1. Calculation of the amount and spatial trend of changes. 2. Estimation of transition potential and identification of essential factors driving LULC changes. 3. Modeling of LULC changes. 4. Model validation and accuracy assessment. The validation and agreement were assessed using the overall accuracy (OA) and kappa index between predicted and actual maps. Detailed explanations for these steps are provided in Appendix A.
The overall accuracy (OA), defined in Equation (2), was calculated by dividing the number of correctly identified pixels by the total number of categorized pixels in the error matrix [29].
A = 1 N P i i
where N represents the total number of accuracy pixels, and P i i denotes the summation of the major diagonal of elements in the error matrix. The kappa index measures the degree of agreement between the predicted and actual maps when considering all elements of the error matrices. It is calculated using Equation (3):
K a p p a = P i P c 1 P c × 100
where P i represents the observed agreement in the error matrices (overall accuracy), while Pc represents the expected agreement. The kappa index could range from −1 to +1, with a value of 1 indicating perfect agreement. A value of 0 indicates random classification, while a negative value implies agreement worse than random [30].

3.2. Flood Vulnerability Assessment Method

To create flood vulnerability maps, we utilized the Random Forest model and a GIS application, incorporating flood inventory data and flood influential factors. The Mean Decrease Accuracy (MDA) method, known for estimating the weights of flood-influencing factors, was employed to select the relevant input variables for modeling [31]. The process involved the spatial modeling of geo-environmental factors to determine flood vulnerability and locations, using testing and training datasets. Subsequently, the vulnerability maps were generated using the RF model and the training datasets. To assess the impact of land use and land cover (LULC) changes on flood probability, the RF model was trained separately for two LULC scenarios: 2021 and 2040. To validate the RF models, different statistical measurements were applied to the testing and training datasets. A comprehensive explanation of the methods and data used in generating flood vulnerability maps based on the two LULC change scenarios is provided in the subsequent section.

3.2.1. Inventory and Mapping of Flood

A flood inventory map is a crucial tool that offers valuable insights into areas that have been affected by flooding in different historical periods. It serves as a valuable resource for researchers for identifying flood patterns and predicting future occurrences [23]. The flood distribution map of the Tajan watershed, shown in Figure 1, was obtained from the Management of Land and Water Resources Company of Iran. The flood inventory includes 332 recorded flood events that occurred between January 1990 and July 2021. These events were compiled using various sources, such as satellite imagery from Google Earth and field surveys. Considering past flood patterns, current flood risk serves as a valuable predictor for future occurrences [32]. To facilitate modeling and analysis, 332 non-flooded locations were randomly selected in ArcGIS 10.5, creating a dichotomous dependent variable for the machine learning models. The dataset was divided into training (70%) and testing (30%) sets using random selection in Python with the Scikit-learn library, following the common practice in machine learning to allocate sufficient data for training and evaluate model performance [33]. Notably, the selection of non-flooded points deliberately focused on areas with lower flood risk, including hills, forested lands with gentle slopes, and mountains. This intentional choice ensured unbiased representation and enhanced the performance of machine learning models [34]. By evaluating the models on the testing set, their performance could be assessed, and the sensitivity of the results to the data split could be determined. This approach offered insights into the models’ generalization capabilities and their ability to predict flood susceptibility beyond the training data [35].

3.2.2. Preparing Flood Influencing Factors

Collecting and constructing influential data on floods is an essential part of simulations. Considering the results published in previous research, we selected twelve influencing factors to predict future floods based on the study area’s topography, geology conditions, hydrology, and climatic and geo-environment conditions [36,37,38]. Figure 3 depicts the selected factors utilized in our analysis, incorporating remote sensing techniques. Spatial data representing these factors were converted into a 30 m resolution raster format using ArcGIS and nearest neighbor interpolation. The natural break method was employed to classify the data into distinct categories, facilitating their seamless integration into the flood susceptibility modeling framework [33]. The topography factors included slope (SL), aspect (AS), altitude (AL), the Topographic Position Index (TPI), and the Terrain Ruggedness Index (TRI). The geology factors consisted of the information value of soil texture (ST) and lithology (LI). Drainage density (DD), distance from river (DR), rainfall (RA), and the Topographic Wetness Index (TWI) made up the hydrology factors, and the geo-environment factor was the land use land cover (LULC). Furthermore, we employed the multicollinearity (MC) test to evaluate the linear relationships among various influencing factors in the context of flood vulnerability assessment. Comprehensive explanations of the flood-influencing factors and the MC test can be found in Appendix B.

3.2.3. Random Forest (RF) and Validation

RF is a supervised machine learning algorithm that combines multiple decision trees to achieve a single result, making it suitable for regression and classification tasks [39]. Before training the model, three main parameters need to be adjusted: node size, the number of trees, and the number of sampled features (variables). The number of features can range from one to the entire set of variables, while the number of trees is typically set between 500 and 1000 [40]. Each decision tree in the Random Forest is trained on a bootstrap sample from the training set, with one-third (30%) of the data reserved as an “out of bag” (OOB) sample for testing. Feature bagging is then applied to introduce additional randomness, increasing dataset diversity and reducing correlation between decision trees [41]. To assess the importance of variables in predicting flood risk, Random Forest measures their impact. In this study, the importance ranking of flood influencing factors was evaluated using the MDA method. MDA calculates the accuracy reduction associated with the presence or absence of specific variables, and the importance of independent factors is evaluated through permutation [42]. The importance factors were evaluated using the MDA method, as presented in Equation (4) [43]:
Variable Importance : V I x i = 1 n t r e e t = 1 n t r e e i ϵ O O I y i = f x i i ϵ O O B I y i = f x i j O O B
The OOB (out-of-bag) method is used in Random Forest to estimate the prediction error of each training sample. It is a subsampling technique that employs bootstrap aggregation (bagging) by creating training samples through subsampling with replacement. OOB accuracy is measured before and after the permuting variable xj, and the difference is calculated to obtain a more accurate prediction value [31]. For each tree t, where t ∈ {1, 2, 3…, ntree}, the variable importance of xj is determined by averaging the difference between the predicted class before permuting xj (yi = f(xi)) and after permuting variable xj (yi = f(xij)) for a specific observation i [42]. Indeed, this study assessed the performance of the Random Forest model by using various statistical measures to evaluate its ability to generate precise flood vulnerability maps. These measures encompass the Sensitivity Test (SST), Specificity (SPF), Positive and Negative Predictive Values (PPV and NPV), Root-Mean-Squared Deviation (RMSD), Coefficient of Determination (R2), and Receiver Operation Characteristics (ROC-AUCS). Detailed descriptions of the validation methods and the statistical measures, including their equations, can be found in Appendix C.

3.3. Flood Hazard Assessment Methods

The evaluation of flood hazards involves conducting hydrological simulations of floods across multiple return periods. For this study, the Hydrological Engineering Center-River Analysis System (HEC-RAS v6) tool was utilized to calculate flood depth and velocity in coastal and fluvial regions [14,44]. The selection of the study area for the flood hazard assessment was based on several factors, including historically flooded locations, the increase in urbanization and cultivation along the river, and the observed rise in stream flow in recent years [45]. These factors collectively led to the identification of a specific part of the study area, covering approximately 205 km2, as a highly flood-susceptible zone for conducting the hazard assessment (shown in Figure 4). It is important to note that the chosen area represents a subset of the overall study region. Due to limitations in data availability, it was not feasible to include the entire region in the analysis. However, the selected area was deemed most relevant for assessing flood hazards based on known flood occurrences and the influence of urbanization and agricultural activities along the river [23]. To initiate the assessment, maximum discharge data recorded by the Sari Water Resources Department of Iran over the past 30 years (1989–2020) were collected from hydrological stations located upstream (Soleiman-Tange) and downstream (Balakoola) of the river (refer to Table 2 and Figure 5). Flood frequency analysis (FFA) was then conducted using the Easyfit application, which utilizes probability distribution functions such as Log Pearson type 3 (LP3), to establish the relationship between the magnitude of extreme runoff or river flow events (Q) and their frequency of occurrence (also known as recurrence interval or return period) (T). The analysis considered various return periods, including Q2, Q5, Q10, Q25, Q50, Q100, Q500, and Q1000 years. Assessing a 1000-year flood event was particularly crucial for effective risk management, given the occurrence of excessive discharge in 2019 and the expansion of rural areas and agricultural lands along the river [14]. Subsequently, the HEC-RAS (1D/2D) combined model, a widely used software tool for flood depth and velocity calculations in coastal and fluvial regions, was employed to estimate the depth and intensity of flood inundation for each pixel (12.5*12.5) in the study region. The simulation process involved extracting river geometry data from the 12.5 m digital elevation model and creating bank lines, centerlines, flow path lines, and cross-section lines. Floodplains were then modeled for the main branch of the river, consisting of 78 cross-sections along a 40.63 km stretch of the Trajan River, with an average distance of approximately 0.59 km per section.
It should be noted that, in this study, manual calibration was required for the HEC-RAS model since it lacks an automatic calibration option. This calibration focused on Manning’s roughness coefficient (N value), a crucial parameter within the HEC-RAS model [46]. We adjusted Manning’s N values for both the main channel and side banks in the upstream and downstream regions. The calibrated values for the main channel were set at (0.04, 0.035, 0.05), while for the side banks, they were (0.045, 0.05, 0.06). Boundary conditions and flow types (sub-critical and super-critical) were determined based on the normal depth and slope profiles upstream and downstream. We conducted a steady-state analysis for all flood flow periods, involving a statistical assessment of 20 out of 30 recorded events from 1989 to 2020 to evaluate differences between simulated and observed water depths. This analysis employed metrics like the correlation coefficient (R), the Nash–Sutcliffe efficiency (NSE), and Root-Mean-Squared-Deviation (RMSD) [13,31].

3.4. Flood Consequences

To estimate the direct financial damages to agricultural land with crops and the built-up area based on land use and land cover (LULC) change scenarios, Equation (5) was employed in different cells and for the total cell area as follows:
D j = i = 1 n x i × f x i × A
where D represents the total damage sustained throughout flood recurrence interval j (in million USD),   x i represents the depth of flood in pixel i (m), f x i represents the function of damage in pixel i, and A represents the pixel area [47]. Accordingly, the adopted depth–damage curve at a continent level presented by [48] has been applied to calculate the perceptible flood damages to agricultural land and the built-up area, as shown in Figure 6. The flood depth–damage curves demonstrate the correlation between flood depth and total or relative economic loss, which directly affect financial losses [5]. While several factors influence the financial impact of a flood, such as inundation time, velocity and wave impacts, contamination, water temperature, and debris, many studies have predominantly considered only inundation depth and extent as influencing factors [14].
In this study, the maximum damage ratios from the depth–damage equation were converted from Euros to US dollars at a conversion rate of 1 Euro = 1.13 US dollars. Table 3 shows the cost of maximum damage considered to the built-up and agricultural land with the crops based on LULC scenarios, which is implemented from earlier research [48]. In this research, the costs of the built-up area and agricultural land with dominant rice crops were taken into account as 60 USD/sq. m. and 863 USD/hectare, respectively. The annual GDP value growth rates in rural areas for built-up areas and agricultural land were assumed to be 4.5% and 5.5%, respectively [14]. Furthermore, the max-damage in the next 20 years (2040) was expected to be 160 USD/sq.m. and 2760 USD/hectare for the built-up area and agricultural land with crops, respectively.
To assess flood damage with a spatial resolution of 30 m, ArcGIS was used to measure damage for nine flood recurrence intervals (2, 5, 10, 20, 50, 100, 200, 500, and 1000 years) under two LULC scenarios: 2021 and 2040. In order to match flood depth with the corresponding LULC type in each pixel, the inundation maps (discussed in Section 3.3) were overlaid on the land use maps in ArcGIS. Finally, the total flood damage at the pixel level was estimated using Equation (5) in MATLAB 23.2 software.
Subsequently, the risk was assessed by evaluating the expected annual damage (EAD) in both LULC change scenarios. The EAD was estimated as the area under the curve, as presented in Equation (6) [10].
EAD = D P d P d A      
In this equation, D(P) represents the damage that happens with a high possibility (P) each year, which is inversely related to the flood recurrence interval (T). The variable A represents the total area. Considering all return times between a high and low probability of flooding provides an accurate estimate of risk because the choice of return periods affects flood risk calculations [13]. To visualize the relationship between flood damage and coupled occurrence probabilities, an EAD curve was constructed. The occurrence probabilities of 0.001 were determined to be the upper and lower boundaries of the probability curve.

4. Results

4.1. Predicted LULC Map and the Change Assessment

This study utilized maps from 2001, 2011, and 2021 were used to project the LULC alterations up to 2040. The historical and recent maps of the Tajan watershed were categorized into five classes: built-up, agricultural, waterbody, forest, and rangeland (see Figure 7). The accuracy of the classified LULC images was evaluated using a Neural Network classification algorithm, and the results are presented in Table 4, which shows the acceptable accuracy rates measured by the kappa coefficient and overall accuracy tests. The MLP model employed in the LULC conversion sub-model demonstrated satisfactory estimates across all sub-models, as indicated by the findings presented in Table 5. Notably, the sub-models for the conversion from forestland to agricultural land and from rangeland to built-up areas achieved the highest accuracy rates of approximately 97.3% and 96.5%, respectively, making them the most precise sub-models among the generated models. Additionally, the LULC change map for 2040, projected using the Markov chain and LCM models, is depicted in Figure 8. The results indicate a satisfactory level of accuracy in the model’s predictions of future LULC changes within the study area, achieving an overall accuracy of 0.86 and a kappa coefficient of 0.89.
Table 6 presents the alterations in different LULC types during various periods calculated by the Markov chain matrix. The results reveal that forestland was the most extensive land cover, occupying about 50% of the entire area in 2001. However, it is projected to experience a significant decline in future changes. Comparing changes between the LULC maps for 2021 and 2040 shows that the forest area will shrink by 88.4 km2 in the next 20 years. The most dynamic classes are agricultural lands and built-up areas, which will increase by approximately 273.3 km2 and 15.84 km2, respectively, by 2040. The results show that the built-up area and agricultural land are expected to double by 2040. The map illustrates that a majority of the expansion in built-up and agricultural land has occurred along the river, which are critical areas prone to flood exposure. Rangelands, including grasslands, shrublands, and woodlands, are projected to decrease significantly by 227.83 km2 from 2021 to 2040. On the other hand, the water body experienced a slight increase of about 1.2% from 2001 to 2040 primarily due to changes in rainfall patterns [49].

4.2. Flood Vulnerability Assessment and Mapping

4.2.1. Flood Influencing Factors for Vulnerability Mapping

Before ranking all flood contributing factors in the watershed, a multicollinearity assessment was conducted using TOL (tolerance) and VIF (variance inflation factor) statistics to evaluate the presence and strength of correlation among the flood influencing factors in the watershed. Based on the results obtained from Table 7, the VIF values range from 1.02 to 3.72, and the TOL values range from 0.30 to 0.97. Typically, VIF values below 10 and TOL values above 0.1 are considered acceptable thresholds, indicating low multicollinearity. These values suggest that multicollinearity is not a significant concern among the flood-influencing factors in this study. The VIF and TOL values for all predictor variables fall within acceptable ranges, indicating low correlation between variables [25]. Therefore, multicollinearity is not a problem in this study, and the predictor variables can be considered independent and suitable for further analysis.
In this study, the MDA selection method was employed to identify and eliminate irrelevant factors, thereby reducing uncertainty in the prediction models using the RF machine learning algorithm. Figure 9a,b depict the ranking of flood influencing factors’ importance based on the RF machine learning model for the two LULC scenarios, 2021 and 2040. The findings indicate that the distance from the river (DR) emerges as the most crucial predictor of flood occurrence, exhibiting the highest values (W = 0.45, 0.47), while the soil texture (ST) factor demonstrates the lowest values (W = 0.03, 0.2) among all the input variables in both the 2021 and 2040 scenarios. In the 2021 LULC scenario, the most significant factors are distance from the river (DR) (W = 0.45), altitude (AL) (W = 0.31), land use land cover (LULC) (W = 0.23), slope (SL) (W = 0.19), drainage density (dd) (W = 0.17), and rainfall (RA) (W = 0.13). However, the rankings of important factors change in the 2040 LULC scenario. Distance from the river (DR) assumes the highest importance (W = 0.48), followed by land use land cover (LULC) (W = 0.39), altitude (AL) (W = 0.28), slope (SL) (W = 0.20), rainfall (RA) (W = 0.19), and drainage density (dd) (W = 0.15).

4.2.2. Validation of Flood Vulnerability Models

The flood vulnerability model was validated using the Receiver Operating Characteristic (ROC) curve and several statistical indices listed in Table 8. These assessments evaluated the model’s ability to identify areas at high risk of flooding. The ROC curve visually displays the match between observed flood points and the model’s predictions, with the area under the curve (AUC) providing a quantitative measure of accuracy. Standard numerical indices were also used to assess the reliability of the model on both the training and testing datasets. Table 8 and Figure 10 present the model’s performance through parameters such as Positive Predictive Value (PPV), Negative Predictive Value (NPV), Sensitivity and Specificity Tradeoff (SST), Specificity (SPF), R2, Root-Mean-Squared Deviation (RMSD), and AUC for the training and validation data. The RF model trained on the 2040 LULC scenario exhibited higher values for PPV (0.87%), NPV (0.90), SST (0.92), SPF (0.89), R2 (0.91), and AUC (0.95%), indicating improved efficiency compared to the 2021 LULC scenario. Conversely, the RF model showed lower RMSD (0.28) for the 2040 scenario compared to 2021 (RMSD = 0.33). The validation dataset also yielded higher values for PPV (0.83), NPV (0.89), SST (0.91), SPF (0.87), R2 (0.90), AUC (0.92), and lower RMSD (0.31) when considering the future scenario. In contrast, the validation data based on the 2021 scenario exhibited slightly lower values for PPV (0.79), NPV (0.87), SST (0.88), SPF (0.78), R2 (0.89), AUC (0.90), and higher RMSD (0.37). These results confirm that the RF model accurately predicts the flood vulnerability pattern in the study area, both historically and under future conditions [16].

4.2.3. Flood Vulnerability Mapping

The flood vulnerability maps of the study area presented in Figure 11a,b were generated by the RF machine learning model based on the impacts of historical and future LULC change scenarios. The flood vulnerability map is classified into five categories: very low, moderate, high, and very high vulnerability, utilizing the natural break classification method in ArcGIS.
From Figure 12a,b, it can be detected that 13.8%, 27.5%, 22.7%, 22%, and 14% of the study area are placed in the very low, low, moderate, high, and very high flood vulnerable zones in the 2021 LULC scenario. In comparison, the future 2040 LULC scenario shows that 17.4%, 18.6%, 21.2%, 23.2%, and 19.6% of the study area are located in very low-, low-, moderate-, high-, and very high-risk vulnerable zones, respectively. These percentages represent the proportion of the total study area that lies within each flood vulnerability category. The maps for both scenarios reveal that high and very high flood vulnerability zones are predominantly located along the main Tajan River and its tributaries. Conversely, low and very low vulnerability zones are concentrated in the high-elevation areas of the watershed’s southwest. The moderate-vulnerability zones in both scenarios are mostly found downstream and in areas with gentler slopes.

4.3. Flood Hazard Mapping

The flood hazard mapping analysis provides crucial insights into the potential flood risks and associated damages within the study area. Using the HEC-RAS flood model, simulations of flood inundation, extents, and flow velocity were conducted for design events with return periods ranging from 2 years to 1000 years.
Figure 13a displays the flood inundation depth and extent map of the selected portion of the Tajan watershed (see Figure 4) for the Q1000 peak discharge at a 30 m topographic resolution. The flood inundation map reveals that the lowland areas adjacent to the Tajan River in the middle and downstream parts are characterized by very high flow depths. Furthermore, Figure 13b presents the percentage of built-up land, agricultural land, and forestland located in the selected portion of the Tajan watershed (Section 3.3) for the years 2021 and 2040. However, in this study, we are only looking at the impact of flooding on built-up areas and agricultural land with crops.
Figure 13c illustrates the relationship between maximum simulated peak discharges and flood return periods, along with the corresponding maximum water depth. The discharge rate is primarily influenced by the stream channel’s depth, width, and cross-sectional shape. As the return period increases, the discharge increases linearly, resulting in a larger area being exposed to flood risks and damages, consistent with previous research [50,51]. The maximum inundation depth varies from 2.2 to 4.6 m for a 2- to 10-year return period and ranges from 6.14 to 15.12 m for return periods of 25, 50, 100, 200, 500, and 1000 years. Figure 13d indicates that the flood inundation area increases linearly with higher return periods. The agricultural lands situated along the riverside, covering a significant portion of the watershed, are particularly vulnerable to flood risks, leading to substantial damage. On the other hand, the built-up areas, occupying a smaller portion of the watershed and located further away from the river, are more susceptible to flood intensity. It is worth mentioning that the model calibration involved adjusting Manning’s roughness coefficient to achieve the best agreement between observed and simulated inundation data. Figure 14 demonstrates the favorable agreement between observed and simulated water depths, with a Coefficient of Determination exceeding 0.90 at both stations for 20 reviewed events. The hydraulic models demonstrate strong performance, indicated by stage calibration statistics (see Table 9) with R values above 0.9, NSE values of at least 0.79, and RMSE below 0.015 at both upstream and downstream stations. Similarly, Table 9 presents the statistical indicators used to assess the models’ performance in flow validation at both stations.

4.4. Flood Damage Analysis

This study performed a flood damage analysis to evaluate the financial impact of flooding on affected areas, focusing on agricultural lands with crops and built-up regions. The analysis applied the flood depth–damage function (Equation (5)) and considered two LULC scenarios: 2021 and 2040.
To calculate the area affected by flooding, the flood hazard grid maps (depth and extents) for different return periods and the LULC maps of 2021 and 2040 were imported into the ARCGIS 10.5 software. By using geometric intersection tools, the affected land cover types in the study area were analyzed against the simulated flood extent for each return period. The total cost of flood damage was then determined by combining the depth of flooding in each cell, the affected area, and the cost of maximum damage (see Table 3) based on the LULC scenarios for 2021 and 2040.
Figure 15 presents the total calculated damages resulting from floods for various return periods under the two LULC scenarios. The analysis revealed that agricultural lands are more vulnerable to flooding due to their proximity to the river, and damage to these regions occurs across all flood return periods (Q2 to Q1000). In contrast, significant damages to built-up areas were observed only in return periods of 25 to 1000 years. The results show an increasing trend in damage costs due to flood depth and flow intensity, as influenced by variations in the return period and LULC scenarios.
The calculated average damage to built-up areas in all flood return periods (Q25 to Q1000) WAS USD 117.43 million in 2021 and is projected to rise to USD 287.24 million in 2040. Additionally, the estimated damages to agricultural lands in 2021 across all flood return periods (Q2 to Q1000) amounted to USD 162.82 million, which is expected to increase to USD 452.10 million by 2040. The most significant flood damage to agricultural lands is estimated to be USD 102.33 million, with a Q1000 return period in the 2040 scenario, attributed to a noticeable increase in cultivated lands. Similarly, the greatest flood damage to the built-up area is estimated to be USD 81.44 million with a Q1000 return period in the 2040 scenario. The observed increase in flood damage from 2021 to 2040 is attributed to the considerable alteration of LULC, resulting in an increase in agricultural land and built-up areas.
Furthermore, the probability of the exceedance curve was derived from the flood damages plot, considering the probabilities of exceedance, and the expected annual damage (EAD) was calculated from the area under the curve. Figure 16 illustrates the probability of the exceedance curve, and Figure 17 shows the calculated EAD, which signifies the influence of LULC alteration on flood risk. The results in Figure 17 indicate that flood damage varies significantly between the maximum and minimum exceedance probabilities in both scenarios. In 2021, when the probability of exceedance is less than 0.02, the maximum average damage to agricultural land increases to USD 34 million with a steeper slope. When the probability of flooding is more than 0.02, the damage decreases with a gentle slope to USD 8 million. Similarly, damages of USD 13 million were calculated for the built-up area in 2021 when the exceedance probability was more than 0.02, while the average annual maximum damage increased to USD 30 million when the exceedance probability was lower than 0.02.
In the 2040 scenario, damages to agricultural land may vary from USD 20.6 million to USD 102 million for a flood with an annual exceedance probability from 0.5 to 0.001. Moreover, the built-up area in 2040 will be exposed to flooding with an annual exceedance probability of 0.04 to 0.001 due to its distance from the river, resulting in damages of USD 30.5 million to USD 81.44 million, respectively. Considering the significant flooding in the area, even during 2- and 5-year return periods, future LULC transformations will lead to increased vulnerability in agricultural land and built-up areas, posing a significant flood risk. As shown in Figure 17, the EAD for built-up areas in 2021 was around USD 91 million, with projections indicating a rise to approximately USD 220 million by 2040. Similarly, the EAD for agricultural land with crops was estimated at USD 162 million in 2021, which could increase by roughly USD 376 million by 2040, given ongoing land use changes.

5. Discussion

The investigation of flood risk, encompassing vulnerability, hazard, and consequences, is crucial, especially in the context of notable events that occur repeatedly due to climate change and land use land cover (LULC) changes. Analyzing the impact of various flood risk variables is necessary to mitigate the loss of life and property damage and promote sustainable development [52]. This study employed multiple methods and approaches to assess flood risk in the Tajan watershed and examine the influence of dynamic human activities and complex geomorphological conditions on LULC changes.

5.1. LULC Modeling and Its Relative Impact

To facilitate socioeconomic development, our study tracked LULC changes using remote sensing, ARCGIS 10.5, and TerrSet software 2020, projecting shifts until 2040. Rigorous accuracy assessments, achieving over 80% overall accuracy and kappa index values, ensured the reliability of our predictions in the study area. The significant decline in forestland and rangeland from 2001 to 2021, attributed to expanding agricultural and built-up areas, aligns with socioeconomic trends in the study area. This result is consistent with other studies that focused on LULC changes in the northern part of Iran [53]. Various factors significantly influenced the LULC modeling process. Cramer’s V analysis highlighted that distance from the road and elevation were the primary factors shaping LULC transformations in the Tajan watershed. The transportation network, particularly road expansion, emerged as a critical element capable of transforming and altering the landscape, thereby escalating the potential for flood damage. Furthermore, road construction in the study area, intended to enhance connectivity between villages and cities, has directly influenced the expansion of residential areas [27]. Most villages and agricultural lands in the Tajan watershed have been established in relatively flat or downstream locations with lower elevations, making these areas more susceptible to floods and waterlogging. Consequently, floods significantly impact people’s lives, their properties, and the region’s infrastructure. The increase in agricultural land and deforestation, along with a reduction in the decrease in impermeable surfaces, has led to greater runoff intensity and velocity, aligning with studies on land cover conversion effects on runoff volume and speed [13,54].
The future LULC projection indicated an increase in agricultural land (+7.2%), a slight increase in the built-up area (+0.42%), notable declines in rangeland (−5.88%) and forestland (−2.3%), and minimal change in water bodies (+0.01%) over the past two decades. Rangeland will experience the most significant reduction compared to other land types due to increased human activities. This result indicates that the percentage of deforestation will be reduced due to the current strict rules for water management projects. Areas projected to become built up under the land use model must be considered as the most suitable areas for future development; however, these areas are prone to flooding and need to be controlled by a disaster management organization. These findings are consistent with previous studies on LULC’s future simulation of land use changes in rapidly urbanizing areas [55,56].

5.2. Flood Vulnerability Mapping Considering LULC Change Scenarios

This study utilized the Random Forest algorithm, an advanced ensemble model, to develop and validate a flood vulnerability map for Iran’s Tajan watershed. The Random Forest model, recognized as an effective classification tool, has shown superior performance in flood vulnerability modeling in previous studies [57,58,59]. Five key factors were identified as the most influential in determining flood vulnerability in the study area, with minor variations between past and future projections due to deforestation, human activities, road construction, and agricultural encroachment. In 2021, the distance from the river was identified as the most significant factor for flood susceptibility, followed by altitude, land use, slope, drainage density, and rainfall. Similarly, in the year 2040, the most important factors remained the same but with some degree of dynamism. The consistent significance of flood susceptibility factors from 2021 to 2040, with minor changes, reflects their inherent hydrological importance [60]. The LULC alteration scenarios utilized in this study were based on the hypothesis that the LULC transformation from 2021 to 2040 would follow the patterns observed in the previous 20 years (2001 to 2021). The results indicate that extreme LULC transformations are likely to occur at lower altitudes near the rivers. Consequently, the reduction in forestland and the increases in farmland and built-up areas at lower altitudes will amplify the intensity and frequency of floods in this region. These findings align with earlier research that links increased runoff velocity and flood risk to deforestation in downstream areas [61,62,63].
The vulnerability map was classified, using the break classification technique, into five categories: very low, low, moderate, high, and very high. The vulnerability map based on the LULC scenario for 2021 shows that the low-risk area (1047.75 km2) encompasses a major part of the study area. In addition, the high- and very high-risk classes comprise approximately 533.4 and 838.2 km2 and are located in the downstream zone near the river. The map of predicted flood vulnerability considering the LULC transformations occurring by 2040 reveals that the very high- and high-flood-risk class areas would increase by about 746.76 and 883.92 km2, respectively. However, the low- and very low-class areas will be reduced by about 708.66 and 662.94 km2. Most of the downstream agricultural lands (rice fields) and built-up areas are located in high-risk and very high-risk flood zones and would be subject to serious flood financial and human losses. The high- and very high-flood-risk zones in both scenarios mainly cover the northern and western parts of the watershed in the vicinity of the main river and its branches on a lower slope. Low drainage density is one of the Tajan watershed’s characteristics, which is one of the main reasons for its high flood vulnerability. Conversely, the southern and eastern parts of the watershed enjoy lower vulnerability to flooding due to their higher elevation. The RF model was successfully validated for both periods, with AUC training and validation rates exceeding 0.90 in both the 2021 and 2040 scenarios. These results are consistent with previous studies’ findings and underscore the RF method’s reliability for accurately mapping flood susceptibility [64]. Several prior studies also support our findings regarding the significant impact of LULC changes on flood probability zones [65,66,67].

5.3. Flood Hazard and Damage Estimation Based on LULC Change Scenarios

This study integrates flood vulnerability and hazard data to assess the impact of flood consequences on different land use categories. The high-risk zone along the river, prone to flooding, was identified for evaluating damages on agricultural land and built-up areas under various LULC change scenarios. The HEC-RAS hydraulic model, calibrated with 30 years of flood discharge data, simulated inundation depth and intensity for different return periods (Q2 to Q1000). The model demonstrated accurate performance, aligning with similar studies [68,69]. For the 2-to-10-year return period, the average maximum inundation depth is 3.4 m, increasing to 10.63 m for higher return periods. In the low and medium return period floods, the difference in the maximum inundation depth is not extreme, probably because there is not a large difference in elevation between the river channel and adjacent areas. The results indicate that topography has a crucial impact on confining the inundated zone during extreme floods, where a further increase in the flood volume is converted into an increase in depth rather than area coverage [50]. In the studied watershed, sometimes, a high inundation depth occurred even beyond the river channel on the border between the river and agricultural land due to the formation of artificial levees and the shifting of the channel. Channel shifting in the Tajan River occurs due to human activity, loose soil, and subsequent erosion [70,71]. The depth–damage function was applied to estimate the direct financial cost of damages to the building and agricultural lands (with the crops) in various flood return periods under previous and future LULC change scenarios. A similar function has been used in other damage studies by Alipour et al. in 2020 and Narayan et al. 2017 at different locations [72,73].
Our results showed that due to the proximity to the river, the agricultural lands are exposed to all flood return periods (Q2 to the Q1000 year); however, the built-up areas a short distance from agricultural lands are susceptible to flood severity in the 25- to 1000-year return period. The results illustrate an increase in average damage between 2021 and 2040. Specifically, the average damage to the built-up area is projected to rise by USD 169.81 million, while the average damage to agricultural lands is expected to increase by USD 289.28 million. The expected annual flood damage (EAD) will increase in the future LULC scenario due to the limited support provided for the flood risk when making land expansion plans. The expected annual damages (EADs) for the built-up area will increase by USD 129 million from 2021 to 2040. Similarly, for agricultural land (with crops), the EAD is expected to rise by USD 214 million, reflecting the ongoing LULC changes. Damage analysis using a depth–damage curve is a standard damage calculation method; however, due to some uncertainties, this method may overestimate flood losses [51]. The findings of this study highlight that the Tajan watershed will face increased flood risk if risk orientation is not considered in future land use planning to mitigate damage and ensure long-term sustainability, so it is crucial to prioritize the implementation of flood risk mitigation strategies. The spatial data on flood risk and land use changes provided in this research can assist stakeholders, including the Ministry of Land, in identifying areas that require land use policy intervention. Additionally, the developed framework can be valuable for land use planners to evaluate the impacts of land use transformations on flood risk and its consequences. This research represents an initial effort to quantify the losses resulting from potential land use transformations and the associated flood risk in the Tajan watershed. While numerous studies have focused on either modeling land use transformation or flood risk, there is a scarcity of assessments that consider their combined impact.
In general, this study aims to reduce uncertainty through the integration of various techniques and models, including historical trend analysis for predicting land use and land cover (LULC) changes, rigorous model evaluation, and standardized data processing and visual analytics. However, it is important to acknowledge that the complete elimination of uncertainty is not achievable. For further assessment, exploring interactions between climate change scenarios—considering greenhouse gas emissions and socioeconomic factors—and their combined impact on flood risk with LULC changes is recommended. Additionally, incorporating a broader range of qualitative factors such as social vulnerability and community resilience into the analysis will contribute to a more comprehensive understanding of flood risk. The utilization of optimized models can further improve the accuracy of projections. The transparent communication of uncertainties and careful interpretation of findings will aid informed decision-making processes.
It is also worth noting that the 2040 damage projections in this study are based on current infrastructure and technology for flood control. This study applied consistent inflation rates and assumed annual GDP growth rates of 4.5% for built-up areas and 5.5% for agricultural land. However, we acknowledge that future factors, such as fluctuations in currency values, unexpected economic shifts, or changes in growth rates, could influence these estimates. To strengthen future projections, it would be beneficial to consider a range of economic scenarios that account for these potential variables.

6. Conclusions

This study developed a flood risk assessment framework that integrates flood vulnerability, hazard, and flood consequences with the spatiotemporal changes in land use and land cover (LULC) scenarios in the Tajan watershed in Iran. We applied a series of computational operations and complex geoprocessing techniques to identify high-risk areas, measure flood depths, and calculate damage influenced by land use changes. The primary findings are as follows:
(i) LULC Transformation: This study identified rapid LULC changes, particularly increases in agricultural and built-up areas, especially near rivers and downstream regions, while forest and rangeland areas decreased. This shift, confirmed by spatial analysis, is anticipated to escalate runoff and flood occurrences, especially downstream. These results underscore the importance of considering flood impacts when planning land expansion to reduce future flood risks.
(ii) Flood Vulnerability Mapping: Random Forest models effectively mapped flood vulnerability under LULC changes, demonstrating high AUC values and accurately identifying the middle and downstream regions of the Tajan watershed as highly vulnerable. This vulnerability is driven by factors such as lower elevations, gentle slopes, increased rainfall, and notable LULC transformations. The study projects a 43% rise in high-vulnerability areas by 2040, primarily due to human activities and ongoing LULC changes. Key contributors to flood risk include proximity to the river, elevation, land use type, slope, drainage density, and rainfall, each varying in influence across different scenarios.
(iii) Economic Impact Assessment: Using HEC-RAS and global damage functions, this study assessed the financial impacts of LULC-induced floods under two scenarios. Direct losses were calculated for buildings and agricultural land in high-risk flood zones across various return periods. The analysis showed that flood impacts on agriculture were largely influenced by flood height and duration, while built-up areas experienced significant damage in events with a return period of 25 years or more. Due to LULC changes, damages are projected to increase, indicating growing exposure to flood risks. The estimated annual flood damage (EAD) for agriculture is set to rise from USD 136.85 million in 2021 to USD 376.10 million in the 2040 scenario. Similarly, EAD for built-up areas is expected to increase from USD 91.43 million in 2021 to USD 220.49 million in the 2040 scenario. These finding underscores the critical role of land use planning in reducing flood-associated risks.
This study highlights flood risk zones resulting from future land cover changes, aiding land development decision-makers in determining appropriate flood conservation and damage mitigation actions. The projected increase in flood damage, primarily due to agricultural and built-up area development in floodplains, allows planning authorities to employ local planning instruments. These measures can redirect land development and promote structural adaptation strategies for agricultural land and buildings in areas prone to potential flood hazards, thereby reducing damage potential.

Author Contributions

Writing—original draft, F.G.; review & editing, J.Z. and A.N.; Supervision, Y.L. and J.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by Shandong Provincial Natural Science Foundation (Grant No. ZR2024ME242), National Natural Science Foundation of China (42007412), China Sri Lanka Joint Research and Demonstration Center for Water Technology, China-Sri Lanka Joint Center for Education and Research, CAS, and Plan for Youth Innovation Team of Colleges in Shandong Province (Efficient Municipal Wastewater Treatment and Reuse Technology, DC2000000961 and 2022KJ147).

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Acknowledgments

The authors would like to thank Shari Lin Holderread for proofreading and editing the manuscript.

Conflicts of Interest

The authors declare that they have no known competing financial interest or personal relationships that could have appeared to influence the work reported in this paper.

Appendix A. LULC Change Modeling and Validation

In this study, the modeling of transition potentials involved the careful selection of five transition sub-models representing various types of land cover changes, including transitions from agricultural land to built-up areas, from forest to agricultural land, from forest to built-up areas, from forest to water bodies, and from rangeland to agricultural land. The choice of these sub-models was based on their proven significance in capturing pertinent land cover dynamics [28]. The multilayer perceptron neural network (MLP) technique was employed to develop these sub-models, considering various training parameters such as the learning rate, momentum, the number of iterations, and hidden layers. The MLP algorithm is renowned for its ability to effectively learn complex patterns and relationships within data [74]. Additionally, five influential input factors, namely slope, elevation, distance from a road, distance from a river, and distance from built-up areas, were considered. These factors played a crucial role in modeling the transition potentials and understanding the drivers of land cover changes. To assess the significance of these factors, this study applied Cramer’s V classification method, enabling the assessment of their association with land cover changes. The outputs obtained from modeling the transition potentials and incorporating the influential input factors served as essential inputs for the prediction stage.

Appendix B. Flood Influencing Factors and Multicollinearity Test Details

Appendix B provides comprehensive descriptions of the flood-influencing factors used in our analysis and outlines the multicollinearity (MC) test conducted to assess the inter-relationships among these factors.
Slope (SL): The slope was computed by the digital elevation model (DEM) and the slope calculation tool in ArcGIS [31]. The slope range map was generated from 0 to 81% via DEM in five categories, as shown in Figure 3a.
Aspect (AS): The local climate, hydrological situation, and type of humidity elements are precisely connected to the slope aspects [75]. The aspect map is classified into nine categories by the ArcGIS toolbox, namely flat, north, northeast, east, southeast, south, southwest, west, and northwest, which were prepared by the DEM map, as shown in Figure 3b.
Altitude (AL): The altitude map is derived from the ALOSPALSAR digital elevation model (DEM) and classified into five classes ranging from −2 to 3700 m, as shown in Figure 3c.
Topographic Wetness Index (TWI): Some research has indicated that the TWI is a significant morphometric factor of flood vulnerability in a catchment [76]. TWI evaluates the impact of topography on water flow accumulation and runoff, as shown in Figure 3d. The TWI is computed as presented in Equation (A1):
T W I = ln A s t a n β
where As is the area of a particular catchment (m2m−1) and β is the slope gradient. This study divided TWI into five categories ranging from 1.1 to 25.
Topographic Position Index (TPI): The TPI represents the elevation of each location compared with its neighboring area. The index value will be a positive number if a point is higher than its surroundings, such as on hilltops and ledges, and will be a negative number in lower topographic areas such as valleys [77]. In this study, the TPI was classified into the five classes presented in Figure 3e.
Terrain Ruggedness Index (TRI): The TRI published by Riley et al. defines the elevation differences between adjacent cells [78]. The TRI map was divided into five classes via the DEM map, and the results are presented in Figure 3f.
Information value of Soil Texture (ST): The soil texture illustrates the various grain sizes in the soil, which supplies the appropriate infiltration for controlling surface runoff [79]. The 1:100,000-scale digitized soil map was acquired from the Soil and Water Research Department of Iran, as shown in Figure 3g.
Rainfall (RA): The amount and speed of the rainfall play an essential role in intensifying the flood frequency in a catchment [80]. The rainfall map in this study was generated by utilizing the annual precipitation data from six rain gauges from 1990 to 2021. The interpolation method (based on the Gaussian process known as Kriging) was used by ArcGIS 10.5 tools to create a rainfall map in seven categories from 420 mm to 770 mm, as shown in Figure 3h.
Drainage Density (DD): The drainage density is calculated by dividing the entire length of the stream in a drainage watershed by the whole area [81]. The drainage density map was generated in five classes in this study, as shown in Figure 3i.
Distance from River (DR): Past studies have shown that floods and damages are more likely to occur near rivers [57]. The distance to the river map was divided into five categories, with the longest distance being 4700 m, as shown in Figure 3j.
Lithology (LI): A lithology map of this study area was developed by the Geological Survey and Mineral Exploration Agency of Iran (GSI). Lithology mainly controls water flow infiltration due to the impenetrability of the rock, which in turn affects the occurrence of floods. The lithology map of the Tajan watershed is presented in Figure 3k.
Land Use Land Cover (LULC): The land use and cover type is an influential factor in determining the rate of roughness coefficient, evapotranspiration, runoff velocity, and volume [57]. In this study, the land use land cover maps made from Landsat 5, 7, and 8 satellite imageries were combined and categorized into five classes: built-up area, agricultural land, water bodies, forestland, and rangeland. Figure 3l illustrates the LULC types of 2021 in the study area.
Multicollinearity (MC) test.
In order to comprehend the linear connection among the multiple influencing factors, a statistical MC test is required for the assessment of flood vulnerability. The MC identifies inter-relationships between two or more independent factors in a dataset. There are two popular ways to measure multicollinearity: tolerance (TOL) and variance inflation factors (VIF). Generally, a VIF above 4 or a tolerance below 0.25 indicates that a multicollinearity problem that needs to be corrected has occurred [45]. Equations (A2) and (A3) display the statistical calculation of the tolerance (TOL) and variance inflation factor (VIF):
T O L = 1 R i 2
V I F = 1 T O L
where Ri2 represents the unadjusted Coefficient of Determination for regressing the ith independent factor on the remaining factors.

Appendix C. Validation Methods for Flood Vulnerability Mapping

In this study, several statistical measures were employed to assess the machine learning model’s ability to develop satisfactory flood vulnerability maps. These measures include the Sensitivity Test (SST), Specificity (SPF), Positive and Negative Predictive Values (PPV and NPV), Root-Mean-Squared Deviation (RMSD), Coefficient of Determination (R2), and Receiver Operation Characteristics (ROC-AUCS). To calculate these statistical measurements, four standard indices were used: true negative (TN), true positive (TP), false negative (FN), and false positive (FP). TP and FP represent the numbers of flooded and non-flooded pixels that were correctly classified, while TN and FN represent the numbers of flooded and non-flooded pixels that were incorrectly classified, respectively. PPV and NPV measure the ratio of true positive and true negative predictions, considering all positive and negative predictions, respectively [57]. RMSD displays the rate of deviation between simulated and existing values. A small RMSD value shows that the model operated adequately. R2 represents the Linear Regression of observed and predicted data in the machine learning model [37]. ROC-AUC is a widely used metric for assessing the quality of classification and predictive capacity in models. The area under the ROC-AUC curve reflects the ratio of the model’s correctly predicted pixels (true positive) to incorrectly predicted pixels (false positive) from the flooded and non-flooded points, and its value ranges between 0 and 1. An ROC-AUC value of 1 indicates unbiased estimation of flood occurrence, while a value greater than 0.5 signifies acceptable flood modeling performance [82,83]. The following Equations (A4)–(A8) illustrate the statistical measurements employed to assess the performance of the machine learning model.
A U C = T P + T N P + N
P P V = T P F P + T P
N P V = T P T P + F N
S e n s i t i v i t y = T P T P + F N
S p e c i f i c i t y = T N F P + T N  

References

  1. Tehrany, M.S.; Shabani, F.; Jebur, M.N.; Hong, H.; Chen, W.; Xie, X. GIS-based spatial prediction of flood prone areas using standalone frequency ratio, logistic regression, weight of evidence and their ensemble techniques. Geomat. Nat. Hazards Risk 2017, 8, 1538–1561. [Google Scholar] [CrossRef]
  2. Gai, L.; Nunes, J.P.; Baartman, J.E.; Zhang, H.; Wang, F.; de Roo, A.; Ritsema, C.J.; Geissen, V. Assessing the impact of human interventions on floods and low flows in the Wei River Basin in China using the LISFLOOD model. Sci. Total Environ. 2019, 653, 1077–1094. [Google Scholar] [CrossRef]
  3. Lee, J.; Pak, G.; Yoo, C.; Kim, S.; Yoon, J. Effects of land use change and water reuse options on urban water cycle. J. Environ. Sci. 2010, 22, 923–928. [Google Scholar] [CrossRef]
  4. Gholami, F.; Nemati, A.; Li, Y.; Zhang, J. Calculation of runoff computation cost and sensitivity analysis of topological attributes. Remote Sens. Appl. Soc. Environ. 2022, 26, 100714. [Google Scholar] [CrossRef]
  5. Shakeri Rostami, H.; Bagheri, A.; Saadodin, A. Assessment of the Flood Risk Governance in Iran According to the Approach of Policy Arrangements. Iran-Water Resour. Res. 2021, 17, 65–103. [Google Scholar]
  6. Hajian, F.; Dykes, A.P.; Cavanagh, S. Assessment of the flood hazard arising from land use change in a forested catchment in northern Iran. J. Flood Risk Manag. 2019, 12, e12481. [Google Scholar] [CrossRef]
  7. Sahana, M.; Rehman, S.; Sajjad, H.; Hong, H. Exploring effectiveness of frequency ratio and support vector machine models in storm surge flood susceptibility assessment: A study of Sundarban Biosphere Reserve, India. Catena 2020, 189, 104450. [Google Scholar] [CrossRef]
  8. Gholami, F.; Sedighifar, Z.; Ghaforpur, P.; Li, Y.; Zhang, J. Spatial-Temporal Analysis of Various Land Use Classifications and Their Long-Term Alteration’s Impact on Hydrological Components; Using Remote Sensing, SAGA-GIS, ARCSWAT Model. Environ. Sci. Water Res. Technol. 2023, 9, 1161–1181. [Google Scholar] [CrossRef]
  9. Zope, P.E.; Eldho, T.I.; Jothiprakash, V. Hydrological impacts of land use–land cover change and detention basins on urban flood hazard: A case study of Poisar River basin, Mumbai, India. Nat. Hazards 2017, 87, 1267–1283. [Google Scholar] [CrossRef]
  10. Adnan, M.S.G.; Abdullah, A.Y.M.; Dewan, A.; Hall, J.W. The effects of changing land use and flood hazard on poverty in coastal Bangladesh. Land Use Policy 2020, 99, 104868. [Google Scholar] [CrossRef]
  11. Ha-Mim, N.M.; Rahman, A.; Hossain, Z.; Fariha, J.N.; Rahaman, K.R. Employing multi-criteria decision analysis and geospatial techniques to assess flood risks: A study of Barguna district in Bangladesh. Int. J. Disaster Risk Reduct. 2022, 77, 103081. [Google Scholar] [CrossRef]
  12. Guido, B.I.; Popescu, I.; Samadi, V.; Bhattacharya, B. An integrated modeling approach to evaluate the impacts of nature-based solutions of flood mitigation across a small watershed in the southeast United States. Nat. Hazards Earth Syst. Sci. 2023, 23, 2663–2681. [Google Scholar] [CrossRef]
  13. Mind’Je, R.; Li, L.; Amanambu, A.C.; Nahayo, L.; Nsengiyumva, J.B.; Gasirabo, A.; Mindje, M. Flood susceptibility modeling and hazard perception in Rwanda. Int. J. Disaster Risk Reduct. 2019, 38, 101211. [Google Scholar] [CrossRef]
  14. Hooshyaripor, F.; Faraji-Ashkavar, S.; Koohyian, F.; Tang, Q.; Noori, R. Annual flood damage influenced by El Niño in the Kan River basin, Iran. Nat. Hazards Earth Syst. Sci. 2020, 20, 2739–2751. [Google Scholar] [CrossRef]
  15. Panahi, M.; Dodangeh, E.; Rezaie, F.; Khosravi, K.; Van Le, H.; Lee, M.-J.; Lee, S.; Pham, B.T. Flood spatial prediction modeling using a hybrid of meta-optimization and support vector regression modeling. Catena 2021, 199, 105114. [Google Scholar] [CrossRef]
  16. Hengl, T.; Nussbaum, M.; Wright, M.N.; Heuvelink, G.B.M.; Gräler, B. Random forest as a generic framework for predictive modeling of spatial and spatio-temporal variables. PeerJ 2018, 6, e5518. [Google Scholar] [CrossRef]
  17. Zhao, G.; Pang, B.; Xu, Z.; Peng, D.; Xu, L. Assessment of urban flood susceptibility using semi-supervised machine learning model. Sci. Total Environ. 2019, 659, 940–949. [Google Scholar] [CrossRef]
  18. Choubin, B.; Moradi, E.; Golshan, M.; Adamowski, J.; Sajedi-Hosseini, F.; Mosavi, A. An ensemble prediction of flood susceptibility using multivariate discriminant analysis, classification and regression trees, and support vector machines. Sci. Total Environ. 2019, 651, 2087–2096. [Google Scholar] [CrossRef]
  19. Jahangir, M.H.; Reineh, S.M.M.; Abolghasemi, M. Spatial predication of flood zonation mapping in Kan River Basin, Iran, using artificial neural network algorithm. Weather Clim. Extrem. 2019, 25, 100215. [Google Scholar] [CrossRef]
  20. Wang, Z.; Zeng, Z.; Lai, C.; Lin, W.; Wu, X.; Chen, X. A regional frequency analysis of precipitation extremes in Mainland China with fuzzy c-means and L-moments approaches. Int. J. Clim. 2017, 37, 429–444. [Google Scholar] [CrossRef]
  21. Andaryani, S.; Nourani, V.; Haghighi, A.T.; Keesstra, S. Integration of hard and soft supervised machine learning for flood susceptibility mapping. J. Environ. Manag. 2021, 291, 112731. [Google Scholar] [CrossRef] [PubMed]
  22. Zamani, Y.; Monfared, S.A.H.; Moghaddam, M.A.; Hamidianpour, M. A comparison of CMIP6 and CMIP5 projections for precipitation to observational data: The case of Northeastern Iran. Theor. Appl. Climatol. 2020, 142, 1613–1623. [Google Scholar] [CrossRef]
  23. Khosravi, K.; Pham, B.T.; Chapi, K.; Shirzadi, A.; Shahabi, H.; Revhaug, I.; Prakash, I.; Bui, D.T. A comparative assessment of decision trees algorithms for flash flood susceptibility modeling at Haraz watershed, northern Iran. Sci. Total Environ. 2018, 627, 744–755. [Google Scholar] [CrossRef]
  24. Bui, D.T.; Khosravi, K.; Shahabi, H.; Daggupati, P.; Adamowski, J.F.; Melesse, A.M.; Pham, B.T.; Pourghasemi, H.R.; Mahmoudi, M.; Bahrami, S.; et al. Flood Spatial Modeling in Northern Iran Using Remote Sensing and GIS: A Comparison between Evidential Belief Functions and Its Ensemble with a Multivariate Logistic Regression Model. Remote Sens. 2019, 11, 1589. [Google Scholar] [CrossRef]
  25. Iosub, M.; Minea, I.; Chelariu, O.E.; Ursu, A. Assessment of flash flood susceptibility potential in Moldavian Plain (Romania). J. Flood Risk Manag. 2020, 13, e12588. [Google Scholar] [CrossRef]
  26. Mishra, V.N.; Rai, P.K. A remote sensing aided multi-layer perceptron-Markov chain analysis for land use and land cover change prediction in Patna district (Bihar), India. Arab. J. Geosci. 2016, 9, 249. [Google Scholar] [CrossRef]
  27. Yousefi, S.; Pourghasemi, H.R.; Emami, S.N.; Pouyan, S.; Eskandari, S.; Tiefenbacher, J.P. A machine learning framework for multi-hazards modeling and mapping in a mountainous area. Sci. Rep. 2020, 10, 12144. [Google Scholar] [CrossRef]
  28. Abdulkareem, J.; Pradhan, B.; Sulaiman, W.; Jamil, N. Prediction of spatial soil loss impacted by long-term land-use/land-cover change in a tropical watershed. Geosci. Front. 2019, 10, 389–403. [Google Scholar] [CrossRef]
  29. Admas, M.; Melesse, A.M.; Tegegne, G. Predicting the Impacts of Land Use/Cover and Climate Changes on Water and Sediment Flows in the Megech Watershed, Upper Blue Nile Basin. Remote Sens. 2024, 16, 2385. [Google Scholar] [CrossRef]
  30. Dash, S.S.; Naik, B.; Kashyap, P.S. Assessment of land use/land cover change derived catchment hydrologic response: An integrated parsimonious hydrological modeling and alteration analysis based approach. J. Environ. Manag. 2024, 356, 120637. [Google Scholar] [CrossRef]
  31. Arabameri, A.; Rezaei, K.; Cerdà, A.; Conoscenti, C.; Kalantari, Z. A comparison of statistical methods and multi-criteria decision making to map flood hazard susceptibility in Northern Iran. Sci. Total Environ. 2019, 660, 443–458. [Google Scholar] [CrossRef] [PubMed]
  32. Hong, H.; Panahi, M.; Shirzadi, A.; Ma, T.; Liu, J.; Zhu, A.-X.; Chen, W.; Kougias, I.; Kazakis, N. Flood susceptibility assessment in Hengfeng area coupling adaptive neuro-fuzzy inference system with genetic algorithm and differential evolution. Sci. Total Environ. 2018, 621, 1124–1141. [Google Scholar] [CrossRef]
  33. Jiang, Z.; Yang, S.; Liu, Z.; Xu, Y.; Xiong, Y.; Qi, S.; Pang, Q.; Xu, J.; Liu, F.; Xu, T. Coupling machine learning and weather forecast to predict farmland flood disaster: A case study in Yangtze River basin. Environ. Model. Softw. 2022, 155, 105436. [Google Scholar] [CrossRef]
  34. Ekmekcioğlu, Ö.; Koc, K. Explainable step-wise binary classification for the susceptibility assessment of geo-hydrological hazards. Catena 2022, 216, 106379. [Google Scholar] [CrossRef]
  35. Costache, R.; Arabameri, A.; Costache, I.; Crăciun, A.; Pham, B.T. New Machine Learning Ensemble for Flood Susceptibility Estimation. Water Resour. Manag. 2022, 36, 4765–4783. [Google Scholar] [CrossRef]
  36. Lechowska, E. What determines flood risk perception? A review of factors of flood risk perception and relations between its basic elements. Nat. Hazards 2018, 94, 1341–1366. [Google Scholar] [CrossRef]
  37. El-Haddad, B.A.; Youssef, A.M.; Pourghasemi, H.R.; Pradhan, B.; El-Shater, A.-H.; El-Khashab, M.H. Flood susceptibility prediction using four machine learning techniques and comparison of their performance at Wadi Qena Basin, Egypt. Nat. Hazards 2021, 105, 83–114. [Google Scholar] [CrossRef]
  38. Hosseini, F.S.; Choubin, B.; Mosavi, A.; Nabipour, N.; Shamshirband, S.; Darabi, H.; Haghighi, A.T. Flash-flood hazard assessment using ensembles and Bayesian-based machine learning models: Application of the simulated annealing feature selection method. Sci. Total Environ. 2020, 711, 135161. [Google Scholar] [CrossRef]
  39. Breiman, L. Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author). Stat. Sci. 2001, 16, 199–215. [Google Scholar] [CrossRef]
  40. Mojaddadi, H.; Pradhan, B.; Nampak, H.; Ahmad, N.; Ghazali, A.H.B. Ensemble machine-learning-based geospatial approach for flood risk assessment using multi-sensor remote-sensing data and GIS. Geomat. Nat. Hazards Risk 2017, 8, 1080–1102. [Google Scholar] [CrossRef]
  41. Rafiei-Sardooi, E.; Azareh, A.; Choubin, B.; Mosavi, A.H.; Clague, J.J. Evaluating urban flood risk using hybrid method of TOPSIS and machine learning. Int. J. Disaster Risk Reduct. 2021, 66, 102614. [Google Scholar] [CrossRef]
  42. Xiong, J.; Li, J.; Cheng, W.-M.; Wang, N.; Guo, L. A GIS-based support vector machine model for flash flood vulnerability assessment and mapping in China. ISPRS Int. J. Geo Inf. 2019, 8, 297. [Google Scholar] [CrossRef]
  43. Costache, R. Flood Susceptibility Assessment by Using Bivariate Statistics and Machine Learning Models—A Useful Tool for Flood Risk Management. Water Resour. Manag. 2019, 33, 3239–3256. [Google Scholar] [CrossRef]
  44. Papaioannou, G.; Vasiliades, L.; Loukas, A.; Alamanos, A.; Efstratiadis, A.; Koukouvinos, A.; Tsoukalas, I.; Kossieris, P. A Flood Inundation Modeling Approach for Urban and Rural Areas in Lake and Large-Scale River Basins. Water 2021, 13, 1264. [Google Scholar] [CrossRef]
  45. Lagmay, A.M.F.A.; Racoma, B.A.; Aracan, K.A.; Alconis-Ayco, J.; Saddi, I.L. Disseminating near-real-time hazards information and flood maps in the Philippines through Web-GIS. J. Environ. Sci. 2017, 59, 13–23. [Google Scholar] [CrossRef] [PubMed]
  46. Karamouz, M.; Zahmatkesh, Z.; Goharian, E.; Nazif, S. Coastal floodplain mapping and evaluation using GIS and HEC-GeoRAS models. In Proceedings of the World Environmental and Water Resources Congress 2014, Portland, OR, USA, 1–5 June 2014. [Google Scholar]
  47. Molinari, D.; De Bruijn, K.M.; Castillo-Rodríguez, J.T.; Aronica, G.T.; Bouwer, L.M. Validation of flood risk models: Current practice and possible improvements. Int. J. Disaster Risk Reduct. 2019, 33, 441–448. [Google Scholar] [CrossRef]
  48. Huizinga, J.; De Moel, H.; Szewczyk, W. Global Flood Depth-Damage Functions: Methodology and the Database with Guidelines; Joint Research Centre: Brussels, Belgium, 2017. [Google Scholar]
  49. Maghsood, F.F.; Moradi, H.; Bavani, A.R.M.; Panahi, M.; Berndtsson, R.; Hashemi, H. Climate Change Impact on Flood Frequency and Source Area in Northern Iran under CMIP5 Scenarios. Water 2019, 11, 273. [Google Scholar] [CrossRef]
  50. Mokhtar, E.S.; Pradhan, B.; Ghazali, A.H.; Shafri, H.Z.M. Assessing flood inundation mapping through estimated discharge using GIS and HEC-RAS model. Arab. J. Geosci. 2018, 11, 682. [Google Scholar] [CrossRef]
  51. Olsen, A.S.; Zhou, Q.; Linde, J.J.; Arnbjerg-Nielsen, K. Comparing Methods of Calculating Expected Annual Damage in Urban Pluvial Flood Risk Assessments. Water 2015, 7, 255–270. [Google Scholar] [CrossRef]
  52. Muis, S.; Güneralp, B.; Jongman, B.; Aerts, J.C.; Ward, P.J. Flood risk and adaptation strategies under climate change and urban expansion: A probabilistic analysis using global data. Sci. Total Environ. 2015, 538, 445–457. [Google Scholar] [CrossRef]
  53. Karimi, H.; Jafarnezhad, J.; Khaledi, J.; Ahmadi, P. Monitoring and prediction of land use/land cover changes using CA-Markov model: A case study of Ravansar County in Iran. Arab. J. Geosci. 2018, 11, 592. [Google Scholar] [CrossRef]
  54. Kaykhosravi, S.; Khan, U.T.; Jadidi, M.A. The Effect of Climate Change and Urbanization on the Demand for Low Impact Development for Three Canadian Cities. Water 2020, 12, 1280. [Google Scholar] [CrossRef]
  55. Wu, J.; Liu, H.; Wei, G.; Song, T.; Zhang, C.; Zhou, H. Flash Flood Forecasting Using Support Vector Regression Model in a Small Mountainous Catchment. Water 2019, 11, 1327. [Google Scholar] [CrossRef]
  56. Recanatesi, F.; Petroselli, A. Land Cover Change and Flood Risk in a Peri-Urban Environment of the Metropolitan Area of Rome (Italy). Water Resour. Manag. 2020, 34, 4399–4413. [Google Scholar] [CrossRef]
  57. Lee, S.; Kim, J.-C.; Jung, H.-S.; Lee, M.J.; Lee, S. Spatial prediction of flood susceptibility using random-forest and boosted-tree models in Seoul metropolitan city, Korea. Geomat. Nat. Hazards Risk 2017, 8, 1185–1203. [Google Scholar] [CrossRef]
  58. Hoque, M.A.-A.; Tasfia, S.; Ahmed, N.; Pradhan, B. Assessing Spatial Flood Vulnerability at Kalapara Upazila in Bangladesh Using an Analytic Hierarchy Process. Sensors 2019, 19, 1302. [Google Scholar] [CrossRef]
  59. Yang, T.-H.; Liu, W.-C. A General Overview of the Risk-Reduction Strategies for Floods and Droughts. Sustainability 2020, 12, 2687. [Google Scholar] [CrossRef]
  60. Seleem, O.; Ayzel, G.; de Souza, A.C.T.; Bronstert, A.; Heistermann, M. Towards urban flood susceptibility mapping using data-driven models in Berlin, Germany. Geomat. Nat. Hazards Risk 2022, 13, 1640–1662. [Google Scholar] [CrossRef]
  61. Munawar, H.S.; Hammad, A.W.; Waller, S.T. A review on flood management technologies related to image processing and machine learning. Autom. Constr. 2021, 132, 103916. [Google Scholar] [CrossRef]
  62. Petroselli, A.; Vojtek, M.; Vojteková, J. Flood mapping in small ungauged basins: A comparison of different approaches for two case studies in Slovakia. Hydrol. Res. 2019, 50, 379–392. [Google Scholar] [CrossRef]
  63. Talukdar, S.; Ghose, B.; Shahfahad; Salam, R.; Mahato, S.; Pham, Q.B.; Linh, N.T.T.; Costache, R.; Avand, M. Flood susceptibility modeling in Teesta River basin, Bangladesh using novel ensembles of bagging algorithms. Stoch. Environ. Res. Risk Assess. 2020, 34, 2277–2300. [Google Scholar] [CrossRef]
  64. Mahdizadeh Gharakhanlou, N.; Perez, L. Spatial Prediction of Current and Future Flood Susceptibility: Examining the Implications of Changing Climates on Flood Susceptibility Using Machine Learning Models. Entropy 2022, 24, 1630. [Google Scholar] [CrossRef] [PubMed]
  65. Al Baky, A.; Islam, M.; Paul, S. Flood Hazard, Vulnerability and Risk Assessment for Different Land Use Classes Using a Flow Model. Earth Syst. Environ. 2020, 4, 225–244. [Google Scholar] [CrossRef]
  66. Fan, J.; Galoie, M.; Motamedi, A.; Huang, J. Assessment of land cover resolution impact on flood modeling uncertainty. Hydrol. Res. 2021, 52, 78–90. [Google Scholar] [CrossRef]
  67. Zhou, Q.; Chen, L.; Singh, V.P.; Zhou, J.; Chen, X.; Xiong, L. Rainfall-runoff simulation in karst dominated areas based on a coupled conceptual hydrological model. J. Hydrol. 2019, 573, 524–533. [Google Scholar] [CrossRef]
  68. Khattak, M.S.; Anwar, F.; Saeed, T.U.; Sharif, M.; Sheraz, K.; Ahmed, A. Floodplain Mapping Using HEC-RAS and ArcGIS: A Case Study of Kabul River. Arab. J. Sci. Eng. 2016, 41, 1375–1390. [Google Scholar] [CrossRef]
  69. Prafulkumar, V.T.; Prem Lal, P.; Prakash, D.P. Calibration of HEC-RAS model on prediction of flood for lower Tapi River, India. J. Water Resour. Prot. 2011, 3, 805–811. [Google Scholar]
  70. Zabihi, M.; Mirchooli, F.; Motevalli, A.; Darvishan, A.K.; Pourghasemi, H.R.; Zakeri, M.A.; Sadighi, F. Spatial modelling of gully erosion in Mazandaran Province, northern Iran. Catena 2018, 161, 1–13. [Google Scholar] [CrossRef]
  71. Jang, J.-H.; Vohnicky, P.; Kuo, Y.-L. Improvement of Flood Risk Analysis Via Downscaling of Hazard and Vulnerability Maps. Water Resour. Manag. 2021, 35, 2215–2230. [Google Scholar] [CrossRef]
  72. Alipour, A.; Ahmadalipour, A.; Abbaszadeh, P.; Moradkhani, H. Leveraging machine learning for predicting flash flood damage in the Southeast US. Environ. Res. Lett. 2020, 15, 024011. [Google Scholar] [CrossRef]
  73. Narayan, S.; Beck, M.W.; Wilson, P.; Thomas, C.J.; Guerrero, A.; Shepard, C.C.; Reguero, B.G.; Franco, G.; Ingram, J.C.; Trespalacios, D. The Value of Coastal Wetlands for Flood Damage Reduction in the Northeastern USA. Sci. Rep. 2017, 7, 9463. [Google Scholar] [CrossRef] [PubMed]
  74. Rivas-Tabares, D.; Tarquis, A.M.; De Miguel, Á.; Gobin, A.; Willaarts, B. Enhancing LULC scenarios impact assessment in hydrological dynamics using participatory mapping protocols in semiarid regions. Sci. Total Environ. 2022, 803, 149906. [Google Scholar] [CrossRef] [PubMed]
  75. Dano, U.L.; Balogun, A.-L.; Matori, A.-N.; Yusouf, K.W.; Abubakar, I.R.; Mohamed, M.A.S.; Aina, Y.A.; Pradhan, B. Flood Susceptibility Mapping Using GIS-Based Analytic Network Process: A Case Study of Perlis, Malaysia. Water 2019, 11, 615. [Google Scholar] [CrossRef]
  76. Mandal, S.P.; Chakrabarty, A. Flash flood risk assessment for upper Teesta river basin: Using the hydrological modeling system (HEC-HMS) software. Model. Earth Syst. Environ. 2016, 2, 59. [Google Scholar] [CrossRef]
  77. Tehrany, M.S.; Pradhan, B.; Jebur, M.N. Spatial prediction of flood susceptible areas using rule based decision tree (DT) and a novel ensemble bivariate and multivariate statistical models in GIS. J. Hydrol. 2013, 504, 69–79. [Google Scholar] [CrossRef]
  78. Riley, S.J.; DeGloria, S.D.; Elliot, R. Index that quantifies topographic heterogeneity. Intermt. J. Sci. 1999, 5, 23–27. [Google Scholar]
  79. Peng, Y.; Wang, Q.; Wang, H.; Lin, Y.; Song, J.; Cui, T.; Fan, M. Does landscape pattern influence the intensity of drought and flood? Ecol. Indic. 2019, 103, 173–181. [Google Scholar] [CrossRef]
  80. Mahmoud, S.H.; Gan, T.Y. Multi-criteria approach to develop flood susceptibility maps in arid regions of Middle East. J. Clean. Prod. 2018, 196, 216–229. [Google Scholar] [CrossRef]
  81. Yousefi, S.; Pourghasemi, H.R.; Emami, S.N.; Rahmati, O.; Tavangar, S.; Pouyan, S.; Tiefenbacher, J.P.; Shamsoddini, S.; Nekoeimehr, M. Assessing the susceptibility of schools to flood events in Iran. Sci. Rep. 2020, 10, 18114. [Google Scholar] [CrossRef]
  82. Islam, A.R.M.T.; Talukdar, S.; Mahato, S.; Kundu, S.; Eibek, K.U.; Pham, Q.B.; Kuriqi, A.; Linh, N.T.T. Flood susceptibility modelling using advanced ensemble machine learning models. Geosci. Front. 2021, 12, 101075. [Google Scholar] [CrossRef]
  83. Nkeki, F.N.; Bello, E.I.; Agbaje, I.G. Flood risk mapping and urban infrastructural susceptibility assessment using a GIS and analytic hierarchical raster fusion approach in the Ona River Basin, Nigeria. Int. J. Disaster Risk Reduct. 2022, 77, 103097. [Google Scholar] [CrossRef]
Figure 1. The location of the study area and the flooded and non-flooded points’ distribution.
Figure 1. The location of the study area and the flooded and non-flooded points’ distribution.
Water 16 03354 g001
Figure 2. The conceptual framework of the methodology used in this study.
Figure 2. The conceptual framework of the methodology used in this study.
Water 16 03354 g002
Figure 3. Influencing flood factor maps: (a) slope, (b) aspect, (c) altitude, (d) TWI, (e) TPI, (f) TRI, (g) soil, (h) rainfall, (i) drainage density, (j) distance from river, (k) lithology, and (l) LULC.
Figure 3. Influencing flood factor maps: (a) slope, (b) aspect, (c) altitude, (d) TWI, (e) TPI, (f) TRI, (g) soil, (h) rainfall, (i) drainage density, (j) distance from river, (k) lithology, and (l) LULC.
Water 16 03354 g003aWater 16 03354 g003bWater 16 03354 g003c
Figure 4. A selected portion of the Tajan watershed for studying flood hazards and damages (a); images of the flood consequences in 2019 in the Tajan watershed (b) [23].
Figure 4. A selected portion of the Tajan watershed for studying flood hazards and damages (a); images of the flood consequences in 2019 in the Tajan watershed (b) [23].
Water 16 03354 g004
Figure 5. The yearly maximum discharge data from 1989 to 2020 upstream and downstream of the Tajan River.
Figure 5. The yearly maximum discharge data from 1989 to 2020 upstream and downstream of the Tajan River.
Water 16 03354 g005
Figure 6. Depth–damage curves adapted from [48].
Figure 6. Depth–damage curves adapted from [48].
Water 16 03354 g006
Figure 7. Land use land cover maps of (a) 2001, (b) 2011, and (c) 2021.
Figure 7. Land use land cover maps of (a) 2001, (b) 2011, and (c) 2021.
Water 16 03354 g007
Figure 8. Predicted land use land cover maps in 2040.
Figure 8. Predicted land use land cover maps in 2040.
Water 16 03354 g008
Figure 9. Ranking flood influencing factors’ importance for LULC scenarios in (a) 2021 and (b) 2040.
Figure 9. Ranking flood influencing factors’ importance for LULC scenarios in (a) 2021 and (b) 2040.
Water 16 03354 g009
Figure 10. ROC-AUC curve of RF model utilizing (a) the training dataset and (b) the validation dataset based on 2021 and 2040 LULC scenarios.
Figure 10. ROC-AUC curve of RF model utilizing (a) the training dataset and (b) the validation dataset based on 2021 and 2040 LULC scenarios.
Water 16 03354 g010
Figure 11. Flood vulnerability maps derived from RF in two scenarios: (a) scenario 2021 and (b) scenario 2040.
Figure 11. Flood vulnerability maps derived from RF in two scenarios: (a) scenario 2021 and (b) scenario 2040.
Water 16 03354 g011
Figure 12. Area of generated flood vulnerability regions: (a) scenario 2021; (b) scenario 2040.
Figure 12. Area of generated flood vulnerability regions: (a) scenario 2021; (b) scenario 2040.
Water 16 03354 g012
Figure 13. The simulated depth and inundation extent for return periods of 1000 years (a); the amount of each LULC class in the selected portion of the Tajan watershed from 2021 to 2040 (b); the simulated peak discharge and maximum depth at different return periods (c); the simulated food inundation extent at various return periods (d).
Figure 13. The simulated depth and inundation extent for return periods of 1000 years (a); the amount of each LULC class in the selected portion of the Tajan watershed from 2021 to 2040 (b); the simulated peak discharge and maximum depth at different return periods (c); the simulated food inundation extent at various return periods (d).
Water 16 03354 g013
Figure 14. Comparison of simulated and observed depths (m) at upstream and downstream stations.
Figure 14. Comparison of simulated and observed depths (m) at upstream and downstream stations.
Water 16 03354 g014
Figure 15. Flood damages estimation at various return periods under LULC scenarios: (a) built-up area; (b) agricultural land.
Figure 15. Flood damages estimation at various return periods under LULC scenarios: (a) built-up area; (b) agricultural land.
Water 16 03354 g015
Figure 16. Probability of exceedance curves: (a) built-up area; (b) agricultural land.
Figure 16. Probability of exceedance curves: (a) built-up area; (b) agricultural land.
Water 16 03354 g016
Figure 17. Total expected annual damage (EAD) assessment based on LULC scenarios for agricultural land and built-up areas in 2021 and 2040.
Figure 17. Total expected annual damage (EAD) assessment based on LULC scenarios for agricultural land and built-up areas in 2021 and 2040.
Water 16 03354 g017
Table 1. Satellite image features.
Table 1. Satellite image features.
Landsat SensorImage Generated DateWRS RowWRS PathSpatial ResolutionNumber of Bands UsedDatum
ETM+30 July 200135163306WGS84
TM28 March 201135163306WGS84
OLI27 June 202135163308WGS84
Table 2. Detailed information of the hydrological stations.
Table 2. Detailed information of the hydrological stations.
Station NameRiverStation CodeLatitudeLongitudeElevation (m)
Soleiman-TangeTajan1301936-15-1253-13-11400
BalakoolaTajan1303536-27-3153-07-15115
Table 3. The cost of max-damage to the built-up and agriculture land with the crops in two LULC scenarios.
Table 3. The cost of max-damage to the built-up and agriculture land with the crops in two LULC scenarios.
ScenarioLULC TypeMax Damage
LULC (2021)Built-up area60 USD/sq.m.
Agricultural land and crops863 USD/hectare
LULC (2040)Built-up160 USD/sq.m.
Agricultural land and crops2760 USD/hectare
Table 4. The accuracy of the land use land cover classification of images in different years.
Table 4. The accuracy of the land use land cover classification of images in different years.
Assessment IndexesLULC2001LULC2011LULC 2021
Cohen’s kappa index0.810.830.81
Overall accuracy (%)88.7492.6790.81
Table 5. General features of various LULC conversion models from 2011–2021.
Table 5. General features of various LULC conversion models from 2011–2021.
Sub-ModelIndependent Factors (Numbers)Factor’s MomentumInteractionsNumber of Hidden LayersAccuracy Rate (%)
Forest–Agriculture50.3510,000797.3
Forest–Water Body50.206000781.7
Forest–Built-Up50.278000586.7
Agriculture–Built-Up50.3210,000684.3
Rangeland–Agriculture50.247000696.5
Table 6. Alterations in LULC area at various periods from 2001 to 2040.
Table 6. Alterations in LULC area at various periods from 2001 to 2040.
LULC TypeLULC
Area till 2001
LULC
Area till 2011
LULC
Area till 2021
LULC
Area till 2040
Rate of LULC Changes 2021–2040 (Km2)Percentage of Changes
2021–2040 (%)
Built-up30.3239.6448.564.3415.84+0.42
Agriculture417.7597.9766.91040.2273.3+7.2
Forest1990.11876.971788.61700.20−88.4−2.3
Water Body3.63.94.14.60.5+0.01
Rangeland13601283.31193.6992.37−201.23−5.88
Table 7. Multicollinearity assessment test.
Table 7. Multicollinearity assessment test.
Flood Influencing FactorsVIFTOL
Slope1.020.97
Aspect1.40.94
Altitude3.720.20
TPI1.810.87
TRI1.640.62
Information value of Soil Texture2.160.46
lithology2.070.51
Drainage Density1.390.78
Distance from River1.50.69
Rainfall3.690.28
TWI1.430.81
LULC1.090.94
Table 8. The validation of flood vulnerability models through standard numerical indices.
Table 8. The validation of flood vulnerability models through standard numerical indices.
IndexTraining Dataset (Scenario 2021)Validation Dataset
(Scenario 2021)
Training Dataset
(Scenario 2040)
Validation Dataset
(Scenario 2040)
PPV (%)0.820.790.870.83
NPV (%)0.870.870.900.89
SST (%)0.910.880.920.91
SPF (%)0.850.780.890.87
R2 (%)0.890.890.910.90
RMSD0.330.370.280.31
AUC (%)0.920.900.950.92
Table 9. Model performance statistics during calibration and validation at two stations.
Table 9. Model performance statistics during calibration and validation at two stations.
PerformanceStationsNSERMSDR
CalibrationSoleiman-Tange0.800.0150.91
Balakoola0.790.0180.90
ValidationSoleiman-Tange0.830.0190.89
Balakoola0.820.0160.88
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Gholami, F.; Li, Y.; Zhang, J.; Nemati, A. Quantitative Assessment of Future Environmental Changes in Hydrological Risk Components: Integration of Remote Sensing, Machine Learning, and Hydraulic Modeling. Water 2024, 16, 3354. https://doi.org/10.3390/w16233354

AMA Style

Gholami F, Li Y, Zhang J, Nemati A. Quantitative Assessment of Future Environmental Changes in Hydrological Risk Components: Integration of Remote Sensing, Machine Learning, and Hydraulic Modeling. Water. 2024; 16(23):3354. https://doi.org/10.3390/w16233354

Chicago/Turabian Style

Gholami, Farinaz, Yue Li, Junlong Zhang, and Alireza Nemati. 2024. "Quantitative Assessment of Future Environmental Changes in Hydrological Risk Components: Integration of Remote Sensing, Machine Learning, and Hydraulic Modeling" Water 16, no. 23: 3354. https://doi.org/10.3390/w16233354

APA Style

Gholami, F., Li, Y., Zhang, J., & Nemati, A. (2024). Quantitative Assessment of Future Environmental Changes in Hydrological Risk Components: Integration of Remote Sensing, Machine Learning, and Hydraulic Modeling. Water, 16(23), 3354. https://doi.org/10.3390/w16233354

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop