Gully Erosion Susceptibility Assessment in the Kondoran Watershed Using Machine Learning Algorithms and the Boruta Feature Selection
Abstract
:1. Introduction
2. Material and Methods
2.1. Study Area
2.2. Data Set
2.3. Predictor Variables of Gully Erosion Susceptibility
2.3.1. Slope Angle
2.3.2. Slope Aspect
2.3.3. Elevation
2.3.4. Soil Texture
2.3.5. Length-Slope Factor
2.3.6. Plan Curvature
2.3.7. Topographic Wetness Index
2.3.8. Land Use
2.3.9. Lithology
2.3.10. Drainage Density
2.3.11. Distance to River
2.4. Collinearity between Independent Variables
2.5. Boruta Variable Selection Algorithm
- (1)
- The information system is extended by generating the shadow attributes (at least five for each attribute).
- (2)
- The random forest algorithms are run on each copy of the new dataset and the Z-scores are computed.
- (3)
- The maximum Z-score (MZS) of shadow attributes is computed.
- (4)
- The importance of each attribute is compared with the MZS.
- (5)
- The attributes with importance significantly lower than MZS are removed (considered as unimportant), and those with importance significantly higher than MZS are considered as important.
- (6)
- All shadow attributes are removed and the procedure is repeated until the importance is assigned to all attributes.
2.6. Investigating the Relationship between Gully Erosion and Conditioning Factors
2.7. Modeling Gully Erosion Susceptibility
2.7.1. Support Vector Machine
2.7.2. Random Forest Model
2.7.3. Multiple Discriminant Analysis
2.7.4. Ensemble Model
2.7.5. Evaluation Model Performance
3. Results
3.1. Multi-Collinearity Test
3.2. Gully Erosion Susceptibility Maps (GESM)
3.3. Evaluation of GESMs Performance
3.4. Computing Variable Importance Using Boruta Algorithm
3.5. Evaluating the Relationship among Conditioning Factors and Gully Erosion Using the Evidential Belief Function (EBF) Model
4. Discussion
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A
AGNPS | Agricultural Non-point Source | AGNPS is a computer-simulation model that simulates the behavior of runoff, sediment, and nutrient transport from watersheds that have agriculture as their prime use. The model operates on a cell basis and is a distributed parameter, event-based model. |
AUC of ROC | Area-under-the-curve of Receiver Operating Characteristics | The AUC statistics describe the area under the ROC curve and are used as a measure of classification accuracy. The greater AUC indicates a better classification result. |
Bel | Belief | Belief is one of the functions that is used in EBF model. It is the pessimistic measures of the spatial relationship of response variable, i.e., Bel indicates the lower probabilities of evidence that supports a hypothesis. |
DEM | Digital Elevation Model | A digital elevation model (DEM) is a representation of the bare ground (bare earth) topographic surface of the Earth excluding trees, mountains, buildings, and any other surface objects. |
Dis | Disbelief | Disbelief is one of the functions that is used in EBF model. It is a degree of disbelief in evidence for the hypothesis and the Dis value is obtained from 1–Pls or 1-Unc–Bel. |
EBF | Evidential Belief Function | The evidential belief function (EBF) algorithm describes the correlation among predictor variables and response variable. The statistical EBF model is computed based on Dempster-Shafer’s theory to combine the representations of several independent variables to achieve a combined measure of belief. |
GIS | Geographic Information System | A geographic information system (GIS) is a conceptualized framework that provides the ability to capture and analyze spatial and geographic data. |
GLM | Generalized Linear Model | GLM is the extension of the classic linear regression model. Contrasted with the normal linear model, the response variables of GLM are not confined to normal distribution, and these response variables can also obey binomial or Poisson distributions. In addition, the link function is introduced into GLM to establish the relationship between the expectation of the response variable and the linear combination of explanatory variables. |
GPS | Global Positioning System | GPS (Global Positioning System) is a radio wave receiver used to provide coordinates that give the exact position of an element in a certain space. |
LiDAR | Light Detection and Ranging | LiDAR is a method for determining ranges (variable distance) by targeting an object with a laser and measuring the time for the reflected light to return to the receiver. |
LISEM | Limburg Soil Erosion Model | The Limburg soil erosion model (LISEM) is a physically-based hydrological and soil erosion model which can be used for planning and conservation purposes. |
MaxEnt | Maximum Entropy | MaxEnt is a data mining method to predict the occurrence of one event based on maximum entropy that approximates the probability distribution of presence data based on environmental limitations. |
MDA | Multiple Diagnostic Analysis | Multiple discriminant analysis is a classification approach that is used to predict categorical responses. This model, also known as the Fisher discriminant analysis, is defined based on Bayes’ theorem. The MDA attempts to estimate the conditional probability and the predictors are assumed to follow a multivariate normal distribution. |
Pls | Plausibility | Plausibility is one of the functions that is used in EBF model. It is the optimistic measures of the spatial relationship of response variable, i.e., Pls indicates the upper probabilities of evidence that supports a hypothesis. |
RF | Random Forest | Random forest (RF) is a classification approach that is obtained based on the improvement of bagging (bootstrap aggregation) trees. |
RUSLE | Revised Universal Soil Loss Equation | RUSLE is an easily and widely used model that estimates rates of soil erosion caused by rainfall and associated overland flow. |
SAR | Synthetic Aperture Radar | Synthetic aperture radar (SAR) is a form of radar that is used to create two-dimensional images or three-dimensional reconstructions of objects, such as landscapes. SAR uses the motion of the radar antenna over a target region to provide finer spatial resolution than conventional stationary beam-scanning radars. |
SVM | Support Vector Machine | The support vector machine (SVM) is a supervised learning approach that is used for classification or regression modeling. This method is defined based on statistical learning theory and uses the structural risk minimization (SRM) method to obtain an optimized solution. |
TOL | Tolerance | Tolerance is relevant and frequently used quantities that may be consulted to examine individual predictors for potentially strong contributions to (near) multicollinearity. This index reflects estimates of the degree of interrelationship of an independent variable with other explanatory variables in a regression model. The TOL less than 0.1 indicates that there exists a collinearity problem among predictor variables. |
TSS | True Skill Statistic | The true skill statistic (TSS) is known as Hanssen–Kuipers discriminant, and is commonly measure for evaluating classification accuracy. The true skill statistics is defined based on the components of the standard confusion matrix representing matches and mismatches between observations and predictions. |
TWI | Topographic Wetness Index | The topographic wetness index (TWI) is a physically-based index of the effect of local topography on runoff flow direction and accumulation. The index is a function of both the slope and the upstream contributing area. The computation of TWI is performed using both geographic information systems (GIS) and Python, a programing software used to enhance computing capabilities. The indices help identify rainfall runoff patterns, areas of potential increased soil moisture, and ponding areas. |
Unc | Uncertainty | Unc value is one of the functions that is used in EBF model. It is the difference between the Pls and Bel function, which shows the ignorance or doubt that the evidence supports a hypothesis. |
VIF | Variance Inflation Factor | Variance inflation factor measures how much the behavior (variance) of an independent variable is influenced, or inflated, by its interaction/correlation with the other independent variables. Variance inflation factors allow a quick measure of how much a variable is contributing to the standard error in the regression. VIF is the reciprocal of Tolerance. The VIF above 10 indicates that there exists a collinearity problem among predictor variables. |
References
- Liu, G.; Zheng, F.; Wilson, G.V.; Xu, X.; Liu, C. Three decades of ephemeral gully erosion studies. Soil Tillage Res. 2021, 212, 105046. [Google Scholar] [CrossRef]
- Conforti, M.; Aucelli, P.P.C.; Robustelli, G.; Scarciglia, F. Geomorphology and GIS analysis for mapping gully erosion susceptibility in the Turbolo stream catchment (Northern Calabria, Italy). Nat. Hazards 2011, 56, 881–898. [Google Scholar] [CrossRef]
- Mokarram, M.; Zarei, A.R. Determining prone areas to gully erosion and the impact of land use change on it by using multiple-criteria decision-making algorithm in arid and semi-arid regions. Geoderma 2021, 403, 115379. [Google Scholar] [CrossRef]
- Conoscenti, C.; Rotigliano, E. Predicting gully occurrence at watershed scale: Comparing topographic indices and multivariate statistical models. Geomorphology 2020, 359, 107123. [Google Scholar] [CrossRef]
- Gómez-Gutiérrez, A.; Conoscenti, C.; Angileri, S.E.; Rotigliano, E.; Schnabel, S. Using topographical attributes to evaluate gully erosion proneness (susceptibility) in two mediterranean basins: Advantages and limitations. Nat. Hazards 2015, 79, 291–314. [Google Scholar] [CrossRef]
- Amiri, M.; Pourghasemi, H.R.; Ghanbarian, G.A.; Afzali, S.F. Assessment of the importance of gully erosion effective factors using Boruta algorithm and its spatial modeling and mapping using three machine learning algorithms. Geoderma 2019, 340, 55–69. [Google Scholar] [CrossRef]
- Lei, P.; Shrestha, R.; Zhu, B.; Han, S.; Yang, H.; Tan, S.; Ni, J.; Xie, D. A Bibliometric Analysis on Nonpoint Source Pollution: Current Status, Development, and Future. Int. J. Environ. Res. Public Healthy 2021, 18, 7723. [Google Scholar] [CrossRef] [PubMed]
- Hessel, R. Effects of grid cell size and time step length on simulation results of the Limburg soil erosion model (LISEM). Hydrol. Process. 2005, 19, 3037–3049. [Google Scholar] [CrossRef]
- Ferreira, V.; Panagopoulos, T. Seasonality of Soil Erosion Under Mediterranean Conditions at the Alqueva Dam Watershed. Environ. Manag. 2014, 54, 67–83. [Google Scholar] [CrossRef]
- Ferreira, V.; Samora-Arvela, A.; Panagopoulos, T. Soil erosion vulnerability under scenarios of climate land-use changes after the development of a large reservoir in a semi-arid area. J. Environ. Plan. Manag. 2016, 59, 1238–1256. [Google Scholar] [CrossRef]
- Zare, M.; Panagopoulos, T.; Loures, L. Simulating the impacts of future land use change on soil erosion in the Kasilian watershed, Iran. Land Use Policy 2017, 67, 558–572. [Google Scholar] [CrossRef]
- Fiorucci, F.; Ardizzone, F.; Rossi, M.; Torri, D. The Use of Stereoscopic Satellite Images to Map Rills and Ephemeral Gullies. Remote Sens. 2015, 7, 14151–14178. [Google Scholar] [CrossRef] [Green Version]
- Bingner, R.L.; Wells, R.R.; Momm, H.G.; Rigby, J.R.; Theurer, F.D. Ephemeral gully channel width and erosion simulation technology. Nat. Hazards 2015, 80, 1949–1966. [Google Scholar] [CrossRef]
- Rahman, R.; Shi, Z.; Chongfa, C. Soil erosion hazard evaluation—An integrated use of remote sensing, GIS and statistical approaches with biophysical parameters towards management strategies. Ecol. Model. 2009, 220, 1724–1734. [Google Scholar] [CrossRef]
- Soleimanpour, S.M.; Pourghasemi, H.R.; Zare, M. A comparative assessment of gully erosion spatial predictive modeling using statistical and machine learning models. Catena 2021, 207, 105679. [Google Scholar] [CrossRef]
- Hembram, T.K.; Paul, G.C.; Saha, S. Spatial prediction of susceptibility to gully erosion in Jainti River basin, Eastern India: A comparison of information value and logistic regression models. Model. Earth Syst. Environ. 2018, 5, 689–708. [Google Scholar] [CrossRef]
- Arabameri, A.; Pradhan, B.; Rezaei, K. Spatial prediction of gully erosion using ALOS PALSAR data and ensemble bivariate and data mining models. Geosci. J. 2019, 23, 669–686. [Google Scholar] [CrossRef]
- Pourghasemi, H.R.; Sadhasivam, N.; Kariminejad, N.; Collins, A. Gully erosion spatial modelling: Role of machine learning algorithms in selection of the best controlling factors and modelling process. Geosci. Front. 2020, 11, 2207–2219. [Google Scholar] [CrossRef]
- Saha, S.; Roy, J.; Arabameri, A.; Blaschke, T.; Bui, D.T. Machine Learning-Based Gully Erosion Susceptibility Mapping: A Case Study of Eastern India. Sensors 2020, 20, 1313. [Google Scholar] [CrossRef] [Green Version]
- Javidan, N.; Kavian, A.; Pourghasemi, H.R.; Conoscenti, C.; Jafarian, Z. Data Mining Technique (Maximum Entropy Model) for Mapping Gully Erosion Susceptibility in the Gorganrood Watershed, Iran. In Gully Erosion Studies from India and Surrounding Regions; Springer: Cham, Switzerland, 2020; pp. 427–448. [Google Scholar]
- Rivera, J.I.; Bonilla, C.A. Predicting soil aggregate stability using readily available soil properties and machine learning techniques. Catena 2020, 187, 104408. [Google Scholar] [CrossRef]
- Elith, J.; Leathwick, J.R.; Hastie, T. A working guide to boosted regression trees. J. Anim. Ecol. 2008, 77, 802–813. [Google Scholar] [CrossRef] [PubMed]
- Etemadi, H.; Rostamy, A.A.A.; Dehkordi, H.F. A genetic programming model for bankruptcy prediction: Empirical evidence from Iran. Expert Syst. Appl. 2009, 36, 3199–3207. [Google Scholar] [CrossRef]
- Gupta, L.D.; Malviya, A.K.; Singh, S. Performance Analysis of Classification Tree Learning Algorithms. Int. J. Comput. Appl. 2012, 55, 39–44. [Google Scholar] [CrossRef]
- Chen, W.; Lei, X.; Chakrabortty, R.; Pal, S.C.; Sahana, M.; Janizadeh, S. Evaluation of different boosting ensemble machine learning models and novel deep learning and boosting framework for head-cut gully erosion susceptibility. J. Environ. Manag. 2021, 284, 112015. [Google Scholar] [CrossRef]
- Arabameri, A.; Yamani, M.; Pradhan, B.; Melesse, A.; Shirani, K.; Bui, D.T. Novel ensembles of COPRAS multi-criteria decision-making with logistic regression, boosted regression tree, and random forest for spatial prediction of gully erosion susceptibility. Sci. Total. Environ. 2019, 688, 903–916. [Google Scholar] [CrossRef]
- Kursa, M.; Rudnicki, W. Feature Selection with theBorutaPackage. J. Stat. Softw. 2010, 36, 1–13. [Google Scholar] [CrossRef] [Green Version]
- Azhdari, Z.; Sardooi, E.R.; Bazrafshan, O.; Zamani, H.; Singh, V.P.; Saravi, M.M.; Ramezani, M. Impact of climate change on net primary production (NPP) in south Iran. Environ. Monit. Assess. 2020, 192, 1–16. [Google Scholar] [CrossRef]
- Department of Water Resource Management of Iran (DWRMI). Report of Natural Resources Management; Ministry of Energy of Iran: Tehran, Iran, 2012; 245p.
- Statistical Center of Iran. Available online: https://www.amar.org.ir/english/Population-and-Housing-Censuses (accessed on 20 August 2016).
- Cama, M.; Conoscenti, C.; Lombardo, L.; Rotigliano, E. Exploring relationships between grid cell size and accuracy for debris-flow susceptibility models: A test in the Giampilieri catchment (Sicily, Italy). Environ. Earth Sci. 2016, 75, 1–21. [Google Scholar] [CrossRef]
- Azareh, A.; Rahmati, O.; Rafiei-Sardooi, E.; Sankey, J.B.; Lee, S.; Shahabi, H.; Bin Ahmad, B. Modelling gully-erosion susceptibility in a semi-arid region, Iran: Investigation of applicability of certainty factor and maximum entropy models. Sci. Total Environ. 2019, 655, 684–696. [Google Scholar] [CrossRef]
- Arabameri, A.; Chen, W.; Loche, M.; Zhao, X.; Li, Y.; Lombardo, L.; Cerda, A.; Pradhan, B.; Bui, D.T. Comparison of machine learning models for gully erosion susceptibility mapping. Geosci. Front. 2020, 11, 1609–1620. [Google Scholar] [CrossRef]
- Tao, Y.; Zou, Z.; Guo, L.; He, Y.; Lin, L.; Lin, H.; Chen, J. Linking soil macropores, subsurface flow and its hydrodynamic characteristics to the development of Benggang erosion. J. Hydrol. 2020, 586, 124829. [Google Scholar] [CrossRef]
- Kumar, R.; Anbalagan, R. Landslide susceptibility zonation in part of Tehri reservoir region using frequency ratio, fuzzy logic and GIS. J. Earth Syst. Sci. 2015, 124, 431–448. [Google Scholar] [CrossRef]
- Gayen, A.; Pourghasemi, H.R.; Saha, S.; Keesstra, S.; Bai, S. Gully erosion susceptibility assessment and management of hazard-prone areas in India using different machine learning algorithms. Sci. Total Environ. 2019, 668, 124–138. [Google Scholar] [CrossRef] [PubMed]
- Auerswald, K.; Fiener, P.; Martin, W.; Elhaus, D. Use and misuse of the K factor equation in soil erosion modeling: An alternative equation for determining USLE nomograph soil erodibility values. Catena 2014, 118, 220–225. [Google Scholar] [CrossRef]
- Agricultural Research, Education and Extension Organization of Hormozgan, Bandar Abbas, Iran. Available online: http://hormozgan.areeo.ac.ir/fa-IR/hormozgan.areeo.ac/3853/page (accessed on 5 July 2019).
- Renard, K.G.; Foster, G.R.; Weesies, G.A.; McCool, D.K.; Yoder, D.C. Predicting Soil Erosion by Water: A Guide to Conservation Planning with the Revised Universal Soil Loss Equation (RUSLE); United States Government Printing: Washington, DC, USA, 1997; Volume 703, pp. 145–220.
- Wilson, J.P.; Gallant, J.C. Digital Terrain Analysis. Principles and Applications; John Wiley: New York, NY, USA, 2000. [Google Scholar]
- Qin, C.-Z.; Zhu, A.-X.; Pei, T.; Li, B.-L.; Scholten, T.; Behrens, T.; Zhou, C.-H. An approach to computing topographic wetness index based on maximum downslope gradient. Precis. Agric. 2011, 12, 32–43. [Google Scholar] [CrossRef]
- Waga, K.; Malinen, J.; Tokola, T. A Topographic Wetness Index for Forest Road Quality Assessment: An Application in the Lakeland Region of Finland. Forests 2020, 11, 1165. [Google Scholar] [CrossRef]
- Geological Survey of Iran [GSI]. Available online: http://www.gsi.ir/en (accessed on 25 August 2019).
- Glennon, A.; Groves, C. An examination of perennial stream drainage patterns within the Mammoth Cave watershed, Kentucky. J. Cave Karst Stud. 2002, 64, 82–91. [Google Scholar]
- Conoscenti, C.; Angileri, S.E.; Cappadonia, C.; Rotigliano, E.; Agnesi, V.; Maerker, M. Gully erosion susceptibility assessment by means of GIS-based logistic regression: A case of Sicily (Italy). Geomorphology 2014, 204, 399–411. [Google Scholar] [CrossRef] [Green Version]
- Greene, W.H. Econometric Analysis; Prentice Hall: Upper Saddle River, NJ, USA, 2002. [Google Scholar]
- Sánchez-Maroño, N.; Alonso-Betanzos, A.; Calvo-Estévez, R.M. A Wrapper Method for Feature Selection in Multiple Classes Datasets; Springer: Berlin/Heidelberg, Germany, 2009; pp. 456–463. [Google Scholar] [CrossRef]
- Kursa, M.B.; Jankowski, A.; Rudnicki, W. Boruta-A system for feature selection. Fundam. Inform. 2010, 101, 271–285. [Google Scholar] [CrossRef]
- Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
- Thiam, A.K. An Evidential Reasoning Approach to Land Degradation Evaluation: Dempster-Shafer Theory of Evidence. Trans. GIS 2005, 9, 507–520. [Google Scholar] [CrossRef]
- Althuwaynee, O.F.; Pradhan, B.; Park, H.J.; Lee, J.H. A novel ensemble bivariate statistical evidential belief function with knowledge-based analytical hierarchy process and multivariate statistical logistic regression for landslide susceptibility mapping. Catena 2014, 114, 21–36. [Google Scholar] [CrossRef]
- Vapnik, V.N. The Nature of Statistical Learning Theory, 2nd ed.; Springer: New York, NY, USA, 1999. [Google Scholar]
- Naimi, B.; Araújo, M.B. sdm: A reproducible and extensible R platform for species distribution modelling. Ecography 2016, 39, 368–375. [Google Scholar] [CrossRef] [Green Version]
- James, G.; Witten, D.; Hastie, T.; Tibshirani, R. An Introduction to Statistical Learning; Springer: New York, NY, USA, 2013. [Google Scholar]
- Liaw, A.; Wiener, M. Classification and regression by random Forest. R News 2013, 2, 18–22. [Google Scholar]
- Wang, Y.; Brandt, M.; Zhao, M.; Tong, X.; Xing, K.; Xue, F.; Kang, M.; Wang, L.; Jiang, Y.; Fensholt, R. Major forest increase on the Loess Plateau, China (2001–2016). Land Degrad. Dev. 2018, 29, 4080–4091. [Google Scholar] [CrossRef]
- Johnson, R.A.; Wichern, D.W. Applied Multivariate Statistical Analysis; Prentice Hall: Upper Saddle River, NJ, USA, 2002. [Google Scholar]
- Zabihi, M.; Mirchooli, F.; Motevalli, A.; Darvishan, A.K.; Pourghasemi, H.R.; Zakeri, M.A.; Sadighi, F. Spatial modelling of gully erosion in Mazandaran Province, northern Iran. Catena 2018, 161, 1–13. [Google Scholar] [CrossRef]
- Schumann, G.J.-P.; Vernieuwe, H.; De Baets, B.; Verhoest, N.E.C. ROC-based calibration of flood inundation models. Hydrol. Process. 2014, 28, 5495–5502. [Google Scholar] [CrossRef]
- Evans, R.; Horstman, C.; Conzemius, M. Accuracy and Optimization of Force Platform Gait Analysis in Labradors with Cranial Cruciate Disease Evaluated at a Walking Gait. Veter-Surg. 2005, 34, 445–449. [Google Scholar] [CrossRef]
- Rahmati, O.; Falah, F.; Naghibi, S.A.; Biggs, T.; Soltani, M.; Deo, R.C.; Cerdà, A.; Mohammadi, F.; Bui, D.T. Land subsidence modelling using tree-based machine learning algorithms. Sci. Total Environ. 2019, 672, 239–252. [Google Scholar] [CrossRef]
- Allouche, O.; Tsoar, A.; Kadmon, R. Assessing the accuracy of species distribution models: Prevalence, kappa and the true skill statistic (TSS). J. Appl. Ecol. 2006, 43, 1223–1232. [Google Scholar] [CrossRef]
- Choi, Y.; Park, H.-D.; Sunwoo, C. Flood and gully erosion problems at the Pasir open pit coal mine, Indonesia: A case study of the hydrology using GIS. Bull. Eng. Geol. Environment. 2008, 67, 251–258. [Google Scholar] [CrossRef]
- Pourghasemi, H.R.; Yousefi, S.; Kornejady, A.; Cerdà, A. Performance assessment of individual and ensemble data-mining techniques for gully erosion modeling. Sci. Total Environ. 2017, 609, 764–775. [Google Scholar] [CrossRef] [Green Version]
- Hembram, T.K.; Paul, G.C.; Saha, S. Modelling of gully erosion risk using new ensemble of conditional probability and index of entropy in Jainti River basin of Chotanagpur Plateau Fringe Area, India. Appl. Geomat. 2020, 12, 337–360. [Google Scholar] [CrossRef]
- Dickson, J.L.; Head, J.W.; Kreslavsky, M. Martian gullies in the southern mid-latitudes of Mars: Evidence for climate-controlled formation of young fluvial features based upon local and global topography. Icarus 2007, 188, 315–323. [Google Scholar] [CrossRef]
- Raga, M.F.; Palencia, C.; Keesstra, S.; Jordán, A.; Fraile, R.; Angulo-Martinez, M.; Cerdà, A. Splash erosion: A review with unanswered questions. Earth-Sci. Rev. 2017, 171, 463–477. [Google Scholar] [CrossRef] [Green Version]
- Kheir, R.B.; Wilson, J.; Deng, Y. Use of terrain variables for mapping gully erosion susceptibility in Lebanon. Earth Surf. Process. Landforms 2007, 32, 1770–1782. [Google Scholar] [CrossRef]
- Shahrivar, A.; Christopher, T.B.S. The effects of soil physical characteristics on gully erosion development in Kohgiloyeh & Boyer Ahmad province, Iran. Adv. Environ. Biol. 2012, 397–406. [Google Scholar]
- Tsunekawa, A.; Liu, G.; Yamanaka, N.; Du, S. Restoration and Development of the Degraded Loess Plateau, China; Springer: Tokyo, Japan, 2014. [Google Scholar]
- Wang, Z.; Lai, C.; Chen, X.; Yang, B.; Zhao, S.; Bai, X. Flood hazard risk assessment model based on random forest. J. Hydrol. 2015, 527, 1130–1141. [Google Scholar] [CrossRef]
- Savory, A.; Duncan, T. Regenerating agriculture to sustain civilization. In Land Restoration; Chabay, I., Frick, M., Helgeson, J., Eds.; Academic Press: Cambridge, MA, USA, 2016; pp. 289–309. [Google Scholar]
- Al-Kaisi, M.M.; Lal, R. Aligning science and policy of regenerative agriculture. Soil Sci. Soc. Am. J. 2020, 84, 1808–1820. [Google Scholar] [CrossRef]
Factors | Variable Type | Scale |
---|---|---|
Soil texture | Categorical | 1:50,000 |
Elevation (meter) | Continuous | 30 × 30 m |
Distance to stream (meter) | Continuous | 30 × 30 m |
Drainage density (km/km2) | Continuous | 30 × 30 m |
Slope angle | Continuous | 30 × 30 m |
Slope aspect | Categorical | 30 × 30 m |
Land use | Categorical | 1:50,000 |
Lithology | Categorical | 1:50,000 |
Plan curvature | Continuous | 30 × 30 m |
Topographic wetness index (TWI) | Continuous | 30 × 30 m |
LS factor | Continuous | 30 × 30 m |
Factors | TOL | VIF |
---|---|---|
Slope aspect | 0.86 | 1.04 |
Distance to stream | 0.36 | 2.79 |
Drainage density | 0.37 | 2.68 |
Land use | 0.32 | 3.08 |
Lithology | 0.31 | 3.27 |
LS factor | 0.44 | 2.40 |
Plan curvature | 0.56 | 1.80 |
Slope angle | 0.39 | 2.65 |
Soil | 0.58 | 3.64 |
TWI | 0.77 | 1.29 |
Elevation | 0.77 | 2.65 |
Model | Value | Percentage | Area (km2) |
---|---|---|---|
RF | Low | 28.51 | 73.37 |
Moderate | 21.72 | 55.90 | |
High | 17.12 | 44.07 | |
Very high | 32.67 | 84.07 | |
Total area | 100 | 257.4 | |
SVM | Low | 12.07 | 31.06 |
Moderate | 21.23 | 54.64 | |
High | 19.48 | 50.13 | |
Very high | 47.22 | 121.52 | |
Total area | 100 | 257.4 | |
MDA | Low | 30.19 | 77.69 |
Moderate | 1.67 | 4.29 | |
High | 2.14 | 5.51 | |
Very high | 66.03 | 169.93 | |
Total area | 100 | 257.4 | |
Ensemble | Low | 28.88 | 74.32 |
Moderate | 9.34 | 24.04 | |
High | 23.83 | 61.33 | |
Very high | 37.96 | 97.69 | |
Total area | 100 | 257.4 |
Metrics | Ensemble | RF | SVM | MDA |
---|---|---|---|---|
AUC | 0.982 | 0.971 | 0.932 | 0.914 |
TSS | 0.93 | 0.91 | 0.84 | 0.82 |
Factors | Mean Importance | Median Importance | Min. Importance | Max. Importance | Decision |
---|---|---|---|---|---|
Distance to stream | 33.5 | 33.12 | 27.89 | 38.8 | Confirmed |
Land use | 17.41 | 17.22 | 15.62 | 19.12 | Confirmed |
Elevation | 12.18 | 12.04 | 10.06 | 14.29 | Confirmed |
Lithology | 7.34 | 7.15 | 5.47 | 9.26 | Confirmed |
Soil type | 4.6 | 4.21 | 2.86 | 6.43 | Confirmed |
Drainage density | 2.43 | 2.38 | 0.27 | 4.58 | Confirmed |
LS factor | 1.34 | 1.27 | 0.13 | 2.56 | Confirmed |
Slope | 0.57 | 0.53 | 0.045 | 1.08 | Confirmed |
Plan curvature | −1.44 | −1.49 | −2.96 | 0.07 | Rejected |
TWI | −1.98 | −1.96 | −3.63 | −0.34 | Rejected |
Aspect | −2.57 | −2.51 | −4.39 | −0.78 | Rejected |
Factor | Class | Bel | Dis | Unc | Pls |
---|---|---|---|---|---|
Elevation (meter) | 3–125 | 1.0 | 0.0 | 0.0 | 1.00 |
125–381 | 0.00 | 0.27 | 0.73 | 0.73 | |
381–697 | 0.00 | 0.25 | 0.75 | 0.75 | |
697–1014 | 0.00 | 0.25 | 0.75 | 0.75 | |
1014–1406 | 0.00 | 0.24 | 0.76 | 0.76 | |
Distance to stream (meter) | 0–276.5 | 0.51 | 0.12 | 0.36 | 0.88 |
276.5–611.8 | 0.26 | 0.20 | 0.54 | 0.80 | |
611.8–1055.13 | 0.23 | 0.24 | 0.52 | 0.76 | |
1055.13–1806.48 | 0.00 | 0.22 | 0.78 | 0.78 | |
1806.48–3118.41 | 0.00 | 0.21 | 0.79 | 0.79 | |
Slope aspect | Flat | 0.07 | 0.11 | 0.82 | 0.89 |
N | 0.10 | 0.11 | 0.79 | 0.89 | |
NE | 0.25 | 0.10 | 0.65 | 0.90 | |
E | 0.11 | 0.11 | 0.78 | 0.89 | |
SE | 0.11 | 0.11 | 0.78 | 0.89 | |
S | 0.09 | 0.12 | 0.80 | 0.88 | |
SW | 0.04 | 0.12 | 0.83 | 0.88 | |
W | 0.11 | 0.11 | 0.78 | 0.89 | |
NW | 0.11 | 0.11 | 0.78 | 0.89 | |
Land use | Barren lands | 0.71 | 0.11 | 0.19 | 0.89 |
Poor rangeland | 0.00 | 0.33 | 0.67 | 0.67 | |
Bare rock | 0.00 | 0.16 | 0.84 | 0.84 | |
Salt land | 0.00 | 0.16 | 0.84 | 0.84 | |
Shrublands | 0.29 | 0.09 | 0.62 | 0.91 | |
Residential lands | 0.00 | 0.16 | 0.84 | 0.84 | |
Soil texture | Loamy skeletal | 0.00 | 0.36 | 0.64 | 0.64 |
Coarse loamy | 0.06 | 0.17 | 0.78 | 0.83 | |
Fine silty | 0.94 | 0.18 | −0.12 | 0.82 | |
Fine loamy | 0.00 | 0.30 | 0.70 | 0.70 | |
TWI | 2.01–5 | 0.13 | 0.27 | 0.60 | 0.73 |
5–8 | 0.14 | 0.23 | 0.63 | 0.77 | |
8–12 | 0.28 | 0.26 | 0.46 | 0.74 | |
12–22.3 | 0.44 | 0.24 | 0.32 | 0.76 | |
Lithology | Aj | 0.00 | 0.14 | 0.86 | 0.86 |
Gs | 0.00 | 0.16 | 0.84 | 0.84 | |
Mn | 0.00 | 0.14 | 0.86 | 0.86 | |
Qaf | 0.13 | 0.14 | 0.73 | 0.86 | |
Qal | 0.16 | 0.14 | 0.70 | 0.86 | |
Qfp | 0.71 | 0.02 | 0.28 | 0.98 | |
Qp | 0.00 | 0.13 | 0.87 | 0.87 | |
Sd | 0.00 | 0.13 | 0.87 | 0.87 | |
Slope angle (degree) | 0–5 | 0.55 | 0.09 | 0.37 | 0.91 |
5–10 | 0.45 | 0.21 | 0.34 | 0.79 | |
10–20 | 0.00 | 0.24 | 0.76 | 0.76 | |
20–30 | 0.00 | 0.23 | 0.77 | 0.77 | |
<30 | 0.00 | 0.23 | 0.77 | 0.77 | |
Drainage density | 0–0.46 | 0.10 | 0.29 | 0.60 | 0.71 |
0.46–1.15 | 0.14 | 0.28 | 0.57 | 0.72 | |
1.15–1.8 | 0.30 | 0.21 | 0.49 | 0.79 | |
1.8–3.69 | 0.45 | 0.22 | 0.33 | 0.78 | |
Plan curvature | <−0.01 | 0.00 | 0.49 | 0.51 | 0.51 |
−0.01–0.01 | 1.00 | 0.00 | 0.00 | 1.00 | |
<0.01 | 0.00 | 0.51 | 0.49 | 0.49 | |
LS factor | 0–2.05 | 0.88 | 0.02 | 0.10 | 0.98 |
2.05–5.91 | 0.12 | 0.25 | 0.63 | 0.75 | |
5.91–10.28 | 0.00 | 0.25 | 0.75 | 0.75 | |
10.28–15.68 | 0.00 | 0.24 | 0.76 | 0.76 | |
15.68–65.55 | 0.00 | 0.23 | 0.77 | 0.77 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Ahmadpour, H.; Bazrafshan, O.; Rafiei-Sardooi, E.; Zamani, H.; Panagopoulos, T. Gully Erosion Susceptibility Assessment in the Kondoran Watershed Using Machine Learning Algorithms and the Boruta Feature Selection. Sustainability 2021, 13, 10110. https://doi.org/10.3390/su131810110
Ahmadpour H, Bazrafshan O, Rafiei-Sardooi E, Zamani H, Panagopoulos T. Gully Erosion Susceptibility Assessment in the Kondoran Watershed Using Machine Learning Algorithms and the Boruta Feature Selection. Sustainability. 2021; 13(18):10110. https://doi.org/10.3390/su131810110
Chicago/Turabian StyleAhmadpour, Hamed, Ommolbanin Bazrafshan, Elham Rafiei-Sardooi, Hossein Zamani, and Thomas Panagopoulos. 2021. "Gully Erosion Susceptibility Assessment in the Kondoran Watershed Using Machine Learning Algorithms and the Boruta Feature Selection" Sustainability 13, no. 18: 10110. https://doi.org/10.3390/su131810110
APA StyleAhmadpour, H., Bazrafshan, O., Rafiei-Sardooi, E., Zamani, H., & Panagopoulos, T. (2021). Gully Erosion Susceptibility Assessment in the Kondoran Watershed Using Machine Learning Algorithms and the Boruta Feature Selection. Sustainability, 13(18), 10110. https://doi.org/10.3390/su131810110