Comparison of Random Forest Model and Frequency Ratio Model for Landslide Susceptibility Mapping (LSM) in Yunyang County (Chongqing, China)
Abstract
:1. Introduction
2. Materials and Methods
2.1. The Study Area
2.2. Datasets
2.2.1. Landslide Inventory
2.2.2. Conditioning Factors of Landslides
Topographic Factors
Geological Factors
Environmental Factors
Triggering Factors
Factors of Human Engineering Activities
2.3. Methodology
2.3.1. Preparation of the Training and Testing Datasets
2.3.2. Random Forest (RF)
2.3.3. Frequency Ratio (FR)
2.3.4. Evaluation of LSM Models
3. Results
3.1. LSM Acquired by RF Model in the Study Area
3.2. LSM Acquired by the FR Model in Study Area
3.3. Validation and Comparison
4. Discussion
4.1. The Comparison of the Two Models
4.2. Distribution Characteristics of New Landslide Events
4.3. Importance of Contributing Factors
5. Conclusions
- (1)
- A total of 987 historical landslides are identified with landslide susceptibility inventory, which contains the historical records, satellite images, and extensive field surveys, and 94.7% of the landslides are soil landslides, while 84.8% are induced by rainfall. Subsequently, 70% of the landslides were used as the training dataset and 30% as the testing dataset. Twenty-two factors in five categories, including elevation, slope, slope position, aspect, and lithology, were selected as the contributing factors of landslides in Yunyang County. By optimizing two important parameters of RF, with 10-fold-cross validation for the best sample on R software, a more efficient RF model can be built to evaluate landslide susceptibility. As a result, the LSM was produced with the two models.
- (2)
- In mapping evaluation, the RF model had 77.5% of historical landslides falling in the regions with high or very high susceptibility, accounting for about 12.8% of the total area. The regions with low or very low susceptibility to landslides accounted for 62.6% of the total area, while only 8.5% of landslides were in these areas. On the other hand, the FR model had 52.7% of the landslide falling in the high or very high susceptibility regions, accounting for of 26.7% of the total area. The regions with very low or low susceptibility accounted for 51.6% of the total area, while 24.0% of the landslides were in these areas. The AUC values under the ROC curve of the RF model and the FR model were 0.988 and 0.716, respectively. Similarly, accuracy, precision, and recall ratio of RF were higher than FR. Furthermore, in high and very low classes, RF performed better. In addition, the susceptibility mapping results of the two models both had a high spatial correlation with new landslides in 2017. The evaluation results above show that the RF model has higher accuracy, reliability, and stability. The RF model is more suitable for landslide susceptibility evaluation in Yunyang County than the FR model. The performance of models depends not only on algorithms, but also on the specific conditions of the study areas and the selection of impacting factors. Therefore, this study cannot conclude that the RF model is definitely the best. Compared with the FR model, the RF model has higher prediction accuracy. This finding is similar to the results of Sun et al. [73], who used RF to study Fengjie County (a neighbor of Yunyang County, with a similar geographic environment).
- (3)
- Finally, the importance-ranking results obtained from the impact factor importance analysis and AUC values of RF model with different reduced landslide influencing factors are in accordance with the basic laws of the geology and consistent with previous research findings. They can provide guidance for landslide management. The elevation, annual average rainfall, slope, lithology, POI kernel density, distance from roads, and distance from rivers were the main important landslide contributors in Yunyang County, while the contribution rate of faults was the smallest. In particular, as the highlight of this study, the POI kernel density proves useful in landslide susceptibility models. There are complex relationships between the factors, and the occurrence of landslides is inseparable from the combined effects of human and natural factors.
Author Contributions
Funding
Acknowledgments
Conflicts of Interest
References
- Yin, K.L.; Zhu, L.F. Landslide Hazard Zonation and Application of GIS. Earth Sci. Front. 2001, 2, 279–284. [Google Scholar]
- Kalantar, B.; Pradhan, B.; Naghibi, S.A.; Motevalli, A.; Mansor, S. Assessment of the effects of training data selection on the landslide susceptibility mapping: A comparison between support vector machine (SVM), logistic regression (LR) and artificial neural networks (ANN). Geomat. Nat. Hazards Risk 2018, 9, 49–69. [Google Scholar] [CrossRef]
- Sang, K. Statistics and analysis of landslide disaster data in China in recent 60 years. Public Commun. Sci. Technol. 2013, 5, 124–129. [Google Scholar]
- Zhong, L.J. The Mountainous Region City Overall Plan Phase Guards against the Geological Disaster to Plan Initially Searches. Master’s Thesis, Chongqing University, Chongqing, China, 1 October 2006. [Google Scholar]
- Pu, P.; Wu, J. Geomatics & Spatial Information Technology. Geomat. Spat. Inf. Technol. 2016, 39, 130–133. [Google Scholar]
- Steger, S.; Brenning, A.; Bell, R.; Glade, T. The influence of systematically incomplete shallow landslide inventories on statistical susceptibility models and suggestions for improvements. Landslides 2017, 14, 1767–1781. [Google Scholar] [CrossRef] [Green Version]
- Neuland, H. A prediction model of landslips. Catena 1976, 3, 215–230. [Google Scholar] [CrossRef]
- Abella, E.A.C.; Van Westen, C.J. Qualitative landslide susceptibility assessment by multicriteria analysis; a case study from San Antoniodel Sur, Guantanamo, Cuba (in GIS technology and models for assessing landslide hazard and risk). Geomorphology 2008, 94, 435–466. [Google Scholar]
- Guimarūes, R.F.; Montgomery, D.R.; Greenberg, H.M. Parameterization of soil properties for a model of topographic controls on shallow landsliding: Application to Rio de Janeiro. Eng. Geol. 2003, 69, 99–108. [Google Scholar] [CrossRef]
- Wang, Q.; Li, W.; Yan, S. GIS based frequency ratio and index of entropy models to landslide susceptibility mapping (Daguan, China). Environ. Earth Sci. 2016, 75, 780. [Google Scholar] [CrossRef]
- Shahabi, H.; Khezri, S.; Ahmad, B.B.; Hashim, M. Landslide susceptibility mapping at central Zab basin, Iran: A comparison between analytical hierarchy process, frequency ratio and logistic regression models. CATENA 2014, 115, 55–70. [Google Scholar] [CrossRef]
- Pradhan, B.; Abokharima, M.H.; Jebur, M.N.; Tehrany, M.S. Land subsidence susceptibility mapping at Kinta Valley (Malaysia) using the evidential belief function model in GIS. Nat. Hazards 2014, 73, 1019–1042. [Google Scholar] [CrossRef]
- Yin, K.L.; Yan, T.Z. Landslide prediction and related models. Chin. J. Rock Mech. Eng. 1996, 1, 1–8. [Google Scholar]
- Bourenane, H.; Guettouche, M.S.; Bouhadad, Y. Landslide hazard mapping in the Constantine city, Northeast Algeria using frequency ratio, weighting factor, logistic regression, weights of evidence, and analytical hierarchy process methods. Arab. J. Geosci. 2016, 9, 154. [Google Scholar] [CrossRef]
- Choi, J.; Oh, H.J.; Lee, H.J.; Lee, C.; Lee, S. Combining landslide susceptibility maps obtained from frequency ratio, logistic regression, and artificial neural network models using ASTER images and GIS. Eng. Geol. 2012, 124, 12–23. [Google Scholar] [CrossRef]
- Yilmaz, I. Landslide susceptibility mapping using frequency ratio, logistic regression, artificial neural networks and their comparison: A case study from Kat landslides (Tokat–Turkey). Comput. Geosci. 2009, 35, 1125–1138. [Google Scholar] [CrossRef]
- Li, Y.T.; Zhu, H.L.; Chen, S.H. Evaluation of landslide susceptibility in the upper reaches of the Yellow River by AHP. Sci. Surv. Mapp. 2016, 41, 67–70. [Google Scholar]
- Mao, Y.M.; Peng, Z.; Chen, Z.G.; Liu, D.L. Application of classification algorithm based on uncertain decision tree in landslide danger prediction. Appl. Res. Comput. 2014, 31, 3646–3650. [Google Scholar]
- Liu, K.Y. Application of Random Forest and Logistic Regression Model to Default Prediction. China Comput. Commun. 2016, 21, 111–112. [Google Scholar]
- Jiang, P.; Chen, J.J.; Gao, C. Application of random forest algorithm to landslide stability analysis. J. Hubei Polytech. Univ. 2018, 34, 29–35. [Google Scholar]
- Chen, X.W.; Liu, M. Prediction of protein-protein interactions using random decision forest framework. Bioinformatics 2005, 21, 4394–4400. [Google Scholar] [CrossRef]
- Pang, H.; Datta, D.; Zhao, H. Pathway analysis using random forests with bivariate node-split for survival outcomes. Bioinformatics 2010, 26, 250–258. [Google Scholar] [CrossRef] [PubMed]
- Ying, W.Y.; Li, X.; Xie, Y.Y.; Johnson, E. Preventing customer churn by using random forests modeling. IEEE Int. Conf. Inf. Reuse Integr. 2008, 429–434. [Google Scholar]
- Xie, Y.Y.; Li, X.; Ngai, E.W.T.; Ying, W.Y. Customer churn prediction using improved balanced random forests. Expert Syst. Appl. 2009, 36, 5445–5449. [Google Scholar] [CrossRef]
- Ward, M.M.; Pajevic, S.; Dreyfuss, J.; Malley, J.D. Short-term prediction of mortality in patients with systemic lupus erythematosus: Classify cation of outcomes using random forests. Arthritis Rheum. 2006, 55, 74–80. [Google Scholar] [CrossRef]
- Kim, S.H.; Lee, J.H.; Ko, B.; Nam, J.Y. X-ray image classification using random forests with local binary patterns. Int. Conf. Mach. Learn. Cybern. 2010, 3190–3194. [Google Scholar]
- Chen, W.T.; LI, X.J.; Wang, Y.X.; Chen, G.; Liu, S.W. Forested landslide detection using LiDAR data and the random forest algorithm: A case study of the Three Gorges, China. Remote Sens. Environ. 2014, 152, 291–301. [Google Scholar] [CrossRef]
- Stumpf, A.; Kerle, N. Obeject-oriented mapping of landslides using random forests. Remote Sens. Environ. 2011, 115, 2564–2577. [Google Scholar] [CrossRef]
- Li, T.; Tian, Y.; Wu, L.; Liu, L. Landslide disaster risk division based on random forest method. Geogr. Geo-Inf. Sci. 2014, 6, 25–30. [Google Scholar]
- Yu, K.Y.; Yao, X.; Qiu, Q.R.; Liu, J. Spatial prediction of landslides based on Stochastic Forest model. Trans. Chin. Soc. Agric. Mach. 2016, 47, 338–345. [Google Scholar]
- Lin, X.S.; Xu, J. The Study of the Complexity of Landslide Hazard. Res. Soil Water Conserv. 2007, 05, 359–363. [Google Scholar]
- Wang, Y.; Lin, Q.G.; Shi, P.J. Spatial pattern and influencing factors of landslide casualty events. J. Geogr. Sci. 2018, 28, 259–374. [Google Scholar] [CrossRef] [Green Version]
- Pradhan, B.; Lee, S. Landslide susceptibility assessment and factor effect analysis: Backpropagation artificial neural networks and their comparison with frequency ratio and bivariate logistic regression modelling. Environ. Model. Softw. 2010, 25, 747–759. [Google Scholar] [CrossRef]
- Hong, H.Y.; Chen, W.; Xu, C.; Youssef, A.M.; Pradhan, B.; Bui, D.T. Rainfall-induced landslide susceptibility assessment at the Chongren area (China) using frequency ratio, certainty factor, and index of entropy. Geocarto. Int. 2016, 32, 139–154. [Google Scholar] [CrossRef]
- Cao, C.; Chen, J.P.; Zheng, W.; Xu, P.H.; Zheng, L.J.; Zhu, C. Geospatial Analysis of Mass-Wasting Susceptibility of Four Small Catchments in Mountainous Area of Miyun County, Beijing. Int. J. Environ. Res. Public Health 2019, 16, 2801. [Google Scholar] [CrossRef] [Green Version]
- Wang, Y.M.; Wu, X.L.; Chen, Z.J.; Ren, F.; Feng, L.W.; Du, Q.Y. Optimizing the Predictive Ability of Machine Learning Methods for Landslide Susceptibility Mapping Using SMOTE for Lishui City in Zhejiang Province, China. Int. J. Environ. Res. Public Health 2019, 16, 368. [Google Scholar] [CrossRef] [Green Version]
- Guo, Z.Z.; Yin, K.L.; Liu, Q.L.; Huang, F.M.; Gui, L.; Zhang, G.R. Rainfall warning of creeping landslide in Yunyang County of Three Gorges Reservoir Area based on displacement ratio model. Earth Sci. 2019, 45, 672–684. [Google Scholar]
- Kavzoglu, T.; Sahin, E.K.; Colkesen, I. An assessment of multivariate and bivariate approaches in landslide susceptibility mapping: A case study of Duzkoy district. Nat. Hazards 2015, 76, 471–496. [Google Scholar] [CrossRef]
- Guzzetti, F.; Mondini, A.C.; Cardinali, M.; Fiorucci, F.; Santangelo, M.; Chang, K.T. Landslide inventory maps: New tools for an old problem. Earth-Sci. Rev. 2012, 112, 42–66. [Google Scholar] [CrossRef] [Green Version]
- Conforti, M.; Muto, F.; Rago, V.; Critelli, S. Landslide inventory map of north-eastern Calabria (South Italy). J. Maps 2014, 10, 90–102. [Google Scholar] [CrossRef]
- Ercanoglu, M. Landslide susceptibility assessment of SE Bartin (West Black Sea region, Turkey) by artificial neural networks. Nat. Hazards Earth Syst. Sci. 2005, 5, 979–992. [Google Scholar] [CrossRef] [Green Version]
- Hungr, O.; Leroueil, S.; Picarelli, L. The Varnes Classification of Landslide Types, an Update. Landslides 2014, 11, 167–194. [Google Scholar] [CrossRef]
- Cruden, D.M.; Varnes, D.J. Landslide types and processes. In Landslides Investigation and Mitigation; Turner, A.K., Schuster, R.L., Eds.; Special Report 247; Transportation Research Board, US National Research Council: Washington, DC, USA, 1996; Volume 247, pp. 36–75. [Google Scholar]
- Liu, J.; Li, S.L.; Chen, T. Landslide Susceptibility Assessment Based on Optimized Random Forest Model. Geomat. Inf. Sci. Wuhan Univ. 2018, 43, 1085–1091. [Google Scholar]
- Umar, Z.; Pradhan, B.; Ahmad, A.; Jebur, M.N.; Tehrany, M.S. Earthquake induced landslide susceptibility mapping using an integrated ensemble frequency ratio and logistic regression models in West Sumatera Province, Indonesia. CATENA 2014, 118, 124–135. [Google Scholar] [CrossRef]
- Piacentini, D.; Devoto, S.; Mantovani, M.; Pasuto, A.; Prampolini, M.; Soldati, M. Landslide susceptibility modeling assisted by Persistent Scatterers Interferometry (PSI): An example from the northwestern coast of Malta. Nat. Hazards 2015, 78, 681–697. [Google Scholar] [CrossRef] [Green Version]
- Du, C.J.; Yi, Q.L.; Zhou, B.; Qin, S.L.; Zeng, H.N. Evaluation of landslide susceptibility in Yunyang County of Three Gorges Reservoir Area Based on GIS and weighted information. J. China Three Gorges Univ. (Nat. Sci.) 2017, 39, 48–53. [Google Scholar]
- Reichenbach, P.; Rossi, M.; Malamud, B.D.; Mihir, M.; Guzzetti, F. A review of statistically-based landslide susceptibility models. Earth-Sci. Rev. 2018, 180, 60–91. [Google Scholar] [CrossRef]
- Wen, H.J.; Hu, D.P.; Wang, G.L. Comparative study on LR and NN evaluation of susceptibility to earthquake landslides in Wenchuan County. China Civ. Eng. J. 2014, 47, 17–23. [Google Scholar]
- Segoni, S.; Lagomarsino, D.; Fanti, R.; Moretti, S.; Cassagli, N. Integration of rainfall thresholds and susceptibility maps in the Emilia Romagna (Italy) regional-scale landslide warning system. Landslides 2015, 12, 773–785. [Google Scholar] [CrossRef] [Green Version]
- Xie, P.; Wen, H.J.; Hu, D.P. Research on Susceptibility Mapping of Earthquake-induced Landslides Along Highway in Mountainous Region. China J. Highw. Transp. 2018, 31, 106–114. [Google Scholar]
- Wen, H.J.; Wang, G.L.; Huang, X.L.; Xue, J.Y.; Xie, P.; Zhang, Y.Y. A Preliminary Evaluation Method of Slope Stability Based on Topographic Map and Geological Map. Chinese Patent 2017105719823, 14 November 2017. [Google Scholar]
- Zêzere, J.L.; de Brum, F.A.; Rodrigues, M.L. The role of conditioning and triggering factors in the occurrence of landslides: A case study in the area north of Lisbon (Portugal). Geomorphology 1999, 30, 133–146. [Google Scholar] [CrossRef]
- Chen, W.; Pourghasemi, H.R.; Panahi, M.; Kornejady, A.; Wang, J.; Xie, X.S.; Cao, S.B. Spatial prediction of landslide susceptibility using an adaptive neuro-fuzzy inference system combined with frequency ratio, generalized additive model, and support vector machine techniques. Geomorphology 2017, 297, 69–85. [Google Scholar] [CrossRef]
- Du, S.; Du, S.; Liu, B.; Zhang, X.Y.; Zheng, Z.J. Large-scale urban functional zone mapping by integrating remote sensing images and open social data. Giscience Amp. Remote Sens. 2020, 57, 411–430. [Google Scholar] [CrossRef]
- Lu, C.; Pang, M.; Zhang, Y.; Li, H.; Lu, C.P.; Tang, X.L.; Cheng, W. Mapping Urban Spatial Structure Based on POI (Point of Interest) Data: A Case Study of the Central City of Lanzhou, China. Isprs Int. J. Geo-Inf. 2020, 9, 92. [Google Scholar]
- Xue, B.; Xiao, X.; Li, J.Z.; Xie, X. POI-Based Analysis on Retail’s Spatial Hot Blocks at a City Level: A Case Study of Shenyang, China. Econ. Geogr. 2018, 38, 36–43. [Google Scholar]
- Wen, H.J.; Li, Y.; Xue, J.Y.; Xie, P. Landslides susceptibility microzoning along highway in mountainous region based on mining the big data. J. Nat. Disasters 2018, 27, 159–165. [Google Scholar]
- Bourenane, H.; Bouhadad, Y.; Guettouche, M.S.; Braham, M. GIS-based landslide susceptibility zonation using bivariate statistical and expert approaches in the city of Constantine (Northeast Algeria). Bull. Eng. Geol. Environ. 2014, 74, 337–355. [Google Scholar] [CrossRef]
- Cong, W.Q.; Pan, M.; Li, T.F.; Wu, Z.X.; Lv, G.X. Key research on landslide and debris flow hazard zonation based on GIS. Earth Sci. Front. 2006, 13, 185–190. [Google Scholar]
- Das, I.; Stein, A.; Kerle, N.; Dadhwal, V.K. Landslide susceptibility mapping along road corridors in the Indian Himalayas using Bayesian logistic regression models. Geomorphology 2012, 179, 116–125. [Google Scholar] [CrossRef]
- Breiman, L. Bagging Predictors. Mach Learn. 1996, 24, 123–140. [Google Scholar] [CrossRef] [Green Version]
- Ho, T.K. The random subspace method for constructing decision forests. IEEE Trans. Pattern Aanalysis Mach. Intell. 1998, 20, 832–844. [Google Scholar]
- Lu, W.; Zhou, Z.; Liu, T.; Liu, Y.H. Discrete element simulation analysis of rock slope stability based on UDEC. Adv. Mater. Res. 2012, 461, 384–388. [Google Scholar] [CrossRef]
- Hong, H.; Tsangaratos, P.; Ilia, I.; Liu, J.; Zhu, A.X.; Chen, W. Application of fuzzy weight of evidence and data mining techniques in construction of flood susceptibility map of Poyang County, China. Sci. Total Env. 2018, 625, 575–588. [Google Scholar] [CrossRef] [PubMed]
- Lee, S.; Pradhan, B. Landslide hazard mapping at Selangor, Malaysia using frequency ratio and logistic regression models. Landslides 2007, 4, 33–41. [Google Scholar] [CrossRef]
- Liu, Y.L.; Yin, K.L.; Liu, B. Application of logical regression and artificial neural network model in spatial prediction of landslide disaster. Hydrogeol. Eng. Geol. 2010, 37, 92–96. [Google Scholar]
- Dou, J.; Yunus, A.P.; Bui, D.T.; Merghadi, A.; Sahana, M.; Zhu, Z.F.; Chen, C.W.; Khosravi, K.; Yang, Y.; Pham, B.T. Assessment of advanced random forest and decision tree algorithms for modeling rainfall-induced landslide susceptibility in the Izu-Oshima Volcanic Island, Japan. Sci. Total Environ. 2019, 662, 332–346. [Google Scholar] [CrossRef] [PubMed]
- Chen, W.; Zhang, S.A.; Li, R.W.; Shahabi, H. Performance evaluation of the GIS-based data mining techniques of best-first decision tree, random forest, and naïve Bayes tree for landslide susceptibility modeling. Sci. Total Environ. 2018, 644, 1006–1018. [Google Scholar] [CrossRef] [PubMed]
- Trigila, A.; Iadanza, C.; Esposito, C.; Mugnozza, G.S. Comparison of Logistic Regression and Random Forests techniques for shallow landslide susceptibility assessment in Giampilieri (NE Sicily, Italy). Geomorphology 2015, 249, 119–136. [Google Scholar] [CrossRef]
- Hong, H.Y.; Pourghasemi, H.R.; Pourtaghi, Z.S. Landslide susceptibility assessment in Lianhua County (China): A comparison between a random forest data mining technique and bivariate and multivariate statistical models. Geomorphology 2016, 259, 105–118. [Google Scholar] [CrossRef]
- Guo, Z.Z.; Yin, K.L.; Huang, F.M.; Fu, S.; Zhang, W. Evaluation of landslide susceptibility based on landslide classification and weighted frequency ratio model. Chin. J. Rock Mech. Eng. 2019, 38, 287–300. [Google Scholar]
- Sun, D.L.; Wen, H.J.; Wang, D.Z.; Xu, J.H. A random forest model of landslide susceptibility mapping based on hyperparameter optimization using Bayes algorithm. Geomorphology. 2020, 362, 107201. [Google Scholar] [CrossRef]
- Sahin, E.K.; Colkesen, I.; Kavzoglu, T. A Comparative Assessment of Canonical Correlation Forest, Random Forest, Rotation Forest and Logistic Regression Methods for Landslide Susceptibility Mapping. Geocarto Int. 2018, 35, 341–363. [Google Scholar] [CrossRef]
- Feng, P.F. Research and Application of a Two-Stage Stepwise Variable Selection Algorithm Based on Random Forest. Master’s Thesis, Fujian Agriculture and Forestry University, Fujian, China, 1 April 2016. [Google Scholar]
- Chen, W.; Xie, X.S.; Wang, J.L.; Pradhan, B.; Hong, H.Y.; Bui, D.T.; Duan, Z.; Ma, J.Q. A comparative study of logistic model tree, random forest, and classification and regression tree models for spatial prediction of landslide susceptibility. CATENA 2017, 151, 147–160. [Google Scholar] [CrossRef] [Green Version]
- Pandey, V.K.; Sharma, K.K.; Pourghasemi, H.R.; Bandooni, S.K. Sedimentological characteristics and application of machine learning techniques for landslide susceptibility modelling along the highway corridor Nahan to Rajgarh (Himachal Pradesh), India. CATENA 2019, 182, 104150. [Google Scholar] [CrossRef]
- When, H.J.; Xie, P.; Hu, D.P. Rapid susceptibility mapping of earthquake-triggered slope geohazards in Lushan County by combining remote sensing with the AHP model developed for the Wenchuan earthquake. Bull. Eng. Geol. Environ. 2016, 73, 909–921. [Google Scholar] [CrossRef]
Factor | Type | Classification |
---|---|---|
Elevation/m | Continuous | (1) <340; (2) 340~543; (3) 543~690; (4) 690~832; (5) 832~951; (6) 951~1053; (7) 1053~1144; (8) 1144~1302; (9) 1302~1556; (10) 1556~1654; (11) >1654 |
Slope/° | Continuous | (1) <5; (2) 5~10; (3) 10~15; (4) 15~20; (5) 20~25; (6) 25~30; (7) 30~35; (8) 35~40; (9) >40 |
RDLS/m | Continuous | (1) <20; (2) 20~30; (3) 30~40; (4) 40~50; (5) 50~80; (6) 80~120; (7) >120 |
Aspect | Categorical | (1) Flat; (2) North; (3) Northeast; (4) East; (5) Southeast; (6) South; (7) Southwest; (8) West; (9) Northwest |
Slope position | Categorical | (1) Ridge; (2) Upper slope; (3) Middle slope; (4) Flats slope; (5) Lower slope; (6) Valley |
Micro-landform | Categorical | (1) Canyons, and Deeply incised streams; (2) Midslope drainages, and shallow valleys; (3) Upland drainages, and Headwaters; (4) U-shape valleys; (5) Plains; (6) Open slopes; (7) Upper slopes, and Plateau; (8) Local ridges hills in valleys; (9) Midslope ridges, and Small hills in plains; (10) Mountain tops, and High narrow ridges |
Curvature | Continuous | (1) <−1; (2) −1~−0.5; (3) −0.5~0; (4) 0~0.5; (5) 0.5~1; (6) >1 |
Profile Curvature | Continuous | (1) <−1; (2) −1~−0.5; (3) −0.5~0; (4) 0~0.5; (5) 0.5~1; (6) >1 |
Plan Curvature | Continuous | (1) <−1; (2) −1~−0.5; (3) −0.5~0; (4) 0~0.5; (5) 0.5~1; (6) >1 |
TRI | Continuous | (1) <1.05; (2) 1.05~1.1; (3) 1.1~1.15; (4) 1.15~1.2; (5) >1.2 |
TWI | Continuous | (1) <4; (2) 4~6; (3) 6~8; (4) 8~10; (5) >10 |
STI | Continuous | (1) <20; (2) 20~40; (3) 40~70; (4) 70~100; (5) 100~200; (6) >200 |
SPI | Continuous | (1) <15; (2) 15~30; (3) 30~45; (4) 45~60; (5) 60~100; (6) 100~1000; (7) >1000 |
Lithology | Categorical | (1) J3s, J3p, J3zj, J3D; (2) J2xs, J2s; (3) J2z, J1-2z, J1z, J2x, J2zs, J1b-j2Q; (4) T3xj, T3zj, T3z2; (5) T2b2, T2b; (6) T1d; (7) T1-2j1, T1-2j2, T1-2j3, T1j; (8) P2 1+w, P2 l-d, P2; (9) P1 m + g, P1 l + q, P1, C; (10) O |
Distance from fault/m | Continuous | (1) <500; (2) 50~1000; (3) 1000~1500; (4) 1500~2000; (5) 2000~2500; (6) 2500~3000; (7) >3000 |
CRDS | Categorical | (1) Dip-slope I; (2) Dip-slope II; (3) Outward slope; (4) Oblique slope; (5) Tangential slope; (6) Reverse slope; (7) Flat |
NDVI | Continuous | (1) 0~0.1; (2) 0.1~0.15; (3) 0.15~0.2; (4) 0.2~0.25; (5) >0.25 |
Distance from rivers/m | Continuous | (1) <100; (2) 100~200; (3) 200~300; (4) 300~400; (5) 400~500; (6) 500~600; (7) >600 |
Annual average rainfall/mm | Continuous | (1) <1221; (2) 122~1251; (3) 1251~1276; (4) 1276~1308; (5) 1308~1343; (6) 1343~1389; (7) 1389~1440; (8) >1440 |
Land cover | Categorical | (1) Meadow; (2) Farmland; (3) Water area; (4) Forest; (5) Garden plot; (6) Others/14531/0.0000; (7) Residential land; (8) Transportation |
Distance from roads/m | Continuous | (1) <100; (2) 100~200; (3) 200~300; (4) 300~400; (5) 400~500; (6) 500~600; (7) >600 |
POI kernel density | Continuous | (1) 0–1; (2) 1–2; (3) 2–3; (4) 3–4; (5) 4–5; (6) 5–10; (7) >10 |
Data Name | Data Sources | Type | Scale |
---|---|---|---|
Historical landslide | Chongqing Geological Monitoring Station | Dataset | |
Elevation | Aster satellite | Grid | 30 m |
Geological data | National Geological Data Center | Grid | 1:200,000 |
Land cover | Chongqing Municipal Bureau of Land and Resources | Vector | 1:100,000 |
Administrative division | Chongqing Municipal Bureau of Land and Resources | Vector | 1:100,000 |
River network | Chongqing Water Resources Bureau | Vector | 1:100,000 |
Satellite image | Geospatial Data Cloud platform | Grid | 30 m |
Annual rainfall | Chongqing Meteorological Administration | Dataset | 90 m |
Road | Chongqing Transportation Commission | Vector | 1:100,000 |
POI of Chongqing | Web Crawler | Dataset |
No | Metric | Equation | Definition |
---|---|---|---|
1 | Precision | The fraction of relevant instances in the retrieved instances. | |
2 | Sensitivity (SST) | The percentage of landslide cells that are correctly classified. | |
3 | Specificity (SPF) | The percentage of non-landslide cells that are correctly classified. | |
4 | Accuracy (ACC) | The proportion of landslide and non-landslide cells which are correctly classified. | |
5 | Recall | It indicates how many positive examples in the sample are predicted correctly. |
Subset | Accuracy | Subset | Accuracy | ||
---|---|---|---|---|---|
Training | Testing | Training | Testing | ||
1 | 1.000 | 0.900 | 6 | 1.000 | 0.902 |
2 | 1.000 | 0.904 | 7 | 1.000 | 0.906 |
3 | 1.000 | 0.909 | 8 | 1.000 | 0.918 |
4 | 1.000 | 0.915 | 9 | 1.000 | 0.885 |
5 | 1.000 | 0.916 | 10 | 1.000 | 0.916 |
Landslide Probability | Susceptibility Class | Grid Number | Area Proportion | Landslide | Landslide Proportion | Density Proportion (Pcs/km2) |
---|---|---|---|---|---|---|
<0.06 | Very low | 1,373,501 | 34.1% | 26 | 2.6% | 0.021 |
0.06–0.12 | Low | 1,147,495 | 28.5% | 58 | 5.9% | 0.056 |
0.12–0.21 | Medium | 991,347 | 24.6% | 138 | 14.0% | 0.155 |
0.21–0.31 | High | 396,368 | 9.8% | 147 | 14.9% | 0.412 |
>0.31 | Very high | 118,277 | 2.9% | 618 | 62.6% | 5.806 |
Factor | Type | Classification/Grid Number/FR |
---|---|---|
Elevation/m | Continuous | (1) <340/647744/1.8208; (2) 340~543/99121/1.1279; (3) 543~690/783868/1.0606; (4) 690~832/614750/0.7927; (5) 832~951/390895/0.5657; (6) 951~1053/235266/0.5048; (7) 1053~1144/142794/0.3728; (8) 1144~1302/140356/0.2042; (9) 1302~1556/75487/0.0543; (10) 1556~1654/13725/0.0000; (11) >1654/5892/0.0000 |
Slope/° | Continuous | (1) <5/231687/0.5303; (2) 5~10/562181/0.8013 ; (3) 10~15/747696/1.2652; (4) 15~20/743059/1.2952; (5) 20~25/638172/1.1743; (6) 25~30/481841/0.9264; (7) 30~35/317712/0.6316; (8) 35~40/180510/0.5672; (9) >40/139136/0.4415 |
RDLS/m | Continuous | (1) <20/1498140/1.0157; (2) 20~30/976617/1.3054; (3) 30~40/708930/1.0616; (4) 40~50/433899/0.6350; (5) 50~80/397593/0.5689; (6) 80~120/40815/0.2015; (7) >120/3177/0.0000 |
Aspect | Categorical | (1) Flat/832/0.0000; (2) North/559138/1.0693; (3) Northeast/418623/0.8413; (4) East/476389/0.9542; (5) Southeast/493071/1.0382; (6) South/599652/0.9629; (7) Southwest/493884/1.1028 ; (8) West/501655/1.0776; (9) Northwest/498750/0.9278 |
Slope position | Categorical | (1) Ridge/1487219/1.0188; (2) Upper slope/282489/1.0583; (3) Middle slope/74587/0.4941; (4) Flats slope/469978/1.0631; (5) Lower slope/217136/0.9619; (6) Valley/1510585/0.9814 |
Micro-landform | Categorical | (1) Canyons, and Deeply incised streams/1447542/1.0920; (2) Midslope drainages, and shallow valleys/93339/1.0530; (3) Upland drainages, and Headwaters/213973/0.6507; (4) U-shape valleys/269975/1.3197; (5) Plains/14080/0.0000; (6) Open slopes/67096/1.2207; (7) Upper slopes, and Plateau/242500/0.8444; (8) Local ridges hills in valleys/214246/0.9940; (9) Midslope ridges, and Small hills in plains/99198/1.8165; (10) Mountain tops, and High narrow ridges/1380045/0.8606 |
Curvature | Continuous | (1) <−1/739914/0.9354; (2) −1~−0.5/473674/0.9597; (3) −0.5~0/932464/1.2473; (4) 0~0.5/697900/0.9213; (5) 0.5~1/455266/0.9355; (6) >1/742776/0.8932 |
Profile Curvature | Continuous | (1) <−1/397552/0.8344; (2) −1~−0.5/466912/1.0087; (3) −0.5~0/1127645/0.9914; (4) 0~0.5/1146502/1.0787; (5) 0.5~1/495095/1.0836; (6) >1/408288/0.8526 |
Plan Curvature | Continuous | (1) <−1/199130/0.7198; (2) −1~−0.5/399120/0.9440; (3) −0.5~0/1453089/1.0963; (4) 0~0.5/1352014/1.0329; (5) 0.5~1/421528/0.8549; (6) >1/217113/0.7922 |
TRI | Continuous | (1) <1.05/1956974/1.0568; (2) 1.05~1.1/921054/1.1916; (3) 1.1~1.15/495536/0.9917; (4) 1.15~1.2/273264/0.6744; (5) >1.2/395166/0.5078 |
TWI | Continuous | (1) <4/522384/0.7134; (2) 4~6/2078046/1.0327; (3) 6~8/878720/1.0346; (4) 8~10/368110/1.1904; (5) >10/194734/0.9043 |
STI | Continuous | (1) <20/2917053/0.9841; (2) 20~40/522250/0.9410; (3) 40~70/282523/0.9857; (4) 70~100/119897/1.3321; (5) 100~200/119007/1.3076; (6) >200/81264/1.0583 |
SPI | Continuous | (1) <15/1742958/1.0315; (2) 15~30/660771/0.8987; (3) 30~45/276316/0.9485; (4) 45~60/166651/0.9092; (5) 60~100/256386/0.7986; (6) 100~1000/734492/1.0649; (7) >1000/204420/1.2220 |
Lithology | Categorical | (1) J3s, J3p, J3zj, J3D/1151138/1.2333; (2) J2xs, J2s/1168089/1.2294; (3) J2z, J1-2z, J1z, J2x, J2zs, J1b-j2Q/617821/0.8741; (4) T3xj, T3zj, T3z2/359056/0.3418; (5) T2b2, T2b/379171/0.9711; (6) T1d/87456/0.7485; (7) T1-2j1, T1-2j2, T1-2j3, T1j/211999/0.3281; (8) P2 1+w, P2 l-d, P2/53807/0.1521; (9) P1 m + g, P1 l + q, P1, C/1323/0.0000; (10) O/8310/0.4923 |
Distance from fault/m | Continuous | (1) <500/3370/0.0000; (2) 50~1000/517/1.5808; (3) 1000~1500/6879/1.7852; (4) 1500~2000/8551/1.4362; (5) 2000~2500/10401/0.7871;(6) 2500~3000/12126/1.0127; (7) >3000/3993800/0.9983 |
CRDS | Categorical | (1) Dip-slope I/65686/1.5565; (2) Dip-slope II/280703/1.0053; (3) Outward slope/345779/1.2419; (4) Oblique slope/1001302/1.0987; (5) Tangential slope/635257/1.1330; (6) Reverse slope/1476082/0.8672; (7) Flat/231650/0.5296 |
NDVI | Continuous | (1) 0~0.1/208363/0.6878; (2) 0.1~0.15/390391/0.7762; (3) 0.15~0.2/1282355/0.9196; (4) 0.2~0.25/1429333/1.0370; (5) >0.25/731025/1.2771 |
Distance from rivers/m | Continuous | (1) <100/244170/0.8718; (2) 100~200/203437/1.7104; (3) 200~300/214861/1.6194; (4) 300~400/186399/1.6471; (5) 400~500/192759/1.2317; (6) 500~600/182581/1.1210; (7) >600/2816099/0.8460 |
Annual average rainfall/mm | Continuous | (1) <1221/291710/1.6968; (2) 122~1251/521092/1.1775; (3) 1251~1276/626074/0.9735; (4) 1276~1308/854004/1.0346; (5) 1308~1343/832774/1.0463; (6) 1343~1389/663280/0.7401; (7) 1389~1440/199205/0.3491; (8) >1440/49295/0.0830 |
Land cover | Categorical | (1) Meadow/727009/0.8346; (2) Farmland/71553/1.4188; (3) Water area/3057711/0.7712; (4) Forest/9809/0.9130; (5) Garden plot/26616/0.8581; (6) Others/14531/0.0000; (7) Residential land/132682/0.9227; (8) Transportation/109/1.4084 |
Distance from roads/m | Continuous | (1) <100/625582/1.7864; (2) 100~200/424193/1.0905; (3) 200~300/402164/1.2113; (4) 300~400/313648/0.9266; (5) 400~500/298588/0.7129; (6) 500~600/258887/0.9171; (7) >600/1717244/0.7175 |
POI kernel density | Continuous | (1) 0–1/286078/0.5581; (2) 1–2/1291095/0.8624; (3) 2–3/1153628/1.0752; (4) 3–4/513828/1.2508; (5) 4–5/244087/0.8553; (6) 5–10/316584/1.1508; (7) >10/235006/1.3238; |
Landslide Probability | Susceptibility Class | Grid Number | Area Proportion | Landslide | Landslide Proportion | Density Proportion (Pcs/km2) |
---|---|---|---|---|---|---|
<21.10 | Very low | 986,559 | 24.5% | 56 | 5.7% | 0.063 |
21.10–22.19 | Low | 1,093,869 | 27.1% | 181 | 18.3% | 0.184 |
22.19–22.93 | Medium | 875,313 | 21.7% | 230 | 23.3% | 0.292 |
22.93–24.54 | High | 924,737 | 22.9% | 387 | 39.2% | 0.465 |
>24.54 | Very high | 152,601 | 3.8% | 133 | 13.5% | 0.968 |
RF | True Condition | Summation | ||
---|---|---|---|---|
Landslide | Non-Landslide | |||
PredictionCondition | Landslide | 907 | 9 | Precision: 0.990 |
Non-landslide | 80 | 9861 | Precision: 0.992 | |
Summation | Recall: 0.919 | Recall: 0.999 | Accuracy: 0.992 |
FR | True Condition | Summation | ||
---|---|---|---|---|
Landslide | Non-Landslide | |||
PredictionCondition | Landslide | 711 | 4114 | Precision: 0.147 |
Non-landslide | 276 | 5756 | Precision: 0.954 | |
Summation | Recall: 0.720 | Recall: 0.538 | Accuracy: 0.600 |
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Wang, Y.; Sun, D.; Wen, H.; Zhang, H.; Zhang, F. Comparison of Random Forest Model and Frequency Ratio Model for Landslide Susceptibility Mapping (LSM) in Yunyang County (Chongqing, China). Int. J. Environ. Res. Public Health 2020, 17, 4206. https://doi.org/10.3390/ijerph17124206
Wang Y, Sun D, Wen H, Zhang H, Zhang F. Comparison of Random Forest Model and Frequency Ratio Model for Landslide Susceptibility Mapping (LSM) in Yunyang County (Chongqing, China). International Journal of Environmental Research and Public Health. 2020; 17(12):4206. https://doi.org/10.3390/ijerph17124206
Chicago/Turabian StyleWang, Yue, Deliang Sun, Haijia Wen, Hong Zhang, and Fengtai Zhang. 2020. "Comparison of Random Forest Model and Frequency Ratio Model for Landslide Susceptibility Mapping (LSM) in Yunyang County (Chongqing, China)" International Journal of Environmental Research and Public Health 17, no. 12: 4206. https://doi.org/10.3390/ijerph17124206
APA StyleWang, Y., Sun, D., Wen, H., Zhang, H., & Zhang, F. (2020). Comparison of Random Forest Model and Frequency Ratio Model for Landslide Susceptibility Mapping (LSM) in Yunyang County (Chongqing, China). International Journal of Environmental Research and Public Health, 17(12), 4206. https://doi.org/10.3390/ijerph17124206