Next Article in Journal
Effect of Functionally-Graded Calcium Titanate Film, Prepared by Metal-Organic Chemical Vapor Deposition, on Titanium Implant
Next Article in Special Issue
Modeling of CO Emissions from Traffic Vehicles Using Artificial Neural Networks
Previous Article in Journal
Fabrication of Novel CeO2/GO/CNTs Ternary Nanocomposites with Enhanced Tribological Performance
Previous Article in Special Issue
A New Weighting Approach with Application to Ionospheric Delay Constraint for GPS/GALILEO Real-Time Precise Point Positioning
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Landslide Susceptibility Modeling Using Integrated Ensemble Weights of Evidence with Logistic Regression and Random Forest Models

1
Key Laboratory of Degraded and Unused Land Consolidation Engineering, Ministry of Land and Resources of China, Xi’an 710075, China
2
Shaanxi Provincial Land Engineering Construction Group Co. Ltd., Xi’an 710075, China
3
College of Geology & Environment, Xi’an University of Science and Technology, Xi’an 710054, Shaanxi, China
*
Authors to whom correspondence should be addressed.
These authors contribute equally to this paper.
Appl. Sci. 2019, 9(1), 171; https://doi.org/10.3390/app9010171
Submission received: 19 November 2018 / Revised: 20 December 2018 / Accepted: 27 December 2018 / Published: 4 January 2019

Abstract

:
The main aim of this study was to compare the performances of the hybrid approaches of traditional bivariate weights of evidence (WoE) with multivariate logistic regression (WoE-LR) and machine learning-based random forest (WoE-RF) for landslide susceptibility mapping. The performance of the three landslide models was validated with receiver operating characteristic (ROC) curves and area under the curve (AUC). The results showed that the areas under the curve obtained using the WoE, WoE-LR, and WoE-RF methods were 0.720, 0.773, and 0.802 for the training dataset, and were 0.695, 0.763, and 0.782 for the validation dataset, respectively. The results demonstrate the superiority of hybrid models and that the resultant maps would be useful for land use planning in landslide-prone areas.

1. Introduction

Landslides are common geological hazards caused by multiple factors including landform [1,2], geological evolution [3], groundwater [4], land use type [5], precipitation [6,7], irrigation [8], earthquake [9], engineering construction [10], and climate change [11,12,13]. To avoid casualties caused by landslides and guarantee the stable development of mountainous areas, it is critical to determine a control and prevention scheme for landslides in a region. Generally, regional landslide susceptibility maps are beneficial to mitigate the effects of landslide hazards.
At present, various methods have been proposed and introduced into landslide susceptibility mapping. The existing modeling approaches can be put into two categories: qualitative approaches and quantitative approaches [14,15]. In recent years, conventional qualitative approaches have been gradually abandoned by many researchers due to the risk that expert opinion can make the results stray from objective reality [16]. Compared with qualitative approaches, quantitative approaches are mainly based on the hidden information of objective data instead of subjective experience. Additionally, quantitative approaches mainly include traditional mathematical statistic methods, deterministic models, and some state-of-the-art machine learning algorithms.
For traditional statistical methods, the probability-frequency ratio (FR) [17,18], weight of evidence (WoE) [19,20], statistical index (SI) [21,22], index of entropy (IoE) [23,24], certainty factors (CF) [25,26,27], evidential belief function (EBF) [28,29,30], and logistic regression (LR) [31,32] models have been extensively adopted in landslide susceptibility mapping. However, one limitation for all traditional statistical methods is that some hypotheses exist [33]. In deterministic models, the detail characteristics of slopes are necessary to construct the calculation model [34]. Although deterministic models conform to basic physical laws of landslide, these models are not very suitable for regional landslide susceptibility assessments due to the complex process of modeling and computing [34].
In the past decade, with the rise of machine learning and data mining, a number of relevant algorithms have been developed for landslide susceptibility zonation [35,36,37,38,39]. For instance, the logistic regression model (LRM), artificial neural network (ANN), support vector machine (SVM), and decision tree (DT) were the top four machine learning algorithms in landslide susceptibility mapping during the period of 2005–2016 [16]. It is clear that machine learning algorithms improve the prediction accuracy of regional landslide occurrence, but the generalization performance of single classifiers still needs to be promoted [40]. In this way, a series of ensemble approaches have recently become more and more popular in geo-hazard susceptibility mapping [37,41,42,43].
In terms of ensemble approaches, several single classifiers have been combined using ensemble frameworks including random subspace [44], random forest [45,46], Bagging [47], AdaBoost [48], MultiBoost [49], and so on [37,50,51,52]. Currently, some novel ensemble techniques have been proposed and applied in landslide susceptibility assessment, flood susceptibility mapping, and groundwater potential analysis [41,53,54]. Additionally, the excellent performance of ensemble algorithms on predictive ability and generalization capacity has also been proven. For example, Kadavi et al. [55] compared four ensemble-based machine learning models (AdaBoost, LogitBoost, Multiclass Classifier, and Bagging) with the traditional frequency ratio model (FRM) in the task of landslide susceptibility mapping. Furthermore, the results demonstrated that all of the AUC values of the four ensemble-based machine learning models were higher than that of FRM. In addition, many scholars preferred to construct ensemble learning models by integrating machine learning algorithms with bivariate statistical models because some of the hypotheses of the conventional models can be weakened through hybrid models [56]. Meanwhile, part of the merits of bivariate statistical models and machine learning models can remain by integrating together. Weights of evidence models, as a classic bivariate statistical approach, can calculate the weights of various categories of a conditioning factor based on sturdy mathematical theories [57]. Furthermore, the weights of evidence models can be integrated with other machine learning approaches to reveal the hidden correlations between different conditioning factors and landslide occurrence. Therefore, in the present study, based on GIS tools, the integrated ensemble weights of evidence with logistic regression and random forest models were employed to map landslide susceptibility, and the results were compared and analyzed quantitatively by receiver operating characteristic curves (ROC) and area under the curve (AUC).

2. Study Area

The study area was located in Shaanxi Province, China (Figure 1) where the average annual temperature is 14.2 °C, the average annual rainfall is 909.8 mm, and the evaporation is 1537.1 mm. Topographically, the study area is part of the Qinba Mountain. The general trend is high in the south and low in the north. Elevation ranges from 442 m to 2410 m above sea level, with an average elevation of 1171 m. Slope angles in the study area are in the range of 0 to 70°. Most of the slope angles are in classes of 10–20° (29.27%), followed by 20–30° (26.29%), 0–10° (23.64%), and 30–40° (14.99%). Only 5.81% of slope angles are higher than 40°.
Geologically, the study area is located at the northern margin of the Yangtze plate. There are five major faults crossing the area including (1) the Gangchang fault (SW–NE direction), (2) the Xiaolengba–Qinjiaba fault (NW–SE direction), (3) the Xiaoba–Haitang fault (SW–NE direction), (4) the Moujiaba–Shuimohe fault (W–E direction), and (5) the Jiangjiawan–Zhujiaba–Tuqiangping fault (SW–NE direction) (Figure 2).

3. Materials and Methods

3.1. Data Preparation

A landslide inventory includes the locations of the past and recent landslides [21]. A landslide inventory can give insight into landslide location, dates, type, frequency of occurrence, state of activity, magnitude or size, failure mechanisms, causal factors, and damage caused [58,59]. In the present study, the landslide inventory map was prepared on the basis of satellite images (Google Earth and ZY03 images) and historical landslide records of the area, which were verified by GPS. A total of 202 landslides were identified to prepare the landslide susceptibility map, of which most of the landslides were slides (190), the others included 12 rock falls [60]. According to an analysis in the GIS environment, the smallest landslide was nearly 160 m3, the largest landslide was more than 1,000,000 m3, while the average was 33,000 m3. Finally, 141 landslides were randomly selected as training data and rest of them were used for the verification of the landslide susceptibility map (Figure 1).
There are no universal guidelines for selecting landslide conditioning factors [33,61]. A total of 16 landslide conditioning factors were used for landslide susceptibility mapping including slope angle, slope aspect, elevation, plan curvature, profile curvature, topographic wetness index (TWI), stream power index (SPI), sediment transport index (STI), distance to rivers, distance to roads, distance to faults, soil, land use, normalized difference vegetation index (NDVI), lithology, and rainfall, which are considered as controlling factors in the occurrence of landslides in the study area.
Slope angle is an important factor that affects the stress state of slope mass, and these positions where stress exceeds failure strength may contribute to landslide hazards [62,63]. In this case, as shown in Figure 3a, the thematic data layer of the slope angle was reclassified into seven categories with an interval of 10°, namely, (0–10°), (10–20°), (20–30°), (30–40°), (40–50°), (50–60°), and (60–72.83°).
Slope aspect is another common conditioning factor for the task of landslide susceptibility mapping [64,65]. It has been proven that most landslides usually occur at a certain slope aspect for a given study area, but the mechanism has not been revealed clearly [66]. Therefore, slope aspect was also employed as a conditioning factor. Here, slope aspect categories include flat, north, northeast, east, southeast, south, southwest, west, and northwest (Figure 3b).
Generally, it is considered that elevation has a firm relationship with landslide occurrence [67]. There is no denying that elevation can influence the topography, vegetation, temperature, humidity, human activities, and many other conditions that have a connection with slope stability [30,68]. In Figure 3c, the elevation of the study area was divided into ten classes with an interval of 200 m, i.e., (442–600 m), (600–800 m), (800–1000 m), (1000–1200 m), (1200–1400 m), (1400–1600 m), (1600–1800 m), (1800–2000 m), (2000–2200 m), and (2200–2410 m).
Plan curvature and profile curvature are two quantitative indices that embody topographic characteristics and trend from different perspectives [69]. Various curvature values indicate different runoff and erosion conditions of water. For instance, the upwardly convex surfaces have positive curvature values while negative curvature values mean upwardly concave surfaces [30]. In this study, the plan curvature and profile curvature values were both reclassified into three groups (Figure 3d,e).
TWI was proposed to indicate the local groundwater potential by Moore [70] in 1991. Currently, TWI is regarded as an extensively-used causative factor in landslide susceptibility assessment [71]. It is expressed as TWI = ln ( α tan β ) , where β is the slope angle (radian), and α is the flow accumulation through a point [72]. The TWI values of the study area can be calculated by GIS software and reclassified as (<4), (4–5), (5–6), (6–7), and (>7) with an interval of 1 (Figure 3f).
SPI can directly measure the erosion capacity of the stream. A higher SPI value indicates that the stream has more powerful erosion on the slope surface [55]. The SPI values are mainly determined as SPI = α tan β [54,70]. In this study, the SPI values were identified as five categories with an interval of 20, namely, (<20), (20–40), (40–60), (60–80), and (>80) (Figure 3g).
As another topographic index, STI has also been considered to construct the landslide susceptibility model [73]. Similar to SPI, STI can quantitatively reflect the regional topographic features and erosion conditions [74]. For the present study, STI values contained five categories with an interval of 10: (<10), (10–20), (20–30), (30–40), (>40) (Figure 3h).
Rivers can not only affect the moisture distribution in slopes, but can also erode the toes of slopes, which cause slope deformation and failure [75]. Thus, it is necessary to consider the river effects when producing landslide susceptibility maps. In this study, based on the distance to rivers, five buffer zones with an interval of 200 m were generated for each river: (<200 m), (200–400 m), (400–600 m), (600–800 m), and (>800 m) (Figure 3i).
Generally speaking, road construction in mountainous areas, which always produce an engineering load and destroy the integrity of slope structure, have significant negative impacts on the slope stability [76]. Hence, the distance to roads is usually selected as a conditioning factor to embody the influence of road engineering activities on landslide occurrence [77]. Here, values of the distance to roads were divided into five groups with an interval of 300 m, i.e., (<300 m), (300–600 m), (600–900 m), (900–1200 m), and (>1200 m) (Figure 3j).
Fault structures affect the spatial distribution and characteristics of landslides in a certain region [50]. According to relevant studies [30,78], the integrity of rock and soil mass generally decrease as the distance to the faults shorten. In this way, landslide hazards are more likely to occur in the neighboring area of faults. Ultimately, buffers of various faults in the study area were obtained and reclassified into five categories with an interval of 1000 m: (<1000 m), (1000–2000 m), (2000–3000 m), (3000–4000 m), and (>4000 m) (Figure 3k).
In terms of soil, this is an essential factor that has a strong correlation with landslide occurrence [79]. To a great extent, the strength, root cohesion, permeability, and vegetation coverage of the soil mass depend on the soil type [80,81], which can impact the failure characteristics of slopes [82,83]. In this study area, a total of nine soil types were identified including cumulic anthrosol, dystric cambisol, eutric cambisol, calcaric fluvisol, haplic luvisol, chromic luvisol, eutric planosol, calcaric regosol, and eutric regosol (Figure 3l).
Land use is one of the most frequently used conditioning factors, and the correlation between landslides and land use has been confirmed [84]. For instance, in some farmland regions, landslides are frequent and common under long-term irrigation [85]. For the study area, the types of land use mainly consist of farmland, forestland, grassland, water, residential areas, and bareland (Figure 3m).
NDVI is a very popular index to measure the degree of vegetation in a region. NDVI values can be figured out by the formula NDVI = (I − R)/(IR + R), where IR is the infrared band and R is the red band of the electromagnetic spectrum [86]. The range of NDVI values is from −1 to 1, and a positive value means that the local ground is covered by vegetation. Five categories of NDVI values were generated based on the natural break method [87], namely (–0.21–0.21), (0.21–0.36), (0.36–0.44), (0.44–0.52), and (0.52–0.65) (Figure 3n).
Like soil, lithology is one of the most important factors that directly determines slope stability. According to many existing studies, the physical and mechanical properties of rock mass usually change dramatically with lithological units [88]. Therefore, most landslides occur in the sliding-prone lithological units that have lower strength and a higher moisture content. For this study area, the strata were mainly reclassified into twelve lithological units based on the lithofacies and geological ages, and the specific distribution of various lithologies was illustrated in Figure 3o.
Rainfall is a crucial triggering factor that causes massive landslides by means of raising the groundwater level and increasing pore water pressure [89]. It can be observed that the probability of landslide occurrence indeed grows under the actions of long-term or heavy rainfall. Based on the meteorological data of the study area, the corresponding rainfall map with an interval of 100 mm/yr was produced, i.e., (<900 mm/yr), (900–1000 mm/yr), (1000–1100 mm/yr), (1100–1200 mm/yr), (1200–1300 mm/yr), (1300–1400 mm/yr), (1400–1500 mm/yr), (1500–1600 mm/yr), (1600–1700 mm/yr), and (>1700 mm/yr) (Figure 3p).

3.2. Weight of Evidence

Weight of evidence (WoE) is one of the most popular models that uses the Bayesian theory of conditional probability to quantify spatial associations between evidence layers and known mineral occurrences [90]. In the WoE method, conditional independence is the most important issue that should be considered. The WoE is based on the calculation of positive weight W + and negative weight W as follows:
W + = ln p { B | A } p { B | A ¯ }
W = ln p { B ¯ | A } p { B ¯ | A ¯ }
where B is the presence predictive factor; B ¯ is the absence of the predictive factor; A is the presence of landslide; and A ¯ is the absence of landslide. In landslide susceptibility prediction, the weight contrast W f = W + W was used to measure and reflect the spatial association between the landslide conditioning factors and landslide occurrence [91].

3.3. Logistic Regression

Logistic regression (LR) is one type of regression analysis where categorical outcomes can be predicted based on a certain predictor [92]. By using the logistic functions, probabilities of the possible outcomes can be modeled [93].
The logistic regression model is useful for two-class classification. Assuming there are n samples of the pairs, ( x i , y i ) , i = 1 , 2 , , n , y i { 1 , + 1 } is a binary class label for each sample i = 1 , 2 , , n and weights ( w , b ) . In the logistic regression for binary classification, the occurrence probability of the class is modeled with the below function:
P ( y = ± 1 | x , w ) = 1 1 + exp ( y ( w T x + b ) )
where b is the intercept; T is the matrix transposition; and the k-dimensional coefficient vector, w = ( w 1 , w 2 , w k ) T are parameters to be estimated.

3.4. Random Forest

Random forests (RF) are an ensemble of separately trained binary decision trees [94]. In the random forest algorithm, a random vector i k is naturally produced, independent from the previous random vectors and distributed to all trees, and each tree is grown using the training dataset and random vector i k , and outcomes are in the collection of tree-structured classifiers h ( x , i k ) , k = 1 , 2 , n at input vector x. In this study, i k is the landslide conditioning factors. The random forest consisted of two trees, namely, landslide and non-landslide, each constructed while considering sixteen random features.
Generally, in a random forest algorithm, the generalization error is described as below [95]:
G E = P x , y ( m g ( x , y ) < 0 )
where x and y are the landslide conditioning factors indicating the probability over the x , y space, and m g is the margin function, which is defined as below:
m g ( x , y ) = a v k I ( h k ( x ) = y ) max j y a v k I ( h k ( x ) = j )
Which measures the extent to which the average number of votes at random vectors for the right output exceeds the average vote for any other output. The I ( * ) is the indicator function [96].

4. Results

4.1. Correlation Analysis

The correlation between the conditioning factors and probability of landslides occurrence was measured by the weight contrast W f , and the calculation results of the WoE model are listed in Table 1. The LR method was employed to produce the landslide susceptibility map, and one of the most critical applicable conditions of LR is that the landslide conditioning factors are mutually independent [97]. Therefore, it is necessary to diagnose the multicollinearity of various conditioning factors when evaluating landslide susceptibility [98]. Currently, the tolerance (TOL) ( TOL = 1 R 2 , and R is the coefficient of determination of the regression equation) and variance inflation factor (VIF) ( VIF = 1 / TOL ) have been applied in multicollinearity diagnosis [99,100,101].
Generally, a TOL value less than 0.1 or a VIF value larger than 10 is regarded as a symbol of multicollinearity [61]. In this study, the results of the WoE model were used as inputs to calculate the TOL and VIF values of all of the conditioning factors. In accordance with the calculated results, there was no multicollinearity among the landslide conditioning factors (Table 2).

4.2. Application of the WoE Model

In terms of slope angle, the slope angle between 10°–20° (0.669) is more prone to landslide occurrence. Additionally, the W f values of the region where slope angles larger than 50° are 0. For the slope aspect factor, W f was the highest for south-facing (0.769). Furthermore, southeast-facing (0.310) and east-facing (0.001) also had a positive correlation with landslide occurrence. In the case of elevation, most landslides were distributed in the classes of 442–600 m (0.485), 600–800 m (0.968), and 800–1000 m (0.717). When the elevation was larger than 1200 m, elevation had an inhibitory effect on landslides. In the case of plan curvature, flat areas had a more important impact on landslides, whereas the W f values of convex areas and concave areas were 0.106 and −0.203, respectively. In the case of profile curvature, the W f values of the concave class, flat class, and convex class ere 0.042, 0.751, and −0.271, respectively. For TWI, the highest W f value was observed for the interval of 5–6 (0.497) while the class <4 (−0.945) had the lowest value. For SPI, the class <20 (0.277) had the highest W f value, and the areas of SPI 20–40 and >60 were negative for landslides. For STI, the class <10 had the highest W f value of 0.361, while the class 20–30 had the lowest value of −0.853. In the case of distance to rivers, the classes of <200 m (0.168) and 600–800 m (0.133) occupied higher W f values when compared to the other classes. In the case of distance to roads, the class of <300 m (0.904) had a more intimate correlation with landslide occurrence. In the case of distance to faults, it can be seen that the class >4000 m had the highest W f value of 0.246. For soil, eutric cambisol (1.177) and eutric planosol (1.516) were more likely to induce landslides due to the dramatic falling of soil strength under saturated conditions [85]. For land use, farmland (1.351) had the highest probability of landslide occurrence, which may be essentially caused by irrigation. According to the W f values of NDVI, the class of 0.36–0.44 (1.121) mainly contributed to landslide occurrence, while the lowest value was for the class of 0.52–0.65 (−2.212), which indicates that high vegetation coverage can restrain landslides. In the case of lithology, the W f values of group 1 (Q) (0.588), group 11 (Ar) (0.594), and group 12 (Pt, Pz) (0.417) were larger than 0, indicating that these lithological groups had the highest susceptibility to landslide. In the case of rainfall, the range between 1100–1200 mm/yr (1.030) showed high susceptibility for landslide occurrence.
The calculated W f values for all landslide conditioning factors were summed using the following equation to construct the landslide susceptibility map (LSM):
L S M W o E = S l o p e   a n g l e W f + S l o p e   a s p e c t W f + E l e v a t i o n W f + P l a n   c u r v a t u r e W f + P r o f i l e   c u r v a t u r e W f + T W I W f + S P I W f + S T I W f + D i s t a n c e   t o   r i v e r s W f + D i s t a n c e   t o   r o a d s W f + D i s t a n c e   t o   f a u l t s W f + N D V I W f + S o i l W f + L a n d u s e W f + L i t h o l o g y W f + R a i n f a l l W f
The integrated result of the WoE model is shown in Figure 4. The LSM was reclassified into five classes based on the natural break method: very low, low, moderate, high, and very high.

4.3. Application of the WoE-LR Model

In this case, SPSS 18.0 software was applied to build a landslide susceptibility model with the WoE-LR model. The input table of the LR model can be generated by the determined class values of variables based on the WoE model [54]. In the analysis process, a forward stepwise LR was adopted, and the analysis results are given in Table 3 and Table 4. The Cox and Snell R Square (0.245) and Nagelkerke R Square (0.326) are two pseudo determined coefficients that are used to reflect the degree of independent variables explaining dependent variables [102,103]. According to Table 4, the LR equation and landslide occurrence probability P can be expressed as Equations (7) and (8), respectively.
y = 1.122 × Slope   angle + 2.157 × Slope   aspect + 0.986 × Elevation + 2.505 × Plan   curvature + 0.868 × P r o f i l e   c u r v a t u r e + 1.764 × TWI + 1.427 × SPI + 1.142 × STI + 0.512 × D i s t a n c e   t o   r i v e r s + 1.445 × D i s t a n c e   t o   r o a d s + 0.972 × D i s t a n c e   t o   f a u l t s + 0.859 × N D V I + 1.392 × S o i l + 1.634 × L a n d u s e + 1.032 × L i t h o l o g y + 1.594 × R a i n f a l l + 0.806
P = e y 1 + e y
Ultimately, the landslide susceptibility index (LSI) for the LR model were obtained based on Equation (8), moreover, the LSI values were reclassified into five categories by the natural break method: very low, low, moderate, high, and very high (Figure 5).

4.4. Application of the WoE-RF Model

Similarly, the calculated results of the WoE model can be used as input for the RF model. In this study, training of the RF model was implemented by WEKA software. During the analyzing process, the importance of various conditioning factors can be measured quantitatively and ordered by MDA (mean decrease accuracy) and MDG (mean decrease Gini). Generally, MDA is determined during the Out-Of-Bag error calculation phase, while MDG is a measure of how each variable contributes to the homogeneity of the nodes and leaves [104]. The values of the above-mentioned two metrics of the conditioning factors are illustrated in Figure 6, and a larger value of MDA or MDG means a higher importance of the corresponding variable. Accordingly, in terms of MDA, land use is the most critical factor in the RF model, while soil is second in importance only to land use. For the MDG, the importance of elevation was first, followed by rainfall and land use. Finally, based on ArcGIS software, the landslide susceptibility map using the WoE-RF was generated and is shown in Figure 7.

4.5. Validation of Landslide Models

Currently, the ROC and AUC have been widely applied to validate the performance of determined landslide susceptibility models [64,69,105]. The ROC curve can be generated by plotting the false positive rate (100-specificity) in the x-axis versus the sensitivity in the y-axis [71]. The area under the ROC curve (AUC) is an indicator of the global summary measure of the performance of a model [106,107,108]. In the present study, to assess the validation of the WoE, WoE-LR, and WoE-RF models, the ROC curves of three models with training and validation datasets are described in Figure 8 and Figure 9.
In the case of the training dataset, the WoE-RF model had the best performance with the highest AUC value of 0.802, while the AUC values of the WoE model and WoE-LR model were 0.720 and 0.773, respectively. Meanwhile, the WoE-RF model had the lowest standard error (0.0275) and a 95% confidence interval of 0.729–0.829. Thus, the WoE-RF and WoE-LR models can improve the accuracy of the traditional WoE model in this study, and the WoE-RF model showed a relatively better performance.
In the case of the validation data, it can be seen that the AUC values of the various models decreased slightly when compared with the training dataset. The AUC values were 0.695, 0.763, and 0.782 for the WoE model, WoE-LR model, and WoE-RF model, respectively. Similarly, the lowest standard error was 0.0430 for the WoE-RF model, followed by the WoE-LR model (0.0440), and the WoE model (0.0484). The detailed results demonstrated that the WoE-RF model had a prominent prediction capacity on landslide susceptibility mapping.

5. Discussions

Under the action of environmental factors and human activities, the frequency of landslide occurrence has been increasing in recent decades, which may result in catastrophic losses on lives, resources, and property [109,110]. Currently, numerous approaches have been used in landslide susceptibility mapping such as FR [23], WoE [19], IoE [111], machine learning [64,112], and ensemble learning models [53,54]. In the above-mentioned models, the probabilistic meaning and calculation procedure of the WoE model are relatively concise and specific, which makes the WoE a classical and widely used method in landslide susceptibility mapping. Nevertheless, due to the uncertainties and fuzziness in the data of the conditioning factors [113], for different datasets, the performance of the WoE models were significantly distinguished [114,115,116]. In the present study, the integrated ensemble WoE with LR and RF models were proposed and applied for landslide susceptibility modeling in order to improve the accuracy and generalization ability of the traditional WoE model.
Landslide inventory map is a preliminary step toward landslide susceptibility, hazard and risk assessment [59]. Generally, there are two classes of Landslide inventories: landslide-event inventories that are associated with a trigger and historical landslide inventories [59,117]. In the present study, we adopted the latter formation, which was the sum of many landslide events over a long time. However, the evidence of many smaller landslides might has been lost due to various degrees of modification by subsequent landslides, erosional processes, vegetation growth and anthropic influences [59]. Therefore, application of multi-temporal high-resolution satellites images for interpretation of smaller landslides may be an effective supplement to the current landslide inventory and efficient for improving the accuracies of landslide susceptibility maps.
According to the existing literature and multicollinearity analysis, sixteen conditioning factors were selected: slope angle, slope aspect, elevation, plan curvature, profile curvature, TWI, STI, SPI, distance to rivers, distance to roads, distance to faults, NDVI, soil, land use, lithology, and rainfall. Furthermore, based on the W f values, the relationships between landslide occurrence and these factors were analyzed. It was demonstrated that all factors had nonlinear relationships with landslides. In addition, the RF model was employed to measure the importance of factors with two indices, the MDA and MDG. In terms of MDA, it could be observed that the most critical factor was land use, followed by soil and elevation. Slope angle had the lowest impact on landslide occurrence. However, for MDG, the importance of elevation, rainfall, and land use ranked first, second, and third, respectively, while the lowest MDG value was for profile curvature.
There are some classification techniques for a landslide susceptibility map in GIS software, such as manual, defined interval, natural break, equal interval, quantile, standard deviation, geometrical interval, and landslide percentage [118]. Generally, user-defined classification is more difficult for the reader to interpret and justify. Therefore, current automatic classification systems should be used instead of a user-defined classification [118]. Besides, when landslide susceptibility indexes have positive or negative skewness, the best classification methods are quantile or natural break [119]. In the present study, natural break method, which is the most commonly used models [120,121], is the most suitable method for modelling landslide susceptibility according to the histogram of data distribution.
In this paper, a comparison study of the WoE, WoE-LR, and WoE-RF models was implemented. LR is a widely used model for classification, particularly for binary classification problems [122]. Thus, we integrated the WoE with the LR model to acquire a better classifier. The WoE-RF model is a combination of the weight of evidence and random forest approach. It has been proven that RF is one of the most popular classification algorithms and can improve the performance of single classifiers [96,123]. Moreover, RF can decrease the dependence of the WoE model on independence among the conditioning factors. Accordingly, the results showed that both the LR model (AUC = 0.773 for training data; AUC = 0.763 for validation data) and RF model (AUC = 0.802 for training data; AUC = 0.782 for validation data) can increase the performance of the traditional WoE model (AUC = 0.720 for training data; AUC = 0.695 for validation data), and the WoE-RF model produced the best results.
Comparing the overall classification results of the three models, the results confirmed that the RF model had a better performance on improving the generalization ability of a weak classifier and raising the corresponding prediction accuracy. Therefore, the landslide susceptibility maps generated by the WoE-RF and WoE-LR models contain reference meaning for the study area to a certain extent. Furthermore, the procedure of factor selection and ensemble model construction is of some value to similar studies.

6. Conclusions

The results are indicative of the quality of the maps drawn by the hybrid approaches of traditional bivariate weights of evidence (WoE) with multivariate logistic regression (WoE-LR) and machine learning-based random forest (WoE-RF). In general, the following conclusions can be drawn:
(1) Geomorphological factors, geological factors, geo-environmental factors, and anthropogenic factors were used for the development of the landslide model. The preliminary selection of these 16 conditioning factors was based on the multicollinearity diagnosis. The TOL and VIF values of all the conditioning factors indicated no multicollinearity.
(2) According to the results of the WoE model, most occurred at slopes of 10–20° with the south aspect, elevations of 600–800 m, distance to rivers of <200 m, distance to roads of <300 m, and a farmland land cover category.
(3) WoE-RF possessed relatively good accuracy when compared to the WoE-LR and WoE models. By using the ROC curve, the AUC values of the training dataset produced by these three methods were 0.802, 0.773, and 0.720, respectively. For the validation dataset, the AUC values were 0.782, 0.763, and 0.695, respectively. It can be concluded that the proposed hybrid models are promising approaches for the spatial prediction of landslides and can also be applied in other landslide-prone areas.

Author Contributions

W.C. and Z.S. collected the field data and conducted the landslide mapping and analysis. W.C. and Z.S. wrote the manuscript. J.H. provided critical comments in planning this paper and edited the manuscript. All authors discussed the results and edited the manuscript.

Acknowledgments

This study was supported by the Opening Fund of Key Laboratory of Degraded and Unused Land Consolidation Engineering, the Ministry of Land and Resources (Grant No. SXDJ2018-04), the National Natural Science Foundation of China (Grant No. 41807192), the China Postdoctoral Science Foundation (Grant No. 2018T111084, 2017M613168), and the Project funded by the Shaanxi Province Postdoctoral Science Foundation (Grant No. 2017BSHYDZZ07).

Conflicts of Interest

No potential conflict of interest was reported by the authors.

References

  1. Kim, M.S.; Onda, Y.; Kim, J.K.; Kim, S.W. Effect of topography and soil parameterisation representing soil thicknesses on shallow landslide modelling. Quat. Int. 2015, 384, 91–106. [Google Scholar] [CrossRef]
  2. Liucci, L.; Melelli, L.; Suteanu, C.; Ponziani, F. The role of topography in the scaling distribution of landslide areas: A cellular automata modeling approach. Geomorphology 2017, 290, 236–249. [Google Scholar] [CrossRef]
  3. Agostini, A.; Tofani, V.; Nolesini, T.; Gigli, G.; Tanteri, L.; Rosi, A.; Cardellini, S.; Casagli, N. A new appraisal of the ancona landslide based on geotechnical investigations and stability modelling. Q. J. Eng. Geol. Hydrogeol. 2014, 47, 29–43. [Google Scholar] [CrossRef]
  4. Peng, D.; Xu, Q.; Liu, F.; He, Y.; Zhang, S.; Qi, X.; Zhao, K.; Zhang, X. Distribution and failure modes of the landslides in heitai terrace, china. Eng. Geol. 2018, 236, 97–110. [Google Scholar] [CrossRef]
  5. Persichillo, M.G.; Bordoni, M.; Meisina, C. The role of land use changes in the distribution of shallow landslides. Sci. Total Environ. 2017, 574, 924–937. [Google Scholar] [CrossRef] [PubMed]
  6. Chang, J.-M.; Chen, H.; Jou, B.J.-D.; Tsou, N.-C.; Lin, G.-W. Characteristics of rainfall intensity, duration, and kinetic energy for landslide triggering in taiwan. Eng. Geol. 2017, 231, 81–87. [Google Scholar] [CrossRef]
  7. Segoni, S.; Rosi, A.; Lagomarsino, D.; Fanti, R.; Casagli, N. Brief communication: Using averaged soil moisture estimates to improve the performances of a regional-scale landslide early warning system. Nat. Hazards Earth Syst. Sci. 2018, 18, 807–812. [Google Scholar] [CrossRef] [Green Version]
  8. Hou, X.; Vanapalli, S.K.; Li, T. Water infiltration characteristics in loess associated with irrigation activities and its influence on the slope stability in heifangtai loess highland, china. Eng. Geol. 2018, 234, 27–37. [Google Scholar] [CrossRef]
  9. Wang, T.; Wu, S.R.; Shi, J.S.; Xin, P.; Wu, L.Z. Assessment of the effects of historical strong earthquakes on large-scale landslide groupings in the wei river midstream. Eng. Geol. 2018, 235, 11–19. [Google Scholar] [CrossRef]
  10. Mohammadi, S.; Taiebat, H. Finite element simulation of an excavation-triggered landslide using large deformation theory. Eng. Geol. 2016, 205, 62–72. [Google Scholar] [CrossRef]
  11. Alvioli, M.; Melillo, M.; Guzzetti, F.; Rossi, M.; Palazzi, E.; von Hardenberg, J.; Brunetti, M.T.; Peruccacci, S. Implications of climate change on landslide hazard in central italy. Sci. Total Environ. 2018, 630, 1528–1543. [Google Scholar] [CrossRef] [PubMed]
  12. Peres, D.J.; Cancelliere, A. Modeling impacts of climate change on return period of landslide triggering. J. Hydrol. 2018, 567, 420–434. [Google Scholar] [CrossRef]
  13. Gariano, S.L.; Guzzetti, F. Landslides in a changing climate. Earth-Sci. Rev. 2016, 162, 227–252. [Google Scholar] [CrossRef] [Green Version]
  14. Fell, R.; Corominas, J.; Bonnard, C.; Cascini, L.; Leroi, E.; Savage, W.Z. Guidelines for landslide susceptibility, hazard and risk zoning for land use planning. Eng. Geol. 2008, 102, 85–98. [Google Scholar] [CrossRef] [Green Version]
  15. Corominas, J.; van Westen, C.; Frattini, P.; Cascini, L.; Malet, J.P.; Fotopoulou, S.; Catani, F.; Van Den Eeckhaut, M.; Mavrouli, O.; Agliardi, F.; et al. Recommendations for the quantitative analysis of landslide risk. Bull. Eng. Geol. Environ. 2014, 73, 209–263. [Google Scholar] [CrossRef]
  16. Pourghasemi, H.R.; Teimoori Yansari, Z.; Panagos, P.; Pradhan, B. Analysis and evaluation of landslide susceptibility: A review on articles published during 2005–2016 (periods of 2005–2012 and 2013–2016). Arab. J. Geosci. 2018, 11, 193. [Google Scholar] [CrossRef]
  17. Lee, S.; Dan, N.T. Probabilistic landslide susceptibility mapping in the lai chau province of vietnam: Focus on the relationship between tectonic fractures and landslides. Environ. Geol. 2005, 48, 778–787. [Google Scholar] [CrossRef]
  18. Chen, W.; Pourghasemi, H.R.; Panahi, M.; Kornejady, A.; Wang, J.; Xie, X.; Cao, S. Spatial prediction of landslide susceptibility using an adaptive neuro-fuzzy inference system combined with frequency ratio, generalized additive model, and support vector machine techniques. Geomorphology 2017, 297, 69–85. [Google Scholar] [CrossRef]
  19. Xu, C.; Xu, X.; Lee, Y.H.; Tan, X.; Yu, G.; Dai, F. The 2010 yushu earthquake triggered landslide hazard mapping using gis and weight of evidence modeling. Environ. Earth Sci. 2012, 66, 1603–1616. [Google Scholar] [CrossRef]
  20. Xie, Z.; Chen, G.; Meng, X.; Zhang, Y.; Qiao, L.; Tan, L. A comparative study of landslide susceptibility mapping using weight of evidence, logistic regression and support vector machine and evaluated by sbas-insar monitoring: Zhouqu to wudu segment in bailong river basin, china. Environ. Earth Sci. 2017, 76, 313. [Google Scholar] [CrossRef]
  21. Mandal, S.; Mandal, K. Bivariate statistical index for landslide susceptibility mapping in the rorachu river basin of eastern sikkim himalaya, india. Spat. Inf. Res. 2018, 26, 59–75. [Google Scholar] [CrossRef]
  22. Regmi, A.D.; Devkota, K.C.; Yoshida, K.; Pradhan, B.; Pourghasemi, H.R.; Kumamoto, T.; Akgun, A. Application of frequency ratio, statistical index, and weights-of-evidence models and their comparison in landslide susceptibility mapping in central nepal himalaya. Arab. J. Geosci. 2014, 7, 725–742. [Google Scholar] [CrossRef]
  23. Jaafari, A.; Najafi, A.; Pourghasemi, H.R.; Rezaeian, J.; Sattarian, A. Gis-based frequency ratio and index of entropy models for landslide susceptibility assessment in the caspian forest, northern iran. Int. J. Environ. Sci. Technol. 2014, 11, 909–926. [Google Scholar] [CrossRef]
  24. Tien Bui, D.; Shahabi, H.; Shirzadi, A.; Chapi, K.; Alizadeh, M.; Chen, W.; Mohammadi, A.; Ahmad, B.; Panahi, M.; Hong, H.; et al. Landslide detection and susceptibility mapping by airsar data using support vector machine and index of entropy models in cameron highlands, malaysia. Remote Sens. 2018, 10, 1527. [Google Scholar] [CrossRef]
  25. Hong, H.; Chen, W.; Xu, C.; Youssef, A.M.; Pradhan, B.; Tien Bui, D. Rainfall-induced landslide susceptibility assessment at the chongren area (china) using frequency ratio, certainty factor, and index of entropy. Geocarto Int. 2017, 32, 139–154. [Google Scholar] [CrossRef]
  26. Chen, W.; Li, W.; Chai, H.; Hou, E.; Li, X.; Ding, X. Gis-based landslide susceptibility mapping using analytical hierarchy process (ahp) and certainty factor (cf) models for the baozhong region of baoji city, china. Environ. Earth Sci. 2016, 75, 1–14. [Google Scholar] [CrossRef]
  27. Dou, J.; Oguchi, T.; Hayakawa, Y.S.; Uchiyama, S.; Saito, H.; Paudel, U. Gis-based landslide susceptibility mapping using a certainty factor model and its validation in the chuetsu area, central japan. In Landslide Science for a Aafer Geoenvironment; Springer: New York, NY, USA, 2014; pp. 419–424. [Google Scholar]
  28. Chen, W.; Shahabi, H.; Shirzadi, A.; Hong, H.; Akgun, A.; Tian, Y.; Liu, J.; Zhu, A.-X.; Li, S. Novel hybrid artificial intelligence approach of bivariate statistical-methods-based kernel logistic regression classifier for landslide susceptibility modeling. Bull. Eng. Geol. Environ. 2018, 1–23. [Google Scholar] [CrossRef]
  29. Pradhan, A.M.S.; Kim, Y.-T. Spatial data analysis and application of evidential belief functions to shallow landslide susceptibility mapping at mt. Umyeon, seoul, korea. Bull. Eng. Geol. Environ. 2017, 76, 1263–1279. [Google Scholar] [CrossRef]
  30. Ding, Q.; Chen, W.; Hong, H. Application of frequency ratio, weights of evidence and evidential belief function models in landslide susceptibility mapping. Geocarto Int. 2017, 32, 619–639. [Google Scholar] [CrossRef]
  31. Zhang, T.; Han, L.; Chen, W.; Shahabi, H. Hybrid integration approach of entropy with logistic regression and support vector machine for landslide susceptibility modeling. Entropy 2018, 20, 884. [Google Scholar] [CrossRef]
  32. Mandal, S.; Mandal, K. Modeling and mapping landslide susceptibility zones using gis based multivariate binary logistic regression (lr) model in the rorachu river basin of eastern sikkim himalaya, india. Modeling Earth Syst. Environ. 2018, 4, 69–88. [Google Scholar] [CrossRef]
  33. Chen, W.; Zhang, S.; Li, R.; Shahabi, H. Performance evaluation of the gis-based data mining techniques of best-first decision tree, random forest, and naïve bayes tree for landslide susceptibility modeling. Sci. Total Environ. 2018, 644, 1006–1018. [Google Scholar] [CrossRef]
  34. Youssef, A.M.; Pourghasemi, H.R.; Pourtaghi, Z.S.; Al-Katheeri, M.M. Landslide susceptibility mapping using random forest, boosted regression tree, classification and regression tree, and general linear models and comparison of their performance at wadi tayyah basin, asir region, saudi arabia. Landslides 2016, 13, 839–856. [Google Scholar] [CrossRef]
  35. Zhou, C.; Yin, K.; Cao, Y.; Ahmed, B.; Li, Y.; Catani, F.; Pourghasemi, H.R. Landslide susceptibility modeling applying machine learning methods: A case study from longju in the three gorges reservoir area, china. Comput. Geosci. 2018, 112, 23–37. [Google Scholar] [CrossRef]
  36. Pham, B.T.; Jaafari, A.; Prakash, I.; Bui, D.T. A novel hybrid intelligent model of support vector machines and the multiboost ensemble for landslide susceptibility modeling. Bull. Eng. Geol. Environ. 2018, 1–22. [Google Scholar] [CrossRef]
  37. Hong, H.; Liu, J.; Bui, D.T.; Pradhan, B.; Acharya, T.D.; Pham, B.T.; Zhu, A.X.; Chen, W.; Ahmad, B.B. Landslide susceptibility mapping using j48 decision tree with adaboost, bagging and rotation forest ensembles in the guangchang area (china). Catena 2018, 163, 399–413. [Google Scholar] [CrossRef]
  38. Chen, W.; Shahabi, H.; Shirzadi, A.; Li, T.; Guo, C.; Hong, H.; Li, W.; Pan, D.; Hui, J.; Ma, M.; et al. A novel ensemble approach of bivariate statistical-based logistic model tree classifier for landslide susceptibility assessment. Geocarto Int. 2018, 33, 1398–1420. [Google Scholar] [CrossRef]
  39. Chen, W.; Shahabi, H.; Zhang, S.; Khosravi, K.; Shirzadi, A.; Chapi, K.; Pham, B.T.; Zhang, T.; Zhang, L.; Chai, H.; et al. Landslide susceptibility modeling based on gis and novel bagging-based kernel logistic regression. Appl. Sci. 2018, 8, 2540. [Google Scholar] [CrossRef]
  40. Truong, X.; Mitamura, M.; Kono, Y.; Raghavan, V.; Yonezawa, G.; Truong, X.; Do, T.; Tien Bui, D.; Lee, S. Enhancing prediction performance of landslide susceptibility model using hybrid machine learning approach of bagging ensemble and logistic model tree. Appl. Sci. 2018, 8. [Google Scholar] [CrossRef]
  41. Pham, B.T.; Tien Bui, D.; Prakash, I.; Dholakia, M.B. Hybrid integration of multilayer perceptron neural networks and machine learning ensembles for landslide susceptibility assessment at himalayan area (india) using gis. CATENA 2017, 149, 52–63. [Google Scholar] [CrossRef]
  42. Pham, B.T.; Shirzadi, A.; Tien Bui, D.; Prakash, I.; Dholakia, M.B. A hybrid machine learning ensemble approach based on a radial basis function neural network and rotation forest for landslide susceptibility modeling: A case study in the himalayan area, india. Int. J. Sediment. Res. 2017. [Google Scholar] [CrossRef]
  43. Chen, W.; Pourghasemi, H.R.; Kornejady, A.; Zhang, N. Landslide spatial modeling: Introducing new ensembles of ann, maxent, and svm machine learning techniques. Geoderma 2017, 305, 314–327. [Google Scholar] [CrossRef]
  44. Pham, B.T.; Prakash, I.; Tien Bui, D. Spatial prediction of landslides using a hybrid machine learning approach based on random subspace and classification and regression trees. Geomorphology 2018, 303, 256–270. [Google Scholar] [CrossRef]
  45. Zabihi, M.; Pourghasemi, H.R.; Pourtaghi, Z.S.; Behzadfar, M. Gis-based multivariate adaptive regression spline and random forest models for groundwater potential mapping in iran. Environ. Earth Sci. 2016, 75, 665. [Google Scholar] [CrossRef]
  46. Lagomarsino, D.; Tofani, V.; Segoni, S.; Catani, F.; Casagli, N. A tool for classification and regression using random forest methodology: Applications to landslide susceptibility mapping and soil thickness modeling. Environ. Modeling Assess. 2017, 22, 201–214. [Google Scholar] [CrossRef]
  47. Breiman, L. Bagging predictors. Mach. Learn. 1996, 24, 123–140. [Google Scholar] [CrossRef] [Green Version]
  48. Tien Bui, D.; Ho, T.-C.; Pradhan, B.; Pham, B.-T.; Nhu, V.-H.; Revhaug, I. Gis-based modeling of rainfall-induced landslides using data mining-based functional trees classifier with adaboost, bagging, and multiboost ensemble frameworks. Environ. Earth Sci. 2016, 75, 1101. [Google Scholar] [CrossRef]
  49. Xia, C.-k.; Su, C.-l.; Cao, J.-t.; Li, P. Multiboost with enn-based ensemble fault diagnosis method and its application in complicated chemical process. J. Cent. South. Univ. 2016, 23, 1183–1197. [Google Scholar] [CrossRef]
  50. Pham, B.T.; Shirzadi, A.; Tien Bui, D.; Prakash, I.; Dholakia, M.B. A hybrid machine learning ensemble approach based on a radial basis function neural network and rotation forest for landslide susceptibility modeling: A case study in the himalayan area, india. Int. J. Sediment. Res. 2018, 33, 157–170. [Google Scholar] [CrossRef]
  51. Fanos, A.M.; Pradhan, B.; Mansor, S.; Yusoff, Z.M.; Abdullah, A.F.b. A hybrid model using machine learning methods and gis for potential rockfall source identification from airborne laser scanning data. Landslides 2018, 15, 1833–1850. [Google Scholar] [CrossRef]
  52. Chen, W.; Panahi, M.; Tsangaratos, P.; Shahabi, H.; Ilia, I.; Panahi, S.; Li, S.; Jaafari, A.; Ahmad, B.B. Applying population-based evolutionary algorithms and a neuro-fuzzy system for modeling landslide susceptibility. CATENA 2019, 172, 212–231. [Google Scholar] [CrossRef]
  53. Naghibi, S.A.; Moghaddam, D.D.; Kalantar, B.; Pradhan, B.; Kisi, O. A comparative assessment of gis-based data mining models and a novel ensemble model in groundwater well potential mapping. J. Hydrol. 2017, 548, 471–483. [Google Scholar] [CrossRef]
  54. Chen, W.; Li, H.; Hou, E.; Wang, S.; Wang, G.; Panahi, M.; Li, T.; Peng, T.; Guo, C.; Niu, C.; et al. Gis-based groundwater potential analysis using novel ensemble weights-of-evidence with logistic regression and functional tree models. Sci. Total Environ. 2018, 634, 853–867. [Google Scholar] [CrossRef] [PubMed]
  55. Kadavi, P.; Lee, C.-W.; Lee, S. Application of ensemble-based machine learning models to landslide susceptibility mapping. Remote Sens. 2018, 10, 1252. [Google Scholar] [CrossRef]
  56. Chen, W.; Xie, X.; Peng, J.; Shahabi, H.; Hong, H.; Bui, D.T.; Duan, Z.; Li, S.; Zhu, A.X. Gis-based landslide susceptibility evaluation using a novel hybrid integration approach of bivariate statistical based random forest method. CATENA 2018, 164, 135–149. [Google Scholar] [CrossRef]
  57. Vakhshoori, V.; Pourghasemi, H.R. A novel hybrid bivariate statistical method entitled froc for landslide susceptibility assessment. Environ. Earth Sci. 2018, 77, 686. [Google Scholar] [CrossRef]
  58. Fell, R.; Glastonbury, J.; Hunter, G. Rapid landslides: The importance of understanding mechanisms and rupture surface mechanics. Q. J. Eng. Geol. Hydrogeol. 2007, 40, 9–27. [Google Scholar] [CrossRef]
  59. Rosi, A.; Tofani, V.; Tanteri, L.; Stefanelli, C.T.; Agostini, A.; Catani, F.; Casagli, N. The new landslide inventory of tuscany (italy) updated with ps-insar: Geomorphological features and landslide distribution. Landslides 2018, 15, 5–19. [Google Scholar] [CrossRef]
  60. Hungr, O.; Leroueil, S.; Picarelli, L. The varnes classification of landslide types, an update. Landslides 2014, 11, 167–194. [Google Scholar] [CrossRef]
  61. Tien Bui, D.; Tuan, T.A.; Klempe, H.; Pradhan, B.; Revhaug, I. Spatial prediction models for shallow landslide hazards: A comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree. Landslides 2016, 13, 361–378. [Google Scholar] [CrossRef]
  62. Dai, F.C.; Lee, C.F.; Li, J.; Xu, Z.W. Assessment of landslide susceptibility on the natural terrain of lantau island, hong kong. Environ. Geol. 2001, 40, 381–391. [Google Scholar]
  63. Nefeslioglu, H.A.; Duman, T.Y.; Durmaz, S. Landslide susceptibility mapping for a part of tectonic kelkit valley (eastern black sea region of turkey). Geomorphology 2008, 94, 401–418. [Google Scholar] [CrossRef]
  64. Pham, B.T.; Khosravi, K.; Prakash, I. Application and comparison of decision tree-based machine learning methods in landside susceptibility assessment at pauri garhwal area, uttarakhand, india. Environ. Process. 2017, 4, 711–730. [Google Scholar] [CrossRef]
  65. Wu, Y.; Li, W.; Liu, P.; Bai, H.; Wang, Q.; He, J.; Liu, Y.; Sun, S. Application of analytic hierarchy process model for landslide susceptibility mapping in the gangu county, gansu province, china. Environ. Earth Sci. 2016, 75, 422. [Google Scholar] [CrossRef]
  66. Saadatkhah, N.; Kassim, A.; Lee, L.M. Susceptibility assessment of shallow landslides in hulu kelang area, kuala lumpur, malaysia using analytical hierarchy process and frequency ratio. Geotech. Geol. Eng. 2015, 33, 43–57. [Google Scholar] [CrossRef]
  67. Aditian, A.; Kubota, T.; Shinohara, Y. Comparison of gis-based landslide susceptibility models using frequency ratio, logistic regression, and artificial neural network in a tertiary region of ambon, indonesia. Geomorphology 2018, 318, 101–111. [Google Scholar] [CrossRef]
  68. Riaz Muhammad, T.; Basharat, M.; Hameed, N.; Shafique, M.; Luo, J. A data-driven approach to landslide-susceptibility mapping in mountainous terrain: Case study from the northwest himalayas, pakistan. Nat. Hazards Rev. 2018, 19, 05018007. [Google Scholar] [CrossRef]
  69. Chen, W.; Li, W.; Hou, E.; Bai, H.; Chai, H.; Wang, D.; Cui, X.; Wang, Q. Application of frequency ratio, statistical index, and index of entropy models and their comparison in landslide susceptibility mapping for the baozhong region of baoji, china. Arab. J. Geosci. 2015, 8, 1829–1841. [Google Scholar] [CrossRef]
  70. Moore, I.D.; Grayson, R.B.; Ladson, A.R. Digital terrain modelling: A review of hydrological, geomorphological, and biological applications. Hydrol. Process. 1991, 5, 3–30. [Google Scholar] [CrossRef]
  71. Chen, W.; Peng, J.; Hong, H.; Shahabi, H.; Pradhan, B.; Liu, J.; Zhu, A.-X.; Pei, X.; Duan, Z. Landslide susceptibility modelling using gis-based machine learning techniques for chongren county, jiangxi province, china. Sci. Total Environ. 2018, 626, 1121–1135. [Google Scholar] [CrossRef]
  72. Beven, K.J.; Kirkby, M.J. A physically based, variable contributing area model of basin hydrology/un modèle à base physique de zone d’appel variable de l’hydrologie du bassin versant. Int. Assoc. Sci. Hydrol. Bull. 1979, 24, 43–69. [Google Scholar] [CrossRef]
  73. Ge, Y.; Chen, H.; Zhao, B.; Tang, H.; Lin, Z.; Xie, Z.; Lv, L.; Zhong, P. A comparison of five methods in landslide susceptibility assessment: A case study from the 330-kv transmission line in gansu region, china. Environ. Earth Sci. 2018, 77, 662. [Google Scholar] [CrossRef]
  74. Wu, Z.; Wu, Y.; Yang, Y.; Chen, F.; Zhang, N.; Ke, Y.; Li, W. A comparative study on the landslide susceptibility mapping using logistic regression and statistical index models. Arab. J. Geosci. 2017, 10, 187. [Google Scholar] [CrossRef]
  75. Yalcin, A. Gis-based landslide susceptibility mapping using analytical hierarchy process and bivariate statistics in ardesen (turkey): Comparisons of results and confirmations. CATENA 2008, 72, 1–12. [Google Scholar] [CrossRef]
  76. Akgun, A. A comparison of landslide susceptibility maps produced by logistic regression, multi-criteria decision, and likelihood ratio methods: A case study at İzmir, turkey. Landslides 2012, 9, 93–106. [Google Scholar] [CrossRef]
  77. Chen, W.; Yan, X.; Zhao, Z.; Hong, H.; Bui, D.T.; Pradhan, B. Spatial prediction of landslide susceptibility using data mining-based kernel logistic regression, naive bayes and rbfnetwork models for the long county area (china). Bull. Eng. Geol. Environ. 2018, 1–20. [Google Scholar] [CrossRef]
  78. Xu, C.; Dai, F.; Xu, X.; Lee, Y.H. Gis-based support vector machine modeling of earthquake-triggered landslide susceptibility in the jianjiang river watershed, China. Geomorphology 2012, 145–146, 70–80. [Google Scholar] [CrossRef]
  79. Ohlmacher, G.C.; Davis, J.C. Using multiple logistic regression and gis technology to predict landslide hazard in northeast kansas, USA. Eng. Geol. 2003, 69, 331–343. [Google Scholar] [CrossRef]
  80. Al-Abadi, A.M.; Shahid, S. A comparison between index of entropy and catastrophe theory methods for mapping groundwater potential in an arid region. Environ. Monit. Assess. 2015, 187, 576. [Google Scholar] [CrossRef]
  81. Salvatici, T.; Tofani, V.; Rossi, G.; D’Ambrosio, M.; Tacconi Stefanelli, C.; Masi, E.B.; Rosi, A.; Pazzi, V.; Vannocci, P.; Petrolo, M.; et al. Application of a physically based model to forecast shallow landslides at a regional scale. Nat. Hazards Earth Syst. Sci. 2018, 18, 1919–1935. [Google Scholar] [CrossRef]
  82. Peng, J.; Tong, X.; Wang, S.; Ma, P. Three-dimensional geological structures and sliding factors and modes of loess landslides. Environ. Earth Sci. 2018, 77, 675. [Google Scholar] [CrossRef]
  83. Santo, A.; Di Crescenzo, G.; Forte, G.; Papa, R.; Pirone, M.; Urciuoli, G. Flow-type landslides in pyroclastic soils on flysch bedrock in southern italy: The bosco de’ preti case study. Landslides 2018, 15, 63–82. [Google Scholar] [CrossRef]
  84. Glade, T. Landslide occurrence as a response to land use change: A review of evidence from new zealand. CATENA 2003, 51, 297–314. [Google Scholar] [CrossRef]
  85. Cui, S.-H.; Pei, X.-J.; Wu, H.-Y.; Huang, R.-Q. Centrifuge model test of an irrigation-induced loess landslide in the heifangtai loess platform, northwest china. J. Mt. Sci. 2018, 15, 130–143. [Google Scholar] [CrossRef]
  86. Justice, C.O.; Townshend, J.R.G.; Holben, B.N.; Tucker, C.J. Analysis of the phenology of global vegetation using meteorological satellite data. Int. J. Remote Sens. 1985, 6, 1271–1318. [Google Scholar] [CrossRef] [Green Version]
  87. Chen, W.; Xie, X.; Wang, J.; Pradhan, B.; Hong, H.; Tien Bui, D.; Duan, Z.; Ma, J. A comparative study of logistic model tree, random forest, and classification and regression tree models for spatial prediction of landslide susceptibility. CATENA 2017, 151, 147–160. [Google Scholar] [CrossRef]
  88. Peruccacci, S.; Brunetti, M.T.; Luciani, S.; Vennari, C.; Guzzetti, F. Lithological and seasonal control on rainfall thresholds for the possible initiation of landslides in central italy. Geomorphology 2012, 139–140, 79–90. [Google Scholar] [CrossRef]
  89. Tsukamoto, Y.; Ohta, T. Runoff process on a steep forested slope. J. Hydrol. 1988, 102, 165–178. [Google Scholar] [CrossRef]
  90. Agterberg, F.P. Systematic approach to dealing with uncertainty of geoscience information in mineral exploration. APCO 1989, 89, 165–178. [Google Scholar]
  91. Dahal, R.K.; Hasegawa, S.; Nonomura, A.; Yamanaka, M.; Masuda, T.; Nishino, K. Gis-based weights-of-evidence modelling of rainfall-induced landslides in small catchments for landslide susceptibility mapping. Environ. Geol. 2008, 54, 311–324. [Google Scholar] [CrossRef]
  92. Lachenbruch, P.A.; Mccullagh, P.; Nelder, J.A. Generalized linear models. Biometrics 1990, 46, 291–303. [Google Scholar] [CrossRef]
  93. Agarwal, S.; Kachroo, P.; Regentova, E. A hybrid model using logistic regression and wavelet transformation to detect traffic incidents. IATSS Res. 2016, 40, 56–63. [Google Scholar] [CrossRef] [Green Version]
  94. Ravì, D.; Bober, M.; Farinella, G.M.; Guarnera, M.; Battiato, S. Semantic segmentation of images exploiting dct based features and random forest. Pattern Recognit. 2016, 52, 260–273. [Google Scholar] [CrossRef]
  95. Masetic, Z.; Subasi, A. Congestive heart failure detection using random forest classifier. Comput. Methods Programs Biomed. 2016, 130, 54–64. [Google Scholar] [CrossRef] [PubMed]
  96. Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  97. Ozdemir, A. Landslide susceptibility mapping using bayesian approach in the sultan mountains (akşehir, turkey). Nat. Hazards 2011, 59, 1573–1607. [Google Scholar] [CrossRef]
  98. Chen, W.; Xie, X.; Peng, J.; Wang, J.; Duan, Z.; Hong, H. Gis-based landslide susceptibility modelling: A comparative assessment of kernel logistic regression, naïve-bayes tree, and alternating decision tree models. Geomat. Nat. Hazards Risk 2017, 8, 950–973. [Google Scholar] [CrossRef]
  99. Lin, G.-F.; Chang, M.-J.; Huang, Y.-C.; Ho, J.-Y. Assessment of susceptibility to rainfall-induced landslides using improved self-organizing linear output map, support vector machine, and logistic regression. Eng. Geol. 2017, 224, 62–74. [Google Scholar] [CrossRef]
  100. Lee, J.-H.; Sameen, M.I.; Pradhan, B.; Park, H.-J. Modeling landslide susceptibility in data-scarce environments using optimized data mining and statistical methods. Geomorphology 2018, 303, 284–298. [Google Scholar] [CrossRef]
  101. Yu, H.; Jiang, S.; Land, K.C. Multicollinearity in hierarchical linear models. Soc. Sci. Res. 2015, 53, 118–136. [Google Scholar] [CrossRef]
  102. Ozdemir, A.; Altural, T. A comparative study of frequency ratio, weights of evidence and logistic regression methods for landslide susceptibility mapping: Sultan mountains, sw turkey. J. Asian Earth Sci. 2013, 64, 180–197. [Google Scholar] [CrossRef]
  103. Ozdemir, A. Gis-based groundwater spring potential mapping in the sultan mountains (konya, turkey) using frequency ratio, weights of evidence and logistic regression methods and their comparison. J. Hydrol. 2011, 411, 290–308. [Google Scholar] [CrossRef]
  104. Hong, H.; Tsangaratos, P.; Ilia, I.; Chen, W.; Xu, C. Comparing the Performance of a Logistic Regression and a Random Forest Model in Landslide Susceptibility Assessments. the Case of Wuyaun Area, China. In Workshop on World Landslide Forum; Mikos, M., Tiwari, B., Yin, Y., Eds.; Springer: Cham, Switzerland, 2017; pp. 1043–1050. [Google Scholar]
  105. Pham, B.T.; Pradhan, B.; Tien Bui, D.; Prakash, I.; Dholakia, M.B. A comparative study of different machine learning methods for landslide susceptibility assessment: A case study of uttarakhand area (india). Environ. Model. Softw. 2016, 84, 240–250. [Google Scholar] [CrossRef]
  106. Tien Bui, D.; Tuan, T.A.; Hoang, N.-D.; Thanh, N.Q.; Nguyen, D.B.; Van Liem, N.; Pradhan, B. Spatial prediction of rainfall-induced landslides for the lao cai area (vietnam) using a hybrid intelligent approach of least squares support vector machines inference model and artificial bee colony optimization. Landslides 2017, 14, 447–458. [Google Scholar] [CrossRef]
  107. Chen, W.; Pourghasemi, H.R.; Naghibi, S.A. Prioritization of landslide conditioning factors and its spatial modeling in shangnan county, china using gis-based data mining algorithms. Bull. Eng. Geol. Environ. 2018, 77, 611–629. [Google Scholar] [CrossRef]
  108. Chen, W.; Pourghasemi, H.R.; Naghibi, S.A. A comparative study of landslide susceptibility maps produced using support vector machine with different kernel functions and entropy data mining models in china. Bull. Eng. Geol. Environ. 2018, 77, 647–664. [Google Scholar] [CrossRef]
  109. He, S.; Pan, P.; Dai, L.; Wang, H.; Liu, J. Application of kernel-based fisher discriminant analysis to map landslide susceptibility in the qinggan river delta, three gorges, china. Geomorphology 2012, 171, 30–41. [Google Scholar] [CrossRef]
  110. Chen, W.; Panahi, M.; Pourghasemi, H.R. Performance evaluation of gis-based new ensemble data mining techniques of adaptive neuro-fuzzy inference system (anfis) with genetic algorithm (ga), differential evolution (de), and particle swarm optimization (pso) for landslide spatial modelling. CATENA 2017, 157, 310–324. [Google Scholar] [CrossRef]
  111. Wang, Q.; Li, W.; Wu, Y.; Pei, Y.; Xie, P. Application of statistical index and index of entropy methods to landslide susceptibility assessment in gongliu (xinjiang, china). Environ. Earth Sci. 2016, 75, 599. [Google Scholar] [CrossRef]
  112. Naghibi, S.A.; Ahmadi, K.; Daneshi, A. Application of support vector machine, random forest, and genetic algorithm optimized random forest models in groundwater potential mapping. Water Resour. Manag. 2017, 31, 2761–2775. [Google Scholar] [CrossRef]
  113. Chen, W.; Shirzadi, A.; Shahabi, H.; Ahmad, B.B.; Zhang, S.; Hong, H.; Zhang, N. A novel hybrid artificial intelligence approach based on the rotation forest ensemble and naïve bayes tree classifiers for a landslide susceptibility assessment in langao county, china. Geomat. Nat. Hazards Risk 2017, 8, 1955–1977. [Google Scholar] [CrossRef]
  114. Ilia, I.; Tsangaratos, P. Applying weight of evidence method and sensitivity analysis to produce a landslide susceptibility map. Landslides 2016, 13, 379–397. [Google Scholar] [CrossRef]
  115. Polykretis, C.; Chalkias, C. Comparison and evaluation of landslide susceptibility maps obtained from weight of evidence, logistic regression, and artificial neural network models. Nat. Hazards 2018. [Google Scholar] [CrossRef]
  116. Lee, S. Landslide detection and susceptibility mapping in the sagimakri area, korea using kompsat-1 and weight of evidence technique. Environ. Earth Sci. 2013, 70, 3197–3215. [Google Scholar] [CrossRef]
  117. Malamud, B.D.; Turcotte, D.L.; Guzzetti, F.; Reichenbach, P. Landslide inventories and their statistical properties. Earth Surf. Process. Landf. 2004, 29, 687–711. [Google Scholar] [CrossRef]
  118. Baeza, C.; Lantada, N.; Amorim, S. Statistical and spatial analysis of landslide susceptibility maps with different classification systems. Environ. Earth Sci. 2016, 75, 1318. [Google Scholar] [CrossRef]
  119. Akgun, A.; Sezer, E.A.; Nefeslioglu, H.A.; Gokceoglu, C.; Pradhan, B. An easy-to-use matlab program (mamland) for the assessment of landslide susceptibility using a mamdani fuzzy algorithm. Comput. Geosci. 2012, 38, 23–34. [Google Scholar] [CrossRef]
  120. Kumar, R.; Anbalagan, R. Landslide susceptibility mapping using analytical hierarchy process (ahp) in tehri reservoir rim region, uttarakhand. J. Geol. Soc. India 2016, 87, 271–286. [Google Scholar] [CrossRef]
  121. Khosravi, K.; Pourghasemi, H.R.; Chapi, K.; Bahri, M. Flash flood susceptibility analysis and its mapping using different bivariate models in iran: A comparison between shannon(‘)s entropy, statistical index, and weighting factor models. Environ. Monit. Assess. 2016, 188, 656. [Google Scholar] [CrossRef] [PubMed]
  122. Kleinbaum, D.G.; Klein, M. Introduction to logistic regression. In Logistic Regression: A Self-Learning Text; Kleinbaum, D.G., Ed.; Springer: New York, NY, USA, 2010. [Google Scholar]
  123. Genuer, R.; Poggi, J.-M.; Tuleau-Malot, C.; Villa-Vialaneix, N. Random forests for big data. Big Data Res. 2017, 9, 28–46. [Google Scholar] [CrossRef]
Figure 1. Location of the study area.
Figure 1. Location of the study area.
Applsci 09 00171 g001
Figure 2. Geological map of the study area.
Figure 2. Geological map of the study area.
Applsci 09 00171 g002
Figure 3. Thematic maps: (a) Slope angle; (b) Slope aspect; (c) Elevation; (d) Plan curvature; (e) Profile curvature; (f) TWI; (g) SPI; (h) STI; (i) Distance to rivers; (j) Distance to roads; (k) Distance to faults; (l) Soil; (m) Land use; (n) NDVI; (o) Lithology; and (p) Rainfall.
Figure 3. Thematic maps: (a) Slope angle; (b) Slope aspect; (c) Elevation; (d) Plan curvature; (e) Profile curvature; (f) TWI; (g) SPI; (h) STI; (i) Distance to rivers; (j) Distance to roads; (k) Distance to faults; (l) Soil; (m) Land use; (n) NDVI; (o) Lithology; and (p) Rainfall.
Applsci 09 00171 g003aApplsci 09 00171 g003bApplsci 09 00171 g003c
Figure 4. Landslide susceptibility map using the WoE model.
Figure 4. Landslide susceptibility map using the WoE model.
Applsci 09 00171 g004
Figure 5. Landslide susceptibility map using the WoE-LR model.
Figure 5. Landslide susceptibility map using the WoE-LR model.
Applsci 09 00171 g005
Figure 6. Mean decrease accuracy and mean decrease Gini.
Figure 6. Mean decrease accuracy and mean decrease Gini.
Applsci 09 00171 g006
Figure 7. Landslide susceptibility map using the WoE-RF model.
Figure 7. Landslide susceptibility map using the WoE-RF model.
Applsci 09 00171 g007
Figure 8. ROC curves using the training dataset.
Figure 8. ROC curves using the training dataset.
Applsci 09 00171 g008
Figure 9. ROC curves using the validation dataset.
Figure 9. ROC curves using the validation dataset.
Applsci 09 00171 g009
Table 1. Correlation between landslides and conditioning factors using the WoE model.
Table 1. Correlation between landslides and conditioning factors using the WoE model.
FactorsClassNo. of LandslideNo. of Pixels in DomainW+WWf
Slope angle (°)0–1036738,3600.077−0.0250.102
10–2063914,1630.423−0.2460.669
20–3029821,142−0.2460.075−0.320
30–4011468,077−0.6530.081−0.734
40–502155,377−1.2550.037−1.292
50–60024,3570.0000.0080.000
60–72.83017100.0000.0010.000
Slope aspectFlat08740.0000.0000.000
North16443,863−0.2250.033−0.258
Northeast16405,251−0.1340.019−0.153
East17376,2070.0010.0000.001
Southeast23390,5470.266−0.0440.310
South32374,2220.639−0.1300.769
Southwest13344,928−0.1810.020−0.201
West9354,647−0.5760.055−0.631
Northwest15432,647−0.2640.037−0.301
Elevation (m)442–60028413,5710.405−0.0790.485
600–80048512,1570.730−0.2370.968
800–100031377,6190.598−0.1190.717
1000–120017326,3810.143−0.0180.161
1200–140015398,407−0.1820.024−0.206
1400–16002385,439−2.1630.117−2.281
1600–18000376,0830.0000.1280.000
1800–20000247,3500.0000.0830.000
2000–2200078,2160.0000.0250.000
2200–2410079630.0000.0030.000
Plan curvature−14.0– −0.0558144,0116−0.1140.088−0.203
−0.05–0.0513215,2900.291−0.0250.316
0.05–13.07701,467,7800.055−0.0510.106
Profile curvature−14.28–−0.05661,428,9520.023−0.0200.042
−0.05–0.0516177,8910.689−0.0620.751
0.05–14.77591,516,343−0.1490.123−0.271
TWI<411558,428−0.8290.116−0.945
4–5501,000,9550.101−0.0520.153
5–648746,5220.354−0.1430.497
6–720393,4900.119−0.0180.137
>712423,791−0.4670.057−0.523
SPI<20881,740,6630.113−0.1640.277
20–4020497,521−0.1160.021−0.137
40–6012231,2360.139−0.0120.151
60–805133,800−0.1890.008−0.197
>8016519,966−0.3830.062−0.445
STI<10901,722,6520.146−0.2150.361
10–2032702,4260.009−0.0030.012
20–306295,062−0.7980.056−0.853
30–405141,300−0.2440.010−0.254
>408261,746−0.3900.029−0.419
Distance to rivers (m)<20027521,1290.138−0.0300.168
200–40022463,3900.050−0.0090.059
400–60018427,717−0.0700.011−0.081
600–80019374,8310.116−0.0170.133
>800551,336,119−0.0920.064−0.156
Distance to roads (m)<30033343,8520.754−0.1500.904
300–60016279,5590.237−0.0270.264
600–9008245,226−0.3250.023−0.348
900–120015219,7520.413−0.0400.453
>1200692,034,797−0.2860.382−0.668
Distance to faults (m)<100032671,7960.054−0.0150.069
1000–200018503,008−0.2320.039−0.271
2000–300021412,1890.121−0.0200.141
3000–40008348,794−0.6770.060−0.737
>4000621,187,3990.145−0.1010.246
SoilCumulic Anthrosol20360,3610.206−0.0300.237
Dystric Cambisol4113,893−0.2510.008−0.259
Eutric Cambisol31249,5921.012−0.1651.177
Calcaric Fluvisol037,0350.0000.0120.000
Haplic Luvisol802,211,459−0.2220.393−0.615
Chromic Luvisol010,0450.0000.0030.000
Eutric Planosol314,8361.500−0.0171.516
Calcaric Regosol182,141−1.3110.020−1.330
Eutric Regosol243,8240.0110.0000.011
Land useFarmland8690,02840.750−0.6011.351
Forestland496,7369−2.3900.342−2.732
Grassland511,202,442−0.0620.037−0.100
Water018,8380.0000.0060.000
Residential areas033,5630.0000.0110.000
Bareland06900.0000.0000.000
NDVI−0.21–0.21467,5020.272−0.0070.279
0.21–0.3610207,9910.063−0.0050.068
0.36–0.4463651,0200.762−0.3581.121
0.44– 0.52561,089,3920.130−0.0770.207
0.52–0.6581,107,281−1.8320.379−2.212
Lithology127363,1390.499−0.0890.588
2016940.0000.0010.000
32136,901−1.1280.031−1.159
46398,403−1.0980.093−1.191
5074700.0000.0020.000
60107,8480.0000.0350.000
75225,834−0.7130.039−0.751
810319,450−0.3660.034−0.401
99276,290−0.3260.027−0.353
10139,158−0.5700.005−0.575
1132435,5390.487−0.1070.594
1249811,4600.291−0.1260.417
Rainfall (mm/yr)<9008189,533−0.0670.004−0.071
900–100029582,2170.098−0.0240.122
1000–110023282,0060.591−0.0830.675
1100–120035329,3190.856−0.1741.030
1200–130016271,0860.268−0.0300.298
1300–140018629,601−0.4570.089−0.545
1400–15007351,254−0.8180.068−0.886
1500–16003270,784−1.4050.069−1.474
1600–17001135,625−1.8120.037−1.849
>1700181,761−1.3060.019−1.325
Table 2. Multicollinearity analysis.
Table 2. Multicollinearity analysis.
Landslide Conditioning FactorsCollinearity Statistics
Tolerance (TOL)Variance inflation factors (VIF)
Slope angle0.7611.315
Slope aspect0.8831.133
Elevation0.6501.539
Plan curvature0.7141.400
Profile curvature0.8551.170
TWI0.8281.208
SPI0.4342.303
STI0.4022.489
Distance to rivers0.9461.057
Distance to roads0.7791.284
Distance to faults0.9081.101
NDVI0.7741.292
Soil0.6421.557
Land use0.6271.595
Lithology0.7651.308
Rainfall0.6641.507
Table 3. Maximum likelihood estimation and Cox and Snell’s and Nagelkerke’s R-square.
Table 3. Maximum likelihood estimation and Cox and Snell’s and Nagelkerke’s R-square.
−2 Log LikelihoodCox & Snell R SquareNagelkerke R Square
311.7800.2450.326
Table 4. Coefficients of WoE-LR model.
Table 4. Coefficients of WoE-LR model.
Landslide Conditioning FactorsCoefficients
Slope angle1.122
Slope aspect2.157
Elevation0.986
Plan curvature2.505
Profile curvature0.868
TWI1.764
SPI1.427
STI1.142
Distance to rivers0.512
Distance to roads1.445
Distance to faults0.972
NDVI0.859
Soil1.392
Land use1.634
Lithology1.032
Rainfall1.594
Constant0.806

Share and Cite

MDPI and ACS Style

Chen, W.; Sun, Z.; Han, J. Landslide Susceptibility Modeling Using Integrated Ensemble Weights of Evidence with Logistic Regression and Random Forest Models. Appl. Sci. 2019, 9, 171. https://doi.org/10.3390/app9010171

AMA Style

Chen W, Sun Z, Han J. Landslide Susceptibility Modeling Using Integrated Ensemble Weights of Evidence with Logistic Regression and Random Forest Models. Applied Sciences. 2019; 9(1):171. https://doi.org/10.3390/app9010171

Chicago/Turabian Style

Chen, Wei, Zenghui Sun, and Jichang Han. 2019. "Landslide Susceptibility Modeling Using Integrated Ensemble Weights of Evidence with Logistic Regression and Random Forest Models" Applied Sciences 9, no. 1: 171. https://doi.org/10.3390/app9010171

APA Style

Chen, W., Sun, Z., & Han, J. (2019). Landslide Susceptibility Modeling Using Integrated Ensemble Weights of Evidence with Logistic Regression and Random Forest Models. Applied Sciences, 9(1), 171. https://doi.org/10.3390/app9010171

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop