Next Article in Journal
The Use of Drone Photo Material to Classify the Purity of Photovoltaic Panels Based on Statistical Classifiers
Next Article in Special Issue
Study on Machine Learning Models for Building Resilience Evaluation in Mountainous Area: A Case Study of Banan District, Chongqing, China
Previous Article in Journal
Event Visualization and Trajectory Tracking of the Load Carried by Rotary Crane
Previous Article in Special Issue
A Novel Model for Landslide Displacement Prediction Based on EDR Selection and Multi-Swarm Intelligence Optimization Algorithm
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Data Mining and Deep Learning for Predicting the Displacement of “Step-like” Landslides

1
Faculty of Engineering, China University of Geosciences, Wuhan 430074, China
2
Engineering Research Center of Rock-Soil Drilling & Excavation and Protection, Ministry of Education, Wuhan 430074, China
*
Author to whom correspondence should be addressed.
Sensors 2022, 22(2), 481; https://doi.org/10.3390/s22020481
Submission received: 5 December 2021 / Revised: 5 January 2022 / Accepted: 8 January 2022 / Published: 9 January 2022

Abstract

:
Landslide displacement prediction is one of the unsolved challenges in the field of geological hazards, especially in reservoir areas. Affected by rainfall and cyclic fluctuations in reservoir water levels, a large number of landslide disasters have developed in the Three Gorges Reservoir Area. In this article, the Baishuihe landslide was taken as the research object. Firstly, based on time series theory, the landslide displacement was decomposed into three parts (trend term, periodic term, and random term) by Variational Mode Decomposition (VMD). Next, the landslide was divided into three deformation states according to the deformation rate. A data mining algorithm was introduced for selecting the triggering factors of periodic displacement, and the Fruit Fly Optimization Algorithm–Back Propagation Neural Network (FOA-BPNN) was applied to the training and prediction of periodic and random displacements. The results show that the displacement monitoring curve of the Baishuihe landslide has a “step-like” trend. Using VMD to decompose the displacement of a landslide can indicate the triggering factors, which has clear physical significance. In the proposed model, the R2 values between the measured and predicted displacements of ZG118 and XD01 were 0.977 and 0.978 respectively. Compared with previous studies, the prediction model proposed in this article not only ensures the calculation efficiency but also further improves the accuracy of the prediction results, which could provide guidance for the prediction and prevention of geological disasters.

1. Introduction

Landslides occur frequently around the world and are one of the most destructive geological disasters in the world [1,2]. Landslide displacement prediction is one of the geological engineering problems that at present has not been solved, especially for mountain and reservoir areas. Reservoir impoundment usually affects the surrounding geological environment, resulting in landslide disasters. As the largest power station in terms of installed capacity in the world since 2012, the water level of the Three Gorges Reservoir fluctuates between 145 and 175 m all year round. Hence, a large number of landslide disasters have developed in the Three Gorges reservoir [3,4]. Because the Three Gorges reservoir plays an important role in flood control and power generation, it is of great significance to study geological landslides in the Three Gorges Reservoir area [5,6].
Landslide displacement prediction is a hot topic at the forefront of natural hazard research [7]. Displacement prediction is the basis of early warning systems for landslide disasters. Accurate landslide displacement prediction can reduce the losses caused by such disasters as much as possible, so as to ensure the safety of people’s lives and property. Due to the complex geological environment, the accuracy of current methods for directly predicting total displacement is not sufficient [8]. Hence, landslide displacement should be divided into several parts by the decomposition technique. At present, landslide displacement decomposition mainly adopts two methods. The first is the time series and simple moving average method [9,10]. This method is simple and practical, and the displacement component obtained has a clear physical meaning. However, due to defects of the decomposition method itself, the random displacement cannot be obtained. The second is empirical mode decomposition (EMD), wavelet analysis, and ensemble empirical mode decomposition (EEMD), which can divide the total displacement into a specific number of components, so it has clear physical significance [11,12,13].
The landslide displacement prediction model has experienced rapid development in the past 50 years, which was from the initial empirical model to the mathematical statistical model, and then to the non-linear theoretical model and the comprehensive model [14]. Nowadays, with the development of high-speed computers, various machine learning models including deep learning have been widely used for predicting landslide displacement, such as ELM (Extreme Learning Machine) [15], EML (Evaluating Machine Learning) [16], BPNN (Back Propagation Neural Network) [17,18], SVR (Support Vector Regression) [19], KELM (Kernel Extreme Learning Machine) [20,21], LSTM (Long Short-Term Memory) [9,22], and so on. Many algorithms have been used to optimize the parameters for the prediction models, including GS (Grid Search algorithm) [10], PSO (Particle Swarm Optimization) [23], GA (Genetic Algorithm) [24], FOA (Fruit Fly Optimization Algorithm) [25], GWO (Grey Wolf Optimizer) [26], and so on. Therefore, selection of the influencing factors plays a crucial role in the development of landslide prediction. Besides, for landslides in a reservoir area, the fluctuation of the reservoir level and rainfall are usually used as the hydrologic triggering factors of landslide deformation and failure [27]. However, the increase of input factors does not necessarily lead to higher prediction accuracy in the model of landslide displacement prediction. Based on the above facts, for different types of displacement, it is necessary to select the appropriate inducing factor as the input layer to establish the model. At present, data mining technology has been widely used in the field of geological hazards. Nevertheless, the research on data mining technology in landslides mostly focuses on association criterions and thresholds of triggering factors, while there are few publications on the joint use of data mining technology and deep learning. In order to optimize the triggering factors to find the most suitable factors for displacement prediction, data mining technology could be used.
In this paper, the Baishuihe landslide was taken as an example, which was in the east of Three Gorges Reservoir area. Data mining and deep learning were used for predicting the displacement. Based on the time series analysis of landslides, the displacement and triggering factors are decomposed by VMD. Periodic and random terms were predicted by FOA-BPNN. A flow chart of this work is shown in Figure 1.

2. Methodology

2.1. Two-Step Clustering

The two-step clustering algorithm is usually applied to deal with large-scale types of data, which divides and integrates data through a two-step process of pre-clustering and clustering to complete the data classification [28]. For sample data including both numerical and subtype variables, the two-step clustering algorithm usually uses a log-likelihood function. If clustered into j classes, it is defined as:
l = j = 1 J i I j log p ( X i θ i ) = j = 1 J l j
where, p is the likelihood function; Ij is the set of samples of jth class; θj is the parameter vector of jth class; J is the number of clusters. For all samples, the log-likelihood clusters are obtained as the aggregation of the log-likelihood clusters for each category.
For the certain ith class and jth class, the combination is noted as <i, j>, and then their distance can be defined as:
d ( i , j ) = ξ i + ξ j ξ i , j
where ξi and ξj are the log-likelihood distance of ith class and jth class, respectively. ξ<i,j> is the log-likelihood distance of the combination of <i, j>. ξ is the specific form of the log-likelihood function:
ξ v = N V ( k = 1 K A 1 2 log ( σ ^ k 2 + σ ^ v k 2 ) + k = 1 K B E v k )
where,
E v k = l = 1 L k N v k l N v log N v k l N v
where KA is the number of numerical variables; KB is the number of categorical variable; σ ^ k 2 and σ ^ v k 2 denote the total variance of the kth numerical variable and the variance in vth class respectively; Nv and Nvkl are the sample size of category v and the first category in the kth subtype variable; Lk is the category of the kth subtype variable.
After ith class and jth class are combined, −ξ<i,j> is greater than ξi + ξj, and hence d(i,j) is less than 0. Moreover, the smaller d(i,j) is, the more it means that the merging of ith class and jth class will not cause a significant increase in intra-class differences. Specially, when d(i,j) is less than the threshold C, ith class and jth class can be merged. Conversely, when d(i,j) is greater than the threshold C, indicating that merging will cause a significant increase in variability within the clustered clusters, and the ith class and jth class cannot be merged.
The threshold value C is given by:
C = log ( V )
V = k R k m L m
where Rk is the range of values of the kth numeric variable; Lm is the sample size of the mth subtype variable.

2.2. Apriori Algorithm

The a priori algorithm was proposed by Agrawal [29]. This algorithm can deal only with categorical variables rather than numeric variables. The algorithm mainly includes two steps: (1) generating frequent item sets that meet the minimum support values, and (2) generating association rules that satisfy the minimum credibility in the frequent item set generated in the first step.
The frequent item set T contains item a (frequent item set). If its support is equal to or greater than the support threshold specified by the user, as shown in Equation (1), the a priori algorithm uses the iterative method of layer-by-layer searching to generate frequent item sets.
| T ( a ) | | T | min   sup p
Frequent k-item sets are used to explore and generate (k + 1)-item sets. The algorithm implementation process is shown in Figure 2.
Simple association rules are generated from the frequent item sets, and association rules with confidence levels greater than the threshold value are selected to form an effective rule set. If C L ( L L ) is greater than the confidence threshold specified by the user (see Equation (2)), then the association rule can be generated.
C L ( L L ) = | T ( L ) | | T ( L ) | min   c o n f

2.3. VMD

Based on the EMD model, the VMD model was proposed in 2013, which is an adaptive method for signal processing and modal variation [30]. The constraint variation can be expressed as:
{ min { u k } { ω k } { k = 1 K t [ ( σ ( t ) + j π t ) u k ( t ) ] e j ω k t 2 2 }   s . t .   k = 1 K u k = f ( t ) }
where f(t) is the original signal, K is the number of components, t denotes the Dirac function, { ω k } denotes the actual central frequency, { u k } denotes the component obtained after decomposition, ( σ ( t ) + j π t ) u k ( t ) denotes the analytical signal of each component, e j ω k t denotes the estimated central frequency of each analytical signal, and * denotes the convolution operator. We then obtain the following:
L ( { u k } , { ω k } , λ ) = α k t [ ( σ ( t ) + j π t ) u k ( t ) ] e j ω k t 2 2 + f ( t ) k u k ( t ) 2 2 + λ ( t ) , f ( t ) k u k ( t )
where λ denotes the Lagrange multiplier.
By using the alternative direction method of multipliers (ADMM), the saddle point of the model without an upper constraint can be obtained, which is the optimal solution of the constrained variational model, so that the original signal can be decomposed into IMF components.

2.4. FOA-BPNN

The BPNN is a multilayer feedforward neural network based on error back propagation algorithm training, which was first proposed by Rumelhart and McClelland [31]. BPNNs have arbitrary complex pattern classification and good multidimensional function mapping ability. In addition, it can solve XOR and other problems that simple perceptrons cannot solve. Structurally, BPNNs are composed of an input layer, a hidden layer, and an output layer. The BP algorithm takes the network’s square error as the objective function and uses the gradient descent method to calculate the minimum objective function.
In addition, as proposed by Wen-Tsao Pan [32], the FOA is a new method of global optimization, which is based on the foraging behavior of Drosophila melanogaster. Because the fruit fly is superior to other species in terms of smell and vision, the olfactory organ of Drosophila can collect all kinds of smells floating in the air, even the smells of food sources 40 km away. Then, after flying to the vicinity of the food location, they can use their sharp vision to find the food or observe the gathering position of their companions, and fly in that direction. The optimization procedure of the FOA-BPNN is shown in Figure 3.

3. Case Study

3.1. Geological Settings of Three Gorges Reservoir Area

The Three Gorges reservoir is an artificial lake formed after the completion of the Three Gorges hydropower station, situated in the middle part of China. The total lengths of the Yangtze River and surrounding area are 660 km and 1084 km2 respectively. The altitude drops from the highest part to the west and east, forming a hilly landform and medium altitude mountains, respectively. The trend of the mountains is controlled by the main geological structures. The strata in the Three Gorges Reservoir area are from pre Sinian to Quaternary. Jurassic red strata are dominant in the Three Gorges Reservoir area, mainly exposed in the west of Zigui county east of Fengjie county (the red strata refer to sandstone, mudstone, and sandstone interbedded with mudstone layers). In addition, other sedimentary rocks (limestone, marl, and dolomite) also exist in the area between Fengjie and Zigui. These hard rocks form a steep canyon in Fengjie–Zigui area. Metamorphic complexes and magmatic rocks appear in the area near the dam site on a relatively small scale. Controlled by the complex geological conditions, coupled with seasonal rainfall and periodic fluctuation of reservoir water level, a large number of geological disasters have developed in the Three Gorges Reservoir area. A total of 4429 geological disasters have been found up to the present time, most of which are landslides, rock falls, and debris flows [4]. A geological map of the Three Gorges Reservoir area is shown in Figure 4.

3.2. Local Environmental Conditions

The Baishuihe landslide is in the Zigui County area of the Three Gorges Reservoir area, located in the middle latitude, belonging to a subtropical continental monsoon climate zone, with a warm and humid climate, sufficient light, abundant rainfall, and distinct seasons. The average annual rainfall of Zigui County is 1493.2 mm. Rainfall is generally concentrated in the flood season in this area, and the maximum daily rainfall has historically reached up to 358 mm. The monsoon is mainly southerly. Limited by the terrain, the wind speed is generally low. The Yangtze River is the lowest erosion base level in this area and flows through the front edge of the landslide from west to east. The cross section of the river valley is a “V” shape, steep to the north and gentle to the south. There are several gullies short in length and depth in the landslide area, all of which are trunk gullies. Only temporary flood flows are formed after rainstorms, which constitute the primary discharge channel of surface water in the area.
The Baishuihe landslide is located on the south bank (convex bank) of the Yangtze River. The elevation of the landslide gradually decreases from south to north. The elevation of the toe and rear edges is about 70 m and 400 m, and the bedrock ridge is the boundary between the eastern and western sides. The deformation of the middle and front part of the landslide is relatively strong. Pinnate fissures are continuously distributed on both sides of the boundary, and the boundary between the east side and the rear edge is basically connected. The slope of the landslide is 30° to 35°, and the landslide has an average thickness of 30 m with a volume of 1.26 × 107 m3. The topographic map and a schematic geological profile of the Baishuihe landslide are shown in Figure 5 and Figure 6.

3.3. Deformation of the Landslide

The Baishuihe landslide has been monitored since June 2003, and the layout of the monitoring surface layout is shown in Figure 5b. According to the characteristics of the surface monitoring displacement and surface macro-deformations, the Baishuihe landslide can be divided into two areas; however, the active area (area A) is the middle and front part of the landslide, which has strong deformation. After the completion of the Three Gorges Dam, the landslide has produced obvious displacement due to the impoundment of the reservoir. Several transverse tension cracks have appeared in the east of the landslide. Specifically, the eastern and posterior boundaries are basically connected, and the western boundary’s cracks are in the shape of pinnately distributed cracks. From August 2005 to August 2006, there were many landslides on the inner side slope of the riverside highway with an elevation of about 220 m, and many subsidence and tension cracks appeared on the surface of the landslide. Approximately 100,000 m3 of landslide debris piled on the road towards the rear of the active area in June 2007 (Figure 7).

3.4. Analysis of the Monitoring Data

There are three monitoring sections and six GPS monitoring points in the active Baishuihe landslide area. Among them, the monitoring points ZG93 and ZG118 have been in place since June 2003; XD01 and XD02 were added in May 2005, and XD03, XD04 were added in October 2005. Note that the displacement of all the monitoring points is synchronous. The monitoring period of Points ZG93, ZG118, and XD01 is long, meaning that they are representative and can reflect the entire movement process of the landslide. Therefore, in this study, these three points were taken for detailed analyses (Figure 8). According to the filling scheduling of the reservoir, the monitoring data can be divided into three stages for analysis, as described below.
(1)
Phase I (from June 2003 to June 2006): The water level of the reservoir started at 135 m in September and reached its highest level of 139 m in October. The maximum displacement of ZG93 and ZG118 was 25.8 mm and 30.6 mm, respectively, during the three impoundment periods. During Phase I, the reservoir basically maintained the highest water level from November to January of the next year. The maximum monthly displacement rates of these three points were below 13 mm/month during this period, which was relatively slow. The water level began to drop in February each year and reached the lowest level (135 m) in July. During this period, the minimum increase of these three points was over 80 mm, and the maximum increase was over 150 mm. Especially in May and June, the rate of increase in landslide displacement was the largest. From the end of July to the beginning of September, the reservoir water level remained at the lowest level, but the landslide displacements first continued to grow rapidly and then basically remained the same. In this stage, the water level of the Yangtze River changed from having the natural water level for many years to the manually adjusted reservoir water level, and the landslide was still in the adaptation period of adjustment of the reservoir’s water level. Therefore, we can consider that the deformation of the landslide in this stage was mainly affected by the decline in the reservoir’s water level. In particular, the heavy rainfall in July 2005 did not cause an obvious increase in the displacement of the landslide.
(2)
Phase II (from July 2006 to June 2008): The water level of the reservoir fluctuated between 145 and 155 m, which dropped from 155 m to 145 m for the first time during April to June 2007. Alternatively, a drastic drop in the water level led to an increase in the hydrodynamic pressure inside the landslide, which caused the displacement of each monitoring point to suddenly increase for the first time, increasing by more than 1000 mm.
(3)
Phase III (from July 2008 to December 2016): The water level of the reservoir fluctuated between 145 and 175 m. Before 2015, the annual displacement rate showed a downward trend.
In summary, the fluctuation in the reservoir’s water level resulted in the significant extension of the fluctuating range and immersion range of the reservoir’s water level, then, in turn, the stress field, seepage field, and rock–soil structure characteristics of the sliding mass changed significantly, which had a significant impact on the evolution process of the Baishuihe landslide. In addition, owing to the different stages of the reservoir’s water level operations, there were some differences in the degree of impact on the deformation evolution process of the landslide. Moreover, although the landslide has undergone some adjustment, its shear deformation energy has been released to a certain extent. However, when external effects such as rainfall and the reservoir water level change dramatically again, the landslide will tend to be unstable.

4. Results

4.1. Triggering Factors

Rainfall and periodic fluctuation of reservoir water level are the main inducing factors of landslide deformation [34,35]. The periodic fluctuation of the water level in the Three Gorges causes dynamic osmotic pressure in the slope, resulting in landslide deformation [36]. On the one hand, rainfall can increase the weight of the landslide mass, thus increasing the sliding force of landslides; on the other hand, it can weaken the mechanical strength of the landslide rock and soil mass, resulting in landslide deformation [37,38]. Therefore, rainfall and reservoir water level can be used as trigger factors for landslide deformation [39]. In this research, a total of 10 triggering factors were selected to carry out displacement prediction research, including 5 reservoir level related factors (monthly average water level h ¯ ; monthly maximum daily drop of water level Δ h max d a i l y d r o p ; monthly maximum daily rise of water level Δ h max d a i l y r i s e ; monthly fluctuation of water level Δ h m o n t h ; bimonthly fluctuation of water level Δ h 2 m o n t h ), 4 rainfall related factors (monthly maximum effective continuous rainfall q c o n t i n u o u s e f f e c t i v e ; monthly cumulative rainfall q m o n t h ; bimonthly cumulative rainfall q 2 m o n t h ; monthly maximum daily rainfall q max d a y ), and 1 deformation factor (last monthly velocity of deformation v), as shown in Table 1. In this study, the monitoring data of ZG93 were selected for landslide prediction. In addition, because the monitoring points ZG118 and XD01 have similar deformation characteristics to ZG93, the monitoring data of ZG118 and XD01 were added to increase the sample size and overcome model overfitting errors, as well as to provide a more representative prediction of the overall landslide displacement. The triggering factors are shown in Table 1.

4.2. Clustering Results

In the two-step clustering algorithm, the minimum and maximum categories of the triggering factors were set as 2 and 10 respectively. In clustering algorithms, there are two commonly used clustering criteria: the Akaike Information Criterion (AIC) and the Bayesian Information Criterion (BIC). When the number of samples is large, the BIC criterion can effectively avoid the model complexity caused by high model accuracy. Therefore, in this study, the BIC was chosen as the cluster criterion, and the distance measurement method was Euclidean distance. The clustering results of the external triggering factors are shown in Table 2 and Table 3. Monthly velocity (v) was clustered into three categories (Low, V1; Medium, V2; High, V3), as shown in Table 4.

4.3. Association Rules

In the a priori algorithm, the minimum conditional support was set to 0.01 and the minimum rule confidence was set to 100% to ensure that the mining association criteria were absolutely correct. In total, 5447 association rules were generated, most of which were V1 and V2 stages (4247 and 1008, respectively). The main factors controlling V1 deformation of the landslide were smooth fluctuations of the reservoir’s water level and light rainfall. The main factors controlling V2 deformation of the landslide were sharp fluctuations of the water level and medium to heavy rainfall. The main factor controlling V3 deformation of the landslide was heavy rainfall. Nevertheless, there may be some time correlation between these nine factors. In general, a drop in reservoir water and heavy rainfall were the main factors causing landslide deformation in the Three Gorges Reservoir area. It can be seen from Figure 7 that the water level of the Three Gorges reservoir has had a period of slow decline (175 m–165 m) from January to April and a rapid decline (165 m–145 m) from April to June since 2008. The heavy rainfall is concentrated from June to September every year. Moreover, this is also a critical period when the landslide produces severe deformation.
The statistical results of the data mining and association rules are shown in Table 5. The total support, average support, and the contribution without support of each triggering factor were counted, and the comprehensive contribution was the mean value of these three contributions. The comprehensive contribution of each factor according to the association rules is shown in Figure 9. Factors with a degree of contribution less than 0.3 were eliminated and were not used as input layers in the prediction model. Therefore, eight triggering factors were taken as the input layer in the V1 and V3 prediction models (F1, F3, F5, F6, F7, F8, F9, and F10), and eight triggering factors were taken as the input layer in the V2 prediction model (F1, F2, F5, F6, F7, F8, F9, and F10).

4.4. Decomposition of Displacement

The non-stationary time series theory indicated that the time series consisted of three parts: the trend term, the periodic term, and the random term. For the landslide displacement, the time series can be divided into three parts: (1) trend displacement, which is controlled by internal factors, such as geological conditions, geomorphology, geological structure, rock and soil properties, etc.; (2) periodic displacement, which is controlled by external factors, such as rainfall, the reservoir’s water level, wind load, air temperature, etc.; and (3) random displacement, which is controlled by random factors, such as human activities, engineering construction, vehicle loads, vibration loads, etc., as shown in Figure 10.
X ( t ) = α ( t ) + β ( t ) + γ ( t )
Here, X ( t ) denotes the observed value of landslide displacement, and α ( t ) , β ( t ) , and γ ( t ) denote the trend, periodic, and random displacements, respectively.
Therefore, K was set to 3 and 2 in the VMD decomposition of the landslide displacements and triggering factors, respectively. The penalty parameter a and the rising step τ (a = 1.5 and τ = 0.1) were finally determined through multiple trials as follows. (1) In the displacement decomposition, a = 1.5 and τ = 0.1. (2) In the triggering factors decomposing, a = 700 and τ = 0.5. The decomposition results are shown in Table 6 and Figure 11.

4.5. Displacement Prediction

4.5.1. Trend Term Prediction

The displacement of the trend term showed a distinct piecewise function. Therefore, the trend term of ZG93 was divided into three phases: Phase 1 (June 2003~June 2007), Phase 2 (June 2007~June 2014), and Phase 3 (June 2014~December 2016). Multiple fitting results showed that good fitting results can be obtained by using a cubic function and the robust least squares method. The fitting function can be defined as:
S = at3 + bt2 + ct + d
The fitting results and parameters are shown in Figure 12 and Table 7, which indicate that the prediction accuracy’s R2 and the RMSE of the trend term were 99.4% and 4.063.

4.5.2. Periodic and Random Term Prediction

The periodic and random displacements were trained and predicted by FOA-BPNN. In general, a model’s performance is usually affected by its own structure. Through extensive sensitivity analysis, the most reliable structure can be obtained [40,41]. Therefore, in the process of FOA optimization, 12 different structures with population sizes between 10 and 120 (10 intervals) were tested [42,43]. Each network was executed with 100 repetitions, and the MSE (between the actual and predicted periodic displacements of the landslide) was defined as the objective function used to evaluate the performance error of the model. It is worth noting that each structure was tested five times to evaluate its repeatability. The sensitivity curves are shown in Figure 13, which indicate that the MSE of the model decreases with an increase in the population size. However, because FOA-BPNN integration reduces the error in the training process, the model is less sensitive to population size. The computing time of models with different population sizes is shown in Figure 14. After consideration of the calculation costs and error, the population size of FOA-BPNN model was determined as 10. In total, six FOA-BPNN models were built, including individual periodic prediction models for V1, V2, and V3, and individual random prediction models for V1, V2, and V3, as shown in Figure 15.
Figure 15 indicates that the proposed models achieved good prediction results. According to the results of the residual error analysis, in the training process of the model, the residual error of displacement was relatively stable, which also verified the robustness and reliability of the model. For the prediction samples, there were some fluctuations in the residual error. The prediction accuracy of the model will be analyzed in the Discussion section.

4.5.3. Total Displacement

The total displacement prediction results of the landslide can be obtained by superimposing the prediction results of all three types of displacement, as shown in Figure 16, which shows that the prediction model achieved good accuracy for monitoring point ZG93. In June 2007, there was a significant difference between the total displacement training value and the actual value, resulting in the obvious mutation of the residual error. This was because in the three parts of landslide displacement (trend, periodic, random), the trend displacement accounts for more than 85%. In June 2007, it was the boundary between Phase 1 and Phase 2, where there were some differences in the training results of the two polynomial fitting functions, resulting in a large residual error in the total displacement. However, the residual error was relatively stable for the prediction samples.

5. Discussion

As mentioned above, the landslide displacement contains three parts: (1) trend displacement, which is controlled by internal factors; (2) periodic displacement, which is controlled by external factors; and (3) random displacement, which is controlled by random factors. Generally, in predictions of landslide displacement, the selection of triggering factors is based on monitoring data such as rainfall and the reservoir’s water level, which are the main factors causing periodic displacement. Therefore, taking these factors as the input of periodic displacement will not only have clear physical significance but will also significantly improve the accuracy of landslide displacement predictions. When the time series analysis method was used to predict the landslide displacement, the displacement trend was relatively easy to predict. Therefore, choosing the appropriate periodic displacement prediction model is the key to improving the effect of landslide displacement predictions. Moreover, landslide prediction models have experienced rapid development in the past 50 years, and various machine learning models have been widely used for predicting landslide displacements. However, each algorithm has its limitations. For instance, SVM has low computational complexity but it is sensitive to the choice of parameters and kernel function. The decision tree model does not need any prior assumptions on the data, but the required sample size is relatively large, and its ability to deal with missing values is quite limited. ELM uses the principle of least squares and a pseudo-inverse matrix to solve the problem, which is only suitable for single-hidden-layer neural networks. BPNN has strong self-learning, self-adaptive ability, and good generalization ability but it is prone to slow convergence. In this study, based on the VMD and data mining results, the FOA-BPNN was used to predict the periodic and random terms of monitoring point ZG93′s displacement. The BPNN, SVM, and ELM algorithms were chosen as the comparison models (Models 2–4). The performance of various displacement prediction models of the Baishuihe landslide are shown in Table 8 and Figure 17. The prediction accuracy of the FOA-BPNN model was the highest. The R2 reached 0.977 and its RMSE was only 10.041. In contrast, the proposed model could improve the accuracy of landslide displacement predictions.
In this study, ZG93’s monitoring data were selected for predicting displacement, and the monitoring data of points ZG118 and XD01 were added to increase the sample size and overcome the model’s overfitting error, and to provide a better representative prediction of the overall landslide displacement. The accuracy of various models in terms of predicting ZG93’s displacement has been discussed. The monitoring points ZG118 and XD01 in 2016 were used for the model validation. The measured and predicted displacements of ZG93, ZG118, XD01 are shown in Figure 18. The R2 values between the measured and predicted displacements of ZG118 and XD01 were 0.977 and 0.978, respectively. The RMSE of these two monitoring points was 12.40 and 16.04, respectively. In the previous study [10], cumulative displacement was divided into trend term and periodic term by time series model and moving average method. A cubic polynomial model was proposed to predict the trend term of displacement. Then, multiple algorithms were used to determine the optimal support vector regression (SVR) model and train and predict the periodic term. In this paper, data mining technology is used to screen the trigger factors of periodic items, and the more advanced FOA optimization algorithm is used to optimize the parameters of the machine learning model. Furthermore, this paper uses a VMD model to divide the landslide displacement data, which makes great progress compared with the moving average model, and will be more conducive to the integration and automation of the landslide prediction model. Therefore, the prediction accuracy obtained in this paper (R2 = 0.977 and 0.978) is significantly higher than that of previous studies (R2 = 0.963 and 0.951). In general, the model proposed in this study has achieved good results in terms of predicting the displacement of different monitoring points of the landslide, which has high practicability and application value in the study of landslide displacement predictions. However, it is worth noting that due to the small amount of displacement data in the V3 state of the monitoring point (Table 6), the prediction results of XD01 have obvious errors for July 2016. Therefore, in order to obtain satisfactory prediction results, the monitoring data of various states should be supplemented as much as possible.

6. Conclusions

In this paper, the Baishuihe landslide in the Three Gorges Reservoir area was taken as an example. Data mining and deep learning were used for displacement prediction. The following conclusions can be reached:
(1)
Using VMD to decompose the displacement of Baishuihe landslide can correspond to the triggering factors, which had clear physical significance.
(2)
The association rules showed that the main factors controlling the V2 and V3 deformation of the landslide were the sharp fluctuation of reservoir water level and medium–heavy rainfall.
(3)
R2 between the measured and prediction displacements of ZG118 and XD01 were 0.977 and 0.978. RMSE of these two monitoring points were 12.40 and 16.04, respectively.
(4)
An integrated approach for landslide displacement prediction including data mining and deep learning was proposed, which could guide the managers of geological disasters to improve the prediction accuracy, so as to reduce the losses caused by landslides.

Author Contributions

Writing—original draft, F.M.; data curation, X.X.; writing—review and editing, Y.W.; investigation, F.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the National Natural Science Foundation of China (42007267, 41977244), Science and Technology Project of Hubei Provincial Department of Natural Resources (ZRZY2020KJ12), and the National Key R&D Program of China (2017YFC-1501301).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Hong, H.; Pourghasemi, H.; Pourtaghi, Z. Landslide susceptibility assessment in Lianhua County (China): A comparison between a random forest data mining technique and bivariate and multivariate statistical models. Geomorphology 2016, 259, 105–118. [Google Scholar] [CrossRef]
  2. Juang, C.H.; Dijkstra, T.; Wasowski, J.; Meng, X. Loess geohazards research in China: Advances and challenges for mega engineering projects. Eng. Geol. 2019, 251, 1–10. [Google Scholar] [CrossRef]
  3. Wu, Y.; Miao, F.; Li, L.; Xie, Y.; Chang, B. Time-varying reliability analysis of Huangtupo Riverside No. 2 landslide in the Three Gorges Reservoir based on water-soil coupling. Eng. Geol. 2017, 226, 267–276. [Google Scholar] [CrossRef]
  4. Tang, H.; Wasowski, J.; Juang, C.H. Geohazards in the three Gorges Reservoir Area, China–Lessons learned from decades of research. Eng. Geol. 2019, 261, 105267. [Google Scholar] [CrossRef]
  5. Miao, F.; Wu, Y.; Li, L.; Tang, H.; Li, Y. Centrifuge model test on the retrogressive landslide subjected to reservoir water level fluctuation. Eng. Geol. 2018, 245, 169–179. [Google Scholar] [CrossRef]
  6. Li, H.; Xu, Q.; He, Y.; Fan, X.; Li, S. Modeling and predicting reservoir landslide displacement with deep belief network and EWMA control charts: A case study in Three Gorges Reservoir. Landslides 2020, 17, 693–707. [Google Scholar] [CrossRef]
  7. Intrieri, E.; Carlà, T.; Gigli, G. Forecasting the time of failure of landslides at slope-scale: A literature review. Earth-Sci. Rev. 2019, 193, 333–349. [Google Scholar] [CrossRef]
  8. Lenti, L.; Martino, S. The interaction of seismic waves with step-like slopes and its influence on landslide movements. Eng. Geol. 2012, 126, 19–36. [Google Scholar] [CrossRef]
  9. Yang, B.; Yin, K.; Lacasse, S.; Liu, Z. Time series analysis and long short-term memory neural network to predict landslide displacement. Landslides 2019, 16, 677–694. [Google Scholar] [CrossRef]
  10. Miao, F.; Wu, Y.; Xie, Y.; Li, Y. Prediction of landslide displacement with step-like behavior based on multialgorithm optimization and a support vector regression model. Landslides 2018, 15, 475–488. [Google Scholar] [CrossRef]
  11. Huang, F.; Huang, J.; Jiang, S.; Zhou, C. Landslide displacement prediction based on multivariate chaotic model and extreme learning machine. Eng. Geol. 2017, 218, 173–186. [Google Scholar] [CrossRef]
  12. Shihabudheen, K.V.; Pillai, G.N.; Peethambaran, B. Prediction of landslide displacement with controlling factors using extreme learning adaptive neuro-fuzzy inference system (ELANFIS). Appl. Soft Comput. 2017, 61, 892–904. [Google Scholar]
  13. Lu, X.; Miao, F.; Xie, X.; Li, D.; Xie, Y. A new method for displacement prediction of “step-like” landslides based on VMD-FOA-SVR model. Environ. Earth Sci. 2021, 80, 542. [Google Scholar] [CrossRef]
  14. Ska, B.; Dbk, A.; Tsa, C. Investigating the potential of a global precipitation forecast to inform landslide prediction. Weather Clim. Extrem. 2021, 33, 100364. [Google Scholar]
  15. Deng, L.; Smith, A.; Dixon, N.; Yuan, H. Machine learning prediction of landslide deformation behaviour using acoustic emission and rainfall measurements. Eng. Geol. 2021, 293, 106315. [Google Scholar] [CrossRef]
  16. Goetz, J.N.; Brenning, A.; Petschko, H.; Leopold, P. Evaluating machine learning and statistical prediction techniques for landslide susceptibility modeling. Comput. Geosci. 2015, 81, 1–11. [Google Scholar] [CrossRef]
  17. Chen, H.; Zeng, Z. Deformation prediction of landslide based on improved back-propagation neural network. Cogn. Comput. 2013, 5, 56–62. [Google Scholar] [CrossRef]
  18. Zhang, Y.; Tang, J.; Liao, R.; Zhang, M.-F.; Zhang, Y.; Wang, X.-M.; Su, Z.-Y. Application of an enhanced bp neural network model with water cycle algorithm on landslide prediction. Stoch. Environ. Res. Risk Assess. 2021, 35, 1273–1291. [Google Scholar] [CrossRef]
  19. Liu, Y.; Xu, C.; Huang, B.; Ren, X.; Liu, C.; Hu, B.; Chen, Z. Landslide displacement prediction based on multi-source data fusion and sensitivity states. Eng. Geol. 2020, 271, 105608. [Google Scholar] [CrossRef]
  20. Zhou, C.; Yin, K.; Cao, Y.; Intrieri, E.; Ahmed, B.; Catani, F. Displacement prediction of step-like landslide by applying a novel kernel extreme learning machine method. Landslides 2018, 15, 2211–2225. [Google Scholar] [CrossRef] [Green Version]
  21. Li, L.; Wu, Y.; Miao, F.; Xue, Y.; Huang, Y. A hybrid interval displacement forecasting model for reservoir colluvial landslides with step-like deformation characteristics considering dynamic switching of deformation states. Stoch. Environ. Res. Risk Assess. 2021, 35, 1089–1112. [Google Scholar] [CrossRef]
  22. Xu, S.; Niu, R. Displacement prediction of Baijiabao landslide based on empirical mode decomposition and long short-term memory neural network in Three Gorges area, China. Comput. Geosci. 2018, 111, 87–96. [Google Scholar] [CrossRef]
  23. Zhou, C.; Yin, K.; Cao, Y.; Ahmed, B. Application of time series analysis and PSO–SVM model in predicting the Bazimen landslide in the Three Gorges Reservoir, China. Eng. Geol. 2016, 204, 108–120. [Google Scholar] [CrossRef]
  24. Li, X.Z.; Kong, J.M. Application of GA-SVM method with parameter optimization for landslide development prediction. Nat. Hazards Earth Syst. Sci. 2014, 14, 525. [Google Scholar] [CrossRef] [Green Version]
  25. Wang, J.; Di, Y.; Rui, X. Research and application of machine learning method based on swarm intelligence optimization. J. Comput. Methods Sci. Eng. 2019, 19, 179–187. [Google Scholar] [CrossRef]
  26. Guo, Z.; Chen, L.; Gui, L.; Du, J.; Yin, K.; Do, H.M. Landslide displacement prediction based on variational mode decomposition and WA-GWO-BP model. Landslides 2020, 17, 567–583. [Google Scholar] [CrossRef]
  27. Hong, H.; Chen, W.; Xu, C.; Youssef, A.M.; Pradhan, B.; Bui, D.T. Rainfall-induced landslide susceptibility assessment at the Chongren area (China) using frequency ratio, certainty factor, and index of entropy. Geocarto Int. 2017, 32, 139–154. [Google Scholar] [CrossRef]
  28. Wu, X.; Zhan, F.; Zhang, K.; Deng, Q. Application of a two-step cluster analysis and the apriori algorithm to classify the deformation states of two typical colluvial landslides in the three gorges, china. Environ. Earth Sci. 2016, 75, 146. [Google Scholar] [CrossRef]
  29. Agrawal, R.; Srikant, R. Fast algorithms for mining association rules. Int. Conf. Very Large Data Bases VLDB 1994, 1215, 487–499. [Google Scholar]
  30. Dragomiretskiy, K.; Zosso, D. Variational mode decomposition. IEEE Trans. Signal Processing 2013, 62, 531–544. [Google Scholar] [CrossRef]
  31. McClelland, J.; Rumelhart, D.; PDP Research Group. Parallel distributed processing. Explor. Microstruct. Cogn. 1986, 2, 216–271. [Google Scholar]
  32. Pan, W.T. A new fruit fly optimization algorithm: Taking the financial distress model as an example. Knowl. Based Syst. 2012, 26, 69–74. [Google Scholar] [CrossRef]
  33. Miao, F.; Wu, Y.; Li, L.; Liao, K.; Xue, Y. Triggering factors and threshold analysis of baishuihe landslide based on the data mining methods. Nat. Hazards 2021, 105, 2677–2696. [Google Scholar] [CrossRef]
  34. Song, K.; Wang, F.; Yi, Q.; Lu, S. Landslide deformation behavior influenced by water level fluctuations of the Three Gorges Reservoir (China). Eng. Geol. 2018, 247, 58–68. [Google Scholar] [CrossRef]
  35. Huang, D.; Gu, D.M.; Song, Y.X.; Cen, D.F.; Zeng, B. Towards a complete understanding of the triggering mechanism of a large reactivated landslide in the Three Gorges Reservoir. Eng. Geol. 2018, 238, 36–51. [Google Scholar] [CrossRef]
  36. Wang, G.; Sassa, K. Factors affecting rainfall-induced flowslides in laboratory flume tests. Geotechnique 2001, 51, 587–599. [Google Scholar] [CrossRef]
  37. Wang, J.; Xiao, L.; Zhang, J.; Zhu, Y. Deformation characteristics and failure mechanisms of a rainfall-induced complex landslide in Wanzhou County, Three Gorges Reservoir, China. Landslides 2020, 17, 419–431. [Google Scholar] [CrossRef]
  38. Wu, L.Z.; Zhu, S.R.; Peng, J. Application of the Chebyshev spectral method to the simulation of groundwater flow and rainfall-induced landslides. Appl. Math. Model. 2020, 80, 408–425. [Google Scholar] [CrossRef]
  39. Xiong, X.; Shi, Z.; Xiong, Y.; Peng, M.; Ma, X.; Zhang, F. Unsaturated slope stability around the Three Gorges Reservoir under various combinations of rainfall and water level fluctuation. Eng. Geol. 2019, 261, 105231. [Google Scholar] [CrossRef]
  40. Mehrabi, M.; Pradhan, B.; Moayedi, H.; Alamri, A. Optimizing an adaptive neuro-fuzzy inference system for spatial prediction of landslide susceptibility using four state-of-the-art metaheuristic techniques. Sensors 2020, 20, 1723. [Google Scholar] [CrossRef] [Green Version]
  41. Moayedi, H.; Mehrabi, M.; Bui, D.T.; Pradhan, B.; Foong, L.K. Fuzzy-metaheuristic ensembles for spatial assessment of forest fire susceptibility. J. Environ. Manag. 2020, 260, 109867. [Google Scholar] [CrossRef] [PubMed]
  42. Moayedi, H.; Mehrabi, M.; Kalantar, B.; Mu’Azu, M.A.; Rashid, A.S.A.; Foong, L.K.; Nguyen, H. Novel hybrids of adaptive neuro-fuzzy inference system (ANFIS) with several metaheuristic algorithms for spatial susceptibility assessment of seismic-induced landslide. Geomatics. Nat. Hazards Risk 2019, 10, 1879–1911. [Google Scholar] [CrossRef] [Green Version]
  43. Moayedi, H.; Mehrabi, M.; Mosallanezhad, M.; Rashid, A.S.A.; Pradhan, B. Modification of landslide susceptibility mapping using optimized PSO-ANN technique. Eng. Comput. 2019, 35, 967–984. [Google Scholar] [CrossRef]
Figure 1. Flow chart of the displacement prediction.
Figure 1. Flow chart of the displacement prediction.
Sensors 22 00481 g001
Figure 2. The implementation process of the a priori algorithm.
Figure 2. The implementation process of the a priori algorithm.
Sensors 22 00481 g002
Figure 3. Optimization procedure of an FOA-BPNN.
Figure 3. Optimization procedure of an FOA-BPNN.
Sensors 22 00481 g003
Figure 4. Geological map of the Three Gorges Reservoir area.
Figure 4. Geological map of the Three Gorges Reservoir area.
Sensors 22 00481 g004
Figure 5. (a) Location of the Baishuihe landslide; (b) Topographic map of the Baishuihe landslide; (c) Overall view of the Baishuihe landslide © 2022 Springer Nature [33].
Figure 5. (a) Location of the Baishuihe landslide; (b) Topographic map of the Baishuihe landslide; (c) Overall view of the Baishuihe landslide © 2022 Springer Nature [33].
Sensors 22 00481 g005
Figure 6. Schematic geological profile of the Baishuihe landslide (II–II’) © 2022 Springer Nature [10].
Figure 6. Schematic geological profile of the Baishuihe landslide (II–II’) © 2022 Springer Nature [10].
Sensors 22 00481 g006
Figure 7. Macroscopic deformation of the Baishuihe landslide © 2022 Springer Nature [33].
Figure 7. Macroscopic deformation of the Baishuihe landslide © 2022 Springer Nature [33].
Sensors 22 00481 g007
Figure 8. Long term monitoring data of the Baishuihe landslide (displacement, reservoir level, precipitation).
Figure 8. Long term monitoring data of the Baishuihe landslide (displacement, reservoir level, precipitation).
Sensors 22 00481 g008
Figure 9. Comprehensive contribution of each factor according to the association rules (V1: Low monthly velocity; V2: Medium monthly velocity; V3: High monthly velocity; F1: Monthly average water level; F2: Maximum monthly daily drop in water level; F3: Maximum monthly daily rise in water level; F4: Monthly fluctuation of the water level; F5: Bimonthly fluctuation of the water level; F6: Maximum monthly effective continuous rainfall; F7: Cumulative monthly rainfall; F8: Cumulative bimonthly rainfall; F9: Maximum monthly daily rainfall).
Figure 9. Comprehensive contribution of each factor according to the association rules (V1: Low monthly velocity; V2: Medium monthly velocity; V3: High monthly velocity; F1: Monthly average water level; F2: Maximum monthly daily drop in water level; F3: Maximum monthly daily rise in water level; F4: Monthly fluctuation of the water level; F5: Bimonthly fluctuation of the water level; F6: Maximum monthly effective continuous rainfall; F7: Cumulative monthly rainfall; F8: Cumulative bimonthly rainfall; F9: Maximum monthly daily rainfall).
Sensors 22 00481 g009
Figure 10. Relationship among trend displacement, periodic displacement, and random displacement.
Figure 10. Relationship among trend displacement, periodic displacement, and random displacement.
Sensors 22 00481 g010
Figure 11. VMD decomposition of displacements at the monitoring points (ZG93, ZG118, XD01).
Figure 11. VMD decomposition of displacements at the monitoring points (ZG93, ZG118, XD01).
Sensors 22 00481 g011
Figure 12. Fitting and prediction curves of the trend term.
Figure 12. Fitting and prediction curves of the trend term.
Sensors 22 00481 g012
Figure 13. A population-based sensitivity analysis for the FOA-BPNN model.
Figure 13. A population-based sensitivity analysis for the FOA-BPNN model.
Sensors 22 00481 g013
Figure 14. Computing time of different population sizes in MATLAB2019 software.
Figure 14. Computing time of different population sizes in MATLAB2019 software.
Sensors 22 00481 g014
Figure 15. Training and prediction curves of the periodic and random terms.
Figure 15. Training and prediction curves of the periodic and random terms.
Sensors 22 00481 g015
Figure 16. Training and prediction curves of the total displacement.
Figure 16. Training and prediction curves of the total displacement.
Sensors 22 00481 g016
Figure 17. Prediction curves of the total displacement.
Figure 17. Prediction curves of the total displacement.
Sensors 22 00481 g017
Figure 18. Measured and predicted displacements of ZG93, ZG118, and XD01.
Figure 18. Measured and predicted displacements of ZG93, ZG118, and XD01.
Sensors 22 00481 g018
Table 1. Triggering factors used to carry out displacement predictions.
Table 1. Triggering factors used to carry out displacement predictions.
No.FactorsCategory
F1 Monthly   average   water   level   ( h ¯ ) (m)Reservoir water
F2 Maximum   monthly   daily   drop   in   water   level   ( Δ h max d a i l y d r o p ) (m/day)Reservoir water
F3 Maximum   monthly   daily   rise   in   water   level   ( Δ h max d a i l y r i s e ) (m/day)Reservoir water
F4 Monthly   fluctuation   of   the   water   level   ( Δ h m o n t h ) (m/month)Reservoir water
F5 Bimonthly   fluctuation   of   the   water   level   ( Δ h 2 m o n t h ) (m/2 months)Reservoir water
F6 Maximum   monthly   effective   continuous   rainfall   ( q c o n t i n u o u s e f f e c t i v e ) (mm)Rainfall
F7 Cumulative   monthly   rainfall   ( q m o n t h ) (mm)Rainfall
F8 Cumulative   bimonthly   rainfall   ( q 2 m o n t h ) (mm)Rainfall
F9 Maximum   monthly   daily   rainfall   ( q max d a y ) (mm)Rainfall
F10Monthly velocity (v) (mm/month)Deformation
Table 2. Clustering results of the reservoir water level factors (ZG93, ZG118, XD01).
Table 2. Clustering results of the reservoir water level factors (ZG93, ZG118, XD01).
No.FactorsClustering ResultsCount
F1 h ¯ (135.13~138.95)High Water Level (F11)97
(144.21~158.02)Medium Water Level (F12)186
(160.14~174.74)Low Water Level (F13)183
F2 Δ h max d a i l y d r o p (−0.14~0.58)Slow Daily Drop (F21)339
(0.63~1.87)Medium Daily Drop (F22)92
(1.91~3.69)Sharp Daily Drop (F23)35
F3 Δ h max d a i l y r i s e (−0.43~0.04)Slow Daily Rise (F31)129
(−1.70~−0.49)Sharp Daily Rise (F32)337
F4 Δ h m o n t h (0~6.18)Smooth Fluctuation (F41)349
(6.59~18.25)Sharp Fluctuation (F42)117
F5 Δ h 2 m o n t h (0~6.50)Non-fluctuation (F51)250
(6.68~14.15)Smooth Fluctuation (F52)126
(14.91~28.71)Sharp Fluctuation (F53)90
Table 3. Clustering results of the rainfall factors (ZG93, ZG118, XD01).
Table 3. Clustering results of the rainfall factors (ZG93, ZG118, XD01).
No.FactorsClustering ResultsCount
F6 q c o n t i n u o u s e f f e c t i v e (1.50~30.30)Light Effective Rainfall (F61)182
(31.30~66.00)Moderate Effective Rainfall (F62)151
(67.70~110.50)Medium Effective Rainfall (F63)92
(125.00~239.40)Heavy Effective Rainfall (F64)41
F7 q m o n t h (3.10~66.10)Light Effective Rainfall (F71)198
(69.90~163.70)Moderate Effective Rainfall (F72)191
(168.50~291.50)Medium Effective Rainfall (F73)60
(357.50~517.60)Heavy Effective Rainfall (F74)17
F8 q 2 m o n t h (18.40~135.20)Light Effective Rainfall (F81)197
(143.60~362.90)Moderate Effective Rainfall (F82)212
(367.20~726.30)Heavy Effective Rainfall (F83)57
F9 q max d a y (1.30~25.60)Light Daily Rainfall (F91)234
(26.50~51.30)Moderate Daily Rainfall (F92)151
Table 4. Clustering results of the monthly velocity (ZG93, ZG118, XD01).
Table 4. Clustering results of the monthly velocity (ZG93, ZG118, XD01).
Monthly Velocity (v) (mm/month)Clustering ResultsCount
(−9.61~21.66)Low (V1)358
(22.35~81.89)Medium (V2)81
(137.70~313.24)High (V3)27
Table 5. Statistical results of the data mining and association rules.
Table 5. Statistical results of the data mining and association rules.
ContributionF1F2F3F4F5F6F7F8F9
V1Association rules286019362071168326732780261027702630
Total support4480.981867.493231.691557.063723.313776.063579.763800.013744.18
Average support1.570.961.560.931.391.361.371.371.42
Contribution without support0.670.450.490.400.630.650.610.650.62
Comprehensive contribution0.410.230.330.210.360.370.350.370.36
V2Association rules632463308392630654694628725
Total support453.26344.78195.09289.57438.03467.48506.75447.84478.52
Average support0.720.740.630.740.690.710.730.710.66
Contribution without support0.630.460.310.390.630.650.690.620.72
Comprehensive contribution0.360.300.210.270.350.370.390.360.38
V3Association rules130481090111133126105124
Total support83.4329.4569.32070.5481.5980.9867.2676.07
Average support0.640.610.6400.640.610.640.640.61
Contribution without support0.710.260.6000.610.730.690.580.68
Comprehensive contribution0.420.230.3700.380.420.410.370.40
Table 6. Composition of training and prediction samples.
Table 6. Composition of training and prediction samples.
SamplesTraining SamplesPrediction Samples
Monthly velocityV1V2V3V1V2V3
ZG931163051020
ZG1181192481020
XD019322131011
Total samples32876263051
Table 7. Parameters of the trend term of displacement based on polynomial fitting.
Table 7. Parameters of the trend term of displacement based on polynomial fitting.
PhaseabcdR2MSERMSE
Phase 10.012−0.61021.2082.6980.990518.27122.766
Phase 20.003−1.072124.836−2752.8300.993780.99527.946
Phase 3−0.03615.684−2272.588111,207.8930.99920.8764.569
All training samples////0.994563.72923.743
Prediction samples////0.99116.5104.063
Table 8. Performance of various displacement prediction models of the Baishuihe landslide.
Table 8. Performance of various displacement prediction models of the Baishuihe landslide.
ModelAlgorithm’s CombinationPrediction Term
R2MSERMSE
Model 1VMD + FOA-BPNN0.977100.82810.041
Model 2VMD + BPNN0.923340.48118.452
Model 3VMD + SVM0.944282.56616.81
Model 4VMD + ELM0.877940.46230.667
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Miao, F.; Xie, X.; Wu, Y.; Zhao, F. Data Mining and Deep Learning for Predicting the Displacement of “Step-like” Landslides. Sensors 2022, 22, 481. https://doi.org/10.3390/s22020481

AMA Style

Miao F, Xie X, Wu Y, Zhao F. Data Mining and Deep Learning for Predicting the Displacement of “Step-like” Landslides. Sensors. 2022; 22(2):481. https://doi.org/10.3390/s22020481

Chicago/Turabian Style

Miao, Fasheng, Xiaoxu Xie, Yiping Wu, and Fancheng Zhao. 2022. "Data Mining and Deep Learning for Predicting the Displacement of “Step-like” Landslides" Sensors 22, no. 2: 481. https://doi.org/10.3390/s22020481

APA Style

Miao, F., Xie, X., Wu, Y., & Zhao, F. (2022). Data Mining and Deep Learning for Predicting the Displacement of “Step-like” Landslides. Sensors, 22(2), 481. https://doi.org/10.3390/s22020481

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop