1. Introduction
When, during earthquake motion, pore water pressure rises because of applied dynamic loads, the loose saturated sand layer that is relatively close to the ground surface is liquefied. Liquefaction can be discovered through manifestations such as (1) a sharp decrease in the frequency content of a sand layer, (2) settlement, (3) flow slides, (4) sand boiling, (5) foundation failure, and (6) lateral displacement (DH).
The movements of sand blocks, which have destroyed and affected constructions and infrastructure, ranging from a few centimeters to some meters [
1], have been reported. Lateral displacement can be significantly damaging for piles, piers, and pipe lines during and for a short time after earthquakes and causes more damage to structures and infrastructures than any other type of liquefaction-induced ground failure. In this phenomenon, the large blocks of soil move towards the free face or along the slope. Researchers have developed several different models and approaches to predict the
DH caused by liquefaction for some decades. Some of them have proposed numerical approaches [
2,
3,
4,
5,
6] such as the finite element method (FEM) and the finite difference method (FDM). Next to that, analytical approaches have been developed, for example, minimum potential energy [
7] and the sliding block model [
8,
9,
10,
11,
12].
Among them, due to the complicated input model parameters and difficulties in their calculations, as well as because of the complex mechanism of liquefaction, empirical and semi-empirical models are the most common models that have been performed and developed by engineers and researchers [
13,
14,
15,
16,
17]. However, in most cases, because of the scarcity and shortage in their database, some aspects of this phenomenon, such as geology, fault type, and the effect of near-fault sites, have been ignored, with the exception of Zhang et al. [
18], who used Japanese spectral attenuation models, or Bardet et al. [
19], who considered peak ground velocity (
PGV) to overcome this shortage and improved the model proposed by Youd et al. [
20]. Nevertheless, a shortage of studies in geology and motion frequency effects still exists.
In 2006, Kramer [
21] reported the result of substantial research on around 300 ground motion parameters and declared that the most efficient and sufficient intensity measure on liquefaction is one standardized form of cumulative absolute velocity (
CAV), which eliminated amplitudes less than 5 cm/sec
2 and is defined as
CAV5. Sufficiency defines which parameter is independent to estimate the target (increasing pore water pressure herein), and efficiency expresses which parameter is able to predict the target with lower uncertainties [
22]. This parameter quantifies aspects of applied frequency load, which can be affected by the near-fault region aspect and causative fault type of earthquakes. Hui et al. [
23] proposed an index of
PGV to peak ground acceleration (
PGA) to characterize the effect of liquefaction on the piles in near-fault zones. Further, Kwang et al. [
24], through performing some uniform cyclic simple shear laboratory tests, demonstrated that
CAV5 provides the highest correlation with
DH among ground motion parameters. While the significant correlation between
CAV5 and the evaluation of liquefaction have been characterized, no attempt has been made to take it into the account when developing empirical and semi-empirical models.
Furthermore, artificial intelligence has been applied to develop models and correlations to predict
DH using databases that were collected from sites [
25,
26,
27,
28]. Training is organized to minimize the mean square error (MSE) function. Wang et al. [
25] used a back-propagation neural network to develop a model for the prediction of lateral ground displacements caused by liquefaction. They applied the same records used by Bartlet et al. [
1], along with 19 datasets of Ambraseys et al. [
29]. Among all datasets, 367 data points were used for the training phase, and the extra 99 datasets were used for the testing phase, while no validating phase was conducted. The model was developed using the same parameters suggested by Youd et al. [
14].
Baziar et al. [
26] created two subsets for training, to train a network to predict
DH, and a validating phase, to prevent overtraining of the artificial neural network (ANN) model. Then, they presented an ANN model using STATISTICA software (version of Statistica 5.1, Dell Software, Round Rock, TX, USA) to estimate
DH. They inspected the performance of their model using validating subset data without considering extra available models. Furthermore, a new model was presented by Javadi et al. on the basis of genetic programming (GP) [
27]. They divided the dataset randomly, without paying attention to the statistical properties of the input parameters, into two subsets for the validating and training phase. Garcia et al. established a neuro-fuzzy model to use the advantages of both systems. They randomly separated their dataset into two subsets for training and testing; however, they did not take statistical aspects into account. They also compared the value predicted by their model with extra models to evaluate its performance. Baziar et al. [
28] then applied ANN and GP to propose a new model. They divided their dataset randomly into two subsets for the testing and training phase; a validating process was not performed, and the statistical factors of the parameters were not considered.
Although the effects of fine content (
Fc) in different values on excess pore water pressure have been investigated [
30,
31,
32,
33,
34], to the best knowledge of the authors, no attempts have been made to consider the range of
Fc to establish the models to predict
DH. Most of the studies reveal a range of 20% to 30% for the transition of behavior of the response of sand to earthquake and liquefaction occurrence. Maurer et al. [
35] investigated the Canterbury earthquakes in 2010 and 2011 through 7000 case history datasets and illustrated that a high value of
Fc caused more inaccuracy in liquefaction assessments. Tao performed some laboratory tests and demonstrated that the potential of liquefaction has a significant dependency on initial relative density (
Dr) when the
Fc value is larger than 28% [
32].
This study is based on the database of Youd et al. [
20] and the addition of a new earthquake parameter of
CAV5, which is
CAV with a 5-cm/sec
2 threshold acceleration, through the attenuation equation presented by Kramer et al. [
21]. By adding
CAV5, the dataset was expanded and became more capable of and efficient in considering aspects of earthquakes and geology site situations, such as earthquake motion frequency, near-fault effects and the causative fault type of an earthquake. The second dataset was created by eliminating samples with an average
Fc in a liquefiable soil layer (
F15) less than 28%. The response surface method (RSM) is used for the first time as a novel method to develop two equations to predict lateral displacement due to liquefaction (
DH) in order to two created datasets herein. Furthermore, the meaningful and effective terms of the equations are discovered through hypothesis testing of the
p-value. In this study, two ANNs with back-propagation analysis were developed to measure the coding input data of the RSM. To develop each ANN model, the main dataset is first divided into three subsets for the training, testing, and validating stages, considering statistical properties—instead of random division—to increase the capability and accuracy of the model. To achieve this goal, an attempt is made to create all three subsets with close statistical factors. Finally, the results are compared with data measured from the Chi-Chi earthquake’s near fault zone of Wufeng district (
Figure 1) and Nantou district (
Figure 2), as well as with the predicted
DH through three extra models [
20,
27,
36] to demonstrate the accuracy and capability of the RSM models.
2. Review of Empirical and Semi-Empirical Models
Bartlett and Youd [
1], based on factors in References [
13,
14,
29], developed a new model to predict
DH due to liquefaction; they supposed that earthquake, topographical, geological, and soil factors are the most influential parameters on
DH. They studied 467 displacement vectors from the case history database. Among those vectors, 337 were from the 1964 Niigata and 1983 Nihonkai-Chubu, Japan, earthquakes; 111 were from earthquakes in the United States; and the other 19 cases were selected from Ambraseys’ [
29] database. In the end, they developed a new model by using multiple linear regression (MLR) for free-face and ground slope conditions [
37], but they did not separate earthquakes according to their region because of a database shortage. Youd et al. revised their MLR by adding case history data from three earthquakes (1983 Borah Peak, Idaho; 1989 Loma Prieta; and 1995 Hyogoken-Nanbu (Kobe)), and they considered coarser-grained materials. They removed eight displacement sites with prevented free lateral movement and developed two equations with more accuracy given as follows:
Sloping ground conditions:
where
and
In Equations (1) and (2), DH is the predicted lateral ground displacement (m), Mw is the moment magnitude of the earthquake, and T15 is the cumulative thickness of saturated granular layers (m) with corrected blow counts ((N1)60) less than 15. Moreover, F15 is the average fines content of sediment within T15 (%); D5015 is the average mean grain size for granular materials within T15 (mm); S is the ground slope (%); and W is the free-face ratio (H/L), where H is the height of the free face and L is the distance from the base of the free face to the liquefied point. Finally, r is the nearest horizontal or map distance from the site to the seismic energy source (Km).
Rezania et al. [
37] developed a model, based on evolutionary polynomial regression, for the assessment of liquefaction potential and lateral spreading. According to response spectral acceleration, measured through strong-motion attenuation models, Zhang et al. [
38] revised the empirical model of Youd et al. [
20] and demonstrated the ability of their model by comparing the predicted results with datasets from sites in Turkey and New Zealand [
18]. Goh et al. proposed multivariate adaptive regression splines (MARS) by using data of Youd et al. [
20]. They demonstrated an improvement of the original model [
39].
6. Comparison of RSM Equations with Extra Models
Chu et al. [
47] analyzed five liquefied sites during the Chi-Chi-1999 earthquake in Taiwan, all in the near-fault region from five sites in two districts of Wufeng and Natu as they are illustrated in
Figure 1 and
Figure 2.
Table 7 presents these parameters’ values from the sites. Based on these data samples, the predicted results of the RSM equations are compared to Youd et al. [
20], Javadi et al. [
27], and Rezania et al. [
36]. In total, 28 sites (for which the necessary parameters for the ANN model and the RSM equation are reported by Chu et al. [
45]) are illustrated in
Table 8. The sample numbers from 1 to 26 are from Wufeng’s sites and samples of 27 and 28 are belong to Nantu’s site. The capability and accuracy of the first RSM equation was demonstrated using all 28 site samples from the Chi-Chi earthquake, including samples with
F15 from 13% to 48.5%.
The results of comparison between the predicted values and measured values are summarized in terms of the root mean square error (
RMSE), mean absolute error (
MAE), and
R in
Table 8. It is clear that the larger
R and smaller
RMSE and
MAE reveal higher accuracy of predicted results.
where
N is the number of samples,
Xm is the measured value, and
XP is the predicted value.
Furthermore, all samples with an
F15 greater than 28% were eliminated from the Chi-Chi earthquake cases, and 16 samples consequently remained (samples number 1 to 14 as well as numbers 27 and 28, as can be seen in
Table 7). Then, the second RSM equation was validated by applying it to these samples in comparison with the extra three models.
Table 9 summarizes the results of all models.
Figure 6 and
Figure 7 visualize the comparison between both RSM equations developed in the present study and three extra models with measured data from sites of the Chi-Chi earthquake. Twenty-eight data points are evaluated in
Figure 6 for whole range of the parameters. Meanwhile,
Figure 7 illustrates the comparison for data points with
F15 values of less than 28% at 16 data points.
7. Results and Discussion
The previous sections have compared the first RSM model, which belongs to the full range of parameters, and the second RSM model, which was derived for samples whose
F15 values were less than 28%, with three extra well-known models. The models were examined using new data from the Chi-Chi earthquake, which were not included in the two datasets to establish the two RSM models. As can be seen in
Table 8, the RSM equation of the whole range of parameters indicated a higher
R value of 0.683, in comparison with the extra models whose values were 0.433, −0.74, and 0.514. Furthermore, the RSM model comprises lower
MAE and
RSME values of 0.3 and 0.37, respectively, compared to 0.49 and 0.7 for Rezania et al., 1.04 and 1.19 for Javadi et al., and 3.77 and 4.37 for Youd et al. Therefore, among all of them, the RSM model provided prediction with higher accuracy.
On the other hand, as can be seen in
Table 9, by considering samples with
F15 less than 28%, the model of Youd et al. provided the highest
R value of 0.934, closely followed by the second and the first RSM models with
R values equal to 0.891 and 0.846 respectively.
Table 9 also illustrates the MAE and RSME criteria values for all models for samples with a limited value for
F15 less than 28%. The values of the
MAE and
RSME in the second RSM model—0.29 and 0.39, respectively—indicates the highest accuracy and performance in comparison with the others. In addition, the model of Rezania et al., with 0.42 and 0.57, illustrated lower values for the
MAE and
RSME, respectively. Further, Javadi et al. with 1.2 and 1.3, and Youd et al. with 4.84 and 5.34 provide less accuracy for predicting
DH.
The comparison between the two models developed in this study and the extra three models demonstrates that the second RSM model provided a reasonable correlation and the lowest error. The results indicated that the RSM is a highly efficient tool to perform a liquefaction hazard analysis. Furthermore, performance of the model is increased by taking into account the complex influence of Fc by eliminating an F15 larger than 28% and even by decreasing the number of samples in the dataset.
Another major advantage of the presented models is their consideration of earthquake aspects, such as the near-fault zone, the frequency of earthquake motion, and the causative fault type, by estimating and adding the
CAV5 parameter to the dataset. As can be seen from
Figure 6 and
Figure 7, among all models that were considered in the present study to calculate
DH without any limitation on the parameters’ value, the model of Youd et al. was overpredicted. Meanwhile, Youd et al.’s model provided poor and overpredicted results for samples with a limited value of
F15 less than 28%. Additionally, considering samples with a limited
F15 value shows the first RSM and the model of Javadi et al. present an overpredicted value for
DH. Furthermore, second RSM equation and model of Rezania et al. underpredicted
DH in their predictions.
There are some limitations for applying both first and second RSM equations as follow:
- (1)
Both RSM models require standard penetration test SPT and laboratory tests to determine geotechnical properties parameters of T15, F15, and D5015.
- (2)
Both of the RSM models are valid for free-face conditions but not ground-slope conditions.
- (3)
Second RSM model is valid only for F15 < 28%.
- (4)
Models are only valid for earthquakes with Mw between 6.4 and 8.0.
- (5)
Specify accuracy limits for each model.
- (6)
It is necessary to transfer all six input models’ parameters measured value to a coded value using Equation (15) and then put the coded value in the RSM equations to predict DH.
8. Summary and Conclusions
The determination of lateral displacement due to liquefaction caused by an earthquake (
DH) is the most important aspect of liquefaction hazard analysis. There are two main types of conditions according to the topography of the sites: free-face and sloping ground conditions. First, the parameter of corrected absolute velocity (
CAV5) of sites was calculated due to it being the most efficient and sufficient parameter for the assessment of liquefaction caused by earthquakes [
21], and it was added to develop the dataset to cover all aspects of earthquakes, including the frequency content of earthquake motions and the causative fault type of earthquakes. Then, a statistical parametric analysis was performed by estimating the correlation coefficient (R) between all input parameters and output as
DH. To achieve a more capable and accurate model, based on the estimated values for
R, the horizontal distance from a site to the seismic energy source (
r) and ground slope (
S) was eliminated from the original dataset due to poor correlations to the target. Therefore, the final dataset was created for free-face condition sites.
The significant aspects of earthquakes, such as the near-fault region, frequency content, and causative fault type of earthquakes, which are included in the model established by Kramer et al. [
21], were considered by taking
CAV5 into account. To investigate the complex effect of fine content, the main dataset was divided into two subsets. The first dataset included the whole range of parameters, and in the second one, all samples with average fine content in the liquefiable layer (
F15) larger than 28% were removed from the dataset, in line with Tao [
32]. Furthermore, the RSM was applied to develop two equations in order to the first and the second dataset to examine its performance to assess liquefaction. In the end, the two presented models in this study were compared to three available models to demonstrate their capability and accuracy with regard to predicting
DH in free-face conditions in a near fault zone case history of the Chi-Chi earthquake.
The present study highlights the importance of earthquake aspects, especially CAV5 as the most sufficient and efficient intensity to liquefaction hazard assessments. In addition, the RSM is a strong tool for the evaluation of complex non-linear phenomena such as liquefaction.
The results also confirm the complicated influence of
F15 on the whole range, and they provide significant enhancements to the performance of the model by considering samples with an
F15 less than 28% as a critical value defined by Tao [
32]. One of the most remarkable results, which shows the complex influence of fine content on evaluation of
DH, is that the second model demonstrated higher accuracy and capability, even though it was developed using a database with fewer samples than the first model.