1. Introduction
Concrete gravity dams are a very common type of dam built and used in China, and the total construction volume is among the highest in the world. Ensuring that concrete gravity dams can meet reliability requirements throughout the operation period is of great significance to national security construction and ecological environmental protection. The single safety factor method is mostly used in the design of concrete gravity dams with a long operation stage. The variables affecting the dam structure design, such as material properties and external loads, are random variables, not certain values. The safety factor method only uses a certain safety factor to evaluate the safety of the dam, without considering the objective uncertainty in each design variable. The reliability analysis for the design and safety check of concrete gravity dams can effectively overcome the shortcomings of the safety factor method [
1,
2]. The results obtained by reliability analysis are more reliable.
The traditional reliability analysis methods mainly include the first-order reliability method [
3], MC method [
4], response surface method [
5], and direct integration method [
6]. However, all of these methods have certain shortcomings. For example, the accuracy of the first-order reliability method is not high; the MC method requires a large number of finite element calculations to obtain the results with high accuracy, and the computational cost is relatively large; the response surface method is generally polynomial in form, which, to a certain extent, limits the accuracy of its fitted limit state function, and the calculation will be more complicated when dealing with a high degree of nonlinearity and a large number of variables; and the direct integration method often generates computational difficulties.
To overcome the limitations of the application of the above-mentioned traditional reliability analysis methods, many scholars have conducted useful research. Zhao et al. [
7] proposed the third-order moment method based on the study of the primary second-order moment method, which improved the accuracy of the reliability calculation results. Lu et al. [
8] proposed a second fourth-order moment calculation method for reliability calculations based on the first fourth-order moment of the second-order approximation of the functional function and proved its simplicity and accuracy. Jiang et al. [
9] proposed a reliability analysis method for gravity dams based on the Hermite orthogonal polynomial approximation method and illustrated the correctness and effectiveness of the proposed method by arithmetic examples. Ren et al. [
10] proposed a sensitivity-assisted MC simulation method in which second-order derivatives were introduced to improve the accuracy of reliability calculations. Heydt et al. [
11] proposed a bootstrap-compensated MC method to reduce the computation time of reliability by introducing the concept of augmented samples. The importance sampling method [
12] is an improved MC method, which is mainly used to improve the efficiency of simulation calculations. Guan et al. [
13] discussed that a change in the location of the implicit functional function fitting point in the traditional response surface method can affect the results of structural reliability calculations, indicating that the accuracy of response surface method calculations at the checkpoint is closely related to the location of the checkpoint. Zhu et al. [
14] proposed a weighted dynamic response surface reliability analysis method based on the cat swarm algorithm, which overcomes the drawback of the MC method that requires a large number of repeated samples. Zhao [
15] proposed a reliability solution method for the response surface of a high-dimensional model combined with the first-order reliability method to solve the structural reliability. Balu et al. [
16] proposed a new method for solving implicit functional function reliability problems, which does not require the derivation of response surface functions of random variables in the process of calculating reliability. Sundar et al. [
17] used locally propagated samples to track the response surface function, and an important feature of this algorithm is that the alternative model advances with the choice of samples, enabling the algorithm to converge quickly to the response surface. Yuan et al. [
18] proposed an improved method of conditional boundary integration method for solving reliability problems with high accuracy requirements by using the direct integration method, which adopts a simple modification of binary integration based on the probability addition rule, and the computational accuracy and efficiency of the method were verified by the error analysis of arithmetic cases. Although scholars have improved and proposed many new methods to address the shortcomings of the traditional reliability solution methods, the following shortcomings still exist: the applicability and solution accuracy of some reliability calculation methods are limited due to the complexity of actual structural engineering problems and the existence of highly nonlinear failure surfaces.
The limit state functions of concrete gravity dams are often highly nonlinear when considering the randomness of external loads, material parameters, and other factors. Therefore, it is difficult to express them accurately with explicit functions. When solving such problems using the above reliability solution methods, it is usually difficult to ensure the accuracy and efficiency of the calculation results.
The development of machine learning and intelligent algorithms provides effective tools for solving practical engineering problems, especially in solving engineering reliability problems [
19,
20]. Least squares support vector machine (LSSVM) is a new generation of machine learning method. LSSVM has advantages of rigorous mathematical foundation, strong small-sample learning ability, low over-fitting risk, fast solving speed, etc. However, in practical application, it is difficult to determine hyper-parameters of LSSVM. For example, when the radial basis function is adopted as kernel function, it is hard to determine its penalty parameter
γ and kernel width parameter
σ. At present, optimizing the search algorithm is the main approach to solving the above problem. The particle swarm optimization (PSO) algorithm [
21] is widely used to tune the hyper-parameters of the LSSVM because of its simple form and easy understanding [
22,
23,
24,
25]. However, the PSO algorithm has the drawback of easily falling into the local optimum, so this paper improves the PSO algorithm for the hyper-parameters tuning.
To improve the calculation accuracy and efficiency, this paper proposed a reliability analysis method based on least squares support vector machines with an improved particle swarm optimization algorithm (IPSO-LSSVM). The IPSO-LSSVM is adopted to establish the response surface to approximate the limit state function based on the samples generated by computer experiments. Subsequently, the proposed response surface is utilized in conjunction with the Monte Carlo (MC) method to obtain the desired reliability estimation.
3. Case Study and Results
3.1. Basic Information of a Typical Concrete Gravity Dam
A typical gravity dam located in Yunnan Province of China was selected. The gravity dam is a roller-compacted concrete gravity dam. The dam is 106.00 m high, 17.50 m wide at the top, and 90.10 m wide at the base. The normal storage level is 1472.00 m, with a corresponding downstream water level of 1408.20 m. The designed flood level is 1478.35 m, with a corresponding downstream water level of 1423.46 m. The check flood level is 1480.26 m, with a corresponding downstream water level of 1425.23 m and an upstream dead water level of 1429.00 m. The foundation of this project is mainly hard slate with an elevation of 1375.00 m. A non-overflow dam section was selected for analysis. The profile of the section is shown in
Figure 2. The statistical characteristics and value ranges of the influencing factors considered are shown in
Table 1.
3.2. Finite Element Model of a Typical Concrete Gravity Dam
According to the basic information, a two-dimensional plane strain finite element model of the typical concrete gravity dam was established. The foundation rock is homogeneous characterized by the Mohr–Coulomb model [
30]. The linear elastic model was adopted in dam concrete which is in good condition. A thin layer with a height of 0.2 m was used to simulate the contact between the dam and the foundation [
30], and its material properties were consistent with foundation rock. Foundation boundary conditions were set to apply fixed constraints at the bottom of the foundation and horizontal constraints at the upstream and downstream of the foundation. The foundation and the dam were considered to be impervious. Dam weight and hydrostatic pressure acting on the upstream and downstream surfaces of the dam body and the foundation were taken into account. The main material properties of dam concrete, foundation rock, and contact are shown in
Table 2.
When the finite element method is used to calculate the dam stress, the calculation domain needs to include the dam foundation. To reflect the influence of foundation stiffness on dam stress, the foundation was selected to a certain range, but in fact, the foundation was a semi-infinite space. To simplify the infinite space with limited space, the general way is to simulate by applying boundary constraints. According to the Saint-Venant principle, if the range of foundation considered is larger, the influence of boundary constraints on dam stress is smaller. When it reaches a certain range, the dam stress is almost no longer affected by the expansion of the foundation range. However, scholars do not have the same standard for the selection of foundation range.
In this section, the foundation range of the finite element model for the stress of the dam toe and heel and anti-sliding stability analysis was selected by calculation. The calculation diagram of foundation range selection is shown in
Figure 3, in which a = 0, 1, … 9 and 1 h is equal to 1 times the height of the dam.
With
a increases from 0 to 9, in turn, the influence of the foundation range change on the calculation results of
Kc,
Pmax, and
Pmin was analyzed. The results are shown in
Table 3 and
Figure 4.
The results show that the values of Kc and Pmin first increase and then decrease with the increase in the foundation range. The influence of the foundation range on Kc and Pmin is small. With the foundation range increasing from 1 h to 10 h, the values of Kc and Pmin change to 0.00518 and 480 Pa, respectively. The values tend to become stable after the foundation range is greater than 3 h. The value of Pmax first decreases and then increases with the increase in the foundation range. The influence of the foundation range on the Pmax is also small. After the foundation range is greater than 3 h, the value tends to be stable. Therefore, in the finite element model, the foundation range is as follows: the length of the upstream foundation, length of the downstream foundation, and depth of the foundation were taken as 3 h.
3.3. Process of Establishing the IPSO-LSSVM Model
3.3.1. Selection of the Hyper-Parameters of LSSVM Model
σ and γ have a significant impact on the accuracy of the LSSVM model. In the following steps, the σ and γ of the model are tuned by the IPSO algorithm.
- (1)
Generation of sample sets
Latin hypercube sampling was used to generate 100 sets of samples. The Kc, Pmin, and Pmax of each set of samples were obtained by the finite element model of the gravity dam. A total of 80% of these samples were used to train the IPSO-LSSVM model, and the other 20% of these samples were used to test the accuracy of the model. To facilitate the calculation, a finite element calculation batch program of concrete gravity dams was written using Python to automatically calculate and extract the results of each sample by calling Abaqus. The steps were as follows:
Use Python language to generate inp files for each sample in bulk.
Use Python language to generate bat files to submit calculations in bulk.
Use Python language to extract numerical simulation results from all odb files.
- (2)
Data pre-processing
To make different datasets comparable, it was necessary to normalize all data.
where
xmax and
xmin are the maximum and minimum value of variable
x, respectively, and
xn is normalized data.
- (3)
Setting of the IPSO algorithm parameters
The IPSO algorithm was used for hyper-parameters tuning of the LSSVM. We set the IPSO algorithm parameters as: m = 20, kmax = 40, , Cmax = 2.5, Cmin = 0.5, , and , and the function fitness was the mean relative error (MRE).
- (4)
Optimization results
From
Figure 5,
Figure 6 and
Figure 7, it can be seen that, in the process of the IPSO algorithm to tune the hyper-parameters of the three models, the global optimal fitness curve appears to suddenly bulge. The PSO algorithm appears to be the aggregate phenomenon in the process of optimization, which will lead to falling into the local optimum. The IPSO algorithm introduces the fitness variance of the swarm to control the degree of particle aggregation. When the swarm is too aggregated, the particles will be re-dispersed for optimization. Therefore, this phenomenon occurs. This proves that the IPSO algorithm can overcome the shortcomings of the PSO algorithm as the initial convergence can easily fall into the local optimum.
Table 4 shows the optimal
γ and
σ of the three models, and inputs the optimal hyper-parameters into the LSSVM models to establish the IPSO-LSSVM models.
3.3.2. Comparison of the Accuracy of the Models
The coefficient of determination (
R2), the mean absolute error (
MAE), and the root-mean-square error (
RMSE) were adopted as the evaluation indexes of model accuracy. The accuracy of the IPSO-LSSVM models of
Kc,
Pmin, and
Pmax were compared with that of RSM and PSO-LSSVM models. The comparison results are shown in
Table 5. The comparison between the predicted value and true value of
Kc,
Pmin, and
Pmax is shown in
Figure 8,
Figure 9 and
Figure 10.
It can be seen from
Table 5 that the accuracy of the RSM models is the lowest, indicating a highly nonlinear relationship between the influencing factors and the safety indexes of the concrete gravity dam. The accuracy of the remaining two models is higher, among which the IPSO-LSSVM models have the highest accuracy. The accuracy of the IPSO-LSSVM models is significantly improved compared with that of the PSO-LSSVM models, indicating that the IPSO algorithm is better at tuning the hyper-parameters of the LSSVM model than the PSO algorithm. Therefore, the IPSO-LSSVM model has significant advantages for dealing with highly nonlinear problems, and the IPSO algorithm is outstanding for the optimal solution of complex problems.
Figure 8,
Figure 9 and
Figure 10 show that the linear relationship between the predicted and true value is
y =
x, indicating that the IPSO-LSSVM models is reliable, which can be used for the reliability calculation of concrete gravity dams.
3.4. Reliability Calculation
Based on the IPSO-LSSVM model, the limit state functions (Equations (1), (3), and (4)) of the three failure modes of the concrete gravity dam were constructed. The MC method was used to calculate the
Pf of the typical concrete gravity dam. The MC method requires a sufficiently large sample to achieve high accuracy, and the quality of the calculated results is generally evaluated using the coefficient of variation of
Pf. When the coefficient of variation of
Pf is less than 0.1, the results are considered to meet the accuracy requirements [
31]. Under the most unfavorable conditions, the
Kc and
Pmin of the concrete gravity dam meet the safety requirements, so this section mainly analyzes the probability of stress exceedance at the toe of the dam. The probability of stress exceedance at the toe of the dam was calculated 10 times for different sample sizes to obtain the mean value and the coefficient of variation of
Pf. The results are shown in
Table 6, and the curve of
Pf is shown in
Figure 11.
From the results, it can be seen that, when the number of samples exceeds 106, the coefficient of variation of Pf is already less than 0.1, indicating the results meet the accuracy requirements. The Pf calculated by using the IPSO-LSSVM model is stable at 8.87 × 10−5. Since there is only one failure mode, the Pf of the concrete gravity dam system is 8.87 × 10−5, which is much smaller than the value specified in the specification, so the concrete gravity dam is safe and reliable.
4. Discussion
In many cases, due to the complexity of reliability analysis problems, it is hard to establish an effective response surface to assess the Pf of concrete gravity dams. A reliability analysis method based on the IPSO-LSSVM was proposed to calculate the reliability of concrete gravity dams when explicit nonlinear limit-state functions are difficult to obtain accurately. In the method, the hyper-parameters of the LSSVM model were tuned by the IPSO algorithm to reflect the relationship between the influencing factors and safety indexes. After that, the IPSO-LSSVM model was utilized in conjunction with the MC method to evaluate the failure probability.
Accurate expression of limit state function is the key to obtain reliable calculation results. This paper focused on the improvement method for the accuracy of the model used for reliability analysis of concrete dams. The RSM models were used to test the degree of nonlinearity between the influencing factors and safety indexes and the results show a highly nonlinear relationship between the influencing factors and the safety indexes. The accuracy of the IPSO-LSSVM model was compared with that of the PSO-LSSVM model, and the result shows that the IPSO-LSSVM has higher accuracy, indicating that the IPSO-LSSVM model significantly improves the accuracy of the PSO-LSSVM model.
In the case study, the number of samples required for Pf calculation is 106. If the traditional Monte Carlo-finite element method is used to calculate Pf, then it consumes more than 2000 h. However, the method proposed in this paper can effectively shorten computing time; it only takes about 20 min, so its computing efficiency is greatly improved. Therefore, the method proposed in this paper can effectively improve computing efficiency on the precondition of accuracy maximization.
The main contribution of this paper is its improvement of the established model [
32] and short computing time compared with the traditional reliability analysis method. The proposed reliability analysis method is shown to be an efficient scheme with respect to both computational effort and accuracy.