4.2. Analysis of the Prediction Models
In this chapter, the authors employ the feature importance and SHAP summary plot results to illustrate the predictive models derived in each case.
Figure 14 displays the importance of the features in each case, quantifying the impact of each feature on the predictive model. Specifically,
Figure 14a showcases the impacts of independent features on the predictive models derived by the RF, XGB, and LightGBM algorithms in case 1. For the best-performing LightGBM, the settlement exhibits the highest impact at 43.17%, followed by P
Z and P
L at 9.94%, T
F at 6.44%, O
H at 6.00%, O
V at 5.92%, P
P at 5.40%, P
CTC at 5.24%, EG at 3.44%, P
D at 2.55%, and P
N at 1.95%; B
R does not affect the prediction model at all. The feature importance in case 2 is illustrated in
Figure 14b, where P
L has the highest impact at 22.02%, followed by P
CTC at 15.26%, P
P at 11.42%, E
G at 10.09%, P
D at 10.04%, P
Z at 8.27%, T
F at 6.62%, P
N at 6.55%, and O
V and O
H at 4.40% and 3.99%, respectively; B
R has a negligible impact at 1.35%. Case 3 is summarized in
Figure 14c, where the settlement has the highest impact at 42.01%, followed by P
Z at 10.92%, P
L/P
D at 8.87%, T
F at 6.99%, P
P at 5.74%, O
V at 5.60%, P
CTC at 5.92%, O
H at 5.55%, A
PILE/A
RAFT at 5.09%, and EG at 3.30%.
Figure 14d shows the feature importance in case 4, where P
CTC has the highest impact at 19.02%, followed by A
PILE/A
RAFT at 16.03%, P
P at 15.53%, P
Z at 14.54%, T
F at 9.29%, E
G at 8.07%, P
L/P
D at 7.96%, and O
H and O
V at 4.81% and 4.75%, respectively.
The analysis of the feature importance reveals that settlement is of significant importance in cases 1 and 3, where it is included as an independent variable. Meanwhile, P
CTC has the highest feature importance in cases 2 and 4, where settlement is not considered. However, while the feature importance results can quantify the impacts of features on the predictive value of a target feature, they do not provide insight into whether the prediction value increases or decreases as the number of features increases. Therefore, the impact of each feature on the prediction model, as measured by the SHAP value, is analyzed in the summary plot presented in
Figure 15.
In the summary plot, a large absolute value of the SHAP value indicates a large contribution to the predicted value. A positive SHAP value indicates that as the feature value increases, the prediction value also increases, and as the feature value decreases, the prediction value decreases.
Figure 15a illustrates the contributions of the features in the model derived by the best-performing LightGBM algorithm in case 1 as a function of the SHAP value, with the predicted values shown in order of magnitude. The feature with the largest contribution is P
L, with the SHAP value increasing as the feature value increases. This trend indicates that the pile axial stress increases with the length of the pile, likely due to the larger allowable load of long piles compared with short piles. Next is P
Z, which demonstrates that the pile axial stress increases as the feature value decreases, meaning that the pile axial stress is greater at the top of the pile. E
G indicates that a larger relative density of the ground results in a greater allowable load and, therefore, greater axial stress on the pile. P
D shows that the pile axial stress decreases as the feature value increases, indicating a small change in the pile axial stress due to the tunneling. Settlement is found to decrease as the feature value increases, as the settlement caused by the tunnel excavation reduces the pile axial stress. P
P, which represents the position of the pile, shows that the smaller the feature value, the larger the SHAP value, meaning that the greater distance from the tunnel side results in greater pile axial stress. P
CTC shows that the smaller the feature value, the smaller the SHAP value, suggesting that the closer the piles are in terms of spacing, the smaller the pile axial stress, due to the cluster pile effect. The feature P
N shows that the larger the feature value, the smaller the SHAP value, indicating that a larger number of piles results in smaller pile axial stress, as more piles share the allowable load. However, T
F, B
R, O
H, and O
V do not produce meaningful SHAP values in terms of their impacts on the predictive model.
Figure 15b shows the SHAP values of the prediction model derived by LightGBM in case 2, and the analysis reveals a similar trend to that observed in case 1. However, the B
R in case 2, which was not analyzed in case 1, exhibits a more meaningful SHAP value than in case 1, with the SHAP value decreasing as the feature value increases. This trend is due to the fact that the larger the foundation width, the more of the allowable load is supported by the raft, resulting in less pile axial stress.
The SHAP values in case 3, summarized in
Figure 15c, reveal that the smaller the feature value of P
L/P
D, the smaller the SHAP value, which is consistent with cases 1 and 2. Additionally, the larger the A
PILE/A
RAFT, the smaller the SHAP value, with a larger A
PILE indicating a smaller A
RAFT. A
PILE is a function of P
D and P
N, while A
RAFT is a function of B
R, and larger P
D and P
N values result in smaller pile axial stress.