1. Introduction
The structural settlement changes of urban tunnels may seriously affect the structure, endangering the safety, stability, and lifespan of the tunnel [
1,
2,
3]. Accurately predicting settlement is vital for monitoring changes and implementing preventive maintenance, making it a critical concern in academic and industrial domains [
4,
5,
6]. A preventive maintenance strategy can be made based on estimating future tunnel structural settlement. It identifies areas needing attention and interventions to prevent more severe structural damage. This approach significantly improves the long-term operational performance of the tunnel and ensures its long-term safety and reliability [
7]. Thus, it develops a sustainable and reliable transportation infrastructure for regional economic development aligned with SDG 9 (Industry, Innovation, and Infrastructure). Furthermore, this maintenance strategy also aligns with the target outlined in SDG 11 (Sustainable Cities and Communities), which emphasizes the importance of offering safe, affordable, accessible, and sustainable transport systems for all.
Machine Learning (ML) exhibits remarkable capacity in capturing complex non-linear relationships among multiple variables, introducing a new avenue for tunnel settlement prediction [
8]. Presently, ML-based research focuses on the construction period, employing prediction models like Support Vector Regression (SVR) [
9], Random Forest (RF) [
10], General Regression Neural Network (GRNN) [
11], and Back Propagation Neural Network (BPNN) [
12]. Notably, Shi et al. [
13] utilized Support Vector Machine (SVM) to precisely predict the arch crown settlement of a shallow buried tunnel; Zhang et al. [
14] employed Extreme Gradient Boosting (XGBoost) to estimate excavation-induced settlement considering parameters such as excavation speed, soil pressure, and water content; Moghaddasi and Noorian-Bidgoli [
15] developed a hybrid model of Artificial Neural Network optimized by an Imperialist Competitive Algorithm (ICA-ANN) to forecast maximum surface settlement for minimizing the impact of subway tunnel excavation on the urban area above. To address the challenge of effectively capturing time-dependent characteristics [
16], Deep Learning (DL) models such as Long Short-Term (LSTM) [
17] and Wavenet [
18] have been applied to improve prediction accuracy. Ge et al. [
19] proposed a Deep Belief Network optimized by a Whale Optimization Algorithm (WO-DBN) to predict shield-induced settlement, while Zhang et al. [
20] combined kinetic correlation analysis with Conv1d to introduce an expanding DL method for real-time ground settlement prediction.
Unlike the construction period, where high-frequency settlement data are directly acquired using real-time sensors, the operation period mainly relies on manually placed acquisition points to collect the tunnel surface heights. Due to the monitoring frequency and methods limitations, the collected data exhibit sparse characteristics of short length and univariate form. Therefore, the input data for training the prediction model only contain sparse univariate time-series data. Direct application of sophisticated AI models can result in inadequate parameter training or overfitting. Moreover, tunnel settlement during the operation period is influenced by multiple factors [
21], emphasizing the importance of feature extraction for prediction accuracy. Currently, most prediction models solely utilize the original temporal features of the univariate data without fully exploring the underlying influencing factors, thereby leading to subpar accuracy and generalization. In summary, tunnel settlement prediction during the operation period surpasses conventional time series prediction in complexity.
To overcome these challenges, this paper proposes an improved ML model based on sparse datasets to predict tunnel settlement during the operation period. The model’s effectiveness is validated using real datasets from a cross-river tunnel in Shanghai. This study has the following contributions.
To address ineffective parameter training caused by sparse settlement data, this paper utilizes a K-Means cluster model based on Dynamic Time Wrapping (DTW) to divide data from different acquisition points into distinct groups. It augments training samples and enhances parameter training efficiency;
To address the limitations of exclusively exploring temporal features in settlement data, this paper applies the Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN) model to decompose the univariate settlement data into multi-dimensional data. This approach fully explores the hidden influencing factors and improves feature mining capability;
To achieve precise tunnel settlement prediction across different locations, this study adopts an effective XGBoost with Bayesian optimization (BO)-informed parameter selection. This predictive model learns decomposed features for each group, dynamically optimizing parameter combinations to improve predictive performance, ensuring precise and stable predictions.
The subsequent sections are structured as follows:
Section 2 provides an overview of the existing research in settlement prediction.
Section 3 introduces the project background,
Section 4 elaborates the proposed methodology, and
Section 5 presents the experimental results, while
Section 6 draws conclusions.
2. Related Works
In general, tunnel settlement prediction models can be classified into model-based methods and AI-based methods. Model-based methods, including empirical solutions and numerical simulations, use professional physical conditions to build rigorous mathematical models. Additionally, empirical methods utilize professional knowledge to construct appropriate mathematical models for fitting observed data. Vorster et al. [
22] developed a conservative estimation method for ground settlement by considering physical variables such as tunnel geometry, pipe stiffness, and soil properties. Fang et al. [
23] established an empirical formula based on the normal distribution function to estimate underground settlement caused by shield tunneling. Lu et al. [
24] designed a tunnel settlement formula based on Gaussian functions using measurements from multiple field sites. However, empirical methods may yield unstable predictions due to variations in environmental conditions. Meanwhile, theoretical approaches are usually established based on certain physical assumptions and constraints to describe, explain, and predict settlements. They mainly use computers to simulate and visualize parameter variations in different scenarios [
25]. Klotoé and Bourgeois [
26] simulated the impact of umbrella arches on tunnel settlement using the CESAR-LCPC 3D finite element mode. Lai et al. [
27] used finite element difference methods to simulate the influence of underpass tunnel construction on settlement. Li et al. [
28] conducted sophisticated simulations using FLAC3D to analyze bridge piles and tunnel lining deformations caused by shield tunnel excavation. Numerical simulation methods demand substantial computational resources for adjustments in response to environmental changes, hindering their ability to offer timely dynamic predictions. While model-based methods offer explicit explanations for tunnel settlement, they are not extensively used due to constraints in valid mathematical assumptions and available multiple influencing variables.
AI-based methods explore potential correlations among input data to generate predictive results. Ling et al. [
29] employed RF to predict settlement in shield tunneling through complex geological formations and determine the relative importance of each input variable. Additionally, some researchers have compared the performance of multiple ML models on the same single dataset. Mahmoodzadeh et al. [
30] compared the tunnel settlement prediction performance of SVR, RF, Gradient Boosting Machine (GBM), XGBoost, and Light Gradient Boosting Machine (LightGBM). Tang and Na [
10] evaluated SVM, RF, BPNN, and Deep Neural Network (DNN), determining that SVM struck the best balance between training time and accuracy. Researchers have also integrated parameter optimization mechanisms like BO [
8], genetic algorithm optimization (GA) [
31], and particle swarm optimization [
32] to improve training effectiveness. Previous studies have demonstrated good settlement prediction accuracy with ML models, but their simple structures may struggle to uncover complex non-linear and non-stationary relations in time-series data. Consequently, they have poor prediction performance and limited applicability. By contrast, DL models can adaptively adjust parameters, mine hidden features [
33], and enhance prediction accuracy. Cao et al. [
34] proposed a Recurrent Neural Network-Gappy Proper Orthogonal Decomposition (RNN-GPOD) model for tunnel surface settlement prediction, aiding accurate tunnel boring machine operation. Wu et al. [
35] showed that LSTM outperforms traditional ML methods in settlement prediction tasks. Zhu et al. [
36] integrated K-Means and LSTM to predict tunnel settlement under different degradation patterns, ultimately assessing tunnel structure performance. While complex network structures excel in capturing intricate input data relationships, prediction model effectiveness hinges on data quality. Sparse input datasets may result in overfitting or diminished accuracy.
The model-driven approach, based on geomechanics, mathematical assumptions, and other theoretical knowledge, can be utilized to calculate the settlement evolution of tunnels under different circumstances. However, these methods simplified the external tunnel environment to meet corresponding mathematical assumptions. Nevertheless, as operational time increases, the surrounding environment undergoes continuous changes, which have a particular impact on the tunnel structure. Therefore, this approach cannot accurately predict the dynamic changes in structural settlement. The effectiveness of data-driven prediction methods has been validated in various engineering projects during the tunnel’s construction. These highly accurate AI models require abundant data samples to support model training and parameter updates. However, during the operation period, manually collected settlement data exhibit sparse characteristics of low frequency, short time series, and single variables. If these data are directly utilized for training AI models, problems like ineffective updates of model parameters or model overfitting may arise, which can result in decreased prediction accuracy. Therefore, current approaches fail to provide precise and stable settlement predictions during the operational period. Consequently, further research is needed to predict sparse settlement data during the operational period.
3. Project Overview
This study validates the proposed model using data from a cross-river tunnel in Shanghai. The tunnel spans 8950 m, with 7500 m crossing the Yangtze River Nan Harbor water area, characterized by a diameter of 15 m. As the longest cross-river tunnel in Shanghai, its structure has undergone spatial–temporal variations due to complex hydrological, geological, and human factors in its vicinity.
According to the structural characteristics of the tunnel, it is divided into five sections. A total of 227 locations within the tunnel are equipped with hydrostatic level gauges, and height is monitored semiannually. These sections and acquisition points are depicted in
Figure 1. The approach sections (Sections 1 and 5) connect the tunnel with surface roads, while the buried section (Sections 2 and 4) links the approach and shield sections (Section 3). The center region of the cross-river area represents the shield-driven section. This study uses datasets from June 2011 to December 2021 for training and validation, re-moving the abnormal data due to device damage, resulting in a dataset dimension 203 × 22.
Figure 2 illustrates accumulated settlement variations between 2011 and 2021, showing similar patterns in some areas but significant differences in others. Sections 1–3 exhibit an overall rise in settlement over time, while Sections 4 and 5 experience degradation. Notably, the area from S217 to S225 has significant long-term cumulative settlement, warranting focused maintenance.
TSI, proposed by Li et al. [
37], thinks settlement is a significant factor influencing the overall condition of shield tunnels. Unlike accumulated settlement, settlement can reveal timely changes in tunnel structure. As a result, this study focuses on predicting settlement to assist maintenance personnel in formulating preventative maintenance strategies and improving the operational performance of tunnels. The original monitored height is transformed into settlement data according to Equation (1).
is the settlement of acquisition point
at time
,
is the height of acquisition point
at
.
6. Conclusions
As a vital component of urban infrastructure, proper management and maintenance of tunnels are crucial for various industries [
51]. In the life cycle of urban tunnels, excessive settlement poses the risk of severe structural damage, jeopardizing tunnel safety, stability, and longevity. Due to the sparsity of tunnel settlement data during the operation period, the accuracy and generalization ability of AI-based prediction models is reduced. This paper proposes an improved ML model that combines time series clustering, time series decomposition, and BO-XGBoost to achieve better predictive performance with sparse settlement data during this period. The main contributions of this paper can be summarized as follows.
By using a DTW-K-Means time series clustering model, the data with similar settlement patterns are aggregated for unified training. This augmentation approach directly increases the number of training samples, thereby facilitating the learning of the future trends in tunnel structures and reducing the likelihood of overfitting.
By utilizing the CEEMDAN time series decomposition model, the univariate settlement data are decomposed into multi-dimensional data containing different temporal frequency information. This method allows the model to effectively reveal the underlying influencing features in the univariate data, enhancing prediction accuracy.
By adopting BO, the high-performing XGBoost model is able to search for the optimal combination of hyperparameters even with limited sample sizes. It enhances the predictive capability of the model.
In summary, the proposed model exhibits accurate and stable predictive performance when facing sparse univariate settlement data during the operational period. Compared to traditional ML models and LSTM, the proposed model achieves the lowest prediction error and highest accuracy on both the training and testing set. The study also utilizes the proposed model to forecast the next 3-year settlement trends of the tunnel in Shanghai. Based on the prediction results, preventive maintenance strategies are suggested. This facilitates the long-term operational performance, safety, and reliability of tunnels, which supports realizing SDG 9 (Industry, Innovation, and Infrastructure) and SDG 11 (Sustainable Cities and Communities). However, due to limitations in data acquisition methods during the operation period, this study only considers the temporal patterns of settlement without incorporating other important environmental factors such as traffic flow and tides. Consequently, the model’s performance on the testing set is inferior, and its generalization capability is limited.
Some further research directions may improve model performance. Firstly, tunnel structural settlement is susceptible to various external factors. Therefore, it is worth considering how to quantify and select multiple data sources to improve the model’s learning ability. Additionally, while existing research has mainly focused on the temporal dependency of settlement, there is explicit spatial dependency in settlement between adjacent areas. Therefore, incorporating spatiotemporal factors can also enhance the model’s performance.