1. Introduction
Prominence in the construction industry has been gained by Concrete-Filled Double-Skin Steel Tube (CFDST) columns due to their inherent advantages, including high load-bearing capacity [
1], high strength-to-weight ratio [
2], and favorable dynamic characteristics [
3] compared to traditional composite columns. An innovative solution for modern load-bearing structures is offered by these columns, contributing to the efficiency and sustainability of engineering projects. However, a complex challenge remains the accurate prediction of the torque capacity of CFDST columns under torsion loading conditions [
4], and provisions for this are still lacked by modern structural design standards.
In a bid to understand this complex behavior, a few experimental (e.g., [
5,
6,
7]) and numerical studies (e.g., [
8,
9]) have been conducted by researchers. Experiments on the torque behavior of tapered CFDST columns with large hollow ratios ranging from 0.75 to 0.9 under two loading modes: pure torsion and combined compression-torsion, were conducted by Deng et al. [
5]. Through test investigation and numerical analysis, the torque behavior of CFDST columns under combined compression and cyclic torsion was studied by Zhang et al. [
6], who proposed axial compression–torque correlation equations and formulas for predicting the torque capacity of CFDST columns under combined compression and torsion via regression analysis. The behavior of CFDST members under pure torsion was investigated by Huang et al. [
7], who employed nonlinear finite element analysis (NLFEA) to examine failure modes, torque–rotation curves, and the interaction between steel tubes and concrete in CFDSTs. Reasonably well with experimental data were aligned the results from NLFEA by their study, which identified six key parameters influencing torque versus torsional rotation curves in CFDST members.
Numerical investigations to understand the torsional behavior of CFDST columns have also been carried out. An NLFEA study on the torsional behavior of circular CFDST columns was conducted by Lu et al. [
4], who established a refined NLFEA model, validated using experimental results. Through parametric analyses, the influence of significant parameters, such as concrete strength, steel tube properties, and aspect ratio, on the torsional capacity of circular CFDST columns, was explored by their research, resulting in the proposal of accurate design formulas for predicting torsional capacity, with demonstrated accuracy in comparison to tests. In a related study, the numerical analysis of torsional behavior was focused on by Lu et al. [
8], but their attention was shifted to rectangular and square CFDST members. They successfully developed NLFEA models, validating them against experimental data. Parametric investigations to assess the effects of factors such as hollow ratio, concrete strength, steel yield strength, and steel tube cross-sectional properties on the torsional response of these members were conducted. The outcome was the formulation of precise design formulas for predicting the torsional capacity of rectangular and square CFDST members.
Turning their attention to an emerging area, the behavior and bearing capacity of tapered CFDST members that fall outside the realm of traditional design codes were explored by Wang et al. [
9]. They used numerical and theoretical research methods, including NLFEA, to understand how these tapered CFDST members respond to torsional and compression–torsion loads. Their study introduced a modified design method, especially suitable for predicting the performance of tapered CFDST members with out-of-code parameters.
While the aforementioned studies have significantly advanced our understanding of CFDST columns’ behavior under torsion and provided foundational insights for design and analysis, several limitations persist within the current body of research. Firstly, many of the experimental [
5,
6,
7]) and numerical studies [
8,
9] focused on specific column configurations or loading conditions, limiting the generalizability of their findings. Furthermore, the complex interactions between the concrete core, inner, and outer steel tubes under torsion are not fully explored, especially for columns with non-standard geometries or material properties. Additionally, the current methods for predicting torque capacity primarily rely on the empirical formulas derived from limited datasets, which may not accurately capture the behavior of CFDST columns under diverse conditions. Moreover, the lack of comprehensive design provisions for torsion in CFDST columns in existing structural design standards underscores the need for a more versatile and robust prediction tool.
Often relying on analytical methods (e.g., [
4,
7]), which may not adequately capture the complex behavior of these columns under pure torsion, are conventional approaches for estimating torsional capacity. To the best of the authors’ knowledge, the torsion provisions for CFDST columns are not addressed by any current structural design standard. To address these limitations, this paper introduces a novel data-driven analysis using machine learning (ML) techniques for predicting the torque capacity of circular CFDST columns.
Significant attention in various engineering disciplines due to their capacity to comprehensively analyze large datasets [
10], identifying complex patterns and relationships that may not be evident through conventional methods [
11], has been gained by ML in recent years. Several recent studies have explored the application of ML techniques in predicting the strength of concrete-filled steel tubular (CFST) columns under various loading conditions. A framework utilizing the gradient tree boosting (GTB) algorithm to forecast the strength of CFST columns subjected to concentric loading was introduced by Vu et al. [
10]. An artificial neural network (ANN) model tailored to estimate the ultimate compressive strength of square CFST columns, demonstrating enhanced performance compared to traditional design codes, was proposed by Le [
12]. The integration of mechanism analysis with machine learning models, particularly Gaussian process regression (GPR), to predict the axial compression strength of circular CFST columns, was investigated by Hou and Zhou [
13]. Their findings suggested superior accuracy and broader applicability compared to existing design standards. Similarly, a machine learning model capable of estimating the axial capacity of both circular and rectangular CFST columns under various loading conditions, offering a high level of accuracy and serving as a viable alternative to empirical and theoretical formulations, was developed by Faridmehr and Nehdi [
11].
The exploration of ML techniques in structural engineering, particularly for predicting the behavior of CFST columns, has showcased the potential for highly accurate predictions that often exceed conventional design standards. Despite these advancements, the ML literature reveals gaps such as a limited focus on the torsional behavior of CFDST columns and challenges in model interpretability and data availability. Our study aims to bridge these gaps by specifically targeting the prediction of torque capacity in CFDST columns under torsion, leveraging a comprehensive database derived from NLFEA simulations for model training and validation. In doing so, we not only enhance the predictive accuracy and applicability of ML models but also contribute to the structural engineering domain by providing interpretable models that elucidate the complex interplay of factors influencing torsional capacity. This approach marks a significant step forward in the application of data-driven methodologies to the nuanced field of structural engineering, addressing both the technical and practical challenges highlighted in previous research.
This study aims to develop reliable and accurate predictive ML models for estimating the torque capacity of circular CFDST columns under pure torsion. By leveraging ML algorithms, the intricate behavior of these columns, considering multiple influencing factors, and providing comprehensive prediction tools for structural engineers, are sought to be captured. A new database, integrating data from multiple NLFEA instances to ensure the adequacy and diversity of the training set, is compiled. The integration of ML in the analysis of CFDST columns represents a significant advancement in structural engineering. This data-driven approach offers the potential to overcome the limitations of conventional methods and provide more accurate and reliable estimates of torque capacity. The outcomes of this study are expected to enhance the design and performance of CFDST columns in various engineering applications, ultimately contributing to safer and more efficient structures.
5. ML Model Deployment and Validation
5.1. Data Preparation and Cleansing
The raw dataset underwent a comprehensive pre-processing regimen to render it apt for both the training and evaluation of ML models. Initially, the dataset was divided through a two-step stratified random sampling method. Firstly, 80% of the data were earmarked for a combined pool of training and validation purposes, with the remaining 20% allocated for testing. Subsequently, the training-validation pool was further split, allocating 75% for training (effectively 60% of the total dataset) and 25% for validation (constituting 20% of the total dataset). This structured division ensures a balanced distribution of data across training, testing, and validation sets, facilitating the development of robust and generalizable ML models [
47].
Feature scaling ensued, employing the StandardScaler from the Scikit-Learn library to normalize the scales of the features, ensuring that no variable disproportionately influences the model’s learning process. The dataset was thoroughly scrutinized for missing values; the absence of such discrepancies negated the need for imputation techniques or the elimination of data points, allowing the dataset to maintain its original volume of 806 entries. A critical step involved segregating the target variable from the predictor features, a practice that is crucial for preserving the dataset’s integrity and optimizing the generalization capability of the ensuing models.
5.2. Details of ML Deployment
5.2.1. Hyperparameters of the ML Models
Grid search and 10-fold cross-validation were employed by the study to fine-tune the hyperparameters for 5 ML algorithms [
38,
39,
40,
43]. Hyperparameters, which are crucial for optimal model performance, are predefined by users and require meticulous optimization [
45,
46]. A systematic exploration of model performance across various hyperparameter combinations is conducted by grid search, while 10-fold cross-validation mitigates overfitting and guards against data-specific biases. Optimal hyperparameters and their ranges were determined based on insights from prior studies on concrete-filled steel columns [
38,
39,
40,
43,
45,
46] and relevant expertise, as summarized in
Table 5. Default values were adopted for hyperparameters that were not explicitly listed in the table. Further details on hyperparameters and algorithms can be found in the provided references.
5.2.2. Evaluation Metrics
Four performance metrics were utilized to evaluate the quality and accuracy of the ML models, notably:
Coefficient of determination (
):
measures the proportion of the variance in the dependent variable explained by independent variables [
48]. A higher
value indicates a better fit of the model to the data. An
value of 1 implies a perfect fit, while 0 suggests no explanatory power. Mathematically,
is defined as:
Mean square error (MSE): MSE quantifies the average deviation between predicted and actual values [
48]. It is computed as the average squared differences between predictions and actual values:
Root mean square error (RMSE): RMSE quantifies the average deviation between predicted and actual values [
48]. It is computed as the square root of the average squared differences between predictions and actual values:
Mean absolute error (MAE): MAE measures the average absolute difference between predicted and actual values. It provides a measure of the average magnitude of the errors made by the model [
48]:
5.3. Performance Comparison among ML Algorithms
The relationship between the predicted torque capacity by ML (
) and the actual torque capacity (
) for the five ML models across both training and testing datasets is illustrated by the scatterplot depicted in
Figure 8. The close alignment of the points on the plot with the ideal prediction line (
) indicates the high accuracy in predicting the torsional capacity of CFDST members. Prediction beyond 500 kNm for the testing and validation sets seems less strong due to the fact that most of the data are poorly represented in this range (
Figure 4). Future studies on this topic should simulate NLFEA CFDST members with torque capacities beyond 500 kNm.
To delve deeper into the performance of these models that only focus on training and validation sets, four metrics—
, RMSE, MAE, and MAPE—were employed. These metrics are outlined in
Table 6 and further visualized in
Figure 9. Each model’s performance is evaluated using four key metrics:
(coefficient of determination), MSE (mean squared error), RMSE (root mean squared error), and MAE (mean absolute error).
Upon analysis, it was revealed that the highest R2 values and the lowest error metrics across both the training and testing sets were exhibited by CatBoost and GBM. Specifically, an R2 of 0.998 on the training set and 0.977 on the testing set was achieved by CatBoost, with the corresponding MSE values of 7.03 and 23.14, RMSE values of 4.91 and 10.99, and MAE values of 6.83 and 12.69, respectively. Similarly, the strong performance with an R2 of 0.998 on the training set and 0.972 on the testing set, accompanied by MSE values of 7.70 and 25.43, RMSE values of 6.43 and 12.00, and MAE values of 7.72 and 14.87, respectively, was demonstrated by GBM.
Conversely, competitive results were also yielded by XGBoost, RF, and DT, but they exhibited slightly lower R2 values and higher error metrics compared to CatBoost and GBM. Notably, the lowest RMSE and MAE values on the training set were demonstrated by RF, indicating its effectiveness in capturing the underlying patterns in the data.
These findings suggest that particularly well suited for predicting the torque capacity of CFDST members under pure torsion are CatBoost and GBM, due to their superior performance in both model fitting and generalization. The importance of selecting an appropriate ML model based on a thorough evaluation of its performance metrics is also highlighted by the results, considering factors such as overfitting tendencies, computational efficiency, and interpretability.
7. SHAP Interpretation
Recent advancements, such as SHAP, have made a significant impact on enhancing the transparency and interpretability of ML predictions [
49]. SHAP provides insights on two levels: it illuminates the influence of each input variable globally, contributing to the overall interpretability, and it enables local interpretability through Shapley values, which assess the importance of features for individual predictions. The explanation model, denoted by
, is described by a linear function of binary features in additive feature attribution methods [
49].
where
M denotes the number of input features and
represents the Shapley value of feature
i, the computation is outlined as follows. Let
S denote the set of all features, and
denote the set of features excluding feature
i.
refers to the power set of
S, containing all possible subsets of
S, while
represents the cardinality, or the number of elements, in set
A. The Shapley value (
) for a particular feature
i is calculated using the given formula [
49]:
In this process, the sum is taken over all possible subsets A that exclude feature i, with F denoting the set of all features. The terms and represent the model’s output with and without the inclusion of feature i in subset A, respectively. This formula quantifies the marginal contribution of feature i across all possible combinations of features, offering a comprehensive view of the possible feature subsets.
Focusing on the prediction of torque capacity in CFDST members under pure torsion, the CatBoost model has been identified as the most effective among the five ML models evaluated. Therefore, for SHAP analysis, the CatBoost model is utilized, highlighting its capability to not only provide accurate predictions but also to offer meaningful insights into the factors influencing those predictions.
7.1. Global SHAP Interpretation
The factors influencing the torque capacity of CFDST columns are understood to be crucial for their optimal design. SHAP analysis has been recognized as a potent tool for unveiling the significance of various features within the CatBoost model, providing insights at both the global and local levels. Globally, the overall impact that each feature exerts on the predictions of torque capacity is elucidated by SHAP. Identified through this analysis (see
Figure 11), the concrete compressive strength (
), the yield strength of the outer steel tube (
), the thickness of the outer steel tube (
), and the thickness of the inner steel tube (
) have been found to be the top four influential parameters. In the SHAP summary plot (
Figure 12), lower feature values are represented by blue dots, while higher feature values are signified by red dots. It is noteworthy that a positive influence on the SHAP values is exhibited by nearly all parameters, indicating that the enhancement of the torque capacity of CFDST columns is contributed to by increasing these parameters. This observation is in alignment with the findings from previous studies [
4,
7], reinforcing the importance of these factors in the structural performance of CFDST columns.
7.2. Local SHAP Interpretation
Averages are only provided by Global SHAP which may not reveal specific critical features, while a closer examination of individual data points is offered by local SHAP. When the CatBoost model for CFDST torque capacity under pure torsion is analyzed, crucial insights are revealed by local SHAP, as depicted in
Figure 13, regarding the interaction among input features. A clear interrelationship between the input features and their contributions to the overall prediction of the CatBoost model is evident across all plots in
Figure 13.
For the outer steel tube diameter (
) against concrete compressive strength (
), it is indicated in
Figure 13a that a positive effect on torque capacity prediction using the CatBoost model is associated with a larger
combined with a high
for CFDST members. Conversely, for CFDST columns with a smaller
, negative SHAP values for various levels of
are exhibited, although these values are only slightly below zero, indicating a minor influence on the reduction in torque capacity.
Regarding the thickness (
) against the yield strength (
), it is revealed by
Figure 13b that a positive impact on torque capacity prediction for the outer steel tube of CFDST members is associated with a larger
combined with a high
. However, for CFDST columns with a smaller
, high negative SHAP values for high values of
are exhibited, indicating a significant influence on the reduction in torque capacity.
For the inner steel tube diameter (
) against concrete compressive strength (
), it is shown by
Figure 13c that a positive effect on torque capacity prediction for CFDST members is associated with a larger
combined with a high
. Conversely, for CFDST columns with a smaller and medium
, negative SHAP values for various levels of
are exhibited, although these values are slightly below zero, indicating a relatively minor influence on the reduction in torque capacity.
Regarding the inner steel tube thickness (
) against the yield strength (
), it is indicated by
Figure 13d that the relationship between
and
is unclear in the CatBoost model, as expected due to the intuitive independence of these two variables.
For the yield strength (
) against concrete compressive strength (
), it is demonstrated by
Figure 13e that a positive impact on torque capacity prediction by the CatBoost model is associated with large values of both
and
. Although a linear relationship is observable, for very small values of
, the SHAP value turns negative only for small values of
, which are positive for the medium and high values of
. This suggests that a consistently positive impact on torque capacity prediction using the current CatBoost model is provided by
, regardless of the values of
.
For the inner steel yield strength (
) against the inner steel tube diameter (
), an exponential relationship between
and
is illustrated by
Figure 13f, indicating that larger values of these parameters lead to an exponential increase in torque capacity prediction via the CatBoost model. However, for smaller values of
and high values of
, negative SHAP values are observed, suggesting a slight negative impact on torque capacity by
when the hollow section ratio of the CFDST column increases.
7.3. Implications for Engineering Practice
The performance evaluation of the CatBoost and XGBoost models, along with comparisons to empirical models and SHAP analysis, provides valuable insights for engineering practices in the design of CFDST columns subjected to pure torsion. Superior predictive capabilities are demonstrated by the CatBoost model, as evidenced by high values and minimal mean errors, establishing it as a reliable tool for engineers in search of precise torque capacity predictions.
Identified through global SHAP analysis, crucial parameters such as concrete compressive strength, outer steel tube yield strength, outer steel tube thickness, and inner steel tube thickness offer specific targets for optimization [
50]. The intricate relationships unveiled by the local SHAP analysis highlight the interconnectedness of these parameters, providing engineers with a comprehensive understanding for customized design choices. Practical implications include the prioritization of key parameters to enhance torque resistance and the integration of advanced machine learning techniques into structural engineering practices [
51]. This research narrows the gap between conventional empirical models and tree-based ML tools, enabling engineers to develop more effective and informed designs of CFDST columns for superior structural performance [
50] by using the presented hyperparameters (e.g., for Catboost) to build a graphical user interface for the ease of use by lame engineers who may not understand ML codes (see
Figure 14).