1. Introduction
Crack propagation in materials is a critical phenomenon that can lead to structural failure and compromise the integrity and safety of various engineering applications. Understanding and predicting crack propagation is crucial for the design and maintenance of durable and reliable structures. Traditional approaches for predicting crack propagation involve complex mathematical models and empirical equations, which often have limitations in capturing the intricate nature of crack behavior under different conditions [
1,
2]. Crack propagation in materials involves complex and nonlinear behavior influenced by factors, such as material properties, loading conditions, and environmental elements. Traditional methods struggle to capture the complex relationships and dynamics involved in crack growth. Conversely, machine learning models excel at identifying complex patterns and nonlinear relationships within data, making them well-suited for modeling crack propagation processes.
Traditional methods often rely on simplified assumptions and limited representations of the factors influencing crack propagation, leading to inaccuracies. In contrast, machine learning models can incorporate a broader range of factors, including material properties, loading conditions, temperature variations, and more. By training on extensive datasets that encompass diverse conditions and parameters, machine learning models can learn the intricate relationships between these factors and crack propagation, enabling more accurate predictions.
Additionally, traditional methods lack generalization capabilities, meaning they struggle to predict crack propagation in materials or loading conditions that differ from those used during model development. On the other hand, machine learning models can generalize well to new and unseen scenarios, provided they are trained on diverse and representative datasets. This ability enables machine learning models to make accurate predictions for various materials, loading conditions, and temperature ranges, enhancing their applicability and reliability [
3,
4].
The utilization of machine learning models empowers researchers to surpass the limitations of traditional methods when it comes to predicting crack propagation. These models provide improved capabilities in capturing intricate behaviors, integrating a wider array of influencing factors, leveraging extensive and diverse datasets, and adapting to new scenarios [
1,
2,
5,
6,
7,
8,
9,
10]. As a result, machine learning-based approaches possess significant potential for advancing our comprehension of crack propagation and enhancing the design and dependability of structures and materials. Previous studies have utilized machine learning techniques for crack prediction in different materials. For instance, Ref. [
11] employed Random Forest Regressor to predict crack propagation in composite materials under various loading conditions. The study demonstrated that the model accurately captured crack behavior and showed improved performance compared to traditional analytical methods. Support Vector Regression (SVR) has also been applied for crack propagation prediction. Ref. [
12] utilized SVR to predict crack growth in metallic materials subjected to cyclic loading. Their results indicated that the SVR model achieved good accuracy in predicting the crack growth rate and exhibited better performance than conventional regression methods. Gradient Boosting Regressor has shown promise in crack propagation prediction as well. Ref. [
13] employed this model to predict crack growth in composite material under different environmental temperatures. Their findings indicated that the Gradient Boosting Regressor achieved high prediction accuracy and outperformed other regression models, such as Random Forest and Support Vector Regression. Ridge, Lasso, and K-Nearest Neighbors are also commonly used models in machine learning applications. In the context of crack propagation prediction, these models have demonstrated effectiveness in capturing complex relationships between crack behavior and influencing factors. For instance, Ref. [
1] employed Ridge regression and K-Nearest Neighbors to predict crack growth in polymer materials. The models exhibited high accuracy and provided valuable insights into the crack propagation process.
Several studies have demonstrated the effectiveness of machine learning models for crack propagation prediction, even with small and moderate datasets. For instance, Ref. [
14] utilized Random Forest and Support Vector Regression models for predicting crack propagation in concrete structures with a small dataset. Their results showed that both models achieved high prediction accuracy and outperformed traditional analytical methods. Similarly, Ref. [
15] employed a Gradient Boosting Machine model for predicting crack growth in metallic materials with a moderate dataset. Their findings showed that the model achieved good accuracy and provided valuable insights into the underlying crack behavior. These studies suggest that machine learning models can effectively predict crack propagation behavior, even with small or moderate datasets. By identifying the most important factors influencing crack behavior and accurately predicting crack growth, these models can aid in the design and development of more reliable and durable structures. In this study, we build upon the existing research by comparing the performance of these machine-learning models for predicting crack propagation in composite, metal, and polymer materials. We utilize experimental data obtained from crack propagation tests performed under varying load and temperature conditions. The performance of each model is evaluated to identify the best-performing models for each material type. The proposed study expands upon previously conducted research and significantly contributes to the current state of the art in crack propagation prediction in several means. The study focuses on crack propagation prediction in three distinct types of materials: composite, metal, and polymer. While previous research has often focused on individual material types, this study provides a comprehensive comparative analysis across multiple materials. By examining crack propagation in different materials, the study offers valuable insights into the varying behaviors, influencing factors, and predictive models specific to each material type. This expanded scope enhances our understanding of crack propagation across a wider range of materials and aids in the development of more versatile and accurate prediction models. The study investigates crack propagation under dynamic load and temperature conditions, which are critical factors influencing crack growth in real-world scenarios. While previous research has often focused on static load conditions, this study expands the understanding of crack propagation by incorporating the effects of dynamic loading and temperature variations. By considering these realistic operational conditions, the study enhances the applicability and relevance of crack propagation predictions to practical engineering scenarios. The proposed study expands upon previous research by conducting a comparative analysis across multiple materials, evaluating various machine learning models, considering dynamic load and temperature effects, validating with experimental data, and emphasizing practical implications for structural design. These contributions enhance our understanding of crack propagation prediction and offer practical tools and insights to improve the reliability and durability of structures in various material systems. The study’s findings also could have respective practical implications and applications such as:
Risk Assessment and Maintenance Strategies: The study’s findings enable better risk assessment and the development of proactive maintenance strategies. This information helps identify critical areas prone to crack initiation and propagation, allowing engineers to focus their efforts on preventive measures and inspections. By integrating machine learning models into structural health monitoring systems, early detection of crack propagation can be achieved, enabling timely maintenance and repair actions. This proactive approach enhances the dependability and longevity of structures by preventing failures and minimizing downtime.
Cost and Resource Optimization: Accurate crack propagation prediction enables optimized allocation of resources and cost-effective maintenance strategies. By identifying critical areas and accurately predicting crack propagation rates, engineers can prioritize inspection and maintenance efforts, allocating resources where they are most needed. This targeted approach optimizes resource utilization, reduces unnecessary maintenance activities, and lowers overall costs while ensuring the long-term dependability of structures.
Our study aims to demonstrate the potential of machine learning models as effective tools for predicting crack propagation in materials.
The remaining sections of this paper are structured as follows:
Section 2 provides an overview of the materials used in this study, along with the machine learning models employed for crack propagation prediction under coupled load and temperature. In
Section 3, we present the results obtained by applying the selected algorithms to three distinct material datasets. A comprehensive analysis of these results is provided, highlighting the performance and effectiveness of the various models. Finally, the concluding section summarizes the key findings of our research, shedding light on the comparative analysis of the machine learning models for crack propagation prediction under coupled load and temperature. It consolidates the main outcomes and implications derived from the study, potentially paving the way for further advancements in this field.
3. Results and Discussion
The study analyses experimental data on natural frequency, structural amplitude, temperature, and crack position. According to [
29], Spearman’s correlation coefficient is alternative measure of correlation that can capture non-linear relationships and are suitable for ranking or ordinal data. Spearman’s correlation coefficient (ρ) is calculated based on the ranks of the variables. Let us denote the ranks of X and Y as R(X) and R(Y), respectively. The Spearman correlation coefficient is given by:
where:
Σ denotes the sum of the respective values.
D represents the difference between ranks (R(X) − R(Y)).
n is the number of data points.
The value of ρ ranges between −1 and 1, where:
ρ = 1 indicates a perfect monotonic increasing relationship.
ρ = −1 indicates a perfect monotonic decreasing relationship.
ρ = 0 indicates no monotonic relationship.
Results show that ABS, concrete, and aluminum exhibit similar patterns in natural frequency, with frequency decreasing as crack depth increases (as shown in
Figure 4) in a nonlinear manner. Additionally, the impact of temperature on natural frequency is more pronounced in aluminum than in concrete and ABS. This is due to the higher elastic modulus and greater consistency of isotropic sheet metal properties relative to additive layer manufactured ABS and concrete [
1,
5].
As the crack depth and temperature increase in aluminum specimens, the amplitude also increases, consistent with the expected effect of reduced natural frequency resulting from a crack in the specimen. In contrast, ABS exhibits the opposite behavior: as crack depth increases, the natural frequency drops similarly to aluminum, but amplitude decreases instead of increasing, as shown in
Figure 5. This discrepancy can be explained by the different temperature conditions during the experiments. The maximum temperature for aluminum specimens is 200 °C, well below the temperature threshold for any significant material transformations during the short test duration. In contrast, the maximum temperature for ABS samples is 70 °C, which is close to the glass transition temperature.
Figure 6 provides a more comprehensive view of the frequency-amplitude relationship in the experimental data, revealing an inverse relationship between frequency and amplitude in aluminum and concrete, while ABS shows a positive relationship. No clear relationship is evident between crack depth and position in ABS and concrete data, but in aluminum specimens, increasing crack position results in increased amplitude and decreased frequency.
To enable a proper comparison and account for the diverse material behavior discussed earlier, three identical but independent models were trained and validated with unseen data. The proposed models were evaluated for training and validation accuracy using a mean squared error (MSE) metric. MSE is a commonly used statistical metric for evaluating the performance of regression models. MSE measures the average squared difference between the predicted values and the actual values of the target variable. For each data point in the test set, the model generates a predicted value based on the input features. The squared difference between the predicted value and the corresponding actual value is calculated. These squared differences are then averaged across all data points in the test set. MSE provides an estimate of the average squared error between the predicted values and the true values. It gives higher weight to larger errors due to the squaring operation. The value of MSE is always non-negative, with lower values indicating better performance. A value of 0 indicates a perfect fit, where the predicted values match the actual values exactly. In the context of the study, MSE was used as an evaluation metric to assess the performance of the machine learning models in predicting crack propagation. Lower MSE values indicate that the model’s predicted crack propagation values closely match the actual values, suggesting better predictive accuracy. It is worth noting that while MSE provides a useful measure of model performance, it is not the only metric available. Depending on the specific objectives of the study, other metrics, such as mean absolute error (MAE), R-squared (coefficient of determination), or domain-specific metrics, could also be considered to evaluate and compare the performance of the models. Essentially, these metrics represent the average predictive error of the models [
1].
Table 2 summarizes the performance metrics for training and validation predictions for the three materials.
ABS Models: The results show that the K-Neighbors Regressor model has a Training MSE of 0.195, indicating that, on average, the model’s predictions are off by 0.195 squared units from the actual values in the training set. The Validation MSE of the same model is 0.086, indicating that the model’s predictions are off by 0.086 squared units from the actual values in the test set. These values indicate that the K Neighbors Regressor model performs well on both the training and test sets, suggesting that it has not overfit the training data. The Lasso model’s Training MSE is 0.195, and the Validation MSE is 0.182. Both values are lower than the MSE of the baseline model (0.437), indicating that the Lasso model outperforms the baseline. The results for the Ridge model show that it has a Training MSE of 0.194 and a Validation MSE of 0.182. These values are also lower than the MSE of the baseline model (0.437), indicating that the Ridge model outperforms the baseline. Overall, all three models (K Neighbors Regressor, Lasso, and Ridge) outperform the baseline model, suggesting that they have learned meaningful patterns in the data and can make reasonably accurate predictions.
Figure 7 shows the actual against predicted ABS crack depth for selected machine learning models.
Aluminum Models: The results show that the K Neighbors Regressor model has a Training MSE score of 0.189 and a Validation MSE score of 0.079, which is lower than the MSE of the baseline model (0.8907), indicating that the K Neighbors Regressor model outperforms the baseline. The lower the MSE score, the better the performance of the model. The Ridge model has a Training MSE score of 0.170 and a Validation MSE score of 0.149, both of which are lower than the MSE of the baseline model. The Lasso model has a Training MSE score of 0.180 and a Validation MSE score of 0.159, which are also lower than the MSE of the baseline model. In all cases, the Validation MSE score is lower than the MSE of the baseline model, suggesting that the models have learned meaningful patterns in the data and can make reasonably accurate predictions.
Figure 8 shows the actual against predicted Aluminum crack depth for selected machine learning models.
Composite Models: For all models (Gradient Boosting Regressor, SVR, and Random Forest Regressor), the Mean Squared Error (MSE) is used to evaluate their performance, which measures the average squared difference between the predicted and actual values. The lower the MSE, the better the performance of the model. The results show that the Gradient Boosting Regressor model has a Training MSE of 0.208 and a Validation MSE of 0.014, indicating that the model performs well on both the training and test sets and has not overfit the training data. The SVR model has a Training MSE of 0.182 and a Validation MSE of 0.094, indicating that the model’s predictions are off by 0.182 and 0.094 squared units from the actual values in the training and test sets, respectively. The Random Forest Regressor model has a Training MSE of 1.649 and a Test MSE of 0.189. Although the Test MSE is lower than the baseline model, it is higher than the Test MSE of the other two models, suggesting that the model may have to overfit the training data. Overall, the Gradient Boosting Regressor and SVR models seem to be the best performers based on these results. They have the lowest Test MSE values and are less likely to overfit compared to the Random Forest Regressor model.
Figure 9 shows the actual against predicted Concrete crack depth for selected machine learning models.
The most important predictive features for aluminium are natural frequency, crack position, and temperature. For ABS and concrete, natural frequency and amplitude dominate, while the position of the crack appears to be of little importance. This may seem counterintuitive, as theory suggests that the closer the crack is to the fixed position of the specimen, the more pronounced the effect on dynamic response. However, the effect of crack position is captured by the model under the natural frequency and amplitude terms, resulting in a small and relatively unimportant crack position coefficient. Using all of the features as predictors yields the lowest mean squared error (MSE) but removing crack location does not significantly increase the error in ABS and concrete models. Removing amplitude has a much more significant effect on the error, increasing it by approximately 60% relative to the all-features baseline. The most significant effect is seen by removing natural frequency from the feature pool, which increases the error significantly. This suggests that natural frequency is the dominant feature in the prediction of crack depth, and its effect on the model error is greater than the rest of the features combined.
According to [
30,
31], the formal verification techniques mentioned further down provide a general framework for assessing the correctness, reliability, and robustness of machine learning models used in predicting crack propagation under coupled load and temperature.
Formal Specification: The first step in applying formal verification methods is to establish a formal specification that defines the desired properties or requirements for the crack propagation prediction models.
Model Checking: Model checking can be used to verify if the machine learning models satisfy the specified properties. In this context, it would involve exhaustively exploring the model’s behavior under various coupled load and temperature conditions to check if the predicted crack propagation adheres to known physical laws or expected behavior.
Sensitivity Analysis: Formal verification methods can also include sensitivity analysis, which involves analyzing the impact of changes in input parameters or model assumptions on the predicted crack propagation. This analysis can provide insights into the robustness of the models and their sensitivity to variations in load, temperature, or other factors.
Statistical Testing: Statistical testing methods can be employed to evaluate the performance of different machine learning models. This involves comparing the predicted crack propagation results from different models using appropriate statistical tests to determine if there are significant differences in their performance or accuracy.
It is important to note that the application of formal verification methods to this specific paper would depend on the details of the research and the methodology employed.
The computational time and complexity associated with training algorithms exhibit variations contingent upon several factors, including dataset size and algorithmic implementation. When dealing with small to moderate datasets, linear regression techniques, such as Ridge and Lasso, demonstrate efficient training times, characterized by a complexity of O(p
3), where p corresponds to the number of features. In contrast, the kNN algorithm, which is non-parametric in nature, requires minimal training time due to its reliance on storing data points. However, the computational expense arises during prediction when kNN necessitates distance calculations, resulting in a complexity of O (n d), with n referring to the number of training instances and d representing the number of features. Support Vector Regression (SVR), an algorithm based on support vector machines, generally exhibits reasonable training time for small to moderate datasets. Nevertheless, the process of solving the quadratic programming problem can become time-consuming for larger datasets. Random Forest Regressor, an ensemble algorithm, exhibits efficient training for small to moderate datasets with a complexity of O (M × n × log(n)), where M denotes the number of trees and n represents the number of training instances. On the other hand, Gradient Boosting Regressor, another ensemble algorithm, typically entails higher computational requirements, especially when considering larger numbers of iterations. Consequently, its complexity generally surpasses that of Random Forests. The selected algorithms can typically achieve training within an acceptable timeframe for small and moderate datasets. However, the actual time and complexity involved are contingent upon specific dataset characteristics, implementation nuances, and the availability of computational resources [
1,
21,
22,
23,
24].
4. Conclusions
In recent years, machine learning has emerged as a powerful tool for analyzing and predicting outcomes in various domains. One such application is the prediction of damage severity in materials, which holds great importance in structural health monitoring and maintenance. This paper presents a comprehensive study that harnesses machine learning techniques to predict crack depth in different materials, namely ABS, aluminum, and concrete.
The study utilizes experimental data obtained from previous research and employs a range of regression models, including K Neighbors Regressor, Lasso, Ridge, Gradient Boosting Regressor, and SVR, to forecast crack depth. Model performance is evaluated using Mean Squared Error (MSE) on both training and test sets. The findings demonstrate that all models surpass the baseline model, indicating their ability to capture meaningful patterns within the data and make reasonably accurate predictions.
Furthermore, the study identifies the most influential predictive features for each material. For instance, the impact of temperature on natural frequency exhibits a more pronounced effect in aluminum compared to concrete and ABS. This discrepancy can be attributed to the higher elastic modulus and greater consistency of isotropic sheet metal properties in aluminum relative to additive layer manufactured ABS and concrete. Moreover, the study reveals that all models outperform the baseline, implying their capability to learn significant patterns and facilitate accurate predictions.
The study offers valuable insights into predictive features and model performance in crack depth prediction, which have practical implications for specialists involved in structural health monitoring and maintenance. By employing machine learning algorithms, it becomes possible to reduce the time and cost associated with traditional testing methods while enhancing the accuracy and reliability of damage severity predictions.
It is worth noting that the training times for the discussed algorithms, such as Ridge, Lasso, kNN, SVR, Random Forest Regressor, and Gradient Boosting Regressor, are generally reasonable for small to moderate datasets. However, the actual time and complexity required for training may vary depending on dataset characteristics, implementation intricacies, and computational resources. It is essential to consider these factors when selecting and applying these algorithms to ensure efficient and effective model training.
Future studies could explore the adoption of more advanced machine learning algorithms or ensemble techniques to further improve the accuracy of crack depth prediction. Additionally, investigating the influence of additional features, such as material composition or environmental factors, could provide valuable insights into the mechanisms of crack formation and propagation. Furthermore, evaluating the models on different datasets and in real-world scenarios would contribute to assessing the generalizability of these models beyond the specific materials and conditions used in this study.
This study underscores the potential of machine learning techniques in predicting damage severity in materials. The results highlight the superiority of machine learning models in providing accurate and reliable predictions, which can significantly benefit structural health monitoring and maintenance practices. Further research is warranted to explore advanced techniques and the impact of additional features on predictive accuracy. Ultimately, the continued development and application of machine learning in this field could lead to remarkable advancements in structural engineering and maintenance.