1. Introduction
Unconfined masonry has been used worldwide for hundreds of years and is still used today. This type of construction is susceptible to earthquakes, environmental degradation, and occupancy changes. Poor construction practices, inferior materials, and structural detailing defects reduce the strength. These factors seek an effective strengthening approach to improve masonry column capacity. Strengthening or upgrading existing structures is treated as more cost-effective than demolishing or rebuilding. For future generations to enjoy historic structures, they must be strengthened and retrofitted. Strengthening compression elements is crucial to preventing catastrophic damage to historic vertical load-bearing structures [
1,
2].
Fiber-reinforced polymer (FRP) materials might be used to reinforce and strengthen concrete columns due to their light weight and high strength [
3]. FRP composites reduce dead weight and simplify application in restricted areas. Several studies have explored the possibility of confining concrete columns externally with FRP by associating the fibers perpendicular to the columns’ vertical axis in the direction of hoop stresses. Bashiri and Toufigh [
4] tested and analyzed a CFRP strips to confine concrete panels to reduce the weight of Buckling-restrained braces. Rodríguez et al. [
5] introduced strengthening heavily damaged beam-column assemblies confined by CFRP wrapping and SFRM casing. Some researchers studied FRP-wrapped concrete columns [
6,
7,
8]. Thamboo et al. [
9] strengthened masonry wallets with CFRP and tested under concentric and eccentric loads. Thamboo [
3] compared experimental and analytical approaches to evaluate the performance of axially loaded masonry columns confined with composites. Wan et al. [
10] tested the mechanical behavior of basalt-reinforced masonry columns under eccentric loading. Li et al. [
11] studied axially loaded masonry columns confined textile reinforced concrete with short fiber. FRP confinement on masonry columns has been studied. [
12,
13] to evaluate masonry wrapped with FRP jackets. Faella et al. [
14] proposed calibrated confinement models. Internally confined masonry with FRP bars injected in column holes with or without FRP wraps and discontinuous confinement by FRP strips have been introduced by Micelli et al. [
15]. Discontinuous confinement by FRP strips were studied in [
16,
17,
18]. Experimental results of FRP confinement were compared to analytical models to validate their accuracy [
3,
14,
19,
20,
21,
22].
The Artificial Neural Networks (ANN) is one of the most popular Artificial Intelligence (AI) techniques that are used a lot in civil and structural engineering to predict how structural elements such as beams and columns will behave based on experimental results considered as training, testing, and verification data [
23,
24]. The ANN-based model is a very effective approach to predict the properties of concrete [
25,
26]. Hasançebi and Dumlupınar [
27] developed linear and nonlinear model upgrading of RC T-beam bridges using ANN networks. The Support Vector Regression (SVR), another popular technique of AI, has also been used in predicting the properties of materials [
28]. Sun et al. [
29] demonstrated the chemical, mechanical, and hydrothermal stimulation for waste glass-reinforced cement. However, very limited work has been conducted in the area of predicting behavior of confined masonry column under axial load using AI techniques such as ANN and SVR. A symbolic regression (SR) approach is a very recent concept of artificial intelligence that could be used to model the behavior of structural element. Such a machine learning system has been employed in several studies [
30,
31]. Rezaei et al. [
32] studied seismic fragility assessment of RC box-girder bridges employing the SR approach. Symbolic regression in materials science was presented by Wang et al. [
33] via dimension-synchronous-computation. Mansourdehghan et al. [
34] conducted data-driven evaluation of RC shear walls utilizing visual damage. However, due to the material’s complexity and heterogeneity in comparison to concrete, there are a number of analytical expressions for determining the compressive strength of FRP-confined masonry.
Therefore, the current study deals with the analysis of FRP confinement of masonry columns in order to propose a new model for estimating compressive strength of FRP confined masonry using symbolic regression approach by adopting genetic programming which is an artificial intelligence technique to solve symbolic regression and produce a closed-form mathematical expression which is much simpler to inspect and implement, particularly for hand calculations. The artificial intelligence model was trained by largest database ever built in the literature with varying parameters and the performance of the proposed model was compared with the performance of twelve most important theoretical models available in the scientific literature. Furthermore, the comparisons of the models have been clearly shown that the proposed model has an excellent agreement with R2 equals to 0.91 which is superior compared to the existing models.
The manuscript is structured in a simple outline. The available analytical FRP confined masonry models and experimental database collected from scientific literature are presented in
Section 2 and
Section 3, respectively. Brief description about symbolic regression model is given in
Section 4. The results and the findings, and comparison of proposed symbolic regression model with existing formulas in literature are explained in the following sections. In
Section 7, reached conclusions are summarized.
2. Analytical FRP Confined Masonry Models
Few analytical models are currently available to compute the compressive strength of FRP-confined masonry. Such existing models rely on both the unconfined masonry strength,
; and the effective lateral confining pressure
. Some of them are linear, while others are nonlinear. However, regardless of the complexity, they were mostly derived from a small number of experimental data. The Italian National Research Council (CNR) [
35] reported common relationship considering the nominal density of the masonry element
, stated in kg/m
3, in the estimation of
. In particular, the parameter approximately estimates the effect of porosity and voids of both the constituent materials as well as texture of masonry on the axial compressive strength.
The generalized expression to obtain the confined masonry’s compressive strength is shown in Equation (1).
The
is compressive strength of masonry and
is compressive strength of FRP confined masonry,
denotes effective confining pressure, where
is a non-dimensional coefficient unless a more detailed analysis is accomplished,
may be derived as follows:
and are coefficients equal to 1.0 if no additional experimental data is obtainable. is the masonry mass-density and equivalent to the symbol ().
Equation (3) is a general formulation for calculating the effective confinement stress, where
is the equivalent confinement stress produced by FRP composite on wrapped column. The effective confining pressure,
is determined considering the cross-section shape and the FRP strengthen system.
The
is the efficiency coefficient derived as a product of the horizontal and vertical efficiency coefficients,
and
, respectively, and
denotes the coefficient incorporating composite topology and material. The lateral confining pressure can be calculated according to the rigid body equilibrium between pressure and force in the external wrap. The lateral pressure produced by FRP jackets on rectangular or square cross-sections can be estimated using Equation (4) for columns with continuous FRP wrapping. Using Equation (5) the lateral pressure can be calculated for columns with discontinuous FRP wrapping:
The
and
denote the tensile modulus of elasticity and thickness of the FRP jacket, respectively. The theoretical prediction is calculated using the nominal thickness of cured laminate. The cross-section dimensions are defined by the width
and the thickness
.
is the width of FRP reinforcement, where
is the center-to-center distance in the middle of two consecutive FRP U-wraps. The physical meaning of
,
, and
are explained in
Figure 1.
Because of a lacking sufficient information about the strain, the efficient factor is used to calculate the FRP strain’s reduced design value, and relate it with ultimate strain of FRP coupon testes. All discussed equations were reported by setting the strain efficiency factor to 1. This is needed in the expressions to calculate the lateral confining pressure .
The FRP strain’s reduced design value,
, can be calculated by Equation (6).
where
denotes the environmental conversion factor,
and
denote the ultimate characteristic strain and partial factors of FRP composite confinement system, respectively. The maximum permissible strain is 0.004 according to CNR-DT 200 R1 [
35]. The environmental and safety coefficients
and
were assumed as 1.0. The maximum permissible strain was not imposed when calculating the value of
to estimate the confining pressure in all theoretical prediction models.
The effectiveness of any composite’s confinement is determined by the shape of the column’s cross-section. Generally, the circular shape is the most effective cross-section for confining material. For rectangular cross sections, the confinement is partially effective along a portion of the section’s total periphery. As a result, the sharp edges of the section are rounded to a specified radius to increase the confinement’s effectiveness. The horizontal coefficient of efficiency, , is calculated by dividing the area of effectively confined masonry by the total area.
For rectangular sections reinforced continuously with FRP, due to the arch effect, the confined section of the masonry column comprises merely a segment of its total surface area. The extent of the confined area is dependent on the rounding radius used.
The efficiency coefficient of rectangular cross-section columns wrapped with external FRP materials is calculated as follows:
Here, and , is corner radius of the cross-section and denotes the area of masonry prism.
If the FRP strengthening system is non-continuous, the vertical coefficient of efficiency is calculated by Equation (6) and
is equal to 1 for a continuous confinement.
It is worth mentioning that FRP-confined masonry columns failure is commonly caused by rupture of the outer jacket. Stress concentration at the cross-section corners is what causes prismatic columns to fail. Due to the arch effect, only a small portion of the masonry column’s surface area is well confined. The confined area’s size is determined by the rounding radius. To calculate confinement efficiency, it is commonly assumed that the column cross-section has parabolic branches. If the column is confined by a discontinuous confinement, achieved by wrapping spaced FRP sheets, then the column could fail before the rupture of FRP. The failure is the outcome of the crushing of a considerable quantity of masonry from the unwrapped areas. Establishing an appropriate spacing between the FRP strips is essential to ensure that the confinement efficiency.
Very few analytical models exist in literature to estimate the compressive strength of the FRP confined masonry. Twelve distinct approaches are considered to calculate the compressive strength: Krevaikas and Triantafillou [
21], Corradi et al. [
19], Di Ludovico et al. [
20] for clay and tuff, Faella et al. [
14] the simplest and accurate, CNR-DT 200 R1 [
35], Rao and Pavan [
36], Ramaglia et al. [
22] for clay and tuff, and Napoli and Realfonzo [
37] the simplest and accurate.
Table 1 summarizes the analytical models described in the literature.
Models developed by Di Ludovico et al. [
20] and Ramaglia et al. [
22] are further differentiated in two model. The first one is calibrated for confined clay masonry and the second for confined tuff masonry. The models proposed by Faella et al. [
14] and Napoli and Realfonzo [
37] also stated two analytical models, one is defined as more accurate, while the second is defined as more straightforward. As mentioned, all models considered same framework except Krevaikas and Triantafillou [
21] model which adopts a nonlinear relationship between the lateral confining pressure
and the axial strength
. However, Krevaikas and Triantafillou [
21], Corradi et al. [
19], Di Ludovico et al. [
20] for clay Rao and Pavan [
36], and Ramaglia et al. [
22] for clay models have applicability for artificial masonry because models were mainly calibrated using a substantial number of experimental data pertaining to FRP-confined clay brick masonry columns. In contrast, Di Ludovico et al. [
20] and Ramaglia et al. [
22] for tuff is appropriate for natural stone masonry columns because it considered a few experimental results of FRP-confined tuff masonry. Lastly, the only analytical models of gaining strength applicable to both natural and artificial masonry are the models presented by Faella et al. [
14] and Napoli and Realfonzo [
37]. Additionally, merely these models estimate the effect of the mass density of masonry on the compressive strength.
3. Experimental Database
To ensure that the machine learning model has the best possible performance, a comprehensive database was used. This section includes a detailed description of the database, as well as an analysis of its input parameters.
3.1. Collected Database from Literature
Recently, Napoli and Realfonzo [
37] reported a comprehensive database of axial compressive tests conducted on unconfined and FRP-confined square/rectangular masonry prisms. The experimental database contains 286 specimens, of which 88 were unconfined and 198 were variable confined with basalt, carbon, glass, or steel FRP layers.
Approximately 76% of columns in the database were prismatic and made of artificial clay brick, of which 54% were variably confined with FRP systems and 22% were unconfined. Twenty-four percent of the remaining prismatic masonry consisted of natural blocks, of which 15 percent were FRP confined, and 9 percent were unconfined. The majority of natural blocks consisted of tuff masonry, after that a considerable quantity of calcareous stone and a few sandstones.
The majority of masonry specimens were constructed with a uniform arrangement of bricks. Nine percent of columns were constructed as hollow columns or cavity-filled hollow columns with a mixture of mortar and stone wastes.
Napoli and Realfonzo [
37] provided the essential data on 198 specimens subjected to axial compression tests and FRP confined. Details include the type and composition of the masonry, as well as arrangement of the masonry units and the mass density. When mass density is not specified in scientific literature, the typical range of
is reported.
The information comprise
and
as width and depth of the cross section, respectively,
is the column height;
is the prismatic column corner radius;
is the per cubic meter weight of the FRP sheet;
,
and
are the mechanical properties of FRP wraps, elastic modulus, ultimate tensile strength, and ultimate strain, respectively.
is the single layer thickness of the FRP sheet;
is the number of FRP layers;
,
and
are the width of the FRP strip, the clear spacing between two consecutive strips and the center-to-center distance, respectively, used for FRP discontinuous confinement;
is the length of overlapping for the FRP strips;
is the unconfined masonry compressive strength;
and
are corresponding coefficient of horizontal and vertical efficiency as per the guideline CNR-DT 200 R1 [
35];
is the effective ultimate lateral confining pressure of FRP wrap normalized by
unconfined masonry compressive strength;
is the compressive strength of the FRP confined masonry;
is FRP strain efficiency factor which is basically the ratio between the experimental ultimate hoop strain in the FRP wraps (
) and the ultimate strain from fiber coupon tensile tests (
). The values of
were calculated according to CNR-DT 200 R1 [
35]. The
≤ 1 is the parameter relying on the shape of the section and the type of FRP wrapping (continuous or discontinuous), estimated through the coefficients
and
. These parameters as well as the effective lateral confining pressure
are characterized by the correlations discussed in
Section 2 for square/rectangular columns. More details about specimen preparation, conditioning of the tests and strength measurements can be found in Napoli and Realfonzo [
37] and relative references.
3.2. Processed Database
Due to the importance of experimental uniaxial compression tests carried out on unconfined and FRP-confined square/rectangular masonry elements reported in the literature, this study uses existing studies to establish input parameters.
For successful a machine learning model, the database is reorganized, by removing and adding parameters and calculated values. In particular, the ratio of highest side over the shortest of cross-section () was introduced to consider the shape of confined cross-section. To consider the effect of the slenderness of columns, the ratio between cross-section thickness and height of the column () was calculated, where is always the longest side of the cross section. The layer thickness of the fiber sheet is recalculated by considering the number of FRP layers.
Researchers selected the following inputs based on literature and their judgment: masonry mass (), specimen size (, , , ), mechanical properties of the strengthening wraps (, , ), compressive strength of the unconfined masonry (), parameters related to the effective ultimate lateral confining pressure (, , ). The output parameter is the axial compressive strength of the masonry confined by FRP ().
Some data points have been removed or changed in the database. Eleven samples tested by Rao and Pavan [
36] were removed because of missing essential data about mechanical properties of the FRP wraps. Four sample from Corradi et al. [
19] were missing the value of
. Zero corner radius is assumed. When the mass density of masonry is not reported in the literature papers, an average typical range of
values is assumed. The final table of database contains 16 columns and 117 rows.
Table 2 summarizes all the ranges of variation of the main parameters in the reorganized database.
According to
Table 2, the side ratio of the cross-section, for all FRP systems varies between 1 (square columns) and 2 for prismatic specimens. The size of the columns as the ratio
was never greater than 0.85. The sheets of fiber used to wrap columns were unidirectional. The mechanical properties of the fiber sheets differed within the expected range. The number of FRP layers varied between 1 and 5 at all times. The investigated compressive strength values of unconfined columns,
were distributed over 2–14.33 MPa range. The compressive strength values of the confined columns,
were dispersed maintaining the range as approximately 2.79 to 44.87 MPa.
In order to extract more information regarding the mutual relationship between all input and output features in the dataset, the correlations between features are analyzed. This statistical measure is useful because it describes the relationship between two or more characteristics. In practice, the findings of this analysis will ultimately lead to the selection of the predictive model to be employed in order to maximize the accuracy of predictions. Among the methods available in the literature, Pearson’s method will be used to calculate the correlation coefficient:
with
as the experimental values,
the regression values, and
is the average of the simulation values. Where
and
are two features while overhead bar and subscript
signify the average value and the
observation, respectively. The expression in Equation (9) guarantees that the coefficient is between −1 and 1. The value 0 indicates that there is no correlation between a specific pair of features, whereas a value of 1 indicates a perfect positive and −1 indicates negative correlation relationship. This indicates that an increase in one quantity causes an increase (if 1) or a decrease (if −1) in the other quantity. The relationship between a quantity and itself, where the correlation coefficient is always 1, is a clear instance of a perfect positive correlation. In contrast, the correlation becomes weaker as the value approaches 0. In Pearson correlation, if two characteristics are independent, the magnitude of the coefficient is close to zero.
Figure 2 shows color map correlation matrix of the dataset features with correlation coefficient. It is revealed that the correlation between unconfined masonry strength
and unconfined masonry strength
is reasonably strong with the coefficient of 0.84. This behavior is expected.
In the recognized database,
Figure 3 illustrates the distribution of the targeted parameter which the confined compressive strength of masonry
.
Figure 4 illustrates the distribution of the selected design parameters.
4. Symbolic Regression Model
Symbolic regression (SR) is a statistical approach that can detect multivariate, nonlinear relationships between variables in sets of data. It generates a closed-form equation that represents a functional mapping of correlations that enables the prediction of the value of a chosen target variable based on the values of other variables. SR is especially interesting since it uses mathematical functions and operations such as addition, subtraction, multiplication, division, logarithm, and exponential functions to build an equation that represents the correlation between the input and target variables. Symbolic Regression is distinguished from conventional regression techniques where in traditional regression, the structure of the equation should be defined and stay fixed during the regression. However, SR can detect a suitable equation structure and optimizes numerical constants simultaneously. In general, SR does not require prior knowledge of model structure or function [
38]. The approach is comparatively simple for use by non-experts in computer-based methods. Additionally, the identified formula can be merely transferred and implemented in other software systems. John Koza [
39] popularized the genetic programming (GP) which developed as an evolutionary algorithm that searches for programs that solve a given problem without explicitly programming as a symbolic regression. Darwin’s Theory of Evolution inspired the description of GP as an evolutionary method for automatic programming. Simulating natural selection, a population of individuals, in the case of symbolic regression as mathematical functions and operations, is selected. By continually picking individuals with high quality and recombining them to minimize the error in the optimized target. GP is a simplification of genetic algorithms [
40] and, when executed, develops problem-solving procedures. GP, unlike genetic algorithms, permits variable-length programming such as symbolic expression trees.
SR, which is a simpler task for GP than full-fledged automatic programming, can identify only a single expression. Consequently, mathematical operators and functions are typically permitted for internal nodes in symbolic expression trees. The collection of final symbols includes some parameters of training dataset as well as constants. Consequently, when SR is performed using GP, the resulting program is a closed-form mathematical expression representing a statistical model.
GP uses the iterative process to evolve symbolic regression models. An initial population is formed by generating and evaluating a set of random expressions. Expressions with a greater capacity to accurately predict the target variable are deemed to be more suitable. Through executing the GP as cycle, new expressions are generated by frequently selecting and recombined old expressions undergoing random mutations. High ranking expressions have a superior likelihood of being selected and may be selected several times. The newly generated expressions are assessed and merged with the existing population to create a new population of expressions. Poorest fit expressions are eliminated in this step. The process typically ends when the stopping criterion is met which is usually reaching the ultimate number of generations. SR employing GP is compatible with sets of data containing millions of observations and dozens of variables.
In the present work, HeuristicLab [
41], an open-source software that provides a module for SR and is employed in C# on top of the .NET Framework, is utilized in this study. Multiple advanced features, including tree structure constraints based on grammar rules, factor variables, and automatic differentiation for memetic gradient-based optimization of numeric parameters, are supported by the software.
For the experiments presented, the following parameters were set population size = 1000, mutation probability = 15%, maximum tree depth = 10, and maximum tree length = 25. The collection of function symbols is (addition, subtraction, multiplication, division). As terminal symbols, numeric constants and variables from the dataset were permitted. All input variables were considered in the reorganized database. As the target variable, the compressive strength of the FRP confined masonry () was chosen. A total of 50% of the data was utilized for training and the remaining data were utilized for testing. Each trial’s data was allocated to either the training set or the test set, ensuring that no test-trial data points were observed during training. The results were achieved after 5000 generations. Symbolic regression is a stochastic technique that yields unique results for each run, despite identical inputs and settings. Thus, multiple repetitions of training were performed with the same settings to select the optimal model.
5. Results and Discussion
Genetic programming-based symbolic regression typically has no model restrictions. However, the desired model should be as clear and precise as possible. Therefore, it is desirable to restrict the size and complexity of the model. The model’s tree length was limited to a maximum of 25. The functional set of mathematical operations consisted solely of mathematical basics (addition, subtraction, multiplication, division). In addition, fewer influencing variables and constants are desirable. The arguments of the function set were further restricted to only variables and constants.
Using the method outlined in
Section 4, the symbolic regression algorithm derived a complex mathematical expression from the training and test matrix data. The procedure’s outcome is depicted in
Section 5.1. The model is then manually simplified by iteratively fitting parameters and removing parts from the model that contribute little to the quality of the prediction using the procedure outlined in
Section 5.2 to arrive at the final expression.
The optimization of model was according to the coefficient of determination (
). The coefficient of determination is in the range of 0 and 1. The value
= 0 indicates that the regression model does not fit the simulation data at all. In contrast,
= 1 means the regression model is fitting the simulation data perfectly.
5.1. Complex Symbolic Regression Modelling
This section describes the results of black-box modeling using symbolic regression and genetic programming. Symbolic regression modeling was achieved by employing freely evolving trees that incorporate no prior physical knowledge. The only limitations here are the maximum length and depth of the tree, as previously described.
After 5000 iterations of symbolic regression modeling, the equations discovered in each iteration are quite random combinations of process variables and are rarely repeatable. There were no similarities between the results of each run. The following is a report on the best quality symbolic regression model discovered in 5000 runs based on the coefficient of determination value over training date.
In
Figure 5, the best prediction symbolic regression model tree is reported. The model is with 23 lengths and 8 depths. Only 7 input variables out of 15 were considered in best quality symbolic regression model. The variables and their impacts are reported in
Table 3. Addition, subtraction, and multiplication are the only mathematical operations that were used.
The prediction quality of the symbolic regression-based black-box model for estimating the compressive strength of FRP-confined masonry is depicted in
Figure 6. In the left panel, the
value of the prediction model on training data is plotted. The right panel illustrates the prediction value of the model on hidden test data. The high coefficient of determination for the model on training data is reasonable and the predictive capability of the model is quite satisfactory on test data.
Statistical indictors for both training and test data are presented in
Table 4. An error analysis is carried out to evaluate the model quality. The coefficient of determination Equation (10), average relative error Equation (11), the mean absolute error Equation (12), mean squared error Equation (13), and root mean squared error Equation (14) are derived as follows:
Even though the model prediction quality in unseen data can be considered good, the training data showed better performance. MAE and MSE for training are 1.51 MPa and 3.48 MPa. Compared to test data, the error was 4.27 for MAE and 30.31 for MSE, correspondingly. It is observed that the error in training data is much lower RMSE than unseen test data.
The coefficient of determination for the whole dataset is determined as 0.83 which implies that the symbolic regression model was performing well.
Figure 7 illustrates the difference between the predicted results of the complex symbolic regression model and experimental data. It can be found that the results obtained from the complex symbolic regression model were well matched with experiments for both training and test data. Generally, the higher value of
indicates a good performance of the model.
5.2. Simplified Model
The algorithm for symbolic regression yields a model with low absolute and mean square errors, but it is quite complex. The model contains seven parameters which is summarized in
Figure 5. MAE and RMSE for training are 1.51 MPa and 1.87 MPa. Therefore, additional training and manually simplifying the model resulted in a simplified version of the complex symbolic regression model reported in
Section 5.1 and
Figure 5, wherein whole data was used in training to fit the model’s parameters and manually simplifying the model by removing model components with a small impact on the estimation quality of the dataset. This procedure yielded straightforward mathematical expressions.
Figure 8 illustrates the simplified symbolic regression model tree. The model has a length of 15 and a depth of 6. Only four of the fifteen input variables were considered in the simple mathematical expression. In terms of the number of inputs, it can be considered a reasonably simple symbolic expression, when compared to the complex symbolic regression model, which is very large and is primarily influenced by a greater number of inputs. The variables and their impacts are reported in
Table 5.
Typically, tree-based genetic programming is utilized, and
Figure 8 depicts the simplified symbolic regression solution as a symbolic expression tree. Symbolic regression based on genetic programming is well-suited to regression problems for two reasons: first, the underlying model structure is unknown, and second, the result is a mathematical expression that can be easily manipulated and implemented in screw-calculation programs:
As demonstrated by Equation (15), the mathematical expressions are quite straightforward, and empirical correction factors are always approximated accurately. As expected, changing some part of the symbolic expression tree would have effects of the performance of the model and its statistical indictors.
The performance of the simplified symbolic regression model, in which the coefficient of determination for the whole dataset is determined as 0.91 which implies that the symbolic regression model was performing well.
Figure 9 shows the resemblance between the predicted results of the simplified symbolic regression model and experimental data. It can be found that the results obtained from the symbolic regression model were well matched with experiments.
Figure 10 depicts four plots for the simplified model’s partial dependence on each of the four input parameters. The proposed model has a smooth response and nonlinear relationship with the compressive strength of masonry.
The simplified model from Equation (15) has a slightly higher prediction accuracy compared to the complex model (
Figure 5) where the value of
R2 is 0.91 against 0.83. In addition, the simplified model is much simpler to inspect and implement, particularly for hand calculations. Statistical indictors for the mathematical expression obtained by simplified symbolic expression are presented in
Table 6 for the whole set of data.
The algorithm’s ability to select only the most influential input variables and ignore other input data is viewed as a positive trait. while keeping in mind that the input covers majority of column properties. Regarding the symbolic regression approach and the other computational methods which are capable of dealing with a wider range of application in different fields, the physical interpretation of the underlying process should be attainable. However, the expression usually possesses black-box model characteristics and should be interpolated carefully when applied to engineering problems where the physical meaning is exceptionally important. The quality of machine learning algorithms is connected to the amount of data to discover the connections within the dataset parameters. Furthermore, a high quality dataset should be representative and holds the required insights to make the truthful predictions. Inconsistency in collecting the dataset or defining the parameters will drastically affect the ability of artificial intelligent model to generalize the prediction and given honest scientific finding.
The relative frequency histogram of residual error is an additional crucial graph derived from the evaluation of the simplified symbolic expression. The relative frequency is the number of occurrences of a value within a data set. Further,
Figure 11 depicts the relative frequency histogram of model error residuals. From the bar chart, the error concentration and minimum and maximum residuals error values can be visualized in advance.
6. Comparison of Proposed Symbolic Regression Model with Existing Formulas
The accuracy of some strength expressions discussed in
Section 2 has been compared and illustrated in
Figure 12. Major theoretical models available to predict strength gaining for prismatic masonry columns confined by FRP in compression are considered. twelve distinct approaches to defining the compressive constitutive law are: Krevaikas and Triantafillou [
21], Corradi et al. [
19], Di Ludovico et al. [
20] for clay and tuff, Faella et al. [
14] simplest and accurate, CNR-DT 200 R1 [
35], Rao and Pavan [
36], Ramaglia et al. [
22] for clay and tuff as well as Napoli and Realfonzo [
37] simplest and accurate.
R2 value, ARR, MAE, MSE and RMSE for each model along with the proposed model are calculated for obtaining the approach’s reliability (
Table 7).
From
Figure 12 and
Figure 13, it can clearly be observed that the proposed model has an excellent agreement of regression value, 0.91 compared to the existing models. Di Ludovico et al. [
20] for tuff, Faella et al. [
14] accurate, CNR-DT 200 R1 [
35], and Napoli and Realfonzo [
37] shows immediate lower regression value as 0.82 which is also well below than the proposed model. Other models followed slightly lower regression. This finding has also been supported by the ARR, MAE, MSE and RMSE outputs. It can be concluded that the Napoli and Realfonzo [
37] simplest model might offer good correlation with the experimental data although the proposed model is more accurate than this model as well. The experimental data for the proposed model shows ARR, MAE, MSE and RMSE as 0.16, 1.86, 5.94 and 2.44 whereas the Napoli and Realfonzo [
37] simplest illustrates 0.18, 2.29, 12.67 and 3.56, respectively. The very less RMSE proves the superior accuracy of the proposed model. The Faella et al. [
14] simplest model the trend with slightly lower value immediately after the Napoli and Realfonzo [
37] simplest model though it has more MSE value.
Figure 13 also reflects the data very close to the regression line without any major deviation. Whereas a few widely deviated values of other model made the differences proving the suitability of the proposed formulation.
During the comparison of the experimental results with the analytical predictions, the average ratio equal to the values of theoretical prediction over experimental values divided by number of samples was reported. Two of the major statistical metrics to evaluate the confinement model’s performance are namely average ratio and coefficient of variation (COV) according to the Equations (16) and (17), respectively.
The average ratio (Prediction/Experiment), coefficient of variance (COV), number of predictions less than 10% error, highest predictions and lowest predictions are also computed to validate the accuracy of the current model to the existing ones. The average ratio for the proposed model is 1.03 which is very good as only 3% variation is there for prediction compared to the experimental date. Faella et al. [
14] accurate model offers the average ratio as 1.0; whereas Ramaglia et al. [
22] for tuff and Napoli and Realfonzo [
37] accurate shows it as 1.01 which are very close to the proposed model. The model better fits with results for both the average ratio and COV. The proposed model makes accurate predictions with an average ratio of approximately 1.03 and COV as 22.4%. Other models have elevated COV values. Nevertheless, overestimated predictions for strength of confined columns were observed by Corradi et al. [
19] model by 33% greater average value than experimental output.
The models of Di Ludovico et al. [
20] for clay, Faella et al. [
14] simplest, CNR-DT 200 R1 [
35], Ramaglia et al. [
22] for clay gives non-conservative predictions of the specimens by 5%, 16%, 23% and 2% of the practical values though the COVs are little higher. Among all the models, the Corradi et al. [
19] seems to be most non-conservative to overestimate the axial strength. The primary discrepancy between experimental data and theoretical predictions is that models overestimate the effective strain in the FRP jacket, resulting in an overestimate of the effective confining pressure. Reducing the maximum allowable strain would improve the model’s accuracy.
Table 8 also displays experimental and theoretically predicted values of compressive strength of FRP-confined prisms subjected to axial loading in terms of number of predictions with less than 10% error. It is evident from this output that the proposed model performs significantly better than the other models. Comparison is made between the performance of existing analytical models and the analytical expression proposed here. The number of predictions less than 10% error is 53 for the proposed model which is well above the immediate lower value of Di Ludovico et al. [
20] as 24 followed by Ramaglia et al. [
22] for Tuff, Krevaikas and Triantafillou [
21], Faella et al. [
14] accurate and Napoli and Realfonzo [
37] accurate showing around 16 in number. This behavior is supported by the highest prediction value of 1.74 and lowest prediction value of 0.46 as well, which are much better than the other models. It means that the percentage variation of the peak highest and lowest values is more sensible for the proposed model. The proposed analytical model predicts the compressive strength of FRP-confined masonry prisms subjected to axial loading better than all existing analytical models.