3.2. Selection of Uncertain Structural Model Parameters
The selection of uncertain structural model parameters is part of the definition of the structural model class and plays an important role on effectiveness of the Bayesian FE model updating and accuracy of model for future response predictions. Therefore, a comprehensive sensitivity analysis is performed to determine the structural model parameters to be updated. To compute sensitivities, the parameter values are perturbed by ±10% and their corresponding natural frequencies are calculated through the FE model. Subsequently, the sensitivity of each parameter for each mode is obtained as the ratio between the frequency change and the parameter range. Moreover, complex structure systems such as long-span bridges are typically composed of a large number of structural members. Ideally, one would like to treat each structural member (e.g., girder beam elements) as an individual substructure in the FE model so that the physical properties (mass and stiffness) of each member can be inferred. However, the information from a limited number of sensors will generally be insufficient to support such a member-level resolution of parameter inference, so substructure assemblages, each consisting of multiple structural members may be necessary in order to reduce the number of model parameters in
. Clustering analysis is therefore employed in this study to group the structural model parameters to form different model classes. Parameters with similar sensitivities are placed in the same cluster to ensure that the parameters from the same cluster have similar effects on the updating process [
33].
Sensitivity analysis in conjunction with clustering is performed herein to generate a series of effective uncertain structural model parameters. Firstly, to perform sensitivity analysis, a set of physical parameters from the FE model is selected. The Jindo Bridge FE model contains 129 beam elements in the bridge girder. Since the bridge girder is symmetric with respect to the location of mid-span, both mass and stiffness (Young’s moduli and moments of inertia in local x and y directions) corresponding to the beam elements of half of the bridge girder are considered, leading to a total of 260 individual physical parameters. Subsequently, the sensitivities of these 260 parameters to the natural frequencies of the structure are analyzed.
To perform clustering, cosine distance [
34] is first used as a proximity measure to compute the closeness among the sensitivities. Then, hierarchical clustering is applied to determine the final clusters. Within the hierarchical clustering, the Un-weighted Pair Group Method with Arithmetic Mean (UPGMA) is employed to link the similar sensitivities. More specifically, the pairwise cosine distances of the sensitivities are calculated for each two parameters, by which the parameters are first grouped into a maximum of 30 binary clusters based on the similarity of sensitivities. In other words, if there are more than 30 parameters, some groups will contain more than one parameter. The horizontal axis of
Figure 3a shows these 30 initial clusters. Subsequently, close binary clusters are put near each other in the horizontal axis of the binary tree and they are linked based on the UPGMA method to form the hierarchical tree. Distance criterion is used to form the final clusters. The distance threshold is selected as 0.4 for mass and 0.7 for all other physical parameters to produce a reasonable number of clusters.
By performing separate clustering, three clusters for each type of physical parameters including mass, Young’s moduli,
Ixx (moment of inertia in x-direction), and
Iyy (moment of inertia in y-direction) and thereby 12 clusters in total are obtained. The binary tree to the three mass clusters is shown in
Figure 3a.
Figure 3b shows the sensitivity matrices of the three mass clusters. Note that clustering is based on the cosine distance of the sensitivity vectors with respect to 14 modes; the parameters within the same cluster show similar sensitivity to these modes on an average sense. As mentioned previously, individual mass parameters are assigned to the beam elements of half of the bridge girder, leading to 65 parameters for mass. In this figure, the x axis refers to these mass parameter numbers. The grouping results from the cluster analysis are presented in
Figure 4 by showing the physical locations of each parameter group using different colors.
Finally, two model classes
and
are defined based on the clustering results. Model class
corresponds to a simplified class of structural models by selecting one parameter for each of the four types of physical parameters (mass, Young’s modulus,
Ixx and
Iyy). Thus, the number of uncertain parameter components in
is only four in total. Regarding model class
, it consiss of 12 parameter components in
corresponding to all the clusters shown in
Figure 4.
Table 1 tabulates the model parameter components for the bridge girder in model classes
and
respectively. As will be discussed later, in the Bayesian updating process, all uncertain structural model parameters are normalized to 10 using their initial physical values. The updated parameters will then be multiplied to their initial physical values to obtain the update model for further analysis.
3.3. Numerical Investigation on the Comparison of the Prediction-Error Precision Parameter Treatments
In this section, Bayesian FE model updating is performed on the bridge model. Three cases are considered corresponding to the three aforementioned treatments of prediction-error precisions, respectively, in which Case 3 considers the proposed method that marginalizes over the prediction error precision parameter. The likelihood function for the structural model parameters is defined in (2) and (9) by using the synthetic modal data of 14 natural frequencies and mode shapes. Mode matching is employed to reorder the analytical modes based on the measured ones, in which Modal Assurance Criteria (MAC) is utilized.
The simulated modal data is generated from model class
, since it has relatively higher complexity. To introduce modeling uncertainties, a coefficient is multiplied to the true value of each mass or stiffness parameter as shown in
Table 2. The noisy modal data of the fourteen modes are simulated by adding 2% and 5% zero-mean Gaussian random noise to the analytical natural frequencies and mode shapes, respectively. By repeating this process, 10 sets of noisy modal data are generated. To perform Bayesian model updating, model class
is employed and all uncertain structural model parameters are normalized to 10 using their nominal values.
To characterize the posterior PDF of the structural model parameters, TMCMC sampler is employed for each case and 1000 samples are drawn in each stage. Uniform distributions are assigned to the prior for all normalized structural model parameter components in with a range of 8 to 12. For the second case with prediction-error precisions updated, the Exponential priors in (3) and (4) are assigned for and with hyper-parameters that is, these priors are almost flat. While for the third case with error-precisions marginalized, the priors of hyper-parameters and are taken as uniform over large open intervals that start at zero. The number of stages in TMCMC sampler corresponding to each case is approximately 20, meaning that in total 20,000 samples are required to be drawn for all stages. The overall computational cost of TMCMC for each case is about 15 h using a Dell computer equipped with an Intel® Xeon® CPU E3-1241 and 16 GB of RAM running a 64-bit version of Windows 7.
In
Figure 5, the results of TMCMC samples of the structural model parameters in the first and last stages of TMCMC are demonstrated, which show how the samples converge from the prior to the posterior as the tempering parameter
increases from 0 to 1. In this numerical investigation, about 20 stages were needed to achieve convergence. Compared with the case with error precisions updated, the samples are much more concentrated in the final stage for the case with the error precisions marginalized, indicating much higher identification confidence. The reason is presumably that the model parameters are much more sensitive to the prediction-error precisions
and
than their corresponding hyper-parameters
and
This is consistent with our discussion in
Section 2.4. However, for the case with constant precisions, the samples of I
yy versus I
xx at the last stage are even more concentrated, though at two incorrect locations. This is due to the fact that the uncertainties of the prediction-error precision parameters have been overlooked by estimating them directly, which leads to inaccuracy in the model updating results and underestimation of the posterior uncertainties of structural model parameters, as we discussed in
Section 2.4.
Table 3 tabulates the results in terms of the relative errors between the predicted natural frequencies by using the posterior mean of
compared with the simulated natural frequencies, as well as the MAC values between the predicted and simulated mode shapes, for all three cases, where the error and MAC values for the
i-th mode are computed as:
where
is the maximum a posteriori (MAP) value of the structural model parameter vector. It is seen that the MAC values are mostly larger than 0.9 for all cases, which are reasonably high. Regarding the frequency errors, the largest error (absolute value) is 6.91% and 6.70% for the cases of constant and updating error precisions, respectively. While for the advanced case with prediction-error precisions marginalized as “nuisance” parameters, the errors range from 0.3% to 3.6%, most of which are smaller than the corresponding values for the other two cases.
From the comparison of the Bayesian updating results for the three cases, it is shown that the performance of the third case with prediction-error precision marginalized is superior. This is due to the fact that the posterior uncertainties of the error precision are preserved properly by integrating them as “nuisance” parameters. However, the first two cases with constant and updated error precisions have limitations that the results depend on the specified values of computed or sampled error-precisions. Therefore, prescribing a robust treatment for prediction-error precisions for Bayesian FE model updating is important because of the significant effect of the prediction error precision parameters, especially for large-scale complex civil structures such as long-span bridges. The merit of the new treatment with error precision marginalized is that the concern of the sensitivity of these parameters on the model updating performance has been alleviated significantly in a rigorous manner.
3.4. Real-World Application of Bayesian FE Model Updating on Jindo Bridge
In this section, Bayesian FE model updating is performed for Jindo Bridge by using long-term monitoring data from a dense deployment of a WSN that include 113 wireless sensor nodes, each composed of an Imote2 (MEMSIC) with an SHM-A [
35] or SHM-H sensor board [
36], both equipped with high-sensitivity accelerometers and with relatively higher sensitivity for the SHM-H. The sensor nodes were attached to the steel bridge surface using magnets. Data was collected during a twelve-month period starting from 1 September 2011 to 30 August 2012 is considered in this study.
Figure 6 presents an example of the acceleration time histories from one channel measured on 5 June 2012 from the Jindo Deck. Details of the sensor network and data are presented in [
37,
38]. The Jindo Deck and Haenam Deck sensor networks were operated by two base stations and hence data collection was conducted independently between these two networks. Therefore, modal identification of Jindo Deck and Haenam Deck were performed separately. A combination of Natural Excitation Technique (NExT) [
39] and Eigensystem Realization Algorithm (ERA) [
40] was used to perform system identification of the bridge. Nine vertical, four lateral and one torsional modal properties of the deck were identified from the acceleration measurements.
Environmental and operational conditions such as temperature and excitation amplitude have been recognized to potentially impact natural frequencies. The wireless smart sensor network measured the temperature during the deployment, offering an excellent opportunity to evaluate the relationship between modal frequencies and temperature. It was observed in Asadollahi and Li [
38] that as temperature increased, the modal frequency decreased, while the excitation amplitude did not appear to have a clear relationship with the modal frequencies in this particular case. Therefore, the effect of temperature was removed from modal frequencies based on the linear relation observed between temperature and modal frequencies before performing Bayesian model updating. Complete descriptions of the modal identification results can be found in [
38].
Figure 7 presents the identified modal frequencies from the 1st vertical mode during the one-year monitoring period as an example. The measured data for model updating typically consists of the extracted modal parameters including the natural frequencies and mode shapes since they significantly affect the dynamic response of the bridge structure. About 400 sets of modal data
were identified from the long-term monitoring and are used in this study.
Model updating problems are intrinsically ill-conditioned and often ill-posed when dealing with noisy measurement data, leading non-unique solutions. Under the Bayesian framework, incorporating as much prior knowledge of the structure as possible can help alleviate the ill-conditioning and ill-posedness [
41,
42]. Therefore, upper and lower bounds are defined for the prior uniform distributions of the structural model parameters. In particular, 5% variation from the nominal values (equal to 10) is considered for the stiffness related parameters because detailed and relative accurate information about the section and material properties of the deck and pylons is available. However, a larger range is selected for the bridge girder mass parameters since only mass of main structural members is considered in the initial FE model while other sources of masses are not. The unconsidered masses could come from structural elements such as deck wearing surface, wind fairings, cable anchors, stiffeners, splices and bolts, and non-structural elements including guard rails and barrier curbs, as well as the live loads (vehicular loads). Based on the length and width information of the deck, calculation indicates that the concrete deck slab alone could add approximately 40% of the considered mass from the structural elements. Therefore, we set the upper bound to 18, which means that we allow mass to be increased by up to 80% of the nominal value of element masses. The lower and upper bounds of the uniform prior for all model parameter components are tabulated in
Table 4.
The modal data of all fourteen natural frequencies and the first three vertical mode shapes are used to construct the likelihood function in (9). The likelihood function is obtained by marginalizing the prediction-error precisions and since in the previous section this treatment was found to achieve the best Bayesian model updating performance. Two runs of TMCMC were performed to draw posterior samples (1000 samples per stage) for the structural model parameters corresponding to the two model classes and . The number of stages for both runs were around 85, leading to a total of 85,000 samples drawn for all stages.
Figure 8 and
Figure 9 show the samples of several structural model parameter components corresponding to the initial (stage number
), middle (stage number
, where
is the total number of stages) and final (stage number
) stages of TMCMC for the two runs corresponding to model classes
and
respectively. It can be seen that the region in the structural model parameter space with significant posterior probability content is very concentrated compared to the prior distribution of
This is achieved efficiently in TMCMC since it constructs a series of intermediate PDFs that gradually converge to the posterior PDF from the broad prior PDF. It is worthwhile noting that although the samples of each parameter component at the final stage of TMCMC look like only a single point as shown in
Figure 6 and
Figure 7, all samples are included in the figures. The concentration of the posterior samples is due to the availability of 400 sets of modal data in the model updating process, leading to high certainty in the posterior distributions. Compared with
Figure 6 and
Figure 7, the samples of the final stage in
Figure 5 (numerical investigation) are more scattered since only 10 sets of modal data were employed in the updating process. In
Table 5, we tabulate the posterior mean and coefficient of variation (C.O.V.) values of model parameter samples together with the percentage change of posterior means compared with those of the initial FE model for both model classes. It is observed that the C.O.V. values are generally smaller for model class
than
, indicating smaller posterior uncertainties and higher confidence in the model updating results. In addition, the percentage change in structural model parameter components compared with the initial calibration values are relatively larger for model class
(up to 79.54%), showing more information exacted from the available data.
By using the prior and posterior distributions of structural model parameters, prior and posterior predictions of modal parameters
can be expressed in (23) and (24), respectively:
where
is the model predictive PDF which has the similar form of Equation (9);
and
are the predictive natural frequency and mode shape vectors, respectively. It is known from the sampling theory that the generation of samples
which characterize the posterior prediction PDF
(prior prediction PDF
), can be achieved simply by the following procedure: first draw samples of
from the posterior PDF
(prior
), and then collect samples of
generated from the predictive PDF
, in which we incorporate the samples of
sequentially.
Figure 10 demonstrates the predicted values of natural frequencies of the first six vertical modes together with the target natural frequencies (identified from the data) for both model classes. Similar to
samples, the predicted natural frequencies from the posterior samples are concentrated within relatively small regions, most of which are close to the target frequencies. By using the error index defined in (21), the errors of the prior and posterior prediction means of natural frequencies are computed and tabulated in
Table 6 for both model classes. It is observed that the errors of posterior predictions are reduced significantly for most modes compared with the values of prior predictions. Only three modal frequency errors are not reduced in posterior predictions, which include the fifth vertical (VM-5) and fourth lateral (LM-4) modes as well as the torsional mode (TM-1). By comparing the results of the two model classes, the large absolute errors of the posterior predictions (e.g., modes VM-5, LM-3, LM-4 and TM-1) for model class
are generally smaller than that of
In addition, the absolute errors of the posterior predictions for the lower modes are generally smaller than 10% for model class
which is thought to be acceptable for damage assessment using these results. Meanwhile, it should be noted that the absolute errors in
Table 6 are not as small as the values in the simulated data case (
Table 3), this could be partially due to the higher noise content in the real-world SHM data for modal identification and model updating. A possible strategy for better model updating performance is to increase the quantity and quality of the available time-domain data and have more identified modes for model updating.
In addition to the investigation of Bayesian model updating, Bayesian model class assessment is also studied to quantify the plausibility of proposed candidate model classes given the measured data. Based on the posterior samples of
from TMCMC, the posterior probability
the log evidence
the posterior mean of log likelihood
, and the relative entropy (information gain)
are computed and compared in
Table 7 for the two model classes
and
. The primary purpose of this table is to compare the trade-off between the data-fit of the model class and the information-theoretic complexity for each model class to select a more optimal one. It is seen that model class
produces a larger posterior mean of log likelihood and a larger relative entropy in comparison with those of
, indicating a better data fit and a higher model complexity for
. Regarding the trade-off between data fit and model complexity, the log evidence value of model class
is much larger than that of
, implying that it gives a much better balance between the data-fit and the model complexity [
8]. Although
has a larger expected information gain (third row), showing that it extracts more information from the data than
, its mean data-fit is so much larger than that of
(second row) because of more detail parameterization that its log evidence is dominant over that of
, making
a much more plausible model class based on the data (first column). Since the posterior probability
is so much larger than
the model class
which has more detailed parameterization is relatively more probable conditional on the data
and so its contribution is dominant over that of
when making response predictions of the bridge structure.
In summary, Bayesian FE model updating of the bridge using real-world data from long-term SHM leads to an accurate FE model which has vibration properties close to the real structure system and produces more reliable predictions of structural responses. In addition, the plausibility of competing model classes for the bridge system can also be assessed based on their posterior probability from Bayes’ Theorem, giving a rigorous method for structural model selection.