1. Introduction
The sensors are the basic instruments of the structural health monitoring system, while the measurements of sensors are the necessary information for estimating the structural safety. However, the operation duration of a sensor is limited and the sensor measurements may be abnormal because of sensor failure or environmental influences [
1]. The technique for reconstructing the sensor information should be developed as a result, which can ensure saving the complete monitoring information for structural safety estimation. The monitoring information is diverse in the different measured locations and the different types of structural responses, which are correlated with each other to some extent [
2]. That is to say, it is necessary to reconstruct the information for the failure sensor using the effective correlation model between the failure sensor and the other normal sensors.
For the sensor information reconstruction, the researchers proposed several methods to solve the question. For the problem of data distortion in the structural health monitoring system of a bridge, Huang [
3] analyzed the relationship of the monitoring data by using Granger causality testing [
4] in which the sensor data with a closer relationship is selected to be the input vector for an extreme learning machine to recover the missing sensor signal data. However, the individual prediction value could be a large error, and the training results need to be further improved because the tendency of the predicted data is not consistent with the theory value. The radial basis function (RBF) neural network model is another method to recover the unreliable data, which is also easy to fall into a local optimum [
5]. The Kalman filtering method was proposed to build the model between structural displacements and structural accelerations [
6], in which the structural model parameters and external excitation should be known in advance. The method of data compression sampling is used for data recovery of wireless sensors, while the recovery data error would be larger for the accelerations when the sparse of the original data becomes worse [
7]. Feng [
8] proposed a data recovery method based on a least square support vector machine. A forecasting model based on the method is established, and signal recovery is accomplished by on-line learning diagnosis. However, this method mainly aims at the data reconstruction of the sensor within short duration, and the effect is not good for a high sampling rate and for long-term failure prediction.
This paper, thus, focuses on the structural responses and the correlation model between the reconstructed variable and the response variable is established, which can be used for reconstructing the sensor information. In this paper, the correlation degree and the partial least square method are applied to the data reconstruction of the fault sensors. As the strain sensors are placed on the roof steel structure of Shenzhen Bay Stadium, the strain measurements represent the distributed responses of the structure and have an inherent association, which is suitable to verify the effectiveness of the proposed method in the paper as well.
2. Correlation Model of Monitoring Data
2.1. Correlation Analysis of Monitoring Data
The monitoring structural responses from different locations and different types of sensors are in the correlated relationship. The Pearson correlation coefficient is an index to measure the linear correlation degree of structural responses.
The structural response matrix of
m measuring points under the load in
n time steps is denoted as
X:
In the Equations (1) and (2),
represents the measurements of the measuring point
i under the load of
n time steps. When the fault sensor works in normal, the measurement of the measuring point where the fault sensor located is denoted as
. The Pearson product-moment correlation coefficient [
9] of the measurement from the arbitrary measuring point and the measuring point where the faulty sensor located is denoted as
ρiIn Equation (3), and represent the response variance of the measurement point i and the fault sensor, respectively; is covariance, and represent the measurement of point i and the fault sensor in the kth time step, respectively; and are the mean value of the measurements and , respectively.
2.2. Correlation Modeling Using Partial Least Squares and Multiple Regression
2.2.1. Reconstructed Variable and Response Variables
The correlation coefficients of the normal measurement of fault sensor and the measurements of the other sensors are calculated, which are also called the correlation coefficients of the reconstructed variable and response variables. The correlation matrix can be denoted as
P:
The correlation coefficients are sorted from larger value to small value based on their absolute values, while the updated correlation coefficient’s sorting matrix can be denoted as
DP:
(
ρ)
i represents the
ith correlation coefficient absolute value, while (
ρ)
1 is the largest correlation coefficient absolute value and (
ρ)
m is the smallest correlation coefficient absolute value, respectively. The corresponding measurement to (
ρ)
i is denoted as (
x)
i, while the updated structural response sorting matrix can be denoted as
DX:
The faulty sensor information is considered as the reconstructed variable
, and
p measurements are selected as response variables
XP:
2.2.2. Establishment of Correlation Model Using the Partial Least Square Method
The reconstructed variables and response variables are standardized as:
The response variables
and the reconstructed variable
are standardized into
and
, respectively:
The main components set of
and
can be obtained by using principal component analysis, which are denoted as
and
, respectively:
The regression equations regarding
and
using partial least squares [
10] and first principal components are:
where the regression coefficient vector is:
The regression equations for the
hth principal component are:
where the regression coefficient vector is:
The relationship between the
hth principal component
th and the
hth axis of
is:
where
is the unit eigenvector corresponding to the maximum eigenvalue
of matrix
.
According to the Equations (18), (19), (22), and (23),
and
can be expressed as:
Substituting Equation (26) into Equation (22) yields:
Substituting Equations (26) and (29) into Equation (27) yields:
Substituting Equation (30) into Equation (26) yields:
Substituting Equation (31) into Equation (28) yields:
Assuming the number of the principal components is
h when the principal components satisfy the cross-validation principle. Meanwhile,
approaches 0, which can be ignored. Equation (33) is written as:
Substituting Equations (14) and (15) into Equation (34) yields:
Substituting Equations (12) and (13) into Equation (37), the correlation model between the reconstructed variable and the response variables is:
2.2.3. The Reconstruction Error Evaluation Index
The average absolute error between the reconstructed value and actual value
ea is:
The average relative error between the reconstructed value and actual value
eb is:
where
represents the reconstructed response of
kth time step of the supposed fault sensor, and
represents the actual response of
kth time step of the supposed fault sensor, respectively.
3. Reconstruction to Stress Measurements of Shenzhen Bay Stadium
3.1. Project Description
The structural health monitoring system of Shenzhen Bay Stadium began to be implemented in 2010, while the continuous monitoring of the steel roof in both the construction phase and service phase began to be carried out at the same time [
11]. The stress measurements of the sensors located at the support members are chosen as the basic analysis data for example. The layout of sensors located at the support members are shown in
Figure 1 and
Figure 2. There are six supports, two rods of which are being monitored and four stress monitoring points are arranged on each rod; the sensor installation of support is shown in
Figure 3. For example, the number of eight stress monitoring points at support 3-1 can be denoted as 3-1-1, 3-1-2, 3-1-3, 3-1-4, 3-1-5, 3-1-6, 3-1-7, and 3-1-8, respectively.
3.2. The Correlation Model of the Reconstructed Variable and the Response Variables
The sensor at location 3-1-4 is assumed to be failure on 16 May 2014. The stress monitoring information of this point is considered as the reconstructed variable. The measurements of the other locations are considered as the available response variables. The stress monitoring data from 7 May 2014 to 10 May 2014 are selected to calculate the correlation coefficients between measurements from location 3-1-4 and the other locations, which are shown in
Table 1.
In Equation (38), there is no exact number demanded of the response variables, which may depend on the detailed examples and the special value of the correlation coefficients. Generally speaking, the historical measurements can be used to identify the effectiveness when the different number of responses variables selected. If four response variables are chosen to build the correlation model, that is to say,
p = 4, then:
The correlation model of the reconstructed variable and the response variable is:
The stress monitoring data of the stress sensor 3-6-6, 3-3-2, 3-1-8, and 3-2-8 are used to reconstruct the stress measurements of the sensor located at point 3-1-4 on 16 May 2014. The contrast curves between reconstructed values and actual values are shown in
Figure 4. The reconstructed values have the same trend as the actual values, and there are small differences between them. The average absolute error is 0.89 MPa and the relative error is 2.71%.
3.3. Influence of Correlation Degree on Reconstruction Error
In order to discuss the influences of the selection of response variables, two additional cases are discussed, which are:
and:
The correlation model using
is:
The contrast curves between reconstructed values and actual values are shown in
Figure 5. The average absolute error is 0.97 MPa and the relative error is 2.97%.
The correlation model using
is:
The contrast curves between reconstructed values and actual values are shown in
Figure 6. The average absolute error is 1.03 MPa and the relative error is 3.13%.
The errors of the reconstructed information using different correlation coefficients of the response variables are summarized in
Table 2.
It can be seen from
Table 2 that the errors become larger when the coefficients of response variables become smaller, in which the average absolute error and relative error are minimum when
XP1 is chosen as the response variables. It indicated that the response variables with closer correlation to the reconstructed variable should be chosen, which is better for the reconstruction of the sensor measurements.
3.4. Influence of Monitoring Data Quantity on Reconstruction Error
In order to discuss the influences of the selection of response variables, two additional cases are discussed. The original stress monitoring data is collected from 7 May 2014 to 10 May 2014 to calculate the correlation coefficients between measurements from location 3-1-4 and the other locations. The stress monitoring data of two additional cases are from 7 May 2014 to 12 May 2014 and from 7 May 2014 to 14 May 2014, respectively.
The correlation model using stress monitoring data from 7 May 2014 to 12 May 2014 is:
The contrast curves between reconstructed values and actual values are shown in
Figure 7. The average absolute error is 0.65 MPa and the relative error is 1.99%.
The correlation model using stress monitoring data from 7 May 2014 to 14 May 2014 is:
The contrast curves between the reconstructed values and the actual values are shown in
Figure 8. The average absolute error is 0.22 MPa and the relative error is 0.68%.
The errors of the reconstructed information using stress monitoring data from different time periods are summarized in
Table 3.
It can be seen from
Table 3 that the errors become smaller when the quantity of stress monitoring data become larger, in which the average absolute error and relative error are a minimum when the stress monitoring data from 7 May 2014 to 12 May 2014 is chosen to build the correlation model. This indicated that the larger quantity of stress monitoring data should be used, which is better for reconstruction of sensor measurements. In theory, the number of measurements larger than the unknown coefficients is the basic requirement for regression function identification; in fact, there are noises and temperature effects included in the measurements, the measurements in at least one full day are better.
3.5. Influence of Response Variable Number on Reconstruction Error
In order to discuss the influences of the different number of response variables, three additional cases are discussed, which are:
The correlation model using
is:
The contrast curves between reconstructed values and actual values are shown in
Figure 9. The average absolute error is 0.80 MPa and the relative error is 2.45%.
The correlation model using
is:
The contrast curves between reconstructed values and actual values are shown in
Figure 10. The average absolute error is 0.80 MPa and the relative error is 2.44%.
The correlation model using
is
The contrast curves between the reconstructed values and the actual values are shown in
Figure 11. The average absolute error is 0.77 MPa and the relative error is 2.36%.
The errors of the reconstructed information using different number of the response variables are summarized in
Table 4.
It can be seen from
Table 4 that the errors become smaller when the number of response variables become larger, in which the average absolute error and relative error are minimum when
XP7 is chosen as the response variables. However, there is only a slight difference between these discussed cases, and the number of the response variables with the same level of correlation coefficients influencing the errors of the reconstructed variables.
4. Conclusions
In this paper, a method of sensor information reconstruction based on the correlation degree is proposed. The monitoring information of a faulty sensor can be reconstructed by the correlation model and the measurements of other normal sensors. Based on the structural health monitoring system of Shenzhen Bay Stadium, the stress measurements of sensors located at support members are used to verify the proposed method. Furthermore, the influences for the errors of reconstructed monitoring information of a faulty sensor caused by three kinds of factors are discussed in the paper, which are the selection of response variables with different correlation, the quantity of the measurements from different time periods, and the number of response variables. The selection of response variables and the quantity of the measurements from different time periods are two main factors, which influence the reconstructed information errors for the faulty sensor directly. The number of the response variables influences the errors of reconstructed information a little if the selected response variables are in the same correlation level to reconstructed variable. Furthermore, the proposed method is verified its effectiveness to a real world project.
Acknowledgments
This research is supported by National Science Foundation of China (Grant No. 51678201, 51308162), and the Fujian National Science Foundation (2014J01171).
Author Contributions
Wei Lu wrote the paper; Wei Lu and Jun Teng conceived and designed the Structural Health Monitoring (SHM) System of Shenzhen Bay Stadium; Chao Li conducted experimental data analysis; Yan Cui conducted experimental data acquisition and collation.
Conflicts of Interest
The authors declare no conflict of interest
References
- Yeung, W.T.; Smith, J.W. Damage detection in bridges using neural networks for pattern recognition of vibration signatures. Eng. Struct. 2005, 27, 685–698. [Google Scholar] [CrossRef]
- Agrawal, R.; Imielinski, T.; Swami, A.N. Mining association rules between sets of items in large databases. In Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data, 26–28 May 1993; ACM: Washington, DC, USA; pp. 207–216.
- Huang, Y.W.; Wu, D.G.; Li, J. Structural healthy monitoring data recovery based on extreme learning machine. Comput. Eng. 2011, 37, 241–243. (In Chinese) [Google Scholar]
- Hu, S.R.; Chen, W.M.; Cai, X.X.; Liang, Z.B.; Zhang, P. Research of deflection data compensation by applying artificial neural networks. In Proceedings of the 2nd International Conference on Structural Health Monitoring of Intelligent Infrastructure, Shenzhen, China, 16–18 November 2005; Taylor & Francis-Balkema: Leiden, The Netherlands; pp. 755–758.
- Huang, G.B.; Zhu, Q.Y.; Siew, C.K. Extreme learning machine: Theory and applications. Neurocomputing 2006, 70, 489–501. [Google Scholar] [CrossRef]
- Ma, T.W.; Bell, M.; Lu, W.; Xu, N.S. Recovering structural displacements and velocities from acceleration measurements. Smart. Struct. Syst. 2014, 14, 191–207. [Google Scholar] [CrossRef]
- Bao, Y.Q.; Li, H.; Sun, X.D.; Yu, Y.; Ou, J.P. Compressive sampling-based data loss recovery for wireless sensor networks used in civil structural health monitoring. Struct. Health Monit. 2013, 12, 78–95. [Google Scholar] [CrossRef]
- Feng, Z.G.; Shida, K.; Wang, Q. Sensor fault detection and data recovery based on LS-SVM predictor. Chin. J. Sci. Instrum. 2007, 28, 193–197. [Google Scholar]
- Pearson, K. Notes on regression and inheritance in the case of two parents. Proc. R. Soc. Lond. 1895, 58, 240–242. [Google Scholar] [CrossRef]
- Tenenhaus, M.; Esposito Vinzi, V.; Chatelinc, Y.-M.; Lauro, C. PLS path modeling. Comput. Stat. Data Anal. 2005, 48, 159–205. [Google Scholar] [CrossRef]
- Teng, J.; Lu, W.; Cui, Y.; Zhang, R.G. Temperature and displacement monitoring to steel roof construction of Shenzhen Bay Stadium. Int. J. Struct. Stab. Dyn. 2016, 16, 1640020. [Google Scholar] [CrossRef]
© 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license ( http://creativecommons.org/licenses/by/4.0/).