1. Introduction
State-of-the-art development of materials production, construction technologies, their automation, and the growth of land prices in large cities has led to a new way of thinking about constructing and assembling complex geometric structures, especially high-rise buildings. This fact complicates the building monitoring task due to the necessity of creating the sensors’ monitoring network, the application of various types of measuring equipment, and the high frequency of observations to acquire and study the kinematic and dynamic properties of the structure. On the other hand, having such a monitoring system will operate remotely without time-consuming and laborious field work, and expensive equipment is preferable. Among the different kinds of building monitoring, the observation of geometric parameters or deformation monitoring, in other words, has an important role. The change in the building geometry causes a reduction of its functional determination and crack emergence and propagation, which may lead to the structure’s collapse. The effect of such external loads as wind, snow, ice, solar radiation, unstable foundation, etc., leads to oscillations and torsion of structures, their bending, and their roll. These parameters may change their values daily and seasonally [
1] and can cause spatial displacements at the tens of centimeters level. The considered geometric deformation parameters that need to be determined during deformation monitoring of the high-rise building are given in
Figure 1.
Evidently, the deformation parameters presented above are subject to monitoring by different geospatial methods and technologies as long as the parameters are functions of linear and angular displacements. Such monitoring is carried out using various methods, geodetics particularly. Today, the global navigation satellite systems (GNSS) are the most widespread element of any monitoring system. GNSS provides reliable and high-frequency information about the monitoring target coordinates’ changes. However, due to the impossibility of measuring coordinates along the structure axis from the ground to the top, GNSS data reveals only the total displacement Δ of the structure. This displacement may portray the structure’s top displacement without reference to the ground floor. Thus, the reason for this displacement is unknown and maybe the simple spatial displacement, displacement and bending, or displacement and roll combinations (see
Figure 2).
It has become clear that GNSS observations alone cannot accurately depict the deformation process. That is why GNSS may primarily be used as a complementary data source and to help detect the structure’s vertical displacement, but additional measurements along the structure are needed to figure out the reasons for the structure displacement and to obtain an accurate picture of the deformation process. The simplest way to overcome the GNSS restriction is the integration of GNSS with other geodetic or non-geodetic equipment and measurements.
Geodetic science has developed many methods to measure structure deformation in given directions. Today, we deal with different terrestrial geodetic measurements, satellite measurements, and photogrammetric technologies and measurements. Terrestrial geodetic measurements are the most widespread [
2,
3]. There is no point in discussing these methods in detail as anyone may find their description in the geodetic literature. Moreover, these methods were the first ones applied for deformation monitoring and, consequently, are well studied. The primary geodetic methods are spirit and hydrostatic/dynamic leveling for vertical displacements, and total stations, including image-assisted total stations [
4,
5], for spatial displacement determination. Other kinds of terrestrial geodetic methods are terrestrial laser scanning [
6,
7], depth cameras [
8], and ground-based radar interferometry [
9]. The papers [
10,
11] further develop InSAR technology. Publication [
8] is focused on comparing a depth camera and terrestrial laser scanner to estimate structural deflection. The comprehensive analysis of the ground-based radar interferometry is given in [
11], where the structural monitoring and damage assessment of constructions are considered as the GIS integration of differential InSAR measurements, geological investigation, historical surveys, and 3D modeling. However, those technologies are laborious, require skilled personnel, and are hard to automate. Especially critical for geodetic methods is the question of the observation frequency. It is clear that leveling, terrestrial laser scanning, or ground-based radar interferometry cannot provide more than one observation epoch per day. That is why pure geodetic methods are being used as an additional information source in combination with other methods. Notable success has been achieved in the joint application of GNSS with terrestrial geodetic measurements and sensors. GNSS was applied for monitoring tasks for the first time more than twenty years ago [
12,
13,
14,
15]. Despite the high observation frequency and relatively high accuracy, the GNSS application is restricted to the number of observation points (i.e., it is impossible to install the necessary number of GNSS antennas on a structure), observation sites (i.e., the necessity to have a relatively open sky for satellites), and sophisticated processing algorithms. The GNSS was combined with other sensors to overcome these shortcomings to detect displacements in many points, e.g., accelerometers, hydrostatic levels, inclinometers, etc. [
16,
17,
18,
19]. The study [
17] presents a good case of GNSS integration with low-cost accelerometers, but the solution is not a monitoring system. Paper [
19] is a different approach to integrating GNSS with other devices. GNSS aids non-overlapping images from two cameras to determine the three-dimensional displacements of high-rise buildings. This case is an example of vision-based technology assistance. The main branch of such technologies is photogrammetry, where the image is the primary data source. Here and below, only close-range photogrammetry is considered.
Close-range photogrammetry can be separated into terrestrial and aerial cases. Terrestrial photogrammetry has been known since its invention, whereas close-range aerial photogrammetry took its place by deploying effective and cheap unmanned aerial vehicles (UAVs). This is why the latter is often called UAV/UAS photogrammetry. On the one hand, its traditional concept applies close-range photogrammetry for structure deformation monitoring. On the other hand, the new achievements in computer vision technologies and digital image processing [
20,
21,
22,
23,
24] have transformed classical photogrammetry into digital photogrammetry, where the opportunities for measurement automatization have risen significantly. Among the applications of the traditional terrestrial close-range photogrammetry for monitoring, it is worth mentioning the studies published recently: [
25] proposes a kind of one-image photogrammetry and its integration with geodetic alignment measurements; [
26,
27] explore photogrammetric deformation monitoring using low-cost cameras, particularly for bridges; [
28,
29] explore photogrammetric deformation monitoring using a target tracking approach (the studies demonstrate one step toward measurement automation); [
30] presents a pan–tilt–zoom camera-based displacement measurement system for the detection of building destruction; and [
31] is a case of monitoring using the roving camera technique. Of course, the main advantage of photogrammetric technologies is the opportunity to measure as many points on the structure as necessary. However, just a simple monitoring task of a 100 m building creates an unsolvable problem for terrestrial photogrammetry due to the impossibility of capturing the building surface from the ground. Even if it were possible, the errors due to perspective distortions and resolution would downgrade the overall accuracy to an unacceptable level.
Unlike terrestrial photogrammetry, UAV photogrammetry, thanks to its higher mobility, permits the collection and, consequently, the reconstruction of a more detailed building model. However, technically, UAV photogrammetry presents the same terrestrial photogrammetry case but with higher data redundancy [
31]. Today, UAV photogrammetry has a versatile application for structural monitoring. A wide range of publications similar to [
32] have been published, e.g., paper [
33], where UAV photogrammetry was applied for vibrations monitoring, and [
34], a presentation of a lab-scale test for a six-story building model where displacement determination was carried out using UAV image correlation. Once more, the automation of UAV data processing is not simple stuff. Besides spatial displacement determination, crack monitoring is a top-rated application of close-range photogrammetry, primarily due to the simplicity of the cracks’ identification in images and their measurements. A comprehensive review of crack detection is given in [
35,
36]. A feature of the crack measurements is the sufficiency of only one image for measurements. On the other hand, the data for crack detection are easy to process and automate. The papers [
37,
38] present monitoring solutions for automated crack detection using machine learning. These papers are based on computer vision principles. The photogrammetric principles for crack monitoring are deployed in [
38,
39]. Except for terrestrial-based monitoring of cracks, UAV-based technologies have become very popular recently [
38,
40,
41]. The paper [
40] studies a new computationally efficient vision-based crack inspection method implemented on a low-cost UAV with a new algorithm designed to extract useful information from the images. The disadvantage of UAV-based observations is the low accuracy that does not satisfy current requirements. So, UAV data can be used as a supplementary data source for structure deformation monitoring.
Since we are on the way to developing a vision-based system, let us pay more attention to the photogrammetric methods and approaches. Considering terrestrial close-range photogrammetry, it is necessary to mention the classic books [
42,
43] that provide the most comprehensive review of close-range photogrammetry. Despite the versatility of the presented photogrammetric instances, they are all based on a standard algorithm and math background. This means that, regardless of the structure, any study comprises a geodetic network creation or assignment of some reference base (coordinate system), target marking and coordinating, and compliance with the general requirements of the photogrammetric survey [
43]. The knowledge that came from computer vision treats and handles images differently than classical photogrammetry. The computer vision approach focuses chiefly on different image refinement methods, a digital correlation between images, etc. Math models for geometric information extraction are less strict but more robust. Computer vision approaches have become very popular thanks to their high robustness and automation [
44,
45,
46,
47,
48,
49,
50,
51,
52,
53,
54,
55,
56,
57,
58]. Let us analyze some of them with unique features regarding the study goal. The study [
44] considers computer vision methods tested on a four-story steel frame in lab conditions. These methods comprise the optical flow with the Lucas–Kanade method, the digital image correlation with bilinear interpolation, and the phase-based motion magnification using the Reisz pyramid. The authors of [
46,
47] developed a novel sensor for displacement measurements using one camera. They suggested an advanced template-matching algorithm. Paper [
48] compares two noncontact measurement methods: the vision-based method underpinned by image correlation and the radar interferometer. The vision-based system uses one or two cameras mounted on a tripod. The paper [
51] proposes an approach based on measurements of retro-reflective targets, while [
52] offers the same approach free of targets. In [
53,
54], the vision-based monitoring task is considered as a problem of the best methods and algorithms for image compression and processing under dark–light conditions. It is worth noting that the computer vision approach can also be used for vibration measurements, as claimed in [
55]. The study was conducted in a lab environment. In virtue of the high level of automation, the computer vision approach can be fused with other sensors, e.g., the fusion between a vision-based system and accelerometers. Another sample of integration is [
58], where computer vision technology is integrated with terrestrial laser scanning. For all the considered cases, the measurements are based on one or two cameras, without external reference to some stable basis, insofar as the detected displacements are relative and do not present the total (global) structure deformation.
The discussion about monitoring will not be comprehensive without the small sensors mentioned. These sensors have recently become one of GNSS’s main integration elements. These sensors supplement GNSS and ensure good results for monitoring high-rise buildings. Among the different measuring sensors, it is worth mentioning the following ones that determine structure deformation and their applications, e.g., inclinometers [
59,
60] to detect inclinations in a particular direction, high-resolution lasers [
61,
62], tilt meters [
63] to measure plane inclination, low-cost radars [
64], and 3-D inclinometers [
65] to determine a spatial rotation. A set of papers and reports give an overall review regarding the use of various sensors [
66,
67,
68,
69,
70]. Whatever sensor is used, it provides superior accuracy. Still, the main drawback is the need to organize them into one system and reference this system to some external coordinate system because any sensor provides relative displacements.
Despite the importance of deformation monitoring, this task is just a small unit of a more significant problem. The technologies mentioned above have become a part of a giant branch named structural health monitoring (SHM). Structural health monitoring has become a pretty widespread problem recently. This problem is highly complex and comprises many methods and technologies to monitor a massive range of various building parameters. Therefore, any vision-based low-cost video observation system should become an integrated part of the SHM system. Many papers regarding this problem have been published recently [
71,
72,
73,
74,
75,
76,
77,
78,
79]. A subject of SHM can be temperature variation inside and outside of a structure, air conditioning, humidity inside or underlying soils, and the status of various structure elements, e.g., cracks, damages, and so on. One of the essential applications of SHM is structure deformation monitoring. The given list [
71,
72,
73,
74,
75,
76,
77,
78,
79] is just about geometric parameters deformation monitoring. Thanks to the development of digital technologies and computer science, it is possible to use different small sensors and combine them into one system that may operate automatically. However, modern SHM systems may also include satellite-based interferometry [
80,
81,
82,
83], UAV technologies [
84], and terrestrial and/or aerial laser scanners [
6]. Moreover, the SHM system is also a part of another more complex system named the building information model (BIM) [
85,
86,
87,
88]. BIM comprises all possible building life-cycle stages and consequently the monitoring steps. The paper [
88] outlines the liaison between monitoring problems and BIM. Thus, the creation and operation of any monitoring system should be considered an inseparable part of the building life cycle and must be embedded into BIM. This premise imposes the conditions of easy installation, repair, operation, relocation, and renovation of the monitoring system. Such complicated requirements can be fulfilled for a system that consists of low-cost sensors. On the other hand, the system design must be as simple as possible and reliable.
Based on the given analysis, it was suggested to deploy low-cost digital cameras operating in an automated mode and organized in a system of chains connected with each other. GNSS is supposed to be used as a supplementary data source that provides external reference and control. The system is assumed to be installed inside the building and integrated into BIM. Such a system is easy to install and use, has high reliability due to the vast redundancy of measurements, and does not need professional users. This study aimed to introduce the GNSS-assisted low-cost vision-based observation system (VOS) concept and demonstrate the system’s preliminary analysis results. This system includes the ideas and approaches from close-range photogrammetry to calibrate and orient images; computer vision to process digital images; geodesy to assign coordinate systems and external control (including the possible application of total stations and GNSS); and adjustment calculus to process and analyze the measurement results. The suggested approach provides complex information about a structure deformation and reduces error accumulation and the effect of external errors, increasing the resulting accuracy in determining the displacement values. The study of [
89,
90,
91] has shown that to carry out complex deformation research, it is recommended to locate the system chains along the principal axes of the structure.
A couple of papers demonstrate a very tentative approach with a similar idea. In [
92], the approach of relative displacement measurements from the inside using an in-room camera is presented. The paper considers just a particular case of observations using one camera from one point. A more detailed review is outlined in [
93], with good analysis and ideas a bit similar to our study. However, the work did not consider the case of combining a set of cameras into a network. The significant contribution of this paper is the analysis of the measuring accuracy ensured by the computer vision monitoring systems and target tracking algorithm examination. The study [
94] demonstrates a concept that is close to the VOS idea but is mostly about camera calibration and does not consider cameras organized into a system without external control.
Therefore, the existing approaches and methods have some similarities to the VOS concept that will be presented and studied in what follows. The following stages have to be examined to achieve the primary goal of the study:
General idea and concept description.
Design of the VOS. At this stage, the distance effect between the VOS elements is considered based on the camera’s technical capabilities and the geometric parameters of the test structure.
Determination of displacements between VOS elements for a single chain. In-field simulation of the displacement measurements for a single chain. A phase correlation algorithm is suggested as a primary processing strategy.
Preliminary analysis of the VOS accuracy for the test structure. The investigation is carried out using statistical simulation and results from stage 3.
Determination of the monitoring parameters for the actual structure. Relative displacements of the VOS elements are used to model the structure frame model and compare with the design model of the structure.
Prediction model. Based on the structure frame model (values of monitoring parameters), a prediction model is built for a given point in time.
In this article, the features of the first four implementation stages will be described and studied in detail, and the simulation and experimental measurement results will be presented. The paper is structured as follows.
Section 2 outlines the general concept of the VOS, its design, and its displacement determination approach.
Section 3 describes the results of experimental studies and simulation of the VOS for a high-rise building. A comprehensive analysis of the results after the simulation and discussion are presented in
Section 4.
Section 5 presents the conclusions.
4. Discussion
The data were analyzed using two approaches: experimental studies analysis and simulation analysis. So far, the obtained results have just demonstrated the opportunities of the VOS. However, what about the acquired accuracy? Is it enough to monitor various engineering structures, especially high-rise buildings? First, let us analyze the results of the experimental studies accomplished in
Section 3.1. To do that, it is necessary to propagate the accuracy for one chain in the case of a multi-chain. This question is essential for the case of strict requirements on the accuracy of monitoring parameter determination. Whereas the requirements for the accuracy of vertical displacement determination are not so severe, the demands for roll or bending measurements are pretty tight. The most widespread condition for the roll and bending determination is based on the requirements for ensuring an allowable deviation
from the building’s vertical axis during construction. The expression defines this requirement as:
where
H is a building height in meters, and
will be millimeters.
The allowable deviation
is turned into monitoring accuracy using the expression:
where
t is the Laplace coefficient that depends on the probability level. Typically,
t equals 2 or 2.5, corresponding to 95% and 99% probability values. However, sometimes, in monitoring practice, it is suggested to use
t equals 5 to increase the reliability of the measurement results. Let us suppose that the accuracy along the
x and
y axis is equal to
. The resulting accuracy will depend on the number of chains
k used for measurements. Under this premise, the final accuracy
M can be determined as:
The expressions (22) permit us to compute the accuracy for multi-chain VOS and compare these values with the allowable values from (20). The following calculations for the various heights have been performed (
Table 8), taking the figures from
Table 7. The accuracy calculations have been carried out based on the VOS installation scheme along the entire height of the structure with a step of 17, 25, and 33 m.
Figure 28 is a graphic summary of the results from
Table 8. The horizontal axis describes a building’s height, while the vertical axis highlights the accuracy propagation.
The experimental results and further calculations yielded some interesting findings. The calculations by (22) prove the impossibility of leveraging the suggested VOS for monitoring alone. The general picture emerging from the results is that the multi-chain VOS can probably ensure the necessary accuracy for monitoring buildings higher than 500 m. The principal stress should be pointed out on the rising efficiency of the VOS with a building height. The inclusion of GNSS measurements changes the final distribution of the RMS errors. However, our findings are not generalizable beyond the subset examined because the calculation approach suggested above does not account for the effect of the interrelated measurements, as seen in
Figure 22.
Thus, it is essential to simulate the VOS measurements to account for the redundancy of measurements. Therefore, the second step is the analysis of the simulation results in
Section 3.2.
Let us summarize the results presented in
Table A1,
Table A2,
Table A3 and
Table A4. The accuracy at each block was averaged, and the mean accuracy value was accepted as final for analysis. These values were compared with allowable values (20). Moreover, the simulation results allow one to estimate two measurement modes: relative and absolute.
As seen in
Figure 29, the simulation results provide a more lifelike picture of the VOS accuracy. The final accuracy has improved thanks to accounting for the measurement redundancy. Therefore, the VOS provides a reliable determination of monitoring parameters starting from the height of 90 m for absolute measurements. That is obvious; the installation of the VOS for such a small building is useless, and the conventional geodetic methods provide the necessary accuracy and are well studied. Things get much more complicated for the higher buildings.
To estimate the efficiency of the VOS for tall structures, with the inclusion of the GNSS measurements, the simulation was performed for the building’s 420 m height. Let us analyze the results in
Figure 26 and
Figure 27. Again, the analysis is better presented graphically. To do so, the RMS errors over each floor were averaged and compared in
Figure 30.
The results of the GNSS-assisted VOS simulation look different from the VOS-only simulation. One may infer a couple of interesting findings. At first, thanks to the GNSS observations, the accuracy of the VOS is saved almost at the same level for the whole structure. This effect grows with the structure height as far as the GNSS restricts the error propagation in the VOS. Secondly, as was expected, the GNSS-assisted VOS may ensure the necessary accuracy starting from 60 m. We obtained a weighted accuracy value for the high structures thanks to combined adjustment. Therefore, the simulation results proved the high capability of the developed GNSS-assisted low-cost vision-based observation system for deformation monitoring.
The specific structure of the VOS puts forward some restrictions on the application of this system. These restrictions are defined by the geometry and construction technology of the monitoring objects. Considering the geometry, one needs to pay attention to the VOS scheme. It is clear that for chains we need straight lines between the sensors. Curvilinear structures require modification of single-chain construction. Moreover, the measurement processing is not straightforward. As an example, let us consider the simplest case, namely, the VOS for horizontal monitoring of a curvilinear structure (e.g., dams, tunnels, shells, etc.) between two reference points (
Figure 31). The QR targets are placed perpendicularly to the sensors but with angles (α, φ) between each other. The measured displacements must be converted regarding coordinate axes. The manufacturing and installation of the system gets complicated. Therefore, the idea of VOS has to be developed and studied in the future for the case of curvilinear structures. So far, the considered and examined scheme applies to high-rise buildings.
The second condition is the material of the monitoring structure that was built. This condition especially makes sense when we deal with temperature deformation. The temperature influence leads to structure bending. In the Introduction, it was pointed out that bending is one of the primary issues of monitoring, and the VOS is the solution to this problem. The bending values will be different for different heights (
Figure 32). In the simplest case, the bending due to temperature is described by
where
is a linear extension coefficient of material (
= 12.1 × 10
−6 1/°C for structures made of steel,
= 10.8 × 10
−6 1/°C for structures made of concrete),
is the temperature difference for different sides of the structure,
is a height, and
is a mean structure size in the plane. For the structure with
= 100 m and
= 420 m, we obtained the values given in
Table 9.
The VOS measurement range for displacements was taken from
Figure 13. Regardless of the material, the VOS measurement range covers the possible deformation by almost three times. So, in this case, there are no special requirements or restrictions on the VOS application.