1. Introduction
Light Detection and Ranging (LIDAR) technology is a cost-effective, reliable, and a fast source to collect dense and accurate elevation data for many surveying and mapping applications [
1,
2,
3]. Aerial LIDAR mapping systems are equipped with an active laser sensor, a Global Positioning System (GPS), and an Inertial Navigation System (INS) [
4]. The laser sensor emits pulses at a near infrared wavelength of about 1000 nm [
5]. Pulses travel through the atmosphere [
6] and are reflected from ground features back to the detector. Their travel time multiplied by the speed of light is used to calculate the distance between the laser scanner and the ground [
7].
Manned and unmanned LIDAR systems are widely employed in different applications including urban planning, damage assessment, natural resources, geomorphology, and archaeology [
8,
9,
10,
11,
12,
13,
14]. The assessment of LIDAR data is becoming increasingly important in the geospatial and mapping communities. Several studies have been conducted to evaluate the quality of LIDAR data and products from such data [
15,
16,
17].
In [
18], two LIDAR datasets were acquired from about 2000 m above ground over two heavy tree coverage areas during leaf-off and snow-free seasons with an Optech GEMINI Airborne Laser Terrain Mapper. Two DEMs were then interpolated from the LIDAR datasets at one-meter ground resolution and resampled to three-, five-, and ten-meter ground resolution using mean cell aggregation. Each DEM underwent a simple low-pass smoothing filter using a mean filtering technique. Ground checkpoints were collected through total station ground surveys. Total stations have a typical accuracy of 1–3 mm ± 1–3 parts per million. Discrepancies of about 0.70 m to 2.10 m were discovered.
In [
19], a Leica ALS50 Phase II was used to map an archaeological site that is ecologically and topographically diverse with alluvial terraces, foothills, mountains, and vegetation. Data were collected at 15 points/m
2 ground density with flight lines overlapping more than 50%. Two control points surveyed with a Trimble R8 GNSS rover and a Trimble R7 base were used to geospatially register the LiDAR data through a 3-D shift of the LIDAR point cloud. Data were classified into bare-ground and manmade and natural features points. A Digital Terrain Model (DTM) was generated from the bare-ground points and a Digital Surface Model (DSM) was generated from all points. Results showed an average accuracy of 0.5 m for the produced DTM and DEM.
LIDAR data popularity is increasing driven mainly by the demand for 3D data for different applications ranging from 3D mapping to earth surface and city models. Common earth surface representations created from LIDAR data are the digital elevation and surface models [
20,
21,
22]. In [
23], the accuracy of LiDAR-driven DEMs interpolated from two LIDAR point clouds collected during the leaf-off season with a Leica ALS60, flying at about 3000 m above ground, for two watershed areas was evaluated. The DEMs were interpolated at one-meter ground resolution using the “LAS dataset to Raster” tool in ArcMap. Four interpolation methods in ArcMap were assessed including the inverse distance weighting and nearest neighbor interpolation techniques. Reference data were collected with a GPS ground survey. The 3-D coordinates of 45 LIDAR ground points were compared with those collected with GPS. The Root Mean Square Error (RMSE) for elevation differences between the LIDAR and GPS coordinates was 0.75 m.
In [
24], the accuracy of LIDAR data collected with an Optech™ ALTM 3100 EA LIDAR sensor, flying at an average altitude of 400 m, over four sites in forestry areas was assessed. Original data was collected with an average point sampling density of 10 points/m
2 then thinned to two datasets of five and one points/m
2. DTMs and DSMs with a grid size of one meter were produced using the TerraScan software. Field surveys for 100 trees were conducted with differentially corrected GPS data collected with a GeoXM 2005 receiver. Mean differences and Root mean square errors (RMSEs) between LIDAR data and field surveys were about 0.10 m and 1.25 m, consecutively.
In [
25], the accuracy and usefulness of point clouds obtained by Airborne Laser Scanners (ALS) and Unmanned Aerial Systems (UASs) in detecting and measuring individual tree heights were compared. ALS data were collected with a Leica ALS80-HP laser scanner over an area of 100 hectares. The average altitude during the scanning was 2700 m above mean sea level. The UAS images were collected with an RGB SONY ILCE-5000 camera mounted on a custom-built fixed wind UAS and processed in pix4D. Analysis of the discrepancies in elevation data showed that elevations measured from the ALS data were lower than those estimated from the UAS data by approximately 1.15 m with a standard deviation of about 1.95 m.
The reliability and of LIDAR data for coastal geomorphological changes was investigated [
26]. Data was provided by the U.S. Geological Survey (USGS) and was collected with the Experimental Advanced Airborne Research Lidar (EAARL) sensor from 300 m above the ground. The LIDAR point cloud contained had a ground point density of 0.65 points/m
2. Ground truth data consisted of 734 points and was surveyed with two Trimble R10 and one Leica GS12 geodetic GPS receivers configured for Real Time Kinematic (RTK) differential correction. In vegetated dune topography areas, dissimilarities of about 0.5 m RMSE were unveiled horizontally. However, better accuracies were attained in non-vegetated flat areas. RMSE was reduced to 0.25 m after systematic offsets between LIDAR data and ground truth were compensated for.
In [
27], a point cloud obtained with a Riegl LMS-Q780 scanner, flying at an average flight height of 1240 m above mean ground, was evaluated. The point cloud had an average density of approximately 15 points/m
2. Ground truth was collected with a DJI Phantom 4 Pro UAS equipped with 5472 pixels × 3648 pixels digital camera. Captured images were processed with Agisoft Metashape Professional
® using structure-from-motion photogrammetric techniques. Geodetic control was established with a Leica TS02 total station. Height comparisons between the LIDAR and UAS point clouds reviled that 98% of the dissimilarities in elevations were in the range from −150 to +150 mm with an absolute maximum of 0.75 m for the whole area.
A Riegl LMS-Q560, flown at an altitude of 750 m, was employed in [
28] to assess the accuracy of nine interpolation methods using cross-validation. The interpolation methods examined were Delauney Triangulation, Natural Neighbor, Nearest neighbor, Ordinary Kriging, Inverse Distance Weighted, and four spline-based interpolators. Data was collected as a point cloud with a point density of 5.13 points/m
2 and a subset was held back for validation. Several DTMs were generated at 0.5-, 1.0-, 1.5-, and 2.0-meter ground resolutions. For most of the algorithms, RMSE values between 0.11 and 0.28 m were reported. Spline-based methods and the Natural Neighbor algorithm had vertical errors of less than 0.20 m for over 90 percent of validation points.
The same dataset was used in another study to map a mountainous terrain area covered by dense forest vegetation with a mixture of spruce and pine trees [
29]. Ground filtering was carried out to distinguish ground points from non-ground points using the LasTools software and then manually checked. Data were resampled at different ground point densities ranging from 0.89 points/m2 to 0.09 points/m
2. It was found that DTM accuracy is highly dependent on the ground point density as the RMSE ranged from 0.15 m for the 0.89 points/m
2 ground point density to 0.62 m for the 0.09 points/m
2 ground point density.
In [
30], LIDAR data from the Golden Gate LiDAR Project was evaluated. The flying altitude was about 2500 m above mean sea level. The data minimum point density was 2 points/m
2 and was filtered using TerraScan providing a set of points classified as bare-earth ground. In addition to the LiDAR data, a set of 753 points was surveyed by the USGS using the RTK differential GPS technique. The distribution of the vertical dissimilarities between the LIDAR elevation values and those observed by RTK was 19% less than 50 cm, 50% between 50 cm and 1 m, 30% between 1 cm and 2 cm, and only about 1% greater than 2 m.
From the presented literature, we can conclude that most of the previous research used GPS ground surveys to assess the quality of the LIDAR data. Despite the fact that GPS in general has higher accuracy than LIDAR data, the reliability of GPS is affected by distances to the base point, time of observation, and satellite geometry. In addition, the lack of a clear line of sight is occasionally found in urban geographic environments and this could harm the quality of the GPS signals. Therefore, in our study we used traditional ground surveying techniques, a total station, to assess the quality of the LIDAR data. Moreover, most of the previous research focused on either relative or absolute assessment of LIDAR data. In our study, we evaluated both the relative and absolute quality of the LIDAR data. The LIDAR data set was collected with an Optech LIDAR system and ground control was surveyed using conventional ground surveying techniques.
4. Data Absolute Accuracy
Many sources of error affect the quality of airborne laser scanning data. Companies that work in collecting such datasets usually publish a fixed number for the uncertainty of their data. However, these numbers are usually not verified by the users because this is not a simple task. Correspondence between laser data points and ground points is not a straightforward matter. The absolute displacements can be found by measuring the location on the ground of a point or feature of known coordinates in the data set and comparing the two measurements.
4.1. Absolute Height Accuracy
The ground points were collected in a specific pattern except when on distinct features (ditches and curves on the ground). This pattern (lines) facilitates establishing a correspondence between the two data types. A buffer zone (with a width of one meter) was constructed around each ground data line and the data points inside that buffer zone were identified. Then, for each laser data point that lies within a meter or less from two ground data points, a corresponding height point with the same planimetric position was interpolated from the surrounding ground-surveyed points. The absolute accuracy was then estimated as the absolute difference between the laser height and the ground height. Since the points are nearby and the surface is flat, a linear interpolation was used. In order to minimize the effect of the planimetric uncertainty of the laser data, the interpolation was limited to totally flat areas. The slope of those areas was restricted to not exceed 10%. The slope was calculated as the rate of change of elevation for each LIDAR point using the Surface Slope tool in ArcMap.
Direct differencing was applied to compute the discrepancy between the two data points from the laser and ground, i.e., the laser height was subtracted from the ground interpolated height. More than a thousand such differences were computed.
Figure 8 shows the histogram of those differences. The histogram shows a bias in the differences of −0.088 m with a standard deviation of ±0.082 m. Hence, the average height of the laser data points included in the test is higher than the estimated ground by 0.088 m. Consequently, the total RMSE of the LIDAR heights from the surveyed ground (zero in the histogram) was 0.12 m with a mean value of zero. The calculated statistical characteristics represent only a small sample of the data. The tested area covers less than 2 s of collection time (about 19,000 data points since the designed scanning rate is 10,000 points/s). The 19,000 data points represent a small portion of the whole data set which is more than 3,000,000 data points. However, only 1008 points were included in the real computation since it is not practical to survey more points on the ground.
4.2. Absolute Planimetric Accuracy
Planimetric offsets are more complicated to determine since they require identifiable spatial features to establish the correspondence between the two data sets. Such features and locations for estimating the offsets might not be available, or when they exist, are usually limited. Moreover, identifying these locations is costly in time and requires great care in order to be reliable. Drainage ditches, terrain features, and building gable roofs are some examples of such features. Procedures and results of estimating the planimetric accuracy in both directions (Easting and Northing) are presented in this section.
Eight positions were identified and used in obtaining the planimetric accuracy. Six of those locations were used to compute the offset in the X (Easting) direction and the other two were utilized to compute the offset in the Y (Northing) direction. In each of these locations the data points from both data sets, LIDAR and ground, were identified as shown in part (a) of
Figure 9,
Figure 10 and
Figure 11. Parts (a) of these figures show the plan view of the points. From each set, an estimate of the two shifts was made to realize the coincidence. The idea here is to translate these two curves and obtain the shift that will maximize the match. Prior to that, the height bias should be removed in order to exclude its effect in the matching. In part (b) of the figures, the green dashed curve represents the LIDAR data after removing the height bias between the two data curves.
To get the best match between the two curves, the LIDAR data curve will be shifted gradually around the ground data set in the direction of the computed offset (X as in
Figure 9 and
Figure 10, and Y as in
Figure 11). The shift search window ranged from −2 m to 2 m with an increment of 0.01m. At each increment, the discrepancy in Z of each ground point and its interpolated-correspondence point from the LIDAR curve data is computed. The sum of the squares of these offsets is considered as the matching cost at each location. Parts (c) of the figures show the matching cost function behavior with respect to different shift values. At the minimum matching cost, which is associated with the best match between the two curves, the corresponding shift is obtained. Parts (d) show the original data curves, after removing the height bias, and after the planimetric shift.
As stated above, eight locations were tested to obtain the planimetric shift.
Table 2 summarizes the results at those selected locations. Regarding the offset in the X (Easting) direction, which is across the flight direction and coincides with the scanning direction, six locations were selected, three at the edge of the swath width of strip two and the rest at the middle of the strip. As expected at the strip edge, the offset in the scanning direction (−0.60 m as an average) was larger than at the middle of the strip (−0.30 m as an average). However, those shifts were in the same direction. The same thing could be said for the height bias, height offsets seem to be larger in magnitude at the edge of the strip. On the other hand, two locations were selected to test the accuracy along the flight direction Y (Northing), one at the middle of the strip and the other at the edge. The two locations had the same direction.
4.3. Error Behavior with Respect to Time
To show how errors changed while the scanner was advancing, the LIDAR data points over the test area were sorted based on the time they were scanned. Then, the calculated height differences (1008 biases) were sorted by time. The test area covered only less than two seconds of the scanning time, which contains about 19,000 data points.
Figure 12 shows the behavior of the absolute height error with respect to the time they were scanned.
In
Figure 12, the height differences between the LIDAR points and their corresponding ground-surveyed points show two types of variations. The two types can be divided in two components: a bias in the geo positioning system and a random variation around that bias. The first type of variation is called short-period variation. This variation seems to be random and has a high frequency. This variation between two consecutive points could reach 0.30 m as a maximum within 0.001 s. This random variation has a standard deviation of 0.082 m. The short period variation,
Figure 13, of the uncertainty of LIDAR heights gives an indication of the system precision since the consecutive points are so close to each other in the time domain and the test was conducted on a flat surface where the height is approximately the same.
On the other hand, as shown in
Figure 12, a trend (green spline) of the differences is observed which is the second form of the variation. A trend is defined as the component of a random phenomenon which has a period larger than the recorded data sample. The trend in this test data represents the biases in the geopositioning system. Although the test data represents only a small sample, this trend is very noticeable. The relative height accuracy between data strips confirms this inference regarding the two forms of random variation since the computation of the relative accuracy covers most of the data.
The total computed uncertainty of the LIDAR heights over the test area is about ±0.121 m. This uncertainty consists of two components: biases in the geopositioning of the platform and random variation around these biases due to the ranging system uncertainty. Regarding the planimetric accuracy, the computed offsets in the scanning direction over the test area show two main outcomes. First, the planimetric uncertainty is larger (almost double) at the end of the swath than at the middle. So, in general, the planimetric accuracy seems to vary based on the location along the scan (cross-track) direction. At the edge of the swath, the average offset was about 0.60 m, while at the middle it was about 0.30 m. The second outcome is that the whole strip seems to be shifted in the east direction since all the computed offsets have the same direction.
Both accuracies, relative and absolute, show two types of variation. The first variation form is the short period random variation which has a high frequency with time. This random variation seems to represent the actual precision of the LIDAR subsystem. The other variation form is the long period variation or a trend. This trend has a much lower frequency and seems to be a result of errors in the positioning/altitude subsystem. If this trend is modeled and the data is adjusted accordingly, the accuracy of data will be improved. Testing more samples that cover different parts with a longer period of time would be needed to strengthen these conclusions.
5. Results Analysis and Discussion
The relative height offsets between LIDAR data strips were examined based on the height offsets between coincident or partially coincident data points. More than 16,600 height offsets from all the 13 test overlap regions were computed. The results show a variation in the height offset between adjacent strips. Among the 13 overlap regions between adjacent strips, the average relative height offset ranges from 0 to 0.12 m. The maximum offset between coincident laser measurements from two different strips is about 0.40 m. Two variation types are observed in the discrepancy between adjacent strips, short- and long-period variations. The short-period variation is attributed with a high frequency with time and has a standard deviation of about ±0.10 m. The long-period variation has a much lower frequency than the first type. The source of the short-period variation appears to be the uncertainty in the LIDAR subsystem. Ranging and scanning observation uncertainty are some examples of error sources in the LIDAR subsystem. The short-period variation appears to be random and represents the LIDAR subsystem precision. On the other hand, the uncertainty in the navigation subsystem appears to be the source of the long-period variation, which can be modeled and corrected and consequently improve the data quality.
More than 1000 height discrepancies between the LIDAR data and collected ground points were calculated in order to examine the height absolute accuracy of the LIDAR data. The histogram of the computed differences shows a bias of about 0.088 m from the surveyed ground with a standard deviation of ±0.082 m around the middle of the histogram. The average height of the laser data points included in the test is higher than the estimated ground by 0.088 m. However, the total variation of the LIDAR heights from the surveyed ground is 0.120 m. This variation contains both the biases and its random differences. This number (0.120 m) represents a sample of the real system height precision of the LIDAR data. The computed discrepancies were then sorted based on the time they were scanned in order to examine the error behavior with respect to time. The height differences between the LIDAR points and their corresponding ground-surveyed points showed the two types of variation that were shown in the relative height accuracy. However, this time the tested sample is much smaller than in the relative height accuracy.
The variation between two consecutive points in time could reach 0.30 m as a maximum. The time span between two consecutive points in the test data is about 0.001 s since the pulse rate is 10 kHz. This short-period variation of the uncertainty of LIDAR heights gives an indication of the system precision since the consecutive points are so close to each other in the time domain and the test was conducted on a generally flat surface where the height is approximately the same.
In the scanning direction (across track), the planimetric offset of the LIDAR data from the collected ground points varies based on the location in the swath. As expected, at the strip edge, the offset in the scanning direction (−0.60 m as an average) was larger than at the middle of the strip (−0.30 m as an average). However, those shifts were in the same direction, which show the presence of a trend or bias. In the other direction (along track), the average offset from the ground was about 0.47 m. In general, the root mean square error of the planimetric offsets in both directions was estimated to be 0.50 m.
6. Conclusions
Coincident LIDAR data points in overlap regions between adjacent strips show a relative discrepancy in height. This discrepancy varies from one position to another and varies with time. Regarding the absolute height accuracy, the data points show a systematic departure from the collected ground reference and a random variation of −0.088 m. The total computed RMSE of the test data from the ground reference was about 12 cm. Planimetric offset of the LIDAR data from the ground reference in the across flight direction varies with the location in the swath. At the edge of the swath, the average planimetric offset was almost double its value in the middle.
Both relative and absolute height discrepancies of the LIDAR data have two components of variation. The first component is the short-period variation, which is random and has high frequency with time. This short-period random variation runs over the second component, which is the long-period variation. The long-period variation has a much lower frequency than the first type and represents the biases in the geopositioning system. The short-period variation is attributed to the LIDAR subsystem measurement uncertainty. System ranging resolution and scanning system-pointing precision are some examples of the error sources in the LIDAR subsystem. On the other hand, the uncertainty in the navigation subsystem appears to be the source of the long period variation, which can be modeled and accounted for in order to improve the quality of the data.
The LIDAR data accuracy evaluation is still an open area for more research and analysis, particularly with the growing application of new UAV-LIDAR systems in many applications. Such systems can provide centimeter-level accuracy and the procedure outlined in this research has the capability to evaluate the accuracy of these systems with a high degree of certainty. This will extend the use of LIDAR data in many applications such as building information models (BIMs), 3D Geographic Information Systems (GIS), mobile communications, disaster management, and city planning. Therefore, future research will focus on assessing the quality of LIDAR data produced from UAV systems and best practices of collecting LIDAR data for different applications.