1. Introduction
Road extraction methodologies, based on the digital processing of images from remote sensing, have been extensively studied by cartography researchers to help update important graphical representations, for example maps, for several different purposes that are useful for many research areas. Those methodologies use digital processing to extract road characteristics from remote sensing images. Remote sensing deals with information, such as images of land surface targets, that is remotely acquired by sensors placed in, for example, airplanes or satellites. The information is collected by cartography to compose cartographic datasets of multiple geographical data and earth measurements and to later be graphically represented as maps, globes, and cartograms, among other representations. In the scientific literature, many studies addressed the application of road extraction methodologies to different cartographic datasets and their respective statistical evaluations and analysis. However, it is difficult for any digital processing method to perform a more reliable comparison among road extraction methodologies when their results come from different cartographic datasets. Therefore, there remains a need for more studies on the comparison of road extraction methodologies by performing statistical evaluations and analysis based on the same dataset.
There is a significant number of studies published by the scientific literature presenting methodologies for road extraction based on the digital processing of images from remote sensing. For example, it is possible to find papers that presented road extraction methodologies based on many different approaches, such as GPS data [
1], texture filters [
2], mathematical morphology [
3,
4,
5], and directional filters [
6,
7], among others. Additionally, due to the importance of this field to cartography, it is easy to find some research reviewing the results of road extraction methodologies [
8,
9,
10]. In particular, these three last papers presented different reviews of several road extraction methodologies explaining their concepts and comparing advantages and disadvantages of each one. Despite being very studied in the cartographic area, other research areas also have interest in the road extraction methodologies, such as civil engineering and social studies, due to the importance of the road extraction studies for traffic management, vehicle navigation system, location based services, tourism and industrial development [
1].
Although the scientific literature has many published studies presenting methodologies for road extraction based on the digital processing of images from remote sensing, unfortunately, most, to the knowledge of the authors, focused only on a specific type of road and quite often with a limited set of characteristics or conditions. In this sense, even though road extraction methodologies have attracted much attention from the scientific community, they have also been a challenge due to the difficulty reproducing the complexity of roads characteristics as a general model [
8]. Thus, there is a need for more studies that could tend toward generalizations of those problems presented by the scientific literature. Moreover, some review papers compared road extraction methodologies with the results presented by their own authors, instead of performing a statistical evaluation and analysis of the road extraction methodologies using a unique image dataset. The comparison of road extraction methodologies, by performing statistical evaluations and analysis based on a unique common dataset, could bring some advantages for researchers enabling a more reliable ranking of future extraction methodologies with the same image dataset and guiding researchers to perform studies that tend to be generalizations of those problems presented by the scientific literature. Thus, Mayer et al. [
11] grouped seven different road extraction methodologies and applied them to the same dataset. The dataset created contains only six images, of which three are aerial images and the other three are images acquired by the satellite Ikonos. In this way, the created dataset does not allow a great variety of images with different scenes that can exist in road extraction cases.
Considering the importance of studies on road extraction methodologies and additionally the advantages of using a unique image dataset to statistically evaluate and compare them, this study proposed a statistical evaluation and analysis of road extraction methodologies applied to the same image dataset. The study aimed to enable a more reliable comparison among different road extraction methodologies from the scientific literature. To achieve this goal, a dataset containing remote sensing images of three different road types, highways, city networks and rural paths, was proposed containing twenty-five images for each road type. In addition, the dataset was incremented with sixteen images from a public dataset (Vaihingen dataset) [
12] from the International Society for Photogrammetry and Remote Sensing (ISPRS), which totals ninety-one images in the dataset. Furthermore, three road extraction methodologies were selected from the literature, in accordance with their availability, to be processed and evaluated using well-known statistical metrics defined in the scientific literature for road extraction evaluation [
13,
14]. The experiment was performed in three steps. The achieved results have pointed out that the proposed statistical evaluation and analysis could allow researchers to evaluate and compare road extraction methodologies using the same dataset and characteristics to obtain a more reliable comparison among them.
The rest of the paper is organized as follows.
Section 2 summarizes some related work on road extraction and segmentation.
Section 3 provides the necessary definitions and concepts on road extraction methodologies.
Section 4 presents the image dataset and the methodology applied in this paper.
Section 5 shows the extraction results achieved by the three selected methodologies and their respective statistical evaluation.
Section 6 presents a brief discussion about the results obtained.
Section 7 presents a conclusion about the results achieved by this study.
2. Related Works
Due to the importance of road extraction methodologies, several works in the scientific literature proposed many different methods to perform road extraction. The significant variability of methods can be explained by the fact that each road or image type has its own characteristics, which directly interfere in the extraction process and, consequently, hinder the development of a unique extraction methodology for all possible situations presented in static images of roads. In this sense, the scientific literature also includes some review papers that presented extraction methodology concepts and compared their results.
Extraction methods can be classified in accordance with the following different road aspects, according to Wang et al. [
8]: geometric, radiometric or photometric, topological, functional and texture features. The geometric features are based on the geometric characteristics and road shapes, such as the fact that the width does not change suddenly and the ratio between the length and width is very large. Another common feature of roads is the color does not change much along the course, while it is different from those of the neighboring areas, which are considered as the radiometric features. The topological features are related to the fact that roads have intersections and they are not suddenly interrupted. On the other hand, the functional features consist of specific functions related to roads that must have some constraints conditions to perform those functions. Lastly, the texture features are based on finding the spatial distribution of pixel colors along the road course or on the neighborhood [
8,
15]. Wang et al. [
8] presented extraction methodologies based on classification, knowledge, mathematical morphology and active contour models, for example. The authors described those methodologies and presented their results in accordance with the information contained in each paper. Additionally, the extraction methodologies were only compared using the information provided by the authors, without using the same characteristics to perform the comparison. Analyzing the papers results, in [
10], the difficulty obtaining high accuracy values using only one extraction algorithm was shown, and that it required a study combining different extraction algorithms in accordance with the application.
Aiming to present a summary of the study area, Kaur and Singh [
10] presented several automatic road extraction methodologies. The authors presented a comparison table containing the papers’ information, such as the basic extraction algorithm, the image source, statistical results when available, advantages and disadvantages. However, the methodology analysis was done using only the information provided by the authors. In this sense, the comparison was not direct because the extraction results were obtained using different image dataset and characteristics in each extraction methodology compared.
Image segmentation is an important step present in all road extraction methodologies, according to Kaur and Kaur [
16]. In their study, segmentation was used to partition an image into meaningful parts having similar features and properties. Considering the large number of segmentation methods, Kaur and Kaur [
16] reviewed various segmentation algorithms classifying them as structural, stochastic (normally known as spectral) or hybrid techniques. The structural techniques are based on the structure information of the interest feature in the image. On the other hand, the stochastic segmentation methods are only based on the pixel values, while the hybrid ones combine the structural information with the pixel values. After doing a comparison of several segmentation techniques, Kaur and Kaur [
16] concluded that a unique extraction methodology is not sufficient to correctly extract the road networks from all the different image types and that some methods are not suitable for a particular image type. This fact motivated this work to perform a systematic analysis of different road extraction methodologies using a unique image dataset and, consequently, enable a real comparison among them.
After performing a review of the literature on road extraction methodologies, it is possible to realize that several methodologies used the mathematical morphology (MM) theory. MM is widely used due to the large number of functions and the ability to maintain the global geometry of the interest feature during the image processing. The most common use for MM is for post-processing steps aiming to improve the segmentation results by removing noises and linking some interest targets due to the capacity of processing the image while preserving the geometric structures of interest targets [
17].
A methodology for road extraction based on the Otsu segmentation and the mathematical morphology theory was presented by Ma et al. [
4]. After obtaining a gray scale image, Otsu algorithm was applied to convert it to a binary image highlighting the road network. However, the segmentation created a lot of noise (such as salt and pepper noise) that was removed using some mathematical morphology operations. Since the steps performed were based on the gray level of pixels, some other features could be part of the image together with the road network. Thus, the methodology proposed by Ma et al. [
4] calculated a ratio between the target area and perimeter, removing targets that do not have an elongated shape to obtain the final extraction result.
In addition, using the MM and before applying the segmentation, a smoothing step using Partial Differential Equation (PDE) was applied by Leonardi et al. [
18]. The PDE was used to remove noise, smoothing the image for the segmentation step. The segmentation was performed automatically using the Otsu method to obtain a binary image, which was post-processed using some mathematical morphology functions to obtain only the interest road.
The mathematical morphology to post-process the segmentation result was used by Wang and Shan [
5]. They classified the images into four groups (linear, curvilinear, crossings and breakages), creating a different post-processing for each road type.
A methodology based on the growing region to segment the image using some road samples provided by the user was proposed by Cardim et al. [
19]. Using the road samples, the algorithm calculated some statistical values to determine a range of values belonging to the roads to apply the growing region method and to obtain the segmentation result. After the segmentation, a post-processing step was performed using the mathematical morphology theory, to obtain the extraction result.
An extraction methodology, for which mathematical morphology and Gabor filter are used in parallel as pre-processing steps to enhance the contrast between road and non-road pixels, was proposed by Alshehhi and Marpu [
20]. The second methodology step was based on the construction of a graph representation of the image in accordance with the initial segmentation and a hierarchical merging and splitting of the images segments in accordance with the color and shapes. Finally, the post-processing step removed the irregular segments.
The use of directional morphological operators for road extraction, which eliminates the needs of a previously defined structure element that is usually necessary in traditional MM filters, was introduced by Valero et al. [
7]. Directional filters, path openings and path closings were used to create a morphological profile, enabling the extraction of linear geometrical information to classify each pixel as road or non-road pixels.
Another road extraction methodology, proposed by Courtrai and Lefèvre [
6], applied a pre-extraction of roads segments, filtering the image with background knowledge, which was then analyzed and connected, whenever necessary, using a region-based path closing. This last step reconstructed the unconnected road segments producing a better result.
A road extraction methodology that can be divided into two main steps was presented by Sghaier and Lepage [
2]. The first step, aiming to prepare the image for the road extraction, used a texture filter to extract linear surfaces, a morphological dilation operator to fill the holes caused by shadows, cars or other occlusions and finally applied the optimal canny edge detector to extract the road candidates. Afterwards, the second step was based on the beamlet transformation, which allowed the identification of the most appropriate scale for each road segment. The beamlet transformation allowed the methodology to use local information for each road segmentation. In this step, only the road candidates with the energy associated with a beamlet greater than a threshold were considered [
2].
A methodology applying a Hough transform, after a canny edge detector, to identify possible roads lines was proposed by Ibtissam et al. [
21]. As a parallel step, their methodology applied a local binary pattern algorithm to analyze the image texture. Afterwards, they assumed that a road is somewhat uniform, with similar values in the local binary pattern and with different values in the road edges detected by Hough transformation. In this sense, the methodology obtained the result of the extraction, taking the lines obtained by Hough transformation that have similar value in the local binary pattern result.
Two road features, the distinct spectral contrast and locally linear trajectory, wre used to present a multistage road extraction methodology exploited by Das et al. [
22]. To achieve the desired results, four probabilistic support vector machines (SVM) were trained using samples of four different categories. The methodology was divided into two stages. During the first stage, a probabilistic support vector machine (P-SVM) was applied in parallel with a dominant singular measure (DSM). When performed alone, the steps cited produced errors that were not verified when they were applied together in a constraint satisfaction neural network (CSNN), which was modified to have a complementary information integration step (CSNN-CII). The CSNN-CII was used to integrate the information of the edge-based processing (DSM) and the region-based processing (P-SVM). In the second stage, a post-processing was performed to improve the accuracy of the road detection by removing some false alarms as well as recovering a few short road segments neglected due to false rejection.
Using convolutional neural network (CNN), Li et al. [
23] proposed a road extraction methodology. The algorithm predicted whether a pixel belongs to the road, creating a rough map. To improve the previous results, a line integral convolution (LIC) based processing was used to connect the gaps and cracks, refining the rough road map obtained before.
According to Zhong et al. [
24], most published CNNs are not designed for remote sensing images of high resolution, because they are adapted from natural scene classification, using thousands of training samples. In this sense, a CNN architecture based on the recent improvements of CNNs was proposed using more efficient convolutional layers and smaller kernels. Nevertheless, the architecture used only the CNN developed to predict all pixels of the image in different classifications and not just for road extraction.
As is shown in the review of state of the art papers, the classification of remote sensing images is an active topic of research, which aims to categorize scene images into a discrete set of meaningful land use and land cover classes according to the image contents [
25]. In this way, it is very common to find people confounding classification techniques with road extraction methodologies. However, while the classifications aim to evaluate multiple spectral-bands to classify every pixel in a predetermined class, the road extraction methodology focuses only on the roads and, therefore, can apply techniques beyond spectral data, such as geometric and context objects techniques. In this sense, classification methods may be used in a preprocess step of road extraction identifying possible roads regions for a better and faster identification of the roads segments using some road extraction methodology. In this way, a classification method could improve the road extraction result, performing an initial and fast scene analysis and checking the need to perform a road extraction in the evaluated image scene in case of road candidates were selected. If a classification method does not find road characteristics in an image scene, the road extraction algorithm does not need to be used in this scene, making the process faster. Conversely, if the classification finds road characteristics in the scene, then the classification can be considered as a good candidate to perform the road extraction methodology, because it detects the interest features and its characteristics.
5. Results
As mentioned before, aiming to enable a more reliable comparison among different road extraction methodologies from the scientific literature, the purpose of this study was to perform a statistical evaluation and analysis of three road extraction methodologies by applying them to a unique image dataset. Three different steps were performed to achieve this purpose. First, a dataset of images from remote sensing was created. Second, it was necessary to obtain the extraction result of the interest roads for each extraction methodology that should be evaluated. That is, the second step is a digital processing method (see
Figure 6) that was performed to process the three methodologies to obtain the extraction results for each image of the dataset. Thrid, the results were statistically evaluated in accordance with a consolidated method [
13], for which the results are compared with a reference image, which is considered as an ideal result, as explained before in the Methodology Section.
In this section, the statistical evaluation was performed calculating the metrics of completeness and correctness, presented, respectively, by Equations (4) and (5). As mentioned in the equation definitions, an acceptance buffer is created according to a specified size, which is empirically defined. Since we have the spatial resolution for every image used, we defined the buffer size as 2 m and the algorithm calculate the buffer size in pixels for each image.
Figure 8 presents the results obtained in the second step by processing and applying the extraction methodologies of Sghaier and Lepage [
2], Cardim et al. [
19] and Ma et al. [
4] to the images presented in
Figure 5.
Table 3 presents the average values for completeness and correctness metrics obtained in the third step for each methodology related to each road type. The bold values are the best values obtained during the experiment.
Table 3 presents an average value obtained by each methodology when applied to the created dataset, but
Table 3 does not show the individual values for each image from the dataset. Aiming to provide a visualization of the statistical values obtained for each image from the dataset,
Figure 9 and
Figure 10 present, respectively, the completeness and correctness variation in the experimented dataset. It is worth mentioning that Images 1–25 correspond to the highways; Images 26–50 correspond to the cities networks; Images 51–75 correspond to the rural paths; and Images 76–91 correspond to the images from the ISPRS dataset.
6. Discussion
It is possible to verify, in
Table 3 and
Figure 9 and
Figure 10, that the statistical values obtained for city road networks extraction, Images 26–50, were the lowest obtained values. The low values obtained for city scenes are due to the difficulty faced by the extraction methodologies to differentiate the interest road from city components. City scenes are very complex containing a lot of similar surrounding information that confuses the extraction methodologies. It seems that, in this case, the results are in good agreement with the observation made by Maboudi et al. [
28] that the spectral and spatial characteristics of non-road structures can be similar to roads, making road extraction more difficult in urban areas. Moreover, the second methodology applied to highways and rural paths achieved high completeness and correctness average results are over 80%, while for some images the statistics surprisingly achieved 100%. As opposed to city roads networks, highways and rural paths are well defined related to the surrounding scene but they still have some interruptions in the road course.
Observing the completeness values obtained by the three methodologies evaluated and presented in
Figure 9, it is possible to verify that the second methodology (Cardim et al., [
19]) obtained the best values for almost all images, except for Images 21 and 26, for which the third methodology (Ma et al., [
4]) and the first one (Sghaier and Lepage [
2]) obtained, respectively, the best values of completeness. Meanwhile, the correctness values obtained, presented in
Figure 10, do not distinguish a methodology as the best one. Actually, the Cardim et al., [
19] methodology achieved the best correctness values for almost all figures containing highways and rural paths, whereas the Sghaier and Lepage [
2] methodology achieved the best correctness values for the majority of images containing city roads. All those facts confirm the information presented in
Table 2, where the best average values of completeness and correctness are in bold.
Regarding to the three methodologies and the dataset analyzed in this paper, it is possible to claim that city roads networks are the most difficult road type to extract, because there are a lot of similar information around them. Moreover, the best average values obtained with the statistical evaluation are results from rural paths images. It could be explained because of the large contrast between the rural paths and their surrounding information. Comparing the results of the methodologies, the Cardim et al. [
19] methodology achieved better statistical results than the other two methodologies for highways and rural paths. It can be explained by the fact that the last methodology is semiautomatic and, therefore, it needs some road samples before the beginning of the extraction process. However, for images containing city road networks, the Sghaier and Lepage [
2] methodology achieved the best correctness statistical results because the detected segments belong to the interest road.
The achieved results indicate that the proposed statistical evaluation and analysis could allow researchers to evaluate and compare road extraction methodologies using the same dataset and characteristics to obtain a more reliable comparison among them.
7. Conclusions
In the scientific literature, many studies addressed the application of road extraction methodologies to different cartographic datasets and their respective statistical evaluations and analysis. However, it is difficult for any study to perform a more reliable comparison among road extraction methodologies when their results come from different cartographic datasets. Facing the importance of studies on road extraction methodologies and the need for more studies to statistically evaluate and compare the application of these methodologies to a unique image dataset, this study proposed a statistical evaluation and analysis of road extraction methodologies applied to the same image dataset. The study aimed to enable a more reliable comparison among different road extraction methodologies from literature.
This study contributed by creating an important dataset of images from remote sensing containing three different types of roads: highways, city road networks and rural paths. Moreover, the achieved results demonstrate that the proposed study also contributed by providing a useful digital processing method able to perform statistical evaluation and analysis of road extraction methodologies applied to the same dataset of images from remote sensing. It could be concluded that the results indicate that the proposed study could provide a more realistic, therefore a more reliable, comparison among the three road extraction methodologies that were selected from the literature.
The proposed study expands prior studies [
8,
10,
16] applying the evaluated methodologies to a unique dataset containing miscellaneous images of different resolutions and context, which are available for future works and analysis. Therefore, this study provided considerable improvements over the state-of-the-art studies applied for remote sensing and significant support for future research to improve the process of comparison of road extraction methodologies using a unique dataset of images from remote sensing.
However, a limitation is worth noting. Although the findings of this study are promising, the difficulty to reproduce the complexity of roads characteristics as a general model remains a challenge. Future work should therefore consider the need for more studies that could be generalizations of the problems presented by the scientific literature.
In the future, it is expected that this study could be applied without difficulty to bring some advantages for researchers enabling a more reliable comparison of other road extraction methodologies with the same image dataset and to support researchers to perform studies that generalize the problems presented by the scientific literature.