1. Introduction
Despite the scientific and technological developments in recent years, there are still natural phenomena that cause loss of human lives and high socio-economic costs in the context of climate change that may increase their severity and frequency. Forest fires, endemic in zones with a Mediterranean climate, together with certain human activities, generate landscape degradation, a barely remediable transformation in the environment, and environmental changes, such as soil erosion or decreasing air quality [
1]. Wildfires take place every year in Spain due to climate conditions, such as the long summer period, combined with low rainfall and high temperatures. These factors affect forest vegetation and facilitate wildfire outbreak and spread. Furthermore, there are anthropogenic causes [
2]; 95% of the fires that occur in the Mediterranean region are produced by human causes, either by accidents, negligence, intentional acts, etc. [
3]
Climate change and changes in land use affect the fire regime [
4]. In the touristic zones and in the metropolitan environments where settlements are in contact with forest zones, this problem is more severe, for obvious reasons, which increase the real state and human exposure to fire risk. In addition, there is an increase in density in these zones due to the proximity of human activities to the forest zone and an increase in the vulnerability of forest vegetation produced by the abandonment of traditional agricultural activities in the peri-urban environments of large cities and holiday dwellings. The expansion of these WUI zones in the United States was very significant [
5] and it has substantially increased the cost of wildfire suppression and the treatment of forest fuels, not to mention the damage that they cause [
6]. This is also the situation in Spain [
7] and has led to detailed studies on its legal regulation in Europe and in Spain, considering the changes in the LULC data and the increased vulnerability that this creates [
8].
The coexistence between forest areas and new settlements, with non-traditional activities in the rural environment, is one of the main characteristics of the concept of the wildland–urban interface (WUI), which is defined in the Spanish forest law (Ley de Montes) 43/2003 as those areas that include housing estates, other buildings, works, electrical installations and transport infrastructures located in or near forest land, that can entail a fire hazard or be affected by them. Royal Decree 893/2013 provides another description of this concept, defining it as zones where buildings meet forest land.
There are particularly good works that delimit the areas in which this urban and forest contact occurs, identifying the typology and the degree of hazard [
9], and indeed the recent evolution of the landscape in many Mediterranean countries is conditioned by the growth in urbanized zones. The expansion of WUI zones in Spain exceeds one million hectares (about 4% of the total forest surface), with an average of 12,500 forest fires per year during the past ten years [
7], gaining prominence in the metropolitan environments of Madrid and Barcelona and in the Mediterranean coast [
10], where urbanizers seek single-family dwellings or second homes, attracted by the aesthetic appeal of a natural environment, which causes a situation that alters the traditional landscape and brings a social and cultural behavior that is alien to that environment and is of danger of a wildfire [
8]. Therefore, it is very important to identify automated methods that help us to delimit this type of WUI zone at different scales and, in this case, the most important metropolitan zones in Spain.
This study is a continuation of a research line initiated in the SIOSE-INNOVA project (subprogram “Retos I + D + I 2016” for 2016 R + D + i challenges, Spanish Ministry of Economy and Competitiveness, CSO2016-79420-RAEI/FEDER UE) in which the exploitation of Geo Open Data repositories of the Spanish national plan for territory observation (PNOT) were tested to obtain the delimitation of these WUI zones in an automated manner for large areas and with a high level of detail. For this purpose, a methodology based on the use of free software tools and standard open data is proposed for testing in the automated detection of WUI zones in the periphery of the metropolitan areas of Madrid and Barcelona (
Figure 1).
Technological developments in remote sensing and in the geographic information process have led to the generation of highly detailed data that are of great use in the study of wildfires in WUI zones [
11]. In this respect, the use of Airborne Laser Scanning (ALS), also known as LiDAR technology, is noteworthy for determining the density of forest fuel, measuring hazard and mapping wildfire risk in settlements, [
12] as well as for generating models for the automated identification of firefighter safety zones [
13]. LiDAR data and aerial photographs were used to determine the volume of forest masses, land slope or flame height. Some methodologies using laser information as fundamental data were developed to model fire behavior during a wildfire [
14].
There are good examples of the delimitation of WUI zones using ALS echoes in Spain, such as the analysis carried out by Badia A. and Gisbert, M. [
11] at an experimental level in the metropolitan area of Barcelona, although limited to a very specific area. In Galicia, Robles et al. [
15] successfully obtained the exposure of settlements to wildfire risk through non-automated analysis of LiDAR data, or by using decision trees and Geographic Information Systems (GIS), as in the case of Fernández-Álvarez, M. et al. [
16]. Initial research in the SIOSE-INNOVA project for the delimitation of WUI zones began with the use of PNOT LiDAR in Valle del Tiétar (Segovia) and in Camp del Turia (Valencia), through active remote sensing of zones where vegetation and buildings were in close proximity to each other [
17,
18,
19].
All these works demonstrated the usefulness of LiDAR data in the study of forest vegetation, wildfires, delimitation of WUI zones, and a highly detailed characterization of vegetation, although in small areas. The advantage of using LiDAR in terms of geographic accuracy also entails its main disadvantage: the large volume of data to be handled and the problem of its processing. When these volumes of information refer to entire regions of Europe, the use of robust databases and the automation of processes is necessary.
The use of LULC databases to delimit WUI zones in Spain has also proved to be useful [
20], although Corine Land Cover (CLC) data are not the most appropriate to achieve detailed results; however, they are unquestionably useful in the creation of evolutionary maps for Europe as a whole [
21], for regional realities [
22], or even provincial realities [
23]. In the research undertaken within the SIOSE-INNOVA project, studies were carried out on the identification of WUI zones, testing the suitability of official land-use databases (Corine Land Cover and SIOSE) in verified scenarios, such as the provinces of Navarra and Castellón [
24]. The problem is the reference scale of this type of database. The Corine Land Cover (CLC) geodatabase is particularly useful for a reference scale of 1:100,000, and SIOSE for a scale of 1:25,000, but, in both cases, there is a geometrical ambiguity that makes it impossible to accurately determine the buildings exposed to risk, as information is essential for planning and preventive work.
In line with the work of this project, the Master’s Thesis by León, P. [
25] and Navarro Carrión, et al. [
26] demonstrated the effectiveness of combining LiDAR data and LULC databases to determine the zones exposed at different scales and with the geometric accuracy needed to precisely delimit the buildings exposed at a detailed scale in the entire province of Alicante. This research began to point out solutions to the problem of Geo Small Data in order to be able to process these sources and identify the zones that are the subject of study within large areas (provinces or regions), with an appropriate level of precision for the local approach.
Our contribution aims to automate the working method and optimize the use of Geo Small Data techniques that allow for the massive processing of data from the combination of LiDAR point clouds, SIOSE, aerial photographs, satellite images, etc. [
26]. For this purpose, the methodology was applied to more complex zones, such as Madrid or Barcelona, in order to set a model that enables the delimitation of WUI zones in any geographic context within the country, or even for Spain as a whole, and that can be easily updated as the official information sources are updated. The results of this research were verified by means of statistical sampling methods, which, along with the photointerpretation of aerial photographs and the use of the web map service (WMS) services of the SIOSE field photographs, allowed us to successfully verify the accuracy obtained in the determination of the WUI zones used as the study areas.
This research began to point to solutions to the Geo Small Data problem in order to process these sources and identify the areas under study within large areas (provinces or regions), but with an adequate level of precision for the local approach. Our work is an evolution of this methodology, which has contributed to increasing the level of automation of the processes of this work and the creation of new automated processes. Fundamentally, all the processes related to the downloading of data from the servers of the National Geographic Institute and their entry into the geodatabase of the research laboratory were automated and improved. We worked in different autonomous communities, not only in the Valencian Community, because the LiDAR information does not have the same characteristics for all the areas; although some minimums are fulfilled in all of them, there are areas, such as Madrid, where the information is denser and more detailed. This research allowed for the contemplation of all these variations in the information and definitive automation of the data download processes. In addition, techniques for checking results with aerial photographs, the systematic sampling of field areas and cross-checking with a repository of field photographs were used and improved.
However, although the obtained results were closely verified through photointerpretation and official graphic documentation, future research should continue with the formulation of qualitative verification protocols that are based in fieldwork and object-oriented remote sensing (OBIA). Therefore, the purpose of this article is limited to the automated method of delimitation of WUI zones based on official and open data, but it does not involve research on WUI in terms of landscape evolution or a qualitative assessment of different WUI typologies in metropolitan zones.
3. Results
The proposed methodology was applied to the selected case studies according to the seven phases: downloading basic information, the creation and structuring of the database, intermixing and interfacing the SIOSE polygon input, the selection of LiDAR points from intermixed and interfaced polygons, the creation of LiDAR point clusters for fuel and building, the determination of intermix and interface exposure, and, finally, the research development and results.
3.1. SIOSE-Based Determination of Exposure to Fire Risk
The implementation of the methodology developed by Moreno, V. [
24], extended in this research and applied to the provinces of Madrid and Barcelona, led to twelve relationships stored in the
WUI schema of the database. The geometries generated from these relationships provide the metrics of the records computed in the processes.
Figure 4 and
Figure 5 represent the entire territory, analyzed according to its type of exposure in an intermix or wildland–urban interface, obtained through the polygons of the SIOSE 2014. The polygons marked in red are the result of intermix; they are areas where residential use coexists with 50% or more of the forest mass. The polygons marked in orange are the interface ones containing residential use and located at a distance equal to or less than 100 m from other polygons with at least 75% or more fuel inside.
The areas exposed to fire risk in the province of Madrid represent a little less than 1% in an intermix situation and almost 2% of the territory in an interface situation according to the results obtained from SIOSE, whereas in Barcelona, they represent a little more than 2% in an intermix situation and almost 8% of the territory in an interface situation. The areas are represented at a larger scale in order to evaluate the degree of detail achieved. In Madrid, the study was conducted in the municipalities of El Escorial, Galapagar and San Lorenzo del Escorial, and in Barcelona, in the Garraf region, with the aim of representing urban areas with abundant forest mass. For that purpose, a sampling of the analyzed data population was carried out, which is described in the section below.
Calculation of the Sample Size
In a move to achieve methodological rigor in the different areas of the research for the development of the study phases, it was necessary to take a sample of the obtained results to identify the effectiveness of the methodology. To validate the results, we accessed the WMS of fieldwork photos taken during the development of SIOSE, and the PNOA aerial photography WMS. However, the following considerations were made to estimate the sample size: for intermix and interface polygons, the work was performed with a 10% error and 90% confidence level, considering that the sample is 50% heterogeneous.
The samples selected in municipalities of Madrid for intermix and interface consisted of 16 and 48 polygons (
Figure 6a), and for Barcelona, in the Garraf region, they consisted of 42 and 60 polygons (
Figure 6b). They were defined by using the random selection tools offered by QGIS, which made it possible to obtain a sample that was spatially well distributed. The selected areas show significant territorial differences in terms of their land-use distribution model, their surface and the relationship between residential zones and forest mass.
The proportion of affected areas (
Table 2) in the intermix is 1.1 to 1, while in the case of the interface, the proportion between polygons of buildings is 1.8 to 1, which may indicate that the exposure in the selected zones of Barcelona is higher than in the selected areas in Madrid.
Based on the samples analyzed for the evaluation of the situation of intermix and interface with SIOSE, it was decided to develop sheets to assess, verify and plot the polygons concerning the sources mentioned above. In order to further refine the results, three levels of exposure (high, moderate, low) were established, as seen in
Figure 7, which will be compared against the results of the operations performed with the LiDAR data. The high exposure value indicates dwellings that, in the event of a fire, would be totally compromised. The moderate value indicates dwellings that could be affected by fire to a lesser extent, due to their distance from the fuel. The low value indicates dwellings that coexist with the fuel but, due to the limited presence of forest mass or to their greater distance from it, would be affected by a fire to a lesser extent than in the previous situations.
It is worth noting that, in the intermix evaluation, the cases found in which SIOSE fails to recognize the exposure or produces “fake positives” (
Table 3) are mostly due to the absence of homes within the polygons. In the case of the interface, it is more associated with the geometric ambiguity of the SIOSE information for the different coverages or uses within the same polygon with more than a 100 m distance from the fuel to the home for these cases. The help of the LiDAR information was essential in these cases to identify the exact location of the thematic information categories of the Land Use Land Cover surfaces (LULC) associated with each of the polygonal entities of SIOSE (association of several thematic attributes to a single geometric object).
It can be pointed out that, for the Garraf region, where the urbanistic model is based on tourism and expansion derived from second homes, the sampled results showed that the assessment was correct for between 85% and 95% of the cases (
Table 3), while for the municipalities of Madrid, it was correct for between 95% and 100% of the cases. In addition, there is a differentiated exposure between the study zones because in the municipalities of El Escorial, Galapagar and San Lorenzo del Escorial, there is a larger number of isolated houses that are in contact with forest and silvopastoral areas, whereas the exposure in the Garraf region mostly affects more consolidated housing areas that coexist with an abundance of fuel. However, the sampling indicates that the methodology applied is effective in identifying exposure and that it is useful for the territory under study.
3.2. LiDAR-Based Determination of Exposure to Fire Risk
Nonetheless, in order to further determine the level of exposure, LiDAR data were used. After obtaining the clusters of vegetation and buildings, it was decided to verify the results generated with the SIOSE database and assess how these results improve with the incorporation of LiDAR data. For this purpose, we generated sheets that show the percentages and levels of exposure determined by the sampling.
Regarding the Garraf region (
Table 4), the sampling determines that the methodology is efficient for between 88.5% (interface) and 100% (intermix) of the cases, and exposure levels reclassified through the initial SIOSE identification are recognized; these results can be inferred for the rest of the region.
It is worth noting that the LiDAR points corresponding to homes or forest mass that were identified in the analyzed sample made it possible to adjust the exposure results and to bring them closer to the reality of the territory, which is also true in the case of polygons that, when using only the SIOSE database, showed a specific exposure level that was represented by a higher level of accuracy when complemented by the LiDAR information.
Unlike the results of Barcelona, the sampling in the municipalities of the metropolitan area of Madrid is effective for between 91.7% and almost 94% of the cases. This difference is related to the higher accuracy of the LiDAR data in this zone (Madrid = 1 echo/sqm vs. Barcelona = 0.5 echoes/sqm), which slightly influences the quality of the results obtained.
According to the approached methodology, the zones exposed to fire risk in the wildland–urban interface within the sampling performed (
Figure 8 and
Figure 9) correspond to the areas represented by the filtered clusters.
Once the quality of the results was determined, the intermix and interface polygons identified by SIOSE were re-classified, with the positive clusters indicating that the exposed area was larger in the intermix than in the interface. These areas corresponded to the intersection between the buildings–forest mass at a distance of less than 100 m. However, within the sample, 14% corresponded to interface polygons, which, for SIOSE, were fake positives and which, when using LiDAR, reduced this percentage to 11% since they allowed for re-classification of the exposure. It must be emphasized that it would have been impossible to reach such a degree of accuracy without the combination of both official repositories (SIOSE + LIDAR).
4. Discussion
This research allowed access to and the handling of information that is difficult to use due to its volume, structure, management and characteristics. A methodology was proposed to enable its use and access for users in different management levels for the development of territorial action plans, emergency plans, plans for self-protection against fire risk, and plans for a more appropriate protection of the environment in the planning of urban development projects.
The research shows that it is possible to create and manage a geodatabase containing land-use information provided by official sources (SIOSE) in combination with the voluminous LiDAR files from PNOA. Storing point clouds in patches and processing them through clusters allows for accurate identification of the WUI zones exposed to fire risk, achieving a high level of detail, applicable at different scales. The use of Geo Small Data approaches, together with the parallelization of processes, allows for high-quality results within a reasonable time.
The proposed methodology enabled the automation of a replicable process of calculation and estimation of the built-up areas exposed to fire risk, at a scale of detail appropriate for any urban planning work, estimating which areas are the most exposed to risk, and is applicable to accelerated urban expansion dynamics, such as the case of Barcelona or Madrid. The data sources are open, standardized and regularly updated, thus responding to a public interest, due to the impact generated by the exposure to fire risk on the safety of people and real state in the study zones. Updates allow for chronological monitoring of the phenomenon, as the official geographic information repositories are renewed.
Obtaining SIOSE polygons in intermix and interface was essential to the recovery of LiDAR points with such a high level of accuracy. To verify the results, the WMS services of the SIOSE 2014 photos and latest PNOA orthophotos were used, which allowed for a visual analysis of the study area and can always be complemented by the corresponding field work.
It should be noted that a correct analysis of cluster filtering is necessary, so that the calculation of the interaction between the variables of forest fuel–distance–residential zones provides a real estimate of the compromised areas. This situation was appropriately solved thanks to the improvements achieved in the proposed methodology in previous works [
24,
26], which were implemented in other geographic environments (the Navarra region in the Pyrenees mountain area, and the region of Valencia in the tourist Mediterranean coast) but which share the same objective and application as this study. It is worth mentioning that the major methodological challenges to be solved refer to errors in the classification of LiDAR pulses and the difficulty of handling a point cloud as large as the ones present in this research work. The total computation time was 20 h, and it was performed with a conventional computer with common features and free software.
The use of LiDAR data acquired by unmanned aerial vehicles (UAVs), combined with other passive and active remote sensing data, has the greatest future for fuel mapping of the wildland–urban interface (WUI), using machine-learning algorithms [
33]. It is also necessary to highlight the importance of short-range LiDAR for field data collection, together with the application of qualitative and quantitative mapping methods to visualize land use in a dynamic context since it is possible to record dynamic phenomena in space, thanks to images obtained cyclically by UAVs [
34]. This information would be of great value in very detailed studies for emergency management and risk prevention in those areas where the existence of fire risk exposure was previously detected. In addition, advanced topographic methods, such as the GNSS method and the object of topographic information based on low-altitude aerial imagery [
35], together with field LiDAR, are decisive tools for these purposes.
However, in the case of this research, the methodological proposal was limited to the maximum exploitation of official repositories of geographic open data with national coverage for studies applied to wide regions, but with very detailed results. Perhaps it would be useful in the future to further the research in the most exposed areas detected with this methodology with the aforementioned technologies.
In the introduction, we referred to many previous studies that demonstrated the usefulness of LiDAR data in the study of forest vegetation, forest fires and delimitation of WUI zones. We also pointed out that the advantage of LiDAR accuracy also entails its main disadvantage: the large volume of data to be processed. This is an advantage or disadvantage that depends on the scale of the work and the level of precision. A single LiDAR file from the Download Center of the National Geographic Institute of Spain (CNIG) only covers an extension of 2 × 2 km in Catalonia and 1 × 1 km in Madrid; each of these files can contain a cloud of up to 9 million points, with more than fifteen thematic information fields associated with each of them.
Thanks to the combination of the official geographic information repositories used—SIOSE and LiDAR—in this research, it was possible to process the information from two European regions in a massive way and, at the same time, obtain a great level of detail. This went beyond the reference scale of the SIOSE database (1:25,000) and ruled out the false-positive scenarios that were classified as positive in the SIOSE polygons due to the ambiguity geometric that presented the uses of the SIOSE enclosures. On the other hand, the SIOSE data and aerial photography also made it possible to improve the LiDAR point cloud classification data, thereby producing a synergy between the two sources of information.