1. Introduction
Fire has played an important role throughout human history, allowing men to transform ecosystems worldwide [
1]. Wildfires can be described as biomass burning, with potential impacts on human life, property, and ecosystems [
1].
Remote Sensing (RS) methods have been widely applied to incorporate data in fire risk assessment studies. The application of RS methods usually falls into three categories: forecasting systems that predict favorable conditions before fire occurrences; monitoring systems for active fire detection; and monitoring of post-fire conditions, such as burned extent (e.g., burn scar maps) or fire severity [
2,
3].
Wildfire forecasting systems usually focus on addressing probabilities of ignition and propagation (e.g., [
4,
5,
6]). Such models often combine dynamic variables (e.g., wind, temperature, or moisture) and static variables (e.g., slope, land cover, or proximity to specific features), which may be derived from RS techniques [
7,
8].
The detection of active fires through RS can be inferred from thermal anomalies, which may be detected from multiple sensors (e.g., MODIS, VIIRS, Landsat series). Rapid burned area estimations may be obtained from short-revisiting-time sensors, although such higher temporal resolutions usually come at the cost of decreased spatial resolutions (e.g., [
9,
10,
11]).
Multispectral indices obtained from optical sensors are amongst the most widely used to obtain variables to evaluate fire hazard (using time series) and to monitor fire-induced changes to vegetation, including burn severity and regeneration [
12,
13]. Such variables may be used to determine fire risk by using methods that correlate predisposing factors to the occurrence of burned areas, as well as by analyzing how the fire intensity influences ecosystem responses and societal impact [
1,
14,
15,
16,
17]. Multispectral indices, which may or may not have been developed specifically for the identification of burned areas, often result from a combination of two or more bands, from visible to short-wave infrared ranges [
18] (some examples are included in
Table 1). Such combinations include Visible (VIS) and Near-Infrared (NIR) couples, e.g., the Burned Area Index (BAI) [
13], which has been developed specifically for burned area discrimination, or the Normalized Difference Vegetation Index (NDVI) [
19,
20] and the Normalized Difference Water Index (NDWI) [
20,
21]. Other combinations consist in grouping Short-Wave Infrared bands (SWIRs) (such as the SWIR1 bands from Landsat sensors) with VIS or NIR bands, e.g., the improved version of BAI (BAIMs) [
9] or the Normalized Burn Ratio (NBRs) [
22], the latter allowing for discriminating different levels of fire severity. Finally, other indices include longer short-wave infrared bands (SWIRl) (such as the SWIR2 from Landsat sensors), e.g., the alternative versions of BAI (BAIMl) and NBR (NBRl), the Mid-Infrared Burned Index (MIRBI) [
23], or the Normalized Burned Ratio 2 (NBR2) combining SWIRs and SWIRl [
24].
Multispectral indices can be used for uni-temporal classifications (post-fire) or bi-temporal (pre- to post-fire difference) approaches [
12,
25]. Among bi-temporal change detections, the Differenced Normalized Burn Ratio (dNBR) is among those with the best performances [
26], demonstrating significant correlations with field measurements using images captured within ca. 30 days after fire events [
27]. However, as with other indices, it shows poor discrimination between burned areas and other surfaces, such as water, bare soil, or areas with little vegetation [
25]. According to [
18], the use of indices including SWIRs and SWIRl, has been demonstrated to mitigate errors, particularly commission errors, such as false assignments of clouds, cloud shadows, topographic shadows, and water. Such improvements have been explained by the distinctive spectral signatures of water and burned areas beyond the NIR region, where water tends to absorb longer wavelengths almost completely, while burned forest reflectance remains fairly constant or shows a slightly growing trend (e.g., [
28]).
The extraction of thematic information from spectral indices is usually carried out using one or more thresholds, which may be used to define two or more classes (density slicing) [
31,
32,
33]. However, as with any spectral index, finding optimal thresholds is often a difficult task, since they are usually scene-dependent [
34,
35]. Additionally, the characteristics and spectral reflectance of burned areas may also be highly variable, depending on fire severity or the density of pre-fire vegetation [
29,
36].
One of the main objectives of this study is to develop and implement a method based on satellite RS imagery and GIS techniques, using an integrated multi-index image-differencing approach aimed at determining burned areas, representing an advancement in respect to the applications of the single indices listed in
Table 1. The method develops from the work of [
34], and it consists of the analysis of the frequency distribution of a set of index differencing statistics, as well as the combination of different single-index classifications by means of a spatial majority analysis. The method was implemented in a fully automatic procedure and tested for a study area in northwestern Portugal.
2. Methods
The method developed in this study consists of an adaptation of the Multi-INDEx Differencing (MINDED) method [
34], which was originally developed to estimate flood extents. For this work, we have incorporated the same methodological principles, yet with a different purpose: the detection of burned areas. The assembly of the Multi-INDEx Differencing method for Burned Areas (MINDED-BA) follows the workflow illustrated in
Figure 1, incorporating Burn-related Indices (BrI). Non-normalized indices introduce data magnitude disparities, which may be exacerbated both in multitemporal analysis contexts and by mathematical derivative operations, potentially requiring additional computational processing and leading to threshold selection issues [
34,
37]. For these reasons, we decided to implement MINDED-BA by using the following normalized indices: NBRs, NBRl, NBR2, NDVI, and NDWI.
Given the specificities of MINDED-BA, particular considerations should be made for defining the requirements of the satellite images dataset. Firstly, we should perform an analysis to identify the typical fire season period of the study area. In the case of the study area selected to test MINDED-BA (northwestern Portugal), the temperate Mediterranean climate fire season spreads from boreal spring to late boreal summer, particularly during the highest temperature months (i.e., July, August, and September) [
38]. This means that the estimation of yearly burned extents may be assumed to be detectable from images acquired prior to (
t0) and after (
t1) each year fire season. Moreover, in order to minimize false change detections associated with different phenological cycle stages,
t0 may be acquired after the end of the previous fire season. This also means that, in a context of a multi-year analysis, every
t1 scene may be used as
t0 for the following year. The calculation of the BrI should be performed using surface reflectance values, which may be readily available for some sensors (e.g., Landsat series Level 2 or Sentinel-2 Level 2A). Another important issue is to address either
t0 or
t1 conditions that may result in false detection of burned areas. These might include the occurrence of features such as clouds, cloud shadows, and topographic shadows, as well as water-related changes (e.g., surface water bodies, soil and vegetation water content). Ideally, to minimize such errors, images should include none of these elements. In practice, additional conventional pre-processing steps may be performed in order to minimize the effects of the above conditions (e.g., [
39,
40,
41,
42,
43,
44,
45]). Moreover, certain land cover changes (e.g., conversion and/or modifications due to clear-cuts, cropland harvesting, or other soil mobilization practices) may produce index differencing results similar to those from burning. In order to address these cases, specific pre-processing analyses should be performed using remote sensing techniques or other GIS operations based on ancillary information. These analyses represent one of the focuses of the methods developed in this research, and they will be later addressed in
Section 4.2.
After the pre-processing phase, the following step of MINDED-BA is to calculate every BrI. This task should be performed (e.g., using the equations of
Table 1) for both
t0 and
t1.
Then, each corresponding ΔBrI may be calculated using Equation (10):
Considering each BrI’s specific spectral reflectance bands and the arrangement of Equation (10) (i.e., with
t0 as the minuend and
t1 as the subtrahend), the following task is to identify the range of values of ΔBrI that are expected to correspond to burn-related changes. For example, considering NDVI, burned areas are usually found to have negative and near-zero values, while green vigorous vegetation is characterized by positive values. Hence, according to Equation (10), those changes from green vegetation to burned result in positive ΔNDVI values. In fact, every other BrI considered in this study performs in a similar way, with recently burned areas corresponding to positive ΔBrI values, whereas longer-term post-fire vegetation regrowth areas are detected as negative values (e.g., [
16]).
Then, thematic classifications of changes are obtained by applying thresholding techniques to every ΔBrI. To this end, we considered an procedure analogous to [
34], consisting in analyzing each ΔBrI data distribution function (
f) and corresponding first and second-order derivative functions,
d1f and
d2f, respectively. Whenever the shape of
f allows separating no-changes from changes (perfectly separated multi-modal values), thresholds may be directly extracted from this distribution curve. This is a conceptual theoretical condition, but in practice, depending on how gradual the transition is between no-change and change conditions, this threshold has to be estimated using either
d1f or
d2f.
Similarly to [
34], we also chose to select two thresholds,
T1 and
T2, to obtain two classes of magnitude of change, Low-Magnitude change (LMc) and High-Magnitude change (HMc). With these, we expect to detect light burning conditions, such as partially burned canopy, or severe burning conditions, such as complete burning of initially vigorous green vegetation (e.g., forests with high leaf chlorophyll concentration), respectively.
The MINDED-BA workflow continues with the combination of the coeval classifications obtained from the single ΔBrI images. Following the same procedures used with MINDED, we performed a spatial majority analysis, allowing us to obtain the overall burn classifications and to estimate uncertainties.
The final step of the model consists in quantifying accuracies, which may be performed by comparison with either ground truth data or reference ancillary spatial information. In the case of the study area of this paper, we compared the MINDED-BA outputs with the official dataset of yearly burned extents by Portugal’s Institute for Nature Conservation and Forests [
46], which was used to provide our reference burned areas (RBA).
All the above steps of MINDED-BA (
Figure 1) were also conceptualized with the aim of developing a fully automatic processing tool for rapid and replicable extraction of burned areas from large multitemporal RS imagery datasets. The model was implemented with a GRASSGIS Python script, while the map outputs were obtained with QGIS, both being free and open-source software.
5. Discussion
This study presents a change detection model for determining burned extents from wildfires. It consists of the development of a previous model (MINDED), originally designed for determining flood extents [
34]. Despite both being weather-related hazards [
60,
61], flooding and wildfires have antagonistic causes and effects. Floods are often the consequence of extreme precipitation events, while wildfires tend to occur and proliferate during dry periods. Nonetheless, this work demonstrates that the same theoretical principles can be applied to determine the extents of both hazardous processes. This can be done by selecting different hazard-specific indices or by focusing on different sides of the same index difference distribution curves. Since MINDED and MINDED-BA share the same theoretical principles, they also share certain advantages and limitations. By combining several indices based on different multispectral regions, both methods allow narrowing the range of types of change, in which burned area detection is also included.
When analyzing processes such as wildfires, which have non-Boolean characteristics and result from interaction with other natural and anthropic processes, it is essential to define an adequate set of procedures to ensure the quality of the modelled results.
Figure 16 highlights both the advantages of the multi-index approach (Multi-index) and the importance of the pre-processing developments (Masking) introduced in MINDED-BA, in that the final result shows clear improvements when compared to the reference burned areas (RBA) [
46]. Nevertheless, as an index-differencing-based approach, it is still not completely able to discriminate each type of change.
As a digital change detection method, MINDED-BA is intrinsically dependent on the quality and quantity of the information conveyed by the images used for both
t0 and
t1 scenes. This means that the choice of the time of acquisition and frequency of the image couples should take into consideration the purpose of the research, as well as the geographical location and the characteristics of the study area. In our case, for the estimation of yearly burned areas in temperate Mediterranean climate regions,
t0 and
t1 should be acquired as close as possible to the end of each year, to ensure that each forest fire season has ended. Considering MINDED-BA relies on multispectral images acquired by sensors with revisiting periods of several days, such as those provided by Landsat or Sentinel-2, this might present additional difficulties. The first difficulty is to find cloud-free images during the wet season, which for particularly rainy years may be hard to obtain, especially for older epochs with a more limited scene availability. For this reason, it was not possible to select images acquired exactly at the same time of every year, ranging from late September to late January of the following year. This may be considered as an important source of error, since more changes other than burning may become detected by MINDED-BA (e.g., phenological cycles, seasonal agricultural or forestry management practices). On the other hand, the selection of only one couple of images per year may prevent detecting changes in recurring fires taking place in the same location within less than a one-year interval (e.g., in areas experiencing fast vegetation regrowth). Additionally, due to such limited image availability and the total time extent of this study (i.e., 19 years), we had to consider couples of images acquired by different sensors (
Table 2). Nevertheless, as referred to by [
34], we verified that using different sensors for
t0 and
t1 does not affect the accuracy of the model significantly, while causing a shift of the ΔBrI modal values and, consequently, of the threshold values (as observed in
Table A3 and
Figure 8). As verified in
Figure 11, no major inconsistencies were found when comparing results obtained from relatively concomitant Landsat 8 or Sentinel-2A scenes. This fact confirms the robustness of the model workflow (
Figure 1), which performed similarly in detecting burned areas (compared to the RBA). Nevertheless, the few differences highlighted in
Table 3 and
Figure 11c,d,g,h and
Figure 13 are more likely resultant from local water-content-related changes occurring during the ca. 11 days separating the
t1 scenes acquired by each sensor. In fact, another difficulty for MINDED-BA is that several phenomena other than fires may constitute false positives for every ΔBrI, which may only be mitigated with further pre-processing steps.
Regarding the atmospheric correction of the multispectral images used as
t0 and
t1, to ensure the replicability and robustness of MINDED-BA results, we considered the Level-2 products of both Landsat and Sentinel-2 (i.e., given as surface reflectance), which are determined by the USGS and ESA algorithms [
41,
62,
63,
64]. For the same reasons, we also considered the quality assessment bands (‘pixel_qa’ and SCL, respectively), which were used to mask clouds, cloud shadows, water, and snow features. Nevertheless, considering the relative stability of permanent water bodies within our study area and the ready availability of spatial thematic data containing their limits, we defined a buffer area with the objective of masking such permanent water bodies. This ensured that none of those pixels remained undetected by the Level-2 algorithms and avoided any further interference with the MINDED-BA results.
Additional criteria are also necessary to discard other known potential interferences such as forest/shrubland clearcutting, agricultural crop harvesting, or soil mobilization occurring between
t0 and
t1. This might be achieved with pre- or post-processing masking of highly reflective surfaces (HRS), which might occur in
t1, resulting in false positive detections. Such a task may be implemented by creating a user-defined training dataset or by considering reference literature values. The first option has the advantage of providing the best correlation with the characteristics of a study area, but may require a priori knowledge of both bare soil and burned area locations (e.g., using analogous approaches to [
35]), being more dependent on the end-user expertise and technical skills. On the other hand, classifications based on reference land cover literature values allow for implementing the model in a fully automatic routine without any subjectivity issues. The latter also has the potential of being applied as a ‘black-box’ tool, targeted for non-experts in remote sensing techniques. Since the thresholding procedures of MINDED-BA rely entirely on the index differencing statistics, there is a clear advantage in implementing masking as a pre-processing step (as opposed to post-processing), because by eliminating such interferences, the signal separation between changes and no-changes is also improved (i.e., noise reduction). For these reasons, we adopted a theoretical reference value approach to define a thresholding range applied to those multispectral bands that maximize the separation between burned areas and HRS. Then, we determined sensor-specific Tasseled Cap Brightness (TCB) values for a series of increments within the thresholding range, which were later used to mask HRS from every
t1 scene. In principle, we expected the best masking to correspond to those increments positioned towards the middle of the thresholding range, as their position was likely to maximize the difference towards the HRS theoretical reflectance signatures and to provide the best trade-off between the Total Changes commission errors (TC
CE) and the Burned Areas omission errors (BA
OE). In fact, if MINDED-BA was to be applied without any complementary information or additional analysis, M2 or M3 would likely be the safest alternatives. Instead, we found that the lowest threshold (i.e., M1) was the best option for our case study (see
Figure 4). This result may be explained by the different spectral characteristics of the burned areas, as well as the reflectivity of HRS due to local pedology and geology. Moreover, we would like to highlight that instead of the TCB, we could have directly integrated the same multispectral bands, as sum or average of reflectance values. After trying such approaches, we verified that despite having similar results, the TCB provided slightly better masking of HRS, obtaining marginal but consistent improvements (ca. 1%) for every parameter of the confusion matrix.
The optimal bin selection approach introduced in this work is a further development to the MINDED threshold selection procedure. It consists of a conceptual interpretation of the shape for the ΔBrI frequency distribution function, depicting what are the expected indications caused by noise and signal variations. Despite the different ΔBrI under analysis being obtained by integrating different multispectral band combinations and the tested bin number values being characterized by a set of 15 samples in a quite large range (10–1259), the obtained range of optimal bin numbers is concentrated in only 3 samples (i.e., 28, 40, 56-see
Figure 6). These findings may be a consequence of the effects of the time span between
t0 and
t1 images, as well as the extent and type of changes. However, further research should be performed to truly understand this behavior. As for the thresholds T1 and T2, in theory these should fall within the range of ΔBrI values 0–2 (as only normalized indices were considered). The results show that T1 is more concentrated in a small range of values (interquartile range 0.10–0.36), while a wider range characterizes T2 (0.45–0.99). We can conclude that the optimal bin number distribution spreads within ca. 20% of the sampled range, while the threshold T1 spreads within ca. 13% of the expected range, hence suggesting high stability regardless of the ΔBrI. On the other hand, T2 spreads within ca. 27%, suggesting higher variability.
Regarding the threshold distribution, there are several factors affecting their magnitude and variability. Such factors include the types of changes occurring between
t0 and
t1, as well as the sensors used to detect them, which in some cases are different between the two epochs (
Table 2). For these reasons, such threshold variations should not be interpreted as an indicator of uncertainty, as they reflect the method’s response to a shift of the modal value of the frequency distribution functions. The higher variability observed for T2 should also be more dependent on the optimal bin number value, because the higher the bin number, more intermediate thresholds may be detected (
Figure 7).
If not addressed correctly, or addressed at all, some of the above-mentioned conditions may still produce effects over the entire MINDED-BA model chain. In those cases, the ΔBrI distribution functions and their corresponding
d1f and
d2f may be affected, triggering the possibility of detecting sub-optimal thresholds and leading to less accurate classification as burn-related changes (either as LMc or HMc). Despite all the pre-processing steps, MINDED-BA seems to have a tendency of overestimating burned areas in comparison to the RBA (as observed in
Figure 10,
Figure 11 and
Figure 14 and
Table 5). This tendency is likely a consequence of the wide and flat topography of the lower section of the Vouga River Watershed. In some cases, we found several LMc areas to occur nearby river bodies that are prone to flooding [
34] (e.g., for 2000, 2009, 2014, and 2018). Moreover, MINDED-BA may be also sensitive to either higher differentials of vegetation greenness or water contents between
t0 and
t1 images, as might be the case for 2013 and 2016. This means that in such cases,
t0 was probably acquired during particularly wetter or even flooded conditions and/or
t1 during a drought. Both these conditions were observed for 2016 (see
Figure 11), especially for
t1, which was acquired during a well-documented drought period [
65,
66]. As an alternative, both
t0 and
t1 scenes could have been acquired earlier than the wet season, but with the risk of missing burned areas from those years.
From the initial set of normalized indices, we found ΔNDWI results to have insufficient performances in detecting burned areas (
Figure 9), so we discarded it from further steps of MINDED-BA. For these reasons, we ended up using four ΔBrI. In general, we found ΔNBRl to have the best individual performance for most parameters of the confusion matrix analysis, while ΔNBR2 achieved the best overall accuracy with the RBA (
Table 6). These findings confirm the indications found in the literature [
18,
28], which highlight the benefits of considering both SWIRs and SWIRl to improve the discrimination between burned areas and water. However, as observed in the examples of
Figure 9, ΔNBR2 seems to underestimate certain burned areas, probably resulting from reduced fire severity conditions. The majority analysis of MINDED-BA has been demonstrated to handle most individual ΔBrI limitations, meaning that the overall maps of
Figure 10 represent the best combination for most of the confusion matrix parameters (
Table 5). Moreover, the uncertainty assessment, which is enabled by the majority analysis, provides additional and relevant information about those locations where MINDED-BA tends to perform with different levels of confidence.
The analysis of the confusion matrix along with the RBA extents indicates that, in general, the larger the burned areas, the better the performance of the MINDED-BA. This is likely the consequence of MINDED-BA being a statistically dependent method, requiring that the frequency of pixels corresponding to burn-related changes should be large enough to produce measurable changes to df1 and df2 functions (though not enough to become the majority). Nevertheless, the method accuracy is also dependent on the occurrence of other types of changes that may lead to detecting false positives, such as those verified in 2016.
Despite being the widest within our study area, wildfires are not exclusive to forests or agricultural areas. In particular, during those years with the largest burned areas, many fires involved other land covers, such as artificial or even wetland areas (e.g., for 2017). Such heterogeneity of burned surfaces increases the difficulty in finding optimal thresholds, which ideally would have to be transversal to detect burn-related changes for every type of land cover.
Regarding the ICNF official burned areas, which were used as our RBA, they are the closest available dataset that may be used as a ground truth alternative. However, this dataset is in practice a product of a multi-source survey, combining fieldwork and remote sensing data interpretation. This means that these are also affected by unknown errors that, as an example, may overestimate the real extent of burned areas (e.g., as seen in
Figure 4, where the outer RBA polygon seems to be including several non-burned islands) and consequently increase the BA
OE. Additionally, this ICNF dataset is mostly Boolean in terms of the occurrence of fires; therefore, we have no way of verifying which fire severities are being considered. Nevertheless, in recent years, ICNF has started releasing additional information, such as date, type of fire (e.g., shrubland, agriculture, and forestry), cause of ignition, or source, which seem to demonstrate coordination with multi-level civil protection agents.
The tendency towards overestimating is a common issue for any image-differencing method based on a bi-temporal analysis, particularly when implemented with couples of images with one year or larger temporal separation. However, since MINDED-BA has been completely integrated in an automatic script, it should be relatively easy to implement it in a systematic routine, in order to analyze different couples of images acquired within a shorter time span (i.e., corresponding to the satellite temporal resolution). This would also enable assessing the yearly burned areas’ extents. In fact, if such analysis was to be performed, the user subjectivity errors when selecting the images would no longer persist, and the overestimation issue should be highly minimized. Such higher frequency multitemporal comparisons would also allow analyzing the persistence (or not) of detected changes across more than a couple of images, which by itself should allow a systematic improvement of burned-extent detection. Finally, considering that MINDED-BA has been demonstrated to be implemented automatically with readily available multispectral satellite data, it may be useful in both regional- and national-level applications. However, when analyzing considerably larger areas, it would be advisable to split the implementation of MINDED-BA by using images acquired along the same orbital path. Any further extents could be joined later as post-classification results, minimizing any potential interferences resulting from different t1 acquisition dates.