4.1. Spatial Transferability vs. Site-Specific LFMC Empirical Models
Extrapolation of empirical models previously fitted for the same shrubland species in a different geographical region (Madrid, central Spain) to the study area (Andalusia, southern Spain) showed that direct transferability of LFMC models based on remote sensing has limitations, even for the same monospecific vegetation. Some authors have assessed the spatial transferability of empirical models derived from satellite data, including LFMC estimation in shrubland and grassland, with moderate results [
11]. In the present study, despite the good fit between observed and predicted LFMC in all formulations tested (R
2 from 0.60 to 0.65), RMSE and MAE were 5% higher (in absolute value) than the errors previously reported for independent validation of the empirical models in the Madrid region [
21]. Nonetheless, our results confirmed the validity of transferring previous
Cistus ladanifer models to different geographical areas with similar shrubland vegetation using a simple recalibration with field data. Once recalibrated, the existing models provided sufficiently reliable LFMC prediction for operational purposes, yielding good estimation errors (RMSE = 16%, MAE = 12%) under the higher fire risk scenarios (LFMC < 100%). Recalibration provided better results regarding the error level and, most importantly, decreased the undesired LFMC overestimation (predicted values higher than field values) that poses a major threat to wildfire risk assessment (
Figure 4).
As expected, the model performance improved when site-specific empirical models based on field LFMC monitoring data available for the study region were fitted. The results showed a better fit between observed and predicted LFMC in the calibration (R
2 ranging from 0.70 to 0.72) as well as the independent validation (R
2 from 0.68 to 0.69). However, at lower LFMC values (<100%) the prediction errors were similar to those found in the extrapolation of previous models after recalibration (
Table A1 and
Table A2). It is worth noting that previous models were fitted with data from a single plot in the region of Madrid [
21], whereas the new models were derived from data from a set of 27 plots located in three different western provinces in the region of Andalusia. Moreover, the performance of the previous models may also be worse because lower LFMC values were less well represented in model calibration, as Sentinel-2 sensors were not fully operational for the whole study period (2016–2019), i.e., fewer images were available in Marino et al. [
21]. Hence, extrapolation of the previous uncalibrated models was also biased towards overestimation, which is of greater concern for LFMC estimation at lower values (<100%). Our results confirm that empirical models perform better when calibrated carefully by including multiple sites and as wide a range of variability as possible in the field LFMC data used as reference values [
33].
In terms of the modelling equations, the exponential formulations performed better than the other models tested (linear, logarithmic and square root), minimizing errors under higher fire risk scenarios (LFMC < 100%) both for the site-specific empirical models fitted in the study area (RMSE = 15%, MAE = 11%) and for the recalibration of previous models from another geographical area (RMSE = 16%, MAE = 12%). These findings regarding the exponential formulation are relevant and not previously reported in the literature. Previous studies involving empirical modelling of LFMC with remote sensing data primarily relied on linear regression analysis [
22,
33,
34,
35] or alternative nonlinear formulations [
21]. Some authors also suggested more complex modelling approaches, such as artificial neural networks (ANNs) [
36] or generalized additive models (GAMs) [
21,
23], with different results. In general terms, while the more complex models (e.g., GAMs) produced better fits during training and cross-validation, they were not well generalized in independent validation testing [
21]. This is crucial for assessing model transferability and extrapolating predictions to new conditions beyond the calibration data [
11].
Concerning the surface reflectance, the SI that included SWIR bands (NDII, NDWI and GVMI) were expected to provide much more direct estimates of LFMC, as they are directly related to plant water absorption [
11,
19]. Conversely, the SIs including visible and NIR (generally indicating vegetation greenness and growth) were better correlated in this case, being indirectly linked with LFMC through the change in leaf pigment content [
19,
33]. Comparing the SI selected as inputs in the empirical models fitted in each geographical area (Madrid vs. Andalusia), we found that VARI yielded the highest correlation with LFMC in both cases. Marino et al. [
21] reported the same result for this shrub species independently of the satellite sensor used (MODIS or Sentinel-2). Yebra et al. [
26] also reported that VARI (in this case, derived from MODIS) was the SI that was most closely correlated with LFMC in
Cistus ladanifer in a different Mediterranean site in Spain (the province of Toledo), located between the regions of Madrid and Andalusia. However, the best combination for predicting LFMC in Andalusia sites included VARI and EVI, whereas the best results in models fitted in the Madrid site were for the combination of VARI and SAVI. The different model inputs selected in both regions for the same shrubland species (EVI vs. SAVI) may explain some of the limitations for direct transfer of previous empirical models to other geographical areas. Our results highlight that, even for the same shrub species, some SI may be more affected by local site conditions than others (depending on the index formulation and the spectral bands included), providing differences in the spectral responses associated with the LFMC dynamics. EVI was originally developed from MODIS to enhance the sensitivity to a wider range of vegetation conditions, improving vegetation monitoring through decoupling of the canopy background signal and a reduction in atmospheric influences [
37]. Previous studies also included EVI for LFMC estimation with different multispectral sensors, including Sentinel-2 [
38] and MODIS [
34,
35]. Our findings suggest that the spectral variation due to different input sites can be addressed by simply recalibrating existing models. However, this approach requires sufficient field data for each new geographical area. Some authors suggest improving LFMC estimation with empirical models by using spectral data in a normalized form (i.e., difference between maximum and minimum SI values) [
25,
33]. This can better account for differences in cover, species composition and soil background between sites, especially for open shrublands [
33]. The drawback of using the recalibration method proposed in the present study is that it requires a sufficient amount of in situ field data to correctly adapt the previous models to the environmental particularities of the new geographical site. Hence, once having invested in field monitoring in a certain region, fitting site-specific empirical models is preferred over recalibrating previous models from other regions, as it would enable more reliable LFMC prediction.
4.2. Temporal Transferability of LFMC Empirical Models for Wildfire Risk Assessment
The best empirical model fitted for
Cistus ladanifer in the study area yielded a mean absolute error of 15% throughout the year in the independent validation dataset, decreasing to only 10% under the higher risk scenarios associated with lower LFMC values (<100%). This level of accuracy obtained with remote sensing data is considered very satisfactory for operational wildfire management, considering that LFMC values from destructive sampling in field monitoring provide a similar range of error (10–15%) [
21]. Compared to previous LFMC studies with empirical modelling based on remote sensing, some authors have reported varying results in different types of Mediterranean shrubland [
11,
22,
23]. Chuvieco et al. [
11] assessed empirical models fitted in Cabañeros National Park (central Spain) in the years subsequent to the calibration period, reporting considerably lower accuracy compared to our models. However, these authors used empirical models that computed LFMC for a mixture of shrubland and grassland, reporting significantly different weather conditions than during the calibration period, which may have strongly affected the results, as herbaceous species exhibit a significantly faster and wider range of LFMC variability [
21]. In the eastern Iberian Peninsula (Valencia region), Costa-Saura et al. [
22] performed a short-term preliminary study also using empirical models with SI derived from Sentinal-2 as input variables. These authors reported similar prediction errors as in the present study but only assessed one summer season by cross-validation (June to October 2019). For the same study area (Valencia region), Arcos et al. [
23] increased the LFMC field monitoring data (June 2020 to November 2021) and also reported good results in independent validation for selected plots during the study period. However, to our knowledge, our study is the first attempt to assess the performance of empirical LFMC models derived from Sentinel-2 data in historical fire events occurring in previous years, i.e., validating temporal transferability outside data from years used in model calibration, which is crucial for operational wildfire risk assessment [
11].
Our results regarding LFMC estimation with the best model, applied to historical wildfires occurring from 2018 to 2022 in the study area, indicated that the empirical model proposed based on spectral information from Sentinel-2 efficiently detected a decreasing trend in moisture content of
Cistus ladanifer before wildfire occurrence. In all cases, LFMC values in this shrub species were below the threshold of 80% generally considered by the regional fire management service as indicating a high fire risk level in the early warning system, i.e., as a value associated with a significant increase in vegetation flammability [
21]. Furthermore, larger wildfire events occurred when LFMC ranged from 60% to 70%. This is consistent with the threshold ascribed to other fire-prone Mediterranean-type shrubland (e.g., California chaparral), with critical LFMC values ranging from 60% to 80% [
8,
11,
15,
35,
39]. Our study validates the performance of the empirical models for monitoring
Cistus ladanifer as a fire-prone indicator species, suggesting a good potential for temporal transferability in predicting future wildfire events without further calibration. The proposed models will be further validated with new LFMC field data during the next wildfire seasons. Although more research is needed, the methods proposed in this study may also be useful in other shrubland with similar trends in LFMC dynamics and wildfire occurrence. Compared to similar studies assessing the performance of empirical models applied in different years, Arcos et al. [
23] included not only SI derived from Sentinel-2 but also ancillary data, e.g., day of year (DOY) to account for seasonal variability of LFMC during the study period. However, this type of input variable may limit the effectiveness of temporal transferability of LFMC empirical models for operational fire risk assessment if they are not recalibrated with new field data, because wildfire occurrence throughout the year is being modified by climate change, with the fire season length being extended in the Mediterranean basin [
5].
4.3. Future Research
This work demonstrates promising results for a very representative fire-prone species found in Mediterranean areas, providing empirical models that are useful for operational fire risk prediction based only on optical data from satellite remote sensing. However, future research should focus on LFMC modelling in other selected indicator species to cover a wider geographical range (e.g.,
Cistus ladanifer is less common in the eastern part of the region of Andalusia and other Mediterranean areas with calcareous soils). Field monitoring may be extended with additional plots to retrieve LFMC in other representative shrub species as reference values to explore the transferability of the proposed models and the identification of other potential indicator species for fire risk prediction in different ecosystem types [
15]. For example, previous studies indicate that
Salvia rosmarinus (i.e.,
Rosmarinus officinalis L.) and
Cistus albidus may be good alternative indicator species for fire risk assessment in other Mediterranean locations due to the higher variability in moisture and more appropriate seasonal patterns than other widely distributed shrub species with less variable LFMC dynamics, such as
Quercus coccifera [
16,
23,
25,
40].
Investing resources in field monitoring is also required to obtain sufficiently long time series for consistent empirical modelling [
11,
33], as well as to explore the proposed methods in tree species, grassland or mixed vegetation [
12]. Other researchers have suggested use of the weighted average of LFMC accounting for each species’ fractional cover in different types of vegetation [
22,
23,
25,
27], which may be useful when monospecific stands are not available regarding fire-prone indicator species. Predicting LFMC at a pixel level in wall-to-wall maps (i.e., independent of the species distribution) would be useful not only for fire risk monitoring but also for assessment of potential wildfire behaviour with simulation tools [
41].
In addition to the vegetation type, LFMC variability is highly dependent on species phenology and weather conditions. Hence, empirical models based on spectral indices could be improved with the inclusion of ancillary variables, integrating spatial and temporal variability associated with climatic or geophysical characteristics [
22,
23,
35], which may be useful for adding information indirectly related to vegetation status and development (e.g., physiology, weather, etc.). Historical seasonal trends are being modified and increasingly unpredictable due to climate change [
5]. Caution should also be applied when considering the potential extrapolation of empirical models that rely on short time series of field data as reference values for calibration and validation, as LFMC estimation errors may be greatly increased when applied in new weather scenarios, even for the same site [
11]. Therefore, topographic variables (i.e., slope, exposure, etc.) and weather inputs may be preferred to DOY-related variables to account for seasonal changes and interannual variability in improved empirical models combining spectral information from remote sensing and ancillary data.