1. Introduction
The spatial data and analysis requirements for modern environmental monitoring and management are increasingly high [
1,
2]. Forest locations, in particular, are subject to a number of management pressures and external threats [
3,
4,
5], being exploited as a resource for a range of cultural and economic activities, as well as being important habitats for a variety of organisms and storing substantial amounts of above-ground carbon [
6]. All of these aspects can influence forest condition, the evaluation of which requires a diverse array of indicator metrics to be measured and interpreted.
The assessment of a given forest site cannot include measurements of every facet of the environment due to temporal and logistical constraints, thus the choice of what to measure is critical. These measurements will be indicative of some ecological function or resource [
7,
8]. At the scale of a stand, indicators are usually placed into one of two categories: those based on the identification of key species or of key structures [
9]. The rationale for key structures is that ecosystems containing a variety of structural components are considered likely to have a variety of resources and species that utilise these resources [
9].
Numerous approaches have been suggested for the assessment of the conservation conditional status of forested areas at national and international levels (e.g., [
10]). Forest condition could be defined in terms of biodiversity, species richness, structural complexity, plant health, ecosystem function and/or productivity, dependent upon the management objective. Furthermore, little consensus exists in the definition of what conditions constitute as ‘favourable’ within each habitat, and definitions can be regionally specific [
11]. Regardless of which, a large number of indicator metrics of favourable conservation status (FCS) have been proposed for this task, for example, the approach defined in [
11] required 17 metrics, including tree size distributions, tree species diversity and understorey composition, amongst others.
The amount of data collected and the area sampled on the ground through fieldwork often tend to be quite limited. Indeed, the need to restrict the cost of fieldwork and gather data over large areas within a short time span often results in relatively small areas being sampled using field plots of typically 1 ha or less [
12]. This emphasises that field data are a subsampling of the environment, with the potential to miss important changes even over short distances, whilst the feasibility of measuring everything is impractical. Large-scale monitoring requirements are really only feasible though the use of remote sensing technologies (e.g., [
13]). With a need to supplement conventional field-based assessment and for standardisation between approaches, an increasing uptake of remote sensing methods is foreseen [
14,
15,
16,
17,
18].
There have been several reviews which have evaluated the potential role of remote sensing (in particular, hyper-spectral, hyper-spatial and hyper-temporal optical data and active airborne laser scanning or radar data) for vegetation condition assessment (e.g., [
19,
20,
21,
22]). These reviews demonstrate a broad range of remote sensing approaches used to estimate various indicators and then infer vegetation or habitat condition; for example, measures of forest extent or canopy cover [
23,
24], composition or diversity [
25,
26], structure, physiology or function [
24,
27,
28,
29,
30], health, stress or disease [
31,
32,
33,
34,
35], seasonal dynamics [
36], disturbance, degradation, vulnerability or recovery [
37,
38,
39].
Recent studies have found that airborne laser scanning, hereafter referred to as ALS, can be a powerful predictor of different vegetation attributes, such as height, basal area, stem density and other vegetation structure parameters at the plot or larger scales [
40,
41,
42,
43,
44]. ALS uses laser pulses to directly measure ground and vegetation height, as well as the vertical distribution of intercepted surfaces, making it an ideal tool for mapping vegetation structure with no saturation at high biomass values [
45]. Thus, ALS measurements have been shown to produce more accurate estimates of vegetation structure parameters than other remotely sensed data, because ALS has the ability to penetrate forest canopies and to detect three-dimensional forest structures [
46,
47]. This is typically accomplished through deriving some statistical relationship between field training data and an ALS metric, although such correlations are often site or ALS acquisition specific (e.g., [
48,
49]). To date, only one study has used ALS to assess individual tree condition, using structural metrics related to crown density and tree height [
50].
Typically, the use of ALS to predict compositional metrics, such as species type, has proven difficult, as ALS metrics are associated with the structural arrangement of features within and below the canopy. Some studies have explored the potential of ALS to model the diversity of insects, spiders and birds [
51,
52,
53]. However, few studies have explicitly analysed the relationships between plant species diversity and ALS measurements [
25,
54,
55]. Radiometric information recorded by ALS, commonly referred to as intensity and recorded from an infrared laser, has proven sufficient to classify species in some cases [
56], however, differences in acquisition specifications and sensor design seriously impact performance. At the very least, classification generally between deciduous and coniferous vegetation is possible (e.g., [
56,
57]). The use of intensity has, however, remained contentious.
ALS return intensity (also referred to as amplitude) presents a number of challenges for its use in any analysis, given the nature of typical sensor design. According to [
42], it is not possible to compare any two discrete-return intensity values. The research presented in [
58] states that intensity data can vary in performance between sensors when used for species classification. In addition, sensors using different wavelengths can also preclude comparison [
59]. Recent work on normalising intensity has been proposed, and the results seem promising (e.g., [
56]). Other studies have combined ALS datasets with multi- or hyper-spectral data which have allowed the classification of overstorey species [
60,
61]. Alternatively, general species characteristics have been correlated with ALS derived metrics, such as mean vegetation return height and the diversity of forest species or land cover (e.g., [
62,
63,
64]).
In general, the area-based statistical models developed for a given dataset are not transferable between acquisitions. For example, in [
65] metrics estimated with models calibrated using leaf-on field and ALS data produced erroneous results when applied to leaf-off ALS data, implying there is a difference in the 3D distribution of returns when considering multi-temporal analysis. The combination of datasets from different time-points (e.g., summer and winter) has the potential to yield additional information. Studies have used multiple acquisitions of airborne ALS for the same site at different times in order to exploit the seasonality of different forest species, to estimate understorey presence [
66] or to compare the accuracy of terrain mapping [
67]. For the latter study, a greater proportion of the returns were from lower portions of the forested area when assessed with leaf-off data than with leaf-on data, due to the absence or presence of foliage. The research outlined in [
68] combined leaf-on and leaf-off datasets and was able to exploit the difference in vegetation structure and return intensity between the acquisitions to classify tree species. The authors of [
69] stated that above-ground biomass estimates were similar between leaf-on and leaf-off ALS data, but that stratification by species type improved estimates. In [
70], a combination of leaf-on and leaf-off data was shown to model diameter at breast height (DBH) and diversity in crown dimensions more accurately, whilst models derived from leaf-on only data performed the poorest in terms of accuracy. Conversely, [
71] stated that differences in canopy conditions manifested in leaf-on or leaf-off datasets have an insignificant impact on estimates of above-ground biomass. The model strength was instead dependent on environmental conditions and the modelling method implemented. The authors of [
55] estimated 23 forest structural or compositional forest metrics using a combination of leaf-on and/or leaf-off metrics, where only ten of these were best estimated using a combination of both datasets. This implies that there are potentially significant changes in vegetation structure captured by acquisitions from different dates, however, some metrics can be estimated from either.
A more recent development of ALS is full-waveform (FW). This sensor type, as opposed to more conventional discrete-return ALS, provides connected profiles of the three-dimensional scene per pulse, which potentially contain more detailed information about the structure of the illuminated surfaces [
72]. Additional processing is required to provide a conventional 3D point cloud. Such a sensor design can potentially return more information from below the canopy [
73] and higher return densities than conventional discrete-return ALS, as noted in [
74], which is of interest in forest studies [
75,
76,
77]. From the research presented in [
55], the analysis of full-waveform (FW) airborne ALS data can yield estimates of both compositional and structural metrics of substantial potential for assessing forests for a variety of possible applications.
Thus, utilising FW ALS data and a combination of leaf-on and leaf-off acquisitions could conceivably yield the highest potential for estimating forest metrics for forest condition assessment. The goal of the current research is to evaluate the capabilities of FW ALS to provide a range of forest structural and compositional metrics across the full vertical profile and to assess the accuracy of these estimates for forest condition monitoring. This research will assess the potential utility that ALS acquisition can provide in future forest monitoring applications, specifically for forest contexts in southern Britain. The field site comprises both semi-natural and managed plantation forests in close proximity. Twenty-seven field metrics for the semi-natural forest have been defined for this location from previous research, which is outlined in [
11,
55]. The objectives of the current research were therefore to: (i) estimate the accuracy of these structural and compositional metrics across a range of forest types; (ii) evaluate if there are any benefits to the use of leaf-on, leaf-off or combined ALS acquisitions in the estimation of forest metrics; (iii) compare the relative merit of height, amplitude and width ALS metrics in derived models; and (iv) assess the use of a subset of these metrics for forest favourable conservation status (FCS) monitoring.
3. Results
3.1. Summary of Models Using Metrics from the Leaf-On Acquisition Only
Model tuning for all of the final RF models produced from metrics generated from the leaf-on ALS acquisition is summarised for reference in
Table S3. The predictor variables used in each of the 27 RF models are summarised in
Table S4.
The final RF models each used between one and seven input ALS variables. In total across the 27 models, height variables were included 55 times, width variables 43 times and amplitude variables 26 times. The most commonly occurring ALS variables were AbsDev of return height (six models), Variance of return height (seven models) and StDev of return height (eight models). Across the 27 RF models, height variables were used in 22 models, width variables in 16 models and amplitude variables in 13 models. Only four models exclusively used height predictors (Shannon–Weiner index for native trees, Downed deadwood decay class, Number of seedlings and Number of ground vegetation species), three models used only amplitude predictors (Simpson index of diversity, Number of saplings and Standing deadwood decay class) and one used width only (Number of sapling species). There were nine models that combined height and width predictors, four that combined height and amplitude predictors and only one model that combined amplitude with width predictors. There were five models which combined height, width and amplitude predictor variables.
The RF models created for each of the 27 forest metrics were assessed against field data, summarised in
Table 2. Across all models, the adj. R
2 ranged from 0.59 to 0.82. In terms of NRMSE, eight models were below 15%, 18 were between 15 and 30% and one was above 30% (Standing deadwood decay class), here implying an unsuitable model. The majority of models had an NBias of below 15%, while three did not (Standing deadwood decay class, Simpson index of diversity and Number of sapling species). Thus, in total three models of the 27 were considered of unacceptable accuracy.
3.2. Summary of Models Using Metrics from the Leaf-Off Acquisition Only
Final RF model tuning parameters for the leaf-off ALS acquisition are provided in
Table S5. Predictor variables used in each model are summarised in
Table S6.
The final RF models created using leaf-off ALS data also had between one and seven input variables. In contrast with the leaf-on models, there was a greater prevalence of width variables. Thus, across the 27 models, width variables were included 54 times, and both height and amplitude variables 36 times. The most commonly occurring ALS variables were Kurtosis of return width (six models), Variance of return width and StDev of return width (seven models) and AbsDev of return width (eight models). Height, width and amplitude predictors were included in 14, 17 and 14 models, respectively, and used exclusively as input for one (Simpson index of diversity), six (Standard deviation of tree diameter, Number of native saplings, Standing deadwood decay class, Number of native seedlings, Mean height to the living crown and Mean crown horizontal area) and three (Total crown horizontal area, Number of sapling species, Percentage bare ground cover) of the RF models, respectively. A combination of height and width variables formed the inputs to three models, height and amplitude for six models and amplitude and width for four models. All three predictor variable types were included in four models.
Comparisons of RF model predictions against field data (
Table 3) produced models with adj. R
2 which ranged from 0.59 to 0.89. In terms of NRMSE values, eight models were below 15%, 17 were between 15 and 30% and two were above 30% (Standing deadwood decay class and Percentage bare ground cover). The majority of models (25 of 27) had an NBias of less than 15%, the exceptions being: Standing deadwood decay class and Number of sapling species. Thus, as with the leaf-on only dataset, three of the 27 RF models were of unacceptable accuracy, of which two were for the same forest metric.
3.3. Summary of Models Using Metrics from Both the Leaf-On and Leaf-Off Acquisition
Final RF model tuning parameters for the combination of leaf-on and leaf-off ALS acquisition are provided for reference in
Table S7. Predictor variables used in each model are summarised in
Table S8.
As with both the separate leaf-on and leaf-off models, the combined leaf-on/leaf-off RF models each had between one and seven input ALS variables. In total across the 27 models, width variables were included 55 times, height variables 45 times and amplitude variables 32 times, of which 50 were from leaf-on data and 82 from leaf-off data. The most common input variables (regardless of leaf-on or leaf-off) were: Maximum of return height, Variance of return height, AbsDev of return height and StDev of width (6 models), StDev of return height and Kurtosis of return width (7 models), Variance of return width (8 models), and AbsDev of return width (10 models). Height, width or amplitude ALS variables were present in 16, 18 and 14 models, respectively. Those models using exclusively height, width or amplitude variables constituted four (Mean tree height, Number of tree species, Shannon–Weiner index for native trees, Number of ground vegetation species), seven (Standard deviation of tree diameters, Mean height to living crown, Mean crown horizontal area, Number of seedlings, Number of native seedlings, Number of sapling species, Downed deadwood decay class) and two (Total crown horizontal area and Number of saplings) RF models, respectively. Models combining height and width, height and amplitude or amplitude and width variables accounted for two, three and two models, respectively. Models produced using all three predictor variable types accounted for seven models.
Predictor variables derived from leaf-on acquisition were present in 20 models, whilst leaf-off variables were present in 23 models. Both leaf-on and leaf-off variables were present together in 16 RF models, whilst four models were derived using leaf-on only variables and seven using leaf-off only variables. For the four models containing only leaf-on ALS data, these were either exactly the same (Mean tree spacing, Number of saplings), a subset (Percentage bare ground) or a highly correlated alternative (Number of sapling species) to the input variables used in the leaf-on only models. By contrast, for the seven models containing only leaf-off ALS variables, four of the models had the same or a subset of the input variables used in the leaf-off only models (Standard deviation of tree diameter, Mean height to the living crown, Number of native seedlings and Number of seedling species), whilst the models for Number of tree species, Downed deadwood decay class and Shannon–Wiener index for native trees had very different input variables to their leaf-off only counterparts. As many of the input variables are highly correlated, in addition to the large number of predictor variables available, and the random subsetting of predictors in the Boruta package over a finite number of iterations (500), this could explain the different predictors being selected. It is conceivable that the predictors indicated in
Table S6 would be repeated if there were enough Boruta iterations. Each of the predictions from these models is illustrated in
Figure S1.
Comparisons of RF model predictions against field data (
Table 4) produced models with adj. R
2 which ranged from 0.56 to 0.89. In terms of NRMSE values, nine models were below 15%, 17 were between 15 and 30% and only Standing deadwood decay class was above 30%, here implying an unsuitable model. The majority of models (25 of 27) had an NBias of less than 15%, with the exceptions being Standing deadwood decay class and Number of sapling species.
3.4. Best Overall Models
Comparing the leaf-on only and leaf-off only RF models, based on NRMSE and NBias, seven models were better using leaf-on data, 11 were better using leaf-off data and nine models were inconsistent between the two ALS datasets. However, in only two models (Number of sapling species and Volume of standing deadwood) were both the NRMSE and NBias different by >1% point between the two datasets. Combining leaf-on and leaf-off data improved ten leaf-on only models and nine leaf-off only models, but only in three cases were both NRMSE and NBias increased by >1% point (Volume of standing deadwood for the leaf-on model and the Simpson index of diversity for both leaf-on and leaf-off models). Therefore, overall, with the exception of Simpson index of diversity which was modelled with a very high NBias using leaf-on only data (22.38%), there was very little or consistent improvement between the leaf-on only, leaf-off only and combined datasets in RF model output quality assessed by NRMSE and NBias.
The most accurate models overall for each of the 27 forest metrics, in terms of minimising NRMSE and NBias, and maximising adj. R
2 (summarised in
Table 5 for the 27 field metrics), were derived from across all ALS datasets. Models derived from leaf-on only ALS data (
Section 3.1) constitute 8 of the best models, those derived from the leaf-off data (
Section 3.2) constitute 12 of the best models and those derived from a combination of leaf-on and leaf-off ALS data (
Section 3.3) account for 7 models.
Overall, slightly lower NRMSE and NBias values were achieved through this process of best model selection. For example, the difference in NRMSE between the leaf-on, leaf-off and combined leaf-on/leaf-off models per forest metric varies between only 0.02 and 4.02 percentage points. The average NRMSE across the 27 models was as follows: 19.57% for leaf-on, 19.28% for leaf-off, 19.06% for combined leaf-on/leaf-off and 18.36% for the best set of models from across all datasets. In terms of NRMSE for the best models, 26 of the models had a value below 30% (nine < 15%), with the exception being Standing deadwood decay class. For NBias, 25 of 27 models were <15%, with the exceptions being Standing deadwood decay class and Number of sapling species. These were therefore the only two RF models considered to be of unacceptable quality.
3.5. Assessment of Accuracy with Regard to Indicators of ‘Favourable Conservation Status’
Favourable status values for the 41 field plots ranged from 3 to 11, where lower values on average were observed for plots located in coniferous stands (mean 6.8; standard error 0.4; n = 20), and higher values for sites in deciduous dominated stands (mean 8.6; standard error 0.5; n = 16) or mixed stands (mean 8.2; standard error 0.7; n = 5). The results of a t-test indicated that the FCS index means from coniferous and deciduous plots were significantly different (p < 0.05).
The most accurate RF models (as in
Table 5) relevant to the 17 FCS indicator metrics were used to predict each field metric, which were then assessed relative to the threshold values. Binomial logistic regression was implemented, and the results are summarised in
Table 6. Overall accuracy of the logistic regression per indicator metric varied between 51 to 100 percent correct (where 11 of 17 were >80% correct). The lowest accuracy (<65%) was observed for the Shannon–Wiener index for native trees and Volume of downed deadwood. All AUC values were above 0.7, implying acceptable discrimination for predicted values. AUC values ranged from 0.71 to 0.98. In addition, the AUC calculated for the Shannon–Wiener index for native seedlings was determined to be rank-deficient and potentially misleading, implying that one or more of the predictor variables were not linearly independent. All of the field measurements for Number of seedlings and Number of native seedlings did not exceed the threshold value and thus were always 0, preventing AUC calculation.
The map presented in
Figure 2 provides the index of favourable conservation status derived from ALS models for the New Forest study site. Generally, high index values are observed for areas containing native beech–oak deciduous forest, whereas low values are consistent with areas of coniferous plantation forest.
5. Conclusions
In this study undertaken in coniferous and deciduous forest in the New Forest, UK, we found that the differences between RF models derived from leaf-on, leaf-off and combined leaf-on/leaf-off datasets were slight, with the exception of estimating the Simpson index of diversity. Thus, whilst there were some detectable trends (e.g., higher accuracies were observed for forest overstorey structure when leaf-on ALS predictor variables were used, whilst forest canopy composition or understorey characteristics achieved a higher accuracy when either leaf-off or combined leaf-on and leaf-off ALS data were used), these were not significant enough to imply that consideration of ALS acquisition time will be required for optimal prediction of forest characteristics if using RF modelling. However, whilst model performance was similar between leaf-on, leaf-off and combined datasets, model composition was often very dissimilar, reiterating that these datasets capture different aspects of the forest and that structure and composition across the full vertical profile are highly inter-connected.
Estimated accuracy overall exceeded estimates produced using linear models in previous research. Whilst there is room for improvement, given the uncertainties associated with estimating a range of metrics and the small training sample size used, the ALS-based method was in good agreement with field-based assessments of FCS. In addition, continuously mapped estimates of FCS were created across the study site and corresponded closely to forest stands and compartments present, demonstrating the utility of such an approach for forest condition mapping and monitoring.
The value of ALS is its ability to estimate a variety of habitat variables related to forest three-dimensional structure. The current research demonstrates the feasibility of predicating spatially explicit forest metrics, derived from ALS covariates, for multiple forest stands. This approach potentially demonstrates a method to rapidly assess forest condition and FCS over large areas, reducing (but not eliminating) the need for costly field surveys. The main advantage of ALS acquisitions of this type is that they allow the sampling of forest structural characteristics at a high spatial resolution over large spatial extents. The ability to characterise these structures through ALS provides a proxy for biophysical processes in forests [
126]. These findings are important for advancing the management of forest resources. Further investigation into such approaches could therefore yield additional data on many potential biological processes and distributions within a landscape. A similar type of approach to that documented in the current study would allow managers to investigate forest changes post-disturbance or treatment. Equipped with this knowledge, managers would have better information and have tools to address the impact of forest changes in the face of a potentially changing climate.
The results of this study show that full-waveform multi-temporal ALS holds a great deal of potential information which is useful for estimating forest characteristics, both within and below the main forest canopy, and for mapping continuously across large extents. The spatial resolution of the data allows for within stand assessment, whereas previously, areas defined as stands, regardless of size, were considered the smallest management unit. We expect that the use of remote sensing technologies and methods will be tested in other sites in the future in order to develop better estimates. The availability of high-resolution forest vegetation maps with acceptable agreement to field validation will significantly advance forest ecological understanding and improve conservation efforts.