1. Introduction
Forest ecosystems play a crucial role in terrestrial ecosystems, acting as massive global carbon reservoirs that contain 80% of the terrestrial carbon stored in aboveground biomass [
1]. The aboveground carbon stock (AGC) of forests is a key parameter for assessing the carbon sequestration capacity and carbon balance above the forest soil layer, making it vital to understand the role of forests in carbon cycling and climate change [
2]. As the issue of climate change becomes increasingly severe, accurately estimating and monitoring changes in forest AGCs are essential for developing sustainable ecosystem management and policies to mitigate climate change [
3].
However, accurately estimating forest AGCs is a challenging task. Although traditional field survey methods are precise and reliable, they are time-consuming and labor-intensive, making it difficult to apply them over large areas or to fully cover diverse ecosystem types. As a result, remote sensing-assisted datasets have gradually become an important supplementary approach, providing spatially and temporally continuous information over extensive regions [
4]. Remote sensing data sources such as LiDAR and UAV multispectral imagery have high-resolution and wide-area surface information, offering the possibility of estimating carbon stocks over large areas [
5,
6,
7,
8]. However, despite numerous studies showing that LiDAR or optical data alone can predict forest AGCs, the accuracy obtained from these methods without support from field measurements remains a challenge. Therefore, calibration and validation of remote sensing-assisted prediction techniques still require field measurement data.
Light detection and ranging (LiDAR) data, as a remote sensing data source, can be used to rapidly and accurately obtain elevation information and vertical structure data for vegetation cover [
9] and are among the latest remote sensing technologies for forest carbon accounting [
10]. Backpack LiDAR, such as LiBackpack, a new type of portable LiDAR, offers high capacity, accessibility, and flexibility in route selection and can obtain high-quality 3D dense point clouds in forests with different vegetation structures [
11]. However, despite the accuracy of backpack LiDAR for measuring tree diameter [
12,
13], identifying the best variables suitable for different tree species remains challenging, especially in the absence of canopy spectral information, where LiDAR data classification accuracy for tree species under complex vegetation conditions is limited [
14]. On the other hand, UAV multispectral imagery includes multispectral information, aiding in studying the spectral characteristics of different tree species. Early research revealed that the high-resolution image texture features of optical remote sensing data are strongly correlated with forest biomass [
15,
16], and the vegetation indices obtained from optical data typically reach saturation at relatively low biomass values [
17,
18]. UAV multispectral imagery has better spatial, spectral, and temporal resolution compared to other optical datasets for similar data volumes and data collection costs [
19], and their time-series data provide high-quality information on seasonal changes in forests. Additionally, UAV multispectral data are effectively used for forest resource monitoring and dynamic management [
20]. Recent studies suggest that combining LiDAR and optical sensors is a feasible approach for estimating biomass and carbon storage in both plantations and natural forests [
21]. Brown et al. [
22] showed that modeling with a large amount of field measurement data added to data fusion can improve the estimation of forest AGBs and AGCs. Kim et al. [
23] stated that combining spectral information with attributes derived from LiDAR data is more suitable for assessing the AGB and AGC than using optical images or LiDAR data alone. However, finding a method for accurately estimating forest AGCs with fewer field measurements to establish regression models is currently challenging.
Extracting vegetation information from remote sensing imagery and integrating it with ground-measured data for modeling has become an effective and popular method for obtaining regional forest AGCs. The study mentioned the use of parametric and nonparametric models. Multiple stepwise linear regression (MSLR) represents the traditional parametric model, assuming a linear relationship between predictive variables and the variable being predicted. However, this assumption limits the inherent nonlinearity of the relationship between them and requires a large sample size [
24]. On the other hand, random forest regression (RF) is a nonparametric model that does not assume a specific distribution for the samples. It can handle complex nonlinear relationships and high-dimensional problems and has been proven effective for estimating forest AGCs [
25]. Additionally, machine learning techniques aid in combining data from different sources to improve outcomes [
26].
In this study,
L. gmelinii and
B. platyphylla, two typical tree species, were chosen as the study species due to their ecological significance and notable differences in carbon storage [
27]. Liu et al. explored the carbon storage capacity of Xing’an larch and birch forests, suggesting that they have an important role in boreal forests [
28].
Larix gmelinii, an important economic tree species, grows in cold and dry conditions and has a high carbon stock. In contrast, compared with
L. gmelinii,
B. platyphylla plays a different role in ecosystems and has distinct growth environments and characteristics, leading to potentially significant differences in carbon storage. By studying these two typical species, we aim to deepen the understanding of the variability in forest carbon storage among different tree species, thereby providing more accurate estimation models to support forest resource management and conservation.
Despite these developments, comprehensive validation of the accuracy of forest AGC estimation is lacking. This study is dedicated to addressing this challenge. We hypothesize that the combination of forest vertical variables with horizontal variables from optical images through allometric relationships can be used to accurately estimate forest AGCs at the plot scale with minimal field measurement data. Therefore, the objectives of this study are: (1) to assess the effectiveness of using backpack LiDAR and UAV multispectral imagery in estimating aboveground carbon stocks (AGC) for L. gmelinii and B. platyphylla; (2) to compare the predictive accuracy of different models; and (3) to validate whether integrating multi-source data can enhance the accuracy of AGC estimation.
4. Discussion
4.1. Potential of LiDAR Combined with Multispectral Imagery for Estimating AGC in Forests
Compared to manual measurements, backpack LiDAR offers precise scanning and real-time data integration while in motion, providing a more flexible and efficient method for forest inventory collection [
60]. In the data collection process, backpack LiDAR requires only one surveyor to carry the equipment across the measurement site, significantly reducing time and costs and improving efficiency [
61]. As shown in
Table 9, when collecting point cloud data for a 10 m × 40 m sample, traditional measurement methods require 3–4 people to complete the data collection, whereas backpack LiDAR needs only one person. While traditional manual measurements take approximately 36 min to measure a plot, backpack LiDAR takes only approximately 5 min. Preprocessing the collected data via traditional methods took approximately 14 min, while preprocessing the backpack LiDAR point cloud data took approximately 10 min. The internal data processing time depends on the size of the dataset and the computer configuration. Overall, compared with traditional methods, backpack LiDAR saves approximately 30 min per plot, illustrating its time efficiency. In terms of optical data, acquiring airborne multispectral images under favorable weather conditions enhances efficiency and reduces costs to a certain extent. Therefore, the combined use of optical imagery and LiDAR further reduces the cost of assessing forest emission reductions. This combination enables the mapping of large areas near real-time carbon stocks [
62]. The findings of this study underscore the high precision and potential of LiDAR technology for estimating AGC, offering significant value for enhancing forest management practices and informing sustainable ecosystem management strategies [
63]. However, scaling up this approach to a broader level may present significant challenges, particularly in low-income countries where limited financial and technical resources could hinder its widespread implementation and reduce its overall effectiveness [
64].
Optical images have been applied in earlier studies to estimate forest AGB and AGCs, but the results showed that optical signals are weakly penetrating. Spectral images mainly record the horizontal structure of the forest and cannot record the vertical structure information of the forest. However, LiDAR can penetrate the forest canopy and record vertical structure information. This approach compensates for the shortcomings of optical images. In this study, there are two main reasons for the small improvement after adding multispectral information. The first reason may be that when the visible light of multispectral data is saturated in dense forest areas [
65], the accuracy is lower in complex forest structures, resulting in the deviation of the AGC estimated by the NDVI from the measured AGC. Another reason for this difference may be that the LiDAR forest structure attributes themselves have a strong correlation with AGC, and the addition of multispectral information did not result in much improvement. Overall, although these improvements are not significant, novel multisensor earth observation methods that involve the combination of satellite-borne LiDAR data with optical data using machine learning techniques enable accurate measurements of carbon stocks and provide effective data support for forest emission reduction. For example, Jiao et al. [
66] proposed a practical framework for assessing forest emission reductions via the fusion of optical satellite imagery and spaceborne LiDAR data. Shen et al. mapped subtropical forest AGB data by combining Landsat TM/ETM+ and ALOS l-band SAR imagery from Guangdong Province, and the results demonstrated that multisensor imagery-based AGBs had a good correlation [
67,
68]. Our results further suggest that combining LiDAR and multispectral data is essential for improving the accuracy of AGB and AGC estimation.
4.2. Main Challenges and Uncertainty Analyses for Estimating Forest AGCs
In response to the challenges in estimating vegetation biomass and carbon storage (specifically, whether obtaining large-scale forest structure and spectral information improves biomass and carbon stock estimations [
69]), this study integrates forest structural attributes and spectral data to estimate forest AGCs at the plot level. Despite the difficulty in accurately capturing AGC changes in forests with complex structures using only structural and spectral information, the heterogeneity of canopy spectral information provided by multispectral images has enhanced the accuracy of our multisource data integration modeling approach, increasing the AGC estimation accuracy from 90.29% to 90.6%. Additionally, we utilized various multiple linear regression and power regression models to select the best-fitting models for AGC estimation. Compared to multiple linear regression models, power regression models exhibited greater accuracy in AGC estimation. This indicates that the dominant tree species in our study area,
L. gmelinii, conforms to a power allometric relationship and that using this relationship can improve the accuracy of forest AGC estimates. Therefore, the power allometric relationship based on forest structural attributes and spectral information represents a new method for enhancing AGC estimation. This method can be used to explore the relationship between tree metabolism and biomass [
70], and such relationships may be more stable in similar landscapes [
71].
However, there are still uncertainties in this study. First, the laser beams of backpack LiDAR cannot penetrate the lower canopy layers in dense forest structures; second, due to obstruction from the understory, backpack LiDAR faces challenges in capturing the treetops of the upper canopy, resulting in notable differences between the LiDAR-estimated and actual measured heights. The results and conclusions of this study are currently valid only for coniferous forests with relatively simple stand structures, and further validation is needed for broadleaf forests, mixed forests, or other forest types with more complex structures based on additional forest plots and remote sensing data. In addition, this study utilized ultra-high-resolution UAV imagery with a spatial resolution of 0.02 m. While such high spatial detail enables capturing fine-scale variations, it may also introduce significant spatial variability, particularly in areas with heterogeneous vegetation distribution. This level of granularity can result in weak correlations between vegetation indices (VI) and AGC, ultimately impacting the model’s predictive accuracy. Despite the observed low correlation in our findings, VI still holds considerable promise for capturing ecosystem dynamics and monitoring environmental changes [
72].
4.3. Research and Perspectives on Estimating Late-Season Forest AGCs
This study revolves around the theme of estimating regional-scale forest AGCs by integrating multispectral imagery and LiDAR data; encompassing a comprehensive and systematic exploration from field data collection to preprocessing steps such as atmospheric, radiometric, and geometric correction of multispectral imagery and cropping; resampling, denoising, filtering, ground classification, and normalization of LiDAR data; constructing forest AGC estimation models suitable for complex terrain conditions; and then spatially extrapolating regional-scale forest AGCs. However, due to the scarcity of field measurement data and the complexity of mountainous terrain, the accuracy of regional forest AGC estimation combined with multisource remote sensing data is still not precise enough, warranting further research.
At the current stage, calibration and validation still require high-quality field measurement data. Due to the complex terrain of mountainous areas, more accessible sites were chosen for field inventory collection, resulting in spatial discontinuity and discreteness in the regional forest AGC density spatial distribution map. Future research should aim to select spatially continuous plots for data collection. Limited by time, the collected samples were insufficient, suitable only for single-tree or regional forest AGC estimation, and not representative of the entire forest AGC storage in the Dulaer forest.
The backpack LiDAR data collection is affected by poor GPS signals, directly impacting the quality of trajectory files and leading to failures in point cloud resolution or significant errors in absolute coordinates. Moreover, obtaining high-precision absolute coordinate point cloud data is crucial for determining individual tree locations within sample areas. Therefore, efficiently and accurately collecting absolute geographical reference point cloud data in dense forests without GPS signals will be a focus of future research. Additionally, in
L. gmelinii plots with dense branches, it was necessary to cut branches in advance along the designed route to ensure the safe operation of the backpack LiDAR, which affects the data collection time and quality. Thus, the accuracy of image data collection via backpack LiDAR needs further verification in more operational environments. The multispectral data used in this study had limited spectral bands, resulting in less correlation between the calculated vegetation spectral indices and forest AGC. Future research should explore the capability of regional forest spectral inversion via hyperspectral imaging via UAVs at different flight altitudes in conjunction with LiDAR data. To further mitigate the impact of spatial variability, future research could explore the use of Object-Based Image Analysis (OBIA) and texture features. These advanced methodologies offer more stable and structured variables by grouping adjacent pixels into cohesive objects based on their spectral and morphological similarities, thereby minimizing the variability inherent in high-resolution data. Additionally, texture features can capture the intricate spatial patterns and distribution characteristics of vegetation, providing a richer representation of the landscape and ultimately enhancing the precision of AGC estimation [
71,
72].
In summary, combining LiDAR data with traditional remote sensing data can complement data sources better, facilitating the acquisition and classification of ground information and improving the accuracy of ecological parameter estimation, ecological monitoring, and simulation. Effectively integrating multisource remote sensing data for ecological research is currently a trending topic.