Next Article in Journal
Growth Response of Pinus tabuliformis and Abies fargesii to Climate Factors in Southern Slope of Central Qinling Mountains of China
Previous Article in Journal
Multi-Level Temporal Variation of Sap Flux Densities in Oil Palm
Previous Article in Special Issue
Bark Biometry Along the Stem for Three Commercial Tree Species in Romania
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Uncertainty Analysis of Remote Sensing Estimation of Chinese Fir (Cunninghamia lanceolata) Aboveground Biomass in Southern China

1
Research Institute of Forest Resource Information Techniques, Chinese Academy of Forestry, Beijing 100091, China
2
College of Forestry, Southwest Forestry University, Kunming 650233, China
3
Guangdong Forestry Survey and Planning Institute, Guangzhou 510520, China
*
Author to whom correspondence should be addressed.
Forests 2025, 16(2), 230; https://doi.org/10.3390/f16020230
Submission received: 24 December 2024 / Revised: 12 January 2025 / Accepted: 22 January 2025 / Published: 25 January 2025
(This article belongs to the Special Issue Forest Biometrics, Inventory, and Modelling of Growth and Yield)

Abstract

:
Forest aboveground biomass (AGB) is not only the basis for forest carbon stock research, but also an important parameter for assessing the forest carbon cycle and ecological functions of forests. However, there are various uncertainties in the estimation process, limiting the accuracy of AGB estimation. Therefore, we extracted the spectral features, vegetation indices and texture factors from remote sensing images based on the field data and Landsat 8 OLI remote sensing images in Southern China to quantify the uncertainties. Then, we established three AGB estimation models, including K Nearest Neighbor Regression (KNN), Gradient Boosted Regression Tree (GBRT) and Random Forest (RF). Uncertainties at the plot scale and models were measured by using error equations to analyze the influences of uncertainties at different scales on AGB estimation. Results were as follows: (1) The R2 of the per-tree biomass model for Cunninghamia lanceolata was 0.970, while the uncertainty of the residual and parameters for per-tree biomass model was 4.62% and 4.81%, respectively; and the uncertainty transferred to the plot scale was 3.23%. (2) The estimation methods had the most significant effects on the remote sensing models. RF was more accurate than other two methods, and had the highest accuracy (R2 = 0.867, RMSE = 19.325 t/ha) and lowest uncertainty (5.93%), which outperformed both the KNN and GBRT models (KNN: R2 = 0.368, RMSE = 42.314 t/ha, uncertainty = 14.88%; GBRT: R2 = 0.636, RMSE = 32.056 t/ha, uncertainty = 6.3%). Compared to KNN and GBRT, the R2 of RF was enhanced by 0.499 and 0.231, while the uncertainty was decreased by 8.95% and 0.37%, respectively. The uncertainty associated with the scale of remote sensing models remains the primary source of uncertainty when compared to the plot scale. On the remote sensing scale, RF is the model with the best estimation effect. This study examines the impact of both plot-scale and remote sensing model-scale methodologies on the estimation of AGB for Cunninghamia lanceolata. The findings aim to offer valuable insights and considerations for enhancing the accuracy of AGB estimations.

1. Introduction

Forests represent the most significant terrestrial ecosystems on the planet, accounting for 80% of terrestrial carbon reserves [1]. They play a crucial role in sustaining regional ecosystems and contributing to the global carbon equilibrium [2,3]. Forest biomass serves as a fundamental parameter within forest ecosystems and is a critical indicator for assessing both the structure and functionality of these ecosystems [4]. Total forest biomass is categorized into aboveground and belowground components, with aboveground biomass (AGB) constituting approximately 70% of the overall biomass [5], and has been acknowledged as a significant biodiversity metric that is effective for assessing ecosystem functionality [6].
Methods for estimating forest biomass mainly include traditional field survey methods and remote-sensing-based methods [7]; the traditional method is to obtain forest resource data through field measurements and data statistics, which has the disadvantages of being time-consuming and labor-intensive [8], and there are obvious limitations in the aspects of a large research scope and a long research cycle due to the limitations of time and cost. In contrast, remote-sensing-based methods are widely used in forest AGB estimation because of their rapid, low-cost, large-scale and low-destructive characteristics [9,10]. LiDAR, as an active remote sensing technique with strong penetration into vegetation, has been extensively utilized in the estimation of forest biomass [11,12]. However, due to its high data cost, complex data processing and susceptibility to the limitations of complex terrain, it is not widely used comparatively in regions with complex terrain [13,14]. In comparison, optical remote sensing images are extensively utilized for the estimation of AGB due to their benefits, which include accessibility, extensive spatial coverage and shorter revisit intervals [15,16]. Furthermore, optical remote sensing images exhibit varying spatial and temporal resolutions, as well as differing spectral bands [17], which can be fully utilized for texture features to reflect more accurate feature information [18]. The estimation of AGB at global, continental or national scales typically necessitates the use of medium-resolution data, exemplified by information acquired from the Moderate Resolution Imaging Spectroradiometer (MODIS), for AGB estimation [19]. In contrast, the estimation of AGB at localized scales typically relies on moderate-resolution data. Landsat 8 Operational Land Imager (OLI) data have been extensively utilized for AGB estimation at regional scales due to their extensive coverage, prolonged monitoring capabilities, moderate resolution, and time series analysis capability [20,21]. Previous studies have demonstrated a significant correlation between spectral bands, vegetation indices, transformed images (utilizing principal component analysis) and texture features with AGB. Consequently, these elements can be employed as effective indicators for estimating AGB [22,23]. A lot of scholars [24,25] have utilized Landsat 8 imagery for estimating forest AGB, and there is growing evidence that optical remote sensing techniques play an integral role in forest resource management [26].
Uncertainty encompasses a broad range of concepts, including inaccuracy, fuzziness and ambiguity, among others. In the context of forest ecosystem productivity, which serves as a critical indicator, three primary categories of uncertainty are pertinent to the calculations associated with forest biomass: model uncertainty, measurement uncertainty and sampling uncertainty [27,28]. For example, Fareed and Numata believed that in a dense forest canopy, when the true height of trees exceeds 30 m, there will be an underestimation phenomenon that has an impact on the accurate estimation of AGB [29]. Shettles et al. [30] posited that model uncertainty constitutes the primary source of uncertainty, representing approximately 70% of the overall uncertainty. This study focuses exclusively on examining the uncertainty associated with the model, while measurement and sampling uncertainty were not considered. The uncertainties associated with the model can be categorized into four primary dimensions: the inherent uncertainty of the input variables, the inaccuracies arising from the specification of the model structure, the residual variance error of the model and the errors related to the model parameters. The uncertainty associated with the input variables primarily pertains to the measurement errors of these variables, specifically diameter at breast height (DBH) and tree height (H) [29]. Such errors may be influenced by the measurement instruments employed, the techniques utilized for measurement and the methodologies applied by the individual conducting the measurements. The inaccuracies in establishing the model framework primarily stem from an insufficient application of advanced modeling techniques [31] or validation data [32], meaning that the model form sets incorrectly. Compared with the first two types of errors, the residual variation and parameter errors of models are comparatively less studied. Therefore, we mainly analyzed the residual error of the model and the model parameter errors. The residual error of a model is indicative of its fitting accuracy, which can be quantitatively assessed using the coefficient of determination (R2). The value of R2 approaching 1 signifies a superior fit of the model (overfitting cannot be excluded). Additionally, the uncertainty associated with the variation in residuals can be evaluated through the standard deviation of the residuals; however, the outcomes may exhibit variability contingent upon the specific study area and the data utilized. For example, Chen et al. [33] conducted an investigation into the uncertainty associated with spatial scales and emphasized the significance of accounting for the uncertainty in AGB estimates at the plot level when utilizing field plots for the calibration of remote-sensing-based biomass models. Silveira et al. [34] employed hierarchical and kriging interpolation techniques to mitigate uncertainty in the modeling process, utilizing both remote sensing data and forest inventory data. Zhang et al. [35] developed a random forest biomass estimation model utilizing remote sensing data in conjunction with forest inventory data, with the aim of examining the uncertainties present during the preprocessing phase of remote sensing images. Therefore, this paper builds on the existing research base and further quantifies two different types of uncertainty: the plot scale and the remote sensing model scale. However, regardless of the scale, there is always uncertainty in the process of estimating AGB, and ignoring it will produce more serious errors. Therefore, the question of how to reduce the uncertainty in the estimation process has become a hot topic in current research [36].
In this paper, we took Shaoguan, Heyuan and Huizhou in Guangdong Province as the study areas and Cunninghamia lanceolata as the study object. Utilizing Landsat 8 OLI remote sensing images in conjunction with field survey data, we developed estimation models employing K nearest neighbor regression (KNN), gradient-boosted regression tree (GBRT) and random forest (RF) methodologies. Additionally, error equations were applied to assess the uncertainty at both the plot scale and in remote sensing models. The aims of this study were as follows: (1) to examine the impact of plot scale and remote sensing models on the precision of AGB estimation; (2) to evaluate and compare the estimation accuracy and associated uncertainties of various remote sensing models; and (3) to identify the primary sources of uncertainty in the AGB estimation process, as well as to quantify the uncertainty arising from different scales.

2. Materials and Methods

2.1. Study Area

The study area is situated in the central and northern regions of Guangdong Province, China (20°13′–25°31′ N, 109°39′–117°19′ E; Figure 1), and includes Shaoguan, Huizhou and Heyuan, with an area of 48,700 square kilometers. Their climate type belongs to the East Asian monsoon zone, with an average precipitation of 1798.8 mm and an average annual temperature of 22.1 °C. These three regions exhibit a diverse topographical landscape characterized by a variety of landforms, including mountains, hills, plateaus, and plains. The region has a well-developed water system, including important rivers such as the Pearl River, the Xijiang River and the Beijiang River, as well as numerous tributaries, lakes and reservoirs, making it an important water resource base in Southern China. Vegetation types in this region mainly include tropical rainforests, evergreen broad-leaved forests and coniferous forests, which are distributed at different altitudes. According to the latest (9th) Forest Resources Inventory, Guangdong Province has a total forest area of 10,535,400 hectares, with a forest coverage rate of 58.59%, and Chinese Fir is the dominant tree species in this region.

2.2. Data Sources and Processing

2.2.1. Field Survey Data

Forest reference data play a vital role in the estimation of biomass. The sample plot data utilized in this research were derived from field surveys conducted in Guangdong Province, spanning from November 2023 to March 2024. Each plot measured 30 m by 30 m, resulting in a total of 87 plots surveyed. Among them, 40 sample plots are in Shaoguan, 25 sample plots are in Heyuan and 22 sample plots are in Huizhou. The field survey was conducted utilizing portable global positioning system (GPS) technology as well as static differential GPS methodologies. Since the DBH, H and biomass measured by the sample plot data are related to the spectral, radiometric and other characteristics of the forest vegetation information from remote sensing data [37], we needed to investigate Cunninghamia lanceolata species with DBH > 5 cm; attributes such as DBH, H, coordinates of the sample plot center, elevation, slope and slope position of every plot were recorded [38], and the biomass of each tree within the plot was determined using the anisotropic growth equation specific to the Cunninghamia lanceolata species [39]. The biomass of each sample plot can be obtained by summing the biomass of each species.
The Cunninghamia lanceolata per-tree biomass model is as follows:
W = 0.06539 × D 2.01735 × H 0.49425
where W is the dry biomass of the individual tree, D is DBH of the individual tree and H is H of the individual tree.

2.2.2. The Collection and Analysis of Remote Sensing Data

The Landsat 8 OLI images utilized in this research can be accessed via the official website of the United States Geological Survey (USGS) (https://earthexplorer.usgs.gov (accessed on 18 November 2023)); the images were acquired in November and December 2023, and their image resolution is 30 m (Table 1). The images were preprocessed using ENVI 5.3 software [40], using radiometric calibration and atmospheric correction to eliminate the effects of terrain and aerosol reflections on ground objects, followed by terrain correction of the image using the slope matching method, by which the mean values of terrain shadows on the north and south slopes were made similar. Finally, the images were mosaicked and cropped by the software to obtain the image.

2.2.3. The Process of Extracting and Selecting Remote Sensing Variables

The images were pre-processed utilizing ENVI 5.3, from which three categories of remote sensing variables were derived from Landsat 8 OLI images: spectral features, vegetation indices and texture features. Spectral features represent the fundamental and most straightforward characteristics in remote sensing imagery. Nonetheless, the occurrence of “different spectra for identical objects and identical spectra for disparate objects” is prevalent in remote sensing data, which complicates the attainment of accurate estimation outcomes when solely depending on spectral features [41,42]. Vegetation index is essentially a multi-band reflectance transformation, which can make the transformed data highlight the vegetation information and weaken the non-vegetation feature information, so it can better reflect the growth conditions of the vegetation and spatial distribution of the location compared to the spectral features. Texture features take into account the spatial relationships among pixels, which can effectively represent variations in grayscale within an image and enhance the recognition of spatial information. Consequently, the integrated application of the three categories of remote sensing factors can more accurately capture the characteristics of features and their transformations [43,44]. In the present study, we identified a total of 121 remote sensing factors, comprising 7 spectral features, 7 vegetation indices, 3 information enhancement factors, and 104 texture features (Table 2). Subsequently, we employed RF to filter the importance of remote sensing factors. Finally, we selected the factors exhibiting high significance for model construction, which primarily included 1 spectral feature and 14 texture features (Figure 2).

2.3. Remote Sensing Models

In this research, remote sensing variables were utilized to develop a model for estimating the AGB of Cunninghamia lanceolata. The modeling methods mainly include KNN, GBRT and RF.
KNN is a typical nonparametric regression model, and its basic principle is to predict variables based on the spatial similarity between observation and prediction points, and this method can solve the problem of parameter estimation for remote sensing data whose probability density distribution function is not normally distributed or is unknown [47]. Consequently, it has been extensively utilized in the assessment of biomass and carbon stock. The parameter range of the K value (number of neighbors) of the KNN model is 2–10, the equidistant spacing is 1, the distance metric is Euclidean distance, the weights are set to mean weights, and the algorithm is chosen as the approximate nearest neighbor algorithm.
GBRT is an iterative algorithm for regression tree modeling introduced by Friedman [48]. In this framework, all regression trees are interconnected, with each subsequent tree being constructed based on the residuals and outcomes of the preceding tree [49]. The fundamental approach involves the creation of multiple weak classifiers, which collectively yield a robust classifier following numerous iterations. The application of the GBRT model in this research is grounded in the “Gradient Boosting Regressor” algorithm available within the “Scikit-learn” library for the Python programming language. In this study, based on the best combination of parameters for goodness-of-fit, the maximum number of iterations was chosen to be 60, subsampling to be 0.5, learning rate to be 0.05, the maximum depth of the decision tree to be 7 and the minimum sample of leaf nodes to be 3.
RF is an ensemble learning model introduced by Breiman [50], characterized by the aggregation of multiple randomly generated decision trees. This methodology effectively mitigates the risk of overfitting through the combination of diverse trees, resulting in superior performance in both classification and regression tasks. Consequently, RF has been widely adopted across various research domains [51,52]. In the present study, the RF model is implemented utilizing the “Random Forest Regression” algorithm available in the “Scikit-learn” library for the Python programming language. The RF model decision tree has a parameter range of 20–200, an aliquot spacing of 10, a minimum sampling spacing of 2, a maximum depth of the decision tree of 10, and a minimum sample of leaf nodes of 1.
In this research, the aforementioned three methodologies were employed independently to develop the AGB estimation model for Cunninghamia lanceolata, after which the uncertainties caused by the residual variance errors of the three remote sensing models were analyzed and quantified.

2.4. Accuracy Assessment

Approximately seventy percent (60 groups) of the dataset was randomly allocated for the purpose of model fitting, while the remaining thirty percent (27 groups) was designated for accuracy validation and cross-validation throughout the model fitting process. The performance of the model was assessed using the metrics R2, RMSE and rRMSE. RMSE is the average degree of deviation between the estimated value and the true value. The smaller the RMSE value, the smaller the difference between the estimated value and the actual value, and the higher the accuracy of the model estimation. And rRMSE is the ratio of RMSE to the mean of the actual observed values; the smaller its value, the smaller the proportion of the size of the estimation error relative to the observed values. To enhance the objectivity of the model outcomes, this study conducted ten iterations for each model and subsequently calculated the average of the evaluation metrics for comparative analysis. The relevant formulas are presented as follows:
R 2 = i = 1 n ( y ^ i y ¯ ) 2 i = 1 n ( y i y ¯ ) 2
R M S E = i = 1 n ( y i y ^ i ) 2 n
r R M S E = R M S E y ¯ × 100 %
where n is the number of plots, y i is the observed value, y ^ i is the predicted value, and y ¯ is the mean of the observed values.

2.5. Analysis of the Uncertainty

2.5.1. Calculation of the Plot Scale Uncertainty

(1)
Calculation of the Residual Uncertainty in the Per-tree Biomass Model
To determine the residual uncertainty associated with the per-tree biomass model, it is essential to calibrate the model utilizing the observed biomass values of Cunninghamia lanceolata. Subsequently, the uncertainty associated with the residual variation can be determined by evaluating the discrepancy between the predicted values and the observed values. Let the observed biomass be denoted as A , and the biomass prediction model be represented as f   ( g , α ^ ) . Therefore, A can be articulated using the following equation:
A = f g , α ^ + ε
where α ^ is the predicted value of the parameter in the biomass model, g is the variable (DBH, H) and ε is the difference between the predicted and measured values. The form of f   ( g , a ) used in this paper is f   g , a = a D b H c . The equation of residual ε was as follows:
ε = A f g , α ^
where ε is the difference between the predicted and measured values, A is the observed biomass value and f   ( g , α ^ ) is the biomass prediction model. The uncertainty associated with the variation in model residuals can be quantified through the standard deviation of the residuals. According to the relevant study [53], the standard deviation of the residuals exhibits a linear correlation with biomass. Consequently, the uncertainty associated with the variation in model residuals can be determined by establishing a linear relationship between the standard deviation of the residuals and biomass. The corresponding formula is presented below:
σ ε = δ ( g , a ^ )
where σ ε is the standard deviation of the residuals and δ is the fitting parameter. In this study, the six-step methodology outlined in reference [54] was employed to compute the value of σ ε . The procedure is delineated as follows: (1) The measured biomass values of Cunninghamia lanceolata ( y ) were organized in ascending order. (2) The residuals ( M ε ) were determined by subtracting the predicted values from the biomass model ( y ^ ), with the residuals representing the discrepancies between the measured and predicted values. (3) The plots were categorized into groups, with each group containing n blocks, and any final group with fewer than n blocks was amalgamated with the preceding group. (4) The mean values of the predicted carbon stock ( y ^ ¯ ) and the standard deviations of the residuals ( σ ε ) were computed for each group. The calculations for the mean value, residuals and standard deviation of the residuals were conducted as follows:
y ^ ¯ = 1 n j = 1 n y ^ j
M ε = y y ^
σ ε = 1 n 1 j = 1 n M ε j M ¯ ε j 2
where y is the measured biomass values, y ^ is the predicted values, y ^ j is the predicted value of AGB in the j th plot, M ε j , M ¯ ε j are the residual and residual standard deviation of the j th plot, respectively, and n is the number of plots.
(5) The predicted mean value y ^ ¯ and standard deviation of the residuals σ ε should be fitted accordingly. The corresponding equation for this fitting can be articulated as follows:
σ ε = θ ( y ^ ¯ )
(6) To calculate the residual uncertainty of the model, substitute the predicted AGB for each plot into the fitted equation. Subsequently, compute the total of the residual standard deviations across all plots and divide this sum by the aggregate of the measured AGB values.
(2)
Calculation of the Parameter Uncertainty in the Per-tree Biomass Model
To assess the uncertainty associated with the model parameters, we can employ the first-order Taylor series expansion. If the function possesses an n th order derivative at x 0 , as x approaches x 0 , the function can be represented as follows:
f x f x 0 + f x 0 x x 0 + f x 0 2 ! ( x x 0 ) 2 + + f n x 0 n ! ( x x 0 ) n
Then, the biomass model A = f   g , a ^ can be expressed after a Taylor series first order expansion as
A = f ( g , a ^ ) f ( g , a ) + f g , α a j ( α ^ α )
where f ( g , a ^ ) is the predicted value of the per-tree biomass model, x is the variable in the biomass model, α ^ is the simulated value of parameters, f g , α a j is the partial derivative of the parameters and f g , α a j ( α ^ α ) reflects the parametrically induced modeling error σ λ , which can be approximated as
σ λ 2 Z j k v a r ( a ) Z j k T
where Z j k represents the matrix with Z designated as j × k , Z j k T signifies the transpose matrix of Z j k and v a r ( α ) denotes the covariance matrix associated with the estimated parameter α within the biomass equation.
(3)
Synthesis of Uncertainty
When the indirectly measured quantity is derived from individual measurements obtained through direct measurements, the uncertainty associated with the indirect measurements can be articulated as a composite uncertainty that encompasses the uncertainties of each direct measurement [55]. Consequently, the overall error can be represented as
K Z = K j 1 2 + K j 2 2 + K j 3 2 + + K j i 2
where K Z is the total error and K j i is the error of the i th variable. According to this equation, the overall uncertainty ( σ z ) is determined as follows:
σ z = σ ε 2 + σ λ 2
where σ ε represents the residual uncertainty associated with the per-tree biomass model, while σ λ denotes the uncertainty pertaining to the model parameters. The propagation of error from the per-tree biomass model to the uncertainty at the plot scale, denoted as σ p , is computed using the following methodology:
σ p = i = 1 N σ t i 2 / S
where N represents the quantity of sampled wood, σ t i denotes the overall uncertainty associated with the per-tree biomass model for the i th plot of wood plants and S signifies the area of the plot.

2.5.2. Calculation of Uncertainty for Remote Sensing Models

The six-step methodology [54] was employed to determine the uncertainty associated with the residual variance of the remote sensing estimation model. The detailed procedures and formulas utilized in this process are outlined in the six-step section of the initial point in Section 2.5.1. Consequently, the uncertainty of the remote sensing model ( σ R ) can be articulated as follows:
σ R = i = 1 n σ ε / y z
where i = 1 n σ ε represents the cumulative sum of the standard deviations of the residuals across all plots, while y z signifies the total of the measured values of AGB.

2.5.3. Calculation of Total Uncertainty from Different Sources

The total uncertainty ( σ ) of the Cunninghamia lanceolata AGB was calculated as follows:
σ = σ p 2 + σ R 2
where σ p is the plot scale uncertainty and σ R is the remote sensing model uncertainty.

3. Results and Analysis

3.1. Accuracy of Different Remote Sensing Models

Scatter plots illustrating the predictive accuracy of AGB estimation models utilizing the KNN, GBRT and RF methodologies are shown in Figure 3. The data presented in the Figure indicate that, under identical modeling conditions, the KNN model exhibited the least favorable performance, achieving an R2 of 0.368 and a RMSE of 42.314 t/ha. The GBRT model ranked second, with an R2 of 0.636 and an RMSE of 32.056 t/ha. In contrast, the RF model demonstrated superior performance, attaining an R2 of 0.867 and an RMSE of 19.325 t/ha. When compared to the KNN and GBRT models, the RF model showed an improvement in R2 of 0.499 and 0.231, respectively, while the RMSE was reduced by 22.989 t/ha and 12.731 t/ha, respectively. Furthermore, an analysis of the predicted and observed values across various models reveals a notable tendency for KNN and GBRT to exhibit a significant overestimation of low values and underestimation of high values. The lowest measured value is 21.942 t/ha, while the highest is 277.09 t/ha. The lowest prediction value based on the KNN model is 70.487 t/ha and the highest is 143.022 t/ha, with a large range, and there are obvious saturation characteristics. While the corresponding GBRT model-predicted values are 60.368 t/ha and 189.963 t/ha, these two models of the relative error between the groups of data are larger. The issue of low values associated with the overestimation and underestimation of high values in the RF model has been notably enhanced, yielding predicted values of 50.946 t/ha and 223.989 t/ha, respectively. The relative errors among the various data groups are minimal, indicating that the RF model demonstrates superior estimation accuracy.

3.2. Calculation of Plot Scale Uncertainty

3.2.1. Uncertainty of the Per-Tree Biomass Model

The parameters of the binary biomass model were determined using the measured AGB of Cunninghamia lanceolata, and the fitted binary biomass model was A = 0.077 D 1.844   H 0.654 ; the R2 of the model was 0.970 (Figure 4a). Figure 4b presents a scatter plot illustrating the error equation associated with the per-tree biomass model. The Figure indicates that the standard deviations of the residuals exhibit an upward trend corresponding to the increase in the predicted biomass values. The fitted model, which accounts for the uncertainty of the model residuals, is expressed as y = 0.0479 x 0.3275 . Then, the predicted values of biomass were substituted into the fitted formula to determine the uncertainty of the residual variance for the binary biomass model, which was 4.62%. To assess the uncertainty associated with the model parameters, we can calculate it by estimating the variance–covariance matrix of the parameters and integrating this information with Equation (14). Through this calculation, it can be determined that the variance–covariance matrix for the parameters of the binary biomass model can be represented as 0.000000012 0.00000047 0.00000250 0.00000047 0.00036000 0.00058000 0.00000250 0.00058000 0.00200000 , which was substituted into Equation (14), with the result that the value of parameter uncertainty was 4.81%, 6%. Synthesizing the two uncertainties, the total uncertainty of the binary biomass model was 6.67%.

3.2.2. Uncertainty at the Plot Scale

The overall uncertainty was determined to be 6.67% by integrating the uncertainty associated with the per-tree biomass model, as outlined in Equation (17). When this uncertainty is applied to the plot scale, it can be quantified using Equation (17), yielding a resultant uncertainty of 3.23%. Table 3 presents the various sources of uncertainty involved in biomass estimation at the plot scale.

3.3. Uncertainty of Remote Sensing Estimation Models

Figure 5 illustrates the correlation between the standard deviation of the residuals for each model and the mean values of the predicted biomass, and we can find that the uncertainties of the KNN, GBRT and RF models are 14.88%, 6.3% and 5.93%, respectively, by substituting the predicted values of each model into the fitting formula. In order to more intuitively reflect the differences between the predicted and measured values of each model, we compared the data of 87 sample plots with the corresponding results in Figure 6; as can be seen from the Figure, there are large differences between the predicted values of the KNN model and the measured values of the plots, with a maximum absolute difference between the two of 134.067 t/ha, and the same phenomenon exists in the GBRT model, with a maximum absolute difference between the two of 87.126 t/ha. For RF, compared to the first two models, the phenomenon of the difference between the predicted value and the measured value has been greatly improved, the maximum absolute difference is 53.1 t/ha, and the minimum absolute difference is only 0.03 t/ha, of which most of the absolute difference is within 20 t/ha, which indicates that the predicted value of the model and the measured value is more consistent with the predicted value, and has a good prediction accuracy, and its uncertainty is also the highest among the three models.

3.4. Total Uncertainty in Biomass Estimation

Table 4 presents the comprehensive uncertainty associated with the remote-sensing-based estimation of AGB for Cunninghamia lanceolata. By integrating the plot scale uncertainty with the uncertainty derived from the remote sensing model, as outlined in Equation (19), we determined the total uncertainties to be 15.22%, 7.08% and 6.75% for the KNN, GBRT and RF models, respectively.

4. Discussion

4.1. Analysis of Plot Scale Uncertainty

Despite the extensive research undertaken to quantify AGB in forest ecosystems, significant uncertainties persist in the estimation methodologies [35,56]. In this paper, we constructed KNN, GBRT and RF models based on remote sensing data, and the error equations were used to analyze the uncertainty at the plot scale and remote sensing model scale. Our analysis indicates that the uncertainty associated with the remote sensing estimation model constitutes the primary source of uncertainty, surpassing that of plot scale uncertainty. Among the models evaluated, the KNN model exhibited the highest level of uncertainty at 14.88%, while the RF model demonstrated the lowest uncertainty at 5.93%. Within the context of plot-scale uncertainties, the predominant source of uncertainty was identified as the parameter error associated with the per-tree biomass model. In comparison, the uncertainty attributed to residual variance was marginally lower. This observation aligns with the findings of Huang et al. [57]. However, it is noteworthy that the parameter error of the model in this study was found to be greater than that reported by Chen et al. [33] and Wang et al. [58]. The discrepancy may be attributed to the fact that Chen et al. [33] employed larger modeling plots compared to the present study, suggesting that the parameter error of the model approaches zero as the plot number increases. In contrast, Wang et al. [58] utilized a grouping approach and assessed the errors of both univariate and bivariate biomass models through the Taylor series method. This indicates that grouping can significantly mitigate the variability of the parameter covariance matrices, thereby yielding a more precise estimation of parameter error. This study focuses solely on the uncertainty associated with the binary biomass model, while the uncertainty of the univariate biomass model warrants further investigation in future research.

4.2. Uncertainty in Different Remote Sensing Estimation Models

In comparison to conventional parametric regression techniques, machine learning approaches offer distinct advantages in the estimation of AGB in forests [59,60]. Consequently, we developed a remote sensing model for estimating the AGB of Cunninghamia lanceolata utilizing Landsat 8 OLI data in conjunction with three different machine learning methodologies. The assessment of AGB in forests utilizing the KNN and GBRT models results in significant overestimations of lower biomass values and underestimations of higher biomass values. Furthermore, the fitting accuracies of both KNN and GBRT models are inferior to those achieved by the RF model. While the inaccuracies in estimating AGB within the RF model have shown considerable improvement, this issue persists. This persistence may be attributed to the substantial variability in measured forest AGB values within the study area, which renders the RF model more susceptible to the influence of outliers [61]. Furthermore, the study area exhibits heterogeneity in its characteristics, which contributes to its diversity. In regions characterized by low AGB in forests, the forest pixels are influenced by the spectral signatures of other non-forest elements, leading to an overestimation of AGB when utilizing the model. Conversely, in areas with elevated forest AGB values, the reflectance spectra of the vegetation tend to become saturated, resulting in estimated AGB values that exceed the actual measured values. For instance, Suwanlee et al. [62] developed support vector machine (SVM) and RF models to map forest AGB utilizing Sentinel-1 and Sentinel-2 data, respectively. Their findings indicated that the RF model demonstrated superior efficiency and consistency in comparison to the SVR model over the two-year period analyzed. Similarly, Zhuo et al. [63] created various remote sensing models to estimate forest AGB using measured biomass data and UAV hyperspectral imagery. Their results further corroborated the RF model’s preeminence, achieving an R2 of 0.82 and a RMSE of 116.14 g/m2, outperforming three other estimation models during the same timeframe. Numerous studies have concluded that the RF model possesses distinct advantages over alternative models in the estimation of forest AGB.

4.3. Limitations and Future Research

In the context of plot scale analysis, it is essential to account for the influences of measurement errors, sampling variability, plot size and plot positioning inaccuracies, alongside the uncertainty introduced by the per-tree biomass model. Notably, several researchers have investigated the uncertainty associated with sample plot size and positioning errors. Mayamanikandan et al. [64] conducted a study examining the uncertainty associated with plot size, shape and location errors utilizing Sentinel data. Their findings indicated that model uncertainty was reduced by approximately 50% when the plot size was increased from 0.01 ha to 0.64 ha. Similarly, Frazer et al. [65] explored the impact of plot size and alignment errors on the uncertainty of LiDAR-derived canopy metrics and AGB estimation models. Their results demonstrated a significant enhancement in AGB predictions and a considerable reduction in model uncertainty when the plot size was expanded from 314 m2 (10 m radius) to 1964 m2 (25 m radius). Persson et al. [66] used continuous forest inventory data to investigate the uncertainty of field survey data due to measurement error, model error and sample location error. By taking these errors into account, the prediction uncertainty based on remote sensing was reduced by 6–18%, and the residuals of the anisotropic growth model and the location error were the main sources of error. All of the above results showed that the plot size showed an inverse relationship with uncertainty, and the model uncertainty decreased with the increase in plot size. In addition, for Cunninghamia lanceolata, as a fast-growing coniferous timber species endemic to subtropical regions in China, the degree of growth adaptation in different regions was not consistent, and its stand biomass also had large differences; the prediction accuracy of the same model at different regional scales will produce large differences, and if regional differences are ignored, this will inevitably lead to inaccurate model estimation results [67]. Therefore, in subsequent studies we should explore the effects of uncertainty in biomass estimation at different scales and regions, and also consider the effects of plot size and positioning errors.

5. Conclusions

In this paper, Shaoguan, Heyuan and Huizhou in Guangdong Province were taken as the study area, and Cunninghamia lanceolata was used as the research object. Biomass estimation models utilizing KNN, GBRT and RF were developed using Landsat 8 OLI remote sensing imagery in conjunction with field survey data. Error equations were employed to assess the uncertainties associated with both plot scale and remote sensing model scale, facilitating an analysis of how variations in scale influence the estimation of AGB. (1) The primary source of uncertainty at the plot scale in the per-tree biomass model was identified as parameter uncertainty, while the residual uncertainty of the model was relatively minor. (2) In contrast, at the plot scale, the uncertainty associated with the remote sensing model emerged as the predominant factor influencing biomass estimation. Among the models evaluated, the KNN model exhibited the lowest estimation accuracy (R2 = 0.368) and the highest uncertainty (14.88%). The GBRT model followed, with an R2 of 0.636 and an uncertainty of 6.3%. Conversely, the RF model demonstrated the highest precision, achieving an R2 of 0.867 and the lowest uncertainty at 5.93%. When compared to the KNN and GBRT models, the RF model’s R2 improved by 0.499 and 0.231, respectively, while its uncertainty decreased by 8.95% and 0.37%. This study analyzed the uncertainties at both the plot scale and the remote sensing model scale to identify the key factors influencing the accuracy of biomass estimation, thereby providing insights that may enhance estimation accuracy and mitigate uncertainty in the estimation process.

Author Contributions

Y.H.: Writing—original draft, software, data curation. B.Q.: methodology, writing—review and editing. L.F.: writing—review and editing, funding acquisition, project administration, supervision. D.X.: writing—review and editing. Z.W.: writing—review and editing. Y.L.: writing—review and editing. J.Y.: writing—review and editing. Q.W.: writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This study is supported by the Fundamental Research Funds for the Central Nonprofit Research Institution of CAF under Grant CAFYBB2022ZB002.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Acknowledgments

We would like to acknowledge all the other individuals who contributed to this paper.

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

  1. Shao, W.; Cai, J.; Wu, H.; Liu, J.; Zhang, H.; Huang, H. An assessment of carbon storage in China’s arboreal forests. Forests 2017, 8, 110. [Google Scholar] [CrossRef]
  2. Cook-Patton, S.C.; Shoch, D.; Ellis, P.W. Dynamic global monitoring needed to use restoration of forest cover as a climate solution. Nat. Clim. Chang. 2021, 11, 366–368. [Google Scholar] [CrossRef]
  3. Zhang, M.; Yang, Z.; Liu, L.; Zhou, D. Impact of renewable energy investment on carbon emissions in China—An empirical study using a nonparametric additive regression model. Sci. Total Environ. 2021, 785, 147109. [Google Scholar] [CrossRef]
  4. Cieszewski, C.J.; Zasada, M.; Lowe, R.C.; Liu, S. Estimating Biomass and Carbon Storage by Georgia Forest Types and Species Groups Using the FIA Data Diameters, Basal Areas, Site Indices, and Total Heights. Forests 2021, 12, 141. [Google Scholar] [CrossRef]
  5. Qian, C.; Qiang, H.; Wang, F.; Li, M. Estimation of Forest Aboveground Biomass in Karst Areas Using Multi-Source Remote Sensing Data and the K-DBN Algorithm. Remote Sens. 2021, 13, 5030. [Google Scholar] [CrossRef]
  6. Pettorelli, N.; Wegmann, M.; Skidmore, A.; Mücher, S.; Dawson, T.P.; Fernandez, M.; Geller, G.N. Framing the concept of satellite remote sensing essential biodiversity variables: Challenges and future directions. Remote Sens. Ecol. Conserv. 2016, 2, 122–131. [Google Scholar] [CrossRef]
  7. Luo, K. Spatial pattern of forest carbon storage in the vertical and horizontal directions based on HJ-CCD remote sensing imagery. Remote Sens. 2019, 11, 788. [Google Scholar] [CrossRef]
  8. Long, Y.; Jiang, F.; Sun, H.; Wang, T.; Zou, Q.; Chen, C.S. Estimating vegetation carbon storage based on optimal bandwidth selected from geographically weighted regression model in Shenzhen City. Acta Ecol. Sin. 2022, 42, 4933–4945. [Google Scholar]
  9. Zharko, V.O.; Bartalev, S.A.; Sidorenkov, V.M. Forest growing stock volume estimation using optical remote sensing over snow-covered ground: A case study for Sentinel-2 data and the Russian Southern Taiga region. Remote Sens. Lett. 2020, 11, 677–686. [Google Scholar] [CrossRef]
  10. Zhai, Y.; Wang, L.; Yao, Y.; Jia, J.; Li, R.; Ren, Z.; He, X.Y.; Ye, Z.W.; Zhang, X.Y.; Chen, Y.Y.; et al. Spatially continuous estimation of urban forest aboveground biomass with UAV-LiDAR and multispectral scanning: An allometric model of forest structural diversity. Agric. For. Meteorol. 2025, 360, 110301. [Google Scholar] [CrossRef]
  11. Foody, G.M.; Boyd, D.S.; Cutler, M.E. Predictive relations of tropical forest biomass from Landsat TM data and their transferability between regions. Remote Sens. Environ. 2003, 85, 463–474. [Google Scholar] [CrossRef]
  12. Lu, D.; Chen, Q.; Wang, G.; Moran, E.; Batistella, M.; Zhang, M.; Saah, D. Aboveground forest biomass estimation with Landsat and LiDAR data and uncertainty analysis of the estimates. Int. J. For. Res. 2012, 2012, 436537. [Google Scholar] [CrossRef]
  13. Ehlers, D.; Wang, C.; Coulston, J.; Zhang, Y.; Pavelsky, T.; Frankenberg, E.; Song, C. Mapping forest aboveground biomass using multisource remotely sensed data. Remote Sens. 2022, 14, 1115. [Google Scholar] [CrossRef]
  14. Minh, D.H.T.; Le Toan, T.; Rocca, F.; Tebaldini, S.; d’Alessandro, M.M.; Villard, L. Relating P-band synthetic aperture radar tomography to tropical forest biomass. IEEE Trans. Geosci. Remote Sens. 2013, 52, 967–979. [Google Scholar] [CrossRef]
  15. Liu, J.; Xu, C.; Zhao, B.; Yang, Z.; Liu, Y.; Zhang, S.; Qi, W. Deformation Slope Extraction and Influencing Factor Analysis Using LT-1 Satellite Data: A Case Study of Chongqing and Surrounding Areas, China. Remote Sens. 2025, 17, 156. [Google Scholar] [CrossRef]
  16. Song, J.; Liu, X.; Adingo, S.; Guo, Y.; Li, Q. A Comparative Analysis of Remote Sensing Estimation of Aboveground Biomass in Boreal Forests Using Machine Learning Modeling and Environmental Data. Sustainability (2071–1050) 2024, 16, 7232. [Google Scholar] [CrossRef]
  17. Wei, Y.Z. Inversion of Forest Biomass by Machine Learning Method Based on SAR Images. Master’s Thesis, Chang’an University, Xi’an, China, 2020. [Google Scholar]
  18. Behera, D.; Kumar, V.A.; Rao, J.P.; Padal, S.B.; Ayyappan, N.; Reddy, C.S. Estimating aboveground biomass of a regional forest landscape by integrating textural and spectral variables of Sentinel-2 along with ancillary data. J. Indian Soc. Remote Sens. 2024, 52, 917–929. [Google Scholar] [CrossRef]
  19. Cui, L.; Zhang, J.; Dai, Y.; Xie, R.; Zhu, Z.; Sun, M.; Zhao, K. Influence of BRDF Models and Solar Zenith Angles on Forest Above-Ground Biomass Derived from MODIS Multi-Angular Indices. Forests 2024, 15, 541. [Google Scholar] [CrossRef]
  20. Galidaki, G.; Zianis, D.; Gitas, I.; Radoglou, K.; Karathanassi, V.; Tsakiri–Strati, M.; Mallinis, G. Vegetation biomass estimation with remote sensing: Focus on forest and other wooded land over the Mediterranean ecosystem. Int. J. Remote Sens. 2017, 38, 1940–1966. [Google Scholar] [CrossRef]
  21. Fu, Y.; Li, R.; Zhu, Z.; Xue, Y.; Ding, H.; Wang, X.; Xia, W. SCARF: A new algorithm for continuous prediction of biomass dynamics using machine learning and Landsat time series. Remote Sens. Environ. 2024, 314, 114348. [Google Scholar] [CrossRef]
  22. Nguyen, H.H.; Bich, N.T.N.; Nghia, N.H.; Lan, T.T.N.; Nguyen, T.T.H.; Dang, H.V. Estimation of changes in above-ground biomass and carbon stocks of mangrove forests using sentinel-2a in Thai Thuy district, Thai Binh province during 2015–2019. Vietnam J. Sci. Technol. 2022, 60, 73–91. [Google Scholar] [CrossRef]
  23. Liu, Y.; Gong, W.; Xing, Y.; Hu, X.; Gong, J. Estimation of the forest stand mean height and aboveground biomass in Northeast China using SAR Sentinel-1B, multispectral Sentinel-2A, and DEM imagery. ISPRS J. Photogramm. Remote Sens. 2019, 151, 277–289. [Google Scholar] [CrossRef]
  24. Huang, T.; Ou, G.; Wu, Y.; Zhang, X.; Liu, Z.; Xu, H.; Xu, C. Estimating the Aboveground Biomass of Various Forest Types with High Heterogeneity at the Provincial Scale Based on Multi-Source Data. Remote Sens. 2023, 15, 3550. [Google Scholar] [CrossRef]
  25. Imran, A.B.; Ahmed, S. Potential of Landsat-8 spectral indices to estimate forest biomass. Int. J. Hum. Cap. Urban Manag. 2018, 3, 303. [Google Scholar]
  26. Tian, L.; Wu, X.; Tao, Y.; Li, M.; Qian, C.; Liao, L.; Fu, W. Review of remote sensing-based methods for forest aboveground biomass estimation: Progress, challenges, and prospects. Forests 2023, 14, 1086. [Google Scholar] [CrossRef]
  27. Saarela, S.; Wästlund, A.; Holmström, E.; Mensah, A.A.; Holm, S.; Nilsson, M.; Ståhl, G. Mapping aboveground biomass and its prediction uncertainty using LiDAR and field data, accounting for tree-level allometric and LiDAR model errors. For. Ecosyst. 2020, 7, 43. [Google Scholar] [CrossRef]
  28. Qin, L.; Meng, S.; Zhou, G.; Liu, Q.; Xu, Z. Uncertainties in above ground tree biomass estimation. J. For. Res. 2021, 32, 1989–2000. [Google Scholar] [CrossRef]
  29. Fareed, N.; Numata, I. Evaluating the impact of field-measured tree height errors correction on aboveground biomass modeling using airborne laser scanning and GEDI datasets in Brazilian Amazonia. Trees For. People 2025, 19, 100751. [Google Scholar] [CrossRef]
  30. Shettles, M.; Hilker, T.; Temesgen, H. Examination of uncertainty in per unit area estimates of aboveground biomass using terrestrial LiDAR and ground data. Can. J. For. Res. 2016, 46, 706–715. [Google Scholar] [CrossRef]
  31. Berger, A.; Gschwantner, T.; McRoberts, R.E.; Schadauer, K. Effects of measurement errors on individual tree stem volume estimates for the Austrian National Forest Inventory. For. Sci. 2014, 60, 14–24. [Google Scholar] [CrossRef]
  32. Breidenbach, J.; Antón-Fernández, C.; Petersson, H.; McRoberts, R.E.; Astrup, R. Quantifying the model-related variability of biomass stock and change estimates in the Norwegian National Forest Inventory. For. Sci. 2014, 60, 25–33. [Google Scholar] [CrossRef]
  33. Chen, Q.; Laurin, G.V.; Valentini, R. Uncertainty of remotely sensed aboveground biomass over an African tropical forest: Propagating errors from trees to plots to pixels. Remote Sens. Environ. 2015, 160, 134–143. [Google Scholar] [CrossRef]
  34. Silveira, E.M.; Santo, F.D.E.; Wulder, M.A.; Júnior, F.W.A.; Carvalho, M.C.; Mello, C.R.; Scolforo, J.R. Pre-stratified modelling plus residuals kriging reduces the uncertainty of aboveground biomass estimation and spatial distribution in heterogeneous savannas and forest environments. For. Ecol. Manag. 2019, 445, 96–109. [Google Scholar] [CrossRef]
  35. Zhang, Q.; Xu, L.; Zhang, M.; Wang, Z.; Gu, Z.; Wu, Y.; Lu, Z. Uncertainty analysis of remote sensing pretreatment for biomass estimation on Landsat OLI and Landsat ETM+. ISPRS Int. J. Geo-Inf. 2020, 9, 48. [Google Scholar] [CrossRef]
  36. Gregoire, T.G.; Næsset, E.; McRoberts, R.E.; Ståhl, G.; Andersen, H.E.; Gobakken, T.; Nelson, R. Statistical rigor in LiDAR-assisted estimation of aboveground forest biomass. Remote Sens. Environ. 2016, 173, 98–108. [Google Scholar] [CrossRef]
  37. Coleman, K.; Müller, J.; Kuenzer, C. Remote Sensing of Forests in Bavaria: A Review. Remote Sens. 2024, 16, 1805. [Google Scholar] [CrossRef]
  38. GB/T 43648-2024; Stump Biomass Modeling and Carbon Measurement Parameters for Major Tree Species. State Administration of Market Supervision and Administration (SACMA) and Standardization Administration of PRC (SAC): Beijing, China, 2015.
  39. Tang, X.; Fehrmann, L.; Guan, F.; Forrester, D.I.; Guisasola, R.; Pérez-Cruzado, C.; Kleinn, C. A generalized algebraic difference approach allows an improved estimation of aboveground biomass dynamics of Cunninghamia lanceolata and Castanopsis sclerophylla forests. Ann. For. Sci. 2017, 74, 12. [Google Scholar] [CrossRef]
  40. Song, R. Vegetation coverage evaluation of Yangtze River great protection based on envi software—A case study of Maanshan City. In Journal of Physics: Conference Series; IOP Publishing: Bristol, UK, 2021; Volume 1744, p. 032208. [Google Scholar]
  41. Carleer, A.P.; Wolff, E. Urban land cover multi-level region-based classification of VHR data by selecting relevant features. Int. J. Remote Sens. 2006, 27, 1035–1051. [Google Scholar] [CrossRef]
  42. Wang, W.; Jiang, Y.; Wang, G.; Guo, F.; Li, Z.; Liu, B. Multi-Scale LBP texture feature learning network for remote sensing interpretation of land desertification. Remote Sens. 2022, 14, 3486. [Google Scholar] [CrossRef]
  43. Yuan, J.; Wang, D.; Li, R. Remote sensing image segmentation by combining spectral and texture features. IEEE Trans. Geosci. Remote Sens. 2013, 52, 16–24. [Google Scholar] [CrossRef]
  44. Duan, M.; Song, X.; Liu, X.; Cui, D.; Zhang, X. Mapping the soil types combining multi-temporal remote sensing data with texture features. Comput. Electron. Agric. 2022, 200, 107230. [Google Scholar] [CrossRef]
  45. Wei, H.L.; Qi, Y.J. Analysis of grassland degradation of the Tibet Plateau and human driving forces based on remote sensing. Pratacultural Sci. 2016, 33, 2576–2586. [Google Scholar]
  46. Zhang, C.; Huang, C.; Li, H.; Liu, Q.; Li, J.; Bridhikitti, A.; Liu, G. Effect of textural features in remote sensed data on rubber plantation extraction at different levels of spatial resolution. Forests 2020, 11, 399. [Google Scholar] [CrossRef]
  47. Li, B.; Xu, X.; Zhang, L.; Han, J.; Bian, C.; Li, G.; Liu, L.; Jin, L. Above-ground biomass estimation and yield prediction in potato by using UAV-based RGB and hyperspectral imaging. ISPRS J. Photogramm. Remote Sens. 2020, 162, 161–172. [Google Scholar] [CrossRef]
  48. Friedman, J.H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 2001, 1189–1232. [Google Scholar] [CrossRef]
  49. Lawrence, R.; Bunn, A.; Powell, S.; Zambon, M. Classification of remotely sensed imagery using stochastic gradient boosting as a refinement of classification tree analysis. Remote Sens. Environ. 2004, 90, 331–336. [Google Scholar] [CrossRef]
  50. Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  51. Sun, J.; Zhong, G.; Huang, K.; Dong, J. Banzhaf random forests: Cooperative game theory based random forests with consistency. Neural Netw. 2018, 106, 20–29. [Google Scholar] [CrossRef]
  52. Wang, X.; Liu, T.; Zheng, X.; Peng, H.; Xin, J.; Zhang, B. Short-term prediction of groundwater level using improved random forest regression with a combination of random features. Appl. Water Sci. 2018, 8, 125. [Google Scholar] [CrossRef]
  53. Qin, L.H.; Zhang, M.Z.; Zhong, S.H.; Yu, X.H. Model uncertainty analysis in forest biomass estimation. J. Ecol. 2017, 37, 79127919. [Google Scholar]
  54. Hosmer, D.W., Jr.; Lemeshow, S.; Sturdivant, R.X. Applied Logistic Regression; John Wiley & Sons: Hoboken, NJ, USA, 2013; Volume 398. [Google Scholar]
  55. Zeger, S.L.; Thomas, D.; Dominici, F.; Samet, J.M.; Schwartz, J.; Dockery, D.; Cohen, A. Exposure measurement error in time-series studies of air pollution: Concepts and consequences. Environ. Health Perspect. 2000, 108, 419–426. [Google Scholar] [CrossRef] [PubMed]
  56. Li, Z.; Bi, S.; Hao, S.; Cui, Y. Aboveground biomass estimation in forests with random forest and Monte Carlo-based uncertainty analysis. Ecol. Indic. 2022, 142, 109246. [Google Scholar] [CrossRef]
  57. Huang, Y.J.; Zhang, J.L.; Hu, Y.P.; Chen, T. Uncertainty analysis of remote sensing for aboveground biomass estimation of Pinus densata. J. Zhejiang Agric. For. Univ. 2022, 39, 531–539. [Google Scholar]
  58. Wang, K.R.; Shu, Q.T.; Zhao, H.Y.; Tan, D.H.; Yuan, Z.J. Modeling uncertainty of single-timber aboveground biomass of Pinus densata. J. Southwest For. Univ. 2021, 41, 100–106. [Google Scholar]
  59. Anees, S.A.; Mehmood, K.; Khan, W.R.; Sajjad, M.; Alahmadi, T.A.; Alharbi, S.A.; Luo, M. Integration of machine learning and remote sensing for above ground biomass estimation through Landsat-9 and field data in temperate forests of the Himalayan region. Ecol. Indic. 2024, 82, 102732. [Google Scholar] [CrossRef]
  60. Zhang, S.H.; He, L.; Duan, J.Z.; Zang, S.L.; Yang, T.C.; Schulthess, U.R.S.; Feng, W. Aboveground wheat biomass estimation from a low-altitude UAV platform based on multimodal remote sensing data fusion with the introduction of terrain factors. Precis. Agric. 2024, 25, 119–145. [Google Scholar] [CrossRef]
  61. Hosseiny, B.; Mahdianpari, M.; Hemati, M.; Radman, A.; Mohammadimanesh, F.; Chanussot, J. Beyond supervised learning in remote sensing: A systematic review of deep learning approaches. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2023, 17, 1035–1052. [Google Scholar] [CrossRef]
  62. Suwanlee, S.R.; Pinasu, D.; Som-ard, J.; Borgogno-Mondino, E.; Sarvia, F. Estimating sugarcane aboveground biomass and carbon stock using the combined time series of sentinel data with machine learning algorithms. Remote Sens. 2024, 16, 750. [Google Scholar] [CrossRef]
  63. Zhuo, W.; Wu, N.; Shi, R.; Liu, P.; Zhang, C.; Fu, X.; Cui, Y. Aboveground biomass retrieval of wetland vegetation at the species level using UAV hyperspectral imagery and machine learning. Ecol. Indic. 2024, 166, 112365. [Google Scholar] [CrossRef]
  64. Mayamanikandan, T.; Reddy, S.; Fararoda, R.; Thumaty, K.C.; Praveen, M.S.S.; Rajashekar, G.; Gummapu, J. Quantifying the influence of plot-level uncertainty in above ground biomass up scaling using remote sensing data in central Indian dry deciduous forest. Geocarto Int. 2022, 37, 3489–3503. [Google Scholar] [CrossRef]
  65. Frazer, G.W.; Magnussen, S.; Wulder, M.A.; Niemann, K.O. Simulated impact of sample plot size and co-registration error on the accuracy and uncertainty of LiDAR-derived estimates of forest stand biomass. Remote Sens. Environ. 2011, 115, 636–649. [Google Scholar] [CrossRef]
  66. Persson, H.J.; Ekström, M.; Ståhl, G. Quantify and account for field reference errors in forest remote sensing studies. Remote Sens. Environ. 2022, 283, 113302. [Google Scholar] [CrossRef]
  67. Liu, Y.F.; Li, Z.J.; Wu, Y.; Lu, C.; Ou, G.L. Remote sensing estimation of above-ground biomass of oak and Simao pine forests in Xishuangbanna. West. For. Sci. 2023, 52, 147–155. [Google Scholar]
Figure 1. Sample point distribution map.
Figure 1. Sample point distribution map.
Forests 16 00230 g001
Figure 2. Importance of different features.
Figure 2. Importance of different features.
Forests 16 00230 g002
Figure 3. (a) Scatter plot representing the KNN model, (b) scatter plot representing the GBRT model and (c) scatter plot representing the RF model.
Figure 3. (a) Scatter plot representing the KNN model, (b) scatter plot representing the GBRT model and (c) scatter plot representing the RF model.
Forests 16 00230 g003
Figure 4. (a) The three-dimensional scatter plot representing the binary biomass model for Cunninghamia lanceolata and (b) residual plot of the model.
Figure 4. (a) The three-dimensional scatter plot representing the binary biomass model for Cunninghamia lanceolata and (b) residual plot of the model.
Forests 16 00230 g004
Figure 5. (a) Scatter plot illustrating the error function associated with the KNN, (b) scatter plot illustrating the error function associated with the GBRT and (c) scatter plot illustrating the error function associated with the RF model.
Figure 5. (a) Scatter plot illustrating the error function associated with the KNN, (b) scatter plot illustrating the error function associated with the GBRT and (c) scatter plot illustrating the error function associated with the RF model.
Forests 16 00230 g005
Figure 6. (a) Variation in each field plot for the KNN, (b) variation in each field plot for the GBRT and (c) variation in each field plot for the RF.
Figure 6. (a) Variation in each field plot for the KNN, (b) variation in each field plot for the GBRT and (c) variation in each field plot for the RF.
Forests 16 00230 g006
Table 1. Images of the study area.
Table 1. Images of the study area.
Survey YearIDAcquisition DateCloudy/%
2023LC08_L2SP_121044_20231118_20231122_02_T118 November 20230.02%
LC08_L2SP_121043_20231118_20231122_02_T118 November 20230.02%
LC08_L2SP_122044_20231125_20231129_02_T125 November 20231.28%
LC08_L2SP_122043_20231227_20240104_02_T127 December 20234.76%
Table 2. Variables obtained from the remote sensing images.
Table 2. Variables obtained from the remote sensing images.
SensorFeature TypeFeature NameDefinition
Landsat 8Spectral bandsB2, B3, B4, B5, B6, B7Blue, Green, Red, NIR, SWIR1, SWIR2
Information enhancementPCA1, PCA2, PCA3Principal component analysis [45]
Vegetation indicesDVINIR-Red
RVINIR/Red
NDVI(NIR − Red)/(NIR + Red)
NDVI2(NIR − Green)/(NIR + Green)
LCI(NIR + Red)/2
SAVI1.5 × (NIR − Red)/8 × (NIR + Red + 0.5)
EVI2.5 × ((NIR − Red)/(NIR + 6 × Red − 7.5 × Blue + 1))
Textural featuresContrast (CON), Dissimilarity (DIS), Angular second moment (ASM), Entropy (ENT), Variance (VAR), Correlation (COR), Homogeneity (HOM), Mean (ME)Gray level co-occurrence matrix [46]
Note: BX refers to a specific single band within the image.
Table 3. Results of each uncertainty source at the plot scale.
Table 3. Results of each uncertainty source at the plot scale.
ModelParameter Error/%Residual Variation Error/%Plot Scale Uncertainty/%
A = 0.077 D 1.844 H 0.654 4.814.623.23
Table 4. Total uncertainty of AGB estimation based on Landsat 8 data.
Table 4. Total uncertainty of AGB estimation based on Landsat 8 data.
ModelUncertainty of Remote Sensing-Based Estimation Models/%Uncertainties at the Plot ScaleTotal Uncertainty/%
Model Parameter Errors/%Model Residual Variance/%Uncertainty at the Plot Scale/%
KNN14.884.814.623.2315.22
GBRT6.37.08
RF5.936.75
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Hu, Y.; Fu, L.; Qiu, B.; Xie, D.; Wu, Z.; Lei, Y.; Ye, J.; Wang, Q. Uncertainty Analysis of Remote Sensing Estimation of Chinese Fir (Cunninghamia lanceolata) Aboveground Biomass in Southern China. Forests 2025, 16, 230. https://doi.org/10.3390/f16020230

AMA Style

Hu Y, Fu L, Qiu B, Xie D, Wu Z, Lei Y, Ye J, Wang Q. Uncertainty Analysis of Remote Sensing Estimation of Chinese Fir (Cunninghamia lanceolata) Aboveground Biomass in Southern China. Forests. 2025; 16(2):230. https://doi.org/10.3390/f16020230

Chicago/Turabian Style

Hu, Yaopeng, Liyong Fu, Bo Qiu, Dongbo Xie, Zheyuan Wu, Yuancai Lei, Jinsheng Ye, and Qiulai Wang. 2025. "Uncertainty Analysis of Remote Sensing Estimation of Chinese Fir (Cunninghamia lanceolata) Aboveground Biomass in Southern China" Forests 16, no. 2: 230. https://doi.org/10.3390/f16020230

APA Style

Hu, Y., Fu, L., Qiu, B., Xie, D., Wu, Z., Lei, Y., Ye, J., & Wang, Q. (2025). Uncertainty Analysis of Remote Sensing Estimation of Chinese Fir (Cunninghamia lanceolata) Aboveground Biomass in Southern China. Forests, 16(2), 230. https://doi.org/10.3390/f16020230

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop