Next Article in Journal
Examining Nicotinamide Application Methods in Alleviating Lead-Induced Stress in Spring Barley
Previous Article in Journal
Silicon Fertilization Improves Sunflower Rhizosphere Microbial Community Structure and Reduces Parasitism by Orobanche cumana Wallr.
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Integration of Unmanned Aerial Vehicle Spectral and Textural Features for Accurate Above-Ground Biomass Estimation in Cotton

1
College of Agriculture, Xinjiang Agricultural University, Urumqi 830052, China
2
Institute of Cash Crops, Xinjiang Academy of Agricultural Sciences, Urumqi 830091, China
3
Key Laboratory of Crop Physiology, Ecology and Farming in Desert Oasis, Agricultural Village Department, Urumqi 830091, China
4
Centre for Agriculture and Bioscience International (CABI), Rawalpindi 467000, Pakistan
5
Key Laboratory of Crop Phyiology and Ecology, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Ministry of Agriculture, Beijing 100081, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Agronomy 2024, 14(6), 1313; https://doi.org/10.3390/agronomy14061313
Submission received: 11 May 2024 / Revised: 1 June 2024 / Accepted: 12 June 2024 / Published: 18 June 2024

Abstract

:
Timely and accurate estimation of Above-Ground-Biomass (AGB) in cotton is essential for precise production monitoring. The study was conducted in Shaya County, Aksu Region, Xinjiang, China. It employed an unmanned aerial vehicle (UAV) as a low-altitude monitoring platform to capture multispectral images of the cotton canopy. Subsequently, spectral features and textural features were extracted, and feature selection was conducted using Pearson’s correlation (P), Principal Component Analysis (PCA), Multivariate Stepwise Regression (MSR), and the ReliefF algorithm (RfF), combined with the machine learning algorithm to construct an estimation model of cotton AGB. The results indicate a high consistency between the mean (MEA) and the corresponding spectral bands in textural features with the AGB correlation. Moreover, spectral and textural feature fusion proved to be more stable than models utilizing single spectral features or textural features alone. Both the RfF algorithm and ANN model demonstrated optimization effects on features, and their combination effectively reduced the data redundancy while improving the model performance. The RfF-ANN-AGB model constructed based on the spectral and textural features fusion worked better, and using the features SIPI2, RESR, G_COR, and RE_DIS, exhibited the best performance, achieving a test sets R2 of 0.86, RMSE of 0.23 kg·m−2, MAE of 0.16 kg·m−2, and nRMSE of 0.39. The findings offer a comprehensive modeling strategy for the precise and rapid estimation of cotton AGB.

1. Introduction

Cotton holds a pivotal role as an economic crop in China, serving as a strategic resource that significantly influences national welfare and the economy and occupying a prominent position within the national economic framework [1]. Above-Ground-Biomass (AGB) is indispensable in delineating crop growth and yield formation, serving as a critical indicator for assessing the quality and functionality of agricultural ecosystems [2]. Understanding the dynamics of cotton AGB is essential for predicting potential yield variations and evaluating crop phenotypic adaptation to environmental changes [3]. The conventional method of sample harvesting offers precise measurements of crop biomass and yield; however, it is time-consuming, labor-intensive, and detrimental to the agricultural ecosystem, rendering it unsuitable for continuous, large-scale observations [4]. As agricultural modernization and mechanization progress rapidly, large-scale, regionalized agriculture emerges as the future development trajectory. Consequently, the rapid and precise monitoring of crop AGB through scientific means becomes imperative for the intelligent and information-based management of agricultural ecosystems.
In recent years, the rapid advancement of low-altitude remote sensing technology, facilitated by unmanned aerial vehicles (UAVs), has provided a new avenue for swiftly and accurately assessing field crop conditions. Numerous studies have predominantly utilized sensors such as RGB cameras [5,6], multispectral [7,8], LiDAR [9], and hyperspectral [10,11] to capture spectral information of farmland. LiDAR sensors primarily serve to measure crop dimensions [12], whereas RGB sensors provide limited spectral bands [13]. On the other hand, multispectral and hyperspectral imaging offers a more comprehensive spectral band coverage, crucial for effectively capturing the growth status of crops [10]. However, hyperspectral sensors tend to have slower image acquisition rates compared to multispectral sensors, and they encounter additional computational challenges during data processing and analysis. In contrast, multispectral sensors swiftly and accurately provide sufficient spectral information, making multispectral imaging technology a promising tool for agricultural remote sensing [14].
Moreover, the extraction of crop traits from spectral data relies on the established relationship between the desired traits and the derived vegetation indices (VIs). Some scholars have observed that relying solely on vegetation indexes for estimating crop agronomic traits is prone to saturation [15]. Given that image textural features carry spatial information, they serve as valuable complements to spectral features and can effectively extend the model’s capability to estimate the saturation point’s location [16]. With the advancement of remote sensing technology, numerous modeling features have been developed. While multi-source features offer rich information, issues such as covariance and data redundancy often arise among these features. Many studies have employed Pearson’s correlation coefficient method to manually downscale features and filter out highly correlated ones [17,18]. However, traits with high covariance remain a challenge as Pearson’s correlation coefficient method (P) fails to address this issue [19]. Principal component analysis (PCA) excels in selecting the most important variables for known data but struggles with unknown and nonlinear data [20]. On the other hand, techniques like Multivariate Stepwise Regression (MSR) [21] and ReliefF (RfF) [22] can address various issues such as determining the importance of the information contained in features, the weights of each feature concerning the target trait, and the feature’s divergence [23]. However, given the abundance of remote sensing features available for selection [24], the choice of feature selection algorithm for dimensionality reduction is particularly important.
Furthermore, the modeling algorithm implements a degree of dimensionality reduction on the features to mitigate model complexity, thereby enhancing model performance. Deep learning algorithms are unsupervised and construct estimation models with high accuracy. However, more parameters are required, the model is larger, and computer performance is required [25]. Kasper Johansen et al. [26] employed the random forest (RF) algorithm, utilizing morphological, image, and spectral features as input variables to develop models for tomato biomass and yield. The artificial neural network model, based on fully connected layer connections, can capture numerous hidden features from the image and output a specified number of features to mitigate covariance among them, thus enhancing model accuracy. Yuan Huanhuan et al. compared the random forest model (RF), support vector machine (SVM), artificial neural network (ANN), and partial least squares (PLS) for the estimation of leaf area index in soybean and found that the RF and ANN models performed better in terms of trait estimation [27]. Artificial neural networks and random forest regression models excel in mitigating covariance issues across different data perspectives and possess robust data fitting and prediction capabilities. Despite their advantages, there is limited research on integrating feature selection algorithms with machine learning modeling algorithms to downscale features and devise optimal modeling strategies [28].
In summary, this study utilized UAV multispectral technology to construct a cotton AGB estimation model. Mainly through the extraction of image spectral features and texture features. Screened the sensitive features of AGB using P, PCA, MSR, and RfF to explore its potential for accurate monitoring of AGB, aiming to provide reference materials for cotton growth monitoring.

2. Materials and Methods

2.1. Overview of the Study Area

The study area is located in Shaya County, Aksu Region, Xinjiang (Figure 1), characterized by a temperate continental arid climate, an average elevation of 988 m, an annual cumulative temperature of ≥10 °C of 4105 °C, an average annual sunshine of 3031.2 h, a frost-free period of 214 d. Precipitation is scarce in this area, with an average annual precipitation of 47.3 mm and an average annual evapotranspiration of about 2000.7 mm, and sub-film drip irrigation is the main high-yield cultivation technology in the area. The soil of experimental field was a sandy loam soil with an organic matter content of 9.8 g·kg−1, total nitrogen of 0.6 g·kg−2, a bulk density of 1.5 g·cm−3, and a soil pH of 8.3 [29].
The experiment spanned 2 years, commencing with a split-zone design in the first year (Figure 1). The main zone contained three horizontal irrigation levels such as W1 (3150 m3·ha−1), W2 (4050 m3·ha−1), and W3 (4950 m3·ha−1). Meanwhile, the secondary zones encompassed seven mulching treatments, including ordinary PE mulch, high-burger mulch, no mulch, and four types of degradable mulches. The cotton variety (J206-5) was sown on 15 April 2021, utilizing the “one film, three tubes, and six rows planting” method, with a plant row configuration of [(10 + 66 + 10 + 66 + 10) + 66] × 10 cm. In the second year, one degradable film cover treatment was omitted, and same cotton variety was sown on 8 April 2022, while all other experimental conditions remained consistent with the first year.

2.2. Ground Data Acquisition

Following each image acquisition, three cotton plants were randomly selected from each plot (2.28 × 9 = 20.52 m2); the whole plant was harvested above the cotyledon node and packed in paper bags. Subsequently, the samples were subjected to a temperature of 105 °C for 30 min in an oven, followed by a reduction in temperature to 80 °C for drying until a constant weight was achieved, and then the data of dry weight was recorded. After removing outliers, the mean sample mass was utilized to compute the above-ground biomass (AGB) value per unit area (kg·m−2). In the first year of the experiment, 120 samples were collected at each stage, whereas in the second year, with one less treatment, 108 samples were collected at each stage, resulting in a total of 576 samples. These samples were randomly allocated into training and test sets at a ratio of 7:3, and the outcomes are detailed in Table 1.

2.3. UAV Image Acquisition and Feature Extraction

2.3.1. Image Acquisition

In the first year of the experiment, we employed the DJI M300 RTK UAV platform (Matrice 300 Real Time Kinematic, M300RTK) produced by DJI Technology Co., Ltd. in Shenzhen, China, equipped with the U.S. company MicaSense (Seattle, WA, USA), RedEdge-MX sensors to obtain low-altitude remote sensing imagery, and the sensors contain five bands: blue, green, red, red-edge, and near-infrared. In the second year of the experiment, the DJI Phantom 4 Multispectral version of the UAV platform (Phantom 4 Multispectral, P4M), produced by DJI Technology Co., Ltd. in Shenzhen, China, was used. The sensors contain one visible and five multispectral bands. Information about the UAV platforms and sensors is shown in Table 2.
Image was taken during clear, cloudless, and windless conditions at noon (13:00–15:00). Prior to commencing image capture, the UAV underwent DLS Configuration. Additionally, both the take-off and landing points were designated at the same location. An open and level terrain was carefully chosen for operation. Within the cotton field, a calibration plate (Ground Control Point, GCP) was strategically positioned to precisely determine the field’s location within the images. Flight routes were planned, preferably aligning with the direction of mulching. The UAV maintained a flight altitude of 30 m and a speed of 2.6 m per second. A 75% overlap rate was ensured in both side and heading directions. Manually, a set of photos of the reflectance correction plate was taken before each route mission. Images were captured at equidistant intervals along consistent flight paths.

2.3.2. Image Extraction and Parsing

Data preprocessing, including stitching and radiometric calibration of multispectral images, was executed using Pix4D Mapper software (version 4.5.6, Pix4D S.A., Prilly, Switzerland). Initially, the acquired remote sensing imagery was imported into the software, along with the POS geolocation information. Subsequently, the Ag multispectral template was selected, and either an aerial grid or a corridor-type route was chosen in the advanced settings match. Point cloud and 3D mesh texture generation followed, alongside the utilization of a reflectance grayscale correction plate for radiation processing and calibration of the reflection map. Finally, five single-band reflectance grayscale maps were generated and fused into one image.
Soil background mask was performed using ENVI (version 5.3, EXELIS., Boulder, CO, USA) software. The images were imported into the software, and the Band Math tool was used to calculate the NDVI vegetation index mapping, and then create a mask with a threshold of 0.3, and then the orthophotos were masked to exclude soil backgrounds.
Spectral information was extracted in bulk using ArcGIS (version 10.6, Esri Inc., Redlands, CA, USA) software. The reflectance grayscale map, with soil background removed, was imported into the software. Subsequently, a new region of interest, Shapefile, was created, delineating the test plot boundaries. The mean reflectance value within this region of interest was then extracted, serving as the spectral information for the plot.
Spectral features (SF) encompass both raw spectral bands and vegetation indices (VI). Vegetation indices are derived from linear or nonlinear band operations applied to the acquired spectral band data. This process aims to minimize the influence of soil background values, thereby enhancing the vegetation characteristics in the image and improving the accuracy of vegetation phenology estimation. In this study, spectral information from the blue, green, red, red-edge, and near-infrared bands (labeled B, G, R, RE, and NIR, respectively) was utilized to compute multispectral vegetation indices, resulting in a total of 20 indices. The formulas for calculating these vegetation indices are provided in Table 3.
Textural features (TF) pertain to the recurring local patterns and arrangements within an image, reflecting the periodic changes in the surface structure of objects. These features encompass spatial information and possess a strong anti-noise capability. Hence, in this study, we utilize the Gray-Level Co-occurrence Matrix (GLCM) in statistical methods. The Co-occurrence Measures tool in the ENVI 5.3 software is employed to extract textural features from the five channels of the multispectral image. A 3 × 3 window with a constant gray level is selected, with a computational step size of 1. The textural features are averaged over four angles, resulting in eight textural features calculated for each channel. These features include mean (MEA), variance (VAR), Homogeneity (HOM), Contrast (CON), Dissimilarity (DIS), Entropy (ENT), Angular Second-order Moments (ASM), and correlation (COR) [41].

2.3.3. Feature Selection Algorithm

In this study, four feature selection algorithms were employed to address the issue of covariance among the feature variables derived from the computed 5 original spectral bands, 20 vegetation indexes, and 40 textural features. These selected features are subsequently utilized as input variables in the model. The feature selection algorithms employed Pearson correlation, Principal Component Analysis, multiple stepwise regression, and ReliefF algorithm.
Pearson’s correlation coefficient method (P) [17] is a linear correlation coefficient, which is the most used type of correlation coefficient. Denoted as r, it is used to reflect the degree of linear correlation between two variables, with r values ranging from −1 to 1, and the larger the value, the stronger the correlation. The essence is to measure the degree of correlation between the variables through covariance while dividing by the product of their respective standard deviations for standardization.
Principal Component Analysis (PCA) [20] is a method that combines multiple correlated variables into a small number of new uncorrelated variables through orthogonal transformations, and the new variables can maintain the original information of the data as much as possible. In this study, the PCA was configured to retain principal component information with a variance contribution greater than 5%.
The fundamental principle of multiple stepwise regression (MSR) [21] involves systematically introducing independent variables one by one. Variables that demonstrate a significant impact on the target variable are retained, while those that do not maintain significance are systematically removed from the process. This process ensures that only independent variables with a significant effect on the target variable are included, thereby excluding variables that lack significant impact.
ReliefF (RfF) [22] is an enhancement of the RelieF algorithm designed to address multi-category problems. The RelieF algorithm itself is a widely utilized filtered Feature Weighting Algorithm that adjusts feature weights based on the correlation between each feature and the target variable. This correlation is determined by the feature’s capacity to distinguish between k close samples, thus updating the weights accordingly, with the default nearest neighbor distance (Near) set to 10. Notably, the algorithm’s runtime increases linearly with the number of samples drawn and the number of original features, ensuring efficient performance.

2.4. Model Construction and Evaluation

2.4.1. Model Building

In this study, the samples were randomly divided into training and test sets for the model in 7:3, and the cotton AGB was used as the target variable, while the feature set preferred by the feature selection algorithm was used as the input variable for constructing the model. Based on the Python (3.9.13) environment, three modeling algorithms implemented in the scikit-learn (1.1.1) package were Bayesian Ridge regression model, Random Forest regression model, and Artificial Neural Network model.

2.4.2. Bayesian Ridge Regression

Bayesian Ridge Regression (BRR) [42] is a linear model in machine learning. Linear model is an algorithmic model that constructs a hypothesis function (hypothesis) by combining the characteristic attributes of the target in a linear fashion and training it. This approach offers advantages such as simplicity of form, ease of modeling, and high interpretability. However, the data may face noise and covariance problems, and ridge regression, as a regularization method, mitigates the overfitting problem by introducing a regularization parameter, λ, that constrains the regression coefficients from a Bayesian perspective. In this study, the Bayesian ridge regression algorithm and parameters tol = 1.0 × 10−6, alpha_1 and alpha_1 as 1.0 × 10−6, and lambda_1 and lambda_2 as 1.0 × 10−6 were used.

2.4.3. Random Forest Regression

Random forest regression (RFR) [26] is a nonlinear model in machine learning, an algorithm that integrates multiple decision trees through the idea of integrated learning. Based on the Bootstrap method of resampling, multiple training sets are generated and input into each decision tree sub-model using a randomly selected set of split attributes, thus reducing the model variance to improve the model prediction performance. Random forests have good resistance to noise but are prone to overfitting and can handle high-dimensional data. In this study, decision tree depth, number of iterations, and other hyperparameters of stochastic Mori Ridge regression models were fine-tuned by grid search method using 5-fold cross-validation.

2.4.4. Artificial Neural Network

Artificial neural network (ANN) [27] represents a nonlinear model derived from a linear model foundation, offering distinct advantages for addressing nonlinear problems. Particularly useful when the relationship between feature variables and the target is ambiguous, ANN models excel in extracting high-dimensional features. Artificial neural network models can be internally categorized into 3 types: input layer, hidden layer, and output layer. High-dimensional feature information is extracted by increasing the number of layers in the hidden layer and the number of neurons in a single hidden layer, which in turn improves the accuracy of the model. The important parameters of this model are the number of hidden layers and neurons, activation function type, weight optimizer type, and maximum number of iterations. Determination of optimal hyperparameters is achieved through iterative loops utilizing the grid search method with 5-fold cross-validation.

2.4.5. Evaluation of Model Accuracy

In this study, coefficient of determination (R2) was chosen to evaluate the model fitting effect, Root Mean Square Error (RMSE) and Mean Absolute Error (MAE) were used to evaluate the model estimation error, and Normalized Root Mean Squared Error (nRMSE) was used to evaluate the efficiency of different models. A higher R2 value indicates a better fit for the model, while smaller RMSE and MAE values signify lower model error. Additionally, a smaller nRMSE value suggests enhanced model generalization ability. The formulas [43] are as follows:
R 2 = i = 1 n x i x ¯ 2 y i y ¯ 2 n i = 1 n x i x ¯ 2 n i = 1 n y i y ¯ 2
R M S E = 1 n i = 1 n y i x i 2
M A E = 1 n i = 1 n y i x i
n R M S E = R M S E i n y i y ¯ 2 / n
where n is the sample size, i is the data of the ith sample, xi is the measured value of the ith sample, x ¯ is the mean of the measured value, yi is the predicted value of the ith sample, and y ¯ is the mean of the predicted value.

3. Results

3.1. Estimation of AGB in Cotton Based on Pearson’s Correlation Analysis

3.1.1. Correlation Analysis between Spectral Features of Cotton Canopy and AGB

The cotton canopy spectral features and AGB were correlated, and these results are shown in Figure 2. All vegetation indices were positively correlated with AGB, and the blue, green, red, and red-edge spectral bands were negatively correlated with AGB. Among the spectral features with high correlation with AGB were G and RESR, with correlation coefficients of −0.66 and 0.66, respectively.
The textural features of the five original band images of the cotton canopy were calculated and correlated with the AGB, and the results are shown in Table 4. The MEA feature and the ASM feature of each channel showed a highly significant negative correlation with AGB (p < 0.001), while the DIS feature, ENT feature, and COR feature showed positive correlation with AGB, with the DIS feature and ENT feature showing as highly significant (p < 0.001). The ENT and ASM of the five channels showed highly significant positive (r = 0.25, p < 0.001) and highly significant negative (r = −0.25, p < 0.001) correlation coefficients, as did the correlation coefficients of the AGB, respectively, showing the unique canopy textural features of cotton. The MEA features of the extracted textural features based on the five channels agree with the AGB correlation coefficients, and the five original spectral features agree with the cotton AGB correlation coefficients, revealing a degree of consistency between the cotton canopy’s textural and spectral features. The textural features with higher correlation are B_DIS, G_MEA, and R_MEA, with correlation coefficients of 0.57, 0.66, and 0.56, respectively, with G_MEA demonstrating the highest correlation coefficient.

3.1.2. Model Construction of Cotton AGB Estimation Based on Pearson’s Correlation Analysis

The model was developed using the features that exhibited high correlation coefficients with AGB [2]. The first two features were used in this study, and the results are shown in Table 5. When utilizing variables with high correlation, the nonlinear models (RFR and ANN) provided better fits to the AGB compared to the linear model (BRR). The use of spectral and texture feature fusion had a lower error (RMSE of 0.28–0.35 kg·m−2, MAE of 0.19–0.25 kg·m−2) than using spectral feature or texture feature AGB models alone. Specifically, the ANN algorithm exhibited superior fitting in the test sets (R2 of 0.80). On the other hand, the ANN algorithm showed less error between the estimated and measured values (RMSE of 0.26–0.35 kg·m−2). The ANN is more accurate among the three algorithmic models (nRMSE: 0.48–0.64). Consequently, the SF-ANN-AGB model proved to be more effective. In the test set, it showed R2 of 0.82, RMSE of 0.26 kg·m−2, MAE of 0.18 kg·m−2, and nRMSE of 0.48.

3.2. Estimation of AGB in Cotton Based on Principal Component Analysis

3.2.1. Contribution of Cotton Canopy Spectral and Textural Features Based on PCA

Principal Component Analysis was performed on the feature variables, and the results are shown in Figure 3. The spectral features displayed a variance contribution of more than 5%, primarily from the first two principal components (PC1 and PC2), which accounted for 83% and 11% of the variance, respectively (Figure 3a). Conversely, individual principal components of the textural feature variables contributed less, and the first four principal components (PC1 = 58%, PC2 = 21%, PC3 = 11%, and PC4 = 5%) contributed more than 5% of the variance, respectively (Figure 3b). As for the spectral and textural feature fusion, the first four principal components had variance contributions of 57%, 21%, 9%, and 6%, respectively (Figure 3c). The PCA algorithm is more effective in extracting the main information of spectral features than textural features.

3.2.2. Model Construction of Cotton AGB Estimation Based on PCA

AGB model construction feature variable selection of the principal component components with a variance contribution greater than 5%; the model results are shown in Table 6. Nonlinear models (RFR and ANN) outperformed the linear model (BRR). Notably, the PCA algorithm exhibited greater accuracy in constructing models based on spectral features compared to textural features. Moreover, in the PCA-based feature selection algorithm, the fusion of textural features with spectral features significantly enhanced the AGB model’s performance. Considering the potential saturation effect when using only a single feature, the STF-ANN-AGB model emerged as more effective among the AGB estimation models constructed based on the PCA feature selection algorithm. In the test set, it obtained an R2 of 0.84, RMSE of 0.25 kg·m−2, MAE of 0.25 kg·m−2, and nRMSE of 0.42.

3.3. Estimation of AGB in Cotton Based on Multiple Stepwise Regression

The features were selected using a multiple stepwise regression (MSR) algorithm, wherein the feature variables underwent a series of analyses for significance and were ranked based on their association with cotton AGB. Subsequently, feature variables with the largest p-values (p > 0.05) were systematically phased out using an iterative approach. These selected feature variables were utilized as input variables for the model, and the model accuracy based on different numbers of features selected by the MSR algorithm is presented in Table 7 below. Among the selected features, there were 17 spectral features, 21 textural features, and 34 spectral and textural feature fusions, as determined by the MSR. Notably, some of the spectral features and textural features overlapped in the spectral and textural feature fusions, which are highlighted in italic and bold in Table 7, indicating strong consistency in this subset of features. The use of MSR selects more features, and the richness of information makes the model more accurate. The ANN algorithm constructs models with higher accuracy and lower error than the BRR and RFR algorithms. The SF-MSR-ANN-AGB model worked better for cotton AGB estimation with an R2 of 0.90, RMSE of 0.20 kg·m−2, MAE of 0.12 kg·m−2, and nRMSE of 0.33.

3.4. Estimation of AGB in Cotton Based on RfF Algorithm

3.4.1. Importance Analysis and Estimation of AGB in Cotton Based on RfF Algorithm

The ReliefF algorithm was employed for feature selection, assessing the importance of features regarding cotton AGB. Feature variables with low importance scores were excluded, and the feature variables were subsequently updated based on their weights relative to AGB. The results of the importance scores of the feature variables to AGB are depicted in Figure 4. Among spectral features, SIPI2 exhibited the highest importance score (0.33), followed by RESR (0.23). In contrast, textural features displayed high importance scores for G_COR (0.37) and RE_DIS (0.34). Additionally, Figure 4b reveals a notable consistency in importance scores across channel textural features, with ASM and ENT scoring lower while DIS and COR scored higher. Interestingly, the RfF algorithm assigned higher importance scores to textural features compared to spectral features.
The features, sorted according to their importance scores, are sequentially incorporated into the AGB model from highest to lowest, as depicted in Figure 5. The model accuracy results highlight the significant impact of the number of features on the AGB model’s performance: as the number of features increases, the model’s R2 increases while the RMSE and nRMSE decrease. Notably, the contribution to model accuracy primarily stems from features with high importance scores, whereas features with lower importance scores make minimal contributions and tend to increase model complexity, thereby weakening the enhancement of model accuracy. Moreover, as the number of feature variables increases, the accuracy of the AGB models constructed by the three modeling algorithms tends to stabilize. In the AGB model constructed based on textural features, it is observed that the accuracy of the model using a small number of textural features is notably low, making it challenging to estimate AGB accurately. The RfF algorithm assigns higher importance scores to textural features compared to spectral features, and AGB model features constructed based on a small number of spectral and textural features fusion primarily rely on textural features, limiting the improvement in the accuracy of AGB estimation models. However, once the fourth feature, a spectral feature (RESR), is incorporated, the performance of the AGB model improves significantly. This further underscores the importance of multi-source features in compensating for the incomplete information of individual features.

3.4.2. Model Construction of Cotton RfR-AGB Estimation

Based on the results of the above analysis, in this study, the features with the top two importance scores of spectral features (SIPI2 (2nd), RESR (1st)) and the top two importance scores of textural features (G_COR (1st), RE_DIS (2nd)) were selected to construct the AGB model separately. Similarly, the features with the top two importance scores of spectral features and textural features in the spectral and textural features fusion (SIPI2 (17th), RESR (4th), G_COR (1st), RE_DIS (2nd)) were selected to construct the model AGB model, and the results are shown in Table 8. The fusion of spectral features with textural features demonstrated enhanced model performance. Specifically, the AGB model constructed using the ANN modeling algorithm exhibited more stable performance. Notably, the SF-RfF-ANN-AGB model and STF-RfR-ANN-AGB model displayed higher accuracy (R2 of 0.86), but the STF-RfR-ANN-AGB model had lower error (RMSE of 0.23 kg·m−2, MAE of 0.16 kg·m−2 and nRMSE of 0.39).

3.5. Model Inversion for Cotton AGB Estimation Based on Optimal Modeling Strategy

Figure 6 shows the results of the cotton AGB model constructed based on spectral and textural feature fusions, a combined feature selection and modeling algorithm approach. Artificial neural network (ANN) models consistently outperformed other models in estimating cotton AGB, demonstrating high accuracy across the board. Among the models, the MSR-ANN-AGB model achieved the highest accuracy, with a test set R2 of 0.89, RMSE of 0.20 kg·m−2, MAE of 0.14 kg·m−2, and nRMSE of 0.33. Notably, the cotton AGB model constructed based on the MSR feature selection algorithm selected numerous features (17–34), leading to higher accuracy compared to other feature selection algorithms. However, the optimal modeling strategy was observed in the RfF-ANN-AGB model, which employed a smaller number of features. This strategy reduced the utilization of redundant data and improved model performance, yielding an R2 of 0.86, RMSE of 0.23 kg·m−2, MAE of 0.16 kg·m−2, and nRMSE of 0.39.

4. Discussion

4.1. Response of Cotton AGB Estimation Models to Spectral and Textural Features

The literature demonstrates that constructing models based on multimodal data fusion can enhance prediction accuracy and mitigate the saturation effect of the model [44,45]. For instance, Guo et al. achieved an RMSE of 5.77 days in estimating the date of summer maize tasseling by integrating spectral and image texture features using an improved adaptive feature weighting method [46]. Similarly, the findings of this study underscore the superiority of spectral and textural features fusion over models constructed solely with either spectral or image texture features, showcasing higher accuracy and stability while effectively delaying the saturation effect of spectral features. Furthermore, Wang et al. highlighted that combining image texture features can mitigate overestimation and underestimation phenomena observed in spectral index models alone [16]. Moreover, while the resolution of acquired images appears to have no impact on models constructed solely from spectral features [47], it does influence models constructed through spectral and textural feature fusion. This discrepancy stems from the fact that image texture features contain spatial information, making them highly sensitive to the spatial resolution of the image. Therefore, further research on spectral and textural feature fusions should consider the role of spatial resolution in image analysis.

4.2. Estimation of AGB in Cotton Based on Spectral and Textural Feature Fusions by Feature Selection Algorithm

In this study, it was found that the five original spectral features and textural features MEA corresponding to the channels exhibited agreement with the AGB correlation coefficients. This suggests that spectral features can, to some extent, characterize certain textural features, thereby highlighting the issue of covariance between features. Similarly, Jiang et al. found that models constructed after screening characteristic wavelength variables using CARS outperformed those utilizing full-wavelength, SPA, and Monte Carlo Uninformative Variable Elimination (MC-UVE) methods [48]. In this paper, four feature selection algorithms were compared and analyzed. The results indicated that feature selection based on Pearson’s correlation coefficient method failed to address the covariance problem between features, resulting in low model accuracy, with an R2 of 0.80. Conversely, employing PCA to analyze spectral and textural feature fusions and selecting the greater 5% of the principal component components improved the model’s R2 to 0.84. Moreover, the AGB model based on multiple stepwise regression retained more features, utilized the STF-MSR-ANN-AGB model strategy, and exhibited the highest accuracy with an R2 of 0.89 and an RMSE of 0.20 kg·m−2. The MSR method efficiently eliminated features insignificantly related to the AGB, thereby enhancing the model’s performance while also effectively downsizing the feature data. Additionally, it has been demonstrated that the ReliefF-RFE algorithm significantly improves plantation classification results for multi-period and single-period data compared to SVM, CART, and RF methods, achieving the highest accuracy [49]. This aligns with the results obtained using the RfF algorithm in this study. Utilizing the RfF algorithm in conjunction with the ANN model, the AGB estimation model constructed based on STF exhibited high accuracy, with an R2 of 0.86, RMSE of 0.23 kg·m−2, MAE of 0.16 kg·m−2, and nRMSE of 0.39 for the test sets.

4.3. Impact of Machine Learning Algorithms on the Accuracy of Cotton AGB Estimation Models

Enhancing the accuracy of remote sensing estimation models has been studied extensively, primarily focusing on the integration of multi-source features and various modeling algorithms. Zhang et al. conducted a comparison between support vector machine and random forest regression for constructing a winter wheat LAI estimation model, revealing that the random forest regression modeling algorithm yielded superior accuracy [50]. Similarly, the findings of this study indicate that the RFR model provides a better fit, with significantly higher accuracy in both the training and test sets compared to the linear BRR model. This improvement is likely attributed to the inherent dimensionality reduction capabilities of the RFR and ANN algorithms. Furthermore, the ANN-MLP model predicted maize yields with optimum accuracy, achieving an R2 of 0.98, RMSE of 0.23, and rRMSE of 20.44% [51]. In line with these findings, this paper suggests that utilizing the ReliefF algorithm for feature selection of spectral and textural features fusion, followed by integration with the ANN modeling algorithm, represents the optimal modeling strategy for constructing the cotton AGB estimation model.
Indeed, the feature selection algorithm employed in this study solely analyzes the spectral and textural features of multispectral images, overlooking the inclusion of high spatial resolution RGB images and high-dimensional hyperspectral data. This presents a path toward further research and exploration. We used full-fertility data to construct the cotton AGB model, and the specific availability at different fertility periods needs to be further explored. Spectral and textural feature fusion have shown promise in delaying the onset of spectral feature saturation points, but its effects on textural features of varying spatial resolutions remain unexplored. Furthermore, the impact of feature selection algorithms on the extent to which textural features delay spectral saturation points across different spatial resolutions warrants investigation. This avenue of inquiry holds the potential to advance our understanding of feature selection methods and their applicability across diverse remote sensing datasets.

5. Conclusions

The textural features MEA and the corresponding spectral features bands exhibited high consistency with AGB correlation. Models incorporating spectral and textural feature fusions demonstrated greater stability compared to models utilizing single spectral features or textural features alone. The cotton AGB estimation model (STF-RfF-ANN-AGB), constructed using the ReliefF algorithm for spectral and textural features fusion for feature selection combined with the ANN algorithm, proved to be the most effective. It showed a 7.44% increase in R2 and a 16.68% decrease in RMSE, as well as a 13.50% decrease in MAE and a 22.68% decrease in nRMSE compared to the ANN-AGB model construction (STF-P-ANN-AGB) based solely on Pearson’s correlation coefficients. The STF-RfF-ANN-AGB model achieved an R2 of 0.86, RMSE of 0.23 kg·m−2, MAE of 0.16 kg·m−2, and nRMSE of 0.39. The features used in this optimal modeling strategy were SIPI2, RESR, G_COR, and RE_DIS.
Conclusively, leveraging spectral and textural feature fusions alongside feature selection algorithms enhanced the generalization ability of cotton AGB estimation models. The RfF algorithm and ANN algorithm for constructing cotton AGB proved to be the most effective. The optimal modeling strategy was the STF-RfF-ANN-AGB model. This approach lays the strong foundation for improving remote sensing model accuracy through multi-feature fusion and feature selection algorithms, thus contributing to the advancement of digital and precision agriculture.

Author Contributions

Conceptualization, M.C., C.Y., T.L., P.J., X.J. and Q.T.; data curation, M.C., H.L. and Z.W.; funding acquisition, T.L. and Q.T.; methodology, M.C., C.Y., T.L. and X.J.; software, M.C. and Z.W.; validation, C.Y.; visualization, M.C. and H.L.; writing—original draft, M.C., C.Y. and S.A.; writing—review and editing, M.C., C.Y., T.L., P.J., S.A., X.J. and Q.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Modern Agricultural Industry Technology System—Cotton Industry Technology System (CARS-15-13), the earmarked fund for Xinjiang Agriculture Research System-03 (XJARS-03), the Xinjiang “Tianshan Talents” Training Program “Youth Top-notch Talent Project-Young Science and Technology Innovation Talents” (2023TSYCCX0019), the Xinjiang Uygur Autonomous Region Major Science and Technology Project “Research on Key Technologies of Farm Digitalization and Intelligence” Sub-project “Dynamic Prediction and Intelligent Decision-making of Field Crop Growth and Development” (2022A02011-2-1), and the Xinjiang Academy of Agricultural Sciences Agricultural Science and Technology Innovation Stability Support Project (xinkywdzc-2023007).

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Li, F.; Bai, J.Y.; Zhang, M.Y.; Zhang, R.Y. Yield estimation of high-density cotton fields using low-altitude UAV imaging and deep learning. Plant Methods 2022, 18, 55. [Google Scholar] [CrossRef] [PubMed]
  2. Meng, L.; Yin, D.M.; Cheng, M.H.; Liu, S.B.; Bai, Y.; Liu, Y.; Liu, Y.D.; Jia, X.; Nan, F.; Song, Y.; et al. Improved Crop Biomass Algorithm with Piecewise Function (iCBA-PF) for Maize Using Multi-Source UAV Data. Drones 2023, 7, 254. [Google Scholar] [CrossRef]
  3. Liu, Y.W.; Shen, X.J.; Wang, Y.J.; Zhang, J.Q.; Ma, R.; Lu, X.G.; Jiang, M. Spatiotemporal Variation in Aboveground Biomass and Its Response to Climate Change in the Marsh of Sanjiang Plain. Front. Plant Sci. 2022, 13, 920086. [Google Scholar] [CrossRef] [PubMed]
  4. Yue, J.B.; Yang, G.J.; Tian, Q.J.; Feng, H.K.; Xu, K.J.; Zhou, C.Q. Estimate of winter-wheat above-ground biomass based on UAV ultrahigh-ground-resolution image textures and vegetation indices. ISPRS J. Photogramm. Remote Sens. 2019, 150, 226–244. [Google Scholar] [CrossRef]
  5. Zhou, X.; Zheng, H.B.; Xu, X.Q.; He, J.Y.; Ge, X.K.; Yao, X.; Cheng, T.; Zhu, Y.; Cao, W.X.; Tian, Y.C. Predicting grain yield in rice using multi-temporal vegetation indices from UAV-based multispectral and digital imagery. ISPRS J. Photogramm. Remote Sens. 2017, 130, 246–255. [Google Scholar] [CrossRef]
  6. Li, B.; Xu, X.M.; Zhang, L.; Han, J.W.; Bian, C.S.; Li, G.C.; Liu, J.G.; Jin, L.P. Above-ground biomass estimation and yield prediction in potato by using UAV-based RGB and hyperspectral imaging. ISPRS J. Photogramm. Remote Sens. 2020, 162, 161–172. [Google Scholar] [CrossRef]
  7. Li, D.; Miao, Y.X.; Gupta, S.K.; Rosen, C.J.; Yuan, F.; Wang, C.Y.; Wang, L.; Huang, Y.B. Improving Potato Yield Prediction by Combining Cultivar Information and UAV Remote Sensing Data Using Machine Learning. Remote Sens. 2021, 13, 3322. [Google Scholar] [CrossRef]
  8. Hassan, M.A.; Yang, M.J.; Rasheed, A.; Yang, G.J.; Reynolds, M.; Xia, X.C.; Xiao, Y.G.; He, Z.H. A rapid monitoring of NDVI across the wheat growth cycle for grain yield prediction using a multi-spectral UAV platform. Plant Sci. 2019, 282, 95–103. [Google Scholar] [CrossRef] [PubMed]
  9. Bronson, K.F.; French, A.N.; Conley, M.M.; Barnes, E.M. Use of an ultrasonic sensor for plant height estimation in irrigated cotton. Agron. J. 2020, 113, 2175–2183. [Google Scholar] [CrossRef]
  10. Yin, C.X.; Lv, X.; Zhang, L.F.; Ma, L.L.; Wang, H.H.; Zhang, L.S.; Zhang, Z. Hyperspectral UAV Images at Different Altitudes for Monitoring the Leaf Nitrogen Content in Cotton Crops. Remote Sens. 2022, 14, 2576. [Google Scholar] [CrossRef]
  11. Zhou, X.T.; Yang, M.; Chen, X.Y.; Ma, L.L.; Yin, C.X.; Qin, S.Z.; Wang, L.; Lv, X.; Zhang, Z. Estimation of Cotton Nitrogen Content Based on Multi-Angle Hyperspectral Data and Machine Learning Models. Remote Sens. 2023, 15, 955. [Google Scholar] [CrossRef]
  12. Li, M.H.; Shamshiri, R.R.; Schirrmann, M.; Weltzien, C.; Shafian, S.; Laursen, S. UAV Oblique Imagery with an Adaptive Micro-Terrain Model for Estimation of Leaf Area Index and Height of Maize Canopy from 3D Point Clouds. Remote Sens. 2022, 14, 585. [Google Scholar] [CrossRef]
  13. Javier, H.C.; Edgar, O.M.; Roger, M.; Olmán, M. Evaluation of Unmanned Aerial Vehicles (UAV) as a Tool to Predict Biomass and Carbon of Tectona grandis in Silvopastoral Systems (SPS) in Costa Rica. Drones 2021, 5, 47. [Google Scholar] [CrossRef]
  14. Bascon, M.V.; Nakata, T.; Shibata, S.; Takata, I.; Kobayashi, N.; Kato, Y.; Inoue, S.; Doi, K.; Murase, J.; Nishuchi, S. Estimating Yield-Related Traits Using UAV-Derived Multispectral Images to Improve Rice Grain Yield Prediction. Agriculture 2022, 12, 1141. [Google Scholar] [CrossRef]
  15. Liu, Y.N.; Liu, S.S.; Li, L.; Gou, X.Y.; Wang, S.Q.; Lu, J.W. Estimating biomass of winter oilseed rape using vegetation indices and texture metrics derived from UAV multispectral images. Comput. Electron. Agric. 2019, 166, 105026. [Google Scholar] [CrossRef]
  16. Wang, F.M.; Yi, Q.X.; Hu, J.H.; Xie, L.L.; Yao, X.P.; Xu, T.Y.; Zheng, J.Y. Combining spectral and textural information in UAV hyperspectral images to estimate rice grain yield. Int. J. Appl. Earth Obs. Geoinf. 2021, 102, 102397. [Google Scholar] [CrossRef]
  17. Bao, L.; Li, X.; Yu, J.X.; Li, G.S.; Chang, X.Y.; Yu, L.X.; Li, Y. Forecasting spring maize yield using vegetation indices and crop phenology metrics from UAV observations. Food Energy Secur. 2023, 13, e505. [Google Scholar] [CrossRef]
  18. Su, X.; Nian, Y.; Yue, H.; Zhu, Y.; Li, J.; Wang, W.; Sheng, Y.; Ma, Q.; Liu, J.; Wang, W.; et al. Improving Wheat Leaf Nitrogen Concentration (LNC) Estimation across Multiple Growth Stages Using Feature Combination Indices (FCIs) from UAV Multispectral Imagery. Agronomy 2024, 14, 1052. [Google Scholar] [CrossRef]
  19. Yin, Q.; Zhang, Y.T.; Li, W.L.; Wang, J.J.; Wang, W.L.; Ajmad, I.; Zhou, G.S.; Huo, Z.Y. Estimation of Winter Wheat SPAD Values Based on UAV Multispectral Remote Sensing. Remote Sens. 2023, 15, 3595. [Google Scholar] [CrossRef]
  20. Mosleth, E.F.; Wan, Y.F.; Lysenko, A.; Chope, G.A.; Pemson, S.P.; Shewry, P.R.; Hawkesford, M.J. A novel approach to identify genes that determine grain protein deviation in cereals. Plant Biotechnol. J. 2015, 13, 625–635. [Google Scholar] [CrossRef]
  21. Zhu, X.F.; Hou, C.Y.; Xu, K.; Liu, Y. Establishment of agricultural drought loss models: A comparison of statistical methods. Ecol. Indic. 2020, 112, 106084. [Google Scholar] [CrossRef]
  22. Lapajne, J.; Knapič, M.; Žibrat, U. Comparison of Selected Dimensionality Reduction Methods for Detection of Root-Knot Nematode Infestations in Potato Tubers Using Hyperspectral Imaging. Sensors 2022, 22, 367. [Google Scholar] [CrossRef]
  23. Liu, Y.; Pan, K.; Liu, Z.; Dai, Y.; Duan, X.; Wang, M.; Shen, Q. Simultaneous Determination of Four Catechins in Black Tea via NIR Spectroscopy and Feature Wavelength Selection: A Novel Approach. Sensors 2024, 24, 3362. [Google Scholar] [CrossRef]
  24. Karthigesu, J.; Owari, T.; Tsuyuki, S.; Hiroshima, T. Improving the Estimation of Structural Parameters of a Mixed Conifer–Broadleaf Forest Using Structural, Textural, and Spectral Metrics Derived from Unmanned Aerial Vehicle Red Green Blue (RGB) Imagery. Remote Sens. 2024, 16, 1783. [Google Scholar] [CrossRef]
  25. Chen, D.W.; Hu, F.; Nian, G.K.; Yang, T.T. Deep Residual Learning for Nonlinear Regression. Entropy 2020, 22, 193. [Google Scholar] [CrossRef] [PubMed]
  26. Kasper, J.; Mitchell, J.L.M.; Yoann, M.; Bruno, A.; Samer, A.; Matteo, G.Z.; Yoseline, A.; Gabriele, F.; Sónia, N.; Magdi, A.A.M.; et al. Predicting Biomass and Yield in a Tomato Phenotyping Experiment Using UAV Imagery and Random Forest. Front. Artif. Intell. 2020, 3, 28. [Google Scholar] [CrossRef]
  27. Yuan, H.H.; Yang, G.J.; Li, C.C.; Wang, Y.J.; Liu, J.G.; Yu, H.Y.; Feng, H.K.; Xu, B.; Zhao, X.Q.; Yang, X.D. Retrieving Soybean Leaf Area Index from Unmanned Aerial Vehicle Hyperspectral Remote Sensing: Analysis of RF, ANN, and SVM Regression Models. Remote Sens. 2017, 9, 309. [Google Scholar] [CrossRef]
  28. Zhang, C.; Yi, Y.; Wang, L.; Zhang, X.; Chen, S.; Su, Z.; Zhang, S.; Xue, Y. Estimation of the Bio-Parameters of Winter Wheat by Combining Feature Selection with Machine Learning Using Multi-Temporal Unmanned Aerial Vehicle Multispectral Images. Remote Sens. 2024, 16, 469. [Google Scholar] [CrossRef]
  29. Chen, M.G.; Lin, T.; Zhang, H.; Liu, H.J.; Wang, Y.F.; Tang, Q.X. Effects of mulch film types on cotton growth and analysisof self-degradation recycling characteristics. Xinjiang Agric. Sci. 2023, 60, 2101–2108. [Google Scholar] [CrossRef]
  30. Hernandez, A.; Jensen, K.; Larson, S.; Larsen, R.; Rigby, C.; Johnson, B.; Spickermann, C.; Sinton, S. Using Unmanned Aerial Vehicles and Multispectral Sensors to Model Forage Yield for Grasses of Semiarid Landscapes. Grasses 2024, 3, 84–109. [Google Scholar] [CrossRef]
  31. Niu, Q.L.; Feng, H.K.; Zhou, X.G.; Zhu, J.Q.; Yong, B.B.; Li, H.Z. Combining UAV Visible Light and Multispectral Vegetation Indices forEstimating SPAD Value of Winter Whea. Trans. Chin. Soc. Agric. Mach. 2021, 52, 183–194. [Google Scholar] [CrossRef]
  32. Oteng-Frimpong, R.; Karikari, B.; Sie, E.K.; Kassim, Y.B.; Puozaa, D.K.; Rasheed, M.A.; Fonceka, D.; Okello, D.K.; Balota, M.; Burow, M.; et al. Multi-locus genome-wide association studies reveal genomic regions and putative candidate genes associated with leaf spot diseases in African groundnut (Arachis hypogaea L.) germplasm. Front. Plant Sci. 2023, 13, 1076744. [Google Scholar] [CrossRef] [PubMed]
  33. Yang, Q.; Shi, L.S.; Han, J.Y.; Zha, Y.Y.; Zhu, P.H. Deep convolutional neural networks for rice grain yield estimation at the ripening stage using UAV-based remotely sensed images. Field Crop. Res. 2019, 235, 142–153. [Google Scholar] [CrossRef]
  34. Sadeed, H. Application and Assessment of UAV on-Board Multispectral Sensor for Non-Destructive Site-Specific Rapeseed Crop (Brassica napus L.) Phenotype Variable and Weed Discrimination at Early Phenological Stage; Huazhong Agricultural University: Wuhan, China, 2021. [Google Scholar]
  35. Yang, Z.; Yu, Z.; Wang, X.; Yan, W.; Sun, S.; Feng, M.; Sun, J.; Su, P.; Sun, X.; Wang, Z.; et al. Estimation of Millet Aboveground Biomass Utilizing Multi-Source UAV Image Feature Fusion. Agronomy 2024, 14, 701. [Google Scholar] [CrossRef]
  36. Killeen, P.; Kiringa, I.; Yeap, T.; Branco, P. Corn Grain Yield Prediction Using UAV-Based High Spatiotemporal Resolution Imagery, Machine Learning, and Spatial Cross-Validation. Remote Sens. 2024, 16, 683. [Google Scholar] [CrossRef]
  37. Shen, Y.; Yan, Z.; Yang, Y.; Tang, W.; Sun, J.; Zhang, Y. Application of UAV-Borne Visible-Infared Pushbroom Imaging Hyperspectral for Rice Yield Estimation Using Feature Selection Regression Methods. Sustainability 2024, 16, 632. [Google Scholar] [CrossRef]
  38. Rozenstein, O.; Haymann, N.; Kaplan, G.; Tanny, J. Validation of the cotton crop coefficient estimation model based on Sentinel-2 imagery and eddy covariance measurements. Agric. Water Manag. 2019, 223, 105715. [Google Scholar] [CrossRef]
  39. Shammi, S.A.; Huang, Y.; Feng, G.; Tewolde, H.; Zhang, X.; Jenkins, J.; Shankle, M. Application of UAV Multispectral Imaging to Monitor Soybean Growth with Yield Prediction through Machine Learning. Agronomy 2024, 14, 672. [Google Scholar] [CrossRef]
  40. Liu, S.B.; Jin, X.L.; Bai, Y.; Wu, W.B.; Cui, N.B.; Cheng, M.H.; Liu, Y.D.; Meng, L.; Jia, X.; Nie, C.W.; et al. UAV multispectral images for accurate estimation of the maize LAI considering the effect of soil background. Int. J. Appl. Earth Obs. Geoinf. 2023, 121, 103383. [Google Scholar] [CrossRef]
  41. Mryka, H.B. Practical guidelines for choosing GLCM textures to use in landscape classification tasks over a range of moderate spatial scales. Int. J. Remote Sens. 2017, 38, 1312–1338. [Google Scholar] [CrossRef]
  42. Abelardo, M.L.; Osval, A.M.L.; Gustavo, D.L.C.; José, C.; Juan, B.; Francisco, J.L.V. Bayesian functional regression as an alternative statistical analysis of high-throughput phenotyping data of modern agriculture. Plant Methods 2018, 14, 46. [Google Scholar] [CrossRef] [PubMed]
  43. Jin, X.L.; Li, Z.H.; Feng, H.K.; Ren, Z.B.; Li, S.K. Deep neural network algorithm for estimating maize biomass based on simulated Sentinel 2A vegetation indices and leaf area index. Crop. J. 2020, 8, 87–97. [Google Scholar] [CrossRef]
  44. Mia, S.M.; Ryoya, T.; Luthfan, N.H.; Naoyuki, H.; Koki, H.; Masayasu, M.; Tsutomu, M.; Takashi, S.T.T. Multimodal Deep Learning for Rice Yield Prediction Using UAV-Based Multispectral Imagery and Weather Data. Remote Sens. 2023, 15, 2511. [CrossRef]
  45. Maitiniyazi, M.; Vasit, S.; Paheding, S.; Sean, H.; Flavio, E.; Felix, B.F. Soybean yield prediction from UAV using multimodal data fusion and deep learning. Remote Sens. Environ. 2020, 237, 111599. [Google Scholar] [CrossRef]
  46. Guo, Y.H.; Fu, Y.S.H.; Chen, S.Z.; Bryant, C.R.; Li, X.X.; Senthilnath, J.; Sun, H.Y.; Wang, S.X.; Wu, Z.F.; Beurs, K. Integrating spectral and textural information for identifying the tasseling date of summer maize using UAV based RGB images. Int. J. Appl. Earth Obs. Geoinf. 2021, 102, 102435. [Google Scholar] [CrossRef]
  47. Francisco, A.V.; Fernando, C.R.; Saiz, M.P.; Francisco, O.R. Multi-temporal Imaging Using an Unmanned Aerial Vehicle for Monitoring a Sunflower Crop. Biosyst. Eng. 2015, 132, 19–27. [Google Scholar]
  48. Jiang, X.Y.; Li, F.S.; Wang, Q.Y.; Luo, J.; Hao, J.; Xu, M.Q. Determination of Lead and Arsenic in Soil Samples by X Fluorescence Spectrum Combined with CARS Variables Screening Method. Spectrosc. Spectr. Anal. 2022, 42, 1535–1540. [Google Scholar]
  49. Lin, X.Q.; Ren, C.; Li, Y.; Yue, W.T.; Liang, J.Y.; Yin, A.C. Eucalyptus Plantation Area Extraction Based on SLPSO-RFE Feature Selection and Multi-Temporal Sentinel-1/2 Data. Forests 2023, 14, 1864. [Google Scholar] [CrossRef]
  50. Zhang, D.Y.; Han, X.X.; Lin, F.F.; Du, S.Z.; Zhang, G.; Hong, Q. Estimation of winter wheat leaf area index using multi-source UAV image feature fusion. Trans. Chin. Soc. Agric. Eng. 2022, 38, 171–179. [Google Scholar] [CrossRef]
  51. Harsányi, E.; Bashir, B.; Arshad, S.; Ocwa, A.; Vad, A.; Alsalman, A.; Bácskai, I.; Rátonyi, T.; Hijazi, O.; Széles, A.; et al. Data Mining and Machine Learning Algorithms for Optimizing Maize Yield Forecasting in Central Europe. Agronomy 2023, 13, 1297. [Google Scholar] [CrossRef]
Figure 1. Study area and plot layout.
Figure 1. Study area and plot layout.
Agronomy 14 01313 g001
Figure 2. Correlation analysis between AGB and canopy spectral features.
Figure 2. Correlation analysis between AGB and canopy spectral features.
Agronomy 14 01313 g002
Figure 3. PCA based on spectral and textural features. (a) Spectral features. (b) Textural features. (c) Spectral and textural feature fusion.
Figure 3. PCA based on spectral and textural features. (a) Spectral features. (b) Textural features. (c) Spectral and textural feature fusion.
Agronomy 14 01313 g003
Figure 4. Importance analysis of spectral and textural features with cotton AGB. (a) Spectral features importance scores. (b) Textural features importance scores.
Figure 4. Importance analysis of spectral and textural features with cotton AGB. (a) Spectral features importance scores. (b) Textural features importance scores.
Agronomy 14 01313 g004
Figure 5. Effect of feature numbers on AGB model based on RfF algorithm. (a) AGB model R2 based on SF. (b) AGB model R2 based on TF. (c) AGB model R2 based on STF. (d) AGB model RMSE based on SF. (e) AGB model RMSE based on TF. (f) AGB model RMSE based on STF. (g) AGB model nRMSE based on SF. (h) AGB model nRMSE based on TF. (i) AGB model nRMSE based on STF.
Figure 5. Effect of feature numbers on AGB model based on RfF algorithm. (a) AGB model R2 based on SF. (b) AGB model R2 based on TF. (c) AGB model R2 based on STF. (d) AGB model RMSE based on SF. (e) AGB model RMSE based on TF. (f) AGB model RMSE based on STF. (g) AGB model nRMSE based on SF. (h) AGB model nRMSE based on TF. (i) AGB model nRMSE based on STF.
Agronomy 14 01313 g005
Figure 6. Evaluation of AGB of cotton model based on optimal modeling strategy based on spectral and textural features fusion. (a) P-BRR-AGB model. (b) PCA-BRR-AGB model. (c) MSR-BRR-AGB model. (d) RfF-BRR-AGB model. (e) P-RFR-AGB model. (f) PCA-RFR-AGB model. (g) MSR-RFR-AGB model. (h) RfF-RFR-AGB model. (i) P-ANN-AGB model. (j) PCA-ANN-AGB model. (k) MSR-ANN-AGB model. (l) RfF-ANN-AGB model.
Figure 6. Evaluation of AGB of cotton model based on optimal modeling strategy based on spectral and textural features fusion. (a) P-BRR-AGB model. (b) PCA-BRR-AGB model. (c) MSR-BRR-AGB model. (d) RfF-BRR-AGB model. (e) P-RFR-AGB model. (f) PCA-RFR-AGB model. (g) MSR-RFR-AGB model. (h) RfF-RFR-AGB model. (i) P-ANN-AGB model. (j) PCA-ANN-AGB model. (k) MSR-ANN-AGB model. (l) RfF-ANN-AGB model.
Agronomy 14 01313 g006
Table 1. Above ground biomass real measurement dataset in cotton.
Table 1. Above ground biomass real measurement dataset in cotton.
Data SetsSample SizeMean
(kg·m−2)
Maximum
(kg·m−2)
Minimum
(kg·m−2)
Standard Deviation
(kg·m−2)
VarianceCoefficient of Variation
Training Sets4031.122.870.190.610.3754%
Test Sets1731.053.610.240.620.3960%
Table 2. UAV platform-specific information.
Table 2. UAV platform-specific information.
Date of Data AcquisitionUnmanned Aircraft PlatformsSensorSpectral Region (nm)Flight Altitude (m)Spacing Resolution (m)Longitudinal and Lateral Overlap Rate
5 July 2021M300RTKRedEdge-MXBlue: 475 ± 20300.0211775% + 75%
5 August 2021M300RTKGreen: 560 ± 20300.0211775% + 75%
22 August 2021M300RTKRed: 668 ± 10300.0211775% + 75%
RedEdge: 717 ± 10
NIR: 840 ± 40
29 June 2022P4M Blue: 450 ± 16300.0209175% + 75%
Green: 560 ± 16
7 August 2022P4MRed: 650 ± 16300.0209175% + 75%
RedEdge: 730 ± 16
NIR: 840 ± 26
Table 3. Spectral features calculation formula.
Table 3. Spectral features calculation formula.
NumberSpectral FeaturesAbbreviationFormulaReference
1Difference Vegetation IndexDVI N I R R [30]
2Green-difference Vegetation Index (g)GDVI N I R G [31]
3Normalized Difference Vegetation IndexNDVI N I R R / N I R + R [32]
4Green-normalized Difference Vegetation IndexGNDVI N I R G / N I R + G [33]
5Blue-normalized Difference Vegetation IndexBNDVI N I R B / N I R + B [34]
6Ration Vegetation IndexRVI N I R / R [35]
7Green-Ration Vegetation IndexGRVI N I R / G [35]
8Structurally Insensitive Pigment IndexSIPI2 N I R G / N I R R [28]
9Source Address Validation ImprovementSAVI 1 + L N I R R / N I R + R + L [36]
10Optimization of Soil Regulatory Vegetation IndexOSAVI 1.16 × N I R R / N I R + R + 0.16 [36]
11Green-Optimization of Soil Regulatory Vegetation IndexGOSAVI 1.16 × N I R G / N I R + G + 0.16 [31]
12Green Chlorophyll IndexCIgreen N I R / G 1 [37]
13RedEdge Simple Ratio Vegetation IndexRESR R E / R [38]
14Enhanced Vegetation IndexEVI 2.5 × N I R R / N I R + 6 × R 7.5 × B + 1 [37]
15Non-Linear IndexNLI N I R 2 R / N I R 2 + R [31]
16Atmospheric Resistance Vegetation IndexARVI N I R 2 × R + B / N I R + B [39]
17Transformed Difference Vegetation IndexTVI 0.5 + N I R R / N I R + R 2 [37]
18Green-Renormalized Difference Vegetation IndexGRDVI N I R G / N I R + G 2 [31]
19Modified Simple Ration IndexMSR N I R / R 1 / N I R / R + 1 2 [40]
20Green-Modified Simple Ration IndexGMSR N I R / G 1 / N I R / G + 1 2 [31]
L: the soil adjustment coefficient, which is taken as 0.5 here.
Table 4. Correlation analysis between AGB and textural features of cotton.
Table 4. Correlation analysis between AGB and textural features of cotton.
Channel BrChannel GrChannel RrChannel NIRrChannel REr
B_MEA−0.20 ***G_MEA−0.66 ***R_MEA−0.56 ***NIR_MEA0.26 ***RE_MEA−0.22 ***
B_VAR0.06G_VAR−0.20 ***R_VAR−0.20 ***NIR_VAR0.10 *RE_VAR0.01
B_HOM−0.08G_HOM0.16 ***R_HOM0.19 ***NIR_HOM−0.28 ***RE_HOM−0.11 **
B_CON0.07G_CON−0.16 ***R_CON−0.20 ***NIR_CON0.25 ***RE_CON0.10 *
B_DIS0.57 ***G_DIS0.03R_DIS0.11 **NIR_DIS0.43 ***RE_DIS0.33 ***
B_ENT0.25 ***G_ENT0.25 ***R_ENT0.25 ***NIR_ENT0.25 ***RE_ENT0.25 ***
B_ASM−0.25 ***G_ASM−0.25 ***R_ASM−0.25 ***NIR_ASM−0.25 ***RE_ASM−0.25 ***
B_COR0.15 ***G_COR0.07R_COR0.20 ***NIR_COR0.01RE_COR0.04
***: p < 0.001, **: p < 0.01, and *: p < 0.05.
Table 5. AGB estimation model for cotton based on Pearson’s correlation.
Table 5. AGB estimation model for cotton based on Pearson’s correlation.
FeaturesMethodFeature NumbersFeature VariablesTest Sets
R2RMSE (kg·m−2)MAE (kg·m−2)nRMSE
SFBRR2G, RESR0.560.410.320.93
RFR0.780.290.200.50
ANN0.820.260.180.48
TFBRR2B_DIS, G_MEA0.530.430.330.99
RFR0.670.360.240.63
ANN0.680.350.240.64
STFBRR4G, RESR, B_DIS, G_MEA0.690.350.250.71
RFR0.780.290.190.50
ANN0.800.280.190.50
Bolding indicates optimal results.
Table 6. Accuracy of AGB estimation model based on PCA feature selection algorithm.
Table 6. Accuracy of AGB estimation model based on PCA feature selection algorithm.
FeaturesMethodpc NumbersTest Sets
R2RMSE (kg·m−2)MAE (kg·m−2)nRMSE
SFBRR20.510.440.341.00
RFR0.840.250.150.42
ANN0.860.230.150.38
TFBRR40.570.410.310.92
RFR0.690.350.230.66
ANN0.620.390.270.74
STFBRR40.640.370.270.79
RFR0.810.270.180.48
ANN0.840.250.250.42
Bolding indicates optimal results.
Table 7. Accuracy of AGB estimation model based on MSR algorithm.
Table 7. Accuracy of AGB estimation model based on MSR algorithm.
FeaturesMethodFeatures NumbersFeatures VariablesTest Sets
R2RMSE (kg·m−2)MAE (kg·m−2)nRMSE
SFBRR17B, G, RE, R, DVI, NDVI, BNDVI, SIPI2, RVI, CIgreen, GOSAVI, RESR, SAVI, OSAVI, GRDVI, GMSR, MSR0.870.220.150.37
RFR0.880.220.130.37
ANN0.900.200.120.33
TFBRR21B_MEA, B_DIS, G_VAR, G_CON, G_COR, NIR_MEA, NIR_VAR, NIR_HOM, NIR_DIS, NIR_ENT, NIR_ASM, RE_MEA, RE_VAR, RE_CON, RE_DIS, RE_COR, R_MEA, R_VAR, R_HOM, R_ENT, R_ASM0.820.260.170.46
RFR0.870.230.140.39
ANN0.890.200.130.33
STFBRR34GDVI, NDVI, GNDVI, BNDVI, SIPI2, RVI, CIgreen, GOSAVI, EVI, OSAVI, NLI, GRDVI, GMSR, MSR, B_VAR, B_DIS, G_MEA, G_VAR, G_HOM, G_CON, G_DIS, G_COR, NIR_HOM, NIR_DIS, NIR_ENT, NIR_ASM, RE_VAR, RE_HOM, RE_CON, RE_DIS, RE_COR, R_CON, R_ENT, R_ASM0.870.230.150.38
RFR0.860.230.120.40
ANN0.890.200.140.33
Italics and bolding indicate repeated features. Bolding indicates optimal results.
Table 8. Accuracy of AGB estimation model based on RfF feature selection algorithm.
Table 8. Accuracy of AGB estimation model based on RfF feature selection algorithm.
FeaturesMethodFeature NumbersFeature VariablesTest Sets
R2RMSE (kg·m−2)MAE (kg·m−2)nRMSE
SFBRR2SIPI2, RESR0.510.430.330.96
RFR0.820.260.190.47
ANN0.860.240.170.41
TFBRR2G_COR, RE_DIS0.080.600.512.77
RFR0.260.540.411.44
ANN0.130.580.482.40
STFBRR4SIPI2, RESR, G_COR, RE_DIS0.610.390.290.82
RFR0.830.250.170.47
ANN0.860.230.160.39
Bolding indicates optimal results.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Chen, M.; Yin, C.; Lin, T.; Liu, H.; Wang, Z.; Jiang, P.; Ali, S.; Tang, Q.; Jin, X. Integration of Unmanned Aerial Vehicle Spectral and Textural Features for Accurate Above-Ground Biomass Estimation in Cotton. Agronomy 2024, 14, 1313. https://doi.org/10.3390/agronomy14061313

AMA Style

Chen M, Yin C, Lin T, Liu H, Wang Z, Jiang P, Ali S, Tang Q, Jin X. Integration of Unmanned Aerial Vehicle Spectral and Textural Features for Accurate Above-Ground Biomass Estimation in Cotton. Agronomy. 2024; 14(6):1313. https://doi.org/10.3390/agronomy14061313

Chicago/Turabian Style

Chen, Maoguang, Caixia Yin, Tao Lin, Haijun Liu, Zhenyang Wang, Pingan Jiang, Saif Ali, Qiuxiang Tang, and Xiuliang Jin. 2024. "Integration of Unmanned Aerial Vehicle Spectral and Textural Features for Accurate Above-Ground Biomass Estimation in Cotton" Agronomy 14, no. 6: 1313. https://doi.org/10.3390/agronomy14061313

APA Style

Chen, M., Yin, C., Lin, T., Liu, H., Wang, Z., Jiang, P., Ali, S., Tang, Q., & Jin, X. (2024). Integration of Unmanned Aerial Vehicle Spectral and Textural Features for Accurate Above-Ground Biomass Estimation in Cotton. Agronomy, 14(6), 1313. https://doi.org/10.3390/agronomy14061313

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop