1. Introduction
Known for its high organic matter and fertility, black soil is a non-renewable resource that plays a very important role in agricultural development [
1]. In order to curb the degradation of black soil and restore and enhance ground strength, conservation tillage has become a world-recognized sustainable production method in agriculture; it is an important part of modern eco-agriculture and plays an important role in environmental protection [
2]. Conservation tillage techniques centered on no- and minimum-tillage plus crop straw mulching have become important measures to improve soil degradation on farmland and increase soil organic matter content and moisture retention capacity. Straw mulching has greatly improved the degree of protection of black soil, facilitated the function of carbon metabolism in no-till soil microbial communities, and increased crop yields. Conservation tillage requires that the straw mulch rate be not less than 0.3 [
3]. Straw mulch quantity is also an important indicator for determining the level of straw mulching. Straw mulch rate refers to the ratio of the area covered by crop straw or stubble on the surface soil to the total surface area; straw mulch quantity refers to the quantity of crop straw or stubble covered per unit area of surface soil. Our team is committed to the study of straw mulch detection, and we have obtained good results in the detection of straw mulch [
4,
5,
6]. However, it was found that the straw mulch rate can only accomplish the detection of “with” and “without” straw mulch and not the detection of “more” and “less” straw mulch. Liu et al. [
7] suggested that different straw mulch quantities have different effects on soil improvement and that a straw return quantity of about 7500 kg/hm
2 had the best effect on soil improvement. Therefore, the accurate detection of straw mulch quantity in the field is an important judgment index to measure the level of straw mulching and also reflects the quality and effect of straw return to the field [
8]. However, in the detection of straw mulch quantity, there are problems with cumbersome work tasks, diversified influencing factors, and huge measurement errors, and there is no automated measurement method of straw mulch quantity on a large scale yet.
In recent years, remote-sensing technology has been developed with the advantage of being available on temporal and spatial scales, allowing for rapid and accurate large-scale feature detection [
9]. Aerial multispectral technology has potential in remote sensing research, which combines the integrated features of UAV monitoring and spectral remote sensing and describes external characterization information more accurately while acquiring rich information [
10]. The input–output ratio of UAV multi-spectral cameras is higher, so they have a short data acquisition cycle and are more flexible in operation and suitable for multiple scenarios. Huang et al. [
11], Daughtry et al. [
12], Daughtry et al. [
13], and Burud et al. [
14] proposed detecting crops in farmland through remote sensing data and achieved good results. Kavoosi et al. [
15] compared the feasibility of satellite and drone images in monitoring straw mulch, and the results showed that drones have certain advantages in straw mulch detection. The mapping relationship established between spectral data and straw dry matter content is one of the simplest methods used to obtain the straw mulch quantity. Yang et al. [
16] used a machine learning algorithm combined with a vegetation index approach to estimate the leaf dry matter content. Liu et al. [
17] used PCA transform to reconstruct leaf reflectance spectra and retrieve the leaf biochemical composition, which included the leaf mass per area. Machine learning techniques have the advantage of solving nonlinear tasks and have been widely applied to the remote sensing of biochemical composition in recent years. Meanwhile, in the application of remote-sensing technology, machine learning algorithms are considered to be reliable and efficient methods for inverting models; they can greatly improve processing speeds and analysis accuracy [
18]. Fu et al. [
19], Han et al. [
20], Ban et al. [
21], and Nasi et al. [
22] used machine learning to construct an inversion model for crop performance evaluation under different conditions related to, for example, crop yield estimation and soil salinity. Although machine learning has reached a certain degree of accuracy, the prediction performance is greatly affected by the model parameters [
23]. The Meta-Heuristic Algorithm can help models be less likely to fall into a local optimum during the search process and find the global optimal solution with great probability, which is a commonly used optimization method [
24]. Zhu et al. [
25] and Mo et al. [
26] proposed that the hybrid algorithm is superior and can address the shortcomings of a single optimization algorithm well, allowing the model to be further optimized in terms of convergence speed and computational accuracy.
In practical application scenarios, we cannot accurately estimate the global straw mulch quantity using only the inversion model because we can only obtain local samples at each site and are limited by the sampling environment and data quantity. The Monte Carlo method [
27], as a flexible and efficient numerical solution, has become a key tool in the field of scientific computing and engineering and plays a crucial role in solving practical problems. The idea of the Monte Carlo method addresses the problem of estimating overall sample distribution by random sampling, which is precisely the problem we wanted to solve. It can approximate the global straw mulch quantity by using local samples [
28]. Therefore, to complete the transformation from “with” and “without” to “more” and “less” of the global field straw mulch and return effect, the establishment of an efficient and accurate regional straw mulch quantity estimation model is the most important task.
Aiming at the above problems, this study adopts a UAV carrying a multispectral camera to acquire corn straw mulch remote sensing images of the study areas and simultaneously determines the data of straw mulch quantity at the sampling points. Based on the RF inversion model, we proposed the hybrid GA-HPSO algorithm to improve the performance of the inversion model and optimize the model to obtain the sample data results. The Monte Carlo idea is used to estimate the global straw mulch quantity and complete the detection of straw return to the field, which provides important modern information technology support to effectively solve the hidden dangers and problems of conservation tillage, such as “difficult to verify the area, difficult to monitor the quality, and high risk of subsidy fund issuance”.
2. Materials and Methods
2.1. Overview of the Study Area
To facilitate a comprehensive discussion on straw mulch quantity estimation, the study area was divided into experimental and validation segments. The experimental areas were primarily designed to assess the feasibility of the optimal inversion model and estimation methods, while the validation areas aimed to demonstrate the practical applicability of these methodologies. The study site was located in Changchun City, Jilin Province, China, situated in the temperate semi-humid continental monsoon climate zone, characterized by an average annual temperature of 5.5 °C, average annual precipitation of 558 mm, and an average elevation of approximately 220 m. Jilin Province has actively promoted conservation tillage practices involving straw mulch, with widespread adoption of full straw mulch applications. Details of the study area selection are presented in
Table 1.
The experimental fields across the eight study areas employed crop rotation and wide–narrow row planting patterns, predominantly utilizing the Udi 598 maize hybrid (Hongxiang Seed Industry Company Limited, Jilin, China). Spring corn is typically sown in early May and harvested in early October. The terrain within the study area is relatively flat, with rows fully covered by returned straw post-crushing, as illustrated in
Figure 1. The return-row plowing method allows for substantial or complete straw return to the field, thereby reducing soil erosion from wind and water while enhancing soil fertility [
29]. This setting provides an optimal environment for conducting maize straw mulch detection research.
2.2. Data Acquisition and Analysis
2.2.1. Determination of Straw Mulch Quantity
To provide a better understanding of the study areas, the weather information under sunny skies, including wind direction, wind speed, and temperature, was collected and shown in
Table 2. The experimental data were collected after harvesting in the autumn but before planting in the spring of the second year concerning the timeliness and diversity of the data. Meanwhile, the locations and number of sampling points were determined based on the size of the area. The corn straw mulch quantity parameters were measured at the sampling points.
We defined each sampling unit as approximately 2500 m
2, randomly selecting six 0.5 m × 0.5 m squares within each unit as sampling points. After measuring straw mulch quantity within these points, we assessed thickness at five designated corners and the center for error verification, utilizing an electronic scale (Hengzi KC-833, accuracy: 0.001 kg). A total of 408 sampling points were collected during the experimental phase and 258 during the validation phase, with the sampling process illustrated in
Figure 2. For example, in Study Area 1, the area was divided into nine sampling units, each containing six randomly distributed sampling points. The UAV captured multispectral images above these points to measure straw mulch quantity.
2.2.2. UAV Remote Sensing Data
The DJI Matric 300 RTK drone (DJI Technology Company Limited, Shenzhen, China) was equipped with an AQ600 PRO multispectral camera (Changguang Yuchen Information Technology and Equipment Company Limited, Qingdao, China). The AQ600 PRO sensor collects data across discrete spectral bands: blue (450 ± 35 nm), green (555 ± 25 nm), red (660 ± 20 nm), red edge (720 ± 10 nm), and near-infrared (840 ± 35 nm), with a pixel resolution of 2048 × 1536. Real-time radiometric correction was supported, with radiometric calibration images captured prior to each flight. Aerial image acquisition was conducted during the optimal timeframe of 10:00 to 15:00.
To ensure comprehensive data collection from the study area plots, we utilized a high-altitude UAV flight pattern. The UAV followed a pre-planned route at a speed of 3 m/s, maintaining an 80% overlap for both bypass and heading. After conducting several trial flights, an optimal flight altitude of 40 m was determined. Additionally, to guarantee the accuracy of spectral images, we employed a fixed-point manual control mode, hovering at a determined altitude of 12 m for image capture above each sampling point.
The multispectral images acquired via UAV were processed and analyzed using Yusense Map V 2.2.3 software, which is specifically designed for the multispectral camera. Initially, we employed the band alignment function within Yusense Map V 2.2.3 software. Subsequently, geometric and radiometric corrections were applied to eliminate potential noise and interference. The corrected UAV multispectral images were then imported into ENVI 5.3 software, where the corresponding spectral image portions were cropped according to the predefined sampling points. Finally, we calculated the average reflectance spectra for the straw mulch samples within the region of interest, yielding the spectral reflectance data.
2.3. Spectral Reflectance Measurement
The objective of the spectral reflectance measurements was to simulate and analyze variations in spectral reflectance associated with different straw mulch quantities within the same area, thereby assessing the impact of these variations on remote sensing estimates of straw mulch quantity and validating the feasibility of this study.
To facilitate this, we collected straw samples from the experimental area across four datasets in 2022 and 2023, placing approximately 0.7 kg of straw into black bags. The experimental environment was maintained under natural light conditions. A square area measuring 0.5 m × 0.5 m was designated for sampling, with a layer of black soil (0.01 m thick) spread evenly. The multispectral camera (AQ600 PRO) was fixed at a height of 1 m above the black soil. We collected multispectral data corresponding to varying straw mulch quantities, starting with 0.1 kg of straw evenly distributed across the sampling area and increasing in 0.1 kg increments up to 0.5 kg. This acquisition process is illustrated in
Figure 3, and we captured radiometric calibration target images prior to each set of measurements.
2.4. Spectral Index Construction
Spectral indices serve as metrics for analyzing and characterizing spectral data, enabling the study of the composition, structure, and properties of substances. By enhancing the separability of targets, spectral indices improve the accuracy of feature detection. Given their sensitivity to feature characteristics, we selected 12 widely used spectral indices for our experiments, with their calculation formulas detailed in
Table 3.
2.5. Optimization of the Inverse Model for Straw Mulch Quantity
Previously, our team compared various inversion models in the context of straw mulch quantity estimation, concluding that the Random Forest (RF) algorithm yielded the most effective inversion model [
42]. Therefore, we employed the RF algorithm in this study [
43]. This algorithm integrates ensemble learning with decision trees, utilizing resampling methods to enhance the prediction accuracy and convergence speed. However, empirical evidence suggests that hyperparameters—specifically, the number of decision trees (n_estimators), maximum tree depth (max_depth), and maximum number of leaf nodes (max_leaf_nodes)—significantly influence RF performance. Consequently, it is essential to balance accuracy and computational efficiency in parameter selection.
To address this, we introduced a hybrid optimization approach combining Genetic Algorithm (GA) [
44] and Particle Swarm Optimization (PSO) [
45], resulting in the GA-HPSO optimization algorithm. This method enables the RF model to autonomously identify optimal solutions, mitigating the subjectivity associated with manual parameter tuning and alleviating issues related to parameter sensitivity in model prediction performance.
The GA is advantageous due to its population diversity and global search capabilities; however, it lacks individual memory and exhibits slower convergence. Conversely, the PSO benefits from memory and rapid convergence but is susceptible to premature convergence and diminished diversity. By merging these two algorithms, we aim to leverage their complementary strengths. Parameter values such as inertia weights and learning factors in the PSO significantly affect convergence performance. Conventional approaches to these parameters can hinder convergence speed and overall efficiency. Additionally, initialization plays a critical role in accuracy and speed; uneven solution space distribution may lead to local optima.
Traditional PSO algorithms typically employ random initialization to determine initial population distributions. This method can result in uneven initial particle distribution within the solution space, adversely affecting early convergence and increasing the likelihood of local optima. To overcome this, we adopted chaotic initialization to replace the conventional random method. Chaotic initialization incorporates randomness, traversal, and regularity, allowing the search space to be traversed systematically without repetition, thereby enhancing both solution accuracy and convergence speed. In this study, we employed the Logistic Chaos Model for initialization [
46], as represented in Equation (1):
where
is the chaotic model mapping parameter. When
, the system exhibits chaotic behavior; at
, it reaches a fully chaotic state, which we employed in this study.
Next, the inertia weight parameter in the PSO algorithm significantly influences convergence performance. A larger inertia weight favors global exploration, while a smaller weight accelerates local convergence. To balance these capabilities, we introduced a dynamic weight factor (ω) to enhance both convergence speed and accuracy. The formulation of ω is shown in Equation (2):
where
represents the real-time objective function value of the particle.
and
denote the average and minimum objective value of all current particles, respectively. A larger weight at the search’s initial phase promotes global exploration, while a smaller weight in later phases fosters local optimization.
Furthermore, the learning factors
and
in the PSO algorithm impact overall performance and are typically fixed. In this study, to prevent rapid convergence around local optima during the initial stages, we allowed
to be larger and
to be smaller. Conversely, in the later stages, we adjusted
to be smaller and
to be larger to facilitate quick and accurate convergence to the global optimum. Thus,
and
are represented in Equations (3) and (4) as follows:
where
is the current iteration number.
is the maximum iteration number of the particle swarm.
,
are the
,
initial values and
,
are the
,
final values. When
and
, it is an asymmetric learning factor method.
Lastly, we incorporated crossover and mutation operators from GA into the PSO framework. These operations enhance population diversity and broaden the algorithm’s search range, thereby aiding in the attainment of a global optimum. The computational flow of the GA-HPSO algorithm is illustrated in
Figure 4.
The research environment utilized MATLAB 2018a, running on a Windows 10 operating system with an Intel (R) Core (TM) i7-8550U
[email protected] GHz 2.00 GHz processor and an NVIDIA GeForce 940MX graphics card.
2.6. Global Estimation Methods
To relate sample point data to global straw mulch quantity, this study employed the Monte Carlo method. This method’s fundamental principle involves obtaining desired results through random sampling and statistical averaging. The steps typically include defining the problem and required data; calculating necessary functions and probability distributions; designing a random sampling approach (e.g., utilizing pseudo-random numbers); and deriving statistical quantities from the sampled data, such as sums, averages, and variances. Conclusions are drawn based on these results.
Utilizing the Monte Carlo method, the straw mulch quantities obtained from sampling points via the optimal inversion model are treated as known samples. From these samples, an empirical distribution is established, leading to the formulation of a Gaussian mixture model. The probability density function of this model, constituted by the superposition of
Gaussian distributions, is expressed in Equation (5):
where
is the full Gaussian model parameter,
is 1,
is the weight,
is the sample,
is the model expectation, and
is the model variance.
The parameters for each class in the hybrid model are derived using an Expectation-Maximization (EM) algorithm with an iterative optimization strategy, comprising an Expectation step (E step) and a maximization step (M step). The E step is represented in Equation (6):
where
denotes the hidden variable associated with the unknown straw mulch quantity data.
is the parameter estimate of the initial Gaussian model.
is the parameter estimate of the Gaussian model with i iterations. The joint distribution is
. Given the observed data
and parameter estimates
, the conditional probability distribution of the hidden variable
is sought
.
The M step is shown in Equation (7):
This involves maximizing
to determine the parameter estimates for the subsequent iteration.
Employing the Monte Carlo principle, the global estimation method entails n random draws from a known distribution function, yielding n known sample data. Given the unknown probability distribution formula and normalization constant of the Gaussian mixture model, direct sampling from the distribution function proves infeasible. Thus, we employed the choice algorithm for random sampling, employing the accept–reject principle. In this process, a random point is accepted if it lies within the designated region; otherwise, it is rejected.
The ultimate goal of this research is to assess straw mulch quantity in the field and evaluate the effectiveness of straw return in conservation tillage. To achieve this, we need to establish the theoretical straw resources to facilitate subsequent calculations of straw volume return rates. The grass-to-grain ratio method [
47], as outlined in NY/T 1701–2009 [
48], was utilized to obtain the theoretical straw resources. The grass-to-grain ratio
is defined in Equation (8):
where
is the mass of straw,
is the mass of grain,
is the moisture content of straw, and
is the moisture content of grain. The standard moisture content of air-dried straw is 15%, while the national standard moisture content for grain crops is 12.5%. The theoretical corn straw resource
is articulated in Equation (9):
where
is the maize yield. The straw collectible resource
is shown in Equation (10):
where
is the straw collection coefficient, as shown in Equation (11):
where
is the average plant height of maize,
is the stubble height of mechanically harvested crops,
is the stubble height of manually harvested crops,
is the mechanised harvesting rate of the crop, and
is the rate of loss of the crop stalks during harvesting and transportation. The theoretical straw resources and collectible straw resources were calculated based on relevant data obtained from the study area, with results presented in
Table 4.