1. Introduction
In recent years, the Food and Agriculture Organization (FAO) reports show an increasing demand for wheat production to meet the world’s current population’s consumption [
1,
2,
3]. Wheat is the world’s largest cultivated cereal crop, covering approximately 220.62 ha (million hectares) in 2022/23 [
1,
4]. Wheat products contribute significantly to eradicating hunger, reducing poverty, and achieving global food security, which are central to the United Nations Sustainable Development Goals (SDGs) number 1 and number 2 [
5,
6,
7,
8]. The SDG 1 pertains to the eradication of poverty in all its manifestations worldwide, whereas SDG 2 focuses on ending hunger, attaining food security, enhancing nutrition, and fostering sustainable agriculture. However, several factors have gradually reduced wheat production over the years contributing to the increasing demands estimated at 70% for a population of 10 billion by 2050 [
9,
10,
11,
12]. The declining trend in wheat production is due to socio-economic factors, as well as abiotic and biotic stresses that have negative impacts on crop growth conditions and food security [
13,
14,
15,
16]. Abiotic stress factors include extreme heat, drought, frost, salinity, waterlogging, and nutrient deficits, while biotic stresses are due to crop diseases, weeds, and pests [
17,
18,
19]. Crop height is one of the important agronomic indicators related to production, providing an indication of crop stress and essential information and alerts about intra-field crop health conditions. Therefore, monitoring crop height throughout the growing season is critical to understanding the anticipated yields as well as improving wheat production forecasting.
The traditional methods such as Light Detection and Ranging (LiDAR) and Unmanned Aerial Services (UAS) have high accuracy when monitoring crop height development [
20,
21,
22,
23,
24]. Another accurate manual approach for crop height measurement is using a yardstick (ruler or meter scale) [
20,
21]. However, these approaches are generally time-consuming, expensive, labor-intensive, prone to human error, and unable to generate digital crop-height spatial distribution maps for large-scale farms [
22,
25,
26,
27]. Remote sensing satellite data have shown great potential to alleviate limitations of traditional measurements due to the satellites’ efficiency, digital wide intra-field of view, and ability to revisit time characteristics, which are key for accurate monitoring of crop biophysical parameters [
28,
29,
30,
31,
32]. The S-1 and S-2 satellite sensors offer high-resolution imagery and revisit time data for monitoring agricultural applications and land use/land cover changes for free [
33]. The S-1 and S-2 satellite sensors have better spatial resolution compared to the moderate-resolution imaging spectroradiometer (MODIS) and Landsat collections; as a result, these sensors are useful for farm-scale applications [
34,
35,
36]. The improvement in the spatial and spectral resolutions of both S-1 and S-2 sensors makes these satellites suitable for monitoring crop biophysical parameters such as height, among others. Additionally, both the S-1 and S-2 satellite sensors increase the frequency of data acquisition for field-scale crop monitoring. For example, the Copernicus Open Access Hub provides S-1 and S-2 images at 5-day intervals, which is suitable for timely crop monitoring and subsequent management and intervention [
16]. The S-2 satellite data are affected by cloudy atmospheric conditions and rain, which limit the acquisition of data during the crop growing season [
37]. However, S-2 images provide a red-edge band that is very sensitive to crops changes as the plants grow, which can help time-series monitoring. The S-1 satellite can provide data that are not obstructed by clouds and can complement S-2 satellite images [
38,
39]. Previous studies showed that there is a strong relationship between crop height and S-1 satellite variables with 66%, 82%, and 92% precision for wheat-height estimation, respectively [
40,
41,
42]. These S-1 satellite imagery variables include local incidence angle normalization, Alpha (α) and dual-polarization VV (vertical transmits and vertical receive), and VH (vertical transmit and horizontal receive). Furthermore, the application of S-1 satellite data has revealed 66% and 68% sensitivity for wheat-crop height estimation with linear regression (LR) and exponential models, respectively [
43,
44]. The Alpha (α) decomposition parameter has achieved a 67% wheat-crop height sensitivity prediction based on the use of SAR data [
42]. The use of S-1 satellite data has produced 87% and 62% correlations with RFR and LR for wheat-crop height sensitivity estimation [
45]. Nevertheless, S-1 satellite data have not been investigated extensively for agricultural applications because of the complex data structure, in comparison to S-2 satellite data [
35,
46,
47].
The synergistic use of SAR S-1 and optical S-2 sensors provides feasibility of capturing both spectral and textural information [
35,
40,
48,
49]. Additionally, the fusion of both S-1 and S-2 satellite data has been reported to improve the estimation accuracy of different biophysical parameters with the application of machine-learning algorithms [
48,
49,
50]. For example, RFR and particle filter (PF) models have achieved 92% and 95% correlation coefficient precision for estimation of rice-plant height using S-1 satellite data, respectively [
41,
51]. Furthermore, the LR model obtained 66% accuracy with SAR data for estimating wheat-crop height. Kaplan et al. [
40] have found that the Enhanced Vegetation Index (EVI) and the local incidence angle normalization model attained 88% and 66% precision for estimating cotton crop height using SAR and optical data, respectively. The local incidence angle normalization method has achieved accuracies of 86% and 87% for wheat and cotton-crop height estimation using SAR imagery [
52]. The crop coefficient (K
c) model achieved more than 80% estimation accuracy for tomato-crop heights when applied to the synergetic use of S-2 MSI and VENµS imagery [
53]. Abdikan et al. [
49] and Xi et al. [
54] also observed that the artificial neural network (ANN) model attained 91% of sunflower-crop height estimation, and the gradient boosting decision tree (GBDT) model attained 72% of forest-canopy height estimation using a S-1 and S-2 satellite data combination. In most studies, SAR and optical data are explored separately for crop height and biophysical variable estimation [
40,
41,
49,
52,
55]. Recently, studies have shown that these data products display limitations when used independently, which are overcome by using them synergistically [
56,
57,
58]. Xi et al. [
54] used ICESat-2, S-1, and S-2 satellite imagery to develop a crop-height model of different forest types in China. The derived matrices from S-1 and S-2 data showed that they played a critical role in modeling different forest types. Narin et al. [
45] use S-1 data to estimate the crop height of wheat in Turkey, and they found that VH was more sensitive compared to VV polarization. Ndikumana et al. [
41] used multitemporal S-1 data to model rice-plant height and dry aboveground biomass in southern France. They also reported that VH was more correlated with in situ measurements compared to VV and that the RFR model yielded an accuracy of more than 80% for both height and biomass models. Similar results were obtained by Sharifi et al. [
59]. Abdikan et al. [
49] used both S-1 and S-2 satellite data to predict crop height of sunflower crops in Turkey. The authors found that ANN and GBDT models produced higher accuracies of more than 90% and that NDVI with a red-edge band was one of the most important contributions. However, studies that have explored the feasibility of combining S-1 and S-2 satellite data to monitor intra-season and intra-field crop height variability for winter wheat fields are limited.
This current study examines the sensitivity of VV, VH, VH–VV ratio, and S-1 polarization indices’ data to monitor the time-series monthly intra-field crop-height growth of a wheat farm. Secondly, the combined use of SAR S-1 and optical S-2 spectral indices’ time-series monthly data for monitoring intra-field crop height variability is investigated. The RFR, SVMR, DTR, and NNR machine-learning algorithms (MLAs) are applied to both experiments. Both RFR and SVMR have been shown to be effective models for estimating crop parameters in previous studies [
41,
60]. The selection criteria for RFR, SVMR, DTR, and NNR MLA are associated with their reported capabilities to handle nonlinear and noisy data with precision when monitoring crop biophysical variables [
2,
41,
47,
54,
60]. This present study investigates the following detailed objectives: (1) assessing the relationship between intra-field crop height growth variation with S-1 VH, VV, VH–VV ratio polarizations, and S-2 vegetation indices; (2) investigating the effectiveness of the selected RFR, SVMR, DTR, and NNR algorithms in predicting crop height variations; (3) determining which input features are important for modeling crop height growth; (4) evaluating the measurement method that is fundamental for estimating crop height variation; and finally, (5) mapping the pattern of intra-field and intra-season crop height spatial distribution at the wheat-farm level.
4. Discussion
This study evaluated the feasibility of S-1 polarization backscatter bands, S-1 polarization indices, and data integration of S-1 and S-2 spectral indices for predicting intra-field and intra-season crop height variation in the wheat farm of Free State Clarens. Crop height was investigated for the different development stages. RFR, SVMR, NNR, and DTR machine-learning algorithm performances for this purpose were evaluated using SAR S-1 imagery separately and then utilizing a SAR S-1 and optical S-2 data fusion, respectively. The correlation matrix was used for determination, and predictions had significant correlation with actual crop height variation. The study also evaluated feature importance to assess the influence of each input feature on the evaluated models. The optimal machine learning regression algorithms’ estimation accuracies during the experiments were used to generate spatial distribution of crop height variation maps for the entire growing period. The findings revealed that the data integration of SAR S-1 and optical S-2 imagery has superior performance for estimating the actual intra-season crop height variability in comparison to S-1 satellite data only.
The crop height increase is expected because of wheat productivity over time. Furthermore, the correlation matrix indicates that the SARRI2 polarization index has a strong positive and significant correlation with the actual crop height variation in comparison to other features. Other features such as VV, VH, VH + VV, and SARRI1 were positively correlated with crop height at
p > 0.05. Similar findings have demonstrated that VH backscatter is strongly correlated to the estimation of crop height in rice, forest, and mangrove in comparison to other polarizations [
51,
88,
111]. Inversely, other studies revealed that maize, rice, wheat, and sunflower crop height estimation is well correlated with VV polarization over VH and VH/VV [
41,
42,
112]. The contribution of each type of S-1 SAR polarization cannot be generalized and was inconstant in this study and with every other estimated crop height biophysical parameter. Overall, for the VV polarization, the higher attenuation of the signal in vertical structure crop stems often decreases as the crop grows, while VH backscatter increases as the crop grows [
51,
82,
113,
114]. This may suggest the superior performance of the VH polarization scenario over VV in actual crop height estimation.
The evaluation of the four non-parametric RFR, SVMR, NNR, and DTR analytical models for predicting the wheat-crop height showed that the RFR model outperformed SVMR, NNR, and DTR models with data integration of S-1 and S-2 satellite data. The reasonable performance of RFR for crop height estimation in the current study is consistent with and similar to findings from previous studies, which focused on global vegetation canopy-, rice-, and maize-crop height [
24,
41,
82]. These results are frequently attributed to the higher capacity of RFR in predicting crop biophysical variables [
82]. In addition, RFR can handle large datasets using regression tree average values, while preserving high accuracy and minimizing model overfitting risk [
115]. This study attained an accuracy R
2 of 0.93 and 8.43 cm RMSE with RFR as the superior model. These findings are similar to those of Ndikumana et al. [
41], which revealed an accuracy of R
2 = 0.92 and 7.9 cm RMSE during the prediction of rice-crop height using RFR with multitemporal S-1 SAR data. Additionally, Han et al. [
116] used UAV imagery for estimating wheat growth and achieved R
2 of 0.6 to 0.79 and 1.68 to 2.32 cm RMSE with RFR. Furthermore, RFR obtained an R
2 of 0.96 for estimation of plant height in eucalyptus using the traditional measurement method (dendrometry) [
117]. Contrary to the above results. SVMR performed better than RFR, NNR, and DTR for wheat-crop height estimation using S-1 SAR data though with unsatisfactory accuracy results. Ji et al. [
118] found similar results that SVMR can enhance the precision of crop height prediction. The better performance of SVMR is usually aligned to the use of kernel functions that obtain optimal hyperparameter values and influence SVMR model accuracy [
119]. The hyperparameter tuning, feature selection, dataset volume, and quality can also influence the performance of ML models. These alterations in model performance precision are associated with differences in the input variables and may vary across studies.
Variable importance ranking was conducted to show the contribution of each input predictor feature for predicting wheat-crop height. The findings showed that RVI_S1 and Pol have higher ranking in both RFR and SVMR when predicting crop height. However, other input variables were important during the training of the models, but their rankings were low and varied. These results are in contrast to other studies that found the VV backscatter band as more important than RVI and other S-1 SAR polarization indices [
46,
76,
82,
112,
120]. The contrasting findings and differences existing in variable importance assessments are related to crop parameters and variations within the input features for model training in these studies. For instance, other studies focused on biomass of maize, grassland, barley, and wheat [
82,
120], while this current study focuses on crop height biophysical parameters for wheat growth.
The crop height estimations were compared using in situ measurement and RFR and SVMR machine-learning models. The findings showed that in situ measured crop height was similar to that measured using RFR at the early development stages. However, SVMR overestimated the crop height during development stages except during the maturity stage. The RFR model underestimated the crop height in the last two stages of crop development. Generally, both RFR and SVMR models overestimated crop height at late development stages. Our findings validate that RFR and SVMR can estimate plant height using satellite imagery, which is consistent with other previous studies [
41,
49,
59,
121]. The increase in crop height estimation was expected as the crop grew, and it reached maximum growth greater than 100 cm at the maturity stage before its decline at the senescence stage. Our findings are similar to other studies, which observed that crop height reaches up to 41.83, 53, 65, 75, 85, and greater than 90 cm for wheat at the maturity stage [
22,
42,
55,
112,
113]. In general, wheat crops could have similar and different height observations at the maturity stage due to differences in growth conditions, which cause variations within regular phenology [
42,
122,
123].
The spatial distribution of intra-field and intra-season crop height variability was mapped. In general, the crop height spatial distribution maps generated from RFR and SVMR MLA were similar, except during the first and last growth stages. These results demonstrated the feasibility of MLA to estimate intra-field and intra-season crop height variation. The crop height maps produced in this study can be used as a practical guide to identify real-time growth problems within the intra-field level and inform decision making for management zones. Additionally, agricultural research institutions and extension officers can also benefit from these crop height maps to make accurate recommendations that help wheat farmers avoid yield losses and customize wheat-crop insurance. Monitoring wheat-crop height throughout the growth cycle provides useful spatiotemporal information for crop management and could enhance crop yields to meet increasing demands in the global market [
14,
22]. The application of this study is the contribution to precision agriculture farming management, sustainable agriculture, and realization of SDG number 2 (Zero Huger) interventions to improve food security [
5,
124]. Also, it contributes to the SDG number 1 (No Poverty) initiative for eradicating hunger and reducing poverty. The limitations of this study, because of high fieldwork costs, include one planting season visited monthly for crop height measurements and the lack of yield data at the end of the planting season to compare with wheat-crop growth productivity. Future research may consider including seasonal crop growth datasets, meteorological data, identification of crop disease, and other crop structural parameters including leaf chlorophyll content (LCC), the leaf area index (LAI), and crop density for holistic understanding of crop growth monitoring. Such datasets will be useful to famers when identfying appropriate windows for early production assessment [
64].