Next Article in Journal
Central Difference Variational Filtering Based on Conjugate Gradient Method for Distributed Imaging Application
Previous Article in Journal
Classification and Mapping of Fuels in Mediterranean Forest Landscapes Using a UAV-LiDAR System and Integration Possibilities with Handheld Mobile Laser Scanner Systems
Previous Article in Special Issue
Inferring the Variability of Dielectric Constant on the Moon from Mini-RF S-Band Observations
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Machine Learning Modelling for Soil Moisture Retrieval from Simulated NASA-ISRO SAR (NISAR) L-Band Data

1
Indian Institute of Remote Sensing (IIRS), Indian Space Research Organisation (ISRO), Dehradun 248001, India
2
Regional Remote Sensing Center-North, Indian Space Research Organisation (ISRO), New Delhi 110049, India
*
Author to whom correspondence should be addressed.
Remote Sens. 2024, 16(18), 3539; https://doi.org/10.3390/rs16183539
Submission received: 12 April 2024 / Revised: 1 July 2024 / Accepted: 20 September 2024 / Published: 23 September 2024

Abstract

:
Soil moisture is a critical factor that supports plant growth, improves crop yields, and reduces erosion. Therefore, obtaining accurate and timely information about soil moisture across large regions is crucial. Remote sensing techniques, such as microwave remote sensing, have emerged as powerful tools for monitoring and mapping soil moisture. Synthetic aperture radar (SAR) is beneficial for estimating soil moisture at both global and local levels. This study aimed to assess soil moisture and dielectric constant retrieval over agricultural land using machine learning (ML) algorithms and decomposition techniques. Three polarimetric decomposition models were used to extract features from simulated NASA-ISRO SAR (NISAR) L-Band radar images. Machine learning techniques such as random forest regression, decision tree regression, stochastic gradient descent (SGD), XGBoost, K-nearest neighbors (KNN) regression, neural network regression, and multilinear regression were used to retrieve soil moisture from three different crop fields: wheat, soybean, and corn. The study found that the random forest regression technique produced the most precise soil moisture estimations for soybean fields, with an R2 of 0.89 and RMSE of 0.050 without considering vegetation effects and an R2 of 0.92 and RMSE of 0.042 considering vegetation effects. The results for real dielectric constant retrieval for the soybean field were an R2 of 0.89 and RMSE of 6.79 without considering vegetation effects and an R2 of 0.89 and RMSE of 6.78 with considering vegetation effects. These findings suggest that machine learning algorithms and decomposition techniques, along with a semi-empirical technique like Water Cloud Model (WCM), can be effective tools for estimating soil moisture and dielectric constant values precisely. The methodology applied in the current research contributes essential insights that could benefit upcoming missions, such as the Radar Observing System for Europe in L-band (ROSE-L) and the collaborative NASA-ISRO SAR (NISAR) mission, for future data analysis in soil moisture applications.

1. Introduction

Soil moisture (SM) plays an important role in the hydrologic event that connects the earth’s surface and atmospheric processes [1,2]. An in-depth understanding of soil moisture dynamics is essential for a wide range of meteorological, climatologic, and hydrologic applications [1,3,4]. Soil Moisture is an influencing component in crop growth and development in agricultural applications and an early warning indicator of agricultural drought emergencies [5,6]. Manual soil moisture content (SWC) mapping through field surveys is difficult as it consumes time and labour and provides limited data due to sampling constraints [7,8]. Temporal mapping of large areas is also challenging, as surface soil moisture is dynamic in nature and changes frequently over time.
Airborne and spaceborne remote sensing techniques keep track of the soil on a large scale and provide detailed data at reasonable temporal and spatial resolutions, allowing us to study soil moisture dynamics over a large area more effectively [1]. Soil moisture can be obtained from optical and microwave remote sensing [9]. Optical remote sensing is mainly sensitive to the topsoil surface as it relies on the reflection of light from the earth’s surface [10]. Microwave remote sensing provides a unique capability and has shown great potential for soil moisture estimation as it removes most of the constraints of optical remote sensing over retrieving soil moisture [11]. Active and passive microwave remote sensing has shown the capability to attain observations from a regional level to a global level [12]. Due to the spatial resolution limits of passive radiometers, which are often coarse, active microwave remote sensing is frequently utilised to estimate soil moisture [13]. Although active microwave remote sensing provides high spatial resolution, it is more sensitive to dielectric characteristics, surface parameters, vegetation, soil texture and sensor parameters [14]. The scattering behaviour of a surface is influenced by its geometric and dielectric properties concerning incident radiation [15]. The presence of vegetation canopy and surface roughness can modify the radar backscatter’s sensitivity to soil moisture. As a result, research efforts focused on estimating soil moisture generally prioritise examining bare surfaces where these factors are minimised. Polarimetric SAR provides polarised information on the targets, which enhances the radar capability in the scattering mechanism. Polarimetric decomposition is a useful technique to separate microwave scattering resulting from different targets and can distinguish between the scattering from the surface and volume targets, this method could be useful for measuring soil moisture in both bare and cropped conditions.
To estimate the dielectric constant values, various models have been developed in the microwave region of the electromagnetic spectrum to simulate the dielectric behaviour of soil [16,17]. The empirical model enhances our understanding of how soil–water mixtures behave. The model provides a way to estimate soil properties at different frequencies based on soil texture and water dielectric constants [18]. One notable example is Dobson’s work [19]. In that study, an empirical relationship was developed to estimate the real and imaginary parts of the dielectric constant at frequencies of 1.4 and 5 GHz [20]. Similarly, Hallikainen later proposed an empirical relationship for the dielectric constant of soil samples measured at frequencies ranging from 1.4 to 18 GHz [21]. While these models have their advantages and disadvantages, it is worth noting that some are site-specific, require several parameters, and are mainly developed for the dual pol dataset. The Dobson and Hallikainen models are limited to specific frequency ranges and require several sensor and target parameters to estimate the permittivity. The semi-empirical models, like the modified Dubois models, which are widely used to estimate the dielectric constant from dual-pol data, have limitations, such as being site-specific, usable only in bare or sparsely vegetated fields, and mainly developed for dual-pol datasets. A new mineralogy-based dielectric model (MBSDM) was developed using clay content as the input, which showed lower prediction errors compared to the semi-empirical mixing dielectric model (SMDM). The MBSDM balances predictions with the application, making it an excellent choice for estimating moist soil dielectric properties [22]. A frequency dielectric model designed for tundra organic soil at 1.4 GHz effectively forecasts the soil’s dielectric features under different moisture levels, densities, and temperatures. The model’s accuracy is comparable to the measurement error, ensuring performance [23]. Integrating matter (OM) into mixing models enhances the precision of microwave radiative transfer modeling by minimising errors in brightness temperature (TB) calculations. This adjustment improves the soil moisture readings from sensors like Soil Moisture Active Passive (SMAP), Soil Moisture and Ocean Salinity (SMOS), and Advanced Microwave Scanning Radiometer 2 (AMSR2) by lowering the constant with higher organic matter content. The enhanced model decreased the root mean square error (RMSE) of SMAP observed brightness temperature by 11%, showcasing its effectiveness in regions with matter [24]. The new dielectric mixing model for phase soil significantly enhances the precision of microwave remote sensing and portable soil sensors, reducing RMSE by 23–35% for SMAP, SMOS, and Advanced Microwave Scanning Radiometer for EOS (AMSR-E) and up to 53% for time domain reflectometer (TDR) and ground penetrating radar (GPR). It offers a fit to data without the need for nonphysical parameters and accurately forecasts dielectric constants in both saturated and unsaturated conditions. This model is expected to improve flood monitoring accuracy via satellite [25].
Several Studies have been conducted to estimate the soil moisture using L-band data. The research showcased how L-band quad-polarised data and decomposition methods effectively determined surface soil moisture in a small field. The Van Zyl decomposition revealed the surface scattering element, while the Yamaguchi decomposition displayed a correlation with field-measured soil moisture and demonstrated high accuracy. Tien-Hao Liao et al., 2021 [26] studied UAVSAR data combined with a radar scattering model based on physics that effectively calculates near-surface soil moisture in an area affected by grassland landslides, achieving an RMSE of 0.058 m3/m3. The Freeman Durden decomposition indicates relationships between soil moisture, surface scattering, and double bounce. This technique can be applied to grassland landslide areas to monitor soil moisture. The utilization of data and multiple incidence angles significantly improves the alpha approximation method for estimating soil moisture, reducing RMSE by 0.4% to 5% compared to single pol and single incidence data. This enhanced approach demonstrates accuracy when applied to corn, soybean, and wheat crops [27]. Burgin et al., 2017 [28] introduce a method that minimises the reliance on data by modeling radar reflections as a linear relationship with soil moisture utilizing coefficients obtained from Aquarius radar and radiometer information. This approach showcases soil moisture estimation based on SMAP radar data, which has been verified against the SMAP Level 2 radiometer-only soil moisture dataset.
Vegetation influences the soil moisture; thus, quantifying the vegetation contribution is sometimes important in the agriculture area. Using SAR data to estimate soil properties and vegetation presents challenges due to the interactions between vegetation water content and electromagnetic radiation, which affect scattering and absorption phenomena. The radar signals wavelength-dependent nature adds another layer of complexity, making it difficult to separate soil and vegetation contributions. Remote sensing models encounter difficulties in determining these properties, recommending the need for techniques in SAR data analysis [29,30,31,32]. A vegetation descriptor called Dual-pol Radar Vegetation Index (DpRVIc) in a dual-pol L-band SAR setup was introduced, which effectively accounts for vegetation influences to estimate soil moisture using the Water Cloud Model (WCM). The proposed descriptor has shown accuracy, followed by NDVI and RVIdp across types of crops and test locations. These results underscore the potential of using dual pol GRD SAR L band data for soil moisture estimation for upcoming missions like NISAR [33]. In regions with dense vegetation cover, the L-band SAR data can effectively predict the soil moisture by incorporating the semi-empirical water cloud model parameterised with Leaf Area Index [34]. Integrating the water cloud model and machine learning model like neural network algorithms suggests that SAR data can be effectively used for soil moisture estimation and improves the model’s overall accuracy [35]. Therefore, physical models inform and justify the application of ML algorithms
To utilise the potential of fully polarimetric datasets, Machine Learning can be employed. Using machine learning techniques, it is possible to accurately model the complex relationships between different factors and input data to estimate soil moisture levels. When it comes to this task, many people rely on two popular techniques: neural network and random forest algorithms. Hajj et al., 2017 [36] and their team successfully trained a neural network algorithm by utilising a simulated database. High-accuracy soil moisture estimations were achieved successfully. Carranza et al., 2021 [37] utilised a random forest algorithm to examine the correlation between ground-based soil moisture measurements and remotely sensed variables. They were able to achieve strong performance and capture the nonlinear relationship between the two. The first thorough assessment of twelve sophisticated statistical and machine learning (ML) methods for the estimate of soil moisture (SM) using dual-pol sentinel 1 data was conducted. The top three models with similar and promising SM estimates, according to the study, were the random forest (RF), subtractive clustering (SBC), and Adaptive Neuro Fuzzy Inference System (ANFIS). To assess the resilience of these top models (RF, SBC, and ANFIS), further performance study was conducted on different datasets. The results show that all three models performed consistently, with SBC being the most recommended model for SM estimation [38]. These studies show that machine learning techniques have the potential to estimate soil moisture accurately and efficiently.
The primary aim of this investigation is to assess the capacity of fully polarimetric synthetic aperture radar data to examine L-Band for estimating soil moisture using polarimetric decomposition technique and Machine Learning modelling. The study employed three different decompositions, namely Freeman-Durden three-component, Van Zyl, and H/A/Alpha, to identify different scattering mechanisms along with additional input polarimetric features to estimate soil moisture and soil dielectric constant in three different crop fields (i.e., corn, wheat, and soybean). Multiple machine learning algorithms were assessed to determine the optimal method for estimating soil dielectric constant and soil moisture.

2. Materials and Methods

2.1. Study Area and Dataset

The Soil Moisture Active Passive Experiment 2012 (SMAPVEX12) was a field campaign established to develop soil moisture retrieval algorithms and support the early adopter project of the Soil Moisture Active Passive (SMAP) mission. The campaign was conducted in a region of Manitoba, Canada (98°00′23″W, 49°40′48″N) situated within the Red River Watershed, and the Assiniboine River flows through the northern part of the area [39]. The field data collected in the region covers approximately 6 weeks in-situ soil dielectric constant and soil moisture data, which was the main reason to consider it as the study area. It covers a distance of 12.8 km × 70 km and demonstrates substantial changes in surface soil moisture as a result of varying soil textures.
The region experiences a notable variation in surface soil moisture due to the transition from dense clay to light, loamy sands from the east to the west. Microwave soil moisture techniques have been extensively employed to study this region due to the significance of agriculture in the area and the spatial variability of soil moisture [15]. The SMAPVEX12 site has a heterogeneous crop distribution, with cereals encompassing 23.4% of the crop acreage, trailed by canola at 16.0%, corn at 9.1%, soybeans at 18.5%, and perennial cover at 14.6%. Figure 1 Illustrate the position of crop fields and the field data point distribution strategy.
During the SMAPVEX12 campaign, field staff collected soil and vegetation data from 55 farmland fields and 4 forest locations between 7 June and 19 July 2012. In-flight days, soil moisture readings were taken on both the farmland and forest sites, while surface roughness measurements were taken on non-flight days. According to Figure 1b, there were 16 sample points per farmland pitch, with crews gathering three replicate soil moisture readings at each sample site to record regional heterogeneity caused by soil characteristics or management approaches [40]. The in-situ soil dielectric constant and soil moisture data from farmland were utilised alongside the QuadPol SAR (simulated NISAR) dataset for the same date, as detailed in Section 2.2.

2.2. Simulated NISAR Dataset

The data used in this study were provided by Jet Propulsion Laboratory (JPL), NASA. The data was acquired through the UAVSAR sensor, which utilises L-band frequencies it was then converted into a Simulated NISAR product. Its descriptive information about the conversion of UAVSAR data into simulated NISAR products is provided at this link: https://uavsar.jpl.nasa.gov/science/documents.html (accessed on 29 June 2024). The purpose of simulating NISAR data is to help the scientific community test algorithms and assess the quality of NISAR products before the NISAR launch. The QuadPol simulated NISAR 138A mode of 1.253 GHz Frequency and a (Ground Range Detected) resolution of 7.3 m × 7.3 m was used. The angle of incidence was 33.88 in the near range and 47.2 in the far range. A total number of 12 NISAR Simulated datasets were used from Flight Line ID: 31606 provided in Table 1 with their respective acquisition dates, in-situ soil moisture measurement, and roughness availability. The data is available in a multi-looked complex format on the website https://uavsar.jpl.nasa.gov/ accessed on 29 June 2024.

2.3. Methodology

This study aims to retrieve soil moisture and soil dielectric constant using various Machine Learning algorithms. The flowchart in Figure 2 illustrates the process of machine learning. The covariance matrix, denoted as [C3], was extracted from the QuadPol Simulated NISAR orthorectified product. Polarimetric parameters were extracted using both model-based and eigenvalue-eigenvector-based decompositions. The model was trained using the following polarimetric input parameters: Alpha, Anisotropy, Entropy, Freeman-Durden dihedral, Freeman-Durden surface, Freeman-Durden volume, Surface/Dihedral, Surface/(Surface +Dihedral +Volume), Van Zyl dihedral, Van Zyl surface, Van Zyl volume, Soil roughness parameters RMS-Height, Correlation Length, VH(dB), VV(dB), HH(dB), HV(dB), HV(dB)/HH(dB), HV(dB)/VV(dB), VH(dB)/HH(dB), and VH(dB)/HH(dB). The estimation of soil moisture and soil dielectric have been done in three ways in three different crop fields. The first model does not consider the vegetation effect, the second model considers the vegetation effect by incorporating two-way attenuation and RVI parameters, and the third model does not consider the RMS and correlation length.
Seven different machine learning algorithms, namely random forest regression, decision tree regression, stochastic gradient descent (SGD), XGBoost, K-nearest neighbors (KNN) regression, neural network regression, and multiple linear regression were trained on 70% and tested on 30% dataset which is a thumb rule in machine learning modelling. The in-situ soil moisture and soil dielectric constant data obtained from the SMAPVEX12 field campaign conducted in Manitoba, Canada, as mentioned in Section 2.1 was used to test and validate the results. The best model was used to retrieve the soil moisture and soil dielectric constant map.

2.3.1. Polarimetric Decomposition Technique

The Freeman-Durden decomposition model was used as an input parameter in the machine learning algorithm because it provides valuable insights into the scattering properties of different targets, such as vegetation, urban structures, and bare soil. This specification allows the algorithm to distinguish between these different land cover types effectively [41]. The scattering vector utilised in Freeman-Durden analysis is defined as in Equation (1).
k L = [ S H H 2 S H V S V V ] T
C = C v + C d + C s
C v = f v 8 3 0 1 0 2 0 1 0 3 P v = f v
C d = f d | α | 2 0 α 0 0 0 α 0 1 P d = f d 1 + | α | 2
C s = f s | β | 2 0 β 0 0 0 β 0 1 P s = f s 1 + | β | 2
Equation (2) is composed of three submatrices, where [Cv], [Cd], and [Cs] refers to the covariance matrix of the volume scattering (Equation (3)), dihedral (Equation (4)), and single scattering (Equation (5)) and fv, fd, and fs represent the energy, respectively. Ps, Pd, and Pv are freeman attributes. The α and β represent the data of SHH/SVV for the dihedral scattering and the single bounce scattering respectively.
Another technique used as an input parameter to the ML model was the H/A/α decomposition method. The H/A/α decomposition approach is useful for characterising target scattering processes, such as surface scattering, double-bounce scattering, and volume scattering, which are associated with different types of land cover and man-made structures [42]. The coherency matrix can be written as Equation (6).
T = u 3 Σ u 3 1 = i = 1 i = 3 λ i u i u i T
H = i = 1 3 P i l o g 3 ( P i )
α = i = 1 i = 3 P i α i
P i = λ i Σ j = 1 3 λ j
A = λ 2 λ 3 λ 2 + λ 3 for   H > 0.7
where u3 represents three unit orthogonal eigenvectors, Equation (7) represents the logarithmic sum of eigenvalue, Equation (8) represents α, where Pi is the probability obtained from the eigenvalues Equation (9). The third parameter extracted from the H/A/α, known as anisotropy, is given in Equation (10).
The Van Zyl decomposition was used to recover useful polarimetric information, particularly those covered in vegetation. It is intended particularly for estimating the volume backscattering component from vegetated regions [43]. It determines the volume contribution of backscattering and separates it from other components, allowing for a more accurate study of soil moisture in vegetated regions.
C = α C m o d e l + C r e m a i n d e r
C r e m a i n d e r = ξ 0 ρ 0 η 0 ρ 0 ς α ξ α 0 ρ α 0 η α 0 ρ α 0 ς α
λ 1 = 1 2 { Y + X } λ 2 = 1 2 { Y X } λ 3 = η a η α X = Y 2 4 ξ a ξ α ς a ς α + 4 ρ a ρ α 2 Y = ξ + ς a ξ α a ς α
where [Cmodel] is the covariance matrix, [Cremainder] is symmetric in Equation (11). The average covariance matrix in Equation (12). The eigenvalues of this matrix are described by Equation (13). The ξ s, ς , and, ρ , η all depend on the size, shape, and electrical properties of the scatterers, as well as their statistical angular distribution.

2.3.2. Vegetation Attenuation

The WCM technique estimates the backscattering coefficient obtained from the area with vegetation coverage as expressed in Equations (14)–(17). The expression given below describes the interaction process between radar signal and vegetation. In this model, the total backscattering coefficient is a sum of the normalised radar cross-section of the vegetation and the two-way attenuation coefficient of the vegetation from the soil. The variables V1 and V2 are vegetation descriptors and improve the overall performance of the WCM. Here, the Leaf Area Index (LAI) and Plant Water Content (PWC) given in Equation (18) have been used as V1 and V2, respectively. The LAI and PWC data collected from the field campaign are available at https://smapvex12.espaceweb.usherbrooke.ca/home.php accessed on 29 June 2024.
σ P P 0 = σ v e g 0 + L 2 σ s o i l 0
σ v e g 0 = A V 1 c o s θ i ( 1 L 2 )
L 2 = exp ( 2 B V 2 s e c θ i )
σ s o i l 0 = C + D × M V
p l a n t   w a t e r   c o n t e n t   ( K g / m 2 ) = f r e s h b i o m a s s   ( K g / m 2 ) d r y b i o m a s s   ( K g / m 2 )
where A and B are vegetation parameters, C and D are bare soil parameters, θi is incidence angle, and MV is the volumetric soil moisture.
The machine learning model receives the two-way attenuation that it derives from the WCM model. To increase overall accuracy, the ML model also incorporates the Radar Vegetation Index (RVI), which is obtained from radar observation as expressed in Equation (19).
R V I = 8 σ H V σ H H + σ V V + 2 σ H V

2.3.3. Machine Learning (ML) Modelling

Random forest regression was utilised, and numerous input factors, such as polarimetric data, were employed. During the training phase, each tree is constructed by recursively splitting the data based on the characteristics that offer the greatest information gain, it has a lower risk of overfitting, and it can account for missing data [44].
When estimating soil moisture, multilinear regression is used to make predictions about the soil moisture content based on several polarimetric input variables. The data that are fed into the method are used to calibrate a linear equation that, when applied, provides the most accurate description of the connection between the independent variables and the dependent variable [45].
In decision tree regression, the method models the relationship between the input polarimetric factors and the output variable (in this case, the soil moisture content) by building a binary tree structure. The nodes of the tree structure stand for the decisions that were made based on the input variables, and the branches of the tree structure stand for the effects of those decisions [46]. Decision tree regression is prone to be affected by overfitting and noise in the data.
Another regression known as the Stochastic gradient descent (SGD) model is iteratively adjusted based on a randomly chosen part of the training data that is referred to as a mini-batch. Nevertheless, SGD regression can be sensitive to the choice of learning rate hyperparameter [47], which can affect the algorithm’s ability to converge and its accuracy. To guarantee that the model is accurate, it is necessary to make use of relevant strategies such as early halting and learning rate scheduling.
XGBoost combines the abilities of several weaker models to create a more powerful predictive model. It uses a technique called gradient boosting. This means that it creates a predictive model by repeatedly improving it in small steps [48].
KNN regression works with a wide range of metrics that measure distance. In addition to this, it is simple to construct and does not need any assumptions to be made about the data that is being used [49].
Another effective method used in this study is the neural network, which can learn complicated patterns and correlations between the polarimetric variables fed into it and the soil moisture it produces as an output [50].

3. Results

This section discusses the results obtained from the machine learning models. This section is divided into three parts. The first section shows the results obtained from the ML models without considering the vegetation effects (not including the WCM and RVI). The second section presents the results obtained after considering the vegetation effect (including the WCM and RVI as input parameters). The third section discusses the results without considering the roughness parameters (i.e., RMS height and correlation length).

3.1. Results without Considering Vegetation Effects

Before applying machine learning modelling, correlation matrixes were generated for all the crop fields, which shows the correlation between the soil moisture and other polarimetric features. The correlation analysis was conducted to examine the strength of the relationship between soil moisture and various polarimetric features, including surface roughness parameters provided in Figure 3.
Soil dielectric constant and soil moisture were estimated in three distinct agricultural fields (corn, wheat, and soybeans) using the polarimetric decomposition technique and machine learning. Radar backscattering and numerous soil-related characteristics were gathered as a collection of SAR polarimetric inputs. The model’s soil dielectric constant outputs were validated using field-measured values through statistical measures such as the coefficient of determination (R2), root mean square error (RMSE), and mean absolute error (MAE) shown in Figure 4. A confidence interval (CI) and prediction interval (PI) were plotted in addition to a regression line fitting the data. The confidence interval (CI) is represented graphically by a yellow line, whereas the prediction interval (PI) is illustrated as a continuous blue line. The regression line that provides the closest fit to the data is depicted in red. The yellow line represents the upper and lower limits of the best-fit line, which are determined by the 95% confidence interval (CI). This means there is a 95% probability that the true value of the best-fit line falls within these limits. It is important to acknowledge that despite having a 95% confidence interval, there is still a 5% chance that the actual linear regression line that best fits the data may fall outside of the boundaries of the confidence interval.
In Figure 4a, the random forest output for the soil dielectric constant estimation in the soybean field exhibits a coefficient of determination of 0.89, an RMSE of 6.79, and an MAE of 5.15. Except for two data points, all other points fall within the prediction interval. Moving on to the wheat field shown in Figure 5a, random forest outperforms other algorithms with a coefficient of determination of 0.73, an RMSE of 3.99, and an MAE of 2.64. However, one data point lies completely outside the 95% prediction interval, and another data point falls outside the prediction interval but remains relatively close to it. In Figure 6a, the random forest algorithm was not efficient in the corn field compared to other algorithms; however, it still outperformed them. The output shows a coefficient of determination of 0.72, an RMSE of 1.96, and an MAE of 1.62. Only one data point was observed outside the prediction interval.
Decision tree regression also showed better performance in the soybean field, as depicted in Figure 4b. The majority of the data points fall within the upper and lower bounds of the 95% prediction interval, with only two data points lying outside. The coefficient of determination is 0.84, the second highest after random forest, while the RMSE is 8.16 and the MAE is 6.05. The minimum accuracy for the decision tree was found in the Corn field, depicted in Figure 6b, where the data points are found within the upper and lower bounds of the 95% prediction interval, except for one data point. However, the coefficient of determination is only 0.30, with an RMSE of 3.10 and an MAE of 2.42. For the corn field, the lowest accuracy was also found in the multilinear regression. It seems that the model does not fit the data well.
In Figure 7a, the random forest output for the soil moisture estimation in the soybean field exhibits a coefficient of determination of 0.89, an RMSE of 0.05, and an MAE of 0.04. Except for two data points, all other points fall within the prediction interval. Moving on to the wheat field shown in Figure 8a, random forest outperforms other algorithms to estimate soil moisture with a coefficient of determination of 0.88, an RMSE of 0.03, and an MAE of 0.02. However, two data points fall outside the prediction interval but remain relatively close to it. In Figure 9a, the random forest algorithm was less efficient in corn than in soybeans and wheat fields. The output shows a coefficient of determination of 0.78, an RMSE of 0.03, and an MAE of 0.02. Only one data point was observed outside the prediction interval. Based on the results, it is evident that random forest outperformed the other algorithms.
Additionally, the decision tree regression showed better performance in estimating soil moisture from the soybean field, as depicted in Figure 7b. The majority of the data points fall within the upper and lower bounds of the 95% prediction interval, with only one data point lying near the lower prediction interval. The coefficient of determination for the decision tree was 0.88, the second highest after random forest, while the RMSE is 0.05 and the MAE is 0.04. The lowest accuracy for the decision tree was found in the corn field, depicted in Figure 9b, where the data points are found within the upper and lower bounds of the 95% prediction interval, except for two data points, the coefficient of determination for the decision tree was 0.34, with an RMSE of 0.05 and MAE of 0.04. For the corn field, the lowest accuracy was found in XGBoost and the multilinear regression with the coefficient of determination of −0.25 and −0.07. It seems that the model does not fit the data well. For other algorithms, such as XGBoost, KNN, MLR, Stochastic gradient descent, and multilinear regression, the analysis is given in Table 2.
The lowest accuracy was found for the corn field, where the coefficient of determination in both multilinear regression and XGboost was negative. This could be due to problems with the data or the model itself, as it is unable to capture any significant relationship between the independent and dependent variables—that is, it is unable to explain or predict the variation. In such cases, the coefficient of determination becomes negative. One of the reasons for the best results in soybean corn was the soybean open canopy, which means that there is less vegetation cover and spacing between the plants. Also, there is less plant material above the ground, which creates a situation where SAR signals can more easily penetrate through the canopy and interact directly with the soil beneath. When the SAR signal interacts with the soil, the sensor gets sufficient backscattering, which provides valuable information on soil moisture content. In the case of corn, it has denser canopies with a large amount of above-ground biomass, making it more challenging for the SAR signals to penetrate through the canopy.
More than 20 polarimetric features were used to train the machine learning model. Certain features, such as surface roughness parameters like correlation length and RMS-H cannot be eliminated as they play a crucial role and affect the SAR backscattering. The Decomposition technique and machine learning–based modelling played a crucial role in estimating soil moisture without considering the RMS-H and correlation length. This type of experiment is useful where the surface roughness parameter is not available as input to the model and machine learning–based retrieval, which suffer from unavailability. The results are presented in Table 2, which shows the modelled output of soil moisture without considering the roughness parameter. It was observed that random forest performed very well on the wheat field, with a coefficient of determination of 0.87. For the corn field, the coefficient of determination was 0.75, while it was 0.45 for the soybean field, which is comparatively less compared to the other crop fields. Interestingly, XGBoost and the neural network were found to be ineffective in predicting the variations in the dependent variable.

3.2. Results with Consideration of Vegetation Effects

The vegetation effect was considered, and all seven machine learning models were re-modeled, incorporating the two attenuation factors from the water cloud model and the Radar Vegetation Index retrieved from the full polarimetric data. The parameters C and D were obtained through linear model fitting. Parameters V1 and V2 represent the LAI and PWC. Parameters A and B were calibrated using the vegetation descriptors along with the soil parameters. The dielectric constant retrieved from the machine learning for the soybean field is shown in Figure 10.
The random forest algorithm was the best performer in the soybean field, as shown in Figure 10a, and demonstrated good agreement with an R2 of 0.89, RMSE of 6.78, and MAE of 4.38. The decision tree algorithm, shown in Figure 10b, performed as well as the random forest algorithm, showing the capability to estimate the dielectric values in the soybean field. The decision tree algorithm had an R2 of 0.88, RMSE of 7.10, and MAE of 4.69. The XGBoost algorithm in Figure 10c and the neural network algorithm in Figure 10g provided the same R2 of 0.74; however, the RMSE and MAE values for XGBoost were lower, at 10.27 and 7.40, respectively, whereas the RMSE and MAE of the neural network algorithm were 10.35 and 7.90. The performance of the SGD algorithm, shown in Figure 10d, was lower compared to the random forest, decision tree, XGBoost, and neural network algorithms. The SGD algorithm had an R2 of 0.61, RMSE of 12.69, and MAE of 10.54. The KNN had an R2 of 0.72 (Figure 10e). The lowest performance was observed in the multilinear regression algorithm (Figure 10f), where the R2 was 0.47, RMSE was 14.77, and MAE was 11.90. All the machine learning algorithms performed well in the soybean field except the multilinear regression.
Figure 11 shows the dielectric constant estimated result from the machine learning in the wheat field. Here, the random forest shown in Figure 11a performed well with an R2 of 0.82, RMSE of 3.21, and MAE of 2.08. The decision tree algorithm performance, as shown in Figure 11b, was not effective in the wheat field with an R2 of 0.60, RMSE of 4.85, and MAE of 3.20. The XGBoost algorithm shown in Figure 11c performed better in estimating the dielectric constant from the wheat field with an R2 of 0.66, RMSE of 4.46, and MAE of 2.90. The stochastic gradient descent performed better than the decision tree; the SGD shown in Figure 11d had an R2 of 0.62, RMSE of 4.73, and MAE of 3.26. The KNN algorithm performance in the wheat field was not as effective; Figure 11e shows that the KNN had an R2 of 0.56, RMSE of 5.07, and MAE of 3.66. The multilinear regression did not perform well, having an R2 of 0.48, RMSE of 5.53, and MAE of 3.70 in Figure 11f. The neural network algorithm shown in Figure 11g shows an R2 of 0.59, RMSE of 4.92, and MAE of 3.65. The neural network performance was better than the K-nearest neighbor and multilinear regression.
The ML results for the corn field are shown in Figure 12. The random forest (Figure 12a) has shown good results with an R2 of 0.77, RMSE of 1.77, and MAE of 1.45. In contrast, the decision tree algorithm in Figure 12b drastically changed to negative, showing an R2 of −0.20, RMSE of 4.08, and MAE of 3.10. The XGBoost algorithm (Figure 12c) shows an R2 of 0.45, RMSE of 2.77, and MAE of 2.07, and it performed better than the decision tree, multilinear regression, and neural network. The stochastic gradient descent algorithm shown in Figure 12d shows an R2 of 0.43, RMSE of 2.62, and MAE of 2.24, which is better than the multilinear regression and neural network. The K-nearest neighbor performed better than the XGBoost, decision tree, SGD, MLR, and NN. The KNN was the second-best performer in estimating the dielectric constant in the corn field with an R2 of 0.64, RMSE of 2.23, and MAE of 1.76, as shown in Figure 12e.
The MLR model did not perform well with the corn data and provided an R2 of −0.24, RMSE of 4.14, and MAE of 3.49, as shown in Figure 12f. The neural network model performance was better than the decision tree and multilinear regression, with an R2 of 0.42, RMSE of 2.82, and MAE of 2.18, as shown in Figure 12g.
The estimation of soil moisture in the soybean field is shown in Figure 13. Here, the random forest (Figure 13a) performed extensively well, with an R2 of 0.92, RMSE of 0.042, and MAE of 0.03. The decision tree (Figure 13b) performed well but not as much, with an R2 of 0.83, RMSE of 0.061, and MAE of 0.04. The XGBoost performance was outstanding, with an R2 of 0.91, RMSE of 0.046, and MAE of 0.03. It performed well compared to all the algorithms except random forest, as shown in Figure 13c. The SGD estimated the soil moisture with an R2 of 0.65, RMSE of 0.088, and MAE of 0.07. The SGD performance is better than the MLR model in this case, as shown in Figure 13d. The KNN performance was better than the SGD, MLR, and NN. The KNN shows an R2 of 0.72, RMSE of 0.078, and MAE of 0.05, as shown in Figure 13e. The MLR in Figure 13f shows an R2 of 0.62, RMSE of 0.092, and MAE of 0.07, which is not significant compared to other models. The least performer in the soybean field is the neural network, with an R2 of 0.61, RMSE of 0.093, and MAE of 0.07, as shown in Figure 13g.
The random forest performed very well in the estimation of soil moisture in the wheat field with an R2 of 0.91, RMSE of 0.027, and MAE of 0.02, as shown in Figure 14a. The second-best performing algorithm is the decision tree (Figure 14b), with an R2 of 0.85, RMSE of 0.035, and MAE of 0.03. The XGBoost model did not perform very well, showing an R2 of 0.62, RMSE of 0.056, and MAE of 0.05 in Figure 14c, but the XGBoost model performed better than the neural network. The SGD algorithm (Figure 14d) had an R2 of 0.69, RMSE of 0.051, and MAE of 0.04, performing similarly to KNN but better than XGBoost. The KNN results shown in Figure 14e demonstrate an R2 of 0.69, RMSE of 0.050, and MAE of 0.04, which are better results than the MLR, NN, and XGB models. The MLR algorithm also performed well in estimating the soil moisture in the wheat field with an R2 of 0.65, RMSE of 0.054, and MAE of 0.04, as shown in Figure 14f, better than XGB and NN.
The neural network performed the least in this case, with an R2 of 0.40, RMSE of 0.070, and MAE of 0.06, as shown in Figure 14g. In estimating soil moisture in the wheat field, the random forest again performed better without underfitting or overfitting the model.
The ML results from the corn field are shown in Figure 15. The random forest in Figure 15a had an R2 of 0.80, RMSE of 0.028, and MAE of 0.02, which means the performance is again good and satisfactory compared to all other algorithms. The decision tree modeled the soil moisture in the corn field with an R2 of 0.73, RMSE of 0.033, and MAE of 0.03, as shown in Figure 15b. The XGBoost (Figure 15c) performance was not satisfactory, with a negative value of R2 of −0.07, RMSE of 0.067, and MAE of 0.06. Here, the performance of the SGD algorithm was better than that of the XGBoost, as well as the MLR and NN. The SGD had an R2 of 0.52, RMSE of 0.045, and MAE of 0.03, as shown in Figure 15d. The KNN model performed well, better than the XGBoost, SGD, MLR, and neural network. The KNN had an R2 of 0.68, RMSE of 0.038, and MAE of 0.03, as shown in Figure 15e. The MLR (Figure 15f) modeling for soil moisture in the corn field was the least effective, with an R2 of −0.11, RMSE of 0.061, and MAE of 0.05. The neural network modeling, as shown in Figure 15g, was the second least effective in soil moisture estimation with an R2 of 0.37, RMSE of 0.051, and MAE of 0.04. All the analyses of the modelled output of soil dielectric constant and soil moisture (considering vegetation effect in the model) are combined in Table 3.

4. Discussion

It is significant to highlight that the models’ effectiveness varied depending on the agricultural fields and machine learning techniques used. The findings imply that certain models could be more appropriate for particular crops or datasets. Further investigation, validation, and comparison with other models are required to determine if the machine learning techniques for estimating soil dielectric constant are robust and generalizable. The choice of features, model improvement, and possible errors caused during data collecting should all be considered.
The analysis of feature importance derived from the random forest model provides valuable insights into the relative significance of various features in predicting the target variable, as shown in Figure 16. The feature that holds the most significant influence is the Van Zyl dihedral, as indicated by its high importance score of 0.294013. The score of 0.213926 is closely associated with anisotropy, signifying its significant influence on the predictions. Soil roughness RMS and correlation length are notable contributors, with respective scores of 0.136134 and 0.13075. These four characteristics are identified as the principal factors that propel the model’s prediction. The features of entropy, Van Zyl surface, Freeman-Durden volume, and Freeman-Durden dihedral exhibit a moderate level of significance, whereas the remaining features demonstrate comparatively lower importance scores. In general, comprehending the relative significance of these characteristics offers valuable perspectives for subsequent investigation and decision-making pertaining to the given dataset.
The random forest performance was best in all the scenarios. The soil dielectric constant and soil moisture map was generated using the random forest in Figure 17, which provides the spatial distribution of soil dielectric constant and soil moisture across the study area. The model combines the information from various polarimetric input features to predict the soil moisture values at unsampled locations. The darker shades denote the lower soil moisture, and the lighter shades denote the higher soil moisture. It is important to highlight that the accuracy and reliability of the soil moisture map are dependent on the quality and representativeness of the input data, as well as the random forest model’s success in capturing the underlying relationships. Validation and comparison with ground-truth measurements or other reference data sources improved the resulting map’s accuracy.
The study aimed to assess the potential of fully polarimetric synthetic aperture radar data to estimate soil dielectric constant and moisture values at L-band. It also demonstrated the usefulness of decomposition techniques and machine learning models in estimating soil moisture with or without the surface roughness parameter. The study was conducted in Winnipeg, Canada, using simulated NISAR data, which sets it apart from prior studies. Over 20 input features, such as Freeman-Durden decomposition, Van Zyl decomposition, H/A/alpha, polarization ratio, and surface parameter, were used to train seven different machine learning algorithms. Akhavan et al., 2021 [39] conducted a study on airborne UAVSAR data and used a feature selection approach to choose input variables for the machine learning model. They used random forest and neural network algorithms to model soil moisture and found that random forests performed better than neural networks. The best accuracy was achieved in the soybean field with R2 = 0.86, while the lowest accuracy was seen in a corn field with R2 = 0.40. The study concluded that the number of input features doesn’t necessarily lead to better accuracy and that the surface roughness parameter is an important variable for soil moisture modeling. Wang et al., 2017 [41] compared the Freeman-Durden, Hajnsek, and An decomposition techniques for soil moisture retrieval over corn, canola, soybean, and wheat fields. They removed the volume scattering component and normalised the incidence angle. The best result had an RMSE of 0.06 m3/m3, while the lowest was an RMSE of 0.11 m3/m3. Wang et al., 2016 [51] removed the volume scattering contribution and focused on surface scattering to estimate soil moisture. They used airborne UAVSAR and Sampex12 data for various crop phenology stages. Using a simplified polarimetric decomposition, they found an inversion rate of 26–38% and an RMSE of 0.06–0.12 m3/m3 for soil moisture retrieval.
The present study examines the previous literature and understands that the impact of decomposition techniques and machine learning algorithms is crucial in estimating the soil dielectric constant and soil moisture. Thus, these techniques are implemented in the current study. All three scattering components (surface, dihedral, and volume) were used without removing any of them in the model, and no feature selection approach (such as trial and error, backward feature selection, or forward feature selection) was utilised, as previous authors have shown that it does not affect the overall accuracy of the model. The lack of effect on the results was observed with the feature selection method because it performs the same process of selecting the best features iteratively, which already exists in ML models, adding nothing new to the results. The feature selection method might select features that work well with the training dataset but not for unseen data, which can lead to overfitting. Conversely, removing too many features might lead to underfitting. Some features, even if they are not strongly predictive individually, can significantly contribute when combined with other features. One of the main reasons for not considering the feature selection method was that machine learning is capable on its own of prioritizing or ranking the best features without discarding any features, maximising model performance. One example is the random forest, which provides feature importance. The study estimated soil dielectric constant and soil moisture values, identifying the Van Zyl dihedral, anisotropy, RMS-H, and correlation length as the most significant features. The results were evaluated both with and without the surface roughness parameter.
The incorporation of vegetation correction using the Water Cloud Model (WCM) has consistently yielded better results compared to those obtained without this correction. The WCM integrates parameters such as Leaf Area Index (LAI) and Plant Water Content (PWC). It is found that the Water Cloud Model is highly suitable for incorporating these parameters, making it ideal for soil moisture studies. The results of our random forest analysis show significant improvements after implementing the WCM. For the soybean field, the R2 increased from 0.89 to 0.92; for the wheat field, from 0.88 to 0.91; and for the corn field, from 0.78 to 0.80. Similarly, the root mean square error (RMSE) values decreased across these fields, from 0.050 to 0.042 for soybean, from 0.031 to 0.027 for wheat, and from 0.030 to 0.028 for corn. These improvements highlight the consistent enhancement of results post-WCM implementation.
All the algorithms demonstrated better consistent results. However, a few algorithms did not exhibit uniform improvements. Specifically, the decision tree algorithm for the soybean field saw a decrease in R2 from 0.88 to 0.83, the KNN algorithm for the corn field decreased from an R2 of 0.68 to 0.66, and the neural network for the wheat field showed an R2 drop from 0.75 to 0.40. These exceptions are attributed to the inherent limitations of certain algorithms in handling the complexity added by the vegetation correction parameters. Decision trees, for instance, are sensitive to noise and prone to overfitting; a very deep decision tree might be overfitted to noise in the training data, explaining the decreased performance in the soybean field. The KNN algorithm, known for its simplicity, struggled with the additional dimensions introduced by the WCM and RVI, leading to reduced accuracy in the corn field. Additionally, the limited data for the corn field may have resulted in underfitting. On the other hand, neural networks, which require extensive training data and fine-tuning, may also be impacted by the choice of activation functions and overfitting, which can lead to learning noise in the training data and result in artifacts in the predictions. This explains the drop in performance for the wheat field.
The implementation of the Water Cloud Model has proven to be beneficial, significantly improving the performance of most algorithms in soil dielectric constant and soil moisture estimation by effectively accounting for vegetation attenuation.
The simulated NISAR L-band data (provided by the NASA-JPL) used in this study replicate future NISAR data characteristics and help to understand the quality of future NISAR products. The future NISAR quad-polarization data will be available for regions like India and the United States [52], with an L-band frequency of 1.257 GHz. The L-SAR system provides a 7-m resolution along its direction of travel (along a track) and a resolution between 2 and 8 m across the track, depending on the viewing mode [53]. On the other hand, the future European Space Agency ROSE-L mission has a spatial resolution of 5–10 m, which will use advanced radar techniques, including full polarimetry, with a frequency of 1.257 GHz for soil moisture and forestry applications [54]. The current study aims to provide essential insights that will be helpful for researchers in the data analysis of the upcoming mission.

5. Conclusions

The results were obtained for three crop types: wheat, soybean, and corn fields. Random forest showed the best results for the soybean field with a coefficient of determination of 0.89, RMS of 0.033 m3/m3, MAE of 0.001 m3/m3, and MBE of 0.023 m3/m3. For the wheat field, a coefficient of determination of 0.87 was observed. Random forest performed better in the soybean field compared to the wheat field, but it still outperformed the other machine learning algorithms. Adverse performance was reported for neural networks, XGBoost, and multilinear regression. For soil dielectric constant estimation, the random forest again provided better results in the soybean crop field with an R2 of 0.89, RMSE of 6.79, and MAE of 5.15. Decision trees, SGD, and KNN also worked better in corn and wheat fields. The results were good in the soybean field due to the lower amount of vegetation present. The corn field model performed less accurately than the other fields due to vegetation and a lower amount of in-situ soil moisture than the other fields. To analyse the effect of vegetation, two-way attenuation from WCM and RVI parameters were incorporated into the ML model. The results observed while considering the vegetation effects were better as compared with the results without considering the vegetation effect. The random forest again showed the best results for the soybean with a coefficient of determination of 0.92 m3/m3, RMSE of 0.042 m3/m3, and MAE of 0.03 m3/m3. The random forest algorithm demonstrates the best overall performance for predicting soil moisture even without considering RMS-H and correlation length surface roughness parameters, particularly for wheat with an R2 of 0.87 and a low RMSE of 0.032 m3/m3. Other Algorithms also performed well. One of the reasons the random forest performed better than other ML algorithms was its ability to handle a larger number of input variables without overfitting. The most important input parameters for soil moisture estimation identified by random forest were Van Zyl dihedral, anisotropy, surface roughness, correlation length, entropy, Van Zyl surface, and Freeman-Durden volume. The soil moisture was predicted for the whole study area, and a range between 0.10 to 0.50 m3/m3 soil moisture is estimated using the random forest model. Dielectric and soil moisture maps were generated for the study area. The methodology applied in the current research contributes essential insights that could benefit upcoming missions, such as the Radar Observing System for Europe in L-band (ROSE-L) and the collaborative NASA-ISRO SAR (NISAR) mission, for future data analysis in soil moisture applications. The datasets from these NISAR and ROSE-L missions, when available in the future, can be used in a similar way for the soil moisture estimation as presented in the study. Future researchers can benefit from our study when handling the actual NISAR or ROSE-L datasets. The following points illustrate how the findings of this study assist researchers in the analysis of future datasets:
  • To retrieve the dielectric constant and soil moisture in different types of crop fields using seven different ML algorithms incorporated by the SAR decomposition models from fully polarimetric L-band data.
  • The random forest algorithm performed better than the decision tree, XGBoost, SGD, KNN, NN, and MLR algorithms in handling complex nonlinear data.
  • The incorporation of vegetation correction using the Water Cloud Model (WCM) has consistently yielded better results than those obtained without this correction.
  • The Water Cloud Model is highly suitable for incorporating Leaf Area Index (LAI) and Plant Water Content (PWC) parameters, making it ideal for soil moisture studies within vegetated areas using the L-band data.
  • The surface roughness parameter significantly plays an important role when modelling SAR-based soil moisture. The study estimates soil moisture with and without the incorporation of the surface roughness parameter in the model.

Author Contributions

D.D.: conceptualization, methodology, software, investigation, validation, formal analysis, visualization, writing—original draft preparation, writing—review and editing. S.K.: conceptualization, resources, methodology, software, investigation, validation, formal analysis, visualization, writing—original draft preparation, writing—review and editing, supervision, project administration. S.S.: conceptualization, writing—review and editing, supervision. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The Simulated NISAR datasets are freely available at https://uavsar.jpl.nasa.gov/. All the datasets used in this study are available on the provided website. Python programming language was used in this study which is available to download from https://www.python.org/. The following open source Python packages were used during the study: sklearn, pandas, folium, gdal, seaborn, scipy.

Acknowledgments

The authors are grateful to the National Aeronautics and Space Administration (NASA) for generously providing the Quad Polarimetric NISAR Simulated datasets. This data source played a significant role in our research and contributed to the depth and breadth of our analysis. The authors extend their heartfelt gratitude to all those involved for their contributions and support. The authors would like to express their sincere appreciation to the crews of the SMAPVEX12 project for their dedicated efforts in collecting the valuable dataset used in this research.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Babaeian, E.; Sadeghi, M.; Jones, S.B.; Montzka, C.; Vereecken, H.; Tuller, M. Ground, Proximal, and Satellite Remote Sensing of Soil Moisture. Rev. Geophys. 2019, 57, 530–616. [Google Scholar] [CrossRef]
  2. Robinson, D.A.; Abdu, H.; Lebron, I.; Jones, S.B. Imaging of Hill-Slope Soil Moisture Wetting Patterns in a Semi-Arid Oak Savanna Catchment Using Time-Lapse Electromagnetic Induction. J. Hydrol. 2012, 416–417, 39–49. [Google Scholar] [CrossRef]
  3. Li, H.; Robock, A.; Wild, M. Evaluation of Intergovernmental Panel on Climate Change Fourth Assessment Soil Moisture Simulations for the Second Half of the Twentieth Century. J. Geophys. Res. Atmos. 2007, 112, 1–15. [Google Scholar] [CrossRef]
  4. Vereecken, H.; Huisman, J.A.; Pachepsky, Y.; Montzka, C.; van der Kruk, J.; Bogena, H.; Weihermüller, L.; Herbst, M.; Martinez, G.; Vanderborght, J. On the Spatio-Temporal Dynamics of Soil Moisture at the Field Scale. J. Hydrol. 2014, 516, 76–96. [Google Scholar] [CrossRef]
  5. Xing, M.; He, B.; Ni, X.; Wang, J.; An, G.; Shang, J.; Huang, X. Retrieving Surface Soil Moisture over Wheat and Soybean Fields during Growing Season Using Modified Water Cloud Model from Radarsat-2 SAR Data. Remote Sens. 2019, 11, 1956. [Google Scholar] [CrossRef]
  6. He, B.; Xing, M.; Bai, X. A Synergistic Methodology for Soil Moisture Estimation in an Alpine Prairie Using Radar and Optical Satellite Data. Remote Sens. 2014, 6, 10966–10985. [Google Scholar] [CrossRef]
  7. Hajnsek, I.; Jagdhuber, T.; Schon, H.; Papathanassiou, K.P. Potential of Estimating Soil Moisture Under Vegetation Cover by Means of PolSAR. IEEE Trans. Geosci. Remote Sens. 2009, 47, 442–454. [Google Scholar] [CrossRef]
  8. Chen, L.; Xing, M.; He, B.; Wang, J.; Shang, J.; Huang, X.; Xu, M. Estimating Soil Moisture Over Winter Wheat Fields During Growing Season Using Machine-Learning Methods. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2021, 14, 3706–3718. [Google Scholar] [CrossRef]
  9. Chauhan, N.S.; Miller, S.; Ardanuy, P. Spaceborne Soil Moisture Estimation at High Resolution: A Microwave-Optical/IR Synergistic Approach. Int. J. Remote Sens. 2003, 24, 4599–4622. [Google Scholar] [CrossRef]
  10. Ahmed, A.; Zhang, Y.; Nichols, S. Review and Evaluation of Remote Sensing Methods for Soil-Moisture Estimation. SPIE Rev. 2011, 2, 28001. [Google Scholar] [CrossRef]
  11. Barrett, B.W.; Dwyer, E.; Whelan, P. Soil Moisture Retrieval from Active Spaceborne Microwave Observations: An Evaluation of Current Techniques. Remote Sens. 2009, 1, 210–242. [Google Scholar] [CrossRef]
  12. Sehler, R.; Li, J.; Reager, J.T.; Ye, H. Investigating Relationship Between Soil Moisture and Precipitation Globally Using Remote Sensing Observations. J. Contemp. Water Res. Educ. 2019, 168, 106–118. [Google Scholar] [CrossRef]
  13. Lakshmi, V. Remote Sensing of Soil Moisture. ISRN Soil Sci. 2013, 2013, 424178. [Google Scholar] [CrossRef]
  14. Walker, J.P.; Houser, P.R.; Willgoose, G.R. Active Microwave Remote Sensing for Soil Moisture Measurement: A Field Evaluation Using ERS-2. Hydrol. Process. 2004, 18, 1975–1997. [Google Scholar] [CrossRef]
  15. Zhang, L.; Lv, X.; Wang, R. Soil Moisture Estimation Based on Polarimetric Decomposition and Quantile Regression Forests. Remote Sens. 2022, 14, 4183. [Google Scholar] [CrossRef]
  16. Minasny, B. Microwave Dielectric Behavior of Wet Soils; Springer: Dordrecht, The Netherlands, 2006; Volume 133, ISBN 1402032714. [Google Scholar]
  17. Kabir, H.; Khan, M.J.; Brodie, G.; Gupta, D.; Pang, A.; Jacob, M.V.; Antunes, E. Measurement and Modelling of Soil Dielectric Properties as a Function of Soil Class and Moisture Content. J. Microw. Power Electromagn. Energy 2020, 54, 3–18. [Google Scholar] [CrossRef]
  18. Wang, J.R.; Schmugge, T.J. An Empirical Model for the Complex Dielectric Permittivity of Soils as a Function of Water Content. IEEE Trans. Geosci. Remote Sens. 1980, GE-18, 288–295. [Google Scholar] [CrossRef]
  19. Singh, A.; Meena, G.K.; Kumar, S.; Gaurav, K. Analysis of the Effect of Incidence Angle and Moisture Content on the Penetration Depth of L- and S-Band SAR Signals into the Ground Surface. ISPRS Ann. Photogramm. Remote Sens. Spat. Inf. Sci. 2018, 4, 197–202. [Google Scholar] [CrossRef]
  20. Dobson, M.C.; Kouyate, F.; Ulaby, F.T. A Reexamination of Soil Textural Effects on Microwave Emission and Backscattering. IEEE Trans. Geosci. Remote Sens. 1983, GE-22, 530–536. [Google Scholar] [CrossRef]
  21. Hallikainen, M.T.; Ulabz, F.T.; Dobson, M.C.; El-Rayes, M.A.; Wu, L.K. Microwave Dielectric Behavior of Wet Soil-Part I: Empirical Models and Experimental Observations. IEEE Trans. Geosci. Remote Sens. 1985, GE-23, 25–34. [Google Scholar] [CrossRef]
  22. Mironov, V.L.; Kosolapova, L.G.; Fomin, S. V Physically and Mineralogically Based Spectroscopic Dielectric Model for Moist Soils. IEEE Trans. Geosci. Remote Sens. 2009, 47, 2059–2070. [Google Scholar] [CrossRef]
  23. Mironov, V.L.; Kosolapova, L.G.; Savin, I.V.; Muzalevskiy, K.V. Temperature Dependent Dielectric Model at 1.4 GHz for a Tundra Organic-Rich Soil Thawed and Frozen. In Proceedings of the 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Milan, Italy, 26–31 July 2015; pp. 2016–2019. [Google Scholar]
  24. Park, C.-H.; Montzka, C.; Jagdhuber, T.; Jonard, F.; De Lannoy, G.; Hong, J.; Jackson, T.J.; Wulfmeyer, V. A Dielectric Mixing Model Accounting for Soil Organic Matter. Vadose Zone J. 2019, 18, 190036. [Google Scholar] [CrossRef]
  25. Park, C.H.; Behrendt, A.; LeDrew, E.; Wulfmeyer, V. New Approach for Calculating the Effective Dielectric Constant of the Moist Soil for Microwaves. Remote Sens. 2017, 9, 732. [Google Scholar] [CrossRef]
  26. Liao, T.-H.; Kim, S.-B.; Handwerger, A.L.; Fielding, E.J. Soil Moisture Retrieval Using L-Band SAR Over Landslide Regions in Northern California Grasslands. In Proceedings of the 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS, Brussels, Belgium, 11–16 July 2021; pp. 5881–5883. [Google Scholar]
  27. Shi, H.; Lopez-Sanchez, J.M.; Yang, J.; Li, P.; Zhao, L.; Zhao, J. Contribution of Polarimetry and Multi-Incidence to Soil Moisture Estimation Over Agricultural Fields Based on Time Series of L-Band SAR Data. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2021, 14, 300–313. [Google Scholar] [CrossRef]
  28. Burgin, M.S.; van Zyl, J.J. Regression-Based Soil Moisture Estimation from SMAP Polarimetric Radar Data with Aquarius Derived Coefficients. In Proceedings of the 2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Fort Worth, TX, USA, 23–28 July 2017; pp. 4136–4138. [Google Scholar]
  29. Bindlish, R.; Barros, A.P. Parameterization of Vegetation Backscatter in Radar-Based, Soil Moisture Estimation. Remote Sens. Environ. 2001, 76, 130–137. [Google Scholar] [CrossRef]
  30. Dobson, M.C.; Ulaby, F.T. Active Microwave Soil Moisture Research. IEEE Trans. Geosci. Remote Sens. 1986, GE-24, 23–36. [Google Scholar] [CrossRef]
  31. Zribi, M.; Chahbi, A.; Shabou, M.; Lili-Chabaane, Z.; Duchemin, B.; Baghdadi, N.; Amri, R.; Chehbouni, A. Soil Surface Moisture Estimation over a Semi-Arid Region Using ENVISAT ASAR Radar Data for Soil Evaporation Evaluation. Hydrol. Earth Syst. Sci. 2011, 15, 345–358. [Google Scholar] [CrossRef]
  32. Singh, S.K.; Prasad, R.; Srivastava, P.K.; Yadav, S.A.; Yadav, V.P.; Sharma, J. Incorporation of First-Order Backscattered Power in Water Cloud Model for Improving the Leaf Area Index and Soil Moisture Retrieval Using Dual-Polarized Sentinel-1 SAR Data. Remote Sens. Environ. 2023, 296, 113756. [Google Scholar] [CrossRef]
  33. Bhogapurapu, N.; Dey, S.; Mandal, D.; Bhattacharya, A.; Karthikeyan, L.; McNairn, H.; Rao, Y.S. Soil Moisture Retrieval over Croplands Using Dual-Pol L-Band GRD SAR Data. Remote Sens. Environ. 2022, 271, 112900. [Google Scholar] [CrossRef]
  34. Zribi, M.; Muddu, S.; Bousbih, S.; Al Bitar, A.; Tomer, S.K.; Baghdadi, N.; Bandyopadhyay, S. Analysis of L-Band SAR Data for Soil Moisture Estimations over Agricultural Areas in the Tropics. Remote Sens. 2019, 11, 1122. [Google Scholar] [CrossRef]
  35. Srivastava, H.S.; Sivasankar, T.; Gavali, M.D.; Patel, P. Soil Moisture Estimation underneath Crop Cover Using High Incidence Angle C-Band Sentinel-1 SAR Data. Kuwait J. Sci. 2024, 51, 100101. [Google Scholar] [CrossRef]
  36. Hajj, M.E.; Baghdadi, N.; Zribi, M.; Bazzi, H. Synergic Use of Sentinel-1 and Sentinel-2 Images for Operational Soil Moisture Mapping at High Spatial Resolution over Agricultural Areas. Remote Sens. 2017, 9, 1292. [Google Scholar] [CrossRef]
  37. Carranza, C.; Nolet, C.; Pezij, M.; van der Ploeg, M. Root Zone Soil Moisture Estimation with Random Forest. J. Hydrol. 2021, 593, 125840. [Google Scholar] [CrossRef]
  38. Chaudhary, S.K.; Srivastava, P.K.; Gupta, D.K.; Kumar, P.; Prasad, R.; Pandey, D.K.; Das, A.K.; Gupta, M. Machine Learning Algorithms for Soil Moisture Estimation Using Sentinel-1: Model Development and Implementation. Adv. Sp. Res. 2022, 69, 1799–1812. [Google Scholar] [CrossRef]
  39. Akhavan, Z.; Hasanlou, M.; Hosseini, M.; McNairn, H. Decomposition-Based Soil Moisture Estimation Using Uavsar Fully Polarimetric Images. Agronomy 2021, 11, 145. [Google Scholar] [CrossRef]
  40. McNairn, H.; Jackson, T.J.; Wiseman, G.; Bélair, S.; Berg, A.; Bullock, P.; Colliander, A.; Cosh, M.H.; Kim, S.B.; Magagi, R.; et al. The Soil Moisture Active Passive Validation Experiment 2012 (SMAPVEX12): Prelaunch Calibration and Validation of the SMAP Soil Moisture Algorithms. IEEE Trans. Geosci. Remote Sens. 2015, 53, 2784–2801. [Google Scholar] [CrossRef]
  41. Wang, H.; Magagi, R.; Goita, K. Comparison of Different Polarimetric Decompositions for Soil Moisture Retrieval over Vegetation Covered Agricultural Area. Remote Sens. Environ. 2017, 199, 120–136. [Google Scholar] [CrossRef]
  42. Cloude, S.R.; Pottier, E. An Entropy Based Classification Scheme for Land Applications of Polarimetric SAR. IEEE Trans. Geosci. Remote Sens. 1997, 35, 68–78. [Google Scholar] [CrossRef]
  43. Kumar, S.; Siqueira, P.; Govil, H.; Agrawal, S. Spaceborne Synthetic Aperture Radar Remote Sensing: Techniques and Applications; CRC Press: Boca Raton, FL, USA, 2023; pp. 1–418. [Google Scholar] [CrossRef]
  44. Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  45. Mukhopadhyay, R.; Kumar, S.; Aghababaei, H.; Kulshrestha, A. Estimation of Aboveground Biomass from PolSAR and PolInSAR Using Regression-Based Modelling Techniques. Geocarto Int. 2022, 37, 4181–4207. [Google Scholar] [CrossRef]
  46. Pekel, E. Estimation of Soil Moisture Using Decision Tree Regression. Theor. Appl. Climatol. 2020, 139, 1111–1119. [Google Scholar] [CrossRef]
  47. Zhang, T. Solving Large Scale Linear Prediction Problems Using Stochastic Gradient Descent Algorithms. In Proceedings of the Twenty-First International Conference on Machine Learning, Banff, AB, Canada, 4–8 July 2004; Association for Computing Machinery: New York, NY, USA, 2004; p. 116. [Google Scholar]
  48. Jia, Y.; Jin, S.; Savi, P.; Gao, Y.; Tang, J.; Chen, Y.; Li, W. GNSS-R Soil Moisture Retrieval Based on a XGboost Machine Learning Aided Method: Performance and Validation. Remote Sens. 2019, 11, 1655. [Google Scholar] [CrossRef]
  49. Ali, I.; Greifeneder, F.; Stamenkovic, J.; Neumann, M.; Notarnicola, C. Review of Machine Learning Approaches for Biomass and Soil Moisture Retrievals from Remote Sensing Data. Remote Sens. 2015, 7, 16398–16421. [Google Scholar] [CrossRef]
  50. Eroglu, O.; Kurum, M.; Boyd, D.; Gurbuz, A.C. High Spatio-Temporal Resolution CYGNSS Soil Moisture Estimates Using Artificial Neural Networks. Remote Sens. 2019, 11, 2272. [Google Scholar] [CrossRef]
  51. Wang, H.; Magagi, R.; Goita, K.; Jagdhuber, T.; Hajnsek, I. Evaluation of Simplified Polarimetric Decomposition for Soil Moisture Retrieval over Vegetated Agricultural Fields. Remote Sens. 2016, 8, 142. [Google Scholar] [CrossRef]
  52. Polarimetry|Get to Know SAR—NASA-ISRO SAR Mission (NISAR). Available online: https://nisar.jpl.nasa.gov/mission/get-to-know-sar/polarimetry/ (accessed on 29 June 2024).
  53. Radars|Observatory—NASA-ISRO SAR Mission (NISAR). Available online: https://nisar.jpl.nasa.gov/mission/observatory/radars/ (accessed on 29 June 2024).
  54. WMO OSCAR|Details for Instrument ROSE-L. Available online: https://space.oscar.wmo.int/instruments/view/rose_l (accessed on 29 June 2024).
Figure 1. (a) Study area and (b) sampling strategy of SMAPVEX12 campaign (http://smapvex12.espaceweb.usherbrooke.ca/ (accessed on 29 June 2024)).
Figure 1. (a) Study area and (b) sampling strategy of SMAPVEX12 campaign (http://smapvex12.espaceweb.usherbrooke.ca/ (accessed on 29 June 2024)).
Remotesensing 16 03539 g001
Figure 2. Methodology for the estimation of dielectric constant and soil moisture using machine leaning modelling.
Figure 2. Methodology for the estimation of dielectric constant and soil moisture using machine leaning modelling.
Remotesensing 16 03539 g002
Figure 3. Correlation between soil moisture and other polarimetric features. (a) soybean field, (b) wheat field, and (c) corn field.
Figure 3. Correlation between soil moisture and other polarimetric features. (a) soybean field, (b) wheat field, and (c) corn field.
Remotesensing 16 03539 g003
Figure 4. Soil dielectric constant retrieval from a soybean field without considering vegetation effects. (a) Random forest, (b) decision tree, (c) extreme gradient boosting, (d) stochastic gradient descent, (e) K-nearest neighbor, (f) multiple linear regression, and (g) neural network.
Figure 4. Soil dielectric constant retrieval from a soybean field without considering vegetation effects. (a) Random forest, (b) decision tree, (c) extreme gradient boosting, (d) stochastic gradient descent, (e) K-nearest neighbor, (f) multiple linear regression, and (g) neural network.
Remotesensing 16 03539 g004
Figure 5. Soil dielectric constant retrieval from a wheat field without considering vegetation effects. (a) Random forest, (b) decision tree, (c) extreme gradient boosting, (d) stochastic gradient descent, (e) K-nearest neighbor, (f) multilinear regression, and (g) neural network.
Figure 5. Soil dielectric constant retrieval from a wheat field without considering vegetation effects. (a) Random forest, (b) decision tree, (c) extreme gradient boosting, (d) stochastic gradient descent, (e) K-nearest neighbor, (f) multilinear regression, and (g) neural network.
Remotesensing 16 03539 g005
Figure 6. Soil dielectric constant retrieval from a corn field without considering vegetation effects. (a) Random forest, (b) decision tree, (c) extreme gradient boosting, (d) stochastic gradient descent, (e) K-nearest neighbor, (f) multilinear regression, and (g) neural network.
Figure 6. Soil dielectric constant retrieval from a corn field without considering vegetation effects. (a) Random forest, (b) decision tree, (c) extreme gradient boosting, (d) stochastic gradient descent, (e) K-nearest neighbor, (f) multilinear regression, and (g) neural network.
Remotesensing 16 03539 g006
Figure 7. Soil moisture retrieval from soybean field without considering vegetation effects. (a) Random forest, (b) decision tree, (c) extreme gradient boosting, (d) stochastic gradient descent, (e) K-nearest neighbor, (f) multilinear regression, and (g) neural network.
Figure 7. Soil moisture retrieval from soybean field without considering vegetation effects. (a) Random forest, (b) decision tree, (c) extreme gradient boosting, (d) stochastic gradient descent, (e) K-nearest neighbor, (f) multilinear regression, and (g) neural network.
Remotesensing 16 03539 g007
Figure 8. Soil moisture retrieval from the wheat field without considering vegetation effects. (a) Random forest, (b) decision tree, (c) SGD, (d) KNN, (e) MLR, (f) XGBoost, and (g) neural network.
Figure 8. Soil moisture retrieval from the wheat field without considering vegetation effects. (a) Random forest, (b) decision tree, (c) SGD, (d) KNN, (e) MLR, (f) XGBoost, and (g) neural network.
Remotesensing 16 03539 g008
Figure 9. Soil moisture retrieval from corn field without considering vegetation effects. (a) Random forest, (b) decision tree, (c) SGD, (d) KNN, (e) MLR, (f) XGBoost, and (g) neural network.
Figure 9. Soil moisture retrieval from corn field without considering vegetation effects. (a) Random forest, (b) decision tree, (c) SGD, (d) KNN, (e) MLR, (f) XGBoost, and (g) neural network.
Remotesensing 16 03539 g009
Figure 10. Soil dielectric constant retrieval from soybean field considering vegetation effects. (a) Random forest, (b) decision tree, (c) extreme gradient boosting, (d) stochastic gradient descent, (e) K-nearest neighbor, (f) multiple linear regression, and (g) neural network.
Figure 10. Soil dielectric constant retrieval from soybean field considering vegetation effects. (a) Random forest, (b) decision tree, (c) extreme gradient boosting, (d) stochastic gradient descent, (e) K-nearest neighbor, (f) multiple linear regression, and (g) neural network.
Remotesensing 16 03539 g010
Figure 11. Soil dielectric constant retrieval from wheat field considering vegetation effects. (a) Random forest, (b) decision tree, (c) extreme gradient boosting, (d) stochastic gradient descent, (e) K-nearest neighbor, (f) multiple linear regression, and (g) neural network.
Figure 11. Soil dielectric constant retrieval from wheat field considering vegetation effects. (a) Random forest, (b) decision tree, (c) extreme gradient boosting, (d) stochastic gradient descent, (e) K-nearest neighbor, (f) multiple linear regression, and (g) neural network.
Remotesensing 16 03539 g011
Figure 12. Soil dielectric constant retrieval from corn field considering vegetation effects. (a) Random forest, (b) decision tree, (c) extreme gradient boosting, (d) stochastic gradient descent, (e) K-nearest neighbor, (f) multiple linear regression, and (g) neural network.
Figure 12. Soil dielectric constant retrieval from corn field considering vegetation effects. (a) Random forest, (b) decision tree, (c) extreme gradient boosting, (d) stochastic gradient descent, (e) K-nearest neighbor, (f) multiple linear regression, and (g) neural network.
Remotesensing 16 03539 g012
Figure 13. Soil moisture retrieval from soybean field considering vegetation effects. (a) Random forest, (b) decision tree (c) extreme gradient boosting, (d) stochastic gradient descent, (e) K-nearest neighbor, (f) multiple linear regression, and (g) neural network.
Figure 13. Soil moisture retrieval from soybean field considering vegetation effects. (a) Random forest, (b) decision tree (c) extreme gradient boosting, (d) stochastic gradient descent, (e) K-nearest neighbor, (f) multiple linear regression, and (g) neural network.
Remotesensing 16 03539 g013
Figure 14. Soil moisture retrieval from the wheat field considering vegetation effects. (a) Random forest, (b) decision tree, (c) extreme gradient boosting, (d) stochastic gradient descent, (e) K-nearest neighbor, (f) multiple linear regression, and (g) neural network.
Figure 14. Soil moisture retrieval from the wheat field considering vegetation effects. (a) Random forest, (b) decision tree, (c) extreme gradient boosting, (d) stochastic gradient descent, (e) K-nearest neighbor, (f) multiple linear regression, and (g) neural network.
Remotesensing 16 03539 g014
Figure 15. Soil moisture retrieval from the corn field considering vegetation effects. (a) Random forest, (b) decision tree, (c) extreme gradient boosting, (d) stochastic gradient descent, (e) K-nearest neighbor, (f) multiple linear regression, and (g) neural network.
Figure 15. Soil moisture retrieval from the corn field considering vegetation effects. (a) Random forest, (b) decision tree, (c) extreme gradient boosting, (d) stochastic gradient descent, (e) K-nearest neighbor, (f) multiple linear regression, and (g) neural network.
Remotesensing 16 03539 g015
Figure 16. Feature importance in random forest.
Figure 16. Feature importance in random forest.
Remotesensing 16 03539 g016
Figure 17. Estimated soil dielectric constant and soil moisture using random forest.
Figure 17. Estimated soil dielectric constant and soil moisture using random forest.
Remotesensing 16 03539 g017
Table 1. Soil Moisture Measurement Available Dates.
Table 1. Soil Moisture Measurement Available Dates.
Date(Flight Line ID: 31606)
Flight ID
Soil Moisture and Soil Dielectric Constant In-Situ Measurement AvailableSurface Roughness In-Situ Measurement AvailableQuadPol SAR Simulated NISAR Products Availabale
7th June --Yes----
10th June ----Yes--
11th June ----Yes--
12th June --YesYes--
13th June ----Yes--
15th June --YesYes--
16th June ----Yes--
17th June Flight 12044YesYesYes
18th June ----Yes--
19th June ----YesYes
21st June ----Yes--
22nd June Flight 12046Yes--Yes
23rd June Flight 12047Yes--Yes
24th June ----Yes--
25th June Flight 12048Yes----
27th June Flight 12049Yes--Yes
29th June Flight 12050Yes--Yes
30th June ----YesYes
3rd July Flight 12055Yes--Yes
5th July Flight 12056Yes--Yes
7th July ----Yes--
8th July Flight 12057Yes--Yes
10th July --Yes--Yes
13th July Flight 12059Yes--Yes
14th July Flight 12060Yes--Yes
17th July Flight 12061Yes--Yes
19th July --Yes----
Table 2. Analysis of modelled output of soil dielectric constant and soil moisture.
Table 2. Analysis of modelled output of soil dielectric constant and soil moisture.
Soil Dielectric ConstantSoil Moisture Soil Moisture
without RMS-H and Correlation Length Variable
AlgorithmsField TypeR2RMSEMAE R2RMSE (m3/m3)MAE (m3/m3)R2RMSE (m3/m3)MAE (m3/m3)
Random ForestSoybeans0.896.795.150.890.0500.040.450.1110.09
Wheat0.734.012.670.880.0310.020.870.0320.02
Corn0.721.961.620.780.0300.020.750.0320.03
Decision TreeSoybeans0.848.166.050.880.0510.040.360.1190.08
Wheat0.584.983.530.770.0440.030.660.0530.44
Corn0.303.102.420.340.0520.040.450.0480.04
XGBoostSoybeans0.6512.058.280.770.0720.050.360.1190.09
Wheat0.545.213.450.580.0590.050.540.0610.05
Corn0.472.711.94−0.250.0720.06−0.330.0740.06
Stochastic Gradient DescentSoybeans0.5813.1110.920.620.0920.080.410.1150.09
Wheat0.624.713.230.680.0520.040.660.0540.04
Corn0.412.862.280.500.0450.040.480.0460.04
KNNSoybeans0.6412.179.010.650.0890.060.430.1130.08
Wheat0.535.283.790.660.0530.040.680.0510.04
Corn0.652.221.860.680.0360.030.560.0430.03
Multilinear RegressionSoybeans0.4515.0611.920.580.0960.080.450.110.09
Wheat0.505.403.850.620.0560.050.600.0580.05
Corn−0.524.583.77−0.070.0670.060.220.0570.05
Neural NetworkSoybeans0.6012.798.840.510.1000.080.250.1380.12
Wheat0.405.944.650.450.0750.050.270.0810.06
Corn0.382.932.260.360.0540.04−0.630.0780.07
Table 3. Analysis of modelled output of soil dielectric constant and soil moisture (considering vegetation effect in the model).
Table 3. Analysis of modelled output of soil dielectric constant and soil moisture (considering vegetation effect in the model).
Soil Dielectric ConstantSoil Moisture
AlgorithmsField TypeR2RMSEMAE R2RMSE (m3/m3)MAE (m3/m3)
Random ForestSoybeans0.896.784.380.920.0420.03
Wheat0.823.212.080.910.0270.02
Corn0.771.771.450.800.0280.02
Decision TreeSoybeans0.887.104.690.830.0610.04
Wheat0.604.853.200.850.0350.03
Corn−0.204.083.100.730.0330.03
XGBoostSoybeans0.7410.277.400.910.0460.03
Wheat0.664.462.900.620.0560.05
Corn0.452.772.07−0.070.0670.06
Stochastic Gradient DescentSoybeans0.6112.6910.540.650.0880.07
Wheat0.624.733.260.690.0510.04
Corn0.432.822.240.520.0450.03
KNNSoybeans0.7210.727.800.720.0780.05
Wheat0.565.073.660.6900500.04
Corn0.642.231.760.660.0380.03
Multilinear RegressionSoybeans0.4714.7711.900.620.0920.07
Wheat0.485.533.700.650.0540.04
Corn−0.244.143.490.110.0610.05
Neural NetworkSoybeans0.7410.357.900.610.0930.07
Wheat0.594.923.650.400.0700.06
Corn0.422.822.180.370.0510.04
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Dinesh, D.; Kumar, S.; Saran, S. Machine Learning Modelling for Soil Moisture Retrieval from Simulated NASA-ISRO SAR (NISAR) L-Band Data. Remote Sens. 2024, 16, 3539. https://doi.org/10.3390/rs16183539

AMA Style

Dinesh D, Kumar S, Saran S. Machine Learning Modelling for Soil Moisture Retrieval from Simulated NASA-ISRO SAR (NISAR) L-Band Data. Remote Sensing. 2024; 16(18):3539. https://doi.org/10.3390/rs16183539

Chicago/Turabian Style

Dinesh, Dev, Shashi Kumar, and Sameer Saran. 2024. "Machine Learning Modelling for Soil Moisture Retrieval from Simulated NASA-ISRO SAR (NISAR) L-Band Data" Remote Sensing 16, no. 18: 3539. https://doi.org/10.3390/rs16183539

APA Style

Dinesh, D., Kumar, S., & Saran, S. (2024). Machine Learning Modelling for Soil Moisture Retrieval from Simulated NASA-ISRO SAR (NISAR) L-Band Data. Remote Sensing, 16(18), 3539. https://doi.org/10.3390/rs16183539

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop