5.1. Impacts of Uncertainties in Retrieval Algorithms
In the SMAP mission, the tau-omega model has been applied to describe the components from the soil, and the vegetation canopy contributes to the L-band brightness temperature [
9,
14]. If the air, vegetation, and near surface soil can be assumed to be in thermal equilibrium, then the vegetation temperature (
TC) is approximately equal to soil effective temperature (
Ts), and both temperatures
TC and
Ts can be replaced by the single effective temperature for the scene (
Teff) in the radiative transfer equation. To retrieve soil moisture data, it is necessary to isolate the soil surface emissivity
in Formula (8) by inversion of the tau-omega model.
where
and
and
TBp is the brightness temperature of each SMAP grid cell,
is the nadir vegetation opacity,
is the vegetation effective scattering albedo,
is the surface incidence angle of 40°. Based on the derived value of
, the smooth surface soil emissivity is determined by removing the roughness effects, and then soil moisture can be retrieved based on Fresnel equation and dielectric mixing model. Therefore, the accuracy of
TBp and
Teff, as well as the suitability of
and
are crucial for the retrieval of soil moisture data [
9,
14].
There are uncertainties in estimating both
TBp and
Teff in the SMAP mission. In mountainous areas, angles of incidence in the target area could not be derived solely from the radiometer observation angle, and the surrounding reflection significantly affects brightness temperature simulation [
44]. Nevertheless, both of them have been ignored in the SMAP mission [
14]. Meanwhile, the L3_SM_P_E product with a resolution of 9 km by 9 km, has been retrieved from the downscaled brightness temperatures in the L2_SM_P product [
45]. The downscaling algorithms would cause uncertainties in
TBp estimates, especially in areas with complex land surface conditions [
45,
46]. In the SMAP mission, the effective temperature
Teff has been provided by the GMAO (GSFC Global Modeling and Assimilation Office) model, also with unclear uncertainty. Moreover, the uncertainties of both
TBp and
Teff would propagate through the soil moisture inversion algorithm [
45], which is still unclear and requires further investigations in the future. Therefore, although well catching the temporal trend of in situ observations, both the L3 and L4 products did not achieve the accuracy of 0.04 m
3/m
3 at each overpassing moment in the study area.
The L3 product shows “dry bias” for all the vegetation types at each overpassing moment in this study, which has also been revealed at both footprint scale and global scale in previous studies, especially in mountainous areas (
Table 5). Colliander et al. [
12] attributed the “dry bias” to growth effects of vegetation. However, as shown in
Table 6, both the L3 ascending and descending products underestimated soil moisture for all the vegetation types in both growing seasons (summer and autumn) and non-growing season (spring), with even larger underestimation in non-growing season. It implies that the “dry bias” is more related to the system structure of retrieval algorithms rather than the growth effects. Both
and
are the factors describing vegetation effects, and the retrieval parameters of them have been trained or validated by observations from the CVS [
12], thus they can be assumed approximately precise at the CVS. However, the evaluations of the SMAP product at the CVS also show the “dry bias” [
12]. Therefore, the “dry bias” of the SMAP L3 product is mainly caused by the uncertainties of the
TBp and
Teff estimates rather than other factors. Furthermore, because the assumptions of thermal equilibrium are more likely to be true at the 6 a.m. SMAP overpass [
9,
14], the L3 ascending product fits the observation better than the L3 descending product.
5.2. Impacts of Parameters bp and h under Different Vegetation Types
According to Formula (8),
and
are two important factors related to vegetation effects [
9,
14]. In the SMAP mission, the nadir vegetation opacity
is related to the total columnar vegetation water content
W (kg/m
2) by
with the coefficient
bp dependent on vegetation type [
9,
14]. Both
bp and
have been determined before launch [
12,
14]. Meanwhile, surface soil reflectivity
, related to the soil emissivity
by
, has been smoothed by
in the SMAP retrieval. As a linear function of the root mean square of surface heights, surface roughness
h is also an important factor for soil moisture estimation in mountainous area [
51,
53].
As the parameters
and
bp are functions of vegetation geometry and vegetation water content, both of them are important factors affecting the performance under vegetation conditions. Better characterizing parameter
is important in soil moisture retrievals from space-borne observations [
54,
55,
56,
57]. However, as shown in
Table 7, differing significantly in the values of
, both the SMAP L3 and L4 products show same performance trends under different vegetation types in the study area. Thus, the effects of
on the performance differences of both products under different vegetation types are not discussed in this study.
The
bp factor is more sensitive than surface roughness
h in vegetated condition, thus the
bp factor is the most important parameter in soil moisture retrieval of the SMAP product for vegetation [
58]. In the study area, the main crop is corn in cropland, which has been considered in the SMAP mission [
14,
59], thus leading to best performance in cropland in the study area far beyond the accuracy of 0.04 m
3/m
3. For coniferous forest, the parameters of evergreen needle leaf have been calibrated and validated by the CanExSM10 (The Canadian Experiment for Soil Moisture in 2010) measurements [
14,
60,
61]. The main tree types are
pine and
spruce over the BERMS (Boreal Ecosystem Research and Monitoring Sites) forested sites in CanExSM10 [
61], both of them are
coniferae as well as
picea crassifolia in our study area. Thus, the parameters of forest are suitable in the study area because of the similarity of tree species, resulting in better performance of the SMAP products in coniferous forest far beyond the accuracy of 0.04 m
3/m
3.
The parameters of grasslands have also been considered in the SMAP mission [
14,
62], but performed not as well as in coniferous forest. This is because the
bp factor varies depending on different soil and vegetation conditions [
58], but only average values for each type have been applied in the SMAP mission [
14,
62]. Compared with the coniferous forest, the invariable
bp factor results in worse performance because of stronger intra-annual variation in grasslands (alpine meadow, sparse and dense grasslands). Since the brightness temperature shows higher sensitivity to vegetation under wet soil conditions [
9,
63], the misevaluation impacts of the
bp factor are much more significant in wet conditions [
61,
64]. Because of the wetness conditions of three grasslands (
Table 8 and
Figure 4), the L3 product shows best performance in sparse grassland under the driest condition, then dense grassland, and alpine meadow under the wettest condition.
The SMAP L3 product shows poor performance in estimating soil moisture in shrub land. This is because the dominant species is
potentilla fruticosa in the shrub land in the upper reach of the Heihe River Watershed, which belongs to the vegetation class of open shrublands in the SMAP mission. However, in the study area, the shrubland is often mixed with grassland, which leads to complexity in soil moisture retrieval. Moreover,
potentilla fruticosa shows strong intra-annual variations because of growth effects. Thus, the invariable
bp factor results in errors of soil moisture retrievals in the shrubland. Meanwhile, the brightness temperature is much more sensitive to vegetation under wet soil conditions [
9,
63]. Under the wettest condition with higher soil moisture data in the shrub (
Table 8 and
Figure 4), the misevaluation impacts of the
bp factor is most significant and thus leading to worst performance in the shrub in the study area.
The L4 product is the assimilation results of the SMAP retrievals and the GEOS-5 model simulations, thus it shows similar performance with the L3 product under different vegetation types. Overall, the suitability as well as the variability of the bp factor result in the performance differences of both the L3 and L4 products under different vegetation types in the study area. The impacts of the bp factor is more significant under wet conditions, resulting in performance differences in the three grasslands with a declining order of sparse grassland, dense grassland and alpine meadow.
For barren land, compared to surface roughness and soil moisture, the backscatter weakly depends on soil type [
65]. The uncertainties of surface roughness
h dominate the error budget of
TBp modeling over barren soil surface [
53]. Meanwhile, although depending on surface soil moisture, the parameterization of
h performs better when soil surface is relatively smooth than when the soil surface gets rougher [
53]. Because of the complex topography in the study area, the errors in parameterization of
h lead to poor performance of the SMAP products in barren land in the study area.
5.3. Impacts of Seasonal Frozen Soil
In the SMAP mission, the soil moisture data are retrieved by the relationship between soil moisture and dielectric constant. As soil moisture increases, the soil dielectric constant increases, which leads to an increase in soil reflectivity or a decrease in soil emissivity [
14]. However, besides dry soil, low dielectric constant can also be associated with frozen soil which has a similar dielectric constant to dry soil independent of water content [
14]. Thus, landscape freeze/thaw state is important in soil moisture data retrieval in the SMAP mission [
66].
In the upper reach of the Heihe River Watershed, seasonal frozen soils account for about 65% of the study area [
67], which strongly affects the soil moisture retrieval in the SMAP mission. In winter, the freeze_thaw_fraction values of all grids are near 1.0, leading to missing estimates, thus there is no data from the L3 product in the study area. For those frozen grids with estimates in the study area, the SMAP mission estimates unfrozen soil water, and the freeze_thaw_fraction values range from 0.008 to 0.892, the
R values are 0.464 and 0.334 for the ascending and descending products, respectively, while the
ubRMSE values are 0.051 and 0.053 m
3/m
3, respectively. Because the in situ soil moisture sensor 5TE can measure free water in frozen soil [
68], the L3 product well catches the trend of observations under the frozen states. Under the unfrozen state, the
R values are 0.624 and 0.555 for the ascending and descending products, respectively, and the corresponding
ubRMSE values are 0.054 m
3/m
3 for both products. With slight differences in bias, the L3 product better catches the temporal trend of the in situ observations under the unfrozen state than the frozen state. Meanwhile, because of the soil frozen/thaw processes in spring in the study area, both the L3 and L4 products showed better performance in summer and autumn than in spring. However, the better performance of both products in autumn than in summer is caused by growth effects of the vegetation during the growing season.
5.4. Impacts of Assimilation System
The L4 product is soil moisture estimate by assimilating the SMAP observation (downscaled brightness temperature from L2 product) with simulation by a catchment land surface model, GEOS-5 [
14]. The assimilation causes the performance differences in the L3 and L4 products. Because the L-band brightness temperatures generated by the GEOS-5 model and its associated microwave radiative transfer model have been calibrated to match the climatology of satellite observations, the GEOS-5 model provides unbiased modeled brightness temperatures in long-term mean ignoring seasonal variations in bias [
17]. Meanwhile, the brightness temperature by the SMAP observations is converted to anomalies for assimilation by removing the climatology of satellite observations [
17]. With lacking of SMAP-only climatology, the aforementioned climatology of satellite observations has been derived by the Soil Moisture and Ocean Salinity (SMOS) product, thus leading to the systemic errors of soil moisture estimates by the SMAP L4 product.
The calibration of the GEOS-5 model makes its simulation results more capable in reflecting the climatological trends with smaller variation. In winter, as there is no SMAP observed brightness temperature because of frozen states as for the L3 product, the estimates of the L4 product are all the simulation results by the GEOS-5 model, thus leading to linear temporal trends of the L4 product in winter. Also because of the better reflection in the climatological trends by the GEOS-5 model, the L4 product shows smaller variations with significantly smaller coefficient of variation (CV) values than the L3 product (
Figure 4), thus the L3 product is more dynamic than the L4 product in catching the tempo-spatial distributions of soil moisture at the watershed scale.
In winter, the larger areal average values of soil moisture indicate that the GEOS-5 model overestimates soil moisture in the study area. This is because the GEOS-5 model has been calibrated over unfrozen land to simulate all the water in the soils [
69]. However, the observed soil moisture is free water in the frozen soils, thus the values would be much smaller than the model simulations. Meanwhile, as shown in
Table 4 and
Figure 4, compared to the L3 product, the L4 product shows larger soil moisture estimates in all the seasons and under all the vegetation types. In the SMAP mission, the L3 product is the retrieval of downscaled (9-km) brightness temperatures, and the L4 product is assimilation result by retrieval of downscaled (9-km) brightness temperatures and GEOS-5 simulation. Because the same algorithm has been applied to retrieve soil moisture from brightness temperatures in both the L3 and L4 products, the larger estimates of the L4 product are mainly caused by the GEOS-5 model simulations. Moreover, the L3 product underestimates soil moisture in the study area, the overestimation of the L4 product indicates that the GEOS-5 model overestimates soil moisture in the study area. In summary, the assimilation makes the L4 product show similar temporal trends with the L3 product, but significantly differ in tempo-spatial distributions. Because of the relative errors of the SMAP brightness temperature observations and the corresponding land model forecast in the assimilation [
14], the L3 product shows better performance than the L4 product in the study area under all the evaluation cases.