3.1. Soils in the Study Area
Traditional soil map with lower detail showed a heterogeneous variation of soil classes (
Figure 4a). The predominant soils of the studied area were classified as: Typic Hapludox (THud), Typic Hapludalf (THa), Typic Hapludult (THu), Typic Eutrudept (TE) and Typic Quartzipsamment (TQ) [
43] (
Table 1).
When we perform more detailed taxonomy (
Figure 4b), each soil (except TQ) was divided in two “groups” according to their main characteristics. The short explanation of each “group” is listed as follows: Typic Hapludox 1 (THud1): Soils with hue 2.5YR or even reddish in the first 100 cm of depth with low cation saturation (<50%) in the B horizon. Typic Hapludox 2 (THud2): Soils with red-yellow and with high cation saturation (≥50%) in B horizon. Typic Hapludalf 1 (THa1): Soils with high levels of cation saturation (≥50%) and Fe
2O
3 values ranging by 15%–36% in most of the first 100 cm of B horizon. Typic Hapludalf 2 (THa2): Soils also with high levels of cation saturation (≥50%) but Fe
2O
3 levels lower than 15% in B horizon. Typic Hapludult 1 (THu1): Soils with hue 2.5YR or even reddish and cation saturation lower than 50% in the first 100 cm of depth. Typic Hapludult 2 (THu2): Soils with hue 2.5YR or even reddish and cation saturation higher than 50% in the first 100 cm of depth. Typic Eutrudept 1 (TE1): Soils with high cation saturation levels (≥50%) and Fe
2O
3 levels lower than 18% in B horizon. Typic Eutrudepts 2 (TE2): Soils with high cation saturation levels (≥50%) in the B horizon and Fe
2O
3 levels vary in the range of 18%–36% in the B horizon.
The area is dominated by the Typic Hapludox (THud), which is the most deep and weathered one (
Figure 4a,b). These soils have high drainage in all profile, and occur in a plain topography. There are no significant differences between textures from surface until undersurface horizons, what gives a characteristic of low water retention (
Table 1). The Typic Quartzipsamment (TQ) is very similar to the THud, but different in texture. It is a highly well drained soil. In the study area, these two soils have similarities on chemical, physical and morphological attributes. In some cases, they can be discriminated only by the clay content at the undersurface diagnostic horizon. The THud has clay content higher than 15%, while TQ have lower contents. In the study site, TQ presented clay contents close to the upper limit of its taxonomical range (15%) of clay, which increase the chance of confusion with Thud with loamy texture.
In the study area, we also have found soils with an argillic horizon, such as the Typic Hapludult. This soil class occurs in rolling topography and has considerable textural differentiation from horizon A to B. Higher values of clay in the second horizon gives lower drainage condition in relation to the Typic Hapludox, which perform better water retention. Shallow soils such as the Typic Eutrudepts were also found in the area. These soils have cambic horizon and occur in strong dissected topography (
Figure 4a,b). Since they are very shallow, they occur very close to parent material at lower altitudes.
3.2. Spectral Data from Laboratory to Satellite
An important strategy on soil survey is to understand the variability between surface and undersurface layers. It is important to ratify that differences detected could assist on soil classification. Soils with same spectral information in both layers will have more chance to be an Oxisol, and when the differences are great, the possibilities move for Ultisol or Alfisol classes, for example. Araújo et al. [
55,
56] observed great differences on spectra from sandy to clayey tropical soils. In fact, spectral information shapes differ between these granulometric groups (
Figure 4c). A low albedo was observed in the VIS and high in SWIR for sandy soils. On the other hand, clayey soil does not change significantly between these regions. This occurs due to the organic matter that absorbs energy in VIS, as quartz reflects in SWIR. This is a clear indication of correlation between granulometry and spectral data as determined by [
57] in tropic soils. In fact, the soils have a great variation on clay content (21% to 84%,
Table 1).
From laboratory to satellite data, it was observed the same tendency (
Figure 4c,d). The spectrum loses the information of details on shapes because Landsat has only six bands against 1500 of the laboratory sensor. On the other hand, the aspect from VIS to NIR to SWIR on albedo shows a positive tendency with ascendant shape, in agreement with [
55]. Since the SWIR spectra remained stable in laboratory measurements, the satellite data from band 5 to 7 had a slight declining tendency on absorption (
Figure 4d). This behavior is due to the soil moist in field condition when the satellite sensor acquired the information. Despite this, band 5 remains remarkable on discrimination of soils with different textures, in agreement with [
56].
When there are high levels of iron in the soil, the Landsat image shows purple coloration in composition 543 (RGB) and brown coloration in the 321 (
Figure 3a,b). The point is that the image “color” gives a clue of the soil information, but does not express a spectral pattern. That is why we should go forward to analyze punctual spectral signature. The wavelengths of iron oxides absorption are between 630 and 690, and 760 and 900 nm, corresponding, respectively, to the wavelengths of bands 3 and 4 of Landsat. Thus, with the presence of oxides, the images will be brown or purple according to the composition of the bands and the presence of the elements. Therefore, there is a great correspondence of the spectra from the top of the topossequence (sandy and low iron) to the bottom (clayey and high iron), with the color indicated in composition 5/4/3, in the sequence from point 1 to 4 (
Figure 3b).
As mentioned,
Figure 3a,b presents the differences on color related with the mineralogy. At the location 4 (THa2) in
Figure 3b (image 543), soils with high clay and Fe
2O
3 content presents dark blue color. On the other hand, when soils have less iron content and gets more sandy (TQ, location 4 in
Figure 3b), the color goes to light pink. Composition 321 indicates the real information and is in agreement with field observation.
When we analyzed the top soil spectra, differences were observed in each soil class (
Figure 5a). We acquired the satellite spectra from a bare soil pixel. Then, a soil sample collected in the center of the same pixel at field was used for comparison with the laboratory spectra. We observed shape and intensity spectral alteration from laboratory to satellite data for THud and TQ (
Figure 5a). Indeed, THud has high iron and clay content, but low quartz. THa is very clayey with high iron (Fe
2O
3) content, magnetite and ilmenite, which brings us to a flat spectral-shape in accordance with [
57]. This tendency is observed in both laboratory and satellite data (
Figure 5c,d). In convergence with spectra, color goes from weak to strong purple, as well (
Figure 5b).
We saw differences between the shapes related with iron oxides (
Figure 5c). At 530 nm, hematite absorbs more energy producing a wider shape, and at 850 nm, it presents a strong concavity. At 1900 nm, there is the OH-stretch of kaolinite associated with the alumina vibration in 2200 nm. These differences cannot be observed in
Figure 5a because the multispectral sensor has limitation. However, the spectral intensity tendencies were similar in both satellite and laboratory sensors. Great differentiation between soils occurred at bands 5 and 7 related with quartz.
Indeed, image colors indicate soil variations.
Figure 3 shows that sandy soils at top altitude has weaker color. As we go to lower altitudes, pedogenetic alteration occurs. The area at the top have soils developed from sandstone and at lower altitudes basalt. Geology map (
Figure 3d) indicates these observations. Thus, the image plus top soil spectra and geology map helps on the detection of soil alteration, and perhaps soil units. On the other hand, spectra from top soil are important but cannot discriminate them accurately, because the diagnostic horizon is in undersurface (B horizon). In this case, we must change the strategy and evaluate surface and undersurface as indicated by [
58], or use relief parameters strategy.
Comparison between laboratory and satellite information can also be realized by soil line concept [
59].
Figure 6a indicates the relation between bands 4/3 and 5/7, where spectra are clearly close to 1:1 line since we know they are bare and dried samples, in agreement with [
59]. The same occurs for band 5/7 (
Figure 6b) but more dispersed along the soil line. The use of bands 5 and 7 for better discrimination of soil texture corroborates with findings [
56]. When using satellite data, the quality acquisition is not always controlled at field level for some aspects, such as moist, litter, roughness and spatial distribution. These issues can be diminished by using images acquired in dry season and after plowing. Pre-processing and reflectance transformation is also necessary. Thus, soil line observations associated with the spectra indicate that satellite data were ratified by laboratory with quality control.
While in laboratory all samples are dried, bands 5 and 7 are similar on intensities and take the dispersion to 1:1 line (
Figure 6). On the other hand, soil represented by satellite images is not completely dry and causes a lower reflectance intensity from band 5 to 7 [
60]. This makes the plots more dispersed. The same happens with clayey soils, but with less intensity. Moisture in clayey soils affect band 7 but not band 5. Instead, band 5 is influenced by quartz. Thus, the greater dispersion out of the soil line would be for sandy soils, and the opposite for clayey ones. This is clear in
Figure 6b for loam-sandy soil, where the dispersion is low for either bands 3 and 4 (
Figure 6a). It was observed that sandy textures were also more dispersed. The explanations for these findings are based on mineralogy, not moisture. Sandy soils have less magnetite, ilmenite and hematite, and, therefore, show differences between these bands. Clayey soils with high Fe
2O
3 (
Table 1) and magnetite does not change intensity between these bands, and thus, adjust to 1:1 line.
The characterization of soil classes by spectra was better verified by profile samples.
Figure 7a indicates no alteration between horizons for TQ. This is true since this soil had less than 15% clay in all profile with very homogenous color. The same happened with Thud (
Figure 7b), although having more clay and lower reflectance. Both soils do not have texture differences between A and B horizons, which is expressed by similar spectra. The Typic Hapludalf (
Figure 7c) has its characteristics as with low reflectance in all bands, negative tendency and flat shape. This occur due to its mineralogy, developed from basalt it has high clay content, high presence of magnetite and ilmenite, dark red color and high hematite content. The magnetite “pull” the spectra to lower reflectance in SWIR, and thus have a negative tendency. A Typic Hapludult has different clay contents from A to B horizon (
Table 1). This characteristic reflects on spectra (
Figure 7d). Horizon A presents higher reflectance at SWIR and horizon B has lower reflectance. In fact, there is a shift between horizons in about 1200 nm, which is an indicative of the gradient texture as previously observed by [
57].
Soils alter along topossequence from point 1 to 5 (
Figure 7). As we start in upper altitudes, soils are mostly developed from sandstone (TQ) and are sandy. This characteristic goes all over the profile. When we go down in altitude, sandy texture starts to shift to loam and we reach the THud soils. Despite this, the texture does not change along the profile. On the other hand, on point 3, the loam texture on horizon A continues the same, but B horizon is interrupted and shifts from loam to clay. This gradient of texture creates the THu soil. The alteration occur because during weathering, clay translocated from A to B horizon. Since the parent material is also changing, as sandstone is giving place to basalt, soils starts also to be more clayey. At point 4, clay is no more translocated, but developed in-situ directly from basaltic rocks. On point 5, A horizon is still clayey, and have a B horizon (cambic horizon), with presence of rocks and primary minerals. Thus, point 5 reveals a low deep soil.
Laboratory spectral strategy has the ability to obtain information from surface and undersurface soil. This information is much more detailed since the equipment is hyperspectral, and absorption features can express soil mineralogy. The topsoil information can only be compared with satellite data if the soil is exposed. However, Landsat spectra provide soil information with low detail since it has only six bands. Despite this limitation, the spectral pattern and intensity provided similar tendency with laboratory data (
Figure 6). This kind of information shows the importance of satellite spectra. On the other hand, satellite image also provides color composition (
Figure 3 and
Figure 5). This information showed great relationship with mineralogy and soil attributes, such as texture. Therefore, soil evaluation and spacialization presented good results when punctual spectral patterns and image composition were applied together. Moreover, laboratory data is very important to validate and understand satellite information as well.
A limitation of image spectral patterns on soil classification could be the impossibility to observe undersurface soil (diagnostic horizon). In fact, this is impossible for satellite images. An example could be the evaluation of THu, for which spectra are different between horizons (
Figure 7c). In contrast, the classification of this soil presupposes the presence of texture gradient between A and B horizons.
Table 1 shows that clay goes from 19.2 to 39.8 g·kg
−1 for this soil class. Thereby, soils with similar characteristics of A and B horizons have better chance for discrimination by satellite images, such as the THud and TQ.
The fact is that surface information of THud (
Figure 6a) is expressed by a high reflectance in band 5, which is related with its low clay content. Although the texture could be identified, the soil classification could not be afforded with that single information. On the other hand, we saw that all soils were discriminated only by surface analysis. This led us to agree with [
61], which obtained 70% of agreement on soil discrimination by Landsat image assisted with geological map.
Thus, despite image only detect surface layer, this information was important for soil discrimination, not classification. We can infer from the soil line, but not make a final decision of soil classes. Indeed, Zeng et al. [
35] observed an agreement between different soils at field with surface satellite information using discriminant analyses. This work shows that, despite the fact that satellite image does not detect undersurface soil, surface should have a specific combination of attributes that differs between them. Nevertheless, caution must be taken because in many cases, only surface information will not be sufficient to discriminate. These first results indicated that satellite and laboratory data, used together, aggregated strong evidence on the discrimination of pedological classes. Thus, we used this knowledge for the next steps to reach the digital soil maps.
3.7. Supervised Classification and Digital Soil Maps
Supervised classification of the DsSA-PC1, DsSB-PC1 and DEM parameters resulted in two soil maps. For the digital TFS-1 map (
Figure 4e), a kappa index (κ) of 0.52 and overall accuracy of 69% were obtained. The TFS-2 map (
Figure 4f) obtained κ = 0.45 and an overall accuracy of 62%. These results corresponded to a good performance [
54].
The degree of spatial correspondence (DSC) between the digital and traditional TFS-1 maps for each soil class showed similar values with the
Sc (
Table 6 and
Table 7). According to the DSC, Typic Hapludox, Typic Hapludult and Typic Hapludalf were the soils that presented good agreement between the digital and traditional assessments. Typic Quartzipsamment and Typic Eutrudepts showed low values of spatial correspondence.
Most of the soil classes of the TFS-1 and TFS-2 digital maps showed relatively low levels of discrepancies with their correspondent traditional soil class maps. Therefore, the similarity with field maps was high (
Figure 4). The predicted Typic Hapludox covered some areas of the traditional Typic Quartzipsamment. This is related to the clay fraction of the Typic Hapludox, which was lower than 20%. In this case, these soil classes had similar clay content. This discrepancy was expected, since Typic Hapludox and Typic Quartzipsamment occur in similar relief compartments.
Concerning the discrepancies between Typic Hapludult in areas of Typic Hapludalf of the soil traditional map, which showed a
Sc = 23.7 (
Table 7), the chemical attributes could not allow the discrimination between them. Thus, the weight of the PC for soil attributes estimated in the undersurface layer was low because it did not provide sufficient information for the discrimination of these soil classes. With the surface satellite image, it is possible that their differentiation was less complex. However, since these soils have different texture between surface and undersurface, the discrimination remains complicated even sometimes by using the traditional method. On the other hand, most of the classification discrepancies between TFS-2 digital map and its correspondent soil traditional map occurred between subgroups of the same soil class (
Table 8).
Finally, the purity of the conventional and traditional soil maps could have affected the purity of the digital soil map. For instance, Beckett et al. [
64] found a range of mapping purity between 65% and 86% for a conventional soil survey at 1:25,000 scale. Therefore, errors in traditional soil maps could interfere in the creation of the training and prediction rules, decreasing the performance of the digital technique. On the other hand, we can infer that the purity of digital soil maps could be closer to the conventional approach ranges mentioned before, if we take into account the overall accuracy indexes found in this work.
Indeed, other authors have obtained variation in the quality indicators when they attempted to develop a digital soil map. There is an indication that the results vary according to some parameters such as prediction co-variables, the scale, the complexity of the area and the statistical methods [
9,
65]. Although [
34] have reached 91% of agreement for Oxisol class in their study, Alfisol and Ultisol classes just reached 46% of concordance between conventional and digital maps. The study was carried out in a very complex area with heterogeneous soils, in a semidetailed scale level (1:50,000), using hyperspectral data, Landsat images and stream density as prediction variables.
The area mapped in the current paper reached good results (mean of 69%) regarding DSM approach with remote and proximal sensing, and with relief parameters. Arruda et al. [
65] used the same area as well the information to extrapolate by pedrotransference neural net systems the entire area, reaching 75% agreement. Their results converge to the importance of DSM. Thus, the strategy to make a pedological map can vary, depending on the tools you have available and the final objective.
Bazaglia Filho et al. [
9] also reached a range from 12% to 35% agreement between the traditional and digital soil map in an area of tropical soils. In that occasion, only terrain attributes derived from relif were used for soil classes prediction. The authors used a neural net system, and the bad prediction power was associated with the geological complexity of the area and the scale level of mapping (ultra-detailed map with 1:5000 scale). Nanni et al. [
66] used laboratory and orbital spectra looking towards pedological mapping. Demattê et al. [
67] used spectroscopy and relief variables to reach very good similarity with the traditional map. In fact, this was also indicated by [
68] when using aerial photographs in addition with field spectroscopy. These papers corroborates with [
69] for whom DSM can support the production of functional maps. These studies are also in agreement with the present work, where a good performance (69%) between the digital and traditional map was reached. It is also important to state that satellite images (specifically of Landsat) can be obtained free on the Internet. Regardless, the image must be taken in the same time of soil sampling, with clear sky and good climate, processed and transformed into reflectance, and with complete bare soil, Thus, the variations of quality seems to be a matter of the objective of the user (e.g., scale), and mainly, on how user develops the strategy to map the soil.