1. Introduction
Earth-observing satellite sensor data can be used for land-cover mapping and monitoring, which is essential for estimating land-cover change. The increase in land use and land cover changes (LULC) in natural ecosystems has adversely affected carbon stocks, climate change, and biodiversity, as well as the global climate over the past few decades [
1,
2,
3,
4]. It is believed that deforestation due to urbanization and agricultural expansion is one of the most critical threats to the environment in the 21st century [
5]. The United Nations (U.N.) sustainable development goal (SDG) 15 has emphasized measures to “
protect, restore, and promote sustainable use of terrestrial ecosystems, sustainably manage forests, combat desertification, and halt and reverse land degradation and biodiversity loss” [
6]. Priority is placed on combating desertification, recovering degraded land and soil, particularly in areas affected by desertification, drought, and floods, and combating land degradation by 2030. Satellite Earth observation data offer one of the most reliable options for monitoring land degradation in the context of the SDGs due to their consistency and repeatability at local and large spatial scales. Information about the land cover of a country is an essential part of the planning and development process. It is useful for environmental reporting [
7], assessing the impact of land use on the natural environment [
8], conserving biodiversity and habitats [
9], mapping population distributions [
10], forecasting crops, studying urban heat islands, managing insurance risks, planning telecommunications, and others [
11,
12,
13].
Even though traditional methods (e.g., field surveying) yield accurate results, they are expensive and inefficient in monitoring large and inaccessible areas. To overcome these limitations, remote sensing scientists have developed analytical tools for detecting, characterizing, parameterizing, and monitoring land variables based on space observations. Remote sensing has experienced rapid advances over the past 40 years. Based on remote sensing technology, data are usually collected across different regions of the electromagnetic spectrum at wide spatiotemporal scales (e.g., the recent Copernicus program/Sentinel missions and the Landsat program/missions- which has been available for over 40 years). Hence, remote sensing provides an interesting option for policymakers to make informed decisions about our environment and also to improve the methodology of assessing ecosystem vulnerability [
14,
15].
Over the past decades, the scientific community has fully recognized remote sensing/Earth observation data from space for LULC monitoring. These data offer an unparalleled opportunity for large-area measurement and high temporal precision for land cover mapping and monitoring. Today, a large number of global land cover maps are produced (e.g., GLOBCOVER and MODIS land cover products). However, these products have their limitations for regional as well as local assessments due to their low spatial resolution (e.g., 1 km, 250 m), temporal frequency, and inconsistencies in their assigned thematic classes [
16]. These limitations primarily occur due to (1) the small number of training data relative to the size of the area being mapped, (2) mismatch definitions/propriety in land cover classification schemes, (3) and the need for a readily and automated algorithm to handle large datasets. In this light, many regional governments have embarked on research projects to provide high and medium-resolution (e.g., 30 m) land cover maps, which are accurate and consistent with their local demands. For example, the operational land cover databases (e.g., the National Land Cover Database for the United States of America (U.S.A.) and the United Kingdom’s Land Cover product which is based on the European CORINE land cover mapping scheme [
11].
A widespread increase in anthropogenic activities, land use, and land cover changes are occurring at an unprecedented rate, requiring policymakers and stakeholders to pay greater attention to the measures to manage and control environmental degradation. In Nigeria, the threat to environmental sustainability, for example, is encapsulated in the need to ensure the quality of the environment is appropriate for good health and well-being, as well as to protect and utilize the environment and natural resources for the benefit of present and future generations. The policy encourages the compilation of detailed land capability inventories, comprehensive land classifications, assessment of the current land use practices, causes and extent of land degradation, and regulatory framework for sustainable land use [
17]. However, despite recent advancements in Earth observation and remote sensing, there is no reliable land LULC for the country. Most of the previous global land cover maps were not also developed based on adequate or training data sets covering Nigeria. And their class labeling and definitions (e.g., International Geosphere-Biosphere Programme) have mixed land cover classes, which are unsuitable for discerning LULC characteristics in Nigeria. Conservation policies in Nigeria have emphasized undertaking land capability classifications based on evolving methods of land evaluation suitable to local conditions.
Land cover monitoring using remotely sensed data involves precise mapping of complex land cover and land use categories, necessitating the employment of strong classification systems [
18]. Waske and Braun [
19], who compare the ensemble classifiers with approaches such as the maximum likelihood classifier for land cover classification using C-band multi-temporal SAR data, observed that random forest (RF) outperformed maximum likelihood by more than 10%. A comprehensive comparison of machine learning algorithms has been conducted by Lawrence and Moran (2015) using uniform procedures and 30 distinct datasets. Their results showed that RF had the highest classification accuracy of 73.19% than SVM, which had an accuracy of 62.28%. Of the total 30 classifications, RF was the most accurate in 18 classification scenarios. Talukdar et al. [
20] reviewed six machine-learning classifiers for LULC classification using satellite observations. Based on overall accuracy, results indicate that RF is the best machine-learning LULC classifier (0.89, RMSE = 0.006), compared to support vector machine (Kappa = 0.86, RMSE = 0.11), artificial neural network (Kappa = 0.87, RMSE = 0.09), fuzzy adaptive resonance theory-supervised predictive mapping (0.85, RMSE = 0.17), spectral angle mapper (Kappa = 0.84, RMSE = 0.23), and Mahalanobis distance (Kappa = 0.82, RMSE = 0.28). This makes the machine learning algorithm suitable for LULC classification. Furthermore, a recent study by Adugna et al. [
21], who compare RF and SVM machine learning methods, found that RF outperformed SVM, yielding overall accuracy (OA) of 0.86 and a kappa (k) statistic of 0.83, respectively, which is 1–2% and 3% higher than the best SVM model.
Nowadays, machine learning technology is used for feature selection to assist in mapping LULC categories. The advantage of RF is its capability for feature selection, which has been proven to improve classification accuracy in previous studies [
22,
23,
24]. A study by Balzter et al. [
11], who developed a method for CORINE Land Cover mapping using RFs, demonstrates the importance of variable selection using Sentinel-1A radar backscatter coefficient at HH and HV polarizations (summer acquisitions) and VV and VH polarization (winter acquisitions) and SRTM Digital Elevation Model Data. The classification out-of-bag error rate was 52.5%, and kappa (κ) = 0.38 for the Sentinel-1 variables. When the variables generated from the S.R.T.M. data were added, the quality of the classified map was improved substantially, with an out-of-bag error rate of 31.6% (68.4% accuracy) and κ = 0.63. R.F. clearly describes the benefits of including variable selection in the land cover classification process in a complex environment [
25].
The RF technique is well-established in land remote sensing today. Still, it has not been adequately evaluated by the remote sensing community as compared to more traditional pattern recognition algorithms. In addition, there have been observations about how the importance of variables varies depending on the data and ecosystem in question, necessitating further exploration [
23,
25,
26]. To assist decision-makers in a variety of spatial planning applications (e.g., cropland management, irrigated agriculture intensification, flood vulnerability assessment, water management, or human settlement/resettlement planning in floodplains), the thematic LULC classes were created to represent the local characteristics of the semi-arid region, in Nigeria. Specifically, the objectives of this study were; (1) to evaluate the applicability of an RF classification algorithm for LULC mapping using local class definitions and training data sets in an agriculturally dominated landscape in Nigeria; (2) to assess the contribution of an individual satellite band in the RF model; (3) to improve model performance and reduce prediction errors of LULC maps based on RF feature selection. The novelty of this study is the synergistic use of different sources of satellite data to identify the most important variables to reduce prediction error. Therefore, one of the most important contributions of the work is the methodology developed to improve classification performance. The insights gained in this work to improve model performance and reduce prediction errors not only support policymakers in applying accurate LULC maps in spatial planning but also enrich the methodological system of LULC assessment through machine learning.
4. Discussion
This study reports on the production of LULC maps based on random feature selection to evaluate its application in an agriculturally dominated landscape in Nigeria. The potential of using Sentinel-1 optical data, Copernicus Sentinel-1 c radar backscatter, and SRTM topographic variables were investigated to ascertain whether this synergy could improve classification accuracy. The general findings that emerged from this study suggest that: (1) the application of RF classification appears promising in this ecosystem; (2) the use of multiple remote sensing and environmental variables is an important contribution to quantitative remote sensing applications; (3) feature selection methods can improve classification accuracy; however, the evaluation of classification accuracy requires a thorough and critical assessments.
The mapping performed in this study was guided by the RF feature selection procedure based on the ranking of MDA as a function of OOB error estimates. The contribution of each satellite band varies. Some bands make a better contribution than others. What makes these results interesting is the procedure used to test each data set individually and then in combinations. Interestingly, the most important bands also provide the largest spectral differences between classes, except for the normalized backscatter polarizations, where the spread between classes is not very large, but this is similar behavior observed for the Sentinel-2 blue band (
Figure 3a,b). Among the Sentinel-2 data variables, the blue, SWIR1, and NIR bands were found to be the most important variables (
Figure 4a). Similar behavior for the SWIR1 and blue spectral bands of Sentinel-2 has been observed in previous vegetation, tree species, and crop mapping studies [
54,
55]. ED Chaves et al. [
56] have explained that Sentinel’s two SWIR bands are very sensitive to chlorophyll content, which allows them to distinguish different vegetation types and determine classification accuracy for LULC. ED Chaves, CA Picoli, and D. Sanches [
56] stated that Sentinel’s two SWIR bands are very sensitive to chlorophyll content, allowing them to distinguish different vegetation types and determine classification accuracy for LULC. In addition, visible and shortwave infrared wavelengths are known for their spectral variations, which can explain variations caused by chlorophyll content, soil type, and soil color [
57].
Using the normalized backscatter, the VH polarization has the highest rank, which is due to the combinations of the different polarizations (
Figure 4b). For the topographic SRTM variables, the elevation data had the highest rank (
Figure 4c). The stand-alone classification results for the Sentinel-1 data (
Table 3), as well as for the topographic SRTM variables (
Table 4), achieved very low accuracy compared to the Sentinel-2 data (
Table 2). Therefore, the synergy between VH, elevation data, and Sentinel spectral bands was evaluated to see if the accuracy of the model could be improved. The ranking of the most important variables shows that elevation, blue band, VH, NIR8a, and SWIR1 are the five most important variables (
Figure 4d). Elevation makes the largest contribution to the classification. These results are consistent with a recent study by Zhao, Zhu, Wei, Fang, Zhang, Yan, Liu, Zhao, and Wu [
57], the only difference being that they do not include radar backscatter as one of their input variables. This study highlights the importance of altitude and radar backscatter data with Sentinel-2 data to improve the classification accuracy of LULC.
The accuracy of the classified maps in this study suggests that it is reasonable to use different remote sensing data for LULC, as has been done in many previous studies. Based on the OOB error estimates, two scenarios were considered the most important, so the comparison is limited to these. The overall OOB classification results for Sentinel-2 data show an overall accuracy of 84.2% and a κ of 0.38, with the lowest and highest class errors for classification at 4.54% and 21.81% for built-up areas and grassland, respectively (
Table 2). This level of accuracy is achieved by the Sentinel-2 data alone, further emphasizing their applicability in LULC mapping in this particular ecosystem. However, when the SRTM elevation data and VH backscatter were added to the Sentinel-2 spectral bands, the overall accuracy was 89.1%, and the κ value was 0.4, an increase of 4.9% in overall accuracy (
Table 3) compared to the Sentinel-2 data alone. The lowest classification error was found in the irrigated areas, with only 2.15%, while the highest error occurred in the grassland areas (18.09%), which in this case were reduced by 3.72%. The cultivated areas had a class error of 3.41%, which is a further reduction of 1.13% compared to the Sentinel-2 data. For trees and shrubland, the study found a 6.28% difference between the sentinel data and their combination with elevation and VH backscatter data.
This is not independent of the role of the elevation and backscatter data in the overall performance of the model. The topography of the area is heterogeneous, and some of the classes are located in the floodplain, which is typically undulating compared to developed and agricultural areas. Several studies have shown the importance of elevation data to increase the accuracy of the classified map [
11,
26,
40,
58]. In the same vein, radar backscatter was found to improve model performance because it can normalize or reduce the effects of the atmosphere, topography, instrument noise, etc., to provide consistent spatial and temporal comparisons [
59]. The results from this study are consistent with Meneghini [
60], who evaluates the synergy between the Sentinel-1 and Sentinel-2 data for land cover classification. Their results show an overall accuracy of 74% and 78% for Sentinel-2 (Only) and in combination with Sentinel-1 data, respectively. Similarly, several studies have reported the importance of synergy between sentinel-1 and -2 data for increasing model performance for biomass estimation [
61], crop type classification [
62], irrigation mapping [
63], and land cover mapping [
64,
65].
It has been observed that in a setting in which there is a strong interest in predicting observations from the smaller classes, sampling the same number of observations from each class for validation is a promising alternative [
53]. Moreover, one of the objectives of this study was to compare the validation of OOB error estimates of the RF normally performed internally by the model with another independent validation (external) which was performed based on equal-size random stratified sampling using 100 polygons for each LULC category. The overall accuracy of the classification results were 69.9% and 75.2% for Sentinel’s 2 data only and the combination of the same data with VH backscatter and elevation data, respectively. The difference between the two is 5.3% which conformed to the OOB estimates of errors even though the overall accuracy obtained from the OOB is higher. The consistency of these two validation results manifested even within the class error. Similar to OOB estimates of error, grassland had the lowest producer’s accuracy with an 11.2% difference between the Sentinel’s data only and in combination with VH and elevation data based on the independent validation. In this context, the estimates from the OOB are, therefore, reliable since the two validation results have maintained a consistent pattern. The only difference between the two is in kappa statistics, where the external validation shows higher kappa (
k = 0.71,
Table 7) than the estimates from the RF internal validation (
k = 0.4,
Table 3). This is one of the advantages of a balanced setting for applying the equalized stratified random sampling for validation [
66], but balancing may not always be possible due to costs or other reasons [
4]. But kappa is not a measure of accuracy but of agreement beyond chance, and chance correction is rarely needed [
67,
68]. The comparison results obtained in this study are consistent with findings by Adelabu et al. [
69], who tested the reliability of the internal accuracy assessments of the RF for classifying tree defoliation levels using different validation methods. One of the most important deductions that can be made in this context is that where only the RF approach is applied to the LULC classification, independent validation is not necessary because validation requires a large number of points, and therefore manual class labeling based on external validation is tedious and time-consuming. The findings of this current study provide insights into the reliability and applicability of OOB error estimates.
One of the limitations of this study is the lack of reference ground truth datasets from a field campaign. Although this study relied on RGB composite images and Google Earth data for the selection of training and validation datasets, it should be noted that such datasets are well-acknowledged as a source of training and validation for land cover mapping [
70,
71]. Furthermore, a comparison of the quantitative and qualitative results showed that the LULC categories are detailed and very accurate (
Table 2,
Table 3,
Table 6,
Table 7 and
Table 8 and
Figure 4 and
Figure 5). The area estimated from the two most accurate results shows that there is extensive agricultural land. The two maps show slight differences for the area of different LULC categories. The study, however, acknowledged the confusion between the barren land and the built-up areas, which occurred primarily due to the presence of settlements in or near the floodplain areas, in addition to the similarity of the spectral reflectance signatures of these LULC classes. Moreover, the difference between the spectral reflectance signatures between the barren land on the upland and in the floodplain probably led to the underestimation of barren land in the upland areas. However, the class error for barren is minimal, as observed for the RF internal validation (7.43%) as well as for independent validation (producer’s accuracy = 91.9% and user’s accuracy = 84.3%). From these results, it is obvious that further research in this particular ecosystem may require the need to incorporate vegetation (e.g., NDVI), bare soil indices (e.g., modified normalized difference bare-land index), and water indices (e.g., Modified Normalized Difference Water Index) to improve classification performance. The study also noted confusion between the river network and wetlands. Earlier reports indicated that significant flooding occurred in the area on October 1 [
72,
73]. At this time, the volume of rivers usually increases, and flooding is easily possible when the amount of rainfall is significant, and the dams along these rivers have been opened. These floods have left many people homeless and severely damaged agricultural land and crops. Future research could focus on flood vulnerability assessment based on change detection using sentinel data. In this situation, flood vulnerability mapping can provide critical information to assess flood risk in the region. Policymakers could be well informed about the risk and thus develop appropriate mitigation strategies based on the severity of the impacts [
74,
75].
Similarly, the study observed confusion between the grassland and farmland. Mapping LULC with Sentinel-2 data in the semi-arid region is quite promising [
34] but challenging because most crops are planted during the rainy season, and their growing season is in July and August, during which the cloud cover is high in the area. And the reliance on dry season imagery may not be feasible as there is a transition from cropland to barren land in the area, especially from early November. Since cropland makes up most of the LULC in the area, this is not the most appropriate time for LULC mapping. This study minimized this problem by integrating Sentinel-1 and -2 data in early and mid-October. Van Tricht, Gobin, Gilliams, and Piccard [
63] demonstrate the importance of choosing phenological cycles for crop mapping based on the synergy between the sentinel-1 and -2 data using an RF classifier for increasing model performance. Similarly, many studies demonstrated the importance of Sentinel-1 and -2 for rice mapping in a lowland area [
76], mapping paddy rice [
77], and mapping Maize Areas in heterogeneous agriculture [
78] based on RF. By understanding this trade-off, the current study can help in the selection of datasets and periods for LULC classification with specific applications to agricultural landscapes in semi-arid regions. Although cloud cover may result in a lack of cloud-free imagery in this region, a potential area for further research would be to examine crop and vegetation phenological cycles and by incorporating more variables from the Sentinel-1 data during the rainy season to minimize the challenge of cloud cover.
5. Conclusions
This paper proposed LULC mapping by applying an RF classifier to Sentinel-1, -2, and SRTM digital elevation data to evaluate its applicability based on local class definitions and training datasets in an agricultural landscape in Nigeria. The main objective was to develop a methodology to improve model performance and reduce prediction error in LULC classifications. A feature selection method (RF) was implemented to evaluate the contribution of individual bands based on a standard OOB error estimate (MDA). The study showed that the combination of spectral bands, backscatter, and topographic features could improve classification accuracy. The results show that among the variables in the sentinel-2 data, the blue, SWIR1, and NIR bands are the most important variables. Using the normalized backscatter, the VH polarization has the highest rank, which is due to the combination of the different polarizations. For the SRTM topographic variables, the elevation data had the highest rank. The ranking of the most important variables when combining the different data sets shows that height, blue band, VH backscatter, NIR8a, and SWIR1 are the five most important variables.
The overall OOB classification results for Sentinel-2 data show an overall accuracy of 84.2%, with the lowest and highest class errors for classification of 4.54% and 21.81% for built-up areas and grassland, respectively. This level of accuracy is achieved by the Sentinel-2 data alone (scenario 1), further emphasizing its applicability in LULC mapping in this particular ecosystem. On the other hand, the class errors for Sentinel-1 (scenario 2) and SRTM data (scenario 3) show high-class errors. However, when the Sentinel-1, -2, and SRTM elevation data were added to the model, the overall accuracy was 89.1%. This represents a 4.9% improvement in overall accuracy compared to Sentinel-2 alone and a 6.1% and 12.66% improvement compared to Sentinel-1 and SRTM data, respectively. The lowest classification error was found in the irrigated areas at only 2.15%. In comparison, the highest error occurred in the grassland areas (18.09%), which in this case were reduced by 3.72% compared to the Sentinel data alone. According to the study, there was a 6.28% difference between sentinel data and their combination with elevation and VH backscatter data for trees and shrubland. The results of an independent validation based on an equal-size random sampling of 800 points are consistent with OOB error estimates. The study shows how the synergy of optical, radar, and elevation data can significantly improve LULC map accuracy. Based on these results, LULC maps could be used in a broad range of spatial planning applications.