1. Introduction
The concepts of geological units and lithological mapping in geology are closely related; however, they are distinct from one another. On one hand, different geological units can be categorized based on their characteristics, such as composition, age, and origin, including rock layers, formations, and other distinct bodies of rock. On the other hand, lithological mapping provides information about the distribution and geological history of the Earth’s crust, together with its characteristics [
1]. Therefore, it plays a significant role in bedrock surveys and mineral exploration [
2]. In many cases, ore deposits have been first discovered on the ground by recognizing hydrothermally altered host rocks. To understand the distribution, properties, and characteristics of different rock types within a particular area, regional lithological maps can play a significant role in lithological mapping as part of geology and mineral exploration [
1]. Acquiring such lithological maps is time-consuming, needs intensive fieldwork, and can be challenging in cases where the study area is difficult to reach.
Remote sensing is one of the valuable approaches for lithological mapping to explore commercially viable mineral resources. Remote-sensing mapping techniques can locate high potential zones for ore mineralization in a vast area by recognizing hydrothermally altered rocks, while consuming less time and money and achieving a higher accuracy than ground-based field surveys [
3,
4]. However, the medium and coarse spatial and spectral resolution of remote-sensing data may make the implementation of such systems difficult. The technical characteristics of multispectral and hyperspectral remote-sensing sensors are crucial for lithological mapping and mineral exploration [
5,
6,
7]. Sensors that are equipped with hyperspectral technology are capable of simultaneously acquiring images with 100 to 200 contiguous spectral bands, allowing for a unique combination of spectrally contiguous images [
8]. A substantial amount of spectral information can be derived from satellite-based hyperspectral data, allowing mineral compositions to be determined from the spectra [
9]. Yet, in addition to spectral confusion, difficulties in data processing, relatively narrow swath widths, and atmospheric interference, high-resolution hyperspectral images are prone to spectral interference [
10]. Thus, a single pixel in the image provides coverage for a large ground surface area (e.g., 1000 m
2), making selections of pure pixel spectra for training samples in the supervised classifier difficult and challenging; furthermore, the lithological classification accuracy is potentially low as a result [
11,
12]. Hyperspectral data are often not openly available or are costly.
Thanks to the availability of high-resolution multispectral data, such as SPOT and GF-2, the problem of low classification accuracy has been solved to a certain extent. Despite an impressive capability of showing structural and textural features, high-spatial-resolution multispectral satellite data have a narrow spectral range. There are only a few visible and near-infrared bands and a marked absence of other spectral bands such as short wave and thermal infrared. Most high-resolution images are also costly and not publicly available.
The advanced spaceborne thermal emission and reflection radiometer (ASTER) sensor can identify lithological units and hydrothermal alteration mineral zones [
13,
14,
15,
16,
17]. ASTER (Ministry of Economy, Trade, and Industry, Tokyo, Japan) provides worldwide coverage with high revisit times (16 days) at a relatively high spatial resolution (15–90 m). Using this technique, it is possible to identify imagery that is free from cloud cover or that is seasonal in order to minimize the effects of vegetation. It has been demonstrated that the remote identification of iron oxide minerals can be easily achieved using ASTER’s visible and near-infrared (VNIR) bands [
14,
18,
19,
20]. The fundamental absorption features of Al–O–H, Mg–O–H, Si–O–H, and CO
3 for identifying hydrothermal alteration minerals (e.g., phyllosilicates, sorosilicate, and carbonates) can be detected using the shortwave infrared (SWIR) bands of ASTER [
21,
22,
23,
24]. Furthermore, ASTER’s thermal infrared bands (TIR) can distinguish silicate lithological groups through the emissivity spectra that are derived from Si–O–Si stretching vibrations [
15,
25,
26,
27].
To map lithological units and identify alteration mineral zones, several image processing algorithms, namely band math, minimum noise fraction, spectral angle mapper, principal component analysis, false color composite, and matched filter, have been commonly applied to ASTER data [
5,
8,
9,
11,
28]. The results from these conventional algorithms contain some drawbacks, such as unclassified and misclassified units, which are challenging. Hence, these techniques typically might reduce the accuracy of lithological and alteration mapping [
29,
30]. Recently, machine-learning (ML) algorithms have been more effective than conventional classification methods when classifying geological targets [
31,
32]. ML, which is a sub-domain of artificial intelligence, is a data-driven technique that helps to extract useful information and recognize patterns in data with minimal human involvement [
33,
34,
35]. ML algorithms have several advantages, especially in automatically solving the most complex nonlinear problems, and are more robust in handling missing data than traditional image-processing methods [
33,
36]. Particular attention is devoted to the task of supervised lithology classification for the prediction of classes representing the spatial distributions of geological materials.
Some researchers, such as Bachri et al. [
37] and Cracknell and Reading [
33], have assessed and evaluated applications of ML algorithms in geological mapping using remote-sensing imagery. They showed considerable potential in various areas, such as mapping lithological units and the identification of alteration zones that are associated with a variety of ore mineralization processes [
37]. Extensively applied ML algorithms in geology and mineral mapping include support vector machines (SVM) [
33], artificial neural networks (ANNs) [
33], random forest (RF) [
38], maximum likelihood classifier (MLC) [
38], k-nearest neighbors (k-NN) [
33], and naïve Bayes (NB) [
33]. Advancement in ML algorithms for image processing based on satellite data has considerably assisted in enhancing the detection of lithological and structural features, and in identifying alteration zones for mineral exploration. Lithological mapping could be made more feasible by using state-of-the-art ML algorithms like gradient boosting (GB), extreme gradient boosting (XGB), and artificial neural networks (ANNs).
A neural network is an artificial intelligence algorithm that is capable of analyzing patterns, learning tasks, and solving problems like humans [
39]. The ANN is widely used to solve complex problems in diverse fields, including regression and classification problems [
40]. The performance of ANNs depends on several key parameters, such as activation functions, loss functions, optimizers, hidden layers, the number of nodes, and regularization layers [
41]. The GB [
42] is a sequential ensemble learning technique where the model’s performance improves over iterations [
43]. This method creates the model in a stage-wise fashion. It infers the model by enabling the optimization of an absolute differentiable loss function [
43,
44]. The XGB algorithm is an extended version of the gradient boosting algorithm. It is designed to enhance an ML model’s performance and speed. Xiong et al. [
45] analyzed deep-learning algorithms and big data in skarn-type (sedimentary–igneous intrusion contacts) iron mineralization in China. Their results showed a strong spatial relationship between known mineralization areas, which were mapped prospectively by a deep-learning method. Elahi et al. [
46] investigated the potential of two ML algorithms, including SVM and ANN, using Sentinel-2 optical data for lithological mapping in Pakistan. They reported an overall accuracy of 95.78% and 95.73% for SVM and ANN, respectively. Utilizing ML methods of XGB and ANN algorithms on ASTER data has a high potential and great advantages for lithological mapping and mineral exploration.
The study’s main objective is to propose an approach for identifying the most optimized and efficient ML approach for lithological mapping using ASTER remote-sensing data. This research compares several traditional machine-learning algorithms, such as RF and SVM, to novel ensemble machine learning techniques, such as GB, XGB, and deep-learning ANN, for the spatial modeling of lithological units. It also aims to find the most relevant features and spectral regions for lithological mapping. This study represents an inclusive evaluation of RF, SVM, GB, XGB, and deep ANN algorithms for lithological mapping using ASTER data. The models can provide geologists with accurate lithological mapping and mineral exploration, especially when applied to similar regions.
2. Geology of the Study Area
The current study focuses on mineral exploration in Iran. Most of the country is semi-arid with sparse, mainly herbaceous vegetation on surfaces that are well exposed. This makes remote-sensing-based geological mapping an ideal method of study [
47]. The Sar-Cheshmeh copper mining region in Kerman Province (southeast Iran) was selected as a case study (
Figure 1A,B). The Sar-Cheshmeh porphyry copper deposit is considered the second largest global deposit of this metal, the most important in Iran, and has been exploited since ancient times. It contains roughly 1200 million tonnes of ore with an average grade of 1.2% copper, 0.03% molybdenum, 3.9 g/t Ag, and 0.11 g/t Au [
48]. It is the first time that ML-based techniques (RF, SVM, GB, XGB, and deep-learning ANN) have been used for lithological mapping using ASTER remote-sensing data in the Sar-Cheshmeh copper mining region (
Figure 1). The study area is 160 km southeast of Kerman City (55.865556° E, 29.946111° N) and south of the Urmia-Dokhtar volcanic belt (
Figure 1A). It is located in an area of Eocene volcanic rock and Oligo-Miocene subvolcanic granitoid rock.
It is believed that the oldest host rocks of the Sar-Cheshmeh porphyry copper deposit are derived from the Eocene volcanogenic complex [
49], which consists of the following: pyroxene trachybasalt, potassic and shoshonitic pyroxene andesite [
50], less abundant andesite, agglomerate, tuff, and tuffaceous sandstone. During the Oligocene–Miocene transition (~23 Ma), granitoid phases such as quartz diorite, quartz monzonite, and granodiorite were intruded into these rocks. These granitoid rocks are cut by intramineral porphyry dikes composed of hornblende porphyry, feldspar porphyry, and biotite porphyry. The Sar-Cheshmeh copper deposit is placed in Eocene volcanic rocks, where a Miocene sub-volcanic granitoid unit intruded into andesitic host rocks [
48]. Porphyry copper mineralization in this area is associated with well-developed zones of hydrothermal phyllic, argillic, propylitic, silicification, and jarositic alteration zones.
The deposit is located at an average altitude of 2620 m asl and its highest altitude reaches 3280 m asl. Generally speaking, the regional climate is characterized by cold, snowy, and windy winters, and mild summers. The temperature ranges from −15 to +35 °C. The average rainfall is reported to be 250 mm or less per year. As a result, the surface of the earth is well exposed, given that there is little or no vegetation cover, which makes the remote-sensing approach very suitable.
5. Discussion
One factor that must be considered in lithological mapping is the specific spectral reflectance that is associated with each mineral. In other words, lithological units consist of a mixture of spectral reflections from minerals that make up their composition [
79,
80]. Pixel size is yet another factor that must be considered when classifying lithological units. It should be noted that different datasets generate pixels of different sizes. Even in a single specific sensor, various spectra have different resolutions. In this study, ASTER bands were resampled to the resolution of VNIR bands (i.e., 15 m). According to the results of FI, the TIR bands and indices that were associated with these bands have higher accuracy. However, the TIR resolution is coarser than that of other spectral regions (e.g., VNIR and SWIR). Including thermal data at a lower resolution may lead to more accurate lithological mapping. Yet, it should be noted that some classes were narrow and elongated in the study area. Therefore, classifying these classes may be difficult, as in the case of high-spatial resolution, two or more lithological classes mix in one pixel. Lower spatial resolution due to too many detailed objects can equally pose a problem [
81]. Another parameter that can affect the overall accuracy of ML algorithms is how training and testing samples are selected. There was some uncertainty in selecting training samples based on the visual interpretation of geological maps [
82]. Yet, the training and testing samples were chosen randomly in this study. In addition, the training models have been repeated by a diverse selection of training samples, and an average accuracy across various situations has been attained to ensure that this study’s overall accuracy is stable.
This study investigated the accuracy of five ML algorithms (RF, SVM, GB, XGB, and a deep-learning ANN) to map lithological units in the Sar-Cheshmeh copper mining region and compared their overall accuracy to one another. A comparison with other studies of ML algorithms in lithological mapping has been made herein. Shebl et al. [
83] utilized Sentinel-2 multispectral data and radiometric data to assess the potential of SVM to classify 13 lithological classes in Egypt, including igneous, metamorphic, and sedimentary rocks. Their dataset contained from 955 to 3397 observations per class for training the model. They reported an overall accuracy of between 0.756 to 0.857. Nugroho et al. [
84] investigated the potential of several forms of remote-sensing imagery, including Sentinel-2, ALOS PALSAR, and DEM, together with geophysical data. This included magnetic and electromagnetic data to map lithology in Indonesia using an RF algorithm. Their number of training samples per class was between 14 to 337. They reported an accuracy of 0.73 to 0.81 for lithology classes. Bachri et al. [
82] utilized several forms of remote-sensing data, including Landsat 8 OLI, DEM, and ALOS PALSAR, to assess lithological mapping in Morocco. They reported an overall accuracy of 0.85 using the SVM ML algorithm. Manap and San [
81] reported that adding SAR and DEM data improved the model’s overall accuracy by roughly 10%. They also reported that SVM and ANN were more accurate than RF. In the current study, SVM outperformed other ML algorithms with respect to overall accuracy in the Sar-Cheshmeh copper mining region. Adding DEM could not significantly improve the overall accuracy of ML algorithms (<2%).
According to
Table 2, the number of training samples for minor and major classes is considerably different in this study, so the imbalance ratio is quite high. The number of features also reached 33 (both bands and indices). This clarifies that in this study, a complicated problem was confronted, as seen from the accuracy that was reported for the class of feldspar dike by almost all models given that it had the lowest number of sampling data points. Considering the above conditions, it is clear that the results of all ML algorithms provided a relatively high accuracy. One way to improve the accuracy of the ML algorithms in this study is simply by adding more data to train the model. It also should be noted that adding data is not always the best case in ML algorithms. Adding additional data, such as geophysical measurements, may improve the accuracy, as has been well stated by Nugroho et al. [
84]. One limitation of such ML algorithms is the presence of vegetation in the area, affecting the spectral information fed back to the sensor and, therefore, classification accuracy [
85]. However, the case study region in this analysis was arid, lacking extensive vegetation. The presence of vegetation in the study area can be addressed by utilizing synthetic aperture radar (SAR) coverage (depending on the type and height of vegetation).