Next Article in Journal
Study on the Factors Affecting the Humus Horizon Thickness in the Black Soil Region of Liaoning Province, China
Next Article in Special Issue
Leveraging Convolutional Neural Networks for Disease Detection in Vegetables: A Comprehensive Review
Previous Article in Journal
Epigenetic Regulation for Heat Stress Adaptation in Plants: New Horizons for Crop Improvement under Climate Change
Previous Article in Special Issue
EFS-Former: An Efficient Network for Fruit Tree Leaf Disease Segmentation and Severity Assessment
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Using Pseudo-Color Maps and Machine Learning Methods to Estimate Long-Term Salinity of Soils

by
Ravil I. Mukhamediev
1,2,
Alexey Terekhov
2,
Yedilkhan Amirgaliyev
2,
Yelena Popova
3,
Dmitry Malakhov
4,
Yan Kuchin
1,2,
Gulshat Sagatdinova
2,
Adilkhan Symagulov
1,2,*,
Elena Muhamedijeva
2 and
Pavel Gricenko
5,6
1
Institute of Automation and Information Technologies, Satbayev University (KazNRTU), Almaty 050013, Kazakhstan
2
Institute of Information and Computational Technologies, Almaty 050010, Kazakhstan
3
Transport and Telecommunication Institute, Lauvas iela 2, LV-1003 Riga, Latvia
4
Institute of Zoology SC MES RK, Al-Faraby Av., 93, Almaty 050060, Kazakhstan
5
Institute of Mechanics and Engineering, Almaty 050010, Kazakhstan
6
Faculty of Mechanics and Mathematics, Al-Farabi Kazakh National University, Almaty 050040, Kazakhstan
*
Author to whom correspondence should be addressed.
Agronomy 2024, 14(9), 2103; https://doi.org/10.3390/agronomy14092103
Submission received: 7 July 2024 / Revised: 25 August 2024 / Accepted: 13 September 2024 / Published: 15 September 2024
(This article belongs to the Special Issue The Applications of Deep Learning in Smart Agriculture)

Abstract

:
Soil salinity assessment methods based on remote sensing data are a common topic of scientific research. However, the developed methods, as a rule, estimate relatively small areas of the land surface at certain moments of the season, tied to the timing of ground surveys. Considerable variability of weather conditions and the state of the earth surface makes it difficult to assess the salinity level with the help of remote sensing data and to verify it within a year. At the same time, the assessment of salinity on the basis of multiyear data allows reducing the level of seasonal fluctuations to a considerable extent and revealing the statistically stable characteristics of cultivated areas of land surface. Such an approach allows, in our opinion, the processes of mapping the salinity of large areas of cultivated lands to be automated considerably. The authors propose an approach to assess the salinization of cultivated and non-cultivated soils of arid zones on the basis of long-term averaged values of vegetation indices and salinity indices. This approach allows revealing the consistent relationships between the characteristics of spectral indices and salinization parameters. Based on this approach, this paper presents a mapping method including the use of multiyear data and machine learning algorithms to classify soil salinity levels in one of the regions of South Kazakhstan. Verification of the method was carried out by comparing the obtained salinity assessment with the expert data and the results of laboratory tests of soil samples. The percentage of “gross” errors of the method, in other words, errors when the predicted salinity class differs by more than one position compared to the actual one, is 22–28% (accuracy is 0.78–0.72). The obtained results allow recommending the developed method for the assessment of long-term trends of secondary salinization of irrigated arable land in arid areas.

1. Introduction

Secondary salinization of arable land is one of the important negative factors limiting agricultural production in many regions of irrigated agriculture [1]. Over the past 30 years, the number of lands exposed to salinization has increased 2.4-fold [2] and covers up to 20% of the world’s irrigated land reserve [3]. Anthropogenic impacts and climate change, causing the increased use of soil and water resources, has led to increased salinity and land degradation, especially in arid areas [4]. Moreover, it is supposed that by 2050 the salinity of arable lands in the world could exceed 50% [5]. In Kazakhstan, secondary salinization of irrigated arable land is widespread in the Aral Sea basin [6,7]. This factor has a large-scale impact on irrigated areas in the middle and lower parts of the Syr Darya river basin in the Turkestan and Kyzylorda regions. Significant areas of salinization are observed in all southern regions of the country. To assess them and to develop the measures to reduce the negative impact, various monitoring methods are applied, one of which is the use of remote sensing of the earth’s surface. Remote methods for assessing soil salinity have been widely discussed by the scientific community since the 1980s [8,9,10,11]. However, a reliable and universal method has not yet been proposed [12]. The main difficulties are related to the fact that remote sensing methods assess the surface layer of soil, while the influence of salt is revealed in the root layer [13,14,15], the state of which is rarely directly related to the spectral characteristics of the soil surface [16]. Such a relationship can be detected only in a number of cases considering the local conditions [17,18,19]. In recent years, the development of machine learning methods has facilitated a solution to such a problem [20,21,22,23].
Nevertheless, soil salinization is a recurring phenomenon, caused not only by the specific conditions of one season but also by the long-term stable factors leading to an increased or decreased salt content in some parts of the regions under consideration [24].
In this study, a model of soil salinity assessment taking into account long-term remote sensing data is proposed. There are very few works of this type in the scientific literature. Only old studies [17,24] and a relatively recent study, which, however, is based on a low-resolution satellite product [25], can be noted. We hope that this paper will cover this gap.
The main contributions of this work are as follows:
  • A method of salinity assessment for cultivated and non-cultivated soils of arid zones based on long-term averaged values of vegetation indices and salinity indices is proposed;
  • The method is based on remote sensing data of the optical spectrum range and machine learning algorithms;
  • The method allows revealing the stable relations between characteristics of corresponding spectral ranges and salinity parameters. In this sense, it is relatively stable.
An analysis of the literature shows that such an approach, which takes into account the long-term averaged data of the optical range of high spatial resolution, is being proposed for the first time.
The paper consists of the following sections:
  • The second section describes the research area, which serves as an example for testing the proposed model.
  • The third section provides a literature review describing the methods of soil salinity assessment based on RS data and machine learning.
  • In the fourth section, we describe the research methodology.
  • In the fifth section, we presented the results obtained.
  • In the sixth section, we discuss the results obtained.
We conclude with a description of the advantages and limitations of the method, as well as the anticipated objectives for future research.

2. Research Field

A large irrigation massif—“Hungry Steppe”—with a total area of about 10,000 km2, is located in the basin of the transboundary Syrdarya River. The river is part of the Aral Sea basin and ranks second in Central Asia (after the Amu Darya) in terms of annual runoff—23 km3. By territory, the main irrigated arable areas of this irrigation massif belong to Uzbekistan (upper part) and Kazakhstan (lower part). After the collapse of the USSR, the upstream hydrosystems with reservoirs, in particular Kayrakkum (4.2 km3; Tajikistan) and Toktogul (19.5 km3; Kyrgyzstan), changed their operation modes from irrigation to energy. Therefore, part of the river flow was used for power generation in the cold periods. This action significantly reduced the water potential in the irrigation season. In addition, after the collapse of the USSR, the processes of drainage system degradation were intensified, manifested in the siltation of drains and the abandonment of drainage wells. As a result, the developed process of secondary salinization of the “Hungry Steppe” arable land has taken place in recent decades.
There are about 140,000 hectares of irrigated arable land in the Kazakhstan sector of the “Hungry Steppe” irrigation array. The main crops are as follows: cotton, wheat, maize, melons, alfalfa, rice, etc. Relatively insignificant areas are allocated to rice crops. These arable lands administratively belong to two districts of the Turkestan district of Kazakhstan: Zhetysai and Maktaaral (see Figure 1). The latter is considered an object of application of the methodology for the estimation of the average multiyear salinity of lands, proposed in this article.
It should be noted that the task of assessing the secondary salinity of irrigated arable lands is complicated by the fact that we have to deal with different crops, crop rotations and agricultural practices. A significant role is played by seasonal weather conditions, air temperature, and water content of the river, which is the source of irrigation. Moreover, as a consequence, there is a great variety of soil surface conditions.

3. Related Works

Morphological diversity of soils (different depths and salinity of soil waters), different modes of economic use, and differences in chemistry and origin of certain types of salinity—all these factors complicate the task of predicting the degree of salinity. Soil salinity is measured both in the laboratory and in the field by assessing the salt content in the soil or the electrical conductivity of the water extract. The electrical conductivity of the water extract is more closely related to the negative effect of salinity on vegetation; therefore, it is used more often. Depending on the salt content in the soil, salinity is usually divided into five or six classes [26,27,28] (see Table 1). When divided into five classes, the “severely saline” and “extremely saline” classes are combined.
Field studies are labor-intensive, so their wide application is limited. During a ground survey, it is necessary to take soil samples from different depths (up to 1–2 m). Soil salinization is based on the physical process of fluid infiltration through porous media. The spatial feature of the processes of the three-dimensional percolation processes is a contrasting mosaic, manifested on the ground surface in the form of different-scale patchiness of local salinization zones [29]. Therefore, it is extremely difficult to conduct a detailed mapping of large areas of arable land on the basis of ground surveys alone.
To assess the level of soil salinity on a regional and global scale, remote sensing (RS) data, including aerial photography, satellite sensing, or UAV data are used [8,11,30].
One of the traditional approaches is the development of empirical models based on the correlation of the optical spectral indices and the soil electrical conductivity. The application of spectral indices reflecting the level of soil salinity in one way or another is widely used in regional studies. The literature devoted to the results of the application of spectral indices is extensive, and some idea of the variety of used indices and the reliability of modeling on their basis can be obtained from review papers [31,32,33,34,35]. Such models have different indicators of the correlation coefficient, which can sometimes be quite high, but the approach itself, as we have already noted, has limitations associated with its application on a local scale. In addition, the use of single satellite images, timed to the dates of ground-based measurements, does not provide an opportunity to assess the seasonal and multiyear dynamics of salinity. The authors of [36] note that “Surface salinity is a highly dynamic process, causing identification constraints derived from the proper behavior of the salt features, spectrally, spatially, and temporally” (see page 2). In addition, salt detection can also be blurred by the presence of vegetation and other surface features, which might contribute to creating spectral confusions with the salt reflectance properties (ibid.). These limitations make it necessary to develop methods for identifying and mapping saline soils using more complex, integrated approaches. One of the promising directions is the application of machine learning (ML) methods [37]. Machine learning technologies help to solve the nonlinear problem of the relationship between soil properties and environmental factors [38]. Among the many machine learning algorithms for solving the problem of estimating the soil salinity, the classical algorithms, boosting models, and random forest are most commonly used. For example, ref. [39] applied a complex calculation system that combined ground data, spectral indices, a regression model, and machine learning elements to map the salinity of sandy soils. The authors of [40] conducted a comparative study of several machine learning algorithms for the same set of ground measurements and showed that of the three used algorithms (gradient boost machine, X gradient boost and random forest), XGBoost finds the best result, despite the fact that the difference coefficient’s R2 between the three models is small and amounts to a few percent. Similar studies [41], which applied four machine learning algorithms (support vector machine, back propagation neural network, extreme learning machine, random forest) to calculate salinity from Sentinel-2 data, confirmed the similarity of the results of all algorithms used, with some differences in the calculation of salinity at different soil depths. A comparison of the least absolute shrinkage and selection operator (LASSO), multiple adaptive regression splines (MARS), classification and regression trees (CART), stochastic gradient treeboost (SGT), and random forest [42] models showed that SGT and random forest were the best models. Other models also have some drawbacks, such as negativity and outliers, binarization, an unreasonable range of values, and instability during multi-period predictions (large fluctuations in the range of soil salinity values).
The research [43] considers the basin of the Keriya River, northwest China. For this area, using the radar data (PALSAR and Radarsat-2) and the SVM classifier with optimal parameters, an accuracy of 93.01% was achieved. There are other examples of the application of machine learning models using radar data [23,44,45], optical data [46], or their combinations [5,47]. Table 2 lists the machine learning algorithms most commonly used for solving the problem of salinity estimation using satellite data.
The number of such researches is steadily growing, confirming the perspective and acceptable accuracy of the recognition and mapping of saline lands with the use of mixed methods of satellite data processing and, in particular, the perspective of machine learning for solving such a non-trivial task as the recognition of different types of salinity on different types of soils. At the same time, the listed methods of soil salinity assessment operate with operational data and are applied, as a rule, to relatively small areas of the earth’s surface at a certain time period.
This is especially true for methods using low-altitude multispectral images obtained using UAVs [69,70,71]. An exception is the method described in [55], which is aimed at constructing a global map. The authors used the SVM, CART, and random forest algorithms to develop a global salinity map on the Google Earth Engine platform. The authors claimed that the accuracy of mapping the various classes of saline soils reached 67–70%.
However, this method, as shown in Figure 2, has limited accuracy for marginal regions, such as Kazakhstan. Along with the location map (1), the figure shows changes in the forecast of the model developed by the authors of this work, based on XGBoost using Landsat 8 data of optical range (2), and the assessment of salinity using an RF model based on temperature data (hereafter Mtemp) [55] (3). To assess the salinity of the area marked in the figure on 23 May 2022, 80 samples of soil were collected, and several regression models were built; the best of these models was XGBoost (R2 = 0.54). ② in Figure 2 shows the model estimates using the satellite data from 1, 9, 10, 17, and 26 April 2022. It should be noted that the model predictions from 9 and 10 April are distorted by cloudy weather.
It can be seen that the predictions of the XGBoost model vary significantly even within one month and that the XGBoost model and the Mtemp model estimate salinity differently, with the Mtemp model predicting only two levels of soil salinity, which is not consistent with field survey data. In other words, for extensive arid zones with a sharp change in climatic conditions, which are the foothills and adjacent areas of southern Kazakhstan, the methods proposed in the literature can be applied in relatively localized areas with similar conditions, or they will have low accuracy. This is due to seasonal fluctuations in humidity, long-term changes in the water content of the seasons [72,73], and changes in irrigation. As a result, soil salinity is often unstable with active processes of salinization–desalination [17].
To identify statistically stable patterns of soil salinization, the authors of this paper propose to use the data of multiyear observations. It should be noted that the number of available ground measurements in Kazakhstan is still not great. For example, in [55], the data from the ground-based measurements of electrical conductivity in the territory of the Republic of Kazakhstan are absent. Therefore, it is reasonable to use the statistical data of the Ministry of Agriculture (MA) of the country, which contains information on the percentage of saline lands and develops a method that can reveal the distribution of salinity in the southern regions of Kazakhstan taking into account these data.

4. Method

The proposed method is based on multiyear values of vegetation and salinity indices and statistical data on salinity provided by the Ministry of Agriculture (MA) of Kazakhstan. To verify the method, the available data from expert assessments of salinity of large areas of cultivated lands in southern Kazakhstan are used. The methodological scheme of the study includes the steps below (see Figure 3).

4.1. Building a Pseudo-Color Map

The effect of arable land salinity on the spectral characteristics of the underlying surface is mainly associated with the depression of agricultural vegetation and possible salt outcrops (salt crusts) during the period of maximum salinization of the surface soil layer at the end of the growing season. The quantitative characteristics of these phenomena vary greatly from year to year, which is caused by variations in weather and hydrological conditions in individual seasons. More objective characteristics can be obtained as a result of the processing of the multiyear data and the transition to average long-term characteristics. In this case, the systematic effect of salinity will be reflected in quantitative characteristics.
In the case of low-dimensional (3- or 4-dimensional) phase spaces, which can be encoded in chrominance components, such as red-green-blue (RGB formula) or cyan-magenta-yellow-black (CMYK formula), it is possible to combine machine processing with expert analysis since the feature space is additionally encoded in a color image. In this case, the expert has the opportunity to choose the options for numerical processing and, as a first approximation, to control their effectiveness. In this study, a 3-dimensional feature space was used, in which 2 independent parameters were assigned to vegetation characteristics and one to the degree of salinity of the soil surface.
Monitoring of the satellite vegetation index values, for example, NDVI [74,75], during the vegetation season gives a fairly complete picture of the state of vegetation:
N D V I = n i r r e d r e d + n i r
Parameterization of the vegetative growth curve with two independent variables automatically leads to the use of its two main points. The first is the maximum value of N D V I m a x on the graph: [calendar dates]—[NDVI]. The second is the area that the vegetation curve cuts off on its graph. If the same time period is always considered, for example, May–August, the area cut off by the vegetation curve is uniquely approximated by the mean value, N D V I m e a n .
There are a large number of different salinity indices using satellite channels in the optical part of the spectrum (see, for example, [76]). They are most effective in the case of open soil, i.e., when the interfering (shielding) influence of green vegetation and mortuary mass is minimal. In the case of agricultural vegetation, open soil covers are typical for the sowing period in spring and in autumn after harvesting. For irrigated arable land, the maximum salinity is timed to the end of the growing season. The separate satellite scenes in the autumn period reflect fields in different phases. There are fields with ripening crops, either after harvesting or even after raising the seedbed. This condition is typical and related to the limited agricultural equipment, inability to sow the entire area in a limited time, as well as due to the use of different varieties and crops that have their own maturity.
The variety of soil surface conditions in the fields in the autumn period simultaneously presented on a separate satellite scene leads to the impossibility of uniform processing and interpretation of the results of estimating the salinity degree by means of salinity indices. In the case of multiyear data processing, the maximum values of the salinity index seem to be the most informative ones. The maximum values appear at the minimization of disturbing factors, and they characterize the underlying surface in the most uniform way in the analysis of multiyear data. The type of chosen salinity index can be different. Selecting the best index requires a separate study. In this study, the well-known VSSI salinity index [77] was chosen:
V S S I = 2 g r e e n 5 r e d + n i r
Thus, three parameters were chosen to form a three-dimensional phase space with its own color image in the form of a composite pseudo-color image reflecting the peculiarities of vegetation dynamics and soil salinity: N D V I m a x ; N D V I m e a n ; V S S I m a x . The following parameters were used to create the corresponding color formula, namely red-green-blue, to build the visual image so that the maximum of the VSSI index is fixed in the red channel at each pixel position:
V S S I m a x = m a x V S S I 1 , V S S I 2 , , V S S I i , , V S S I n
The NDVI maximum is in the green channel:
N D V I m a x = m a x n i r 1 r e d 1 r e d 1 + n i r 1 , n i r 2 r e d 2 r e d 2 + n i r 2 , , n i r n r e d n r e d n + n i r n
The average NDVI value is in the blue channel:
N D V I m e a n = 1 n i = 1 n n i r i r e d i r e d i + n i r i
where 1 i n , and n is the number of Sentinel-2 satellite images annually from May to September, starting in 2016 and ending in 2022.
Since the purpose of the analysis was to rank the territory, in other words, a comparative analysis, the absolute values of the selected indices were not important. Therefore, the variability of each index was encoded by a byte of information (256 discrete values), and histogram processing of each channel was applied to form the most information-rich, from the point of view of an expert, visual image (see Figure 4).

4.2. Sampling of Local Zones with a Simple Spatial Organization of the Target Feature

Gradient fragments are used to construct a 5-class salinity classification according to [50]. Simple linear gradients determine the rank order of composite classes (colors) from non-salinity to very strong salinity (expert selection of gradient). The set of gradient fragments, through the centers of gravity of each class (color), forms the universal rank order (the ranks may vary slightly in each fragment) (see Figure 5).

4.3. Formation of a 5-Class Representation of Salinity

The created rank order of classes (colors) of the pseudo-color composite is used to construct a 5-class coarsening. Grouping of ranked colors into 5 classes attributed to 5 salinity classes is carried out. As a result, the formation of a 5-class salinity representation on the basis of a pseudo-color composite (see Figure 6) takes place.

4.4. Development and Training of the Classifier

When training the classifiers, a pseudo-color composite is used as the input data, and the salinity map corresponding to it is used as the target value (see Figure 7). During data preparation, the surface area is divided into fragments of 4 × 4 pixels, which forms the input vector of values 4 × 4 × 3, where 3 is the number of color channels (RGB). The target image is also fragmented into similar 4 × 4 areas, each of which is set to one of 6 colors. The color of the section is selected according to the maximum criterion. That is, for example, if the selected target fragment has 4 pixels of yellow, 2 pixels of green, and 10 of blue, then this fragment is colored completely in blue—there is no salinity.

4.5. Applying the Classifier to Specified Areas of the Earth’s Surface

The classifier model selected according to the maximum harmonic measure criterion (F1 score) is used to estimate the salinity of the entire significant area on the basis of its pseudo-color composite (see Figure 8).
The purpose of mapping is a numerical assessment of the relative state of the salinity of rural districts (or other fragments) in the area under consideration. The resulting map of the salinity of the district is analyzed for the balance of areas of salinity classes. It is estimated how close it is to the official data of the Ministry of Agriculture. Therefore, indirect validation of the results is performed. The above steps can be carried out repeatedly, so that the initial 5-class division is varied, and new training and validation is undertaken. The repetition of action continues until a given error level is obtained in the test section. Thus, the issue of the information content of the initial pseudo-color composite in the task of constructing a map of the average long-term salinity of arable land is solved.
To perform computational experiments, a software system was developed in Python using the numpy, sklearn, matplotlib, cv2, alive_progress, pickle, and tensorflow libraries, which provides the following:
  • Cutting out part of the image;
  • Segmentation of images by a given number of elements;
  • Connecting image segments to form the final image;
  • Converting the image fragments to one color;
  • Color adjustment, which is necessary to exclude halftone pixels;
  • Saving the prepared data sets and performing some additional functions related to obtaining the results and the application of machine learning models;
  • Application of machine learning models.
Computational experiments were performed on a computer equipped with 32 GB of RAM, an Intel (R) Core (TM) i7-10750H processor, and a discrete video card, Nvidia GeForce GTX 1650 Ti.
The results of the machine learning models in this case were evaluated using the confusion matrix and the indicators of accuracy (Ac), precision (Precision), completeness (Recall) and harmonic measure (F1-Score):
A c = N t N
where N t is the number of correct answers, and N is the total number of possible answers of the model.
Precision:
P = T P T P + F P
Recall:
R = T P T P + F N
F1 score:
F 1 s c o r e = 2 P R P + R
where true positive (TP) and true negative (TN) are the cases of correct operation of the classifier, in other words, when the predicted class coincides with the expected one. Correspondingly, false negative (FN) and false positive (FP) are cases of misclassification.
It should be noted that when classifying the unbalanced datasets, when there are significantly more or fewer objects of some classes than others, it is possible to use two options for estimating the basic indicators. The first one, called the macro-average, assumes that each class, regardless of the number of its members, is equivalent in weight to the overall score of the indicator. This means that the quality metric is calculated within the objects of each class and then averaged over all classes. The second version of averaging, the micro-average, assumes that objects in all classes contribute equally to the quality metrics. Since it is important for us to consider the contribution of small classes, the macro-average will give a more objective estimate.

5. Results

The resulting pseudo-color composite is shown in Figure 8. A portion of this composite (40,000 pixels) was used to train and test machine learning models. In the process of conducting the computational experiments, nine machine learning models were used. Table 3 shows the results of the classifiers.
As can be seen from the table, the best value of the class-averaged value of the harmonic measure is presented by the models based on convolution neural network and XGBoost classifier.
The XGBoost classifier model has the regularization parameters reg_lambda = 4, reg_alpha = 0.1 and a maximum tree depth of max_depth = 12.
The convolutional network contains three convolutional layers, two maxpooling layers, and a fully connected neural network of two output layers. The total number of layers including flatten and dropout layers is 12. The total number of trainable params is 724,230. The program code of the model and the graph illustrating the learning process of the network are given in Appendix A.
The detailed results of the XGBoost classifier (XGB) and convolution neural network (CNN) obtained using the classification_report utility are shown in Figure 9.
Comparison of the models’ performance (Figure 9) shows that the convolutional network is better at classifying non-saline (class 0) and very saline (class 4) surface areas. At the same time, both algorithms make more frequent errors in the region of extreme values, which may be due to the fact that there are fewer such data than average values.

6. Discussion

The obtained results show that the machine learning models perform the classification of salinity based on the pseudo-color composite quite accurately. However, a full evaluation of the method, including the construction of the composite, is not a trivial matter. To perform this, it is necessary to compare the obtained result with ground-based salinity estimates made by an independent method. Such a comparison of the current salinity with the long-term result will be approximate due to the statistical nature of the pseudo-color composite construction. To evaluate the results of this comparison, in addition to using the accuracy metric in the standard interpretation, an additional metric of “rough” errors (RE) was used, which shows the number of serious errors to the total number of responses excluding background pixels. To calculate RE, the class values are ordered from 0 (no salinity) to 4 (extreme salinity). In addition, an additional class number (5) is allocated to the background values (white is the unclassifiable zone). A classification error is considered serious if the predicted value of the class differs from the expected value by more than 1 up or down. For example, if the predicted value is 1—low salinity, but the real value is 0 or 2, in this case it is considered an error. If the predicted value is 1 and the real value is 3, this situation is considered a serious (“rough”) error. Accordingly,
A c R E = 1 R E
To evaluate the method in this paper, we compared the obtained results with expert estimates and laboratory studies of soil salinity.

6.1. Comparison with Expert Evaluation

The trained model was applied to assess the salinity of one of the sites in the Maktaaral district of the Kyzylorda region. This area is characterized by a pseudo-color composite (see Figure 10a). To verify the obtained results, a salinity map of this area, created within the framework of a survey on the assessment of the salinity of rural areas of the Maktaaral district, carried out by the South Kazakhstan Hydrogeological and Meliorative Expedition of the Committee on Water Resources of the Ministry of Agriculture of the Republic of Kazakhstan in 2019, was used. Each field was evaluated by the expedition’s experts as one of five salinity levels in whole or in large fragments (see Figure 10b).
The map proposed in Figure 10a is a multiyear average salinity estimate for the period of 2018–2022. Accurate assessment of the reliability of the developed map requires many years of laboratory measurements. It is impossible to obtain such data due to the absence of relevant archives. Therefore, it is reasonable to evaluate the accuracy of the obtained results using a special metric. The metric should evaluate serious errors of the model by comparing expert estimates (ground measurements) of salinity and model predictions.
Figure 11 shows the result of applying the pre-trained model to classify the specified area of the map using the pseudo-color compository.
In Figure 11b, the black dots indicate “rough” classification errors, i.e., when the value predicted by the classifier differs by more than one class up or down from the expert estimate shown in Figure 10b. Of the total number of the 5239 significant pixels (not background), the gross errors are 1175. That is, RE = 22.4%, or A c R E = 0.78 . The total number of errors is 3506, or 66.9% (see Figure 11c). It should be noted that hydrogeology experts of the meliorative expedition perform an estimation at once, and it is approximate, as a whole, for the big areas of a field based on its visible condition at the moment of estimation. However, the fields often have a patchy mosaic structure of salinity. It should also be taken into account that the simulation results refer to a multiyear period. Under these conditions, the method shows very high correspondence with one-time expert assessments.

6.2. Comparison with Laboratory Conductivity Measurements

In June 2022, the ground surveys of salinity were carried out during a field expedition. Surface soil samples were collected in a localized area during the expedition.
In total, 75 samples were collected from a depth of up to 30 cm. Each sample was processed in the standard way in the following order: drying, sieving to remove insoluble fractions, mixing with water in the proportion of 1:5, settling the solution for at least 24 h, and measuring the electrical conductivity using Hanna GroLine HI9814. The results of laboratory conductivity measurements are given in Appendix B. The results of the comparison of the long-term salinization map and the performed electrical conductivity measurements are shown in Figure 12.
Part of the samples (9) were received from sites where modeling data are missing (white background). The number of gross errors is 19 (28%, A c R E = 0.72 ); in other words, the laboratory measurements show approximately the same correspondence to the forecast as the expert estimates shown in Figure 10. At the same time, the main gross errors appear where the method predicts low salinity.

6.3. Long-Term Salinity Map of the Maktaaral District and Method Restrictions

This indicates both the stability of the salinization processes in the area under consideration and the stability of the method itself, which allows revealing the long-term regularities of salinization processes in the cultivated arable land. The resulting long-term salinity map of the Maktaaral District is shown in Figure 13.
The obtained results allow drawing the following conclusions:
  • The method based on taking into account the long-term salinization patterns demonstrates good correspondence to the actual salinization of fields in the Maktaaral region.
  • Salinization in this area has a stable nature. The salinization pattern is repeated year after year with slight variations in the level of electrical conductivity.
  • Although we have obtained a much more detailed salinity map, it is necessary to note the limitations of the proposed method.
  • The method allows us to construct a map of long-term salinity trends, but its compliance with the current state of the field will not be complete due to both weather anomalies and long-term climate changes.
  • Accurate verification of the obtained result is generally difficult. It requires many years of work on salinity assessment.
  • The spatial resolution of the map is limited by the resolution of satellite images, which may not be sufficient in cases of small-scale salinity.

7. Conclusions

As a result of the performed research, a method of estimating the salinity of the cultivated soil areas, based on the account of long-term statistically stable states of the earth surface, is offered.
The method comprises two stages. In the first stage, a pseudo-colour model of a local area of the earth’s surface is built. Then, it is used to tune the machine learning models. The quality of the machine learning models is assessed quantitatively using the standard metrics. The obtained estimates are presented in Table 2. The tuned model is then used for salinity mapping.
The method is verified by comparing the resulting map with expert estimates obtained in 2019 and laboratory measurements of soil samples obtained in 2022.
Comparison of the obtained result with the data available to the authors from ground assessments of fields’ salinity in the Maktaaral district of Kyzylorda region (2019) shows rather good agreement between the assessments of specialist hydrogeologists of the meliorative expedition and the model conclusions (RE = 22%, or A c R E = 0.78 ).
Comparison of the results with the employment of laboratory studies of electrical conductivity (2022) showed a slightly worse result (RE = 28% or A c R E = 0.72 ).
The performed double verification demonstrates the stability of the method. This allows extending the model from a local area to the entire Maktaaral region, where the conditions of irrigated agriculture are approximately the same. In addition, verification estimates allow expecting positive results in other arid zones where irrigated agriculture is used.
The proposed method for assessing the salinity of cultivated lands has the following advantages:
  • Relatively high resistance to annual fluctuations in weather conditions;
  • The possibility of extrapolating the results obtained on the relatively small areas of the earth’s surface to large areas and regions;
  • The gradient of the relief, the amount of precipitation (water availability of the year) and the state of the irrigation system are indirectly taken into account in the process of constructing a pseudo-color composite and subsequent training of the machine learning.
The main limitations of the method are as follows:
  • Significant weather changes and the conditions of a single year can make serious adjustments to the salinity of certain areas of the earth’s surface. Such rapid changes cannot be captured using the proposed method;
  • The pseudo-color composite, which is formed at the Nth step of the method, essentially depends on the conditions in a particular area of the earth’s surface and requires a significant share of manual labor.
Future Research:
  • Improving the accuracy of the method by using a wider spectral range of remote sensing data, including radar;
  • Increasing the coverage of the method by collecting more field data on soil electrical conductivity;
  • Increasing the spatial accuracy of the method.

Author Contributions

Conceptualization, A.T. and R.I.M.; methodology, R.I.M. and A.T.; software, R.I.M. and G.S.; validation, E.M., Y.K. and A.S.; investigation, R.I.M., A.T., G.S. and P.G.; resources, R.I.M., G.S. and A.T.; data curation, G.S., E.M., P.G. and Y.K.; writing—original draft preparation, R.I.M., A.T., D.M. and Y.K.; writing—review and editing, Y.P., Y.K. and A.S.; visualization, R.I.M., E.M. and G.S.; supervision, R.I.M.; project administration, Y.A.; funding acquisition, Y.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Committee of Science of the Ministry of Science and Higher Education of the Republic of Kazakhstan under Grants: AP23488745 “Rapid assessment of soil salinity using low-altitude unmanned aerial platforms (RASS)”, AP14869972 “Development and adaptation of computer vision and machine learning methods for solving precision agriculture problems using unmanned aerial systems”, BR24992908 “Support system for agricultural crop production optimization via remote monitoring and artificial intelligence methods (Agroscope)”, BR21881908 “Complex of urban ecological support (CUES)”, BR18574144 “Development of a data mining system for monitoring dams and other engineering structures under technogenic and natural impacts”, and BR10965172 “Space monitoring and GIS for quantitative assessment of soil salinity and degradation of agricultural lands in South Kazakhstan”.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Parameters of Convolution Neural Network

Appendix A.1. Fragment of Code That Creates a Convolution Neural Network

def model_CNN(optimizer=‘adam’, flt1=32,flt2=64,flt3=128):
# Define the model
model = Sequential()
model.add(Conv2D(flt1, kernel_size=(2, 2), activation=‘relu’,padding=‘same’, input_shape=(4, 4, 3)))
model.add(MaxPooling2D(pool_size=(2, 2),padding=‘same’))
model.add(Dropout(0.25))
model.add(Conv2D(flt2, kernel_size=(2, 2), activation=‘relu’,padding=‘same’))
model.add(MaxPooling2D(pool_size=(2, 2),padding=‘same’))
model.add(Dropout(0.25))
model.add(Conv2D(flt3, kernel_size=(2, 2), activation=‘relu’,padding=‘same’))
model.add(Dropout(0.25))
model.add(Flatten()
model.add(Dense(128, activation=‘relu’))
model.add(Dropout(0.2))
model.add(Dense(6, activation=‘softmax’))
# Compile the model
model.compile(loss=‘categorical_crossentropy’,
optimizer=optimizer,
metrics=[‘accuracy’])
return model

Appendix A.2. Model Params

Table A1. Model: “sequential_41”.
Table A1. Model: “sequential_41”.
Layer (Type)Output ShapeParam #
conv2d_91 (Conv2D)(None, 4, 4, 128)1664
max_pooling2d_67 (MaxPooling)(None, 2, 2, 128)0
dropout_125 (Dropout)(None, 2, 2, 128)0
conv2d_92 (Conv2D)(None, 2, 2, 256)131,328
max_pooling2d_68 (MaxPooling)(None, 1, 1, 256)0
dropout_126 (Dropout)(None, 1, 1, 256)0
conv2d_93 (Conv2D)(None, 1, 1, 512)524,800
dropout_127 (Dropout)(None, 1, 1, 512)0
flatten_37 (Flatten)(None, 512)0
dense_77 (Dense)(None, 128)65,664
dropout_128 (Dropout)(None, 128)0
dense_78 (Dense)(None, 6)774
Total params: 724,230; trainable params: 724,230; non-trainable params: 0. # means number.
Figure A1. Convolution neural network training process. # means number.
Figure A1. Convolution neural network training process. # means number.
Agronomy 14 02103 g0a1

Appendix B. Results of Laboratory Testing of Soil Samples

Table A2. Results of laboratory testing of soil samples.
Table A2. Results of laboratory testing of soil samples.
NUMLatitudeLongitudeConductivity mS/cm
141.01086368.1648341.75
241.01094668.1645591.37
341.0111268.1640531.02
441.01129868.1635831.38
541.01142368.1632681.8
641.01156668.1628951.39
741.01156768.1628961.39
841.011368.1627060.98
941.01117468.1630651.44
1041.01100768.1635241.64
1141.01071868.164241.92
1241.01054268.1647180.45
1341.01024768.1645720.68
1441.01042268.1641150.41
1541.01063168.1635520.71
1641.01087768.1629110.64
1741.01112568.1623490.77
1841.0113568.1617550.73
1941.01153868.1612471.29
2041.0118268.1604650.97
2141.01150868.1602330.92
2241.01130868.1607040.78
2341.01113368.1611490.96
2441.01093668.1616650.73
2541.01076568.1620890.82
2641.01061668.162480.76
2741.01040968.1630391.01
2841.01019968.1635910.79
2941.01001268.1641081.26
3041.0093368.1643541.09
3141.00948368.1639590.92
3241.0096768.1634910.72
3341.00989568.1629940.47
3441.01006468.1625690.52
3541.01029868.1619570.82
3641.01050168.161450.87
3741.01078868.1607322.18
3841.01097468.1602251.86
3941.01120168.1596591.29
4041.01139868.1591680.77
4141.01157568.1586961.75
4241.01110668.1583551.43
4341.01090468.1588221.57
4441.0106868.1593641.4
4541.01051468.1597721.66
4641.01039668.1601351.2
4741.01019868.1606251.37
4841.01001368.1610450.8
4941.00982968.1615181.5
5041.00965568.1619541.43
5141.00950868.1623371.13
5241.00933768.1627740.97
5341.00914768.1632550.94
5441.00894968.1637631.62
5541.0087868.1642051.05
5641.01125968.1650371.5
5741.01133668.1647972.06
5841.01144868.1644921.47
5941.01153868.1641981.81
6041.01160568.1639892.86
6141.01171568.1636472.86
6241.01183568.1632493.12
6341.0119968.1628452.32
6441.01187668.1627174.73
6541.01173468.1631424.2
6641.01155768.1636282.26
6741.01140868.1640823.33
6841.01130468.1644522.4
6941.01114568.1649151.41
7041.01091268.1650893.3
7141.01050668.1649456.57
7241.01126768.1659341.21
7341.01116568.1663540.84
7441.01107768.1667351.23
7541.0109568.1672640.91

References

  1. Bernstein, L. Crop growth and salinity. Drain. Agric. 1974, 17, 39–54. [Google Scholar]
  2. Qadir, M.; Quillérou, E.; Nangia, V.; Murtaza, G.; Singh, M.; Thomas, R.J.; Drechsel, P.; Noble, A.D. Economics of salt-induced land degradation and restoration. Nat. Resour. Forum 2014, 38, 282–295. [Google Scholar] [CrossRef]
  3. Hossain, M.S. Present scenario of global salt affected soils, its management and importance of salinity research. Int. Res. J. Biol. Sci. 2019, 1, 1–3. [Google Scholar]
  4. Li, X.; Wang, Z.; Song, K.; Zhang, B.; Liu, D.; Guo, Z. Assessment for salinized wasteland expansion and land use change using GIS and remote sensing in the west part of Northeast China. Environ. Monit. Assess. 2007, 131, 421–437. [Google Scholar] [CrossRef]
  5. Muhetaer, N.; Nurmemet, I.; Abulaiti, A.; Xiao, S.; Zhao, J. An efficient approach for inverting the soil salinity in Keriya Oasis, northwestern China, based on the optical-radar feature-space model. Sensors 2022, 22, 7226. [Google Scholar] [CrossRef] [PubMed]
  6. Kitamura, Y.; Yano, T.; Honna, T.; Yamamoto, S.; Inosako, K. Causes of farmland salinization and remedial measures in the Aral Sea basin—Research on water management to prevent secondary salinization in rice-based cropping system in arid land. Agric. Water Manag. 2006, 85, 1–14. [Google Scholar] [CrossRef]
  7. Duan, Y.; Ma, L.; Abuduwaili, J.; Liu, W.; Saparov, G.; Smanov, Z. Driving factor identification for the spatial distribution of soil salinity in the irrigation area of the Syr Darya river, Kazakhstan. Agronomy 2022, 12, 1912. [Google Scholar] [CrossRef]
  8. Панкoва, Е.И.; Мазикoв, В.М.; Исаев, В.А.; Джамнoва, И.А. Опыт испoльзoвания аэрoфoтoснимкoв для характеристики засoленнoсти пoчв серoземных пoливных участкoв (Experience in the use of aerial photographs for the characteristics of soil salinity rainfed areas serozem area). Пoчвoведение 1978, 3, 82–85. [Google Scholar]
  9. Мамедoв, Э. Изучение засoленных земель и сoлoнчакoв с пoмoщью кoсмических метoдoв (Study of saline lands and salt marshes with the help of space methods). Исследoвание Земли Кoсмoса 1985, 1, 60–61. [Google Scholar]
  10. Singh, A.; Dwivedi, R. Delineation of salt-affected soils through digital analysis of Landsat MSS data. Remote Sens. 1989, 10, 83–92. [Google Scholar] [CrossRef]
  11. Allbed, A.; Kumar, L. Soil salinity mapping and monitoring in arid and semi-arid regions using remote sensing technology: A review. Adv. Remote Sens. 2013, 2013, 373–385. [Google Scholar] [CrossRef]
  12. Li, Y. Research Progress of Remote Sensing Monitoring of Soil Salinization. IOP Conf. Ser. Earth Environ. Sci. 2021, 692, 042007. [Google Scholar] [CrossRef]
  13. Abbas, A.; Khan, S.; Hussain, N.; Hanjra, M.A.; Akbar, S. Characterizing soil salinity in irrigated agriculture using a remote sensing approach. Phys. Chem. Earth Parts A/B/C 2013, 55, 43–52. [Google Scholar] [CrossRef]
  14. Bannari, A.; Guédon, A.; El-Ghmari, A. Mapping slight and moderate saline soils in irrigated agricultural land using advanced land imager sensor (EO-1) data and semi-empirical models. Commun. Soil Sci. Plant Anal. 2016, 47, 1883–1906. [Google Scholar] [CrossRef]
  15. Scudiero, E.; Corwin, D.L.; Anderson, R.G.; Yemoto, K.; Clary, W.; Wang, Z.; Skaggs, T.H. Remote sensing is a viable tool for mapping soil salinity in agricultural lands. Calif. Agric. 2017, 71, 231–238. [Google Scholar] [CrossRef]
  16. Wang, N.; Peng, J.; Xue, J.; Zhang, X.; Huang, J.; Biswas, A.; He, Y.; Shi, Z. A framework for determining the total salt content of soil profiles using time-series Sentinel-2 images and a random forest-temporal convolution network. Geoderma 2022, 409, 115–126. [Google Scholar] [CrossRef]
  17. Rukhovich, D.; Pankova, E.; Chernousenko, G.; Koroleva, P. Long-term salinization dynamics in irrigated soils of the Golodnaya Steppe and methods of their assessment on the basis of remote sensing data. Eurasian Soil Sci. 2010, 43, 682–692. [Google Scholar] [CrossRef]
  18. Lobell, D.; Lesch, S.; Corwin, D.; Ulmer, M.; Anderson, K.; Potts, D.; Doolittle, J.; Matos, M.; Baltes, M. Regional-scale assessment of soil salinity in the Red River Valley using multi-year MODIS EVI and NDVI. J. Environ. Qual. 2010, 39, 35–41. [Google Scholar] [CrossRef]
  19. Laiskhanov, S.U.; Otarov, A.; Savin, I.Y.; Tanirbergenov, S.I.; Mamutov, Z.U.; Duisekov, S.N.; Zhogolev, A. Dynamics of Soil Salinity in Irrigation Areas in South Kazakhstan. Pol. J. Environ. Stud. 2016, 25, 2469–2475. [Google Scholar] [CrossRef]
  20. Wei, G.; Li, Y.; Zhang, Z.; Chen, Y.; Chen, J.; Yao, Z.; Lao, C.; Chen, H. Estimation of soil salt content by combining UAV-borne multispectral sensor and machine learning algorithms. PeerJ 2020, 8, e9087. [Google Scholar] [CrossRef]
  21. Guan, Y.; Grote, K.; Schott, J.; Leverett, K. Prediction of soil water content and electrical conductivity using random forest methods with UAV multispectral and ground-coupled geophysical data. Remote Sens. 2022, 14, 1023. [Google Scholar] [CrossRef]
  22. Chen, B.; Zheng, H.; Luo, G.; Chen, C.; Bao, A.; Liu, T.; Chen, X. Adaptive estimation of multi-regional soil salinization using extreme gradient boosting with Bayesian TPE optimization. Int. J. Remote Sens. 2022, 43, 778–811. [Google Scholar] [CrossRef]
  23. Hoa, P.V.; Giang, N.V.; Binh, N.A.; Hai, L.V.H.; Pham, T.-D.; Hasanlou, M.; Tien Bui, D. Soil salinity mapping using SAR sentinel-1 data and advanced machine learning algorithms: A case study at Ben Tre Province of the Mekong River Delta (Vietnam). Remote Sens. 2019, 11, 128. [Google Scholar] [CrossRef]
  24. Fan, X.; Weng, Y.; Tao, J. Towards decadal soil salinity mapping using Landsat time series data. Int. J. Appl. Earth Obs. Geoinf. 2016, 52, 32–41. [Google Scholar] [CrossRef]
  25. Терехoв, А.; Сагатдинoва, Г.; Мурзабаев, Б. Принципы региoнальнoй oценки мнoгoлетней засoленнoсти пашни в Казахстанскoм сектoре дoлины реки Сырдарьи пo данным MODIS (Principles of regional assessment of perennial salinity of arable land in the Kazakhstan sector of the Syrdarya river valley based on MODIS data). Curr. Probl. Remote Sens. Earth Space 2022, 19, 169–179. [Google Scholar]
  26. Richards, L.A. Diagnosis and Improvement of Saline and Alkali Soils; US Government Printing Office: Washington, DC, USA, 1954.
  27. Measuring Soil Salinity. Available online: https://www.agric.wa.gov.au/soil-salinity/measuring-soil-salinity (accessed on 28 April 2023).
  28. Кузнецoв, В.В.; Дмитриева, Г.А. Физиoлoгия Растений (Plant Physiology). 2016, 742. Textbook for the academic Bachelor’s Degree, 4th ed.; revised and supplemented—M.: Publishing House Yurait: Ussuriisk, Russia, 2016; Volume 1 437 p., Volume 2 459 p., UMO and VO stamp; ISBN 978-5-9916-5644-3; 978-5-9916-5645-0. [Google Scholar]
  29. Isichenko, M.B. Percolation, statistical topography, and transport in random media. Rev. Mod. Phys. 1992, 64, 961. [Google Scholar] [CrossRef]
  30. Ivushkin, K.; Bartholomeus, H.; Bregt, A.K.; Pulatov, A.; Franceschini, M.H.; Kramer, H.; van Loo, E.N.; Roman, V.J.; Finkers, R. UAV based soil salinity assessment of cropland. Geoderma 2019, 338, 502–512. [Google Scholar] [CrossRef]
  31. Панкoва, Е.; Кoнюшкoва, М.; Гoрoхoва, И. О прoблеме oценки засoленнoсти пoчв и метoдике крупнoмасштабнoгo цифрoвoгo картoграфирoвания засoленных пoчв (On the problem of soil salinity assessment and methodology of large-scale digital mapping of saline soils). Экoсистемы Экoлoгия Динамика 2017, 1, 26–54. [Google Scholar]
  32. Elhag, M. Evaluation of different soil salinity mapping using remote sensing techniques in arid ecosystems, Saudi Arabia. J. Sens. 2016, 2016, 7596175. [Google Scholar] [CrossRef]
  33. Gorji, T.; Yıldırım, A.; Sertel, E.; Tanık, A. Remote sensing approaches and mapping methods for monitoring soil salinity under different climate regimes. Int. J. Environ. Geoinform. 2019, 6, 33–49. [Google Scholar] [CrossRef]
  34. Bannari, A.; Guedon, A.; El-Harti, A.; Cherkaoui, F.; El-Ghmari, A. Characterization of slightly and moderately saline and sodic soils in irrigated agricultural land using simulated data of advanced land imaging (EO-1) sensor. Commun. Soil Sci. Plant Anal. 2008, 39, 2795–2811. [Google Scholar] [CrossRef]
  35. Saleh, A.M. Evaluation of different soil salinity mapping using remote sensing indicators and regression techniques, Basrah, Iraq. J. Am. Sci. 2017, 13, 85–97. [Google Scholar]
  36. Metternicht, G.I.; Zinck, J. Remote sensing of soil salinity: Potentials and constraints. Remote Sens. Environ. 2003, 85, 1–20. [Google Scholar] [CrossRef]
  37. Mukhamediev, R.I.; Popova, Y.; Kuchin, Y.; Zaitseva, E.; Kalimoldayev, A.; Symagulov, A.; Levashenko, V.; Abdoldina, F.; Gopejenko, V.; Yakunin, K. Review of artificial intelligence and machine learning technologies: Classification, restrictions, opportunities and challenges. Mathematics 2022, 10, 2552. [Google Scholar] [CrossRef]
  38. Allbed, A.; Kumar, L.; Aldakheel, Y.Y. Assessing soil salinity using soil salinity and vegetation indices derived from IKONOS high-spatial resolution imageries: Applications in a date palm dominated region. Geoderma 2014, 230, 1–8. [Google Scholar] [CrossRef]
  39. Ngabire, M.; Wang, T.; Xue, X.; Liao, J.; Sahbeni, G.; Huang, C.; Duan, H.; Song, X. Soil salinization mapping across different sandy land-cover types in the Shiyang River Basin: A remote sensing and multiple linear regression approach. Remote Sens. Appl. Soc. Environ. 2022, 28, 100847. [Google Scholar] [CrossRef]
  40. Zarei, A.; Hasanlou, M.; Mahdianpari, M. A comparison of machine learning models for soil salinity estimation using multi-spectral earth observation data. ISPRS Ann. Photogramm. Remote Sens. Spat. Inf. Sci. 2021, 3, 257–263. [Google Scholar] [CrossRef]
  41. Chen, Y.; Qiu, Y.; Zhang, Z.; Zhang, J.; Chen, C.; Han, J.; Liu, D. Estimating salt content of vegetated soil at different depths with Sentinel-2 data. PeerJ 2020, 8, e10585. [Google Scholar] [CrossRef] [PubMed]
  42. Wang, F.; Yang, S.; Yang, W.; Yang, X.; Jianli, D. Comparison of machine learning algorithms for soil salinity predictions in three dryland oases located in Xinjiang Uyghur Autonomous Region (XJUAR) of China. Eur. J. Remote Sens. 2019, 52, 256–276. [Google Scholar] [CrossRef]
  43. Nurmemet, I.; Ghulam, A.; Tiyip, T.; Elkadiri, R.; Ding, J.-L.; Maimaitiyiming, M.; Abliz, A.; Sawut, M.; Zhang, F.; Abliz, A. Monitoring soil salinization in Keriya River Basin, Northwestern China using passive reflective and active microwave remote sensing data. Remote Sens. 2015, 7, 8803–8829. [Google Scholar] [CrossRef]
  44. Tripathi, A.; Tiwari, R.K. A simplified subsurface soil salinity estimation using synergy of SENTINEL-1 SAR and SENTINEL-2 multispectral satellite data, for early stages of wheat crop growth in Rupnagar, Punjab, India. Land Degrad. Dev. 2021, 32, 3905–3919. [Google Scholar] [CrossRef]
  45. Taghadosi, M.M.; Hasanlou, M.; Eftekhari, K. Soil salinity mapping using dual-polarized SAR Sentinel-1 imagery. Int. J. Remote Sens. 2019, 40, 237–252. [Google Scholar] [CrossRef]
  46. Wang, J.; Ding, J.; Yu, D.; Teng, D.; He, B.; Chen, X.; Ge, X.; Zhang, Z.; Wang, Y.; Yang, X. Machine learning-based detection of soil salinity in an arid desert region, Northwest China: A comparison between Landsat-8 OLI and Sentinel-2 MSI. Sci. Total Environ. 2020, 707, 136092. [Google Scholar] [CrossRef] [PubMed]
  47. Ma, G.; Ding, J.; Han, L.; Zhang, Z.; Ran, S. Digital mapping of soil salinization based on Sentinel-1 and Sentinel-2 data combined with machine learning algorithms. Reg. Sustain. 2021, 2, 177–188. [Google Scholar] [CrossRef]
  48. Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
  49. Wei, L.; Yuan, Z.; Yu, M.; Huang, C.; Cao, L. Estimation of arsenic content in soil based on laboratory and field reflectance spectroscopy. Sensors 2019, 19, 3904. [Google Scholar] [CrossRef]
  50. Mukhamediev, R.I.; Merembayev, T.; Kuchin, Y.; Malakhov, D.; Zaitseva, E.; Levashenko, V.; Popova, Y.; Symagulov, A.; Sagatdinova, G.; Amirgaliyev, Y. Soil Salinity Estimation for South Kazakhstan Based on SAR Sentinel-1 and Landsat-8, 9 OLI Data with Machine Learning Models. Remote Sens. 2023, 15, 4269. [Google Scholar] [CrossRef]
  51. Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  52. Zhu, C.; Ding, J.; Zhang, Z.; Wang, Z. Exploring the potential of UAV hyperspectral image for estimating soil salinity: Effects of optimal band combination algorithm and random forest. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2022, 279, 121416. [Google Scholar] [CrossRef]
  53. Hu, J.; Peng, J.; Zhou, Y.; Xu, D.; Zhao, R.; Jiang, Q.; Fu, T.; Wang, F.; Shi, Z. Quantitative estimation of soil salinity using UAV-borne hyperspectral and satellite multispectral images. Remote Sens. 2019, 11, 736. [Google Scholar] [CrossRef]
  54. Mohamed, S.A.; Metwaly, M.M.; Metwalli, M.R.; AbdelRahman, M.A.; Badreldin, N. Integrating active and passive remote sensing data for mapping soil salinity using machine learning and feature selection approaches in arid regions. Remote Sens. 2023, 15, 1751. [Google Scholar] [CrossRef]
  55. Ivushkin, K.; Bartholomeus, H.; Bregt, A.K.; Pulatov, A.; Kempen, B.; De Sousa, L. Global mapping of soil salinity change. Remote Sens. Environ. 2019, 231, 111260. [Google Scholar] [CrossRef]
  56. Cortes, C.; Vapnik, V. Support-vector networks. Mach. Learn. 1995, 20, 273–297. [Google Scholar] [CrossRef]
  57. Wang, D.-Y.; Chen, H.-Y.; Wang, G.-F.; Cong, J.-Q.; Wang, X.-F.; Wei, X.-W. Salinity inversion of severe saline soil in the yellow river estuary based on UAV multi-spectra. Sci. Agric. Sin. 2019, 52, 1698–1709. [Google Scholar]
  58. Guan, X.; Wang, S.; Gao, Z.; Lv, Y. Dynamic prediction of soil salinization in an irrigation district based on the support vector machine. Math. Comput. Model. 2013, 58, 719–724. [Google Scholar] [CrossRef]
  59. Yu, H.-F.; Huang, F.-L.; Lin, C.-J. Dual coordinate descent methods for logistic regression and maximum entropy models. Mach. Learn. 2011, 85, 41–75. [Google Scholar] [CrossRef]
  60. Santosa, F.; Symes, W.W. Linear inversion of band-limited reflection seismograms. SIAM J. Sci. Stat. Comput. 1986, 7, 1307–1330. [Google Scholar] [CrossRef]
  61. Svozil, D.; Kvasnicka, V.; Pospichal, J. Introduction to multi-layer feed-forward neural networks. Chemom. Intell. Lab. Syst. 1997, 39, 43–62. [Google Scholar] [CrossRef]
  62. Nosair, A.M.; Shams, M.Y.; AbouElmagd, L.M.; Hassanein, A.E.; Fryar, A.E.; Abu Salem, H.S. Predictive model for progressive salinization in a coastal aquifer using artificial intelligence and hydrogeochemical techniques: A case study of the Nile Delta aquifer, Egypt. Environ. Sci. Pollut. Res. 2022, 29, 9318–9340. [Google Scholar] [CrossRef]
  63. Akramkhanov, A.; Vlek, P.L. The assessment of spatial distribution of soil salinity risk using neural network. Environ. Monit. Assess. 2012, 184, 2475–2485. [Google Scholar] [CrossRef]
  64. Yang, N.; Yang, S.; Cui, W.; Zhang, Z.; Zhang, J.; Chen, J.; Ma, Y.; Lao, C.; Song, Z.; Chen, Y. Effect of spring irrigation on soil salinity monitoring with UAV-borne multispectral sensor. Int. J. Remote Sens. 2021, 42, 8952–8978. [Google Scholar] [CrossRef]
  65. Cui, X.; Han, W.; Zhang, H.; Cui, J.; Ma, W.; Zhang, L.; Li, G. Estimating soil salinity under sunflower cover in the Hetao Irrigation District based on unmanned aerial vehicle remote sensing. Land Degrad. Dev. 2023, 34, 84–97. [Google Scholar] [CrossRef]
  66. Zhang, Z.; Niu, B.; Li, X.; Kang, X.; Hu, Z. Estimation and Dynamic Analysis of Soil Salinity Based on UAV and Sentinel-2A Multispectral Imagery in the Coastal Area, China. Land 2022, 11, 2307. [Google Scholar] [CrossRef]
  67. Shahabi, M.; Jafarzadeh, A.A.; Neyshabouri, M.R.; Ghorbani, M.A.; Valizadeh Kamran, K. Spatial modeling of soil salinity using multiple linear regression, ordinary kriging and artificial neural network methods. Arch. Agron. Soil Sci. 2017, 63, 151–160. [Google Scholar] [CrossRef]
  68. Huang, G.-B.; Zhu, Q.-Y.; Siew, C.-K. Extreme learning machine: Theory and applications. Neurocomputing 2006, 70, 489–501. [Google Scholar] [CrossRef]
  69. Mukhamediev, R.; Amirgaliyev, Y.; Kuchin, Y.; Aubakirov, M.; Terekhov, A.; Merembayev, T.; Yelis, M.; Zaitseva, E.; Levashenko, V.; Popova, Y. Operational mapping of salinization areas in agricultural fields using machine learning models based on low-altitude multispectral images. Drones 2023, 7, 357. [Google Scholar] [CrossRef]
  70. Zhao, W.; Zhou, C.; Zhou, C.; Ma, H.; Wang, Z. Soil salinity inversion model of oasis in arid area based on UAV multispectral remote sensing. Remote Sens. 2022, 14, 1804. [Google Scholar] [CrossRef]
  71. Yu, X.; Chang, C.; Song, J.; Zhuge, Y.; Wang, A. Precise monitoring of soil salinity in China’s Yellow River Delta using UAV-borne multispectral imagery and a soil salinity retrieval index. Sensors 2022, 22, 546. [Google Scholar] [CrossRef]
  72. Абаев, Н.; Терехoв, А.; Тиллакарим, Т.; Елтай, А. Спутникoвый мoнитoринг малoвoдных сoстoяний в китайскoй части бассейна реки или в сезoне 2020 гoда (Satellite monitoring of low-water conditions in the Chinese part of the river basin or in the 2020 season). In Proceedings of the Всерoссийская Научнo-Практ. Кoнф. с Междунарoд. Участием «Трансграничные Вoдные Объекты: Испoльзoвание, Управление, Охрана», Sochi, Russia, 20–25 September 2021; Лик: Novocherkassk, Russia, 2021; pp. 13–17. [Google Scholar]
  73. Терехoв, А. Вoзмoжнoсти спутникoвых данных в прoблеме oценки сезoннoгo сoстoяния вoды в китайскoй части бассейна реки Или (Possibilities of satellite data in the problem of estimating seasonal water conditions in the Chinese part of the Ili River Basin). Гидрoметеoрoлoгия Экoлoгия 2019, 4, 184–189. [Google Scholar]
  74. Kriegler, F.J. Preprocessing transformations and their effects on multspectral recognition. In Proceedings of the Sixth International Symposium on Remote Sesning of Environment, Ann Arbor, MI, USA, 13–16 October 1969; pp. 97–131. [Google Scholar]
  75. Rouse, J.W.; Haas, R.H.; Schell, J.A.; Deering, D.W. Monitoring vegetation systems in the Great Plains with ERTS. NASA Spec. Publ. 1974, 351, 309. [Google Scholar]
  76. Kholdorov, S.; Gopakumar, L.; Katsura, K.; Jabbarov, Z.; Jobborov, O.; Shamsiddinov, T.; Khakimov, A. Soil salinity assessment research using remote sensing techniques: A special focus on recent research. IOP Conf. Ser. Earth Environ. Sci. 2022, 1068, 012037. [Google Scholar] [CrossRef]
  77. Nicolas, H.; Walter, C. Detecting salinity hazards within a semiarid context by means of combining soil and remote-sensing data. Geoderma 2006, 134, 217–230. [Google Scholar]
Figure 1. Maktaaral district of irrigated lands in the south of Kazakhstan.
Figure 1. Maktaaral district of irrigated lands in the south of Kazakhstan.
Agronomy 14 02103 g001
Figure 2. Using two models to estimate the salinity of a site.
Figure 2. Using two models to estimate the salinity of a site.
Agronomy 14 02103 g002
Figure 3. The principal steps of the salinity map construction. Blue- non-saline, green—slightly saline, yellow—moderately saline, red—highly saline, crimson- extremely saline.
Figure 3. The principal steps of the salinity map construction. Blue- non-saline, green—slightly saline, yellow—moderately saline, red—highly saline, crimson- extremely saline.
Agronomy 14 02103 g003
Figure 4. Pseudo-colored composite of one of the sites of Maktaaral district of Kyzylordin region.
Figure 4. Pseudo-colored composite of one of the sites of Maktaaral district of Kyzylordin region.
Agronomy 14 02103 g004
Figure 5. Gradient fragments.
Figure 5. Gradient fragments.
Agronomy 14 02103 g005
Figure 6. Correspondence of the composite map to the levels of soil salinity (Crimson—extreme salinity, red—very high salinity, yellow—strong salinity, green—weak salinity, blue—no salinity).
Figure 6. Correspondence of the composite map to the levels of soil salinity (Crimson—extreme salinity, red—very high salinity, yellow—strong salinity, green—weak salinity, blue—no salinity).
Agronomy 14 02103 g006
Figure 7. Initial pseudo-color image (a) and target classification by salinity levels (b), where crimson is extreme salinity, red is very high salinity, yellow is strong salinity, green is weak salinity, and blue is no salinity.
Figure 7. Initial pseudo-color image (a) and target classification by salinity levels (b), where crimson is extreme salinity, red is very high salinity, yellow is strong salinity, green is weak salinity, and blue is no salinity.
Agronomy 14 02103 g007
Figure 8. The position of the study area (left) and the original pseudo-color image (right).
Figure 8. The position of the study area (left) and the original pseudo-color image (right).
Agronomy 14 02103 g008
Figure 9. Classification results of the test image using XGB and CNN models.
Figure 9. Classification results of the test image using XGB and CNN models.
Agronomy 14 02103 g009
Figure 10. Pseudo-color composite and salinity map of one of the sites of Maktaaral district of the Kyzylorda region. (a) Pseudo-colored composite of Maktaaral district site. (b) Salinity map created by experts in the course of the field work.
Figure 10. Pseudo-color composite and salinity map of one of the sites of Maktaaral district of the Kyzylorda region. (a) Pseudo-colored composite of Maktaaral district site. (b) Salinity map created by experts in the course of the field work.
Agronomy 14 02103 g010
Figure 11. The result of applying the pre-trained XGBoost classifier model to the verified site of Maktaaral district. (a) The result of the XGBoost Classifier. (b) Comparison of model results and expert assessments. The black dots indicate “rough” classification errors. (c) Comparison of model results and expert assessments. The black dots indicate all classification errors.
Figure 11. The result of applying the pre-trained XGBoost classifier model to the verified site of Maktaaral district. (a) The result of the XGBoost Classifier. (b) Comparison of model results and expert assessments. The black dots indicate “rough” classification errors. (c) Comparison of model results and expert assessments. The black dots indicate all classification errors.
Agronomy 14 02103 g011
Figure 12. Correspondence of ground-based salinity research to the predicted values in a local area of the Maktaaral region.
Figure 12. Correspondence of ground-based salinity research to the predicted values in a local area of the Maktaaral region.
Agronomy 14 02103 g012
Figure 13. The resulting average long-term salinity map of the Maktaaral District.
Figure 13. The resulting average long-term salinity map of the Maktaaral District.
Agronomy 14 02103 g013
Table 1. Threshold values of 6 classes of soil salinity and its impact on plant growth.
Table 1. Threshold values of 6 classes of soil salinity and its impact on plant growth.
Salinity ClassEC1:5 Range for Loams (dS/m)Impact on Crop Growth Types of Crops Growing at a Given Level of Salinity
Non-saline0–0.18MinorCereals other than corn: vetch, alfalfa
Slightly saline0.19–0.36The yield of crops that are sensitive to salinity is reduced. Cotton, timothy, cocksfoot, sweet clover, wheat
Moderately saline0.37–0.72The yield of most crops is decreasing. Fodder swede, fodder cabbage, wheatgrass, sorghum
Highly saline0.73–1.45Only salt-tolerant crops give a satisfactory harvest. Sugar beets, sunflowers, western wheatgrass, French ryegrass, awnless brome
Severely saline1.46–2.90Most salt-tolerant crops can produce satisfactory yields.Forage grasses: tall wheatgrass, brome grass, Canadian hair grass, tall fescue
Extremely saline>2.90Unsatisfactory yieldFitting varieties
N.B. EC1:5 is the electrical conductivity of the soil solution (one weight fraction of soil dissolves in five parts of water).
Table 2. Application of machine learning models for estimation of soil salinity.
Table 2. Application of machine learning models for estimation of soil salinity.
RegressorAbbreviationRefs.
XGBoost [48]XGB[40,47,49,50]
Random forest [51]RF[20,40,41,42,52,53,54,55]
Support vector machines [56]SVM[41,45,54,57,58]
Linear regression [59]LR[39]
Lasso regression [60]Lasso[42]
Feed forward neural networks [61]FFNN[41,54,62,63,64,65,66,67]
Extreme learning machine [68]ELM[41]
Table 3. Results of the operation of machine learning models on a test set of cultivated soil plots.
Table 3. Results of the operation of machine learning models on a test set of cultivated soil plots.
Machine Learning ModelF1_Score Macro AverageAccuracy
XGBoost classifier0.830.89
Convolution neural network 0.840.89
Random forest classifier0.760.84
KneighborsClassifier0.760.84
SVC (kernel = “linear”, C = 0.025)0.710.74
DecisionTreeClassifier (max_depth = 36)0.660.77
GaussianNB0.620.72
AdaBoostClassifier0.600.60
MLP 0.590.72
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Mukhamediev, R.I.; Terekhov, A.; Amirgaliyev, Y.; Popova, Y.; Malakhov, D.; Kuchin, Y.; Sagatdinova, G.; Symagulov, A.; Muhamedijeva, E.; Gricenko, P. Using Pseudo-Color Maps and Machine Learning Methods to Estimate Long-Term Salinity of Soils. Agronomy 2024, 14, 2103. https://doi.org/10.3390/agronomy14092103

AMA Style

Mukhamediev RI, Terekhov A, Amirgaliyev Y, Popova Y, Malakhov D, Kuchin Y, Sagatdinova G, Symagulov A, Muhamedijeva E, Gricenko P. Using Pseudo-Color Maps and Machine Learning Methods to Estimate Long-Term Salinity of Soils. Agronomy. 2024; 14(9):2103. https://doi.org/10.3390/agronomy14092103

Chicago/Turabian Style

Mukhamediev, Ravil I., Alexey Terekhov, Yedilkhan Amirgaliyev, Yelena Popova, Dmitry Malakhov, Yan Kuchin, Gulshat Sagatdinova, Adilkhan Symagulov, Elena Muhamedijeva, and Pavel Gricenko. 2024. "Using Pseudo-Color Maps and Machine Learning Methods to Estimate Long-Term Salinity of Soils" Agronomy 14, no. 9: 2103. https://doi.org/10.3390/agronomy14092103

APA Style

Mukhamediev, R. I., Terekhov, A., Amirgaliyev, Y., Popova, Y., Malakhov, D., Kuchin, Y., Sagatdinova, G., Symagulov, A., Muhamedijeva, E., & Gricenko, P. (2024). Using Pseudo-Color Maps and Machine Learning Methods to Estimate Long-Term Salinity of Soils. Agronomy, 14(9), 2103. https://doi.org/10.3390/agronomy14092103

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop