Digital Visualization of Environmental Risk Indicators in the Territory of the Urban Industrial Zone
Abstract
:1. Introduction
- -
- Inverse Distance Weighting (IDW), where weights are assigned to nearby data points based on their distance to the target location, with closer points having greater influence on the interpolated value [22];
- -
- Kriging, a geostatistical method that incorporates spatial autocorrelation and variogram analysis to estimate risk at unsampled locations, considering spatial dependence and uncertainty [23];
- -
- spline interpolation, where splines are a type of piecewise polynomial function used for interpolation. They connect multiple polynomial segments, called splines, to create a smooth curve that passes through or near a set of data points [24].
2. Materials and Methods
2.1. Research Area and Sample Analysis
2.2. Environmental Risk Indicator Calculation
2.3. Methods for Spatial Interpolation and Evaluation Measures
2.4. Data Analysis and Visualization
2.4.1. Cross Validation
2.4.2. Self-Assessment
2.4.3. Statistics and Visualization
3. Results
3.1. Statistical Distribution of Heavy Metal Concentrations in Soil
3.2. Analysis of PLI Statistical Distribution
3.3. Modeling of PLI Spatial Distribution Using Machine Learning Methods
3.3.1. k-Nearest Neighbors and Weighted k-Nearest Neighbors
3.3.2. Gradient Boosting (CatBoost Regression)
- -
- depth: the term ‘depth’ refers to the maximum number of splits allowed in each individual decision tree used within the ensemble;
- -
- learning rate: a crucial hyperparameter that controls the magnitude of the update each new tree makes to the overall model prediction. Smaller learning rates lead to smaller updates, potentially slower learning, but better generalization and reduced overfitting;
- -
- iterations: the number of individual decision trees built in the ensemble model. More iterations (more trees) can lead to higher complexity and a potentially better fit to the training data. However, too many iterations can also lead to overfitting, where the model becomes too specific to the training data and does not generalize well to unseen data;
- -
- loss function: a critical component that defines how the algorithm measures the discrepancy between the model’s predictions and the actual target values. It plays a fundamental role in driving the learning process and influencing the final model’s performance.
3.3.3. Artificial Neural Network (MLP Regression)
- -
- input layer, including two neurons, taking geographical X- and Y-coordinates in the metric system or four neurons when added to the distance to the nearest known neighbor and the PLI at the nearest known neighbor location;
- -
- one or two hidden layers, including n neurons;
- -
- output layer, consisting of one neuron, yielding a predicted value of the PLI as a result of the regression.
- -
- the function of activation (‘activation’), which plays a crucial role in introducing non-linearity into the network. This is vital because without non-linearity, a network would simply be performing linear regressions at each layer, ultimately leading to a limited ability to learn complex patterns in the data.
- -
- The number of neurons in the hidden layer (‘hidden_layer_sizes’) is a crucial parameter that defines the architecture of the hidden layers. These layers lie between the input and output layers and play a vital role in learning complex relationships between the data and the target variable. Experimenting with different ‘hidden_layer_sizes’ values through grid search and cross-validation helps find the optimal architecture for the specific problem.
- -
- The optimization algorithm (‘solver’) used to train the model and adjust its internal parameters (weights and biases) during the learning process. These algorithms aim to minimize a specific loss function, which measures the discrepancy between the model’s predictions and the true target values. Different solvers come with various strengths and weaknesses, making the choice crucial for achieving optimal performance.
3.3.4. Kriging Model
- -
- method: ordinary or universal;
- -
- variogram model: ‘linear’, ‘power’, ‘gaussian’, ‘spherical’, ‘hole-effect’, ‘exponential’;
- -
- number of lags (nlags): specifies the number of lags used in the variogram calculation. This determines the level of detail captured in the spatial relationship;
- -
- calculation of weights (weight): we should choose to use or not perform calculations. Each weight reflects the influence of a specific sampled data point on the prediction at a particular unsampled location. Points closer to the target location and points that exhibit similar values tend to have higher weights, contributing more significantly to the prediction.
3.3.5. Multilevel b-Spline Interpolation
- -
- the calculated value cannot be equal to the average (or close to) value in every predicted point;
- -
- the distribution cannot follow a simple linear dependence (like simple gradients), at least for difficult dependencies such as the distribution of pollutants in environmental objects, which are characterized by numerous reciprocal influences with many factors;
- -
- the calculated distribution has to reflect the observed values, assuming the method of observation and observed data are correct.
4. Discussion
5. Conclusions
- The study focused on soil samples collected from 39 locations in the research area, with elemental analysis of heavy metals including Pb, Hg, Zn, Mo, Cu, Co, Ni, Cr, Sr, Mn, and V. The data revealed significant variability and potential contamination in several elements, particularly chromium (Cr), zinc (Zn), and lead (Pb), where a substantial percentage of samples exceeded their respective MPCs. For example, 79% of chromium samples exceeded the MPC of 200 mg·kg−1, with concentrations ranging from 0 to 820 mg·kg−1 and an average of 203.85 mg·kg−1. Zinc also showed high levels of contamination, with 79% of samples exceeding the MPC of 55 mg·kg−1, and concentrations ranging from 30 to 910 mg·kg−1 and an average of 121.28 mg·kg−1. Additionally, 67% of lead samples exceeded the MPC of 32 mg·kg−1, with concentrations ranging from 0 to 200 mg·kg−1 and an average of 46.92 mg·kg−1. Conversely, elements such as iron (Fe), strontium (Sr), and vanadium (V) showed minimal exceedance of MPCs. All iron, strontium, and vanadium samples were below the corresponding MPCs. Overall, the analysis underscored the importance of continuous monitoring and targeted intervention strategies to address the identified contamination issues, particularly for chromium, zinc, and lead. Efforts should focus on remediation and stricter regulatory measures to manage and mitigate the environmental and health risks associated with these contaminants.
- Nine mathematical models based on kNN, gradient boosting, artificial neural networks, Kriging, and multilevel b-spline interpolation methods were employed to predict pollution levels, with the primary accuracy metric being MSE. In this study, we were focused on visual expressiveness and empirical consistency. As a result of the model’s benchmarking, Kriging and MLBS interpolation were ultimately chosen for their ability to create smooth, visually appealing maps that accurately represent the spatial distribution of environmental indicators, with MSE values of 0.282 and 0.404, respectively. While kNN and WkNN are straightforward and adaptable, they are computationally intensive and less visually smooth. Gradient boosting and ANNs offer high predictive accuracy, with MSE values around 0.287 and 0.289 for gradient boosting, but are resource-heavy, complex, and visually unacceptable from the point of view of empirical consistency.
- Taking into account the impossibility to accurately assess the accuracy of the model because of the small initial dataset, the following four visual parameters were used for the assessment of the appropriateness of the maps obtained with the computational models: non-uniform prediction, non-linearity, empirical consistency, and smoothness. Based on the conducted correlation analysis, the most important features were empirical consistency and non-linearity.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Thompson, P.B. Ethics and Environmental Risk Assessment. In Food and Agricultural Biotechnology in Ethical Perspective; Thompson, P.B., Ed.; The International Library of Environmental, Agricultural and Food Ethics; Springer International Publishing: Cham, Switzerland, 2020; pp. 137–165. ISBN 978-3-030-61214-6. [Google Scholar] [CrossRef]
- Wang, J.; Kim, H.-S. Visualizing the Landscape of Home IoT Research: A Bibliometric Analysis Using VOSviewer. Sensors 2023, 23, 3086. [Google Scholar] [CrossRef]
- Bahari, M.; Arpaci, I.; Der, O.; Akkoyun, F.; Ercetin, A. Driving Agricultural Transformation: Unraveling Key Factors Shaping IoT Adoption in Smart Farming with Empirical Insights. Sustainability 2024, 16, 2129. [Google Scholar] [CrossRef]
- Rodier, D.J.; Zeeman, M.G. Ecological Risk Assessment. In Basic Environmental Toxicology; Cockerham, L.G., Shane, B.S., Eds.; CRC Press: Boca Raton, FL, USA, 2019; pp. 581–604. ISBN 978-1-315-13809-1. [Google Scholar] [CrossRef]
- Latif, U.; Dickert, F.L. Biochemical Oxygen Demand (BOD). In Environmental Analysis by Electrochemical Sensors and Biosensors; Moretto, L.M., Kalcher, K., Eds.; Nanostructure Science and Technology; Springer: New York, NY, USA, 2015; pp. 729–734. ISBN 978-1-4939-1300-8. [Google Scholar] [CrossRef]
- Latif, U.; Dickert, F.L. Chemical Oxygen Demand. In Environmental Analysis by Electrochemical Sensors and Biosensors; Moretto, L.M., Kalcher, K., Eds.; Nanostructure Science and Technology; Springer: New York, NY, USA, 2015; pp. 719–728. ISBN 978-1-4939-1300-8. [Google Scholar] [CrossRef]
- Wackernagel, M. Measuring Ecological Footprints. In Measuring Sustainable Production; OECD Sustainable Development Studies; OECD: Paris, France, 2008; pp. 49–59. ISBN 978-92-64-04412-8. [Google Scholar] [CrossRef]
- Sivaranjani, S.; Rakshit, A.; Singh, S. Water Quality Assessment with Water Quality Indices. Int. J. Bioresour. Sci. 2015, 2, 85. [Google Scholar] [CrossRef]
- Horn, S.A.; Dasgupta, P.K. The Air Quality Index (AQI) in Historical and Analytical Perspective a Tutorial Review. Talanta 2024, 267, 125260. [Google Scholar] [CrossRef]
- Safarov, R.Z.; Shomanova, Z.K.; Kopishev, E.E.; Nossenko, Y.G.; Bexeitova, Z.B. Spatial Distribution of PM2.5 and PM10 Pollutants in Residential Area of Pavlodar, Kazakhstan. News Nat. Acad. Sci. Repub. Kazakhstan Ser. Chem. Technol. 2023, 457, 181–200. [Google Scholar] [CrossRef]
- Ferreira, S.L.C.; da Silva, J.B.; dos Santos, I.F.; de Oliveira, O.M.C.; Cerda, V.; Queiroz, A.F.S. Use of Pollution Indices and Ecological Risk in the Assessment of Contamination from Chemical Elements in Soils and Sediments—Practical Aspects. Trends Environ. Anal. Chem. 2022, 35, e00169. [Google Scholar] [CrossRef]
- Rinklebe, J.; Antoniadis, V.; Shaheen, S.M.; Rosche, O.; Altermann, M. Health Risk Assessment of Potentially Toxic Elements in Soils along the Central Elbe River, Germany. Environ. Int. 2019, 126, 76–88. [Google Scholar] [CrossRef] [PubMed]
- Rabee, A.M.; Al-Fatlawy, Y.F.; Abd, A.-A.-H.N.; Nameer, M. Using Pollution Load Index (PLI) and Geoaccumulation Index (I-Geo) for the Assessment of Heavy Metals Pollution in Tigris River Sediment in Baghdad Region. Nahrain J. Sci. 2011, 14, 108–114. [Google Scholar] [CrossRef]
- Angulo, E. The Tomlinson Pollution Load Index Applied to Heavy Metal, ‘Mussel-Watch’ Data: A Useful Index to Assess Coastal Pollution. Sci. Total Environ. 1996, 187, 19–56. [Google Scholar] [CrossRef]
- Chen, T.-B.; Zheng, Y.-M.; Lei, M.; Huang, Z.-C.; Wu, H.-T.; Chen, H.; Fan, K.-K.; Yu, K.; Wu, X.; Tian, Q.-Z. Assessment of Heavy Metal Pollution in Surface Soils of Urban Parks in Beijing, China. Chemosphere 2005, 60, 542–551. [Google Scholar] [CrossRef]
- Jorfi, S.; Maleki, R.; Jaafarzadeh, N.; Ahmadi, M. Pollution Load Index for Heavy Metals in Mian-Ab Plain Soil, Khuzestan, Iran. Data Brief 2017, 15, 584–590. [Google Scholar] [CrossRef]
- Gorgoglione, A.; Castro, A.; Chreties, C.; Etcheverry, L. Overcoming Data Scarcity in Earth Science. Data 2020, 5, 5. [Google Scholar] [CrossRef]
- Grunwald, S. Disaggregation and Scientific Visualization of Earthscapes Considering Trends and Spatial Dependence Structures. New J. Phys. 2008, 10, 125011. [Google Scholar] [CrossRef]
- James, P.M.A.; Fortin, M.-J. Ecosystems and Spatial Patterns. In Encyclopedia of Sustainability Science and Technology; Meyers, R.A., Ed.; Springer: New York, NY, USA, 2012; pp. 3326–3342. ISBN 978-0-387-89469-0. [Google Scholar] [CrossRef]
- Phillips, D.L.; Marks, D.G. Spatial Uncertainty Analysis: Propagation of Interpolation Errors in Spatially Distributed Models. Ecol. Modell. 1996, 91, 213–229. [Google Scholar] [CrossRef]
- Igaz, D.; Šinka, K.; Varga, P.; Vrbičanová, G.; Aydın, E.; Tárník, A. The Evaluation of the Accuracy of Interpolation Methods in Crafting Maps of Physical and Hydro-Physical Soil Properties. Water 2021, 13, 212. [Google Scholar] [CrossRef]
- Wong, D.W.S. Interpolation: Inverse-Distance Weighting. In International Encyclopedia of Geography; Richardson, D., Castree, N., Goodchild, M.F., Kobayashi, A., Liu, W., Marston, R.A., Eds.; Wiley: Hoboken, NJ, USA, 2017; pp. 1–7. ISBN 978-0-470-65963-2. [Google Scholar] [CrossRef]
- Calder, C.A.; Cressie, N. Kriging and Variogram Models. In International Encyclopedia of Human Geography; Elsevier: Amsterdam, The Netherlands, 2009; pp. 49–55. ISBN 978-0-08-044910-4. [Google Scholar] [CrossRef]
- Cuevas, E.; Luque, A.; Escobar, H. Spline Interpolation. In Computational Methods with MATLAB®; Synthesis Lectures on Engineering, Science, and Technology; Springer Nature: Cham, Switzerland, 2024; pp. 151–177. ISBN 978-3-031-40477-1. [Google Scholar] [CrossRef]
- Huete-Morales, M.D.; Villar-Rubio, E.; Galán-Valdivieso, F. Spatial Distribution of CO2 Verified Emissions: A Kriging-Based Approach. Emiss. Control Sci. Technol. 2021, 7, 63–77. [Google Scholar] [CrossRef]
- Gilbert, R.O.; Simpson, J.C. Kriging for Estimating Spatial Pattern of Contaminants: Potential and Problems. Environ. Monit. Assess. 1985, 5, 113–135. [Google Scholar] [CrossRef] [PubMed]
- Wang, C.; Zhu, H. Combination of Kriging Methods and Multi-Fractal Analysis for Estimating Spatial Distribution of Geotechnical Parameters. Bull. Eng. Geol. Environ. 2016, 75, 413–423. [Google Scholar] [CrossRef]
- Khadka, D.; Lamichhane, S.; Giri, R.K.; Chalise, B.; Amgain, R.; Joshi, S. Geostatistical Based Soil Fertility Mapping of Horticultural Research Station, Rajikot, Jumla, Nepal. J. Agric. Nat. Res. 2020, 3, 257–275. [Google Scholar] [CrossRef]
- Krivoruchko, K.; Gribov, A. Distance Metrics for Data Interpolation over Large Areas on Earth’s Surface. Spat. Stat. 2020, 35, 100396. [Google Scholar] [CrossRef]
- Faisal, M.; Jaelani, L.M. Spatio-Temporal Analysis of Nitrogen Dioxide (NO2) from Sentinel-5P Imageries Using Google Earth Engine Changes during the COVID-19 Social Restriction Policy in Jakarta. Nat. Hazards Res. 2023, 3, 344–352. [Google Scholar] [CrossRef]
- Gribov, A.; Krivoruchko, K. Empirical Bayesian Kriging Implementation and Usage. Sci. Total Environ. 2020, 722, 137290. [Google Scholar] [CrossRef]
- Griffin, T.; Brown, J.; Lowenberg-DeBoer, J. Yield Monitor Data Analysis Protocol: A Primer in the Management and Analysis of Precision Agriculture Data. SSRN J. 2007. [Google Scholar] [CrossRef]
- Lee, S.; Wolberg, G.; Shin, S.Y. Scattered Data Interpolation with Multilevel B-Splines. IEEE Trans. Vis. Comput. Graph. 1997, 3, 228–244. [Google Scholar] [CrossRef]
- Conrad, O. Tool Multilevel B-Spline Interpolation/SAGA-GIS Tool Library Documentation (v6.1.0). Available online: https://saga-gis.sourceforge.io/saga_tool_doc/6.1.0/grid_spline_4.html (accessed on 18 February 2024).
- Hjelle, H. MBA: Multilevel B-Splines Reference Manual. Available online: https://www.sintef.no/globalassets/upload/ikt/9011/geometri/mba/mba_doc/index.html (accessed on 18 February 2024).
- Hardy, D.J.; Wolff, M.A.; Xia, J.; Schulten, K.; Skeel, R.D. Multilevel Summation with B-Spline Interpolation for Pairwise Interactions in Molecular Dynamics Simulations. J. Chem. Phys. 2016, 144, 114112. [Google Scholar] [CrossRef] [PubMed]
- Maximov, G.A.; Larichev, V.A.; Lesonen, D.N.; Derov, A.V. Geospline: Mathematical Model of 3D Complex Geological Medium. In Proceedings of the SPE Arctic and Extreme Environments Technical Conference and Exhibition, Moscow, Russia, 15–17 October 2013; SPE: Moscow, Russia, 2013; p. SPE-166834-MS. [Google Scholar] [CrossRef]
- Li, M.; Shi, W.; Liu, S.; Fu, S.; Fei, Y.; Zhou, L.; Li, Y. Fast and Universal Single Molecule Localization Using Multi-Dimensional Point Spread Functions. bioRxiv 2023. [Google Scholar] [CrossRef]
- Cunningham, P.; Delany, S.J. K-Nearest Neighbour Classifiers—A Tutorial. ACM Comput. Surv. 2022, 54, 1–25. [Google Scholar] [CrossRef]
- Syriopoulos, P.K.; Kalampalikis, N.G.; Kotsiantis, S.B.; Vrahatis, M.N. kNN Classification: A Review. Ann. Math. Artif. Intell. 2023. [Google Scholar] [CrossRef]
- Tabatabaei, M.; Kimiaefar, R.; Hajian, A.; Akbari, A. Robust Outlier Detection in Geo-Spatial Data Based on LOLIMOT and KNN Search. Earth Sci. Inform. 2021, 14, 1065–1072. [Google Scholar] [CrossRef]
- Ahmed, M.-S.; N’diaye, M.; Attouch, M.K.; Dabo-Niange, S. K-Nearest Neighbors Prediction and Classification for Spatial Data. J. Spat. Econom. 2023, 4, 12. [Google Scholar] [CrossRef]
- Ver Hoef, J.M.; Temesgen, H. A Comparison of the Spatial Linear Model to Nearest Neighbor (k-NN) Methods for Forestry Applications. PLoS ONE 2013, 8, e59129. [Google Scholar] [CrossRef] [PubMed]
- Hechenbichler, K.; Schliep, K. Weighted K-Nearest-Neighbor Techniques and Ordinal Classification; LMU: Munich, Germany, 2004; Paper 399. [Google Scholar] [CrossRef]
- Mladenova, T. A Feature-Weighted Rule for the K-Nearest Neighbor. In Proceedings of the 2021 5th International Symposium on Multidisciplinary Studies and Innovative Technologies (ISMSIT), Ankara, Turkey, 21–23 October 2021; pp. 493–497. [Google Scholar] [CrossRef]
- Sahin, E.K. Comparative Analysis of Gradient Boosting Algorithms for Landslide Susceptibility Mapping. Geocarto Int. 2022, 37, 2441–2465. [Google Scholar] [CrossRef]
- Zhang, Z.; Wu, C.; Qu, S.; Chen, X. An Explainable Artificial Intelligence Approach for Financial Distress Prediction. Inform. Process Manag. 2022, 59, 102988. [Google Scholar] [CrossRef]
- Hancock, J.T.; Khoshgoftaar, T.M. CatBoost for Big Data: An Interdisciplinary Review. J. Big Data 2020, 7, 94. [Google Scholar] [CrossRef] [PubMed]
- Thomas, J.; Mayr, A.; Bischl, B.; Schmid, M.; Smith, A.; Hofner, B. Gradient Boosting for Distributional Regression: Faster Tuning and Improved Variable Selection via Noncyclical Updates. Stat. Comput. 2018, 28, 673–687. [Google Scholar] [CrossRef]
- Biau, G.; Cadre, B.; Rouvière, L. Accelerated Gradient Boosting. Mach. Learn. 2019, 108, 971–992. [Google Scholar] [CrossRef]
- Bailly, J.S.; Arnaud, M.; Puech, C. Boosting: A Classification Method for Remote Sensing. Int. J. Remote Sens. 2007, 28, 1687–1710. [Google Scholar] [CrossRef]
- Taşpınar, S.; Doğan, O.; Bera, A.K. GMM Gradient Tests for Spatial Dynamic Panel Data Models. Reg. Sci. Urban Econ. 2017, 65, 65–88. [Google Scholar] [CrossRef]
- Alzubaidi, L.; Zhang, J.; Humaidi, A.J.; Al-Dujaili, A.; Duan, Y.; Al-Shamma, O.; Santamaría, J.; Fadhel, M.A.; Al-Amidie, M.; Farhan, L. Review of Deep Learning: Concepts, CNN Architectures, Challenges, Applications, Future Directions. J. Big Data 2021, 8, 53. [Google Scholar] [CrossRef]
- Taud, H.; Mas, J.F. Multilayer Perceptron (MLP). In Geomatic Approaches for Modeling Land Change Scenarios; Camacho Olmedo, M.T., Paegelow, M., Mas, J.-F., Escobar, F., Eds.; Lecture Notes in Geoinformation and Cartography; Springer International Publishing: Cham, Switzerland, 2018; pp. 451–455. ISBN 978-3-319-60800-6. [Google Scholar] [CrossRef]
- Yalaltdinova, A.; Baranovskaya, N.; Rikhvanov, L.; Matveenko, I. Geochemical Peculiarities of Black Poplar Leaves (Populus nigra L.) in the Sites with Heavy Metals Intensive Fallouts. Available online: https://ui.adsabs.harvard.edu/abs/2013EGUGA..15..258Y (accessed on 5 October 2023).
- Mimura, N.T.; Alibekova, A.; Abisheva, A.K.; Aldungarova, M. Study of Geotechnical Conditions of the City of Pavlodar. In Smart Geotechnics for Smart Societies; CRC Press: Boca Raton, FL, USA, 2023; pp. 505–510. ISBN 978-1-00-329912-7. [Google Scholar]
- Alimbaev, T.; Beksultanova, C.; Mazhitova, Z.; Choybekova, G.; Zhunushalieva, G.; Tentigul Kyzy, N. The Beginning of Virgin Lands Development in Pavlodar Region (in 1954). E3S Web Conf. 2023, 371, 06017. [Google Scholar] [CrossRef]
- Kakabayev, A.; Yermekova, A.; Baikenova, G.; Abdurahmanov, I.; Baituk, G. Technogenic Impact Assessment on the Environment of Pavlodar Region Using GIS Technologies. E3S Web Conf. 2023, 386, 06001. [Google Scholar] [CrossRef]
- Guney, M.; Akimzhanova, Z.; Kumisbek, A.; Beisova, K.; Kismelyeva, S.; Satayeva, A.; Inglezakis, V.; Karaca, F. Mercury (Hg) Contaminated Sites in Kazakhstan: Review of Current Cases and Site Remediation Responses. Int. J. Environ. Res. Public Health 2020, 17, 8936. [Google Scholar] [CrossRef] [PubMed]
- Kismelyeva, S.; Khalikhan, R.; Torezhan, A.; Kumisbek, A.; Akimzhanova, Z.; Karaca, F.; Guney, M. Potential Human Exposure to Mercury (Hg) in a Chlor-Alkali Plant Impacted Zone: Risk Characterization Using Updated Site Assessment Data. Sustainability 2021, 13, 13816. [Google Scholar] [CrossRef]
- Woodruff, S.; Dack, S. Analysis of Risk from Mercury Contamination at the Khimprom Plant in Kazakhstan. Land Contam. Reclam. 2004, 12, 213–218. [Google Scholar] [CrossRef]
- Yessenbayev, D.; Khamidullina, Z.; Tarzhanova, D.; Orazova, G.; Zhakupova, T.; Kassenova, D.; Bilyalova, Z.; Igissinova, G.; Sayakov, U.; Dzhumabayeva, F.; et al. Epidemiology of Lung Cancer in Kazakhstan: Trends and Geographic Distribution. Asian Pac. J. Cancer Prev. 2023, 24, 1521–1532. [Google Scholar] [CrossRef] [PubMed]
- Safarov, R.; Berdenov, Z.; Urlibay, R.; Nossenko, Y.; Shomanova, Z.; Bexeitova, Z.; Kulak, A.; Varga, I.; Balog, A.; Domjánné, R.N.; et al. Spatial Distribution of Elements, Environmental Effects, and Economic Potential of Waste from the Aksu Ferroalloy Plant [Kazakhstan]. PLoS ONE 2023, 18, e0283251. [Google Scholar] [CrossRef]
- Zhakhina, G.; Zhalmagambetov, B.; Gusmanov, A.; Sakko, Y.; Yerdessov, S.; Matmusaeva, E.; Imanova, A.; Crape, B.; Sarria-Santamera, A.; Gaipov, A. Incidence and Mortality Rates of Strokes in Kazakhstan in 2014–2019. Sci. Rep. 2022, 12, 16041. [Google Scholar] [CrossRef] [PubMed]
- Mukataeva, Z.; Dinmukhamedova, A.; Kabieva, S.; Baidalinova, B.; Khamzina, S.; Zekenova, L.; Aizman, R. Comparative Characteristics of Developing Morphofunctional Features of Schoolchildren from Different Climatic and Geographical Regions. J. Pediatr. Endocrinol. Metab. 2023, 36, 158–166. [Google Scholar] [CrossRef] [PubMed]
- Guney, M.; Akimzhanova, Z.; Kumisbek, A.; Kismelyeva, S.; Guney, A.; Karaca, F.; Inglezakis, V. Assessment of Distribution of Potentially Toxic Elements in Different Environmental Media Impacted by a Former Chlor-Alkali Plant. Sustainability 2021, 13, 13829. [Google Scholar] [CrossRef]
- Kanibolotskaya, Y.; Listkov, W.; Shmidt, N. Heavy Metals in Soil and Plants (Agropyron Pectiniforme Roem. et Schult.) of the Pavlodar Region (Kazakhstan). In Proceedings of the International Conference on Sustainable Development of Cross-Border Regions, Barnaul, Russia, 19–20 April 2019; IOP Publishing: Bristol, UK, 2019; Volume 395, p. 012037. [Google Scholar] [CrossRef]
- Pogodaiklimat.ru Pavlodar Weather Archive. Available online: http://www.pogodaiklimat.ru/weather.php?id=36003 (accessed on 4 February 2024).
- Esshaimi, M.; Ouazzani, N.; Avila, M.; Perez, G.; Valiente, M.; Mandi, L. Heavy Metal Contamination of Soils and Water Resources Kettara Abandoned Mine. Am. J. Environ. Sci. 2012, 8, 253–261. [Google Scholar] [CrossRef]
- Sánchez-Donoso, R.; García Lorenzo, M.L.; Esbrí, J.M.; García-Noguero, E.M.; Higueras, P.; Crespo, E. Geochemical Characterization and Trace-Element Mobility Assessment for Metallic Mine Reclamation in Soils Affected by Mine Activities in the Iberian Pyrite Belt. Geosciences 2021, 11, 233. [Google Scholar] [CrossRef]
- Wang, Y.; Duan, X.; Wang, L. Spatial Distribution and Source Analysis of Heavy Metals in Soils Influenced by Industrial Enterprise Distribution: Case Study in Jiangsu Province. Sci. Total Environ. 2020, 710, 134953. [Google Scholar] [CrossRef] [PubMed]
- Chan, S.; Reddy, V.; Myers, B.; Thibodeaux, Q.; Brownstone, N.; Liao, W. Machine Learning in Dermatology: Current Applications, Opportunities, and Limitations. Dermatol. Ther. 2020, 10, 365–386. [Google Scholar] [CrossRef] [PubMed]
- Taneja, S.; Gupta, C.; Goyal, K.; Gureja, D. An Enhanced K-Nearest Neighbor Algorithm Using Information Gain and Clustering. In Proceedings of the 2014 Fourth International Conference on Advanced Computing & Communication Technologies, Rohtak, India, 8–9 February 2014; pp. 325–329. [Google Scholar] [CrossRef]
- Li, P.; Gou, J.; Yang, H. The Distance-Weighted K-Nearest Centroid Neighbor Classification. J. Inf. Hiding Multimed. Signal Process 2017, 8, 611–622. Available online: https://www.semanticscholar.org/paper/The-Distance-Weighted-K-nearest-Centroid-Neighbor-Li-Gou/6055e3c54be1d095029b37cf2bd7586bf9125546#citing-papers (accessed on 1 May 2024).
- Yan, X. Weighted K-Nearest Neighbor Classification Algorithm Based on Genetic Algorithm. TELKOMNIKA 2013, 11, 6173–6178. [Google Scholar] [CrossRef]
- Prokhorenkova, L.; Gusev, G.; Vorobev, A.; Dorogush, A.V.; Gulin, A. CatBoost: Unbiased Boosting with Categorical Features. In Proceedings of the Advances in Neural Information Processing Systems, Montreal, QC, Canada, 3–8 December 2018; Bengio, S., Wallach, H., Larochelle, H., Grauman, K., Cesa-Bianchi, N., Garnett, R., Eds.; Curran Associates, Inc.: Brooklyn, NY, USA, 2018; Volume 31. [Google Scholar]
- Benuskova, L.; Kasabov, N. Artificial Neural Networks (ANN). In Computational Neurogenetic Modeling; Springer: Boston, MA, USA, 2007; pp. 81–106. ISBN 978-0-387-48353-5. [Google Scholar] [CrossRef]
- Almeida, J.S. Predictive Non-Linear Modeling of Complex Data by Artificial Neural Networks. Curr. Opin. Biotechnol. 2002, 13, 72–76. [Google Scholar] [CrossRef] [PubMed]
- Corazza, M.; Fasano, G.; Mason, F. An Artificial Neural Network-Based Technique for On-Line Hotel Booking. Procedia Econ. Financ. 2014, 15, 45–55. [Google Scholar] [CrossRef]
- Dnyaneshwar, K.; Tushar, J.; Shivam, K.; Sagar, T. Analysis of Car Selling Prediction Based On AIML. Int. J. Innov. Eng. Sci. 2023, 8, 14–17. [Google Scholar] [CrossRef]
- Ma, T.; Wang, F.; Cheng, J.; Yu, Y.; Chen, X. A Hybrid Spectral Clustering and Deep Neural Network Ensemble Algorithm for Intrusion Detection in Sensor Networks. Sensors 2016, 16, 1701. [Google Scholar] [CrossRef]
- Cerezo, M.; Arrasmith, A.; Babbush, R.; Benjamin, S.C.; Endo, S.; Fujii, K.; McClean, J.R.; Mitarai, K.; Yuan, X.; Cincio, L.; et al. Variational Quantum Algorithms. Nat. Rev. Phys. 2021, 3, 625–644. [Google Scholar] [CrossRef]
- Qiao, P.; Lei, M.; Yang, S.; Yang, J.; Guo, G.; Zhou, X. Comparing Ordinary Kriging and Inverse Distance Weighting for Soil as Pollution in Beijing. Environ. Sci. Pollut. Res. 2018, 25, 15597–15608. [Google Scholar] [CrossRef] [PubMed]
- Berrocal, V.J.; Guan, Y.; Muyskens, A.; Wang, H.; Reich, B.J.; Mulholland, J.A.; Chang, H.H. A Comparison of Statistical and Machine Learning Methods for Creating National Daily Maps of Ambient PM2.5 Concentration. Atmos. Environ. 2020, 222, 117130. [Google Scholar] [CrossRef] [PubMed]
- Paci, L.; Gelfand, A.E.; Holland, D.M. Spatio-Temporal Modeling for Real-Time Ozone Forecasting. Spat. Stat. 2013, 4, 79–93. [Google Scholar] [CrossRef]
- Vural, A. Evaluation of Soil Geochemistry Data of Canca Area (Gümüşhane, Turkey) by Means of Inverse Distance Weighting (IDW) and Kriging Methods-Preliminary Findings. Bull. Min. Res. Exp. 2019, 158, 195–216. [Google Scholar] [CrossRef]
- Zhang, Y.; Li, M.; Han, S.; Ren, Q.; Shi, J. Intelligent Identification for Rock-Mineral Microscopic Images Using Ensemble Machine Learning Algorithms. Sensors 2019, 19, 3914. [Google Scholar] [CrossRef] [PubMed]
- Saeb, S.; Lonini, L.; Jayaraman, A.; Mohr, D.C.; Kording, K.P. The Need to Approximate the Use-Case in Clinical Machine Learning. GigaScience 2017, 6, gix019. [Google Scholar] [CrossRef]
- Jin, H.; Montufar, G. Implicit Bias of Gradient Descent for Mean Squared Error Regression with Two-Layer Wide Neural Networks. J. Mach. Learn. Res. 2023, 24, 137. Available online: https://www.webofscience.com/wos/woscc/full-record/WOS:001111702800001 (accessed on 3 May 2024).
- Vodyanitskii, Y.N. Standards for the Contents of Heavy Metals and Metalloids in Soils. Eurasian Soil. Sci. 2012, 45, 321–328. [Google Scholar] [CrossRef]
- GN 2.1.7.2041-06; Hygienic Standards of Russian Federation “GN 2.1.7.2041-06 Maximum Permissible Concentrations (MPC) of Chemicals in the Soil”. Goskomsanepidnadzor of Russia: Moscow, Russia, 2006.
- adilet.zan.kz Regulatory Legal Act of the Republic of Kazakhstan No. 11755 “Hygienic Standards for the Safety of the Environment”. Available online: https://adilet.zan.kz/rus/docs/V2100022595 (accessed on 6 August 2023).
- Mineev, V.G. Practical Course in Agrochemistry, 2nd ed.; M.V. Lomonosov Moscow State University: Moscow, Russia, 2001; p. 689. ISBN 5-211-04265-4. [Google Scholar]
- Kazantsev, I.V.; Matveyeva, T.B. Contents of heavy metals in the soil cover in the conditions of technogenesis. Samara J. Sci. 2016, 5, 34–37. (In Russian) [Google Scholar] [CrossRef]
- Vodyanitskii, Y.N. Chromium and Arsenic in Contaminated Soils (Review of Publications). Eurasian Soil. Sci. 2009, 42, 507–515. [Google Scholar] [CrossRef]
- Semendyayeva, N.V.; Dobrotvorskaya, N.I.; Morozova, A.A. The Elemental Composition of Soils of Saline Agrolandscapes and Sanitary-Hygienic Conditions of the Southern Part of the Prichanovskaya Depression. Open Access J. Environ. Soil Sci. 2019, 2, 166–174. [Google Scholar] [CrossRef]
- Timofeeva, T.A.; Chebotar, V.K.; Demidov, D.V.; Gaidukova, S.E.; Yakovleva, I.V.; Kamionskaya, A.M. Effects of Apatite Concentrate in Combination with Phosphate-Solubilizing Microorganisms on the Yield of Ryegrass Cultivar Izorskiy. Agronomy 2023, 13, 1568. [Google Scholar] [CrossRef]
- Noskova, L.N. Justification of maximum permissible concentration of molybdenum in soil. Hyg. Sanit. 1988, 1, 69–70. Available online: https://cyberleninka.ru/article/n/obosnovanie-predelno-dopustimoy-kontsentratsii-molibdena-v-pochve (accessed on 3 May 2024). (In Russian).
- Borovskaya, N.E. Materials for justification of maximum permissible concentration of cobalt in soil. Hyg. Sanit. 1986, 5, 70–71. Available online: https://cyberleninka.ru/article/n/materialy-k-obosnovaniyu-predelno-dopustimoy-kontsentratsii-kobalta-v-pochve (accessed on 3 May 2024). (In Russian).
- Li, Q.; Pin, M.; Zhongyang, G. Research on the Spatial Distribution of Rural Settlements Based on GIS. In Computer Science and Applications: Proceedings of the 2014 Asia-Pacific Conference on Computer Science and Applications (CSAC 2014), Shanghai, China, 27–28 December 2014; CRC Press: Shanghai, China, 2014; pp. 111–116. ISBN 978-1-138-02811-1. [Google Scholar]
- Chidunchi, I.; Kulikov, M.; Safarov, R.; Kopishev, E. Extraction of Platinum Group Metals from Catalytic Converters. Heliyon 2024, 10, e25283. [Google Scholar] [CrossRef]
- Roy, S.; Gupta, S.K.; Prakash, J.; Habib, G.; Kumar, P. A Global Perspective of the Current State of Heavy Metal Contamination in Road Dust. Environ. Sci. Pollut. Res. 2022, 29, 33230–33251. [Google Scholar] [CrossRef]
- Li, K.; Wang, F. Global Hotspots and Trends in Interactions of Microplastics and Heavy Metals: A Bibliometric Analysis and Literature Review. Environ. Sci. Pollut. Res. 2023, 30, 93309–93322. [Google Scholar] [CrossRef]
- Sadyrova, G.; Tanybayeva, A.; Bazarbaeva, T.; Mukanova, G.; Jamilova, S.; Nurmakhanova, A. Analysis of the Ecological State of Urban Green Spaces in the Medeu District of Almaty. Bull. L.N. Gumilyov Eurasian Natl. Univ. Chem. Geogr. Ecol. Ser. 2023, 145, 83–92. [Google Scholar] [CrossRef]
- Wang, J.; Christopher, S.A.; Nair, U.S.; Reid, J.S.; Prins, E.M.; Szykman, J.; Hand, J.L. Mesoscale Modeling of Central American Smoke Transport to the United States: 1. “Top-down” Assessment of Emission Strength and Diurnal Variation Impacts. J. Geophys. Res. 2006, 111, 2005JD006416. [Google Scholar] [CrossRef]
- Meteoblue.com Modeling Historical Climate and Weather Data for Pavlodar. Available online: https://www.meteoblue.com/en/weather/historyclimate/climatemodelled/pavlodar_kazakhstan_1520240 (accessed on 20 February 2024).
- Ahad, N.; Sin Yin, T.; Othman, A.; Yaacob, C. Sensitivity of Normality Tests to Non-Normal Data. Sains Malays. 2011, 40, 637–641. Available online: https://www.ukm.my/jsm/english_journals/vol40num6_2011/vol40num6_2011pg637-641.html (accessed on 7 May 2024).
- Liu, X.S. A Probabilistic Explanation of Pearson’s Correlation. Teach. Stat. 2019, 41, 115–117. [Google Scholar] [CrossRef]
Class Number | CF | PLI | Classification Description |
---|---|---|---|
1 | CF ≤ 0.7 | PLI ≤ 0.7 | No contamination |
2 | 0.7 < CF ≤ 1 | 0.7 < PLI ≤ 1 | Low contamination |
3 | 1 < CF ≤ 3 | 1 < PLI ≤ 3 | Moderate contamination |
4 | 3 < CF ≤ 6 | 3 < PLI ≤ 6 | Considerable contamination |
5 | 6 < CF | 6 < PLI | Very high contamination |
Name | Description | Instruments | References |
---|---|---|---|
k-Nearest Neighbors (kNN) | Simple, intuitive, and widely used non-parametric algorithm for both classification and regression in machine learning. In regression, it assigns the property value determined by the average of the values of its k-nearest neighbors. k-Nearest neighbors is used in various applications like recommendation systems, pattern recognition, and anomaly detection due to its simplicity, effectiveness, and ease of interpretation. | KNeighborsRegressor() function of Scikit-learn package in Python | [72,73] |
Weighted k-Nearest Neighbors (WkNN) | A variant of the basic kNN algorithm where different weights are assigned to the contributions of the neighbors, so the nearest neighbors contribute more to the average than the more distant ones. This approach can be particularly useful when dealing with heterogeneous datasets, where certain data points are more closely clustered together, and outliers could disproportionately affect the result if an unweighted approach were used. Weighted kNN can help to mitigate the influence of outliers and provide a more nuanced classification or regression outcome. | KNeighborsRegressor() function of Scikit-learn package in Python | [74,75] |
Gradient Boosting. CatBoost (CB) | CatBoost is an open-source gradient boosting algorithm developed by Yandex, which is designed to work with categorical features without the need for the extensive data preprocessing that is typically required by other machine learning algorithms. CatBoost is particularly powerful for datasets with lots of categorical features and has been successfully used in various applications, including ranking tasks, regression problems, and classification problems. It is known for its robustness, handling of large datasets, and high-quality predictions, along with its ease of use. | CatBoostRegressor() function of Catboost package in Python was utilized with RMSE as loss function | [48,76] |
Artificial Neural Networks (ANN) | Computational models consisting of interconnected groups of artificial neurons or nodes that process information using a connectionist approach to computation. They can learn to approximate non-linear functions, which makes them suitable for a wide range of problems including but not limited to classification, regression, and time series prediction. Learning occurs in the network through a process of adjusting the synaptic weights of the connections between neurons, usually executed by a learning algorithm like backpropagation. | MLPRegressor() function from the Scikit-learn package in Python | [77,78,79,80,81,82] |
Kriging | A geostatistical interpolation technique that is widely used for spatial analysis and modeling. Kriging assumes that the spatial variation in the data can be modeled as a stochastic process with a structured dependence captured by the variogram or covariance function. To generate a prediction, ordinary Kriging computes a set of weights for the known data points, ensuring that the sum of these weights equals one to maintain the estimator’s unbiasedness. These weights are determined by solving a system of linear equations that arises from the variogram model, incorporating a constraint that enforces the unbiasedness of the predictions. This method is especially favored in fields such as geostatistics and environmental modeling, where understanding and accounting for spatial variability is critical | rk.Krige() function from the PyKrige package in Python. The best Kriging model for the given dataset was selected using the GridSearchCV function from the Scikit-learn package with cross validation. | [83,84,85,86] |
Multilevel b-spline interpolation (MLBS) | A sophisticated mathematical method used for smooth curve fitting and surface approximation, which is particularly useful when working with complex data in multiple dimensions. b-Splines, short for basis splines, are a series of piecewise polynomials that are defined by a set of control points that determine the shape of the curve or surface. The benefits of this method include its inherent smoothness, the ability to handle large and potentially irregularly spaced data sets, and the control it provides over the smoothness of the interpolation. Moreover, it is quite robust, reducing the impact of noise in the data, and it is capable of capturing the underlying trend without overfitting to the precise data points. This method serves as a powerful tool in applications that require a blend of accuracy and visual or analytical smoothness. Because MLBS interpolation strictly uses initial points for interpolation, conducting a self-assessment is not useful. This is because the error will be driven to zero, which does not reflect the true accuracy of the prediction. | The function was used with SAGA ver. 9.1.0 GIS software’s command line API through PySAGA_cmd package in Python. | [33,34,35,36] |
Element | Minimum (mg·kg−1) | Maximum (mg·kg−1) | Average (mg·kg−1) | Median (mg·kg−1) | Standard Deviation (mg·kg−1) | Variation Coefficient, % | Background (mg·kg−1) | Average CF | PLI | MPC (mg·kg−1) 1 | Sample over MPC, % |
---|---|---|---|---|---|---|---|---|---|---|---|
Fe | 11,380 | 34,360 | 20,406.92 | 19,860 | 4810.28 | 24 | 20,680 | 0.99 | 1.31 | 40,000 3 | 0 |
Mn | 240 | 1900 | 553.85 | 460 | 366.24 | 66 | 600 | 0.92 | 1500 | 8 | |
Cr | 0 | 820 | 203.85 | 150 | 206.88 | 101 | 440 | 0.46 | 200 4 | 79 | |
Sr | 110 | 250 | 173.33 | 170 | 23.24 | 13 | 90 | 1.93 | 600 5 | 0 | |
Zn | 30 | 910 | 121.28 | 70 | 159.10 | 131 | 60 | 2.02 | 55 | 79 | |
Cu | 0 | 630 | 64.87 | 0 | 110.06 | 170 | 80 | 0.81 | 33 | 46 | |
Pb | 0 | 200 | 46.92 | 50 | 42.56 | 91 | 32 2 | 1.47 | 32 | 67 | |
Ni | 0 | 150 | 34.10 | 0 | 47.65 | 140 | 20 2 | 1.71 | 20 | 36 | |
Mo | 0 | 440 | 11.28 | 0 | 69.55 | 617 | 50 2 | 0.23 | 50 6 | 3 | |
V | 0 | 90 | 7.69 | 0 | 23.26 | 302 | 150 2 | 0.05 | 150 | 0 | |
Hg | 0 | 100 | 2.56 | 0 | 15.81 | 618 | 2.1 2 | 1.22 | 2.1 | 3 | |
Co | 0 | 80 | 2.05 | 0 | 12.64 | 617 | 50 2 | 0.04 | 50 7 | 3 |
Sample ID | In Degrees | In Metrical Units | PLI | ||
---|---|---|---|---|---|
X coord | Y coord | X coord | Y coord | ||
1 | 76.94639 | 52.30136 | 8,565,633.076 | 6,854,799.322 | 3.38 |
2 | 76.92663 | 52.31008 | 8,563,432.835 | 6,856,388.113 | 1.67 |
3 | 76.91402 | 52.33016 | 8,562,029.675 | 6,860,044.77 | 1.41 |
4 | 76.92706 | 52.33927 | 8,563,481.57 | 6,861,703.848 | 1.05 |
5 | 76.95862 | 52.3016 | 8,566,994.468 | 6,854,844.032 | 1.13 |
6 | 76.92942 | 52.37143 | 8,563,743.427 | 6,867,565.049 | 1.28 |
7 | 76.92623 | 52.3585 | 8,563,389.086 | 6,865,207.782 | 2.76 |
8 | 76.9341 | 52.37226 | 8,564,265.137 | 6,867,717.35 | 1.43 |
9 | 76.92768 | 52.37319 | 8,563,550.388 | 6,867,886.849 | 1.36 |
10 | 76.92706 | 52.39318 | 8,563,480.713 | 6,871,532.878 | 1.8 |
11 | 76.92536 | 52.33002 | 8,563,291.559 | 6,860,019.086 | 1.61 |
12 | 76.92515 | 52.37943 | 8,563,268.572 | 6,869,024.966 | 1.35 |
13 | 76.92655 | 52.3668 | 8,563,424.619 | 6,866,720.58 | 1.64 |
14 | 76.91806 | 52.4022 | 8,562,479.194 | 6,873,177.94 | 2.73 |
15 | 76.93407 | 52.37103 | 8,564,261.018 | 6,867,492.227 | 1.08 |
16 | 76.9173 | 52.38409 | 8,562,394.658 | 6,869,874.255 | 2.85 |
17 | 76.96258 | 52.3236 | 8,567,435.216 | 6,858,849.493 | 1.7 |
18 | 76.93838 | 52.36898 | 8,564,741.061 | 6,867,118.117 | 1.33 |
19 | 76.95269 | 52.40022 | 8,566,334.444 | 6,872,816.447 | 1.87 |
20 | 76.9298 | 52.38515 | 8,563,786.085 | 6,870,068.148 | 1.29 |
21 | 76.89894 | 52.36555 | 8,560,350.899 | 6,866,494.044 | 1.1 |
22 | 76.91425 | 52.31876 | 8,562,054.699 | 6,857,968.202 | 1.12 |
23 | 76.90053 | 52.3614 | 8,560,528.031 | 6,865,736.674 | 1.16 |
24 | 76.91124 | 52.38316 | 8,561,719.672 | 6,869,705.171 | 1.54 |
25 | 76.91913 | 52.31395 | 8,562,597.838 | 6,857,091.725 | 1.37 |
26 | 76.94659 | 52.32635 | 8,565,655.173 | 6,859,350.008 | 1.71 |
27 | 76.94892 | 52.31256 | 8,565,914.903 | 6,856,838.966 | 1.29 |
28 | 76.96725 | 52.3199 | 8,567,954.777 | 6,858,174.939 | 1.83 |
29 | 76.96179 | 52.33491 | 8,567,347.24 | 6,860,909.126 | 2.49 |
30 | 76.97273 | 52.31845 | 8,568,564.663 | 6,857,911.93 | 1.88 |
31 | 76.89961 | 52.33391 | 8,560,425.55 | 6,860,726.512 | 1.16 |
32 | 76.96859 | 52.32524 | 8,568,104.691 | 6,859,147.102 | 1.31 |
33 | 76.95853 | 52.38353 | 8,566,983.938 | 6,869,771.082 | 1.46 |
34 | 76.97169 | 52.325 | 8,568,449.67 | 6,859,104.846 | 2.12 |
35 | 76.9583 | 52.31219 | 8,566,958.88 | 6,856,770.774 | 1.56 |
36 | 76.9286 | 52.3197 | 8,563,652.546 | 6,858,138.17 | 1.85 |
37 | 76.88773 | 52.36098 | 8,559,102.718 | 6,865,659.803 | 1.41 |
38 | 76.96241 | 52.3002 | 8,567,416.403 | 6,854,588.54 | 1.64 |
39 | 76.99945 | 52.30655 | 8,571,539.677 | 6,855,744.188 | 1.39 |
kNN | WkNN | |||
---|---|---|---|---|
k | Mean MSE Cross-Validation | MSE Self-Assessment | Mean MSE Cross-Validation | MSE Self-Assessment |
2 | 0.413 | 0.105 | 0.378 | 0.000 |
3 | 0.379 | 0.183 | 0.362 | 0.000 |
4 | 0.348 | 0.213 | 0.343 | 0.000 |
5 | 0.332 | 0.223 | 0.329 | 0.000 |
6 | 0.330 | 0.230 | 0.326 | 0.000 |
7 | 0.333 | 0.242 | 0.327 | 0.000 |
8 | 0.329 | 0.255 | 0.324 | 0.000 |
9 | 0.326 | 0.260 | 0.318 | 0.000 |
10 | 0.337 | 0.264 | 0.323 | 0.000 |
ANN1 2 Input (‘activation’: ‘ tanh ‘, ‘hidden_layer_sizes’: (150), ‘solver’: ‘sgd’) | ANN2 2 Input (activation = ‘relu’, hidden_layer_sizes = (5), solver = ‘lbfgs’) | ANN3 4 Input (activation = ‘relu’, hidden_layer_sizes = (5), solver = ‘lbfgs’) | |||
---|---|---|---|---|---|
Mean MSE Cross-Validation | MSE Self-Assessment | Mean MSE Cross-Validation | MSE Self-Assessment | Mean MSE Cross-Validation | MSE Self-Assessment |
0.285 | 0.286 | 0.313 | 0.285 | 0.346 | 0.278 |
Model | kNN | WkNN | CB1 | CB2 | ANN1 | ANN2 | ANN3 | Kriging | MLBS |
---|---|---|---|---|---|---|---|---|---|
Parameters | k: 5 | k: 9, weights: ‘distance’ | 2 features: (X coord, Y coord) ‘depth’: 10, ‘iterations’: 60, ‘learning_rate’: 0.03, ‘loss_function’: ‘RMSE’ | 4 features: (X coord, Y coord, DNN, PLI NN) ‘depth’: 5, ‘iterations’: 100, ‘learning_rate’: 0.03, ‘loss_function’: ‘MAE’ | 2 inputs (X coord, Y coord) ‘activation’: ‘tanh’, ‘hidden_layer_sizes’: (150), ‘solver’: ‘sgd’ | 2 inputs (X coord, Y coord) activation: ‘relu’, hid-den_layer_sizes: (5), solver: ‘lbfgs’ | 4 inputs (X coord, Y coord, DNN, PLI NN) activation: ‘relu’, hidden_layer_sizes: (5), solver: ‘lbfgs’ | ‘method’: ‘universal’, ‘nlags’: 6, ‘variogram_model’: ‘hole-effect’, ‘weight’: True | n.a. |
Mean MSE Cross-Validation | 0.332 | 0.318 | 0.287 | 0.289 | 0.285 | 0.313 | 0.346 | 0.282 | 0.404 |
MSE Self-Assessment | 0.223 | 0.000 | 0.090 | 0.000 | 0.286 | 0.285 | 0.278 | 0.000 | 0.000 |
Shapiro–Wilk statistic (W) | 0.9758 | 0.9682 | 0.9486 | 0.9839 | 0.0012 | 0.9911 | 0.9065 | 0.8629 | 0.9332 |
Shapiro–Wilk p-value | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
Non-Uniform Prediction | Yes | Yes | Yes | Yes | No | Yes | Yes | Yes | Yes |
Non-Linearity | Yes | Yes | Yes | Yes | No | No | No | Yes | Yes |
Empirical Consistency | Yes | Yes | Yes | No | No | No | No | Yes | Yes |
Smoothness | No | No | No | No | No | Yes | No | Yes | Yes |
Min | 1.176 | 1.083 | 1.132 | 1.148 | 1.629 | 1.605 | 1.432 | 1.061 | 1.001 |
Max | 2.108 | 3.272 | 3.312 | 2.068 | 1.629 | 1.684 | 2.479 | 3.353 | 3.425 |
Min–Max Difference | 1.398 | 0.141 | 0.150 | 1.410 | 2.330 | 2.251 | 1.283 | 0.038 | 0.094 |
Appropriateness Conclusion | Yes | Yes | Yes | No | No | No | No | Yes | Yes |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Safarov, R.; Shomanova, Z.; Nossenko, Y.; Mussayev, Z.; Shomanova, A. Digital Visualization of Environmental Risk Indicators in the Territory of the Urban Industrial Zone. Sustainability 2024, 16, 5190. https://doi.org/10.3390/su16125190
Safarov R, Shomanova Z, Nossenko Y, Mussayev Z, Shomanova A. Digital Visualization of Environmental Risk Indicators in the Territory of the Urban Industrial Zone. Sustainability. 2024; 16(12):5190. https://doi.org/10.3390/su16125190
Chicago/Turabian StyleSafarov, Ruslan, Zhanat Shomanova, Yuriy Nossenko, Zhandos Mussayev, and Ayana Shomanova. 2024. "Digital Visualization of Environmental Risk Indicators in the Territory of the Urban Industrial Zone" Sustainability 16, no. 12: 5190. https://doi.org/10.3390/su16125190
APA StyleSafarov, R., Shomanova, Z., Nossenko, Y., Mussayev, Z., & Shomanova, A. (2024). Digital Visualization of Environmental Risk Indicators in the Territory of the Urban Industrial Zone. Sustainability, 16(12), 5190. https://doi.org/10.3390/su16125190