Next Article in Journal
Managed Aquifer Recharge for Sustainable Groundwater Management: New Developments, Challenges, and Future Prospects
Previous Article in Journal
Impact of Flooding on Lands with Emerging Contaminants on the Quality of Receiving Water Bodies
Previous Article in Special Issue
Development on Surrogate Models for Predicting Plume Evolution Features of Groundwater Contamination with Natural Attenuation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Hydrogeochemical Characteristics and Formation Mechanisms of High-Arsenic Groundwater in the North China Plain: Insights from Hydrogeochemical Analysis and Unsupervised Machine Learning

1
The Chinese Academy of Environmental Sciences, Beijing 100012, China
2
Soil and Agricultural Ecological Environment Supervision Technology Center, Ministry of Ecology and Environment, Beijing 100012, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Water 2024, 16(22), 3215; https://doi.org/10.3390/w16223215
Submission received: 29 September 2024 / Revised: 21 October 2024 / Accepted: 29 October 2024 / Published: 8 November 2024

Abstract

:
Hydrochemical data were utilized in this study to elucidate the hydrogeochemical characteristics and genesis of high-arsenic groundwater in the North China Plain, employing both traditional hydrogeochemical analysis and unsupervised machine learning techniques. The findings indicate that the predominant hydrochemical types of groundwater in the study area are HCO3-Ca·Na and SO4·Cl-Na·Ca. The primary mechanism influencing groundwater chemistry has been identified as rock weathering. The unsupervised machine learning framework incorporates various methods, such as principal component analysis (PCA), non-negative matrix factorization (NMF), machine learning models (gradient boosting trees and random forests), and cluster analysis to explore the characteristics and genesis of groundwater hydrochemical types within the study area. This study demonstrated that the formation mechanism of high-arsenic groundwater results from multiple interacting factors.

1. Introduction

Groundwater arsenic contamination represents a significant environmental challenge worldwide [1]. This issue is particularly pronounced in East Asia. For instance, rural areas surrounding Hanoi, Vietnam, exhibit groundwater arsenic concentrations averaging 430 µg/L in drinking water [2]. In the southwestern region of Bangladesh, located within the Ganges Delta Plain, groundwater arsenic concentrations vary from 43.1 to 1352 µg/L, substantially exceeding the World Health Organization (WHO) guidelines for drinking water (10 µg/L) [3]. In China, the central Datong Basin has recorded groundwater arsenic concentrations reaching up to 366 µg/L [4].
Consumption of high-arsenic groundwater presents considerable health risks to humans, while utilizing such water for crop irrigation can result in arsenic accumulation in agricultural produce, leading to a series of subsequent pollution hazards. Consequently, identifying the factors that influence arsenic concentrations in high-arsenic groundwater and understanding the mechanisms that govern its release are urgent and essential tasks.
The study area is located in Xinxiang City, Henan Province, China, which falls within the arid and semi-arid regions of Northern China. Numerous previous studies have indicated that closed inland basins and aquifer environments under strongly reducing conditions in arid and semi-arid regions are conducive to the accumulation of arsenic in groundwater [5,6,7]. Additionally, the extremely high arsenic concentrations in this area are influenced directly or indirectly by groundwater overextraction and paleoriver channels. Previous researchers, such as Bian Jianmin et al. [8], reported that the spatial distribution of high-arsenic groundwater in Tongyu County shows arsenic enrichment in the low-lying central and northern areas along the horizontal plane, with relatively high arsenic concentrations in aquifers at depths ranging from 30 to 50 m. The total arsenic content is highest in Cl-HCO3-Na-type water.
As research into groundwater contamination mechanisms has progressed, subsequent studies have concentrated on the enrichment mechanisms and genesis of high-arsenic groundwater. For instance, Zhang Di [9] identified the reductive dissolution of iron oxides as the primary cause of elevated arsenic levels in groundwater in Tongyu County. In contrast, sulfide precipitation was found to inhibit arsenic mobility and enrichment in deeper groundwater, while human agricultural activities emerged as the principal factor contributing to arsenic enrichment in groundwater.
This study aims to analyze the spatial distribution and characteristics of high-arsenic groundwater in the study area presented within this paper through field sampling and testing. By integrating these findings with those from prior research, further investigation into the formation mechanisms and influencing factors of arsenic in the region is pursued. This research offers scientific evidence to enhance the understanding and management of high-arsenic groundwater contamination in similar areas [10].

2. Study Area

Overview of the Study Area

The study area is located in Xinxiang City, Henan Province, within the North China Plain, covering a total area of 4847.8 km2. To the west, the region is bordered by the Taihang Mountains, while the Yellow River forms the boundary to the east and south, with Anyang City situated to the north. The terrain generally slopes from west to east, with elevations ranging from 60 to 95 m. The region experiences a warm temperate, semi-humid, and semi-arid continental monsoon climate. Historically, the Yellow River has breached its banks and changed course multiple times in northern Henan, complicating the stratigraphy and depositional environment in areas of the paleoriver channel. This has led to the formation of distinct geomorphological features such as alluvial fans, floodplains, and Yellow River breakout fans across the surface [11] (Figure 1a).
The study area lies within the downstream segment of the paleo-Yellow River, between Wuzhi’s and Neihuang’s groundwater systems. The aquifers in the region primarily consist of loose rock pore aquifers, with lithologies that include loose fine sand, medium-coarse sand, gravel, loess-like silty sand, thin layers of sand, and silty clay with gravel lenses. This configuration results in a pronounced interbedded structure of sand and clay. Furthermore, the region is characterized by relatively low hydraulic conductivity, which contributes to the development of anaerobic or hypoxic conditions, thus affecting the formation of high-arsenic groundwater in the area.
The depth to the aquifer base typically ranges from 60 to 110 m, with some areas near the Yellow River exhibiting highly productive aquifers, where individual wells can yield between 3000 and 5000 m3/day (Figure 1b). Groundwater recharge in the system occurs through atmospheric precipitation, river infiltration, and irrigation return flow. Groundwater extraction, primarily driven by agricultural irrigation and industrial water use, has become the main means of groundwater discharge [12].

3. Methods and Materials

3.1. Sample Collection and Analysis

The hydrochemical data in this study were obtained from field-sampled spring water analyses and collected from the literature. In July 2023, 176 spring water samples were collected from the study area, all of which are located within the Xinxiang city region of Henan Province, China. Sample bottles were prewashed with distilled water and rinsed three times with well water from each target sampling site before collection. On site pH values were measured, and the samples were sealed, labeled, stored at 4 °C, and transported to the laboratory for water quality testing. The analysis was performed in accordance with the “Standards for Drinking Natural Mineral Water” (GB 8537-2018, National Food Safety Standards) and “Standards for Maximum Levels of Contaminants in Food” (GB 2762-2017, National Food Safety Standards).
A Shimadzu AA-6000CF atomic absorption spectrophotometer(Shimadzu Corporation, Kyoto, Japan ) and a Dionex ICS-2100 ion chromatograph (Thermo Fisher Scientific, Waltham, MA, USA) were utilized to measure sodium, potassium, calcium, magnesium, bicarbonate, chloride, sulfate, iron, and hydroxide ions. A spectrophotometer was employed to determine the contents of nitrate (NO3), nitrite (NO2), and ammonium nitrogen (NH4+), while temperature, odor, pH, and other physical characteristics were assessed in the field.

3.2. Data Processing and Analysis

3.2.1. Traditional Hydrogeochemical Techniques

To analyze the hydrochemical characteristics, Origin 2021 and Python(3.11) were utilized to create Piper trilinear diagrams, Pearson correlation plots, and Gibbs diagrams. These tools qualitatively describe the relationships among various components in water samples and their environmental behaviors, aiding in the identification of water–rock interactions and other processes affecting water chemistry. This methodology was employed to evaluate the water type and ascertain redox processes in the groundwater.

3.2.2. Unsupervised Machine Learning

The unsupervised machine learning module in this study processed the data using Python with applied principal component analysis (PCA) and K-means clustering techniques for the validation and analysis. PCA was utilized to identify and quantify the most significant factors influencing groundwater chemistry, extracting the primary influencing factors and sources of variation. The K-means clustering method was subsequently employed to categorize groundwater data based on the optimal number of clusters indicated by specific metrics [13].
To accurately represent the hydrochemical properties of the groundwater in the study area, 12 variables (pH, Na+, K+, Ca2+, Mg2+, Fe2+, HCO3, SO42−, Cl, NO3, NH4+, and TDS) were utilized as the input dataset. Following data processing, the self-organizing map (SOM) technique was applied for initial classification, followed by PCA and K-means clustering for additional analysis. Through 10,000 iterations of PCA and K-means clustering, stability was achieved by the random forest and gradient boosting tree models, allowing for the validation of the raw hydrochemical data and the analysis of mechanisms governing hydrochemical formation.

4. Results and Discussion

4.1. Spatial Distribution of High-Arsenic Groundwater

The statistical results indicate that the arsenic (As) concentration in the 176 groundwater samples collected from the study area ranged from 0.01 to 190 μg/L, with an average concentration of 24.88 μg/L (Table 1 and Figure 2). Among these samples, 61 (34.65%) exhibited As concentrations surpassing the World Health Organization’s guideline of 10 μg/L. The data reveal that high-arsenic groundwater (As > 10 μg/L) is generally weakly alkaline, with pH values ranging from 6.93 to 7.9 and an average of 7.748. This suggests that strongly reducing conditions are more conducive to arsenic enrichment. In high-arsenic groundwater, the average concentration of NO3 was 0.95 mg/L, which was lower than that in low-arsenic groundwater. Additionally, the average concentration of Fe2+ in the high-arsenic groundwater was 2.2 mg/L, whereas it was 0.95 mg/L in the low-arsenic samples. The average NH4+ concentration was 0.5 mg/L in high-arsenic groundwater, whereas it was only 0.28 mg/L in low-arsenic groundwater [14,15,16,17,18,19].
For the water chemistry indicators, pH is dimensionless, As is measured in μg·L−1, and the rest are measured in mg·L−1 [20,21].
From a spatial distribution perspective (Figure 3), high-arsenic groundwater is predominantly located in the central interfluvial zone and paleoriver channels of the study area (Table 2). The highest arsenic concentrations were observed in groundwater from Dongtun township and Zuocheng township in Yanjin County, where the concentrations reached 150 and 190 μg/L, respectively. These areas are situated in low-lying regions at the front of alluvial fans and within the Wuzhi interfluvial zone. The likelihood of high-arsenic groundwater occurring near paleoriver channels has further increased due to historical changes in the Yellow River.
The correlation between total arsenic concentration and well depth (Figure 4) indicates that high-arsenic groundwater is predominantly found at depths ranging from 30 to 75 m. Groundwater samples with arsenic levels surpassing 50 μg/L are primarily located in aquifers at depths of 30–45 m. Hydrogeological information demonstrates that the thickness of quaternary loose sediments in the piedmont plain ranges from 2 to 50 m. The active hydrogeochemical processes influenced by groundwater flow in this area promote the mobilization of arsenic from sedimentary environments [14].

4.2. Hydrochemical Characteristics

Based on the analysis of the Piper trilinear diagram (Figure 5), the majority of the groundwater sample points are located in the central area of the cation triangle, suggesting that Na+, K+, and Ca2+ are the predominant cations in the groundwater. In contrast, the distribution of groundwater samples in the anion triangle exhibits greater variability. A considerable number of samples group in the region characterized by HCO3.
Figure 5 indicates that the predominant hydrochemical type of groundwater in the study area is HCO3-Ca·Na, suggesting that groundwater composition is primarily influenced by rock weathering and recharge processes. In some areas, SO4·Cl-Na·Ca-type water is also observed, indicating potential influences from sulfate, chloride, and evaporation processes.
The distribution of cations in the groundwater displays a distinct trend of increased Na+ and K+ concentrations, likely associated with cation exchange processes and extended water–rock interactions. The relatively high levels of Ca2+ and Mg2+ imply that the weathering of carbonate rocks significantly affects groundwater composition. Furthermore, the location of the study area within a plain region results in slower flow velocities, which contribute to prolonged evaporation—leading to elevated concentrations of Cl and SO42− in the groundwater and the formation of SO4·Cl-Na·Ca-type water [15,16,17,18,19,22].
According to the Schoeller classification system, the ionic composition of groundwater is mainly governed by cation exchange, evaporation, and carbonate rock weathering. The presence of ions exceeding 25% milliequivalent percentages for both cations and anions reflects a varied hydrochemical landscape within the study area. High-arsenic groundwater is primarily located in regions characterized by high total dissolved solids (TDS), indicating that these samples exhibit relatively high mineralization, potentially influenced by a combination of evaporation, rock weathering, and anthropogenic activities.
In summary, the primary groundwater types in the study area are HCO3-Ca·Na and SO4·Cl-Na·Ca, reflecting the complex hydrogeological conditions across different regions [23].
Another useful index for water classification is Ionic Salinity or Total Ionic Salinity (TIS), showing the sum of anion and cation total contents (expressed in meq/L). Iso-TIS lines are reported in Figure 6 (SO4 2− vs. HCO3 + Cl), where low-arsenic groundwater falls in the 4 and 56 meq/L range of the iso-TIS lines and high-arsenic groundwater show TIS (from 12 to 44 meq/L).

4.3. Mechanism of High-Arsenic Groundwater Formation

Through comprehensive analysis of surface water hydrochemical data, Gibson suggested that the primary mechanisms influencing natural water systems can be categorized into three types: evaporation concentration, rock weathering, and atmospheric precipitation. The Gibbs diagram employs Na+/(Na+ + Ca2+) and Cl/(Cl + HCO3) as the horizontal axes, with the logarithm of total dissolved solids (TDS) as the vertical axis. Characteristic regions corresponding to these mechanisms are illustrated in Figure 5 [24,25,26,27].
In the Gibbs diagram, the TDS exceeds 1000 mg/L and ion ratios are above 0.5. The region dominated by rock weathering is found in the central-left area of the diagram, characterized by TDS values ranging from 100 to 1000 mg/L and ion ratios below 0.5. The area dominated by atmospheric precipitation is located in the lower left, where TDS is under 100 mg/L and ion concentration ratios are greater than 0.5. Numerous studies have established the applicability of the Gibbs diagram to groundwater, and it has been extensively utilized in research concerning groundwater formation mechanisms; however, its inability to account for human activities introduces certain limitations.
The results (Figure 7) indicate that the majority of groundwater samples exhibit TDS values between 100 and 1000 mg/L, positioning them within the central region of the diagram. These findings suggest that rock weathering (water–rock interaction) is the primary process influencing groundwater geochemistry in the study area. It is inferred that the chemical composition of high-arsenic groundwater mainly results from the weathering of carbonate and silicate rocks. Previous research has indicated that Ca2+ is the predominant cation in groundwater, implying that groundwater composition may be influenced primarily by carbonate weathering.
The region dominated by evaporation concentration is positioned in the upper right of both figures. The Cl/(Cl + HCO3) ratio represents the relative amounts of chloride and bicarbonate ions among the total anions. A ratio near 0 signifies HCO3 dominance, while a ratio close to 1 reflects Cl dominance. Data reveal that most groundwater samples possess Cl/(Cl + HCO3) ratios ranging from 0.2 to 0.6, indicating significant contributions from both bicarbonate and chloride ions in these waters.
The total dissolved solids (TDS) values indicate the overall mineral content dissolved in the water and serve as crucial indicators of water hardness and salinity. TDS values in the study area range from about 10 mg/L to over 1000 mg/L, demonstrating considerable variations in groundwater mineralization. In the TDS versus Cl/(Cl + HCO3) ratio plot, most high-arsenic groundwater samples (represented by red dots) show relatively high TDS values. This suggests a correlation between high-arsenic groundwater and increased mineralization, often associated with deeper groundwater flow or prolonged contact between groundwater and minerals in the slow-flowing waters of the plain region.
Additionally, further investigations have indicated that carbonate weathering has a more substantial influence on groundwater chemistry than silicate weathering in the study area. In arid to semi-arid inland basins, evaporation concentration also impacts the chemical characteristics of high-arsenic groundwater, alongside weathering. Figure 5 illustrates an upward trend of sample points toward the upper right, with some samples located within the evaporation concentration-dominated region, indicating a secondary role of evaporation in the groundwater geochemistry of the study area. Nevertheless, the effect of evaporation on arsenic concentration in groundwater remains limited.
In the analysis of groundwater sources (Figure 8), correlations among various ions can be utilized to investigate groundwater formation mechanisms and hydrogeochemical processes. Pearson correlation matrices were applied to examine the relationships between key ions that may influence arsenic enrichment in the groundwater of Zhouzhi and its surrounding areas. The results are presented in Figure 7. The correlation coefficients range within [−1, 1], where positive values indicate a positive correlation and negative values signify a negative correlation.
As illustrated in Figure 7, arsenic enrichment is primarily governed by reducing conditions. Correlation analysis indicated that arsenic is significantly positively correlated with Fe2+ and NH4+ (0.621 and 0.563, respectively), suggesting that under reducing conditions, the reductive dissolution of iron and the decomposition of organic matter—which generate ammonium—may enhance the release and migration of arsenic. In such environments, arsenic predominantly exists as As(III), which is more mobile and soluble, leading to increased arsenic concentrations.
Conversely, arsenic displays a negative correlation with oxidative indicators such as SO42− and NO3 (−0.173 and −0.102, respectively), implying that oxidizing conditions may promote arsenic immobilization, thereby inhibiting its dissolution. The presence of sulfate and nitrate generally indicates oxidizing conditions, where As(V) is more stable and tends to adsorb onto mineral surfaces, such as iron oxides, limiting arsenic migration. Therefore, dynamic variations in redox conditions are crucial in influencing arsenic transport in groundwater [27,28,29,30,31].
Additionally, the correlation between arsenic and the TDS (total dissolved solids) is relatively low (−0.148), indicating no significant linear relationship between arsenic enrichment and groundwater mineralization. This suggests that while mineral dissolution increases the overall mineral content in groundwater, it does not directly contribute to higher arsenic concentrations. Similarly, the correlation between arsenic and pH is weak (−0.017), suggesting that pH has a limited direct effect on arsenic concentration, although it may indirectly influence arsenic migration by affecting its speciation and adsorption behavior.
In summary, the mechanism behind high arsenic concentrations in groundwater in this area is closely associated with the stable presence of As(III) in localized reducing environments. Specifically, processes such as the reductive dissolution of iron minerals and the degradation of organic matter facilitate the release of arsenic from the solid phase into the aqueous phase. Moreover, changes in redox conditions are critical for arsenic mobilization and immobilization. Therefore, future strategies to mitigate high arsenic levels in groundwater should focus on regulating redox conditions to minimize arsenic migration and accumulation.

4.4. Redox Reactions

Indicators such as Fe2+ and NH4+ serve as sensitive components reflecting the redox environment of groundwater. Analyzing the correlations among nitrogen, iron, and arsenic in groundwater allows for the qualitative identification of environmental characteristics associated with arsenic occurrence. Iron oxides act as the primary carriers of arsenic within aquifers. Arsenic generally exists on the surface of iron oxides in either adsorbed or bonded forms, and nitrogen can affect arsenic enrichment through dynamic interactions with Fe(II) present in iron oxides [32,33].
Figure 9 depicts the relationships among the concentrations of Fe2+ (ferrous iron), NH4+ (ammonium), and arsenic in groundwater, contrasting high-arsenic groundwater with groundwater with normal arsenic levels. This emphasizes the influence of reductive dissolution of iron oxides on arsenic enrichment in reducing environments. Examining the variations in Fe2+ and NH4+ concentrations aids in understanding the mechanisms of arsenic release under reducing conditions. The figure indicates that as the Fe2+ concentration rises, NH4+ concentration also increases, particularly in high-arsenic groundwater samples. This trend suggests an interaction between the reductive dissolution of iron oxides and the degradation of organic matter under reducing conditions. The rise in Fe2+ signifies the dissolution of iron oxides, while NH4+ results from the anaerobic decomposition of organic matter. The concurrent increase in these two components reflects the intensity of the reducing environment.
The elevation of Fe2+ and NH4+ concentrations in groundwater is often linked with the release of arsenic, particularly As(III), which is adsorbed onto iron oxide surfaces. As illustrated in the circled region of the figure, Fe2+ and NH4+ concentrations between 1 and 10 mg/L are densely clustered, especially in high-arsenic groundwater. This phenomenon indicates that when reducing conditions are sufficiently strong, the dissolution of iron oxides releases both Fe2+ and arsenic into the groundwater. Under such conditions, Fe2+ acts as an indicator of reducing environments and is closely associated with arsenic enrichment. High-arsenic groundwater exhibits Fe2+ concentrations ranging from 1 to 10 mg/L and NH4+ concentrations from 0.1 to 10 mg/L, suggesting that arsenic dissolution and migration predominantly occur under strongly reducing conditions.
In contrast, groundwater samples with normal values of arsenic (blue squares) present lower concentrations of Fe2+ and NH4+, indicating weaker reducing conditions that limit arsenic dissolution and migration. These samples are primarily located in regions where Fe2+ concentrations fall below 1 mg/L and NH4+ concentrations are below 0.1 mg/L, suggesting that the reductive dissolution of iron oxides is relatively weak, thereby restricting the release of arsenic adsorbed onto iron oxide surfaces. Typically, more oxidizing environments hinder the accumulation of Fe2+ and NH4+, consequently restricting arsenic migration.
The process of the reductive dissolution of iron oxides plays a key role in arsenic release under reducing conditions. When the Fe2+ concentration reaches about 10 mg/L, the arsenic concentration in high-arsenic groundwater increases significantly, highlighting the close association between arsenic enrichment and the reductive dissolution of iron oxides. During this process, iron serves as a major carrier of arsenic in reducing environments, and arsenic adsorbed on iron oxide surfaces is released into groundwater as the iron dissolves. In strongly reducing environments, As(III) is the more stable and soluble form, further enhancing arsenic migration and accumulation in groundwater.
The increase in ammonium (NH4+) concentrations reflects the degradation and decomposition of organic matter in groundwater. The correlation between NH4+ and Fe2+ indicates that in reducing environments, the decomposition of organic matter not only enhances the accumulation of ammonium but also indirectly intensifies reductive processes, which further accelerates the dissolution of iron oxides. This process is accompanied by the migration of arsenic from the solid phase to the aqueous phase. Consequently, the degradation of organic matter and the presence of anaerobic conditions establish favorable conditions for arsenic release.
The components or elements sensitive to oxidation–reduction in groundwater, such as NO3 and the SO42−/Cl molar ratio, serve as indicators of the surrounding redox environment, as evidenced by the correlation between these components and arsenic concentrations (Figure 10). In instances where the NO3 concentration exceeds 15 mg/L, the aquifer is characterized by a (weak) oxidizing state and the arsenic concentration remains relatively low, below 10 μg/L. This phenomenon occurs because the elevated redox potential of the NO3/NO2 pair inhibits the reduction of As(V) to As(III) and hinders the reductive dissolution of Fe/Mn (hydr)oxides, thereby preventing the mobilization of arsenic into the aqueous phase.
Conversely, samples exhibiting high arsenic concentrations are associated with extremely low SO42−/Cl ratios, suggesting that the groundwater environment is in a pronounced sulfate-reducing state. The reduced sulfur species (HS and S2) generated during sulfate reduction can form thioarsenic complexes that exhibit high mobility and facilitate the migration and release of arsenic, which results in arsenic accumulation in these samples. Furthermore, HS and S2 react with Fe2+ to produce insoluble iron sulfides, leading to reduced Fe2+ concentrations in these samples. Some studies have indicated that arsenic coprecipitates with iron sulfides under conditions of strong reduction, thus decreasing its mobility. However, this conclusion appears inconsistent with the present findings, which is potentially attributable to the specific types of iron sulfide minerals that form within the groundwater environment. Laboratory simulations have demonstrated that amorphous FeS has a limited impact on arsenic coprecipitation, while FeS2 plays a more significant role [5,6,7,31,34,35,36,37,38,39,40,41,42].
Compared with oxidized As(V), reduced As(III) is more mobile and more easily activated in the aqueous phase. In oxidizing environments, arsenic typically coexists with Fe oxides or hydroxides through adsorption or coprecipitation; however, in reducing environments, the reductive dissolution of Fe oxides results in the release of coexisting arsenic into the aqueous phase. In strongly reducing environments, reduced sulfur species (S°, HS, and S2) formed during sulfate reduction can either react with Fe2+ to form iron sulfides (e.g., FeS and FeS2) that partially immobilize arsenic or form thioarsenic complexes with arsenic, increasing its mobility. Additionally, sulfur species such as S° and HS can act as electron shuttles, facilitating the reductive dissolution of iron oxides, especially under microbial mediation. This is in line with previous studies on the influence of iron and sulfur on arsenic, and it can also be confirmed that iron and sulfur affect the morphology and content of arsenic by affecting the soil redox potential.

4.5. Verification and Analysis of Factors Influencing as Formation in Shallow Groundwater

In this study, traditional hydrogeochemical methods were initially utilized to analyze the water chemistry characteristics and genesis of the study area. To facilitate a more comprehensive and precise analysis, machine learning techniques—including principal component analysis (PCA), K-means clustering, non-negative matrix factorization (NMF), random forest analysis, and gradient boosting tree analysis—were employed to systematically assess the hydrochemical data. The mechanisms underlying the formation of high-arsenic groundwater in the study area were further discussed and validated.
PCA was employed to extract key hydrochemical information from the groundwater samples, with the first five principal components explaining 74.04% of the total variance, underscoring their importance in characterizing the chemical properties of the water samples. Analysis of the principal components (Table 3) indicated that PC1 accounted for 35.46% of the variance, with significant contributing factors including TDS (loading 0.4128), Ca2+ (loading 0.2922), and Mg2+ (loading 0.3933). However, while previous correlation coefficient analyses suggested a weak relationship between TDS and As, the PCA findings imply that elevated TDS levels reflect high groundwater mineralization, which may be linked to the dissolution of arsenic-bearing minerals. This aligns with the mechanisms of arsenic formation in high-arsenic groundwater as increased mineralization is often associated with the aquifer depth, water–rock interactions, and evaporation processes. Thus, PC1 signifies the primary role of mineralization in the release and migration of arsenic.
PC2 accounted for 12.52% of the variance, with a pH loading of −0.2991. Groundwater pH directly influences arsenic speciation: at elevated pH levels, arsenic is more likely to exist as As(V), which is less mobile, while under acidic conditions, As(III) is more mobile, facilitating arsenic entry into water. In high-arsenic groundwater, the carbonate system and acid–base balance regulate arsenic behavior. Carbonates act as buffers for pH levels and affect arsenic speciation and solubility. PC2 indicates that arsenic mobility is enhanced in environments characterized by high alkalinity and elevated HCO3 concentrations. Carbonate equilibrium may decrease arsenic adsorption by forming calcium carbonate precipitates, consequently increasing dissolved arsenic levels.
PC3 explained 10.95% of the variance, primarily associated with Fe2+ and Mg2+. Redox reactions involving Fe2+ are critical in arsenic release and migration. Under reducing conditions, the reductive dissolution of the iron oxides releases adsorbed arsenic, thereby increasing arsenic concentrations in the groundwater. This process typically occurs in anaerobic environments enriched with organic matter. PC3 suggests that metal ions, especially iron, significantly contribute to arsenic enrichment, aligning with the observations depicted in Figure 8.
Subsequent cluster analysis categorized the water samples into three distinct groups. Table 4 outlines the hydrochemical characteristics among these clusters. The results reveal that Cluster 2 possesses a markedly higher mean PC1 value (27.21) compared to the other clusters, indicating extremely high mineralization and potentially elevated arsenic concentrations. Further examination of cluster centers indicated that Cluster 1, with a mean PC1 value of 4.89, represents groundwater with moderate TDS and metal ion concentrations, likely corresponding to typical mineralized groundwater without particularly high arsenic levels. Cluster 2, exhibiting the highest mineralization, is likely influenced by enhanced mineral dissolution and water–rock interactions, leading to arsenic enrichment. Cluster 3 displays a lower mean PC1 value (−0.43), suggesting reduced mineralization, potentially correlating with shallow groundwater characterized by lower arsenic mobility.
Through additional factor decomposition of the PCA results using non-negative matrix factorization (NMF), three primary factors were identified: Factor 1 (pH factor), Factor 2 (salinity factor), and Factor 3 (metal content factor) (Figure 11c).
Factor 1 demonstrates the greatest characteristic weight for pH, recorded at 10.61, signifying the importance of pH in regulating arsenic behavior in groundwater. In mildly alkaline conditions (pH ≈ 7.4), As(V) exhibits increased stability in oxidizing environments and tends to be significantly adsorbed, reducing its likelihood of dissolution in water. Conversely, in reducing conditions, the presence of As(III) promotes arsenic mobility and enhances its solubility. Consequently, fluctuations in pH directly influence the chemical speciation of arsenic and its bioavailability.
Factor 2 is strongly linked to salinity, displaying high weights for Na+ (10.43) and Cl (5.43). This indicates a direct connection between this factor and water salinity. Elevated salinity is frequently associated with phenomena such as seawater intrusion or evaporation, which may result in arsenic release through mineral dissolution. For instance, in areas impacted by seawater intrusion, increased salinity in groundwater can facilitate the dissolution of arsenic-bearing minerals, consequently elevating arsenic concentrations [42,43,44,45,46,47,48,49,50,51].
Factor 3 is primarily associated with metal ions, particularly Fe2+ and Mg2+, which have weights of 7.36 and 7.90, respectively. Iron fulfills a dual function in groundwater systems: under oxidizing conditions, iron oxides can adsorb arsenic, while under reducing conditions, the dissolution of iron oxides can lead to arsenic release into the water. This underscores the significant role of metal ions, especially iron, in the migration and accumulation of arsenic in groundwater.
In research on high-arsenic groundwater, employing machine learning models like gradient boosting trees (GBTs) and random forests (RFs) has been effective in pinpointing crucial hydrochemical features that impact arsenic levels. By analyzing the importance of these features after training the models, not only can arsenic concentrations in groundwater be predicted accurately but the primary factors influencing arsenic activity can also be identified. This section examines in detail the outcomes of both models (refer to Table 5) and investigates the influence of significant features on predicting arsenic concentration.
The gradient boosting tree model reported an R2 value of 0.9111 and a mean squared error (MSE) of 12,784.6562, demonstrating substantial accuracy in forecasting arsenic concentrations. Such precision indicates that the model effectively represents the complex nonlinear interactions between the arsenic concentration and various hydrochemical parameters.
During the analysis of feature importance, Feature_16 was found to have an importance score of 95.06%, significantly higher than other features, which suggests its major role in arsenic concentration predictions. Assuming Feature_16 relates to total dissolved solids (TDS) or specific key metal ions (like Fe2+ or Mg2+), the findings imply that mineralization (TDS) is a predominant factor affecting arsenic levels.
TDS denotes the overall quantity of dissolved ions in water and serves as a crucial marker of mineralization. Elevated TDS levels often suggest extensive interactions between water and rock, potentially leading to significant dissolution of arsenic-containing minerals (e.g., arsenopyrite), resulting in arsenic’s presence in the groundwater. Waters with higher mineralization levels exhibit stronger dissolving capabilities, enhancing arsenic release from minerals. This underlines the exceptional importance of TDS in the gradient boosting tree model. Furthermore, TDS might also relate to groundwater salinity and concentrations of various metal ions. In environments with high salinity, the competitive ionic conditions may hinder the adsorption of arsenic, maintaining its solubility and thus elevating its groundwater concentration [52].
In addition to TDS, metal ions such as Fe2+ and Mg2+ are pivotal in redox conditions. Particularly, the solubilization of Fe2+ under reductive conditions can mobilize arsenic previously adsorbed onto iron oxides into the water. Considering the elevated levels of Fe2+ and Mg2+ in some water samples, these ions could affect arsenic levels by altering the redox state. Therefore, the prominent importance of Feature_16 might also indicate its link to the redox conditions of the groundwater.
The random forest model recorded an R2 value of 0.8716 and an MSE of 18,470.7957, indicating a slightly less accurate prediction capability compared to the gradient boosting tree model, yet it remains effective in illustrating the intricate relationships between arsenic concentrations and hydrochemical variables.
Similar to the gradient boosting tree model, Feature_16 also held a high importance score in the random forest model, further corroborating the critical role of mineralization (TDS) or significant metal ions (like Fe2+) in influencing arsenic mobility. Additionally, Feature_3 and Feature_11 were also identified as highly important in the random forest model, indicating their significant roles in arsenic prediction. Feature_3 is likely associated with specific metal content in the water, such as Ca2+ or Mg2+. These metals are intricately linked to arsenic’s chemical behavior in water, especially in the dissolution of aquifer minerals that can liberate arsenic. Furthermore, these metals might affect arsenic adsorption mechanisms, particularly in environments with higher pH levels, influencing arsenic’s precipitation or dissolution.
Feature_11 is associated with the redox environment of groundwater, potentially tied to the oxidation–reduction processes of Fe2+. Iron not only serves as an adsorption carrier for arsenic but also directly influences its speciation and mobility through redox reactions. In oxidizing environments, iron oxides adsorb arsenic, reducing its mobility; under reducing conditions, iron oxides dissolve, releasing arsenic into the water. Therefore, the redox environment plays a significant role in arsenic migration and enrichment.
From the feature importance analyses of both the gradient boosting tree and random forest models, several key conclusions can be drawn:
Mineralization, particularly TDS, is identified as a critical factor for predicting arsenic concentrations in groundwater. Highly mineralized waters, characterized by stronger dissolving capabilities, frequently exhibit elevated arsenic levels. This phenomenon is attributed to the concentration of dissolved solids, such as metal ions, in the water. As TDS increases, the release and migration of arsenic become more pronounced.
Fe2+ and other metal ions significantly influence arsenic behavior. The redox reactions involving iron ions directly affect the release or adsorption of arsenic, while other metal ions, such as magnesium, may indirectly impact arsenic mobility by altering groundwater hardness or pH levels.
Through the gradient boosting tree and random forest models, key hydrochemical factors affecting arsenic concentrations were successfully identified. These factors, including TDS, metal ions (such as Fe2+ and Mg2+), and redox conditions, play crucial roles in the migration and enrichment of arsenic. The insights provided by these findings are vital for understanding the mechanisms behind the formation of high-arsenic groundwater, emphasizing the roles of mineralization and redox conditions in arsenic mobility.
In summary, arsenic behavior in groundwater is influenced by multiple hydrochemical factors, with mineralization (TDS) being particularly significant. Higher mineralization implies greater dissolving power, facilitating the release of more arsenic-bearing minerals. Redox conditions, closely associated with Fe2+ concentrations, are also vital as the dissolution and precipitation of iron oxides are key to arsenic fixation and migration. In oxidizing environments, iron oxides effectively adsorb arsenic, while under reducing conditions, their dissolution releases arsenic into the water, explaining the correlation between high iron concentrations and elevated arsenic levels. In addition, pH variations significantly impact arsenic speciation and mobility. Under neutral to mildly alkaline conditions, As(V) tends to bind with iron oxides, reducing its mobility; conversely, under acidic or strongly alkaline conditions, As(III) becomes more soluble and mobile, facilitating arsenic’s entry into water.
PCA and NMF analyses indicate that the formation of high-arsenic groundwater is closely related to the dissolution of arsenic-bearing minerals. As minerals like arsenopyrite dissolve due to changes in the hydrochemical environment (e.g., increased salinity or mineralization), arsenic is progressively released. In water samples with high TDS, the enhanced solubility of mineralized water leads to greater arsenic release and enrichment. Iron oxides serve as primary arsenic carriers in groundwater. In oxidizing conditions, they can effectively adsorb arsenic; however, under reducing conditions, the reduction of iron oxides liberates arsenic, increasing its concentration in groundwater. NMF analysis suggests that metal contents of Fe2+ and Mn are important for arsenic release. This mechanism explains the higher arsenic concentrations found in reducing environments, especially in deeper aquifers or groundwater influenced by reducing conditions. Hydrogeological factors, including aquifer types, groundwater flow paths, recharge sources, and geological structures also significantly influence arsenic distribution. According to clustering analysis, different hydrochemical groups may represent distinct hydrogeological environments. Samples in Cluster 2 (high-mineralization clusters) are likely influenced by deeper hydrological conditions, where mineral dissolution and reducing environments lead to arsenic release and migration. In contrast, samples in Cluster 3, characterized by lower mineralization, likely originate from shallow aquifers with less mineral dissolution, resulting in relatively lower arsenic concentrations [51,53,54].

5. Conclusions

  • The groundwater in the study area is predominantly weakly alkaline with low mineralization. Anions are primarily dominated by HCO3, whereas cations are mainly Ca2+ and Mg2+. The primary hydrochemical types are HCO3-Ca·Na and SO4·Cl-Na·Ca;
  • The Gibbs diagram indicates that the dominant mechanisms influencing groundwater chemistry are rock weathering and evaporation, with a significant influence from carbonate dissolution. Correlation analysis of the groundwater chemical parameters revealed strong positive relationships among Fe2+, NH4+, and As. Subsequent redox analysis confirmed that Fe2+ and NH4+ are key factors contributing to the formation of high-arsenic groundwater in the region;
  • This study comprehensively investigated the mechanisms behind the formation of high-arsenic groundwater using multiple analytical methods, including principal component analysis (PCA), non-negative matrix factorization (NMF), machine learning models (gradient boosting trees and random forest), and cluster analysis. The PCA results indicated that the first principal component (PC1) accounted for 35.46% of the total variance, primarily associated with TDS, Ca2+, and Mg2+ ions. Samples with high TDS levels exhibited strong correlations with elevated arsenic concentrations, underscoring the pivotal role of mineral dissolution in arsenic release. The second principal component (PC2), which explained 12.52% of the variance, underscored the substantial influence of pH on arsenic speciation. The third principal component (PC3), capturing 10.95% of the variance, highlighted the significant roles of Fe2+ and Mg2+ in the fixation and release of arsenic;
  • NMF analysis identified three influential factors that further clarified the interactions between hydrochemical parameters and arsenic behavior. Factor 1 (pH factor) demonstrated the profound impact of pH on the dissolution and mobility of arsenic. Factor 2 (salinity factor) emphasized the effects of Na+ and Cl on arsenic solubility, particularly in environments with high mineralization where increased salinity promotes mineral dissolution and subsequent arsenic release. Factor 3 (metal content factor) was closely linked to the concentrations of Fe2+ and Mg2+;
  • The gradient boosting tree model reported an R2 value of 0.9111, reflecting a robust predictive capability. Feature importance analysis revealed that Feature_16 (likely representing TDS or a crucial metal ion) held the highest importance, with a score of 95.06%, indicating that mineralization is a key factor in arsenic migration. The random forest model, achieving an R2 value of 0.8716, similarly highlighted the strong connections between mineralization (TDS) and metal ion concentrations with arsenic enrichment.

Author Contributions

Conceptualization, Y.L. and X.W.; methodology, Y.L.; software, Y.L.; validation, X.W., W.L. and Q.H.; formal analysis, Q.H.; investigation, Y.L. and G.Z; resources, G.Z., W.L. and X.W.; data curation, Y.L.; writing—original draft preparation, Y.L.; writing—review and editing, Y.L. and X.W.; visualization, Y.L. and G.Z.; supervision, W.L.; project administration, W.L. and G.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded to explore the occurrence characteristics and evolution patterns of typical pollutants in groundwater in agricultural irrigation areas in accordance with North China law (2022YFC3703701) and by the Fundamental Research Funds for Central Universities Postgraduate Science and Technology Innovation Fund Project (ZY20240310).

Data Availability Statement

Data is available upon request due to privacy or ethical restrictions The data presented in this study are available upon request from the corresponding authors. The data is not publicly available due to [some data is not suitable for public disclosure and is sensitive].

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Kumar, P.J.S. Groundwater fluoride contamination in Coimbatore district: A geochemical characterization, multivariate analysis, and human health risk perspective. Environ. Earth Sci. 2021, 80, 232. [Google Scholar] [CrossRef]
  2. Berg, M.; Tran, H.C.; Nguyen, T.C.; Pham, H.V.; Schertenleib, R.; Giger, W. Arsenic contamination of groundwater and drinking water in Vietnam: A human health threat. Environ. Sci. Technol. 2001, 35, 2621–2626. [Google Scholar] [CrossRef] [PubMed]
  3. Halim, M.A.; Majumder, R.K.; Nessa, S.A.; Hiroshiro, Y.; Uddin, M.J.; Shimada, J.; Jinno, K. Hydrogeochemistry and arsenic contamination of groundwater in the Ganges Delta Plain, Bangladesh. J. Hazard. Mater. 2009, 164, 1335–1345. [Google Scholar] [CrossRef] [PubMed]
  4. Guo, H.; Wang, Y. Geochemical characteristics of shallow groundwater in Datong basin, northwestern China. J. Geochem. Explor. 2005, 87, 109–120. [Google Scholar] [CrossRef]
  5. Deng, Y.M. Geochemical Processes in the High Arsenic Groundwater System in the Western Hetao Basin. Ph.D. Thesis, China University of Geosciences, Wuhan, China, 2008. [Google Scholar]
  6. Zhang, D.; Guo, H.M.; Ni, P.; Wu, Y. Influence of redox conditions on the migration of arsenic release from groundwater-an example of high arsenic groundwater in Tongyu County. Quat. Res. 2014, 34, 1072–1081. [Google Scholar]
  7. Yuan, H.Q.; Li, Q.; Tao, H.F.M.; Aihemaiti; Yang, W.X.; Su, Y.P. Groundwater arsenic enrichment factors of Kuitun river basin, Xinjiang. Environ. Chem. 2020, 39, 524–530. [Google Scholar]
  8. Bian, J.M.; Tang, J.; Feng, L.; Zha, E.S. Hydrogeochemical characteristics in the arsenic poisoning area in western Jilin Province. Hydrogeol. Eng. Geol. 2009, 36, 80–83. [Google Scholar]
  9. Zhao, J.; Li, Y.; Bian, J.; Zhang, L.; Yang, Z. Threshold analysis and health risk assessment of arsenic in groundwater in western Jilin Province. J. Jilin Univ. (Earth Sci. Edit.) 2013, 43, 251–258. [Google Scholar]
  10. Zhang, D.; Guo, H.; Xiu, W.; Ni, P.; Zheng, H.; Wei, C. In-situ mobilization and transformation of iron oxides-adsorbed arsenate in natural groundwater. J. Hazard. Mater. 2017, 321, 228–237. [Google Scholar] [CrossRef]
  11. Jiang, J.; Wang, X.; Su, C.; Wang, M.; Ren, F.; Huq, M.E. Unraveling the impact of dissolved organic matter on arsenic mobilization in alluvial aquifer of the lower Yellow River basin, Northern China. Appl. Geochem. 2023, 158, 105781. [Google Scholar] [CrossRef]
  12. Rivett, M.O.; Buss, S.R.; Morgan, P.; Smith, J.W.N.; Bemment, C.D. Nitrate attenuation in groundwater: A review of biogeochemical controlling processes. Water Res. 2008, 42, 4215–4232. [Google Scholar] [CrossRef] [PubMed]
  13. Liu, Y.; Li, M.; Zhang, Y.; Wu, X.; Zhang, C. Analysis of the hydrogeochemical characteristics and origins of groundwater in the changbai mountain region via inverse hydrogeochemical modeling and unsupervised machine learning. Water 2024, 16, 1853. [Google Scholar] [CrossRef]
  14. Cao, W.; Gao, Z.; Guo, H.; Pan, D.; Qiao, W.; Wang, S.; Ren, Y.; Li, Z. Increases in groundwater arsenic concentrations and risk under decadal groundwater withdrawal in the lower reaches of the Yellow River basin, Henan Province, China. Environ. Pollut. 2022, 296, 118741. [Google Scholar] [CrossRef] [PubMed]
  15. Knappett, P.S.K.; Li, Y.; Loza, I.; Hernandez, H.; Avilés, M.; Haaf, D.; Majumder, S.; Huang, Y.; Lynch, B.; Piña, V. Rising arsenic concentrations from dewatering a geothermally influenced aquifer in central Mexico. Water Res. 2020, 185, 116257. [Google Scholar] [CrossRef]
  16. Liao, Z.; Lin, X.; Shi, Q.; Gnansounou, R.; Du, X. Study on the groundwater exploitation test in the Yellow River lower reaches: A case study on the north suburb waterworks of Zhengzhou China. Sci. China Ser. E 2004, 47, 14–24. [Google Scholar] [CrossRef]
  17. Liu, J.; Cao, G.; Zheng, C. Sustainability of groundwater resources in the North China Plain. In Sustaining Groundwater Resources; Jones, J., Ed.; Springer: Dordrecht, The Netherlands, 2011; pp. 69–87. [Google Scholar]
  18. Mozumder, M.R.H.; Bostick, B.C.; Selim, M.; Islam, M.A.; Shoenfelt, E.M.; Ellis, T.; Mailloux, B.J.; Choudhury, I.; Ahmed, K.M.; van Geen, A. Similar retardation of arsenic in gray Holocene and orange Pleistocene sediments: Evidence from field-based column experiments in Bangladesh. Water Res. 2020, 183, 116081. [Google Scholar] [CrossRef]
  19. Nath, B.; Chakraborty, S.; Burnol, A.; Stüben, D.; Chatterjee, D.; Charlet, L. Mobility of arsenic in the sub-surface environment: An integrated hydrogeochemical study and sorption model of the sandy aquifer materials. J. Hydrol. 2009, 364, 236–248. [Google Scholar] [CrossRef]
  20. Apollaro, C.; Caracausi, A.; Paternoster, M.; Randazzo, P.; Aiuppa, A.; De Rosa, R.; Fuoco, I.; Mongelli, G.; Muto, F.; Vanni, E.; et al. Fluid geochemistry in a low-enthalpy geothermal field along a sector of southern Apennines chain (Italy). J. Geochem. Explor. 2020, 219, 106618. [Google Scholar] [CrossRef]
  21. Apollaro, C.; Buccianti, A.; Vespasiano, G.; Vardè, M.; Fuoco, I.; Barca, D.; Bloise, A.; Miriello, D.; Cofone, F.; Servidio, A.; et al. Comparative geochemical study between the tap waters and the bottled mineral waters in Calabria (Southern Italy) by compositional data analysis (CoDA) developments. Appl. Geochem. 2019, 107, 19–33. [Google Scholar] [CrossRef]
  22. Neidhardt, H.; Rudischer, S.; Eiche, E.; Schneider, M.; Stopelli, E.; Duyen, V.T.; Trang, P.T.K.; Viet, P.H.; Neumann, T.; Berg, M. Phosphate immobilisation dynamics and interaction with arsenic sorption at redox transition zones in floodplain aquifers: Insights from the Red River Delta, Vietnam. J. Hazard. Mater. 2021, 411, 125128. [Google Scholar] [CrossRef]
  23. Cao, J.S.; Zhang, W.J. Research on shallow groundwater recharge and control in Taihang Mountain area of north China. Adv. Mater. Res. 2010, 113, 1572–1576. [Google Scholar] [CrossRef]
  24. Chen, J.; Gu, B.; LeBoeuf, E.J.; Pan, H.; Dai, S. Spectroscopic characterization of the structural and functional properties of natural organic matter fractions. Chemosphere 2002, 48, 59–68. [Google Scholar] [CrossRef] [PubMed]
  25. Rütting, T.; Huygens, D.; Müller, C.; Van Cleemput, O.; Godoy, R.; Boeckx, P. Functional role of DNRA and nitrite reduction in a pristine south Chilean Nothofagus forest. Biogeochemistry 2008, 90, 243–258. [Google Scholar] [CrossRef]
  26. Saxena, V.K.; Mondal, N.C.; Singh, V.S. Reducing arsenic concentration in groundwater. Curr. Sci. 2005, 88, 707–708. [Google Scholar]
  27. Skierszkan, E.K.; Dockrey, J.W.; Mayer, K.U.; Beckie, R.D. Release of geogenic uranium and arsenic results in water-quality impacts in a subarctic permafrost region of granitic and metamorphic geology. J. Geochem. Explor. 2020, 217, 106607. [Google Scholar] [CrossRef]
  28. Tisserand, D.; Pili, E.; Hellmann, R.; Boullier, A.-M.; Charlet, L. Geogenic arsenic in groundwaters in the western Alps. J. Hydrol. 2014, 518, 317–325. [Google Scholar] [CrossRef]
  29. Xie, X.; Wang, Y.; Li, J.; Yu, Q.; Wu, Y.; Su, C.; Duan, M. Effect of irrigation on Fe (III)–SO42− redox cycling and arsenic mobilization in shallow groundwater from the Datong basin, China: Evidence from hydrochemical monitoring and modeling. J. Hydrol. 2015, 523, 128–138. [Google Scholar] [CrossRef]
  30. Xiu, W.; Lloyd, J.; Guo, H.; Dai, W.; Nixon, S.; Bassil, N.M.; Ren, C.; Zhang, C.; Ke, T.; Polya, D. Linking microbial community composition to hydrogeochemistry in the western Hetao Basin: Potential importance of ammonium as an electron donor during arsenic mobilization. Environ. Int. 2020, 136, 105489. [Google Scholar] [CrossRef]
  31. Yang, B.; Zhou, L.; Xue, N.; Li, F.; Li, Y.; Vogt, R.D.; Cong, X.; Yan, Y.; Liu, B. Source apportionment of polycyclic aromatic hydrocarbons in soils of Huanghuai Plain, China: Comparison of three receptor models. Sci. Total Environ. 2013, 443, 31–39. [Google Scholar] [CrossRef]
  32. Liu, Y.; Cai, L.; Wang, X.; Chen, Z.; Yang, W. Efficient adsorption of arsenic in groundwater by hydrated iron oxide and ferromanganese oxide chitosan gel beads. Sep. Purif. Technol. 2023, 315, 123692. [Google Scholar] [CrossRef]
  33. Duan, L.; Song, J.; Zhang, Y.; Yin, M.; Yuan, H.; Li, X. Unraveling seasonal shifts in microbial and geochemical mediated arsenic mobilization at the estuarine sediment-water interface under redox changes. Sci. Total Environ. 2023, 912, 168939. [Google Scholar] [CrossRef] [PubMed]
  34. Schaefer, M.V.; Ying, S.C.; Benner, S.G.; Duan, Y.; Wang, Y.; Fendorf, S. Aquifer arsenic cycling induced by seasonal hydrologic changes within the Yangtze River Basin. Environ. Sci. Technol. 2016, 50, 3521–3529. [Google Scholar] [CrossRef] [PubMed]
  35. Wang, X.; Luo, Y.L.; Deng, W.W. The 3D-EEM characteristics of DOM in high arsenic groundwater of Kuitun, Xinjiang. China Environ. Sci. 2020, 40, 4974–4981. [Google Scholar]
  36. Shakoor, M.B.; Bibi, I.; Niazi, N.K.; Shahid, M.; Nawaz, M.F.; Farooqi, A.; Naidu, R.; Rahman, M.M.; Murtaza, G.; Lüttge, A. The evaluation of arsenic contamination potential, speciation and hydrogeochemical behaviour in aquifers of Punjab, Pakistan. Chemosphere 2018, 199, 737–746. [Google Scholar] [CrossRef]
  37. Rasool, A.; Xiao, T.; Farooqi, A.; Shafeeque, M.; Masood, S.; Ali, S.; Fahad, S.; Nasim, W. Arsenic and heavy metal contaminations in the tube well water of Punjab, Pakistan and risk assessment: A case study. Ecol. Eng. 2016, 95, 90–100. [Google Scholar] [CrossRef]
  38. Lawson, M.; Polya, D.A.; Boyce, A.J.; Bryant, C.; Ballentine, C.J. Tracing organic matter composition and distribution and its role on arsenic release in shallow Cambodian groundwaters. Geochim. Cosmochim. Ac. 2016, 178, 160–177. [Google Scholar] [CrossRef]
  39. Liao, V.H.C.; Chu, Y.J.; Su, Y.C.; Hsiao, S.Y.; Wei, C.C.; Liu, C.W.; Liao, C.M.; Shen, W.C.; Chang, F.J. Arsenite-oxidizing and arsenate-reducing bacteria associated with arsenic-rich groundwater in Taiwan. J. Contam. Hydrol. 2011, 123, 20–29. [Google Scholar] [CrossRef]
  40. Li, P.; Jiang, Z.; Wang, Y.; Deng, Y.; Van Nostrand, J.D.; Yuan, T.; Liu, H.; Wei, D.; Zhou, J. Analysis of the functional gene structure and metabolic potential of microbial community in high arsenic groundwater. Water Res. 2017, 123, 268–276. [Google Scholar] [CrossRef]
  41. Li, Y.; Guo, H.; Hao, C. Arsenic release from shallow aquifers of the Hetao basin, Inner Mongolia: Evidence from bacterial community in aquifer sediments and groundwater. Ecotoxicology 2014, 23, 1900–1914. [Google Scholar] [CrossRef]
  42. Seddique, A.A.; Masuda, H.; Mitamura, M.; Shinoda, K.; Yamanaka, T.; Itai, T.; Maruoka, T.; Uesugi, K.; Ahmed, K.M.; Biswas, D.K. Arsenic release from biotite into a Holocene groundwater aquifer in Bangladesh. Appl. Geochem. 2008, 23, 2236–2248. [Google Scholar] [CrossRef]
  43. Senn, D.B.; Hemond, H.F. Nitrate controls on iron and arsenic in an urban lake. Science 2002, 296, 2373–2376. [Google Scholar] [CrossRef] [PubMed]
  44. Shah, B.A. Groundwater arsenic contamination from parts of the Ghaghara Basin, India: Influence of fluvial geomorphology and Quaternary morphostratigraphy. Appl. Water Sci. 2017, 7, 2587–2595. [Google Scholar] [CrossRef]
  45. Sheikhy Narany, T.; Ramli, M.F.; Aris, A.Z.; Sulaiman, W.N.A.; Juahir, H.; Fakharian, K. Identification of the hydrogeochemical processes in groundwater using classic integrated geochemical methods and geostatistical techniques, in amol-babol plain, iran. Sci. World J. 2014, 2014, 419058. [Google Scholar] [CrossRef]
  46. Singh, H.; Pandey, R.; Singh, S.K.; Shukla, D.N. Assessment of heavy metal contamination in the sediment of the River Ghaghara, a major tributary of the River Ganga in Northern India. Appl. Water Sci. 2017, 7, 4133–4149. [Google Scholar] [CrossRef]
  47. Singh, C.K.; Kumar, A.; Bindal, S. Arsenic contamination in Rapti River basin, Terai region of India. J. Geochem. Explor. 2018, 192, 120–131. [Google Scholar] [CrossRef]
  48. Smedley, P.L.; Kinniburgh, D.G. A review of the source, behaviour and distribution of arsenic in natural waters. Appl. Geochem. 2002, 17, 517–568. [Google Scholar] [CrossRef]
  49. Smedley, P.L.; Zhang, M.; Zhang, G.; Luo, Z. Mobilisation of arsenic and other trace elements in fluviolacustrine aquifers of the Huhhot Basin, Inner Mongolia. Appl. Geochem. 2003, 18, 1453–1477. [Google Scholar] [CrossRef]
  50. Stuckey, J.W.; Schaefer, M.V.; Kocar, B.D.; Benner, S.G.; Fendorf, S. Arsenic release metabolically limited to permanently water-saturated soil in Mekong Delta. Nat. Geosci. 2016, 9, 70–76. [Google Scholar] [CrossRef]
  51. Tufano, K.J.; Fendorf, S. Confounding impacts of iron reduction on arsenic retention. Environ. Sci. Technol. 2008, 42, 4777–4783. [Google Scholar] [CrossRef]
  52. Qu, H.; Ding, K.; Ao, M.; Ye, Z.; Liu, T.; Hu, Z.; Cao, Y.; Morel, J.; Baker, A.; Tang, Y.; et al. New insights into the controversy of reactive mineral-controlled arsenopyrite dissolution and arsenic release. Water Res. 2024, 262, 122051. [Google Scholar] [CrossRef]
  53. Tseng, W.P.; Chu, H.M.; How, S.W.; Fong, J.M.; Lin, C.S.; Yeh, S.H.U. Prevalence of skin cancer in an endemic area of chronic arsenicism in Taiwan. J. Natl. Cancer I 1968, 40, 453–463. [Google Scholar]
  54. Van Geen, A.; Radloff, K.; Aziz, Z.; Cheng, Z.; Huq, M.R.; Ahmed, K.M.; Weinman, B.; Goodbred, S.; Jung, H.B.; Zheng, Y. Comparison of arsenic concentrations in simultaneously-collected groundwater and aquifer particles from Bangladesh, India, Vietnam, and Nepal. Appl. Geochem. 2008, 23, 3244–3251. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Location, topography, landforms, sampling points, and hydrogeological zoning map of the study area: (a) sampling point location map based on elevation; (b) hydrogeological profile map; (c) field drilling map.
Figure 1. Location, topography, landforms, sampling points, and hydrogeological zoning map of the study area: (a) sampling point location map based on elevation; (b) hydrogeological profile map; (c) field drilling map.
Water 16 03215 g001
Figure 2. Box plots of waters displaying hydrochemical factors ((a) is a low-arsenic groundwater of less than 10 mg; (b) is high-arsenic groundwater greater than 10 mg).
Figure 2. Box plots of waters displaying hydrochemical factors ((a) is a low-arsenic groundwater of less than 10 mg; (b) is high-arsenic groundwater greater than 10 mg).
Water 16 03215 g002
Figure 3. Spatial distribution map of high-arsenic groundwater.
Figure 3. Spatial distribution map of high-arsenic groundwater.
Water 16 03215 g003
Figure 4. Distribution relationship between the arsenic concentration and well depth.
Figure 4. Distribution relationship between the arsenic concentration and well depth.
Water 16 03215 g004
Figure 5. (a) Piper triple plot of high-arsenic groundwater and low-arsenic groundwater; (b,c) indicate local tsi supplementary Piper plots and supplementary local water chemical feature types.
Figure 5. (a) Piper triple plot of high-arsenic groundwater and low-arsenic groundwater; (b,c) indicate local tsi supplementary Piper plots and supplementary local water chemical feature types.
Water 16 03215 g005aWater 16 03215 g005b
Figure 6. SO4 vs. Cl + HCO3. Data are in meq/L. Iso–salinity lines are drawn for reference.
Figure 6. SO4 vs. Cl + HCO3. Data are in meq/L. Iso–salinity lines are drawn for reference.
Water 16 03215 g006
Figure 7. Gibbs distribution map of high-arsenic groundwater and normal-value groundwater in the study area.
Figure 7. Gibbs distribution map of high-arsenic groundwater and normal-value groundwater in the study area.
Water 16 03215 g007
Figure 8. Pearson analysis chart of hydrochemical ions.
Figure 8. Pearson analysis chart of hydrochemical ions.
Water 16 03215 g008
Figure 9. Correlations between Fe2+, NH4+, and As in groundwater samples.
Figure 9. Correlations between Fe2+, NH4+, and As in groundwater samples.
Water 16 03215 g009
Figure 10. Scatter plots of As vs. NO3 and SO42−/Cl in groundwater samples.
Figure 10. Scatter plots of As vs. NO3 and SO42−/Cl in groundwater samples.
Water 16 03215 g010
Figure 11. (a) Data distribution box plot; (b) PCA clustering analysis; (c) data distribution histogram; (d) NMF factor analysis.
Figure 11. (a) Data distribution box plot; (b) PCA clustering analysis; (c) data distribution histogram; (d) NMF factor analysis.
Water 16 03215 g011
Table 1. Statistics of groundwater quality parameters.
Table 1. Statistics of groundwater quality parameters.
AsPHCa2+Mg2+K+Na+ClSO42−HCO3NO3NO2NH4+Fe2+FTDS
As < 10Max9.88.49351.98450.97666.451193.92120.63435.9946.697239.350.492.77.63.297825
Min0.016.857.766.170.017.296.355.57265.740.0500.020.030.12271.29
Mean2.867.4398.0273.3811.27168.11137.89274.99541.7511.150.030.280.950.811098.37
As ≥ 10Max1907.9160.48151.8410.13375.79674.65320.55280.0217.570.212.829.251.82413.94
Min106.9324.8522.170.5342.8514.1121.76771.60.045000.070.341990.4
Mean35.137.4886.4156.142.73112.5399.6116.57527.890.950.020.52.20.62767.47
Table 2. Proportion of high-arsenic groundwater in geological divisions.
Table 2. Proportion of high-arsenic groundwater in geological divisions.
Partition TypeCountingProportion
Modern Yellow River Impact Zone3150.8%
Neihuang Ancient River Channel1931.1%
Wuzhi River Interbank Zone914.8%
Puyang ancient river interzone23.3%
Table 3. Dataset of parameters after training the PCA model.
Table 3. Dataset of parameters after training the PCA model.
Methodology
PCAStatistical IndexPC1PC2PC3
Max0.410.580.57
Min−0.13−0.30−0.42
Mean0.180.030.01
Contribution rate35%12%10%
Table 4. Cluster analysis.
Table 4. Cluster analysis.
Cluster 1 MeanCluster 2 MeanCluster 3 Mean
PC14.89321845327.21070411−0.435959501
PC20.163452669−1.665889574−0.003922947
PC3−1.1413527544.8206887520.055541138
PC41.423497914−5.429592234−0.071679233
PC50.37764131−0.392501558−0.023346113
PC60.2010971743.27863246−0.026843749
PC7−1.177240327.164169380.048230065
PC80.1066002131.312202174−0.012470271
PC9−0.057678840.5339115580.001607231
PC100.311394195−0.215691252−0.01969676
PC11−0.05150223−0.0008668720.003408688
PC120.522860824−1.438479223−0.028625182
PC13−0.0069318720.179685313−0.000284196
PC140.176629866−0.31228099−0.01038759
PC15−0.0128503460.026185170.000741406
PC160.0231035390.019183984−0.001606779
Table 5. Random forest feature importance and gradient boosting tree feature importance.
Table 5. Random forest feature importance and gradient boosting tree feature importance.
NumberRandom Forest Model ImportanceGradient Boosting Tree Model Importance
Feature 19.59 × 10−60
Feature 21.09 × 10−51.74 × 10−9
Feature 32.39 × 10−22.45 × 10−2
Feature 49.18 × 10−62.70 × 10−9
Feature 51.20 × 10−55.55 × 10−5
Feature 64.36 × 10−63.57 × 10−4
Feature 76.93 × 10−60
Feature 81.58 × 10−50
Feature 92.54 × 10−62.06 × 10−9
Feature 101.83 × 10−80
Feature 112.87 × 10−22.44 × 10−2
Feature 123.09 × 10−61.73 × 10−5
Feature 132.31 × 10−51.78 × 10−5
Feature 143.43 × 10−42.85 × 10−5
Feature 151.83 × 10−52.30 × 10−5
Feature 169.46 × 10−19.51 × 10−1
Feature 1700
Feature 1800
Feature 1900
Feature 2000
Feature 2100
Feature 2200
Feature 2300
Feature 2400
Feature 2500
Feature 2600
Feature 2700
Feature 284.68 × 10−52.05 × 10−6
Feature 299.41 × 10−60
Feature 302.15 × 10−50
Feature 3100
Feature 323.36 × 10−70
Feature 331.70 × 10−76.33 × 10−8
Feature 341.35 × 10−71.89 × 10−8
Feature 353.86 × 10−84.48 × 10−8
Feature 364.77 × 10−60
Feature 374.69 × 10−60
Feature 382.32 × 10−60
Feature 396.74 × 10−70
Feature 409.75 × 10−43.62 × 10−5
Feature 413.10 × 10−70
Feature 427.61 × 10−69.65 × 10−9
Feature 432.86 × 10−63.04 × 10−8
Feature 445.27 × 10−80
Feature 455.88 × 10−70
Feature 462.97 × 10−70
Feature 4700
Feature 489.00 × 10−70
Feature 491.22 × 10−60
Feature 501.54 × 10−70
Feature 511.79 × 10−60
Feature 526.15 × 10−70
Feature 533.00 × 10−64.12 × 10−8
Feature 549.89 × 10−70
Feature 551.68 × 10−51.14 × 10−7
Feature 567.56 × 10−68.41 × 10−7
Feature 572.27 × 10−67.55 × 10−7
Feature 586.11 × 10−60
Feature 592.28 × 10−55.70 × 10−8
Feature 601.91 × 10−67.48 × 10−8
Feature 611.68 × 10−60
Feature 6200
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wu, X.; Liu, W.; Liu, Y.; Zhu, G.; Han, Q. Hydrogeochemical Characteristics and Formation Mechanisms of High-Arsenic Groundwater in the North China Plain: Insights from Hydrogeochemical Analysis and Unsupervised Machine Learning. Water 2024, 16, 3215. https://doi.org/10.3390/w16223215

AMA Style

Wu X, Liu W, Liu Y, Zhu G, Han Q. Hydrogeochemical Characteristics and Formation Mechanisms of High-Arsenic Groundwater in the North China Plain: Insights from Hydrogeochemical Analysis and Unsupervised Machine Learning. Water. 2024; 16(22):3215. https://doi.org/10.3390/w16223215

Chicago/Turabian Style

Wu, Xiaofang, Weijiang Liu, Yi Liu, Ganghui Zhu, and Qiaochu Han. 2024. "Hydrogeochemical Characteristics and Formation Mechanisms of High-Arsenic Groundwater in the North China Plain: Insights from Hydrogeochemical Analysis and Unsupervised Machine Learning" Water 16, no. 22: 3215. https://doi.org/10.3390/w16223215

APA Style

Wu, X., Liu, W., Liu, Y., Zhu, G., & Han, Q. (2024). Hydrogeochemical Characteristics and Formation Mechanisms of High-Arsenic Groundwater in the North China Plain: Insights from Hydrogeochemical Analysis and Unsupervised Machine Learning. Water, 16(22), 3215. https://doi.org/10.3390/w16223215

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop