Groundwater Level Modeling with Machine Learning: A Systematic Review and Meta-Analysis

Ahmadi, Arman; Olyaei, Mohammadali; Heydari, Zahra; Emami, Mohammad; Zeynolabedin, Amin; Ghomlaghi, Arash; Daccache, Andre; Fogg, Graham E.; Sadegh, Mojtaba

doi:10.3390/w14060949

Open AccessEditor’s ChoiceReview

Groundwater Level Modeling with Machine Learning: A Systematic Review and Meta-Analysis

by

Arman Ahmadi

¹

,

Mohammadali Olyaei

^2,3

,

Zahra Heydari

⁴,

Mohammad Emami

^5,1

,

Amin Zeynolabedin

²,

Arash Ghomlaghi

²

,

Andre Daccache

^1,*

,

Graham E. Fogg

^6,7

and

Mojtaba Sadegh

⁸

¹

Department of Biological and Agricultural Engineering, University of California, Davis, CA 95616, USA

²

School of Civil Engineering, College of Engineering, University of Tehran, Tehran 1417935840, Iran

³

Department of Civil Environmental and Geo-Engineering, University of Minnesota, Minneapolis, MN 55455, USA

⁴

Department of Civil and Environmental Engineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA

⁵

Department of Water Engineering and Hydraulic Structures, Faculty of Civil Engineering, Semnan University, Semnan 3513119111, Iran

⁶

Hydrologic Sciences Graduate Group, University of California, Davis, CA 95616, USA

⁷

Department of Land, Air and Water Resources, University of California, Davis, CA 95616, USA

⁸

Department of Civil Engineering, Boise State University, Boise, ID 83706, USA

^*

Author to whom correspondence should be addressed.

Water 2022, 14(6), 949; https://doi.org/10.3390/w14060949

Submission received: 9 February 2022 / Revised: 7 March 2022 / Accepted: 15 March 2022 / Published: 17 March 2022

Download

Browse Figures

Versions Notes

Abstract

:

Groundwater is a vital source of freshwater, supporting the livelihood of over two billion people worldwide. The quantitative assessment of groundwater resources is critical for sustainable management of this strained resource, particularly as climate warming, population growth, and socioeconomic development further press the water resources. Rapid growth in the availability of a plethora of in-situ and remotely sensed data alongside advancements in data-driven methods and machine learning offer immense opportunities for an improved assessment of groundwater resources at the local to global levels. This systematic review documents the advancements in this field and evaluates the accuracy of various models, following the protocol developed by the Center for Evidence-Based Conservation. A total of 197 original peer-reviewed articles from 2010–2020 and from 28 countries that employ regression machine learning algorithms for groundwater monitoring or prediction are analyzed and their results are aggregated through a meta-analysis. Our analysis points to the capability of machine learning models to monitor/predict different characteristics of groundwater resources effectively and efficiently. Modeling the groundwater level is the most popular application of machine learning models, and the groundwater level in previous time steps is the most employed input data. The feed-forward artificial neural network is the most employed and accurate model, although the model performance does not exhibit a striking dependence on the model choice, but rather the information content of the input variables. Around 10–12 years of data are required to develop an acceptable machine learning model with a monthly temporal resolution. Finally, advances in machine and deep learning algorithms and computational advancements to merge them with physics-based models offer unprecedented opportunities to employ new information, e.g., InSAR data, for increased spatiotemporal resolution and accuracy of groundwater monitoring and prediction.

Keywords:

groundwater hydrology; water resources; data science; regression machine learning; hydrogeology; artificial neural networks

1. Introduction

Groundwater is the largest global reservoir of liquid freshwater, which is under increasing stress due to overdraft [1]. Groundwater is “the water stored beneath earth’s surface in soil and porous rock aquifers” [2], and plays a principal role in sustaining ecosystems and producing food in a vast area of arid and semi-arid land globally [3]. Groundwater accounts for around 33% of total worldwide water withdrawals [4], and over two billion people rely on groundwater as their main water source [5]. Over-drafting is causing groundwater levels to drop continuously and dramatically in many regions, leading to a global groundwater crisis [2,6,7].

To address the challenges of sustainable groundwater management, it is crucial to have a good understanding of the current status and to forecast future estates of this indispensable resource. There are numerous mechanistic groundwater models, for example using finite difference and finite element techniques to simulate the dynamic behavior of a groundwater system [8,9,10,11], such as MODFLOW [12,13,14]. Numerous studies also applied soft computing techniques for groundwater level or contamination prediction, including GA [15,16,17], ANN [18,19,20,21], and ANFIS [22,23,24,25,26] Generally, physical and numerical models have been the main tool in modeling and forecasting the groundwater level. However, because these traditional methods rely on various inputs and the underlying mechanisms are usually too complicated to grasp, data-driven approaches are used in several recent studies [27,28].

In recent years, there has been a growing interest to employ ML and data-driven approaches to groundwater modeling [29,30,31,32,33,34,35,36,37,38]. Due to the complex nature of groundwater problems, resolving all governing processes is very difficult, and simulation and prediction models are constrained with numerous simplifications and assumptions and endure significant uncertainties [39]. The application of black-box models, such as ML, that can resolve the nonlinear interdependencies of all influential input variables, without the need for complete knowledge of underlying physical or mathematical processes, is appealing [33,40]. Moreover, novel strategies such as linear stochastic approaches and pre-processing techniques have recently been proved to be promising in groundwater level forecasting [27].

This study attempts to systematically review the state-of-the-art application of ML methods in the modeling and prediction of groundwater resources. By conducting a rigorous meta-analysis on the congregated results, this study investigates the suitability of ML models to predict the quality and quantity of groundwater resources. Although the scope of this systematic review is not limited to any specific characteristic, its focus is on groundwater level prediction, as it is by far the most popular application of data-driven techniques in groundwater studies [21,41,42,43]. This study builds upon the previous review articles on the application of ML and deep learning models in hydrology, water resources, and groundwater [44,45,46,47], and bridges the gap for a comprehensive, consistent, and systematic meta-analysis of various ML models in studying groundwater. ML studies of groundwater heavily vary in spatial and temporal scale, background meteorology, ML model construction, sample division, and input variables. As a result, predicted groundwater indices, their spatiotemporal resolution, and their forecast lead time vary widely. Consequently, a robust comparison of the performance of ML models in monitoring and forecasting groundwater characteristics can be challenging. A systematic meta-analysis makes these inter-study comparisons possible by communicating through a pooled summary of combined individual study results [48]. The current study fills these research gaps by following the CEBC protocol for conducting a systematic review [49]. According to CEBC, “A Systematic Review is an evidence synthesis method that aims to answer a specific question as precisely as possible in an unbiased way” [49]. We pose the question: how accurately can ML methods model and predict groundwater resources’ quantitative characteristics? By answering this question through a meta-analysis, we aim to cast light on the performance of ML methods in groundwater resources studies.

2. Methodology

Formulating a well-focused and clearly-framed question is the first and one of the most important steps in the systematic review process. Without a pre-defined question and inclusion-exclusion criteria, it can be challenging and time-consuming to identify appropriate resources and search for relevant literature. Following the procedure developed by the CEBC, we used a specialized framework, called Population, Intervention, Comparator, Outcome—PICO, to form the question systematically and facilitate the literature search [49]. Here, PICO was defined as:

Population: time series of groundwater resources’ quantity or quality characteristics
Intervention: regression ML algorithms
Comparator: observation and measurement
Outcome: predictive capabilities (through quantitative measures of performance like the coefficient of determination)

Using the PICO framework, we designed a search string and used it to search title, abstract, and keywords of literature through two online databases: “Scopus” and “Web of Science”. We used the same search string for both databases simultaneously to avoid any discrepancies. The literature sample was drawn from English, peer-reviewed journal articles, and conference proceedings published between January 2010 and September 2020. The process of searching was performed on 21 September 2020. Adopting a high-sensitivity and low-specificity approach search strategy, the search string was designed to encompass all regression ML methods that have been used in hydrology and hydrogeology, excluding ML methods that are specific to classification. Many articles were initially identified but removed later at the title and abstract screening stage (Figure 1). The search string is presented in the Supplementary Materials.

Adding up the records from both databases, a total of 5762 articles were identified to meet the search string criteria and were stored in a reference manager software (Mendeley). Since we used two databases, there were a considerable number of duplicate articles, and we used two methods to deal with this: an automatic duplicate removal process, which was conducted through Mendeley, and to check the reliability of this process, in parallel, we checked if the title of the records and their DOI was identical in Excel and removed the duplications accordingly. We also used the Fuzzy Lookup procedure in Excel to find similar titles (i.e., titles of the same article with different wordings).

After duplicate removal, 3677 records were retained in the next step: title screening. Assessment of the titles was undertaken by two reviewers, simultaneously and independently, qualifying articles to be retained or removed. If both reviewers agreed to either keep or remove a specific record, the final decision was the agreement. However, in case of a conflict in decisions, a third reviewer checked the record and made the final decision to either keep or remove it. In the end, 878 records remained for the next round of review, namely abstract screening. The records were divided randomly between 6 reviewers. Everyone reviewed the abstracts of assigned records to decide whether each record met the inclusion criteria or not. Inclusion criteria for both title and abstract screening were the same and based on the PICO framework. Specifically, the following criteria should have been met to include the record:

The article should present original research on one or more case studies (i.e., aquifers) that employ a regression ML algorithm to predict a specific and measurable aquifer characteristic in different time steps.
The article should use a time series of input data to train its algorithm.
The article should evaluate the accuracy of the prediction by comparing the ML algorithm outputs with observation.
The article should report its goodness of prediction with quantitative measures of performance (i.e., statistical indices).

After the abstract screening, 347 records were retained, of which 23 were either not retrievable or not in English in their full-text form, which left us with 324 articles for the full-text screening (Figure 1). Six individuals reviewed the retrieved full texts according to inclusion–exclusion criteria as the third step of article screening. After the full-text screening, 127 articles were removed based on the exclusion criteria (Figure 1). Eventually, a total of 197 articles remained to be included in the systematic review and to be investigated in meta-analysis. Figure 1 depicts different steps of the systematic review and the number of records in each step.

Finally, key characteristics of the final papers were extracted in the data extraction stage. A second reviewer also checked a random subset of the included studies to ensure that data had been extracted accurately. All team members involved in the extraction process also appeared as second reviewers and were assigned to check the extracted data by other team members to ensure data hygiene and minimize human error. Finally, the extracted data went through data curation.

3. Results and Discussion

3.1. Statistical Analysis

The number of research articles using ML to predict groundwater characteristics is growing after 2014 (Figure 2), with a spike in 2017. Out of 197 articles included for meta-analysis, 33 (16.75%) were published in 2017.

Included records were published in various journals, of which the Journal of Hydrology (10.66%) published the largest set of papers, followed by Water Resources Management (7.11%), and Environmental Earth Sciences (5.08%) (Figure S1 in the Supplementary Materials).

The systematic literature search showed that Iran (24%), India (18%), China (16%), and the United States (10%) had the highest number of articles, respectively (Figure 3). Iran as the leading country in the number of articles in this systematic review also deals with a state of water bankruptcy partly due to anthropogenic depletion of its aquifers (Noori et al., 2021). The list of countries with the highest number of articles also agrees well with the list of countries with the highest dependency on groundwater resources. According to [50], the top five nations with the largest estimated annual groundwater extractions in 2010 are India (251.00 km³/year), China (111.95 km³/year), the United States (111.70 km³/year), Pakistan (64.82 km³/year), and Iran (63.40 km³/year). It is worth mentioning that Iran, India, China, and the United States use 87%, 89%, 54%, 71% of their groundwater extraction for irrigation, respectively (Margat and Van der Gun, 2013). It should be noted that groundwater depletion due to overdraft for mainly irrigation purposes is reported as a worldwide problem. According to the findings of [51,52], Iran, India, China, and United states are among the countries with the most reliance on groundwater resources for food production and deal with the consequences of overdraft. Our findings reveal that the hotspots of groundwater consumption and depletion are the popular case studies for the application of ML in groundwater modeling and prediction. In total, the included articles in this study were from 28 countries (Figure 3). Moreover, our findings show that the countries with the highest number of articles are the countries suffering from groundwater stress (Figure S7 in the Supplementary Materials).

Most of the papers (56%) had a case study with an area less than 1000 km², followed by study areas between 1000 km² and 2000 km² (22%), and the remaining 23% had a case study with an area of more than 2000 km² (Figure S2 in the Supplementary Materials). Only 6% of the articles studied a confined aquifer, while 5% had a semi-confined aquifer and 89% had worked on an unconfined aquifer or did not mention the type of aquifer in their manuscript. Twenty-seven percent of the articles studied coastal aquifers and the remaining (73%) had a non-coastal aquifer as their case study (Figure S3 in the Supplementary Materials). Being prone to seawater intrusion, groundwater salinization is a common problem in coastal aquifers, particularly where excessive groundwater pumping induces a decrease in the piezometric head [53], and therefore, some of the reviewed studies had focused on predicting groundwater salinity in coastal aquifers [29,54].

As shown in Figure 4, a high percentage of the reviewed articles are from arid and semi-arid regions of the world, where surface water resources are generally scarce and highly unreliable [55]. Moving from arid to humid regions, the reliability of surface water resources increases and, as a result, the interest in studying groundwater resources decreases (Figure 4).

In total, 26 different ML methods were reported in the articles as tools to predict various characteristics of groundwater resources. Among them, ANN, SVM, and ANFIS were the most popular methods with 53%, 16%, and 10% of total records, while GEP, LR, and GP were applied much less (Figure 5).

The employed ANN models had different architectures, but FFNN was the most used (around 66% of records), followed by NARX with 11.3% of records (Figure S4 in the Supplementary Materials). Gradient descent (64.3%), LMA (19.5%), and PSO (5%) were the most used optimization algorithms for training ANN models (Figure S5 in the Supplementary Materials). Most of the papers that used gradient descent mentioned using back-propagation for calculating gradients for the weights of the network. Seventy-nine records used wavelet transformation along with ML models, where 54.4%, 13.9%, and 10.1% of them utilized ANN, ANFIS, SVM models, respectively (Figure S6 in the Supplementary Materials). According to the studies that used wavelet transformation, determining the appropriate decomposition level is an important step as it affects the ML models’ performance [56,57]. Moosavi et al. (2013) suggest considering the periodicity and seasonality of data series to determine the appropriate number of decomposition levels. In summary, our meta-analysis shows that FFNN with gradient descent as an optimization algorithm is the most employed ML model to predict characteristics of groundwater resources. Based on its wide use and acceptable performance, it can be inferred that this model structure is a suitable choice for the prediction of groundwater characteristics.

Sample division into training, validation, and test sets is one of the important factors in designing ML models. Although some researchers divided the data into only training-testing subsets, using three subsets as training, validation, and testing is generally preferable. In the latter scenario, the testing set is never used in the process of model building while the validation set helps with the fine-tuning of the model hyperparameters and even choosing the best model structure. This procedure eliminates the risk of over-fitting (i.e., where an ML model will “memorize” the features of the training input data instead of actual “learning”) and ends up with more reliable results where the ML model shows its generality to work well with new, unseen data.

Cross-validation is another model validation technique that uses a resampling procedure and is especially useful when the sample data are limited. In the cross-validation process, instead of a fixed test set, input data are divided into some “folds” and in each training step, one fold is held out as the test set and the model is trained with the remaining data. After training the model, its performance is measured on the unseen test set (i.e., the held-out fold). This process repeats k times, where k is the number of folds, and at the end, the average of k measures of performance is reported as the final measure of model fitness. According to our meta-analysis, 16.2% of the articles used cross-validation, while 12.4% of records used both cross-validation and sample division strategies. A 96.2% of the articles divided their dataset into subsets, while around 80% of these articles only had train-test subsets and 20% had three subsets division. From a data science point of view, this can be a weakness, especially if the models have been exposed to the validation data before the final model evaluation.

As shown in Figure 6, most of the articles have used 70–80% of the data as the training subset and the remaining as the test subset. Similarly, most of the articles having three subsets have used 60–70% of the data as the train set and divided the remaining into validation and test sets (Figure S8 in the Supplementary Materials).

The input data length, temporal resolution, and the number of categories are other important factors in ML modeling in general and particularly in hydrological studies. To train a reliable data-driven model in groundwater studies, the model needs to be fed with temporally inclusive input data to be able to predict variable geohydrological conditions and to learn the seasonality. As depicted in Figure 7, while most of the articles had lower than 8 input categories, a considerable portion had between 3 to 4 input categories. This might have two main reasons; first, in many case studies, many potential variables are poorly measured, and secondly, increasing the number of input variables would cause some unfavorable phenomenon in modeling such as the curse of dimensionality. Additionally, the use of fewer input variables to training ML models can imply the efficacy of these models in predicting groundwater characteristics. This is especially important in ungauged regions. The use of ML models in these regions can also be favorable from an economic point of view since these regions usually rely on agriculture, and an accurate estimation of, for example, the groundwater level using limited input data can assist with more cost-efficient irrigation scheduling.

As shown in Figure 8, the length of the input data time series was mostly up to around 12 years, and rarely more than 20 years, with very few studies having more than 40 years of input data to train the ML models.

The monthly temporal resolution was by far the most popular among the articles (around 65% of the records), followed by the daily resolution with 19.6% (Figure S9 in the Supplementary Materials). This could imply a higher availability of groundwater data in the monthly temporal resolution more than other resolutions. Furthermore, the monthly resolution might be more favorable for large-scale water managing stakeholders and policymakers.

Although our research question was not limited to any specific characteristic, we found that most of the research articles using ML algorithms in groundwater studies were focused on the prediction of the groundwater level (82.5%). The possible explanation for this large number might be related to denser measurements of the groundwater level compared to other variables in practice. Moreover, the groundwater level is a continuous variable that could be regionalized through various interpolation methods. In total, 17 groundwater characteristics were found in the reviewed articles to be predicted using ML, with a discharge or baseflow (6.1%), groundwater recharge (2.7%), and freshwater-saltwater interface level (2.5%) being the most popular ones after groundwater level (Table 1 and Figure S10 in the Supplementary Materials). Our analysis shows that the most adopted input variables for training ML models to predict the groundwater level were groundwater levels at earlier time steps (26.7%), precipitation (25.1%), temperature (13.6%), and evaporation or evapotranspiration (10.5%) (Figure S11 in the Supplementary Materials). Humidity or moisture (2.2%), river discharge (1.9%), surface runoff (1.8%), pumping data (1.7%), and river stage (1.6%) were other important input variables. Table S1 in the Supplementary Materials presents the percentage of the most employed input variables for other predicted characteristics.

Around 40% of the reports have used input variable selection techniques to determine what variables should be included in the ML model based on their importance. Cross-correlation analysis (36.2%), autocorrelation analysis (19.9%), and partial autocorrelation function (17.1%) were the most adopted techniques (Figure S12 in the Supplementary Materials). After training the ML model, 61.3% of the reviewed articles used their model to forecast future states of groundwater resources. Figure 9 shows the relative frequency of the forecast timespan.

Figure 10 presents the percentage of statistical indicators used to measure the accuracy of the ML model of the groundwater level. RMSE (27.4%), NSE (17.8%), the correlation coefficient (14.3%), coefficient of determination (13.7%), and MAE (9.4%) were the most popular measures of performance. RMSE is also the most adopted measure of performance for other predicted characteristics. RMSE indicates the absolute fit of the model to the data and is a suitable measure of performance with the same units as the predicted variable. On the other hand, the coefficient of determination (R²) is a relative measure and does not indicate the absolute precision of the model.

3.2. Meta-Analysis

As mentioned earlier, more than 82% of reviewed articles had used ML models to predict the groundwater level and only around 18% of articles were focused on other groundwater characteristics. As a result, our meta-analysis is mostly focused on groundwater level forecasting. We also presented the outcome of the meta-analysis for other characteristics, where possible. Here, we used violin plots that show the probability density of the data at different values using a rotated kernel density plot, which provides insights into the distribution of data and facilitates data analysis and exploration [58,59]. In all violin plots, the red dot shows the mean, while the box demonstrates the first, second and third quartiles, where the middle bar is the median. Figure 11 shows the results of the meta-analysis on the predictive capability of ML models for groundwater level prediction through various measures of performance.

The statistics of these violin plots are presented in Table S2 in the Supplementary Materials. As shown in Figure 11, meta-analysis confirms the ability of ML models to predict groundwater levels with high accuracy. Table S2 shows the number of reports for each violin plot. For instance, 546 records with an RMSE performance were used to construct the violin plot of RMSE in Figure 11 (mean RMSE of 0.52 m). It should be noted that different papers had various case studies with distinct groundwater levels, therefore, comparing RMSEs might lead to misleading results in some cases. In other words, the variation of the groundwater level in a shallow aquifer is inherently different from that of a deep aquifer. As shown in Figure 11, the results of R² presented from 270 records are promising.

Figure 12 illustrates the results for other characteristics that had enough records (more than 15) to conduct a meta-analysis (Table S3 in the Supplementary Materials). These violin plots show an acceptable accuracy of ML models to predict a variety of groundwater characteristics. Contrary to the groundwater level prediction (Figure 11), these results are from fewer records (Table S3 in the Supplementary Materials), therefore, general conclusions should be drawn with caution. What is obvious, however, is the potential of data-driven models to estimate miscellaneous groundwater characteristics accurately with a lower number of input data and easier model structures compared to physical models.

Along with a one-dimensional meta-analysis on the capability of the ML models to predict groundwater characteristics, we categorized the reviewed papers’ reports based on different criteria to cast light on the different aspects of data-driven modeling in groundwater studies. Figure 13 represents the results for different ML methods and ANN architectures with a threshold of 15 records in each category (also see Table S4 in the Supplementary Materials). Most employed ML methods (e.g., ANFIS, ANN, SVM) have a comparable and even similar performance according to reported statistical measures. However, ANN slightly outperforms other models in most cases. Generally, it can be inferred that the most influencing factor in the performance of ML models in groundwater studies is the quality and quantity of the input data and not the model. Comparing different ANN architectures, we see that NARX outperforms FFNN, but due to the much lower number of records for NARX, this finding is not conclusive, and more investigation is required.

Figure 14 contrasts the results for the type of the aquifer, whether the aquifer is coastal or not, whether cross-validation is used or not, and various schemes for sample division (Table S5 in the Supplementary Materials). As we can see in Figure 14, results from different aquifer types are comparable and no obvious trend can be found. Although the number of records is different for coastal and non-coastal aquifers, from Figure 14 we can infer that the model results for the coastal aquifers are slightly superior. Moreover, Figure 14 shows that in the case of sample division without cross-validation, models are working slightly better. This might be because in cross-validation the considered dataset is divided into different training and test sets multiple times, and the total performance of a model would be the average of all individual performances; however, in classical validation, there is only one training and one test set. Therefore, even one subset with a low performance would decrease the total performance in the cross-validation technique. There is no meaningful trend in the results for different sample division proportions.

Figure 15 shows the outcome of meta-analysis for input data’s temporal resolution, the input variable selection technique, and forecast for the future (Table S6 in the Supplementary Materials). The daily time series is marginally better than the monthly time series in terms of model accuracy. Studies that used input variable selection techniques had superior results to those without these techniques. It can be inferred that input variable selection is a useful step in setting up ML models to predict groundwater characteristics. According to Figure 15, there is no meaningful trend in the results comparing papers that do forecast for the future and papers that do not. Figures S13 and S14 in the Supplementary Materials depict the results of our meta-analysis for other categories and combinations.

4. Opportunities

Advances in ML and AI algorithms (e.g., boosting algorithms and deep learning) alongside exponential growth in the availability of computational resources (e.g., Google and Amazon cloud) provide unprecedented opportunities for breakthroughs in groundwater monitoring and forecasting (e.g., reliable forecast with longer lead times). Arguably, the most lucrative opportunity for future work lies in the flexibility of new algorithms to fuse data with widely different spatio-temporal resolutions from various remote sensors, ground observations, and numerical and physics-based models. The new algorithms also allow for the inclusion of physics into the traditionally black-box methods (e.g., physics-based AI) and quantify uncertainties (e.g., uncertainty-aware AI). Physics-based AI may resolve a longstanding issue that AI methods could not reliably predict/forecast states/outputs that are outside the bounds of observed/training data. Reliable AI/ML methods for the prediction of groundwater states should include a combination of initial states (e.g., groundwater level at the current time, snowpack, surface water availability, temperature, wind, cultivated area), sub-seasonal to seasonal forecasts from numerical models (e.g., from National Oceanic and Atmospheric Administration’s Global Forecast System), and large-scale climate signals (e.g., El Niño-Southern Oscillation). The skill of these variables to predict future groundwater states vary across regions and temporal lags, but our understanding of all these predictors is improving rapidly. Remote sensing, tele-stations, and citizen science are providing an unprecedented quantity and quality of surface observations. There, however, exists an opportunity for a significant scientific contribution through developing homogenized, quality-controlled, global products of an in situ observation of groundwater states. Numerical weather prediction models are transforming by the hour and their predictive skills are rapidly enhancing, but there remain great opportunities in this field to resolve microphysics and improve weather forecasts. Finally, new climate signals are being explored, and important advances in convolutional, geospatial, and memory-enabling ML models are being leveraged to explore the entire sea surface temperature (SST) domain to devise new teleconnections, which were not captured by traditional climate signals that mainly depended on differences in SST in specific zones. Anthropogenic factors (e.g., groundwater pumping and artificial recharge) can also be integrated into ML/AI models of groundwater. Finally, while still in its infancy, advances in Interferometric Synthetic Aperture Radar technology and data to estimate surface elevation changes, when merged with physics-based models of elastic and non-elastic ground deformation, can infer groundwater levels at unprecedented spatial (a few dozen meters) and temporal (a few weeks) scales.

5. Summary and Conclusions

In this paper, we posed the question of how accurately can ML methods model and predict groundwater resources’ characteristics? Questions of this nature require systematic review methodologies with explicit inclusion and exclusion criteria that are developed to identify and analyze the relevant literature. Here, by conducting a systematic literature search on the application of regression ML in groundwater resources studies, we found that:

Groundwater level modeling and forecasting is the most popular use of ML in the literature.
Groundwater level at the previous time step and precipitation were the most employed input variables to feed groundwater models.
Countries with more dependence on groundwater as a freshwater source produced the majority of studies on the application of ML in groundwater modeling.
Feed-forward ANN with gradient descent as the optimization algorithm is the most employed and effective ML model to predict quantitative characteristics of groundwater. This might be due to the simplicity of this architecture and according to the availability of models and codes.
A considerable portion of reports used only 3 to 4 input variables to train the ML models. The acceptable accuracy reported from these models can imply the capability of data-driven models to simulate the complicated nature of groundwater resources efficiently and effectively, even in the case of few input parameters.
The monthly scale is the most employed temporal resolution in time series and, generally, finer temporal resolutions result in higher accuracy.
Around 10–12 years of data are required to develop an acceptable ML model with monthly temporal resolution.
Input variable selection is a highly used technique to choose the most appropriate input variables to train the models, and studies that used these techniques outperformed those that did not.
A high portion of studies use their data-driven model to forecast the future states of groundwater resources.
RMSE is the most employed measure of performance between different studies and for various characteristics.
While different ML methods have a similar accuracy in predicting groundwater characteristics, ANN is slightly superior to other methods.
When using traditional sample division without cross-validation, models generally result in higher quantitative measures of performance. However, results of cross-validation are generally expected to be a more accurate estimate of the true performance of the model since cross-validation reduces the risk of overfitting and increases the model generality.

With the groundwater modeling literature expanding rapidly and interest in using ML tools in this area gaining higher momentum, meta-analyses, like our study, can help us grasp what we know, don’t know, and need to know. Systematic reviews and meta-analyses such as the present study can augment recent comprehensive reviews on the application of ML in groundwater studies (e.g., 28). Future systematic reviews and meta-analysis studies can focus on the application of ML models in other areas of water resources, such as streamflow modeling and forecasting, extreme hydro-meteorological events induced by climate change, and fine-tuning the estimation of evapotranspiration and soil moisture along with remote sensing datasets [60,61,62]. Moreover, since hydrological models always deal with inherent uncertainties and ambiguity of model structure, parameters, and input variables, systematic reviews can shed light on the state-of-the-art of uncertainty, reliability, and sensitivity analysis of hydrological models [63,64,65]. Although aggregating results from different studies, as done here, have some obvious shortcomings, doing so can shed light on the subject by generating comprehensive and multidimensional findings. The aggregation of results is a two-sided sword though, and since each original research article is specific in its methodology, representation, and interpretation of the results, researchers should be cautious in interpreting the results.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/w14060949/s1, Figure S1. Journals with high numbers of included records; Figure S2. Percentage of reviewed articles based on the area of their case study; Figure S3. Proportion of the reviewed articles’ case studies; Figure S4. Percentage of network architectures for the reports employing ANN as their machine learning method; Figure S5. Percentage of optimization algorithms for ANN methods; Figure S6. Proportion of the reports using wavelet transform with machine learning methods; Figure S7. The relationship between groundwater stress and the number of articles in different countries; Figure S8. Frequency of data division strategies for articles having three subsets; Figure S9. Percentage of articles based on their input data temporal resolution; Figure S10. Groundwater characteristics predicted using machine learning algorithms; Figure S11. Proportion of input variables to predict groundwater level; Figure S12. Percentage of most adopted input variable selection techniques; Figure S13. Results of meta-analysis for subcategories combination; Figure S14. Results of meta-analysis for subcategories combination; Table S1. Input variables for predicted characteristics; Table S2. Statistics of groundwater level’s violin plots depicted in Figure 11; Table S3. Statistics of other characteristics’ violin plots illustrated in Figure 12; Table S4. Statistics of the violin plots represented in Figure 13; Table S5. Statistics of the violin plots represented in Figure 14; Table S6. Statistics of the violin plots represented in Figure 15.

Author Contributions

Conceptualization: A.A., M.O., Z.H., M.E., A.Z., A.G. and A.D.; Methodology: A.A., M.O., Z.H., M.E., A.Z., A.G. and G.E.F.; Validation: A.A., M.O., Z.H., M.E., A.Z. and A.G.; Formal Analysis: M.E., A.G. and A.A.; Data Curation: M.O. and M.E.; Writing—Original Draft: A.A.; Writing—Review and Editing: Z.H., G.E.F., A.D., M.E., M.O. and M.S.; Visualization: M.O., A.A., M.E. and Z.H.; Supervision: A.A. and A.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

AIC	Akaike information criterion
ANFIS	adaptive network-based fuzzy inference system
ANN	artificial neural network
CEBC	Center for Evidence-Based Conservation
FFNN	feed-forward neural networks
GEP	gene expression programming
GP	genetic programming
GA	genetic algorithm
LMA	Levenberg–Marquardt
LR	linear regression
MAE	mean absolute error
MAPE	mean absolute percentage error
MSE	mean squared error
ML	machine learning
MLR	multiple linear regression
NARX	nonlinear autoregressive network with exogenous inputs
NRMSE	normalized root mean square error
NSE	Nash–Sutcliffe efficiency
RF	random forest
RMAE	relative mean absolute error
RMSE	root mean square error
PSO	particle swarm optimization
SST	sea surface temperature
SVM	support vector machine
SWAT	soil and water assessment tool

References

McDonough, L.; Santos, I.R.; Andersen, M.S.; O’Carroll, D.; Rutlidge, H.; Meredith, K.; Oudone, P.; Bridgeman, J.; Gooddy, D.C.; Sorensen, J.P.R.; et al. Changes in global groundwater organic carbon driven by climate change and urbanization. Nat. Commun. 2020, 11, 1279. [Google Scholar] [CrossRef] [Green Version]
Famiglietti, J.S. The global groundwater crisis. Nat. Clim. Chang. 2014, 4, 945–948. [Google Scholar] [CrossRef]
Taylor, R.G.; Scanlon, B.; Döll, P.; Rodell, M.; Van Beek, R.; Wada, Y.; Longuevergne, L.; Leblanc, M.; Famiglietti, J.S.; Edmunds, M.; et al. Ground water and climate change. Nat. Clim. Chang. 2013, 3, 322–329. [Google Scholar] [CrossRef] [Green Version]
Siebert, S.; Burke, J.; Faures, J.M.; Frenken, K.; Hoogeveen, J.; Döll, P.; Portmann, F.T. Groundwater use for irrigation–a global inventory. Hydrol. Earth Syst. Sci. 2010, 14, 1863–1880. [Google Scholar] [CrossRef] [Green Version]
Alley, W.M.; Healy, R.W.; LaBaugh, J.W.; Reilly, T.E. Flow and storage in groundwater systems. Science 2002, 296, 1985–1990. [Google Scholar] [CrossRef] [Green Version]
Rodell, M.; Velicogna, I.; Famiglietti, J.S. Satellite-based estimates of groundwater depletion in In-dia. Nature 2009, 460, 999–1002. [Google Scholar] [CrossRef] [Green Version]
Zaki, N.A.; Haghighi, A.T.; Rossi, P.M.; Tourian, M.J.; Kløve, B. Monitoring Groundwater Storage Depletion Using Gravity Recovery and Climate Experiment (GRACE) Data in Bakhtegan Catchment, Iran. Water 2019, 11, 1456. [Google Scholar] [CrossRef] [Green Version]
Narasimhan, T.N.; Witherspoon, P.A. An integrated finite difference method for analyzing fluid flow in porous media. Water Resour. Res. 1976, 12, 57–64. [Google Scholar] [CrossRef] [Green Version]
Huyakorn, P.S.; Lester, B.H.; Faust, C.R. Finite element techniques for modeling groundwater flow in fractured aquifers. Water Resour. Res. 1983, 19, 1019–1035. [Google Scholar] [CrossRef]
Mosé, R.; Siegel, P.; Ackerer, P.; Chavent, G. Application of the mixed hybrid finite element ap-proximation in a groundwater flow model: Luxury or necessity? Water Resour. Res. 1994, 30, 3001–3012. [Google Scholar] [CrossRef]
Wang, H.F.; Anderson, M.P. Introduction to Groundwater Modeling: Finite Difference and Finite Element Methods; Academic Press: Cambridge, CA, USA, 1995. [Google Scholar]
Harbaugh, A.W.; Banta, E.R.; Hill, M.C.; McDonald, M.G. Modflow-2000, the U.S. geological survey modular ground-water model-user guide to modularization concepts and the ground-water flow process. Open-file Report. U.S. Geol. Surv. 2000, 92, 134. [Google Scholar]
McDonald, M.G.; Harbaugh, A.W.; Original authors of MODFLOW. The history of MOD-FLOW. Groundwater 2003, 41, 280–283. [Google Scholar] [CrossRef] [PubMed]
Singh, A. Groundwater resources management through the applications of simulation modeling: A re-view. Sci. Total Environ. 2014, 499, 414–423. [Google Scholar] [CrossRef] [PubMed]
Jalalkamali, A.; Jalalkamali, N. Groundwater modeling using hybrid of artificial neural network with genetic algorithm. Afr. J. Agric. Res. 2011, 6, 5775–5784. [Google Scholar] [CrossRef]
Fallah-Mehdipour, E.; Haddad, O.B.; Mariño, M.A. Prediction and simulation of monthly ground-water levels by genetic programming. J. Hydro. Environ. Res. 2013, 7, 253–260. [Google Scholar] [CrossRef]
Sivapragasam, C.; Kannabiran, K.; Karthik, G.; Raja, S. Assessing Suitability of GP Modeling for Groundwater Level. Aquat. Procedia 2015, 4, 693–699. [Google Scholar] [CrossRef]
Nayak, P.C.; Rao, Y.S.; Sudheer, K.P. Groundwater level forecasting in a shallow aquifer using artificial neural network approach. Water Resour. Manag. 2006, 20, 77–90. [Google Scholar] [CrossRef]
Dash, N.B.; Panda, S.N.; Remesan, R.; Sahoo, N. Hybrid neural modeling for groundwater level prediction. Neural Comput. Appl. 2010, 19, 1251–1263. [Google Scholar] [CrossRef]
Seyam, M.; Mogheir, Y. Application of Artificial Neural Networks Model as Analytical Tool for Groundwater Salinity. J. Environ. Prot. 2011, 2, 56–71. [Google Scholar] [CrossRef]
Yoon, H.; Jun, S.-C.; Hyun, Y.; Bae, G.-O.; Lee, K.-K. A comparative study of artificial neural networks and support vector machines for predicting groundwater levels in a coastal aquifer. J. Hydrol. 2011, 396, 128–138. [Google Scholar] [CrossRef]
Shiri, J.; Kisi, O. Comparison of genetic programming with neuro-fuzzy systems for predicting short-term water table depth fluctuations. Comput. Geosci. 2011, 37, 1692–1701. [Google Scholar] [CrossRef]
Moosavi, V.; Vafakhah, M.; Shirmohammadi, B.; Behnia, N. A Wavelet-ANFIS Hybrid Model for Groundwater Level Forecasting for Different Prediction Periods. Water Resour. Manag. 2013, 27, 1301–1321. [Google Scholar] [CrossRef]
Sahoo, S.; Jha, M.K. Groundwater-level prediction using multiple linear regression and artificial neural network techniques: A comparative assessment. Appl. Hydrogeol. 2013, 21, 1865–1887. [Google Scholar] [CrossRef]
Shiri, J.; Kisi, O.; Yoon, H.; Lee, K.-K.; Nazemi, A.H. Predicting groundwater level fluctuations with meteorological effect implications—A comparative study among soft computing techniques. Comput. Geosci. 2013, 56, 32–44. [Google Scholar] [CrossRef]
Nourani, V.; Alami, M.T.; Vousoughi, F.D. Hybrid of SOM-Clustering Method and Wavelet-ANFIS Approach to Model and Infill Missing Groundwater Level Data. J. Hydrol. Eng. 2016, 21, 5016018. [Google Scholar] [CrossRef]
Azari, A.; Zeynoddin, M.; Ebtehaj, I.; Sattar, A.; Gharabaghi, B.; Bonakdari, H. Integrated prepro-cessing techniques with linear stochastic approaches in groundwater level forecasting. Acta Geophys. 2021, 69, 1395–1411. [Google Scholar] [CrossRef]
Osman, A.I.A.; Ahmed, A.N.; Huang, Y.F.; Kumar, P.; Birima, A.H.; Sherif, M.; Sefelnasr, A.; Ebraheemand, A.A.; El-Shafie, A. Past, Present and Perspective Methodology for Groundwater Modeling-Based Machine Learning Approaches. Arch. Comput. Methods Eng. 2022, 1–17. [Google Scholar] [CrossRef]
Banerjee, P.; Singh, V.S.; Chatttopadhyay, K.; Chandra, P.; Singh, B. Artificial neural network model as a potential alternative for groundwater salinity forecasting. J. Hydrol. 2011, 398, 212–220. [Google Scholar] [CrossRef]
Taormina, R.; Chau, K.W.; Sethi, R. Artificial neural network simulation of hourly groundwater lev-els in a coastal aquifer system of the Venice lagoon. Eng. Appl. Artif. Intell. 2012, 25, 1670–1676. [Google Scholar] [CrossRef] [Green Version]
Hosseini, F.S.; Malekian, A.; Choubin, B.; Rahmati, O.; Cipullo, S.; Coulon, F.; Pradhan, B. A novel machine learning-based approach for the risk assessment of nitrate groundwater contamination. Sci. Total Environ. 2018, 644, 954–962. [Google Scholar] [CrossRef] [Green Version]
Barzegar, R.; Moghaddam, A.A.; Deo, R.; Fijani, E.; Tziritis, E. Mapping groundwater contamination risk of multiple aquifers using multi-model ensemble of machine learning algorithms. Sci. Total Environ. 2018, 621, 697–712. [Google Scholar] [CrossRef] [PubMed]
Chen, C.; He, W.; Zhou, H.; Xue, Y.; Zhu, M. A comparative study among machine learning and numerical models for simulating groundwater dynamics in the Heihe River Basin, northwestern China. Sci. Rep. 2020, 10, 1–13. [Google Scholar] [CrossRef] [Green Version]
Moghaddam, D.D.; Rahmati, O.; Panahi, M.; Tiefenbacher, J.; Darabi, H.; Haghizadeh, A.; Haghighi, A.T.; Nalivan, O.A.; Tien Bui, D. The effect of sample size on different machine learning models for groundwater potential mapping in mountain bedrock aquifers. CATENA 2020, 187, 104421. [Google Scholar] [CrossRef]
Rahman, A.S.; Hosono, T.; Quilty, J.M.; Das, J.; Basak, A. Multiscale groundwater level forecasting: Coupling new machine learning approaches with wavelet transforms. Adv. Water Resour. 2020, 141, 103595. [Google Scholar] [CrossRef]
Mosavi, A.; Hosseini, F.S.; Choubin, B.; Abdolshahnejad, M.; Gharechaee, H.; Lahijanzadeh, A.; Dineva, A.A. Susceptibility prediction of groundwater hardness using ensemble machine learning models. Water 2020, 12, 2770. [Google Scholar] [CrossRef]
Hussein, E.A.; Thron, C.; Ghaziasgar, M.; Bagula, A.; Vaccari, M. Groundwater Prediction Using Machine-Learning Tools. Algorithms 2020, 13, 300. [Google Scholar] [CrossRef]
Farzin, M.; Avand, M.; Ahmadzadeh, H.; Zelenakova, M.; Tiefenbacher, J.P. Assessment of Ensemble Models for Groundwater Potential Modeling and Prediction in a Karst Watershed. Water 2021, 13, 2540. [Google Scholar] [CrossRef]
Refsgaard, J.C.; Christensen, S.; Sonnenborg, O.T.; Seifert, D.; Højberg, A.L.; Troldborg, L. Review of strategies for handling geological uncertainty in groundwater flow and transport modeling. Adv. Water Resour. 2012, 36, 36–50. [Google Scholar] [CrossRef]
Sahoo, S.; Russo, T.A.; Elliott, J.; Foster, I. Machine learning algorithms for modeling groundwater level changes in agricultural regions of the U.S. Water Resour. Res. 2017, 53, 3878–3895. [Google Scholar] [CrossRef]
Adamowski, J.; Chan, H.F. A wavelet neural network conjunction model for groundwater level forecasting. J. Hydrol. 2011, 407, 28–40. [Google Scholar] [CrossRef]
Gholami, V.; Chau, K.W.; Fadaee, F.; Torkaman, J.; Ghaffari, A. Modeling of groundwater level fluctuations using dendrochronology in alluvial aquifers. J. Hydrol. 2015, 529, 1060–1069. [Google Scholar] [CrossRef]
Zhang, J.; Zhu, Y.; Zhang, X.; Ye, M.; Yang, J. Developing a Long Short-Term Memory (LSTM) based model for predicting water table depth in agricultural areas. J. Hydrol. 2018, 561, 918–929. [Google Scholar] [CrossRef]
Raghavendra, N.S.; Deka, P.C. Support vector machine applications in the field of hydrology: A review. Appl. Soft Comput. 2014, 19, 372–386. [Google Scholar] [CrossRef]
Shen, C. A transdisciplinary review of deep learning research and its relevance for water resources sci-entists. Water Resour. Res. 2018, 54, 8558–8593. [Google Scholar] [CrossRef]
Sit, M.A.; Demiray, B.Z.; Xiang, Z.; Ewing, G.; Sermet, Y.; Demir, I. A comprehensive review of deep learning applications in hydrology and water resources. Water Sci. Technol. 2020, 82, 2635–2670. [Google Scholar] [CrossRef] [PubMed]
Zounemat-Kermani, M.; Matta, E.; Cominola, A.; Xia, X.; Zhang, Q.; Liang, Q.; Hinkelmann, R. Neurocomputing in surface water hydrology and hydraulics: A review of two decades retrospective, current status and future prospects. J. Hydrol. 2020, 588, 125085. [Google Scholar] [CrossRef]
Garg, A.X.; Hackam, D.; Tonelli, M. Systematic review and meta-analysis: When one study is just not enough. Clin. J. Am. Soc. Nephrol. 2008, 3, 253–260. [Google Scholar] [CrossRef]
Collaboration for Environmental Evidence. Guidelines and Standards for Evidence Synthesis in Environmental Management; Version 5.0; Pullin, A.S., Frampton, G.K., Livoreil, B., Petrokofsky, G., Eds.; Collaboration for Environmental Evidence: Johannesburg, South Africa, 2018; Available online: https://environmentalevidence.org/ (accessed on 3 January 2021).
Margat, J.; Van der Gun, J. Groundwater around the World: A Geographic Synopsis; CRC Press: Boca Raton, FL, USA, 2013. [Google Scholar]
Dalin, C.; Wada, Y.; Kastner, T.; Puma, Y.W.M.J. Groundwater depletion embedded in international food trade. Nature 2017, 543, 700–704. [Google Scholar] [CrossRef] [Green Version]
Döll, P.; Mueller Schmied, H.; Schuh, C.; Portmann, F.T.; Eicker, A. Global-scale assessment of groundwater depletion and related groundwater abstractions: Combining hydrological modeling with information from well observations and GRACE satellites. Water Resour. Res. 2014, 50, 5698–5720. [Google Scholar] [CrossRef]
Sahour, H.; Gholami, V.; Vazifedan, M. A comparative analysis of statistical and machine learning techniques for mapping the spatial distribution of groundwater salinity in a coastal aquifer. J. Hydrol. 2020, 591, 125321. [Google Scholar] [CrossRef]
Alagha, J.S.; Seyam, M.; Said, M.A.M.; Mogheir, Y. Integrating an artificial intelligence approach with k-means clustering to model groundwater salinity: The case of Gaza coastal aquifer (Pales-tine). Hydrogeol. J. 2017, 25, 2347–2361. [Google Scholar] [CrossRef]
Scanlon, B.R.; Keese, K.E.; Flint, A.L.; Flint, L.E.; Gaye, C.B.; Edmunds, W.M.; Simmers, I. Global synthesis of groundwater recharge in semiarid and arid regions. Hydrol. Process. 2006, 20, 3335–3370. [Google Scholar] [CrossRef]
Suryanarayana, C.; Sudheer, C.; Mahammood, V.; Panigrahi, B. An integrated wavelet-support vector machine for groundwater level prediction in Visakhapatnam, India. Neurocomputing 2014, 145, 324–335. [Google Scholar] [CrossRef]
Ebrahimi, H.; Rajaee, T. Simulation of groundwater level variations using wavelet combined with neural network, linear regression and support vector machine. Glob. Planet. Chang. 2017, 148, 181–191. [Google Scholar] [CrossRef]
Hintze, J.L.; Nelson, R.D. Violin plots: A box plot-density trace synergism. Am. Stat. 1998, 52, 181–184. [Google Scholar]
Ahmadi, A.; Emami, M.; Daccache, A.; He, L. Soil Properties Prediction for Precision Agriculture Using Visible and Near-Infrared Spectroscopy: A Systematic Review and Meta-Analysis. Agronomy 2021, 11, 433. [Google Scholar] [CrossRef]
Pan, S.; Pan, N.; Tian, H.; Friedlingstein, P.; Sitch, S.; Shi, H.; Arora, V.K.; Haverd, V.; Jain, A.K.; Kato, E.; et al. Evaluation of global terrestrial evapotranspiration using state-of-the-art approaches in re-mote sensing, machine learning and land surface modeling. Hydrol. Earth Syst. Sci. 2020, 24, 1485–1509. [Google Scholar] [CrossRef] [Green Version]
AghaKouchak, A.; Chiang, F.; Huning, L.S.; Love, C.A.; Mallakpour, I.; Mazdiyasni, O.; Moftakhari, H.; Papalexiou, S.M.; Ragno, E.; Sadegh, M. Climate Extremes and Compound Hazards in a Warming World. Annu. Rev. Earth Planet. Sci. 2020, 48, 519–548. [Google Scholar] [CrossRef] [Green Version]
Mokhtari, A.; Ahmadi, A.; Daccache, A.; Drechsler, K. Actual Evapotranspiration from UAV Images: A Multi-Sensor Data Fusion Approach. Remote Sens. 2021, 13, 2315. [Google Scholar] [CrossRef]
Ahmadi, A.; Nasseri, M.; Solomatine, D.P. Parametric uncertainty assessment of hydrological models: Coupling UNEEC-P and a fuzzy general regression neural network. Hydrol. Sci. J. 2019, 64, 1080–1094. [Google Scholar] [CrossRef]
Ahmadi, A.; Nasseri, M. Do direct and inverse uncertainty assessment methods present the same results? J. Hydroinformatics 2020, 22, 842–855. [Google Scholar] [CrossRef]
Saberi-Movahed, F.; Najafzadeh, M.; Mehrpooya, A. Receiving more accurate predictions for longi-tudinal dispersion coefficients in water pipelines: Training group method of data handling using extreme learning machine conceptions. Water Resour. Manag. 2020, 34, 529–561. [Google Scholar] [CrossRef]

Figure 1. Flow diagram of the systematic review.

Figure 2. Number of research records included in the systematic review based on their date of publication.

Figure 3. Pie chart of the included research articles based on the country of origin.

Figure 4. Reviewed articles’ proportion according to the average annual precipitation of their case studies.

Figure 5. The proportion of the reports according to the ML method that they have employed.

Figure 6. The proportion of articles dividing the data into two training-testing subsets.

Figure 7. The proportion of articles according to their number of inputs.

Figure 8. Percentage of the reviewed articles according to the length of the input data time series.

Figure 9. Percentage of reports according to their forecast periods.

Figure 10. The proportion of employed quantitative measures of performance.

Figure 11. Quantitative measures of performance for ML models predicting groundwater levels.

Figure 12. Results of meta-analysis for various groundwater characteristics.

Figure 13. Results of meta-analysis for ML models and ANN architectures to predict groundwater level.

Figure 14. Meta-analysis results according to various subcategories in the reviewed reports.

Figure 15. Meta-analysis results for three subcategories in the reviewed reports for groundwater level prediction.

Table 1. Groundwater characteristics predicted by ML models in the reviewed articles.

Predicted Variable	Percentage of Reports
Groundwater level	82.5%
Discharge	6.1%
Groundwater recharge	2.7%
Freshwater–saltwater interface level	2.5%
Salinity	1.3%
Groundwater level fluctuation	1.4%
Total dissolved solids	0.6%
Electrical conductivity	0.6%
Aquifer loss coefficient	0.5%
Fluoride	0.5%
Sodium adsorption ratio	0.4%
Nitrate nitrogen (NO₃-N)	0.2%
Contamination level	0.2%
Sulfate (SO₄)	0.2%
Hydraulic head change	0.1%
Dissolved oxygen	0.1%
Groundwater storage variation	0.1%

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ahmadi, A.; Olyaei, M.; Heydari, Z.; Emami, M.; Zeynolabedin, A.; Ghomlaghi, A.; Daccache, A.; Fogg, G.E.; Sadegh, M. Groundwater Level Modeling with Machine Learning: A Systematic Review and Meta-Analysis. Water 2022, 14, 949. https://doi.org/10.3390/w14060949

AMA Style

Ahmadi A, Olyaei M, Heydari Z, Emami M, Zeynolabedin A, Ghomlaghi A, Daccache A, Fogg GE, Sadegh M. Groundwater Level Modeling with Machine Learning: A Systematic Review and Meta-Analysis. Water. 2022; 14(6):949. https://doi.org/10.3390/w14060949

Chicago/Turabian Style

Ahmadi, Arman, Mohammadali Olyaei, Zahra Heydari, Mohammad Emami, Amin Zeynolabedin, Arash Ghomlaghi, Andre Daccache, Graham E. Fogg, and Mojtaba Sadegh. 2022. "Groundwater Level Modeling with Machine Learning: A Systematic Review and Meta-Analysis" Water 14, no. 6: 949. https://doi.org/10.3390/w14060949

APA Style

Ahmadi, A., Olyaei, M., Heydari, Z., Emami, M., Zeynolabedin, A., Ghomlaghi, A., Daccache, A., Fogg, G. E., & Sadegh, M. (2022). Groundwater Level Modeling with Machine Learning: A Systematic Review and Meta-Analysis. Water, 14(6), 949. https://doi.org/10.3390/w14060949

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Groundwater Level Modeling with Machine Learning: A Systematic Review and Meta-Analysis

Abstract

1. Introduction

2. Methodology

3. Results and Discussion

3.1. Statistical Analysis

3.2. Meta-Analysis

4. Opportunities

5. Summary and Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI