1. Introduction
The natural gas produced from the subterranean gas fields and subsequently transported through pipelines should meet certain specifications such as environmental and safety standards as well as those of sale gas sectors. The products destined for sale should be free of undesirable contaminants, e.g., carbon dioxide (CO
2) and hydrogen sulfide (H
2S), which are both toxic and unfriendly from an environmental point of view. For instance, CO
2 is considered the main contributor to global warming and climate change. Several treatments are employed to remove acidic gases from natural gas. The most famous are the alkanolamine-based treatment operations [
1,
2,
3]. This technique was firstly introduced for carbon dioxide removal in 1991 [
4,
5]. The used alkanolamines are organic compounds, such as monoethanolamine (MEA), diethanolamine (DEA), and triethanolamine (TEA). By far, MEA was the most preferred alkanolamine compared to DEA and TEA because of its reactivity, low molecular weight, and lower required circulation to maintain a given amine to acid-gas mole ratio.
The gas–liquid absorption in amine-based solvents is an efficient process in gas sweetening. Nevertheless, some imperfections have been observed, such as the creation of corrosive byproducts due to the amine degradation, water transfer to the gas stream during the desorption stage, and loss of the feedstock (amine), making the treatment operations expensive [
6,
7,
8,
9,
10]. As an alternative, a new class of non-aqueous and environmentally friendly innovating fluids, known as ionic liquids (ILs), has emerged. ILs have many industrial applications such as catalysis for clean technology [
11] and the removal of contaminants from refinery feedstock [
12]. Furthermore, Ion Engineering Company is intended to use the know-how of ionic liquids for industrial-scale sweetening of natural gas and flue gas CO
2 separation [
13,
14], as stated by Hasib-ur-Rahman et al. [
15].
Ionic liquids are molten salts, which are liquid (non-volatile) at room temperature. They are comprised exclusively of positively and negatively charged ions. Due to their bulky and asymmetrical cation structure, ILs have a low affinity to constitute crystals [
16]. Manipulation of the cation and/or anion allows designing ILs adaptable to any particular application requirements [
17]. Moreover, ILs are a perfect medium for acid gas solubilization over wide ranges of temperature and pressure. Thus, great attention was paid to evaluate the performance of ILs as a gas-cleaning agent in gas refinery plants [
10]. The only and the most discussed disadvantage related to the use of ILs is their high viscosity, but it can be bypassed as the viscosity can be regulated over a reasonable range of about <50 cP to >10,000 cP by selecting an adequate mixture of cation and anion [
15].
In the past few years, many experimental studies were conducted to estimate the solubility of acid gases in ILs [
1,
6,
8,
10,
18,
19], especially the carbon dioxide solubility [
20,
21,
22,
23,
24]. The obtained results confirmed the ILs to be very efficient in carbon dioxide removal. Unfortunately, the experimental studies require many laboratory tests, which are expensive, difficult, tedious, and time-consuming. As an alternative strategy, the solubility of acid gases in ILs has been modeled using thermodynamical laws and the equation of state (EoS). The thermodynamic laws used for modeling the solubility of acid gases in ILs can be divided into four groups that include cubic equations, quantum mechanics-based methods, activity coefficient methods, and statistical mechanics-based molecular approaches [
25]. The most used models are the Peng–Robinson equation of state (EoS), the generic Van der Waals EoS, the generalized Redlich–Kwong cubic EoS, the law and extended law of Henry, and the equation of Krichevsky–Kasarnovsky [
8,
26,
27,
28,
29,
30,
31,
32]. It was noticed that the models describe well the systems at low and moderate pressures [
33], however, the equations of state suffer from many weaknesses. The equation of state can be reliable only for an individual system and not for more interestingly, multiple systems. They require various adjustable parameters, which should be optimized based on real data within a particular and limited range of thermodynamic conditions. Consequently, developing more general and powerful models to predict the solubility of acid gases, especially carbon dioxide, in ILs is of paramount importance.
Recently, many soft computing methods have been applied to model gas solubility and phase equilibrium. One of these methods is the artificial neural network (ANN), which represents an important embranchment computational intelligence method that can be used without any pre-assumption of the input–output relationship [
34,
35]. Multilayer perceptron (MLP), radial basis function network (RBF), multi-layer feed-forward network, and gene expression programming (GEP) are the general categories of ANN. Fuzzy logic (FL) is also one of the computational intelligence methods, which can model complicated nonlinear relations [
36]. Likewise, the support vector machine (SVM) proposed by Vapnik was shown to be a very performant smart model [
37]. Several researchers have used these smart models in the petroleum industry and to predict the solubility of acid gases in ILs. Baghban et al. [
36] have predicted CO
2 solubility in the presence of various ILs using MLP and Adaptive Neuro-Fuzzy Inference System (ANFIS). Amedi et al. [
38] have used MLP, RBF, and ANFIS to predict H
2S solubility in the presence of various ILs. Otherwise, in 2017, Rostami et al. [
39] have applied the GEP method to model CO
2 solubility in crude oil during carbon dioxide enhanced oil recovery.
The current work aims at developing highly robust and easy-to-use machine learning models that can be applied for forecasting the solubility of CO
2 in 13 different ionic liquids at different temperature and pressure conditions. Two rigorous connectionist techniques, namely multilayer perceptron (MLP), and gene expression programming (GEP) are applied on a set of experimental data that was gathered from different literature sources [
40,
41,
42,
43,
44]. The MLP method was optimized using either Levenberg–Marquardt (LMA) or Bayesian Regularization (BR) techniques. The results obtained using three methods (MLP-LMA, MLP-BR, GEP) are then compared with results calculated using thermodynamic models based on Peng–Robinson (PR) and Soave–Redlich–Kwong (SRK) equation of states. Statistical indicators, including the determination coefficient (R
2) and Root Mean Square Error (RMSE), are used to evaluate the accuracy of the methods, in addition to graphical assessments using cross plots and bar plots. In the end, outliers detection is performed to test and analyze the validity of the best-developed model and quantify the doubtful experimental points from the database. It is worth noting that the two backpropagation-based learning algorithms (LMA and BR) that were employed in the training process of MLP, alongside the explicit correlations established to predict the CO
2 solubility in ILs, make the current work different from previously published works in the literature.
This paper is constructed as follows;
Section 2 depicts the data used in the study and the input and output parameters in the models.
Section 3 describes in detail the rigorous connectionist models and optimization techniques. An overview of the PR and SRK equations of state is presented in
Section 4. In
Section 5, the results are presented and discussed, and in
Section 6, the conclusions of the study are summarized.
4. Equations of State for Modeling the Solubility of Acid Gases
Many equations of state have been used by researchers to model the solubility of acid gases in ILs. Two of the most widely used EoS are the Soave–Redlich–Kwong (SRK) and the Peng–Robinson (PR) EoS, which are defined with Equations (10) and (15), respectively [
35]. The results of these two EoS are used to contrast those obtained by using the proposed models.
Noting that the calculation of solubility using EoS is related to the calculation of the mole fraction, the equations below express how to determine the mole fraction.
Soave–Redlich–Kwong (SRK) EoS:
where
T,
P,
v, and
R indicate the temperature, pressure, molar volume, and gas constant respectively, and
a and
b represent the EoS variables.
For computing the variables, a and b in the case of mixtures, the classic van der Waals one-fluid rules of mixing is used [
58]:
with
and
denote the mole fractions of components
i and
j, and
N indicates the number of the mixture’s components. The parameter
is a parameter of binary interaction, that enlarges the molecular interactions between molecules
i and
j. For pure materials, the parameters can be computed as follows:
with
, denotes the reduced temperature. We note that the above description is valid for a system of arbitrarily many components. In our system, it was assumed that we had two components where CO
2 was one and the IL the other.
Peng–Robinson (PR) EoS:
with:
For mixtures, the parameters a and b are determined using the same formulas as in SRK EoS. The parameter of binary interaction (
) is the only adjustable parameter for both EoS, SRK, and PR. This parameter is determined using a genetic algorithm (GA) with the next objective function [
58]:
with
and
indicate the experimental and EoS-calculated mole fractions of the solute respectively.
5. Computational Procedure
For the training phase, the mean square error (MSE) was used as the assessment criterion, which is defined mathematically as follows:
where
stands for mole fraction of CO
2,
and
indicate the experimental and the predicted values, respectively, and
represents the number of samples. The model tuning parameters giving the lowest MSE on the training set were considered the choice for the trained model.
For the modeling task using MLP, the data points were normalized between −1 and 1. To select appropriate topologies for the MLP approach, trial and error were used. The obtained models were designated MLP-LMA, and MLP-BR, respectively, and both included 3 hidden layers with 11, 11, and 9 neurons, respectively. The suitable activation functions in all the hidden layers and for the output layer were Tansig and Pureline, respectively.