1. Introduction
Petroleum fractions consist of tens of thousands of molecular species that contain carbon, hydrogen, sulfur, nitrogen, oxygen and metals. Due to the diverse origins of crude oils and refining technologies, the compositions of material streams in the refining processes vary widely. In recent years, with increasingly strict environmental regulations for fossil fuels as well as more and more heavy crude oils, it is urgent to improve refining technologies [
1,
2]. As the foundation for the study of refining technologies, the characterization of petroleum fractions and the construction of conversion mechanisms at the molecular level have become the most important issues in refineries [
3,
4].
Due to the limitation of analytical chemistry technologies, computer-aided molecular reconstruction is currently the most popular method to reflect the complicated compositions of petroleum with limited properties [
4]. Since the early 1990s when the first molecular reconstruction method was reported, a lot of molecular reconstruction methods have been proposed and applied in the molecular modeling of refining processes [
5,
6,
7,
8], including the stochastic reconstruction method (SR), structural-oriented lumping (SOL), molecular type homologous series (MTHS) method and SR-reconstruction by entropy maximization (SR-REM). Among them, the SR method is the most popular method in the characterization of heavy petroleum fractions.
Klein et al. [
9,
10] first introduced the SR method to characterize heavy residue feedstocks. In the SR method, molecules are treated as the collection of structural attributes (SA), and a probability density function (PDF) is imposed on each structural attribute. The type of PDF varies for different structural attributes, such as the histogram distribution for the determination of molecular families and the gamma distribution for ring numbers and the length of chains. Monte Carlo sampling with a quadrature method is applied to generate an equimolar set of molecules from PDFs. The parameters for each PDF were adjusted in an optimization loop for simulated annealing or genetic algorithm to make the bulk properties of generated mixtures close to those of the actual samples.
Many studies that use the SR method have been reported. Petti et al. examined [
11] the usage of CPU resources in the SR method and suggested that a sample size of 10,000 molecules could balance the simulation accuracy and computation expense. Zhang et al. [
12] extended a novel SR model to heavy vacuum residue fractions. The residue molecules were treated as a combination of approximately 600 building substructures. Deniz et al. [
13] introduced a new structure parameter set for detailed ring and chain configurations into the SR method to improve the method performance in heavy petroleum fractions. Moreover, Deniz et al. investigated [
14] the effects of methods for estimating the boiling point temperature and density of pure compounds on the simulating accuracy of the SR method. It is observed that the SR method has the highest accuracy with the group contribution method by Gani [
15,
16,
17,
18] and the Yen-woods equations [
19]. Haktanlr et al. [
20] proposed a novel SR method based on a custom predefined molecular library. This novel method focuses on characterizing the petroleum fractions with exhausted molecular species. Meanwhile, a sieving mechanism is introduced to make sure that the generated molecules are reasonable in structure. Glazov et al. studied [
21] the relationships between different PDFs of structural attributes and bulk properties.
The general expression of the SR method is shown as below:
where
stands for the parameters to be optimized,
are series of uniform random numbers between 0 and 1.
xlow and
xupper are the lower and upper boundaries of
. The function
f stands for the gap between the experimental values of bulk properties and the predicted values. The optimal parameters that make the predicted values of bulk properties close enough to the experimental values are obtained by effective algorithm optimization.
In the SR method, the input parameters are the parameters of PDFs. The structural attributes generally contain the type of molecule, the number of naphthenic rings and aromatic rings, the length of paraffin chains and sidechains, the type of heteroatom-containing molecule, etc. The types of PDF used to represent them are histogram distribution and gamma distribution. The parameters in histogram distribution are real numbers between 0 and 1. If more than one parameter exists in a histogram distribution, they should increase progressively. Given the total number of parameters in the model and the simulating accuracy, the two-parameter gamma distribution is generally adopted in the SR method. The two parameters are the shape parameter (SP) and scale parameter, or the shape parameter and mean. To characterize the complicated and diverse compositions in petroleum fractions, it is crucial to reasonably configure the lower and upper boundaries of the parameters, which is the domain of parameters in the SR method. However, as far as the authors know, no study has been reported on this issue. To this point, this paper aims to study the determination of domain and its effects on the performance of the SR method.
Section 2 illustrates the SR model based on gas oils. The bulk properties by experiments, the setting of structural attributes and the building diagram are provided in
Section 2.1.
Section 2.2 provides the configuration of the lower and upper boundaries for the parameters in the histogram distributions and gamma distributions.
Section 3 provides the results and discussions. Finally, the conclusion is given in
Section 4.
4. Conclusions
Based on gas oil samples, this work studies the performance of the SR model in different domains of parameters. In the setting of domains, multiple cases for the upper boundaries of shape parameters in gamma distributions are considered. The results show that in each case, the SR model can generate pseudo mixture, the bulk properties of which agree well with the experimental values. The bulk properties adopted are elemental analysis, density, H/C ratio, aromatic sulfur content, PINA analysis and simulated distillation. By investigating the variations in average objective function value in each case, it is observed that even though the decrease of average objective function value in smaller domains is faster at the beginning of the simulation, the SR model can find parameters in wider domains that make the average objective function value smaller. Case 3 is the best case where the simulating accuracy and convergence performance are maintained compared with other cases. The prediction of the SR model on compositions of gas oil sample is not acceptable, despite the estimated values of bulk properties being very close to the experimental values. Large derivations are observed in all cases.
The detailed group-type analysis contributes to more accurate predictions on compositions. The predicted distributions in normal paraffins and isoparaffins become higher as the upper boundaries of shape parameters increase. In cases 3 and 4, the predicted distributions fit best with the experimental distributions, compared to the other cases. As for naphthenes and aromatics, the predicted distributions agree well with experimental values in all cases. Because case 3 performed best in maintaining simulating accuracy and convergence rate, the best upper boundary of shape parameters is 20, compared to other values in this work.
The domain of parameters in probability functions has a great influence on the performance of the SR method. Due to the differences in bulk properties and compositions of petroleum fractions, reasonably setting the domain of parameters is indispensable for the accurate reconstruction of petroleum fractions.