Next Article in Journal
Chemical and Enzymatic Approaches to Carbohydrate-Derived Spiroketals: Di-D-Fructose Dianhydrides (DFAs)
Previous Article in Journal
Composition and Applications of Aloe vera Leaf Gel
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Statistical Assessment of Solvent Mixture Models Used for Separation of Biological Active Compounds

by
Sorana D. Bolboacă
1,2,*,
Elena M. Pică
2,
Claudia V. Cimpoiu
3 and
Lorentz Jäntschi
2
1
“Iuliu Haţieganu” University of Medicine and Pharmacy Cluj-Napoca, Department of Medical Informatics and Biostatistics, 6 Louis Pasteur, 400349 Cluj-Napoca, Romania.
2
Technical University of Cluj-Napoca, 103-105 Muncii Bvd, 400641 Cluj-Napoca, Romania
3
“Babeş-Bolyai” University, Department of Analytical Chemistry, 11 Arany Janos, 400028 Cluj-Napoca, Romania
*
Author to whom correspondence should be addressed.
Molecules 2008, 13(8), 1617-1639; https://doi.org/10.3390/molecules13081617
Submission received: 23 May 2008 / Revised: 7 August 2008 / Accepted: 7 August 2008 / Published: 11 August 2008

Abstract

:
Two mathematical models with seven and six parameters have been created for use as methods for identification of the optimum mobile phase in chromatographic separations. A series of chromatographic response functions were proposed and implemented in order to assess and validate the models. The assessment was performed on a set of androstane isomers. Pearson, Spearman, Kendall tau-a,b,c and Goodman-Kruskal correlation coefficients were used in order to identify and to quantify the link and its nature (quantitative, categorical, semi-quantitative, both quantitative and categorical) between experimental values and the values estimated by the mathematical models. The study revealed that the six parameter model is valid and reliable for five chromatographic response factors (retardation factor, retardation factor ordered ascending by the chromatographic peak, resolution of pairs of compound, resolution matrix of successive chromatographic peaks, and quality factor). Furthermore, the model could be used as an instrument in analysis of the quality of experimental data. The results obtained by applying the model with six parameters for deviations of rank sums suggest that the data of the experiment no. 8 are questionable.

Introduction

Chromatographic analysis, defined as the technique used for separation of a mixture of compounds by their distribution between two phases, was invented in 1901 by the Russian botanist Mikhail Semyonovich Tsvet, during his research on plant pigments [1]. He used liquid-adsorption column chromatography with calcium carbonate as adsorbent and petroleum ether/ethanol mixtures as eluent to separate chlorophylls and carotenoids. The method was presented at the XI Congress of Naturalists and Doctors in St. Petersburg in 1901 but the term “chromatography” was used for the first time in 1906 in a paper published in the Berichte der Deutschen Botanischen Gesellschaft Journal [2].
Today, chromatography is a separation method frequently used in chemistry [3,4], biology [5,6], and medicine [7,8] as an analytical technique. The choice of the optimum mobile phase composition is the most difficult and most time-consuming task [9,10]. Optimization procedures have been developed by many researchers in order to obtain the optimum mobile phase. Some geometrical or mathematical models were proposed and assessed [11,12,13,14,15]. Moreover, some optimization methods that use neural networks have been introduced [16,17].
A series of experiments were performed and the optimum mobile phases were obtained for different classes of compounds (steroids [18,19,20] and benzodiazepines [13,17]). The aim of the present research was to assess through statistical parameters and tests two mathematical models created for optimization of mobile phase in chromatographic separation applied on a set of androstane isomers.

Material and Methods

Experimental Measurements

A set of five previously investigated androstane isomers (5α-androstane-3β-ol, 5α-androstane-3α-ol, 5α-androstane-17β-ol, 5β-androstane-3α,17β-diol, and 5β-androstane-3β,17β-diol) [20] was included into the study. Eleven experimental values were considered (see Table 1). The 11th experiment was a result of an optimization method applied on an objective function that suggests the composition of the optimum mobile phase as 55:19:26 (trichloromethane:propanone:petroleum ether).
The mathematical models presented in Eqs. (1.1) and (1.2) were used in some studies, including in the investigation of the set of androstane isomers presented in [20,21]:
M7(x1,x2,x3) = a1x1 + a2x2 + a3x3 + a4x1x2 + a5x1x3 + a6x2x3 + a7x1x2x3
M6(x1,x2,x3) = a1x1 + a2x2 + a3x3 + a4x1x2 + a5x1x3 + a6x2x3
where M7, M6 = estimator and predictor of the selected chromatographic parameter; x1, x2, and x3 = molar fractions of the three solvents (where x1 + x2 + x3 = 1); a1, a2, a3, a4, a5, a6, and a7 = model coefficients; first determined based on the best estimation of the selected chromatographic parameter (using 7 experiments for M7; 6 experiments for M6) and then used to predict used chromatographic parameter for any composition of the mobile phase (used for not included in model experiments).
The concentration of the solvents into the mixture and the experimental data are presented in Table 1. The chromatography for experiments 1 - 11 was performed on 5×10 cm glass HPTLC plates pre-coated with silica gel 60 F254 (Merck) (Table 1).
Table 1. Experimental data: androstane isomers.
Table 1. Experimental data: androstane isomers.
No.TCM:Prop:PE*Ll1w1l2w2l3w3l4w4l5w5
133:33:338.706.650.487.360.357.260.234.000.384.760.98
20:0:1008.830.000.420.000.440.000.220.000.250.000.21
30:100:08.758.290.378.490.268.490.117.930.287.790.59
4100:0:09.001.210.622.050.451.430.410.050.230.190.30
550:0:508.930.540.560.980.380.680.270.000.260.000.25
650:50:08.846.710.557.120.317.050.205.310.365.560.69
70:50:508.768.440.368.560.118.560.057.350.317.201.38
810:10:808.863.490.604.710.424.510.280.530.270.641.41
980:10:108.875.080.696.710.516.060.341.010.322.320.63
1010:80:108.828.240.528.410.248.460.147.380.327.270.96
1155:19:2618.953.430.825.861.1611.521.4313.441.2514.381.32
* Trichloromethane (CHCl3 [mL]) : Propanone (CH3COCH3 [mL]) : Petroleum ether ([mL])
L = migration distance of the eluent e
li = migration distance of ithe compound in eluent e (where i = 1, 2, …5)
wi = spot width of ith compound (where i = 1, 2, …, 5)

Statistical Validation

The statistical hypothesis of the research was as follows: the mathematical model with seven parameters proposed by Eq (1.1.), and the model with six parameters proposed by Eq (1.2) are reliable and valid models for estimation of a given response function (H0).
A series of parameters were defined and proposed to be used as estimators of the separation quality (see Table 2). The terms and symbols used in the assessment of the mathematical model are as were established by the International Union of Pure and Applied Chemistry (IUPAC) in the Nomenclature for Chromatography [22].
The model is not considered reliable and valid if the null hypothesis (H0) is rejected at a significance level of 5% in the investigation of the response factors (see Table 2). Pearson (r), Spearman (ρ), Kendall (τ-a,b,c) and Gamma (Γ) correlation coefficients [23,24,25] were used in order to identify and to quantify the nature of the link (quantitative, categorical, semi-quantitative, quantitative and categorical) between experimental and estimated values. The correlation approach was choose for analysis of the quality of the models (Eq (1.1), and Eq (1.2), respectively) due to its ability of identification of linear relationship between two variable (in our case the experimental and estimated values by the proposed mathematical models from Eq (1.1) and Eq (1.2), respectively).
Table 2. Chromatographic response function for statistical assessment.
Table 2. Chromatographic response function for statistical assessment.
ParameterFormulaEq.Notes
Retardation factors (RF) matrixRF(i,e) =
l(i,e)/l(e)
(2)ia separated compound
ethe mobile phase
l(i,e)migration distance of i in e
l(e)migration distance of e
Ordered RFRFO(i,e) =
2·(lo(i+1,e)-lo(i,e))/l(e)
(3)lo(i,e)ith migration coordinate in the list of migration, ordered by length
Resolution matrixRSM(i,j,e) =
2·(l(i,e)-l(j,e))/(w(i,e)+w(j,e))
(4)ja separated compound
w(i,e)spot width of i
w(j,e)spot width of j
Resolution of adjacent spots matrixRSO(i,e) =
2·(lo(i+1,e)-lo(i,e))/(w(i+1,e)+w(i,e))
(5)lo(i,e)ith migration coordinate in the list of migration, ordered by length
Number of componentsnc(e) =
Σi 1 | lo(i+1,e)-lo(i,e)>(w(i+1,e)+w(i,e))/8
(6)nc(e)number of components observed in e at least 1σ (σ = standard deviation)
Maximum number of componentsmnc =
maxe nc(e)
(7)mncfrom all experimented mobile phases (or previous knowledge)
Retardation factors deviationRFD(e) =
√(∑i (ΔRF(i,e)-1/mnc)2/√nc(e)(nc(e)+1)
(8)1/mnctheoretical difference between two retardation factors
ΔRF(i,e)RFO(i+1,e)-RFO(i,e)
Informational energyIEne(e) =
mnc2 - Σi (ni)2
(9)ninumber of compounds that migrate into ith equidistant interval from mnc intervals
Informational entropyIEnt(e) =
Σi (ni)log2(ni)
(10)
Resolution sumRSS(e) =
i RSO(i,e)
(11)RSS(e)average indicator for separation
Effective plates number squared rootQNeff(e) =
4·l(e)/(Σi w(e,i))
(12)QNeff(e)average indicator for a hypothetic quantitative analysis
Resolution divided by the number of effective platesRSP(e) =
25·RSS(e)/QNeff(e)
(13)RSP(e)composite indicator for separation expressed as proportion; note that 4·RSS(e) → QNeff(e) for an ideal separation
Average resolution for separationRSA(e) =
RSS(e)/nc(e)
(14)RSA(e)average indicator for separation
Relative resolution productRRP(e) =
Πi RSO(i,e)/ Σi RSO(i,e)
(15)RRP(e)average indicator for separation
Minkowski type mean of resolutionsRSR(e) =
(∑i (RSO(i,e))1/p/nc(e))p; p = 2
(16)RSR(e)is better descriptor for separation than RSA
Quality factorQF(e) =
mini,j RSM(i,j,e) = mini RSO(i,e)
(17)QF(e)worst one define the resolution of separation
Notes:
÷
Informational energy is a quality factor computed by the Logit method, which is equal with 0 for an ideal separation
÷
Informational energy is a quality factor which is equal with 0 for an ideal separation
÷
Part of the entries in Table 2 were previously used in the literature, with different names; thus, relating [20], RFD was reported as Sm.
A significant correlation coefficient (at a significance level of 5%) sustains the validity of the model from Eq (1.1) and Eq (1.2) in the estimation of chromatographic response factors of interest. An online resource hosted by AcademicDirect was used in order to calculate correlation coefficients and associated statistic parameters (http://l.academicdirect.org/Statistics/linear_dependence/).
The difference between two correlation coefficients was tested by using Statistica 6.0 software, Basic Statistics and Tables - Differences tests - Differences between two correlation coefficients.
If Pearson’s correlation coefficient is much smaller than Spearman’s correlation coefficient applied to the same variables, it can be concluded that the variable of interest correlated consistently (if both are statistically significant), but not in a linear quantitative manner (i.e. may not be linear; may not be quantitative).
A linear relationship between experimental and estimated values (given by Eq (1.1) and Eq (1.2), respectively) was considered to be proven when Pearson’s and Spearman’s correlation coefficients had similar values (i.e. there is no significant statistical difference) that are statistically significant different from zero. The Kendall’s as well as Gamma correlation coefficients make no assumption whatsoever about the distribution of the investigated values (it is a ratio that investigate the “concordant” and “discordant” pairs). Opposite to Pearson's correlation coefficient, Kendall’s tau correlation coefficients are measures of correlation between two ordinal-level variables. When Kendall’s tau correlation coefficient are applied, for more concordant than discordant pairs the value of the coefficient is positive, for equal concordant and discordant pairs the coefficient is zero.

Results and Discussion

A series of results were obtained by applying the parameters described in Table 2 on the set of androstane isomers. Four response functions were computed as the measurement space (Eq (2)-Eq (5)): retardation factor, retardation factors ordered ascending by the chromatographic peak, resolution of separation, and resolution of separation of successive peaks. The proposed response functions (Eq (6)-Eq (17)) for investigation of the reliability and validity of the model presented in Eq (1.1) and Eq (1.2) where computed based on the results of Eq (2)-Eq (5).
The retardation factor matrix (the response function of the compound separation in chromatography, defined as the relative migration distance) for each investigated compound is presented in Table 3.
Table 3. Matrix of retardation factor: experimental vs estimated.
Table 3. Matrix of retardation factor: experimental vs estimated.
No.ExperimentalEstimated by Eq(1.1)Estimated by Eq(1.2)
AI1AI2AI3AI4AI5AI1AI2AI3AI4AI5AI1AI2AI3AI4AI5
10.7640.8450.8340.4600.547 0.006 0.021 0.134 0.159 0.228
20.0000.0000.0000.0000.000
30.9470.9700.9700.9060.890
40.1340.2280.1590.0060.021
50.0600.1100.0760.0000.000
60.7590.8050.7980.6010.629
70.9630.9770.9770.8390.822
80.3940.5320.5090.0600.0720.2910.3140.3080.2030.2160.2150.2190.2710.2790.284
90.5730.7560.6830.1140.2620.3090.3930.3470.1390.1740.1570.1720.2890.3170.364
100.9340.9540.9590.8370.8241.0171.0521.0530.8780.8910.8820.9030.9971.0241.022
110.1810.3090.6080.7090.7590.5050.5910.5650.2530.3240.3090.3230.4380.4650.491
AI1 = 5α-androstane-3β-ol; AI2 = 5α-androstane-3α-ol
AI3 = 5a-androstane-17β-ol; AI4 = 5β-androstane-3α,17β-diol; AI5 = 5β-androstane-3β,17β-diol
The experimental and estimated retardation factor had identical values for first experiments because these were used to construct the model: the coefficients of the model with seven parameters proposed by Eq (1.1) were calculated based on experiments 1...7 presented in Table 1. A similar procedure was followed for experiments 2...7 included in the model with six parameters proposed by Eq (1.2). The differences between experimental and estimated values (for the experiments not included in learning set) showed that the great variation was obtained by the experiment no. 11. The difference varied from -0.324 (AI1 = 5α-androstane-3β-ol, experiment no. 11, model M7) to 0.584 (AI4 = 5β-androstane-3α,17β-diol, experiment no. 11, model M6). The estimated values were greater than the experimental values in half of the cases. Systematically, the estimated values were greater than experimental values for experiment no. 10 - Eq (1.1) and lower than experimental values for experiment no. 1 - Eq (1.2).
The correlation analysis on experimental versus estimated retardation factor give the results presented in Table 4. The analysis of the values of correlation coefficients (all values are higher than 0.5 and all correlation are statistically significant p < 1.39·10-2) revealed that the models from Eq (1.1) and Eq (1.2) have good abilities in estimation of chromatographic retardation factor.
Table 4. Correlation analysis on retardation factor: experimental versus estimated.
Table 4. Correlation analysis on retardation factor: experimental versus estimated.
NameCorrelation coefficientp-valueStatistical parameter
Eq(1.1), n = 20
Pearsonr = 0.72143.31·10-4tPrs,1 = 4.42
Spearmanρ = 0.77895.19·10-5tSpm,1 = 5.27
Semi-QrsQ = 0.74961.42·10-4tsQ = 4.80
Kendall τaτKen,a = 0.63169.89·10-5ZKen,τa = 3.89
Kendall τbτKen,b = 0.63169.89·10-5ZKen,τb = 3.89
Kendall τcτKen,c = 0.60002.17·10-4ZKen,τc = 3.70
GammaΓ = 0.63161.39·10-2ZΓ = 2.46
Eq (1.2), n = 25
Pearsonr = 0.82923.02·10-7tPrs,1 = 7.11
Spearmanρ = 0.90088.45·10-10tSpm,1 = 9.95
Semi-QrsQ = 0.86422.58·10-8tsQ = 8.24
Kendall τaτKen,a = 0.76677.80·10-8ZKen,τa = 5.37
Kendall τbτKen,b = 0.76677.80·10-8ZKen,τb = 5.37
Kendall τcτKen,c = 0.73602.51·10-7ZKen,τc = 5.16
GammaΓ = 0.76673.82·10-5ZΓ = 4.12
The highest correlation coefficient is obtained by the Spearman method and leads to the idea that the retardation factor is a categorical not a quantitative variable. Statistically, there was not identified any significant difference between correlation coefficients obtained by different methods, neither for Eq (1.1) nor for Eq (1.2) (the lowest value of 0.3465 was obtained in comparison of Spearman and Kendall - Eq (1.1); a p-value of 0.0780 was obtained in comparison of Spearman and Kendall τc - Eq (1.2)).
The retardation matrix ordered ascending for each chromatographic peak was obtained based on Eq (3). The results for experimental and estimated determinations are presented in Table 5. The correspondence between each peak and the compound was known on the data presented in Table 5. The difference between experimental and estimated values varied from -0.155 (1st peak, experiment no. 8, Eq (1.2)) to 0.363 (5th peak, experiment no. 9, Eq (1.1)) and 0.675 (4th peak, experiment no. 1, Eq (1.2)). Systematically, the estimated values were higher than experimental values for experiment no. 10, for both Eq (1.1) and Eq (1.2). The correlation analysis between experimental and estimated values (by Eq (1.1) and Eq (1.2), respectively) leads to the results presented in Table 6.
Table 5. Matrix of retardation factors ordered by the chromatographic peak: experimental vs estimated.
Table 5. Matrix of retardation factors ordered by the chromatographic peak: experimental vs estimated.
No.Experimental peakEstimated peak by Eq(1.1)Estimated peak by Eq(1.2)
1st2nd3rd4th5th1st2nd3rd4th5th1st2nd3rd4th5th
10.4600.5470.7640.8340.845 0.0060.0210.1340.1590.228
20.0000.0000.0000.0000.000
30.8900.9060.9470.9700.970
40.0060.0210.1340.1590.228
50.0000.0000.0600.0760.110
60.6010.6290.7590.7980.805
70.8220.8390.9630.9770.977
80.0600.0720.3940.5090.5320.2000.2190.2910.3080.3140.2150.2190.2710.2790.284
90.1140.2620.5730.6830.7560.1410.1720.3090.3470.3930.1570.1720.2890.3170.364
100.8240.8370.9340.9540.9590.8660.9021.0171.0531.0520.8820.9030.9971.0241.022
110.1810.3090.6080.7090.7590.2560.3210.5050.5650.5910.3090.3230.4380.4650.491
L = migration distance of the eluent
The difference between two correlation coefficients was tested and the results are presented in Table 7. By analyzing the results from Table 7 it can be seen that the investigated chromatographic response function is more like to be a categorical variable but not a rank variable then a quantitative variable (the Spearman rank correlation coefficient is statistically significant greater that Kendall τc applied on Eq (1.1)). The data presented in Table 6 revealed that all correlation coefficient were statistically significant (p ≤ 3.82·10-5). Thus, it can be concluded that the link between experimental and estimated by Eq (1.1) and Eq (1.2) data are linear related and sustain the validity of the models from Eq (1.1) and Eq (1.2) for this chromatographic response function.
Table 6. Correlation analysis on retardation factor ordered ascending by the chromatographic peak (experimental vs estimated values).
Table 6. Correlation analysis on retardation factor ordered ascending by the chromatographic peak (experimental vs estimated values).
NameCorrelation coefficientp-valueStatistical parameter
Eq(1.1), n = 20
Pearsonr = 0.86548.38·10-7tPrs,1 = 7.33
Spearmanρ = 0.95793.39·10-11tSpm,1 = 14.15
Semi-QrsQ = 0.91052.53·10-8tsQ = 9.34
Kendall τaτKen,a = 0.85261.47·10-7ZKen,τa = 5.26
Kendall τbτKen,b = 0.85261.47·10-7ZKen,τb = 5.26
Kendall τcτKen,c = 0.81005.94·10-7ZKen,τc = 4.99
GammaΓ = 0.85267.42·10-6ZΓ = 4.48
Eq(1.2), n = 25
Pearsonr = 0.82923.02·10-7tPrs,1 = 7.11
Spearmanρ = 0.90088.45·10-10tSpm,1 = 9.95
Semi-QrsQ = 0.86422.58·10-8tsQ = 8.24
Kendall τaτKen,a = 0.76677.80·10-8ZKen,τa = 5.37
Kendall τbτKen,b = 0.76677.80·10-8ZKen,τb = 5.37
Kendall τcτKen,c = 0.73602.51·10-7ZKen,τc = 5.16
GammaΓ = 0.76673.82·10-5ZΓ = 4.12
Table 7. Matrix of p-values: test of difference between two correlation coefficients.
Table 7. Matrix of p-values: test of difference between two correlation coefficients.
PearsonSpearmanSemi-QKendall τaKendall τbKendall τcGamma
Eq(1.1), n = 20 Eq(1.2), n = 25
Pearson1.00000.35100.72870.58050.58050.43460.5805
Spearman0.08241.00000.55590.14080.14080.09030.1408
Semi-Q0.57430.23051.00000.36990.36990.26140.3699
Kendall τa0.74160.05190.24681.00000.36990.26140.3699
Kendall τb0.74160.05190.24681.00001.00001.00000.8178
Kendall τc0.37830.02230.08900.58030.58031.00000.8178
Gamma0.74160.05190.24681.00001.00000.58031.0000
The resolution of separation between any two investigated androstane isomers could be considered one of the top-three quality measurements of a chromatographic separation. The resolution matrix between pairs of androstane isomers are presented in Table 8. The highest the resolution value between two compounds, the better the separation is considered.
The difference between experimental and estimated values for experiments 8-11 varied from to -7.232 (the resolution between 5β-androstane-3α,17β-diol and 5β-androstane-3β,17β-diol, experiment no. 8, Eq (1.1)) to 11.827 (the resolution between 5a-androstane-17β-ol and 5β-androstane-3α,17β-dio, experiment no.8, Eq (1.2)). With a single exception, the estimated exceed the experimental values (experiment no. 10, for both Eq (1.1) and Eq (1.2), respectively). The results of the correlation analysis are presented in Table 9.
Table 8. Resolution matrix of pairs of compounds: experimental vs estimated.
Table 8. Resolution matrix of pairs of compounds: experimental vs estimated.
No.AI1-AI2AI1-AI3AI1-AI4AI1-AI5AI2-AI3AI2-AI4AI2-AI5AI3-AI4AI3-AI5AI4-AI5
Experimental
11.6871.7186.1632.5890.3109.1783.89510.6894.1321.118
20.0000.0000.0000.0000.0000.0000.0000.0000.0000.000
30.6350.8331.1081.0420.0002.0741.6472.8722.0000.322
41.5700.4272.7292.2171.4425.8824.9604.3133.4930.528
50.9360.3371.3171.3330.9233.0633.1112.5662.6150.000
60.9530.9073.0771.8550.2755.4033.1206.2143.3480.476
70.5110.5853.2541.4250.0005.7621.8266.7221.9020.178
82.3922.3186.8052.8360.57112.1164.44814.4734.5800.131
92.7171.9038.0594.1821.52913.7357.70215.3037.7112.758
100.4470.6672.0481.3110.2633.6791.9004.6962.1640.172
112.4557.1919.67110.2344.3716.2906.8711.4332.0800.732
Estimated by Eq(1.1)
80.5120.4561.9680.8910.1783.2111.4023.6391.4090.207
91.5150.7863.4472.2531.0626.4054.5425.9123.7700.586
100.8721.0823.0811.620-0.0045.0282.3276.2692.7340.521
111.6811.3805.0422.4800.6777.9714.2928.6654.1420.901
Estimated by Eq(1.2)
10.8220.6732.9731.6880.3725.4402.8476.0912.885 0.196
80.3250.2311.2790.6960.1912.4041.1762.6461.140.008
91.3280.562.7582.0581.0765.5974.3164.9193.5010.387
100.6850.8572.3921.4260.014.222.1015.2762.4650.322
111.0460.6132.7021.8190.7225.2283.5235.2923.2280.225
AI1 = 5α-androstane-3β-ol
AI2 = 5α-androstane-3α-ol
AI3 = 5a-androstane-17β-ol
AI4 = 5β-androstane-3α,17β-diol
AI5 = 5β-androstane-3β,17β-diol
Two out of seven correlation coefficients were not statistically significant according to the Pearson and Gamma correlation coefficients. The values of the other correlation coefficients were not significantly different by each other (the lowest p-value was obtained when Spearman and Kendall τc were compared with values of 0.2578 - Eq (1.1), and 0.1862 - Eq (1.2), respectively).
The experimental matrices of the successive chromatographic peaks and those estimated by the models Eq (1.1) and Eq (1.2) are presented in Table 10. The difference between the experimental and estimated values varied from -0.635 (1st peak - experiment no. 9, Eq (1.1)) and 2.172 (2nd peak - experiment no. 10, Eq (1.1)). In most cases (four out of five), the estimated by Eq (1.1) values were greater than the experimental values for experiment no. 10.
Table 9. Correlation analysis on resolutions: experimental vs estimated.
Table 9. Correlation analysis on resolutions: experimental vs estimated.
NameCorrelation coefficientp-valueStatistical parameter
Estimated by Eq(1.1), n = 40
Pearsonr = 0.51736.30·10-4tPrs,1 = 3.72
Spearmanρ = 0.62141.88·10-5tSpm,1 = 4.89
Semi-QrsQ = 0.56701.36·10-4tsQ = 4.24
Kendall τaτKen,a = 0.44625.02·10-5ZKen,τa = 4.05
Kendall τbτKen,b = 0.44625.02·10-5ZKen,τb = 4.05
Kendall τcτKen,c = 0.43507.71·10-5ZKen,τc = 3.95
GammaΓ = 0.44627.05·10-2ZΓ = 1.81
Estimated by Eq(1.2), n = 50
Pearsonr = 0.61851.70·10-6tPrs,1 = 5.45
Spearmanρ = 0.67866.12·10-8tSpm,1 = 6.40
Semi-QrsQ = 0.64783.67·10-7tsQ = 5.89
Kendall τaτKen,a = 0.49394.18·10-7ZKen,τa = 5.06
Kendall τbτKen,b = 0.49394.18·10-7ZKen,τb = 5.06
Kendall τcτKen,c = 0.48407.07·10-7ZKen,τc = 4.96
GammaΓ = 0.49391.24·10-2ZΓ = 2.50
Table 10. Resolution matrices of successive chromatographic peaks: experimental vs estimated.
Table 10. Resolution matrices of successive chromatographic peaks: experimental vs estimated.
No.Experimental peakEstimated peak by Eq(1.1)Estimated peak by Eq(1.2)
1st2nd3rd4th5th1st2nd3rd4th5th1st2nd3rd4th5th
11.1182.5891.7180.3100.845 0.1962.4870.6730.3720.196
20.0000.0000.0000.0000.000
30.3221.1080.8330.0000.970
40.5282.2170.4271.4420.228
50.0001.3170.3370.9230.110
60.4761.8550.9070.2750.805
70.1783.2540.5850.0000.977
80.1312.8362.3180.5710.5320.2071.2930.4560.1780.3140.0081.2710.2310.1910.008
92.7584.1821.9031.5290.7560.5862.1430.7861.0620.3930.3872.1210.5601.0760.387
100.1722.0480.4470.2630.9590.5212.0641.082-0.0041.0520.3222.0420.8570.010.322
112.4554.3711.4330.7320.7590.9012.2381.3800.6770.5910.2252.1630.6130.7220.225
The results of the correlation analysis of the resolution matrix of successive chromatographic peaks are presented in Table 11. The values and associated significances of the Pearson and Spearman correlation coefficients sustained the linearity of the relationship between experimental and estimated values. The analysis of the results presented in Table 11 revealed that for both equations the Pearson correlation coefficient is greater than the Spearman correlation coefficient, leading to the conclusion that the investigated response function is a quantitative variable. With a single exceptions (Gamma correlation analysis), all correlation method sustained the validity of the model from Eq (1.1) and Eq (1.2) (p ≤ 5.54·10-3). With one exception (Kendall τc, τKen,c = 0.4900), the values of correlation coefficient were higher than 0.5, indicating moderate to good correlations between experimental and estimated values. Four out of seven correlation coefficients (Kendall τa, Kendall τb, Kendall τc, and Gamma) had values less than or equal with 0.4737, indicating a weak correlation between experimental and estimated by Eq (1.2) values. There could not be identified any statistically significant differences between correlation coefficients presented in Table 11. The lower p-values (0.2348 - Eq (1.1), and 0.2592 - Eq (1.2)) were obtained when Pearson and Kendall τc correlation coefficients were compared.
Table 11. Results of correlation analysis: resolution matrix of successive chromatographic peaks (experimental vs estimated).
Table 11. Results of correlation analysis: resolution matrix of successive chromatographic peaks (experimental vs estimated).
NameCorrelation coefficientp-valueStatistical parameter
Estimated by Eq(1.1), n = 20
Pearsonr = 0.74461.66·10-4tPrs,1 = 4.73
Spearmanρ = 0.66921.25·10-3tSpm,1 = 3.82
Semi-QrsQ = 0.70595.06·10-4tsQ = 4.23
Kendall τaτKen,a = 0.51581.47·10-3ZKen,τa = 3.18
Kendall τbτKen,b = 0.51581.47·10-3ZKen,τb = 3.18
Kendall τcτKen,c = 0.49002.52·10-3ZKen,τc = 3.02
GammaΓ = 0.51581.01·10-1ZΓ = 1.64
Estimated by Eq(1.2), n = 25
Pearsonr = 0.68219.24·10-4tPrs,1 = 3.96
Spearmanρ = 0.63612.57·10-3tSpm,1 = 3.50
Semi-QrsQ = 0.65871.59·10-3tsQ = 3.71
Kendall τaτKen,a = 0.47373.50·10-3ZKen,τa = 2.92
Kendall τbτKen,b = 0.47373.50·10-3ZKen,τb = 2.92
Kendall τcτKen,c = 0.45005.54·10-3ZKen,τc = 2.77
GammaΓ = 0.47371.67·10-1ZΓ = 1.38
Seven global indicators of separation were introduced and calculated (see Table 2):
÷
The number distinct compounds on chromatogram DCN - Eq (7);
÷
The string of standard deviation of retardation factors ordered ascending and estimated by Eq (1) compared with ideal positions of the peaks obtained through experiment RFD - Eq (9);
÷
The string of sum of the peak resolutions obtained through experiment RSS - Eq (12);
÷
The squared of effective plate number QN- Eq (13);
÷
Average peaks separation (into experiment) RSA - Eq (15);
÷
The string of mean resolution calculated with Minkowski experimental peaks RSR - Eq (17);
÷
The string of experimental peaks with minimal resolution QF- Eq (18).
The values associated to the global indicators of separation are presented in Table 12 and Table 13, respectively. The results of correlation analysis are presented in Table 14 and Table 15, respectively.
In investigation of the number of distinct compounds on chromatogram, no difference was obtained by both models (Eq (1.1), and Eq (1.2), respectively) for the experiments from 9 to 11 (integer numbers were considered). The same difference of two was obtained between experimental and estimated by both models for experiment no. 8; a difference of 1 for experiment no. 1 when Eq (1.2) was investigated.
Regarding the string of standard deviation of retardation factors ordered ascending and estimated by Eq (1.1) compared with the ideal positions of the peaks obtained through experiment, the lowest difference between experimental and estimated value is obtained by experiment no. 11 (0.001, Eq (1.1)), while the higher difference by the experiment no. 8 (-0.114, Eq (1.1)). The lowest difference of -0.002 was obtained by experiments nos. 8,9 & 10 and the largest difference of -0.010 was obtained by experiment no. 1 when the Eq (1.2) was investigated.
Table 12. Four (response functions) global indicators on chromatography: experimental vs estimated.
Table 12. Four (response functions) global indicators on chromatography: experimental vs estimated.
No.ExperimentalEstimated by Eq(1.1), n = 4Estimated by Eq(1.2), n = 5
DCNRFDRSSQNDCNRFDRSSQNDCNRFDRSSQN
150.0475.73071.900 4 (4.222)0.0573.7383.857
210.2830.000114.680
340.0812.260108.700
450.0554.61089.550
540.0782.580103.840
650.0573.51083.790
730.0974.02079.280
840.0675.86059.4602 (2.368)0.1812.13598.5542 (2.200)0.1831.703101.14
950.03610.37071.2405 (5.168)0.0424.57485.7445 (5.000)0.0444.14288.326
1040.0762.93080.9204 (4.328)0.0623.66189.5914 (4.160)0.0643.22992.174
1150.0408.99063.3805 (5.220)0.0395.19279.1235 (4.650)0.0463.72587.895
DCN = number of distinct compounds on chromatogram;
RFD = string of standard deviation of retardation factors ordered ascending and estimated by Eq(1.1) and Eq(1.2), respectively compared with ideal positions of the peaks obtained through experiment;
RSS = string of sum of the peak resolutions obtained through experiment; QN = squared of effective plate number.
The analysis of the string of sum of the peak resolutions revealed a difference between experimental and estimated of -0.731 (experiment no. 10, Eq (1.1)) and of 5.795 (experiment no. 9, Eq (1.1)). The investigation of Eq (1.2) revealed that the lowest difference of 0.432 was obtained by experiments 8, 9 & 10, and the largest of 2.000 by experiment no. 1. The lowest difference for the squared of effective plate number response function was obtained in the experiments 8, 9 & 10 (-2.582, Eq (1.2)) and the highest one by the experiment no. 8 (-39.094, Eq (1.2)).
Table 13. Other three global indicators of chromatographic parameters: experimental vs estimated.
Table 13. Other three global indicators of chromatographic parameters: experimental vs estimated.
NoExperimentalEstimated by Eq(1.1), n = 4Estimated by Eq(1.2), n = 5
RSARSRQFRSARSRQFRSARSRQF
11.4341.2850.310 0.9320.6360.075
20.0000.0000.000
30.5660.4010.000
41.1531.0350.427
50.6440.4520.000
60.8780.7780.275
71.0040.5590.000
81.4641.1690.1310.5330.3800.0280.4250.2400.000
92.5932.4981.5291.1441.0210.3441.0350.8810.293
100.7330.5730.1720.9160.6960.1050.8070.5560.054
112.2482.0380.7321.2991.1800.3110.9310.7040.138
RSA = average peaks separation (into experiment); RSR = string of mean resolution calculated with Minkowski of experimental peaks;
QF = string of experimental peaks with minimal resolution.
The last three global response functions had the same difference pattern between experimental and estimated values. The lowest difference is obtained in experiment no. 10 and the highest difference by experiment no. 9 for Eq (1.1) and by experiment no. 8, and experiment no. 11, respectively, for Eq (1.2).
The correlation analysis was applied also on the global quality factors. The lower sample size of experimental data is the major limitation of this analysis and explained the absence of the significance (p ≥ 0.05).
Other three response functions were implemented and computed resolution divided by the number of effective plates - RSP; informationa energy - IEne; and informational entropy - IEnt. The experimental an estimated values are presented in Table 16.
The results of correlation analysis on these response functions are presented in Table 17 and Table 18.
The difference of resolution divided by the number of effective plates obtained experimentally and estimated by model from Eq (1.1) varied into a large range: from -3.176 for experiment no. 10 to 36.548 for experiment no. 9 (see Table 15). The variation of the informational energy and informational entropy varied on the same pattern: the lowest difference between experimental and estimated values was obtained by experiment no. 11, while the highest values were obtained by the experiment no. 8 - Eq (1.1).
All response functions presented in Table 16 varied by the same pattern for Eq (1.2): the lowest difference is obtained by experiment no. 8 and the highest one by the experiment no. 1.
Informational response functions, energy and entropy, are parameters that investigated the disorder into the system (in our case the disorders into chromatographic analysis). For both response functions, values lower than 0.41 were obtained, these values being not statistically significant.
Table 14. Results of correlation analysis on global quality factors: Eq (1.1), n = 4.
Table 14. Results of correlation analysis on global quality factors: Eq (1.1), n = 4.
NameCorrelation coefficientp-valueStatistical parameterNameCorrelation coefficientp-valueStatistical parameter
DCNRSA
Pearsonr = 0.81651.80·10-1tPrs,1 = 2.0Pearsonr = 0.59054.09·10-1tPrs,1 = 1.03
Spearmanρ = 0.94285.72·10-2tSpm,1 = 4.0Spearmanρ = 0.60004.00·10-1tSpm,1 = 1.06
Semi-QrsQ = 0.84571.23·10-1tsQ = 2.59Semi-QrsQ = 0.59524.05·10-1tsQ = 1.05
Kendall tau-aτKen,a = 0.66671.75·10-1ZKen,τa = 1.36Kendall tau-aτKen,a = 0.33334.97·10-1ZKen,τa = 0.68
Kendall tau-bτKen,b = 0.73031.49·10-1ZKen,τb = 1.44Kendall tau-bτKen,b = 0.33334.97·10-1ZKen,τb = 0.68
Kendall tau-cτKen,c = 0.50002.79·10-1ZKen,τc = 1.08Kendall tau-cτKen,c = 0.25006.10·10-1ZKen,τc = 0.51
GammaΓ = 1.00004.15·10-2ZΓ = 2.04GammaΓ = 0.33338.21·10-1ZΓ = 0.23
RFDRSR
Pearsonr = 0.54344.56·10-1tPrs,1 = 0.92Pearsonr = 0.71182.88·10-1tPrs,1 = 2.05
Spearmanρ = 0.60004.00·10-1tSpm,1 = 1.06Spearmanρ = 0.60003.46·10-1tSpm,1 = 1.06
Semi-QrsQ = 0.57104.29·10-1tsQ = 0.98Semi-QrsQ = 0.65355.73·10-1tsQ = 1.22
Kendall tau-aτKen,a = 0.33334.97·10-1ZKen,τa = 0.68Kendall tau-aτKen,a = 0.33334.47·10-1ZKen,τa = 0.68
Kendall tau-bτKen,b = 0.33334.97·10-1ZKen,τb = 0.68Kendall tau-bτKen,b = 0.33334.97·10-1ZKen,τb = 0.68
Kendall tau-cτKen,c = 0.2506.10·10-1ZKen,τc = 0.51Kendall tau-cτKen,c = 0.25006.10·10-1ZKen,τc = 0.51
GammaΓ = 0.33338.21·10-1ZΓ = 0.23GammaΓ = 0.33338.21·10-1ZΓ = 0.23
RSSQF
Pearsonr = 0.59064.09·10-1tPrs,1 = 1.04Pearsonr = 0.89361.06·10-1tPrs,1 = 2.82
Spearmanρ = 0.60004.00·10-1tSpm,1 = 1.13Spearmanρ = 1.00005.47·10-2tSpm,1 = 4.10
Semi-QrsQ = 0.59534.05·10-1tsQ = 1.05Semi-QrsQ = 0.94536.68·10-2tsQ = 2.82
KendallτKen,a = 0.33331.97·10-1ZKen,τa = 0.68KendallτKen,a = 1.00004.15·10-2ZKen,τa = 2.04
KendallτKen,b = 0.33334.97·10-1ZKen,τb = 0.68KendallτKen,b = 1.00004.15·10-2ZKen,τb = 2.04
KendallτKen,c = 0.25006.10·10-1ZKen,τc = 0.51KendallτKen,c = 0.75001.26·10-1ZKen,τc = 1.53
GammaΓ = 0.33338.21·10-1ZΓ = 0.23GammaΓ = 1.00004.15·10-2ZΓ = 2.04
QN
Pearsonr = -0.15889.85·10-1tPrs,1 = 0.22n = sample size;
DCN = number of distinct compounds on chromatogram;
RFD = string of standard deviation of retardation factors estimated by Eq(1) ordered ascending compared with ideal positions of the peaks obtained through experiment;
RSS = string of sum of the peak resolutions obtained through experiment;
QN = squared of effective plate number;RSA = average peaks separation (into experiment);
RSR = string of Minkowski mean resolution of experimental peaks;
QF = string of experimental peaks with minimal resolution.
Spearmanρ = -0.20008.22·10-1tSpm,1 = 0.29
Semi-QrsQ = 0.17828.00·10-1tsQ = 0.25
Kendall tau-aτKen,a = 0.00001.00ZKen,τa = 0.00
Kendall tau-bτKen,b = 0.00001.00ZKen,τb = 0.00
Kendall tau-cτKen,c = 0.00001.00ZKen,τc = 0.00
GammaΓ = 0.00001.00ZΓ = 0.00
Table 15. Results of correlation analysis on global quality factors: Eq (1.2), n = 5.
Table 15. Results of correlation analysis on global quality factors: Eq (1.2), n = 5.
NameCorrelation coefficientp-valueStatistical parameterNameCorrelation coefficientp-valueStatistical parameter
DCNRSA
Pearsonr = 0.74541.48·10-1tPrs,1 = 3.75Pearsonr = 0.46984.25·10-1tPrs,1 = 0.92
Spearmanρ = 0.47224.22·10-1tSpm,1 = 0.93Spearmanρ = 0.50003.91·10-1tSpm,1 = 1.00
Semi-QrsQ = 0.59332.92·10-1tsQ = 1.28Semi-QrsQ = 0.48474.08·10-1tsQ = 0.96
Kendall tau-aτKen,a = 0.30004.62·10-1ZKen,τa = 0.73Kendall tau-aτKen,a = 0.40003.27·10-1ZKen,τa = 0.98
Kendall tau-bτKen,b = 0.31624.49·10-1ZKen,τb = 0.76Kendall tau-bτKen,b = 0.40003.27·10-1ZKen,τb = 0.98
Kendall tau-cτKen,c = 0.24005.44·10-1ZKen,τc = 0.61Kendall tau-cτKen,c = 0.32004.33·10-1ZKen,τc = 0.78
GammaΓ = 0.42866.53·10-1ZΓ = 0.45GammaΓ = 0.40006.95·10-1ZΓ = 0.39
RFDRSR
Pearsonr = 0.55203.35·10-1tPrs,1 = 1.15Pearsonr = 0.68272.04·10-1tPrs,1 = 2.62
Spearmanρ = 0.90003.74·10-2tSpm,1 = 3.58Spearmanρ = 0.90003.74·10-2tSpm,1 = 3.58
Semi-QrsQ = 0.70491.84·10-1tsQ = 1.72Semi-QrsQ = 0.78381.17·10-1tsQ = 2.19
Kendall tau-aτKen,a = 0.80005.00·10-2ZKen,τa = 1.96Kendall tau-aτKen,a = 0.80005.00·10-2ZKen,τa = 1.96
Kendall tau-bτKen,b = 0.80005.00·10-2ZKen,τb = 1.96Kendall tau-bτKen,b = 0.80005.00·10-2ZKen,τb = 1.96
Kendall tau-cτKen,c = 0.64001.17·10-1ZKen,τc = 1.57Kendall tau-cτKen,c = 0.64001.17·10-1ZKen,τc = 1.57
GammaΓ = 0.80001.17·10-1ZΓ = 1.57GammaΓ = 0.80001.17·10-1ZΓ = 1.57
RSSQF
Pearsonr = 0.46914.25·10-1tPrs,1 = 0.92Pearsonr = 0.98711.76·10-3tPrs,1 = 10.67
Spearmanρ = 0.50003.91·10-1tSpm,1 = 1.00Spearmanρ = 1.00001.24·10-2tSpm,1 = 5.41
Semi-QrsQ = 0.48434.08·10-1tsQ = 0.96Semi-QrsQ = 0.99356.26·10-4tsQ = 15.14
KendallτKen,a = 0.40003.27·10-1ZKen,τa = 0.98KendallτKen,a = 1.00001.43·10-2ZKen,τa = 2.45
KendallτKen,b = 0.40003.27·10-1ZKen,τb = 0.98KendallτKen,b = 1.00001.43·10-2ZKen,τb = 2.45
KendallτKen,c = 0.32004.33·10-1ZKen,τc = 0.78KendallτKen,c = 0.80005.00·10-2ZKen,τc = 1.96
GammaΓ = 0.40006.95·10-1ZΓ = 0.39GammaΓ = 1.00001.43·10-2ZΓ = 2.45
QN
Pearsonr = -0.41894.82·10-1tPrs,1 = 0.80n = sample size; DCN = number of distinct compounds on chromatogram;
RFD = string of standard deviation of retardation factors ordered ascending and estimated by Eq(1) compared with ideal positions of the peaks obtained through experiment;
RSS = string of sum of the peak resolutions obtained through experiment; QN = squared of effective plate number; RSA = average peaks separation (into experiment);
RSR = string of mean resolution calculated with Minkowski experimental peaks;
QF = string of experimental peaks with minimal resolution.
Spearmanρ = -0.30006.24·10-1tSpm,1 = 0.54
Semi-QrsQ = 0.35455.58·10-1tsQ = 0.66
Kendall tau-aτKen,a = 0.20006.24·10-1ZKen,τa = 0.49
Kendall tau-bτKen,b = 0.20006.24·10-1ZKen,τb = 0.49
Kendall tau-cτKen,c = 0.16007.05·10-1ZKen,τc = 0.39
GammaΓ = 0.20009.22·10-1ZΓ = 0.10
Table 16. Resolution ratio, informational energy and entropy: experimental vs estimated.
Table 16. Resolution ratio, informational energy and entropy: experimental vs estimated.
No.ExperimentalEstimated by Eq(1.1), n = 4Estimated by Eq(1.2), n = 5
RSPIEntIEneRSPIEntIEneRSPIEntIEne
131.9004.00016.000 17.67810.4072.67
20.00011.6100.000
38.30011.6100.000
420.6008.0008.000
59.90011.6100.000
616.8008.0008.000
720.30011.6100.000
839.4004.00016.00011.09610.3712.5608.02411.7540.00
958.2002.00018.00021.6527.3389.28018.588.7226.40
1014.50011.6100.00017.6769.3604.80014.60410.7441.92
1156.7004.75014.00027.2855.20313.56516.8529.9023.78
RSP = resolution divided by the number of effective plates;
IEnt = informational energy;
IEne = informational entropy.
Table 17. Results of correlation analysis for response functions presented in Table 15: Eq (1.1), n = 4.
Table 17. Results of correlation analysis for response functions presented in Table 15: Eq (1.1), n = 4.
NameCorrelation coefficientp-valueStatistical parameterNameCorrelation coefficientp-valueStatistical parameter
RSPIEnt
Pearsonr = 0.53264.67·10-1tPrs,1 = 0.90Pearsonr = 0.31886.81·10-1tPrs,1 = 0.48
Spearmanρ = 0.60004.00·10-1tSpm,1 = 1.06Spearmanρ = 0.00001.00tSpm,1 = 0.00
Semi-QrsQ = 0.56534.35·10-1tsQ = 0.97Semi-QrsQ = 0.00001.00tsQ = 0.00
Kendall tau-aτKen,a = 0.33334.97·10-1ZKen,τa = 0.68Kendall tau-aτKen,a = 0.00001.00ZKen,τa = 0.00
Kendall tau-bτKen,b = 0.33334.97·10-1ZKen,τb = 0.68Kendall tau-bτKen,b = 0.00001.00ZKen,τb = 0.00
Kendall tau-cτKen,c = 0.25006.10·10-1ZKen,τc = 0.51Kendall tau-cτKen,c = 0.00001.00ZKen,τc = 0.00
GammaΓ = 0.33338.21·10-1ZΓ = 0.23GammaΓ = 0.00001.00ZΓ = 0.00
IEne
Pearsonr = 0.29627.04·10-1tPrs,1 = 0.44RSP = resolution divided by the number of effective plates;
IEnt = informational energy;
IEne = informational entropy;
n = sample size.
Spearmanρ = 0.00001.00tSpm,1 = 0.00
Semi-QrsQ = 0.00001.00tsQ = 0.00
KendallτKen,a = 0.00001.00ZKen,τa = 0.00
KendallτKen,b = 0.00001.00ZKen,τb = 0.00
KendallτKen,c = 0.00001.00ZKen,τc = 0.00
GammaΓ = 0.00001.00ZΓ = 0.00
Table 18. Results of correlation analysis for response functions presented in Table 15: Eq (1.2), n = 5.
Table 18. Results of correlation analysis for response functions presented in Table 15: Eq (1.2), n = 5.
NameCorrelation coefficientp-valueStatistical parameterNameCorrelation coefficientp-valueStatistical parameter
RSPIEnt
Pearsonr = 0.28646.40·10-1tPrs,1 = 0.52Pearsonr = 0.37705.32·10-1tPrs,1 = 0.71
Spearmanρ = 0.50003.91·10-1tSpm,1 = 1.00Spearmanρ = 0.41044.92·10-1tSpm,1 = 0.78
Semi-QrsQ = 0.37845.30·10-1tsQ = 0.71Semi-QrsQ = 0.39345.12·10-1tsQ = 0.74
Kendall tau-aτKen,a = 0.40003.27·10-1ZKen,τa = 0.98Kendall tau-aτKen,a = 0.30004.62·10-1ZKen,τa = 0.73
Kendall tau-bτKen,b = 0.40003.27·10-1ZKen,τb = 0.98Kendall tau-bτKen,b = 0.31624.48·10-1ZKen,τb = 0.76
Kendall tau-cτKen,c = 0.32004.33·10-1ZKen,τc = 0.78Kendall tau-cτKen,c = 0.24005.44·10-1ZKen,τc = 0.61
GammaΓ = 0.40006.95·10-1ZΓ = 0.39GammaΓ = 0.33337.85·10-1ZΓ = 0.27
IEne
Pearsonr = 0.31526.05·10-1tPrs,1 = 0.58RSP = resolution divided by the number of effective plates;
IEnt = informational energy;
IEne = informational entropy;
n = sample size.
Spearmanρ = 0.41044.92·10-1tSpm,1 = 0.78
Semi-QrsQ = 0.35965.52·10-1tsQ = 0.67
KendallτKen,a = 0.30004.62·10-1ZKen,τa = 0.73
KendallτKen,b = 0.31624.48·10-1ZKen,τb = 0.76
KendallτKen,c = 0.24005.44·10-1ZKen,τc = 0.61
GammaΓ = 0.33337.85·10-1ZΓ = 0.27
The summary of the acceptance of the linear relationships between experimental and estimated values by the Eq (1.1) and Eq (1.2) for the investigated response functions is presented in Table 19.

Experiments Quality Assessment

RF, RFO, RSM, RSO, and QF chromatographic response functions can be accepted as being dependent on mobile phase composition with a good confidence, according to the results presented in Table 19. This partial conclusion can be used now backward, in order to see what is wrong (if there is something) in the experiments. Table 20 presented the biggest 20% differences between experimental and predicted by the models values. The Diff column from Table 20 contains relative differences calculated as follows:
Diff = 50·|Exp-Est|/(Exp+Est) [%]
Results from Table 20 can be analyzed in terms of relative deviation using Eq (18) if the values of relative deviation obtained for a given model and chromatographic response functions are ordered by rank.
Table 19. Summary of validation the response functions estimated by the model from Eq (1.1) and Eq (1.2), respectively.
Table 19. Summary of validation the response functions estimated by the model from Eq (1.1) and Eq (1.2), respectively.
ParameterPearsonSpearmanSemi-QKendall τaKendall τbKendall τcGamma
Eq (1.1)
RF
RFO
RSM
RSO
QF
DCN
RFD
RSS
QN
RSA
RSR
RSP
IEne
IEnt
Eq (1.2)
RF
RFO
RSM
RSO
QF
DCN
RFD
RSS
QN
RSA
RSR
RSP
IEne
IEnt
✓ = statistical significant at a significance level of 5%; ✕ = statistical insignificant at a significance level of 5%;
RF = retardation factor; RFO = retardation factor ordered ascending by the chromatographic peak;
RSM = resolution of pairs of compounds;
RSO = resolution matrix of successive chromatographic peaks;
DCN = number of distinct compounds on chromatogram;
RFD = string of standard deviation of retardation factors ordered ascending and estimated by Eq(1.1), Eq(1.2) respectively, compared with ideal positions of the peaks obtained through experiment;
RSS = string of sum of the peak resolutions obtained through experiment;
QN = squared of effective plate number;
RSA = average peaks separation (into experiment);
RSR = string of mean resolution calculated with Minkowski experimental peaks;
QF = string of experimental peaks with minimal resolution;
RSP = resolution divided by the number of effective plates;
IEnt = informational energy; IEne = informational entropy;
Table 20. Summary of validation the response functions estimated by the model from Eq (1.1) and Eq (1.2), respectively.
Table 20. Summary of validation the response functions estimated by the model from Eq (1.1) and Eq (1.2), respectively.
NoModelCRFEstimatedExperimentalDifference (%)Group RankExp No
1Eq (1.1)RF0.5050.18123.62111
2Eq (1.1)RF0.2530.70923.70211
3Eq (1.1)RF0.2160.07225.0038
4Eq (1.1)RF0.2030.06027.1948
5Eq (1.1)RSM0.4562.31833.5618
6Eq (1.1)RSM1.387.19133.90211
7Eq (1.1)RSM8.6651.43335.81311
8Eq (1.1)RSM0.6774.37136.59411
9Eq (1.1)RSM-0.0040.26351.54510
10Eq (1.1)RFO0.3930.75615.8019
11Eq (1.1)RFO0.3470.68316.3129
12Eq (1.1)RFO0.2190.07225.2638
13Eq (1.1)RFO0.2000.0626.9248
14Eq (1.1)RSO0.5862.75832.4819
15Eq (1.1)RSO0.4562.31833.5628
16Eq (1.1)RSO-0.0040.26351.54310
18Eq (1.1)QF0.0280.13132.3918
19Eq (1.2)RF0.3110.70919.51111
20Eq (1.2)RF0.3220.75920.21211
21Eq (1.2)RF0.4380.18120.76311
22Eq (1.2)RF0.2150.07224.9148
23Eq (1.2)RF0.2200.06028.5758
24Eq (1.2)RSM2.64614.47334.5418
25Eq (1.2)RSM1.81910.23434.91211
26Eq (1.2)RSM0.1961.11835.0831
27Eq (1.2)RSM0.7224.37135.82411
28Eq (1.2)RSM0.3872.75837.6959
29Eq (1.2)RSM0.3252.39238.0468
30Eq (1.2)RSM0.2312.31840.9478
31Eq(1.2)RSM0.6137.19142.15811
32Eq (1.2)RSM0.0080.13144.2498
33Eq (1.2)RSM0.0100.26346.341010
34Eq (1.2)RFO0.2890.57316.4719
35Eq (1.2)RFO0.3640.75617.5029
36Eq (1.2)RFO0.3170.68318.3039
37Eq (1.2)RFO0.2190.07225.2648
38Eq (1.2)RFO0.2150.06028.1858
39Eq (1.2)RSO0.2312.31840.9418
40Eq (1.2)RSO0.2252.45541.60211
41Eq (1.2)RSO0.0080.13144.2438
42Eq (1.2)RSO0.0100.26346.34410
43Eq (1.2)QF-0.0230.13171.3018
A qualitative measure of experiments results can be obtained by constructing the graphical representation of experiments based on the rank sums of relative deviations. Figure 1 presents the plot of rank sums deviations for the experiments included in the estimation.
Figure 1. Rank sums for relative deviations of investigated experiments.
Figure 1. Rank sums for relative deviations of investigated experiments.
Molecules 13 01617 g001
The analysis of values presented in Figure 1 showed that over 60 relative deviation ranks sum are recorded for experiment no. 8 (about 46.4%), followed by over 30 relative deviation ranks sum for experiment no. 11 (about 24.6%, about half relative to the experiment no. 8). Considering a normal distribution of unbiased error, the expected distribution of deviations rank sum is uniform. Thus, the expected frequencies for ranks sums in investigated experiments are no more than 25% (no more than 34 relative deviation ranks sum). Concluding, the results obtained for deviations of rank sums suggest that results from experiment no. 8 are questionable.

Conclusions

The analyses of the data presented in Table 19, Table 20 and Figure 1 leads to the following conclusions:
  • The model presented in Eq (1.2) seems to be more reliable for the estimation of chromatographic response functions on investigated androstane isomers. Four response functions (RF - retardation factor; RFO - retardation factor ordered ascending by the chromatographic peak; RSM - resolution of pairs of compounds; QF - string of experimental peaks with minimal resolution) revealed statistically significant linear relationships between experimental and estimated values.
  • The models presented in Eq (1.1) is valid and reliable in investigation of retardation factor, retardation factor ordered ascending by the chromatographic response, resolution of pairs of compounds and resolution matrix of successive chromatographic peaks;
  • Good performances are obtained in estimation of resolution of pairs of compounds but the relationship between experimental and estimated values by Eq (1.1) and Eq (1.2) could be questionable due to the absence of significantly statistic Gamma correlation coefficient;
  • Some estimation abilities were observed in investigation of the string of standard deviation of retardation factors ordered ascending estimated by Eq (1.2) compared with ideal positions of the peaks obtained through experiment; and of the string of Minkowski type mean resolution calculated by Eq (1.2) with experimental peaks. These two chromatographic response functions seem to be qualitative and rank variables.
  • Two global response functions for the separation, abbreviated as QF and DCN recorded a weak acceptance in investigation of Eq (1.1). Thus, QF are rejected at 95% confidence by Spearman (with 5.47% error), Semi-Q (6.68% error), Pearson (with 10.6% error) and Kendall τc (with 12.6% error) even if the correlations are good (over 0.75). The small dimension of the sample size, not grater enough to provide statistical significance of the obtained correlations, explained with a good confidence the rejection of these correlations. Note that QF chromatographic response function is in fact a minimum function of resolutions of the separation, resolutions that are accepted by the model from Eq (1.1) - see 2nd conclusion. DCN is statistically significant by the Goodman-Kruskal method (concordant vs. discordant) and is near to be statistically significant by the Spearman method (5.72% error). Thus, the rejection is recorded for a quantitative correlation, but a possible acceptance is seen by the qualitative correlation. Again, small sample size is against of a solid statistical conclusion for DCN.
  • The results presented in Experiments Quality Assessment subsection sustain the hypothesis that the proposed equations (Eq (1.1) and Eq (1.2), respectively) could be used in order to verify the quality of experimental data. The results obtained for deviations of rank sums suggest that the experimental data of the experiment no. 8 are questionable.

Acknowledgements

The research was partly supported by the UEFISCSU Romania through research grants (ET108/2006, AT93/2007 & CEEX2444/2008).

References and Notes

  1. Senchenkova, E.M. Tsvet (or Tswett). Mikhail Semenovich (1872 - 1919). In Dictionary of scientific biography; Gillispie, Ch. C., Ed.; Charles Scribner Sons: New York, 1976; pp. 486–488. [Google Scholar]
  2. Tswett, M. Adsorption analysis and chromatographic method. Application to the chemistry of the chlorophyll. (in German). Ber. Deut. Bot. Ges 1906, 24, 316–323. [Google Scholar]
  3. Duarte, A.C.; Capelo, S. Application of chemometrics in separation science. J. Liq. Chrom. Rel. Technol 2006, 29, 1143–1176. [Google Scholar] [CrossRef]
  4. Xu, L.; Tang, L.-J.; Cai, C.-B.; Wu, H.-L.; Shen, G.-L.; Yu, R.-Q.; Jiang, J.-H. Chemometric methods for evaluation of chromatographic separation quality from two-way data-A review. Anal. Chim. Acta 2008, 613, 121–134. [Google Scholar] [CrossRef]
  5. Miyake, T.; Yafuso M. Pollination of Alocasia cucullata (Araceae) by two Colocasiomyia flies known to be specific pollinators for Alocasia odora. Plant Species Biol 2005, 20, 201–208. [Google Scholar] [CrossRef]
  6. De Abreu, I.N.; Sawaya, A.C.; Eberlin, M.N.; Mazzafera, P. Production of pilocarpine in callus of jaborandi (Pilocarpus microphyllus stapf). In Vitro Cell. Dev. Biol.: Plant 2005, 41, 806–811. [Google Scholar] [CrossRef]
  7. Sharma, L.; Desai, A.; Sharma, A. A thin layer chromatography laboratory experiment of medical importance. Biochem. Mol. Biol. Educ 2006, 34, 44–48. [Google Scholar]
  8. Reddy, B.S.; Rozati, R.; Reddy, B.V.R.; Raman, N.V.V.S.S. Association of phthalate esters with endometriosis in Indian women. BJOG 2006, 113, 515–520. [Google Scholar] [CrossRef]
  9. Mulija, M.; Indrayanto, G. Steroid Analysis by TLC. In Encyclopedia of Chromatography; Cazes, J., Ed.; Marcel Dekker: New York, 2001; pp. 794–797. [Google Scholar]
  10. Scott, R.P.W. Thin Layer Chromatography; Library4Science: Letchworth Garden City, UK, 2007. [Google Scholar]
  11. Nyiredy, Sz.; Meier, B.; Erdelmeier, C.A.J.; Sticher, O. PRISMA: A Geometrical Design for Solvent Optimization in HPLC. J. High Resolut. Chromatogr. Chromatogr. Commun 1985, 8, 186–188. [Google Scholar] [CrossRef]
  12. Zhang, Y.P.; Zhang, Y.J.; Gong, W.J.; Gopalan, A.I.; Lee, K.-P. Rapid separation of Sudan dyes by reverse-phase high performance liquid chromatography through statistically designed experiments. J. Chromatogr. A 2005, 1098, 183–187. [Google Scholar]
  13. Cimpoiu, C.; Jäntschi, L.; Hodişan, T. A New Method for Mobile Phase Optimization in High-Performance Thin-Layer Chromatography (HPTLC). J. Planar Chromatogr. - Mod. TLC 1998, 11, 191–194. [Google Scholar]
  14. Cimpoiu, C.; Hodisan, T. Application of numerical taxonomy techniques to the choice of optimum mobile phase in high-performance thin-layer chromatography (HPTLC). J. Pharm. Biomed. Anal 1999, 21((5)), 895–900. [Google Scholar] [CrossRef]
  15. Vasiljević, T.; Onjia, A.; Čokeša, Đ.; Laušević, M. Optimization of artificial neural network for retention modeling in high-performance liquid chromatography. Talanta 2004, 64, 785–790. [Google Scholar] [CrossRef]
  16. Tran, A.T.K.; Hyne, R.V.; Pablo, F. Optimisation of the separation of herbicides by linear gradient high performance liquid chromatography utilising artificial neural networks. Talanta 2007, 71, 1268–1275. [Google Scholar] [CrossRef]
  17. Cimpoiu, C.; Jäntschi, L.; Hodişan, T. A New Mathematical Model for the Optimization of the Mobile Phase Composition in HPTLC and the Comparision with Other Models. J. Liq. Chromatogr. Relat. Technol 1999, 22, 1429–1441. [Google Scholar] [CrossRef]
  18. Jäntschi, L.; Hodişan, S.; Cimpoiu, C.; Ceteraş, I. Analysis of Some Steroids by TLC Using Optimum Mobile Phases. Acta Univ. Cibin. - Ser. F Chem 2005, 8, 67–76. [Google Scholar]
  19. Jäntschi, L.; Hodişan, S.; Cimpoiu, C.; Hosu, A.; Darvasi, E.; Hodişan, T. Modeling of thin-layer chromatographic separation of androstane isomers. J. Planar Chromatogr. - Mod. TLC 2007, 20((2)), 91–94. [Google Scholar]
  20. Hodişan, S.; Cimpoiu, C.; Casoni, D. Separation of Androstan Isomers by High Performance Thin Layer Chromatography (HPTLC) Using an Optimum Mobile Phase. Acta Univ. Cibin. - Ser. F Chem 2004, 7, 27–31. [Google Scholar]
  21. Stoenoiu, C.E.; Bolboacă, S. D.; Jäntschi, L. Mobile Phase Optimization Method for Steroids Separation. Appl. Med. Inform 2006, 18, 17–24. [Google Scholar]
  22. Ettre, L.S. Nomenclature for Chromatography. (IUPAC Recommendations 1993). Pure Appl. Chem 1993, 65, 819–872. [Google Scholar] [CrossRef]
  23. Bolboacă, S.D.; Jäntschi, L. Pearson versus Spearman, Kendall's Tau Correlation Analysis on Structure-Activity Relationships of Biologic Active Compounds. Leonardo J. Sci 2006, 9, 179–200. [Google Scholar]
  24. Jäntschi, L.; Bolboacă, S.D. Triazines herbicidal assessed activity. Stud. Cercet. Stiint. - Ser. Biol., Univ. Bacau 2007, 12, 57–62. [Google Scholar]
  25. Bolboacă, S.D.; Jäntschi, L. Modelling the Property of Compounds from Structure: Statistical Methods for Models Validation. Environ. Chem. Lett 2008, 6, 175–181. [Google Scholar] [CrossRef]

Share and Cite

MDPI and ACS Style

Bolboacă, S.D.; Pică, E.M.; Cimpoiu, C.V.; Jäntschi, L. Statistical Assessment of Solvent Mixture Models Used for Separation of Biological Active Compounds. Molecules 2008, 13, 1617-1639. https://doi.org/10.3390/molecules13081617

AMA Style

Bolboacă SD, Pică EM, Cimpoiu CV, Jäntschi L. Statistical Assessment of Solvent Mixture Models Used for Separation of Biological Active Compounds. Molecules. 2008; 13(8):1617-1639. https://doi.org/10.3390/molecules13081617

Chicago/Turabian Style

Bolboacă, Sorana D., Elena M. Pică, Claudia V. Cimpoiu, and Lorentz Jäntschi. 2008. "Statistical Assessment of Solvent Mixture Models Used for Separation of Biological Active Compounds" Molecules 13, no. 8: 1617-1639. https://doi.org/10.3390/molecules13081617

APA Style

Bolboacă, S. D., Pică, E. M., Cimpoiu, C. V., & Jäntschi, L. (2008). Statistical Assessment of Solvent Mixture Models Used for Separation of Biological Active Compounds. Molecules, 13(8), 1617-1639. https://doi.org/10.3390/molecules13081617

Article Metrics

Back to TopTop