1. Introduction
The human scent and its molecular profile are one of the major topics of current forensic interest. The possibility of its chemical analysis and identification of the individual based on scent composition would mean a significant breakthrough in forensic practice, providing more convincing evidence than specially trained dogs [
1,
2,
3,
4,
5]. In addition to forensic practice, monitoring changes in the molecular profile of an individual would be of great importance for the evaluation of physical or mental conditions in biometric and medical applications [
6,
7,
8].
Concerning human scent, a number of publications [
9,
10] focus mainly on volatile organic compounds (VOCs), which are largely contained in human skin scent and which can be found in all body fluids and excrement: in breath, saliva, blood, milk, urine, feces, and skin secretions (sweat and follicular fluids). In 2014, de Lacy Costello et al. [
6] published a review with several hundred VOCs of the human scent. However, some authors point to the importance of low volatile compounds [
11,
12,
13]. In general, the human skin scent contains several thousand different compounds, e.g., acids, alcohols, aldehydes, amides, amines, esters, hydrocarbons, ketones, heterocyclic compounds, fatty acids, and fatty acid esters. These compounds, in addition, can vary in concentrations by several orders of magnitude [
12]. The qualitative and quantitative variability of the human scent is influenced by many circumstances, e.g., by the sampling condition and sampled body parts, age, sex, heredity, diet, drugs, mood and emotions, cosmetics, and environmental factors [
14,
15].
For the analysis of volatile compounds in real complex samples, such as human scent, conventional gas chromatography is often an insufficient separation tool. Some analytes cannot be separated during a single analysis due to their similar properties, and either overlapping or insufficiently distributed peaks appear in the chromatogram, causing problems for both identification and quantification.
The aim of the experiments described in this work is to find the optimal arrangement of the 2D chromatographic system and a suitable chromatographic method to achieve the best separation condition for non-targeted analysis of human scent samples.
2. Materials and Methods
2.1. Materials
Hexane (quality for GC-MS) and all standards used (see
Table S1) were obtained from Sigma-Aldrich (St. Louis, MO, USA). Ethanol (for UV–VIS spectroscopy, min. 99.8%) was purchased from Penta (CZ). The chromatographic columns Rxi-5MS, Rxi-17Sil MS, Rtx-200 MS (Restek, Bellefonte, PA, USA) and TG-5HT (Thermo Scientific, Waltham, MA, USA) had the same column diameter of 0.25 mm and a stationary phase thickness of 0.25 μm. The carrier gas was helium 5.5 (purity ≥ 99.9995) from Linde (CZ).
2.2. Preparation of a Mixture of Standards
To develop and optimize the chromatographic measurement method, a model mixture of human skin scent containing 98 compounds (see
Supplementary Materials, Table S1) with a concentration of 100 mg·L
−1 of each component in hexane was created. A solution with a concentration of 25 mg·L
−1 of each component in hexane was used for the work. The compounds were chosen to cover the widest possible range (chemical family, polarity, molecular weight) of compounds found in human skin scent samples.
2.3. Preparation of Real Scent Samples
The developed chromatographic methods were also verified by measuring real scent samples. The scent samples were taken on glass beads of the palms of volunteers. Details on the purification of the sampling material and the scent sampling are given in
Appendix A. The samples were subsequently extracted by double extraction into 1 mL of ethanol, each extraction step lasting 20 min, of which 10 min extraction was supported by shaking (1000 rpm, IKAVIBRAX VXR basic, Staufen im Breisgau, Germany) and 10 min by placing the samples in an ultrasonic bath (Fisherbrand FB15059, Fisher Scientific, Loughborough, UK). The individual extracts were gradually evaporated to dryness in a 1.5 mL vial in a Genevac EZ-2 concentrator under reduced pressure (approx. 100 mbar). The residue was then redissolved in 100 μL of a 1:1 mixture of ethanol and hexane. The samples prepared in this way were stored in a refrigerator at 7 °C until the analysis.
2.4. Instrumentation
The samples were analyzed on a 7890B GC gas chromatograph (Agilent, Santa Clara, CA, USA) with a Pegasus® 4D-C time-of-life mass detector from LECO Corp. (St. Joseph, MI, USA). A MultiPurpose Sampler MPS Robotic from Gerstel (Mülheim, Germany) was used to dose samples to the column. The sample measurement and data evaluation were performed in the ChromaTOF® program (version 4.72.0.0, LECO Corp., St. Joseph, MI, USA).
During the development of the method, five main column configurations were tested, namely:
Classic arrangement: non-polar Rxi-5MS (2 + 30 m) + medium polar Rxi-17Sil MS (1 m);
Classic arrangement: non-polar Rxi-5MS (2 + 30 m) + medium polar Rtx-200MS (1 m);
Reverse arrangement: medium polar Rxi-17Sil MS (2 + 30 m) + non-polar TG-5HT (1 m);
Reverse arrangement: medium polar Rtx-200 MS (2 + 30 m) + non-polar TG-5HT (1 m);
Reverse arrangement: medium polar Rtx-200 MS (2 + 15 m) + non-polar TG-5HT (1 m),
where the first bracket shows the length of the pre-column and the primary column, the pre-column being the same as the primary column, and the second bracket shows the total length of the secondary column, of which 10 cm is in the modulator, 21 cm in the GC-MS transfer line and 69 cm in the secondary oven.
The injection of samples (1 µL) took place at a temperature of 280 °C in the splitless mode with a period of 2 min. The carrier gas flow was set at a constant 1.5 mL/min. The optimized temperature program was: 40 °C—2 min—5 °C/min—320 °C—10 min, the total time of one analysis was 68 min (4080 s). The secondary column was heated to a temperature 5 °C higher than the primary column and the modulator to a temperature 15 °C higher than the secondary column. In the case of a 15 m primary column instead of 30 m, the temperature program was shortened by 10 min (the final holding of the final temperature was eliminated), and the secondary column was heated 10 °C higher than the primary column.
Cryogenic modulation was performed on a secondary column using dried air at −80 °C. The modulation periods and the corresponding hot and cold pulse times were different for different column configurations; see
Table 1. The warm pulse length corresponds to 0.3 times the modulation period length and the cold pulse length to 0.2 times the modulation period length.
The transfer line was set to 280 °C. The mass detector was in electron ionization (EI) mode with an ionizing electron energy of 70 eV, and the temperature of the ion source was 250 °C. Data collection took place in the Total Ion Current (TIC) mode; in the range 29–800 m/z, the data collection speed was 200 spectra/s. The acquisition delay was set to 500 s for a 30 m primary column, and in the case of a 15 m column it was shortened to 300 s. The voltage on the detector was set 200 V higher than tuning to increase the detection sensitivity.
2.5. Choosing the Best System Configuration
The different systems were compared with each other in terms of orthogonality, the used separation space, the maximum theoretical peak capacity and the condition peak capacity of the system, the achieved resolution, and the number of coelutions.
2.5.1. Spearman’s Correlation Coefficient
Spearman’s correlation coefficient is used to measure monotonic nonlinear correlations between retention times and can be used to measure the orthogonality of systems [
16]. First, the order of retention times in the first (
rit1) and second dimensions (
rit2) is determined for compound
i. The Spearman’s correlation coefficient (
ρ) is then calculated according to Equation (1),
where
di =
rit1 −
rit2 and
n is the number of compounds.
2.5.2. Bin Counting Method
The bin counting method [
17,
18] serves both as an indicator of orthogonality and an indicator of the coverage of the separation space. In the bin counting method, retention times in both dimensions must first be normalized according to Equation (2),
where
1tr,norm(i) and
2tr,norm(i) are normalized retention times in the first and the second dimension,
tr(i) is the retention time of compound
i, and
tr,first and
tr,last are the retention times of the first and last observed peaks in the separation space, respectively.
Then, the separation space is divided into bins, whose number is the same or very close to the number of compounds in the sample. Subsequently, the number of bins containing a peak is counted and compared with the total number of bins according to Equation (3),
where
Pmax is the total number of bins. Occupied space
O takes values from 0 (completely non-orthogonal system) to 1 (completely orthogonal system). The coefficient of 0.63 is based on the observation that systems close to orthogonality have a ratio of filled and total bins (so-called
fcoverage) close to 0.63.
For reverse systems, the separation space was divided into three parts, which corresponded to the individual modulation periods. The total value O of the given reverse system was calculated as a weighted average of partial O.
2.5.3. Peak Capacities
The theoretical peak capacities [
19] of the system in both dimensions (
1nc and
2nc) were calculated according to Equation (4).
where
tr is the retention time of the last of first observed peaks in the separation space, and
wb is the measured average 4σ peak width. The maximum theoretical GC × GC capacity of the
2Dnc system was then determined as the product of the peak capacities in the first (
1nc) and the second (
2nc) dimensions (Equation (5)).
However, orthogonality and surface coverage have a huge impact on the calculations of the conditional (practical) peak capacity
2Dnc,cond of a comprehensive 2D system [
19]. It is given by Equation (6),
where
fcoverage is the ratio of filled and total bins (see
Section 2.5.2), and
β is an undersampling of the first dimension, and it is given by Equation (7),
where
ts is the first-dimension sampling time, and
1σ is the peak standard deviation in the first dimension.
For reverse systems, the separation space was divided into three parts, which corresponded to the individual modulation periods. The total value of theoretical peak capacity and the conditional peak capacity of the given reverse system were calculated as a sum of partial capacitances.
2.5.4. Resolutions
With knowledge of the maximum theoretical peak capacities (see Equation (4)), the average resolutions (
Rs,av) in both dimensions can be calculated [
20]. It is given by Equation (8),
where
n is the number of compounds.
The resolutions of the two adjacent peaks in the individual dimensions (
1Rs and
2Rs) were calculated from the retention times (
tr1 and
tr2) and the width of the peaks at the base (
wb1 and
wb2) according to Equation (9) [
21],
and the total resolution in a 2D system of
2DRs was then calculated as the square root of the sum of the squares of the resolutions obtained in the first and second dimensions; see Equation (10) [
21].
3. Results
3.1. Classic Arrangement: Non-Polar Rxi-5MS (2 + 30 m) + Medium Polar Rxi-17Sil MS (1 m)
The first column arrangement tested was the 30 m Rxi-5ms column with the 2 m Rxi-5ms precolumn in the first dimension and the 1.5 m Rxi-17Sil MS column in the second dimension. The Rxi-5ms column with a (5%-phenyl)-methylpolysiloxane phase has a low polarity, which is suitable for a wide range of analyses. The second Rxi-17Sil MS column with a (50%-phenyl)-methylpolysiloxane phase is characterized by medium polarity and higher selectivity to aromatic substances and is usually recommended for environmental analyses. After the initial analyses, however, it was observed that when using a column length of 1.5 m in the second dimension, the retention times in the second dimension are too high related to the modulation period and a large area of unused separation space was formed at the bottom of the chromatogram. For this reason, the secondary column was shortened to 1 m, and this column length was used as the standard for the other column arrangements.
The ramp of the temperature program was also tested under the above conditions (see
Section 2.4). At a temperature rate of 3 °C/min, the temperature program was very long (105 min), at 8 °C/min, the temperature program was shortened to 47 min, however, the resolution of the peaks in the first dimension deteriorated significantly. The rate of 5 °C/min was chosen as the most suitable, as a compromise between the total length of the program and the achieved resolution in the first dimension.
Figure 1 shows the chromatogram obtained under the optimized conditions (see
Section 2.4). The classification zones of homologues of various compounds are highlighted.
n-alkanes are marked with black contour, with which amines partially coelute (lower dark yellow zone). For instance, heptadecane with N,N-dimethyltetradecanamide achieved an overall resolution of
2DRs = 0.65. Acids and esters (left red zone), aldehydes and ketones (yellow zone), alcohols (left white zone), and chloroalkanes (green zone) showed similar retention on the secondary column. For example, dodecan-1-ol with undecanoic acid achieved an overall resolution of
2DRs = 0.52 and dodecan-1-ol with 1-chlordodecane
2DRs = 0.56. Ethers (right white zone) and amides (upper dark yellow zone) are then clearly separated from these compounds. The right red zone then represents the area of higher fatty acid esters. Unlabeled compounds with higher retention on the secondary column represent compounds classified as “others”, which are often part of cosmetic products. The list of standards including retention indices calculated from absolute retention times according to [
22] is given in
Supplementary Material,
Table S1.
Unfortunately, it was not possible to distinguish structural isomers such as hexadecyl ester of tetradecanoic acid and dodecyl ester of octadecanoic acid in the model mixture (
2DRs = 0, both compounds have the general formula C
30H
60O
2 and M
r = 452.80, their mixed mass spectrum see
Figure 2). The total number of analyte coelutions (with
2DRs < 1.5) in the mixture was 38.
As described above, the Rxi-17Sil MS column is selective for aromatics, resulting in high retention of aromatic compounds in the second dimension. This was also shown in the subsequent analysis of the real sample, when the modulation period of 4 s was too short, and the so-called wrap-around effect occurred; see
Figure 3.
3.2. Classic Arrangement: Non-Polar Rxi-5MS (2 + 30 m) + Medium Polar Rtx-200MS (1 m)
In another tested arrangement, the secondary column was replaced with the Rtx-200MS column with a (trifluoropropyl)-methylpolysiloxane phase, which, like Rxi-17Sil MS, belongs to the group of the mid-polar columns; however, the polarity is slightly higher. The great advantage of the Rtx-200MS column is its unique selectivity for electronegative substances, which changes the elution order and can ensure a resolution that cannot be achieved on the phenyl, cyano or polyethylene glycol phases.
The chromatogram of the model mixture measured on the Rxi-5ms and the Rtx-200MS columns under optimized conditions (see
Section 2.4) is then shown in
Figure 4. Several differences can be observed compared to the chromatogram in
Figure 1. First, there was a clear separation of aldehydes and ketones in the second dimension (yellow classification zone) from acids and esters (first red zone). Second, there was a significant reduction in aromatic retention in the second dimension (see caffeine, which had
2tr = 3.18 s for the Rxi-17Sil MS column,
2tr = 1.66 s for the Rtx-200 MS column). However, the worst--separated places remained poorly differentiated. Amines (lower dark yellow zone) still coeluted with
n-alkanes (black zone), e.g., the previously discussed heptadecane with N,N-dimethyltetradecanamine reached
2DRs = 0.57. Alcohols (left white zone) then still showed similar retention as acids (left red zone) and chloroalkanes (green zone). For example, dodecan-1-ol with undecanoic acid achieved an overall resolution of
2DRs = 0.65 and dodecan-1-ol with 1-chlordodecane
2DRs = 0.61. The structural isomers in the area of higher fatty acid esters (right red zone) were not distinguished at all (
2DRs = 0). The total number of analyte coelutions (with
2DRs < 1.5) in the mixture was 34.
Due to the lower retention of aromatic compounds, in the case of a real sample, there was no longer a wrap-around effect during the modulation period of 4 s. See
Figure 5 for the chromatogram of the real sample.
3.3. Reverse Arrangement: Medium Polar Rxi-17Sil MS (2 + 30 m) + Non-Polar TG-5HT (1 m)
Due to the failure of achieving satisfactory separation in the classical arrangement of the columns, the order of the tested columns was reversed. The first reverse system consisted of the 30 m primary column Rxi-17Sil MS with the 2 m precolumn, also Rxi-17Sil MS. A 1 m long TG-5HT was chosen as the secondary column, which is formed as the Rxi-5ms by a non-polar (5%-phenyl)-methylpolysiloxane phase; however, it is designed up to temperatures of 400 °C and the manufacturer guarantees up to 40% longer life compared to similar columns.
Figure 6 shows the measured chromatogram obtained under optimal conditions (see
Section 2.4), from which it is clear, that the elution of different types of substances was reversed compared to the classical systems. The highest retention on the secondary column had
n-alkanes (black classification zone), from which amines (upper dark yellow zone) were visibly separated, e.g., the mentioned problem site of heptadecane and N,N-dimethyltetradecanamine reached an overall resolution of
2DRs = 4.15. Acids and esters (left red zone) and aldehydes and ketones (yellow zone) were separated from each other within the first dimension, but not within the second dimension. Alcohols (left white zone), such as dodecan-1-ol, were further separated from the acids in the first dimension, and undecanoic acid reached a resolution of
2DRs = 1.15. Previously mentioned problematic places, namely, alcohols (white zone) and chloroalkanes (green zone), were successfully separated within the first and second dimensions (the distinction between dodecan-1-ol and 1-chlordodecane reached
2DRs = 3.51). The structural isomers in the area of higher fatty acid esters (right red zone) again remained indistinguishable (
2DRs = 0). However, the distribution of amines and chloroalkanes deteriorated, e.g., for N,N-dimethyldodecanamine with 1-chlordodecane the resolution was
2DRs = 0.71 (in both classical column arrangements the resolutions were
2DRs~3.7). The total number of analyte coelutions (with
2DRs < 1.5) in the mixture was 24. The chromatogram of a real scent sample is shown in
Figure 7.
3.4. Reverse Arrangement: Medium Polar Rtx-200 MS (2 + 30 m) + Non-Polar TG-5HT (1 m)
The second reverse system consisted of the 30 m primary column Rtx-200 MS with the 2 m precolumn, also Rtx-200 MS. The 1 m long TG-5HT remained as a secondary column.
Figure 8 shows that all studied compounds’ groups have already been separated. In the second dimension, there was a more significant separation of amines (upper dark yellow classification zone) from chloroalkanes (green zone) within the second dimension, alcohols (left white zone) were more significantly separated from the area of acids and esters (left red zone), from whose zone also separated aldehydes and ketones (yellow zone). At the same time, however, there was a change in the retention of aromatic or polyunsaturated hydrocarbons (see squalene, cholesterol, caffeine). In this arrangement, squalene eluted near octacosane with a resolution of
2DRs = 1.34 and cholesterol near tetradecyl ester of hexadecanoic acid (
2DRs = 2.15). It was again not possible to distinguish the structural isomers in the region of higher fatty acid esters (second red bubble,
2DRs = 0). The total number of analyte coelutions (with
2DRs < 1.5) in the mixture was 7. The chromatogram of a real scent sample is shown in
Figure 9.
3.5. Reverse Arrangement 3: Medium Polar Rtx-200 MS (2 + 15 m) + Non-Polar TG-5HT (1 m)
To shorten the analysis time, an alternative to a 15 m primary column instead of a 30 m column was tested. While maintaining the original temperature gradient, the total analysis time was reduced by 10 min (for more details on setting up the analysis, see
Section 2.4). In this configuration, however, the resolution of the compounds within the first dimension deteriorated (compared to the 30 m system, see
Table 2) and the shape of the peaks deteriorated, in particular, their profile (see
Figure 10). The total number of analyte coelutions (with
2DRs < 1.5) in the mixture was 21.
4. Discussion
Five different configurations of the 2D chromatographic system were tested to find its optimal arrangement and at the same time with the intention of finding a suitable chromatographic method for non-targeted analysis of human odor samples. In testing chromatographic methods, mainly the temperature ramp was optimized. The rate of 5 °C/min was chosen as the most suitable, as a compromise between the total length of the program and the achieved resolution in the first dimension. Individual chromatographic systems were evaluated in terms of orthogonality, use of separation space, peak capacities, resolutions, and finally the number of coelutions (see
Table 2). The first of the tested systems in the classical arrangement (Rxi-5 ms/Rxi-17Sil MS) reached a high theoretical peak capacity; however, the use of separation space was the worst compared to the other tested systems. Thanks to this, the conditional peak capacity was considerably lower, and the number of coelutions of analytes of the model mixture was the highest (38). In the case of the analysis of a real scent sample, which contained, e.g., many aromatic heterocyclic compounds with high retention, the so-called wrap-around effect. The second system in the classical arrangement (Rxi-5ms/Rtx-200 MS) showed the highest orthogonality according to Spearman’s correlation coefficient, and the use of the separation space was considerably higher than in the first case. The theoretical and conditional peak capacities were generally lower than in the first case, as the peak capacity in the second dimension decreased due to the lower retention of aromatic compounds (e.g., caffeine). However, the average resolution of the analytes improved slightly, but multiple coelutions still occurred (34 in total). Due to the lower retention of aromatic compounds, there was no longer a wrap-around effect in the analysis of a real scent sample. Reverse systems (except Rxi-17Sil MS/TG-5HT system) according to Spearman’s correlation coefficient, showed lower orthogonality than systems in the classical arrangement; nevertheless, the use of separation space was considerably higher, thanks to which the maximum theoretical and conditional peak capacities were higher than in the case of classical systems. The highest conditional peak capacity was then achieved with the Rtx-200 MS/TG-5HT system. The average 2D resolution was lower than in the case of classical systems; however, the number of coelutions (7 in total) was the lowest in this case, mainly due to the much higher resolution in the second dimension than was the case in previous systems. This was also shown in the case of analyses of real samples when, in the case of Rxi-17 Sil MS/TG-5HT columns, the separation of the above-mentioned aromatic heterocyclic compounds was insufficient. In the case of the Rtx 200 MS/TG-5HT system, even these heterocyclic compounds were sufficiently separated.
The reverse system (Rtx-200 MS/TG-5HT) was evaluated as the optimal system for human scent analysis and an alternative of a 15 m primary column instead of a 30 m column was tested on this arrangement to shorten the analysis time. However, when using the 15 m column, the theoretical and conditional peak capacities decreased considerably, especially in the first dimension, which is due to the deterioration of the peak shape and, above all, peaks expansion.
The deterioration of the peak shape is mainly related to the long modulation periods (15, 18, and 20 s), which were needed to avoid the wrap-around effect. The high retention in the second dimension can be explained by the fact that, at the same temperature program, the analytes are eluted from the shorter primary column at a lower temperature, i.e., the separation on the secondary column also occurs at a lower temperature, which is associated with prolongation of retention. With a higher setting of the temperature offset between the primary and secondary ovens, there was a significant loss of resolution in the second dimension. Furthermore, it must be said that long modulation periods also correlate with long periods of cold and warm pulses (see
Table 1 in
Section 2.4). The problem is that short columns generate relatively narrow primary peaks, which require rapid and sharp reinjections into the secondary system to maintain properly performed modulation and cryofocusing, which was not followed in this case. The solution could be a further optimization of the modulation process, a change of the length of the secondary column, or change of the diameter and thickness of the stationary phase on the primary and secondary columns, respectively.
None of the systems was able to distinguish the structural isomers of higher fatty acid esters, which were represented in the model mixture by hexadecyl ester of tetradecanoic acid and dodecyl ester of octadecanoic acid (both compounds have the general formula C30H60O2 and Mr = 452.80). Due to their structure and the long aliphatic chains, it would probably be necessary to use a very non-polar column, such as Rxi-1ms, to distinguish between them. In addition to changing the stationary phase, it would also probably be necessary to increase the separation efficiency, either by increasing the length of the column, which would, however, lead to very long temperature programs, or by reducing the internal diameter of the column and the thickness of the stationary phase, which in turn could cause insufficient column capacity. However, although the isomers could not be resolved chromatographically, they could be distinguished based on the characteristic masses in the mass spectrum. These esters have a typical m/z, which corresponds to the loss of the alkyl chain of the alcohol associated with the migration of two hydrogens.