Next Article in Journal
Steroidomics for the Prevention, Assessment, and Management of Cancers: A Systematic Review and Functional Analysis
Previous Article in Journal
Extending the Metabolite Diversity of the Endophyte Dimorphosporicola tragani
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Normalizing Untargeted Periconceptional Urinary Metabolomics Data: A Comparison of Approaches

by
Ana K. Rosen Vollmar
1,
Nicholas J. W. Rattray
1,2,
Yuping Cai
1,
Álvaro J. Santos-Neto
1,3,
Nicole C. Deziel
1,
Anne Marie Z. Jukic
4 and
Caroline H. Johnson
1,*
1
Department of Environmental Health Sciences, Yale School of Public Health, New Haven, CT 06510, USA
2
Strathclyde Institute of Pharmacy and Biomedical Sciences, University of Strathclyde, G4 0RE Glasgow, UK
3
São Carlos Institute of Chemistry, University of São Paulo, São Carlos 13566-590, SP, Brazil
4
Epidemiology Branch, National Institute of Environmental Health Sciences, Durham, NC 27709, USA
*
Author to whom correspondence should be addressed.
Metabolites 2019, 9(10), 198; https://doi.org/10.3390/metabo9100198
Submission received: 7 August 2019 / Revised: 9 September 2019 / Accepted: 19 September 2019 / Published: 21 September 2019

Abstract

:
Metabolomics studies of the early-life exposome often use maternal urine specimens to investigate critical developmental windows, including the periconceptional period and early pregnancy. During these windows changes in kidney function can impact urine concentration. This makes accounting for differential urinary dilution across samples challenging. Because there is no consensus on the ideal normalization approach for urinary metabolomics data, this study’s objective was to determine the optimal post-analytical normalization approach for untargeted metabolomics analysis from a periconceptional cohort of 45 women. Urine samples consisted of 90 paired pre- and post-implantation samples. After untargeted mass spectrometry-based metabolomics analysis, we systematically compared the performance of three common approaches to adjust for urinary dilution—creatinine adjustment, specific gravity adjustment, and probabilistic quotient normalization (PQN)—using unsupervised principal components analysis, relative standard deviation (RSD) of pooled quality control samples, and orthogonal partial least-squares discriminant analysis (OPLS-DA). Results showed that creatinine adjustment is not a reliable approach to normalize urinary periconceptional metabolomics data. Either specific gravity or PQN are more reliable methods to adjust for urinary concentration, with tighter quality control sample clustering, lower RSD, and better OPLS-DA performance compared to creatinine adjustment. These findings have implications for metabolomics analyses on urine samples taken around the time of conception and in contexts where kidney function may be altered.

1. Introduction

With growing interest in the exposome—the totality of an individual’s exposures across the life course—metabolomics is a powerful tool for measuring both chemical exposures and biological responses at a molecular level [1]. Because of the challenges in continuous longitudinal exposome monitoring, efforts to characterize the exposome often focus on critical developmental windows, including the periconceptional period and pregnancy [2,3]. In environmental health research, urine is a valuable biospecimen not only for evaluating the presence of environmental chemicals through biomonitoring, but also for assessing the biological impact of exposures through changes to endogenous metabolites. However, accounting for differential urinary dilution across samples through application of a normalization strategy is essential to reduce measurement error and accurately interpret findings [4]. Although urinary creatinine concentration is commonly used to adjust for urinary dilution in biomonitoring studies, creatinine excretion is known to vary based on multiple factors, including sex, age, diet, exercise, muscle mass, body mass index (BMI), and health conditions that impact kidney function [4,5,6,7,8,9,10]. Normal pregnancy substantially alters kidney function and creatinine excretion, with changes documented as early as 3–6 weeks after the last menstrual period (LMP) [10], and clinically observable through increased urine production and decreased urinary creatinine concentrations [11]. Hormonal changes of pregnancy promote fluid and electrolyte retention, with concomitant vasodilation to accommodate the increase in water and blood volume [12]. Kidney volume increases 30% and glomerular filtration rate increases 50–80% above pre-pregnancy levels by the end of the first trimester [13,14]. Using creatinine to adjust for urinary concentration during times of changing kidney function, such as the critical window of early pregnancy, has the potential to introduce confounding and measurement errors [5,8,15,16].
In metabolomics research, there is no consensus on the optimal normalization strategy to account for concentration variability in urinary metabolomics data, especially in situations where creatinine excretion or kidney function may be impacted [17,18,19]. Metabolomics normalization strategies adjust data for both analytical and biological variability: analytical variability results from changes in instrument performance over the course of an experiment [20,21], while biological variability is due to biological differences between subjects, including sample concentration [17,18]. Additional variability also can be introduced by experimental conditions and study design decisions. Normalization can occur pre-acquisition, in which samples are diluted to a common concentration before analysis, or post-acquisition, which adjusts the data after analysis [22]. Other common approaches used in metabolomics include normalization to an internal standard or reference compound such as creatinine, and adjustment using specific gravity or osmolality. Specific gravity is a ratio of the density of urine (in this case) to pure water at constant temperature, while osmolality measures the solute concentration in a given solution in moles per kilogram, or osmoles [23]. Additionally, there are exclusively statistical approaches developed specifically for metabolomics research, such as probabilistic quotient normalization (PQN) [24,25]. These methods typically use a summary statistic of metabolite responses for normalization. PQN uses the distribution of metabolites across pooled quality control (QC) samples as a normalization factor, and is a fundamentally different approach to normalization from creatinine or specific gravity adjustment because it relies on the attributes of QC samples as a reference, rather than on the study samples themselves. First developed for nuclear magnetic resonance (NMR) spectroscopy [24], PQN has since been applied widely within mass spectrometry-based metabolomics [25].
Most studies systematically comparing post-acquisition normalization approaches for untargeted urinary metabolomics data have just compared statistical approaches [22,25,26,27]. Only two studies have compared both statistical approaches and methods that incorporate direct measurements from samples, such as using an internal standard, creatinine, or specific gravity to adjust for concentration [22,28]. Although pre-acquisition normalization strategies have yielded promising results [19,28,29], application of this strategy can be impractical in the large cohorts increasingly common in exposome and environmental health studies, given the intensive sample preparation processes. Thus, the objective of this study was to determine the optimal post-analytical normalization approach for untargeted metabolomics analysis of periconceptional urine samples. We used a subset of women from a periconceptional cohort and compared the performance of three common approaches to adjust for urinary dilution: creatinine, specific gravity, and PQN.

2. Results

To compare normalization approaches, we used 90 paired pre- and post-implantation pooled urine samples from 45 women. After mass spectrometry analysis and data processing, we used support vector regression (SVR) to correct for analytical variability in all datasets. We then compared three approaches to adjust for metabolite concentration variability—creatinine, specific gravity, and PQN—using unsupervised principal components analysis (PCA), and the relative standard deviation (RSD) of pooled QC samples. Orthogonal partial least-squares discriminant analysis (OPLS-DA) was used to assess the variance in the data associated with pre- and post-implantation status for each normalization method, and models were validated with a permutation test. Statistically significant changes in peaks from pre- to post-implantation for each normalization approach were evaluated with the Wilcoxon paired signed-rank test.
PCA scores plots of the raw data without application of normalization methods for analytical or biological variability show that the QC samples are not well-clustered (Figure 1 and Figure 2, panel A). PCA plots were assessed using visual inspection of the relative dispersion of QC samples [22,30]; tightly clustered QC samples are indicative of good reproducibility, as theoretically the QC samples should be in an identical position within the plot. Data normalized for analytical variability alone using SVR demonstrate tight QC clustering. When the data are further adjusted for concentration variability, QC clustering is tightest for specific gravity normalization, followed by PQN, and then creatinine. Creatinine normalization shows poor QC clustering especially for reversed-phase liquid chromatography (RPLC) data (Figure 1, panel C).
Comparison of the RSD across QC samples by normalization method (Table 1) shows that adjusting for analytical variability alone with SVR results in 94.0% (RPLC data) and 80.3% (hydrophilic interaction chromatography, HILIC, data) of peaks having an RSD < 0.3, which is a substantial improvement over the raw data. RSD is a measure of the variability of the peak area across QC samples, with a lower RSD indicating less variability and more similarity across the QC samples, which again theoretically should be identical. There is general consensus in the metabolomics research community that an RSD < 0.3 is indicative of good reproducibility [31]. Additional adjustment for concentration variability using creatinine and specific gravity normalization results in a similar RSD for both RPLC and HILIC data compared to SVR normalization. However, PQN normalization yields a substantially lower median RSD (0.08 for RPLC and 0.11 for HILIC) compared to other methods, even though when visualized via PCA plot the QCs are slightly more dispersed compared to specific gravity-normalized data (Figure 1 and Figure 2). These low RSD results are an artifact of the PQN normalization process: PQN-normalized QCs have a peak area very close to 1 because the PQN method divides the QC peak area by the median of the mean QC peak area for a given m/z, and the given QC peak area. Because normalized QC peak areas are close to 1, the standard deviation is small, resulting in an even smaller RSD. Therefore, using RSD alone to compare PQN-normalized QCs with QCs normalized by other methods may not be reliable.
We then compared the mean difference in RSD between each normalization method using paired t-tests (Table 2). Positive mean differences indicate the mean RSD of the first method listed in Table 2 is larger and therefore less optimal than that of the second method. These comparisons confirmed that additional normalization for urinary concentration using PQN improves the RSD compared to creatinine and specific gravity for both RPLC and HILIC data. Specific gravity performed better than creatinine for RPLC data, but there was no statistically significant difference in RSD between specific gravity and creatinine for HILIC data. The equivalent RSD between SVR and specific gravity is a result of the specific gravity normalization method, in which QC peak areas are the same as those normalized by SVR alone, leading to no difference in RSD when comparing these methods (see Section 4.6, Equation (1)).
We carried out OPLS-DA analysis to determine the variance in the data associated with pre- and post-implantation status, as well as the predictability of the model (Table 3), including only those peaks with RSD < 0.3. As a result of using this RSD threshold, PQN data included more features than any other approach because of the low RSD associated with the PQN process. The variation in the data related to pre- and post-implantation status was extremely similar for specific gravity and creatinine for RPLC data (R2X = 0.36 and 0.37 respectively), and much lower for PQN (R2X = 0.18). The low R2X of PQN may be related to the fact that more peaks are included that might not have relevance to implantation status because of the method’s low RSD from the PQN calculation process, which could be a source of bias. For HILIC data, it was highest for creatinine (R2X = 0.37), with PQN and specific gravity much lower (R2X = 0.25 and 0.20 respectively). For all methods, there was good group separation by implantation status (R2Y > 0.80 for all RPLC data, and R2Y > 0.75 for all HILIC data). Best group separation by implantation status was observed in PQN-normalized data, followed by specific gravity and then creatinine. The overall predictive performance of the model was similar across normalization methods for RPLC data, with the creatinine-adjusted model performing the poorest (Q2 = 0.58) and PQN the best (Q2 = 0.69). For HILIC data, the creatinine-adjusted model had a substantially lower Q2 (0.27) compared to specific gravity or PQN (Q2 = 0.53 and 0.52 respectively).
This suggests that for both RPLC and HILIC modes, models that use features that have been creatinine-adjusted result in poorer predictions for features correlated with implantation status. The difference between R2Y and Q2 is < 0.3 for all three normalization approaches in RPLC data; however, it is > 0.3 for all three approaches in the HILIC data, suggesting overfitting of the model [32]. Results of the permutation test (Supplementary Materials, Figures S1 and S2) to validate the model were similar across all normalization methods, and in all cases had a Q2 intercept close to 0 (range of −0.08 to 0.09), showing reasonable model performance. For RPLC data, creatinine, specific gravity, and PQN all had a similar number of features with variable importance in projection or VIP > 1 and q < 0.05 (Table 3). Features meeting both of these criteria are of higher importance in the OPLS-DA model and show statistically significant differences from pre- to post-implantation. For HILIC data, PQN had the most features meeting these criteria (n = 2578), followed by specific gravity (n = 2358) and then creatinine (n = 1491).
Taken together, these analyses indicate that PQN and specific gravity normalization both outperform creatinine adjustment; in some tests PQN performs better than specific gravity, while in others specific gravity outperforms PQN. Specific gravity has tighter clustering of QC samples in unsupervised PCA plots compared to PQN (Figure 1 and Figure 2), and while PQN has a lower RSD than specific gravity, this is an artifact of the PQN normalization process, suggesting the two approaches perform similarly when evaluated by RSD (Table 1 and Table 2). In OPLS-DA results (Table 3) for RPLC data, PQN has a high Q2 but a low R2X, indicating good group separation but relatively little variability explained by implantation status, while specific gravity performs consistently well across assessments. For HILIC data, OPLS-DA results for PQN and specific gravity are similar. Finally, for RPLC data, PQN and specific gravity have a similar number of features meeting the VIP > 1 and q < 0.05 criteria, while for HILIC data PQN has more features meeting this threshold than does specific gravity. Creatinine is a poor choice for concentration normalization of periconceptional urinary metabolomics data, and either PQN or specific gravity are good alternative options.

3. Discussion

We found that either specific gravity or PQN are good approaches for post-acquisition normalization of untargeted, periconceptional urinary metabolomics data. Our findings suggest that creatinine adjustment may not be an effective normalization approach for samples collected around the time of conception and during early pregnancy. Although there is some consensus in the metabolomics literature that normalization for urinary dilution using creatinine may be inaccurate given the many biological factors that can influence kidney function and creatinine excretion, there is not agreement on the optimal normalization approach [17,18,22,33,34,35,36].
The performance of urinary creatinine compared to specific gravity adjustment for accurate biomarker measurement has been explored in biomedical and environmental health contexts, with studies finding specific gravity robust to the physiologic parameters that influence creatinine (i.e., age, sex), and to have less within-person variability [5,6,7,37,38,39]. These studies conclude that specific gravity is either better than or equivalent to creatinine when adjusting for urinary concentration. To our knowledge, no metabolomics studies have directly compared creatinine and specific gravity normalization, although osmolality and creatinine normalization have been compared. Osmolality, which measures the number of solute particles per kilogram of solution, is considered the gold standard approach to assessing urine concentration, and is a closely related measure to specific gravity [23]. Comparison of pre-acquisition normalization using osmolality and creatinine in a small (n = 5) group of hospital patients with unspecified conditions found osmolality to perform better [29], while two targeted metabolomics studies found post-acquisition normalization using either osmolality or creatinine did not impact a model’s predictive ability to identify patient diagnoses [35], or alter biomarker identification [36]. Together, these studies suggest that while there may be a growing consensus among biomedical and environmental health literature that specific gravity is more reliable than creatinine when adjusting for urinary dilution, there is scant research in the field of metabolomics comparing the approaches.
Similarly, while multiple studies have found PQN to be an optimal technique for normalizing untargeted urinary metabolomics data, to our knowledge none have compared it to non-statistical approaches for adjusting for concentration differences, such as creatinine or specific gravity [22,25,26,27]. In a study of how metabolomics data structures are impacted by normalization methods, Saccenti [34] found that application of a variety of normalization methods to adjust for urinary dilution, including PQN and creatinine, induced spurious correlations. Saccenti suggests that normalization to urine output (volume/unit time), a physiologic parameter, better preserves data structure than internal standard or statistical approaches. We found specific gravity and PQN to perform similarly based on the combined assessment of PCA plots, RSD scores, OPLS-DA results, and Wilcoxon paired sign-rank tests. Importantly, we also found that comparing PQN to other techniques using RSD can yield low RSDs that are an artifact of the calculation process, as QCs normalized using PQN are very close to 1, resulting in a small standard deviation and RSD. In normalization comparisons that include PQN, relying on RSD alone for evaluation could result in misleading conclusions.
The strengths of our study include that it is the first to systematically compare metabolomics normalization strategies for urinary concentration in periconceptional cohorts, using data from both RPLC and HILIC analyses. To our knowledge, no other studies have examined normalization approaches in periconceptional cohorts. Only one other study has examined normalization methods in the context of kidney function changes, focusing on kidney failure patients and comparing pre- and post-acquisition normalization methods [22]. Our study focuses on post-acquisition normalization techniques, which in many cases may be simpler and more feasible, especially for large cohort studies with hundreds or even thousands of subjects where dilution to a common concentration may be impractical. Comparison of normalization strategies in other periconceptional and pregnancy cohorts could provide valuable information on optimal strategies, with application to research efforts seeking to characterize the early life exposome.
Of note, our study has some limitations. The conclusion that specific gravity and PQN are similarly effective normalization approaches for the urinary metabolomics data generated from our cohort may not be generalizable to all periconceptional cohorts. Other methods may be optimal in different settings, in different kinds of cohorts, and at different times during pregnancy. Using specific gravity for normalization demands performing an additional analysis of each specimen. While this is less labor- and cost-intensive than pre-analytical normalization, methods that make use of data collected by mass spectrometry have distinct advantages in large cohort metabolomics studies. Because specific gravity is a measure of density, the presence of heavy molecular weight compounds such as protein or glucose in a urine sample can produce a dense solution, leading to an overestimation of specific gravity [23,39]. Beginning in mid-pregnancy at approximately 20 weeks gestation, glucose resorption by the proximal tubule of the kidneys is less effective, and increases in glomerular filtration rate can lead to protein excretion, although in normal pregnancies glucose and protein excretion do not exceed normal limits [14]. Additionally, in study populations with clinical conditions such as diabetes or hypertension, including during pregnancy, the use of specific gravity for normalization may be less reliable. Because our samples were collected at 3–6 weeks gestation, and subjects were all healthy, this is unlikely to impact our findings related to specific gravity.
Our findings suggest that during the periconceptional period, use of creatinine adjustment for post-analytical concentration normalization of urinary metabolomics data is not a reliable approach. Instead, either specific gravity normalization or PQN are good alternative approaches to adjust for urinary dilution, with application to metabolomics research during the periconceptional period, and other times of changing kidney function.

4. Materials and Methods

4.1. Study Populations

We used urine specimens from the Early Pregnancy Study (EPS), a prospective cohort study conducted in North Carolina from 1982–86 of 221 women without known fertility problems or chronic illnesses who were attempting to become pregnant [40,41]. Briefly, the participants collected daily first morning urine specimens for 6 months if they did not conceive or through 8 weeks gestation if they did conceive. Specimens were collected in polypropylene jars without preservatives and stored for up to 2 weeks in home freezers before being transported on ice to a central freezer and stored at −20 °C [42]. After completion of the original study, the samples were placed in long-term storage at −80 °C at the National Institute of Environmental Health Sciences (NIEHS).
To compare normalization approaches, we randomly selected a subset of 45 subjects from the EPS cohort with a pregnancy resulting in live birth. This study was approved by the Yale University Institutional Review Board (ID 2000022845). Pregnancy was defined as a human chorionic gonadotropin (hCG) level of > 0.025 ng/mL for ≥ 3 consecutive days [41]. After identification of a pregnancy, the day of implantation was defined as the first day of the pregnancy where hCG was ≥ 0.01 ng/mL. Inclusion criteria were no tobacco, alcohol, or marijuana use; white; age 27 to 31; and BMI < 25. Study conceptions that resulted in multiple gestations or that ended with a pregnancy loss were excluded.

4.2. Pooled Urine Samples

As part of a previous research study (NCT01132924) [42], stored daily urine specimens were pooled within menstrual cycles for each individual woman in order to facilitate the analysis of exposures to rapidly metabolized environmental chemicals with high intra-individual variability. Pools were created by selecting three first-morning specimens that were approximately one week apart, and that were collected after the end of menses. Equal aliquots were then drawn from each of these three specimens (Figure 3). The first two samples comprising the pool were pre-ovulation and the third was post-ovulation, but pre-implantation if a participant conceived. Together, these pooled samples reflect an individual’s entire menstrual cycle, and are referred to as “pre-implantation.” “Post-implantation” pooled samples consisted of three specimens approximately one week apart that were collected after the day of implantation. The post-implantation samples correspond to the first 3 to 6 weeks of gestation as measured from the LMP. For all 45 subjects, pooled urine samples from the conception cycle (pre-implantation sample) and from early pregnancy (post-implantation sample) were analyzed, for a total of 90 paired urine samples from two biological states.

4.3. Specimen Preparation

Urine specimens were sent overnight on dry ice and then stored at −80 °C until preparation using a solid phase extraction technique [43]. We created a pooled QC sample consisting of equal aliquots from each sample to account for analytical variability, from which multiple QC samples were extracted. As these QC samples are identical in composition, any changes in data acquired from QC samples during an experiment represents analytical variability, and can be corrected using statistical techniques. All samples were run on 96-well plates consisting of 90 urine samples, 3 pooled QCs, 2 extraction blanks used to ensure the absence of contamination, and 1 internal standard. Urine (75 µL) was combined with distilled water (75 µL), and 120 µL of each sample with 5 µL internal standard was extracted using an Oasis HLB96 µElution Plate (Waters Corporation, Milford, MA, USA). This extract was used for analysis of polar metabolites by HILIC-mass spectrometry (MS). We then passed 150 µL methanol through the extraction plate, and transferred 120 µL of the eluent to the 96-well plate. This plate was used for analysis of non-polar metabolites by RPLC-MS.

4.4. UHPLC/MS Analysis

All specimens were analyzed by ultra high performance liquid chromatography (UHPLC) quadrupole time-of-flight (QToF) mass spectrometry (Xevo-G2-XS-QToFMS, Waters Corporation, Milford, MA, USA), with run-order randomized and QC samples included every 6 injections. For HILIC-MS, 2 µL of each sample were injected onto an Acquity UPLC BEH Amide column (1.7 µm particle size, 2.1 mm × 150 mm ID; Waters Corporation), using a mobile phase composed of A (10 mM ammonium bicarbonate in 5:95 acetonitrile:water) and B (95:5 acetonitrile:water) at 30 °C and a solvent flow rate of 300 µL/min in electrospray ionization (ESI) negative mode. The linear gradient elution began at 95% B (0–1 min), then went to 65% B (1–14 min), 40% B (14–18 min), back to 95% B (18–18.1 min), and held at 95% B to clean the column (18.1–23 min). For RPLC-MS, 2 µL of each sample were injected onto an Acquity UPLC BEH RPLC column (1.7 µm particle size, 2.1 mm × 50 mm; Waters Corporation, Milford, MA, USA), using a mobile phase composed of A (95% water, 5% methanol, and 0.1% formic acid) and B (95% water, 5% methanol, and 0.1% formic acid) at 40 °C and a flow rate of 400 µL/min in ESI positive mode. The linear gradient elution began at 95% A (0–2min), then changed to 5% A (2–12 min) and held at 5% for 30 s, back to 95% A (12.5–13 min), and held at 95% A to regenerate the column (13–14 min).
Mass calibration of the mass spectrometer followed the manufacturer’s guidelines and used leucine enkephalin (200 pg/mL), which was also used as the locking mass compound. The QToF was set to acquire at 30,000 resolution in sensitivity mode over the 25–1000 m/z range, with 1 scan per 300 ms. Desolvation gas temperature was 400 °C, desolvation gas flow 800 L/h of N2, and cone gas flow 100 L/h. The ESI source voltage was set to 3.5 V, and source temperature set at 120 °C. MassLynx software (Waters Corporation, Milford, MA, USA) was used as the operating system.

4.5. Data Processing

ProteoWizard (version 3.0.10158) was used to convert .RAW files to .mzML files. The XCMS package (version 3.8) running on R (version 3.4) was used for deconvolution and peak alignment [44,45,46]. The XCMS-generated peak tables were further processed with the MetCleaning [47] package in R using support vector regression (SVR) to account for analytical variability [20,21,48], and then normalized using creatinine, specific gravity, or PQN in Excel 2016. Before processing the HILIC data using MetCleaning, 95 features (out of 18,977 features) were removed from the dataset where peak area = 0 for ≥ 3 QC samples, resulting in 18,882 features in the SVR-normalized dataset (see Table 1). For the RPLC dataset, use of MetCleaning filtered 17 features (out of 12,811 features), resulting in 12,794 features in the SVR-normalized dataset (see Table 1). The data processing parameters used for XCMS and MetCleaning are included in Appendix A. Please contact the authors regarding data availability.

4.6. Normalization Methods

In metabolomics analyses, concentration normalization techniques may be applied before data acquisition, such as diluting all samples to the same concentration before analysis based on specific gravity or osmolality [22,28,29], or post-acquisition, such as correction using creatinine [29], specific gravity [19,28], or PQN [24,25]. We first used SVR, a non-parametric machine learning method that has been found to out-perform LOESS and robust spline alignments, to account for analytical variability with the MetCleaning R package [20,21,48]. After applying SVR to all datasets, we compared the performance of three post-acquisition normalization approaches to adjust for concentration variability: creatinine, specific gravity, and PQN. In the context of post-analytical normalization, we first carried out analytical normalization and then corrected for concentration differences. In the case of PQN and specific gravity, where QC samples are used for normalization, first accounting for instrumental bias such as signal drift ensures QC peak areas are correct before normalization of each sample. For approaches like creatinine adjustment that use measurements collected by mass spectrometry, normalization of uncorrected peak areas using an uncorrected creatinine value could introduce further error. We therefore chose to first apply analytical normalization to all datasets, and then compare the performance of methods adjusting for concentration variability.
Creatinine normalization was carried out by dividing each peak area by the peak area of creatinine for a given sample [5,6,7,49]. In the HILIC-MS dataset, creatinine was identified as m/z = 112.05132 [M-H] (mass-to-charge ratio, m/z) at rt = 277.282 s (retention time in seconds, rt). In the RPLC-MS dataset, creatinine was identified as m/z = 114.0662 [M+H]+ at rt = 21 s; because this peak was not picked in the initial XCMS processing, peak areas were obtained by manual integration of the raw data in MassLynx (version 4.2, Waters Corporation). Creatinine normalization was applied to the peak table resulting from MetCleaning, after SVR normalization.
Specific gravity (SG) was measured using an Atago PAL-10S Refractometer. Specific gravity normalization was carried out using the following formula:
P A c o r r = P A × S G r e f 1 S G 1
where P A c o r r = specific gravity-corrected peak area, P A = peak area, S G r e f = reference specific gravity (in this case 1.020, the median and mean specific gravity of all samples), and S G = specific gravity measured in sample [5,6,38]. The difference S G 1 approximates the total dissolved urinary solids present in the sample [48]. The effect of this normalization method is to increase peak area when the sample is dilute ( S G < S G r e f ), and decrease peak area when the sample is concentrated ( S G > S G r e f ). Because specific gravity was not measured in the pooled QC samples, the median specific gravity across all samples was used for QC correction. While this could reduce variability due to measurement error of QC specific gravity measurements, because the QCs are comprised of equal aliquots from each sample, it is unlikely that this contributed to improving the reproducibility profile of specific gravity in this study. Specific gravity normalization was applied to the peak table resulting from MetCleaning, after SVR normalization.
The PQN technique uses a “gold standard” spectrum to normalize test features [24,25]. In this study, as per common practice, we used the distribution of QC sample m/z values as the gold standard spectrum. First, the mean peak area of the QC features was calculated for each m/z value. Then, the median between a given test feature peak area and the QC mean peak area was calculated. Finally, the test feature peak area was divided by the median, resulting in a normalized peak area. Features with peak area of 0 for all QC samples were removed before normalization (n = 2 for RPLC data, n = 1 for HILIC data). As with creatinine- and specific gravity-adjustment, PQN was applied to the peak table resulting from MetCleaning, after SVR normalization.

4.7. Statistical Comparison of Normalization Methods

To compare normalization techniques, we first used unsupervised PCA to visually inspect the clustering and relative dispersion of the QC samples [22,30]. Then, we calculated the RSD by dividing the standard deviation of the peak area by the mean peak area across QC samples as a measure of feature reproducibility across QCs, with ideal RSD < 0.3 or 30% [17,20,22,50]. Paired t-tests were used to compare the mean difference in RSD between each normalization method. OPLS-DA was used to determine the variance in the data associated with pre- and post-implantation status, and we included only those peaks for which RSD < 0.3. For each normalization method, we calculated R2X, R2Y, Q2, and the number of features where the variable importance in projection or VIP was greater than 1 [51]. A permutation test was conducted to validate the model, correlating the Q2 and R2 of the original data with the distribution of Q2 and R2 after 200 iterations randomly assigning pre- and post-implantation status [32]. The Wilcoxon paired signed-rank test was used to determine whether there was a statistically significant change in peak area for a given feature from pre- to post-implantation status, for each normalization method. The level of significance was set using a Benjamini–Hochberg false discovery rate (FDR)-corrected p-value (q-value) of q < 0.05 [52]. For each normalization method, we reported the number of features with both q < 0.05 and VIP > 1, common criteria applied to identify potential biomarkers [51]. All analyses were carried out in R (version 3.4.1).

5. Conclusions

In metabolomics studies using urine specimens, accounting for urinary concentration is essential. However, when a health status or outcome of interest alters kidney function, as is the case with normal pregnancy, creatinine may not be a reliable normalization approach. In this paper, we show that specific gravity and PQN are both more effective methods for normalizing urinary periconceptional metabolomics data compared to creatinine. These findings have implications for metabolomics analyses around the time of conception and in contexts where normal kidney function is altered.

Supplementary Materials

The following are available online at https://www.mdpi.com/2218-1989/9/10/198/s1, Figure S1: Results of 200 random permutation tests (permutation by implantation status) performed on RPLC datasets for each normalization method, Figure S2: Results of 200 random permutation tests (permutation by implantation status) performed on HILIC datasets for each normalization method, Table S1: Difference in the original R2 from OPLS-DA, and the R2 calculated from the permutation test.

Author Contributions

Conceptualization, A.K.R.V., N.J.W.R., and C.H.J.; methodology, A.K.R.V., N.J.W.R., and C.H.J.; formal analysis, A.K.R.V., Y.C., and Á.J.S.-N.; investigation, A.K.R.V., N.J.W.R., and C.H.J.; resources, C.H.J. and A.M.Z.J.; writing—original draft preparation, A.K.R.V.; writing—review and editing, A.K.R.V., C.H.J., N.C.D., A.M.Z.J., N.J.W.R., and Y.C.; supervision, C.H.J., A.M.Z.J., and N.C.D.

Funding

This research was funded by a Yale School of Public Health Doctoral Fellowship, and was supported in part by the Intramural Research Program of the NIH, National Institute of Environmental Health Sciences under project number Z01ES103333. A.S.N.’s research fellowship abroad (BPE) was funded by grant #2018/01448-9, from São Paulo Research Foundation (FAPESP).

Acknowledgments

The authors would like to thank the Principal Investigators of the EPS cohort, Allen Wilcox, Clarice Weinberg, and Donna Baird, for sharing data and specimens. We would also like to thank staff at NIEHS for measuring specific gravity.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Metabolomics Data Processing Parameters

For XCMS (version 3.8) data processing, the centWave algorithm was used with the following parameters: ppm = 20, snthresh = 5, peakwidth = c (3,25). The “obiwarp” method was used for peak alignment, and the FillPeaks function used to impute missing values. Polarity was set to positive for RPLC data processing and negative for HILIC data processing.
For data processing using the MetCleaning package in R, the following parameters were used: zero.filter = FALSE, obs.zero.cutoff = 0, var.zero.cutoff = 0, qc.outlier.filer = FALSE, subject.outlier.filter = FALSE, threads = 1, mass.tolerance = 30. Polarity was positive for RPLC data and negative for HILIC data.

References

  1. Cai, Y.; Rosen Vollmar, A.K.; Johnson, C.H. Analyzing metabolomics data for environmental health and exposome research. In Computational Methods and Data Analysis for Metabolomics; Li, S., Ed.; Springer Nature: New York, NY, USA, 2019; in press. [Google Scholar]
  2. Robinson, O.; Vrijheid, M. The pregnancy exposome. Curr. Environ. Heal. Rep. 2015, 2, 204–213. [Google Scholar] [CrossRef] [PubMed]
  3. Fleming, T.P.; Watkins, A.J.; Velazquez, M.A.; Mathers, J.C.; Prentice, A.M.; Stephenson, J.; Barker, M.; Saffery, R.; Yajnik, C.S.; Eckert, J.J.; et al. Origins of lifetime health around the time of conception: Causes and consequences. Lancet 2018, 391, 1842–1852. [Google Scholar] [CrossRef]
  4. Barr, D.B.; Wilder, L.C.; Caudill, S.P.; Gonzalez, A.J.; Needham, L.L.; Pirkle, J.L. Urinary creatinine concentrations in the U.S. population: Implications for urinary biologic monitoring measurements. Environ. Health Perspect. 2005, 113, 192–200. [Google Scholar] [CrossRef] [PubMed]
  5. MacPherson, S.; Arbuckle, T.E.; Fisher, M. Adjusting urinary chemical biomarkers for hydration status during pregnancy. J. Expo. Sci. Environ. Epidemiol. 2018, 28, 481–493. [Google Scholar] [CrossRef] [PubMed]
  6. Miller, R.C.; Brindle, E.; Holman, D.J.; Shofer, J.; Klein, N.A.; Soules, M.R.; O’Connor, K.A. Comparison of specific gravity and creatinine for normalizing urinary reproductive hormone concentrations. Clin. Chem. 2004, 50, 924–932. [Google Scholar] [CrossRef] [PubMed]
  7. Carrieri, M.; Trevisan, A.; Bartolucci, G.B. Adjustment to concentration-dilution of spot urine samples: Correlation between specific gravity and creatinine. Int. Arch. Occup. Environ. Health 2001, 74, 63–67. [Google Scholar] [CrossRef]
  8. Hays, S.M.; Aylward, L.L.; Blount, B.C. Variation in urinary flow rates according to demographic characteristics and body mass index in NHANES: Potential confounding of associations between health outcomes and urinary biomarker concentrations. Environ. Health Perspect. 2015, 123, 293–300. [Google Scholar] [CrossRef]
  9. Boeniger, M.F.; Lowry, L.K.; Rosenberg, J. Interpretation of urine results used to assess chemical exposure with emphasis on creatinine adjustments: A review. Am. Ind. Hyg. Assoc. J. 1993, 54, 615–627. [Google Scholar] [CrossRef]
  10. Davison, J.M.; Noble, M.C. Serial changes in 24 hour creatinine clearance during normal menstrual cycles and the first trimester of pregnancy. Br. J. Obstet. Gynaecol. 1981, 88, 10–17. [Google Scholar] [CrossRef]
  11. Odutayo, A.; Hladunewich, M. Obstetric nephrology: Renal hemodynamic and metabolic physiology in normal pregnancy. Clin. J. Am. Soc. Nephrol. 2012, 7, 2073–2080. [Google Scholar] [CrossRef]
  12. Blackburn, S.T. Maternal, Fetal, and Neonatal Physiology, 3rd ed.; Elsevier: St. Louis, MO, USA, 2007. [Google Scholar]
  13. Abduljalil, K.; Furness, P.; Johnson, T.N.; Rostami-Hodjegan, A.; Soltani, H. Anatomical, physiological and metabolic changes with gestational age during normal pregnancy: A database for parameters required in physiologically based pharmacokinetic modelling. Clin. Pharmacokinet. 2012, 51, 365–396. [Google Scholar] [CrossRef] [PubMed]
  14. Cheung, K.L.; Lafayette, R.A. Renal physiology of pregnancy. Adv. Chronic Kidney Dis. 2013, 20, 209–214. [Google Scholar] [CrossRef] [PubMed]
  15. Weaver, V.M.; Kotchmar, D.J.; Fadrowski, J.J.; Silbergeld, E.K. Challenges for environmental epidemiology research: Are biomarker concentrations altered by kidney function or urine concentration adjustment? J. Expo. Sci. Environ. Epidemiol. 2016, 26, 1–8. [Google Scholar] [CrossRef] [PubMed]
  16. Aylward, L.L.; Hays, S.M.; Smolders, R.; Koch, H.M.; Cocker, J.; Jones, K.; Warren, N.; Levy, L.; Bevan, R. Sources of variability in biomarker concentrations. J. Toxicol. Environ. Heal. Part B Crit. Rev. 2014, 17, 45–61. [Google Scholar] [CrossRef] [PubMed]
  17. Warrack, B.M.; Hnatyshyn, S.; Ott, K.H.; Reily, M.D.; Sanders, M.; Zhang, H.; Drexler, D.M. Normalization strategies for metabonomic analysis of urine samples. J. Chromatogr. B Anal. Technol. Biomed Life Sci. 2009, 877, 547–552. [Google Scholar] [CrossRef]
  18. Zhang, T.; Watson, D.G. A short review of applications of liquid chromatography mass spectrometry based metabolomics techniques to the analysis of human urine. Analyst 2015, 140, 2907–2915. [Google Scholar] [CrossRef] [PubMed]
  19. Jacob, C.C.; Dervilly-Pinel, G.; Biancotto, G.; Le Bizec, B. Evaluation of specific gravity as normalization strategy for cattle urinary metabolome analysis. Metabolomics 2014, 10, 627–637. [Google Scholar] [CrossRef]
  20. Shen, X.; Gong, X.; Cai, Y.; Guo, Y.; Tu, J.; Li, H.; Zhang, T.; Wang, J.; Xue, F.; Zhu, Z.J. Normalization and integration of large-scale metabolomics data using support vector regression. Metabolomics 2016, 12, 1–12. [Google Scholar] [CrossRef]
  21. Kuligowski, J.; Sánchez-Illana, Á.; Sanjuán-Herráez, D.; Vento, M.; Quintás, G. Intra-batch effect correction in liquid chromatography-mass spectrometry using quality control samples and support vector regression (QC-SVRC). Analyst 2015, 140, 7810–7817. [Google Scholar] [CrossRef]
  22. Gagnebin, Y.; Tonoli, D.; Lescuyer, P.; Ponte, B.; de Seigneux, S.; Martin, P.Y.; Schappler, J.; Boccard, J.; Rudaz, S. Metabolomic analysis of urine samples by UHPLC-QTOF-MS: Impact of normalization strategies. Anal. Chim. Acta 2017, 955, 27–35. [Google Scholar] [CrossRef]
  23. Chadha, V.; Garg, U.; Alon, U.S. Measurement of urinary concentration: A critical appraisal of methodologies. Pediatr. Nephrol. 2001, 16, 374–382. [Google Scholar] [CrossRef]
  24. Dieterle, F.; Ross, A.; Schlotterbeck, G.; Senn, H. Probabilistic quotient normalization as robust method to account for dilution of complex biological mixtures. Application in 1H NMR metabonomics. Anal. Chem. 2006, 78, 4281–4290. [Google Scholar] [CrossRef] [PubMed]
  25. Di Guida, R.; Engel, J.; Allwood, J.W.; Weber, R.J.M.; Jones, M.R.; Sommer, U.; Viant, M.R.; Dunn, W.B. Non-targeted UHPLC-MS metabolomic data processing methods: A comparative investigation of normalisation, missing value imputation, transformation and scaling. Metabolomics 2016, 12, 1–14. [Google Scholar] [CrossRef] [PubMed]
  26. Chen, J.; Zhang, P.; Lv, M.; Guo, H.; Huang, Y.; Zhang, Z.; Xu, F. Influences of normalization method on biomarker discovery in gas chromatography-mass spectrometry-based untargeted metabolomics: What should be considered? Anal. Chem. 2017, 89, 5342–5348. [Google Scholar] [CrossRef] [PubMed]
  27. Filzmoser, P.; Walczak, B. What can go wrong at the data normalization step for identification of biomarkers? J. Chromatogr. A 2014, 1362, 194–205. [Google Scholar] [CrossRef]
  28. Edmands, W.M.B.; Ferrari, P.; Scalbert, A. Normalization to specific gravity prior to analysis improves information recovery from high resolution mass spectrometry metabolomic profiles of human urine. Anal. Chem. 2014, 86, 10925–10931. [Google Scholar] [CrossRef]
  29. Chetwynd, A.J.; Abdul-Sada, A.; Holt, S.G.; Hill, E.M. Use of a pre-analysis osmolality normalisation method to correct for variable urine concentrations and for improved metabolomic analyses. J. Chromatogr. A 2016, 1431, 103–110. [Google Scholar] [CrossRef]
  30. Gika, H.G.; Zisi, C.; Theodoridis, G.; Wilson, I.D. Protocol for quality control in metabolic profiling of biological fluids by U(H)PLC-MS. J. Chromatogr. B 2016, 1008, 15–25. [Google Scholar] [CrossRef]
  31. Naz, S.; Vallejo, M.; García, A.; Barbas, C. Method validation strategies involved in non-targeted metabolomics. J. Chromatogr. A 2014, 1353, 99–105. [Google Scholar] [CrossRef]
  32. Triba, M.N.; Le Moyec, L.; Amathieu, R.; Goossens, C.; Bouchemal, N.; Nahon, P.; Rutledge, D.N.; Savarin, P. PLS/OPLS models in metabolomics: The impact of permutation of dataset rows on the K-fold cross-validation quality parameters. Mol. Biosyst. 2015, 11, 13–19. [Google Scholar] [CrossRef]
  33. Ryan, D.; Robards, K.; Prenzler, P.D.; Kendall, M. Recent and potential developments in the analysis of urine: A review. Anal. Chim. Acta 2011, 684, 17–29. [Google Scholar] [CrossRef]
  34. Saccenti, E. Correlation patterns in experimental data are affected by normalization procedures: Consequences for data analysis and network inference. J. Proteome Res. 2017, 16, 619–634. [Google Scholar] [CrossRef]
  35. Khamis, M.M.; Holt, T.; Awad, H.; El-Aneed, A.; Adamko, D.J. Comparative analysis of creatinine and osmolality as urine normalization strategies in targeted metabolomics for the differential diagnosis of asthma and COPD. Metabolomics 2018, 14, 1–8. [Google Scholar] [CrossRef]
  36. Kennedy, A.D.; Miller, M.J.; Beebe, K.; Wulff, J.E.; Evans, A.M.; Miller, L.A.D.; Sutton, V.R.; Sun, Q.; Elsea, S.H. Metabolomic profiling of human urine as a screen for multiple inborn errors of metabolism. Genet. Test. Mol. Biomark. 2016, 20, 485–495. [Google Scholar] [CrossRef]
  37. Heavner, D.L.; Morgan, W.T.; Sears, S.B.; Richardson, J.D.; Byrd, G.D.; Ogden, M.W. Effect of creatinine and specific gravity normalization techniques on xenobiotic biomarkers in smokers’ spot and 24-h urines. J. Pharm. Biomed. Anal. 2006, 40, 928–942. [Google Scholar] [CrossRef]
  38. Haddow, J.E.; Knight, G.J.; Palomaki, G.E.; Neveux, L.M.; Chilmonczyk, B.A. Replacing creatinine measurements with specific gravity values to adjust urine cotinine concentrations. Clin. Chem. 1994, 40, 562–564. [Google Scholar]
  39. Sauvé, J.F.; Lévesque, M.; Huard, M.; Drolet, D.; Lavoué, J.; Tardif, R.; Truchon, G. Creatinine and specific gravity normalization in biological monitoring of occupational exposures. J. Occup. Environ. Hyg. 2015, 12, 123–129. [Google Scholar] [CrossRef]
  40. Wilcox, A.J.; Weinberg, C.R.; Wehmann, R.E.; Armstrong, E.G.; Canfield, R.E.; Nisula, B.C. Measuring early pregnancy loss: Laboratory and field methods. Fertil. Steril. 1985, 44, 366–374. [Google Scholar] [CrossRef]
  41. Wilcox, A.J.; Weinberg, C.R.; O’Connor, J.F.; Baird, D.D.; Schlatterer, J.P.; Canfield, R.E.; Armstrong, E.G.; Nisula, B.C. Incidence of early loss of pregnancy. N. Engl. J. Med. 1988, 319, 189–194. [Google Scholar] [CrossRef]
  42. Jukic, A.M.; Calafat, A.M.; McConnaughey, D.R.; Longnecker, M.P.; Hoppin, J.A.; Weinberg, C.R.; Wilcox, A.J.; Baird, D.D. Urinary concentrations of phthalate metabolites and bisphenol A and associations with follicular-phase length, luteal-phase length, fecundability, and early pregnancy loss. Environ. Health Perspect. 2016, 124, 321–328. [Google Scholar] [CrossRef]
  43. Vuckovic, D. Current trends and challenges in sample preparation for global metabolomics using liquid chromatography-mass spectrometry. Anal. Bioanal. Chem. 2012, 403, 1523–1548. [Google Scholar] [CrossRef]
  44. Smith, C.A.; Want, E.J.; O’Maille, G.; Abagyan, R.; Siuzdak, G. XCMS: Processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification. Anal. Chem. 2006, 78, 779–787. [Google Scholar] [CrossRef]
  45. Tautenhahn, R.; Bottcher, C.; Neumann, S. Highly sensitive feature detection for high resolution LC/MS. BMC Bioinform. 2008, 9, 504. [Google Scholar] [CrossRef]
  46. Benton, H.P.; Want, E.J.; Ebbels, T.M.D. Correction of mass calibration gaps in liquid chromatography-mass spectrometry metabolomics data. Bioinformatics 2010, 26, 2488–2489. [Google Scholar] [CrossRef]
  47. Shen, X.; Zhu, Z. MetCleaning. R package version 0.99.85. 2018. [Google Scholar]
  48. Sánchez-Illana, Á.; Pérez-Guaita, D.; Cuesta-García, D.; Sanjuan-Herráez, J.D.; Vento, M.; Ruiz-Cerdá, J.L.; Quintás, G.; Kuligowski, J. Model selection for within-batch effect correction in UPLC-MS metabolomics using quality control—Support vector regression. Anal. Chim. Acta 2018, 1026, 62–68. [Google Scholar] [CrossRef]
  49. Vij, H.S.; Howell, S. Improving the specific gravity adjustment method for assessing urinary concentrations of toxic substances. Am. Ind. Hyg. Assoc. J. 1998, 59, 375–380. [Google Scholar] [CrossRef]
  50. Wu, Y.; Li, L. Sample normalization methods in quantitative metabolomics. J. Chromatogr. A 2016, 1430, 80–95. [Google Scholar] [CrossRef]
  51. Wheelock, Å.M.; Wheelock, C.E. Trials and tribulations of ’omics data analysis: Assessing quality of SIMCA-based multivariate models using examples from pulmonary medicine. Mol. Biosyst. 2013, 9, 2589–2596. [Google Scholar] [CrossRef]
  52. Benjamini, Y.; Hochberg, Y. Controlling the false discovery rate: A practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B 1995, 57, 289–300. [Google Scholar] [CrossRef]
Figure 1. Unsupervised PCA plots comparing normalization approaches for RPLC data showing (A) raw data, (B) SVR-normalized data, (C) SVR and creatinine-normalized data, (D) SVR and specific gravity-normalized data, and (E) SVR and PQN-normalized data. Abbreviations: RPLC, reversed-phase liquid chromatography; SVR, support vector regression; PQN, probabilistic quotient normalization; PC1, principal component 1; PC2, principal component 2; ellipse is a 95% confidence ellipse.
Figure 1. Unsupervised PCA plots comparing normalization approaches for RPLC data showing (A) raw data, (B) SVR-normalized data, (C) SVR and creatinine-normalized data, (D) SVR and specific gravity-normalized data, and (E) SVR and PQN-normalized data. Abbreviations: RPLC, reversed-phase liquid chromatography; SVR, support vector regression; PQN, probabilistic quotient normalization; PC1, principal component 1; PC2, principal component 2; ellipse is a 95% confidence ellipse.
Metabolites 09 00198 g001
Figure 2. Unsupervised PCA plots comparing normalization approaches for HILIC data showing (A) raw data, (B) SVR-normalized data, (C) SVR and creatinine-normalized data, (D) SVR and specific gravity-normalized data, and (E) SVR and PQN-normalized data. Abbreviations: HILIC, hydrophilic interaction chromatography; SVR, support vector regression; PQN, probabilistic quotient normalization; PC1, principal component 1; PC2, principal component 2; ellipse is a 95% confidence ellipse.
Figure 2. Unsupervised PCA plots comparing normalization approaches for HILIC data showing (A) raw data, (B) SVR-normalized data, (C) SVR and creatinine-normalized data, (D) SVR and specific gravity-normalized data, and (E) SVR and PQN-normalized data. Abbreviations: HILIC, hydrophilic interaction chromatography; SVR, support vector regression; PQN, probabilistic quotient normalization; PC1, principal component 1; PC2, principal component 2; ellipse is a 95% confidence ellipse.
Metabolites 09 00198 g002
Figure 3. Pooled sampling strategy for pre- and post-implantation urine specimens. The pre-implantation sample depicted here is a from a theoretical conception cycle during which ovulation occurs on day 14 (the mode in this population). In most conceptions, implantation occurs approximately 8–10 days after ovulation. Gestational weeks are measured from the start of the last menstrual period (LMP), so post-implantation early pregnancy pooled samples span 3–6 weeks gestation.
Figure 3. Pooled sampling strategy for pre- and post-implantation urine specimens. The pre-implantation sample depicted here is a from a theoretical conception cycle during which ovulation occurs on day 14 (the mode in this population). In most conceptions, implantation occurs approximately 8–10 days after ovulation. Gestational weeks are measured from the start of the last menstrual period (LMP), so post-implantation early pregnancy pooled samples span 3–6 weeks gestation.
Metabolites 09 00198 g003
Table 1. Comparison of relative standard deviation of peak area for quality control samples across normalization approaches.1
Table 1. Comparison of relative standard deviation of peak area for quality control samples across normalization approaches.1
Normalization ApproachRPLC DataHILIC Data
Median RSD
(IQR)
Peaks with RSD
< 0.3, n (%)
Median RSD
(IQR)
Peaks with RSD
< 0.3, n (%)
Raw Data0.23 9827/12,811 0.33 8271/18,977
(0.16–0.29)(76.7%)(0.23–0.48)(43.6%)
SVR0.15 12,023/12,794 0.19 15,158/18,882
(0.10–0.20)(94.0%)(0.13–0.27)(80.3%)
SVR and Creatinine0.18 11,744/12,794 0.20 15,104/18,882
(0.15–0.23)(91.8%)(0.13–0.27)(80.0%)
SVR and Specific Gravity0.15 12,023/12,794 0.19 15,158/18,882
(0.10–0.20)(94.0%)(0.13–0.27)(80.3%)
SVR and PQN0.08 12,667/12,794 0.11 18,106/18,882
(0.05–0.11)(99.0%)(0.07–0.16)(95.9%)
1 SVR addresses analytical variability, while creatinine, specific gravity, and PQN address concentration variability. RPLC data represent hydrophobic metabolites, and HILIC data represent hydrophilic metabolites. Raw data has been processed using XCMS without any application of a normalization technique. Abbreviations: RPLC, reversed-phase liquid chromatography; HILIC, hydrophilic interaction chromatography; RSD, relative standard deviation; IQR, interquartile range; SVR, support vector regression; PQN, probabilistic quotient normalization.
Table 2. Comparison of the mean difference in relative standard deviation across quality control samples between normalization approaches using paired t-tests.1
Table 2. Comparison of the mean difference in relative standard deviation across quality control samples between normalization approaches using paired t-tests.1
Compared Normalization ApproachesRPLC DataHILIC Data
RSD Mean Difference 3p-valueRSD Mean Difference 3p-value
Raw and SVR 20.100p < 2.2 × 10−160.160p < 2.2 × 10−16
SVR and Creatinine 2,4−0.040p < 2.2 × 10−16−0.0020.21
SVR and Specific Gravity 2,40--0--
SVR and PQN 20.075p < 2.2 × 10−160.100p < 2.2 × 10−16
Creatinine and Specific Gravity 2,40.040p < 2.2 × 10−160.0020.21
Creatinine and PQN 20.116p < 2.2 × 10−160.103p < 2.2 × 10−16
Specific gravity and PQN 20.075p < 2.2 × 10−160.101p < 2.2 × 10−16
1 Abbreviations: RPLC, reversed-phase liquid chromatography; HILIC, hydrophilic interaction chromatography; RSD, relative standard deviation; SVR, support vector regression; PQN, probabilistic quotient normalization; SGref, reference specific gravity; SG, specific gravity; QC, quality control sample. 2 Raw data were processed using XCMS. SVR normalization was then applied to all datasets. Creatinine, specific gravity, and PQN adjustments were carried out after SVR normalization. 3 RSD mean differences are computed in the order listed, for example, RSDraw–RSDSVR, and expressed as percentages. 4 Because of the specific gravity normalization method in which SGref = SG for QC samples (see Section 4.6, Equation (1)), QC peak areas are the same as those normalized by SVR alone, and therefore there is no difference in the RSD when comparing these methods. Similarly, because QC peak areas are the same for specific gravity and SVR, comparisons between creatinine and SVR and creatinine and specific gravity result in the same mean differences in terms of absolute value, and the same p-values.
Table 3. Results of OPLS-DA analysis of variation associated with pre- and post-implantation status, and Wilcoxon paired signed-rank test of changes in peak area from pre- to post-implantation.1
Table 3. Results of OPLS-DA analysis of variation associated with pre- and post-implantation status, and Wilcoxon paired signed-rank test of changes in peak area from pre- to post-implantation.1
Normalization ApproachRPLC DataHILIC Data
Features, n 3R2XR2YQ2VIP > 1 and q < 0.05, n 4Features, n 3R2XR2YQ2VIP > 1 and q < 0.05, n 4
Raw 298160.180.930.66130382710.250.870.511385
SVR 212,0230.130.950.62142515,1580.220.900.492161
Creatinine 211,7440.360.820.58158915,1040.370.750.271491
Specific Gravity 212,0230.370.860.62159115,1580.200.870.532358
PQN212,6670.180.940.69155118,1060.250.910.522578
1 Abbreviations: OPLS-DA, orthogonal partial least-squares discriminant analysis; RPLC, reversed-phase liquid chromatography; HILIC, hydrophilic interaction chromatography; VIP, variable importance in projection; SVR, support vector regression; PQN, probabilistic quotient normalization. 2 Raw data were processed using XCMS. SVR normalization was then applied to all datasets. Creatinine, specific gravity, and PQN normalizations were then carried out after SVR normalization. 3 Only those features with RSD < 0.3 were included in this analysis. 4 Number of discriminant features with VIP > 1 based on OPLS-DA analysis, and q < 0.05 based on the Wilcoxon paired signed-rank test.

Share and Cite

MDPI and ACS Style

Rosen Vollmar, A.K.; Rattray, N.J.W.; Cai, Y.; Santos-Neto, Á.J.; Deziel, N.C.; Jukic, A.M.Z.; Johnson, C.H. Normalizing Untargeted Periconceptional Urinary Metabolomics Data: A Comparison of Approaches. Metabolites 2019, 9, 198. https://doi.org/10.3390/metabo9100198

AMA Style

Rosen Vollmar AK, Rattray NJW, Cai Y, Santos-Neto ÁJ, Deziel NC, Jukic AMZ, Johnson CH. Normalizing Untargeted Periconceptional Urinary Metabolomics Data: A Comparison of Approaches. Metabolites. 2019; 9(10):198. https://doi.org/10.3390/metabo9100198

Chicago/Turabian Style

Rosen Vollmar, Ana K., Nicholas J. W. Rattray, Yuping Cai, Álvaro J. Santos-Neto, Nicole C. Deziel, Anne Marie Z. Jukic, and Caroline H. Johnson. 2019. "Normalizing Untargeted Periconceptional Urinary Metabolomics Data: A Comparison of Approaches" Metabolites 9, no. 10: 198. https://doi.org/10.3390/metabo9100198

APA Style

Rosen Vollmar, A. K., Rattray, N. J. W., Cai, Y., Santos-Neto, Á. J., Deziel, N. C., Jukic, A. M. Z., & Johnson, C. H. (2019). Normalizing Untargeted Periconceptional Urinary Metabolomics Data: A Comparison of Approaches. Metabolites, 9(10), 198. https://doi.org/10.3390/metabo9100198

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop