Next Article in Journal
Mass Spectrometry Imaging for Reliable and Fast Classification of Non-Small Cell Lung Cancer Subtypes
Next Article in Special Issue
Prognostic Role of Serum Cytokeratin-19 Fragment (CYFRA 21-1) in Patients with Hepatocellular Carcinoma
Previous Article in Journal
Secreted Frizzled-Related Protein 1 as a Biomarker against Incomplete Age-Related Lobular Involution and Microcalcifications’ Development
Previous Article in Special Issue
Immunotherapy in Hepatocellular Cancer Patients with Mild to Severe Liver Dysfunction: Adjunctive Role of the ALBI Grade
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Pathway-Based Integrative Analysis of Metabolome and Microbiome Data from Hepatocellular Carcinoma and Liver Cirrhosis Patients

1
Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul 08826, Korea
2
Department of Internal Medicine and Liver Research Institute, Seoul National University College of Medicine, Seoul 03080, Korea
3
Department of Gastroenterology, Ajou University School of Medicine, Suwon 16499, Korea
4
Department of Statistics, Seoul National University, Seoul 08826, Korea
*
Author to whom correspondence should be addressed.
Cancers 2020, 12(9), 2705; https://doi.org/10.3390/cancers12092705
Submission received: 29 July 2020 / Revised: 14 September 2020 / Accepted: 16 September 2020 / Published: 21 September 2020
(This article belongs to the Special Issue Hepatocellular Carcinoma)

Abstract

:

Simple Summary

Hepatocellular carcinoma (HCC) is one of the diseases associated with human microbiome. The human microbiome is known to affect human disease through the metabolites. The aim of this study was to identify the pathways associated with HCC by integrating microbiome and metabolomic data via a novel pathway-based integrative method named HisCoM-MnM representing Hierarchical structural Component Model for pathway analysis of Microbiome and Metabolome. Application of HisCoM-MnM to datasets from HCC and liver cirrhosis (LC) patients successfully identified HCC-related pathways related to cancer metabolic reprogramming along with the significant metabolome and metagenome that make up those pathways.

Abstract

Aberrations of the human microbiome are associated with diverse liver diseases, including hepatocellular carcinoma (HCC). Even if we can associate specific microbes with particular diseases, it is difficult to know mechanistically how the microbe contributes to the pathophysiology. Here, we sought to reveal the functional potential of the HCC-associated microbiome with the human metabolome which is known to play a role in connecting host phenotype to microbiome function. To utilize both microbiome and metabolomic data sets, we propose an innovative, pathway-based analysis, Hierarchical structural Component Model for pathway analysis of Microbiome and Metabolome (HisCoM-MnM), for integrating microbiome and metabolomic data. In particular, we used pathway information to integrate these two omics data sets, thus providing insight into biological interactions between different biological layers, with regard to the host’s phenotype. The application of HisCoM-MnM to data sets from 103 and 97 patients with HCC and liver cirrhosis (LC), respectively, showed that this approach could identify HCC-related pathways related to cancer metabolic reprogramming, in addition to the significant metabolome and metagenome that make up those pathways.

Graphical Abstract

1. Introduction

Recent advances in throughput and improvement in the accuracy of metagenomic sequencing have enabled active research regarding specific microbiomes associated with diseases. The human microbiome has been well implicated in the progression of various diseases. Such diseases include infectious diseases [1,2], metabolic disorders (e.g., obesity [3], type 2 diabetes [4]), respiratory diseases [5], autoimmune disorders [6], mental or psychological illnesses [7,8], gastrointestinal disease (e.g., inflammatory bowel disease [9]), and liver diseases, such as alcoholic liver disease, nonalcoholic fatty liver disease, cirrhosis, and hepatocellular carcinoma (HCC) [10,11,12]. Despite our extensive findings, little progress has been made in discovering the pathways that underlie these diseases, and the role in the pathophysiology of disease. Therefore, the use of other types of biological data can enhance the understanding of biological processes, and functions of the microbiome, in specific disease phenotypes. There are now many studies that associate microbiome data with genomic, epigenomic, transcriptomic, and metabolomic data in humans [13]. Among the numerous types of omics data, metabolome have additionally improved our ability to understand the structure and function of the microbiome, in numerous disease states [14,15]. Because the metagenome contains more metabolic genes than those found in the entire human genome, it plays an essential role in human physiology, with regard to factors such as enzymes, metabolites, and biological pathways. Several recent studies have now paired microbiome and metabolomic data to identify distinct mechanisms connecting the microbiome to human disease. For example, Noecker et al. [16] used a community-based metabolite potential score to estimate relative metabolic creation or depletion capacity of the microbiome, and also attempted to identify key microbial species. This group also compared predicted and measured metabolites, to identify interactions between humans and the microbiome.
However, many challenges remain. Most studies have analyzed microbiome and metabolomic data independently, or focused only on analyzing statistical associations between these two data types, underscoring a need for methods of integration. In this study, we present such a new integration method, Hierarchical structural Component Model for pathway analysis of Microbiome and Metabolome (HisCoM-MnM). This model extends our previously developed Pathway-based approach using HierArchical structure of collapsed RAre variant Of High-throughput sequencing data (PHARAOH) method [17]. PHARAOH is a pathway analysis method for rare genetic variants that analyzes pathways using a single hierarchical model consisting of collapsed gene-level summaries and pathways. This method was later extended to various data types such as common genetic variants, and miRNA and mRNA expression [18,19,20,21,22].
In the current work, we used the main framework of PHARAOH to construct a hierarchical component model for integrating microbiome and metabolomic data, using pathway information. Based on HisCoM-MnM, we aimed to identify disease-related pathways, and elucidate the underlying “crosstalk” between human microbiome and metabolome, by discovering pathways associated with pathogenic metagenome and metabolome specific to distinct diseases, using a single model to consider potential correlations, between pathways, and between microbiome and metabolomic data, using a ridge-type penalty. HisCoM-MnM allows us to understand the relationships between different biological “layers,” based on pathway information related to specific phenotypes.
Here, we applied our HisCoM-MnM model to analyze microbiome and metabolomic data from HCC and liver cirrhosis (LC) patients [23,24]. HCC is the most common type of primary liver cancer, making up > 90% of cases, and is the major leading cause of cancer-related death worldwide [25]. To identify biomarkers for the early detection of HCC, many studies have analyzed its possible associations with the human microbiome [11]. We further aimed to identify HCC-related pathways and KEGG orthologs and metabolites that could discriminate HCC from LC, using HisCoM-MnM, thus providing biological relevance between HCC and LC, in the context of pathways.

2. Results

2.1. Baseline Characteristics

The baseline characteristics of subjects in the HCC and LC groups are shown in Table 1. Subjects in both groups were similar in age and gender, although males were slightly predominant. Subjects in the LC group were more likely to have decompensated liver disease than the HCC group, wherein most of the subjects had compensated liver disease, and all HCC subjects had hepatitis B virus-related liver disease. Serum α-fetoprotein (AFP) levels were higher in the HCC than LC group. About 80% of the HCC cases were of stage I and II, according to the 7th American Joint Committee on Cancer staging system [26].

2.2. Pathway Analysis of HCC and LC Data

We next applied HisCoM-MnM to microbiome and metabolomic data from HCC and LC patients. Hospital information, age, gender, and AFP were included as covariates in the pathway analysis. A total of 111 pathways were commonly mapped between metagenomic and metabolomic data. For these 111 pathways, 2236 unique KEGG orthologs, and 74 unique metabolites, were mapped. HisCoM-MnM successfully found 39 significant pathways with p-values smaller than 0.01, after Bonferroni adjustment. Among these 39 pathways, most were related to metabolism (24 pathways), with other pathways related to organismal systems (5 pathways), human diseases (4 pathways), environmental information processing (3 pathways), cellular processes (2 pathways), and genetic information processing (1 pathway), according to classifications by the KEGG pathway database (Table 2). Among four human disease-related pathways, central carbon metabolism in cancer (KEGG pathway map05230) was in a cancer-related subcategory (Table 2).
The central carbon metabolism cancer pathway contained a total of 25 KEGG orthologs, and metabolites, in HisCoM-MnM. Among these 25, only serine was significant, with a nominal p-value of 0.024. To delineate the details of serine metabolism in this pathway, we checked the results of pathways related to metabolic pathway reprogramming of cancer cells, out of our total 111 pathways, finding that the serine metabolism pathway related to another pathway named glycine, serine, and threonine metabolism. This pathway was significant (nominal p-value of 2.0 × 10−4), and the involvement of serine in this pathway was also significant, with a nominal p-value of 0.041. In Figure 1, we can see the entire metabolic pathways of glycine, serine, and threonine, as generated by KEGG Mapper. Among metabolites, only serine (red circle) showed significant results, and several KEGG orthologs marked with red rectangles showed significant results. Among significant KEGG orthologs, PHGDH (K00058), which encodes a metabolic enzyme involved in serine synthesis, was identified to be significant in HCC, with a nominal p-value of 0.029.
In addition, we identified other amino acid and lipid-metabolic pathways related to HCC, as well as the choline metabolism in cancer (map05231) pathway (Table 3). Choline metabolism in cancer is another significant (p-value of 0.0025) cancer-related pathway. We also found glycerophospholipid metabolism (map00564) as a significant result, with a p-value of 0.0012, consistent with previous findings [27].

2.3. Comparison of HisCoM-MnM to HisCoM-Single Omics

Next, we confirmed whether the multiomics approach, HisCoM-MnM, could find significant results, as well as interpretive aspects, when compared to results from single omics data analysis. First, we applied HisCoM to the same HCC and LC metabolomic and metagenomic data used by HisCoM-MnM. Hospital information, age, gender, and AFP were also used as covariates, and the 111 pathways analyzed by HisCoM-MnM were also analyzed by single omics analysis. Figure 2 shows a pairwise scatter plot comparing p-values of the 111 pathways, as calculated by each method. The correlation between HisCoM-MnM and metagenomic data was higher than that between HisCoM-MnM and metabolomic data, showing that HisCoM-MnM and metagenomic data tend to have more similar pathway results than HisCoM-MnM and metabolomic data. Figure 3 is a Venn diagram showing that six significant pathways were shared between HisCoM-MnM, metagenome, and metabolome, including a cancer-related pathway, central carbon metabolism in cancer (as described earlier). In particular, HisCoM-MnM identified 11 significant pathways not found by the metagenomic and metabolomic single omics model. Many of these were previously observed in HCC, including fatty acid metabolism [28], valine, leucine, and isoleucine biosynthesis [28], selenocompound metabolism [29], and the forkhead box, class O (FOXO) signaling pathway [30]. These HCC-related pathways were only identified by HisCoM-MnM, but not by single omics analysis. Based on the results so far, HisCoM-MnM has successfully identified important pathways well known to associate with HCC and LC.

3. Discussion

This study was designed to identify HCC-related pathways, and KEGG orthologs and metabolites that discriminate HCC from LC. To utilize both microbiome and metabolomic data, we proposed a novel pathway analysis method, HisCoM-MnM, for integrating microbiome and metabolomic data sets. HisCoM-MnM has several advantages over traditional approaches, in that it simultaneously performs not only pathway analysis, but also analyzes components of pathways, such as metabolic genes and metabolites. This method also considers correlations between pathways and between its components. To consider correlations, HisCoM-MnM applies a ridge-type penalization approach. As a result, HisCoM-MnM identified pathways well linked to HCC, suggesting that the development of HCC is more related to amino acid and lipid metabolic reprogramming, compared to LC.
One of the significant pathways was central carbon metabolism in cancer [31]. It is well known that central metabolic pathways operating in cancer cells are fundamentally different from normal cells, in that cancer cells must generate the additional energy and biomass required to support rapid cell division. Therefore, cancer cells use different metabolic pathways to meet increased energy and biosynthesis demands during the process of tumor progression. One well-characterized metabolic phenotype observed in cancer cells is the Warburg effect, a shift from aerobic adenosine triphosphate (ATP) production, through mitochondrial oxidative phosphorylation (OXPHOS), to ATP production through anaerobic glycolysis, which converts glucose to lactate (without OXPHOS). Paradoxically, the Warburg effect is seen even under normal oxygen concentrations, and is widely observed in human cancers, including HCC [32].
We also identified the pathway named glycine, serine and threonine metabolism. It is well known that the serine/glycine biosynthetic pathway plays a crucial role in cancer cell proliferation. Serine and glycine are amino acids that are incorporated into proteins, and also serve as precursor molecules and substrates for cell growth and proliferation [33]. This shared biosynthetic pathway starts by oxidizing the glycolytic intermediate 3-phosphoglycerate into 3-phospho-hydroxy- pyruvate, catalyzed by the enzyme phosphoglycerate dehydrogenase (PHGDH, KEGG component K00058). 3-Phospho-hydroxypyruvate is subsequently metabolized into serine, via phosphoserine aminotransferase 1 (PSAT1, K00831), and phosphoserine phosphatase (PSPH, K01079) [34] and serine hydroxymethyltransferase (SHMT, K00600) next catalyze the conversion of serine into glycine. Moreover, HCC cells are known to support liver tumorigenesis by increasing serine and glycine synthesis by upregulating PHGDH, and other metabolic genes [35]. This is consistent with our findings that we found serine and PHGDH as significant results.
In addition, the dysregulation of amino acids (e.g., glutamine, glutamate, glycine, and aspartate), fatty acids, and bile acids, in HCC versus LC, was reported in previous studies [27,28,36,37,38,39]. In particular, we also found amino-acid- and lipid-metabolism-related pathways to be significant in our study. These findings reflect the Warburg effect, involving a metabolic shift for energy production.
Another significant cancer-related pathway was choline metabolism in cancer. Choline is an essential component of cell membranes, and abnormal choline metabolism is a metabolic feature of oncogenesis and tumor progression [40]. In particular, levels of glycerophosphocholine, phosphatidylcholine, and choline were reported to be elevated in HCC, compared to LC. In addition, increases in the phospholipid, phosphatidylcholine, were observed in early-stage HCC. These findings suggest dysregulation of glycerophospholipid metabolism in the early stages of HCC development. In this study, most HCC patients were of early-stage disease, with about 80% in stage I and II (Table 1). Our study also showed a significant result for glycerophospholipid metabolism, consistent with previous findings [27].
Compared to single omics approaches, HisCoM-MnM has the advantages of identifying significant pathways and aspects of interpretation. By a multiomics approach, HisCoM-MnM identified major HCC-related pathways not found by single omics approaches. For example, HisCoM-MnM detected significant pathways, such as metabolic reprogramming of fatty acids and amino acids. Furthermore, there were pathways known to relate to the suppression of cancer. One of the significant pathway was selenocompound metabolism. The potential for the anticancer effects of the micronutrient element selenium was demonstrated previously [29]. Another significant pathway, FOXO signaling, was reported to play a role in HCC growth inhibition by activating transcription of a thioredoxin-interacting protein considered to be a tumor suppressor [30].
In addition, HisCoM-MnM is a highly utilized method. First, HisCoM-MnM is flexible to the methods used to obtain the pathway-matching information. In this study, we used 16S rRNA-based microbiome data, and PICRUSt, to predict putative functions of the microbiome. Because PICRUSt provides KEGG-based information, we obtained metabolic gene information, as KEGG orthologs matched to KEGG pathways. To integrate metabolomic and microbiome data, through pathway information, we annotated metabolites to KEGG compound IDs, for pathway matching. Although we used the KEGG database for pathway matching, other pathway database and other types of data, such as shotgun sequencing data or untargeted metabolomic data, can also be used, because HisCoM-MnM only requires matching information between pathways, and components of pathways. Second, while HisCoM-MnM was developed for analyzing hepatocellular carcinoma and liver cirrhosis data, it can be applicable to any complex diseases associated with the microbiome.
There are some limitations to the present study. First, this study mainly focused on hepatitis B virus (HBV)-related HCC patients, with no patients with etiologies such as HCV and non-viral. Therefore, further research is required to confirm the generalizability of our results to other HCC etiologies. Second, the information in the microbiome and metabolomic data has not been fully utilized, due to limitations of the annotation. As described before, we used the KEGG database for pathway matching. We annotated metabolites to KEGG compound IDs, generating KEGG orthology tables from 16S rRNA-based microbiome data, using PICRUSt. Then, we matched KEGG IDs with KEGG pathways. In this step, out of 188 metabolites, in targeted metabolomic data sets, only 74 could be annotated to KEGG compound IDs, and matched to distinct pathways. KEGG orthology also excluded those that were not matched to pathways, leaving only 3394 out of 6241 KEGG orthologs. Despite this limitation, HisCoM-MnM successfully discovered HCC-related pathways.

4. Materials and Methods

4.1. HCC and LC Data Sets

We performed integrated analysis of microbiome and metabolomic data originating from our previous studies [23,24]. We considered data sets consisting of 103 HCC patients and 97 LC patients who had both microbiome and metabolomic data. Among them, 100 patients were from Seoul National University Hospital (Seoul, Korea), and the others from Ajou University Hospital (Suwon, Korea). This study was approved by the institutional review board of Seoul National University Hospital (IRB No. 1704-021-843), and was conducted in accordance with the Declaration of Helsinki principles. The samples provided by the Korea Biobank Network were obtained, with informed consent, according to IRB-approved protocols.

4.2. Microbiome Data

For microbiome data, circulating cell-free DNA was extracted for 16S rRNA gene analysis from serum extracellular vesicles (EVs) [23]. First, we processed a 16S rRNA sequencing data set, via a closed-reference operational taxonomic unit (OTU)-picking pipeline, using QIIME [41], against the Greengenes reference database (gg_13_5). OTUs with all sample zero counts were removed from the OTU table. After filtering, a minimum of 731 and a maximum of 46,024 reads per sample were obtained. The OTU table was rarefied to 3000 reads, with 13 samples having low numbers of read counts (< 3000) being filtered.
For microbiome data, normalization of the OTU table was performed by dividing each OTU by the known or predicted 16S copy number abundance. Next, we generated a functional table, using PICRUSt [42], to predict metagenome contents. The output is a table of KEGG orthology (KO) abundances. The resulting table was normalized using MUSiCC [43]. Finally, 3394 unique KEGG orthologs were mapped to 356 pathways, using the KEGG pathway database.

4.3. Metabolomic Data

For metabolomic data, targeted metabolomic analysis using the AbsoluteIDQ® p180 kit (BIOCRATES Life Science AG, Innsbruck, Austria), with liquid chromatography and tandem mass spectrometry, was used to quantify 188 metabolites [24]. We filtered out metabolites with high zero proportion in samples, and normalized by autoscaling for each metabolite.
For metabolomic data, first, we mapped each metabolite to the KEGG COMPOUND ID database, using MetaboAnalyst [44], and checked matching results using databases such as the Human Metabolome Database (HMDB). Finally, 74 unique metabolites were mapped to 126 KEGG pathways.

4.4. Merging Metagenomic and Metabolomic Data, Using Pathway Information

For the pathway analysis, we merged metagenomic and metabolomic data sets using KEGG pathway information [45]. KEGG orthologs and metabolites that mapped to the same pathways were grouped together. There were 111 common pathways between 356 metagenomic pathways and 126 metabolomic pathways. Each common pathway consisted of at least one KEGG orthology and metabolite, and the normalized value for each KEGG orthology and metabolite were used as an input data set for HisCoM.

4.5. Hierarchical Structural Component Model for Pathway Analysis of Microbiome and Metabolome (HisCoM-MnM)

After merging two different types of data sets, pathway analysis could be performed using HisCoM-MnM. The structure of the proposed model is given in Figure 4. This model extends the pathway analysis model of rare variants [17], in that pathways are defined as a weighted component of a set of KEGG orthologs and metabolites.
Let y j denote the phenotype of the j th subject ( j = 1 , , N ) . Let K be the number of pathways and T k and S k be the number of KEGG orthologs and metabolites, respectively, in the k th pathway ( k = 1 , , K ; t = 1 , , T k ; s = 1 , , S k ) . Denote x j k t and x j k s as the respective t th and s th KEGG orthology and metabolite in the k th pathway, for the j th subject. Let w k t and w k s denote weights assigned to x j k t and x j k s . Let f j k be a latent variable representing the main effect of the k th pathway, which is defined as a weighted sum of T k KEGG orthologs and S k metabolites, such that:
f j k = t = 1 T k x j k t w k t + s = 1 S k x j k s w k s
Let β k denote the coefficient connecting the k th pathway to the phenotype. Let c j denote the covariate of the j th subject, and β p denote the coefficients of covariates. Then, the relationship between the binary phenotype and latent variables are established such that:
l o g i t ( π ) =   β 0 + k = 1 K [ t = 1 T k x j k t w k t + s = 1 S k x j k s w k s ] β k + c j β p = β 0 + k = 1 K f j k β k + c j β p
In Figure 4, the first and second layers consist of rectangles and circles that represent the observed variables and latent variables, respectively. As described above, each pathway is defined as a latent variable constructed by a weighted sum of its observed variable consisting of KEGG orthologs and metabolites.
To estimate the model parameters, we used the alternating least squares (ALS) algorithm, which alternates two steps until convergence. In the first step, the pathway coefficient estimates are updated, in the sense of least squares, for fixing the weight coefficient estimates. In the second step, the weight coefficient estimates are updated in the sense of least squares for fixing pathway coefficient estimates. To control the potential correlation between metabolites and KEGG orthologs, and between pathways, we used a ridge-type penalty. Then, we sought to maximize the following penalized log-likelihood function:
ϕ = j = 1 N log p ( y j ; β k ,   δ ) 1 2 λ m k = 1 K [ t = 1 T k w k t 2 + s = 1 S k w k s 2 ] 1 2 λ p k = 0 K β k 2 ,
where p ( y j ; β k ,   δ ) is the probability distribution for y j , and λ m and λ p are ridge parameters for the metabolite, KEGG orthology, and pathway. The values of λ m and λ p are determined before applying the parameter estimation procedure. These tuning parameters are chosen based on five-fold cross validation. After parameter estimation, to test the pathway and each KEGG orthology and metabolite’s significance, p-values were calculated using permutation tests, by generating 100,000 permuted phenotypes.

5. Conclusions

In conclusion, we have developed an innovative method, HisCoM-MnM, applied it to 16S rRNA-based microbiome data, and targeted metabolomic data from HCC and LC patients to identify HCC-related pathways, and KEGG orthologs and metabolites. We found that HisCoM-MnM can discover HCC-related pathways that are consistent with previous findings. Further studies with other etiologies of HCC are warranted, to validate the generalizability of our identification of specific pathology-related pathways.

Author Contributions

Conceptualization, T.P.; validation, T.P.; methodology, B.K. and T.P.; software, B.K.; formal analysis, B.K.; investigation, B.K. and T.P.; resources, S.S.K., J.Y.C., S.W.C., E.J.C. and J.-H.Y.; data curation, B.K.; visualization, B.K.; writing—original draft preparation, B.K.; writing—review and editing, B.K., E.J.C. and T.P.; supervision, T.P.; project administration, T.P.; funding acquisition, E.J.C., J.-H.Y. and T.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Korea Health Technology R&D Project through the Korea Health Industry Development Institute funded by the Ministry of Health & Welfare (No. HI16C2037), and the Bio-Synergy Research Project (2013M3A9C4078158) of the Ministry of Science, ICT and Future Planning through the National Research Foundation (NRF).

Acknowledgments

The biospecimens used in this study were provided by the Ajou University Human Bio-Resource Bank and Seoul National University Hospital-Human Biobank, members of Korea Biobank Network, which are supported by the Ministry of Health and Welfare, Republic of Korea. The author would like to thank Curt Balch, Bioscience Advising (https://bioscienceadvising.com/) for editing this manuscript for English language.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; the collection, analyses, or interpretation of data; the writing of the manuscript; or the decision to publish the results.

Glossary

Microbiotamicroorganisms present in a defined environment [46]
Metagenomethe collection of genomes and genes sequenced from members of a microbiota [46]
Microbiomethe entire habitat, including the microorganisms, their genomes, and the surrounding environmental conditions [46]
Metabolomecollection of all metabolites present in the cell that are participants in general metabolic reactions [47]
Omicslarge-scale study of data representing an entire set of molecules, such as proteins, lipids, or metabolites, in a cell, organ, or organism
Multiomicsan approach that examines multiple omics data sets such as genomics, proteomics, transcriptomics, metabolomics, and microbiomics

References

  1. Hofer, U.; Speck, R.F. Disturbance of the gut-associated lymphoid tissue is associated with disease progression in chronic HIV infection. Semin. Immunopathol. 2009, 31, 257–266. [Google Scholar] [CrossRef] [Green Version]
  2. Sartor, R.B. Microbial influences in inflammatory bowel diseases. Gastroenterology 2008, 134, 577–594. [Google Scholar] [CrossRef]
  3. Ley, R.E.; Turnbaugh, P.J.; Klein, S.; Gordon, J.I. Human gut microbes associated with obesity. Nature 2006, 444, 1022–1023. [Google Scholar] [CrossRef]
  4. Qin, J.; Li, Y.; Cai, Z.; Li, S.; Zhu, J.; Zhang, F.; Liang, S.; Zhang, W.; Guan, Y.; Shen, D.; et al. A metagenome-wide association study of gut microbiota in type 2 diabetes. Nature 2012, 490, 55–60. [Google Scholar] [CrossRef] [PubMed]
  5. Noverr, M.C.; Noggle, R.M.; Toews, G.B.; Huffnagle, G.B. Role of antibiotics and fungal microbiota in driving pulmonary allergic responses. Infect. Immun. 2004, 72, 4996–5003. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  6. Wen, L.; Ley, R.E.; Volchkov, P.Y.; Stranges, P.B.; Avanesyan, L.; Stonebraker, A.C.; Hu, C.; Wong, F.S.; Szot, G.L.; Bluestone, J.A.; et al. Innate immunity and intestinal microbiota in the development of Type 1 diabetes. Nature 2008, 455, 1109–1113. [Google Scholar] [CrossRef] [PubMed]
  7. Cryan, J.F.; Dinan, T.G. Mind-altering microorganisms: The impact of the gut microbiota on brain and behaviour. Nat. Rev. Neurosci. 2012, 13, 701–712. [Google Scholar] [CrossRef]
  8. Wang, L.; Christophersen, C.T.; Sorich, M.J.; Gerber, J.P.; Angley, M.T.; Conlon, M.A. Low relative abundances of the mucolytic bacterium akkermansia muciniphila and bifidobacterium spp. in feces of children with autism. Appl. Environ. Microbiol. 2011, 77, 6718–6721. [Google Scholar] [CrossRef] [Green Version]
  9. Packey, C.D.; Sartor, R.B. Commensal bacteria, traditional and opportunistic pathogens, dysbiosis and bacterial killing in inflammatory bowel diseases. Curr. Opin. Infect. Dis. 2009, 22, 292. [Google Scholar] [CrossRef] [Green Version]
  10. Cesaro, C.; Tiso, A.; Del Prete, A.; Cariello, R.; Tuccillo, C.; Cotticelli, G.; del Vecchio Blanco, C.; Loguercio, C.J.D.; Disease, L. Gut microbiota and probiotics in chronic liver diseases. Dig. Liver Dis. 2011, 43, 431–438. [Google Scholar] [CrossRef]
  11. Tripathi, A.; Debelius, J.; Brenner, D.A.; Karin, M.; Loomba, R.; Schnabl, B.; Knight, R. The gut-liver axis and the intersection with the microbiome. Nat. Rev. Gastroenterol. Hepatol. 2018, 15, 397–411. [Google Scholar] [CrossRef] [PubMed]
  12. Wang, B.; Jiang, X.; Cao, M.; Ge, J.; Bao, Q.; Tang, L.; Chen, Y.; Li, L. Altered fecal microbiota correlates with liver biochemistry in nonobese patients with non-alcoholic fatty liver disease. Sci. Rep. 2016, 6, 32002. [Google Scholar] [CrossRef] [PubMed]
  13. Wang, Q.; Wang, K.; Wu, W.; Giannoulatou, E.; Ho, J.W.; Li, L. Host and microbiome multi-omics integration: Applications and methodologies. Biophys. Rev. 2019, 11, 55–65. [Google Scholar] [CrossRef] [PubMed]
  14. Marcobal, A.; Kashyap, P.C.; Nelson, T.A.; Aronov, P.A.; Donia, M.S.; Spormann, A.; Fischbach, M.A.; Sonnenburg, J.L. A metabolomic view of how the human gut microbiota impacts the host metabolome using humanized and gnotobiotic mice. ISME J. 2013, 7, 1933–1943. [Google Scholar] [CrossRef] [Green Version]
  15. Shankar, V.; Homer, D.; Rigsbee, L.; Khamis, H.J.; Michail, S.; Raymer, M.; Reo, N.V.; Paliy, O. The networks of human gut microbe-metabolite associations are different between health and irritable bowel syndrome. ISME J. 2015, 9, 1899–1903. [Google Scholar] [CrossRef] [Green Version]
  16. Noecker, C.; Eng, A.; Srinivasan, S.; Theriot, C.M.; Young, V.B.; Jansson, J.K.; Fredricks, D.N.; Borenstein, E. Metabolic model-based integration of microbiome taxonomic and metabolomic profiles elucidates mechanistic links between ecological and metabolic variation. Msystems 2016, 1, e00013–e00015. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  17. Lee, S.; Choi, S.; Kim, Y.J.; Kim, B.-J.; Consortium, T.d.-G.; Hwang, H.; Park, T. Pathway-based approach using hierarchical components of collapsed rare variants. Bioinformatics 2016, 32, i586–i594. [Google Scholar] [CrossRef] [Green Version]
  18. Choi, S.; Lee, S.; Kim, Y.; Hwang, H.; Park, T. HisCoM-GGI: Hierarchical structural component analysis of gene–gene interactions. J. Bioinf. Comput. Biol. 2018, 16, 1840026. [Google Scholar] [CrossRef] [PubMed]
  19. Jiang, N.; Lee, S.; Park, T. Hierarchical structural component model for pathway analysis of common variants. BMC Med. Genom. 2020, 13, 26. [Google Scholar] [CrossRef]
  20. Kim, Y.; Lee, S.; Choi, S.; Jang, J.-Y.; Park, T. Hierarchical structural component modeling of microRNA-mRNA integration analysis. BMC Bioinf. 2018, 19, 75. [Google Scholar] [CrossRef] [Green Version]
  21. Lee, S.; Kim, Y.; Choi, S.; Hwang, H.; Park, T. Pathway-based approach using hierarchical components of rare variants to analyze multiple phenotypes. BMC Bioinf. 2018, 19, 79. [Google Scholar] [CrossRef] [Green Version]
  22. Mok, L.; Kim, Y.; Lee, S.; Choi, S.; Lee, S.; Jang, J.-Y.; Park, T.J.G. HisCoM-PAGE: Hierarchical structural component models for pathway analysis of gene expression data. Genes 2019, 10, 931. [Google Scholar] [CrossRef] [Green Version]
  23. Cho, E.J.; Leem, S.; Kim, S.A.; Yang, J.; Lee, Y.B.; Kim, S.S.; Cheong, J.Y.; Cho, S.W.; Kim, J.W.; Kim, S.-M.; et al. Circulating microbiota-based metagenomic signature for detection of hepatocellular carcinoma. Sci. Rep. 2019, 9, 7536. [Google Scholar] [CrossRef] [PubMed]
  24. Kim, D.J.; Cho, E.J.; Yu, K.-S.; Jang, I.-J.; Yoon, J.-H.; Park, T.; Cho, J.-Y.J.C. Comprehensive metabolomic search for biomarkers to differentiate early stage hepatocellular carcinoma from cirrhosis. Cancers 2019, 11, 1497. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Galle, P.R.; Forner, A.; Llovet, J.M.; Mazzaferro, V.; Piscaglia, F.; Raoul, J.-L.; Schirmacher, P.; Vilgrain, V. EASL clinical practice guidelines: Management of hepatocellular carcinoma. J. Hepatol. 2018, 69, 182–236. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Edge, S.B.; Compton, C.C. The American joint committee on cancer: The 7th edition of the AJCC cancer staging manual and the future of TNM. Ann. Surg. Oncol. 2010, 17, 1471–1474. [Google Scholar] [CrossRef] [PubMed]
  27. Yang, Y.; Li, C.; Nie, X.; Feng, X.; Chen, W.; Yue, Y.; Tang, H.; Deng, F. Metabonomic studies of human hepatocellular carcinoma using high-resolution magic-angle spinning 1H NMR spectroscopy in conjunction with multivariate data analysis. J. Proteome Res. 2007, 6, 2605–2614. [Google Scholar] [CrossRef] [PubMed]
  28. Gao, R.; Cheng, J.; Fan, C.; Shi, X.; Cao, Y.; Sun, B.; Ding, H.; Hu, C.; Dong, F.; Yan, X. Serum metabolomics to identify the liver disease-specific biomarkers for the progression of hepatitis to hepatocellular carcinoma. Sci. Rep. 2015, 5, 18175. [Google Scholar] [CrossRef] [PubMed]
  29. Bartolini, D.; Sancineto, L.; de Bem, A.F.; Tew, K.D.; Santi, C.; Radi, R.; Toquato, P.; Galli, F. Selenocompounds in cancer therapy: An overview. Adv. Cancer Res. 2017, 136, 259–302. [Google Scholar] [CrossRef]
  30. Yamaguchi, F.; Hirata, Y.; Akram, H.; Kamitori, K.; Dong, Y.; Sui, L.; Tokuda, M. FOXO/TXNIP pathway is involved in the suppression of hepatocellular carcinoma growth by glutamate antagonist MK-801. BMC Cancer 2013, 13, 468. [Google Scholar] [CrossRef] [Green Version]
  31. Cairns, R.A.; Harris, I.S.; Mak, T.W. Regulation of cancer cell metabolism. Nat. Rev. Cancer 2011, 11, 85–95. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  32. Iansante, V.; Choy, P.M.; Fung, S.W.; Liu, Y.; Chai, J.-G.; Dyson, J.; Del Rio, A.; D’Santos, C.; Williams, R.; Chokshi, S.; et al. PARP14 promotes the Warburg effect in hepatocellular carcinoma by inhibiting JNK1-dependent PKM2 phosphorylation and activation. Nat. Commun. 2015, 6, 7882. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. DeBerardinis, R.J. Serine metabolism: Some tumors take the road less traveled. Cell Metab. 2011, 14, 285–286. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Amelio, I.; Cutruzzolá, F.; Antonov, A.; Agostini, M.; Melino, G. Serine and glycine metabolism in cancer. Trends Biochem. Sci. 2014, 39, 191–198. [Google Scholar] [CrossRef] [PubMed]
  35. Woo, C.C.; Chen, W.C.; Teo, X.Q.; Radda, G.K.; Lee, P.T.H. Downregulating serine hydroxymethyltransferase 2 (SHMT2) suppresses tumorigenesis in human hepatocellular carcinoma. Oncotarget 2016, 7, 53005. [Google Scholar] [CrossRef] [PubMed]
  36. Beyoğlu, D.; Imbeaud, S.; Maurhofer, O.; Bioulac-Sage, P.; Zucman-Rossi, J.; Dufour, J.F.; Idle, J.R. Tissue metabolomics of hepatocellular carcinoma: Tumor energy metabolism and the role of transcriptomic classification. Hepatology 2013, 58, 229–238. [Google Scholar] [CrossRef] [Green Version]
  37. Fitian, A.I.; Nelson, D.R.; Liu, C.; Xu, Y.; Ararat, M.; Cabrera, R. Integrated metabolomic profiling of hepatocellular carcinoma in hepatitis C cirrhosis through GC/MS and UPLC/MS-MS. Liver Int. 2014, 34, 1428–1444. [Google Scholar] [CrossRef] [Green Version]
  38. Nahon, P.; Amathieu, R.; Triba, M.N.; Bouchemal, N.; Nault, J.-C.; Ziol, M.; Seror, O.; Dhonneur, G.; Trinchet, J.-C.; Beaugrand, M.; et al. Identification of serum proton NMR metabolomic fingerprints associated with hepatocellular carcinoma in patients with alcoholic cirrhosis. Clin. Cancer Res. 2012, 18, 6714–6722. [Google Scholar] [CrossRef] [Green Version]
  39. Xiao, J.F.; Varghese, R.S.; Zhou, B.; Nezami Ranjbar, M.R.; Zhao, Y.; Tsai, T.-H.; Di Poto, C.; Wang, J.; Goerlitz, D.; Luo, Y. LC-MS based serum metabolomics for identification of hepatocellular carcinoma biomarkers in Egyptian cohort. J. Proteome Res. 2012, 11, 5914–5923. [Google Scholar] [CrossRef] [Green Version]
  40. Glunde, K.; Bhujwalla, Z.M.; Ronen, S.M. Choline metabolism in malignant transformation. Nat. Rev. Cancer 2011, 11, 835–848. [Google Scholar] [CrossRef] [Green Version]
  41. Caporaso, J.G.; Kuczynski, J.; Stombaugh, J.; Bittinger, K.; Bushman, F.D.; Costello, E.K.; Fierer, N.; Peña, A.G.; Goodrich, J.K.; Gordon, J.I.; et al. QIIME allows analysis of high-throughput community sequencing data. Nat. Methods 2010, 7, 335–336. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  42. Langille, M.G.I.; Zaneveld, J.; Caporaso, J.G.; McDonald, D.; Knights, D.; Reyes, J.A.; Clemente, J.C.; Burkepile, D.E.; Vega Thurber, R.L.; Knight, R.; et al. Predictive functional profiling of microbial communities using 16S rRNA marker gene sequences. Nat. Biotechnol. 2013, 31, 814–821. [Google Scholar] [CrossRef] [PubMed]
  43. Manor, O.; Borenstein, E. MUSiCC: A marker genes based framework for metagenomic normalization and accurate profiling of gene abundances in the microbiome. Genome Biol. 2015, 16, 53. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  44. Xia, J.; Psychogios, N.; Young, N.; Wishart, D.S. MetaboAnalyst: A web server for metabolomic data analysis and interpretation. Nucleic Acids Res. 2009, 37, W652–W660. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  45. Kanehisa, M.; Goto, S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000, 28, 27–30. [Google Scholar] [CrossRef]
  46. Marchesi, J.R.; Ravel, J. The vocabulary of microbiome research: A proposal. Microbiome 2015, 3, 31. [Google Scholar] [CrossRef] [Green Version]
  47. Mosleth, E.; McLeod, A.; Rud, I.; Axelsson, L.; Solberg, L.; Moen, B.; Gilman, K.; Færgestad, E.; Lysenko, A.; Rawlings, C. Comprehensive Chemometrics, 2nd ed.; Elsevier: Amsterdam, The Netherlands, 2020; pp. 515–567. [Google Scholar]
Figure 1. Glycine, serine, and threonine metabolic pathways. Circles represent compounds and rectangles are enzymes. The KEGG orthologs and metabolites that were significantly related with HCC are marked in red, otherwise in grey.
Figure 1. Glycine, serine, and threonine metabolic pathways. Circles represent compounds and rectangles are enzymes. The KEGG orthologs and metabolites that were significantly related with HCC are marked in red, otherwise in grey.
Cancers 12 02705 g001
Figure 2. Pairwise scatter plot of 111 pathways’ p-values calculated by each method. The upper panel shows the Spearman’s rank correlation coefficient, and the lower panel is a scatter plot of the 111 pathways’ p-values. These p-values, from each method, were adjusted by Bonferroni correction, and a cutoff of 0.01 is represented by the red dotted lines.
Figure 2. Pairwise scatter plot of 111 pathways’ p-values calculated by each method. The upper panel shows the Spearman’s rank correlation coefficient, and the lower panel is a scatter plot of the 111 pathways’ p-values. These p-values, from each method, were adjusted by Bonferroni correction, and a cutoff of 0.01 is represented by the red dotted lines.
Cancers 12 02705 g002
Figure 3. Venn diagram summarizing the number of significant pathways shared among three methods, HisCoM-MnM, metagenome, and metabolome. The number of significant pathways in the Venn diagram was determined by Bonferroni-corrected p-values with a cutoff 0.01.
Figure 3. Venn diagram summarizing the number of significant pathways shared among three methods, HisCoM-MnM, metagenome, and metabolome. The number of significant pathways in the Venn diagram was determined by Bonferroni-corrected p-values with a cutoff 0.01.
Cancers 12 02705 g003
Figure 4. A diagram of the HisCoM-MnM model. Green rectangles represent metabolic genes predicted from microbiome data and blue rectangles represent metabolites. Each pathway consists of metabolic genes and metabolites.
Figure 4. A diagram of the HisCoM-MnM model. Green rectangles represent metabolic genes predicted from microbiome data and blue rectangles represent metabolites. Each pathway consists of metabolic genes and metabolites.
Cancers 12 02705 g004
Table 1. Basic characteristics of the subjects in each HCC (hepatocellular carcinoma) and LC (liver cirrhosis) group.
Table 1. Basic characteristics of the subjects in each HCC (hepatocellular carcinoma) and LC (liver cirrhosis) group.
VariableHCCLCp-Value
(n = 103)(n = 97)
Age, Years59 (52–64)57 (49–62)0.08
Male77 (74.76%)68 (70.10%)0.53
Etiology of Liver Disease <0.001
HBV103 (100.00%)83 (85.57%)
HCV012 (12.37%)
Non-Viral02 (2.06%)
Liver Function <0.001
Compensated94 (91.26%)66 (68.04%)
Decompensated9 (8.74%)31 (31.96%)
α-Fetoprotein, ng/ml14.8 (4.2–216.8)3.1 (1.8–9.8)<0.001
AJCC TNM Stage
I52 (50.48%)
II30 (29.13%)
III13 (12.62%)
IV8 (7.77%)
Data are presented as medians with interquartile ranges (IQRs) or numbers (%), unless otherwise indicated. AJCC, the American Joint Committee on Cancer; HBV, hepatitis B virus; HCV, hepatitis C virus; HCC, hepatocellular carcinoma; LC, liver cirrhosis.
Table 2. Categories and subcategory of KEGG pathways.
Table 2. Categories and subcategory of KEGG pathways.
Pathway CategorySubcategoryNumber of Pathways
MetabolismBiosynthesis of other secondary metabolites724
Metabolism of other amino acids7
Amino acid metabolism4
Carbohydrate metabolism2
Energy metabolism1
Global and overview maps1
Lipid metabolism1
Metabolism of cofactors and vitamins1
Organismal SystemsDigestive system25
Nervous system2
Excretory system1
Human DiseasesCancer: overview14
Drug resistance: antimicrobial1
Neurodegenerative disease1
Substance dependence1
Environmental Information ProcessingSignal transduction33
Cellular ProcessesCell growth and death12
Cell motility1
Genetic Information ProcessingTranslation11
Table 3. Significant pathways, within specific categories and subcategories, related to HCC.
Table 3. Significant pathways, within specific categories and subcategories, related to HCC.
CategoryPathway CategorySub CategoryNamep-Value
CancerHuman DiseasesCancer: overviewCentral carbon metabolism in cancer2.0 × 10−5
Human DiseasesCancer: overviewCholine metabolism in cancer2.54 × 10−3
Amino Acid MetabolismMetabolismAmino acid metabolismGlycine, serine and threonine metabolism2.0 × 10−4
MetabolismAmino acid metabolismValine, leucine and isoleucine biosynthesis8.0 × 10−5
MetabolismAmino acid metabolismAlanine, aspartate and glutamate metabolism9.80 × 10−4
MetabolismAmino acid metabolismValine, leucine and isoleucine degradation3.18 × 10−3
MetabolismMetabolism of other amino acidsD-Glutamine and D-glutamate metabolism2.0 × 10−5
MetabolismMetabolism of other amino acidsD-Alanine metabolism2.0 × 10−5
MetabolismMetabolism of other amino acidsGlutathione metabolism1.40 × 10−4
Lipid MetabolismMetabolismGlobal and overview mapsFatty acid metabolism6.0 × 10−5
MetabolismLipid metabolismFatty acid degradation1.40 × 10−4
MetabolismLipid metabolismGlycerophospholipid metabolism1.18 × 10−3
MetabolismLipid metabolismPrimary bile acid biosynthesis3.78 × 10−3
MetabolismLipid metabolismLinoleic acid metabolism0.029

Share and Cite

MDPI and ACS Style

Kim, B.; Cho, E.J.; Yoon, J.-H.; Kim, S.S.; Cheong, J.Y.; Cho, S.W.; Park, T. Pathway-Based Integrative Analysis of Metabolome and Microbiome Data from Hepatocellular Carcinoma and Liver Cirrhosis Patients. Cancers 2020, 12, 2705. https://doi.org/10.3390/cancers12092705

AMA Style

Kim B, Cho EJ, Yoon J-H, Kim SS, Cheong JY, Cho SW, Park T. Pathway-Based Integrative Analysis of Metabolome and Microbiome Data from Hepatocellular Carcinoma and Liver Cirrhosis Patients. Cancers. 2020; 12(9):2705. https://doi.org/10.3390/cancers12092705

Chicago/Turabian Style

Kim, Boram, Eun Ju Cho, Jung-Hwan Yoon, Soon Sun Kim, Jae Youn Cheong, Sung Won Cho, and Taesung Park. 2020. "Pathway-Based Integrative Analysis of Metabolome and Microbiome Data from Hepatocellular Carcinoma and Liver Cirrhosis Patients" Cancers 12, no. 9: 2705. https://doi.org/10.3390/cancers12092705

APA Style

Kim, B., Cho, E. J., Yoon, J. -H., Kim, S. S., Cheong, J. Y., Cho, S. W., & Park, T. (2020). Pathway-Based Integrative Analysis of Metabolome and Microbiome Data from Hepatocellular Carcinoma and Liver Cirrhosis Patients. Cancers, 12(9), 2705. https://doi.org/10.3390/cancers12092705

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop