Next Article in Journal
Impact of Developmental Changes of GABAA Receptors on Interneuron-NG2 Glia Transmission in the Hippocampus
Next Article in Special Issue
Detection of Interleukin-1 β (IL-1β) in Single Human Blastocyst-Conditioned Medium Using Ultrasensitive Bead-Based Digital Microfluidic Chip and Its Relationship with Embryonic Implantation Potential
Previous Article in Journal
Toxic Effects and Mechanisms of Polybrominated Diphenyl Ethers
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Recurrent Implantation Failure: Bioinformatic Discovery of Biomarkers and Identification of Metabolic Subtypes

1
Department of Obstetrics and Gynecology, Peking University People’s Hospital, Beijing 100044, China
2
Reproductive Medical Center, Peking University People’s Hospital, Beijing 100044, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Int. J. Mol. Sci. 2023, 24(17), 13488; https://doi.org/10.3390/ijms241713488
Submission received: 13 July 2023 / Revised: 2 August 2023 / Accepted: 14 August 2023 / Published: 30 August 2023
(This article belongs to the Special Issue Advancements in Molecular Research on Infertility)

Abstract

:
Recurrent implantation failure (RIF) is a challenging scenario from different standpoints. This study aimed to investigate its correlation with the endometrial metabolic characteristics. Transcriptomics data of 70 RIF and 99 normal endometrium tissues were retrieved from the Gene Expression Omnibus database. Common differentially expressed metabolism-related genes were extracted and various enrichment analyses were applied. Then, RIF was classified using a consensus clustering approach. Three machine learning methods were employed for screening key genes, and they were validated through the RT-qPCR experiment in the endometrium of 10 RIF and 10 healthy individuals. Receiver operator characteristic (ROC) curves were generated and validated by 20 RIF and 20 healthy individuals from Peking University People’s Hospital. We uncovered 109 RIF-related metabolic genes and proposed a novel two-subtype RIF classification according to their metabolic features. Eight characteristic genes (SRD5A1, POLR3E, PPA2, PAPSS1, PRUNE, CA12, PDE6D, and RBKS) were identified, and the area under curve (AUC) was 0.902 and the external validated AUC was 0.867. Higher immune cell infiltration levels were found in RIF patients and a metabolism-related regulatory network was constructed. Our work has explored the metabolic and immune characteristics of RIF, which paves a new road to future investigation of the related pathogenic mechanisms.

1. Introduction

Embryo implantation is a delicate and tightly regulated process, achieved by a synchronized and coordinated crosstalk between the embryo and the endometrium [1]. In recent decades, despite the tremendous advances in reproductive medicine, recurrent implantation failure (RIF) is still a controversial and poorly understood clinical problem. RIF affects about 10% worldwide patients undergoing in vitro fertilization and embryo transfer, being a challenge and a setback for both patients and clinicians [2]. RIF etiology is complex and is usually grouped into three categories involving the receptivity of the endometrium, the embryo, and their interacting environment. Within assisted reproductive technologies in which good-quality embryos are implanted, insufficient endometrial receptivity has resulted in approximately two-thirds of all implantation failures [3]. Thus, identifying molecular markers and clarifying the mechanisms is a strategy with important theoretical and clinical value.
The dynamic endometrial changes during the menstrual cycle are metabolically demanding. Aerobic glycolysis and lactate accumulation are important additional metabolic requirements of the implantation process [4,5,6]. During early pregnancy, the lactated uterine microenvironment seems to be favorable to embryo implantation [7]. During the decidualization process, genes and other factors related to aerobic glycolysis are extensively induced. On the other hand, the inhibition of lactate production can lead to decidua damage [8]. In addition, hyperinsulinemia and insulin resistance have been proven to inhibit the expression of endometrial receptivity markers, such as the insulin-like growth factor type 1 receptor [9]. Clinically, the low fertility of patients with metabolic diseases, such as polycystic ovary syndrome and diabetes, suggests that metabolic imbalance may affect endometrial receptivity and embryo implantation [10]. Nevertheless, the metabolic characteristics related to endometrial receptivity in RIF patients, and their potential influencing mechanisms and regulatory pathways, are still unclear.
Embryo invasion also requires specific immune activation at the maternal-fetal interface [11]. There are several types of immune cells in the endometrium, such as natural killer cells, macrophages, dendritic cells, and T cells. These cells are essential in regulating endometrial receptivity and embryo implantation [12]. Local immune dysfunction can damage endometrial receptivity and lead to RIF. Moreover, impairments in the maternal immune system during pregnancy can improve the susceptibility to viral infections, ultimately leading to an impairment of the embryo formation [13]. Several comprehensive bioinformatic analyses have indicated differences in immune cell infiltration levels between RIF patients and healthy individuals [14]. However, the mechanisms underlying immune cell infiltration in RIF patients are still worthy of further exploration.
In recent years, the rapid development of sequencing technology has provided us with a novel view of gene and protein expression patterns that could help to understand the mechanisms of implantation failure from different aspects [15,16,17]. However, most of these studies have a small sample size and have not been validated through an independent cohort. Moreover, the characteristics and mechanisms related to their metabolic component were not comprehensively analyzed in these studies. In view of the RIF complexity, it is necessary to explore its characteristics and pathogenesis from different perspectives, with the help of the rapidly developing high-throughput sequencing technology. This alternative approach can help to seek more efficient treatment strategies for RIF patients.
In the present study, we aimed to accurately explore metabolism-related genes related to endometrial receptivity, which enables the prediction of RIF occurrence from metabolic features. Moreover, a novel RIF classification, containing two subtypes with different metabolic characteristics, is proposed. Furthermore, we also investigated the immune infiltration and constructed the microRNA (miRNA)-transcription factor (TF)-genes network to gain a better understanding of the potential molecular metabolic process related to RIF occurrence.

2. Results

2.1. Identification of Differentially Expressed Genes and Functional Enrichment Analysis

An overview of the workflow is shown in Figure 1. Initially, we combined the expression profiles of 70 RIF and 99 normal endometrium tissues from the GSE58144, GSE103465 and GSE111974 datasets cohorts. Before removing batch effects, endometrium tissues from different platforms showed significantly different clustering patterns but grouped together after batch correlation (Figure 2A,B and Figure S1A,B). According to the predefined cut-off criteria, we detected a total of 1185 differentially expressed genes (DEGs), including 619 downregulated genes and 566 upregulated genes between the two groups of endometrium tissues among 17407 genes, as presented in Figure 2C,D. Gene Ontology (GO) enrichment analysis revealed that DEGs were enriched in regulation of mRNA metabolic process, histone modification, phosphoprotein phosphatase activity and so on (Figure 2E and Figure S1C,D). In addition, the results of Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis indicated that DEGs were enriched in ubiquitin mediated proteolysis, glycerophospholipid metabolism, protein processing in endoplasmic reticulum and other functions (Figure 2F). Moreover, the GO and KEGG enrichment analyses of separated upregulated and downregulated RIF-related DEGs are shown in Figure S2. These results demonstrated that these DEGs were involved in various metabolic processes.

2.2. Expression of Metabolism-Related Genes in Recurrent Implantation Failure Patients

We overlapped 1660 metabolic genes with the DEGs from the three merged datasets. The Venn diagram analysis revealed 109 overlapped metabolism-related genes, which were selected for further analysis. Of these, 54 were upregulated and 55 were downregulated (Figure 3A,B). To analyze the overall expression levels of metabolism-related genes, a volcano plot (Figure S3A) and heatmap (Figure S3B) of the expression levels of these metabolism-related genes were constructed. Through the GO and KEGG functional analyses, these metabolism-related genes were mainly linked to glycolipid metabolic processes, such as the phospholipid, glycerolipid, glycerophospholipid, and cholesterol metabolism, as well as fatty acid metabolism and biosynthesis, glycolysis/gluconeogenesis, and other metabolic pathways (Figure 3C–E, Figures S3C,D and S4A–D).

2.3. Construction and Characterization of Two Metabolic Subtype Models of Recurrent Implantation Failure

Through the consensus clustering approach, RIFs were clustered in accordance with expression profiling of 109 metabolism-related genes. The optimal clustering stability was identified when K = 2, which was determined using a consensus matrix plot, a cumulative distribution function (CDF) plot, relative alterations in the area under the CDF curve, and a tracking plot (Figure 4A and Figure S5A–C). The two metabolic subtypes were termed subtype-A and subtype-B, including 31 and 39 patients, respectively. Principal component analysis (PCA) revealed a remarkable difference between the two subtypes (Figure 4B). The heatmap and boxplot revealed a notable heterogeneity in the expression levels of these metabolism-related genes between two RIF subtypes (Figure 4C,D).
Gene set variation analysis (GSVA) was further conducted to explore any differences in metabolic pathway enrichment between the two subgroups. As shown in Figure 4E,F, the subtype A group was enriched in inflammasomes, inflammatory response, and adhesion molecules. The subtype B group was enriched in the biosynthesis of unsaturated fatty acids, fatty acid metabolism, mitochondrial fatty acid beta-oxidation, and cholesterol biosynthesis and homeostasis. While subtype A might have connections with inflammation pathways, subtype B was more closely associated with lipid metabolism.
DEGs were identified between the two clusters. To explore the underlying signaling mechanisms, functional analyses were performed. A total of 296 DEGs were detected, from which 98 genes were downregulated and 198 genes were upregulated in cluster 2, as compared with cluster 1 (Figure 5A). GO enrichment analysis showed that the DEGs were enriched in the hormone metabolic process, alcohol metabolic process, peptidase regulator activity, and endopeptidase regulator activity (Figure 5B). KEGG analysis showed that the DEGs were enriched in arachidonic acid metabolism, retinol metabolism, and linoleic acid metabolism (Figure 5C).

2.4. Selection of Characteristic Genes by Machine Learning Methods

Three algorithms were utilized for screening characteristic genes among key metabolism-related genes. For the least absolute shrinkage and selection operator (LASSO) algorithm, we selected the minimum criteria for building the LASSO classifier due to higher accuracy by comparisons and 40 characteristic genes were identified (Figure 6A,B). For the random forest algorithm, the top 10 characteristic genes with relative importance > 0.5 were determined (Figure 6C,D). For the support vector machine-recursive feature elimination (SVM-RFE) algorithm, when the feature number was 28, the classifier had the minimum error (Figure 6E). Following the intersection, 8 characteristic genes shared by the three algorithms were finally identified (SRD5A1, POLR3E, PPA2, PAPSS1, PRUNE, CA12, PDE6D, and RBKS; Figure 6F). Compared with the control group, the expression of PAPSS1 was upregulated in the RIF group, while the other genes were downregulated.

2.5. Characteristics of Metabolism-Related Hub Genes

The analysis revealed a positive correlation between the expression levels of SRD5A1, POLR3E, PPA2, PRUNE, CA12, PDE6D, and RBKS. PAPSS1 showed a significant negative correlation with SRD5A1, POLR3E, CA12, and RBKS. Among these genes, the strongest correlation existed between PAPSS1 and RBKS (Figure 7A). Correlation analysis between the eight characteristic genes and all genes from the three datasets was carried out. The 50 genes with the strongest correlation were displayed in a heatmap (Figure S6). Based on the correlation analysis results, the gene set enrichment analysis (GSEA) of the single gene based on KEGG was prosecuted to evaluate signaling pathways involved in the characteristic genes (Figure 7B–I). Among them, PAPSS1 was mainly involved in the metabolic pathways and amino sugar and nucleotide sugar metabolism. PRUNE, PPA2 and CA12 were mainly involved in the metabolic pathways, carbon metabolism and fatty acid metabolism. RBKS was mainly involved in the carbon metabolism and propanoate metabolism. SRD5A1 was mainly involved in the glycolysis/gluconeogenesis, steroid hormone biosynthesis and carbon metabolism.

2.6. Immunological Infiltration Features of Recurrent Implantation Failure

Immunological features were evaluated through the assessment of immune cell infiltration. There were also some interactions between immune cell populations across RIF (Figure 8A). Compared with the control group, more activated B cells, mast cells, and monocytes were found in RIF patients presenting higher immune infiltration levels (Figure 8B). Moreover, as illustrated in Figure 8C, the analyses displayed positive interactions between characteristic genes and immune cell infiltrations. The most strongly associated pairs included SRD5A1 and immature B cell (r = 0.26, p < 0.001), POLR3E and type 17 T helper (Th17) cell (r = −0.42, p < 0.001), PPA2 and neutrophil (r = −0.41, p < 0.001), PRUNE and CD56bright natural killer (NK) cell (r = −0.24, p = 0.002), CA12 and T follicular helper cell (r = 0.39, p < 0.001), PDE6D and CD56bright NK cell (r = −0.44, p < 0.001), RBKS and CD56dim NK cell (r = 0.47, p < 0.001), PAPSS1 and type 1 T helper (Th1) cell (r = −0.47, p < 0.001). Hence, the characteristic genes might modulate immunological features during the occurrence of RIF.

2.7. Diagnostic Efficacy and Validation of Characteristic Genes for Recurrent Implantation Failure Prediction

In the three combined cohorts, the diagnostic performance of each characteristic gene in RIF prediction was evaluated. The AUC values of the receiver operator characteristic (ROC) curves are shown in Figure 9A, indicating that these characteristic genes enabled to estimate the occurrence of RIF. When all of them were fitted into one variable, the area under curves (AUC) of the ROC curve was 0.902. Then, the diagnostic performance of the characteristic genes was validated by our patient cohort. Clinical characteristics of the recruited healthy individuals and RIF patients are presented in Table 1. The AUC was 0.867, which indicates that these eight genes were also potentially diagnostic markers for RIF (Figure 9B).
Next, to predict the prevalence of RIF in patients, a diagnostic nomogram was created based on the eight characteristic genes (Figure 9C). The calibration curves revealed that the line graph model predictions were nearly identical to those of the ideal model (Figure 9D). In addition, the single predicted risk score in the decision curve analysis or the composite genetic model was better than that in the random model. These results imply that decision-making based on the line graph model could be beneficial for RIF patients (Figure 9E).

2.8. Verification of Characteristic Genes

In order to experimentally validate the bioinformatics results, real time-quantitative PCR (RT-qPCR) experiments were performed to compare the expression levels of eight characteristic genes in the endometrium of 10 RIF patients and 10 healthy individuals. The results revealed that, in comparison with the healthy individuals, the mRNA levels of SRD5A1, POLR3E, PPA2, PRUNE, CA12, and RBKS were significantly lower, and the mRNA expression levels of PAPSS1 were significantly higher in the RIF group. Moreover, the mRNA levels of PDE6D in the RIF group were lower than in the control group; however, there was no statistically significant difference between the two groups. This observation indicated that the data mining results are reliable and have further research value (Figure 10).

2.9. Establishment of the microRNA-Transcription Factor-Genes Network

To investigate the associated molecular mechanism, the RegNetwork database was utilized to identify upstream miRNAs and TFs of the eight target genes. Moreover, their interaction pairs (potentially involved in RIF regulation) were retrieved, to generate the regulatory interaction network (Figure 11). In this network, hsa-miR-134 was confirmed as a coregulator of PAPSS1 and POLR3E, hsa-miR-206 was confirmed as a coregulator of PAPSS1 and PDE6D, and hsa-miR-485-5p was confirmed as a coregulator of PRUNE and PDE6D. Furthermore, hsa-miR-580, hsa-miR-30a, and hsa-miR-30b were confirmed as coregulators of PDE6D and POLR3E. In addition, the TFs of MYC and YY1 were identified as a coregulator of SRD5A1, POLR3E, and PDE6D; the TFs of MAX and USF1 were proven to be coregulators of SRD5A1 and POLR3E, and the TF of EGR1 was found to be a coregulator of SRD5A1. In addition, RBKS and PAPSS1, the TFs of HNF4A and BPTF were confirmed to be coregulators of RBKS and PRUNE, and the TFs of REST and SP1 were found to be coregulators of PRUNE and POLR3E.

3. Discussion

In infertile couples, RIF is a highly frustrating and distressing reproductive problem. Understanding its pathogenesis is helpful for its treatment and for improving patient outcomes [18]. Timely implantation of the blastocyst into the uterine endometrium is essential for the initiation of pregnancy. During early pregnancy in mammals, metabolic regulation plays important roles in embryo development and uterine receptivity, ultimately influencing pregnancy efficiency [4,5,6,7,8,19]. Therefore, uncovering metabolic characteristics related to endometrial receptivity and exploring potential druggable mechanisms can have a profound impact on improving the outcome of RIF patients.
Nowadays, the etiology, effective diagnostic markers, and therapeutic strategies of RIF remain to be fully elucidated. In this study, we studied the pathological mechanisms of RIF from a new perspective. Furthermore, we explored a new strategy for the diagnosis of RIF, based on bioinformatic analysis combined with machine learning. Overall, we probed the involvement of metabolism in the pathological mechanisms of RIF. We revealed the metabolic characteristics in the endometrium during the days 20–24 in the luteal phase of the menstrual cycle: (1) Through differential expression analysis, we obtained 109 metabolism-related genes in RIF and investigated their specific molecular mechanisms through functional enrichment analysis. (2) We proposed a novel RIF classification, containing two subtypes, according to their metabolic features, and analyzed their correlation with immune infiltration levels. With this approach, we gain a better understanding of the potential molecular metabolic processes during the occurrence of RIF. (3) We implemented three machine learning algorithms to accurately explore the metabolic genes related to endometrial receptivity, including SRD5A1, POLR3E, PPA2, PAPSS1, PRUNE, CA12, PDE6D, and RBKS, which enabled the identification of RIF occurrence through metabolic characteristics. (4) Then, the diagnostic value of the eight gene expression signatures was evaluated by ROC analysis. This model was validated by an external clinical patient cohort from Peking University People’s Hospital (PKUPH). In addition, the eight key genes involved in RIF were validated in the endometrium samples of the enrolled patients. (5) Higher immune cell infiltration levels were found in RIF, which were positively linked to the characteristic genes. Our study provided some new insights into the potential pathogenesis of RIF and future research direction in this field.
To establish endometrial receptivity, a large amount of energy metabolism is required for the adaptation to changes in endometrial morphology, regulation of the endometrial environment and function, and the preparation for embryo implantation [20]. Among them, glucose and lipid metabolism play an important role in cellular energy and material sources [21]. In the present study, the functional enrichment analysis has uncovered 1185 DEGs between the RIF group and the control group. In addition, 109 overlapped metabolism-related genes were mainly involved in specific glucose and lipid metabolism processes. Previous studies have indicated that adequate glucose uptake and normal metabolism were essential for endometrial differentiation and decidualization, providing a nutritional and receptive milieu that supports embryo implantation [22]. The process of endometrial decidualization is inseparable from the activation of glucose metabolism [23]. In RIF patients, glucose transporter 1 expression in endometrial stromal cells is decreased, indicating impaired glucose metabolism [24]. When the embryo is implanted, the endometrium undergoes epithelial-mesenchymal transition. During this stage, the movement of epithelial cells also depends on sufficient energy supply [25]. In addition, progesterone regulates glucose metabolism through the glucose transporter 1, to promote endometrial receptivity. Such receptivity indirectly reflects the impact of energy metabolism/glycolysis on embryo implantation [26]. Obesity has various deleterious effects on human reproduction [27]. Substantial evidence suggests that an increased body mass index, as well as dyslipidemia and LDL-C, not only affect the quality of the oocytes and embryos, but also interfere with embryo implantation and endometrial receptivity, resulting in poor pregnancy outcomes [28,29,30]. As the essential energy sources of the human body, fatty acids contribute to successful embryo implantation. In both animal and human studies, prostaglandin and phospholipid-derived endocannabinoids are two widely studied lipid-derived molecules involved in the establishment of endometrial receptivity [6]. During implantation, prostaglandin levels increase and affect endometrial receptivity through interactions with endometrial epithelial and stromal cells. Compared with fertile controls, prostaglandin synthesis appears to be disrupted in RIF patients [31]. The levels of most phospholipids remarkably increase in the stroma immediately surrounding the implantation sites, uncovering the complexity of the biological processes involved in embryo implantation [32]. Lysophosphatidic acid 3 signaling is a positive factor in embryo implantation and decidualization. RIF patients have decreased levels of LPA3 in the endometrium [31]. Thus, glucose and lipid metabolism imbalance in the endometrium may be a phenotypic alteration that occurs in RIF patients.
Based on the expression profiling of 109 metabolism-related genes, RIF patients were clustered into two subtypes, which had different metabolic characteristics. GSVA analysis further identified that subtype B was more closely associated with lipid metabolism, while subtype A might be related to inflammation pathways. An inflammatory process within the endometrium may cause cellular and biochemical alterations, leading to the risk of RIF. Endometrial inflammation changes the mechanisms securing the timely arrival of a viable blastocyst in a receptive endometrium [33]. Studies have demonstrated that the expression of the genes potentially associated with embryo receptivity and decidualization is likely downregulated in the endometrium in RIF cases with chronic endometritis [34]. On the other hand, embryo attachment-associated inflammation is a balanced and delicately controlled process. Inflammation-prone locations are favorable sites for embryo implantation. In women with RIF, a local biopsy-induced inflammatory response may facilitate the preparation of the endometrium for implantation [35]. Therefore, this new classification helps to accurately determine the RIF etiology and may provide targeted specific intervention strategies to increase pregnancy probability and promote reproductive health.
Then, the eight metabolism-related genes were selected by performing LASSO regression analysis, and by the random forest and the SVM-RFE algorithms. These genes included SRD5A1, POLR3E, PPA2, PAPSS1, PRUNE, CA12, PDE6D, and RBKS. According to the GSEA analysis, these genes are involved in various metabolic pathways, including amino acid, sugar, and nucleotide sugar metabolism, carbon metabolism, propanoate metabolism, glycolysis/gluconeogenesis, steroid hormone biosynthesis, fatty acid metabolism, and others. They influence the function of the endometrium from different metabolic pathways. A recent study has demonstrated that SRD5A1 deficiency leads to impaired decidualization, structural and functional changes to decidual blood vessels, and transcriptomic changes affecting angiogenesis signaling pathways [36]. SRD5A1 silencing has led to a decrease in the progesterone metabolism rate (with higher concentrations of unmetabolized progesterone), indicating that SRD5A1 plays a crucial role in progesterone metabolism [37]. SULT1E1 and PAPSS1 are responsible for estrogen sulfation, by providing enzymes and a universal sulfate donor [38]. The expression levels of PAPSS1 during the decidualization of human endometrial stromal fibroblast cells were variable [39]. PPA2 is an important mitochondrial metabolic gene, and its abnormal expression levels can result in mitochondrial dysfunction, leading to embryo implantation failure [40,41]. The expression levels of CA12 are significantly upregulated in macrophages of human hepatocellular carcinoma, which can enhance the epithelial mesenchymal transition ability of cancer cells and promote tumor metastasis [42]. The changes prepared for implantation in the endometrium include epithelial-mesenchymal transition and proliferation of endometrial cells, suggesting that CA12 mediated carbon metabolism balance also plays an important role in the formation of endometrial receptivity.
In this study, the immunological features of RIF and the correlation between eight hub genes and immune cell infiltration levels were analyzed through ssGSEA. As depicted above, the results have shown that most innate and adaptive immune cells presented higher infiltration levels in RIF, compared with control patients. A previous study has indicated that both innate (cytotoxic NK cells, M1 macrophages) and adaptive (Th1 cells, Th17 cells, and B cells) immune cell activation led to embryoblast miscarriage in RIF patients [43]. Abnormal uNK cells may generate adverse outcomes during embryo invasion, such as vascular remodeling, local ischemia, and oxidative stress, which are detrimental to implantation [44]. A recent meta-analysis showed that the proportion of CD56+ uNK is significantly increased in RIF patients, when compared with healthy controls [45]. Further experimental research is still needed to elucidate the potential pathophysiology of these immunological characteristic changes.
Furthermore, a risk signature model was established based on gene expression profiles and the coefficients of their association with RIF. Based on this model, the AUC of the ROC curves of the enrolled cohort from our own center was as high as 0.867. With sufficient evidence, we constructed an integrative nomogram to allow for a better prediction of the occurrence of RIF. Last but not least, the eight metabolism-related genes were validated by RT-qPCR in samples from patients from our center. Then, a miRNA-TF-genes network was constructed based on the RegNetwork database, to provide clues for further mechanistic exploration. Further attention to these essential genes may help doctors better manage RIF patients.
Up to now, our study is the first to explore the involvement of metabolism in the pathological mechanisms of RIF. The eight-gene-based metabolism-related characteristic model and the novel classification of RIF, containing two subtypes with different metabolic characteristics, have not been previously published. Nevertheless, in the current study, several limitations should be acknowledged. First, given the complexity and variability of metabolism, larger sample sizes and broader perspectives are needed in order to explore the relations and associations between metabolism and RIF. Second, the sample size of our study was relatively small. Moreover, in vivo and in vitro basic experimental verification is also needed, to elucidate the molecular mechanisms and increase the reliability and accuracy of these results. Additionally, functional validation of the constructed miRNA-mRNA network should be performed in future research.

4. Materials and Methods

4.1. Data Preprocessing

Raw gene expression data of RIF patients were accessed from the GSE58144 [46], GSE103465 [47] and GSE111974 [48] datasets of the Gene Expression Omnibus (GEO; https://www.ncbi.nlm.nih.gov/gds, accessed on 9 February 2023) database, using the “GEOquery” package [49]. The three datasets were separately based on the platforms of GPL15789 (A-UMCU-HS44K-2.0), GPL16043 [GeneChip® PrimeView™ Human Gene Expression Array (with External spike-in RNAs)] and GPL17077 (Agilent-039494 SurePrint G3 Human GE v2 8x60K Microarray). They contained endometrium samples from 43 RIF patients and 72 healthy individuals, 3 RIF patients and 3 healthy individuals, 24 RIF patients and 24 healthy individuals, respectively. Their expression profiles were incorporated, and batch effects were directly adjusted utilizing the Combat function of the “sva” package [50]. PCA was applied for the dimensional reduction of the transcriptomics data and for evaluating the performance of the Combat function [51].

4.2. Differentially Expressed Genes Screening

The “limma” R package [52] was applied to perform DEG screening analysis between the RIF and the healthy control groups. The occurrence of false positives was corrected by the Benjamini-Hochberg multiple-test method. To uncover DEGs, volcano maps and heatmaps were generated using the “ggplot2” and the “pheatmap” packages [53,54]. Statistically significantly upregulated or downregulated genes were used for subsequent analysis.
A total of 1660 human metabolism-related genes (from 86 metabolic pathways) were obtained from the KEGG database (https://www.genome.jp/kegg/, accessed on 9 February 2023) [55]. Differentially expressed metabolism-related genes were identified by intersecting DEGs and metabolic genes and further displayed using the “VennDiagram” package [56].

4.3. Molecular Subtypes Identification

Consistency clustering was a resampling-based approach for identifying each member and their subgroup number, as well as validating the cluster. To discover various metabolic patterns, on the basis of those significant metabolism-related DEGs, the “ConsensusClusterPlus” package [57] was implemented.

4.4. Functional Enrichment Analysis

To explore the biological significance of DEGs, we performed Gene Ontology (GO) classification [58] and KEGG pathway analysis using the “clusterProfiler” package [59]. Furthermore, to explore the differences in biological processes between the different subgroups, the KEGG and Reactome gene sets were downloaded from the Molecular Signature Database (http://software.broadinstitute.org/gsea/msigdb, accessed on 9 February 2023) as the reference set [60]. GSVA was performed to demonstrate the signaling pathways alteration between the two clusters using the “GSVA” R package [61]. GSEA was implemented for functionally elucidating the biological significance of characteristic genes. For achieving a normalized enrichment score for each analysis, gene set permutations with 1000 times were conducted. Only terms with a false discovery rate < 0.05 were considered as statistically significant enrichment.

4.5. Characteristic Gene Selection

Three machine learning algorithms, LASSO [62], random forest [63] and SVM-RFE [64], were employed for screening key genes. We used the “randomForest” package [65] for random forest and the “glmnet” package [66] to perform LASSO logistic regression with a turning/penalty parameter utilizing a 10-fold cross-verification. The SVM classifier was created using the R package “kernlab” [67]. The three aforementioned classifiable models’ overlapping genes were then uncovered.

4.6. Receiver Operator Characteristic Analysis and Nomogram Construction

Subsequently, the ROC curves were plotted, and the AUC were separately calculated to evaluate the performance of each signature, using the “rms” and “ROCR” packages [68,69]. Next, characteristic genes were incorporated to establish a nomogram using logistic regression analysis. The calibration curve was utilized for evaluating the accuracy of the nomogram. The clinical usefulness of the nomogram was assessed through decision curve analysis.

4.7. Patient Recruitment for External Validation

Human endometrial tissues were collected from women attending the Department of Obstetrics and Gynecology, PKUPH, China. All samples were surplus tissue from endometrial biopsies obtained from patients for diagnostic purposes, between January 2018 and December 2022. The timing of the endometrial biopsy was 5 days after ovulation (evaluated by ultrasonography), which is equivalent to LH + 7 in a natural cycle. The patients in the RIF group had suffered at least three embryo transfer failures, in which at least four morphologically high-grade embryos were transferred in total. There were no other obvious explanations for the occurrence of RIF. The control group included women who received artificial reproductive technology due to obstruction of the fallopian tube or male infertility and confirmed their ability to conceive.
The total RNA was extracted from patient tissue with TRIzol® (Tiangen Biotech Co., Ltd., Beijing, China) and reverse transcribed into cDNA using the FastQuant RT Kit (Tiangen Biotech Co., Ltd., Beijing, China). RT-qPCR was performed on a MiniOpticon Real-Time PCR #CFB3120EDU (Bio-Rad, Hercules, CA, USA) machine, using the SuperReal PreMix Plus reaction mixture (SYBR Green) (Tiangen Biotech Co., Ltd., Beijing, China). Gene expression levels relative to GAPDH expression levels were assessed using the 2−ΔΔCt method. Experiments were conducted in triplicate. Primer sequences (Sangon Biotech, Shanghai, China) for the reference and candidate genes are listed in Table 2.

4.8. Immune Cell Infiltration Evaluation

Single-sample gene set enrichment analysis (ssGSEA) of the R package “GSVA” was utilized to evaluate the infiltration levels of immune cells in individuals with different metabolic patterns [70].

4.9. Metabolism-Related Transcription Factor/miRNA Regulatory Network Construction

To study the regulatory mechanisms related to the eight crucial genes, TFs, and miRNAs that bind to hub genes, the RegNetwork target gene prediction database (http://www.regnetworkweb.org/, accessed on 12 February 2023) was used. A TF-miRNA-gene regulatory network was constructed and visualized using Cytoscape (Version 3.9.1).

4.10. Statistical Analysis

Statistical analyses were performed using R software (Version 4.2.2). Continuous variables were presented as the mean ± SD (standardized deviation). Categorical variables were described by frequency (n) and proportion (%). Statistically significant differences among variables were assessed using Student’s t-tests, nonparametric tests, Chi-square tests, or one-way ANOVA tests. The correlation between the variables was determined using Pearson’s or Spearman’s correlation test. All statistical tests were two-tailed, and a p value threshold of 0.05 was considered statistically significant.

5. Conclusions

In summary, our study investigated the specific metabolism-related molecular mechanisms of RIF and determined eight characteristic metabolism-related genes (SRD5A1, POLR3E, PPA2, PAPSS1, PRUNE, CA12, PDE6D and RBKS) that could possibly predict the occurrence of RIF. Moreover, we proposed a new molecular classification comprising two RIF subtypes with different metabolic characteristics. Thus, our study may provide new insights into the pathogenesis of metabolic disorders affecting human reproductive health.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/ijms241713488/s1.

Author Contributions

Conceptualization, Y.F.; methodology, Y.F.; experiment, Y.F.; validation, Y.F., N.H. and F.F.; formal analysis, Y.F.; investigation, Y.F.; resources, Y.F. and C.S.; data curation, Y.F. and C.S.; writing—original draft preparation, Y.F.; writing—review and editing, Y.F., L.T. and J.W.; visualization, Y.F.; supervision, L.T. and J.W.; funding acquisition, Y.F. and C.S. All authors have read and agreed to the published version of the manuscript.

Funding

This study is supported by Peking University Medicine Sailing Program for Young Scholars’ Scientific & Technological Innovation (Grant No. BMU2023YFJHPY004), the Natural Science Foundation of Beijing, China (Grant No. 7222200) and the Project Supported By Peking University People’s Hospital Research and Development Funds (Grant No. RDJP2022-16).

Institutional Review Board Statement

This research was approved by the Ethics Committees of Peking University People’s Hospital (approval No. 2018PHB141-01).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The datasets used and/or analyzed in the current study are available in the GEO database (https://www.ncbi.nlm.nih.gov/gds, accessed on 12 July 2023). Further inquiries can be directed to the corresponding authors.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Urman, B.; Yakin, K.; Balaban, B. Recurrent implantation failure in assisted reproduction: How to counsel and manage. B. Treatment options that have not been proven to benefit the couple. Reprod. BioMed. Online 2005, 11, 382–391. [Google Scholar] [CrossRef]
  2. Busnelli, A.; Reschini, M.; Cardellicchio, L.; Vegetti, W.; Somigliana, E.; Vercellini, P. How common is real repeated implantation failure? An indirect estimate of the prevalence. Reprod. BioMed. Online 2020, 40, 91–97. [Google Scholar] [CrossRef]
  3. Simón, C.; Moreno, C.; Remohí, J.; Pellicer, A. Molecular interactions between embryo and uterus in the adhesion phase of human implantation. Hum. Reprod. 1998, 13 (Suppl. S3), 219–232. [Google Scholar] [CrossRef] [PubMed]
  4. Ma, L.N.; Huang, X.B.; Muyayalo, K.P.; Mor, G.; Liao, A.H. Lactic Acid: A Novel Signaling Molecule in Early Pregnancy? Front. Immunol. 2020, 11, 279. [Google Scholar] [CrossRef] [PubMed]
  5. Zhang, H.; Qi, J.; Pei, J.; Zhang, M.; Shang, Y.; Li, Z.; Wang, Y.; Guo, J.; Sun, K.; Fan, J.; et al. O-GlcNAc modification mediates aquaporin 3 to coordinate endometrial cell glycolysis and affects embryo implantation. J. Adv. Res. 2022, 37, 119–131. [Google Scholar] [CrossRef] [PubMed]
  6. Yang, T.; Zhao, J.; Liu, F.; Li, Y. Lipid metabolism and endometrial receptivity. Hum. Reprod. Update 2022, 28, 858–889. [Google Scholar] [CrossRef]
  7. Xiao, S.; Li, R.; El Zowalaty, A.E.; Diao, H.; Zhao, F.; Choi, Y.; Ye, X. Acidification of uterine epithelium during embryo implantation in mice. Biol. Reprod. 2017, 96, 232–243. [Google Scholar] [CrossRef]
  8. Zuo, R.J.; Gu, X.W.; Qi, Q.R.; Wang, T.S.; Zhao, X.Y.; Liu, J.L.; Yang, Z.M. Warburg-like Glycolysis and Lactate Shuttle in Mouse Decidua during Early Pregnancy. J. Biol. Chem. 2015, 290, 21280–21291. [Google Scholar] [CrossRef]
  9. Lathi, R.B.; Hess, A.P.; Tulac, S.; Nayak, N.R.; Conti, M.; Giudice, L.C. Dose-dependent insulin regulation of insulin-like growth factor binding protein-1 in human endometrial stromal cells is mediated by distinct signaling pathways. J. Clin. Endocrinol. Metab. 2005, 90, 1599–1606. [Google Scholar] [CrossRef]
  10. Schulte, M.M.; Tsai, J.H.; Moley, K.H. Obesity and PCOS: The effect of metabolic derangements on endometrial receptivity at the time of implantation. Reprod. Sci. 2015, 22, 6–14. [Google Scholar] [CrossRef]
  11. Díaz-Hernández, I.; Alecsandru, D.; García-Velasco, J.A.; Domínguez, F. Uterine natural killer cells: From foe to friend in reproduction. Hum. Reprod. Update 2021, 27, 720–746. [Google Scholar] [CrossRef] [PubMed]
  12. Erlebacher, A. Immunology of the maternal-fetal interface. Annu. Rev. Immunol. 2013, 31, 387–411. [Google Scholar] [CrossRef]
  13. Mazziotta, C.; Pellielo, G.; Tognon, M.; Martini, F.; Rotondo, J.C. Significantly Low Levels of IgG Antibodies Against Oncogenic Merkel Cell Polyomavirus in Sera From Females Affected by Spontaneous Abortion. Front. Microbiol. 2021, 12, 789991. [Google Scholar] [CrossRef]
  14. Feng, X.; Meng, X.; Guo, S.; Li, K.; Wang, L.; Ai, J. Identification of key genes and immune cell infiltration in recurrent implantation failure: A study based on integrated analysis of multiple microarray studies. Am. J. Reprod. Immunol. 2022, 88, e13607. [Google Scholar] [CrossRef] [PubMed]
  15. Mrozikiewicz, A.E.; Ożarowski, M.; Jędrzejczak, P. Biomolecular Markers of Recurrent Implantation Failure-A Review. Int. J. Mol. Sci. 2021, 22, 82. [Google Scholar] [CrossRef] [PubMed]
  16. Huang, J.; Song, N.; Xia, L.; Tian, L.; Tan, J.; Chen, Q.; Zhu, J.; Wu, Q. Construction of lncRNA-related competing endogenous RNA network and identification of hub genes in recurrent implantation failure. Reprod. Biol. Endocrinol. 2021, 19, 108. [Google Scholar] [CrossRef]
  17. Ma, J.; Gao, W.; Li, D. Recurrent implantation failure: A comprehensive summary from etiology to treatment. Front. Endocrinol. 2022, 13, 1061766. [Google Scholar] [CrossRef]
  18. Coughlan, C.; Ledger, W.; Wang, Q.; Liu, F.; Demirol, A.; Gurgan, T.; Cutting, R.; Ong, K.; Sallam, H.; Li, T.C. Recurrent implantation failure: Definition and management. Reprod. BioMed. Online 2014, 28, 14–38. [Google Scholar] [CrossRef]
  19. Kim, S.T.; Moley, K.H. Regulation of facilitative glucose transporters and AKT/MAPK/PRKAA signaling via estradiol and progesterone in the mouse uterine epithelium. Biol. Reprod. 2009, 81, 188–198. [Google Scholar] [CrossRef]
  20. Chen, Q.; Zhang, A.; Yu, F.; Gao, J.; Liu, Y.; Yu, C.; Zhou, H.; Xu, C. Label-free proteomics uncovers energy metabolism and focal adhesion regulations responsive for endometrium receptivity. J. Proteome Res. 2015, 14, 1831–1842. [Google Scholar] [CrossRef]
  21. Achache, H.; Revel, A. Endometrial receptivity markers, the journey to successful embryo implantation. Hum. Reprod. Update 2006, 12, 731–746. [Google Scholar] [CrossRef]
  22. Frolova, A.I.; Moley, K.H. Glucose transporters in the uterus: An analysis of tissue distribution and proposed physiological roles. Reproduction 2011, 142, 211–220. [Google Scholar] [CrossRef]
  23. Kommagani, R.; Szwarc, M.M.; Kovanci, E.; Gibbons, W.E.; Putluri, N.; Maity, S.; Creighton, C.J.; Sreekumar, A.; DeMayo, F.J.; Lydon, J.P.; et al. Acceleration of the glycolytic flux by steroid receptor coactivator-2 is essential for endometrial decidualization. PLoS Genet. 2013, 9, e1003900. [Google Scholar] [CrossRef]
  24. von Wolff, M.; Ursel, S.; Hahn, U.; Steldinger, R.; Strowitzki, T. Glucose transporter proteins (GLUT) in human endometrium: Expression, regulation, and function throughout the menstrual cycle and in early pregnancy. J. Clin. Endocrinol. Metab. 2003, 88, 3885–3892. [Google Scholar] [CrossRef]
  25. Owusu-Akyaw, A.; Krishnamoorthy, K.; Goldsmith, L.T.; Morelli, S.S. The role of mesenchymal-epithelial transition in endometrial function. Hum. Reprod. Update 2019, 25, 114–133. [Google Scholar] [CrossRef]
  26. Zhang, H.; Qi, J.; Wang, Y.; Sun, J.; Li, Z.; Sui, L.; Fan, J.; Liu, C.; Shang, Y.; Kong, L.; et al. Progesterone Regulates Glucose Metabolism Through Glucose Transporter 1 to Promote Endometrial Receptivity. Front. Physiol. 2020, 11, 543148. [Google Scholar] [CrossRef]
  27. Boutari, C.; Pappas, P.D.; Mintziori, G.; Nigdelis, M.P.; Athanasiadis, L.; Goulis, D.G.; Mantzoros, C.S. The effect of underweight on female and male reproduction. Metabolism 2020, 107, 154229. [Google Scholar] [CrossRef] [PubMed]
  28. Zhang, J.; Liu, H.; Mao, X.; Chen, Q.; Fan, Y.; Xiao, Y.; Wang, Y.; Kuang, Y. Effect of body mass index on pregnancy outcomes in a freeze-all policy: An analysis of 22,043 first autologous frozen-thawed embryo transfer cycles in China. BMC Med. 2019, 17, 114. [Google Scholar] [CrossRef]
  29. Liu, Z.; Cong, J.; Liu, X.; Zhao, H.; Lai, S.; He, S.; Bao, H. Dyslipidemia Is Negatively Associated With the Cumulative Live-Birth Rate in Patients Without PCOS Following IVF/ICSI. Front. Physiol. 2021, 12, 713356. [Google Scholar] [CrossRef] [PubMed]
  30. Cai, W.Y.; Luo, X.; Chen, E.; Lv, H.; Fu, K.; Wu, X.K.; Xu, J. Serum Lipid Levels and Treatment Outcomes in Women Undergoing Assisted Reproduction: A Retrospective Cohort Study. Front. Endocrinol. 2021, 12, 633766. [Google Scholar] [CrossRef] [PubMed]
  31. Achache, H.; Tsafrir, A.; Prus, D.; Reich, R.; Revel, A. Defective endometrial prostaglandin synthesis identified in patients with repeated implantation failure undergoing in vitro fertilization. Fertil. Steril. 2010, 94, 1271–1278. [Google Scholar] [CrossRef]
  32. Burnum, K.E.; Cornett, D.S.; Puolitaival, S.M.; Milne, S.B.; Myers, D.S.; Tranguch, S.; Brown, H.A.; Dey, S.K.; Caprioli, R.M. Spatial and temporal alterations of phospholipids determined by mass spectrometry during mouse embryo implantation. J. Lipid Res. 2009, 50, 2290–2298. [Google Scholar] [CrossRef] [PubMed]
  33. Chen, X.; Liu, Y.; Zhao, Y.; Cheung, W.C.; Zhang, T.; Qi, R.; Chung, J.P.W.; Wang, C.C.; Li, T.C. Association between chronic endometritis and uterine natural killer cell density in women with recurrent miscarriage: Clinical implications. J. Obstet. Gynaecol. Res. 2020, 46, 858–863. [Google Scholar] [CrossRef] [PubMed]
  34. Di Pietro, C.; Cicinelli, E.; Guglielmino, M.R.; Ragusa, M.; Farina, M.; Palumbo, M.A.; Cianci, A. Altered transcriptional regulation of cytokines, growth factors, and apoptotic proteins in the endometrium of infertile women with chronic endometritis. Am. J. Reprod. Immunol. 2013, 69, 509–517. [Google Scholar] [CrossRef] [PubMed]
  35. Goel, T.; Mahey, R.; Bhatla, N.; Kalaivani, M.; Pant, S.; Kriplani, A. Pregnancy after endometrial scratching in infertile couples undergoing ovulation induction and intrauterine insemination cycles-a randomized controlled trial. J. Assist. Reprod. Genet. 2017, 34, 1051–1058. [Google Scholar] [CrossRef]
  36. Shaw, I.W.; Kirkwood, P.M.; Rebourcet, D.; Cousins, F.L.; Ainslie, R.J.; Livingstone, D.E.W.; Smith, L.B.; Saunders, P.T.K.; Gibson, D.A. A role for steroid 5 alpha-reductase 1 in vascular remodeling during endometrial decidualization. Front. Endocrinol. 2022, 13, 1027164. [Google Scholar] [CrossRef]
  37. Sinreih, M.; Anko, M.; Zukunft, S.; Adamski, J.; Rižner, T.L. Important roles of the AKR1C2 and SRD5A1 enzymes in progesterone metabolism in endometrial cancer model cell lines. Chem. Biol. Interact. 2015, 234, 297–308. [Google Scholar] [CrossRef]
  38. Xu, Y.; Liu, X.; Guo, F.; Ning, Y.; Zhi, X.; Wang, X.; Chen, S.; Yin, L.; Li, X. Effect of estrogen sulfation by SULT1E1 and PAPSS on the development of estrogen-dependent cancers. Cancer Sci. 2012, 103, 1000–1009. [Google Scholar] [CrossRef]
  39. Gibson, D.A.; Foster, P.A.; Simitsidellis, I.; Critchley, H.O.D.; Kelepouri, O.; Collins, F.; Saunders, P.T.K. SULFATION PATHWAYS: A role for steroid sulphatase in intracrine regulation of endometrial decidualisation. J. Mol. Endocrinol. 2018, 61, M57–M65. [Google Scholar] [CrossRef] [PubMed]
  40. Muid, K.A.; Kimyon, Ö.; Reza, S.H.; Karakaya, H.C.; Koc, A. Characterization of long living yeast deletion mutants that lack mitochondrial metabolism genes DSS1, PPA2 and AFG3. Gene 2019, 706, 172–180. [Google Scholar] [CrossRef]
  41. Kim, J.; Seli, E. Mitochondria as a biomarker for IVF outcome. Reproduction 2019, 157, R235–R242. [Google Scholar] [CrossRef] [PubMed]
  42. Ning, W.R.; Jiang, D.; Liu, X.C.; Huang, Y.F.; Peng, Z.P.; Jiang, Z.Z.; Kang, T.; Zhuang, S.M.; Wu, Y.; Zheng, L. Carbonic anhydrase XII mediates the survival and prometastatic functions of macrophages in human hepatocellular carcinoma. J. Clin. Investig. 2022, 132, e153110. [Google Scholar] [CrossRef]
  43. Amjadi, F.; Zandieh, Z.; Mehdizadeh, M.; Aghajanpour, S.; Raoufi, E.; Aghamajidi, A.; Aflatoonian, R. The uterine immunological changes may be responsible for repeated implantation failure. J. Reprod. Immunol. 2020, 138, 103080. [Google Scholar] [CrossRef]
  44. Donoghue, J.F.; Paiva, P.; Teh, W.T.; Cann, L.M.; Nowell, C.; Rees, H.; Bittinger, S.; Obers, V.; Bulmer, J.N.; Stern, C.; et al. Endometrial uNK cell counts do not predict successful implantation in an IVF population. Hum. Reprod. 2019, 34, 2456–2466. [Google Scholar] [CrossRef]
  45. Von Woon, E.; Greer, O.; Shah, N.; Nikolaou, D.; Johnson, M.; Male, V. Number and function of uterine natural killer cells in recurrent miscarriage and implantation failure: A systematic review and meta-analysis. Hum. Reprod. Update 2022, 28, 548–582. [Google Scholar] [CrossRef]
  46. Koot, Y.E.; van Hooff, S.R.; Boomsma, C.M.; van Leenen, D.; Groot Koerkamp, M.J.; Goddijn, M.; Eijkemans, M.J.; Fauser, B.C.; Holstege, F.C.; Macklon, N.S. An endometrial gene expression signature accurately predicts recurrent implantation failure after IVF. Sci. Rep. 2016, 6, 19411. [Google Scholar] [CrossRef] [PubMed]
  47. Guo, F.; Si, C.; Zhou, M.; Wang, J.; Zhang, D.; Leung, P.C.K.; Xu, B.; Zhang, A. Decreased PECAM1-mediated TGF-β1 expression in the mid-secretory endometrium in women with recurrent implantation failure. Hum. Reprod. 2018, 33, 832–843. [Google Scholar] [CrossRef]
  48. Bastu, E.; Demiral, I.; Gunel, T.; Ulgen, E.; Gumusoglu, E.; Hosseini, M.K.; Sezerman, U.; Buyru, F.; Yeh, J. Potential Marker Pathways in the Endometrium That May Cause Recurrent Implantation Failure. Reprod. Sci. 2019, 26, 879–890. [Google Scholar] [CrossRef]
  49. Davis, S.; Meltzer, P.S. GEOquery: A bridge between the Gene Expression Omnibus (GEO) and BioConductor. Bioinformatics 2007, 23, 1846–1847. [Google Scholar] [CrossRef]
  50. Leek, J.T.; Johnson, W.E.; Parker, H.S.; Jaffe, A.E.; Storey, J.D. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics 2012, 28, 882–883. [Google Scholar] [CrossRef] [PubMed]
  51. Mazziotta, C.; Cervellera, C.F.; Badiale, G.; Vitali, I.; Touzé, A.; Tognon, M.; Martini, F.; Rotondo, J.C. Distinct retinoic gene signatures discriminate Merkel cell polyomavirus-positive from-negative Merkel cell carcinoma cells. J. Med. Virol. 2023, 95, e28949. [Google Scholar] [CrossRef] [PubMed]
  52. Ritchie, M.E.; Phipson, B.; Wu, D.; Hu, Y.; Law, C.W.; Shi, W.; Smyth, G.K. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015, 43, e47. [Google Scholar] [CrossRef]
  53. Ginestet, C. ggplot2: Elegant Graphics for Data Analysis. J. R. Stat. Soc. Ser. A 2011, 174, 245–246. [Google Scholar] [CrossRef]
  54. Wu, Y.; Zhang, L.; Zhang, Y.; Zhen, Y.; Liu, S. Bioinformatics analysis to screen for critical genes between survived and non-survived patients with sepsis. Mol. Med. Rep. 2018, 18, 3737–3743. [Google Scholar] [CrossRef] [PubMed]
  55. Ogata, H.; Goto, S.; Sato, K.; Fujibuchi, W.; Bono, H.; Kanehisa, M. KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 1999, 27, 29–34. [Google Scholar] [CrossRef]
  56. Chen, H.; Boutros, P.C. VennDiagram: A package for the generation of highly-customizable Venn and Euler diagrams in R. BMC Bioinform. 2011, 12, 35. [Google Scholar] [CrossRef]
  57. Wilkerson, M.D.; Hayes, D.N. ConsensusClusterPlus: A class discovery tool with confidence assessments and item tracking. Bioinformatics 2010, 26, 1572–1573. [Google Scholar] [CrossRef]
  58. Ashburner, M.; Ball, C.A.; Blake, J.A.; Botstein, D.; Butler, H.; Cherry, J.M.; Davis, A.P.; Dolinski, K.; Dwight, S.S.; Eppig, J.T.; et al. Gene ontology: Tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. 2000, 25, 25–29. [Google Scholar] [CrossRef]
  59. clusterProfiler: An R Package for Comparing Biological Themes among Gene Clusters. OMICS J. Integr. Biol. 2012, 16, 284–287. [CrossRef]
  60. Liberzon, A.; Birger, C.; Thorvaldsdóttir, H.; Ghandi, M.; Mesirov, J.P.; Tamayo, P. The Molecular Signatures Database (MSigDB) hallmark gene set collection. Cell Syst. 2015, 1, 417–425. [Google Scholar] [CrossRef]
  61. Hänzelmann, S.; Castelo, R.; Guinney, J. GSVA: Gene set variation analysis for microarray and RNA-seq data. BMC Bioinform. 2013, 14, 7. [Google Scholar] [CrossRef] [PubMed]
  62. Friedman, J.; Hastie, T.; Tibshirani, R. Regularization Paths for Generalized Linear Models via Coordinate Descent. J. Stat. Softw. 2010, 33, 1–22. [Google Scholar] [CrossRef]
  63. Wang, H.; Yang, F.; Luo, Z. An experimental study of the intrinsic stability of random forest variable importance measures. BMC Bioinform. 2016, 17, 60. [Google Scholar] [CrossRef]
  64. Huang, S.; Cai, N.; Pacheco, P.P.; Narrandes, S.; Wang, Y.; Xu, W. Applications of Support Vector Machine (SVM) Learning in Cancer Genomics. Cancer Genom. Proteom. 2018, 15, 41–51. [Google Scholar] [CrossRef]
  65. RColorBrewer, S.; Liaw, M.A. Package ‘Randomforest’; University of California: Berkeley, CA, USA, 2018; Available online: https://www.stat.berkeley.edu/~breiman/RandomForests/ (accessed on 25 March 2018).
  66. Engebretsen, S.; Bohlin, J. Statistical predictions with glmnet. Clin. Epigenetics 2019, 11, 123. [Google Scholar] [CrossRef]
  67. Karatzoglou, A.; Smola, A.; Hornik, K.; Karatzoglou, M.A. “Package ‘kernlab’”, CRAN R Project. 2019. Available online: http://cran.rediris.es/web/packages/kernlab/index.html (accessed on 31 January 2023).
  68. Harrell, F.E., Jr.; Harrell, M.F.E., Jr.; Hmisc, D. Package ‘rms’. Vanderbilt Univ. 2017, 229, Q8. [Google Scholar]
  69. Sing, T.; Sander, O.; Beerenwinkel, N.; Lengauer, T. ROCR: Visualizing the Performance of Scoring Classifiers. 2020. Available online: https://cran.r-project.org/web/packages/ROCR/index.html (accessed on 12 October 2022).
  70. Barbie, D.A.; Tamayo, P.; Boehm, J.S.; Kim, S.Y.; Moody, S.E.; Dunn, I.F.; Schinzel, A.C.; Sandy, P.; Meylan, E.; Scholl, C.; et al. Systematic RNA interference reveals that oncogenic KRAS-driven cancers require TBK1. Nature 2009, 462, 108–112. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Diagram of the study design. Abbreviations: DEGs: differently expressed gene, GO: Gene Oncology, ssGSEA: single-sample gene set enrichment analysis, GSVA: Gene set variation analysis, KEGG: Kyoto Encyclopedia of Genes and Genomes, LASSO: least absolute shrinkage and selection operator, miRNA: microRNA, PKUPH: Peking University People’s Hospital, RIF: recurrent implantation failure, ROC: receiver operating characteristic, RT-qPCR: real time-quantitative PCR, SVM-RFE: support vector machine-recursive feature elimination, TF: transcription factor.
Figure 1. Diagram of the study design. Abbreviations: DEGs: differently expressed gene, GO: Gene Oncology, ssGSEA: single-sample gene set enrichment analysis, GSVA: Gene set variation analysis, KEGG: Kyoto Encyclopedia of Genes and Genomes, LASSO: least absolute shrinkage and selection operator, miRNA: microRNA, PKUPH: Peking University People’s Hospital, RIF: recurrent implantation failure, ROC: receiver operating characteristic, RT-qPCR: real time-quantitative PCR, SVM-RFE: support vector machine-recursive feature elimination, TF: transcription factor.
Ijms 24 13488 g001
Figure 2. Data processing and DEGs identified of the derivation cohort. PCA of GSE58144, GSE103465 and GSE111974 datasets before (A) and after (B) batch correlation. (C) RIF-related DEGs volcano plot with log2FoldChange in the horizontal coordinate and −log10 (adjust p value) in the vertical coordinate. (D) Heatmap of RIF-related DEG expression levels: pink indicates high gene expression, and light blue indicates low gene expression. (E) Main BPs and (F) KEGG pathways enriched by RIF-related DEGs. Each term’s p value is colored according to the legend. Abbreviations: BP: biological process, DEG: differentially expressed gene, GO: Gene Oncology, KEGG: Kyoto Encyclopedia of Genes and Genomes, PCA: principal component analysis, RIF: recurrent implantation failure.
Figure 2. Data processing and DEGs identified of the derivation cohort. PCA of GSE58144, GSE103465 and GSE111974 datasets before (A) and after (B) batch correlation. (C) RIF-related DEGs volcano plot with log2FoldChange in the horizontal coordinate and −log10 (adjust p value) in the vertical coordinate. (D) Heatmap of RIF-related DEG expression levels: pink indicates high gene expression, and light blue indicates low gene expression. (E) Main BPs and (F) KEGG pathways enriched by RIF-related DEGs. Each term’s p value is colored according to the legend. Abbreviations: BP: biological process, DEG: differentially expressed gene, GO: Gene Oncology, KEGG: Kyoto Encyclopedia of Genes and Genomes, PCA: principal component analysis, RIF: recurrent implantation failure.
Ijms 24 13488 g002
Figure 3. Screening of metabolism-related DEGs and enriched items in GO and KEGG analyses. Venn diagram of (A) metabolic genes and upregulated DEGs, and (B) metabolic genes and downregulated DEGs. (C) Enriched items in GO-BP analysis. (D) Enriched items in KEGG pathway analysis. Each term’s p value is colored according to the legend. Different colored bubbles reflect different pathway terms. (E) Net plot showing the top 5 signaling pathways enriched by KEGG analysis. Abbreviations: DEG: differentially expressed gene, GO: Gene Ontology, BP: biological process, KEGG: Kyoto Encyclopedia of Genes and Genomes.
Figure 3. Screening of metabolism-related DEGs and enriched items in GO and KEGG analyses. Venn diagram of (A) metabolic genes and upregulated DEGs, and (B) metabolic genes and downregulated DEGs. (C) Enriched items in GO-BP analysis. (D) Enriched items in KEGG pathway analysis. Each term’s p value is colored according to the legend. Different colored bubbles reflect different pathway terms. (E) Net plot showing the top 5 signaling pathways enriched by KEGG analysis. Abbreviations: DEG: differentially expressed gene, GO: Gene Ontology, BP: biological process, KEGG: Kyoto Encyclopedia of Genes and Genomes.
Ijms 24 13488 g003
Figure 4. Construction of two metabolic subtypes of RIF based on metabolism-related DEGs. (A) Consensus matrix heatmap when K = 2. (B) PCA plots demonstrating that RIF specimens are categorized as two subtypes (subtype-A and subtype-B) in accordance with the expression profiling of metabolism-related DEGs. (C) Heatmap visualizing the expression of metabolism-related genes in the two subgroups. (D) Box plots showing the mRNA expression of characteristic genes in two metabolic subtypes. * p < 0.05; ** p < 0.01; and *** p < 0.001. (E) Reactome and (F) KEGG terms are utilized for GSVA illustrating the difference in metabolic pathways between two subtypes. Abbreviations: DEG: differentially expressed gene, GSVA: gene set variation analysis, KEGG: Kyoto Encyclopedia of Genes and Genomes. PCA: principal component analysis, RIF: recurrent implantation failure.
Figure 4. Construction of two metabolic subtypes of RIF based on metabolism-related DEGs. (A) Consensus matrix heatmap when K = 2. (B) PCA plots demonstrating that RIF specimens are categorized as two subtypes (subtype-A and subtype-B) in accordance with the expression profiling of metabolism-related DEGs. (C) Heatmap visualizing the expression of metabolism-related genes in the two subgroups. (D) Box plots showing the mRNA expression of characteristic genes in two metabolic subtypes. * p < 0.05; ** p < 0.01; and *** p < 0.001. (E) Reactome and (F) KEGG terms are utilized for GSVA illustrating the difference in metabolic pathways between two subtypes. Abbreviations: DEG: differentially expressed gene, GSVA: gene set variation analysis, KEGG: Kyoto Encyclopedia of Genes and Genomes. PCA: principal component analysis, RIF: recurrent implantation failure.
Ijms 24 13488 g004
Figure 5. DEGs analysis of two subtypes and functional analyses. (A) Volcano plot showing the DEGs between the two subgroups. (B) Bar plot visualizing the biological processes enriched by GO analysis. (C) Net plot showing the top 5 signaling pathways enriched by KEGG analysis. Abbreviations: DEG: differentially expressed genes, BP: biological process, CC: cellular component, MF: molecular function, GO: Gene Ontology, KEGG: Kyoto Encyclopedia of Genes and Genomes.
Figure 5. DEGs analysis of two subtypes and functional analyses. (A) Volcano plot showing the DEGs between the two subgroups. (B) Bar plot visualizing the biological processes enriched by GO analysis. (C) Net plot showing the top 5 signaling pathways enriched by KEGG analysis. Abbreviations: DEG: differentially expressed genes, BP: biological process, CC: cellular component, MF: molecular function, GO: Gene Ontology, KEGG: Kyoto Encyclopedia of Genes and Genomes.
Ijms 24 13488 g005
Figure 6. Selection of characteristic genes among key metabolism-related DEGs. (A) LASSO logistic regression algorithm to screen associated genes with ten-time cross-verification. Each curve corresponds to a single gene. (B) LASSO coefficient profiling. (C) Random forest for the relationships between the number of trees and error rate. (D) The rank of genes in accordance with their relative importance. (E) SVM-RFE algorithm for feature selection. (F) Venn diagram showing the overlapping genes. Abbreviations: LASSO: least absolute shrinkage and selection operator, SVM-RFE: support vector machine-recursive feature elimination.
Figure 6. Selection of characteristic genes among key metabolism-related DEGs. (A) LASSO logistic regression algorithm to screen associated genes with ten-time cross-verification. Each curve corresponds to a single gene. (B) LASSO coefficient profiling. (C) Random forest for the relationships between the number of trees and error rate. (D) The rank of genes in accordance with their relative importance. (E) SVM-RFE algorithm for feature selection. (F) Venn diagram showing the overlapping genes. Abbreviations: LASSO: least absolute shrinkage and selection operator, SVM-RFE: support vector machine-recursive feature elimination.
Ijms 24 13488 g006
Figure 7. The correlation analysis and GSEA signaling pathways involved in the characteristic genes. (A) Interactions between characteristic genes at the molecular level. The red line represents positive correlation, the green line represents negative correlation, and the darker the color, the stronger the correlation. (BI) The main signaling pathways that are significantly enriched in high expressions of characteristic genes. The x-axis displays the enrichment fractions, and greater than 0 indicates a positive correlation between genes and pathways, and less than 0 indicates a negative correlation. Abbreviations: GSEA: gene set enrichment analysis.
Figure 7. The correlation analysis and GSEA signaling pathways involved in the characteristic genes. (A) Interactions between characteristic genes at the molecular level. The red line represents positive correlation, the green line represents negative correlation, and the darker the color, the stronger the correlation. (BI) The main signaling pathways that are significantly enriched in high expressions of characteristic genes. The x-axis displays the enrichment fractions, and greater than 0 indicates a positive correlation between genes and pathways, and less than 0 indicates a negative correlation. Abbreviations: GSEA: gene set enrichment analysis.
Ijms 24 13488 g007
Figure 8. Immunological infiltration features of RIF. (A) Heatmaps depicting the correlations between distinct immune cell compositions in RIF. The size of the colored bubbles represents the strength of correlation. The red bubble represents positive correlation, the blue bubble represents negative correlation, and the bigger and darker the color, the stronger the correlation. (B) Box plots depicting the infiltration levels of immune cells in RIF (orange) and normal (blue) tissues. ns p ≥ 0.05, * p < 0.05, and ** p < 0.01. (C) Correlation between each characteristic gene and infiltrating immune cells. The size of the dots represents the strength of the correlation between genes and immune cells; the larger the dots, the stronger the correlation. The color of the dots represents the p value. Abbreviations: RIF: recurrent implantation failure.
Figure 8. Immunological infiltration features of RIF. (A) Heatmaps depicting the correlations between distinct immune cell compositions in RIF. The size of the colored bubbles represents the strength of correlation. The red bubble represents positive correlation, the blue bubble represents negative correlation, and the bigger and darker the color, the stronger the correlation. (B) Box plots depicting the infiltration levels of immune cells in RIF (orange) and normal (blue) tissues. ns p ≥ 0.05, * p < 0.05, and ** p < 0.01. (C) Correlation between each characteristic gene and infiltrating immune cells. The size of the dots represents the strength of the correlation between genes and immune cells; the larger the dots, the stronger the correlation. The color of the dots represents the p value. Abbreviations: RIF: recurrent implantation failure.
Ijms 24 13488 g008
Figure 9. Diagnostic efficacy and validation of characteristic genes in predicting RIF. The ROC curves estimating the diagnostic performance of characteristic genes (A) in the combined GSE58144, GSE103465 and GSE111974 datasets (AUC = 0.902), and (B) in the validated cohort patients enrolled from PKUPH (AUC = 0.867). (C) Establishment of a nomogram integrating characteristic genes for predicting RIF. In the nomogram, each variable corresponds to a score, and the total score can be calculated by adding the scores for all variables. (D) Calibration curve estimates the prediction accuracy of the nomogram. (E) Decision curve analysis shows the clinical benefit of the nomogram. Abbreviations: AUC: area under curve, RIF: recurrent implantation failure, PKUPH: Peking University People’s Hospital, ROC: receiver operating characteristic.
Figure 9. Diagnostic efficacy and validation of characteristic genes in predicting RIF. The ROC curves estimating the diagnostic performance of characteristic genes (A) in the combined GSE58144, GSE103465 and GSE111974 datasets (AUC = 0.902), and (B) in the validated cohort patients enrolled from PKUPH (AUC = 0.867). (C) Establishment of a nomogram integrating characteristic genes for predicting RIF. In the nomogram, each variable corresponds to a score, and the total score can be calculated by adding the scores for all variables. (D) Calibration curve estimates the prediction accuracy of the nomogram. (E) Decision curve analysis shows the clinical benefit of the nomogram. Abbreviations: AUC: area under curve, RIF: recurrent implantation failure, PKUPH: Peking University People’s Hospital, ROC: receiver operating characteristic.
Ijms 24 13488 g009
Figure 10. RT-qPCR validation in the endometrium of days 20–24 in the luteal phase of the menstrual cycle in 20 patients enrolled from PKUPH. the mRNA expression levels of (A) PRUNE, (B) SRD5A1, (C) RBKS, (D) PPA2, (E) POLR3E, (F) CA12 were significantly lower, and the mRNA expression levels of (G) PAPSS1 were significantly higher in the RIF group. The mRNA expression of (H) PDE6D in the RIF group was lower, but there was no statistical difference between the two groups. ns p ≥ 0.05, * p < 0.05; ** p < 0.01. Abbreviations: PKUPH: Peking University People’s Hospital, RIF: recurrent implantation failure, RT-qPCR: real time-quantitative PCR.
Figure 10. RT-qPCR validation in the endometrium of days 20–24 in the luteal phase of the menstrual cycle in 20 patients enrolled from PKUPH. the mRNA expression levels of (A) PRUNE, (B) SRD5A1, (C) RBKS, (D) PPA2, (E) POLR3E, (F) CA12 were significantly lower, and the mRNA expression levels of (G) PAPSS1 were significantly higher in the RIF group. The mRNA expression of (H) PDE6D in the RIF group was lower, but there was no statistical difference between the two groups. ns p ≥ 0.05, * p < 0.05; ** p < 0.01. Abbreviations: PKUPH: Peking University People’s Hospital, RIF: recurrent implantation failure, RT-qPCR: real time-quantitative PCR.
Ijms 24 13488 g010
Figure 11. The miRNAs-TFs-genes networks. Abbreviations: miRNA: microRNA; TFs: transcription factor.
Figure 11. The miRNAs-TFs-genes networks. Abbreviations: miRNA: microRNA; TFs: transcription factor.
Ijms 24 13488 g011
Table 1. Characteristics of patients in validation cohort from Peking University People’s Hospital.
Table 1. Characteristics of patients in validation cohort from Peking University People’s Hospital.
Clinical ParameterControl N = 20RIF N = 20p Value
Age (years)34.00 ± 3.3436.85 ± 2.460.004
BMI (kg/m2)20.10 ± 5.1721.90 ± 5.790.306
Infertility duration (years)3.11 ± 1.764.30 ± 2.230.072
Infertility 0.507
Primary infertility14 (70.00%)12 (60.00%)
Secondary infertility6 (30.00%)8 (40.00%)
Basal FSH (IU/L)8.75 ± 1.778.50 ± 5.120.838
Basal LH (IU/L)4.21 ± 1.813.40 ± 1.880.179
Prolactin (pg/mL)12.37 ± 5.5012.53 ± 6.440.936
Basal estradiol (pg/mL)38.70 ± 13.1167.76 ± 117.020.277
Androgen (pg/mL)1.84 ± 0.601.73 ± 0.960.689
AMH (ng/mL)3.55 ± 2.505.32 ± 7.080.301
AFC6.67 ± 2.645.08 ± 2.760.083
Endometrial type 0.885
Type-A11 (64.71%)9 (64.29%)
Type-B4 (23.53%)4 (28.57%)
Type-C2 (11.76%)1 (7.14%)
Abbreviations: AFC, antral follicle count; AMH, anti-Müllerian hormone; BMI, body mass index; FSH, follicle-stimulating hormone; LH, luteinizing hormone; RIF, recurrent implantation failure.
Table 2. PCR primers.
Table 2. PCR primers.
GeneForward Primer SequenceReverse Primer Sequence
PRUNECTTGAAGATAGGCATGGAGGTTAGGCAACGATCTGTGAAGTCCTGGAAC
SRD5A1CCTGCCGCTCTACCAGTACGTCCTCCTCGCATCAGAAATGGG
RBKSGAAGCAGTTCCTGTAGCAGCATCTGGTGTGTAAGGTTGGCAAAGATTC
PPA2AAGGGAAGATATTCGCCACATAGCGCCACCAAGGAGCCAATGAATC
PDE6DTGAACCTTCGGGATGCTGAGACCCACACCAGGGACAGACAGG
PAPSS1AGCAACCAATGTCACCTACCAAGCAACCACGAAAGCCACCTCTG
CA12TCTTGGTGGCTGGCTTGTAAATGCATCTGTATTGTGGTGGTGGTGTC
POLR3EGCCAACTTGATGAGCCTCCTGGACCAACATCGCCACCTTCTG
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Fan, Y.; Shi, C.; Huang, N.; Fang, F.; Tian, L.; Wang, J. Recurrent Implantation Failure: Bioinformatic Discovery of Biomarkers and Identification of Metabolic Subtypes. Int. J. Mol. Sci. 2023, 24, 13488. https://doi.org/10.3390/ijms241713488

AMA Style

Fan Y, Shi C, Huang N, Fang F, Tian L, Wang J. Recurrent Implantation Failure: Bioinformatic Discovery of Biomarkers and Identification of Metabolic Subtypes. International Journal of Molecular Sciences. 2023; 24(17):13488. https://doi.org/10.3390/ijms241713488

Chicago/Turabian Style

Fan, Yuan, Cheng Shi, Nannan Huang, Fang Fang, Li Tian, and Jianliu Wang. 2023. "Recurrent Implantation Failure: Bioinformatic Discovery of Biomarkers and Identification of Metabolic Subtypes" International Journal of Molecular Sciences 24, no. 17: 13488. https://doi.org/10.3390/ijms241713488

APA Style

Fan, Y., Shi, C., Huang, N., Fang, F., Tian, L., & Wang, J. (2023). Recurrent Implantation Failure: Bioinformatic Discovery of Biomarkers and Identification of Metabolic Subtypes. International Journal of Molecular Sciences, 24(17), 13488. https://doi.org/10.3390/ijms241713488

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop