Next Article in Journal
Cytochrome P450 1B1 Expression Regulates Intracellular Iron Levels and Oxidative Stress in the Retinal Endothelium
Previous Article in Journal
Organic–Inorganic Hybrid Pigments Based on Bentonite: Strategies to Stabilize the Quinoidal Base Form of Anthocyanin
Previous Article in Special Issue
Follow-Up Biomarkers in the Evolution of Prostate Cancer, Levels of S100A4 as a Detector in Plasma
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Lymphatic Dissemination in Prostate Cancer: Features of the Transcriptomic Profile and Prognostic Models

by
Elena A. Pudova
1,*,
Anastasiya A. Kobelyatskaya
1,
Irina V. Katunina
1,
Anastasiya V. Snezhkina
1,
Maria S. Fedorova
1,
Vladislav S. Pavlov
1,
Ildar R. Bakhtogarimov
1,
Margarita S. Lantsova
1,
Sergey P. Kokin
2,
Kirill M. Nyushko
2,
Boris Ya. Alekseev
2,
Dmitry V. Kalinin
3,
Nataliya V. Melnikova
1,
Alexey A. Dmitriev
1,
George S. Krasnov
1 and
Anna V. Kudryavtseva
1
1
Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 119991 Moscow, Russia
2
National Medical Research Radiological Center, Ministry of Health of the Russian Federation, 125284 Moscow, Russia
3
Vishnevsky Institute of Surgery, Ministry of Health of the Russian Federation, 117997 Moscow, Russia
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2023, 24(3), 2418; https://doi.org/10.3390/ijms24032418
Submission received: 31 October 2022 / Revised: 19 December 2022 / Accepted: 23 January 2023 / Published: 26 January 2023
(This article belongs to the Special Issue Advances in Agressive Prostate Cancer)

Abstract

:
Radical prostatectomy is the gold standard treatment for prostate cancer (PCa); however, it does not always completely cure PCa, and patients often experience a recurrence of the disease. In addition, the clinical and pathological parameters used to assess the prognosis and choose further tactics for treating a patient are insufficiently informative and need to be supplemented with new markers. In this study, we performed RNA-Seq of PCa tissue samples, aimed at identifying potential prognostic markers at the level of gene expression and miRNAs associated with one of the key signs of cancer aggressiveness—lymphatic dissemination. The relative expression of candidate markers was validated by quantitative PCR, including an independent sample of patients based on archival material. Statistically significant results, derived from an independent set of samples, were confirmed for miR-148a-3p and miR-615-3p, as well as for the CST2, OCLN, and PCAT4 genes. Considering the obtained validation data, we also analyzed the predictive value of models based on various combinations of identified markers using algorithms based on machine learning. The highest predictive potential was shown for the “CST2 + OCLN + pT” model (AUC = 0.863) based on the CatBoost Classifier algorithm.

1. Introduction

Prostate cancer (PCa) is one of the most common cancers in men worldwide; more than one million new cases are diagnosed annually [1]. Prostate cancer is characterized by high clinical heterogeneity, which is manifested by a different propensity for recurrence of the disease, as well as the onset of progression after surgical treatment. The clinical heterogeneity of PCa, in turn, is due to molecular heterogeneity—in the tumors of patients, there are disorders that are accompanied by various changes in signaling pathways and metabolic processes [2,3]. At the same time, for the development of aggressive forms of PCa, the occurrence of only a few driver disorders is sufficient [4,5,6].
Despite a wide range of therapeutic approaches, the problem of choosing the tactics for treating a patient after radical prostatectomy is quite acute, especially for the category of patients with locally advanced PCa (LAPC). In addition to invasion of the prostatic capsule, metastasis to regional lymph nodes is often observed in LAPC. In this case, as a rule, the choice is between the immediate start of adjuvant hormone therapy or active monitoring of the level of prostate-specific antigen (PSA) in the blood.
The choice of a therapeutic concept is based on determining the prognosis for the patient, which, in turn, is based on such clinical and pathological parameters as the PSA level, the size of the primary tumor (stage T), and the Gleason score. Indeed, it is worth noting the emergence of an improved classification of cancer cell differentiation based on the Gleason score, a system developed by the ISUP (International Society of Uropathologists) [7]. The division into groups occurs between 1 and 5, depending on the Gleason score, the method providing the most accurate stratification of tumors.
However, these indicators are not informative enough to identify a group of patients with tumors that have high potential for progression, the development of an aggressive phenotype, and metastasis [8,9]. Thus, reliable prognostic markers are needed, the use of which, in combination with clinicopathological parameters, will help to reliably identify an aggressive tumor phenotype and thus choose the best therapeutic approach for the patient. One of the promising approaches in the search for prognostic markers may be the analysis of the transcriptomic data of tumors [10,11,12]. Currently, several prognostic expression panels of PCa markers based on tissue analysis have been identified, such as Decipher and Oncotype DX Genomic Prostate Score (GPS).
The Decipher test measures the RNA expression levels of 22 different genes, selected based on unique differential expression patterns in early metastasis [13]. The Decipher test has shown high discrimination in the prediction of clinical metastases (AUC = 0.75–0.83) and mortality from PCa (AUC = 0.78) in validation studies, significantly exceeding the available clinicopathological characteristics (AUC = 0.69) [14]. The Oncotype DX panel is a tissue biopsy-based genomic assay that measures the mRNA expression of 17 genes responsible for tumor cell growth and survival [15]. At the development stage of this test, we used the results of quantitative PCR, based on the archival material of tissue samples obtained after the surgical treatment of patients with PCa in low- and intermediate-risk groups. Based on validation studies, the Oncotype DX panel has been shown to be highly correlated with the biochemical recurrence of PCa and a poor prognosis, highlighting its predictive value for patients with PCa in low- and intermediate-risk groups.
Previously, we conducted a study that included miRNA-Seq analysis of 44 PCa tissue samples with and without lymphatic dissemination (N1 group = 20 samples; N0 group = 24 samples), as a result of which we identified a number of miRNAs, the expression of which could potentially be associated with lymphatic dissemination [16]. In the present study, we performed RNA-Seq profiling of an expanded sample of 73 PCa tissue samples from Russian patients and validated the obtained results by quantitative PCR (qPCR). This study also included an independent sample of 37 PCa tissue samples based on archival material. The obtained qPCR data for both PCa samples were used to analyze the predictive value of models based on combinations of candidate markers and clinicopathological parameters, using various machine learning algorithms (Logistic Regression (LR; scikit-learn ver.1.1.3), Light Gradient Boosting Machine (LGBM; ver.3.3.2), CatBoost (ver.1.0.6), Random Forest (scikit-learn ver.1.1.3) and XGBoost (ver.1.6.2)). The results of the study can be used to develop an expression panel for assessing the metastatic potential of high-risk PCa when choosing a therapeutic concept for a patient.

2. Results

2.1. Transcriptome Profile Associated with Lymphatic Dissemination

Based on the obtained RNA-Seq data, the differential expression (DE) of genes was analyzed between groups of patients with and without lymphatic dissemination (N1 and N0 groups, respectively). The list of obtained statistically significant DE genes (DEGs) is presented in Table S1 (p value of the QLF/U tests < 0.05). A heat map of the gene expression profile is shown in Figure 1.
Based on the obtained list of DEGs, pathway enrichment analysis was performed based on the GSEA algorithm using the Reactome 2022 database (Table 1).
According to the results of the pathway enrichment analysis, in the case of tumor samples of the N1 group, we predominantly observed the activation of the cell cycle and translation pathways. It is also worth noting the decrease in the activity of the pathway—"Fatty acid metabolism.”

2.2. Selection of Candidate Markers of Lymphatic Dissemination

One of the key objectives of this study was the identification of promising markers of lymphatic dissemination. To solve this problem, the resulting list of DEGs was filtered based on the following parameters: FDR U-test < 0.05; −1 < LogFC < 1; p value rs < 0.05. As a result of the filtering, the following genes were selected that best match the specified criteria: OCLN, F5, TBX1, CST2, RAB27A, PCAT4, and VGLL3 (Table 2).
Based on the previously obtained list of miRNAs associated with the N1 group, candidates were also selected based on filtering by similar parameters [16]. As a result, the following were selected: miR-148a-3p (Log2FC = −1.69; LogCPM = 18.6; FDR U test = 4 × 10−2; p value rs = 4 × 10−2) and miR-615-3p (Log2FC = 1.32; LogCPM = 7.6; FDR U test = 9 × 10−3; p value rs = 8 × 10−3).
After selecting a number of candidate markers, we evaluated their multicollinearity, including those with clinical and pathological parameters—ISUP and pT stage. The analysis of the correlation showed that there is no functional strong relationship between the markers and that they can be jointly considered as part of the models (Figure 2).

2.3. Validation of the Relative Expression of the Potential Markers by qPCR

Validation of the expression of promising markers was primarily carried out on a previously sequenced sample of Russian patients with LAPC based on freshly frozen surgical material (FFT samples).
As a result of the validation, it was shown that, based on the expression of all considered genes, a statistically significant difference between the studied groups was confirmed (Figure 3a and Table 3). Next, we validated the relative expression of the selected genes in an independent sample of Russian patients (FFPE samples).
Statistically significant results were obtained based on the relative expression of the CST2, OCLN, and PCAT4 genes (Figure 3b and Table 3). The results of the calculations of the average change in the relative expression of the genes between the studied groups, performed based on the FFPE samples, showed the greatest decrease in expression in the presence of lymphatic dissemination for the PCAT4 gene (a 1.64-fold decrease) and the greatest increase in expression was shown for the CST2 gene (a 3.25-fold increase).
Next, we validated the relative expression of the selected promising miRNAs. In the case of the FFT samples, a statistically significant difference between groups based on the relative expression of miR-148a-3p and miR-615-3p was confirmed (Figure 4a and Table 4).
Next, we validated the expression of miR-148a-3p and miR-615-3p in the FFPE samples. According to the results obtained, based on the relative expression of these miRNAs, there was also a statistically significant difference between the groups (Figure 4b).
In the case of miR-615-3p, we observed a 4.08-fold increase in expression in the case of the FFT samples (p = 0.001) and a 2.23-fold increase in the case of the FFPE samples (p = 0.04). The expression of miR-148a-3p was reduced in the N1 group by 1.5-fold in the FFPE samples (p = 0.04) and 2.04-fold in the FFPE samples (p = 0.04).

2.4. Relative Expression of the Candidate Markers in Lymph Node Metastases

We also analyzed the relative expression of the candidate markers in lymph node metastasis samples. Relative expression of CST2, OCLN, miR-148a-3p and miR-615-3p was found in all PCa metastasis samples.
It was also found that in the metastasis samples, the expression of the CST2 gene was statistically significantly reduced by an average of 3.5-fold compared to the primary tumor samples (p = 0.002) (Figure 5a). In the case of the OCLN gene, in metastases, on the contrary, an average 3.34-fold increase in expression (p = 0.02) was observed (Figure 5b).
When evaluating the expression of candidate microRNAs in metastasis samples, a statistically significant increase in miR-615-3p expression was found—being, on average, 6.57-fold in comparison to primary tumor samples (p = 0.003) (Figure 5c). In the case of miR-148a-3p, an average 2.16-fold increase in expression (p = 0.01) was also noted in the metastatic samples (Figure 5d).

2.5. ROC–AUC Analysis of Models Based on Combinations of the Markers

We assessed the predictive value of the identified candidate markers in various combinations (from single predictors to a model based on all seven predictors). Each combination of predictors was evaluated based on five machine learning algorithms: Logistic Regression, LGBM, Catboost, Random Forest, and XGBoost. The results of all predictor combinations for the five algorithms are presented in Table S2.
Based on the results obtained in each category of combinations, we selected the best models in terms of the ROC–AUC value using the test data, which passed the threshold of 0.7 for the accuracy parameter in the training data (Table 4).
The model based on the combination of «CST2+ OCLN+ pT» predictors and the CatBoost algorithm was chosen as the best one, characterized by the highest performance in all parameters considered. According to the other considered machine learning algorithms, we also observed high AUC values for this model (Table 5 and Figure 6).

3. Discussion

In the present study, a comprehensive transcriptomic analysis of LAPC tissue samples in a sample of Russian patients was carried out. This procedure aimed at identifying promising candidate markers of early metastasis. Based on the obtained transcriptomic profile, the biological pathways associated with lymphatic dissemination were first considered. The results of the biological pathways analysis with high enrichment predominantly demonstrated the activation of translational processes in tumor cells. This translational activity is likely to be closely related to increased metabolism, directed at obtaining energy from various sources to promote the epithelial–mesenchymal transition of tumor cells and metastasis.
Among all of the pathways identified, it is worth noting the decreased activation of the “Fatty Acid Metabolism” pathway. It is known that malignant transformation of a tumor depends on complex intercellular interactions, supported by a wide network of physical and chemical mediators that make up the tumor microenvironment [17]. Recently, various researchers have emphasized the key role of adipose tissue as a key component in the progression of solid tumors [18]. The prostate gland is surrounded by periprostatic adipose tissue, and extraprostatic expansion to adipose tissue is a widely recognized poor prognostic factor in PCa and an important predictor of recurrence after treatment [19]. The positive relationship between obesity and aggressive PCa, determined by an increase in local and distant spread, also supports the role of adipose tissue in tumor progression [20].
Interactions between adipocytes and tumor cells in the tumor microenvironment can create a metabolic symbiosis, leading to growth and metastasis. In combination with glucose, fatty acids are also vital for the synthesis of membrane lipids in tumor cells, energy production, and the synthesis of carcinogenesis-associated lipid-signaling molecules such as lysophosphatidic acids [21,22,23]. As a result, tumor cells activate de novo fatty acid synthesis, and elevated levels of fatty acid synthase are negatively correlated with prognosis [24]. Thus, in addition to synthesis, tumor cells can also use exogenous fatty acids as a source of nutrition. Thus, our results also highlight the importance of lipid metabolism in the progression of PCa.
Furthermore, based on the obtained transcriptomic profile, we searched for promising candidate markers based on gene expression. As a result of the validation, we confirmed the statistical significance of the expression of the PCAT4, OCLN and CST2 genes in lymphatic dissemination in an independent sample.
The PCAT4 (prostate cancer associated transcript 4; PCAN1; GDEP) gene is characterized by high tissue specificity for prostate tissue, but there are no published data on the biological function of this gene. According to the data obtained, we saw a significant decrease in the expression of this gene in the N1 group.
The OCLN (occludin) gene encodes the occludin protein, which belongs to tight junction proteins. Tight junctions are one of the key components in tumor metastasis, as tumor cells must pass through a series of barriers to successfully metastasize to secondary lesions [25]. OCLN is widely expressed in tissues and cells with tight junctions and is a membrane protein with four trans-membrane domains [26]. According to the literature, high expression of OCLN has been found in lung cancer, and when OCLN was knocked down in cell lines of lung cancer (A549, NCL-H1650, SPC-A1, HCC827, NCI-H1299, and MSTO-211H), inhibition of cell proliferation was observed in vitro and in vivo. In addition, OCLN knockdown promoted apoptosis of lung cancer cell lines and reduced their ability to invade, on the basis of which the role of OCLN as a tumor promoter and prometastatic factor was shown for the first time [27]. Based on our data, for the first time, an association between increased expression of the OCLN gene and the presence of lymphatic dissemination in LAPC was shown.
The CST2 (cystatin SA) gene is a member of the cystatin family. Based on several studies, it has been shown that high expression of this gene is associated with the development of carcinogenesis. In breast cancer, increased expression of the CST2 gene has been shown to be associated with tumor cell proliferation, movement, and adhesion [28]. Based on our data, we observed an association between increased expression of the CST2 gene and lymphatic dissemination in LAPC.
In addition, we also validated previously identified promising markers based on miRNA expression, specifically miR-615-3p and miR-148a-3p, which also confirmed their association with lymphatic dissemination in the case of an independent sample of LAPC.
Aberrant expression of miR-615-3p has been described in many forms of cancer, including PCa, where overexpression of miR-615-3p has been observed in the most aggressive forms [29,30,31]. Experiments on cell lines of various types of cancer have shown that miR-615-3p overexpression supports cell proliferation and migration [29,30]. Functional studies performed on PCa cell lines have shown that miR-615-3p promotes proliferation, apoptosis, and migration of the PC3M cell line in vitro, indicating that miR-615-3p is an important oncogenic microRNA in PCa [31].
miR-148a-3p is one of the most highly expressed miRNAs in PCa tissues, as well as the most dominant in PCa metastasis [32]. High-grade tumors have been shown to exhibit reduced levels of miR-148a-3 expression. miR-148a expression has also been shown to be downregulated in docetaxel-resistant variants of PCa cell lines, including PC-3 and DU145, and downregulation of miR-148a has been observed in PCa with a risk of biochemical recurrence [33].
Evaluation of the expression of these candidate markers in samples of affected lymph nodes showed a further linear increase in the expression of the OCLN gene and miR-615-3p, as well as increased expression of miR-148a-3p. It can be assumed that the increased expression of these markers is not only associated with lymphatic dissemination in LAPC, but also supports the formation of secondary tumor foci.
We assessed the prognostic significance of PCAT4, OCLN, CST2, miR-615-3p, and miR-148a-3p for various combinations of predictors, both with each other and with such clinicopathological parameters as ISUP and pT. We considered the main metrics for models based on five machine learning algorithms for a classification problem.
The “CST2 + OCLN + pT” model, based on the CatBoost algorithm, had the highest metrics. In addition to the highest AUC (0.863), this model also had the highest sensitivity (SE = 0.83), specificity (SP = 0.79), and accuracy (AC = 0.81) of an independent sample of archival material. The parameters of this model based on other algorithms also differed in the highest rates (AUC = 0.73–0.81; SE = 0.56–0.78; SP = 0.68–0.84; AC = 0.62–0.73). Thus, the model we identified based on the predictor combination of «CST2 + OCLN + pT» had the highest prognostic potential for determining lymphatic dissemination in LAPC, both based on freshly frozen surgical material and, in the case of archival material, based on FFPE blocks.

4. Materials and Methods

4.1. Material

The present study included 73 samples of freshly frozen LAPC tissues, obtained as a result of radical prostatectomy with extended pelvic lymphadenectomy, a procedure performed on the basis of the research of National Medical Research Center for Radiology of the Ministry of Health of the Russian Federation.
The main criteria for sample inclusion in the study were the following: tumor type adenocarcinoma, LAPC (pT3a/3b), no neoadjuvant therapy, known lymph node status (N0/N1), and a negative resection margin for samples with stage N0. The sample of FFT samples was divided into groups, both with and without lymphatic dissemination (groups N1 n = 31 and N0 n = 42, respectively).
As an independent sample, 37 FFPE LAPC tissue samples were used, obtained as a result of radical prostatectomy with extended pelvic lymphadenectomy on the basis of the A.V. Vishnevsky National Medical Research Center for Surgery of the Ministry of Health of Russia. The sample was also divided into groups N1 (n = 19) and N0 (n = 18). Samples of affected regional lymph nodes were also included in the study of patients from group N1 (n = 14).
All samples of tumor tissues were characterized in the pathological anatomical departments of the respective medical institutions, on the basis of which the material was obtained, and they were found to contain at least 70% of tumor cells. The main clinical and pathological characteristics of patients are presented in Table 6.

4.2. Methods

4.2.1. Isolation of Total RNA from Tissue Samples

Samples of fresh frozen tumor tissues were preliminarily homogenized using a MagNA Lyser device (Roche, Basel, Switzerland). Subsequent total RNA isolation was performed using the MagNA Pure Compact RNA Kit (Roche) on the MagNA Pure Compact System (Roche) according to the manufacturer’s protocol. For FFPE samples of tumor tissues and lymph nodes, total RNA isolation was performed using the High Pure FFPET RNA Isolation Kit (Roche) according to the manufacturer’s protocol. The concentration of isolated total RNA was assessed on a Quibit 4.0 fluorimeter (Thermo Fisher Scientific, Waltham, MA, USA) using the Qubit RNA BR Assay Kit (Thermo Fisher Scientific).

4.2.2. Library Preparation and High Throughput Sequencing

For the obtained samples of total RNA, the quality was assessed on the Agilent Bioanalyzer 2100 instrument (gilent Technologies, Santa Clara, CA, USA) using the Agilent RNA 6000 Nano Kit (Agilent Technologies) in accordance with the manufacturer’s protocol. For the subsequent preparation of mRNA libraries, tumor tissue samples with a RIN value of at least 7 were used. Sample preparation of mRNA libraries was performed using the TruSeq Stranded mRNA Kit (Illumina, San Diego, CA, USA) in accordance with the manufacturer’s protocol. The concentration of the resulting libraries was measured on a Quibit 4.0 fluorimeter using the Qubit dsDNA HS Assay Kit (Thermo Fisher Scientific). The quality of the resulting libraries was assessed on an Agilent Bioanalyzer 2100 instrument using the Agilent High Sensitivity DNA Kit (Thermo Fisher Scientific) in accordance with the manufacturer’s protocol. The size of the resulting mRNA library was ~260 bp. High-throughput sequencing of mRNA libraries was performed on a NextSeq 500 System (Illumina) using NextSeq 500/550 High Output Kit v2.5 (Illumina) in 75 bp single-ended read mode. As a result of the sequencing, at least 14 million reads were obtained for each sample.

4.2.3. Bioinformatics Data Analysis

For the obtained RNA-Seq data in the fastqc format, quality assessment was per-formed using the FastqQC and MultiQC programs (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/, accessed on 10 May 2022). The Trimmomatic tool was used to remove adapter sequences from RNA-Seq data, which was followed by mapping to the reference genome (GRCh38 assembly) using the STAR tool [34,35]. FeatureCounts (Subread package v.1.6.4, Parkville, Australia) was used to calculate the read counts per gene [36]. The analysis of differential gene expression was performed in the R statistical environment using the edgeR package [37]. The TMM (Trimmed Mean of M-values) method was used to normalize the data. In the analysis of differential gene expression, the following quasi-likelihood F-test (QLF test) and the non-parametric Mann–Whitney test were applied (U-test). The Benjamini–Hochberg correction was applied to calculate the false positive rate (FDR). Spearman’s rank correlation coefficients (rs) were calculated between N0 and N1groups. Differences in the level of gene expression were considered statistically significant at test p values < 0.05. The visualization of heat maps of transcriptome profiles was performed using the ggplot2 package [38]. Biological pathway enrichment analysis based on RNA-Seq data was performed based on the GSEA algorithm using the Reactome 2022 database. The results were considered significant at FDR < 0.05.

4.2.4. Quantitative PCR (qPCR)

cDNA samples were obtained from the mRNA template using Mint reverse transcriptase and oligo(dT) primer (20 µM) according to the protocol of the manufacturer (Evrogen, Moscow, Russia). cDNA was obtained from the miRNA template using the TaqMan Advanced miRNA cDNA Synthesis Kit (Thermo Fisher Scientific) according to the manufacturer’s protocol. qPCR was performed in three technical replicates on an Applied Biosystems 7500 instrument (Thermo Fisher Scientific). The HPRT1 gene was used as a reference gene for analysis of relative mRNA expression. The sequences of primers used to validate markers based on mRNA expression are shown in Table 7. When validating microRNAs, miR-28-3p was used as a control. For the detection of control and target miRNAs, commercial sets of primers and probes, all contained on the TaqManTM Advanced miRNA Assay (Thermo Fisher Scientific), were used: 477814_mir (miR-148a-3p), 478175_mir (miR-615-3p), 477999_mir (miR-28-3p). The level of relative expression of genes and microRNA for each study group was calculated by the ΔCt method. Visualization and statistical analysis of expression results were performed using paired Wilcoxon tests in Jupyter Notebook, Python (ver. 3.6).

4.2.5. Model Analysis

Based on the qPCR results, we used five algorithms for supervised machine learning, including Logistic Regression (LR), Light Gradient Boosting Machine (LGBM), CatBoost, Random Forest and XGBoost. These methods were implemented using scikit-learn, lightgbm, catboost, xgboost libraries in Jupyter Notebook, Python (ver. 3.6). The presence of lymphatic dissemination was used as a target. FFT samples were used as the train set, FFPE samples were used as the test set. The models were trained using cross-validation (cv = 5). Receiver operating characteristics (ROC) curves were used to compare the performance of these five algorithms.

5. Conclusions

We performed RNA-Seq profiling of 73 LAPC tissue samples, obtained from radical prostatectomy, with extended lymphectomy. Using bioinformatics analysis, enriched biological pathways associated with lymphatic dissemination in LAPC were identified in a sample of Russian patients, a group which can be further studied in the search for new potential therapeutic targets. Moreover, based on the bioinformatics analysis, we identified a number of genes and microRNAs, the expression of which can be considered potential prognostic markers. As a result of the validation of candidate markers by qPCR on an independent sample of patients, statistically significant results were confirmed for the PCAT4, OCLN, and CST2 genes, as well as miR-615-3p and miR-148a-3p. Based on the qPCR data obtained, we analyzed the prognostic significance of various combinations of these candidate markers, including those with the clinicopathological parameters ISUP and pT, using various machine learning algorithms. As a result, we showed that the model, based on the combination of «CST2 + OCLN + pT», was characterized by the highest predictive value (AUC = 0.863) for determining lymphatic dissemination, both on samples of freshly frozen PCa tissues and on samples of archival material.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms24032418/s1.

Author Contributions

Conceptualization, E.A.P., K.M.N., D.V.K. and B.Y.A.; methodology, E.A.P. and M.S.F.; formal analysis, A.A.K. and G.S.K.; investigation, I.V.K., I.R.B., S.P.K. and M.S.L.; re-sources, A.V.K.; writing—original draft preparation, E.A.P.; writing—review and editing, A.V.S., A.A.D. and N.V.M.; visualization, V.S.P. and M.S.F.; project administration, E.A.P. and A.V.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Russian Science Foundation, grant 18-75-10127.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki and approved by the N. Lopatkin Scientific Research Institute of Urology and Interventional Radiology-branch of the National Medical Research Radiological Center of the Ministry of Health of Russian Federation (ethics committee approval no. 159, 09.11.2021) and Vishnevsky Institute of Surgery Ministry of Health of the Russian Federation (ethics committee approval no. 007/18, 02.10.2018).

Informed Consent Statement

Written informed consent has been obtained from the patients to publish this paper.

Data Availability Statement

All data generated or analyzed during this study are available within the article or upon request from the corresponding author.

Acknowledgments

The authors thank the National Medical Research Center of Radiology and Vishnevsky Institute of Surgery for assistance in data interpretation and discussion, and EIMB RAS for providing the computing power and techniques for the data analysis. This work was performed using the equipment of EIMB RAS “Genome” center (http://www.eimb.ru/ru1/ckp/ccu_genome_c.php, accessed on 10 May 2022).

Conflicts of Interest

The author has no conflict of interest to declare.

References

  1. Pernar, C.H.; Ebot, E.M.; Wilson, K.M.; Mucci, L.A. The Epidemiology of Prostate Cancer. Cold Spring Harb. Perspect. Med. 2018, 8, a030361. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Grasso, C.S.; Wu, Y.-M.; Robinson, D.R.; Cao, X.; Dhanasekaran, S.M.; Khan, A.P.; Quist, M.J.; Jing, X.; Lonigro, R.J.; Brenner, J.C.; et al. The mutational landscape of lethal castration-resistant prostate cancer. Nature 2012, 487, 239–243. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Taylor, B.S.; Schultz, N.; Hieronymus, H.; Gopalan, A.; Xiao, Y.; Carver, B.S.; Arora, V.K.; Kaushik, P.; Cerami, E.; Reva, B.; et al. Integrative Genomic Profiling of Human Prostate Cancer. Cancer Cell 2010, 18, 11–22. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Barbieri, C.; Bangma, C.H.; Bjartell, A.; Catto, J.; Culig, Z.; Grönberg, H.; Luo, J.; Visakorpi, T.; Rubin, M. The Mutational Landscape of Prostate Cancer. Eur. Urol. 2013, 64, 567–576. [Google Scholar] [CrossRef] [Green Version]
  5. Haffner, M.C.; Mosbruger, T.; Esopi, D.M.; Fedor, H.; Heaphy, C.M.; Walker, D.A.; Adejola, N.; Gã¼Rel, M.; Hicks, J.; Meeker, A.K.; et al. Tracking the clonal origin of lethal prostate cancer. J. Clin. Investig. 2013, 123, 4918–4922. [Google Scholar] [CrossRef] [Green Version]
  6. Kumar, A.; White, T.A.; MacKenzie, A.P.; Clegg, N.; Lee, C.; Dumpit, R.F.; Coleman, I.; Ng, S.B.; Salipante, S.J.; Rieder, M.J.; et al. Exome sequencing identifies a spectrum of mutation frequencies in advanced and lethal prostate cancers. Proc. Natl. Acad. Sci. USA 2011, 108, 17087–17092. [Google Scholar] [CrossRef] [Green Version]
  7. Epstein, J.I.; Egevad, L.; Amin, M.B.; Delahunt, B.; Srigley, J.R.; Humphrey, P.A.; Grading, C. The 2014 International Society of Urological Pathology (ISUP) Consensus Conference on Gleason Grading of Prostatic Carcinoma: Definition of Grading Patterns and Proposal for a New Grading System. Am. J. Surg. Pathol. 2016, 40, 244–252. [Google Scholar] [CrossRef]
  8. Moschini, M.; Spahn, M.; Mattei, A.; Cheville, J.; Karnes, R.J. Incorporation of tissue-based genomic biomarkers into localized prostate cancer clinics. BMC Med. 2016, 14, 67. [Google Scholar] [CrossRef] [Green Version]
  9. Vela, I.; Scardino, P.T. Molecular Profiles of Prostate Cancer: To Treat or Not to Treat. Annu. Rev. Med. 2016, 67, 119–135. [Google Scholar] [CrossRef]
  10. Kumar, A.; Kasikci, Y.; Badredine, A.; Azzag, K.; Quintyn Ranty, M.L.; Zaidi, F.; Aragou, N.; Mazerolles, C.; Malavaud, B.; Mendoza-Parra, M.A.; et al. Patient-matched analysis identifies deregulated networks in prostate cancer to guide personalized therapeutic intervention. Am. J. Cancer Res. 2021, 11, 5299–5318. [Google Scholar]
  11. Bolis, M.; Bossi, D.; Vallerga, A.; Ceserani, V.; Cavalli, M.; Impellizzieri, D.; Di Rito, L.; Zoni, E.; Mosole, S.; Elia, A.R.; et al. Dynamic prostate cancer transcriptome analysis delineates the trajectory to disease progression. Nat. Commun. 2021, 12, 7033. [Google Scholar] [CrossRef]
  12. Pudova, E.A.; Krasnov, G.S.; Kobelyatskaya, A.A.; Savvateeva, M.V.; Fedorova, M.S.; Pavlov, V.S.; Nyushko, K.M.; Kaprin, A.D.; Alekseev, B.Y.; Trofimov, D.Y.; et al. Gene Expression Changes and Associated Pathways Involved in the Progression of Prostate Cancer Advanced Stages. Front. Genet. 2021, 11, 613162. [Google Scholar] [CrossRef]
  13. Karnes, R.J.; Bergstralh, E.J.; Davicioni, E.; Ghadessi, M.; Buerki, C.; Mitra, A.P.; Crisan, A.; Erho, N.; Vergara, I.; Lam, L.L.; et al. Validation of a Genomic Classifier that Predicts Metastasis Following Radical Prostatectomy in an At Risk Patient Population. J. Urol. 2013, 190, 2047–2053. [Google Scholar] [CrossRef]
  14. Klein, E.A.; Haddad, Z.; Yousefi, K.; Lam, L.L.; Wang, Q.; Choeurng, V.; Palmer-Aronsten, B.; Buerki, C.; Davicioni, E.; Li, J.; et al. Decipher Genomic Classifier Measured on Prostate Biopsy Predicts Metastasis Risk. Urology 2016, 90, 148–152. [Google Scholar] [CrossRef] [Green Version]
  15. Van den Eeden, S.K.; Lu, R.; Zhang, N.; Quesenberry, C.P., Jr.; Shan, J.; Han, J.S.; Tsiatis, A.C.; Leimpeter, A.D.; Lawrence, H.J.; Febbo, P.G.; et al. A Biopsy-based 17-gene Genomic Prostate Score as a Predictor of Metastases and Prostate Cancer Death in Surgically Treated Men with Clinically Localized Disease. Eur. Urol. 2018, 73, 129–138. [Google Scholar] [CrossRef]
  16. Pudova, E.A.; Krasnov, G.S.; Nyushko, K.M.; Kobelyatskaya, A.A.; Savvateeva, M.V.; Poloznikov, A.A.; Dolotkazin, D.R.; Klimina, K.M.; Guvatova, Z.G.; Simanovsky, S.A.; et al. miRNAs expression signature potentially associated with lymphatic dissemination in locally advanced prostate cancer. BMC Med Genom. 2020, 13 (Suppl. 8), 129. [Google Scholar] [CrossRef]
  17. Hanahan, D.; Weinberg, R.A. Hallmarks of cancer: The next generation. Cell 2011, 144, 646–674. [Google Scholar] [CrossRef] [Green Version]
  18. Duong, M.N.; Geneste, A.; Fallone, F.; Li, X.; Dumontet, C.; Muller, C. The fat and the bad: Mature adipocytes, key actors in tumor progression and resistance. Oncotarget 2017, 8, 57622–57641. [Google Scholar] [CrossRef] [Green Version]
  19. Kapoor, J.; Namdarian, B.; Pedersen, J.; Hovens, C.; Moon, D.; Peters, J.; Costello, A.J.; Ruljancich, P.; Corcoran, N.M. Extraprostatic Extension into Periprostatic Fat is a More Important Determinant of Prostate Cancer Recurrence than an Invasive Phenotype. J. Urol. 2013, 190, 2061–2067. [Google Scholar] [CrossRef]
  20. Parker, A.S.; Thiel, D.D.; Bergstralh, E.; Carlson, R.E.; Rangel, L.J.; Joseph, R.; Diehl, N.; Karnes, R.J. Obese men have more advanced and more aggressive prostate cancer at time of surgery than non-obese men after adjusting for screening PSA level and age: Results from two independent nested case–control studies. Prostate Cancer Prostatic Dis. 2013, 16, 352–356. [Google Scholar] [CrossRef]
  21. Rysman, E.; Brusselmans, K.; Scheys, K.; Timmermans, L.; Derua, R.; Munck, S.; Van Veldhoven, P.P.; Waltregny, D.; Daniëls, V.W.; Machiels, J.; et al. De novo Lipogenesis Protects Cancer Cells from Free Radicals and Chemotherapeutics by Promoting Membrane Lipid Saturation. Cancer Res. 2010, 70, 8117–8126. [Google Scholar] [CrossRef] [Green Version]
  22. Zha, S.; Ferdinandusse, S.; Hicks, J.L.; Denis, S.; Dunn, T.A.; Wanders, R.J.; Luo, J.; De Marzo, A.M.; Isaacs, W.B. Peroxisomal branched chain fatty acid beta-oxidation pathway is upregulated in prostate cancer. Prostate 2005, 63, 316–323. [Google Scholar] [CrossRef]
  23. Ren, J.; Xiao, Y.-J.; Singh, L.S.; Zhao, X.; Zhao, Z.; Feng, L.; Rose, T.M.; Prestwich, G.D.; Xu, Y. Lysophosphatidic Acid Is Constitutively Produced by Human Peritoneal Mesothelial Cells and Enhances Adhesion, Migration, and Invasion of Ovarian Cancer Cells. Cancer Res. 2006, 66, 3006–3014. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Kuhajda, F.P. Fatty-acid synthase and human cancer: New perspectives on its role in tumor biology. Nutrition 2000, 16, 202–208. [Google Scholar] [CrossRef] [PubMed]
  25. Haynes, M.D.; Martin, T.A.; Jenkins, S.A.; Kynaston, H.G.; Matthews, P.N.; Jiang, W.G. Tight junctions and bladder cancer (review). Int. J. Mol. Med. 2005, 16, 3–9. [Google Scholar] [CrossRef] [PubMed]
  26. Tsukita, S.; Furuse, M.; Itoh, M. Multifunctional strands in tight junctions. Nat. Rev. Mol. Cell Biol. 2001, 2, 285–293. [Google Scholar] [CrossRef]
  27. Wang, M.; Liu, Y.; Qian, X.; Wei, N.; Tang, Y.; Yang, J. Downregulation of occludin affects the proliferation, apoptosis and metastatic properties of human lung carcinoma. Oncol. Rep. 2018, 40, 454–462. [Google Scholar] [CrossRef] [Green Version]
  28. Bao, Y.; Wang, L.; Shi, L.; Yun, F.; Liu, X.; Chen, Y.; Chen, C.; Ren, Y.; Jia, Y. Transcriptome profiling revealed multiple genes and ECM-receptor interaction pathways that may be associated with breast cancer. Cell. Mol. Biol. Lett. 2019, 24, 38. [Google Scholar] [CrossRef] [Green Version]
  29. Mukai, R.; Tomimaru, Y.; Nagano, H.; Eguchi, H.; Mimori, K.; Tomokuni, A.; Asaoka, T.; Wada, H.; Kawamoto, K.; Marubashi, S.; et al. miR-615-3p expression level in bone marrow is associated with tumor recurrence in hepatocellular carcinoma. Mol. Clin. Oncol. 2015, 3, 487–494. [Google Scholar] [CrossRef] [Green Version]
  30. Wang, J.; Liu, L.; Sun, Y.; Xue, Y.; Qu, J.; Pan, S.; Li, H.; Qu, H.; Wang, J.; Zhang, J. miR-615-3p promotes proliferation and migration and inhibits apoptosis through its potential target CELF2 in gastric cancer. Biomed. Pharmacother. 2018, 101, 406–413. [Google Scholar] [CrossRef]
  31. Laursen, E.B.; Fredsøe, J.; Schmidt, L.; Strand, S.H.; Kristensen, H.; Rasmussen, A.K.I.; Daugaard, T.F.; Mouritzen, P.; Høyer, S.; Kristensen, G.; et al. Elevated miR-615-3p Expression Predicts Adverse Clinical Outcome and Promotes Proliferation and Migration of Prostate Cancer Cells. Am. J. Pathol. 2019, 189, 2377–2388. [Google Scholar] [CrossRef] [Green Version]
  32. Watahiki, A.; Wang, Y.; Morris, J.; Dennis, K.; O’Dwyer, H.M.; Gleave, M.; Gout, P.W.; Wang, Y. MicroRNAs Associated with Metastatic Prostate Cancer. PLoS ONE 2011, 6, e24950. [Google Scholar] [CrossRef]
  33. Lichner, Z.; Ding, Q.; Samaan, S.; Saleh, C.; Nasser, A.; Al-Haddad, S.; Samuel, J.N.; Fleshner, N.E.; Stephan, C.; Jung, K.; et al. miRNAs dysregulated in association with Gleason grade regulate extracellular matrix, cytoskeleton and androgen receptor pathways. J. Pathol. 2015, 237, 226–237. [Google Scholar] [CrossRef]
  34. Bolger, A.M.; Lohse, M.; Usadel, B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 2014, 30, 2114–2120. [Google Scholar] [CrossRef] [Green Version]
  35. Dobin, A.; Davis, C.A.; Schlesinger, F.; Drenkow, J.; Zaleski, C.; Jha, S.; Batut, P.; Chaisson, M.; Gingeras, T.R. STAR: Ultrafast universal RNA-seq aligner. Bioinformatics 2013, 29, 15–21. [Google Scholar] [CrossRef]
  36. Liao, Y.; Smyth, G.K.; Shi, W. feature Counts: An efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 2014, 30, 923–930. [Google Scholar] [CrossRef] [Green Version]
  37. Robinson, M.D.; McCarthy, D.J.; Smyth, G.K. EdgeR: A Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 2010, 26, 139–140. [Google Scholar] [CrossRef] [Green Version]
  38. Ito, K.; Murphy, D. Application of ggplot2 to Pharmacometric Graphics. CPT Pharmacomet. Syst. Pharmacol. 2013, 2, 1–16. [Google Scholar] [CrossRef]
Figure 1. Heat map of the relative level of expression of 50 DEGs between groups N0 and N1 for Russian patients with LAPC. Cell colors correspond to the binary logarithm of the ratio of the expression level in the current sample to the average level for all samples (for each gene).
Figure 1. Heat map of the relative level of expression of 50 DEGs between groups N0 and N1 for Russian patients with LAPC. Cell colors correspond to the binary logarithm of the ratio of the expression level in the current sample to the average level for all samples (for each gene).
Ijms 24 02418 g001
Figure 2. Correlation matrix of candidate markers and clinicopathological parameters expression levels. Cell colors correspond to the value of the Pearson correlation coefficient.
Figure 2. Correlation matrix of candidate markers and clinicopathological parameters expression levels. Cell colors correspond to the value of the Pearson correlation coefficient.
Ijms 24 02418 g002
Figure 3. Box plot of the relative mRNA expression levels (in log scale) of CST2, F5, OCLN, PCAT4, RAB27A, TBX1, and VGLL3 genes between N0 and N1 groups in (a) FFT samples, (b) FFPE samples. Group N0 is marked in blue, group N1 is marked in red.
Figure 3. Box plot of the relative mRNA expression levels (in log scale) of CST2, F5, OCLN, PCAT4, RAB27A, TBX1, and VGLL3 genes between N0 and N1 groups in (a) FFT samples, (b) FFPE samples. Group N0 is marked in blue, group N1 is marked in red.
Ijms 24 02418 g003
Figure 4. Relative expression of miR-148a-3p and miR-615-3p between groups N0 and N1 (a) on FFT samples (b) on FFPE samples.
Figure 4. Relative expression of miR-148a-3p and miR-615-3p between groups N0 and N1 (a) on FFT samples (b) on FFPE samples.
Ijms 24 02418 g004
Figure 5. Dot plots showing relative expression results for (a) CST2 gene, (b) OCLN gene, (c) miR-615-3p, and (d) miR-148a-3p between groups of primary tumors and lymph node metastases. The y axis represents the 2−ΔCT values. The gray line indicates the direction of relative expression of the transcript between groups for each patient.
Figure 5. Dot plots showing relative expression results for (a) CST2 gene, (b) OCLN gene, (c) miR-615-3p, and (d) miR-148a-3p between groups of primary tumors and lymph node metastases. The y axis represents the 2−ΔCT values. The gray line indicates the direction of relative expression of the transcript between groups for each patient.
Ijms 24 02418 g005
Figure 6. ROC curves of the “CST2 + OCLN + pT” model from Logistic Regression, LGBM, Catboost, Random Forest, and XGBoost algorithms.
Figure 6. ROC curves of the “CST2 + OCLN + pT” model from Logistic Regression, LGBM, Catboost, Random Forest, and XGBoost algorithms.
Ijms 24 02418 g006
Table 1. Enriched biological pathways, according to the Reactome 2022 database, associated with lymphatic dissemination in LAPC in a sample of Russian patients.
Table 1. Enriched biological pathways, according to the Reactome 2022 database, associated with lymphatic dissemination in LAPC in a sample of Russian patients.
Pathway IDPathway NameESNESFDRGene Set SizeMatched Size
R-HSA-156842Eukaryotic Translation Elongation 0.642.511 × 10−39039
R-HSA-6791226Major Pathway of rRNA Processing in Nucleolus and Cytosol 0.582.471 × 10−317958
R-HSA-9633012Response Of EIF2AK4 (GCN2) to Amino Acid Deficiency0.622.411 × 10−39841
R-HSA-72312rRNA Processing 0.562.31 × 10−319964
R-HSA-975956Nonsense-Mediated Decay (NMD) Independent of Exon Junction Complex (EJC) 0.642.291 × 10−39237
R-HSA-156902Peptide Chain Elongation0.642.261 × 10−38637
R-HSA-8868773rRNA Processing in Nucleus and Cytosol0.562.221 × 10−318962
R-HSA-168273Influenza Viral RNA Transcription and Replication 0.572.211 × 10−313747
R-HSA-192823Viral mRNA Translation 0.612.211 × 10−39038
R-HSA-69278Cell Cycle Mitotic 0.472.13 × 10−3523124
R-HSA-72689Formation of a Pool of Free 40S Subunits 0.582.13 × 10−39844
R-HSA-975957Nonsense-Mediated Decay (NMD) Enhanced by Exon Junction Complex (EJC) 0.562.13 × 10−311242
R-HSA-2408557Selenocysteine Synthesis 0.612.123 × 10−39038
R-HSA-72764Eukaryotic Translation Termination0.642.133 × 10−39037
R-HSA-156827L13a-mediated Translational Silencing of Ceruloplasmin Expression 0.582.134 × 10−310845
R-HSA-72737Cap-dependent Translation Initiation 0.562.164 × 10−311646
R-HSA-397014Muscle Contraction −0.61−2.214 × 10−319630
R-HSA-2408522Selenoamino Acid Metabolism 0.552.031 × 10−211441
R-HSA-72706GTP Hydrolysis and Joining of 60S Ribosomal Subunit 0.562.011 × 10−210946
R-HSA-1640170Cell Cycle 0.451.991 × 10−2654142
R-HSA-168255Influenza Infection0.531.962 × 10−215751
R-HSA-9010553Regulation of Expression of SLITs and ROBOs 0.521.982 × 10−216755
R-HSA-68877Mitotic Prometaphase0.521.913 × 10−218641
R-HSA-1799339SRP-dependent Cotranslational Protein Targeting to Membrane 0.491.873 × 10−210843
R-HSA-68886M Phase0.441.893 × 10−238086
R-HSA-2500257Resolution of Sister Chromatid Cohesion0.541.883 × 10−210630
R-HSA-500792GPCR Ligand Binding0.531.93 × 10−245835
R-HSA-9711097Cellular Response to Starvation 0.481.863 × 10−215350
R-HSA-8953854Metabolism of RNA 0.411.93 × 10−2666156
R-HSA-446652Interleukin-1 Family Signaling 0.511.844 × 10−215230
R-HSA-68882Mitotic Anaphase 0.471.834 × 10−223265
R-HSA-72766Translation0.441.844 × 10−228176
R-HSA-69620Cell Cycle Checkpoints 0.451.844 × 10−227169
R-HSA-9754678SARS-CoV-2 Modulates Host Translation Machinery0.611.824 × 10−24717
R-HSA-141444Unattached Kinetochores Signal Amplification Via A MAD2 Inhibitory Signal 0.51.814 × 10−29328
R-HSA-69618Mitotic Spindle Checkpoint 0.481.814 × 10−211036
R-HSA-8978868Fatty Acid Metabolism −0.49−1.954 × 10−217332
R-HSA-72662mRNA Activation Upon Binding of Cap-Binding Complex and eIFs Subsequent Binding to 43S 0.531.84 × 10−25820
Table 2. Differential expression of promising genes as markers of lymphatic dissemination based on a sample of Russian patients with LAPC.
Table 2. Differential expression of promising genes as markers of lymphatic dissemination based on a sample of Russian patients with LAPC.
Gene IDSymbolNameLog2FCLog2CPMFDR (U Test)rsp (rs)
ENSG00000197822OCLNOccludin1.093.59 × 10−30.479 × 10−4
ENSG00000184058TBX1T-Box Transcription Factor 11.154.93 × 10−30.561 × 10−5
ENSG00000198734F5Coagulation Factor V1.8961 × 10−20.442 × 10−3
ENSG00000170369CST2Cystatin SA1.712.71 × 10−20.479 × 10−4
ENSG00000069974RAB27ARas-Related Protein Rab-27A−1.176.55 × 10−3−0.529 × 10−5
ENSG00000251321PCAT4Prostate Cancer-associated Transcript 4−2.515.22 × 10−2−0.49 × 10−3
ENSG00000206538VGLL3Vestigial-like Family Member 3−2.165.13 × 10−2−0.372 × 10−3
Table 3. The results of the relative expression of CST2, F5, OCLN, PCAT4, RAB27A, TBX1, and VGLL3 genes between groups N0 and N1 based on the U test.
Table 3. The results of the relative expression of CST2, F5, OCLN, PCAT4, RAB27A, TBX1, and VGLL3 genes between groups N0 and N1 based on the U test.
GeneFC, FFTp Value (U Test), FFTFC, FFPEp Value (U Test), FFPE
PCAT4↓ 3.731 × 10−3↓ 1.644 × 10−2
OCLN↑ 1.631 × 10−2↑ 1.774 × 10−2
F5↑ 2.466 × 10−3↓ 1.010.8
CST2↑ 3.894 × 10−3↑ 3.242 × 10−3
RAB27A↓ 1.721 × 10−3↑ 1.050.6
TBX1↑ 21 × 10−2↑ 1.230.3
VGLL3↓ 3.56 × 10−3↑ 1.350.2
↑, increase in relative gene expression; ↓, decrease in relative gene expression.
Table 4. The best models by AUC in each category of predictor combinations based on Logistic Regression, LGBM, Catboost, Random Forest, and XGBoost algorithms.
Table 4. The best models by AUC in each category of predictor combinations based on Logistic Regression, LGBM, Catboost, Random Forest, and XGBoost algorithms.
ModelAccuracyAUCF1SensitivitySpecificity
Logistic Regression
CST2 + pT0.650.790.720.390.89
ISUP + OCLN + pT0.650.820.670.610.68
ISUP + PCAT4 + OCLN + pT0.650.830.670.610.68
ISUP + PCAT4 + OCLN + miR148a + pT0.650.830.670.610.68
ISUP + PCAT4 + CST2 + OCLN + miR148a + pT0.650.830.670.610.68
LGBM
OCLN + pT0.620.700.630.610.63
ISUP + OCLN + pT0.730.760.740.720.74
ISUP + OCLN + miR148a + pT0.730.810.740.720.74
ISUP + PCAT4 + CST2 + OCLN + pT0.730.800.740.720.74
PCAT4 + CST2 + OCLN + miR148a + miR615 + pT0.590.750.690.280.89
Catboost
miR615 + pT0.680.710.740.440.89
CST2 + OCLN + pT0.810.860.810.830.79
ISUP + OCLN + miR148a + pT0.780.860.790.780.79
ISUP + PCAT4 + OCLN + miR148a + pT0.700.820.700.720.68
ISUP + PCAT4 + CST2 + OCLN + miR148a + pT0.700.800.720.670.68
Random Forest
ISUP + miR6150.590.650.630.500.68
OCLN + miR615 + pT0.700.760.730.610.79
ISUP + OCLN + miR615 + pT0.680.770.730.500.84
PCAT4 + OCLN + miR148a + miR615 + pT0.650.770.720.390.89
ISUP + PCAT4 + OCLN + miR148a + miR615 + pT0.700.760.740.560.84
XGBoost
ISUP + miR6150.570.630.600.500.63
OCLN + miR615 + pT0.760.760.780.670.84
ISUP + OCLN + miR615 + pT0.700.780.720.670.74
ISUP + OCLN + miR148a + miR615 + pT0.680.760.710.560.79
ISUP + PCAT4 + OCLN + miR148a + miR615 + pT0.680.760.710.560.79
Table 5. ROC analysis results for the «CST2+ OCLN+ pT» model based on the Logistic Regression, LGBM, Catboost, Random Forest, and XGBoost algorithms.
Table 5. ROC analysis results for the «CST2+ OCLN+ pT» model based on the Logistic Regression, LGBM, Catboost, Random Forest, and XGBoost algorithms.
MetricsLogistic RegressionLGBMCatboostRandom ForestXGBoost
Accuracy, Train0.760.720.700.690.67
AUC, Train0.840.880.981.001.00
F1, Train0.700.630.630.620.61
Accuracy, Test0.700.620.810.730.73
AUC, Test0.730.740.860.800.81
F1, Test0.740.650.810.720.74
Sensitivity, Test0.560.560.830.780.72
Specificity, Test0.840.680.790.680.74
Table 6. Clinicopathological characteristics of LAPC specimens from two cohorts of Russian patients.
Table 6. Clinicopathological characteristics of LAPC specimens from two cohorts of Russian patients.
FFT SamplesFFPE Samples
GroupN0N1N0N1
Age46–77 (63)41–73 (63)56–75 (65)51–72 (65)
pT3a stage29782
pT3b stage13241017
ISUP group 15251
ISUP group 217245
ISUP group 3121142
ISUP group 45542
ISUP group 531119
PSA, ng/ml3.4–27.6 (14.81)5.49–44 (14.28)7.7–46 (14.6)4–20 (12.9)
Table 7. Primer sequences for detection of marker gene expression by qPCR.
Table 7. Primer sequences for detection of marker gene expression by qPCR.
GenePrimer Sequence (5′→3′)Product Size, bp
HPRT1F: CGAGATGTGATGAAGGAGATGG98
R: TTGATGTAATCCAGCAGGTCAG
F5F: TCTTACCTTGACCACACATTCC99
R: TCCACTGTCCTCACTGATACT
PCAT4F: GGATTCGCAAGAGAACACAATC111
R: CATCACAAACCGGCCAATATC
OCLNF: GGTTCACTTCTCCCAGTCTTTC96
R:AGACACAATCAACAGGGTTAGG
RAB27AF: CATGCCTGGGATCTTCTCTATG111
R: CCGGATGCTTTATTCGTAGGT
TBX1F: CGACAACGGCCACATTATTC100
R: CTCGGCATATTTCTCGCTATCT
CST2F: AGGAGGACAGGATAATCGAGG84
R: TGATGACAAAGTGAAGGGCAC
VGLL3F: CTCTCAAGCCAGCGGAATAG102
R: GACCTGGAAGTCAGGATGAAC
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Pudova, E.A.; Kobelyatskaya, A.A.; Katunina, I.V.; Snezhkina, A.V.; Fedorova, M.S.; Pavlov, V.S.; Bakhtogarimov, I.R.; Lantsova, M.S.; Kokin, S.P.; Nyushko, K.M.; et al. Lymphatic Dissemination in Prostate Cancer: Features of the Transcriptomic Profile and Prognostic Models. Int. J. Mol. Sci. 2023, 24, 2418. https://doi.org/10.3390/ijms24032418

AMA Style

Pudova EA, Kobelyatskaya AA, Katunina IV, Snezhkina AV, Fedorova MS, Pavlov VS, Bakhtogarimov IR, Lantsova MS, Kokin SP, Nyushko KM, et al. Lymphatic Dissemination in Prostate Cancer: Features of the Transcriptomic Profile and Prognostic Models. International Journal of Molecular Sciences. 2023; 24(3):2418. https://doi.org/10.3390/ijms24032418

Chicago/Turabian Style

Pudova, Elena A., Anastasiya A. Kobelyatskaya, Irina V. Katunina, Anastasiya V. Snezhkina, Maria S. Fedorova, Vladislav S. Pavlov, Ildar R. Bakhtogarimov, Margarita S. Lantsova, Sergey P. Kokin, Kirill M. Nyushko, and et al. 2023. "Lymphatic Dissemination in Prostate Cancer: Features of the Transcriptomic Profile and Prognostic Models" International Journal of Molecular Sciences 24, no. 3: 2418. https://doi.org/10.3390/ijms24032418

APA Style

Pudova, E. A., Kobelyatskaya, A. A., Katunina, I. V., Snezhkina, A. V., Fedorova, M. S., Pavlov, V. S., Bakhtogarimov, I. R., Lantsova, M. S., Kokin, S. P., Nyushko, K. M., Alekseev, B. Y., Kalinin, D. V., Melnikova, N. V., Dmitriev, A. A., Krasnov, G. S., & Kudryavtseva, A. V. (2023). Lymphatic Dissemination in Prostate Cancer: Features of the Transcriptomic Profile and Prognostic Models. International Journal of Molecular Sciences, 24(3), 2418. https://doi.org/10.3390/ijms24032418

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop