Next Article in Journal
Mutational Activation of the NRF2 Pathway Upregulates Kynureninase Resulting in Tumor Immunosuppression and Poor Outcome in Lung Adenocarcinoma
Previous Article in Journal
Angiopoietin-1 Upregulates Cancer Cell Motility in Colorectal Cancer Liver Metastases through Actin-Related Protein 2/3
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Gene Expression Monotonicity across Bladder Cancer Stages Informs on the Molecular Pathogenesis and Identifies a Prognostic Eight-Gene Signature

by
Rafael Stroggilos
1,
Maria Frantzi
2,
Jerome Zoidakis
1,
Marika Mokou
2,
Napoleon Moulavasilis
3,
Emmanouil Mavrogeorgis
1,†,
Anna Melidi
1,
Manousos Makridakis
1,
Konstantinos Stravodimos
3,
Maria G. Roubelakis
4,5,
Harald Mischak
2 and
Antonia Vlahou
1,*
1
Systems Biology Center, Biomedical Research Foundation of the Academy of Athens, Soranou Efessiou 4, 11527 Athens, Greece
2
Mosaiques Diagnostics GmbH, 30659 Hannover, Germany
3
1st Department of Urology, Laiko Hospital, National and Kapodistrian University of Athens, 11527 Athens, Greece
4
Laboratory of Biology, School of Medicine, National and Kapodistrian University of Athens, 11527 Athens, Greece
5
Cell and Gene Therapy Laboratory, Biomedical Research Foundation of the Academy of Athens, Soranou Efessiou 4, 11527 Athens, Greece
*
Author to whom correspondence should be addressed.
Current address: Mosaiques Diagnostics GmbH, 30659 Hannover, Germany.
Cancers 2022, 14(10), 2542; https://doi.org/10.3390/cancers14102542
Submission received: 29 March 2022 / Revised: 14 May 2022 / Accepted: 17 May 2022 / Published: 21 May 2022

Abstract

:

Simple Summary

Gene expression monotonicity is an important feature in the evolution and progression of cancer, yet has not been investigated in bladder cancer (BLCA). Most of the transcriptomic stage investigations of BLCA are limited either to a subset of stages, or to a small number of samples. Here, we leverage publicly available data to create a meta-dataset of 1135 primary BLCA transcriptomes, to identify genes and processes with a monotonal change related to higher clinical or pathologic stages. Our analysis aims to deepen the current understanding of the disease’s molecular pathogenesis, as well as to propose a prognostic gene signature based on the trait of monotonicity. Results demonstrate tumor dependencies on specific cell-cycle and metabolic microprocesses, and highlight an eight-gene signature capable of prognosing 5-year outcomes in both the discovery and validation sets.

Abstract

Despite advancements in molecular classification, tumor stage and grade still remain the most relevant prognosticators used by clinicians to decide on patient management. Here, we leverage publicly available data to characterize bladder cancer (BLCA)’s stage biology based on increased sample sizes, identify potential therapeutic targets, and extract putative biomarkers. A total of 1135 primary BLCA transcriptomes from 12 microarray studies were compiled in a meta-cohort and analyzed for monotonal alterations in pathway activities, gene expression, and co-expression patterns with increasing stage (Ta–T1–T2–T3–T4), starting from the non-malignant tumor-adjacent urothelium. The TCGA-2017 and IMvigor-210 RNA-Seq data were used to validate our findings. Wnt, MTORC1 signaling, and MYC activity were monotonically increased with increasing stage, while an opposite trend was detected for the catabolism of fatty acids, circadian clock genes, and the metabolism of heme. Co-expression network analysis highlighted stage- and cell-type-specific genes of potentially synergistic therapeutic value. An eight-gene signature, consisting of the genes AKAP7, ANLN, CBX7, CDC14B, ENO1, GTPBP4, MED19, and ZFP2, had independent prognostic value in both the discovery and validation sets. This novel eight-gene signature may increase the granularity of current risk-to-progression estimators.

1. Introduction

Bladder cancer (BLCA) accounts for approximately 200,000 annual deaths worldwide, and had an estimated number of 570,000 new cases in 2020 [1]. It comprises a spectrum of diseases including variably recurrent low- and high-risk non-muscle-invasive (NMI) tumors, along with muscle-invasive (MI) cases characterized by poor prognosis. The disease is diagnosed pathologically according to a well-established protocol—including cystoscopy, microscopic examination of the transurethral resection of the bladder tumor material (TURBT), and urine cytology—to determine the grade of malignancy [2]. Based on the spread of the tumor within the organ, BLCA is staged as Ta when the tumor is confined within the urothelium, T1 when it spreads to the lamina propria, T2 when it invades the detrusor muscle, T3 when it grows to the peri-vesical tissue, and T4 when it spreads to the surrounding or distant tissues and organs [3]. Advances in diagnostic tools (mostly imaging technologies) and drug discovery over the years have improved patient survival and quality of life [1]. However, disease monitoring and treatment selection still rely on clinical and histological features, which is not optimal. This is well reflected in the highly heterogeneous 5-year recurrence rates (35–76%) in NMI tumors, and in the poor 5-year survival of MI and metastatic cases [4]. The suboptimal predictability of tumor recurrence leads to frequent surveillance revisits, making BLCA the most expensive malignancy to treat over the lifetime of patients [5]. An estimated 1/4 patients diagnosed with BLCA experiences financial toxicity—a term describing the mental and emotional stress related to the burden of unaffordable cancer care, which negatively affects quality of life [5,6].
In an effort to replace or enhance cytological findings, several studies have reported molecular biomarkers capable of diagnosing primary or recurrent disease in the urine samples of patients [7]. However, reliably detecting those BLCA patients who are at true risk of recurrence or progression remains an unmet challenge. Additional studies have investigated tumor tissue, which is thought to be better suited for prognosis. In a large multicenter cohort of NMI patients—particularly among those patients who had been classified as high-risk based on the traditional EORTC score table—patients with a high 12-gene expression score experienced significantly shorter progression-free survival time than those with low scores [8]. A number of studies investigating MIBC prognosis in the TCGA data on BLCA tissue have recently been published, all demonstrating the significant predictive value of molecular (omics) information from tissue [9,10,11,12,13,14].
Bladder cancer is strongly linked to environmental factors such as smoking or occupational exposure to DNA-modifying compounds [15]. At the genomic level, normal cells adjacent to dysplastic regions manifest a number of alterations on genes implicated in carcinogenesis. These are the forerunner genes (FR genes), and are often mutated or epigenetically silenced in wide areas beyond the tumor initiation site [16]. According to the combined action of field effects and clonal expansion, due to the variability in the affected FR genes among neighboring cells, some cells can acquire small advantages in growth and survival. If obtained by the progenitor cells, these properties are propagated to descendant cells, and any additional cumulative genomic abnormalities in the descendants can potentially lead to tumor initiation [17].
In BLCA, tumor initiation is believed to take place in the basal layer of the urothelium, where stem cells reside [18]. Increasing evidence suggests that there are at least two sets of altered FR genes, giving rise to dominant clones with distinct biology: the luminal and the basal types. Luminal tumors correlate with hyperactivation of the FGFR3 and AKT/PI3K/mTOR pathways, while basal tumors present with losses or inactivation of tumor-suppressor genes, such as RB1, TP53, and PTEN [4]. However, common genomic abnormalities also exist between these two entities, including mutations on chromatin-modifying genes as well as the loss of the 9q locus—events linked to the pro-tumorigenic effect of the environmental chemicals ending up in the bladder [18]. A growing body of transcriptomic investigations of the biology of luminal and basal tumors has uncovered further molecular phenotypes [19], adding to our understanding of the disease. However, due to intratumor heterogeneity, different areas within the same tumor may present with different molecular phenotypes [20], which may complicate their clinical utility.
For the study reported here, we compiled a discovery meta-cohort of 1135 BLCA microarray transcriptomes along with two RNA-Seq validation sets, and addressed the disease as a molecular continuum of alterations. Using the stage as a checkpoint variable that reflects the cumulative processes of tumor progression, we investigated how molecular processes and gene expression levels change, starting from the non-malignant adjacent urothelium (NAU), and continuing through the disease stages Ta, T1, T2, T3, and T4. Our analysis aims to enrich the current understanding of the molecular pathogenesis of BLCA, uncovering pathways whose activation progressively increases or diminishes with cancer growth, while also reporting for the first time on the gene co-expression profiles of the disease stages. Based on the hypothesis that the expression levels of the most optimal prognostic biomarkers should follow a monotonal trend with higher disease stage, we propose an eight-gene signature with the potential of prognosing 5-year survival outcomes.

2. Materials and Methods

A comprehensive data mining strategy was employed to retrieve studies applying -omics technologies in BLCA. The overall workflow is summarized in Figure 1, and described in detail in Appendix A. This analysis focused on primary, treatment-naïve tumor transcriptomes. The selection of datasets and samples eligible for the analysis, as well as the processing and integration of raw microarray data, is detailed in Appendix A. Briefly, we compiled a microarray discovery cohort from 12 well-characterized GEO datasets (GSE121711, GSE93527, E-MTAB-1940, GSE31684, GSE104922, GSE128959, GSE83586, GSE48276, GSE52219, GSE69795, GSE13507, and GSE48075). These data (summarized in Figure 1), comprised 1054 primary bladder cancer tumor transcriptomes of treatment-naïve patients without any prior cancer history, along with profiles from 81 non-malignant urothelium tissues adjacent to the tumor site (NAU), corresponding to a total of 1135 gene expression profiles. Stage distribution among the utilized datasets is shown in Table A1. Two RNA-Seq datasets were used for validation (TCGA2017 and IMvigor210), as per availability. Table 1 shows sample allocation to the clinical variables for both the discovery and validation sets. Figure 1 illustrates the overall study design and workflow. In the discovery set, the ratio of men:women was 3.5:1, with equal distribution between NMI and MI disease (p = 0.99), and similar mean age at baseline diagnosis (68 years, p = 0.81). Percentages of NAU, NMI, and MI in the dataset were 7.1%, 43.5%, and 49.4%, respectively, with the grade distribution being as follows: 16.5% low-grade and 48.8% high-grade disease, with the remaining samples lacking available grade information. Detailed histological records were missing for 71.5% of the cohort, with the most frequently reported histology among the available records being urothelial/papillary (23.3%), and squamous differentiation being the most frequent variant (1.3%).
Gene expression across the 12 GEO datasets was harmonized using the empirical Bayes approach of the ComBat algorithm. Batch effect removal was assessed with the algorithm BatchQC (Figures S1 and S2) [21], with relative log expression and principal component analysis plots (Figure S3), with comparative analysis of expression levels between housekeeping and non-housekeeping genes (Figure S4), or using a set of 11 positive BLCA markers with known regulation across non-malignant–NMI–MI or non-malignant–low-grade–high-grade disease (Figure S5). ComBat successfully eliminated batch effects (Figures S1–S3), while maintaining the biological variability (Figures S3 and S4), BatchQC reports indicated that the variability in the corrected data was explained by stage rather than by the batch variable (Figure S2), allowing for an in-depth downstream analysis. The ComBat corrected expression matrix along with the sample clinical data are available as File S1 and File S2, respectively.
To identify genes that form a continuum of changes across BLCA stages, each of the 5 disease stages was initially compared against non-malignant adjacent urothelium (NAU). A total of 3018 genes differed significantly (Mann–Whitney p < 0.05), while having the same orientation of change in all comparisons. We refer to this set of genes (n = 3108) as concordantly differentially expressed genes (CDEGs). CDEGs were utilized to infer pathway activation scores, and to create stage co-expression networks. Pathway activation scores per sample were calculated with the ssGSEA–GSVA method [22], using the molecular signature database libraries of Hallmark, Canonical Pathways (Reactome subset), C3 (GO biological processes subset), and C5 (GTRD subset of transcription factor targets). Dorothea (https://github.com/saezlab/dorothea, accessed on 16 March 2021) [23] was utilized to assess regulon activity. To further identify the subset of pathways whose activation had monotonal traits across non-malignant adjacent urothelium (NAU) and disease stages, each pathway’s activation scores across stages were compared to NAU with Mann–Whitney tests, and the direction of change was defined based on fold change (= mean of stage − mean of NAU). Monotonicity for a pathway was defined as being significantly different in all stage comparisons to NAU, and also having a continuously larger/smaller fold change with increasing stage. Stromal infiltration scores were imputed using the ESTIMATE algorithm [24].
Gene-pair co-expression weights among the 3108 CEDEGs were approximated with ensemble learning, using GENIE3 [25], while the direction of co-expression (positive/negative) was determined using Spearman’s coefficient. Out of the 3108 × 3108 = 9,659,664 gene-pair weights calculated individually per condition (i.e., NAU and 5 BLCA stages), gene-pairs with the highest GENIE3 weights, which were also positively correlated based on the Spearman’s coefficient, were used to construct networks. The cutoff for this selection was determined based on the gene size of the resulting networks; to avoid network saturation, we opted to keep their gene sizes close to half the number of CDEGS (= 1554 genes), which resulted in setting the cutoff to the top 5600 gene-pairs. Networks were constructed with igraph and were analyzed with Louvain clustering [26] to identify local modules of co-expression relationships (communities) per condition. The top five in size (= number of genes) of the communities of co-expressed genes per condition were analyzed for Gene Ontology Biological Processes with clusterProfiler [27]. Potential drug targets in the co-expression networks were defined based on the betweenness centrality metric [28], using default cutoffs (computed with igraph).
We utilized CDEGs to extract genes whose expression levels was monotonically increasing or decreasing with higher stage. Monotonicity for a gene was defined as being a CDEG and additionally having a continuously larger/smaller fold change with increasing stage (fold change, as defined in each disease stage versus NAU). Functional annotation and enrichment were performed using PubMed and the online tool GeneCards (https://ga.genecards.org/, accessed on 22 March 2021), respectively. Out of the monotonal subset, 43 genes were found to be of prognostic value (Cox univariate association with 5-year outcome). Eight of these genes, validated in the TCGA-BLCA dataset, were further utilized to construct a sample-wise scoring system—the 8-gene prognostic signature—by summing the expression values of the upregulated genes (n = 4) while subtracting the downregulated genes (n = 4). Before calculations, each of the gene expression values per sample was divided by the gene’s variation across the dataset, in order to minimize the effect of individual gene variability on the final signature score:
Si = Ai/VarA + Bi/VarB + Ci/VarC + Di/VarD − Ei/VarE − Fi/VarF − Gi/VarG − Hi/VarH
where Si denotes the sample-wise derived signature score, Ai Bi, Ci, Di, and Ei, Fi, Gi, Hi, denote the sample-wise gene expression of the 4 upregulated and 4 downregulated genes, respectively, and Var denotes the gene expression variance across the entire set of samples.
The 8-gene signature and the disease stage were further used as inputs in a multivariate Cox regression model, to identify whether they had independent prognostic value. This procedure was applied on both the discovery meta-cohort and the TCGA validation data.
Since our procedure for the detection of monotonal traits in genes and pathway activities involved multiple filtering criteria, to avoid over-elimination due to Type-II error, significance was defined at an unadjusted Mann–Whitney p < 0.05. In contrast, significance for pathway over-representation (clusterProfiler output) was determined by FDR correction at p < 0.05. Categorical variables were investigated for significance with Pearson’s chi-squared test, and were adjusted for multiple hypotheses (with the package RVAideMemoire). All reported correlation scores corresponded to Spearman’s rank coefficient. Cox proportional hazards regression was performed with the packages survminer and survival, and statistical significance was determined via the log-rank method. CIBERSORT analysis was conducted on the web platform https://cibersort.stanford.edu/, (accessed on 9 April 2021) and only samples with successful deconvolution (p < 0.05, n = 350 samples) were further used for the statistical comparisons of relative immune populations between stages. Read counts from the IMvigor data were acquired from the IMvigor210CoreBiologies, and were normalized with the variance stabilization transformation [29]. Unless stated otherwise, all processing, analyses, and visualizations were conducted in the R statistical environment (version 4.0.2).

3. Results

3.1. Pathway Activation and Gene Co-Expression Link the Cell Cycle to ECM and Metabolic Alterations

For initial assessment of the gene expression relationships between the non-malignant adjacent urothelium (referred to as NAU) and cancerous samples, we performed differential expression analysis of NAU versus NMI and NAU versus MI samples. When compared to NAU, a strong positive correlation (r = 0.83, p < 2.1 × 10−16) was observed between the two sets of log fold change values, suggesting concordance in abundance changes and directionality between NMI and MI tumors. To investigate transcriptional changes associated with increasing malignancy, we compiled genes being differentially expressed in all stages compared to NAU, also showing a persistent change—either up or down—and we further denoted them as concordantly differentially expressed genes (defined in the Materials and Methods Section; CDEGs, n = 3108). Due to their consistent regulation compared to NAU, CDEGs likely reflect fundamental alterations occurring during carcinogenesis; thus, we focused our analysis on this particular set of genes. We initially performed a ssGSEA–GSVA analysis using CDEGs, aiming to identify pathways and biological processes whose activation is continually enhanced or diminished throughout the disease stages. With increasing disease stage, our results indicated gradually stronger activation of several mitotic processes, positive regulation of the canonical Wnt pathway, mTORC1 signaling, expression of MYC targets, degradation of anaphase inhibitors, metabolism of nucleotides, mobility of formins, and the TNFR2/non-canonical NF-κB pathway (Figure 2, Table S1). Conversely, diminished activity was recorded for the lipid and fatty acid catabolic processes, for the metabolism of heme and, interestingly, for the circadian clock process (Figure 2). Regulon activity per sample was additionally estimated, and respective scores between disease stages and normal tissue were compared. This analysis highlighted the GATA3 and GLI2 regulons, whose activity was significantly diminished with increasing malignancy (Figure 2).
To further investigate co-expression alterations occurring in the disease stages, an integrated network–pathway analysis was performed. Stage-specific networks were constructed and clustered to identify communities (subnetworks) of co-expressed genes (described in methods). We analyzed the five largest communities (based on the number of genes) per disease stage and NAU, and used the Gene Ontology—Biological Process (GO–BP) library to identify affected molecular processes (Figure 3, Tables S2–S7). The analysis revealed large differences in gene co-expression between NAU and disease stages, with four out of the five largest communities clearly associated with specific biological processes.
Three out of the top five communities were consistently detected in all BLCA stage networks. Based on examination of their enriched processes, these were labeled as (1) the cell-cycle community, (2) the ECM and developmental community, and (3) the metabolic and translational community (Figure 3A,B).
The cell-cycle communities of the different tumor stages involved a total of 288 genes, 178 of which had a proliferation related GO–BP annotation. Hypergeometric tests for each of the stage networks indicated highly similar cell-cycle BPs being over-represented across stages. An excerpt of the statistically most significant BPs, along with the number of implicated genes, is presented in Figure 4B. Out of the 178 cell-cycle genes, 80 were co-expressed consistently in all stage networks. These were also upregulated in tumors compared to NAU, possibly forming the backbone of cell proliferation in BLCA (Table S8). The gene size of this community increased with increasing disease stage (Ta n = 118, T1 n = 148, and MIBC n = 168–170 genes). The communities also included genes lacking cell-cycle GO annotation (11.9% for Ta, 17.6% for T1, 21.9% for T2, 24.9% for T3, and 29.8% for T4). An over-representation test of the 110 genes lacking GO–BP annotation across the stages revealed that 20 of them participate in the metabolism of nucleotides (unadjusted p = 0.03), likely suggesting a rewiring component controlling both the regulation of proliferation and the processing of nucleotides. To detect the most relevant potential drug targets within the cell-cycle communities, we determined their betweenness centrality scores. The most prominent genes included CDC5, KIF2C, FOXM1, AURKB, CDT1, SMC4, CCNB1, RRM2, and KIF14 (Figure 4C).
The community of ECM and developmental processes encompassed a total of 291 genes, and was enriched in cell–cell communication and cell–matrix interaction processes, in responses to microenvironmental stress, and in differentiation programs of epithelial, mesenchymal, and stem cells (Figure 5B). This GO–BP composition suggests that these co-expression signals originate either from tumoral or non-tumoral cells, or might be the product of their interaction. For example, the process of extracellular matrix organization included co-expression of 15–36 genes (depending on the stage network), of which COL13A1, FGFR4, FOXF2, and SCUBE1 were co-expressed only in the NAU samples compared to the disease stages. Conversely, 26 genes, including mediators of epithelial–mesenchymal transition (i.e., COL6A1/A2, COL16A1, MFAP5, MMP11), were co-expressed in tumor tissue but not in NAU. In line with recent observations [30], we noticed that NAU presented with an active ECM remodeling profile. Sixteen of the ECM-associated genes were co-expressed both in NAU and in the NMIBC stages, including genes promoting basolateral tumor cell migration (i.e., MMP2, CTSK, PDPN), fibrotic collagens (i.e., COL1A2, COL6A3, COL14A1, COL15A1), and pro-angiogenic factors (i.e., PDGFRA, RECK), suggesting a pro-tumorigenic potential in the NAU. However, expression in the NAU was predicted to be driven by ALDH1A2 and MFAP4 (Figure 4C)—genes that are both notoriously downregulated in other genitourinary malignancies compared to normal samples [31,32], likely suggesting tumor-suppressive roles. In contrast, co-expression in the Ta stage (confined to the internal lining of the bladder) was predicted to be regulated by the hub genes COL16A1 and CLIP3. CLIP3 interacts with both AKT1 and AKT2 [33], and may therefore play an important role in the early AKT/PI3K/mTOR axis of hyperplastic carcinogenesis. A list containing the community’s betweenness centrality scores for each stage can be found in Table S9.
The community of metabolism and translation encompassed mitochondrial, translational, and multiple metabolic processes being activated during carcinogenesis, and was more profound in the T1 and more advanced tumors. Cellular respiration, translational initiation, mRNA catabolic processes, nonsense-mediated decay, and protein targeting to the ER were consistently enriched in most BLCA stages. The results highlighted a set of 12 genes commonly co-expressed across stages for these processes, including COX7B, DLD, NDUFS4, UQCRFS1, PAIP2, RPL15, RPL30, RPL7, RPS23, RPS27, RPS27A, and RPS4X (Table S10).
In addition to the abovementioned consistently detected communities in BLCA, a community enriched in processes of immune cell differentiation, cytokine secretion, and GPCR activity was identified in the NAU and the MIBC stages, and involved both innate and adaptive responses, as well as processes of immune cell adhesion and migration. This immune-associated community presented low variation in the composition of genes participating in the co-expression networks between NAU and the MIBC stages. Out of the 17 genes involved in the process of T-cell activation that were commonly co-expressed at the MIBC stages, 15 were also co-expressed in the NAU samples. To further investigate these observations, the transcriptome data per sample were deconvoluted into relative abundances of immune cell populations using CIBERSORT, and cell fractions between disease stages were compared. Significant results were obtained for the following populations: CD8+, activated CD4+, activated NK, monocytes, M2 macrophages, and activated dendritic cells (Figure S6). Our results indicated differential commitment of immune cells to NAU and the BLCA stages. NAU samples (n = 37) were significantly more infiltrated with CD8+ (p = 0.046) and monocytes (p = 4.6 × 10−4) than tumor samples (n = 313), consistent with an overrepresentation of the excluded over the inflamed phenotype, as previously seen in the IMvigor210 trial data [34]. However, compared to tumor samples, NAU samples had significantly less abundance of activated CD4+ cells (p = 5.28 × 10−3), macrophages (p = 16.8 × 10−5), activated dendritic cells (p = 0.0002), and activated NK cells (p = 0.015). Generally, NMIBC had lower infiltration burden than MIBC. Activated dendritic cells were significantly increased in Ta tumors (n = 34) compared to other BLCA stages (p = 0.024). The abundance of CD8+, activated NK cells, and M2 macrophages increased linearly with higher malignancy. Interestingly, AIF1—a gene that promotes macrophage survival and M2 polarization [35]—was predicted to be a driver of immune co-expression in the T4 tumors. Along with the CIBERSORT results, which indicate a greater abundance of M2 macrophages in the T4 samples, we hypothesize that AIF1 is actively involved in the immune suppression and, thus, its expression levels might indicate putative candidates for immune checkpoint inhibition. A list containing the community’s betweenness centrality scores for each stage can be found in Table S11.

3.2. Monotonicity in Individual Genes, Prognostic Signatures, and Validation

Using the 3108 CDEGs, we extracted genes with a monotonal (i.e., continuously increasing or decreasing) change in expression in the spectrum NAU–Ta–T1–T2–T3–T4. A total of 157 genes was identified with the trait of monotonicity, of which 118 were up- and 39 were downregulated with increasing stage (Figure 4, Table S12). Functional analysis revealed that for 46 of these genes, experimental evidence on mediating cell-cycle progression exists (Table S12). Upregulated cell-cycle-associated genes (n = 44) were not phase-specific, and included cyclins, DNA polymerases, regulators of the cohesin complex, and kinetochore components. The list also included 23 genes involved in signal transduction (Table S12), 6 of which (ARHGAP11A, AURKA, CDKN3, PBK, PLK1, and RRM2) promote cell-cycle progression, and were all upregulated with increased stage. The data also indicated an overactivation of the Wnt pathway with increasing disease stage, with its upstream inhibitor APCDD1 being downregulated and its activating ligand WNT2 upregulated (Table S12). In total, 14 of the 157 genes were transcriptional or translational regulators (Table S12), including genes with known upregulation in bladder cancer (e.g., the transcription factors E2F1 and DEPDC1 [36,37]). Based on the monotonal changes with higher stage, increased androgen receptor activity may be predicted, as both its translational enhancer BUD31 [38] and its downstream transcription factor ELK1 [39] were upregulated. In total, 4 of the 157 genes (HTR2C, LRP8, NENF, and NMU) are involved in neurotransmission or neuronal development, all of which were upregulated (Table S12). Among the 157 genes, 21 were of poorly described or unknown function (Table S12), including the oncogenic factor TRIM65 [40], which was found to be upregulated with increasing bladder cancer stage. Further functional enrichment using GeneCards for the 157 genes verified their involvement in cell-cycle pathways, with the top hits being related to the regulation of the anaphase-promoting (APC) complex (score = 31.53), to PLK1 (score = 24.47) and Aurora B (score = 20.95) signaling, and to TP53 (score = 19.06) and RB1 (score = 17.85) cell-cycle checkpoint control (Figure 4C, Table S13). Univariate Cox regression analysis indicated 48 genes with potentially prognostic impact at p < 0.01 (Table S14).
Due to the lack of an RNA-Seq dataset comprising the entire disease stage spectrum of BLCA incidents, the observed stage alterations in the discovery set were investigated for their reproducibility in the TCGA-BLCA RNA-Seq data [41]. To align the validation samples to the discovery set, patients with unknown history of prior treatment for non-muscle-invasive bladder cancer, as well as patients with a history of other malignancies, were excluded. Differential expression analysis between the available stage comparisons (T3 vs. T2 and T4 vs. T3) in the TCGA data validated 43 of the 157 monotonal genes (Table S15). Cox regression analysis in the TCGA data validated 8 out of the 48 monotonal genes that were found to be of prognostic value in the discovery set (Table S14), including MED19, ENO1, ANLN, and GTPBP4, higher levels of which were associated with worse survival, and CBX7, ZFP2, AKAP7, and CDC14B higher levels of which were associated with better survival probability (Figure 5A). We utilized these eight genes to construct a combined score characterizing each individual sample (see the Materials and Methods Section). Along with the disease stage, the eight-gene signature had independent prognostic value, both in the discovery and in the validation set (Figure 5B). Specifically, we constructed a survival model to compare 5-year survival rates between those with high and low eight-gene signature scores (defined by a median cutoff). Patients with a high eight-gene signature score had a worse 5-year survival probability in the discovery set, and this finding was validated in the TCGA data (Figure 5C). The eight-gene signature did not differ significantly between males and females (p = 0.36), and was weakly associated with age and stage (Figure S7), suggesting its independent value with respect to other clinical variables. In an attempt to verify the co-expression analysis findings, stage-specific co-expression networks were also created using the TCGA data, and were clustered using the Louvain algorithm. GO–Biological Process analysis of the communities validated the differential segregation of the cell cycle, extracellular matrix, and immune activation processes to distinct communities (Figure S8). In order to validate the value of AIF1 as a candidate biomarker of response to immunotherapy, we analyzed RNA-Seq data from the IMvigor210 study—a trial investigating responses to atezolizumab immunotherapy in patients with metastatic BLCA. High AIF1 expression in the IMvigor data was associated with a complete response to atezolizumab (Figure 5D).

4. Discussion

Integration of BLCA molecular data has been previously performed in the context of characterizing molecular subtypes [42] or validating results of either (single-cell) scRNA-Seq [43] or RNA-Seq re-analysis [44]. In this study, we performed an integrated meta-analysis of datasets from non-tumor-bearing adjacent urothelium and BLCA stages, aiming to identify continuous as well as concerted gene expression alterations with increasing malignancy. To our knowledge, this is the first attempt to associate molecular alterations with clinical classification based on the analysis of more than 1000 well-characterized primary tumor datasets. Instead of focusing on molecular subtypes, we increased the power and addressed the disease as a continuum under the assumption that individual samples reflect different snapshots of the whole process. Our novel design, based on the hypothesis of continuous evolution through the stages, was successfully applied here, and resulted in novel findings on gene regulation associated with cancer progression.
Starting from a normalized expression dataset comprising 12 microarray cohorts, we identified genes that were differentially expressed between disease stages and NAU (CDEGs), and further analyzed them for pathways/processes that were progressively altered with higher stage, for changes in the co-expression profiles of stages, and for genes showing a monotonal change in expression with higher stage. Our analysis highlighted expected landmark pathways, such as the mTORC1 pathway [45] and MYC targets [46], which were upregulated, but also novel downregulated pathways, such as the circadian clock and the metabolism of heme. These results, for the first time, associate BLCA progression with the disruption of the circadian homeostasis, and with iron metabolism deficiencies—events that are thought to be tumorigenic [47,48], but their exact mechanism of action is not well understood. In addition to the GATA3 regulon—a known driver of luminal biology—we found a novel progressive downregulation of the GLI2 regulon. GLI proteins are transcription factors of the Sonic hedgehog (Shh) pathway, and although GLI2 expression levels are positively correlated with more invasive BLCA cell lines, Shh genes do not behave accordingly [49]. Our results validate these observations, as the entire regulon of the GLI2 TF was inactivated with increasing BLCA malignancy, suggesting no potential therapeutic effect of its inhibition.
Tumor initiation in the bladder is thought to occur within the basal layers of the urothelium, when the accumulated burden of mutations dysregulates cells’ homeostatic mechanisms, favoring uncontrolled proliferation over apoptosis [18]. The process of initiation is lengthy in time, and affects the entire neighborhood of the adjacent cells, which are continuously exposed to a pro-tumorigenic environment. Here, we found that most of the alterations in the non-tumor-bearing adjacent cells involve genes operating during embryogenesis or during ECM remodeling (Community 3, Table S2). These are likely among the first to acquire an organized pattern of co-expression. Interestingly, co-expression in NAU tissue was driven mostly by ALDH1A2 (and partly by MFAP4), which catalyzes the formation of retinoic acid (RA). In the progenitor cells, during embryonic development, receptors of RA form complexes with chromatin modifiers, leading to the activation of self-renewal and differentiation programs [50]. These data give rise to the hypothesis that the cells in the NAU may express and maintain parts of a stem-cell-like RA-related program. Indeed, the biological process of response to RA appears to be enriched in the ECM communities of NAU (p = 4.57 × 10−5) and the T2 (p = 1.12 × 10−2) stage. T2 tumors are far more de-differentiated in comparison to both NAU and NMBIC, and can host multiple genetic differentiation components [41].
The immune activation community was present in both the NAU and MIBC samples. The results of the CIBERSORT analysis showed that within the bladder tumors, and with increasing stage, most monocytes preferentially differentiate into macrophages with M2 polarization. This is consistent with the findings of Chen et al. [43], who analyzed scRNA-Seq data of BLCA patients, and observed a similar pattern of differentiation for monocytes. In our data, T4 samples had the highest proportion of M2 macrophages, which could partially explain their immunosuppressive state. A novel finding here is the observation that AIF1 appears to drive co-expression in the immune cells of T4 tumors. Interestingly, high AIF1 expression was associated with complete response to atezolizumab (Figure 5D)—a finding that may have implications for patient selection for immunotherapy. Together with the observation that AIF1 is highly expressed in macrophages responding to M2 stimulation [51], the data suggest that AIF1 expression in the M2 macrophages could potentially trigger a PD-L1 signature in the tumor and the surrounding immune cells, leading to immune suppression [52], but further work is required to validate these preliminary observations.
We specifically searched for genes showing a monotonal trend in their expression level with increasing disease stage, as this property may mark those genes whose quantification could offer additive prognostic value. Out of the 157 identified monotonal genes, almost half of them were components of the cell-cycle machinery, kinases signaling positively for it, or transcription factors responsible for the expression of cell-cycle genes. In total, 8 out of the 48 monotonal genes with prognostic value in the discovery cohort were validated in independent RNA-Seq data, and were utilized to develop a sample-wise gene signature. Of these, only ENO1 (higher levels of which had been previously linked with worse BLCA outcomes [53]) and CBX7 (downregulation of which was associated with worse survival [54]) were described previously; the remaining six genes associated with survival are novel. MED19, a component of the mediator complex that regulates the transcription of RNA polymerases, was found via IHC to be overexpressed in human BLCA compared to normal tissues, and its knockdown in the T24 and 5637 bladder cell lines resulted in cell-cycle arrest at the G0/G1 checkpoint, along with attenuation of cell growth [55]. The involvement of GTPBP4 in BLCA’s development has not been characterized, but oncogenic properties have been attributed to this gene in hepatocellular carcinoma [56]. ANLN, AKAP7, and CDC14B are thought to regulate bladder cell growth and apoptosis in a TP53-independent manner [57]. ICA1L is naturally expressed in sperm cells. Its role in BLCA has not yet been described. The CDC14B gene is located on the 9q chromosome—a region that is often deleted in BLCA. This might also explain its overall downregulation in malignancy in comparison to NAU, with additional mechanisms (such as the increasing number of tumor cells) resulting in the observed further downregulation with increasing stage, as observed in the discovery set. CDC14B is believed to dephosphorylate TP53 [58], but the functional consequences on the mitotic or DNA damage repair pathways are not yet well clarified [59]. CBX7 is a component of the chromatin-modifier PRC1 complex, and is required for the propagation of the transcriptionally repressive state of multiple genes through cell division during embryonic development [60], including Hox genes [61]. Expectedly, while CBX7 levels monotonically decreased, we noted that HOXC6 and HOXC9 were both monotonically upregulated with increasing malignancy (Table S1). ZFP2 is a probable transcription factor, and evidence suggests that it plays an epigenetic role as well [62]. High loads of mutations in ZFP36—another member of the ZFP family—were associated with upper tract urothelial carcinoma [63].
Our study has its limitations, including the retrospective nature of the analysis, and restrictions in the validation set imposed by a lack of samples from all disease stages, which prevented validation of some of our observations—particularly for NMIBC. Clinical stage assignment is known to have varying rates of error. However, the high number of samples used in each of the stage categories is expected to balance out the error to some extent, while increasing the power of the received results. It should also be noted that our scope was to identify, if present, any common “core” molecular themes with high statistical power during the evolution of BLCA. This does not rule out the existence of intra-tumoral heterogeneity, which still has to be considered (together with the observed molecular changes, as defined in our study) when predicting therapeutic response.

5. Conclusions

The progression of BLCA is associated with monotonal alterations both in individual genes and in molecular pathways, and these alterations can be leveraged as prognosticators, as well as for the development of novel combinatorial drug therapies. The suggested eight-gene signature may help in decision making with regards to therapeutic options and surveillance schedules. However, further validation of the signature in clinical trials is required before clinical implementation.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/cancers14102542/s1, Figure S1: Batch effect in the non-corrected data. (A) Variation explained by clinical stage and dataset ID. (B) Fraction of genes affected, by batch. (C) Heatmap of Pearson’s correlation coefficients between sample pairs. (D) First two principal components showing sample relationships; Figure S2: Batch effect in the corrected data. (A) Variation explained by clinical stage and dataset ID. (B) Fraction of genes affected by batch. (C) Heatmap of Pearson’s correlation coefficients between sample pairs. (D) First two principal components showing sample relationships; Figure S3: Gene expression and sample distribution in the corrected data. (A) Gene expression distribution (y-axis) among the 1135 samples (x-axis) in the discovery cohort, after ComBat normalization. (B) A 3D representation of the principal component analysis results, showing sample allocation in the discovery cohort after ComBat normalization; Figure S4: Mean expression and median absolute deviation of the ComBat data. Plots demonstrating preservation of the housekeeping properties in the adjusted data, i.e., higher mean expression and lower dispersion when compared to non-housekeeping genes. Left: housekeeping genes defined based on microarray analysis of several cancers. Right: housekeeping genes defined based on RNA-Seq analysis of several cancers; Figure S5: Gene expression of 11 known BLCA markers among clinical conditions in the ComBat data; Figure S6: CIBERSORT analysis of the ComBat data, showing statistically significant results on the relative proportions of immune cells between BLCA stages; Figure S7: Correlation analysis between the 8-gene signature score and the age (left) and stage of the patients (right); Figure S8: Functional annotation of the top 5 largest molecular communities of co-expressed genes detected in the T2, T3, and T4 stages of the TCGA cohort, demonstrating a significant overlap with the cell-cycle, extracellular matrix, and immunity communities of the discovery data; Table S1: Processes, pathways and regulons that follow a monotonically increasing/decreasing activation with increasing Bladder Cancer stage (NAU = Non-malignant Adjacent Urothelium); Table S2: Gene Ontology Biological Processes for the Non-malignant Adjacent Urothelium (NAU) communities of coexpressed genes; Table S3: Gene Ontology Biological Processes for the Ta stage communities of coexpressed genes; Table S4: Gene Ontology Biological Processes for the T1 stage communities of coexpressed genes; Table S5: Gene Ontology Biological Processes for the T2 stage communities of coexpressed genes; Table S6: Gene Ontology Biological Processes for the T3 stage communities of coexpressed genes; Table S7: Gene Ontology Biological Processes for the T4 stage communities of coexpressed genes; Table S8: Genes of the cell-cycle community and their betweeness centrality scores; Table S9: Genes of the ECM and developmental community and their betweeness centrality scores; Table S10: Genes of the metabolism and translation community and their betweeness centrality scores; Table S11: Genes of the immune community and their betweeness centrality scores; Table S12: Functionanl analysis of the 157 monotonically changing genes based on searches in Pubmed and GeneCards; Table S13: Functional analysis results for implicated pathways of the 157 monotonically changing genes using the GeneCards tool and database; Table S14: Univariate cox regression analysis of the 157 monotonal genes for the discovery and the TCGA validation cohorts. In green are highlighted genes who were validated with cox in the TCGA data; Table S15: Results of differential expression analysis (Mann-Whitney) between bladder cancer stages using only primary samples (70 T2, 90 T3, and 28 T4) from the TCGA Bladder Cancer 2017 cohort. Blue and red color indicate downregulation and upregulation, respectively; File S1: ComBat_expression_data; File S2: Clinical_data.

Author Contributions

H.M., R.S., M.F. and A.V. conceived and designed the investigation; R.S., M.F., J.Z., J.Z., N.M. and E.M. processed and analyzed the data; R.S. and E.M. wrote the software for analysis and visualizations; R.S. and A.M. developed the R package for the replication of findings; R.S., M.F., J.Z., M.G.R., M.M. (Marika Mokou), M.M. (Manousos Makridakis), N.M., H.M. and A.V. interpreted the results; R.S. wrote the manuscript; M.M. (Marika Mokou), M.G.R., H.M., J.Z., K.S. and A.V. provided critical revisions. All authors have read and agreed to the published version of the manuscript.

Funding

Marika Mokou is supported by the European Union’s Horizon-2020 Marie Sklodowska-Curie Research and Innovation Program H2020-MSCA-IF-2019, ReDrugBC (Grant agreement ID: 898260). Emmanouil Mavrogeorgis is supported by the European Union’s Horizon-2020 Marie Sklodowska-Curie Research and Innovation Program, STRATEGY-CKD (Grant agreement ID: 860329).

Institutional Review Board Statement

The study is a meta-analysis of publicly available data and thus did not require ethical approval.

Informed Consent Statement

Patient consent was waived due to the data being already publicly available in online repositories. Individual datasets used here, are all accompanied with their respective patient consent, which can be found in the original publications.

Data Availability Statement

This study used publicly available data. The raw microarrays used here are publicly available in the Gene Expression Omnibus under the accession codes GSE121711, GSE93527, GSE31684, GSE104922, GSE128959, GSE83586, GSE48276, GSE52219, GSE69795, GSE13507, and GSE48075, and in ArrayExpress under the accession code E-MTAB-1940. TCGA-BLCA-2017 molecular and clinical data were downloaded from cBioPortal. IMvigor molecular and clinical data were acquired through the IMvigor210CoreBiologies R package. The processed, ComBat-normalized matrix (discovery set) and the sample clinical data are available as Supplements. The codes used to generate the results and figures are available upon reasonable request.

Conflicts of Interest

Harald Mischak is the co-founder and co-owner of Mosaiques Diagnostics. Maria Frantzi, Marika Mokou, and Emmanouil Mavrogeorgis are employed by Mosaiques Diagnostics. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Appendix A

Appendix A.1. Database Search Strategy, Cohorts, and Processing

To perform a comprehensive investigation of available molecular data, we searched for bladder cancer (BLCA) omics studies in public repositories. All genomic urothelial cancer data from cBioPortal (including The Cancer Genome Atlas) were downloaded. Gene Expression Omnibus (GEO) was queried for transcriptomics, additional genomics, or protein array datasets using the search terms “bladder cancer” and “urothelial carcinoma”. We also queried ArrayExpress using the special filter “Array express data only” to obtain any additional datasets missing from GEO. All cohort data published or updated between 2010 and 2020, annotated as Homo sapiens, coming from tissue samples with sample size > 10, were retrieved. All used datasets were published and downloaded anonymized. This resulted in the collection of 105 datasets, comprising more than 8000 individuals (Figure 1, main text), encompassing genomics, methylomics, transcriptomics, and proteomics data, derived from a variety of technologies used in the field.

Appendix A.2. Inclusion and Exclusion Criteria

We analyzed tumor transcriptomes and selected studies with at least clinical or pathological stage information per subject. As chemotherapy is known to significantly alter the molecular phenotype of the tumor cells, samples or datasets collected after the administration of (neo)adjuvant chemotherapy, as well as secondary or recurrent bladder cancers, were excluded. To compile a microarray discovery set while preserving as high integrity as possible, we chose microarray data quantified by the most frequently used single-color channel vendors (Affymetrix and Illumina). This resulted in a final dataset of 1135 patients sourced from 12 different studies (Table A1). We used the TCGA-BLCA-2017 and the IMvigor210 studies for validation purposes, and particularly analyzed primary BLCA samples collected prior to the administration of (neo)adjuvant chemotherapy (n samples: TCGA = 188, IMvigor = 132).

Appendix A.3. Cohorts

Discovery (Microarray, Datasets = 12, Samples = 1135)
  • GSE121711 [64]: Loras A, et al., 2019. Cancers (Basel): Study of the metabolome and transcriptome of bladder cancer patients, aiming to identify tissue and urinary metabolic signatures as biomarkers of BC. Contains 8 tumoral and 10 normal urothelium samples taken from regions adjacent to the tumor (NAU).
  • GSE93527 [65]: Hurst CD, et al., 2017. Cancer Cell: Subtyping of Ta-staged non-muscle-invasive bladder cancer patients based on copy number alterations, and subsequent expression profiling of 79 samples.
  • E-MTAB-1940 [66]: Biton A, et al., 2014. Cell Reports: Characterization of the luminal and basal muscle-invasive bladder cancer subtypes based on an independent component analysis of 86 transcriptomes.
  • GSE31684 [67]: Riester M, et al., 2012. Clinical Cancer Research: A study for the identification and validation of high-risk bladder cancer. It included 93 samples, taken from patients undergoing radical cystectomy (RC) at the Memorial Sloan-Kettering Cancer Center (MSKCC), of which 3 were collected post-treatment and were excluded from the analysis.
  • GSE104922 [68]: Therkildsen C, et al., 2018. Molecular Oncology: Identification of molecular profiles in 41 bladder cancer patients with Lynch syndrome.
  • GSE128959 [69]: Sjödahl G, et al., 2020. International Journal of Cancer: An analysis of multiple samples taken from 73 patients upon recurrence/progression, investigating transcriptional alterations and subtype shifts during disease development. Expression profiles are available for 55 bladder tumors, of which 2 were excluded due to missing stage information.
  • GSE83586 [70]: Sjödahl G, et al., 2017. The Journal of Pathology: This study utilizes tumor samples from 307 advanced bladder cancer patients in order to compare molecular classification to tumor cell phenotype (the latter assessed via immunohistochemistry). We excluded six samples due to the absence of stage information.
  • GSE48276 [71]: Choi W, et al., 2014. Cancer Cell: The MDA molecular classification of muscle-invasive bladder cancer. The GEO dataset contains 116 expression profiles, 16 of which were collected post-treatment and, thus, excluded. An additional set of 20 profiles had no information on the timing of sample collection, and were also excluded.
  • GSE52219 [71]: Choi W, et al., 2014. Cancer Cell: The MDA molecular classification of muscle-invasive bladder cancer. The GEO dataset contains 23 FFPE pretreatment tumors from a phase III clinical trial, which were used to validate the MDA classification.
  • GSE69795 [72]: McConkey, et al., 2016. European Urology: Investigation of gene expression signatures predicting response to methotrexate, vinblastine, doxorubicin, and cisplatin with bevacizumab. The GEO dataset contains 61 profiles, of which 23 and 1 were excluded due to post-treatment collection and the absence of stage information, respectively.
  • GSE13507 [73]: Kim WJ, et al., 2010. Journal of Clinical Oncology: Analysis of primary bladder cancer for the identification of a prognostic gene expression classifier. The GEO dataset contains 256 profiles, of which 9 are normal bladders, 58 are NAU, 1 profile is taken from a C57BL/6 mouse sub-strain, 165 are primary tumor samples, and 23 correspond to recurrences. Due to the low number of normal bladder samples represented in the compiled discovery cohort, we grouped them together with the NAU samples. Samples collected after recurrence (n = 23), as well as the expression profile of the murine strain, were excluded from analysis.
  • GSE48075 [71]: Choi W, et al., 2014. Cancer Cell: The MDA molecular classification of muscle-invasive bladder cancer. It includes 142 primary tumors of both non-muscle-invasive and muscle-invasive bladder cancer.
Validation (RNA-Seq, Datasets 2, Samples = 318)
  • TCGA-BLCA-2017 [42]: Robertson AG, et al., 2017. Cell: The TCGA molecular subtyping of muscle-invasive bladder cancer. Molecular and clinical data were downloaded from cBioPortal, containing 412 tumor samples. Of these, 109 had history of other malignancies, and were excluded as potentially being secondary bladder cancers. Additionally, 105 samples had unknown history of prior treatment for non-muscle-invasive bladder cancer, 2 samples were collected post-NAC, and 8 more samples had no stage information, so they were all excluded from further analysis (n analyzed = 186 samples).
  • IMvigor210 [34]: Mariathasan S, et al., 2018. Nature: A study investigating molecular determinants of tumor response to atezolizumab immunotherapy, administered to metastatic bladder cancer patients. Transcriptomic profiles are available on GitHub (R package “IMvigor210CoreBiologies”) for 298 patients, from which we analyzed only samples collected pre-NAC (n = 132 samples).
Table A1. Stage distribution across the 12 microarray datasets used for the discovery set.
Table A1. Stage distribution across the 12 microarray datasets used for the discovery set.
TechnologyPlatformDataset IDNAUTaT1T2T3T4Study Size
AffymetrixGPL17586GSE12171110323 18
AffymetrixGPL17586GSE93527 79 79
AffymetrixGPL570E-MTAB-194044141 86
AffymetrixGPL570GSE31684 51016411890
AffymetrixGPL6244GSE104922 197105041
AffymetrixGPL6244GSE128959 1339 53
AffymetrixGPL6244GSE83586 134424111301
IlluminaGPL14951GSE48276 2623637
IlluminaGPL14951GSE52219 157123
IlluminaGPL14951GSE69795 3727 37
IlluminaGPL6102GSE13507672380311912232
IlluminaGPL6947GSE48075 333442238142
Stage size81229262371146461135

Appendix A.4. Integration and Processing of the Transcriptomes

Pretreatment, raw microarray profiles from the 12 GEO datasets were summarized to the gene level with the package oligo [74]. Affymetrix arrays were normalized via the RMA method, while Illumina arrays were filtered based on detection p values < 0.05, followed by quantile normalization, the addition of 1, and transformation to the natural logarithmic scale. Data were annotated using biomaRt (v2.42.1) [75], and microarray probes matching to multiple genes were excluded from downstream analysis. The probe with the highest mean across arrays was selected as representative in cases where multiple probes matched to the same gene. To completely avoid missing values, merging of expression matrices was based on the intersection of genes between datasets, using the HUGO Gene Symbol as a key. Batch effects across different studies in the discovery data were removed with ComBat [76], and the quality of the corrected data was assessed with BatchQC [21] (Figures S1 and S2), with boxplots of expression distribution per sample and principal component analysis plots of sample relationships (Figure S3), with gene expression comparisons of housekeeping genes to other genes (Figure S4) [77], and with a set of 12 positive control BLCA markers with known regulation across normal–NMI–MI or across normal–low-grade–high-grade disease (Figure S5).

References

  1. Bray, F.; Ferlay, J.; Soerjomataram, I.; Siegel, R.L.; Torre, L.A.; Jemal, A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 2018, 68, 394–424. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Humphrey, P.A.; Moch, H.; Cubilla, A.L.; Ulbright, T.M.; Reuter, V.E. The 2016 WHO Classification of Tumours of the Urinary System and Male Genital Organs—Part B: Prostate and Bladder Tumours. Eur. Urol. 2016, 70, 106–119. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Magers, M.J.; Lopez-Beltran, A.; Montironi, R.; Williamson, S.R.; Kaimakliotis, H.Z.; Cheng, L. Staging of bladder cancer. Histopathology 2019, 74, 112–134. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Knowles, M.A.; Hurst, C.D. Molecular biology of bladder cancer: New insights into pathogenesis and clinical diversity. Nat. Rev. Cancer 2014, 15, 25–41. [Google Scholar] [CrossRef]
  5. Messing, E.M. Financial Toxicity of Having Bladder Cancer. Bladder Cancer 2018, 4, 351–352. [Google Scholar] [CrossRef] [Green Version]
  6. Casilla-Lennon, M.M.; Choi, S.K.; Deal, A.M.; Bensen, J.T.; Narang, G.; Filippou, P.; McCormick, B.; Pruthi, R.; Wallen, E.; Tan, H.J.; et al. Financial Toxicity among Patients with Bladder Cancer: Reasons for Delay in Care and Effect on Quality of Life. J. Urol. 2018, 199, 1166–1173. [Google Scholar] [CrossRef]
  7. Batista, R.; Vinagre, N.; Meireles, S.; Vinagre, J.; Prazeres, H.; Leao, R.; Maximo, V.; Soares, P. Biomarkers for Bladder Cancer Diagnosis and Surveillance: A Comprehensive Review. Diagnostics 2020, 10, 39. [Google Scholar] [CrossRef] [Green Version]
  8. Dyrskjot, L.; Reinert, T.; Algaba, F.; Christensen, E.; Nieboer, D.; Hermann, G.G.; Mogensen, K.; Beukers, W.; Marquez, M.; Segersten, U.; et al. Prognostic Impact of a 12-gene Progression Score in Non-muscle-invasive Bladder Cancer: A Prospective Multicentre Validation Study. Eur. Urol. 2017, 72, 461–469. [Google Scholar] [CrossRef]
  9. Wang, L.; Wang, Y.; Wang, J.; Li, L.; Bi, J. Identification of a Prognosis-Related Risk Signature for Bladder Cancer to Predict Survival and Immune Landscapes. J. Immunol. Res. 2021, 2021, 3236384. [Google Scholar] [CrossRef]
  10. Dong, B.; Liang, J.; Li, D.; Song, W.; Song, J.; Zhu, M.; Zhao, S.; Ma, Y.; Yang, T. Identification of a Prognostic Signature Associated With the Homeobox Gene Family for Bladder Cancer. Front. Mol. Biosci. 2021, 8, 688298. [Google Scholar] [CrossRef]
  11. Qu, G.; Liu, Z.; Yang, G.; Xu, Y.; Xiang, M.; Tang, C. Development of a prognostic index and screening of prognosis related genes based on an immunogenomic landscape analysis of bladder cancer. Aging 2021, 13, 12099–12112. [Google Scholar] [CrossRef] [PubMed]
  12. Deng, Y.; Hong, X.; Yu, C.; Li, H.; Wang, Q.; Zhang, Y.; Wang, T.; Wang, X. Preclinical analysis of novel prognostic transcription factors and immune-related gene signatures for bladder cancer via TCGA-based bioinformatic analysis. Oncol. Lett. 2021, 21, 344. [Google Scholar] [CrossRef] [PubMed]
  13. Cao, R.; Yuan, L.; Ma, B.; Wang, G.; Qiu, W.; Tian, Y. An EMT-related gene signature for the prognosis of human bladder cancer. J. Cell. Mol. Med. 2019, 24, 605–617. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. He, R.Q.; Zhou, X.G.; Yi, Q.Y.; Deng, C.W.; Gao, J.M.; Chen, G.; Wang, Q.Y. Prognostic Signature of Alternative Splicing Events in Bladder Urothelial Carcinoma Based on Spliceseq Data from 317 Cases. Cell. Physiol. Biochem. 2018, 48, 1355–1368. [Google Scholar] [CrossRef] [PubMed]
  15. Cumberbatch, M.G.K.; Jubber, I.; Black, P.C.; Esperto, F.; Figueroa, J.D.; Kamat, A.M.; Kiemeney, L.; Lotan, Y.; Pang, K.; Silverman, D.T.; et al. Epidemiology of Bladder Cancer: A Systematic Review and Contemporary Update of Risk Factors in 2018. Eur. Urol. 2018, 74, 784–795. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  16. Crawford, J.M. The origins of bladder cancer. Lab. Investig. 2008, 88, 686–693. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  17. Bajaj, J.; Diaz, E.; Reya, T. Stem cells in cancer initiation and progression. J. Cell Biol. 2019, 219, e201911053. [Google Scholar] [CrossRef]
  18. Czerniak, B.; Dinney, C.; McConkey, D. Origins of Bladder Cancer. Annu. Rev. Pathol. 2016, 11, 149–174. [Google Scholar] [CrossRef]
  19. Fong, M.H.Y.; Feng, M.; McConkey, D.J.; Choi, W. Update on bladder cancer molecular subtypes. Transl. Androl. Urol. 2020, 9, 2881–2889. [Google Scholar] [CrossRef]
  20. Warrick, J.I.; Sjodahl, G.; Kaag, M.; Raman, J.D.; Merrill, S.; Shuman, L.; Chen, G.; Walter, V.; DeGraff, D.J. Intratumoral Heterogeneity of Bladder Cancer by Molecular Subtypes and Histologic Variants. Eur. Urol. 2019, 75, 18–22. [Google Scholar] [CrossRef]
  21. Manimaran, S.; Selby, H.M.; Okrah, K.; Ruberman, C.; Leek, J.T.; Quackenbush, J.; Haibe-Kains, B.; Bravo, H.C.; Johnson, W.E. BatchQC: Interactive software for evaluating sample and batch effects in genomic data. Bioinformatics 2016, 32, 3836–3838. [Google Scholar] [CrossRef] [Green Version]
  22. Hanzelmann, S.; Castelo, R.; Guinney, J. GSVA: Gene set variation analysis for microarray and RNA-Seq data. BMC Bioinform. 2013, 14, 7. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Garcia-Alonso, L.; Holland, C.H.; Ibrahim, M.M.; Turei, D.; Saez-Rodriguez, J. Benchmark and integration of resources for the estimation of human transcription factor activities. Genome Res. 2019, 29, 1363–1375. [Google Scholar] [CrossRef] [Green Version]
  24. Yoshihara, K.; Shahmoradgoli, M.; Martinez, E.; Vegesna, R.; Kim, H.; Torres-Garcia, W.; Trevino, V.; Shen, H.; Laird, P.W.; Levine, D.A.; et al. Inferring tumour purity and stromal and immune cell admixture from expression data. Nat. Commun. 2013, 4, 2612. [Google Scholar] [CrossRef] [PubMed]
  25. Huynh-Thu, V.A.; Irrthum, A.; Wehenkel, L.; Geurts, P. Inferring Regulatory Networks from Expression Data Using Tree-Based Methods. PLoS ONE 2010, 5, e12776. [Google Scholar] [CrossRef] [PubMed]
  26. Blondel, V.D.; Guillaume, J.L.; Lambiotte, R.; Lefebvre, E. Fast unfolding of communities in large networks. J. Stat. Mech. Theory Exp. 2008, 2008, P10008. [Google Scholar] [CrossRef] [Green Version]
  27. Yu, G.; Wang, L.G.; Han, Y.; He, Q.Y. clusterProfiler: An R package for comparing biological themes among gene clusters. OMICS J. Integr. Biol. 2012, 16, 284–287. [Google Scholar] [CrossRef]
  28. Duron, C.; Pan, Y.; Gutmann, D.H.; Hardin, J.; Radunskaya, A. Variability of Betweenness Centrality and Its Effect on Identifying Essential Genes. Bull. Math. Biol. 2019, 81, 3655–3673. [Google Scholar] [CrossRef]
  29. Anders, S.; Huber, W. Differential expression analysis for sequence count data. Genome Biol. 2010, 11, R106. [Google Scholar] [CrossRef] [Green Version]
  30. Wullweber, A.; Strick, R.; Lange, F.; Sikic, D.; Taubert, H.; Wach, S.; Wullich, B.; Bertz, S.; Weyerer, V.; Stoehr, R.; et al. Bladder Tumor Subtype Commitment Occurs in Carcinoma In Situ Driven by Key Signaling Pathways Including ECM Remodeling. Cancer Res. 2021, 81, 1552–1566. [Google Scholar] [CrossRef]
  31. Yang, J.; Song, H.; Chen, L.; Cao, K.; Zhang, Y.; Li, Y.; Hao, X. Integrated analysis of microfibrillar-associated proteins reveals MFAP4 as a novel biomarker in human cancers. Epigenomics 2019, 11, 5–21. [Google Scholar] [CrossRef] [Green Version]
  32. Kim, H.; Lapointe, J.; Kaygusuz, G.; Ong, D.E.; Li, C.; van de Rijn, M.; Brooks, J.D.; Pollack, J.R. The retinoic acid synthesis gene ALDH1a2 is a candidate tumor suppressor in prostate cancer. Cancer Res. 2005, 65, 8118–8124. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Ding, J.; Du, K. ClipR-59 interacts with Akt and regulates Akt cellular compartmentalization. Mol. Cell. Biol. 2009, 29, 1459–1471. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Mariathasan, S.; Turley, S.J.; Nickles, D.; Castiglioni, A.; Yuen, K.; Wang, Y.; Kadel, E.E., III; Koeppen, H.; Astarita, J.L.; Cubas, R.; et al. TGFbeta attenuates tumour response to PD-L1 blockade by contributing to exclusion of T cells. Nature 2018, 554, 544–548. [Google Scholar] [CrossRef] [PubMed]
  35. Egana-Gorrono, L.; Chinnasamy, P.; Casimiro, I.; Almonte, V.M.; Parikh, D.; Oliveira-Paula, G.H.; Jayakumar, S.; Law, C.; Riascos-Bernal, D.F.; Sibinga, N.E.S. Allograft inflammatory factor-1 supports macrophage survival and efferocytosis and limits necrosis in atherosclerotic plaques. Atherosclerosis 2019, 289, 184–194. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  36. Lee, S.R.; Roh, Y.G.; Kim, S.K.; Lee, J.S.; Seol, S.Y.; Lee, H.H.; Kim, W.T.; Kim, W.J.; Heo, J.; Cha, H.J.; et al. Activation of EZH2 and SUZ12 Regulated by E2F1 Predicts the Disease Progression and Aggressive Characteristics of Bladder Cancer. Clin. Cancer Res. Off. J. Am. Assoc. Cancer Res. 2015, 21, 5391–5403. [Google Scholar] [CrossRef] [Green Version]
  37. Kanehira, M.; Harada, Y.; Takata, R.; Shuin, T.; Miki, T.; Fujioka, T.; Nakamura, Y.; Katagiri, T. Involvement of upregulation of DEPDC1 (DEP domain containing 1) in bladder carcinogenesis. Oncogene 2007, 26, 6448–6455. [Google Scholar] [CrossRef] [Green Version]
  38. Hsu, C.L.; Liu, J.S.; Wu, P.L.; Guan, H.H.; Chen, Y.L.; Lin, A.C.; Ting, H.J.; Pang, S.T.; Yeh, S.D.; Ma, W.L.; et al. Identification of a new androgen receptor (AR) co-regulator BUD31 and related peptides to suppress wild-type and mutated AR-mediated prostate cancer growth via peptide screening and X-ray structure analysis. Mol. Oncol. 2014, 8, 1575–1587. [Google Scholar] [CrossRef]
  39. Kawahara, T.; Shareef, H.K.; Aljarah, A.K.; Ide, H.; Li, Y.; Kashiwagi, E.; Netto, G.J.; Zheng, Y.; Miyamoto, H. ELK1 is up-regulated by androgen in bladder cancer cells and promotes tumor progression. Oncotarget 2015, 6, 29860–29876. [Google Scholar] [CrossRef] [Green Version]
  40. Wei, W.S.; Chen, X.; Guo, L.Y.; Li, X.D.; Deng, M.H.; Yuan, G.J.; He, L.Y.; Li, Y.H.; Zhang, Z.L.; Jiang, L.J.; et al. TRIM65 supports bladder urothelial carcinoma cell aggressiveness by promoting ANXA2 ubiquitination and degradation. Cancer Lett. 2018, 435, 10–22. [Google Scholar] [CrossRef]
  41. Robertson, A.G.; Kim, J.; Al-Ahmadie, H.; Bellmunt, J.; Guo, G.W.; Cherniack, A.D.; Hinoue, T.; Laird, P.W.; Hoadley, K.A.; Akbani, R.; et al. Comprehensive Molecular Characterization of Muscle-Invasive Bladder Cancer. Cell 2017, 171, 540–556.e25. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  42. Tan, T.Z.; Rouanne, M.; Tan, K.T.; Huang, R.Y.; Thiery, J.P. Molecular Subtypes of Urothelial Bladder Cancer: Results from a Meta-cohort Analysis of 2411 Tumors. Eur. Urol. 2019, 75, 423–432. [Google Scholar] [CrossRef]
  43. Chen, Z.; Zhou, L.; Liu, L.; Hou, Y.; Xiong, M.; Yang, Y.; Hu, J.; Chen, K. Single-cell RNA sequencing highlights the role of inflammatory cancer-associated fibroblasts in bladder urothelial carcinoma. Nat. Commun. 2020, 11, 5077. [Google Scholar] [CrossRef] [PubMed]
  44. Chen, X.; Chen, H.; He, D.; Cheng, Y.; Zhu, Y.; Xiao, M.; Lan, H.; Wang, Z.; Cao, K. Analysis of Tumor Microenvironment Characteristics in Bladder Cancer: Implications for Immune Checkpoint Inhibitor Therapy. Front. Immunol. 2021, 12, 672158. [Google Scholar] [CrossRef]
  45. Ching, C.B.; Hansel, D.E. Expanding therapeutic targets in bladder cancer: The PI3K/Akt/mTOR pathway. Lab. Investig. 2010, 90, 1406–1414. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  46. Mahe, M.; Dufour, F.; Neyret-Kahn, H.; Moreno-Vega, A.; Beraud, C.; Shi, M.; Hamaidi, I.; Sanchez-Quiles, V.; Krucker, C.; Dorland-Galliot, M.; et al. An FGFR3/MYC positive feedback loop provides new opportunities for targeted therapies in bladder cancers. EMBO Mol. Med. 2018, 10, e8163. [Google Scholar] [CrossRef] [PubMed]
  47. Fu, L.; Kettner, N.M. The circadian clock in cancer development and therapy. Prog. Mol. Biol. Transl. Sci. 2013, 119, 221–282. [Google Scholar] [PubMed] [Green Version]
  48. Chen, Y.; Fan, Z.; Yang, Y.; Gu, C. Iron metabolism and its contribution to cancer (Review). Int. J. Oncol. 2019, 54, 1143–1154. [Google Scholar] [CrossRef] [Green Version]
  49. Mechlin, C.W.; Tanner, M.J.; Chen, M.; Buttyan, R.; Levin, R.M.; Mian, B.M. Gli2 expression and human bladder transitional carcinoma cell invasiveness. J. Urol. 2010, 184, 344–351. [Google Scholar] [CrossRef]
  50. Ozgun, G.; Senturk, S.; Erkek-Ozhan, S. Retinoic acid signaling and bladder cancer: Epigenetic deregulation, therapy and beyond. Int. J. Cancer 2020, 148, 2364–2374. [Google Scholar] [CrossRef]
  51. Cai, H.; Zhu, X.D.; Ao, J.Y.; Ye, B.G.; Zhang, Y.Y.; Chai, Z.T.; Wang, C.H.; Shi, W.K.; Cao, M.Q.; Li, X.L.; et al. Colony-stimulating factor-1-induced AIF1 expression in tumor-associated macrophages enhances the progression of hepatocellular carcinoma. OncoImmunology 2017, 6, e1333213. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  52. Fanelli, G.; Romano, M.; Nova-Lamperti, E.; Werner Sunderland, M.; Nerviani, A.; Scotta, C.; Bombardieri, M.; Quezada, S.A.; Sacks, S.H.; Noelle, R.J.; et al. PD-L1 signaling on human memory CD4+ T cells induces a regulatory phenotype. PLoS Biol. 2021, 19, e3001199. [Google Scholar] [CrossRef] [PubMed]
  53. Ji, M.; Wang, Z.; Chen, J.; Gu, L.; Chen, M.; Ding, Y.; Liu, T. Up-regulated ENO1 promotes the bladder cancer cell growth and proliferation via regulating beta-catenin. Biosci. Rep. 2019, 39, BSR20190503. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  54. Huang, Z.; Yan, Y.; Zhu, Z.; Liu, J.; He, X.; Dalangood, S.; Li, M.; Tan, M.; Cai, J.; Tang, P.; et al. CBX7 suppresses urinary bladder cancer progression via modulating AKR1B10-ERK signaling. Cell Death Dis. 2021, 12, 537. [Google Scholar] [CrossRef]
  55. Zhang, H.; Jiang, H.; Wang, W.; Gong, J.; Zhang, L.; Chen, Z.; Ding, Q. Expression of Med19 in bladder cancer tissues and its role on bladder cancer cell growth. Urol. Oncol. Semin. Orig. Investig. 2012, 30, 920–927. [Google Scholar] [CrossRef]
  56. Liu, W.B.; Jia, W.D.; Ma, J.L.; Xu, G.L.; Zhou, H.C.; Peng, Y.; Wang, W. Knockdown of GTPBP4 inhibits cell growth and survival in human hepatocellular carcinoma and its prognostic significance. Oncotarget 2017, 8, 93984–93997. [Google Scholar] [CrossRef] [Green Version]
  57. da Silva, G.N.; Evangelista, A.F.; Magalhaes, D.A.; Macedo, C.; Bufalo, M.C.; Sakamoto-Hojo, E.T.; Passos, G.A.; Salvadori, D.M. Expression of genes related to apoptosis, cell cycle and signaling pathways are independent of TP53 status in urinary bladder cancer cells. Mol. Biol. Rep. 2011, 38, 4159–4170. [Google Scholar] [CrossRef]
  58. Li, L.; Ljungman, M.; Dixon, J.E. The human Cdc14 phosphatases interact with and dephosphorylate the tumor suppressor protein p53. J. Biol. Chem. 2000, 275, 2410–2414. [Google Scholar] [CrossRef] [Green Version]
  59. Dietachmayr, M.; Rathakrishnan, A.; Karpiuk, O.; von Zweydorf, F.; Engleitner, T.; Fernandez-Saiz, V.; Schenk, P.; Ueffing, M.; Rad, R.; Eilers, M.; et al. Antagonistic activities of CDC14B and CDK1 on USP9X regulate WT1-dependent mitotic transcription and survival. Nat. Commun. 2020, 11, 1268. [Google Scholar] [CrossRef] [Green Version]
  60. Moussa, H.F.; Bsteh, D.; Yelagandula, R.; Pribitzer, C.; Stecher, K.; Bartalska, K.; Michetti, L.; Wang, J.; Zepeda-Martinez, J.A.; Elling, U.; et al. Canonical PRC1 controls sequence-independent propagation of Polycomb-mediated gene silencing. Nat. Commun. 2019, 10, 1931. [Google Scholar] [CrossRef]
  61. Grossniklaus, U.; Paro, R. Transcriptional silencing by polycomb-group proteins. Cold Spring Harb. Perspect. Biol. 2014, 6, a019331. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  62. Sobocinska, J.; Molenda, S.; Machnik, M.; Oleksiewicz, U. KRAB-ZFP Transcriptional Regulators Acting as Oncogenes and Tumor Suppressors: An Overview. Int. J. Mol. Sci. 2021, 22, 2212. [Google Scholar] [CrossRef] [PubMed]
  63. Su, X.; Lu, X.; Bazai, S.K.; Comperat, E.; Mouawad, R.; Yao, H.; Roupret, M.; Spano, J.P.; Khayat, D.; Davidson, I.; et al. Comprehensive integrative profiling of upper tract urothelial carcinomas. Genome Biol. 2021, 22, 7. [Google Scholar] [CrossRef] [PubMed]
  64. Loras, A.; Suarez-Cabrera, C.; Martinez-Bisbal, M.C.; Quintas, G.; Paramio, J.M.; Martinez-Manez, R.; Gil, S.; Ruiz-Cerda, J.L. Integrative Metabolomic and Transcriptomic Analysis for the Study of Bladder Cancer. Cancers 2019, 11, 686. [Google Scholar] [CrossRef] [Green Version]
  65. Hurst, C.D.; Alder, O.; Platt, F.M.; Droop, A.; Stead, L.F.; Burns, J.E.; Burghel, G.J.; Jain, S.; Klimczak, L.J.; Lindsay, H.; et al. Genomic Subtypes of Non-invasive Bladder Cancer with Distinct Metabolic Profile and Female Gender Bias in KDM6A Mutation Frequency. Cancer Cell 2017, 32, 701–715.e7. [Google Scholar] [CrossRef]
  66. Biton, A.; Bernard-Pierrot, I.; Lou, Y.; Krucker, C.; Chapeaublanc, E.; Rubio-Perez, C.; Lopez-Bigas, N.; Kamoun, A.; Neuzillet, Y.; Gestraud, P.; et al. Independent component analysis uncovers the landscape of the bladder tumor transcriptome and reveals insights into luminal and basal subtypes. Cell Rep. 2014, 9, 1235–1245. [Google Scholar] [CrossRef] [Green Version]
  67. Riester, M.; Taylor, J.M.; Feifer, A.; Koppie, T.; Rosenberg, J.E.; Downey, R.J.; Bochner, B.H.; Michor, F. Combination of a novel gene expression signature with a clinical nomogram improves the prediction of survival in high-risk bladder cancer. Clin. Cancer Res. Off. J. Am. Assoc. Cancer Res. 2012, 18, 1323–1333. [Google Scholar] [CrossRef] [Green Version]
  68. Therkildsen, C.; Eriksson, P.; Hoglund, M.; Jonsson, M.; Sjodahl, G.; Nilbert, M.; Liedberg, F. Molecular subtype classification of urothelial carcinoma in Lynch syndrome. Mol. Oncol. 2018, 12, 1286–1295. [Google Scholar] [CrossRef] [Green Version]
  69. Sjodahl, G.; Eriksson, P.; Patschan, O.; Marzouka, N.A.; Jakobsson, L.; Bernardo, C.; Lovgren, K.; Chebil, G.; Zwarthoff, E.; Liedberg, F.; et al. Molecular changes during progression from nonmuscle invasive to advanced urothelial carcinoma. Int. J. Cancer 2020, 146, 2636–2647. [Google Scholar] [CrossRef] [Green Version]
  70. Sjodahl, G.; Eriksson, P.; Liedberg, F.; Hoglund, M. Molecular classification of urothelial carcinoma: Global mRNA classification versus tumour-cell phenotype classification. J. Pathol. 2017, 242, 113–125. [Google Scholar] [CrossRef]
  71. Choi, W.; Porten, S.; Kim, S.; Willis, D.; Plimack, E.R.; Hoffman-Censits, J.; Roth, B.; Cheng, T.; Tran, M.; Lee, I.L.; et al. Identification of distinct basal and luminal subtypes of muscle-invasive bladder cancer with different sensitivities to frontline chemotherapy. Cancer Cell 2014, 25, 152–165. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  72. McConkey, D.J.; Choi, W.; Shen, Y.; Lee, I.L.; Porten, S.; Matin, S.F.; Kamat, A.M.; Corn, P.; Millikan, R.E.; Dinney, C.; et al. A Prognostic Gene Expression Signature in the Molecular Classification of Chemotherapy-naive Urothelial Cancer is Predictive of Clinical Outcomes from Neoadjuvant Chemotherapy: A Phase 2 Trial of Dose-dense Methotrexate, Vinblastine, Doxorubicin, and Cisplatin with Bevacizumab in Urothelial Cancer. Eur. Urol. 2016, 69, 855–862. [Google Scholar] [PubMed] [Green Version]
  73. Kim, W.J.; Kim, E.J.; Kim, S.K.; Kim, Y.J.; Ha, Y.S.; Jeong, P.; Kim, M.J.; Yun, S.J.; Lee, K.M.; Moon, S.K.; et al. Predictive value of progression-related gene classifier in primary non-muscle invasive bladder cancer. Mol. Cancer 2010, 9, 3. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  74. Carvalho, B.S.; Irizarry, R.A. A framework for oligonucleotide microarray preprocessing. Bioinformatics 2010, 26, 2363–2367. [Google Scholar] [CrossRef] [PubMed]
  75. Durinck, S.; Spellman, P.T.; Birney, E.; Huber, W. Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nat. Protoc. 2009, 4, 1184–1191. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  76. Johnson, W.E.; Li, C.; Rabinovic, A. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics 2007, 8, 118–127. [Google Scholar] [CrossRef]
  77. Weishaupt, H.; Johansson, P.; Sundstrom, A.; Lubovac-Pilav, Z.; Olsson, B.; Nelander, S.; Swartling, F.J. Batch-normalization of cerebellar and medulloblastoma gene expression datasets utilizing empirically defined negative control genes. Bioinformatics 2019, 35, 3357–3364. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Study design and workflow for the analysis of the selected primary BLCA transcriptomes.
Figure 1. Study design and workflow for the analysis of the selected primary BLCA transcriptomes.
Cancers 14 02542 g001
Figure 2. Excerpt of the pathways showing a monotonal increase or decline in their activation scores with higher stage. Pathway activation scores were z-scaled across samples for visualization. Pathways are colored based on their database of origin. The top side of the heatmap presents dataset and clinical information. Samples (columns) are ordered based on the stage variable; from left to right: non-malignant adjacent urothelium (NAU), Ta, T1, T2, T3, and T4. Red lines indicate boundaries between adjacent stages.
Figure 2. Excerpt of the pathways showing a monotonal increase or decline in their activation scores with higher stage. Pathway activation scores were z-scaled across samples for visualization. Pathways are colored based on their database of origin. The top side of the heatmap presents dataset and clinical information. Samples (columns) are ordered based on the stage variable; from left to right: non-malignant adjacent urothelium (NAU), Ta, T1, T2, T3, and T4. Red lines indicate boundaries between adjacent stages.
Cancers 14 02542 g002
Figure 3. Biological process analysis of the largest (in size) co-expressed communities identified in each BLCA stage network: (A) Coherent communities identified and characterized across the non-malignant adjacent urothelium (NAU) and disease stages. The presence of a community is indicated by the + symbol. Numbers in parentheses show the fraction of genes with GO Biological Process annotation relevant to the community, with respect to the total number of genes found to be co-expressed in the community. (B) Bar plots of the most significantly enriched biological processes per community, depicting the number of co-expressed genes for each. (C) Hub genes identified across the studied conditions based on the betweenness centrality scores (y-axis).
Figure 3. Biological process analysis of the largest (in size) co-expressed communities identified in each BLCA stage network: (A) Coherent communities identified and characterized across the non-malignant adjacent urothelium (NAU) and disease stages. The presence of a community is indicated by the + symbol. Numbers in parentheses show the fraction of genes with GO Biological Process annotation relevant to the community, with respect to the total number of genes found to be co-expressed in the community. (B) Bar plots of the most significantly enriched biological processes per community, depicting the number of co-expressed genes for each. (C) Hub genes identified across the studied conditions based on the betweenness centrality scores (y-axis).
Cancers 14 02542 g003
Figure 4. Differential expression analysis between non-malignant adjacent urothelium (NAU) and BLCA stages: (A) Volcano plots of the five stages’ comparisons to NAU, with the color red indicating the 157 genes showing a monotonal trend of expression across stages. (B) Heatmap of the absolute fold changes of the 157 monotonal genes, which were continuously either up- (yellow) or downregulated (blue), in the comparisons between disease stages and NAU. (C) Top 15 pathways of the 157 monotonal genes, sorted by their GeneCards enrichment score.
Figure 4. Differential expression analysis between non-malignant adjacent urothelium (NAU) and BLCA stages: (A) Volcano plots of the five stages’ comparisons to NAU, with the color red indicating the 157 genes showing a monotonal trend of expression across stages. (B) Heatmap of the absolute fold changes of the 157 monotonal genes, which were continuously either up- (yellow) or downregulated (blue), in the comparisons between disease stages and NAU. (C) Top 15 pathways of the 157 monotonal genes, sorted by their GeneCards enrichment score.
Cancers 14 02542 g004
Figure 5. Validation of key findings in the TCGA-2017 and the IMvigor210 cohorts: (A) Forest plots showing hazard ratios (HRs) for the 8 monotonal genes with univariate prognostic value in both the discovery and TCGA validation datasets. (B) Multivariate analysis of stage and the 8-gene signature scores in the discovery and TCGA validation sets. (C) 5-year survival analysis between patients with high and low 8-gene signature scores, in the discovery and the TCGA data. (D) Data from the IMvigor210 trial illustrating AIF1 expression across responses to atezolizumab in immunotherapy groups.
Figure 5. Validation of key findings in the TCGA-2017 and the IMvigor210 cohorts: (A) Forest plots showing hazard ratios (HRs) for the 8 monotonal genes with univariate prognostic value in both the discovery and TCGA validation datasets. (B) Multivariate analysis of stage and the 8-gene signature scores in the discovery and TCGA validation sets. (C) 5-year survival analysis between patients with high and low 8-gene signature scores, in the discovery and the TCGA data. (D) Data from the IMvigor210 trial illustrating AIF1 expression across responses to atezolizumab in immunotherapy groups.
Cancers 14 02542 g005
Table 1. Clinical data of the cohorts used.
Table 1. Clinical data of the cohorts used.
KerrypnxDiscovery SetTCGA-BLCA-2017IMvigor210
No. of patients1135188132
Median age (range)67 (24–95)69 (34–90)
Sex
Female135 (11.9%)45 (23.9%)29 (22.0%)
Male459 (40.5%)143 (76.1%)103 (78.0%)
No info541 (47.6%)0 (0.0%)0 (0.0%)
Stage
NAU81 (7.1%)0 (0.0%)
Ta229 (20.2%)0 (0.0%)
T1262 (23.1%)0 (0.0%)
T2371 (32.7%)70 (37.3%)
T3146 (12.9%)90 (47.9%)
T446 (4.0%)28 (14.8%)
Grade
G113 (1.1%)0 (0.0%)
G2164 (14.5%)0 (0.0%)
G3372 (32.8%)188 (100%)
Gx13 (1.1%)0 (0.0%)
No info573 (50.5%)0 (0.0%)
Response to atezolizumab
Complete 11 (8.3%)
Partial 20 (15.2%)
Stable disease 25 (18.9%)
Progressed disease 76 (57.6%)
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Stroggilos, R.; Frantzi, M.; Zoidakis, J.; Mokou, M.; Moulavasilis, N.; Mavrogeorgis, E.; Melidi, A.; Makridakis, M.; Stravodimos, K.; Roubelakis, M.G.; et al. Gene Expression Monotonicity across Bladder Cancer Stages Informs on the Molecular Pathogenesis and Identifies a Prognostic Eight-Gene Signature. Cancers 2022, 14, 2542. https://doi.org/10.3390/cancers14102542

AMA Style

Stroggilos R, Frantzi M, Zoidakis J, Mokou M, Moulavasilis N, Mavrogeorgis E, Melidi A, Makridakis M, Stravodimos K, Roubelakis MG, et al. Gene Expression Monotonicity across Bladder Cancer Stages Informs on the Molecular Pathogenesis and Identifies a Prognostic Eight-Gene Signature. Cancers. 2022; 14(10):2542. https://doi.org/10.3390/cancers14102542

Chicago/Turabian Style

Stroggilos, Rafael, Maria Frantzi, Jerome Zoidakis, Marika Mokou, Napoleon Moulavasilis, Emmanouil Mavrogeorgis, Anna Melidi, Manousos Makridakis, Konstantinos Stravodimos, Maria G. Roubelakis, and et al. 2022. "Gene Expression Monotonicity across Bladder Cancer Stages Informs on the Molecular Pathogenesis and Identifies a Prognostic Eight-Gene Signature" Cancers 14, no. 10: 2542. https://doi.org/10.3390/cancers14102542

APA Style

Stroggilos, R., Frantzi, M., Zoidakis, J., Mokou, M., Moulavasilis, N., Mavrogeorgis, E., Melidi, A., Makridakis, M., Stravodimos, K., Roubelakis, M. G., Mischak, H., & Vlahou, A. (2022). Gene Expression Monotonicity across Bladder Cancer Stages Informs on the Molecular Pathogenesis and Identifies a Prognostic Eight-Gene Signature. Cancers, 14(10), 2542. https://doi.org/10.3390/cancers14102542

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop