Next Article in Journal
An Eye on Kupffer Cells: Development, Phenotype and the Macrophage Niche
Next Article in Special Issue
An Epigenetic LINE-1-Based Mechanism in Cancer
Previous Article in Journal
Improving the Laboratory Diagnosis of M-like Variants Related to Alpha1-Antitrypsin Deficiency
Previous Article in Special Issue
Polyploidy and Myc Proto-Oncogenes Promote Stress Adaptation via Epigenetic Plasticity and Gene Regulatory Network Rewiring
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Analysis of Dormancy-Associated Transcriptional Networks Reveals a Shared Quiescence Signature in Lung and Colorectal Cancer

by
Adriano Cuccu
1,
Federica Francescangeli
1,
Maria Laura De Angelis
1,
Alessandro Bruselles
1,
Alessandro Giuliani
2 and
Ann Zeuner
1,*
1
Department of Oncology and Molecular Medicine, Istituto Superiore di Sanità, Viale Regina Elena 299, 00161 Rome, Italy
2
Environment and Health Department, Istituto Superiore di Sanità, Viale Regina Elena 299, 00161 Rome, Italy
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2022, 23(17), 9869; https://doi.org/10.3390/ijms23179869
Submission received: 28 June 2022 / Revised: 23 August 2022 / Accepted: 25 August 2022 / Published: 30 August 2022
(This article belongs to the Special Issue Advances in Genome Regulation in Cancer)

Abstract

:
Quiescent cancer cells (QCCs) are a common feature of solid tumors, representing a major obstacle to the long-term success of cancer therapies. We isolated QCCs ex vivo from non-small cell lung cancer (NSCLC) and colorectal cancer (CRC) xenografts with a label-retaining strategy and compared QCCs gene expression profiles to identify a shared “quiescence signature”. Principal Component Analysis (PCA) revealed a specific component neatly discriminating quiescent and replicative phenotypes in NSCLC and CRC. The discriminating component showed significant overlapping, with 688 genes in common including ZEB2, a master regulator of stem cell plasticity and epithelial-to-mesenchymal transition (EMT). Gene set enrichment analysis showed that QCCs of both NSCLC and CRC had an increased expression of factors related to stemness/self renewal, EMT, TGF-β, morphogenesis, cell adhesion and chemotaxis, whereas proliferating cells overexpressed Myc targets and factors involved in RNA metabolism. Eventually, we analyzed in depth by means of a complex network approach, both the ‘morphogenesis module’ and the subset of differentially expressed genes shared by NCSLC and CRC. This allowed us to recognize different gene regulation network wiring for quiescent and proliferating cells and to underpin few genes central for network integration that may represent new therapeutic vulnerabilities. Altogether, our results highlight common regulatory pathways in QCCs of lung and colorectal tumors that may be the target of future therapeutic interventions.

1. Introduction

In mammalians, most tissues and organs contain small populations of quiescent stem cells that serve as a reservoir during cellular turnover and upon tissue damage [1]. Likewise, many tumors contain subpopulations of quiescent/slow cycling cells involved in clinically relevant steps of tumor development such as chemo-resistance, metastatic dissemination, dormancy and relapse [2,3]. In the last decade, there has been a progressive understanding of QCCs importance for cancer therapy. Consequently, the number of studies focused on QCCs and tumor dormancy increased exponentially, highlighting the complexity of the quiescent state. In fact, quiescence is a heterogeneous state, both in normal tissues and in cancer [4,5]. First, different levels of quiescence depth can be found in normal and possibly in neoplastic cells [6,7,8,9]. Second, quiescent cancer cells are found in different moments of the tumor’s lifetime such as before, during and after treatments; during latency; or in actively growing cancers. Third, QCCs can reside in different intra-tumor or extra-tumor locations, thus being subject to a different spectrum of microenvironmental signals [3]. The peculiar behavior of ‘regime change’ from quiescent to proliferative state of QCCs stimulated biophysical studies along the line of non-linear dynamics approach [10,11] that in turn opened new general perspectives on cancer development [12]. New insights into the mechanisms driving cancer cell quiescence have led to the development of new experimental models of cancer cell dormancy in vitro and in vivo [4] and in the identification of new dormancy-specific therapies that are currently under clinical evaluation [13]. In fact, therapies targeting QCCs would be useful in several clinical circumstances, i.e., to overcome chemo/radioresistance and to prevent metastatic dissemination, tumor reawakening and relapse. Dormancy-specific therapeutic strategies include drugs that target a plethora of cellular processes including autophagy, stress signaling, nuclear receptors, adhesion signaling, metabolism and epigenetic regulation [14]. Clearly, the possibility to identify effective therapies directed against QCCs depends on the identification of QCCs-specific vulnerabilities. The identification and isolation of QCCs has been previously performed through different methods, such as the histone H2B-GFP approach or through labelling techniques, including PKH26, bromodeoxyuridine or carboxyfluorescein succinimidyl ester. PKH26 is a dye that incorporates into the membrane of cells and is equally distributed to daughter cells when they divide [15]. PKH26 labelling was used to isolate QCCs from breast cancer [16], colorectal cancer [17,18], glioblastoma [19,20] and osteosarcoma [21]. QCCs populations obtained through PKH26 labelling contain tumor cells negative for the proliferation marker Ki67 [22] and are enriched in cells with enhanced cancer initiation potential, expression of “stemness” genes and resistance to chemotherapy [16,18,20,21,22]. Starting from the hypothesis that QCCs relative to different tumors may share a core of epigenetic regulators responsible for the specification of the quiescent/slow cycling state, we compared QCCs gene expression profiles of quiescent/slow cycling cells with the profiles of rapidly proliferating cells separately in NSCLC and CRC. Then, we performed a cross-analysis of genes and pathways differentially expressed in QCCs of NSCLC and CRC, identifying a signature of quiescence-associated genes and pathways common to both tumors. On a more theoretical perspective, we analyzed in detail the morphogenesis module in quiescent and proliferating cells of both tumors, pointing to relevant differences in gene regulatory network structure. The network approach both indicated ‘high centrality’ genes that differentiate quiescent and proliferative status and a wiring architecture of gene expression correlation for quiescent cells state more fit to a metastable state like dormancy with respect to actively proliferating cells [23]. These results show for the first time the existence of common traits that characterize QCCs of lung and colorectal cancer, providing new insights into the mechanisms responsible for quiescence regulation in solid tumors.

2. Results

2.1. Data Generation and Analysis

NSCLC and CRC specimens were obtained from patients undergoing surgical resection and used to generate multicellular spheroid cultures that were validated by genomic and proteomic analyses, marker expression and ability to reproduce phenocopies of the parental tumors when inoculated in immunocompromised mice [3,22,24,25,26]. NSCLC and CRC spheroids were stained with the lipophilic membrane dye PKH26, which is rapidly lost by actively proliferating cells and retained by slowly proliferating/quiescent cells. Bulk populations of PKH26-stained spheroid cells were inoculated subcutaneously into immunocompromised mice, and tumors were allowed to grow for three or four weeks (respectively for CRC and NSCLC), until the percentage of PKH26-positive cells reached approximately 5% of total tumor cells. At that time, tumors were collected and PKH26+ were separated from PKH26 cells by fluorescence-activated cell sorting (FACS) upon additional labeling with anti-human EpCAM antibody, in order to avoid contamination with non-tumor cells. PKH26-positive and -negative fractions were lysed immediately after sorting and used for gene expression analysis as described in the Methods Section. Having obtained profiles of differentially expressed genes in colon and lung QCCs as compared to their actively proliferating counterpart, we asked whether we could identify a sub-set of genes, common to the two tumors, discriminating quiescent and replicative cell states. To do this, we adopted a largely unsupervised, data-driven strategy. This was an almost essential choice to escape the ‘curse of dimensionality’ with the consequent risk of chance correlation stemming from the very high number of gene products observed in both colon and lung experiments. Thus, we adopted a strategy based on Principal Component Analysis (PCA) as applied to the data set having different samples (gene expression profiles) as variables and gene products as statistical units. Only a posteriori the loading space generated by PCA was checked for the presence of one component exactly matching the PKH26+/PKH26 in terms of loading values [27], thus collapsing the number of inferential tests to three-four. After having found a component that discriminates with no error and in completely unsupervised way the two groups, we stepped into the analysis of component scores identifying the genes with higher absolute scores, thus being ‘most representative’ of the discriminating component. Out of this selection, we focused on ‘high score’ elements shared by the two colon and lung tissues. These shared genes correspond to the most relevant genes discriminating PKH26+/PKH26 conditions across different tissue contexts and were analyzed in terms of putative pathways they pertain. In parallel to the unsupervised PCA approach, we carried out a differential expression analysis of PKH26+/PKH26 datasets with the supervised limma (Linear Models for MicroArray Data) approach in order to get a consistent extraction of relevant genes. Limma provides a measure of the differential expression between the two quiescent and replicative states for each gene, the log2 fold change (logFC). Limma trend is indeed a standard and powerful approach to detect significantly differentially expressed genes, developed to work with both Microarray and RNA-seq data [28]. Furthermore, focusing on the first 6000 genes and the last 6000 of the ranking built by sorting them in descending order by logFC in the colon and lung experiments, we performed Gene Set Enrichment Analysis (GSEA) and reported the significantly enriched pathways shared by the two tissues. These common pathways correspond to the most relevant ones characterizing the two PKH26+ and PKH26 conditions regardless of the type of tissue. The consistency between unsupervised (deriving from PCA) and supervised (limma) gene selection demonstrated the consistency of the results. Finally, we applied complex network analysis of differentially expressed genes to the morphogenesis-related gene subset, which was differentially expressed by lung and colorectal QCCs.

2.2. PCA and Limma Trend Analysis of Lung Cancer Quiescent and Proliferating Cells

Starting from the assumption that quiescence dynamics are different in cancer models in vitro and in vivo, and that in vivo tumor models likely reproduce more faithfully the features of QCCs naturally present in human cancer, we used molecularly annotated 3D cultures of primary NSCLC cells [24,29] to generate subcutaneous tumor xenografts in immunocompromised mice. Cells were stained with PKH26 before inoculation, and PKH26+/PKH26 tumor cells were collected by flow cytometry after three weeks of tumor growth. Gene expression profiles were generated by RNA-sequencing and the results were analyzed in parallel by PCA and limma analysis. PCA on log-transformed original expression data gave rise to a three-component solution explaining around 67% of total variance. Each component conveys an immediate meaning in terms of loading pattern (Table 1A,B):
Having assessed the presence of a specific (PC3) component discriminating, in a totally unsupervised way, the two PKH26 and PKH26+ phenotypes (Figure 1A), we went in depth into the most discriminating genes by means of a limma trend analysis [28]. The analysis was performed on log counts per million and gave rise to the results reported in Figure 1B: the genes sorted by the log fold changes in decreasing order are reported in Table S1. The first and the last 6000 genes were then compared with the first and the last 6000 genes of the ranking built on the scores on PC3 of the PCA in terms of their score on PC3 (top 6000 genes = genes characterizing PKH26+, last 6000 genes = genes characterizing PKH26).
The limma result shares the 75% of the most up-regulated genes and the 72% of the most down-regulated genes with PCA (Figure 1C,D). This is a remarkable finding because it highlights a quite high consistency between the standard limma and unsupervised PCA results, giving us greater confidence on the reliability of the limma output.
Finally, we asked whether the quiescent cell population was characterized by an increased expression of stemness-related factors. Indeed, limma analysis of differentially expressed genes indicated an increased expression of four genes related to cancer stemness: the pluripotency factor KLF4, the PLAUR gene encoding for the urokinase plasminogen activator surface receptor (uPAR), CD44 and CD166 (Table 2). These findings indicate that, in lung cancer, there is at least a partial overlapping between quiescent label-retaining cells and cancer stem cells (CSCs).

2.3. PCA and Limma Trend Analysis of Colorectal Cancer Quiescent and Proliferating Cells

CRC cells were isolated from tumor xenografts generated with PKH26-stained multicellular spheroid cultures and gene expression profiles were obtained as described in our previous studies [22]. PCA on expression data gave rise to a four-component solution explaining around 99.5% of total variance. At odds with lung case, due to the lack of relevant information about line of origin or other relevant phenotypic features, we can only assign a meaning to PC1 and PC4 (Table 3A,B). PC1 corresponds (consistently with lung) to the ‘tissue attractor effect’, while PC4 exactly discriminates the two quiescent and proliferative phenotypes.
In this case, thanks to the more homogeneous biological material, the ‘tissue attractor’ component (PC1) is much more relevant than in the lung case, accounting for 98.7% of variance. All the samples have loading of the same sign on this component (the fact the sign is negative only depends on computational details, given that the directionality of the eigenvector is arbitrary) so it can be considered as a size component. The discrimination between PKH26+ and PKH26 phenotypes stems from the fourth component (PC4) in which the two groups have opposite sign loadings (Table 3B, Figure 2A).
The comparison between PCA and limma results for the colon case is reported in Figure 2.
Having assessed the presence of a specific (PC4) component discriminating the two PKH26 and PKH26+ phenotypes (Figure 2A), we focused on the most discriminating genes by means of a limma trend analysis [28], which gave rise to the results reported in Figure 2B: the genes sorted by the log fold changes in decreasing order are reported in Table S2. The limma result shares 69% of the most up-regulated genes and 58% of the most down-regulated genes with PCA (Figure 2C,D). This finding is consistent with the results found in NSCLC.
Finally, we asked whether also in the case of colorectal cancer quiescent cells were characterized by an increased expression of stemness-related factors. Among the top genes emerging from limma analysis, we found four genes particularly related to cancer stemness and CRC CSCs: besides the pluripotency factor KLF4, we found three genes were specifically related to colorectal cancer stemness and self renewal, i.e., AXIN2, LGR5 and BMI1 (Table 4). These findings indicate that in colorectal cancer label-retaining quiescent cancer cells are enriched in CSCs.

2.4. Comparison between Genes Involved in the Quiescent State in Lung and Colon Cancer

In order to get a full picture of the quiescent/proliferative discrimination, it is important to focus on the discriminating genes shared by the two tissues (Figure 3).
The differential expression analysis of genes shared by lung and colon QCCs showed a partial overlapping of the two transcriptional programs, with approximately 15.8% of genes upregulated in lung cancer QCCs and 21.3% of genes upregulated in CRC QCCs, for a total of 688 genes shared between QCCs of both tumors (Figure 3A). Genes downregulated in QCCs of both tumors (599 in total) include approximately 14% of quiescence-associated genes for lung cancer and 17.1% for CRC (Figure 3B). KEGG pathway selection was applied to the 688 genes representing the “core transcriptional program” of QCCs. The results are shown in Figure 3C and highlight and increased involvement of pathways implicated in angiogenesis, EGFR and RAS signaling, integrins, TGF-β and WNT signaling, suggesting a central role of these pathways in solid tumor quiescence.
Pathway Analysis was performed through the GSEA software (version 4.2.2, UC San Diego and the Broad Institute, San Diego, CA, USA) according to the gene-pathway matching algorithm reported in [32]. The application of GSEA to the limma-ranked results gave rise to the significant pathways (nominal p-value < 0.05) shared by the two lung and colon tissues shown in Table 5. The shared significantly enriched pathways between the two tissues are shown in Table S3 for the quiescent state and in Table S4 for the replicative one. The set of all significantly enriched pathways in the two tissues is reported in Table S5 for the lung system and in Table S6 for the colon one.
Table 5 reports the pathways shared by the two lung and colon systems that differentiate quiescent and replicative phenotypes. These shared pathways can be thus considered as proxies of the ‘tissue independent’ possible mechanisms governing the quiescent/replicative transition.
Figure 4 reports the enrichment plots of some interesting common pathways significantly enriched in the quiescent condition between the lung and colon tissues, and the Venn diagrams of the enriched pathways in the two PKH26+ and PKH26 states.
Overall, the lung system shares almost 30% of the significantly enriched pathways in either state with the colon system. This finding leads us to state that the common trait of the quiescent/replicative transition between the two tissues is not negligible.

2.5. Network Analysis in Quiescent and Proliferating Cancer Cells (with a Specific Emphasis on Morphogenesis Module)

In order to get some hints on the gene expression regulation network constituted by the ‘transcriptional core’ discriminating quiescent and proliferative states in both tissues, we performed a co-expression network analysis.
The existence of a strong correlation among differentially expressed genes shared by the two tissues is the image in light of a coordinated gene regulation network in charge of the transition between the two states. The gene expression network has genes as nodes: an edge between two nodes is drawn (and consequently the two genes are considered to interact) if their correlation coefficient exceeds the threshold of r = |0.7|.
Figure 5A shows the network built on the 688 genes up-regulated in the PKH26+ state shared by the lung and colon systems, considering the PKH26+ samples of both tumors. Table S7 reports relevant information on the interactions between the nodes of this network. Table S8 reports the betweenness centralities and the clustering coefficients of the nodes and these measures are displayed in Figure 5B. Table S9 shows the centrality scores for the genes in the TGF-β pathways. Tables S8 and S9 and Figure 5B refer to the network built considering the usual threshold of Pearson r = |0.7|, while Table S7 and Figure 5A refer to a more stringent threshold r = |0.98|, which is useful to get a graphical picture of the network and, still more important, highlights the quasi-deterministic character of the gene regulation network underlying quiescent/proliferative transition.
In Figure 5A, is worth noting that ZEB2 occupies a central position in the same network domain as c-FOS, consistently with its role in stem cell regulation, plasticity and proliferation transition. Looking at the network, it is evident to note the extremely high density of edges, corresponding to a strictly coordinated behavior of the entire system. This density goes hand-in-hand with extremely elevated Clustering coefficient values (Clustering coefficient corresponds to the relative frequency of ‘complete triads’, which holds the condition: if gene A is correlated with gene B and gene B is correlated with gene C, then even gene A is directly correlated with gene C). While the Clustering coefficient has to do with the local network structure, betweenness deals with the long range signal transmission throughout the co-expression network. Betweenness of a node corresponds to the number of ‘shortest paths’ traversing it, in other words, a “high betweenness” gene corresponds to a gene involved in many signal transmission pathways across the network.
The usefulness of centrality indicators in gene co-expression networks have been reported in various application domains, such as cancer research [33], neuroscience [34] and protein science [35]. For instance, genes exhibiting high betweenness centrality scores have been proposed as candidate targets in different human and animal models [36].
The betweenness centrality encapsulates the property of node i as a bridging node in the network, corresponding to the number of shortest paths connecting any two nodes, j and k, which pass through node i. Nodes with large betweenness-centrality values are often called “high traffic” nodes [36].
Figure 5B projects the genes in the Clustering coefficient/betweenness space. In Figure 5B, the genes in the TGF-β pathways are coded in blue, and they occupy a specific location in the graph where they are easily recognizable (and not scattered around) consistently, pertaining to a specific pathway. The most striking peculiarity of these group of genes is the singularity of JAK1 and LDLRAD4, which are by far the ‘highest betweenness’ genes (Figure 5B), suggesting a possible role as therapeutic targets.
Among the significantly enriched pathways in the PKH26+ state shared by the two lung and colon tissues, we paid particular attention on those related to the process of morphogenesis (Table 6). This focus comes from both the mounting evidence of carcinogenesis as ‘embryo development gone awry’ [37] and the crucial role played by cell morphological changes in cancer and metastatic phenomena [38,39]. In particular, quiescent cancer cells from several tumors have been shown to express increased levels of stemness-associated and embryonic factors [16,22,40,41]. Moreover, quiescent/chemoresistant cells have been shown to adopt a transcriptional program resembling that of embryonic diapause, characterized by c-Myc inactivation and expression of genes typical of the epiblast stage [42,43]. According to these considerations, we focused on the analysis of a morphogenesis module, which was common to QCCs of lung and colorectal cancer.
Going more in detail, for the pathways reported in Table 6, we focused on the genes differentially expressed for quiescent/proliferative states in both lung and colon tissues. For these genes, we built a dataset that reports their expression levels in both lung and colon samples (Table S10), one that reports the expression levels in both tissues for the PKH26+ state only (Table S11) and one that reports the expression levels in both tissues only for the PKH26 state (Table S12).
To build the networks, for each dataset and for all pairs of genes in it, the between-genes Pearson correlation coefficient was calculated to define the strength of the gene-gene pairwise connections. We concentrated on gene correlation values, and we adopted the largely accepted r = |0.7| as minimal threshold to consider two genes as correlated, thus establishing an edge between the two genes (nodes) in the resulting interaction network. The characteristics of the resulting networks are summarized in Table S13. Networks were visualized and analyzed with Cytoscape software (version 3.9.1., Institute for System Biology, Seattle, WA, USA)
Table S13 tells us something interesting: the networks built from the dataset specific for PKH26+ (Table S11) or for PKH26 (Table S12) have many more edges than the one built on all samples (Table S10). This suggests that the correlations are not biased by the differential expression between the two PKH26+ and PKH26 states (range restriction effect [44]), but refer to constitutive between-genes correlation (intra-class correlations outnumbering entire data set correlations) and thus can be considered as proxies of actual regulatory co-expression circuits.
To get a graphical picture of the networks, we adopted and extremely elevated threshold (r > |0.98|) for two nodes (genes) to be considered as connected. This near to unity threshold indicates the ‘obligated’ relations between gene couples, thus highlighting the ‘hard wired’ modules in morphogenesis. Figure 6B–D reports the obtained graphs. It is immediate to note the presence of a ‘giant connected component’ collecting the majority of obligated contacts in both PKH26+ (B) and PKH26 (C), while the all-samples network is much less dense (D). This giant component corresponds to the ‘core’ of the morphogenesis network.
Figure 6A highlights that the high-betweenness genes (i.e., those more involved in long-range regulation and communication between different network domains) are different in the two cases. The fact that different genes play the role of ‘strong’ connectors in the two gene expression network points to a rewiring of the network going from PKH26+ and PKH26. This rewiring implies a local ‘role exchange’ of single genes that keeps invariant the average betweenness of the two networks (Table S17). In this case we considered as connection the usual r > |0.7| threshold.
The Clustering coefficient (Table S18) has to do with the edge density in the neighborhood of a node, and to be more specific, it corresponds to the proportion of ‘complete triplets’. That is to say the ratio between the number of times in which holds the rule ‘IF an edge exists between node(i) and node(j) AND an edge exists between node(j) and node(k) THEN must exist an edge between node(i) and node(k)’ and the total number of triplets connected by at least two edges. The near to unity mean values of clustering centrality have to do with very high density of edges of the co-expression networks (that in turn derives from the need of a strict coordination of morphogenesis processes), again the higher Clustering coefficient of PKH26 has to do with the higher number of edges. Betweenness analysis (Figure 6A) indicates possible targets specific for quiescent cells (it is worth noting the neat discrimination between elevated centrality nodes relative to PKH26+ and PKH26 conditions). As a matter of fact, high centrality corresponds to a major role of the node in signal spreading across the network and thus is a proxy for the character of a gene as a potential target. Passing to the deterministic (r > |0.98|) core of morphogenesis module (Figure 6B–E), Figure 6E suggests that 49 genes of the giant components make up an irreplaceable module of deterministic behavior, while 40 and 61 genes characterize the PKH26+ and PKH26 classes, respectively.
Looking at the pathways corresponding to the genes composing the different classes (Table 7), it emerges that the distribution of the pathways seems not to vary across the different classes. This means that the genes across these classes participate with interchangeable roles to the ‘deterministic’ core of the network. This finding is confirmed by the correlation matrix (Table 8) for classes reported in Table 7. Therefore the involved genes have superimposable biological functions and the role played by a gene in one class can be carried out by a different one in the other (Figure 5A, Table 7 and Table 8).
All in all, morphogenesis appears as a highly structured module, constituted by genes strongly interacting among them in a quasi-deterministic way. This is consistent with the presence of strongly interconnected gene regulation networks supporting the activation of atavistic (unicellular-like) development programs in [38,39].

3. Discussion

Understanding quiescence-associated transcriptional programs is essential to devise more effective strategies for cancer treatment, as QCCs are implicated in tumor chemoresistance, dissemination and dormancy/reawakening [3]. Previous efforts were made to generate a quiescence-associated signature through the cross-analysis of transcriptomes in normal quiescent stem cells, such as hematopoietic stem cells, muscle stem cells and hair follicle stem cells [45]. By contrast, cancer quiescence is much less understood and the possibility to identify shared quiescence-associated targets among different tumors is largely unexplored. We compared the gene expression profiles of QCCs isolated from NSCLC and CRC tumor xenografts, thus identifying a core quiescence-associated transcriptional program common to both tumors. From a methodological point of view, QCCs isolation performed in this study was characterized by two distinctive features. First, QCCs were isolated ex vivo from tumor xenografts, thus reproducing the features of QCCs present in human tumors with increased faithfulness as compared to in vitro cultured cells. In fact, tumor xenografts were generated by inoculating primary tumor spheroids (which have been reported to be enriched in CSCs and to retain several properties of parental tumors [24,46]) stained with PKH26. Label-retaining/PKH26+ cells were then isolated from established xenografts, thus providing the possibility to identify cells that were quiescent/slow cycling from the initial stages of tumor development (“historically quiescent cells”) and not just in a given moment, as occurs instead with the H2B-GFP system or with cell cycle-based reporters. Supporting the soundness of this experimental method, PKH26+ QCCs populations were previously isolated from CRC, lung cancer, breast cancer, glioblastoma and osteosarcoma [16,18,20,21,22,29,47]. Importantly, in all the studies cited above, the PKH26+ subpopulation was demonstrated to possess stemness traits. According with these findings, a specific search for stemness-related genes in the PKH26+ subset revealed an increased expression of the pluripotency factor KLF4 that was common to NSCLC and CRC. Interestingly, besides its known role in pluripotency determination [48,49], KLF4 has been reported to inhibit cancer cell proliferation both in NSCLC and in CRC [50,51,52]. Based on these observations, it may be speculated that KLF4 may play a double role in the maintenance of the QCCs pool by maintaining a stemness state and restraining proliferation at the same time. In addition to KLF4, lung QCCs were characterized by an increased expression of urokinase plasminogen activator surface receptor (uPAR), a component of the plasminogen activation system involved in tumorigenesis, metastasis, EMT and chemoresistance [53], and by the two stemness-associated surface markers CD44 and CD166. Similarly, colorectal QCCs showed an increased expression of factors involved in stemness and self-renewal such as AXIN2, LGR5 and BMI1. Among these, BMI1 was previously identified as the marker for a subpopulation of quiescent intestinal stem cells (ISCs) [54], whereas LGR5 can be found on both proliferating and slow cycling/chemoresistant ISCs [55]. Among the highest-ranking genes expressed in lung QCCs, we also found ZEB2 (Zinc finger E-box-binding homeobox 2), a transcriptional inhibitor involved in EMT, stem cell plasticity and chemoresistance [56,57]. As ZEB2 was previously implicated in quiescence/chemoresistance in CRC [22], its involvement also in NSCLC QCCs suggests a broad role of this factor in regulating cancer quiescence. Altogether, these conditions provided a robust experimental system for the investigation of quiescence-associated features that was validated at several levels during data analysis. In fact, the existence of a PCA component allowing for a perfect partition of the loading component space into PKH26+ and PKH26 areas for each tumor represents an intrinsic quality control of samples and suggests the presence of shared gene-regulatory networks supporting a highly organized autonomous dormancy attractor. Furthermore, as discussed in detail below, genes upregulated in NSCLC and CRC QCCs were coherent with a known quiescence-associated phenotype characterized by an elevated expression of pro-EMT and pro-metastatic factors. Specifically, the analysis of genes upregulated in both NSCLC and CRC quiescent cells showed a “core quiescence program” including approximately 11.5% of genes upregulated in lung cancer and 22% of genes upregulated in CRC. Among the 688 genes common to NSCLC and CRC QCCs, we found an enrichment in gene categories involved in angiogenesis, cancer stemness and TGF-β signaling and several factors involved in metastatization (Cathepsin A, ROCK2), hypoxia regulation (EGLN1) and in the ubiquitin proteasome system (Cul3, Cul4). GSEA analysis of gene expression profiles further revealed a common involvement of pathways related to EMT, TGF-β, chemotaxis and cell adhesion in NSCLC and CRC quiescent cells. Factors involved in the negative regulation of cell proliferation were also more represented in QCCs, providing an internal control of population quality. A large body of evidence has previously associated EMT with chemoresistance [58]. Moreover, several studies highlighted the existence of aggressive cancer cells characterized by combined properties of EMT, quiescence and stemness in breast and colorectal tumors [22,59,60,61]. Likewise, TGF-β pathway activation was shown to be implicated in cancer cell quiescence and metastasis [62], in line with its conspicuous presence in the “core quiescence program” shared by NSCLC and CRC. The presence of cell adhesion pathways in the shared QCCs signature is also in line with previous studies showing a central role of adhesion molecules in maintaining cancer dormancy [63] and in protecting dormant cells from chemotherapy through interaction with the perivascular niche [64]. By contrast, the role of chemotaxis in cancer quiescence is less understood. However, deregulated chemokines produced by tumor cells have been shown to drive chemotaxis of immune cells into the tumor and to manipulate them in order to promote cancer cell dissemination [65]. By contrast, rapidly proliferating cells exhibited a higher expression of pathways involved in RNA processing, in line with their increased RNA content and transcriptional activity. Importantly, Myc targets were found to be upregulated in rapidly proliferating cells, in line with Myc activation as a turning point for quiescent and proliferative states in embryos and cancer [42,66]. Proliferating cells also showed a common upregulation of mitosis-associated pathways, in accordance with their replicative phenotype. The dormancy attractor was analyzed in depth in the case of morphogenesis module, confirming the presence of a coherent gene expression interaction pattern. Network analysis of the morphogenesis module shown in Figure 5A highlights different ‘high centrality’ nodes (genes) for the PKH26+ and PKH26 states. Such genes are of utmost importance for therapeutic intervention, as they represent crucial nodes for intracellular signaling and provide a distinctive vulnerability profile for quiescent and proliferating cancer cells. Importantly, the key network nodes (high betweenness) for the quiescent subpopulation are represented by Annexin A1, a protein involved in metastasis, suppression of inflammation and of the immune response [67,68,69,70,71], by the antioxidant enzyme catalase, involved in the protection from reactive oxygen species and in chemotherapy resistance [72], and by Methaderin, a protein involved in multiple steps of tumor progression and metastasis [73]. It is worth noting these genes emerged by a pure data-driven strategy on the entire gene expression profiles.
Altogether, our results provide for the first time a core quiescence program shared between lung and colorectal tumors, highlighting the upregulation of pro-metastatic, EMT-like, stemness and chemoresistance pathways in QCCs. Moreover, network analysis of QCCs-associated morphogenesis module revealed new factors that represent crucial nodes in QCCs signaling networks, indicating potential targets for novel QCCs-directed therapeutic approaches.

4. Materials and Methods

4.1. Primary Non-Small Cell Lung Cancer and Colorectal Cancer Cells

NSCLC and CRC cells were isolated as previously described [15,18,19,20] from surgically resected tumor samples in accordance with the standards of the ethics committee on human experimentation of the Italian National Institute of Health (Istituto Superiore di Sanità, ISS authorization no. CE5ISS 09/282) and subsequently stored at the institutional Cancer Stem Cell Biobank. Tumor cells were cultured as multicellular spheroids in serum-free medium containing epidermal growth factor 20 ng/mL and basic fibroblast growth factor 10 ng/mL (PeproTech, Cranbury, NJ, USA) in nontreated polystyrene flasks (Thermo Fischer Scientific, Waltham, MA, USA). Such culture conditions were used to reduce cell adherence and to support the growth of lung and colorectal cells as stem cell-enriched multicellular spheroids. Regular thawing of early-passage cells was carried out to avoid the accumulation of culture-related changes. Short Tandem Repeats (STR) analysis was performed with the AmpFlSTRIdentifiler Plus Kit (Applied Biosystems, Waltham, MA, USA) and used to generate a unique STR profile for each primary CRC and NSCLC cell line in order to monitor cell purity over time and to confirm matching with the parental tumor. Multicellular spheroid cultures of NSCLC and CRC were routinely tested for their ability to produce carcinomas histologically identical to the parental tumors when injected into NOD.Cg-Prkdcscid Il2rgtm1Wjl/SzJ (NSG) mice (Charles River, Calco, Italy). Cells were routinely tested for mycoplasma contamination with the PCR Mycoplasma Test Kit (PanReac AppliChem GmbH, Darmstadt, Germany).

4.2. Xenograft Generation

NSCLC and CRC spheroids were dissociated with TrypLE Express (Thermo Fisher Scientific, Waltham, MA, USA), stained for 2 min at 37 °C with PKH26 (Sigma, St. Louis, MO, USA), then washed extensively with PBS. PKH26 staining was assessed by flow cytometry and cells were used for subsequent experiments only when positivity was ≥98%. NOD.Cg-Prkdcscid Il2rgtm1Wjl/SzJ (NSG) mice (The Jackson Laboratory, Sacramento, CA, USA) were injected subcutaneously with 5 × 105 cells derived from the multicellular spheroid culture of primary tumor specimens (as described above) resuspended in 100 μL 1:1 PBS/Matrigel (BD Biosciences, San Jose, CA, USA). After 3 weeks (for colorectal cancer xenografts) or 4 weeks (for lung cancer xenografts) of tumor growth, animals were euthanized, and tumors were extracted for subsequent sorting of PKH26+/PKH26 cells. The time for PKH26+ cell isolation was determined through previous experiments and corresponded to a PKH26+ positivity of approximately 5%. All animal procedures were performed according to the Italian National Animal Experimentation Guidelines (D.L.116/92) upon approval of the experimental protocol by the Italian Ministry of Health’s Animal Experimentation Committee (DM n. 292/2015 PR 23/4/2015).

4.3. Sample Preparation

Xenografts derived from PKH26-stained cells were cut into small pieces and subsequently digested with TrypLE express for 15 min at 37 °C with vigorous pipetting every 5 min. Freshly isolated cells were stained with anti-human EpCAM antibody and 10 μg/mL 7-aminoactinomycin D was used for dead cell exclusion. Cells were sorted with a FACSAria (Becton Dickinson, Franklin Lakes, NJ, USA) to obtain EpCAM+/PKH26+ and EpCAM+/PKH26 populations.

4.4. Transcriptional Profiling of NSCLC and CRC Quiescent and Proliferating Cells Derived from Human Xenografts

The transcriptome of PKH26+ and PKH26 colorectal cancer cells was obtained as described in [22]. For lung cancer cells, PKH26+ and PKH26 cells were isolated from NSCLC xenografts in triplicate samples and subsequently analyzed by RNA-sequencing on the Illumina HiSeq platform as 2 × 150 bp paired-end reads by the company Biodiversa (Rovereto, Italy). RNA libraries were prepared using the SMARTer Stranded Total RNA-Seq Kit v2 (Takara Bio, San Jose, CA, USA) technology that enables removal of ribosomal cDNA following cDNA synthesis (as opposed to direct removal of corresponding rRNA molecules prior to reverse transcription). An average of 98 Mb reads per library was obtained. Trimmomatic [74] was used to remove sequence adapters, discard low-quality reads and trim poor-quality bases. Transcript abundance estimation was performed by means of Salmon 1.3.0 [75], which accurately quantify transcripts without the need of previously aligning the reads, using NCBI hg19 RefSeq reference transcripts. The overall average mapping rate was 14.5%. We performed a series of quality controls in order to understand the reason of such a low mapping rate (we checked for the presence of rRNA, tried both a more aggressive quality trimming of the reads and ran Salmon with different parameters, tried switching reference to GRCh38 and mapping reads to all the genome with STAR aligner and calculated multiple metrics using Picard CollectRnaSeqMetrics and RSeQC), but none of these approaches incremented the mapping rate. Therefore, we infer that the low mapping rate was likely due to the small amount of starting material, as PKH26+ cells isolated from tumor xenografts after 3–4 weeks of growth are few. Nonetheless, the percentage of remaining reads were high enough to produce results that were reliable and very consistent among lung and colorectal samples.

4.5. Statistical and Bioinformatics Analysis

Gene expression profiles were analyzed by means of Principal Component Analysis so as to select a specific component that neatly discriminates quiescent and proliferating samples in the loading space. PCA was performed in R using the ‘prcomp’ function of the ‘stats’ library. The score distribution of the discriminating component was then used to select the genes most involved in the component (score > |3|). This ‘PCA-derived signature’ was then compared with the signature coming from an independent approach and the signature coming from a standard RNA-seq analysis tool (limma trend analysis) obtaining a very strong superposition. The genes coming from limma approach were in turn assigned to different putative pathways and functions by GSEA software. The morphogenesis module coming from GSEA was the basis of the complex network approach. Adopting a correlation threshold of Pearson r > |0.7| for the insertion of an edge between two nodes (genes) of the morphogenesis GRN, we generated the networks relative to the quiescent and proliferative states. These networks were analyzed by Cytoscape software in terms of their relative density (frequency of connections) and, on a gene-by-gene basis, for the centrality of different nodes adopting the betweenness centrality index. The identification of the ‘core’ of the GRNs was obtained by the setting of the correlation threshold to r > |0.98|.

Supplementary Materials

The supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms23179869/s1.

Author Contributions

A.C. performed data analysis and wrote the manuscript, F.F. and M.L.D.A. produced experimental results, A.B. performed data analysis, A.G. provided essential expertise and wrote the manuscript, A.Z. managed the project and wrote the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Italian Association for Cancer Research (AIRC) Grant N. 20744 to A.Z.

Institutional Review Board Statement

The animal study was reviewed and approved by the Italian Ministry of Health’s Animal Experimentation Committee (DM n. 292/2015 PR 23/4/2015).

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding authors.

Acknowledgments

We thank Rachele Rossi for manuscript editing and excellent technical assistance.

Conflicts of Interest

The authors declare no competing interests.

References

  1. van Velthoven, C.T.J.; Rando, T.A. Stem Cell Quiescence: Dynamism, Restraint, and Cellular Idling. Cell Stem Cell 2019, 24, 213–225. [Google Scholar] [CrossRef] [PubMed]
  2. Aguirre-Ghiso, J.A. Models, mechanisms and clinical evidence for cancer dormancy. Nat. Rev. Cancer 2007, 7, 834–846. [Google Scholar] [CrossRef] [PubMed]
  3. De Angelis, M.L.; Francescangeli, F.; La Torre, F.; Zeuner, A. Stem Cell Plasticity and Dormancy in the Development of Cancer Therapy Resistance. Front. Oncol. 2019, 9, 626. [Google Scholar] [CrossRef] [PubMed]
  4. Nik Nabil, W.N.; Xi, Z.; Song, Z.; Jin, L.; Zhang, X.D.; Zhou, H.; De Souza, P.; Dong, Q.; Xu, H. Towards a Framework for Better Understanding of Quiescent Cancer Cells. Cells 2021, 10, 562. [Google Scholar] [CrossRef]
  5. Yao, G. Modelling mammalian cellular quiescence. Interface Focus 2014, 4, 20130074. [Google Scholar] [CrossRef]
  6. Fujimaki, K.; Li, R.; Chen, H.; Della Croce, K.; Zhang, H.H.; Xing, J.; Bai, F.; Yao, G. Graded regulation of cellular quiescence depth between proliferation and senescence by a lysosomal dimmer switch. Proc. Natl. Acad. Sci. USA 2019, 116, 22624–22634. [Google Scholar] [CrossRef]
  7. Fujimaki, K.; Yao, G. Crack the state of silence: Tune the depth of cellular quiescence for cancer therapy. Mol. Cell Oncol. 2018, 5, e1403531. [Google Scholar] [CrossRef]
  8. Liu, B.; Shen, Y.; Huang, H.; Croce, K.D.; Wu, M.; Fan, Y.; Liu, Y.; Xu, J.; Yao, G. Curcumin derivative C212 inhibits Hsp90 and eliminates both growing and quiescent leukemia cells in deep dormancy. Cell Commun. Signal. 2020, 18, 159. [Google Scholar] [CrossRef]
  9. Wang, X.; Fujimaki, K.; Mitchell, G.C.; Kwon, J.S.; Della Croce, K.; Langsdorf, C.; Zhang, H.H.; Yao, G. Exit from quiescence displays a memory of cell growth and division. Nat. Commun. 2017, 8, 321. [Google Scholar] [CrossRef]
  10. Furusawa, C.; Kaneko, K. A dynamical-systems view of stem cell biology. Science 2012, 338, 215–217. [Google Scholar] [CrossRef]
  11. Prager, B.C.; Bhargava, S.; Mahadev, V.; Hubert, C.G.; Rich, J.N. Glioblastoma Stem Cells: Driving Resilience through Chaos. Trends Cancer 2020, 6, 223–235. [Google Scholar] [CrossRef]
  12. Huang, S. Reconciling Non-Genetic Plasticity with Somatic Evolution in Cancer. Trends Cancer 2021, 7, 309–322. [Google Scholar] [CrossRef] [PubMed]
  13. Aguirre-Ghiso, J.A. Translating the Science of Cancer Dormancy to the Clinic. Cancer Res. 2021, 81, 4673–4675. [Google Scholar] [CrossRef] [PubMed]
  14. Risson, E.; Nobre, A.R.; Maguer-Satta, V.; Aguirre-Ghiso, J.A. The current paradigm and challenges ahead for the dormancy of disseminated tumor cells. Nat. Cancer 2020, 1, 672–680. [Google Scholar] [CrossRef] [PubMed]
  15. Lanzkron, S.M.; Collector, M.I.; Sharkis, S.J. Hematopoietic stem cell tracking in vivo: A comparison of short-term and long-term repopulating cells. Blood 1999, 93, 1916–1921. [Google Scholar] [CrossRef]
  16. Pece, S.; Tosoni, D.; Confalonieri, S.; Mazzarol, G.; Vecchi, M.; Ronzoni, S.; Bernard, L.; Viale, G.; Pelicci, P.G.; Di Fiore, P.P. Biological and molecular heterogeneity of breast cancers correlates with their cancer stem cell content. Cell 2010, 140, 62–73. [Google Scholar] [CrossRef] [PubMed]
  17. Francescangeli, F.; Patrizii, M.; Signore, M.; Federici, G.; Di Franco, S.; Pagliuca, A.; Baiocchi, M.; Biffoni, M.; Ricci Vitiani, L.; Todaro, M.; et al. Proliferation state and polo-like kinase1 dependence of tumorigenic colon cancer cells. Stem Cells 2012, 30, 1819–1830. [Google Scholar] [CrossRef]
  18. Regan, J.L.; Schumacher, D.; Staudte, S.; Steffen, A.; Lesche, R.; Toedling, J.; Jourdan, T.; Haybaeck, J.; Mumberg, D.; Henderson, D.; et al. RNA sequencing of long-term label-retaining colon cancer stem cells identifies novel regulators of quiescence. iScience 2021, 24, 102618. [Google Scholar] [CrossRef]
  19. Zeng, L.; Zhao, Y.; Ouyang, T.; Zhao, T.; Zhang, S.; Chen, J.; Yu, J.; Lei, T. Label-retaining assay enriches tumor-initiating cells in glioblastoma spheres cultivated in serum-free medium. Oncol. Lett. 2016, 12, 815–824. [Google Scholar] [CrossRef]
  20. Richichi, C.; Brescia, P.; Alberizzi, V.; Fornasari, L.; Pelicci, G. Marker-independent method for isolating slow-dividing cancer stem cells in human glioblastoma. Neoplasia 2013, 15, 840–847. [Google Scholar] [CrossRef] [Green Version]
  21. Rainusso, N.; Man, T.K.; Lau, C.C.; Hicks, J.; Shen, J.J.; Yu, A.; Wang, L.L.; Rosen, J.M. Identification and gene expression profiling of tumor-initiating cells isolated from human osteosarcoma cell lines in an orthotopic mouse model. Cancer Biol. Ther. 2011, 12, 278–287. [Google Scholar] [CrossRef] [PubMed]
  22. Francescangeli, F.; Contavalli, P.; De Angelis, M.L.; Careccia, S.; Signore, M.; Haas, T.L.; Salaris, F.; Baiocchi, M.; Boe, A.; Giuliani, A.; et al. A pre-existing population of ZEB2(+) quiescent cells with stemness and mesenchymal features dictate chemoresistance in colorectal cancer. J. Exp. Clin. Cancer Res. 2020, 39, 2. [Google Scholar] [CrossRef] [PubMed]
  23. Liu, X.; Li, D.; Ma, M.; Szymanski, B.; Stanley, H.; Gao, J. Network resilience. Phys. Rep. 2022, 971, 108. [Google Scholar] [CrossRef]
  24. Eramo, A.; Lotti, F.; Sette, G.; Pilozzi, E.; Biffoni, M.; Di Virgilio, A.; Conticello, C.; Ruco, L.; Peschle, C.; De Maria, R. Identification and expansion of the tumorigenic lung cancer stem cell population. Cell Death Differ. 2008, 15, 504–514. [Google Scholar] [CrossRef] [PubMed]
  25. Francescangeli, F.; De Angelis, M.L.; Rossi, R.; Sette, G.; Eramo, A.; Boe, A.; Guardiola, O.; Tang, T.; Yu, S.C.; Minchiotti, G.; et al. CRIPTO Is a Marker of Chemotherapy-Induced Stem Cell Expansion in Non-Small Cell Lung Cancer. Front. Oncol. 2022, 12, 830873. [Google Scholar] [CrossRef] [PubMed]
  26. Orienti, I.; Salvati, V.; Sette, G.; Zucchetti, M.; Bongiorno-Borbone, L.; Peschiaroli, A.; Zolla, L.; Francescangeli, F.; Ferrari, M.; Matteo, C.; et al. A novel oral micellar fenretinide formulation with enhanced bioavailability and antitumour activity against multiple tumours from cancer stem cells. J. Exp. Clin. Cancer Res. 2019, 38, 373. [Google Scholar] [CrossRef]
  27. Roden, J.C.; King, B.W.; Trout, D.; Mortazavi, A.; Wold, B.J.; Hart, C.E. Mining gene expression data by interpreting principal components. BMC Bioinform. 2006, 7, 194. [Google Scholar] [CrossRef]
  28. Law, C.W.; Chen, Y.; Shi, W.; Smyth, G.K. voom: Precision weights unlock linear model analysis tools for RNA-seq read counts. Genome Biol. 2014, 15, R29. [Google Scholar] [CrossRef]
  29. Zeuner, A.; Francescangeli, F.; Contavalli, P.; Zapparelli, G.; Apuzzo, T.; Eramo, A.; Baiocchi, M.; De Angelis, M.L.; Biffoni, M.; Sette, G.; et al. Elimination of quiescent/slow-proliferating cancer stem cells by Bcl-XL inhibition in non-small cell lung cancer. Cell Death Differ. 2014, 21, 1877–1888. [Google Scholar] [CrossRef]
  30. Zimatore, G.; Tsuchiya, M.; Hashimoto, M.; Kasperski, A.; Giuliani, A. Self-organization of whole-gene expression through coordinated chromatin structural transition. Biophys. Rev. 2021, 2, 031303. [Google Scholar] [CrossRef]
  31. Jolicoeur, P.; Mosimann, J. Size and shape variation in the painted turtle. A principal component analysis. Growth 1960, 24, 16. [Google Scholar]
  32. Subramanian, A.; Tamayo, P.; Mootha, V.K.; Mukherjee, S.; Ebert, B.L.; Gillette, M.A.; Paulovich, A.; Pomeroy, S.L.; Golub, T.R.; Lander, E.S.; et al. Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. USA 2005, 102, 15545–15550. [Google Scholar] [CrossRef] [PubMed]
  33. Bai, Q.; Liu, H.; Guo, H.; Lin, H.; Song, X.; Jin, Y.; Liu, Y.; Guo, H.; Liang, S.; Song, R.; et al. Identification of Hub Genes Associated With Development and Microenvironment of Hepatocellular Carcinoma by Weighted Gene Co-expression Network Analysis and Differential Gene Expression Analysis. Front. Genet. 2020, 11, 615308. [Google Scholar] [CrossRef] [PubMed]
  34. Wang, Y.; Jiang, L.; Wang, X.Y.; Chen, W.; Shao, Y.; Chen, Q.K.; Lv, J.L. Evidence of altered brain network centrality in patients with diabetic nephropathy and retinopathy: An fMRI study using a voxel-wise degree centrality approach. Ther. Adv. Endocrinol. Metab. 2019, 10, 2042018819865723. [Google Scholar] [CrossRef]
  35. Di Paola, L.; Giuliani, A. Protein contact network topology: A natural language for allostery. Curr. Opin. Struct. Biol. 2015, 31, 43–48. [Google Scholar] [CrossRef]
  36. Azuaje, F.J. Selecting biologically informative genes in co-expression networks with a centrality score. Biol. Direct. 2014, 9, 12. [Google Scholar] [CrossRef]
  37. Dong, J.; Kislinger, T.; Jurisica, I.; Wigle, D.A. Lung cancer: Developmental networks gone awry? Cancer Biol. Ther. 2009, 8, 312–318. [Google Scholar] [CrossRef]
  38. Bizzarri, M.; Giuliani, A.; Cucina, A.; D’Anselmi, F.; Soto, A.M.; Sonnenschein, C. Fractal analysis in a systems biology approach to cancer. Semin. Cancer Biol. 2011, 21, 175–182. [Google Scholar] [CrossRef]
  39. Northcott, J.M.; Dean, I.S.; Mouw, J.K.; Weaver, V.M. Feeling Stress: The Mechanics of Cancer Progression and Aggression. Front. Cell Dev. Biol. 2018, 6, 17. [Google Scholar] [CrossRef]
  40. Kojima, H.; Okumura, T.; Yamaguchi, T.; Miwa, T.; Shimada, Y.; Nagata, T. Enhanced cancer stem cell properties of a mitotically quiescent subpopulation of p75NTR-positive cells in esophageal squamous cell carcinoma. Int. J. Oncol. 2017, 51, 49–62. [Google Scholar] [CrossRef]
  41. Robinson, M.; Gilbert, S.F.; Waters, J.A.; Lujano-Olazaba, O.; Lara, J.; Alexander, L.J.; Green, S.E.; Burkeen, G.A.; Patrus, O.; Sarwar, Z.; et al. Characterization of SOX2, OCT4 and NANOG in Ovarian Cancer Tumor-Initiating Cells. Cancers 2021, 13, 262. [Google Scholar] [CrossRef] [PubMed]
  42. Dhimolea, E.; de Matos Simoes, R.; Kansara, D.; Al’Khafaji, A.; Bouyssou, J.; Weng, X.; Sharma, S.; Raja, J.; Awate, P.; Shirasaki, R.; et al. An Embryonic Diapause-like Adaptation with Suppressed Myc Activity Enables Tumor Treatment Persistence. Cancer Cell 2021, 39, 240–256.e211. [Google Scholar] [CrossRef] [PubMed]
  43. Rehman, S.K.; Haynes, J.; Collignon, E.; Brown, K.R.; Wang, Y.; Nixon, A.M.L.; Bruce, J.P.; Wintersinger, J.A.; Singh Mer, A.; Lo, E.B.L.; et al. Colorectal Cancer Cells Enter a Diapause-like DTP State to Survive Chemotherapy. Cell 2021, 184, 226–242.e221. [Google Scholar] [CrossRef] [PubMed]
  44. Sackett, P.R.; Lievens, F.; Berry, C.M.; Landers, R.N. A cautionary note on the effects of range restriction on predictor intercorrelations. J. Appl. Psychol. 2007, 92, 538–544. [Google Scholar] [CrossRef]
  45. Cheung, T.H.; Rando, T.A. Molecular regulation of stem cell quiescence. Nat. Rev. Mol. Cell Biol. 2013, 14, 329–340. [Google Scholar] [CrossRef]
  46. De Angelis, M.L.; Zeuner, A.; Policicchio, E.; Russo, G.; Bruselles, A.; Signore, M.; Vitale, S.; De Luca, G.; Pilozzi, E.; Boe, A.; et al. Cancer Stem Cell-Based Models of Colorectal Cancer Reveal Molecular Determinants of Therapy Resistance. Stem Cells Transl Med. 2016, 5, 511–523. [Google Scholar] [CrossRef]
  47. Puig, I.; Tenbaum, S.P.; Chicote, I.; Arques, O.; Martinez-Quintanilla, J.; Cuesta-Borras, E.; Ramirez, L.; Gonzalo, P.; Soto, A.; Aguilar, S.; et al. TET2 controls chemoresistant slow-cycling cancer cell survival and tumor recurrence. J. Clin. Investig. 2018, 128, 3887–3905. [Google Scholar] [CrossRef]
  48. Takahashi, K.; Tanabe, K.; Ohnuki, M.; Narita, M.; Ichisaka, T.; Tomoda, K.; Yamanaka, S. Induction of pluripotent stem cells from adult human fibroblasts by defined factors. Cell 2007, 131, 861–872. [Google Scholar] [CrossRef]
  49. Takahashi, K.; Yamanaka, S. Induction of pluripotent stem cells from mouse embryonic and adult fibroblast cultures by defined factors. Cell 2006, 126, 663–676. [Google Scholar] [CrossRef]
  50. Chen, X.; Johns, D.C.; Geiman, D.E.; Marban, E.; Dang, D.T.; Hamlin, G.; Sun, R.; Yang, V.W. Kruppel-like factor 4 (gut-enriched Kruppel-like factor) inhibits cell proliferation by blocking G1/S progression of the cell cycle. J. Biol. Chem. 2001, 276, 30423–30428. [Google Scholar] [CrossRef]
  51. Ma, Y.; Wu, L.; Liu, X.; Xu, Y.; Shi, W.; Liang, Y.; Yao, L.; Zheng, J.; Zhang, J. KLF4 inhibits colorectal cancer cell proliferation dependent on NDRG2 signaling. Oncol. Rep. 2017, 38, 975–984. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  52. Yu, T.; Chen, X.; Zhang, W.; Liu, J.; Avdiushko, R.; Napier, D.L.; Liu, A.X.; Neltner, J.M.; Wang, C.; Cohen, D.; et al. KLF4 regulates adult lung tumor-initiating cells and represses K-Ras-mediated lung cancer. Cell Death Differ. 2016, 23, 207–215. [Google Scholar] [CrossRef] [PubMed]
  53. Lv, T.; Zhao, Y.; Jiang, X.; Yuan, H.; Wang, H.; Cui, X.; Xu, J.; Zhao, J.; Wang, J. uPAR: An Essential Factor for Tumor Development. J. Cancer 2021, 12, 7026–7040. [Google Scholar] [CrossRef] [PubMed]
  54. Yan, K.S.; Chia, L.A.; Li, X.; Ootani, A.; Su, J.; Lee, J.Y.; Su, N.; Luo, Y.; Heilshorn, S.C.; Amieva, M.R.; et al. The intestinal stem cell markers Bmi1 and Lgr5 identify two functionally distinct populations. Proc. Natl. Acad. Sci. USA 2012, 109, 466–471. [Google Scholar] [CrossRef]
  55. Barriga, F.M.; Montagni, E.; Mana, M.; Mendez-Lago, M.; Hernando-Momblona, X.; Sevillano, M.; Guillaumet-Adkins, A.; Rodriguez-Esteban, G.; Buczacki, S.J.A.; Gut, M.; et al. Mex3a Marks a Slowly Dividing Subpopulation of Lgr5+ Intestinal Stem Cells. Cell Stem Cell 2017, 20, 801–816.e807. [Google Scholar] [CrossRef]
  56. Fardi, M.; Alivand, M.; Baradaran, B.; Farshdousti Hagh, M.; Solali, S. The crucial role of ZEB2: From development to epithelial-to-mesenchymal transition and cancer complexity. J. Cell Physiol. 2019, 234, 14783–14799. [Google Scholar] [CrossRef]
  57. Li, N.; Babaei-Jadidi, R.; Lorenzi, F.; Spencer-Dene, B.; Clarke, P.; Domingo, E.; Tulchinsky, E.; Vries, R.G.J.; Kerr, D.; Pan, Y.; et al. An FBXW7-ZEB2 axis links EMT and tumour microenvironment to promote colorectal cancer stem cells and chemoresistance. Oncogenesis 2019, 8, 13. [Google Scholar] [CrossRef]
  58. Shibue, T.; Weinberg, R.A. EMT, CSCs, and drug resistance: The mechanistic link and clinical implications. Nat. Rev. Clin. Oncol. 2017, 14, 611–629. [Google Scholar] [CrossRef]
  59. Hosseini, H.; Obradovic, M.M.S.; Hoffmann, M.; Harper, K.L.; Sosa, M.S.; Werner-Klein, M.; Nanduri, L.K.; Werno, C.; Ehrl, C.; Maneck, M.; et al. Early dissemination seeds metastasis in breast cancer. Nature 2016, 540, 552–558. [Google Scholar] [CrossRef]
  60. Ju, S.; Wang, F.; Wang, Y.; Ju, S. CSN8 is a key regulator in hypoxia-induced epithelial-mesenchymal transition and dormancy of colorectal cancer cells. Mol. Cancer 2020, 19, 168. [Google Scholar] [CrossRef]
  61. Lawson, D.A.; Bhakta, N.R.; Kessenbrock, K.; Prummel, K.D.; Yu, Y.; Takai, K.; Zhou, A.; Eyob, H.; Balakrishnan, S.; Wang, C.Y.; et al. Single-cell analysis reveals a stem-cell program in human metastatic breast cancer cells. Nature 2015, 526, 131–135. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  62. Prunier, C.; Baker, D.; Ten Dijke, P.; Ritsma, L. TGF-beta Family Signaling Pathways in Cellular Dormancy. Trends Cancer 2019, 5, 66–78. [Google Scholar] [CrossRef]
  63. Barney, L.E.; Hall, C.L.; Schwartz, A.D.; Parks, A.N.; Sparages, C.; Galarza, S.; Platt, M.O.; Mercurio, A.M.; Peyton, S.R. Tumor cell-organized fibronectin maintenance of a dormant breast cancer population. Sci. Adv. 2020, 6, eaaz4157. [Google Scholar] [CrossRef] [PubMed]
  64. Carlson, P.; Dasgupta, A.; Grzelak, C.A.; Kim, J.; Barrett, A.; Coleman, I.M.; Shor, R.E.; Goddard, E.T.; Dai, J.; Schweitzer, E.M. Targeting the perivascular niche sensitizes disseminated tumour cells to chemotherapy. Nat. Cell Biol. 2019, 21, 238–250. [Google Scholar] [CrossRef] [PubMed]
  65. Roussos, E.T.; Condeelis, J.S.; Patsialou, A.J.N.R.C. Chemotaxis in cancer. Nat. Rev. Cancer 2011, 11, 573–587. [Google Scholar] [CrossRef]
  66. Scognamiglio, R.; Cabezas-Wallscheid, N.; Thier, M.C.; Altamura, S.; Reyes, A.; Prendergast, A.M.; Baumgartner, D.; Carnevalli, L.S.; Atzberger, A.; Haas, S.; et al. Myc Depletion Induces a Pluripotent Dormant State Mimicking Diapause. Cell 2016, 164, 668–680. [Google Scholar] [CrossRef]
  67. Araujo, T.G.; Mota, S.T.S.; Ferreira, H.S.V.; Ribeiro, M.A.; Goulart, L.R.; Vecchi, L. Annexin A1 as a Regulator of Immune Response in Cancer. Cells 2021, 10, 2245. [Google Scholar] [CrossRef]
  68. Babbin, B.A.; Lee, W.Y.; Parkos, C.A.; Winfree, L.M.; Akyildiz, A.; Perretti, M.; Nusrat, A. Annexin I regulates SKCO-15 cell invasion by signaling through formyl peptide receptors. J. Biol. Chem. 2006, 281, 19588–19599. [Google Scholar] [CrossRef]
  69. Foo, S.L.; Sachaphibulkij, K.; Lee, C.L.Y.; Yap, G.L.R.; Cui, J.; Arumugam, T.; Lim, L.H.K. Breast cancer metastasis to brain results in recruitment and activation of microglia through annexin-A1/formyl peptide receptor signaling. Breast Cancer Res. 2022, 24, 25. [Google Scholar] [CrossRef]
  70. Han, G.; Lu, K.; Xu, W.; Zhang, S.; Huang, J.; Dai, C.; Sun, G.; Ye, J. Annexin A1-mediated inhibition of inflammatory cytokines may facilitate the resolution of inflammation in acute radiation-induced lung injury. Oncol. Lett. 2019, 18, 321–329. [Google Scholar] [CrossRef]
  71. Oshi, M.; Tokumaru, Y.; Mukhopadhyay, S.; Yan, L.; Matsuyama, R.; Endo, I.; Takabe, K. Annexin A1 Expression Is Associated with Epithelial-Mesenchymal Transition (EMT), Cell Proliferation, Prognosis, and Drug Response in Pancreatic Cancer. Cells 2021, 10, 653. [Google Scholar] [CrossRef] [PubMed]
  72. Glorieux, C.; Dejeans, N.; Sid, B.; Beck, R.; Calderon, P.B.; Verrax, J. Catalase overexpression in mammary cancer cells leads to a less aggressive phenotype and an altered response to chemotherapy. Biochem. Pharmacol. 2011, 82, 1384–1390. [Google Scholar] [CrossRef] [PubMed]
  73. Dhiman, G.; Srivastava, N.; Goyal, M.; Rakha, E.; Lothion-Roy, J.; Mongan, N.P.; Miftakhova, R.R.; Khaiboullina, S.F.; Rizvanov, A.A.; Baranwal, M. Metadherin: A Therapeutic Target in Multiple Cancers. Front. Oncol. 2019, 9, 349. [Google Scholar] [CrossRef]
  74. Bolger, A.M.; Lohse, M.; Usadel, B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 2014, 30, 2114–2120. [Google Scholar] [CrossRef] [PubMed]
  75. Patro, R.; Duggal, G.; Love, M.I.; Irizarry, R.A.; Kingsford, C. Salmon provides fast and bias-aware quantification of transcript expression. Nat. Methods 2017, 14, 417–419. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Figure 1. PCA and limma results. (A) Samples projection on the loading space of the second and third components. Each point corresponds to a sample of the log-transformed original expression data matrix, whose coordinates are given by the PC2 and PC3 loadings of that sample (see Table 1B). It is worth noting the cell line discrimination on PC2 and the quiescent/proliferative separation on PC3. (B) Mean difference plot reporting average log expression values versus log2 fold changes (logFC). LogFC are obtained performing differential expression analysis on the log-transformed original expression data matrix. Significantly differentially expressed genes are obtained by sorting in decreasing order the LogFC distribution and keeping the top 6000 values (highlighted in red—PKH26+ specific) and the bottom 6000 (highlighted in blue—PKH26 specific), which resulted to be associated to |LogFC| > 1.95. (C,D) Venn diagrams reporting the number of significantly up-regulated genes in the PKH26+ state identified by the limma and PCA approaches highlighting the relevant superposition between the two methods. Duplicated genes are removed. Statistical significance of the overlap: (C) p-value < 1 × 10−4; RR = 24.0; (D) p-value < 1 × 10−4; RR = 23.4.
Figure 1. PCA and limma results. (A) Samples projection on the loading space of the second and third components. Each point corresponds to a sample of the log-transformed original expression data matrix, whose coordinates are given by the PC2 and PC3 loadings of that sample (see Table 1B). It is worth noting the cell line discrimination on PC2 and the quiescent/proliferative separation on PC3. (B) Mean difference plot reporting average log expression values versus log2 fold changes (logFC). LogFC are obtained performing differential expression analysis on the log-transformed original expression data matrix. Significantly differentially expressed genes are obtained by sorting in decreasing order the LogFC distribution and keeping the top 6000 values (highlighted in red—PKH26+ specific) and the bottom 6000 (highlighted in blue—PKH26 specific), which resulted to be associated to |LogFC| > 1.95. (C,D) Venn diagrams reporting the number of significantly up-regulated genes in the PKH26+ state identified by the limma and PCA approaches highlighting the relevant superposition between the two methods. Duplicated genes are removed. Statistical significance of the overlap: (C) p-value < 1 × 10−4; RR = 24.0; (D) p-value < 1 × 10−4; RR = 23.4.
Ijms 23 09869 g001
Figure 2. Differential expression analysis performed with Principal Component Analysis is consistent with that performed with the standard limma. (A) Samples projection on the loading space of the first and fourth components. Each point corresponds to a sample of the log-transformed original expression data matrix, whose coordinates correspond to PC2 and PC3 loadings of that sample (see Table 2B). (B) Mean Difference plot reporting average log expression values versus log2 fold changes (logFC). LogFC are obtained performing differential expression analysis on the log-transformed original expression data matrix. Significantly differentially expressed genes are achieved sorting in decreasing order the LogFC distribution and keeping the top 6000 values (highlighted in red—PKH26+ specific) and the bottom 6000 (highlighted in blue—PKH26 specific), which resulted to be associated to |LogFC| > 0.2. (C,D) Venn diagrams reporting the number of significantly up-regulated genes in the PKH26+ state identified by the limma and PCA approaches highlighting the relevant superposition between the two methods. Duplicated genes were removed. Statistical significance of the overlap: (C) p-value < 1 × 10−4; RR = 12.9; (D) p-value < 1 × 10−4; RR = 11.8.
Figure 2. Differential expression analysis performed with Principal Component Analysis is consistent with that performed with the standard limma. (A) Samples projection on the loading space of the first and fourth components. Each point corresponds to a sample of the log-transformed original expression data matrix, whose coordinates correspond to PC2 and PC3 loadings of that sample (see Table 2B). (B) Mean Difference plot reporting average log expression values versus log2 fold changes (logFC). LogFC are obtained performing differential expression analysis on the log-transformed original expression data matrix. Significantly differentially expressed genes are achieved sorting in decreasing order the LogFC distribution and keeping the top 6000 values (highlighted in red—PKH26+ specific) and the bottom 6000 (highlighted in blue—PKH26 specific), which resulted to be associated to |LogFC| > 0.2. (C,D) Venn diagrams reporting the number of significantly up-regulated genes in the PKH26+ state identified by the limma and PCA approaches highlighting the relevant superposition between the two methods. Duplicated genes were removed. Statistical significance of the overlap: (C) p-value < 1 × 10−4; RR = 12.9; (D) p-value < 1 × 10−4; RR = 11.8.
Ijms 23 09869 g002
Figure 3. Shared genes resulting from differential expression analysis performed with the standard limma. (A) Venn diagram reporting the number of significantly up-regulated genes in the PKH26+ state identified by the limma approach in the lung (3661 genes) and colon (2539 genes) systems, and in both systems (688 genes). Statistical significance of the overlap: p-value < 1 × 10−4; RR = 7.2. Genes are considered significant if they appear in the first 6000 of the ranking built on the logFC, with a nominal p-value < 0.05. Duplicated genes are removed. (B) Venn diagram reporting the number of significantly down-regulated genes in the PKH26+ state identified by the limma approach in the lung (3672 genes) and colon (2898 genes) systems and in both systems (599 genes). Statistical significance of the overlap: p-value < 1 × 10−4; RR = 5.9. Genes are considered significant if they appear in the last 6000 of the ranking built on the logFC, with a nominal p-value < 0.05. Duplicated genes are removed. (C) Barplot reporting the number of significantly up- and down-regulated genes in the PKH26+ state, identified by the limma approach and shared by the lung and colon systems, belonging to some interesting KEGG categories (reported on the x-axis) found using Panther. Abbreviations: num_genes = number of genes; down = down-regulated; up = up-regulated.
Figure 3. Shared genes resulting from differential expression analysis performed with the standard limma. (A) Venn diagram reporting the number of significantly up-regulated genes in the PKH26+ state identified by the limma approach in the lung (3661 genes) and colon (2539 genes) systems, and in both systems (688 genes). Statistical significance of the overlap: p-value < 1 × 10−4; RR = 7.2. Genes are considered significant if they appear in the first 6000 of the ranking built on the logFC, with a nominal p-value < 0.05. Duplicated genes are removed. (B) Venn diagram reporting the number of significantly down-regulated genes in the PKH26+ state identified by the limma approach in the lung (3672 genes) and colon (2898 genes) systems and in both systems (599 genes). Statistical significance of the overlap: p-value < 1 × 10−4; RR = 5.9. Genes are considered significant if they appear in the last 6000 of the ranking built on the logFC, with a nominal p-value < 0.05. Duplicated genes are removed. (C) Barplot reporting the number of significantly up- and down-regulated genes in the PKH26+ state, identified by the limma approach and shared by the lung and colon systems, belonging to some interesting KEGG categories (reported on the x-axis) found using Panther. Abbreviations: num_genes = number of genes; down = down-regulated; up = up-regulated.
Ijms 23 09869 g003
Figure 4. Lung QCCs and colorectal QCCs share some interesting significantly enriched pathways. (A) RNA sequencing-generated gene set enrichment analysis for Mesenchyme Development (nominal p-value = 0.032, FDR q-value = 0.212), regulation of chemotaxis (nominal p-value = 0.012, FDR q-value = 0.127), cell chemotaxis (nominal p-value = 0.005, FDR q-value = 0.105), cytokine production (nominal p-value = 0.050, FDR q-value = 0.271), response to transforming growth factor beta (nominal p-value < 1 × 10−10, FDR q-value = 0.058), negative regulation of cell population proliferation (nominal p-value = 0.018, FDR q-value = 0.248), transforming growth factor beta receptor signaling (nominal p-value < 1 × 10−10, FDR q-value = 0.057), positive regulation of catalytic activity (nominal p-value = 0.019, FDR q-value = 0.256), regulation of cell adhesion (nominal p-value < 1 × 10−10, FDR q-value = 0.064) and epithelial mesenchymal transition (nominal p-value = 0.005, FDR q-value = 0.051) in PKH26+ lung cancer cells (compared with PKH26 cells). (B) RNA sequencing-generated gene set enrichment analysis for Mesenchyme Development (nominal p-value < 1 × 10−10, FDR q-value = 0.015), regulation of chemotaxis (nominal p-value < 1 × 10−10, FDR q-value = 0.026), cell chemotaxis (nominal p-value = 0.002, FDR q-value = 0.038), cytokine production (nominal pvalue = 0.002, FDR q-value = 0.069), response to transforming growth factor beta (nominal p-value = 0.011, FDR q-value = 0.108), negative regulation of cell population proliferation (nominal p-value = 0.002, FDR q-value = 0.105), transforming growth factor beta receptor signaling (nominal p-value = 0.046, FDR q-value = 0.178), positive regulation of catalytic activity (nominal p-value = 0.030, FDR q-value = 0.230), regulation of cell adhesion (nominal p-value = 0.047, FDR q-value = 0.250) and epithelial mesenchymal transition (nominal p-value < 1 × 10−10, FDR q-value = 0.002) in PKH26+ colon cancer cells (compared with PKH26 cells). (C) Venn diagram reporting the number of significantly enriched gene sets identified in lung PKH26+ cells (282 enriched pathways, p-value < 0.05), colon PKH26+ cells (233 enriched pathways, p-value < 0.05) and in both lung and colon PKH26+ cells (137 enriched pathways, p-value < 0.05). Statistical significance of the overlap: p-value < 1 × 10−4; RR = 3.4. (D) Venn diagram reporting the number of significantly enriched gene sets identified in lung PKH26 cells (91 enriched pathways, p-value < 0.05), colon PKH26 cells (166 enriched pathways, p-value < 0.05) and in both lung and colon PKH26 cells (22 enriched pathways, p-value < 0.05). Statistical significance of the overlap: p-value = 0.003; RR = 4.4.
Figure 4. Lung QCCs and colorectal QCCs share some interesting significantly enriched pathways. (A) RNA sequencing-generated gene set enrichment analysis for Mesenchyme Development (nominal p-value = 0.032, FDR q-value = 0.212), regulation of chemotaxis (nominal p-value = 0.012, FDR q-value = 0.127), cell chemotaxis (nominal p-value = 0.005, FDR q-value = 0.105), cytokine production (nominal p-value = 0.050, FDR q-value = 0.271), response to transforming growth factor beta (nominal p-value < 1 × 10−10, FDR q-value = 0.058), negative regulation of cell population proliferation (nominal p-value = 0.018, FDR q-value = 0.248), transforming growth factor beta receptor signaling (nominal p-value < 1 × 10−10, FDR q-value = 0.057), positive regulation of catalytic activity (nominal p-value = 0.019, FDR q-value = 0.256), regulation of cell adhesion (nominal p-value < 1 × 10−10, FDR q-value = 0.064) and epithelial mesenchymal transition (nominal p-value = 0.005, FDR q-value = 0.051) in PKH26+ lung cancer cells (compared with PKH26 cells). (B) RNA sequencing-generated gene set enrichment analysis for Mesenchyme Development (nominal p-value < 1 × 10−10, FDR q-value = 0.015), regulation of chemotaxis (nominal p-value < 1 × 10−10, FDR q-value = 0.026), cell chemotaxis (nominal p-value = 0.002, FDR q-value = 0.038), cytokine production (nominal pvalue = 0.002, FDR q-value = 0.069), response to transforming growth factor beta (nominal p-value = 0.011, FDR q-value = 0.108), negative regulation of cell population proliferation (nominal p-value = 0.002, FDR q-value = 0.105), transforming growth factor beta receptor signaling (nominal p-value = 0.046, FDR q-value = 0.178), positive regulation of catalytic activity (nominal p-value = 0.030, FDR q-value = 0.230), regulation of cell adhesion (nominal p-value = 0.047, FDR q-value = 0.250) and epithelial mesenchymal transition (nominal p-value < 1 × 10−10, FDR q-value = 0.002) in PKH26+ colon cancer cells (compared with PKH26 cells). (C) Venn diagram reporting the number of significantly enriched gene sets identified in lung PKH26+ cells (282 enriched pathways, p-value < 0.05), colon PKH26+ cells (233 enriched pathways, p-value < 0.05) and in both lung and colon PKH26+ cells (137 enriched pathways, p-value < 0.05). Statistical significance of the overlap: p-value < 1 × 10−4; RR = 3.4. (D) Venn diagram reporting the number of significantly enriched gene sets identified in lung PKH26 cells (91 enriched pathways, p-value < 0.05), colon PKH26 cells (166 enriched pathways, p-value < 0.05) and in both lung and colon PKH26 cells (22 enriched pathways, p-value < 0.05). Statistical significance of the overlap: p-value = 0.003; RR = 4.4.
Ijms 23 09869 g004
Figure 5. Network built on the genes up-regulated in PKH26+ shared by the lung and colon systems. (A) Network visualization. It was built considering as connected only pairs of nodes with extremely elevated correlations (r > |0.98|). (B) Genes’ projection in the Clustering coefficient/betweenness space. It refers to the network built considering the usual threshold of Pearson r = |0.7|. Genes linked to the TGF-β pathway are highlighted in blue.
Figure 5. Network built on the genes up-regulated in PKH26+ shared by the lung and colon systems. (A) Network visualization. It was built considering as connected only pairs of nodes with extremely elevated correlations (r > |0.98|). (B) Genes’ projection in the Clustering coefficient/betweenness space. It refers to the network built considering the usual threshold of Pearson r = |0.7|. Genes linked to the TGF-β pathway are highlighted in blue.
Ijms 23 09869 g005
Figure 6. Network built on the PKH26+, PKH26 and all-samples datasets. (A) Betweenness centrality scores in both PKH26+ (x-axis) and PKH26 (y-axis) networks. “High traffic” genes behaving differently between PKH26+ and PKH26 are highlighted in red. (B) Network visualization from the PKH26+ dataset. Network was built considering as connected only pairs of nodes with extremely elevated correlations (r > |0.98|). See Table S14 for more details. (C) Network visualization from the PKH26 dataset. Network was built considering as connected only pairs of nodes with extremely elevated correlations (r > |0.98|). See Table S15 for more details. (D) Network visualization from the all-samples dataset. Network was built considering as connected only pairs of nodes with extremely elevated correlations (r > |0.98|). See Table S16 for more details. (E) Venn diagram reporting the number of genes identified in the PKH26+ (40 genes) and in the PKH26 network (61 genes), and in both PKH26+ and PKH26 networks (49 genes). Networks were built considering as connected only pairs of nodes with extremely elevated correlations (r > |0.98|). Statistical significance of the overlap: p-value < 1 × 10−4; RR = 3.6.
Figure 6. Network built on the PKH26+, PKH26 and all-samples datasets. (A) Betweenness centrality scores in both PKH26+ (x-axis) and PKH26 (y-axis) networks. “High traffic” genes behaving differently between PKH26+ and PKH26 are highlighted in red. (B) Network visualization from the PKH26+ dataset. Network was built considering as connected only pairs of nodes with extremely elevated correlations (r > |0.98|). See Table S14 for more details. (C) Network visualization from the PKH26 dataset. Network was built considering as connected only pairs of nodes with extremely elevated correlations (r > |0.98|). See Table S15 for more details. (D) Network visualization from the all-samples dataset. Network was built considering as connected only pairs of nodes with extremely elevated correlations (r > |0.98|). See Table S16 for more details. (E) Venn diagram reporting the number of genes identified in the PKH26+ (40 genes) and in the PKH26 network (61 genes), and in both PKH26+ and PKH26 networks (49 genes). Networks were built considering as connected only pairs of nodes with extremely elevated correlations (r > |0.98|). Statistical significance of the overlap: p-value < 1 × 10−4; RR = 3.6.
Ijms 23 09869 g006
Table 1. PCA results. (A) Decomposition of the correlation matrix expressed as eigenvalues, difference between adjacent eigenvalues, proportion of variance explained by each component and cumulative variance. (B) Component pattern (loadings). Each loading corresponds to the correlation coefficient between original variables (samples) and corresponding component.
Table 1. PCA results. (A) Decomposition of the correlation matrix expressed as eigenvalues, difference between adjacent eigenvalues, proportion of variance explained by each component and cumulative variance. (B) Component pattern (loadings). Each loading corresponds to the correlation coefficient between original variables (samples) and corresponding component.
(A)
ComponentEigenvalueDifferenceProportionCumulative
15.9704.8160.4980.498
21.1540.2480.0960.594
30.9060.3070.0760.670
40.5990.0490.0500.720
(B)
SamplePC1PC2PC3PC4
p136.10.2730.3640.3000.053
p136.20.2300.4950.224−0.037
p136.30.2750.3640.2750.068
m136.10.305−0.040−0.2250.358
m136.20.2130.298−0.600−0.689
m136.30.2460.268−0.4620.531
p229.10.320−0.2000.178−0.160
p229.20.318−0.1780.226−0.166
p229.30.316−0.1590.201−0.140
m229.10.318−0.285−0.1760.133
m229.20.325−0.307−0.0520.068
m229.30.296−0.235−0.064−0.113
PC1 = ‘tissue attractor’ [30]. This component is by far the most relevant in terms of percent of variance explained (around 50%) and has all positive loadings; thus, it can be considered as a ‘size’ component [31] stemming from the largely invariant tissue-specific average genome expression profile. PC2 = ‘cell line effect’ this component accounts for approximately 10% of total variance; it is a ‘shape’ component [31] having both positive and negative loading correspondent to a directionality of effect having as poles (opposite loadings, see Table 1B) the lsc136 and lsc229 cell lines. This implies the presence of a neat ‘effect line’ (no exception to this loading sign rule, Table 1B) pointing to the presence of genes with a correlated expression with opposite up (down) expression in the two lines independently of the PKH26+/PKH26− phenotype. PC3 = ‘quiescent/replicative effect’ this component accounts for 7.5% of total variance and discriminates PKH26+ and PKH26 (p vs. m) phenotypes in terms of loading sign. Again, this is a shape component pointing to the presence of a clear opposite pattern of gene activation/repression discriminating the two conditions. It is worth noting that principal components are each other orthogonal by construction, thus implying the above effects (tissue attractor, line effect, phenotype effect) do not superimpose. This means that the quiescent/dormant discrimination happens through the same underlying mechanism in both lines.
Table 2. Typical ‘stemness’ genes up-regulated in the lung cancer PKH26+ state being part of the top 6000 genes selected by limma analysis. Differential expression evaluated through limma as previously described.
Table 2. Typical ‘stemness’ genes up-regulated in the lung cancer PKH26+ state being part of the top 6000 genes selected by limma analysis. Differential expression evaluated through limma as previously described.
Gene NamelogFCp-Value
KLF42.11540.0770
PLAUR/CD872.33510.0837
CD443.78350.0186
ALCAM/CD1666.48520.0000
Table 3. PCA results. (A) Decomposition of the correlation matrix expressed as eigenvalues, difference between adjacent eigenvalues, proportion of variance explained by each component and cumulative variance. (B) Component pattern (loadings). Each loading corresponds to the correlation coefficient between original variables (samples) and corresponding component.
Table 3. PCA results. (A) Decomposition of the correlation matrix expressed as eigenvalues, difference between adjacent eigenvalues, proportion of variance explained by each component and cumulative variance. (B) Component pattern (loadings). Each loading corresponds to the correlation coefficient between original variables (samples) and corresponding component.
(A)
ComponentEigenvalueDifferenceProportionCumulative
19.8749.8320.9870.987
20.0420.0170.0040.991
30.0250.0100.0030.994
40.0150.0030.0020.996
(B)
SamplePC1PC2PC3PC4
PKH26+1−0.3170.156−0.2720.336
PKH26+2−0.3170.065−0.1510.255
PKH26+3−0.3170.100−0.2210.166
PKH26+4−0.316−0.0380.6940.478
PKH26+5−0.314−0.799−0.0400.087
PKH261−0.3160.3470.353−0.396
PKH262−0.3170.143−0.363−0.013
PKH263−0.3170.121−0.262−0.157
PKH264−0.3170.2330.208−0.163
PKH265−0.316−0.3360.057−0.591
Table 4. Typical ‘stemness’ genes up-regulated in the colorectal cancer PKH26+ state. Differential expression evaluated through the limma approach.
Table 4. Typical ‘stemness’ genes up-regulated in the colorectal cancer PKH26+ state. Differential expression evaluated through the limma approach.
Gene NamelogFCp-Value
KLF40.38380.0078
AXIN20.42810.0481
LGR50.80000.0095
BMI10.39260.0015
Table 5. Relevant pathways shared by the two lung and colon systems. In yellow are highlighted the pathways enriched in PKH26; in red are highlighted those enriched in PKH26+.
Table 5. Relevant pathways shared by the two lung and colon systems. In yellow are highlighted the pathways enriched in PKH26; in red are highlighted those enriched in PKH26+.
DatabasePathway
HallmarkMYC_TARGETS_V1
HallmarkMYC_TARGETS_V2
HallmarkEPITHELIAL_MESENCHYMAL_TRANSITION
Gene OntologyNCRNA_METABOLIC_PROCESS
Gene OntologyTRNA_METABOLIC_PROCESS
Gene OntologyRRNA_METABOLIC_PROCESS
Gene OntologyMITOTIC_CELL_CYCLE
Gene OntologyMITOTIC_CELL_CYCLE_PROCESS
Gene OntologyMRNA_PROCESSING
Gene OntologyRNA_PROCESSING
Gene OntologyMESENCHIME_DEVELOPMENT
Gene OntologyREGULATION_OF_CHEMOTAXIS
Gene OntologyCELL_CHEMOTAXIS
Gene OntologyCYTOKINE_PRODUCTION
Gene OntologyRESPONSE_TO_TRANSFORMING_GROWTH_FACTOR_BETA
Gene OntologyNEGATIVE_REGULATION_OF_CELL_POPULATION_PROLIFERATION
Gene OntologyTRANSFORMING_GROWTH_FACTOR_BETA_RECEPTOR_SIGNALING_PATHWAY
Gene OntologyPOSITIVE_REGULATION_OF_CATALYTIC_ACTIVITY
Gene OntologyREGULATION_OF_CELL_ADHESION
Table 6. Morphogenesis pathways, significantly enriched in the PKH26+ state, shared by the two lung and colon systems.
Table 6. Morphogenesis pathways, significantly enriched in the PKH26+ state, shared by the two lung and colon systems.
DatabasePathway
Gene OntologyANATOMICAL_STRUCTURE_FORMATION_INVOLVED_IN_MORPHOGENESIS
Gene OntologyANIMAL_ORGAN_MORPHOGENESIS
Gene OntologyBLOOD_VESSEL_MORPHOGENESIS
Gene OntologyEMBRYONIC_MORPHOGENESIS
Gene OntologyHEART_MORPHOGENESIS
Gene OntologyMORPHOGENESIS_OF_AN_EPITHELIUM
Gene OntologyREGULATION_OF_ANATOMICAL_STRUCTURE_MORPHOGENESIS
Gene OntologyTISSUE_MORPHOGENESIS
Gene OntologyTUBE_DEVELOPMENT
Gene OntologyTUBE_MORPHOGENESIS
Table 7. Pathway distribution across the classes. PKH26+, PKH26 and all-samples datasets refer to the genes composing the giant components reported in Figure 6B–E.
Table 7. Pathway distribution across the classes. PKH26+, PKH26 and all-samples datasets refer to the genes composing the giant components reported in Figure 6B–E.
PathwayPKH26+PKH26All_Samples
ANATOMICAL_STRUCTURE_FORMATION_INVOLVED_IN_MORPHOGENESIS162818
ANIMAL_ORGAN_MORPHOGENESIS172315
BLOOD_VESSEL_MORPHOGENESIS7177
EMBRYONIC_MORPHOGENESIS7128
HEART_MORPHOGENESIS741
MORPHOGENESIS_OF_AN_EPITHELIUM9126
REGULATION_OF_ANATOMICAL_STRUCTURE_MORPHOGENESIS122414
TISSUE_MORPHOGENESIS12158
TUBE_DEVELOPMENT142311
TUBE_MORPHOGENESIS11229
Table 8. Correlation matrix for Table 5.
Table 8. Correlation matrix for Table 5.
PKH26+PKH26All_Samples
PKH26+1.000.850.93
PKH260.851.000.80
all_samples0.930.801.00
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Cuccu, A.; Francescangeli, F.; De Angelis, M.L.; Bruselles, A.; Giuliani, A.; Zeuner, A. Analysis of Dormancy-Associated Transcriptional Networks Reveals a Shared Quiescence Signature in Lung and Colorectal Cancer. Int. J. Mol. Sci. 2022, 23, 9869. https://doi.org/10.3390/ijms23179869

AMA Style

Cuccu A, Francescangeli F, De Angelis ML, Bruselles A, Giuliani A, Zeuner A. Analysis of Dormancy-Associated Transcriptional Networks Reveals a Shared Quiescence Signature in Lung and Colorectal Cancer. International Journal of Molecular Sciences. 2022; 23(17):9869. https://doi.org/10.3390/ijms23179869

Chicago/Turabian Style

Cuccu, Adriano, Federica Francescangeli, Maria Laura De Angelis, Alessandro Bruselles, Alessandro Giuliani, and Ann Zeuner. 2022. "Analysis of Dormancy-Associated Transcriptional Networks Reveals a Shared Quiescence Signature in Lung and Colorectal Cancer" International Journal of Molecular Sciences 23, no. 17: 9869. https://doi.org/10.3390/ijms23179869

APA Style

Cuccu, A., Francescangeli, F., De Angelis, M. L., Bruselles, A., Giuliani, A., & Zeuner, A. (2022). Analysis of Dormancy-Associated Transcriptional Networks Reveals a Shared Quiescence Signature in Lung and Colorectal Cancer. International Journal of Molecular Sciences, 23(17), 9869. https://doi.org/10.3390/ijms23179869

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop