Next Article in Journal
Genetic and Pathophysiological Basis of Cardiac and Skeletal Muscle Laminopathies
Previous Article in Journal
Novel Loss-of-Function SYCP2 Variants in Infertile Males Upgrade the Gene–Disease Clinical Validity Classification for SYCP2 and Male Infertility to Strong
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Integration of Bioinformatics and Machine Learning to Identify CD8+ T Cell-Related Prognostic Signature to Predict Clinical Outcomes and Treatment Response in Breast Cancer Patients

1
Institute of Physical Education and Sport, Shanxi University, Taiyuan 030006, China
2
Capital University of Physical Education and Sports, Beijing 100191, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Genes 2024, 15(8), 1093; https://doi.org/10.3390/genes15081093
Submission received: 19 July 2024 / Revised: 15 August 2024 / Accepted: 16 August 2024 / Published: 19 August 2024
(This article belongs to the Section Bioinformatics)

Abstract

:
The incidence of breast cancer (BC) continues to rise steadily, posing a significant burden on the public health systems of various countries worldwide. As a member of the tumor microenvironment (TME), CD8+ T cells inhibit cancer progression through their protective role. This study aims to investigate the role of CD8+ T cell-related genes (CTRGs) in breast cancer patients. Methods: We assessed the abundance of CD8+ T cells in the TCGA and METABRIC datasets and obtained CTRGs through WGCNA. Subsequently, a prognostic signature (CTR score) was constructed from CTRGs screened by seven machine learning algorithms, and the relationship between the CTR score and TME, immunotherapy, and drug sensitivity was analyzed. Additionally, CTRGs’ expression in different cells within TME was identified through single-cell analysis and spatial transcriptomics. Finally, the expression of CTRGs in clinical tissues was verified via RT-PCR. Results: The CD8+ T cell-related prognostic signature consists of two CTRGs. In the TCGA and METABRIC datasets, the CTR score appeared to be negatively linked to the abundance of CD8+ T cells, and BC patients with higher risk score show a worse prognosis. The low CTR score group exhibits higher immune infiltration levels, closely associated with inhibiting the tumor microenvironment. Compared with the high CTR score group, the low CTR score group shows better responses to chemotherapy and immune checkpoint therapy. Single-cell analysis and spatial transcriptomics reveal the heterogeneity of two CTRGs in different cells. Compared with the adjacent tissues, CD163L1 and KLRB1 mRNA are downregulated in tumor tissues. Conclusions: This study establishes a robust CD8+ T cell-related prognostic signature, providing new insights for predicting the clinical outcomes and treatment responses of breast cancer patients.

1. Introduction

Breast cancer is one of the main diseases that threaten women’s health and cause disability and death [1]. Global data from 2020 reveal 2.3 million new cases of breast cancer, representing approximately 12% of all newly diagnosed cancers, along with 685,000 related deaths [2]. With the success of novel systemic therapies, the prognosis of patients with early and advanced breast cancer has steadily and progressively improved [3]. Despite the fact that the majority of early-stage breast cancer cases are curable with various treatment options, the incidence of breast cancer continues to increase [4,5]. Identification of biomarkers is essential for the diagnosis and treatment of breast cancer. Several recent studies have found that circulating tumor cells (CTC), circulating tumor DNA (ctDNA), circulating tumor RNA (ctRNA) and extracellular vesicles (EV) in liquid biopsies have shown great potential as BC biomarkers [6,7,8,9]. As one of the biomarkers, prognostic signature can identify high-risk patients for effective treatment and potentially reduce mortality.
Cancer cells proliferate and survive in complex tissue ecosystems called tumor microenvironments (TMEs), which crucially contribute to cancer cell growth, invasion and metastasis [10]. A diverse range of immune and non-immune cells, as well as many factors secreted by them, are present in the TME, and the interactions of these cells form an intra-tumor environment that is chronically inflammatory, immunosuppressive, and pro-angiogenic [11]. TME plays a key role in dynamically regulating tumor progression and targeting the malignant components of TME shows great potential in the treatment of cancer [12,13]. As part of TME, CD8+ T cells play an important role in inhibiting cancer development. CD8+ T cells have the ability to recognize and kill cancer cells, and tumor cell-specific CD8+ T cells can be measured in tumors [14,15]. CD8+ T cells residing in tumor tissues were found to correlate with the expression of PD-1/PD-L1, which suppresses the immune activity of tumors [16]. In addition, immunotherapies (cytokine therapy, immune checkpoint blockade, chimeric antigen receptor therapy, and overt T cell transfer) that enhance the activity of antigen-specific CD8+ T cells have been successful in many tumors [17,18]. Therefore, constructing CD8+ T cell-related prognostic traits is important for predicting the prognosis of breast cancer as well as the response to treatment.
In this study, we built the CD8+ T cell-related prognostic signature containing two genes by WGCNA and machine learning in different cohorts, and investigated its relationship with immune cell infiltration, immunotherapy and drug sensitivity. These provide important references for predicting clinical outcomes as well as treatment response in breast cancer patients.

2. Materials and Methods

2.1. Data Acquisition and Processing

Gene expression data and clinical data for TCGA-BRCA, METABRIC, GSE96058, and GSE20685 were obtained from The Cancer Genome Atlas (TCGA) database (https://portal.gdc.cancer.gov/, accessed on 8 January 2024), cBioPortal for Cancer Genomics (https://www.cbioportal.org/, accessed on 8 January 2024), and Gene Expression Omnibus (https://www.ncbi.nlm.nih.gov/geo/, accessed on 15 February 2024). Table S1 shows the baseline characteristics of these datasets. Differential expression genes (DEGs) in the TCGA-BRCA were obtained using the “esayTCGA” R package (Version 0.0.1.7000).

2.2. Assessment of CD8+ T Cells Abundance

The abundance of CD8+ T cells was evaluated using four algorithms in the “IBOR” R package (Version 0.99.9), including the Microenvironment Cell Populations-counter (MCP-counter) algorithm, xCell algorithm, quantiseq algorithm, and ssGSEA algorithm.

2.3. WGCNA Analysis

Weighted Gene Co-expression Network Analysis (WGCNA) related to CD8+ T cell abundance was conducted using the “WGCNA” R package (Version 1.72-1). We selected the top 5000 genes from the TCGA-BRCA and METABRIC cohorts to construct the network and determined the soft thresholding power. Subsequently, gene modules were identified through clustering, and the relationship between CD8+ T cell abundance and module eigengenes (MEs) was assessed. The gene modules with the most relevance were extracted for further study. Finally, overlapping genes among module genes from TCGA-BRCA, METABRIC, and DEGs were extracted to serve as CD8+ T cell-related genes (CTRGs).

2.4. Selection of the Prognostic CTRGs

Prognosis-related CTRGs were screened using the univariate Cox analysis with a screening criterion of p < 0.05. These CTRGs were further selected using six machine learning methods, including CoxBoost, random forest algorithm (RFS), extreme gradient boosting (XGBoost), the least absolute shrinkage and selection operator (LASSO), stepwise, and gradient boosting machine (GBM). Eventually, we extracted overlapping genes identified by six machine learning algorithms and further screened genes for constructing prognostic signature by multifactorial Cox.

2.5. Construction of the CD8+ T Cell-Related Prognostic Signature

To avoid underfitting of the prognostic model, we used 8 machine learning algorithms to construct the prognostic signature: CTR score. The 8 machine learning algorithms are LASSO, Ridge, SurvReg, StepCox, Survival-Svm, plsRcox GBM, and CoxBoost. To ensure the accuracy of CTR scores, we split the TCGA-BRCA randomly into a training set and a validation set according to a ratio of 6:4. We build the CTR score in the training set and evaluate the performance of the CTR score in the test set. In addition, the METABRIC, GSE96058 and GSE20685 datasets were applied as exterior validation sets to further check the performance of the CTR score. Finally, we selected the highest C-index of the 8 machine learning algorithms to construct the CTR score. The sample in the TGCA cohort was categorized into high and low CTR score groups based on the median CTR score. The impact of CTR score on the overall survival (OS) of BC patients was then assessed through survival analysis. Additionally, ROC curves for CTR score at 1, 3, and 5 years were plotted.

2.6. Construction of the Nomogram

The relationship between clinical characteristics and CTR score was analyzed using univariate and multivariate Cox regression analyses. We constructed a nomogram by combining the CTR score with clinical characteristics with independent prognostic value in a multifactorial Cox. Additionally, the ROC curve and calibration curve of the nomogram were plotted.

2.7. Functional Enrichment Analysis

Gene Ontology (GO) and Kyoto Encyclopedia of Genomes (KEGG) enrichment analyses were carried out with the ”clusterProfiler” R package (version 4.8.2). Differential biological processes between the various CTR score groups were analyzed by the “GSVA” R package (Version 1.48.0).

2.8. Immune Cell Infiltration Analysis

The CIBERSORT, EPIC, TIMER, MCPcounter, quantiseq, and ESTIMATE algorithms were used to measure immune cell infiltration in both high and low CTR score groups. In addition, the correlation of the CTR score with the immunoinhibitor, immunostimulator, major histocompatibility complex (MHC) molecule, chemokine, and chemokine receptor was analyzed. Referring to previous studies [19,20], we analyzed differences in 29 immune signatures and the cancer immune cycle between CTR score groups.

2.9. Analysis of Immune Therapy and Drug Sensitivity

Immunophenotype score (IPS) data were downloaded from The Cancer Immunome Atlas (TCIA, https://tcia.at/, accessed on 15 April 2024) and variations in IPS between CTR score groups were assessed. IPS indicates a patient’s sensitivity to the immune checkpoint suppressant. The “oncoPredict” R package (Version 0.2) was used for drug sensitivity analysis of chemotherapy-related drugs, using the half-maximal inhibition concentrations (IC50) to represent drug sensitivity.

2.10. Single-Cell and Spatial Transcriptomics Analysis

The expression of CTRGs in different cell types was analyzed using the single-cell database TISCH2 (http://tisch.comp-genomics.org/, accessed on 15 April 2024). Additionally, the spatial distribution of CTRGs was analyzed using the Sparkle database (https://grswsci.top/analyze, accessed on 8 January 2024).

2.11. Validation of Prognostic CTRGs Expression Levels

Immunohistochemistry results for CTRGs were downloaded from the Human Protein Atlas (https://www.proteinatlas.org/, accessed on 18 April 2024). Expression data for the CD163L1 protein in breast cancer were obtained from UALCAN (https://ualcan.path.uab.edu/index.html, accessed on 18 April 2024). Expression data for CTRGs in breast cancer cell lines were downloaded from the Cancer Cell Line Encyclopedia (CCLE) (https://depmap.org/portal/, accessed on 22 April 2024), and visualized using the “ggplot2” R package (Version 3.4.2).

2.12. Real-Time Fluorescent Polymerase Chain Reaction (RT-PCR)

Clinical tissue from BC patients was provided by the Shanxi Cancer Hospital. The study received permission from the Research Ethics Committee of the Shanxi Cancer Hospital (approval number: KY2023163), and all participants signed informed consent forms. Total RNA was extracted from tissues or cells using the Trizol reagent kit. Reverse transcription was performed using the Takara reagent kit. The real-time fluorescent quantitative PCR steps are as follows: configuration of the reverse transcription system; real-time PCR reaction; calculation of the target gene mRNA expression levels. Primers for two genes are listed in Supplementary Table S2.

2.13. Statistical Analysis

The Wilcoxon test for comparing the difference between two groups was used. The Kaplan–Meier analysis was used to evaluate the difference in overall survival (OS) between the low and high CTR score groups. All statistical analyses were performed using R version 4.3.0, with p < 0.05 indicating statistical significance.

3. Results

3.1. Relationship between CD8+ T Cell Infiltration Level and Prognosis

We calculated the abundance of CD8+ T cells in TCGA-BRCA and METABRIC using four algorithms: MCP-counter, xCell, quantiseq, and ssGSEA. According to the median CD8+ T cell abundance, BC samples from both cohorts were classified into high and low groups. In the TCGA-BRCA, survival curves indicated that lower CD8+ T cell abundance was associated with a poorer prognosis (Figure 1A–D). Similarly, in the METABRIC cohort, BC patients with lower CD8+ T cell abundance had a worse prognosis (Figure 1E–I).

3.2. WGCNA Analysis Based on the CD8+ T Cell Abundance

The CD8+ T cell abundance of the four algorithms mentioned above were selected for WGCNA analysis in the TCGA-BRCA and METABRIC cohorts, respectively. The soft threshold power was six for TCGA-BRCA (Figure 2A) and four for METABRIC (Figure 2B). In the TCGA-BRCA cohort, system clustering identified six modules (Figure 2C), with the green module showing the highest correlation with Activated_CD8_T_cell_ssGSEA (Cor = 0.76), T_cells_CD8_MCPcounter (Cor = 0.61), T_cells_CD8_xCell (Cor = 0.67), and T_cells_CD8_quantiseq (Cor = 0.73). In the METABRIC cohort, ten modules were identified (Figure 2D), with the turquoise module showing the highest correlation. Within the green and turquoise modules, a positive correlation was found between module membership (MM) and gene significance (GS) (Figure 2E–L).

3.3. Identification and Enrichment Analysis of CTRGs

A Venn diagram revealed 147 intersecting genes between the green module, turquoise module, and DEGs (Figure S1A, Table S3). GO and KEGG enrichment analyses were undertaken to understand the biological processes of these genes. KEGG results showed that CTRGs were mainly enriched in cytokine–cytokine receptor interaction, cell adhesion molecules, and the TNF signaling pathway (Figure S1B). GO results indicated that CTRGs were mainly enriched in leukocyte cell–cell adhesion, regulation of T cell activation, regulation of cell–cell adhesion, and leukocyte-mediated immunity, among other immune-related biological processes (Figure S1C).

3.4. Screening for Prognostic-Related CTRGs

A total of 41 CTRGs were successfully screened by the one-factor Cox analysis (Figure S1D). Subsequently, seven machine learning algorithms were employed to further screen prognostic-associated CTRGs. The LASSO, CoxBoost, stepwise, and RFS algorithms identified six, seven, eight, and 27 genes, respectively (Figure 3A–E). Additionally, the top 10 most important genes in the XGboost and GBM algorithms were identified (Figure 3F,G). We further screened for the overlapping genes in the six algorithms by multivariate Cox analysis. Figure 4A shows the two intersecting genes from the six algorithms. The results of the multifactorial Cox showed that KLRB1 and CD163L1 had independent prognostic value (Figure 4B). Compared to normal tissue, the expression of KLRB1 and CD163L1 is lower in tumor tissue (Figure 4C).

3.5. Construction of Prognostic Signature Related to CD8+ T Cells

We build the CTR score by the GBM algorithm with the highest C-index (Figure S2). CTR score and survival statuses in the TCGA-train and TCGA-validation sets are shown in Figure S3A,B. Figure 4D–G show CTR score and survival statuses in the TCGA-BRCA, METABRIC, GSE96058 and GSE20685. In the TCGA-train set, the AUC values of the CTR score predicting the overall survival (OS) of BC patients at 1, 3, and 5 years were 0.785, 0.860, and 0.796 (Figure S3C). The AUC values in the TCGA-validation set were 0.599, 0.754 and 0.727. (Figure S3D). In the TCGA-all cohort, the AUC values were 0.779, 0.842, and 0.777 (Figure 4H). In the METABRIC cohort, the AUC values were 0.638, 0.562, and 0.550 (Figure 4I). In the GSE96058 cohort, the AUC values were 0.735, 0.689, and 0.689 (Figure 4G). In the GSE20685 cohort, the AUC values of were 0.793, 0.761, and 0.752 (Figure 4K). Survival curves indicate that in the TCGA-train, TCGA-validation, TCGA-all, METABRIC, and GSE96058 sets, BC patients in the high CTR score group have a worse prognosis (Figure 4L–O; Figure S3E,F).

3.6. Relationship between CTR Score and CD8+ T Cell Abundance

We analyzed the relationship between CD8+ T cell levels, as assessed by the previous four algorithms, and the CTR score. The low CTR score group exhibited higher levels of CD8+ T cells in both TCGA-BRCA (Figure 5A–D) and METABRIC (Figure 5E–H). Correlation analyses revealed that the CTR score was negatively correlated with the levels of CD8+ T cells as determined by the four algorithms in TCGA-BRCA (Figure 5I) and METABRIC (Figure 5J) (p < 0.001). Additionally, we found that the expression levels of the two genes were positively correlated with the levels of CD8+ T cells by the four algorithms (Figure 5K–L).

3.7. Relationship between CTR Score and Clinical Characteristics

To further estimate the performance of the CTR score, we analyzed its relationship with clinical features. Results revealed that in the TCGA-BRCA cohort, the CTR score showed significant differences across various statuses, stages, T stages, and age groups (Figure 6A–D). In the METABRIC cohort, significant differences in the risk score were observed across different statuses, estrogen receptor (ER) status, and age groups (Figure 6E–G). Next, we assessed the predictive ability of the prognostic feature across different clinical subgroups. The prognosis of BC patients in different CTR score groups showed significant differences among subgroups under 65 years old, 65 years and older, stages I–II, stages III–IV, T1–T2, T3–T4, N0–N1, and N2–N3 (Figure 6H–O). In addition, we analyzed the performance of CTR score in patients with different subtypes of breast cancer in the TCGA-BRCA and GSE96058 cohorts. Notably, CTR score showed robust predictive performance across all subtypes (Figure S4A–D).

3.8. Construction of a Nomogram

Univariate and multivariate Cox analyses of CTR score and clinical characteristics were applied aiming to explore the independence of CTR scores. The results demonstrated that the CTR score served as an independent predictive indicator separate from other clinical features in the TCGA-BRCA, METABRIC, GSE96058, and GSE20685 (Figure 7A–D; Figure S5A–D). In the TCGA-BRCA cohort, a nomogram was constructed combining age and stage that could predict the 1-, 3-, and 5-year survival of BC patients (Figure 7E). The ROC curves indicated that the AUC values of the nomogram for predicting 1-, 3-, and 5-year survival were 0.856, 0.882, and 0.821 (Figure 7F). Besides, the survival probabilities predicted by the nomogram matched the actual survival probabilities substantially (Figure 7G).

3.9. Differential Biological Processes between Different CTR Score Groups

The GSVA-GO analysis revealed that the high CTR score group exhibited enrichment in metabolism-related biological processes, such as nucleoside phosphate metabolic process, thioester metabolic process, and fatty acyl CoA synthase activity. Conversely, the low CTR score group demonstrated enrichment in immune-related biological processes, including lymphocyte anergy, positive regulation of T cell receptor signaling pathway, and CD40 signaling pathway (Figure S6A). Similarly, GSVA-KEGG analysis indicated that the high CTR score group exhibited enrichment in butanoate metabolism, histidine metabolism, and phenylalanine metabolism, while the low-risk group was in the ERBB signaling pathway, leukocyte transendothelial migration, and tight junction (Figure S6B). In addition, we investigated differences in cancer-related biological processes across CTR score groups. The low CTR score group displayed higher scores in angiogenesis, apoptosis, proliferation, differentiation, and inflammation, whereas the high-risk group showed elevated scores in cell cycle, DNA repair, and glycolysis (Figure S6C–J).

3.10. Relationship between CTR Score and Immune Cell Infiltration

As depicted in Figure 8A,B, the low CTR score group exhibited elevated levels of immune cells and demonstrated higher immune, stromal, and estimate scores. Moreover, significant differences were observed in immunoinhibitor, immunostimulator, MHC, chemokine, and chemokine receptor expression between the CTR score groups (Figure 8C). Furthermore, our analysis revealed that the cancer immune cycle score and 29 immune characteristics score were notably higher in the low CTR score group (Figure 8D,E).

3.11. Prediction of Immune Therapy and Chemotherapy Sensitivity

The Immune Phenotype Score (IPS) is a predictive factor for the response to CTLA-4 and PD-1 antibodies [21]. As depicted in Figure 9A–D, the IPS was higher in the low CTR score group, suggesting a better response to immune checkpoint inhibitors. We also analyzed the sensitivity to chemotherapy-related drugs across different CTR score groups. The low CTR score group had better sensitivity to docetaxel, cisplatin, 5-fluorouracil, cyclophosphamide, epirubicin, vinorelbine, vincristine, and gemcitabine (Figure 9E–L).

3.12. Single-Cell Analysis of Prognostic Signature Genes

Figure 10A displays various cell types in the EMTAB8107 dataset. Single-cell analysis revealed that KLRB1 is mainly enriched in CD8+ T cells, while CD163L1 is expressed at lower levels across different cells (Figure 10B,C). Figure 10D shows the distribution of different cells in GSE203612-GSM6177603-NYU-BRCA2. CD163L1 is abundantly expressed in malignant cells, whereas KLRB1 shows lower expression across various cells (Figure 10D,E).

3.13. Validation of Expression Levels of Prognostic Signature Genes

Compared to paired normal tissues, the expression levels of KLRB1 and CD163L1 were reduced in cancer tissues (Figure 11A,B). Consistent with previous results, the protein expression level of CD163L1 is also downregulated in cancer tissues (Figure 11C). Figure 11D,E, show the expression levels of KLRB1 and CD163L1 in breast cancer cell lines. Immunohistochemistry (IHC) images of the prognostic feature genes in tumor tissues were downloaded from the Human Protein Atlas (HPA) database. As shown in Figure 11F,G, the staining of KLRB1 and CD163L1 proteins is lighter in tumor tissues. The expression of KLRB1 and CD163L1 mRNA is downregulated in tumor tissues compared to paracancerous tissue (Figure 11H,I).

4. Discussion

Over the past 50 years, the mortality rate for breast cancer has decreased by 58% due to advancements in early screening techniques and treatment methods [22,23]. However, despite these improvements, the incidence of breast cancer remains on the rise [24,25]. Breast cancer encompasses various biological entities, each characterized by specific genomic alterations, gene expression patterns, and influences from the TME, all of which collectively determine clinical behaviors and treatment responses [26]. The TME itself is a complex ecosystem, comprising cancer cells, various non-cancerous cells, and an altered, vascularized extracellular matrix [27]. It includes immune cells (such as T lymphocytes, B lymphocytes, natural killer cells, mast cells, and neutrophils), stromal cells, extracellular matrix, and secretory molecules [28]. The interactions among these diverse cells can either promote or inhibit the progression of tumors, influencing the developmental trajectory of the TME [29]. Given the complexity of the TME, a variety of therapeutic strategies targeting it have been developed, such as immunotherapies, anti-angiogenic drugs, and treatments that target tumor-promoting cells [30].
As a protective component of immune cells within the TME, cytotoxic CD8+ T cells are critical in the elimination of malignant cells [31]. Although these cells are present in tumor tissues, they often find themselves in an exhausted state within cancerous environments, which impairs their anti-tumor activities [32,33]. Additionally, CD8+ T cells work in concert with other cells within the TME to exert anti-tumor effects. For instance, FOLR2+ macrophages have been shown to interact with CD8+ T cells to enhance the anti-tumor immune response [34]. Recent studies have underscored the importance of tumor-resident type 1 conventional dendritic cells (cDC1) in orchestrating CD8+ T cell-mediated anti-cancer responses at the tumor-intrinsic stage [35]. In cancers such as gastric, lung, colon, and ovarian, higher levels of CD8+ T cell infiltration have correlated with improved clinical outcomes [36,37,38,39]. Moreover, studies have highlighted that tumor-resident memory CD8+ T cells with a structured residency phenotype are associated with favorable prognoses in patients with triple-negative breast cancer and play significant roles in the efficacy of checkpoint blockade therapies and sustaining protective immunity against breast cancer [40]. In this study, we also observed that breast cancer patients with higher levels of CD8+ T cell infiltration exhibited better prognoses. Although CD8+ T cells are crucial for anti-tumor immunity, their specific mechanisms of action in the TME still require further investigation.
Given the pivotal role of CD8+ T cells in the tumor microenvironment (TME), we developed the CTR score based on these cells using machine learning algorithms. We found that breast cancer patients categorized in the high CTR score group exhibited a poorer prognosis and had reduced infiltration of CD8+ T cells. The CTR score is an independent prognostic predictor for BC patients and demonstrated robust predictive performance in different datasets. In addition, we found that CTR score could well predict the prognosis of BC patients with different clinicopathological features. These results suggest that CTR score can serve as a potential biomarker for predicting the prognosis of BC patients. To enhance clinical utility, we integrated the CTR score with clinical pathological characteristics to construct a nomogram designed to predict the prognosis of breast cancer patients. Both calibration curves and ROC curves demonstrate that this nomogram provides good predictive accuracy.
The CTR score incorporates two genes: Killer Cell Lectin Like Receptor B1 (KLRB1) and CD163 Molecule Like 1 (CD163L1). Prior research has established a strong link between these genes and cancer progression. For instance, high expression of KLRB1 has been shown to promote the progression and evolution of gliomas by influencing T cell dysfunction [41]. Another study revealed that inactivation of the KLRB1 gene or antibody-mediated blockade of CD161 can enhance T cell-mediated anti-tumor functions in glioblastoma [42]. In our analysis, KLRB1 was found to be downregulated in breast cancer while being abundantly expressed in CD8+ T cells. As for CD163L1, which is a homolog co-localized with CD163, it is typically highly expressed in macrophages and associated with tissue-resident macrophages that exhibit anti-inflammatory or immunologically incompetent phenotypes, potentially playing a role in the resolution of inflammation [43,44]. In our study, we observed that CD163L1 is downregulated in breast cancer and closely linked to CD8+ T cell dynamics. Furthermore, RT-PCR validation of these genes corroborated our database analyses, thereby reinforcing the predictive accuracy of our prognostic signature.
The complex interactions between immune cells and cancer cells within the TME are considered key mechanisms regulating tumor progression [45]. In this study, we found that the levels of B cells, CD8+ T cells, and M1 macrophages were higher in the low-risk group. Previous studies have demonstrated that these immune cells play a significant role in inhibiting cancer progression [46,47,48]. Conversely, the infiltration of M0 and M2 macrophages was higher in the high-risk group. M0 and M2 macrophages have also been shown to promote tumor development [49,50,51]. Furthermore, we observed that the scores of cancer immune cycles were higher in the low CTR score group, indicating a heightened level of anti-cancer immune activity within this group. These results suggest that TME in the high CTR score group is strongly associated with immunosuppression, which may be the cause of the poorer prognosis in this group.
Recent research emphasizes the importance of a multidisciplinary approach in decision-making for breast cancer treatment, particularly in selecting systemic therapies tailored to individual patients [52]. The primary local treatment for breast cancer remains surgical intervention, supplemented by adjuvant therapies such as chemotherapy, radiotherapy, and endocrine therapy [53]. Our study also revealed that the low CTR score group demonstrated lower IC50 values for chemotherapy drugs, suggesting a more favorable response to these treatments. Furthermore, the last decade has seen significant advancements in immunotherapy, which has revitalized cancer treatment protocols. Various immunotherapeutic approaches, including immune checkpoint inhibitors, vaccination, and adoptive cell transfer, have been rigorously tested in clinical settings, especially for patients with triple-negative breast cancer (TNBC) [54]. Exhausted T cells, which are prevalent in the TME, typically overexpress various inhibitory surface molecules that hinder T cell activation, such as CTLA-4 and PD-1 [55,56]. These molecules are targeted by immune checkpoint inhibitors, which aim to disrupt regulatory pathways in T cells and enhance anti-tumor immune responses [57]. In this context, our findings indicate that the low CTR score group exhibited higher IPS, suggesting a more effective response to anti-PD-1 and anti-CTLA-4 therapies. These results further underscore the potential of using the CTR score as biomarker for predicting clinical treatment responses in breast cancer.
Certainly, there are some limitations in this study that should be further addressed. Firstly, our prognostic feature was constructed based on databases and still needs further validation in real cohorts. Moreover, the role and potential molecular mechanisms of the two prognostic feature genes in the progression of breast cancer need to be further explored in in vitro or in vivo experiments.

5. Conclusions

In conclusion, we established a CD8+ T cell-related prognostic signature in BC by machine learning, which can be used as an independent prognostic factor for BC. In addition, the prognostic signature can assess the tumor microenvironment as well as the response to immunotherapy and chemotherapy in BC patients. These results provide an important reference for clinical outcome monitoring as well as early treatment of breast cancer patients.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/genes15081093/s1, Figure S1: Identification and enrichment analysis of CTRGs; Figure S2: Constructing the prognostic signature through machine learning algorithms; Figure S3: Construction of prognostic signatures related to CD8+ T cells; Figure S4: Relationship between CTR score and different subtypes; Figure S5: Investigate the independence of the CTR score; Figure S6: Differences in biological processes between different CTR score groups; Table S1: Comparison of baseline characteristics; Table S2: Primer sequences; Table S3: Intersection genes between green module, turquoise module and differentially expressed genes.

Author Contributions

Conceptualization, B.W. and J.Z.; Methodology, L.L. (Longpeng Li) and L.L. (Longhui Li); validation, L.L. (Longpeng Li) and L.L. (Longhui Li); Formal analysis, L.L. (Longpeng Li); Investigation, Y.C.; Resources, Y.G.; Data curation, Y.C. and Y.G.; Writing—original draft preparation, L.L. (Longpeng Li) and B.W.; Writing—review and editing, B.W. and J.Z.; Visualization, L.L. (Longpeng Li); Supervision, B.W.; Project administration, B.W.; Funding acquisition, B.W. and J.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the grants from the Postgraduate Education Innovation Project in Shanxi Province [Grant number: 2022Y061], the Basic Research Program of Shanxi Province (Free Exploration Category) [Grant number: 202103021224428], and the Basic Research Program of Shanxi (Free Exploration Categories) [Grant number: 202203021221022].

Institutional Review Board Statement

The Research Ethics Committee of the Shanxi Cancer Hospital has approved this study (batch number: KY2023163). Informed consent was obtained from all subjects included in this study. It is necessary to respect the privacy of the individuals involved in the study.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The datasets used in this study can be found in The Cancer Genome Atlas database (https://portal.gdc.cancer.gov/, accessed on 8 January 2024), cBioPortal for Cancer Genomics (https://www.cbioportal.org/, accessed on 8 February 2024), and Gene Expression Omnibus (https://www.ncbi.nlm.nih.gov/geo/, accessed on 15 February 2024).

Acknowledgments

The authors thank TCGA, GEO, and cBioPortal for providing data.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Trapani, D.; Ginsburg, O.; Fadelu, T.; Lin, N.U.; Hassett, M.; Ilbawi, A.M.; Anderson, B.O.; Curigliano, G. Global Challenges and Policy Solutions in Breast Cancer Control. Cancer Treat. Rev. 2022, 104, 102339. [Google Scholar] [CrossRef]
  2. Sung, H.; Ferlay, J.; Siegel, R.L.; Laversanne, M.; Soerjomataram, I.; Jemal, A.; Bray, F. Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA Cancer J. Clin. 2021, 71, 209–249. [Google Scholar] [CrossRef]
  3. Goel, S.; Chandarlapaty, S. Emerging Therapies for Breast Cancer. Cold Spring Harb. Perspect. Med. 2023, 13, a041333. [Google Scholar] [CrossRef] [PubMed]
  4. Loibl, S.; André, F.; Bachelot, T.; Barrios, C.H.; Bergh, J.; Burstein, H.J.; Cardoso, M.J.; Carey, L.A.; Dawood, S.; Del Mastro, L.; et al. Early Breast Cancer: ESMO Clinical Practice Guideline for Diagnosis, Treatment and Follow-Up. Ann. Oncol. 2024, 35, 159–182. [Google Scholar] [CrossRef]
  5. Siegel, R.L.; Giaquinto, A.N.; Jemal, A. Cancer Statistics, 2024. CA Cancer J. Clin. 2024, 74, 12–49. [Google Scholar] [CrossRef]
  6. Cani, A.K. Breast Cancer Circulating Tumor Cells: Current Clinical Applications and Future Prospects. Clin. Chem. 2024, 70, 68–80. [Google Scholar] [CrossRef]
  7. Khadka, V.S.; Nasu, M.; Deng, Y.; Jijiwa, M. Circulating microRNA Biomarker for Detecting Breast Cancer in High-Risk Benign Breast Tumors. Int. J. Mol. Sci. 2023, 24, 7553. [Google Scholar] [CrossRef]
  8. Lee, Y.; Ni, J.; Beretov, J.; Wasinger, V.C.; Graham, P.; Li, Y. Recent Advances of Small Extracellular Vesicle Biomarkers in Breast Cancer Diagnosis and Prognosis. Mol. Cancer 2023, 22, 33. [Google Scholar] [CrossRef] [PubMed]
  9. Gao, L.; Medford, A.; Spring, L.; Bar, Y.; Hu, B.; Jimenez, R.; Isakoff, S.J.; Bardia, A.; Peppercorn, J. Searching for the “Holy Grail” of Breast Cancer Recurrence Risk: A Narrative Review of the Hunt for a Better Biomarker and the Promise of Circulating Tumor DNA (ctDNA). Breast Cancer Res. Treat. 2024, 205, 211–226. [Google Scholar] [CrossRef]
  10. Quail, D.F.; Joyce, J.A. Microenvironmental Regulation of Tumor Progression and Metastasis. Nat. Med. 2013, 19, 1423–1437. [Google Scholar] [CrossRef]
  11. Pitt, J.M.; Marabelle, A.; Eggermont, A.; Soria, J.-C.; Kroemer, G.; Zitvogel, L. Targeting the Tumor Microenvironment: Removing Obstruction to Anticancer Immune Responses and Immunotherapy. Ann. Oncol. 2016, 27, 1482–1492. [Google Scholar] [CrossRef] [PubMed]
  12. Babar, Q.; Saeed, A.; Tabish, T.A.; Sarwar, M.; Thorat, N.D. Targeting the Tumor Microenvironment: Potential Strategy for Cancer Therapeutics. Biochim. Biophys. Acta (BBA) Mol. Basis Dis. 2023, 1869, 166746. [Google Scholar] [CrossRef]
  13. Goenka, A.; Khan, F.; Verma, B.; Sinha, P.; Dmello, C.C.; Jogalekar, M.P.; Gangadaran, P.; Ahn, B. Tumor Microenvironment Signaling and Therapeutics in Cancer Progression. Cancer Commun. 2023, 43, 525–561. [Google Scholar] [CrossRef]
  14. Novellino, L.; Castelli, C.; Parmiani, G. A Listing of Human Tumor Antigens Recognized by T Cells: March 2004 Update. Cancer Immunol. Immunother. 2005, 54, 187–207. [Google Scholar] [CrossRef]
  15. Schumacher, T.N.; Scheper, W.; Kvistborg, P. Cancer Neoantigens. Annu. Rev. Immunol. 2019, 37, 173–200. [Google Scholar] [CrossRef]
  16. Tumeh, P.C.; Harview, C.L.; Yearley, J.H.; Shintaku, I.P.; Taylor, E.J.M.; Robert, L.; Chmielowski, B.; Spasic, M.; Henry, G.; Ciobanu, V.; et al. PD-1 Blockade Induces Responses by Inhibiting Adaptive Immune Resistance. Nature 2014, 515, 568–571. [Google Scholar] [CrossRef]
  17. Reiser, J.; Banerjee, A. Effector, Memory, and Dysfunctional CD8+ T Cell Fates in the Antitumor Immune Response. J. Immunol. Res. 2016, 2016, 8941260. [Google Scholar] [CrossRef]
  18. Wang, M.; Yin, B.; Wang, H.Y.; Wang, R.-F. Current Advances in T-Cell-Based Cancer Immunotherapy. Immunotherapy 2014, 6, 1265–1278. [Google Scholar] [CrossRef] [PubMed]
  19. Xu, L.; Deng, C.; Pang, B.; Zhang, X.; Liu, W.; Liao, G.; Yuan, H.; Cheng, P.; Li, F.; Long, Z.; et al. TIP: A Web Server for Resolving Tumor Immunophenotype Profiling. Cancer Res. 2018, 78, 6575–6580. [Google Scholar] [CrossRef]
  20. He, Y.; Jiang, Z.; Chen, C.; Wang, X. Classification of Triple-Negative Breast Cancers Based on Immunogenomic Profiling. J. Exp. Clin. Cancer Res. 2018, 37, 327. [Google Scholar] [CrossRef]
  21. Charoentong, P.; Finotello, F.; Angelova, M.; Mayer, C.; Efremova, M.; Rieder, D.; Hackl, H.; Trajanoski, Z. Pan-Cancer Immunogenomic Analyses Reveal Genotype-Immunophenotype Relationships and Predictors of Response to Checkpoint Blockade. Cell Rep. 2017, 18, 248–262. [Google Scholar] [CrossRef]
  22. Caswell-Jin, J.L.; Sun, L.P.; Munoz, D.; Lu, Y.; Li, Y.; Huang, H.; Hampton, J.M.; Song, J.; Jayasekera, J.; Schechter, C.; et al. Analysis of Breast Cancer Mortality in the US—1975 to 2019. JAMA 2024, 331, 233. [Google Scholar] [CrossRef] [PubMed]
  23. Wheeler, S.B.; Rocque, G.; Basch, E. Benefits of Breast Cancer Screening and Treatment on Mortality. JAMA 2024, 331, 199. [Google Scholar] [CrossRef]
  24. Siegel, R.L.; Miller, K.D.; Jemal, A. Cancer Statistics, 2020. CA Cancer J. Clin. 2020, 70, 7–30. [Google Scholar] [CrossRef]
  25. Siegel, R.L.; Miller, K.D.; Wagle, N.S.; Jemal, A. Cancer Statistics, 2023. CA Cancer J. Clin. 2023, 73, 17–48. [Google Scholar] [CrossRef]
  26. Nolan, E.; Lindeman, G.J.; Visvader, J.E. Deciphering Breast Cancer: From Biology to the Clinic. Cell 2023, 186, 1708–1728. [Google Scholar] [CrossRef]
  27. De Visser, K.E.; Joyce, J.A. The Evolving Tumor Microenvironment: From Cancer Initiation to Metastatic Outgrowth. Cancer Cell 2023, 41, 374–403. [Google Scholar] [CrossRef] [PubMed]
  28. Junttila, M.R.; De Sauvage, F.J. Influence of Tumour Micro-Environment Heterogeneity on Therapeutic Response. Nature 2013, 501, 346–354. [Google Scholar] [CrossRef]
  29. Dominiak, A.; Chełstowska, B.; Olejarz, W.; Nowicka, G. Communication in the Cancer Microenvironment as a Target for Therapeutic Interventions. Cancers 2020, 12, 1232. [Google Scholar] [CrossRef]
  30. Bejarano, L.; Jordāo, M.J.C.; Joyce, J.A. Therapeutic Targeting of the Tumor Microenvironment. Cancer Discov. 2021, 11, 933–959. [Google Scholar] [CrossRef]
  31. Reina-Campos, M.; Scharping, N.E.; Goldrath, A.W. CD8+ T Cell Metabolism in Infection and Cancer. Nat. Rev. Immunol. 2021, 21, 718–738. [Google Scholar] [CrossRef] [PubMed]
  32. Dolina, J.S.; Van Braeckel-Budimir, N.; Thomas, G.D.; Salek-Ardakani, S. CD8+ T Cell Exhaustion in Cancer. Front. Immunol. 2021, 12, 715234. [Google Scholar] [CrossRef] [PubMed]
  33. Philip, M.; Schietinger, A. CD8+ T Cell Differentiation and Dysfunction in Cancer. Nat. Rev. Immunol. 2022, 22, 209–223. [Google Scholar] [CrossRef] [PubMed]
  34. Nalio Ramos, R.; Missolo-Koussou, Y.; Gerber-Ferder, Y.; Bromley, C.P.; Bugatti, M.; Núñez, N.G.; Tosello Boari, J.; Richer, W.; Menger, L.; Denizeau, J.; et al. Tissue-Resident FOLR2+ Macrophages Associate with CD8+ T Cell Infiltration in Human Breast Cancer. Cell 2022, 185, 1189–1207.e25. [Google Scholar] [CrossRef] [PubMed]
  35. Meiser, P.; Knolle, M.A.; Hirschberger, A.; De Almeida, G.P.; Bayerl, F.; Lacher, S.; Pedde, A.-M.; Flommersfeld, S.; Hönninger, J.; Stark, L.; et al. A Distinct Stimulatory cDC1 Subpopulation Amplifies CD8+ T Cell Responses in Tumors for Protective Anti-Cancer Immunity. Cancer Cell 2023, 41, 1498–1515.e10. [Google Scholar] [CrossRef] [PubMed]
  36. Liao, K.; Yang, Q.; Xu, Y.; He, Y.; Wang, J.; Li, Z.; Wu, C.; Hu, J.; Wang, X. Identification of Signature of Tumor-Infiltrating CD8 T Lymphocytes in Prognosis and Immunotherapy of Colon Cancer by Machine Learning. Clin. Immunol. 2023, 257, 109811. [Google Scholar] [CrossRef] [PubMed]
  37. Zhu, Y.; Liang, L.; Li, J.; Zeng, J.; Yao, H.; Wu, L. Establishing Molecular Subgroups of CD8+ T Cell-Associated Genes in the Ovarian Cancer Tumour Microenvironment and Predicting the Immunotherapy Response. Biomedicines 2023, 11, 2399. [Google Scholar] [CrossRef]
  38. Li, J.; Han, T.; Wang, X.; Wang, Y.; Yang, R.; Yang, Q. Development of a CD8+ T Cell Associated Signature for Predicting the Prognosis and Immunological Characteristics of Gastric Cancer by Integrating Single-Cell and Bulk RNA-Sequencing. Sci. Rep. 2024, 14, 4524. [Google Scholar] [CrossRef] [PubMed]
  39. Chen, L.; Weng, Y.; Cui, X.; Li, Q.; Peng, M.; Song, Q. Comprehensive Analyses of a CD8+ T Cell Infiltration Related Gene Signature with Regard to the Prediction of Prognosis and Immunotherapy Response in Lung Squamous Cell Carcinoma. BMC Bioinform. 2023, 24, 238. [Google Scholar] [CrossRef]
  40. Virassamy, B.; Caramia, F.; Savas, P.; Sant, S.; Wang, J.; Christo, S.N.; Byrne, A.; Clarke, K.; Brown, E.; Teo, Z.L.; et al. Intratumoral CD8+ T Cells with a Tissue-Resident Memory Phenotype Mediate Local Immunity and Immune Checkpoint Responses in Breast Cancer. Cancer Cell 2023, 41, 585–601.e8. [Google Scholar] [CrossRef] [PubMed]
  41. Di, W.; Fan, W.; Wu, F.; Shi, Z.; Wang, Z.; Yu, M.; Zhai, Y.; Chang, Y.; Pan, C.; Li, G.; et al. Clinical Characterization and Immunosuppressive Regulation of CD161 (KLRB1) in Glioma through 916 Samples. Cancer Sci. 2022, 113, 756–769. [Google Scholar] [CrossRef]
  42. Mathewson, N.D.; Ashenberg, O.; Tirosh, I.; Gritsch, S.; Perez, E.M.; Marx, S.; Jerby-Arnon, L.; Chanoch-Myers, R.; Hara, T.; Richman, A.R.; et al. Inhibitory CD161 Receptor Identified in Glioma-Infiltrating T Cells by Single-Cell Analysis. Cell 2021, 184, 1281–1298.e26. [Google Scholar] [CrossRef]
  43. Moeller, J.B.; Nielsen, M.J.; Reichhardt, M.P.; Schlosser, A.; Sorensen, G.L.; Nielsen, O.; Tornøe, I.; Grønlund, J.; Nielsen, M.E.; Jørgensen, J.S.; et al. CD163-L1 Is an Endocytic Macrophage Protein Strongly Regulated by Mediators in the Inflammatory Response. J. Immunol. 2012, 188, 2399–2409. [Google Scholar] [CrossRef]
  44. González-Domínguez, É.; Samaniego, R.; Flores-Sevilla, J.L.; Campos-Campos, S.F.; Gómez-Campos, G.; Salas, A.; Campos-Peña, V.; Corbí, Á.L.; Sánchez-Mateos, P.; Sánchez-Torres, C. CD163L1 and CLEC5A Discriminate Subsets of Human Resident and Inflammatory Macrophages In Vivo. J. Leukoc. Biol. 2015, 98, 453–466. [Google Scholar] [CrossRef]
  45. Zhou, Y.; Wang, H.; Luo, Y.; Tuo, B.; Liu, X.; Li, T. Effect of Metabolism on the Immune Microenvironment of Breast Cancer. Biochim. Biophys. Acta (BBA) Rev. Cancer 2023, 1878, 188861. [Google Scholar] [CrossRef]
  46. Tokunaga, R.; Naseem, M.; Lo, J.H.; Battaglin, F.; Soni, S.; Puccini, A.; Berger, M.D.; Zhang, W.; Baba, H.; Lenz, H.-J. B Cell and B Cell-Related Pathways for Novel Cancer Treatments. Cancer Treat. Rev. 2019, 73, 10–19. [Google Scholar] [CrossRef] [PubMed]
  47. Michaud, D.; Steward, C.R.; Mirlekar, B.; Pylayeva-Gupta, Y. Regulatory B Cells in Cancer. Immunol. Rev. 2021, 299, 74–92. [Google Scholar] [CrossRef] [PubMed]
  48. Gunassekaran, G.R.; Poongkavithai Vadevoo, S.M.; Baek, M.-C.; Lee, B. M1 Macrophage Exosomes Engineered to Foster M1 Polarization and Target the IL-4 Receptor Inhibit Tumor Growth by Reprogramming Tumor-Associated Macrophages into M1-like Macrophages. Biomaterials 2021, 278, 121137. [Google Scholar] [CrossRef]
  49. Wang, Y.; Lyu, Z.; Qin, Y.; Wang, X.; Sun, L.; Zhang, Y.; Gong, L.; Wu, S.; Han, S.; Tang, Y.; et al. FOXO1 Promotes Tumor Progression by Increased M2 Macrophage Infiltration in Esophageal Squamous Cell Carcinoma. Theranostics 2020, 10, 11535–11548. [Google Scholar] [CrossRef] [PubMed]
  50. Chen, H.; Ma, R.; Zhou, B.; Yang, X.; Duan, F.; Wang, G. Integrated Immunological Analysis of Single-Cell and Bulky Tissue Transcriptomes Reveals the Role of Interactions between M0 Macrophages and Naïve CD4+ T Cells in the Immunosuppressive Microenvironment of Cervical Cancer. Comput. Biol. Med. 2023, 163, 107151. [Google Scholar] [CrossRef]
  51. Lu, Y.; Han, G.; Zhang, Y.; Zhang, L.; Li, Z.; Wang, Q.; Chen, Z.; Wang, X.; Wu, J. M2 Macrophage-Secreted Exosomes Promote Metastasis and Increase Vascular Permeability in Hepatocellular Carcinoma. Cell Commun. Signal 2023, 21, 299. [Google Scholar] [CrossRef] [PubMed]
  52. Curigliano, G.; Burstein, H.J.; Gnant, M.; Loibl, S.; Cameron, D.; Regan, M.M.; Denkert, C.; Poortmans, P.; Weber, W.P.; Thürlimann, B.; et al. Understanding Breast Cancer Complexity to Improve Patient Outcomes: The St Gallen International Consensus Conference for the Primary Therapy of Individuals with Early Breast Cancer 2023. Ann. Oncol. 2023, 34, 970–986. [Google Scholar] [CrossRef]
  53. McDonald, E.S.; Clark, A.S.; Tchou, J.; Zhang, P.; Freedman, G.M. Clinical Diagnosis and Management of Breast Cancer. J. Nucl. Med. 2016, 57, 9S–16S. [Google Scholar] [CrossRef] [PubMed]
  54. Ye, F.; Dewanjee, S.; Li, Y.; Jha, N.K.; Chen, Z.-S.; Kumar, A.; Vishakha; Behl, T.; Jha, S.K.; Tang, H. Advancements in Clinical Aspects of Targeted Therapy and Immunotherapy in Breast Cancer. Mol. Cancer 2023, 22, 105. [Google Scholar] [CrossRef]
  55. Wherry, E.J. T Cell Exhaustion. Nat. Immunol. 2011, 12, 492–499. [Google Scholar] [CrossRef]
  56. Javid, H.; Attarian, F.; Saadatmand, T.; Rezagholinejad, N.; Mehri, A.; Amiri, H.; Karimi-Shahri, M. The Therapeutic Potential of Immunotherapy in the Treatment of Breast Cancer: Rational Strategies and Recent Progress. J. Cell. Biochem. 2023, 124, 477–494. [Google Scholar] [CrossRef] [PubMed]
  57. Rui, R.; Zhou, L.; He, S. Cancer Immunotherapies: Advances and Bottlenecks. Front. Immunol. 2023, 14, 1212476. [Google Scholar] [CrossRef]
Figure 1. Survival analysis based on CD8+ T cell abundance. Survival curves for different CD8+ T cell abundances in the TCGA-BRCA and METABRIC cohorts. (A,E) The xCell algorithm. (B,F) The quantiseq algorithm. (C,G) The ssGSEA algorithm. (D,H) MCPcounter algorithm.
Figure 1. Survival analysis based on CD8+ T cell abundance. Survival curves for different CD8+ T cell abundances in the TCGA-BRCA and METABRIC cohorts. (A,E) The xCell algorithm. (B,F) The quantiseq algorithm. (C,G) The ssGSEA algorithm. (D,H) MCPcounter algorithm.
Genes 15 01093 g001
Figure 2. WGCNA analysis based on CD8+ T cell abundance. (A,B) Evaluation of the soft threshold in the TCGA-BRCA and METABRIC. (C,D) Correlation between each gene co-expression module and CD8+ T cell abundance in both datasets. Scatter plots of the green module MM and GS in the TCGA-BRCA cohort, including Activated_CD8_T_cell_ssGSEA (E), T_cells_CD8_MCPcounter (F), T_cells_CD8_xCell (G), and T_cells_CD8_quantiseq (H). Scatter plots of the turquoise module MM and GS in the METABRIC cohort, including Activated_CD8_T_cell_ssGSEA (I), T_cells_CD8_MCPcounter (J), T_cells_CD8_xCell (K), and T_cells_CD8_quantiseq (L).
Figure 2. WGCNA analysis based on CD8+ T cell abundance. (A,B) Evaluation of the soft threshold in the TCGA-BRCA and METABRIC. (C,D) Correlation between each gene co-expression module and CD8+ T cell abundance in both datasets. Scatter plots of the green module MM and GS in the TCGA-BRCA cohort, including Activated_CD8_T_cell_ssGSEA (E), T_cells_CD8_MCPcounter (F), T_cells_CD8_xCell (G), and T_cells_CD8_quantiseq (H). Scatter plots of the turquoise module MM and GS in the METABRIC cohort, including Activated_CD8_T_cell_ssGSEA (I), T_cells_CD8_MCPcounter (J), T_cells_CD8_xCell (K), and T_cells_CD8_quantiseq (L).
Genes 15 01093 g002
Figure 3. Screening of prognostic CTRGs through machine learning algorithm. (A) Plot of ten-fold cross-validations. (B) Plot of the LASSO coefficient. Screening of prognostic CTRGs using the Random Forest algorithm (C), and CoxBoost algorithm (D). (E) The top 10 most vital genes selected by the XGboost algorithm. (F) The top 10 most vital genes selected by the GBM algorithm.
Figure 3. Screening of prognostic CTRGs through machine learning algorithm. (A) Plot of ten-fold cross-validations. (B) Plot of the LASSO coefficient. Screening of prognostic CTRGs using the Random Forest algorithm (C), and CoxBoost algorithm (D). (E) The top 10 most vital genes selected by the XGboost algorithm. (F) The top 10 most vital genes selected by the GBM algorithm.
Genes 15 01093 g003
Figure 4. Construction of prognostic signature related to CD8+ T cells. (A) Venn diagram of six algorithms. (B) Identifying CTRGs with independent prognostic value. (C) Differential expression of two prognostic signature genes in normal and tumor tissues (**** p < 0.0001). Scatter plots of CTR score and survival status in the TCGA-BRCA (D), METABRIC (E), GSE96058 (F), and GSE20685 (G) datasets. (HK) ROC curves of CTR score in the TCGA-BRCA, METABRIC, GSE96058, and GSE20685 datasets. Survival curves in the TCGA-BRCA (L), METABRIC (M), GSE96058 (N), and GSE20685 (O) datasets.
Figure 4. Construction of prognostic signature related to CD8+ T cells. (A) Venn diagram of six algorithms. (B) Identifying CTRGs with independent prognostic value. (C) Differential expression of two prognostic signature genes in normal and tumor tissues (**** p < 0.0001). Scatter plots of CTR score and survival status in the TCGA-BRCA (D), METABRIC (E), GSE96058 (F), and GSE20685 (G) datasets. (HK) ROC curves of CTR score in the TCGA-BRCA, METABRIC, GSE96058, and GSE20685 datasets. Survival curves in the TCGA-BRCA (L), METABRIC (M), GSE96058 (N), and GSE20685 (O) datasets.
Genes 15 01093 g004
Figure 5. Association of CTR score with CD8+ T cell abundance. Comparison of CD8+ T cell abundance estimated by four algorithms in low and high CTR score groups in TCGA-BRCA (AD) and METABRIC (EH) (**** p < 0.0001). Assessment of the association between CTR score and CD8+ T cell abundance estimated by four algorithms in TCGA-BRCA (I) and METABRIC (J) (*** p < 0.001). Correlation analysis between two prognostic signature genes and CD8+ T cell abundance, estimated by four algorithms in TCGA-BRCA (K) and METABRIC (L).
Figure 5. Association of CTR score with CD8+ T cell abundance. Comparison of CD8+ T cell abundance estimated by four algorithms in low and high CTR score groups in TCGA-BRCA (AD) and METABRIC (EH) (**** p < 0.0001). Assessment of the association between CTR score and CD8+ T cell abundance estimated by four algorithms in TCGA-BRCA (I) and METABRIC (J) (*** p < 0.001). Correlation analysis between two prognostic signature genes and CD8+ T cell abundance, estimated by four algorithms in TCGA-BRCA (K) and METABRIC (L).
Genes 15 01093 g005
Figure 6. Relationship between CTR score and clinical characteristics. Differences in CTR score in the TCGA-BRCA cohort by status (A), stage (B), T stage (C), and age (D) (* p-value < 0.05; ** p-value < 0.01; *** p-value < 0.001; **** p-value < 0.0001). Differences in CTR score in the METABRIC cohort by status (E), ER (F), and age (G). Survival curves for different risk groups in the TCGA-BRCA cohort for ages < 65 (H), ages ≥ 65 (I), stages I–II (J), stages III–VI (K), T1–T2 (L), T3–T4 (M), N0–N1 (N), and N2–N3 (O).
Figure 6. Relationship between CTR score and clinical characteristics. Differences in CTR score in the TCGA-BRCA cohort by status (A), stage (B), T stage (C), and age (D) (* p-value < 0.05; ** p-value < 0.01; *** p-value < 0.001; **** p-value < 0.0001). Differences in CTR score in the METABRIC cohort by status (E), ER (F), and age (G). Survival curves for different risk groups in the TCGA-BRCA cohort for ages < 65 (H), ages ≥ 65 (I), stages I–II (J), stages III–VI (K), T1–T2 (L), T3–T4 (M), N0–N1 (N), and N2–N3 (O).
Genes 15 01093 g006
Figure 7. Construction of a nomogram. Assessment of the independence of the CTR score in the TCGA-BRCA (A,B) and METABRIC (C,D) cohorts (* p-value < 0.05; ** p-value < 0.01; *** p-value < 0.001). (E) Nomogram in the TCGA-BRCA. (F) ROC curve of the nomogram. (G) Calibration curve of the nomogram.
Figure 7. Construction of a nomogram. Assessment of the independence of the CTR score in the TCGA-BRCA (A,B) and METABRIC (C,D) cohorts (* p-value < 0.05; ** p-value < 0.01; *** p-value < 0.001). (E) Nomogram in the TCGA-BRCA. (F) ROC curve of the nomogram. (G) Calibration curve of the nomogram.
Genes 15 01093 g007
Figure 8. Relationship between CTR score and immune cell infiltration. (A) Differences in immune infiltration levels between different CTR score groups assessed by five algorithms. (B) Differences in tumor microenvironment scores (immune, stromal, and estimate score) between CTR score groups (**** p < 0.0001). (CE) Association of CTR score with immune checkpoints, cancer immune cycles and 29 immune characteristics (* p < 0.05, ** p < 0.01, *** p < 0.001, **** p < 0.0001).
Figure 8. Relationship between CTR score and immune cell infiltration. (A) Differences in immune infiltration levels between different CTR score groups assessed by five algorithms. (B) Differences in tumor microenvironment scores (immune, stromal, and estimate score) between CTR score groups (**** p < 0.0001). (CE) Association of CTR score with immune checkpoints, cancer immune cycles and 29 immune characteristics (* p < 0.05, ** p < 0.01, *** p < 0.001, **** p < 0.0001).
Genes 15 01093 g008
Figure 9. Differences in sensitivity to immunotherapy and chemotherapeutic agents in various CTR score groups. (AD) Differences in Immune Phenotype Scores (IPS) between different groups (*** p < 0.001). (EL) Differences in chemotherapy sensitivity to docetaxel, cisplatin, 5-fluorouracil, cyclophosphamide, epirubicin, vinorelbine, vincristine, and gemcitabine between different groups (* p < 0.05, **** p < 0.0001).
Figure 9. Differences in sensitivity to immunotherapy and chemotherapeutic agents in various CTR score groups. (AD) Differences in Immune Phenotype Scores (IPS) between different groups (*** p < 0.001). (EL) Differences in chemotherapy sensitivity to docetaxel, cisplatin, 5-fluorouracil, cyclophosphamide, epirubicin, vinorelbine, vincristine, and gemcitabine between different groups (* p < 0.05, **** p < 0.0001).
Genes 15 01093 g009
Figure 10. Single-cell and spatial transcriptomic analysis of prognostic feature genes. (A) Different cell types in the EMTAB8107 dataset. Expression levels of KLRB1 (B) and CD163L1 (C) in different cells. (D) Distribution of different cells in GSE203612-GSM6177603-NYU-BRCA2. (E,F) Distribution of CD163L1 and KLRB1 in different cells.
Figure 10. Single-cell and spatial transcriptomic analysis of prognostic feature genes. (A) Different cell types in the EMTAB8107 dataset. Expression levels of KLRB1 (B) and CD163L1 (C) in different cells. (D) Distribution of different cells in GSE203612-GSM6177603-NYU-BRCA2. (E,F) Distribution of CD163L1 and KLRB1 in different cells.
Genes 15 01093 g010
Figure 11. Validation of expression levels of prognostic-related CTRGs. (A,B) Expression levels of KLRB1 and CD163L1 in cancerous tissues and paired adjacent non-cancerous tissues (**** p < 0.0001). (C) Expression of CD163L1 protein in the UALCAN database (**** p < 0.0001). (D) Expression levels of CD163L1 in normal epithelial cell line (HMEL) and breast cancer cell lines (CAL51, T47D, and HMC18) in the CCLE database. (E) Expression level of KLRB1 in breast cancer cell lines. (F,G) IHC staining images of KLRB1 and CD163L1 proteins in cancerous tissues. (H,I) Experimental validation of the prognostic CTRGs (* p < 0.05, *** p < 0.001).
Figure 11. Validation of expression levels of prognostic-related CTRGs. (A,B) Expression levels of KLRB1 and CD163L1 in cancerous tissues and paired adjacent non-cancerous tissues (**** p < 0.0001). (C) Expression of CD163L1 protein in the UALCAN database (**** p < 0.0001). (D) Expression levels of CD163L1 in normal epithelial cell line (HMEL) and breast cancer cell lines (CAL51, T47D, and HMC18) in the CCLE database. (E) Expression level of KLRB1 in breast cancer cell lines. (F,G) IHC staining images of KLRB1 and CD163L1 proteins in cancerous tissues. (H,I) Experimental validation of the prognostic CTRGs (* p < 0.05, *** p < 0.001).
Genes 15 01093 g011
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wu, B.; Li, L.; Li, L.; Chen, Y.; Guan, Y.; Zhao, J. Integration of Bioinformatics and Machine Learning to Identify CD8+ T Cell-Related Prognostic Signature to Predict Clinical Outcomes and Treatment Response in Breast Cancer Patients. Genes 2024, 15, 1093. https://doi.org/10.3390/genes15081093

AMA Style

Wu B, Li L, Li L, Chen Y, Guan Y, Zhao J. Integration of Bioinformatics and Machine Learning to Identify CD8+ T Cell-Related Prognostic Signature to Predict Clinical Outcomes and Treatment Response in Breast Cancer Patients. Genes. 2024; 15(8):1093. https://doi.org/10.3390/genes15081093

Chicago/Turabian Style

Wu, Baoai, Longpeng Li, Longhui Li, Yinghua Chen, Yue Guan, and Jinfeng Zhao. 2024. "Integration of Bioinformatics and Machine Learning to Identify CD8+ T Cell-Related Prognostic Signature to Predict Clinical Outcomes and Treatment Response in Breast Cancer Patients" Genes 15, no. 8: 1093. https://doi.org/10.3390/genes15081093

APA Style

Wu, B., Li, L., Li, L., Chen, Y., Guan, Y., & Zhao, J. (2024). Integration of Bioinformatics and Machine Learning to Identify CD8+ T Cell-Related Prognostic Signature to Predict Clinical Outcomes and Treatment Response in Breast Cancer Patients. Genes, 15(8), 1093. https://doi.org/10.3390/genes15081093

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop