Next Article in Journal
Long Noncoding RNAs and Circular RNAs Regulate AKT and Its Effectors to Control Cell Functions of Cancer Cells
Next Article in Special Issue
The Proliferating Cell Nuclear Antigen (PCNA) Transcript Variants as Potential Relapse Markers in B-Cell Acute Lymphoblastic Leukemia
Previous Article in Journal
Functional Repercussions of Hypoxia-Inducible Factor-2α in Idiopathic Pulmonary Fibrosis
Previous Article in Special Issue
Nicotinamide Adenine Dinucleotide (NAD) Metabolism as a Relevant Target in Cancer
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Locus-Specific Enrichment Analysis of 5-Hydroxymethylcytosine Reveals Novel Genes Associated with Breast Carcinogenesis

by
Deepa Ramasamy
1,
Arunagiri Kuha Deva Magendhra Rao
1,
Meenakumari Balaiah
1,
Arvinden Vittal Rangan
1,
Shirley Sundersingh
2,
Sridevi Veluswami
3,
Rajkumar Thangarajan
1 and
Samson Mani
1,*
1
Department of Molecular Oncology, Cancer Institute (WIA), 38, Sardar Patel Road, Chennai 600036, Tamilnadu, India
2
Department of Oncopathology, Cancer Institute (WIA), 38, Sardar Patel Road, Chennai 600036, Tamilnadu, India
3
Department of Surgical Oncology, Cancer Institute (WIA), 38, Sardar Patel Road, Chennai 600036, Tamilnadu, India
*
Author to whom correspondence should be addressed.
Cells 2022, 11(19), 2939; https://doi.org/10.3390/cells11192939
Submission received: 3 August 2022 / Revised: 16 August 2022 / Accepted: 17 August 2022 / Published: 20 September 2022
(This article belongs to the Special Issue Cancers: Genetics and Cellular Perspective)

Abstract

:

Highlights

  • The conversion of 5-methylcytosine to 5-hydroxymethylcytosine relaxes the repressive marks of gene expression, and their imbalance leads to cancer development.
  • Global loss of 5-hmC is found to be associated with the downregulation of TET1 and TET3 genes in breast cancer.
  • Genome-wide analysis revealed 4809 differentially methylated regions and 4841 differentially hydroxymethylated regions in breast cancer.
  • The abundance of 5-mC was observed at gene promoter regions while the 5-hmC was profusely distributed at the distal regulatory regions of the breast cancer genome. The accumulation of 5-hmC at distal regulatory sites can potentially enhance gene transcription.
  • Alteration in the 5-hmC levels was positively associated with the respective gene expression. Novel 5-hmC candidates such as TXNL1, CNIH3, BNIPL, and CHODL were found to be promising diagnostic and therapeutic markers for breast cancer.

Abstract

An imbalance in DNA methylation is a hallmark epigenetic alteration in cancer. The conversion of 5-methylcytosine (5-mC) to 5-hydroxymethyl cytosine (5-hmC), which causes the imbalance, results in aberrant gene expression. The precise functional role of 5-hydroxymethylcytosine in breast cancer remains elusive. In this study, we describe the landscape of 5-mC and 5-hmC and their association with breast cancer development. We found a distinguishable global loss of 5-hmC in the localized and invasive types of breast cancer that strongly correlate with TET expression. Genome-wide analysis revealed a unique 5-mC and 5-hmC signature in breast cancer. The differentially methylated regions (DMRs) were primarily concentrated in the proximal regulatory regions such as the promoters and UTRs, while the differentially hydroxymethylated regions (DhMRs) were densely packed in the distal regulatory regions, such as the intergenic regions (>−5 kb from TSSs). Our results indicate 4809 DMRs and 4841 DhMRs associated with breast cancer. Validation of nine 5-hmC enriched loci in a distinct set of breast cancer and normal samples positively correlated with their corresponding gene expression. The novel 5-hmC candidates such as TXNL1, and CNIH3 implicate a pro-oncogenic role in breast cancer. Overall, these results provide new insights into the loci-specific accumulation of 5-mC and 5-hmC, which are aberrantly methylated and demethylated in breast cancer.

1. Introduction

DNA methylation imbalance is one of the hallmark epigenetic events in cancer. Cytosine DNA methylation (5-methylcytosine or 5-mC) occurs at the gene promoter and is often associated with gene repression, while its oxidized form, 5-hydroxymethylcytosine (5-hmC), relaxes the repression [1,2]. The oxidation of 5-mC to 5-hmC is catalyzed by the TET family of genes TET1, TET2, and TET3 [3,4,5,6,7]. Based on tissue specificity, 5-hmC levels can vary between 0.1% and 1% of the human genome [8]. The increase in 5-hmC is strongly associated with transcriptional activation [9]. Effective binding of methylation readers such as MBD3 and MeCP2 preferentially to 5-hmC results in active transcriptional assembly and activity [10,11]. It turned out that 5-hmC is the stable epigenetic modification involved in the transcription machinery, and does not just serve as an intermediate in the demethylation process. The imbalance between 5-mC and 5-hmC is of recent interest as both are associated with gene expression and lead to carcinogenesis.
A global reduction of 5-hmC is evident in several cancers [8,12,13,14,15]. Studies on melanoma, pancreatic cancer, lung cancer, and prostate cancer suggest that aberrant 5-mC and 5-hmC levels may predispose to tumor progression [16,17,18,19,20,21,22,23]. However, there is limited evidence for 5-mC and 5-hmC dynamics in breast cancer. We hypothesize an imbalance among the genomic 5-mC and 5-hmC levels that contribute to breast carcinogenesis. Previous reports affirm that 5-hmC levels depend on tissue-specific TET expression. Particularly, downregulation of the genes TET1 and TET2 have been reported to alter the 5-hmC levels [24]. A recent study on breast cancer showed the altered 5-hmC profiles and their association with lymph node metastases [25]. In breast cancer, the locus-specific deposition of 5-hmC and its functional role in the control of gene expression are poorly understood. Emerging enrichment approaches can identify 5-mC and 5-hmC genomic regions with single-base resolution and describe the differentially methylated regions (DMRs) and differentially hydroxymethylated regions (DhMRs) in cancer [26,27]. Determining the 5-hmC modified genomic regions in breast cancer will be useful for diagnostic and therapeutic markers.
In this study, we found that global methylation and hydroxymethylation levels were drastically reduced in breast cancer tissues. The global 5-hmC reduction was associated with the downregulation of the TET1 and TET3 genes. The genome-wide analysis revealed differentially methylated and differentially hydroxymethylated breast cancer loci. We also identified a strong correlation between the 5-hmC alterations and gene expression changes. Altogether, the study provides a comprehensive genome-wide distribution of 5-mC and 5-hmC, and also an imbalance in the DNA methylation machinery that leads to breast cancer development.

2. Materials and Methods

2.1. Clinical Specimen

Breast cancer tissues of stages IIA–IV and paired normal (PN) tissue were obtained from the Tumor Bank, Cancer Institute (WIA), Chennai, India. Tissue samples were collected from patients undergoing direct surgery for invasive ductal carcinoma (IDC) or ductal carcinoma in situ (DCIS). Tumor tissues (n = 15) were histopathologically confirmed to consist of >70% tumor cells, and paired non-cancerous tissue (n = 15) free of tumor cells was excised away from the tumor margin. Similarly, DCIS (n = 5) were obtained from patients undergoing a wide-excision biopsy, and absolute normal samples (n = 5) were collected from patients undergoing wide-excision biopsy for non-tumorous conditions, such as fibrosis or adenosis, and histopathologically confirmed to be free of any tumor cells. An additional set of tumour samples (n = 30) and non-cancerous (n = 6) tissues were also collected for a validation study. Informed consent for participation and sampling was obtained from all patients. The Cancer Institute (WIA), Institutional Ethics Committee (IEC/2016/05) approved the study.

2.2. Isolation of Genomic DNA

About 25 mg of tissue was homogenized and DNA was isolated using the Nucleospin Tissue DNA Kit (Macherey Nagel, GmbH, Düren, Germany) according to the manufacturer’s instructions. The isolated DNA was quantified with Nanodrop ND-2000 and stored at −20 °C until further use.

2.3. Estimation of Global Levels of 5-hmC and 5-mC

Genomic DNA (100 ng) was used for the estimation of global 5-hmC and 5-mC levels by ELISA using the Quest 5-hmC ELISA kit and the 5-mC DNA ELISA kit (Zymo Research Inc., Irvine, CA, USA) according to the manufacturer’s instructions.

2.4. RNA Isolation and TET Expression Assay

Briefly, tissues were homogenized, and RNA was isolated using the Nucleospin® RNA Isolation Kit (Macherey Nagel, GmbH). RNA was quantitated using Nanodrop ND-2000 and cDNA was synthesized from 500 ng of total RNA using a Quantitect® reverse transcription kit (Qiagen, Germantown, MD, USA). Gene expression analysis of TET 1, 2, and 3 was performed using TaqMan probes (Supplemetary File S4), TaqMan™ Universal Master Mix II, no UNG (Applied Biosystems, Waltham, MA, USA), and the Quant studio 12Kflex system (Applied Biosystems, Waltham, MA, USA).

2.5. Enrichment of 5-mC Modified DNA Regions and Library Preparation

Genomic DNA (1 μg) was fragmented to 300–600 bp after 25 cycles of 30 s of pulsed sonication in Bioruptor (Diagenode, Liège, Belgium). Fragmented DNA was end-repaired, and adapter ligation was performed using the NEBNext Ultra II DNA Library Prep Kit (New England Biolabs, Ipswich, MA, USA) for Illumina. Furthermore, fragments with a length of 400–500 bp (300 bp insert + 120 bp adapter) were size-selected using AMPure beads (Beckman Coulter, Brea, CA, USA). Immunoprecipitation of methylated DNA was performed using the MagMeDIP kit (Diagenode, Belgium), and enriched DNA was purified using the I Pure kit (Diagenode, Belgium) according to the manufacturer’s protocol. Purified DNA was indexed and amplified using the NEBNext Ultra II DNA Library Prep Kit (New England Biolabs, Ipswich, MA, USA).

2.6. Enrichment of 5-hmC Modified DNA Regions and Library Preparation

For the enrichment of 5-hmC-modified DNA, a reduced representation hydroxymethylation profiling by RRHP kit (Zymo Research Inc., Irvine, CA, USA) was carried out. The genomic DNA (1 μg) was digested using the Msp1 enzyme and ligated with p5 and p7 adapters. The adapter-ligated fragments were glycosylated with UDP-glucose and T4-glycosyltransferase and digested again with Msp1 to cleave adapters from non-glycosylated fragments. The fragments were size selected 400–500 bp (300 bp insert + 120 bp adapter) using AMPure XP beads (Beckman Coulter, Brea, CA, USA) and amplified as libraries using RRHP™ 5-hmC Library Prep Kit (Zymo Research. Inc., Irvine, CA, USA).

2.7. Sequencing and Data Analysis

Enriched libraries were sequenced by 150 × 2 paired ends to generate 25 million reads in the Illumina Nextseq 500. The FASTQ files were quality checked by FastQC. Trimmomatic was used to trim the adapter sequence and the low-quality reads (Phred > 30) were discarded. Using the BWA mem algorithm, the processed fastq files were then aligned to the reference genome (hg19). Aligned files were then converted into sorted bam files using samtools. The peaks were called with MACS2, and the peak files were used to find overlapping peaks in multiple files, and their raw counts were extracted with the DiffBind R package.

2.8. DMR Analysis

DMR analyses were carried out using the DESeq2 R package and the likelihood ratio test (LRT) was used to determine DMR with the padj value cut-off of ≤0.01. Furthermore, the DMR regions were filtered based on the number of peaks called in biological replicate with at least 10 | 10 | 5 | 5 (T | PN | DCIS | AN). Filtered DMR was annotated using the ChIPseeker R package with the UCSC hg19 known gene sets. The Z-Score was calculated from normalized counts and the heatmap was plotted using the Complex Heat map R package. The sorted bam files were indexed, and the coverage profile was calculated using the Deep Tools bam Coverage with RPKM normalization and a bin size of 20 bp. The resulted bigwig files were used to visualize peak regions with IGV.

2.9. DhMR Analysis

For the DhMR analysis, RPKM values were extracted from the DiffBind R package and a global rank-invariant set normalization was carried out using the rank-invariant function from the Lumi R package. The Kruskal–Wallis test was performed, and the p-value was adjusted with FDR. A cut-off of padj ≤ 0.1 was set to define DhMR. In addition, the DhMR peaks were filtered with the same criteria as DMR. The chromosomal distributions of DMR and DhMR were analyzed with the Karyoplot R-Package with a p-value of 0.05. The ideogram was examined for the DMR and DhMR regions with an FDR of 0.05.

2.10. Pathway Enrichment and Gene Ontology Analysis

Pathway enrichment and gene ontology analysis were performed with the g: Profiler and Cluster Profiler R package. For the p-adjusted corrected method, FDR with a p-value cut-off of 0.05 was used. Dot blots were generated based on the criteria mentioned above both for DMRs and DhMRs.

2.11. Validation of Loci-Specific 5-hmC Enriched Regions Using qPCR Assays

Briefly, the tumor (n = 30) and normal tissue (n = 6) were homogenized, and DNA was isolated using the Nucleospin Tissue DNA Kit (Macherey Nagel, GmBH) according to the manufacturer’s instructions. The genomic DNA was subjected to a 5-hmC-specific enrichment with EpiMark 5-hmC and 5-mC analysis kit (New England Biolabs, Ipswich, MA, USA). The processed DNA samples were analyzed with qPCR using loci-specific primers (Supplemetary File S4). Gene expression, methylation analysis, and survival analysis were carried out with the UALCAN [28] web tool for 5-mC- and 5-hmC-specific candidate genes.

2.12. Statistical Analysis

The correlation analysis was performed between the expression levels of TET enzymes and the global levels of 5-mC and 5-hmC distribution using GraphPad Prism v 7.0a (GraphPad Software, La Jolla, CA, USA). Spearman’s rank correlation test was performed with a confidence interval of 95% for generating the correlation plot. Linear regression lines were generated based on the R-value of the entities. The Wilcoxon signed-rank test was used for the nonparametric paired analysis. The Mann–Whitney U test was performed for the nonparametric unpaired analysis. The p-value of <0.05 is considered a significant outcome.

3. Results

3.1. Loss of 5-hmC Is Associated with TET 1 and TET3 Downregulation in Breast Cancer

Global levels of 5-hmC and 5-mC were first quantified in breast cancer (invasive ductal carcinoma (IDC) and ductal carcinoma in situ (DCIS)) paired normal (PN), and apparent normal (AN) tissues. We found a significant decrease in 5-hmC levels in IDC vs. PN (FC = −2.58, p = 0.0003) and DCIS vs. AN (FC = −2.08, p = 0.0324) (Figure 1a). Although global 5-mC levels were reduced in IDC vs. PN (FC = −2.28, p = 0.0014), there was no significant difference between the DCIS vs. AN group (p = 0.5467) (Figure 1b). The results imply that the global loss of 5-hmC is a characteristic epigenetic alteration of localized and invasive breast cancer. The differential expression of TET genes leads to the altered 5-hmC levels in breast cancer. Therefore, we quantified the TET gene expression levels and found that the genes TET2 (FC = +2.02, p = 0.0317) (Figure 1d), and TET3 (FC = +2.0, p = 0.0159) (Figure 1e) were upregulated in DCIS vs. AN group. However, TET1 (FC = −2.08, p = 0.0266) (Figure 1c) and TET3 (FC = −2.0, p = 0.026) (Figure 1e) genes were downregulated in the IDC vs. PN and IDC vs. AN groups. Spearman’s rank test showed no significant correlation of TET genes with global 5-hmC levels in PN tissues (Figure 1f–h), while it showed a significant positive correlation of TET1 (r = 0.544, p = 0.05) (Figure 1i) and TET3 (r = 0.5662, p = 0.0437) (Figure 1k) genes with the global loss of 5-hmC in breast cancer but not with the TET2 gene (Figure 1j). In addition to TET1, we report here that TET3 is also associated with a global loss of 5-hmC in breast cancer.

3.2. Relative Abundance of 5-hmC in Breast Cancer and Their Enrichment at Distal Regulatory Sites

We performed the enrichment of genomic 5-mC and 5-hmC specific regions followed by high-throughput sequencing and achieved ~14 million reads from 5-hmC-enriched libraries and ~21 million reads from 5-mC-enriched libraries. Almost 99.7% of the 5-hmC reads and ~98.94% of 5-mC reads were mapped effectively against the reference genome. Principal component analysis (PCA) and hierarchical clustering analysis (HCA) showed a clear pattern of segregation from the tumor (IDC and DCIS) to normal samples (PN and AN) (Supplemetary Figure S1a–d). MACS-2 peak calling of the mapped reads resulted in a total of 3.3 million 5-hmC-enriched peak sets and 4.7 million 5-mC-enriched peak sets. Differential peak calling analysis between breast cancer and normal groups identified 4809 differential methylated regions (DMRs) (p < 0.01, FDR < 0.05) (Figure 2a) and 4841 differential hydroxymethylated regions (DhMRs) (p < 0.01, FDR < 0.05) (Figure 2b) (Supplemetary File S1a,b). The distribution of peaks across the chromosomes showed a higher peak intensity of DhMR over DMR (Figure 2c,d) (window size: 1 × 10−6). Significantly higher peak intensity of DhMR indicates a potential difference in loci-specific 5-hmC levels between breast tumor and paired normal samples.
The peaks characterized by their genomic features indicated that both DMR and DhMR were moderately found in the gene body (28.97% and 37.31%, respectively), particularly in the intronic regions, but not in the exons. The DMR profile was high in the promoter (28.57%) regions. However, only a mere 8.82% enrichment of DhMR was found in the promoter region compared to massive accumulation in the distal intergenic regions (43.11%) ((Figure 2e) (Supplemetary Figure S2a,b and Table S1a,b)). Therefore, we show here that the distribution in the gene body does not invariably differ between the two modifications, but DMR is typically enriched at the proximal regulatory sites (promoter), while DhMR at the distal regulatory sites (intergenic region). We found the enrichment of 5-mC and 5-hmC were significantly distinguishable between tumor and normal tissues. Further, the enriched peak sets of the DhMRs and DMRs were analyzed for transcription factor binding sites. We found that the accumulation in the promoter regions of DMRs in the interval 1500 bp upstream and 1500 bp downstream of the TSSs (read count frequency > 6.5 × 10−5) was higher than that of DhMRs (read count frequency < 3 × 10−5) (Supplemetary Figure S2c). While in DhMRs, the accumulation was observed in the distal intergenic regions upstream 5 kb from the TSSs (read count frequency > 6.5 × 10−5) ((Figure 2f–g) (Supplemetary Figure S2d)). We also used LOLA-Web to identify the locus overlap between the 5-hmC sites and the regulatory sites in the distal intergenic regions (i.e., regions > −5 kb from the TSSs). Tumor and PN peak sets of 5-hmC were tested against the ENCODE data set with the reference genome hg19 and normalized with preloaded Tiles1000.hg19.bed. Tumor-specific 5-hmC peak sets are significantly associated with the enhancer sites of the breast cancer cell line MCF-7 regions such as H3K4me1, H3K4me3, H3K14ac, and H3K9ac (log (p-value) > 300). Enhancer sites overlapped with loci-specific 5-hmC levels of the candidate genes in the breast cancer cell lines, such as MDA-MD-468, MDA-MB-231, and MCF-7 regions, but not in the normal luminal cell line MCF-10A (Supplemetary Figure S3a–f). Hence, the results suggest that DhMRs were widespread in the distal regulatory regions, while DMRs accumulated in the proximal regulatory regions of the breast cancer genome.

3.3. Locus-Specific Imbalance of DhMRs and DMRs in Breast Cancer

To determine the exact loci and the differential distribution of 5-hmC and 5-mC accumulation in breast cancer, we identified 35 hyper-hmC loci (Supplemetary File S2a), and 30 hypo-hmC loci (Supplemetary File S2b). The hyper-hmC loci included coding genes (GALC, BNIPL, TXNL1, CNIH3, etc.), lncRNA (LINC00535, LINC00662, and PTPRN2 lncRNA), and microRNA (MIR4278, MIR1204, MIR944, and MIR921). We found 26 coding genes (ZBTB16, SP8, THRB, HIC2, etc.,) and only four non-coding loci (MIR4417, MIR3612, LINC00911, and LINC00417) among the hypo-hmC-specific regions. A total of 57 hyper-mC loci inclusive of 53 coding genes (CCDC181, SIM2, ID4, etc.) and 4 non-coding genes (LINC01257, LOC728989, MIR5087, and MIR183) were obtained. The hypo-mC loci consisted of 24 coding genes (OPCML, MKI67, and SPOCK1) and 6 were non-coding (MIR548AR, LINC02347, MIR744, MIR3612, etc.) (Figure 3a) (Supplemetary File S2c,d). Further, we validated five hyper-hmC and four hypo-hmC loci in breast cancer and normal tissues. The results confirmed the 5-hmC gain of TXNL1 (FC = 4, p = 0.0102) (Figure 3b,c), CNIH3 (FC = 2, p = 0.0242) (Figure 3d), while for BNIPL, A4GALT and CBLN4 no statistical significance was observed, although they followed the same trend (Figure 3e–g). On the other hand, hypo-hmC candidates CHODL showed a two-fold loss of 5-hmC (p = 0.0416) (Figure 3h). Other loci such as ZBTB16, HIC2, and SP8 showed a trend towards 5-hmC loss in tumors but did not show any statistical significance (Figure 3i–k). Our validation analysis confirmed that the gain of 5-hmC in TXNL1, BNIPL, CNIH3 and loss of 5-hmC in CHODL, ZBTB16, SP8, and HIC2 in breast cancer samples. The gene function prediction by g: Profiler indicated that 5-mC and 5-hmC genes were related to cell cycle regulation, cell cycle inhibition, and cell proliferative signals that could potentially affect breast cancer development and progression (Supplemetary File S3a–d). The gene ontology and KEGG pathway enrichment map of DhMRs and DMRs also revealed the association of breast cancer-specific 5-hmC and 5-mC with transcriptional machinery (Supplemetary Figure S4a–d). Altered levels of 5-mC and 5-hmC eventually control and determine the progression of breast cancer development. Extensive functional analysis of the identified loci will elucidate the significance of the 5-mC and 5-hmC imbalance in breast cancer development.

3.4. Association of 5-mC and 5-hmC Modifications with Gene Expression

We investigated the aberrant levels of methylation and hydroxymethylation in the locus-specific aspect and its impact on the regulation of gene expression using the TCGA-breast cancer data set (UALCAN webtool) (Table 1). We found the hyper-methylation of CCDC181 (beta value > 0.5, p < 0.05), ID4 (beta value > 0.5, p < 0.05) and hypo-methylation of MKI67, OPCML, and SPOCK1 (beta value < 0.3, p < 0.05). Correspondingly, CCDC181 (normalized TPM count = −0.094, p < 0.1) (Figure 4a), OR4F29 (normalized TPM count = −0.005, p < 0.1) (Figure 4b), and ID4 (normalized TPM count = −58.839, p < 0.001) (Figure 4c), were downregulated in tumor samples (n = 1094) while, MKI67 (normalized TPM count = 9.94, p < 0.001) and SPOCK1 (normalized TPM count = 3.26, p < 0.001) were found to be over-expressed in tumor samples but not OPCML (Figure 4d–f). The inverse correlation confirmed that hypermethylation leads to the suppression of gene expression and hypomethylation leads to overexpression of the gene. Further, we found that hyper-hmC candidates, BNIPL (normalized TPM count = 5.344, p < 0.05), CNIH3 (normalized TPM count = 0.778, p < 0.005), and TXNL1 (normalized TPM count = 14.549, p < 0.001) to be upregulated and the hypo-hmC candidates such as ZBTB16 (normalized TPM count = −13.933, p < 0.001), HIC2 (normalized TPM count = −0.436, p < 0.001), CHODL (normalized TPM count = −1.093, p < 0.001), THRB (normalized TPM count = −13.792, p < 0.001) and RAPGEF2 (normalized TPM count = −9.296, p < 0.001) were significantly downregulated (Figure 4g–p). Several loci relax the repressive methylation marks by increasing the 5-hmC levels leading to gene activation. In this study, we found that three candidate genes, TXNL1, CNIH3, and BNIPL, showed an increase in 5-hmC levels associated with gene overexpression. The direct proportionality between 5-hmC and gene expression reinstates that gain of 5-hmC activates gene transcription. We further investigated gene expression influence and the candidate gene’s overall survival with hyper-hmC specifications. The results indicate that overexpression of genes such as TXNL1 (p = 0.042) (Supplemetary Figure S5a), CNIH3 (p = 0.26) (Supplemetary Figure S5b) and BNIPL (p = 0.0001) (Supplemetary Figure S5c) was associated with poor overall survival in breast cancer patients.

4. Discussion

The present study illustrated the methylation and hydroxymethylation landscape of the breast cancer genome. Initial findings showed that the downregulation of the TET genes causes a global 5-hmC reduction in breast cancer. The genome-wide profiling revealed a higher 5-mC accumulation around the TSSs from −1.5 k to +1.5 k, while the 5-hmC accumulated in the intergenic regions (>−5 kb away from TSSs). Thus, DMRs are mainly associated with proximal gene regulation and DhMRs with distal regulation. We found an intergenic and gene body gain of 5-hmC associated with gene overexpression and a loss of 5-hmC towards downregulation of the corresponding genes. The study results show that the imbalance between 5-mC and 5-hmC is a novel phenomenon orchestrating the epigenetic machinery of breast cancer.
Previous studies also reported global 5-mC and 5-hmC loss in various cancers, including breast cancer [16,22,29,30]. The tissue-specificity and the alterations of TET gene expression in advancing cancer stages determine the 5-hmC levels and the demethylation process [13,17,23,31]. The cytoplasmic mislocalization of TET1 in ER/PR-negative subtypes of IDC and DCIS was directly proportional to the global reduction in 5-hmC levels [24]. We show here that the global reduction in the 5-hmC content in IDC is dependent on TET1 and TET3 genes. The CXXC domains of TET1 and TET3 enhance the DNA binding efficiency at DNA demethylation sites [32]. The expression and nuclear import of TET1 and TET3 facilitate an active oxidation process. Our finding shows the downregulation of TET3 also contributes to 5hmC loss in breast cancer tissues.
Genome-wide profiling showed the effects of promoter methylation in breast cancer. The advancement in enrichment strategies and next-generation sequencing has resulted in the contemporary analysis of 5-mC and 5-hmC describing their genomic signatures in breast cancer [33,34,35]. In our study, we found the enrichment of 5-mC and 5-hmC significantly distinguishable between tumor and normal tissues. Higher hydroxymethylation levels were observed in the promoter and UTRs of DCIS tissues. On the other hand, the invasive type showed a higher accumulation in the gene body and intergenic regions than in the promoter regions. Hence, we speculate that accumulation of 5-hmC in the preliminary stage of breast cancer occurs mainly at the proximal regulatory regions, reducing the suppression caused by 5-mC. In the locally advanced breast tumors, the enrichment at the distal intergenic regions implicates an enhancer-like activity of 5-hmC. Previous studies also reported that 5-hmC could potentially act as an enhancer or super-enhancer elements ~5–10 kb and >20 kb away from the TSSs [36,37]. The overlapping histone markers from the ENCODE roadmap project and 5-hmC sites emphasized the association of active enhancer sites in the breast cancer genome. The positive association of histone activation and 5-hmC gain suggests a synergistically enhanced gene.
Several loci relax the repressive methylation marks by increasing the 5-hmC levels leading to gene activation. Our results found TXNL1 as a novel 5-hmC candidate gene in breast cancer with an increased 5-hmC level and corresponding gene overexpression. Survival analysis also indicated the overexpression of TXNL1 and BNIPL in breast cancer is significantly associated with poor overall survival. Previous studies reported that induction of oxidative stress led to the overexpression of TXNL1 associated with the downregulation of the DNA repair protein XRCC1, an accumulation of DNA damage, and BCL-2 regulation [38]. Further functional analysis on the 5-hmC gain at TXNL1 and other loci would be warranted to elucidate the mechanistic insight of 5-hmC in the transcriptional activation and breast cancer development.

5. Conclusions

The present study opens a new paradigm on the imbalance of 5-mC and 5-hmC in breast cancer. The study offers a detailed perspective on an epigenomic instability substantiated by the loss of 5-mC and 5-hmC in breast tumors. Global loss of 5-hmC is associated with TET1 and TET3 downregulation. Genome-wide profiling has revealed a profound imbalance in breast cancer’s region-specific distribution of 5-mC and 5-hmC. Predominant 5-hmC modifications localized at distal gene regulatory sites implicating a transcription enhancing function. The novel 5-hmC candidates identified in the study can be promising diagnostic and therapeutic markers for breast cancer.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/cells11192939/s1. Figure S1: (a) Principal component analysis of 5-mC enriched breast tumor and normal samples. (b) Principal component analysis of 5-hmC enriched breast tumor and normal samples (c) Hierarchical clustering analysis of 5-mC enriched breast tumor and normal samples. (d) Hierarchical clustering analysis of 5-hmC enriched breast tumor and normal samples. Figure S2: (a) Annopie representing DMRs. (b) Annopie representing DhMRs. (c) Relative peak count frequency of DMRs from TSSs (−1.5 kb to +1.5 kb from the TSSs). (d) Relative peak count frequency of DhMRs from TSSs (−1.5 kb to +1.5 kb from the TSSs). Figure S3: Locus overlap analysis and ChIP seq analysis using UALCAN and LOLA webtool. Figure S4: (a) KEGG pathway enrichment analysis of DMRs. (b) Gene ontology analysis of DMRs. (c) KEGG pathway enrichment analysis of DhMRs. (d) Gene ontology analysis of DhMRs. Figure S5: (a) Survival analysis of TXNL1 in breast cancer patients using UALCAN webtool. (b) Survival analysis of CNIH3 in breast cancer patients using UALCAN webtool. (c) Survival analysis of BNIPL in breast cancer patients using UALCAN webtool. File S1: (a) Significant differentially hydroxymethylated regions (DhMRs) of breast tumor and normal samples. (b) Significant differentially methylated regions (DMRs) of breast tumor and normal samples. File S2: Significant hyper- and hypo-5-mC and 5-hmC regions (a) Hyper-hmC gene list (b) hypo-hmC gene list (c) hyper-mC gene list (d) hypo-mC gene list. File S3: Gene-specific analysis using gprofiler (a) Hyper-hmC genes dataset (b) Hypo-hmC genes dataset (c) Hyper-mC genes dataset (d) Hypo-mC genes dataset. File S4: qPCR primers used in the study. Table S1: (a) Genomic features of 5-mC (b) Genomic features of 5-hmC (paired normal (PN), invasive ductal carcinoma (IDC), ductal carcinoma in situ (DCIS), and apparent normal (AN) samples).

Author Contributions

Conceptualization, S.M.; data curation, D.R.; formal analysis, D.R. and A.K.D.M.R.; funding acquisition, S.M.; investigation, A.K.D.M.R. and S.M.; methodology, D.R., A.K.D.M.R., M.B. and A.V.R.; project administration, S.M.; resources, S.S. and S.V.; supervision, R.T. and S.M.; validation, D.R.; visualization, D.R. and A.K.D.M.R.; writing—original draft, D.R.; writing—review & editing, A.K.D.M.R., R.T. and S.M. All authors have read and agreed to the published version of the manuscript.

Funding

The study was fully funded by a financial grant sanctioned by the Science and Engineering Research Board, Department of Science and Technology (DST), Government of India (EMR/2015/001319), and the student fellowship was sanctioned by the DST-INSPIRE fellowship (IF190144), Government of India.

Institutional Review Board Statement

This study was approved by the Ethics Committees of Cancer Institute (WIA), Regional Cancer Centre, Chennai—600036 (IEC/2016/05). The study was conducted according to the principles expressed in the Declaration of Helsinki.

Informed Consent Statement

Informed consent was obtained from all individual participants included in the study.

Data Availability Statement

Raw sequencing data are available in Sequence Read Archive Hosted by National Centre for Biotechnology Information (NCBI) search database with accession number PRJNA769519.

Conflicts of Interest

All authors declare that they have no conflict of interest.

Abbreviations

5-mC5-methylcytosine
5-hmC5-hydroxymethylcytosine
TETTen eleven translocases
IDCInvasive ductal carcinoma
DCISDuctal carcinoma in situ
FCFold change
pp-value
padjp-adjusted value
FDRFalse discovery rate
L2FClog-2-fold change
TPMTranscript per million
RRHPReduced representation hydroxymethylation profiling
WGSWhole-genome sequencing
MeDIPMethylated DNA immunoprecipitation
BCBreast cancer
LRTLikelihood ratio test
TSSstranscription start sites
TFBStranscription factor binding sites

References

  1. Kulis, M.; Esteller, M. DNA Methylation and Cancer. Adv. Genet. 2010, 70, 27–56. [Google Scholar] [CrossRef] [PubMed]
  2. Moore, L.D.; Le, T.; Fan, G. DNA Methylation and Its Basic Function. Neuropsychopharmacology 2013, 38, 23–38. [Google Scholar] [CrossRef] [PubMed]
  3. Münzel, M.; Globisch, D.; Carell, T. 5-Hydroxymethylcytosine, the Sixth Base of the Genome. Angew. Chem. Int. Ed. 2011, 50, 6460–6468. [Google Scholar] [CrossRef]
  4. Dahl, C.; Grønbæk, K.; Guldberg, P. Advances in DNA methylation: 5-hydroxymethylcytosine revisited. Clin. Chim. Acta 2011, 412, 831–836. [Google Scholar] [CrossRef] [PubMed]
  5. Bachman, M.; Uribe-Lewis, S.; Yang, X.; Williams, M.; Murrell, A.; Balasubramanian, S. 5-Hydroxymethylcytosine is a predominantly stable DNA modification. Nat. Chem. 2014, 6, 1049–1055. [Google Scholar] [CrossRef] [PubMed]
  6. Scourzic, L.; Mouly, E.; Bernard, O.A. TET proteins and the control of cytosine demethylation in cancer. Genome Med. 2015, 7, 9. [Google Scholar] [CrossRef] [PubMed]
  7. Rasmussen, K.D.; Helin, K. Role of TET enzymes in DNA methylation, development, and cancer. Genes Dev. 2016, 30, 733–750. [Google Scholar] [CrossRef]
  8. Li, W.; Liu, M. Distribution of 5-Hydroxymethylcytosine in Different Human Tissues. J. Nucleic Acids 2011, 2011, 870726. [Google Scholar] [CrossRef]
  9. Jin, S.-G.; Wu, X.; Li, A.X.; Pfeifer, G.P. Genomic mapping of 5-hydroxymethylcytosine in the human brain. Nucleic Acids Res. 2011, 39, 5015–5024. [Google Scholar] [CrossRef]
  10. Yildirim, O.; Li, R.; Hung, J.-H.; Chen, P.B.; Dong, X.; Ee, L.-S.; Weng, Z.; Rando, O.J.; Fazzio, T.G. Mbd3/NURD Complex Regulates Expression of 5-Hydroxymethylcytosine Marked Genes in Embryonic Stem Cells. Cell 2011, 147, 1498–1510. [Google Scholar] [CrossRef] [Green Version]
  11. Mellén, M.; Ayata, P.; Dewell, S.; Kriaucionis, S.; Heintz, N. MeCP2 Binds to 5hmC Enriched within Active Genes and Accessible Chromatin in the Nervous System. Cell 2012, 151, 1417–1430. [Google Scholar] [CrossRef] [PubMed]
  12. Haffner, M.C.; Chaux, A.; Meeker, A.K.; Esopi, D.M.; Gerber, J.; Pellakuru, L.G.; Toubaji, A.; Argani, P.; Iacobuzio-Donahue, C.; Nelson, W.G.; et al. Global 5-hydroxymethylcytosine content is significantly reduced in tissue stem/progenitor cell compartments and in human cancers. Oncotarget 2011, 2, 627–637. [Google Scholar] [CrossRef] [PubMed]
  13. Liu, C.; Liu, L.; Chen, X.; Shen, J.; Shan, J.; Xu, Y.; Yang, Z.; Wu, L.; Xia, F.; Bie, P.; et al. Decrease of 5-Hydroxymethylcytosine Is Associated with Progression of Hepatocellular Carcinoma through Downregulation of TET1. PLoS ONE 2013, 8, e62828. [Google Scholar] [CrossRef] [PubMed]
  14. Murata, A.; Baba, Y.; Ishimoto, T.; Miyake, K.; Kosumi, K.; Harada, K.; Kurashige, J.; Iwagami, S.; Sakamoto, Y.; Miyamoto, Y.; et al. TET family proteins and 5-hydroxymethylcytosine in esophageal squamous cell carcinoma. Oncotarget 2015, 6, 23372–23382. [Google Scholar] [CrossRef]
  15. Chen, Z.; Shi, X.; Guo, L.; Li, Y.; Luo, M.; He, J. Decreased 5-hydroxymethylcytosine levels correlate with cancer progression and poor survival: A systematic review and meta-analysis. Oncotarget 2016, 8, 1944–1952. [Google Scholar] [CrossRef]
  16. Fernandez, A.F.; Bayón, G.F.; Sierra, M.; Urdinguio, R.G.; Toraño, E.G.; García, M.G.; Carella, A.; Lopez, V.; Santamarina, P.; Pérez, R.F.; et al. Loss of 5hmC identifies a new type of aberrant DNA hypermethylation in glioma. Hum. Mol. Genet. 2018, 27, 3046–3059. [Google Scholar] [CrossRef]
  17. Eyres, M.; Lanfredini, S.; Xu, H.; Burns, A.; Blake, A.; Willenbrock, F.; Goldin, R.; Hughes, D.; Hughes, S.; Thapa, A.; et al. TET2 Drives 5hmc Marking of GATA6 and Epigenetically Defines Pancreatic Ductal Adenocarcinoma Transcriptional Subtypes. Gastroenterology 2021, 161, 653–668e16. [Google Scholar] [CrossRef]
  18. Zhang, J.; Han, X.; Gao, C.; Xing, Y.; Qi, Z.; Liu, R.; Wang, Y.; Zhang, X.; Yang, Y.-G.; Li, X.; et al. 5-Hydroxymethylome in Circulating Cell-free DNA as A Potential Biomarker for Non-small-cell Lung Cancer. Genom. Proteom. Bioinform. 2018, 16, 187–199. [Google Scholar] [CrossRef]
  19. Matthias, W.; Liou, W.; Pulverer, W.; Singer, C.F.; Rappaport-Fuerhauser, C.; Kandioler, D.; Egger, G.; Weinhäusel, A. Cytosine 5-Hydroxymethylation of the LZTS1 Gene Is Reduced in Breast Cancer. Transl. Oncol. 2013, 6, 715–721. [Google Scholar] [CrossRef]
  20. Wilkins, O.M.; Johnson, K.C.; Houseman, E.A.; King, J.E.; Marsit, C.J.; Christensen, B.C. Genome-wide characterization of cytosine-specific 5-hydroxymethylation in normal breast tissue. Epigenetics 2019, 15, 398–418. [Google Scholar] [CrossRef] [Green Version]
  21. Cai, J.; Chen, L.; Zhang, Z.; Zhang, X.; Lu, X.; Liu, W.; Shi, G.; Ge, Y.; Gao, P.; Yang, Y.; et al. Genome-wide mapping of 5-hydroxymethylcytosines in circulating cell-free DNA as a non-invasive approach for early detection of hepatocellular carcinoma. Gut 2019, 68, 2195–2205. [Google Scholar] [CrossRef] [PubMed]
  22. Ficz, G.; Gribben, J.G. Loss of 5-Hydroxymethylcytosine in Cancer: Cause or Consequence? Genomics 2014, 104, 352–357. [Google Scholar] [CrossRef] [PubMed]
  23. Carella, A.; Tejedor, J.R.; García, M.G.; Urdinguio, R.G.; Bayón, G.F.; Sierra, M.; Lopez, V.; García-Toraño, E.; Santamarina-Ojeda, P.; Pérez, R.F.; et al. Epigenetic downregulation of TET3 reduces genome-wide 5hmC levels and promotes glioblastoma tumorigenesis. Int. J. Cancer 2019, 146, 373–387. [Google Scholar] [CrossRef] [PubMed]
  24. Tsai, K.-W.; Li, G.-C.; Chen, C.-H.; Yeh, M.-H.; Huang, J.-S.; Tseng, H.-H.; Fu, T.-Y.; Liou, H.-H.; Pan, H.-W.; Huang, S.-F.; et al. Reduction of global 5-hydroxymethylcytosine is a poor prognostic factor in breast cancer patients, especially for an ER/PR-negative subtype. Breast Cancer Res. Treat. 2015, 153, 219–234. [Google Scholar] [CrossRef]
  25. Wu, S.-L.; Zhang, X.; Chang, M.; Huang, C.; Qian, J.; Li, Q.; Yuan, F.; Sun, L.; Yu, X.; Cui, X.; et al. Genome-wide 5-Hydroxymethylcytosine Profiling Analysis Identifies MAP7D1 as A Novel Regulator of Lymph Node Metastasis in Breast Cancer. Genom. Proteom. Bioinform. 2021, 19, 64–79. [Google Scholar] [CrossRef]
  26. Beck, D.; Ben Maamar, M.; Skinner, M.K. Genome-wide CpG density and DNA methylation analysis method (MeDIP, RRBS, and WGBS) comparisons. Epigenetics 2021, 17, 518–530. [Google Scholar] [CrossRef]
  27. Peng, J.; Xia, B.; Jinying, P. Single-base resolution analysis of DNA epigenome via high-throughput sequencing. Sci. China Life Sci. 2016, 59, 219–226. [Google Scholar] [CrossRef]
  28. Chandrashekar, D.S.; Bashel, B.; Balasubramanya, S.A.H.; Creighton, C.J.; Ponce-Rodriguez, I.; Chakravarthi, B.V.S.K.; Varambally, S. UALCAN: A portal for facilitating tumor subgroup gene expression and survival analyses. Neoplasia 2017, 19, 649–658. [Google Scholar] [CrossRef]
  29. Lian, C.G.; Xu, Y.; Ceol, C.; Wu, F.; Larson, A.; Dresser, K.; Xu, W.; Tan, L.; Hu, Y.; Zhan, Q.; et al. Loss of 5-Hydroxymethylcytosine Is an Epigenetic Hallmark of Melanoma. Cell 2012, 150, 1135–1146. [Google Scholar] [CrossRef]
  30. Oishi, N.; Vuong, H.G.; Mochizuki, K.; Kondo, T. Loss of 5-Hydroxymethylcytosine is an Epigenetic Hallmark of Thyroid Carcinomas with TERT Promoter Mutations. Endocr. Pathol. 2020, 31, 359–366. [Google Scholar] [CrossRef]
  31. Zhang, L.-Y.; Li-Ying, Z.; Wang, T.-Z.; Zhang, X.-C. Prognostic values of 5-hmC, 5-mC and TET2 in epithelial ovarian cancer. Arch. Gynecol. Obstet. 2015, 292, 891–897. [Google Scholar] [CrossRef] [PubMed]
  32. An, J.; Rao, A.; Ko, M. TET family dioxygenases and DNA demethylation in stem cells and cancers. Exp. Mol. Med. 2017, 49, e323. [Google Scholar] [CrossRef] [PubMed]
  33. Taiwo, O.; Wilson, G.A.; Morris, T.; Seisenberger, S.; Reik, W.; Pearce, D.; Beck, S.; Butcher, L.M. Methylome analysis using MeDIP-seq with low DNA concentrations. Nat. Protoc. 2012, 7, 617–636. [Google Scholar] [CrossRef] [PubMed]
  34. Li, D.; Zhang, B.; Xing, X.; Wang, T. Combining MeDIP-seq and MRE-seq to investigate genome-wide CpG methylation. Methods 2014, 72, 29–40. [Google Scholar] [CrossRef] [PubMed]
  35. Clark, C.; Palta, P.; Joyce, C.J.; Scott, C.; Grundberg, E.; Deloukas, P.; Palotie, A.; Coffey, A.J. A Comparison of the Whole Genome Approach of MeDIP-Seq to the Targeted Approach of the Infinium HumanMethylation450 BeadChip® for Methylome Profiling. PLoS ONE 2012, 7, e50233. [Google Scholar] [CrossRef]
  36. Cui, X.-L.; Nie, J.; Ku, J.; Dougherty, U.; West-Szymanski, D.C.; Collin, F.; Ellison, C.K.; Sieh, L.; Ning, Y.; Deng, Z.; et al. A human tissue map of 5-hydroxymethylcytosines exhibits tissue specificity through gene and enhancer modulation. Nat. Commun. 2020, 11, 6161. [Google Scholar] [CrossRef]
  37. Wu, Y.; Chen, X.; Zhao, Y.; Wang, Y.; Li, Y.; Xiang, C. Genome-wide DNA methylation and hydroxymethylation analysis reveal human menstrual blood-derived stem cells inhibit hepatocellular carcinoma growth through oncogenic pathway suppression via regulating 5-hmC in enhancer elements. Stem Cell Res. Ther. 2019, 10, 151. [Google Scholar] [CrossRef]
  38. Xu, W.; Wang, S.; Chen, Q.; Zhang, Y.; Ni, P.; Wu, X.; Zhang, J.; Qiang, F.; Li, A.; Røe, O.D.; et al. TXNL1-XRCC1 pathway regulates cisplatin-induced cell death and contributes to resistance in human gastric cancer. Cell Death Dis. 2014, 5, e1055. [Google Scholar] [CrossRef]
Figure 1. Global 5-mC, 5-hmC, and TET gene expression levels in breast cancer. (a) Global levels of 5-hmC in IDC (n = 15), PN (n = 15), DCIS (n = 5), and AN (n = 5) tissues. (b) Global levels of 5-mC in IDC (n = 15), PN (n = 15), DCIS (n = 5), and AN (n = 5) tissues. (c) Gene expression analysis of TET1 among the IDC, PN, AN, and DCIS samples. (d) Gene expression analysis of TET2 among the IDC, PN, AN, and DCIS samples (e) Gene expression analysis of TET3 among the IDC, PN, AN, and DCIS samples. (f) Correlation analysis of global 5-hmC levels of PN samples with relative mRNA expression of TET1 gene. (g) Correlation analysis of global 5-hmC levels of PN samples with relative mRNA expression of TET2 gene. (h) Correlation analysis of global 5-hmC levels of PN samples with relative mRNA expression of TET3 gene. (i) Correlation analysis of global 5-hmC levels of breast tumour samples with relative mRNA expression of TET1 gene. (j) Correlation analysis of global 5-hmC levels of breast tumour samples with relative mRNA expression of TET2 gene. (k) Correlation analysis of global 5-hmC levels of breast tumour samples with relative mRNA expression of TET3 gene. The Wilcoxon signed-rank test was applied to test the statistical significance of paired analysis. The Mann–Whitney U test was used to evaluate unpaired or grouped analysis (*** p < 0.0001; ** p < 0.001, and * p < 0.05). Abbreviations: PN = paired normal; IDC = invasive ductal carcinoma; AN = apparent normal breast tissues; DCIS = ductal carcinoma in situ.
Figure 1. Global 5-mC, 5-hmC, and TET gene expression levels in breast cancer. (a) Global levels of 5-hmC in IDC (n = 15), PN (n = 15), DCIS (n = 5), and AN (n = 5) tissues. (b) Global levels of 5-mC in IDC (n = 15), PN (n = 15), DCIS (n = 5), and AN (n = 5) tissues. (c) Gene expression analysis of TET1 among the IDC, PN, AN, and DCIS samples. (d) Gene expression analysis of TET2 among the IDC, PN, AN, and DCIS samples (e) Gene expression analysis of TET3 among the IDC, PN, AN, and DCIS samples. (f) Correlation analysis of global 5-hmC levels of PN samples with relative mRNA expression of TET1 gene. (g) Correlation analysis of global 5-hmC levels of PN samples with relative mRNA expression of TET2 gene. (h) Correlation analysis of global 5-hmC levels of PN samples with relative mRNA expression of TET3 gene. (i) Correlation analysis of global 5-hmC levels of breast tumour samples with relative mRNA expression of TET1 gene. (j) Correlation analysis of global 5-hmC levels of breast tumour samples with relative mRNA expression of TET2 gene. (k) Correlation analysis of global 5-hmC levels of breast tumour samples with relative mRNA expression of TET3 gene. The Wilcoxon signed-rank test was applied to test the statistical significance of paired analysis. The Mann–Whitney U test was used to evaluate unpaired or grouped analysis (*** p < 0.0001; ** p < 0.001, and * p < 0.05). Abbreviations: PN = paired normal; IDC = invasive ductal carcinoma; AN = apparent normal breast tissues; DCIS = ductal carcinoma in situ.
Cells 11 02939 g001
Figure 2. Genome-wide distribution of 5-mC and 5-hmC in breast cancer. (a) Heatmap representing DMRs in breast cancer. (b) Heatmap showing DhMRs in breast cancer (Z-score ranges from −2 (white) to +2 (red)). (c) Chromosomal distribution of differentially methylated regions (DMRs) in breast cancer. (d) Chromosomal distribution of differentially hydroxymethylated regions (DhMRs) in breast cancer. (e) Genomic features of DMRs and DhMRs in breast cancer. (f) Relative peak count frequency of DhMR from Transcription Start Sites (TSSs). (g) Relative peak count frequency of DMR from TSSs.
Figure 2. Genome-wide distribution of 5-mC and 5-hmC in breast cancer. (a) Heatmap representing DMRs in breast cancer. (b) Heatmap showing DhMRs in breast cancer (Z-score ranges from −2 (white) to +2 (red)). (c) Chromosomal distribution of differentially methylated regions (DMRs) in breast cancer. (d) Chromosomal distribution of differentially hydroxymethylated regions (DhMRs) in breast cancer. (e) Genomic features of DMRs and DhMRs in breast cancer. (f) Relative peak count frequency of DhMR from Transcription Start Sites (TSSs). (g) Relative peak count frequency of DMR from TSSs.
Cells 11 02939 g002
Figure 3. Validation of 5-hmC specific loci in breast tumour and normal samples. (a) Ideogram representing DMRs (blue) and DhMRs (red) across all chromosomes and the candidate loci of hyper-mC, hypo-mC, hyper-hmC, and hypo-hmC groups. (b) Integrative genome viewer representing the gain of 5-hmC in the distal regulatory region of TXNL1 in tumour, paired normal and apparent normal samples. (c) Validation of 5-hmC levels of TXNL1 (d) Validation of 5-hmC levels of CNIH3 (e) Validation of 5-hmC levels of BNIPL. (f) Validation of 5-hmC levels of A4GALT. (g) Validation of 5-hmC levels of CBLN4. (h) Validation of 5-hmC levels of CHODL. (i) Validation of 5-hmC levels of ZBTB16. (j) Validation of 5-hmC levels of HIC2. (k) Validation of 5-hmC levels of SP8. The p-value of <0.05 was considered statistically significant (** p < 0.001, and * p < 0.05).
Figure 3. Validation of 5-hmC specific loci in breast tumour and normal samples. (a) Ideogram representing DMRs (blue) and DhMRs (red) across all chromosomes and the candidate loci of hyper-mC, hypo-mC, hyper-hmC, and hypo-hmC groups. (b) Integrative genome viewer representing the gain of 5-hmC in the distal regulatory region of TXNL1 in tumour, paired normal and apparent normal samples. (c) Validation of 5-hmC levels of TXNL1 (d) Validation of 5-hmC levels of CNIH3 (e) Validation of 5-hmC levels of BNIPL. (f) Validation of 5-hmC levels of A4GALT. (g) Validation of 5-hmC levels of CBLN4. (h) Validation of 5-hmC levels of CHODL. (i) Validation of 5-hmC levels of ZBTB16. (j) Validation of 5-hmC levels of HIC2. (k) Validation of 5-hmC levels of SP8. The p-value of <0.05 was considered statistically significant (** p < 0.001, and * p < 0.05).
Cells 11 02939 g003
Figure 4. Gene expression analysis of 5-mC and 5-hmC specific loci in breast tumour and normal samples. (af) Gene expression analysis of hyper- and hypo-mC candidates. (gp); Gene expression analysis of hyper- and hypo-hmC candidates using UALCAN webtool (*** p < 0.0001; ** p < 0.001, and * p < 0.05).
Figure 4. Gene expression analysis of 5-mC and 5-hmC specific loci in breast tumour and normal samples. (af) Gene expression analysis of hyper- and hypo-mC candidates. (gp); Gene expression analysis of hyper- and hypo-hmC candidates using UALCAN webtool (*** p < 0.0001; ** p < 0.001, and * p < 0.05).
Cells 11 02939 g004
Table 1. Loci-specific candidates of 5-mC and 5-hmC in breast cancer.
Table 1. Loci-specific candidates of 5-mC and 5-hmC in breast cancer.
Hyper-hmC Regions
Gene SymbolEntrez Gene NameAnnotated RegionsLog2-Fold Changep-Value
CBLN4Cerebellin 4 precursorDistal intergenic1.5000.00075
BNIPLBCL2 interacting protein-likePromoter1.5630.00108
CNIH3Cornichon family AMPA receptor auxiliary protein 3Intron1.5420.00187
TXNL1Thioredoxin like -1Distal intergenic1.7200.00217
A4GALTalpha 1,4-galactosyltransferase (P blood group)Promoter1.5200.00848
GALCGalactosylceramidaseDistal intergenic4.9150.0073
Hypo-hmC Regions
Gene SymbolEntrez Gene NameAnnotated RegionsLog2-Fold Changep-Value
SP8Sp8 transcription factorDistal intergenic−1.3125.3 × 10−5
CHODLChondrolectinIntron−1.5700.00015
HIC2HIC ZBTB transcriptional repressor 2Intron−1.5020.00355
RAPGEF2Rap guanine nucleotide exchange factor 2 [Homo sapiens (human)]Distal intergenic−1.5690.00587
ZBTB16Zinc finger and BTB domain containing 16Intron−1.2700.00753
Hypermethylated Regions
Gene SymbolEntrez Gene NameAnnotated RegionsLog2-Fold Changep-Value
NXPH1Neurexophilin 1Intron2.8107.1 × 10−36
SIM2SIM bHLH transcription factor 2Promoter3.0502.6 × 10−31
WT1-ASWT1 antisense RNAPromoter2.7921.0 × 10−25
CCDC181Coiled-coil domain containing 181Promoter3.8643.1 × 10−14
ID4Inhibitor of DNA binding 4, HLH proteinPromoter2.7893.1 × 10−12
OR4F29Olfactory receptor family 4 subfamily F member 29Distal intergenic3.0010.00014
Hypomethylated Regions
Gene SymbolEntrez Gene NameAnnotated RegionsLog2-Fold Changep-Value
MKI67Marker of proliferation Ki-67Distal intergenic−1.2665.9 × 10−11
SPOCK1SPARC (osteonectin), cwcv and kazal-like domains proteoglycan 1Intron−1.4133.4 × 10−6
OPCMLOpioid binding protein/cell adhesion molecule likeIntron−1.6271.9 × 10−6
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Ramasamy, D.; Rao, A.K.D.M.; Balaiah, M.; Vittal Rangan, A.; Sundersingh, S.; Veluswami, S.; Thangarajan, R.; Mani, S. Locus-Specific Enrichment Analysis of 5-Hydroxymethylcytosine Reveals Novel Genes Associated with Breast Carcinogenesis. Cells 2022, 11, 2939. https://doi.org/10.3390/cells11192939

AMA Style

Ramasamy D, Rao AKDM, Balaiah M, Vittal Rangan A, Sundersingh S, Veluswami S, Thangarajan R, Mani S. Locus-Specific Enrichment Analysis of 5-Hydroxymethylcytosine Reveals Novel Genes Associated with Breast Carcinogenesis. Cells. 2022; 11(19):2939. https://doi.org/10.3390/cells11192939

Chicago/Turabian Style

Ramasamy, Deepa, Arunagiri Kuha Deva Magendhra Rao, Meenakumari Balaiah, Arvinden Vittal Rangan, Shirley Sundersingh, Sridevi Veluswami, Rajkumar Thangarajan, and Samson Mani. 2022. "Locus-Specific Enrichment Analysis of 5-Hydroxymethylcytosine Reveals Novel Genes Associated with Breast Carcinogenesis" Cells 11, no. 19: 2939. https://doi.org/10.3390/cells11192939

APA Style

Ramasamy, D., Rao, A. K. D. M., Balaiah, M., Vittal Rangan, A., Sundersingh, S., Veluswami, S., Thangarajan, R., & Mani, S. (2022). Locus-Specific Enrichment Analysis of 5-Hydroxymethylcytosine Reveals Novel Genes Associated with Breast Carcinogenesis. Cells, 11(19), 2939. https://doi.org/10.3390/cells11192939

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop