Next Article in Journal
A Missense Mutation in the KLF7 Gene Is a Potential Candidate Variant for Congenital Deafness in Australian Stumpy Tail Cattle Dogs
Next Article in Special Issue
Alternative Splicing of Pre-mRNA in the Control of Immune Activity
Previous Article in Journal
Cloning, Characterization, and RNA Interference Effect of the UDP-N-Acetylglucosamine Pyrophosphorylase Gene in Cnaphalocrocis medinalis
Previous Article in Special Issue
BAF57/SMARCE1 Interacting with Splicing Factor SRSF1 Regulates Mechanical Stress-Induced Alternative Splicing of Cyclin D1
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Landscape of Chimeric RNAs in Non-Cancerous Cells

1
School of Basic Medical Sciences, Academy of Medical Sciences, Zhengzhou University, Zhengzhou 450001, Henan Province, China
2
Department of Biochemistry and Molecular Genetics, School of Medicine, University of Virginia, Charlottesville, VA 22908, USA
3
Department of Pathology, School of Medicine, University of Virginia, Charlottesville, VA 22908, USA
4
UVA Cancer Center, School of Medicine, University of Virginia, Charlottesville, VA 22908, USA
*
Authors to whom correspondence should be addressed.
Genes 2021, 12(4), 466; https://doi.org/10.3390/genes12040466
Submission received: 22 February 2021 / Revised: 18 March 2021 / Accepted: 19 March 2021 / Published: 24 March 2021
(This article belongs to the Special Issue Alternative Splicing in Human Physiology and Disease)

Abstract

:
Gene fusions and their products (RNA and protein) have been traditionally recognized as unique features of cancer cells and are used as ideal biomarkers and drug targets for multiple cancer types. However, recent studies have demonstrated that chimeric RNAs generated by intergenic alternative splicing can also be found in normal cells and tissues. In this study, we aim to identify chimeric RNAs in different non-neoplastic cell lines and investigate the landscape and expression of these novel candidate chimeric RNAs. To do so, we used HEK-293T, HUVEC, and LO2 cell lines as models, performed paired-end RNA sequencing, and conducted analyses for chimeric RNA profiles. Several filtering criteria were applied, and the landscape of chimeric RNAs was characterized at multiple levels and from various angles. Further, we experimentally validated 17 chimeric RNAs from different classifications. Finally, we examined a number of validated chimeric RNAs in different cancer and non-cancer cells, including blood from healthy donors, and demonstrated their ubiquitous expression pattern.

1. Introduction

Chimeric fusion RNAs and their encoded proteins were once thought to be features unique to cancer, some of which have been successfully used as cancer diagnostic markers and therapeutic targets [1,2,3,4]. For instance, BCR-ABL in chronic myelogenous leukemia (CML) [5], TMPRSS2-ETS [6,7], SLC45A3-ELK4 [8,9,10,11], and D2HGDH-GAL3ST2 in prostate cancer [12], LHX6-NDUFA8 and SLC2A11-MIF in cervical cancer [13], RRM2-C2orf48 in colorectal cancer (CRC) [14], and ASTN2-PAPPAas in esophageal cancer [15]. Compared to non-cancer tissues and cells, most of these chimeric RNAs are significantly over-expressed in cancer. Without doubt, they represent effective markers for clinical diagnosis/prognosis, and/or drug targets.
However, recent studies have revealed the existence of chimeric RNAs in non-pathological situations [16,17,18,19,20]. Not only have RNA-Sequencing analyses from normal margins of cancer patients revealed that chimeric RNAs can exist in histologically non-neoplasia tissues [16,17,21,22,23,24,25], an increasing number of paired-end RNA sequencing libraries [26,27,28], such as the Genotype-Tissue Expression (GTEx) dataset [29,30], have also allowed for high throughput discovery of non-cancerous chimeric RNA in normal tissues/cells [20,31]. For instance, JAZF1-JJAZ1 is observed in endometrial stromal cells and is derived from RNA trans-splicing instead of chromosome rearrangement, which occurs in endometrial stromal sarcoma [32]. ADCK4-NUMBL, C15orf57-CBX3, ARL10-HIGD2A, and other fusions are detected in GTEx samples [31] and knockdown of ADCK4-NUMBL and C15orf57-CBX decrease cell proliferation and/or cell motility in non-cancerous cells.
In this study, we used three different cell types (HEK-293T, HUVEC, and LO2 cell lines) as models. By interrogating RNA sequencing data via the bioinformatic software tool SOAPfuse, we identified close to four hundred chimeric RNAs. We characterized these chimeras based on various features of the fusions and their parental genes. We then selected 31 candidate chimeric RNAs from different classifications and experimentally validated 17 of these chimeric RNAs as true events in these cell lines. We also explored four of the validated chimeric RNAs in other cells and tissues, including esophageal and prostate cancer and non-cancerous cell lines, as well as healthy blood samples. Additionally, we investigated the location of the transcripts by fractioning cell nucleus and cytoplasm.

2. Materials and Methods

2.1. Cell Culture

Human embryonic kidney cell line (HEK-293T), non-cancer esophageal cell lines (HEEC, HET-1A, and SHEE) prostate cancer cell lines (LNCaP and 22Rv1), and non-cancer prostate epithelial cell lines (WPMY-1 and RWPE-1) were purchased from Chinese Academy of Sciences (Shanghai, China). Human umbilical vein endothelial cell line, HUVEC, human liver cell line, LO2, and esophageal cancer cell lines, KYSE-140, EC1, Eca-109, were gifts from other laboratories in School of Basic Medical Sciences of Zhengzhou University. HEK-293T, HUVEC, LO2, HEEC, and HET-1A cell lines were maintained in DMEM/HIGH GLUCOSE medium containing 4500 mg/L Glucose and 4.0 mM L-Glutamine (HyCloneTM, Logan, UT, USA). KYSE-140, EC1, Eca-109, LNCaP, 22Rv1 and WPMY-1 cell lines were maintained in RPMI1640 medium (HyCloneTM, USA). SHEE was maintained in MEM medium (HyCloneTM, USA). All mediums mentioned above were supplemented with 10% fetal bovine serum (LONSERA, Ciudad de la Costa Canelones, Uruguay) and 1% penicillin and streptomycin (HyCloneTM, USA). RWPE-1 cell line was maintained in K-SFM medium (Gibco, USA) without additives. The cells were incubated at 37 °C in a humidified chamber at 5% CO2, with medium changes every other day. All cells were digested by 0.25% trypsin with 1 g/L EDTA (HyCloneTM, USA).

2.2. RNA Extraction and qRT-PCR

RNA was extracted from each cell line with BEI-BEI BIOTECH Total RNA Isolation Kit (Zhengzhou, China) and reverse-transcribed by TIANGEN FastKing cDNA Kit (Beijing, China) according to the manufacturer’s instructions. TaKaRa TB Green™ Premix Ex Taq™ (Japan) was used to perform SYBR Green based qRT-PCR experiments. Relative RNA levels were calculated using 2−(∆∆Ct) method. GAPDH was used as internal control.

2.3. Real-Time PCR (RT-PCR) and Sanger Sequencing

Candidate chimeric RNAs were validated by RT-PCR. Specific primer pairs for the candidates were designed, with each primer flanking the junction site. All primers used for detection are listed in Supplementary Table S1. RT-PCR experiments were performed with TaKaRa Taq™ Version 2.0 plus dye (Shiga, Japan). Following RT-PCR and gel electrophoresis, Axygen® AxyPrep DNA Gel Extraction Kit (Corning, NY, USA) was used for DNA purification and followed by Sanger sequencing at Sangon Biotech.

2.4. Cell Fractionation

HEK-293T cells were digested with 0.25% trypsin, washed once with PBS, and then separated into two fractions using NE-PER nuclear and cytoplasmic extraction reagents (Thermo Fisher, Waltham, MA, USA) following the manufacturer’s instructions. RNA from each fraction was extracted separately with BEI-BEI BIOTECH Total RNA Isolation Kit and followed by qRT-PCR with MALAT1 and GAPDH as controls.

2.5. Preparation of Peripheral Blood Buffy Coat

Whole blood samples were obtained from healthy volunteers and stored in EDTA anticoagulant tubes. The blood samples were centrifuged at 1000× g for 10 min at 4 °C, and the supernatant plasma above the buffy coat layer was removed and discarded. About 500 µL of buffy coat layer, comprised of white blood cells and platelets, was extracted from erythrocytes and plasma. Total RNA was extracted from the buffy coat with SIMGEN Blood Total RNA Kit (USA) and followed by reverse-transcription and qRT-PCR as described above.

2.6. RNA-Seq and Bioinformatics Analysis

RNA-seq was implemented by Novogene (Beijing, China). Common eukaryotic transcriptome library was constructed by the magnetic bead enrichment method. mRNA with Poly(A+) was enriched by magnetic beads with Oligo (dT), then the RNA was broken into 250–300 bp short fragments, which were synthesized into cDNA. The purified double stranded cDNA was repaired, and A tail was added. PCR amplification and purification were performed for the construction of the library. Illumina PE150 sequencing (Pair end 150 bp) was performed. Total reads used for comparative analysis were 59,722,918 in HEK-293T, 59,065,200 in HUVEC, and 71,624,388 in LO2, respectively. The total mapped rate to hg19 was 97.07% in in HEK-293T, 97.24% in HUVEC, and 95.86% in LO2, respectively. Then, we used the bioinformatic software tool, SOAPfuse, to analyze the RNA-seq data and obtain putative chimera lists.

3. Results

3.1. Discovery of the Chimeric RNAs in HEK-293T, HUVEC, and LO2 Cell Lines

To discover chimeric RNAs in non-cancerous cells, we selected three cell lines, representing different cell types. HEK-293T is established from human embryonic kidney, HUVEC is a human umbilical vein endothelial cell line, and LO2 is a human hepatocyte line. RNAs extracted from HEK-293T, HUVEC, and LO2 cells were processed and followed by transcriptome sequencing. To detect and identify novel chimeric RNA events, we used the bioinformatic software tool SOAPfuse [33] to analyze the paired-end RNA-seq data and applied the process outlined in Figure 1A. A total of 156, 77, and 120 unique chimeric fusion transcripts were identified from HEK-293T, HUVEC, and LO2, respectively (Figure 1B,C). The Venn diagram shows the similarities/differences of chimeric RNAs among the cell lines. The overlaps between lines are small, suggesting that most fusion transcripts identified here are cell type specific (Figure 1B). Illustrated by Circos plots, it is apparent that the majority of the fusions are intrachromosomal (Figure 1C). Similar to our previous publication [34], we categorized chimeric RNAs according to junction localization relative to parental genes: both sides being known exon/intron boundaries (E/E), both sides falling into the middle of exons (M/M), and one side being exon/intron boundary and the other not (E/M or M/E). In order to reduce the false discovery rate, we filtered out the M/M-type fusion transcripts because of their lower validation rate [8]. After applying this filter, 118, 32, and 68 unique fusions were left in HEK-293T, HUVEC, and LO2, respectively.

3.2. Classifications of Chimeric RNAs

We subsequently examined the landscape of these chimeric RNAs from three angles, and at three different levels, in all three lines (Figure 2A–C). First, we characterized fusions according to chromosomal location of their parental genes: parental genes located on different chromosomes (INTERCHR), neighboring genes on the same strand (Read-Through), and non-neighboring and/or opposite strand genes on the same chromosome (INTRA-Others). For all of the fusions, we found INTERCHR to be the largest group (60% in HUVEC and 46% in LO2) and INTRA-Others to be the least prominent group (18% in HUVEC and 27% in LO2). However, in HEK-293T cells, Read-Through is the most prominent group (57%) and INTERCHR is the least common group (19%). When the M/M-type fusions were filtered out, the INTERCHR group shrunk (1% in HEK-293T, 25% in HUVEC, and 16% in LO2), and Read-Through and INTRA-Others became more abundant (70% and 29% in HEK-293T, 47% and 28% in HUVEC, and 41% and 43% in LO2, respectively).
Secondly, as described above, we categorized chimeric RNAs into M/M, E/M, M/E, and E/E types based on junction location relative to exons of parental genes (Figure 2A–C). Among all the fusions, the most prominent category was E/E fusions: 64% in HEK-293T, while M/M (25%), M/E (8%), and E/M (3%) were much less common in HEK-293T. However, M/M types are more prominent in the other two cell lines (59% in HUVEC and 43% in LO2), while E/E (25% in HUVEC and 41% in LO2), E/M (10% in HUVEC and 8% in LO2), and M/E (6% in HUVEC and 8% in LO2) were much less common. After filtering out the M/M-type, E/E fusions were significantly enriched (85% in HEK-293T, 59% in HUVEC, and 72% in LO2). Interestingly, M/E fusions are always more abundant than E/M fusions in this population of three cell lines.
Lastly, we categorized chimeras according to their reading frames (Figure 2A–C): the known reading frame of the 3′ gene is the same as the 5′ gene (in-frame), the known protein coding sequence of the 3′ gene uses a different reading frame than the 5′ gene (frame-shift), no predicted effect on the reading frame of parental genes (NA) (this category includes chimeric RNAs whose junction sequence fall into an untranslated region, or one or both parental genes is a lncRNA), or a very small population of fusions fell into the “both” category, which could be in-frame or frame-shift, depending on the alternative splicing isoforms of the parental genes. When all the fusions were examined, we found the most common classification to be NA (47% in HEK-293T, 71% in HUVEC, and 65% in LO2), while frame-shift (36% in HEK-293T, 8% in HUVEC, and 23% in LO2) and in-frame (15% in HEK-293T, 21% in HUVEC, and 10% in LO2) fusions were much less common, while the “both” category contained only 2% in HEK-293T and LO2. Removal of M/M-type fusions seemed to enrich in-frame (18% in HEK-293T, 22% in HUVEC, and no change in LO2) and frame-shift (41% in HEK-293T, 13% in HUVEC, and 26% in LO2) classifications, while reducing fusions with NA classification (39% in HEK-293T, 65% in HUVEC, and 63% in LO2).

3.3. Validation of the Chimeric RNAs in HEK-293T, HUVEC, and LO2 Cell Lines

Based on the fusion types (Read-Through, INTRA-Others, and INTERCHR), we chose 18, 8, and 5 candidate chimeric RNAs from Read-Through, INTRA-Others, and INTERCHR groups, respectively, for validation (Table 1). To do so, primers annealing to parental genes and flanking the fusion junction site were designed. Sanger sequencing was used after RT-PCR to confirm the sequence on both sides of the junction site in three cell lines (Figure 3A–D and Supplementary Figures S1 and S2). Ten and six candidates were validated from Read-Through and INTRA-Others, respectively (Table 1, and Supplementary Table S2). Of these, two forms for RAPH1-OLA1 (Figure 3C and Figure S2), RAD51AP1-DYRK4 (Figure 3D and Figure S1), and TPD5212-DNAJC5 were validated in both HEK-293T and HUVEC (Figure 3A). However, only one chimeric RNA, MLLT1-PFKP, was confirmed from the INTERCHR group (Figure 3D). Similar to our previous research, the validation rate of chimeric RNAs from INTERCHR group is lower than that of chimeric RNAs from Read-Through and INTRA-Others groups.

3.4. Expression of the Candidate Chimeric RNAs in HEK-293T, HUVEC, and LO2 Cell Lines

To quantify the chimeric RNAs discovered, we subsequently measured the expression of six novel chimeric RNAs among these three cell lines by qRT-PCR (Figure 4A). We chose three Read-Through chimeric RNAs (SUMO3-UBE2G2, MSANTD3-TMEFF1, and RAD51AP1-DYRK4), two INTRA-Others (TPD5112-DNAJC5 and RAPH1-OLA1), and one INTERCHR (MLLT1-PFKP). All of the chimeric RNAs from the Read-Through and INTRA-Others groups were detected in these three cells, with SUMO3-UBE2G2 and MSANTD3-TMEFF1 expressed significantly higher in HEK-293T and RAPH1-OLA1 expressed significantly higher in HUVEC. The INTERCHR chimeric RNA, MLLT1-PFKP, was not found in HEK-293T or HUVEC (Figure 4B).
To gain more insight into the protein coding role of the candidate chimeric RNAs, a fractionation experiment was performed to separate nuclear from cytoplasmic RNAs in HEK-293T. Different from regular protein-coding mRNAs, it has been reported that many lncRNAs reside in the nucleus to regulate transcription [35,36,37]. Since expression of RAPH1-OLA1 is relatively low, we focused on the examination of the other four fusions. Based on bioinformatics analysis, MSANTD3-TMEFF1 is in-frame, RAD51AP1-DYRK4, and TPD5112-DNAJC5 are frame-shift, and SUMO3-UBE2G2 is NA, according to their reading frames which are described above. As seen in Figure 4C, a classic protein-coding gene, GAPDH, was enriched in the cytoplasmic fraction, while a classic long non-coding RNA, MALAT1, was found predominately in the nuclear fraction. All of these four fusions showed close to one ratio of nuclear vs. cytoplasmic fraction, making them candidates for both categories (Figure 4C). These findings also suggest that assessing protein-coding potential based on the bioinformatic prediction Illumina platform short sequencing may be error-prone. Such prediction would be more reliable with full length sequencing platforms.

3.5. Expression of the Candidate Chimeric RNAs in Other Cells/Tissues

To examine the candidate chimeric RNAs in other cells, we performed qRT-PCR for the four fusions in multiple esophageal and prostate cell lines. All four chimeric RNAs were detected in LNCaP and 22Rv1 (prostate cancer cell lines), and RWPE-1 and WPMY-1 (non-cancer prostate epithelial cell lines). They were also detected in KYSE-140, EC1, and Eca-109 (esophageal cancer cell lines), and HEEC, HET-1A, and SHEE (non-cancer esophageal cell lines) (Figure 5A). No significant differences were found between cancer and non-cancer cells.
Lastly, we also examined the expression of these four chimeric RNAs in the buffy coat of three healthy donors. Buffy coat contains various white blood cells and platelets [38]. We extracted RNA from the buffy coat, and conducted qRT-PCR. SUMO3-UBE2G2, MSANTD3-TMEFF1, RAD51AP1-DYRK4, and TPD5112-DNAJC5 were found in all samples (Figure 5B), supporting that the expression of these chimeric RNAs are more ubiquitous.

4. Discussion and Conclusions

Chimeric RNAs caused by chromosomal rearrangements are well known cancer-causing genetic events and are actively used in clinical cancer diagnoses. Some fusion products, such as BCR-ABL and EML-ALK4, have also shown to be effective targets of directed therapy [39,40]. However, there has been increasing evidence of chimeric RNAs in non-cancer tissues and cells [18,20,31,41]. Recent work on RNA trans-splicing [18,32] and cis-splicing between adjacent genes [8,9,42] have defined a new exemplification for alternative splicing mechanisms, which can also generate chimeric RNAs.
The wide application of RNA-Sequencing has resulted an explosion of newly identified chimeric RNAs. However, it is important to realize that not all chimeric RNAs detected in cancer cells or tissues are cancer specific. Here, we examined the chimeric RNA profile in three different types of non-cancer cells, and uncovered hundreds of chimeric RNAs. Our findings are consistent with the recent realization that chimeric RNAs are not exclusive to cancer cells, and sound alarms for the assumption that all chimeric RNAs identified in cancer are potentially cancer specific biomarkers. This is important as more and more chimeric RNAs are being deposited into Mitelman Database of Chromosome Aberrations and Gene Fusions in Cancer.
In this study, we selected SOAPfuse software to perform paired-end RNA sequencing data analysis because of its high validation rate [8,43]. From the software analysis outcomes, we found small overlaps between three cell lines, supporting that the majority of fusions tend to exist in unique cell types. We performed analyses on the landscape of these chimeric RNAs on three levels and from three angles. After filtering out M/M fusions, the frequency of category INTERCHR shrank significantly. Our previous study also showed that this category of chimeric RNAs tends to have a lower validation rate. Therefore, we decided to omit them from experimental verifications. We randomly selected 31 chimeric candidates from different fusion types (Read-Through, INTRA-Others, and INTERCHR) and confirmed 17 of them. Indeed, we had higher validation for Read-Through and INTRA-Others, with only one INTERCHR chimeric RNA, MLLT1-PFKP, verified in LO2. Given the lower validation rate of INTERCHR, it is possible that a subset of them are false positives. Based on bioinformatics analysis, we chose six fusions with significant differences in three cell lines. We noticed that the expression detected by qRT-PCR is not always consistent with the bioinformatic analysis from RNA-Seq data, highlighting the different sensitivity of PCR and sequencing, and the limitation of bioinformatic tools.
In conclusion, instead of being unique features of cancer, chimeric RNAs are widely spread in non-cancer cells. Their ubiquitous expression in many different cells/tissues support them as a set of functional entities involved in fundamental cellular mechanisms common to many cells. Furthermore, tissue specific chimeric RNAs may carry unique functions relevant to the physiological role of the tissue. We believe that chimeric RNAs provide an additional way for the functional genome to expand without an increase in the number of genes. Further study on their function is needed to support this idea.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/genes12040466/s1, Table S1. All primers used for RT-PCR. Table S2. Read counts for the validated chimeric RNAs. Figure S1. Identification of chimeric RNA candidates. Figure S2. Identification of chimeric RNA candidates.

Author Contributions

F.Q., Y.T., and H.L. designed the study, and interpreted the data. C.C. and F.Q. performed, analyzed, and interpreted the data. S.H. generated circos plots. C.C., F.Q., Y.T., and H.L. contributed to the writing and editing of the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

Y.T. is supported by Key R&D and Promotion Project (No.202102310097) in Henan Province, China. H.L. is supported by NIGMS grant GM132138.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data is contained within the article and Supplementary Material.

Acknowledgments

We thank Amy Yang and Sarah Lynch for English editing of the manuscript.

Conflicts of Interest

The authors declare no conflict of interests.

References

  1. Asmann, Y.W.; Necela, B.M.; Kalari, K.R.; Hossain, A.; Baker, T.R.; Carr, J.M.; Davis, C.; Getz, J.E.; Hostetter, G.; Li, X.; et al. Detection of redundant fusion transcripts as biomarkers or disease-specific therapeutic targets in breast cancer. Cancer Res. 2012, 72, 1921–1928. [Google Scholar] [CrossRef] [PubMed]
  2. Cools, J.; DeAngelo, D.J.; Gotlib, J.; Stover, E.H.; Legare, R.D.; Cortes, J.; Kutok, J.; Clark, J.; Galinsky, I.; Griffin, J.D.; et al. A tyrosine kinase created by fusion of the PDGFRA and FIP1L1 genes as a therapeutic target of imatinib in idiopathic hypereosinophilic syndrome. N. Engl. J. Med. 2003, 348, 1201–1214. [Google Scholar] [CrossRef] [PubMed]
  3. Løvf, M.; Nome, T.; Bruun, J.; Eknaes, M.; Bakken, A.C.; Mpindi, J.P.; Kilpinen, S.; Rognum, T.O.; Nesbakken, A.; Kallioniemi, O.; et al. A novel transcript, VNN1-AB, as a biomarker for colorectal cancer. Int. J. Cancer 2014, 135, 2077–2084. [Google Scholar] [CrossRef] [PubMed]
  4. Yang, Z.; Yu, L.; Wang, Z. PCA3 and TMPRSS2-ERG gene fusions as diagnostic biomarkers for prostate cancer. Chin. J. Cancer Res. 2016, 28, 65–71. [Google Scholar] [PubMed]
  5. Rowley, J.D. A new consistent chromosomal abnormality in chronic myelogenous leukaemia identified by quinacrine fluorescence and Giemsa staining. Nature 1973, 243, 290–293. [Google Scholar] [CrossRef]
  6. Tomlins, S.A.; Laxman, B.; Varambally, S.; Cao, X.; Yu, J.; Helgeson, B.E.; Cao, Q.; Prensner, J.R.; Rubin, M.A.; Shah, R.B.; et al. Role of the TMPRSS2-ERG Gene Fusion in Prostate Cancer. Neoplasia 2008, 10, 177–188, IN1–IN9. [Google Scholar] [CrossRef]
  7. Tomlins, S.A.; Rhodes, D.R.; Perner, S.; Dhanasekaran, S.M.; Mehra, R.; Sun, X.W.; Varambally, S.; Cao, X.; Tchinda, J.; Kuefer, R.; et al. Recurrent fusion of TMPRSS2 and ETS transcription factor genes in prostate cancer. Science 2005, 310, 644–648. [Google Scholar] [CrossRef]
  8. Qin, F.; Song, Z.; Babiceanu, M.; Song, Y.; Facemire, L.; Singh, R.; Adli, M.; Li, H. Discovery of CTCF-sensitive Cis-spliced fusion RNAs between adjacent genes in human prostate cells. PLoS Genet. 2015, 11, e1005001. [Google Scholar]
  9. Zhang, Y.; Gong, M.; Yuan, H.; Park, H.G.; Frierson, H.F.; Li, H. Chimeric Transcript Generated by cis-Splicing of Adjacent Genes Regulates Prostate Cancer Cell Proliferation. Cancer Discov. 2012, 2, 598–607. [Google Scholar] [CrossRef] [PubMed]
  10. Kumar-Sinha, C.; Kalyana-Sundaram, S.; Chinnaiyan, A.M. SLC45A3-ELK4 Chimera in Prostate Cancer: Spotlight on cis-Splicing: Figure 1. Cancer Discov. 2012, 2, 582–585. [Google Scholar] [CrossRef]
  11. Qin, F.; Zhang, Y.; Liu, J.; Li, H. SLC45A3-ELK4 functions as a long non-coding chimeric RNA. Cancer Lett. 2017, 404, 53–61. [Google Scholar] [CrossRef] [PubMed]
  12. Qin, F.; Song, Z.; Chang, M.; Song, Y.; Frierson, H.; Li, H. Recurrent cis-SAGe chimeric RNA, D2HGDH-GAL3ST2, in prostate cancer. Cancer Lett. 2016, 380, 39–46. [Google Scholar] [CrossRef]
  13. Wu, P.; Yang, S.; Singh, S.; Qin, F.; Kumar, S.; Wang, L.; Ma, D.; Li, H. The Landscape and Implications of Chimeric RNAs in Cervical Cancer. EBioMedicine 2018, 37, 158–167. [Google Scholar] [CrossRef] [PubMed]
  14. Wu, H.; Singh, S.; Xie, Z.; Li, X.; Li, H. Landscape characterization of chimeric RNAs in colorectal cancer. Cancer Lett. 2020, 489, 56–65. [Google Scholar] [CrossRef]
  15. Wang, L.; Xiong, X.; Yao, Z.; Zhu, J.; Lin, Y.; Lin, W.; Li, K.; Xu, X.; Guo, Y.; Chen, Y.; et al. Chimeric RNA ASTN2-PAPPAas aggravates tumor progression and metastasis in human esophageal cancer. Cancer Lett. 2021, 501, 1–11. [Google Scholar] [CrossRef]
  16. Chase, A.; Ernst, T.; Fiebig, A.; Collins, A.; Grand, F.; Erben, P.; Reiter, A.; Schreiber, S.; Cross, N.C. TFG, a target of chromosome translocations in lymphoma and soft tissue tumors, fuses to GPR128 in healthy individuals. Haematologica 2009, 95, 20–26. [Google Scholar] [CrossRef] [PubMed]
  17. Wu, C.-S.; Yu, C.-Y.; Chuang, C.-Y.; Hsiao, M.; Kao, C.-F.; Kuo, H.-C.; Chuang, T.-J. Integrative transcriptome sequencing identifies trans-splicing events with important roles in human embryonic stem cell pluripotency. Genome Res. 2014, 24, 25–36. [Google Scholar] [CrossRef] [PubMed]
  18. Yuan, H.; Qin, F.; Movassagh, M.; Park, H.; Golden, W.L.; Xie, Z.; Zhang, P.; Sklar, J.; Li, H. A Chimeric RNA Characteristic of Rhabdomyosarcoma in Normal Myogenesis Process. Cancer Discov. 2013, 3, 1394–1403. [Google Scholar] [CrossRef] [PubMed]
  19. Ma, L.; Yang, S.; Zhao, W.; Tang, Z.; Zhang, T.; Li, K. Identification and analysis of pig chimeric mRNAs using RNA sequencing data. BMC Genom. 2012, 13, 429. [Google Scholar] [CrossRef]
  20. Babiceanu, M.; Qin, F.; Xie, Z.; Jia, Y.; Lopez, K.; Janus, N.; Facemire, L.; Kumar, S.; Pang, Y.; Qi, Y.; et al. Recurrent chimeric fusion RNAs in non-cancer tissues and cells. Nucleic Acids Res. 2016, 44, 2859–2872. [Google Scholar] [CrossRef] [PubMed]
  21. Ren, G.; Zhang, Y.; Mao, X.; Liu, X.; Mercer, E.; Marzec, J.; Ding, D.; Jiao, Y.; Qiu, Q.; Sun, Y.; et al. Transcription-Mediated Chimeric RNAs in Prostate Cancer: Time to Revisit Old Hypothesis? OMICS J. Integr. Biol. 2014, 18, 615–624. [Google Scholar] [CrossRef]
  22. Yoshihara, K.; Wang, Q.; Torres-Garcia, W.; Zheng, S.; Vegesna, R.; Kim, H.; Verhaak, R.G.W. The landscape and therapeutic relevance of cancer-associated transcript fusions. Oncogene 2015, 34, 4845–4854. [Google Scholar] [CrossRef] [PubMed]
  23. Stransky, N.; Cerami, E.; Schalm, S.; Kim, J.L.; Lengauer, C. The landscape of kinase fusions in cancer. Nat. Commun. 2014, 5, 4846. [Google Scholar] [CrossRef]
  24. Finta, C.; Zaphiropoulos, P.G. Intergenic mRNA molecules resulting from trans-splicing. J. Biol. Chem. 2002, 277, 5882–5890. [Google Scholar] [CrossRef] [PubMed]
  25. Balamurali, D.; Gorohovski, A.; Detroja, R.; Palande, V.; Raviv-Shay, D.; Frenkel-Morgenstern, M. ChiTaRS 5.0: The compre-hensive database of chimeric transcripts matched with druggable fusions and 3D chromatin maps. Nucleic Acids Res. 2002, 48, D825–D834. [Google Scholar] [CrossRef]
  26. Levin, J.Z.; Berger, M.F.; Adiconis, X.; Rogov, P.; Melnikov, A.; Fennell, T.; Nusbaum, C.; Garraway, L.A.; Gnirke, A. Targeted next-generation sequencing of a cancer transcriptome enhances detection of sequence variants and novel fusion transcripts. Genome Biol. 2009, 10, R115. [Google Scholar] [CrossRef]
  27. Maher, C.A.; Kumar-Sinha, C.; Cao, X.; Kalyana-Sundaram, S.; Han, B.; Jing, X.; Sam, L.; Barrette, T.; Palanisamy, N.; Chinnaiyan, A.M. Transcriptome sequencing to detect gene fusions in cancer. Nat. Cell Biol. 2009, 458, 97–101. [Google Scholar] [CrossRef]
  28. Zhao, Q.; Caballero, O.L.; Levy, S.; Stevenson, B.J.; Iseli, C.; de Souza, S.J.; Galante, P.A.; Busam, D.; Leversha, M.A.; Chadalavada, K.; et al. Transcriptome-guided characterization of genomic re-arrangements in a breast cancer cell line. Proc. Natl. Acad. Sci. USA 2009, 106, 1886–1891. [Google Scholar] [CrossRef]
  29. The GTEx Consortium. The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans. Science 2015, 348, 648–660. [Google Scholar] [CrossRef] [PubMed]
  30. The GTEx Consortium. The Genotype-Tissue Expression (GTEx) project. Nat. Genet. 2013, 45, 580–585. [Google Scholar] [CrossRef]
  31. Singh, S.; Qin, F.; Kumar, S.; Elfman, J.; Lin, E.; Pham, L.-P.; Yang, A.; Li, H. The landscape of chimeric RNAs in non-diseased tissues and cells. Nucleic Acids Res. 2020, 48, 1764–1778. [Google Scholar] [CrossRef] [PubMed]
  32. Li, H.; Wang, J.; Mor, G.; Sklar, J. A Neoplastic Gene Fusion Mimics Trans-Splicing of RNAs in Normal Human Cells. Science 2008, 321, 1357–1361. [Google Scholar] [CrossRef]
  33. Jia, W.; Qiu, K.; He, M.; Song, P.; Zhou, Q.; Zhou, F.; Yu, Y.; Zhu, D.; Nickerson, M.L.; Wan, S.; et al. SOAPfuse: An algorithm for identifying fusion transcripts from paired-end RNA-Seq data. Genome Biol. 2013, 14, R12. [Google Scholar] [CrossRef] [PubMed]
  34. Zhu, D.; Singh, S.; Chen, X.; Zheng, Z.; Huang, J.; Lin, T.; Li, H. The landscape of chimeric RNAs in bladder urothelial carcinoma. Int. J. Biochem. Cell Biol. 2019, 110, 50–58. [Google Scholar] [CrossRef]
  35. Brown, C.J.; Hendrich, B.D.; Rupert, J.L.; Lafrenière, R.G.; Xing, Y.; Lawrence, J.; Willard, H.F. The human XIST gene: Analysis of a 17 kb inactive X-specific RNA that contains conserved repeats and is highly localized within the nucleus. Cell 1992, 71, 527–542. [Google Scholar] [CrossRef]
  36. Hutchinson, J.N.; Ensminger, A.W.; Clemson, C.M.; Lynch, C.R.; Lawrence, J.B.; Chess, A. A screen for nuclear transcripts identifies two linked noncoding RNAs associated with SC35 splicing domains. BMC Genom. 2007, 8, 39. [Google Scholar] [CrossRef]
  37. Sone, M.; Hayashi, T.; Tarui, H.; Agata, K.; Takeichi, M.; Nakagawa, S. The mRNA-like noncoding RNA Gomafu constitutes a novel nuclear domain in a subset of neurons. J. Cell Sci. 2007, 120, 2498–2506. [Google Scholar] [CrossRef] [PubMed]
  38. Teetson, W.; Cartwright, C.; Dreiling, B.J.; Steinberg, M.H. The Leukocyte Composition of Peripheral Blood Buffy Coat. Am. J. Clin. Pathol. 1983, 79, 500–501. [Google Scholar] [CrossRef]
  39. Druker, B.J. Current treatment approaches for chronic myelogenous leukemia. Cancer J. 2001, 7, 14–18. [Google Scholar]
  40. Shaw, A.T.; Ou, S.-H.I.; Bang, Y.-J.; Camidge, D.R.; Solomon, B.J.; Salgia, R.; Riely, G.J.; Varella-Garcia, M.; Shapiro, G.I.; Costa, D.B.; et al. Crizotinib in ROS1-Rearranged Non–Small-Cell Lung Cancer. N. Engl. J. Med. 2014, 371, 1963–1971. [Google Scholar] [CrossRef]
  41. Jividen, K.; Li, H. Chimeric RNAs generated by intergenic splicing in normal and cancer cells. Genes Chromosom. Cancer 2014, 53, 963–971. [Google Scholar] [CrossRef] [PubMed]
  42. Chwalenia, K.; Qin, F.; Singh, S.; Li, H. A cell-based splicing reporter system to identify regulators of cis-splicing between adjacent genes. Nucleic Acids Res. 2019, 47, e24. [Google Scholar] [CrossRef] [PubMed]
  43. Liu, S.; Tsai, W.-H.; Ding, Y.; Chen, R.; Fang, Z.; Huo, Z.; Kim, S.; Ma, T.; Chang, T.-Y.; Priedigkeit, N.M.; et al. Comprehensive evaluation of fusion transcript detection algorithms and a meta-caller to combine top performing methods in paired-end RNA-seq data. Nucleic Acids Res. 2016, 44, e47. [Google Scholar] [CrossRef] [PubMed]
Figure 1. The landscape of chimeric RNAs in HEK-293T, HUVEC, and LO2 cell lines. (A) The pipeline for discovering chimeric RNAs. Our own RNA sequencing data was used for analysis. (B) Venn diagram summarizing the shared and unique chimeric RNAs that were discovered in HEK-293T, HUVEC, or LO2 cell lines. (C) Circos plot depicting chimeric RNAs discovered from three cell lines. The fused genes are illustrated here as a line that connects two parental genes.
Figure 1. The landscape of chimeric RNAs in HEK-293T, HUVEC, and LO2 cell lines. (A) The pipeline for discovering chimeric RNAs. Our own RNA sequencing data was used for analysis. (B) Venn diagram summarizing the shared and unique chimeric RNAs that were discovered in HEK-293T, HUVEC, or LO2 cell lines. (C) Circos plot depicting chimeric RNAs discovered from three cell lines. The fused genes are illustrated here as a line that connects two parental genes.
Genes 12 00466 g001
Figure 2. Classifications of the chimeric RNAs in HEK-293T, HUVEC, and LO2 cell lines. Pie charts showing the different distribution of chimeric RNAs in types of chimeras of HEK-293T (A), HUVEC (B), and LO2 (C) cell lines based on parental gene location (left), E/M categories (middle), and fusion protein coding potential (right). The distribution of chimeric RNAs was examined at two stages along our filtering pipeline: all chimeric RNAs and after removal of M/M (Non-M/M). When the criteria of “non-M/M” was applied, more E/E, Read-Through, and in-frame fusions were enriched.
Figure 2. Classifications of the chimeric RNAs in HEK-293T, HUVEC, and LO2 cell lines. Pie charts showing the different distribution of chimeric RNAs in types of chimeras of HEK-293T (A), HUVEC (B), and LO2 (C) cell lines based on parental gene location (left), E/M categories (middle), and fusion protein coding potential (right). The distribution of chimeric RNAs was examined at two stages along our filtering pipeline: all chimeric RNAs and after removal of M/M (Non-M/M). When the criteria of “non-M/M” was applied, more E/E, Read-Through, and in-frame fusions were enriched.
Genes 12 00466 g002
Figure 3. Identification and profiling of Read-Through, INTRA-Others, and INTERCHR chimeric RNA candidates. RT-PCR and Sanger sequencing for candidate chimeric RNAs. (A) Gel images of RT-PCR products of eight fusions in HEK-293T, six in HUVEC, and four in LO2 are shown. Structures of one Read-Through, one INTRA-Others and one INTERCHR chimeric RNA and Sanger sequencing validations in HEK-293T (B), HUVEC (C), and LO2 (D). Blocks indicate exons while lines indicate introns and the intergenic region. The fusion junction sites are highlighted as red lines.
Figure 3. Identification and profiling of Read-Through, INTRA-Others, and INTERCHR chimeric RNA candidates. RT-PCR and Sanger sequencing for candidate chimeric RNAs. (A) Gel images of RT-PCR products of eight fusions in HEK-293T, six in HUVEC, and four in LO2 are shown. Structures of one Read-Through, one INTRA-Others and one INTERCHR chimeric RNA and Sanger sequencing validations in HEK-293T (B), HUVEC (C), and LO2 (D). Blocks indicate exons while lines indicate introns and the intergenic region. The fusion junction sites are highlighted as red lines.
Genes 12 00466 g003
Figure 4. Expression of candidate chimeric RNAs in HEK-293T, HUVEC, and LO2 cell lines. (A) qRT-PCR measuring five chimeric RNAs in HEK-293T, HUVEC, and LO2 cells. The levels of various transcripts were normalized to that of GAPDH, and further normalized to that in HEK-293T. (B) Gel image of RT-PCR products of MLLT1-PFKP in 13 cell lines. (C) qRT-PCR measuring the expression of four chimeric RNAs in cell nuclei and cytoplasm. HEK-293T cells were fractioned into nuclear and cytoplasmic parts. GAPDH and known long non-coding RNA MALAT1 were used as controls. Ratios of expression in nuclear and cytoplasm parts were plotted.
Figure 4. Expression of candidate chimeric RNAs in HEK-293T, HUVEC, and LO2 cell lines. (A) qRT-PCR measuring five chimeric RNAs in HEK-293T, HUVEC, and LO2 cells. The levels of various transcripts were normalized to that of GAPDH, and further normalized to that in HEK-293T. (B) Gel image of RT-PCR products of MLLT1-PFKP in 13 cell lines. (C) qRT-PCR measuring the expression of four chimeric RNAs in cell nuclei and cytoplasm. HEK-293T cells were fractioned into nuclear and cytoplasmic parts. GAPDH and known long non-coding RNA MALAT1 were used as controls. Ratios of expression in nuclear and cytoplasm parts were plotted.
Genes 12 00466 g004
Figure 5. Expression of candidate chimeric RNAs in other cells/tissues. (A) qRT-PCR measuring the expression of four chimeric RNAs in various esophageal and prostate cell lines. Blue bars indicate cancer cell lines and green bars indicate non-cancer cell lines. The levels of various transcripts were normalized to that of GAPDH and further normalized to SHEE. (B) qRT-PCR measuring the expression of four chimeric RNAs in three blood samples (BC-1, BC-2, BC-3). The levels of various transcripts were normalized to that of GAPDH and further normalized to BC-1.
Figure 5. Expression of candidate chimeric RNAs in other cells/tissues. (A) qRT-PCR measuring the expression of four chimeric RNAs in various esophageal and prostate cell lines. Blue bars indicate cancer cell lines and green bars indicate non-cancer cell lines. The levels of various transcripts were normalized to that of GAPDH and further normalized to SHEE. (B) qRT-PCR measuring the expression of four chimeric RNAs in three blood samples (BC-1, BC-2, BC-3). The levels of various transcripts were normalized to that of GAPDH and further normalized to BC-1.
Genes 12 00466 g005
Table 1. Identification of the fusions. Summary of 31 candidate chimeric RNAs. There are two forms for chimeric RNA RAPH1-OLA1 and RAD51AP1-DYRK4. Sanger sequencing confirmed 17 chimeric RNAs noted with *.
Table 1. Identification of the fusions. Summary of 31 candidate chimeric RNAs. There are two forms for chimeric RNA RAPH1-OLA1 and RAD51AP1-DYRK4. Sanger sequencing confirmed 17 chimeric RNAs noted with *.
Fusion GenesSanger SequencingNeighboring GenesInterchromosomal
ARL16-OXLD1NYN
RAD51AP1-DYRK4-1 *YYN
D2HGDHNYN
SUMO3-UBE2G2 *YYN
DMKN-KRTDAP *YYN
DPM2-PIP5KL1NYN
MORF4L2-AS1-TMEM31 *YYN
APEH-RNF123NYN
MFGE8-HAPLN3NYN
TBC1D23-NIT2 *YYN
ZHX1-C8orf76 *YYY
TPD52L2-DNAJC5 *YNN
TBC1D24-ATP6VOC *YYN
MSANTD3-TMEFF1 *YYY
SMOX-LINC012433 *YYN
CLTC-VMP1 *YNN
RAPH1-OLA1-1 *YNN
RAPH1-OLA1-2 *YNN
RAD51AP1-DYRK4-2 *YYY
TLCD-RWDD3NYN
FARSA-SUCE2 *YYN
VAMP1-CD27-AS1NYN
ZNF674-AS1-CHST7NYN
SUMF1-BHLHE40-AS1 *YNN
TPCN2-SMIM38NNN
UBXN2A-MFSD2BNNN
MLLT1-PFKP *YNY
ASB16-AS1-PHF13NNY
POLDIP2-TNRC18NNY
CCDC32-CBX3NNY
BMERB1-CHP1NNY
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Chen, C.; Haddox, S.; Tang, Y.; Qin, F.; Li, H. Landscape of Chimeric RNAs in Non-Cancerous Cells. Genes 2021, 12, 466. https://doi.org/10.3390/genes12040466

AMA Style

Chen C, Haddox S, Tang Y, Qin F, Li H. Landscape of Chimeric RNAs in Non-Cancerous Cells. Genes. 2021; 12(4):466. https://doi.org/10.3390/genes12040466

Chicago/Turabian Style

Chen, Chen, Samuel Haddox, Yue Tang, Fujun Qin, and Hui Li. 2021. "Landscape of Chimeric RNAs in Non-Cancerous Cells" Genes 12, no. 4: 466. https://doi.org/10.3390/genes12040466

APA Style

Chen, C., Haddox, S., Tang, Y., Qin, F., & Li, H. (2021). Landscape of Chimeric RNAs in Non-Cancerous Cells. Genes, 12(4), 466. https://doi.org/10.3390/genes12040466

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop