Next Article in Journal
Non-Coding RNAs in Rheumatoid Arthritis: Implications for Biomarker Discovery
Previous Article in Journal
Identification of Two Exosomal miRNAs in Circulating Blood of Cancer Patients by Using Integrative Transcriptome and Network Analysis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Expanded Landscape of Unusually Short RNAs in 11 Samples from Six Eukaryotic Organisms

1
CHU de Québec-Université Laval Research Center/CHUL Pavilion, 2705 Blvd. Laurier, Quebec City, QC G1V 4G2, Canada
2
Department of Microbiology, Infectious Diseases and Immunology, Faculty of Medicine, Université Laval, Quebec City, QC G1V 0A6, Canada
3
Department of Plant Sciences, Faculty of Agricultural and Food Sciences, Université Laval, Quebec City, QC G1V 0A6, Canada
*
Author to whom correspondence should be addressed.
Non-Coding RNA 2022, 8(3), 34; https://doi.org/10.3390/ncrna8030034
Submission received: 22 February 2022 / Revised: 9 May 2022 / Accepted: 16 May 2022 / Published: 19 May 2022
(This article belongs to the Section Small Non-Coding RNA)

Abstract

:
Small RNA sequencing (sRNA-Seq) approaches unveiled sequences derived from longer non-coding RNAs, such as transfer RNA (tRNA) and ribosomal RNA (rRNA) fragments, known as tRFs and rRFs, respectively. However, rRNAs and RNAs shorter than 16 nt are often depleted from library preparations/sequencing analyses, although they may be functional. Here, we sought to obtain a complete repertoire of small RNAs by sequencing the total RNA from 11 samples of 6 different eukaryotic organisms, from yeasts to human, in an extended 8- to 30-nt window of RNA length. The 8- to 15-nt window essentially contained fragments of longer non-coding RNAs, such as microRNAs, PIWI-associated RNAs (piRNAs), small nucleolar RNAs (snoRNAs), tRNAs and rRNAs. Notably, unusually short RNAs < 16 nt were more abundant than those >16 nt in bilaterian organisms. A new RT-qPCR method confirmed that two unusually short rRFs of 12 and 13 nt were more overly abundant (~3-log difference) than two microRNAs. We propose to not deplete rRNA and to reduce the lower threshold of RNA length to include unusually short RNAs in sRNA-Seq analyses and datasets, as their abundance and diversity support their potential role and importance as biomarkers of disease and/or mediators of cellular function.

1. Introduction

Studies in the 1990s related to small non-coding RNAs (sRNAs) [1,2] were scarce and often defined them as junk RNAs, non-functional RNAs, or degradation products [3]. This dogma was firmly anchored in scientific debates, leading to a span of half a century between their first observation and the discovery of functional 19 to 24 nucleotide (nt) microRNAs (miRNAs) [4,5]. The latter discoveries have prompted researchers to look beyond and to challenge currently accepted beliefs and dogma.
Nowadays, several sRNA families, such as PIWI-associated RNAs (piRNAs) [6], small nucleolar RNAs (snoRNAs) [7], or cytoplasmic RNAs (Y RNAs) [8] have been reported and their functions deciphered. Among them, the control of gene expression [9], the induction of RNA chemical modifications [10], or the maintenance of genome integrity [11] have been demonstrated.
Recently, a plethora of studies, made possible by the use and the development of high-throughput sequencing (HTS) [12], have identified small RNA fragments derived from longer RNAs [13,14]. Accumulating evidence suggests that these RNA fragments are themselves physiologically relevant in health and disease [15,16]. Those were named according to the RNA from which they derived: transfer RNA (tRNA) derived fragments (tRFs), snoRNA-derived RNAs (sdRNAs) [17], and ribosomal RNA fragments (rRFs) [18]. Although nothing is known regarding Y-RNA fragments (YsRNA) function [19], tRFs have been described as regulators of gene expression [20,21], whereas sdRNAs are involved in the regulation of alternative splicing events [22], messenger RNA (mRNA) stability [23], or translation [24].
Small RNA sequencing (sRNA-Seq) approaches are designed to determine sRNA sequences and their relative abundance, which depend on the method used for library generation [25,26]. For the past decade, sRNA library preparation has involved a routine, standardized size selection of sRNAs by gel excision according to their length, and it essentially centered on that of microRNAs, i.e., ranging from 16 to 30 nt [27]. sRNAs shorter than 16 nt are thus systematically excluded from sRNA-Seq studies, the reason being that these sequences are too short to be functional or mapped reliably to the genome, and they unnecessarily increase the background noise, blurring downstream analyses of more important sequences.
Although some sequences shorter than 16 nt are degradation products, several might still be functional. Our laboratory previously reported the serendipitous discovery of a 12 nt semi-microRNA (smiRNAs) able to prevent the repression induced by the microRNA from which it originated [28], suggesting that unusually short RNAs may conceal interesting functionalities and that their study may shed light on new regulatory sequences. In addition, by-products of stable degradation may be useful as diagnostic or prognostic markers in pathologies, as with YsRNAs in cancer for instance [8,19].
The limitations we are facing in small RNA research are those we impose on ourselves and the technologies we are using rather than the technologies themselves. To challenge the current dogma of restraining our view and interest to RNA species longer than 16 nt, we lowered the 16-nt threshold to 8-nt, which is still arbitrary, while keeping the upper 30-nt limit.
The focus of the present study was to gather evidence about the differential enrichment of the small RNAs in the 8- to 30-nt range and to further highlight the existence of 8- to 15-nt RNA species, by sRNA-Seq analysis of 11 different biological samples from 6 different species, from yeast to human, thereby providing an unprecedented view of unusually short RNAs in eukaryotes.
We have pursued the analysis of some of the results we have obtained and recently reported the identification of a new family of unusually short RNAs mapping to ribosomal RNA 5.8S, which we named dodecaRNAs (doRNAs), according to the number of core nucleotides (12 nt) their members contain [29].
We hope that sharing our approach, findings, and complete datasets of eukaryotic RNA species between 8 and 30 nt in length, especially those in the 8- to 15-nt window, will stimulate research in that new area of unusually short RNAs.

2. Results and Discussion

2.1. Identification of Highly Abundant Unusually Short RNAs

Analysis of the reads distributed by sequence length unveiled new RNA abundance peaks below the standard 16-nt threshold. We observed a major peak at 12 and/or 13 nt, in all analyzed samples, except in A. thaliana (Figure 1A–E). In human samples, the average of 70,000 RPM was observed for the 12-nt RNAs and 350,000 RPM for the 13-nt RNAs, so the 13-nt RNAs were ~5 times more abundant than those of 12 nt. The same abundance difference was seen in yeast samples (Figure 1A,E). Equivalent abundance of 12- and 13-nt sequences was observed in mouse (Figure 1B) and plant (Figure 1D) samples. The opposite trend was seen in D. melanogaster, in which 12-nt RNAs (437,760 RPM) were more abundant than 13-nt RNAs (155,602 RPM; Figure 1C).
Moreover, we noted small peaks of 16-nt plant sequences (25,412 RPM; Figure 1D), of 18-nt mouse sequences (Figure 1B), and major peaks of 17- and 18-nt in human samples (Figure 1A). These latter peaks remained 2.4-fold smaller than 13-nt RNAs. In addition, human 12-nt RNAs were (from 20% to 80%, depending on the sample) less abundant than 17- or 18-nt RNAs, except in HUVEC, where they were 1.5 times more abundant than the 17-nt RNAs (Figure 1A). Although 13-nt RNAs in HUVEC and HEK293 cells were 6.7 and 5.1 times more abundant than 17-nt RNAs, those 17-nt RNAs were more abundant than 13-nt RNAs in human PMN (Figure 1A).
The 17- to 19-nt peak (Figure 1) observed in human samples is also present in mouse samples, albeit at a much lower level. These 17- to 19-nt long RNAs mainly comprise sdRNAs, tRFs, and rRFs (Figure 2 and Figure 3). In human samples, the 17-nt RNAs seem to be constituted of a unique sequence originating from SNORD30 (Table 1). It was demonstrated in 2009 that snoRNA-derived RNAs (sdRNAs) from C/D snoRNAs exhibit a bimodal length distribution at ~17–19 nt and >27 nt and predominantly originate from the 5′ end [30]. A possible function of sdRNAs would be to act as a novel source of miRNAs, as a sdRNA originating from the snoRNA ACA45 was found to co-immunoprecipitate with AGO1 and AGO2 in human embryonic kidney 293 cells (HEK293) [23]. SnoRNAs are also a reported source of piRNAs [31], but their mechanism of action and other of their functions remain unclear. Our data are consistent with 17- to 19-nt RNA species being produced by a processing mechanism preserved in humans and in mice, but not in other organisms, which warrants further investigation.
Assessment of RNA sequence diversity through the analysis of unique sequences revealed that the relatively abundant 12- and 13-nt sequences were not correspondingly diverse, as shown by their relatively lower content in unique sequences (Supplementary Figure S1). Indeed, the number of unique 12- and 13-nt RNA sequences, yet overly abundant, in human, mouse, fly, and budding yeast (Supplementary Figure S1A–E), was lower than that of 18- and 22-nt RNAs, except in fission yeast.
The presence of two flattened peaks centered around 13- and 22-nt unique RNAs in fission yeast S. pombe (Supplementary Figure S1E), which are sharper in mouse (Supplementary Figure S1B) and human (Supplementary Figure S1A) samples, is worth noting from an evolutionary perspective.

2.2. RNAs Shorter Than 16- nt Are More Abundant in Bilaterian Organisms

We performed our analyses by comparing sRNAs obtained in the standard window of RNA length (i.e., 16- to 30-nt) to unusually short sRNAs found in the 8- to 15-nt window, both in terms of sequence and relative abundance. Even if the first window (i.e., 16- to 30-nt) was twice as large as the smaller (i.e., 8- to 15-nt) and contained the well-known family of microRNAs, we found a higher abundance of RNAs in the 8- to 15-nt window in mouse and Drosophila samples (Figure 1G,H). This was not the case for every human sample, except for HUVEC; in HEK293 cells, the reads abundance was almost equally distributed between the two windows, whereas in PMN the ratio of reads within the 8- to 15-nt window was slightly lower (Figure 1F). Finally, a higher proportion of reads within the standard window was found in Arabidopsis thaliana and Saccharomyces cerevisiae (Figure 1I,J), while an equal distribution between the two windows was observed in Schizosaccharomyces pombe (Figure 1J).
Although the number of samples we analyzed is small, it seems that in bilaterian organisms unusually short RNAs are particularly conserved and more abundant than sRNAs found in the window of 16- to 30-nt standard-length, while the opposite trend was observed in yeasts and plant. However, we did not observe this pattern in terms of diversity, as represented by the number of unique reads (Figure 1F–H vs. Supplementary Figure S1F–H).
We then performed analyses of conserved RNA sequences between the bilaterian species (human, mouse, and fly samples), as they might give some indications of their functionality. We obtained 181, 46, and 28 sequences shorter than 15-nt from conserved rRNAs, tRNAs, and microRNAs, respectively (Supplementary Table S1). The conserved cleavage and the generation of these sequences makes them particularly interesting, supporting the functionality of RNAs shorter than 15-nt.

2.3. Small Non Coding RNA Distribution upon Biotypes

Sequences obtained by sRNA-Seq were first annotated by mapping against downloaded databases of defined sRNA classes, including, among others, rRFs, tRFs, snoRNAs, microRNAs, and the piRNAs, providing an unprecedented, systematic analysis of RNA species from six eukaryotic organisms and between 8- and 30-nt in length.

2.3.1. MicroRNAs Are Less Abundant Than Unusually Short RNAs

As observed in the nearly 2000 small RNA libraries listed by Li et al. [32], analysis of small RNA length distribution in the standard 16- to 30-nt window consistently showed the greatest abundance of RNAs in the 19- to 24-nt range, usually associated to microRNAs in human, mouse, or fly [33,34,35]. However, when reducing the lower threshold of the window to 8 nt, the 19- to 24-nt RNAs became a minority of all sequences and were largely dominated by shorter sequences (Figure 1A–D), suggesting that 19- to 24-nt sequences are not the most abundant sRNAs.
However, a high level of RNA diversity was observed in the 19- to 24-nt range, as reflected by the large number of unique sequences found among all species in which microRNAs are found (Supplementary Figure S1A–D). Thus, broadening of the standard 16- to 30-nt window to the 8- to 30-nt range appears to have reduced the relative abundance of 19- to 24-nt RNAs, while retaining their relative diversity.
Our data suggest that, although the preparation of an 8- to 30-nt library may not be the best to study only 19- to 24-nt RNAs, such as microRNAs, it allows a comparison of their relative abundance versus the other sRNAs detected, including their derived fragments, such as semi-microRNAs for microRNAs. Moreover, as these sequences are more abundant than microRNAs, it may facilitate their detection and use as biomarkers. However, their length poses a challenge to their rapid detection and their accurate quantitation.

2.3.2. MicroRNA Fragments May Be Detected in the 8- to 15-nt Window

The microRNA and their fragment reads were almost not detectable in the 8- to 15-nt window (Figure 2A–C), while they constituted a significant part of 16- to 30-nt RNAs (Figure 2F–H).
Length distribution plotting of the reads mapping to microRNA sequences unveiled a large abundance of 19- to 24-nt RNAs (Figure 3A) that was associated with a high diversity (Supplementary Figure S3A), corresponding to the known and well-defined class of microRNAs. Nevertheless, a high diversity of 8- to 11-nt sequences (Supplementary Figure S3A, in blue) of relatively high abundance (Figure 3A) was also observed, which may be interesting, since they may be functional such as semi-microRNA [28]. Interestingly, our current analyses revealed the 10 most abundant of 8- to 15-nt RNA fragments derived microRNAs in all samples (Supplementary Table S2). The origin of each microRNA fragment seems to be conserved between different human samples and between mouse samples. Most of the microRNA fragment sequences are conserved between human and mouse samples, whereas a minority of sequences are conserved between human, mouse, and fly samples (Supplementary Table S1).
In total, we discovered between 886 and 3705 unique microRNA-derived sequences in the 8- to 15-nt window, depending on the sample (Supplementary Figure S2A–D). Listing the most abundant microRNA-derived fragments (Supplementary Table S2), we have discovered a fragment derived from miR-625 that can suppress the inflammatory response [36] in human PMN and HEK293 cells, but also in HUVEC with a lower RPM number. We also reported semi-microRNAs from miR-207, miR-1b, or miR-382 in mice, from miR-12136 involved in drug resistance [37], and from miR-221 or miR-1283 in humans (Supplementary Table S2). The major pathogenic role of microRNAs in cancer and inflammatory diseases has been widely described, as well as their use, on a clinical level, either for diagnosis or prognosis in pathologies [38]. In this context, miRNA fragments may have relevant gene regulatory functions, in addition to their potential utility as biomarkers, which is why the study and the monitoring of their expression need to be intensified to challenge the dogma about their uselessness. Indeed, even though the inhibitory principle of smiRNAs has been demonstrated only for a smiRNA-223, we speculate that smiRNA modulates the function of the microRNA from which they derive. Thus, these sequences may offer alternative therapeutic options for the treatment of pathologies [39].

2.3.3. piRNA Fragments Are as Abundant and Diverse as piRNAs in Bilaterian Organisms

piRNAs are a class of small non-coding RNAs of 24- to 30-nt initially identified as repressors of germ cell transposition [40,41], and subsequently detected in somatic cells of many animals [42]. Recently, 20-nt long piRNA have been detected [43]. Interestingly, in this study, RNA shorter than 20-nt long piRNA were also detected and were ever more abundant than those of 20- and 24-nt in all samples. Since the functionality of piRNAs as biomarkers for early diagnosis in pathologies has been increasingly pointed out in recent years [41], as well as the function of small non-coding RNA from longer RNAs (e.g., tRFs, rRFs) [18,44], we aimed to identify piRNA-derived sequences in the 8- to 30-nt window.
Initially, the piRNAs and the piRFs class get the most abundant part in Mus musculus (Figure 2B,G), while the second species which has shown the most abundant piRNAs and piRFs is D. melanogaster (Figure 2C,H). Finally, in human samples the share of piRNAs is the lowest (Figure 2A,F). It seems that the share of piRNAs in the RNAs found in the 8 to 30-nt window seems to decrease from Drosophila to Human. In contrast to their abundance, we noticed that the fly piRNAs/piRFs were very diverse, since they have the largest share of unique sequences in our analyses (Supplementary Figure S2). They account for more than 27.5% and 9.4% of unique sequences of unusually short RNAs (8–15 nt) and standard RNAs (16–30 nt), respectively, in human, mouse, and fly samples (Supplementary Figure S2A–C,F–H).
Moreover, the length distribution and the abundance clustering of the piRNA and piRF reads (Figure 3B) highlighted a blue cluster corresponding to the highest abundance RNAs. One part of this cluster has grouped RNAs from 22- to 24-nt, which is the usual length of piRNAs, while another part contains sequences of 12- to 19-nt in length (Figure 3B). This last RNAs range, which was likely missed in most other studies, suggests that piRFs may be as abundant as authentic piRNAs.
Analysis of piRNA/piRF diversity showed a large number of unique sequences in the 11- to 25-nt window, with a particular enrichment in 13-nt sequences across samples (Supplementary Figure S3B).
Then, we generated a list of the most abundant piRNAs and piRFs found in our samples (Table 2). We observed that piRF ranging from 23- to 28-nt in length were the most abundant (particularly those originating from piR-hsa-145507 in human samples and piR-mmu-25873647 in mice), as well as 13- to 22-nt piRFs. Interestingly, these piRFs always derived from the 5′ end of the original piRNA (whenever this information was available). For instance, we identified several piRFs of 15- or 18-nt, all originating from the 5′ end of piR-mmu-10912946, reminding the generation type of the tRF-5 [45].
Our findings support the existence of RNA fragments derived from piRNAs, similar to the many classes of RNAs derived from longer reference precursors. While current research is documenting the presence of piRNAs in somatic cells and trying to understand their role [41,43,46], we propose to take into account the possible existence of piRFs, which may bear some functionality and warrant further investigation. Furthermore, it would be highly informative to investigate whether these piRFs are derived from conserved piRNA clusters.

2.3.4. Discovery of Highly Abundant, Diverse, and Unusually Short sdRNAs

sdRNAs are preferentially produced upon cleavage of the 5′ or 3′ end of snoRNAs [47], and their length varies between 20- and 30-nt [13,48]. Here, we quantitated reads mapping to annotated snoRNAs and assembled the sdRNA transcriptome of our samples in the 8- to 30-nt window (Figure 3C). As for piRNAs, sdRNA length distribution analysis identified three clusters of varying abundance, including sequences of 17- and 29-nt, the characteristic length of sdRNAs from C/D snoRNAs [30], which were particularly abundant (Figure 3C). In addition, we identified highly abundant 9- to 12-nt sdRNA sequences (Figure 3C) that also displayed the greatest diversity, with the largest number of unique sequences (Supplementary Figure S3C, cluster in blue).
Surprisingly, a single sdRNA derived from the 5′ end of snoR58C (or snoR58 in mouse) dominates the 29-nt peak, constituting between 45% and 94% of the reads (Table 1). Similarly, a single sdRNA sequence accounted for 99.9% of the reads within the 17-nt peak in all human samples, and another for 96% of the 17-nt reads in mouse samples (Table 1). These two sdRNAs derive from the 5′ end of human SNORD30 or mouse SNORD83B, respectively. Although a specific cleavage of the 5′ end of SNORD30 giving rise to a ~22-nt sdRNAs has been reported previously [13], the 17-nt SNORD30-derived sdRNA is more abundant than the 22-nt derivative. Thus, as SNORD30 expression has been correlated to tumor progression in several studies [49,50], the 17-nt derivative may constitute a better marker of immune cancers than the 22-nt form.
The three most abundant 9- to 12-nt sdRNAs detected in our samples are listed in Table 3. Intriguingly, these sdRNAs share a common ATGA motif at their 3′ end (Table 3).
Although these analyses do not confirm these sdRNAs to be functional or more than stable by-products, the preponderance of discrete unusually short sdRNAs in all of our samples, from yeasts to human, as well as their origin (e.g., the 5′ end of snoRNAs) in mouse, human, fly, and plant, or specific motif (e.g., ATGA), support some specificity in the biogenesis process and/or a conservation mechanism, both of which deserve further investigations.

2.3.5. Detection of tRFs Shorter Than 16-nt

tRFs are a rapidly growing class of non-coding RNAs, as several studies have contributed to improve our knowledge of their expression [51], biogenesis [52], and function [53]. Notably, their discovery has paved the way to dismantle the dogma on the functionality of RNAs resulting from the degradation of longer non-coding RNAs, in addition to the rise of studies on rRFs [18]. Again, tRFs are being used to overcome dogmatic limits, here length, as a recent study in Arabidopsis reported a strong accumulation of 13- to 16-nt tRFs [51].
Our study expands on the latter by reporting the presence of tRFs in all of our eukaryotic samples across the 8- to 30-nt window of RNA length (Figure 2). tRFs are particularly abundant in the 15- to 18-nt range (Figure 3D), with a concomitantly high diversity of unique sequences in the same range (Supplementary Figure S3D).
To get further insight into their family affiliation, biogenesis or mode of degradation, we grouped tRFs according to their sequence and aligned them on the shortest, common seed sequence (Supplementary Tables S3–S13, tRFs). Within these files, we can observe what seems to be the normal process of tRNA degradation or very with low abundant sequences of decreasing size, in “staircase”. However, within these groups of tRNAs that appear to be digested, we observe very abundant tRFs. The latter could represent either by-products of degradation of stable tRNAs or tRFs. We also determine the most abundant tRFs detected in our sRNA-Seq analyses (Table 4). As for the longer tRFs already listed, we observed that tRFs shorter than 16 nt were mainly from the 5′ or 3′ end of tRNAs or from their loop [45,51]. Together, these clues suggest that tRFs shorter than 16 nt may not be mere degradation products, but functional tRFs that deserve to be studied further.

2.3.6. rRFs Are Overly Abundant in Bilaterian Organisms

rRFs formed the most abundant class of RNA found in our sRNA-Seq datasets, representing between 16% and 95% of the reads in the 8- to 15-nt window, and between 9% and 70% of the reads in the 16- to 30-nt window (Figure 2). Expansion of the sequencing window to include 8- to 15-nt RNA species considerably increased the share of rRFs, as compared to the standard 16- to 30-nt window, at the expense of all other biotypes in bilateral organisms (Figure 2A–C,F–H). The excessive abundance of rRFs, which has been observed by other groups [54,55] and discussed previously by us [3], suggests that they originate from rRNA, which constitutes 80% of the RNAs in cells [56].
As we did with tRFs, we gathered all rRFs having a common motif or sequence together (Supplementary Tables S3–S13, rRFs). Thus, a staircase-like profile with a correspondingly low read count for each rRF sequence emerged, supporting their active degradation, most often from 5′ to 3′, possibly involving exonuclease(s). sRNA-Seq analysis in an extended 8- to 30-nt window may thus provide, besides a more global, unbiased view of small RNAs, a means to better characterize a small RNA degradome and to study the mechanisms involved [12,57].
Some rRF sequences exhibited a peculiar profile of abnormally high read counts (Supplementary Tables S3–S13, rRFs, highlighted in blue), which did not correspond to the shortest reads, arguing against rRFs accumulating as degradation end-products of a stepwise nucleotide excision process. Listing of the most abundant rRFs revealed that the human and mouse sequences (except for one) shared the same minimal 12-nt sequence, with identical 5′ extension of mainly 1 (a C) or 6 nucleotides (TCGTAC) (except for one).
Heatmap analysis of rRF abundance according to their length confirmed the high abundance of 12-, 13- and 18-nt sequences (Figure 3E, clusters in blue). Each of these peaks was almost exclusively composed of a single sequence identical to those listed in Table 5: the sequence GACTCTTAGCGG represented 98% of the 12-nt reads, CGACTCTTAGCGG 95% of the 13-nt reads, and TCGTACGACTCTTAGCGG ~90% of the 18-nt reads in mouse and human samples (5′ extensions are underlined).

2.4. RT-qPCR Validation of Two Unusually Short rRFs of 12 and 13 nt

Analysis of our sRNA-Seq data revealed that in HEK293 cells the specific 12- and 13-nt rRFs represented 57,921 RPM and 338,156 RPM, respectively (~5 times more 13-nt sequences than the 12-nt), while miR-25 and miR-30a represented only 257 and 96 RPM, a ~3-log difference (Figure 4A). In mouse neuronal N2a cells, the two rRFs exhibited similar levels of expression, with 391,832 RPM for the 12-nt rRF and 349,502 RPM for the 13-nt rRF, while miR-25 and miR-30a accounted for 1220 and 731 RPM, respectively. The other human and mouse samples showed a similar relative distribution (Figure 4A). The predominance of the specific 12- and 13-nt rRFs among our sRNA reads prompted us to validate their existence by an alternate method (e.g., RT-qPCR), which would also allow a comparison of their abundance relative to better-known microRNAs.
We have thus developed a specific and sensitive RT-qPCR method based on splinted 5′ ligation to detect two particularly abundant rRFs of 12 and 13 nt that differ by only a single nucleotide at the 5′ extremity [58]. We used our new method to validate the existence of the two 12- and 13-nt rRFs, and quantitate them relative to two microRNAs: miR-25, which is highly expressed, and miR-30a, which is less expressed in the same human and mouse samples.
RT-qPCR analyses of the same human and mouse samples that were previously analyzed by sRNA-Seq confirmed the existence of the 12- and the 13-nt rRFs, and their relative abundance to each other; the 13-nt rRF being 5 times more abundant than the 12-nt rRF in human samples (Figure 4B). More importantly, our RT-qPCR data confirmed our sRNA-Seq showing the far greater abundance of the 12- and the 13-nt rRFs relative to microRNAs miR-25 and miR-30a (Figure 4B). A comparison between unusually short, 12- to 13-nt rRFs and 19- to 24-nt microRNAs, either in number of reads, height of the peaks or copy numbers, suggests that rRFs are overly more abundant than microRNAs. These findings were confirmed by RT-qPCR analysis of samples from different cultures of human HEK293 and mouse N2a cells in which, again, the 13-nt rRF was 5 times more abundant than the 12-nt rRF in human samples, and 12- and 13-nt rRF copy numbers proved to be ~3 logs higher than the individual microRNAs miR-25 and miR-30a (Figure 4C).
Together, the results obtained through the use of our new RT-qPCR detection method, tailored to monitor unusually short RNA species, allowed us to confirm our sequencing results, document the differential enrichment of 12- and 13-nt rRFs between human and mouse samples, and highlight their ~3-log higher levels of expression compared to individual microRNAs.

2.5. Conclusions

Although the cellular role and function of the newly discovered, unusually short RNAs in this study have yet to be elucidated, our results support their presence as genuine, as confirmed and quantitated by RT-qPCR and as suggested by the accumulation of rRFs of discrete length in all the samples analyzed. Regardless as to whether these sRNAs are stable degradation by-products or fulfill a specific function within cells, their systematic removal from sRNA-Seq analyses definitely hampers their study and their characterization, and it contributes to perpetuating the concept that unusually short RNAs are devoid of interest and utility.
However, it is necessary to relativize our results with respect to the potential bias introduced during the construction of the sequencing library. Indeed, although some unusually short sequences seem to represent a majority share in our results (e.g., sdRNA, tRF, or rRF sequences represent 90% of the abundance of the peak length; Table 1 and Table 3), the presence of post-transcriptional modifications of RNA may hinder the detection of some RNA species by HTS, and thus bias the sequencing results. For example, tRNAs/tRFs are highly modified post-transcriptionally. These modifications impair their reverse transcription and thus the ability of tRNA-derived sequences to be sequenced [59]. Thus, these sequencing biases may result from the inability to detect specific sequences. In particular, this could explain our results in Figure 2, in which a low share of tRFs is observed in most organisms in the 16- to 30-nt window, while it is widely known that these are highly abundant small RNA species in these organisms. Similarly, it has been widely reported that human tRFs derived from tRNAGly are the most abundant tRFs in most HTS analyses (Table 4), but Northern blotting performed in a previous study revealed that they are not particularly more abundant than other tRFs [60]. This is, in part, because tRNAGly is less modified than other tRNAs.
As our studies are based on RNA sequencing without prior demethylation, it is likely that we mainly documented small RNAs from longer RNAs that do not show post-transcriptional modifications. Thus, if the shorter RNAs are not post-transcriptionally modified, they will be more likely to be sequenced, leading to the misinterpretation that they are more abundant than the 16- to 30-nt sequences. It would be interesting to perform a comparative analysis with sequencing from a constructed library taking into account this possibly major bias. Nevertheless, this bias does not call into question the discovery of unusually short sequences shorter than 15-nt. On the contrary, we may even discover a greater diversity of them by sequencing RNAs in the 8- to 30-nt window from previously demethylated RNAs.
Finally, since the discovery that several sRNAs derive from longer RNAs, their usefulness in the diagnosis and prognosis of various pathologies has been established [8,13,44,54], albeit without precisely knowing their function, making their potential use in targeted medical research promising. We, therefore, call upon the scientific community to reconsider the dogma at play—as we all did when our research led to the discovery of microRNAs—and to reduce the lower limit of RNA length to include these unusually short RNAs in sRNA-Seq analyses and datasets. Their abundance and their diversity support their potential role and importance as biomarkers of disease and/or mediators of cellular function [29]. As a matter of fact, the study by Li et al. [61], who described the presence of unusually short 17-nt RNAs that can modulate human gene expression, has laid the foundation for the sRNA functionality length threshold to be enlarged. Today, we again set an arbitrary limit at 8 nt, even knowing that future studies may end up revealing that RNAs shorter than 8 nt also have biological relevance.

3. Materials and Methods

3.1. Ethical Statement

3.1.1. Human Blood Samples

Collection of venous blood from healthy volunteers (adult Caucasians of both sexes from the immediate region of Quebec City) was approved by our institutional human ethics committee (B14-08-2103). The participants provided their written informed consent to participate in this study, in accordance with the Declaration of Helsinki.

3.1.2. Mouse Tissue Samples

This study was carried out in accordance with the guidelines, regulations, and requirements of the Canadian Council of Animal Care for Animals Used for Scientific Purposes. Experiments were performed in accordance with the latest guidelines and using a protocol approved by the Université Laval Animal Welfare Committee.

3.2. Biological Samples

3.2.1. Primary and Cultured Human Cells

Human blood polymorphonuclear leukocytes (PMN) were isolated from venous blood collected from four healthy donors and pooled, as described in Laffont et al. [62]. Human umbilical vein endothelial cells (HUVEC; Stem Cell Technologies, Vancouver, BC, Canada) were cultured in endothelial growth medium (Lonza, Basel, Switzerland) supplemented with bovine brain extract (Lonza, Basel, Switzerland) and maintained at 37 °C under 5% CO2 and used between passages 2 to 6. Cultured human embryonic kidney 293 (HEK293; ATCC, Manassas, VA, USA) were maintained in Dulbecco’s modified Eagle’s medium (DMEM) supplemented with 10% (v/v) fetal bovine serum (FBS), 1 mM sodium pyruvate, 100 units/mL penicillin, 100 μg/mL streptomycin, and 2 mM L-glutamine in a humidified incubator under 5% CO2 at 37 °C.

3.2.2. Primary and Cultured Mouse Cells and Tissues

Mouse blood PMN were isolated from four healthy 12- to 15-week-old mice, as described in Duchez et al. [63], and pooled. The brain cortex (cerebellum) was collected, after PBS washing, from exsanguinated 24-month-old C57BL/6 mice and flash frozen in liquid nitrogen before storage at −80 °C. Neuronal N2a and NIH/3T3 fibroblast cell lines used in this study were obtained from ATCC (Manassas, VA, USA) and maintained in culture according to ATCC’s recommendation.

3.2.3. Drosophila melanogaster

Adult flies were purchased from the University of California in San Diego (UCSD) Drosophila Species Stock Center (San Diego, CA, USA) and snap-frozen in liquid nitrogen before storage at −80 °C.

3.2.4. Arabidopsis thaliana

Arabidopsis thaliana, accession Colombia (Col-0) was obtained from the ABRC (Arabidopsis Biological Resource Centre, Ohio State University, Columbus, OH, USA). Seeds were sown in autoclaved Pro-Mix BX potting mix (Premier Horticulture, Rivière-du-Loup, QC, Canada), watered with sterile distilled water (300 mL/pot/week), and placed in a growth chamber (16 h light at 22 °C, 8 h dark at 20 °C, 60–70% humidity). After 7 days, seedlings of uniform size were transferred to pots containing Pro-Mix BX potting mix and grown for 14 days prior to being harvested for RNA extraction.

3.3. Total RNA Isolation

Total RNA samples were prepared using TRIzol® reagent or TRIzol LS® reagent for liquid samples (Invitrogen Life Technologies, Carlsbad, CA, USA), following the manufacturer’s recommendations. Contaminating DNA was degraded using DNase I (M0303S, New England Biolabs, Ipswich, MA, USA) treatment.

3.4. Small RNA Library and Sequencing

For primary blood PMN samples involving cell isolation, we used a pooling strategy involving equal mixing of total RNA samples derived from independent biological samples. Pooling of small RNA samples is effective in reducing data variability, and it reduces the number of replicates, hence lowering the cost for subsequent steps [64]. For each of the other samples, a unique total RNA specimen was analyzed.
The quality and the concentration of the total RNA samples were verified using a NanoDrop ND-1000 (Thermo Fisher Scientific, Waltham, MA, USA) and gel separation (Supplementary Table S15). The sRNA-Seq libraries, containing RNA species between 8 and 30 nt in length were prepared as previously described [65]. Samples were diluted to a final concentration of 8 pM, denatured as single-stranded DNA, and cluster generation was performed on the Illumina cBot using a TruSeq Rapid SR cluster kit (GD-402-4001, Illumina, San Diego, CA, USA). Afterward, the clusters were sequenced for 51 cycles on Illumina HiSeq 2000 using TruSeq Rapid SBS Kits (FC-402-4002, Illumina, San Diego, CA, USA), as per the manufacturer’s instructions.

3.5. Analysis Workflow

Clean reads matching the quality standards were processed to remove the adaptor sequence, leading to sRNA trimmed reads. All reports displayed here were generated through the standard analysis pipeline of Arraystar Inc., (Rockville, MD, USA) (https://www.arraystar.com/, accessed on 17 May 2022) and refined using R (Free Software Foundation). Only the reads that were identical, both in length and in sequence, were considered as a unique read. TPM normalizations were not based on cell types—the increased number of chromosome/gene copies may contribute to increase the abundance of sRNAs in the cancerous HEK293 and N2a cell lines compared to primary cells.
After adapter trimming, low-quality filtering, and contamination checking, the clean sequencing data were mapped to the downloaded sRNA databases of the corresponding organisms, as listed in Supplementary Table S16. The mapped tags were then used to identify and to profile sncRNAs, including miRNAs (and semi-microRNAs), rRFs, tRFs, sdRNAs, piRNAs (and piRFs), and other sncRNAs based on the constructed libraries (Supplementary Table S16).
Post-transcriptional modifications of RNA can lead to errors in reverse transcription, whereas the presence of single nucleotide polymorphism (SNP) may hamper proper identification. Since the sequenced sRNAs may contain mismatches, and to allow detection of modified RNA species, such as those derived from tRNAs, rRNAs or piRNAs, we proceeded with our analyses by accepting one mismatch. Many piRNA sequences are actually derived from tRNA, or rRNA genes [66]. To avoid misclassification of small RNAs derived from piRNA/tRFs/rRFs, we prioritized the search and the classification of sequences originating from tRNAs or rRNAs in the class of tRFs or rRNAs, before that of piRNA sequences. For tRF/rRF identification, the reads aligned to precursor tRNA/rRNA genes and mature tRNA/rRNA sequences with the same strand as the source tRNA/rRNA were used for tRF/rRF analysis. In addition, sequences that did not perfectly match to any database were listed as undefined. No mismatch or gap was tolerated in our classification. miRNA precursor sequences and other known small non-coding RNA sequences were retrieved from the latest miRBase database [67] (release v22.0).

3.6. Adapter-Ligated RT-qPCR Method

This specific quantification method was set up, experimentally validated, and described in Lambert et al. [58]. The splint and the adaptor were annealed together prior to being added to 300 ng of total RNA in a ligation reaction intended to extend two unusually short rRF sequences (named doRNA and C-doRNA) at their 5′ extremity. Two µL of total ligated RNA were used for reverse transcription using the miRCURY Locked Nucleic-Acid (LNA)-modified microRNA PCR Assay (Qiagen Inc., Toronto, ON, Canada). After cDNA dilution (1/10), qPCR was performed using miRCURY LNA SYBR® Green PCR Kits (Qiagen Inc., Toronto, ON, Canada) in 96-well plates using the CFX96 Touch™ Real-Time PCR Detection System (Bio-Rad, Mississauga, ON, Canada) and specific Custom LNA Oligonucleotides (Qiagen Inc., Toronto, ON, Canada). doRNA, C-doRNA, microRNA-25 (miR-25), and microRNA-30a (miR-30a) copy numbers were determined with a standard curve built using synthetic RNAs (Integrated DNA Technologies, Inc., Coralville, IA, USA). In compliance with the MIQE guidelines, the small nucleolar RNA U6 was used as a reference gene for normalization [68].

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ncrna8030034/s1, Figure S1: Unique sequence distribution of small RNAs in human, mouse, fly, plant and yeast cells in the 8- to 30-nucleotide window; Figure S2: Biotype distribution of unique small RNA sequences in human, mouse, fly, plant and yeast cells in the 8- to 15-nt and 16- to 30-nucleotide windows; Figure S3: Length distribution of the small RNA biotype unique sequence in human, mouse, fly, plant and yeast cells in the 8 to 30-nucleotide window. Table S1: RNA sequences conserved in bilaterians species; Table S2: Most abundant miRNA derived fragment identified in the unusually, 8 to 15-nt window by sRNA-Seq analysis; Table S3: rRFs and tRFs precursors classification of RNA sequencing data from HUVEC; Table S4: rRFs and tRFs precursors classification of RNA sequencing data from PMN; Table S5: rRFs and tRFs precursors classification of RNA sequencing data from HEK293; Table S6: rRFs and tRFs precursors classification of RNA sequencing data from Cerebellum; Table S7: rRFs and tRFs precursors classification of RNA sequencing data from PMN; Table S8: rRFs and tRFs precursors classification of RNA sequencing data from NIH/3T3; Table S9: rRFs and tRFs precursors classification of RNA sequencing data from N2a; Table S10: rRFs and tRFs precursors classification of RNA sequencing data from D. melanogaster; Table S11: rRFs and tRFs precursors classification of RNA sequencing data from A. thaliana; Table S12: rRFs and tRFs precursors classification of RNA sequencing data from S. cerevisiae; Table S13: rRFs and tRFs precursors classification of RNA sequencing data from S. pombe; Table S14: Statistical analyses of the quantitation of 12-nt and 13-nt rRF sequences and the miR-30a and miR-25 levels; Table S15: RNA Quality control and Quality Assessment of Sequencing; Table S16: Databases used for the sRNA-Seq workflow analyses.

Author Contributions

M.L. and P.P. conceived and designed research project. M.L., S.G., J.H., A.B. and B.L. performed the experiments. R.B. prepared the plant material and isolated and purified RNA from Arabidopsis thaliana. M.L. interpreted the results, prepared the figures, and drafted the manuscript. M.L., R.B. and P.P. revised the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

M.L. received PhD studentship award no. 289637 from the Fonds de Recherche du Québec—Nature et Technologies (FRQ-NT). This work and the article processing charge (APC) were funded by the Natural Sciences and Engineering Research Council of Canada (NSERC) Discovery Grant number RGPIN-2019-06502.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and approved by by our CHU de Quebec-Université Laval Research human ethics committee (Approval Code: 2015-2103, B14-08-2103; Approval Date: 17 October 2017). The animal study protocol was approved by the Université Laval Animal Welfare Committee (Approval Code: 2014056-4; Approval Date: 11 May 2017).

Data Availability Statement

All raw small RNA-seq data generated in this study have been submitted to the NCBI Gene Expression Omnibus under accession number GSE179677. Instructions for Editors and Reviewers: Go to https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE179677 (accessed on 17 May 2022). Enter token mtyhesicpnunlyv into the box.

Acknowledgments

We thank Karl Ekwall (Karolinska Institutet, Huddinge, Sweden) for generously providing total RNA from Saccharomyces cerevisiae and Schizosaccharomyces pombe.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

DMEMDulbecco’s modified Eagle’s medium
FBSfetal bovine serum
HEK293human embryonic kidney 293
HTShigh-throughput sequencing
HUVEChuman umbilical vein endothelial cells
mRNAmessenger RNA
ntnucleotide
piRNAPIWI-associated RNA
PMNpolymorphonuclear leukocyte
rRFribosomal RNA fragment
rRNAribosomal RNA
RT-qPCRreverse transcription—quantitative polymerase chain reaction
sdRNAsmall nucleolar RNA-derived RNA
sncRNAsmall non-coding RNA
snoRNAsmall nucleolar RNA
sRNAsmall RNA
sRNA-Seqsmall RNA sequencing
tiRNAstRNA halves derived from tRNA
tRFtransfer RNA (tRNA)-derived fragment
tRNAtransfer RNA
Y RNAcytoplasmic RNA
YsRNAY RNA-derived small RNA
siRNAsmall interfering RNA
rasiRNArepeat-associated short interfering RNA
snRNAsmall nuclear RNA
NIH/3T3mouse embryonic fibroblast cells
N2amouse neuroblastoma cells
miRNAmicroRNA
smiRNAsemi-microRNA
piRFspiRNA-derived fragments
siRNAsmall interfering RNA
RPMreads per million.

References

  1. Reddy, R.; Henning, D.; Busch, H. Substitutions, insertions, and deletions in two highly conserved U3 RNA species. J. Biol. Chem. 1980, 255, 7029–7033. [Google Scholar] [CrossRef]
  2. Mizuno, T.; Chou, M.Y.; Inouye, M. A unique mechanism regulating gene expression: Translational inhibition by a complementary RNA transcript (micRNA). Proc. Natl. Acad. Sci. USA 1984, 81, 1966–1970. [Google Scholar] [CrossRef] [Green Version]
  3. Lambert, M.; Benmoussa, A.; Provost, P. Small Non-Coding RNAs Derived from Eukaryotic Ribosomal RNA. Non-Coding RNA 2019, 5, 16. [Google Scholar] [CrossRef] [Green Version]
  4. Castellano, L.; Stebbing, J. Deep sequencing of small RNAs identifies canonical and non-canonical miRNA and endogenous siRNAs in mammalian somatic tissues. Nucleic Acids Res. 2013, 41, 3339–3351. [Google Scholar] [CrossRef] [Green Version]
  5. Lee, R.C.; Feinbaum, R.L.; Ambros, V. The C. elegans heterochronic gene lin-4 encodes small RNAs with antisense complementarity to lin-14. Cell 1993, 75, 843–854. [Google Scholar] [CrossRef]
  6. Houwing, S.; Kamminga, L.M.; Berezikov, E.; Cronembold, D.; Girard, A.; van den Elst, H.; Filippov, D.V.; Blaser, H.; Raz, E.; Moens, C.B.; et al. A role for Piwi and piRNAs in germ cell maintenance and transposon silencing in Zebrafish. Cell 2007, 129, 69–82. [Google Scholar] [CrossRef] [Green Version]
  7. ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature 2012, 489, 57–74. [Google Scholar] [CrossRef]
  8. Guglas, K.; Kołodziejczak, I.; Kolenda, T.; Kopczyńska, M.; Teresiak, A.; Sobocińska, J.; Bliźniak, R.; Lamperska, K. YRNAs and YRNA-Derived Fragments as New Players in Cancer Research and Their Potential Role in Diagnostics. Int. J. Mol. Sci. 2020, 21, 5682. [Google Scholar] [CrossRef]
  9. Shigematsu, M.; Kirino, Y. tRNA-Derived Short Non-coding RNA as Interacting Partners of Argonaute Proteins. Gene Regul. Syst. Biol. 2015, 9, 27–33. [Google Scholar] [CrossRef]
  10. Lui, L.; Lowe, T. Small nucleolar RNAs and RNA-guided post-transcriptional modification. Essays Biochem. 2013, 54, 53–77. [Google Scholar] [CrossRef] [Green Version]
  11. Moyano, M.; Stefani, G. piRNA involvement in genome stability and human cancer. J. Hematol. Oncol. 2015, 8, 38. [Google Scholar] [CrossRef] [Green Version]
  12. Zheng, Y. Chapter 1—Introduction to Non-coding RNAs and High Throughput Sequencing. In Computational Non-Coding RNA Biology; Zheng, Y., Ed.; Academic Press: Cambridge, MA, USA, 2019; pp. 3–31. [Google Scholar] [CrossRef]
  13. Chow, R.D.; Chen, S. Sno-derived RNAs are prevalent molecular markers of cancer immunity. Oncogene 2018, 37, 6442–6462. [Google Scholar] [CrossRef]
  14. Wang, Y.; Li, H.; Sun, Q.; Yao, Y. Characterization of Small RNAs Derived from tRNAs, rRNAs and snoRNAs and Their Response to Heat Stress in Wheat Seedlings. PLoS ONE 2016, 11, e0150933. [Google Scholar] [CrossRef] [Green Version]
  15. Abel, Y.; Rederstorff, M. SnoRNAs and the emerging class of sdRNAs: Multifaceted players in oncogenesis. Biochimie 2019, 164, 17–21. [Google Scholar] [CrossRef]
  16. Green, D.; Fraser, W.D.; Dalmay, T. Transfer RNA-derived small RNAs in the cancer transcriptome. Pflug. Arch. 2016, 468, 1041–1047. [Google Scholar] [CrossRef] [Green Version]
  17. Mleczko, A.M.; Machtel, P.; Walkowiak, M.; Wasilewska, A.; Pietras, P.J.; Bąkowska-Żywicka, K. Levels of sdRNAs in cytoplasm and their association with ribosomes are dependent upon stress conditions but independent from snoRNA expression. Sci. Rep. 2019, 9, 18397. [Google Scholar] [CrossRef] [Green Version]
  18. Guan, L.; Grigoriev, A. Computational meta-analysis of ribosomal RNA fragments: Potential targets and interaction mechanisms. Nucleic Acids Res. 2021, 49, 4085–4103. [Google Scholar] [CrossRef]
  19. Kowalski, M.P.; Krude, T. Functional roles of non-coding Y RNAs. Int. J. Biochem. Cell Biol. 2015, 66, 20–29. [Google Scholar] [CrossRef] [Green Version]
  20. Xie, Y.; Yao, L.; Yu, X.; Ruan, Y.; Li, Z.; Guo, J. Action mechanisms and research methods of tRNA-derived small RNAs. Signal Transduct. Target. Ther. 2020, 5, 109. [Google Scholar] [CrossRef]
  21. Gebetsberger, J.; Wyss, L.; Mleczko, A.M.; Reuther, J.; Polacek, N. A tRNA-derived fragment competes with mRNA for ribosome binding and regulates translation during stress. RNA Biol. 2017, 14, 1364–1373. [Google Scholar] [CrossRef] [Green Version]
  22. Kishore, S.; Khanna, A.; Zhang, Z.; Hui, J.; Balwierz, P.J.; Stefan, M.; Beach, C.; Nicholls, R.D.; Zavolan, M.; Stamm, S. The snoRNA MBII-52 (SNORD 115) is processed into smaller RNAs and regulates alternative splicing. Hum. Mol. Genet. 2010, 19, 1153–1164. [Google Scholar] [CrossRef] [Green Version]
  23. Ender, C.; Krek, A.; Friedländer, M.R.; Beitzinger, M.; Weinmann, L.; Chen, W.; Pfeffer, S.; Rajewsky, N.; Meister, G. A human snoRNA with microRNA-like functions. Mol. Cell 2008, 32, 519–528. [Google Scholar] [CrossRef]
  24. Plewka, P.; Szczesniak, M.W.; Stepien, A.; Zywicki, M.; Pacak, A.; Colombo, M.; Makalowska, I.; Ruepp, M.; Raczynska, K.D. FUS controls the processing of snoRNAs into smaller RNA fragments that can regulate gene expression. bioRxiv 2018. [Google Scholar] [CrossRef]
  25. Giraldez, M.D.; Spengler, R.M.; Etheridge, A.; Godoy, P.M.; Barczak, A.J.; Srinivasan, S.; De Hoff, P.L.; Tanriverdi, K.; Courtright, A.; Lu, S.; et al. Comprehensive multi-center assessment of small RNA-seq methods for quantitative miRNA profiling. Nat. Biotechnol. 2018, 36, 746–757. [Google Scholar] [CrossRef]
  26. Raabe, C.A.; Tang, T.-H.; Brosius, J.; Rozhdestvensky, T.S. Biases in small RNA deep sequencing data. Nucleic Acids Res. 2014, 42, 1414–1426. [Google Scholar] [CrossRef] [Green Version]
  27. Olivares, D.; Perez-Hernandez, J.; Perez-Gil, D.; Chaves, F.J.; Redon, J.; Cortes, R. Optimization of small RNA library preparation protocol from human urinary exosomes. J. Transl. Med. 2020, 18, 132. [Google Scholar] [CrossRef]
  28. Plante, I.; Plé, H.; Landry, P.; Gunaratne, P.H.; Provost, P. Modulation of microRNA Activity by Semi-microRNAs. Front. Genet. 2012, 3, 99. [Google Scholar] [CrossRef] [Green Version]
  29. Lambert, M.; Benmoussa, A.; Diallo, I.; Ouellet-Boutin, K.; Dorval, V.; Majeau, N.; Joly-Beauparlant, C.; Droit, A.; Bergeron, A.; Têtu, B.; et al. Identification of Abundant and Functional dodecaRNAs (doRNAs) Derived from Ribosomal RNA. Int. J. Mol. Sci. 2021, 22, 9757. [Google Scholar] [CrossRef]
  30. Taft, R.J.; Glazov, E.A.; Lassmann, T.; Hayashizaki, Y.; Carninci, P.; Mattick, J.S. Small RNAs derived from snoRNAs. RNA 2009, 15, 1233–1240. [Google Scholar] [CrossRef] [Green Version]
  31. He, X.; Chen, X.; Zhang, X.; Duan, X.; Pan, T.; Hu, Q.; Zhang, Y.; Zhong, F.; Liu, J.; Zhang, H.; et al. An Lnc RNA (GAS5)/SnoRNA-derived piRNA induces activation of TRAIL gene by site-specifically recruiting MLL/COMPASS-like complexes. Nucleic Acids Res. 2015, 43, 3712–3725. [Google Scholar] [CrossRef]
  32. Feng, L.; Zhang, F.; Zhang, H.; Zhao, Y.; Meyers, B.C.; Zhai, J. An Online Database for Exploring Over 2,000 Arabidopsis Small RNA Libraries. Plant Physiol. 2020, 182, 685–691. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Ludwig, N.; Leidinger, P.; Becker, K.; Backes, C.; Fehlmann, T.; Pallasch, C.P.; Rheinheimer, S.; Meder, B.; Stähler, C.; Meese, E.; et al. Distribution of miRNA expression across human tissues. Nucleic Acids Res. 2016, 44, 3865–3877. [Google Scholar] [CrossRef] [PubMed]
  34. Axtell, M.J. ShortStack: Comprehensive annotation and quantification of small RNA genes. RNA 2013, 19, 740–751. [Google Scholar] [CrossRef] [Green Version]
  35. Chen, C.; Ridzon, D.A.; Broomer, A.J.; Zhou, Z.; Lee, D.H.; Nguyen, J.T.; Barbisin, M.; Xu, N.L.; Mahuvakar, V.R.; Andersen, M.R.; et al. Real-time quantification of microRNAs by stem-loop RT-PCR. Nucleic Acids Res. 2005, 33, e179. [Google Scholar] [CrossRef]
  36. Qian, F.-H.; Deng, X.; Zhuang, Q.-X.; Wei, B.; Zheng, D.-D. miR-625-5p suppresses inflammatory responses by targeting AKT2 in human bronchial epithelial cells. Mol. Med. Rep. 2019, 19, 1951–1957. [Google Scholar] [CrossRef] [Green Version]
  37. Yuan, J.; Xiao, C.; Lu, H.; Yu, H.; Hong, H.; Guo, C.; Wu, Z. Effect of miR-12136 on Drug Sensitivity of Drug-Resistant Cell Line Michigan Cancer Foundation-7/Doxorubicin by Regulating ATP Binding Cassette Subfamily B Member 1. J. Biomater. Tissue Eng. 2020, 10, 1431–1435. [Google Scholar] [CrossRef]
  38. Jadideslam, G.; Ansarin, K.; Sakhinia, E.; Babaloo, Z.; Abhari, A.; Ghahremanzadeh, K.; Khalili, M.; Radmehr, R.; Kabbazi, A. Diagnostic biomarker and therapeutic target applications of miR-326 in cancers: A systematic review. J. Cell. Physiol. 2019, 234, 21560–21574. [Google Scholar] [CrossRef]
  39. Valastyan, S.; Weinberg, R.A. miR-31: A crucial overseer of tumor metastasis and other emerging roles. Cell Cycle 2010, 9, 2124–2129. [Google Scholar] [CrossRef] [Green Version]
  40. Gainetdinov, I.; Skvortsova, Y.; Kondratieva, S.; Funikov, S.; Azhikina, T. Two modes of targeting transposable elements by piRNA pathway in human testis. RNA 2017, 23, 1614–1625. [Google Scholar] [CrossRef]
  41. Wu, X.; Pan, Y.; Fang, Y.; Zhang, J.; Xie, M.; Yang, F.; Yu, T.; Ma, P.; Li, W.; Shu, Y. The Biogenesis and Functions of piRNAs in Human Diseases. Mol. Ther. Nucleic Acids 2020, 21, 108–120. [Google Scholar] [CrossRef]
  42. Grimson, A.; Srivastava, M.; Fahey, B.; Woodcroft, B.; Chiang, H.R.; King, N.; Degnan, B.; Rokhsar, D.; Bartel, D.P. Early origins and evolution of microRNAs and Piwi-interacting RNAs in animals. Nature 2008, 455, 1193–1197. [Google Scholar] [CrossRef] [PubMed]
  43. Perera, B.P.U.; Tsai, Z.T.-Y.; Colwell, M.; Jones, T.R.; Goodrich, J.M.; Wang, K.; Sartor, M.A.; Faulk, C.; Dolinoy, D.C. Somatic expression of piRNA and associated machinery in the mouse identifies short, tissue-specific piRNA. Epigenetics 2019, 14, 504–521. [Google Scholar] [CrossRef] [PubMed]
  44. Jia, Y.; Tan, W.; Zhou, Y. Transfer RNA-derived small RNAs: Potential applications as novel biomarkers for disease diagnosis and prognosis. Ann. Transl. Med. 2020, 8, 1092. [Google Scholar] [CrossRef] [PubMed]
  45. Venkatesh, T.; Suresh, P.S.; Tsutsumi, R. tRFs: miRNAs in disguise. Gene 2016, 579, 133–138. [Google Scholar] [CrossRef] [PubMed]
  46. Yan, Z.; Hu, H.; Jiang, X.; Maierhofer, V.; Neb, E.; He, L.; Hu, Y.; Hu, H.; Li, N.; Chen, W.; et al. Widespread expression of piRNA-like molecules in somatic tissues. Nucleic Acids Res. 2011, 39, 6596–6607. [Google Scholar] [CrossRef]
  47. Pundhir, S.; Gorodkin, J. Differential and coherent processing patterns from small RNAs. Sci. Rep. 2015, 5, 12062. [Google Scholar] [CrossRef] [Green Version]
  48. Scott, M.S.; Ono, M.; Yamada, K.; Endo, A.; Barton, G.J.; Lamond, A.I. Human box C/D snoRNA processing conservation across multiple cell types. Nucleic Acids Res. 2012, 40, 3676–3688. [Google Scholar] [CrossRef] [Green Version]
  49. López-Corral, L.; Mateos, M.V.; Corchete, L.A.; Sarasquete, M.E.; de la Rubia, J.; De Arriba, F.; Lahuerta, J.-J.; Garcia-Sanz, R.; Miguel, J.S.; Gutierrez, N.; et al. Genomic analysis of high-risk smoldering multiple myeloma. Haematologica 2012, 97, 1439–1443. [Google Scholar] [CrossRef] [Green Version]
  50. Chuang, T.-D.; Xie, Y.; Yan, W.; Khorram, O. Next-generation sequencing reveals differentially expressed small noncoding RNAs in uterine leiomyoma. Fertil. Steril. 2018, 109, 919–929. [Google Scholar] [CrossRef]
  51. Ma, X.; Liu, C.; Kong, X.; Liu, J.; Zhang, S.; Liang, S.; Luan, W.; Cao, X. Extensive profiling of the expressions of tRNAs and tRNA-derived fragments (tRFs) reveals the complexities of tRNA and tRF populations in plants. Sci. China Life Sci. 2021, 64, 495–511. [Google Scholar] [CrossRef]
  52. Zhang, Y.; Li, M.; Hu, C. Exosomal transfer of miR-214 mediates gefitinib resistance in non-small cell lung cancer. Biochem. Biophys. Res. Commun. 2018, 507, 457–464. [Google Scholar] [CrossRef] [PubMed]
  53. Magee, R.; Rigoutsos, I. On the expanding roles of tRNA fragments in modulating cell behavior. Nucleic Acids Res. 2020, 48, 9433–9448. [Google Scholar] [CrossRef] [PubMed]
  54. Cherlin, T.; Magee, R.; Jing, Y.; Pliatsika, V.; Loher, P.; Rigoutsos, I. Ribosomal RNA fragmentation into short RNAs (rRFs) is modulated in a sex- and population of origin-specific manner. BMC Biol. 2020, 18, 38. [Google Scholar] [CrossRef] [PubMed]
  55. Fowler, E.K.; Mohorianu, I.; Smith, D.T.; Dalmay, T.; Chapman, T. Small RNA populations revealed by blocking rRNA fragments in Drosophila melanogaster reproductive tissues. PLoS ONE 2018, 13, e0191966. [Google Scholar] [CrossRef] [Green Version]
  56. Lodish, H.; Berk, A.; Zipursky, S.L. Molecular Cell Biology. In Processing of rRNA and tRNA, 4th ed.; WH Freeman: New York, NY, USA, 2000. [Google Scholar]
  57. Jackowiak, P.; Nowacka, M.; Strozycki, P.M.; Figlerowicz, M. RNA degradome—Its biogenesis and functions. Nucleic Acids Res. 2011, 39, 7361–7370. [Google Scholar] [CrossRef] [Green Version]
  58. Lambert, M.; Benmoussa, A.; Provost, P. A New Specific and Sensitive RT-qPCR Method Based on Splinted 5’ Ligation for the Quantitative Detection of RNA Species Shorter than microRNAs. Non-Coding RNA 2021, 7, 59. [Google Scholar] [CrossRef]
  59. Zheng, G.; Qin, Y.; Clark, W.C.; Dai, Q.; Yi, C.; He, C.; Lambowitz, A.M.; Pan, T. Efficient and quantitative high-throughput tRNA sequencing. Nat. Methods 2015, 12, 835–837. [Google Scholar] [CrossRef]
  60. Wright, C.; Rajpurohit, A.; Burke, E.E.; Williams, C.; Collado-Torres, L.; Kimos, M.; Brandon, N.J.; Cross, A.J.; Jaffe, A.E.; Weinberger, D.R.; et al. Comprehensive assessment of multiple biases in small RNA sequencing reveals significant differences in the performance of widely used methods. BMC Genom. 2019, 20, 513. [Google Scholar] [CrossRef] [Green Version]
  61. Li, Z.; Kim, S.W.; Lin, Y.; Moore, P.S.; Chang, Y.; John, B. Characterization of Viral and Human RNAs Smaller than Canonical MicroRNAs. J. Virol. 2009, 83, 12751–12758. [Google Scholar] [CrossRef] [Green Version]
  62. Laffont, B.; Corduan, A.; Rousseau, M.; Duchez, A.C.; Lee, C.H.C.; Boilard, E.; Provost, P. Platelet microparticles reprogram macrophage gene expression and function. Thromb. Haemost. 2016, 115, 311–323. [Google Scholar] [CrossRef]
  63. Duchez, A.-C.; Boudreau, L.H.; Naika, G.S.; Bollinger, J.; Belleannée, C.; Cloutier, N.; Laffont, B.; Mendoza-Villarroel, R.E.; Levesque, T.; Rollet-Labelle, E.; et al. Platelet microparticles are internalized in neutrophils via the concerted activity of 12-lipoxygenase and secreted phospholipase A2-IIA. Proc. Natl. Acad. Sci. USA 2015, 112, E3564–E3573. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  64. Assefa, A.T.; Vandesompele, J.; Thas, O. On the utility of RNA sample pooling to optimize cost and statistical power in RNA sequencing experiments. BMC Genom. 2020, 21, 312. [Google Scholar] [CrossRef] [PubMed]
  65. Benmoussa, A.; Laugier, J.; Beauparlant, C.J.; Lambert, M.; Droit, A.; Provost, P. Complexity of the microRNA transcriptome of cow milk and milk-derived extracellular vesicles isolated via differential ultracentrifugation. J. Dairy Sci. 2020, 103, 16–29. [Google Scholar] [CrossRef] [PubMed]
  66. Keam, S.P.; Young, P.E.; McCorkindale, A.L.; Dang, T.H.; Clancy, J.L.; Humphreys, D.; Preiss, T.; Hutvagner, G.; Martin, D.I.; Cropley, J.E.; et al. The human Piwi protein Hiwi2 associates with tRNA-derived piRNAs in somatic cells. Nucleic Acids Res. 2014, 42, 8984–8995. [Google Scholar] [CrossRef] [PubMed]
  67. Kozomara, A.; Birgaoanu, M.; Griffiths-Jones, S. miRBase: From microRNA sequences to function. Nucleic Acids Res. 2019, 47, D155–D162. [Google Scholar] [CrossRef] [PubMed]
  68. Bustin, S.A.; Benes, V.; Garson, J.A.; Hellemans, J.; Huggett, J.; Kubista, M.; Mueller, R.; Nolan, T.; Pfaffl, M.W.; Shipley, G.L.; et al. The MIQE Guidelines: Minimum Information for Publication of Quantitative Real-Time PCR Experiments. Clin. Chem. 2009, 55, 611–622. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Figure 1. Opening of the 8- to 15-nt window revealed a high abundance of unusually short RNAs in human, mouse and fly samples. (AE) Length distribution of the small RNA reads (in nucleotides, nt) from human (A), mouse (B), D. melanogaster (C), A. thaliana (D), S. cerevisiae and S. pombe (E) samples in the 8- to 15-nt and the standard 16- to 30-nt windows of RNA length. (FJ) Relative proportion of the reads in either of the two windows of RNA length in each sample.
Figure 1. Opening of the 8- to 15-nt window revealed a high abundance of unusually short RNAs in human, mouse and fly samples. (AE) Length distribution of the small RNA reads (in nucleotides, nt) from human (A), mouse (B), D. melanogaster (C), A. thaliana (D), S. cerevisiae and S. pombe (E) samples in the 8- to 15-nt and the standard 16- to 30-nt windows of RNA length. (FJ) Relative proportion of the reads in either of the two windows of RNA length in each sample.
Ncrna 08 00034 g001
Figure 2. Most unusually short RNAs detected in the 8- to 15-nt window derive from rRNA in human, mouse, and fly samples. (AE) Biotype distribution of the small RNAs (percentage of RPM) from human (A), mouse (B), D. melanogaster (C), A. thaliana (D), S. cerevisiae and S. pombe (E) samples in the 8- to 15-nt window. (FJ) The small RNA biotype distribution (percentage of RPM) of the corresponding samples in the standard 16- to 30-nt window.
Figure 2. Most unusually short RNAs detected in the 8- to 15-nt window derive from rRNA in human, mouse, and fly samples. (AE) Biotype distribution of the small RNAs (percentage of RPM) from human (A), mouse (B), D. melanogaster (C), A. thaliana (D), S. cerevisiae and S. pombe (E) samples in the 8- to 15-nt window. (FJ) The small RNA biotype distribution (percentage of RPM) of the corresponding samples in the standard 16- to 30-nt window.
Ncrna 08 00034 g002
Figure 3. High abundance of fragments derived from longer, authentic non-coding RNAs detected by sRNA-Seq analysis in the 8- to 30-nt window. (AE) Heatmap of length distribution abundance (log10 of RPM) of miRNA-miRNA (A), piRNA-piRF (B), sdRNA (C), tRF (D), and rRF (E) biotypes from human, mouse, D. melanogaster, A. thaliana, S. cerevisiae and S. pombe samples in the 8- to 30-nt window. Gray boxes display conditions with zero reads. For each biotype, a k-mean clustering was generated according to the small RNA length abundance distribution (Euclidean distance, n = 3 clusters; in blue the most abundant, while the green is the lesser abundant group and in beige were grouped RNAs with a middle abundance).
Figure 3. High abundance of fragments derived from longer, authentic non-coding RNAs detected by sRNA-Seq analysis in the 8- to 30-nt window. (AE) Heatmap of length distribution abundance (log10 of RPM) of miRNA-miRNA (A), piRNA-piRF (B), sdRNA (C), tRF (D), and rRF (E) biotypes from human, mouse, D. melanogaster, A. thaliana, S. cerevisiae and S. pombe samples in the 8- to 30-nt window. Gray boxes display conditions with zero reads. For each biotype, a k-mean clustering was generated according to the small RNA length abundance distribution (Euclidean distance, n = 3 clusters; in blue the most abundant, while the green is the lesser abundant group and in beige were grouped RNAs with a middle abundance).
Ncrna 08 00034 g003
Figure 4. A 12-nt and a 13-nt rRF sequences are more abundant than microRNAs miR-25 and miR-30a in human and mouse samples. (A,B) Heatmap comparing the levels of 12-nt and 13-nt rRFs versus microRNAs miR-25 and miR-30a expressed either as reads per million (RPM) detected by sRNA-Seq (A) or in copy number detected by RT-qPCR (B). (C) Quantitation of 12 and 13-nt rRF levels by splinted ligation RT-qPCR analysis of total RNA extracted from HEK293 and N2a cells, in parallel to that of microRNAs miR-25 and miR-30a (n = 3 independent experiments). The detailed results of the statistical analyses are shown in Supplementary Table S14 (two-way ANOVA with Holm Sidak’s post-hoc test).
Figure 4. A 12-nt and a 13-nt rRF sequences are more abundant than microRNAs miR-25 and miR-30a in human and mouse samples. (A,B) Heatmap comparing the levels of 12-nt and 13-nt rRFs versus microRNAs miR-25 and miR-30a expressed either as reads per million (RPM) detected by sRNA-Seq (A) or in copy number detected by RT-qPCR (B). (C) Quantitation of 12 and 13-nt rRF levels by splinted ligation RT-qPCR analysis of total RNA extracted from HEK293 and N2a cells, in parallel to that of microRNAs miR-25 and miR-30a (n = 3 independent experiments). The detailed results of the statistical analyses are shown in Supplementary Table S14 (two-way ANOVA with Holm Sidak’s post-hoc test).
Ncrna 08 00034 g004
Table 1. Most abundant sdRNA sequences identified in the standard, 16- to 30-nt window by sRNA-Seq analysis.
Table 1. Most abundant sdRNA sequences identified in the standard, 16- to 30-nt window by sRNA-Seq analysis.
Length (nt)SequenceReads *Origin
H.
sapiens
HUVEC17GTTTGTGATGACTTACA99.95′ end of SNORD30
29TTGCTGTGATGACTATCTTAGGACACCTT94.45′ end of SNORD58C
29CTGCAGTGATGACTTTCTTAGGACACCTT4.45′ end of SNORD58A
PMN17GTTTGTGATGACTTACA99.95′ end of SNORD30
29TTGCTGTGATGACTATCTTAGGACACCTT54.25′ end of SNORD58C
29CTGCAGTGATGACTTTCTTAGGACACCTT29.75′ end of SNORD58A
HEK29317GTTTGTGATGACTTACA99.95′ end of SNORD30
29TTGCTGTGATGACTATCTTAGGACACCTT94.65′ end of SNORD58C
29CTGCAGTGATGACTTTCTTAGGACACCTT4.75′ end of SNORD58A
M.
musculus
Cerebellum17GTTCTGTGATGAGGCTC965′ end of SNORD83B, without the 3 first nt
29TTGCTGTGATGACTATCTTAGGACACCTT645′ end of SNORD58, without the 3 first nt
29CTGCAGTGATGACTATCTTAGGACACCTT175′ end of SNORD58, without the 3 first nt
PMN17GTTCTGTGATGAGGCTC985′ end of SNORD83B, without the 3 first nt
29TTGCTGTGATGACTATCTTAGGACACCTT735′ end of SNORD58, without the 3 first nt
29CTGCAGTGATGACTATCTTAGGACACCTT135′ end of SNORD58, without the 3 first nt
NIH17GTTCTGTGATGAGGCTC995′ end of SNORD83B, without the 3 first nt
29TTGCTGTGATGACTATCTTAGGACACCTT475′ end of SNORD58, without the 3 first nt
29CTGCAGTGATGACTATCTTAGGACACCTT355′ end of SNORD58, without the 3 first nt
N2a17GTTCTGTGATGAGGCTC995′ end of SNORD83B, without the 3 first nt
29TTGCTGTGATGACTATCTTAGGACACCTT455′ end of SNORD58, without the 3 first nt
29CTGCAGTGATGACTATCTTAGGACACCTT265′ end of SNORD58, without the 3 first nt
* % of reads from sequences having the same length. Nucleotide substitutions are in red.
Table 2. Most abundant piRNAs and piRFs identified by sRNA-Seq analysis.
Table 2. Most abundant piRNAs and piRFs identified by sRNA-Seq analysis.
Length (nt)SequenceOrigin (piRBase Name)
H. sapiensHUVEC15GACCAATGATGTGAApiR-hsa-4433698 5′ end
23TCCTGTACTGAGCTGCCCCGAGApiR-hsa-145507
23TCCTGTACTGAGCTGCCCCGAGTpiR-hsa-145507
PMN15TACAACTTTTGGCAApiR-hsa-7695930 3′ end
14ACAACTTTTGGCAApiR-hsa-7695930 3′ end
23TATTGCACTTGTCCCGGCCTGTApiR-hsa-137098
HEK29314GATGGGTGACCGCCpiR-hsa-741077 fragment
13ATGGGTGACCGCCpiR-hsa-741077 fragment
23TCCTGTACTGAGCTGCCCCGAGApiR-hsa-145507
M. musculusCerebellum15GCATTGGTGGTTCAGpiR-mmu-10912946 5′ end
18GCATTGGTGGTTCAGTGGpiR-mmu-10912946 5′ end
23AACCCGTAGATCCGAACTTGTGApiR-mmu-29307247 5′ end
23TCCTGTACTGAGCTGCCCCGAGApiR-mmu-25873647 5′ end
PMN15GCATTGGTGGTTCAGpiR-mmu-10912946 5′ end
16AGCGGAGTAGAGCAGTpiR-mmu-23655655 5′ end
23TCCTGTACTGAGCTGCCCCGAGApiR-mmu-25873647 5′ end
23TCCTGTACTGAGCTGCCCCGAGTpiR-mmu-25873647 5′ end
22CCTGTACTGAGCTGCCCCGAGApiR-mmu-25873647 5′ end
23GTACCCTGTAGATCCGAATTTGTpiR-mmu-11542414
NIH/3T316AGCGGAGTAGAGCAGTpiR-mmu-23655655 5′ end
22CCTGTACTGAGCTGCCCCGAGApiR-mmu-25873647 5′ end
23TCCTGTACTGAGCTGCCCCGAGApiR-mmu-25873647 5′ end
24GTCCTGTACTGAGCTGCCCCGAGApiR-mmu-25873647 5′ end
N2a12TCGCTGTGATGApiR-mmu-24106721
23CACCCGTAGAACCGACCTTGCGTpiR-mmu-31228201 5′ end
27GGCTCTGTGGCGCAATGGATAGCGCATpiR-mmu-5102689
28TGGCCAAGGATGAGAACTCTAACCTGACpiR-mmu-7884931
D.
melanogaster
13GAGGAAACTCTGGpiR-dme-108681 5′ end
15AAGGGAAGGGTATTGpiR-dme-5048778 5′ end
16AAAGGGAAGGGTATTGpiR-dme-5048778 5′ end
18CTGGGTCGGCCGGGGCGCpiR-dme-34359551 fragment
20TAGGGACGGTCGGGGGCATCpiR-dme-40694119 3′ end
21ATAGGGACGGTCGGGGGCATCpiR-dme-40694119 3′ end
Table 3. Most abundant sdRNA sequences identified in the 8- to 10-nt cluster by sRNA-Seq analysis.
Table 3. Most abundant sdRNA sequences identified in the 8- to 10-nt cluster by sRNA-Seq analysis.
Length (nt)SequenceReads *Origin
H. sapiensHUVEC9GGCTAATGA97.95′ end of SNORD-like-snoRNA,alias:ZL45,
ID:snoID_0724, without the 3 first nt
12TCGCTATGATGA36.95′ end of SNORD14B
10GGACCAATGA96.05′ end of SNORD114-12
PMN9GGCTAATGA99.75′ end of SNORD-like-snoRNA,alias:ZL45,
ID:snoID_0724, without the 3 first nt
12TCGCTATGATGA23.95′ end of SNORD14B
11CCCGTCTGACC22.03′ end of SNORD13
HEK2939GGCTAATGA100,05′ end of SNORD-like-snoRNA,alias:ZL45,
ID:snoID_0724, without the 3 first nt
10TGGCTAATGA45,25′ end of SNORD-like-snoRNA,alias:ZL45,
ID:snoID_0724, without the 2 first nt
11GTAAGTATATT41,4Middle of SNORA24L2
M. musculusCerebellum11CGCTGTGATGA32.65′ end of SNORD14C, without the first nt
9ATTGAGGAC7.9CD_40-1_ (chr16) 20684238,20684314
12AATTGTGGTAAC13.6Middle of SCARNA10
PMN11CGCTGTGATGA37.95′ end of SNORD14C, without the first nt
12AATTGTGGTAAC8.3Middle of SCARNA10
11ATTGTGGTAAC11.3Middle of SCARNA10
NIH/3T311CGCTGTGATGA19.55′ end of SNORD14C, without the first nt
11AGAGAGGTGAG18.1Middle of SNORA17
12TGCTGTGATGAC39.65′ end of SNORD58C, without the first nt
N2a11CGCTGTGATGA80.45′ end of SNORD14C, without the first nt
12AGGGATTGTGGG28.25′ end of SNORA71
10GCGGGTGTGG24.1SNORA74B
D.
melanogaster
12GTGGAGGTAAAG98.05′ end snoRNA:Psi18S-525f
9ATAGGGACG71.3snoRNA:Psi18S-525k (Dmel_CR34569)
10TTATAAACTG43.7PsiU2-38.40.42 (scaRNA:PsiU2-38.40.42)
A. thaliana 11AGATATGATGA95.15′ end of SnoR18a
10AATATTGAAA31.4Middle of SnoR96
11TAATATTGAAA1.4Middle of SnoR96
S. cerevisae 10CCTTCTGAAA22.1SnoRNA86
11TCCTTCTGAAA26.6SnoRNA86
10TCGGGGCTGA11.4SnoRNA86
S. pombe 9TCAACTGTA28.0SnR70
10TGTTCTGATG35.5SnR81
9TGTCTGATC6.7Snr41
* % of reads from sequence with the same length. Nucleotide substitutions are in red. The common motif at the 3′ end is highlighted in green.
Table 4. Most abundant tRF sequences identified by sRNA-Seq analysis.
Table 4. Most abundant tRF sequences identified by sRNA-Seq analysis.
Length (nt)SequenceOrigin
H. sapiensHUVEC18GCATTGGTGGTTCAGTGG5′ end of tRNA-Gly-GCC-3-1
18GCATGGGTGGTTCAGTGG5′ end of tRNA-Gly-GCC-3-1
15GCATTGGTGGTTCAG5′ end of tRNA-Gly-GCC-3-1
PMN18GCATTGGTGGTTCAGTGG5′ end of tRNA-Gly-GCC-3-1
15GCATTGGTGGTTCAG5′ end of tRNA-Gly-GCC-3-1
14TAGAATTCTCGCCTMiddle of tRNA-Gly-CCC-1-1
HEK29315GCATTGGTGGTTCAG5′ end of tRNA-Gly-GCC-3-1
18GCATTGGTGGTTCAGTGG5′ end of tRNA-Gly-GCC-3-1
18GCATGGGTGGTTCAGTGG5′ end of tRNA-Gly-GCC-3-1
M. musculusCerebellum18GCATTGGTGGTTCAGTGG5′ end of tRNA-Gly-GCC-3-1
15GCATTGGTGGTTCAG5′ end of tRNA-Gly-GCC-3-1
14CTTCGTGGTCGCCAPartial 3035a trf-3
PMN18GCATTGGTGGTTCAGTGG5′ end of tRNA-Gly-GCC-3-1
17CATTGGTGGTTCAGTGG5′ end of tRNA-Gly-GCC-3-1
15GCATTGGTGGTTCAG5′ end of tRNA-Gly-GCC-3-1
NIH/3T318GCATTGGTGGTTCAGTGG5′ end of tRNA-Gly-GCC-3-1
15GCATTGGTGGTTCAG5′ end of tRNA-Gly-GCC-3-1
N2a18GCATTGGTGGTTCAGTGG5′ end of tRNA-Gly-GCC-3-1
15GCATTGGTGGTTCAG5′ end of tRNA-Gly-GCC-3-1
D. melanogaster 30CATCGGTGGTTCAGTGGTAGAATGCTCGCC5′ end of tRNA-Gly-GCC-3-1
28GCATCGGTGGTTCAGTGGTAGAATGCTC5′ end of tRNA-Gly-GCC-3-1
17CCCGGGTTTCGGCACCA3023 trf-3
A. thaliana 15GGCTAGGTAACATAAPT-261581 tRF-5
16GGGGATGTAGCTCATA5′ end of tRNA-Ala-CGC-2-1
16GGCGGATGTAGCCAAGPT-218828 tRF-5
S. cerevisae 13GCGGATTTAGCTCtrna9-PheGAA
13GCTTCAGTAGCTCtrna19-MetCAT
28TCCTTAGTTCGATCCTGAGTGCGAGCTCtRNA-Cys-GCA-1-1
29TCCGTGATAGTTTAATGGTCAGAATGGGCtrna1-AspGTC
S. pombe 8GCTTCAGTtrna49-LeuCAG
8GCGGATTTtrna17-SerGCT
10CCCTGGGTTCtrna15-AlaTGC
Nucleotide substitutions are in red.
Table 5. Most abundant rRF sequences identified by sRNA-Seq analysis.
Table 5. Most abundant rRF sequences identified by sRNA-Seq analysis.
Length (nt)Sequence
H. sapiensHUVEC18TCGTACGACTCTTAGCGG
19CTCGTACGACTCTTAGCGG
18TCGTACGACTCTTAGCGG
12GACTCTTAGCGG
13CGACTCTTAGCGG
PMN12GACTCTTAGCGG
13CGACTCTTAGCGG
18TCGTACGACTCTTAGCGG
HEK29312GACTCTTAGCGG
13CGACTCTTAGCGG
18TCGTACGACTCTTAGCGG
M. musculusCerebellum12GACTCTTAGCGG
13CGACTCTTAGCGG
25CAAACGAGAACTTTGAAGGCCGAAG
PMN12GACTCTTAGCGG
13CGACTCTTAGCGG
18CGATACGACTCTTAGCGG
NIH12GACTCTTAGCGG
13CGACTCTTAGCGG
18CGATACGACTCTTAGCGG
N2a12GACTCTTAGCGG
13CGACTCTTAGCGG
18CGATACGACTCTTAGCGG
D. melanogaster 11ACTCTAAGCGG
12AACTCTAAGCGG
30TGCTTGGACTACATATGGTTGAGGGTTGTA
A. thaliana 12GAGTCTGGTAAT
14GGGATGGGTCGGCC
18TAGGATAGTGGCCTACCA
S. cerevisae 13TTGACCTCAAATC
18TATCTGGTTGATCCTGCC
19GCGGCTGTCTGATCAGGCA
S. pombe 13TAAAACTTTCAGC
13TTGACCTCAAATC
24TTTGACCTCAAATCAGGTAGGACT
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Lambert, M.; Guellal, S.; Ho, J.; Benmoussa, A.; Laffont, B.; Bélanger, R.; Provost, P. An Expanded Landscape of Unusually Short RNAs in 11 Samples from Six Eukaryotic Organisms. Non-Coding RNA 2022, 8, 34. https://doi.org/10.3390/ncrna8030034

AMA Style

Lambert M, Guellal S, Ho J, Benmoussa A, Laffont B, Bélanger R, Provost P. An Expanded Landscape of Unusually Short RNAs in 11 Samples from Six Eukaryotic Organisms. Non-Coding RNA. 2022; 8(3):34. https://doi.org/10.3390/ncrna8030034

Chicago/Turabian Style

Lambert, Marine, Sara Guellal, Jeffrey Ho, Abderrahim Benmoussa, Benoit Laffont, Richard Bélanger, and Patrick Provost. 2022. "An Expanded Landscape of Unusually Short RNAs in 11 Samples from Six Eukaryotic Organisms" Non-Coding RNA 8, no. 3: 34. https://doi.org/10.3390/ncrna8030034

APA Style

Lambert, M., Guellal, S., Ho, J., Benmoussa, A., Laffont, B., Bélanger, R., & Provost, P. (2022). An Expanded Landscape of Unusually Short RNAs in 11 Samples from Six Eukaryotic Organisms. Non-Coding RNA, 8(3), 34. https://doi.org/10.3390/ncrna8030034

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop