Next Article in Journal
Expression Pattern of the SARS-CoV-2 Entry Genes ACE2 and TMPRSS2 in the Respiratory Tract
Next Article in Special Issue
Comparative Analysis of the Circular and Highly Asymmetrical Marseilleviridae Genomes
Previous Article in Journal
Viral Related Tools against SARS-CoV-2
Previous Article in Special Issue
The Unconventional Viruses of Ichneumonid Parasitoid Wasps
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Diversity of tRNA Clusters in the Chloroviruses

by
Garry A. Duncan
1,
David D. Dunigan
1,2 and
James L. Van Etten
1,2,*
1
Nebraska Center for Virology, University of Nebraska-Lincoln, Lincoln, NE 68583-0900, USA
2
Department of Plant Pathology, University of Nebraska-Lincoln, Lincoln, NE 68583-0833, USA
*
Author to whom correspondence should be addressed.
Viruses 2020, 12(10), 1173; https://doi.org/10.3390/v12101173
Submission received: 18 September 2020 / Revised: 12 October 2020 / Accepted: 12 October 2020 / Published: 16 October 2020
(This article belongs to the Collection Unconventional Viruses)

Abstract

:
Viruses rely on their host’s translation machinery for the synthesis of their own proteins. Problems belie viral translation when the host has a codon usage bias (CUB) that is different from an infecting virus due to differences in the GC content between the host and virus genomes. Here, we examine the hypothesis that chloroviruses adapted to host CUB by acquisition and selection of tRNAs that at least partially favor their own CUB. The genomes of 41 chloroviruses comprising three clades, each infecting a different algal host, have been sequenced, assembled and annotated. All 41 viruses not only encode tRNAs, but their tRNA genes are located in clusters. While differences were observed between clades and even within clades, seven tRNA genes were common to all three clades of chloroviruses, including the tRNAArg gene, which was found in all 41 chloroviruses. By comparing the codon usage of one chlorovirus algal host, in which the genome has been sequenced and annotated (67% GC content), to that of two of its viruses (40% GC content), we found that the viruses were able to at least partially overcome the host’s CUB by encoding tRNAs that recognize AU-rich codons. Evidence presented herein supports the hypothesis that a chlorovirus tRNA cluster was present in the most recent common ancestor (MRCA) prior to divergence into three clades. In addition, the MRCA encoded a putative isoleucine lysidine synthase (TilS) that remains in 39/41 chloroviruses examined herein, suggesting a strong evolutionary pressure to retain the gene. TilS alters the anticodon of tRNAMet that normally recognizes AUG to then recognize AUA, a codon for isoleucine. This is advantageous to the chloroviruses because the AUA codon is 12–13 times more common in the chloroviruses than their host, further helping the chloroviruses to overcome CUB. Among large DNA viruses infecting eukaryotes, the presence of tRNA genes and tRNA clusters appear to be most common in the Phycodnaviridae and, to a lesser extent, in the Mimiviridae.

1. Introduction

Viruses rely on most or all of their host’s translation machinery to synthesize their proteins. A conflict for viruses occurs when they have a codon usage bias (CUB) different from their host. However, some viruses have genes that help adapt to the host’s CUB in favor of their own CUB by encoding tRNAs. These include some large dsDNA viruses infecting eukaryotic organisms (see Morgado and Vicente, 2019 [1], for an extensive list) and bacteriophages [1,2]. Recently, tRNA-encoding genes have also been reported in some small ssDNA and ssRNA viruses [1].
Among the group of tRNA-encoding viruses are large viruses that infect algae [3,4,5,6], including the chloroviruses (family Phycodnaviridae) with a lytic lifestyle [7,8,9,10,11,12,13]. The chloroviruses have genomes that are 290 to 370 kb in size and are predicted to encode up to 400 proteins (CDSs) and 16 tRNAs [14]. They infect certain chlorella-like green algae that live in a symbiotic relationship with protists and metazoans (referred to as zoochlorellae), forming the holobiont. There are four known clades of chloroviruses based on the host they infect: viruses that infect Chlorella variabilis NC64A (referred to as NC64A viruses), viruses that only infect Chlorella variabilis Syngen 2–3 (referred to as Osy viruses), viruses that infect Chlorella heliozoae SAG 3.83 (referred to as SAG viruses), and viruses that infect Micractinium conductrix Pbi (referred to as Pbi viruses).
The genomes of more than 40 chloroviruses, representing 3 of the 4 clades, have been sequenced, assembled and annotated [13], and references cited therein. (The Osy-virus clade was not included in these analyses because at the time of this analysis few had been sequenced and annotated.) The genome GC content of the NC64A viruses ranges from 40–41%; the GC content of the Pbi viruses ranges from 44–47%, while the GC content of the SAG viruses ranges from 48–52% [13]. Their hosts, C. variabilis NC64A [15] and M. conductrix Pbi [16], both have nuclear genome GC contents of 67%, and it is assumed that C. heliozoae, which has not been sequenced, has a similar GC content.
Two NC64A chloroviruses, PBCV-1 and CVK2, were reported to have the interesting property of clustered tRNA genes [8,9]. Clusters of tRNA genes are found in a small percentage of organisms from all three domains of life [1,17,18,19]. tRNA-gene clusters have also been reported to occur in mitochondria and plastids [2,20,21]. In terms of viruses, Morgado and Vicente [1] recently reported that tRNA genes were in ~14% of the 13,200 virus genomes that they examined and that the tRNAs occurred in clusters in 1.7% of the tRNA encoding viruses, many of which were bacteriophage. Therefore, clusters of tRNAs encoded by viruses that infect eukaryotic organisms are not very common.
This current in silico investigation had several purposes: (i) To evaluate chlorovirus tRNA genes with respect to clustering. (ii) To compare and contrast the tRNA genes within and among the three clades of chloroviruses. (iii) To determine if there was a CUB in the algal host C. variabilis NC64A, and likewise, if there was a CUB among the chloroviruses. iv) To evaluate whether the chlorovirus clades acquired their tRNA gene clusters before or after their divergences from their most recent common ancestor (MRCA). (v) To investigate the role that tRNA isoleucine lysidine synthase, an enzyme encoded by almost all chloroviruses, might play in overcoming CUB. (vi) To examine the incidence of tRNA genes and gene clustering within other members of the Phycodnaviridae and in some other large DNA viruses.

2. Materials and Methods

2.1. Genomic Sequence Data

In total, the genomes of 41 sequenced, assembled (in some cases to draft genomes) and annotated chlorovirus genomes that represent three of the four known chlorovirus clades were used in this study: 14 NC64A viruses, 13 SAG viruses, and 14 Pbi viruses ([13] and references cited therein). Table S1 provides the accession numbers, as well as the collection source of the viruses. The genome of C. variabilis NC64A, which is the host of the NC64A viruses, was previously sequenced and annotated [15] (project accession number ADIC00000000). M. conductrix, the host of the Pbi virus, has also recently been sequenced and annotated [16], but it was not used in this study.

2.2. Identification and Localization of the tRNA Genes

A tRNA gene cluster was previously reported for the chlorovirus PBCV-1 [9,22], as well as chlorovirus CVK2 [8]. The other 40 chloroviruses were examined herein to determine if they had tRNA gene clusters, and their tRNA genes and gene order were also documented. The tRNA genes for each virus were identified by entering the genome accession numbers into the Nucleotide database at NCBI (ncbi.nlm.nih.gov). The Graphics link was used to identify the tRNA genes and their locations, as well as the surrounding non-tRNA genes. The putative tRNA sequences were verified using tRNAscan-SE [23,24]. Commonalities among the three clades of chloroviruses, as well as their differences, were also recorded. tRNA gene information for other nuclear cytoplasmic large DNA viruses (NCLDVs) was obtained from the NCBI database and from Table S2 from Morgado and Vincente [1].
For the chloroviruses examined herein, the non-tRNA genes immediately 5′ and 3′ to the tRNA clusters were sought in order to give insight as to whether the 41 chloroviruses acquired their tRNA gene clusters prior to or after their divergence into the three clades. The source of the data was the same NCBI website, and the data were processed as described above.

2.3. CUB in Chloroviruses and Their Host

Two NC64A viruses, PBCV-1 and AN69C, were selected to compare their codon usage to that of their host, C. variabilis NC64A using Geneious 11.1.5 (Biomatters Ltd., Auckland, New Zealand, https://www.geneious.com).

2.4. Phylogenetic Tree Construction of tRNA Genes

Three concatenated tRNA genes common to all three chlorovirus clades were analyzed by phylogenetic analysis: tRNATyr, tRNAGly, and tRNAArg. The following chloroviruses were not included in this analysis because they lacked one of the three tRNA genes that were concatenated: PBCV-1, NE-JV-1, AR158, NE-JV-4, GM0701.1, and MN0810.1. The tRNA sequences were retrieved from NCBI. The concatenated sequences were aligned by MUSCLE and curated by the Gblocks method; phylogenies were constructed using three tree building algorithms that included 100 bootstraps each: maximum likelihood, parsimony and BioNJ, a distance method (http://phylogeny.fr). The trees were saved in Newick format and MEGAX was used to present the final trees [25]. Phylogenetic trees using each of the three individual tRNA genes were also constructed by the procedures described above.

3. Results and Discussion

3.1. Chlorovirus tRNA Gene Clusters

All 41 chloroviruses encode tRNA genes, and the tRNA genes were located in clusters, usually with intergenic spacers of 1 to ~30 nucleotides (Table 1, Table 2 and Table 3). However, in a few cases non-tRNA genes were also present within the tRNA clusters, resulting in larger spacers between contiguous tRNA genes. Collectively, the 41 chloroviruses encoded a total of 410 tRNAs of which there were 17 different tRNAs for 14 different amino acids (3 synonymous codons).
The 14 NC64A viruses had from 7 (virus AR158) to 14 (viruses MA-1E, CvsA1, and CviK1) tRNA genes in their clusters (Table 1); the 13 SAG viruses had from 7 (NTS-1) to 13 (OR0704.3, Can0619SP, NE-JV-2) tRNA genes in their clusters (Table 2); and, the 14 Pbi viruses had from 3 (NE-JV-1) to 11 (Fr5L) genes in their tRNA gene clusters (Table 3). If one assumes that the original clusters are the sum of all the tRNAs genes in a clade, the NC64A chlorovirus cluster would consist of 14 tRNA genes; indeed, 3 of the NC64A viruses had all 14 tRNA genes. However, the sum of the SAG viruses was 18 tRNA genes, but the largest number of tRNA genes in any one extant virus was 13; the sum of the Pbi viruses was 15 tRNA genes, but the largest number of tRNA genes of any one extant virus was 11 tRNA genes. Some of the tRNA genes likely represent gene duplications. For example, all 41 chloroviruses had 2–4 tRNAAsn genes, in which the encoded tRNAs recognize the same codon AAC.
The order of the chlorovirus tRNA genes within a cluster is also reported in Table 1, Table 2 and Table 3. In general, there was synteny of tRNA genes among the viruses within a clade, but not between clades. (Exceptions to synteny within a clade are noted in the footnotes to Table 1, Table 2 and Table 3.) Seven tRNA genes were common to one or more members of all three chlorovirus clades, although not present in every virus isolate within any of the three clades (Table 4). Two tRNA genes were unique to the NC64A viruses, 3 tRNA genes were unique to the Pbi viruses and 4 tRNA genes were unique to the SAG viruses. Three additional tRNA genes were common to one or more members of the NC64A and SAG clades. The tRNAArg gene was found in all 41 chloroviruses, while 39/41 chloroviruses had the tRNAGly gene.
Due to the short intergenic spacers between clustered chlorovirus tRNA genes, we suspected that the tRNA gene clusters were transcribed as one transcript. Indeed, Nishida et al. [8] reported that chlorovirus CVK2 transcribed its tRNA gene cluster of 14 tRNAs into one transcript; furthermore, the RNA transcript was precisely processed into individual tRNA species by either some unknown virus-encoded or host-encoded RNase. In this regards, PBCV-1 encodes a functional RNase III enzyme [26], but its role in virus replication is unknown.
Differences and similarities in tRNA gene content were observed among individual chloroviruses within each of the three clades of chloroviruses. In some chloroviruses, not all of the tRNA genes within a gene cluster were immediately contiguous to one another, leading to interrupted tRNA-gene clusters in which one or several non-tRNA genes were interspersed within the tRNA-gene cluster. This was especially true for the Pbi viruses in which at least four member viruses had interrupted tRNA-gene clusters, as indicated by large nt intergenic spacer numbers in Table 3.
Additionally, all of the chlorovirus-encoded tRNAs lacked the 3′ terminal three nucleotides (CCA) necessary to be aminoacylated. Since fully functional tRNAs were reported for the NC64A chlorovirus CVK2 [8], the chlorovirus tRNAs, like tRNAs from cellular organisms [27], must either use an unidentified virus enzyme(s) or the host tRNA nucleotidytransferase to add the CCA nucleotides prior to tRNA aminoacylation.

3.2. Genome Location of the Chlorovirus tRNA Clusters

The tRNA clusters in the NC64A and Pbi viruses were located near the center of their genomes except for the Pbi virus NE-JV-1, which was located in the last third of its genome; NE-JV-1 is unusual in many other aspects including the fact that it only has 3 tRNA genes [13]. The tRNA clusters in all of the SAG viruses were located in the first third of their genomes. It is interesting to note that all 13 SAG viruses had a tRNAThr gene located ~30 kb beyond the 3′ end of the tRNA gene cluster, placing the tRNAThr genes near the center of their respective genomes.
One possible explanation is that the tRNA cluster in the SAG viruses was translocated in the 5′ direction but without the tRNAThr gene in the SAG clade’s MRCA. Indeed, if a translocation event did take place, then the original location of the cluster in the ancestral SAG virus would have been more towards the center of its genome. In support of this notion, all of the Pbi viruses have a tRNAThr at the 3’ end of their respective clusters.

3.3. Potential tRNA Transcription Promoters

As noted above, Nishida et al. [8] reported that the tRNA gene cluster from chlorovirus CVK2 was transcribed as a single RNA, which was then processed to the individual tRNAs. RNA polymerase III utilizes type 2 (Class II) promoters to begin transcription of tRNA genes, individually or in clusters [28,29,30]. Type 2 promoters consist of two 11-mer intragenic sequences referred to as Box A and B, which are first recognized by transcription factor IIIC (TFIIIC) that begins the cascade of events leading to the attachment of RNA polymerase III. The chlorovirus tRNA genes have these two boxes, as do the 13 orphaned tRNAThr genes in all 13 SAG viruses located approximately 30 kb downstream of their respective tRNA clusters (Figure S1). This suggests that the tRNAThr genes in the SAG viruses are transcribed independently of their respective tRNA clusters.

3.4. Presence of CUB Differences in Host and Viruses

Because nuclear genomic codon usage data are available for C. variabilis NC64A and its viruses, we were able to compare their respective codon usages (Table 5). Codon usage was available for 13/14 NC64A viruses, but only two representative NC64A viruses, PBCV-1 and AN69C, were chosen for the comparative study. CUB favoring codons with high GC content were noted in the host alga C. variabilis NC64A whose genome is 67% GC (Table 5), while the two NC64A viruses have GC contents of 40% [13] and a CUB favoring AU (Table 5). For example, in the standard universal code there are four codons for the amino acid alanine, differing only in the third (3′) base of the codon. Additionally, in the standard universal code there are four codons for the amino acid glycine, differing only in the third (3′) base of the codon. In C. variabilis NC64A, the two codons whose third base was C or G (GGC and GGG) were the most common, while the two most common in PBCV-1 and AN69C ended in A and U (GGA and GGU) (Table 5). A parallel example occurs in the usage of the two synonymous codons for glutamic acid; GAG was almost exclusively used by C. variabilis NC64A, while GAA was the most common in the two NC64A viruses. The same was true for all amino acids encoded by two synonymous codons (asparagine, aspartic acid, cysteine, glutamine, histidine, lysine, phenylalanine and tyrosine), as well as other amino acids with more than two synonymous codons. Furthermore, the isoleucine codon AUA (AU-rich) occurred in 2.44% of all PBCV-1 codons, while it occurred in only 0.20% of codons in C. variabilis NC64A, a 12-fold difference (Table 5); hence, in the virus, natural selection appears to have favored the retention of the gene that encodes the cognate tRNA for this codon. Likewise, the lysine codon AAA (AU rich) occurred in 4.7% of all PBCV-1 codons, but in only 0.25% of host codons, nearly a 20-fold difference. The leucine codon UUA was the rarest codon used by C. variabilis NC64A (0.06% of all codons), while it was a moderately common codon used by PBCV-1 (1.39% of all codons) (Table 5). As a final example, there are six synonymous codons for the amino acid arginine, but only one codon that had one guanine or cytosine (AGA), while the other five codons had a minimum of two guanines and/or cytosines. 1.43% of all viral codons were AGA for arginine, but only 0.25% of C. variabilis NC64A codons were AGA. Indeed, the two arginine codons with three guanines and cytosines (CGC and CGG) were the two most common in C. variabilis NC64A (3.22% and 2.11%, respectively), while the same two codons in the two viruses were 0.75% and 0.70%, respectively.

3.5. Not All Chlorovirus tRNAs Assist in Overcoming CUB

The CUB of the host is overcome by some but not all tRNA members in the viral tRNA clusters. For example, PBCV-1 encodes 11 tRNAs but only 7 help it to overcome CUB; likewise, AN69C encodes 10 tRNAs but only 8 favor its own CUB (Table 5). Thus, we speculate that the cluster as a unit is under natural selection, rather than each individual tRNA gene, because some of the tRNA genes in the clusters are neutral and do not bestow any positive selective advantage. A reasonable explanation for these observations is that the neutral tRNA genes are unvetted by natural selection, acting as hitchhikers due to their close linkage to tRNA genes that help the viruses overcome the CUB of the host. To illustrate this point, all three clades of chloroviruses had 2–4 tRNAAsn genes whose tRNAs only recognize the codon AAC; none of the 41 chloroviruses encode a tRNAAsn that recognizes the alternate codon AAU, which was 10 times more common in the two viruses than in C. variabilis (Table 5), and which would presumably enhance viral protein translation. Of no surprise, the most common of the two Asn codons in C. variabilis is AAC.
The presence of the AAC codon cognate tRNAAsn genes, which appears to be neutral in benefit to all 41 chloroviruses, suggests that they were acquired in the MRCA tRNA cluster due to an evolutionary accident. That is, their presence appears to be an artifact of evolutionary history by which some neutral tRNA genes were acquired in the tRNA cluster along with selectively advantageous genes by some mechanism, such as horizontal gene transfer (HGT). (This kind of event is similar to the frozen accident concept first proposed by Crick [31].) In the case at hand, neutral tRNAAsn genes may be maintained in the tRNA gene clusters seemingly due to their tight linkage to selectively advantageous viral tRNA genes in the cluster. This seems particularly conceivable if the tRNA cluster is encoded as a single RNA transcript, as it is with chlorovirus CVK2 [8]. The same argument may explain the presence in PBCV-1 of the tRNALys gene for the cognate codon AAG. PBCV-1 encodes both of the tRNAs genes that recognize the two lysine codons, AAG and AAA. However, the AAA codon was >20 times more common in PBCV-1 than in C. variabilis, while AAG was the most common lysine codon in C. variabilis; hence, the former appears to be maintained by positive selection, while the latter appears to provide a neutral benefit to PBCV-1.
Although the tRNA analysis software (http://lowelab.ucsc.edu/tRNAscan-SE/) predicts that all of the encoded chlorovirus tRNAs are functional, with the noted exceptions of several tRNA genes whose mutations have rendered them as pseudogenes, appropriate laboratory-based verification of functionality was beyond the scope of this study. However, Nishida et al. (1999) did demonstrate that some of the tRNAs encoded by CVK2, an NC64A virus, are functional. While we think it is unlikely, we cannot rule out the possibility that some or all of the viral-encoded tRNAs discussed herein are nonfunctional.

3.6. Chloroviruses Encode a Putative tRNA Isoleucine Lysidine Synthase (TilS)

Thirty-nine of the 41 chloroviruses encode another putative enzyme involved in codon usage, TilS. The methionine codon AUG is normally recognized by its cognate tRNAMet with the 3′ UAC 5′ anticodon. The TilS enzyme ligates lysine to the cytidine in the 5′ position of the tRNAMet anticodon; this modified cytidine becomes lysidine, which is complementary to adenine in the 3′ position of the codon, rather than guanine (Figure 1). As such, this modified tRNA then behaves as a tRNAIle and recognizes the isoleucine AUA codon [32,33]. The AUA codon is 12–13 times more common in the two NC64A viruses than in the host, C. variabilis (Table 5). Thus, we suspect this enzyme provides one additional mechanism that helps the viruses overcome CUB of the host by enabling more efficient viral protein synthesis, diminishing the chances of a ribosome stall during elongation when AUA codons are encountered. A previous PBCV-1 transcription RNA-seq study reported that the tils gene was transcribed very early during virus infection, detected by 7 min post infection [34].
All chloroviruses also encode a homolog of translational elongation factor 3 (EF-3) [13]. EF-3 plays a role in optimizing the accuracy of mRNA decoding at the ribosomal acceptor site during protein synthesis in fungi [35,36]; EF-3 has been reported recently in algae [37]. The role this putative enzyme plays in chlorovirus translation is unknown.

3.7. Acquisition of Chlorovirus tRNA Clusters

One question we addressed was: Did the three chlorovirus clades independently acquire their tRNA clusters, e.g., by HGT, or was the tRNA gene cluster acquired by the MRCA, i.e., the last common ancestor prior to splitting into the three clades? We used several lines of inquiry to address this question. First, as described above, the three clades of chloroviruses had 7 tRNA genes in common with one another (Table 4), including 2–4 tRNAAsn genes, in which the encoded tRNAs recognize the codon AAC. There were, however, considerable differences in composition and order between clades, unlike the similar composition and synteny within each of the three clades (Table 1, Table 2 and Table 3).
The second line of inquiry focused on the protein-encoding genes that border the 5′ and 3′ sides of the tRNA clusters. Supplementary Tables S2–S4 report the commonness of protein-encoding genes (orthologous genes) within each of the three clades, but the protein-encoding genes surrounding the tRNA clusters of each clade were not orthologous across the three clades.
A third line of inquiry was to examine phylogenetic tree constructs in which the three tRNA genes common to all three clades of chloroviruses (tRNAGly, tRNAArg, and tRNATyr) were concatenated (Figure 2). The Pbi and SAG clades clustered together, as expected. The NC64A viruses, however, formed two distinct clades. The tree is supported by the results shown in Table 1, in which there is a pattern of presence/absence of tRNAs genes that distinguishes two groups of NC64A viruses.
Phylogenetic trees were also constructed for each of the three individual tRNA genes, tRNATyr, tRNAGly, and tRNAArg (Figures S2–S4, respectively). For each of the three individual tRNA gene analyses, most of the chloroviruses tended to cluster within their clade, but there were a number of exceptions, and it was the exceptions that support a single HGT event hypothesis in the MRCA. For example, the tRNATyr genes of four of the NC64A viruses were more similar in nucleotide sequence to the SAG viruses than to other NC64A viruses (Figure S2). Likewise, for the tRNAGly gene, half of the NC64A viruses were more similar to SAG viruses, while the other half of the NC64A viruses were more similar to Pbi viruses (Figure S3); furthermore, there were two Pbi and two SAG viruses that were more similar to a subclade of NC64A viruses than to other members of their own clade.
A fourth line of inquiry focused on the predicted intron in the tRNATyr gene found in 34/41 chloroviruses representing all three clades. We examined the locations and sequences of the introns and found that the introns were located in identical positions in all of the tRNATyr genes, one nucleotide from the anticodon (3′ direction). While the intron lengths and sequences were nearly identical within a clade, there were slight differences in intron length between the three clades: NC64A 13–14 nt; Pbi 13 nt; SAG 10–11 nt. Because introns within tRNATyr genes are relatively rare, it is unlikely that all of these tRNATyr genes with introns were acquired independently. Rather, from an Occam’s razor line of inference, the evidence supports a single origin for the chlorovirus tRNATyr gene and, therefore gene clusters, prior to the divergence of the three clades from the MRCA.
It is also noted that all 410 tRNA genes of all 41 chloroviruses are on the same identical DNA strand and none on the alternate DNA strand. Separate acquisition events over time would likely have resulted in at least some tRNA genes being acquired on the alternate DNA strand.

3.8. Generation of Chlorovirus tRNAs

The ability of tRNA genes to proliferate is thought to be similar to the mechanism by which mobile elements can lead to intragenomic gene duplications [38]. Duplications and losses are clearly evident among and within all three chlorovirus clades; for example, the Pbi virus CZ-2 had four tRNAAsn genes that all recognized the same ACC codon, while almost all of the other Pbi viruses had just two copies. Likewise, the NC64A virus MA-1E had three tRNALys genes that all recognized the same AAG codon, while others in the same clade had just one, and virus AR158 had none. There are other similar examples among the three chlorovirus clades.
Pope et al. [39] implicated a homing endonuclease (HNH) in the generation of tRNA genes in mycobacteriophages. While not immediately adjacent to the tRNA gene clusters, all of the chloroviruses encoded at least two putative HNHs (e.g., orthologs of A087R and A422R in PBCV-1). A second source of HNH or other endonucleases could be from co-infecting viruses with such genes in their repertoire that generated tRNA gene duplications, losses, and translocations within the chloroviruses. Previously, we proposed three potential sources of HGT for the chlorovirus protein-encoding genes that might also explain the tRNA clusters in the chloroviruses [13]: (i) viral host(s), although there are only a few NC64A chlorovirus genes that have probably been acquired from C. variabilis NC64A; however, the viruses probably had at least one other host through evolutionary time; (ii) bacteria, because some of the chlorovirus genes appear to be of bacterial origin; and, (iii) from other host-competing viral species. Plastids and mitochondria might also have contributed to the viruses via HGT, at least for the tRNA gene clusters, because those organelles also have tRNA gene clusters [2,20,21]. These results are consistent with the analyses and conclusions of Fan et al. [20] who sequenced the mitochondria and plastids of the three chlorovirus hosts, C. variabilis, C. heliozoae, and M. conductrix. Morgado and Vicente [1] suggested that viruses with tRNA clusters might be the source of dissemination of such clustered structures in the three domains of life.

3.9. tRNA Gene Clusters in other Phycodnavirues

Most of the other viruses in the Phycodnaviridae family also encode tRNAs that are in gene clusters (Table 6). Micromonas pusilla virus 12T (MpV12T) encodes six tRNA genes, five of which are clustered. The sixth tRNA gene, tRNAThr, is an orphan ~30 kb beyond the tRNALeu gene, the 3′ member of the tRNA cluster. The position of these two tRNA genes and the distance between them is the same as in the chlorovirus SAG clade that also has a 30 kb gap between the genes that encode tRNALeu and tRNAThr. MpV12T also encodes a tRNATyr, as do the chloroviruses, but unlike the chloroviruses, MpV12T does not have an intron in its tRNATyr gene. On the other hand, Ostreococcus lucimarinus virus 7 (OlV7) [6], which has five tRNA genes, four of which are clustered, encodes a tRNATyr that does have a 15 nt intron located in the same position as the chloroviruses. In fact, the tRNA genes in the Micromonas and Ostreococcus virus clusters are similar to some of the tRNA gene clusters in the NC64A viruses, suggesting the two groups of viruses might have a common evolutionary ancestor. The only phycodnavirus that lacks tRNA encoding genes is Ectocarpus siliculosus virus 1 (EsV1) (Table 6); however, EsV1 is unusual because it is the only phycodnavirus that has a lysogenic life style.

3.10. tRNA Genes and Gene Clusters in other Large DNA Viruses

Viruses in the Phycodnaviridae family are proposed to have an ancient, common evolutionary ancestry with some other large dsDNA viruses in the Poxviridae, Iridoviridae, Ascoviridae, Asfarviridae, Mimiviridae, Marseillivirdae, and Pithoviridae families that infect eukaryotes. However, many more large DNA viruses are rapidly being discovered, including Pandoraviruses, Faustoviruses, Mollivirus, Kaumoebavirus, Cetratvirus, Pacmanvirus, and Orpheoviruses, and the evolutionary relationships among these viruses are just starting to be analyzed [40,41]. Collectively, these giant viruses are referred to as NCLDVs [42,43,44].
Therefore, we examined a few representatives from all of these groups to see if they encoded tRNAs and, if so, did the tRNA genes occur in clusters (Table 7). All of the Mimiviridae viruses that were examined encoded tRNA genes and 8/9 had tRNA gene clusters. Most of the remaining NCLDVs, with few exceptions, encoded 0–3 tRNA genes, and none of them had 2 or more tRNA genes in clusters. A few baculoviruses and herpes viruses were also examined for tRNA-encoding genes. Nine of the baculoviruses examined encoded 1 tRNA gene. The 6 herpes viruses that were examined contained from 1 to 19 tRNA genes, but only one of the viruses had clustered tRNA genes (Table 7). Thus, the Phycodnaviridae viruses, along with many viruses in the Mimiviridae family, appear to be the exceptions among the large DNA viruses by having clustered tRNA genes.

4. Conclusions

The chloroviruses overcome CUB of their algal hosts by encoding clusters of tRNA-encoding genes, with most of the tRNAs favoring the viruses. However, the evolutionary events associated with the tRNA clusters differed among the viruses, even within the same clade, because very few of the tRNAs were conserved among all of the chloroviruses. In addition, 39/41 chloroviruses encode a putative enzyme that modifies a tRNA to recognize a codon that is 12–13 times more prominent in the chloroviruses than their host, further acting to curb CUB. As well, it has not escaped our attention that Wobble, which has been extensively investigated and updated, may assist viral protein synthesis. For example, the two codons for tyrosine, UAU and UAC, can be recognized by the same tRNATyr due to Wobble at the 5′ G of the tRNA, which can base pair with the U or C at the 3′ position of the codon. Wobble can also facilitate some of the other encoded amino acids including the codons for glutamic acid, cysteine, histidine, phenylalanine, etc. All the chloroviruses examined to date have clustered tRNA genes, as do many of their Phycodnaviridae relatives. With the exception of Mimiviridae viruses, most other large DNA viruses not only have few, if any, tRNA genes, but when a virus encodes two or more tRNA genes, they are not clustered.
Finally, we are aware that tRNAs are turning out to play other roles in cells besides protein synthesis [45], so it is always possible that the viral encoded tRNAs serve some other functions.

Supplementary Materials

The following are available online at https://www.mdpi.com/1999-4915/12/10/1173/s1, Figure S1: Alignment of tRNAThr genes from 13 SAG viruses, Figure S2: Phylogenetic tree of tRNATyr genes from 35 chloroviruses representing all three clades, NC64A, Pbi and SAG, Figure S3: Phylogenetic tree of tRNAGly genes from 39 chloroviruses representing all three clades, NC64A, Pbi and SAG, Figure S4: Phylogenetic tree of tRNAArg genes from all 41 chloroviruses representing all three clades, NC64A, Pbi and SAG. Table S1: Accession numbers for 41 chloroviruses grouped into three clades that have three different hosts, Table S2: NC64A viruses: 5’ and 3’ genes closest to the tRNA gene cluster, Table S3: SAG viruses: 5’ and 3’ genes closest to the tRNA gene cluster, Table S4: Pbi viruses: 5’ and 3’ genes closest to the tRNA gene cluster.

Author Contributions

Conceptualization, G.A.D. and J.L.V.E.; methodology, G.A.D.; software, G.A.D.; validation, G.A.D., D.D.D., and J.L.V.E.; formal analysis, G.A.D., D.D.D., and J.L.V.E.; investigation, G.A.D.; resources, J.L.V.E.; data curation, G.A.D. and D.D.D.; writing—original draft preparation, G.A.D. and J.L.V.E.; writing—review and editing, G.A.D., D.D.D., J.L.V.E.; visualization, G.A.D. and D.D.D.; supervision, J.L.V.E.; project administration, J.L.V.E.; funding acquisition, J.L.V.E. and D.D.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Science Foundation under Grant No. 1736030 and the University of Nebraska-Lincoln Agricultural Research Division and the Office of Research and Economic Development.

Conflicts of Interest

The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

  1. Morgado, S.M.; Vicente, A.C.P. Global in-silico scenario of tRNA genes and their organization in virus genomes. Viruses 2019, 11, 180. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Abe, T.; Inokuchi, H.; Yamada, Y.; Muto, A.; Iwasaki, Y.; Ikemura, T. tRNADB-CE: tRNA gene database well-timed in the era of big sequence data. Front. Genet. 2014, 5. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Michely, S.; Toulza, E.; Subirana, L.; John, U.; Cognat, V.; Maréchal-Drouard, L.; Grimsley, N.; Moreau, H.; Piganeau, G. Evolution of codon usage in the smallest photosynthetic eukaryotes and their giant viruses. Genome Biol. Evol. 2013, 5, 848–859. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Pagarete, A.; Lanzén, A.; Puntervoll, P.; Sandaa, R.A.; Larsen, A.; Larsen, J.B.; Allen, M.J.; Bratbak, G. Genomic sequence and analysis of EhV-99B1, a new coccolithovirus from the Norwegian fjords. Intervirology 2013, 56, 60–66. [Google Scholar] [CrossRef] [PubMed]
  5. Santini, S.; Jeudy, S.; Bartoli, J.; Poirot, O.; Lescot, M.; Abergel, C.; Barbe, V.; Wommack, K.E.; Noordeloos, A.A.M.; Brussaard, C.P.D.; et al. Genome of Phaeocystis globosa virus PgV-16T highlights the common ancestry of the largest known DNA viruses infecting eukaryotes. Proc. Natl. Acad. Sci. USA 2013, 110, 10800–10805. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  6. Derelle, E.; Monier, A.; Cooke, R.; Worden, A.Z.; Grimsley, N.H.; Moreau, H. Diversity of viruses infecting the green microalga Ostreococcus lucimarinus. J. Virol. 2015, 89, 5812–5821. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  7. Li, Y.; Lu, Z.; Sun, L.; Ropp, S.; Kutish, G.F.; Rock, D.L.; Van Etten, J.L. Analysis of 74 kb of DNA located at the right end of the 330-kb chlorella virus PBCV-1 genome. Virology 1997, 237, 360–377. [Google Scholar] [CrossRef] [Green Version]
  8. Nishida, K.; Kawasaki, T.; Fujie, M.; Usami, S.; Yamada, T. Aminoacylation of tRNAs encoded by chlorella virus CVK2. Virology 1999, 263, 220–229. [Google Scholar] [CrossRef] [Green Version]
  9. Lee, D.Y.; Graves, M.V.; Van Etten, J.L.; Choi, T.J. Functional implication of the tRNA genes encoded in the chlorella virus PBCV-1 genome. Plant. Pathol. J. 2005, 21, 334–342. [Google Scholar] [CrossRef] [Green Version]
  10. Fitzgerald, L.A.; Graves, M.V.; Li, X.; Feldblyum, T.; Nierman, W.C.; Van Etten, J.L. Sequence and annotation of the 369-kb NY-2A and the 345-kb AR158 viruses that infect Chlorella NC64A. Virology 2007, 358, 472–484. [Google Scholar] [CrossRef] [Green Version]
  11. Fitzgerald, L.A.; Graves, M.V.; Li, X.; Hartigan, J.; Pfitzner, A.J.; Hoffart, E.; Van Etten, J.L. Sequence and annotation of the 288-kb ATCV-1 virus that infects an endosymbiotic chlorella strain of the heliozoon Acanthocystis turfacea. Virology 2007, 362, 350–361. [Google Scholar] [CrossRef] [Green Version]
  12. Fitzgerald, L.A.; Graves, M.V.; Li, X.; Feldblyum, T.; Hartigan, J.; Van Etten, J.L. Sequence and annotation of the 314-kb MT325 and the 321-kb FR483 viruses that infect Chlorella Pbi. Virology 2007, 358, 459–471. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Jeanniard, A.; Dunigan, D.D.; Gurnon, J.R.; Agarkova, I.V.; Kang, M.; Vitek, J.; Duncan, G.; McClung, O.W.; Larsen, M.; Claverie, J.-M.; et al. Towards defining the chloroviruses: A genomic journey through a genus of large DNA viruses. BMC Genom. 2013, 14, 1–14. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Van Etten, J.L.; Agarkova, I.V.; Dunigan, D.D. Chloroviruses. Viruses 2020, 12, 20. [Google Scholar] [CrossRef] [Green Version]
  15. Blanc, G.; Duncan, G.; Agarkova, I.; Borodovsky, M.; Gurnon, J.; Kuo, A.; Lindquist, E.; Lucas, S.; Pangilinan, J.; Polle, J.; et al. The Chlorella variabilis NC64A genome reveals adaptation to photosymbiosis, coevolution with viruses, and cryptic sex. Plant. Cell 2010, 22, 2943–2955. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  16. Arriola, M.B.; Velmurugan, N.; Zhang, Y.; Plunkett, M.H.; Hondzo, H.; Barney, B.M. Genome sequences of Chlorella sorokiniana UTEX 1602 and Micractinium conductrix SAG 241.80: Implications to maltose excretion by a green alga. Plant J. 2018, 93, 566–586. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  17. Bermudez-Santana, C.; Attolini, C.S.O.; Kirsten, T.; Engelhardt, J.; Prohaska, S.J.; Steigele, S.; Stadler, P.F. Genomic organization of eukaryotic tRNAs. BMC Genom. 2010, 11, 270. [Google Scholar] [CrossRef] [Green Version]
  18. Morgado, S.M.; Vicente, A.C.P. Beyond the limits: tRNA array units in Mycobacterium genomes. Front. Microbiol. 2018, 9. [Google Scholar] [CrossRef]
  19. Morgado, S.M.; Vicente, A.C.P. Exploring tRNA gene cluster in archaea. Mem. Inst. 2019, 114. [Google Scholar] [CrossRef]
  20. Fan, W.; Guo, W.; Van Etten, J.L.; Mower, J.P. Multiple origins of endosymbionts in Chlorellaceae with no reductive effects on the plastid or mitochondrial genomes. Sci. Rep. 2017, 7, 10101. [Google Scholar] [CrossRef] [Green Version]
  21. Friedrich, A.; Jung, P.P.; Hou, J.; Neuvéglise, C.; Schacherer, J. Comparative mitochondrial genomics within and among yeast species of the Lachancea genus. PLoS ONE 2012, 7, e47834. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  22. Dunigan, D.D.; Cerny, R.L.; Bauman, A.T.; Roach, J.C.; Lane, L.C.; Agarkova, I.V.; Wulser, K.; Yanai-Balser, G.M.; Gurnon, J.R.; Vitek, J.C.; et al. Paramecium bursaria Chlorella virus 1 proteome reveals novel architectural and regulatory features of a giant virus. J. Virol. 2012, 86, 8821–8834. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Lowe, T.M.; Eddy, S.R. tRNAscan-SE: A program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997, 25, 955–964. [Google Scholar] [CrossRef]
  24. Lowe, T.M.; Chan, P.P. tRNAscan-SE On-line: Integrating search and context for analysis of transfer RNA genes. Nucleic Acids Res. 2016, 44, W54–W57. [Google Scholar] [CrossRef]
  25. Stecher, G.; Tamura, K.; Kumar, S. Molecular evolutionary genetics analysis (MEGA) for macOS. Mol. Biol. Evol. 2020, 37, 1237–1239. [Google Scholar] [CrossRef] [PubMed]
  26. Zhang, Y.; Calin-Jageman, I.; Gurnon, J.R.; Choi, T.J.; Adams, B.; Nicholson, A.W.; Van Etten, J.L. Characterization of a chlorella virus PBCV-1 encoded ribonuclease III. Virology 2003, 317, 73–83. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  27. Rak, R.; Dahan, O.; Pilpel, Y. Repertoires of tRNAs: The couplers of genomics and proteomics. Annu. Rev. Cell Dev. Biol. 2018, 34, 239–264. [Google Scholar] [CrossRef]
  28. Diebel, K.W.; Claypool, D.J.; van Dyk, L.F. A conserved RNA polymerase III promoter required for gammaherpesvirus TMER transcription and microRNA processing. Gene 2014, 544, 8–18. [Google Scholar] [CrossRef] [Green Version]
  29. Arimbasseri, A.G.; Maraia, R.J. RNA polymerase III advances: Structural and tRNA functional views. Trends Biochem. Sci. 2016, 41, 546–559. [Google Scholar] [CrossRef] [Green Version]
  30. Tatosyan, K.A.; Stasenko, D.V.; Koval, A.P.; Gogolevskaya, I.K.; Kramerov, D.A. TATA-like boxes in RNA polymerase III promoters: Requirements for nucleotide sequences. Int. J. Mol. Sci. 2020, 21, 3706. [Google Scholar] [CrossRef]
  31. Crick, F.H.C. The origin of the genetic code. J. Mol. Biol. 1968, 38, 367–379. [Google Scholar] [CrossRef]
  32. Nakanishi, K.; Fukai, S.; Ikeuchi, Y.; Soma, A.; Sekine, Y.; Suzuki, T.; Nureki, O. Structural basis for lysidine formation by ATP pyrophosphatase accompanied by a lysine-specific loop and a tRNA-recognition domain. Proc. Natl. Acad. Sci. USA 2005, 102, 7487–7492. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Suzuki, T.; Miyauchi, K. Discovery and characterization of tRNAIle lysidine synthetase (TilS). FEBS Lett. 2010, 584, 272–277. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Blanc, G.; Mozar, M.; Agarkova, I.V.; Gurnon, J.R.; Yanai-Balser, G.M.; Rowe, J.M.; Xia, Y.; Riethoven, J.-J.; Dunigan, D.D.; Van Etten, J.L. Deep RNA sequencing reveals hidden features and dynamics of early gene transcription in Paramecium bursaria chlorella virus 1. PLoS ONE 2014, 9, e90989. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Belfield, G.P.; Tuite, M.F. Translation elongation factor 3: A fungus-specific translation factor? Mol. Microbiol. 1993, 9, 411–418. [Google Scholar] [CrossRef] [PubMed]
  36. Belfield, G.P.; Ross-Smith, N.J.; Tuite, M.F. Translation elongation factor-3 (EF-3): An evolving eukaryotic ribosomal protein? J. Mol. Evol. 1995, 41, 376–387. [Google Scholar] [CrossRef]
  37. Mateyak, M.K.; Pupek, J.K.; Garino, A.E.; Knapp, M.C.; Colmer, S.F.; Kinzy, T.G.; Dunaway, S. Demonstration of translation elongation factor 3 activity from a non-fungal species, Phytophthora infestans. PLoS ONE 2018, 13, e0190524. [Google Scholar] [CrossRef] [Green Version]
  38. Velandia-Huerto, C.A.; Berkemer, S.J.; Hoffmann, A.; Retzlaff, N.; Romero Marroquín, L.C.; Hernández-Rosales, M.; Stadler, P.F.; Bermúdez-Santana, C.I. Orthologs, turn-over, and remolding of tRNAs in primates and fruit flies. BMC Genom. 2016, 17, 617. [Google Scholar] [CrossRef] [Green Version]
  39. Pope, W.H.; Anders, K.R.; Baird, M.; Bowman, C.A.; Boyle, M.M.; Broussard, G.W.; Chow, T.; Clase, K.L.; Cooper, S.; Cornely, K.A.; et al. Cluster M mycobacteriophages Bongo, PegLeg, and Rey with unusually large repertoires of tRNA isotypes. J. Virol. 2014, 88, 2461–2480. [Google Scholar] [CrossRef] [Green Version]
  40. Koonin, E.V.; Yutin, N. Multiple evolutionary origins of giant viruses. F1000Research 2018, 7, 1840. [Google Scholar] [CrossRef] [Green Version]
  41. Koonin, E.V.; Yutin, N. Evolution of the large nucleocytoplasmic DNA viruses of eukaryotes and convergent origins of viral gigantism. Adv. Virus Res. 2019, 103, 167–202. [Google Scholar] [CrossRef] [PubMed]
  42. Iyer, L.M.; Aravind, L.; Koonin, E.V. Common origin of four diverse families of large eukaryotic DNA viruses. J. Virol. 2001, 75, 11720–11734. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  43. Iyer, L.M.; Balaji, S.; Koonin, E.V.; Aravind, L. Evolutionary genomics of nucleo-cytoplasmic large DNA viruses. Virus Res. 2006, 117, 156–184. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  44. Koonin, E.V.; Yutin, N. Origin and evolution of eukaryotic large nucleo-cytoplasmic DNA viruses. Intervirology 2010, 53, 284–292. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  45. Lyons, S.M.; Fay, M.M.; Ivanov, P. The role of RNA modifications in the regulation of tRNA cleavage. FEBS Lett. 2018, 592, 2828–2844. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Figure 1. Simplified diagram of tRNA anticodon base pairing with the mRNA codon. The tRNAMet anticodon 3′ UAC 5′ recognizes the codon 5′ AUG 3′ (left figure). The enzyme tRNA isoleucine lysidine synthase (TilS) attaches lysine to the 5′ cytosine of the tRNA, which becomes lysidine. Lysidine base pairs with adenine. As such, the modified tRNA now recognizes the isoleucine codon AUA.
Figure 1. Simplified diagram of tRNA anticodon base pairing with the mRNA codon. The tRNAMet anticodon 3′ UAC 5′ recognizes the codon 5′ AUG 3′ (left figure). The enzyme tRNA isoleucine lysidine synthase (TilS) attaches lysine to the 5′ cytosine of the tRNA, which becomes lysidine. Lysidine base pairs with adenine. As such, the modified tRNA now recognizes the isoleucine codon AUA.
Viruses 12 01173 g001
Figure 2. Phylogram of three concatenated tRNA genes—tRNAGly, tRNATyr, and tRNAArg—using the maximum likelihood tree building algorithm with 100 bootstraps using the online website: phylogeny.fr. The concatenated sequences were aligned by MUSCLE; the alignment was curated by the Gblocks program. Parsimony and distance (BioNJ) phylogenetic tree-building algorithms were also used (100 bootstraps) and produced similar tree branching topologies. The numbers on the tree represent supported bootstrap values (i.e., percentage of bootstrap replicates that produced the same tree branch). The following chloroviruses were not included in the analysis because they lacked one of the three tRNA genes that were concatenated: PBCV-1, NE-JV-1, AR158, NE-JV-4, GM0701.1, and MN0810.1.
Figure 2. Phylogram of three concatenated tRNA genes—tRNAGly, tRNATyr, and tRNAArg—using the maximum likelihood tree building algorithm with 100 bootstraps using the online website: phylogeny.fr. The concatenated sequences were aligned by MUSCLE; the alignment was curated by the Gblocks program. Parsimony and distance (BioNJ) phylogenetic tree-building algorithms were also used (100 bootstraps) and produced similar tree branching topologies. The numbers on the tree represent supported bootstrap values (i.e., percentage of bootstrap replicates that produced the same tree branch). The following chloroviruses were not included in the analysis because they lacked one of the three tRNA genes that were concatenated: PBCV-1, NE-JV-1, AR158, NE-JV-4, GM0701.1, and MN0810.1.
Viruses 12 01173 g002
Table 1. The order of clustered tRNA genes in NC64A chloroviruses 1.
Table 1. The order of clustered tRNA genes in NC64A chloroviruses 1.
Leu-1 UUGIle AUAAsn-1 AACLeu-2 UUAArg-1 AGAAsn-2 AACGly GGAAsn-3 AACLys-1 AAGGln CAGLys-2 AAGTyr UAC aLys-3 AAALys-4 AAGArg-2 AGAAsp GACVal GUUTotal tRNAs
MA-1E 25 23 24 3 23 3 27 3 22 23 23 30 1 14
CvsA1 25 23 24 3 23 3 27 3 22 23 23 30 1 14
CviK1 25 23 24 3 23 3 27 3 22 23 23 30 1 14
KS1B 3 23 109 3 23 3 27 3 22 23 23 30 1 12
PBCV-1 25 23 23 3b24 3 24 22 23 33 11
IL-3A 25 23 24 3 23 3 25c3 84 22 23 10
MA-1D 25 23 24 5 15 3 25 3 22 23 33 12
NE-JV-4 25 23 24 3 23 3 72c3 23 33 11
AN69C 25 23 24 3 23 3 12 22 23 10
NY-2B 25 23 23 3 23 22 2 25 8
IL-5-2s1 25 23 23 3 23 22 2 25 8
NY-2A 142 23 3 24 22 2 25 8
NYs-1 25 25 229 3 1058 2 25 8
AR158 25 25 51 23 3 24 7
1 All tRNAs genes are within a single cluster for each of the 14 NC64A viruses. The heading of each column identifies the cognate amino acid and codon for each encoded tRNA. Green color indicates same tRNA gene as other members in the column; orange indicates a pseudogene; red indicates a tRNA gene substitution; white indicates absence of tRNA gene. The numbers in the white columns specify the number of nt between two adjacent tRNAs; a missing number in a column is due to a missing tRNA gene. a tRNATyr contains an intron; b tRNALys substitution for tRNAGly; c tRNAAsn substitution for tRNAGln.
Table 2. The order of clustered tRNA genes in SAG chloroviruses 1.
Table 2. The order of clustered tRNA genes in SAG chloroviruses 1.
SAG VirusesCodons and Cognate Amino Acids Recognized by SAG Virus tRNAs
Ile-1 AUASer AGUArg AGAAsn-1 AACGly GGAIle-2 AUUAsn-2 AACMet AUGAsp-1 GACVal-1 GUUVal-2 GUUAsn-3 AACTyr UAC aLys AAGAsn-4 AACAsp-2 GACLeu-1 UUAAsn-5 AACLeu-2 UUGThr ACU bTotal tRNAs
Can0610SP 4 25 22 22 25 23 22 22 2 21 4 36k 13
OR0704.3 4 25 22 22 25 23 22 22 2 21 148 32k 13
NE-JV-2 22 4 25 22 22 22 23 22 2 21 148 29k 13
NE-JV-3 4 25 22 22 22 23 22 2 21 148 31k 12
ATCV-1 5 25 22 23 22 22 22 2 22 148 c31k 11
WI0606 4 25 22 23 22 22 2 22 148 31k 11
MO0605SPH 4 25 22 23 22 22 2 22 148 30k 11
GM0701.1 25 1 22 24 23 148 77 4 35k 10
Br0604L 4 25 22 25 23 2 242 32k 9
TN603.4.2 25 22 24 23 2 22 227 32k 9
Canal-1 158 5 25 22 65 23 24 54 29k 9
MN0810.1 23 24 1 22 22 22 4 35k 9
NTS-1 23 25 22 21 23 33k 7
1 All tRNAs genes are within a single cluster for each of the 13 SAG viruses, with the exception of tRNAThr, which is an orphan tRNA 29–36kb downstream of its respective tRNA cluster. The heading of each column identifies the cognate amino acid and codon for each encoded tRNA. Green color indicates the same tRNA gene as other members in the column; orange indicates a pseudogene; white indicates the absence of the tRNA gene. The numbers in the white columns specify the number of nt between two adjacent tRNAs; a missing number in a column is due to a missing tRNA gene. a tRNATyr contains an intron; b This tRNA gene is 30,000–31,502 nt downstream of the tRNA cluster.
Table 3. The order of clustered tRNA genes in Pbi chloroviruses 1.
Table 3. The order of clustered tRNA genes in Pbi chloroviruses 1.
Pbi VirusesCodons and Cognate Amino Acids Recognized by Pbi Virus tRNAs
Ile AUALeu UUAPhe UUCArg AGAGly GGAAsn-1 AACTyr-1 UAC aLys-1 AAGAsn-2 AACAsn-3 AACAsn-4 AACTyr-2 UAC aLys-2 AAGThr-1 ACGThr-2 ACGTotal tRNAs
Fr5L 23 3 22 22 2 25 22 22 2 245 11
CZ-2 23 3 22 23 23 22 22 2 243 10
MT325 24 24 23 3 23 22 22 2 161 10
Can18-4 24 24 23 3 23 22 22 2 1041 10
CVB-1 24 24 26 985 24 23 22 2 161 10
FR483 22 24 3 23 23 22 2 161 9
CVG-1 132 23 3 23 23 22 2 161 9
CVR-1 132 23 3 23 23 22 2 159 9
CVA-1 132 23 3 23 23 22 2 159 9
AP110A 132 23 3 23 23 23 2 159 9
CVM-1 132 23 3 24 96 22 2 161 9
NW665.2 1142 3 23 23 22 2 161 8
OR0704.2.2 3 23 22 22 2 159 7
NE-JV-1 132 1416 3
1 All tRNAs genes are within a single cluster for each of the 14 Pbi viruses. The heading of each column identifies the cognate amino acid and codon for each encoded tRNA. Green color indicates the same tRNA gene as other members in the column; white indicates the absence of the tRNA gene. The numbers in the white columns specify the number of nt between two adjacent tRNAs; a missing number in a column is due to a missing tRNA gene. a tRNATyr contains an intron.
Table 4. Chlorovirus tRNA genes common to one another and unique to each chlorovirus clade 1.
Table 4. Chlorovirus tRNA genes common to one another and unique to each chlorovirus clade 1.
tRNACodonNC64ASAGPbi
Ile-1AUAccc
Leu-1UUAccc
Asn-1AACccc
Gly-1GGAccc
Lys-1AAGccc
Tyr-1UACccc
Arg-1AGAccc
Asp-1GACdd
Val-1GUUdd
Leu-2UUGdd
Gln-1CAGu
Lys-2AAAu
Ser-1AGU u
Ile-2AUU u
Met-1AUG u
Thr-1ACU u
Phe-1UUC u
Thr-1ACG u
Thr-2ACG u
1 (c) means that some members in all three clades of the chloroviruses have the gene. (d) means that some members in the NC64A and SAG chlorovirus clades have the gene. (u) means that the gene is unique to some viruses in one of the three clades of chloroviruses.
Table 5. Codon frequency use comparison of two chloroviruses, PBCV-1 and AN69C, to Chlorella host.
Table 5. Codon frequency use comparison of two chloroviruses, PBCV-1 and AN69C, to Chlorella host.
Codon AAPBCV-1AN69CC. variabilisRatio 1Codon AAPBCV-1AN69CC. variabilisRatio 1
GCA A1.921.152.420.63AAC N2.622.281.481.66
GCC A0.860.645.540.14AAU N3.162.780.309.9
GCG A1.290.845.460.20CCA P1.421.241.081.23
GCU A1.300.821.720.62CCC P1.070.912.550.39
UGC C0.671.081.770.49CCG P0.960.972.300.42
UGU C1.231.880.324.86CCU P1.330.890.961.16
GAC D1.971.313.040.54CAA Q1.842.090.593.33
GAU D3.021.911.06 4.56CAG Q0.871.064.930.2
GAA E3.692.470.545.70AGA R1.431.710.256.28
GAG E1.261.084.920.24AGG R0.680.840.920.83
UUC F2.532.401.551.59CGA R0.661.480.442.43
UUU F2.943.490.953.38CGC R0.660.843.220.23
GGA G1.711.280.761.97CGG R0.450.942.110.32
GGC G0.610.615.880.10CGU R1.051.380.442.76
GGG G1.120.922.070.52AGC S0.710.823.030.25
GGU G2.101.420.672.63AGU S1.261.240.274.63
CAC H0.891.211.890.56UCA S1.481.780.433.79
CAU H1.261.870.523.01UCC S0.981.341.330.87
AUA I2.442.640.2012.70UCG S1.101.381.051.18
AUC I2.001.801.681.13UCU S1.881.690.513.5
AUU I2.872.880.446.53ACA T2.011.870.613.18
AAA K4.703.760.2516.92ACC T1.321.381.980.68
AAG K2.481.912.530.87ACG T1.621.571.291.24
CUA L0.850.930.273.30ACU T1.571.310.393.69
CUC L1.261.091.470.80GUA V1.801.540.256.68
CUG L0.851.086.850.14GUC V1.351.251.161.12
CUU L1.651.670.502.00GUG V1.571.374.230.35
UUA L1.391.810.0626.67GUU V2.342.280.405.78
UUG L1.762.060.563.41UGG W1.091.241.610.72
AUG M2.761.971.842.04UAC Y1.521.461.421.05
UAU Y 2.242.660.386.45
1 Ratio of the average codon usage of the two viruses divided by codon usage of the host C. variabilis. Ratios above 1 favor the virus and ratios less than 1 favor the host. Codons in bold font are recognized by tRNAs encoded by one or both chloroviruses.
Table 6. Total tRNA genes and clusters of representative Phycodnaviridae viruses a.
Table 6. Total tRNA genes and clusters of representative Phycodnaviridae viruses a.
Phycodnaviridae VirusesAccession NumberTotal tRNA GenestRNA Genes in Cluster b
Bathycoccus sp. RCC1105 virus BpV1HM004432.122
Ectocarpus siliculosus virus 1AF204951.200
Emiliania huxleyi virus 86 isolate EhV86AJ890364.142
Heterosigma akashiwo virus 01 NC_03855332
Micromonas pusilla virus 12TNC_020864.165
Micromonas sp. RCC1109 virus MpV1HM004429.163
Only Syngen Nebraska virus 5KX857749.11414
Ostreococcus lucimarinus virus OlV1HM004431.144
Ostreococcus lucimarinus virus 2 isolate Olv2KP874736.154
Ostreococcus lucimarinus virus 7 isolate OlV7KP874737.144
Ostreococcus mediterraneus virus 1 isolate OmV1KP874735.133
Ostreococcus tauri virus 2FN600414.133
Ostreococcus tauri virus OtV5EU304328.233
Yellowstone lake phycodnavirus 1LC015647.172 and 5 c
Yellowstone lake phycodnavirus 2LC015648.144
Yellowstone lake phycodnavirus 3LC015649.143
a Data for this table were obtained from Morgado and Vincenta [1]. b Definition of a cluster: no intervening genes; 2 or more to be considered as a cluster. c There are 2 tRNA clusters, one with 2 and the other with 5; tRNA genes 2 and 3 are separated by 360 nt that encodes a putative protein.
Table 7. Total tRNA genes and clusters of representative large DNA viruses a,b.
Table 7. Total tRNA genes and clusters of representative large DNA viruses a,b.
AscoviridaeAccession NumberTotal tRNA GenestRNA Genes in Cluster c
Heliothis virescens ascovirus 3f isolate LD135790KJ755191.110
Heliothis virescens ascovirus 3gJX491653.1 10
Trichoplusia ni ascovirus 2cDQ517337.130
Asfarviridae
African swine fever virus strain Ken06.Bus NC_04494600
African swine fever virus strain BA71V NC_00165900
African swine fever virus isolate ASFV Belgium 2018/1 LR536725.100
Faustoviruses
Faustovirus strain E9 MT33575500
Faustovirus strain D3 KU55680300
Faustovirus strain E24 KU702948.1 00
Iridoviridae
Aedes taeniorhynchus iridescent virus,DQ643392.110
Lymphocystis disease virus—isolate ChinaAY380826.110
Scale drop disease virus isolate C4575KR139659.110
Singapore grouper iridovirusAY521625.110
Kaumeobavirus
Kaumeobavirus isolate Sc NC_034249.1 00
Kaumoebavirus isolate Sc KX552040.1 00
Marseilliviridae
Brazilian marseillevirus strain BH2014 NC_029692.1 00
Golden Marseillevirus NC_031465.1 00
Marseillevirus strain T19 NC_01375600
Mimiviridae
Acanthamoeba polyphaga mimivirusHQ336222.262
Acanthamoeba polyphaga moumouvirusJX962719.122
Aureococcus anophagefferens virus isolate BtV-01KJ645900.175
Cafeteria roenbergensis virus BV-PW1GU244497.12222
Chrysochromulina ericina virus isolate CeV-01BKT820662.1123 and 7 d
Megavirus chiliensisJN258408.130
Mimivirus terra2KF527228.162
Phaeocystis globosa virus strain 16TKC662249.187
Tetraselmis virus 1KY322437.11010
Molliviruses
Mollivirus kamchatka strain KronotskyMN812837.100
Mollivirus sibericum isolate P1084-T NC_027867.1 20
Mollivirus sibericum isolate P1084-TKR921745.130
Orpheovirus
Orpheovirus IHUMI-LCC2 NC_036594.1 00
Pacmanvirus
Pacmanvirus A23 NC_034383.1 00
Pandoraviruses
Pandoravirus macleodensisNC_03766510
Pandoravirus neocaledoniaNC_03766630
Pandoravirus quercusNC_03766710
Pithoviridae
Brazilian cedratvirus IHUMI strain IHUMI-27.7 LT99465100
Cedratvirus kamchatka isolate P4 MN873693.1 00
Pithovirus sp. LC8 LT59883600
Poxviridae
Parapoxvirus red deer/HL953 strain HL953KM502564.110
Squirrelpox virus Berlin_2015 (unverified)MF503315.110
Baculoviridae
Anticarsia gemmatalis multicapsid nucleopolyhedrovirus isolate AgMNPV-37KR815466.110
Anticarsia gemmatalis nucleopolyhedrovirusDQ813662.210
Autographa californica nucleopolyhedrovirus clone C6L22858.110
Neodiprion lecontei NPVAY349019.110
Antheraea pernyi nucleopolyhedrovirusDQ486030.310
Helicoverpa armigera granulovirusEU255577.110
Mamestra brassicae MNPV strain K1JQ798165.110
Mamestra configurata NPV-A strain 90/2U59461.210
Xestia c-nigrum granulovirusAF162221.110
Herpesviridae
Bovine herpesvirus type 1.1AJ004801.120
Cercopithecine herpesvirus 16 strain X313DQ149153.120
Columbid alphaherpesvirus 1 strain HLJKX589235.1190
Falconid herpesvirus 1 strain S-18KJ668231.1190
Macropodid herpesvirus 1 isolate MaHV1.3076/08KT594769.110
Murine herpesvirus 68 strain WUMSU97553.274 and 2 e
a Data for this table were obtained from Morgado and Vincente [1] and NCBI. b Table does not include the Phycodnaviridae. c Definition of a cluster: no intervening genes; 2 or more to be considered as a cluster. d There are 2 tRNA clusters, one with 3 and the other with 7. e There are 2 tRNA clusters, one with 4 and the other with 2.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Duncan, G.A.; Dunigan, D.D.; Van Etten, J.L. Diversity of tRNA Clusters in the Chloroviruses. Viruses 2020, 12, 1173. https://doi.org/10.3390/v12101173

AMA Style

Duncan GA, Dunigan DD, Van Etten JL. Diversity of tRNA Clusters in the Chloroviruses. Viruses. 2020; 12(10):1173. https://doi.org/10.3390/v12101173

Chicago/Turabian Style

Duncan, Garry A., David D. Dunigan, and James L. Van Etten. 2020. "Diversity of tRNA Clusters in the Chloroviruses" Viruses 12, no. 10: 1173. https://doi.org/10.3390/v12101173

APA Style

Duncan, G. A., Dunigan, D. D., & Van Etten, J. L. (2020). Diversity of tRNA Clusters in the Chloroviruses. Viruses, 12(10), 1173. https://doi.org/10.3390/v12101173

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop