Next Article in Journal
Breeding for Virus Resistance and Its Effects on Deformed Wing Virus Infection Patterns in Honey Bee Queens
Previous Article in Journal
Porcine Lymphotropic Herpesviruses (PLHVs) and Xenotranplantation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Full-Genome Sequences of Alphacoronaviruses and Astroviruses from Myotis and Pipistrelle Bats in Denmark

by
Christina M. Lazov
1,
Graham J. Belsham
2,
Anette Bøtner
2 and
Thomas Bruun Rasmussen
3,*
1
Department of Biotechnology and Biomedicine, Technical University of Denmark, 2800 Kongens Lyngby, Denmark
2
Department of Veterinary and Animal Sciences, University of Copenhagen, 1870 Frederiksberg, Denmark
3
Department of Virus and Microbiological Special Diagnostics, Statens Serum Institut, 2300 Copenhagen, Denmark
*
Author to whom correspondence should be addressed.
Viruses 2021, 13(6), 1073; https://doi.org/10.3390/v13061073
Submission received: 27 April 2021 / Revised: 28 May 2021 / Accepted: 2 June 2021 / Published: 4 June 2021
(This article belongs to the Section General Virology)

Abstract

:
Bat species worldwide are receiving increased attention for the discovery of emerging viruses, cross-species transmission, and zoonoses, as well as for characterizing virus infections specific to bats. In a previous study, we investigated the presence of coronaviruses in faecal samples from bats at different locations in Denmark, and made phylogenies based on short, partial ORF1b sequences. In this study, selected samples containing bat coronaviruses from three different bat species were analysed, using a non-targeted approach of next-generation sequencing. From the resulting metagenomics data, we assembled full-genome sequences of seven distinct alphacoronaviruses, three astroviruses, and a polyomavirus, as well as partial genome sequences of rotavirus H and caliciviruses, from the different bat species. Comparisons to published sequences indicate that the bat alphacoronaviruses belong to three different subgenera—i.e., Pedacovirus, Nyctacovirus, and Myotacovirus—that the astroviruses may be new species in the genus Mamastrovirus, and that the polyomavirus could also be a new species, but unassigned to a genus. Furthermore, several viruses of invertebrates—including two Rhopalosiphum padi (aphid) viruses and a Kadipiro virus—present in the faecal material were assembled. Interestingly, this is the first detection in Europe of a Kadipiro virus.

1. Introduction

Bat species (order Chiroptera) are some of the most taxonomically diverse mammals, and with over 1400 recognized species (out of a total of around 6500 mammalian species worldwide), they are exceeded only by the number of rodent species [1]. Bats are found widely around the world, and fulfil important roles in ecosystems such as pollination, seed dispersal, regulation of insect populations, and fertilization [2]. In Europe, 51 species of bats have been described; all of them are protected by law, and 17 of these species in the family Vespertilionidae have ranges that include Denmark, in Scandinavia [3]. Bats have, in recent years, been the focus of research for the discovery of new viruses, since it has been found that spillover events from bats form the basis of many emerging infectious diseases in animals and humans [4]. Interestingly, many viruses are seemingly sustained in bat populations without causing disease. Reasons for this have been attributed to the unique immune system of bats, in which constitutively high levels of interferons are expressed, slowing down viral infections, while low antibody titers may be permissive of persistent infections with intermittent shedding [5,6]. Naturally, studies of bat viruses typically focus on those that have emerged as human pathogens, including coronaviruses, lyssaviruses, paramyxoviruses, filoviruses, and reoviruses [7,8]; this is reflected in the distribution of known virus sequences from bats [9]. Another approach has been to sequence and taxonomically assign the entire bat virome, e.g., all nucleic acids present in faecal samples [10,11,12,13,14,15]. Typically, these studies discover many viruses, but have low coverage of individual viruses, and mammalian, invertebrate, fungal, and bacterial viruses will all be present in the data.
Coronaviruses (CoVs), of the family Coronaviridae in the order Nidovirales, are enveloped viruses with positive-sense, single-stranded RNA genomes of around 27–31 kb [16]. Bats are regarded as a natural reservoir of coronaviruses in the genera Alphacoronavirus and Betacoronavirus [17]. Within these two genera, a number of important pathogens of animals and humans are known, e.g., porcine epidemic diarrhoea virus (PEDV), murine hepatitis virus (MHV), Middle East respiratory syndrome (MERS)-related coronavirus, severe acute respiratory syndrome (SARS) coronavirus and, most notably, the SARS-CoV-2 virus [18]. This virus, responsible for the COVID-19 pandemic, may have originated from bats [19,20,21,22,23], although the precise origin and possible intermediate hosts have not yet been identified. Likewise, the four endemic human coronaviruses (HCoVs)—OC43, 229E, NL63, and HKU1—have possible origins in bats and rodents [24,25,26].
Reports of coronaviruses in bats from the Nordic countries are limited. Previously, partial coronavirus ORF1b sequences from five different bat species in Denmark were reported [27] and, more recently, full-genome sequences of two separate coronaviruses—from Myotis daubentonii and Myotis brandtii—as well as other partial bat coronavirus (BtCoV) sequences, have been determined [28]. Still missing are full-genome sequences of coronaviruses from other bat species in the Nordic countries. In this study, we present assembled whole-genome sequences (termed full-genome or full-length sequences) of coronaviruses from Myotis daubentonii, Myotis dasycneme, and Pipistrellus pygmaeus in Denmark, tentatively placing them within three different subgenera within the genus Alphacoronavirus.
As a consequence of the non-targeted sequencing approach taken to obtain the coronavirus data, we gained additional information about the genome sequences of other viruses present within the bat faecal samples. This enabled us to assemble whole-genome sequences of three astroviruses and a polyomavirus, together with partial genome sequences of caliciviruses and rotaviruses. In addition to these viruses of vertebrate hosts, genomes were assembled of three virus taxa infecting arthropods—namely, Rhopalosiphum padi virus, Rosy apple aphid virus, and Kadipiro virus—as well as two picornavirus genomes of unknown host range, provisionally named Basavirus and bat picornavirus. The genome sequences of the dicistrovirus Rhopalosiphum padi virus [29] and the reovirus Kadipiro virus [30] were selected for further characterization, based on the overwhelming number of sequence reads derived from the samples. The astroviruses were selected for characterization due to the novelty of the genome sequences. The family Astroviridae comprises the genera Mamastrovirus (infecting mammals) and Avastrovirus (infecting birds) [31]. Astroviruses are highly prevalent—but understudied—infectious agents of bats [32], and few full-length sequences are publicly available.

2. Materials and Methods

The sample material for sequencing was nucleic acid extracted from selected bat faecal samples collected in 2018, or from samples collected earlier (2014–2016), from different regions in Denmark, as described previously [27], which had scored positive in pan-CoV RT-qPCR assays (Table 1). Selection was based on the level of coronavirus RNA in the samples (with Ct values as low as possible) and the distinct bat species and year of collection. Briefly, bat faecal samples were homogenized in Eagle’s medium, and the nucleic acids were extracted using a MagNA Pure 96 system (Roche, Basel, Switzerland) and then tested using up to three different pan-CoV real-time RT-qPCR assays, as previously described [27].
The nucleic acid samples (as described above) were treated with DNAse I (Invitrogen, Carlsbad, CA, USA) according to the manufacturer’s protocol, and double-stranded cDNA was generated using a Superscript III first-strand synthesis kit (Invitrogen) and a NEBNext mRNA second-strand synthesis kit (New England Biolabs, Ipswich, MA, USA). The reactions were purified using the GeneJET PCR purification kit (Thermo Scientific, Vilnius, Lithuania) with elution in 2 aliquots of nuclease-free water (30 µL each) at 60 °C.
The samples were sequenced at the DTU Multi-Assay Core (DMAC) in Kongens Lyngby, Denmark. In brief, the samples were dried in a Savant SpeedVac™ vacuum concentrator (Thermo Scientific, Waltham, MA, USA) and then resuspended in 5 μL nuclease-free water in order to increase their concentration. Libraries were generated using the Nextera XT DNA library Preparation Kit (Illumina Inc, San Diego, CA, USA) and checked on the 2100 Bioanalyzer system with the High-Sensitivity DNA kit (Agilent Technologies, Glostrup, Denmark). Note that the DNA content was often too low to pass the quality control (Supplementary Table S1), but the analyses were still continued. Finally, the libraries were run on the MiSeq platform using the Reagent Kit v3 600 bp (Illumina).
The raw reads were submitted to the online taxonomic investigation tool Kaiju version 1.7.3 [33], and the results were checked for the presence and number of reads matching coronaviruses and other viruses.
The raw read data were checked for quality using FastQC version 0.11.9 [34] and imported into Geneious Prime version 2019.2.3 to 2021.1.1 [35] for data processing. The reads were trimmed using the BBDuk plugin version 1.0 [36], and the trimmed reads were rechecked with FastQC (Supplementary Table S1).
The trimmed reads were assembled de novo in Geneious Prime using SPAdes version 3.13.0 [37] for metagenomic data with error correction and automatic k-mer choice, or with the MIRA plugin version 1.1.1 and automatic settings for small datasets below 500,000 reads. On the basis of the Kaiju analysis, the resulting contig sequences were mapped to reference sequences from GenBank [38] for bat coronaviruses and the other viruses assembled in the study. If one full-genome-length contig was identified, the trimmed reads were mapped to it, and a consensus sequence was extracted. If several partial-genome-length contigs were present, a reference sequence was used as a scaffold for the contigs, the trimmed reads were mapped to it, and a consensus sequence was generated. For mapping the reads, 5–10 iterations were used, with varying stringency—e.g., number of mismatches, ambiguities and length of gaps—allowed depending on the data, judged by visual inspection of the resulting contig. Occasionally, two or more substantially different de novo contigs would overlap, or multiple reads with identical nucleotide (nt) differences would map to the same de novo contig. In both cases, these observations indicated the presence of multiple virus variants in the sample. For these cases of mixed, closely-related virus populations, very stringent mappings were performed in order to assemble the predominant virus genome, followed by mapping of the unused reads in the dataset to assemble the variant genome(s). A similar strategy was applied when no suitable reference sequence was available and full-genome de novo contigs were not generated, either due to low coverage or other reasons. The reads were mapped to a partial genome de novo contig, and the unused reads were repeatedly mapped to the consensus sequence (Supplementary Table S2).
The consensus sequences were aligned with the de novo contig sequences and annotated using the closest related reference sequences from the ICTV Master Species List [39], in order to establish and appropriately trim the 5′ and 3′ termini of the newly assembled genomes, and to identify the ORFs.
Alignments were made with MUSCLE version 3.8.425 [40] or the progressive Mauve plugin version 1.1.3 [41] in Geneious Prime using type species reference sequences, and with the closest matching sequences identified using BLASTn. MUSCLE alignments of full-genome sequences, specific genes, or individual genome segments were used for generation of either maximum likelihood (ML) or neighbour-joining (NJ) phylogenetic trees, according to the normal practice for the particular virus family. ML trees based on Le–Gascuel and discrete gamma models (LG + G) were constructed from initial BioNJ trees with 100 bootstrap replicates, estimated proportions of invariable sites, and optimizations for tree topology, branch length, and rate using the PhyML plugin version 2.2.4 [42]. Geneious NJ phylogenetic trees based on the Jukes–Cantor genetic distance model were constructed with 1000 bootstrap replicates.

3. Results

3.1. Bat Coronaviruses

Bat coronavirus sequence reads (ranging from 1800 to 140,000 reads) were found in all seven samples selected for this study (see Table 1 and Table 2). Full-length genomes were assembled from the three different species of bats: five genomes were from four samples from M. daubentonii (two different viruses were obtained from sample 21164-6 in dataset D32-7), one genome from M. dasycneme, and one full-length plus one partial genome from P. pygmaeus (Table 3). The BtCoV genomes from M. daubentonii and M. dasycneme were assembled de novo using SPAdes, which, with one exception, produced a single contig covering the entire genome. For the M. daubentonii sample, containing two different BtCoVs, three overlapping partial genome contigs were produced by SPAdes, containing two different spike genes—this was the most divergent gene between the two BtCoVs in the sample. For the P. pygmaeus samples, one genome was assembled de novo into one full-length contig using MIRA, while the other genome was only partially assembled using an iterative reference mapping strategy due to the relatively low number of BtCoV reads (i.e., 1800) in this dataset (Supplementary Table S2).
The full-length Danish BtCoV genomes were aligned and compared to one another, and to their closest BLASTn matches, for distance investigations between the nucleotide sequences (Figure 1). It was apparent that the BtCoVs from the three different bat species had low similarity between species (from 57.4 to 58.8% identity, corresponding to 12,277–11,870 nucleotide differences between the complete genomes). The BtCoV from M. dasycneme was most similar to the reference sequence NC_028811 from M. pilosus (also called M. ricketti), with 79.2% identity and 5891 nucleotide differences, while the full-length BtCoV from P. pygmaeus was most similar to the genome from P. kuhlii (MH938448), with 78.1% identity and 6163 nucleotide differences. A much higher degree of similarity was found between the five BtCoV sequences from M. daubentonii, with a minimum of 93.0% and a maximum of 96.4% identity corresponding to 1974 and 1026 nucleotide differences, respectively. Interestingly, the lowest identity was found between the two BtCoVs from sample 21164-6. The Finnish CoV sequence (MG923574) from M. daubentonii was very similar to the Danish M. daubentonii CoV sequences (between 92.7% and 95.0% identity corresponding to 2055 and 1418 nucleotide differences, respectively).
The Danish BtCoV sequences were also aligned with the closest resembling reference sequences from BLAST using the progressive Mauve plugin in Geneious. The Mauve alignment showed no evidence for recombination or rearrangements.
To further characterize the Danish BtCoVs, the ORF1ab and Spike protein coding sequences were translated in silico. For the Spike protein, an amino acid alignment was made with reference sequences including SARS-CoV-1, SARS-CoV-2, and PEDV. As for the PEDV Spike protein, no furin cleavage sites (motif RXXR) were detected in the bat coronaviruses, except in the cytoplasmic tail of the S2 (with the motif RGPR located at a position corresponding to a.a. 1367–1370 of PEDV Acc. No. NC_003436). For the ORF1ab sequences, an alignment was made of concatenated, conserved regions of certain non-structural proteins (nsps)—namely, nsp5, nsp12, and nsp13—together with concatenated ICTV exemplar species of each subgenus and other published coronavirus sequences (kindly provided by A. Gorbalenya). These conserved regions are currently used in coronavirus taxonomy for species demarcation with DEmARC software, and for the construction of phylogenetic trees as described in ICTV proposal materials [44]—as described recently for classifying the pandemic SARS-CoV-2 [18]. A maximum likelihood phylogenetic tree was constructed from this alignment (Figure 2). The four genera—i.e., Alphacoronavirus, Betacoronavirus, Gammacoronavirus, and Deltacoronavirus—make up the four main clades of the tree. The Danish BtCoV sequences are found within the Alphacoronavirus genus, where they form three clusters, consistent with the full-length nucleotide comparison shown in Figure 1. The M. daubentonii sequences and the P. pygmaeus sequence (from sample 7542-55) form the M. daubentonii cluster, which also includes the Scotophilus bat coronavirus 512 and porcine epidemic diarrhoea virus—the ICTV exemplar species of the subgenus Pedacovirus. In the P. pygmaeus sample B40-5 cluster, the exemplar species Pipistrellus kuhlii coronavirus 3398 and Nyctalus velutinus alphacoronavirus SC-2013 of the subgenus Nyctacovirus are present. Finally, the M. dasycneme cluster contains the Myotis ricketti alphacoronavirus Sax-2011, the single exemplar species of the subgenus Myotacovirus.

3.2. Bat Astroviruses

Bat astrovirus (BtAstV) reads were identified in the two NGS datasets generated from samples 13585-58 and 21164-6 collected from M. daubentonii bats (Table 1 and Table 2). Astroviruses are non-enveloped viruses with a positive-sense single-stranded RNA genome, generally of 6.4–7.9 kb in length [31]. A full-length BtAstV genome for sample 13585-58 of 6557 nucleotides was assembled by mapping to a single de novo generated contig. For sample 21164-6, the SPAdes de novo assembler generated 17 overlapping partial-genome-length astrovirus contigs (Supplementary Table S2). By mapping the reads to these contigs and visually inspecting the assemblies, it was evident that two distinct astroviruses were present in the sample. Through iterative mappings, 2 almost-complete BtAstV genomes of 6619 (A) and 6558 nucleotides (B) with 92.2% identity (corresponding to 516 nucleotide differences between them) were assembled, with different stringency to the de novo contigs. The astrovirus genome is divided into a 5′-untranslated region (UTR), three open reading frames (ORFs)—termed ORF1a, ORF1b, and ORF2—and a 3′-UTR with a poly (A) tail [31]. Although the two BtAstVs from sample 21164-6 were of equal or longer length than the other BtAstV identified, it was not possible to identify the 5′-end of ORF1 or the 3′-end of ORF2 in order to establish the complete coding regions of the genome in sequences A and B. The only full-length reference sequences of the Mamastrovirus genus resembling these were the bat astroviruses of 6650 nt (accession number (Acc. No.) MT734809) and 5310 nt (Acc. No. MW249010), and a mouse astrovirus of 6543 nt (Acc. No. JF755422). From alignments with these sequences, it is expected that only a handful of codons are missing from either end of the A sequence. Sequence B is slightly shorter than A and, furthermore, 4 predicted nucleotides at positions 3100–3103 are undefined (marked NNNN) due to low coverage in this region.
BLASTn searches using the 3 assembled genomes (Table 4) showed high similarity (90.2–99.5% identity) to partial ORF1b sequences of 379–381 nt from BtAstVs from M. daubentonii bats in Germany [45]. This could also be seen in a phylogenetic tree based on nucleotide sequences of the same partial ORF1b region, as targeted by the most commonly used RT-PCR assay for astrovirus detection [45] (Supplementary Figure S1).
The assembled nucleotide sequences were aligned with the progressive Mauve plugin to search for recombination or other rearrangements between the astrovirus genomes of Danish bats and resembling sequences from BLASTn, as well as ICTV exemplars and additional isolate sequences for each astrovirus of the Mamastrovirus genus. The program identified ORF1ab as a common local collinear block (LCB) for the three sequences from Danish bats, although with substantial differences, while ORF2 belonged to different LCBs for the two sequenced samples. No recombination or other rearrangements were detected.
The ORF2 coding sequences from the three assembled genomes of BtAstV were translated in silico and aligned with ICTV reference sequences as well as the few, most closely related, astroviruses with full-length ORF2 sequences from GenBank identified by BLAST. A neighbour-joining tree with 1000 bootstrap replicates was constructed with these, in accordance with the proposal from the ICTV [47]. In this tree (Figure 3), the BtAstV from sample 13585-58 forms a cluster with Mamastroviruses 14, 15, and 16, as well as the bat astroviruses and the mouse astrovirus from GenBank. The closest node is shared with Mamastrovirus 14 from Miniopterus magnater, but the branch length distance is shorter to Mamastrovirus 15 from Taphozous melanopogon. The two BtAstVs from sample 21164-6 are in a different cluster together with Mamastroviruses 17, 18, and 19. The closest node (with a bootstrap value of only 53) is shared with Mamastrovirus 18 from Miniopterus pusillus, but again the branch length is shorter to Mamastrovirus 19 from Taphozous melanopogon.

3.3. Rhopalosiphum Padi Viruses

Many reads (around 780,000) corresponding to sequences from Rhopalosiphum padi virus (RhPV) were identified within the dataset from sample OV-157 from M. daubentonii (Table 2). RhPV is a member of the family Dicistroviridae within the Picornavirales order, and has a small monopartite genome of positive-sense single-stranded RNA. The two aphid-infecting dicistroviruses of the genus Cripavirus—RhPV, and aphid lethal paralysis virus—both employ plants as passive vectors for their arthropod hosts [48]. A single de novo contig was produced from the RhPV reads in the dataset, but when the reads were mapped to this, it was again evident that two virus variants were present. It was possible to separate the reads of the two RhPVs by strict mapping of all reads to the de novo contig sequence, followed by a less strict mapping of the unused reads (Supplementary Table S2).
As with the other dicistroviruses, the genome of RhPV contains two ORFs preceded by internal ribosomal entry sites (IRESs) directing cap-independent initiation of translation [49]. For the IRES located in the intergenic region (IGR) between ORF1 and ORF2, translation is initiated from a non-AUG codon [50]. ORF1 and ORF2 encode non-structural and structural protein precursors, respectively. The 2 Danish RhPV sequences of 10,009 and 10,010 nucleotides were aligned with a closely related sequence (MF535298) from a bat faecal sample from Hungary identified by BLAST, and with the ICTV exemplar species isolate of RhPV (NC_001874). Nucleotide sequence comparisons were made for each UTR and ORF, as well as for the predicted amino acid (a.a.) sequences encoded by the two ORFs (Table 5). Overall, the similarity of the Danish sequences to the references was high, with 96.5% and 96.4% identity to NC_001874, and 99.5% and 99.2% identity to MF535298, respectively. The greatest difference to the reference sequences was observed in the 5′UTR, where there was 91.9% and 91.7% identity to NC_001874, respectively, but 100% identity to MF535298, although the latter sequence was incomplete.
Interestingly, the 2 Danish nucleotide sequences resemble the Hungarian sequence more than each other (98.9% identity), but both of the Danish sequences have a premature stop codon in the ORF2 due to a frameshift mutation (insertion of U) after nucleotide 9460, which results in the deletion of 30 amino acids (residues 789 to 817). This means that one of the three major structural proteins of RhPV—the 30 kDa capsid protein VP1 [29,50]—will be around 26 kDa instead. A similar premature stop codon in this region has been described in RhPV from wild aphids in Sweden, with no apparent difference in infectivity [52].

3.4. Kadipiro Viruses

More than two-thirds (around 2 million) of the trimmed reads in dataset D32-7 from sample 21164-6 from M. daubentonii were identified as Kadipiro virus (KDV) reads (Table 2). Kadipiro virus is one of three currently recognized species of virus, along with Banna virus and Liao ning virus, within the genus Seadornavirus in the Reoviridae family [53]; recently proposed members of this genus are the Balaton and Mangshi viruses [54,55]. Mosquito species are vectors for seadornaviruses, which have been isolated from ticks, encephalitic human patients, pigs, and cattle (Banna virus) [56,57], or are able to infect and kill mice in experimental settings (Liao ning virus) [58]. The genome is double-stranded RNA divided into 12 segments numbered according to size, with a total length of around 21 kbp [53]. The 12 genome segments of KDV were assembled by mapping the reads to the 322 overlapping de novo KDV contigs on a scaffold of the KDV strain JKT-7075 reference sequence from Java, Indonesia, and extracting a consensus sequence. Typically, one long de novo contig covered the majority of the genome segments, while multiple shorter, very varied, de novo contigs mapped to the 5′ and 3′ ends (Supplementary Table S2). For the purposes of this study, it was not necessary to dissect the variants found in the reads and contigs further (Supplementary Material, SPAdes KDV de novo contigs).
The 12 KDV genome segments were subjected to BLASTn searches and compared with the other full-length KDV genome nucleotide sequences available in GenBank (Table 6). The level of identity to the first BLASTn hit varied from 83% (segment 5) to 93% (segment 12, the most conserved segment of KDV), with the closest reference sequence being either strain QTM27331 (marked in red) or strain SDKL1625 (marked in blue) from China.
The VP1 gene, genome segment 1, encodes the RNA-dependent RNA polymerase (RdRp). The predicted amino acid sequence of the RdRp from the assembled KDV was aligned with the amino acid sequences of the closest BLAST matches and ICTV reference sequences, and a maximum likelihood tree with 100 bootstrap replicates was generated (Supplementary Figure S2). Here, it can be seen that the translated Danish KDV shares the closest node with the JKT-7075 reference sequence from Java, but the branch length distances to the Chinese strains are shorter.

3.5. Other Viral and Microbial Agents Present in the Data

The online metagenomics analysis tool Kaiju listed many different organisms—including bacteria, viruses, fungi, and parasites—whose nucleic acid sequences available in NCBI databases matched the sequences present in the NGS datasets (Table 2), and some of the virus genomes were fully or partially assembled (Table 3 and Supplementary Table S2).
A full-genome sequence of a polyomavirus from the M. daubentonii sample 21164-6 was de novo assembled into a single contig. Polyomaviruses are small, non-enveloped viruses with a circular, double-stranded DNA genome encoding early expressed regulatory genes (encoding large tumour antigen (LTAg) and small tumour antigen (STAg)) and late expressed protein genes [61]. From a phylogenetic analysis of the predicted amino acid sequence of the LTAg (Supplementary Figure S3), the sequence from the Danish bat is most related to two other polyomaviruses identified in bats—Pomona leaf-nosed-bat-associated polyomavirus (Acc. No. NC_033737) [62], and KSA403 polyomavirus isolate from Eidolon helvum in Saudi Arabia (Acc. No. NC_040600) [63]—but it is still clearly separate from these, and sequence comparison shows that the Danish sequence only shares 42% and 47% amino acid identity of the LTAg and 57% and 58% full-length genome nucleotide identity with them, respectively. The three bat-derived sequences are situated within a diverse clade of fish polyomaviruses that are unassigned to genera [61,62,63].
Rotavirus H reads were identified in the M. dasycneme sample data, and partial genome sequences of 9 out of 11 genome segments were assembled. These resembled sequences from M. daubentonii rotavirus H detected in bats in Switzerland in 2019 (e.g., Acc. No. MT815963) [64], with around 93% identity.
The sample from M. daubentonii, OV-157, contained calicivirus reads that were de novo assembled into an almost-complete coding sequence, resembling Acc. No. KU712497, with 93% nucleotide identity to a partial-genome bat calicivirus from the same bat species previously described from Hungary [65]; calicivirus reads were also identified in the M. daubentonii sample 21164-6, but the three assembled short partial-genome sequences only shared 70–95% identity with the Hungarian reference sequence.
Several different picornaviruses and picorna-like viruses were identified in the samples from M. daubentonii, resembling the findings of other metagenomics studies on bats [66]. A picornavirus from sample 13585-58 was partially assembled, and shared 62% nucleotide identity with Washington bat picornavirus from a metagenomics data set (Acc. No. KX580885). This reference sequence is possibly related to a picornavirus identified in a spider [67]. From the M. daubentonii sample 21164-6, a full-length genome sequence of the provisionally named Basavirus (bat stool-associated RNA virus) was de novo assembled, with a genome organization resembling the order Picornavirales, but not clearly belonging to any of the five established families, therefore inhabiting its own group of Posa and Posa-like viruses (PPLV) [68]. The Danish Basavirus sequenced shared 64% nucleotide identity with the Basavirus 7 isolate 16715_71 from a Scotophilus kuhlii bat in Vietnam (Acc. No. KX673238) [68].
The M. daubentonii samples also contained many reads from different aphid viruses, e.g., rosy apple aphid virus and Aphis glycines virus 2 [69]. A full-length sequence of rosy apple aphid virus was de novo assembled from sample 13585-58, sharing 82% nucleotide identity with the reference sequence (Acc. No. DQ286292) [70]. Furthermore, many reads matched with unclassified RNA viruses described in a large study of newly discovered invertebrate viruses [43].

4. Discussion

The faecal samples investigated in this study were initially identified as coronavirus positive by real-time RT-qPCR assays and Sanger sequencing, as used previously by Lazov et al. [27]. The alphacoronaviruses detected in these samples from different bat species were compared using phylogenetic analysis based on short regions (130 or 208 nt) of the ORF1ab coding sequence. Using the nucleic acid samples for NGS sequencing generated 8800–140,000 coronavirus reads for each sample (Table 2). This enabled the generation of 7 full-length BtCoV genome sequences of 27,943–28,192 nucleotides—two of which were from the same sample. The single exception was a sample from P. pygmaeus with only 1800 BtCoV reads, which could not be assembled into a full genome, but the partial ORF1ab sequences needed for including it in a standard phylogeny were obtained (Figure 2). Comparison of full-length genome sequences (Figure 1), and the phylogenetic tree of the concatenated conserved predicted amino acid sequences (Figure 2), showed the same species-specific clustering as described in our previous study based on much shorter sequences [27]. The BtCoVs sequenced in this study from M. daubentonii are highly similar to one another, and also to a full-length genome sequence from M. daubentonii from Finland [28]. The Finnish sample was from Mustasaari on the west coast of Finland, about 1000 km from the sampling site of Mønsted in Denmark. The authors reported that this sequence clustered together with sequences from Central and Southern Europe in a phylogenetic tree of partial ORF1ab nucleotide sequences, which is in line with our previous findings [27]. There is a lack of full-genome coronavirus sequences (or even sequences for the concatenated conserved 1200 amino acid long sequences), from other M. daubentonii in GenBank, which prevents more detailed comparisons. The sequences found most closely related to the M. daubentonii cluster were from an M. brandtii from Finland and the partial-genome sequence from P. pygmaeus from 2014, with only around 79.5% nucleotide identity to the first full-length genome sequence (Figure 1). Together, these sequences are part of a larger cluster, which includes the reference sequences for Scotophilus bat coronavirus 512 and the porcine epidemic diarrhoea virus (PEDV), suggesting that they all belong to the same subgenus, termed Pedacovirus. The M. dasycneme BtCoV sequence in this study was assembled from data generated from a sample from Mønsted, as with the M. daubentonii samples, but this BtCoV was found to be most closely related—with 79.2% nucleotide identity—to the Chinese reference sequence NC_028811 of the virus species Myotis ricketti alphacoronavirus Sax-2011. These sequences also cluster together in the phylogenetic tree (Figure 2), suggesting that the Danish sequence belongs to the subgenus Myotacovirus. We saw in our previous study that there were two different types of BtCoV from P. pygmaeus in the Danish samples. One of these was the Pedacovirus-like partial-genome sequence (as described above). The other type (represented in this study by sample B40-5) is in the same cluster in the phylogeny (Figure 2) as the two species exemplars of the Nyctacovirus subgenus: Nyctalus velutinus alphacoronavirus SC-2013 and Pipistrellus kuhlii coronavirus 3398. Further analysis should establish whether these viruses are prototypes for new coronavirus species, or belong to already established species within the subgenera.
Cell receptor binding is important when it comes to assessing the host species and tissue specificity of potentially zoonotic viruses. The six Pedacovirus-like BtCoV sequences described in this study are phylogenetically related to both PEDV and Scotophilus bat coronavirus 512. PEDV has been shown to bind to aminopeptidase-N (APN), but not necessarily as the only receptor, while Scotophilus bat coronavirus 512 is able to infect a wide range of cultured cell types without the use of APN [71]. To our knowledge, no cell receptors have been described for the Alphacoronavirus subgenera Nyctacovirus and Myotacovirus, but for other subgenera in this genus, APN, angiotensin-converting enzyme 2 (ACE2), N-glycolylneuraminic acid (Neu5Gc), and N-acetylneuraminic acid (Neu5Ac) are known receptors [72]. However, in order to predict the cell receptor specificity of the BtCoVs, sequence similarity studies must be combined with functional studies, as it has previously been shown that only a few amino acid differences in the receptor binding domains can change the receptor specificity completely [73].
This study started out as an attempt to expand the knowledge of coronaviruses in Danish bats. This goal was accomplished and, furthermore, information about various other viruses was obtained as well, and some of their whole-genome sequences could be assembled. Among these were three bat astrovirus genome sequences from two different M. daubentonii bats.
Astroviruses of the Mamastrovirus genus have been described in many different mammalian species and in many different bats [32]. However, the reference sequences available for the seven species of bat astrovirus currently recognized—namely, Mamastrovirus 12, 14, 15, 16, 17, 18, and 19—are only partial genome sequences, mainly covering the ORF1b and ORF2 [39]. In 2020 and 2021, three complete or nearly complete BtAstV genomes (Acc. No. MT734809, MW249010, and MG693176) were released in GenBank, but there is still a lack of full-length sequences from bat astroviruses. The most common approach for sequence comparison and phylogenetic analysis of bat astroviruses is based on the sequence of the PCR product produced by a diagnostic assay [46]. Comparison of the target region of the diagnostic assay within the sequences generated by this study shows that these Danish viruses are most closely related to bat astroviruses detected in bats from Germany (Table 4 and Supplementary Figure S1). The two sequences were found to closely resemble (98.2–99.5% identity) two different sequences of a common cluster of many M. daubentonii BtAstVs with different regional origins in Germany [45]. In Figure 3, it can be seen that these two Danish BtAstVs do not clearly belong to either of the species Mamastrovirus 18 or 19, but are distantly related to both. The third astrovirus was 93% identical to a German sequence in a distinct cluster with only two sequences [45]. This Danish BtAstV was found to be separate from the rest of the reference sequences (Figure 3), and is distantly related to Mamastrovirus 14, 15, and 16, another bat astrovirus (Acc. No. NC_043101), and mouse astrovirus (Acc. No. JF55422). We therefore propose that this BtAstV is a new, separate species in the Mamastrovirus genus. Bat astroviruses are generally regarded as non-pathogenic for bats, but a recent study showed that Jamaican fruit bats, Artibeus jamaicensis, experience shifts in their gut microbiome towards increased pathogen-containing microbial genera, and may be more likely to contract co-infections after astrovirus infection [74]. Infections with two or more distinct astroviruses, as seen in one of our bat samples (21164-6), have been documented before—e.g., in a recent Polish study, where 5 out of 12 bats harboured multiple astroviruses, providing an opportunity for the recombination and emergence of new viruses [75]. Cross-species transmission and recombination between astroviruses of different species poses a risk to human and animal health, and is more common than previously thought [76,77].
One of the M. daubentonii bats generated reads that could be assembled into a full-length genome of a polyomavirus. More than 100 species of polyomavirus are currently recognized from humans and a wide range of animals, including bats. They are considered to be host-specific, often establishing persistent infections in healthy hosts, and are associated with disease in immunocompromised individuals—e.g., Merkel cell polyomavirus causing Merkel cell carcinoma in humans [61]. However, it is not known whether polyomaviruses can cause disease in bats. Interestingly, the polyomavirus assembled in this study was only distantly related to other previously described viruses, phylogenetically grouping with fish and bat polyomaviruses unassigned to genera (Supplementary Figure S3). The virus was probably present at a high level in the faeces sample, as it was possible to assemble the whole genome in spite of the DNAse I treatment of the purified nucleic acids during the preparation steps for sequencing.
Calicivirus reads were identified from two M. daubentonii samples, and one long partial genome and two short sequences were assembled (Table 3). These sequences were identified as belonging to the Sapovirus genus of the Caliciviridae family, resembling a virus previously described from M. daubentonii in Hungary (Acc. No. KU712497) [65]. Sapoviruses cause enteric infections in both humans and animals, and recombination events between sapoviruses of different species have been shown to drive viral evolution [78]. Another important infectious agent is rotavirus, responsible for acute gastroenteritis cases in humans and animals [56]. In this study, we identified reads of rotavirus H in the sample from M. dasycneme (Table 2), and assembled partial genome sequences of 9 out of 11 genome segments (Table 3) that most closely resembled rotavirus H sequences from M. daubentonii in Switzerland [64].
In addition to the mentioned viruses of vertebrates, we detected numerous reads from viruses specific to invertebrate hosts (Table 2). These viruses included dicistrovirus RhPV, which is capable of infecting aphid species [79]. Other dicistroviruses of the Cripavirus genus included Aphid lethal paralysis virus, Cricket paralysis virus, Drosophila C virus, and a novel dicistrovirus related to Drosophila C virus that has been detected in the blood of fruit bats [80]. The phylogenetic analysis and nucleotide comparison of the two genomes of RhPV from one Danish bat showed that they are very closely related to an RhPV from Hungary described by Zana et al. [51], which was also identified in bat faeces. The genome contains two large ORFs, but is incomplete at the 5′ terminus, where one of the two IRES structures is located. The Danish RhPV sequences supply this missing information, as the 5′ termini of these viruses are not truncated.
Another virus of invertebrates that we detected in the read data in large numbers, presumably following the ingestion of arthropods, was Kadipiro virus. This virus has not been detected in Europe before, but has previously been detected in and isolated from mosquitoes in Asia, and three full-genome sequences are currently available in GenBank from China and Indonesia [30,43,81,82]. Kadipiro virus was detected in a plasma sample from a febrile adult in Kenya, but the results were possibly due to contamination, and could not be verified [59,60]. It is not known whether KDV is able to infect humans or other mammals via mosquito vectors, as seen with Banna virus from the same Seadornavirus genus [56]. The 12 segments that make up the viral genome were assembled as consensus sequences, but multiple variants were present in the 2 million reads (Supplementary Material, SPAdes KDV de novo contigs). The assembled genome sequences were compared with the existing reference sequences (Table 6), and the level of nucleotide identity varied from 83 to 93% for segments of the Chinese KDV reference strains QTM27331 and SDKL1625. Phylogenetic comparison of predicted amino acid sequences for the VP1s (RdRPs) showed that the Danish KDV was related to both the Chinese strains and the JKT-7075 isolate from Java, Indonesia, but not clearly to one more than the others (Supplementary Figure S2).
Numerous viruses have been isolated from bat species [9], but some of the most common virus detections in faecal samples from European bats are of astroviruses, coronaviruses, adenoviruses, paramyxoviruses, herpesviruses, and picornaviruses [45,83], with varying prevalence reported—e.g., between 5 and 46% for astroviruses [45,46,84,85], and 2–20% for coronaviruses [25,27,84,85,86]. It is therefore not surprising that individual bats may test positive for several of these viruses, and co-infection of bats with, for example, coronaviruses and astroviruses has previously been reported in 6% of sampled bats in Hong Kong [46] and in one P. pygmaeus bat (out of 6 sampled) in a surveillance study in Hungary [85]. Furthermore, out of 17 Hipposideros cervinus bats positive for BtCoV, 15 were also found positive for BtAsV in a study in Borneo [84]. Co-infection with different viruses, but especially with positive-sense RNA viruses of the same species, can provide opportunities for recombination [87]. For coronaviruses, recombination has been associated with several documented switches in host and tissue tropism, and is considered to be one of the driving forces of coronavirus evolution [16,24]. In this study, we found multiple examples of virus co-infections, but no evidence of virus recombination in the Myotis bats (Table 2 and Table 3). The five bat samples of this genus were included in the study because they scored positive for coronaviruses. Two of the bat samples contained astroviruses as well, and one of the samples—21164-6—actually had two different strains of astrovirus, two different strains of coronavirus, two different strains of calicivirus at low levels, and polyomavirus (Table 1, Table 2 and Table 3). Furthermore, this sample had viral reads from Basavirus and very high levels of KDV reads. However, it is not known whether these viral reads only represented virus ingested with the invertebrate food sources, whether they represented the virome of commensal or parasitic organisms, or whether there was an actual infection in the bat. No other samples—e.g., serum or organs—are available from the bats for further investigations into this potential arbovirus reservoir [88]. Likewise, one of the M. daubentonii bat faecal samples contained a high level of RhPV RNA, which may be of little relevance to the bat, but may give insight into the ecological role of bats as disseminators of viral infections of insects [13].
Finding several virus strains in the same sample, some of which were previously uncharacterized—such as the BtAstVs—highlights one of the strengths of the semi-metagenomic sequencing strategy employed in this study, as opposed to amplicon sequencing or other targeted methods. Usually, however, this comes at the cost of lower sequencing coverage of specific targets. Nevertheless, with the exception of one of the P. pygmaeus BtCoVs, we have still been able to obtain full-genome sequences of the BtCoV RNA present in six other nucleic acid samples investigated and, thus, the value of this approach is demonstrated.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/v13061073/s1: Figure S1: Phylogenetic tree of the partial ORF1b of the bat astroviruses from Denmark; Figure S2: Phylogenetic tree of the Kadipiro virus sequence from Denmark; Figure S3: Phylogenetic tree of the polyomavirus sequence from Denmark; Table S1: Quality control of libraries and reads; Table S2: Additional assembly data; File: SPAdes KDV de novo contigs.

Author Contributions

Conceptualization, C.M.L., G.J.B., A.B. and T.B.R.; formal analysis, C.M.L.; investigation, C.M.L., G.J.B., A.B. and T.B.R.; data curation, C.M.L.; writing—original draft preparation, C.M.L.; writing—review and editing, C.M.L., G.J.B., A.B. and T.B.R.; visualization, C.M.L.; supervision, G.J.B., T.B.R. and A.B. All authors have read and agreed to the published version of the manuscript.

Funding

This study was conducted as part of a surveillance program for coronaviruses and lyssaviruses in Danish bats funded by the Danish Veterinary and Food Administration. The funders had no role in the design of the study, in the collection, analyses, or interpretation of data, in the writing of the manuscript, or in the decision to publish the results.

Ethical approval

Ethical review and approval were waived for this study, since the only animal samples used were faecal samples collected without harm to the bats.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are openly available in the NCBI GenBank at https://www.ncbi.nlm.nih.gov/, Bioproject reference number PRJNA720531.

Acknowledgments

Collection of bat faeces samples was performed by Hans J. Baagøe and Esben Fjederholt. Preparation of sequencing libraries and sequencing runs were performed by Neslihan Bicen and Marlene D. Dalgaard at DMAC at the Technical University of Denmark, DTU. Johannes E. Hartby assisted with de novo assemblies. An unpublished alignment of concatenated conserved regions of coronavirus ORF1ab-predicted amino acid sequences was kindly provided by Alexander Gorbalenya, chair of the ICTV Nidovirales Study Group.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Burgin, C.J.; Colella, J.P.; Kahn, P.L.; Upham, N.S. How many species of mammals are there? J. Mammal. 2018, 99, 1–14. [Google Scholar] [CrossRef] [Green Version]
  2. Kunz, T.H.; de Torrez, E.B.; Bauer, D.; Lobova, T.; Fleming, T.H. Ecosystem services provided by bats. Ann. N. Y. Acad. Sci. 2011, 1223, 1–38. [Google Scholar] [CrossRef] [PubMed]
  3. Baagøe, H.J. Flagermus, Chiroptera. In Dansk Pattedyratlas, 1st ed.; Baagøe, H.J., Jensen, T.S., Eds.; Gyldendal: Copenhagen, Denmark, 2007; pp. 38–98. [Google Scholar]
  4. Letko, M.; Seifert, S.N.; Olival, K.J.; Plowright, R.K.; Munster, V.J. Bat-Borne virus diversity, spillover and emergence. Nat. Rev. Microbiol. 2020, 18, 461–471. [Google Scholar] [CrossRef] [PubMed]
  5. Subudhi, S.; Rapin, N.; Misra, V. Immune system modulation and viral persistence in bats: Understanding viral spillover. Viruses 2019, 11, 192. [Google Scholar] [CrossRef] [Green Version]
  6. Schountz, T.; Baker, M.L.; Butler, J.; Munster, V. Immunological control of viral infections in bats and the emergence of viruses highly pathogenic to humans. Front. Immunol. 2017, 8. [Google Scholar] [CrossRef] [PubMed]
  7. Shi, Z.L. Emerging infectious diseases associated with bat viruses. Sci. China Life Sci. 2013, 56, 678–682. [Google Scholar] [CrossRef] [Green Version]
  8. Wong, S.; Lau, S.; Woo, P.; Yuen, K.Y. Bats as a continuing source of emerging infections in humans. Rev. Med. Virol. 2007, 17, 67–91. [Google Scholar] [CrossRef]
  9. Chen, L.; Liu, B.; Yang, J.; Jin, Q. DBatVir: The database of bat-associated viruses. Database 2014, 2014, bau021. [Google Scholar] [CrossRef] [Green Version]
  10. Li, L.; Victoria, J.G.; Wang, C.; Jones, M.; Fellers, G.M.; Kunz, T.H.; Delwart, E. Bat guano virome: Predominance of dietary viruses from insects and plants plus novel mammalian viruses. J. Virol. 2010, 84, 6955–6965. [Google Scholar] [CrossRef] [Green Version]
  11. Wu, Z.; Ren, X.; Yang, L.; Hu, Y.; Yang, J.; He, G.; Zhang, J.; Dong, J.; Sun, L.; Du, J.; et al. Virome analysis for identification of novel mammalian viruses in bat species from Chinese provinces. J. Virol. 2012, 86, 10999–11012. [Google Scholar] [CrossRef] [Green Version]
  12. Hu, D.; Zhu, C.; Wang, Y.; Ai, L.; Yang, L.; Ye, F.; Ding, C.; Chen, J.; He, B.; Zhu, J.; et al. Virome analysis for identification of novel mammalian viruses in bats from Southeast China. Sci. Rep. 2017, 7, 10917. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Donaldson, E.F.; Haskew, A.N.; Gates, J.E.; Huynh, J.; Moore, C.J.; Frieman, M.B. Metagenomic analysis of the viromes of three North American bat species: Viral diversity among different bat species that share a common habitat. J. Virol. 2010, 84, 13004–13018. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Li, Y.; Altan, E.; Reyes, G.; Halstead, B.; Deng, X.; Delwart, E. Virome of bat guano from nine Northern California roosts. J. Virol. 2021, 95. [Google Scholar] [CrossRef] [PubMed]
  15. Wu, Z.; Yang, L.; Ren, X.; He, G.; Zhang, J.; Yang, J.; Qian, Z.; Dong, J.; Sun, L.; Zhu, Y.; et al. Deciphering the bat virome catalog to better understand the ecological diversity of bat viruses and the bat origin of emerging infectious diseases. ISME J. 2016, 10, 609–620. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  16. De Groot, R.J.; Baker, S.C.; Baric, R.; Enjuanes, L.; Gorbalenya, A.E.; Holmes, K.V.; Perlman, S.; Poon, L.; Rottier, P.J.M.; Talbot, P.J.; et al. Family: Coronaviridae. In Virus Taxonomy, Ninth Report of the International Committee on Taxonomy of Viruses; King, A.M.Q., Adams, M.J., Carstens, E.B., Lefkowitz, E.J., Eds.; Elsevier: Amsterdam, The Netherlands, 2012; pp. 806–828. [Google Scholar] [CrossRef]
  17. Woo, P.C.Y.; Lau, S.K.P.; Lam, C.S.F.; Lau, C.C.Y.; Tsang, A.K.L.; Lau, J.H.N.; Bai, R.; Teng, J.L.L.; Tsang, C.C.C.; Wang, M.; et al. Discovery of seven novel mammalian and avian coronaviruses in the genus deltacoronavirus supports bat coronaviruses as the gene source of alphacoronavirus and betacoronavirus and avian coronaviruses as the gene source of gammacoronavirus and deltacoronavi. J. Virol. 2012, 86, 3995–4008. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  18. Gorbalenya, A.E.; Baker, S.C.; Baric, R.S.; de Groot, R.J.; Drosten, C.; Gulyaeva, A.A.; Haagmans, B.L.; Lauber, C.; Leontovich, A.M.; Neuman, B.W.; et al. The species severe acute respiratory syndrome-related coronavirus: Classifying 2019-nCoV and naming it SARS-CoV-2. Nat. Microbiol. 2020, 5, 536–544. [Google Scholar] [CrossRef] [Green Version]
  19. Zhou, P.; Yang, X.L.; Wang, X.G.; Hu, B.; Zhang, L.; Zhang, W.; Si, H.R.; Zhu, Y.; Li, B.; Huang, C.L.; et al. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature 2020, 579, 270–273. [Google Scholar] [CrossRef] [Green Version]
  20. Zhou, H.; Chen, X.; Hu, T.; Li, J.; Song, H.; Liu, Y.; Wang, P.; Liu, D.; Yang, J.; Holmes, E.C.; et al. A novel bat coronavirus closely related to SARS-CoV-2 contains natural insertions at the S1/S2 cleavage site of the spike protein. Curr. Biol. 2020, 30, 2196–2203. [Google Scholar] [CrossRef]
  21. Murakami, S.; Kitamura, T.; Suzuki, J.; Sato, R.; Aoi, T.; Fujii, M.; Matsugo, H.; Kamiki, H.; Ishida, H.; Takenaka-Uema, A.; et al. Detection and characterization of bat sarbecovirus phylogenetically related to SARS-CoV-2, Japan. Emerg. Infect. Dis. 2020, 26, 3025–3029. [Google Scholar] [CrossRef]
  22. Wacharapluesadee, S.; Tan, C.W.; Maneeorn, P.; Duengkae, P.; Zhu, F.; Joyjinda, Y.; Kaewpom, T.; Chia, W.N.; Ampoot, W.; Lim, B.L.; et al. Evidence for SARS-CoV-2 related coronaviruses circulating in bats and pangolins in Southeast Asia. Nat. Commun. 2021, 12. [Google Scholar] [CrossRef]
  23. Hul, V.; Delaune, D.; Karlsson, E.A.; Hassanin, A.; Tey, P.O.; Baidaliuk, A.; Gámbaro, F.; Tu, V.T.; Keatts, L.; Mazet, J.; et al. A novel SARS-CoV-2 related coronavirus in bats from Cambodia. bioRxiv 2021. [Google Scholar] [CrossRef]
  24. Corman, V.M.; Muth, D.; Niemeyer, D.; Drosten, C. Hosts and sources of endemic human coronaviruses. Adv. Virus Res. 2018, 100, 163–188. [Google Scholar] [CrossRef] [PubMed]
  25. Pfefferle, S.; Oppong, S.; Drexler, J.F.; Gloza-Rausch, F.; Ipsen, A.; Seebens, A.; Müller, M.A.; Annan, A.; Vallo, P.; Adu-Sarkodie, Y.; et al. Distant relatives of severe acute respiratory syndrome coronavirus and close relatives of human coronavirus 229E in bats, Ghana. Emerg. Infect. Dis. 2009, 15, 1377–1384. [Google Scholar] [CrossRef]
  26. Wang, W.; Lin, X.D.; Guo, W.P.; Zhou, R.H.; Wang, M.R.; Wang, C.Q.; Ge, S.; Mei, S.H.; Li, M.H.; Shi, M.; et al. Discovery, diversity and evolution of novel coronaviruses sampled from rodents in China. Virology 2015, 474, 19–27. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  27. Lazov, C.M.; Chriél, M.; Baagøe, H.J.; Fjederholt, E.; Deng, Y.; Kooi, E.A.; Belsham, G.J.; Bøtner, A.; Rasmussen, T.B. Detection and characterization of distinct alphacoronaviruses in five different bat species in Denmark. Viruses 2018, 10, 486. [Google Scholar] [CrossRef] [Green Version]
  28. Kivistö, I.; Tidenberg, E.-M.; Lilley, T.; Suominen, K.; Forbes, K.M.; Vapalahti, O.; Huovilainen, A.; Sironen, T. First report of coronaviruses in Northern European bats. Vector Borne Zoonotic Dis. 2019, 20, 155–158. [Google Scholar] [CrossRef] [PubMed]
  29. Moon, J.S.; Domier, L.L.; McCoppin, N.K.; D’Arcy, C.J.; Jin, H. Nucleotide sequence analysis shows that Rhopalosiphum padi virus is a member of a novel group of insect-infecting RNA viruses. Virology 1998, 243, 54–65. [Google Scholar] [CrossRef] [Green Version]
  30. Attoui, H.; Billoir, F.; Biagini, P.; De Micco, P.; De Lamballerie, X. Complete sequence determination and genetic analysis of Banna virus and Kadipiro virus: Proposal for assignment to a new genus (Seadornavirus) within the family Reoviridae. J. Gen. Virol. 2000, 81, 1507–1515. [Google Scholar] [CrossRef] [PubMed]
  31. Bosch, A.; Guix, S.; Krishna, N.K.; Méndez, E.; Monroe, S.S.; Pantin-Jackwood, M.; Schultz-Cherry, S. Family: Astroviridae. In Virus Taxonomy, Ninth Report of the International Committee on Taxonomy of Viruses; King, A.M.Q., Adams, M.J., Carstens, E.B., Lefkowitz, E.J., Eds.; Elsevier: Amsterdam, The Netherlands, 2012; pp. 953–959. [Google Scholar] [CrossRef]
  32. Fischer, K.; dos Reis, V.P.; Balkema-Buschmann, A. Bat astroviruses: Towards understanding the transmission dynamics of a neglected virus family. Viruses 2017, 9, 34. [Google Scholar] [CrossRef]
  33. Menzel, P.; Ng, K.L.; Krogh, A. Fast and sensitive taxonomic classification for metagenomics with Kaiju. Nat. Commun. 2016, 7. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Andrews, S. FastQC v. 0.11.8. Babraham Institute Bioinformatics Group. Available online: https://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (accessed on 25 April 2021).
  35. Geneious Prime v. 2021.1.1 Biomatters Ltd. Available online: https://www.geneious.com (accessed on 25 April 2021).
  36. Bushnell, B. BBDuk Trimmer v. 1.0. Biomatters Ltd. Available online: https://www.geneious.com/plugins/bbduk/#quick-start (accessed on 25 April 2021).
  37. Nurk, S.; Meleshko, D.; Korobeynikov, A.; Pevzner, P.A. MetaSPAdes: A new versatile metagenomic assembler. Genome Res. 2017, 27, 824–834. [Google Scholar] [CrossRef] [Green Version]
  38. Sayers, E.W.; Beck, J.; Bolton, E.E.; Bourexis, D.; Brister, J.R.; Canese, K.; Comeau, D.C.; Funk, K.; Kim, S.; Klimke, W.; et al. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2021, 49, D10–D17. [Google Scholar] [CrossRef]
  39. ICTV Master Species List 2019.v1. Available online: https://talk.ictvonline.org/files/master-species-lists/m/msl/9601 (accessed on 25 April 2021).
  40. Edgar, R.C. MUSCLE: Multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32, 1792–1797. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  41. Darling, A.C.E.; Mau, B.; Blattner, F.R.; Perna, N.T. Mauve: Multiple alignment of conserved genomic sequence with rearrangements. Genome Res. 2004, 14, 1394–1403. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  42. Guindon, S.; Dufayard, J.F.; Lefort, V.; Anisimova, M.; Hordijk, W.; Gascuel, O. New algorithms and methods to estimate maximum-likelihood phylogenies: Assessing the performance of PhyML 3.0. Syst. Biol. 2010, 59, 307–321. [Google Scholar] [CrossRef] [Green Version]
  43. Shi, M.; Lin, X.D.; Tian, J.H.; Chen, L.J.; Chen, X.; Li, C.X.; Qin, X.C.; Li, J.; Cao, J.P.; Eden, J.S.; et al. Redefining the invertebrate RNA virosphere. Nature 2016, 540, 539–543. [Google Scholar] [CrossRef] [PubMed]
  44. Reorganization and Expansion of the Order Nidovirales at the Family and Sub-Order Ranks, Code 2017.015S. Available online: https://talk.ictvonline.org/ictv/proposals/2017.012_015S.A.v1.Nidovirales.zip (accessed on 25 April 2021).
  45. Fischer, K.; Zeus, V.; Kwasnitschka, L.; Kerth, G.; Haase, M.; Groschup, M.H.; Balkema-Buschmann, A. Insectivorous bats carry host specific astroviruses and coronaviruses across different regions in Germany. Infect. Genet. Evol. 2016, 37, 108–116. [Google Scholar] [CrossRef]
  46. Chu, D.K.W.; Poon, L.L.M.; Guan, Y.; Peiris, J.S.M. Novel astroviruses in insectivorous bats. J. Virol. 2008, 82, 9107–9114. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  47. In the Genus Mamastrovirus, Family Astroviridae, Create 14 Species, Abolish One Species and Rename 5 Species, Code 2010.018a-cV. Available online: https://talk.ictvonline.org/ictv/proposals/2010.018a-cV.A.v4.Mamastrovirus.pdf (accessed on 25 April 2021).
  48. Jones, R.A.C. Plant and insect viruses in managed and natural environments: Novel and neglected transmission pathways. In Advances in Virus Research, 1st ed.; Elsevier Inc.: Amsterdam, The Netherlands, 2018. [Google Scholar] [CrossRef]
  49. Terenin, I.M.; Dmitriev, S.E.; Andreev, D.E.; Royall, E.; Belsham, G.J.; Roberts, L.O.; Shatsky, I.N. A cross-kingdom internal ribosome entry site reveals a simplified mode of internal ribosome entry. Mol. Cell. Biol. 2005, 25, 7879–7888. [Google Scholar] [CrossRef] [Green Version]
  50. Valles, S.M.; Chen, Y.; Firth, A.E.; Guérin, D.M.A.; Hashimoto, Y.; Herrero, S.; de Miranda, J.R.; Ryabov, E. ICTV virus taxonomy profile: Dicistroviridae. J. Gen. Virol. 2017, 98, 355–356. [Google Scholar] [CrossRef]
  51. Zana, B.; Kemenesi, G.; Urbán, P.; Földes, F.; Görföl, T.; Estók, P.; Boldogh, S.; Kurucz, K.; Jakab, F. Metagenomic analysis of bat guano samples revealed the presence of viruses potentially carried by insects, among others by Apis mellifera in Hungary. Acta Vet. Hung. 2018, 66, 151–161. [Google Scholar] [CrossRef] [PubMed]
  52. Ban, L.; Didon, A.; Jonsson, L.M.V.; Glinwood, R.; Delp, G. An improved detection method for the Rhopalosiphum padi virus (RhPV) allows monitoring of its presence in aphids and movement within plants. J. Virol. Methods 2007, 142, 136–142. [Google Scholar] [CrossRef]
  53. Walker, P.J.; Siddell, S.G.; Lefkowitz, E.J.; Mushegian, A.R.; Adriaenssens, E.M.; Dempsey, D.M.; Dutilh, B.E.; Harrach, B.; Harrison, R.L.; Hendrickson, R.C.; et al. Changes to virus taxonomy and the statutes ratified by the International Committee on Taxonomy of Viruses (2020). Arch. Virol. 2020, 165, 2737–2748. [Google Scholar] [CrossRef]
  54. Reuter, G.; Boros, Á.; Delwart, E.; Pankovics, P. Novel seadornavirus (family Reoviridae) related to Banna virus in Europe. Arch. Virol. 2013, 158, 2163–2167. [Google Scholar] [CrossRef] [PubMed]
  55. Wang, J.; Li, H.; He, Y.; Zhou, Y.; Meng, J.; Zhu, W.; Chen, H.; Liao, D.; Man, Y. Isolation and genetic characterization of Mangshi virus: A newly discovered Seadornavirus of the Reoviridae family found in Yunnan Province, China. PLoS ONE 2015, 10, e0143601. [Google Scholar] [CrossRef]
  56. Attoui, H.; Mertens, P.P.C.; Becnel, J.; Belaganahalli, S.; Bergoin, M.; Brussaard, C.P.; Chappell, J.D.; Ciarlet, M.; del Vas, M.; Dermody, T.S.; et al. Family: Reoviridae. In Virus Taxonomy, Ninth Report of the International Committee on Taxonomy of Viruses; King, A.M.Q., Adams, M.J., Carstens, E.B., Lefkowitz, E.J., Eds.; Elsevier: Amsterdam, The Netherlands, 2012; pp. 541–637. [Google Scholar] [CrossRef]
  57. Tao, S.J.; Chen, B.Q. Studies of coltivirus in China. Chin. Med. J. 2005, 118, 581–586. [Google Scholar] [PubMed]
  58. Attoui, H.; Jaafar, F.M.; Belhouchet, M.; Tao, S.; Chen, B.; Liang, G.; Tesh, R.B.; de Micco, P.; de Lamballerie, X. Liao ning virus, a new Chinese seadornavirus that replicates in transformed and embryonic mammalian cells. J. Gen. Virol. 2006, 87, 199–208. [Google Scholar] [CrossRef]
  59. Ngoi, C.N.; Siqueira, J.; Li, L.; Deng, X.; Mugo, P.; Graham, S.M.; Price, M.A.; Sanders, E.J.; Delwart, E. The plasma virome of febrile adult Kenyans shows frequent parvovirus B19 infections and a novel arbovirus (Kadipiro virus). J. Gen. Virol. 2016, 97, 3359–3367. [Google Scholar] [CrossRef] [Green Version]
  60. Ngoi, C.N.; Siqueira, J.; Li, L.; Deng, X.; Mugo, P.; Graham, S.M.; Price, M.A. Corrigendum to: The plasma virome of febrile adult Kenyans shows frequent parvovirus B19 infections and a novel arbovirus (Kadipiro virus). J. Gen. Virol. 2017, 98, 517. [Google Scholar] [CrossRef] [Green Version]
  61. Moens, U.; Calvignac-Spencer, S.; Lauber, C.; Ramqvist, T.; Feltkamp, M.C.W.; Daugherty, M.D.; Verschoor, E.J.; Ehlers, B. ICTV virus taxonomy profile: Polyomaviridae. J. Gen. Virol. 2017, 98, 1159–1160. [Google Scholar] [CrossRef]
  62. Cantalupo, P.G.; Buck, C.B.; Pipas, J.M. Complete genome sequence of a polyomavirus recovered from a pomona leaf-nosed bat (Hipposideros pomona) metagenome data set. Genome Announc. 2017, 5, e01053-16. [Google Scholar] [CrossRef] [Green Version]
  63. Mishra, N.; Fagbo, S.F.; Alagaili, A.N.; Nitido, A.; Williams, S.H.; Ng, J.; Lee, B.; Durosinlorun, A.; Garcia, J.A.; Jain, K.; et al. A viral metagenomic survey identifies known and novel mammalian viruses in bats from Saudi Arabia. PLoS ONE 2019, 14, e0214227. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  64. Hardmeier, I.S. Virome of Swiss Bats. Ph.D. Thesis, University of Zurich, Zurich, Switzerland, 2021. [Google Scholar] [CrossRef]
  65. Kemenesi, G.; Gellért, Á.; Dallos, B.; Görföl, T.; Boldogh, S.; Estók, P.; Marton, S.; Oldal, M.; Martella, V.; Bányai, K.; et al. Sequencing and molecular modeling identifies candidate members of Caliciviridae family in bats. Infect. Genet. Evol. 2016, 41, 227–232. [Google Scholar] [CrossRef]
  66. Yinda, C.K.; Zell, R.; Deboutte, W.; Zeller, M.; Conceição-Neto, N.; Heylen, E.; Maes, P.; Knowles, N.J.; Ghogomu, S.M.; Van Ranst, M.; et al. Highly diverse population of Picornaviridae and other members of the Picornavirales, in Cameroonian fruit bats. BMC Genom. 2017, 18, 249. [Google Scholar] [CrossRef] [Green Version]
  67. Debat, H.J. An RNA virome associated to the golden orb-weaver spider Nephila clavipes. Front. Microbiol. 2017, 8, 2097. [Google Scholar] [CrossRef] [Green Version]
  68. Oude Munnink, B.B.; Phan, M.V.T.; Simmonds, P.; Koopmans, M.P.G.; Kellam, P.; van der Hoek, L.; Cotten, M. Characterization of Posa and Posa-like virus genomes in fecal samples from humans, pigs, rats, and bats collected from a single location in Vietnam. Virus Evol. 2017, 3, vex022. [Google Scholar] [CrossRef] [Green Version]
  69. Liu, S.; Vijayendran, D.; Chen, Y.; Bonning, B.C. Aphis glycines Virus 2, a novel insect virus with a unique genome structure. Viruses 2016, 8, 315. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  70. Ryabov, E.V.; Keane, G.; Naish, N.; Evered, C.; Winstanley, D. Densovirus induces winged morphs in asexual clones of the rosy apple aphid, Dysaphis plantaginea. Proc. Natl. Acad. Sci. USA 2009, 106, 8465–8470. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  71. Chen, Y.N.; Hsu, H.C.; Wang, S.W.; Lien, H.C.; Lu, H.T.; Peng, S.K. Entry of Scotophilus bat coronavirus-512 and severe acute respiratory syndrome coronavirus in human and multiple animal cells. Pathogens 2019, 8, 259. [Google Scholar] [CrossRef] [Green Version]
  72. De Wilde, A.H.; Snijder, E.J.; Kikkert, M.; van Hemert, M.J. Host factors in coronavirus replication. In Roles of Host Gene and Non-Coding RNA Expression in Virus Infection; Tripp, R.A., Tompkins, S.M., Eds.; Springer International Publishing: Cham, Switzerland, 2018; Volume 419, pp. 1–42. [Google Scholar] [CrossRef] [Green Version]
  73. Li, F. Structure, function, and evolution of coronavirus spike proteins. Annu. Rev. Virol. 2016, 3, 237–261. [Google Scholar] [CrossRef] [Green Version]
  74. Uddin, W.; Brändel, S.D.; Tschapka, M.; Page, R.; Rasche, A.; Corman, V.M.; Drosten, C.; Sommer, S. Astrovirus infections induce age-dependent dysbiosis in gut microbiomes of bats. ISME J. 2018, 12, 2883–2893. [Google Scholar] [CrossRef]
  75. Orłowska, A.; Smreczak, M.; Potyrało, P.; Bomba, A.; Trębas, P.; Rola, J. First detection of bat astroviruses (BtAstVs) among bats in Poland: The genetic BtAstVs diversity reveals multiple co-infection of bats with different strains. Viruses 2021, 13, 158. [Google Scholar] [CrossRef] [PubMed]
  76. Wohlgemuth, N.; Honce, R.; Schultz-Cherry, S. Infection, genetics and evolution astrovirus evolution and emergence. Infect. Genet. Evol. 2019, 69, 30–37. [Google Scholar] [CrossRef]
  77. Donato, C.; Vijaykrishna, D. The broad host range and genetic diversity of mammalian and avian astroviruses. Viruses 2017, 9, 102. [Google Scholar] [CrossRef] [Green Version]
  78. Barry, A.F.; Durães-Carvalho, R.; Oliveira-Filho, E.F.; Alfieri, A.A.; Van der Poel, W.H.M. High-Resolution phylogeny providing insights towards the epidemiology, zoonotic aspects and taxonomy of sapoviruses. Infect. Genet. Evol. 2017, 56, 8–13. [Google Scholar] [CrossRef]
  79. Gildow, F.E.; D’Arcy, C.J. Cytopathology and experimental host range of Rhopalosiphum padi virus, a small isometric RNA virus infecting cereal grain aphids. J. Invertebr. Pathol. 1990, 55, 245–257. [Google Scholar] [CrossRef]
  80. Bennett, A.J.; Bushmaker, T.; Cameron, K.; Ondzie, A.; Niama, F.R.; Parra, H.J.; Mombouli, J.V.; Olson, S.H.; Munster, V.J.; Goldberg, T.L. Diverse RNA viruses of arthropod origin in the blood of fruit bats suggest a link between bat and arthropod viromes. Virology 2019, 528, 64–72. [Google Scholar] [CrossRef]
  81. Attoui, H.; Charrel, R.N.; Billoir, F.; Cantaloube, J.F.; De Micco, P.; De Lamballerie, X. Comparative sequence analysis of American, European and Asian isolates of viruses in the genus Coltivirus. J. Gen. Virol. 1998, 79, 2481–2489. [Google Scholar] [CrossRef] [Green Version]
  82. Zhang, W.; Li, F.; Liu, A.; Lin, X.; Fu, S.; Song, J.; Liu, G.; Shao, N.; Tao, Z.; Wang, Q.; et al. Identification and genetic analysis of Kadipiro virus isolated in Shandong province, China. Virol. J. 2018, 15, 64. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  83. Kohl, C.; Kurth, A. European bats as carriers of viruses with zoonotic potential. Viruses 2014, 6, 3110–3128. [Google Scholar] [CrossRef] [Green Version]
  84. Seltmann, A.; Corman, V.M.; Rasche, A.; Drosten, C.; Czirják, G.; Bernard, H.; Struebig, M.J.; Voigt, C.C. Seasonal fluctuations of astrovirus, but not coronavirus shedding in bats inhabiting human-modified tropical forests. Ecohealth 2017, 14, 272–284. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  85. Kemenesi, G.; Dallos, B.; Gorfol, T.; Boldogh, S.; Estok, P.; Kurucz, K.; Kutas, A.; Foldes, F.; Oldal, M.; Nemeth, V.; et al. Molecular survey of RNA viruses in hungarian bats: Discovering novel astroviruses, coronaviruses, and caliciviruses. Vector Borne Zoonotic Dis. 2014, 14, 846–855. [Google Scholar] [CrossRef] [PubMed]
  86. Gloza-Rausch, F.; Ipsen, A.; Seebens, A.; Göttsche, M.; Panning, M.; Drexler, J.F.; Petersen, N.; Annan, A.; Grywna, K.; Müller, M.; et al. Detection and prevalence patterns of group I coronaviruses in bats, Northern Germany. Emerg. Infect. Dis. 2008, 14, 626–631. [Google Scholar] [CrossRef]
  87. Bentley, K.; Evans, D.J. Mechanisms and consequences of positive-strand RNA virus recombination. J. Gen. Virol. 2018, 99, 1345–1356. [Google Scholar] [CrossRef]
  88. Fagre, A.C.; Kading, R.C. Can bats serve as reservoirs for Arboviruses? Viruses 2019, 11, 215. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Figure 1. Comparison of full-length genomes of bat coronaviruses (BtCoVs) to one another, and to the closest matching full-length sequences from GenBank, after aligning the sequences with MUSCLE. The grouping of sequences corresponds to the clustering seen in the phylogenetic tree (Figure 2). The numbers in the upper-right part of the table are the percentage nucleotide identities between the sequences, while the lower-left part shows the actual number of nucleotide differences between the sequences. The heat map shading indicates degree of identity, with increased similarity shown by darker colours. The incomplete genome sequence BtCoV/7542-55/P.pyg/DK/2014 was not included in this analysis. The new sequences reported here are marked in bold text.
Figure 1. Comparison of full-length genomes of bat coronaviruses (BtCoVs) to one another, and to the closest matching full-length sequences from GenBank, after aligning the sequences with MUSCLE. The grouping of sequences corresponds to the clustering seen in the phylogenetic tree (Figure 2). The numbers in the upper-right part of the table are the percentage nucleotide identities between the sequences, while the lower-left part shows the actual number of nucleotide differences between the sequences. The heat map shading indicates degree of identity, with increased similarity shown by darker colours. The incomplete genome sequence BtCoV/7542-55/P.pyg/DK/2014 was not included in this analysis. The new sequences reported here are marked in bold text.
Viruses 13 01073 g001
Figure 2. Phylogenetic tree of bat coronavirus sequences from Denmark. The sequences reported here (highlighted in orange) were compared to the closest matching sequences from GenBank and at least one ICTV exemplar isolate of each species of the Orthocoronavirinae subfamily. Bat coronavirus sequences are indicated with a bat symbol. The tree was generated using the PhyML maximum likelihood method with 100 bootstrap iterations on MUSCLE-aligned concatenated amino acid sequences predicted from the partial ORF1ab nt sequences (conserved regions of nsp5, nsp12, and nsp13), with a total length of 1224 amino acids. Only bootstrap values above 50 are displayed on the nodes. The evolutionary model for the tree was LG + G. The ICTV reference sequences have all been assigned to subgenera within the genera Alphacoronavirus, Betacoronavirus, Gammacoronavirus, and Deltacoronavirus. Clustering of the Danish bat coronavirus sequences indicates that they belong to the same subgenera as the reference sequences in the genus Alphacoronavirus, within the subgenera Pedacovirus, Nyctacovirus, and Myotacovirus (highlighted in green).
Figure 2. Phylogenetic tree of bat coronavirus sequences from Denmark. The sequences reported here (highlighted in orange) were compared to the closest matching sequences from GenBank and at least one ICTV exemplar isolate of each species of the Orthocoronavirinae subfamily. Bat coronavirus sequences are indicated with a bat symbol. The tree was generated using the PhyML maximum likelihood method with 100 bootstrap iterations on MUSCLE-aligned concatenated amino acid sequences predicted from the partial ORF1ab nt sequences (conserved regions of nsp5, nsp12, and nsp13), with a total length of 1224 amino acids. Only bootstrap values above 50 are displayed on the nodes. The evolutionary model for the tree was LG + G. The ICTV reference sequences have all been assigned to subgenera within the genera Alphacoronavirus, Betacoronavirus, Gammacoronavirus, and Deltacoronavirus. Clustering of the Danish bat coronavirus sequences indicates that they belong to the same subgenera as the reference sequences in the genus Alphacoronavirus, within the subgenera Pedacovirus, Nyctacovirus, and Myotacovirus (highlighted in green).
Viruses 13 01073 g002
Figure 3. Phylogenetic tree of bat astrovirus sequences from Denmark. The Danish sequences (indicated in bold text) were analysed with the closest matching sequences with full ORF2 coding sequences available in GenBank, and at least one ICTV exemplar isolate of each species of the genus Mamastrovirus in the Astroviridae family, with an exemplar isolate of the genus Avastrovirus chosen as an outgroup. Bat astrovirus sequences are indicated with a bat symbol. The tree was generated using the neighbour-joining method, with 1000 bootstrap iterations on MUSCLE-aligned predicted amino acid sequences from the complete ORF2 gene coding sequence, with a total length of 1144 amino acids. Bootstrap values above 50 are displayed.
Figure 3. Phylogenetic tree of bat astrovirus sequences from Denmark. The Danish sequences (indicated in bold text) were analysed with the closest matching sequences with full ORF2 coding sequences available in GenBank, and at least one ICTV exemplar isolate of each species of the genus Mamastrovirus in the Astroviridae family, with an exemplar isolate of the genus Avastrovirus chosen as an outgroup. Bat astrovirus sequences are indicated with a bat symbol. The tree was generated using the neighbour-joining method, with 1000 bootstrap iterations on MUSCLE-aligned predicted amino acid sequences from the complete ORF2 gene coding sequence, with a total length of 1144 amino acids. Bootstrap values above 50 are displayed.
Viruses 13 01073 g003
Table 1. Overview of bat faecal sample material used in the study. Previously published data mentioned with GenBank accession numbers consist of partial bat coronavirus ORF1b sequences of 208 or 130 nucleotides in length.
Table 1. Overview of bat faecal sample material used in the study. Previously published data mentioned with GenBank accession numbers consist of partial bat coronavirus ORF1b sequences of 208 or 130 nucleotides in length.
Sample IDSpeciesCollection TimeLocationPrevious DataVirus Sequences Assembled in This Study
13585-35M. dauOctober 2014MønstedLR025670, LR025709BtCoV
13585-58M. dauOctober 2014MønstedLR025680, LR025712BtCoV, BtAstV, BtPV, RAAV
21164-6M. dauNovember 2015MønstedLR025732BtCoV, BtAstV, BtCV, BtPyV, KDV, BaV
OV-157M. dauOctober 2018Mønsted-BtCoV, BtCV, RhPV
18802-1M. dasNovember 2016MønstedLR025723BtCoV, RVH
B40-5P. pygAugust 2013SollerupLR025698, LR025733BtCoV
7542-55P. pygJune 2014VadumLR025676, LR025706BtCoV
Abbreviations of bat species names are: Myotis daubentonii (M. dau); Myotis dasycneme (M. das); and Pipistrellus pygmaeus (P. pyg). Abbreviations of virus names are: bat coronavirus (BtCoV); bat astrovirus (BtAstV); bat calicivirus/sapovirus (BtCV), rotavirus H (RVH); bat polyomavirus (BtPyV); bat picornavirus (BtPV); Basavirus (BaV); Kadipiro virus (KDV); Rhopalosiphum padi virus (RhPV); and rosy apple aphid virus (RAAV).
Table 2. Overview of the output from Kaiju metagenomics analysis, with a focus on reads assigned to selected viruses. Read numbers highlighted in bold indicate that viral genome sequences were assembled.
Table 2. Overview of the output from Kaiju metagenomics analysis, with a focus on reads assigned to selected viruses. Read numbers highlighted in bold indicate that viral genome sequences were assembled.
NGS DatasetD28-7D28-9D32-7D32-9D29-6 + D32-10D32-6D32-4
Bat speciesM. dauM. dauM. dauM. dauM. dasP. pygP. pyg
Sample no.13585-3513585-5821164-6OV-15718802-1B40-57542-55
Biosample ac. no.SAMN
18653548
SAMN
18653642
SAMN
18661215
SAMN
18680226
SAMN
18665528
SAMN
18672062
SAMN
18672089
Assigned reads
(% of raw data)
296,356
(23%)
1,469,272
(42%)
2,234,894
(63%)
2,155,628
(62%)
1,495,375
(49%)
55,361
(19%)
182,572
(50%)
Archaea22215031323522078207360
Bacteria189,6021,121,10228,900164,5521,447,06039,417148,103
Eukaryota22,941144,898886818,81318,25124389956
Viruses:72,440189,0142,195,3081,961,910899810,1872871
Unclassified vir.13,181 17865 1 3742 2497,671 1683
Unclas. RNA vir. [43]44,631 340,180 36584 4654,384 32644015
Unclas. bacterial vir.262997-1322820
Retro-transcribing vir.386443166276113115
dsDNA viruses1012208233 597156584188
ssDNA viruses-23--167-
dsRNA viruses458 6292,113,874 7770 890 9--
ssRNA viruses:9980136,49270,512734,899670895012517
Unclassified vir.227242 10-306---
Coronavirinae6381117,97268,83852,185593472731173
Astroviridae857547442---
Tymovirales87281154157191-
Nodaviridae8-357-6--
Picornavirales:3166 1094 435 13681,091 41182514
Picornaviridae-902 12-42-2-
Iflaviridae443137-5063 15121795 14-
Dicistroviridae2679 111748675,984 11261910
Caliciviridae--28276---
1 Predominant species: Aphis glycines virus 2; 2 predominant species: Basavirus sp.; 3 predominant species: Hubei permutotetra-like virus 4; 4 predominant species: Hubei partiti-like virus 22; 5 predominant species: Pomona leaf-nosed-bat-associated polyomavirus; 6 predominant species: unclassified Birnaviridae; 7 predominant species: Kadipiro virus; 8 predominant species: Totiviridae; 9 predominant species: rotavirus; 10 predominant species: rosy apple aphid virus; 11 predominant species: Rhopalosiphum padi virus; 12 predominant species: Washington bat picornavirus; 13 predominant species: Pow Burn virus; 14 predominant species: iflavirus; 15 predominant species: Brevicoryne brassicae picorna-like virus. Abbreviations used: viruses (vir.); unclassified (unclas.); double-stranded (ds); single-stranded (ss).
Table 3. List of all full-length (highlighted in bold) and partial genome sequences assembled in this study and deposited in GenBank.
Table 3. List of all full-length (highlighted in bold) and partial genome sequences assembled in this study and deposited in GenBank.
Virus DescriptionVirus Strain NameMapped ReadsAverage
Coverage
Length nt.Accession
Number
AlphacoronavirusBtCoV/13585-35/M.dau/DK/201488005028,096MN535731
BtCoV/13585-58/M.dau/DK/2014140,000100028,140MN535732
BtCoV/21164-6/M.dau/DK/201570,00037028,117MN543743
BtCoV/21164-6-alt/M.dau/DK/201520,00012028,092MZ218052
BtCoV/OV-157/M.dau/DK/201813,00010228,192MN535733
BtCoV/18802-1/M.das/DK/201698004228,013MN535734
BtCoV/B40-5/P.pyg/DK/201392005927,943MN482242
BtCoV/7542-55/P.pyg/DK/20141800827,786 1MZ218060
MamastrovirusBtAstV/13585-58/M.dau/DK/201417,0004656557MN832787
BtAstV/21164-6-A/M.dau/DK/2015630156619MZ218053
BtAstV/21164-6-B/M.dau/DK/2015540136558 2MZ218054
CalicivirusBtCV/OV-157/M.dau/DK/201835287486 3MZ218056
BtCV/21164-6-A/M.dau/DK/201521.5396MZ218057
BtCV/21164-6-B/M.dau/DK/201561.4706MZ218058
Rotavirus HRVH/18802-1/M.das/DK/2016982.258–2169 4,5MZ218062-
MZ218070
PolyomavirusBtPyV/21164-6/M.dau/DK/2015710174758MZ218055
PicornavirusBtPV/13585-58/M.dau/DK/2014870169300MZ218061
BasavirusBaV/21164-6/M.dau/DK/201585601509155MW929926
Kadipiro virusKDV/21164-6/M.dau/DK/20151,960,0003700–16,500754–3750 6MN543741,
MN543742,
MN543744-
MN543753
Rhopalosiphum padi virusRhPV/OV-157/M.dau/DK/2018440,000649010,009MN535735
RhPV/OV-157-alt/M.dau/DK/2018340,000460010,010MZ218059
Rosy apple aphid virusRAAV/13585-58/M.dau/DK/201458601029742MW929927
Some sequences contain NNNs to indicate unknown nucleotides. Numbers of NNNs are: 1 2037; 2 4; 3 70; 4 2558. 5 A total of 9 out of 11 partial genome segments, with a total length of 5542 nt (excluding NNNs), were assembled into gapped sequences for rotavirus H using both reference assemblies and de novo assembly. 6 All 12 genome segments of Kadipiro virus were assembled with a total length of 20,781 nt.
Table 4. The closest matching sequences to the bat astroviruses found in this study are not complete genomes, but partial ORF1ab gene sequences generated by sequencing of RT-PCR products (around 381 nucleotides in length without primers) from the most commonly used assay for detection of astroviruses in bats [46]. This table summarizes the top three identity BLASTn search hits of these short sequences as accession numbers, with % identity/length in parentheses. The 1st BLASTn hits were described previously [45]. Abbreviations used for German regions: North Rhine-Westphalia (NRW); and Mecklenburg-Western Pomerania (MV).
Table 4. The closest matching sequences to the bat astroviruses found in this study are not complete genomes, but partial ORF1ab gene sequences generated by sequencing of RT-PCR products (around 381 nucleotides in length without primers) from the most commonly used assay for detection of astroviruses in bats [46]. This table summarizes the top three identity BLASTn search hits of these short sequences as accession numbers, with % identity/length in parentheses. The 1st BLASTn hits were described previously [45]. Abbreviations used for German regions: North Rhine-Westphalia (NRW); and Mecklenburg-Western Pomerania (MV).
BtAstV Genomes1st BLASTn Hit1st BLASTn Hit Source2nd BLASTn Hit3rd BLASTn Hit
BtAstV/13585-58/M.dau/DK/2014KT894882
(93.7/379)
M. daubentonii, D15,
September 2014, NRW
EU847161
(77.5/240)
EU847160
(77.5/240)
BtAstV/21164-6-A/M.dau/DK/2015KT894889
(98.2/381)
M. daubentonii, D13,
September 2014, NRW
KT894883
(91.9/381)
KT894893
(92.0/375)
BtAstV/21164-6-B/M.dau/DK/2015KT894894
(99.5/381)
M. daubentonii, D10,
July 2013, MV
KT894896
(99.2/381)
KT894895
(99.2/381)
Table 5. Rhopalosiphum padi virus comparisons to one another, to the reference RhPV sequence NC_001874, and to the closely related sequence MF535298 from Hungary [51].
Table 5. Rhopalosiphum padi virus comparisons to one another, to the reference RhPV sequence NC_001874, and to the closely related sequence MF535298 from Hungary [51].
RhPV/OV-157/M.dau/DK/2018RhPV/OV-157-alt/M.dau/DK/2018
Nucleotide Comparison(10,009 nt)(10,010 nt)
Nt. DifferencesPercent IdentityNt. DifferencesPercent Identity
NC_001874 (10,011 nt)35196.535796.4
- 5′-end (579 nt)4791.94891.7
- ORF1 (5997 nt)23096.223196.2
- IGR (533 nt)898.51197.9
- ORF2 (2451 nt)5997.65997.6
- 3′-end (451 nt)798.4898.2
MF535298 (9966 nt)5099.57699.2
- 5′-end (219 nt)01000100
- ORF1 (5997 nt)1999.74899.2
- IGR (533 nt)299.6399.4
- ORF2 (2451 nt)2399.12199.1
- 3′-end (451 nt)698.7499.1
OV-157-alt (10,010 nt)10698.9
- 5′-end (579 and 580 nt)299.7
- ORF1 (5997 nt)5999.0
- IGR (533 nt)599.1
- ORF2 (2361 nt)3198.7
- 3′-end (539 nt)998.3
Amino acid comparison: ORF1(1988 a.a.)(1988 a.a.)
A.a. differencesPercent identityA.a. differencesPercent identity
NC_001874 (1988 a.a.)2398.92898.6
MF535298 (1988 a.a.)399.9499.8
OV-157-alt (1988 a.a.)799.7
Amino acid comparison: ORF2(788 a.a.)(788 a.a.)
A.a. differencesPercent identityA.a. differencesPercent identity
NC_001874 (818 a.a.)1198.61398.4
MF535298 (818 a.a.)699.2699.2
OV-157-alt (788 a.a.)499.5
Table 6. Assembled Kadipiro virus genome segments listed with the first three BLASTn search hits as accession numbers, with percent identities in parentheses. KX884650–KX884661: Kadipiro virus strain QTM27331 from China 2013; MG590140–MG590148, MG590150–MG590151: Kadipiro virus strain SDKL1625 from China 2016; AF133429, AF133509–AF133513, AF052019–AF052023: Kadipiro virus strain JKT-7075, an isolate originally from Java, Indonesia [30]; KX247778: Kadipiro virus isolate Kenya2 [59,60]; FJ159105: Kadipiro virus strain YN0557 from China 2005.
Table 6. Assembled Kadipiro virus genome segments listed with the first three BLASTn search hits as accession numbers, with percent identities in parentheses. KX884650–KX884661: Kadipiro virus strain QTM27331 from China 2013; MG590140–MG590148, MG590150–MG590151: Kadipiro virus strain SDKL1625 from China 2016; AF133429, AF133509–AF133513, AF052019–AF052023: Kadipiro virus strain JKT-7075, an isolate originally from Java, Indonesia [30]; KX247778: Kadipiro virus isolate Kenya2 [59,60]; FJ159105: Kadipiro virus strain YN0557 from China 2005.
Genome SegmentSegment LengthAccession Number1st BLASTn Hit2nd BLASTn Hit3rd BLASTn Hit
Segment 13799MN543741KX884650 (86)MG590148 (86)AF133429 (84)
Segment 23016MN543742MG590140 (85)KX884651 (84)AF134509 (82)
Segment 32353MN543744MG590141 (84)KX884652 (84)AF134510 (81)
Segment 42121MN543745KX884653 (86)AF134511 (84)KX247778 (74)
Segment 51895MN543746MG590142 (83)KX884654 (83)AF134512 (81)
Segment 61659MN543747KX884655 (87)AF134513 (86)MG590143 (86)
Segment 71232MN543748MG590144 (92)KX884656 (92)AF052023 (90)
Segment 81083MN543749KX884657 (88)AF052022 (85)MG590145 (86)
Segment 91041MN543750KX884658 (91)MG590150 (90)AF052021 (88)
Segment 10914MN543751KX884659 (87)MG590151 (86)AF052020 (84)
Segment 11875MN543752KX884660 (85)AF052019 (83)MG590146 (83)
Segment 12754MN543753MG590147 (93)FJ159105 (90)KX884661 (89)
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Lazov, C.M.; Belsham, G.J.; Bøtner, A.; Rasmussen, T.B. Full-Genome Sequences of Alphacoronaviruses and Astroviruses from Myotis and Pipistrelle Bats in Denmark. Viruses 2021, 13, 1073. https://doi.org/10.3390/v13061073

AMA Style

Lazov CM, Belsham GJ, Bøtner A, Rasmussen TB. Full-Genome Sequences of Alphacoronaviruses and Astroviruses from Myotis and Pipistrelle Bats in Denmark. Viruses. 2021; 13(6):1073. https://doi.org/10.3390/v13061073

Chicago/Turabian Style

Lazov, Christina M., Graham J. Belsham, Anette Bøtner, and Thomas Bruun Rasmussen. 2021. "Full-Genome Sequences of Alphacoronaviruses and Astroviruses from Myotis and Pipistrelle Bats in Denmark" Viruses 13, no. 6: 1073. https://doi.org/10.3390/v13061073

APA Style

Lazov, C. M., Belsham, G. J., Bøtner, A., & Rasmussen, T. B. (2021). Full-Genome Sequences of Alphacoronaviruses and Astroviruses from Myotis and Pipistrelle Bats in Denmark. Viruses, 13(6), 1073. https://doi.org/10.3390/v13061073

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop