Next Article in Journal
Current Dengue Virus Vaccine Developments and Future Directions
Previous Article in Journal
Novel Oronasal Drainage for Long COVID: Proposed Mechanisms—Case Report
Previous Article in Special Issue
Evolutionary Insights from Association Rule Mining of Co-Occurring Mutations in Influenza Hemagglutinin and Neuraminidase
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Complete Genome Classification System of Rotavirus alphagastroenteritidis: An Updated Analysis

by
Ricardo Gabriel Díaz Alarcón
1,2,
Karina Salvatierra
3,
Emiliano Gómez Quintero
1,
Domingo Javier Liotta
1,4,
Viviana Parreño
5 and
Samuel Orlando Miño
1,6,*
1
Laboratory of Applied Molecular Biology (LaBiMAp), Faculty of Exacts, Chemical and Natural Sciences, National University of Misiones (UNaM), Posadas CP3300, Misiones, Argentina
2
National Council for Scientific and Technical Research (CONICET), Av. Mariano Moreno 1375, Lab 105, Posadas CP3300, Misiones, Argentina
3
Laboratory “MADAR”, National University of Misiones (UNaM), Ruta 12, Km 7 y ½, Posadas CP3300, Misiones, Argentina
4
National Institute of Tropical Medicine (INMeT)—ANLIS “Dr. Carlos Malbrán”, Puerto Iguazú CP3370, Misiones, Argentina
5
National Institute of Agricultural Technology (INTA), IncuINTA, De Las Cabañas y De los Reseros s/n, Hurlingham CP1816, Buenos Aires, Argentina
6
National Institute of Agricultural Technology (INTA), EEA Cerro Azul, National Route 14, Km 836, Cerro Azul CP3313, Misiones, Argentina
*
Author to whom correspondence should be addressed.
Viruses 2025, 17(2), 211; https://doi.org/10.3390/v17020211 (registering DOI)
Submission received: 12 December 2024 / Revised: 21 January 2025 / Accepted: 27 January 2025 / Published: 31 January 2025
(This article belongs to the Special Issue Virus Bioinformatics 2024)

Abstract

:
Rotavirus alphagastroenteritidis is the major causative agent of acute gastroenteritis in both children under the age of 5 and young mammals and birds globally. RVAs are non-enveloped viruses with a genome comprising 11 double-stranded RNA segments. In 2008, the Rotavirus Classification Working Group pioneered a comprehensive and complete RVA genome classification system, establishing a specific threshold, which measures the genetic distances between homologous genes. The aim of this study was to perform an updated systematic analysis of the genetic variability across all RVA genes. Our investigation involved assessing the established cutoff values for each RVA genome segment and determining the need for any updates. To achieve this objective, multiple sequence alignments were constructed for all 11 genes and one for each genotype with discrepancies. Also, pairwise distances along with their cutoff values were evaluated. The analyses provided insights into the current relevance of cutoff values, which remain applicable for the majority of genotypes. In conclusion, this study fortifies the current classification system by highlighting its robustness and accurate genotyping of Rotavirus alphagastroenteritidis.

1. Introduction

Rotavirus alphagastroenteritidis (RVA) is a major pathogen associated with acute gastroenteritis in children, young mammals, and birds worldwide [1]. RVA has a segmented double-stranded RNA genome which consists of 11 genome segments enclosed in a triple-layered icosahedral capsid [2]. The 11 genome segments encode six viral structural proteins (VP1 to VP4, VP6, and VP7) and six nonstructural proteins (NSP1 to NSP6) [3]. Each genome segment, with the exception of gene 11 that encodes two proteins (NSP5 and NSP6), codes for a single viral protein [4]. The inner layer of the rotavirus virion is mainly composed of VP2, which encases VP1; the viral RNA-dependent RNA polymerase, VP3; the viral capping enzyme; and the viral genome [2]. The middle layer of the virion comprises VP6 trimmers exclusively; VP7 and spikes of VP4 compose the outer layer [2]. VP7 and VP4 were the initial basis of a binary classification system defining the G types (glycoprotein) and P types (protease sensitive), respectively. VP7 and VP4 are capable of independently eliciting neutralizing antibodies, initially used to define rotavirus G and P serotypes [4].
For human strains, three genogroups have been established as follows: two major genogroups represented by the reference Wa and DS-1 strains and one minor genogroup represented by reference AU-1 strain [5]. These genogroups were historically determined by hybridization techniques [5], and they were later confirmed by sequencing, establishing particular “genotype constellations” in line with these genogroups. Sequencing rotavirus genomes and phylogenetic and phylodynamic analyses are critical for understanding the patterns of virus evolution. One method that is frequently used to study the genetic distances between virus strains consists of pairwise sequence identity comparisons [6]. Studying evolutionary patterns, the main generator of diversity in rotavirus appears to be point mutations. These occur continuously due to the high error rate of the RVA polymerase [7]. In addition, viral genome reassortments occur between co-infecting strains, often involving zoonotic transmission [1]. During reassortment events certain RVA genes apparently co-segregate, suggesting an important degree of gene linkage [8]. Furthermore, when host restriction has been observed, only RVAs of particular gene constellations can efficiently infect certain hosts. In fact, a detailed look at RVA genome constellations shows a restricted gene pattern in different animal species, with only a few promiscuous genotypes able to infect multiple species [9,10,11].
In the past decades, specific rotavirus strains were associated with specific animal species; however, after the implementation of the new classification system, the host species’ descriptions were improved [9]. Human RVA strains that possess genes commonly found in animal rotaviruses have been isolated from infected children [9]. Strains such as G3 (found commonly in species such as cats, dogs, monkeys, pigs, mice, rabbits, and horses), G5 (pigs), G6 and G8 (cattle), G9 (pigs), and G10 (cattle) have been isolated from the human population throughout the world [9,12,13,14]. On the other hand, the most common RVA genotypes circulating in humans worldwide are G1P[8], G2P[4], G3P[8], G4P[8], G9P[8], and G12P[8] [15]. In a recent review of RVA distribution in animals, it was observed that reassortment occurs frequently. Many genotypes combine constantly and new reassortants continue to appear [9]. These new reassortants may be transferred to other species, leading to unpredictable outcomes and opening the possibility for the emergence of new virulent variants with unforeseen impacts [9].
Nucleotide composition is the simplest way of characterizing genomes [16], and it is essential to the study of viral evolution, particularly the interplay between viruses and host cells [17]. On the other hand, mutation rate estimates vary considerably, even for the same virus [18,19]. Since viral mutation rates have implications for epidemiological surveillance, pathogenesis [20,21], vaccine development [22,23] antiviral therapy [24], and disease management [25,26], it is important to have accurate data at molecular level [27].
To properly study the evolution of rotaviruses, the establishment of a classification system in which individual genes fall into defined clusters/genotypes based on reliable percentage identity cutoff values is crucial [6]. In recent decades, multiple rotavirus strains have been analyzed and compared to one another by the partial or complete sequencing of all 11 gene segments as this approach has allowed researchers to determine direct genetic relationships [6]. The introduction of a new classification system and the creation of the Rotavirus Classification Working Group (RCWG) marked a significant milestone. This system relied on the nucleotide sequence identity cutoff percentages of each of the 11 RVA genome segments [28]. Genetic identity refers to the degree of similarity between two genetic sequences.
To denote the different encoding genes, the proposed notation was employed (Table 1), providing a valuable tool for studying complete rotavirus genomes, also known as genome constellations, that infect various animal species and humans. This approach greatly enhanced our understanding of host restriction, interspecies transmission events, the emergence of reassortant strains post-vaccination, and the overall evolution of RVA.
The identity cutoff percentages were initially determined based on all available strains in 2008. However, since then, a multitude of new strains has been sequenced, leading to the discovery of new genotypes (Table 1). These genotypes have a reference, usually the first reported. The reference strains were listed by Matthijnssens et al. in 2008 [28] and then the RCWG continued reporting new reference strains in their webpage [29]. The updated list of the reference strains is presented in Table 2.
In this study, our objective was to explore the rigor of the classification system for all 11 RVA genome segments by conducting phylogenetic analyses and constructing pairwise sequence identity profiles following the RCWG recommendations. More specifically, we aimed to test the current classification system using the cutoff values established in 2008.

2. Materials and Methods

2.1. Matrix Construction and Alignment

In February 2020, we obtained nucleotide sequences of RVA genes from the National Center for Biotechnology Information (www.ncbi.nlm.nih.gov/genbank/ accessed on 28 February 2020). The search was conducted using the words “rotavirus A” and “VP7 gene”, for example, to download all the VP7 strains sequences available, and we then used the search terms “rotavirus A” and “VP4 gene”, to download all the VP4 strain sequences available, and so on. To ensure the integrity of our dataset, we made a “depuration process”, which means we meticulously manually filtered out partial sequences lacking a complete open reading frame (ORF), as well as those containing repeated sequences and uncertainties such as ambiguous nucleotides. In instances where only limited data (only a few or even one sequence) were available for specific genotypes, sequences lacking a complete ORF were still included to maintain a comprehensive representation.
To enhance the accuracy and reliability of our analyses, matrix editions (alignments of all sequences, selection of the ORF, etc.) were carried out using AliView v1.26 [22] and Bioedit v7.2.5 [23]. Sequences were meticulously labeled to include genotype, accession number, host, country of origin, and year of collection, providing a detailed context for each sequence.
Subsequently, multiple sequence alignment (MSA) was constructed for each gene using the online server MAFT (https://mafft.cbrc.jp/alignment/server/index.html accessed on 4 March 2020) with default settings. The resulting matrices underwent further refinement and manual editing using AliView and BioEdit to ensure precise alignment. This comprehensive approach to matrix construction and alignment serves as a robust foundation for our subsequent analyses and contributes to the overall reliability of our findings.
This matrix compares the genetic identities between all possible pairs of sequences of a given genotype. Thus, for each genotype, we obtain the distances between all sequences that compound it. In an ideal case, each identity corresponding to a pair of sequences belonging to the same genotype should be greater than the cutoff value, as described in Table 1 for each gene. For the p-distance analysis, the criterion we used to consider a single strain to be “out of the cutoff” is that a genetic distance greater than the cutoff value was obtained with any other strain inside that genotype. However, for divergent strains, the decision to classify it as a new genotype should be analyzed by the expert committee. On the other hand, for the K2P analysis we consider the cutoff value only to determine if they are closely related (if the genetic distance is smaller than the cutoff value) or if they are diverging (if the genetic distance is greater than the cutoff value). In other words, the cutoff value in the K2P analysis only involves evolutionary interpretation and is not used for classification purposes.

2.2. Genetic Variation

Pairwise genetic distances were analyzed with a p-distance model using MegaX software (version 10, 64 bits for Windows) [30] with default settings. To assess genetic similarity, we calculated similarities both within genotypes (intra-genotype) and between genotypes (inter-genotype). These results were used to construct frequency histograms of identities, where the x-axis denotes the identities between each pair of sequences, while the y-axis represents the number of measured sequence pairs. Additionally, we calculated the best-fitting evolutionary model with the data to construct the phylogenetic trees and determine the pairwise genetic distances, including base and substitutions frequencies (Table A1).

2.2.1. Data Quality: Evolution Model, Nucleotide Frequency, Nucleotide Substitution, and Phylogenetic Information Estimation

Before conducting phylogenetic analyses, a thorough assessment of the dataset information was carried out. The quality analysis revealed key parameters, including the base frequency and nucleotide substitution rate, as outlined in Table A1. The results of this analysis guided the selection of the evolutionary model, with the General Time Reversible (GTR) model featuring the empirical base frequencies (+F) and the reversibility (R) identified as the most suitable [25], as detailed in Table A1.
Furthermore, the presence of a phylogenetic signal was systematically assessed using the likelihood mapping method [28], implemented in IQ-Tree [29]. This methodological step ensured the reliability of the dataset for subsequent phylogenetic analyses by confirming the presence of informative signals in the genetic data.

2.2.2. Phylogenetic Analysis

The construction of phylogenetic trees was executed through the maximum likelihood method utilizing IQ-Tree. To enhance the accuracy of our analyses, we selected the models that best fit each of the eleven datasets, as detailed in Table A1. This approach ensured that the chosen models were tailored to the specific characteristics of each dataset, contributing to a more precise representation of evolutionary relationships.
In assessing the robustness of the phylogenetic tree branches, we employed 10,000 ultrafast bootstrap replicates [30]. This statistical support method provides a reliable estimation of branch confidence, offering a thorough and statistically rigorous evaluation of the inferred phylogenetic relationships.
By incorporating these methodologies, our phylogenetic tree construction process not only utilized advanced computational techniques but also ensured that the selected models and statistical support measures were optimized for the unique features of each dataset. This comprehensive approach enhances the reliability and significance of the phylogenetic inferences drawn from the analyses conducted with IQ-Tree.
Finally, the constructed phylogenetic trees were visually presented using Figtree software version v1.4.4 (http://tree.bio.ed.ac.uk/software/figtree/), offering an intuitive and comprehensive visualization of the evolutionary relationships inferred from the dataset. This meticulous approach to dataset assessment and the subsequent phylogenetic analysis contributes to the robustness and reliability of the evolutionary insights derived from the study.

3. Results

3.1. Classification System

The full classification analyses were carried out based on a total of 26,537 nucleotide sequences which fulfilled the inclusion criteria, including at least 3172 strains. These sequences were obtained from the GenBank database. We included the nucleotide sequences of VP7 (3172), VP4 (2771), VP6 (2463), VP1 (1495), VP2 (2754), VP3 (1507), NSP1 (2208), NSP2 (2358), NSP3 (2294), NSP4 (3183), and NSP5 (2332) (Figure 1). Table 1 shows the increment of strains between 2008 and the moment this work was performed (February 2020). The column “Current Genotype” indicates the number of genotypes at the moment this paper is written (May 2024); it shows that its number increases quickly. The sum of all genotypes included in this study is 299.
The relative abundance of genotypes for each gene were also calculated (Figure A1). This figure depicts the dominance of genotypes 1 and 2 over the rest of the genotypes in all genes but VP7 and VP4, where a major diversity can be observed.
The genotyping system was tested with the following two models: the previously established model which uses p-distance and is used by the RCWG to classify the new strains, and the model suggested by the software that better fit our dataset (mainly K2P). The histograms of the pair identities were constructed with p-distance and the histograms of the pair identities and phylogenetic trees with the K2P model. We will analyze the results separately.

3.2. p-Distance Analysis

The p-distance histograms reveal a clear differentiation between intra- and inter-genotype distances (Figure 2 and Figure 3). It can be observed that the inter- and intra-genotype distances are clearly differentiated and there is very little overlap. This is evidenced when we observed that the number of strains which falls “outside of the cutoff value” (Table 3) was only 169 for the p-distance model. This represents 0.63% (169/26,537) of the total analyzed pairwise identities. Of these, 44.9% (76/169) represents NSP4 outliers. On the other hand, VP7 has only one strain out of the cutoff value (D86277) corresponding to the G3 genotype. Meanwhile, P[8] has 10 strains out of the cutoff value (representing 0.64% of the strains included in these genotypes). The P[15] genotype has only one strain out of the cutoff value and VP1, VP3, and NSP1 have no strains outside of the cutoff. On the other hand, for NSP4, the E3 genotype has 19.5% (15/77) strains out of the cutoff value, and for NSP5 H10 (5/5), H14 (2/2), and H15 (2/2), 100% of the strains are out of the cutoff value, although only a few strains were analyzed for these genotypes.

3.3. K2P Analysis: Outer Capsid Proteins (VP7, VP4, and VP6)

Among the genotypes described for VP7, 89% (32/36) of the genotypes showed that their strains are closely related according to our analysis. Moreover, 11% (4/36 genotypes) exhibited strains with identities ranging from 0.80 to 0.62. These genotypes were G3 with 17% (66/384) of strains, G6 with 73% (64/87), G8 with 42% (63/148), and G10 with 22% (20/88). All genotypes, except for G3, maintain a monophyletic origin within the phylogenetic tree (Figure A2).
Among the genotypes described for VP4, 92% (47/51 genotypes) showed a consistent distance according to our analysis (Figure 2). However, 8% (4/51 genotypes) exhibited strains with identities ranging from 0.79 to 0.73. These genotypes were P[1] with 26% (9/34) of the strains, P[3] with 83% (20/24), P[13] with 35% (5/14), and P[14] with 12% (6/49). Evolutionary relationships between groups are shown with a middle-point root of VP4, where all groups are clearly defined (Figure A3). P[4], P[6], and P[8], all described in humans, are grouped together on the tree and the rest of the genotypes represent a heterogeneous group with a broad range of hosts. Concerning P[13], Figure A3 shows that it is a polyphyletic group. The strains mainly correspond to oriental porcine strains, but one human strain from Belgium and one porcine strain from Spain were also reported.
Among the genotypes described for VP6, 88% (23/26) showed a consistent distance according to our analysis (Figure 2). Only 12% (3 out of 26 genotypes) exhibited strains with identities ranging from 0.851 to 0.78. These genotypes were I2 with 9% (72/773) of the strains, I3 with 28% (12/43), and I12 with 67% (11/17), as shown in Figure A1. The VP6 phylogenetic tree with a middle-point root in Figure A4 shows all genotypes have monophyletic origins.

3.4. K2P Analysis: Inner Capsid Proteins (VP1, VP2, and VP3)

Among the genotypes described for VP1, 87% (19/22 genotypes) showed a consistent distance with the classification method (Figure 2). The other 13% (3/22 genotypes) exhibited strains with low identities ranging from 0.831 to 0.79. These genotypes were R1 with 7% (65/830) of the strains out the cutoff value, R2 with 31% (181/571), and R3 with 35% (16/45). The VP1 phylogenetic tree is shown in Figure A5. Two major groups are observed as follows: R4, R6, R14, R17, R20, and R21 are together in one branch and the rest of the genotypes are in another branch.
Among the genotypes described for VP2, 90% (18/20) showed a consistent distance according to our analysis (Figure 2). The other 10% (2/20 genotypes) exhibited strains with low identities in a range from 0.841 to 0.69 (Table A1). These genotypes were C2 with 22% (195/868) of the strains and C4 with 33% (4/12). We also observed that strain RVA/Cow-wt/SVN/SI-B17/2004/G6P[11] (access number JX402792) showed distances higher than the cutoff value for most of C2 strains. The VP2 phylogenetic tree showed two branches, one containing C1, C2, C3, C5, C9, C11, and C12 and the other branch containing the rest of the genotypes (Figure A6).
Among the genotypes described for VP3, 90% (18/20 genotypes) showed a consistent distance according to our analysis (Figure 2). The other 10% (2/20 genotypes) exhibited strains with low identities in a range from 0.811 to 0.76 (Table A1). These genotypes were M2 with 25% (137/557) of the strains and M3 with 39% (28/71). The phylogenetic tree shows M16 as a separate distant group from the rest of the genotypes in Figure A7.

3.5. Other Model Analysis: Non-Structural Proteins

Among the genotypes described for NSP1, 71% (25/31 genotypes) showed a consistent distance according to our analysis (Figure 3). The other 29% (6/31 genotypes) exhibited strains with low identities in a range from 0.791 to 0.57. These genotypes were A1 with 28% (395/1405) of the strains, A3 with 51% (43/84), A8 with 52% (38/73), A9 with 48% (16/33), A11 with 20% (5/25), and A19 with 100% (6/6). The phylogenetic tree shows six branches. The A27 genotype includes only one sequence, which is far away from the other genotypes (Figure A8).
Among the genotypes described for NSP2, 77% (17/22 genotypes) showed a consistent distance according to our analysis (Figure 3). Only 23% (5/22 genotypes) exhibited strains with low identities ranging from 0.84 to 0.65. These genotypes were N1 with 36% (587/1600) of the strains, N2 with 95% (616/643), N3 with 62% (23/37), N4 with 14% (2/14), and N10 with 75% (6/8). The phylogenetic tree shows that most N-genotypes are monophyletic branches (Figure A9), while N2, N3, and N10 are polyphyletic groups. N2 shows the following two lineages: N2-A and N2-B. The N9 genotype is clustering as a branch within the N2-A lineage. N3 has the following two lineages: N3-A and N3-B, with N5 genotype as a related branch to these two lineages. N10 has the following two lineages: N10-A and N10-B, with N6 as a related branch these two lineages (Figure A9).
Among the genotypes described for NSP3, 73% (16/22 genotypes) showed a consistent cutoff value according to our analysis (Figure 3). The 27% (6/22 genotypes) exhibited strains with low identities in a range from 0.851 to 0.66. These genotypes were T1 with 8% (127/1496) of the strains, T3 with 77% (54/70), T4 with 60% (6/10), T6 with 80% (65/81), T7 with 4% (2/48, strains RVA/Pig-wt/ESP/F456/2017/G5P[13] (MH238143) and RVA/dog-tc/CHN/SCCD-A/2017/G9P[23] (MH910071)), and T14 with 60% (3/5). In the phylogenetic tree, T4, T8, T16, and T21 form a branch. On the other hand, T18 were in a branch separate from all other genotypes (Figure A10).
Among the genotypes described for NSP4, 74% (20/27 genotypes) showed a consistent distance according to our analysis (Figure 3). The other 26% (7/27 genotypes) exhibited strains with low identities in a range from 0.851 to 0.63. These genotypes were E1 with 11% (216/1952) of the strains, E2 with 87% (772/881), E3 with 48% (37/76), E4 with 33% (2/6, strains RVA/dove-wt/JPN/PO-13/2001/G18P[17] (AB009627) and RVA/VelvetScoter-tc/JPN/RK1/1989/G18P[17] (LC088105)), E7 with 4% (1/24, strain RVA/mice/USA/EC/1997/GxP[x] (U96337)), E9 with 9% (1/11, strain RVA/Pig-wt/ESP/F438/2017/G5P[19] (MH238162)), and E11 with 1% (1/103, strain RVA/Chicken-tc/xxx/BRS-115/xxx/G7P[35] (KJ725026)). Some other points to mentions are that for NSP4 genotypes which are in the E18 and E20 genotypes, there were only two strains in each genotype, and the distance between them is out of the cutoff value (strains RVA/Rat-wt/ITA/Rat14/2015/G3P[3] (KX398368) and RVA/Rat-wt/GER/KS-11-573/2011/G3P[3] (KJ879457) for E18 and RVA/Bat-wt/CRC/KCR10-93/2010/G20P[47] (MN551607) and RVA/Human-wt/SUR/2014735512/2013/G20P[28] (KX257412) for E20). We also observed that RVA/cat-wt/JPN/FRV70/2001/G3P[3] (AB048196) and RVA/cat-wt/JPN/FRV303/2001/G3P[3] (AB048199), which belong to the E3 genotype, showed low identities across all E3 strains. The phylogenetic tree shows polyphilia in the following strains: between E1 and E9; between E12 and E15; between E4 and E19, E21, and E26; between E3 and E13, E16, E17, and E24 (Figure A11).
Among the genotypes described for NSP5, 77% (17/22 genotypes) showed a consistent distance according to our analysis (Figure 3). The other 23% (5/22 genotypes) exhibited strains with low identities in a range from 0.911 to 0.45. These genotypes were H1 with 12% (194/1580) of the strains, H2 with 1% (7/514), H3 with 28% (41/147), H4 with 45% (5/11), H6 with 45% (5/11), and H10 with 100% (5/5). We observed that the H14 and H15 genotypes had only two strains each and showed low identities. We also found that strain KP258408, which belongs to the H3 genotype, showed distances above the cutoff value with all H3 strains. In the phylogenetic tree, we observed one branch which contains H4, H8, H14, H16 and H21, with H18 in a branch separate from the others, and the rest of genotypes in another branch (Figure A12).

3.6. Comparison Between the p-Distance and K2P

We compared the performance of the p-distance and K2P models. The results are summarized in Table 4. We used the cutoff value established by Matthijnssens et al. in 2008 to determine whether the K2P results were consistent.

4. Discussion

In this study, we tested the classification system for all 11 RVA genome segments by constructing pairwise sequence identity profiles using the p-distance and K2P models and then conducting phylogenetic analyses using the information available up to 2021. Our objective was to determine whether the current cutoff values remain up to date. With this purpose we selected a dataset of 26,537 strain nucleotide sequences belonging to 299 genotypes obtained from the GenBank database. We found that most genotypes (99.4%) for each gene segment exhibited high consistency in their cutoff value based on the analysis and only a small percentage of genotypes showed strains with identities out of the cutoff value (0.63% of strains).

4.1. p-Distance Model vs. K2P

The established genotyping system demonstrated high efficacy in classifying rotavirus strains. A total of 93.4% of the genotypes (20/299) exhibited cutoff values consistent with those analyzed using the p-distance model, suggesting that the current classification framework remains applicable for most RVA genes. Additionally, 6.6% of the strains had 99% of their values aligned with the established cutoffs. Table 4 summarizes the performance of the p-distance model for classification purposes. This model effectively distinguishes nearly all RVA strains in their respective genotypes, even with the limited sequence information this model uses. All the evidence we found indicates that the p-distance model is more suitable for classification compared to other models (K2P), as the classification performance based on the p-distance model was superior to that of K2P, as shown in Figure 2 and Figure 3.

4.2. K2P Model

The strains analyzed with K2P and the other models indicate the presence of genetic variations, which could have originated from point mutations or other mechanisms [31,32]. The phylogenetic analysis provided additional insights into the evolutionary relationships of the genotypes [6]. In some cases, the phylogenetic trees confirmed the monophyletic nature of the genotypes [31,33,34], while in others, the trees revealed polyphyletic or heterogeneous clustering, suggesting the existence of distinct lineages within those genotypes and genotypes that fall inside other genotypes. This is an interesting effect of the evolutionary process because strains represent living organisms. These organisms are evolving, even as we observe them, and the phylogenetic tree is like a snapshot of their current state. In this snapshot, we can see the direction in which the strains are evolving, where they come from, and where they are heading in evolutionary terms. This explains the presence of lineages within strains, and it is natural for it to be this way. However, we agree that classification is a convention that is not necessarily consistent with evolutionary relationships. It is supported by using the simplest genetic distance model (p-distance), which does not include evolutive information for genotype classification but is the model that best discriminates genotypes based on their nucleotide sequence. The best example for this is G3; the phylogenetic analysis of G3 reveals a complex evolutionary structure, with this strain divided into five distinct lineages as follows: A, B, C, D, and E. Adding to this complexity, G13, G14, G16, and G29 are positioned intermediately among these lineages. This intermediate placement suggests overlapping evolutionary relationships, indicating the possible emergence of new strains, probably due to G3 diverging to produce the other strains. These highlight the dynamic nature of strain evolution, where distinct lineages can still share close genetic ties with multiple related strains. The evolutionary relationships between strains P[13] and P[22] are another example, as P[13] is located in a well-supported branch with a bootstrap value of 100. This branch is divided into the following three lineages: A, B, and C. P[22] is positioned between lineages A and B of P[13], indicating that P[22] shares more similarity with this strain than with the rest of the VP4 strains. The placement of P[22] highlights the dynamic nature of these processes and the importance of using phylogenetic trees as snapshots to interpret evolutionary pathways. On the other hand, we have the NSP2 strains. N2 is divided into two lineages, A and B, with N9 positioned between them, suggesting a closer evolutionary relationship with both lineages. Similarly, N3 is also divided into two lineages, A and B, and N5 is located in the middle of these two groups. Finally, N10 is split into two distinct lineages, A and B, with N6 occupying an intermediate position between them. These findings highlight consistent patterns of lineage division and the intermediate placement of related strains, offering further insights into the evolutionary dynamics within these groups. These examples show us the need to study how different genotypes evolve, but these questions are beyond the scope of this work.
Other genotypes phylogenetically closely related are E1 and E9, E12 and E15; E3 and E13, E8, E16, E17 and E24; E4 with E19, E21 and E26. And not only phylogenetically, as in the case of G3 with G14, which are more closely related to each other serologically than different genotypes usually are [35]. Antigenic analysis, as a virus neutralization test, could be carried out in order to response to these questions, at least for VP7 and VP4.
The viral proteins (VPs) and the non-structural proteins (NPSs) will be discussed in order, from simplest to most complex to analyze.

4.2.1. Proteins Forming the Triple Layer Particles

Only three VP6 genotypes demonstrated low identities as follows: I2, I3, and I12. Many studies use phylogeny to identify the genotype of VP6 strains in combination with their pair identities, and high identities were observed [36,37,38,39,40,41,42,43,44].
VP1, VP2, and VP3 show close relationships with the strains belonging to each genotype. The phylogenetic tree shows that no polyphyletic groups are present.
VP7 presents only four genotypes (G3, G6, G8, and G10) with low identities. We identify G3 as a polyphyletic group, in accordance with previous studies [42,44,45]. A molecular analysis to examine the genetic variation within the VP7 of 27 G3 human and animal rotavirus strains was performed more than three decades ago by Nishikawa et al., 1989 and showed an overall sequence identity of 85% or higher [33]. A higher degree of overall VP7 sequence similarity was observed among strains from the same animal species when compared to strains from different animal species, suggesting that these were VP7 species-specific sequences. Moreover, VP7 had serotype-specific regions, where genetic variations identified among strains of different serotypes were highly conserved among G3 from the same species [33]. Furthermore, Nishikawa et al. found that the varying reactivities of anti-VP7 monoclonal antibodies with the 27 strains studied were consistent with the occurrence of antigenic variation among serotype 3 strains. The corresponding phylogenetic tree suggested that G3 rotavirus strains from different animal species were more closely related to each other than to rotavirus strains of different G genotypes [33]. This finding poses the query regarding the classification of these strains. Phylogenetic analysis is a useful supporting tool for the classification of all genotypes except G3, which is the most variable genotype.
Notably, prior research identified three lineages within G5 [46] using 28 strains, while our study, encompassing 48 strains, reaffirms the monophyletic nature of G5. Various studies have explored VP7 genotypes and strains (G6, G5, G3, G8, G9, G12), primarily for classification purposes [36,38,40,43,45,47,48,49]. Interestingly, a previous study tentatively designated some G3 strains exceeding the cutoff value as G16 [6]. However, in our findings, G16 strains were appropriately grouped within the correct distance to G3 strains, suggesting that the discrepancies in previous studies may be due to the limited number of strains analyzed. Additionally, our results revealed that G6 exhibited substantial intra-genotype diversity in prior studies, with five distinct lineages identified. Furthermore, we observed that G15 was classified as a lineage of G6 rather than a separate G-type [39].
Concerning VP4, only four genotypes (P[1], P[3], P[13], and P[14]) possess strains with low identities. These strains share a monophyletic origin within the phylogenetic tree, with the exception of P[13]. Other studies that focused on P[3] [40,44] and P[13] [39,42,45] did not find any subgroups or polyphyletic origin for these genotypes. The exclusion of identities between closely related P[4] and P[8] genotypes—consistent with the methodology employed by Matthijnssens in 2008 [4], who find identities between P[8] and P[4] ranged from 84% to 89%, completely above the 80% cutoff value—reinforces the idea that P[8] and P[4] are not only closely related P genotypes but are also subtypes corresponding to the distinct serotypes P1A and P1B, respectively, which were initially thought to be part of a single P serotype (P1) [50,51,52]. This exclusion significantly influences the accurate classification of new segments. Prior research has extensively investigated P[4] and P[8] genotypes, along with their respective alleles, contributing valuable insights to the understanding of these genomic elements [36,37,40,43,49].

4.2.2. Nonstructural Proteins

NSP5 exhibits the highest cutoff among all RVA genes, which shows that it is the most conserved gene. We found that 5 out of 22 genotypes possess strains with low identities. Specifically, genotypes H1 and H2 exhibited 12% and 1% of their strains, respectively, with high genetic distances. H10 needs further analysis because the identities that exhibit its sequences are lower than 0.78. Previous studies did not find a polyphyletic origin or groups within this genotype [37,38,39,40,42,43,44].
Concerning NSP3, six genotypes (27%) showed low identities. These included T3 with 77% (54/70), T4 with 60% (6/10), T6 with 80% (65/81). Further evolutionary analysis is needed to determine why many strains of this gene showed low identities. Previous studies did not find a polyphyletic origin or groups within this genotype, possibly because the low number of strains used [37,38,39,40,42,43,44].
In the case of NSP1, we found a large peak of inter-genotypes pair identities (between 0.62 and 0.80 in Figure 3). NSP1’s cutoff stands at 0.79, the lowest among all genes, being the most variable of the 11 genes. This gene also needs deeper evolutionary analysis. Previous studies were conducted on NSP1 to compare the local strains with other strains, but no polyphyletic groups were observed [37,38,39,40,42,43,44,49].
Previous studies used the classification system for NSP2 but did not find a polyphyletic origin or groups within this genotype [37,38,39,40,42,43,44]. This could be due to these works having classification purposes, with just a few strains being used.
Concerning NSP4, three genotypes, E1, E2, and E3, comprise numerous strains with low identities. This gene presents high variability and the phylogenetic trees show the following three polyphyletic groups: E3, E10, and E12. This shows that these strains are closely related and the genotypes are changing over time. Previous studies conducted on NSP4 have found that it is a monophyletic group [36,37,38,39,40,41,42,43,44].
Overall, as new strains are sequenced and more genetic data become available, it is important to regularly check and update the classification system used for RVA classification. It is crucial to reassess the cutoff values and refine the classification system to accurately reflect the genetic diversity and evolutionary relationships of the rotavirus strains when needed [6].
Additional investigations are imperative to explore strains with high genetic distances and assess whether these strains are diverging into novel subtypes (lineages). A more in-depth analysis of the evolutionary path is needed for specific strains, such as G3 with G13, G14, G16 and G29; P[13] with P[22]; N2 with N9; N3 with N5; N10 with N6, E1 and E9; E12 and E1; E3 with E13, E8, E16, E17 and E24; E4 with E19, E21 and E26. The rationale behind this inquiry is rooted in the observation that all these genotypes harbor strains with low identities, and the phylogenetic analysis indicates their inclusion within other genotypes. This nuanced exploration will contribute to a deeper understanding of the genetic diversity and relationships between these strains, potentially leading to the identification of novel genotypic distinctions or subtypes.
Additionally, ongoing surveillance and sequencing efforts will continue to contribute to our understanding of rotavirus evolution and the impact of vaccination on strain diversity [6,15,53,54,55]. This study employed a large dataset of rotavirus nucleotide sequences to investigate the genotype diversity and evolutionary relationships of various gene segments. The analysis revealed variations in genotype diversity among different genes, with VP7 and VP4 showing the highest diversity, as expected for the neutralizing antigens of the virus. There is remarkable consistency with the established cutoff values that have served as an excellent tool for RVA classification over the last fifteen years.

5. Conclusions

In conclusion, the application of the rotavirus classification system to numerous rotavirus strains (more than 3170 strains and 26,500 sequences) allowed the accurate differentiation of all genotypes within each gene, showing the robustness of the system. Furthermore, the system facilitates the detection of multiple reassortment events and transmissions between species, allowing them to trace the origins of this variability and to identify the parental strains of emerging variants. Our study provides valuable information that promotes the utilization of the rotavirus genotype classification system by testing its robustness. Finally, our phylogenetic analyses show the complexity of the evolutionary relationships within this viral species and urge us to continue studying the evolution of rotaviruses to better understand its dynamic and evolutionary process.

Author Contributions

Conceptualization, S.O.M. and V.P.; Methodology, E.G.Q., K.S. and S.O.M.; Validation, R.G.D.A.; Formal Analysis, S.O.M. and R.G.D.A.; Investigation, E.G.Q. and R.G.D.A.; Data Curation, E.G.Q. and R.G.D.A.; Writing—Original Draft Preparation, R.G.D.A. and K.S; Writing—Review and Editing, S.O.M., D.J.L., R.G.D.A. and V.P.; Visualization, S.O.M., D.J.L. and R.G.D.A.; Supervision, V.P., S.O.M. and D.J.L.; Project Administration, S.O.M. and V.P.; Funding Acquisition, D.J.L. and S.O.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by a CONICET grant, Instituto Nacional de Tecnología Agrope-cuaria (INTA) projects PE-I044 and RIST-111; the VIRUSES journal granted the APC waiver.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Acknowledgments

The authors would like to extend their thanks to Jonathan Aguilera and Analía Sanchez for their collaboration in the data analysis and to Jelle Matthijnssens, Ulrich Desselberger, and Max Ciarlet for reviewing the draft.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Table A1. Base frequency and substitution rate of all sequences included in this study for each protein and the average values. The substitution rate for each pair of nucleotides was calculated. The evolution model used to construct the trees and the distance matrix. The gamma parameter used to correct the variation in the rate of substitution were estimated with bootstrap resampling with 1000 replicates to assess statistical robustness.
Table A1. Base frequency and substitution rate of all sequences included in this study for each protein and the average values. The substitution rate for each pair of nucleotides was calculated. The evolution model used to construct the trees and the distance matrix. The gamma parameter used to correct the variation in the rate of substitution were estimated with bootstrap resampling with 1000 replicates to assess statistical robustness.
Base Frequency and Substitution Rate of All Genes and Evolution Models Suggested by MegaX
GeneBase Frequency %Nucleotide Substitution FrequenciesEvolution Model *Gamma Parameter
GATCG-TG-AG-CA-TA-CT-CTreeDistance Matrix
VP73136181519.21.00.81.611.1GTR + F + R8Kimura 2-parameter4
VP43037181519.51.20.71.912.2GTR + F + R7Kimura 2-parameter4
VP630341917112.60.70.92.016.3GTR + F + R5Kimura 2-parameter4
VP130381715114.00.90.62.020.7GTR + F + R10Kimura 2-parameter4
VP229381815114.30.90.82.222.3GTR + F + R8Kimura 2-parameter4
VP332381614114.31.00.52.020.7GTR + F + R10Kimura 2-parameter4
NSP137173313110.01.10.52.114.4GTR + F + R10Tamura 3-parameter0.88
NSP237183114112.00.70.71.916.5GTR + F + R10Tamura 3-parameter0.48
NSP338193112110.91.00.62.017.4GTR + F + R10Tamura 3-parameter0.56
NSP43919271518.00.50.61.010.6GTR + F + R10Tamura-Nei0.72
NSP53519291718.80.80.91.610.4GTR + F + R10Tamura 3-parameter0.51
Average33.528.523.414.71.011.20.90.71.915.7
* The evolutionary model used to construct the phylogenetic tree or the distance matrix.
Figure A1. Genotype composition of each gene. This shows that genotypes 1 and 2 are often the most numerous, except for VP7 and VP4.
Figure A1. Genotype composition of each gene. This shows that genotypes 1 and 2 are often the most numerous, except for VP7 and VP4.
Viruses 17 00211 g0a1
Figure A2. VP7 middle-point rooted phylogenetic tree. G3 shows two principal branches, G3-A with close phylogenetic relationships with G16 and G29 and G3-B related to G13 and G14.
Figure A2. VP7 middle-point rooted phylogenetic tree. G3 shows two principal branches, G3-A with close phylogenetic relationships with G16 and G29 and G3-B related to G13 and G14.
Viruses 17 00211 g0a2
Figure A3. VP4 middle-point rooted phylogenetic tree. P[13] shows close relationships with P[22] and P[41].
Figure A3. VP4 middle-point rooted phylogenetic tree. P[13] shows close relationships with P[22] and P[41].
Viruses 17 00211 g0a3
Figure A4. VP6 middle-point rooted phylogenetic tree.
Figure A4. VP6 middle-point rooted phylogenetic tree.
Viruses 17 00211 g0a4
Figure A5. VP1 middle-point rooted phylogenetic tree.
Figure A5. VP1 middle-point rooted phylogenetic tree.
Viruses 17 00211 g0a5
Figure A6. VP2 middle-point rooted phylogenetic tree.
Figure A6. VP2 middle-point rooted phylogenetic tree.
Viruses 17 00211 g0a6
Figure A7. VP3 middle-point rooted phylogenetic tree.
Figure A7. VP3 middle-point rooted phylogenetic tree.
Viruses 17 00211 g0a7
Figure A8. NSP1 middle-point rooted phylogenetic tree.
Figure A8. NSP1 middle-point rooted phylogenetic tree.
Viruses 17 00211 g0a8
Figure A9. NSP2 middle-point rooted phylogenetic tree. N2 shows a close relationship with N9. N3 shows a close relationship with N5. N10 shows a close relationship with N6.
Figure A9. NSP2 middle-point rooted phylogenetic tree. N2 shows a close relationship with N9. N3 shows a close relationship with N5. N10 shows a close relationship with N6.
Viruses 17 00211 g0a9
Figure A10. NSP3 middle-point rooted phylogenetic tree.
Figure A10. NSP3 middle-point rooted phylogenetic tree.
Viruses 17 00211 g0a10
Figure A11. NSP4 middle-point rooted phylogenetic tree. E1 shows a close relationship with E9. E3 shows a close relationship with E13, E16, E17, and E24. E4 shows a close relationship with E19, E21, and E23.
Figure A11. NSP4 middle-point rooted phylogenetic tree. E1 shows a close relationship with E9. E3 shows a close relationship with E13, E16, E17, and E24. E4 shows a close relationship with E19, E21, and E23.
Viruses 17 00211 g0a11
Figure A12. NSP5 middle-point rooted phylogenetic tree.
Figure A12. NSP5 middle-point rooted phylogenetic tree.
Viruses 17 00211 g0a12

References

  1. Desselberger, U. Rotaviruses. Virus Res. 2014, 190, 75–96. [Google Scholar] [CrossRef]
  2. Knipe, D.M.; Howley, P.M.; Griffin, D.E.; Lamb, A.R.; Martin, M.A.; Roizman, B.; Straus, S.E. Rotavirus. In Fields Virology; Knipe, D.M., Howley, P.M., Eds.; Lippincott Williams & Wilkins: Philadelphia, PA, USA, 2013; pp. 1347–1401. [Google Scholar]
  3. Guglielmi, K.; Matthijnssens, J.; Dormitzer, P.; Ciarlet, M.; Patton, J. Genus Rotavirus: Type Species A. In Virus Taxonomy: Classification and Nomenclature of Viruses; Fauquet, C., Mayo, M., Maniloff, J., Desselberger, U., Ball, A., Eds.; Elsevier Academic Press: Amsterdam, The Netherlands, 2011; pp. 484–499. [Google Scholar]
  4. Ciarlet, M.; Estes, M. Rotaviruses: Basic Biology, Epidemiology and Methodologies. In Encyclopedia of Environmental Microbiology; Bitton, G., Ed.; John Wiley & Sons: New York, NY, USA, 2002; pp. 2753–2773. [Google Scholar]
  5. Nakagomi, O.; Nakagomi, T.; Akatani, K.; Ikegami, N. Identification of Rotavirus Genogroups by RNA-RNA Hybridization. Mol. Cell. Probes 1989, 3, 251–261. [Google Scholar] [CrossRef]
  6. Matthijnssens, J.; Ciarlet, M.; Heiman, E.; Arijs, I.; Delbeke, T.; McDonald, S.M.; Palombo, E.A.; Iturriza-Gómara, M.; Maes, P.; Patton, J.T.; et al. Full Genome-Based Classification of Rotaviruses Reveals a Common Origin between Human Wa-Like and Porcine Rotavirus Strains and Human DS-1-Like and Bovine Rotavirus Strains. J. Virol. 2008, 82, 3204–3219. [Google Scholar] [CrossRef]
  7. Blackhall, J.; Fuentes, A.; Magnusson, G. Genetic Stability of a Porcine Rotavirus RNA Segment during Repeated Plaque Isolation. Virology 1996, 225, 181–190. [Google Scholar] [CrossRef]
  8. Miño, S.; Barrandeguy, M.; Parreño, V.; Parra, G.I. Genetic Linkage of Capsid Protein-Encoding RNA Segments in Group A Equine Rotaviruses. J. Gen. Virol. 2016, 97, 912–921. [Google Scholar] [CrossRef]
  9. Díaz Alarcón, R.G.; Liotta, D.J.; Miño, S. Zoonotic RVA: State of the Art and Distribution in the Animal World. Viruses 2022, 14, 2554. [Google Scholar] [CrossRef]
  10. Martella, V.; Bányai, K.; Matthijnssens, J.; Buonavoglia, C.; Ciarlet, M. Zoonotic Aspects of Rotaviruses. Vet. Microbiol. 2010, 140, 246–255. [Google Scholar] [CrossRef]
  11. Cook, N.; Bridger, J.; Kendall, K.; Gomara, M.I.; El-Attar, L.; Gray, J. The Zoonotic Potential of Rotavirus. J. Infect. 2004, 48, 289–302. [Google Scholar] [CrossRef]
  12. Papp, H.; Matthijnssens, J.; Martella, V.; Ciarlet, M.; Bányai, K. Global Distribution of Group A Rotavirus Strains in Horses: A Systematic Review. Vaccine 2013, 31, 5627–5633. [Google Scholar] [CrossRef]
  13. Papp, H.; László, B.; Jakab, F.; Ganesh, B.; De Grazia, S.; Matthijnssens, J.; Ciarlet, M.; Martella, V.; Bányai, K. Review of Group A Rotavirus Strains Reported in Swine and Cattle. Vet. Microbiol. 2013, 165, 190–199. [Google Scholar] [CrossRef]
  14. Ghosh, S.; Kobayashi, N. Exotic Rotaviruses in Animals and Rotaviruses in Exotic Animals. Virusdisease 2014, 25, 158–172. [Google Scholar] [CrossRef]
  15. Dóró, R.; László, B.; Martella, V.; Leshem, E.; Gentsch, J.; Parashar, U.; Bányai, K. Review of Global Rotavirus Strain Prevalence Data from Six Years Post Vaccine Licensure Surveillance: Is There Evidence of Strain Selection from Vaccine Pressure? Infect. Genet. Evol. 2014, 28, 446–461. [Google Scholar] [CrossRef]
  16. Simón, D.; Cristina, J.; Musto, H. Nucleotide Composition and Codon Usage Across Viruses and Their Respective Hosts. Front. Microbiol. 2021, 12, 646300. [Google Scholar] [CrossRef] [PubMed]
  17. Shackelton, L.A.; Parrish, C.R.; Holmes, E.C. Evolutionary Basis of Codon Usage and Nucleotide Composition Bias in Vertebrate DNA Viruses. J. Mol. Evol. 2006, 62, 551–563. [Google Scholar] [CrossRef] [PubMed]
  18. Duffy, S.; Shackelton, L.A.; Holmes, E.C. Rates of Evolutionary Change in Viruses: Patterns and Determinants. Nat. Rev. Genet. 2008, 9, 267–276. [Google Scholar] [CrossRef]
  19. Sanjuán, R.; Nebot, M.R.; Chirico, N.; Mansky, L.M.; Belshaw, R. Viral Mutation Rates. J. Virol. 2010, 84, 9733–9748. [Google Scholar] [CrossRef]
  20. Pfeiffer, J.K.; Kirkegaard, K. Increased Fidelity Reduces Poliovirus Fitness and Virulence under Selective Pressure in Mice. PLoS Pathog. 2005, 1, e11. [Google Scholar] [CrossRef] [PubMed]
  21. Vignuzzi, M.; Stone, J.K.; Arnold, J.J.; Cameron, C.E.; Andino, R. Quasispecies Diversity Determines Pathogenesis through Cooperative Interactions in a Viral Population. Nature 2006, 439, 344–348. [Google Scholar] [CrossRef] [PubMed]
  22. Vignuzzi, M.; Wendt, E.; Andino, R. Engineering Attenuated Virus Vaccines by Controlling Replication Fidelity. Nat. Med. 2008, 14, 154–161. [Google Scholar] [CrossRef]
  23. Weeks, S.A.; Lee, C.A.; Zhao, Y.; Smidansky, E.D.; August, A.; Arnold, J.J.; Cameron, C.E. A Polymerase Mechanism-Based Strategy for Viral Attenuation and Vaccine Development. J. Biol. Chem. 2012, 287, 31618–31622. [Google Scholar] [CrossRef]
  24. Anderson, J.P.; Daifuku, R.; Loeb, L.A. Viral Error Catastrophe by Mutagenic Nucleosides. Annu. Rev. Microbiol. 2004, 58, 183–205. [Google Scholar] [CrossRef] [PubMed]
  25. Holmes, E.C. Evolutionary History and Phylogeography of Human Viruses. Annu. Rev. Microbiol. 2008, 62, 307–328. [Google Scholar] [CrossRef] [PubMed]
  26. Woolhouse, M.; Gaunt, E. Ecological Origins of Novel Human Pathogens. Crit. Rev. Microbiol. 2007, 33, 231–242. [Google Scholar] [CrossRef] [PubMed]
  27. Combe, M.; Sanjuán, R. Variation in RNA Virus Mutation Rates across Host Cells. PLoS Pathog. 2014, 10, e1003855. [Google Scholar] [CrossRef]
  28. Matthijnssens, J.; Ciarlet, M.; Rahman, M.; Attoui, H.; Estes, M.K.; Gentsch, J.R.; Iturriza-gómara, M.; Kirkwood, C.; Mertens, P.P.C.; Nakagomi, O.; et al. Recommendations for the Classification of Group A Rotaviruses Using All 11 Genomic RNA Segments. Arch. Virol. 2008, 153, 1621–1629. [Google Scholar] [CrossRef]
  29. Rotavirus Classification Working Group: RCWG Home Page. Newly Assigned Genotypes, List of Accepted Genotypes. Available online: https://rega.kuleuven.be/cev/viralmetagenomics/virus-classification/rcwg (accessed on 5 February 2024).
  30. Kumar, S.; Stecher, G.; Li, M.; Knyaz, C.; Tamura, K. MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms. Mol. Biol. Evol. 2018, 35, 1547–1549. [Google Scholar] [CrossRef]
  31. Lemey, P.; Salemi, M.; Vandamme, A.-M. The Phylogenetic Handbook, 2nd ed.; Cambridge University Press: Cambridge, UK, 2009; Volume 6, ISBN 9786021018187. [Google Scholar]
  32. Bonkoungou, I.J.O.; Damanka, S.; Sanou, I.; Tiendrebeogo, F.; Coulibaly, S.O.; Bon, F.; Haukka, K.; Traore, A.S.; Barro, N.; Armah, G.E. Genotype Diversity of Group A Rotavirus Strains in Children With Acute Diarrhea in Urban Burkina Faso, 2008–2010. J. Med. Virol. 2011, 83, 1485–1490. [Google Scholar] [CrossRef]
  33. Nlshikawa, K.; Hoshino, Y.; Taniguchi, K.; Green, K.Y.; Greenberg, H.B.; Kapikian, A.; Chanock, R.M.; Gorziglia, M. Rotavirus VP7 Neutralization Epitopes of Serotype 3 Strains. Virology 1989, 171, 503–515. [Google Scholar] [CrossRef]
  34. Ciarlet, M.; Hoshino, Y.; Liprandi, F. Single Point Mutations May Affect the Serotype Reactivity of Serotype G11 Porcine Rotavirus Strains: A Widening Spectrum? J. Virol. 1997, 71, 8213–8220. [Google Scholar] [CrossRef]
  35. Ciarlet, M.; Reggeti, F.; Pina, C.I.; Liprandil, F. Equine Rotaviruses with G14 Serotype Specificity Circulate among Venezuelan Horses. J. Clin. Microbiol. 1994, 32, 2609–2612. [Google Scholar] [CrossRef]
  36. Morozova, O.V.; Sashina, T.A.; Epifanova, N.V.; Zverev, V.V.; Kashnikov, A.U.; Novikova, N.A. Phylogenetic Comparison of the VP7, VP4, VP6, and NSP4 Genes of Rotaviruses Isolated from Children in Nizhny Novgorod, Russia, 2015–2016, with Cogent Genes of the Rotarix and RotaTeq Vaccine Strains. Virus Genes 2018, 54, 225–235. [Google Scholar] [CrossRef] [PubMed]
  37. Tatte, V.S.; Maran, D.; Walimbe, A.M.; Gopalkrishna, V. Rotavirus G9P[4], G9P[6] and G1P[6] Strains Isolated from Children with Acute Gastroenteritis in Pune, Western India, 2013–2015: Evidence for Recombination in Genes Encoding VP3, VP4 and NSP1. J. Gen. Virol. 2019, 100, 1605–1630. [Google Scholar] [CrossRef] [PubMed]
  38. Sircar, S.; Malik, Y.S.; Kumar, P.; Ansari, M.I.; Bhat, S.; Shanmuganathan, S.; Kattoor, J.J.; Vinodhkumar, O.R.; Rishi, N.; Touil, N.; et al. Genomic Analysis of an Indian G8P[1] Caprine Rotavirus-A Strain Revealing Artiodactyl and DS-1-Like Human Multispecies Reassortment. Front. Vet. Sci. 2021, 7, 606661. [Google Scholar] [CrossRef] [PubMed]
  39. Ghosh, S.; Navarro, R.; Malik, Y.S.; Willingham, A.L.; Kobayashi, N. Whole Genomic Analysis of a Porcine G6P[13] Rotavirus Strain. Vet. Microbiol. 2015, 180, 286–298. [Google Scholar] [CrossRef]
  40. Ianiro, G.; Di Bartolo, I.; De Sabato, L.; Pampiglione, G.; Ruggeri, F.M.; Ostanello, F. Detection of Uncommon G3P[3] Rotavirus A (RVA) Strain in Rat Possessing a Human RVA-like VP6 and a Novel NSP2 Genotype. Infect. Genet. Evol. 2017, 53, 206–211. [Google Scholar] [CrossRef]
  41. Tamim, S.; Matthijnssens, J.; Heylen, E.; Zeller, M.; Van Ranst, M.; Salman, M.; Hasan, F. Evidence of Zoonotic Transmission of VP6 and NSP4 Genes into Human Species A Rotaviruses Isolated in Pakistan in 2010. Arch. Virol. 2019, 164, 1781–1791. [Google Scholar] [CrossRef]
  42. Yan, N.; Yue, H.; Wang, Y.; Zhang, B.; Tang, C. Genomic Analysis Reveals G3P[13] Porcine Rotavirus A Interspecific Transmission to Human from Pigs in a Swine Farm with Diarrhoea Outbreak. J. General. Virol. 2021, 102, jgv001532. [Google Scholar] [CrossRef]
  43. Peng, R.; Li, D.; Wang, J.; Xiong, G.; Wang, M.; Liu, D.; Wei, Y.; Pang, L.; Sun, X.; Li, H.; et al. Reassortment and Genomic Analysis of a G9P[8]-E2 Rotavirus Isolated in China. Virol. J. 2023, 20, 135. [Google Scholar] [CrossRef]
  44. Pathak, A.; Gulati, B.R.; Maan, S.; Mor, S.; Kumar, D.; Soman, R.; Punia, S.; Chaudhary, D.; Khurana, S.K. Complete Genome Sequencing Reveals Unusual Equine Rotavirus A of Bat Origin from India. J. Virol. 2022, 96, 20. [Google Scholar] [CrossRef]
  45. Jing, Z.; Zhang, X.; Shi, H.; Chen, J.; Shi, D.; Dong, H.; Feng, L. A G3P[13] Porcine Group A Rotavirus Emerging in China Is a Reassortant and a Natural Recombinant in the VP4 Gene. Transbound. Emerg. Dis. 2018, 65, e317–e328. [Google Scholar] [CrossRef]
  46. da Silva, M.F.M.; Tort, L.F.L.; Goméz, M.M.; Assis, R.M.S.; Volotão, E.d.M.; de Mendonça, M.C.L.; Bello, G.; Leite, J.P.G. VP7 Gene of Human Rotavirus A Genotype G5: Phylogenetic Analysis Reveals the Existence of Three Different Lineages Worldwide. J. Med. Virol. 2011, 83, 357–366. [Google Scholar] [CrossRef] [PubMed]
  47. Gupta, S.; Gauhar, M.; Bubber, P.; Ray, P. Phylogenetic Analysis of VP7 and VP4 Genes of the Most Predominant Human Group A Rotavirus G12 Identified in Children with Acute Gastroenteritis in Himachal Pradesh, India during 2013–2016. J. Med. Virol. 2021, 93, 6200–6209. [Google Scholar] [CrossRef] [PubMed]
  48. Degiuseppe, J.I.; Martelli, A.; Barrios Mathieur, C.; Stupka, J.A. Genetic Diversity of Rotavirus A in Argentina during 2019-2022: Detection of G6 Strains and Insights Regarding Its Dissemination. Arch. Virol. 2023, 168, 251. [Google Scholar] [CrossRef] [PubMed]
  49. Takatsuki, H.; Agbemabiese, C.A.; Nakagomi, T.; Pun, S.B.; Gauchan, P.; Muto, H.; Masumoto, H.; Atarashi, R.; Nakagomi, O.; Pandey, B.D. Whole Genome Characterisation of G11P[25] and G9P[19] Rotavirus A Strains from Adult Patients with Diarrhoea in Nepal. Infect. Genet. Evol. 2019, 69, 246–254. [Google Scholar] [CrossRef]
  50. Estes, M.K.Z. Rotaviruses and Their Replication. In Fields Virology; Fields, B.N., Knipe, D.M., Howley, P.M., Griffin, D.E., Lamb, R.A., Martin, M.A., Roizman, B., Straus, S.E., Eds.; Lippincott, Williams and Wilkins: Philadelphia, PA, USA, 2007; pp. 1917–1974. [Google Scholar]
  51. Hoshino, Y.; Jones, R.W.; Chanock, R.M.; Kapikian, A.Z. Generation and Characterization of Six Single VP4 Gene Substitution Reassortant Rotavirus Vaccine Candidates: Each Bears a Single Human Rotavirus VP4 Gene Encoding P Serotype 1A[8] or 1B[4] and the Remaining 10 Genes of Rhesus Monkey Rotavirus MMU18006 or Bovine Rotavirus UK. Vaccine 2002, 20, 3576–3584. [Google Scholar] [CrossRef]
  52. Gorziglia, M.; Larralde, G.; Kapikian, A.Z.; Chanock, R.M. Antigenic Relationships among Human Rotaviruses as Determined by Outer Capsid Protein VP4 (Rotavirus VP4 Expression/Rotavirus VP4 Serotypes/Rotavirus VP8/Rotavirus VP5). Proc. Nat. Acad. Sci. USA 1990, 87, 7155–7159. [Google Scholar] [CrossRef]
  53. Chen, J.; Grow, S.; Iturriza-Gómara, M.; Hausdorff, W.P.; Fix, A.; Kirkwood, C.D. The Challenges and Opportunities of Next-Generation Rotavirus Vaccines: Summary of an Expert Meeting with Vaccine Developers. Viruses 2022, 14, 2565. [Google Scholar] [CrossRef]
  54. Dhama, K.; Saminathan, M.; Karthik, K.; Tiwari, R.; Shabbir, M.Z.; Kumar, N.; Malik, Y.S.; Singh, R.K. Avian Rotavirus Enteritis—An Updated Review. Vet. Q. 2015, 35, 142–158. [Google Scholar] [CrossRef]
  55. Louge Uriarte, E.L.; Badaracco, A.; Spetter, M.J.; Miño, S.; Armendano, J.I.; Zeller, M.; Heylen, E.; Späth, E.; Leunda, M.R.; Moreira, A.R.; et al. Molecular Epidemiology of Rotavirus A in Calves: Evolutionary Analysis of a Bovine G8P[11] Strain and Spatio-Temporal Dynamics of G6 Lineages in the Americas. Viruses 2023, 15, 2115. [Google Scholar] [CrossRef]
Figure 1. Strain selection criteria and number of strains included in the study.
Figure 1. Strain selection criteria and number of strains included in the study.
Viruses 17 00211 g001
Figure 2. Histograms of pair identities for each Viral Protein (VP). In the ordinates are the identities between each pair of sequences, and the abscissa represents the number of pairs of sequences measured. p-distance histograms are on the left; intra-genotype identities are represented in red and inter-genotype identities in green. K2P histograms are on the right; intra-genotype identities are represented in gray and inter-genotype identities in light blue. Dotted lines represent the cutoff value proposed by Matthijnssens et al. in 2008 [28].
Figure 2. Histograms of pair identities for each Viral Protein (VP). In the ordinates are the identities between each pair of sequences, and the abscissa represents the number of pairs of sequences measured. p-distance histograms are on the left; intra-genotype identities are represented in red and inter-genotype identities in green. K2P histograms are on the right; intra-genotype identities are represented in gray and inter-genotype identities in light blue. Dotted lines represent the cutoff value proposed by Matthijnssens et al. in 2008 [28].
Viruses 17 00211 g002
Figure 3. Histograms of the pair identities for each Non-Structural Protein (NSP). In the ordinates are the identities, and in the abscise, the number of pairs of sequences measured are shown. p-distance histograms are on the left; intra-genotype identities are represented in red and inter-genotype identities in green. K2P histograms are on the right; intra-genotype identities are represented in gray and inter-genotype identities in light blue. Dotted lines represent the cutoff value proposed by Matthijnssens et al. in 2008 [28].
Figure 3. Histograms of the pair identities for each Non-Structural Protein (NSP). In the ordinates are the identities, and in the abscise, the number of pairs of sequences measured are shown. p-distance histograms are on the left; intra-genotype identities are represented in red and inter-genotype identities in green. K2P histograms are on the right; intra-genotype identities are represented in gray and inter-genotype identities in light blue. Dotted lines represent the cutoff value proposed by Matthijnssens et al. in 2008 [28].
Viruses 17 00211 g003
Table 1. Information about the encoded proteins of RVA. The classification notations used, the number of sequences analyzed in a previous work (2008), the present work (2024), and the current number of genotypes [29].
Table 1. Information about the encoded proteins of RVA. The classification notations used, the number of sequences analyzed in a previous work (2008), the present work (2024), and the current number of genotypes [29].
Comparison Between Previous and Present Work
GeneEncoded ProteinNotationCutoff Value2008 *This Work (2024)Current Genotype
StrainsGenotypesStrainsGenotypes
VP7GlycosylatedG8010001531723642
VP4Protease-sensitiveP801902727715158
VP6Inner capsidI851421024632632
VP1RNA-polymeraseR8358414952228
VP2Core proteinC8458527542024
VP3MethyltransferaseM8167615072023
NSP1Interferon AntagonistA791001422083139
NSP2NTPaseN8571523582228
NSP3Translation enhancerT8577722942228
NSP4EnterotoxinE85100631832732
NSP5pHosphoproteinH91113623322228
* Work of 2008, where the cutoff values were established [4].
Table 2. Reference strains of all genes. The highlighted genotypes represent the reference for that genotype. The first 41 strains were taken from Matthijnssens et al. in 2008 [28] and the others from the RCWG homepage, accessed in July 2024 [29].
Table 2. Reference strains of all genes. The highlighted genotypes represent the reference for that genotype. The first 41 strains were taken from Matthijnssens et al. in 2008 [28] and the others from the RCWG homepage, accessed in July 2024 [29].
Reference Strains for Each Genotypes
Strain NameVP7VP4VP6VP1VP2VP3NSP1NSP2NSP3NSP4NSP5
RVA/Human-tc/USA/WaCS/1974/G1P[8]G1P[8]I1R1C1M1A1N1T1E1H1
RVA/Human-tc/USA/DS-1/1976/G2P[4]G2P[4]I2R2C2M2A2N2T2E2H2
RVA/Human-tc/JPN/AU-1/1982/G3P[9]G3P[9]I3R3C3M3A3N3T3E3H3
RVA/Human-tc/GBR/ST3/1975/G4P2A[6]G4P[6]I1R1C1M1A1N1T1E1H1
RVA/Pig-tc/USA/Gottfried/1983/G4P[6]G4P[6]I1R1C1M1A8N1T1E1H1
RVA/Pig-tc/USA/OSU/1975/G5P[7]G5P[7]I5R1C1M1A1N1T1E1H1
RVA/Cow-tc/USA/NCDV-Lincoln/1971/G6P[1]G6P[1]I2R2C2M2-N2T6E2-
RVA/Human-wt/HUN/Hun5/1997/G6P[14]G6P[14]I2R2C2M2A11N2T6E2H3
RVA/Turkey-tc/IRL/Ty-3/1979/G7P[35]G7P[35]I4R4C4M4A16N4T4E11H14
RVA/Human-wt/COD/DRC86/2003/G8P[6]G8P[6]I2R2C2M2A2N2T2E2H2
RVA/Human-tc/USA/WI61/1983/G9P1A[8]G9P[8]I1R1C1M1A1N1T1E1H1
RVA/Human/IND/I321/1996/G10P[11]G10P[11]I2---A1N2T1E2-
RVA/Human-wt/BGD/Dhaka6/2001/G11P[25]G11P[25]I1R1C1M1A1N1T1E1H1
RVA/Human-tc/PHL/L26/1987/G12P[4]G12P[4]I2R2C2M2A2N1T2E2H1
RVA/Human-tc/THA/T152/1998/G12P[9]G12P[9]I3R3C3M3A12N3T3E3H6
RVA/Horse-tc/GBR/L338/1991/G13P[18]G13P[18]I6R9C9M6A6N9T12E14H11
RVA/Horse-tc/USA/FI23/1981/G14P[12]G14P[12]I2R2C2M3A10N2T3E2H7
RVA/Cow/IND/Hg18/XXXX/G15P[21]G15P[21]-------E2-
RVA/Mouse-tc/USA/EDIM/XXXX/G16P[16]G16P[16]I7R7C7M8A7N7T10E7H9
RVA/Turkey-tc/IRL/Ty-1/1979/G17P[38]G17P[38]I4R4C4M4A16N4T4E4H4
RVA/Pigeon-tc/PO-13/1983/G18P[17]G18P[17]I4R4C4M4A4N4T4E4H4
RVA/Chicken-tc/DEU/06V0661/2006/G19P[31]G19P[31]I11-------H8
RVA/Chicken-tc/IRL/Ch-1/1979/G19P[30]G19P[17]I10-------H10
RVA/Simian-tc/ZAF/SA11-H96/1958/G3P[2]G3P[2]I2R2C5M5A5N5T5E2H5
RVA/Cat-tc/AUS/Cat97/1984/G3P[3]G3P[3]I3R3C2M3A9N2T3E3H6
RVA/Cow-tc/USA/WC3/1981/G6P[5]G6P[5]I2R2C2M2A3N2T6E2H3
RVA/Human-tc/GBR/ST3/1975/G4P2A[6]G4P[6]I1R1C1M1A1N1T1E1H1
RVA/Cat-tc/AUS/Cat2/1984/G3P[9]G3P[9]I3R3C2M3A3N1T6E3H3
RVA/Human-tc/IDN/69M/1980/G8P4[10]G8P[10]I2R2C2M2A2N2T2E2H2
RVA/Cow-tc/THA/A5-13/1988/G8P[1]G8P[1]----A14----
RVA/Cow-tc/USA/B223/1983/G10P[11]G10P[11]I2R2C2M2A13N2T6E2H3
RVA/Horse-tc/GBR/H-2/1976/G3P[12]G3P[12]I6R2C2M3A10N2T3E2H7
RVA/Human-wt/IND/HP140/1987/G6P[13]G6P[13]I2------E1H1
RVA/Sheep-tc/ESP/OVR762/2002/G8P[14]G8P[14]I2R2C2M2A11N2T6E2H3
RVA/Ovine/CHN/Lp14/1981/G10P[15]G10P[15]I2------E2H3
RVA/Human/IND/RMC321/1989/G9P[19]G9P[19]I5---A1N1T1E1H1
RVA/Mouse/Brazil/EHP/1981/G16P[20]G16P[20]----A7--E7-
RVA/Rabbit/ITA/160-01/2002/G3P[22]G3P[22]-------E5-
RVA/Pig-wt/ESP/34461-4/2003/G2P[23]G2P[23]I5------E1H1
RVA/Rhesus-wt/USA/TUCH/2002/G3P[24]G3P[24]I9R3C3M3A9N1T3E3H6
RVA/Human-wt/NPL/KTM368/2004/G11P[25]G11P[25]I12R1C1M1A1N1T1E1H1
RVA/Human-tc/ITA/PA260-97/1997/G3P[3]G3P[3]I3R3C3M3A15N2T3E3H6
RVA/Cow-wt/ARG/B383/1998/G15P[11]G15P[11]I2R5C2M2A13N2T6E12H3
RVA/Horse-wt/ARG/E30/1993/G3P[12]G3P[12]I6R2C2M3A10N2T3E12H7
RVA/Pig/ITA/134/04-15/2003/G5P[26]G5P[26]I5------E1-
RVA/Dog-tc/ITA/RV198-95/1995/G3P[3]G3P[3]-------E8-
RVA/Pig/THA/CMP034/2000/G20P[27]G2P[27]I5------E9H1
RVA/Human-wt/ECU/Ecu534/2006/G20P[28]G20P[28]I13R13C13M12A23N13T15E20H15
RVA/Cow-wt/JPN/Azuk-1/2006/G21P[29]G21P[29]I2R2C2M2A13N2T9E2H3
RVA/Turkey-tc/DEU/03V0002E10/2003/G22P[35]G22P[35]I4R4C4M4A16N4T4E11H4
RVA/Chicken-tc/DEU/02V0002G3/2002/G19P[30]G19P[30]I11R6C6M7A16N6T8E10H8
RVA/Pig-wt/IRL/61-07-ire/2007/G2P[32]G2P[32]---------
RVA/Pheasant-wt/HUN/Phea14246/2008/G23P[x]G23----------
RVA/Pig-wt/CAN/CE-M-06-0003/2005/G2P[27]G2P[27]I14--------
RVA/Cow-tc/JPN/Dai-10/2007/G24P[33]G24P[33]I2R2C2M2A13N2T9E2H3
RVA/Cow-wt/JPN/Azuk-1/2006/G21P[29]G21P[29]I2R2C2M2A13N2T9E2H3
RVA/Mouse-tc/USA/ETD_822/XXXX/G16P[16]G16P[16]I7R7C7M8A7N7T10E7H9
RVA/Bat-wt/KEN/KE4852/07/2007/G25P[6]G25P[6]I15-C8--N8T11E2H10
RVA/Pig-wt/JPN/FGP51/2009/G4P[34]G4P[34]---------
RVA/Human-tc/KEN/B10/1987/G3P[2]G3P[2]I16R8C5M5A5N5T5E13H5
RVA/Pig-wt/JPN/TJ4-1/2010/G26P[X]G26----------
RVA/SugarGlider-tc/JPN/SG385/2012/G27P[36]G27P[36]I19R10C10M9A20N11T13E17H12
RVA/Camel-wt/KUW/21s/2010/G10P[15]G10P[15]I1R1C2--N2T2E15H3
RVA/vicugna-wt/ARG/C75/2010/G8P[14]G8P[14]I2R2C2M2-N2T6E16-
RVA/Rabbit-tc/CHN/N5/1992/G3P[14]G3P[14]I17R3C3M3A9N1T1E3H2
RVA/Alpaca-wt/PER/356/2010/G3P[14]G3P[14]I2R5C3M3A17N3T6E3H3
RVA/Pheasant-tc/GER/10V0112H5/2010/G23P[37]G23P[37]I4R4C4M4A16N10T4E4H4
RVA/Camel-wt/SDN/MRC-DPRU447/2004/G8P[11]G8P[11]I2R2C2M2A18N2T6E2H3
RVA/Human-wt/BRA/QUI-35-F5/2010/G3P[9]G3P[9]I18R3C3M3A19N3T3E3H6
RVA/VelvetScoter-tc/JPN/RK1/1989/G18P[17]G18P[17]I4R4C4M4A21N4T4E4H4
RVA/Rat-wt/GER/KS-11-573/2011/G3P[3]G3P[3]I20R11C11M10A22N2T14E18H13
RVA/Fox-wt/ITA/288356/2011/G18P[17]G18P[17]I4R4C4M4A16N4T4E19H4
RVA/Turkey-tc/IRL/Ty-3/1979/G7P[17]G7P[17]I4R4C4M4A16N4T4E11H14
RVA/Human-wt/ITA/ME848-12/2012/G12P[8]G12P[8]I17R12C12M11A12N12T7E6H2
RVA/Human-wt/SUR/2014735512/2013/G20P[28]G20P[28] R13C13M12A23N13T15E20H15
RVA/Common_Gull-wt/JPN/Ho374/2013/G28P[39]G28P[39]I21R14C14M13A24N14T16E21H16
RVA/Alpaca-tc/PER/SA44/2014/G3P[40]G3P[40]I8R3C3M3A9N3T3E3H6
RVA/Human-wt/BEL/BEF06018/2014/G29P[41]G29P[41]I2R2C2M2A3N2T6E2H3
Strain nameVP7VP4VP6VP1VP2VP3NSP1NSP2NSP3NSP4NSP5
RVA/Bat-wt/CMR/BatLi09/2014/G30P[42]G30P[42]I22R15C15M14A25N15T17E22H17
RVA/Bat-wt/CMR/BatLi08/2014/G31P[42]G31P[42]I22R15C15M14A25N15T17E22H17
RVA/Bat-wt/CMR/BatLi10/2014/G30P[42]G30P[42]I23R15C15M14A25N15T17E22H17
RVA/Bat-wt/CMR/BatLy03/2014/G25P[43]G25P[43]I15R16C8M15A26N8T11E23H10
RVA/Bat-wt/NLD/NPpipi1/2014/GxP[44]-P[44]I23R17C16M16A27N16T18-H18
RVA/Rat-wt/CHN/RA116/2013/G3P[45]G3P[45]I3R3C3M10A22N3T3E3H13
RVA/Shrew-wt/CHN/LW9/2013/G32P[46]G32P[46]I24R18C17M17A28N17T19E24H19
RVA/Bat-wt/CMR/BatLy17/2014/G30P[47]G30P[47]I22R15C15M14A25N15T17E22H17
RVA/Rat-wt/ITA/Rat14/2015/G3P[3]G3P[3]I1R11C11M10A22N18T14E18H13
RVA/Bat-wt/CHN/GLRL1/2005/G33P[48]G33P[48]I25R19C18M18-N19T20E25H20
RVA/Bat-wt/CHN/YSSK5/2015/G3P[3]G3P[3]I8R20C2M1A9N3T3E3H6
RVA/Bat-wt/CHN/BSTM70/2015/G3P[3]G3P[3]I8R3C3M3A29N3T3E3H6
RVA/Raccoon-wt/JPN/Rac-311/2011/G34P[17]G34P[17]I26R21C19M19A30N20T21E26H21
RVA/Pig-wt/BGD/214016006/2014/G9P[49]G9P[49]---------
RVA/Alpaca-wt/PER/Alp11B/2010/G35P[50]G35P[50]I13------E16H6
RVA/Bat-wt/ZMB/ZFB14-126/2014/GxP[x]--I22----N21T17E27-
RVA/Bat-wt/KEN/BATp39/2015/G36P[51]G36P[51]I16R22C20M20A31N22T22E27H22
RVA/Bat/CRC/KCR10-93/2010/G20P[47]G20P[47]I13R13C13M12A32N13T23E20-
RVA/Bat/GAB/GKS-929/2009/G3P[2]G3P[2]I30R8C5M5A36N23T5E28H5
RVA/Bovine-wt/UMN-VDL/2018/G37P[52]G37P[52]---------
RVA/common-shrew/KS-11-2281/2011/GXP[X]--I27R23------H23
RVA/Bat-wt/KEN/11/2008/G30P[53]G30P[53]----A33---H24
RVA/Bat-wt/GTM/56/2010/G38P[54]G38P[54]I28R24C21M21A34N24T24E29H25
RVA/Bat-wt/NGA/59/2011/GXP[2]-P[2]I16R8C3M5A35N3T3--
RVA/Bat-wt/GTM/53/2009/GxPx--I29R25---N25T25--
RVA/Shrew-wt/GER/KS14-269/2014/G39P[55]G39P[55]-R26C22M22A37N26T26E30H26
RVA/JungleCrow-wt/JPN/JC-105/2019/G40P[56]G40P[56]I26R21C19M19A30N20T21E26H21
RVA/MultimammateMouse-wt/ZMB/MpR12/2012/G41P[57]G41P[57]I31R27C23M23A38N27T27E31H27
RVA/Shrew-wt/GER/KS11-0893/2010/G42P[58]G42P[58]I32R28C24L24A39N28T28E32H28
Table 3. Strains out of the cutoff value calculated with the p-distance model. Here, we compare the number of stains out of the cutoff value for each genotype, the number of strains out of the cutoff value for each gene, and the total of strains analyzed included in each particular gene.
Table 3. Strains out of the cutoff value calculated with the p-distance model. Here, we compare the number of stains out of the cutoff value for each genotype, the number of strains out of the cutoff value for each gene, and the total of strains analyzed included in each particular gene.
Strains Out of the Cutoff Value with p-Distance Model
GeneGenotypes OutliersStrains OutliersTotal Strains In
Genotype
Rate
VP7G313840.3%
VP4P[8]1015660.64%
P[15]1147.1%
VP6I2137731.7%
I1222010.0%
VP1none-14950%
VP2C1717890.4%
C258690.6%
VP3None- 0%
NSP1None-22080%
NSP2N11416030.9%
N266440.9%
NSP3T1814960.5%
T36708.6%
NSP4E1519520.3%
E2568816.4%
E3157719.5%
NSP5H1315800.2%
H225140.4%
H361474.1%
H1055100%
H1422100%
H1522100%
TOTAL169
Table 4. Compared results of the RVA strain cutoff analysis. We compared the number of sequences analyzed in 2008 with the two models used in this study. In the columns are the number of strains out of the cutoff value. In the case of 2008, the number of strains analyzed are also included.
Table 4. Compared results of the RVA strain cutoff analysis. We compared the number of sequences analyzed in 2008 with the two models used in this study. In the columns are the number of strains out of the cutoff value. In the case of 2008, the number of strains analyzed are also included.
Comparison Between Models
Gene2008 *This Work
Strains AnalyzedStrains Analyzedp-Distance Outside the CutoffK2P Outside the Cutoff
VP7100031721215
VP419027711145
VP614224631595
VP1581495-262
VP258275412199
VP3671507-165
NSP11002208-502
NSP2712358201234
NSP377229414255
NSP41003183761025
NSP5113233220252
* Work of 2008 [4].
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Díaz Alarcón, R.G.; Salvatierra, K.; Gómez Quintero, E.; Liotta, D.J.; Parreño, V.; Miño, S.O. Complete Genome Classification System of Rotavirus alphagastroenteritidis: An Updated Analysis. Viruses 2025, 17, 211. https://doi.org/10.3390/v17020211

AMA Style

Díaz Alarcón RG, Salvatierra K, Gómez Quintero E, Liotta DJ, Parreño V, Miño SO. Complete Genome Classification System of Rotavirus alphagastroenteritidis: An Updated Analysis. Viruses. 2025; 17(2):211. https://doi.org/10.3390/v17020211

Chicago/Turabian Style

Díaz Alarcón, Ricardo Gabriel, Karina Salvatierra, Emiliano Gómez Quintero, Domingo Javier Liotta, Viviana Parreño, and Samuel Orlando Miño. 2025. "Complete Genome Classification System of Rotavirus alphagastroenteritidis: An Updated Analysis" Viruses 17, no. 2: 211. https://doi.org/10.3390/v17020211

APA Style

Díaz Alarcón, R. G., Salvatierra, K., Gómez Quintero, E., Liotta, D. J., Parreño, V., & Miño, S. O. (2025). Complete Genome Classification System of Rotavirus alphagastroenteritidis: An Updated Analysis. Viruses, 17(2), 211. https://doi.org/10.3390/v17020211

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop