Next Article in Journal
Novel Salmonella Phage, vB_Sen_STGO-35-1, Characterization and Evaluation in Chicken Meat
Previous Article in Journal
Special Issue “Enterococci for Probiotic Use: Safety and Risk”: Editorial
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Use and Limitations of the 16S rRNA Sequence for Species Classification of Anaplasma Samples

by
Mitchell T. Caudill
1,2 and
Kelly A. Brayton
1,*
1
Program in Genomics, Department of Veterinary Microbiology and Pathology, Washington State University, Pullman, WA 99164, USA
2
Center for One Health Research, Virginia-Maryland College of Veterinary Medicine, Blacksburg, VA 24060, USA
*
Author to whom correspondence should be addressed.
Microorganisms 2022, 10(3), 605; https://doi.org/10.3390/microorganisms10030605
Submission received: 6 February 2022 / Revised: 4 March 2022 / Accepted: 8 March 2022 / Published: 12 March 2022
(This article belongs to the Topic Veterinary Infectious Diseases)

Abstract

:
With the advent of cheaper, high-throughput sequencing technologies, the ability to survey biodiversity in previously unexplored niches and geographies has expanded massively. Within Anaplasma, a genus containing several intra-hematopoietic pathogens of medical and economic importance, at least 25 new species have been proposed since the last formal taxonomic organization. Given the obligate intracellular nature of these bacteria, none of these proposed species have been able to attain formal standing in the nomenclature per the International Code of Nomenclature of Prokaryotes rules. Many novel species’ proposals use sequence data obtained from targeted or metagenomic PCR studies of only a few genes, most commonly the 16S rRNA gene. We examined the utility of the 16S rRNA gene sequence for discriminating Anaplasma samples to the species level. We find that while the genetic diversity of the genus Anaplasma appears greater than appreciated in the last organization of the genus, caution must be used when attempting to resolve to a species descriptor from the 16S rRNA gene alone. Specifically, genomically distinct species have similar 16S rRNA gene sequences, especially when only partial amplicons of the 16S rRNA are used. Furthermore, we provide key bases that allow classification of the formally named species of Anaplasma.

1. Introduction

The genus Anaplasma contains obligate intracellular bacteria capable of colonizing both mammalian and arthropod cells. Currently, the genus consists of seven formally named species: A. marginale, A. centrale, A. ovis, A. bovis, A. phagocytophilum, A. platys and A. caudatum [1,2]; however, since the last taxonomic organization of the genus [1] at least 25 new species have been proposed in the literature. These proposed species range from clinical isolates of infected animals to unique sequence variants detected in broad multi-species-based 16S rRNA metagenomic studies. Some, such as the proposed “A. capra”, represent an important source of human and animal disease [3,4,5], while others appear as single, orphan isolates from unique animal hosts.
The nature and variety of evidence for the claim of a novel species vary greatly in the reports, but a unique 16S rRNA sequence identity appears to be the most prevalent evidence. The intracellular nature of these bacteria and the consequent inability to produce pure cultures make taxonomic studies and new species descriptions in line with the International Code of Nomenclature of Prokaryotes (ICNP) difficult to impossible [6]. Historically, molecular study of the genus has been limited by technical challenges posed by the intracellular lifestyle despite several species being of either public health or economic importance [7,8,9]. The expanded availability of high-throughput sequencing has provided the ability to acquire new genetic samples without the need for associated mammalian or tick-cell culture. While this has provided a boon for the ability to conduct new analyses, the scattershot application of gene sequencing studies lacking a standardized methodology or reference point has led to a proliferation of proposed novel species without the ability to rigorously and systematically analyze these claims en masse.
Clear species definitions in bacteria are fraught but at the onset of the genomic era of bacterial taxonomy, it was suggested and broadly accepted that bacteria defined as the same species should share at least 70% DNA cross-hybridization (DNA–DNA hybridization or DDH) [10]. DDH assays proved difficult to implement and as genome sequences became available, it was shown that an average nucleotide identity (ANI) of ~95–96% could serve as a proxy for 70% DDH [11]. Following the work of Carl Woese and others, variation in sequence identity of the 16S rRNA gene has been broadly correlated with differences in genomic ANI in prokaryotes, and thus the percentage of DNA–DNA cross-hybridization [12]. While 16S rRNA is a common sequence for metagenomic survey and taxonomic comparison, its utility in resolving isolates to the species level can vary greatly across various groups of bacteria, especially when only partial regions of the genes are used as the point of comparison [13]. In this paper, we evaluate the utility of near-complete and partial 16S rRNA sequences to resolve Anaplasma isolates to the species level and test the relationship between 16S rRNA and average nucleotide identity (ANI) for the genus Anaplasma.
We demonstrate that 16S rRNA sequences must be used with caution when attempting to define species within the genus Anaplasma, as bona fide species as confirmed by whole genome analysis possess 16S rRNA gene identities suggestive of same species status. Nevertheless, the prevalence of unique 16S rRNA sequences across the literature represents a large measure of uncaptured genetic diversity within the current taxonomy and suggests numerous potential novel species within the genus that deserve deeper geno- and phenotypic characterization. Finally, we present an analysis of the 16S rRNA sequences derived from the complete genomes of the genus to determine key bases that may aid in distinguishing the known, formally named species in the absence of a complete genome.

2. Materials and Methods

2.1. Sequences and Alignments

All 16S rRNA gene sequences were obtained from NCBI Genbank. Initial searches were conducted using several key phrases including “16S”, “Anaplasma”, as well as earlier names such as “Ehrlichia equi” that were utilized prior to the reorganization of the order in 2001 [1]. The 737 sequences obtained were compiled and their accession numbers are listed in Supplementary Table S1. A few instances of identical entries were removed, and only sequences over 1300 bp in length were deemed sufficient for inclusion. Additional sequences not captured in the above searches were added following review of the literature for proposed novel Anaplasma species. Sequences were aligned utilizing ClustalW within the Bioedit platform (Version 7.0.5.3) [14] according to species designation. Identical sequences were removed to leave a core set of unique 16S rRNA gene sequence variants for each species (see Supplementary Table S2) to construct the similarity tables. Consensus sequences were generated for each species using Bioedit. Ambiguous nucleotides assigned by Bioedit to consensus sequences were examined and kept if the single-nucleotide polymorphism (SNP) was observed in the locus three or more times. A SNP frequency below this threshold was considered within the range of sequence error. A few sequences (AB211164, JQ839010, KP062964–KP062966, AB588977, KX817983, and AF283007) that were classified in Genbank as A. centrale were reassigned to A. capra (for an explanation, see Khumalo et al., 2018 [15]).

2.2. ANI and Sequence Identity Matrix

OrthoANI values were calculated in a pairwise fashion from whole genome sequences using the EZBio platform (https://www.ezbiocloud.net/tools/ani; accessed on 7 March 2022) [16]. The 16S rRNA gene alignments were used to generate sequence identity matrices using the Bioedit platform (Version 7.0.5.3).

2.3. Phylogenetic Comparison

The phylogenetic tree arising from the consensus 16S rRNA gene sequences was constructed using the Phylogeny.fr platform (www.phylogeny.fr; accessed 7 March 2022) [17,18,19,20,21,22,23]. This platform aligns sequences with MUSCLE (v3.8.31), and removes ambiguous regions with Gblocks (v.091b) using these parameters: block after gap cleaning has minimum length of ten, segments with contiguous non-conserved positions larger than eight are rejected, and flank positions require 85% minimum sequences. The phylogenetic tree was constructed using a maximum likelihood method using PhyML (v3.1/3.0 aLRT) with a HKY85 substitution model with an estimated proportion of invariant sites (of 0.704) and four gamma-distributed rate categories. The gamma shape parameter was estimated directly from the data (gamma = 0.546). Reliability for the internal branch was assessed using the aLRT test (SH-Like). Graphical representation of the phylogenetic tree was performed with TreeDyn (v198.3).

3. Results

3.1. Correlation of 16S rRNA with ANI within the Genus Anaplasma

An initial analysis was conducted to assess the relationship between 16S rRNA gene sequence identity and genome nucleotide identity within the genus Anaplasma. Across the Anaplasma species for which genomes are available, sequence similarity in 16S rRNA below 98.7% corresponded to an ANI percentage difference (i.e., < 96%) indicating a distinct species (Table 1).
This preliminarily analysis confirms the correlation between 16S rRNA identities and ANI for the genus Anaplasma. Interestingly, A. marginale, A. centrale, and A. ovis share 16S rRNA sequence identity above 98.7% but nevertheless have distinctive ANI percentages. This indicates that high 16S rRNA sequence similarity may not indicate species status within the genus Anaplasma. As expected, within A. phagocytophilum and A. marginale, the two species for which multiple genomes have been sequenced, there is high intraspecies correlation of both the 16S rRNA identity and the ANI.

3.2. 16S rRNA Sequence Similarity across Anaplasma Sequences

Table 2 lists twenty-five proposed species documented in the literature, with the associated NCBI accession numbers of the 16S rRNA sequences. Seventeen of these species had sequences of sufficient length to be used in this study. Of note, none of these proposed species have available whole genome sequences. Additionally, few other genes have been sequenced from these organisms, and a consistent set of genes is not available from each organism to be able to study the utility of a larger gene set for species classification.
Comparison of the 16S rRNA gene sequences from the proposed and validated Anaplasma organisms shows that several proposed species have identities greater than the 98.7% cutoff that usually would indicate that two organisms belong to the same species (Table 3). Thirteen of the named species, “Candidatus Anaplasma boleense”, ”Candidatus Anaplasma pangolinii”, “Candidatus Anaplasma testudinis”, Anaplasma sp. Saso, Anaplasma sp. Hadesa, Anaplasma sp. Dedessa, A. capra, A. odocoilei, Anaplasma sp. SA dog, Anaplasma sp. ZAM dog, Anaplasma sp. Ar. walkerae, Anaplasma sp. O. moubata and Anaplasma sp. Shizhu, resolve below a 98.7% identity threshold from a formally named species of Anaplasma. “Candidatus Anaplasma testudinis”, “Candidatus Anaplasma pangolinii”, Anaplasma sp. Ar. walkerae, Anaplasma sp. O. moubata, Anaplasma sp. Shizhu and A. capra each form independent, distinct, profiles while Anaplasma sp. SA dog, and Anaplasma sp. ZAM dog group together. Anaplasma sp. Saso and Anaplasma sp. Hadesa group together as do Anaplasma sp. Dedessa and “Candidatus Anaplasma boleense”. Interestingly, the putative A. odocoilei has less than 98.7% sequence identity with A. platys, but greater than 98.7% to other proposed species that group with A. platys (i.e., Anaplasma sp. Mymensingh, “Candidatus Anaplasma camelii”, and Anaplasma sp. Omatijenne). In the absence of more sequence data, or a complete genome, it is unknown whether the sequences from these organisms represent truly separate species or are perhaps strain variants of A. platys. Notably, two sequences that have been identified as A. platys (KU585989 and KU586001) are distinct from the other A. platys sequences and may represent yet an additional species.
The relationships among these organisms can be seen in the phylogenetic tree based on 16S rRNA gene sequences (Figure 1). The phylogenetic tree forms two clades, both anchored by three validly named species. The clade anchored by A. marginale, A. centrale, and A. ovis primarily infects erythrocytes. While the other, anchored by A. bovis, A. phagocytophilum and A. platys, primarily infects white blood cells and platelets. To visualize the totality of the sequences used in this study, non-redundant sequences were put into an identity matrix with identities ≥98.7 highlighted in dark blue (Figure 2), with those below 95% in white and intermediate identities in light blue. With this coloration as a guide, several samples appear to not be placed correctly with their most similar counterparts, or they do not appear to have a best fit position in the matrix. In other words, this analysis highlights misplaced organisms/species, such as some A. marginale better aligning with A. capra. It also shows that some of the putative species, such as “Candidatus Anaplasma boleense” and “Candidatus Anaplasma testudinis” appear to be unique.

3.3. 16S rRNA Variable Region Sequence Analysis across Anaplasma Sequences

Several studies have relied on partial sequences composed of only a few hypervariable regions of the 16S rRNA (see, for instance, [36]) (Figure 2A). As such, we tested the degree to which hypervariable regions corresponded to results arising from the near-complete 16S rRNA sequence. In analyzing these partial sequences, we determined that hypervariable regions 2 and 6 (V2 and V6) were the most variable for Anaplasma genotypes. We examined the identity matrices of the near-full-length 16S rRNA sequences (Figure 2B), V3–V4 (Figure 2C) and concatenated V2 and V6 (Figure 2D), to determine the precision and accuracy of specific hypervariable region sequences to classify Anaplasma sequences. Using concatenated sequences of V2 and V6 usually resulted in the highest correlation with near-complete sequences (i.e., highest accuracy); however, in a few instances, samples were misclassified into an incorrect species (relatively low precision).

3.4. 16S rRNA Single-Nucleotide Polymorphisms across Closely Related Anaplasma Species

Since several distinct species share greater than 98.7% 16S rRNA sequence identity, we also examined the distinct single-nucleotide polymorphisms that might allow classification of a limited sample. In examining the closely related ruminant-infecting Anaplasma clade (A. marginale, A. centrale, A. ovis and the proposed species Anaplasma sp. Mongolia), only six nucleotides discriminate these species (Table 4). Similarly, when examining the 16S rRNA sequences for the organisms that appear to be closely related to A. platys, there are up to thirteen nucleotides that are differentiating, with a smaller number depending on the exact pair/set of sequences being examined (Table 5). Given the lack of genome sequences, an analysis of the ANI relationships could not be performed.

4. Discussion

A clear species definition for bacteria has proved a vexing and, indeed, philosophical problem [65,66]. Nevertheless, as a practical matter, the consensus opinion in bacterial taxonomy has been that species should be delineated based on the degree of genetic overlap. The technical test for this genetic overlap has progressed from the degree of DNA–DNA hybridization to average nucleotide identity (ANI), to the percentage of 16S rRNA sequence identity. From the analysis of available sequence samples for the genus Anaplasma, we show that classification of samples to the species level remains difficult based on 16S rRNA sequences alone and that ancillary data are likely required to clearly define separate species.
Among the formally named Anaplasma, the 16S rRNA sequences from A. phagocytophilum are easily distinguishable from those of A. marginale, A. centrale and A. ovis; however, single-nucleotide polymorphisms must be used to distinguish this latter group from each other (Table 4), despite significant whole genome ANI and syntenty differences between these species that confirm their designation as unique entities (Table 1). Conversely, while 16S rRNA sequence comparisons showing identities below a 98.7% threshold are broadly used to suggest that two bacteria are separate species, as shown in Figure 2B for the formally named Anaplasma there are numerous sequences in which the samples have been classified as a given species of Anaplasma despite the 16S rRNA sequence having less than 98.7% shared identity to a plurality of sequences within that clade. It is possible that these sequences which diverge from the consensus could be novel species erroneously classified as a known species or, alternatively, it may represent a high degree of intraspecies population variance in the 16S rRNA sequence across the Anaplasma. Interestingly, the variance appears more prominent in the clade of Anaplasma infecting white cells and platelets than the clade infecting erythrocytes. Given the inconsistent sample size of sequence deposits across the various species of Anaplasma, and the lack of corroborating information for many deposited sequences, the intraspecies population heterogeneity for the 16S rRNA sequence for each of the Anaplasma remains impossible to determine in a rigorous manner, but these tentative findings deserve further investigation.
When the sequence identity of near-complete 16S rRNA sequences from the putative species in the literature are examined, eleven resolve below 98.7% identity with samples from validated species indicating greater diversity within the genus Anaplasma than appreciated in the currently accepted taxonomy (Table 3). Several of these also show a low degree of intraspecies variation, though this may result from low sample size and sampling bias in the field study methodology. “Candidatus Anaplasma boleense”,”Candidatus Anaplasma pangolinii”, “Candidatus Anaplasma testudinis”, the two species from argasid ticks and A. capra all group as unique entities (Figure 1) and represent likely candidates for independent species. The status and relationships of A. odocoilei, Anaplasma sp. SA dog, and Anaplasma sp. ZAM dog are more complicated. The similarity of 16S rRNA sequences between A. odocoilei and several putative species, which themselves have high similarity with A. platys may indicate one or even several new species, since 16S rRNA sequence identities above 98.7% can occur among unique Anaplasma species. The same is true for Anaplasma sp. SA dog and ZAM dog, which group together but are distinct from other Anaplasma spp. These may represent variants of a single species (as suggested by Kolo et al., 2020), or two unique entities; additional sequence data or whole genome sequences are needed to resolve this question. Anaplasma sp. Dedessa groups with “Candidatus Anaplasma boleense” and may represent a variant strain. The same may be true for Anaplasma sp. Hadesa and Anaplasma sp. Saso, which group together but apart from other Anaplasma spp. Collectively, they may represent variance within an independent species.
The other proposed species for which 16S rRNA sequences are available but do not resolve at a species level in our analysis are Anaplasma sp. Mymensingh, “Candidatus Anaplasma camelii”, Anaplasma sp. Omatjenne and Anaplasma sp. Mongolia. Anaplasma sp. Mymensingh, “Candidatus Anaplasma camelii”, Anaplasma sp. Omatjenne all group with A. platys, while Anaplasma sp. Mongolia groups closely with A. ovis. For Anaplasma sp. Mymensingh, “Candidatus Anaplasma camelii” and Anaplasma sp. Mongolia the similarity in 16S rRNA sequence identity with a validly named species were noted in their descriptions and differences in the groEL gene sequence was used to justify the creation of a new taxon. For Anaplasma sp. Omatjenne [58], the reasoning for novel nomenclature is not clear. The authors noted that Anaplasma sp. Omatjenne has 99.5% sequence similarity with A. platys but did not elaborate on why they assigned a new name. Presumably it is due to the non-standard host species—Anaplasma sp. Omatjenne was found in sheep, while A. platys is traditionally found in dogs, but this remains speculative on our part.
Our analysis revealed that sequences covering only a few hypervariable regions of the 16S rRNA gene can lead to misclassification of species (Figure 2) and should be avoided in favor of complete or near-complete 16S rRNA sequences. As such, 16S rRNA sequences alone remain poorly suited for species assignment for the genus Anaplasma. This should particularly be noted by those attempting to discriminate Anaplasma species when performing microbiome analysis of ticks or other arthropods. Our findings largely comport to recent studies of the use of groEL for taxonomic placement and strongly indicate that gene specific knowledge is required for the creation of accurate phylogenies [67]. While groEL and gltA, along with various Anaplasma specific msp genes, were additionally considered for our analysis, ultimately too few sequences were available, especially when attempting to match species or strain specific sequences (e.g., pairing a 16S rRNA sequence and groEL sequence arising from the same bacterial isolate). There is, at the present time, no rigorous correlation between differences in sequence identity ratios of 16S rRNA and other housekeeping genes within an Anaplasma species or among the validly named Anaplasma species.
It bears emphasizing that Anaplasma species remain difficult to culture in laboratory settings as they require intracellular replication in either mammalian or arthropod cells. This dramatically limits the ability to generate high-quality genetic material suitable for whole genome sequences, particularly for the clinically observed isolates or those arising from metagenomic analyses. While some third-generation sequencing technologies may aid in relieving this barrier, the inability to generate whole genome sequences remains the greatest bottleneck in mapping the diversity of the genus Anaplasma.
From our analyses it is clear that genetic diversity in Anaplasma is greater than currently captured by the formally named species in the genus Anaplasma, but a more definitive description of the exact number and relationship of species requires further genetic and genome sequencing. We would furthermore urge caution in denoting a new species on the basis of only two genes as in the case of Anaplasma sp. Mymensingh, “Candidatus Anaplasma camelii”, and Anaplasma sp. Mongolia, or particularly for naming putative species (i.e., “Candidatus Anaplasma corsicanum”, “Candidatus Anaplasma ivorensis”, and “Candidatus Anaplasma mediterraneum”) on the basis of 23S rRNA sequences, for which there are only a handful of sequences deposited even for validly named Anaplasma species. Multilocus sequencing may represent a valuable way forward to overcome some of the current issues in Anaplasma taxonomy, as well as a larger number of whole genome sequences for all of the known species of Anaplasma. Accessing samples that are known not to be mixed (i.e., containing more than one species of Anaplasma) with sufficiently high concentrations of target DNA remains challenging.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/microorganisms10030605/s1, Table S1: All 16S rRNA sequence accessions gathered for Anaplasma. Table S2: List of the representative sequences used in analysis, and identical sequences that were removed for Figure 2. Table S3: Identity matrices of the 16S rRNA gene regions.

Author Contributions

M.T.C.: data curation, formal analysis, investigation, methodology, visualization, and writing—original draft, review and editing. K.A.B.: conceptualization, methodology, funding acquisition, project administration, supervision, validation, visualization, and writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This study was funded in part by NIH NIAID grant R01AI136832. M.T.C. was supported by an Achievement Rewards for College Scientists (ARCS) Fellowship from the ARCS Foundation.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Publicly available data were analyzed in this study. All data were downloaded from NCBI (www.ncbi.nlm.nih.gov/nuccore/) on 17 September 2021. Specific accession numbers are listed in Supplementary Table S1.

Acknowledgments

We thank Agatha Kolo for discussions leading to this line of enquiry.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

  1. Dumler, J.S.; Barbet, A.F.; Bekker, C.P.J.; Dasch, G.A.; Palmer, G.H.; Ray, S.; Rikihisa, Y.; Rurangirwa, F.R. Reorganization of genera in the families Rickettsiaceae and Anaplasmataceae in the order Rickettsiales: Unification of some species of Ehrlichia with Anaplasma, Cowdria with Ehrlichia and Ehrlichia with Neorickettsia, descriptions of six new species combinations and designation of Ehrlichia equi and ‘HGE agent’ as subjective synonyms of Ehrlichia phagocytophila. Int. J. Syst. Evol. Microbiol. 2001, 51, 2145–2165. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Validation of the Publication of New Names and New Combinations Previously Effectively Published Outside the IJSB: List No. 57. Int. J. Syst. Bacteriol. 1996, 46, 625–626. [CrossRef] [PubMed] [Green Version]
  3. Guo, W.-P.; Huang, B.; Zhao, Q.; Xu, G.; Liu, B.; Wang, Y.-H.; Zhou, E.-M. Human-pathogenic Anaplasma spp., and Rickettsia spp. in animals in Xi’an, China. PLOS Negl. Trop. Dis. 2018, 12, e0006916. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Yang, J.; Liu, Z.; Niu, Q.; Mukhtar, M.U.; Guan, G.; Liu, G.; Luo, J.; Yin, H. A novel genotype of “Anaplasma capra” in wildlife and its phylogenetic relationship with the human genotypes. Emerg. Microbes Infect. 2018, 7, 1–4. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Peng, Y.; Wang, K.; Zhao, S.; Yan, Y.; Wang, H.; Jing, J.; Jian, F.; Wang, R.; Zhang, L.; Ning, C. Detection and Phylogenetic Characterization of Anaplasma capra: An Emerging Pathogen in Sheep and Goats in China. Front. Cell. Infect. Microbiol. 2018, 8, 283. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  6. Parker, C.T.; Tindall, B.J.; Garrity, G.M. International Code of Nomenclature of Prokaryotes. Int. J. Syst. Evol. Microbiol. 2019, 69, S1–S111. [Google Scholar] [CrossRef]
  7. Chávez, A.S.O.; Herron, M.J.; Nelson, C.M.; Felsheim, R.F.; Oliver, J.D.; Burkhardt, N.Y.; Kurtti, T.J.; Munderloh, U.G. Mutational analysis of gene function in the Anaplasmataceae: Challenges and perspectives. Ticks Tick Borne Dis. 2019, 10, 482–494. [Google Scholar] [CrossRef] [PubMed]
  8. Dumler, J.S.; Choi, K.-S.; Garcia-Garcia, J.C.; Barat, N.S.; Scorpio, D.G.; Garyu, J.W.; Grab, D.J.; Bakken, J.S. Human Granulocytic Anaplasmosis and Anaplasma phagocytophilum. Emerg. Infect. Dis. 2005, 11, 1828–1834. [Google Scholar] [CrossRef] [PubMed]
  9. Aubry, P.; Geale, D.W. A Review of Bovine Anaplasmosis. Transbound. Emerg. Dis. 2010, 58, 1–30. [Google Scholar] [CrossRef] [PubMed]
  10. Wayne, L.G.; Brenner, D.J.; Colwell, R.R.; Grimont, P.A.D.; Kandler, O.; Krichevsky, M.I.; Moore, L.H.; Moore, W.E.C.; Murray, R.G.E.; Stackebrandt, E.; et al. Report of the Ad Hoc Committee on Reconciliation of Approaches to Bacterial Systematics. Int. J. Syst. Evol. Microbiol. 1987, 37, 463–464. [Google Scholar] [CrossRef] [Green Version]
  11. Konstantinidis, K.T.; Tiedje, J.M. Genomic insights that advance the species definition for prokaryotes. Proc. Natl. Acad. Sci. USA 2005, 102, 2567–2572. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Kim, M.; Oh, H.-S.; Park, S.-C.; Chun, J. Towards a taxonomic coherence between average nucleotide identity and 16S rRNA gene sequence similarity for species demarcation of prokaryotes. Int. J. Syst. Evol. Microbiol. 2014, 64, 346–351. [Google Scholar] [CrossRef] [PubMed]
  13. Johnson, J.S.; Spakowicz, D.J.; Hong, B.-Y.; Petersen, L.M.; Demkowicz, P.; Chen, L.; Leopold, S.R.; Hanson, B.M.; Agresta, H.O.; Gerstein, M.; et al. Evaluation of 16S rRNA gene sequencing for species and strain-level microbiome analysis. Nat. Commun. 2019, 10, 5029. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Hall, T.A. BioEdit: A User-Friendly Biological Sequence Alignment Editor and Analysis Program for Windows 95/98/NT. Nucleic Acids Symp. Ser. 1999, 41, 95–98. [Google Scholar]
  15. Khumalo, Z.T.H.; Catanese, H.N.; Liesching, N.; Hove, P.; Collins, N.; Chaisi, M.E.; Gebremedhin, A.H.; Oosthuizen, M.; Brayton, K.A. Characterization of Anaplasma marginale subsp. centrale Strains by Use of msp1aS Genotyping Reveals a Wildlife Reservoir. J. Clin. Microbiol. 2016, 54, 2503–2512. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  16. Yoon, S.H.; Ha, S.M.; Lim, J.; Kwon, S.; Chun, J. A large-scale evaluation of algorithms to calculate average nucleotide identity. Antonie Van Leeuwenhoek. 2017, 10, 1281–1286. [Google Scholar] [CrossRef] [PubMed]
  17. Dereeper, A.; Audic, S.; Claverie, J.-M.; Blanc, G. BLAST-EXPLORER helps you building datasets for phylogenetic analysis. BMC Evol. Biol. 2010, 10, 8. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  18. Dereeper, A.; Guignon, V.; Blanc, G.; Audic, S.; Buffet, S.; Chevenet, F.; Dufayard, J.-F.; Guindon, S.; Lefort, V.; Lescot, M.; et al. Phylogeny.fr: Robust phylogenetic analysis for the non-specialist. Nucleic Acids Res. 2008, 36, W465–W469. [Google Scholar] [CrossRef] [PubMed]
  19. Edgar, R.C. MUSCLE: Multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32, 1792–1797. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  20. Castresana, J. Selection of Conserved Blocks from Multiple Alignments for Their Use in Phylogenetic Analysis. Mol. Biol. Evol. 2000, 17, 540–552. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  21. Guindon, S.; Gascuel, O. A Simple, Fast, and Accurate Algorithm to Estimate Large Phylogenies by Maximum Likelihood. Syst. Biol. 2003, 52, 696–704. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  22. Anisimova, M.; Gascuel, O. Approximate Likelihood-Ratio Test for Branches: A Fast, Accurate, and Powerful Alternative. Syst. Biol. 2006, 55, 539–552. [Google Scholar] [CrossRef] [PubMed]
  23. Chevenet, F.; Brun, C.; Bañuls, A.-L.; Jacq, B.; Christen, R. TreeDyn: Towards dynamic graphics and annotations for analyses of trees. BMC Bioinform. 2006, 7, 439. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Dahmani, M.; Davoust, B.; Tahir, D.; Raoult, D.; Fenollar, F.; Mediannikov, O. Molecular investigation and phylogeny of Anaplasmataceae species infecting domestic animals and ticks in Corsica, France. Parasites Vectors 2017, 10, 302. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Ehounoud, C.B.; Yao, K.P.; Dahmani, M.; Achi, Y.L.; Amanzougaghene, N.; N’Douba, A.K.; N’Guessan, J.D.; Raoult, D.; Fenollar, F.; Mediannikov, O. Multiple Pathogens Including Potential New Species in Tick Vectors in Côte d’Ivoire. PLOS Negl. Trop. Dis. 2016, 10, e0004367. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Dahmani, M.; Davoust, B.; Sambou, M.; Bassene, H.; Scandola, P.; Ameur, T.; Raoult, D.; Fenollar, F.; Mediannikov, O. Molecular investigation and phylogeny of species of the Anaplasmataceae infecting animals and ticks in Senegal. Parasites Vectors 2019, 12, 1–15. [Google Scholar] [CrossRef] [Green Version]
  27. Guo, W.-P.; Tian, J.-H.; Lin, X.-D.; Ni, X.-B.; Chen, X.-P.; Liao, Y.; Yang, S.-Y.; Dumler, J.S.; Holmes, E.C.; Zhang, Y.-Z. Extensive genetic diversity of Rickettsiales bacteria in multiple mosquito species. Sci. Rep. 2016, 6, 38770. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  28. Fernandes, S.D.J.; Matos, C.A.; Freschi, C.R.; Ramos, I.A.D.S.; Machado, R.Z.; André, M.R. Diversity of Anaplasma species in cattle in Mozambique. Ticks Tick Borne Dis. 2019, 10, 651–664. [Google Scholar] [CrossRef] [PubMed]
  29. Bastos, A.D.; Mohammed, O.B.; Bennett, N.; Petevinos, C.; Alagaili, A. Molecular detection of novel Anaplasmataceae closely related to Anaplasma platys and Ehrlichia canis in the dromedary camel (Camelus dromedarius). Veter. Microbiol. 2015, 179, 310–314. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  30. Sharifiyazdi, H.; Jafari, S.; Ghane, M.; Nazifi, S.; Sanati, A. Molecular investigation of Anaplasma and Ehrlichia natural infections in the dromedary camel (Camelus dromedarius) in Iran. Comp. Clin. Pathol. 2017, 26, 99–103. [Google Scholar] [CrossRef]
  31. Vanstreels, R.E.T.; Yabsley, M.J.; Parsons, N.J.; Swanepoel, L.; Pistorius, P.A. A novel candidate species of Anaplasma that infects avian erythrocytes. Parasites Vectors 2018, 11, 525. [Google Scholar] [CrossRef] [PubMed]
  32. Koh, F.X.; Kho, K.L.; Panchadcharam, C.; Sitam, F.T.; Tay, S.T. Molecular detection of Anaplasma spp. in pangolins (Manis javanica) and wild boars (Sus scrofa) in Peninsular Malaysia. Veter. Parasitol. 2016, 227, 73–76. [Google Scholar] [CrossRef] [PubMed]
  33. Parola, P.; Cornet, J.-P.; Sanogo, Y.O.; Miller, R.S.; Van Thien, H.; Gonzalez, J.-P.; Raoult, D.; Telford, S.R.; Wongsrichanalai, C. Detection of Ehrlichia spp., Anaplasma spp., Rickettsia spp., and Other Eubacteria in Ticks from the Thai-Myanmar Border and Vietnam. J. Clin. Microbiol. 2003, 41, 1600–1608. [Google Scholar] [CrossRef] [Green Version]
  34. Crosby, F.L.; Wellehan, J.F.; Pertierra, L.; Wendland, L.D.; Lundgren, A.M.; Barbet, A.F.; Brown, M.B. Molecular characterization of “Candidatus Anaplasma testudinis”: An emerging pathogen in the threatened Florida gopher tortoise (Gopherus polyphemus). Ticks Tick Borne Dis. 2021, 12, 101672. [Google Scholar] [CrossRef] [PubMed]
  35. Calchi, A.C.; Vultão, J.G.; Alves, M.H.; Yogui, D.R.; Desbiez, A.L.J.; De Santi, M.; de Souza Santana, M.; Da Silva, T.M.V.; Werther, K.; Teixeira, M.M.G.; et al. Ehrlichia spp. and Anaplasma spp. in Xenarthra mammals from Brazil, with evidence of novel ‘Candidatus Anaplasma spp.’. Sci. Rep. 2020, 10, 12615. [Google Scholar] [CrossRef] [PubMed]
  36. Uilenberg, G.; Van Vorstenbosch, C.J.; Perié, N.M. Blood parasites of sheep in the Netherlands. I. Anaplasma mesaeterum sp.n. (Rickettsiales, Anaplasmataceae). Tijdschr. Voor Diergeneeskd. 1979, 104, 14–22. [Google Scholar]
  37. Nakamura, Y.; Kawazu, S.-I.; Minami, T. Antigen profiles of Anaplasma ovis and A. mesaeterum and cross infection trials with them and A. marginale. Veter. Microbiol. 1993, 37, 19–30. [Google Scholar] [CrossRef]
  38. Sun, X.-F.; Zhao, L.; Wen, H.-L.; Luo, L.-M.; Yu, X.-J. Anaplasma species in China. Lancet Infect. Dis. 2015, 15, 1263–1264. [Google Scholar] [CrossRef] [Green Version]
  39. Li, H.; Zheng, Y.-C.; Ma, L.; Jia, N.; Jiang, B.-G.; Jiang, R.-R.; Huo, Q.-B.; Wang, Y.-W.; Liu, H.-B.; Chu, Y.-L.; et al. Human infection with a novel tick-borne Anaplasma species in China: A surveillance study. Lancet Infect. Dis. 2015, 15, 663–670. [Google Scholar] [CrossRef]
  40. Guo, W.-P.; Zhang, B.; Wang, Y.-H.; Xu, G.; Wang, X.; Ni, X.; Zhou, E.-M. Molecular identification and characterization of Anaplasma capra and Anaplasma platys-like in Rhipicephalus microplus in Ankang, Northwest China. BMC Infect. Dis. 2019, 19, 434. [Google Scholar] [CrossRef] [PubMed]
  41. Kawahara, M.; Rikihisa, Y.; Lin, Q.; Isogai, E.; Tahara, K.; Itagaki, A.; Hiramitsu, Y.; Tajima, T. Novel Genetic Variants of Anaplasma phagocytophilum, Anaplasma bovis, Anaplasma centrale, and a Novel Ehrlichia sp. in Wild Deer and Ticks on Two Major Islands in Japan. Appl. Environ. Microbiol. 2006, 72, 1102–1109. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  42. Kawahara, M.; Tajima, T.; Torii, H.; Yabutani, M.; Ishii, J.; Harasawa, M.; Isogai, E.; Rikihisa, Y. High prevalence of an Ehrlichia sp. closely related to Ehrlichia chaffeensis and three Anaplasma spp. infection in deer from Nara Park, Japan. Genbank, 2008; submitted. [Google Scholar]
  43. Sato, M.; Nishizawa, I.; Fujihara, M.; Nishimura, T.; Matsubara, K.; Harasawa, R. Phylogenetic Analysis of the 16S rRNA Gene of Anaplasma Species Detected from Japanese Serows (Capricornis crispus). J. Veter. Med. Sci. 2009, 71, 1677–1679. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  44. Masuzawa, T.; Uchishima, Y.; Fukui, T.; Okamoto, Y.; Muto, M.; Koizumi, N.; Yamada, A. Detection of Anaplasma phagocytophilum from Wild Boars and Deer in Japan. Jpn. J. Infect. Dis. 2011, 64, 333–336. [Google Scholar] [PubMed]
  45. Inokuma, H.; Terada, Y.; Kamio, T.; Raoult, D.; Brouqui, P. Analysis of the 16S rRNA gene sequence of Anaplasma centrale and its phylogenetic relatedness to other ehrlichiae. Clin. Diagn. Lab. Immunol. 2001, 8, 241–244. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  46. Zhou, Z.; Nie, K.; Tang, C.; Wang, Z.; Zhou, R.; Hu, S.; Zhang, Z. Phylogenetic analysis of the genus Anaplasma in Southwestern China based on 16S rRNA sequence. Res. Veter. Sci. 2010, 89, 262–265. [Google Scholar] [CrossRef]
  47. Liu, Z.; Ma, M.; Wang, Z.; Wang, J.; Peng, Y.; Li, Y.; Guan, G.; Luo, J.; Yin, H. Molecular Survey and Genetic Identification of Anaplasma Species in Goats from Central and Southern China. Appl. Environ. Microbiol. 2012, 78, 464–470. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  48. Ge, Y.; Yin, H.; Rikihisa, Y.; Pan, W.; Yin, H. Molecular Detection of Tick-Borne Rickettsiales in Goats and Sheep from Southeastern China. Vector Borne Zoonotic Dis. 2016, 16, 309–316. [Google Scholar] [CrossRef]
  49. Zhuang, L.; Du, J.; Cui, X.-M.; Li, H.; Tang, F.; Zhang, P.-H.; Hu, J.-G.; Tong, Y.-G.; Feng, Z.-C.; Liu, W. Identification of tick-borne pathogen diversity by metagenomic analysis in Haemaphysalis longicornis from Xinyang, China. Infect. Dis. Poverty 2018, 7, 1–8. [Google Scholar] [CrossRef]
  50. Lu, M.; Tian, J.-H.; Yu, B.; Guo, W.-P.; Holmes, E.C.; Zhang, Y.-Z. Extensive diversity of rickettsiales bacteria in ticks from Wuhan, China. Ticks Tick Borne Dis. 2017, 8, 574–580. [Google Scholar] [CrossRef]
  51. Amer, S.; Kim, S.; Yun, Y.; Na, K.-J. Novel variants of the newly emerged Anaplasma capra from Korean water deer (Hydropotes inermis argyropus) in South Korea. Parasites Vectors 2019, 12, 1–9. [Google Scholar] [CrossRef] [PubMed]
  52. Miranda, E.A.; Han, S.-W.; Cho, Y.-K.; Choi, K.-S.; Chae, J.-S. Co-Infection with Anaplasma Species and Novel Genetic Variants Detected in Cattle and Goats in the Republic of Korea. Pathogens 2021, 10, 28. [Google Scholar] [CrossRef]
  53. Staji, H.; Yousefi, M.; Ghaffari Khaligh, S.; Keyvanloo, M.; Ashrafi Tamai, I. No Title. Genbank, 2021; submitted. [Google Scholar]
  54. Inokuma, H.; Oyamada, M.; Kelly, P.J.; Jacobson, L.A.; Fournier, P.-E.; Itamoto, K.; Okuda, M.; Brouqui, P. Molecular detection of a new Anaplasma species closely related to Anaplasma phagocytophilumin canine blood from South Africa. J. Clin. Microbiol. 2005, 43, 2934–2937. [Google Scholar] [CrossRef] [Green Version]
  55. Roy, B.C.; Krücken, J.; Ahmed, J.S.; Majumder, S.; Baumann, M.P.; Clausen, P.-H.; Nijhof, A.M. Molecular identification of tick-borne pathogens infecting cattle in Mymensingh district of Bangladesh reveals emerging species of Anaplasmaand and Babesia. Transbound. Emerg. Dis. 2017, 65, e231–e242. [Google Scholar] [CrossRef] [PubMed]
  56. Kolo, A.O.; Collins, N.E.; Brayton, K.A.; Chaisi, M.; Blumberg, L.; Frean, J.; Gall, C.A.; Wentzel, J.M.; Wills-Berriman, S.; De Boni, L.; et al. Anaplasma phagocytophilum and Other Anaplasma spp. in Various Hosts in the Mnisi Community, Mpumalanga Province, South Africa. Microorganisms 2020, 8, 1812. [Google Scholar] [CrossRef] [PubMed]
  57. Tate, C.M.; Howerth, E.W.; Mead, D.G.; Dugan, V.G.; Luttrell, M.P.; Sahora, A.I.; Munderloh, U.G.; Davidson, W.R.; Yabsley, M.J. Anaplasma odocoilei sp. nov. (family Anaplasmataceae) from white-tailed deer (Odocoileus virginianus). Ticks Tick Borne Dis. 2013, 4, 110–119. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  58. Allsopp, M.T.E.P.; Visser, E.S.; Du Plessis, J.L.; Vogel, S.W.; Allsopp, B.A. Different organisms associated with heartwater as shown by analysis of 16S ribosomal RNA gene sequences. Veter. Parasitol. 1997, 71, 283–300. [Google Scholar] [CrossRef]
  59. Debeila, E.; Oosthuizen, M.; Collins, N. Occurrence of Anaplasma and Ehrlichia species in African buffalo (Syncerus caffer) in Kruger National Park and Hluhluwe-iMfolozi Park in South Africa. Genbank, 2012; submitted. [Google Scholar]
  60. Fischer, T.; Myalkhaa, M.; Krücken, J.; Battsetseg, G.; Batsukh, Z.; Baumann, M.P.O.; Clausen, P.; Nijhof, A.M. Molecular detection of tick-borne pathogens in bovine blood and ticks from Khentii, Mongolia. Transbound. Emerg. Dis. 2020, 67, 111–118. [Google Scholar] [CrossRef] [PubMed]
  61. Vlahakis, P.A.; Chitanga, S.; Simuunza, M.C.; Simulundu, E.; Qiu, Y.; Changula, K.; Chambaro, H.M.; Kajihara, M.; Nakao, R.; Takada, A.; et al. Molecular detection and characterization of zoonotic Anaplasma species in domestic dogs in Lusaka, Zambia. Ticks Tick Borne Dis. 2018, 9, 39–43. [Google Scholar] [CrossRef] [PubMed]
  62. Laloy, E.; Petit, E.; Boulouis, H.-J.; Lacroux, C.; Corbiere, F.; Schelcher, F.; Bonnet, S.; Maillard, R. First detection of Anaplasma phagocytophilum-like DNA in the French izard Rupricapra pyrenaica. Clin. Microbiol. Infect. 2009, 15, 26–27. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  63. Hailemariam, Z.; Krücken, J.; Baumann, M.; Ahmed, J.S.; Clausen, P.-H.; Nijhof, A.M. Molecular detection of tick-borne pathogens in cattle from Southwestern Ethiopia. PLoS ONE 2017, 12, e0188248. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  64. Qiu, Y.; Simuunza, M.; Kajihara, M.; Chambaro, H.; Harima, H.; Eto, Y.; Simulundu, E.; Squarre, D.; Torii, S.; Takada, A.; et al. Screening of tick-borne pathogens in argasid ticks in Zambia: Expansion of the geographic distribution of Rickettsia lusitaniae and Rickettsia hoogstraalii and detection of putative novel Anaplasma species. Ticks Tick Borne Dis. 2021, 12, 101720. [Google Scholar] [CrossRef]
  65. Novick, A.; Doolittle, W.F. ‘Species’ without species. Stud. Hist. Philos. Sci. Part A 2021, 87, 72–80. [Google Scholar] [CrossRef] [PubMed]
  66. Baquero, F.; Coque, T.M.; Galán, J.C.; Martinez, J.L. The Origin of Niches and Species in the Bacterial World. Front. Microbiol. 2021, 12, 1–13. [Google Scholar] [CrossRef] [PubMed]
  67. Ben Said, M.; Belkahia, H.; Selmi, R.; Messadi, L. Computational selection of minimum length groESL operon required for Anaplasma species attribution and strain diversity analysis. Mol. Cell. Probes 2019, 48, 101467. [Google Scholar] [CrossRef]
Figure 1. Phylogenetic tree of the genus Anaplasma with putative species. Validly named species are represented by their consensus sequences and are highlighted in bold. Ehrlichia ruminantium serves as an outgroup for comparison of species distance. Sequences used to construct the tree were 16S rRNA gene regions V2–V7 for each species/putative species and were approximately 1200 bp in length. Phylogeny constructed using a modification of the likelihood-ratio test via the PhyML algorithm with an HKY85 evolutionary model [21,22,23].
Figure 1. Phylogenetic tree of the genus Anaplasma with putative species. Validly named species are represented by their consensus sequences and are highlighted in bold. Ehrlichia ruminantium serves as an outgroup for comparison of species distance. Sequences used to construct the tree were 16S rRNA gene regions V2–V7 for each species/putative species and were approximately 1200 bp in length. Phylogeny constructed using a modification of the likelihood-ratio test via the PhyML algorithm with an HKY85 evolutionary model [21,22,23].
Microorganisms 10 00605 g001
Figure 2. Comparisons of Anaplasma 16S rRNA sequences. (A) A map of the variable regions of 16S rRNA sequence A. marginale St. Maries strain. The start of each variable region was determined by the beginning of the conserved portion of the given variable region. Panels (BD) are colorized representations of sequence identity matrices of regions of the 16SrRNA gene for known and putative Anaplasma species. (B) Comparison of the near-full-length (V2–V9 regions) 16S rRNA sequence identity for 294 sequences of Anaplasma. (C) Comparison of the V3–V4 regions of 16S rRNA sequence identity 294 Anaplasma sequences. (D) Comparison of concatenated V2 and V6 regions of 16S rRNA sequence identity 294 Anaplasma samples. In (BD), each box represents a single 16S rRNA sequence of the indicated species of Anaplasma. Dark blue shading represents shared identity above or equal to 98.7% with white representing shared identity below 95%. Light blue shading represents identities between 95–98.7%. Coding is as follows: Ac: A. centrale; Am: A. marginale; Ao: A. ovis; Mon: Anaplasma sp. Mongolia; Boo: “Candidatus A. boleense”; Ab: A. bovis; Aca: A. capra; Aph: A. phagocytophilum; Apl: A. platys; Omat: Anaplasma sp. Omatjenne; Apan: “Candidatus A. pangolinii”; Ded: Anaplasma sp. Dedessa; Saso: Anaplasma sp. Saso; Had: Anaplasma sp. Hadesa; Arw: Anaplasma sp. Ar. walkerae; Omo: Anaplasma sp. O. moubata; Aod: A. odocoilei; Ate: “Candidatus A. testudines”; SA: Anaplasma sp. SA dog; ZAM: Anaplasma sp. ZAM dog; Acam: “Candidatus A. camelii”; Shi: Anaplasma sp. Shizhu. My: Anaplasma sp. Mymensingh. For explicit sample coding (accession numbers), see Supplementary Table S3.
Figure 2. Comparisons of Anaplasma 16S rRNA sequences. (A) A map of the variable regions of 16S rRNA sequence A. marginale St. Maries strain. The start of each variable region was determined by the beginning of the conserved portion of the given variable region. Panels (BD) are colorized representations of sequence identity matrices of regions of the 16SrRNA gene for known and putative Anaplasma species. (B) Comparison of the near-full-length (V2–V9 regions) 16S rRNA sequence identity for 294 sequences of Anaplasma. (C) Comparison of the V3–V4 regions of 16S rRNA sequence identity 294 Anaplasma sequences. (D) Comparison of concatenated V2 and V6 regions of 16S rRNA sequence identity 294 Anaplasma samples. In (BD), each box represents a single 16S rRNA sequence of the indicated species of Anaplasma. Dark blue shading represents shared identity above or equal to 98.7% with white representing shared identity below 95%. Light blue shading represents identities between 95–98.7%. Coding is as follows: Ac: A. centrale; Am: A. marginale; Ao: A. ovis; Mon: Anaplasma sp. Mongolia; Boo: “Candidatus A. boleense”; Ab: A. bovis; Aca: A. capra; Aph: A. phagocytophilum; Apl: A. platys; Omat: Anaplasma sp. Omatjenne; Apan: “Candidatus A. pangolinii”; Ded: Anaplasma sp. Dedessa; Saso: Anaplasma sp. Saso; Had: Anaplasma sp. Hadesa; Arw: Anaplasma sp. Ar. walkerae; Omo: Anaplasma sp. O. moubata; Aod: A. odocoilei; Ate: “Candidatus A. testudines”; SA: Anaplasma sp. SA dog; ZAM: Anaplasma sp. ZAM dog; Acam: “Candidatus A. camelii”; Shi: Anaplasma sp. Shizhu. My: Anaplasma sp. Mymensingh. For explicit sample coding (accession numbers), see Supplementary Table S3.
Microorganisms 10 00605 g002
Table 1. ANI to 16S rRNA Gene Percent Identities for Anaplasma species.
Table 1. ANI to 16S rRNA Gene Percent Identities for Anaplasma species.
A. phagocytophilumA. marginaleA. centraleA. ovis
StrainHZHZ2Norway V2JMSt. MariesFloridaIsraelHaibei
HZ 99.98–100%96.51–99.8%99.69–100%67.89–96.3%68.10–96.3%68.17–96.3%68.32–96.2%
HZ299.98–100% 96.57–99.8%99.63–100%67.78–96.3%68.38–96.3%68.20–96.3%68.22–96.2%
Norway V296.51–99.8%96.57–99.8% 96.43–99.8%68.16–96.3%68.44–96.3%68.26–96.3%68.13–96.2%
JM99.69–100%99.30–100%96.43–99.8% 68.10–96.3%68.27–96.3%67.96–96.3%67.70–96.2%
St. Maries67.89–96.3%67.78–96.3%68.16–96.3%68.10–96.3% 99.02–99.9%87.56–99.3%84.87–99.3%
Florida68.1–96.3%68.38–96.3%68.44–96.3%68.27–96.3%99.02–99.9% 87.81–99.2%85.28–99.3%
Israel68.17–96.3%68.20–96.3%68.26–96.3%67.96–96.3%87.56–99.3%87.81–99.2% 81.46–99.5%
Haibei68.32–96.2%68.22–96.2%68.13–96.2%67.70–96.2%84.87–99.3%85.28–99.3%81.46–99.5%
White text on black background indicates species that conform to the ANI to 16S rRNA gene percent identity for grouping in a species. The black text on a gray background highlights strains that share high 16S rRNA gene identity with another species, but low ANI values. Black text on a white background shows values for which both ANI and 16S percent indicate different species classification.
Table 2. Putative Anaplasma spp., host source, 16S rRNA sequence accession numbers and references.
Table 2. Putative Anaplasma spp., host source, 16S rRNA sequence accession numbers and references.
Putative SpeciesHostAccession #Ref
Candidatus A. corsicanum”SheepNone[24]
Candidatus A. ivorensis”Tick (Amblyomma variegatum)None[25]
Candidatus A. mediterraneum”SheepNone[24]
Candidatus A. africae”Sheep, Cattle, GoatsMN317253–MN317255 *[26]
Candidatus A. boleense”Mosquitos, CattleKU585969, KU586025[27]
KU586041, KU586162[27]
KU586164, KU586169[27]
KU586177, KU586180[27]
KU586182[27]
MH169152 *[28]
Candidatus A. camelii”CamelsKX765882[29]
KF843823–KF843825[30]
Candidatus A. rodmosense”MosquitosKU586127 *, KU586148 *[27]
KU586144–KU586146 *[27]
KU586134–KU586136 *[27]
KU586141 *[27]
Candidatus A. sphenisci”Penguin (Spheniscus demersus)MG748724 *[31]
Candidatus A. pangolinii”Pangolin (Manis javanica),KU189193[32]
Tick (Amblyoma javanense)AF497580 *[33]
Candidatus A. testudines”Tortise (Gopherus polyphemus)MT62341-MT62345[34]
Cadidatus A. amazonensis”SlothsNone[35]
Candidatus A. brasiliensis”AnteatersNone[35]
A. mesaeterumSheepNone[36,37]
A. capraHuman,KR261618–KR261622[38]
domestic and wild ruminants,KP314237–KP314238[38]
DogsKM206273[39]
MG869526–MG869594[3]
MG869482–MG869510[3]
MH762071–MH762077[40]
AB211164[41]
AB454075[42]
AB509223[43]
AB588977[44]
AF283007[45]
EU709493[46]
FJ389574, FJ389576[46]
JN558820, JN558827[47]
KP062964–KP062966[48]
KP314241[38]
KX817983[49]
KX987331[50]
LC432092–LC432126[51]
MT798599–MT798604[52]
MW721591[53]
A. sp. SA dogDogsAY570538–AY570540[54]
A. sp. MymensinghTicks (Rhipicephalus microplus;MF576175.1[55]
Haemaphysalis bispinosa)MK815558-MK814449[56]
A. odocoileiDeer (Odocoileus virginianus)NR_118489, JX876644[57]
A. sp. OmatjenneSheep, Cattle, GoatsU54806[58]
KC189853[59]
A. sp. MongoliaSheepMK575506[60]
A. sp. ZAM dogDogsLC269823[61]
A. sp. Izard agentIzard (Rupricapra pyrenaica)EU857675 *[62]
A. sp. HadesaCattleKY924884[63]
A. sp. SasoCattleKY924885[63]
A. sp. DedessaCattleKY924886[63]
A. sp. O. moubataTick (Ornithodoros moubata)LC558313[64]
A. sp. Ar. walkeraeTick (Argas walkerae)LC558314[64]
A. sp. ShizhuGoatsFJ389575[46]
* These are partial or fragmented sequences that were not included in the analyses. “Accession #” refers to the Genbank sequence accession number.
Table 3. Sequence identity matrix for 16S rRNA gene “consensus” sequences for Anaplasma spp.
Table 3. Sequence identity matrix for 16S rRNA gene “consensus” sequences for Anaplasma spp.
centmargovisMoncaprabovisphagplatysMymOmatcamodocSAZAMSasoHadeboleDedepangtestwalkmoubShiz
A. centraleID99.799.299.398.194.695.595.796.095.995.995.795.895.994.394.396.296.596.396.496.896.197.2
A. marginale99.7ID99.099.098.194.595.595.896.095.995.995.695.795.894.094.096.196.496.196.496.896.197.0
A. ovis99.299.0ID99.597.894.595.695.695.995.995.995.995.795.894.394.396.096.396.096.596.896.296.9
A. sp. Mongolia99.399.099.5ID97.994.695.695.796.095.995.996.096.096.194.494.496.296.596.396.796.896.497.0
A. capra98.198.197.897.9ID93.995.295.595.995.595.895.495.795.893.993.995.896.195.695.496.595.997.2
A. bovis94.694.594.594.693.9ID94.994.995.495.195.595.395.495.592.992.994.694.796.493.294.294.596.3
A. phagocytophilum95.595.595.695.695.294.9ID96.997.096.897.096.897.897.994.094.096.796.996.494.495.595.295.9
A. platys95.795.895.695.795.594.996.9ID99.098.999.098.297.297.293.993.996.696.896.494.595.695.295.9
A. sp. Mymensingh96.096.095.996.095.995.497.099.0ID99.599.998.997.998.094.194.197.197.297.094.796.095.796.3
A. sp. Omatjenne95.995.995.995.995.595.196.898.999.5ID99.598.897.597.694.094.096.997.196.794.795.995.496.0
Can A. camelii95.995.995.995.995.895.597.099.099.999.5ID98.897.897.994.094.097.097.296.994.895.995.896.3
A. odocoilei95.795.695.996.095.495.396.898.298.998.898.8ID97.597.694.194.196.897.096.894.995.995.796.1
A. sp. SA Dog95.895.795.796.095.795.497.897.297.997.597.897.5ID99.994.694.697.497.697.294.895.895.896.5
A. sp. ZAM dog95.995.895.896.195.895.597.997.298.097.697.997.699.9ID94.794.797.597.797.294.995.895.996.6
A. sp. Saso94.394.094.394.493.992.994.093.994.194.094.094.194.694.7ID10094.494.594.893.694.093.794.3
A. sp. Hadesa94.394.094.394.493.992.994.093.994.194.094.094.194.694.7100ID94.494.594.893.694.093.794.3
Can A. boleense96.296.196.096.295.894.696.796.697.196.997.096.897.497.594.494.4ID99.696.695.296.495.396.4
A. sp. Dedessa96.596.496.396.596.194.796.996.897.297.197.297.097.697.794.594.599.6ID96.795.496.595.496.7
Can A. pangolini96.396.196.096.395.696.496.496.497.096.796.996.897.297.294.894.896.696.7ID94.795.696.696.5
Can A. testudinis96.496.496.596.795.493.294.494.594.794.794.894.994.894.993.693.695.295.494.7ID96.295.394.9
A. sp. Ar. walkerae96.896.896.896.896.594.295.595.696.095.995.995.995.895.894.094.096.496.595.696.2ID95.996.4
A. sp. O. moubata96.196.196.296.495.994.595.295.295.795.495.895.795.895.993.793.795.395.496.695.395.9ID95.7
A. sp. Shizhu97.297.096.997.097.296.395.995.996.396.096.396.196.596.694.394.396.496.796.594.996.495.7ID
White text on black background indicates organisms with sequence identity of >98.7%. The species epithet is shown in full on the left side. The species/putative species are listed from left to right with abbreviated names along the top, and full names from top to bottom at the left side in the same order. Can = Candidatus.
Table 4. Differentiating bases for the ruminant clade of Anaplasma species.
Table 4. Differentiating bases for the ruminant clade of Anaplasma species.
Base Number *
1441562202652741250
A. centraleAATTGT
A. marginaleAGTTGT
A. ovisGRYCTT
Anaplasma sp. MongoliaGACCGC
* Numbering based on Anaplasma marginale St. Maries strain sequence.
Table 5. Differentiating bases for Anaplasma platys and closely related species.
Table 5. Differentiating bases for Anaplasma platys and closely related species.
Base Number *
213224262289693696878879885886890105213091358
A. platysATTTNTRCGTTRYC
Anaplasma sp. MymensinghATTTCTACGTTACC
Anaplasma sp. OmatjenneACTTCTRCGTTGCT
“Candidatus Anaplasma camelii”ATTTCTACGTTATC
A. odocoileiGAGCAAGTACCGCC
* Numbering based on Anaplasma platys S3 strain genome sequence. The consensus sequence of A. platys contains a “C” between bases at position 555–556 and a “T” between bases at position 1030–1031. These insertions are not present in all A. platys strains and are absent in the S3 strain.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Caudill, M.T.; Brayton, K.A. The Use and Limitations of the 16S rRNA Sequence for Species Classification of Anaplasma Samples. Microorganisms 2022, 10, 605. https://doi.org/10.3390/microorganisms10030605

AMA Style

Caudill MT, Brayton KA. The Use and Limitations of the 16S rRNA Sequence for Species Classification of Anaplasma Samples. Microorganisms. 2022; 10(3):605. https://doi.org/10.3390/microorganisms10030605

Chicago/Turabian Style

Caudill, Mitchell T., and Kelly A. Brayton. 2022. "The Use and Limitations of the 16S rRNA Sequence for Species Classification of Anaplasma Samples" Microorganisms 10, no. 3: 605. https://doi.org/10.3390/microorganisms10030605

APA Style

Caudill, M. T., & Brayton, K. A. (2022). The Use and Limitations of the 16S rRNA Sequence for Species Classification of Anaplasma Samples. Microorganisms, 10(3), 605. https://doi.org/10.3390/microorganisms10030605

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop