Evolution of BACON Domain Tandem Repeats in crAssphage and Novel Gut Bacteriophage Lineages
Abstract
:1. Introduction
2. Materials and Methods
2.1. Data
2.2. Identification of BACON Domains
2.3. Clustering of BACON Domain-Containing Viral Contigs
2.4. Analysis of Novel Contig Clusters
2.5. Examining of BACON ORF Genetic Neighbourhoods
2.6. Phylogenetic Analysis of BACON Domains
3. Results and Discussion
3.1. Construction of a Specific Profile HMM of the crAss-Like Phage BACON Domain
3.2. BACON Domains Are Found in Diverse Phages
3.3. BACON Domains Have Diverse Configurations in Phages
3.4. Recurrent Evolution of BACON Domain Tandem Repeats in Phages
4. Conclusions
Supplementary Materials
Author Contributions
Funding
Conflicts of Interest
References
- Manrique, P.; Dills, M.; Young, M.J. The human gut phage community and its implications for health and disease. Viruses 2017, 9, 141. [Google Scholar] [CrossRef] [PubMed]
- Barr, J.J.; Auro, R.; Furlan, M.; Whiteson, K.L.; Erb, M.L.; Pogliano, J.; Stotland, A.; Wolkowicz, R.; Cutting, A.S.; Doran, K.S.; et al. Bacteriophage adhering to mucus provide a non-host-derived immunity. Proc. Natl. Acad. Sci. USA 2013, 110, 10771–10776. [Google Scholar] [CrossRef] [PubMed]
- Norman, J.M.; Handley, S.A.; Baldridge, M.T.; Droit, L.; Liu, C.Y.; Keller, B.C.; Kambal, A.; Monaco, C.L.; Zhao, G.; Fleshner, P.; et al. Disease-Specific Alterations in the Enteric Virome in Inflammatory Bowel Disease. Cell 2015, 160, 447–460. [Google Scholar] [CrossRef] [PubMed]
- Zhao, G.; Vatanen, T.; Droit, L.; Park, A.; Kostic, A.D.; Poon, T.W.; Vlamakis, H.; Siljander, H.; Härkönen, T.; Hämäläinen, A.-M.; et al. Intestinal virome changes precede autoimmunity in type I diabetes-susceptible children. Proc. Natl. Acad. Sci. USA 2017, 114, E6166–E6175. [Google Scholar] [CrossRef] [PubMed]
- Reyes, A.; Blanton, L.V.; Cao, S.; Zhao, G.; Manary, M.; Trehan, I.; Smith, M.I.; Wang, D.; Virgin, H.W.; Rohwer, F.; et al. Gut DNA viromes of Malawian twins discordant for severe acute malnutrition. Proc. Natl. Acad. Sci. USA 2015, 112, 11941–11946. [Google Scholar] [CrossRef]
- Nakatsu, G.; Zhou, H.; Wu, W.K.K.; Wong, S.H.; Coker, O.O.; Dai, Z.; Li, X.; Szeto, C.H.; Sugimura, N.; Lam, T.Y.T.; et al. Alterations in Enteric Virome Are Associated With Colorectal Cancer and Survival Outcomes. Gastroenterology 2018, 155, 529–541. [Google Scholar] [CrossRef]
- Yutin, N.; Makarova, K.S.; Gussow, A.B.; Krupovic, M.; Segall, A.; Edwards, R.A.; Koonin, E.V. Discovery of an expansive bacteriophage family that includes the most abundant viruses from the human gut. Nat. Microbiol. 2018, 3, 38–46. [Google Scholar] [CrossRef]
- Manrique, P.; Bolduc, B.; Walk, S.T.; van der Oost, J.; de Vos, W.M.; Young, M.J. Healthy human gut phageome. Proc. Natl. Acad. Sci. USA 2016, 113, 10400–10405. [Google Scholar] [CrossRef]
- Edwards, R.A.; Vega, A.A.; Norman, H.M.; Ohaeri, M.; Levi, K.; Dinsdale, E.A.; Cinek, O.; Aziz, R.K.; McNair, K.; Barr, J.J.; et al. Global phylogeography and ancient evolution of the widespread human gut virus crAssphage. Nat. Microbiol. 2019, 527796. [Google Scholar] [CrossRef]
- Dutilh, B.E.; Cassman, N.; McNair, K.; Sanchez, S.E.; Silva, G.G.Z.; Boling, L.; Barr, J.J.; Speth, D.R.; Seguritan, V.; Aziz, R.K.; et al. A highly abundant bacteriophage discovered in the unknown sequences of human faecal metagenomes. Nat. Commun. 2014, 5, 4498. [Google Scholar] [CrossRef]
- Guerin, E.; Shkoporov, A.; Stockdale, S.R.; Clooney, A.G.; Ryan, F.J.; Sutton, T.D.S.; Draper, L.A.; Gonzalez-Tortuero, E.; Ross, R.P.; Hill, C. Biology and Taxonomy of crAss-like Bacteriophages, the Most Abundant Virus in the Human Gut. Cell Host Microbe 2018, 24, 653–664. [Google Scholar] [CrossRef] [PubMed]
- Larsbrink, J.; Rogers, T.E.; Hemsworth, G.R.; McKee, L.S.; Tauzin, A.S.; Spadiut, O.; Klinter, S.; Pudlo, N.A.; Urs, K.; Koropatkin, N.M.; et al. A discrete genetic locus confers xyloglucan metabolism in select human gut Bacteroidetes. Nature 2014, 506, 498–502. [Google Scholar] [CrossRef] [PubMed]
- Roche, D.B.; Do Viet, P.; Bakulina, A.; Hirsh, L.; Tosatto, S.C.E.; Kajava, A.V. Classification of β-hairpin repeat proteins. J. Struct. Biol. 2018, 201, 130–138. [Google Scholar] [CrossRef]
- Fraser, J.S.; Maxwell, K.L.; Davidson, A.R. Immunoglobulin-like domains on bacteriophage: Weapons of modest damage? Curr. Opin. Microbiol. 2007, 10, 382–387. [Google Scholar] [CrossRef] [PubMed]
- Hayes, S.; Mahony, J.; Vincentelli, R.; Ramond, L.; Nauta, A.; van Sinderen, D.; Cambillau, C. Ubiquitous Carbohydrate Binding Modules Decorate 936 Lactococcal Siphophage Virions. Viruses 2019, 11, 631. [Google Scholar] [CrossRef] [PubMed]
- Granell, M.; Namura, M.; Alvira, S.; Kanamaru, S.; van Raaij, M.J. Crystal structure of the carboxy-terminal region of the bacteriophage T4 proximal long tail fiber protein Gp34. Viruses 2017, 9, 168. [Google Scholar] [CrossRef] [PubMed]
- Bartual, S.G.; Otero, J.M.; Garcia-Doval, C.; Llamas-Saiz, A.L.; Kahn, R.; Fox, G.C.; van Raaij, M.J. Structure of the bacteriophage T4 long tail fiber receptor-binding tip. Proc. Natl. Acad. Sci. USA 2010, 107, 20287–20292. [Google Scholar] [CrossRef]
- Spinelli, S.; Desmyter, A.; Verrips, C.T.; De Haard, H.J.W.; Moineau, S.; Cambillau, C. Lactococcal bacteriophage p2 receptor-binding protein structure suggests a common ancestor gene with bacterial and mammalian viruses. Nat. Struct. Mol. Biol. 2006, 13, 85–89. [Google Scholar] [CrossRef]
- Barbirz, S.; Müller, J.J.; Uetrecht, C.; Clark, A.J.; Heinemann, U.; Seckler, R. Crystal structure of Escherichia coli phage HK620 tailspike: Podoviral tailspike endoglycosidase modules are evolutionarily related. Mol. Microbiol. 2008, 69, 303–316. [Google Scholar] [CrossRef]
- Steinbacher, S.; Miller, S.; Baxa, U.; Budisa, N.; Weintraub, A.; Seckler, R.; Huber, R. Phage P22 tailspike protein: Crystal structure of the head-binding domain at 2.3 Å, fully refined structure of the endorhamnosidase at 1.56 Å resolution, and the molecular basis of O-antigen recognition and cleavage. J. Mol. Biol. 1997, 267, 865–880. [Google Scholar] [CrossRef]
- Steinbacher, S.; Steipe, B.; Huber, R.; Reinemer, P.; Seckler, R.; Miller, S. Crystal structure of P22 tailspike protein: Interdigitated subunits in a thermostable trimer. Science 1994, 265, 383–386. [Google Scholar] [CrossRef] [PubMed]
- Müller, J.J.; Barbirz, S.; Heinle, K.; Freiberg, A.; Seckler, R.; Heinemann, U. An Intersubunit Active Site between Supercoiled Parallel β Helices in the Trimeric Tailspike Endorhamnosidase of Shigella flexneri Phage Sf6. Structure 2008, 16, 766–775. [Google Scholar] [CrossRef] [PubMed]
- Mitraki, A.; Papanikolopoulou, K.; Van Raaij, M.J. Natural Triple β-Stranded Fibrous Folds. In Advances in Protein Chemistry; Academic Press: Cambridge, MA, USA, 2006; Volume 73, pp. 97–124. [Google Scholar]
- Jernigan, K.K.; Bordenstein, S.R. Tandem-repeat protein domains across the tree of life. PeerJ 2015, 3, e732. [Google Scholar] [CrossRef] [PubMed]
- Verstrepen, K.J.; Jansen, A.; Lewitter, F.; Fink, G.R. Intragenic tandem repeats generate functional variability. Nat. Genet. 2005, 37, 986–990. [Google Scholar] [CrossRef]
- Björklund, Å.K.; Ekman, D.; Elofsson, A. Expansion of protein domain repeats. PLoS Comput. Biol. 2006, 2, e114. [Google Scholar] [CrossRef]
- Wright, C.F.; Teichmann, S.A.; Clarke, J.; Dobson, C.M. The importance of sequence diversity in the aggregation and evolution of proteins. Nature 2005, 438, 878–881. [Google Scholar] [CrossRef]
- Persi, E.; Wolf, Y.I.; Koonin, E.V. Positive and strongly relaxed purifying selection drive the evolution of repeats in proteins. Nat. Commun. 2016, 7, 13570. [Google Scholar] [CrossRef]
- Reyes, A.; Haynes, M.; Hanson, N.; Angly, F.E.; Heath, A.C.; Rohwer, F.; Gordon, J.I. Viruses in the faecal microbiota of monozygotic twins and their mothers. Nature 2010, 466, 334–338. [Google Scholar] [CrossRef]
- Paez-Espino, D.; Eloe-Fadrosh, E.A.; Pavlopoulos, G.A.; Thomas, A.D.; Huntemann, M.; Mikhailova, N.; Rubin, E.; Ivanova, N.N.; Kyrpides, N.C. Uncovering Earth’s virome. Nature 2016, 536, 425–430. [Google Scholar] [CrossRef]
- Cobián Güemes, A.G.; Youle, M.; Cantú, V.A.; Felts, B.; Nulton, J.; Rohwer, F. Viruses as Winners in the Game of Life. Annu. Rev. Virol. 2016, 3, 197–214. [Google Scholar] [CrossRef]
- Shiffman, M.E.; Soo, R.M.; Dennis, P.G.; Morrison, M.; Tyson, G.W.; Hugenholtz, P. Gene and genome-centric analyses of koala and wombat fecal microbiomes point to metabolic specialization for Eucalyptus digestion. PeerJ 2017, 5, e4075. [Google Scholar] [PubMed]
- Roux, S.; Enault, F.; Hurwitz, B.L.; Sullivan, M.B. VirSorter: Mining viral signal from microbial genomic data. PeerJ 2015, 3, e985. [Google Scholar] [CrossRef] [PubMed]
- Finn, R.D.; Bateman, A.; Clements, J.; Coggill, P.; Eberhardt, R.Y.; Eddy, S.R.; Heger, A.; Hetherington, K.; Holm, L.; Mistry, J.; et al. Pfam: The protein families database. Nucleic Acids Res. 2014, 42, 222–230. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Hyatt, D.; Chen, G.L.; LoCascio, P.F.; Land, M.L.; Larimer, F.W.; Hauser, L.J. Prodigal: Prokaryotic gene recognition and translation initiation site identification. BMC Bioinform. 2010, 11, 119. [Google Scholar] [CrossRef] [Green Version]
- Eddy, S.R. Accelerated profile HMM searches. PLoS Comput. Biol. 2011, 7, e1002195. [Google Scholar] [CrossRef] [Green Version]
- Jahn, M.T.; Arkhipova, K.; Markert, S.M.; Stigloher, C.; Lachnit, T.; Pita, L.; Kupczok, A.; Ribes, M.; Stengel, S.T.; Rosenstiel, P.; et al. A Phage Protein Aids Bacterial Symbionts in Eukaryote Immune Evasion. Cell Host Microbe 2019, 26, 542–550. [Google Scholar] [CrossRef]
- Lima-Mendez, G.; Van Helden, J.; Toussaint, A.; Leplae, R. Reticulate Representation of Evolutionary and Functional Relationships between Phage Genomes. Mol. Biol. Evol. 2008, 25, 762–777. [Google Scholar] [CrossRef] [Green Version]
- Bin Jang, H.; Bolduc, B.; Zablocki, O.; Kuhn, J.H.; Roux, S.; Adriaenssens, E.M.; Brister, J.R.; Kropinski, A.M.; Krupovic, M.; Lavigne, R.; et al. Taxonomic assignment of uncultivated prokaryotic virus genomes is enabled by gene-sharing networks. Nat. Biotechnol. 2019, 37, 632–639. [Google Scholar] [CrossRef]
- Buchfink, B.; Xie, C.; Huson, D.H. Fast and sensitive protein alignment using DIAMOND. Nat. Methods 2014, 12, 59–60. [Google Scholar] [CrossRef]
- Enright, A.J. An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res. 2002, 30, 1575–1584. [Google Scholar] [CrossRef]
- Charrad, M.; Ghazzali, N.; Boiteau, V.; Niknafs, A. NbClust: An R Package for Determining the Relevant Number of Clusters in a Data Set. J. Stat. Softw. 2014, 61, 11744–11750. [Google Scholar] [CrossRef] [Green Version]
- Galili, T.; O’Callaghan, A.; Sidi, J.; Sievert, C. Heatmaply: An R package for creating interactive cluster heatmaps for online publishing. Bioinformatics 2018, 34, 1600–1602. [Google Scholar] [CrossRef] [PubMed]
- Sievers, F.; Higgins, D.G. Clustal Omega for making accurate alignments of many protein sequences. Protein Sci. 2018, 27, 135–145. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Kearse, M.; Moir, R.; Wilson, A.; Stones-Havas, S.; Cheung, M.; Sturrock, S.; Buxton, S.; Cooper, A.; Markowitz, S.; Duran, C.; et al. Geneious Basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 2012, 28, 1647–1649. [Google Scholar] [CrossRef]
- Seemann, T. Prokka: Rapid prokaryotic genome annotation. Bioinformatics 2014, 30, 2068–2069. [Google Scholar] [CrossRef] [PubMed]
- Sullivan, M.J.; Petty, N.K.; Beatson, S.A. Easyfig: A genome comparison visualizer. Bioinformatics 2011, 27, 1009–1010. [Google Scholar] [CrossRef]
- Camacho, C.; Coulouris, G.; Avagyan, V.; Ma, N.; Papadopoulos, J.; Bealer, K.; Madden, T.L. BLAST+: Architecture and applications. BMC Bioinform. 2009, 10, 421. [Google Scholar] [CrossRef] [Green Version]
- Capella-Gutiérrez, S.; Silla-Martínez, J.M.; Gabaldón, T. trimAl: A tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 2009, 25, 1972–1973. [Google Scholar] [CrossRef]
- Zhou, X.; Shen, X.X.; Hittinger, C.T.; Rokas, A. Evaluating fast maximum likelihood-based phylogenetic programs using empirical phylogenomic data sets. Mol. Biol. Evol. 2018, 35, 486–503. [Google Scholar] [CrossRef] [Green Version]
- Nguyen, L.T.; Schmidt, H.A.; Von Haeseler, A.; Minh, B.Q. IQ-TREE: A fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 2015, 32, 268–274. [Google Scholar] [CrossRef]
- Kalyaanamoorthy, S.; Minh, B.Q.; Wong, T.K.F.; von Haeseler, A.; Jermiin, L.S. ModelFinder: Fast model selection for accurate phylogenetic estimates. Nat. Methods 2017, 14, 587–589. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Hoang, D.T.; Chernomor, O.; von Haeseler, A.; Minh, B.Q.; Vinh, L.S. UFBoot2: Improving the Ultrafast Bootstrap Approximation. Molecular biology and evolution. Mol. Biol. Evol. 2018, 35, 518–522. [Google Scholar] [CrossRef] [PubMed]
- Letunic, I.; Bork, P. Interactive Tree Of Life (iTOL) v4: Recent updates and new developments. Nucleic Acids Res. 2019, 47, W256–W259. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Galiez, C.; Siebert, M.; Enault, F.; Vincent, J.; Söding, J. WIsH: Who is the host? Predicting prokaryotic hosts from metagenomic phage contigs. Bioinformatics 2017, 33, 3113–3114. [Google Scholar] [CrossRef]
- Wattam, A.R.; Abraham, D.; Dalay, O.; Disz, T.L.; Driscoll, T.; Gabbard, J.L.; Gillespie, J.J.; Gough, R.; Hix, D.; Kenyon, R.; et al. PATRIC, the bacterial bioinformatics database and analysis resource. Nucleic Acids Res. 2014, 42, 581–591. [Google Scholar] [CrossRef] [Green Version]
- Parks, D.H.; Imelfort, M.; Skennerton, C.T.; Hugenholtz, P.; Tyson, G.W. CheckM: Assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015, 25, 1043–1055. [Google Scholar] [CrossRef] [Green Version]
- Pruitt, K.D.; Tatusova, T.; Maglott, D.R. NCBI reference sequences (RefSeq): A curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res. 2007, 35, 61–65. [Google Scholar] [CrossRef] [Green Version]
- Johnson, M.; Zaretskaya, I.; Raytselis, Y.; Merezhuk, Y.; McGinnis, S.; Madden, T.L. NCBI BLAST: A better web interface. Nucleic Acids Res. 2008, 36, 5–9. [Google Scholar] [CrossRef]
- Mello, L.V.; Chen, X.; Rigden, D.J. Mining metagenomic data for novel domains: BACON, a new carbohydrate-binding module. FEBS Lett. 2010, 584, 2421–2426. [Google Scholar] [CrossRef] [Green Version]
- Wheeler, T.J.; Clements, J.; Finn, R.D. Skylign: A tool for creating informative, interactive logos representing sequence alignments and profile hidden Markov models. BMC Bioinform. 2014, 15, 7. [Google Scholar] [CrossRef] [Green Version]
- Rahmann, S.; Schuster-Böckler, B.; Schultz, J. HMM logos for visualization of protein families. BMC Bioinform. 2004, 5, 7. [Google Scholar]
- Shkoporov, A.N.; Khokhlova, E.V.; Fitzgerald, C.B.; Stockdale, S.R.; Draper, L.A.; Ross, R.P.; Hill, C. ΦCrAss001 represents the most abundant bacteriophage family in the human gut and infects Bacteroides intestinalis. Nat. Commun. 2018, 9, 4781. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Low, S.J.; Džunková, M.; Chaumeil, P.A.; Parks, D.H.; Hugenholtz, P. Evaluation of a concatenated protein phylogeny for classification of tailed double-stranded DNA viruses belonging to the order Caudovirales. Nat. Microbiol. 2019, 4, 1306–1315. [Google Scholar] [CrossRef] [PubMed]
- Serwer, P.; Hayes, S.J.; Zaman, S.; Lieman, K.; Rolando, M.; Hardies, S.C. Improved isolation of undersampled bacteriophages: Finding of distant terminase genes. Virology 2004, 329, 412–424. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Toussaint, A.; Rice, P.A. Transposable phages, DNA reorganization and transfer. Curr. Opin. Microbiol. 2017, 38, 88–94. [Google Scholar] [CrossRef]
- Fogg, P.C.M.; Rigden, D.J.; Saunders, J.R.; McCarthy, A.J.; Allison, H.E. Characterization of the relationship between integrase, excisionase and antirepressor activities associated with a superinfecting Shiga toxin encoding bacteriophage. Nucleic Acids Res. 2011, 39, 2116–2129. [Google Scholar] [CrossRef] [Green Version]
- De Jonge, P.A.; Nobrega, F.L.; Brouns, S.J.J.; Dutilh, B.E. Molecular and evolutionary determinants of bacteriophage host-range. Trends Microbiol. 2018, 27, 51–63. [Google Scholar] [CrossRef]
- Shkoporov, A.N.; Hill, C. Bacteriophages of the Human Gut: The “Known Unknown” of the Microbiome. Cell Host Microbe 2019, 25, 195–209. [Google Scholar] [CrossRef] [Green Version]
- Garcia-Doval, C.; van Raaij, M.J. Structure of the receptor-binding carboxy-terminal domain of bacteriophage T7 tail fibers. Proc. Natl. Acad. Sci. USA 2012, 109, 9390–9395. [Google Scholar] [CrossRef] [Green Version]
- Lasica, A.M.; Ksiazek, M.; Madej, M.; Potempa, J. The Type IX Secretion System (T9SS): Highlights and Recent Insights into Its Structure and Function. Front. Cell. Infect. Microbiol. 2017, 7, 215. [Google Scholar] [CrossRef]
- Fraser, J.S.; Yu, Z.; Maxwell, K.L.; Davidson, A.R. Ig-Like Domains on Bacteriophages: A Tale of Promiscuity and Deceit. J. Mol. Biol. 2006, 359, 496–507. [Google Scholar] [CrossRef] [PubMed]
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
de Jonge, P.A.; von Meijenfeldt, F.A.B.; van Rooijen, L.E.; Brouns, S.J.J.; Dutilh, B.E. Evolution of BACON Domain Tandem Repeats in crAssphage and Novel Gut Bacteriophage Lineages. Viruses 2019, 11, 1085. https://doi.org/10.3390/v11121085
de Jonge PA, von Meijenfeldt FAB, van Rooijen LE, Brouns SJJ, Dutilh BE. Evolution of BACON Domain Tandem Repeats in crAssphage and Novel Gut Bacteriophage Lineages. Viruses. 2019; 11(12):1085. https://doi.org/10.3390/v11121085
Chicago/Turabian Stylede Jonge, Patrick A., F. A. Bastiaan von Meijenfeldt, Laura E. van Rooijen, Stan J. J. Brouns, and Bas E. Dutilh. 2019. "Evolution of BACON Domain Tandem Repeats in crAssphage and Novel Gut Bacteriophage Lineages" Viruses 11, no. 12: 1085. https://doi.org/10.3390/v11121085
APA Stylede Jonge, P. A., von Meijenfeldt, F. A. B., van Rooijen, L. E., Brouns, S. J. J., & Dutilh, B. E. (2019). Evolution of BACON Domain Tandem Repeats in crAssphage and Novel Gut Bacteriophage Lineages. Viruses, 11(12), 1085. https://doi.org/10.3390/v11121085