Next Article in Journal
Regulatory Effect of Methylation of the Porcine AQP3 Gene Promoter Region on Its Expression Level and Porcine Epidemic Diarrhea Virus Resistance
Next Article in Special Issue
Unlocking Survival Mechanisms for Metal and Oxidative Stress in the Extremely Acidophilic, Halotolerant Acidihalobacter Genus
Previous Article in Journal
OneStopRNAseq: A Web Application for Comprehensive and Efficient Analyses of RNA-Seq Data
Previous Article in Special Issue
Determinants of Copper Resistance in Acidithiobacillus Ferrivorans ACH Isolated from the Chilean Altiplano
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genome Mining of the Genus Streptacidiphilus for Biosynthetic and Biodegradation Potential

1
Department of Microbiology and Molecular Biology, Chungnam National University, Daejeon 34134, Korea
2
Institute of Intelligence Informatics Technology, Sangmyung University, Seoul 03016, Korea
*
Author to whom correspondence should be addressed.
Genes 2020, 11(10), 1166; https://doi.org/10.3390/genes11101166
Submission received: 18 July 2020 / Revised: 26 September 2020 / Accepted: 29 September 2020 / Published: 3 October 2020
(This article belongs to the Special Issue Genetics and Genomics of Acidophiles)

Abstract

:
The genus Streptacidiphilus represents a group of acidophilic actinobacteria within the family Streptomycetaceae, and currently encompasses 15 validly named species, which include five recent additions within the last two years. Considering the potential of the related genera within the family, namely Streptomyces and Kitasatospora, these relatively new members of the family can also be a promising source for novel secondary metabolites. At present, 15 genome data for 11 species from this genus are available, which can provide valuable information on their biology including the potential for metabolite production as well as enzymatic activities in comparison to the neighboring taxa. In this study, the genome sequences of 11 Streptacidiphilus species were subjected to the comparative analysis together with selected Streptomyces and Kitasatospora genomes. This study represents the first comprehensive comparative genomic analysis of the genus Streptacidiphilus. The results indicate that the genomes of Streptacidiphilus contained various secondary metabolite (SM) producing biosynthetic gene clusters (BGCs), some of them exclusively identified in Streptacidiphilus only. Several of these clusters may potentially code for SMs that may have a broad range of bioactivities, such as antibacterial, antifungal, antimalarial and antitumor activities. The biodegradation capabilities of Streptacidiphilus were also explored by investigating the hydrolytic enzymes for complex carbohydrates. Although all genomes were enriched with carbohydrate-active enzymes (CAZymes), their numbers in the genomes of some strains such as Streptacidiphilus carbonis NBRC 100919T were higher as compared to well-known carbohydrate degrading organisms. These distinctive features of each Streptacidiphilus species make them interesting candidates for future studies with respect to their potential for SM production and enzymatic activities.

1. Introduction

The family Streptomycetaceae within the phylum Actinobacteria is one of the most diverse and ubiquitous soil bacterial groups [1] represented by three genera, classified as Streptomyces, Kitasatospora and Streptacidiphilus, which are closely related on phylogenetic and phenotypic aspects [2]. The representatives of this family are well-known for their roles in biodegradation (e.g., lignocellullose, chitin), secondary metabolite (SM) production and plant growth-promoting potential [3]. Among these three genera, Streptacidiphilus includes acidophilic species, and is phylogenetically more related with Kitasatospora as compared to Streptomyces [4,5]. They form branched substrate mycelia and aerial hyphae which are differentiated into long straight to flexuous chains of smooth surfaced spores. The whole-organism hydrolysates of Streptacidiphilus cell wall peptidoglycan contain LL-diaminopimelic acid (LL-DAP), a feature also observed in Streptomyces [5]. The cell wall peptidoglycan of Kitasatospora, on the other hand, contains either LL-DAP or meso-DAP in different parts of the mycelial structures [6].
Actinobacteria in general have been a great source of novel antibiotics, but the majority of these microbial compounds, such as cycloheximide [7], avermectins [8], hygromycin [9], and many others with known biological implications, have been derived from the family Streptomycetaceae, especially from Streptomyces. Similarly, the genus Kitasatospora has also been the producer of several important SMs, such as setamycin, phosalacine, and endophenaside antibiotics that are mainly active against eukaryotic organisms [10,11,12,13]. Representatives of Streptomycetaceae are also considered as the key organisms in carbon recycling because of their potential to degrade biopolymers [3]. However, only Streptomyces has been the focus of study in most of these studies [14,15]. Nevertheless, the potential of Kitasatospora in the degradation of biopolymers, chitosan in soil for example, has been demonstrated in a limited number of studies [16,17]. Recently, a group of tripeptides designated acidiphilamides were discovered as the first known SMs from a representative of Streptacidiphilus, namely Streptacidiphilus rugosus AM-16T [18]. These acidiphilamides significantly inhibited cellular autophagy and highlighted the fact that under-investigated rare actinobacterial genera could serve as a good resource for novel bioactive natural products.
With the advancement in the genome sequencing technology, it was revealed that microorganisms still represent an essential source for novel natural products that need to be explored [19]. For example, the genome information of two well-studied species, Streptomyces coelicolor A3(2) [3] and Streptomyces avermitilis [20], revealed the presence of several cryptic biosynthetic gene clusters (BGCs), thereby highlighting their potential to produce new metabolites in addition to the existing ones from these organisms [21]. Since the complete genome sequence of Str. coelicolor A3(2) was first reported [3], a large number of genomes for this genus have become available in the public databases (e.g., https://www.ncbi.nlm.nih.gov/genome/). Similarly, the genomes of other two under-represented genera, viz, Kitasatospora and Streptacidiphilus, are also available in these databases, albeit to a limited extent. The genus Streptacidiphilus currently includes 15 validly named (http://www.bacterio.net/streptacidiphilus.html) acidophilic species, which in general grow in the pH range between 3.5 and 6.5, with optimum values around pH 4.5–5.5 [5].
Similar to Streptomyces and Kitasatospora, the distribution of Streptacidiphilus has also been reported from diverse terrestrial habitats such coal mine waste, acid mine drainage, and acidic forest soils [22,23]. Acidophilic actinobacteria are believed to be promising sources of compounds with antifungal activity and acid-stable enzymes [24,25]. Because soil acidophilic actinobacteria have to face competition with fungi in natural habitats, it is anticipated that they may have higher potential than their neutrophilic counterparts in some activities, e.g., antifungal activity [26,27]. Considering the rapid increase in the reports of new species in recent years, Streptacidiphilus may become a main source of novel metabolites together with the two neighboring genera.
Currently, 15 Streptacidiphilus genomes representing 11 species are available in public databases, which makes the genus the least studied one among the three genera of Streptomycetaceae, especially at the genome level. This is supported by the fact that no detailed genomic investigations have been carried out for this genus to date, although there are several brief reports for a limited number of Streptacidiphilus providing basic levels of analysis [4,28,29,30]. This study is a first comprehensive comparative genomic analysis of Streptacidiphilus to get a better understanding of their phylogeny, SM production and biodegradation capabilities. These Streptacidiphilus genomes were further compared with the representative Kitasatospora and Streptomyces species.

2. Materials and Methods

2.1. Streptacidiphilus Strains for Genome Analysis

Three representatives of Streptacidiphilus, Streptacidiphilus albus JL83T (KCTC 9910T), Streptacidiphilus rugosus AM-16T (KCTC 19279T) and Streptacidiphilus oryzae TH49T (KCTC 19220T) were subjected to high quality genome sequencing, as these three species were described in earlier years and represented three different geographic locations. The biomass of the strains was obtained from the cultures grown at 30 °C for 3 days with shaking in ISP (International Streptomyces Project) medium 2 broth (glucose 0.4%, yeast extract 0.4%, malt extract 1%).

2.2. Genome Sequencing, Assembly and Annotation

A high quality genomic DNA for each strain was prepared using a DNA prep kit (Solgent, Daejeon, Korea), and DNA quantification was by PicoGreen dsDNA reagent kit (Thermo Fisher Scientific Korea, Seoul, Korea). Genome library construction, sequencing, and assembly were performed at the Joint Genome Institute (JGI, Berkeley, CA, USA) using PacBio RS [31] sequencing platform. The hierarchical genome assembly process (HGAP v2.2.0.p1) [32] was used to assemble the raw reads. Gene predictions were performed using Prodigal v2.5 [33] software. All coding sequences (CDS) were manually curated and verified by comparison with the publicly available NCBI database. Orthologous groups were assigned by using online functional annotation tool eggNOG-mapper (http://eggnog-mapper.embl.de/) [34], and also mapped to KEGG pathways by using the KEGG Automatic Annotation Server (KAAS: https://www.genome.jp/kegg/kaas/) [35]. Standalone antiSMASH v4.2.0 [36] with default parameters was used to predict the gene clusters that may have potential for the production of SMs. dbCAN2 (http://bcb.unl.edu/dbCAN2/) meta server [37] was used to annotate the carbohydrate active enzymes (CAZymes) [38]. Hits with an e-value threshold of ≤10−5 were considered only. Sequence similarity networks (SSN) were generated by using option C available at Enzyme Function Initiative-Enzyme Similarity Tool (EFI-EST: https://efi.igb.illinois.edu/efi-est/) [39] and visualized in Cytoscape v3.7.0 [40].

2.3. Comparative Genomic and Phylogenetic Analysis

For comparative analysis, the genomes of three Streptacidiphilus strains determined in this study, and an additional set of 8 Streptacidiphilus genomes retrieved from the NCBI Genome Database (https://www.ncbi.nlm.nih.gov/genome) were used. Only one genome of a type strain for each species was used in case there were two or more genomes for a given species. In addition, the genome data of 3 representatives each from Kitasatospora and Streptomyces were also retrieved from the NCBI Genome Database for comparative analysis. The partial 16S rRNA sequences of all Streptacidiphilus species were retrieved from public databases and aligned with ClustalW tool in the MEGA X [41] software. A neighbor-joining tree was constructed with a bootstrap test using 1000 resampled dataset, and evolutionary distances were computed by using the Jukes–Cantor method. Additionally, a whole genome based phylogeny was inferred using the Type (Strain) Genome Server (TYGS: https://tygs.dsmz.de/) webserver [42]. The average nucleotide identity (ANI) values across all seventeen (including 6 Kitasatospora and Streptomyces) genomes were calculated using orthoANIu [43]. The Bacterial Pan Genome Analysis (BPGA v1.3) [44] and bacterial pan-genome profile (PanGP v1.0.1) [45] pipelines were used to carry out the pan-genome analysis of all eleven Streptacidiphilus strains at 40% sequence identity cut-off.

3. Results and Discussion

3.1. General Features of Streptacidiphilus Genomes

The high-quality genome sequences of three Streptacidiphilus (S. albus JL83T, S. rugosus AM-16T, and S. oryzae TH49T) species were obtained for which the number of contigs ranged between 1 and 6. Among them, the genome of S. albus JL83T was the largest (9.91 Mb) with a single contig. The DNA G+C content of this strain was 71.8 mol%, which is similar to S. rugosus AM-16T of comparable genome size (9 Mb). The genome sequence of S. rugosus AM-16T consisted of four contigs and the contig length of N50 was 7,115,445 bp. In contrast, the genome of S. oryzae TH49T was smaller (7.81 Mb) with six contigs and N50 of 6,858,095 bp [28]. Besides these three Streptacidiphilus genomes, there are eight additional Streptacidiphilus strains for which genome data are available in the NCBI genome database. There were two entries for S. albus and Streptacidiphilus jiangxiensis, but only the entry with higher level of assembly was selected from each species. Among other Streptacidiphilus species retrieved from NCBI, only Streptacidiphilus bronchialis DSM 106435T represented the high-quality genome, whereas the number of contigs/scaffolds for other genomes ranged between 96 and 281. The G+C content of all Streptacidiphilus strains was observed to be within the range of 70–72%, as described earlier [5], except S. oryzae TH49T (strain with smallest genome size), in which the G+C content was observed as 73.4% and represented the highest among all these strains. Table 1 provides the summary of genomic features of all 11 Streptacidiphilus strains along with selected Streptomyces and Kitasatospora genomes. It can be noted that the number of pseudogenes is higher in both Kitasatospora and Streptacidiphilus than the Streptomyces genomes, although a higher number of pseudogenes has also been reported from Streptomyces as well [46]. Nevertheless, the function of these bacterial pseudogenes is debatable, and they are known to have short retention times as they undergo the process of degradation and removal by the accumulation of mutations [47].
The orthoANIu values shared between the 11 Streptacidiphilus genomes ranged between 76.28% (S. oryzae TH49T and S. jiangxiensis NBRC 100920T) and 91.14% (Streptacidiphilus neutrinimicus NBRC 100921T and Streptacidiphilus melanogenes NBRC 103184T) (Table S1). S. oryzae TH49T exhibited the least similarity (76.28 to 77.31%) with other representatives of Streptacidiphilus. The 16S rRNA gene sequences from all 15 validly described species were used as phylogenetic markers to confirm the taxonomic grouping into the genus Streptacidiphilus (Figure 1), in which each of the three genera were clearly separated from one another. Although the 16S rRNA-based phylogeny offers a handy method for deriving the relationships between species, it does not provide adequate taxonomic resolution for precise identification [48]. In contrast, the phylogenomic analysis of the strains used in this study and additional close neighbors of the query genomes predicted by TYGS webserver gave different views on the relationship among the three genera, as Streptomyces and Streptacidiphilus were paraphyletic while Kitasatospora remained monophyletic (Figure S1). In particular, S. bronchialis DSM 106435T and S. griseoplanus IFO 12779T as well as S. oryzae TH49T fell outside the Streptacidiphilus group (Figure S1). The ANI indicated closer relationship of these species to Kitasatospora, but their reclassification does not seem straightforward as S. oryzae TH49T was placed out of both Streptacidiphilus and Kitasatospora clades in the genome tree, and S. bronchialis DSM 106435T and S. griseoplanus IFO 12779T were reported to contain LL-diaminopimelic acid in the cell wall [49], which is not consistent with that of Kitasatospora. Further taxonomic studies would clarify the correct taxonomic affiliation of these taxa within the family.

3.2. Functional Annotation

Assignment of orthologous groups and functional annotation revealed that on average about 11(±0.55)% of the proteins in each of the eleven Streptacidiphilus strains were assigned to the transcription (K) category, followed by 8.55(±0.59)% to amino acid transport and metabolism (E), and 6.12(±1.26)% to carbohydrate transport and metabolism (G) categories, respectively. Although all of these Streptacidiphilus strains exhibited more or less similar COG profiles, significant variation in the G category was observed. For example, the percentage of proteins annotated to carbohydrate transport and metabolism for S. carbonis NBRC 100919T (8.37%), Streptacidiphilus jeojiense NRRL B-24555T (7.66%) and S. oryzae TH49T (7.59%) was much higher compared to other strains (Figure 2A). Overall, about 87% of the S. oryzae TH49T and S. bronchialis DSM 106435T proteins were observed to have orthologs in the COG database. For the remaining Streptacidiphilus strains, the percentage ranged between 80 (S. albus JL83T) and 86% (S. jiangxiensis NBRC 100920T). These figures include about 19% of the proteins in each genome belonging to unknown functional (S) categories. When the COG profile of these Streptacidiphilus strains was compared with the related representatives of Streptomycetaceae, both Streptacidiphilus and Kitasatospora were found to have higher number of proteins with unknown functions by about 5% than Streptomyces. This suggests that Streptacidiphilus along with Kitasatospora are the under-investigated representatives/genera of the family Streptomycetaceae.
Moreover, several enzymes belonging to various transposase families were also detected in each of the Streptacidiphilus genomes, especially in higher numbers in the high-quality (≤10 contigs) draft genomes. Studies have reported a highly variable number of transposase genes in prokaryotic genomes that can range from anywhere between 0 to several thousand per genome [58]. The most abundant transposase identified in the Streptacidiphilus genomes represented IS5/IS1182 family transposases. Transposases are mobile genetic elements and are believed to be crucial for the plasticity of bacterial genomes as well as host adaptations [59]. These transposases were also detected in the Kitasatospora and Streptomyces species used in this study. Similarly, 14, 7 and 7 CRISPR-associated (Cas) proteins were detected in the genomes of S. albus JL83T, S. neutrinimicus NBRC 100921T and S. bronchialis DSM 106435T. Most of these Cas proteins were detected in the unique genomes of these three individual species, whereas no such proteins were annotated in other Streptacidiphilus genomes. The CRISPR-Cas systems have already been employed for the discovery and characterization of biosynthetic compounds from Streptomyces [60,61].
Large numbers of proteins from all strains were mapped to different types of metabolism-related KEGG pathways and exhibited a similar profile in all 17 strains. However, the top pathway in which the maximum number of proteins were mapped in all strains was “ABC transporter” pathway of KEGG database. Currently, the ABC transporter represents the largest protein family and is known to transport a wide range of molecules across the cellular membranes [62].

3.3. Pan-Genome Analysis

The pan-genome analysis of 11 Streptacidiphilus genomes resulted in 41,602 genes in the accessory/dispensable genome and 1736 sequences assigned to the core genome. The accessory genome of individual Streptacidiphilus strains ranged between 1915 (S. bronchialis DSM 106435T) and 4678 (S. anmyonensis NBRC 103185T) genes. Similarly, a total of 13,297 unique or genome-specific genes (singletons) were observed in Streptacidiphilus ranging between 554 (S. melanogenes NBRC 103184T) and 2066 (S. albus JL83T) genes. This pan-genome is slightly larger as compared to the pan-genome of 17 Streptomyces [63] and is expected to increase as the genomes of more Streptacidiphilus strains are sequenced. The Streptacidiphilus pan-genome shows the characteristics of an “open” pan-genome [64], the size of which increases with the sequential addition of new genomes (Figure 3A). The core genome profile also shows the expected gradual decrease with the sequential addition of new genomes. Similarly, the number of new genes does not converge to 0 as new genomes are added (Figure 3B). These observations are supported by the power law regression analysis, which shows that the pan-genome of Streptacidiphilus is in fact “open” with Bpan = 0.53. Such open pan-genomes are often found in bacterial species dwelling in diverse ecological habitats with complicated lifestyles, and show predisposition towards horizontal gene transfer (HGT) [64,65]. The Streptacidiphilus pan-genome was further discussed in terms of core, accessory and unique genes.

3.3.1. Core Genome

The core genome of 11 Streptacidiphilus strains consisted of 1736 protein-coding genes (CDS). About 20% of these core genes were assigned to an unknown functional (S) category by COG analysis, whereas no orthologs were detected for approximately 4% of core CDS. Among the remaining core genome sequences, the enriched functional categories included amino acid metabolism and transport (E), transcription (K), translation, ribosomal structure and biogenesis (J), energy production and conversion (C), nucleotide metabolism and transport (F), carbohydrate metabolism and transport (G), and coenzyme metabolism and transport (H) (Figure 2B). Some of these categories in the core genome showed a higher percentage as compared to the average of full proteomes of each individual genome (Figure 2E). Coding sequences that belong to these orthologous groups are essential for basic cellular functions and survival [66], and also provides the major phenotypic traits [64]. The core genes were also mapped to about 191 KEGG pathways including ribosome, purine metabolism, and oxidative phosphorylation as the top three pathways with 52, 44 and 36 KEGG orthologs mapped to each of these pathways, respectively (Table S2A). Several other pathways implicated in amino acid and carbohydrate-related metabolism were also over-represented. These results are consistent with an earlier study on the related taxonomic group, Streptomyces [63], and highlight the fact that several core genes encoding transcriptional regulators and sigma factors may be a characteristic of this family and is in accordance with their sophisticated transcriptional regulatory system that impacts their morphological and physiological differentiation [67]. Overall, 967 unique KEGG orthologs were identified and assigned to 1105 coding sequences which represent about 64% of the core genome. This also means that some orthologs were assigned to more than one core gene. For example, at least 10 coding sequences from the core genome were mapped to KEGG ortholog “K01990”, an ABC-2 type transport system ATP-binding protein. In bacteria, such transporters aid the secretion of antibiotics through the cell membrane besides contributing towards self-resistance to the synthesized antibiotics [68].

3.3.2. Accessory Genome

The accessory components of Streptacidiphilus genomes ranged between 1915 (S. bronchialis DSM 106435T) and 4678 (S. anmyonensis NBRC 103185T) CDS (Table 1). On average, a slightly higher percentage (~21%) of accessory genomes for each Streptacidiphilus strain were assigned to an unknown functional category (S). In fact, this percentage is highest among the core, full as well as the unique genomes (Figure 2E), and this may suggest that certain specific functional roles of Streptacidiphilus are probably performed by these accessory genes, which nevertheless remains to be established experimentally. In addition, a slightly higher number of accessory CDS were assigned to K, G, cell wall/membrane/envelop biogenesis (M), and secondary metabolites biosynthesis, transport and catabolism (Q) categories as compared to the CDS of other genomes (Figure 2C). Generally, the accessory genome offers diversity within a species, and may perform functions and pathways that are secondary for bacterial growth but may be otherwise advantageous to overcome adverse environmental conditions [64]. Furthermore, the accessory genome was mapped to 273 (higher than the core genome) KEGG pathways with 119, 74 and 42 orthologs detected for ABC transporters, two-component system, and amino sugar and nucleotide sugar metabolism pathways, respectively (Table S2B). About 13,376 (32%) accessory sequences were mapped to 1588 KEGG orthologs. The topmost ortholog, to which 249 sequences from the representative accessory genome were mapped, was “K00059” with 3-oxoacyl-[acyl-carrier protein] reductase (fabG, OAR1) activity. In addition to their roles in fatty acid metabolism, these reductases are also involved in the biosynthesis of prodigiosin and antibiotics (https://www.genome.jp/dbget-bin/www_bget?ko:K00059). Such enzymes are also attractive targets for the design of new antimicrobial compounds [69]. The second most abundant KEGG ortholog found in the accessory genome was again ABC-2 type transport system ATP-binding protein (“K01990”), to which sequences were mapped. The other ortholog to which as many as 149 accessory sequences were mapped was “K12132”, which is a eukaryotic-like serine/threonine-protein kinase (prkC, stkP). Studies have shown that such kinases are present in several prokaryotes and regulate key processes related to cell division, morphogenesis and development, although their substrates and mechanisms of action vary from one bacterial species to another [70]. The large number of stkP orthologs observed in the accessory genome of Streptacidiphilus could be attributed to the fact that several eukaryotic-like serine/threonine-protein kinase (ESTPK) encoding genes have been reported from related Streptomyces. For example, the genome sequence of Str. coelicolor A3(2) led to the identification of at least 34 putative ESTPK genes [71].

3.3.3. Unique Genome

A maximum of 2066 singletons were observed in S. albus JL83T, whereas S. melanogenes NBRC 103184T contained only 554 singletons (Table 1). Comparison of the species-specific COG profiles highlighted the presence of a large number of singletons from S. oryzae TH49T annotated for carbohydrate transport and metabolism (G) category (Figure 2D), which included those for several proteins with CAZyme domains.
Similar to the accessory genome, the average number of CDS annotated in Q category of COG was slightly higher in all strain-specific genes as compared to the core and full genome CDS (Figure 2E). The number of strain-specific NRPS proteins ranged between 1 (S. neutrinimicus NBRC 100921T) and 9 (S. bronchialis DSM 106435T). Similarly, at least 18 PKS singletons were detected in S. bronchialis DSM 106435T genome, which is almost three times the number of PKS singletons identified in the genomes of S. albus JL83T and S. anmyonensis NBRC 103185T. Moreover, strains such as S. carbonis NBRC 100919T, S. jeojiense NRRL B-24555T, and S. oryzae TH49T, which have shown a higher number of proteins annotated in G category, exhibited no or a limited number of NRPS and PKS proteins in their unique genomes. The top two KEGG pathways detected in the unique genomes were similar to those of accessory genomes (i.e., ABC transporters, two-component system). However, the third top KEGG pathway identified within the unique genomes was pyruvate metabolism (Table S2C). Additionally, the most abundant KEGG orthologs with a maximum of 62 and 46 orthologs were “K00059” and “K12132”, respectively, as described above for the accessory genomes.
Overall, there were about 179 KEGG orthologs shared commonly between core, accessory and unique genomes (Figure 2F). Individually, the number of orthologs mapped to the unique genomes was much lower (365) as compared to the orthologs identified for core (531) and accessory (775) genomes. These data are consistent with the COG analysis in which the unique genome was least annotated (56.56%, ±17.77) as compared to other genomes, thereby again suggesting the potential novel/unique functions carried out by the CDS of unique genomes.

3.4. Genes Related to Morphological Properties

Considering the fact that Streptacidiphilus shares key chemotaxonomic and morphological properties with related Streptomyces and Kitasatospora [3], the genes involved in the development of aerial mycelium and cell wall peptidoglycan were investigated. Several orthologs of the proteins encoded by bld cascade genes such as bldA, bldB, bldC, bldD, bldG, bldH, bldKA, bldM, bldN, and amfC that are involved in the formation of aerial mycelium [72,73] were observed in the genomes of all Streptacidiphilus strains. However, some of these genes were not identified straightforwardly and required local BLAST search. For example, the amino acid sequence (AAA79120) of bldB gene from Str. coelicolor A3(2) was used as a query and scanned against the proteomes of individual Streptacidiphilus species, and exhibited about 40–55% sequence identity with the potential bldB orthologs from all Streptacidiphilus strains. All bldB protein orthologs including the amino acid sequence from Str. coelicolor A3(2) were assigned to the category of unknown function (DUF397: http://pfam.xfam.org/family/PF04149) in the Pfam [74] database. In contrast, the amino acid sequence (NP_627022) encoded by bldH (also known as AdpA homologue) gene from Str. coelicolor A3(2) exhibited about 80-86% sequence identity with the potential bldH protein sequences of almost all strains. The only exception to this higher sequence identity was observed in the cases of S. albus JL83T, S. carbonis NBRC 100919T and S. melanogenes NBRC 103184T, in which the amino acid sequence identity for this gene was observed to be 49.37 (WP_034091671), 50.15 (WP_042407209) and 48.23% (WP_042383536), respectively. The bldH protein regulated by bldA is an essential intermediate transcription regulator in the bld cascade [75]. Similar to bld developmental master regulators, several whi family genes (e.g., whiA, whiB, whiG, whiH) that are involved in regulating sporulation [73] were also identified in all Streptacidiphilus genomes. Mutational studies have shown that bld mutants fail to develop aerial hyphae and lack the typical “fuzzy” appearance of the wild type, therefore showing a shiny, “bald” phenotype. In contrast, whi mutants are capable of forming aerial hyphae, but cannot complete their life cycle by forming mature spore chains [76,77].
The cell wall peptidoglycan of Streptacidiphilus resembles that of Streptomyces and also contains LL-diaminopimelic acid (LL-DAP) as the major diamino acid [5]. The murE genes encode enzymes that catalyze the formation of the UDP-N-acetylmuramic acid (UDP-MurNAc) tripeptide in the biosynthesis of bacterial peptidoglycan [78]. Two murE genes in the genomes of all three Streptacidiphilus species were identified, which is akin to most of the Streptomyces and Kitasatospora genomes. The first murE gene encodes a protein with UDP-N-acetylmuramoyl-L-alanyl-D-glutamate-2,6-diaminopimelate ligase activity, and the length of this enzyme ranged between 500 and 580 amino acid residues in 11 Streptacidiphilus strains as well as in Streptomyces and Kitasatospora. However, an additional homologue (490 amino acids) of this first murE gene was identified in Kitasatospora azatica KCTC 9699T and can be seen as an outlier in Figure S2A. In contrast, the second murE gene encodes a shorter protein with approximately 400 amino acids in Streptacidiphilus and other strains of Streptomycetaceae. This shorter murE protein contains a domain of an unknown function (DUF1727) which is present towards the C terminal region of bacterial proteins that include UDP-N-acetylmuramyl tripeptide synthase and the related Mur ligase (http://pfam.xfam.org/family/PF08353.10). On the other hand, only one copy of dapF gene, the product of which is involved in the isomerization of LL-DAP into meso-DAP [79,80] was identified in most Streptacidiphilus genomes, except for Streptacidiphilus anmyonensis NBRC 103185T and S. neutrinimicus NBRC 100921T, in which two copies of this gene were present in both genomes (Figure S2B). This is consistent with earlier studies where the occurrence of more than one dapF genes was reported in some species of Kitasatospora and Streptomyces [81].

3.5. Predicted BGCs of Streptacidiphilus

Table 1 gives the overview of the number of BGCs identified in each genome. Overall, 506 BGCs were predicted in the 17 genomes used in this study, including 305 from 11 Strepacidiphilus. The number of BGCs for Streptacidiphilus ranged between 15 (S. oryzae TH49T) and 38 (S. neutrinimicus NBRC 100921T), which corresponds to 1.92 and 4.15 BGCs per Mb, respectively. The average number of BGCs per Mb was 3.17(±0.85).
The distribution of most abundant types of BGCs for SMs detected in each Streptacidiphilus genome is summarized in Table 2. From the table, it can be seen that the BGCs for siderophores and terpenes are the only clusters that are commonly present in all 11 Streptacidiphilus genomes as well as representative Kitasatospora and Streptomyces. The number of non-ribosomal peptide synthetase (NRPS) gene clusters in Streptacidiphilus ranged between 1 (S. bronchialis DSM 106435T and S. melanogenes NBRC 103184T) and 5 (S. jiangxiensis NBRC 100920T). A few NRPS clusters were also present as a part of hybrid clusters, whereas no NRPS cluster (including hybrids) was observed in S. oryzae TH49T. None of the Streptacidiphilus strains with high quality genomes (≤10 contigs) possessed type 1 polyketide synthase (T1PKS) gene clusters, except as a component of hybrid clusters. A broad variety of potential structures predicted from these BGCs are shown in Table S3. Some of the important types of BGCs that may be involved in the synthesis of these potential compounds are discussed below.
Since 7 out of 11 Streptacidiphilus genomes have several contigs, it is expected to have a number of BGCs to be located at contig edges, giving some level of redundancy in the number of BGCs. For example, the number of BGCs predicted at contig edges in draft Streptacidiphilus genomes (genomes with number of contigs between 96–281) ranged between 4 in S. jeojiense NRRL B-24555T (144 contigs) and 22 in S. neutrinimicus NBRC 100921T (184 contigs). In contrast, only two (one each in S. albus JL83T and S. oryzae TH49T) BGCs were identified in strains with high-quality (≤10 contigs) genomes. Similarly, only one BGC in the case of Str. albus DSM 41398T was detected on the contig edge among all the representative Streptomyces and Kitasatospora genomes. Although the number of BGCs from draft genomes may be overpredicted, they would offer a worthy initial measure of the potential biosynthetic diversity [82].

3.5.1. Type 1 Polyketide Synthase (T1PKS) BGCs

The T1PKS with 36 such BGCs distributed among 11 Streptacidiphilus species was one of the highly abundant types of SM producing gene clusters. However, as mentioned above, none of the four high-quality Streptacidiphilus genomes contained T1PKS clusters except as a part of hybrid clusters. Of all the T1PKS clusters, 11 were detected in S. neutrinimicus NBRC 100921T, but most of these clusters were small and consisted of only one to five genes, except in the case of clusters 9, 14, 27 and 31. Similarly, most of the T1PKS type BGCs from S. carbonis NBRC 100919T consisted of a few genes only, except for cluster 15, which consisted of several PKS enzymes with well-defined modular structure and exhibited 30% similar genes with geldanamycin BGC from Streptomyces hygroscopicus NRRL 3602 in MIBiG (https://mibig.secondarymetabolites.org/) [83,84] database. In contrast, five out of the six BGCs from S. melanogenes NBRC 103184T consisted of multiple genes with well-defined modular structures of their PKS enzymes. Among the T1PKS clusters from S. melanogenes NBRC 103184T, cluster 4 exhibited a maximum of 39% of similar genes with ECO-02301 BGC from Streptomyces aizunensis, the compound known as an antifungal agent [85]. The low similarities with the known BGCs suggest that the potential T1PKS biosynthetic compounds predicted by antiSMASH from Streptacidiphilus species (Table S3) may be novel. Some of these BGCs did not show any similarity with the known BGCs available in the databases. Overall, among all the T1PKS BGCs, cluster 17 from S. anmyonensis NBRC 103185T shared the maximum gene content similarity (60% similar genes) with the known ebelactone A BGC from Kitasatospora aburaviensis [86].

3.5.2. Non-Ribosomal Peptide Synthetase (NRPS) BGCs

Overall, 28 NRPS BGCs were identified from 11 Streptacidiphilus strains, which include 7 clusters from 4 (S. albus JL83T, S. oryzae TH49T, S. rugosus AM-16T and S. bronchialis DSM 106435T) high quality draft genomes. Among these, four NRPS clusters were detected from S. albus JL83T, three (clusters 9, 26 and 30) out of which consisted of NRPS proteins with well-defined modular structures (Figure S3). Although a core structure was predicted for all 4 NRPS clusters (Table S3), the gene content similarities with existing BGCs were significantly low (<25%), except for cluster 30, in which 50% of genes showed similarity with cyanopeptin BGC from Planktothrix agardhii, a filamentous cyanobacterium [87]. However, the amino acid identities between the individual NRPS sequences from P. agardhii and S. albus JL83T were around 40%, in addition to the varying modular structures and domain organization. In the case of S. rugosus AM16T, only cluster 18 consisted of NRPS proteins, having a well-defined modular structure. Only 28% of genes from this cluster exhibited similarity with a known BGC for erythrochelin, a siderophore. A single NRPS cluster (cluster 17) was detected in the genome of strain S. bronchialis DSM 106435T, which shared only 8% gene content similarity with the BGC for cyclomarin, an antimycobacterial and antimalarial cyclopeptide.
Overall, each of these abovementioned NRPS clusters ranged in size approximately between 58 (cluster 9 of S. albus JL83T) and 82 Kbp (cluster 17 of S. bronchialis DSM 106435T). A high degree of variation in the type, number and organization of various NRPS domains among all these clusters was also observed (Figure S3). For example, in addition to the varying number of classical condensation (C), adenylation (A) and thiolation (T, also known as peptidyl carrier protein (PCP) domain) domains, certain tailoring domains such as epimerisation (E) or β-ketoreductase (KR) domains were also present in some strains. Additionally, each of these clusters also included genes responsible for transcription regulation and transport.
Similarly, those of the other seven Streptacidiphilus strains (>90 contigs) also exhibited low gene content similarities with the known BGCs except in the case of cluster 8 from S. jeojiense NRRL B-24555T that showed 100% similarity with scabichelin BGC from Streptomyces scabiei 87.22 [88]. The scabichelin BGC contains a core gene that encodes a putative NRPS/siderophore biosynthesis protein in addition to other transport-related genes. These putative NRPS proteins from S. jeojiense NRRL B-24555T and Streptomyces scabiei 87.22 show very similar domain architecture with five C-A-T domains and a single E domain, and exhibited about 70% sequence identity over a coverage of 45%. In spite of such similarities, a difference in the overall predicted structures from these two clusters was observed (Figure S4A). One of the main differences between these two species is that in the case of Streptomyces scabiei 87.22, a second central n-methylation (nMT) domain as well as an additional domain of about 135 amino acid residues is also found towards the C-terminal end. The scabichelin BGC was also predicted in the genomes of K. azatica KCTC 9699T and Kitasatospora setae KM-6054T, although exhibiting only 20% gene content similarity to the known scabichelin BGC, and the putative NRPS/siderophore biosynthesis proteins in both these cases were smaller in length with no predicted domain organization.

3.5.3. T1PKS-NRPS Hybrid Gene Clusters

About 51 hybrid BGCs were detected from the 10 Streptacidiphilus genomes, 17 of which were found in S. albus JL83T. These hybrid clusters were formed by the combination of two or more different types of BGCs and could be as simple as commonly observed T1PKS-NRPS hybrids or as complex as butyrolactone-T2PKS-terpene-siderophore BGC observed in S. jiangxiensis NBRC 100920T. At least one such hybrid cluster was observed in all Streptacidiphilus strains except S. oryzae TH49T. The most abundant type of hybrid cluster was T1PKS-NRPS, and there were seven such clusters from S. bronchialis DSM 106435T (cluster 9, 14 and 15), S. albus JL83T (cluster 5), S. carbonis NBRC 100919T (cluster 14), S. jiangxiensis NBRC 100920T (cluster 26) and S. neutrinimicus NBRC 100921T (cluster 23). The three T1PKS-NRPS clusters from S. bronchialis DSM 106435T shared 10%, 64% and 50% gene content similarities with thioviridamide, neocarzilin and abyssomicin BGCs, respectively. Although thioviridamide is synthesized by ribosomally synthesized and post-translationally modified peptide (RiPP) family BGC [89], there are some common genes present in T1PKS-NRPS (cluster 9) and the known thioviridamide BGC from Streptomyces olivoviridis [89]. For example, LuxR family transcriptional regulators which are involved in quorum-sensing (QS) mechanisms [90] were detected in both BGCs. Thioviridamide and neocarzilin are known to exhibit antitumor activities, and abyssomicin to exhibit antibacterial activities. In contrast, the lone T1PKS-NRPS hybrid cluster from S. albus JL83T consisted of genes that were 100% similar with the BGC of antimycin, a fish poison from Streptomyces sp. S4 [91]. The gene content similarity between the NRPS sequence of both these organisms was 60%, whereas PKS sequences exhibited 66% sequence identity with ~100% coverage. In addition to high amino acid sequence similarities observed between NRPS/PKS sequences of these two strains, their modular organizations were also very similar (Figure S4B). However, one of the main differences between these two clusters was that the T1PKS-NRPS cluster in S. albus JL83T genome was much bigger (~135 Kbp), almost twice the size of antimycin BGC of Streptomyces sp. S4, and consisted of additional genes including two NRPS, one PKS, and other tailoring genes. Among the additional NRPS genes, one showed 57% gene content similarity with the NRPS protein sequence (WP_067134481) from Microtetraspora malaysiensis, whereas the other NRPS sequence shared 52% amino acid sequence identity with adenylation domain-containing protein (WP_123974827) from Streptomyces sp. Ag109_O5-1. In contrast, the amino acid sequence of additional PKS exhibited 54% identity with T1PKS (WP_095581742) from Streptomyces albireticuli. This cluster in fact was also probably the largest T1PKS-NRPS cluster as compared to three other such hybrid clusters from S. bronchialis DSM 106435T that ranged between 76 (cluster 9) and 111 Kbp (cluster 15). Therefore, these data indicate that in spite of high gene content similarities shared between the T1PKS-NRPS cluster (cluster 5) from S. albus JL83T and antimycin BGC from Streptomyces sp. S4, the potential SM produced by S. albus JL83T from this cluster could be novel.
In contrast, cluster 14 of S. carbonis NBRC 100919T showed 24% gene content similarity with the himastatin (BGC0001117) BGC, whereas only 10% of the genes from cluster 23 of S. neutrinimicus NBRC 100921T exhibited similarity with the enduracidin (BGC0000341) BGC of MIBiG database.

3.5.4. Other Hybrid Gene Clusters

In addition to T1PKS-NRPS clusters, the other hybrid clusters either involved a PKS or an NRPS cluster in combination with other types (bacteriocin-T2PKS, butyrolactone-T1PKS-otherks, T2PKS-oligosaccharide-otherKS, terpene-butyrolactone etc). A total of 33 such hybrid clusters were detected, 9 of which belonging to S. albus JL83T, whereas no hybrid cluster was detected in S. oryzae TH49T (Table 2 and Table S2). Some of these clusters from S. albus JL83T contained well-defined modular structures of PKS or NRPS domains, for example clusters 7 (Terpene-T1PKSs), 20 (T3PKS-NRPS) and 22 (T3PKS-lanthipeptide-otherks-NRPS) while exhibiting limited gene content similarities (between 21–42%) with known BGCs. In contrast, 51% genes of T3PKS-T1PKS hybrid cluster (cluster 16) from strain DSM 106435 showed similarity with the known BGCs for PM100117/PM100118, which are antitumor substances from Streptomyces caniferus [92].

3.5.5. Lanthipeptide Gene Clusters

At least 28 (including hybrids) lanthipeptide BGCs were identified from 11 Streptacidiphilus strains, 7 out of which were present in S. albus JL83T. Based on antiSMASH analysis, the seven lanthipeptide clusters from S. albus JL83T had potential for class I (clusters 10, 21 and 23) or class II (clusters 6 and 24) lanthipeptides. In addition to these seven lanthipeptide clusters, one T3PKS-lanthipeptide-otherks-NRPS hybrid cluster (cluster 22) was also observed in S. albus JL83T. While only one lanthipeptide was predicted for most clusters, multiple class II core peptides were predicted for clusters 22 and 24 (Table S4). No core peptides were predicted for clusters 13 and 33. In contrast, three lanthipeptide BGCs (clusters 8, 10 and 12) from S. bronchialis DSM 106435T were classified as those for class I lanthipeptides and predicted to have a core peptide associated with each cluster. Similarly, a core lanthipeptide belonging to class I was predicted for S. carbonis NBRC 100919T (cluster 22), S. jeojiense NRRL B-24555T (cluster 18), and S. jiangxiensis NBRC 100920T (cluster 15). Apart from S. albus JL83T, S. anmyonensis NBRC 103185T (cluster 9: hybrid lanthipeptide-T1PKS), S. melanogenes NBRC 103184T (cluster 8: hybrid T2PKS-lanthipeptide), and Streptacidiphilus pinicola KCTC 49008T (cluster 27) were also predicted to have two, three, and two class II core peptides, respectively (Table S4). Class I lanthipeptide clusters contain a lanthipeptide dehydrogenase (LanB) and a cyclase (LanC), both of which are required for the enzymatic synthesis of lanthipeptides. Class II lanthipeptides, on the other hand, are altered by a bifunctional “LanM” enzyme which comprises an N-terminal dehydratase domain and a C-terminal LanC-like cyclase domain [93]. Although all class I and class II lanthipeptides from 11 Streptacidiphilus genomes contained essential core enzymes for the biosynthesis of lanthipeptides, the number and types of tailoring enzymes varied, which may offer additional modifications specific for each lanthipeptide and ameliorate their activities and/or stability [94].

3.5.6. Terpene Gene Clusters

The biosynthesis of terpenes is in general investigated by determining the products of an individual enzyme known as terpene synthase or commonly referred to as terpene cyclase [95]. However, the synthesized compound may require additional modifications by the action of additional genes to attain biological activity. Therefore, it is essential to identify the underlying gene clusters, which in turn is necessary for the discovery of unique metabolic pathways and their potential industrial applications [96]. In Streptacidiphilus, a total of 53 terpene BGCs were identified, which were found as the most abundant type of BGCs in this genus. S. oryzae TH49T, with eight such clusters, was the top strain, followed by six each for S. jiangxiensis NBRC 100920T and S. carbonis NBRC 100919T. These terpene clusters shared limited similarities with the known BGCs, and hopene was one of the most common matches. In addition to core biosynthetic terpene/phytoene synthases/cyclases, some clusters contained additional biosynthetic enzymes such as phytoene desaturases, dehydrogenases, and glycosyl transferases, etc. Several regulatory and transport genes were also observed in these clusters. Such genes are required for the biosynthesis of terpenes [97], but the number and types of these genes varied across all terpene BGCs. Although knowledge on the terpene production from bacteria is limited, some terpene or terpenoid structures have been reported in addition to few characterized biosynthetic pathways [98]. For example, phenalinolactone (a terpene glycoside), produced by Streptomyces sp. Tü6071 [79], is encoded by a 35-gene cluster and comprises all the biosynthetic as well as regulatory genes required for its production. Similarly, terpenes such as terpenticin and brasilicardin A from other bacterial species are also produced by BGCs with partially characterized biosynthetic pathways [99,100].

3.5.7. Distribution of Known BGCs in Streptomycetaceae

As mentioned above, terpene is one of the most abundant BGCs found in the Streptacidiphilus strains, and the number still remains higher and reaches 78 if the representatives from Streptomyces and Kitasatospora are included (Table 2). The most abundant known BGC present in all these strains was hopene (Figure 4), a terpene-type BGC. Each of the 17 species used in this study consisted of 1 hopene BGC and showed a similarity within a range of 38 (some Streptacidiphilus strains) to 100% (Str. coelicolor) with the reference hopene BGC (BGC0000663) in the MIBiG database. Hopenes play roles in maintaining membrane fluidity and stability [101], and their BGCs are one of the most abundant BGCs observed in Streptomyces along with those for ectoine [102]. Ectoine BGCs are highly conserved in Streptomyces and prevent osmotic stress [103]. In contrast, ectoine BGC was only observed in the genomes of S. jiangxiensis NBRC 100920T and S. oryzae TH49T with 23 and 100% gene content similarities to the reference BGCs in MIBiG database. No ectoine BGC was observed in the Kitasatospora genomes. These results indicate that Streptacidiphilus and Kitasatospora may have developed alternate mechanisms to carry out the functions otherwise performed by ectoines.
The other most abundant type of known BGC observed in Streptacidiphilus was that for macrotetrolide, a siderophore-type SM. Except S. jiangxiensis NBRC 100920T and S. bronchialis DSM 106435T, each Streptacidiphilus species contained at least 1 (2 in the case of S. albus JL83T) macrotetrolide BGC and showed very low gene content similarities (25–33%) to the reference BGC (BGC0000244). Among the six representative Streptomyces and Kitasatospora, a macrotetrolide BGC was present in Streptomyces albus DSM 41398T only, albeit as a part of hybrid butyrolactone-otherKS instead of a siderophore, as was found in the case of Streptacidiphilus.
Although macrotetrolides have been reported from Streptomyces [104], an extended search against an additional 129 complete Streptomyces genomes and 6 high-quality Kitasatospora genomes identified their presence in only seven Streptomyces genomes. No macrotetrolide BGC was detected in any Kitasatospora genome (unpublished data). The macrotetrolides are involved in a wide array of biological activities including antibacterial, antifungal, antitumor, antiprotozoan, antiparasitic, insecticidal and immunosuppressive activities [104].
Another most abundant known BGCs observed among Streptacidiphilus were those for echoside, which were present in 7 out of 11 genomes and exhibited limited gene content similarity (17–35%) with a reference MIBiG entry BGC0000340. No echoside BGC was observed in six representative species of other genera used in this study. When the genomes of 129 Streptomyces were scanned, only 17 such clusters were detected, and none of the additional Kitasatospora genomes contained echoside BGC (unpublished data). Echosides belong to para-terphenyl natural products and show inhibitory activities against DNA topoisomerase I and IIα, in addition to several other range of biological activities [105]. The majority of such compounds have been isolated from fungi, although a limited number of echosides have also been reported from some Streptomyces. Therefore, the presence of echoside BGCs in Streptacidiphilus may offer a rich source of potential novel compounds belonging to this family of natural products.
Despite the fact that the number of Streptacidiphilus genomes are limited at present, the overall distribution of known BGCs was different from Streptomyces. For example, BGCs such as those for albaflavenone and melanin which are otherwise abundant in Streptomyces [102] were not identified in any of the Streptacidiphilus species. On the other hand, only one melanin type BGC was found within the Kitasatospora (Kitasatospora mediocidica KCTC 9733T) genome.
Furthermore, there were at least seven known BGCs that were exclusively identified in Streptacidiphilus genomes. Two (lankacidin and cacibiocin) out of these seven known BGCs were detected in S. pinicola KCTC 49008T and exhibited limited similarity with their corresponding reference BGCs in MIBiG database (Table 3). Overall, these results indicate that in addition to the core (present in all the species of three taxa) SM producing BGCs such as terpene and siderophore, there are some noticeable differences in the overall distribution of these BGCs within the family Streptomycetaceae.

3.6. Diversity of Carbohydrate-Active Enzymes

CAZymes play an essential role in the degradation as well as the biosynthesis of complex carbohydrates [38]. A wide range of organisms, including several species of Streptomyces, are known to produce these CAZymes [14] that can have environmental and industrial significance [106]. Recently, studies on the biodegradation potential of Kitasatospora have started to gain attention [16,17]. At present, there is only one Streptacidiphilus strain (S. bronchialis DSM 106435T) for which CAZyme annotations are available at CAZy database [38]. Therefore, in order to explore the diversity and distribution of CAZymes within Streptacidiphilus, the amino acid sequences of all strains belonging to any potential CAZy family were annotated by using the dbCAN2 meta server. On average, each strain of Streptacidiphilus contained 354 (±49.83) CAZy genes with one or more CAZy domains. A maximum number of 440 CAZy genes that encode various CAZymes were observed in S. jeojiense NRRL B-24555T genome, whereas only 268 CAZyme encoding genes were present in S. bronchialis DSM 106435T. However, in terms of percentage, S. carbonis NBRC 100919T was highest, with 5.98% of its genes containing one or more CAZy domains. Other species which contained ≥5% of CAZy genes included S. jeojiense NRRL B-24555T (5.66%), S. pinicola KCTC 49008T (5.24%), and S. oryzae TH49T (5%). These numbers are impressive and highest among all the three types of taxonomic groups (Table 4) including a well-known cellulolytic strain Streptomyces sp. SirexAA-E [107]. A large number of these CAZymes consisted of glycoside hydrolase (GH) domains that represented a diverse set of CAZy families. This indicates that the genus Streptacidiphilus, akin to other representatives of the family Streptomycetaceae such as Streptomyces, has a potential to degrade biomass containing cellulose, chitin, etc. in acidic environments.
Specifically, the set of enzymes required for the degradation of at least three different types of complex carbohydrates, viz. cellulose, hemi-cellulose and chitin, were explored in more detail.

3.6.1. Cellulose Degrading CAZymes or Cellulases

Cellulases are distributed in at least 11 different CAZy families, including GH5, GH6, GH7, GH8, GH9, GH12, GH44, GH45, GH48, GH61 and GH74 [108]. All 11 Streptacidiphilus strains contained at least one potential endoglucanase from GH5 and GH6 families. Specifically, GH5 enzymes were more abundant as compared to GH6 family, and could be as many as 11 in S. carbonis NBRC 100919T or 9 in S. pinicola KCTC 49008T. These numbers are much higher in comparison to GH6 enzymes found in Streptomyces and Kitasatospora (Table 5). However, the distribution of endoglucanases from other GH families (e.g., GH8, GH9, GH12 and GH44) differed among strains and ranged between 0 and 2. Among all the species S. bronchialis DSM 106435T possessed one additional endoglucanase from GH44 family. All Kitasatospora strains possessed one GH48 family, cellobiohydrolase, whereas some species of Streptacidiphilus and Str. albus DSM 41398T lacked this family enzyme. Additionally, at least one endoglucanase or cellobiohydrolase from the GH6 (having both endoglucanase as well as cellobiohydrolase activity) family was present in all three taxonomic groups. None of these strains possessed endoglucanase- or cellobiohydrolase-encoding genes from GH45 and GH7 families. A large number (10–17) of β-glucosidases from GH1 and GH3 families were found in all Streptacidiphilus strains, and the number of β-glucosidases from family GH3 were higher as compared to GH1 (Table 5). For Streptomyces, the number of β-glucosidases ranged between 8 (Str. avermitilis MA-4680T) and 14 (Str. coelicolor A3(2)). Kitasatospora, on the other hand, consisted of a limited number of these enzymes with the exception of K. azatica KCTC 9699T, in which 12 β-glucosidases from GH1 and GH3 families were observed.

3.6.2. Hemi-Cellulose Degrading CAZymes

The distribution of main hemi-cellulose degrading enzymes [109] in 11 Streptacidiphilus strains is summarized in Table 5. Notably, no endo-β-1,4-xylanase enzymes from families GH10, GH11 and GH30 were found in S. albus JL83T, while at least 10 genes that encode GH16 (xyloglycosyltransferase activity) family enzymes were present in its genome. This number is equal to the number of GH16 enzymes observed in K. mediocidica KCTC 9733T. Overall, the number of GH16 in Kitasatospora was higher than Streptacidiphilus and Streptomyces. In contrast, several endo-β-1,4-xylanase encoding genes including other xylan-degrading enzymes from different GH families were observed for S. bronchialis DSM 106435T and S. carbonis NBRC 100919T (Table 5). S. carbonis NBRC 100919T genome contained a maximum of 45 potential hemicellulase degrading enzymes (belonging to all xylan degrading GH families), which is highest among all the species used in this study. The only other species in which all the potential xylan-degrading enzymes were observed was K. azatica KCTC 9699T, with 35 such genes.
Both cellulases and hemicellulases may contain additional carbohydrate binding modules (CBMs) that may bind to the substrates [109]. Additionally, other CAZymes including redox enzymes such as lytic polysaccharide monooxygenases belonging to family AA10, xylan esterases (CE), and polysaccharide lyases (PL) may also be involved in the degradation of cellulose and hemicellulose [110]. The AA10 family enzymes were abundant in Streptomyces, especially in Str. coelicolor A3(2) and Streptomyces sp. SirexAA, both well known for their glycan degrading activities. However, it can be noted from Table 5 that among Streptacidiphilus, only S. bronchialis DSM 106435T, a host associated strain [30], possessed three AA10 family CAZy enzymes. In the case of Streptomyces, it has been reported that host-associated strains may have higher cellulolytic activity [14]. Similarly, only K. setae KM-6054T possessed enzymes belonging to AA10 family among the genus.

3.6.3. Chitin Degrading CAZymes

Based on amino acid sequence similarities, chitin-degrading enzymes are divided into three CAZy families (GH18, GH19 and GH20) [111]. All these enzymes were observed in all strains with the exception of S. oryzae TH49T, Str. avermitilis MA-4680T, and K. azatica KCTC 9699T, which seemed to lack chitinase from the GH19 family. Specifically, 132 GH18 family chitinases were distributed among the 11 Streptacidiphilus strains, and the number was highest in S. jeojiense NRRL B-24555T (17) and S. jiangxiensis NBRC 100920T (16). Within the genomes of known carbohydrate degraders Str. coelicolor A3(2) and Streptomyces sp. SirexAA, 12 and 10 GH18 family enzymes were present. In contrast, K. setae KM-6054T contained a maximum number of 19 GH18 enzymes among all species. When the amino acid sequences of 132 putative GH18 chitinases from Streptacidiphilus were compared with 80 chitinases from Streptomyces and Kitasatospora, the DXE catalytic motif [112] was well conserved in all these sequences despite a high degree of sequence variation (Figure S5). Further analysis of these 212 GH18 family sequences revealed that several of these enzymes could be grouped into 8 clusters based on their sequence identity (Figure S6). One sequence from Str. coelicolor A3(2) did not cluster with any of the sequences in the network and contained GH18 EndoS-like domain. This GH18 family endo-β-N-acetylglucosaminidase S (EndoS) is secreted by Streptococcus pyogenes and has the potential to hydrolyze the chitobiose core of asparagine-linked glycan on immunoglobulin G (IgG) [113]. The sequences in the well-defined clusters (clusters with 3 or more nodes) contained one or more domains in addition to the GH18 family domain and each cluster exhibited over-representation of certain domains. For example, several sequences of cluster 1 contained a fibronectin type 3 (FN3), or a chitin-binding domain of chitinase C (ChiC_BD) domains in addition to CBM_4_9 (annotated as CBM16 family by dbCAN) superfamily modules. In contrast, cluster 2 exhibited predominance of domains such as GH18_PF-ChiA-like, ricin-type β-trefoil lectin (ricin_B_lectin), and cellulose binding (CBM_2) domains. PF-ChiA is a chitinase present in the hyperthermophilic archaeon Pyrococcus furiosus with a GH18 catalytic activity. Although the catalytic residues are conserved in PF-ChiA, they mainly differ from the related GH18 family chitinases in their catalytic mechanism [114]. In addition to clusters 1 and 2, some sequences of cluster 3 also contained the FN3 domain, but the sequences in cluster 3 contained two FN3 domains instead of only one, as observed in the first two clusters. The FN3 domains are generally present between the catalytic and CBM modules [115].

4. Conclusions

This study provides the first analysis of high-quality genome sequences for Streptacidiphilus, and also the first comprehensive comparative genome analysis of eleven Streptacidiphilus and representative Kitasatospora and Streptomyces species, which highlighted the potential of this genus in terms of biosynthetic as well as biodegradation capabilities. In summary, we observe that hopene represents the core BGC in the family Streptomcetaceae since it was identified in the genome of each strain used in this study. Some BGCs such as macrotetrolide were more abundant in Streptacidiphilus as compared to Streptomyces and Kitasatospora. In contrast, albaflavenone and melanin, which are highly abundant in Streptomyces, were absent in Streptacidiphilus. Based on the similarities with existing BGCs, there is a high probability that the potential SMs produced by these Streptacidiphilus strains may be novel and the expected bioactivities may cover a wide range, including antibacterial, antifungal, antimalarial and antitumor activities. Similarly, some Streptacidiphilus species such as S. carbonis NBRC 100919T, S. jeojiense NRRL B-24555T, S. pinicola KCTC 49008T, and S. oryzae TH49T exhibited a higher number of genes that may have implications in biodegradation. Moreover, our pan-genome analysis suggests an open pan-genome for Streptacidiphilus and highlights that the unique genome is the least annotated as compared to the core and accessory genome. The biosynthetic gene clusters or enzymes (especially those representing the unique genome) highlighted in this study would provide a starting point to explore the genus for its full potential. With the availability of more high-quality genomes from this genus, new insights regarding the developmental processes or the biosynthesis of secondary metabolites in Streptomycetaceae might be revealed in more detail.

Supplementary Materials

The following are available online at https://www.mdpi.com/2073-4425/11/10/1166/s1, Figure S1. Whole genome-based tree of the family Streptomycetaceae generated with TYGS. Streptomyces, Kitasatospora and Streptacidiphilus are indicated as red, green and blue respectively. The tree was rooted at the midpoint, and GBDP bootstrap support values > 50% are indicated as black circles. Figure S2. Phylogeny of the amino acid sequences involved in the biosynthesis of cell wall peptidoglycan. (A) Enzymes encoded by two different types of MurE genes. (B) Amino acid sequences encoded by dapF genes in each strain. The trees were constructed with 1000 replicates of bootstrap test and distance estimates were calculated based on the Jones-Taylor-Thornton (JTT) model. Figure S3. NRPS type BGCs with well-defined modular structure as detected by antiSMASH in 3 high-quality (≤10 contigs) Streptacidiphilus genomes. Figure S4. Comparison of the antiSMASH predicted modular structures. (A) Scabichelin producing NRPS enzymes from S. jeojiense NRRL B-24555T and Streptomyces scabiei 87.22. (B) hybrid T1PKS-NRPS cluster 5 from Streptacidiphilus albus JL83T and the antimycin producing BGC from Streptomyces sp. S4. The predicted structures and potential monomers are also shown. Figure S5. Multiple sequence alignment between GH18 family chitinases from the representatives of family Streptomycetaceae showing highly conserved DXE catalytic motif. (X = any amino acid). Figure S6. Sequence similarity network of GH18 enzymes found in Streptacidiphilus, Kitasatospora and Streptomyces. In addition to GH18 domain, several other types of domains were observed in these enzymes. Based on their sequence similarity, GH18 enzymes were grouped into multiple clusters. Each node represents a GH18 enzyme and edges represent the sequence identity between the two nodes. Two nodes were connected if they shared ≥ 40% sequence identity. List of over-represented domains is provided under each cluster. Table S1. Average nucleotide identity (ANI) scores (%) observed between 11 Streptacidiphilus genomes and selected representatives of Streptomyces and Kitasatospora. Table S2. Pan-genome of Streptacidiphilus mapped to KEGG pathways. (A) Core genes. (B) Accessory genes. (C) Unique genes. Table S3. Diversity of core structures predicted by antiSMASH in Streptacidiphilus. Structures from BGCs which consisted of 5 or more genes are shown only. Table S4. Predicted core lanthipeptides detected by antiSMASH in Streptacidiphilus. (A) Class I lanthipeptides. (B) Class II lanthipeptides.

Author Contributions

Conceptualization, A.M. and S.B.K.; Formal Analysis, Y.R.K. and A.M.; Writing—Original Draft Preparation, A.M.; Writing—Review & Editing, S.B.K. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Ministry of Science and ICT (MSIT) of Korean government (No. 2020R1F1A1076440).

Acknowledgments

The genome analysis of S. albus JL83T, S. oryzae TH49T and S. rugosus AM16T was conducted by the Joint Genome Institute (JGI), a DOE Office of Science User Facility, which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Hodgson, D.A. Primary metabolism and its control in streptomycetes: A most unusual group of bacteria. Adv. Microb. Physiol. 2000, 42, 47–238. [Google Scholar] [PubMed]
  2. Girard, G.; Willemse, J.; Zhu, H.; Claessen, D.; Bukarasam, K.; Goodfellow, M.; van Wezel, G.P. Analysis of novel kitasatosporae reveals significant evolutionary changes in conserved developmental genes between Kitasatospora and Streptomyces. Antonie Van Leeuwenhoek 2014, 106, 365–380. [Google Scholar] [CrossRef] [PubMed]
  3. Bentley, S.D.; Chater, K.F.; Cerdeno-Tarraga, A.M.; Challis, G.L.; Thomson, N.R.; James, K.D.; Harris, D.E.; Quail, M.A.; Kieser, H.; Harper, D.; et al. Complete genome sequence of the model actinomycete Streptomyces coelicolor A3(2). Nature 2002, 417, 141–147. [Google Scholar] [CrossRef] [PubMed]
  4. Roh, S.G.; Kim, M.K.; Park, S.; Yun, B.R.; Park, J.; Kim, M.J.; Kim, Y.S.; Kim, S.B. Streptacidiphilus pinicola sp. nov., isolated from pine grove soil. Int. J. Syst. Evol. Microbiol. 2018, 68, 3149–3155. [Google Scholar] [CrossRef] [PubMed]
  5. Kim, S.B.; Lonsdale, J.; Seong, C.N.; Goodfellow, M. Streptacidiphilus gen. nov., acidophilic actinomycetes with wall chemotype I and emendation of the family Streptomycetaceae (Waksman and Henrici (1943) AL) emend. Rainey et al. 1997. Antonie Van Leeuwenhoek 2003, 83, 107–116. [Google Scholar] [CrossRef]
  6. Takahashi, Y. Genus Kitasatospora, taxonomic features and diversity of secondary metabolites. J. Antibiot. 2017, 70, 506–513. [Google Scholar] [CrossRef] [Green Version]
  7. Lyons, A.J.; Pridham, T.G. Streptomyces griseus (Krainsky) Waksman and Henrici: A Taxonomic Study of Some Strains; United States Department of Agriculture Economic Research Service: Washington, DC, USA, 1966; Available online: http://ageconsearch.umn.edu/record/171447/files/tb1360.pdf (accessed on 17 July 2020).
  8. Burg, R.W.; Miller, B.M.; Baker, E.E.; Birnbaum, J.; Currie, S.A.; Hartman, R.; Kong, Y.L.; Monaghan, R.L.; Olson, G.; Putter, I.; et al. Avermectins, new family of potent anthelmintic agents: Producing organism and fermentation. Antimicrob. Agents Chemother. 1979, 15, 361–367. [Google Scholar] [CrossRef] [Green Version]
  9. Habib el, S.E.; Scarsdale, J.N.; Reynolds, K.A. Biosynthetic origin of hygromycin A. Antimicrob. Agents Chemother. 2003, 47, 2065–2071. [Google Scholar] [CrossRef] [Green Version]
  10. Omura, S.; Otoguro, K.; Nishikiori, T.; Oiwa, R.; Iwai, Y. Setamycin, a new antibiotic. J. Antibiot. 1981, 34, 1253–1256. [Google Scholar] [CrossRef] [Green Version]
  11. Takahashi, Y.; Iwai, Y.; S, O. Two new species of the genus Kitasatosporia, Kitasatosporia phosalacinea sp. nov. and Kitasatosporia griseola sp. nov. J. Gen. Appl. Microbiol. 1984, 30, 377–387. [Google Scholar] [CrossRef]
  12. Omura, S.; Murata, M.; Hanaki, H.; Hinotozawa, K.; Oiwa, R.; Tanaka, H. Phosalacine, a new herbicidal antibiotic containing phosphinothricin. Fermentation, isolation, biological activity and mechanism of action. J. Antibiot. 1984, 37, 829–835. [Google Scholar] [CrossRef] [PubMed]
  13. Wu, C.; van Wezel, G.P.; Hae Choi, Y. Identification of novel endophenaside antibiotics produced by Kitasatospora sp. MBT66. J. Antibiot. 2015, 68, 445–452. [Google Scholar] [CrossRef] [PubMed]
  14. Book, A.J.; Lewin, G.R.; McDonald, B.R.; Takasuka, T.E.; Wendt-Pienkowski, E.; Doering, D.T.; Suh, S.; Raffa, K.F.; Fox, B.G.; Currie, C.R. Evolution of high cellulolytic activity in symbiotic Streptomyces through selection of expanded gene content and coordinated gene expression. PLoS Biol. 2016, 14, e1002475. [Google Scholar] [CrossRef] [PubMed]
  15. Lacombe-Harvey, M.E.; Brzezinski, R.; Beaulieu, C. Chitinolytic functions in actinobacteria: Ecology, enzymes, and evolution. Appl. Microbiol. Biotechnol. 2018, 102, 7219–7230. [Google Scholar] [CrossRef] [Green Version]
  16. Sawaguchi, A.; Ono, S.; Oomura, M.; Inami, K.; Kumeta, Y.; Honda, K.; Sameshima-Saito, R.; Sakamoto, K.; Ando, A.; Saito, A. Chitosan degradation and associated changes in bacterial community structures in two contrasting soils. Soil Sci. Plant Nutr. 2015, 61, 471–480. [Google Scholar] [CrossRef] [Green Version]
  17. Zitouni, M.; Viens, P.; Ghinet, M.G.; Brzezinski, R. Diversity of family GH46 chitosanases in Kitasatospora setae KM-6054. Appl. Microbiol. Biotechnol. 2017, 101, 7877–7888. [Google Scholar] [CrossRef] [Green Version]
  18. Hwang, S.; Yun, Y.; Choi, W.H.; Kim, S.B.; Shin, J.; Lee, M.J.; Oh, D.C. Acidiphilamides A-E, modified peptides as autophagy inhibitors from an acidophilic actinobacterium, Streptacidiphilus rugosus. J. Nat. Prod. 2019, 82, 341–348. [Google Scholar] [CrossRef]
  19. Newman, D. Screening and identification of novel biologically active natural compounds. F1000Research 2017, 6, 783. [Google Scholar] [CrossRef] [Green Version]
  20. Omura, S.; Ikeda, H.; Ishikawa, J.; Hanamoto, A.; Takahashi, C.; Shinose, M.; Takahashi, Y.; Horikawa, H.; Nakazawa, H.; Osonoe, T.; et al. Genome sequence of an industrial microorganism Streptomyces avermitilis: Deducing the ability of producing secondary metabolites. Proc. Natl. Acad. Sci. USA 2001, 98, 12215–12220. [Google Scholar] [CrossRef] [Green Version]
  21. Udwary, D.W.; Zeigler, L.; Asolkar, R.N.; Singan, V.; Lapidus, A.; Fenical, W.; Jensen, P.R.; Moore, B.S. Genome sequencing reveals complex secondary metabolome in the marine actinomycete Salinispora tropica. Proc. Natl. Acad. Sci. USA 2007, 104, 10376–10381. [Google Scholar] [CrossRef] [Green Version]
  22. Cho, S.H.; Han, J.H.; Seong, C.N.; Kim, S.B. Phylogenetic diversity of acidophilic sporoactinobacteria isolated from various soils. J. Microbiol. 2006, 44, 600–606. [Google Scholar] [PubMed]
  23. Cho, S.H.; Han, J.H.; Ko, H.Y.; Kim, S.B. Streptacidiphilus anmyonensis sp. nov., Streptacidiphilus rugosus sp. nov. and Streptacidiphilus melanogenes sp. nov., acidophilic actinobacteria isolated from Pinus soils. Int. J. Syst. Evol. Microbiol. 2008, 58, 1566–1570. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Williams, S.T.; Khan, M.R. Antibiotics—A soil microbiologist’s viewpoint. Postepy Hig. I Med. Dosw. 1974, 28, 395–408. [Google Scholar]
  25. Williams, S.T.; Robinson, C.S. The role of streptomycetes in decomposition of chitin in acidic soils. J. Gen. Microbiol. 1981, 127, 55–63. [Google Scholar] [CrossRef] [Green Version]
  26. Iuv, A.; Zenova, G.M. Antagonistic activity of soil acidophilic actinomycetes. Izv. Akad. Nauk. Seriia Biol. 2007, 4, 402–405. [Google Scholar]
  27. Poomthongdee, N.; Duangmal, K.; Pathom-aree, W. Acidophilic actinomycetes from rhizosphere soil: Diversity and properties beneficial to plants. J. Antibiot. 2015, 68, 106–114. [Google Scholar] [CrossRef]
  28. Kim, Y.R.; Park, S.; Kim, T.S.; Kim, M.K.; Han, J.H.; Joung, Y.; Kim, S.B. Draft genome sequence of Streptacidiphilus oryzae TH49T, an acidophilic actinobacterium isolated from soil. Genome Announc. 2015, 3. [Google Scholar] [CrossRef] [Green Version]
  29. Komaki, H.; Ichikawa, N.; Oguchi, A.; Hamada, M.; Tamura, T.; Fujita, N. Draft genome sequences of six type strains of the genus Streptacidiphilus. Genome Announc. 2015, 3. [Google Scholar] [CrossRef] [Green Version]
  30. Arthur, R.A.; Gulvik, C.A.; Humrighouse, B.W.; Lasker, B.A.; Batra, D.; Rowe, L.A.; Igual, J.M.; Nouioui, I.; Klenk, H.P.; McQuiston, J.R. Complete genome sequence of Streptacidiphilus sp. strain 15-057A, obtained from bronchial lavage fluid. Microbiol. Resour. Announc. 2018, 7. [Google Scholar] [CrossRef] [Green Version]
  31. Eid, J.; Fehr, A.; Gray, J.; Luong, K.; Lyle, J.; Otto, G.; Peluso, P.; Rank, D.; Baybayan, P.; Bettman, B.; et al. Real-time DNA sequencing from single polymerase molecules. Science 2009, 323, 133–138. [Google Scholar] [CrossRef]
  32. Chin, C.S.; Alexander, D.H.; Marks, P.; Klammer, A.A.; Drake, J.; Heiner, C.; Clum, A.; Copeland, A.; Huddleston, J.; Eichler, E.E.; et al. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat. Methods 2013, 10, 563–569. [Google Scholar] [CrossRef] [PubMed]
  33. Hyatt, D.; Chen, G.L.; Locascio, P.F.; Land, M.L.; Larimer, F.W.; Hauser, L.J. Prodigal: Prokaryotic gene recognition and translation initiation site identification. BMC Bioinform. 2010, 11, 119. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Huerta-Cepas, J.; Forslund, K.; Coelho, L.P.; Szklarczyk, D.; Jensen, L.J.; von Mering, C.; Bork, P. Fast genome-wide functional annotation through orthology assignment by eggNOG-Mapper. Mol. Bio. Evol. 2017, 34, 2115–2122. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Moriya, Y.; Itoh, M.; Okuda, S.; Yoshizawa, A.C.; Kanehisa, M. KAAS: An automatic genome annotation and pathway reconstruction server. Nucleic Acids Res. 2007, 35, W182–W185. [Google Scholar] [CrossRef] [Green Version]
  36. Blin, K.; Wolf, T.; Chevrette, M.G.; Lu, X.; Schwalen, C.J.; Kautsar, S.A.; Suarez Duran, H.G.; de Los Santos, E.L.C.; Kim, H.U.; Nave, M.; et al. antiSMASH 4.0-improvements in chemistry prediction and gene cluster boundary identification. Nucleic Acids Res. 2017, 45, W36–W41. [Google Scholar] [CrossRef]
  37. Zhang, H.; Yohe, T.; Huang, L.; Entwistle, S.; Wu, P.; Yang, Z.; Busk, P.K.; Xu, Y.; Yin, Y. dbCAN2: A meta server for automated carbohydrate-active enzyme annotation. Nucleic Acids Res. 2018, 46, W95–W101. [Google Scholar] [CrossRef] [Green Version]
  38. Lombard, V.; Golaconda Ramulu, H.; Drula, E.; Coutinho, P.M.; Henrissat, B. The carbohydrate-active enzymes database (CAZy) in 2013. Nucleic Acids Res. 2014, 42, D490–D495. [Google Scholar] [CrossRef] [Green Version]
  39. Gerlt, J.A.; Bouvier, J.T.; Davidson, D.B.; Imker, H.J.; Sadkhin, B.; Slater, D.R.; Whalen, K.L. Enzyme Function Initiative-Enzyme Similarity Tool (EFI-EST): A web tool for generating protein sequence similarity networks. Biochim. Biophys. Acta 2015, 1854, 1019–1037. [Google Scholar] [CrossRef] [Green Version]
  40. Shannon, P.; Markiel, A.; Ozier, O.; Baliga, N.S.; Wang, J.T.; Ramage, D.; Amin, N.; Schwikowski, B.; Ideker, T. Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Res. 2003, 13, 2498–2504. [Google Scholar] [CrossRef]
  41. Kumar, S.; Stecher, G.; Li, M.; Knyaz, C.; Tamura, K. MEGA X: Molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 2018, 35, 1547–1549. [Google Scholar] [CrossRef]
  42. Meier-Kolthoff, J.P.; Göker, M. TYGS is an automated high-throughput platform for state-of-the-art genome-based taxonomy. Nat. Commun. 2019, 10, 2182. [Google Scholar] [CrossRef] [PubMed]
  43. Yoon, S.H.; Ha, S.M.; Lim, J.; Kwon, S.; Chun, J. A large-scale evaluation of algorithms to calculate average nucleotide identity. Antonie Leeuwenhoek 2017, 110, 1281–1286. [Google Scholar] [CrossRef] [PubMed]
  44. Chaudhari, N.M.; Gupta, V.K.; Dutta, C. BPGA—An ultra-fast pan-genome analysis pipeline. Sci. Rep. 2016, 6, 24373. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  45. Zhao, Y.; Jia, X.; Yang, J.; Ling, Y.; Zhang, Z.; Yu, J.; Wu, J.; Xiao, J. PanGP: A tool for quickly analyzing bacterial pan-genome profile. Bioinformatics 2014, 30, 1297–1299. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  46. Huang, X.; Kong, F.; Zhou, S.; Huang, D.; Zheng, J.; Zhu, W. Streptomyces tirandamycinicus sp. nov., a novel marine sponge-derived actinobacterium with antibacterial potential against Streptococcus agalactiae. Front. Microbiol. 2019, 10, 482. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  47. Kuo, C.H.; Ochman, H. The extinction dynamics of bacterial pseudogenes. PLoS Genet. 2010, 6, e1001050. [Google Scholar] [CrossRef] [PubMed]
  48. Labeda, D.P.; Dunlap, C.A.; Rong, X.; Huang, Y.; Doroghazi, J.R.; Ju, K.S.; Metcalf, W.W. Phylogenetic relationships in the family Streptomycetaceae using multi-locus sequence analysis. Antonie Van Leeuwenhoek. 2017, 110, 563–583. [Google Scholar] [CrossRef]
  49. Nouioui, I.; Klenk, H.-P.; Igual, J.M.; Gulvik, C.A.; Lasker, B.A.; McQuiston, J.R. Streptacidiphilus bronchialis sp. nov., a ciprofloxacin-resistant bacterium from a human clinical specimen; reclassification of Streptomyces griseoplanus as Streptacidiphilus griseoplanus comb. nov. and emended description of the genus Streptacidiphilus. Int. J. Syst. Evol. Microbiol. 2019, 69, 1047–1056. [Google Scholar] [CrossRef]
  50. Huang, Y.; Cui, Q.; Wang, L.; Rodriguez, C.; Quintana, E.; Goodfellow, M.; Liu, Z. Streptacidiphilus jiangxiensis sp. nov., a novel actinomycete isolated from acidic rhizosphere soil in China. Antonie Van Leeuwenhoek 2004, 86, 159–165. [Google Scholar] [CrossRef]
  51. Wang, L.; Huang, Y.; Liu, Z.; Goodfellow, M.; Rodríguez, C. Streptacidiphilus oryzae sp. nov., an actinomycete isolated from rice-field soil in Thailand. Int. J. Syst. Evol. Microbiol. 2006, 56, 1257–1261. [Google Scholar] [CrossRef] [Green Version]
  52. Nouioui, I.; Carro, L.; García-López, M.; Meier-Kolthoff, J.P.; Woyke, T.; Kyrpides, N.C.; Pukall, R.; Klenk, H.P.; Goodfellow, M.; Göker, M. Streptomyces albus (Rossi Doria 1891) Waksman and Henrici 1943 emend. Genome-Based Taxonomic Classification of the Phylum Actinobacteria. Front. Microbiol. 2018, 9, 2007. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  53. Kim, S.B.; Goodfellow, M. Streptomyces avermitilis sp. nov., nom. rev., a taxonomic home for the avermectin-producing streptomycetes. Int. J. Syst. Evol. Microbiol. 2002, 52, 2011–2014. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  54. Hodgson, D.A. Carbohydrate Utilisation in Streptomyces coelicolor A3(2). Ph.D. Thesis, University of East Anglia, Norwich, UK, 1980. Available online: https://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.258081 (accessed on 17 July 2020).
  55. Zhang, Z.; Wang, Y.; Ruan, J. A proposal to revive the genus Kitasatospora (Omura, Takahashi, Iwai, and Tanaka 1982). Int. J. Syst. Bacteriol. 1997, 47, 1048–1054. [Google Scholar] [CrossRef] [PubMed]
  56. Labeda, D.P. Kitasatosporia mediocidica sp. nov. Int. J. Syst. Bacteriol. 1988, 38, 287–290. [Google Scholar] [CrossRef] [Green Version]
  57. Omura, S.; Takahashi, Y.; Iwai, Y.; Tanaka, H. Kitasatosporia, a new genus of the order Actinomycetales. J. Antibiot. 1982, 35, 1013–1019. [Google Scholar] [CrossRef] [Green Version]
  58. Vigil Stenman, C.T. Effects and Dynamics of Insertion Sequences in the Evolution of Cyanobacteria. Ph.D. Thesis, Stockholm University, Stockholm, Sweden, 2015. [Google Scholar]
  59. Vigil-Stenman, T.; Ininbergs, K.; Bergman, B.; Ekman, M. High abundance and expression of transposases in bacteria from the Baltic Sea. ISME J. 2017, 11, 2611–2623. [Google Scholar] [CrossRef] [Green Version]
  60. Zhang, M.M.; Wong, F.T.; Wang, Y.; Luo, S.; Lim, Y.H.; Heng, E.; Yeo, W.L.; Cobb, R.E.; Enghiad, B.; Ang, E.L.; et al. CRISPR-Cas9 strategy for activation of silent Streptomyces biosynthetic gene clusters. Nat. Chem. Biol. 2017. [Google Scholar] [CrossRef]
  61. Tao, W.; Yang, A.; Deng, Z.; Sun, Y. CRISPR/Cas9-based editing of Streptomyces for discovery, characterization, and production of natural products. Front. Microbiol. 2018, 9, 1660. [Google Scholar] [CrossRef] [Green Version]
  62. Wilkens, S. Structure and mechanism of ABC transporters. F1000Prime Rep. 2015, 7, 14. [Google Scholar] [CrossRef]
  63. Kim, J.N.; Kim, Y.; Jeong, Y.; Roe, J.H.; Kim, B.G.; Cho, B.K. Comparative genomics reveals the core and accessory genomes of Streptomyces species. J. Microbiol. Biotechnol. 2015, 25, 1599–1605. [Google Scholar] [CrossRef] [Green Version]
  64. Medini, D.; Donati, C.; Tettelin, H.; Masignani, V.; Rappuoli, R. The microbial pan-genome. Curr. Opin. Genet. Dev. 2005, 15, 589–594. [Google Scholar] [CrossRef] [PubMed]
  65. De Maayer, P.; Chan, W.Y.; Rubagotti, E.; Venter, S.N.; Toth, I.K.; Birch, P.R.; Coutinho, T.A. Analysis of the Pantoea ananatis pan-genome reveals factors underlying its ability to colonize and interact with plant, insect and vertebrate hosts. BMC Genom. 2014, 15, 404. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  66. Callister, S.J.; McCue, L.A.; Turse, J.E.; Monroe, M.E.; Auberry, K.J.; Smith, R.D.; Adkins, J.N.; Lipton, M.S. Comparative bacterial proteomics: Analysis of the core genome concept. PLoS ONE 2008, 3, e1542. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  67. Bibb, M.J. Regulation of secondary metabolism in streptomycetes. Curr. Opin. Microbiol. 2005, 8, 208–215. [Google Scholar] [CrossRef] [PubMed]
  68. Méndez, C.; Salas, J.A. The role of ABC transporters in antibiotic-producing organisms: Drug secretion and resistance mechanisms. Res. Microbiol. 2001, 152, 341–350. [Google Scholar] [CrossRef]
  69. Campbell, J.W.; Cronan, J.E., Jr. Bacterial fatty acid biosynthesis: Targets for antibacterial drug discovery. Annu. Rev. Microbiol. 2001, 55, 305–332. [Google Scholar] [CrossRef]
  70. Manuse, S.; Fleurie, A.; Zucchini, L.; Lesterlin, C.; Grangeasse, C. Role of eukaryotic-like serine/threonine kinases in bacterial cell division and morphogenesis. FEMS Microbiol. Rev. 2016, 40, 41–56. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  71. Petříčková, K.; Petříček, M. Eukaryotic-type protein kinases in Streptomyces coelicolor: Variations on a common theme. Microbiology 2003, 149, 1609–1621. [Google Scholar] [CrossRef] [Green Version]
  72. Claessen, D.; de Jong, W.; Dijkhuizen, L.; Wosten, H.A. Regulation of Streptomyces development: Reach for the sky! Trends Microbiol. 2006, 14, 313–319. [Google Scholar] [CrossRef] [Green Version]
  73. Den Hengst, C.D.; Tran, N.T.; Bibb, M.J.; Chandra, G.; Leskiw, B.K.; Buttner, M.J. Genes essential for morphological development and antibiotic production in Streptomyces coelicolor are targets of BldD during vegetative growth. Mol. Microbiol. 2010, 78, 361–379. [Google Scholar] [CrossRef] [Green Version]
  74. El-Gebali, S.; Mistry, J.; Bateman, A.; Eddy, S.R.; Luciani, A.; Potter, S.C.; Qureshi, M.; Richardson, L.J.; Salazar, G.A.; Smart, A.; et al. The Pfam protein families database in 2019. Nucleic Acids Res. 2019, 47, D427–D432. [Google Scholar] [CrossRef] [PubMed]
  75. Chater, K.F.; Chandra, G. The evolution of development in Streptomyces analysed by genome comparisons. FEMS Microbiol. Rev. 2006, 30, 651–672. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  76. Flardh, K.; Buttner, M.J. Streptomyces morphogenetics: Dissecting differentiation in a filamentous bacterium. Nat. Rev. Microbiol. 2009, 7, 36–49. [Google Scholar] [CrossRef] [PubMed]
  77. McCormick, J.R.; Flardh, K. Signals and regulators that govern Streptomyces development. FEMS Microbiol. Rev. 2012, 36, 206–231. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  78. Lin, X.; Li, N.; Kudo, H.; Zhang, Z.; Li, J.; Wang, L.; Zhang, W.; Takechi, K.; Takano, H. Genes sufficient for synthesizing peptidoglycan are retained in Gymnosperm genomes, and MurE from Larix gmelinii can rescue the albino phenotype of Arabidopsis MurE mutation. Plant Cell Physiol. 2017, 58, 587–597. [Google Scholar] [CrossRef] [PubMed]
  79. Richaud, C.; Higgins, W.; Mengin-Lecreulx, D.; Stragier, P. Molecular cloning, characterization, and chromosomal localization of dapF, the Escherichia coli gene for diaminopimelate epimerase. J. Bacteriol. 1987, 169, 1454–1459. [Google Scholar] [CrossRef] [Green Version]
  80. Ichikawa, N.; Oguchi, A.; Ikeda, H.; Ishikawa, J.; Kitani, S.; Watanabe, Y.; Nakamura, S.; Katano, Y.; Kishi, E.; Sasagawa, M.; et al. Genome sequence of Kitasatospora setae NBRC 14216T: An evolutionary snapshot of the family Streptomycetaceae. DNA Res. 2010, 17, 393–406. [Google Scholar] [CrossRef]
  81. Hwang, J.Y.; Kim, S.H.; Oh, H.R.; Kwon, E.; Nam, D.H. Analysis of a draft genome sequence of Kitasatospora cheerisanensis KCTC 2395 producing bafilomycin antibiotics. J. Microbiol. 2015, 53, 84–89. [Google Scholar] [CrossRef]
  82. Blin, K.; Pascal Andreu, V.; de Los Santos, E.L.C.; Del Carratore, F.; Lee, S.Y.; Medema, M.H.; Weber, T. The antiSMASH database version 2: A comprehensive resource on secondary metabolite biosynthetic gene clusters. Nucleic Acids Res. 2019, 47, D625–D630. [Google Scholar] [CrossRef] [Green Version]
  83. Rascher, A.; Hu, Z.; Viswanathan, N.; Schirmer, A.; Reid, R.; Nierman, W.C.; Lewis, M.; Hutchinson, C.R. Cloning and characterization of a gene cluster for geldanamycin production in Streptomyces hygroscopicus NRRL 3602. FEMS Microbiol. Lett. 2003, 218, 223–230. [Google Scholar] [CrossRef] [Green Version]
  84. Kautsar, S.A.; Blin, K.; Shaw, S.; Navarro-Muñoz, J.C.; Terlouw, B.R.; van der Hooft, J.J.J.; van Santen, J.A.; Tracanna, V.; Suarez Duran, H.G.; Pascal Andreu, V.; et al. MIBiG 2.0: A repository for biosynthetic gene clusters of known function. Nucleic Acids Res. 2020, 48, D454–D458. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  85. McAlpine, J.B.; Bachmann, B.O.; Piraee, M.; Tremblay, S.; Alarco, A.M.; Zazopoulos, E.; Farnet, C.M. Microbial genomics as a guide to drug discovery and structural elucidation: ECO-02301, a novel antifungal agent, as an example. J. Nat. Prod. 2005, 68, 493–496. [Google Scholar] [CrossRef] [PubMed]
  86. Wyatt, M.A.; Ahilan, Y.; Argyropoulos, P.; Boddy, C.N.; Magarvey, N.A.; Harrison, P.H. Biosynthesis of ebelactone A: Isotopic tracer, advanced precursor and genetic studies reveal a thioesterase-independent cyclization to give a polyketide β-lactone. J. Antibiot. 2013, 66, 421–430. [Google Scholar] [CrossRef] [PubMed]
  87. Rounge, T.B.; Rohrlack, T.; Tooming-Klunderud, A.; Kristensen, T.; Jakobsen, K.S. Comparison of cyanopeptolin genes in Planktothrix, Microcystis, and Anabaena strains: Evidence for independent evolution within each genus. Appl. Environ. Microbiol. 2007, 73, 7322–7330. [Google Scholar] [CrossRef] [Green Version]
  88. Kodani, S.; Bicz, J.; Song, L.; Deeth, R.J.; Ohnishi-Kameyama, M.; Yoshida, M.; Ochi, K.; Challis, G.L. Structure and biosynthesis of scabichelin, a novel tris-hydroxamate siderophore produced by the plant pathogen Streptomyces scabies 87.22. Org. Biomol. Chem. 2013, 11, 4686–4694. [Google Scholar] [CrossRef] [Green Version]
  89. Izawa, M.; Kawasaki, T.; Hayakawa, Y. Cloning and heterologous expression of the thioviridamide biosynthesis gene cluster from Streptomyces olivoviridis. Appl. Environ. Microbiol. 2013, 79, 7110–7113. [Google Scholar] [CrossRef] [Green Version]
  90. Santos, C.L.; Correia-Neves, M.; Moradas-Ferreira, P.; Mendes, M.V. A walk into the LuxR regulators of Actinobacteria: Phylogenomic distribution and functional diversity. PLoS ONE 2012, 7, e46758. [Google Scholar] [CrossRef] [Green Version]
  91. Seipke, R.F.; Hutchings, M.I. The regulation and biosynthesis of antimycins. Beilstein J. Org. Chem. 2013, 9, 2556–2563. [Google Scholar] [CrossRef] [Green Version]
  92. Perez, M.; Schleissner, C.; Fernandez, R.; Rodriguez, P.; Reyes, F.; Zuniga, P.; de la Calle, F.; Cuevas, C. PM100117 and PM100118, new antitumor macrolides produced by a marine Streptomyces caniferus GUA-06-05-006A. J. Antibiot. 2016, 69, 388–394. [Google Scholar] [CrossRef]
  93. Zhang, Q.; Yu, Y.; Velasquez, J.E.; van der Donk, W.A. Evolution of lanthipeptide synthetases. Proc. Natl. Acad. Sci. USA 2012, 109, 18361–18366. [Google Scholar] [CrossRef] [Green Version]
  94. Repka, L.M.; Chekan, J.R.; Nair, S.K.; van der Donk, W.A. Mechanistic understanding of lanthipeptide biosynthetic enzymes. Chem. Rev. 2017, 117, 5457–5520. [Google Scholar] [CrossRef] [PubMed]
  95. Oldfield, E.; Lin, F.Y. Terpene biosynthesis: Modularity rules. Angew. Chem. Int. Ed. Engl. 2012, 51, 1124–1137. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  96. Hay, R. Investigation of the Role of Gene Clusters in Terpene Biosynthesis in Sorghum Bicolor. Theses 2018. Available online: https://irl.umsl.edu/thesis/320 (accessed on 3 February 2020).
  97. Durr, C.; Schnell, H.J.; Luzhetskyy, A.; Murillo, R.; Weber, M.; Welzel, K.; Vente, A.; Bechthold, A. Biosynthesis of the terpene phenalinolactone in Streptomyces sp. Tu6071: Analysis of the gene cluster and generation of derivatives. Chem. Biol. 2006, 13, 365–377. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  98. Yamada, Y.; Kuzuyama, T.; Komatsu, M.; Shin-Ya, K.; Omura, S.; Cane, D.E.; Ikeda, H. Terpene synthases are widely distributed in bacteria. 2015. Proc. Natl. Acad. Sci. USA 2015, 112, 857–862. [Google Scholar] [CrossRef] [Green Version]
  99. Dairi, T.; Hamano, Y.; Kuzuyama, T.; Itoh, N.; Furihata, K.; Seto, H. Eubacterial diterpene cyclase genes essential for production of the isoprenoid antibiotic terpentecin. J. Bacteriol. 2001, 183, 6085–6094. [Google Scholar] [CrossRef] [Green Version]
  100. Hayashi, Y.; Matsuura, N.; Toshima, H.; Itoh, N.; Ishikawa, J.; Mikami, Y.; Dairi, T. Cloning of the gene cluster responsible for the biosynthesis of brasilicardin A, a unique diterpenoid. J. Antibiot. 2008, 61, 164–174. [Google Scholar] [CrossRef] [Green Version]
  101. Kannenberg, E.L.; Poralla, K. Hopanoid biosynthesis and function in bacteria. Naturwissenschaften 1999, 86, 168–176. [Google Scholar] [CrossRef]
  102. Ward, A.C.; Allenby, N.E. Genome mining for the search and discovery of bioactive compounds: The Streptomyces paradigm. FEMS Microbiol. Lett. 2018, 365. [Google Scholar] [CrossRef]
  103. Sadeghi, A.; Soltani, B.M.; Nekouei, M.K.; Jouzani, G.S.; Mirzaei, H.H.; Sadeghizadeh, M. Diversity of the ectoines biosynthesis genes in the salt tolerant Streptomyces and evidence for inductive effect of ectoines on their accumulation. Microbiol. Res. 2014, 169, 699–708. [Google Scholar] [CrossRef]
  104. Smith, W.C.; Xiang, L.; Shen, B. Genetic localization and molecular characterization of the nonS gene required for macrotetrolide biosynthesis in Streptomyces griseus DSM40695. Antimicrob. Agents Chemother. 2000, 44, 1809–1817. [Google Scholar] [CrossRef] [Green Version]
  105. Zhu, J.; Chen, W.; Li, Y.Y.; Deng, J.J.; Zhu, D.Y.; Duan, J.; Liu, Y.; Shi, G.Y.; Xie, C.; Wang, H.X.; et al. Identification and catalytic characterization of a nonribosomal peptide synthetase-like (NRPS-like) enzyme involved in the biosynthesis of echosides from Streptomyces sp. LZ35. Gene 2014, 546, 352–358. [Google Scholar] [CrossRef] [PubMed]
  106. Andre, I.; Potocki-Veronese, G.; Barbe, S.; Moulis, C.; Remaud-Simeon, M. CAZyme discovery and design for sweet dreams. Curr. Opin. Chem. Biol. 2014, 19, 17–24. [Google Scholar] [CrossRef] [PubMed]
  107. Adams, A.S.; Jordan, M.S.; Adams, S.M.; Suen, G.; Goodwin, L.A.; Davenport, K.W.; Currie, C.R.; Raffa, K.F. Cellulose-degrading bacteria associated with the invasive woodwasp Sirex noctilio. ISME J. 2011, 5, 1323–1331. [Google Scholar] [CrossRef] [PubMed]
  108. Lakhundi, S.; Siddiqui, R.; Khan, N.A. Cellulose degradation: A therapeutic strategy in the improved treatment of Acanthamoeba infections. Parasit. Vectors 2015, 8, 23. [Google Scholar] [CrossRef] [Green Version]
  109. Lopez-Mondejar, R.; Zuhlke, D.; Becher, D.; Riedel, K.; Baldrian, P. Cellulose and hemicellulose decomposition by forest soil bacteria proceeds by the action of structurally variable enzymatic systems. Sci. Rep. 2016, 6, 25279. [Google Scholar] [CrossRef]
  110. Koeck, D.E.; Pechtl, A.; Zverlov, V.V.; Schwarz, W.H. Genomics of cellulolytic bacteria. Curr. Opin. Biotechnol. 2014, 29, 171–183. [Google Scholar] [CrossRef]
  111. Swiontek Brzezinska, M.; Jankiewicz, U.; Burkowska, A.; Walczak, M. Chitinolytic microorganisms and their possible application in environmental protection. Curr. Microbiol. 2014, 68, 71–81. [Google Scholar] [CrossRef] [Green Version]
  112. Synstad, B.; Gaseidnes, S.; Van Aalten, D.M.; Vriend, G.; Nielsen, J.E.; Eijsink, V.G. Mutational and computational analysis of the role of conserved residues in the active site of a family 18 chitinase. Eur. J. Biochem. 2004, 271, 253–262. [Google Scholar] [CrossRef] [Green Version]
  113. Collin, M.; Olsén, A. Effect of SpeB and EndoS from Streptococcus pyogenes on human immunoglobulins. Infect. Immun. 2001, 69, 7187–7189. [Google Scholar] [CrossRef] [Green Version]
  114. Nakamura, T.; Mine, S.; Hagihara, Y.; Ishikawa, K.; Uegaki, K. Structure of the catalytic domain of the hyperthermophilic chitinase from Pyrococcus furiosus. Acta Cryst. Sect. F Struct. Biol. Cryst. Commun. 2007, 63, 7–11. [Google Scholar] [CrossRef]
  115. Manjeet, K.; Purushotham, P.; Neeraja, C.; Podile, A.R. Bacterial chitin binding proteins show differential substrate binding and synergy with chitinases. Microbiol. Res. 2013, 168, 461–468. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Neighbor-joining tree based on 16S rRNA gene sequences showing the relationships between 15 currently recognized species of Streptacidiphilus. Numbers at the nodes represent the bootstrap support (%), and the scale bar the substitutions rates per nucleotide position.
Figure 1. Neighbor-joining tree based on 16S rRNA gene sequences showing the relationships between 15 currently recognized species of Streptacidiphilus. Numbers at the nodes represent the bootstrap support (%), and the scale bar the substitutions rates per nucleotide position.
Genes 11 01166 g001
Figure 2. Distribution of COG categories in Streptacidiphilus genomes. (A) Full proteomes of Streptacidiphilus species, and for additional comparison representative strains from Kitasatospora and Streptomyces are added. (B) Core genome, (C) accessory genome, (D) unique genome, (E) average comparison, and (F) KEGG orthologs. The COG categories indicate RNA processing and modification (A), chromatin structure and dynamics (B), energy production and conversion (C), cell cycle control, cell division, chromosome partitioning (D), amino acid metabolism and transport (E), nucleotide metabolism and transport (F), carbohydrate metabolism and transport (G), coenzyme metabolism and transport (H), lipid metabolism and transport (I), translation, ribosomal structure and biogenesis (J), transcription (K), replication, recombination and repair (L), cell wall/membrane/envelop biogenesis (M), post-translational modification, protein turnover, chaperone functions (O), inorganic ion transport and metabolism (P), secondary metabolites biosynthesis, transport and catabolism (Q), function unknown (S), signal transduction mechanisms (T), intracellular trafficking, secretion, and vesicular transport (U), and defense mechanisms (V), respectively.
Figure 2. Distribution of COG categories in Streptacidiphilus genomes. (A) Full proteomes of Streptacidiphilus species, and for additional comparison representative strains from Kitasatospora and Streptomyces are added. (B) Core genome, (C) accessory genome, (D) unique genome, (E) average comparison, and (F) KEGG orthologs. The COG categories indicate RNA processing and modification (A), chromatin structure and dynamics (B), energy production and conversion (C), cell cycle control, cell division, chromosome partitioning (D), amino acid metabolism and transport (E), nucleotide metabolism and transport (F), carbohydrate metabolism and transport (G), coenzyme metabolism and transport (H), lipid metabolism and transport (I), translation, ribosomal structure and biogenesis (J), transcription (K), replication, recombination and repair (L), cell wall/membrane/envelop biogenesis (M), post-translational modification, protein turnover, chaperone functions (O), inorganic ion transport and metabolism (P), secondary metabolites biosynthesis, transport and catabolism (Q), function unknown (S), signal transduction mechanisms (T), intracellular trafficking, secretion, and vesicular transport (U), and defense mechanisms (V), respectively.
Genes 11 01166 g002aGenes 11 01166 g002b
Figure 3. The pan-genome of Streptacidiphilus based on 11 species. (A) Core vs. pan developmental plot of eleven Streptacidiphilus genomes. The blue line indicates an increase in the number of genes in a pan-genome on sequential addition of genomes, and suggests an open pan-genome for this genus. In contrast, the number of core genes shared across these genomes decreases with the sequential addition of genomes, indicated by the red line. (B) Plot representing the number of new unique genes as a function of the number of strains added sequentially.
Figure 3. The pan-genome of Streptacidiphilus based on 11 species. (A) Core vs. pan developmental plot of eleven Streptacidiphilus genomes. The blue line indicates an increase in the number of genes in a pan-genome on sequential addition of genomes, and suggests an open pan-genome for this genus. In contrast, the number of core genes shared across these genomes decreases with the sequential addition of genomes, indicated by the red line. (B) Plot representing the number of new unique genes as a function of the number of strains added sequentially.
Genes 11 01166 g003
Figure 4. Most abundant known BGCs in Streptacidiphilus. Bar color indicates the classification of each cluster type in MIBiG database. Numbers above the bars indicate the range of percent similarity shown the clusters against known BGCs.
Figure 4. Most abundant known BGCs in Streptacidiphilus. Bar color indicates the classification of each cluster type in MIBiG database. Numbers above the bars indicate the range of percent similarity shown the clusters against known BGCs.
Genes 11 01166 g004
Table 1. Features of 11 Streptacidiphilus genomes and their comparison with representatives from Streptomyces and Kitasatospora. Species: 1, Streptacidiphilus albus JL83T; 2, Streptacidiphilus anmyonensis NBRC 103185T; 3, Streptacidiphilus bronchialis DSM 106435T; 4, Streptacidiphilus carbonis NBRC 100919T; 5, Streptacidiphilus jeojiense NRRL B-24555T; 6, Streptacidiphilus jiangxiensis NBRC 100920T; 7, Streptacidiphilus melanogenes NBRC 103184T; 8, Streptacidiphilus neutrinimicus NBRC 100921T; 9, Streptacidiphilus oryzae TH49T; 10, Streptacidiphilus pinicola KCTC 49008T; 11, Streptacidiphilus rugosus AM-16T; 12, Streptomyces albus DSM 41398T; 13, Streptomyces avermitilis MA-4680T; 14, Streptomyces coelicolor A3(2); 15, Kitasatospora azatica KCTC 9699T; 16, Kitasatospora mediocidica KCTC 9733T; 17, Kitasatospora setae KM-6054T.
Table 1. Features of 11 Streptacidiphilus genomes and their comparison with representatives from Streptomyces and Kitasatospora. Species: 1, Streptacidiphilus albus JL83T; 2, Streptacidiphilus anmyonensis NBRC 103185T; 3, Streptacidiphilus bronchialis DSM 106435T; 4, Streptacidiphilus carbonis NBRC 100919T; 5, Streptacidiphilus jeojiense NRRL B-24555T; 6, Streptacidiphilus jiangxiensis NBRC 100920T; 7, Streptacidiphilus melanogenes NBRC 103184T; 8, Streptacidiphilus neutrinimicus NBRC 100921T; 9, Streptacidiphilus oryzae TH49T; 10, Streptacidiphilus pinicola KCTC 49008T; 11, Streptacidiphilus rugosus AM-16T; 12, Streptomyces albus DSM 41398T; 13, Streptomyces avermitilis MA-4680T; 14, Streptomyces coelicolor A3(2); 15, Kitasatospora azatica KCTC 9699T; 16, Kitasatospora mediocidica KCTC 9733T; 17, Kitasatospora setae KM-6054T.
SpeciesBioProject AccessionSize (Mb)No. of ContigsGC Content (%)CDSPseudogenestRNA (rRNA)Accessory GenesUnique GenesNo. of Biosynthetic Gene Clusters (BGCs)Reference
1PRJNA2347869.91171.88,16636684 (27)3,4582,06635[5]
2PRJDB32979.3815372794145373 (3)467875930[23]
3PRJNA4385327.09172.70568124766 (24)1915166519[49]
4PRJDB32978.4513971.3706245472 (3)3591116232[5]
5PRJNA2385349.1514471.5776925773 (19)3990137618-
6PRJDB32979.52967282857669 (5)4545124730[50]
7PRJDB32978.7710771.9748139074 (3)454755431[23]
8PRJDB32978.4118471.9711945971 (3)419356738[5]
9PRJNA2347887.81673.4652627263 (23)2447188415[51]
10PRJNA4754528.4328171.8744031168 (8)413784635[4]
11PRJNA2347789471.8775035674 (27)4101117122[23]
12PRJNA2716258.38172.6073305865 (18)NANA35[52]
13PRJNA1899.11270.687446068 (18)NANA37[53]
14PRJNA2429.0517281526065 (18)NANA29[54]
15PRJNA2348628.27371.6684429274 (29)NANA30[55]
16PRJNA2347818.68771.9710325471 (33)NANA32[56]
17PRJDA199518.78174.207182074 (27)NANA38[57]
Table 2. Distribution of biosynthetic gene clusters (BGCs) in the genomes of Streptacidiphilus and their comparison with representative Kitasatospora and Streptomyces. Cluster type with an overall aggregate of 10 or above are shown only. Species: 1, S. albus JL83T; 2, S. anmyonensis NBRC 103185T; 3, S. bronchialis DSM 106435T; 4, S. carbonis NBRC 100919T; 5, S. jeojiense NRRL B-24555T; 6, S. jiangxiensis NBRC 100920T; 7, S. melanogenes NBRC 103184T; 8, S. neutrinimicus NBRC 100921T; 9, S. oryzae TH49T; 10, S. pinicola KCTC 49008T; 11, S. rugosus AM-16T; 12, Str. albus DSM 41398T; 13, Str. avermitilis MA-4680T; 14, Str. coelicolor A3(2); 15, K. azatica KCTC 9699T; 16, K. mediocidica KCTC 9733T; 17, K. setae KM-6054T.
Table 2. Distribution of biosynthetic gene clusters (BGCs) in the genomes of Streptacidiphilus and their comparison with representative Kitasatospora and Streptomyces. Cluster type with an overall aggregate of 10 or above are shown only. Species: 1, S. albus JL83T; 2, S. anmyonensis NBRC 103185T; 3, S. bronchialis DSM 106435T; 4, S. carbonis NBRC 100919T; 5, S. jeojiense NRRL B-24555T; 6, S. jiangxiensis NBRC 100920T; 7, S. melanogenes NBRC 103184T; 8, S. neutrinimicus NBRC 100921T; 9, S. oryzae TH49T; 10, S. pinicola KCTC 49008T; 11, S. rugosus AM-16T; 12, Str. albus DSM 41398T; 13, Str. avermitilis MA-4680T; 14, Str. coelicolor A3(2); 15, K. azatica KCTC 9699T; 16, K. mediocidica KCTC 9733T; 17, K. setae KM-6054T.
Cluster Type/Species1234567891011121314151617Total
Terpene3436564485527533578
T1PKS06081161103034125152
NRPS4214351204224303242
Siderophore2212223212224322236
Lantipeptide7132210211120331434
Bacteriocin0112101123111230323
Butyrolactone2110011112210021622
T2PKS0200011110001220112
T3PKS0101111102001110112
Lassopeptide0100011101013000211
T1PKS_NRPS1031010100011010010
Thiopeptide201000000211002009
T1PKS_OtherKS000000000002121118
Butyrolactone_OtherKS000001100101100106
Other hybrids9641156602185348675
Others0010000111253313223
Total3530193218303138153522353729303238506
Table 3. List of known BGCs observed only in Streptacidiphilus genomes.
Table 3. List of known BGCs observed only in Streptacidiphilus genomes.
Known BGCOrganism NameCluster No.Cluster TypeSimilarity (%)MIBiG Reference
CacibiocinS. pinicola KCTC 49008T10Aminocoumarin64BGC0001154
CyanopeptinS. albus JL83T30NRPS50BGC0000331
Eicoseicosapentaenoic acidS. bronchialis DSM 106435T18Otherks10BGC0000865
ErythrochelinS. rugosus AM-16T18NRPS28BGC0000349
FrenolicinS. jiangxiensis NBRC 100920T9T2PKS50BGC0000225
LankacidinS. pinicola KCTC 49008T25Other26BGC0001100
MicromonolactamS. neutrinimicus NBRC 100921T2T1PKS100BGC0000095
Table 4. Distribution of carbohydrate-active enzymes (CAZymes) and their various families in Streptacidiphilus and their comparison with representative Streptomyces and Kitasatospora species. For comparison, the CAZy profile of a well-known cellulose degrading Streptomyces sp. SirexAA strain is also shown. Values in parentheses (columns 3–8) represent different types of CAZy families. GH = glycoside hydrolase, GT = glycosyl transferase, CE = carbohydrate esterases, PL = polysaccharide lyases, CBM = carbohydrate-binding modules, AA = auxiliary activities.
Table 4. Distribution of carbohydrate-active enzymes (CAZymes) and their various families in Streptacidiphilus and their comparison with representative Streptomyces and Kitasatospora species. For comparison, the CAZy profile of a well-known cellulose degrading Streptomyces sp. SirexAA strain is also shown. Values in parentheses (columns 3–8) represent different types of CAZy families. GH = glycoside hydrolase, GT = glycosyl transferase, CE = carbohydrate esterases, PL = polysaccharide lyases, CBM = carbohydrate-binding modules, AA = auxiliary activities.
Organism NameGenes (%)GHGTCEPLCBMAA
S. albus JL83T366 (4.48)145 (46)84 (14)76 (7)9 (6)120 (16)15 (5)
S. anmyonensis NBRC 103185T339 (4.27)134 (42)79 (14)71 (7)4 (2)108 (15)24 (6)
S. bronchialis DSM 106435T268 (4.72)132 (49)56 (13)60 (8)4 (3)86 (20)11 (4)
S. carbonis NBRC 100919T422 (5.98)235 (69)77 (13)77 (8)9 (5)152 (26)15 (6)
S. jeojiense NRRL B-24555T440 (5.66)236 (60)93 (12)74 (8)17 (5)129 (21)22 (6)
S. jiangxiensis NBRC 100920T370 (4.47)171 (56)88 (14)70 (8)1 (1)100 (18)29 (6)
S. melanogenes NBRC 103184T329 (4.40)133 (43)80 (14)64 (8)4 (3)96 (16)27 (6)
S. neutrinimicus NBRC 100921T313 (4.40)130 (47)75 (14)62 (7)5 (4)83 (16)25 (6)
S. oryzae TH49T326 (5.00)154 (53)83 (15)64 (8)4 (4)48 (16)20 (5)
S. pinicola KCTC 49008T390 (5.24)190 (61)87 (13)59 (9)9 (7)131 (20)24 (5)
S. rugosus AM-16T334 (4.31)147 (47)81 (14)66 (8)4 (4)99 (16)25 (6)
Str. albus DSM 41398T262 (3.57)120 (48)55 (11)54 (8)3 (3)30 (12)21 (7)
Str. avermitilis MA-4680T347 (4.52)169 (55)75 (13)60 (9)12 (9)75 (18)25 (7)
Str. coelicolor A3(2)383 (4.70)187 (60)68 (14)71 (9)15 (11)98 (19)23 (7)
Streptomyces sp. sirexAA297 (4.67)137 (50)63 (13)63 (9)8 (5)80 (21)17 (5)
K. azatica KCTC 9699T364 (5.34)172 (56)70 (15)67 (8)2 (2)160 (21)18 (5)
K. mediocidica KCTC 9733T333 (4.69)128 (48)84 (14)71 (9)8 (2)96 (18)15 (4)
K. setae KM-6054T308 (4.29)122 (41)78 (13)62 (8)3 (2)100 (19)21 (5)
Table 5. Distribution of key CAZymes involved in the degradation of complex carbohydrates among Streptacidiphilus and their comparison with representative Streptomyces and Kitasatospora.
Table 5. Distribution of key CAZymes involved in the degradation of complex carbohydrates among Streptacidiphilus and their comparison with representative Streptomyces and Kitasatospora.
Enzyme ClassCAZy FamilyMain ActivityS. albus JL83TS. anmyonensis NBRC 103185TS. carbonis NBRC 100919TS. jeojiense NRRL B-24555TS. jiangxiensis NBRC 100920TS. melanogenes NBRC 103184TS. neutrinimicus NBRC 100921TS. oryzae TH49TS. pinicola KCTC 49008TS. rugosus AM-16TS. bronchialis DSM 106435TStr. albus DSM 41398TStr. avermitilis MA-4680TStr. coelicolor A3(2)Streptomyces sp. SirexAAK. azatica KCTC 9699TK. mediocidica KCTC 9733TK. setae KM-6054T
Cellulose degrading CAZymesGH1β-Glucosidase, β-galactosidase435336334643356524
GH3β-Glucosidases, β-D-xylopyranosidase878914108711976597724
GH5Endo-β-1,4-glucanase, β-mannosidase2551157561951353362
GH6Cellobiohydrolase, endo-β-1,4-glucanase314141111122431323
GH8Endo-β-1,4-glucanase120012220220000011
GH9Endo-β-1,4-glucanase202111110100021012
GH12Endo-β-1,4-glucanase001221000110321100
GH44Endoglucanase001000000000000000
GH48Cellobiohydrolase101111001110111111
GH74Endoglucanase131442223340431211
Subtotal222128323531222221332612233021221618
Hemicellulose degrading CAZymesGH2β-Galactosidase, β-glucuronidase112441106224674110
GH10Endo-β-1,4-xylanase003311023210221323
GH11Endo-β-1,4-xylanase002100001000021101
GH16Xyloglycosyltransferase10538958436743569107
GH26Endo-β-1,4-mannanase020335330301111100
GH30Endo-β-1,4-xylanase033322321202311211
GH31α-Glucosidases, α-xylosidase001441005102224400
GH39α-L-iduronidase, β-xylosidase300221112020200100
GH42β-Galactosidase022645212620423210
GH43α-l-arabinofuranosidase, β-xylosidase113451004232666612
GH53Endo-β-1,4-galactanase100743243810000540
Subtotal161419453825201730321815292827352014
Chitin and chitosan degrading CAZymesGH18Chitinase14126121716121071214581210141219
GH19Chitinase122122110111023012
GH20Exo-β-N-acetylglucosaminidase242335424445342222
GH46Endo-β-1,4-chitosanase101110001002223103
Subtotal181811172323171312171913132018171526
Oxidative enzymesAA10Lytic polysaccharide monooxygenase (LPMO)003000000001476005

Share and Cite

MDPI and ACS Style

Malik, A.; Kim, Y.R.; Kim, S.B. Genome Mining of the Genus Streptacidiphilus for Biosynthetic and Biodegradation Potential. Genes 2020, 11, 1166. https://doi.org/10.3390/genes11101166

AMA Style

Malik A, Kim YR, Kim SB. Genome Mining of the Genus Streptacidiphilus for Biosynthetic and Biodegradation Potential. Genes. 2020; 11(10):1166. https://doi.org/10.3390/genes11101166

Chicago/Turabian Style

Malik, Adeel, Yu Ri Kim, and Seung Bum Kim. 2020. "Genome Mining of the Genus Streptacidiphilus for Biosynthetic and Biodegradation Potential" Genes 11, no. 10: 1166. https://doi.org/10.3390/genes11101166

APA Style

Malik, A., Kim, Y. R., & Kim, S. B. (2020). Genome Mining of the Genus Streptacidiphilus for Biosynthetic and Biodegradation Potential. Genes, 11(10), 1166. https://doi.org/10.3390/genes11101166

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop