Next Article in Journal
Implementation of Exome Sequencing in Clinical Practice for Neurological Disorders
Next Article in Special Issue
Mitochondrial Heteroplasmy and PCR Amplification Bias Lead to Wrong Species Delimitation with High Confidence in the South American and Antarctic Marine Bivalve Aequiyoldia eightsii Species Complex
Previous Article in Journal
Genome-Wide Analysis of DREB Family Genes and Characterization of Cold Stress Responses in the Woody Plant Prunus nana
Previous Article in Special Issue
Using Next-Generation Sequencing to Disentangle the Diet and Incidence of Intestinal Parasites of Falkland Flightless Steamer Duck Tachyeres brachypterus and Patagonian Crested Duck Lophonetta specularioides Sharing a South Atlantic Island
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Brief Report

Metagenomics Provides a Deeper Assessment of the Diversity of Bacterial Communities in Polar Soils Than Metabarcoding

by
Burkhard Becker
* and
Ekaterina Pushkareva
Institute for Plant Sciences, University of Cologne, 50923 Cologne, Germany
*
Author to whom correspondence should be addressed.
Genes 2023, 14(4), 812; https://doi.org/10.3390/genes14040812
Submission received: 9 January 2023 / Revised: 23 February 2023 / Accepted: 7 March 2023 / Published: 28 March 2023
(This article belongs to the Special Issue Polar Genomics)

Abstract

:
The diversity of soil bacteria was analyzed via metabarcoding and metagenomic approaches using DNA samples isolated from the biocrusts of 12 different Arctic and Antarctic sites. For the metabarcoding approach, the V3-4 region of the 16S rRNA was targeted. Our results showed that nearly all operational taxonomic units (OTUs = taxa) found in metabarcoding analyses were recovered in metagenomic analyses. In contrast, metagenomics identified a large number of additional OTUs absent in metabarcoding analyses. In addition, we found huge differences in the abundance of OTUs between the two methods. The reasons for these differences seem to be (1) the higher sequencing depth in metagenomics studies, which allows the detection of low-abundance community members in metagenomics, and (2) bias of primer pairs used to amplify the targeted sequence in metabarcoding, which can change the community composition dramatically even at the lower taxonomic levels. We strongly recommend using only metagenomic approaches when establishing the taxonomic profiles of whole biological communities.

1. Introduction

The ecosystems of the polar regions belong to the most extreme environments on Earth. While they are only sparsely inhibited by humans, they represent a fifth of our planet and comprise the majority of the world’s cryosphere. The polar regions still represent vast white wildernesses. However, even these remote regions are under anthropogenic influence, with climate change significantly altering these icy environments in the near future. Current estimates predict these cold regions to decrease by at least 16% by 2099 according to the Global Change Assessment Model RCP4.5 [1]. In addition, freeze/thaw cycles will be more common in huge areas of the polar regions [2]. These will result in massive changes to biodiversity and ecosystem functioning [3].
While the macrofauna and -flora (plants and algae) of the polar regions have been studied quite extensively, we know much less about microbial organisms and communities. This is especially the case for the terrestrial ecosystems, which are nearly completely (Antarctica) or mainly (Arctic) dominated by lichens [4,5] and biological soil crust communities [6] and will, due to climate change, undergo dramatic changes. Historically, identification of members of these communities required biochemical (bacteria) or microscopical (cyanobacteria, micro algae, micro fauna) approaches, often after establishing cultures for the organisms of interest [7]. However, it has been known since the middle of the 1980s that only a small percentage (estimated to be 0.1 to 5%) can be cultured [7]. With the advent of DNA-sequencing techniques, identification was more commonly based on sequence information of some marker molecules (most often 16S rRNA for bacteria and 18S rRNA for eukaryote sequences) isolated from cultured material. More recently, metabarcoding has been introduced to establish the composition of complete microbial communities [8,9]. Metabarcoding requires the isolation of DNA from environmental samples followed by the amplification (by PCR) of a specific biomarker (often certain variable regions of the rRNA) which serves as a barcode for the identification of the presence of an organism. The frequency with which an RNA sequence is found in a sample is a measure of the abundance of an RNA sequence. Individual RNA sequences are commonly referred to as amplicon sequence variant (ASV) or operational taxonomical unit (OTU) and represent a species. Metabarcoding has been widely used in the last years to study community composition of soil samples (e.g., [9,10,11]).
A major problem with metabarcoding is the PCR step [12,13,14]. Universal primer pairs targeting all organisms have not been found so far. Therefore, often more specific primer pairs, which target only some organismal groups (bacteria [15] and eukaryotes [16] but also specific primers for lower taxonomic groups e.g., Cercozoa [17], Xantophyceae [18]) are used. This results in difficulties in establishing taxonomic profiles of whole communities.
To overcome this problem, metagenomic approaches have been proposed. Metagenomics is the random sequencing of DNA isolated from environmental samples (e.g., soil), thus without a PCR step. Over the last years sophisticated bioinformatic tools were developed to establish, in principle, most if not all genomes present in microbial communities. This huge sequence information can be used to investigate the biodiversity but also to analyze the functions of individual members in the community (see [19] for a comprehensive discussion of metabarcoding and metagenomics in soil science). Currently, the success of this approach seems only to be impeded by sequencing depth and computer speed and time.
However, a real comparison of metabarcoding and metagenomics are rare [20,21]. We therefore decided to use the same DNA samples isolated from Svalbard (the Arctic) and East Antarctica for a comparison of metabarcoding and metagenomic approaches using the same bioinformatic pipeline. We will not discuss any ecological aspects of the results, which have been already published for the Arctic samples [11] and will be published soon for the Antarctic samples (Pushkareva et al., submitted).

2. Material and Methods

2.1. Sampling Sites

The sampling sites for the Arctic and Antarctic samples are described in [11] and Pushkareva et al. (submitted), respectively. In brief, the Arctic samples S1, S3, S7, and S8 were collected in South Svalbard, while S11 was taken in the Longyearbyen (Svalbard) surroundings and NA3 near Ny-Alesund (Svalbard). Antarctic sample Pad2 was collected on Padda Island, and Amu8 and Amu14 were sampled at the Amundsen Bay in East Antarctica. Lang_37 and Skar18 were collected at Langhovde Hills and Skarvsnes Foreland, respectively, both located at Lützow-Holm Bay, East Antarctica. Syo6 was sampled close to Syowa Station at Ongul Island, East Antarctica. All samples are different types of biological soil crusts.

2.2. DNA Isolation, Sequencing, and Bioinformatic Analyses

DNA was isolated as described in [11]. Metabarcoding sequencing of the V3-4 region of rDNA of 4 replicates was performed as described in [11]. The raw reads for the Arctic and Antarctic samples are available at the Sequence Read Archive (SRA) under projects PRJNA881983 and PRJNA936101, respectively. For metagenomics, one replicate of DNA samples used for metabarcoding (replicate 1 of Pad2 and S1 sites, replicate 2 of S11 site and replicate 3 of Amu14 site) was sequenced by Eurofins (Ebersberg, Germany) and Biomarker Technologies (Münster, Germany). Additional replicates were sequenced only by Eurofins (replicate 1 for the Lang_37, NA3_R2, S3c, S7c, S8, and Skar18 sites; replicate 3 for the Amu8 site; and replicate 4 for the Syo6 site). The raw reads were submitted to SRA under project PRJNA945601 for the Eurofins data set and PRJNA948668 for the Biomarker data set.
Raw reads were demultiplexed and primers and poor sequences removed using the trimmomatic version (standard settings) [22] in the Omicsbox software package [23]. Taxonomic classification of the reads obtained via both metabarcoding (16S rDNA gene targeted) and metagenomic sequencing was performed using the Kraken 2 software [24] included in Omicsbox with standard settings. Briefly, Kraken 2 assigns taxonomic labels to DNA short reads by examining the k-mers within a read and querying a database with information on those k-mers for different species (see [24]). The current Kraken database 2022_8 contains information of 23862 genomes (mainly for bacteria (49%) and viruses (48%)). Differential abundance analyses to identify OTUs that differ significantly between two or more samples were performed using edgeR [25] using the standard settings as provided in Omicsbox.

3. Results and Discussion

DNA was isolated for six Arctic and six Antarctic sites (see Material and Methods for details). The same DNA samples were used for metagenomic and metabarcoding approaches. For the metabarcoding, the 16S rRNA gene was targeted. All reads were analyzed with the same software, Kraken 2, as implemented in the Omicsbox software using standard settings. Table S1 gives an overview about the number of reads classified, the estimated diversity within the samples using the Shannon and Simpson indices, and the percentage of classified reads at various systematic levels.
The percentage of classified reads was much higher for metabarcoding than for the metagenomic analyses. Kraken 2 classified 17–32% of all reads (average 22% ± 4%, n = 16) in analyses of the metagenomic data sets. In contrast, in the metabarcoding data sets between 77–97% of all reads (average 90% ± 11%, n = 36) were taxonomically classified. The higher classification rate within the metabarcoding data set was probably due to the targeted rDNA sequence in metabarcoding, as we attempted to amplify only the 16S rDNA. Generally, between 77% and 97% of the reads could be classified in metabarcoding. Only for three replicates was a lower classification value than 80% obtained (Table S1). In contrast, using the same software, only 17–25% of the reads in the metagenomics dataset could be classified. This probably reflects the Kraken 2 database used for classification. The current database of Kraken 2 is rich in bacterial sequences but contains only few eukaryotic sequences. Therefore, nearly the whole eukaryotic diversity in our metagenomic data set was missing, while the metabarcoding data sets contained nearly only bacterial sequences.
Another major difference between both approaches was, that generally the percentage of reads classified was much higher at various taxonomic levels in the metagenomic data sets than in the metabarcoding data sets (Table S1). For example, at the phylum level in the metagenomic and metabarcoding data sets, we classified between 70 and almost 100% and 51–89% reads, respectively. For 9 out of 16 metagenomic data sets, we obtained classification level of more than 80%. In contrast, only 6 out of 36 metabarcoding data sets reached the same percentage for taxonomic classification at the phylum level (Table S1). We observed similar difference for the lower taxonomic levels. At the species level, the observed classification level ranges for metagenomic and metabarcoding were 7–34% and 3–24%, respectively.
Although the overall percentage of reads classified in our metagenomic data sets was low, the observed biodiversity was much higher than in the metabarcoding. For example, the calculated Shannon indices were >3.8 in the metagenomic analyses, while the values obtained in metabarcoding were much lower and in the range of 1.6 to 3.1 (with one exception; Table 1). Figure 1 compares the Shannon indices for all analyzed Antarctic and Arctic samples. The Shannon indices for the metagenomic samples (Figure 1 blue) were always higher than the Shanon indices for the metabarcoding samples (Figure 1, orange). There was only one exception. For the Lang37 site, both types of data sets showed similar Shannon indices. Lang37 was the only site where metagenomic and metabarcoding data were very similar (see Table S1).
The difference between the Shannon indices of the metagenomic and metabarcoding data sets was due to a higher number of OTUs recognized by the Kraken 2 software in the metagenomic datasets. Many OTUs could not be found at all in the metabarcoding data sets. Even after removal of the low abundant OTUs, 10 to 60% (depending on the site and probably the number of replicates used in metagenome analyses) of the OTUs were enriched in abundance (present only or significant fold change) in the metagenomic data sets (Table 1). Only a small number of OTUs (always less than 1% of the filtered taxonomical tags) were enriched in the metabarcoding datasets. Thus, at all investigated sites, a single metagenome recovered more than 99% (often 99.9%) of the diversity as revealed by metabarcoding using up to four replicates.
In agreement with this, we observed remarkable differences between the taxa composition of the two approaches. At all taxonomic levels, the composition is clearly very different for metagenomic and metabarcoding data. As an example, Figure 2 shows the taxonomic composition for the Pad2 site at two different taxonomic levels (Figure 2A,B). At the phylum level (Figure 2A), metagenomic data sets were dominated by Acidobacteria followed by Proteobacteria and Cyanobacteria (with Acidobacteria representing nearly 60% of the classified reads).
In contrast, the metabarcoding data sets were dominated by Cyanobacteria (representing about 50% of the classified reads except for replicate 2) followed by Acidobacteria and Proteobacteria (Figure 2A). Similar results were obtained for all other sites. For all data sets we observed considerable variation between metabarcoding replicates similar to the one shown for the Pad2 site in Figure 2. These results highlight the importance of replication in metabarcoding studies.
Having established that the quantitative results were very different for metabarcoding and metagenomic analyses, we wondered whether the taxon composition at lower levels would be also very different. As we are mostly interested in phototrophs, we decided to look in more detail into the presence of cyanobacteria at the investigated sites. Table 2 shows the cyanobacterial composition at the genus level for the Pad2 site. Similar to the results reported above, metagenomics identified more genera than metabarcoding. Genera, present with a small read number in the metagenome data set, were not present at all in the metabarcoding data set. However, surprisingly, we also observed major quantitative differences between metagenomic and metabarcoding data. The cyanobacterial reads of the metagenomic data sets were dominated by Nostoc (approximately 26%), Chroococcodiopsis (approximately 6.5%), Calothrix (approximately 2.5%), Scytonema (approximately 1.5%), and Leptolyngbya (approximately 1.1). In contrast, Leptolyngbya was the dominant cyanobacterium in the metabarcoding replicates (20 to 60% in the different replicates), followed by Nostoc, with approximately 2% in all replicates (Table 2). Similar results were also obtained for the other sites investigated.

4. General Discussion

In this manuscript, we provide a comparison of metagenomic and metabarcoding approaches for studying the biodiversity of the soil environment. Both are commonly used to assess the microbial diversity in soil samples [7,19]. As we used the same DNA samples and the same bioinformatic pipeline for both molecular methods, the results should be directly comparable to each other for both approaches. The differences were surprisingly high. The bacterial biodiversity analyzed using the Kraken 2 software was much higher in all metagenomic analyses. The reason for this is mainly the detection of minor community members in metagenomic analyses which are missing in the metabarcoding data sets. This is probably due to the higher sequencing depth in metagenome studies. To allow the assembly of complete genomes for the major community members, the sequencing depth is much higher in metagenomic studies than in metabarcoding studies, which apparently benefits the detection of minor community members. This could probably also be compensated by higher sequencing depth in metabarcoding analyses.
However, a more troubling result was the big quantitative differences at every taxonomic level, e.g., Actinobacteria and Cyanobacteria (at the phylum level) and Nostoc and Leptolynbya (at the genus level). While we cannot exclude other possibilities completely, it seems most likely that the PCR amplification is causing these differences when targeting a marker gene sequence like the 16S rDNA by PCR amplification [14,19]. Different PCR amplification rates for the same gene in different species are well known, and, apparently, not only effect the composition of OTUs in metabarcoding studies at higher taxonomic levels (Acidobacteria vs. Cyanobacteria) but also at lower levels (e.g., genus: Nostoc vs. Leptolynbya).
While overall the results for different replicates for metabarcoding and metagenomics were very similar, the completely different taxon composition of for example the Pad2 replicate 2 in the metabarcoding study highlights the importance of replication. Single replicates can clearly be very misleading when analyzing the biodiversity of soil samples. However, our metagenomic analyses for of single replicate recovered more than 99% of the biodiversity observed with two to four replicates in our metabarcoding analyses, even if very different replicates were encountered in metabarcoding as for the Pad2 site. Nevertheless, the differences between both methods became even more pronounced when more replicates for both methods were included in our analyses (e.g., in Table 1, overrepresentation rates correlate with replicate number).
Finally, a major advantage of metagenomics is the ability to assemble nearly complete genomes for members of the communities. For example, a total of 321,299 reads were identified for Nostoc in the Metagenome 2 data set for Pad2. The reads represent 0.27% of the total reads of this data set and a total of 48 MB, which is approximately five times the size of a typical Nostoc genome (N. punctiforme: 8.9 MB [26]).

5. Conclusions

In this study, we demonstrate that metagenomic analyses are clearly better suited than metabarcoding for investigating the biodiversity of terrestrial soil samples. The number of observed species is much higher and quantitative analyses are not hindered by amplification biases in the PCR step for metabarcoding. In addition, metagenomes will allow to compare functional aspects for different communities. The only drawback of metagenome analysis is the larger data sets which requires more time for computer analyses.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/genes14040812/s1, Supplementary Table S1: Overall comparison of the bacterial biodiversity of terrestrial soil samples from polar soils using metagenomics and metabarcoding approaches. All reads were analyzed using the Kraken 2 software as implemented in Omicsbox. Metagenomics: Replicates 1 and 2 represent technical replicates sequenced by two different companies. Metabarcoding: All replicates were sequenced by a single company. Bold numbers indicate the same replicate DNA sample that was used for metagenomics and metabarcoding.

Author Contributions

Conceptualization, B.B. and E.P.; methodology, B.B. and E.P.; writing—original draft preparation, B.B.; writing—review and editing, E.P.; visualization, B.B.; funding acquisition, B.B. All authors have read and agreed to the published version of the manuscript.

Funding

The work presented is supported by a DFG grant BE1779/19-1 within the priority program SPP1158 (Antarctic Research) to BB. We acknowledge support for the article processing charge from the DFG (491454339).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The raw reads of the metabarcoding study for the Arctic and Antarctic samples are available at the Sequence Read Archive (SRA) under projects PRJNA881983 and PRJNA936101, respectively. The raw reads for the metagenome study were submitted to SRA under project PRJNA945601 for the Eurofins data set and PRJNA948668 for the Biomarker data set.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Thomson, A.M.; Calvin, K.v.; Smith, S.J.; Kyle, G.P.; Volke, A.; Patel, P.; Delgado-Arias, S.; Bond-Lamberty, B.; Wise, M.A.; Clarke, L.E.; et al. RCP4.5: A Pathway for Stabilization of Radiative Forcing by 2100. Clim. Chang. 2011, 109, 77. [Google Scholar] [CrossRef] [Green Version]
  2. Henry, H.A.L. Soil Freezing Dynamics in a Changing Climate: Implications for Agriculture. In Plant and Microbe Adaptations to Cold in a Changing World; Springer: Berlin, Germany, 2013. [Google Scholar]
  3. Balser, T.C.; Gutknecht, J.L.M.; Liang, C. How Will Climate Change Impact Soil Microbial Communities? In Soil Microbiology and Sustainable Crop Production; Springer: Berlin, Germany, 2010. [Google Scholar]
  4. Printzen, C. Uncharted Terrain: The Phylogeography of Arctic and Boreal Lichens. Plant Ecol. Divers. 2008, 1, 265–271. [Google Scholar] [CrossRef]
  5. Friedmann, E.I. Lichens of Antarctica and South Georgia: A Guide to Their Identification and Ecology. Arct. Antarct. Alp. Res. 2003, 35, 409–410. [Google Scholar] [CrossRef] [Green Version]
  6. Williams, L.; Borchhardt, N.; Colesie, C.; Baum, C.; Komsic-Buchmann, K.; Rippin, M.; Becker, B.; Karsten, U.; Büdel, B. Biological Soil Crusts of Arctic Svalbard and of Livingston Island, Antarctica. Polar Biol. 2017, 40, 399–411. [Google Scholar] [CrossRef] [Green Version]
  7. Torsvik, V.; Øvreås, L. Microbial Diversity and Function in Soil: From Genes to Ecosystems. Curr. Opin. Microbiol. 2002, 5, 240–245. [Google Scholar] [CrossRef] [PubMed]
  8. Ruppert, K.M.; Kline, R.J.; Rahman, M.S. Past, Present, and Future Perspectives of Environmental DNA (EDNA) Metabarcoding: A Systematic Review in Methods, Monitoring, and Applications of Global EDNA. Glob. Ecol. Conserv. 2019, 17, e00547. [Google Scholar] [CrossRef]
  9. Joos, L.; Beirinckx, S.; Haegeman, A.; Debode, J.; Vandecasteele, B.; Baeyen, S.; Goormachtig, S.; Clement, L.; de Tender, C. Daring to Be Differential: Metabarcoding Analysis of Soil and Plant-Related Microbial Communities Using Amplicon Sequence Variants and Operational Taxonomical Units. BMC Genom. 2020, 21, 733. [Google Scholar] [CrossRef]
  10. Rippin, M.; Lange, S.; Sausen, N.; Becker, B. Biodiversity of Biological Soil Crusts from the Polar Regions Revealed by Metabarcoding. FEMS Microbiol. Ecol. 2018, 94, fiy036. [Google Scholar] [CrossRef] [PubMed]
  11. Pushkareva, E.; Elster, J.; Holzinger, A.; Niedzwiedz, S.; Becker, B. Biocrusts from Iceland and Svalbard: Does Microbial Community Composition Differ Substantially? Front. Microbiol. 2022, 13, 1048522. [Google Scholar] [CrossRef]
  12. Alberdi, A.; Aizpurua, O.; Gilbert, M.T.P.; Bohmann, K. Scrutinizing Key Steps for Reliable Metabarcoding of Environmental Samples. Methods Ecol. Evol. 2018, 9, 134–147. [Google Scholar] [CrossRef] [Green Version]
  13. Liu, M.; Clarke, L.J.; Baker, S.C.; Jordan, G.J.; Burridge, C.P. A Practical Guide to DNA Metabarcoding for Entomological Ecologists. Ecol. Entomol. 2020, 45, 373–385. [Google Scholar] [CrossRef] [Green Version]
  14. Schloss, P.D.; Gevers, D.; Westcott, S.L. Reducing the Effects of PCR Amplification and Sequencing Artifacts on 16s RRNA-Based Studies. PLoS ONE 2011, 6, e27310. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Lange, A.; Jost, S.; Heider, D.; Bock, C.; Budeus, B.; Schilling, E.; Strittmatter, A.; Boenigk, J.; Hoffmann, D. AmpliconDuo: A Split-Sample Filtering Protocol for High-Throughput Amplicon Sequencing of Microbial Communities. PLoS ONE 2015, 10, e0141590. [Google Scholar] [CrossRef] [PubMed]
  16. Stoeck, T.; Bass, D.; Nebel, M.; Christen, R.; Jones, M.D.M.; Breiner, H.W.; Richards, T.A. Multiple Marker Parallel Tag Environmental DNA Sequencing Reveals a Highly Complex Eukaryotic Community in Marine Anoxic Water. Mol. Ecol. 2010, 19, 21–31. [Google Scholar] [CrossRef] [PubMed]
  17. Fiore-Donno, A.M.; Rixen, C.; Rippin, M.; Glaser, K.; Samolov, E.; Karsten, U.; Becker, B.; Bonkowski, M. New Barcoded Primers for Efficient Retrieval of Cercozoan Sequences in High-Throughput Environmental Diversity Surveys, with Emphasis on Worldwide Biological Soil Crusts. Mol. Ecol. Resour. 2018, 18, 229–239. [Google Scholar] [CrossRef]
  18. Rybalka, N.; Wolf, M.; Andersen, R.A.; Friedl, T. Congruence of Chloroplast- and Nuclear-Encoded DNA Sequence Variations Used to Assess Species Boundaries in the Soil Microalga Heterococcus (Stramenopiles, Xanthophyceae). BMC Evol. Biol. 2013, 13, 39. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  19. Semenov, M.V. Metabarcoding and Metagenomics in Soil Ecology Research: Achievements, Challenges, and Prospects. Biol. Bull. Rev. 2021, 11, 40–53. [Google Scholar] [CrossRef]
  20. Armbrecht, L.; Eisenhofer, R.; Utge, J.; Sibert, E.C.; Rocha, F.; Ward, R.; José, J.; Karlusich, P.; Tirichine, L.; Norris, R.; et al. Paleo-Diatom Composition from Santa Barbara Basin Deep-Sea Sediments: A Comparison of 18S-V9 and Diat-RbcL Metabarcoding vs Shotgun Metagenomics. ISME Commun. 2021, 1, 66. [Google Scholar] [CrossRef]
  21. Rieder, J.; Kapopoulou, A.; Bank, C.; Adrian-Kalchhauser, I. Metagenomics and Metabarcoding Experimental Choices and Their Impact on Microbial Community Characterization in Freshwater Recirculating Aquaculture Systems. Environ. Microbiome 2023, 18, 8. [Google Scholar] [CrossRef]
  22. Bolger, A.M.; Lohse, M.; Usadel, B. Trimmomatic: A Flexible Trimmer for Illumina Sequence Data. Bioinformatics 2014, 30, 2114–2120. [Google Scholar] [CrossRef] [Green Version]
  23. BioBam Bioinformatics OmicsBox—Bioinformatics Made Easy. Available online: https://www.biobam.com/omicsbox (accessed on 8 January 2023).
  24. Wood, D.E.; Salzberg, S.L. Kraken: Ultrafast Metagenomic Sequence Classification Using Exact Alignments. Genome Biol. 2014, 15, R46. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Robinson, M.D.; McCarthy, D.J.; Smyth, G.K. EdgeR: A Bioconductor Package for Differential Expression Analysis of Digital Gene Expression Data. Bioinformatics 2009, 26, 139–140. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Meeks, J.C.; Elhai, J.; Thiel, T.; Potts, M.; Larimer, F.; Lamerdin, J.; Predki, P.; Atlas, R. An Overview of the Genome of Nostoc Punctiforme, a Multicellular, Symbiotic Cyanobacterium. Photosynth. Res. 2001, 70, 85–106. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Shannon indices for all analyzed samples. Samples to the left—Amu8 to Syo6—are from East Antarctica, whereas samples to the right—NA3_R2 to S11a—are from Svalbard (the Arctic).
Figure 1. Shannon indices for all analyzed samples. Samples to the left—Amu8 to Syo6—are from East Antarctica, whereas samples to the right—NA3_R2 to S11a—are from Svalbard (the Arctic).
Genes 14 00812 g001
Figure 2. Biodiversity of the Pad2 site. The metagenomic and the metabarcoding datasets were analyzed using the Kraken 2 software and Omicsbox tools. Shown are stacked bar charts of the abundance (x-axis) of classified reads. For simplicity taxonomic groups with less than 1% abundance are grouped together as others. The same DNA samples (replicate 1 of the four replicates shown) were used for metabarcoding 1 and the metagenomes 1 and 2. (A) Phyla; (B) classes.
Figure 2. Biodiversity of the Pad2 site. The metagenomic and the metabarcoding datasets were analyzed using the Kraken 2 software and Omicsbox tools. Shown are stacked bar charts of the abundance (x-axis) of classified reads. For simplicity taxonomic groups with less than 1% abundance are grouped together as others. The same DNA samples (replicate 1 of the four replicates shown) were used for metabarcoding 1 and the metagenomes 1 and 2. (A) Phyla; (B) classes.
Genes 14 00812 g002
Table 1. Differential abundance of taxonomic tags (species taxonomic level) in metagenomic and metabarcoding studies of terrestrial soil samples from polar regions. Sequences were classified with the Kraken 2 software and differential abundance analyses performed using edgeR and standard settings. The percentage of filtered taxonomic tags over- or underrepresented in metagenomes is given in brackets.
Table 1. Differential abundance of taxonomic tags (species taxonomic level) in metagenomic and metabarcoding studies of terrestrial soil samples from polar regions. Sequences were classified with the Kraken 2 software and differential abundance analyses performed using edgeR and standard settings. The percentage of filtered taxonomic tags over- or underrepresented in metagenomes is given in brackets.
SiteTotal TagsTags after Filtering (1)Diff. Abundant TagsOverrepresented in MetagenomesUnderrepresented in Metagenomes
Amu86410571614091402 (24.5%)7 (0.1%)
Amu147055378822772254 (59.5%)23 (0.6%)
Lang3754874014739734 (18.2%)5 (0.1%)
Pad27482355821962187 (61.5%)9 (0.1%)
Skar1850314496753737 (16.4%)16 (0.3%)
Syo653424333615585 (13.5%)30 (0.7%)
NA3_
R2C
n/an/an/an/an/a
S17103355220402015 (56.7%)25 (0.7%)
S352065206509497 (9.5%)12 (0.2%)
S7 (2)n/an/an/an/an/a
S85443544316811661 (30.5%)20 (0.4%)
S116338358516581641 (45.7%)17 (0.5%)
(1) OTUs had to be present in at least 2 samples with more than 4 reads/million reads to be included in the analysis. (2) Not enough replicates for differential abundance analysis.
Table 2. Comparison of the abundance of selected cyanobacterial genera in metagenomes and metabarcoding analyses of the Pad2 site. Bold numbers indicate the replicate that was used for metagenomics and metabarcoding.
Table 2. Comparison of the abundance of selected cyanobacterial genera in metagenomes and metabarcoding analyses of the Pad2 site. Bold numbers indicate the replicate that was used for metagenomics and metabarcoding.
Metagenome 1Metagenome 2Metabarcoding 1Metabarcoding 2Metabarcoding 3Metabarcoding 4
Total cyanobacterial reads96575912798132080914401956814344
Percentage of cyanobacterial reads
Anabaena0.220.25n.dn.d0.01n.d
Calothrix2.342.600.030.140.050.09
Chroococcidiopsis6.426.920.11n.dn.dn.d
Fischerella0.510.57n.d.n.d.n.d.n.d.
Gloeobacter0.070.120.100.070.090.10
Gloeocapsa0.530.610.010.140.010.01
Gloeomargarita0.020.03n.d.n.d.n.d.n.d.
Gloeothece0.030.06n.d.n.d.n.d.n.d.
Leptolyngbya1.091.2636.2766.8126.1447.11
Microcystis0.090.10n.d.n.d.n.d.n.d.
Nostoc25.9025.112.031.462.291.46
Oscillatoria0.200.220.02n.d.0.010.01
Parasynechococcusp.p.n.d.n.d.n.d.n.d.
Phormidium0.020.03n.d.n.d.n.d.n.d.
Pseudanabaena0.040.05n.d.n.d.n.d.0.01
Scytonema1.381.500.09n.d.0.12n.d.
Synechococcus0.320.500.00n.d.0.04n.d.
Synechocystis0.040.05n.d.n.d.n.d.n.d.
Thermoleptolyngbya0.090.11n.d.n.d.n.d.n.d.
Thermosynechococcus0.050.08n.d.n.d.n.d.n.d.
n.d. = not detected. p. = present.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Becker, B.; Pushkareva, E. Metagenomics Provides a Deeper Assessment of the Diversity of Bacterial Communities in Polar Soils Than Metabarcoding. Genes 2023, 14, 812. https://doi.org/10.3390/genes14040812

AMA Style

Becker B, Pushkareva E. Metagenomics Provides a Deeper Assessment of the Diversity of Bacterial Communities in Polar Soils Than Metabarcoding. Genes. 2023; 14(4):812. https://doi.org/10.3390/genes14040812

Chicago/Turabian Style

Becker, Burkhard, and Ekaterina Pushkareva. 2023. "Metagenomics Provides a Deeper Assessment of the Diversity of Bacterial Communities in Polar Soils Than Metabarcoding" Genes 14, no. 4: 812. https://doi.org/10.3390/genes14040812

APA Style

Becker, B., & Pushkareva, E. (2023). Metagenomics Provides a Deeper Assessment of the Diversity of Bacterial Communities in Polar Soils Than Metabarcoding. Genes, 14(4), 812. https://doi.org/10.3390/genes14040812

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop