Next Article in Journal
Fine Mapping and Candidate Gene Analysis of Rice Grain Length QTL qGL9.1
Previous Article in Journal
Low-Dose Non-Targeted Effects and Mitochondrial Control
Previous Article in Special Issue
In Which Way Do the Flower Properties of the Specialist Orchid Goodyera repens Meet the Requirements of Its Generalist Pollinators?
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Ancient Horizontal Gene Transfers from Plastome to Mitogenome of a Nonphotosynthetic Orchid, Gastrodia pubilabiata (Epidendroideae, Orchidaceae)

1
Division of Life Sciences, Korea University, Seoul 02841, Republic of Korea
2
International Biological Material Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon 34141, Republic of Korea
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2023, 24(14), 11448; https://doi.org/10.3390/ijms241411448
Submission received: 29 June 2023 / Revised: 8 July 2023 / Accepted: 11 July 2023 / Published: 14 July 2023
(This article belongs to the Special Issue Orchid Biochemistry)

Abstract

:
Gastrodia pubilabiata is a nonphotosynthetic and mycoheterotrophic orchid belonging to subfamily Epidendroideae. Compared to other typical angiosperm species, the plastome of G. pubilabiata is dramatically reduced in size to only 30,698 base pairs (bp). This reduction has led to the loss of most photosynthesis-related genes and some housekeeping genes in the plastome, which now only contains 19 protein coding genes, three tRNAs, and three rRNAs. In contrast, the typical orchid species contains 79 protein coding genes, 30 tRNAs, and four rRNAs. This study decoded the entire mitogenome of G. pubilabiata, which consisted of 44 contigs with a total length of 867,349 bp. Its mitogenome contained 38 protein coding genes, nine tRNAs, and three rRNAs. The gene content of G. pubilabiata mitogenome is similar to the typical plant mitogenomes even though the mitogenome size is twice as large as the typical ones. To determine possible gene transfer events between the plastome and the mitogenome individual BLASTN searches were conducted, using all available orchid plastome sequences and flowering plant mitogenome sequences. Plastid rRNA fragments were found at a high frequency in the mitogenome. Seven plastid protein coding gene fragments (ndhC, ndhJ, ndhK, psaA, psbF, rpoB, and rps4) were also identified in the mitogenome of G. pubilabiata. Phylogenetic trees using these seven plastid protein coding gene fragments suggested that horizontal gene transfer (HGT) from plastome to mitogenome occurred before losses of photosynthesis related genes, leading to the lineage of G. pubilabiata. Compared to species phylogeny of the lineage of orchid, it was estimated that HGT might have occurred approximately 30 million years ago.

1. Introduction

The advancement in sequencing technology has led to a surge in research on nucleotide sequences of flowering plants and their use for evolutionary and phylogenetic studies [1,2]. However, these studies have mainly focused on the plastome due to the cost and time constraints in sequencing. As of February 2023, 7588 species of angiosperm plastome sequences have been deposited in the NCBI, whereas only 347 mitogenome sequences have been deposited in the NCBI. Research using plastome sequences has been conducted on various flowering plants, such as Amborella [3], magnolids [4], monocots [5], and rosid [6]. Despite challenges in sequencing, nuclear genomes of some flowering plants have been decoded, including Amborella [7], Oryza [8], Phalaenopsis [9], and Triticum [10].
Both the mitochondria and chloroplast are essential organelles in plant function. The plastome is the most widely used genome in plant evolutionary study because it has relatively stable genome structure, size, and rich evolutionary information. In contrast to plastome, the mitogenome vary widely in size and structure across taxa even among different populations of same species [11,12]. The smallest mitogenome is 66 kb for Viscum [13], while the longest mitogenome is 11.3 mb for Silene [14]. It has been suggested that these differences result from genomic rearrangements, repetitive sequences, and/or foreign DNA insertions [15,16,17]. Due to many repeats, long length, and rearrangement of the mitogenome, it is difficult to decipher the overall structure of the mitogenome in majority of plant species. Frequent gene transfers have been reported in the mitogenome of Valeriana of photosynthetic Dipsacales [18]. Intracellular gene transfer (IGT) and horizontal gene transfer (HGT) have been reported in the parasitic plant Aeginetia [19]. Gene transfer has been reported during the symbiotic relationship between fungi and orchids, which are closely related to saprophytic lifestyles of the Orchidaceae [20]. The trace of gene transfer has been studied in a few orchid mitochondria. These imported sequences might be due to past biological contact [12,20,21,22].
The Orchidaceae are one of the largest families of flowering plants, containing approximately 25,000 species [23]. Recent studies have divided this family into five subfamilies: Apostasioideae, Vanilloideae, Orchidoideae, Cypripedioideae, and Epidendroideae [24]. Following the first complete plastome sequencing report of Phalaenopsis aphrodite [25], plastome sequencing reports have been published for all major lineages within Orchidaceae, such as Corallorhiza [26,27], Cymbidium [28], Dendrobium [29], Holcoglossum [30], Neottia [31], and Vanilla [32]. To date, at least 321 orchid plastome sequences have been deposited in the NCBI (as of February 2023). The growing number of reported orchid plastome sequences has led to various studies of the Orchidaceae family, focusing on phylogenetics, evolution, and plastome structure [5,32,33,34,35,36,37,38]. These studies often concentrate on specific lineages such as epiphytic or non-photosynthetic orchids. There are a total of 43 genera that contain non-photosynthetic mycoheterotrophs [39]. Several plastome sequences of these types of orchids have been reported, such as Aphyllorchis [31], Corallorhiza [40], Cyrtosia [35], Epipogium [41], Gastrodia [42], Hexalectris [37], Lecanorchis [43] and Rhizantella [33]. Typical photosynthetic orchid plastomes contain 83 protein coding genes [25]. However, non-photosynthetic species such as Epipogium roseum have significantly reduced plastomes, containing only 17 protein coding genes [41]. This contraction of plastomes results in low levels of phylogenetically informative sites. Genes retained in these plastomes often exhibit elevated evolutionary rates [44], making it difficult to construct accurate phylogenetic trees and interpret their results [5,43].
For the Orchidaceae family, the mitogenome sequence has only been reported for Gastrodia elata [42] and nuclear genome sequences have only been reported for Cymbidium [45], Dendrobium [46], Gastrodia [42], and Phalaenopsis [9]. Despite active research into phylogenetic and evolutionary aspects of Orchidaceae, phylogenetic trees from different organelles are often inconsistent [43,47,48]. This might be due to factors such as incomplete lineage sorting, ancient plastome capture, and/or sampling issues [43]. To address these incongruences, more mitochondrial and nuclear genome sequence data are needed.
The species Gastrodia pubilabiata belongs to the tribe Gastrodieae of the subfamily Epidendroideae of the family Orchidaceae [24]. It is widely distributed in Korea, Japan, and Taiwan [49,50], but it is a rare and endangered species. For its closely related species, G. elata, plastome sequences along with nuclear and mitogenome sequences have been reported [42,43]. The plastome of G. elata is greatly reduced compared to that of a typical photosynthetic orchid. Flowers and stems of G. pubilabiata are pale greenish in color. In a previous study on Cymbidium macrorhizon [51] which also has pale greenish stems and flowers, the plastome was almost intact condition, although it is known as a non-photosynthetic species. Only a few ndh genes were missing from its plastome. Therefore, the extreme reduction in plastome sequence in G. elata raises the question of what type of plastome sequence will be present in G. pubilabiata which has pale greenish stems and flowers.
The current state of research on the plastome and mitogenome of non-photosynthetic orchids presents several difficulties, including a lack of parsimonious informative sites in the plastome and incongruence in phylogenetic trees based on different genomes. To address these issues, this study aimed to decode both of the plastome and mitogenome of the non-photosynthetic orchid G. pubilabiata. Through this research, organelle genomes of non-photosynthetic orchids will be better understood and IGT or HGT between different organelle genomes will be documented if it existed. In addition, any potential foreign nucleotide sequences in organelle genomes will be identified if it presented. Through this study, if any evidence of externally imported nucleotide sequences is found, the source of sequences will be investigated and the relationship with the taxon will be determined. Results of this study will contribute to the further understanding of the evolutionary modes of mitogenome in non-photosynthetic orchid species.

2. Results

2.1. Organelle Characteristics and Comparative Analysis

The plastome sequence of G. pubilabiata was a 30,698 bp in length (Table 1 and Figure 1). The GC content of the plastome was 24.9% with a depth coverage of 236.0×. Unlike other reported orchid plastome sequences such as Cypripedium or Habenaria, the plastome of G. pubilabiata was much shorter in length. In addition, it lacked the typical quadripartite structure of angiosperm plastomes, which typically consist of a large single copy (LSC) and a small single copy (SSC) separated by two inverted repeats (IRs). Instead, the plastome of G. pubilabiata, like that of G. elata, lacked IRs. As a non-photosynthetic, mycoheterotrophic orchid without any leaves, G. pubilabiata lost all photosynthetic related genes, including ndh, pet, psa, psb, and rpo as well as several housekeeping genes, atp genes, rrn5, ccsA, cemA, infA, matK, rbcL, ycf3, and ycf4. Finally, the plastome of G. pubilabiata was annotated. It had only 19 protein coding genes, three tRNA genes, and three rRNA genes.
As a result of de novo assembly, 44 mitochondrial contigs of G. pubilabiata were obtained (Table 1). The goal was to obtain a complete, master circle of the mitogenome of G. pubilabiata. However, it was proved to be challenging due to the presence of AT-rich regions, rearrangements, and the recombination. These 44 mitochondrial contigs were considered as the complete mitogenome with a total length of, 867,349 bp (Table 1 and Figure 2). The coverage depth was 150.0×, with the highest coverage being 627.2× (14,604 bp) and the lowest coverage being 40.1× (4510 bp). The GC content of the mitogenome was 42.1%. With these 44 mitochondrial contigs, a total of, 38 protein coding genes, nine tRNA genes, and three rRNA genes were annotated. Compared to other monocot mitogenomes, three genes (rpl10, sdh3, and sdh4), nad1 intron1, and nad1 intron2 were missing in the mitogenome of G. pubilabiata (Figure 2).
Repeat analysis was conducted to compare the mitogenome and plastome of G. pubilabiata. Three additional organelles were also included (the mitogenome of G. elata, the plastome of G. elata, and the plastome of G. longistyla) in this analysis. The analysis revealed a total of 437 repeats (Supplementary Table S1). Of these, 279 repeats were found in the mitogenome of G. pubilabiata, 79 repeats were shared between the mitogenomes of G. pubilabiata and G. elata, 44 repeats were shared among the three Gastrodia plastomes, 34 repeats were identified within the mitogenome of G. elata, and one repeat was shared between the G. elata plastome and mitogenome (Figure 3 and Supplementary Table S1). Additionally, 89 simple sequences repeats (SSRs) were found within the plastome of G. pubilabiata, including 16 mononucleotide SSRs, five dinucleotide SSRs, four trinucleotide SSRs, 13 tetranucleotide SSRs, and 51 pentanucleotide SSRs (Supplementary Table S2).

2.2. Foreign Sequences in the Organelle

BLASTN and BLASTP searches were performed for the plastome of G. pubilabiata to identify exotic regions. Results of the BLASTN showed that most regions had the highest matches with plastome sequences of green plants. However, there were also matches with algae, bacteria, and animal sequences in regions corresponding to rRNAs (rrn4.5, rrn5, rrn18, and rrn23). Results of the BLASTP for open reading frames (ORFs) in the G. pubilabiata plastome confirmed that most ORFs were of green plant origin. In some cases, ORFs located near or within rpl16, rrn16, rrn23, and rps12 showed a high proportion of matches with bacterial sequences in BLASTP search results.
During annotation of protein coding genes in the plastome of G. pubilabiata, no debris from other mitogenome or nuclear genomes was found. However, during annotation of mitogenome, sequences like plastome derived psbF genes were detected, which was 119 bp in length. Regarding this regions, similar regions were found in other previously reported mitogenomes. These sequences were aligned and used to construct a maximum likelihood tree (Supplementary Figures S1 and S2). In this ML tree, all mitochondrial psbF-like regions formed a monophyletic group, indicating that mitochondrial and plastome psbF genes evolved independently.
The plastome and mitogenome of G. pubilabiata were searched using local BLASTN against two databases of published angiosperm plastomes and mitogenomes. As a result, 23 blocks of the G. pubilabiata plastome were found to match with the mitogenome database. Of these 23 blocks, 17 were related to rrn16 and rrn23, while the remaining were related to clpP, rpl2, rpl16, and ycf1. In the G. pubilabiata mitogenome, 118 blocks were identified as having plastome-related sequences, 82 of which had a higher degree of similarity to mitogenomes than to the plastome database. Thirty-six blocks were found to be plastome-related blocks, largely located in intergenic spacers or rRNAs. Six regions were found to be coding regions in the plastome (ndhC, ndhJ, ndhK, psaA, rpoB, and rps4) (Supplementary Table S3). A phylogenetic tree was constructed using these regions after performing MAFFT alignment (Figure 4).

2.3. Phylogenetic Position of G. pubilabiata

The phylogenetic position of G. pubilabiata was determined using 79 protein coding regions and four rRNAs in its plastome. Results showed that G. pubilabiata, G. elata, and G. longystyla formed a monophyletic group. All nodes in the Gastrodia clade were supported by 100% bootstrap values (Figure 5). General phylogenetic position of the Orchidaceae was found to be Apostasioideae [Vanilloideae [Cypripedioideae [Orchidoideae, Epidendroideae]]].
As previously mentioned, studies decoding mitogenomes of orchids are limited. Thus, a phylogenetic tree was constructed using previously reported angiosperm mitogenomes. Results showed that G. pubilabiata and G. elata formed a monophyletic group and the genus Gastrodia formed a sister relationship with Allium cepa and Asparagus officinalis. Most species were supported by high bootstrap values (above 90%). However, species such as Beta, Daucus, Fagus, Geranium, Luffa, Nelumbo, and Prunus showed relatively lower bootstrap values. The phylogenetic tree based on the mitogenomes was in line with previously reported phylogenetic trees based on plastomes or nuclear genomes (Figure 6).

3. Discussion

3.1. Organelle Genome Evolution

The plastome of G. pubilabiata had a distinct feature in that it has extremely reduced genome size and consists of only a single copy region. This same plastome structure has previously been also reported in the same genus, G. elata [42]. Other taxa without IR include Orobanchaceae [52], Cactaceae [53], Geraniaceae [54,55], Fabaceae [56], and Gymnosperms [57,58]. In Fabaceae, a representative group of taxa lacking IR, deletions of genes or introns, rearrangements of the plastome structure, gene transfer, and hypermutations have been reported [59]. The lack of IR in the legume plastome might be a result of plastome rearrangement mediated by repetitive DNA [60]. These repeats are thought to have facilitated rearrangements by enabling recombination [55].
The plastome of G. pubilabiata contains simple sequence repeats (SSRs) located around rps2-rps14, rps2-rps3, and ycf2 regions (Supplementary Table S2). The gene trnQ-UUG is located between rps2 and rps14, and the trnC-GCA gene is situated near rps2 and rps3. Recombination of repeats and tRNAs might be the primary mechanism behind genomic rearrangements in Trachelium [61]. In a typical orchid plastome, rps2 is found near the LSC-IRa junction and rps3 is located near the LSC-IRb junction. Rps2 and rps14 are 20 kb apart. The change in position of rps2, rps3, and rps14 in G. pubilabiata is believed to be due to the presence of repeats or tRNAs in the intergenic spacer (IGS) near the relevant region. However, it is uncertain whether gene reduction, in which protein coding regions between corresponding genes are lost, precedes genome rearrangement and reduction [62]. Some studies have suggested that repetitive sequences contribute to genome rearrangement in the Geraniaceae [55]. Accumulation of SSRs and repeats in the plastome of G. pubilabiata might have also contributed to gene loss or plastome size reduction.
Several changes such as ndh gene contents, IR boundary shifts, and IR absence have been reported in plastomes of various orchid taxa [5,32,42,43,62]. For non-photosynthetic taxa, changes have also been reported in gene groups other than just ndh genes [43]. This level of change is consistent with previous reports for non-photosynthetic flowering plants [63]. In the case of G. pubilabiata, most genes have disappeared. The plastome of G. pubilabiata belongs to the fourth stage of plastome degradation pattern (Supplementary Figure S3). Other orchid taxa that belong to stages 4 of plastome degradation, including Epipogium [41], Gastrodia [42], Rhizanthella [33], and Lecanorchis [43], are also non-photosynthetic mycoheterotrophs with greatly altered plastomes. These taxa have only retained some housekeeping genes.
Flowers and stems of G. pubilabiata have a green color, which sets them apart from the other fourth stages of orchid with red, white, or yellow flowers and stems. In contrast to the other green stemmed mycoheterotrophs C. macrorhizon, which only lacks ndh genes in its plastome [51], all photosynthesis related genes have disappeared from the confirmed plastome of G. pubilabiata (Figure 1 and Supplementary Figure S3). Despite this, low level photosynthesis has been observed in flowers and stems of C. macrorhizon due to residual chlorophyll [64,65,66]. Given this and the presence of residual plastids in other non-photosynthetic taxa, it is hypothesized that there might be an alternative pathway for photosynthesis in G. pubilabiata. Further research such as RNA-seq is needed to confirm this.
Mitogenomes of flowering plants are known to be variable in structure due to the presence of repeats, rearrangements, and submolecules [67]. Mitogenomes of monocots can vary greatly in length and genetic composition (Table 2 and Supplementary Figure S4). In the genus Gastrodia, multiple mitogenomes have been reported [42]. The mitogenome of G. pubilabiata was found to have 44 mitogenomes (Table 1 and Figure 2). Unlike the plastome, which displays a significant difference in genetic composition between photosynthetic and non-photosynthetic taxa, it is believed that gene composition of the mitogenome is not significantly impacted by photosynthesis or the lack thereof.

3.2. Putative Gene Transfer

BLAST search was performed to detect any foreign sequences in the plastome and mitogenome of G. pubilabiata. The search uncovered regions with high plastome frequency having mitochondrial features and regions with high mitochondrial frequency having plastome characteristics (Figure 7). Regions with high frequency of mitogenome even in the plastome were mostly made up of rRNA regions (Supplementary Table S3). Other protein coding regions in the plastome (accD, clpP, rpl16, and rpl2) did show BLASTN results for the mitogenome. However, it was confirmed that there were many more search results for the plastome (Supplementary Table S3). The plastome might have infiltrated to the mitogenome, or there was a coincidental similarity of nucleotide sequences, rather than the presence of corresponding regions derived from the mitogenome. Reports of the presence of mitochondrial plastome sequences (MTPTs) have been documented in flowering plants [15]. A recent study has also reported the presence of plastome originating sequences in the mitogenome of Mangifera species [68].
BLASTN results for the mitogenome were like those for the plastome. Many results show the high frequency for the plastome contained regions having rRNA, such as 16 s rRNA, and 18 s rRNA, suggesting that there might have been an exchange between rRNAs of the mitogenome and the plastome. However, for some regions, BLASTN results were found for plastome regions (ndhC, ndhJ, ndhK, psaA, psbF, rpoB, and rps4), but not for mitogenome references. This suggests that gene transfer from the plastome to the mitogenome might have occurred. The mitogenome contains many photosynthesis-related genes from the plastome (ndh, psb, psa, rpo), even though G. pubilabiata is a non-photosynthetic plant and all photosynthesis related genes have been deleted from the current plastome (Figure 1 and Supplementary Figure S3). This suggests that transfer of photosynthesis related genes to the mitogenome might have occurred before the loss of photosynthetic function in plastome. Therefore, this gene transfer might have occurred a longer time ago than previously expected. Except for a few obligate mycoheterotrophic taxa, a previous study has shown that the divergence of mycoheterotrophs might have occurred earlier than 30 mya [43]. This suggests that photosynthesis-related genes might have disappeared after that time. Gene transfer might have occurred a longer than 30 mya. However, as pointed out in that previous study, research results on the plastome and mitogenome of related obligated mycoheterotrophic taxa will be needed to verify that hypothesis.
A phylogenetic tree was constructed to determine the origin of gene fragments (ndh, psa, psb, and rpo) in the mitogenome of G. pubilabiata with other major flowering plants. Results are shown in Figure 4. In the case of psbF, G. pubilabiata formed a monophyletic relationship with other mitochondrial psbF. It tended to be distinguished between the plastome and the mitogenome. This suggests that these gene fragments might have been transferred from the plastome to the mitogenome in the distant past. In the case of psaA, the mitogenome derived psaA displayed a long branch, which was believed to result from the absence of other mitogenome derived sequences with the same cause as the psbF phylogenetic tree. The phylogenetic tree using rpoB region was distinct from trees using psaA and psbF gene fragments. It did not show a long branch and rpoB sequences of G. pubilabiata from a monophyletic branch with Euonymus of Celastrales, which was supported by a bootstrap value of 100. This suggests that G. pubilabiata has a close relationship with Celastraceae plants in its vicinity. Similar relationships have been reported for parasitic plants such as Aeginetia with nearby Poaceae plants [19]. Although it is difficult to directly compare mycoheterotrophic orchid to parasitic plants, this orchid might be related to surrounding plants. It was confirmed that Celastrus and Euonymus lived some distance away, not nearby.
Unlike photosynthesis-related genes found in the mitogenome, the rps4 gene is classified as a housekeeping gene. The presence of this gene in both the plastome and mitogenome of G. pubilabiata was confirmed and a phylogenetic tree was constructed to examine its origin. The phylogenetic tree showed that both plastome and mitogenome derived sequences of the rps4 gene formed a monophyly with other orchids. However, the plastome derived sequence showed a long branch and formed a monophyly, while the mitogenome derived sequence was at the base of the clade. This difference in evolutionary rates between the two sequences was likely due to relaxed selection in degraded plastomes of mycoheterotrophic plants, which tended to drive the acceleration of gene evolution [69]. Moreover, mitochondrial datasets show much slower evolutionary rates than the plastome datasets in the previous study [70]. Thus, it could be concluded that the rps4 gene in the plastome evolved at a faster evolutionary rate than the gene in the mitogenome.
It is known that ndh genes are typically the first to be lost in the plastome degradation process of non-photosynthetic species, followed by other photosynthesis related genes [52]. This study found that ndhC, J, and K genes in the mitogenome of G. pubilabiata formed a monophyly with other orchids, unlike photosynthesis-related gene rpo, psa, or psb, which showed relationships with other taxa of flowering plants (Figure 4). This suggested that the degradation of G. pubilabiata’s plastome was more recent or that alternative pathways were involved. Reference guided assembly analysis only found a few base sequences for ndhC, J, and K in the mitogenome. No base sequence was found to support the possibility of transfer to the nuclear genome. Hence, it was assumed that ndhC, J, K genes disappeared more recently from the plastome or that the degradation of the plastome was more recent. While it was believed that non-photosynthesis in Orchidaceae developed within the past 30 mya [43], further experiments such as transcriptome analysis are needed to explore the possibility of alternative pathways.

3.3. Phylogenetic Implications

Phylogenetic relationships within the Orchidaceae are presented as Apostasioideae [Vanilloideae [Cypripedioideae [Orchidoideae, Epidendroideae]]] based on the analysis of protein coding regions and rRNAs of orchid plastomes. This finding aligns with a previous study [5]. Three species of the genus Gastrodia, G. elata, G. longyphyla, and G. pubilabiata, formed a monophyletic group and the tribe Nervilieae was the sister group to Gastrodieae (Figure 5). In the case of Gastrodieae, Nervilieae, Diurideae, and Vanilleae, containing non-photosynthetic orchids, a long branch was formed, which might be due to reduced number of genes used in the analysis or elevated mutation rates in mycoheterotrophic species. A phylogenetic tree based on mitochondrial genes indicated that G. pubilabiata was part of a monophyletic group along with other Asparagales species (Figure 6). Unlike the plastome phylogenetic tree, non-photosynthetic species exhibited limited levels of long branches.

4. Materials and Methods

4.1. Plant Material and DNA Extraction

Plant samples were collected from Jeju Island, Republic of Korea. To protect plant habitat, only a few flowers were taken from individuals of G. pubilabiata. Fresh samples were ground into powder using liquid nitrogen in a mortar. Ground samples were then used to extract genomic DNA using a G-spin II Genomic DNA extraction Kit (Intron, Seoul, Republic of Korea) following the manufacturer’s manual. The quality of the extracted DNA was checked with a UV/VIS spectrophotometer. DNA was then stored in Plant DNA bank in Republic of Korea (PDBK2020-0086). Due to the limitations of sample collection, the voucher specimen was replaced with photographs of G. pubilabiata’s in its habitat. These photographs were deposited in Korea University Herbarium (KUS) under the voucher-specimen number of KUS2020-0086 (Supplementary Table S4). Overall analysis steps and procedures adopted in this study are summarized in supplemental Figure S5.

4.2. Sequencing, Assembly and Annotations

Extracted genomic DNA of G. pubilabiata was used for sequencing using two different NGS platforms: Illumina NovaSeq (Illumina, San Diego, CA, USA) and Pacbio Sequel (Pacbio, Menlo Park, CA, USA). A total of 224,926,562 NovaSeq reads and 746,492 Pacbio Sequel reads were produced. The quality of sequencing reads was examined using FastQC v.0.11.9 (https://www.bioinformatics.babraham.ac.uk/projects/fastqc, accessed on 29 June 2023) [71]. Raw reads from Illumina NovaSeq were about 50 gb. They were trimmed with BBduk 37.64, as implemented in Geneious 11.1.5 (length: 27 kmer) [72]. BBNorm 37.64 was used to normalize trimmed reads (target coverage level: 30; minimum depth: 12). Raw reads from Pacbio Sequel were about 5 gb. They were corrected and polished using trimmed NOVAseq reads with CANU v.1.9 [73].
Error corrected Pacbio reads were used for de novo assembly using FALCON v.0.3.0 [74,75] and CANU v.1.9. Mitogenome of Allium cepa (NC030100) and 18 mitochondrial sequences of G. elata (MF070084~MF070102) were used as references to verify putative mitochondrial contigs among de novo assembly results obtained using FALCON and CANU. Trimmed NovaSeq reads and selected contigs from FALCON and CANU results were used for hybrid de novo assembly using SPAdes v.3.14.1 [76]. Mitochondrial contigs were selected from SPAdes results by performing local BLASTN [77] and using BANDAGE v.0.8.0. [78]. The final selection of mitogenome was based on their coverage, which was obtained using hybridSPAdes as part of SPAdes v.3.14.1 [79]. The depth-coverage of mitogenomes of G. pubilabiata is represented in Supplementary Table S5.
Plastome sequences of G. elata (NC037409) and Habenaria radiata (NC035834) were used as references to identify plastome contigs from results of FALCON and CANU. Selected contigs were assembled using Geneious assembler in Geneious 11.1.5 to obtain complete plastome sequence. Trimmed NovaSeq reads were then mapped to the complete plastome sequence of G. pubilabiata’s using Geneious assembler, and the mapped NovaSeq reads were extracted. Extracted NovaSeq reads were used to perform de novo assembly using Geneious assembler to validate G. pubilabiata’s complete plastome sequence.
The complete plastome sequence of G. pubilabiata was annotated using BLASTN, tRNAscan-SE 2.0 [80], ORF finder, and the find annotation function in Geneious 11.1.5. Plastome sequences of G. elata and H. radiata were used as references. Alternative start codons (ACG and TTG) were also considered in ORF finder. Pseudogene judgement was performed based on the criteria described previously [43]. ORFs that were larger than 150 bp were translated into protein sequences. These sequences were used to perform a psi-blast search [81] using several reported Orchidaceae plastome sequences as references. A circular plastome map was generated using the OGdraw web server [82].

4.3. Phylogenetic Analysis

Ninety-two plastome sequences were downloaded from the NCBI for phylogenetic analysis (Supplementary Table S6), including 87 sequences for the Orchidaceae, four for the Asparagales, and one for Liliales. Seventy-nine protein coding genes and four rRNA genes were extracted from these plastome sequences. Each extracted gene was aligned using MAFFT v.7.450 [83]. Alignments were manually checked. All alignments were concatenated to a length of 85,749 bp and subjected to jModeltest in the CIPRES Science Gateway to determine the best fit model [84,85]. The best fit model was determined to GTR + I + G. Missing genes were treated as missing data. A maximum likelihood (ML) tree was constructed using RaxML-HPC2 on XSEDE in CIPRES Science Gateway with 100 bootstrap replicates [86]. The result ML tree was visualized using Treegraph 2.15.0-887 [87].
Thirty-eight mitogenome sequences were downloaded from the NCBI (Supplementary Table S6). Thirty-five coding genes and three rRNA genes were extracted from these mitogenomes. Each gene was aligned using MAFFT v.7.450. All alignments were manually checked. Alignments were concatenated to a total length of 51,862 bp and then subjected to jModeltest in CIPRES Science Gateway to determine the best model. An ML tree was constructed using RaxML-HPC2 on XSEDE in CIPRES Science Gateway with a GTR + I + G model and 100 bootstrap replicates. The obtained ML tree was graphically represented using Treegraph 2.15.0-887.
Eighteen mitochondrial sequences of G. elata (MF070084~MF070102) and two plastome sequences of G. elata (NC037409) and G. longistyla (MW879162) were used to compare organelles among the genus Gastrodia. Simple sequence repeats (SSRs) were identified using Phobos 3.3.12 implemented in Geneious 11.1.5 [88]. REPuter (https://bibiserv.cebitec.uni-bielefeld.de/reputer, accessed on 28 June 2023) was used to distinguish palindromic repeats among organelles with default options [89]. The repeat finder in Geneious 11.1.5 was used to find repeats larger than 50 bp with a perfect match option. Repeat information was summarized and presented graphically using Circos v.0.69-9 [90].
A total of 5199 reported angiosperm plastome sequences and 219 mitogenome sequences were downloaded from the NCBI and used as a local BLAST database. Plastome and mitogenome sequences of G. pubilabiata were divided into 150 bp fragments, which were then used to perform local BLASTN to identify potential instances of IGT or HGT. Results were summarized in terms of chloroplast counts and mitochondria counts based on the number of results generated. Summarized data were visualized using ggplot2 in R [91]. Mitogenome regions skewed toward to plastome in BLASTN results were carefully evaluated, and also plastome regions skewed toward to mitogenome were evaluated (Supplementary Table S3). Based on these data, mitochondrial plastome (MTPT) sequences were extracted to construct a phylogenetic tree. Fifty-three plastome sequences of the main angiosperm lineage were used as references for this phylogenetic tree. For psaA and rpoB genes, a total of 301 Malpighiales and Celastrales plastome sequences were used as references to construct the ML tree. All obtained ML trees were visualized using Treegraph2. In order to estimate the HGT time, the divergence time of the Gastodea lineage were adopted from the previous study [43].

5. Conclusions

In this research we decode the entire plastome and mitogenome from a non-photosynthetic and mycoheterotrophic orchid species, G. pubilabiata. As expected, the plastome is dramatically reduced in size to be only 30,698 bp. This reduction has led to the loss of most photosynthesis-related genes and some housekeeping genes in the plastome, which now only contains 19 protein coding genes, three tRNAs, and three rRNAs. Therefore, the chloroplast lost its function and the genome faced strong relaxed selection pressure. In contrast, the mitogenome of G. pubilabiata is 867,349 bp in length and contained 38 protein coding genes, nine tRNAs, and three rRNAs. The gene content of the G. pubilabiata mitogenome is similar to the typical plant mitogenomes, even though the genome size is twice larger than the typical ones. Plastid rRNA fragments were found at a high frequency in the mitogenome. In addition, seven plastid protein coding gene fragments (ndhC, ndhJ, ndhK, psaA, psbF, rpoB, and rps4) were also identified in the mitogenome of G. pubilabiata. This horizontal gene transfer (HGT) from plastome to mitogenome occurred before losses of photosynthesis related genes from the chloroplast of G. pubilabiata. The HGT have occurred approximately 30 million years ago. This is the first documentation of the HGT between two cellular organelles of the orchid family. Furthermore, this is the first report for the HGT time from the orchid family. This finding will expand our knowledge for understanding of the orchid genome evolution. We expect more similar results from the orchid family which contains a wide range of nonphotosynthetic species.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms241411448/s1.

Author Contributions

Conceptualization, K.-J.K.; methodology, Y.-K.K.; software, Y.-K.K.; validation, Y.-K.K. and K.-J.K.; formal analysis, Y.-K.K., S.J., S.-H.C. and J.-R.H.; investigation, Y.-K.K.; resources, K.-J.K.; data curation, Y.-K.K., S.J., S.-H.C. and J.-R.H.; writing—original draft, Y.-K.K.; writing—review and editing, K.-J.K.; supervision, K.-J.K.; project administration, K.-J.K.; funding acquisition, Y.-K.K. and K.-J.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by National Research Foundation of Korea, NRF-2021R1A2C1013731 to K.-J.K. and NRF-2020R1A6A3A01100512 to Y.-K.K.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The newly sequenced plastome and mitogenome of G. pubilabiata were submitted to GenBank and the accession number are OR004100-OR004143 for mitogenome and OR031839 for plastome, respectively.

Acknowledgments

We thank two anonymous reviewers for their valuable comment for the manuscript improvement. We also thank PDBK (Plant DNA Bank of Korea) for DNA curation and KUS (Korea University Herbarium) for voucher information curation, respectively.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

LSCLarge Single Copy
SSCSmall Single Copy
IRInverted Repeat
MLMaximum Likelihood
HGTHorizontal Gene Transfer
IGTIntracellular Gene Transfer
SSRSimple Sequence Repeat
MTPTMitochondrial Plastome sequences

References

  1. Paajanen, P.; Kettleborough, G.; López-Girona, E.; Giolai, M.; Heavens, D.; Baker, D.; Lister, A.; Cugliandolo, F.; Wilde, G.; Hein, I.; et al. A critical comparison of technologies for a plant genome sequencing project. Gigascience 2019, 8, giy163. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Shirasawa, K.; Harada, D.; Hirakawa, H.; Isobe, S.; Kole, C. Chromosome-level de novo genome assemblies of over 100 plant species. Breed. Sci. 2021, 71, 117–124. [Google Scholar] [CrossRef] [PubMed]
  3. Goremykin, V.V.; Hirsch-Ernst, K.I.; Wölfl, S.; Hellwig, F.H. Analysis of the Amborella trichopoda chloroplast genome sequence suggests that Amborella is not a basal angiosperm. Mol. Biol. Evol. 2003, 20, 1499–1505. [Google Scholar] [CrossRef] [PubMed]
  4. Cai, Z.; Penaflor, C.; Kuehl, J.V.; Leebens-Mack, J.; Carlson, J.E.; DePamphilis, C.W.; Boore, J.L.; Jansen, R.K. Complete plastid genome sequences of Drimys, Liriodendron, and Piper: Implications for the phylogenetic relationships of magnoliids. BMC Evol. Biol. 2006, 6, 77. [Google Scholar] [CrossRef] [Green Version]
  5. Givnish, T.J.; Spalink, D.; Ames, M.; Lyon, S.P.; Hunter, S.J.; Zuluaga, A.; Iles, W.J.D.; Clements, M.A.; Arroyo, M.T.K.; Leebens-Mack, J.; et al. Orchid phylogenomics and multiple drivers of their extraordinary diversification. Proc. R. Soc. B Biol. Sci. 2015, 282, 20151553. [Google Scholar] [CrossRef]
  6. Sun, M.; Naeem, R.; Su, J.X.; Cao, Z.Y.; Burleigh, J.G.; Soltis, P.S.; Soltis, D.E.; Chen, Z.D. Phylogeny of the Rosidae: A dense taxon sampling analysis. J. Syst. Evol. 2016, 54, 363–391. [Google Scholar] [CrossRef] [Green Version]
  7. DePamphilis, C.W.; Palmer, J.D.; Rounsley, S.; Sankoff, D.; Schuster, S.C.; Ammiraju, J.S.S.; Barbazuk, W.B.; Chamala, S.; Chanderbali, A.S.; Determann, R.; et al. The Amborella genome and the evolution of flowering plants. Science 2013, 342, 1241089. [Google Scholar] [CrossRef]
  8. Chen, J.; Huang, Q.; Gao, D.; Wang, J.; Lang, Y.; Liu, T.; Li, B.; Bai, Z.; Luis Goicoechea, J.; Liang, C.; et al. Whole-genome sequencing of Oryza brachyantha reveals mechanisms underlying Oryza genome evolution. Nat. Commun. 2013, 4, 1595. [Google Scholar] [CrossRef] [Green Version]
  9. Liu, Z.J. The genome sequence of the orchid Phalaenopsis equestris. Nat. Genet. 2015, 47, 65–72. [Google Scholar] [CrossRef] [Green Version]
  10. Mayer, K.F.X.; Marcussen, T.; Sandve, S.R.; Heier, L.; Pfeifer, M.; Kugler, K.G.; Zhan, B.; Spannagl, M.; Pfeifer, M.; Jakobsen, K.S.; et al. A chromosome-based draft sequence of the hexaploid bread wheat (Triticum aestivum) genome Ancient hybridizations among the ancestral genomes of bread wheat Genome interplay in the grain transcriptome of hexaploid bread wheat Structural and functional pa. Science 2014, 345, 1250092. [Google Scholar]
  11. Wu, Z.; Sloan, D.B. Recombination and intraspecific polymorphism for the presence and absence of entire chromosomes in mitochondrial genomes. Heredity 2019, 122, 647–659. [Google Scholar] [CrossRef]
  12. Lin, Y.; Li, P.; Zhang, Y.; Akhter, D.; Pan, R.; Fu, Z.; Huang, M.; Li, X.; Feng, Y. Unprecedented organelle genomic variations in morning glories reveal independent evolutionary scenarios of parasitic plants and the diversification of plant mitochondrial complexes. BMC Biol. 2022, 20, 49. [Google Scholar] [CrossRef]
  13. Skippingtona, E.; Barkmanb, T.J.; Ricea, D.W.; Palmera, J.D. Miniaturized mitogenome of the parasitic plant viscum scurruloideum is extremely divergent and dynamic and has lost all nad genes. Proc. Natl. Acad. Sci. USA 2015, 112, E3515–E3524. [Google Scholar] [CrossRef]
  14. Sloan, D.B.; Alverson, A.J.; Chuckalovcak, J.P.; Wu, M.; McCauley, D.E.; Palmer, J.D.; Taylor, D.R. Rapid evolution of enormous, multichromosomal genomes in flowering plant mitochondria with exceptionally high mutation rates. PLoS Biol. 2012, 10, e1001241. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Sloan, D.B.; Wu, Z. History of plastid DNA insertions reveals weak deletion and AT mutation biases in angiosperm mitochondrial genomes. Genome Biol. Evol. 2014, 6, 3210–3221. [Google Scholar] [CrossRef] [PubMed]
  16. Wynn, E.L.; Christensen, A.C. Repeats of unusual size in plant mitochondrial genomes: Identification, incidence and evolution. G3 Genes Genomes Genet. 2019, 9, 549–559. [Google Scholar] [CrossRef] [Green Version]
  17. Gandini, C.L.; Garcia, L.E.; Abbona, C.C.; Sanchez-Puerta, M.V. The complete organelle genomes of Physochlaina orientalis: Insights into short sequence repeats across seed plant mitochondrial genomes. Mol. Phylogenet. Evol. 2019, 137, 274–284. [Google Scholar] [CrossRef]
  18. Kim, H.T.; Kim, J.S. Structural mutations in the organellar genomes of Valeriana sambucifolia f. dageletiana (Nakai. ex Maekawa) hara show dynamic gene transfer. Int. J. Mol. Sci. 2021, 22, 3770. [Google Scholar] [CrossRef]
  19. Choi, K.S.; Park, S. Complete plastid and mitochondrial genomes of Aeginetia indica reveal intracellular gene transfer (IGT), horizontal gene transfer (HGT), and cytoplasmic male sterility (cms). Int. J. Mol. Sci. 2021, 22, 6143. [Google Scholar] [CrossRef]
  20. Sinn, B.T.; Barrett, C.F. Ancient Mitochondrial Gene Transfer between Fungi and the Orchids. Mol. Biol. Evol. 2020, 37, 44–57. [Google Scholar] [CrossRef]
  21. Rice, D.W.; Alverson, A.J.; Richardson, A.O.; Young, G.J.; Sanchez-Puerta, M.V.; Munzinger, J.; Barry, K.; Boore, J.L.; Zhang, Y.; DePamphilis, C.W.; et al. Horizontal transfer of entire genomes via mitochondrial fusion in the angiosperm Amborella. Science 2013, 342, 1468–1473. [Google Scholar] [CrossRef] [Green Version]
  22. Sanchez-Puerta, M.V.; Edera, A.; Gandini, C.L.; Williams, A.V.; Howell, K.A.; Nevill, P.G.; Small, I. Genome-scale transfer of mitochondrial DNA from legume hosts to the holoparasite Lophophytum mirabile (Balanophoraceae). Mol. Phylogenet. Evol. 2019, 132, 243–250. [Google Scholar] [CrossRef] [Green Version]
  23. Christenhusz, M.J.M.; Byng, J.W. The number of known plants species in the world and its annual increase. Phytotaxa 2016, 261, 201–217. [Google Scholar] [CrossRef] [Green Version]
  24. Chase, M.W.; Cameron, K.M.; Freudenstein, J.V.; Pridgeon, A.M.; Salazar, G.; van den Berg, C.; Schuiteman, A. An updated classification of Orchidaceae. Bot. J. Linn. Soc. 2015, 177, 151–174. [Google Scholar] [CrossRef] [Green Version]
  25. Chang, C.C.; Lin, H.C.; Lin, I.P.; Chow, T.Y.; Chen, H.H.; Chen, W.H.; Cheng, C.H.; Lin, C.Y.; Liu, S.M.; Chang, C.C.; et al. The chloroplast genome of Phalaenopsis aphrodite (Orchidaceae): Comparative analysis of evolutionary rate with that of grasses and its phylogenetic implications. Mol. Biol. Evol. 2006, 23, 279–291. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Barrett, C.F.; Davis, J.I. The plastid genome of the mycoheterotrophic Corallorhiza striata (Orchidaceae) is in the relatively early stages of degradation. Am. J. Bot. 2012, 99, 1513–1523. [Google Scholar] [CrossRef] [PubMed]
  27. Barrett, C.F.; Wicke, S.; Sass, C. Dense infraspecific sampling reveals rapid and independent trajectories of plastome degradation in a heterotrophic orchid complex. New Phytol. 2018, 218, 1192–1204. [Google Scholar] [CrossRef] [Green Version]
  28. Yang, J.-B.; Tang, M.; Li, H.-T.; Zhang, Z.-R.; Li, D.-Z. Complete chloroplast genome of the genus Cymbidium: Lights into the species identification, phylogenetic implications and population genetic analyses. BMC Evol. Biol. 2013, 13, 84. [Google Scholar] [CrossRef] [Green Version]
  29. Niu, Z.; Xue, Q.; Zhu, S.; Sun, J.; Liu, W.; Ding, X. The Complete Plastome Sequences of Four Orchid Species: Insights into the Evolution of the Orchidaceae and the Utility of Plastomic Mutational Hotspots. Front. Plant Sci. 2017, 8, 715. [Google Scholar] [CrossRef] [Green Version]
  30. Li, Z.H.; Ma, X.; Wang, D.Y.; Li, Y.X.; Wang, C.W.; Jin, X.H. Evolution of plastid genomes of Holcoglossum (Orchidaceae) with recent radiation. BMC Evol. Biol. 2019, 19, 63. [Google Scholar] [CrossRef]
  31. Feng, Y.L.; Wicke, S.; Li, J.W.; Han, Y.; Lin, C.S.; Li, D.Z.; Zhou, T.T.; Huang, W.C.; Huang, L.Q.; Jin, X.H. Lineage-specific reductions of plastid genomes in an orchid tribe with partially and fully mycoheterotrophic species. Genome Biol. Evol. 2016, 8, 2164–2175. [Google Scholar] [CrossRef] [PubMed]
  32. Lin, C.S.; Chen, J.J.W.; Huang, Y.T.; Chan, M.T.; Daniell, H.; Chang, W.J.; Hsu, C.T.; Liao, D.C.; Wu, F.H.; Lin, S.Y.; et al. The location and translocation of ndh genes of chloroplast origin in the Orchidaceae family. Sci. Rep. 2015, 5, 9040. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Delannoy, E.; Fujii, S.; Colas Des Francs-Small, C.; Brundrett, M.; Small, I. Rampant Gene loss in the underground orchid Rhizanthella gardneri highlights evolutionary constraints on plastid genomes. Mol. Biol. Evol. 2011, 28, 2077–2086. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Kim, J.S.; Kim, H.T.; Kim, J.-H. The largest plastid genome of monocots: A novel genome type containing AT residue repeats in the slipper orchid Cypripedium japonicum. Plant Mol. Biol. Rep. 2015, 33, 1210–1220. [Google Scholar] [CrossRef]
  35. Kim, Y.K.; Jo, S.; Cheon, S.H.; Joo, M.J.; Hong, J.R.; Kwak, M.H.; Kim, K.J. Extensive losses of photosynthesis genes in the plastome of a mycoheterotrophic orchid, Cyrtosia septentrionalis (Vanilloideae: Orchidaceae). Genome Biol. Evol. 2019, 11, 565–571. [Google Scholar] [CrossRef] [Green Version]
  36. Freudenstein, J.V.; Chase, M.W.M. Phylogenetic relationships in Epidendroideae (Orchidaceae), one of the great flowering plant radiations: Progressive specialization and diversification. Ann. Bot. 2015, 115, 665–681. [Google Scholar] [CrossRef] [Green Version]
  37. Barrett, C.F.; Kennedy, A.H. Plastid genome degradation in the endangered, mycoheterotrophic, North American orchid Hexalectris warnockii. Genome Biol. Evol. 2018, 10, 1657–1662. [Google Scholar] [CrossRef]
  38. Guo, Y.Y.; Yang, J.X.; Bai, M.Z.; Zhang, G.Q.; Liu, Z.J. The chloroplast genome evolution of Venus slipper (Paphiopedilum): IR expansion, SSC contraction, and highly rearranged SSC regions. BMC Plant Biol. 2021, 21, 248. [Google Scholar] [CrossRef]
  39. Merckx, V.S.F.T.; Freudenstein, J.V.; Kissling, J.; Christenhusz, M.J.M.; Stotler, R.E.; Crandall-Stotler, B.; Wickett, N.; Rudall, P.J.; Maas-van de Kamer, H.; Maas, P.J.M. Taxonomy and Classification BT—Mycoheterotrophy: The Biology of Plants Living on Fungi; Merckx, V., Ed.; Springer: New York, NY, USA, 2013; pp. 19–101. ISBN 978-1-4614-5209-6. [Google Scholar]
  40. Barrett, C.F.; Freudenstein, J.V.; Li, J.; Mayfield-Jones, D.R.; Perez, L.; Pires, J.C.; Santos, C. Investigating the path of plastid genome degradation in an early-transitional clade of heterotrophic orchids, and implications for heterotrophic angiosperms. Mol. Biol. Evol. 2014, 31, 3095–3112. [Google Scholar] [CrossRef] [Green Version]
  41. Schelkunov, M.I.; Shtratnikova, V.Y.; Nuraliev, M.S.; Selosse, M.A.; Penin, A.A.; Logacheva, M.D. Exploring the limits for reduction of plastid genomes: A case study of the mycoheterotrophic orchids Epipogium aphyllum and Epipogium roseum. Genome Biol. Evol. 2015, 7, 1179–1191. [Google Scholar] [CrossRef] [Green Version]
  42. Yuan, Y.; Jin, X.; Liu, J.; Zhao, X.; Zhou, J.; Wang, X.; Wang, D.; Lai, C.; Xu, W.; Huang, J.; et al. The Gastrodia elata genome provides insights into plant adaptation to heterotrophy. Nat. Commun. 2018, 9, 1615. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  43. Kim, Y.K.; Jo, S.; Cheon, S.H.; Joo, M.J.; Hong, J.R.; Kwak, M.; Kim, K.J. Plastome Evolution and Phylogeny of Orchidaceae, With 24 New Sequences. Front. Plant Sci. 2020, 11, 22. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  44. Lam, V.K.Y.; Darby, H.; Merckx, V.S.F.T.; Lim, G.; Yukawa, T.; Neubig, K.M.; Abbott, J.R.; Beatty, G.E.; Provan, J.; Soto Gomez, M.; et al. Phylogenomic inference in extremis: A case study with mycoheterotroph plastomes. Am. J. Bot. 2018, 105, 480–494. [Google Scholar] [CrossRef] [Green Version]
  45. Yang, F.X.; Gao, J.; Wei, Y.L.; Ren, R.; Zhang, G.Q.; Lu, C.Q.; Jin, J.P.; Ai, Y.; Wang, Y.Q.; Chen, L.J.; et al. The genome of Cymbidium sinense revealed the evolution of orchid traits. Plant Biotechnol. J. 2021, 19, 2501–2516. [Google Scholar] [CrossRef]
  46. Zhang, G.-Q.; Xu, Q.; Bian, C.; Tsai, W.-C.; Yeh, C.-M.; Liu, K.-W.; Yoshida, K.; Zhang, L.-S.; Chang, S.-B.; Chen, F. The Dendrobium catenatum Lindl. genome sequence provides insights into polysaccharide synthase, floral development and adaptive evolution. Sci. Rep. 2016, 6, 19029. [Google Scholar] [CrossRef] [Green Version]
  47. Zhang, G.Q.; Chen, G.Z.; Chen, L.J.; Zhai, J.W.; Huang, J.; Wu, X.Y.; Li, M.H.; Peng, D.H.; Rao, W.H.; Liu, Z.J.; et al. Phylogenetic incongruence in Cymbidium orchids. Plant Divers. 2021, 43, 452–461. [Google Scholar] [CrossRef] [PubMed]
  48. Li, Y.X.; Li, Z.H.; Schuitman, A.; Chase, M.W.; Li, J.W.; Huang, W.C.; Hidayat, A.; Wu, S.S.; Jin, X.H. Phylogenomics of Orchidaceae based on plastid and mitochondrial genomes. Mol. Phylogenet. Evol. 2019, 139, 106540. [Google Scholar] [CrossRef] [PubMed]
  49. Flora of Korea Editorial Committee; Park, C.-W. The Genera of Vascular Plants of Korea; Academy Publisher: Seoul, Republic of Korea, 2007; ISBN 9788976163806. [Google Scholar]
  50. Chen, X.Q.; Liu, Z.J.; Zhu, G.H.; Lang, K.Y.; Ji, Z.H.; Luo, Y.B.; Jin, X.H.; Cribb, P.J.; Wood, J.J.; Gale, S.W. Flora of China: Orchidaceae. Chen XQ Wood JJ 2009, 175, 477. [Google Scholar]
  51. Kim, H.T.; Shin, C.H.; Sun, H.; Kim, J.H. Sequencing of the plastome in the leafless green mycoheterotroph Cymbidium macrorhizon helps us to understand an early stage of fully mycoheterotrophic plastome structure. Plant Syst. Evol. 2018, 304, 245–258. [Google Scholar] [CrossRef]
  52. Wicke, S.; Muller, K.F.; de Pamphilis, C.W.; Quandt, D.; Wickett, N.J.; Zhang, Y.; Renner, S.S.; Schneeweiss, G.M. Mechanisms of Functional and Physical Genome Reduction in Photosynthetic and Nonphotosynthetic Parasitic Plants of the Broomrape Family. Plant Cell 2013, 25, 3711–3725. [Google Scholar] [CrossRef] [Green Version]
  53. Sanderson, M.J.; Copetti, D.; Burquez, A.; Bustamante, E.; Charboneau, J.L.M.; Eguiarte, L.E.; Kumar, S.; Lee, H.O.; Lee, J.; McMahon, M.; et al. Exceptional reduction of the plastid genome of saguaro cactus (Carnegiea gigantea): Loss of the ndh gene suite and inverted repeat. Am. J. Bot. 2015, 102, 1115–1127. [Google Scholar] [CrossRef] [Green Version]
  54. Guisinger, M.M.; Kuehl, J.V.; Boore, J.L.; Jansen, R.K. Extreme reconfiguration of plastid genomes in the angiosperm family Geraniaceae: Rearrangements, repeats, and codon usage. Mol. Biol. Evol. 2011, 28, 583–600. [Google Scholar] [PubMed] [Green Version]
  55. Blazier, J.C.; Jansen, R.K.; Mower, J.P.; Govindu, M.; Zhang, J.; Weng, M.L.; Ruhlman, T.A. Variable presence of the inverted repeat and plastome stability in Erodium. Ann. Bot. 2016, 117, 1209–1220. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  56. Palmer, J.D.; Thompson, W.F. Chloroplast DNA rearrangements are more frequent when a large inverted repeat sequence is lost. Cell 1982, 29, 537–550. [Google Scholar] [CrossRef] [PubMed]
  57. Guo, W.; Grewe, F.; Cobo-Clark, A.; Fan, W.; Duan, Z.; Adams, R.P.; Schwarzbach, A.E.; Mower, J.P. Predominant and substoichiometric isomers of the plastid genome coexist within Juniperus plants and have shifted multiple times during cupressophyte evolution. Genome Biol. Evol. 2014, 6, 580–590. [Google Scholar] [CrossRef]
  58. Raubeson, L.A.; Jansen, R.K. A rare chloroplast-DNA structural mutation is shared by all conifers. Biochem. Syst. Ecol. 1992, 20, 17–24. [Google Scholar] [CrossRef]
  59. Moghaddam, M.; Ohta, A.; Shimizu, M.; Terauchi, R.; Kazempour-Osaloo, S. The complete chloroplast genome of Onobrychis gaubae (Fabaceae-Papilionoideae): Comparative analysis with related IR-lacking clade species. BMC Plant Biol. 2022, 22, 75. [Google Scholar]
  60. Wu, S.; Chen, J.; Li, Y.; Liu, A.; Li, A.; Yin, M.; Shrestha, N.; Liu, J.; Ren, G. Extensive genomic rearrangements mediated by repetitive sequences in plastomes of Medicago and its relatives. BMC Plant Biol. 2021, 21, 421. [Google Scholar] [CrossRef]
  61. Haberle, R.C.; Fourcade, H.M.; Boore, J.L.; Jansen, R.K. Extensive rearrangements in the chloroplast genome of Trachelium caeruleum are associated with repeats and tRNA genes. J. Mol. Evol. 2008, 66, 350–361. [Google Scholar] [CrossRef]
  62. Kim, H.T.; Kim, J.S.; Moore, M.J.; Neubig, K.M.; Williams, N.H.; Whitten, W.M.; Kim, J.H. Seven new complete plastome sequences reveal rampant independent loss of the ndh gene family across orchids and associated instability of the inverted repeat/small single-copy region boundaries. PLoS ONE 2015, 10, e0142215. [Google Scholar] [CrossRef]
  63. Wicke, S.; Müller, K.F.; dePamphilis, C.W.; Quandt, D.; Bellot, S.; Schneeweiss, G.M. Mechanistic model of evolutionary rate variation en route to a nonphotosynthetic lifestyle in plants. Proc. Natl. Acad. Sci. USA 2016, 113, 9045–9050. [Google Scholar] [CrossRef] [PubMed]
  64. Suetsugu, K.; Ohta, T.; Tayasu, I. Partial mycoheterotrophy in the leafless orchid Cymbidium macrorhizon. Am. J. Bot. 2018, 105, 1595–1600. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  65. Blumenfeld, H. Beitrage zur Physiologie des Wurzepilzes von Limodorum abortivum (L.) Sw; Universitat Basel: Basel, Switzerland, 1935. [Google Scholar]
  66. Girlanda, M.; Selosse, M.A.; Cafasso, D.; Brilli, F.; Delfine, S.; Fabbian, R.; Ghignone, S.; Pinelli, P.; Segreto, R.; Loreto, F.; et al. Inefficient photosynthesis in the Mediterranean orchid Limodorum abortivum is mirrored by specific association to ectomycorrhizal Russulaceae. Mol. Ecol. 2006, 15, 491–504. [Google Scholar] [CrossRef] [PubMed]
  67. Sloan, D.B. One ring to rule them all? Genome sequencing provides new insights into the “master circle” model of plant mitochondrial DNA structure. New Phytol. 2013, 200, 978–985. [Google Scholar] [CrossRef]
  68. Niu, Y.; Gao, C.; Liu, J. Complete mitochondrial genomes of three Mangifera species, their genomic structure and gene transfer from chloroplast genomes. BMC Genom. 2022, 23, 147. [Google Scholar] [CrossRef]
  69. Kim, Y.; Cheon, S.; Hong, J.; Kim, K. Evolutionary Patterns of the Chloroplast Genome in Vanilloid Orchids (Vanilloideae, Orchidaceae). Int. J. Mol. Sci. 2023, 24, 3808. [Google Scholar]
  70. Fonseca, L.H.M.; Lohmann, L.G. Exploring the potential of nuclear and mitochondrial sequencing data generated through genome-skimming for plant phylogenetics: A case study from a clade of neotropical lianas. J. Syst. Evol. 2020, 58, 18–32. [Google Scholar] [CrossRef]
  71. Andrews, S. FastQC: A Quality Control Tool for High Throughput Sequence Data. Available online: https://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (accessed on 28 June 2023).
  72. Kearse, M.; Moir, R.; Wilson, A.; Stones-Havas, S.; Cheung, M.; Sturrock, S.; Buxton, S.; Cooper, A.; Markowitz, S.; Duran, C. Geneious Basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 2012, 28, 1647–1649. [Google Scholar] [CrossRef] [Green Version]
  73. Koren, S.; Walenz, B.P.; Berlin, K.; Miller, J.R.; Bergman, N.H.; Phillippy, A.M. Canu: Scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 2017, 27, 722–736. [Google Scholar] [CrossRef] [Green Version]
  74. Chin, C.-S.; Peluso, P.; Sedlazeck, F.J.; Nattestad, M.; Concepcion, G.T.; Clum, A.; Dunn, C.; O’Malley, R.; Figueroa-Balderas, R.; Morales-Cruz, A. Phased diploid genome assembly with single-molecule real-time sequencing. Nat. Methods 2016, 13, 1050–1054. [Google Scholar] [CrossRef] [Green Version]
  75. Chin, C.-S.; Alexander, D.H.; Marks, P.; Klammer, A.A.; Drake, J.; Heiner, C.; Clum, A.; Copeland, A.; Huddleston, J.; Eichler, E.E. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat. Methods 2013, 10, 563–569. [Google Scholar] [PubMed]
  76. Bankevich, A.; Nurk, S.; Antipov, D.; Gurevich, A.A.; Dvorkin, M.; Kulikov, A.S.; Lesin, V.M.; Nikolenko, S.I.; Pham, S.; Prjibelski, A.D. SPAdes: A new genome assembly algorithm and its applications to single-cell sequencing. J. Comput. Biol. 2012, 19, 455–477. [Google Scholar] [PubMed] [Green Version]
  77. Altschul, S.F.; Gish, W.; Miller, W.; Myers, E.W.; Lipman, D.J. Basic local alignment search tool. J. Mol. Biol. 1990, 215, 403–410. [Google Scholar] [PubMed]
  78. Wick, R.R.; Schultz, M.B.; Zobel, J.; Holt, K.E. Bandage: Interactive visualization of de novo genome assemblies. Bioinformatics 2015, 31, 3350–3352. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  79. Antipov, D.; Korobeynikov, A.; McLean, J.S.; Pevzner, P.A. HybridSPAdes: An algorithm for hybrid assembly of short and long reads. Bioinformatics 2016, 32, 1009–1015. [Google Scholar] [CrossRef] [Green Version]
  80. Lowe, T.M.; Chan, P.P. tRNAscan-SE On-line: Integrating search and context for analysis of transfer RNA genes. Nucleic Acids Res. 2016, 44, W54–W57. [Google Scholar] [CrossRef]
  81. Altschul, S.F.; Madden, T.L.; Schäffer, A.A.; Zhang, J.; Zhang, Z.; Miller, W.; Lipman, D.J. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 1997, 25, 3389–3402. [Google Scholar]
  82. Lohse, M.; Drechsel, O.; Bock, R. OrganellarGenomeDRAW (OGDRAW): A tool for the easy generation of high-quality custom graphical maps of plastid and mitochondrial genomes. Curr. Genet. 2007, 52, 267–274. [Google Scholar] [CrossRef]
  83. Katoh, K.; Misawa, K.; Kuma, K.; Miyata, T. MAFFT: A novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002, 30, 3059–3066. [Google Scholar] [CrossRef] [Green Version]
  84. Darriba, D.; Taboada, G.L.; Doallo, R.; Posada, D. jModelTest 2: More models, new heuristics and parallel computing. Nat. Methods 2012, 9, 772. [Google Scholar] [CrossRef] [Green Version]
  85. Miller, M.A.; Pfeiffer, W.; Schwartz, T. Creating the CIPRES Science Gateway for inference of large phylogenetic trees. In Proceedings of the 2010 Gateway Computing Environments Workshop (GCE), New Orleans, LA, USA, 14 November 2010; IEEE: New York, NY, USA, 2010; pp. 1–8. [Google Scholar]
  86. Stamatakis, A. RAxML version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 2014, 30, 1312–1313. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  87. Stöver, B.C.; Müller, K.F. TreeGraph 2: Combining and visualizing evidence from different phylogenetic analyses. BMC Bioinformatics 2010, 11, 7. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  88. Mayer, C. A Tandem Repeat Search Program, Phobos Version 3.3.12. Available online: http://www.rub.de/ecoevo/cm/cm_phobos.htm (accessed on 28 June 2023).
  89. Kurtz, S. REPuter: The manifold applications of repeat analysis on a genomic scale. Nucleic Acids Res. 2001, 29, 4633–4642. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  90. Krzywinski, M.; Schein, J.; Birol, I.; Connors, J.; Gascoyne, R.; Horsman, D.; Jones, S.J.; Marra, M.A. Circos: An information aesthetic for comparative genomics. Genome Res. 2009, 19, 1639–1645. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  91. Wickham, H. ggplot2. Wiley Interdiscip. Rev.-Comput. Stat. 2011, 3, 180–185. [Google Scholar] [CrossRef]
Figure 1. The complete plastome of G. pubilabiata. The plastome is 30.698 bp in length and lack IR region. It contains 19 protein coding genes, three tRNA genes, and three rRNA genes.
Figure 1. The complete plastome of G. pubilabiata. The plastome is 30.698 bp in length and lack IR region. It contains 19 protein coding genes, three tRNA genes, and three rRNA genes.
Ijms 24 11448 g001
Figure 2. The complete mitogenome of G. pubilabiata. It is composed of 44 mitochondrial contigs. A total length of mitogenome is 867,349 bp in length and the genome include 38 protein coding genes, nine tRNA genes, and three rRNA genes.
Figure 2. The complete mitogenome of G. pubilabiata. It is composed of 44 mitochondrial contigs. A total length of mitogenome is 867,349 bp in length and the genome include 38 protein coding genes, nine tRNA genes, and three rRNA genes.
Ijms 24 11448 g002
Figure 3. Results of repeat analysis between Gastrodia organelle genomes. The blue lines represent repeats between mitogenomes of different species, the green lines represent repeats between plastomes of different species, the grey lines indicate repeats between mitogenomes of the same species, and the red lines indicate repeats between plastomes and mitogenomes, respectively.
Figure 3. Results of repeat analysis between Gastrodia organelle genomes. The blue lines represent repeats between mitogenomes of different species, the green lines represent repeats between plastomes of different species, the grey lines indicate repeats between mitogenomes of the same species, and the red lines indicate repeats between plastomes and mitogenomes, respectively.
Ijms 24 11448 g003
Figure 4. A maximum likelihood (ML) phylogenetic tree of six regions found to be plastid-origin coding regions in the mitogenome (ndhC, ndhJ, ndhK, psaA, rpoB, and rps4). The position of G. pubilabiata based on mitogenome is highlighted in red. The position of G. pubilabiata based on plastome is highlighted in green.
Figure 4. A maximum likelihood (ML) phylogenetic tree of six regions found to be plastid-origin coding regions in the mitogenome (ndhC, ndhJ, ndhK, psaA, rpoB, and rps4). The position of G. pubilabiata based on mitogenome is highlighted in red. The position of G. pubilabiata based on plastome is highlighted in green.
Ijms 24 11448 g004
Figure 5. A maximum likelihood (ML) phylogenetic tree inferred from 92 species (87 species from Orchidaceae, four species from Asparagales, and one species from Liliales were used). The nucleotide sequences of 79 protein coding genes and four rRNA genes were aligned independently and concatenated to be a single data matrix. The alignment was 85,749 bp and the tree was constructed by RaxML with the GTR + I + G model (−621,773.060310 of ML value). The scientific name of G. pubilabiata is highlighted with red color. The numbers above or below the nodes are the bootstrap values. Asterisk (*) mark above or below the nodes indicates the bootstrap value of 100.
Figure 5. A maximum likelihood (ML) phylogenetic tree inferred from 92 species (87 species from Orchidaceae, four species from Asparagales, and one species from Liliales were used). The nucleotide sequences of 79 protein coding genes and four rRNA genes were aligned independently and concatenated to be a single data matrix. The alignment was 85,749 bp and the tree was constructed by RaxML with the GTR + I + G model (−621,773.060310 of ML value). The scientific name of G. pubilabiata is highlighted with red color. The numbers above or below the nodes are the bootstrap values. Asterisk (*) mark above or below the nodes indicates the bootstrap value of 100.
Ijms 24 11448 g005
Figure 6. A maximum likelihood (ML) phylogenetic tree inferred from 38 species. The nucleotide sequences of 35 protein coding genes and three rRNA genes were aligned independently and concatenated to be a single data matrix. The alignment was 51,862 bp and the tree was constructed by RaxML with the GTR + I + G model (−318,584.844612 of ML value). The scientific name of G. pubilabiata is highlighted with red color. The numbers above or below the nodes are the bootstrap values and the asterisk (*) mark above or below the nodes indicate the bootstrap value of 100.
Figure 6. A maximum likelihood (ML) phylogenetic tree inferred from 38 species. The nucleotide sequences of 35 protein coding genes and three rRNA genes were aligned independently and concatenated to be a single data matrix. The alignment was 51,862 bp and the tree was constructed by RaxML with the GTR + I + G model (−318,584.844612 of ML value). The scientific name of G. pubilabiata is highlighted with red color. The numbers above or below the nodes are the bootstrap values and the asterisk (*) mark above or below the nodes indicate the bootstrap value of 100.
Ijms 24 11448 g006
Figure 7. A dot-plot based on local BLASTN search results. Blue triangular dots represent plastome-origin sequences. Red circular dots indicate mitogenome-origin sequences. Irregular search results are highlighted with gray boxes.
Figure 7. A dot-plot based on local BLASTN search results. Blue triangular dots represent plastome-origin sequences. Red circular dots indicate mitogenome-origin sequences. Irregular search results are highlighted with gray boxes.
Ijms 24 11448 g007
Table 1. The general features of G. pubilabiata’s plastome and mitogenome.
Table 1. The general features of G. pubilabiata’s plastome and mitogenome.
PlastomeMitogenome
Contigs144
Total Length (bp)30,698867,349
GC (%)24.942.1
Depth Coverage (×)236.0150.0
Protein Coding Genes1938
tRNAs39
rRNAs33
Table 2. The general features of reported mitogenomes of monocotyledon.
Table 2. The general features of reported mitogenomes of monocotyledon.
Scientific NameAccesionsContigsTotal Length
(bp)
Protein
Coding
Genes
tRNAsrRNAs
Gastrodia pubilabiataOR004100–OR00414344867,3493893
Gastrodia elataMF070084~
MF070102
191,339,82538143
Allium cepaNC0301001316,3632453
Asparagus officinalisNC0536421492,06236143
Butomus umbellatusNC0213991450,82628124
Oryza sativaNC0110331490,52031153
Phoenix dactyliferaNC0167401715,00137163
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Kim, Y.-K.; Jo, S.; Cheon, S.-H.; Hong, J.-R.; Kim, K.-J. Ancient Horizontal Gene Transfers from Plastome to Mitogenome of a Nonphotosynthetic Orchid, Gastrodia pubilabiata (Epidendroideae, Orchidaceae). Int. J. Mol. Sci. 2023, 24, 11448. https://doi.org/10.3390/ijms241411448

AMA Style

Kim Y-K, Jo S, Cheon S-H, Hong J-R, Kim K-J. Ancient Horizontal Gene Transfers from Plastome to Mitogenome of a Nonphotosynthetic Orchid, Gastrodia pubilabiata (Epidendroideae, Orchidaceae). International Journal of Molecular Sciences. 2023; 24(14):11448. https://doi.org/10.3390/ijms241411448

Chicago/Turabian Style

Kim, Young-Kee, Sangjin Jo, Se-Hwan Cheon, Ja-Ram Hong, and Ki-Joong Kim. 2023. "Ancient Horizontal Gene Transfers from Plastome to Mitogenome of a Nonphotosynthetic Orchid, Gastrodia pubilabiata (Epidendroideae, Orchidaceae)" International Journal of Molecular Sciences 24, no. 14: 11448. https://doi.org/10.3390/ijms241411448

APA Style

Kim, Y. -K., Jo, S., Cheon, S. -H., Hong, J. -R., & Kim, K. -J. (2023). Ancient Horizontal Gene Transfers from Plastome to Mitogenome of a Nonphotosynthetic Orchid, Gastrodia pubilabiata (Epidendroideae, Orchidaceae). International Journal of Molecular Sciences, 24(14), 11448. https://doi.org/10.3390/ijms241411448

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop