Next Article in Journal
Prognostic Role of Specific KRAS Mutations Detected in Aspiration and Liquid Biopsies from Patients with Pancreatic Cancer
Next Article in Special Issue
Deciphering Carotenoid and Flowering Pathway Gene Variations in Eastern and Western Carrots (Daucus carota L.)
Previous Article in Journal
Novel Variant in ANO5 Muscular Dystrophy: Identification by Whole Genome Sequencing and Quad Analysis
Previous Article in Special Issue
Screening Wheat Genotypes for Specific Genes Linked to Drought Tolerance
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Complete Chloroplast Genome of Meconopsis simplicifolia and Its Genetic Comparison to Other Meconopsis Species

1
Institute of Advanced Study, Chengdu University, Chengdu 610106, China
2
Engineering Research Center of Sichuan-Tibet Traditional Medicinal Plant, Chengdu 610000, China
3
School of Food and Biological Engineering, Chengdu University, Chengdu 610106, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Genes 2024, 15(10), 1301; https://doi.org/10.3390/genes15101301
Submission received: 12 September 2024 / Revised: 3 October 2024 / Accepted: 4 October 2024 / Published: 6 October 2024
(This article belongs to the Collection Feature Papers: 'Plant Genetics and Genomics' Section)

Abstract

:
Background: Chloroplasts, due to their high conservation and lack of recombination, serve as important genetic resources for the classification and evolutionary analysis of closely related species that are difficult to distinguish based on their morphological features. Meconopsis simplicifolia (M. simplicifolia), an endangered herb within the Meconopsis genus, has demonstrated therapeutic potential in treating various diseases. However, the highly polymorphic morphology of this species poses a challenge for accurate identification. Methods: In this study, the complete chloroplast genome of M. simplicifolia was sequenced and assembled using Illumina sequencing technology. Simple sequence repeats (SSRs) and repetitive sequences were characterized. In addition, a comparative analysis was conducted with the chloroplast genomes of six other Meconopsis species. Results: The chloroplast genome of M. simplicifolia has a quadripartite circular structure with a total length of 152,772 bp. It consists of a large single-copy region of 83,824 bp and a small single-copy region of 17,646 bp, separated by a pair of inverted repeat sequences (IRa and IRb, 25,651 bp). The genome contains 131 genes, 33 SSRs, and 27 long repetitive sequences. Comparative analysis with six other chloroplast genomes of Meconopsis revealed that M. simplicifolia is closely related to M. betonicifolia and that the rpl2 (ribosomal protein L2) gene in the IRb region has been deleted. This deletion is of significant importance for future taxonomic studies of M. simplicifolia. Conclusions: This study provides a valuable reference for the identification of M. simplicifolia and contributes to a deeper understanding of the phylogeny and evolution of the Meconopsis genus.

1. Introduction

Chloroplasts are unique organelles in plants that play a crucial role in photosynthesis, growth and development, signal transduction under various stresses, and the biosynthesis of important metabolites [1,2]. Decoding the chloroplast genome is fundamental to understanding chloroplast function and its biological processes, which are of great significance for exploring the mechanisms of plant growth and stress response. Previous studies have shown that chloroplast deoxyribonucleic acid (DNA) typically has a quadripartite circular structure, a linear structure, or a multi-branch linear structure [3,4,5,6]. However, due to differences in plant species, cell developmental stages, and tissue types, it remains unclear which form of chloroplast DNA is more prevalent [7,8]. The size of chloroplast genomes ranges from 15,553 bp (Asarum minus) to 521,168 bp (Floydiella terrestris), and are typically divided into two regions: a large single-copy (LSC) region and a small single-copy (SSC) region [9,10], separated by two inverted repeats (IRa and IRb). The chloroplast genome generally contains between 101 and 118 genes, including approximately 80 protein-coding genes, four ribosomal ribonucleic acids (RNAs), and 30 transfer RNAs [11,12,13]. With advances in sequencing technology, obtaining a complete chloroplast genome has become more accessible and cost-effective, providing a valuable opportunity to explore the role of chloroplasts in plant biology.
Chloroplasts are maternally inherited organelles, and their structure is relatively conserved [10]. Compared with other plastids, such as nuclear and mitochondrial genomes, chloroplast genomes are characterized by their small size, simple structure, and moderate evolutionary rate [14,15]. Thus, chloroplasts not only play an important role in plant biology but also serve as valuable resources in the study of plant systematics and genetic relationships [16,17,18]. For example, chloroplast genome sequencing was performed on Oreomecon nudicaulis, a species with an unclear classification (originally assigned to the Papaver genus, it is now classified under Oreomecon). The analysis revealed that it is closely related to Meconopsis within the Papaveraceae family but does not form a clade with the Papaver genus, which is consistent with the revised classification [19]. Similarly, phylogenetic analysis of the chloroplast genome of the medicinal plant Hypecoum erectum L. showed its association with H. zhukanum, both of which belong to the Hypecoideae subfamily, a monophyletic group [20]. Moreover, structural variations in the chloroplast genome, such as gene deletions, large inversions, and the contraction or elongation of inverted repeat (IR) regions, provide important genetic information for the identification of specific plants [11]. Ren et al. found that, compared with other plants in the Papaveroideae subfamily, five typical genes located in the SSC region in the chloroplast genomes of Corydalis saxicola and Corydalis tomentella migrated to the IR region, resulting in IR elongation and gene duplication [21]. Additionally, polymorphisms were observed in the gap regions of seven genes, and coding polymorphisms were detected in three genes in the chloroplast genomes of Papaveroideae plants, indicating their potential to serve as molecular markers for phylogenetic and species identification studies [22]. In summary, chloroplast genome information provides an important basis for inferring evolutionary relationships in species and specific taxa.
Meconopsis is a genus of herbaceous plants, belonging to the Papaveraceae subfamily [23]. Globally, 49 known species of Meconopsis have been identified, several of which have demonstrated therapeutic efficacy [24]. These plants are mainly distributed across the Qinghai-Tibet Plateau, Hengduan Mountains, and Himalayan region at altitudes ranging between 2000 and 5800 m [25]. Geographic isolation and natural selection have promoted speciation, resulting in diverse Meconopsis species, making it challenging to classify them solely based on their phenotypic characteristics [26]. The complete chloroplast genome provides a powerful tool for the accurate identification and classification of species [27]. In this study, the chloroplast genome of the endangered plant M. simplicifolia was sequenced using second-generation sequencing technology and a complete chloroplast genome information of 152,772 bp containing 131 genes was obtained. A comparative analysis was conducted to determine the phylogenetic relationship between M. simplicifolia and other Meconopsis species; it was confirmed that M. simplicifolia and M. betonicifolia have a closer phylogenetic relationship. The regions LSC and SSC are highly variable in the chloroplast genomes of Meconopsis species. This study is of profound significance for the identification of M. simplicifolia, the exploration of its chloroplast function, and improvement in the phylogenetic understanding within the Meconopsis genus.

2. Materials and Methods

2.1. Plant Material

Considering the distribution of M. simplicifolia, one individual was collected from the wild at Lingzhi, Tibet, China (38.9784° N, 105.9035° E) in July 2022. A voucher specimen (voucher number: NMU00912) was deposited at the Herbarium of North Minzu University (Figure 1). The collection of plant material adhered to the relevant institutional, national, and international guidelines and legislation, and we obtained permission to collect the M. simplicifolia.

2.2. Sequencing, Assembly, and Annotation of the M. simplicifolia cp Genome

Genomic DNA was extracted from the fresh leaves using the DNeasy Plant Mini Kit (Qiagen, Redwood City, CA, USA), following the manufacturer’s protocol. A paired-end library with an insertion size of 350 bp was constructed according to the manufacturer’s instructions and sequenced using the Illumina HiSeq 2500 platform. Approximately 5.57 Gb of raw reads were generated and assembled into non-redundant contigs using NOVOPlasty [28], a de novo sequence assembly software package, with k = 39 and a genome range of 120,000–200,000. Initial gene annotation was performed using Plann [29] with the chloroplast genome of M. racemosa (GenBank accession number: MK533649) as the reference genome [30], and the annotation was refined using Geneious software (https://www.geneious.com/, accessed on 1 January 2024) [31].

2.3. SSRs and Repeated Sequences

The Perl script MISA v2.1 (http://pgrc.ipk-gatersleben.de/misa/misa.html, accessed on 3 January 2024) was applied to detect simple sequence repeats (SSRs) in the cp genome with the settings as follows: 10 for mono-, 5 for di-, 4 for tri-, and 3 for tetra-, penta- and hexanucleotide. Repeats (forward, palindrome, complement, and reverse sequences) were identified using online REPuter software (https://bibiserv.cebitec.uni-bielefeld.de/reputer?id=reputer_view_submission, accessed on 3 January 2024), with the default settings.

2.4. Comparative Analysis of cp Genome Structure

Whole-genome comparative analysis was conducted on the cp genomes of the following seven Meconopsis species: M. simplicifolia (this study, NC_070211), M. horridula (MK533646) [30], M. integrifolia (MK533647) [30], M. punicea (MK533648) [30], M. racemosa (MK533649) [30], M. henrici (MN488591) [32], and M. quintuplinervia (MK801686) [33]. Comparative genomic analysis of M. simplicifolia and six other Meconopsis species was performed using BLAST Ring Image Generator (BRIG) software [34]. Meanwhile, the comparison and variation in the cp genomes’s architecture were visualized using the Shuffle-LAGAN mode of the mVISTA software (https://genome.lbl.gov/vista/index.shtml, accessed on 3 January 2024) [35]. The IRscope tool (https://irscope.shinyapps.io/irapp/, accessed on 3 January 2024) was used to compare and illustrate the IR border regions in the seven Meconopsis species [36].

2.5. Phylogenetic Analysis

Phylogenetic relationships were reconstructed based on the 10 cp genomes, using Papaver orientale (NC_037832) [37] from the NCBI database as the ‘outgroup’. The coding sequences of the protein-coding genes present in all the cp genomes of the Papaveraceae species were aligned using MAFFT-LINSI v7.313 [38]. The optimal trees were inferred by maximum likelihood phylogenetic analysis using RAxML v8.2.11 [39] with the GTRGAMMA model and 500 bootstrap replicates.

3. Results

3.1. Features of the cp Genome of Meconopsis simplicifolia

The assembled cp genome of M. simplicifolia has been deposited in GenBank with the accession number NC_070211. The cp DNA of M. simplicifolia measured 152,772 bp in length and exhibited the typical quadripartite structure (Table 1 and Figure 2). It consisted of a pair of inverted repeat regions (IRa and IRb) spanning 25,651 bp each, along with separate single-copy regions, including the SSC of 17,646 bp and the LSC of 83,824 bp. The M. simplicifolia cp genome contained 131 predicted functional genes, comprising 84 protein-coding genes, 37 tRNA genes, 8 rRNA genes, and 2 pseudogenes (rps15 and rps19) (Table 2). There were 21 intron-containing genes, of which 13 were protein-coding genes and 8 were tRNA genes. Out of the intron-containing genes, 18 contained a single intron, while 3 (clpP, rps12, and ycf3) contained two introns.
The overall GC content of the M. simplicifolia cp genome was 38.74%. Regarding the protein-coding regions, the GC content of the first, second, and third codons was 45.91%, 38.17%, and 31.15%, respectively. The 84 protein-coding genes encoded 25,454 codons, with leucine (L) the most frequently used amino acid (10.36%), and cysteine (C) the least frequently used (1.19%).

3.2. SSRs and Long-Repeat Analysis

SSRs, also known as microsatellites, are tandem repeat sequences consisting of 1–6 nucleotide repeat units. They are widely distributed in cp genomes and often used as genetic markers in population genetics and evolutionary studies due to their high intraspecific variability [40]. In this study, we analyzed SSRs in the cp genomes of seven Meconopsis species. A total of 33 SSRs were identified in the M. simplicifolia cp genome. Similarly, M. horridula, M. integrifolia, M. punicea, M. racemose, M. henrici, and M. quintuplinervia contained 38, 33, 34, 40, 23, and 35 SSRs, respectively (Figure 3A, Table S1). Among all chloroplast genomes, mononucleotide repeats were the most frequent, ranging between 8 and 24, accounting for 34.78% (8/23) to 66.67% (24/33) of all SSRs, followed by dinucleotide, ranging between 4 (12.12%, 4/33) and 8 (23.35%, 8/34), tetranucleotide, ranging between 3 (9.09%, 3/33) and 5 (14.29%, 5/35), trinucleotide, ranging between 1 (2.86%, 1/35) and 3 (9.09%, 3/33), pentanucleotide, ranging between 0 and 2 (8.70%, 2/23), and pentanucleotide, ranging between 0 and 1 (4.35%, 1/23). In M. simplicifolia, all mononucleotide repeats (100%) and the majority of dinucleotide repeats (75%) consisted of A/T nucleotides (Figure 3B).
Repeat sequences play a crucial role in phylogenetic research and genome reorganization. In the cp genome of M. simplicifolia, 27 dispersed repeats were identified, including 10 forward repeats and 17 palindromic repeats (Figure 4A). This pattern is consistent with the other six Meconopsis cp genomes, with the number of repeats ranging between 29 in M. punicea and 50 in M. integrifolia. Palindromic repeats were the most prevalent repeat type among the seven Meconopsis species (Figure 4B,C). Most of these repeats ranged between 30 bp and 44 bp in length.

3.3. Comparative Analysis of cp Genomes of Meconopsis Species

A comparative analysis of cp genomes provided valuable insights into intricate evolutionary relationships. In this study, we compared the cp genomes of M. simplicifolia and other six Meconopsis species. The size of the seven Meconopsis cp genomes ranged between 151,864 (M. integrifolia) and 154,997 bp (M. quintuplinervia). Notably, within the genus Meconopsis, the genome of M. simplicifolia exhibited a higher degree of conservation and could be accurately mapped (Figure 5). The sequence consistency of Meconopsis cp genomes was further evaluated using mVISTA software. The results revealed that the IR regions exhibited fewer differences compared to the LSC and SSC regions (Figure 6). Non-coding regions displayed more variability than coding regions, with significant changes observed in the intergenic spacers among the seven cp genomes. These highly divergent regions included trnH-psbA, matK, rps16-psbK, atpH-atpI, rpoC2, psbM-petN, trnE-trnT, trnT-psbD, psaA-ycf3, trnF-ndhJ, ndhK, ndhC-trnV, atpB-rbcL, accD, ycf4-cemA, petA-psbL, psbE-petL, clpP-psbB, rpl16, ndhF-rpl32-ccsA, and ycf1, among others.
A detailed comparison of the binding regions between the inverted repeats (IR/LSC and IR/SSC) was performed among the seven Meconopsis species (Figure 7). In all species, the rpl22 gene was located within the LSC region. Variations in gene content and order were observed, such as the presence of the ycf1 gene at the SSC/IRa junction in M. simplicifolia, M. horridula, M. punicea, M. racemose, M. henrici, and M. quintuplinervia, while M. integrifolia had a missing ycf1 gene in the SSC/IRa junction. Expansion and contraction of the inverted repeat region were observed. For example, the rps19 gene was found within the LSC region in M. racemose, while in the other six Meconopsis species, it was located 67–158 bp away, spanning the LSC and IRb binding regions. Except for M. simplicifolia, the rpl2 gene did not extend into the LSC region in the other species. Overall, there were only minor variations in the IR boundary regions among the cp genomes of these seven Meconopsis species.

3.4. Phylogenetic Analysis of M. simplicifolia and Related Meconopsis species cp Genomes

To determine the phylogenetic position of M. simplicifolia within the Papaveraceae family, we utilized the cp genomes of ten Papaveraceae members, including M. simplicifolia. A species tree was constructed based on the alignment of 75 shared protein-coding genes from the cp genomes. The analysis revealed that M. simplicifolia clustered together with M. betonicifolia (Figure 8). All Meconopsis plants formed a well-supported branch, indicating the high potential of cp genomes for species differentiation within the order Papaveraceae.

4. Discussion

The cp genome serves as a valuable resource for interspecific differentiation and various biotechnological applications [41]. In this study, we sequenced and assembled the complete cp genome of M. simplicifolia. The cp genome of M. simplicifolia has a typical quadripartite circular structure with a GC content of 38.74%, similar to the chloroplast genome characteristics of most Meconopsis species [1]. The size of the M. simplicifolia cp genome was 152,772 bp, consistent with other Meconopsis species ranging between 151 and 154 kb [30,32,33]. All these complete cp genomes displayed a GC content of 38%, which is in line with the low GC content observed in the cp genomes of other angiosperms [13].
SSRs have been extensively used in phylogenetic relationships, genetic diversity studies, and species identification. The SSRs identified in the cp genomes of seven Meconopsis were mostly composed of mononucleotide repeats consisting mainly of T, which is consistent with previous studies on Meconopsis species [30]. The availability of genomic resources in Meconopsis plants can enhance the understanding of population patterns and the identification of gene regions associated with important medicinal and environmental adaptive traits [30]. SSRs provide effective marker resources for species identification and genetic diversity studies of Meconopsis and related species. Repeat sequences have the potential to promote chloroplast genome rearrangement and increase population genetic diversity, and are widely used to identify mutation hotspots and establish phylogenetic relationships [42]. In this study, we identified 27 repeat sequences in M. simplicifolia, which is fewer compared to other Meconopsis species. Most of these repeats were located in genes, indicating that the cp genome of M. simplicifolia retains a significant amount of genetic material.
The analysis of protein-coding genes revealed that M. simplicifolia shares 75 genes with the other six Meconopsis species. Similar to other Meconopsis species, M. simplicifolia lacks the rpl2 gene, which plays a crucial role in chloroplast development during early leaf development [43]. It is hypothesized that the Msrpl2 gene has either been functionally transferred to the nucleus or replaced by a eukaryotic gene. Comparative analysis of cp genomes using BRIG and mVISTA showed a high sequence identity among all Meconopsis species. However, Meconopsis cp genomes have also undergone gene duplication, gene/intron loss, insertion/deletion, pseudogenization, and varying expansion/contraction of the inverted repeat region. These genomic events are consistent with observations in other angiosperms [12,13]. In M. simplicifolia, the pseudogenization of rps19 in the IR regions is consistent with a previous study [30]. The sequence and content of the SC regions show less similarity compared to the IR regions in Meconopsis species. The most highly divergent non-coding regions were identified in the intergenic regions of trnT-psbD, ndhC-trnV, and ndhF-rpl32-ccsA, which have also been recognized as molecular markers in many land plants [44,45].
Taxonomy and phylogeny of Meconopsis have been extensively studied at the genus level [46,47]. Previous studies on the evolutionary relationships among different Meconopsis species utilized internal transcribed spacer and cpDNA sequences, as well as amplified fragment length polymorphisms (AFLPs) [46,47,48]. However, complete genome sequencing offers a more comprehensive perspective [49]. In the case of M. simplicifolia, limited information was available. The phylogenetic position of M. simplicifolia within Meconopsis was determined using cp genomes and 75 protein-coding genes among nine Meconopsis species. Phylogenetic analysis revealed that all Meconopsis species formed a monophyletic clade with 100% bootstrap support. M. simplicifolia was found to be closely related to M. betonicifolia, supporting previous morphological and molecular data [46].

5. Conclusions

This study characterized the complete cp genome of M. simplicifolia and conducted a comparative analysis of other six Meconopsis cp genomes, revealing the conserved genome structure and organization across seven Meconopsis species. The most divergent regions among these Meconopsis cp genomes were identified in three non-coding intergenic spacer (IGS) regions (trnT-psbD, ndhC-trnV, and ndhF-rpl32-ccsA) and three genic regions (matK, rpoC2, and ycf1). Moreover, the genetic resources such as SSRs and repetitive sequences discovered in the cp genomes can serve as valuable molecular markers for the identification of Meconopsis species. Phylogenetic analysis demonstrated that M. simplicifolia is closely related to M. betonicifolia. The comprehensive cp genome provides essential resources for genetic and biological studies of Meconopsis and other species within the Papaveraceae family.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/genes15101301/s1, Table S1. Types and numbers of SSRs in the chloroplast genomes of M. simplicifolia, M. horridula, M. integrifolia, M. punicea, M. racemose, M. henrici, and M. quintuplinervia.

Author Contributions

All authors contributed to the study’s conception and design. R.L. conceived and supervised the project. M.S. and Z.Z. analyzed the data and wrote the manuscript. All of the authors have read and approved the final manuscript and have agreed to be accountable for all aspects of the work. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by the Tibet Autonomous Region Science and Technology Plan Project (Key Research and Development Program) (No. XZ202201ZY0031G).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

The authors mentioned in the manuscript have agreed with its authorship and have read and approved the manuscript. Authors give consent for the publication of identifiable details, which can include photograph(s)/tables and/or details within the text to be published in the “Genes” Journal.

Data Availability Statement

The genome sequence data that support the findings of this study are openly available in GenBank of NCBI at https://www.ncbi.nlm.nih.gov/ (accessed on 15 February 2023), under the accession no. NC_070211. The associated BioProject, SRA, and Bio-Sample numbers are PRJNA935484, SRR23491443, and SAMN33317127, respectively.

Acknowledgments

The authors thank Lei Zhang from North Minzu University (contact person: Lei Zhang, [email protected]) for helping with plant material collection, and thank Mingcheng Wang from Chengdu University for his useful suggestions in data analysis.

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

  1. Daniell, H.; Lin, C.S.; Yu, M.; Chang, W.J. Chloroplast genomes: Diversity, evolution, and applications in genetic engineering. Genome Biol. 2016, 17, 134. [Google Scholar] [CrossRef] [PubMed]
  2. Zhang, Y.; Zhang, A.; Li, X.; Lu, C. The Role of Chloroplast Gene Expression in Plant Responses to Environmental Stress. Int. J. Mol. Sci. 2020, 21, 6082. [Google Scholar] [CrossRef] [PubMed]
  3. Lilly, J.W.; Havey, M.J.; Jackson, S.A.; Jiang, J. Cytogenomic analyses reveal the structural plasticity of the chloroplast genome in higher plants. Plant Cell 2001, 13, 245–254. [Google Scholar] [CrossRef] [PubMed]
  4. Bendich, A.J.; Smith, S.B. Moving pictures and pulsed-field gel electrophoresis show linear DNA molecules from chloroplasts and mitochondria. Curr. Genet. 1990, 17, 421–425. [Google Scholar] [CrossRef]
  5. Shaver, J.M.; Oldenburg, D.J.; Bendich, A.J. The structure of chloroplast DNA molecules and the effects of light on the amount of chloroplast DNA during development in Medicago truncatula. Plant Physiol. 2008, 146, 1064–1074. [Google Scholar] [CrossRef]
  6. Scharff, L.B.; Koop, H.-U. Linear molecules of tobacco ptDNA end at known replication origins and additional loci. Plant Mol. Biol. 2006, 62, 611–621. [Google Scholar] [CrossRef] [PubMed]
  7. Mower, J.P.; Vickrey, T.L. Structural Diversity Among Plastid Genomes of Land Plants. Plastid Genome Evolution; Elsevier: Amsterdam, The Netherlands, 2018; pp. 263–292. ISBN 9780128134573. [Google Scholar]
  8. Dobrogojski, J.; Adamiec, M.; Luciński, R. The chloroplast genome: A review. Acta Physiol. Plant 2020, 42, 155. [Google Scholar] [CrossRef]
  9. Qu, X.-J.; Zou, D.; Zhang, R.-Y.; Stull, G.W.; Yi, T.-S. Progress, challenge and prospect of plant plastome annotation. Front. Plant Sci. 2023, 14, 1166140. [Google Scholar] [CrossRef] [PubMed]
  10. Zhang, T.; Li, M.; Zhu, X.; Li, S.; Guo, M.; Guo, C.; Shu, Y. Comparative Chloroplast Genomes Analysis Provided Adaptive Evolution Insights in Medicago ruthenica. Int. J. Mol. Sci. 2024, 25, 8689. [Google Scholar] [CrossRef] [PubMed]
  11. Kim, S.-C.; Ha, Y.-H.; Park, B.K.; Jang, J.E.; Kang, E.S.; Kim, Y.-S.; Kimspe, T.-H.; Kim, H.-J. Comparative analysis of the complete chloroplast genome of Papaveraceae to identify rearrangements within the Corydalis chloroplast genome. PLoS ONE 2023, 18, e0289625. [Google Scholar] [CrossRef] [PubMed]
  12. Jansen, R.K.; Ruhlman, T.A. Plastid Genomes of Seed Plants. In Genomics of Chloroplasts and Mitochondria; Bock, R., Knoop, V., Eds.; Springer: Dordrecht, The Netherlands, 2012; pp. 103–126. ISBN 978-94-007-2919-3. [Google Scholar]
  13. Wicke, S.; Schneeweiss, G.M.; Depamphilis, C.W.; Müller, K.F.; Quandt, D. The evolution of the plastid chromosome in land plants: Gene content, gene order, gene function. Plant Mol. Biol. 2011, 76, 273–297. [Google Scholar] [CrossRef]
  14. Cao, Z.; Zhao, W.; Xin, Y.; Shen, W.; Wang, F.; Li, Q.; Tu, Y.; Zhang, H.; Dong, Z.; Xin, P. Characteristics of the Complete Chloroplast Genome of Pourthiaea (Rosaceae) and Its Comparative Analysis. Horticulturae 2022, 8, 1144. [Google Scholar] [CrossRef]
  15. Lan, Z.; Shi, Y.; Yin, Q.; Gao, R.; Liu, C.; Wang, W.; Tian, X.; Liu, J.; Nong, Y.; Xiang, L.; et al. Comparative and phylogenetic analysis of complete chloroplast genomes from five Artemisia species. Front. Plant Sci. 2022, 13, 1049209. [Google Scholar] [CrossRef] [PubMed]
  16. Xiao, S.; Xu, P.; Deng, Y.; Dai, X.; Zhao, L.; Heider, B.; Zhang, A.; Zhou, Z.; Cao, Q. Comparative analysis of chloroplast genomes of cultivars and wild species of sweetpotato (Ipomoea batatas L. Lam). BMC Genom. 2021, 22, 262. [Google Scholar] [CrossRef]
  17. Cao, Z.; Yang, L.; Xin, Y.; Xu, W.; Li, Q.; Zhang, H.; Tu, Y.; Song, Y.; Xin, P. Comparative and phylogenetic analysis of complete chloroplast genomes from seven Neocinnamomum taxa (Lauraceae). Front. Plant Sci. 2023, 14, 1205051. [Google Scholar] [CrossRef]
  18. Wang, J.; Liao, X.; Gu, C.; Xiang, K.; Li, S.; Tembrock, L.R.; Wu, Z.; He, W. The Asian lotus (Nelumbo nucifera) pan-plastome: Diversity and divergence in a living fossil grown for seed, rhizome, and aesthetics. Ornam. Plant Res. 2022, 2, 1–10. [Google Scholar] [CrossRef]
  19. Zhan, Q.; Huang, Y.; Xue, X.; Chen, Y. Comparative chloroplast genomics and phylogenetic analysis of Oreomecon nudicaulis (Papaveraceae). BMC Genom. Data 2024, 25, 49. [Google Scholar] [CrossRef]
  20. Mano, H.; Boltenkov, E.V.; Marchuk, E.A.; Nakamura, K.; Yoichi, W. The complete chloroplast genome sequence of Hypecoum erectum L. (Papaveraceae). Mitochondrial DNA B Resour. 2024, 9, 1010–1014. [Google Scholar] [CrossRef]
  21. Ren, F.; Wang, L.; Li, Y.; Zhuo, W.; Xu, Z.; Guo, H.; Liu, Y.; Gao, R.; Song, J. Highly variable chloroplast genome from two endangered Papaveraceae lithophytes Corydalis tomentella and Corydalis saxicola. Ecol. Evol. 2021, 11, 4158–4171. [Google Scholar] [CrossRef] [PubMed]
  22. Wang, L.; Li, F.; Wang, N.; Gao, Y.; Liu, K.; Zhang, G.; Sun, J. Characterization of the Dicranostigma leptopodum chloroplast genome and comparative analysis within subfamily Papaveroideae. BMC Genom. 2022, 23, 794. [Google Scholar] [CrossRef] [PubMed]
  23. Grierson, A.J.C.; Long, D.G. Flora of Bhutan, Including a Record of Plants from Sikkim. Royal Botanic Gardens Edinburgh: Edinburgh, UK; Volume 1, Parts 1 and 2; 1983. [Google Scholar]
  24. Shi, N.; Naudiyal, N.; Wang, J.; Gaire, N.P.; Wu, Y.; Wei, Y.; He, J.; Wang, C. Assessing the Impact of Climate Change on Potential Distribution of Meconopsis punicea and Its Influence on Ecosystem Services Supply in the Southeastern Margin of Qinghai-Tibet Plateau. Front. Plant Sci. 2022, 12, 830119–830135. [Google Scholar] [CrossRef] [PubMed]
  25. Zhao, S.; Gao, X.; Yu, X.; Yuan, T.; Zhang, G.; Liu, C.; Li, X.; Wei, P.; Li, X.; Liu, X. Comparative Analysis of Chloroplast Genome of Meconopsis (Papaveraceae) Provides Insights into Their Genomic Evolution and Adaptation to High Elevation. Int. J. Mol. Sci. 2024, 25, 2193. [Google Scholar] [CrossRef] [PubMed]
  26. Wu, S.; Wang, Y.; Wang, Z.; Shrestha, N.; Liu, J. Species divergence with gene flow and hybrid speciation on the Qinghai-Tibet Plateau. New Phytol. 2022, 234, 392–404. [Google Scholar] [CrossRef]
  27. Shi, W.; Song, W.; Chen, Z.; Cai, H.; Gong, Q.; Liu, J.; Shi, C.; Wang, S. Comparative chloroplast genome analyses of diverse Phoebe (Lauraceae) species endemic to China provide insight into their phylogeographical origin. PeerJ 2023, 11, e14573. [Google Scholar] [CrossRef]
  28. Nicolas, D.; Patrick, M.; Guillaume, S. NOVOPlasty: De novo assembly of organelle genomes from whole genome data. Nucleic Acids Res. 2017, 45, e18–e26. [Google Scholar] [CrossRef]
  29. Huang, D.L.; Cronk, Q.C. Plann: A command-line application for annotating plastome sequences. Appl. Plant Sci. 2015, 3, 1500026–1500028. [Google Scholar] [CrossRef]
  30. Li, X.; Tan, W.; Sun, J.; Du, J.; Zheng, C.; Tian, X.; Zheng, M.; Xiang, B.; Wang, Y. Comparison of four complete chloroplast genomes of medicinal and ornamental Meconopsis species: Genome organization and species discrimination. Sci. Rep. 2019, 9, 10567–10578. [Google Scholar] [CrossRef] [PubMed]
  31. Kearse, M.; Moir, R.; Wilson, A.; Stones-Havas, S.; Cheung, M.; Sturrock, S.; Buxton, S.; Cooper, A.; Markowitz, S.; Duran, C.; et al. Geneious Basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 2012, 28, 1647–1649. [Google Scholar] [CrossRef] [PubMed]
  32. Zhu, Y.; Zhang, D. Complete chloroplast genome sequences of two species used for Tibetan medicines, Meconopsis punicea vig. and M. henrici vig. (Papaveraceae). Mitochondrial DNA Part B 2020, 5, 48–50. [Google Scholar] [CrossRef]
  33. Xu, B.; Kang, H.; Luo, S.; Yang, Q.; Liu, Z.L. Complete chloroplast genome of the Meconopsis quintuplinervia (Papaveraceae), a traditional medicine of Tibetan. Mitochondrial DNA Part B 2019, 4, 2335–2336. [Google Scholar] [CrossRef] [PubMed]
  34. Alikhan, N.-F.; Petty, N.K.; Ben Zakour, N.L.; Beatson, S.A. BLAST Ring Image Generator (BRIG): Simple prokaryote genome comparisons. BMC Genom. 2011, 12, 402–411. [Google Scholar] [CrossRef] [PubMed]
  35. Frazer, K.A.; Lior, P.; Alexander, P.; Rubin, E.M.; Inna, D. VISTA: Computational tools for comparative genomics. Nucleic Acids Res. 2004, 32, 273–279. [Google Scholar] [CrossRef]
  36. Amiryousefi, A.; Hyvönen, J.; Poczai, P. IRscope: An online program to visualize the junction sites of chloroplast genomes. Bioinformatics 2018, 34, 3030–3031. [Google Scholar] [CrossRef] [PubMed]
  37. Zhou, J.; Cui, Y.; Chen, X.; Li, Y.; Xu, Z.; Duan, B.; Li, Y.; Song, J.; Yao, H. Complete Chloroplast Genomes of Papaver rhoeas and Papaver orientale: Molecular Structures, Comparative Analysis, and Phylogenetic Analysis. Molecules 2018, 23, 437. [Google Scholar] [CrossRef]
  38. Standley, D.M. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. 2013, 30, 772–780. [Google Scholar] [CrossRef]
  39. Stamatakis, A. RAxML-VI-HPC: Maximum Likelihood-Based Phylogenetic Analyses with Thousands of Taxa and Mixed Models. Bioinformatics 2006, 22, 2688–2690. [Google Scholar] [CrossRef]
  40. Ellis, J.R.; Burke, J.M. EST-SSRs as a resource for population genetic analyses. Heredity 2007, 99, 125–132. [Google Scholar] [CrossRef]
  41. Bock, R. Engineering plastid genomes: Methods, tools, and applications in basic research and biotechnology. Annu. Rev. Plant Biol. 2015, 66, 211–241. [Google Scholar] [CrossRef] [PubMed]
  42. Jiang, D.; Cai, X.; Gong, M.; Xia, M.; Xing, H.; Dong, S.; Tian, S.; Li, J.; Lin, J.; Liu, Y.; et al. Complete chloroplast genomes provide insights into evolution and phylogeny of Zingiber (Zingiberaceae). BMC Genom. 2023, 24, 30. [Google Scholar] [CrossRef]
  43. Lan, J.; Lin, Q.; Zhou, C.; Liu, X.; Miao, R.; Ma, T.; Chen, Y.; Mou, C.; Jing, R.; Feng, M. Young Leaf White Stripe encodes a P-type PPR protein required for chloroplast development. J. Integr. Plant Biol. 2023, 67, 1687–1720. [Google Scholar] [CrossRef] [PubMed]
  44. Chiapella, J.O.; Barfuss, M.H.J.; Xue, Z.-Q.; Greimler, J. The plastid genome of Deschampsia cespitosa (Poaceae). Molecules 2019, 24, 216. [Google Scholar] [CrossRef] [PubMed]
  45. Zhang, Z.; An, M.; Miao, J.; Gu, Z.; Liu, C.; Zhong, B. The Antarctic sea ice alga Chlamydomonas sp. ICE-L provides insights into adaptive patterns of chloroplast evolution. Bmc Plant Biol. 2018, 18, 53–64. [Google Scholar] [CrossRef] [PubMed]
  46. Wei, X. Active. Molecular Systematics of Meconopsis Vig. (Papaveraceae): Taxonomy, Polyploidy Evolution, and Historical Biogeography from a Phylogenetic Insight. Ph.D. Thesis, University of Texas at Austin, Austin, TX, USA, December 2013. [Google Scholar]
  47. Xiao, W.; Simpson, B.B. A New Infrageneric Classification of Meconopsis (Papaveraceae) Based on a Well-Supported Molecular Phylogeny. Syst. Bot. 2017, 42, 226–233. [Google Scholar] [CrossRef]
  48. Valtuea, F.J.; Preston, C.D.; Kadereit, J.W. Evolutionary significance of the invasion of introduced populations into the native range of Meconopsis cambrica. Mol. Ecol. 2011, 20, 4318–4331. [Google Scholar] [CrossRef] [PubMed]
  49. Li, D.-M.; Zhao, C.-Y.; Liu, X.-F. Complete chloroplast genome sequences of Kaempferia galanga and Kaempferia elegans: Molecular structures and comparative analysis. Molecules 2019, 24, 474. [Google Scholar] [CrossRef]
Figure 1. A photograph of M. simplicifolia in the flowering stage.
Figure 1. A photograph of M. simplicifolia in the flowering stage.
Genes 15 01301 g001
Figure 2. A map of the chloroplast genome of M. simplicifolia. The genes inside and outside the circle are transcribed in a clockwise and counterclockwise direction, respectively. Genes belonging to different functional groups are shown in different colors. The darker and lighter grey in the inner circle each represent GC and AT content.
Figure 2. A map of the chloroplast genome of M. simplicifolia. The genes inside and outside the circle are transcribed in a clockwise and counterclockwise direction, respectively. Genes belonging to different functional groups are shown in different colors. The darker and lighter grey in the inner circle each represent GC and AT content.
Genes 15 01301 g002
Figure 3. An analysis of simple sequence repeats (SSRs) in the cp genomes of the seven Meconopsis species. (A) Number of different types of SSRs; (B) frequency of identified SSR motifs in different repeat classes.
Figure 3. An analysis of simple sequence repeats (SSRs) in the cp genomes of the seven Meconopsis species. (A) Number of different types of SSRs; (B) frequency of identified SSR motifs in different repeat classes.
Genes 15 01301 g003
Figure 4. An analysis of the repeated sequences in the cp genomes of the seven Meconopsis species. (A) Frequency of palindromic and forward repeats; (B) frequency of forward repeats by length; (C) frequency of palindromic repeats by length.
Figure 4. An analysis of the repeated sequences in the cp genomes of the seven Meconopsis species. (A) Frequency of palindromic and forward repeats; (B) frequency of forward repeats by length; (C) frequency of palindromic repeats by length.
Genes 15 01301 g004
Figure 5. BRIG ring comparison of the cp genomes of the seven Meconopsis species. The innermost rings represent the GC content (black) and GC skew (purple/green). The seven outer circles depict the similarity results compared to the reference genome (M. simplicifolia. NC_070211).
Figure 5. BRIG ring comparison of the cp genomes of the seven Meconopsis species. The innermost rings represent the GC content (black) and GC skew (purple/green). The seven outer circles depict the similarity results compared to the reference genome (M. simplicifolia. NC_070211).
Genes 15 01301 g005
Figure 6. Alignment of the cp genomes of the seven Meconopsis species. The white regions show the sequence differences among all analyzed chloroplast genomes. The horizontal axis shows the positions within the chloroplast genome, and the vertical scale indicates the identity percentage, ranging between 50% and 100%.
Figure 6. Alignment of the cp genomes of the seven Meconopsis species. The white regions show the sequence differences among all analyzed chloroplast genomes. The horizontal axis shows the positions within the chloroplast genome, and the vertical scale indicates the identity percentage, ranging between 50% and 100%.
Genes 15 01301 g006
Figure 7. A comparison of the border regions of LSC, SSC, and IR regions among the cp genomes of the seven Meconopsis species.
Figure 7. A comparison of the border regions of LSC, SSC, and IR regions among the cp genomes of the seven Meconopsis species.
Genes 15 01301 g007
Figure 8. A species tree based on the coding sequences of 75 protein-coding genes shared by the cp genomes of 10 Papaveraceae members including M. simplicifolia (NC_070211; this study), M. horridula (MK533646) [30], M. integrifolia (MK533647) [30], M. punicea (MK533648) [30], M. racemosa (MK533649) [30], M. henrici (MN488591) [32], M. quintuplinervia (MK801686) [33], M. pseudohorridula (NC_061608; unpublished), M. betonicifolia (OK349678; unpublished), and the outgroup species Papaver orientale (NC_037832) [37]. All nodes received 100% bootstrap support. Orange represents M Simplicifolia chloroplast genome.
Figure 8. A species tree based on the coding sequences of 75 protein-coding genes shared by the cp genomes of 10 Papaveraceae members including M. simplicifolia (NC_070211; this study), M. horridula (MK533646) [30], M. integrifolia (MK533647) [30], M. punicea (MK533648) [30], M. racemosa (MK533649) [30], M. henrici (MN488591) [32], M. quintuplinervia (MK801686) [33], M. pseudohorridula (NC_061608; unpublished), M. betonicifolia (OK349678; unpublished), and the outgroup species Papaver orientale (NC_037832) [37]. All nodes received 100% bootstrap support. Orange represents M Simplicifolia chloroplast genome.
Genes 15 01301 g008
Table 1. A summary of the complete chloroplast genomes of seven Meconopsis species.
Table 1. A summary of the complete chloroplast genomes of seven Meconopsis species.
FeatureM. simplicifoliaM. horridulaM. integrifoliaM. puniceaM. racemosaM. henriciM. quintuplinervia
Accession numberNC_070211MK533646MK533647MK533648MK533649MN488591MK801686
Genome size (bp)152,772153,785151,864153,259153,816153,388154,997
IR length (bp)25,65151,98851,30651,54851,98826,10725,984
SSC length (bp)17,64617,89817,74917,72917,89817,82217,876
LSC length (bp)83,82483,89982,80983,98283,93083,69885,153
No. of total genes131127127127127112129
No. of protein-coding genes84909090907884
No. of tRNA genes39373737373037
No. of rRNA genes8888838
Overall GC content (%)38.738.838.838.538.738.538.5
Table 2. A list of the genes in the M. simplicifolia chloroplast genome.
Table 2. A list of the genes in the M. simplicifolia chloroplast genome.
CategoryGene GroupGene Name
Self-replicationRibosomal protein (large subunit) (9)rpl14, rpl16 a, rpl20, rpl22, rpl23 b, rpl32, rpl33, rpl36
Ribosomal protein (small subunit) (16)rps2, rps3, rps4, rps7 b, rps8, rps11, rps12 a,b, rps14, rps15, rps16 a, rps18, rps19
DNA-dependent RNA polymerase (4)rpoA, rpoB, rpoC1 a, rpoC2
rRNA genes (8)rrn16 b, rrn23 b, rrn4.5 b, rrn5 b,
tRNA genes (37)trnH-GUG, trnK-UUU a, trnQ-UUG, trnS-GCU, trnG-UCC a, trnR-UCU, trnC-GCA, trnD-GUC, trnY-GUA, trnE-UUC, trnT-GGU, trnS-UGA, trnG-GCC, trnS-GGA, trnT-UGU, trnL-UAA a, trnF-GAA, trnV-UAC a, trnfM-CAU b, trnW-CCA, trnP-UGG, trnI-CAU b, trnL-CAA b, trnV-GAC b, trnI-GAU a,b, trnA-UGC a,b, trnR-ACG b, trnN-GUU b, trnL-UAG
PhotosynthesisPhotosystem I (5)psaA, psaB, psaC, psaI, psaJ
Photosystem II (15)psbA, psbB, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbK, psbL, psbM, psbN, psbT, psbZ
NADH dehydrogenase (11)ndhA a, ndhB a,b, ndhC, ndhE, ndhF, ndhG, ndhH, ndhI, ndhJ, ndhK
Cytochrome b/f complex (6)petA, petB a, petD a, petG, petL, petN
ATP synthase (6)atpA, atpB, atpE, atpF a, atpH, atpI
Large subunit of rubisco (1)rbcL
Other genesTranslational initiation factor (1)infA
ATP-dependent protease subunit p gene (1)clpP a
Maturase (1)matK
Envelope membrane protein (1)cemA
Unknown Subunit of acetyl-CoA-carboxylase (1)accD
C-type cytochrome synthesis gene (1)ccsA
Conserved hypothetical chloroplast ORF (7)ycf1 b, ycf2 b, ycf3 a, ycf4, ycf15,
Pseudogene (2)rps15, rps19
Notes: a Genes containing introns; b two gene copies in IR.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Sun, M.; Zhu, Z.; Li, R. The Complete Chloroplast Genome of Meconopsis simplicifolia and Its Genetic Comparison to Other Meconopsis Species. Genes 2024, 15, 1301. https://doi.org/10.3390/genes15101301

AMA Style

Sun M, Zhu Z, Li R. The Complete Chloroplast Genome of Meconopsis simplicifolia and Its Genetic Comparison to Other Meconopsis Species. Genes. 2024; 15(10):1301. https://doi.org/10.3390/genes15101301

Chicago/Turabian Style

Sun, Min, Zhidan Zhu, and Rui Li. 2024. "The Complete Chloroplast Genome of Meconopsis simplicifolia and Its Genetic Comparison to Other Meconopsis Species" Genes 15, no. 10: 1301. https://doi.org/10.3390/genes15101301

APA Style

Sun, M., Zhu, Z., & Li, R. (2024). The Complete Chloroplast Genome of Meconopsis simplicifolia and Its Genetic Comparison to Other Meconopsis Species. Genes, 15(10), 1301. https://doi.org/10.3390/genes15101301

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop