Next Article in Journal
A Revised Model of Anatomically Modern Human Expansions Out of Africa through a Machine Learning Approximate Bayesian Computation Approach
Next Article in Special Issue
Genetic Structure and Core Collection of Olive Germplasm from Albania Revealed by Microsatellite Markers
Previous Article in Journal
Distinct Effects of Inflammation on Cytochrome P450 Regulation and Drug Metabolism: Lessons from Experimental Models and a Potential Role for Pharmacogenetics
Previous Article in Special Issue
Genetic Resources of Olea europaea L. in the Garda Trentino Olive Groves Revealed by Ancient Trees Genotyping and Parentage Analysis of Drupe Embryos
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Resolving the Phylogeny of the Olive Family (Oleaceae): Confronting Information from Organellar and Nuclear Genomes

1
Laboratoire Evolution & Diversité Biologique (EDB, UMR 5174), CNRS/IRD/Université Toulouse III, 118 Route de Narbonne, 31062 Toulouse, France
2
Claude E. Phillips Herbarium, Delaware State University, 1200 N. Dupont Hwy, Dover, DE 19901-2277, USA
3
Institut de Systématique Evolution Biodiversité (ISYEB), Muséum National d’Histoire Naturelle, CNRS, Sorbonne Université, EPHE, Université des Antilles, 57 rue Cuvier, CP39, 75005 Paris, France
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Genes 2020, 11(12), 1508; https://doi.org/10.3390/genes11121508
Submission received: 25 September 2020 / Revised: 12 November 2020 / Accepted: 11 December 2020 / Published: 16 December 2020
(This article belongs to the Special Issue Oleaceae Genetics)

Abstract

:
The olive family, Oleaceae, is a group of woody plants comprising 28 genera and ca. 700 species, distributed on all continents (except Antarctica) in both temperate and tropical environments. It includes several genera of major economic and ecological importance such as olives, ash trees, jasmines, forsythias, osmanthuses, privets and lilacs. The natural history of the group is not completely understood yet, but its diversification seems to be associated with polyploidisation events and the evolution of various reproductive and dispersal strategies. In addition, some taxonomical issues still need to be resolved, particularly in the paleopolyploid tribe Oleeae. Reconstructing a robust phylogenetic hypothesis is thus an important step toward a better comprehension of Oleaceae’s diversity. Here, we reconstructed phylogenies of the olive family using 80 plastid coding sequences, 37 mitochondrial genes, the complete nuclear ribosomal cluster and a small multigene family encoding phytochromes (phyB and phyE) of 61 representative species. Tribes and subtribes were strongly supported by all phylogenetic reconstructions, while a few Oleeae genera are still polyphyletic (Chionanthus, Olea, Osmanthus, Nestegis) or paraphyletic (Schrebera, Syringa). Some phylogenetic relationships among tribes remain poorly resolved with conflicts between topologies reconstructed from different genomic regions. The use of nuclear data remains an important challenge especially in a group with ploidy changes (both paleo- and neo-polyploids). This work provides new genomic datasets that will assist the study of the biogeography and taxonomy of the whole Oleaceae.

1. Introduction

The olive family (Oleaceae) is a medium-sized group of woody plants comprising 28 genera and ca. 700 species, distributed on all continents (except Antarctica) in both temperate and tropical environments [1]. Most species are trees, but there are also one herbaceous plant (Dimetra craibiana), small shrubs (e.g., Menodora spp.) and a few lianas (e.g., Jasminum spp., Chionanthus macrobotrys). Many Oleaceae species are of economic importance, for the production of oil and fruits (olive), timber (e.g., ash trees), as well as ornaments and fragrances (e.g., jasmines, osmanthuses, lilacs, etc). Moreover, Oleaceae are important components of temperate and tropical ecosystems, with, for example, several species producing drupes and palatable leaves as a common food source to wild animals. Individual species can also support a large number of other organisms, for example, nearly 1000 species (e.g., fungi, insects, birds) are known to be associated with Fraxinus excelsior [2].
Oleaceae is currently divided into five tribes, Myxopyreae, Jasmineae, Forsythieae, Fontanesieae and Oleeae, the latter being subdivided into four subtribes (Oleinae, Fraxininae, Ligustrinae, and Schreberinae) [3]. The natural history of the group is not completely understood yet, but its diversification seems to be associated with a few events of polyploidisation (in particular a major event of whole genome duplication in the ancestor of Oleeae) [4,5,6,7,8,9] and the evolution of various reproductive and dispersal strategies [3]. This family thus presents a substantial diversity of flowers (e.g., [1,10,11]) and fruits (e.g., capsules, samaras, drupes) [3,12,13,14]; associated with different vectors of pollination and seed dispersal. Variable breeding systems were also described, from hermaphroditism to dioecy, with several stages often considered as intermediate such as polygamy and androdioecy (e.g., [10,15,16,17,18]). In addition, a di-allelic self-incompatibility system, associated with distyly in some groups, has been reported for a number of species belonging to different tribes (i.e., Myxopyreae [15], Jasmineae [19,20], Forsythieae [21], and Oleeae [1,22,23]).
To better understand trait evolution and patterns of diversification in this group, or to resolve any lingering taxonomical issues, as in the case of the paleopolyploid tribe Oleeae [3,24,25,26], a robust phylogenetic hypothesis is then required. Oleaceae’s systematics has evolved from being based on morphological, cytological, and biochemical traits (e.g., [4,14,27]), to the use of molecular phylogenies during the last two decades. Such advances, though, have mainly consisted in studies that focused on specific groups or partially resolved phylogenetic trees [11,26,28,29,30,31,32,33,34,35,36], and not on the whole family (but see [3,37]). This has been mainly due to difficulties to sample all main Oleaceae lineages and to take into account variable ploidy levels [31]. Recent developments in genomics and museomics present new opportunities to tackle such obstacles, though. Several Oleaceae nuclear and cytoplasmic genomes, as well as transcriptomes have been released [31,38,39,40,41,42,43]. Also herbarium samples, previously deemed unusable, are now accessible allowing for a more comprehensive sampling, and the inclusion of rare, or recently extinct species [31,44,45,46].
Another product of the recent advances in genomics is the possibility to use various, independent genomic regions to reconstruct phylogenies of plant groups. The genome skimming approach, for instance, has allowed for cost-effective sequencing of high-copy fractions of total genomic DNA, such as organellar genomes, and nuclear ribosomal DNA, but it can also generate data sets for low-copy nuclear genes [47,48]. With such a diversity of genomic datasets, one can compare the phylogenetic hypothesis estimated using plastid, mitochondrial and nuclear data and increment the estimation of species trees from gene trees. Recent studies that did such comparisons include determining the origin of wild octoploid species [49], the placement of the Celastrales-Oxalidales-Malpighiales (COM) clade within Rosidae [50], and the origin and evolution of species in Ludwigia sect. Macrocarpon of Onagraceae [51].
Here, we reconstructed phylogenies of the olive family using protein-coding sequences for 80 plastid coding sequences, 37 mitochondrial genes, the nuclear ribosomal cluster and one small multigene family encoding phytochromes (phyB and phyE) of 61 representative species to document any patterns of incongruence between datasets, and discuss these in the context of the evolution of Oleaceae. The use of nuclear data remains, however, an important challenge especially in a group with frequent ploidy changes (both paleo- and neo-polyploids). Due to paleo-events of polyploidisation, the basic chromosome number varies among tribes (i.e., x = 23 in Oleeae, 14 in Forsythieae, 13 in Fontanesieae, 11 to 13 in Jasmineae, and 11 in Myxopyreae [3,4,6]. As the consequence of whole genome duplication(s), some nuclear genes could be duplicated in polyploid lineages (e.g., Oleeae), and their orthology has to be verified before using them for inferring species phylogenies. In addition, gene duplicates as well as their pseudogenes may inform us on the polyploids ancestors. Reconstructing the phylogeny of multigene families is the first step to identify gene orthologs that could be used for species phylogenetic reconstruction. Here, we chose the closely related phytochrome genes phyB and phyE, because these low-copy genes have been frequently used for inferring phylogenetic relationships in several plant families (e.g., [52,53]). All these new datasets will not only assist on the study of Oleaceae’s taxonomy, but also its biogeography.

2. Materials and Methods

2.1. Taxon Sampling and Sequencing

In this study, we sampled a total of 65 species: 61 belonging to the ingroup (Table S1) and four representing outgroups (Table S2). The ingroup included species representing all currently recognized tribes, subtribes and genera in Oleaceae. For such list, we followed the current checklist of accepted taxa in Oleaceae that has been reviewed by the staff at Royal Botanic Gardens (Kew), as part of the project “World Checklist of Selected Plant Families” [54], and the most recent literature (e.g., [32,55]). The outgroup comprised two species also in the Lamiales order, Avicennia marina (Acanthaceae) and Sesamum indicum (Pedaliaceae), and two species in the Solanales order, Capsicum annuum and Solanum lycopersicum (both in Solanaceae).
Whole genome sequences (’genome skims’) were obtained for the 61 Oleaceae species. Twenty-two samples were removed from herbarium collections specimens (Table S1). Forty-one accessions were already characterized from previous works [31,42], and we newly analyzed 20 species belonging to Jasmineae (six species, three genera), Myxopyreae (four species, three genera), Forsythieae (Abeliophyllum), and Oleeae (two accessions of Ligustrum, one of Chengiodendron, Chionanthus, Haenianthus, Syringa, Priogymnanthus, Noronhia, and Comoranthus). For these samples, total genomic DNA was extracted from ca. 5–10 mg of dried leaves. We grounded the samples in 2-mL tubes with three metal beads using a TissueLyser (Qiagen Inc., Texas). We then extracted the DNA following the BioSprint 15 DNA Plant Kit protocol (Qiagen Inc.), and eluted the extracted DNA in 200 µL of AE buffer. Shotgun sequencing (genome skimming approach) was done at the Genopole platform of Toulouse as described in Olofsson et al. [31]. Briefly, 10 to 200 ng of double stranded DNA was used to construct sequencing libraries with the Illumina TruSeq Nano HT Sample kit (Illumina), following the manufacturer’s instructions. DNA was fragmented by sonication, except for extracts from herbarium specimens, which were already highly degraded. Each sample was paired-end sequenced (150 bp) on 1/24th of an Illumina HiSeq3000 lane and multiplexed with samples from the same or different projects.

2.2. Assembly of Cytoplasmic and Nuclear DNA Regions

2.2.1. Assembly of Plastome and Nuclear Ribosomal DNA (nrDNA) Cluster

We assembled full plastomes and the nrDNA cluster following the methods of Bianconi et al. [56]. Sequencing depth in these genomic regions was superior to 100× for all investigated species. We generated a consensus sequence for both regions for each accession, and mapped reads onto them with GENEIOUS v9.0.5 [57] for manually checking the assembly quality and assessing the sequencing depth. Then, assembled plastomes and nrDNA clusters were annotated in GENEIOUS by transferring annotations from the olive tree (GenBank accessions NC013707.2 and LR031475.1 for plastid and ribosomal cluster, respectively). Finally, we generated independent alignments for the two regions using the MUSCLE algorithm [58] with default options as implemented in GENEIOUS.

2.2.2. Assembly of Mitochondrial Genes

We adopted a reference-based iterative assembly approach to retrieve a set of 37 mitochondrial protein-coding genes for each sampled species (excluding Olea europaea, Capsicum annuum, and Solanum lycopersicum, for which annotated mitochondrial genomes are already available in GenBank; Table S2). Genes located in regions homologous to plastomes (for which plastid reads mapped on; so called “mtpt” regions) were excluded. Using the reference sequence of the olive tree mitochondrial genes (MG372119.1), an initial set of homologous reads were identified by mapping using Bowtie2 v2.3.5.1 [59] in local mode (all other parameters to default values). These reads were used as the input of a de novo assembly using SPAdes v3.14.1 [60] with default parameters. The resulting contigs were then used as reference for the next round of homologous read search and assembly. After three iterations, obtained contigs for each gene were aligned using MAFFT v7.313 [61] with defaults options. Sequencing depth of mitochondrial genes was superior to 30× for all investigated species. The alignment was then inspected and annotated in GENEIOUS by transferring annotations from the olive tree and extremities were trimmed to the annotated coding-sequence.

2.2.3. Assembly of Genes Encoding Phytochromes

Finally, we analyzed phylogenetic relationships within the Oleaceae using a few nuclear phytochrome genes. Their coding part (cds) is relatively long (>3000 bp; 4 exons) and can be aligned on most of their sequence. A reference-guided approach was used to assemble genomic regions containing genes encoding phytochromes B and E (phyB, phyE), as described in [62,63]. Briefly, raw genomic data sets were filtered using the NGSQC Toolkit v.2.3.3 [64] to retain only high-quality reads (i.e., >80% of the bases with Phred quality score >20), and to remove adaptor contamination and reads with ambiguous bases. The retained reads were subsequently trimmed from the 3’ end to remove bases with Phred score <20. We mapped cleaned paired-end reads on references for genes encoding phytochromes B and E using GENEIOUS. First, exons of phyB (two genes, see Phylogenetic analyses below) and phyE genes of the ash tree (Fraxinus excelsior; GenBank accessions LR983955 to LR983957 [40]) were used as seeds to reconstruct full phyB and phyE genes of 15 Oleaceae accessions for which nuclear genome sequencing depth was superior or equal to 5× (i.e., Dimetra craibiana, Nyctanthes arbor-tristis, Abeliophyllum distichum, Forsythia × mandschurica, Fontanesia fortunei, Jasminum didymum, Jasminum pauciflorum, ChrysoJasminum fruticans, Olea europaea subsp. laperrinei, Noronhia emarginata, Ligustrum ovalifolium, Syringa pubescens, Schrebera swietenioides, Comoranthus obconicus, and Fraxinus ornus). These species are representative of all main Oleaceae lineages (tribes and subtribes) as defined by Wallander and Albert [3]. We carefully checked that phy sequences were not chimeric between related paralogs (especially between phyB-1a and phyB-1b) by a manual verification of reads phasing on gene assemblies. Then, our newly assembled genes were used to assemble exons in other species by using gene sequences of reference from the same tribe or subtribe. Partial or complete consensus coding sequences of phyB and phyE were thus obtained for the remaining 46 Oleaceae species. Consensus phy sequences of Ny. arbor-tristis and Ch. ligustrinus showed a relatively high rate of ambiguities on all genes [on average 2.38% (2.26–2.50%) and 2.11% (1.6–2.7%), respectively]. A manual checking of these gene assemblies reveals the presence of more than two distinct homologs suggesting we collapsed sequences of recently duplicated genes on these species. Finally, a few paralogs with lower homology to our references were also detected in some accessions and were further considered when their assembly covered more than 1000 bp of the coding sequence. These additional (pseudo)genes were assembled in nine distantly related species (i.e., Nor. emarginata, Chionanthus rupicolus, Ch. trichotomus, Fore. angustifolia, Sc. swietenioides, J. didymum, A. distichum, Fors. mandschurica, and Fon. fortunei). Gene sequences covering more than 90% of the coding region were annotated and deposited in GenBank (Table S3). Genes were considered as potentially non-functional when coding sequences were truncated or presented in-frame stop codon.

2.3. Phylogenetic Analyses

2.3.1. Phylogeny of Oleaceae Using Organellar DNA

All protein-coding sequences were extracted from the full plastomes and aligned separately as codons using PRANK v170427 [65] (default options for translated alignments of protein-coding DNA sequences). We then estimated a tree using the maximum likelihood (ML) algorithm in IQ-Tree2 v2.0.6 [66]. We used a concatenation approach with an edge-linked proportional partition model, using ModelFinder [67], and assessed branch support with 1000 ultrafast bootstrap (UFB) replicates [68]. The best partition scheme for each dataset was determined with PartitionFinder v2.1.1 [69] and the best fitted evolutionary model for each partition was selected according to the best BIC score with ModelFinder, as implemented in IQ-Tree2. An ML phylogenetic tree for the mitochondrial alignment was also estimated, as described above.

2.3.2. Phylogeny of Oleaceae Using nrDNA

In previous studies on the Oleeae tribe, the nrDNA cluster rendered questionable results with the unexpected phylogenetic clustering of tropical lineages (e.g., Schreberinae subtribe embedded in an Oleinae lineage including genera Chionanthus, Priogymanthus, Haenianthus, Noronhia, and Olea [30,31,46]). A strong phylogenetic bias was attributed to the highly variable GC content in the external and internal transcribed spacers (ETS and ITS) of the Oleeae tribe [31] and nrDNA was thus deemed unreliable for phylogenetic inference in this group. However, it has been suggested that a purine-pyrimidine only coding (usually referred to as RY-coding) can effectively reduce the influence of biased GC-content [70]. Before using the nrDNA dataset on the phylogenetic analyses, we thus transformed the data from regular nucleotide-coding to a RY-coding alignment. An ML phylogenetic tree was finally estimated as described above splitting the ribosomal cluster into seven partitions: 5’ETS, 18S, ITS1, 5.8S, ITS2, 26S, and 3’ETS.

2.3.3. Phylogenetic Analyses of the Nuclear phy Gene Family

Coding regions of all phy sequences were aligned together in a matrix using MAFFT (alignment provided in Supplementary Materials). We then estimated a tree for the phyB+phyE gene family by using the ML algorithm in IQ-Tree2. In this case, we estimated the best substitution model for the whole region using ModelFinder [67], and assessed branch support with 1000 UFB replicates. This analysis allowed us to infer ancestral duplications involved in the diversification of the gene family, and then identify orthologs that can be used for reconstructing phylogeny of Oleaceae. Two nuclear genes (phyB-1 and phyE-1), putatively encoding functional enzymes in most analyzed accessions, were finally selected for the phylogenetic inference of the Oleaceae family. In Oleeae, two paralogs (phyB-1a and phyB-1b) were kept, with phyB-1a arbitrarily aligned to the phyB-1 copies of other Oleaceae tribes. An ML phylogenetic tree was finally estimated as described above allowing one partition per gene.

2.3.4. Phylogenetic Inference of Family Tree Using Data from Mixed Origin

We then estimated an ML phylogeny for Oleaceae combining nuclear and organellar information and assessed congruence between the datasets by using the algorithm for concordance factors calculations implemented in IQ-Tree2. We quantified the concordance between this phylogeny and each dataset by calculating the gene concordance factor (gCF) and the site concordance factor (sCF) for each branch of the reference tree [71]. The gCF represents the fraction of individual trees (here, species tree obtained with one of the datasets) that is concordant with a given branch, and the sCF shows the proportion of alignment sites that support that branch. It thus allows us to quantify the presence of sites inside each dataset supporting the combined topology, even if the topology obtained with one individual dataset shows an alternative topology.

3. Results

3.1. Phylogenetic Reconstructions Based on Chloroplast and Mitochondrial Genes

Using chloroplastic gene data (consisting of 77,676 sites including 10,059 parsimony-informative sites), we obtained a fully-resolved tree of the family (Figure 1). Oleaceae division into five tribes (Myxopyreae, Jasmineae, Forsythieae, Fontanesieae, and Oleeae) is strongly supported. In this dataset, Myxopyreae forms a monophyletic tribe (with the Myxopyrum genus sister to Dimetra+Nyctanthes) and is the sister lineage to all other groups in Oleaceae. Jasmineae appears as sister group to Oleeae. Schreberinae are represented as the sister clade (and subtribe) to the rest of the clades in the monophyletic tribe Oleeae, and Schrebera is paraphyletic. Within the Oleeae subtribe Ligustrinae, the genus Syringa also forms a paraphyletic group. Within Oleinae, the tree consists of short branches with a few polyphyletic genera (i.e., Chionanthus, Olea, Osmanthus, and Nestegis). Branch length was particularly long in tribe Jasmineae (notably in Menodora) and at a lesser extent in the core Ligustrinae and Dimetra+Nyctanthes, suggesting an increase of the evolutionary rate of plastid genes in these clades.
In comparison to the chloroplastic DNA phylogeny, the phylogeny based on mitochondrial data (60,747 sites, 3509 parsimony-informative sites) exhibits a highly-congruent albeit less supported topology (Figure 2). We only stress one significant difference, regarding the branching order in the deepest nodes of the family, in this topology, Forsythieae is positioned as the sister clade to all other Oleaceae (and not Myxopyreae as in the chloroplast tree). Again, Jasmineae (especially Menodora) and Dimetra+Nyctanthes show longer branches suggesting an increase of the evolutionary rate in these two clades.

3.2. Phylogeny Based on the Nuclear Ribosomal Cluster

Compared to phylogenetic reconstructions based on cytoplasmic genes, the analysis of the nrDNA cluster (7008 sites, 837 parsimony-informative sites) resulted in a less-supported and quite different topology (Figure 3). Myxopyreae+Fontanesieae+Forsythieae are resolved as sister to the tribes Jasmineae and Oleeae. Myxopyreae are not monophyletic, with Myxopyrum sister to Forsythia+Fontanesia but this topology is poorly supported (UFB:64). Jasmineae is here again reported as sister to Oleeae but includes a different branching of Menodora (sister to Jasminum+Chrysojasminum). This topology presents a first strongly-supported split in Oleeae between Schreberinae and Fraxininae+Ligustrinae+Oleinae (UFB:97). Within this grouping, Fraxininae and Ligustrinae form monophyletic lineages sister to Oleinae but are not supported. Longer branches are still observed in Jasmineae and Dimetra+Nyctanthes (especially in Dimetra).

3.3. Phylogeny Based on Nuclear phy Gene Family

A second nuclear DNA phylogeny was reconstructed using phy genes. We first investigated the phylogenetic tree of the phy family in order to select the most informative orthologs. A condensed phylogenetic tree of genes encoding phytochromes E and B is shown in Figure 4 (the detailed tree is provided in Figure S1). As expected, the main distinction of two genes, phyE and phyB, was recovered.
For phyE, one supposedly functional gene (phyE-1) was detected in most Oleaceae species, although a second functional gene (phyE-2) was also assembled in tribes Forsythieae and Fontanesieae. phyE-2 is sister to a clade formed by phyE-1 and phyE of Avicennia and Sesamum (recovered from GenBank). This topology suggests an ancestral gene duplication (giving birth to phyE-1 and phyE-2) in the ancestor of Lamiales, after its divergence from Solanales. A likely pseudogenic phyE-1 paralog (namely phyE-1b) was detected in Schrebera swietenioides (Oleeae). Its phylogenetic position remains unresolved due to a polytomy with phyE-1 clades of Oleeae (namely phyE-1a) and Jasmineae. phyE-1b likely testifies to a gene duplication in the Oleeae ancestor [3,4], followed by a rapid pseudogenization of this duplicate. Interestingly, we also detected putative pseudogenes of phyE-2 in distantly related species of Jasmineae and Oleeae. Two putatively pseudogenic lineages were detected in Oleeae (phyE-2a and phyE-2b), another evidence of (pseudo)gene duplication in the ancestor of this tribe [3,4]. Based on this topology, only phyE-1 was selected for our phylogenetic analyses of species relationships because this ortholog was detected in all analyzed Oleaceae accessions, and phylogenetic relationships based on this gene support the main taxonomic lineages (i.e., tribes and subtribes) as defined by Wallander and Albert [3]. Putitatively pseudogenized copies (i.e., presence of frame shifts and/or stop codons) of phyE-1a were detected in eight species (Figure S1).
For phyB, first, two functional duplicates were detected in Solanales, Acanthaceae (Avicennia) and Pedaliaceae (Sesamum). Two main gene lineages (phyB-1 and phyB-2) were also detected in Oleaceae, but phyB-2 was detected only in Forsythieae (Forsythia and Abeliophyllum). This gene is sister to the phyB genes of Acanthaceae and Pedaliaceae. On the other hand, phyB-1 was detected in all Oleaceae species. Two closely related genes (phyB-1a and phyB-1b) were assembled in all Oleeae species, again testifying to an event of gene duplication in the ancestor of this tribe [3,4]. Based on this topology, phyB-1 was selected for species relationships analyses because this gene was detected in all analyzed accessions, and the phylogeny allowed us to retrieve all Oleaceae lineages [3]. Putatively pseudogenic copies (i.e., presence of frame shifts and/or stop codons or complete deletion of exon) of phyB-1a and phyB-1b were detected in two and four species, respectively (Figure S1).
The phylogenetic tree based on concatenated phyB-1 (a and b) and phyE-1 genes (10,438 sites, 3282 parsimony-informative sites) is shown in Figure 5. Again, the topology supports the distinction of all taxonomic units defined by Wallander and Albert [3], with tribe Myxopyreae recognized as sister to the rest of Oleaceae. As in other topologies showed above, tribes Jasmineae and Oleeae as well as subtribes Oleinae and Fraxininae are sister groups. In contrast, a major incongruence with both cytoplasmic datasets is the placement of subtribe Ligustrinae as sister to the remaining of Oleeae. This topology was recovered with phyB-1a and phyE-1a, but not with phyB-1b that supports Schreberinae as sister to the other subtribes (Figure 4 and Figure S1). Longer branches are observed in Jasmineae and Dimetra.

3.4. Phylogenetic Reconstruction Combining the Four Genomic Datasets

The combination of nuclear and cytoplasmic datasets allowed the reconstruction of a well-supported phylogeny of Oleaceae (Figure 6). All datasets broadly supported the same phylogenetic hypothesis with five strongly supported monophyletic tribes Myxopyreae, Fontanesieae, Forsythieae, Jasmineae and Oleeae. The position of Myxopyreae as sister to the rest of the family is supported by the majority of data as concordance factors attest. The branching order of Forsythieae and Fontanesieae is however difficult to decide on. For these two tribes, the topology of the species tree obtained from the combined dataset is not well-supported. The branching node of Forsythieae, despite a bootstrap support of 100, exhibits high uncertainty based on the concordance factors (gCF: 50%; sCF: 51.4%, Figure S2). The represented branching of Fontanesieae is even less supported (UFB: 64; gCF: 25%; sCF: 29.1%, Figure S2). In both cases, concordance factors show that the reported topology is not supported by most sites. Similar sCF and gCF values suggest this is due to genuine discordant signal in the trees probably due to incomplete lineage sorting. In contrast, we set Jasmineae as the sister tribe of Oleeae with confidence (UFB and gCF values of 100). The topology within Jasmineae confirms the recent reevaluation of the genus Jasminum in two distinct genera Chrysojasminum and Jasminum [36,37,54]. The other major uncertainty resides within the Oleeae tribe on the branching order of Ligustrinae and Schreberinae. Although bootstrap support and concordance factors values sustain the represented branching (Schreberinae as sister to other Oleeae subtribes), the concordance factors (especially sCF) are less decisive for the Ligustrinae split.

4. Discussion

We gathered molecular information from several genomic compartments (chloroplastic, mitochondrial and nuclear) for 61 Oleaceae species representative of all currently recognized tribes, subtribes and genera in Oleaceae. Both plastid and mitochondrial DNA datasets as well as the nrDNA cluster are based on relatively high sequencing depth (>30×) and thus of a high quality [30,41]. In contrast, low-copy nuclear genes are more difficult to assemble from genome skimming data and their use in phylogenetics is still a challenge due to lower coverage and recurrent whole genome duplications [31,48]. Here, we explored the utility of a single nuclear gene family (phyB and phyE genes) for investigating the phylogeny of the whole Oleaceae family. The obtained dataset allowed us to tackle the complex history of nuclear gene duplication and subsequent pseudogenization indicating the necessity to control for gene orthology before proposing a phylogenetic hypothesis for the whole family. By combining and confronting our datasets, we were able to establish a well-resolved phylogeny of Oleaceae although a few discordances were revealed when comparing phylogenies based on cytoplasmic and nuclear genomic regions. Overall, tribes and subtribes were strongly supported by all phylogenetic reconstructions and only very few relationships between tribes/subtribes were not fully resolved.

4.1. Taxonomy of Oleaceae

Our phylogenetic analyses confirm the divisions of Oleaceae in five tribes and four subtribes as defined by Wallander and Albert [3]. Given the amount of data we analyzed, we achieved a greater resolution and support in our phylogenetic inference of the whole family, including all currently recognized genera and considering several accessions from distant areas in the largest groups (e.g., Chionanthus, Olea, Fraxinus, Syringa, Jasminum). First, our results validated the grouping of Nyctanthes, Dimetra and Myxopyrum in Myxopyreae [3,72] and overall supported this clade as sister to all other lineages in the family. We were also able to corroborate some of the less-reliable nodes and in particular the sister tribes Jasmineae and Oleeae. We resolved the relationships between Forsythieae and Fontanesieae as being distinct and non-sister tribes. We also put into question the idea that Ligustrinae is sister to all other lineages in Oleeae [3,35,36] favoring the alternative hypothesis of Schreberinae being the one (as in [31], where the whole plastid genome and single-nucleotide polymorphisms datasets gathered from more than 11,000 nuclear genes were used). Finally, we were also able to better define the relationships within Oleeae wherein some genera appeared as polyphyletic (i.e., Chionanthus, Olea, Osmanthus, Nestegis) or paraphyletic (i.e., Schrebera, Syringa) confirming previous reports from the literature [26,28,30,31,32].
A relatively high congruence was obtained between phylogenies based on plastid and mitochondrial DNA datasets (Figure 1 and Figure 2), as expected for maternally inherited genomes [42]. We obtained the best resolution with the chloroplastic dataset as it contains more informative sites. Topologies based on phy genes and cytoplasmic genomes were also quite congruent although the relative placement of Ligustrinae and Schreberinae as well as Forsythieae and Fontanesieae differ according to phy genes (Figure 4 and Figure 5). In contrast, the nrDNA cluster provided less reliable information than organellar genomes and phy nuclear genes (Figure 3). Phylogenetic biases related to GC content and incomplete concerted evolution have been already reported in Oleaceae for the nrDNA marker (e.g., [10,31,46]), which thus needs to be interpreted with caution. Yet, the RY-coding seems to have greatly improved the topology since all Oleeae subtribes were retrieved in contrast to previous analyses [31,46] (see Figure S3 for the ML phylogeny from the original alignment).

4.2. Nuclear Gene Orthology and Polyploidization Events in Oleaceae

The analysis of a small multigene family revealed other aspects on the Oleaceae history, related to past whole genome duplications and different tempo of pseudogenization. First, two divergent functional paralogs were revealed on phyE and phyB, but only in Fontanesieae and Forsythieae. The duplication of these genes (possibly due to whole genome duplication) is ancient, likely preceding the divergence of Lamiales, and the pseudogenisation of phy-B2 and phy-E2 in tribes Myxopyreae, Jasmineae and Oleeae may have occurred rapidly after their divergence. Only pseudo-phy-E2 was still detected in Jasmineae and Oleeae. More interestingly, the detection of two closely related paralogs of phyB-1, phyE-1 and pseudo-phyE-2 in all Oleeae species is highly congruent with the reported event of polyploidization in their common ancestor [3,4]. As we decided to collapse highly homologous sequences of phy genes, we were not able to investigate the fate of these genes in neopolyploids, but we detected a relatively high level of ambiguities in the tetraploid Ny. arbor-tristis [73] as well as in Ch. ligustrinus for which the chromosome number is unknown.

5. Concluding Remarks and Future Directions in Oleaceae Phylogenomics

Our work provided a more robust phylogenetic history of Oleaceae than previous works, a crucial prerequisite to study the diversification process of this family. A complex history of gene duplication and pseudogenization was also revealed, and these aspects need to be evaluated before using nuclear data in the reconstruction of phylogenies, especially in a plant family with paleopolyploids such as Oleaceae. Moreover, our prospective study also demonstrated the limits of using phy genes to estimate a tree due to the variable levels of gene retention and the presence of non-functional sequences. With the higher accessibility of genomic data, some of these caveats can be circumvented with the use of new methodologies such as the analyses of UCE (Ultra Conserved Elements) or universal single-copy orthologs (e.g., [74,75,76]). Although, in the light of the complicated history of evolution of plants (e.g., multiple reported events of whole genome duplication), we stress the importance of taking gene orthology into account when estimating species trees.
When it comes to our current and future goals with the study of the phylogenomics of Oleaceae, the complete sequencing of nuclear genomes (with at least 30–50× coverage) is in progress in our lab. We are mainly focusing on low heterozygous diploid species, and avoiding neo-polyploids and hybrids. In addition, since this study confirmed that cytoplasmic and nuclear ribosomal DNA sequences can be easily assembled independent of species ploidy, we are using those genomic regions on a comprehensive sampling to reconstruct a fossil-calibrated phylogeny of the family. Finally, with this large phylogeny of Oleaceae we will explore the causes of variable evolutionary rates among genomes, considering factors as generation time (e.g., short living species exhibit particularly long branches in phylogenetic reconstructions) [77], gene duplication, genome inheritance, and recombination rate [77,78,79].

Supplementary Materials

The following are available online at https://www.mdpi.com/2073-4425/11/12/1508/s1. Table S1. List of Oleaceae accessions analyzed in our study, with their taxonomy, accession number and origin. Table S2. List of species used as outgroups in our phylogenetic analyses. Table S3. GenBank no of genomic regions for each accession. Figure S1. Full representation of the midpoint-rooted maximum likelihood phylogenetic tree of the phy gene family in Oleaceae. Figure S2. Maximum likelihood topology of Oleaceae estimated from the partitioned analysis of the four datasets with corresponding concordance factors of nodes. Figure S3. Maximum likelihood phylogenetic tree of Oleaceae based on the non-transformed nrDNA cluster alignment. Materials S1 to S11. Sequence alignments used for phylogenetic reconstructions, and tree files.

Author Contributions

Conceptualization: J.D. and G.B.; Plant sampling: J.D., C.H.-W., M.G., and G.B.; Lab work: J.D., S.M., and G.B.; Data analyses: J.D., P.R., and G.B.; Manuscript writing: J.D, P.R., and G.B.; Funding acquisition: J.D. and G.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the FruitFul grant (H2020-MSCA-IF-2018-842234), the ERA-NET BiodivERsA project INFRAGECO (ANR-16-EBI3-0014), and by the grant GeneRes (Occitanie-France Olive). In addition, J.D., P.R., S.M. and G.B. are members of the EDB laboratory, which is supported by the excellence projects Labex CEBA (ANR-10-LABX-25-01) and Labex TULIP (ANR-10-LABX-0041), managed by the French ANR.

Acknowledgments

We are grateful to the Genotoul bioinformatics platform Toulouse Occitanie (Bioinfo Genotoul, doi:10.15454/1.5572369328961167E12) for providing computing and storage resources, and to Céline Van de Paer for help in gene annotation of plastomes. We also thank three anonymous reviewers for their constructive comments.

Conflicts of Interest

The authors declare no competing interests. Funders played no role in the study design; data collection, analysis or interpretation; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Green, P.S. Oleaceae. In The Families and Genera of Vascular Plants, Flowering Plants, Dicotyledons; Kubitzki, K., Kadereit, J.W., Eds.; Springer: New York, NY, USA, 2004; Volume 7, pp. 296–306. [Google Scholar]
  2. Mitchell, R.J.; Beaton, J.K.; Bellamy, P.E.; Broome, A.; Chetcuti, J.; Eaton, S.; Ellis, C.J.; Gimona, A.; Harmer, R.; Hester, A.J.; et al. Ash dieback in the UK: A review of the ecological and conservation implications and potential management options. Biol. Conserv. 2014, 175, 95–109. [Google Scholar] [CrossRef]
  3. Wallander, E.; Albert, V.A. Phylogeny and classification of Oleaceae based on rps16 and trnL-F sequence data. Am. J. Bot. 2000, 87, 1827–1841. [Google Scholar] [CrossRef] [PubMed]
  4. Taylor, H. Cyto-taxonomy and phylogeny of the Oleaceae. Brittonia 1945, 5, 337–367. [Google Scholar] [CrossRef]
  5. Briggs, B.G. Some chromosome numbers in the Oleaceae. Contrib. N. S. W. Natl. Herb. 1970, 4, 126–129. [Google Scholar]
  6. George, K.; Geethamma, S. Cytology and evolution of jasmines. Cytologia 1992, 57, 27–32. [Google Scholar] [CrossRef] [Green Version]
  7. Besnard, G.; Garcia-Verdugo, C.; Rubio de Casas, R.; Treier, U.A.; Galland, N.; Vargas, P. Polyploidy in the olive complex (Olea europaea): Evidence from flow cytometry and nuclear microsatellite analyses. Ann. Bot. 2008, 101, 25–30. [Google Scholar] [CrossRef]
  8. Lattier, J.D.; Contreras, R.N. Ploidy and genome size in lilac species, cultivars, and interploid hybrids. J. Am. Soc. Hortic. Sci. 2017, 142, 355–366. [Google Scholar] [CrossRef] [Green Version]
  9. Whittemore, A.T.; Cambell, J.J.N.; Zheng-Lian, X.; Carlson, C.H.; Atha, D.; Olsen, R.T. Ploidy variation in Fraxinus L. (Oleaceae) of eastern North America: Genome size diversity and taxonomy in a suddenly endangered genus. Int. J. Plant Sci. 2018, 179, 377–389. [Google Scholar] [CrossRef]
  10. Wallander, E. Systematics of Fraxinus (Oleaceae) and evolution of dioecy. Plant Syst. Evol. 2008, 273, 25–49. [Google Scholar] [CrossRef]
  11. Hinsinger, D.D.; Bask, J.; Gaudeul, M.; Cruaud, C.; Bertolino, P.; Frascaria-Lacoste, N.; Bousquet, J. The phylogeny and biogeographic history of ashes (Fraxinus, Oleaceae) highlight the roles of migration and vicariance in the diversification of temperate trees. PLoS ONE 2013, 8, e80431. [Google Scholar] [CrossRef]
  12. Rohwer, J.G. A preliminary survey of the fruits and seeds of the Oleaceae. Bot. Jahrb. Syst. 1993, 115, 271–291. [Google Scholar]
  13. Rohwer, J.G. Fruit and seed structures in Menodora (Oleaceae): A comparison with Jasminum. Bot. Acta 1995, 108, 163–168. [Google Scholar] [CrossRef]
  14. Rohwer, J.G. Die Frucht- und Samenstrukturen der Oleaceae. Bibl. Bot. 1996, 148, 1–177. [Google Scholar]
  15. Kiew, R. Preliminary pollen study of the Oleaceae in Malesia. Gard. Bull. 1984, 37, 225–230. [Google Scholar]
  16. Lepart, J.; Dommée, B. Is Phillyrea angustifolia L. (Oleaceae) an androdioecious species? Bot. J. Linn. Soc. 1992, 108, 375–387. [Google Scholar] [CrossRef]
  17. Green, P.S. A revision of Olea L. (Oleaceae). Kew Bull. 2002, 57, 91–140. [Google Scholar] [CrossRef]
  18. Saumitou-Laprade, P.; Vernet, P.; Dowkiw, A.; Bertrand, S.; Billiard, S.; Albert, B.; Gouyon, P.H.; Dufay, M. Polygamy or subdioecy? The impact of diallelic self-incompatibility on the sexual system in Fraxinus excelsior(Oleaceae). Proc. R. Soc. B Biol. Sci. 2018, 285, 20180004. [Google Scholar] [CrossRef] [Green Version]
  19. Thompson, J.D.; Dommée, B. Morph-specific patterns of variation in stigma height in natural populations of distylous Jasminum fruticans. New Phytol. 2000, 148, 303–314. [Google Scholar] [CrossRef]
  20. Olesen, J.M.; Dupont, Y.L.; Ehlers, B.K.; Valido, A.; Hansen, D.M. Heterostyly in the Canarian endemic Jasminum odoratissimurn (Oleaceae). Nord. J. Bot. 2005, 23, 537–539. [Google Scholar] [CrossRef]
  21. Ryu, T.Y.; Yeam, D.Y.; Kim, Y.J.; Kim, S.J. Studies on heterostyly incompatibility of Abeliophyllum distichum. Seoul Natl. Univ. Coll. Agric. Bull. 1976, 1, 113–120. [Google Scholar]
  22. Saumitou-Laprade, P.; Vernet, P.; Vassiliadis, C.; Hoareau, Y.; de Magny, G.; Dommée, B.; Lepart, J. A self-incompatibility system explains high male frequencies in an androdioecious plant. Science 2010, 327, 1648–1650. [Google Scholar] [CrossRef] [PubMed]
  23. Vernet, P.; Lepercq, P.; Billiard, S.; Bourceaux, A.; Lepart, J.; Dommée, B.; Saumitou-Laprade, P. Evidence for the long-term maintenance of a rare self-incompatibility system in Oleaceae. New Phytol. 2016, 210, 1408–1417. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Johnson, L.A.S. A review of the family Oleaceae. Contrib. N. S. W. Natl. Herb. 1957, 2, 395–418. [Google Scholar]
  25. Stearn, W.T. Union of Chionanthus and Linociera (Oleaceae). Ann. Mo. Bot. Gard. 1976, 63, 355–357. [Google Scholar] [CrossRef]
  26. Li, J.; Alexander, J.H.; Zhang, D. Paraphyletic Syringa (Oleaceae): Evidence from sequences of nuclear ribosomal DNA ITS and ETS regions. Syst. Bot. 2002, 27, 592–597. [Google Scholar]
  27. Harborne, J.B.; Green, P.S. A chemotaxonomic survey of flvonoids in leaves of the Oleaceae. Bot. J. Linn. Soc. 1980, 81, 155–167. [Google Scholar] [CrossRef]
  28. Besnard, G.; Rubio de Casas, R.; Christin, P.A.; Vargas, P. Phylogenetics of Olea (Oleaceae) based on plastid and nuclear ribosomal DNA sequences: Tertiary climatic shifts and lineage differentiation times. Ann. Bot. 2009, 104, 143–160. [Google Scholar] [CrossRef] [Green Version]
  29. Yuan, W.J.; Zhang, W.R.; Han, Y.J.; Dong, M.F.; Shang, F.D. Molecular phylogeny of Osmanthus (Oleaceae) based on non-coding chloroplast and nuclear ribosomal internal transcribed spacer regions. J. Syst. Evol. 2010, 48, 482–489. [Google Scholar] [CrossRef]
  30. Hong-Wa, C.; Besnard, G. Intricate patterns of phylogenetic relationships in the olive family as inferred from multi-locus plastid and nuclear DNA sequence analyses: A close-up on Chionanthus and Noronhia (Oleaceae). Mol. Phylogenet. Evol. 2013, 67, 367–378. [Google Scholar] [CrossRef]
  31. Olofsson, J.K.; Cantera, I.; Van de Paer, C.; Hong-Wa, C.; Zedane, L.; Dunning, L.T.; Alberti, A.; Christin, P.A.; Besnard, G. Phylogenomics using low-depth whole genome sequencing: A case study with the olive tribe. Mol. Ecol. Resour. 2019, 19, 877–892. [Google Scholar] [CrossRef]
  32. Li, Y.F.; Zhang, M.; Wang, X.R.; Sylvester, S.P.; Xiang, Q.B.; Li, X.; Li, M.; Zhu, H.; Zhang, C.; Chen, L.; et al. Revisiting the phylogeny and taxonomy of Osmanthus (Oleaceae) including description of the new genus Chengiodendron. Phytotaxa 2020, 436, 283–292. [Google Scholar] [CrossRef]
  33. Kim, K.J.; Jansen, R.K. A chloroplast DNA phylogeny of lilacs (Syringa, Oleaceae): Plastome groups show a strong correlation with crossing groups. Am. J. Bot. 1998, 85, 1338–1351. [Google Scholar] [CrossRef] [PubMed]
  34. Lee, H.L.; Jansen, R.K.; Chumley, T.W.; Kim, K.J. Gene relocations within chloroplast genomes of Jasminum and Menodora (Oleaceae) are due to multiple, overlapping inversions. Mol. Biol. Evol. 2007, 24, 1161–1180. [Google Scholar] [CrossRef] [Green Version]
  35. Kim, D.; Kim, J. Molecular phylogeny of tribe Forsythieae (Oleaceae) based on nuclear ribosomal DNA internal transcribed spacers and plastid DNA trnL-F and matK gene sequences. J. Plant Res. 2011, 124, 339–347. [Google Scholar] [CrossRef]
  36. Ha, Y.H.; Kim, C.; Choi, K.; Kim, J.H. Molecular phylogeny and dating of Forsythieae (Oleaceae) provide insight into the Miocene history of Eurasian temperate shrubs. Front. Plant Sci. 2018, 9, 99. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  37. Jeyarani, J.N.; Yohannan, R.; Vijayavalli, D.; Dwivedi, M.D.; Pandey, A.K. Phylogenetic analysis and evolution of morphological characters in the genus Jasminum L. (Oleaceae) in India. J. Genet. 2018, 97, 1225–1239. [Google Scholar] [CrossRef]
  38. Cruz, F.; Julca, I.; Gómez-Garrido, J.; Loska, D.; Marcet-Houben, M.; Cano, E.; Galán, B.; Frias, L.; Ribeca, P.; Derdak, S.; et al. Genome sequence of the olive tree, Olea europaea. GigaScience 2016, 5, 29. [Google Scholar] [CrossRef]
  39. Unver, T.; Wu, Z.; Sterck, L.; Turktas, M.; Lohaus, R.; Li, Z.; Yang, M.; He, L.; Deng, T.; Escalante, F.J.; et al. Genome of wild olive and the evolution of oil biosynthesis. Proc. Natl. Acad. Sci. USA 2017, 114, E9413–E9422. [Google Scholar] [CrossRef] [Green Version]
  40. Sollars, E.S.A.; Harper, A.L.; Kelly, L.J.; Sambles, C.M.; Ramirez-Gonzalez, R.H.; Swarbreck, D.; Kaithakottil, G.; Cooper, E.D.; Uauy, C.; Havlickova, L.; et al. Genome sequence and genetic diversity of European ash trees. Nature 2017, 541, 212–216. [Google Scholar] [CrossRef]
  41. Kelly, L.J.; Plumb, W.J.; Carey, D.W.; Mason, M.E.; Cooper, E.D.; Crowther, W.; Whittemore, A.T.; Rossiter, S.J.; Kock, J.L.; Buggs, R.J.A. Convergent molecular evolution among ash species resistant to the emerald ash borer. Nat. Ecol. Evol. 2020, 4, 1116–1128. [Google Scholar] [CrossRef]
  42. Van de Paer, C.; Bouchez, O.; Besnard, G. Prospects on the evolutionary mitogenomics of plants: A case study on the olive family (Oleaceae). Mol. Ecol. Resour. 2018, 18, 409–423. [Google Scholar] [CrossRef] [PubMed]
  43. Zhang, C.; Zhang, T.; Luebert, F.; Xiang, Y.; Huang, C.H.; Hu, Y.; Rees, M.; Frohlich, M.W.; Qi, J.; Weigend, M.; et al. Asterid phylogenomics/phylotranscriptomics uncover morphological evolutionary histories and support phylogenetic placement for numerous whole-genome duplications. Mol. Biol. Evol. 2020, 37, 3188–3210. [Google Scholar] [CrossRef] [PubMed]
  44. Bieker, V.C.; Martin, M.D. Implications and future prospects for evolutionary analyses of DNA in historical herbarium collections. Bot. Lett. 2018, 165, 409–418. [Google Scholar] [CrossRef] [Green Version]
  45. Van de Paer, C.; Hong-Wa, C.; Jeziorski, C.; Besnard, G. Mitogenomics of Hesperelaea, an extinct genus of Oleaceae. Gene 2016, 594, 197–202. [Google Scholar] [CrossRef] [PubMed]
  46. Zedane, L.; Hong-Wa, C.; Murienne, J.; Jeziorski, C.; Baldwin, B.G.; Besnard, G. Museomics illuminate the history of an extinct, paleoendemic plant lineage (Hesperelaea, Oleaceae) known from an 1875 collection from Guadalupe Island, Mexico. Biol. J. Linn. Soc. 2016, 117, 44–57. [Google Scholar] [CrossRef] [Green Version]
  47. Straub, S.C.K.; Parks, M.; Weitemier, K.; Fishbein, M.; Cronn, R.C.; Liston, A. Navigating the tip of the genomic iceberg: Next-generation sequencing for plant systematics. Am. J. Bot. 2012, 99, 349–364. [Google Scholar] [CrossRef] [Green Version]
  48. Berger, B.A.; Han, J.; Sessa, E.B.; Gardner, A.G.; Shepherd, K.A.; Ricigliano, V.A.; Jabaily, R.S.; Howarth, D.G. The unexpected depths of genome-skimming data: A case study examining Goodeniaceae floral symmetry genes. Appl. Plant Sci. 2017, 5, 1700042. [Google Scholar] [CrossRef]
  49. Govindarajulu, R.; Parks, M.; Tennessen, J.A.; Liston, A.; Ashman, T.L. Comparison of nuclear, plastid, and mitochondrial phylogenies and the origin of wild octoploid strawberry species. Am. J. Bot. 2015, 102, 544–554. [Google Scholar] [CrossRef] [Green Version]
  50. Sun, M.; Soltis, D.E.; Soltis, P.S.; Zhu, X.; Burleigh, J.G.; Chen, Z. Deep phylogenetic incongruence in the angiosperm clade Rosidae. Mol. Phylogenet. Evol. 2015, 83, 156–166. [Google Scholar] [CrossRef]
  51. Liu, S.H.; Edwards, C.E.; Hoch, P.C.; Raven, P.H.; Barber, J.C. Genome skimming provides new insight into the relationships in Ludwigia section Macrocarpon, a polyploid complex. Am. J. Bot. 2018, 105, 875–887. [Google Scholar] [CrossRef]
  52. Mathews, S.; Lavin, M.; Sharrock, R.A. Evolution of the phytochrome gene family and its utility for phylogenetic analyses of angiosperms. Ann. Mo. Bot. Gard. 1995, 82, 296–321. [Google Scholar] [CrossRef]
  53. Mathews, S.; Tsai, R.C.; Kellogg, E.A. Phylogenetic structure in the grass family (Poaceae): Evidence from the nuclear gene phytochrome B. Am. J. Bot. 2000, 87, 96–107. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  54. WCSP. World Checklist of Selected Plant Families. Facilitated by the Royal Botanic Gardens, Kew. 2020. Published on the Internet. Available online: http://wcsp.science.kew.org/ (accessed on 4 June 2020).
  55. Banfi, E. Chrysojasminum, a new genus for Jasminum sect. Alternifolia (Oleaceae, Jasmineae). Nat. Hist. Sci. 2014, 1, 3–6. [Google Scholar] [CrossRef]
  56. Bianconi, M.; Hackel, J.; Vorontsova, M.S.; Alberti, A.; Arthan, W.; Burke, S.V.; Duvall, M.R.; Kellogg, E.A.; Lavergne, S.; McKain, M.; et al. Continued adaptation of C4 photosynthesis after an initial burst of changes in the Andropogoneae grasses. Syst. Biol. 2020, 69, 445–461. [Google Scholar] [CrossRef] [PubMed]
  57. Kearse, M.; Moir, R.; Wilson, A.; Stones-Havas, S.; Cheung, M.; Sturrock, S.; Buxton, S.; Cooper, A.; Markowitz, S.; Duran, C.; et al. GENEIOUS Basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 2012, 28, 1647–1649. [Google Scholar] [CrossRef]
  58. Edgar, R.C. MUSCLE: Multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32, 1792–1797. [Google Scholar] [CrossRef] [Green Version]
  59. Langmead, B.; Salzberg, S.L. Fast gapped-read alignment with Bowtie 2. Nat. Meth. 2012, 9, 357–359. [Google Scholar] [CrossRef] [Green Version]
  60. Bankevich, A.; Nurk, S.; Antipov, D.; Gurevich, A.A.; Dvorkin, M.; Kulikov, A.S.; Lesin, V.M.; Nikolenko, S.I.; Pham, S.; Prjibelski, A.D.; et al. SPAdes: A new genome assembly algorithm and its applications to single-cell sequencing. J. Comput. Biol. 2012, 19, 455–477. [Google Scholar] [CrossRef] [Green Version]
  61. Katoh, K.; Misawa, K.; Kuma, K.; Miyata, T. MAFFT: A novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002, 30, 3059–3066. [Google Scholar] [CrossRef] [Green Version]
  62. Besnard, G.; Christin, P.A.; Malé, P.J.; Coissac, E.; Lhuillier, E.; Lauzeral, C.; Vorontsova, M.S. From museums to genomics: Old herbarium specimens shed light on a C3 to C4 transition. J. Exp. Bot. 2014, 65, 6711–6721. [Google Scholar] [CrossRef]
  63. Besnard, G.; Bianconi, M.E.; Hackel, J.; Manzi, S.; Vorontsova, M.S.; Christin, P.A. Herbarium genomics retrace the origins of C4-specific carbonic anhydrase in Andropogoneae (Poaceae). Bot. Lett. 2018, 165, 419–433. [Google Scholar] [CrossRef]
  64. Patel, R.K.; Jain, M. NGS QC Toolkit: A toolkit for quality control of next generation sequencing data. PLoS ONE 2012, 7, e30619. [Google Scholar] [CrossRef] [PubMed]
  65. Löytynoja, A. Phylogeny-aware alignment with PRANK. Meth. Mol. Biol. 2014, 1079, 155–170. [Google Scholar]
  66. Minh, B.Q.; Schmidt, H.A.; Chernomor, O.; Schrempf, D.; Woodhams, M.D.; von Haeseler, A.; Lanfear, R. IQ-TREE 2: New models and efficient methods for phylogenetic inference in the genomic era. Mol. Biol. Evol. 2020, 37, 1530–1534. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  67. Kalyaanamoorthy, S.; Minh, B.Q.; Wong, T.K.F.; von Haeseler, A.; Jermiin, L.S. ModelFinder: Fast model selection for accurate phylogenetic estimates. Nat. Meth. 2017, 14, 587–589. [Google Scholar] [CrossRef] [Green Version]
  68. Hoang, D.T.; Chernomor, O.; von Haeseler, A.; Minh, B.Q.; Vinh, L.S. UFBoot2: Improving the ultrafast bootstrap approximation. Mol. Biol. Evol. 2018, 35, 518–522. [Google Scholar] [CrossRef] [PubMed]
  69. Lanfear, R.; Frandsen, P.B.; Wright, A.M.; Senfeld, T.; Calcott, B. PartitionFinder 2: New methods for selecting partitioned models of evolution for molecular and morphological phylogenetic analyses. Mol. Biol. Evol. 2017, 34, 772–773. [Google Scholar] [CrossRef] [Green Version]
  70. Phillips, M.J.; Delsuc, F.; Penny, D. Genome-scale phylogeny and the detection of systematic biases. Mol. Biol. Evol. 2004, 21, 1455–1458. [Google Scholar] [CrossRef] [Green Version]
  71. Minh, B.Q.; Hahn, M.W.; Lanfear, R. New methods to calculate concordance factors for phylogenomic datasets. Mol. Biol. Evol. 2020, 37, 2727–2733. [Google Scholar] [CrossRef]
  72. Kiew, R.; Baas, P. Nyctanthes is a member of Oleaceae. Proc. Ind. Acad. Sci. 1984, 93, 349–358. [Google Scholar]
  73. George, K.; Geethamma, S. Cytological and other evidences for the taxonomic position of Nyctanthes arbor-tristis. Curr. Sci. 1984, 53, 439–441. [Google Scholar]
  74. Huang, C.H.; Zhang, C.; Liu, M.; Hu, Y.; Gao, T.; Qi, J.; Ma, H. Multiple polyploidization events across Asteraceae with two nested events in the early history revealed by nuclear phylogenomics. Mol. Biol. Evol. 2016, 33, 2820–2835. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  75. Waterhouse, R.M.; Seppey, M.; Simão, F.A.; Manni, M.; Ioannidis, P.; Klioutchnikov, G.; Kriventseva, E.V.; Zdobnov, E.M. BUSCO applications from quality assessments to gene prediction and phylogenomics. Mol. Biol. Evol. 2018, 35, 543–548. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  76. Zhang, F.; Ding, Y.; Zhu, C.D.; Zhou, X.; Orr, M.C.; Scheu, S.; Luan, Y.X. Phylogenomics from low-coverage whole-genome sequencing. Meth. Ecol. Evol. 2019, 10, 507–517. [Google Scholar] [CrossRef]
  77. Smith, S.A.; Donoghue, M.J. Rates of molecular evolution are linked to life history in flowering plants. Science 2008, 322, 86–89. [Google Scholar] [CrossRef] [Green Version]
  78. Larracuente, A.M.; Sackton, T.B.; Greenberg, A.J.; Wong, A.; Singh, N.D.; Sturgill, D.; Zhang, Y.; Oliver, B.; Clark, A.G. Evolution of protein-coding genes in Drosophila. Trends Genet. 2008, 24, 114–123. [Google Scholar] [CrossRef]
  79. Yang, L.; Gaut, B.S. Factors that contribute to variation in evolutionary rate among Arabidopsis genes. Mol. Biol. Evol. 2011, 28, 2359–2369. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Maximum likelihood phylogenetic tree of Oleaceae based on concatenated coding sequences of 80 plastid genes. The tree was rooted on the split with Solanaceae. The scale is in substitution per site. Ultrafast bootstrap support values are indicated on nodes when inferior to 100.
Figure 1. Maximum likelihood phylogenetic tree of Oleaceae based on concatenated coding sequences of 80 plastid genes. The tree was rooted on the split with Solanaceae. The scale is in substitution per site. Ultrafast bootstrap support values are indicated on nodes when inferior to 100.
Genes 11 01508 g001
Figure 2. Maximum likelihood phylogenetic tree of Oleaceae based on the concatenation of 37 mitochondrial genes. The tree was rooted on the split with Solanaceae. The scale is in substitution per site. Ultrafast bootstrap support values are indicated on nodes when inferior to 100.
Figure 2. Maximum likelihood phylogenetic tree of Oleaceae based on the concatenation of 37 mitochondrial genes. The tree was rooted on the split with Solanaceae. The scale is in substitution per site. Ultrafast bootstrap support values are indicated on nodes when inferior to 100.
Genes 11 01508 g002
Figure 3. Maximum likelihood phylogenetic tree of Oleaceae based on the complete RY-coded nrDNA cluster alignment. The tree was rooted on the split with Solanaceae. The scale is in substitution per site.
Figure 3. Maximum likelihood phylogenetic tree of Oleaceae based on the complete RY-coded nrDNA cluster alignment. The tree was rooted on the split with Solanaceae. The scale is in substitution per site.
Genes 11 01508 g003
Figure 4. Reduced representation of the midpoint-rooted maximum likelihood phylogenetic tree of the phy gene family in Oleaceae. Only ultrafast bootstrap (UFB) values inferior to 100 are indicated on nodes. Putative pseudogenes are denoted by dashed lines.
Figure 4. Reduced representation of the midpoint-rooted maximum likelihood phylogenetic tree of the phy gene family in Oleaceae. Only ultrafast bootstrap (UFB) values inferior to 100 are indicated on nodes. Putative pseudogenes are denoted by dashed lines.
Genes 11 01508 g004
Figure 5. Maximum likelihood phylogenetic tree of Oleaceae based on phyB-1 (a and b) and phyE-1 nuclear genes. Oleeae phyB-1a was arbitrarily aligned with phyB-1 of other tribes. The tree was rooted on the split with Solanaceae. The scale is in substitution per site. Ultrafast bootstrap support values are indicated on nodes when inferior to 100.
Figure 5. Maximum likelihood phylogenetic tree of Oleaceae based on phyB-1 (a and b) and phyE-1 nuclear genes. Oleeae phyB-1a was arbitrarily aligned with phyB-1 of other tribes. The tree was rooted on the split with Solanaceae. The scale is in substitution per site. Ultrafast bootstrap support values are indicated on nodes when inferior to 100.
Genes 11 01508 g005
Figure 6. Maximum likelihood topology of Oleaceae family estimated from the partitioned concatenation of 80 plastid coding sequence, 37 mitochondrial genes, the complete nuclear ribosomal DNA cluster and three nuclear genes encoding phytochromes (phyE-1, phyB-1a, phyB-1b). Concordance factors were calculated in relation to the species trees inferred for each partitioned dataset. Gene concordance factors are represented by the green/purple pie charts (left), site concordance factors by the blue/orange ones (right). UFB support values are indicated near their respective nodes.
Figure 6. Maximum likelihood topology of Oleaceae family estimated from the partitioned concatenation of 80 plastid coding sequence, 37 mitochondrial genes, the complete nuclear ribosomal DNA cluster and three nuclear genes encoding phytochromes (phyE-1, phyB-1a, phyB-1b). Concordance factors were calculated in relation to the species trees inferred for each partitioned dataset. Gene concordance factors are represented by the green/purple pie charts (left), site concordance factors by the blue/orange ones (right). UFB support values are indicated near their respective nodes.
Genes 11 01508 g006
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Dupin, J.; Raimondeau, P.; Hong-Wa, C.; Manzi, S.; Gaudeul, M.; Besnard, G. Resolving the Phylogeny of the Olive Family (Oleaceae): Confronting Information from Organellar and Nuclear Genomes. Genes 2020, 11, 1508. https://doi.org/10.3390/genes11121508

AMA Style

Dupin J, Raimondeau P, Hong-Wa C, Manzi S, Gaudeul M, Besnard G. Resolving the Phylogeny of the Olive Family (Oleaceae): Confronting Information from Organellar and Nuclear Genomes. Genes. 2020; 11(12):1508. https://doi.org/10.3390/genes11121508

Chicago/Turabian Style

Dupin, Julia, Pauline Raimondeau, Cynthia Hong-Wa, Sophie Manzi, Myriam Gaudeul, and Guillaume Besnard. 2020. "Resolving the Phylogeny of the Olive Family (Oleaceae): Confronting Information from Organellar and Nuclear Genomes" Genes 11, no. 12: 1508. https://doi.org/10.3390/genes11121508

APA Style

Dupin, J., Raimondeau, P., Hong-Wa, C., Manzi, S., Gaudeul, M., & Besnard, G. (2020). Resolving the Phylogeny of the Olive Family (Oleaceae): Confronting Information from Organellar and Nuclear Genomes. Genes, 11(12), 1508. https://doi.org/10.3390/genes11121508

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop