Differential Transcriptome Analysis between Paulownia fortunei and Its Synthesized Autopolyploid

Zhang, Xiaoshen; Deng, Minjie; Fan, Guoqiang

doi:10.3390/ijms15035079

Open AccessArticle

Differential Transcriptome Analysis between Paulownia fortunei and Its Synthesized Autopolyploid

by

Xiaoshen Zhang

,

Minjie Deng

and

Guoqiang Fan

^*

Institute of Paulownia, Henan Agricultural University, 95 Wenhua Road, Jinsui District, Zhengzhou 450002, Henan, China

^*

Author to whom correspondence should be addressed.

Int. J. Mol. Sci. 2014, 15(3), 5079-5093; https://doi.org/10.3390/ijms15035079

Submission received: 13 December 2013 / Revised: 17 February 2014 / Accepted: 18 February 2014 / Published: 21 March 2014

(This article belongs to the Section Biochemistry)

Download

Browse Figures

Versions Notes

Abstract

:

Paulownia fortunei is an ecologically and economically important tree species that is widely used as timber and chemical pulp. Its autotetraploid, which carries a number of valuable traits, was successfully induced with colchicine. To identify differences in gene expression between P. fortunei and its synthesized autotetraploid, we performed transcriptome sequencing using an Illumina Genome Analyzer IIx (GAIIx). About 94.8 million reads were generated and assembled into 383,056 transcripts, including 18,984 transcripts with a complete open reading frame. A conducted Basic Local Alignment Search Tool (BLAST) search indicated that 16,004 complete transcripts had significant hits in the National Center for Biotechnology Information (NCBI) non-redundant database. The complete transcripts were given functional assignments using three public protein databases. One thousand one hundred fifty eight differentially expressed complete transcripts were screened through a digital abundance analysis, including transcripts involved in energy metabolism and epigenetic regulation. Finally, the expression levels of several transcripts were confirmed by quantitative real-time PCR. Our results suggested that polyploidization caused epigenetic-related changes, which subsequently resulted in gene expression variation between diploid and autotetraploid P. fortunei. This might be the main mechanism affected by the polyploidization. Our results represent an extensive survey of the P. fortunei transcriptome and will facilitate subsequent functional genomics research in P. fortunei. Moreover, the gene expression profiles of P. fortunei and its autopolyploid will provide a valuable resource for the study of polyploidization.

Keywords:

Paulownia fortunei; transcriptome; polyploidy; de novo assembly; next-generation sequencing

1. Introduction

Paulownia is a genus native to China. The two species most often cultivated in China are Paulownia fortunei and Paulownia elongata. Paulownia is a fast-growing, short-rotation timber crop that is also valuable for the production of chemical pulp [1,2]. Additionally, its wood exhibits rot resistance, dimensional stability and a high ignition point; thus, Paulownia is widely used for making furniture, aircraft, plywood, toys and musical instruments [3]. The tolerance of Paulownia to drought and soil extremes makes it commercially important for use in the reclamation of surface-mined land [4]. Indeed, this genus has been suggested for the reforestation of nutrient-poor soils [5].

Polyploidy is the heritable condition of possessing more than two complete sets of chromosomes [6]. Generally, polyploids are divided into autopolyploids arising by chromosome doubling of a single species and allopolyploids arising via interspecific hybridization and subsequent chromosome doubling. They play an important role in plant evolution. Recent estimates indicate that 15% of angiosperm and 31% of fern speciation events are accompanied by an increase in ploidy [7]. Many major crops, including wheat, cotton, oat, coffee, potato and oilseed rape, are polyploids [8,9]. Polyploids have also been induced by experimental treatments, such as heat shock and colchicines [10]. Rapid genomic and gene expression changes have been demonstrated in many synthesized and natural polyploid plants, including Arabidopsis [11,12], Brassica [13,14], Gossypium [15], Spartina [16] and Tragopogon [17].

Polyploidy provides genome “buffering” by increasing allelic diversity and heterosis, and it facilitates the creation of novel phenotypic variation and asexual reproduction [6,18], which may be valuable in plant breeding. With the goals of enriching the germplasm of Paulownia and increasing the desired traits, our lab successfully induced an autopolyploid of P. fortunei with the desirable wood properties using colchicine [19]. We were unable to characterize the genetic differences between diploid and autotetraploid P. fortunei on a grand scale, due to a lack of genomic sequence data, until the development of high-throughput next-generation sequencing (NGS). This technology allowed us to perform a short read-based transcriptome analysis of non-model organisms for whom the genomic sequence is unknown [20]. In the present study, we sequenced the transcriptome of diploid and autotetraploid P. fortunei using the Illumina Genome Analyzer IIx (GAIIx), an NGS platform, assembled and annotated the sequence data and analyzed the gene expression changes caused by polyploidization. These assembled transcriptome sequences and annotations will be useful for future functional genomic analyses.

2. Results

2.1. Illumina Paired-End Sequencing and De Novo Assembly

The two cDNA libraries, which were respectively constructed using the leaves of diploid P. fortunei (PF₂) and its autotetraploid (PF₄), were sequenced using the Illumina GAIIx sequencing platform. We obtained a total of 96.4 million 100-bp raw reads from the two libraries (51.2 million for PF₂; 45.2 million for PF₄), encompassing a total of 10.4 Gbp. After a stringent quality assessment and data filtering, about 94.8 million high-quality reads of both ploidies with a base quality score >20 were deposited in the National Center for Biotechnology Information (NCBI) Short Read Archive (accession number: SRP032166) and used to assemble the transcriptome of P. fortunei with the Trinity program [21]. The complete read dataset was assembled into 383,056 transcripts totaling 218,296,008 bp. The size of the transcripts ranged from 201 to 10,538 bp, with a mean length of 570 bp (Figure 1a). Furthermore, a total of 18,984 assembled transcripts with a complete open reading frame (ORF) and their corresponding protein sequences were predicted using TransDecoder in the Trinity package. The length of the complete transcripts with a total of 22,724,258 bp varied from 319 to 7406 bp, with an average of 1197 bp (Figure 1b).

2.2. Annotation of the Predicted Complete Transcripts

We aligned 18,984 protein sequences corresponding to the ORFs of the complete transcripts with sequences from public protein databases using the BLASTp program (E-value cut-off: 1.0 × 10⁻⁵). The results showed that 16,004 (84.3%), 10,730 (56.5%), 10,100 (53.2%), 4697 (24.7%) and 5076 (26.7%) transcripts had homologous sequences in the NCBI non-redundant (nr) (ftp://ftp.ncbi.nih.gov/blast/db/FASTA/nr.gz), UniProtKB/Swiss-Prot ( http://www.uniprot.org/), Eukaryotic Orthologous Groups (KOG) ( http://www.ncbi.nlm.nih.gov/COG/), Pfam ( http://pfam.sanger.ac.uk/) and Kyoto Encyclopedia of Genes and Genomes (KEGG) ( http://www.genome.jp/kegg/) databases, respectively. For the nr annotations, most transcripts had significant similarity to sequences from Vitis vinifera (6976; 43.6%), Ricinus communis (2287; 14.3%) and Populus trichocarpa (2118; 13.2%) (Figure 2).

2.3. Functional Classification Using GO, KOG and KEGG

First, the proteins corresponding to complete transcripts were annotated using the Pfam database. Using the gene ontology (GO) terms associated with the Pfam annotations, we classified 4697 transcripts into 33 functional groups under three main divisions: biological processes, cellular components and molecular functions (Figure S1). In the molecular function category, a significant percentage of transcripts were assigned to “binding” (2332; 51.7%) and “catalytic activity” (1784; 39.6%). In the cellular components category, a high percentage of transcripts were assigned to “cell” (566; 26.8%), whereas many transcripts were assigned to “metabolic processes” (1950; 42.4%) and “cellular processes” (1574; 34.2%) for the functional class biological processes.

After alignment to the KOG database, the functions of 10,100 transcripts were assigned to 24 categories (Figure S2). “Posttranslational modification, protein turnover and chaperones” represented the largest group (1097; 10.8%). However, categories with no concrete assignment, such as “function unknown” (598; 5.9%) and “general function prediction only” (2072; 20.5%) accounted for a large fraction of the transcripts.

To reconstruct the metabolic pathways in P. fortunei, 5076 transcripts having enzyme commission numbers were assigned to 235 KEGG pathways (Table S1). In terms of metabolism, the greatest numbers of transcripts were matched to “metabolic pathways” (1413; 27.8%), “biosynthesis of secondary metabolites” (735; 14.5%) and “microbial metabolism in diverse environments” (394; 7.8%). In the KEGG database, the top five pathways, including the most transcripts, were “RNA transport” (185; 3.6%), “protein processing in endoplasmic reticulum” (159; 3.1%), “spliceosome” (157; 3.1%), “glycolysis/gluconeogenesis” (156; 3.1%) and “starch and sucrose metabolism” (148; 2.9%).

2.4. Analysis of Differentially Expressed Transcripts between Diploid and Autotetraploid P. fortunei

A total of 1158 out of 18,984 (6.09%) complete transcripts were significantly differentially expressed between diploid and autotetraploid P. fortunei. Six hundred fifty eight were upregulated and 500 were downregulated in autotetraploid P. fortunei when compared with the diploid sample. For upregulated transcripts, differences ranged between 2.17- and 10.65-fold; for downregulated transcripts, differences ranged between 2.59–10.89-fold. Four hundred and eighty-three transcripts were only detected in the autotetraploid sample, and three hundred and seventy-eight transcripts were only detected in the diploid sample. A total of 983, 624, 317 and 317 transcripts were annotated in the NCBI nr, KOG, Pfam and KEGG databases, respectively.

2.5. Differentially Expressed Transcripts Related to Energy Metabolism

We mapped differentially expressed transcripts (DETs) to terms in the KEGG database and compared them with the whole transcriptome, with a focus on finding genes involved in metabolic pathways that were significantly enriched. Up to 16 KEGG pathways were significantly enriched (Table 1), with “pyruvate metabolism” (map00620), “carbon fixation in photosynthetic organisms” (map00710) and “oxidative phosphorylation” (map00190) as the top three pathways. Notably, these three pathways, “sulfur metabolism” (map00920) and “photosynthesis-antenna proteins” (map00196) are parts of energy metabolism. In the pathway “oxidative phosphorylation”, four transcripts corresponding to four V-type (vacuolar or vesicular proton pump) H⁺-transporting ATPase subunits (K02155, K02147, K02154 and K02145) were upregulated. Seven upregulated and two downregulated transcripts corresponding to five enzymes (K00025, K01006, K00873, K00029 and K01595) were involved in the pathway “carbon fixation in photosynthetic organisms”; while these five enzymes also play roles in the pathway “pyruvate metabolism” belonging to carbohydrate metabolism (Table 2). Two transcripts, m.50116 and m.50118, also were involved in the pathway “carbon fixation pathways in prokaryotes” (map00720).

2.6. Transcriptomic Changes Related to Genetic Information Storage and Processing

In the KOG database, one hundred and thirty-five DETs were classified to the main category “information storage and processing”, represented by five functional classes (Figure S3 and Table S2). The category containing the most number of DETs (49) was “RNA processing and modification”, including splicing factor (10), RNA helicase (12), RNA methylase (three), the subunit of mRNA cleavage and polyadenylation factor (four). Thirty-seven DETs were assigned to the category “Translation, ribosomal structure and biogenesis”; 14 were upregulated, and 23 were downregulated. Fifteen differentially expressed transcription factors (three upregulated GATA transcription factors) and two coactivators were included in the category “transcription”. Twelve upregulated and five downregulated DETs, such as five-fold upregulated mismatch repair ATPase MSH6 (m.22798) and four-fold exonuclease HKE1/RAT1 (m.56286), were divided into the category “replication, recombination and repair”. Eight DETs (six upregulated and two downregulated) were allocated to the category “chromatin structure and dynamics”, for example, the subunit CPS60/ASH2/BRE2 of the histone H3 (Lys4) methyltransferase complex and a component, SWI2, of chromatin remodeling complex SWI/SNF. These results suggested the transmission pipeline of genetic information might change during the shift from di- to tetra-ploid.

2.7. Verification of DETs by Quantitative Real-Time PCR

Twenty-two DETs were selected for quantitative real-time PCR (qRT-PCR) verification with specific primers (Tables 2, 3 and S3). Twelve transcripts, including two (m.8309 and m.32221) that were downregulated in PT4 vs. PT2, expressed at a higher level, and seven transcripts expressed at a lower level in autotetraploid plants than that in diploid plants; whereas there was almost no difference in the expression of the other three transcripts in diploid and autotetraploid P. fortunei (Figure 3). Twelve upregulated transcripts indicated that the energy and carbohydrate metabolism level of autotetraploid P. fortunei was probably higher than its diploid progenitor. Eight upregulated transcripts related to carbon fixation in autotetraploid plants were confirmed, which help us to understand our previous report that the wood density and fiber length of autotetraploid P. fortunei increased compared with its diploid progenitor [22]. Seven downregulated transcripts confirmed the variation of chromatin remodeling, the mRNA process and transcript regulation during the polyploidization of P. fortunei. In addition, for seventeen of twenty-two transcripts, their expression change trends (up- or down-regulation) determined by the qRT-PCR were in agreement with those predicted by the bioinformatic tool, which suggested that our transcriptome data were reliable.

3. Discussion

In the area of genomics research, NGS technology offers higher throughput and a lower cost than Sanger sequencing. The Illumina GAIIx (Illumina Inc., San Diego, CA, USA), Roche/454 Genome Sequencer (Roche Diagnostics Corp., Basel, Switzerland) and ABI SOLiD System (Life Technologies Corp., Carlsbad, NM, USA) are the three most widely used NGS platforms for genome sequencing, genome resequencing, transcriptome sequencing, miRNA expression profiling and DNA methylation analysis [23,24]. NGS technology can also be used for de novo transcriptome sequencing of non-model organisms, thereby facilitating the study of organisms with an unknown reference genome on a large scale [20]. In the present study, an Illumina GAIIx was used for de novo transcriptome sequencing of P. fortunei, because of its low cost and ability to generate large numbers of reads. About 94.8 million high-quality reads were generated and assembled de novo to 383,056 transcripts, including 18,984 transcripts with a complete ORF. Compared with the de novo transcriptome of P. tomentosa × P. fortunei in our previous work, in our present study, the number of complete transcripts was less, and the mean length of complete transcripts was shorter, which suggested that the total number of reads probably affected the assembly quality [25]. A total of 16,004 complete transcripts were successfully annotated using the NCBI nr database, suggesting that their functions were relatively conserved.

Most plant transcriptome studies have assembled sequence data from different tissues [26–28] or used mixed cDNAs from different tissues as a sequencing sample [29–31]. In this case, transcriptomic data could be acquired, but since alternative splicing exists in different tissue types [32], assembly was difficult. A few experiments have used tissue-specific transcriptomic sequencing and assembly [33,34], which can produce more accurate data than the former strategy. Tissue-specific transcriptomic data will supply a good reference set for gene expression studies, especially in non-model plants.

Full coding sequence cDNAs are useful in functional studies of genes and gene products and in genome assembly [35]. Though the prediction of coding sequence regions in eukaryotic genomes is complicated by the interruption of introns and the low proportion of protein coding sequences in the genome, previous studies in other species have produced full-length cDNA sets [36,37]. Up to now, few full-length cDNAs were available in public databases for Paulownia. We herein attempted to identify transcripts with full-length cDNA sequences from the reads using the Trinity program, with the aim of providing a reference for the future identification of coding sequences in the lab. In addition, we selected transcripts with an ORF as representatives for differentially expressed gene profiling between diploid and autotetraploid P. fortunei to decrease the inference of non-coding sequences and the occurrence of false-positive results.

The molecular basis of plant polyploids is probably correlated with genomic sequence and gene expression changes. Large-scale gene expression changes that resulted from the combination of hybridization and genome doubling were observed in allopolyploids [38]; meanwhile, there was only a small percentage of transcriptome alteration in autotetraploids [39], even no differences [40,41], compared with their diploid progenitors. A low level (6.09%) of gene expression alteration between P. fortunei diploid and autotetraploid is similar to the transcriptome data previously reported for Arabidopsis, Isatis indigotica, Eragrostis curvula and Siraitia grosvenorii [42–45]. The changes of genes related to metabolism process were significant between S. grosvenorii diploid and tetraploid [45]. In our study, differentially expressed transcripts related to energy metabolism and carbon fixation were enriched; most of them were upregulated.

Recent studies of polyploid plants have shown that genome-wide changes in gene expression may be associated with the inter-related epigenetic mechanisms (DNA methylation with histone covalent modifications and small RNAs) [46]. Salmon et al. [16] suggested that significant changes of DNA methylation patterns could explain the morphological plasticity and larger ecological amplitude of Spartina allopolyploids. Transcriptome alterations of A. thaliana autotetraploid Col-0 lines were related to DNA methylation, which worked with other DNA modifications [11]. Several siRNAs correlated with repeat sequences or transposons in A. suecica varied significantly between the two progenitor species, A. thaliana and A. arenosa [47]. In our results, the predicted functions of some DETs were connected to DNA or histone methyl transfer, RNA processing and chromosome remodeling; this indirectly indicated epigenetic mechanisms altered by polyploidization. On one hand, epigenetic changes may alter the expression of one gene; on the other hand, these changes acting on one transcription factor/repressor could alter the expression of a number of target and downstream genes without any further change of their epigenetic state, which caused the differential transmission of genetic information, as well as physiological, biochemical and phenotypic variation between diploid and tetraploid plants.

4. Experimental Section

4.1. Tissue Collection and RNA Isolation

Leaves were respectively collected from twenty (10 for PF₂; 10 for PF₄) healthy tissue culture seedlings grown for 30 days. All seedlings were incubated at 25 ± 2 °C under a 16-h/8-h light/dark photoperiod with light supplied by cool-white fluorescent lamps at an intensity of 130 μmol·m⁻²·s⁻¹. Equal amounts of the leaves from ten diploid (or autopolyploid) seedlings were mixed as one sample. Total RNA was respectively isolated from two mixed leaf samples using a Plant RNA Isolation Kit (AutoLab, Beijing, China), followed by RNA purification using an RNeasy MiniElute Cleanup Kit (Qiagen, Valencia, CA, USA), according to the manufacturer’s protocol.

4.2. cDNA Library Preparation, Sequencing and De Novo Assembly

Two paired-end libraries were constructed using a TruSeq RNA Sample Preparation Kit (Illumina, San Diego, CA, USA), according to the manufacturer’s instructions. The high-throughput sequencing was conducted using an Illumina GAIIx platform. A primary analysis of the data and base calling were performed using the software built into the Illumina instrument.

The raw image data were transformed by base calling into sequence data, which were called raw reads. Before transcriptome assembly, a stringent raw reads filtering process using the software package SolexaQA (DynamicTrim.pl, p = 0.05; LengthSort.pl, min length = 25) was employed to acquire clean reads [48]. Reads in which >10% of the bases had a quality score of Q < 20, ambiguous sequences represented as “N” and adaptor contamination were removed. We used the Trinity program (version: trinityrnaseq-r2013-02-25, the options: -seqType fq -min_contig_length 200 -group_pairs_distance 250 -min_kmer_cov 2) to de novo assemble the clean reads [21]. TransDecoder in the Trinity package (the options: -m 100 -G universal -C complete ORFs only -T 500) was used to predict the open reading frames (ORFs) of the assembled transcripts and their corresponding protein sequences.

4.3. Functional Annotation and Categorization of the Transcripts

Protein sequences corresponding to the ORFs of the complete transcripts were subjected to a similarity search against several public databases, including the NCBI non-redundant protein sequence database (nr) (ftp://ftp.ncbi.nih.gov/blast/db/FASTA/nr.gz), UniProtKB/Swiss-Prot ( http://www.uniprot.org/), Eukaryotic Orthologous Groups (KOG) ( http://www.ncbi.nlm.nih.gov/COG/), the Kyoto Encyclopedia of Genes and Genomes (KEGG) ( http://www.genome.jp/kegg/) [49] and Pfam ( http://pfam.sanger.ac.uk/), using BLASTp (NCBI, Bethesda, MD, USA) (version: 2.2.22, the options: -F F -e 1e-5 -p BLASTp) [50]. The complete transcripts aligned to the KOG database ( http://www.ncbi.nlm.nih.gov/COG/) were classified according to their possible functions. The resulting hits from the Pfam database ( http://pfam.sanger.ac.uk/) were processed by Blast2GO software (Instituto Valenciano de Investigaciones Agrarias, Moncada, Spain) to obtain gene ontology (GO) annotations of the complete transcripts [51], and then, WEGO software (Zhejiang University, Hangzhou, China) was used to perform GO functional classifications [52,53]. To summarize the pathways in P. fortunei, we mapped the annotated sequences to all pathways in the KEGG database ( http://www.genome.jp/kegg/).

4.4. Expression Abundance Analysis

To analyze the expression levels of the complete transcripts, we first used the Bowtie aligner (version: 0.12.7, the options used are in the Supplementary information) to align the reads from P. fortunei and its synthesized autotetraploid back to the assembled reference transcriptome [54]. We then used RSEM (version: 1.2.2, the options used are in the Supplementary information) built in the Trinity package to compute fragments per kilobase per million reads (FPKM) values [55] and applied RSEM-coupled EBseq (version: 1.1.5) (University of Wisconsin, Madison, WI, USA) to calculate transcript abundance differences between the two samples. The fold change for each transcript between the samples were computed as the ratio of the FPKM values. Transcripts with an absolute value of a log₂ fold change >2 and a p-value <0.01 were regarded as significantly differentially expressed transcripts.

4.5. Functional Analysis of DETs

Those DETs with a complete ORF were classified based on their KOG annotations, while the transcripts were mapped to all pathways in the KEGG database ( http://www.genome.jp/kegg/). KEGG pathway enrichment analyses for these transcripts were performed by conducting hypergeometric tests with the assembled reference transcriptome set as the background. For the enrichment analysis, all p-values were adjusted using the Bonferroni correction. A corrected p-value of <0.05 was selected as the threshold for determining the significant enrichment of the transcript sets.

4.6. Quantitative Real-Time PCR Analysis

Total RNA extracted from the leaves of diploid P. fortunei and its autotetraploids was reverse transcribed into single-stranded cDNA with an iScript cDNA Synthesis Kit (Bio-Rad, Hercules, CA, USA). The SsoFast EvaGreen Supermix (Bio-Rad, Hercules, CA, USA) was used for qRT-PCR, starting with 1 μL cDNA template in a standard 20-μL reaction. The qRT-PCR cycle was as follows: 95 °C for 2 min, 40 cycles of 95 °C for 15 s and annealing at 57 °C for 15 s. The reactions were performed on a CFX96™ Real-Time PCR Detection System (Bio-Rad, Hercules, CA, USA), according to the manufacturer’s instructions. Two independent biological replicates for each sample and three technical replicates of each biological replicate were performed. The relative expression levels were calculated using the delta-delta Ct method with normalization to the internal control 18SrRNA.

5. Conclusions

The present study investigated the transcriptome profiles of P. fortunei and its synthesized autopolyploid in an attempt to identify alterations in gene expression between them. The de novo characterization of the P. fortunei transcriptome will provide valuable information for functional genomics studies of P. fortunei, especially for the discovery of functional genes and protein expression. The detection of 1158 differentially expressed transcripts demonstrated that the gene expression changed after autopolyploidization, which would certainly facilitate further research into genetic and epigenetic mechanisms of P. fortunei polyploidization.

Supplementary Information

ijms-15-05079-s001.pdf

Acknowledgments

This work was financially supported by the National Natural Science Foundation of China (grant no. 30271082, 30571496, U1204309), by the Outstanding Talents Project of Henan Province (grant no. 122101110700) and by Science and Technology Innovation Team Project of Zhengzhou City (grant no. 121PCXTD515).

Conflicts of Interest

The authors declare no conflict of interest.

References

Caparrósa, S.; Díaza, M.J.; Arizaa, J.; Lópeza, F.; Jiménezb, L. New perspectives for Paulownia fortunei L valorisation of the autohydrolysis and pulping processes. Bioresour. Technol. 2008, 99, 741–749. [Google Scholar]
Rai, A.K.; Singh, S.P.; Luxmi, C.; Savita, G. Paulownia fortunei—A new fiber source for pulp and paper. Indian Pulp Pap. Tech. Assoc. 2000, 12, 51–56. [Google Scholar]
Ipekci, Z.; Gozukirmizi, N. Direct somatic embryogenesis and synthetic seed production from Paulownia elongata. Plant Cell Rep. 2003, 22, 16–24. [Google Scholar]
Tang, R.C.; Carpenter, S.B.; Wittwer, R.F.; Graves, D.H. Paulownia—A crop tree for wood products and reclamation of surface mined land. South J. Appl. For. 1980, 4, 19–24. [Google Scholar]
Melhuish, J.H.; Gentry, C.E.; Beckjord, P.R. Paulownia tomentosa seedling growth at differing levels of ph nitrogen and phosphorus. J. Environ. Hort. 1990, 8, 205–207. [Google Scholar]
Comai, L. The advantages and disadvantages of being polyploid. Nat. Rev. Genet. 2005, 6, 836–846. [Google Scholar]
Wood, T.E.; Takebayashi, N.; Barker, M.S.; Mayrose, I.; Greenspoon, P.B.; Rieseberg, L.H. The frequency of polyploid speciation in vascular plants. Proc. Natl. Acad. Sci. USA 2009, 106, 13875–13879. [Google Scholar]
Cifuentes, M.; Grandont, L.; Moore, G.; Chevre, A.M.; Jenczewski, E. Genetic regulation of meiosis in polyploid species: New insights into an old question. New Phytol. 2010, 186, 29–36. [Google Scholar]
Higgins, J.; Magusin, A.; Trick, M.; Fraser, F.; Bancroft, I. Use of mRNA-seq to discriminate contributions to the transcriptome from the constituent genomes of the polyploid crop species Brassica napus. BMC Genomics 2012, 13, 247. [Google Scholar]
Kaensaksiri, T.; Soontornchainaksaeng, P.; Soonthornchareonnon, N.; Prathanturarug, S. In vitro induction of polyploidy in Centella asiatica (L) Urban. Plant Cell Tiss. Organ Cult. 2011, 107, 187–194. [Google Scholar]
Yu, Z.; Haberer, G.; Matthes, M.; Rattei, T.; Mayer, K.F.; Gierl, A.; Torres-Ruiz, R.A. Impact of natural genetic variation on the transcriptome of autotetraploid Arabidopsis thaliana. Proc. Natl. Acad. Sci. USA 2010, 107, 17809–17814. [Google Scholar]
Wang, J.; Tian, L.; Madlung, A.; Lee, H.S.; Chen, M.; Lee, J.J.; Watson, B.; Kagochi, T.; Comai, L.; Chen, Z.J. Stochastic and epigenetic changes of gene expression in Arabidopsis polyploids. Genetics 2004, 167, 1961–1973. [Google Scholar]
Harper, A.L.; Trick, M.; Higgins, J.; Fraser, F.; Clissold, L.; Wells, R.; Hattori, C.; Werner, P.; Bancroft, I. Associative transcriptomics of traits in the polyploid crop species Brassica napus. Nat. Biotechnol. 2012, 30, 798–802. [Google Scholar]
Jiang, J.; Shao, Y.; Du, K.; Ran, L.; Fang, X.; Wang, Y. Use of digital gene expression to discriminate gene expression differences in early generations of resynthesized Brassica napus and its diploid progenitors. BMC Genomics 2013, 14, 72. [Google Scholar]
Hovav, R.; Udall, J.A.; Chaudhary, B.; Hovav, E.; Flagel, L.; Hu, G.; Wendel, J.F. The evolution of spinnable cotton fiber entailed prolonged development and a novel metabolism. PLoS Genet. 2008, 4, e25. [Google Scholar]
Salmon, A.; Ainouche, M.L.; Wendel, J.F. Genetic and epigenetic consequences of recent hybridization and polyploidy in Spartina (Poaceae). Mol. Ecol. 2005, 14, 1163–1175. [Google Scholar]
Buggs, R.J.; Doust, A.N.; Tate, J.A.; Koh, J.; Soltis, K.; Feltus, F.A.; Paterson, A.H.; Soltis, P.S.; Soltis, D.E. Gene loss and silencing in Tragopogon miscellus (Asteraceae): Comparison of natural and synthetic allotetraploids. Heredity (Edinb.) 2009, 103, 73–81. [Google Scholar]
Udall, J.A.; Wendel, J.F. Polyploidy and crop improvement. Crop Sci. 2006, 46, S3–S14. [Google Scholar]
Fan, G.Q.; Cao, Y.C.; Zhao, Z.L.; Yang, Z.Q. Induction of autotetraploid of Paulownia fortunei. Sci. Silv. Sin. 2007, 43, 31–35. [Google Scholar]
Collins, L.J.; Biggs, P.J.; Voelckel, C.; Joly, S. An approach to transcriptome analysis of non-model organisms using short-read sequences. Genome Inform. 2008, 21, 3–14. [Google Scholar]
Grabherr, M.G.; Haas, B.J.; Yassour, M.; Levin, J.Z.; Thompson, D.A.; Amit, I.; Adiconis, X.; Fan, L.; Raychowdhury, R.; Zeng, Q.; et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 2011, 29, 644–652. [Google Scholar]
Zhai, X.Q.; Zhang, X.S.; Zhao, Z.L.; Deng, M.J.; Fan, G.Q. Study on wood physical properties of tetraploid Paulownia fortunei. J. Henan Agric. Univ. 2012, 46, 651–654. [Google Scholar]
Mardis, E.R. The impact of next-generation sequencing technology on genetics. Trends Genet. 2008, 24, 133–141. [Google Scholar]
Morozova, O.; Marra, M.A. Applications of next-generation sequencing technologies in functional genomics. Genomics 2008, 92, 255–264. [Google Scholar]
Liu, R.; Dong, Y.; Fan, G.; Zhao, Z.; Deng, M.; Cao, X.; Niu, S. Discovery of genes related to witches’ broom disease in Paulownia tomentosa × Paulownia fortunei by a de novo assembled transcriptome. PLoS One 2013, 8, e80238. [Google Scholar]
Garg, R.; Patel, R.K.; Jhanwar, S.; Priya, P.; Bhattacharjee, A.; Yadav, G.; Bhatia, S.; Chattopadhyay, D.; Tyagi, A.K.; Jain, M. Gene discovery and tissue-specific transcriptome analysis in chickpea with massively parallel pyrosequencing and web resource development. Plant Physiol. 2011, 156, 1661–1678. [Google Scholar]
Wang, Y.; Zeng, X.; Iyer, N.J.; Bryant, D.W.; Mockler, T.C.; Mahalingam, R. Exploring the switchgrass transcriptome using second-generation sequencing technology. PLoS One 2012, 7, e34225. [Google Scholar]
Liu, S.; Chen, C.; Chen, G.; Cao, B.; Chen, Q.; Lei, J. RNA-sequencing tag profiling of the placenta and pericarp of pungent pepper provides robust candidates contributing to capsaicinoid biosynthesis. Plant Cell Tissue Organ. Cult. 2012, 110, 111–121. [Google Scholar]
Hsiao, Y.Y.; Chen, Y.W.; Huang, S.C.; Pan, Z.J.; Fu, C.H.; Chen, W.H.; Tsai, W.C.; Chen, H.H. Gene discovery using next-generation pyrosequencing to develop ESTs for Phalaenopsis orchids. BMC Genomics 2011, 12, 360. [Google Scholar]
Parchman, T.L.; Geist, K.S.; Grahnen, J.A.; Benkman, C.W.; Buerkle, C.A. Transcriptome sequencing in an ecologically important tree species: Assembly annotation and marker discovery. BMC Genomics 2010, 11, 180. [Google Scholar]
Huang, L.L.; Yang, X.; Sun, P.; Tong, W.; Hu, S.Q. The first Illumina-based de novo transcriptome sequencing and analysis of safflower flowers. PLoS One 2012, 7, e38653. [Google Scholar]
Barash, Y.; Calarco, J.A.; Gao, W.; Pan, Q.; Wang, X.; Shai, O.; Blencowe, B.J.; Frey, B.J. Deciphering the splicing code. Nature 2010, 465, 53–59. [Google Scholar]
Barrero, R.A.; Chapman, B.; Yang, Y.; Moolhuijzen, P.; Keeble-Gagnere, G.; Zhang, N.; Tang, Q.; Bellgard, M.I.; Qiu, D. De novo assembly of Euphorbia fischeriana root transcriptome identifies prostratin pathway related genes. BMC Genomics 2011, 12, 600. [Google Scholar]
Zhou, Y.; Gao, F.; Liu, R.; Feng, J.; Li, H. De novo sequencing and analysis of root transcriptome using 454 pyrosequencing to discover putative genes associated with drought tolerance in Ammopiptanthus mongolicus. BMC Genomics 2012, 13, 266. [Google Scholar]
Harhay, G.P.; Sonstegard, T.S.; Keele, J.W.; Heaton, M.P.; Clawson, M.L.; Snelling, W.M.; Wiedmann, R.T.; van Tassell, C.P.; Smith, T.P. Characterization of 954 bovine full-CDS cDNA sequences. BMC Genomics 2005, 6, 166. [Google Scholar]
Chen, F.; Lee, Y.; Jiang, Y.; Wang, S.; Peatman, E.; Abernathy, J.; Liu, H.; Liu, S.; Kucuktas, H.; Ke, C.; et al. Identification and characterization of full-length cDNAs in channel catfish (Ictalurus punctatus) and blue catfish (Ictalurus furcatus). PLoS One 2010, 5, e11546. [Google Scholar]
Andreassen, R.; Lunner, S.; Hoyheim, B. Characterization of full-length sequenced cDNA inserts (FLIcs) from Atlantic salmon (Salmo salar). BMC Genomics 2009, 10, 502. [Google Scholar]
Jackson, S.; Chen, Z.J. Genomic and expression plasticity of polyploidy. Curr. Opin. Plant Biol. 2010, 13, 153–159. [Google Scholar]
Anssour, S.; Krugel, T.; Sharbel, T.F.; Saluz, H.P.; Bonaventure, G.; Baldwin, I.T. Phenotypic genetic and genomic consequences of natural and synthetic polyploidization of Nicotiana attenuata and Nicotiana obtusifolia. Ann. Bot. 2009, 103, 1207–1217. [Google Scholar]
Stupar, R.M.; Bhaskar, P.B.; Yandell, B.S.; Rensink, W.A.; Hart, A.L.; Ouyang, S.; Veilleux, R.E.; Busse, J.S.; Erhardt, R.J.; Buell, C.R.; et al. Phenotypic and transcriptomic changes associated with potato autopolyploidization. Genetics 2007, 176, 2055–2067. [Google Scholar]
Albertin, W.; Brabant, P.; Catrice, O.; Eber, F.; Jenczewski, E.; Chevre, A.M.; Thiellement, H. Autopolyploidy in cabbage (Brassica oleracea L) does not alter significantly the proteomes of green tissues. Proteomics 2005, 5, 2131–2139. [Google Scholar]
Cervigni, G.D.; Paniego, N.; Pessino, S.; Selva, J.P.; Diaz, M.; Spangenberg, G.; Echenique, V. Gene expression in diplosporous and sexual Eragrostis curvula genotypes with differing ploidy levels. Plant Mol. Biol. 2008, 67, 11–23. [Google Scholar]
Wang, J.; Tian, L.; Lee, H.S.; Wei, N.E.; Jiang, H.; Watson, B.; Madlung, A.; Osborn, T.C.; Doerge, R.W.; Comai, L.; et al. Genomewide nonadditive gene regulation in Arabidopsis allotetraploids. Genetics 2006, 172, 507–517. [Google Scholar]
Lu, B.; Pan, X.; Zhang, L.; Huang, B.; Sun, L.; Li, B.; Yi, B.; Zheng, S.; Yu, X.; Ding, R.; et al. A genome-wide comparison of genes responsive to autopolyploidy in Isatis indigotica using Arabidopsis thaliana Affymetrix genechips. Plant Mol. Biol. Rep. 2006, 24, 197–204. [Google Scholar]
Fu, W.; Ma, X.; Tang, Q.; Mo, C. Karyotype analysis and genetic variation of a mutant in Siraitia grosvenorii. Mol. Biol. Rep. 2012, 39, 1247–1252. [Google Scholar]
Salmon, A.; Ainouche, M.L. Polyploidy and DNA methylation: New tools available. Mol. Ecol. 2010, 19, 213–215. [Google Scholar]
Ha, M.; Lu, J.; Tian, L.; Ramachandran, V.; Kasschau, K.D.; Chapman, E.J.; Carrington, J.C.; Chen, X.; Wang, X.J.; Chen, Z.J. Small RNAs serve as a genetic buffer against genomic shock in Arabidopsis interspecific hybrids and allopolyploids. Proc. Natl. Acad. Sci. USA 2009, 106, 17835–17840. [Google Scholar]
Cox, M.P.; Peterson, D.A.; Biggs, P.J. SolexaQA: At-a-glance quality assessment of Illumina second-generation sequencing data. BMC Bioinform. 2010, 11, 485. [Google Scholar]
Kanehisa, M.; Goto, S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000, 28, 27–30. [Google Scholar]
Altschul, S.F.; Madden, T.L.; Schaffer, A.A.; Zhang, J.; Zhang, Z.; Miller, W.; Lipman, D.J. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 1997, 25, 3389–3402. [Google Scholar]
Conesa, A.; Gotz, S.; Garcia-Gomez, J.M.; Terol, J.; Talon, M.; Robles, M. Blast2GO: A universal tool for annotation visualization and analysis in functional genomics research. Bioinformatics 2005, 21, 3674–3676. [Google Scholar]
Ashburner, M.; Ball, C.A.; Blake, J.A.; Botstein, D.; Butler, H.; Cherry, J.M.; Davis, A.P.; Dolinski, K.; Dwight, S.S.; Eppig, J.T.; et al. Gene ontology: Tool for the unification of biology The Gene Ontology Consortium. Nat. Genet. 2000, 25, 25–29. [Google Scholar]
Ye, J.; Fang, L.; Zheng, H.; Zhang, Y.; Chen, J.; Zhang, Z.; Wang, J.; Li, S.; Li, R.; Bolund, L.; et al. WEGO: A web tool for plotting GO annotations. Nucleic Acids Res. 2006, 34, W293–W297. [Google Scholar]
Langmead, B.; Trapnell, C.; Pop, M.; Salzberg, S.L. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009, 10, R25. [Google Scholar]
Li, B.; Dewey, C.N. RSEM: Accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinforma. 2011, 12, 323. [Google Scholar]

Figure 1. Overview of P. fortunei transcriptome assembly. (a) The size distribution of the transcripts obtained from de novo assembly of high-quality clean reads; (b) the size distribution of the transcripts with a complete open reading frame (ORF).

Figure 2. Species distribution of the standard protein-protein BLAST (BLASTp) matches of P. fortunei transcripts against the non-redundant (nr) database (E-value cut-off 1.0 × 10⁻⁵).

Figure 3. Quantitative real-time PCR (qRT-PCR) analysis of differentially expressed transcripts involved in energy metabolism. PF₂, diploid P. fortunei; PF₄, autotetraploid P. fortunei. Bars represent the mean (±SD).

Table 1. Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways significantly enriched for differentially expressed transcripts between diploid and tetraploid P. fortune.

**Table 1.** Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways significantly enriched for differentially expressed transcripts between diploid and tetraploid P. fortune.
Pathway entry	Pathway name	Number of DETs a	Corrected p-value
map00620	Pyruvate metabolism	20	0.008
map00710	Carbon fixation in photosynthetic organisms	17	0.011
map00190	Oxidative phosphorylation	11	0.023
map00720	Carbon fixation pathways in prokaryotes	9	0.009
map00860	Porphyrin and chlorophyll metabolism	9	0.009
map00906	Carotenoid biosynthesis	6	0.020
map00592	alpha-Linolenic acid metabolism	5	0.038
map00920	Sulfur metabolism	5	0.034
map00591	Linoleic acid metabolism	5	0.015
map00670	One carbon pool by folate	5	0.015
map00061	Fatty acid biosynthesis	4	0.021
map00590	Arachidonic acid metabolism	3	0.015
map00902	Monoterpenoid biosynthesis	3	0.012
map00196	Photosynthesis-antenna proteins	2	0.038
map00785	Lipoic acid metabolism	2	0.014
map00253	Tetracycline biosynthesis	2	0.016

^aDETs means differentially expressed transcripts.

Table 2. KEGG annotations of 14 differentially expressed transcripts involved in the top three enriched metabolism pathways.

**Table 2.** KEGG annotations of 14 differentially expressed transcripts involved in the top three enriched metabolism pathways.
Transcript ID	KEGG orthology (KO) number	KEGG descriptions	E-value	KEGG pathway no.a
m.14097	K02155	V-type H⁺-transporting ATPase 16 kDa proteolipid subunit	7.0 × 10⁻⁶⁹	map00190
m.54501	K02147	V-type H⁺-transporting ATPase subunit B	1.0 × 10⁻⁴⁵	map00190
m.32555	K02154	V-type H⁺-transporting ATPase subunit I	1.0 × 10⁻⁴⁸	map00190
m.33871	K02145	V-type H⁺-transporting ATPase subunit A	1.0 × 10⁻⁴⁸	map00190
m.30899 *	K02144	V-type H⁺-transporting ATPase 54 kD subunit	7.0 × 10⁻⁴⁸	map00190
m.8309 *	K00029	malate dehydrogenase (oxaloacetate-decarboxylating) (NADP⁺)	1.0 × 10⁻³⁴	map00620, map00710
m.32221 *	K00029	malate dehydrogenase (oxaloacetate-decarboxylating) (NADP⁺)	8.0 × 10⁻⁴³	map00620, map00710
m.28729	K00025	malate dehydrogenase	6.0 × 10⁻⁵⁴	map00620, map00710
m.37547	K01006	pyruvate, orthophosphate dikinase	6.0 × 10⁻⁴⁶	map00620, map00710
m.37548	K01006	pyruvate, orthophosphate dikinase	1.0 × 10⁻⁵⁴	map00620, map00710
m.41758	K00873	pyruvate kinase	2.0 × 10⁻²⁶	map00620, map00710
m.43095	K00873	pyruvate kinase	4.0 × 10⁻³¹	map00620, map00710
m.50116	K01595	phosphoenolpyruvate carboxylase	7.0 × 10⁻⁷⁹	map00620, map00710
m.50118	K01595	phosphoenolpyruvate carboxylase	9.0 × 10⁻⁴⁰	map00620, map00710

^*The downregulated transcript;

^amap00620, pyruvate metabolism; map00710, carbon fixation in photosynthetic organisms; map00190, oxidative phosphorylation.

Table 3. Annotations of differentially expressed transcripts involved in genetic information storage and processing.

**Table 3.** Annotations of differentially expressed transcripts involved in genetic information storage and processing.
Transcript ID	Function descriptions	E-value
m.56286	5′-3′ exonuclease HKE1/RAT1	9.0 × 10⁻¹¹
m.59998	Chromatin remodeling complex SWI/SNF, component SWI2 and related ATPases (DNA/RNA helicase superfamily)	4.0 × 10⁻²⁷
m.17815	Chromatin remodeling protein HARP/SMARCAL1, DEAD-box superfamily	8.0 × 10⁻⁷
m.48610	Polyadenylate-binding protein (RRM superfamily)	7.0 × 10⁻⁶
m.38370	Translation initiation factor 3, subunit c (eIF-3c)	3.0 × 10⁻³⁴
m.58566	mRNA cleavage and polyadenylation factor II complex, BRR5 (CPSF subunit)	1.0 × 10⁻¹¹⁰
m.24433	RNA Helicase	9.0 × 10⁻⁶
m.12316	Transcription factor containing NAC and translation elongation factor EF-Ts, N-terminal domain (TS-N) domains	3.0 × 10⁻⁸

© 2014 by the authors; licensee MDPI, Basel, Switzerland This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).

Share and Cite

MDPI and ACS Style

Zhang, X.; Deng, M.; Fan, G. Differential Transcriptome Analysis between Paulownia fortunei and Its Synthesized Autopolyploid. Int. J. Mol. Sci. 2014, 15, 5079-5093. https://doi.org/10.3390/ijms15035079

AMA Style

Zhang X, Deng M, Fan G. Differential Transcriptome Analysis between Paulownia fortunei and Its Synthesized Autopolyploid. International Journal of Molecular Sciences. 2014; 15(3):5079-5093. https://doi.org/10.3390/ijms15035079

Chicago/Turabian Style

Zhang, Xiaoshen, Minjie Deng, and Guoqiang Fan. 2014. "Differential Transcriptome Analysis between Paulownia fortunei and Its Synthesized Autopolyploid" International Journal of Molecular Sciences 15, no. 3: 5079-5093. https://doi.org/10.3390/ijms15035079

APA Style

Zhang, X., Deng, M., & Fan, G. (2014). Differential Transcriptome Analysis between Paulownia fortunei and Its Synthesized Autopolyploid. International Journal of Molecular Sciences, 15(3), 5079-5093. https://doi.org/10.3390/ijms15035079

Article Menu

Differential Transcriptome Analysis between Paulownia fortunei and Its Synthesized Autopolyploid

Abstract

1. Introduction

2. Results

2.1. Illumina Paired-End Sequencing and De Novo Assembly

2.2. Annotation of the Predicted Complete Transcripts

2.3. Functional Classification Using GO, KOG and KEGG

2.4. Analysis of Differentially Expressed Transcripts between Diploid and Autotetraploid P. fortunei

2.5. Differentially Expressed Transcripts Related to Energy Metabolism

2.6. Transcriptomic Changes Related to Genetic Information Storage and Processing

2.7. Verification of DETs by Quantitative Real-Time PCR

3. Discussion

4. Experimental Section

4.1. Tissue Collection and RNA Isolation

4.2. cDNA Library Preparation, Sequencing and De Novo Assembly

4.3. Functional Annotation and Categorization of the Transcripts

4.4. Expression Abundance Analysis

4.5. Functional Analysis of DETs

4.6. Quantitative Real-Time PCR Analysis

5. Conclusions

Supplementary Information

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI