- freely available
Int. J. Mol. Sci. 2014, 15(3), 5079-5093; doi:10.3390/ijms15035079
Published: 21 March 2014
Abstract: Paulownia fortunei is an ecologically and economically important tree species that is widely used as timber and chemical pulp. Its autotetraploid, which carries a number of valuable traits, was successfully induced with colchicine. To identify differences in gene expression between P. fortunei and its synthesized autotetraploid, we performed transcriptome sequencing using an Illumina Genome Analyzer IIx (GAIIx). About 94.8 million reads were generated and assembled into 383,056 transcripts, including 18,984 transcripts with a complete open reading frame. A conducted Basic Local Alignment Search Tool (BLAST) search indicated that 16,004 complete transcripts had significant hits in the National Center for Biotechnology Information (NCBI) non-redundant database. The complete transcripts were given functional assignments using three public protein databases. One thousand one hundred fifty eight differentially expressed complete transcripts were screened through a digital abundance analysis, including transcripts involved in energy metabolism and epigenetic regulation. Finally, the expression levels of several transcripts were confirmed by quantitative real-time PCR. Our results suggested that polyploidization caused epigenetic-related changes, which subsequently resulted in gene expression variation between diploid and autotetraploid P. fortunei. This might be the main mechanism affected by the polyploidization. Our results represent an extensive survey of the P. fortunei transcriptome and will facilitate subsequent functional genomics research in P. fortunei. Moreover, the gene expression profiles of P. fortunei and its autopolyploid will provide a valuable resource for the study of polyploidization.
Paulownia is a genus native to China. The two species most often cultivated in China are Paulownia fortunei and Paulownia elongata. Paulownia is a fast-growing, short-rotation timber crop that is also valuable for the production of chemical pulp [1,2]. Additionally, its wood exhibits rot resistance, dimensional stability and a high ignition point; thus, Paulownia is widely used for making furniture, aircraft, plywood, toys and musical instruments . The tolerance of Paulownia to drought and soil extremes makes it commercially important for use in the reclamation of surface-mined land . Indeed, this genus has been suggested for the reforestation of nutrient-poor soils .
Polyploidy is the heritable condition of possessing more than two complete sets of chromosomes . Generally, polyploids are divided into autopolyploids arising by chromosome doubling of a single species and allopolyploids arising via interspecific hybridization and subsequent chromosome doubling. They play an important role in plant evolution. Recent estimates indicate that 15% of angiosperm and 31% of fern speciation events are accompanied by an increase in ploidy . Many major crops, including wheat, cotton, oat, coffee, potato and oilseed rape, are polyploids [8,9]. Polyploids have also been induced by experimental treatments, such as heat shock and colchicines . Rapid genomic and gene expression changes have been demonstrated in many synthesized and natural polyploid plants, including Arabidopsis [11,12], Brassica [13,14], Gossypium , Spartina  and Tragopogon .
Polyploidy provides genome “buffering” by increasing allelic diversity and heterosis, and it facilitates the creation of novel phenotypic variation and asexual reproduction [6,18], which may be valuable in plant breeding. With the goals of enriching the germplasm of Paulownia and increasing the desired traits, our lab successfully induced an autopolyploid of P. fortunei with the desirable wood properties using colchicine . We were unable to characterize the genetic differences between diploid and autotetraploid P. fortunei on a grand scale, due to a lack of genomic sequence data, until the development of high-throughput next-generation sequencing (NGS). This technology allowed us to perform a short read-based transcriptome analysis of non-model organisms for whom the genomic sequence is unknown . In the present study, we sequenced the transcriptome of diploid and autotetraploid P. fortunei using the Illumina Genome Analyzer IIx (GAIIx), an NGS platform, assembled and annotated the sequence data and analyzed the gene expression changes caused by polyploidization. These assembled transcriptome sequences and annotations will be useful for future functional genomic analyses.
2.1. Illumina Paired-End Sequencing and De Novo Assembly
The two cDNA libraries, which were respectively constructed using the leaves of diploid P. fortunei (PF2) and its autotetraploid (PF4), were sequenced using the Illumina GAIIx sequencing platform. We obtained a total of 96.4 million 100-bp raw reads from the two libraries (51.2 million for PF2; 45.2 million for PF4), encompassing a total of 10.4 Gbp. After a stringent quality assessment and data filtering, about 94.8 million high-quality reads of both ploidies with a base quality score >20 were deposited in the National Center for Biotechnology Information (NCBI) Short Read Archive (accession number: SRP032166) and used to assemble the transcriptome of P. fortunei with the Trinity program . The complete read dataset was assembled into 383,056 transcripts totaling 218,296,008 bp. The size of the transcripts ranged from 201 to 10,538 bp, with a mean length of 570 bp (Figure 1a). Furthermore, a total of 18,984 assembled transcripts with a complete open reading frame (ORF) and their corresponding protein sequences were predicted using TransDecoder in the Trinity package. The length of the complete transcripts with a total of 22,724,258 bp varied from 319 to 7406 bp, with an average of 1197 bp (Figure 1b).
2.2. Annotation of the Predicted Complete Transcripts
We aligned 18,984 protein sequences corresponding to the ORFs of the complete transcripts with sequences from public protein databases using the BLASTp program (E-value cut-off: 1.0 × 10−5). The results showed that 16,004 (84.3%), 10,730 (56.5%), 10,100 (53.2%), 4697 (24.7%) and 5076 (26.7%) transcripts had homologous sequences in the NCBI non-redundant (nr) (ftp://ftp.ncbi.nih.gov/blast/db/FASTA/nr.gz), UniProtKB/Swiss-Prot (http://www.uniprot.org/), Eukaryotic Orthologous Groups (KOG) (http://www.ncbi.nlm.nih.gov/COG/), Pfam (http://pfam.sanger.ac.uk/) and Kyoto Encyclopedia of Genes and Genomes (KEGG) (http://www.genome.jp/kegg/) databases, respectively. For the nr annotations, most transcripts had significant similarity to sequences from Vitis vinifera (6976; 43.6%), Ricinus communis (2287; 14.3%) and Populus trichocarpa (2118; 13.2%) (Figure 2).
2.3. Functional Classification Using GO, KOG and KEGG
First, the proteins corresponding to complete transcripts were annotated using the Pfam database. Using the gene ontology (GO) terms associated with the Pfam annotations, we classified 4697 transcripts into 33 functional groups under three main divisions: biological processes, cellular components and molecular functions (Figure S1). In the molecular function category, a significant percentage of transcripts were assigned to “binding” (2332; 51.7%) and “catalytic activity” (1784; 39.6%). In the cellular components category, a high percentage of transcripts were assigned to “cell” (566; 26.8%), whereas many transcripts were assigned to “metabolic processes” (1950; 42.4%) and “cellular processes” (1574; 34.2%) for the functional class biological processes.
After alignment to the KOG database, the functions of 10,100 transcripts were assigned to 24 categories (Figure S2). “Posttranslational modification, protein turnover and chaperones” represented the largest group (1097; 10.8%). However, categories with no concrete assignment, such as “function unknown” (598; 5.9%) and “general function prediction only” (2072; 20.5%) accounted for a large fraction of the transcripts.
To reconstruct the metabolic pathways in P. fortunei, 5076 transcripts having enzyme commission numbers were assigned to 235 KEGG pathways (Table S1). In terms of metabolism, the greatest numbers of transcripts were matched to “metabolic pathways” (1413; 27.8%), “biosynthesis of secondary metabolites” (735; 14.5%) and “microbial metabolism in diverse environments” (394; 7.8%). In the KEGG database, the top five pathways, including the most transcripts, were “RNA transport” (185; 3.6%), “protein processing in endoplasmic reticulum” (159; 3.1%), “spliceosome” (157; 3.1%), “glycolysis/gluconeogenesis” (156; 3.1%) and “starch and sucrose metabolism” (148; 2.9%).
2.4. Analysis of Differentially Expressed Transcripts between Diploid and Autotetraploid P. fortunei
A total of 1158 out of 18,984 (6.09%) complete transcripts were significantly differentially expressed between diploid and autotetraploid P. fortunei. Six hundred fifty eight were upregulated and 500 were downregulated in autotetraploid P. fortunei when compared with the diploid sample. For upregulated transcripts, differences ranged between 2.17- and 10.65-fold; for downregulated transcripts, differences ranged between 2.59–10.89-fold. Four hundred and eighty-three transcripts were only detected in the autotetraploid sample, and three hundred and seventy-eight transcripts were only detected in the diploid sample. A total of 983, 624, 317 and 317 transcripts were annotated in the NCBI nr, KOG, Pfam and KEGG databases, respectively.
2.5. Differentially Expressed Transcripts Related to Energy Metabolism
We mapped differentially expressed transcripts (DETs) to terms in the KEGG database and compared them with the whole transcriptome, with a focus on finding genes involved in metabolic pathways that were significantly enriched. Up to 16 KEGG pathways were significantly enriched (Table 1), with “pyruvate metabolism” (map00620), “carbon fixation in photosynthetic organisms” (map00710) and “oxidative phosphorylation” (map00190) as the top three pathways. Notably, these three pathways, “sulfur metabolism” (map00920) and “photosynthesis-antenna proteins” (map00196) are parts of energy metabolism. In the pathway “oxidative phosphorylation”, four transcripts corresponding to four V-type (vacuolar or vesicular proton pump) H+-transporting ATPase subunits (K02155, K02147, K02154 and K02145) were upregulated. Seven upregulated and two downregulated transcripts corresponding to five enzymes (K00025, K01006, K00873, K00029 and K01595) were involved in the pathway “carbon fixation in photosynthetic organisms”; while these five enzymes also play roles in the pathway “pyruvate metabolism” belonging to carbohydrate metabolism (Table 2). Two transcripts, m.50116 and m.50118, also were involved in the pathway “carbon fixation pathways in prokaryotes” (map00720).
2.6. Transcriptomic Changes Related to Genetic Information Storage and Processing
In the KOG database, one hundred and thirty-five DETs were classified to the main category “information storage and processing”, represented by five functional classes (Figure S3 and Table S2). The category containing the most number of DETs (49) was “RNA processing and modification”, including splicing factor (10), RNA helicase (12), RNA methylase (three), the subunit of mRNA cleavage and polyadenylation factor (four). Thirty-seven DETs were assigned to the category “Translation, ribosomal structure and biogenesis”; 14 were upregulated, and 23 were downregulated. Fifteen differentially expressed transcription factors (three upregulated GATA transcription factors) and two coactivators were included in the category “transcription”. Twelve upregulated and five downregulated DETs, such as five-fold upregulated mismatch repair ATPase MSH6 (m.22798) and four-fold exonuclease HKE1/RAT1 (m.56286), were divided into the category “replication, recombination and repair”. Eight DETs (six upregulated and two downregulated) were allocated to the category “chromatin structure and dynamics”, for example, the subunit CPS60/ASH2/BRE2 of the histone H3 (Lys4) methyltransferase complex and a component, SWI2, of chromatin remodeling complex SWI/SNF. These results suggested the transmission pipeline of genetic information might change during the shift from di- to tetra-ploid.
2.7. Verification of DETs by Quantitative Real-Time PCR
Twenty-two DETs were selected for quantitative real-time PCR (qRT-PCR) verification with specific primers (Tables 2, 3 and S3). Twelve transcripts, including two (m.8309 and m.32221) that were downregulated in PT4 vs. PT2, expressed at a higher level, and seven transcripts expressed at a lower level in autotetraploid plants than that in diploid plants; whereas there was almost no difference in the expression of the other three transcripts in diploid and autotetraploid P. fortunei (Figure 3). Twelve upregulated transcripts indicated that the energy and carbohydrate metabolism level of autotetraploid P. fortunei was probably higher than its diploid progenitor. Eight upregulated transcripts related to carbon fixation in autotetraploid plants were confirmed, which help us to understand our previous report that the wood density and fiber length of autotetraploid P. fortunei increased compared with its diploid progenitor . Seven downregulated transcripts confirmed the variation of chromatin remodeling, the mRNA process and transcript regulation during the polyploidization of P. fortunei. In addition, for seventeen of twenty-two transcripts, their expression change trends (up- or down-regulation) determined by the qRT-PCR were in agreement with those predicted by the bioinformatic tool, which suggested that our transcriptome data were reliable.
In the area of genomics research, NGS technology offers higher throughput and a lower cost than Sanger sequencing. The Illumina GAIIx (Illumina Inc., San Diego, CA, USA), Roche/454 Genome Sequencer (Roche Diagnostics Corp., Basel, Switzerland) and ABI SOLiD System (Life Technologies Corp., Carlsbad, NM, USA) are the three most widely used NGS platforms for genome sequencing, genome resequencing, transcriptome sequencing, miRNA expression profiling and DNA methylation analysis [23,24]. NGS technology can also be used for de novo transcriptome sequencing of non-model organisms, thereby facilitating the study of organisms with an unknown reference genome on a large scale . In the present study, an Illumina GAIIx was used for de novo transcriptome sequencing of P. fortunei, because of its low cost and ability to generate large numbers of reads. About 94.8 million high-quality reads were generated and assembled de novo to 383,056 transcripts, including 18,984 transcripts with a complete ORF. Compared with the de novo transcriptome of P. tomentosa × P. fortunei in our previous work, in our present study, the number of complete transcripts was less, and the mean length of complete transcripts was shorter, which suggested that the total number of reads probably affected the assembly quality . A total of 16,004 complete transcripts were successfully annotated using the NCBI nr database, suggesting that their functions were relatively conserved.
Most plant transcriptome studies have assembled sequence data from different tissues [26–28] or used mixed cDNAs from different tissues as a sequencing sample [29–31]. In this case, transcriptomic data could be acquired, but since alternative splicing exists in different tissue types , assembly was difficult. A few experiments have used tissue-specific transcriptomic sequencing and assembly [33,34], which can produce more accurate data than the former strategy. Tissue-specific transcriptomic data will supply a good reference set for gene expression studies, especially in non-model plants.
Full coding sequence cDNAs are useful in functional studies of genes and gene products and in genome assembly . Though the prediction of coding sequence regions in eukaryotic genomes is complicated by the interruption of introns and the low proportion of protein coding sequences in the genome, previous studies in other species have produced full-length cDNA sets [36,37]. Up to now, few full-length cDNAs were available in public databases for Paulownia. We herein attempted to identify transcripts with full-length cDNA sequences from the reads using the Trinity program, with the aim of providing a reference for the future identification of coding sequences in the lab. In addition, we selected transcripts with an ORF as representatives for differentially expressed gene profiling between diploid and autotetraploid P. fortunei to decrease the inference of non-coding sequences and the occurrence of false-positive results.
The molecular basis of plant polyploids is probably correlated with genomic sequence and gene expression changes. Large-scale gene expression changes that resulted from the combination of hybridization and genome doubling were observed in allopolyploids ; meanwhile, there was only a small percentage of transcriptome alteration in autotetraploids , even no differences [40,41], compared with their diploid progenitors. A low level (6.09%) of gene expression alteration between P. fortunei diploid and autotetraploid is similar to the transcriptome data previously reported for Arabidopsis, Isatis indigotica, Eragrostis curvula and Siraitia grosvenorii [42–45]. The changes of genes related to metabolism process were significant between S. grosvenorii diploid and tetraploid . In our study, differentially expressed transcripts related to energy metabolism and carbon fixation were enriched; most of them were upregulated.
Recent studies of polyploid plants have shown that genome-wide changes in gene expression may be associated with the inter-related epigenetic mechanisms (DNA methylation with histone covalent modifications and small RNAs) . Salmon et al.  suggested that significant changes of DNA methylation patterns could explain the morphological plasticity and larger ecological amplitude of Spartina allopolyploids. Transcriptome alterations of A. thaliana autotetraploid Col-0 lines were related to DNA methylation, which worked with other DNA modifications . Several siRNAs correlated with repeat sequences or transposons in A. suecica varied significantly between the two progenitor species, A. thaliana and A. arenosa . In our results, the predicted functions of some DETs were connected to DNA or histone methyl transfer, RNA processing and chromosome remodeling; this indirectly indicated epigenetic mechanisms altered by polyploidization. On one hand, epigenetic changes may alter the expression of one gene; on the other hand, these changes acting on one transcription factor/repressor could alter the expression of a number of target and downstream genes without any further change of their epigenetic state, which caused the differential transmission of genetic information, as well as physiological, biochemical and phenotypic variation between diploid and tetraploid plants.
4. Experimental Section
4.1. Tissue Collection and RNA Isolation
Leaves were respectively collected from twenty (10 for PF2; 10 for PF4) healthy tissue culture seedlings grown for 30 days. All seedlings were incubated at 25 ± 2 °C under a 16-h/8-h light/dark photoperiod with light supplied by cool-white fluorescent lamps at an intensity of 130 μmol·m−2·s−1. Equal amounts of the leaves from ten diploid (or autopolyploid) seedlings were mixed as one sample. Total RNA was respectively isolated from two mixed leaf samples using a Plant RNA Isolation Kit (AutoLab, Beijing, China), followed by RNA purification using an RNeasy MiniElute Cleanup Kit (Qiagen, Valencia, CA, USA), according to the manufacturer’s protocol.
4.2. cDNA Library Preparation, Sequencing and De Novo Assembly
Two paired-end libraries were constructed using a TruSeq RNA Sample Preparation Kit (Illumina, San Diego, CA, USA), according to the manufacturer’s instructions. The high-throughput sequencing was conducted using an Illumina GAIIx platform. A primary analysis of the data and base calling were performed using the software built into the Illumina instrument.
The raw image data were transformed by base calling into sequence data, which were called raw reads. Before transcriptome assembly, a stringent raw reads filtering process using the software package SolexaQA (DynamicTrim.pl, p = 0.05; LengthSort.pl, min length = 25) was employed to acquire clean reads . Reads in which >10% of the bases had a quality score of Q < 20, ambiguous sequences represented as “N” and adaptor contamination were removed. We used the Trinity program (version: trinityrnaseq-r2013-02-25, the options: -seqType fq -min_contig_length 200 -group_pairs_distance 250 -min_kmer_cov 2) to de novo assemble the clean reads . TransDecoder in the Trinity package (the options: -m 100 -G universal -C complete ORFs only -T 500) was used to predict the open reading frames (ORFs) of the assembled transcripts and their corresponding protein sequences.
4.3. Functional Annotation and Categorization of the Transcripts
Protein sequences corresponding to the ORFs of the complete transcripts were subjected to a similarity search against several public databases, including the NCBI non-redundant protein sequence database (nr) (ftp://ftp.ncbi.nih.gov/blast/db/FASTA/nr.gz), UniProtKB/Swiss-Prot (http://www.uniprot.org/), Eukaryotic Orthologous Groups (KOG) (http://www.ncbi.nlm.nih.gov/COG/), the Kyoto Encyclopedia of Genes and Genomes (KEGG) (http://www.genome.jp/kegg/)  and Pfam (http://pfam.sanger.ac.uk/), using BLASTp (NCBI, Bethesda, MD, USA) (version: 2.2.22, the options: -F F -e 1e-5 -p BLASTp) . The complete transcripts aligned to the KOG database (http://www.ncbi.nlm.nih.gov/COG/) were classified according to their possible functions. The resulting hits from the Pfam database (http://pfam.sanger.ac.uk/) were processed by Blast2GO software (Instituto Valenciano de Investigaciones Agrarias, Moncada, Spain) to obtain gene ontology (GO) annotations of the complete transcripts , and then, WEGO software (Zhejiang University, Hangzhou, China) was used to perform GO functional classifications [52,53]. To summarize the pathways in P. fortunei, we mapped the annotated sequences to all pathways in the KEGG database (http://www.genome.jp/kegg/).
4.4. Expression Abundance Analysis
To analyze the expression levels of the complete transcripts, we first used the Bowtie aligner (version: 0.12.7, the options used are in the Supplementary information) to align the reads from P. fortunei and its synthesized autotetraploid back to the assembled reference transcriptome . We then used RSEM (version: 1.2.2, the options used are in the Supplementary information) built in the Trinity package to compute fragments per kilobase per million reads (FPKM) values  and applied RSEM-coupled EBseq (version: 1.1.5) (University of Wisconsin, Madison, WI, USA) to calculate transcript abundance differences between the two samples. The fold change for each transcript between the samples were computed as the ratio of the FPKM values. Transcripts with an absolute value of a log2 fold change >2 and a p-value <0.01 were regarded as significantly differentially expressed transcripts.
4.5. Functional Analysis of DETs
Those DETs with a complete ORF were classified based on their KOG annotations, while the transcripts were mapped to all pathways in the KEGG database (http://www.genome.jp/kegg/). KEGG pathway enrichment analyses for these transcripts were performed by conducting hypergeometric tests with the assembled reference transcriptome set as the background. For the enrichment analysis, all p-values were adjusted using the Bonferroni correction. A corrected p-value of <0.05 was selected as the threshold for determining the significant enrichment of the transcript sets.
4.6. Quantitative Real-Time PCR Analysis
Total RNA extracted from the leaves of diploid P. fortunei and its autotetraploids was reverse transcribed into single-stranded cDNA with an iScript cDNA Synthesis Kit (Bio-Rad, Hercules, CA, USA). The SsoFast EvaGreen Supermix (Bio-Rad, Hercules, CA, USA) was used for qRT-PCR, starting with 1 μL cDNA template in a standard 20-μL reaction. The qRT-PCR cycle was as follows: 95 °C for 2 min, 40 cycles of 95 °C for 15 s and annealing at 57 °C for 15 s. The reactions were performed on a CFX96™ Real-Time PCR Detection System (Bio-Rad, Hercules, CA, USA), according to the manufacturer’s instructions. Two independent biological replicates for each sample and three technical replicates of each biological replicate were performed. The relative expression levels were calculated using the delta-delta Ct method with normalization to the internal control 18SrRNA.
The present study investigated the transcriptome profiles of P. fortunei and its synthesized autopolyploid in an attempt to identify alterations in gene expression between them. The de novo characterization of the P. fortunei transcriptome will provide valuable information for functional genomics studies of P. fortunei, especially for the discovery of functional genes and protein expression. The detection of 1158 differentially expressed transcripts demonstrated that the gene expression changed after autopolyploidization, which would certainly facilitate further research into genetic and epigenetic mechanisms of P. fortunei polyploidization.
This work was financially supported by the National Natural Science Foundation of China (grant no. 30271082, 30571496, U1204309), by the Outstanding Talents Project of Henan Province (grant no. 122101110700) and by Science and Technology Innovation Team Project of Zhengzhou City (grant no. 121PCXTD515).
Conflicts of Interest
The authors declare no conflict of interest.
- Caparrósa, S.; Díaza, M.J.; Arizaa, J.; Lópeza, F.; Jiménezb, L. New perspectives for Paulownia fortunei L valorisation of the autohydrolysis and pulping processes. Bioresour. Technol. 2008, 99, 741–749.
- Rai, A.K.; Singh, S.P.; Luxmi, C.; Savita, G. Paulownia fortunei—A new fiber source for pulp and paper. Indian Pulp Pap. Tech. Assoc. 2000, 12, 51–56.
- Ipekci, Z.; Gozukirmizi, N. Direct somatic embryogenesis and synthetic seed production from Paulownia elongata. Plant Cell Rep. 2003, 22, 16–24.
- Tang, R.C.; Carpenter, S.B.; Wittwer, R.F.; Graves, D.H. Paulownia—A crop tree for wood products and reclamation of surface mined land. South J. Appl. For. 1980, 4, 19–24.
- Melhuish, J.H.; Gentry, C.E.; Beckjord, P.R. Paulownia tomentosa seedling growth at differing levels of ph nitrogen and phosphorus. J. Environ. Hort. 1990, 8, 205–207.
- Comai, L. The advantages and disadvantages of being polyploid. Nat. Rev. Genet. 2005, 6, 836–846.
- Wood, T.E.; Takebayashi, N.; Barker, M.S.; Mayrose, I.; Greenspoon, P.B.; Rieseberg, L.H. The frequency of polyploid speciation in vascular plants. Proc. Natl. Acad. Sci. USA 2009, 106, 13875–13879.
- Cifuentes, M.; Grandont, L.; Moore, G.; Chevre, A.M.; Jenczewski, E. Genetic regulation of meiosis in polyploid species: New insights into an old question. New Phytol. 2010, 186, 29–36.
- Higgins, J.; Magusin, A.; Trick, M.; Fraser, F.; Bancroft, I. Use of mRNA-seq to discriminate contributions to the transcriptome from the constituent genomes of the polyploid crop species Brassica napus. BMC Genomics 2012, 13, 247.
- Kaensaksiri, T.; Soontornchainaksaeng, P.; Soonthornchareonnon, N.; Prathanturarug, S. In vitro induction of polyploidy in Centella asiatica (L) Urban. Plant Cell Tiss. Organ Cult. 2011, 107, 187–194.
- Yu, Z.; Haberer, G.; Matthes, M.; Rattei, T.; Mayer, K.F.; Gierl, A.; Torres-Ruiz, R.A. Impact of natural genetic variation on the transcriptome of autotetraploid Arabidopsis thaliana. Proc. Natl. Acad. Sci. USA 2010, 107, 17809–17814.
- Wang, J.; Tian, L.; Madlung, A.; Lee, H.S.; Chen, M.; Lee, J.J.; Watson, B.; Kagochi, T.; Comai, L.; Chen, Z.J. Stochastic and epigenetic changes of gene expression in Arabidopsis polyploids. Genetics 2004, 167, 1961–1973.
- Harper, A.L.; Trick, M.; Higgins, J.; Fraser, F.; Clissold, L.; Wells, R.; Hattori, C.; Werner, P.; Bancroft, I. Associative transcriptomics of traits in the polyploid crop species Brassica napus. Nat. Biotechnol. 2012, 30, 798–802.
- Jiang, J.; Shao, Y.; Du, K.; Ran, L.; Fang, X.; Wang, Y. Use of digital gene expression to discriminate gene expression differences in early generations of resynthesized Brassica napus and its diploid progenitors. BMC Genomics 2013, 14, 72.
- Hovav, R.; Udall, J.A.; Chaudhary, B.; Hovav, E.; Flagel, L.; Hu, G.; Wendel, J.F. The evolution of spinnable cotton fiber entailed prolonged development and a novel metabolism. PLoS Genet. 2008, 4, e25.
- Salmon, A.; Ainouche, M.L.; Wendel, J.F. Genetic and epigenetic consequences of recent hybridization and polyploidy in Spartina (Poaceae). Mol. Ecol. 2005, 14, 1163–1175.
- Buggs, R.J.; Doust, A.N.; Tate, J.A.; Koh, J.; Soltis, K.; Feltus, F.A.; Paterson, A.H.; Soltis, P.S.; Soltis, D.E. Gene loss and silencing in Tragopogon miscellus (Asteraceae): Comparison of natural and synthetic allotetraploids. Heredity (Edinb.) 2009, 103, 73–81.
- Udall, J.A.; Wendel, J.F. Polyploidy and crop improvement. Crop Sci. 2006, 46, S3–S14.
- Fan, G.Q.; Cao, Y.C.; Zhao, Z.L.; Yang, Z.Q. Induction of autotetraploid of Paulownia fortunei. Sci. Silv. Sin. 2007, 43, 31–35.
- Collins, L.J.; Biggs, P.J.; Voelckel, C.; Joly, S. An approach to transcriptome analysis of non-model organisms using short-read sequences. Genome Inform. 2008, 21, 3–14.
- Grabherr, M.G.; Haas, B.J.; Yassour, M.; Levin, J.Z.; Thompson, D.A.; Amit, I.; Adiconis, X.; Fan, L.; Raychowdhury, R.; Zeng, Q.; et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 2011, 29, 644–652.
- Zhai, X.Q.; Zhang, X.S.; Zhao, Z.L.; Deng, M.J.; Fan, G.Q. Study on wood physical properties of tetraploid Paulownia fortunei. J. Henan Agric. Univ. 2012, 46, 651–654.
- Mardis, E.R. The impact of next-generation sequencing technology on genetics. Trends Genet. 2008, 24, 133–141.
- Morozova, O.; Marra, M.A. Applications of next-generation sequencing technologies in functional genomics. Genomics 2008, 92, 255–264.
- Liu, R.; Dong, Y.; Fan, G.; Zhao, Z.; Deng, M.; Cao, X.; Niu, S. Discovery of genes related to witches’ broom disease in Paulownia tomentosa × Paulownia fortunei by a de novo assembled transcriptome. PLoS One 2013, 8, e80238.
- Garg, R.; Patel, R.K.; Jhanwar, S.; Priya, P.; Bhattacharjee, A.; Yadav, G.; Bhatia, S.; Chattopadhyay, D.; Tyagi, A.K.; Jain, M. Gene discovery and tissue-specific transcriptome analysis in chickpea with massively parallel pyrosequencing and web resource development. Plant Physiol. 2011, 156, 1661–1678.
- Wang, Y.; Zeng, X.; Iyer, N.J.; Bryant, D.W.; Mockler, T.C.; Mahalingam, R. Exploring the switchgrass transcriptome using second-generation sequencing technology. PLoS One 2012, 7, e34225.
- Liu, S.; Chen, C.; Chen, G.; Cao, B.; Chen, Q.; Lei, J. RNA-sequencing tag profiling of the placenta and pericarp of pungent pepper provides robust candidates contributing to capsaicinoid biosynthesis. Plant Cell Tissue Organ. Cult. 2012, 110, 111–121.
- Hsiao, Y.Y.; Chen, Y.W.; Huang, S.C.; Pan, Z.J.; Fu, C.H.; Chen, W.H.; Tsai, W.C.; Chen, H.H. Gene discovery using next-generation pyrosequencing to develop ESTs for Phalaenopsis orchids. BMC Genomics 2011, 12, 360.
- Parchman, T.L.; Geist, K.S.; Grahnen, J.A.; Benkman, C.W.; Buerkle, C.A. Transcriptome sequencing in an ecologically important tree species: Assembly annotation and marker discovery. BMC Genomics 2010, 11, 180.
- Huang, L.L.; Yang, X.; Sun, P.; Tong, W.; Hu, S.Q. The first Illumina-based de novo transcriptome sequencing and analysis of safflower flowers. PLoS One 2012, 7, e38653.
- Barash, Y.; Calarco, J.A.; Gao, W.; Pan, Q.; Wang, X.; Shai, O.; Blencowe, B.J.; Frey, B.J. Deciphering the splicing code. Nature 2010, 465, 53–59.
- Barrero, R.A.; Chapman, B.; Yang, Y.; Moolhuijzen, P.; Keeble-Gagnere, G.; Zhang, N.; Tang, Q.; Bellgard, M.I.; Qiu, D. De novo assembly of Euphorbia fischeriana root transcriptome identifies prostratin pathway related genes. BMC Genomics 2011, 12, 600.
- Zhou, Y.; Gao, F.; Liu, R.; Feng, J.; Li, H. De novo sequencing and analysis of root transcriptome using 454 pyrosequencing to discover putative genes associated with drought tolerance in Ammopiptanthus mongolicus. BMC Genomics 2012, 13, 266.
- Harhay, G.P.; Sonstegard, T.S.; Keele, J.W.; Heaton, M.P.; Clawson, M.L.; Snelling, W.M.; Wiedmann, R.T.; van Tassell, C.P.; Smith, T.P. Characterization of 954 bovine full-CDS cDNA sequences. BMC Genomics 2005, 6, 166.
- Chen, F.; Lee, Y.; Jiang, Y.; Wang, S.; Peatman, E.; Abernathy, J.; Liu, H.; Liu, S.; Kucuktas, H.; Ke, C.; et al. Identification and characterization of full-length cDNAs in channel catfish (Ictalurus punctatus) and blue catfish (Ictalurus furcatus). PLoS One 2010, 5, e11546.
- Andreassen, R.; Lunner, S.; Hoyheim, B. Characterization of full-length sequenced cDNA inserts (FLIcs) from Atlantic salmon (Salmo salar). BMC Genomics 2009, 10, 502.
- Jackson, S.; Chen, Z.J. Genomic and expression plasticity of polyploidy. Curr. Opin. Plant Biol. 2010, 13, 153–159.
- Anssour, S.; Krugel, T.; Sharbel, T.F.; Saluz, H.P.; Bonaventure, G.; Baldwin, I.T. Phenotypic genetic and genomic consequences of natural and synthetic polyploidization of Nicotiana attenuata and Nicotiana obtusifolia. Ann. Bot. 2009, 103, 1207–1217.
- Stupar, R.M.; Bhaskar, P.B.; Yandell, B.S.; Rensink, W.A.; Hart, A.L.; Ouyang, S.; Veilleux, R.E.; Busse, J.S.; Erhardt, R.J.; Buell, C.R.; et al. Phenotypic and transcriptomic changes associated with potato autopolyploidization. Genetics 2007, 176, 2055–2067.
- Albertin, W.; Brabant, P.; Catrice, O.; Eber, F.; Jenczewski, E.; Chevre, A.M.; Thiellement, H. Autopolyploidy in cabbage (Brassica oleracea L) does not alter significantly the proteomes of green tissues. Proteomics 2005, 5, 2131–2139.
- Cervigni, G.D.; Paniego, N.; Pessino, S.; Selva, J.P.; Diaz, M.; Spangenberg, G.; Echenique, V. Gene expression in diplosporous and sexual Eragrostis curvula genotypes with differing ploidy levels. Plant Mol. Biol. 2008, 67, 11–23.
- Wang, J.; Tian, L.; Lee, H.S.; Wei, N.E.; Jiang, H.; Watson, B.; Madlung, A.; Osborn, T.C.; Doerge, R.W.; Comai, L.; et al. Genomewide nonadditive gene regulation in Arabidopsis allotetraploids. Genetics 2006, 172, 507–517.
- Lu, B.; Pan, X.; Zhang, L.; Huang, B.; Sun, L.; Li, B.; Yi, B.; Zheng, S.; Yu, X.; Ding, R.; et al. A genome-wide comparison of genes responsive to autopolyploidy in Isatis indigotica using Arabidopsis thaliana Affymetrix genechips. Plant Mol. Biol. Rep. 2006, 24, 197–204.
- Fu, W.; Ma, X.; Tang, Q.; Mo, C. Karyotype analysis and genetic variation of a mutant in Siraitia grosvenorii. Mol. Biol. Rep. 2012, 39, 1247–1252.
- Salmon, A.; Ainouche, M.L. Polyploidy and DNA methylation: New tools available. Mol. Ecol. 2010, 19, 213–215.
- Ha, M.; Lu, J.; Tian, L.; Ramachandran, V.; Kasschau, K.D.; Chapman, E.J.; Carrington, J.C.; Chen, X.; Wang, X.J.; Chen, Z.J. Small RNAs serve as a genetic buffer against genomic shock in Arabidopsis interspecific hybrids and allopolyploids. Proc. Natl. Acad. Sci. USA 2009, 106, 17835–17840.
- Cox, M.P.; Peterson, D.A.; Biggs, P.J. SolexaQA: At-a-glance quality assessment of Illumina second-generation sequencing data. BMC Bioinform. 2010, 11, 485.
- Kanehisa, M.; Goto, S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000, 28, 27–30.
- Altschul, S.F.; Madden, T.L.; Schaffer, A.A.; Zhang, J.; Zhang, Z.; Miller, W.; Lipman, D.J. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 1997, 25, 3389–3402.
- Conesa, A.; Gotz, S.; Garcia-Gomez, J.M.; Terol, J.; Talon, M.; Robles, M. Blast2GO: A universal tool for annotation visualization and analysis in functional genomics research. Bioinformatics 2005, 21, 3674–3676.
- Ashburner, M.; Ball, C.A.; Blake, J.A.; Botstein, D.; Butler, H.; Cherry, J.M.; Davis, A.P.; Dolinski, K.; Dwight, S.S.; Eppig, J.T.; et al. Gene ontology: Tool for the unification of biology The Gene Ontology Consortium. Nat. Genet. 2000, 25, 25–29.
- Ye, J.; Fang, L.; Zheng, H.; Zhang, Y.; Chen, J.; Zhang, Z.; Wang, J.; Li, S.; Li, R.; Bolund, L.; et al. WEGO: A web tool for plotting GO annotations. Nucleic Acids Res. 2006, 34, W293–W297.
- Langmead, B.; Trapnell, C.; Pop, M.; Salzberg, S.L. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009, 10, R25.
- Li, B.; Dewey, C.N. RSEM: Accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinforma. 2011, 12, 323.
|Table 1. Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways significantly enriched for differentially expressed transcripts between diploid and tetraploid P. fortune.|
|Pathway entry||Pathway name||Number of DETs a||Corrected p-value|
|map00710||Carbon fixation in photosynthetic organisms||17||0.011|
|map00720||Carbon fixation pathways in prokaryotes||9||0.009|
|map00860||Porphyrin and chlorophyll metabolism||9||0.009|
|map00592||alpha-Linolenic acid metabolism||5||0.038|
|map00591||Linoleic acid metabolism||5||0.015|
|map00670||One carbon pool by folate||5||0.015|
|map00061||Fatty acid biosynthesis||4||0.021|
|map00590||Arachidonic acid metabolism||3||0.015|
|map00785||Lipoic acid metabolism||2||0.014|
aDETs means differentially expressed transcripts.
|Table 2. KEGG annotations of 14 differentially expressed transcripts involved in the top three enriched metabolism pathways.|
|Transcript ID||KEGG orthology (KO) number||KEGG descriptions||E-value||KEGG pathway no.a|
|m.14097||K02155||V-type H+-transporting ATPase 16 kDa proteolipid subunit||7.0 × 10−69||map00190|
|m.54501||K02147||V-type H+-transporting ATPase subunit B||1.0 × 10−45||map00190|
|m.32555||K02154||V-type H+-transporting ATPase subunit I||1.0 × 10−48||map00190|
|m.33871||K02145||V-type H+-transporting ATPase subunit A||1.0 × 10−48||map00190|
|m.30899 *||K02144||V-type H+-transporting ATPase 54 kD subunit||7.0 × 10−48||map00190|
|m.8309 *||K00029||malate dehydrogenase (oxaloacetate-decarboxylating) (NADP+)||1.0 × 10−34||map00620, map00710|
|m.32221 *||K00029||malate dehydrogenase (oxaloacetate-decarboxylating) (NADP+)||8.0 × 10−43||map00620, map00710|
|m.28729||K00025||malate dehydrogenase||6.0 × 10−54||map00620, map00710|
|m.37547||K01006||pyruvate, orthophosphate dikinase||6.0 × 10−46||map00620, map00710|
|m.37548||K01006||pyruvate, orthophosphate dikinase||1.0 × 10−54||map00620, map00710|
|m.41758||K00873||pyruvate kinase||2.0 × 10−26||map00620, map00710|
|m.43095||K00873||pyruvate kinase||4.0 × 10−31||map00620, map00710|
|m.50116||K01595||phosphoenolpyruvate carboxylase||7.0 × 10−79||map00620, map00710|
|m.50118||K01595||phosphoenolpyruvate carboxylase||9.0 × 10−40||map00620, map00710|
*The downregulated transcript;amap00620, pyruvate metabolism; map00710, carbon fixation in photosynthetic organisms; map00190, oxidative phosphorylation.
|Table 3. Annotations of differentially expressed transcripts involved in genetic information storage and processing.|
|Transcript ID||Function descriptions||E-value|
|m.56286||5′-3′ exonuclease HKE1/RAT1||9.0 × 10−11|
|m.59998||Chromatin remodeling complex SWI/SNF, component SWI2 and related ATPases (DNA/RNA helicase superfamily)||4.0 × 10−27|
|m.17815||Chromatin remodeling protein HARP/SMARCAL1, DEAD-box superfamily||8.0 × 10−7|
|m.48610||Polyadenylate-binding protein (RRM superfamily)||7.0 × 10−6|
|m.38370||Translation initiation factor 3, subunit c (eIF-3c)||3.0 × 10−34|
|m.58566||mRNA cleavage and polyadenylation factor II complex, BRR5 (CPSF subunit)||1.0 × 10−110|
|m.24433||RNA Helicase||9.0 × 10−6|
|m.12316||Transcription factor containing NAC and translation elongation factor EF-Ts, N-terminal domain (TS-N) domains||3.0 × 10−8|
© 2014 by the authors; licensee MDPI, Basel, Switzerland This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).