Pseudo-Reference-Based Assembly of Vertebrate Transcriptomes
AbstractHigh-throughput RNA sequencing (RNA-seq) provides a comprehensive picture of the transcriptome, including the identity, structure, quantity, and variability of expressed transcripts in cells, through the assembly of sequenced short RNA-seq reads. Although the reference-based approach guarantees the high quality of the resulting transcriptome, this approach is only applicable when the relevant reference genome is present. Here, we developed a pseudo-reference-based assembly (PRA) that reconstructs a transcriptome based on a linear regression function of the optimized mapping parameters and genetic distances of the closest species. Using the linear model, we reconstructed transcriptomes of four different aves, the white leg horn, turkey, duck, and zebra finch, with the Gallus gallus genome as a pseudo-reference, and of three primates, the chimpanzee, gorilla, and macaque, with the human genome as a pseudo-reference. The resulting transcriptomes show that the PRAs outperformed the de novo approach for species with within about 10% mutation rate among orthologous transcriptomes, enough to cover distantly related species as far as chicken and duck. Taken together, we suggest that the PRA method can be used as a tool for reconstructing transcriptome maps of vertebrates whose genomes have not yet been sequenced. View Full-Text
- Supplementary File 1:
Supplementary (DOCX, 104 KB)
Share & Cite This Article
Nam, K.; Jeong, H.; Nam, J.-W. Pseudo-Reference-Based Assembly of Vertebrate Transcriptomes. Genes 2016, 7, 10.
Nam K, Jeong H, Nam J-W. Pseudo-Reference-Based Assembly of Vertebrate Transcriptomes. Genes. 2016; 7(3):10.Chicago/Turabian Style
Nam, Kyoungwoo; Jeong, Heesu; Nam, Jin-Wu. 2016. "Pseudo-Reference-Based Assembly of Vertebrate Transcriptomes." Genes 7, no. 3: 10.
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.