This article is- freely available
- re-usable
Article
Discovery of Single Nucleotide Polymorphisms in Complex Genomes Using SGSautoSNP
Michał T. Lorenc 1 
,
Satomi Hayashi 2 
,
Jiri Stiller 3 
,
Hong Lee 1 
,
Sahana Manoli 1 
,
Pradeep Ruperao 1,4 
,
Paul Visendi 1 
,
Paul J. Berkman 3 
,
Kaitao Lai 1 
,
Jacqueline Batley 2 
and
David Edwards 1,*

1
Australian Centre for Plant Functional Genomics, School of Agriculture and Food Science, University of Queensland, Brisbane, QLD 4072, Australia
2
Centre for Integrative Legume Research, School of Agriculture and Food Science, University of Queensland, Brisbane, QLD 4072, Australia
3
CSIRO Plant Industry, Brisbane, QLD 4072, Australia
4
Centre of Excellence in Genomics (CEG), International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru 502324, Andhra Pradesh, India
* Author to whom correspondence should be addressed.
Received: 12 July 2012; in revised form: 9 August 2012 / Accepted: 10 August 2012 / Published: 27 August 2012
Abstract: Single nucleotide polymorphisms (SNPs) are becoming the dominant form of molecular marker for genetic and genomic analysis. The advances in second generation DNA sequencing provide opportunities to identify very large numbers of SNPs in a range of species. However, SNP identification remains a challenge for large and polyploid genomes due to their size and complexity. We have developed a pipeline for the robust identification of SNPs in large and complex genomes using Illumina second generation DNA sequence data and demonstrated this by the discovery of SNPs in the hexaploid wheat genome. We have developed a SNP discovery pipeline called SGSautoSNP (Second-Generation Sequencing AutoSNP) and applied this to discover more than 800,000 SNPs between four hexaploid wheat cultivars across chromosomes 7A, 7B and 7D. All SNPs are presented for download and viewing within a public GBrowse database. Validation suggests an accuracy of greater than 93% of SNPs represent polymorphisms between wheat cultivars and hence are valuable for detailed diversity analysis, marker assisted selection and genotyping by sequencing. The pipeline produces output in GFF3, VCF, Flapjack or Illumina Infinium design format for further genotyping diverse populations. As well as providing an unprecedented resource for wheat diversity analysis, the method establishes a foundation for high resolution SNP discovery in other large and complex genomes.
Keywords: single nucleotide polymorphisms; wheat; autoSNP; genome diversity; genotyping by sequencing; haplotype
Article Statistics
Click here to load and display the download statistics.
Notes: Multiple requests from the same IP address are counted as one view.
Cite This Article
MDPI and ACS Style
Lorenc, M.T.; Hayashi, S.; Stiller, J.; Lee, H.; Manoli, S.; Ruperao, P.; Visendi, P.; Berkman, P.J.; Lai, K.; Batley, J.; Edwards, D. Discovery of Single Nucleotide Polymorphisms in Complex Genomes Using SGSautoSNP. Biology 2012, 1, 370-382.
AMA Style
Lorenc MT, Hayashi S, Stiller J, Lee H, Manoli S, Ruperao P, Visendi P, Berkman PJ, Lai K, Batley J, Edwards D. Discovery of Single Nucleotide Polymorphisms in Complex Genomes Using SGSautoSNP. Biology. 2012; 1(2):370-382.
Chicago/Turabian Style
Lorenc, Michał T.; Hayashi, Satomi; Stiller, Jiri; Lee, Hong; Manoli, Sahana; Ruperao, Pradeep; Visendi, Paul; Berkman, Paul J.; Lai, Kaitao; Batley, Jacqueline; Edwards, David. 2012. "Discovery of Single Nucleotide Polymorphisms in Complex Genomes Using SGSautoSNP." Biology 1, no. 2: 370-382.