- freely available
Int. J. Mol. Sci. 2012, 13(10), 13203-13211; doi:10.3390/ijms131013203
Abstract: We described and characterized 11 expressed sequence tag (EST)-derived simple sequence repeats (SSR) and seven genomic (G)-derived SSRs in Coreoperca whiteheadi Boulenger. The EST-SSRs comprised 62.2% di-nucleotide repeats, 32.2% tri-nucleotide repeats and 5.5% tetra-nucleotide repeats, whereas the majority of the G-SSRs were tri-nuleotide repeats (81.4%). The number of alleles for the 18 loci ranged from 3 to 6, with a mean of 3.8 alleles per locus. The observed (Ho) and expected heterozygosities (He) values ranged from 0.375 to 1.000, and 0.477 to 0.757, respectively. The polymorphic information content (PIC) values ranged from 0.466 to 0.706. The mean values number of alleles, Ho, He, and PIC of EST-SSRs were higher than those of the G-SSRs. Four microsatellite loci deviated significantly from Hardy-Weinberg equilibrium (HWE) after Bonferroni correction and no significant deviations in linkage disequilibrium (LD) were observed. These loci are the first to be characterized in C. whiteheadi and should be useful in the investigation of a genetic evaluation for conservation. Compared with 11 loci in C. whiteheadi, 37 potential polymorphic EST-SSRs were found in Siniperca chuatsi (Basilewsky), which will provide a valuable tool for mapping studies and molecular breeding programs in S. chuatsi.
Coreoperca whiteheadi Boulenger, one of the lower percoid fishes, is found in south China and the Red River of North Vietnam [1,2]. Due to anthropogenic disturbances, such as over-exploitation and environment pollution, the wild resource of C. whiteheadi has markedly declined in recent years . Because of its dire conservation status, the genetic conservation of C. whiteheadi is becoming essential for the sustainable management of natural resources and increasing the production of this species. Hence, robust genetic markers are needed for information on the population dynamics of this species, including genetic connectivity, in order to inform conservation efforts.
Microsatellites or simple sequence repeats (SSRs) have become a useful marker system in population genetics analysis, genetic mapping and marker-assisted selection (MAS) of many kinds of fish species because of their co-dominant nature, high allelic polymorphism, high reproducibility and transferability across species [4,5]. However, widespread use of these markers is often limited by the time and cost involved in their development . The recent development of library enrichment techniques and automated sequencing has made production of these markers simple, rapid, and cost effective [7,8]. SSR markers can be classified in genomic SSRs and EST-SSRs . However, until now, no microsatellite markers have been reported for C. whiteheadi.
In this study, we characterized 11 EST-SSRs and seven G-SSRs as a tool to support genetic conservation and breeding programs in C. whiteheadi. 37 potential polymorphic EST-SSRs were also found in a sample of 12 wild Siniperca chuatsi (Basilewsky) individuals. A total of 122 SSRs from ESTs in our transcriptome sequencing database and 49 ones from genomic sequences of enriched libraries were selected for designing microsatellite primers and tested using PCR amplification for C. whiteheadi. And all the 122 microsatellites from ESTs were also tested for S. chuatsi.
2. Results and Discussion
A microsatellite-enriched library was constructed from the genomic DNA of C. whiteheadi. Developmental steps for the construction of the enriched library and its characteristic features are summarized in Table 1. A total of 90 putative recombinant clones were picked from the enriched library, sequenced, and analyzed for presence of SSRs. Sequence analysis revealed that 30 clones (33.33%) were redundant clones. Of the remaining 60 unique clones (66.67%), 49 (81.67% of the unique clones) were found to harbor SSR sequences (GenBank Accession number: JX449105–JX449153), and 49 can be finally used for primer design. Sequence analysis of all the SSR-containing clones indicated that tri-nucleotide SSRs were found to be more frequent (81.4%) than di-nucleotide SSRs (18.6%), and no tetra/penta/hexa nucleotide SSRs was identified in the library. Among 49 tri-nucleotide SSRs, the CCT/GGA class of repeat motif was the most frequent (50% of total tri-nucleotide microsatellites), followed by the GAG/CTC class (31%).
In this study, we characterized a set of EST-SSR and G-SSR markers as a tool to support genetic conservation in C. whiteheadi. A total of 37 microsatellites from ESTs in our transcriptome and 31 from genomic sequences of enriched libraries were selected successfully for designing microsatellite primers and tested using PCR amplification. 11 EST-SSRs and seven G-SSRs were found to be polymorphic, and evaluated the performance in genetic analysis using 32 individuals randomly selected from a wild population (Table 2). The number of alleles for the 18 loci ranged from 3 to 6, with a mean of 3.8 alleles per locus. The Ho and He values ranged from 0.375 to 1.000, and 0.477 to 0.757, respectively. PIC values ranged from 0.466 to 0.706 (Table 2). From the results, we could find the mean values of number of alleles, Ho, He, and PIC of EST-SSRs were higher than those of the G-SSRs. Four microsatellite loci deviated significantly from HWE (p < 0.0029) after Bonferroni correction (Table 2), which would be due to the limited sample size, sampling strategy and null alleles . Analysis with MICROCHECKER  indicated that no significant LD was observed. Analysis of the nucleotide sequences of the 37 EST- and 31 G-SSRs showed that, the EST-SSRs comprised 62.2% di-nucleotide repeats, 32.3% tri-nucleotide repeats, and 5.5% tetra-nucleotide repeats. In contrast, the G-SSRs were composed mainly of tri-nucleotide repeats (81.4%).
Meanwhile, 37 potential polymorphic EST-SSRs were found in a sample of 12 wild S. chuatsi individuals (Table 3). Among 92 successfully amplified EST-SSRs, 37 loci (40.2% of the designed primers) showed probably polymorphism in S. chuatsi, compared with 11 loci (12.0% of the designed primers) in C. whiteheadi. The number of alleles for the 37 loci in S. chuatsi ranged from 3 to 8, with a mean of 4.3 alleles per locus. It is easy to explain the difference in cross-species amplification as the transcriptome in this study was from F1 interspecies hybrids between S. chuatsi and S. schezeri. The 11 loci in C. whiteheadi were fully contained in 37 potential polymorphic EST-SSRs in S. chuatsi. And the mean values of number of alleles of the 37 loci were higher than those of the 11 loci (Table 3). Only a small subset of the EST-SSRs in our transcriptome was tested in this study, but high levels of polymorphism in S. chuatsi indicate that the pairs of primers described here may be suitable for assessments of genetic diversity, the construction of high-density linkage map, and MAS in S. chuatsi, but still required further study due to the limited sample size.
3. Experimental Section
De novo transcriptome sequencing of F1 hybrids between S. chuatsi (♀) and S. scherzeri (♂) was performed and a total of 118,218 unigenes were identified. The processes of library preparation for transcriptome analysis and sequence assembly were as described in [12,13]. This unigene set was used for mining EST-SSR markers using the default parameters of the BatchPrimer3 v1.0 software . The primers for these SSR loci were designed using NCBI/Primer-BLAST (Available online: http://www.ncbi.nlm.nih.gov/tools/primer-blast/index.cgi?LINK_LOC=BlastHome; accessed on 18 June 2012). In this study, a subset of 92 EST-SSR markers was screened on 12 wild S. chuatsi and 32 wild C. whiteheadi, respectively.
A microsatellite-enriched genomic library was constructed using the fast isolation by amplified fragment length polymorphism (AFLP) of sequences containing repeats (FIASCO) protocol . High quality genomic DNA was fragmented using a restriction enzyme, MseI. The fragmented DNAs were ligated to specific adapters (5′-GACGATGAGTCCTGAG-3′ and 5′-TACTCAGGACTCAT-3′). The polymerase chain reaction (PCR) products were size selected to preferentially obtain small fragments (300–1000 bp), which were hybridized to one streptavidin-biotinylated oligo simple sequence repeat complexes: (CCT/GGA)15. The enriched DNAs were cloned into the pGEM-T vector (Promega, Madison, WI, USA) and then transformed into competent DH5a strain (Promega, USA). White colonies were randomly picked from the primary transformation plates, and then the isolated Plasmid DNA was sequenced using ABI 3730 Genetic Analyzer (Applied Biosystems, Foster City, CA, USA). Identification of SSR clones was screened using the SSRHUNTER program, which was designed to find regions containing SSRs . For all types of SSRs, a minimum length criterion of 12 bp was selected, and only perfect SSR were considered. Primers flanking SSR were designed using the PRIMER PREMIER 5.0 program (PREMIER Biosoft International, Palo Alto, CA, USA). The positive clones were identified by PCR using MseI-N primers and M13 primers. Of the 90 colonies, 60 were sequenced using ABI 3730 Genetic Analyzer (Applied Biosystems), 49 of which contained microsatellites (GenBank Accession number: JX449105–JX449153). In this study, a subset of 31 G-SSR markers was screened on 32 C. whiteheadi, respectively.
Total genomic DNA was extracted from fin clips using the TIANamp Genomic DNA Kit (Tiangen, Beijing, China) following the manufacturer’s instructions. Polymerase chain reaction (PCR) conditions were optimized for each pair of primers. PCRs were performed in 25 μL reaction volumes containing 2.5 μL of 10 × PCR buffer, 1.0–3.0 mM MgCl2, 50 μM dNTPs, 0.4 μM of each primer, 1 U Taq polymerase (Takara, Dalian, China) and 50 ng genomic DNA. PCR conditions were as follows: initial denaturation at 94 °C for 3 min followed by 30 cycles at 94 °C for 30 s, the optimized annealing temperature (Table 2, 3) for 30 s, 72 °C for 30 s, and then a final extension step at 72 °C for 10 min. PCR products were separated on a 8% non-denaturing polyacrylamide gel electrophoresis and visualized by silver staining. A denatured pBR322 DNA/MspI molecular weight marker (Tiangen, Beijing, China) was used as a size standard to identify alleles.
The number of alleles (Na), the observed (Ho) and expected heterozygosities (He) were estimated using POPGENE version 1.32 . The polymorphic information content (PIC) was calculated using the Formula 1:
where n is the number of alleles, and qi, qj is the ith and jth allele frequency, respectively . Deviations from HWE and LD were tested using the online version of GENEPOP (Available online: http://genepop.curtin.edu.au/, accessed on 11 January 2011) . All results were adjusted for multiple simultaneous comparisons using a sequential Bonferroni correction . Genotyping errors due to null alleles, stutter bands, or allele dropout were analyzed using the software Micro-checker 2.2.3 (The University of Hull: Hull, UK, 2005).
In this study, 18 novel polymorphic SSR markers, 11 EST-SSRs and seven G-SSRs were successfully characterized in C. whiteheadi. These loci are the first to be characterized in C. whiteheadi and should be useful in the investigation of a genetic evaluation for conservation. Meanwhile, 37 potential polymorphic EST-SSRs were found in a sample of 12 wild S. chuatsi individuals, which will provide a valuable tool for mapping studies and molecular breeding programs of S. chuatsi.
This work was financially supported by the National Natural Science Foundation of China (31172420), the National Basic Research Program of China (2009CB118702), the Fundamental Research Funds for the Central Universities (2010PY010, 2011PY030) and Huazhong Agricultural University Scientific & Technological Self-innovation Foundation (2012SC24).
- Zhou, C.W.; Yang, Q.; Cai, D.L. On the classification and distribution of the sinipercinae fishes (family serranidae). Zool. Res 1988, 9, 113–125. [Google Scholar]
- Liu, H.Z.; Chen, Y.Y. Phylogeny of the sinipercine fishes with some taxonomic notes. Zool. Res 1994, 15, 1–12. [Google Scholar]
- Xiong, H.L.; Dai, J.J.; Chen, L. Study on anatomy structure of the digestive system of Coreoperca whiteheadi. J. Anhui Agri. Sci 2011, 39, 4015–4017. [Google Scholar]
- Hamada, H.; Petrino, M.G.; Kakunaga, T. A novel repeated element with Z-DNA-forming potential is widely found in evolutionarily diverse eukaryotic genomes. Proc. Natl. Acad. Sci. USA 1986, 79, 6465–6469. [Google Scholar]
- Saha, M.C.; Cooper, J.D.; Rouf Mian, M.A.; Chekhovskiy, K.; May, G.D. Tall fescue genomic SSR markers: Development and transferability across multiple grass species. Theor. Appl. Genet 2006, 113, 1449–1458. [Google Scholar]
- Zhao, W.G.; Lee, G.A.; Kwon, S.W.; Ma, K.H.; Lee, M.C.; Park, Y.J. Development and use of novel SSR markers for molecular genetic diversity in Italian millet (Setaria italica L.). Genes Genomics 2012, 34, 51–57. [Google Scholar]
- Zane, L.; Bargelloni, L.; Patarnello, T. Strategies for microsatellite isolation: A review. Mol. Ecol 2002, 11, 1–16. [Google Scholar]
- Li, Y.C.; Korol, A.B.; Fahima, T.; Nevo, E. Microsatellites within genes: Structure, function, and evolution. Mol. Biol. Evol 2004, 21, 991–1007. [Google Scholar]
- Lu, X.; Wang, H.X.; Dai, P.; Liu, B.Z. Characterization of EST-SSR and genomic-SSR markers in the clam, Meretrix meretrix. Conserv. Genet. Resour 2011, 3, 655–658. [Google Scholar]
- Miao, G.D.; Liu, H.W.; Ma, H.Y.; Chen, S.L.; Fan, T.J. Isolation and characterization of 54 polymorphic microsatellite loci for half-smooth Tongue Sole, Cynoglossus semilaevis. J. World Aquacult. Soc 2011, 42, 462–467. [Google Scholar]
- Van Oosterhout, C.; Hutchinson, W.F.; Wills, D.P.M.; Shipley, P. MICRO-CHECKER: Software for identifying and correcting genotyping errors in microsatellite data. Mol. Ecol. Notes 2004, 4, 535–538. [Google Scholar]
- Wang, X.W.; Luan, J.B.; Li, J.M.; Bao, Y.Y.; Zhang, C.X.; Liu, S.S. De novo characterization of a whitefly transcriptome and analysis of its gene expression during development. BMC Genomics 2010, 11. [Google Scholar] [CrossRef]
- Qu, C.M.; Liang, X.F.; Huang, W.; Cao, L. Isolation and characterization of 46 novel polymorphic EST-SSR markers in two sinipercine fishes (Siniperca) and cross-species amplification. Int. J. Mol. Sci 2012, 13, 9534–9544. [Google Scholar]
- You, F.M.; Huo, N.; Gu, Y.Q.; Luo, M.C.; Ma, Y.; Hane, D.; Lazo, G.R.; Dvorak, J.; Anderson, O.D. BatchPrimer3: A high throughput web application for PCR and sequencing primer design. BMC Bioinf 2008, 9. [Google Scholar] [CrossRef]
- Li, Q.; Wan, J.M. SSRHUNTER: Development of a local searching software for SSR sites. Yichuan 2005, 27, 808–810. [Google Scholar]
- Yeh, F.C.; Boyle, T.J.B. Population genetic analysis of co-dominant and dominant markers and quantitative traits. Belg. J. Bot 1997, 129, 157. [Google Scholar]
- Botstein, D.; White, R.L.; Skolnick, M.; Davis, R.W. Construction of a genetic linkage map in man using restriction fragment length polymorphisms. Am. J. Hum. Genet 1980, 32, 314–331. [Google Scholar]
- Raymond, M.; Rousset, F. GENEPOP (version 1.2): Population genetic software for exact tests and ecumenicism. J. Hered 1995, 86, 248–249. [Google Scholar]
- Rice, W.R. Analyzing tables of statistical tests. Evolution 1989, 43, 223–225. [Google Scholar]
|Screening step||Number and percentage|
|Redundant clones||30 (33.33%)|
|Unique clones||60 (66.67%)|
|SSR clones||49 (81.67%)|
|Locus||Gene Bank||Repeat motif||Primer sequence(5′-3′)||Na (size range, bp)||Ta (°C)||HO||He||PIC||P-HWE|
|CW399||JX503464||(CA)10N(CA)10||F: ATTTGCATTCGAGGTCATC||4 (104–136)||55||1.0000||0.7118||0.6464||0.4512|
|CW403||JX503460||(GT)5N(TG)6||F: CTGAATTTGAATTGTCTGGA||5 (180–263)||55||1.0000||0.7574||0.7063||0.0002 *|
|CW411||JX503452||(TG)12||F: CAACTAATCATCACAGCCCA||4 (86–116)||58||0.9688||0.7039||0.6383||0.0101|
|CW416||JX503447||(AC)13N(CA)8(CT)8||F: GACCAAAGATTTCAAGGAGT||5 (185–250)||55||0.5000||0.6736||0.6155||0.3571|
|CW372||JX503491||(AC)11N(AC)5||F: ATCACACACTCGCAACATTC||4 (164–225)||60||0.9688||0.7257||0.6620||0.0029|
|CW376||JX503487||(CAG)9||F: CTTGGACCTGAACCTGGA||3 (250–283)||58||0.9063||0.6066||0.5299||0.9911|
|CW460||JX503403||(GT)9N(TC)5||F: AATGGGACCACCCCTCTTGT||3 (232–268)||55||1.0000||0.6344||0.5540||0.0000 *|
|CW426||JX503437||(GT)13(GA)18||F: TTTGAATCCTCCACTTGTA||3 (139–171)||54||0.8125||0.6652||0.5813||0.9815|
|CW461||JX503402||(TC)14||F: CGCGTCCGATCTGTTTTACC||3 (178–194)||55||0.9375||0.6647||0.5799||0.0041|
|CW745||JX503388||(TG)18||F: AAGAGTGCGGAAGACGAGG||4 (103–137)||55||1.0000||0.7257||0.6632||0.0120|
|CW467||JX503396||(TCA)5||F: TGGTCTCCGCACAACATCAT||5 (238–300)||55||1.0000||0.6920||0.6182||0.0000 *|
|FC124||JX449121||(TCC)6||F: CCAACGCAGCAACATTTCTA||4 (185–222)||60||0.5313||0.6344||0.5755||0.5555|
|FC133||JX449126||(TCC)9||F: GAGTCAGCAGAAGGGAACCA||4 (101–121)||62||1.0000||0.7431||0.6829||0.0034|
|FC137||JX449130||(TG)20||F: CCCTCTCTGTTGTCTTGGCT||6 (65–108)||62||0.6563||0.6205||0.5841||0.2640|
|FC139||JX449132||(AGG)5||F: GTGCTCCAACCAGACAGAGG||4 (74–92)||60||0.9063||0.6141||0.5422||0.0003 *|
|FC154||JX449140||(CCT)8||F: AGCCACGACTGTTTAGATGA||3 (197–241)||60||0.3750||0.6146||0.5242||0.0421|
|FC161||JX449144||(GGA)4||F: CAGGAGATGAGGGTGACA||3 (308–472)||60||0.4250||0.4766||0.4658||0.0261|
|FC163||JX449146||(GAG)6||F: GTGTTGAAGGTGTGGAGGTG||3 (187–230)||58||0.8438||0.6592||0.5743||0.0109|
Na, number of alleles; Ta, annealing temperature; Ho, observed heterozygosity; He, expected heterozygosity; PIC, polymorphic information content;*indicates significant deviation from HWE after Bonferroni correction.
|Locus||Repeat motif||Primer sequence (5′-3′)||Expected size (bp)||Ta (°C)||Na||Gene Bank|
For each locus the information of number of alleles, in the left refers to S. chuatsi and the right refers to C. whiteheadi. Ta corresponds to annealing temperature; Na is number of alleles; no polymorphism for each locus is denoted by “--”.
© 2012 by the authors; licensee Molecular Diversity Preservation International, Basel, Switzerland. This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).