Int. J. Mol. Sci. 2012, 13(8), 9893-9899; doi:10.3390/ijms13089893

Article
Development of Microsatellite Markers Using Pyrosequencing in Galium trifidum (Rubiaceae), a Rare Species in Central Europe
Monika Szczecińska 1,*, Mirosław Kwaśniewski 2, Jakub Sawicki 1, Karolina Chwiałkowska 2, Kamil Szandar 3 and Włodzimierz Pisarek 4
1
Department of Botany and Nature Protection, University of Warmia and Mazury in Olsztyn, Plac Łódzki 1, Olsztyn 10-727, Poland; E-Mail: mailto:jakub.sawicki@uwm.edu.pl
2
Department of Genetics, University of Silesia, Jagiellońska 28, Katowice 40-032, Poland; E-Mails: kwasniew@us.edu.pl (M.K.); karolina.chwialkowska@us.edu.pl (K.C.)
3
Department of Genetics, University of Warmia and Mazury in Olsztyn, Oczapowskiego 1A, Olsztyn 10-719, Poland; E-Mail: kamil.szandar@uwm.edu.pl
4
Regional Directorate of Environmental Protection in Olsztyn, Dworcowa 60, Olsztyn 10-437, Poland; E-Mail: wlodzimierz.pisarek.olsztyn@rdos.gov.pl
*
Author to whom correspondence should be addressed; E-Mail: monika.szczecinska@uwm.edu.pl; Tel.: +48-89-524-5190; Fax: +48-89-523-3546.
; in revised form: 23 July 2012 / Accepted: 25 July 2012 /
Published: 8 August 2012

Abstract

: We identify a large number of microsatellites from Galium trfidum, a plant species considered rare and endangered in Central and Western Europe. Using a combination of a total enriched genomic library and small-scale 454 pyrosequencing, we determined 9755 contigs with a length of 100 to 6192 bp. Within this dataset, we identified 153 SSR motifs in 144 contigs. Here, we tested 14 microsatellite loci in 2 populations of G. trifidum. The number of alleles and expected heterozygosity were 1–8 (mean 3.2) and 0.00–0.876 (0.549 on average), respectively. The markers described in this study will be useful for evaluating genetic diversity within and between populations, and gene flow between G. trifidum populations. These markers could also be applied to investigate the biological aspects of G. trifidum, such as the population dynamics and clonal structure, and to develop effective conservation programs for the Central European populations of this species.
Keywords:
SSR markers; Galium trifidum; 454 pyrosequencing

1. Introduction

Galium trifidum (Rubiaceae) is a long-lived, perennial plant species that can be further divided into four geographically distant subspecies. Three subspecies, G. trifidum ssp. columbianum (Rydb) Hult, G. trifidum ssp. subbiflorum (Wieg.) Puff and G. trifidum ssp. halophilum (Fern. et Wieg.) Puff, grow in North America and in the Pacific islands. The fourth typical subspecies, G. trifidum, can be found in the boreal zone of Europe, Asia and America, where both climate and habitat conditions are optimal for its growth [1,2]. In Central and Western Europe, G. trifidum is a rare and endangered species. A few scattered localities of G. trifidum have been reported from the Polish Lowland (four localities in the Mazury Region), high Austrian mountains—1700 m a.s.l. (four localities in the Styrian Alps), France (one locality in the Pyrenees) and Turkey (one locality in the Pontic Mountains) [36].

In Northern Europe, G. trifidum occurs along ecotones of different types of water bodies (lakes, rivers, streams) characterized by changeable physical and chemical properties of water. G. trifidum grows there in a wide range of habitats: abundant and highly dynamic populations can be found in those providing the best growth conditions, whereas the populations reported from less optimal habitats are smaller and less vigorous. In Central Europe, G. trifidum prefers eutrophic moist lowland bogs and the shores of high-mountain lakes [3,4]. Due to their ecotone character, the habitats of G. trifidum are affected by water level fluctuations which often exert an adverse effect on the species, leading to a decline in population size or to the disappearance of entire populations.

However, studies have been conducted to investigate the ecology and population structure of G. trifidum. Long-term protection and management plans aiming at preserving G. trifidum populations should involve habitat and environmental monitoring as well as quantification of genetic diversity within and among populations. Microsatellite markers, also called simple sequence reprat (SSR) markers, are widely used in ecological studies and can also be used for investigating the genetic diversity of populations. The popularity of those markers stems from their near ubiquity, a high level of polymorphism, codominance and multi-allelic variation [7]. SSRmarkers can also be used for investigating the genetic diversity of Galium trifidum. In this study, we developed nuclear microsatellite markers for G. trifidum by using GS Junior next generation sequencing (Roche 454 Life Sciences, Branford, CT, USA). In comparison with the conventional method Sanger sequencing, pyrosequencing supports the acquisition of larger amounts of data within a shorter time.

High-throughput next generation sequencing enables the identification of even several hundred polymorphic loci in a single run [8]. To date, this technique has been successfully applied in animal, plant and bryophyte researches [915]. Fourteen microsatellite markers developed for G. trifidum are characterized in this paper. The polymorphism of SSR markers was tested in two G. trifidum populations of 19 and 20 individuals, respectively.

2. Results and Discussion

We tested PCR amplification and the level of polymorphism of the designed SSR motifs. The sequences of the SSR fragments were deposited in the GenBank (accession numbers from JX273032 to JX273045).

A single run of Galium trifidum DNA library sequencing in the GS Junior pyrosequencing system resulted in 144,275 reads with an average read length of 426 bp. Sequence assembling and mapping to the chloroplast genome of Coffea arabica allowed the alignment of 1708 reads and their contigs to the reference genome. The Galium trifidum sequences obtained in the analysis covered the chloroplast genome of C. arabica in approximately 72%, at an average depth of 2.8. The remaining reads were de novo assembled into 9755 contigs with a length of 100 to 6192 bp.

Analysis of the obtained sequences with the msatcommandersoftware identified 153 SSR motifs in 144 contigs. Di- (107) and tri-nucleotide (35) repeats dominated among the discovered microsatellite motifs. Longer repeat motifs included seven tetra-, three penta- and two hexa-nucleotide motifs.

Among identified SSR motifs we designed primers for 60 of them. The 14 microsatellite loci identified in the study showed a clear, single peak for each allele These 14 loci were subsequently used to screen 39 individuals collected from two populations of G. trifidum, one from northern Finland and one from Austria. In the studied populations, 12 loci showed polymorphism, while two loci (Gal10, Gal 12) were monomorphic (Table 1).

The number of alleles per locus ranged from one to eight, with an average of 2.3 in the Austrian population and 2.6 in the Finnish population. The expected (HE) heterozygosities ranged from 0.000 to 0.876 (0.549 on average) (Table 2). The values of coefficient HE were similar in the Austrian and Finnish populations, reaching HE = 0.438 and HE = 0.431 respectively. Significant deviations (p < 0.05) from Hardy-Weinberg equilibrium (HWE) were detected for locus Gal02 and Gal07 in the Finnish population, which suggests the presence of null alleles. Significant LD were noted between three pairs of loci: Gal02/Gal13, Gal04/Gal14 and Gal07/Gal11.

3. Experimental Section

3.1. DNA Extraction

Total genomic DNA was extracted from the leaf tissue of 20 and 19 individuals from one Finnish and one Austrian population, respectively using the DNeasy® Plant Mini Kit (Qiagen, Hilden, Germany). Stems were ground with silica beads in a MiniBead-Beater tissue disruptor for 50 s, and they were subsequently processed using the manufacturer’s protocols. DNA quantity was estimated with the Qubit fluorometer system (Invitrogen, Carlsbad, NM, USA) using the Quant-IT ds-DNA BR Assay Kit (Invitrogen).

3.2. DNA Library Preparation and Sequencing

Eight hundred nanograms of DNA was sheared by nebulization, purified with the MinElute PCR Purification Kit (Qiagen), and subsequently processed according to the GS Rapid Library Preparation Kit Method Manual (Roche/454 Life Sciences). The quality of the DNA library was assessed by gel electrophoresis in the FlashGel System (Lonza). DNA fragments were clonally amplified using the GS Junior Titanium emPCR Lib-L Kit (Roche/454 Life Sciences). Sequencing was performed using the GS Junior pyrosequencing system, according to the Sequencing Method Manual (Roche/454 Life Sciences).

Pyrosequencing data were assembled using GS Reference Mapper software (Roche/454 Life Sciences). A two- step assembly was performed. First, the obtained sequences were assembled using the chloroplast genome of Coffea arabica (GenBank: NC008535) to separate chloroplast reads from nuclear reads. The remaining reads were assembled using the GS Newbler de novo assembler (Roche/454 Life Sciences).

The obtained contigs were searched for microsatellite motifs using MSTATCOMMANDER with default settings [16]. This program was also used for primer design. To avoid designing primers for any potential SSR locus twice, the contigs containing the same motif were compared in Bioedit 7.0.5 [17].

3.3. Genotyping Test

PCR reactions were performed in 20 μL of a reaction mixture containing 40 ng genomic DNA, 1.0 μM of each primer, 1.5 mM MgCl2, 200 μM dNTP (dATP, dGTP, dCTP, dTTP), 1× PCR buffer, 1 μL BSA and 1 U Genomic RedTaq polymerase (Sigma, St. Louis, MO, USA). The temperature profile was) initial denaturation—5 min at 94 °C, followed by 35 cycles of (2) 1 min at 94 °C, 1 min at 53 °C and 1 min at 72 °C. Final elongation was performed for 7 min at 72 °C. PCR products were separated on a QIAxcel capillary electrophoresis system, using the Qiaxcel High Resolution Kit with the alignment marker 15–500 bp (Qiagen) and the DNA size marker 25–500 bp (Qiagen). Standard OM700 settings were used as the electrophoresis program [18].

Randomly selected 14 amplicons were resequenced using amplification primers. Purified PCR products were sequenced in both directions using the ABI BigDye 1.1 Terminator Cycle Kit (Applied Biosystems, Foster City, CA, USA), and were visualized using an ABI Prism 3130 Automated DNA Sequencer (Applied Biosystems).

3.4. Data Analysis

Genetic diversity measures were estimated using GenAIEx 6.41 software [19]. Deviations from the Hardy-Weinberg equilibrium (HWE) and linkage disequilibrium between loci were tested using FSTAT software version 2.9.3 [20]. Significance levels were adjusted using Bonferroni correction for multiple testing.

4. Conclusions

The 14 microsatellite loci described here will be useful for evaluating genetic diversity within and between populations, and gene flow between G. trifidum populations. These markers could also be applied to investigate the biological aspects of G. trifidum, such as the population dynamics and clonal structure, and to develop effective conservation programs for the Central European populations of this species.

Acknowledgments

This study was made possible through financial support from the Polish Ministry of Science and Education (grant N304 364438).

References

  1. Puff, C. The Galium trifidum group (Galium sect. Aparinoides, Rubiaceae). Can. J. Bot 1976, 54, 1911–1925.
  2. Puff, C. Morphologie, biologie und abgrenzung von Galium L. sect. Aparinoides (Jord.) Gren. (Rubiaceae). Beitr. Biol. Pflanzen 1975, 51, 17–40.
  3. Meusel, H.; Jäger, E.J. Vergleichende Chorologie der Zentraleurpäischen Flora. Band III; G. Fischer Verlag: Jena, Stuttgart, Germany; New York, NY, USA, 1992; pp. 198–199.
  4. Pisarek, W. The University of Gdański: Gdańsk, Poland, 1998.
  5. Pisarek, W. KaŸmierczakowa, R., Zarzycki, K., Eds.; P.A.N.: Kraków, Poland, 2001.
  6. Mrkvicka, A.Ch.; Vitek, E. Galium trifidum (Rubiaceae)-weitere fundorte in mitteleuropa und in der Turkei. Ann. Naturhist. Mus. Wien. 2008, 109, 319–322.
  7. Oliveira, E.J.; Padua, J.G.; Zucchi, M.I.; Vencovsky, R.; Carneiro Vieira, M.L. Origin, evolution and genome distribution of microsatellite data. Genet. Mol. Biol 2006, 29, 294–307.
  8. Parchman, T.L.; Geist, K.S.; Grahen, J.A.; Benkman, C.W.; Buerkle, C.A. Transcriptome sequencing in an ecologically important tree species: Assembly, annotation, and markerdiscovery. BMC Genom 2010, 11, doi:10.1186/1471-2164-11-180.
  9. Abderkim, J.; Robertson, B.; Stanton, J.A.; Gemmell, N. Fast, cost-effective development of species-specific microsatellite markers by genomic sequencing. BioTechniques 2009, 46, 185–192.
  10. Duran, C.; Appleby, N.; Edwards, D.; Batley, J. Molekular genetics markers: Discovery, applications, data storage and visualization. Curr. Bioinforma 2009, 4, 16–27.
  11. Schwarz, D.; Robertson, H.M.; Feder, J.L.; Varala, K.; Hudson, M.E.; Ragland, G.J.; Hahn, D.A.; Berlocher, S.H. Sympatric ecological speciation meets pyrosequencing: Sampling the transcriptome of the apple maggot Rhagoletis pomonella. BMC Genom 2009, 10, 633.
  12. Csencsics, D.; Brodbeck, S.; Holderegger, R. Cost-effective, species-specific microsatellite development for the endangered Dwarf Bulrush (Typha minima) using next-generation sequencing technology. J. Hered 2010, 101, 789–793.
  13. Somme, L.; Raabova, J.; Jacquemart, A.L.; Raspe, O. Development and multiplexing of microsatellite markers using pyrosequencing in the clonal plant Comarum palustre (Rosaceae). Mol. Ecol. Res 2012, 12, 91–97.
  14. Sakaguchi, S.; Uchiyama, K.; Ueno, S.; Ujino-Ihara, T.; Tsumura, Y.; Prior, L.D.; Bowman, D.M.; Crisp, M.D.; Isagi, Y. Isolation and characterization of 52 polymorphic EST-SSRmarkers for Callitris columellaris (Cupressaceae). Am. J. Bot 2011, 98, e363–e368.
  15. Sawicki, J.; Kwaśniewski, M.; Szczecińska, M.; Chwiałkowska, K.; Milewicz, M.; Plášek, V. Isolation and characterization of Simple Sequence Repeats (SSR) markers from the moss genus Orthotrichum using a small throughput pyrosequencing machine. Int. J. Mol. Sci 2012, 13, 7586–7593.
  16. Faircloth, B.C. MSATCOMMANDER: Detection of microsatellite repeat arrays and automated, locus-specific primer design. Mol. Ecol. Res 2008, 8, 92–94.
  17. Hall, T.A. BioEdit: A user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symp. Ser 1999, 41, 95–98.
  18. Wang, X.; Rinehart, T.A.; Wadl, P.A.; Spiers, J.M.; Hadziabdic, D.; Windham, M.T.; Trigiano, R.N. A new electrophoresis technique to separate microsatellite alleles. Afr. J. Biotechnol 2009, 8, 2432–2436.
  19. Peakall, R.; Smouse, P.E. GenAlEx 6: Genetic analysis in Excel. Population genetic software for teaching and research. Mol. Ecol. Notes 2006, 6, 288–295.
  20. Goudet, J. FSTAT: A computer program to calculate F-statistics. J. Hered 1995, 86, 485–486.
Table 1. Characteristics of 14 compound microsatellite loci for Galium trifidum.

Click here to display table

Table 1. Characteristics of 14 compound microsatellite loci for Galium trifidum.
LocusRepeat motifSize range (bp)Primer sequenceTa (°C)No. of total allelesGenBank Accession No.
Gal01(GT)10336–341F-AGGAAGAATTTAATCACGCTGG
R-AGCTGAACAGGTTTGCTTG
493JX273032
Gal02(GT)8268–272F-TTCGCGTTTCCTGCAATCC
R-AAACGACGGACAATGGCTC
514JX273033
Gal03(AT)7306–337F-TTTGGCAGGATAAAGGGCG
R-AAACGGAAGAGTATAACAAAGGG
512JX273034
Gal04(AC)6280–291F-CACGACCAGCTGCATTTCC
R-AGCTTCCTAAAGAGGCCAG
512JX273035
Gal05(AT)6407–414F-ATGGCGATTAGTGGGTCGC
R-ATACGAGTCCAAAGGCAGG
533JX273036
Gal06(AAG)5423–427F-AACGGGTTGGTCAATCTCC
R-GCCCTGGCGTAGAGGTG
544JX273037
Gal07(AT)6419–428F-TATGGGCGGATCGGTTGAG
R-ACAAGCACTTTGATGAGACCG
535JX273038
Gal08(AT)6390–398F-TTTCTTGGGCGAAACAGGG
R-GGTTAATACCAAGTTCT
554JX273039
Gal09(ATC)6148–151F-GGCTACTATTCGCCGCTTC
R-AGCAAGTGAAGATTGTCATCG
533JX273040
Gal10(CCT)5206F-GACTCCGCCAAATCATCCC
R-CAGAGAGTGCCACATCAGC
531JX273041
Gal11(AT)8240–260F-CATCCAACGGCAGCATAGAC
R-TCCGATGAAGGTGTTGCATTC
535JX273042
Gal12(AC)5(TC)5235F-TCTCGTTCCCGCTCTGAAG
R-GGCAGACAGTTCATGCCTC
531JX273043
Gal13(AC)7425–442F-TCTGAACTACAGCATGTTGCC
R-TCTGAACTACAGCATGTTGCC
535JX273044
Gal14(AAT)9423–455F-CTGTACGCTGAGTAGCACAAG
R-ACACAGCTCTTATGTAAGGCAC
538JX273045
Table 2. Results of initial Simple Sequence Repeats (SSR) polymorphism analysis in two populations of G. trifidum.

Click here to display table

Table 2. Results of initial Simple Sequence Repeats (SSR) polymorphism analysis in two populations of G. trifidum.
Austria N = 19Finland N = 20


LocusNAARHEFNAARHEF
Gal0131.8100.4171.00032.3740.6891.000
Gal0232.3510.6740.63632.2230.633 *0.655
Gal0311.0000.0000.00021.8830.5331.000
Gal0421.6730.3891.00021.6240.333−0.250
Gal0521.4050.2221.00031.7370.3781.000
Gal0632.4420.7221.00021.8450.5001.000
Gal0742.7550.7920.69031.5680.289 *0.263
Gal0842.3070.611−0.67032.0890.550−0.695
Gal0921.9080.5561.00031.7370.3781.000
Gal1011.0000.0000.00011.0000.0000.000
Gal1121.4050.2221.00031.7370.3781.000
Gal1211.0000.0000.00011.0000.0000.000
Gal1342.5270.667−0.54321.8830.5331.000
Gal1452.9940.8611.00052.9480.8440.690
MEAN2.31.8980.4380.7402.61.6360.4310.633

Number of alleles (NA), allechic richness (AR), expected heterozygosites (HE) and fixation index (F). A significant deviation from Hardy-Weinberg equilibrium expectations is indicated by * (p < 0.05).

Int. J. Mol. Sci. EISSN 1422-0067 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert