Next Article in Journal
Phenolic-Rich Extracts of Galenia africana and Tulbaghia violacea Accelerate Keratinocyte Migration and Mitigate Oxidative Stress to Enhance Wound Healing
Previous Article in Journal
Cannabidiol’s Antioxidant Properties in Skin Care Products and Legislative Regulations
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genome-Wide Development and Characterization of 169 gSSR Markers in the Invasive Plant Xanthium strumarium L.

College of Bioscience and Biotechnology, Shenyang Agricultural University, Shenyang 110065, China
*
Authors to whom correspondence should be addressed.
These authors contribute equally to this work.
Plants 2025, 14(22), 3522; https://doi.org/10.3390/plants14223522
Submission received: 29 September 2025 / Revised: 10 November 2025 / Accepted: 14 November 2025 / Published: 18 November 2025
(This article belongs to the Section Plant Molecular Biology)

Abstract

Xanthium strumarium L. is a plant species native to North America; however, it has become a serious invasive threat in northern China due to its great environmental adaptability in the colonized regions. Therefore, elucidating its genetic traits is crucial to understanding its adaptive success. Simple sequence repeats (SSRs) comprise 1–6 nucleotides within plant genomes, which are available for evaluating the level of plant genetic diversity. However, the comprehensive analysis of high-coverage SSR markers in Xanthium is limited. This study identified 450,847 SSR loci in the X. strumarium genome. The number of SSR loci decreased with increasing SSR length within the range of 10–100 bp. Dinucleotide repeats constituted the majority (49.81%), totaling 221,154, with AT/TA motifs being the most frequent (66.62%). We developed 169 gSSR markers covering all X. strumarium chromosomes, with 5–15 markers per chromosome. Moreover, the number of different alleles (Na), number of effective alleles (Ne), Shannon’s information index (I), observed heterozygosity (Ho), expected heterozygosity (He), and polymorphism information content (PIC) were varied from 1.2 to 3.3, 1.077 to 2.385, 0.087 to 0.903, 0 to 1, 0.056 to 0.558, and 0.161 to 0.853, respectively. This marks the first systematic development of high-coverage SSR markers in the genus Xanthium, which increases the number of available SSR markers and reveals the molecular foundation of adaptation to invasion.

1. Introduction

Xanthium strumarium L. is native to North America but has become one of the most successful invasive species in many parts of the world, including northern China [1]. X. strumarium has key physiological differences to its native Chinese counterpart, Xanthium sibiricum L., which enable it to acquire nitrates at a substantially higher rate, allowing for higher growth plasticity [2,3,4,5]. Additionally, this enhanced ability of X. strumarium influences local legume plants to rely more heavily on nitrate nitrogen source [6]. This superior ability of X. strumarium to uptake and use nitrate allows it to outcompete native species for resources and available growth space. These findings provide a molecular explanation for the resource utilization hypothesis [3,5]. Research into its intricate underlying mechanism, beyond nitrogen resource utilization, remains largely unexplored. Furthermore, the colonization of northern China by the invasive species of X. strumarium has complicated traditional taxonomic classifications, making the agreed number of Xanthium species in China controversial [7,8]. Beyond traditional classification methods, utilizing molecular approaches to analyze the complex genetic background of Xanthium species offers a promising alternative.
Simple sequence repeats (SSRs), which consist of 1–6 nucleotides and exhibit high polymorphism, are ubiquitous in plant genomes [9,10]. SSRs have been widely used in various fields, including phylogenetic analysis, genetic diversity studies, marker-assisted selection, gene identification, and germplasm characterization [1,11,12,13,14,15,16]. In recent decades, SSR technology has been increasingly utilized in both staple and cash crops, serving as a pivotal tool for genetic improvement and species conservation [13,14,15,16].
SSRs in plants are currently categorized into genomic SSRs (gSSRs) and expression sequence tag SSRs (EST-SSRs) based on the sequences from which they are derived [14,17]. gSSRs, developed from genomic DNA, cover both coding and non-coding regions. They are highly polymorphic and widely distributed across the genome. Comparatively, EST-SSRs are derived exclusively from regions of expressed genes that have clear functional annotations. Each type of SSR has its strengths and limitations: gSSRs offer comprehensive genomic coverage and strong versatility, particularly suitable for wild or non-model species, but at a higher cost [14,18,19]. EST-SSRs can be linked to functional genes but rely on transcriptome data for development. They are less polymorphic but are comparatively more cost-effective [16,20].
Previous studies have developed eight pairs of inter-simple sequence repeat (ISSR) markers to assess genetic diversity and differentiation analyses in X. italicum Moretti [1]. Raman et al. [8] assembled the chloroplast genome of X. spinosum L. and identified 701 chloroplast SSRs. The quantity of these SSRs was relatively low, indicating that they were in an early stage of development and validation. In this study, we systematically developed and validated 169 gSSR markers using data from our unpublished sources on X. strumarium. These gSSRs provide full chromosomal coverage, overcoming limitations of organelle-specific or low-density markers. Moreover, developing an SSR marker system for the invasive species of X. strumarium holds dual significance. Firstly, identifying and understanding the genetic background and molecular bases of X. strumarium’s success help researchers uncover general mechanisms of invasion adaptation in plants. This is crucial for developing management and prevention strategies for invasive species. Secondly, compared to its native congener X. sibiricum, X. strumarium exhibits specific invasive traits, notably an enhanced capacity for nitrate uptake [2,3,4,5]. An SSR marker system can help identify the genes that contribute to these physiological differences. Our objectives in this study included (1) systematically identifying genome-wide SSR loci in X. strumarium; (2) developing and validating a set of SSR markers covering all chromosomes; and (3) evaluating their utility in genetic diversity analysis and molecular classification within X. strumarium, with particular emphasis on applications in invasion genetics.

2. Results

2.1. Analysis of SSR Repetition Frequency and Length

A total of 450,847 SSRs were isolated in the genome of X. strumarium. These SSRs had an average distance of 6.91 kb and an occurrence frequency of 0.24%. Chromosome 0 harbored the highest number of SSRs (33,645, accounting for 7.46% of the total), whereas Chromosome 9 had the lowest number (18,346, accounting for 4.07%) (Table S1 and Figure S1).
The average length of the SSR motifs was 21.32 bp, with lengths ranging from 12 to 2994 bp. Almost half of the SSRs (49.11%), a total of 221,425, were found to have a length of within 20 bp (Figure 1). The least abundant SSRs (0.06%) were those between 91 and 100 bp in length, totaling only 288. SSRs longer than 100 bp represented 21.53% of the overall total, yet their distribution varied significantly across different lengths. Generally, the number of SSR loci decreased with increasing SSR length within the 10–100 bp range (Figure 1 and Table 1).

2.2. Characteristics of SSRs Predicted in the Genome of X. strumarium

We further analyzed the predicted SSR characteristics in the X. strumarium genome. Among the various types, dinucleotide repeats exhibited the smallest average distribution distance of 10.17 kb, while hexanucleotide repeats had the largest distance of 442.90 kb. Dinucleotide repeats represented the highest proportion, accounting for 49.81% of all repeat types, with trinucleotide repeats following at 42.43%. The remaining repeat types together accounted for only 7.76% (Table 2). Therefore, dinucleotide and trinucleotide repeats were the most abundant genome of X. strumarium.
Regarding SSR motif types, within dinucleotide repeats, the AT/TA motif was the most frequent (66.62%), followed by AC/TG (12.15%) and GT/TC (8.22%), showing a relatively uneven distribution (Figure 2A). For trinucleotide repeats, AAT/TTA was the most prevalent motif (15.29%), followed by TAT/ATT (10.99%) and TAA/ATA (9.71%) (Figure 2B). Among tetranucleotide repeats, AAAT/TTTA was the most frequent (19.30%), followed by ATAC/TTAT (10.76%) and ATTT/TATT (8.27%) (Figure 2C). In pentanucleotide and hexanucleotide repeats, TTATA/ATATA (17.01%) and TGGTTA/AAGAAT (3.5%) were the most abundant, respectively (Figure 2D,E). Overall, the AT/TA motif within dinucleotide repeats was the most prevalent in X. strumarium, and this motif became the subject of further analysis and validation.

2.3. Identification and Characterization of 169 gSSR Markers with Polymorphism Across 18 Chromosomes

To further substantiate the predicted SSRs in the X. strumarium genome, we selected 100 SSR loci from each chromosome for verification (a total of 1800 loci). A total of 169 loci were isolated across 18 chromosomes, for which corresponding SSR markers were developed (Figure S2 and Table S2). The majority of chromosomes (excluding chromosomes 1, 2, and 3) harbored more than 8 SSR loci, with a range of 5 to 15 per chromosome (Figure 3A). The most abundant motifs found in the validated SSR sequences were AT and TA. The number of AT and TA motifs counted was 44 and 26, respectively, while other motifs were found in numbers less than 19 (Figure 3B and Table S3). Dinucleotide repeats were the most common among all repeat types, amounting to 161, followed by trinucleotide and tetranucleotide repeats, which had only 7 and 1, respectively (Figure 3B and Table S3), which was consistent with our previous predictions (Table 2 and Figure 2). These screened SSR lengths ranged from 12 bp to 54 bp, SSR motif repetition numbers were between 4 and 27, and overall SSR loci lengths were between 80 bp and 214 bp (Table S2).

2.4. gSSR Marker Assay and Their Informativeness

To further validate the polymorphisms associated with 169 gSSR markers, we randomly selected one pair of SSR primers from each chromosome to examine 160 X. strumarium species and 40 X. sibiricum species. All 18 primer pairs were successfully amplified. Table 3 lists the genetic diversity indices calculated from the 18 SSR markers for the 200 species. The number of different alleles (Na) ranged from 1.2 to 3.3, with an average value of 2.006. The number of effective alleles (Ne) ranged from 1.077 to 2.385, averaging 1.730. Shannon’s information index (I) ranged from 0.087 to 0.903, with an average value of 0.567. Observed heterozygosity (Ho) ranged from 0 to 1, averaging 0.672. Expected heterozygosity (He) ranged from 0.056 to 0.558, averaging 0.377. Polymorphic information content (PIC) ranged from 0.161 to 0.853, averaging 0.620. These results demonstrate that the 18 SSR markers can be used to effectively evaluate polymorphisms within the X. strumarium and X. sibiricum species.

3. Discussion

Previous studies have shown that SSR repeat types primarily include mononucleotide, dinucleotide, and trinucleotide repeats, and their relative abundance varies significantly among different species and even within different genomic regions of the same species [11,12,17,19,21,22]. An analysis of X. strumarium identified a total of 164,154 dinucleotide repeats, accounting for 49.81% of the total (Table 2). From our findings, dinucleotide repeats were the most predominant type in X. strumarium, which is consistent with the reports on modern sugarcane cultivars, Brassica spp., and Weigela cultivars [11,12,19]. This has also been observed in other Asteraceae species such as Chrysanthemum sensu lato [10,14], but differs from findings from Chimonanthus praecox L., Elymus sibiricus L., and Elymus breviaristatus (Keng) Keng f. [17,21,22]. Among these dinucleotide repeats, TA/AT motif repeats were the most common, accounting for 66.62% (Figure 2A), which aligns with observations made in Brassica spp. and Phaseolus vulgaris L. [19,23]. Additionally, subsequent validation of SSR motifs further confirmed this preponderance (Tables S2 and S3). AT base pairs are linked by two hydrogen bonds, while GC base pairs are bonded by three [24]. This suggests that TA/AT motif repeats might be energetically less demanding for organisms, although some species were also identified to have AG/GA and AG/CT as the primary dinucleotide repeats [11,12]. In X. strumarium, the prevalence of TA/AT motifs potentially signifies an evolutionary balance between energy efficiency and mutational stability. The reduced energy requirement for DNA replication in AT-rich regions, due to a lower number of hydrogen bonds [24,25], could confer an advantage for X. strumarium in rapidly colonizing environments. Additionally, trinucleotide repeats constitute approximately 46% in X. strumarium, with AAT/TTA identified as the most prevalent motif sequence (Figure 2B), which is different from AAG/CTT in Chrysanthemum × morifolium Ramat. and CCG/CGG in C. praecox. These differences may arise from species-specific evolutionary selection processes affecting SSR sequences [26].
SSRs within plant genomes are widely used in plant genetic analysis due to their high abundance, genetic diversity, and high variability. They are valuable for identifying genetic differences and are applied in gene mapping, identifying genetic diversity, and germplasm identification [8,15,16,17,27,28,29]. Previous studies in species such as C. × morifolium and C. praecox reported EST-SSR loci ranging from thousands to tens of thousands, with successfully developed SSR markers in the single to double digits [14,16,17]. In contrast, gSSRs provide broader coverage (both coding and non-coding regions). Non-conserved regions exhibit less conservation and greater variation across species within the same genus compared to conserved regions. This high variability makes gSSR markers more polymorphic and better for distinguishing closely related individuals or populations within a species [11,12]. In this study, 450,847 SSRs were identified in X. strumarium, covering its entire genome (Table S1 and Figure S1). Furthermore, we successfully developed 169 gSSR markers for the first time, spanning all 18 chromosomes (Figure S2 and Figure 3A, and Table S2). The polymorphic rate in both X. strumarium and X. sibiricum was approximately 9.4% (169/1800), comparable to the 9.3% found in C. praecox [17] but lower than 32.62% in 12 sugarcane materials [30].
We observed that the PIC values in our study reached up to 0.853 (Table 3), showing high genetic diversity in X. strumarium. This value is comparatively higher than in Asteraceae: invasive Ambrosia artemisiifolia L. (0.21–0.82) [31]; non-invasive C. × morifolium (0.38–0.50) [32]; and Gerbera hybrida Hort. (0.13–0.80) [33]. The substantial variation in SSR polymorphic rates and PIC values among species may have resulted from multiple factors, including species evolutionary history (phylogenetic relationship, reproductive strategies), genomic characteristics (repeat unit stability, ploidy level), and SSR screening methods (primer specificity, sample selection) [17,30,34,35]. The invasive biology of X. strumarium can be linked to its high PIC value. During long-distance colonization, invasive plants frequently retain high genetic diversity to adapt to heterogeneous environments. Furthermore, we speculate that during its expansion, X. strumarium probably encountered strong selective pressures and heterogeneous environments frequently, as shown by its high PIC value. High PIC values indicate that a genetically diverse population offers broader raw materials for natural selection, which enabled X. strumarium to rapidly adapt to abiotic stresses (e.g., low nitrogen) and biotic interactions (e.g., competition), ultimately facilitating successful invasion.
Conclusively, the development of 169 gSSR markers in this study signifies a significant advancement over earlier research on the genus Xanthium. Firstly, it achieves full chromosomal coverage in X. strumarium for the first time, overcoming the limitation of previous chloroplast genome markers in X. spinosum, which could only reflect uniparental genetic information [8]. Secondly, compared to the limited SSR markers used in earlier studies on population genetic differentiation in Xanthium [1], this marker system has the potential to significantly enhance the resolution of fine genetic variations among closely related species (for example, X. sibiricum and X. strumarium, or other Xanthium species) by utilizing high-density loci. Thirdly, it provides key technical support for the analysis of population genetic structure, inference of invasion routes, and the genetic mapping of invasion-related traits (for example, rapid growth and resource competition) of the invasive species of X. strumarium. These markers can facilitate future comparisons of genetic differences between invasive and native populations and help identify gene loci linked to invasion adaptability.

4. Materials and Methods

4.1. Plant Materials and gDNA Extraction

X. strumarium was collected from Kangping, Liaoning, China (123°17′41.92″ E, 42°42′57.25″ N). SSR markers were developed using both X. strumarium and the native congeneric species X. sibiricum, which were conserved by the biological invasion team at Shenyang Agricultural University. Following a 4 °C vernalization for 2–3 days, seeds of X. strumarium and X. sibiricum were then germinated in a growth chamber for 2–3 days at 28 °C with a light intensity of 200 μmol/m2·s, maintained under a 12 h light/12 h dark cycle. Total genomic DNA was extracted from germinated seedlings of both species using the CTAB method [36]. The extracted DNA was visualized via agarose gel electrophoresis to confirm normal banding patterns, and DNA concentrations were then measured using a NanoDrop 1000 spectrophotometer (Thermo Fisher Scientific, Waltham, MA, USA). The DNA concentrations of X. strumarium and X. sibiricum were both adjusted to 80 ng/µL.

4.2. SSR Screening

We conducted an exhaustive genome-wide exploration for SSR within our unpublished genome of X. strumarium utilizing the Microsatellite Identification Tool (MISA) software (web-based version) [37]. The initial screening criteria were carefully formulated as follows: a minimal SSR repeat unit length of 2 nucleotides, a maximal length constraint of 6 nucleotides, a minimum SSR sequence length threshold of 12 nucleotides, a flanking sequence length of 100 base pairs, and a minimum inter-SSR distance of 12 base pairs [38,39,40]. The mismatch threshold was set to 0, and flanking sequences with low-complexity regions or N bases were excluded (Figure 4). All remaining parameters were standardized to their operational software-default settings to ensure consistency and rigor in our methodological approach.

4.3. Primer Design and PCR Reaction

Primers were designed for the flanking sequences of SSR loci identified in the X. strumarium genome using Primer 3.0 software [41]. Primer design followed specific parameters set as: an optimal length of 23 bp, an annealing temperature range of 56–65 °C, a GC content between 40% and 60%, a maximum 3′ stability with ΔG ≤ −9 kcal/mol, and a dimer score of ≤3.0 (Figure 4). During the process of selecting flanking regions for primer design, we ensured that the sequences were unique within the genome to avoid cross-reactivity with other genomic regions. Additionally, secondary structures that could disrupt primer binding and PCR efficiency, such as hairpins or self-dimers, were avoided. The remaining parameters were configured to the default values of the software. Selected SSR primers from each chromosome were used for PCR analysis with four samples from two X. strumarium and two X. sibiricum species. PCR reactions were performed according to the manufacturer’s instructions for EasyTaq® DNA Polymerase (AP111-01) (TransGen Biotech, Beijing, China). The PCR program consisted of an initial denaturation at 95 °C for 5 min, followed by 35 cycles of PCR (95 °C for 15 s, 58 °C for 15 s, 72 °C for 1 min), and a final extension at 72 °C for 5 min (Figure 4).

4.4. Polyacrylamide Gel Electrophoresis (PAGE) Protocol

An 8% resolving gel (acrylamide:bis-acrylamide = 29:1) containing 5× TBE buffer was prepared and polymerized with the addition of 10% APS and TEMED, followed by overlaying with a 5% stacking gel. PCR products were mixed with formamide loading buffer (10×) at a ratio of 1:10 and centrifuged at 4000 rpm. Subsequently, 2 μL of the mixture of PCR products was loaded onto the gel and electrophoresed at a constant voltage of 120 V for 3 h in 1× TBE buffer. Silver staining was performed for 20 min using a 0.1% solution, then it was immersed in a 3% NaOH solution containing 1% formaldehyde for another 20 min. The reaction was terminated by adding a small amount of boric acid [11,42,43]. Finally, the gel was placed in ddH2O and photographed using a Canon EOS RP camera (Canon Inc., Tokyo, Japan) for documentation (Figure 4).

4.5. Analysis of gSSR Markers

The genetic diversity of 160 X. strumarium species and 40 X. sibiricum species was reflected by newly developed gSSR markers. These species were amplified by the selective gSSR primers. The genetic diversity indices, including the number of different alleles (Na), the number of effective alleles (Ne), Shannon’s information index (I), observed heterozygosity (Ho), and expected heterozygosity (He), were computed using GenAlEx 6.5 software [44]. Polymorphism information content (PIC) was calculated by PowerMarker 3.25 software [45].

5. Conclusions

A total of 169 gSSR markers were developed from an invasive plant species of X. strumarium for the first time, in contrast to the previously limited number of SSR markers. These SSR markers were distributed across all chromosomes, with a distribution of 5–15 markers per chromosome. These 169 gSSR markers can potentially enable marker-assisted taxonomy to resolve ambiguities among morphologically similar X. strumarium or X. sibiricum species. They can help elucidate population genetic structure and infer invasion routes of X. strumarium. Additionally, they can facilitate investigation into the genetic basis of adaptive evolution, particularly by identifying loci under selection associated with invasiveness traits like rapid growth and stress tolerance in future research. In general, they effectively contribute to the molecular identification of taxonomically ambiguous groups and address critical scientific issues such as hybridization, introgression, and adaptive evolution in Xanthium.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/plants14223522/s1, Figure S1. Number of predicted SSRs with different chromosome in X. strumarium. The x-axis represents Chromosome, and the y-axis represents Number of predicted SSRs. The line chart displays the fluctuating trend of predicted SSR numbers across different chromosomes. Figure S2. The 169 SSR markers with different chromosomes in X. strumarium. Chr + a number represents different chromosomes. Xst and Xsi represent Xanthium strumarium and Xanthium sibiricum, respectively. Table S1. The number and proportion of SSRs on different chromosomes of X. strumarium. Number represents the total count of SSRs identified on each specific chromosome, and Percentage indicates the proportion of SSRs on that chromosome relative to the total SSRs. Table S2. 169 SSR loci information of X. strumarium. SSR motif: The repeating nucleotide sequence (e.g., TG, GA, AT) that defines the SSR. SSR length: The total length of the SSR sequence. SSR repeat number: The number of times the SSR motif is repeated within the SSR sequence. Location: The genomic position of the SSR locus. Sequence: The DNA sequence containing the SSR motif. SSR Name: A unique identifier for each SSR locus. Left primer/Right primer: The forward and reverse PCR primers designed for amplifying the SSR locus. SSR amplicon size: The expected size of the PCR product when the locus is amplified using the specified primers. Table S3. Distribution of the verified 169 SSR motif types with different numbers. Rows represent specific SSR motifs (e.g., AT, TA, AG), and columns indicate the number of repeats (ranging from 4 to 27). Values in the table represent the count of each SSR motif type with a given repeat number.

Author Contributions

J.Y., Q.B., and Y.M. performed the experiment. J.Y., H.M., C.Z., and Y.S. analyzed the data. X.Z. and Y.F. wrote the manuscript and designed the experiments. All authors have read and agreed to the published version of the manuscript.

Funding

This project was supported by the National Key R&D Program (2023YFC2604500 and 2023YFC2604502), the National Natural Science Foundation of China (32301467, 32402609, and 32171666), and the Natural Science Foundation of Liaoning Province (2023-BSBA-279).

Data Availability Statement

The data presented in this study are available in the text and Supplementary Materials. The data presented in this study are available upon request from the corresponding author.

Acknowledgments

The authors thank 51edit.com for the English editing service for this manuscript.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Tang, J.S.; Ma, M. Genetic diversity and genetic differentiation of invasive weed Xanthium italicum in China. C. R. Biol. 2020, 343, 63–72. [Google Scholar] [CrossRef]
  2. Sun, S.M.; Chen, J.X.; Feng, W.W.; Zhang, C.; Huang, K.; Guan, M.; Sun, J.K.; Liu, M.C.; Feng, Y.L. Plant Strategies for Nitrogen Acquisition and Their Effects on Exotic Plant Invasions. Biodivers. Sci. 2021, 29, 72–80. [Google Scholar] [CrossRef]
  3. Luo, J.J.; Gao, Y.M.; Feng, W.W.; Liu, M.C.; Qu, B.; Zhang, C.; Feng, Y.L. Stronger Ability to Absorb Nitrate and Associated Transporters in the Invasive Plant Xanthium strumarium Compared with Its Native Congener. Environ. Exp. Bot. 2022, 198, 104851. [Google Scholar] [CrossRef]
  4. Zhang, X.; Bai, S.; Min, H.; Cui, Y.; Sun, Y.; Feng, Y. Evolutionary dynamics of nitrate uptake, assimilation, and signalling in plants: Adapting to a changing environment. Physiol. Plant. 2025, 177, e70069. [Google Scholar] [CrossRef] [PubMed]
  5. Zhu, W.; Liu, H.; Zhang, Q.; Gao, N.; Min, H.; Sun, Y.; Zhang, X.; Feng, Y. Genome-wide identification of cytochrome b5 gene family reveals their potential roles in nitrate response in Xanthium strumarium. Plant. Physiol. Biochem. 2025, 229, 110544. [Google Scholar] [CrossRef]
  6. Han, M.; Zhang, H.; Liu, M.; Tang, J.; Guo, X.; Ren, W.; Zhao, Y.; Yang, Q.; Guo, B.; Han, Q.; et al. Increased dependence on nitrogen-fixation of a native legume in competition with an invasive plant. Plant. Divers. 2024, 46, 510–518. [Google Scholar] [CrossRef] [PubMed]
  7. Löve, D.; Dansereau, P. Biosystematic studies on Xanthium: Taxonomic appraisal and ecological status. Can. J. Bot. 1959, 37, 173–208. [Google Scholar] [CrossRef]
  8. Raman, G.; Park, K.T.; Kim, J.H.; Park, S.J. Correction to: Characteristics of the completed chloroplast genome sequence of Xanthium Spinosum: Comparative analyses, identification of mutational hotspots and phylogenetic implications. BMC Genom. 2020, 22, 97. [Google Scholar] [CrossRef]
  9. Srivastava, S.; Avvaru, A.K.; Sowpati, D.T.; Mishra, R.K. Patterns of microsatellite distribution across eukaryotic genomes. BMC Genom. 2019, 20, 153. [Google Scholar] [CrossRef]
  10. Wang, H.; Jiang, J.; Chen, S.; Qi, X.; Peng, H.; Li, P.; Song, A.; Guan, Z.; Fang, W.; Liao, Y. Next-Generation Sequencing of the Chrysanthemum Nankingense (Asteraceae) Transcriptome Permits Large-Scale Unigene Assembly and SSR Marker Discovery. PLoS ONE 2013, 8, e62293. [Google Scholar] [CrossRef]
  11. Xu, Y.; Chen, S.; Chen, S.; Wei, X.; Shang, H.; Zhang, Q.; Zhang, J. Genome-wide development of SSR molecular markers for modern sugarcane cultivars. Front. Plant Sci. 2025, 16, 1573967. [Google Scholar] [CrossRef]
  12. Hamm, T.P.; Boggess, S.L.; Kandel, J.S.; Staton, M.E.; Huff, M.L.; Hadziabdic, D.; Shoemaker, D.; Adamczyk, J.J., Jr.; Nowicki, M.; Trigiano, R.N. Development and Characterization of 20 Genomic SSR Markers for Ornamental Cultivars of Weigela. Plants 2022, 11, 1444. [Google Scholar] [CrossRef] [PubMed]
  13. Hemasai, B.; Kumbha, D.K.; Modem, V.N.; Gannavarapu, S.K.; Bommaka, R.R.; Mallapuram, S.; Chintala, S.; Sreevalli, M.D.; Ramireddy, E.; Vemireddy, L.R. Development of miRNA-SSR and target-SSR markers from yield-associate genes and their applicability in the assessment of genetic diversity and association mapping in rice (Oryza sativa L.). Mol. Breed. 2024, 44, 30. [Google Scholar] [CrossRef] [PubMed]
  14. Shi, Z.; Zhao, W.; Li, Z.; Kang, D.; Ai, P.; Ding, H.; Wang, Z. Development and validation of SSR markers related to flower color based on full-length transcriptome sequencing in Chrysanthemum. Sci. Rep. 2022, 12, 22310. [Google Scholar] [CrossRef]
  15. Park, H.; Sa, K.J.; Hyun, D.Y.; Lee, S.; Lee, J.K. Identifying SSR Markers Related to Seed Fatty Acid Content in Perilla Crop (Perilla frutescens L.). Plants 2021, 10, 1404. [Google Scholar] [CrossRef]
  16. Chen, M.; Nie, G.; Li, X.; Yang, L.; Cai, Y.; Zhang, Y. Development of EST-SSR markers based on transcriptome sequencing for germplasm evaluation of 65 Lilies (Lilium). Mol. Biol. Rep. 2023, 50, 3259–3269. [Google Scholar] [CrossRef]
  17. Liu, B.; Wu, H.F.; Cao, Y.Z.; Yang, X.M.; Sui, S.Z. Establishment of Novel Simple Sequence Repeat (SSR) Markers from Chimonanthus praecox Transcriptome Data and Their Application in the Identification of Varieties. Plants 2024, 13, 2131. [Google Scholar] [CrossRef] [PubMed]
  18. Squirrell, J.; Hollingsworth, P.M.; Woodhead, M.; Russell, J.; Lowe, A.J.; Gibby, M.; Powell, W. How much effort is required to isolate nuclear microsatellites from plants? Mol. Ecol. 2003, 12, 1339–1348. [Google Scholar] [CrossRef]
  19. Sun, N.; Chen, J.; Wang, Y.; Hussain, I.; Lei, N.; Ma, X.; Li, W.; Liu, K.; Yu, H.; Zhao, K.; et al. Development and utility of SSR markers based on Brassica sp. whole-genome in triangle of U. Front. Plant Sci. 2024, 14, 1259736. [Google Scholar] [CrossRef]
  20. Jiang, H.; Waseem, M.; Wang, Y.; Basharat, S.; Zhang, X.; Li, Y.; Liu, P. Development of simple sequence repeat markers for sugarcane from data mining of expressed sequence tags. Front. Plant Sci. 2023, 14, 1199210. [Google Scholar] [CrossRef]
  21. Zhang, Z.; Xie, W.; Zhao, Y.; Zhang, J.; Wang, N.; Ntakirutimana, F.; Yan, J.; Wang, Y. EST-SSR marker development based on RNA-sequencing of E. sibiricus and its application for phylogenetic relationships analysis of seventeen Elymus species. BMC Plant Biol. 2019, 19, 235. [Google Scholar] [CrossRef]
  22. Li, J.; Zhang, C.; Chen, S.; Jiang, K.; Guan, H.; Liu, W. Characterization and Application of EST-SSR Markers Developed from Transcriptome Sequences in Elymus breviaristatus (Poaceae: Triticeae). Genes 2023, 14, 302. [Google Scholar] [CrossRef]
  23. Blair, M.W.; Buendía, H.F.; Giraldo, M.C.; Métais, I.; Peltier, D. Characterization of AT-Rich Microsatellites in Common Bean (Phaseolus vulgaris L.). Theor. Appl. Genet. 2008, 118, 91–103. [Google Scholar] [CrossRef]
  24. Lin, Y.; Wang, H.; Wu, Y.; Gao, S.; Schaefer, H.F., 3rd. Proton-transfer in hydrogenated guanine-cytosine trimer neutral species, cations, and anions embedded in B-form DNA. Phys. Chem. Chem. Phys. 2014, 16, 6717–6725. [Google Scholar] [CrossRef]
  25. Subramanian, H.; Gatenby, R.A. Evolutionary advantage of anti-parallel strand orientation of duplex DNA. Sci. Rep. 2020, 10, 9883. [Google Scholar] [CrossRef] [PubMed]
  26. Shi, J.; Huang, S.; Fu, D.; Yu, J.; Wang, X.; Hua, W.; Liu, S.; Liu, G.; Wang, H. Evolutionary Dynamics of Microsatellite Distribution in Plants: Insight from the Comparison of Sequenced Brassica, Arabidopsis and Other Angiosperm Species. PLoS ONE 2013, 8, e59988. [Google Scholar] [CrossRef] [PubMed]
  27. Zheng, X.; Pan, C.; Diao, Y.; You, Y.; Yang, C.; Hu, Z. Development of microsatellite markers by transcriptome sequencing in two species of Amorphophallus (Araceae). BMC Genom. 2013, 19, 490. [Google Scholar] [CrossRef] [PubMed]
  28. Yang, J.; Zhang, J.; Han, R.; Zhang, F.; Mao, A.; Luo, J.; Dong, B.; Liu, H.; Tang, H.; Zhang, J.; et al. Target SSR-Seq: A Novel SSR Genotyping Technology Associate With Perfect SSRs in Genetic Analysis of Cucumber Varieties. Front. Plant Sci. 2019, 10, 531. [Google Scholar] [CrossRef]
  29. Wu, F.; Zhang, S.; Gao, Q.; Liu, F.; Wang, J.; Wang, X. Genetic diversity and population structure analysis in a large collection of Vicia amoena in China with newly developed SSR markers. BMC Plant. Biol. 2021, 21, 544. [Google Scholar] [CrossRef]
  30. Xiao, N.; Wang, H.; Yao, W.; Zhang, M.; Ming, R.; Zhang, J. Development and Evaluation of SSR Markers Based on Large Scale Full-Length Transcriptome Sequencing in Sugarcane. Trop. Plant. Biol. 2020, 13, 343–352. [Google Scholar] [CrossRef]
  31. Meyer, L.; Causse, R.; Pernin, F.; Scalone, R.; Bailly, G.; Chauvel, B.; Délye, C.; Le Corre, V. New gSSR and EST-SSR markers reveal high genetic diversity in the invasive plant Ambrosia artemisiifolia L. and can be transferred to other invasive Ambrosia species. PLoS ONE 2017, 12, e0176197. [Google Scholar] [CrossRef]
  32. Samarina, L.S.; Malyarovskaya, V.I.; Reim, S.; Yakushina, L.G.; Koninskaya, N.G.; Klemeshova, K.V.; Shkhalakhova, R.M.; Matskiv, A.O.; Shurkina, E.S.; Gabueva, T.Y.; et al. Transferability of ISSR, SCoT and SSR Markers for Chrysanthemum × Morifolium Ramat and Genetic Relationships Among Commercial Russian Cultivars. Plants 2021, 10, 1302. [Google Scholar] [CrossRef] [PubMed]
  33. Gong, L.; Deng, Z. EST-SSR markers for gerbera (Gerbera hybrida). Mol. Breed. 2010, 26, 125–132. [Google Scholar] [CrossRef]
  34. Ellegren, H. Microsatellites: Simple sequences with complex evolution. Nat. Rev. Genet. 2004, 5, 435–445. [Google Scholar] [CrossRef]
  35. Selkoe, K.A.; Toonen, R.J. Microsatellites for ecologists: A practical guide to using and evaluating microsatellite markers. Ecol. Lett. 2006, 9, 615–629. [Google Scholar] [CrossRef]
  36. Doyle, J.J.; Jane, L.D. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem. Bull. 1987, 19, 11–15. [Google Scholar]
  37. Beier, S.; Thiel, T.; Münch, T.; Scholz, U.; Mascher, M. MISA-web: A web server for microsatellite prediction. Bioinformatics 2017, 33, 2583–2585. [Google Scholar] [CrossRef] [PubMed]
  38. Thiel, T.; Michalek, W.; Varshney, R.K.; Graner, A. Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.). Theor. Appl. Genet. 2003, 106, 411–422. [Google Scholar] [CrossRef] [PubMed]
  39. Zhao, H.; Sun, H.; Li, L.; Lou, Y.; Li, R.; Qi, L.; Gao, Z. Transcriptome-based investigation of cirrus development and identifying microsatellite markers in rattan (Daemonorops jenkinsiana). Sci. Rep. 2017, 7, 46107. [Google Scholar] [CrossRef]
  40. Bonthala, B.; Abdin, M.Z.; Arya, L.; Pandey, C.D.; Sharma, V.; Yadav, P.; Verma, M. Genome-wide SSR markers in bottle gourd: Development, characterization, utilization in assessment of genetic diversity of National Genebank of India and synteny with other related cucurbits. J. Appl. Genet. 2022, 63, 237–263. [Google Scholar] [CrossRef]
  41. Rozen, S.; Skaletsky, H. Primer3 on the WWW for general users and for biologist programmers. Methods Mol. Biol. 2000, 132, 365–386. [Google Scholar] [CrossRef] [PubMed]
  42. Haider, N.; Nabulsi, I.; MirAli, N. Comparison of the efficiency of A–PAGE and SDS–PAGE, ISSRs and RAPDs in resolving genetic relationships among Triticum and Aegilops species. Genet. Resour. Crop Evol. 2010, 57, 1023–1039. [Google Scholar] [CrossRef]
  43. Sen, A.; Alikamanoglu, S. Analysis of drought-tolerant sugar beet (Beta vulgaris L.) mutants induced with gamma radiation using SDS-PAGE and ISSR markers. Mutat. Res. 2012, 738–739, 38–44. [Google Scholar] [CrossRef] [PubMed]
  44. Peakall, R.; Smouse, P.E. GenAlEx 6.5: Genetic analysis in Excel. Population genetic software for teaching and research—An update. Bioinformatics 2012, 28, 2537–2539. [Google Scholar] [CrossRef]
  45. Liu, K.; Muse, S.V. PowerMarker: An integrated analysis environment for genetic marker analysis. Bioinformatics 2005, 21, 2128–2129. [Google Scholar] [CrossRef]
Figure 1. SSR length (10–100 bp) distribution information in Xanthium strumarium.
Figure 1. SSR length (10–100 bp) distribution information in Xanthium strumarium.
Plants 14 03522 g001
Figure 2. Distribution of motif types in the whole genomes of X. strumarium. SSR motif types include the following five categories: Dinucleotide (A), Trinucleotide (B), Tetranucleotide (C), Pentanucleotide (D), and Hexanucleotide (E). The differently colored boxes in the pie graph represent the relative sizes of the proportions of different motifs.
Figure 2. Distribution of motif types in the whole genomes of X. strumarium. SSR motif types include the following five categories: Dinucleotide (A), Trinucleotide (B), Tetranucleotide (C), Pentanucleotide (D), and Hexanucleotide (E). The differently colored boxes in the pie graph represent the relative sizes of the proportions of different motifs.
Plants 14 03522 g002
Figure 3. Distribution information of 169 SSRs locus in X. strumarium. (A) The number of SSR loci with different chromosomes. (B) The different number of SSR motifs.
Figure 3. Distribution information of 169 SSRs locus in X. strumarium. (A) The number of SSR loci with different chromosomes. (B) The different number of SSR motifs.
Plants 14 03522 g003
Figure 4. A schematic workflow of the gSSR marker development process for X. strumarium and X. sibiricum.
Figure 4. A schematic workflow of the gSSR marker development process for X. strumarium and X. sibiricum.
Plants 14 03522 g004
Table 1. Distribution of SSR length (450,847) in the genome of X. strumarium.
Table 1. Distribution of SSR length (450,847) in the genome of X. strumarium.
Length (bp)NumberPercentage
10–20221,42549.11%
21–3057,03712.65%
31–4033,4187.41%
41–5021,2874.72%
51–6012,5572.79%
61–7048581.08%
71–8020340.45%
81–908330.18%
91–1002880.06%
100+97,11021.54%
Table 2. (a) Distribution of predicted SSR motif types with different numbers (Part 1). (b) Distribution of the predicted SSR motif types with different numbers (Part 2).
Table 2. (a) Distribution of predicted SSR motif types with different numbers (Part 1). (b) Distribution of the predicted SSR motif types with different numbers (Part 2).
(a)
  No. of Repeats45678910111213
 No. 
SSR Motif Types  
Dinucletides0036,58021,92816,15211,5988174612250204492
Trinucleotides88,73424,4119725421320871334970831770703
Tetranucleotides12,48230118233081578247351716
Pentanucleotides346371320598715840343323
Hexanucleotides2609620226107603520191211
Total107,28828,75547,55926,65418,52713,1079251704158525245
Percentage32.55%8.72%14.43%8.09%5.62%3.98%2.81%2.14%1.78%1.59%
(b)
  No. of Repeats1415161718192020+TotalPercentageAverage Physical Distance
 No. 
SSR Motif Types  
Dinucletides425041594051397438093823374926,27316,415449.81%10.17 kb
Trinucleotides6416365945315154614542212139,82242.43%12.13 kb
Tetranucleotides11564310517,0135.16%80.38 kb
Pentanucleotides2110813946748161.46%332.88 kb
Hexanucleotides912321021937671.14%442.9 kb
Total493248224662452443374289421128,516329,572100%6.91 kb
Percentage1.50%1.46%1.41%1.37%1.32%1.30%1.28%8.65%100//
Table 3. (a) Characteristics and diversity statistics for the 18 SSR markers evaluated in Xanthium plants (200 individuals) (Part 1). (b) Characteristics and diversity statistics for the 18 SSR markers evaluated in Xanthium plants (200 individuals) (Part 2).
Table 3. (a) Characteristics and diversity statistics for the 18 SSR markers evaluated in Xanthium plants (200 individuals) (Part 1). (b) Characteristics and diversity statistics for the 18 SSR markers evaluated in Xanthium plants (200 individuals) (Part 2).
(a)
 LocusINVZ-191INVZ-456INVZ-705INVZ-864INVZ-1104INVZ-1274INVZ-1501INVZ-1652INVZ-1925INVZ-2051
Types 
No. of Different Alleles (Na)1.200 ± 0.3021.700 ± 0.4832.400 ± 0.3692.900 ± 0.9201.500 ± 0.5063.200 ± 1.2072.400 ± 0.7692.400 ± 0.5002.100 ± 1.2821.500 ± 0.608
No. of Effective Alleles (Ne)1.066 ± 0.1201.282 ± 0.3032.185 ± 0.4032.395 ± 0.4601.350 ± 0.3512.261 ± 0.3411.933 ± 0.3262.101 ± 0.2721.992 ± 1.1641.465 ± 0.596
Shannon’s Information
Index (I)
0.076 ± 0.0560.250 ± 0.2250.767 ± 0.1960.890 ± 0.2120.265 ± 0.2520.885 ± 0.1930.669 ± 0.2100.763 ± 0.1360.702 ± 0.4140.474 ± 0.235
Observed Heterozygosity (Ho)0.000 ± 0.0010.242 ± 0.1280.800 ± 0.2211.000 ± 0.0010.326 ± 0.1460.950 ± 0.1130.816 ± 0.2610.924 ± 0.1390.700 ± 0.3460.590 ± 0.364
Expected Heterozygosity (He)0.046 ± 0.0350.156 ± 0.1540.507 ± 0.1190.561 ± 0.0650.181 ± 0.1740.544 ± 0.0540.445 ± 0.1320.510 ± 0.0630.424 ± 0.2230.340 ± 0.169
Polymorphism Information Content (PIC)0.215 ± 0.1170.343 ± 0.1150.801 ± 0.0660.778 ± 0.0780.839 ± 0.1380.853 ± 0.0450.612 ± 0.1480.691 ± 0.0650.852 ± 0.1950.774 ± 0.162
(b)
 LocusINVZ-2330INVZ-2478INVZ-2645INVZ-2906INVZ-3183INVZ-3458INVZ-3563INVZ-3764Average
Types 
No. of Different Alleles (Na)1.200 ± 0.8121.8 ± 0.4522.300 ± 0.4832.000 ± 0.0011.200 ± 0.7392.600 ± 1.1291.700 ± 0.8302.000 ± 0.0012.006
No. of Effective Alleles (Ne)1.077 ± 0.7441.8 ± 0.4522.090 ± 0.2022.000 ± 0.0011.200 ± 0.7391.851 ± 0.5571.104 ± 0.3282.000 ± 0.0011.730
Shannon’s Information
Index (I)
0.332 ± 0.2880.624 ± 0.1570.756 ± 0.1160.693 ± 0.0010.416 ± 0.2560.712 ± 0.2200.239 ± 0.2020.693 ± 0.0010.567
Observed Heterozygosity (Ho)0.410 ± 0.3640.900 ± 0.2260.945 ± 0.1121.000 ± 0.0010.600 ± 0.3700.714 ± 0.3850.170 ± 0.1511.000 ± 0.0010.672
Expected Heterozygosity (He)0.222 ± 0.1930.450 ± 0.1130.515 ± 0.0420.500 ± 0.0010.300 ± 0.1850.448 ± 0.1270.138 ± 0.1200.500 ± 0.0010.377
Polymorphism Information Content (PIC)0.678 ± 0.1700.491 ± 0.1580.623 ± 0.0630.644 ± 0.0430.375 ± 0.2030.812 ± 0.1530.161 ± 0.0900.614 ± 0.0530.620
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yin, J.; Bai, Q.; Mao, Y.; Min, H.; Zhang, C.; Sun, Y.; Zhang, X.; Feng, Y. Genome-Wide Development and Characterization of 169 gSSR Markers in the Invasive Plant Xanthium strumarium L. Plants 2025, 14, 3522. https://doi.org/10.3390/plants14223522

AMA Style

Yin J, Bai Q, Mao Y, Min H, Zhang C, Sun Y, Zhang X, Feng Y. Genome-Wide Development and Characterization of 169 gSSR Markers in the Invasive Plant Xanthium strumarium L. Plants. 2025; 14(22):3522. https://doi.org/10.3390/plants14223522

Chicago/Turabian Style

Yin, Junshuang, Qingyao Bai, Yiting Mao, Hui Min, Chunsha Zhang, Yibo Sun, Xiaojia Zhang, and Yulong Feng. 2025. "Genome-Wide Development and Characterization of 169 gSSR Markers in the Invasive Plant Xanthium strumarium L." Plants 14, no. 22: 3522. https://doi.org/10.3390/plants14223522

APA Style

Yin, J., Bai, Q., Mao, Y., Min, H., Zhang, C., Sun, Y., Zhang, X., & Feng, Y. (2025). Genome-Wide Development and Characterization of 169 gSSR Markers in the Invasive Plant Xanthium strumarium L. Plants, 14(22), 3522. https://doi.org/10.3390/plants14223522

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop