Next Article in Journal
Serum Amyloid A-Dependent Inflammasome Activation and Acute Injury in a Mouse Model of Experimental Stroke
Next Article in Special Issue
Omics Technologies in Molecular Biology
Previous Article in Journal
The Endogenous Metabolite TDCA Ameliorates LPS-Driven Liver Injury via Modulation of Caspase-11/GSDMD-Mediated Pyroptosis
Previous Article in Special Issue
Untargeted Serum Proteomics in the Fontan Circulation Reveals Three Distinct Molecular Signatures of Fontan Physiology with CYB5R3 Among Key Proteins
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Assessing Genetic Structure and Diversity Using Multiple Molecular Markers to Guide Conservation Management of Saussurea involucrata in the Eastern Tianshan Mountains

1
College of Forestry and Landscape Architecture, Xinjiang Agricultural University, Urumqi 830011, China
2
State Key Laboratory of Desert and Oasis Ecology, Key Laboratory of Ecological Safety and Sustainable, Development in Arid Lands, Xinjiang Institute of Ecology and Geography, Chinese Academy of Sciences, Xinjiang 830011, China
3
Turpan Eremophytes Botanical Garden, Chinese Academy of Sciences, Turpan 838008, China
4
Xinjiang Key Lab of Conservation and Utilization of Plant Gene Resources, Xinjiang Institute of Ecology and Geography, Chinese Academy of Sciences, Urumqi 830011, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Int. J. Mol. Sci. 2026, 27(5), 2274; https://doi.org/10.3390/ijms27052274
Submission received: 26 January 2026 / Revised: 19 February 2026 / Accepted: 24 February 2026 / Published: 28 February 2026
(This article belongs to the Special Issue Omics Technologies in Molecular Biology)

Abstract

Saussurea involucrata is a rare perennial herb endemic to the alpine zone of the Tianshan Mountains, possessing significant medicinal value yet facing severe threats from overharvesting and habitat fragmentation. The core distribution area and recognized genetic differentiation center of this species are located in the Bayinbuluke region of the Western Tianshan Mountains. In contrast, the genetic distinctiveness and conservation status of populations in the Eastern Tianshan Mountains have remained unclear. To clarify the genetic relationships, we conducted an integrated analysis using nuclear microsatellite (SSR) markers as well as chloroplast (cpDNA) and nuclear ribosomal DNA (nrDNA) sequences on 16 populations (5 from the Eastern Tianshan Mountains and 11 from the Bayinbuluke region). The results showed that the Eastern Tianshan Mountains populations exhibited higher genetic diversity (mean He = 0.5568). Bayesian clustering and principal coordinate analysis (PCoA) clearly separated all populations into two genetic groups corresponding to the two geographical regions. Notably, private haplotypes (cpDNA H1 and nrDNA H7) were identified exclusively in the Eastern Tianshan populations, and no recent genetic bottleneck was detected, indicating historical demographic stability. These findings demonstrate significant genetic differentiation and a unique evolutionary trajectory in the Eastern Tianshan Mountains populations, likely resulting from long-term geographical isolation and local adaptation to arid environments. Therefore, we propose that these populations be managed as an independent Management Unit (MU) to preserve their unique genetic legacy. This study provides critical genetic evidence for refining conservation strategies and promoting the sustainable use of this endangered species, while the established molecular marker system also offers a reliable framework for its geographical traceability.

1. Introduction

The conservation of endangered species represents a critical challenge that necessitates a multidisciplinary approach. In this endeavor, the fields of conservation genetics and population ecology have become indispensable, providing the scientific foundation for diagnosing threats and formulating effective recovery strategies [1]. Conservation genetics specifically aims to reveal the genetic mechanisms underlying population decline and extinction risk in endangered species. By elucidating patterns of genetic diversity, structure, and gene flow, this discipline offers a theoretical framework for developing scientifically sound and actionable protection plans [2]. Assessing the genetic diversity of an endangered species constitutes the fundamental first step in evaluating its current conservation status [3,4,5]. Such an assessment not only informs our understanding of its present viability but also aids in predicting potential future evolutionary trajectories and in designing interventions to prevent further population decline. Ultimately, genetic diversity serves as the essential substrate for species survival, adaptation, and long-term evolutionary resilience.
Saussurea involucrata (Kar. & Kir.) Sch. Bip., a perennial herb within the Asteraceae family, exemplifies a species facing severe conservation threats. Its distribution is primarily confined to the alpine zones of the Tianshan Mountains, with a major concentration in the western ranges. Valued for centuries in traditional medicine, S. involucrata demonstrates notable pharmacological effects against inflammation, oxidative stress, fatigue, and rheumatic diseases [6]. This very utility, however, has led to its severe overexploitation through excessive collection from the wild. Compounding this anthropogenic pressure are the species’ intrinsic biological constraints. S. involucrata requires extremely stringent environmental conditions, with its distribution tightly restricted to narrow habitats adjacent to the alpine snowline. Furthermore, global climate change is causing rapid glacial retreat and snowline elevation in its mountain habitat [7], leading to continual diminishment and fragmentation of its already limited niche. In recent decades, the synergistic effects of overharvesting and climate-driven habitat degradation have resulted in a significant decline in natural populations, drastically elevating its risk of extinction [8]. The species is further hampered by slow growth and limited natural regeneration capacity, making population recovery from disturbance exceptionally difficult. In recognition of its precarious state, S. involucrata is listed as a nationally protected plant of Class II in China.
Research on S. involucrata spans several areas, including phytochemistry, pharmacology [9,10], and the development of artificial cultivation techniques [11,12]. Significant attention has also been directed toward its conservation genetics. For instance, an early study by Shi analyzed five populations from the Western Tianshan Mountains, identifying 12 cpDNA and 5 nrDNA haplotypes [13]. This work revealed high genetic diversity and found no evidence of recent population bottlenecks or rapid expansion. Expanding the geographical scope, Hu examined nine populations across the Tianshan and Altay Mountains, discovering 12 chloroplast and 7 nuclear haplotypes that delineated two separate lineages [14]. This study proposed the Bayinbuluke area as a potential center of origin and genetic divergence for the species. A subsequent, finer-scale analysis by Hu, utilizing 18 SSR markers on 112 individuals from 11 Bayinbuluke populations, further confirmed high genetic diversity within this key region [15].
Despite these advancements, a significant geographical bias persists in the genetic research on S. involucrata. Previous studies have predominantly focused on Central and Western Tianshan populations, paying insufficient attention to those in the Eastern Tianshan Mountains. Consequently, genetic information for these Eastern Tianshan Mountains populations remains scarce, creating a critical knowledge gap that limits the development of comprehensive, range-wide conservation strategies. To address this gap, our study focuses on the genetic diversity and structure of S. involucrata populations in the Eastern Tianshan Mountains, using the well-characterized Bayinbuluke region as a reference. Our findings aim to provide concrete suggestions for the targeted protection of these neglected Eastern Tianshan Mountains populations.
A robust framework for translating genetic data into conservation action involves defining the Management Unit (MU) [16]. Evolutionarily Significant Unit (ESU) are traditionally defined as populations that exhibit reciprocal monophyly in organellar DNA lineages (e.g., mtDNA or cpDNA) and significant divergence in nuclear allele frequencies, reflecting historically isolated lineages with long-term evolutionary independence according to Moritz [8,17,18]. Expanding on this framework, Funk [19] advocate that the ideal delineation of ESU should incorporate evidence of adaptive divergence, rather than relying solely on neutral genetic markers, which may capture historical isolation without necessarily implying adaptive significance. In contrast, MUs are defined on a contemporary timescale based on significant differences in allele frequencies, reflecting restricted gene flow and serving as practical units for immediate management interventions. Integrating these concepts allows for the identification of lineages requiring priority conservation from an evolutionary standpoint while enabling tailored management for specific populations on an ecological timescale.
Advances in molecular biology have profoundly enhanced our ability to delineate such units. Genetic markers have evolved from morphological and biochemical traits to sophisticated molecular tools [20]. Among these, co-dominant microsatellite (SSR) markers are widely used in conservation genetics due to their high polymorphism, reproducibility, and utility in analyzing contemporary genetic diversity and population structure [21]. For example, SSR markers [22] have successfully revealed population differentiation in the endangered Taxus chinensis, informing the delineation of priority conservation units [23]. To complement this contemporary snapshot, sequences from non-coding chloroplast DNA (cpDNA) regions (e.g., trnL-trnF), which are maternally inherited and evolve at a moderate rate [24], are invaluable for reconstructing phylogeographic history and inferring historical population dynamics. A synergistic approach combining fast-evolving SSR markers (reflecting recent demographic processes) with more conserved cpDNA and nrDNA sequences (revealing deeper evolutionary history) provides a comprehensive basis for defining independent conservation units.
The primary purpose of this study is to elucidate the genetic relationship between the understudied populations of S. involucrata in the Eastern Tianshan Mountains and the populations in the Bayinbuluke area. We employ an integrated molecular approach using eighteen polymorphic SSR markers [15], three cpDNA regions (trnL-F, matK, ndhF-rpl32), and the nrDNA ITS1-4 region. Specifically, our objectives are to (1) assess and compare the levels of genetic diversity in Eastern Tianshan Mountains and Bayinbuluke populations; (2) analyze the population genetic structure and differentiation across the study area; (3) assess the relationship between genetic differentiation and geographical environment was inferred; and (4) based on the combined genetic evidence, evaluate the conservation status of the Eastern populations and propose specific protection strategies. The results will provide critical genetic evidence to guide both in situ and ex situ conservation efforts, establishing a solid scientific foundation for the sustainable preservation of this valuable alpine medicinal plant.

2. Results

2.1. Genetic Diversity

Based on the analysis of genetic diversity parameters in S. involucrata, the results are summarized in Table 1. Among the 16 populations examined, the average number of samples per locus per population (N) ranged from 3.667 to 20.278, with a mean of 9.660. To comprehensively assess allelic variation, this study analyzed metrics including the mean number of observed alleles (Na) and the mean effective number of alleles (Ne). Na refers to the average number of alleles observed per locus, reflecting allelic richness, whereas Ne takes allele frequencies into account, estimating the equivalent number of alleles under equal frequency conditions and thus indicating the evenness of allele distribution. A significant difference between Na and Ne often suggests the presence of rare alleles or uneven allele frequencies, which may result from genetic drift or recent population bottlenecks. Na values ranged from 2.278 to 6.944, and Ne ranged from 1.838 to 3.591. The overall means of Na and Ne across all populations were 3.479 and 2.459, respectively. Population 12 from the Eastern Tianshan Mountains exhibited the highest values for N, Na, and Ne. Observed heterozygosity (Ho) across populations varied from 0.306 to 0.534, with an average of 0.402. Expected heterozygosity (He) ranged from 0.315 to 0.647, with an overall mean of 0.488. The mean Shannon’s Information Index (I) was 0.905, which integrates both allelic richness and evenness and serves as another common indicator of genetic diversity. Regionally, the average genetic diversity indices were higher in populations from the Eastern Tianshan Mountains than in those from the Bayinbuluke area, indicating a relatively higher level of genetic diversity. In contrast, populations 10 and 11 displayed lower values across these metrics, suggesting more limited genetic diversity within these groups.
AMOVA analysis (Table 2) revealed that only 2% of the total genetic variance was attributed to differences among populations, while 60% was found among individuals within populations, and the remaining 38% resided within individuals. This result indicates a low level of genetic differentiation among the studied populations of S. involucrata. Overall, the genetic diversity of S. involucrata was maintained both within and between the populations of the Eastern Tianshan Mountains and the Bayinbuluke area.
Nei’s genetic distance and pairwise FST values (Table A1 and Table A2) are key parameters for evaluating genetic differentiation among populations (Figure 1). Both metrics are derived from changes in allele frequencies, and they effectively reveal patterns of genetic divergence and relatedness. Analysis of the genetic distance matrix indicates that the distance between population 4 and population 5 was the smallest, reflecting the lowest level of genetic divergence between any two populations. In contrast, the genetic distance between population 10 and population 13 was the largest, indicating the greatest degree of differentiation among all pairwise comparisons. Consistent with this, population 16 exhibited the largest genetic distances when compared to most other populations, suggesting that it is the most genetically distinct group overall. Furthermore, comparisons between populations from the Eastern Tianshan Mountains (populations 12–16) and those from the Bayinbuluke area revealed generally larger genetic distances, highlighting a pattern of genetic differentiation between these two regional groups.
This study assessed recent population bottlenecks in the S. involucrata populations by examining the distribution of allele frequencies using the sign test, the Wilcoxon signed-rank test, and a mode-shift indicator. The results revealed distinct distribution patterns among the populations (Table 3). Based on the one-tailed probability for heterozygosity excess under both the Two-Phase Model (T.P.M.) and the Stepwise Mutation Model (S.M.M.), as well as the shape of the allele frequency distribution, the populations were clearly categorized into two groups: those exhibiting a normal L-shaped distribution and those displaying a shifted distribution mode. Analysis indicated that six populations (No. 1, 3, 7, 9, 12, 13) maintained a normal L-shaped distribution, as their test results did not provide significant evidence for a recent bottleneck. Conversely, the allele frequency distribution was shifted in the remaining ten populations (No. 2, 4, 5, 6, 8, 10, 11, 14, 15, 16). Within the latter group, the results further varied. For five populations (No. 2, 11, 14, 15, 16), the sign test and Wilcoxon test p-values under both mutation models were greater than 0.05, yet their allele frequency distributions were classified as shifted. A different pattern was observed for four populations (No. 4, 5, 6, 8). For these, the p-values for heterozygosity excess under the T.P.M. and S.M.M. were significant (p < 0.05), the mode-shift indicator showed a shifted distribution, and while the sign test p-value was >0.05, the Wilcoxon test p-value was <0.05.

2.2. Genetic Structure Analysis

Based on microsatellite data, population genetic structure was investigated using principal coordinate analysis (PCoA; Figure 2) and a Bayesian clustering approach implemented in STRUCTURE. The first three principal coordinates from the PCoA collectively explained 6.24% of the total genetic variation, with individual contributions of 3.17% (PC1), 1.59% (PC2), and 1.46% (PC3). The PCoA plot revealed a generally clustered distribution, with individuals from most populations overlapping broadly in the multivariate space. Although the 16 populations collectively formed one main cluster, a few individuals from several populations were positioned away from their respective group centroids. No clear separation was observed according to geographic source populations, further supporting the presence of extensive gene flow across the sampled range.
To infer population genetic structure, a Bayesian clustering analysis was performed using Structure 2.3.4 software. The ΔK method indicated that the optimal number of genetic clusters (K) was 2 (Figure 3). ΔK = 2, and the 16 populations were clearly divided into two major groups (Figure 4): Group I comprised populations 1–11 from the Bayinbuluke area, and Group II consisted of populations 12–16 from the Eastern Tianshan Mountains. This result reveals that the genetic structure of S. involucrata is associated with geographical distribution, reflecting broad-scale divergence between the two regions. At the same time, the individual assignment plots show a degree of shared ancestry or admixture between groups, indicating that historical or ongoing gene flow has occurred across this geographical divide.
To assess historical phylogeographic patterns, haplotype networks were constructed for both nrDNA (ITS1–4, 558 bp) and cpDNA (combined three non-coding regions, 2964 bp) sequences (Figure 5 and Figure 6). A total of 11 cpDNA haplotypes (H1–H11) and 7 nrDNA haplotypes (H1–H7) were identified. The analysis revealed distinct geographic structuring. In the cpDNA network, several private haplotypes were found in the Eastern Tianshan Mountains populations. While haplotype H5 occupied a central position in the network topology, it was geographically restricted to the Bayinbuluke area. Similarly, haplotype H1 was private to the Eastern Tianshan Mountains populations. This pattern of geographically restricted, private haplotypes suggests a history of prolonged isolation and independent evolutionary trajectories between the regions. The presence of shared haplotype H3 may indicate historical gene flow or retention of ancestral polymorphism. The nrDNA haplotype distribution was consistent with this pattern, with haplotype H7 being exclusive to the Eastern Tianshan Mountains. Overall, haplotype richness was markedly higher in the Bayinbuluke area, where the widespread haplotype H3 was also the most frequent. The combined evidence from both markers demonstrates significant differences in cpDNA and nrDNA haplotype composition between the Eastern Tianshan Mountains populations and the Bayinbuluke region. The network structure, with central haplotype H5 giving rise to the private Eastern Tianshan Mountains haplotype H1, supports a scenario of genetic divergence following long-term geographical isolation. The shared occurrence of haplotype H3 likely reflects either ancestral shared ancestry or limited historical gene exchange. The present genetic structure confirms that populations from the two regions have undergone significant genetic differentiation.
To further evaluate the regional genetic structure, populations from the Eastern Tianshan Mountains and the Bayinbuluke area were grouped and subjected to principal coordinate analysis (PCoA) (Figure 7). The results revealed a clear spatial separation along the primary axes. Populations from the Bayinbuluke area exhibited a broad distribution in the PCoA plot, indicating higher genetic variation within this region. In contrast, populations from the Eastern Tianshan Mountains clustered more tightly and were positioned distinctly below those from Bayinbuluke along the major axis of variation. This spatial segregation in the PCoA provides independent support for the genetic structure inferred from both the SSR marker analysis and the sequence-based (cpDNA and nrDNA) haplotype networks. Collectively, all lines of evidence confirm that S. involucrata populations from the Eastern Tianshan Mountains and the Bayinbuluke area possess a significant and reproducible genetic structure.
To investigate the association between environmental factors and genetic diversity in S. involucrata, this study employed Mantel tests to assess the correlation between genetic differentiation and geographic–environmental variables, to explore evolutionary hypotheses, and to analyze multivariate data. The Mantel test results indicated that the genetic diversity indices derived from SSR markers in S. involucrata were correlated with environmental factors. A significant correlation was observed between observed heterozygosity (Ho) and the Normalized Difference Vegetation Index (NDVI), which reveals a direct link between the genetic diversity of S. involucrata and the productivity and health status of its ecosystem. The study also found significant correlations between the mean number of alleles (Na) and the effective number of alleles (Ne) with precipitation of the wettest month (Bio13). Additionally, precipitation of the driest month (Bio14) showed significant correlations (p < 0.01) with Ne, expected heterozygosity (He), and Shannon’s Information Index (I) (Figure 8).
To investigate both direct and indirect relationships among the variables, redundancy analysis (RDA) was further employed to explore the associations between genetic variation and specific environmental factors. The RDA results indicated that precipitation (Bio13 and Bio14) had a highly significant effect on the genetic variation of S. involucrata, which is consistent with the findings from the Mantel tests. The analysis also revealed that longitude, slope (Podu), and river proximity exerted influences on genetic variation (Figure 9).

3. Discussion

Human activities are recognized as a primary driver of species population decline and extinction [25]. For many species, anthropogenic pressures such as habitat fragmentation and overexploitation have precipitated sharp reductions in wild population sizes, often confining remaining individuals to small, isolated fragments. This demographic trajectory frequently triggers a cascade of negative genetic consequences, including the erosion of genetic diversity, increased inbreeding depression, and the intensification of genetic drift, which collectively elevate extinction risk [26,27]. This study employed a combination of SSR markers and Sanger sequencing of cpDNA and nrDNA regions to systematically evaluate the genetic diversity and population structure of S. involucrata in the Eastern Tianshan Mountains. In our study, we analyzed the association between population genetic differentiation and environmental factors in S. involucrata. Our integrated analysis reveals a distinct genetic signature for these eastern populations. Therefore, based on the cumulative genetic evidence presented here, we propose that the populations in the Eastern Tianshan Mountains should be delineated and managed as an independent Management Unit.

3.1. Genetic Diversity of S. involucrata in Tianshan Mountains

Analysis of molecular variance (AMOVA) and population genetic parameters indicate that the majority of genetic variation in S. involucrata resides among and within individuals, with only minimal differentiation observed between populations. This pattern suggests historically frequent gene flow across its range. The average expected heterozygosity (He) was 0.557 in the Eastern Tianshan Mountains populations and 0.457 in the Bayinbuluke area populations. These values are comparatively high for an alpine specialist; for instance, the congeneric Saussurea medusa exhibits a species-level He of only 0.276 [28]. This relatively high diversity suggests that S. involucrata may not have undergone severe historical genetic bottlenecks, possibly due to persistence in regional refugia during past glacial cycles [29].
However, genetic diversity is not uniformly distributed across the landscape. Our results demonstrate that populations in the Eastern Tianshan Mountains harbor particularly high genetic diversity, which may reflect stronger genetic adaptability. This disparity could be explained if the Eastern Tianshan Mountains acted as a major glacial refugium for the species, allowing for the accumulation and preservation of ancestral genetic variation. The long-term geographical isolation between the Eastern Tianshan and Bayinbuluke populations, potentially driven by pronounced climatic gradients, may have further promoted differentiation. As noted by Hu [14], the Western Tianshan and Altai Mountains receive greater moisture via westerly airflows, whereas the Eastern Tianshan is comparatively arid. This proposed link between genetic patterns and environmental drivers, specifically precipitation, is robustly supported by our direct statistical assessments. To explicitly investigate the association between environmental factors and genetic diversity, this study employed Mantel tests and redundancy analysis (RDA). The Mantel tests confirmed significant correlations between multiple genetic diversity indices (e.g., Ne, He, I) and key precipitation variables, including precipitation of the driest month (Bio14). Furthermore, the significant correlation between observed heterozygosity (Ho) and the Normalized Difference Vegetation Index (NDVI) reveals a direct link between genetic diversity and ecosystem productivity. The RDA further substantiated that precipitation exerts a highly significant effect on genetic variation, consistent with the Mantel results, and also highlighted influences of longitude, slope, and river proximity. These findings collectively confirm that the relatively high genetic diversity in the Eastern Tianshan region may reflect adaptability to this harsher environment, as well as to the potentially more complex habitat types found there. Habitat heterogeneity is a key driver for maintaining genetic diversity, as varied environmental conditions can select for different adaptive genotypes, thereby preserving a broader genetic base.
Analysis of recent demographic history using bottleneck detection methods revealed that Eastern Tianshan populations 12 and 13 exhibit a characteristic L-shaped distribution of allele frequencies [30,31]. This pattern provides strong evidence that these populations have maintained relatively large and stable effective population sizes over time, with minimal impacts from genetic drift. The absence of a genetic bottleneck signature in these groups is particularly significant, as it indicates an enhanced capacity to retain low-frequency alleles and a high degree of genetic resilience. Consequently, populations 12 and 13 should be considered vital reservoirs of the species and genetic diversity and prioritized as core source populations in conservation strategies. In contrast, the shifted allele distribution observed in three other Eastern Tianshan Mountains populations (Nos. 14, 15, 16) should be interpreted with caution. These results may arise from statistical artifacts or sampling error rather than genuine demographic decline [32]. Small sample sizes can reduce the accuracy of allele frequency estimation, potentially leading to false signals of a bottleneck. Further sampling would be needed to clarify the demographic history of these particular groups.

3.2. Populations’ Genetic Structure and Evolutionary History

A deeper level of genetic structure was revealed through principal coordinate analysis (PCoA), Bayesian clustering, and haplotype network reconstruction. In the principal coordinate analysis (PCoA), the population of the Eastern Tianshan Mountains forms an independent cluster, which is located below the cluster in the Bayinbulak area. This spatial segregation indicates that genetic variation within the Eastern Tianshan Mountains populations is structured and differentiated from the Bayanbulak area. Simultaneously, our findings also indicate that longitude, slope, and river proximity have an influence on genetic variation. Therefore, we propose that this genetic differentiation likely arose from a combination of topographic complexity and environmental–climatic factors. This pattern suggests that the Eastern Tianshan Mountains population lineage may represent a locally specialized genetic group, potentially derived from an ancestral population that originally differentiated in the Western Tianshan Mountains region. Such heterogeneous genetic structuring is common in narrowly distributed alpine plants and is often associated with shifts in flowering phenology driven by temperature gradients, habitat fragmentation imposed by complex mountain terrain, and naturally limited gene flow [33]. The major geological history of the region provides a critical backdrop for interpreting these patterns. The multi-stage deformation in the Tianshan area is widespread in the Qinghai–Tibet Plateau and its surrounding areas. It is likely to be a synchronous response to the multi-stage uplift process of the Qinghai–Tibet Plateau [34]. This uplift not only reshaped the topography of the Tianshan but also intensified aridity in the surrounding regions [35]. Such profound geological events have had lasting impacts on the genetic architecture of regional flora [36]. Furthermore, habitat fragmentation, a consequence of such topographic changes, can reduce within-population genetic diversity while amplifying genetic differences between populations [37]. The increased aridity in the Eastern Tianshan Mountains populations compared to the west, itself influenced by plateau uplift, may be a key driver shaping the observed patterns of genetic diversity and structure in S. involucrata.
Importantly, the cpDNA and nrDNA haplotype analyses provide direct evidence for this evolutionary history. The presence of private haplotypes in the Eastern Tianshan Mountains (H1 in cpDNA and H7 in nrDNA) highlights their genetic distinctiveness. The shared cpDNA haplotype H3 may indicate either limited historical gene exchange or the retention of an ancestral polymorphism following population subdivision. The contrasting spatial patterns of cpDNA and nrDNA lineages suggest that the Tianshan populations have experienced prolonged geographical isolation leading to independent evolution. The mountains and valleys of the region likely act as effective barriers to gene flow. It is noteworthy that cpDNA primarily captures recent, localized maternal expansion events mediated by seed dispersal, whereas nrDNA more strongly reflects deeper phylogenetic structure shaped by long-term geographic isolation [38]. Therefore, this study infers infer that long-term, though not complete, geographical isolation exists between the Eastern Tianshan Mountains populations and Bayinbuluke populations, resulting in restricted gene flow. This isolation can be attributed to the limited seed dispersal capacity of S. involucrata [39] coupled with the significant geographic barriers presented by the terrain separating the two regions [40]. Together, these factors have likely enabled the formation and maintenance of the observed population genetic structure.

3.3. Association Between Genetic Patterns and Reproductive Ecology

The genetic patterns revealed by cpDNA and nuclear markers can be explained by the reproductive biology of S. involucrata. This species is a perennial, insect-pollinated, hermaphroditic (perfect-flowered) herb with a predominantly facultative outcrossing mating system. Although self-compatible, it relies heavily on pollinators for effective pollination [41]. Gene flow in S. involucrata is mediated by both seeds and pollen, which differentially shape the maternal cpDNA and nrDNA markers, resulting in distinct marker-dependent genetic structures.
In angiosperms, cpDNA is typically strictly maternally inherited; its haplotype distribution directly records the history of seed dispersal and maternal lineage migration. The pronounced east–west cpDNA phylogeographic break observed in our study, along with private haplotypes in the eastern populations, indicates limited seed-mediated gene flow over evolutionary timescales. This aligns with the species’ reproductive ecology. Although the seeds have crown hairs, successful seedling establishment in harsh, fragmented alpine scree habitats is extremely low [42,43]. Long-distance colonization across vast unsuitable terrain is even more constrained. Thus, the strong cpDNA divergence likely reflects historical isolation among glacial refugia, followed by independent evolution of the Eastern Tianshan populations’ maternal lineages driven largely by genetic drift under restricted seed dispersal.
In contrast, the shallow population differentiation and admixed signals revealed by SSR markers suggest that pollen-mediated gene flow—though limited—acts as a more effective homogenizing force than seed dispersal. Bombus spp., Calliphora uralensis, Pontia callidice, Clossiana euphrosane, and Aglais urticae are the primary pollinating insects of S. involucrata [41]. These pollinators facilitate pollen transfer at local to regional scales, partly counteracting genetic differentiation. However, their movement is strongly constrained by the complex topography and climate of the Tianshan Mountains, leading to an isolation-by-resistance pattern. Pollen flow declines not only with distance but is also impeded by deep valleys, ridges, and extensive unsuitable habitat between eastern and western regions. Consequently, the observed nuclear genetic structure reflects a dynamic balance among historical isolation, contemporary restricted pollen flow, and genetic drift. In summary, the genetic architecture of S. involucrata may result from the interplay between its reproductive traits and geographical setting. Restricted seed dispersal has led to strong a phylogeographic structure in cpDNA, while limited but ongoing pollen flow has buffered genetic differentiation in the genome.

3.4. Management Unit Division

In the integrated analysis, the congruent patterns of significant genetic differentiation observed in cpDNA and nrDNA sequence data are further corroborated by the independent SSR analysis. However, the data support that the sampled populations are independent Management Units, but do not meet the ESU conceptual standard proposed by Funk (Evolutionarily Significant Unit: a group of conspecific populations that has substantial reproductive isolation, which has led to adaptive differences so that the populations represent a significant evolutionary component of the species) [19]. The nuclear differentiation and admixture observed in our SSR and Sanger sequencing data are more consistent with the criteria for a Management Unit (MU). Our study data indicate that although the Eastern Tianshan populations possess private cpDNA and nrDNA haplotypes and exhibit significant differences in genetic diversity compared to the Bayanbulak area (significant FST and clear clustering into two groups in STRUCTURE analysis), they lack reciprocal monophyly in cpDNA, exhibit relatively shallow nuclear genome differentiation (SSR shows obvious admixture and PCoA overlap), and fail to provide evidence for reproductive isolation. Therefore, it does not meet the criteria for ESU designation in any respect. Meanwhile, the observed allele frequency differences, private alleles and haplotypes, and genetic structure under geographic isolation fully satisfy the criteria for MU designation. This consistency across multiple genetic datasets underscores a distinct, independent evolutionary history for the Eastern Tianshan Mountains populations’ lineage. Complementing this evolutionary perspective, the contemporary genetic structure inferred from SSR markers and the Bayesian clustering analysis (STRUCTURE) provides the basis for practical conservation management. We therefore propose the establishment of a distinct Management Unit (MU) for the Eastern Tianshan Mountains populations. This unit, characterized by its high genetic diversity, represents a crucial reservoir of adaptive potential for the species and should be the focus of targeted conservation strategies.

3.5. Protection Recommendations

This study posits that the Eastern Tianshan Mountains populations should be managed as an independent Management Unit. It is the main measure to protect the evolutionary trajectory of the species. Priority protection should be given to populations 12 and 13 with high genetic diversity and special haplotype populations. At present, the focus of protection is to implement preventive in situ protection, strictly maintain the existing population size and habitat integrity, and focus on protecting the unique cpDNA haplotypes to prevent genetic erosion. In order to fully maintain the adaptive evolutionary potential of S. involucrata in the Eastern Tianshan Mountains, it is necessary to implement a comprehensive protection strategy covering the entire distribution area: (1) Seed bank construction. The collection of germplasm resources should systematically cover the eastern part of the Tianshan Mountains to ensure the representativeness of all evolutionary lineages of the species. The germplasm resources in this area should be systematically collected, and a safe ecological backup should be established. (2) For the Management Unit of Tianshan Mountains, a protection strategy with habitat protection and genetic monitoring as the core should be implemented to prevent the decline in its diversity. It is recommended to establish permanent genetic monitoring plots to regularly evaluate the genetic diversity, inbreeding level, and gene flow changes of key populations, enabling us to scientifically evaluate the effect of protection measures and dynamically adjust the management plan. (3) We suggest strengthening the efficiency of law enforcement preventing illegal collection. Standardizing tourism and animal husbandry activities and minimize the interference of trampling and overeating are also recommened. (4) By transforming local herdsmen into ecological guardians and integrating them into the main body of conservation action, the establishment of community-based collaborative management mechanisms can be promoted. (5) At the same time, in order to realize the overall protection and sustainable protection of the genetic diversity of S. involucrate, we need to establish a collaborative protection mechanism for S. involucrata throughout the Tianshan Mountains.

4. Materials and Methods

All plant materials used in this study were collected from fresh leaf samples of the endangered species S. involucrata obtained from its natural populations in Xinjiang, China. A total of 168 individual specimens were sampled from 16 geographically distinct populations. Detailed locality information for each population is provided in Table 4. Given that this species is rare, with a fragmented habitat and a highly restricted distribution, conducting field surveys and collecting samples posed significant challenges. Therefore, our sampling protocol was designed to obtain a genetically representative snapshot of the existing populations while minimizing ecological disturbance. This principle guided our overall strategy for determining inter-plant spacing and sample size. To ensure genetic independence among samples and reduce potential biases from spatial autocorrelation or clonal reproduction, a minimum distance of at least 10 m was maintained between any two sampled individuals within each population. This measure helped lower the likelihood of collecting genetically identical individuals. When a suspected clonal cluster was encountered, only one representative individual was sampled from it. To ensure that the sampled units represented distinct population units, a minimum distance of >5 km was maintained between the sampling sites of different populations. The sample size per population (n = 5–21 individuals) was determined based on factors such as the endangered status of the plants and the difficulty of sampling. Fresh leaf tissue was collected from each selected plant, immediately dried in silica gel, and stored at −20 °C. DNA was subsequently extracted from the silica-dried leaves using a modified CTAB method. Key environmental parameters were recorded at each sampling site.
Amplification using SSR markers was conducted with 18 polymorphic primer pairs that produced clear and stable amplification, designed in accordance with Hu [15]. Additionally, data from Hu’s study, comprising 112 individuals across 11 populations distributed in the Bayinbuluke region, were utilized. The PCR amplification reaction system had a total volume of 25 μL, which included 1 μL of template DNA, 1 μL of upstream primer, 1 μL of downstream primer, 12.5 μL of 2× Easy Taq PCR Super Max, and 9.5 μL of ddH2O. The amplification method included three stages: initial pre-denaturation at 93 °C for 3 min, followed by 35 cycles of denaturation at 94 °C for 30 s and annealing at 54 °C for 30 s (adjusted according to the specific Tm value of each primer combination), followed by an extension step at 65 °C for 90 s, and a final extension at 65 °C for 5 min. Successful amplification was confirmed by 2% agarose gel electrophoresis. PCR products were identified through capillary electrophoresis and fluorescent labeling. The amplification reaction system and methods were conducted in accordance with Hu [15]. Sanger sequencing was conducted on three regions of chloroplast DNA (trnL-trnF, matK, and ndhF-rpl32). Primers were obtained from [13,14]. The ITS1-4 locus of the nuclear genome was amplified. The amplification reaction mixture and thermal cycling protocol were used in accordance with Shi [13].
For SSR data tesults, GenAlEx 6.51b2 [44] was employed to identify the populations’ genetic parameters for each primer pair, including the number of alleles (Na), observed heterozygosity (Ho), expected heterozygosity (He), and effective number of alleles (Ne) [45]. Principal component analysis (PCoA) and AMOVA were also conducted using GenAlEx 6.51b2. Bayesian analysis was performed using Structure 2.3.4 software [46] to determine the optimal number of genetic clusters (K) for all populations. This study employed the Admixture model, which allows individual genomes to originate from multiple ancestral populations, as it is suitable for scenarios involving gene flow or recent population admixture. To assist with clustering under weak population structure, this study employed the LOCPRIOR model, incorporating the sampling location information of individuals as prior data. The parameters were set as follows: 10,000 iterations for the Length of the Burn-in Period and 100,000 iterations for the Number of MCMC Replications after Burn-in. The K-value was tested from 1 to 19, with 20 independent runs for each K-value. The results from all runs were then compressed and imported into the online platform Structure Harvester for analysis to determine the most suitable K-value. Finally, the Structure 2.3.4 clustering analysis plot was generated. We used Bottleneck v. 1.2.02 [31,47] to evaluate whether the S. involucrata populations have recently experienced a bottleneck event. The computational models used were the Stepwise Mutation Model (SMM) and the Two-Phase Mutation Model (TPM), both developed based on the number of alleles. The TPM, which is more suitable for SSR data analysis, was configured with the following parameters: SSM at 90%, a mutation rate of 30%, and 1 × 103 repetitions. The sign test and Wilcoxon signed-rank test were employed on the 11 populations to evaluate whether excess heterozygosity was statistically significant within each population [48]. This study employed Origin 2024 software to create genetic diversity index maps for the populations of the Eastern Tianshan Mountains and Western Tianshan Mountains, using box plots. The Nei genetic distance and Fst value matrix among populations were computed using GenAlEx 6.51b2 software. The resulting matrix data were imported into Origin 2024 software to produce the basic heat map, modify the scale based on the data range, activate populations labels and numerical annotations, and refine the settings of the coordinate axes and grid lines.
The obtained DNA sequences were imported into Geneious Prime 2024.0.5 [49] for quality assessment and assembly. Following visual inspection of chromatograms to confirm base calls, sequences were manually trimmed to remove low-quality ends and vector contamination. Overlapping sequence fragments were then assembled to generate consensus sequences for each individual. Haplotype identification, along with calculations of haplotype diversity (Hd) and nucleotide diversity (π), was performed using DnaSP v. 5 [50]. The sampling points and their corresponding haplotype data were mapped using ArcMap 9.3 software to generate a haplotype geographic distribution map. Finally, Popart-1.7 [51] was used to construct a median-joining haplotype network, which visualizes the phylogenetic relationships among haplotypes.
This study considered 20 environmental factors (Table A3 and Table A4). Climatic factor data were derived from the WorldClim database [52], while topographic factors such as slope and aspect were calculated from 30 m resolution Digital Elevation Model (DEM) data obtained from the Geospatial Data Cloud platform. The distance to rivers was computed as the Euclidean distance based on the 1:1,000,000 national river dataset downloaded from the Geospatial Data Cloud. The China Annual Maximum Normalized Difference Vegetation Index (NDVI) dataset, with a spatial resolution of 30 m and spanning 2000–2020, was provided by the Land Use and Global Change Remote Sensing Team of the Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences. To evaluate the relationship between genetic differentiation and geographic–environmental variables [53], this study employed an integrated analytical framework combining Mantel tests and redundancy analysis (RDA) based on microsatellite (SSR) markers. Geographic (Euclidean) distance and environmental distance matrices were generated from standardized bioclimatic variables as well as longitude and latitude data. Mantel tests were performed using the R package linkET [54] with 9999 permutation tests to examine the correlations between the genetic distance matrix and each environmental distance matrix. The results were visualized using ggplot2 [55], where edge width corresponds to the Mantel’s r value [56] and color indicates statistical significance. Subsequently, redundancy analysis was conducted using the vegan package to investigate the multivariate influence of environmental factors on within-population genetic diversity indices, including observed heterozygosity, expected heterozygosity, mean number of alleles, effective number of alleles, and Shannon’s Information Index. Key environmental predictors were identified through forward selection (α = 0.05). Finally, quadratic regression models were fitted to characterize nonlinear relationships between the significant environmental variables identified by RDA and each genetic diversity index, thereby elucidating the relative contribution of environmental factors to the genetic diversity pattern in S. involucrata. All analyses were conducted in the R statistical environment [57].
To delineate conservation units, SSR molecular markers were combined with Sanger sequencing data to methodically identify key evolutionary units and Management Units. In this approach [17,18], Evolutionarily Significant Unit (ESU) [58] are defined based on pronounced phylogeographic differentiation, reflecting prolonged evolutionary isolation. Subsequently, within each MU, significant differences in allele frequencies among populations were evaluated using SSR data indicative of current gene flow, complemented by Bayesian analysis [16]. Independent Management Units (MUs) with limited gene flow were identified. This systematic approach ensures that the defined units accurately represent the distinctiveness of deep evolutionary history while also informing conservation management focused on current populations.

5. Conclusions

This study clarifies the genetic relationship between S. involucrata populations in the Eastern Tianshan Mountains and those in the Bayinbuluke region. It further identifies key factors shaping the genetic structure of the Eastern Tianshan populations and proposes targeted conservation measures. Our results indicate that the Eastern Tianshan populations maintain significantly higher genetic diversity than those in Bayinbuluke. More importantly, they form a distinct genetic cluster characterized by private haplotypes, reflecting a history of prolonged geographical isolation. This clear genetic demarcation supports an independent evolutionary trajectory for the eastern lineage. The topographic complexity of the Tianshan range, combined with the distinctive environmental and climatic conditions of the eastern region, appears to have played a key role in driving this genetic differentiation. These heterogeneous abiotic factors likely interact to shape gene flow, influence genetic drift, and promote local adaptation, collectively structuring the population genetics of S. involucrata.
Analysis of the relationship between genetic diversity and environmental factors provides direct evidence for these drivers. Multiple genetic diversity indices, particularly parameters related to heterozygosity and allelic richness, showed significant correlations with key environmental variables such as vegetation index (NDVI) and precipitation. Redundancy analysis (RDA) further confirmed that precipitation is the dominant environmental factor driving genetic variation, with topographic and geographic factors also exerting significant influence.
Given the high genetic diversity and observed evidence of adaptive differentiation in the Eastern Tianshan populations, this study suggests that this region be designated as an independent Management Unit (MU). Prioritizing the conservation of this unique genetic diversity is crucial to prevent its erosion due to ongoing habitat fragmentation. In addition to in situ protection, we recommend establishing a germplasm bank to serve as an ecological safety backup for these genetic resources. In summary, this study provides critical genetic evidence to inform and refine conservation strategies, highlighting the urgent need to protect the unique evolutionary potential and adaptive genetic variation of this endangered alpine species. Future research incorporating genomic approaches and ecological niche modeling is recommended to further elucidate the molecular basis of local adaptation.

Author Contributions

Conceptualization, W.S. and J.L. (Jiayi Lu); methodology, W.S.; software, J.L. (Jiayi Lu); validation, W.S.; formal analysis, J.L. (Jiayi Lu); investigation, J.W.; resources, W.S., J.W., D.Z. and D.T.; data curation, J.L. (Jiayi Lu); writing—original draft preparation, J.L. (Jiayi Lu) and K.M.; writing—review and editing, W.S.; visualization, J.L. (Jiayi Lu) and J.L. (Jingdian Liu). All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (No. 32170386).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Acknowledgments

The authors would like to express their heartfelt thanks to the superintendent of the HugeJili Station of the Forestry and Grassland Bureau of Hejing County for his unwavering support of the field investigation work in this paper. Meanwhile, at the same time, they sincerely thank Comrade Panwei Zhao, Secretary of the Communist Youth League Committee of Huangtian Farm, Xinxing City, the 13th Division, for his strong support of our work.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Table A1. Pairwise Fst values of S. involucrata populations.
Table A1. Pairwise Fst values of S. involucrata populations.
Pop *12345678910111213141516
10.000
20.2110.000
30.0940.0740.000
40.1150.1470.0950.000
50.0950.1390.0730.0370.000
60.1710.1130.1160.1370.1290.000
70.1590.0870.0880.1180.0920.0580.000
80.1270.1860.1280.1130.1100.1060.1260.000
90.1270.1430.0880.0830.0550.1220.0950.0670.000
100.2300.1280.1280.1890.1700.2130.1340.2710.2000.000
110.1990.0860.1060.1320.0990.1470.0670.2020.1260.0860.000
120.2060.2030.1750.1670.1700.2090.1990.1830.1840.2890.2290.000
130.2110.2200.1890.1830.1930.2000.2130.1810.1930.3250.2550.0660.000
140.2030.2710.2020.1800.1990.2340.2460.2020.2030.3210.2730.1700.1550.000
150.1910.2520.1920.1800.1950.2040.2250.1910.2060.2950.2650.1570.1350.0740.000
160.2440.2850.2330.2290.2490.2440.2730.2430.2590.3560.3020.1880.1670.1080.0790.000
* 1–11 are populations distributed in the Bayinbuluke area; 12 is a population distributed in the Tianchi area; 13 is a population distributed in the Turpan area; 14–16 are populations distributed in the Hami region.
Table A2. Pairwise matrix of Nei genetic distance of S. involucrata populations.
Table A2. Pairwise matrix of Nei genetic distance of S. involucrata populations.
Pop *12345678910111213141516
10.000
20.4880.000
30.1390.0910.000
40.2000.2670.1690.000
50.1490.2490.1080.0260.000
60.3680.1760.2050.2470.2260.000
70.3310.1130.1260.1980.1250.0660.000
80.2600.4220.2910.2090.2060.1880.2610.000
90.2400.2640.1710.1340.0630.2310.1450.0880.000
100.4420.1410.1540.3150.2620.3600.1580.5910.3310.000
110.4110.0860.1560.2170.1200.2310.0640.4300.2040.0580.000
120.8810.8820.8240.7320.7341.0110.9350.7940.7811.1631.0040.000
130.7800.8590.7790.6970.7450.8080.8840.6820.6991.2181.0190.2170.000
140.5640.9520.6760.4890.5670.7790.8540.5840.6030.8780.9030.6180.4330.000
150.5700.9500.6920.5750.6600.6960.8520.6110.6750.9140.9540.6330.4260.0030.000
160.8751.0970.9220.7880.8770.8741.0760.8550.9031.1291.1340.7310.5120.0930.0290.000
* 1–11 are populations distributed in the Bayinbuluke area; 12 is a population distributed in the Tianchi area; 13 is a population distributed in the Turpan area; 14–16 are populations distributed in the Hami region.
Table A3. Topographic factor information.
Table A3. Topographic factor information.
Variable CodeUnitDescription
Latitude°Latitude
Longitude°Longitude
Podu°Gradient
Poxiang-Aspect
Soil_ph-Soil PH value (×10)
Clay%Percentage of clay in soil
Clcd-China Land Cover Dataset
Ndvi-Normalized Difference Vegetation Index
River-Euclidean distance from the river
Table A4. Bioclimatic factor information.
Table A4. Bioclimatic factor information.
Variable CodeUnitDescription
Bio2°CAverage daily temperature range (annual average)
Bio4-Temperature seasonality (standard deviation × 100)
Bio9°CThe average temperature of the driest quarter
Bio13mmWet month precipitation
Bio14mmPrecipitation in the driest month
Bio15-Precipitation season (coefficient of variation)
Srad_01kJ m−2 d−1Solar radiation in January
Srad_07kJ m−2 d−1Solar radiation in July
Wind_01M·s−1January wind speed
Wind_07M·s−1July wind speed
Vapr_01kPaJanuary water vapor pressure

References

  1. Chen, K.Y.; Liu, Q.; Wang, Y.; Li, D.H.; Zhang, J.I. Genetic Diversity Analysis of Rhododendron vialii Based on Microsatellite Marker. J. Yunnan Agric. Univ (Nat. Sci.) 2020, 35, 818–825. [Google Scholar] [CrossRef]
  2. Li, A.; Ge, S. Research progress of plant protection genetics. Biodivers. Sci. 2002, 1, 61–71. [Google Scholar] [CrossRef]
  3. Allendorf, F.W. Genetics and the Conservation of Natural Populations: Allozymes to Genomes. Mol. Ecol. 2017, 26, 420–430. [Google Scholar] [CrossRef]
  4. Xu, Y.; Zang, R.G. Conservation of Rare and Endangered Plant Species in China. iScience 2023, 26, 106008. [Google Scholar] [CrossRef]
  5. Wang, S.Q. Genetic Diversity and Population Structure of the Endangered Species Paeonia decomposita Endemic to China and Implications for Its Conservation. BMC Plant Biol. 2020, 20, 510. [Google Scholar] [CrossRef] [PubMed]
  6. Jagdeep, K.; Mahima, P. Phytochemistry and Pharmacology of Saussurea genus (Saussurea lappa, Saussurea costus, Saussurea obvallata, Saussurea involucrata). Mater. Today Proc. 2022, 56, 1173–1181. [Google Scholar] [CrossRef]
  7. Wang, J.W.; Tang, Z.G.; Deng, G.; Hu, G.J.; Sang, K.C. Remote Sensing Monitoring of Snowline Height at the End of Snowmelt in Tianshan Mountains from 1991 to 2021. Arid Zone Res. 2022, 39, 1385–1397. [Google Scholar] [CrossRef]
  8. Fan, W.X.; Yang, W.P.; Liu, H.S. Research Progress on Culture Technologies, Chemical Components, and Pharmacological Activities of Saussurea involucrata Cells. China J. Chin. Mater. Med. 2021, 46, 3522–3528. [Google Scholar] [CrossRef]
  9. Houchi, S.; Messasma, Z. Exploring the Inhibitory Potential of Saussurea costus and Saussurea involucrata Phytoconstituents against the Spike Glycoprotein Receptor Binding Domain of SARS-CoV-2 Delta (B.1.617.2) Variant and the Main Protease (Mpro) as Therapeutic Candidates, Using Molecular Docking, DFT, and ADME/Tox Studies. J. Mol. Struct. 2022, 1263, 133032. [Google Scholar] [CrossRef]
  10. Nadir, A.; Eni, K. The Planting Technology of Saussurea involucrata. Anim. Husb. Xinjiang 2018, 33, 45–46. [Google Scholar] [CrossRef]
  11. Li, J.P. Artificial Cultivation Techniques of Saussurea involucrata in Tianshan Mountains. Xinjiang Farm Res. Sci. Technol. 2008, 02, 21–22. [Google Scholar]
  12. Bi, Y.J. Artificial Cultivation Techniques of Saussurea involucrata in Tianshan Mountains. Xinjiang Agric. Sci. Technol. 2002, 03, 1007–3574. [Google Scholar] [CrossRef]
  13. Shi, W.; Yang, W.K.; Wang, X.Y.; Hou, Y.G. High Genetic Diversity in an Endangered Medicinal Plant, Saussurea involucrata (Saussurea, Asteraceae), in Western Tianshan Mountains, China. Conserv. Genet. 2017, 18, 1435–1447. [Google Scholar] [CrossRef]
  14. Hu, L.; Lu, T.; Wang, X.Y.; Wang, J.C.; Shi, W. Conservation Priorities and Demographic History of Saussurea involucrata in the Tianshan Mountains and Altai Mountains. Life 2023, 13, 2209. [Google Scholar] [CrossRef]
  15. Hu, L.; Wang, J.C.; Wang, X.Y.; Zhang, D.Y.; Sun, Y.; Lu, T.; Shi, W. Development of SSR Markers and Evaluation of Genetic Diversity of Endangered Plant Saussurea involucrata. Biomolecules 2024, 14, 1010. [Google Scholar] [CrossRef] [PubMed]
  16. Palsbøll, P.J.; Bérubé, M.; Allendorf, F.W. Identification of Management Units Using Population Genetic Data. Trends Ecol. Evol. 2007, 22, 11–16. [Google Scholar] [CrossRef]
  17. Fraser, D.J.; Bernatchez, L. Adaptive Evolutionary Conservation: Towards a Unified Concept for Defining Conservation Units. Mol. Ecol. 2001, 10, 2741–2752. [Google Scholar] [CrossRef] [PubMed]
  18. Moritz, C.C. Defining ‘Evolutionarily Significant Units’ for Conservation. Trends Ecol. Evol. 1994, 9, 373–375. [Google Scholar] [CrossRef]
  19. Funk, W.C.; McKay, J.K.; Hohenlohe, P.A.; Allendorf, F.W. Harnessing genomics for delineating conservation units. Trends Ecol. Evol. 2012, 27, 489–496. [Google Scholar] [CrossRef] [PubMed]
  20. Ouborg, N.J.; Pertoldi, C.; Loeschcke, V.; Bijlsma, R.; Hedrick, P.W. Conservation Genetics in Transition to Conservation Genomics. Trends Genet. 2010, 26, 177–187. [Google Scholar] [CrossRef] [PubMed]
  21. Wang, Z.F.; Peng, S.L. Plant Conservation Genetics. Acta Ecol. Sin. 2003, 01, 158–172. [Google Scholar]
  22. Selkoe, K.A.; Toonen, R.J. Microsatellites for Ecologists: A Practical Guide to Using and Evaluating Microsatellite Markers. Ecol. Lett. 2006, 9, 615–629. [Google Scholar] [CrossRef]
  23. Du, Y.Q.; Jiang, L.Y.; Wang, Y.H. SSR-Based Genetic Diversity Analysis of Taxus chinensis var. mairei in the Taihang Mountain Area. Biotechnol. Bull. 2025, 41, 214–224. [Google Scholar] [CrossRef]
  24. Shaw, J.; Lickey, E.B.; Schilling, E.E.; Small, R.L. Comparison of Whole Chloroplast Genome Sequences to Choose Noncoding Regions for Phylogenetic Studies in Angiosperms: The Tortoise and the Hare III. Am. J. Bot. 2007, 94, 275–288. [Google Scholar] [CrossRef] [PubMed]
  25. Tao, T.Z. Conservation Genetics of Thuja sutchuenensis. Master’s Thesis, Sichuan University, Chengdu, China, 2023. (In Chinese) [Google Scholar]
  26. Frankham, R. Genetics and Extinction. Biol. Conserv. 2005, 126, 131–140. [Google Scholar] [CrossRef]
  27. Spielman, D.; Brook, B.W.; Frankham, R. Most Species Are Not Driven to Extinction Before Genetic Factors Impact Them. Proc. Natl. Acad. Sci. USA 2004, 101, 15261–15264. [Google Scholar] [CrossRef] [PubMed]
  28. Wang, J.; Dai, W.; Chen, J.; Ye, K.; Lai, Q.; Zhao, D. Assessment of Genetic Diversity and Genetic Structure of Saussurea medusa (Asteraceae), a “Sky Island” Plant in the Qinghai–Tibet Plateau, Using SRAP Markers. Plants 2023, 12, 2463. [Google Scholar] [CrossRef]
  29. Hewitt, G.M. Genetic Consequences of Climatic Oscillations in the Quaternary. Philos. Trans. R. Soc. Lond. B Biol. Sci. 2004, 359, 183–195; discussion 195. [Google Scholar] [CrossRef] [PubMed]
  30. Excoffier, L.; Heckel, G. Computer Programs for Population Genetics Data Analysis: A Survival Guide. Nat. Rev. Genet. 2006, 7, 745–758. [Google Scholar] [CrossRef]
  31. Nei, M.; Maruyama, T.; Chakraborty, R. The Bottleneck Effect and Genetic Variability in Populations. Evolution 1975, 29, 1–10. [Google Scholar] [CrossRef]
  32. Chikhi, L.; Sousa, V.C.; Luisi, P.; Goossens, B.; Beaumont, M.A. The Confounding Effects of Population Structure, Genetic Diversity and the Sampling Scheme on the Detection and Quantification of Population Size Changes. Genetics 2010, 186, 983–995. [Google Scholar] [CrossRef]
  33. Hirao, A.S.; Kudo, G. Landscape Genetics of Alpine-Snowbed Plants: Comparisons Along Geographic and Snowmelt Gradients. Heredity 2004, 93, 290–298. [Google Scholar] [CrossRef]
  34. Jing, H.; Wang, W.; Zhang, P.; Wang, Y.; Pang, J.; Zhang, Y.; Li, Z.; Yan, Y.; Zhang, H.; Zheng, D. Multi-stage Uplift and Propagation of the Chinese East Tianshan during the Cenozoic. Tectonics 2025, 44, e2024TC008666. [Google Scholar] [CrossRef]
  35. Su, B.H.; Jiang, D.W. Simulation of Air-Sea Coupling Model of Tibetan Plateau Uplift and Asian Inland Aridification. Quat. Sci. 2016, 36, 935–944. [Google Scholar] [CrossRef]
  36. Ding, W.; Ree, R.H.; May, M.R.; Brun, P.; Hagen, O.; Karger, D.N.; Skeels, A. The Asynchronous Rise of Northern Hemisphere Alpine Floras Reveals General Responses of Biotic Assembly to Orogeny and Climate Change. Sci. Adv. 2025, 11, 1888. [Google Scholar] [CrossRef]
  37. Aguilar, R.; Quesada, M.; Ashworth, L.; Herrerias-Diego, Y.; Lobo, J. Genetic Consequences of Habitat Fragmentation in Plant Populations: Susceptible Signals in Plant Traits and Methodological Approaches. Mol. Ecol. 2008, 17, 5177–5188. [Google Scholar] [CrossRef] [PubMed]
  38. Schaal, B.A.; Olsen, K.M. Gene Genealogies and Population Variation in Plants. Proc. Natl. Acad. Sci. USA 2000, 97, 7024–7029. [Google Scholar] [CrossRef] [PubMed]
  39. Peng, X.C.; Pan, Y.; Zhu, X.Y. Morphological Characters and Wind Dispersal Property of Pappose Seeds for Seven Common Compositae Weeds in Yunnan, China. J. Yunnan Univ. (Nat. Sci. Ed.) 2018, 40, 1024–1033. [Google Scholar] [CrossRef]
  40. Lu, H.F.; King, Y.V.; Jia, D. Late Cenozoic Basin-Mountain Coupling Mechanism Between Tarim Basin and Tianshan Mountains. J. Geol. Coll. Univ. 2005, 11, 493–503. [Google Scholar] [CrossRef]
  41. Dai, P.F. Reproductive Ecology of Saussurea involucrata. Master’s Thesis, Xin Jiang Agricultural University, Urumqi, China, 2008. (In Chinese) [Google Scholar]
  42. Huang, J.H. Biological Characteristics of Saussurea involucrata. Master’s Thesis, Xinjiang Agricultural University, Urumqi, China, 2004. (In Chinese) [Google Scholar]
  43. Tan, D.Y.; Zhu, J.W.; Yao, F.; Tian, Y.W. Study on reproductive ecology of Saussurea involucrata I. Habitat, Botany and Phenological Characteristics. J. Xinjiang Agric. Univ. 1998, 01, 3–7. [Google Scholar] [CrossRef]
  44. Peakall, R.; Smouse, P.E. Genalex 6: Genetic Analysis in Excel. Population Genetic Software for Teaching and Research. Mol. Ecol. Notes 2006, 6, 288–295. [Google Scholar] [CrossRef]
  45. Ramakrishnan, M.; Antony Ceasar, S.; Duraipandiyan, V.; Al-Dhabi, N.A.; Ignacimuthu, S. Assessment of genetic diversity, population structure and relationships in Indian and non-Indian genotypes of finger millet (Eleusine coracana (L.) Gaertn) using genomic SSR markers. SpringerPlus 2016, 5, 120. [Google Scholar] [CrossRef]
  46. Li, Y.L.; Liu, J.X. StructureSelector: A web-based software to select and visualize the optimal number of clusters using multiple methods. Mol. Ecol. Resour. 2018, 18, 176–177. [Google Scholar] [CrossRef] [PubMed]
  47. Cornuet, J.M.; Luikart, G. Description and Power Analysis of Two Tests for Detecting Recent Population Bottlenecks from Allele Frequency Data. Genetics 1996, 144, 2001–2014. [Google Scholar] [CrossRef]
  48. Spencer, C.C.; Neigel, J.E.; Leberg, P.L. Experimental Evaluation of the Usefulness of Microsatellite DNA for Detecting Demographic Bottlenecks. Mol. Ecol. 2000, 9, 1517–1528. [Google Scholar] [CrossRef] [PubMed]
  49. Kearse, M.; Moir, R.; Wilson, A.; Stones-Havas, S.; Cheung, M.; Sturrock, S.; Buxton, S.; Cooper, A.; Markowitz, S.; Duran, C.; et al. Geneious Basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 2012, 28, 1647–1649. [Google Scholar] [CrossRef]
  50. Librado, P.; Rozas, J. DnaSP v5: A software for comprehensive analysis of DNA polymorphism data. Bioinformatics 2009, 25, 1451–1452. [Google Scholar] [CrossRef]
  51. Clement, M.; Snell, Q.; Walke, P.; Posada, D.; Crandall, K. TCS: Estimating Gene Genealogies. In Proceedings of the Proceedings 16th International Parallel and Distributed Processing Symposium 7, Washington, DC, USA, 15–19 April 2002; IEEE: Fort Lauderdale, FL, USA, 2002. [Google Scholar]
  52. Fick, S.E.; Hijmans, R.J. WorldClim 2: New 1-km spatial resolution climate surfaces for global land areas. Int. J. Climatol. 2017, 37, 4302–4315. [Google Scholar] [CrossRef]
  53. Wei, X.; Jiang, M. Meta-analysis of genetic representativeness of plant populations under ex situ conservation in contrast to wild source populations. Conserv. Biol. 2020, 35, 12–23. [Google Scholar] [CrossRef]
  54. Huang, H. LinkET: Everything Is Linkable. R Package, Version 0.0.3; Github Repository: Shenzhen, China, 2021; Available online: https://github.com/Hy4m/linkET (accessed on 4 September 2024).
  55. Wickham, H.; Chang, W.; Wickham, M.H. Package ‘ggplot2’. Create elegant data visualisations using the grammar of graphics. Version 2016, 2, 1–89. Available online: https://github.com/tidyverse/ggplot2 (accessed on 4 September 2024).
  56. Laliberté, E.; Legendre, P. A distance-based framework for measuring functional diversity from multiple traits. Ecology 2010, 91, 299–305. [Google Scholar] [CrossRef] [PubMed]
  57. Imdadullah, M.; Aslam, M. mctest: An R Package for Detection of Collinearity among Regressors. R J. 2016, 8, 495–505. [Google Scholar] [CrossRef]
  58. Molinari, J. A Bare-Bones Scheme to Choose Between the Species, Subspecies, and ‘Evolutionarily Significant Unit’ Categories in Taxonomy and Conservation. J. Nat. Conserv. 2023, 72, 126335. [Google Scholar] [CrossRef]
Figure 1. (a) Pairwise population matrix of Fst values of S. involucrata populations; (b) pairwise population matrix of Nei genetic distance of S. involucrata populations. 1–11 are populations distributed in the Bayinbuluke area; 12–16 are populations of the Eastern Tianshan Mountains. 12 is a population distributed in the Tianchi area; 13 is a population distributed in the Turpan area; 14–16 are populations distributed in Hami region.
Figure 1. (a) Pairwise population matrix of Fst values of S. involucrata populations; (b) pairwise population matrix of Nei genetic distance of S. involucrata populations. 1–11 are populations distributed in the Bayinbuluke area; 12–16 are populations of the Eastern Tianshan Mountains. 12 is a population distributed in the Tianchi area; 13 is a population distributed in the Turpan area; 14–16 are populations distributed in Hami region.
Ijms 27 02274 g001
Figure 2. Principal component analysis (PCoA). 1–11 are populations distributed in the Bayinbuluke area; 12 is a population distributed in the Tianchi area; 13 is a population distributed in the Turpan area; 14–16 are populations distributed in the Hami region.
Figure 2. Principal component analysis (PCoA). 1–11 are populations distributed in the Bayinbuluke area; 12 is a population distributed in the Tianchi area; 13 is a population distributed in the Turpan area; 14–16 are populations distributed in the Hami region.
Ijms 27 02274 g002
Figure 3. ΔK values with different K parameters, where the optimal number of genetic clusters is determined via the ΔK method. The blue dots show the Delta K values corresponding to different K values. The red vertical line marks the peak of Delta K at K = 2.
Figure 3. ΔK values with different K parameters, where the optimal number of genetic clusters is determined via the ΔK method. The blue dots show the Delta K values corresponding to different K values. The red vertical line marks the peak of Delta K at K = 2.
Ijms 27 02274 g003
Figure 4. (a) ΔK = 2; (b) ΔK = 3. Bayesian analysis (STRUCTURE) clustering diagram of S. involucrata populations. 1–112 represents populations 1–11, while 113–168 denotes populations 13–16. 1–11 are populations distributed in Bayinbuluke area; 12–16 are populations of Eastern Tianshan Mountains. Each color represents a hypothetical ancestral population, with the proportions reflecting the ancestral composition. The colors themselves have no fixed meaning and are arbitrarily assigned by the software.
Figure 4. (a) ΔK = 2; (b) ΔK = 3. Bayesian analysis (STRUCTURE) clustering diagram of S. involucrata populations. 1–112 represents populations 1–11, while 113–168 denotes populations 13–16. 1–11 are populations distributed in Bayinbuluke area; 12–16 are populations of Eastern Tianshan Mountains. Each color represents a hypothetical ancestral population, with the proportions reflecting the ancestral composition. The colors themselves have no fixed meaning and are arbitrarily assigned by the software.
Ijms 27 02274 g004
Figure 5. The geographic distributions and frequencies of the nrDNA gene regions detected in the S. involucrata populations in the Tianshan Mountains, with their haplotype networks constructed by using Popart-1.7 TCS. The image within the black frame is a close-up view. The red arrow indicates the magnified area.
Figure 5. The geographic distributions and frequencies of the nrDNA gene regions detected in the S. involucrata populations in the Tianshan Mountains, with their haplotype networks constructed by using Popart-1.7 TCS. The image within the black frame is a close-up view. The red arrow indicates the magnified area.
Ijms 27 02274 g005
Figure 6. The geographic distributions and frequencies of the cpDNA gene regions detected in the S. involucrata populations in the Tianshan Mountains, with their haplotype networks constructed by using Popart-1.7 TCS. The image within the black frame is a close-up view. The red arrow indicates the magnified area.
Figure 6. The geographic distributions and frequencies of the cpDNA gene regions detected in the S. involucrata populations in the Tianshan Mountains, with their haplotype networks constructed by using Popart-1.7 TCS. The image within the black frame is a close-up view. The red arrow indicates the magnified area.
Ijms 27 02274 g006
Figure 7. Principal component analysis (PCoA). 1 represents the Bayinbuluke area and 2 represents the Eastern Tianshan Mountains.
Figure 7. Principal component analysis (PCoA). 1 represents the Bayinbuluke area and 2 represents the Eastern Tianshan Mountains.
Ijms 27 02274 g007
Figure 8. Mantel test of S. involucrata based on SSR molecular markers. An asterisk (*) indicates statistical significance: * represents statistical significance at the 95% confidence level (p-value < 0.05), ** at the 99% confidence level (p-value < 0.01), and *** at the 99.9% confidence level (p-value < 0.001).
Figure 8. Mantel test of S. involucrata based on SSR molecular markers. An asterisk (*) indicates statistical significance: * represents statistical significance at the 95% confidence level (p-value < 0.05), ** at the 99% confidence level (p-value < 0.01), and *** at the 99.9% confidence level (p-value < 0.001).
Ijms 27 02274 g008
Figure 9. Redundancy analysis of S. involucrata based on SSR molecular markers. On the horizontal axis, 84.74% of the data variance is explained, while on the vertical axis, 94.77% of the variance is revealed. Red represents geographical environmental factors. Black represents the population number. Blue represents genetic diversity parameters.
Figure 9. Redundancy analysis of S. involucrata based on SSR molecular markers. On the horizontal axis, 84.74% of the data variance is explained, while on the vertical axis, 94.77% of the variance is revealed. Red represents geographical environmental factors. Black represents the population number. Blue represents genetic diversity parameters.
Ijms 27 02274 g009
Table 1. Genetic diversity parameters of 16 populations.
Table 1. Genetic diversity parameters of 16 populations.
Pop *N ** NaNeIHoHe
110.0563.0562.3700.8020.3420.457
27.4442.8892.2420.7670.4220.440
38.6673.5562.5750.9330.3590.518
49.5562.9442.3080.8550.4090.505
59.4442.9442.3140.8510.3780.501
69.1113.2222.6680.8580.3570.477
79.1673.1672.2300.8300.3580.472
89.7783.0002.3010.8240.3160.481
910.6673.2782.2710.8410.3660.467
109.5562.2781.8380.5370.3490.315
118.9442.6671.9540.6800.3060.396
1220.2786.9443.5911.3770.4810.647
1319.6116.2223.0051.2170.5340.598
144.1673.3892.7190.9290.4590.499
154.4443.1112.4420.9300.5030.537
163.6673.0002.5250.8800.4880.503
Total
Mean9.6603.4792.4590.8820.4020.488
* 1–11 are populations distributed in the Bayinbuluke area; 12–16 are populations of the Eastern Tianshan Mountains. 12 is a population distributed in the Tianchi area; 13 is a population distributed in the Turpan area; 14–16 are populations distributed in the Hami region. ** Number of alleles (N); effective number of alleles (Na); effective number of alleles (Ne); Shannon diversity index (I); observed heterozygosity (Ho); expected heterozygosity (He).
Table 2. Analysis of molecular variance within and among populations.
Table 2. Analysis of molecular variance within and among populations.
SourcedfSSMSEst. Var.%
Among Pops15278.38518.5590.2212%
Among Indiv1522124.26613.9755.31760%
Within Indiv168561.5003.3423.34238%
Total3352964.152 8.880100%
Table 3. Detection of bottleneck effect in S. involucrata populations.
Table 3. Detection of bottleneck effect in S. involucrata populations.
Pop *Sign Test Wilcoxon TestMode-Shift
One Tail for H Excess
T.P.MS.M.MT.P.MS.M.M
10.109490.11730.003810.00775normal L-shaped distribution
20.236690.432350.11560.18773shifted mode
30.573170.414470.201880.33885normal L-shaped distribution
40.009340.00970.007750.00912shifted mode
50.042970.045520.009120.00912shifted mode
60.014080.014910.006230.01077shifted mode
70.522730.500030.373330.48161normal L-shaped distribution
80.018520.022270.004670.01008shifted mode
90.435790.422290.444980.5normal L-shaped distribution
100.21380.23040.033690.06152shifted mode
110.145230.33190.21660.24771shifted mode
12000.99720.99974normal L-shaped distribution
130.006710.00150.997630.99974normal L-shaped distribution
140.544650.539770.576540.59802shifted mode
150.487750.50420.415860.44929shifted mode
160.186610.193580.036960.04672shifted mode
* 1–11 are populations distributed in the Bayinbuluke area; 12–16 are populations of the Eastern Tianshan Mountains. 12 is a population distributed in the Tianchi area; 13 is a population distributed in the Turpan area; 14–16 are populations distributed in the Hami region.
Table 4. Information of 16 populations.
Table 4. Information of 16 populations.
Number *PopLatitudeLongitude
1B1 (11) **43.03° N83.50° E
2B2 (11)43.04° N83.59° E
3B3 (9)42.65° N84.62° E
4C1–C10 (10)42.57° N84.82° E
5C11–C20 (10)42.57° N84.90° E
6D1–D10 (10)42.84° N82.92° E
7D11–D20 (10)42.82° N83.01° E
8E1–E10 (10)42.87° N84.62° E
9E11–E21 (11)42.84° N84.73° E
10F1–F10 (10)42.65° N83.61° E
11F11–F20 (10)42.55° N83.58° E
12Z1–Z20 (21)43.84° N88.29° E
13T1–T20 (20)43.31° N89.68° E
14L1–L5 (5)43.01° N94.33° E
15M1–M5 (5)43.02° N94.34° E
16O1–O5 (5)43.00° N94.25° E
* 1–11 are populations distributed in the Bayinbuluke area; 12–16 are populations of the Eastern Tianshan Mountains. 12 is a population distributed in the Tianchi area; 13 is a population distributed in the Turpan area; 14–16 are populations distributed in the Hami region. ** The number in parentheses is the sample size.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Lu, J.; Maimaiti, K.; Liu, J.; Zhang, D.; Tan, D.; Wang, J.; Shi, W. Assessing Genetic Structure and Diversity Using Multiple Molecular Markers to Guide Conservation Management of Saussurea involucrata in the Eastern Tianshan Mountains. Int. J. Mol. Sci. 2026, 27, 2274. https://doi.org/10.3390/ijms27052274

AMA Style

Lu J, Maimaiti K, Liu J, Zhang D, Tan D, Wang J, Shi W. Assessing Genetic Structure and Diversity Using Multiple Molecular Markers to Guide Conservation Management of Saussurea involucrata in the Eastern Tianshan Mountains. International Journal of Molecular Sciences. 2026; 27(5):2274. https://doi.org/10.3390/ijms27052274

Chicago/Turabian Style

Lu, Jiayi, Kelimunur Maimaiti, Jingdian Liu, Daoyuan Zhang, Dunyan Tan, Jiancheng Wang, and Wei Shi. 2026. "Assessing Genetic Structure and Diversity Using Multiple Molecular Markers to Guide Conservation Management of Saussurea involucrata in the Eastern Tianshan Mountains" International Journal of Molecular Sciences 27, no. 5: 2274. https://doi.org/10.3390/ijms27052274

APA Style

Lu, J., Maimaiti, K., Liu, J., Zhang, D., Tan, D., Wang, J., & Shi, W. (2026). Assessing Genetic Structure and Diversity Using Multiple Molecular Markers to Guide Conservation Management of Saussurea involucrata in the Eastern Tianshan Mountains. International Journal of Molecular Sciences, 27(5), 2274. https://doi.org/10.3390/ijms27052274

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop