Next Article in Journal
Whole Genome Re-Sequencing Reveals Insights into the Genetic Diversity and Fruit Flesh Color of Guava
Previous Article in Journal
Comparative Identification of LsWRKY Transcription Factors and Transcriptional Response to Abiotic and Biotic Stresses in Lagenaria siceraria
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genetic Diversity and Association Analysis of Dioscorea polystachya Germplasm Resources Based on Phenotypic Traits and SSR Markers

School of Advanced Agriculture and Bioengineering, Yangtze Normal University, Chongqing 408100, China
*
Authors to whom correspondence should be addressed.
Horticulturae 2025, 11(10), 1193; https://doi.org/10.3390/horticulturae11101193
Submission received: 2 July 2025 / Revised: 19 September 2025 / Accepted: 20 September 2025 / Published: 3 October 2025
(This article belongs to the Section Genetics, Genomics, Breeding, and Biotechnology (G2B2))

Abstract

Dioscorea polystachya (Chinese yam) is a crop valued for both medicinal and edible purposes, and exhibits rich genetic diversity. However, research into its germplasm resources remains understudied, and molecular breeding efforts lag behind. To bridge this gap, this study employed an integrated approach, combining the analysis of 23 phenotypic traits (17 qualitative and 6 quantitative) with genotyping using 19 polymorphic SSR markers. This combined strategy was applied to 53 accessions collected across 16 Chinese provinces to assess genetic diversity, population structure, and marker–trait associations. Phenotypic analysis revealed high diversity, with the Shannon diversity index (I) ranging from 0.09 to 1.15 for qualitative traits and from 1.45 to 1.79 for quantitative traits. Tuber traits exhibited the highest variability (with a CV up to 71.45%), indicating significant potential for yield improvement. Principal component analysis distilled phenotypic variation into eight principal components (accounting for 73.13% of the cumulative variance), and elite germplasm (e.g., DP24, DP52) was selected for breeding based on this analysis. Stepwise regression prioritized eight core evaluation traits (e.g., flowering rate, tuber length). SSR markers amplified 80 alleles (mean 4.211/locus), showing moderate genetic diversity (He = 0.529, PIC = 0.585). Population structure analysis divided accessions into two subpopulations, correlated with geographic origins: Group 1 (northern/southwestern China) and Group 2 (central/eastern China), reflecting adaptation to local climates and human selection. Association analysis identified 10 SSR loci significantly linked (p < 0.01) to key traits, including YM07_2 (flowering, R2 = 13.94%), YM37_2 (leaf margin color, R2 = 19.03%), and YM19_3 (leaf width, R2 = 19.34%). This study establishes a comprehensive genetic framework for Chinese yam, offering molecular tools for marker-assisted breeding and strategies to conserve high-diversity germplasm, thereby enhancing the utilization of this orphan crop.

1. Introduction

Yam (Dioscorea spp.) is an annual or perennial entwined vine plant of the Dioscorea genus in the Dioscoreaceae family [1]. It is a significant medicinal and edible crop with considerable nutritional and economic value. Its tuber is rich in starch, protein, choline, saponin, mucopolysaccharide, and other medicinal ingredients and minerals [2,3]. The origin of yams can be traced back to the Late Cretaceous period, with more than 650 species currently recognized within nine genera of the Dioscoreaceae family [4]. Yam (Dioscorea spp.) originated in Asia, Africa, and the Americas, where prolonged domestication established three distinct cultivation centers [5,6]. Among these, Asia serves as the primary distribution center, cultivating species including Chinese yam, D. alata, D. esculenta, D. japonica, D. bulbifera, D. hispida, and D. quinqueloba [7]. As a primary center of yam origin and domestication, China has documented Dioscorea species since antiquity—most notably in the pre-Qin era Classic of Mountains and Seas (Shanhaijing) [8]. The country harbors 93 native species, with Chinese yam, D. alata, D. esculenta, D. fordii, and D. persimilis being the most widely cultivated [9]. Chinese yam is endemic to China and uniquely adapted to temperate climates, exhibiting greater cold tolerance than other Dioscorea species [10]. Commonly known as ‘Huai Shan Yao’, it has been used in traditional medicine for over 2000 years to treat diarrhea, diabetes, and asthma. Recognized in the Chinese Pharmacopoeia, it remains a vital medicinal resource [11]. Chinese yam is extensively cultivated across China (excluding Qinghai and Xizang), with major production in Henan, Shandong, Jiangsu, Guangxi, Hunan, and Shanxi provinces [12]. According to statistics presented at the First National Yam Industry Development Conference (2023), the total cultivation area reached approximately 32,600 ha, yielding 450,000 t with an output value of CNY 2.7 billion.
Despite its significant nutritional and medicinal value, yam receives limited research attention and investment, resulting in its longstanding classification as an “orphan crop”—particularly exemplified by Chinese yam, which is well-known in China yet critically understudied globally [10]. Presently, research on Chinese yam is primarily concentrated on its medicinal value and the advancement of medicinal components, with the relatively limited investigation into resource types, evaluation, and genetic diversity [13,14,15]. Germplasm resources function as both repositories of genetic information and fundamental substrates for genetic improvement, while also serving as the essential basis for novel material development and cultivar breeding [16]. Genetic diversity research serves as a pivotal tool for assessing and leveraging germplasm resources [17]. Conducting genetic diversity analysis and comprehensive evaluation of Chinese yam germplasm resources enables enhanced conservation and characterization, reveals germplasm richness and genetic relationships, and provides elite materials to improve breeding quality and efficiency [18]. This constitutes a critical step toward advancing scientific innovation in Chinese yam germplasm utilization.
Phenotypic traits, isoenzymes, karyotype analysis, and DNA diversity have been used to study the genetic diversity of yam germplasm [9,19,20,21]. Phenotypic diversity represents the external expression of genetic diversity and constitutes the most fundamental methodology for the selection of germplasm resources and genetic background research [18,19,22]. The study of phenotypic genetic diversity in Chinese yam is primarily concerned with the examination of morphological traits, including leaves, stems, tubers, seeds, flowers, and roots [23,24,25,26,27,28]. Phenotypic traits are relatively straightforward to quantify; however, they are constrained by numerous factors and are particularly susceptible to environmental influences. Molecular markers are not susceptible to environmental influences and have been extensively employed in the fields of plant classification, breeding, and germplasm resource assessment [29]. Molecular markers such as Random Amplified Polymorphic DNA (RAPD), Inter-Simple Sequence Repeat (ISSR), Sequence-Related Amplified Polymorphism (SRAP), Amplified Fragment Length Polymorphism (AFLP), and Simple Sequence Repeats (SSR) have been extensively used in the investigation of the genetic diversity of yam [30,31,32]. However, most relevant studies have focused on D. alata, with scarce research reported on Chinese yam [18,30,31]. SSR molecular markers have the advantages of co-dominance, reproducibility, and high polymorphism, and have been widely used for the identification of germplasm resources and analysis of genetic diversity [32]. They are also one of the ideal markers for association analysis [33,34]. Association analysis leverages linkage disequilibrium to statistically assess associations between target traits and genetic markers in natural populations [35]. This approach has been widely adopted in plant genetics and breeding to identify genes and marker loci linked to agronomically important traits [36].
Chinese yam is dioecious and rarely flowers, and its seeds rarely develop to maturity, which makes cross-breeding for specific traits challenging [37]. Furthermore, it has long relied on asexual reproduction using tubers as planting material, leading to extensive viral accumulation, degradation of tuber quality, and significant reductions in both yield and quality. These are urgent issues that need to be addressed in Chinese yam agricultural production. However, studying the quantitative trait loci (QTL) associated with Chinese yam traits via linkage mapping is highly difficult and time consuming. Additionally, due to the late start of related research, the genome of Chinese yam has not yet been published, and the development of SNP molecular markers requires large-scale sequencing and incurs high costs. Therefore, using SSR molecular markers in combination with phenotypic traits to identify molecular markers associated with important phenotypic traits represents an effective approach to assist in the molecular breeding of Chinese yam. However, few such studies have been reported.
To clarify the genetic diversity of Chinese yam, identify molecular markers associated with important phenotypic traits, and provide a theoretical basis and material support for germplasm identification, innovation, and variety breeding, 53 germplasm resources of Chinese yam were collected from 16 provinces in the current study. Their genetic diversity and population structure were comprehensively characterized and evaluated by combining phenotypic traits with SSR molecular markers. Meanwhile, association analysis was performed to correlate quantitative traits with SSR molecular markers, aiming to identify marker loci associated with phenotypic traits. The findings of this study can provide a theoretical foundation and marker resources for molecular marker-assisted breeding in Chinese yam.

2. Materials and Methods

2.1. Plant Materials

A total of 53 Chinese yam germplasm resources were collected from major production regions across 16 provinces in China, encompassing both landraces and wild accessions (Table S1 and Figure 1A). All samples were planted in the Yam Germplasm Resource Garden of Jiangxi Agricultural University (Nanchang City, Jiangxi Province) in 2019. Tubers (80–120 g in weight) were used as seedings, with a row spacing of 20 cm × 1.2 m. Each plant was supported by a 2 m high bamboo pole for twining and was regularly managed with standard fertilization, weeding, irrigation, and pest control (Figure 1B). Each accession was planted with 10 individual plants. To ensure representativeness, 6 plants were randomly selected from the 10 for phenotypic trait determination, with three replicates conducted for each trait.

2.2. Determination of Phenotypic Traits

Twenty-three phenotypic traits of leaves, stems, flowers, aerial tubers, tubers, and roots were investigated, including seventeen qualitative traits and six quantitative traits (Table 1). The investigation of leaf, stem, aerial tuber, and flower traits was conducted from June to September 2019 during the growing season, while that of tuber and root traits was conducted from October to November 2019 after harvest. The phenotyping methodology adhered to published standards [38,39,40]. All traits were evaluated using the descriptor-based grading system detailed in Table 1.

2.3. SSR Molecular Marker Analysis

Genomic DNA was extracted from leaves using the Takara MiniBest Plant Genomic DNA Extraction Kit (Takara, Beijing, China). The DNA concentration was measured with a NanoDrop 2000 spectrophotometer (Thermo Fisher Scientific, Waltham, MA, USA), and integrity was confirmed using 1.0% (w/v) ethidium bromide-stained agarose gels. Samples were diluted to 20 ng/μL and stored at −20 °C. Nineteen SSR markers, which exhibited polymorphic bands across all accessions, were selected from the initial 53 amplified markers for further analysis (Table S2). Primer synthesis was performed by Sangon Biotech Co., Ltd. (Shanghai, China). The PCR amplification reaction mixture and amplification protocol for SSR markers were conducted according to Cao et al. [41].

2.4. Data Analysis

Phenotypic trait data were organized and calculated using Microsoft Excel 2020. Statistical analyses, including principal component analysis, correlation analysis, stepwise regression analysis, and calculation of descriptive statistics (maximum, minimum, mean, standard deviation (SD), and coefficient of variation (CV)), were performed using SPSS 24.0. The formula for calculating the Shannon–Wiener diversity index (I) is: I = −∑ (pi × ln pi), where pi is the percentage of materials in the i-th level of a certain trait relative to the total number of materials [21]. Quantitative trait data were divided into 10 grades, ranging from the first grade [xi < (x − 2σ)] to the tenth grade [xi > (x + 2σ)] with an interval of 0.5σ for each grade, and the Shannon–Wiener diversity index (I) was calculated [42].
SSR bands were scored as present (1) or absent (0) based on electrophoretic banding patterns to construct a binary matrix. The number of alleles, allele frequencies, gene diversity, and polymorphism information content (PIC) index were calculated using PowerMarker V3.25 [43]. Observed Heterozygosity (Ho), Expected Heterozygosity (He), Gene Diversity (H), and Polymorphic Information Content (PIC) are defined as: Ho = H/N, where H is the number of individuals with heterozygous genotypes in the population, and N is the total number of individuals; He = 1 − ∑pi2, where pi is the frequency of the i-th allele and k is the total number of alleles at the locus; H = 1/n∑(1 − ∑pi2), where n is the number of loci; PIC = 1 − ∑pi2 − ∑∑2pi2pj2, where pi and pj represent frequencies of distinct alleles.
The population structure of all Chinese yam accessions was analyzed using STRUCTURE Version 2.3 software based on a Bayesian model [44]. The number of populations (K) was set from 1 to 10. For each K value, five replicates were run with a burn-in period of 100,000 iterations followed by 100,000 MCMC iterations. The optimal K value was determined using the maximum likelihood method and ∆K value, which was taken as the number of population subgroups. A bar plot was generated based on the Q value (the probability that the genomic variation of the i-th accession originates from the k-th population). Association analysis was performed using TASSEL 5.0 with both the General Linear Model (GLM) and Mixed Linear Model (MLM). For GLM, the population structure Q-matrix of the 53 accessions was used as a covariate in regression analyses between 19 SSR markers and eight phenotypic traits (FL, AT, LAX, LMC, LL, LW, LWR, TL) to identify significant loci and calculate their explained phenotypic variation (R2). For MLM, combined covariates of the Q-matrix and Kinship matrix were applied to the same traits to determine associated loci and their R2 contributions [45]. The regression equations for GLM and MLM were Y = μ + βx + Qα + ε and Y = μ + βx + Qα + u + ε, respectively, where Y is the phenotypic trait vector, μ represents the population phenotypic mean, β denotes the marker genotype effect value, x is the marker genotype vector, Q indicates the population structure matrix, α stands for the covariate coefficients of population structure, ε refers to the random residual error, and u represents the individual random effect vector modeled by the Kinship matrix. The Kinship matrix was calculated using the Identity by State (IBS) algorithm with the formula K = (1/L)∑(number of shared alleles/2), where L is the total number of SSR markers (19) and K denotes the kinship coefficient between accessions i and j. SSR loci significantly associated with target traits were identified using a threshold of p ≤ 0.01.

3. Results

3.1. Analysis of Phenotypic Genetic Diversity of Chinese Yam Germplasm Resources

Significant phenotypic variation was observed in tuber, leaf, stem, and flower characteristics across the Chinese yam accessions (Figure 1C–F). The 17 qualitative traits from 53 germplasm accessions were categorized, with frequency distributions and diversity indices calculated for each trait (Table 2). The Shannon diversity index (I) for qualitative traits ranged from 0.09 to 1.15 (mean = 0.59). The genetic diversity indices for flowering, leaf shape, petiole color, and tuber skin color all exceeded 1. LS showed the highest diversity (I = 1.15), while the tuber secondary color of upper bulbils (TSCUB), and flesh color color (FC) demonstrated the lowest diversity (I = 0.09).

3.2. Variation Analysis of Quantitative Traits

The six quantitative traits across 53 Chinese yam accessions exhibited substantial variation, with Shannon diversity indices (I) ranging from 1.45 to 1.79 (all > 1; Table 3). The variation in the six quantitative traits among the 53 accessions differed, with coefficients of variation ranging from 15.80% to 71.45%. The coefficients of variation, in descending order, were for tuber flesh weight, tuber diameter, tuber length, leaf width, leaf length, and length-to-width ratio. Specifically, the coefficients of variation for tuber length, tuber diameter, and tuber flesh weight were 31.10%, 44.36%, and 71.45%, respectively, indicating that tubers have great potential for genetic improvement. Furthermore, the frequency distributions of the six quantitative traits exhibited continuous normal distributions, which meet the phenotypic data requirements for association analysis (Figure 2).

3.3. Comprehensive Evaluation of Phenotypic Traits of Germplasm Resources

3.3.1. Principal Component Analysis of Phenotypic Traits

Principal component analysis of the 23 phenotypic traits revealed that eight new independent comprehensive indices with eigenvalues greater than 1 could be extracted, with a cumulative contribution rate of 73.125% (Table 4). The eigenvalues of these eight principal components are 3.67, 3.03, 2.49, 2.02, 1.73, 1.41, 1.36, and 1.11, respectively, with corresponding contribution rates of 15.95%, 13.16%, 10.83%, 8.80%, 7.50%, 6.14%, 5.91%, and 4.83%, respectively. For the first principal component, the eigenvector of LV has the highest absolute value, followed by those of LW, LL, LC, and LAX, indicating that the first principal component is a comprehensive index related to leaf factors. For the second principal component, the eigenvector of LWR has the highest absolute value, followed by those of LW, LC, PRT, and FL. For the third principal component, the eigenvectors of TSCUB and FC have the highest absolute values, both at 0.746. The highest absolute values of the eigenvectors for the fourth, fifth, sixth, seventh, and eighth principal components correspond to YPP, AT, TD, LVC, and RHD, respectively.

3.3.2. Comprehensive Evaluation of Germplasm Resources

The data for the 23 phenotypic traits of the 53 accessions were standardized, and the factor score coefficients of the eight principal components mentioned above were substituted to obtain principal component scores. The comprehensive scoring formula for accessions was derived based on the values of D1–D8 and the contribution rate weights of each principal component: D = 0.22D1 + 0.18D2 + 0.15D3 + 0.12D4 + 0.10D5 + 0.08D6 + 0.08D7 + 0.07D8. This formula was used to evaluate the phenotypic traits of the 53 Chinese yam germplasm accessions. The top ten accessions with the highest comprehensive scores were DP24, DP52, DP22, DP5, DP23, DP42, DP39, DP1, DP2, and DP47 (Table 5). Correlation analysis between the phenotypic trait D value and the 23 phenotypic traits is shown in Figure 3. The D value was highly significantly correlated with six phenotypic traits: it was negatively correlated with LWR and positively correlated with LC, LV, LW, TD, and YPP (Figure 3).
To screen the optimal evaluation indicators for Chinese yam germplasm resources, an optimal regression equation was constructed using 23 phenotypic traits and the comprehensive score D value. Stepwise linear regression was performed with the 23 phenotypic traits as independent variables and D values as dependent variables, yielding the optimal equation: y = 0.562 − 0.025x1 − 0.028x2 + 0.045x5 + 0.040x7 − 0.016x17 + 0.02x18 − 0.081x19 + 0.001x20 (r = 0.776, R2 = 0.602, F = 2.064, p = 0.033), where x1, x2, x5, x7, x17, x18, x19, and x20 represent FL, AT, LAX, LMC, LL, LW, LWR, and TL, respectively. The correlation coefficient (r) and coefficient of determination (R2) of the equation were 0.776 and 0.602, respectively, indicating that the eight independent variables can explain 60.2% of the total variation in the D value. With an F value of 2.064 and a p value of 0.033, the equation reached a significant level, confirming its suitability for evaluating the comprehensive performance of Chinese yam germplasm. Thus, the phenotypic traits FL, AT, LAX, LMC, LL, LW, LWR, and TL were the main indicators for the comprehensive evaluation of Chinese yam germplasm resources.

3.4. Genetic Diversity Analysis of SSR Molecular Markers

3.4.1. Analysis of SSR Molecular Marker Polymorphism and Genetic Diversity

As shown in Table 6, 19 SSR markers amplified a total of 80 alleles in the 53 germplasm accessions, with an average of 4.211 alleles per marker. The primer YM32 yielded the highest number of alleles (eight). The average number of effective alleles was 2.846; the highest was observed for YM06 (6.531), and the lowest for YM30 (1.110). Shannon’s diversity index (I) ranged from 0.205 to 1.970, with an average of 1.004. The average observed heterozygosity (Ho), expected heterozygosity (He), and Nei’s gene diversity (H) were 0.556, 0.529, and 0.523, respectively. The polymorphism information content (PIC) index ranged from 0.15 to 0.942, with a mean of 0.585.

3.4.2. Analysis of Population Genetic Structure

The genetic structure of the Chinese yam population was analyzed using SSR data based on a Bayesian clustering model. The optimal number of groups (K) was estimated according to Evanno et al. [46], with the highest ΔK value for the 53 individuals observed at K = 2 (Figure 4A). At K = 2, the 53 germplasm accessions were divided into two groups: the first group (11 accessions) is represented in red, and the second group (42 accessions) is represented in green. These two groups are denoted by red and green bars in the bar plot, containing 11 and 42 germplasm accessions, respectively (Figure 4B). Additionally, Group 1 exhibited a lower leaf width ratio, shorter and thicker tubers, along with higher yield, whereas Group 2 showed contrasting characteristics but demonstrated greater phenotypic diversity (Table S3).
The phylogenetic tree constructed using the neighbor-joining method is shown in Figure 4C; the 53 germplasm accessions were clustered into two populations (red and green), consistent with the results of the population structure analysis. Specifically, accessions DP41, DP47, DP51, DP15, DP53, DP52, DP38, DP33, DP50, DP31, and DP34 were grouped together, with the remaining accessions forming the second group.
The results of principal component analysis (PCA) were consistent with those of the phylogenetic tree and population structure analysis, as the 53 germplasm accessions were also divided into two groups (Figure 4D).

3.5. Linkage Disequilibrium Analysis

Linkage disequilibrium analysis between SSR loci is shown in Figure 5. Among the 4005 locus combinations of the 19 SSR loci, a considerable proportion of loci were in linkage disequilibrium. A high degree of linkage disequilibrium (D > 0.5) was observed in 1734 loci combinations, accounting for 43.30%. Additionally, 183 combinations showed highly significant differences (p < 0.01), with a proportion of 4.67%.

3.6. Association Analysis of SSR Markers and Phenotypic Traits of Chinese Yam

The general linear model (GLM) and mixed linear model (MLM) were used to conduct regression analysis of the main phenotypic data and genotype data to identify loci associated with phenotypic traits. A total of 82 marker loci showed significant associations with FL, AT, LAX, LMC, LL, LW, LWR, and TL at the p < 0.05 level. The results from GLM and MLM analyses indicated that seven marker loci were significantly correlated with the phenotypic traits of FL, AT, LAX, LMC, LL, LW, LWR, and TL at the p < 0.01 level (Table 7). The marker gene significantly associated with AT was YM06_3 (350 bp) with explanation rates of 18.03%. The marker loci associated with LMC were YM13_3 (400 bp), YM37_2 (350 bp), and YM37_7 (100 bp), with explanatory rates of 18.72%, 19.03%, and 13.73%, respectively. The marker loci Da1D08_4 (300 bp) and YM06_1 (500 bp) were significantly correlated with LL and LWR, with explanatory rates of 16.46% and 13.79%, respectively.
MLM analysis revealed nine loci that are significantly associated with FL, AT, LMC, LL, LWR, and LW. The marker loci significantly correlated with FL and AT were YM07-2 (400 bp) and YM13_3 (400 bp), with an explanatory rate of 13.94% and 12.96%, respectively. The MLM analysis identified loci associated with LMC, consistent with the GLM results. The MLM analysis also identified loci significantly associated with LL, LWR, and LW, including YM19_3 (130 bp), YM06_1 (500 bp), and YM19_3 (130 bp) with explanatory rates of 15.94%, 16.70%, and 19.34%, respectively. Both GLM and MLM analyses identified six loci significantly associated with AT, LMC, LL, and LWR.

4. Discussion

Phenotypic trait analysis is a fundamental and intuitive method for studying plant genetic diversity, as it can reveal plant diversity, explore genetic resources, and provide an important basis for researching the mechanisms of complex traits [47,48,49]. For Chinese yam, the investigation and analysis of phenotypic traits in germplasm resources represent the primary work in germplasm research, the foundational work in breeding, and a simple yet effective approach. This method can reveal the genetic diversity of yam, explore genetic resources, and provide an important basis for studying complex traits. Currently, there are few relevant reports on the comprehensive evaluation of Chinese yam phenotypic characteristics. However, conclusions drawn from phenotypic markers have certain limitations; thus, it is more scientific and accurate to evaluate the genetic diversity of germplasm resources by combining phenotypic traits with molecular markers. Furthermore, association analysis based on population structure was conducted to identify significant loci and favorable alleles associated with target traits, providing genetic resources and molecular markers for Chinese yam breeding.

4.1. Comprehensive Evaluation of Phenotypic Characteristics of Chinese Yam Germplasm Resources

Principal component analysis is a multivariate statistical analysis technique, which is conducive to improving breeding levels and parent selection effects in multi-objective breeding [50,51]. PCA can convert multiple indicators into a few principal components, which are not correlated with each other and can reflect most of the information of the original data. This method has been widely used in the comprehensive evaluation of germplasm resources, but few studies have been reported on Chinese yam. In this study, 23 phenotypic traits were simplified into eight independent principal components through principal component analysis, with a cumulative contribution rate of 73.13%, representing most of the information on phenotypic traits. The F value of each resource was calculated by constructing the function expression of principal components, and the germplasm with excellent comprehensive characters, such as DP24, DP52, DP22, and DP5, was selected, which could be used for germplasm innovation and genetic improvement of Chinese yam. A mathematical model for phenotype evaluation of Chinese yam germplasm was established through comprehensive correlation analysis and stepwise regression analysis, and the key indicators for comprehensive evaluation were selected as FL, AT, LAX, LMC, LL, LW, LWR, and TL to provide a reference for genetic breeding research of Chinese yam.
PCA can convert multiple indicators into a few uncorrelated principal components, which collectively retain most of the information from the original data. This method has been widely used in the comprehensive evaluation of germplasm resources, but few studies have been reported for Chinese yam. In this study, 23 phenotypic traits were simplified into eight independent principal components via PCA, with a cumulative contribution rate of 73.13%, thus capturing most of the phenotypic trait information. By constructing a function expression for these principal components, we calculated the comprehensive score (D value) for each germplasm resource and identified accessions with excellent comprehensive traits, such as DP24, DP52, DP22, and DP5. These can be used for germplasm innovation and genetic improvement of Chinese yam. Through comprehensive correlation analysis and stepwise regression analysis, a mathematical model for phenotypic evaluation of Chinese yam germplasm was established. The key indicators selected for comprehensive evaluation were FL, AT, LAX, LMC, LL, LW, LWR, and TL, providing a reference for genetic breeding research on Chinese yam.

4.2. Analysis of Genetic Diversity of Yam Germplasm Based on Phenotypic Traits and SSR Molecular Markers

Genetic diversity levels can be reflected by the coefficient of variation and Shannon–Wiener index, where a high diversity index indicates rich diversity. The coefficient of variation reflects the variation characteristics of phenotypic traits, while the Shannon–Wiener index reflects their diversity [52]. The Shannon–Wiener index can also reflect the range of variation and the distribution of genotype frequencies, making it widely used to evaluate phenotypic trait diversity [53]. In this study, the phenotypic diversity of 53 Chinese yam germplasm accessions was analyzed using 17 qualitative traits and 6 quantitative traits. High genetic diversity was observed for several phenotypic traits of Chinese yam, including FL, LS, PC, and TSC (I > 1), which is consistent with our previous analysis of phenotypic genetic diversity in five Dioscorea species [41].
SSR markers, characterized by high abundance, stability, and polymorphism, have been widely used in evaluating genetic diversity, constructing genetic maps, and determining species lineages across many plants. For the 19 pairs of SSR primers selected in this study, the average values of Na, Ne, I, Ho, He, H, and PIC were 4.211, 2.846, 1.004, 0.556, 0.529, 0.523, and 0.585, respectively—similar to our previous results on genetic diversity in five yam species [41]. Zhang et al. (2025) developed SSR markers via high-throughput sequencing and selected 42 polymorphic primer pairs to analyze the genetic diversity in 101 yam accessions [36]. Their reported mean values for Na, I, and PIC were 5.409, 0.551, and 0.586, respectively. Compared with the present study, their Na was higher, I was lower, and PIC was similar. These discrepancies may be attributed to differences in primer selection and plant materials. As a geographically restricted endemic to China, Chinese yam remains understudied, with few reports on SSR marker-based genetic diversity analyses. Zhou et al. analyzed the genetic diversity of 28 Chinese yam cultivars using ISSR markers and reported an I value of 0.3191, which is lower than that in our study [54]. Wang et al. analyzed the genetic diversity of 51 Chinese yam accessions from six species using ISSR, ISAP, SRAP, and SCAR markers, with reported H and I values of 0.4950 and 0.6762, respectively [55]. These differences may stem from variations in the type and quantity of test materials, as well as the types of molecular markers used. Additionally, primers YM02, YM06, YM07, YM09, YM32, YM35, YM37, Da1D08, and SSR-17—with PIC > 0.7 and clear amplification profiles—were identified as optimal for evaluating the genetic diversity of Chinese yam germplasm resources.

4.3. Analysis of Population Structure of Chinese Yam Germplasm Materials

Population structure analysis based on SSR markers revealed a clear genetic division of the 53 Chinese yam germplasm accessions into two groups (K = 2), supported by consistent results from PCA and neighbor-joining tree analyses (Figure 4B–D). This genetic divergence may be attributed to multiple factors, including geographical isolation, historical domestication events, and artificial selection.
The two genetic clusters showed a weak but noticeable correlation with geographical origins. For example, the first group (11 accessions) included materials from northern China (e.g., Hebei, Shandong, and Shanxi) and southwestern regions (e.g., Yunnan and Sichuan), while the second group (42 accessions) was dominated by accessions from central and eastern China (e.g., Henan, Jiangxi, and Hubei). This pattern aligns with the historical cultivation centers of Chinese yam, where northern varieties adapted to temperate climates (e.g., DP39 “Baiyu Yam” from Shandong) and southwestern accessions evolved under subtropical conditions (e.g., DP52 “Honglong Yam” from Yunnan). The presence of admixed individuals (e.g., DP41 “Chenji Tiegun Yam” from Shandong) suggests gene flow between regions due to human migration or trade.
Phenotypic divergence between the two genetic clusters may reflect adaptation to local selective pressures. Group 1’s combination of shorter tubers and higher flesh weight suggests selection for yield optimization in northern regions (e.g., Shanxi), whereas Group 2’s elongated tubers align with traditional shape preferences in central China. These ecotypic adaptations were consolidated via PCA, which linked tuber morphology and leaf architecture to primary axes of variation (Table 4).
The genetic structure provides a foundation for rational germplasm conservation and breeding. Group 1 contains rare wild accessions (e.g., DP52 from Yunnan), which should be prioritized for in situ conservation to preserve adaptive alleles. Group 2, dominated by cultivated varieties, offers potential for trait-specific breeding (e.g., high-yield lines from Henan). Future studies should integrate environmental variables (e.g., altitude, temperature) to disentangle the roles of natural selection and human intervention in shaping the population structure.

4.4. Correlation Analysis Between SSR Markers and Phenotypic Traits of Chinese Yam

Association analysis between SSR markers and phenotypic traits is pivotal for linking genetic diversity to agronomic performance, providing a foundation for marker-assisted selection (MAS) in breeding programs [56,57]. Hansen et al. identified two AFLP markers associated with genes influencing the vernalization requirement before bolting in Beta vulgaris ssp. maritima [58]. This study represented the first reported application of association analysis at the whole-genome level, which subsequently became a widely investigated approach. However, relatively few studies on association analysis for Chinese yam have been reported to date.
In this study, 10 SSR loci were significantly associated with eight key traits (e.g., flowering, leaf morphology, and tuber characteristics) at p < 0.01 using GLM and MLM (Table 7). It is noteworthy that the GLM and MLM models detected seven and nine associated loci, respectively, with an overlap of six common loci, highlighting their distinct detection patterns and complementary nature. In the GLM analysis, only the Q-matrix derived from subpopulation membership was employed, whereas the MLM analysis—by incorporating both population structure and kinship—typically achieves a significant reduction in false-positive loci [59]. In this study, the AT-associated locus YM13_3 showed significant association exclusively in GLM but was undetectable via MLM, suggesting potential false positives due to population stratification. The loci consistently identified by both models (e.g., YM37_7 associated with leaf vein color, LMC) likely represent the most reliable associations, with the MLM model explaining 20.16% of the phenotypic variance. Future studies should prioritize further validation of loci identified through the MLM framework. These association results highlight the potential of SSR markers in elucidating the genetic basis of complex traits and accelerating genetic improvement in Chinese yam. The locus YM07_2 (400 bp) showed a significant association with flowering traits (FL) (p = 0.0093, R2 = 13.94%), suggesting that it may be related to regulatory genes controlling flowering. Delayed or absent flowering in Chinese yam is a critical bottleneck for seed-based breeding due to its dioecious nature and reliance on vegetative propagation. This marker could serve as a diagnostic tool to screen early-flowering germplasm, facilitating hybridization and genetic studies. The marker YM19_3 (350 bp) explained 19.34% of the variation in leaf width (LW), indicating that this trait may be under strong genetic control. Leaf morphology influences photosynthetic efficiency and stress adaptation, and this marker could guide the selection of germplasm with optimal leaf architecture for specific environments. Consistent with the present study, Zhang et al. identified 15 loci associated with stem diameter, leaf shape, leaf length, leaf width, petiole length, petiole color, sex type, and single-head weight using 44 SSR markers [49]. In summary, conducting significant correlation analysis between phenotypic traits and SSR markers provides insights into the genetic basis of agronomically important traits in Chinese yam, thus enhancing breeding efficiency and the success rate of cultivar development.

5. Conclusions

This study aimed to analyze the genetic diversity, population structure, and trait-associated molecular markers of 53 Chinese yam germplasm accessions using phenotypic traits and SSR markers, providing a theoretical basis for germplasm identification, innovation, and breeding. Phenotypic analysis revealed high genetic diversity in Chinese yam, with Shannon’s diversity index (I) > 1 for traits such as flowering (FL), leaf shape (LS), petiole color (PC), and tuber skin color (TSC). Tuber traits (e.g., tuber flesh weight) exhibited a coefficient of variation up to 71.45%, indicating significant potential for yield improvement. These results systematically characterize the diversity level of the germplasm, fulfilling the goal of evaluating genetic variability. SSR-based population structure analysis divided the 53 accessions into two subpopulations, corresponding to geographical origins in northern/southwestern China and central/eastern China, respectively. This pattern reflects the combined effects of climatic adaptation and human selection, providing a basis for tracing germplasm origins and targeted conservation. Association analysis identified 10 SSR loci significantly linked (p < 0.01) to key traits, including YM07_2 (flowering, R2 = 13.94%), YM37_2 (leaf margin color, R2 = 19.03%), and YM19_3 (leaf width, R2 = 19.34%). These markers offer direct tools for marker-assisted breeding, addressing the objective of mining functional genetic loci. In summary, by integrating phenotypic and molecular data, this study establishes a comprehensive evaluation system for Chinese yam germplasm. It not only reveals genetic diversity and population differentiation but also identifies elite accessions (e.g., DP24, DP52) and functional markers for breeding, laying a foundation for germplasm conservation, genetic improvement, and industrial utilization of this species.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/horticulturae11101193/s1, Table S1: List of yam accessions and their origin; Table S2: SSR Primers; Table S3: Phenotypic traits and Classes.

Author Contributions

Methodology, D.T. and Y.Y.; Investigation, R.T. and G.Y.; Resources, M.H. and M.T.; Writing—original draft, P.D.; Writing—review & editing, T.C. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Chongqing Natural Science Foundation Project (2023NSCQ-MSX1502), the Science and Technology Research Program of Chongqing Municipal Education Commission (KJZD-M202401401), and the Chongqing Social Science Planning Project (2024NDYB050).

Data Availability Statement

All the databases are available by request for public research. All requests must be addressed to College of Modern Agriculture and Bioengineering, Yangtze Normal University (sixu5678000@163.com), Chongqing.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

References

  1. Pei, J.; Ding, Z.Z. Flora of China; Science Press: Beijing, China, 1985; Volume 16, pp. 103–105. [Google Scholar]
  2. Huang, D.P.; Mo, J.G.; Lao, W.Y.; Huang, Y.F.; He, J.Z.; He, B.Z. Analysis and comparison of sixteen kinds of elements in guangshanyao and huaishanyao. Guangdong Trace Elem. Sci. 2002, 9, 47–50. [Google Scholar]
  3. Xue, J.P.; Zhu, H.L.; Rao, J.Y. Study on nutrient components of irons tick yam and bubil. Sci. Technol. Food Ind. 2008, 8, 268–269. [Google Scholar]
  4. Maurin, O.; Muasya, M.; Catalan, P.; Shongwe, E.Z.; Viruel, J.; Wilkin, P.; van der Bank, M. Diversification into novel habitats in the Africa clade of Dioscorea (Dioscoreaceae): Erect habit and elephant’s foot tubers. BMC Evol. Biol. 2016, 16, 238. [Google Scholar] [CrossRef] [PubMed]
  5. Coursey, D.G. The origins and domestication of yams in Africa. In Origin of African Plant Domesticates; Harlan, J.R., De Wet, J.M.J., Stemler, A.B.L., Eds.; De Gruyter Mouton Press: The Hague, Netherlands; Paris, France, 1976; pp. 383–408. [Google Scholar]
  6. Martin, F.W.; Sadik, S. Tropical Yams and Their Potential. Part 4. Dioscorea rotundata and Dioscorea cayenensis; USDA Agriculture Handbook 36; United States Department of Agriculture (USDA), Agricultural Research Service (ARS): Washington, DC, USA, 1977. [Google Scholar]
  7. Gong, Q.P.; Cheng, G.; Yuan, L.G. Standardized Cultivation Techniques of Chinese Yam; China Agricultural Press: Beijing, China, 2004. [Google Scholar]
  8. Yuan, K. Proofreading and Annotation of the Classic of Mountains and Rivers; Shanghai Ancient Books Publishing House: Shanghai, China, 1980; p. 135. [Google Scholar]
  9. Cao, T.X.; Zhu, Q.L.; Chen, X.; Wang, P.T.; Shan, N.; Zhou, Q.H.; Huang, Y.J. The complete chloroplast genome sequence of the Dioscorea persimilis Prain et Burkill (Dioscoreaceae). Mitochondrial DNA B 2020, 5, 451–452. [Google Scholar] [CrossRef]
  10. Epping, J.; Laibach, N. An underutilized orphan tuber crop-Chinese yam: A review. Planta 2020, 252, 58. [Google Scholar] [CrossRef]
  11. Peng, B.; Zhang, Y.M.; Sun, X.Q.; Li, M.; Huang, Y. Genetic relationship and identifcation of Dioscorea polystachya cultivars accessed by ISAP and SCAR markers. Arch. Biol. Sci. 2017, 69, 277–284. [Google Scholar] [CrossRef]
  12. Pei, C.; Ting, C.T.; Chin, H.C.; Su, P.; Tang, S.Y.; Chang, H.C. A preliminary systematic study of Dioscorea L. sect. Stenophora Uline. J. Syst. Evol. 1979, 17, 61–72. [Google Scholar]
  13. Li, J.; Qin, L.; Li, C.; Liu, M.D.; Zhang, Y.S. Comparative transcriptome analysis identifies putative genes involved in dioscin biosynthesis in Dioscorea zingiberensis. Molecules 2018, 23, 454. [Google Scholar] [CrossRef] [PubMed]
  14. Lebot, V.; Malapa, R.; Molisalé, T. Development of HP-TLC method for rapid quantification of sugars, catechins, phenolic acids and saponins to assess Yam (Dioscorea spp.) tuber flour quality. Plant Genet. Resour. 2018, 17, 62–72. [Google Scholar] [CrossRef]
  15. Cheng, J.; Chen, J.; Liu, X.; Li, X.; Zhang, W.; Dai, Z.; Lu, L.; Zhou, X.; Cai, J.; Zhang, X.; et al. The origin and evolution of diosgenin biosynthetic pathway in Yam. Plant Commun. 2020, 2, 100079. [Google Scholar] [CrossRef] [PubMed]
  16. Sun, Y.T.; Long, W.J.; Li, Y.; Liu, T.P.; Zhao, G.L.; Ding, G.X.; Ni, X.L. Genetic diversity analysis of 45 Glutinous sorghum germplasms based on major agronomic traits and SSR markers. Crops 2024, 40, 57–64. [Google Scholar]
  17. Zhou, S.J.; Chen, X.J.; Zhu, Y.Q.; Chen, L.P.; Zhang, P. Research progress and suggestion on germplasms of ash gourd and chieh-qua. J. Plant Genet. Resour. 2014, 15, 211–214. [Google Scholar]
  18. Mignouna, H.; Mank, R.; Ellis, T. A genetic linkage map of water yam (Dioscorea alata L.) based on AFLP markers and QTL analysis for anthracnose resistance. Theor. Appl. Genet. 2002, 105, 726–735. [Google Scholar] [CrossRef]
  19. Sartie, A.; Asiedu, R.; Franco, J. Genetic and phenotypic diversity in a germplasm working collection of cultivated tropical yams (Dioscorea spp.). Genet. Resour. Crop. Evol. 2012, 59, 1753–1765. [Google Scholar] [CrossRef]
  20. Nemorin, A.; David, J.; Maledon, E.; Nudol, E.; Dalon, J.; Arnau, G. Microsatellite and flow cytometry analysis to help understand the origin of Dioscorea alata polyploids. Ann. Bot. 2013, 112, 811–819. [Google Scholar] [CrossRef]
  21. Kouam, E.; Avana-Tientcheu, M.; Lekeumo, V.; Akitio, H.; Khasa, D.; Pasquet, R. Agro-ecological distribution of the phenotypic diversity of aerial yam (Dioscorea bulbifera L.) in Cameroon using multivariate analysis: Prospect for germplasm conservation and improvement. Open Agric. 2018, 3, 190–206. [Google Scholar] [CrossRef]
  22. Zhang, W.J.; Chen, J.Y.; Liu, B.C.; Zhao, Y.Q.; Huang, Y.Z.; Chen, Y. Genetic variations on major traits of 37 Chinese Yam germplasms. Fujian J. Agric. Sci. 2019, 34, 1246–1254. [Google Scholar]
  23. Wu, Z.G.; Wei, Y.H.; Leng, C.H.; Tao, Z.M.; Li, L.; Ji, Z.X. Study on the phenotypic trait diversity of Wenzhou yam. J. Zhejiang Agric. Sci. 2010, 4, 735–738. [Google Scholar]
  24. Huang, Y.X.; Wang, F.Q.; Du, J.F.; Hua, S.M.; Lei, F.G.; Xu, X.M.; Liang, K.J.; Zhang, Z.Y. Numerical analysis of morphological variation of germplasm resources of Dioscorea. China J. Chin. Mater. Medica 2013, 38, 318–324. [Google Scholar]
  25. Qin, W.Z.; Wei, B.H.; Gan, X.Q.; Wei, M.Z.; Tang, X.H. Genetic diversity analysis of main agronomic traits in yam germplasm resources. J. South. Agric. 2014, 45, 1726–1733. [Google Scholar]
  26. Li, L.H.; Hua, S.M.; Chen, Z.H.; Lei, F.G.; Cao, Y.Y. Phenotypic genetic diversity research on Dioscorea Landraces of Fujian province. J. Yunnan Agric. Univ. (Nat. Sci.) 2016, 30, 257–262. [Google Scholar]
  27. Wu, J.P.; Jiao, Z.B.; Guo, F.L.; Chen, L.F.; Zeng, F.S.; Qiu, Z.M. Study on germplasm genetic diversity of yam based on morphological markers. Chin. Wild Plant Resour. 2017, 36, 11–15. [Google Scholar]
  28. Zhang, Y.J.; Sun, Y.; Ge, F.L.; Piao, J.Z.; Li, N.; Zhang, X.F.; Huang, T.H.; Xue, C.M.; Cao, J.L. Resources investigation and morphological identification for yam. Beijing J. Tradit. Chin. Med. 2018, 37, 463–468. [Google Scholar]
  29. Naval, M.D.M.; Zuriaga, E.; Pecchioli, S.; Llácer, G.; Giordani, E.; Badenes, M.L. Analysis of genetic diversity among persimmon cultivars using microsatellite markers. Tree Genet. Genomes 2010, 6, 677–687. [Google Scholar] [CrossRef]
  30. Mignouna, H.D.; Abang, M.M.; Fagbemi, S.A. A comparative assessment of molecular marker assays (AFLP, RAPD and SSR) for white yam (Dioscorea ratundata) germplasm characterization. Ann. Appl. Biol. 2003, 142, 269–276. [Google Scholar] [CrossRef]
  31. Silva, L.; Mezette, T.; Nascimento, W.; Silva, E.; Veasey, E. Spatially structured morphological and molecular diversity among Dioscorea cayenensis and D. rotundata yam accessions. Plant Genet. Resour. 2017, 15, 296–309. [Google Scholar] [CrossRef]
  32. Singh, R.K.; Jena, S.N.; Khan, S.; Yadav, S.; Banarjee, N.; Raghuvanshi, S.; Bhardwaj, V.; Dattamajumder, S.K.; Kapur, R.; Solomon, S.; et al. Development, cross-species/genera transferability of novel EST-SSR markers and their utility in revealing population structure and genetic diversity in sugarcane. Gene 2013, 524, 309–329. [Google Scholar] [CrossRef] [PubMed]
  33. Kalia, R.K.; Rai, M.K.; Kalia, S.; Singh, R.; Dhawan, A.K. Microsatellite markers: An overview of the recent progress in plants. Euphytica 2011, 177, 309–334. [Google Scholar] [CrossRef]
  34. Zhao, J.; Tong, Y.Q.; Ge, T.M.; Ge, J.W. Genetic diversity estimation and core collection construction of Sinojackia huangmeiensis based on novel microsatellite markers. Biochem. Syst. Ecol. 2016, 16, 74–80. [Google Scholar] [CrossRef]
  35. Zhang, T.F. Localization of QTL for Protein and Oil Content in Soybean Using Chain and Association Analysis. Master’s Thesis, Chinese Academy of Agricultural Sciences, Beijing, China, 2021. [Google Scholar]
  36. Zhang, Q.; Tian, H.L.; Guo, S.H.; Wang, Y.Q.; Pei, S.S.; Wu, C.J.; Wang, Q.B.; Zuo, X.Q. Development of high-throughput simple sequence repeat (SSR) markers for Chinese yam (Dioscorea opposita Thunb.) and genetic diversity and association analysis using phenotypic and genotypic traits. Genet. Resour. Crop Evol. 2025, 72, 7465–7482. [Google Scholar] [CrossRef]
  37. Bressan, E.; Neto, T.B.; Zucchi, M.I.; Rabello, R.J.; Veasey, E.A. Morphological variation and isozyme diversity in Dioscorea alata L. landraces from Vale do Ribeira, Brazil. Sci. Agric. 2011, 68, 494–502. [Google Scholar] [CrossRef]
  38. IPGRI/IITA. Descriptors for Yam (Dioscorea spp.); International Institute of Tropical Agriculture/Rome and International Plant Genetic Resources Institute: Ibadan, Nigeria; Rome, Italy, 1997. [Google Scholar]
  39. Huang, D.Y.; Huang, X.L. Descriptors and Data Quality Control Criterion for Dioscorea opposita Thunb; Science Press: Beijing, China, 2013. [Google Scholar]
  40. Wang, H.P.; Shen, D. Descriptors and Data Standard for Yam (Dioscorea spp.); China Agriculture Press: Beijing, China, 2014. [Google Scholar]
  41. Cao, T.X.; Sun, J.Y.; Shan, N.; Chen, X.; Wang, P.T.; Zhu, Q.L.; Xiao, Y.; Zhang, H.Y.; Zhou, Q.H.; Huang, Y.J. Uncovering the genetic diversity of yams (Dioscorea spp.) in China by combining phenotypic trait and molecular marker analyses. Ecol. Evol. 2021, 11, 9970–9986. [Google Scholar] [CrossRef]
  42. Zhai, C.J.; Ge, L.J.; Cheng, Y.J.; Qiu, L.; Wang, X.Q.; Liu, S.D. Genetic diversity analysis of wax gourd and chieh-qua germplasm resources based on phenotypic traits and ssr markers. Sci. Agric. Sin. 2024, 57, 3440–3457. [Google Scholar]
  43. Liu, K.; Muse, S.V. PowerMarker: An integrated analysis environment for genetic marker analysis. Bioinformatics 2005, 21, 2128–2129. [Google Scholar] [CrossRef] [PubMed]
  44. Pritchard, J.K.; Stephens, M.J.; Donnelly, P.J. Inference of population structure using multilocus genotype data. Genetics 2000, 155, 945–959. [Google Scholar] [CrossRef] [PubMed]
  45. Bradbury, P.J.; Zhang, Z.; Kroon, D.E.; Casstevens, T.M.; Ramdoss, Y.; Buckler, E.S. TASSEL: Software for association mapping of complex traits in diverse samples. Bioinformatics 2007, 23, 2633–2635. [Google Scholar] [CrossRef]
  46. Evanno, G.; Regnaut, S.; Goudet, J. Detecting the number of clusters of individuals using the software STRUCTURE: A simulation study. Mol. Ecol. 2005, 14, 2611–2620. [Google Scholar] [CrossRef]
  47. Chen, W.; Wang, L.R.; Zhu, G.R.; Fang, W.C.; Chen, C.W.; Cao, K.; Zhang, S.L. Comparison of Genetic Diversity among Peach Cultivars based on Biological Traits and SSR Markers. J. Plant Genet. Resour. 2009, 10, 86–90. [Google Scholar]
  48. Sun, Z.Z.; Li, Q.Y.; Wang, X.K.; Zhao, W.T.; Xue, Y.; Feng, J.Y.; Liu, X.F.; Liu, M.Y.; Jiang, D. Comprehensive evaluation and phenotypic diversity analysis of germplasm resources in Mandarin. Sci. Agric. Sin. 2017, 50, 4362–4372. [Google Scholar]
  49. Campbell, D.R. Using phenotypic manipulations to study multivariate selection of floral trait associations. Ann. Bot. 2009, 103, 1557–1566. [Google Scholar] [CrossRef]
  50. Zhao, L.Y.; Xu, J.Q.; Xu, B.; Hu, A.P.; Zhang, Y.H. Genetic diversity analysis of agronomic traits and elite accession selection in 263 cucumber germplasm resources. J. South. Agric. 2021, 52, 145–154. [Google Scholar]
  51. Sun, D.L.; Bian, N.F.; Chen, Z.D.; Xing, X.H.; Xu, Z.J.; Qi, Y.J.; Wang, X.; Wang, X.J.; Wang, W. Comprehensive evaluation and index screening of phenotypic traits in peanut germplasm resources. J. Plant Genet. Resour. 2018, 19, 865–874. [Google Scholar]
  52. Huang, Y.; Xie, Y.Q.; Li, Z.Z.; Wang, K.L.; Lin, P.; Yao, X.H. Analysis of Camellia meiocarpa phenotype diversity. J. Plant Genet. Resour. 2014, 15, 270–278. [Google Scholar]
  53. Jiang, X.B.; Gong, B.C.; Liu, Q.Z.; Chen, X.; Wu, K.Y.; Deng, Q.E.; Tang, D. Phenotypic diversity of important agronomic traits of local cultivars of Chinese chestnut. Acta Hortic. Sin. 2014, 41, 641–652. [Google Scholar]
  54. Zhou, Y.Q.; Zhou, Z.N.; Yao, H.L.; Liu, Y.J.; Tu, R.T. Application of ISSR markers in detection of genetic variation among Chinese yam (Dioscorea opposita Thunb) cultivars. Life Sci. J. 2008, 5, 6–12. [Google Scholar]
  55. Wang, Y.W.; Zixia, G.; Min, C.; Zhang, Y.M.; Sun, X.Q.; Zhou, Y.F.; Bin, P. Genetic relationship and species identification of Dioscorea polystachya Turcz. in Yams determined by ISSR, ISAP, SRAP and SCAR markers. Genet. Resour. Crop Evol. 2022, 69, 1953–1964. [Google Scholar]
  56. Wang, R.H.; Wang, T.Y.; Li, Y. Application of association analysis in molecular evaluation of crop germplasm resources. J. Plant Genet. Resour. 2007, 8, 366–372. [Google Scholar]
  57. Liu, J.Y.; Zhang, Y.W.; Han, X.; Zuo, J.F.; Zhang, Z.; Shang, H.; Song, Q.; Zhang, Y.M. An evolutionary population structure model reveals pleiotropic effects of GmPDAT for traits related to seed size and oil content in soybean. J. Exp. Bot. 2020, 71, 6988–7002. [Google Scholar] [CrossRef]
  58. Hansen, M.; Kraft, T.; Ganestam, S.S.Ä.; Li, T.; Nilsson, N.O. Linkage disequilibrium mapping of the bolting gene in sea beet using AFLP markers. Genet. Res. 2001, 77, 61–66. [Google Scholar] [CrossRef]
  59. Tan, L.Z.; Zhao, Y.Q. Principle, optimization and application of mixed models in genomewide association study. Scientia Agric. Sin. 2024, 57, 3440–3457. [Google Scholar]
Figure 1. Geographical distribution and resource abundance of Chinese yam germplasm resources in China (A), a field trial plot (B), and representative morphological features of leaves (C), stems (D), flowers (E), and tubers (F) from different germplasm accessions. Numbers indicate the total number of resources collected per province, with solid circles in different colors representing distinct yam varieties. Scale bars: 1 cm (C), 10 cm (F).
Figure 1. Geographical distribution and resource abundance of Chinese yam germplasm resources in China (A), a field trial plot (B), and representative morphological features of leaves (C), stems (D), flowers (E), and tubers (F) from different germplasm accessions. Numbers indicate the total number of resources collected per province, with solid circles in different colors representing distinct yam varieties. Scale bars: 1 cm (C), 10 cm (F).
Horticulturae 11 01193 g001
Figure 2. Frequency distribution of leaf length (A), leaf width (B), leaf length-to-width ratio (C), tuber length (D), tuber diameter (E), and tuber flesh weight (F) in Chinese yam germplasm.
Figure 2. Frequency distribution of leaf length (A), leaf width (B), leaf length-to-width ratio (C), tuber length (D), tuber diameter (E), and tuber flesh weight (F) in Chinese yam germplasm.
Horticulturae 11 01193 g002
Figure 3. Heatmap of correlations between the comprehensive value (D-value) and 23 phenotypic traits. Red indicates positive correlations, and blue indicates negative correlations; the color bar on the right represents the correlation coefficient. “*” and “**” denote significant correlations at the 0.05 and 0.01 probability levels, respectively.
Figure 3. Heatmap of correlations between the comprehensive value (D-value) and 23 phenotypic traits. Red indicates positive correlations, and blue indicates negative correlations; the color bar on the right represents the correlation coefficient. “*” and “**” denote significant correlations at the 0.05 and 0.01 probability levels, respectively.
Horticulturae 11 01193 g003
Figure 4. Relationships between the number of genetic groups (K) and the estimated ΔK based on the SSR dataset (A), population genetic structure of 53 Chinese yam germplasm accessions at K = 2 based on the Bayesian clustering model (B), neighbor-joining tree of 53 Chinese yam germplasm accessions based on Nei’s genetic distances (C), and principal component analysis of SSR data for 53 Chinese yam individuals (D).
Figure 4. Relationships between the number of genetic groups (K) and the estimated ΔK based on the SSR dataset (A), population genetic structure of 53 Chinese yam germplasm accessions at K = 2 based on the Bayesian clustering model (B), neighbor-joining tree of 53 Chinese yam germplasm accessions based on Nei’s genetic distances (C), and principal component analysis of SSR data for 53 Chinese yam individuals (D).
Horticulturae 11 01193 g004
Figure 5. Analysis of linkage disequilibrium (LD) distribution across 19 SSR markers in 53 Chinese yam germplasms. The upper right triangle displays pairwise D’ values, while the lower left triangle represents LD confidence metrics (p-values) for each locus pair. For all marker pairs, LD strength is visualized using a white (0.0) to red (1.0) color gradient in the heatmap’s upper triangle, with deeper red indicating stronger disequilibrium. Corresponding p-values range from non-significant (p > 0.01; white) to highly significant (p < 0.0001; red).
Figure 5. Analysis of linkage disequilibrium (LD) distribution across 19 SSR markers in 53 Chinese yam germplasms. The upper right triangle displays pairwise D’ values, while the lower left triangle represents LD confidence metrics (p-values) for each locus pair. For all marker pairs, LD strength is visualized using a white (0.0) to red (1.0) color gradient in the heatmap’s upper triangle, with deeper red indicating stronger disequilibrium. Corresponding p-values range from non-significant (p > 0.01; white) to highly significant (p < 0.0001; red).
Horticulturae 11 01193 g005
Table 1. Standardized description and classification of phenotypic traits in Chinese yam germplasm.
Table 1. Standardized description and classification of phenotypic traits in Chinese yam germplasm.
TraitsClasses (Codes)
12345
Flowering (FL)AbsentMaleFemale
Aerial tubers (ATs)Absent Present
Leaf shape (LS)Heart triangleTriangular ovateLanceolateRoundShape of halberd
Leaf color (LC)Yellow-greenGreenish grayDark green
Leaf apex shape (LAX)ObtuseAcute
Distance between lobes (DBLs)IntermediateVery distant
Leaf margin color (LMC)GreenPurple
Petiole color (PC)PurpleGreenGreenish purplePurplish red
Leaf vein color (LVC)Yellow-greenGreenPurple
Leaf vein (LV)FiveSevenNine
Stem color (SC)GreenGreen with purpleBrownish greenPurple
Tuber shape (TS)OvalCylindricalIrregular
Roots hair density (RHD)SparseDense
Place of roots on the tuber (PRT)AllUpper and Middle
Tuber skin color (TSC)BrownBlackGray
Tuber skin color under bark (TSCUB)BeigePurple
Flesh color (FC)WhiteYellow
Leaf length (LL)Average leaf length of six mature leaves (cm)
Leaf width (LW)Average leaf width of six mature leaves (cm)
Length to width ratio (LWR)Average leaf length/average leaf width(cm/cm)
Tuber length (TL)Average tuber length of six plants (cm)
Tuber diameter (TD)Average tuber diameter of six plants (mm)
Tuber flesh weight (TFW)Average yield of six plants (g)
Table 2. Frequency distribution and Shannon diversity index (I) analysis of qualitative traits in Chinese yam germplasm.
Table 2. Frequency distribution and Shannon diversity index (I) analysis of qualitative traits in Chinese yam germplasm.
TraitsClasses (Codes)I
12345
Flowering (FL)0.230.280.49 1.04
Aerial tubers (ATs)0.250.76 0.56
Leaf shape (LS)0.020.490.060.080.401.15
Leaf color (LC)0.080.060.87 0.48
Leaf apex shape (LAX)0.450.55 0.69
Distance between lobes (DBLs)0.660.34 0.64
Leaf margin color (LMC)0.260.74 0.58
Petiole color (PC)0.190.190.040.59 1.07
Leaf vein color (LVC)0.680.280.04 0.74
Leaf vein (LV)0.040.940.02 0.25
Stem color (SC)0.040.550.020.38 0.90
Tuber shape (TS)0.930.020.06 0.31
Roots hair density (RHD)0.790.21 0.51
Place of roots on the tuber (PRT)0.890.11 0.35
Tuber skin color (TSC)0.450.360.19 1.04
Tuber skin color under bark (TSCUB)0.980.02 0.09
Flesh color (FC)0.980.02 0.09
Mean 0.62
Table 3. Variation statistics and genetic diversity analysis of quantitative traits in Chinese yam germplasm resources.
Table 3. Variation statistics and genetic diversity analysis of quantitative traits in Chinese yam germplasm resources.
TraitsMaximumMinimumRangeMeanSDCVI
Leaf length (cm)12.825.886.938.481.3916.411.50
Leaf width (cm)8.982.856.135.961.2120.281.62
Length-to-width ratio2.201.101.101.460.2315.801.57
Tuber length (cm)77.5017.5060.0040.4612.5831.101.77
Tuber diameter (mm)15.381.7513.636.652.9544.361.79
Tuber flesh weight (g)1316.0061.501254.50350.72250.6071.451.45
SD: std. deviation; CV: coefficient of variation (%); I: genetic diversity index.
Table 4. The proportion of the morphological variation and traits of Chinese yam germplasm contribution explained by the first eight principal components.
Table 4. The proportion of the morphological variation and traits of Chinese yam germplasm contribution explained by the first eight principal components.
TraitPrincipal Component
12345678
FL0.01 0.47 0.14 −0.42 0.18 0.02 0.14 −0.06
AT0.00 0.08 −0.38 −0.55 0.49 0.12 −0.14 0.00
LS−0.51 0.05 0.39 0.37 0.17 0.02 0.15 0.04
LC0.57 0.48 0.14 0.28 −0.13 0.08 0.12 0.03
LAX−0.57 −0.17 0.03 −0.06 0.04 0.13 0.08 0.41
DBL−0.41 −0.12 −0.11 0.34 0.41 −0.28 0.45 −0.02
LMC−0.45 0.40 0.22 0.24 −0.09 −0.24 0.37 −0.03
PC0.38 −0.13 −0.02 0.07 −0.35 −0.39 0.35 0.32
LVC−0.16 −0.14 −0.22 −0.18 0.27 0.36 0.48 0.38
LV0.65 0.01 0.46 −0.22 0.31 0.17 0.18 −0.01
SC−0.33 0.44 0.26 0.18 0.10 −0.11 −0.27 0.16
TS0.29 −0.43 0.08 0.47 −0.09 0.15 0.27 −0.21
RHD0.27 −0.06 −0.31 0.39 0.28 −0.05 0.03 −0.49
PRT−0.04 −0.47 −0.41 −0.05 0.42 0.11 0.27 −0.14
TSC−0.08 0.43 0.18 −0.36 −0.37 0.21 0.41 −0.04
TSCUB0.39 −0.43 0.75 −0.04 0.24 0.01 −0.04 0.07
FC0.39 −0.43 0.75 −0.04 0.24 0.01 −0.04 0.07
LL0.59 0.09 −0.39 0.13 0.20 −0.43 −0.11 0.39
LW0.65 0.50 −0.21 0.08 0.30 −0.24 0.06 0.18
LWR−0.36 −0.78 −0.04 0.04 −0.10 −0.18 −0.24 0.21
TL−0.44 0.45 0.29 0.07 0.38 −0.10 −0.18 −0.13
TD0.32 0.07 −0.27 0.43 −0.19 0.64 −0.11 0.16
YPP−0.13 0.33 0.03 0.55 0.31 0.37 −0.19 0.27
Eigenvalue3.67 3.03 2.49 2.02 1.73 1.41 1.36 1.11
Contribution rate (%)15.95 13.16 10.83 8.80 7.50 6.14 5.91 4.83
Accumulated contribution rate (%)15.95 29.11 39.94 48.74 56.24 62.39 68.30 73.13
Table 5. The ranking of Chinese yam germplasm resources by calculating quality-related traits.
Table 5. The ranking of Chinese yam germplasm resources by calculating quality-related traits.
AccessionsD1D2D3D4D5D6D7D8D ValueRanking
DP240.4821.0000.2760.6240.7580.9230.2810.7000.6261
DP521.0000.0001.0000.3250.8530.3710.3940.6410.5972
DP220.5820.6690.0870.6560.8420.5840.9710.2590.5693
DP50.5440.8530.2490.5660.4300.5650.4170.9010.5624
DP230.5280.9580.2490.5810.8750.1960.3580.4710.5615
DP420.3630.6440.3220.9990.5690.4190.2851.0000.5466
DP390.6460.7710.1920.4580.3540.5080.6550.7400.5447
DP10.5520.8030.2370.4630.3980.6800.5300.7100.5448
DP20.5030.8250.1950.4570.7310.2410.7470.4870.5299
DP470.6010.8110.1750.3690.3640.6020.4670.7970.52610
DP410.6010.7790.1700.3330.4660.7350.4950.5450.52211
DP270.5740.6200.0540.4181.0000.3420.5020.7630.51712
DP370.7500.3600.0990.9350.3031.0000.4250.1730.51613
DP380.4340.5820.2400.4930.5400.2931.0000.9140.51514
DP300.8480.6580.0250.5090.4400.2240.1980.9590.51115
DP280.5160.8960.2190.5970.9260.1080.2860.0740.51016
DP490.5510.4750.2780.7770.3300.2480.9320.5160.50417
DP210.6110.6860.1580.3030.3020.6360.4780.9700.50318
DP250.3490.8720.3450.5990.7190.6130.1760.0900.50219
DP200.4410.8460.2780.3890.6660.2560.5370.4770.50120
DP400.4920.7390.2530.2860.4940.3500.9570.4860.50121
DP480.6730.7510.1480.3870.3980.1250.5660.7970.50022
DP290.8660.6480.0000.5510.5830.0000.2500.6600.49523
DP530.5290.7260.2480.4500.4280.3570.3860.7210.49024
DP440.5570.8000.2270.2920.4300.2680.7450.3530.48425
DP450.5750.6800.1550.5000.6050.1700.7330.2240.48126
DP190.6990.2330.0671.0000.4360.3940.8870.0000.47327
DP430.5250.8200.3080.2370.3370.3740.6640.2640.47328
DP80.6640.7080.1640.3810.3990.2410.3900.5680.47329
DP320.7650.6300.0610.5570.5810.1240.1440.5050.47130
DP30.4250.4680.2560.6080.4320.2720.7640.7810.46831
DP360.5530.7740.2090.1600.3350.5770.5150.4110.46232
DP100.4970.7540.2810.4790.4630.3190.2430.3630.46133
DP180.4980.7750.2900.3890.4670.4750.2550.2090.46034
DP160.5880.7700.2200.3950.3870.2210.2890.3970.45535
DP510.6520.7930.2080.1110.2970.0880.6820.3830.44736
DP150.4380.7390.2640.2900.6740.1560.5840.2110.44637
DP140.5880.6090.1570.1300.5220.4910.3890.5910.44238
DP40.4590.6220.2810.3220.4050.7120.2680.3770.44039
DP70.5960.6840.2180.2890.3950.1700.4050.3860.43340
DP460.4400.6980.3300.1330.1700.3590.7920.5160.43241
DP130.6290.6660.1820.2230.4250.4750.2540.2370.43042
DP90.5380.7020.2700.3430.3980.0820.3290.4500.42943
DP500.6160.5290.2050.3410.0000.8860.3980.1470.41744
DP340.6650.5270.1370.4270.4010.3270.2520.1360.40945
DP310.6980.4380.1080.3880.2750.2370.1800.7220.40446
DP330.7240.3540.0110.4050.5210.2320.4110.1830.38947
DP260.5690.6250.2380.2860.1960.0560.4170.3710.38948
DP60.4160.5960.2580.0000.3490.2890.3130.4740.35349
DP110.2510.2770.1440.7610.1260.0790.4000.7630.32050
DP170.3790.009−0.144−0.2291.1170.7100.4430.6790.28851
DP350.2560.1050.2130.1920.5050.3640.5370.4820.28652
DP120.0000.3260.2640.6980.3690.1420.0000.4300.26153
Table 6. Genetic diversity of 19 SSR markers in 53 Chinese yam germplasm resources.
Table 6. Genetic diversity of 19 SSR markers in 53 Chinese yam germplasm resources.
MarkersNaNeIHoHeHPIC
YM0252.2971.0330.3020.5710.5650.703
YM0321.9350.6760.5310.4880.4830.318
YM0676.5311.9700.7250.8580.8470.927
YM0764.6811.6460.5580.7960.7860.907
YM0976.1791.8640.6190.8480.8380.942
YM1221.6790.5940.5210.4090.4040.402
YM1341.2880.4930.2440.2260.2240.392
YM1721.9130.6700.7870.4830.4770.294
YM1942.7201.1310.9350.6390.6320.533
YM2121.6410.5790.5320.3950.3900.374
YM2421.9390.6770.6000.4900.4840.473
YM3021.1100.2050.0630.1000.0990.150
YM3285.5631.8480.7730.8300.8200.942
YM3331.1900.3250.1350.1610.1600.259
YM3553.4581.3370.8910.7190.7110.819
YM3774.0801.6410.8860.7640.7550.904
YM4121.1560.2610.1460.1370.1350.218
Da1D0852.7411.2010.7390.6420.6350.843
SSR-1751.9770.9270.5750.5000.4940.717
Total8054.07819.07810.56210.0569.93911.117
Mean4.2112.8461.0040.5560.5290.5230.585
Na: observed number of alleles; Ne: effective number of alleles; I: Shannon’s diversity index; Ho: observed heterozygosity; He: expected heterozygosity; H: Nei’s gene diversity; PIC: polymor-phism information content index.
Table 7. SSR marker sites were significantly associated with major phenotypic traits (p < 0.01) and their explanation rates for phenotypic variation.
Table 7. SSR marker sites were significantly associated with major phenotypic traits (p < 0.01) and their explanation rates for phenotypic variation.
TraitMarker SiteGeneral Linear Model (GLM)Mixed Linear Model
(MLM)
pR2 (%)pR2 (%)
FLYM07_2 (400 bp)--0.009313.94
ATYM06_3 (350 bp)0.001618.030.003618.58
ATYM13_3 (400 bp)0.008512.96--
LMCYM13_3 (400 bp)0.001318.720.003518.75
LMCYM37_2 (350 bp)0.001219.030.006316.25
LMCYM37_7 (100 bp)0.006813.730.002620.16
LLDa1D08_4 (300 bp)0.002816.460.005516.77
LLYM19_3 (130 bp)--0.006715.94
LWRYM06_1 (500 bp)0.006213.790.005816.70
LWYM19_3 (130 bp)--0.003219.34
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Tan, D.; Tang, R.; Yang, G.; Yang, Y.; Hu, M.; Tang, M.; Cao, T.; Du, P. Genetic Diversity and Association Analysis of Dioscorea polystachya Germplasm Resources Based on Phenotypic Traits and SSR Markers. Horticulturae 2025, 11, 1193. https://doi.org/10.3390/horticulturae11101193

AMA Style

Tan D, Tang R, Yang G, Yang Y, Hu M, Tang M, Cao T, Du P. Genetic Diversity and Association Analysis of Dioscorea polystachya Germplasm Resources Based on Phenotypic Traits and SSR Markers. Horticulturae. 2025; 11(10):1193. https://doi.org/10.3390/horticulturae11101193

Chicago/Turabian Style

Tan, Dan, Rong Tang, Ge Yang, Yinfang Yang, Miao Hu, Min Tang, Tianxu Cao, and Ping Du. 2025. "Genetic Diversity and Association Analysis of Dioscorea polystachya Germplasm Resources Based on Phenotypic Traits and SSR Markers" Horticulturae 11, no. 10: 1193. https://doi.org/10.3390/horticulturae11101193

APA Style

Tan, D., Tang, R., Yang, G., Yang, Y., Hu, M., Tang, M., Cao, T., & Du, P. (2025). Genetic Diversity and Association Analysis of Dioscorea polystachya Germplasm Resources Based on Phenotypic Traits and SSR Markers. Horticulturae, 11(10), 1193. https://doi.org/10.3390/horticulturae11101193

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop