Next Article in Journal
Genome-Wide Identification and Functional Characterization of the BAHD Acyltransferase Gene Family in Brassica napus L.
Previous Article in Journal
Pectins as Brakes? Their Potential Implication in Adjusting Mesophyll Conductance Under Water Deficit and Salt Stresses
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genetic Diversity and Construction of Salt-Tolerant Core Germplasm in Maize (Zea mays L.) Based on Phenotypic Traits and SNP Markers

1
Key Laboratory of Germplasm Enhancement, Physiology and Ecology of Food Crops in Cold Region, Engineering Technology Research Center of Maize Germplasm Resources Innovation on Cold Land of Heilongjiang Province, Northeast Agricultural University, Harbin 150030, China
2
Institute of Forage and Grassland Sciences Heilongjiang Academy of Agricultural Science, Harbin 150086, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Plants 2025, 14(14), 2182; https://doi.org/10.3390/plants14142182
Submission received: 14 June 2025 / Revised: 7 July 2025 / Accepted: 11 July 2025 / Published: 14 July 2025
(This article belongs to the Special Issue Maize Landraces: Conservation, Characterization and Exploitation)

Abstract

Maize is an essential staple food, and its genetic diversity plays a central role in breeding programs aimed at developing climate-adapted cultivars. Constructing a representative core germplasm set is necessary for the efficient conservation and utilization of maize genetic resources. In this study, we analyzed 588 cultivated maize accessions using agronomic traits such as plant morphology and yield traits such as ear characteristics and single-nucleotide polymorphisms (SNPs) to assess molecular diversity and population structure and to construct a core collection. Nineteen phenotypic traits were evaluated, revealing high genetic diversity and significant correlations among most quantitative traits. The optimal sampling strategy was identified as “Mahalanobis distance + 20% + deviation sampling + flexible method.” Whole-genome genotyping was conducted using the Maize6H-60K liquid phase chip. Population structure analysis, principal component analysis, and cluster analysis divided the 588 accessions into six subgroups. A core collection of 172 accessions was selected based on both phenotypic and genotypic data. These were further evaluated for salt–alkali tolerance during germination, and cluster analysis classified them into five groups. Sixty-five accessions demonstrated salt–alkali tolerance, including 18 with high resistance. This core collection serves as a valuable foundation for germplasm conservation and utilization strategies.

1. Introduction

Maize (Zea mays L.) is a globally significant crop used for food, feed, and industrial purposes [1]. China ranks third worldwide in saline–alkali soil area, with more than 90% of this land located inland, significantly threatening national food security [2,3]. This is especially critical in northern cold regions, where both low temperatures and severe soil salinization limit crop productivity. Evaluating salt–alkali tolerance in maize germplasm and breeding resistant varieties is therefore essential for improving land use efficiency and maintaining food supply.
Genetic diversity is a cornerstone of maize breeding and improvement. It plays an important role in germplasm utilization and conservation. However, a narrow genetic base remains a limiting factor in breeding progress [4,5]. Germplasm resources are national strategic resources, playing a vital role in sustainable agricultural development [6]. In recent years, with the increasing emphasis on food security in China, research on maize germplasm resources has attracted growing attention. As the basis for germplasm evaluation and utilization, genetic diversity provides critical information for the exploration of gene resources [7].
With advancements in biotechnology, Single-Nucleotide Polymorphisms (SNPs) have become widely used in studies of genetic diversity, QTL mapping, fingerprinting, population structure, and marker-assisted selection [8,9,10]. Core collections aim to reduce redundancy while preserving the genetic diversity of the full germplasm set, facilitating more efficient conservation and utilization [11].
Core collections are typically constructed using either phenotypic data (e.g., agronomic or morphological traits) or molecular data. Several core germplasms of local Chinese maize varieties have been developed based on geographic origin and phenotypic evaluations. Phenotypic diversity assessments of germplasm in China’s National Germplasm Bank have revealed substantial regional variation in diversity indices and traits [12,13,14,15]. Core collections have also been established for crops like wheat, sweet potato, and rice, playing a key role in conserving and using genetic resources [16,17,18,19,20,21]. Previous studies in maize have focused on morphological evaluations and diversity assessments. For example, Cosmos Magorolosho constructed a core collection using phenotypic and genotypic data [22]. The Chinese Academy of Agricultural Sciences (CASA) also developed core germplasms from over 13,000 local varieties and 3000 self-pollinated lines [8].
Other crops have been evaluated for stress tolerance using core collections. For example, sorghum core germplasm has been assessed for salt tolerance during germination [23,24], and soybean core germplasm has been classified into five drought-tolerance levels based on adult-stage assessments [25]. Despite numerous studies on maize genetic diversity and core germplasm development in China, few have focused on germplasm from northern cold regions, and few studies systematically integrated phenotypic and SNP-based assessments of salt tolerance [26,27]. Moreover, few studies integrate both phenotypic and SNP data in constructing specialized salt-tolerant core collections. Heilongjiang Province ranks first in China in terms of maize planting area, production, and commercialization rate. It is also home to the country’s only national medium-term genebank for cold-region crops. With the continuous expansion of germplasm resource collections, challenges have emerged in resource management and utilization. Issues such as insufficient exploration of genetic characteristics and low utilization efficiency have become prominent. Salt–alkali stress, a critical abiotic stress factor in spring maize production areas of northern China, is often accompanied by high pH stress, imposing complex mixed salt–alkali stress on plants and severely impacting food security [28]. Conducting genetic diversity assessment and constructing a core germplasm collection represent effective solutions to bridge the gap between resource conservation and efficient utilization.
In this study, we conducted comprehensive phenotypic and genotypic analyses of a large maize germplasm set. A core collection was established to reduce germplasm redundancy while preserving diversity. Subsequently, salt–alkali tolerance was evaluated during the germination stage to construct a specialized core collection suited for cold-region saline–alkali environments. This curated set will enable more efficient germplasm management and provide a foundation for inbred line development, artificial population construction, and future breeding programs aimed at improving maize stress tolerance.

2. Results

2.1. Phenotypic and Correlation Analysis

Descriptive statistics and genetic diversity analysis were performed on the 19 phenotypic traits across 588 maize accessions (Table 1). All traits exhibited skewness and kurtosis close to ±1, indicating near-normal distributions and suitability for further statistical analysis (Supplementary Figure S1). The coefficients of variation (CV) ranged from 4.43% to 56.61%, with plot yield exhibiting the highest variability (56.61%) and the growth period showing the lowest (4.43%). Except for seed emergence rate, all traits had CVs above 10%, suggesting substantial variability.
Genetic diversity index (GDI) values ranged from 0.728 to 1.597, with the highest index observed for hundred-grain weight and the lowest for growth period, confirming considerable diversity across the accessions and supporting their value as a diverse germplasm resource.
Correlation analysis revealed complex interrelationships among traits. Most traits were significantly correlated, though a few showed no association (p < 0.05 or p < 0.01) (Figure 1). PCA was conducted to reduce dimensionality while retaining critical trait information. Five principal components with eigenvalues greater than 1.0 were extracted, accounting for a cumulative variance contribution of 62.667%, indicating that the majority of the trait variability was captured (Table 2).

2.2. Phenotypic Core Collection Construction

Eight core collections were constructed using different systematic clustering methods and evaluated based on trait divergence percentages. The ranking of clustering methods was as follows: Average linkage > flexible method > Ward’s method > complete linkage > centroid method > average linkage > single linkage > unweighted pair-group method using arithmetic averages (Supplementary Table S1). For most traits, the CVs in the core collections were higher than in the original population, indicating preserved or enhanced diversity (Supplementary Table S2).
Evaluation of five core collections, constructed using different sampling proportions (10–30%), showed that each met the criteria of >80% extreme variance compliance and <20% mean difference (Table 3). Based on representativeness, the ranking of sampling proportions was 20% > 15% > 10% > 25% > 30% (Supplementary Table S3).
A total of 24 different sampling strategies were tested by combining two genetic distance types, the top two sampling ratios, two sampling methods, and the two best clustering approaches (Supplementary Table S4). Among them, all 24 strategies had an MD < 20%, with the highest at 5.26%. The VD ranged from 21.05% to 89.47%. The CR ranged from 88.27% to 100%, with eight strategies achieving full (100%) representation. The VR ranged from 108.22% to 124.59%.
The GJ23 strategy, which used Mahalanobis distance, a 20% sampling ratio, deviation sampling, and the flexible method, was identified as the optimal approach for constructing the phenotypic core collection. This strategy achieved >95.00% consistency in capturing unique germplasm. Compared with the original population, the core set demonstrated increased CVs across most traits, reflecting strong heterogeneity (Supplementary Table S5).
PCA comparison between the core and original populations showed that the core collection had a higher cumulative variance (69.76%) compared with the original population (62.67%), suggesting more efficient trait capture and potential reduction in genetic redundancy (Supplementary Table S6). Three-dimensional PCA distribution plots revealed that the original population exhibited dense clustering and high overlap among accessions, indicating genetic similarity among them. In contrast, the core collection showed a more uniform distribution, reduced overlap, and better separation between accessions (Figure 2). These findings confirm that the core collection not only preserves overall diversity but also minimizes genetic duplication.

2.3. Genetic Diversity and Population Structure Analysis of Maize Germplasm

A total of 61,214 putative SNPs were identified across the 588 maize accessions. After filtering, 7439 high-quality SNP markers were retained for genetic diversity and population structure analysis. These SNPs were unevenly distributed across the 10 maize chromosomes, with the highest number (1218) located on chromosome 1 and the fewest (476) on chromosome 10 (Figure 3A,B). The SNP density across the genome was higher at the distal ends of the chromosome and lower near the centromeres, indicating reduced diversity and higher sequence conversion near centromeric regions.
PCA principal component analysis showed that the first and second principal components accounted for 6.619% and 4.461% of phenotypic variance, respectively, cumulatively explaining 10.77% (Figure 3C).
Population structure analysis was performed using Structure v2.3.4 based on SNP data. The ΔK value peaked at K = 6, indicating that the 588 maize accessions were optimally grouped into six distinct subpopulations (Figure 4A,B). Neighbor-joining cluster analysis also confirmed this grouping pattern (Figure 4C). Among the accessions, the largest group (Branch II) included 376 accessions, while the smallest group (Branch I) contained 19.
The distribution of heterotic groups among these accessions was as follows: Lancaster (Lan), 308 accessions (52.38%); Ludahonggu (LRC), 53 (9.02%); Reid (BSSS), 114 (19.39%); Tangsipingtou (TSPT), 70 (11.90%); P group, 21 (3.57%); and X group, 22 (3.74%) (Figure 4B). Representative inbred lines from each group include B73 and Ye478 (Reid), Dan 340 (LRC), Huangzaosi (TSPT), Qi319 (P), Mo17 (Lan), and Jing 724 (X). These results indicate substantial structure and genetic stratification among northern Chinese maize germplasm, underscoring their breeding potential.
Genetic similarity coefficient (GSC) analysis revealed wide variability, with 50% of accessions falling in the 0.3 < GSC ≤ 0.4 range and the other 50% in the 0.9 < GSC ≥ 1.0 range. The lowest GSC was 0.3098 (between T106 and 196), while the average GSC across all accessions was 0.5595 (Table 4). These results affirm the rich genetic diversity within the studied germplasm population.

2.4. Core Collection Construction and Evaluation Based on SNP Markers

Genetic diversity analysis revealed that the observed heterozygosity (Ho) was 0.0316, and the expected heterozygosity (He) was 0.4024, indicating limited heterozygosity within individual accessions (Table 5). These findings suggest that the germplasm exhibits substantial genetic variability, providing a strong foundation for background selection and evolutionary analysis.
Genotypic core collection screening was performed across the 588 maize accessions. Sampling ratios ranging from 10% to 25% captured 99.39% to 99.83% of the total genetic diversity. Diversity indices for these core sets showed Shannon’s information index ranging from 0.5788 to 0.6009 (Table 5).
Based on these genetic diversity results, a 15% sampling ratio was identified as optimal for representing the population’s diversity, followed in rank by 20%, 25%, 30%, and 10%.
To verify whether the 15% core collection adequately represented the initial population, principal coordinate analysis (PCA) based on genetic distance was conducted using GenAlex 6.5 software. The initial population (blue) showed dense clustering, while the core collection (orange) exhibited a more uniform distribution. This indicates that the 15% core set effectively captured the genetic variability of the full population while reducing redundancy (Figure 5A).

2.5. Final Integration of Phenotypic and Genotypic Core Collections

A phenotypic core collection of 117 accessions and a genotypic core collection of 88 accessions were independently constructed. Among them, 33 accessions overlapped between both groups. These 33 shared accessions were integrated to establish a final, non-redundant core collection comprising 172 accessions (Table 6). This combined collection retained comprehensive genetic representation while minimizing duplication, thereby enhancing practical utility.
The final 172 accessions encompassed all six heterotic groups: Lan (97), LRC (13), BSSS (32), TSPT (18), P (6), and X (6) (Figure 5B). The dominance of Lan and the BSSS reflects their prevalence in northern Chinese maize breeding programs.The genetic diversity indices for 172 accessions showed Shannon’s information index is 0.5803, Ho is 0.0367, and He is 0.3957 (Supplementary Table S7). The core collection we finally constructed showed that various genetic diversity indices were highly similar to those of the original germplasm. This core collection serves as a representative subset for future research, breeding, and resource management (Supplementary Table S8).

2.6. Identification of Salt–Alkali Tolerant Germplasm in the Core Collection

Salt–alkali tolerance was evaluated in all 172 core germplasm accessions during germination. Membership function values were calculated across six physiological indicators. Correlation analysis revealed complex interrelationships among traits. Most traits were significantly correlated, though a few showed no association (p < 0.05 or p < 0.01) (Figure 5C) Cluster analysis grouped the accessions into five categories based on salt–alkali response (Figure 6, Supplementary Table S7). Group I, comprising four accessions (2.33%), was classified as highly tolerant. Group II included 14 accessions (8.14%) and was categorized as tolerant. Group III, containing 47 accessions (27.33%), exhibited moderate tolerance. Group IV consisted of 64 accessions (37.20%) and was identified as salt–alkali sensitive, while Group V, comprising 43 accessions (25%), was considered highly sensitive to salt–alkali stress.
In total, 65 accessions were classified as salt–alkali tolerant, including 18 that demonstrated strong tolerance. These accessions serve as valuable genetic resources for improving maize performance in saline–alkaline soils and enhancing the utility of marginal lands in northern regions (Supplementary Table S9).

3. Discussion

3.1. Genetic Diversity Analysis of Maize Germplasm Resources

Crop germplasm resources are strategic assets critical to sustainable agriculture, food security, ecological stability, energy development, and seed industry innovation. The development of new germplasm plays a crucial role in increasing maize yield, enhancing stress resistance, and improving agronomic performance. The processes of germplasm collection, preservation, evaluation, and utilization are foundational to the advancement of high-quality maize breeding programs [29].
A comprehensive understanding of the genetic diversity and variation within breeding populations is crucial for their effective conservation and use [30,31]. Genetic diversity assessment is a fundamental step in breeding research, with morphological variation providing an important dimension of genetic differentiation. In tree species, for instance, broader geographic distribution is often correlated with greater genetic and phenotypic variation, particularly in leaf and physiological traits, underscoring the environmental impact on genetic structure [32].
Studies have consistently demonstrated that landraces exhibit significantly greater genetic diversity compared with elite cultivars [33,34]. Maize, having undergone extensive domestication and selection across diverse regions, has accumulated considerable genetic variation. However, a relatively narrow genetic base continues to constrain modern maize breeding in China. Therefore, expanding and innovating the germplasm base remains a core challenge in maize improvement [35].
Numerous studies have evaluated the genetic diversity and population structure across various maize germplasms, environments, and marker systems. For example, certain landraces contain unexplored genetic variation and selection footprints across regions, much of which remains underutilized in current breeding programs [36,37]. Research on the genetic diversity of 126 inbred lines from the Shaanxi A and Shaanxi B populations revealed six distinct subgroups and low relatedness among lines, reinforcing the genetic richness of regional germplasms [38].
DNA-based molecular markers have proven effective in detecting genetic diversity and in supporting the development of novel cultivars through marker-assisted selection (MAS). This approach not only shortens breeding timelines but also enables targeted selection of parent lines based on their desirable traits. For instance, studies assessing temperate and tropical germplasm have categorized materials based on pedigree, selection history, and endosperm color, providing deeper insight into genetic variation and breeding potential [39,40].
A study on 141 sweet maize inbred lines detected 16,383 high-quality SNP loci, which were grouped into four clusters through PHYLIP analysis, confirming the presence of rich genetic diversity and complex kinship structures [27].
In summary, our findings provide a comprehensive view of the genetic diversity and relationships among 588 maize germplasm resources. This knowledge is valuable for germplasm collection strategies, resource characterization, and breeding innovations in maize.

3.2. Population Structure and Genetic Relationships

Analyzing population structure is a crucial first step in genome-wide association studies (GWAS) and in understanding the genetic landscape of breeding populations. Currently, SNP chip technology is widely applied in the division of heterotic groups in maize. Heterosis, defined as the phenomenon where hybrid offspring exhibit superior performance in traits such as growth vigor, yield, and adaptability compared with their parental lines, serves as a fundamental principle in maize breeding. Previous studies have used this technology to classify different types of maize germplasm into four to seven heterotic groups, which provides an important theoretical basis for maize variety improvement and heterosis utilization [41,42]. In our study, structure analysis, PCA, and kinship assessments revealed the presence of six subpopulations within the maize collection, highlighting significant genetic differentiation. Such insights are essential for guiding genetic improvement strategies and understanding evolutionary relationships among accessions.
Genetic differentiation among populations is typically evaluated using fixation indices such as FST [43,44]. Previous studies on 269 widely used temperate maize inbred lines in China, dating back to the 1970s, revealed historical shifts in hybrid utilization patterns and diversity [45]. Similarly, Dube et al. used Evanno’s method to determine an optimal K value of 3, identifying three distinct subgroups [46]. A broader population analysis integrating 237 germplasm samples from seven taxonomic groups and 507 inbred lines produced a high-resolution variation map, offering new insights into maize systematics and genetic diversity [47].
In our PCA, the first and second principal components explained 6.619% and 4.461% of the variance, respectively, accounting for a combined 10.77% of the total variance. STRUCTURE software analysis confirmed K = 6 as the most likely number of subpopulations, consistent with the results of hierarchical clustering. The average GSC among accessions was 0.5595. These findings indicate that the selected maize resources possess unique genetic backgrounds and can serve as a valuable foundation for understanding genetic structure, evolutionary history, and guiding selective breeding efforts.

3.3. Linking Genotype and Phenotype in a Newly Constructed Maize Core Collection

While large-scale germplasm collections offer abundant raw material for genetic research and breeding, their sheer size poses challenges in management, evaluation, and practical utilization. To address this, the concept of a “core collection” was introduced, providing a targeted strategy to reduce redundancy while preserving genetic diversity [48,49,50,51].
Since its inception, the core collection framework has been extensively studied worldwide, with efforts focused on construction methodologies, sampling strategies, and analytical techniques to ensure representativeness. Recent advancements in molecular marker technologies, such as SSRs (Simple Sequence Repeats), SNPs, and resequencing, have significantly enhanced the accuracy of core collection assembly by enabling precise assessments of genetic diversity [52,53,54,55]. This approach has already been successfully applied in multiple plant species [56,57,58]. Well-constructed core collections can capture more than 90% of a germplasm bank’s allelic diversity while representing just 10–20% of its accessions [59,60,61,62].
In our study, we integrated phenotypic diversity, genotypic variation, and trait associations to construct a comprehensive maize core collection. Ultimately, 172 accessions (29.25% of the total) were selected, capturing a broad genetic base. Salt–alkali tolerance was evaluated for all 172 core germplasm accessions during germination: 65 accessions were classified as salt–alkali tolerant, including 18 with strong tolerance. These accessions serve as invaluable genetic resources for improving maize performance in saline–alkaline soils, particularly for enhancing the utilization of marginal lands in northern regions. They also represent prime breeding material for gene function studies, marker-assisted selection (MAS), and new variety development. The establishment of this unified core collection provides a practical model for balancing genetic diversity with efficient germplasm use in maize breeding, offering both theoretical insights into genetic conservation and tangible applications for crop improvement.

4. Materials and Methods

4.1. Plant Materials

This study involved 588 maize (Zea mays L.) inbred lines planted in Heilongjiang Province, China, during the 2023 and 2024 growing seasons. Field traits were assessed from August to September each year. The accessions were collected from diverse sources, including 148 lines from northern maize breeding institutions, 36 European germplasm resources, 34 self-developed inbred lines, and 370 parental inbred lines of commercially important hybrids from eastern and northern China (Figure 7) (Supplementary Table S10).

4.2. Phenotypic and Physiological Trait Assessment

All materials were planted at the experimental base of Northeast Agricultural University, located in Xiangyang Township, Xiangfang District, Harbin. Plots consisted of three rows per line, with each row measuring 5 m. For each inbred line, morphological data were collected from 10 replicate plants in each experimental plot. The field trials were conducted in two replications, following standard agronomic practices. Three plants were randomly selected from the middle row of each plot to assess phenological stages. According to the standard process for maize phenotype identification, 19 indicators, including plant architecture, ear morphology, grain characteristics, and yield-related traits, totaling 19 phenotypic indicators [46]. These traits were quantitatively evaluated across both growing seasons.
Trait means were tested for normal distribution and visualized through distribution plots. Correlation analyses among traits were conducted using R 4.3.1. A weighted affiliation function was used to evaluate the 19 traits across all accessions. Microsoft Excel 2016 was used to record data and compute mean values. Trait standardization was performed using Z-scores in SPSS v20.0 to ensure comparability across traits. One-way analysis of variance (ANOVA) and Duncan’s new multiple range test were performed in SPSS v20.0 (IBM, Armonk, NY, USA), with statistical significance set at p < 0.05. All results are presented as mean ± standard deviation (SD), based on three biological replicates.
Data visualizations were generated using Origin Pro 2019 (OriginLab Corporation, Northampton, MA, USA), RStudio v3.6.1, and GraphPad Prism 8.0.0 to ensure clarity and reproducibility.

4.3. Construction of Core Collection Based on Phenotypic Data

Core collection construction based on phenotypic traits was conducted using QGA Station 2.0. Euclidean distance was selected as the similarity metric, with an initial sampling ratio of 10% applied. Random sampling was combined with eight distinct systematic clustering approaches, including shortest distance, longest distance, intermediate distance, centroid, class averaging, variable class averaging, flexible method, and sum-of-squares of deviations. Methods were evaluated based on their ability to represent the full diversity of the original population [63,64,65]. Subsequently, core collections were generated using Euclidean distance with five different sampling ratios (10%, 15%, 20%, 25%, and 30%). These were assessed using different combinations of sampling methods and clustering algorithms [55]. The best-performing methods were selected based on a combination of statistical metrics. The mean difference (MD) was less than 20%, with smaller values indicating better performance. The coincidence rate of range difference (CR), variance difference (VD), and coefficient of variation (VR) had greater genetic representativeness, the higher their value.
Student’s t-tests and F-tests were used to compare the core collection with the original population in terms of trait means and variances. A higher variance, greater coefficient of variation, and minimal mean differences were taken as indicators of an optimally representative core set.

4.4. DNA Extraction and SNP Genotyping

Ten seeds were selected from each germplasm and planted in the seedling tray. Genomic DNA was extracted from fresh leaf tissue using a plant genomic DNA extraction kit (Tiangen, Beijing, China). DNA quality and concentration were confirmed by 1% agarose gel electrophoresis [20].
Genotyping of the 588 maize inbred lines was performed using the Maize6H-60K chip. Genotyping was outsourced to Heilongjiang Kenfeng Agricultural Technology Co., Ltd. (Harbin, China). A total of 61,214 SNP loci were initially identified. Using TASSEL 5.0 software, these were filtered based on a minimum allelic variation frequency (MAF) > 0.01 and missing data proportion (PMS) < 0.01, yielding 7439 high-quality SNPs for downstream analysis. These thresholds were chosen to ensure that rare alleles and loci of poor quality were excluded according to previous criteria.

4.5. Genetic Diversity Analysis

Genotypic data were analyzed, and genetic distances among maize germplasm resources were estimated using TASSEL software [66]. The expected heterozygosity (He) and observed heterozygosity (Ho) estimate the probability of heterozygosity at a given locus. Lastly, Shannon’s Information Index (I) quantified overall genetic diversity.
All calculations were performed using GenAlEx v6.501 software [67]. Additional parameters, including gene diversity (D) and allele number (N), were analyzed using Power Marker v3.25 [68].
Cluster analysis was performed using the Neighbor-Joining method, and phylogenetic trees were constructed in MEGA 7. A genetic distance heatmap and pairwise similarity matrix were generated using R, which also calculated SNP density and generated corresponding visualizations [26,69].

4.6. Population Structure and PCA

Population structure was inferred using a Bayesian model-based clustering algorithm in STRUCTURE v2.3.4 [70]. The admixture model was applied with a burn-in period of 100,000 iterations followed by 100,000 Markov Chain Monte Carlo (MCMC) replications. Values of K (the assumed number of subpopulations) ranging from 1 to 10 were tested across three independent runs. The optimal number of subpopulations was determined using the ΔK method via STRUCTURE HARVESTER [71,72].
PCA was conducted using TASSEL v5.0 based on the 7439 high-quality SNP markers [66].

4.7. Core Collection Construction

To develop the core collection, the genetic distance matrix was input into Core-Hunter 3. Sampling was conducted at 16 different proportions (5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, and 80%) to control sampling density and assess representativeness [73,74].
The final core collection was established by combining selections based on both genotypic and phenotypic data. Accessions selected solely by phenotypic analysis but not present in the molecular-based subset were added to ensure full representation. Germplasms not included in the final selection were designated as the reserve set [55].

4.8. Evaluation of Salinity Tolerance in the Core Collection

To evaluate salt–alkali tolerance, seeds from the core collection were soaked for six hours in a mixed saline–alkaline solution containing 150 mmol/L of sodium ions. The solution was prepared using a 1:9:9:1 ratio of NaCl, Na2SO4, NaHCO3, and Na2CO3.
Post-soaking, seeds were transferred to filter paper within culture dishes for germination. Germination potential was assessed on Day 4, while germination rate and seedling length were measured on Day 7 [75].

5. Conclusions

Population structure analysis, principal component analysis, and cluster analysis divided 588 accessions into six subgroups. A core collection of 172 accessions was selected based on both phenotypic and genotypic data. Among them, 65 accessions were identified as salt–alkali tolerant, including 18 with strong tolerance during the germination stage.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/plants14142182/s1, Supplementary Figure S1: Frequency distribution of 19 phenotypic traits. Supplementary Table S1: Percentage differences between core collections and the initial population constructed using different clustering methods. Supplementary Table S2: Differences between core collections and the initial population constructed using different sampling proportions. Supplementary Table S3: Summary of core collection construction methods. Supplementary Table S4: Percentage differences between core collections and the initial population constructed using different construction methods. Supplementary Table S5: Comparison of differences between core collections and the initial population constructed using different construction methods. Supplementary Table S6: Principal component analysis between the core collections and the initial collections. Supplementary Table S7: The genetic diversity indices between 172 core collection and initial collection. Supplementary Table S8: Comprehensive evaluation and ranking of 172 maize core germplasm accessions for salinity tolerance at the germination stage. Supplementary Table S9: Strong salinity-tolerant accessions identified from the core germplasm collection during the germination-stage screening. Supplementary Table S10: Names of 588 Chinese maize germplasm resources and their heterosis group classification.

Author Contributions

The conception or design of the work: C.L. and H.D. Performed the experiment by Y.S., J.W. (Jiahao Wang) and Y.M. Analyzed the data by J.W. (Jiaxin Wang)., L.B., H.L. and J.F., Provided writing ideas by L.Z., Y.Z., X.Z., D.S. and Z.W. Wrote the manuscript by Y.S. and J.W. (Jiahao Wang). All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the Key Research and Development Project of Heilongjiang Province (2022ZX02B01), the Program for Young Talents of Basic Research in Universities of Heilongjiang Province (YQJH2024021), and Open Project Funding from the Key Laboratory of Germplasm Enhancement, Physiology, and Ecology of Food Crops in Cold Regions, Ministry of Education (CXSTOP202405).

Data Availability Statement

The data presented in this study are available upon request from the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Rouf Shah, T.; Prasad, K.; Kumar, P.; Yildiz, F. Maize-A potential source of human nutrition and health: A review. Cogent Food Agric. 2016, 2, 1166995. [Google Scholar] [CrossRef]
  2. Mao, G.; Zhang, D.; Mao, D.; Wang, F.; Dai, Q.; Wang, Z.; Feng, N.; Wang, C. Research Progress and Prospect of Saline-alkali Tolerant Rice. Hybrid Rice 2024, 39, 1–10. [Google Scholar]
  3. Yuan, L. The Strategy for Hybrid Rice Developmen. Hybrid Rice 2018, 33, 1–2. [Google Scholar]
  4. Li, C.L.; Li, J.L.; Lin, H.; Zhang, X.M.; Wang, Y.; Liu, S.T. Analysis on genetic diversity of maize germplasm. Norhteast Agric. Univ. 2010, 41, 6–14. [Google Scholar]
  5. Duan, C.-X.; Dong, H.-Y.; Li, X.; Li, H.; Li, C.-H.; Sun, S.-L.; Zhu, Z.-D.; Wang, X.-M. A large-scale screening of maize germplasm for resistance to multiple diseases in multiplot demonstration for several years under natural condition. Crops J. 2020, 46, 1135–1145. [Google Scholar]
  6. Li, S.; Li, H.; Li, X.; Zhang, C.; Li, X.; Jiang, L.; Li, X. Genetic Diversity Analysis of Phenotypic Traits and Selection of Superior Germplasm in 205 Maize Germplasm Resources. J. Maize Sci. 2023, 31, 1–10. [Google Scholar]
  7. Campbell, D.R. Using phenotypic manipulations to study multivariate selection of floral trait associations. Ann. Bot. 2009, 103, 1557–1566. [Google Scholar] [CrossRef]
  8. Li, Y.; Shi, Y.; Cao, Y.; Wang, T. Establishment of a core collection for maize germplasm preserved in Chinese National Genebank using geographic distribution and characterization data. Genet. Resour. Crop Evol. 2005, 51, 845–852. [Google Scholar] [CrossRef]
  9. Frascaroli, E.; Schrag, T.A.; Melchinger, A.E. Genetic diversity analysis of elite European maize (Zea mays L.) inbred lines using AFLP, SSR, and SNP markers reveals ascertainment bias for a subset of SNPs. Theor. Appl. Genet. 2013, 126, 133–141. [Google Scholar] [CrossRef]
  10. Lu, M.; Zhang, Z.; Zheng, S.; Yue, Y.; Liu, H.; Wang, M.; Liu, W. Genetic Analysis on Jidan 50 Based on SNP Marker in Maize. J. Maize Sci. 2016, 24, 1–7. [Google Scholar]
  11. Frankel, O. Genetic perspectives of germplasm conservation. Genet. Manip. Impact Man Soc. 1984, 161, 161–170. [Google Scholar]
  12. Dong, X.; Li, S.; Yang, H.; Zhang, P.; Li, S.; Qi, Z.; Yu, X.; Yi, H.; Fu, Z. Phenotypic diversity analysis of local varieties of maize in Chongqing. J. Plant Genet. Resour. 2019, 20, 861–870. [Google Scholar]
  13. Malik, H.N.; Malik, S.I.; Hussain, M.; Chughtai, S.R.; Javed, H.I. Genetic correlation among various quantitative characters in maize (Zea mays L.) hybrids. J. Agric. Soc. Sci. 2005, 3, 262–265. [Google Scholar]
  14. Li, Y.; Shi, Y.; Cao, Y.; Wang, T. A phenotypic diversity analysis of maize germplasm preserved in China [Zea mays L.]. Maydica 2002, 47, 107–114. [Google Scholar]
  15. Cai, Y.; Liu, Z.; Wang, T.; Li, Y.; Tan, H.; Wang, G.; Sun, H.; Wang, J. Phenotypic diversity analysis of quality and agronomic traits in some local varieties of maize in China. J. Plant Genet. Resour. 2011, 12, 31–36. [Google Scholar]
  16. Balfourier, F.; Bouchet, S.; Robert, S.; De Oliveira, R.; Rimbert, H.; Kitt, J.; Choulet, F.; Consortium, I.W.G.S.; Consortium, B.; Paux, E. Worldwide phylogeography and history of wheat genetic diversity. Sci. Adv. 2019, 5, eaav0536. [Google Scholar] [CrossRef]
  17. Kumar, A.; Kumar, S.; Singh, K.B.; Prasad, M.; Thakur, J.K. Designing a mini-core collection effectively representing 3004 diverse rice accessions. Plant Commun. 2020, 1, 100049. [Google Scholar] [CrossRef]
  18. Su, W.; Wang, L.; Lei, J.; Chai, S.; Liu, Y.; Yang, Y.; Yang, X.; Jiao, C. Genome-wide assessment of population structure and genetic diversity and development of a core germplasm set for sweet potato based on specific length amplified fragment (SLAF) sequencing. PLoS ONE 2017, 12, e0172066. [Google Scholar] [CrossRef]
  19. Bai, Q.; Cai, Y.; He, B.; Liu, W.; Pan, Q.; Zhang, Q. Core set construction and association analysis of Pinus massoniana from Guangdong province in southern China using SLAF-seq. Sci. Rep. 2019, 9, 13157. [Google Scholar] [CrossRef]
  20. Guo, Q.; Liu, J.; Li, J.; Cao, S.; Zhang, Z.; Zhang, J.; Zhang, Y.; Deng, Y.; Niu, D.; Su, L. Genetic diversity and core collection extraction of Robinia pseudoacacia L. germplasm resources based on phenotype, physiology, and genotyping markers. Ind. Crops Prod. 2022, 178, 114627. [Google Scholar] [CrossRef]
  21. Li, X.; Cui, L.; Zhang, L.; Huang, Y.; Zhang, S.; Chen, W.; Deng, X.; Jiao, Z.; Yang, W.; Qiu, Z. Genetic diversity analysis and core germplasm collection construction of radish cultivars based on structure variation markers. Int. J. Mol. Sci. 2023, 24, 2554. [Google Scholar] [CrossRef] [PubMed]
  22. Magorokosho, C. Genetic Diversity and Performance of Maize Varieties from Zimbabwe. Ph.D. Thesis, Texas A&M University, College Station, TX, USA, 2006. [Google Scholar]
  23. Wang, C.; Chen, J.; Pang, Z.; Li, N.; Dong, H.; Li, D. The Response and Screening of Germplasm Tolerant to Mixed Saline-Alkali Stress in Sweet Sorghum. Crops 2016, 1, 56–61. [Google Scholar]
  24. Xu, X.; Sun, F.; Xiao, M.; Zhang, R.; Zhou, Y.; Huang, R. Screening and identification of salt tolerance during the germination period of sorghum varieties. Seed 2020, 39, 6–11. [Google Scholar]
  25. Ren, H.; Zhang, F.; Han, X.; Hong, H.; Zhu, X.; Wang, G.; Qiu, L. Drought Tolerance Evaluation of Soybean Mini Core Collection. Crop 2023, 6, 94–100. [Google Scholar] [CrossRef]
  26. Gao, S.; Yan, X.; Wang, N.; Zhang, M.; Li, Y.; Diao, W.; Duan, H. Genetic Diversity Analysis of 255 MaizeGermplasm ResourcesBased on 10K SNP Chip. J. Agric. Sci. Technol. 2024, 26, 20–33. [Google Scholar]
  27. Chen, J.; Zhang, H.; Wang, T.; Li, X.; Wu, Z.; Lv, G. Genetic Diversity and Population Genetic Structure Analysis of Sweet CornInbred Lines. Mol. Plant Breed. 2022, 20, 1547–1549. [Google Scholar]
  28. Hu, J.; Zhou, D.; Guan, S.; Zhang, D. The Improvement and Utilization of Bare Saline-alkali Soil in the Songnen Plain by Sand-coverin. Chin. Agric. Sci. Bull. 2021, 37, 85–94. [Google Scholar]
  29. Boczkowska, M.Ł.B.; Kordulasińska, I.; Dostatny, D.F.; Czembor, J.H. Promoting the use of common oat genetic resources through diversity analysis and core collection construction. PLoS ONE 2016, 11, e0167855. [Google Scholar] [CrossRef]
  30. Basey, A.C.; Fant, J.B.; Kramer, A.T. Producing native plant materials for restoration: 10 rules to collect and maintain genetic diversity. Nativ. Plants J. 2015, 16, 37–53. [Google Scholar] [CrossRef]
  31. Turchetto, C.; Segatto, A.L.A.; Mäder, G.; Rodrigues, D.M.; Bonatto, S.L.; Freitas, L.B. High levels of genetic diversity and population structure in an endemic and rare species: Implications for conservation. AoB Plants 2016, 8, plw002. [Google Scholar] [CrossRef]
  32. Jiang, R.; Liu, J.; Jiang, J.; Kong, C.; Li, Y. Geographical variation in main phenotypic traits and seedling traits of Machilus thunbergii. J. Northeast For. Univ. 2011, 39, 9–11. [Google Scholar]
  33. Wingen, L.U.; West, C.; Leverington-Waite, M.; Collier, S.; Orford, S.; Goram, R.; Yang, C.-Y.; King, J.; Allen, A.M.; Burridge, A. Wheat landrace genome diversity. Genetics 2017, 205, 1657–1676. [Google Scholar] [CrossRef] [PubMed]
  34. Moore, G. Strategic pre-breeding for wheat improvement. Nat. Plants 2015, 1, 15018. [Google Scholar] [CrossRef]
  35. Li, M.; Xie, C.; Zhang, S. Technical approaches and strategies to improve the efficiency of maize breeding. Crops 2007, 1, 4–7. [Google Scholar]
  36. Carolina, S.; Jorge, F.; Bruno, S.; Lawrence, P.; Sukhwinder, S.; Cesar, P.; Jaime, C.; Kate, D.; Thomas, P.; David, M.; et al. Diversity analysis of 80,000 wheat accessions reveals consequences and opportunities of selection footprints. Nat. Commun. 2020, 11, 4572. [Google Scholar]
  37. Wingen, L.U.; Orford, S.; Goram, R.; Leverington-Waite, M.; Bilham, L.; Patsiou, T.S.; Ambrose, M.; Dicks, J.; Griffiths, S. Establishing the A. E. Watkins landrace cultivar collection as a resource for systematic gene discovery in bread wheat. TAG. Theoretical and applied genetics. Theor. Und Angew. Genet. 2014, 127, 1831–1842. [Google Scholar] [CrossRef]
  38. Li, T.; Qu, J.; Wang, Y.; Chang, L.; He, K.; Guo, D.; Zhang, X.; Xu, S.; Xue, J. Genetic characterization of inbred lines from Shaan A and B groups for identifying loci associated with maize grain yield. BMC Genet. 2018, 19, 63. [Google Scholar] [CrossRef]
  39. Boakyewaa Adu, G.; Badu-Apraku, B.; Akromah, R.; Garcia-Oliveira, A.L.; Awuku, F.J.; Gedil, M. Genetic diversity and population structure of early-maturing tropical maize inbred lines using SNP markers. PLoS ONE 2019, 14, e0214810. [Google Scholar] [CrossRef]
  40. Luqman, T.; Qamar, Z.-u.; Tabasum, A.; El-Kallawy, W.H.; Nazir, T.; Attacha, S.; Fiaz, S.; Nadeem, M.A.; Hameed, A.; Maryum, Z. Genetic characterization of coarse and basmati rice (Oryza sativa L.) through microsatellite markers and morpho-agronomic traits. Genet. Resour. Crop Evol. 2023, 70, 2307–2320. [Google Scholar] [CrossRef]
  41. Fan, J.; Xu, H.; Xu, X.; Zhang, H.; Chen, W.; Qi, L.; Jiang, Y. Exploring the Genetic Diversity and Population Structure of Guangxi Waxy Maize Landraces Based on SNP Markers. J. Plant Genet. Resour. 2023, 24, 661–670. [Google Scholar]
  42. Wang, J.; Song, W.; Wang, R.; Tang, H.; Li, X.; Wang, F.; Zhao, J.; Wang, R. Heterotic Grouping of 51 Maize Inbred Lines by SNP Markers. J. Maize Sci. 2014, 22, 29–34. [Google Scholar]
  43. Tehseen, M.M.; Istipliler, D.; Kehel, Z.; Sansaloni, C.P.; da Silva Lopes, M.; Kurtulus, E.; Muazzam, S.; Nazari, K. Genetic diversity and population structure analysis of Triticum aestivum L. landrace panel from Afghanistan. Genes 2021, 12, 340. [Google Scholar] [CrossRef] [PubMed]
  44. Luo, Z.; Brock, J.; Dyer, J.M.; Kutchan, T.; Schachtman, D.; Augustin, M.; Ge, Y.; Fahlgren, N.; Abdel-Haleem, H. Genetic diversity and population structure of a Camelina sativa spring panel. Front. Plant Sci. 2019, 10, 184. [Google Scholar] [CrossRef] [PubMed]
  45. Zhang, R.; Xu, G.; Li, J.; Yan, J.; Li, H.; Yang, X. Patterns of genomic variation in Chinese maize inbred lines and implications for genetic improvement. Theor. Appl. Genet. 2018, 131, 1207–1221. [Google Scholar] [CrossRef] [PubMed]
  46. Dube, S.P.; Sibiya, J.; Kutu, F. Genetic diversity and population structure of maize inbred lines using phenotypic traits and single nucleotide polymorphism (SNP) markers. Sci. Rep. 2023, 13, 17851. [Google Scholar] [CrossRef]
  47. Chen, L.; Luo, J.; Jin, M.; Yang, N.; Liu, X.; Peng, Y.; Li, W.; Phillips, A.; Cameron, B.; Bernal, J.S. Genome sequencing reveals evidence of adaptive variation in the genus Zea. Nat. Genet. 2022, 54, 1736–1745. [Google Scholar] [CrossRef]
  48. Brown, A. Core collections: A practical approach to genetic resources management. Genome 1989, 31, 818–824. [Google Scholar] [CrossRef]
  49. Charmet, G.; Balfourier, F.; Ravel, C.; Denis, J.-B. Genotype x environment interactions in a core collection of French perennial ryegrass populations. Theor. Appl. Genet. 1993, 86, 731–736. [Google Scholar] [CrossRef]
  50. Xu, J.; Wang, L.; Wang, H.; Mao, C.; Kong, D.; Chen, S.; Zhang, H.; Shen, Y. Development of a core collection of six-rowed hulless barley from the Qinghai-Tibetan plateau. Plant Mol. Biol. Rep. 2020, 38, 305–313. [Google Scholar] [CrossRef]
  51. Liu, J.; Gao, S.; Xu, Y.; Wang, M.; Ngiam, J.J.; Rui Wen, N.C.; Yi, J.J.J.; Weng, X.; Jia, L.; Salojärvi, J. Genetic diversity analysis of Sapindus in China and extraction of a core germplasm collection using EST-SSR markers. Front. Plant Sci. 2022, 13, 857993. [Google Scholar] [CrossRef]
  52. Sun, Q.; Bai, L.; Ke, L.; Xiang, X.; Zhao, J.; Ou, L. Developing a core collection of litchi (Litchi chinensis Sonn.) based on EST-SSR genotype data and agronomic traits. Sci. Hortic. 2012, 146, 29–38. [Google Scholar] [CrossRef]
  53. Hoshikawa, K.; Lin, Y.-P.; Schafleitner, R.; Shirasawa, K.; Isobe, S.; Nguyen, D.C.; Ohsawa, R.; Yoshioka, Y. Genetic diversity analysis and core collection construction for Amaranthus tricolor germplasm based on genome-wide single-nucleotide polymorphisms. Sci. Hortic. 2023, 307, 111428. [Google Scholar] [CrossRef]
  54. Zhu, S.; Niu, E.; Wang, W.; Shi, A. Identification and evaluation of SNP core loci for olive germplasm (Olea europaea L.). Mol. Plant Breed. 2020, 18, 1548–1557. [Google Scholar]
  55. Chen, X.; Liu, Y.-Y.; Zheng, F.-S.; Cheng, G.-X.; Guo, M.; Li, J.-S.; Wang, X.-M. Construction of a core collection of tomato (Solanum lycopersicum) germplasm based on phenotypic traits and SNP markers. Sci. Hortic. 2025, 339, 113855. [Google Scholar] [CrossRef]
  56. Xue, H.; Yu, X.; Fu, P.; Liu, B.; Zhang, S.; Li, J.; Zhai, W.; Lu, N.; Zhao, X.; Wang, J. Construction of the Core Collection of Catalpa fargesii f. duclouxii (Huangxinzimu) based on molecular markers and phenotypic traits. Forests 2021, 12, 1518. [Google Scholar] [CrossRef]
  57. Le, L.; Yang, X.; Xie, X.; Zhang, W.; Wang, G.; Cao, F. Construction of the core germplasm of yellowhorn (Xanthoceras sorbifolium Bunge) using physiological traits and SSR markers. Sci. Hortic. 2024, 323, 112556. [Google Scholar] [CrossRef]
  58. Lee, H.-Y.; Ro, N.-Y.; Jeong, H.-J.; Kwon, J.-K.; Jo, J.; Ha, Y.; Jung, A.; Han, J.-W.; Venkatesh, J.; Kang, B.-C. Genetic diversity and population structure analysis to construct a core collection from a large Capsicum germplasm. BMC Genet. 2016, 17, 142. [Google Scholar] [CrossRef]
  59. Wu, X.; Li, Y.; Li, X.; Li, C.; Shi, Y.; Song, Y.; Zheng, Z.; Li, Y.; Wang, T. Analysis of genetic differentiation and genomic variation to reveal potential regions of importance during maize improvement. BMC Plant Biol. 2015, 15, 256. [Google Scholar] [CrossRef]
  60. Zhang, M.; Zhang, R.; Yang, J.; Luo, P.; Zhang, M.; Zhang, R.; Yang, J.; Luo, P. Identification of a new QTL for Fusarium head blight resistance in the wheat genotype “Wang shui-bai”. Mol. Biol. Rep. 2010, 37, 1031–1035. [Google Scholar] [CrossRef]
  61. Li, Z.; Zhang, H.; Zeng, Y.; Yang, Z.; Shen, S.; Sun, C.; Wang, X. Studies on sampling schemes for the establishment of corecollection of rice landraces in Yunnan, China. Genet. Resour. Crop Evol. 2002, 49, 67–74. [Google Scholar] [CrossRef]
  62. Van Treuren, R.; Tchoudinova, I.; Van Soest, L.; van Hintum, T.J. Marker-assisted acquisition and core collection formation: A case study in barley using AFLPs and pedigree data. Genet. Resour. Crop Evol. 2006, 53, 43–52. [Google Scholar] [CrossRef]
  63. Zeuli, P.S.; Qualset, C. Evaluation of five strategies for obtaining a core subset from a large genetic resource collection of durum wheat. Theor. Appl. Genet. 1993, 87, 295–304. [Google Scholar] [CrossRef] [PubMed]
  64. Diwan, N.; McIntosh, M.; Bauchan, G. Methods of developing a core collection of annual Medicago species. Theor. Appl. Genet. 1995, 90, 755–761. [Google Scholar] [CrossRef]
  65. Liu, S.; Zheng, C.; Xiang, W.; Yi, Z.; Xiao, L. A sampling strategy to develop a primary core collection of Miscanthus spp. in China based on phenotypic traits. Agronomy 2022, 12, 678. [Google Scholar] [CrossRef]
  66. Bradbury, P.J.; Zhang, Z.; Kroon, D.E.; Casstevens, T.M.; Ramdoss, Y.; Buckler, E.S. TASSEL: Software for association mapping of complex traits in diverse samples. Bioinformatics 2007, 23, 2633–2635. [Google Scholar] [CrossRef]
  67. Peakall, R.; Smouse, P.E. GENALEX 6: Genetic analysis in Excel. Population genetic software for teaching and research. Mol. Ecol. Notes 2006, 6, 288–295. [Google Scholar] [CrossRef]
  68. Liu, K.; Muse, S.V. PowerMarker: An integrated analysis environment for genetic marker analysis. Bioinformatics 2005, 21, 2128–2129. [Google Scholar] [CrossRef]
  69. Chen, S.; Zhou, Y.; Chen, Y.; Huang, T.; Liao, W.; Xu, Y.; Li, Z.; Gu, J. Gencore: An efficient tool to generate consensus reads for error suppressing and duplicate removing of NGS data. BMC Bioinform. 2019, 20, 606. [Google Scholar] [CrossRef]
  70. Pritchard, J.K.; Stephens, M.; Donnelly, P. Inference of population structure using multilocus genotype data. Genetics 2000, 155, 945–959. [Google Scholar] [CrossRef]
  71. Evanno, G.; Regnaut, S.; Goudet, J. Detecting the number of clusters of individuals using the software STRUCTURE: A simulation study. Mol. Ecol. 2005, 14, 2611–2620. [Google Scholar] [CrossRef]
  72. Earl, D.A.; VonHoldt, B.M. STRUCTURE HARVESTER: A website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv. Genet. Resour. 2012, 4, 359–361. [Google Scholar] [CrossRef]
  73. Yang, H.; Zhang, R.; Wang, B.; Xu, Z.; Zhou, Z. Construction of core collection of Schima superba based on SSR molecular markers. Sci. Silvae Sin. 2017, 53, 37–46. [Google Scholar]
  74. Beukelaer, H.D.; Smýkal, P.; Davenport, G.F.; Fack, V. Core Hunter II: Fast core subset selection based on multiple genetic diversity measures using Mixed Replica search. BMC Bioinform. 2012, 13, 312. [Google Scholar] [CrossRef] [PubMed]
  75. Hu, Y.; Li, C.; Zhou, R.; Song, Y.; Lv, Z.; Wang, Q.; Dong, X.; Liu, S.; Feng, C.; Zhou, Y. The transcription factor ZmNAC89 gene is involved in salt tolerance in maize (Zea mays L.). Int. J. Mol. Sci. 2023, 24, 15099. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Correlation analysis of the phenotypic traits in maize inbred lines. Note: “*” indicates significance at p < 0.05, “** and ***” indicate significance at p < 0.01. XZ1-XZ19 represent growth period, number of male spike branches, plant height, ear height, spike stalk length, spike length, spike diameter, spike rows, number of grains in rows, spike type, grain length, grain width, grain thickness, grain type, 100-grain weight, total grain per panicle, dry weight per panicle, seed yield, and plot yield.
Figure 1. Correlation analysis of the phenotypic traits in maize inbred lines. Note: “*” indicates significance at p < 0.05, “** and ***” indicate significance at p < 0.01. XZ1-XZ19 represent growth period, number of male spike branches, plant height, ear height, spike stalk length, spike length, spike diameter, spike rows, number of grains in rows, spike type, grain length, grain width, grain thickness, grain type, 100-grain weight, total grain per panicle, dry weight per panicle, seed yield, and plot yield.
Plants 14 02182 g001
Figure 2. PCA distribution map constructed using phenotypic core germplasm. (A) PCA distribution of phenotypic data from the original maize germplasm resources; (B) PCA distribution of phenotypic data from the core maize germplasm resources.
Figure 2. PCA distribution map constructed using phenotypic core germplasm. (A) PCA distribution of phenotypic data from the original maize germplasm resources; (B) PCA distribution of phenotypic data from the core maize germplasm resources.
Plants 14 02182 g002
Figure 3. Genomic SNP density of maize germplasms. (A) Chromosomal position distribution of genotype data; (B) Number of SNPs distributed across each chromosome; (C) PCA scatter plot based on SNP genotype data.
Figure 3. Genomic SNP density of maize germplasms. (A) Chromosomal position distribution of genotype data; (B) Number of SNPs distributed across each chromosome; (C) PCA scatter plot based on SNP genotype data.
Plants 14 02182 g003
Figure 4. Genetic diversity and population structure analysis of maize germplasm resources. (A) Determination of Δ K value; (B) Population structure analysis of maize inbred lines; (C) Cluster analysis of maize inbred lines.
Figure 4. Genetic diversity and population structure analysis of maize germplasm resources. (A) Determination of Δ K value; (B) Population structure analysis of maize inbred lines; (C) Cluster analysis of maize inbred lines.
Plants 14 02182 g004
Figure 5. Principal coordinate analysis comparing the core collection with the original population. (A) The principal coordinate analysis of core collection and other populations, blue represents the core collection, and red represents the other population. (B) The correlation analysis of the phenotypic traits in the core collection. “***” indicate significance at p < 0.01.; (C) The distribution of heterotic groups of initial population and core collection.
Figure 5. Principal coordinate analysis comparing the core collection with the original population. (A) The principal coordinate analysis of core collection and other populations, blue represents the core collection, and red represents the other population. (B) The correlation analysis of the phenotypic traits in the core collection. “***” indicate significance at p < 0.01.; (C) The distribution of heterotic groups of initial population and core collection.
Plants 14 02182 g005
Figure 6. Cluster analysis of core collection accessions for salinity tolerance at the germination stage. Group I: the highly salt–alkaline-tolerant accessions; Group II: the salt–alkaline-tolerant accessions; Group III: the medium salt–alkaline-tolerant accessions; Group IV: the salt–alkaline-sensitive accessions; Group V: the high salt–alkaline-sensitive accessions.
Figure 6. Cluster analysis of core collection accessions for salinity tolerance at the germination stage. Group I: the highly salt–alkaline-tolerant accessions; Group II: the salt–alkaline-tolerant accessions; Group III: the medium salt–alkaline-tolerant accessions; Group IV: the salt–alkaline-sensitive accessions; Group V: the high salt–alkaline-sensitive accessions.
Plants 14 02182 g006
Figure 7. Distribution of maize germplasm resources across China.
Figure 7. Distribution of maize germplasm resources across China.
Plants 14 02182 g007
Table 1. Statistical analysis and genetic diversity index of 19 phenotypic traits.
Table 1. Statistical analysis and genetic diversity index of 19 phenotypic traits.
TraitAverage ValueMedianVarianceSkewnessKurtosisMinimumMaximumAverage ValueStandard DeviationCoefficientGenetic Diversity Index
growth period 14815343.183−1.592.03234123148.476.5714.43%0.728
tassel branch number7618.610.9931.4242707.784.31455.45%1.204
plant height203.3572212.33821.345−0.083−0.094166.66110.67203.3628.65914.09%1.556
ear position height76.216275390.3710.084−0.38106.6725.3376.2219.75825.92%1.673
ear stem length8.946814.680.5650.8522308.953.83242.82%0.812
ear length16.566317.68.6160.1010.56323.25.4316.572.93517.71%1.005
ear thickness4.58244.60.265−0.0511.2873.982.424.580.51511.24%1.566
ear row number14146.4170.5460.795161014.82.53317.11%1.41
row grain number 272735.8640.2360.23737927.055.98922.14%0.881
ear type220.6920.281−0.574312.120.83239.25%1.205
grain length10.24310.481.664−0.1460.5248.836.2710.241.2912.60%1.519
grain width8.63088.70.880.1040.3895.945.498.630.93810.87%1.283
grain thickness4.940450.3990.4871.1965.52.54.940.63212.79%0.827
grain type220.5380.262−1.109211.840.73339.84%1.052
hundred grain weight28.597132.7333.5850.0470.32337.8712.128.65.79520.26%1.597
total grain per ear99.1884117.331620.0060.8131.88275.51299.1940.24940.58%1.441
dry weight per ear125.4884140.332542.5620.8862.683397.6719125.4950.42440.18%1.4
seed emergence rate0.79220.840.006−1.5974.5590.530.40.790.0749.37%1.059
plot yield1546.99881056766819.9161.1361.6325310.991481547875.68356.61%1.178
Table 2. Principal component analysis of phenotypic traits.
Table 2. Principal component analysis of phenotypic traits.
IndicesComponent 1Component 2Component 3Component 4Component 5
growth period 0.1660.2850.241−0.033−0.682
tassel branch number0.0690.1910.492−0.2670.283
plant height0.7030.0340.169−0.3280.155
ear position height0.5130.0750.35−0.40.51
ear stem length0.2920.048−0.252−0.197−0.052
ear length0.6890.185−0.189−0.422−0.074
ear thickness0.6740.2540.4160.22−0.074
ear row number0.44−0.2910.5960.225−0.152
row grain number 0.678−0.252−0.218−0.367−0.087
ear type−0.249−0.2190.3550.4580.185
grain length0.639−0.12−0.0030.40.223
grain width0.0130.702−0.2330.1680.193
grain thickness−0.1070.614−0.030.184−0.058
grain type0.406−0.4140.0950.25−0.019
hundred grain weight0.4410.461−0.2870.4270.272
total grain per ear0.871−0.071−0.180.1770.029
dry weight per ear0.8730.059−0.0740.163−0.03
seed emergence rate0.024−0.554−0.4860.0990.233
plot yield0.747−0.094−0.2040.13−0.345
Eigenvalue5.3772.0151.7151.5521.248
Contribution (%)28.30210.6049.0298.1676.566
Cumulative contribution28.30238.90647.93556.10162.667
Table 3. Percentage difference between the core collections and the initial population constructed using different sampling proportions.
Table 3. Percentage difference between the core collections and the initial population constructed using different sampling proportions.
Construction ProportionMean Difference PercentagePercentage of Variance DifferenceCoincidence Rate of RangeVariable Rate of Coefficient of Variation
10%5.2678.9591.92127.74
15%5.2678.9593.71121.3
20%5.2678.9596.49118.28
25%5.2673.6896.92116.34
30%10.5373.6896.92114.93
Table 4. Germplasm pairs with the largest and smallest genetic similarity coefficients.
Table 4. Germplasm pairs with the largest and smallest genetic similarity coefficients.
Germplasm 1Germplasm 2GSCGermplasm 1Germplasm 2GSC
T1061960.3366Dan6263C2601.0000
D5801Jiuyi1150.3368M60P22371.0000
zheng58Jiuyi1150.337A22Jiang1341.0000
5311Jiuyi1150.337241125022 (B)1.0000
Qing795Jiuyi1150.33744112Ji41121.0000
Jing388Jiuyi1150.345022 (B)Ji41121.0000
Ming84Jiuyi1150.34084112N528-1 (1284)1.0000
T106KWCB10.3425022 (B)N528-1 (1284)1.0000
T106Yuanfuhuang0.3424Ji4112N528-1 (1284)1.0000
P138Jiuyi1150.3425Mo113M01131
Note: GSC = Genetic similarity coefficient.
Table 5. Comparison of genetic parameters between the core collection and the initial population under different sampling proportions.
Table 5. Comparison of genetic parameters between the core collection and the initial population under different sampling proportions.
Subset (%)Number of CultivarsShannon’s Information IndexCorrect_Shannon’s Information IndexHoHe
Core_5480.60090.15520.01530.4142
Core_10960.57880.12680.03320.3953
Core_151450.58870.11830.0280.4036
Core_201930.58440.11110.0380.4001
Core_252420.58670.10690.03620.4019
Core_302900.58750.10360.03510.4025
Core_353380.58720.10080.02910.4024
Core_403870.58480.09810.03140.4004
Core_454340.59340.09770.03250.4075
Core_504830.58620.09480.02870.4015
Core_555320.58570.09330.03610.401
Core_605800.5850.09190.03240.4005
Core_656280.58920.09140.03120.404
Core_706770.58560.08990.03180.401
Core_757240.58710.08910.03420.4022
Core_807730.58620.08810.0320.4015
Core_1005880.58680.08530.03210.402
Note: Ho = observed heterozygosity; He = expected heterozygosity.
Table 6. Number of phenotypic and genotypic accessions included in the core collection.
Table 6. Number of phenotypic and genotypic accessions included in the core collection.
Final Core CollectionPhenotypic TraitsSNP Marker
Reserved NumberPercentage (%)Reserved NumberPercentage (%)Reserved NumberPercentage (%)
17229.2511719.908814.97
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Song, Y.; Wang, J.; Ma, Y.; Wang, J.; Bao, L.; Sun, D.; Lin, H.; Fan, J.; Zhou, Y.; Zeng, X.; et al. Genetic Diversity and Construction of Salt-Tolerant Core Germplasm in Maize (Zea mays L.) Based on Phenotypic Traits and SNP Markers. Plants 2025, 14, 2182. https://doi.org/10.3390/plants14142182

AMA Style

Song Y, Wang J, Ma Y, Wang J, Bao L, Sun D, Lin H, Fan J, Zhou Y, Zeng X, et al. Genetic Diversity and Construction of Salt-Tolerant Core Germplasm in Maize (Zea mays L.) Based on Phenotypic Traits and SNP Markers. Plants. 2025; 14(14):2182. https://doi.org/10.3390/plants14142182

Chicago/Turabian Style

Song, Yongfeng, Jiahao Wang, Yingwen Ma, Jiaxin Wang, Liangliang Bao, Dequan Sun, Hong Lin, Jinsheng Fan, Yu Zhou, Xing Zeng, and et al. 2025. "Genetic Diversity and Construction of Salt-Tolerant Core Germplasm in Maize (Zea mays L.) Based on Phenotypic Traits and SNP Markers" Plants 14, no. 14: 2182. https://doi.org/10.3390/plants14142182

APA Style

Song, Y., Wang, J., Ma, Y., Wang, J., Bao, L., Sun, D., Lin, H., Fan, J., Zhou, Y., Zeng, X., Wang, Z., Zhang, L., Li, C., & Di, H. (2025). Genetic Diversity and Construction of Salt-Tolerant Core Germplasm in Maize (Zea mays L.) Based on Phenotypic Traits and SNP Markers. Plants, 14(14), 2182. https://doi.org/10.3390/plants14142182

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop