1. Introduction
Garlic (
Allium sativum L.) is an herbaceous plant belonging to the
Allium genus of the
Liliaceae family. It is not only an important condiment vegetable [
1] but also a valuable medicinal plant [
2,
3]. Previous studies have shown that garlic originated in Central Asia. Its ancestor was domesticated, and then introduced to the Mediterranean Basin, India, and China before spreading to other parts of the world [
4]. During the Han Dynasty of China, garlic was introduced to the Guanzhong region of China and subsequently cultivated on a large scale. Today, garlic cultivation has expanded to all continents except Antarctica [
5]. As a typical vegetable with both medicinal and nutritional value, the bulb is the main edible organ of garlic. In addition to rich vitamin, carbohydrate, and mineral elements, garlic contains allicin, an important functional component, and can also accumulate high levels of selenium and germanium. These properties give garlic important medical and health effects, such as anti-bacterial, anti-inflammatory, blood sugar- and lipid-lowering, anti-cancer, heavy metal-protective, and age-delaying properties [
6,
7,
8].
The global ex situ holdings of garlic accessions are abundant, with a total of over 5000 accessions according to incomplete statistics. China, India, the Czech Republic, Poland, the United States, Germany, and Russia are major countries with rich garlic germplasm resources, and each country conserves more than 400 accessions [
9,
10]. In China, ex situ conservation and ultra-low temperature conservation are the main methods for preserving garlic germplasm resources [
9].
Garlic mainly reproduces asexually through bulbs, which greatly limits the breeding of new varieties. Although previous studies have reported fertile garlic resources in the northern region of the Tianshan Mountains and Central Asia that produce seeds through selfing or hybridization and that purple anthers are the main morphological characteristic of most male-fertile garlic resources [
11,
12], no new garlic varieties bred through sexual hybridization have been reported. During the flowering period of garlic, the aerial bulbs expand rapidly, causing extrusion and nutrient competition. Pollen mother cells gradually dry out during pollen grain formation, leading to complete drying of anthers before flowering. Even under artificial optimal conditions, garlic anthers fail to germinate pollen tubes. Additionally, high-frequency chromosomal structural variations in pollen mother cells collectively result in sterile pollen grains, preventing the breeding of new varieties through sexual hybridization [
13,
14].
The concept of a core collection was first proposed by Frankel in 1984 [
15]. It refers to a subset of accessions selected from a collected resource population, with a limited quantity (usually 5–20% of the total accessions) that represents the genetic diversity of the entire collection. The construction of one or more core collections compressed the number of original germplasm accessions by removing redundant materials with similar genetic backgrounds without reducing genetic diversity, reduced the cost of germplasm conservation and reproduction renewal, and improved the efficiency of germplasm conservation [
16,
17]. Moreover, the construction of core collection should be carried out before in-depth evaluation of germplasm accessions [
18]. Since the core collection contains the main genetic variation of the germplasm accession, it provides breeders with more detailed information, and can be used to quickly screen parent materials related to target traits, reduce the blindness of breeding, and accelerate the breeding process of new variety. Therefore, core collection plays an important role in genetic diversity analysis and utilization, germplasm resource conservation, and crop genetic improvement, and is utilized in many horticultural crops [
19,
20,
21], such as soybean [
22], ginger [
23], pea [
24], peanut [
25], tomato [
26], and banana [
27]. Early data used for constructing core collections of horticultural crops were phenotypic data, and core collection was constructed by detailed investigation of agronomic traits, although the data were susceptible to environmental factors and investigators. Molecular markers, especially Simple Sequence Repeats (SSRs), which have distinct advantages such as high polymorphism, broad genomic coverage, and stable results, are widely used [
28,
29,
30,
31], and may become an important technology for the construction of crop core collection.
Current research on garlic germplasm accessions mainly focuses on the comprehensive evaluation of agronomic traits, screening of elite resources, genetic diversity analysis using molecular markers, and population genetic structure analysis [
32,
33,
34,
35,
36,
37,
38]. Although there have been reports on the construction of garlic core collections, a universally accepted method has not been established [
39,
40,
41,
42]. Therefore, to screen germplasm resources with rich variation types, high genetic diversity, and strong heterogeneity, this study investigated and analyzed 21 agronomic traits of 366 garlic germplasm resources. Three sampling methods (random, priority, and deviation) were used to construct core collections, and their representativeness was evaluated. The results provide a scientific basis for the establishment of a germplasm nursery, efficient conservation and utilization of germplasm resources, and the breeding of elite varieties of garlic.
2. Results
2.1. Genetic Diversity Analysis of the Agronomic Traits of Garlic
Among the 21 agronomic traits of garlic, 7 are qualitative, and their genetic diversity is shown in
Table 1. The 7 qualitative traits have 23 variation types, with different distribution frequencies among traits. The range of the genetic diversity index (
H′) was 0.48–1.11. Leaf color had the highest
H′ (1.11), with rich variation types, mainly green, accounting for 58.20%. Clove uniformity had the lowest
H′ (0.48), with uniform accounting for 84.97% of variation types. Plant type had an
H′ of 0.71, with semi-erect accounting for 76.23%. Leaf straightness had an
H′ of 0.55, with drooping accounting for 84.15%. Bulb shape had an
H′ of 0.90, with oblate spheroid accounting for 56.28%. Clove arrangement had an
H′ of 0.77, with four variations: regular two-whorled accounting for 65.03%, regular single-whorled accounting for 31.69%, and no single-headed type. Bulb disc position had an
H′ of 0.95, with flat accounting for 59.56% and concave 22.13%.
The
H′ values of the 14 quantitative traits of the 366 garlic germplasm accessions are listed in
Table 2. The coefficient of variation (CV) values of the quantitative traits ranged from 12.30 to 37.49%. Leaf number per plant had the smallest CV (12.30%), while weight per bulb had the largest (37.49%).
H′ ranged from 3.08 to 5.80. Leaf number per plant had the smallest
H′ (3.08), and bulb transverse diameter had the largest (5.80). Ten quantitative traits had
H′ values exceeding 5.00, indicating that these germplasm accessions have rich genetic diversity compared to previous studies [
32,
35].
2.2. Correlation Analysis of the Agronomic Traits of Garlic
The correlation analysis of the 14 quantitative traits of garlic showed that all were significantly or extremely significantly correlated (
Table 3). Except for the significant correlation between clove number per bulb and plant height, and between clove number per bulb and clove back width, all other indices showed extremely significant correlations. Aerial pseudostem height, aerial pseudostem diameter, bulb height, bulb transverse diameter, and weight per bulb were the five traits with high breeding attention.
Among them, the correlation coefficient between aerial pseudostem height and plant height was the highest (0.726), followed by that between aerial pseudostem height and aerial pseudostem diameter (0.673). The top three indices with the highest correlation coefficients with aerial pseudostem diameter were leaf width (0.804), leaf length (0.769), and clove height (0.754). The top three indices with the highest correlation coefficients with bulb height were weight per bulb (0.889), clove height (0.875), and bulb transverse diameter (0.808). The top three indices with the highest correlation coefficients with bulb transverse diameter were weight per bulb (0.953), clove height (0.826), and bulb height (0.808). The top three indices with the highest correlation coefficients with weight per bulb were bulb transverse diameter (0.953), bulb height (0.889), and clove height (0.873).
The correlation coefficients between the aboveground parts (leaf length and leaf width) and belowground parts (bulb transverse diameter, weight per bulb, and bulb disc diameter) were all above 0.700. The correlation coefficients between aerial pseudostem diameter and the three underground traits (bulb height, weight per bulb, and clove height) were also above 0.700, indicating that leaf size and aerial pseudostem diameter significantly affect underground agronomic traits.
2.3. Principal Component Analysis of the Agronomic Traits of Garlic
Principal component analysis was performed on 21 agronomic traits of all accessions (
Table 4). The results showed that the cumulative contribution rate of the first four principal components was 70.19%, indicating that these four principal components can represent most of the genetic information of the 21 traits. Among them, the first principal component was chiefly related to leaf, aerial pseudostem, and bulb traits and had an eigenvalue of 9.340, a contribution rate of 44.475%, and mainly included eight indices, namely, leaf length, leaf width, aerial pseudostem diameter, bulb height, bulb transverse diameter, weight per bulb, clove height, and bulb disc diameter, with eigenvector values all above 0.800. The second principal component was chiefly related to bulb shape and plant type traits and had an eigenvalue of 2.988, a contribution rate of 14.230%, and included four indices, namely, bulb shape, clove arrangement, plant type, and leaf straightness. The third principal component had an eigenvalue of 1.292, a contribution rate of 6.150%, and included three indices, namely, clove number per bulb, clove uniformity, and bulb disc position. The fourth principal component had an eigenvalue of 1.121, a contribution rate of 5.337%, and corresponded to leaf color.
2.4. Cluster Analysis of the Agronomic Traits of Garlic
The 366 garlic germplasm accessions were divided into five categories by cluster analysis based on 21 agronomic traits (
Figure 1). Category I contained 97 accessions, accounting for 26.50% of the total resources, and this category had high values in indices such as plant height, plant breadth, aerial pseudostem height, aerial pseudostem diameter, bulb height, weight per bulb, clove height, and clove back width, indicating vigorous aboveground growth and presumably high bulb yield. The representative resource was 3R052. Category II contained 63 accessions, accounting for 17.21% of the total, and these resources had moderate values in indices such as leaf number per plant, plant height, aerial pseudostem height, bulb height, and bulb transverse diameter, but had wider leaves than those represented by category I resources. The representative resource was 3R135. Category III contained 18 accessions, accounting for 4.92% of the total. This category was characterized by compact and short plants, with generally low bulb indices, indicating small garlic bulbs. The bulb shape was mainly high spheroid, with irregular clove arrangements and straight leaves. The representative resource was 3R166. Category IV contained 86 accessions, accounting for 23.50% of the total. These resources had more leaves per plant, with larger leaves (leaf length and width indices), moderate plant height and breadth, and higher aerial pseudostem diameter, bulb transverse diameter, and bulb disc diameter indices. The plant type was mainly semi-erect, the bulb shape was mainly oblate spheroid, and the leaf color was mainly yellow-green. The representative resource was 3R228. Category V contained 102 accessions, accounting for the highest proportion (27.87% of the total). These resources had moderate plant size and bulb indices, clove arrangements that were mainly regular single-whorled, and bulb disc positions that were mainly concave. The representative resource was 3R343.
2.5. Construction and Evaluation of Garlic Core Collections
Genetic distance was calculated according to Euclidean distance using QGA Station 2.0 software, and clustering was performed by the unweighted pair group method with arithmetic mean (UPGMA). Three methods, namely, random (R), priority (P), and deviation (D), were applied to construct core collections for sampling at a 10% ratio, and their representativeness was evaluated. R1, P1, and D1 contained 37, 36, and 33 accessions, respectively. Among them, the number of repeated accessions was six between R1 and P1, four between R1 and D1, and seven between P1 and D1. Therefore, we constructed 90 garlic core collections in this study.
The percentage difference between core collections and the original population are showed in
Table 5. Compared with the original population, the mean difference percentages (MD) of R1, P1, and D1 were 0.37%, 1.48%, and 13.04%, respectively, all below 20%. The range coincidence rates (CR) of R1 and P1 were 79.82% and 100%, respectively, indicating that R1 and P1 met the principles of core collection construction, can represent the genetic diversity of the original population, and reflect the genetic diversity of the original germplasm accessions. The mean t-test was used to examine differences in agronomic trait means between the original population and core collections, and the variance
F-test was used to analyze the homogeneity of agronomic trait variation between the two groups [
19]. Comparisons of mean and variance differences between core collections and original population are listed in
Table 6. Core collection R1 had 13 traits with higher variance and 13 traits with higher coefficients of variation than those of the original population. Core collection P1 had 15 traits with significant or extremely significant variance differences from those in the original population and 20 traits with higher coefficients of variation. Core collection D1 had 16 traits with significant or extremely significant mean differences from those of the original population, indicating that the three core collections have heterogeneity, and P1 has the largest genetic variation.
5. Conclusions
In this study, 21 agronomic traits of 366 garlic germplasm accessions were investigated and statistically analyzed, including by genetic diversity, correlation, principal component, and cluster analyses, revealing that these resources have rich genetic diversity. Three core collections (R1, P1, and D1) were constructed using random (R), priority (P), and deviation (D) sampling. Among the constructed core collections, R1 and P1 met the necessary requirements and reflect the genetic diversity of the original population. D1 had 16 traits with significant or very significant mean differences from those of the original population, indicating that the three core collections have good heterogeneity, and P1 had the largest genetic variation. To retain as much genetic diversity of the original population as possible, 90 garlic core collections were constructed after removing redundant resources, ultimately accounting for 24.59% of the original population. These core collections lay an important foundation for the conservation of garlic genetic diversity, the exchange and utilization of germplasm accessions between countries, and the establishment of a germplasm nursery for garlic.