2.1. Morphological and Grain Quality Traits
Durum wheat collection effectively preserved and documented at the ESS were characterized by their different morphological, phenological and grain quality traits. A heatmap was generated for a simplified representation of the morphological trait distribution (
Figure 1;
Table S1).
Overall, these traits allowed us to group the germplasm in five main clusters, with 50% of landraces/cultivars belonging to cluster 1 and 2, while 5, 4, 3, and 3 genotypes belonged clusters 3, 4, 5, and 6, respectively. Interestingly, ten traits showing high variability among the samples, as determined by their clusters—six traits (morpho 2, morpho 11, morpho 19, morpho 23, morpho 24, and morpho 27;
Table S1) were mainly due to the genotypic variation; while three traits were highly influenced by environment (morpho 3, morpho 4, and morpho 17;
Table S1). In addition, flag leaves (morpho 2) played a very important role in grain filling and yield, together with ear emergence (morpho 3), ear cross shape (morpho 23), and density (morpho 24).
The reduction of plant height, together with a higher grain yield and early maturation, were the main breeder aims in the last century. Our historical Sicilian wheat collection, investigated here, showed mostly a growth erect or semi erect (data not shown), and the plant height ranged from 95 and 135 cm. Nineteen percent of the landraces showed a plant height below 120 cm (
Table S1). Based on the data collected, the time of ear emergence, counted as days from sowing to flowering, ranged from early (120 days) to medium (125 days) in more than 70% of the collection (
Table S1). Among landraces, it was possible to recognize genotypes belonging to the ‘
syriacum typicum’, a group utilized to select and release shorter and earlier-flowering new cultivars in Italy, during 1920–1950 [
27]. These results supported previous reports [
28], confirming the key role of these genetic resources for a potential re-introduction of the historical durum wheat landraces in cultivation.
Mains comparison and post-hoc test of grain and wholemeal flour quality-related (commercial) traits of wheat genotypes are reported in
Table 1 (one-way ANOVA is reported in
Table S2).
In durum wheat that have not undergone weathering, test weight (TW) is an excellent predictor of the semolina milling potential. Indeed, this parameter indicated the specific seed weight per unit volume, depending on the filling degree of the seed, was used to indicate the likely milling performance of wheat, and was considered a more reliable predictor of the flour/semolina yield [
29,
30]. Here, there were no significant differences between the highest value recorded for cv. Claudio and those measured in the two landraces (val2gl and tri2). The same was applicable for the other five landraces (urr1, tre2, trin, sam3, and bivc), when compared to the value recorded for cv. Simeto. The thousand-kernel-weight (TKW) ranged from 33.5 to 59.2 g recorded in “tim” and “val2gl”, respectively. Interestingly, thirteen out of the twenty-seven landraces recorded significant TKW higher values, as compared to cv. Claudio, furthermore, seven landraces also showed a higher TKW value than the modern cv. Simeto. These performances might be due to earlier flowering of modern wheat cultivars, as previously reported [
31]. These results indicate that the durum wheat landraces from Sicily might possess a higher yield potential than the modern cultivars, when cultivated under drought stress conditions.
Non-vitreous kernels (starchy), characterized by an opaque area, have a significant impact on the characteristics of kernels during milling, producing a detrimental effect to the end-use quality of durum wheat, and decreasing the semolina extract [
32]. The incidence of starchy kernels, recorded on the Sicilian wheat collection, ranged from 0 (sco4) to 96% (tri2), with the landraces showing a higher percentage, compared to the modern cultivars, as reported in Gallo et al. [
33].
Wholemeal flour quality traits, protein and gluten contents are the most important features useful for characterizing durum wheat landraces/cultivars. Indeed, it is well-documented that the protein content and the endosperm storage protein composition have a decisive impact on the wheat processing quality [
34,
35]. High protein level in semolina will usually yield a product with uniform particle size, with a minimum number of starchy particles, although it has been shown that other traits, like such specific γ-gliadins and gluten viscoelasticity, together with vitreousness and yellow semolina color, play a pivotal role in quality, processed wheat products [
36,
37].
Frequently, modern durum wheat cultivars show lower grain protein content, compared to the older ones [
38,
39,
40], as the wheat breeding programs are mainly focused on increasing the grain yield [
41]. Our result seems to confirm these previous reports. Indeed, a significantly higher protein content (>16.0%) was observed in the six Sicilian landraces (bd3, sco4, reg1, cic1, gig1, and ing2) compared to the modern cultivars (<15%), whereas the lowest value (8.5%) was found in tri2, due to the high percentage of starchy kernels (
Table 1).
The values of wet and dry gluten content, important for the pasta industry, showed a comparable trend (r
2 = 0.97) with a high protein content and a wide variability, as already reported [
42,
43]. The historical cv. Capp1, recently registered again in the Italian National Register of Varieties (INRV) for its reintroduction in the wheat cultivation areas of Italy, showed a very high value of wet gluten content (38.1%). With regard to the gluten index, often correlated with gluten quality, most of the ancient wheat landraces showed low values, ranging from 18.8 (fsa1) to 59.5 (cot1) (
Table 1;
Figure S1). By contrast, “mla1” landrace showed a significant higher value (91.12), compared to the testers (86.3 and 83.1 in Claudio and Simeto, respectively). These data confirmed the presence of a weaker gluten in Sicilian wheat landraces, as previously reported [
44,
45,
46,
47], compared to the modern cultivars [
48,
49,
50].
Durum wheat grain color might affect the quality of the end products, depending on genetic background of a genotype, as well as the environment and the technological processes. The anthocyanins are a class of pigments, which characterize the durum wheat aleurone or pericarp. The high-level of these pigments observed in many Sicilian landraces represents an important trait for breeding programs aimed at improving the nutritional value of grain and its end products [
51]. The yellow index (b*) measured in semolina was significantly higher in the cv. Simeto (20.06), compared to those measured in the Sicilian landraces, demonstrating the effect of selection on this important nutritional trait. The historical cv. Cappelli (capp1), together with two landraces, “fsa1” and “rea4” were characterized by a high value of yellow index (b*), according to Digesù et al. [
52]. As expected, the bread wheat landrace (cuc1) showed the significant lowest yellow index (8.74).
The Principal Component Analysis (PCA) performed on all the quality-related grain traits showed wide differences among landraces (
Figure 2). The first two components explained 60.1% of the total variance, the second ones were able to discriminate cuc1 (
T. aestivum L.) from the landraces belonging to
T. turgidum L.
ssp.
durum, and the last components were able to discriminate between historical and modern cultivars (
Figure 2). Protein content and the parameters related to gluten showed a strong influence on both components (cos2 = 0.75), separating 50% of landraces (16) from the others. Finally,
T. aestivum L. was characterized by a higher number of starchy kernels (cos
2 = 0.75), strongly related to the first component (
Figure 2). These evidences were confirmed by Pearson correlations, showing a positive correlation (
p < 0.05) among water binding in wet gluten, and dry and wet gluten, with ae higher positive correlation coefficients (0.96) for the last two components and a negative correlation (
p < 0.05) between starchy kernels and protein content (−0.64) (
Figure S2). Likewise, the red, brown, and yellow index were positive correlated (
p < 0.05).
2.2. SNP Analysis and Genetic Characterization
Genetic diversity can be evaluated by biochemical and morphological markers or by the use of pedigree. Unfortunately, these approaches can be influenced by environment or can be erroneous and incomplete, causing frequent misclassifications among the genotypes. By contrast, molecular markers allow the assessment of relatedness at the DNA level, making them necessary for identification of genetic variation among and within landraces/populations, due to the influence of the environment on many traits under polygenic control. Different molecular markers can be used for comparative genomic, phylogenetic relationships, and diversity studies [
53,
54]. More recently, next generation sequencing (NGS) led to set up different SNP panels that were successfully used for genetic diversity analyses, also in wheat [
21,
55]. The high-throughput wheat 90k SNP array was used to investigate the genetic relationships across the Sicilian ancient germplasm, using two modern cultivars (Claudio e Simeto) as references, and Cuccitta, a bread wheat landrace, as the outgroup. After the SNP-dataset filtering, 5,594 loci (7%) did not amplify among all genotypes and 41,926 (51%) were monomorphic (
Table 2). The final dataset resulted in 18,170 loci, after removing the SNPs with a number of NC (not-call) higher than 20%, among which 13,528 loci (74%) were polymorphic with an overall Minor Allele Frequency (MAF) value of 0.232 (
Table 2).
To investigate the genetic relationships among cultivars, based on the SNP-data, phylogenetic analysis and PCoA were carried out. Cluster analysis based on Nei (1978) genetic coefficient and the UPGMA algorithm, generated a dendrogram underlining four main clusters across the durum wheat germplasm (
Figure 3). Two modern and the historical cultivars (Claudio and Simeto; Trinakria and Cappelli, respectively) grouped in Cluster A, together with three landraces (brc-b1, mar2, and bd3), while the bread wheat landrace Cuccitta was the outgroup, as expected (
Figure 3). Cluster B included 16 out of 27 durum wheat landraces (60%), while cluster C and D grouped only three and five landraces, respectively. Many bootstrap values (80%) ranged from 99 to 100% in the most important nodes, avoiding any misclassifications (
Figure 3). Cluster A was closer to Cluster B (NEI = 0.1552), while Cluster D appeared more different from the other three (
Table S3). As expected, the outgroup Cuccitta (cuc1) showed the highest values of genetic distance from the other groups (
Table S3).
PCoA was also carried out to properly describe the clusters reported above, as expected, the first two PCs showed four main clusters, grouping the different genotypes in agreement with cluster analysis (
Figure S3). Genetic distances among genotypes were confirmed, with five landraces belong to cluster D (
Figure 3) showing the highest genetic variability, and the landraces grouped in cluster B and C showing a common genetic background. As expected, cuc1 landrace was separated from the others. Unlike cluster analysis, brc-b1 appeared similar to cuc1, between the samples belonging to cluster A and D, without a clear assignment (
Figure S3).
The differences among cultivars by fast STRUCTURE analysis were further confirmed. Indeed, although the groups highlighted by the two approaches were different, this analysis confirmed cluster and PCoA results, and was able to separate the ancient cultivars from the others. The optimum number of genetic clusters (
K) within the collection was determined as
K = 7 (
Figure 4). The Cuc1 hexaploid landrace belonged to a private group (red pool; outgroup in the cluster analysis) and the other two pools, green and gold, overlapped the clusters B defined in the UPGMA analysis (clustering), with ing2, cot1, rea4, and urr1 varieties being grouped in private branches. Likewise, the blue pool of the structure analysis overlapped with cluster D. By contrast, cluster A in the UPGMA, harbored the purple pool, except the cultivar Claudio which shared a different genetic pool (light blue) with tri2, in the structure analysis (
Figure 4). However, cla and tri2 samples were outside their main clusters, and genetically closer to each other, in comparison to the samples belonging to groups B and D. Thirty out of the 32 genotypes (94%) classified into one of the seven pools, using an 80% cut-off ancestry. Thus, many varieties showed 100% membership to their group (
K), except bivc, sco4, and tre2. Among landraces, only brc-b1 and bia1 showed a high admixture profile between the gold and green, the purple and blue pools, respectively (
Figure 4).
To prioritize germplasm within the collection, a genetic distinctness and a genetic redundancy analyses were run using R/AveDissR. In the distinctness analysis, the average dissimilarity (AD) values ranged from 0.31 to more than 0.57, the most distinct individual being cuc1 (
Table S4). Selecting the 15 samples with higher AD in the sixth step of the iteration resulted in a percentage of variation (PVa) of 0.13 among the populations, as compared to the initial (0.10). Any further selection with the chosen step of 0.1 would substantially reduce the PVa (
Table S5), hindering the representativeness of the most distinct genotypes. Genetic redundancy was assayed to identify the least unique samples. PVa dropped from 0.10 to 0.08, when removing the three samples with the lower AD values (
Table S6), indicating a slight reduction from the overall genetic differentiation of the collection. The outcome of R/AveDissR analysis might be used to guide the selection of the most distinct genotype, which, in combination with passport information and further phenotypic characterization, might contribute to breeding programs willing to employ these genetic resources. By contrast, the identification of samples characterized by lower AD values would allow to further characterize the redundant genotypes putatively derived from the shared ancestry or even from the duplications in the seed bank.