1. Introduction
Durum wheat (
Triticum turgidum L. subsp.
durum (Desf.) Husn.) is one of the world’s most important cereal crops and is highly valued by the food industry for pasta and semolina production [
1]. Despite the economic importance of durum wheat, most modern cultivars belong to the spring growth habit, whereas the development of winter durum wheat is relatively recent. The breeding of winter-hardy durum forms began only in the second half of the 20th century and remains limited compared with the long-established improvement of winter bread wheat (
Triticum aestivum) [
2,
3].
Our previous analyses of winter durum wheat have identified substantial allelic diversity at
Glu-1 loci affecting grain quality, indicating that this germplasm holds considerable potential for quality improvement [
4]. However, the full realization of this potential, coupled with consistent yield performance, is significantly hindered by sensitivity to low temperatures. Because of a shorter breeding history, winter durum germplasm still exhibits considerable variability in cold tolerance, and the physiological and genetic mechanisms underlying its winter survival are not yet well characterized. For example, spring frosts are well known for causing sterility in spikes and damaging reproductive organs, which directly reduces grain growth process [
5].
The creation of frost-resistant winter durum varieties is therefore of high strategic importance: winter forms can exploit the full vegetation period, achieve higher yield potential, and contribute to the diversification of durum wheat cultivation in colder continental regions [
6,
7,
8]. Owing to generally lower frost tolerance compared with bread wheat, durum wheat is considered more sensitive to cold stress [
8,
9]. Therefore, improving frost tolerance in durum wheat is a critical breeding objective to ensure stable productivity and to expand cultivation into colder environments.
The ability of wheat plants to withstand subzero temperatures is governed by cold acclimation—a process whereby gradual cooling triggers metabolic reprogramming and accumulation of protective compounds, thereby enhancing subsequent frost tolerance [
10]. Frost tolerance is a quantitative, polygenic trait controlled by numerous loci whose effects interact with the environment. Major loci of frost resistance reside on chromosome 5. In particular, Frost Resistance-1 (
Fr-1) and Frost Resistance-2 (
Fr-2) on the long arm of 5A are key contributors:
Fr-1 is linked to growth habit (winter vs. spring) by vernalization genes such as
Vrn-A1, whereas
Fr-2 encompasses the CBF (C-repeat Binding Factor genes) gene cluster and exerts the strongest influence on frost and winter survival [
11].
Consistent with this, Gupta et al. (2020) demonstrated that although
Vrn-A1 and photoperiod genes
Ppd-A1/
Ppd-B1 are major determinants of flowering in durum wheat, they explain only part of the phenotypic variation, implying the existence of additional polygenic regulators within the vernalization network that may also influence cold adaptation [
12]. Indeed, the
Fr loci contribution to frost tolerance is strongly influenced by developmental stage and environmental conditions, thereby contributing to the developmental and environmental dependence of
Fr-1 loci effects. Ferrante et al. (2021) demonstrated that in winter wheat, the
Fr-1 locus confers high frost tolerance during the vegetative phase (below −12 °C), but its regulatory effectiveness declines after vernalization as plants enter the reproductive stage [
13]. The
Fr-2 region, in turn, exhibits strong linkage disequilibrium and limited allelic diversity, with only two major haplotypes identified in modern winter wheat [
11,
14]. The superior haplotype provides approximately 15% higher winter survival compared to the alternative one, suggesting that most of the favorable variation within this cluster has already been exploited. As a result, further improvement in frost tolerance through
Fr loci alone may reach a plateau, emphasizing the importance of identifying additional QTL operating independently of the
Fr-mediated pathway to achieve complementary genetic gains and enhance the resilience of durum wheat to frost stress.
Genome-wide association studies (GWAS) are widely used to dissect the genetic architecture of complex traits in crops, and frost tolerance is no exception. Despite considerable progress in bread wheat, comparable data for durum wheat remain scarce. The objective of this study was to identify genomic loci associated with frost tolerance in winter durum wheat and to develop molecular markers for selecting highly tolerant genotypes in breeding programs, including loci beyond the well-characterized
Fr-regions to broaden the genetic basis of frost tolerance. Following the methodological framework established in our previous study on spring durum wheat [
15], we conducted a GWAS to assess the survival of winter durum wheat under frost conditions. The analysis revealed a novel genomic loci associated with frost resistance. Subsequently, Kompetitive Allele-Specific PCR (KASP) markers were developed for the identified allelic variant, and their efficiency for marker-assisted selection was validated.
3. Discussion
Frost tolerance in winter durum wheat is a complex, multilayered trait. Despite the binary nature of the freezing-survival phenotype, the loci identified in this study demonstrated strong and stable signals across all post-GWAS analyses tailored to binary outcomes. Their significance, effect direction, and robustness were further reinforced by KASP validation for three loci, confirming that the associations reflect true underlying polymorphisms rather than artifacts of phenotypic resolution.
This study identified four genomic regions on chromosomes 1B, 5A, 5B, and 7B that collectively underpin frost tolerance in winter durum wheat. These loci represent interconnected layers of the cold stress response—encompassing signal perception, regulatory control, cellular protection, and translational maintenance. Rather than functioning independently, these processes likely operate as a coordinated network where the efficiency of one layer enhances or constrains another.
The variant
1B_41099587 lies in an intergenic region and was identified with a minor allele frequency (MAF) of 0.43, an effect size of 0.573, and phenotypic variance explained (PVE) of 9.4%. This SNP is in tight linkage with a cluster of genes encoding histidine-containing phosphotransfer (HPt) proteins. In
Arabidopsis thaliana HPt proteins serve as intermediaries shuttling phosphoryl groups between sensor histidine kinases (e.g., AHKs) and response regulators (ARRs), thereby propagating environmental stress signals [
17]. In the context of cold stress, such multistep two-component pathways are known to activate CBF transcription factors that govern cold-responsive genes [
18].
The favorable variant A of
5B_517276534 is located in an intergenic region on chromosome 5B (MAF 0.28, effect size −0.861, PVE 7.6%), in proximity to genes encoding ornithine decarboxylase (ODC1B) and phospholipase D delta (PLDδ). Both enzymes are functionally linked to stress adaptation, especially under low temperatures [
19,
20]. Experimental studies in
Arabidopsis have shown that altering PLDδ levels can significantly change frost tolerance: knockout of PLDδ renders plants more sensitive to frost, whereas overexpression of PLDδ enhances frost survival [
21].
The SNP
7B_598228866 resides in an intergenic region adjacent to genes encoding the 40S ribosomal protein S14 (RPS14) and other ribosomal subunit proteins. It was detected at a lower frequency (MAF 0.09) but with a relatively large effect size (1.634) and PVE of 12.9%. Ribosomal proteins (RPs) have traditionally been viewed as structural components of the translation machinery; however, emerging evidence indicates that certain ribosomal subunits take on regulatory functions during stress adaptation [
22].
The variant
5A_487200180 lies in the 5′ untranslated region of a gene encoding an E3 ubiquitin–protein ligase known as BIG BROTHER. E3 ubiquitin ligases are key regulators in plant stress response pathways, often mediating the turnover of critical signaling proteins [
23]. It was identified at a MAF of 0.12 with a large effect size of 2.094 and phenotypic variance explained (PVE) of 21.5%. This locus had the highest effect size, underlining its biological significance, although direct KASP marker conversion proved challenging due to the tandem-repeat nature of the region. Such challenges are not uncommon in polyploid crops, where multi-copy genes and tandem repeats can hinder locus-specific amplification [
24]. Notably, the
5A_487200180 locus maps approximately 30 Mb proximal to the canonical
Fr-A2 region (516–523 Mb) harboring the CBF gene cluster [
11]. The E3 ubiquitin ligase BIG BROTHER identified at this position may modulate CBF pathway activity through protein degradation, as E3 ligases are known regulators of CBF stability and cold acclimation [
25].
Evidence for such interdependence was provided by the detected strong, but non-significant interaction between the
5B_517276534 and
7B_598228866 loci (β = −0,833,
p = 0.120). Recent studies indicate that the ODC pathway is closely linked to translational regulation, as polyamines derived from ODC activity stabilize ribosomes and enhance protein synthesis efficiency under stress conditions [
26,
27]. However, the relatively large magnitude of the interaction coefficient suggests the presence of a potential biological effect that may become evident with increased sample size.
Currently, only one genome-wide association study has investigated frost tolerance in winter durum wheat, namely the work of Sieber et al. (2016), which identified the
Fr-A2 and
Fr-B2 loci as the major determinants of frost tolerance [
28]. Their results demonstrated that frost tolerance in winter durum is governed by the same molecular mechanisms as in bread wheat, particularly those centered on the CBF-dependent cold-responsive pathway and its regulatory connection to vernalization and photoperiod genes.
Given the limited number of studies on frost tolerance in winter durum wheat, findings from bread wheat, which exhibits highly similar genomic organization and cold-response pathways, can be considered representative for comparative analysis of corresponding loci. Comparative genomics between
Triticum species provides a robust framework for transferring genomic information from one ploidy level to another, particularly between tetraploid durum wheat and hexaploid bread wheat. Due to their shared A-genome ancestry and the high degree of sequence conservation along homologous chromosomes, many trait-associated loci can be directly aligned between these species. Marcotuli et al. (2022) conducted a meta-analysis of GWAS in
Triticum turgidum ssp.
durum, identifying stable QTL hotspots and confirming strong collinearity between durum and bread wheat genomes through comparative mapping [
29]. El Baidouri et al. (2017) further demonstrated that the A subgenome of
T. aestivum is highly syntenic with the A genome of
T. turgidum, reflecting their recent shared ancestry and structural conservation [
30]. These findings justify direct positional comparison between loci on chromosome 5A of both species.
Concordance with
Fr-regions is observed in the present study. The
5A_487200180 locus (E3 ubiquitin ligase BIG BROTHER) lies proximally to the major bread wheat frost-tolerance QTL reported by Soleimani et al. (2022) on 5A (
QTL_5A_2, 516.45–523.45 Mb), which contains a dense cluster of CBF genes specifically mapping to the
Fr-2 region [
31]. Similarly, Chen et al. (2019) located the similar QTL for winter survival on chromosome 5A in bread wheat (
QWs.ugw-5A.1) [
32]. Specifically,
QWs.ugw-5A.1 is positioned at about 499.66 Mb, only about 12 Mb away from
5A_487200180 locus. These loci likely belong to the same broader genomic interval on the long arm of chromosome 5A, which consistently harbors cold-responsive genes, including the CBF cluster at
Fr-2.
On chromosome 5B, the
5B_517276534 locus (adjacent to ODC1B and PLDδ) is positioned ~25–30 Mb distal to the
Fr-B2 cluster (
QTL_5B, 486.35–493.35 Mb) reported by Soleimani et al. (2022), which harbors multiple CBF homologs [
31]. Genes involved in polyamine biosynthesis and phospholipid signaling may operate within the same regulatory framework as the CBF transcriptional network. Studies in tomato (
Solanum lycopersicum) leaves have shown that these pathways exhibit coordinated but partly antagonistic expression patterns under cold stress, jointly contributing to membrane stabilization and cellular protection during frost conditions [
33].
Although several plausible candidate genes were identified near the associated loci, mechanistic characterization of cold-response pathways requires transcriptomic or gene-editing approaches. The three KASP markers validated here directly support marker-assisted breeding for winter hardiness, while functional dissection of the underlying genes remains a promising direction for future research.
4. Materials and Methods
4.1. Plant Materials, Overall Design of Experiments and Phenotyping
All wheat cultivars were provided by the breeding program of the National Center of Grain named after P.P. Lukyanenko (Krasnodar, Russia). The segregating multi-family F2 panel of 270 wheat plants, representing a combined mixed population, was developed for genome-wide association analysis and validation of candidate loci associated with frost tolerance. The highly frost-tolerant cultivar Tsel and the frost-susceptible cultivar Senora participated as recurrent parental components. These cultivars were successively crossed with a diverse set of landraces (Odari, Andromeda, Zernograd, Leukurum, Pributkova, L2870VILLOSA, 4812h53, 4249h103, I-627494, 4598h48, 4291h83, 3902h3-18-3, I-627613, 3552h59-18-7, KN-21-130, 4754h37, 4743h81), forming a multi-parental structure that captured a broad range of frost-resistance and agronomic traits. Hybrid combinations included Tsel × landraces, Senora × landraces, and Tsel × Senora × landraces progenies, reflecting the diversity of germplasm. A total of plants were randomly selected from the segregating hybrid families for genotyping and phenotypic evaluation.
The evaluation of plants for frost tolerance was conducted using the direct frost method according to the Russian national standard for the evaluation of winter hardiness (Methodology for State Variety Trials of Agricultural Crops. Issue II: Cereal, Grain Legume, Maize, and Forage Crops, Moscow, 1989). The experiment was carried out in the C-816 frost chambers (Canada) installed in the phytotron complex of the National Center of Grain named after P.P. Lukyanenko (45°02′ N, 38°58′ E, Krasnodar, Russia).
The individual plants intended for frost were grown in wooden boxes (38 × 26 × 12 cm) filled with a soil–sand mixture (3:1). Sowing was performed in October 2023, corresponding to the optimal regional sowing date, using dry seeds. Each box contained seven rows spaced 5 cm apart, with 25 seeds per row, sown to a depth of 2 cm (
Figure 6). Emergence occurred 7 days later. The fourth row in each box was sown with the check variety Krupinka, known for its optimal frost tolerance among winter durum wheats (at a frost temperature of −15 °C, the average survival rate exceeds 50%) [
34].
The entire experimental workflow consisted of three consecutive phases: outdoor acclimation, controlled freezing, and post-thaw recovery (
Figure 7). Prior to freezing, plants underwent natural cold acclimation under field-like outdoor conditions during November and December. After acclimation, the freezing procedure was carried out in the C-816 programmable frost chamber. A 24 h pre-hardening stage at −2 to −5 °C preceded the main freezing cycle. The chamber temperature was then gradually lowered from −5 °C to −15 °C at a rate of approximately 1 °C per hour, after which the target semi-lethal temperature of −15 °C was maintained for 24 h to impose severe frost stress.
Following the freezing treatment, the temperature was increased stepwise from −15 °C to +5 °C over approximately two days to ensure gradual thawing and to avoid thermal shock. After thawing, plants were transferred to phytotron recovery conditions (20 °C day/16 °C night, 16 h photoperiod). Survival was recorded 7 and 14 days after freezing.
Finally, each individual plant within the F2 population was evaluated separately for survival following frost. Plants were scored as surviving (“1”) if they produced at least one new green leaf or tiller 7–14 days post-thaw; otherwise, they were scored as dead (“0”). Among the 270 individuals, 104 plants survived (38.5%) and 166 did not survive (61.5%). These binary survival data were subsequently used for association analysis to identify loci linked to frost tolerance. Among the survivors, most individuals were derived from Tsel (66.3%), while 21.2% originated from Senora and 12.5% had mixed ancestry (Tsel + Senora). Among the non-surviving plants, 52.4% were of Tsel origin, 34.3% from Senora, and 13.3% had mixed ancestry.
4.2. Genotyping and SNP Calling
Genomic DNA was extracted from dried leaf tissue using the “MagnoPrime GMO” kit following the manufacturer’s protocol. GBS libraries were prepared following the protocol described by Elshire et al. (2011), using the restriction enzymes MspI and PstI, and sequenced on an Illumina NovaSeq 6000 platform [
35]. Raw reads were aligned to the durum wheat reference genome (Svevo v2.0 [
36]) using bowtie 2 v2.3.5 with standard parameters [
37].
Variant calling was performed using ChoCallate (
https://github.com/alermol/ChoCallate; accessed 6 June 2025), an automated, high-performance ensemble framework developed by our group for consensus-based variant discovery, integrating multiple variant callers to generate robust, high-confidence SNVs and INDELs. The raw VCF was processed using PLINK v2.00a6LM [
38]. Only biallelic variants were retained, resulting in 278,727 SNPs. Quality control was applied in several stages: variants with a missing rate above 20% were excluded, leaving 70,796 variants. Variants with minor allele frequency (MAF) < 0.05 were filtered out, yielding 12.108 sites, and loci with heterozygosity > 30% were removed, leaving 9,529 variants. Missing genotypes were imputed using Beagle, followed by a secondary MAF ≥ 0.05 filter, resulting in 9.446 high-quality polymorphic variants (8,388 SNPs and 1,058 indels) used for GWAS.
4.3. Genome-Wide Association Analysis
Genome-wide association analysis (GWAS) was performed using the GAPIT v3 package [
39]. The Bayesian-information and Linkage-disequilibrium Iteratively Nested Keyway (BLINK) model was applied to identify significant associations between SNP markers and the frost tolerance phenotype [
40]. To account for population structure, the first five principal components (PCs) derived from the genotype data were included as covariates in the model.
Significance thresholds were determined using a false discovery rate (FDR) 10% threshold to control for multiple testing. Manhattan and quantile–quantile (Q-Q) plots were generated using GAPIT’s visualization functions. Effect sizes, standard errors, and the percentage of phenotypic variance explained (PVE) were extracted for all significant SNPs. Candidate genes were identified based on the physical position of significant SNPs in the Svevo v2.0. SNP annotation was performed using SnpEff with default parameters [
41].
4.4. Designing KASP Markers
Specific KASP markers were developed for the significant SNPs identified in the GWAS to validate the detected associations. Primer sets were designed according to LGC Genomics guidelines, flanking sequences retrieved from the Triticum durum Svevo v2.0 reference genome. Each assay included two allele-specific forward primers containing the universal FAM or HEX tails at the 5′ end, and a single common reverse primer. All KASP reactions were carried out in 5 μL volume using CFX96 Real-Time PCR System ( Bio-Rad, Hercules, CA, USA) under the following cycling conditions: 94 °C for 15 min, followed by 10 touchdown cycles (94 °C for 20 s; annealing starting at 61 °C and decreasing by 0.6 °C per cycle), and 26 cycles at 94 °C for 20 s, 55 °C for 60 s. Fluorescence was measured at the end of each annealing step, and endpoint genotyping clusters were analyzed using Bio-Rad CFX Manager software (v3.1).
4.5. Post-GWAS Analysis
Because each F2 plant represents a unique genotype subjected to a destructive freezing event, frost tolerance could only be recorded as a binary survival response. To ensure that this limited phenotypic resolution did not bias inference, we applied a series of complementary statistical procedures beyond the primary GWAS, including logistic regression, likelihood-ratio testing, comparison of alternative inheritance models, and assessment of potential epistasis. In addition, significant SNPs were converted to KASP markers to provide an independent validation layer and confirm the robustness of genotype–phenotype associations.
Post-GWAS validation was conducted to evaluate the contribution of individual SNPs to survival probability and to assess the combined predictive power of significant loci. All analyses were performed in R (v4.2.2) using the packages stats, broom (v1.0.6), dplyr (v1.1.4), tidyr (v1.3.1), ggplot2 (v3.5.1), and pROC (v1.18.5).
Binary survival outcomes were modeled using logistic regression (GLM with binomial link). For each SNP, the genotype was coded as the minor-allele dosage (0, 1, 2). The full multivariate model included all significant loci identified in the GWAS.
4.5.1. Inheritance Model Comparison
For each SNP, three alternative genetic models were fitted: additive (dosage-related), dominant (carrier vs. non-carrier), and genotypic (3-level categorical). Each model was compared against the intercept-only model using LRTs, and pseudo-R2 values were calculated to assess the best-fitting mode of inheritance.
4.5.2. Integrated Evaluation of Joint and Locus-Specific Effects
The predictive value of the SNP set was first assessed by contrasting the full additive logistic model (including all SNP terms) with an intercept-only model using a global likelihood-ratio test. Model improvement was summarized by McFadden’s pseudo-R
2 [
42].
Locus-specific contributions within the multivariable framework were then quantified by type-II likelihood-ratio tests (LRTs) [
43], and for each SNP a partial pseudo-R
2 was obtained by refitting the model after removing the focal term. Effect sizes were reported as logit coefficients (β) with standard errors (SEs), Wald
p-values, odds ratios (OR = eβ), and 95% confidence intervals derived via broom package. Multiplicity for per-marker tests was controlled using the Benjamini–Hochberg false discovery rate.
To test for non-additive (pairwise) interactions between loci, an extended model including all two-way SNP × SNP terms was fitted and compared to the additive model via LRT. The change in model deviance and pseudo-R2 was used to quantify epistatic effects.
4.5.3. Model Discrimination
Discriminatory performance was assessed using receiver operating characteristic (ROC) analysis implemented in the pROC package [
44]. The area under the ROC curve (AUC) and its 95% confidence interval were calculated using DeLong’s method [
45].
4.6. Generative AI Statement
No generative artificial intelligence (GenAI) was used in the creation of this manuscript.