Next Article in Journal
A Metabolically Stable Apelin-13 Analog Acting as a Potent ITo Potassium Current Blocker with Potential Benefits for Brugada Syndrome
Next Article in Special Issue
Harnessing Light Wavelengths to Enrich Health-Promoting Molecules in Tomato Fruits
Previous Article in Journal
The Combination of Physiological and Transcriptomic Approaches Reveals New Insights into the Molecular Mechanisms of Leymus chinensis Growth Under Different Shading Intensities
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Uncovering Key Genes Associated with the Short-Winged Trait in Faba Bean (Vicia faba L.) Through Re-Sequencing and Genome-Wide Association Studies (GWASs)

1
Institute of Food Crops, Yunnan Academy of Agricultural Science, Kunming 650205, China
2
Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB T6G 2P5, Canada
3
Qujing Academy of Agricultural Sciences, Qujing 655000, China
*
Authors to whom correspondence should be addressed.
Int. J. Mol. Sci. 2025, 26(6), 2733; https://doi.org/10.3390/ijms26062733
Submission received: 17 February 2025 / Revised: 10 March 2025 / Accepted: 13 March 2025 / Published: 18 March 2025

Abstract

:
Faba bean (Vicia faba L.) is a globally significant legume valued for its applications in food, vegetables, and green manure, yet its high outcrossing rate (30–80%) poses challenges for production development. A rare short-winged trait identified in Yunnan, China, offers promise for developing low-outcrossing varieties, reducing outcrossing rates to below 5%. Scanning electron microscopy (SEM) and transmission electron microscopy (TEM) analyses revealed that the epidermal cells of normal wing petals are conical, while those of short-wing petals are tubular. This study examined 200 F2 lines from crosses between ‘K0692’ (short-winged) and ‘Yundou 1183’, as well as ‘Yundoulvxin 1’ (short-winged) and ‘Yundou 1183’. The GWASs identified 10 SNP loci across chromosomes 2, 3, 4, and 5, with SNP_chr4::1013887633 explaining 22.20% of the wing trait variation. Key candidate genes were identified, such as VFH_III145120, which influences floral identity; and VFH_III149200, associated with epidermal differentiation. GO enrichment analysis demonstrated significant gene involvement in RNA localization, ribosome biogenesis, and preribosome metabolism, while KEGG analysis linked these genes to pathways in amino acid, nucleotide, and purine metabolism; ubiquitin-mediated proteolysis; and protein processing in the endoplasmic reticulum. These findings lay a foundation for breeding low-outcrossing faba bean varieties and enhancing sustainable faba bean cultivation.

1. Introduction

Faba bean (Vicia faba L.) is a globally significant legume known for its high nutritional value and environmental benefits. It serves multiple purposes, including human food, livestock feed, and green manure, which enhances soil fertility through nitrogen fixation [1]. This dual functionality aligns with sustainable agricultural practices, supporting crop rotation systems and reducing reliance on synthetic fertilizers. The adaptability of faba bean to diverse climatic conditions further underscores its importance in improving food security, particularly in regions vulnerable to climate change [1,2].
Despite these advantages, faba bean cultivation faces challenges, particularly due to high outcrossing rates ranging from 7% to 82% [3]. This outcrossing can introduce unwanted genetic variability, complicating breeding programs aimed at developing stable, high-yielding varieties and increasing susceptibility to diseases and pests. Thus, managing outcrossing rates is essential for enhancing the agronomic traits of faba bean.
Recent efforts have focused on identifying genetic resources to mitigate high outcrossing rates. A rare germplasm from Yunnan, China, exhibits a short-winged trait (Figure 1A–E) that significantly reduces its outcrossing rate to below 5% [4]. Normal-wing petals (Figure 1H(b)) are longer than the keel petal (Figure 1H(c)) and serve as a landing platform for honeybee foraging activity [5] (Figure 1K). In contrast, the short-winged petal (Figure 1C(b)) is identical in shape and length to the keel petal (Figure 1C(c)), eliminating a stable landing site for honeybees and restricting their ability to access nectar [4,6]. Pollinators, particularly honeybees, typically extract nectar from the junction of the standard petal and the wing petal, using the wing as support while foraging (Figure 1K), which is recognized as an effective mechanism for nectar transmission among faba bean plants [5]. However, the short-winged trait poses a disadvantage for insect-mediated foraging as bees cannot land on the short-winged petals, thereby reducing cross-pollination.
Based on nearly 40 years of breeding records, the short-winged faba bean has exhibited minimal variation in the field, making it an excellent germplasm for maintaining a low outcrossing rate and ensuring production stability [4,6]. While the high outcrossing rate in faba bean has been widely documented for its positive effects—such as increased heterozygosity, improved plant vigor, and enhanced yield [7]—it also presents challenges. The coexistence of allogamous and autogamous pollination complicates breeding strategies, as conventional methods used for either self-pollinated crops or highly outcrossing species like maize are not directly applicable to faba bean [8]. The short-winged trait offers a unique opportunity to develop low-outcrossing faba bean varieties through molecular breeding techniques, enabling the creation of more genetically stable germplasm and facilitating breeding strategies suitable for self-pollinated crop improvement in faba bean.
Floral morphology evolution plays a critical role in plant reproductive strategies and has been extensively studied in botany [9]. Among floral structures, petals significantly influence pollinator attraction and reproductive success [10]. Petal morphology can vary widely among species, adapting to specific ecological niches and pollination mechanisms [11]. Furthermore, the cellular structure of petals impacts their overall functionality; the arrangement, size, and shape of epidermal cells can affect light reflection, water retention, and the flower’s ability to attract pollinators [12,13]. Previous studies have shown significant differences in epidermal structures among flower types, underscoring the importance of cellular architecture in floral design [14]. Bailes et al. (2018) employed transmission electron microscopy (TEM) and scanning electron microscopy (SEM) techniques to investigate the phenotypic diversity of epidermal cells in faba bean petals. Their findings suggest that conical (papillate) epidermal cells provide better anchoring for pollinating insects, thereby enhancing their residence time on such petals [15].
Genetic research has emerged as a rapid and effective approach for molecular breeding [16]. Studies on other leguminous crops, such as the early investigation in 1923, indicated that the wing phenotype in pea is controlled by a single gene, designated “k” [17], and the mapping of this gene has been reported [18]. In Lotus japonicus, the KEW1/kew1 gene regulates petal wing formation and influences epidermal cell development [19]. This gene is expressed in both wing and standard petals, with expression levels in wing petals being significantly higher, particularly during late floral development [20].
Currently, there is a lack of reports on short-winged faba bean germplasms and the regulation of short-winged petals in this species. The large nuclear genome of faba bean, approximately 13 Gb, has contributed to delays in whole-genome sequencing and high-throughput genetic studies compared to other major crops [21]. Recent advancements, such as the comprehensive genomic information reported by Jayacodi et al. (2023), provide a valuable reference for molecular genetic studies, including the construction of genetic maps using hybrid populations [22].
To further investigate morphological differences, this study employed SEM and TEM to examine the epidermal cells of short-winged and normal-winged varieties. A total of 200 F2 lines were derived from crosses between short-winged and normal-winged varieties. Utilizing advanced re-sequencing techniques and genome-wide association studies (GWASs), several single nucleotide polymorphism (SNP) loci were mapped across chromosomes. This research not only identified these SNP loci, but also provided a comprehensive analysis of candidate genes associated with them, revealing a complex genetic landscape related to wing morphology. These findings underscore the significance of the identified SNPs in regulating wing morphology and their potential utility in marker-assisted selection for breeding programs.

2. Results

2.1. Analysis of Segregation of the Wing Trait in F2 Populations

The segregation ratios for the wing trait in the two F2 populations were 1389:483 for normal-winged to short-winged in the ‘Yundoulvxin 1’ (short-winged) × ‘Yundou 1183’ cross, as well as 1624:531 for the ‘K0692’ (short-winged) × ‘Yundou 1183’ cross (Table 1). The observed ratios of the normal-winged to short-winged individuals were 2.88:1, 3.06:1, and 2.97:1 for ‘Yundoulvxin 1’ × ‘Yundou 1183’, ‘K0692’ × ‘Yundou 1183’, and the combined populations, respectively, as shown in Table 2. These ratios conform to the expected segregation ratio of 3:1.

2.2. Microscopic and Genetic Basis of the Wing Trait

2.2.1. Scanning Electron Microscopy (SEM) and Transmission Electron Microscopy (TEM) of Petal Epidermal Cells

SEM analysis revealed that the epidermal cells of the wing petal in the short-wing variant were tabular (Figure 2A,B), whereas the normal-wing variant featured conical cells (Figure 2E,F). These observations were further confirmed by TEM, which showed tabular cells in the short-wing variant (Figure 3A,B) and conical cells in the normal-wing variant (Figure 3C,D). In contrast, the epidermal cells of the keel petal were consistent across both variants, exhibiting tabular cell morphology (Figure 2C,D,G,H).

2.2.2. Genome-Wide Association Study on the Wing Trait

Population Characteristic Analysis and LD Analysis

To identify the genes associated with the short-winged trait, we sequenced 200 individuals from the F2 populations, comprising 50 short-winged and 150 normal-winged individuals, which were consistent with a 3:1 ratio. A total of 1072.16 G reads were generated, with 96.94% of these reads achieving Q30 quality or higher and a GC content of 41.31%.
Using the Hedin2 genome v1 [22] as the reference, we identified 32,339,886 SNPs and 1,126,648 InDels. After applying filtering criteria (missing data < 30% and minor allele frequency [MAF] > 5%), a total of 488,121 SNPs and 14,931 InDels remained for analysis. The highest density of SNPs was observed on chromosome 5, while the lowest density of SNPs was detected on chromosome 7, with an average marker density of 41 SNPs per Mb (Figure 4A).
To explore the population structure and genotype characteristics of the 200 tested F2 individuals, we employed ADMIXTURE software Version 1.3.0. This analysis utilized an unsupervised maximum likelihood estimation model and involved cross-validation to determine the optimal number of subpopulations (k = 1–20). The structural simulation results indicated that k = 9 minimized the cross-validation (CV) error (Figure 4B,C). Consequently, we selected a k value of 9 to assess the genetic structure of the 200 F2 individuals. In the principal component analysis (PCA) of these individuals, the first two principal components explained 14.4% of the total genetic variation (Figure 4D).
Significant genetic variation was observed among the 200 F2 individuals in this study. To further investigate the genetic differentiation among genotypes, we performed clustering based on the neighbor joining (NJ) method. Consistent with the ADMIXTURE results, the stratified NJ tree revealed distinct subclusters, despite all of the individuals belonging to a larger overall group (Figure 4E). Nine subclusters were identified in the NJ tree, aligning with the nine major subgroups from the ADMIXTURE analysis (Figure 4B,C). The resulting Q matrix at k = 9 was used for subsequent marker-trait association mapping.
All of the high-quality SNP markers were employed to estimate the linkage disequilibrium (LD) within the F2 population. In our analysis, the maximum observed r2 value was 0.45, and the half-decay distance would, theoretically, correspond to the physical distance, at which r2 decayed to half of this value (0.225). However, even at the maximum calculated distance of 500 kb, the r2 did not decay to 0.225, indicating a relatively slow LD decay rate in our population. Therefore, we used 500 kb as the LD half-decay distance. For visualization and interpretation, we referred to an r2 threshold of 0.34, which reflects the approximate background level of LD across the genome under the given distance constraints (Supplementary Figure S1).

Genome-Wide Association Analysis

To identify the key genetic loci and candidate genes associated with the short-winged trait, we conducted genome-wide association studies (GWASs) using three models: GLM, MLM, and FarmCPU. The GLM was included as a baseline model without controlling for the population structure, while MLM accounted for both the population structure and kinship to reduce false positives. FarmCPU was selected for its ability to effectively control confounding while maintaining statistical power through using a multi-locus mixed model approach. The rMVP software (Version 1.0.6) package was used to analyze the associations between SNP markers and the short-winged trait in the F2 population, enabling detection of significant marker-trait associations. A GWAS threshold of −log (P) = 7 was used in declaring significant marker-trait associations (MTAs).
  • GLM Model
Using the GLM model, a Lambda value of 1.269 was obtained with three principal components (nPC = 3) (Figure 5A). A total of 4958 SNPs across seven chromosomes and 45 InDels were identified. SNPs were distributed as follows: 361 on chromosome 1; 25 on chromosome 2; 118 on chromosome 3; 4390 on chromosome 4; 23 on chromosome 5; 15 on chromosome 6; and 26 on chromosome 7 (Figure 5D). Among these, 53 InDels and 2319 SNPs explained over 10% of the trait variation, while a subset—5 InDels, 105 SNPs on chromosome 1, 5 on chromosome 3, and 522 on chromosome 4—accounted for more than 20% of variation. Notably, 4 SNPs on chromosome 1 and 14 on chromosome 4 explained over 30% of variation, with the SNP at position SNP_chr4::1013887633 explaining the highest variation at 49.37%.
  • MLM Model
With the MLM model, the Lambda value was 0.863 with three principal components (nPC = 3) (Figure 5B). A total of 716 SNPs and 11 InDels showed significant association with the wing trait in faba bean. The SNP distribution included 86 on chromosome 1; 5 on chromosome 2; 61 on chromosome 3; 542 on chromosome 4; 2 on chromosome 5; 9 on chromosome 6; and 11 on chromosome 7 (Figure 5E). A subset of 277 SNPs from chromosomes 1 and 4, and sca659 and sca1426, explained over 10% of trait variation. Key loci on chromosome 1 (SNP_chr1::386258298, SNP_chr1::386258299, SNP_chr1::386258476, and SNP_chr1::386330156) and chromosome 4 (SNP_chr4::1013887633, SNP_chr4::1038779197, and SNP_chr4::1060355967) explained over 20% of variation, with SNP_chr4::1013887633 reaching 26.85%.
  • FarmCPU Model
Using the FarmCPU model, the Lambda value was 1.064 with three principal components (nPC = 3) (Figure 5C). Ten SNPs significantly associated with the short-winged trait were identified (Figure 5F, Table 2). One SNP on chromosome 2 (SNP_chr2::1370936770) explained 9.29% of variation. Two SNPs on chromosome 3 (SNP_chr3::764672128 and SNP_chr3::1529700980) explained 9.29% and 6.18% of variation, respectively. Six SNPs on chromosome 4, including SNP_chr4::1013887633, which accounted for 22.20% of variation, also exhibited significant associations. The final SNP, located on chromosome 5 (SNP_chr5::1024268985), explained 11.78% of the variation in the wing trait.
Significant associations were identified between the SNP pairs on chromosomes 2 and 4 (Supplementary Table S1 and Supplementary Figure S2, respectively), with distinct odds ratios and contributions to the variance (R2). A notable association was observed between SNPs chr4::1013887633 and chr4::1068677594, showing an odds ratio of 533.85 and a p-value of 0.0261, explaining 2.03% of the phenotypic variance. These associations suggest potential loci interactions that may play a role in influencing the trait of interest.

Candidate Gene Analysis

The Q-Q plot generated from the FarmCPU model demonstrated that most points aligned along the diagonal, and the genomic inflation factor (λ) for FarmCPU was 1.064, which is close to 1, suggesting an appropriate balance between false positives and false negatives. This indicates a robust model fit for identifying genetic associations with the wing trait in the F2 population. Therefore, FarmCPU was determined to be the best model for identifying significant SNPs and performing candidate gene blasting. Utilizing this model, significant SNPs were analyzed, and a 500 kb confidence interval on each side of these SNPs in the Hedin2 genome v1 was used to search for candidate genes linked to the short-wing trait. In total, 52 candidate genes were identified (Table 3), although none were previously annotated in faba bean. Functional annotations for these candidate genes were assigned based on the Universal Protein Database (UniProt), providing insights into their potential roles in trait expression.
On chromosome 2, three novel candidate genes were identified, with gene-VFH_I185240 associated with DNA integration, metal ion binding, and ribonuclease activity; gene-VFH_I185160 linked to the regulation of organ growth and protein ubiquitination (protein ubiquitination:regulation of seed growth:ubiquitin protein ligase activity); and gene-VFH_I185200 had an unknown function.
Chromosome 3 yielded ten novel candidate genes, including gene-VFH_II126600, which was associated with transcription factor activity and DNA binding near SNP_chr3::764672128. Nine additional genes were identified around SNP_chr3::1529700980, including gene-VFH_II242840 and gene-VFH_II242880 with RNA–DNA hybrid ribonuclease activity; gene-VFH_II242760 was involved in vesicle-mediated transport; and gene-VFH_II126600, gene-VFH_II242920, and gene-VFH_II242960 were related to transcription regulation and DNA binding.
On chromosome 4, thirty-two candidate genes were identified across multiple intervals. Two genes near SNP_chr4::897638629, gene-VFH_III128960, and gene-VFH_III129000 were of unknown function. Eight genes in the region of SNP_chr4::1013887633 were also uncharacterized. In contrast, SNP_chr4::1025369220 contained gene-VFH_III147240, which was associated with post-translational modification; gene-VFH_III147280, which was linked to nucleotide transport and metabolism; and the other six genes were uncharacterized. Additionally, SNP_chr4::1038779197 included gene-VFH_III149200, which was found to be involved in signal transduction mechanisms and protein kinase activity. Within the candidate intervals of SNP_chr4::1068677594, ten genes were identified. Among them, gene-VFH_III153600, gene-VFH_III153640, gene-VFH_III153560, and gene-VFH_III153520 were linked to translation, ribosomal structure, and biogenesis, while gene-VFH_III153400 was associated with post-translational modification, protein turnover, and chaperone activity. Gene-VFH_III153480 was found to be involved in signal transduction mechanisms. Four other genes in this region—gene-VFH_III153320, gene-VFH_III153280, gene-VFH_III153440, and gene-VFH_III153360—have unknown functions. Additionally, three genes within the interval of SNP_chr4::1283122638—gene-VFH_III185920, gene-VFH_III185880, and gene-VFH_III185960 also lacked functional annotations.
Chromosome 5 analysis revealed four candidate genes within the interval of SNP_chr5::1024268985, with all of them having unknown functions in faba bean. However, gene-VFH_IV148120 and gene-VFH_IV148160 were predicted to encode subtilisin-like proteases in Arabidopsis thaliana, while gene-VFH_IV148200 likely encodes an inactive purple acid phosphatase. The function of gene-VFH_IV148240 remains unreported. These findings provide a basis for future exploration of the genetic basis of the short-wing trait and offer candidate genes that warrant further functional validation.

Gene Ontology (GO) Enrichment Analysis

The Gene Ontology (GO) enrichment analysis of the candidate genes identified significant enrichment in two primary functional categories (along with the preribosome compound metabolic process (Figure 6A)): RNA/rRNA and protein localization activity. Within these categories, two terms related to cellular components were identified: 90S preribosome (GO:0030686) and preribosome, small subunit precursor (GO:0030688). The biological process category included 18 terms related to rRNA processing and protein localization. These terms are as follows: cleavage involved in rRNA processing (GO:0000469); endonucleolytic cleavage in 5′-ETS of tricistronic rRNA transcript (SSU-rRNA, 5.8S rRNA, LSU-rRNA) (GO:0000480); endonucleolytic cleavage in ITS1 to separate SSU-rRNA from 5.8S rRNA and LSU-rRNA from tricistronic rRNA transcript (SSU-rRNA, 5.8S rRNA, LSU-rRNA) (GO:0000447); endonucleolytic cleavage involved in rRNA processing (GO:0000478); endonucleolytic cleavage of tricistronic rRNA transcript (SSU-rRNA, 5.8S rRNA, LSU-rRNA) (GO:0000479); endonucleolytic cleavage to generate mature 5′-end of SSU-rRNA from (SSU-rRNA, 5.8S rRNA, LSU-rRNA) (GO:0000472); maturation of 5.8S rRNA (GO:0000460); maturation of 5.8S rRNA from tricistronic rRNA transcript (SSU-rRNA, 5.8S rRNA, LSU-rRNA) (GO:0000466); ncRNA 5′-end processing (GO:0034471); protein-containing complex localization (GO:0031503); ribosomal small subunit export from nucleus (GO:0000056); ribosomal subunit export from nucleus (GO:0000054); ribosome localization (GO:0033750); RNA 5′-end processing (GO:0000966); RNA capping (GO:0036260); RNA phosphodiester bond hydrolysis (GO:0090501); RNA phosphodiester bond hydrolysis, endonucleolytic (GO:0090502); and rRNA 5′-end processing (GO:0000967).

Kyoto Encyclopedia of Genes and Genomes (KEGG) Annotation

The Kyoto Encyclopedia of Genes and Genomes (KEGG) annotation indicated that the metabolic pathways of the candidate genes are primarily associated with key biochemical processes, specifically alanine, aspartate, and glutamate metabolism (ko00250); nucleotide metabolism (ko01232); purine metabolism (ko00230); ubiquitin-mediated proteolysis (ko04120); biosynthesis of cofactors (ko01240); and protein processing in the endoplasmic reticulum (ko04141) (Figure 6B).

3. Discussion

This study provides critical insights into the genetic basis of the short-winged trait in faba bean, highlighting its potential as a tool for reducing outcrossing rates and stabilizing crop performance. The segregation pattern observed in the F2 populations, with a 1:3 ratio of short-winged to normal-winged plants, suggests a single recessive gene controls the trait. This finding is consistent with previous studies on legumes such as Pisum sativum and Lotus japonicus, where similar short-wing traits are also governed by single recessive genes [17,19]. Given the 1:3 segregation ratio of short-winged to normal-winged plants in the F2 population, we selected 50 short-winged and 150 normal-winged individuals to reflect this natural inheritance pattern. While a more balanced case–control design could improve statistical power, our approach ensures adequate representation of the short-winged phenotype while maintaining the genetic diversity of the population. Future studies with larger sample sizes could further validate these findings and improve detection sensitivity.
Microscopic analyses using SEM and TEM revealed clear distinctions in epidermal cell morphology between the two wing types: short-winged variants showed tabular cells, while normal-winged variants exhibited conical cells. This morphological difference aligns with known floral adaptation mechanisms, as documented in Lotus japonicus studies, where the KEW1/kew1 gene influences wing petal morphology and cell development [19,20]. The consistency of tabular cell structure in keel petals across both wing variants suggests a conserved cellular architecture in these floral parts, possibly due to their functional roles.
The short-winged trait in the 200 selected F2 individuals, derived from two distinct crosses, was analyzed using a genome-wide association study (GWAS) approach. To account for potential population structure, we evaluated the selected individuals based on K values, which revealed nine distinct groups and is indicative of genetic variation within the population. This phylogenetic analysis supported the identified structure, aiding in the reduction in false positives in our GWAS results. Furthermore, our Q-Q plot analysis when using the FarmCPU model demonstrated a strong diagonal alignment of most data points, reinforcing the model’s effectiveness in addressing population structure [23]. The selection of GLM, MLM, and FarmCPU was based on their distinct advantages: GLM provides a simple linear association test, MLM accounts for kinship to reduce false positives, and FarmCPU enhances statistical power by integrating both fixed and random effects. To determine the most suitable model, we evaluated the genomic inflation factor (λ), where a value close to 1 indicates an appropriate balance between false positives and false negatives. The λ values for GLM (1.269) and MLM (0.836) suggested potential inflation and overcorrection, respectively, whereas the λ value for FarmCPU (1.064) was closest to 1, indicating well-controlled population structure effects. This, along with the well-calibrated Q-Q plot and clear signal resolution in the Manhattan plot, supports the choice of FarmCPU as the most reliable model for our analysis.
Our GWAS analysis identified 10 SNP loci that were significantly associated with the short-wing trait, with SNP_chr4::1013887633 accounting for 22.20% of the trait variance. Among the 49 candidate genes identified, most lacked prior annotations, and none had been previously associated with wing morphology in faba bean. Notable genes include VFH_III145120, which contains a NAM-related C-terminal domain linked to floral organ identity and lateral organ separation in model legumes such as Medicago truncatula. Microarray and real-time quantitative PCR analyses revealed that mutations in NAM down-regulate the expression of floral homeotic genes in M. truncatula [24]. These findings underscore the essential role of NAM in floral identity. NAM homologs in Petunia and Arabidopsis also play essential roles in floral patterning and differentiation, influencing the balance between cell division and expansion in reproductive organs [25,26]. Given its conserved function, VFH_III145120 in Vicia faba may similarly influence wing petal morphology by regulating genes involved in cell differentiation and expansion. Additionally, VFH_III149200, an inactive receptor kinase gene, likely contributes to epidermal differentiation, which was supported by the SEM and TEM observations of conical versus tabular cells between wing variants, paralleling findings in Pisum sativum (pea) and Arabidopsis thaliana [27]. Another significant candidate, VFH_I185160 on chromosome 2, corresponds to the E3 ubiquitin-protein ligase DA2 OS, which has been linked to floral development in Arabidopsis [28,29]. RNA-seq analysis identified over 200 proteins are present in the E3s mutant but absent in the wild type, while more than 300 proteins exhibited higher abundance in the mutant [29]. These findings highlight the crucial role of E3s. Given its association with SNP_chr4::1013887633, which explains 22.20% of trait variance, this gene may regulate key growth factors involved in petal expansion and final morphology.
The associations identified between the SNP pairs on chromosomes 2 and 4 suggest significant genetic interactions influencing the trait under study. Notably, the pair chr4::1013887633 and chr4::1068677594 showed a substantial positive association (OR = 533.85, p = 0.0261). While genes in the chr4::1013887633 region remain uncharacterized, those within the chr4::1068677594 region were associated with translation, ribosomal structure, biogenesis, post-translational modification, protein turnover, and chaperone activity. These findings indicate that the loci on chromosomes 2 and 4 may interact to modulate the trait, with some variants potentially acting as protective factors and others as risk enhancers. While high odds ratios like 533.85 are rare, they suggest that specific allelic combinations may have a pronounced effect. This extreme value may also be due to small sample sizes in certain genotype combinations, leading to inflated estimates. Further validation with larger sample sizes may help ensure the robustness of these findings.
The collective evidence suggests a hierarchical regulatory network in which NAM transcription factors (VFH_III145120) function upstream to orchestrate floral identity by regulating genes involved in cell differentiation and expansion. This regulation likely integrates with receptor kinase signaling (VFH_III149200) to fine tune the epidermal cell fate and structural modifications in wing petals. Additionally, post-translational mechanisms mediated by E3 ubiquitin ligases (VFH_I185160) contribute to protein turnover, ensuring precise control over floral organ growth. The interactions between loci on chromosomes 2 and 4 suggest a coordinated genetic framework, where some variants may enhance trait stability, while others introduce variability, shaping the final wing petal morphology in Vicia faba. However, further validation studies are required to confirm these associations and elucidate the underlying biological mechanisms.
Enrichment analysis when using GO and KEGG revealed key biological processes and metabolic pathways tied to the candidate genes, including RNA/rRNA processing, ribosome assembly, and protein localization. These processes are integral to morphological and developmental regulation, suggesting that the candidate genes identified may influence cellular architecture and wing morphology. Specifically, ribosome biogenesis and preribosome complex pathways, which are crucial for cellular growth and reproduction, emerged as significant. Mutations in ribosomal proteins (RPs) and ribosome biogenesis factors (RBFs) are known to impact plant growth and flowering times [30], aligning with our findings on the genetic regulation of floral traits.
Purine metabolism and ubiquitin-mediated proteolysis pathways, identified through KEGG analysis, are central to floral and cell differentiation. For instance, purine metabolism is critical for floral bud development, as shown in Citrullus lanatus studies, where gene expression in this pathway varies between male-fertile and sterile buds [31]. The ubiquitin-mediated proteolysis pathway plays vital roles in plant responses to environmental and developmental cues, including flooding, salinity, vernalization, and the functionality of the shoot apical meristem [28]. It is also involved in regulating plant responses to environmental factors and floral senescence, indicating the importance of protein turnover in floral organ development and maintenance [29,32]. The E3 ubiquitin ligase family, notably F-box genes like UFO (UNUSUAL FLORAL ORGANS) in Arabidopsis and its ortholog FIM (FIMBRIATA) in Antirrhinum, further illustrates the role of ubiquitin in flowering regulation; mutations in these genes can lead to various floral defects, indicating their multifaceted involvement in floral patterning [33,34,35,36]. Additionally, endoplasmic reticulum (ER)-associated protein degradation mechanisms, which are crucial for proper protein folding, may influence floral abscission and development [37], suggesting potential impacts on the wing morphology of faba bean.
The enriched GO terms highlight processes involved in RNA regulation and protein localization that could be pivotal for cell shape and differentiation. The significant enrichment of pathways related to amino acid and nucleotide metabolism, protein degradation, and cofactor biosynthesis further underscores these candidate genes’ regulatory roles in plant adaptation and morphology.

4. Materials and Methods

4.1. Plant Materials and Phenotypic Data Collection

Two F2 populations were created from crosses between ‘K0692’ (short-winged) and ‘Yundou 1183’ (n = 2156), as well as between ‘Yundoulvxin 1’ (short-winged) and ‘Yundou 1183’ (n = 1872). The two populations were used for segregation analysis. For the genome-wide association study, 200 F2 individuals from both populations were selected for re-sequencing (Supplementary Table S2). The parental lines ‘K0692’ and ‘Yundou 1183’ were also utilized for scanning electron microscopy (SEM) and transmission electron microscopy (TEM) analyses.
The F2 populations and the parental lines were planted at the Field Station of the Yunnan Academy of Agricultural Sciences in 2021. Wing type phenotypes were recorded at the flowering stage. Young leaf samples from all of the F2 individuals were collected at flowering and stored at −20 °C for future analysis. Fully opened flowers from the parental lines ‘K0692’ and ‘Yundou 1183’ were collected for SEM and TEM studies.

4.2. Scanning Electron Microscopy (SEM) of the Epidermal Cells on Wing and Keel Petals

To analyze the petal epidermis, SEM was performed on the wing and keel petals of the parental lines ‘K0692’ and ‘Yundou 1183’, following the protocol established by Xiao et al. [38]. Fresh petals from fully blooming flowers were selected, and central sections (approximately 0.5 × 0.5 mm) were cut and immersed in a 2.5% (v/v) glutaraldehyde solution (Biosharp, Hefei, China) at 4 °C overnight. The samples were then washed three times with phosphate buffer solution (pH 7.2, (Sigma-Aldrich, St. Louis, MO, USA)) for 10 min each. Subsequently, the petal samples underwent dehydration in a graded ethanol series (30%, 50%, 70%, 90%, 95%, and 100%), with each step lasting 10 min. The samples were treated twice with isoamyl acetate for 15 min each. After dehydration, samples were dried using a critical point dryer (EM CPD 300; BAL-TEC Inc., Balzers, Liechtenstein), with liquid carbon dioxide being used as the transitional fluid. Finally, the samples were mounted on metal stubs, sputter coated with gold, and examined using a scanning electron microscope (Hitachi SU-8100, Tokyo, Japan).

4.3. Transmission Electron Microscopy (TEM) of the Epidermal Cells on Wing Petal

Transmission electron microscopy (TEM) was conducted on the wing petals from the parental lines ‘K0692’ (short-winged) and ‘Yundou 1183’, according to the protocol by Tajuddin et al. [39]. Samples were initially fixed in a 2.5% glutaraldehyde solution (Electron Microscopy Sciences, Hatfield, PA, USA) prepared in 0.1 M phosphate-buffered saline (PBS) (Sigma-Aldrich) at pH 7.0 for two hours under vacuum to improve penetration. Following primary fixation, samples were rinsed with PBS multiple times, then subjected to secondary fixation with 2% osmium tetroxide (Electron Microscopy Sciences) for lipid preservation. For dehydration, samples were passed through a graded ethanol series, with each step lasting 15 min, and they were then transitioned to propylene oxide (Electron Microscopy Sciences). The samples were subsequently infiltrated with resin while being continuously rotated to ensure uniform penetration. After infiltration, samples were polymerized in epoxy resin at 70 °C overnight. The hardened resin blocks were sectioned to a thickness of 70–90 nanometers using an ultramicrotome (EM UC7, LEICA, Wetzlar, Germany). Sections were stained with uranyl acetate (Sigma-Aldrich) and lead citrate (Electron Microscopy Sciences) before examination under a transmission electron microscope (TECNAL SPIRIT G2, FEI, Hillsboro, OR, USA).

4.4. Genomic DNA Library Preparation and Sequencing

Total genomic DNA was extracted from the 200 selected F2 individual plants using the DNeasy Plant Pro Kit (Qiagen, Hilden, Germany), following the Quick-Start protocol (Qiagen.com/HB-2522) (Qiagen). DNA quantity and quality were evaluated using a NanoDrop 2000 spectrophotometer (Thermo Fisher Scientific, Waltham, MA, USA) and a Quantus™ Fluorometer (Promega, Madison, WI, USA). Only DNA samples meeting quality standards (OD260/280 = 1.8–2.0, OD260/230 ≥ 2.0, and concentration ≥ 25 ng/µL) were selected for sequencing library construction. DNA integrity and contamination were assessed via 1% agarose gel electrophoresis.
For library preparation, 0.5 µg of DNA per sample was used. Libraries were prepared as paired-end sequencing libraries for 10 × genome coverage using the Illumina NovaSeq™ X Plus platform (Illumina, San Diego, CA, USA). Library construction followed the TruSeq Nano DNA HT Sample Prep Kit protocol (Illumina, San Diego, CA, USA). Briefly, genomic DNA was sonicated to ~350 bp fragments, end-polished, A-tailed, and ligated to full-length adapters for sequencing on the NovaSeq X Plus. PCR amplification was performed, and the products were purified using the AMPure XP system. Library size distribution was assessed with an Agilent 2100 Bioanalyzer, and libraries were quantified by real-time PCR (3 nM). Paired-end sequencing was performed on the Illumina NovaSeq X Plus system at Shanghai Majorbio Bio-pharm Technology Co., Ltd (Shanghai, China).

4.5. Variant Discovery

Low-quality raw reads (mean Phred score < 20), i.e., reads with adapter contamination, and reads with unrecognizable nucleotides (N bases > 10) were trimmed or discarded using Fastp [40]. Trimmed reads were then mapped to the Hedin2 reference genome v1 [22] using BWA-MEME [41] with default parameters. Following a modified GATK Best Practices workflow [42], alignment BAM files were sorted using Samtools [43], and PCR duplicates were marked with MarkDuplicates. After base quality recalibration, germline variants (SNPs and InDels) were called across all samples using the Haplotyper and Gvcftyper programs in the Sentieon genomics tools [44]. Variants were filtered according to standard hard filtering parameters from the GATK Best Practices pipeline.
All variants were annotated with SnpEff [45]. SNPs and InDels were categorized based on chromosomal positions (intergenic regions, exons, introns, splicing sites, untranslated regions, and 1-kb upstream and downstream regions) and their functional effects (missense, start codon gain or loss, stop codon gain or loss, and splicing mutations). To minimize false positives in the SNP and genotype calling, several filtering steps were applied: (i) SNPs with more than two alleles were removed; (ii) SNPs with a mean depth across samples < 4 were excluded; (iii) SNPs with a minor allele frequency < 0.05 were discarded; (iv) SNPs were retained only if genotyped in at least 70% of samples; and (v) SNPs were pruned for population structure analysis based on linkage disequilibrium using Plink [46].

4.6. Phylogenetic Analyses

Maximum likelihood (ML) and neighbor joining (NJ) phylogenetic trees were constructed using IQ-TREE2 Version 2.1.2 [47] and FastTree Version 2.1.11 [48], respectively. The NJ tree was generated in FastTree with the GTR + Gamma model and 1000 bootstrap replicates, based on pruned SNP sites. ML analysis was conducted in IQ-TREE2 using the GTR + I + G4 model with 1000 bootstrap replicates.

4.7. Population Structure Analysis

Population structure was assessed using the unsupervised maximum-likelihood clustering algorithm in ADMIXTURE Version 1.3.0 [49] for k = 1 to k = 20 ancestral clusters with default parameters. Pruned SNPs were used for increased clustering accuracy.

4.8. Principal Component Analysis (PCA)

To visualize the genetic relationships among the 200 F2 lines, principal component analysis (PCA) was performed based on pruned SNPs using Plink Version 1.90b6.20 [46].

4.9. Population Diversity Indexes

Population divergence indexes were calculated using the populations program of Stacks Version 2.66.

4.10. Linkage Disequilibrium (LD) Analysis

To assess the LD decay across the genome, the squared correlation coefficient (r2) between loci pairs was calculated using PopLDdecay Version 3.42 [50], based on re-sequencing data. LD was estimated with a sliding window size of 50 kb, and a maximum inter-marker distance of 500 kb was set to calculate LD between SNP pairs. To minimize the bias caused by tightly linked loci, SNP pairs with r2 values greater than 0.5 were excluded from the analysis. The average r2 values were computed for pairwise SNPs within 500 kb bins, and these were then averaged across the genome to generate the LD decay curve.

4.11. Relatedness Analysis

The pairwise relatedness (kinship) between individuals was estimated using Plink Version 1.90b6.20 [51], based on SNP data.

4.12. Genome-Wide Association Analysis (GWAS)

Genome-wide association studies (GWASs) were performed to identify the loci associated with the short-winged trait using the R package ‘rMVP’ and by applying GLM, MLM, and FarmCPU models [52]. The first three principal components and a kinship matrix were incorporated as covariates in the models. Significance thresholds for p-values were set using a Bonferroni correction (0.05/n, where n is the total number of SNPs) to control the genome-wide type I error rate. Epistasis among significant loci was analyzed using Plink Version 1.90b6.20. To control multiple testing in the epistasis analysis, a false discovery rate (FDR) correction was applied to maintain a stringent genome-wide significance level. A GWAS threshold of − log (P) = 7 was used in declaring significant marker-trait associations (MTAs), which approximately corresponds to the Bonferroni-adjusted significance level given the total number of SNPs used in the analysis. This stringent threshold helps reduce false positives while ensuring robust association signals.
For loci identified as significant, a 500 kb flanking region on each side of the loci was extracted as the candidate region associated with the trait. Genes overlapping with these candidate regions were selected for further functional enrichment analysis.

5. Conclusions

This study identified the loci and candidate genes linked to the short-winged trait in faba bean, underscoring the genetic basis for this trait and its potential in reducing outcrossing rates. By integrating morphological observations with GWAS findings, this work provides a foundation for the molecular breeding of low-outcrossing faba bean varieties, promoting stable and reliable crop production. These insights contribute to the broader field of faba bean breeding and highlight the potential for developing sustainable, adaptable varieties under diverse environmental conditions.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms26062733/s1.

Author Contributions

H.Y.: methodology, conducting trials, drafting the manuscript, and providing funding support; F.Y. and C.H.: data collection; Q.W., Y.J., Z.D. and X.Y.: analysis and interpretation of data; J.C., A.Z., Y.W. and Q.L.: acquisition of data; and H.Y., Y.H. and M.L.: critical revision of the manuscript for important intellectual content. All authors have read and agreed to the published version of the manuscript.

Funding

The work was supported by the National Natural Science Foundation—Regional Fund Project: Research on the genetic characteristics and mining of key genes for short-winged trait in rare germplasm resource (K0692) of faba bean (Vicia faba L.) (32360466); Yunnan Xingdian Talents Support Program-Youth Talent Project; the China Agriculture Research System—Food Legumes (CARS08-G07); and the Technology Talent and Platform Program—Yunnan Laboratory Construction Special Project: Demonstration of integrated innovation and transformation of agricultural seed industry achievements (202305AR340003).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Raw data were generated at the Institute of Food Crops, Yunnan Academy of Agricultural Science. Derived data supporting the findings of this study are available from the corresponding author, H.Y., on request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Jensen, E.S.; Peoples, M.B.; Hauggaard-Nielsen, H. Faba Bean in Cropping Systems. Field Crops Res. 2010, 115, 203–216. [Google Scholar] [CrossRef]
  2. Köpke, U.; Nemecek, T. Ecological Services of Faba Bean. Field Crops Res. 2010, 115, 217–233. [Google Scholar] [CrossRef]
  3. Link, W. Autofertility and Rate of Cross-Fertilization: Crucial Characters for Breeding Synthetic Varieties in Faba Beans (Vicia faba L.). Theor. Appl. Genet. 1990, 79, 713–717. [Google Scholar] [CrossRef]
  4. Yu, H.; Wang, L.; Yang, F.; Lv, M.; Zong, X.; Yang, T. Identification and Utilization of Short Wing Petal Broad Bean (Vicia faba L.) Germplasm. J. Plant Genet. Resour. 2019, 20, 1334–1339. [Google Scholar]
  5. Cubero, J.I.; Hebblethwaite, P.D. The Faba Bean; CRC Press: Boca Raton, FL, USA, 1983. [Google Scholar]
  6. Yu, H.; Wang, L.; Lü, L.; Yang, T.; Yang, F.; Liu, R.; He, Y.; Zong, X. Advances on Related Factors and Applied Research of Outcrossing Rate in Faba Bean (Vicia faba L.). Crops 2020, 36, 9–15. [Google Scholar] [CrossRef]
  7. Musallam, I.W.; Haddad, N.J.; Tawaha, A.-R.M.; Migdadi, O.S. The Importance of Bee-Pollination in Four Genotypes of Faba Bean (Vicia faba L.). Int. J. Agric. Biol. 2004, 6, 9–12. [Google Scholar]
  8. Bishnoi, S.K.; Hooda, J.S.; Yadav, I.S.; Panchta, R. Advances in Heterosis and Hybrid Breeding in Faba Bean (Vicia faba L.). Forage Res. 2012, 38, 24–27. [Google Scholar]
  9. Zariman, N.A.; Omar, N.A.; Huda, A.N. Plant Attractants and Rewards for Pollinators: Their Significant to Successful Crop Pollination. Int. J. Life Sci. Biotechnol. 2022, 5, 270–293. [Google Scholar] [CrossRef]
  10. Irwin, R.E.; Adler, L.S.; Brody, A.K. The Dual Role of Floral Traits: Pollinator Attraction and Plant Defense. Ecology 2004, 85, 1503–1511. [Google Scholar] [CrossRef]
  11. Specht, C.D.; Howarth, D.G. Adaptation in Flower Form: A Comparative Evodevo Approach. New Phytol. 2015, 206, 74–90. [Google Scholar] [CrossRef]
  12. Buide, M.L. Pollination Ecology of Silene Acutifolia (Caryophyllaceae): Floral Traits Variation and Pollinator Attraction. Ann. Bot. 2006, 97, 289–297. [Google Scholar] [CrossRef] [PubMed]
  13. Costa, V.B.S.; Pimentel, R.M.M.; Chagas, M.G.S.; Alves, G.D.; Castro, C.C. Petal Micromorphology and Its Relationship to Pollination. Plant Biol. 2017, 19, 115–122. [Google Scholar] [CrossRef] [PubMed]
  14. Rands, S.A.; Glover, B.J.; Whitney, H.M. Floral Epidermal Structure and Flower Orientation: Getting to Grips with Awkward Flowers. Arthropod-Plant Interact. 2011, 5, 279–285. [Google Scholar] [CrossRef]
  15. Bailes, E.J.; Glover, B.J. Intraspecific Variation in the Petal Epidermal Cell Morphology of Vicia faba L. (Fabaceae). Flora 2018, 244–245, 29–36. [Google Scholar] [CrossRef]
  16. Ahmar, S.; Gill, R.A.; Jung, K.-H.; Faheem, A.; Qasim, M.U.; Mubeen, M.; Zhou, W. Conventional and Molecular Techniques from Simple Breeding to Speed Breeding in Crop Plants: Recent Advances and Future Outlook. Int. J. Mol. Sci. 2020, 21, 2590. [Google Scholar] [CrossRef]
  17. Pellew, C.; Sverdrup, A. New Observations on the Genetics of Peas (Pisum sativum). J. Genet. 1923, 13, 125–131. [Google Scholar] [CrossRef]
  18. Wang, Z.; Luo, Y.; Li, X.; Wang, L.; Xu, S.; Yang, J.; Weng, L.; Sato, S.; Tabata, S.; Ambrose, M.; et al. Genetic Control of Floral Zygomorphy in Pea (Pisum sativum L.). Proc. Natl. Acad. Sci. USA 2008, 105, 10414–10419. [Google Scholar] [CrossRef]
  19. Feng, X.; Zhao, Z.; Tian, Z.; Xu, S.; Luo, Y.; Cai, Z.; Wang, Y.; Yang, J.; Wang, Z.; Weng, L.; et al. Control of Petal Shape and Floral Zygomorphy in Lotus japonicus. Proc. Natl. Acad. Sci. USA 2006, 103, 4970–4975. [Google Scholar] [CrossRef] [PubMed]
  20. Xu, Z.; Cheng, K.; Li, X.; Yang, J.; Xu, S.; Cao, X.; Hu, X.; Xie, W.; Yuan, L.; Ambrose, M.; et al. Transcriptional and Post-Transcriptional Modulation of SQU and KEW Activities in the Control of Dorsal–Ventral Asymmetric Flower Development in Lotus japonicus. Mol. Plant 2016, 9, 722–736. [Google Scholar] [CrossRef]
  21. O’Sullivan, D.M.; Angra, D. Advances in Faba Bean Genetics and Genomics. Front. Genet. 2016, 7, 150. [Google Scholar] [CrossRef]
  22. Jayakodi, M.; Golicz, A.A.; Kreplak, J.; Fechete, L.I.; Angra, D.; Bednář, P.; Bornhofen, E.; Zhang, H.; Boussageon, R.; Kaur, S.; et al. The Giant Diploid Faba Genome Unlocks Variation in a Global Protein Crop. Nature 2023, 615, 652–659. [Google Scholar] [CrossRef]
  23. Burghardt, L.T.; Young, N.D.; Tiffin, P. A Guide to Genome-Wide Association Mapping in Plants. Curr. Protoc. Plant Biol. 2017, 2, 22–38. [Google Scholar] [CrossRef] [PubMed]
  24. Cheng, X.; Peng, J.; Ma, J.; Tang, Y.; Chen, R.; Mysore, K.S.; Wen, J. NO APICAL MERISTEM (MtNAM) Regulates Floral Organ Identity and Lateral Organ Separation in Medicago Truncatula. New Phytol. 2012, 195, 71–84. [Google Scholar] [CrossRef] [PubMed]
  25. Sablowski, R.W.M.; Meyerowitz, E.M. A Homolog of NO APICAL MERISTEM Is an Immediate Target of the Floral Homeotic Genes APETALA3/PISTILLATA. Cell 1998, 92, 93–103. [Google Scholar] [CrossRef] [PubMed]
  26. Souer, E.; van Houwelingen, A.; Kloos, D.; Mol, J.; Koes, R. The No Apical Meristem Gene of Petunia Is Required for Pattern Formation in Embryos and Flowers and Is Expressed at Meristem and Primordia Boundaries. Cell 1996, 85, 159–170. [Google Scholar] [CrossRef]
  27. Tanaka, H.; Watanabe, M.; Watanabe, D.; Tanaka, T.; Machida, C.; Machida, Y. ACR4, a Putative Receptor Kinase Gene of Arabidopsis Thaliana, That Is Expressed in the Outer Cell Layers of Embryos and Plants, Is Involved in Proper Embryogenesis. Plant Cell Physiol. 2002, 43, 419–428. [Google Scholar] [CrossRef]
  28. Holdsworth, M.J.; Vicente, J.; Sharma, G.; Abbas, M.; Zubrycka, A. The Plant N-Degron Pathways of Ubiquitin-Mediated Proteolysis. J. Integr. Plant Biol. 2020, 62, 70–89. [Google Scholar] [CrossRef]
  29. Lu, D.; Ni, W.; Stanley, B.A.; Ma, H. Proteomics and Transcriptomics Analyses of Arabidopsis Floral Buds Uncover Important Functions of ARABIDOPSIS SKP1-LIKE1. BMC Plant Biol. 2016, 16, 61. [Google Scholar] [CrossRef]
  30. Li, K.; Wang, P.; Ding, T.; Hou, L.; Li, G.; Zhao, C.; Zhao, S.; Wang, X.; Li, P. Mutation of an Essential 60S Ribosome Assembly Factor MIDASIN 1 Induces Early Flowering in Arabidopsis. Int. J. Mol. Sci. 2022, 23, 6509. [Google Scholar] [CrossRef]
  31. Zhang, G.; Wei, B.; Ding, Q. Identification of Differentially Expressed miRNAs between Male Sterile and Fertile Floral Buds in Watermelon (Citrullus lanatus L.) via High-Throughput Sequencing. 3 Biotech 2024, 14, 247. [Google Scholar] [CrossRef]
  32. Courtney, S.E.; Rider, C.C.; Stead, A.D. Changes in Protein Ubiquitination and the Expression of Ubiquitin-Encoding Transcripts in Daylily Petals during Floral Development and Senescence. Physiol. Plant. 1994, 91, 196–204. [Google Scholar] [CrossRef]
  33. Ingram, G.C.; Doyle, S.; Carpenter, R.; Schultz, E.A.; Simon, R.; Coen, E.S. Dual Role for Fimbriata in Regulating Floral Homeotic Genes and Cell Division in Antirrhinum. EMBO J. 1997, 16, 6521–6534. [Google Scholar] [CrossRef]
  34. Levin, J.Z.; Meyerowitz, E.M. UFO: An Arabidopsis Gene Involved in Both Floral Meristem and Floral Organ Development. Plant Cell 1995, 7, 529–548. [Google Scholar] [CrossRef]
  35. Moon, J.; Parry, G.; Estelle, M. The Ubiquitin-Proteasome Pathway and Plant Development. Plant Cell 2004, 16, 3181–3195. [Google Scholar] [CrossRef]
  36. Samach, A.; Klenz, J.E.; Kohalmi, S.E.; Risseeuw, E.; Haughn, G.W.; Crosby, W.L. The UNUSUAL FLORAL ORGANS Gene of Arabidopsis Thaliana Is an F-Box Protein Required for Normal Patterning and Growth in the Floral Meristem. Plant J. 1999, 20, 433–445. [Google Scholar] [CrossRef]
  37. Strasser, R. Protein Quality Control in the Endoplasmic Reticulum of Plants. Annu. Rev. Plant Biol. 2018, 69, 147–172. [Google Scholar] [CrossRef]
  38. Xiao, W.; Li, Z.; Chen, H.; Lv, F. Visualization of Micromorphology of Petal Epidermal Features of Waxy and Velvety Flowers in Phalaenopsis. EBSCOhost. Available online: https://openurl.ebsco.com/contentitem/doi:10.2306%2Fscienceasia1513-1874.2020.080?sid=ebsco:plink:crawler&id=ebsco:doi:10.2306%2Fscienceasia1513-1874.2020.080 (accessed on 17 February 2025).
  39. Tajuddin, S.; Mat, N.; Yunus, A.G. Anatomical Study of Stem, Petiole, Leaf, Tuber, Root and Flower of Dioscorea hispida Dennst. (Dioscoreaceae) by Using Optical Microscope, SEM and TEM. J. Agrobiotechnol. 2013, 4, 33–42. [Google Scholar]
  40. Chen, S.; Zhou, Y.; Chen, Y.; Gu, J. Fastp: An Ultra-Fast All-in-One FASTQ Preprocessor. Bioinformatics 2018, 34, i884–i890. [Google Scholar] [CrossRef]
  41. Jung, Y.; Han, D. BWA-MEME: BWA-MEM Emulated with a Machine Learning Approach. Bioinformatics 2022, 38, 2404–2413. [Google Scholar] [CrossRef]
  42. McKenna, A.; Hanna, M.; Banks, E.; Sivachenko, A.; Cibulskis, K.; Kernytsky, A.; Garimella, K.; Altshuler, D.; Gabriel, S.; Daly, M.; et al. The Genome Analysis Toolkit: A MapReduce Framework for Analyzing next-Generation DNA Sequencing Data. Genome Res. 2010, 20, 1297–1303. [Google Scholar] [CrossRef]
  43. Li, H.; Handsaker, B.; Wysoker, A.; Fennell, T.; Ruan, J.; Homer, N.; Marth, G.; Abecasis, G.; Durbin, R. 1000 Genome Project Data Processing Subgroup the Sequence Alignment/Map Format and SAMtools. Bioinformatics 2009, 25, 2078–2079. [Google Scholar] [CrossRef] [PubMed]
  44. Freed, D.; Aldana, R.; Weber, J.A.; Edwards, J.S. The Sentieon Genomics Tools—A Fast and Accurate Solution to Variant Calling from next-Generation Sequence Data. bioRxiv 2017, 115717. [Google Scholar] [CrossRef]
  45. Cingolani, P.; Platts, A.; Wang, L.L.; Coon, M.; Nguyen, T.; Wang, L.; Land, S.J.; Lu, X.; Ruden, D.M. A Program for Annotating and Predicting the Effects of Single Nucleotide Polymorphisms, SnpEff. Fly 2012, 6, 80–92. [Google Scholar] [CrossRef] [PubMed]
  46. Purcell, S.; Neale, B.; Todd-Brown, K.; Thomas, L.; Ferreira, M.A.R.; Bender, D.; Maller, J.; Sklar, P.; de Bakker, P.I.W.; Daly, M.J.; et al. PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses. Am. J. Hum. Genet. 2007, 81, 559–575. [Google Scholar] [CrossRef]
  47. Minh, B.Q.; Schmidt, H.A.; Chernomor, O.; Schrempf, D.; Woodhams, M.D.; von Haeseler, A.; Lanfear, R. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Mol. Biol. Evol. 2020, 37, 1530–1534. [Google Scholar] [CrossRef]
  48. Price, M.N.; Dehal, P.S.; Arkin, A.P. FastTree 2—Approximately Maximum-Likelihood Trees for Large Alignments. PLoS ONE 2010, 5, e9490. [Google Scholar] [CrossRef]
  49. Alexander, D.H.; Novembre, J.; Lange, K. Fast Model-Based Estimation of Ancestry in Unrelated Individuals. Genome Res. 2009, 19, 1655–1664. [Google Scholar] [CrossRef]
  50. Zhang, C.; Dong, S.-S.; Xu, J.-Y.; He, W.-M.; Yang, T.-L. PopLDdecay: A Fast and Effective Tool for Linkage Disequilibrium Decay Analysis Based on Variant Call Format Files. Bioinformatics 2019, 35, 1786–1788. [Google Scholar] [CrossRef]
  51. Manichaikul, A.; Mychaleckyj, J.C.; Rich, S.S.; Daly, K.; Sale, M.; Chen, W.-M. Robust Relationship Inference in Genome-Wide Association Studies. Bioinformatics 2010, 26, 2867–2873. [Google Scholar] [CrossRef]
  52. Yin, L.; Zhang, H.; Tang, Z.; Xu, J.; Yin, D.; Zhang, Z.; Yuan, X.; Zhu, M.; Zhao, S.; Li, X.; et al. rMVP: A Memory-Efficient, Visualization-Enhanced, and Parallel-Accelerated Tool for Genome-Wide Association Study. Genom. Proteom. Bioinform. 2021, 19, 619–628. [Google Scholar] [CrossRef]
Figure 1. Comparison of short-winged flowers (AE) and regular flowers (FJ): (A,F) top view of the corolla; (B,G) side view of the corolla; (C,H) diagram of the corolla (standard petal (a), wing petal (b), and keel petal (c)); (D,I) keel petal; (E,J) wing petal; and (K) one of the honeybee’s nectar-gathering behaviors on faba bean flowers (facilitating effective cross-pollination), where the bee uses the wing petals as a foothold, landing between the two wing petals to collect nectar.
Figure 1. Comparison of short-winged flowers (AE) and regular flowers (FJ): (A,F) top view of the corolla; (B,G) side view of the corolla; (C,H) diagram of the corolla (standard petal (a), wing petal (b), and keel petal (c)); (D,I) keel petal; (E,J) wing petal; and (K) one of the honeybee’s nectar-gathering behaviors on faba bean flowers (facilitating effective cross-pollination), where the bee uses the wing petals as a foothold, landing between the two wing petals to collect nectar.
Ijms 26 02733 g001
Figure 2. Scanning electron microscope results of the epidermal cells: the wing petal (A,B,E,F) and keel petal (C,D,G,H) epidermal cells of the short-winged faba bean (AD) and normal-winged faba bean (EH). (E,F) Conical cells; others, tabular cells.
Figure 2. Scanning electron microscope results of the epidermal cells: the wing petal (A,B,E,F) and keel petal (C,D,G,H) epidermal cells of the short-winged faba bean (AD) and normal-winged faba bean (EH). (E,F) Conical cells; others, tabular cells.
Ijms 26 02733 g002
Figure 3. Transmission electron microscope scan results of the epidermal cells: wing petal epidermal cells of the short-winged faba bean (A,B) and normal-winged faba bean (C,D). (A,B) Tabular cells; (C,D) conical cells.
Figure 3. Transmission electron microscope scan results of the epidermal cells: wing petal epidermal cells of the short-winged faba bean (A,B) and normal-winged faba bean (C,D). (A,B) Tabular cells; (C,D) conical cells.
Ijms 26 02733 g003
Figure 4. The genome structure and genotyping analyses of 200 F2 individual populations. (A) Genome structure; (B) cross-validation diagram of the SNP dataset; (C) population structure analyzed by STRUCTURE at K = 8, 9, and 10; (D) principal component analysis of the population; and (E) phylogenetic tree of the population. The red point in (B) indicates the structural simulation results, showing that k = 9 minimized the cross-validation (CV) error.
Figure 4. The genome structure and genotyping analyses of 200 F2 individual populations. (A) Genome structure; (B) cross-validation diagram of the SNP dataset; (C) population structure analyzed by STRUCTURE at K = 8, 9, and 10; (D) principal component analysis of the population; and (E) phylogenetic tree of the population. The red point in (B) indicates the structural simulation results, showing that k = 9 minimized the cross-validation (CV) error.
Ijms 26 02733 g004
Figure 5. Loci related to the short-winged trait based on the GLM, MLM, and FarmCPU methods. (A,D) QQ and Manhattan plots of the GWAS results based on the GLM method; (B,E) QQ and Manhattan plots of the GWAS results based on the MLM method; and (C,F) QQ and Manhattan plots of the GWAS results based on the FarmCPU model. Red dots above the threshold line or deviating from the linear curve represent significantly associated loci.
Figure 5. Loci related to the short-winged trait based on the GLM, MLM, and FarmCPU methods. (A,D) QQ and Manhattan plots of the GWAS results based on the GLM method; (B,E) QQ and Manhattan plots of the GWAS results based on the MLM method; and (C,F) QQ and Manhattan plots of the GWAS results based on the FarmCPU model. Red dots above the threshold line or deviating from the linear curve represent significantly associated loci.
Ijms 26 02733 g005
Figure 6. GO (A) and KEGG (B) analyses of the genes associated with short-winged traits via the FarmCPU model. (A) GO Enrichment Analysis, candidate genes were enriched in biological processes and molecular functions; (B) Kyoto Encyclopedia of Genes and Genomes (KEGG) Annotation of candidate genes significantly involved in metabolic pathways.
Figure 6. GO (A) and KEGG (B) analyses of the genes associated with short-winged traits via the FarmCPU model. (A) GO Enrichment Analysis, candidate genes were enriched in biological processes and molecular functions; (B) Kyoto Encyclopedia of Genes and Genomes (KEGG) Annotation of candidate genes significantly involved in metabolic pathways.
Ijms 26 02733 g006
Table 1. Segregation of the wing traits in the F2 populations.
Table 1. Segregation of the wing traits in the F2 populations.
PopulationNormal-Winged (N)Short-Winged (S)TotalN:S Ratio aExp. N bExp. S cChi-Square dp-Value e
‘Yundoulvxin 1’ (short-winged) × ‘Yundou 1183’138948318723N:1S14044680.640.42
‘K0692’ (short-winged) × ‘Yundou 1183’162453121553N:1S1616.25538.750.150.70
Total3013101440273N:1S3020.251006.750.070.79
a Expected ratio of the normal-winged to short-winged individuals in the F2 populations. b Expected number of the normal-winged individuals. c Expected number of the short-winged individuals. d Chi-square results comparing the observed and expected segregation. e A p-value greater than 0.05 indicates no significant difference.
Table 2. Loci related to the short-winged trait based on the FarmCPU method.
Table 2. Loci related to the short-winged trait based on the FarmCPU method.
ChromosomePositionReference aAlternation bSE cp-Value dPVE e
chr21,370,936,770TC0.0190 1.58 × 10−90
chr3764,672,128TA0.0203 2.40 × 10−109.3%
chr31,529,700,980TG0.0365 2.15 × 10−96.2%
chr4897,638,629GA0.0297 3.35 × 10−85.2%
chr41,013,887,633CT0.0163 2.57 × 10−2522.2%
chr41,025,369,220AG0.0324 9.77 × 10−120
chr41,038,779,197CT0.0195 2.22 × 10−1817.0%
chr41,068,677,594AC0.0101 3.11 × 10−117.4%
chr41,283,122,638CA0.0206 2.25 × 10−80
chr51,024,268,985AG0.0896 1.20 × 10−911.8%
a Nucleotide of the reference genome. b Nucleotide mutations in the studied populations. c Standard errors for the percentage of variation explained (PVE) by each SNP. d Significance of the loci associated with phenotypic traits, where p-values less than 0.05 indicate significance. e Percentage of the phenotypic variation explained by the corresponding loci.
Table 3. Candidate gene list.
Table 3. Candidate gene list.
Gene IDChromosomeStrandStartEndAnnotation
gene-VFH_I185160chr2+1,370,573,6431,370,577,612E3 ubiquitin-protein ligase DA2 OS
gene-VFH_I185200chr2+1,370,577,9521,370,578,250--
gene-VFH_I185240chr2+1,370,586,1331,370,595,083--
gene-VFH_II126600chr3+764,359,337764,360,410Ethylene-responsive transcription factor CRF3 OS
gene-VFH_II242640chr3+1,529,265,5021,529,266,471--
gene-VFH_II242680chr3+1,529,957,9691,529,968,053Sterol 3-beta-glucosyltransferase UGT80A2 OS
gene-VFH_II242720chr3+1,530,074,6911,530,075,183--
gene-VFH_II242760chr3+1,530,076,0621,530,077,346Vesicle-associated membrane protein 722 OS
gene-VFH_II242760chr3+1,530,076,0621,530,077,346Vesicle-associated membrane protein 721 OS
gene-VFH_II242800chr3+1,530,088,2521,530,088,560--
gene-VFH_II242840chr3+1,530,088,5861,530,088,906--
gene-VFH_II242880chr3+1,530,089,9431,530,090,287Uncharacterized mitochondrial protein AtMg00310 OS
gene-VFH_II242920chr3+1,530,152,9891,530,153,702Zinc-finger homeodomain protein 2 OS
gene-VFH_II242960chr3+1,530,183,7341,530,184,447Zinc-finger homeodomain protein 2 OS
gene-VFH_III128960chr4+897,872,521897,872,763--
gene-VFH_III129000chr4+897,910,440897,912,825--
gene-VFH_III144960chr4+1,013,856,7891,013,857,154--
gene-VFH_III145000chr4+1,013,857,2431,013,861,427--
gene-VFH_III145040chr4+1,014,183,8551,014,184,466--
gene-VFH_III145080chr4+1,014,260,6871,014,262,444--
gene-VFH_III145120chr4+1,014,277,2011,014,278,223Glutathione S-transferase T3 OS
gene-VFH_III145120chr4+1,014,277,2011,014,277,560--
gene-VFH_III145160chr4+1,014,279,0081,014,279,562--
gene-VFH_III145200chr4+1,014,289,1831,014,290,059Late embryogenesis abundant protein D-34 OS
gene-VFH_III147000chr4+1,024,934,6261,024,935,012--
gene-VFH_III147040chr4+1,025,114,2721,025,114,726--
gene-VFH_III147080chr4+1,025,414,4931,025,417,411Vacuole membrane protein KMS1 OS
gene-VFH_III147120chr4+1,025,427,3051,025,427,901--
gene-VFH_III147160chr4+1,025,428,1281,025,429,181Putative ribonuclease H protein At1g65750 OS
gene-VFH_III147200chr4+1,025,429,4801,025,430,643Glutathione S-transferase T3 OS
gene-VFH_III147240chr4+1,025,451,8491,025,452,856Cullin-1 OS
gene-VFH_III147280chr4+1,025,778,1281,025,778,745Adenylosuccinate lyase OS
gene-VFH_III149160chr4+1,039,110,2581,039,110,459--
gene-VFH_III149200chr4+1,039,111,9951,039,112,210Probable inactive receptor kinase At1g48480 OS
gene-VFH_III153280chr4+1,068,451,7651,068,460,306Protein DWD HYPERSENSITIVE TO UV-B 1 OS
gene-VFH_III153320chr4+1,068,460,8341,068,461,079--
gene-VFH_III153360chr4+1,068,461,0991,068,461,438Snakin-1 OS = Solanum tuberosum OX
gene-VFH_III153400chr4+1,068,485,6231,068,487,425Thioredoxin domain-containing protein 9 homolog OS
gene-VFH_III153440chr4+1,068,772,6101,068,775,302Sister chromatid cohesion protein SCC2 OS
gene-VFH_III153480chr4+1,068,789,2491,068,792,353Serine/threonine-protein kinase EDR1 OS
gene-VFH_III153520chr4+1,068,929,5351,068,931,137Pumilio homolog 12 OS
gene-VFH_III153560chr4+1,068,967,4221,068,969,025Pumilio homolog 9 OS
gene-VFH_III153560chr4+1,068,967,4221,068,969,025Pumilio homolog 12 OS
gene-VFH_III153600chr4+1,069,082,2721,069,083,842Pumilio homolog 12 OS
gene-VFH_III153640chr4+1,069,110,2641,069,111,871Pumilio homolog 12 OS
gene-VFH_III185880chr4+1,283,109,9061,283,110,265--
gene-VFH_III185920chr4+1,283,164,2661,283,168,067Protein TIC 40, chloroplastic OS
gene-VFH_III185960chr4+1,283,221,5851,283,223,452Rust resistance kinase Lr10 OS
gene-VFH_IV148120chr5+1,024,025,2121,024,028,413Subtilisin-like protease SBT5.4 OS
gene-VFH_IV148160chr5+1,024,573,4011,024,592,701Subtilisin-like protease SBT5.4 OS
gene-VFH_IV148160chr5+1,024,573,4011,024,580,420Subtilisin-like protease SBT5.4 OS
gene-VFH_IV148200chr5+1,024,680,9061,024,685,005Probable inactive purple acid phosphatase 27 OS
gene-VFH_IV148240chr5+1,024,689,7661,024,690,207--
The positive and negative directions of the chain are indicated by the plus sign + and the minus sign −, respectively.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yu, H.; Hu, C.; Yang, X.; Li, Q.; Wang, Y.; Dai, Z.; Cun, J.; Zheng, A.; Jiang, Y.; Wang, Q.; et al. Uncovering Key Genes Associated with the Short-Winged Trait in Faba Bean (Vicia faba L.) Through Re-Sequencing and Genome-Wide Association Studies (GWASs). Int. J. Mol. Sci. 2025, 26, 2733. https://doi.org/10.3390/ijms26062733

AMA Style

Yu H, Hu C, Yang X, Li Q, Wang Y, Dai Z, Cun J, Zheng A, Jiang Y, Wang Q, et al. Uncovering Key Genes Associated with the Short-Winged Trait in Faba Bean (Vicia faba L.) Through Re-Sequencing and Genome-Wide Association Studies (GWASs). International Journal of Molecular Sciences. 2025; 26(6):2733. https://doi.org/10.3390/ijms26062733

Chicago/Turabian Style

Yu, Haitian, Chaoqin Hu, Xin Yang, Qiong Li, Yubao Wang, Zhengming Dai, Jie Cun, Aiqing Zheng, Yanhua Jiang, Qinfang Wang, and et al. 2025. "Uncovering Key Genes Associated with the Short-Winged Trait in Faba Bean (Vicia faba L.) Through Re-Sequencing and Genome-Wide Association Studies (GWASs)" International Journal of Molecular Sciences 26, no. 6: 2733. https://doi.org/10.3390/ijms26062733

APA Style

Yu, H., Hu, C., Yang, X., Li, Q., Wang, Y., Dai, Z., Cun, J., Zheng, A., Jiang, Y., Wang, Q., Lv, M., Yang, F., & He, Y. (2025). Uncovering Key Genes Associated with the Short-Winged Trait in Faba Bean (Vicia faba L.) Through Re-Sequencing and Genome-Wide Association Studies (GWASs). International Journal of Molecular Sciences, 26(6), 2733. https://doi.org/10.3390/ijms26062733

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop