Next Article in Journal
COP9 Signalosome’s Role in Plant Defense Mechanisms
Previous Article in Journal
Using Structural Equation Models to Interpret Genome-Wide Association Studies for Morphological and Productive Traits in Soybean [Glycine max (L.) Merr.]
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

WSF: A Transformer-Based Framework for Microphenotyping and Genetic Analyzing of Wheat Stomatal Traits

1
Sanya Institute, Nanjing Agricultural University, Sanya 572000, China
2
Academy for Advanced Interdisciplinary Studies, Nanjing Agricultural University, Nanjing 210095, China
3
National Technique Innovation Center for Regional Wheat Production/Key Laboratory of Crop Ecophysiology, Collaborative Innovation Centre for Modern Crop Production, Ministry of Agriculture and Rural Affairs/College of Agriculture, Nanjing Agricultural University, Nanjing 210095, China
4
Shandong Hedian Agricultural Technology Co., Ltd., Jinan 250000, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Plants 2025, 14(19), 3016; https://doi.org/10.3390/plants14193016
Submission received: 21 August 2025 / Revised: 15 September 2025 / Accepted: 26 September 2025 / Published: 29 September 2025
(This article belongs to the Special Issue Machine Learning for Plant Phenotyping in Wheat)

Abstract

Stomata on the leaves of wheat serve as important gateways for gas exchange with the external environment. Their morphological characteristics, such as size and density, are closely related to physiological processes like photosynthesis and transpiration. However, due to the limitations of existing analysis methods, the efficiency of analyzing and mining stomatal phenotypes and their associated genes still requires improvement. To enhance the accuracy and efficiency of stomatal phenotype traits analysis and to uncover the related key genes, this study selected 210 wheat varieties. A novel semantic segmentation model based on transformer for wheat stomata, called Wheat Stoma Former (WSF), was proposed. This model enables fully automated and highly efficient stomatal mask extraction and accurately analyzes phenotypic traits such as the length, width, area, and number of stomata on both the adaxial (Ad) and abaxial (Ab) surfaces of wheat leaves based on the mask images. The model evaluation results indicate that coefficients of determination (R2) between the predicted values and the actual measurements for stomatal length, width, area, and number were 0.88, 0.86, 0.81, and 0.93, respectively, demonstrating the model’s high precision and effectiveness in stomatal phenotypic trait analysis. The phenotypic data were combined with sequencing data from the wheat 660 K SNP chip and subjected to a genome-wide association study (GWAS) to analyze the genetic basis of stomatal traits, including length, width, and number, on both adaxial and abaxial surfaces. A total of 36 SNP peak loci significantly associated with stomatal traits were identified. Through candidate gene identification and functional analysis, two genes—TraesCS2B02G178000 (on chromosome 2B, related to stomatal number on the abaxial surface) and TraesCS6A02G290600 (on chromosome 6A, related to stomatal length on the adaxial surface)—were found to be associated with stomatal traits involved in regulating stomatal movement and closure, respectively. In conclusion, our WSF model demonstrates valuable advances in accurate and efficient stomatal phenotyping for locating genes related to stomatal traits in wheat and provides breeders with accurate phenotypic data for the selection and breeding of water-efficient wheat varieties.

1. Introduction

Wheat holds a crucial position in China’s grain production and is one of the country’s main staple crops. The development of the wheat industry is directly related to national food security and social stability. Therefore, improving wheat yield is of significant importance for ensuring food security and promoting farmers’ income in China [1]. Stomata are the main channels for the exchange of substances between wheat and the external environment. Their morphological traits, such as size and density, directly affect physiological activities such as photosynthesis and transpiration in the leaves, determining the wheat’s material yield and drought resistance [2,3]. Therefore, analyzing the genetic mechanisms of wheat stomatal traits is of great scientific value for studying wheat drought resistance mechanisms and for drought-resistant breeding.
Unlike most dicotyledonous plants, where stomata can be directly observed by peeling off epidermal cells, monocotyledonous plants such as wheat have a thinner mesophyll layer, making the peeling of epidermal cells very difficult [4]. Therefore, appropriate fixatives are often used to preserve the stomatal morphology, followed by microscopic observation, or stomata can be directly observed under a microscope [5]. However, the process of identifying and counting stomata through microscope images is cumbersome. After capturing the images, manual identification and counting of stomata are required, which is time-consuming and labor-intensive. Long-term reliance on visual observation also increases the likelihood of errors. With advancements in technology, image processing software such as ImageJ-154 [6] has been used to semi-automatically identify stomata and measure their length and width, but this process still requires users to manually mark the boundaries of stomata and other points of interest [7]. This method has certain limitations in terms of accuracy, speed, throughput, and automation, and cannot meet the modern scientific demand for efficient stomatal data acquisition. Therefore, developing a non-destructive method capable of quickly collecting plant stomatal data will significantly improve the efficiency of subsequent research and drive progress in related fields.
With the rapid development of deep learning, the automatic detection and recognition technology for plant stomata has greatly advanced [8], significantly improving the efficiency of stomatal research. In stomatal studies, deep learning methods can be applied for automatic detection, counting, and complex feature extraction, providing researchers with more comprehensive and accurate data. Currently, deep learning methods have been applied in stomatal trait detection. Stomata Counter, a stomatal counting tool based on the R-CNN deep convolutional neural network, has achieved an accuracy of 0.98 for ginkgo stomata recognition [9]; in maize crop live stomata image recognition, a stomatal counting model based on the YOLO framework [10] achieved a precision of 0.89 [11,12]. However, the models mentioned above are all object detection models, which use bounding boxes to mark the regions where stomata are located. However, they still rely on subsequent image processing steps to extract the stomatal pixels within the bounding boxes. This traditional approach has certain computational errors, and for small targets such as stomata, the bounding boxes often fail to capture their details precisely, making it difficult to reflect the actual morphology of small targets. Semantic segmentation models [13], which can finely classify each pixel in an image, are capable of accurately extracting stomatal regions. These models not only delineate the contour boundaries of stomata but also generate high-quality masks, providing precise support for monitoring stomatal morphological changes, especially for target tracking and phenotypic quantification analysis in dynamic processes. The Mask RCNN model [14] has shown ideal results in stomatal complex segmentation and classification in multiple tree species [15,16]; the UNet model [17] has been used to extract contour coordinates of pore regions from wheat leaf microscopic images and to calculate the morphological characteristics of stomata and pores based on these coordinates [18]. However, both Mask-RCNN and UNet have issues in wheat stomatal segmentation, including high computational complexity, insufficient boundary accuracy, and weak long-range dependency modeling capability, especially when dealing with small targets. In contrast, SegFormer [19], through the self-attention mechanism, captures global contextual information, improves boundary precision for small targets, and provides higher computational efficiency and robustness, making it particularly suitable for fine segmentation of stomata in complex backgrounds, which is ideal for the stomatal segmentation task in this study.
Genome-wide association studies (GWAS) identify genetic variations closely related to specific traits by comparing genomic variations with phenotypic characteristics in different individuals [20]. Currently, GWAS has been widely applied in the development of linkage markers and gene mining for crops such as rice [21], maize [22], wheat [23], and barley [24]. However, in these GWAS studies, phenotypic traits are usually measured manually, which is time-consuming and prone to subjective errors. With the application of deep learning methods, stomatal boundaries or masks can be efficiently extracted, and accurate morphological traits related to stomata can be obtained. These traits reflect the plant’s growth status and adaptability under different environmental conditions [25]. Combining deep learning-extracted phenotypic data with GWAS not only improves the efficiency and accuracy of trait acquisition but also helps uncover the genetic mechanisms of complex agronomic traits. Through this deep learning-assisted research, researchers can study traditional traits such as disease resistance and yield, while simultaneously analyzing multiple traits of crops, including those that cannot be obtained through manual examination, leading to the identification of numerous significantly associated loci and the discovery of more related genes.
To address the current challenges in stomatal research related to efficiency and automation, this study aims to propose an innovative solution, focusing on non-destructive methods for acquiring stomatal images from wheat leaves of multiple varieties and developing a stomatal semantic segmentation method, Wheat Stoma Former (WSF), that is applicable to different species. This method integrates deep learning techniques to extract key phenotypic parameters and conducts genome-wide association studies based on phenotypic data. Specifically, the research objectives include the following three aspects: first, to construct a high-quality non-destructive wheat leaf image dataset, using high-precision imaging devices to capture stomatal samples from multiple varieties, providing a foundation for subsequent modeling; second, to design and implement a universal stomatal segmentation framework, WSF, applicable to multiple species, where the integrated Cross-Layer Feature Pyramid Module (CFPT) effectively suppresses background noise interference while enhancing edge structure extraction, thereby improving segmentation accuracy; finally, to conduct GWAS on 210 wheat varieties using the extracted stomatal phenotypic parameters, identify significant SNP loci closely related to stomatal traits, and recognize potential regulatory genes. This will provide theoretical support and data for understanding the genetic basis of stomatal traits and their applications in breeding. This research framework ensures both data acquisition efficiency and accuracy while balancing general applicability and practicality, offering the potential to advance the integration of crop phenotypic intelligent analysis and molecular breeding research.

2. Results

2.1. Model Performance Testing and Comparative Experiments

To verify the effectiveness of the proposed WSF model, we selected two mainstream semantic segmentation architectures, Deeplabv3+ and UNet, as baseline models for comparative experiments. All models were trained under consistent training configurations to ensure fairness, and segmentation performance was quantitatively assessed using the Precision and Mean Intersection over Union (MIoU) metrics (Table 1). The experimental results clearly show that the WSF model exhibits significant advantages in the stomatal mask segmentation task across multiple crops. Its performance in both accuracy and MIoU exceeds that of the comparison models, demonstrating stronger feature extraction capability and generalization performance.
This section presents systematic experiments to validate the practical application of the WSF model in the stomata segmentation of wheat (Figure 1a), highlighting the effectiveness of the proposed model in the application of wheat stomatal semantic segmentation. To further improve training efficiency and accelerate convergence, we adopted a dynamic learning rate optimization strategy. This strategy automatically adjusts the learning rate based on the progress of the current training. For example, when the loss decreases at a slower rate during a certain stage of training, the learning rate is moderately reduced, which prevents the model from getting stuck in local minima and helps maintain convergence stability. This dynamic adjustment of the learning rate improves the model’s performance in the later stages of training and prevents volatility caused by excessively large learning rates.
To better understand the model’s performance during training, we saved four key metrics at each training batch: training loss, validation loss, smoothed training loss, and smoothed validation loss. Training loss refers to the error of the model on the training set, reflecting how well the model fits the training data. Validation loss refers to the error of the model on the validation set, reflecting the model’s generalization ability, i.e., how well the model can adapt to unseen data. Smoothed training loss and smoothed validation loss are versions of the original loss curves that have been smoothed to observe the trend of the model more clearly and avoid interference from fluctuations between batches in assessing the model’s learning progress. The initial training epoch was set to 200, and by using the backpropagation (BP) algorithm to optimize the loss function, we ensured that the model could effectively learn the spatial features and morphological information of wheat stomata. To accelerate convergence and improve training efficiency, we combined the dynamic learning rate optimization strategy, which automatically adjusts the learning rate based on the loss values during the training process, ensuring more stable convergence in the later stages of training. During the training, we saved the training loss and validation loss for each batch in real time and visualized both the original and smoothed losses (Figure 1b) to intuitively monitor the model’s training progress and performance changes. With these optimization strategies, the model’s final training loss stabilized around 0.15, indicating that the model has adequately fit the training data and demonstrated strong generalization ability on the validation set.
The changes in precision and MIoU of the WSF model at each training epoch are shown below (Figure 1c,d). From the graphs, it can be clearly observed that the model not only shows faster convergence but also possesses significantly higher training stability. Specifically, the WSF model reaches a performance plateau after approximately 50 epochs, with the precision and mIoU curves stabilizing thereafter. In contrast, the SegFormer model requires around 80 epochs to reach relative stability, while Deeplabv3+ requires up to 150 epochs to converge. More notably, the traditional UNet model exhibits large fluctuations throughout the entire training process, showing clear instability.
This training advantage is attributed to the optimizations in the WSF model. First, in terms of model structure, it integrates a multi-scale feature extraction mechanism and a lightweight attention module, enabling it to accurately capture edge and texture features of stomatal regions at different scales, especially for complex multi-crop stomatal morphological backgrounds. Second, in terms of training optimization, we adopted dynamic learning rate adjustment and a hybrid loss function design, effectively reducing the risk of overfitting and accelerating the convergence of model parameters. Additionally, the introduction of data augmentation and noise robustness mechanisms further enhanced the model’s generalization ability under different lighting, resolution, and tissue structure backgrounds.
Overall, the WSF model outperforms existing mainstream models in terms of precision and MIoU and also exhibits excellent performance in training efficiency and stability. This performance demonstrates its broad applicability and potential for engineering deployment in high-throughput plant phenotyping scenarios, particularly in crop phenotyping research that requires high precision for stomatal localization and recognition.

2.2. Stomatal Trait Accuracy Assessment

This study validated and compared the stomata of 210 wheat varieties. The accuracy of the algorithm was evaluated using two metrics: coefficient of determination (R2) and root mean square error (RMSE). The manual measurement values were obtained using the ImageJ-154 tool software for assistance, and the comparison between the manual measurement values and system-estimated values of the phenotypic parameters is shown in Figure 2. For stomatal length (Figure 2a), the R2 for the system-estimated stomatal length on the adaxial and abaxial surfaces with manual measurements were 0.88 and 0.87, with RMSE of 2.77 pixels and 2.84 pixels, respectively. For stomatal width (Figure 2b), the R2 for the system-estimated stomatal width on the adaxial and abaxial surfaces with manual measurements were 0.84 and 0.86, with RMSE of 4.12 pixels and 4.04 pixels, respectively. For stomatal area (Figure 2c), the R2 for the system-estimated stomatal area on the adaxial and abaxial surfaces with manual measurements were 0.81 and 0.80, with RMSE of 148.25 pixels2 and 170.98 pixels2, respectively. For stomatal number (Figure 2d), the R2 for the system-estimated stomatal number on the adaxial and abaxial surfaces with manual measurements were 0.92 and 0.93, with RMSE of 0.81 and 0.74, respectively. The correlation between the system’s predictions and the traditional ImageJ manual measurements is highly significant, especially for stomatal number prediction, demonstrating the value of this method in agricultural phenomics research.

2.3. Analysis of Stomatal Differences Between the Adaxial and Abaxial Surfaces

Statistical analysis of the stomatal length, width, area, and number on the adaxial and abaxial surfaces of wheat leaves revealed that the frequency distributions of these traits followed a normal distribution (Figure 3). The average stomatal length on the adaxial surface was 123.70 pixels, while the average stomatal length on the abaxial surface was 120.70 pixels (Figure 3a). Although the mean values are similar, the stomatal length on the adaxial surface is higher, suggesting that the stomata on the adaxial surface may be slightly longer in terms of structural development. The average stomatal width on the adaxial surface was 66.10 pixels, while the average stomatal width on the abaxial surface was 69.30 pixels (Figure 3b). These results indicate that the short-axis dimension of the stomata on the abaxial surface is larger, possibly due to its greater physiological role in gas exchange or transpiration regulation. The maximum stomatal area on the adaxial surface was 9083 pixels, while the maximum stomatal area on the abaxial surface was 8627 pixels (Figure 3c). Although the length on the abaxial surface is slightly smaller, the width and area are slightly larger. This may be due to differences in the stomatal opening angle or structural differences, which result in a larger area measurement. In a field of 1600 × 1200 pixels, the average number (22.80) of stomata on the adaxial surface was more than the average number (18.80) of stomata on the abaxial surface (Figure 3d).
Violin plots illustrating significant differences (p < 0.001) in stomatal traits between adaxial and abaxial surfaces across 210 wheat cultivars are shown in Figure 3. The stomatal length on the adaxial surface of leaves was significantly longer than that on the abaxial surface (Figure 3e); the stomatal width on the adaxial surface was significantly smaller than that on the abaxial surface (Figure 3f); the stomatal area on the adaxial surface was significantly smaller than that on the abaxial surface (Figure 3g); and the stomatal number on the adaxial surface was significantly greater than that on the abaxial surface (Figure 3h). Overall, the stomatal density on the adaxial surface is higher than that on the abaxial surface, while the stomatal size on the adaxial surface is smaller than that on the abaxial surface, indicating significant differences in the stomatal distribution strategy between the two leaf surfaces.

2.4. Stomatal Trait Correlation Analysis

Correlation coefficient (r) displays the correlation relationships between eight stomatal traits at a significant level (p < 0.05), revealing the co-variation patterns of stomatal number, size, and area (Figure 4). The results indicate that the number of stomata on the adaxial surface is positively correlated with the number of stomata on the abaxial surface (r = 0.36) and the stomatal width on the abaxial surface (r = 0.18), suggesting a certain synchrony in the distribution of stomata on both sides of the leaf, with stomata on the abaxial surface tending to expand in high-density regions. Meanwhile, the number of stomata on the adaxial surface is negatively correlated with the stomatal length on both the adaxial surface (r = −0.31) and the abaxial surface (r = −0.21), reflecting a physiological trade-off relationship of “increased number—decreased length”. The number of stomata on the abaxial surface is also positively correlated with the number of stomata on the adaxial surface (r = 0.36) but negatively correlated with the stomatal length and area on both surfaces (r = −0.22~−0.30), suggesting that under high-density stomatal distribution on the abaxial surface, the structural development of individual stomata may be limited. Further analysis shows that the area of stomata on both surfaces is moderately positively correlated with the stomatal length and width on both surfaces (r = 0.68~0.75), indicating structural coordination between the stomata on both leaf surfaces. Additionally, the area of stomata on the abaxial surface is negatively correlated with the number of stomata on the abaxial surface (r = −0.22). Overall, the length and width of the stomata are closely related to the size of the stomata, while the morphological features of the stomata on the adaxial and abaxial surface have no direct correlation.

2.5. GWAS Results and Candidate Gene Identification

The results of the GWAS revealed multiple genetic loci significantly associated with wheat leaf stomatal traits, covering key phenotypic indicators such as stomatal length, width, area, and number. For stomatal length on the adaxial surface, two significant loci were identified, located on chromosomes 3B and 6A (Figure 5a). For stomatal length on the abaxial surface, four loci were associated, located on chromosomes 2B, 3A, 6A, and 7B, suggesting that these chromosomal regions may contain key genes regulating stomatal longitudinal growth (Figure 5b). Three significant loci associated with stomatal width on the adaxial surface were located on chromosomes 6A, 6B, and 6D (Figure 5c), while the genetic regulation of stomatal width on the abaxial surface was more complex, with a total of eight significant loci detected, widely distributed on chromosomes 1A, 2A, 2B, 3B, 4A, 5B, 5D, and 7A, indicating multi-locus and multi-pathway regulation of this trait (Figure 5d). For stomatal area, the adaxial stomatal area was associated with four loci, located on chromosomes 2B, 4B, 6B, and 7B (Figure 5e). The abaxial stomatal area was associated with only one locus, located on chromosome 7A, suggesting that stomatal area development on the two sides of the leaf may be controlled by different genetic mechanisms (Figure 5f). Furthermore, the number of stomata on the adaxial surface was significantly associated with two loci on chromosomes 1B and 6D (Figure 5g), while the number of stomata on the abaxial surface was associated with as many as twelve significant loci, covering chromosomes 1A, 2A, 2B, 3A, 3B, 4A, 4B, 4D, 5A, 5B, 5D, and 7A (Figure 5h), indicating the complexity and diversity of genetic regulation for this trait.
This study further investigated 36 candidate genes significantly associated with stomatal traits identified through GWAS, using the Dr.Tom multi-omics data mining platform from BGI Genomics for functional annotation and biological significance analysis, with the aim of identifying potential functional genes and elucidating their roles in stomatal development and regulation. The annotation results indicate that two genes may directly participate in the physiological regulation of stomatal behavior. One gene, TraesCS2B02G178000, located on chromosome 2B, is predicted to be involved in stomatal movement and may affect gas exchange and water loss by regulating stomatal opening and closing signaling pathways, contributing to the plant’s response mechanism to environmental changes. Another significant gene, TraesCS6A02G290600, located on chromosome 6A, is involved in stomatal closure regulation and is hypothesized to regulate the physiological response of guard cells to environmental stresses such as drought or heat, thereby influencing the maintenance and regulation of stomatal dynamics (Appendix A). The identification of these two genes provides strong evidence for revealing the genetic basis of wheat stomatal traits and offers important candidate targets for further research into the molecular mechanisms of stomatal functional traits and breeding applications.

3. Discussion

3.1. Model Performance Testing and Comparative Experiments

The WSF proposed in this study has distinct advantages compared to previous models. In the current application of deep learning models for wheat stomata, two main types of models are commonly used: object detection models and semantic segmentation models. For object detection models, the research by [26] enhanced a CNN-based detection model by incorporating an attention mechanism, achieving a stomata recognition accuracy of 98.30%. However, the study used a general-purpose dataset created under ideal conditions with destructive sampling, which cannot effectively handle images with significant background interference. In contrast, the data in this study was collected from field conditions using non-destructive sampling, which includes interference from lighting, background impurities, and other real-world environmental factors, thus posing a greater challenge to the model’s generalization ability. This gives our model an advantage over those using general-purpose datasets. Similarly, some studies employed the YOLO architecture for stomata detection [27], but the detection framework returned bounding boxes, and the stomata located within these boxes still required conventional image processing methods for extraction. These conventional methods often miss detections and are ineffective in handling overlapping stomata or those that are too small. For semantic segmentation models, the U-Net network is known for its ability to process details and capture features at multiple scales, making it suitable for dynamic video streams of stomatal behavior [28]. However, U-Net’s decoder relies on skip connections, which preserve image details but may lead to poor segmentation precision for small objects like minute stomata. Its performance on static images is often unsatisfactory. On the other hand, SegFormer, using a Transformer architecture, is better at capturing long-range dependencies and global information, making it more suitable for segmentation tasks in complex backgrounds, particularly in handling fine stomata. To address the issue of inaccurate stomatal edge segmentation, we designed a CFPT. This module, by fusing features from multiple layers and refining them, effectively improves the completeness and accuracy of edge segmentation.

3.2. Genetic Regulation of Stomatal Traits and Application Prospects of GWAS

The stomata on both sides of the wheat leaf exhibit diverse and continuous variations in morphological traits such as length, width, area, and number, following normal distribution. This aligns with the genetic patterns of quantitative traits, indicating that stomatal traits are typical complex phenotypes controlled by multiple genes. As these traits play key roles in environmental adaptation, physiological metabolism, and yield regulation, their genetic basis warrants further exploration. Therefore, this high-quality phenotypic data provides a solid foundation for conducting GWAS. Combined with high-throughput genotypic data, these findings can be used to identify key functional genes that regulate stomatal development, density, and size, further assisting breeding decisions and providing theoretical and practical support for the selection of wheat varieties with high resistance and efficient water use.
Stomata serve as the passage for plants to exchange carbon and water with the environment, thus playing a crucial role in regulating their physiological activities. Therefore, the regulation of stomatal traits (such as morphology, number, size, and positioning on the leaf surface) can significantly affect the plant’s water use efficiency and, consequently, its yield [29]. Although the loci and signaling pathways controlling stomatal traits are generally highly conserved [30,31,32], significant functional differences in stomatal genes have been identified in various lineages [33]. These stomatal genes can influence traits related to water use, thereby affecting the plant’s resistance to abiotic stresses. Thus, screening and identifying stomatal traits that are responsive to drought phenotypes and molecular responses can help improve crop tolerance. Researchers have also screened the stomatal index of 539 wheat varieties [34] and used a wheat 660 K SNP chip for genotyping. Through GWAS analysis, they identified 130 SNPs significantly associated with the stomatal index of wheat seedlings, many of which overlapped with genes. Among the 130 gene loci detected in the study, 69 were positively specific and 61 were negatively specific, with almost no overlap between the SNPs associated with the front and back of the leaf stomatal indices. This finding is consistent with previous reports on wheat [35]. These results suggest that the genetic control of the stomatal index on the front and back of the wheat leaf is distinct.
In this study, we conducted a GWAS of stomatal-related traits (length, width, area, and number) in 210 wheat varieties and identified 36 SNP loci significantly associated with these traits. We found that compared with the adaxial surface of the leaves, there were more significantly associated gene loci on the abaxial surface, with 11 loci on the adaxial surface and 25 loci on the abaxial surface, respectively, and there was minimal overlap between them. Two genes—TraesCS2B02G178000 (on chromosome 2B) and TraesCS6A02G290600 (on chromosome 6A)—were found to be associated with stomatal traits, involved in regulating stomatal movement and closure, respectively. By comparing these two candidate genes with their most clearly defined homologous genes, it was found that TraesCS2B02G178000 shares 82% identity with At1g22640 (SPCH) in Arabidopsis, a bHLH transcription factor that initiates stomatal lineage [36], and TraesCS6A02G02G290600 is 75% identical to At5g53210 (EPF1), whose secreted peptide negatively regulates stomatal density and suppresses stomatal clustering [37]. This result is consistent with prior research. Furthermore, functional annotation of candidate genes revealed two potential loci involved in regulating stomatal movement and closure, validating the effectiveness of our WSF-powered GWAS.

3.3. The Generalization Ability of the WSF Model to Other Plants

In this study, the WSF model was also tested for generalization performance on three plant species, namely peanut, chrysanthemum, and cotton (100 images each) (Figure 6). Two evaluation metrics, precision and Mean Intersection over Union (mIoU), were employed, and the results are presented in Table 2. Among these species, Cotton demonstrated the best performance, with both precision and mIoU exceeding 83%, indicating that the model possesses strong transferability when applied to plants with similar stomatal morphology or relatively simple backgrounds. Peanut ranked second, achieving relatively high accuracy and good regional overlap, which suggests that the model also adapts well to leaves with relatively regular structures or moderate interference. In contrast, the performance on Chrysanthemum declined slightly due to its smaller stomata and more complex leaf surface texture. Chrysanthemum exhibited the lowest accuracy and segmentation quality, largely because its stomatal morphology differs markedly from the training data and background noise is strong, reflecting the challenges the model faces in cross-species applications with pronounced structural differences.

3.4. Limitations of the WSF Model and Future Research Directions

The WSF semantic segmentation model shows great promise in wheat stomatal analysis, enabling automated identification and segmentation of stomata in large-scale microscopic images and thus substantially improving research efficiency. Nevertheless, the model still presents several limitations in practical applications. First, its performance strongly depends on the quality and diversity of training data. Since most datasets are derived from high-quality laboratory-acquired images, segmentation accuracy often decreases under field conditions, where noise, uneven illumination, and leaf folding are common, revealing inadequate generalization ability. Second, the model may encounter recognition errors when applied to different wheat varieties, developmental stages, or under pathological influences, which limits its broader agricultural applicability. Furthermore, its ability to capture fine boundary details remains insufficient, leading to inaccuracies in quantifying stomatal aperture, thereby hindering deeper investigations into stomatal dynamics and physiological processes.
On the application side, although preliminary image-processing software integrating the WSF model has been developed to automate stomatal analysis, its functionalities remain relatively limited. Existing software typically supports only single-image processing and lacks efficient tools for large-scale batch or fully automated analysis, which is crucial for field monitoring and high-throughput experiments. In addition, the user interfaces are not yet sufficiently intuitive, posing challenges for agricultural researchers without computational backgrounds. More importantly, current software largely focuses on segmentation and area measurement, without establishing strong connections with physiological parameters such as stomatal density, aperture dynamics, transpiration rate, or photosynthetic efficiency. This limits its role in supporting crop stress resistance evaluation or breeding selection. Future efforts should therefore prioritize enhancing the model’s robustness and cross-environment adaptability, while promoting the development of intelligent, automated software platforms that integrate stomatal traits with physiological analysis, ultimately serving wheat stomatal research and agricultural production practices.

4. Materials and Methods

4.1. Stomatal Data Collection for 210 Wheat Varieties

A total of 210 wheat varieties (the names of all varieties are listed in Appendix A), including major wheat varieties and core germplasm resources from the Huang-Huai-Hai wheat region, were collected. The experimental materials were sown in late October 2023 at the Baima Teaching and Research Base of Nanjing Agricultural University (31.61° N, 119.18° E), Lishui District, Nanjing City, Jiangsu Province. The fertilizers used in the experiment included urea as the nitrogen fertilizer, monoammonium phosphate as the phosphorus fertilizer, and potassium chloride as the potassium fertilizer. The nitrogen fertilizer was applied at a rate of 240 kg hm−2, divided into basal fertilizer and jointing fertilizer with a 5:5 ratio. Both phosphorus and potassium fertilizers were applied at a rate of 120 kg hm−2 as a single application of basal fertilizer. A total of 210 experimental plots were used. Each plot was 1.50 m × 1.25 m in size, containing six rows of wheat with a row spacing of 0.25 m and 175 seeds in each row. The spacing between adjacent plots was 0.50 m horizontally and 0.45 m vertically. Field management was carried out according to standard high-yield cultivation practices.
Sampling for this study was conducted during the wheat regreening stage on 22 February 2024. To avoid edge effects in the experiment, three wheat plants with similar growth and morphology were selected from the second row of each plot from bottom to top. The top leaves of the main stems were collected, and the stomatal images of the wheat leaves were captured using an Anyty portable microscope (3R-WM601 USB series, BeiJing, China). The microscope includes an image acquisition button and a size adjustment knob. After connecting the Anyty microscope to a computer, turning the adjustment knob to change the focal length of the microscope can ensure the appropriate magnification for observing the wheat leaf stomata. In this study, stomatal images of wheat leaves were captured using the Anyty microscope, with a magnification of 550 times. The leaf was slowly moved so that the microscope was aligned with the center of the leaf, and the stomata were arranged as parallel as possible in the field of view. After photographing the stomata on the adaxial surface, the leaf was flipped to capture the stomata on the abaxial surface. The stomatal images were saved in JPG format. A total of 210 wheat varieties were photographed, with six images taken per variety (three for the adaxial surface and three for the abaxial surface), resulting in a total of 1260 stomatal images.

4.2. Construction of the Wheat Stomata Dataset

This study focuses on image segmentation for three target categories: wheat stomata (wheat_stoma), pores (pore), and background (background). During the data preparation phase, the ISAT Segment Anything tool was used to accurately annotate masks and optimize details for 1260 high-resolution images (1600 × 1200 pixels). To enhance data diversity, the research team implemented various data augmentation strategies, including random rotation, scaling transformations, and deformation processing, which ultimately expanded the sample size to 12,600 image sets (each set consisting of the original image and its corresponding mask). The fundamental principle of data augmentation lies in generating diverse samples through operations such as rotation, flipping, scaling, and noise injection, thereby enhancing the model’s adaptability to varying environments and disturbances. This process reduces overfitting and improves generalization.
For data splitting, the augmented dataset was scientifically allocated into training, testing, and validation sets in a 7:2:1 ratio. Specifically, 8820 image sets were designated for training, 2520 sets for testing, and 1260 sets for validation. This distribution scheme effectively ensured sample balance at each stage of model development. All image data were matched strictly using a file naming mechanism to ensure correspondence.

4.3. WSF Model Architecture

In this section, based on the SegFormer framework, we propose a semantic segmentation model for wheat stomata, named WSF (Figure 7). Accurate recognition of the stomatal edges is a key factor affecting the overall accuracy. However, there are several issues with the segmentation of wheat stomatal edges. First, the boundary between the stomata and the surrounding cell walls may have low contrast due to lighting conditions, imaging factors, or sample preparation methods (such as the leaf surface wax layer), making it difficult for traditional segmentation models to distinguish the edges accurately. Additionally, wheat stomata exhibit significant variation in shape and size (e.g., in open/closed states, different developmental stages), leading to inconsistent edge structures. Furthermore, small pores, intercellular spaces, or noise (e.g., dust or imaging artifacts) around the stomata may be misinterpreted as the stomatal edges, resulting in segmentation errors. To address these issues, we introduce a CFPT module to resolve these challenges. This module captures the subtle features of stomatal edges through multi-scale feature fusion, utilizes spatial/channel attention mechanisms to enhance the weight of edge regions, and combines the long-range dependency modeling ability of Transformers to optimize edge continuity.
The WSF model is composed of a hierarchical Transformer encoder and a lightweight decoder consisting of pure MLP. In the encoding phase, the raw image features are first processed by the CFPT module to enable cross-layer information interaction and edge-sensitive feature enhancement. Then, the model performs an overlapping block embedding operation on the input image, where the convolution kernel size is 7 × 7, the stride is set to 4, and the padding is 3, thus acquiring richer local information. Subsequently, the input is processed sequentially by four Transformer modules with progressively reduced resolutions, corresponding to down sampling ratios of 1/4, 1/8, 1/16, and 1/32, with channel dimensions of 64, 128, 320, and 512, respectively, to construct multi-scale semantic feature maps. The decoder then extracts features from these four stages. First, a 1 × 1 convolution is used to unify the channel number to 256 for subsequent fusion. After the feature maps are up sampled and aligned to a unified spatial size, they are concatenated along the channel dimension, then sent into multiple lightweight MLP branches for feature mapping and context enhancement. These branches operate in parallel to mine multi-scale semantic information. Finally, the outputs of all branches are concatenated and subjected to a convolution operation to complete channel compression and information fusion, resulting in a globally fused feature map. At the model output, a convolution layer matching the number of categories is set to generate the final semantic segmentation mask, achieving pixel-level precise recognition of wheat stomata, pores, and background regions.
The CFPT module divides the feature maps from different levels of the backbone network into four groups, each differing in spatial scale and channel number, reflecting the network’s perception of different semantic levels. To fully exploit the information complementarity among these features, CFPT module first uses a horizontal connection strategy to upsample high-level semantic features to the same spatial resolution as low-level features and align them in the channel dimension, enabling the fusion of features from different layers. The fused features are then fed into the cross-layer attention module, which consists of two key substructures: Cross-Layer Channel Attention (CCA) and Cross-Layer Spatial Attention (CSA). CCA focuses on the channel dimension and allows each channel token to perceive global context information across the entire spatial domain through a multi-scale adjacent feature interaction mechanism. Before this process, features at different scales undergo Channel Reconstruction (CR) to ensure compatibility in the channel attention computation by unifying the spatial resolution. CSA, on the other hand, operates on the spatial dimension, interacting features corresponding to the same spatial locations at different layers, enabling each spatial token to fuse semantic features from the channel dimension, thereby establishing more robust local context representations. Through these two attention paths, CFPT module achieves cross-layer feature fusion in both channel and spatial dimensions, constructing a new, semantically rich pyramid feature structure. Finally, this new multi-layer pyramid is input into a shared detection head, which predicts category information for all scale features, outputting a segmentation tensor of size (H, W, K), where H and W are the spatial resolution, and K is the number of target categories.
All experiments were performed on a desktop system featuring an Intel Core i9-14900 K processor(Intel Corporation, Santa Clara, CA, USA), 64 GB of system memory, and an NVIDIA GeForce RTX 4090 GPU with 64 GB VRAM. In the training process of the Wheat Stoma Former semantic segmentation model, carefully designed hyperparameter configurations were adopted to ensure a balance between convergence efficiency and model performance. In the training of the SegFormer model, k-fold cross-validation (e.g., k = 5) is employed to mitigate overfitting. The dataset is divided into five subsets, with each subset serving as the validation set in turn, while the remaining subsets are used for training. This iterative process ensures stable performance across varying data partitions, enhances generalization capability, and facilitates the selection of optimal hyperparameters. Specifically, the initial learning rate was set to 6 × 10−5, with a learning rate decay mechanism applied, reducing the learning rate by a decay factor of 0.8 after each training epoch to accommodate fine-tuning of parameters in the later stages of training. The model used a batch size of 8 and underwent 200 epochs of iterative training, with accuracy evaluated on the validation set after each epoch to monitor the model’s generalization performance and adjust strategies in a timely manner. The AdamW optimizer was used, which is an improved version of Adam that incorporates a weight decay regularization term to effectively mitigate overfitting and enhance training stability.
To comprehensively evaluate model performance, this study introduced two primary metrics: overall precision (Pre) and mean Intersection over Union (MIoU). Precision measures the model’s prediction accuracy for all categories, i.e., the proportion of true positives among all predicted positive samples; while MIoU calculates the ratio of intersection to union between predicted and ground truth regions and averages this value across all categories, reflecting the overall quality of spatial region segmentation. The computation of these two metrics is represented by Equations (1) and (2), which serve as key indicators for evaluating the precise segmentation capability of Wheat Stoma Former in stomatal regions.
Pre   = True p True p + False p
MIoU   = 1 k + 1   i = 0 k Area m i Area gt i Area m i Area gt i
Specifically, True p and False p represent the number of correctly and incorrectly classified pixels, respectively; Area m Area gt and Area m Area gt represent the number of intersection and union pixels of the model’s predicted mask and the manually annotated mask, respectively.

4.4. Wheat Stomatal Trait Extraction

In the wheat stomatal semantic segmentation task, the mask images generated by the semantic segmentation model enable precise localization and region separation of stomatal targets. Based on the mask images of the adaxial and abaxial surfaces of each wheat variety, the following four key phenotypic parameters can be extracted: stomatal length, stomatal width, stomatal area, and stomatal number. First, the mask images output by the semantic segmentation model are converted into binary images, where the stomatal region is white and the background is black. Then, the contours of each stomatal target are extracted using the Canny operator-based contour extraction method. Afterward, the minimum enclosing rectangle of each contour is fitted to obtain the stomatal length and width. Next, the number of pixels in each stomatal contour within the mask region is counted and multiplied by the physical area represented by a single pixel to obtain the stomatal area. Finally, the number of detected contours is used as the estimated value for stomatal count.

4.5. Genotyping and GWAS of Stomatal Traits in Wheat

In this study, a panel of 210 wheat accessions was genotyped using the Axiom® Wheat660 high-density SNP array(Beijing, China), developed through a collaboration between the Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, and Affymetrix Inc. (Santa Clara, CA, USA) [38]. This genotyping platform features over 95% effective marker coverage and ensures even genome-wide distribution. It allows the detection of 630,517 genetic variants, including both SNPs and small insertions/deletions (InDels). Initial genotype data underwent stringent quality control using Axiom Analysis Suite version 5.2 (Affymetrix, Santa Clara, CA, USA). Filtering criteria excluded SNPs with more than 10% missing data and minor allele frequencies (MAF) below 0.05. After quality control, a dataset comprising 409,976 high-confidence SNPs remained for genome-wide association analysis. SNPs were physically anchored to the wheat reference genome IWGSC RefSeq v1.0.
To explore the genetic basis of stomatal morphological traits on the adaxial and abaxial leaf surfaces, GWAS was conducted using the rMVP v0.99.26 software package [39]. The FarmCPU (Fixed and Random Model Circulating Probability Unification) model was employed to control false positives and increase detection power. Significant marker-trait associations were visualized using Manhattan plots. Candidate genes located near significant loci were identified using the WheatOmics multi-omics database [40]. Further functional annotation and biological relevance assessments of these genes were conducted via the Dr.Tom platform (https://biosys.bgi.com, accessed on 1 July 2023) provided by BGI Genomics.

4.6. Statistical Analysis

One-way ANOVA followed by Fisher’s LSD test was used to examine significant differences in stomatal traits difference between the adaxial and abaxial surfaces. The Loss curves, Precision curves, and MIoU curves of model training were all visualized using Python3.9. All the statistical analyses were conducted using SPSS 25, and other related figures were drawn using Origin 2024.

5. Conclusions

In this study, an automatic and high-throughput analysis pipeline for wheat stomatal phenotypic traits was established. The developed WSF semantic segmentation network was used to extract stomatal features, and the edge detection effect was enhanced through the CFPT module. The WSF model showed high accuracy, with a precision of 93.60% and an MIoU of 91.20%. The predicted values were highly correlated with the measured values, with the R2 value for stomatal number exceeding 0.92 and the R2 values for length and width both exceeding 0.84. The analysis of stomatal phenotypes in 210 wheat varieties revealed that the adaxial surface exhibited longer stomatal length but reduced width, resulting in smaller overall stomatal area compared to the abaxial side. Conversely, stomatal number showed an inverse pattern with higher frequency on the adaxial surface. Subsequently, GWAS was performed using 660 K SNP chip data, which identified 36 significant SNP loci on multiple chromosomes. Candidate genes such as TraesCS2B02G178000 and TraesCS6A02G290600 are related to stomatal regulation. These findings offer valuable insights into the genetic control of stomatal traits and provide targets for improving water use efficiency and drought tolerance in wheat breeding.

Author Contributions

Conceptualization, H.Z. and Q.L.; methodology, H.Z. and H.M.; software, H.Z. and Q.L.; validation, H.Z., H.M., S.L., D.J., J.C. (Jiawei Chen), and Q.L.; formal analysis, H.Z., H.M., S.L., D.J., J.C. (Jiawei Chen), and Q.L.; investigation, H.Z., H.M., S.L., B.Q., Q.S., Z.P., Q.P., X.W., J.C. (Jian Cai), Q.Z., Y.Z., M.H., D.J., J.C. (Jiawei Chen), and Q.L.; resources, H.Z., H.M., S.L., D.J., J.C. (Jiawei Chen), and Q.L.; data curation, H.Z., H.M., S.L., D.J., J.C. (Jiawei Chen), and Q.L.; writing—original draft preparation, H.Z., H.M., and Q.L.; writing—review and editing, H.Z., H.M., S.L., B.Q., Q.S., Z.P., Q.P., X.W., J.C. (Jian Cai), Q.Z., Y.Z., M.H., D.J., J.C. (Jiawei Chen), and Q.L.; visualization, H.Z., H.M., S.L., B.Q., Q.S., Z.P., Q.P., X.W., J.C. (Jian Cai), Q.Z., Y.Z., M.H., D.J., J.C. (Jiawei Chen), and Q.L.; supervision, H.Z., H.M., S.L., B.Q., Q.S., Z.P., Q.P., X.W., J.C. (Jian Cai), Q.Z., Y.Z., M.H., D.J., J.C. (Jiawei Chen), and Q.L.; project administration, H.Z. and Q.L.; funding acquisition, D.J., J.C. (Jiawei Chen), and Q.L.; H.Z. and H.M. contributed equally to this work. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the projects of Theoretical Innovation and Application of High-Yield and High-Quality Cultivation of Xinjiang Wheat Variety ‘Aimi Zao’ and Relay Cropping Projects, Jiangsu Innovation Support Program for International Science and Technology Cooperation Project (BZ2023049), the Guidance Foundation of the Sanya Institute of Nanjing Agricultural University (NAUSY-MS31), Lhasa City Science and Technology Program Projects (LSKJ202520), the Key R&D Project of Shandong Province (2024TZXD070), the National Key Research and Development Program of China (2024YFD2301305, 2023YFD2300200), the National Natural Science Foundation of China (32021004), Shandong Province First-class Discipline Construction “811” Project, the earmarked fund for CARS-03, and the collaborative Innovation Center for Modern Crop Production co-sponsored by Province and Ministry (CIC-MCP).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All data and materials are available upon request to QL (qingli@njau.edu.cn).

Conflicts of Interest

The authors declare no competing interests. Author Qiuxiao Pan was employed by the company Shandong Hedian Agricultural Technology Co., Ltd., Jinan 250000, China. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Appendix A. Wheat Varieties Used in the Study

NumberVariety NameNumberVariety NameNumberVariety NameNumberVariety NameNumberVariety Name
1Aikang 5843Jinan 1685Luomai 21127Shimai 22169Yannong 15
2Baomai 1044Jinan 1786Luomai 26128Shiyou 20170Yannong 173
3Beijing 1045Jining 1687Luomai 7 129Shunmai 1718171Yannong 19
4Boai 702346Ji 526588Machang 2130Sumai 188172Yannong 22
5Chuanmai 2247Jimai 1989Mianmai 37131Sumai 3173Yannong 836
6Chuanmai 3648Jimai 190Mianmai 39132Tai 10604174Yannong 999
7Chuanmai 4249Jimai 2691Xumai 32133Taikemai 31175Yangmai 10
8Dekang 96150Jinan 292Nannong 0686134Taikemai 33176Yangmai 13
9Dexuan 151Jinan 893Ningmai 13135Taimai 1918 177Yangmai 15
10Emai 1152Jinan Ai 694Ningmai 15136Tainong 19178Yangmai 158
11Emai 1253Jinhe 912395Ningmai 22137Taishan 1179Yangmai 16
12Emai 1554Jinmai 3196Ningmai 24138Taishan 27180Yangmai 18
13Emai 1955Jinmai 3397Ningmai 26139Taishan 28181Yangmai 1
14Emai 656Jinmai 4798Ningmai 9140Taishan 4182Yangmai 22
15Fan 657Jingdong 1799Ningnuomai 1141Taishan 5366183Yangmai 5
16Fengde Cunmai 158Jingdong 18100Nongda 211142Taitianmai 118184Yangmai 9
17Fengkang 859Jingdong 22101Qida 195143Tainong 24185Yumai 13
18Gaoyou 50360Jinghua 9102Qimai 2144Wanmai 0066186Yumai 18
19Guomai 30161Jingshuang 16103Qingnong 2145Wanmai 19187Yumai 2
20Han 708662Keyuan 088 104Rumai 1146Wanmai 33188Yunhan 2233
21Haoyou 201863Laizhou 137105Ruihuamai 520147Wanmai 50189Yunhan 618
22Henong 206364Laizhou 953106Ruihuamai 523148Wanxi Mai 0638190Xumai 31
23Henong 604965Lianmai 2107Shannong 12149Wanfeng 269191Chang 4738
24Henong 82566Lianmai 6108Shannong 15150Wannian 2192Zhemai 1
25Heng 13667Lianmai 7109Shannong 20151Wennong 14193Zhenmai 12
26Heng 439968Liangxing 66110Shannong 205152Wennong 5194Zhenmai 168
27Yangmai 2069Liangxing 77111Shannong 21153Wennong 17195Zhenmai 3
28Hengguan 3570Lin Y8012112Shannong 22154Xinmai 16196Zhenmai 4
29Hengza 10271Linmai 4113Shannong 24155Xinmai 18197Zhenmai 9
30Huaimai 2072Lukenmai 9114Shannong 28156Xinmai 20198Zhengmai 004
31Huaimai 2273Lumai 14115Shannong 29157Xinmai 9199Zhengmai 7698
32Huaimai 2874Lumai 15116Shengxuan 3158Xinmai 296200Zhengmai 9023
33Huaimai 2975Lumai 1117Shengxuan 6159Xingmai 18201Zhongmai 175
34Huaimai 3076Lumai 21118Shiluan 02-1160Xingmai 7202Zhongmai 9
35Huaimai 3377Lumai 22119Shimai 26161Xumai 30203Zhoumai 12
36Huaimai 678Lumai 5120Shimai 28162Xumai 33204Zhoumai 18
37Jimai 1979Luyuan 502121Shi 4185163Xuzhou 24205Zhoumai 23
38Jimai 2080Lunxuan 987122Shijiazhuang 407164Xuzhou 438206Zhoumai 24
39Jimai 2181Luohan 11123Shijiazhuang 8165Xuzhou 8207Zhoumai 26
40Jimai 2282Luohan 13124Shimai 12166Yanfu 188208Zhoumai 27
41Jimai 22983Luohan 2125Shimai 15167Yannong 0428209Zhoumai 32
42Jimai 2384Luohan 9126Shimai 18168Yannong 1212210Zinong 033

Appendix B. Significant Associations Markers Based on GWAS of Stomatal Traits

Trait Marker Loc Var −log10 SnpEffect_GeneID Gene Annotation
Length (Ad)SNP1Chr3B:24820702C->T3.50TraesCS3B02G048900oxidoreductase activity
Length (Ad)SNP2Chr6A:522154021G->A3.86TraesCS6A02G290600cellular response to calcium ion,
regulation of stomatal closure
Length (Ab)SNP1Chr2B:579569265C->A4.01TraesCS2B02G407900integral component of membrane
transmembrane transport
Length (Ab)SNP2Chr3A:15558194G->A3.90TraesCS3A02G030300integral component of membrane
Length (Ab)SNP3Chr6A:571935441A->C3.50TraesCS6A02G338600transaminase activity
cellular amino acid metabolic process
Length (Ab)SNP4Chr7B:52781G->A3.65TraesCS7B02G000200ATP binding, ATPase activity, coupled to transmembrane movement of substances
Width (Ad)SNP1Chr6A:609452855C->A3.63TraesCS6A02G400000iron–sulfur cluster binding, metal ion binding
Width (Ad)SNP2Chr6B:710007244C->T4.36TraesCS6B02G451100aspartic endopeptidase activity, intramembrane cleaving, membrane protein proteolysis
Width (Ad)SNP3Chr6D:468806891A->G3.67TraesCS6D02G396300RNA polymerase
Width (Ab)SNP1Chr1A:531677694T->C4.58TraesCS1A02G342900ATPase activity, coupled to transmembrane
movement of substances, ATP binding
Width (Ab)SNP2Chr2A:36021015G->T4.08TraesCS2A02G073500UDP-glycosyltransferase activity
Width (Ab)SNP3Chr2B:54532482A->G3.75TraesCS2B02G094100protein kinase activity, ubiquitin–protein transferase activity,
ATP binding, oxidoreductase activity
Width (Ab)SNP4Chr3B:705591855C->T3.84TraesCS3B02G462700integral component of membrane
Width (Ab)SNP5Chr4A:624015701T->G4.54TraesCS4A02G344100O-methyltransferase activity; protein dimerization activity;
P-nucleus; hormone-mediated signaling pathway; methylation;
Q-positive regulation of transcription, DNA-templated;
Width (Ab)SNP6Chr5B:457122696T->C3.64TraesCS5B02G271400RNA binding; intracellular membrane-bounded organelle; RNA modification
Width (Ab)SNP7Chr5D:549858298C->G4.23TraesCS5D02G536800plant-pathogen interaction; environmental adaptation; organismal Systems
Width (Ab)SNP8Chr7A:548367285T->C3.68TraesCS7A02G375500calcium ion binding; RNA binding; methyltransferase activity;
RNA methylation
Area (Ad)SNP1Chr2B:703976259G-> T5.22TraesCS2B02G507100ATP binding, ADP binding, defense response
Area (Ad)SNP2Chr4B:656816645G-> T4.41TraesCS4B02G370800cation transmembrane transporter activity
Area (Ad)SNP3Chr6B:32330944C->G4.19TraesCS6B02G068800raffinose alpha-galactosidase activity,
carbohydrate metabolic process
Area (Ad)SNP4Chr7B:641806970G->T5.41TraesCS7B02G377100extracellular region
Area (Ab)SNP1Chr7A:675287935G->T4.57TraesCS7A02G684200integral component of membrane
Number (Ad)SNP1Chr1B:667223816G->A3.50TraesCS1B02G447600RNA binding, metal ion binding; nucleus
Number (Ad)SNP2Chr6D:471016984G->A3.59TraesCS6D02G402800amRNA binding, DNA binding, metal ion binding;
poly(A)+mRNA export from nucleus
Number (Ab)SNP1Chr1A:503377978C->A4.75TraesCS1A02G311900DNA-binding transcription factor activity,
RNA polymerase II-specific
Number (Ab)SNP2Chr2A:52763048T->C4.28TraesCS2A02G102500other types of O-glycan biosynthesis, Glycan
biosynthesis and metabolism; oxidoreductase activity
Number (Ab)SNP3Chr2B:153128770A->G3.57TraesCS2B02G178000regulation of stomatal movement
Number (Ab)SNP4Chr3A:535189608G->T3.50TraesCS3A02G301400rRNA 5′-end processing, rRNA processing
Number (Ab)SNP5Chr3B:822681495C->T4.03TraesCS3B02G602700MAPK signaling pathway-plant
Number (Ab)SNP6Chr4A:590590535C->T3.82TraesCS4A02G283700amino sugar and nucleotide sugar metabolism
Number (Ab)SNP7Chr4B:21556722G->A4.30TraesCS4B02G029400amino sugar and nucleotide sugar metabolism
Number (Ab)SNP8Chr4D:11914487C->T4.29TraesCS4D02G026900amino sugar and nucleotide sugar metabolism
Number (Ab)SNP9Chr5A:682898348T->C3.75TraesCS5A02G521600magnesium ion binding, terpene synthase activity;
terpenoid biosynthetic process
Number (Ab)SNP10Chr5B:550156642C->T4.54TraesCS5B02G563800Aminoacyl-tRNA biosynthesis;
DNA binding, sigma factor activity
Number (Ab)SNP11Chr5D:450860102G->T3.89TraesCS5D02G380500ribosome biogenesis in eukaryotes;
GTPase activity, GTP binding
Number (Ab)SNP12Chr7A:116506563T->C3.66TraesCS7A02G160400MAPK signaling pathway-plant, Plant hormone signal transduction, Plant-pathogen interaction
Note: Candidate genes associated with stomatal regulation are marked in red.

References

  1. Qin, X.; Zhang, F.; Liu, C.; Yu, H.; Cao, B.; Tian, S.; Liao, Y.; Siddique, K.H. Wheat yield improvements in China: Past trends and future directions. Field Crops Res. 2015, 177, 117–124. [Google Scholar] [CrossRef]
  2. Lawson, T.; Matthews, J. Guard cell metabolism and stomatal function. Annu. Rev. Plant Biol. 2020, 71, 273–302. [Google Scholar] [CrossRef]
  3. Xie, J.; Fernandes, S.B.; Mayfield-Jones, D.; Erice, G.; Choi, M.; E Lipka, A.; Leakey, A.D. Optical topometry and machine learning to rapidly phenotype stomatal patterning traits for maize QTL mapping. Plant Physiol. 2021, 187, 1462–1480. [Google Scholar] [CrossRef]
  4. Danila, F.R.; Quick, W.P.; White, R.G.; Furbank, R.T.; von Caemmerer, S. The metabolite pathway between bundle sheath and mesophyll: Quantification of plasmodesmata in leaves of C3 and C4 monocots. Plant Cell 2016, 28, 1461–1471. [Google Scholar] [CrossRef]
  5. Kim, K.W. Methanol fixation for scanning electron microscopy of plants. Appl. Microsc. 2020, 50, 10. [Google Scholar] [CrossRef]
  6. Abràmoff, M.D.; Magalhães, P.J.; Ram, S.J. Image processing with ImageJ. Biophotonics Int. 2004, 11, 36–42. [Google Scholar]
  7. Cheng, Y.; Cao, L.; Wang, S.; Li, Y.; Wang, H.; Zhou, Y. Analyses of plant leaf cell size, density and number, as well as trichome number using cell counter plugin. Bio-Protocol 2014, 4, e1165. [Google Scholar] [CrossRef]
  8. Meeus, S.; Van den Bulcke, J.; Wyffels, F. From leaf to label: A robust automated workflow for stomata detection. Ecol. Evol. 2020, 10, 9178–9191. [Google Scholar] [CrossRef] [PubMed]
  9. Fetter, K.C.; Eberhardt, S.; Barclay, R.S.; Wing, S.; Keller, S.R. StomataCounter: A neural network for automatic stomata identification and counting. New Phytol. 2019, 223, 1671–1681. [Google Scholar] [CrossRef] [PubMed]
  10. Badgujar, C.M.; Poulose, A.; Gan, H. Agricultural object detection with You Only Look Once (YOLO) Algorithm: A bibliometric and systematic literature review. Comput. Electron. Agric. 2024, 223, 109090. [Google Scholar] [CrossRef]
  11. Pathoumthong, P.; Zhang, Z.; Roy, S.J.; El Habti, A. Rapid non-destructive method to phenotype stomatal traits. Plant Methods 2023, 19, 36. [Google Scholar] [CrossRef] [PubMed]
  12. Sultana, S.N.; Park, H.; Choi, S.H.; Jo, H.; Song, J.T.; Lee, J.-D.; Kang, Y.J. Optimizing the experimental method for stomata-profiling automation of soybean leaves based on deep learning. Plants 2021, 10, 2714. [Google Scholar] [CrossRef] [PubMed]
  13. Lei, L.; Yang, Q.; Yang, L.; Shen, T.; Wang, R.; Fu, C. Deep learning implementation of image segmentation in agricultural applications: A comprehensive review. Artif. Intell. Rev. 2024, 57, 149. [Google Scholar] [CrossRef]
  14. He, K.; Gkioxari, G.; Dollár, P.; Girshick, R. Mask r-cnn. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017. [Google Scholar]
  15. Costa, L.; Archer, L.; Ampatzidis, Y.; Casteluci, L.; Caurin, G.A.; Albrecht, U. Determining leaf stomatal properties in citrus trees utilizing machine vision and artificial intelligence. Precis. Agric. 2021, 22, 1107–1119. [Google Scholar] [CrossRef]
  16. Jayakody, H.; Petrie, P.; Boer, H.J.d.; Whitty, M. A generalised approach for high-throughput instance segmentation of stomata in microscope images. Plant Methods 2021, 17, 27. [Google Scholar] [CrossRef]
  17. Ronneberger, O.; Fischer, P.; Brox, T. U-net: Convolutional networks for biomedical image segmentation. In Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention, Munich, Germany, 5–9 October 2015. [Google Scholar]
  18. Zhu, C.; Hu, Y.; Mao, H.; Li, S.; Li, F.; Zhao, C.; Luo, L.; Liu, W.; Yuan, X. A deep learning-based method for automatic assessment of stomatal index in wheat microscopic images of leaf epidermis. Front. Plant Sci. 2021, 12, 716784. [Google Scholar] [CrossRef] [PubMed]
  19. Xie, E.; Wang, W.; Yu, Z.; Anandkumar, A.; Alvarez, J.M.; Luo, P. SegFormer: Simple and efficient design for semantic segmentation with transformers. Adv. Neural Inf. Process. Syst. 2021, 34, 12077–12090. [Google Scholar]
  20. Visscher, P.M.; Brown, M.A.; McCarthy, M.I.; Yang, J. Five years of GWAS discovery. Am. J. Hum. Genet. 2012, 90, 7–24. [Google Scholar] [CrossRef]
  21. Duan, P.; Xu, J.; Zeng, D.; Zhang, B.; Geng, M.; Zhang, G.; Huang, K.; Huang, L.; Xu, R.; Ge, S. Natural variation in the promoter of GSE5 contributes to grain size diversity in rice. Mol. Plant 2017, 10, 685–694. [Google Scholar] [CrossRef]
  22. Xiao, Y.; Liu, H.; Wu, L.; Warburton, M.; Yan, J. Genome-wide association studies in maize: Praise and stargaze. Mol. Plant 2017, 10, 359–374. [Google Scholar] [CrossRef]
  23. Saini, D.K.; Chopra, Y.; Singh, J.; Sandhu, K.S.; Kumar, A.; Bazzer, S.; Srivastava, P. Comprehensive evaluation of mapping complex traits in wheat using genome-wide association studies. Mol. Breed. 2022, 42, 1. [Google Scholar] [CrossRef]
  24. Maurer, A.; Draba, V.; Pillen, K. Genomic dissection of plant development and its impact on thousand grain weight in barley through nested association mapping. J. Exp. Bot. 2016, 67, 2507–2518. [Google Scholar] [CrossRef]
  25. Soltabayeva, A.; Ongaltay, A.; Omondi, J.O.; Srivastava, S. Morphological, physiological and molecular markers for salt-stressed plants. Plants 2021, 10, 243. [Google Scholar] [CrossRef]
  26. Gibbs, J.A.; Mcausland, L.; Robles-Zazueta, C.A.; Murchie, E.H.; Burgess, A.J. A deep learning method for fully automatic stomatal morphometry and maximal conductance estimation. Front. Plant Sci. 2021, 12, 780180. [Google Scholar] [CrossRef] [PubMed]
  27. Qur’ania, A.; Herdiyeni, Y.; Kusuma, W.A.; Wigena, A.H.; Suhesti, S. Detection and Calculation of Stomatal Density Using YOLOv5: A Study of High-Yielding Patchouli Varieties. J. Image Graph. 2024, 12, 228–238. [Google Scholar] [CrossRef]
  28. Wacker, T.S.; Smith, A.G.; Jensen, S.M.; Pflüger, T.; Hertz, V.G.; Rosenqvist, E.; Liu, F.; Dresbøll, D.B. Stomata morphology measurement with interactive machine learning: Accuracy, speed, and biological relevance? Plant Methods 2025, 21, 95. [Google Scholar] [CrossRef] [PubMed]
  29. Lawson, T.; Blatt, M.R. Stomatal size, speed, and responsiveness impact on photosynthesis and water use efficiency. Plant Physiol. 2014, 164, 1556–1570. [Google Scholar] [CrossRef]
  30. Caine, R.S.; Yin, X.; Sloan, J.; Harrison, E.L.; Mohammed, U.; Fulton, T.; Biswal, A.K.; Dionora, J.; Chater, C.C.; Coe, R.A. Rice with reduced stomatal density conserves water and has improved drought tolerance under future climate conditions. New Phytol. 2019, 221, 371–384. [Google Scholar] [CrossRef]
  31. Dunn, J.; Hunt, L.; Afsharinafar, M.; Meselmani, M.A.; Mitchell, A.; Howells, R.; Wallington, E.; Fleming, A.J.; Gray, J.E. Reduced stomatal density in bread wheat leads to increased water-use efficiency. J. Exp. Bot. 2019, 70, 4737–4748. [Google Scholar] [CrossRef]
  32. Hughes, J.; Hepworth, C.; Dutton, C.; Dunn, J.A.; Hunt, L.; Stephens, J.; Waugh, R.; Cameron, D.D.; Gray, J.E. Reducing stomatal density in barley improves drought tolerance without impacting on yield. Plant Physiol. 2017, 174, 776–787. [Google Scholar] [CrossRef]
  33. Wang, S.-X.; Zhu, Y.-L.; Zhang, D.-X.; Shao, H.; Liu, P.; Hu, J.-B.; Zhang, H.; Zhang, H.-P.; Chang, C.; Lu, J. Genome-wide association study for grain yield and related traits in elite wheat varieties and advanced lines using SNP markers. PLoS ONE 2017, 12, e0188662. [Google Scholar] [CrossRef] [PubMed]
  34. Li, S.; Yu, S.; Zhang, Y.; Zhu, D.; Li, F.; Chen, B.; Mei, F.; Du, L.; Ding, L.; Chen, L. Genome-wide association study revealed TaHXK3-2A as a candidate gene controlling stomatal index in wheat seedlings. Plant Cell Environ. 2022, 45, 2306–2323. [Google Scholar] [CrossRef] [PubMed]
  35. Schoppach, R.; Taylor, J.D.; Majerus, E.; Claverie, E.; Baumann, U.; Suchecki, R.; Fleury, D.; Sadok, W. High resolution mapping of traits related to whole-plant transpiration under increasing evaporative demand in wheat. J. Exp. Bot. 2016, 67, 2847–2860. [Google Scholar] [CrossRef]
  36. MacAlister, C.A.; Ohashi-Ito, K.; Bergmann, D.C. Transcription factor control of asymmetric cell divisions that establish the stomatal lineage. Nature 2007, 445, 537–540. [Google Scholar] [CrossRef]
  37. Hunt, L.; Gray, J.E. The signaling peptide EPF2 controls asymmetric cell divisions during stomatal development. Curr. Biol. 2009, 19, 864–869. [Google Scholar] [CrossRef]
  38. Sun, C.; Dong, Z.; Zhao, L.; Ren, Y.; Zhang, N.; Chen, F. The Wheat 660K SNP array demonstrates great potential for marker-assisted selection in polyploid wheat. Plant Biotechnol. J. 2020, 18, 1354–1360. [Google Scholar] [CrossRef]
  39. Yin, L.; Zhang, H.; Tang, Z.; Xu, J.; Yin, D.; Zhang, Z.; Yuan, X.; Zhu, M.; Zhao, S.; Li, X. rMVP: A memory-efficient, visualization-enhanced, and parallel-accelerated tool for genome-wide association study. Genom. Proteom. Bioinform. 2021, 19, 619–628. [Google Scholar] [CrossRef]
  40. Ma, S.; Wang, M.; Wu, J.; Guo, W.; Chen, Y.; Li, G.; Wang, Y.; Shi, W.; Xia, G.; Fu, D. WheatOmics: A platform combining multiple omics data to accelerate functional genomics studies in wheat. Mol. Plant 2021, 14, 1965–1968. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Model segmentation performance testing and parameter variation of the WSF model during the training process. (The blue area is the stomatal mask segmented by the model.) (a) The segmentation results of wheat stomata. (b) Training and testing loss curve image. (c) Image of precision variations of different models. (d) Image of MIoU variations of different models.
Figure 1. Model segmentation performance testing and parameter variation of the WSF model during the training process. (The blue area is the stomatal mask segmented by the model.) (a) The segmentation results of wheat stomata. (b) Training and testing loss curve image. (c) Image of precision variations of different models. (d) Image of MIoU variations of different models.
Plants 14 03016 g001
Figure 2. Stomatal trait regression comparison graph. Each genotype had 3–4 replicates. (The red lines are the fitted line segments.) (a) Fitting result of stomatal length. (b) Fitting result of stomatal width. (c) Fitting result of stomatal area. (d) Fitting result of stomatal number.
Figure 2. Stomatal trait regression comparison graph. Each genotype had 3–4 replicates. (The red lines are the fitted line segments.) (a) Fitting result of stomatal length. (b) Fitting result of stomatal width. (c) Fitting result of stomatal area. (d) Fitting result of stomatal number.
Plants 14 03016 g002
Figure 3. Frequency distributions and violin plots of four stomatal traits, based on data from three replicates of 210 wheat varieties. (a) Frequency distribution diagram of stomatal length. (b) Frequency distribution diagram of stomatal width. (c) Frequency distribution diagram of stomatal area. (d) Frequency distribution diagram of stomatal number. (e) Violin plot of stomatal length. (f) Violin plot of stomatal width. (g) Violin plot of stomatal area. (h) Violin plot of stomatal number. Three asterisks (***) indicate statistically significant differences as determined by one-way ANOVA followed by Fisher’s LSD test (p < 0.001).
Figure 3. Frequency distributions and violin plots of four stomatal traits, based on data from three replicates of 210 wheat varieties. (a) Frequency distribution diagram of stomatal length. (b) Frequency distribution diagram of stomatal width. (c) Frequency distribution diagram of stomatal area. (d) Frequency distribution diagram of stomatal number. (e) Violin plot of stomatal length. (f) Violin plot of stomatal width. (g) Violin plot of stomatal area. (h) Violin plot of stomatal number. Three asterisks (***) indicate statistically significant differences as determined by one-way ANOVA followed by Fisher’s LSD test (p < 0.001).
Plants 14 03016 g003
Figure 4. Correlation analysis of the 8 stomatal traits of the 210 wheat cultivars. (* means p < 0.05).
Figure 4. Correlation analysis of the 8 stomatal traits of the 210 wheat cultivars. (* means p < 0.05).
Plants 14 03016 g004
Figure 5. Genome-wide association study for wheat SNP markers of 8 stomatal traits of 210 wheat cultivars. (ah) are Manhattan plots, in which the red arrows indicate significant associated loci.
Figure 5. Genome-wide association study for wheat SNP markers of 8 stomatal traits of 210 wheat cultivars. (ah) are Manhattan plots, in which the red arrows indicate significant associated loci.
Plants 14 03016 g005
Figure 6. Segmentation results of the WSF model under different plants. (Red boxes in the figure are magnified stomata, making it easier to observe the segmentation effect of the model.).
Figure 6. Segmentation results of the WSF model under different plants. (Red boxes in the figure are magnified stomata, making it easier to observe the segmentation effect of the model.).
Plants 14 03016 g006
Figure 7. Schematic of the WSF model architecture. (Different colors represent modules with different functions, and the arrows between modules indicate a progressive relationship).
Figure 7. Schematic of the WSF model architecture. (Different colors represent modules with different functions, and the arrows between modules indicate a progressive relationship).
Plants 14 03016 g007
Table 1. Performance comparison of the different methods.
Table 1. Performance comparison of the different methods.
Model NamePrecisionMIoU
WSF (our)93.6%91.2%
SegFormer91.7%90.1%
Deeplabv3+89.5%86.8%
UNet87.7%85.2%
Table 2. Performance comparison of the different plants.
Table 2. Performance comparison of the different plants.
Plant NamePrecisionMIoU
Peanut81.1%77.5%
Chrysanthemum78.2%72.3%
Cotton83.6%79.0%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhou, H.; Min, H.; Liang, S.; Qin, B.; Sun, Q.; Pei, Z.; Pan, Q.; Wang, X.; Cai, J.; Zhou, Q.; et al. WSF: A Transformer-Based Framework for Microphenotyping and Genetic Analyzing of Wheat Stomatal Traits. Plants 2025, 14, 3016. https://doi.org/10.3390/plants14193016

AMA Style

Zhou H, Min H, Liang S, Qin B, Sun Q, Pei Z, Pan Q, Wang X, Cai J, Zhou Q, et al. WSF: A Transformer-Based Framework for Microphenotyping and Genetic Analyzing of Wheat Stomatal Traits. Plants. 2025; 14(19):3016. https://doi.org/10.3390/plants14193016

Chicago/Turabian Style

Zhou, Honghao, Haijiang Min, Shaowei Liang, Bingxi Qin, Qi Sun, Zijun Pei, Qiuxiao Pan, Xiao Wang, Jian Cai, Qin Zhou, and et al. 2025. "WSF: A Transformer-Based Framework for Microphenotyping and Genetic Analyzing of Wheat Stomatal Traits" Plants 14, no. 19: 3016. https://doi.org/10.3390/plants14193016

APA Style

Zhou, H., Min, H., Liang, S., Qin, B., Sun, Q., Pei, Z., Pan, Q., Wang, X., Cai, J., Zhou, Q., Zhong, Y., Huang, M., Jiang, D., Chen, J., & Li, Q. (2025). WSF: A Transformer-Based Framework for Microphenotyping and Genetic Analyzing of Wheat Stomatal Traits. Plants, 14(19), 3016. https://doi.org/10.3390/plants14193016

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop