Next Article in Journal
Phytonutrient-Enriched Prebiotic Mixture Primes the Gut Environment to Enhance Probiotic Efficacy: Ex Vivo Screening and a Human Clinical Trial
Previous Article in Journal
Exploratory Machine Learning and Omics Integration in the Search for Biomarkers of Papillary Thyroid Cancer
Previous Article in Special Issue
Genome-Wide Identification and Expression Analysis of the CesA/Csl Superfamily in Madhuca pasquieri
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Evolutionary Expansion and Diversification of the GDSL Gene Family in Grasses

1
Key Laboratory of National Forestry and Grassland Administration on Osmanthus fragrans, School of Nuclear Technology and Chemistry & Biology, Hubei University of Science and Technology, Xianning 437100, China
2
Xianning Forestry Academy of Sciences, Xianning 437100, China
3
Public Inspection and Testing Center, Xianning 437100, China
4
Land Reserve Center of Gaomi Bureau of Natural Resources and Planning, Gaomi 261500, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Biology 2026, 15(13), 1005; https://doi.org/10.3390/biology15131005 (registering DOI)
Submission received: 10 May 2026 / Revised: 21 June 2026 / Accepted: 22 June 2026 / Published: 25 June 2026
(This article belongs to the Special Issue Advances in Plant Genomics and Genome Editing)

Simple Summary

We explored a key class of functional genes across nine typical grass crops and the model plant Arabidopsis. We identified 1707 members of this gene group and found its obvious gene number growth in grasses, especially in polyploid wheat. Gene replication events mainly drove the expansion of this gene family, and most retained genes remained stable and conservative during evolution. Closely related grass species share highly similar gene distribution, and some gene subgroups uniquely formed in grasses. These genes carry abundant regulatory signals related to stress and hormone response, and many members show obvious expression changes under low temperature in rice and wheat. Our findings clarify the evolutionary characteristics and stress response roles of these genes in grasses, and provide a useful gene resource for cultivating stress-tolerant cereal crops in future agricultural breeding.

Abstract

The glycine-aspartic acid-serine-leucine (GDSL) esterase/lipase family is a functionally diverse group of hydrolytic enzymes involved in multiple plant biological processes, including stress adaptation and development. However, its evolutionary patterns, functional conservation, and stress-responsive mechanisms in grasses remain not fully elucidated. In this study, a comprehensive comparative genomic analysis was performed on the GDSL gene family across nine representative grass species and Arabidopsis thaliana. Genome-wide identification, phylogenetic analysis, duplication pattern detection, synteny analysis, cis-regulatory element prediction, protein–protein interaction (PPI) network construction, and RNA-seq-based expression profiling were employed. A total of 1707 GDSL genes were identified, with substantial expansion in grasses, especially hexaploid wheat. Whole-genome and segmental duplications were the major drivers of family expansion, with most duplicated genes under strong purifying selection. A grass-specific clade (C3-2) was identified, and extensive syntenic conservation was observed among closely related grasses. Promoter analysis revealed enrichment of stress- and hormone-responsive cis-elements, and RNA-seq showed dynamic GDSL expression under low-temperature stress in rice and wheat. These findings demonstrate that the expansion of the GDSL gene family in grasses is driven by polyploidization and lineage-specific duplication, accompanied by the emergence of a grass-specific clade (C3-2) and regulatory diversification, collectively shaping stress-responsive evolutionary innovation in Poaceae.

1. Introduction

Glycine-aspartic acid-serine-leucine (GDSL) proteins constitute a distinct subfamily within the serine-glycine-asparagine-histidine (SGNH) hydrolase superfamily, defined by a highly conserved Gly–Asp–Ser–Leu (GDSL) motif near the N-terminus [1,2,3]. This structural feature distinguishes them from classical lipases harboring the glycine-any amino acid-serine-any amino acid-glycine (GXSXG) motif, endowing GDSL with exceptional catalytic flexibility and broad substrate specificity to hydrolyze diverse lipid-derived and ester-containing compounds [1,4]. Owing to these versatile catalytic properties, GDSLs are evolutionarily conserved across bacteria, fungi, animals, and plants, participating in core biological and metabolic processes across kingdoms [1,3,5]. Recent studies have expanded the acyltransferase landscape in plants, confirming that GDSL enzymes cooperate with benzyl alcohol O-acetyltransferase/anthocyanin O-hydroxycinnamoyltransferase/hydroxycinnamoyl-CoA shikimate/quinate hydroxycinnamoyltransferase/dihydroflavonol 4-O-reductase (BAHD) acyltransferase and serine carboxypeptidase-like (SCPL) families to maintain lipid and metabolite homeostasis [6,7,8,9].
In plants, GDSLs have emerged as key regulators of fundamental physiological and developmental programs. Recent functional studies have validated their critical roles in seed germination, pollen development, cuticle formation, cell wall remodeling, secondary metabolism, and lipid homeostasis [10,11,12]. For instance, the rice BS1 (brittle leaf sheath1) gene encoding a GDSL modulates cell wall biosynthesis and mechanical strength [13], while Arabidopsis GDSLs govern cutin biosynthesis, epidermal patterning, and reproductive development [10,11]. Beyond development, GDSLs are central to plant stress resilience: their expression is strongly induced by drought, salinity, cold, pathogen invasion, and wounding [14,15,16,17]. Overexpression of specific GDSL genes enhances abiotic stress tolerance by regulating cuticular wax deposition, membrane lipid stability, and reactive oxygen species (ROS) scavenging [14,16], while other family members directly bolster plant immunity by activating systemic resistance and antimicrobial defenses [18,19,20,21]. Natural variation in TaGDSL-7D in wheat regulates grain weight and yield, and ZmSWL3 in maize regulates drought resistance through endoplasmic reticulum localization [21,22]. GDSL genes also improve saline–alkali resistance by enhancing lignin synthesis and ROS balance [8]. Collectively, these studies demonstrate that plant GDSLs have undergone extensive functional diversification, acting as critical molecular links between development, metabolism, and environmental adaptation [12,13].
The evolutionary expansion and functional divergence of the GDSL gene family are tightly coupled to plant genome evolution. With the proliferation of high-quality plant genome assemblies over the past five years, genome-wide analyses have confirmed that GDSL genes form large multigene families across angiosperms [12,23,24]. Marked variation in GDSL copy number across species indicates that lineage-specific expansion has shaped the family’s evolutionary trajectory [16,25]. Tandem duplication, segmental duplication, and whole-genome duplication (WGD) have collectively driven GDSL expansion in soybean, cotton, Rosaceae species, and other lineages [16,23,25]. Notably, duplicated GDSL paralogs frequently diverge in exon–intron architecture, conserved motif composition, and spatiotemporal expression, indicative of widespread subfunctionalization and neofunctionalization during evolution [16,23,24]. Cross-species annotation reveals that the GDSL hydrolase family has conserved motifs and low sequence similarity in plants [26]. GbGDSL in cotton and GDSL53 in Arabidopsis further confirm the functional differentiation of family members in cell elongation and polysaccharide acetylation modification [27,28].
Grasses (Poaceae) represent one of the most evolutionarily successful and economically vital plant families, encompassing staple cereal crops that sustain global food security [29,30]. Grass genomes have undergone recurrent polyploidization, chromosomal rearrangements, transposable element proliferation, and lineage-specific duplication events over the past decade, fostering extensive genomic diversity and functional innovation [29,31,32]. Triticeae species—including bread wheat and its diploid progenitors—experienced multiple rounds of polyploidization, a major driver of gene family expansion and adaptive evolution [29,32,33]. Comparative genomic analyses further reveal that grass genes derived from WGD or tandem duplication often evolve specialized functions or stress-responsive regulatory patterns [33,34], making grasses an exceptional model for dissecting how gene duplication fuels multigene family diversification. In Poaceae crops, AmGDSL1 in Agropyron mongolicum and haplotype differentiation of the OsGDSL family in rice are directly related to environmental adaptive evolution [35,36]. Genetic analysis of peduncle length in sorghum and yield traits in rapeseed also confirm the association between Poaceae gene family expansion and agronomic trait improvement [37,38].
Despite substantial progress in characterizing GDSL families in individual plant species since 2021, most research has focused on single-genome analyses or targeted gene functional studies [16,23,25]. Comprehensive comparative evolutionary analyses of GDSL genes across representative grasses remain scarce. Critical knowledge gaps persist regarding the phylogenetic relationships, duplication dynamics, selective pressures, and lineage-specific diversification of Poaceae GDSL genes. Moreover, the evolutionary conservation and divergence of GDSL genes between polyploid Triticeae and diploid grass lineages have not been systematically explored, hindering a unified understanding of their adaptive evolution in cereals.
In this study, we address this gap by systematically investigating the evolutionary mechanisms underlying the expansion and functional diversification of GDSL genes in Poaceae including Aegilops tauschii, Hordeum vulgare, Triticum aestivum, Triticum urartu, Oryza sativa, Brachypodium distachyon, Zea mays, Sorghum bicolor, and Setaria italica. Specifically, we focus on polyploidization-driven gene family expansion, the emergence of grass-specific evolutionary lineages, and regulatory divergence associated with environmental stress responses. We selected rice and wheat to explore GDSL expression under cold stress due to its agricultural importance and abundant related omics data, while other stresses and species will be studied later. Our analyses reveal that GDSL gene evolution in grasses is shaped by coordinated genomic and regulatory processes, providing new insights into the evolutionary innovation of this gene family in monocots.

2. Materials and Methods

2.1. Genome Data Collection and Identification of GDSL Genes

Genome assemblies and corresponding annotation files of eight representative grass species, including Brachypodium distachyon (v3.1), Oryza sativa (IRGSP-1.0), Sorghum bicolor (v3.1.1), Setaria italica (v2.2), Triticum aestivum (IWGSC RefSeq v1.1), Triticum urartu (v1.0), Aegilops tauschii (v1.0), and Hordeum vulgare (IBSC v2), were retrieved from Phytozome v13 and Ensembl Plants v54. The Arabidopsis thaliana genome (TAIR10) was downloaded from TAIR and used as a dicot outgroup for comparative evolutionary analysis [39,40,41].
For systematic identification of GDSL esterase/lipase genes, the hidden Markov model (HMM) profile of the conserved GDSL domain (PF00657) was obtained from InterPro [42]. HMMER v3.3.2 was used to identify candidate GDSL proteins using the Hidden Markov Model profile of the GDSL domain (PF00657), with an E-value threshold of 1 × 10−10 and a minimum domain coverage of ≥70% [43]. To improve identification sensitivity and reduce false negatives, BLASTP (https://blast.ncbi.nlm.nih.gov/Blast.cgi (accessed on 20 April 2026))searches were additionally performed using all annotated Arabidopsis thaliana GDSL proteins as queries, with an E-value cutoff of ≤1 × 10−10, sequence identity ≥ 30%, and coverage ≥ 50% [44]. All candidate sequences were further validated using Pfam-scan v1.3 to confirm the presence of the complete GDSL (PF00657) domain. Only proteins passing all filtering steps were retained for downstream analyses. For genes with multiple transcript isoforms, the longest protein isoform was retained using the R package seqfinder for downstream analyses [45]. Protein physicochemical properties (molecular weight, pI, instability index) were calculated using ProtParam (https://web.expasy.org/protparam/ (accessed on 20 April 2026)).

2.2. Phylogenetic Analysis of GDSL Proteins

Full-length GDSL protein sequences were aligned using MUSCLE v3.8 with default parameters, and poorly aligned regions were trimmed using TrimAl v1.2 (gap threshold = 0.2) [46]. A maximum-likelihood (ML) phylogenetic tree was constructed using IQ-TREE v2.0.3 with the best-fit model selected by ModelFinder. Branch support was assessed using 1000 ultrafast bootstrap replicates and SH-aLRT tests [47]. The tree was rooted using Arabidopsis GDSL proteins as outgroup and visualized with the R package ggtree v3.2.1. Orthogroup classification was performed using OrthoFinder v2.5.4 with default settings.

2.3. Chromosomal Distribution and Gene Duplication Analyses

Chromosomal positions of GDSL genes were extracted from GFF annotations and plotted onto chromosomes using gggenes (https://wilkox.org/gggenes/, (accessed on 21 April 2026)) [48]. Gene density and tandem clusters (distance ≤ 100 kb) were analyzed in each grass genome. Collinearity and duplication events were identified using MCScanX v1.0.0 (https://github.com/wyp1125/MCScanX, (accessed on 21 April 2026)) with BLASTP E-value ≤ 1 × 10−10. Inter- and intra-species synteny blocks were visualized using JCVI v1.2.7 and circlize v0.4.18 (https://github.com/jokergoo/circlize, (accessed on 21 April 2026)). Synonymous (Ks) and non-synonymous (Ka) substitution rates and Ka/Ks ratios were calculated using KaKs_Calculator 2.0 with the YN model. Duplicated pairs were classified as tandem, proximal, dispersed, or segmental; Ka/Ks < 1 indicates purifying selection, Ka/Ks = 1 neutral selection, and Ka/Ks > 1 positive selection.

2.4. Cis-Regulatory Element Analysis in the Promoters of GDSL Genes

To investigate potential regulatory divergence among GDSL genes, 2-kb genomic sequences upstream of the transcription start sites were extracted as putative promoter regions using custom Python v3.14 scripts. Cis-regulatory elements were predicted using the PlantCARE database [49]. Identified cis-elements associated with phytohormone responsiveness, stress response, light responsiveness, and developmental regulation were statistically summarized and classified into different functional categories. The distribution patterns of cis-elements were visualized using the R v4.5.3 packages ggtree and gggenes.

2.5. Protein–Protein Interaction Network Analysis of GDSL Proteins

AraNet2 is a functional gene network framework based on Arabidopsis interaction data and computational inference. It enables orthology-based prediction of conserved functional associations across species. Protein–protein interaction (PPI) networks of GDSL proteins were predicted using the AraNet2 database [50]. Functional annotation of proteins within the PPI network was conducted using EggNOG-mapper [51]. The resulting interaction networks were visualized using the R package ggNetView (https://github.com/Jiawang1209/ggNetView (accessed on 22 April 2026)) [50,51].

2.6. Expression Profiling Analysis of GDSL Genes Based on RNA-Seq Data

Publicly available RNA-seq datasets were downloaded from the NCBI Sequence Read Archive (SRA) database (https://www.ncbi.nlm.nih.gov/home/download/ (accessed on 24 April 2026)) under accession numbers PRJNA772921 and PRJNA787922. Raw sequencing reads were filtered and quality-controlled using fastp [52]. Clean reads were subsequently aligned to the corresponding reference genomes using HISAT2 v2.2.1 [53]. Gene expression abundance was quantified using featureCounts and normalized as transcripts per million (TPM) values [54]. Tissue-specific expression patterns of GDSL genes were analyzed across different developmental tissues and conditions. Expression patterns of GDSL genes across different tissues were visualized using the R v4.5.3 package pheatmap.

2.7. qRT-PCR Validation of Transcriptome Data

For japonica rice, seedlings were maintained at 17 °C and sampled at 1 d, 2 d, 3 d, and 4 d post cold exposure, alongside untreated control seedlings. For hexaploid wheat, leaf tissues were harvested under a series of gradient cold treatments ranging from 5 °C down to −25 °C (Tn-M5 to Tn-M25). All leaf tissues were immediately frozen in liquid nitrogen after collection and stored at −80 °C. Gene-specific primers targeting unique coding regions of candidate GDSLs were designed via Primer 5.0, with OsActin and TaActin chosen as internal reference genes for rice and wheat, respectively (Supplementary Data S1).
Total RNA was extracted using a commercial plant RNA isolation kit (Vazyme, Nanjing, China), followed by DNase I digestion to eliminate genomic DNA contamination. RNA integrity and purity were checked by agarose gel electrophoresis and NanoDrop spectrophotometry. Qualified RNA was reverse-transcribed into first-strand cDNA using the HiScript II RT kit (Vazyme). The 10 μL qRT-PCR reaction system contained 5 μL 2 × ChamQ SYBR qPCR Master Mix, 2 μL diluted cDNA template, 0.5 μL forward primer, 0.5 μL reverse primer and 2 μL RNase-free water. The amplification program included an initial denaturation step at 95 °C for 3 min, followed by 40 cycles of 95 °C for 10 s and 60 °C for 30 s; melting curve analysis was performed after amplification to confirm single specific amplicons. Each sample contained three biological replicates and three technical replicates.

3. Results

3.1. Genome-Wide Identification and Physicochemical Features of GDSL Proteins

Genome-wide analysis identified 1707 GDSL genes across nine representative grass species and Arabidopsis thaliana (Figure 1, Supplementary Data S2), with gene copy numbers ranging from 107 in A. thaliana to 522 in hexaploid Triticum aestivum. GDSL gene copy numbers varied substantially among species, with most diploid genomes containing 107–160 members. The remarkable expansion observed in wheat is likely associated with its allohexaploid origin and extensive gene retention following polyploidization, suggesting that polyploid evolution has played an important role in shaping GDSL family expansion in grasses. In contrast, diploid grass species displayed relatively similar GDSL gene numbers, indicating a generally conserved family size across Poaceae lineages.
Despite considerable variation in gene copy number, the physicochemical properties of GDSL proteins remained highly conserved among species. Protein lengths ranged from 113 to 1427 amino acids, whereas median lengths were consistently distributed between 367 and 383 amino acids. Similarly, molecular weights varied from approximately 12.8 to 157.8 kDa, while median values remained close to 40 kDa across all species. Hydrophobicity analysis showed that most GDSL proteins were weakly hydrophilic, with median GRAVY values ranging from −0.07 to 0.03. In contrast, theoretical isoelectric points (pI) exhibited broader variation, ranging from approximately 4.1 to 12.6. Notably, grass species showed wider pI distributions than A. thaliana, suggesting greater physicochemical variation in GDSL proteins in grasses. Overall, these findings suggest that expansion of the GDSL gene family occurred while maintaining relatively conserved core protein characteristics.

3.2. Phylogenetic Classification and Evolutionary Diversification of GDSL Proteins

To investigate the evolutionary relationships of GDSL proteins in grasses, a maximum-likelihood phylogenetic tree was constructed using GDSL protein sequences from nine representative Poaceae species, with Arabidopsis thaliana included as an outgroup (Figure 2). Based on phylogenetic topology and branch support, all GDSL proteins were classified into five major clades, designated C1, C2, C3-1, C3-2, and C4. The C3-2 clade was identified as a well-supported sub-lineage within the C3 group based on maximum-likelihood phylogenetic analysis. This clade is strongly supported by both ultrafast bootstrap and SH-aLRT values. All members of this clade are exclusively derived from grass species, and no Arabidopsis thaliana sequences are present, indicating a monocot-specific evolutionary lineage within the GDSL gene family.
Substantial differences in gene abundance were observed among the five clades. Among them, C3-1 represented the largest GDSL subgroup and accounted for the highest proportion of members in all analyzed grass species, particularly showing pronounced expansion in Triticum aestivum. In contrast, C2 consistently contained the fewest members across species, indicating that this lineage has remained relatively conserved but numerically restricted throughout evolution. Meanwhile, C1 and C4 maintained relatively high and stable gene numbers among species, suggesting that these clades may perform conserved biological functions. Compared with other clades, C3-2 exhibited the greatest variation in copy number among species, especially showing expansion trends in Triticeae species, implying lineage-specific gene amplification and retention during grass evolution.
Notably, expansion of the GDSL gene family in hexaploid Triticum aestivum was highly uneven across clades, with most duplicated genes concentrated in C3-1 and C4. Similar but less pronounced expansion patterns were also observed in Triticum urartu and Aegilops tauschii. These findings suggest that polyploidization and subsequent gene retention preferentially promoted the expansion of specific GDSL subgroups rather than uniformly increasing the size of the entire gene family. Collectively, the asymmetric distribution of GDSL proteins among clades highlights distinct evolutionary trajectories and selective pressures acting on individual GDSL lineages during grass evolution.

3.3. Chromosomal Distribution and Duplication Patterns

To investigate the evolutionary mechanisms underlying GDSL gene family expansion in grasses, gene duplication patterns and intra-species collinearity analyses were performed using MCScanX across the analyzed species (Figure 3A,B, Supplementary Figure S1A–H). Five major duplication categories were identified, including singleton, dispersed, proximal, tandem, and whole-genome duplication/segmental duplication (WGD/segmental). Among these, WGD/segmental duplication represented the predominant expansion mode in nearly all grass species, whereas tandem duplication contributed to local amplification of only a subset of GDSL genes (Supplementary Data S3). These results suggest that large-scale genome duplication events have played a central role in shaping the expansion and diversification of the GDSL family during grass evolution.
Clear differences in duplication complexity were observed among species with different genome sizes and evolutionary histories. Polyploid wheat (Triticum aestivum) exhibited the most extensive collinearity relationships and contained the largest number of duplicated GDSL gene pairs, consistent with its recent allohexaploid origin. Similar but less extensive WGD-derived duplication patterns were also detected in Triticum urartu, Aegilops tauschii, and Hordeum vulgare, indicating that recurrent genome duplication and chromosomal rearrangement events contributed substantially to GDSL family expansion within Triticeae species. In contrast, diploid grasses such as Oryza sativa, Brachypodium distachyon, and Setaria italica displayed comparatively fewer duplicated gene pairs and simpler collinearity structures, suggesting relatively moderate expansion histories.
The duplication types could be clearly distinguished based on gene distribution patterns and gene identifiers. Tandem duplication events were generally characterized by physically adjacent genes with highly similar gene IDs located on the same chromosome, such as TraesCS7A02G535700–TraesCS7A02G535800 and Seita.5G150300–Seita.5G150400, indicating recent local duplication events. In contrast, WGD/segmental duplicated gene pairs were typically distributed on different chromosomes or distant chromosomal regions and often exhibited larger gene ID divergence, such as TraesCS1A02G250600–TraesCS1B02G261200 and TraesCS7A02G388700–TraesCS7B02G290600, reflecting large-scale chromosomal duplication and subgenome conservation following polyploidization. Collectively, these findings indicate that WGD/segmental duplication has been the dominant evolutionary force driving GDSL gene family expansion in grasses, whereas tandem duplication contributed mainly to lineage-specific local diversification.
Ka/Ks analysis further revealed that most duplicated GDSL gene pairs exhibited Ka/Ks ratios below 1.0, (Figure 3C) regardless of duplication type, indicating that the majority of duplicated genes have undergone strong purifying selection during evolution. Although a few tandem duplicated gene pairs displayed relatively elevated Ka/Ks values, suggesting possible functional divergence or adaptive evolution, the overall evolutionary pattern remained highly conserved across grass species. These results imply that expansion of the GDSL family was accompanied by long-term functional conservation following duplication events.

3.4. Comparative Synteny and Evolutionary Conservation of GDSL Genes

To further elucidate the conservation and diversification patterns of the GDSL gene family during plant evolution, comparative synteny analyses were performed among different species (Figure 4A). Overall, GDSL genes exhibited extensive syntenic conservation among grass species; however, the number of syntenic gene pairs varied markedly across different evolutionary lineages and was closely associated with phylogenetic relationships (Supplementary Data S4). Closely related species generally retained substantially more syntenic GDSL gene pairs, whereas only limited collinearity was preserved between distantly related species, reflecting the important roles of genome rearrangement and gene loss in shaping GDSL family architecture during long-term evolution.
Among all species combinations, the highest number of syntenic gene pairs was detected between hexaploid wheat Triticum aestivum and its closely related ancestral species Triticum urartu, with a total of 321 collinear GDSL gene pairs identified. This pattern is highly consistent with the evolutionary history of Triticum urartu as the A-subgenome donor of wheat, indicating that a large proportion of ancestral GDSL loci were retained during wheat polyploidization. In addition, a relatively high number of syntenic gene pairs was also observed between Triticum aestivum and Hordeum vulgare (86 pairs), further supporting strong genomic conservation within Triticeae species. Among C4 grass crops, Zea mays and Sorghum bicolor shared 81 syntenic GDSL gene pairs, while Sorghum bicolor and Setaria italica retained 76 collinear pairs, indicating substantial conservation of GDSL loci among closely related grass species. In contrast, syntenic relationships between the dicot species Arabidopsis thaliana and grass species were dramatically reduced. For example, only six syntenic GDSL gene pairs were detected between Setaria italica and Arabidopsis thaliana, reflecting extensive genomic restructuring of GDSL loci following long-term divergence between monocots and dicots.
Further Ka/Ks analysis revealed that the majority of syntenic GDSL gene pairs exhibited Ka/Ks ratios lower than 1 (Figure 4B), suggesting that these genes have predominantly undergone strong purifying selection during grass evolution. The distributions of Ka and Ks values also differed among species combinations, with closely related species generally displaying lower Ks values, whereas more distantly related species exhibited greater sequence divergence. Collectively, these findings indicate that although the GDSL gene family experienced substantial expansion in grasses, its core functions remained highly conserved throughout evolution, while lineage-specific divergence was mainly associated with differential gene retention and sequence evolutionary rates.

3.5. Comparative Analysis of Cis-Regulatory Elements in GDSL Gene Promoters

To investigate the potential regulatory patterns of GDSL genes in different plant species, cis-regulatory elements within the promoter regions of GDSL genes were systematically analyzed (Figure 5). Based on functional annotation, these elements were mainly classified into four categories, including plant growth and development-related elements, light-responsive elements, plant hormone-responsive elements, and stress-responsive elements. Overall, although the composition of cis-elements was relatively conserved across species, substantial variation in element abundance was observed, which generally correlated with genome complexity and GDSL gene family size.
Among all analyzed species, the hexaploid wheat Triticum aestivum contained the highest number of cis-regulatory elements, showing remarkable enrichment across nearly all element categories. For example, the numbers of TATA-box and CAAT-box elements reached 9473 and 12,078, respectively, which were substantially higher than those observed in other species. In addition, light-responsive elements such as G-box, GT1-motif, and GATA-motif, together with hormone- and stress-related elements including ABRE, ARE, MBS, and LTR, also exhibited pronounced expansion in wheat. In contrast, Arabidopsis thaliana and several diploid grass species, which possess relatively compact genomes and fewer GDSL genes, generally showed lower cis-element abundance. Overall, species with larger and more complex genomes, particularly Triticum aestivum, Triticum urartu, and Aegilops tauschii, tended to contain substantially more promoter elements, suggesting that expansion of the GDSL gene family was accompanied by large-scale retention and accumulation of regulatory sequences during grass evolution.
From a functional perspective, light-responsive elements represented the most abundant cis-element category across all species, among which G-box, Box 4, and GT1-motif were particularly enriched, indicating that GDSL genes may be extensively involved in light-mediated regulatory processes. In addition, hormone-responsive elements such as ABRE, CGTCA-motif, and TGACG-motif were highly enriched in most grass species, implying close associations between GDSL genes and phytohormone signaling pathways, especially ABA- and jasmonate-related responses. Among stress-responsive elements, ARE, MBS, and as-1 were broadly abundant in grass species and showed marked expansion in Triticeae lineages, suggesting potential roles of GDSL genes in stress adaptation and environmental responsiveness. Collectively, the variation in cis-regulatory element composition among species indicates that expansion of the GDSL gene family not only retained core regulatory features but may also have enhanced environmental adaptability through regulatory diversification and promoter evolution.

3.6. PPI Network Analysis of GDSL Family Members

To explore the potential functional associations among GDSL proteins, protein–protein interaction (PPI) networks were constructed for GDSL family members in each analyzed species (Figure 6). The PPI networks in this study were predicted via AraNet2, which is built on Arabidopsis interaction data. We used this dicot-based platform because GDSL proteins have evolutionarily conserved structures and core functions in dicots and monocots, and their interacting partners and network patterns are largely conserved across plant species. All predicted networks are orthology-based in silico results and require subsequent experimental verification. Extensive interaction relationships were detected within the GDSL family across all species, indicating that GDSL proteins may function through highly interconnected regulatory or metabolic networks rather than acting independently. Most GDSL proteins formed dense interaction clusters with multiple neighboring proteins, suggesting widespread functional coordination among GDSL members during plant growth, development, and environmental responses.
Although the overall interaction architecture was generally conserved, clear differences in network complexity were observed among species. Polyploid and large-genome species, particularly Triticum aestivum, exhibited substantially more complex PPI networks, characterized by increased node density and higher interaction connectivity among GDSL proteins. In contrast, diploid species such as Arabidopsis thaliana, Brachypodium distachyon, and Oryza sativa displayed relatively smaller and less interconnected networks. Notably, several highly connected hub proteins were consistently identified in nearly all species, implying that certain GDSL members may play central roles in maintaining conserved biological functions. Collectively, these findings suggest that expansion of the GDSL gene family in grasses was accompanied by increased interaction complexity, while the core interaction framework of GDSL proteins remained evolutionarily conserved across plant lineages.
Nodes represent proteins, where pink circles indicate GDSL proteins and grey circles represent predicted interacting proteins. The size of each node represents the degree of the network. Edges indicate predicted functional associations derived from the AraNet2-based orthology inference framework.

3.7. Expression Profiling of GDSL Genes Based on RNA-Seq Analysis

To investigate the potential roles of GDSL genes in cold stress responses, publicly available RNA-seq datasets were analyzed in rice and wheat under low-temperature treatments (Figure 7). Expression profiling revealed that a substantial proportion of GDSL genes exhibited dynamic transcriptional responses following cold exposure, indicating that GDSL genes show transcriptional responsiveness under low-temperature conditions in grasses. Distinct expression clusters were observed across different treatment stages, with numerous GDSL genes showing either rapid induction or strong repression after cold treatment, suggesting transcriptional variation among GDSL members during stress responses.
In rice, multiple GDSL genes displayed pronounced transcriptional activation during prolonged low-temperature treatment, particularly at LT-3d and LT-4d stages, whereas several genes were transiently downregulated during the early stress period and subsequently recovered at later stages. Similarly, wheat GDSL genes also exhibited extensive expression reprogramming under decreasing temperature conditions, with subsets of genes specifically induced under severe cold treatments (e.g., Tn-M10 to Tn-M25), while others showed preferential expression under mild or recovery conditions. Notably, the transcriptional response patterns in wheat appeared more complex and heterogeneous than those observed in rice, likely reflecting the larger GDSL family size and increased regulatory diversification associated with polyploid wheat evolution. Collectively, these findings suggest that GDSL genes are transcriptionally responsive to cold stress and may be associated with low-temperature adaptation processes in grasses.
To verify the reliability of the above transcriptome results as proposed by the reviewer, eight rice and eight wheat representative GDSL genes with distinct cold-responsive trends were quantified via qRT-PCR (Supplementary Figure S2). All tested genes exhibited consistent up-/downregulation trends between qRT-PCR and RNA-seq data, which confirms the authenticity of the cold-induced transcriptional patterns of GDSL family genes.

4. Discussion

Plant GDSL esterase/lipase family represents one of the most functionally diverse and evolutionarily expanded groups of hydrolytic enzymes, with critical roles in plant development, lipid metabolism, pathogen defense, and abiotic stress adaptation [1,4,12,13]. In the present study, we systematically identified 1707 GDSL genes across nine representative grass species and Arabidopsis thaliana, revealing substantial and lineage-specific expansion of the GDSL family in Poaceae, especially in hexaploid wheat. Previous genome-wide surveys similarly documented remarkable GDSL expansion in polyploid crops such as soybean and wheat, supporting the view that recurrent whole-genome duplication (WGD) events are major drivers of the evolutionary diversification of this gene family [16,21]. Our results further demonstrated that despite dramatic copy-number divergence across species, core physicochemical properties—including protein length, molecular weight, and hydrophilicity—remained highly conserved. This evolutionary pattern indicates that functional conservation has been stably maintained during large-scale family expansion, whereas functional diversification likely arose primarily via regulatory evolution and expression divergence rather than drastic structural innovation. Analogous trends have been reported for other large stress-associated gene families in grasses, where polyploidization promoted gene retention while preserving catalytic domains essential for core stress resilience [55].
Phylogenetic reconstruction and duplication analyses further revealed that GDSL family expansion in grasses was highly asymmetric among evolutionary clades. Specifically, subgroup C3-1 represented the largest clade in all surveyed grass species, whereas subgroup C2 remained numerically constrained and evolutionarily conserved. More notably, the C3-2 clade was uniquely present in Poaceae and completely absent from Arabidopsis, implying the emergence of grass-specific GDSL lineages during monocot evolution, likely associated with adaptive innovations in stress responses and developmental specialization. Similar lineage-specific expansion patterns have been widely observed in grass-adaptive gene families linked to environmental responsiveness and niche adaptation [56]. MCScanX analysis further confirmed that WGD/segmental duplication was the predominant mechanism underlying GDSL expansion, especially in Triticeae. This observation aligns closely with the well-established evolutionary history of wheat and related grasses, whose genomes have undergone repeated rounds of polyploidization and chromosomal rearrangement [57]. Moreover, the majority of duplicated GDSL gene pairs exhibited Ka/Ks ratios < 1, indicating strong purifying selection has preserved most duplicated copies after expansion. Such evolutionary conservation suggests that most retained GDSL duplicates continue to perform essential biological functions, with only a limited subset undergoing neofunctionalization or adaptive divergence to fulfill specialized roles.
The analysis of nonsynonymous (Ka) and synonymous (Ks) substitution rates revealed that most duplicated GDSL gene pairs exhibit Ka/Ks < 1, indicating strong purifying selection acting on this gene family in grasses. This pattern is broadly conserved across all analyzed species, suggesting that functional constraints are maintained throughout GDSL gene evolution in Poaceae. However, lineage-specific variations were also observed. In particular, a subset of tandemly duplicated genes in polyploid wheat species exhibited relatively elevated Ka/Ks values, suggesting relaxed purifying selection or early-stage functional divergence following recent duplication events. These lineage-specific deviations highlight ongoing subfunctionalization processes in certain evolutionary contexts.
A growing body of evidence highlights the pivotal roles of GDSL proteins in abiotic stress responses, particularly under drought, salinity, and cold conditions [58,59]. Consistent with these prior studies, our promoter cis-element analysis identified substantial enrichment of stress- and hormone-responsive motifs, including ABRE, MBS, LTR, TGACG-motif, and CGTCA-motif, implicating GDSL genes in extensive ABA-, jasmonate-, and low-temperature-associated regulatory networks. Notably, Triticeae species displayed pronounced accumulation of these regulatory elements, implying enhanced regulatory complexity during grass evolutionary adaptation. Importantly, several of the identified cis-elements, particularly LTR (low-temperature responsive elements) and dehydration-responsive motifs, are known to be associated with upstream CBF/DREB-mediated cold signaling pathways in plants. This suggests that GDSL genes may be transcriptionally regulated by the CBF/DREB regulatory module under cold stress conditions, providing a mechanistic link between promoter architecture and canonical cold-responsive signaling networks in grasses. In parallel, RNA-seq analysis revealed that numerous GDSL genes in both rice and wheat exhibited dynamic transcriptional responses to low-temperature treatment, with several genes strongly induced under prolonged or severe cold stress. Wheat exhibited more heterogeneous and complex expression patterns than rice, likely reflecting the larger family size and greater regulatory diversification conferred by polyploid evolution. Previous research has established that cold-induced lipid remodeling and membrane stabilization are pivotal for plant freezing tolerance, and multiple GDSL lipases have been functionally linked to these adaptive processes [16]. Accordingly, the extensive transcriptional reprogramming of GDSL genes observed in this study strongly supports our working hypothesis that GDSL genes contribute to cold adaptation in grasses through diversified regulatory and metabolic functions. Independent qRT-PCR quantification of 8 rice and 8 wheat genes further verified the reliability of these transcriptomic expression results. Under drought stress, GDSL genes in Agropyron mongolicum and maize improve drought resistance by regulating lipid metabolism and stomatal structure. OsMbl1 in rice reduces blast susceptibility by inhibiting the lipase activity of OsGdsl1 [60], further verifying the conserved functions of the GDSL family in biotic/abiotic stresses.
Future research directions should prioritize three key areas to advance understanding of grass GDSL gene function and evolution. First, functional validation of grass-specific C3-2 clade genes via CRISPR/Cas9 mutagenesis and overexpression will clarify their specialized roles in monocot-specific stress adaptation. Second, integrative multi-omics analyses (transcriptomics, lipidomics, and metabolomics) under varying cold stress regimes will dissect the molecular mechanisms by which GDSL genes regulate lipid remodeling and membrane stability in temperate grasses. Studies on seed fatty acid reducer genes in soybean and potato tuber development provide methodological references for verifying GDSL downstream metabolic pathways [16,61], and lipidome association analysis can expand the substrate and function research of Poaceae GDSL [62,63]. Third, population genomic studies across natural grass populations will explore the adaptive significance of GDSL gene polymorphisms in cold-prone environments, facilitating the identification of elite alleles for molecular breeding of cold-tolerant cereal crops. Collectively, these efforts will bridge evolutionary patterns and functional mechanisms of GDSL genes, providing valuable genetic resources and theoretical foundations for improving abiotic stress resistance in staple grass crops.

5. Conclusions

In this study, a comprehensive comparative genomic analysis of the GDSL esterase/lipase gene family was performed across nine representative grass species and Arabidopsis thaliana. A total of 1707 GDSL genes were identified, revealing substantial expansion of the GDSL family in grasses, particularly in polyploid wheat. Phylogenetic and duplication analyses demonstrated that whole-genome duplication and segmental duplication were the major evolutionary forces driving GDSL family expansion, while most duplicated genes remained under strong purifying selection during grass evolution. Comparative synteny analysis further revealed extensive conservation of GDSL loci among closely related grass species, especially within Triticeae lineages. In addition, promoter analysis indicated widespread enrichment of stress-, hormone-, and light-responsive cis-elements, suggesting increasing regulatory complexity accompanying GDSL family expansion. PPI networks revealed extensive functional connectivity among GDSL proteins, whereas RNA-seq analyses demonstrated dynamic transcriptional responses of numerous GDSL genes under low-temperature stress in both rice and wheat. Collectively, our findings suggest that the evolution of the GDSL gene family in grasses is driven by coordinated genomic and regulatory mechanisms. Whole-genome duplication and polyploidization provide the primary source of gene expansion, while lineage-specific retention and regulatory divergence drive functional innovation. The emergence of a grass-specific clade (C3-2) further highlights monocot-specific evolutionary innovation in this gene family.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/biology15131005/s1, Figure S1: Intra-species collinearity and duplication patterns of GDSL genes in grasses. Circos plots showing chromosomal distribution, duplication types, and intra-species collinearity relationships of GDSL genes in representative plant species. (A) Arabidopsis thaliana; (B) Setaria italica; (C) Sorghum bicolor; (D) Zea mays; (E) Brachypodium distachyon; (F) Hordeum vulgare; (G) Aegilops tauschii; (H) Triticum urartu. Figure S2: qRT-PCR quantification validates cold-responsive expression patterns of eight rice and eight wheat GDSL genes under low-temperature stress treatments. Supplementary Data S1: Primer sequences used for qRT-PCR. Supplementary Data S2: Analysis of physicochemical properties of GDSL gene family in 10 species. Supplementary Data S3: Duplicate gene pairs of GDSL gene family in 10 species. Supplementary Data S4: Synteny gene pairs of GDSL gene family in 10 species.

Author Contributions

Conceptualization, H.C. (Hongguo Chen). and X.Z.; methodology, Q.Z., L.G. and X.W.; software, Q.Z., H.M. and Y.W.; validation, X.G., Q.Z., L.G., X.W. and J.Z.; formal analysis, Z.L., Q.Z. and J.Y.; investigation, X.W., H.L. and X.C.; resources, X.G., Y.Z., H.M. and Z.L.; data curation, Y.Z., Q.Z., H.C. (Hongxi Chen) and L.S.; writing—original draft preparation, Q.Z.; writing—review and editing, X.Z. and H.C. (Hongguo Chen); visualization, Q.Z. and Y.W.; supervision, H.C. (Hongguo Chen) and X.Z.; project administration, H.C. (Hongguo Chen); funding acquisition, H.C. (Hongguo Chen). All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by grants from the Natural Science Foundation of Hubei Province (2026AFC0529, 2024AFB1057 and 2025AFB857), the National Natural Science Foundation of China (Grant Nos. 32271951, 32372754, 32101581 and 32401647), and a PhD grant from the Hubei University of Science and Technology (BK202327), the Open Research Fund of State Key Laboratory of Hybrid Rice (Wuhan University) KF202503.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article; further inquiries can be directed to the corresponding authors.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Akoh, C.C.; Lee, G.C.; Liaw, Y.C.; Huang, T.H.; Shaw, J.F. GDSL family of serine esterases/lipases. Prog. Lipid Res. 2004, 43, 534–552. [Google Scholar] [CrossRef] [PubMed]
  2. Zhang, H.; Zhang, X.; Zhao, J.; Sun, L.; Wang, H.; Zhu, Y.; Xiao, J.; Wang, X. Genome-Wide Identification of GDSL-Type Esterase/Lipase Gene Family in Dasypyrum villosum L. Reveals That DvGELP53 Is Related to BSMV Infection. Int. J. Mol. Sci. 2021, 22, 12317. [Google Scholar] [CrossRef] [PubMed]
  3. Lv, J.; Dai, C.B.; Wang, W.F.; Sun, Y.H. Genome-wide identification of the tobacco GDSL family and apical meristem-specific expression conferred by the GDSL promoter. BMC Plant Biol. 2021, 21, 501. [Google Scholar] [CrossRef] [PubMed]
  4. Chepyshko, H.; Lai, C.P.; Huang, L.M.; Liu, J.H.; Shaw, J.F. Multifunctionality and diversity of GDSL esterase/lipase gene family in rice (Oryza sativa L. japonica) genome: New insights from bioinformatics analysis. BMC Genom. 2012, 13, 309. [Google Scholar] [CrossRef] [PubMed]
  5. Volokita, M.; Rosilio-Brami, T.; Rivkin, N.; Zik, M. Combining comparative sequence and genomic data to ascertain phylogenetic relationships and explore the evolution of the large GDSL-lipase family in land plants. Mol. Biol. Evol. 2011, 28, 551–565. [Google Scholar] [CrossRef] [PubMed]
  6. Ding, Y.; Xing, L.; Xu, J.; Jiang, T.; Tang, X.; Wang, Y.; Huang, S.; Hao, W.; Zhou, X.; Zhang, Y.; et al. Genome-wide exploration of the GDSL-type esterase/lipase gene family in rapeseed reveals several BnGELP proteins active during early seedling development. Front. Plant Sci. 2023, 14, 1139972. [Google Scholar] [CrossRef] [PubMed]
  7. Kim, H.G.; Kwon, S.J.; Jang, Y.J.; Nam, M.H.; Chung, J.H.; Na, Y.-C.; Guo, H.; Park, O.K. GDSL LIPASE1 modulates plant immunity through feedback regulation of ethylene signaling. Plant Physiol. 2013, 163, 1776–1791. [Google Scholar] [CrossRef] [PubMed]
  8. Wang, Z.; Wan, W.; Shi, M.; Ji, S.; Zhang, L.; Wang, X.; Zhang, L.; Cui, H.; Liu, X.; Sun, H.; et al. GDSL in Lilium pumilum (LpGDSL) Confers Saline-Alkali Resistance to the Plant by Enhancing the Lignin Content and Balancing the ROS. Int. J. Mol. Sci. 2024, 25, 9319. [Google Scholar] [CrossRef] [PubMed]
  9. Mallavergne, A.; Hilbert, J.L.; Gagneul, D. Completing the acylation landscape in plants: GDSL enzymes join the BAHD and SCPL acyltransferase families. J. Exp. Bot. 2026, 77, erag170. [Google Scholar] [CrossRef] [PubMed]
  10. Yeats, T.H.; Martin, L.B.; Viart, H.M.; Isaacson, T.; He, Y.; Zhao, L.; Matas, A.J.; Buda, G.J.; Domozyck, D.S.; Clausen, M.H.; et al. The identification of cutin synthase: Formation of the plant polyester cutin. Nat. Chem. Biol. 2012, 8, 609–611. [Google Scholar] [CrossRef] [PubMed]
  11. Hong, J.K.; Choi, H.W.; Hwang, I.S.; Kim, D.S.; Kim, N.H.; Choi, D.S.; Kim, Y.J.; Hwang, B.K. Function of a novel GDSL-type pepper lipase gene, CaGLIP1, in disease susceptibility and abiotic stress tolerance. Planta 2008, 227, 539–558. [Google Scholar] [CrossRef] [PubMed]
  12. Girard, A.-L.; Mounet, F.; Lemaire-Chamley, M.; Gaillard, C.; Elmorjani, K.; Vivancos, J.; Runavot, J.-L.; Quemener, B.; Petit, J.; Germain, V.; et al. Tomato GDSL1 is required for cutin deposition in the fruit cuticle. Plant Cell 2012, 24, 3119–3134. [Google Scholar] [CrossRef] [PubMed]
  13. Shen, G.; Sun, W.; Chen, Z.; Shi, L.; Hong, J.; Shi, J. Plant GDSL Esterases/Lipases: Evolutionary, Physiological and Molecular Functions in Plant Development. Plants 2022, 11, 468. [Google Scholar] [CrossRef] [PubMed]
  14. Ji, Z.; Wang, M.; Zhang, S.; Du, Y.; Cong, J.; Yan, H.; Guo, H.; Xu, B.; Zhou, Z. GDSL Esterase/Lipase GELP1 Involved in the Defense of Apple Leaves against Colletotrichum gloeosporioides Infection. Int. J. Mol. Sci. 2023, 24, 10343. [Google Scholar] [CrossRef] [PubMed]
  15. Ding, L.-N.; Hu, Y.-H.; Li, T.; Li, M.; Li, Y.-T.; Wu, Y.-Z.; Cao, J.; Tan, X.-L. A GDSL motif-containing lipase modulates Sclerotinia sclerotiorum resistance in Brassica napus. Plant Physiol. 2024, 196, 2973–2988. [Google Scholar] [CrossRef] [PubMed]
  16. Su, H.G.; Zhang, X.H.; Wang, T.T.; Wei, W.L.; Wang, Y.X.; Chen, J.; Zhou, Y.B.; Chen, M.; Ma, Y.Z.; Xu, Z.S.; et al. Genome-Wide Identification, Evolution, and Expression of GDSL-Type Esterase/Lipase Gene Family in Soybean. Front. Plant Sci. 2020, 11, 726. [Google Scholar] [CrossRef] [PubMed]
  17. Shakiba, M.H.; Ali, M.S.M.; Rahman, R.N.Z.R.A.; Salleh, A.B.; Leow, T.C. Cloning, expression and characterization of a novel cold-adapted GDSL family esterase from Photobacterium sp. strain J15. Extremophiles 2016, 20, 44–55. [Google Scholar] [CrossRef] [PubMed]
  18. Gao, M.; Yin, X.; Yang, W.; Lam, S.M.; Tong, X.; Liu, J.; Wang, X.; Li, Q.; Shui, G.; He, Z. GDSL lipases modulate immunity through lipid homeostasis in rice. PLoS Pathog. 2017, 13, e1006724. [Google Scholar] [CrossRef] [PubMed]
  19. Kwon, S.J.; Jin, H.C.; Lee, S.; Nam, M.H.; Chung, J.H.; Kwon, S.I.; Ryu, C.-M.; Park, O.K. GDSL lipase-like 1 regulates systemic resistance associated with ethylene signaling in Arabidopsis. Plant J. 2009, 58, 235–245. [Google Scholar] [CrossRef] [PubMed]
  20. Han, X.; Li, S.; Zhang, M.; Yang, L.; Liu, Y.; Xu, J.; Zhang, S. Regulation of GDSL Lipase Gene Expression by the MPK3/MPK6 Cascade and Its Downstream WRKY Transcription Factors in Arabidopsis Immunity. Mol. Plant-Microbe Interact. 2019, 32, 673–684. [Google Scholar] [CrossRef]
  21. He, F.; Du, D.; Chen, Z.; Ma, L.; Liu, Y.; Li, Z.; Song, L.; Jiang, Z.; Fan, Y.; Sun, Q.; et al. A natural variation in the promoter of TaGDSL-7D contributes to grain weight and yield in wheat. Plant Biotechnol. J. 2025, 23, 3851–3863. [Google Scholar] [CrossRef] [PubMed]
  22. Tang, J.; Li, J.; Tang, C.; Han, X.; Zhang, H.; Yang, B.; Xiao, L.; Li, R.; Liu, H.; Pi, D.; et al. Genome-wide identification and characterization of the GDSL lipase gene family in Dendrobium catenatum and their potential role in drought stress tolerance and stomatal outer cuticular ledge formation. Front. Plant Sci. 2025, 16, 1722133. [Google Scholar] [CrossRef] [PubMed]
  23. Wang, J.; Zhao, H.; Qu, Y.; Yang, P.; Huang, J. The binding pocket properties were fundamental to functional diversification of the GDSL-type esterases/lipases gene family in cotton. Front. Plant Sci. 2023, 13, 1099673. [Google Scholar] [CrossRef] [PubMed]
  24. Cenci, A.; Concepción-Hernández, M.; Guignon, V.; Angenon, G.; Rouard, M. Genome-Wide Classification and Phylogenetic Analyses of the GDSL-Type Esterase/Lipase (GELP) Family in Flowering Plants. Int. J. Mol. Sci. 2022, 23, 12114. [Google Scholar] [CrossRef] [PubMed]
  25. Duan, L.; Wang, F.; Shen, H.; Xie, S.; Chen, X.; Xie, Q.; Li, R.; Cao, A.; Li, H. Identification, evolution, and expression of GDSL-type Esterase/Lipase (GELP) gene family in three cotton species: A bioinformatic analysis. BMC Genom. 2023, 24, 795. [Google Scholar] [CrossRef] [PubMed]
  26. Vujaklija, I.; Bielen, A.; Paradžik, T.; Biđin, S.; Goldstein, P.; Vujaklija, D. An effective approach for annotation of protein families with low sequence similarity and conserved motifs: Identifying GDSL hydrolases across the plant kingdom. BMC Bioinform. 2016, 17, 91. [Google Scholar] [CrossRef] [PubMed]
  27. Wang, Z.; Cai, C.; Yu, Z.; Reyimu, Y.; Han, X.; Lv, M.; Zhang, Z.; Li, W.; Zhu, G.; Guo, W. A GDSL esterase/lipase gene GbGELP identified from a fiber micronaire QTL qMIC-A11 modulates cell elongation and fiber development. Theor. Appl. Genet. 2025, 138, 126. [Google Scholar] [CrossRef] [PubMed]
  28. Sun, A.; Yu, B.; Zhang, Q.; Peng, Y.; Yang, J.; Sun, Y.; Qin, P.; Jia, T.; Smeekens, S.; Teng, S. MYC2-Activated TRICHOME BIREFRINGENCE-LIKE37 Acetylates Cell Walls and Enhances Herbivore Resistance. Plant Physiol. 2020, 184, 1083–1096. [Google Scholar] [CrossRef] [PubMed]
  29. Liu, S.; Geng, S.; Dreisigacker, S. Genomics in wheat improvement: Progress and perspectives. Plant Genome 2026, 19, e70176. [Google Scholar] [CrossRef] [PubMed]
  30. Calfee, E.; Gates, D.; Lorant, A.; Perkins, M.T.; Coop, G.; Ross-Ibarra, J. Selective sorting of ancestral introgression in maize and teosinte along an elevational cline. PLoS Genet. 2021, 17, e1009810. [Google Scholar] [CrossRef] [PubMed]
  31. Paterson, A.H.; Bowers, J.E.; Chapman, B.A. Ancient polyploidization predating divergence of the cereals, and its consequences for comparative genomics. Proc. Natl. Acad. Sci. USA 2004, 101, 9903–9908. [Google Scholar] [CrossRef]
  32. Levy, A.A.; Feldman, M. Evolution and origin of bread wheat. Plant Cell 2022, 34, 2549–2567. [Google Scholar] [CrossRef] [PubMed]
  33. Pont, C.; Salse, J. Wheat paleohistory created asymmetrical genomic evolution. Curr. Opin. Plant Biol. 2017, 36, 29–37. [Google Scholar] [CrossRef] [PubMed]
  34. Schnable, J.C.; Springer, N.M.; Freeling, M. Differentiation of the maize subgenomes by genome dominance and both ancient and ongoing gene loss. Proc. Natl. Acad. Sci. USA 2011, 108, 4069–4074. [Google Scholar] [CrossRef] [PubMed]
  35. Yan, X.; Wu, X.; Sun, F.; Nie, H.; Du, X.; Li, X.; Fang, Y.; Zhai, Y.; Zhao, Y.; Fan, B.; et al. Cloning and Functional Study of AmGDSL1 in Agropyron mongolicum. Int. J. Mol. Sci. 2024, 25, 9467. [Google Scholar] [CrossRef] [PubMed]
  36. Lubba, K.M.; Yamamori, K.; Kishima, Y. Haplotype shifts in the lipid-related OsGELP gene family underpin rice adaptation to high latitudes. Sci. Rep. 2025, 15, 32293. [Google Scholar] [CrossRef] [PubMed]
  37. Li, J.; Hu, Z.; Hao, Z.; Sun, B.; Ye, Z.; Yang, G. Genetic Analysis, Transcriptome Analysis, and Candidate Major Genes Screening of Peduncle Length Trait in Brewing Sorghum [Sorghum bicolor (L.) Moench]. Genes 2026, 17, 362. [Google Scholar] [CrossRef] [PubMed]
  38. Basunanda, P.; Radoev, M.; Ecke, W.; Friedt, W.; Becker, H.C.; Snowdon, R.J. Comparative mapping of quantitative trait loci involved in heterosis for seedling and yield traits in oilseed rape (Brassica napus L.). Theor. Appl. Genet. 2010, 120, 271–281. [Google Scholar] [CrossRef] [PubMed]
  39. Goodstein, D.M.; Shu, S.; Howson, R.; Neupane, R.; Hayes, R.D.; Fazo, J.; Mitros, T.; Dirks, W.; Hellsten, U.; Putnam, N.; et al. Phytozome: A comparative platform for green plant genomics. Nucleic Acids Res. 2012, 40, D1178–D1186. [Google Scholar] [CrossRef] [PubMed]
  40. Bolser, D.; Staines, D.M.; Pritchard, E.; Kersey, P. Ensembl Plants: Integrating Tools for Visualizing, Mining, and Analyzing Plant Genomics Data. Methods Mol. Biol. 2016, 1374, 115–140. [Google Scholar] [CrossRef] [PubMed]
  41. Lamesch, P.; Berardini, T.Z.; Li, D.; Swarbreck, D.; Wilks, C.; Sasidharan, R.; Muller, R.; Dreher, K.; Alexander, D.L.; Garcia-Hernandez, M.; et al. The Arabidopsis Information Resource (TAIR): Improved gene annotation and new tools. Nucleic Acids Res. 2012, 40, D1202–D1210. [Google Scholar] [CrossRef] [PubMed]
  42. Hunter, S.; Apweiler, R.; Attwood, T.K.; Bairoch, A.; Bateman, A.; Binns, D.; Bork, P.; Das, U.; Daugherty, L.; Duquenne, L.; et al. InterPro: The integrative protein signature database. Nucleic Acids Res. 2009, 37, D211–D215. [Google Scholar] [CrossRef] [PubMed]
  43. Mistry, J.; Finn, R.D.; Eddy, S.R.; Bateman, A.; Punta, M. Challenges in homology search: HMMER3 and convergent evolution of coiled-coil regions. Nucleic Acids Res. 2013, 41, e121. [Google Scholar] [CrossRef] [PubMed]
  44. Camacho, C.; Coulouris, G.; Avagyan, V.; Ma, N.; Papadopoulos, J.; Bealer, K.; Madden, T.L. BLAST+: Architecture and applications. BMC Bioinform. 2009, 10, 421. [Google Scholar] [CrossRef] [PubMed]
  45. Mistry, J.; Chuguransky, S.; Williams, L.; Qureshi, M.; Salazar, G.A.; Sonnhammer, E.L.L.; Tosatto, S.C.E.; Paladin, L.; Raj, S.; Richardson, L.J.; et al. Pfam: The protein families database in 2021. Nucleic Acids Res. 2021, 49, D412–D419. [Google Scholar] [CrossRef] [PubMed]
  46. Edgar, R.C. MUSCLE: Multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32, 1792–1797. [Google Scholar] [CrossRef] [PubMed]
  47. Minh, B.Q.; Schmidt, H.A.; Chernomor, O.; Schrempf, D.; Woodhams, M.D.; von Haeseler, A.; Lanfear, R. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Mol. Biol. Evol. 2020, 37, 1530–1534. [Google Scholar] [CrossRef] [PubMed]
  48. Wilkins, D. gggenes: Draw Gene Arrows with ggplot2; R Package Version 0.5.0; R Foundation for Statistical Computing: Vienna, Austria, 2020; Available online: https://CRAN.R-project.org/package=gggenes (accessed on 20 April 2026).
  49. Lescot, M.; Déhais, P.; Thijs, G.; Marchal, K.; Moreau, Y.; Van de Peer, Y.; Rouzé, P.; Rombauts, S. PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences. Nucleic Acids Res. 2002, 30, 325–327. [Google Scholar] [CrossRef] [PubMed]
  50. Lee, T.; Yang, S.; Kim, E.; Ko, Y.; Hwang, S.; Shin, J.; Shim, J.E.; Shim, H.; Kim, H.; Kim, C.; et al. AraNet v2: An improved database of co-functional gene networks for the study of Arabidopsis thaliana and 27 other nonmodel plant species. Nucleic Acids Res. 2015, 43, D996–D1002. [Google Scholar] [CrossRef] [PubMed]
  51. Cantalapiedra, C.P.; Hernández-Plaza, A.; Letunic, I.; Bork, P.; Huerta-Cepas, J. eggNOG-mapper v2: Functional Annotation, Orthology Assignments, and Domain Prediction at the Metagenomic Scale. Mol. Biol. Evol. 2021, 38, 5825–5829. [Google Scholar] [CrossRef] [PubMed]
  52. Chen, S.; Zhou, Y.; Chen, Y.; Gu, J. fastp: An Ultra-Fast All-in-One FASTQ Preprocessor. Bioinformatics 2018, 34, i884–i890. [Google Scholar] [CrossRef] [PubMed]
  53. Kim, D.; Langmead, B.; Salzberg, S.L. HISAT: A fast spliced aligner with low memory requirements. Nat. Methods 2015, 12, 357–360. [Google Scholar] [CrossRef] [PubMed]
  54. Liao, Y.; Smyth, G.K.; Shi, W. featureCounts: An efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 2014, 30, 923–930. [Google Scholar] [CrossRef] [PubMed]
  55. Qiao, X.; Li, Q.; Yin, H.; Qi, K.; Li, L.; Wang, R.; Zhang, S.; Paterson, A.H. Gene duplication and evolution in recurring polyploidization-diploidization cycles in plants. Genome Biol. 2019, 20, 38. [Google Scholar] [CrossRef] [PubMed]
  56. Flagel, L.E.; Wendel, J.F. Gene duplication and evolutionary novelty in plants. New Phytol. 2009, 183, 557–564. [Google Scholar] [CrossRef] [PubMed]
  57. Feliner, G.N.; Casacuberta, J.; Wendel, J.F. Genomics of Evolutionary Novelty in Hybrids and Polyploids. Front. Genet. 2020, 11, 792. [Google Scholar] [CrossRef] [PubMed]
  58. Chiconato, D.A.; Costa, M.G.d.S.; Balbuena, T.S.; Munns, R.; Dos Santos, D.M.M.M. Proteomic analysis of young sugarcane plants with contrasting salt tolerance. Funct. Plant Biol. 2021, 48, 588–596. [Google Scholar] [CrossRef] [PubMed]
  59. Naranjo, M.A.; Forment, J.; Roldán, M.; Serrano, R.; Vicente, O. Overexpression of Arabidopsis thaliana LTL1, a salt-induced gene encoding a GDSL-motif lipase, increases salt tolerance in yeast and transgenic plants. Plant Cell Environ. 2006, 29, 1890–1900. [Google Scholar] [CrossRef] [PubMed]
  60. Song, L.; Yang, T.; Abubakar, Y.S.; Han, Y.; Zhang, R.; Li, Y.; Ye, W.; Lu, G.-D. OsMbl1 Counteracts OsGdsl1-Mediated Rice Blast Susceptibility by Inhibiting Its Lipase Activity. Plant Cell Environ. 2025, 48, 5650–5663. [Google Scholar] [CrossRef]
  61. Zinta, R.; Tiwari, J.K.; Buckseth, T.; Goutam, U.; Singh, R.K.; Thakur, A.K.; Singh, S.; Kumar, V.; Kumar, M. Phenotypic and transcriptomics characterization uncovers genes underlying tuber yield traits and gene expression marker development in potato under aeroponics. Planta 2024, 260, 74. [Google Scholar] [CrossRef] [PubMed]
  62. Strable, J. Keeping the Ethylene Response Fluid: GDSL Lipase MHZ11 Modulates Sterol Levels and Ethylene Signaling in Rice Roots. Plant Cell 2020, 32, 1352–1353. [Google Scholar] [CrossRef] [PubMed]
  63. Zhao, H.; Ma, B.; Duan, K.-X.; Li, X.-K.; Lu, X.; Yin, C.-C.; Tao, J.-J.; Wei, W.; Zhang, W.-K.; Xin, P.-Y.; et al. The GDSL Lipase MHZ11 Modulates Ethylene Signaling in Rice Roots. Plant Cell 2020, 32, 1626–1643. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Identification and physicochemical characterization of GDSL family members in 10 species investigated in this study. Protein length, molecular weight, grand average of hydropathicity (GRAVY), and theoretical isoelectric point (pI) were predicted using the ProtParam tool (https://web.expasy.org/protparam/ (accessed on 25 April 2026)).
Figure 1. Identification and physicochemical characterization of GDSL family members in 10 species investigated in this study. Protein length, molecular weight, grand average of hydropathicity (GRAVY), and theoretical isoelectric point (pI) were predicted using the ProtParam tool (https://web.expasy.org/protparam/ (accessed on 25 April 2026)).
Biology 15 01005 g001
Figure 2. Phylogenetic classification and evolutionary diversification of GDSL proteins in grasses. (A) Maximum-likelihood phylogenetic tree constructed using GDSL protein sequences from nine representative Poaceae species and Arabidopsis thaliana. (B) Comparative distribution of GDSL subgroups among different species.
Figure 2. Phylogenetic classification and evolutionary diversification of GDSL proteins in grasses. (A) Maximum-likelihood phylogenetic tree constructed using GDSL protein sequences from nine representative Poaceae species and Arabidopsis thaliana. (B) Comparative distribution of GDSL subgroups among different species.
Biology 15 01005 g002
Figure 3. Intra-species collinearity and duplication patterns of GDSL genes in grasses. Circos plots showing chromosomal distribution, duplication types, and intra-species collinearity relationships of GDSL genes in representative plant species. (A) Oryza sativa Japonica; (B) Triticum aestivum. (C) Distribution of Ka, Ks, and Ka/Ks values for tandem duplicated and WGD-derived GDSL gene pairs across different species.
Figure 3. Intra-species collinearity and duplication patterns of GDSL genes in grasses. Circos plots showing chromosomal distribution, duplication types, and intra-species collinearity relationships of GDSL genes in representative plant species. (A) Oryza sativa Japonica; (B) Triticum aestivum. (C) Distribution of Ka, Ks, and Ka/Ks values for tandem duplicated and WGD-derived GDSL gene pairs across different species.
Biology 15 01005 g003
Figure 4. Comparative synteny and evolutionary conservation of GDSL genes among representative grass species. (A) Cross-species collinearity relationships of GDSL genes among representative Poaceae species and Arabidopsis thaliana. (B) Distribution of Ka, Ks, and Ka/Ks values of orthologous GDSL gene pairs among different species combinations.
Figure 4. Comparative synteny and evolutionary conservation of GDSL genes among representative grass species. (A) Cross-species collinearity relationships of GDSL genes among representative Poaceae species and Arabidopsis thaliana. (B) Distribution of Ka, Ks, and Ka/Ks values of orthologous GDSL gene pairs among different species combinations.
Biology 15 01005 g004
Figure 5. Comparative analysis of cis-regulatory elements in the 2 kb promoter regions of GDSL genes across representative plant species. Cis-elements identified in promoter sequences were classified into four major functional categories, namely, (A) light-responsive elements, plant growth and development-related elements; (B) plant hormone-responsive elements, and stress-responsive elements.
Figure 5. Comparative analysis of cis-regulatory elements in the 2 kb promoter regions of GDSL genes across representative plant species. Cis-elements identified in promoter sequences were classified into four major functional categories, namely, (A) light-responsive elements, plant growth and development-related elements; (B) plant hormone-responsive elements, and stress-responsive elements.
Biology 15 01005 g005
Figure 6. Predicted protein–protein interaction (PPI) networks of GDSL proteins across representative plant species. (A) Arabidopsis thaliana; (B) Setaria italica; (C) Sorghum bicolor; (D) Zea mays; (E) Oryza sativa Japonica; (F) Brachypodium distachyon; (G) Hordeum vulgare; (H) Aegilops tauschii; (I) Triticum urartu; and (J) Triticum aestivum.
Figure 6. Predicted protein–protein interaction (PPI) networks of GDSL proteins across representative plant species. (A) Arabidopsis thaliana; (B) Setaria italica; (C) Sorghum bicolor; (D) Zea mays; (E) Oryza sativa Japonica; (F) Brachypodium distachyon; (G) Hordeum vulgare; (H) Aegilops tauschii; (I) Triticum urartu; and (J) Triticum aestivum.
Biology 15 01005 g006
Figure 7. Transcriptional responses of GDSL genes under low-temperature stress in rice and wheat. (A) Heatmap showing expression profiles of GDSL genes in Oryza sativa ssp. japonica under low-temperature treatment at different time points, including Control, LT-1 d, LT-2 d, LT-3 d, and LT-4 d. LT indicates low-temperature treatment at 17 °C, and 1 d, 2 d, 3 d, and 4 d represent 1, 2, 3, and 4 days of treatment, respectively. (B) Heatmap showing expression patterns of GDSL genes in Triticum aestivum under a gradient of low-temperature stress conditions (Tn-M25 to Tn-5), corresponding to 5 °C, 0 °C, −5 °C, −10 °C, −15 °C, −20 °C, and −25 °C, respectively. Expression values were derived from publicly available RNA-seq datasets.
Figure 7. Transcriptional responses of GDSL genes under low-temperature stress in rice and wheat. (A) Heatmap showing expression profiles of GDSL genes in Oryza sativa ssp. japonica under low-temperature treatment at different time points, including Control, LT-1 d, LT-2 d, LT-3 d, and LT-4 d. LT indicates low-temperature treatment at 17 °C, and 1 d, 2 d, 3 d, and 4 d represent 1, 2, 3, and 4 days of treatment, respectively. (B) Heatmap showing expression patterns of GDSL genes in Triticum aestivum under a gradient of low-temperature stress conditions (Tn-M25 to Tn-5), corresponding to 5 °C, 0 °C, −5 °C, −10 °C, −15 °C, −20 °C, and −25 °C, respectively. Expression values were derived from publicly available RNA-seq datasets.
Biology 15 01005 g007
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhang, Q.; Wen, X.; Li, H.; Zou, J.; Yang, J.; Cai, X.; Gong, X.; Zhang, Y.; Li, Z.; Chen, H.; et al. Evolutionary Expansion and Diversification of the GDSL Gene Family in Grasses. Biology 2026, 15, 1005. https://doi.org/10.3390/biology15131005

AMA Style

Zhang Q, Wen X, Li H, Zou J, Yang J, Cai X, Gong X, Zhang Y, Li Z, Chen H, et al. Evolutionary Expansion and Diversification of the GDSL Gene Family in Grasses. Biology. 2026; 15(13):1005. https://doi.org/10.3390/biology15131005

Chicago/Turabian Style

Zhang, Qian, Xin Wen, Huan Li, Jingjing Zou, Jie Yang, Xuan Cai, Xusheng Gong, Yingting Zhang, Zeqing Li, Hongxi Chen, and et al. 2026. "Evolutionary Expansion and Diversification of the GDSL Gene Family in Grasses" Biology 15, no. 13: 1005. https://doi.org/10.3390/biology15131005

APA Style

Zhang, Q., Wen, X., Li, H., Zou, J., Yang, J., Cai, X., Gong, X., Zhang, Y., Li, Z., Chen, H., Shi, L., Wu, Y., Gong, L., Ma, H., Chen, H., & Zeng, X. (2026). Evolutionary Expansion and Diversification of the GDSL Gene Family in Grasses. Biology, 15(13), 1005. https://doi.org/10.3390/biology15131005

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop