Next Article in Journal
Functional Characterization of a Novel PBX1 De Novo Missense Variant Identified in a Pediatric Patient with CAKUT
Previous Article in Journal
Clinical Utility of Opportunistic Genome-Wide cfDNA Prenatal Screening in Intermediate-Risk Pregnancies
Previous Article in Special Issue
Genome-Wide Identification of Juglans regia GABA Transcription Factors and Expression Pattern Analysis in Response to Abiotic Stress
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genome-Wide Identification and Expression Profiling of the GRAS Gene Family in Taxus cuspidate

1
Department of Seed Science and Engineering, Liaodong University, Dandong 118003, China
2
Beijing Tianhaoyuan Agricultural Technology Development Co., Ltd., Beijing 100081, China
*
Author to whom correspondence should be addressed.
Genes 2025, 16(11), 1345; https://doi.org/10.3390/genes16111345
Submission received: 16 October 2025 / Revised: 2 November 2025 / Accepted: 5 November 2025 / Published: 7 November 2025
(This article belongs to the Special Issue Molecular Mechanisms of Adaptive Evolution in Trees)

Abstract

Background: The GRAS transcription factor family plays pivotal regulatory roles in plant growth and development, hormone signaling, and responses to environmental stresses. Taxus cuspidata, an endangered and nationally protected conifer species endemic to China, is recognized as the sole natural source of paclitaxel—an anti-cancer compound of considerable pharmaceutical importance. While GRAS family genes have been systematically characterized in various plant species, a comprehensive investigation in T. cuspidata has yet to be conducted. Methods: In this study, 26 TcGRAS genes were identified and characterized through genome-wide analysis. Results: Phylogenetic analysis grouped these genes into seven subfamilies, indicating a high degree of evolutionary conservation. Gene structure analysis revealed that most TcGRAS genes lacked introns, while promoter regions were enriched with cis-acting elements related to light responsiveness, hormone signaling (e.g., MeJA, ABA, SA), and abiotic stress responses. Tissue-specific expression profiling and qRT-PCR analyses showed that several TcGRAS genes were highly expressed in paclitaxel-accumulating organs such as roots, stems, and bark—particularly TcGRAS6, TcGRAS13, and TcGRAS24—suggesting their putative involvement in paclitaxel biosynthesis through hormone-mediated regulatory pathways. Co-expression network analysis further identified TcGRAS13 and TcGRAS14 as central nodes within the transcriptional regulatory network. Conclusions: This study provides the first genome-wide identification and comprehensive characterization of the GRAS gene family in T. cuspidata, thereby establishing a theoretical framework and providing valuable candidate gene resources for elucidating their biological functions and regulatory roles in secondary metabolism, particularly in paclitaxel biosynthesis.

1. Introduction

Throughout their life cycle, plants are constantly exposed to and must adapt to a wide range of dynamic environmental fluctuations, including changes in light, water availability, temperature, and biotic stressors. To efficiently perceive and respond to both internal and external cues, plants employ a complex and tightly regulated transcriptional network to coordinate growth, developmental processes, and environmental adaptation [1]. Within this regulatory framework, transcription factors function as central regulators by specifically recognizing and binding to cis-acting elements in the promoters of target genes, thereby facilitating the precise spatiotemporal control of downstream gene expression [2].
The GRAS gene family comprises plant-specific transcription factors, named after its first three identified members: GAI (GA-Insensitive), RGA (Repressor of ga1-3), and SCR (Scarecrow) [3,4]. GRAS proteins are defined by a highly conserved C-terminal GRAS domain, which contains several signature motifs—LHRI, VHIID, LHRII, PFYRE, and SAW—that mediate protein–protein interactions and transcriptional activation or repression [5]. In contrast, the N-terminal region exhibits considerable structural variability, which is implicated in signal perception, protein complex assembly, and the specific recognition of target genes [6], thereby contributing to their functional diversification.
Extensive studies have shown that the GRAS gene family participates in a wide array of fundamental physiological processes across diverse plant species, including gibberellin (GA) signaling [7], root and stem differentiation, light signaling responses [8], hormonal crosstalk, and responses to both biotic and abiotic stresses. These findings underscore the extensive functional diversity and high degree of evolutionary conservation of this gene family. Among its subgroups, members of the DELLA subfamily, such as GAI and RGA, function as negative regulators in the GA signaling pathway, thereby suppressing excessive stem elongation and influencing plant height and branching architecture [9]. In contrast, members such as SCR and SHR are involved in regulating radial patterning in vascular and root tissues and serve as integral components of the root developmental network [10,11]. Within the GRAS family, the PAT1 subfamily has attracted significant interest owing to its critical involvement in light signal transduction. PAT1 proteins were first identified in Arabidopsis thaliana as positive nuclear regulators of the phytochrome A (phyA) signaling pathway, which activate downstream light-responsive genes and ultimately regulate photomorphogenesis [12,13]. Recent studies have further revealed that members of the PAT1 subfamily play pivotal roles in abiotic stress responses, such as those triggered by drought [14] and low temperatures [15], emphasizing their essential contribution to stress-responsive signaling networks. Moreover, emerging evidence indicates that PAT1 genes are functionally significant in plant grafting. Notably, their expression is significantly upregulated at the graft junctions of both Arabidopsis and Picea abies, indicating a pivotal role in wound healing and tissue regeneration during graft union formation [16].
Northeast yew (Taxus cuspidata Sieb. et Zucc.), a rare gymnosperm belonging to the family Taxaceae, is recognized as a valuable medicinal resource due to its production of paclitaxel, a compound extensively utilized in anti-cancer drug development and clinical therapy [17]. The biosynthesis of paclitaxel involves the coordinated expression of multiple genes and the catalytic activities of pathway-specific enzymes. To date, approximately 20 enzymatic steps have been proposed; however, the complete biosynthetic pathway remains to be fully elucidated [18]. The natural abundance of paclitaxel in Taxus species is extremely low, and the intrinsically slow growth rate of T. cuspidata further limits the sustainable production and supply of this compound. The recent completion of whole-genome sequencing for T. cuspidata has yielded valuable genomic resources that facilitate gene cloning, functional characterization, and comprehensive bioinformatic analyses [19].
Although the GRAS gene family has been characterized in numerous plant species, a comprehensive genome-wide characterization of this family in T. cuspidata species has yet to be reported. Whether GRAS genes contribute to paclitaxel biosynthesis or are involved in the unique tissue development and environmental adaptation characteristic of T. cuspidata remains unknown. In this study, members of the GRAS gene family in T. cuspidata were systematically identified on a genome-wide scale. Phylogenetic relationships, conserved domains, cis-regulatory elements, and tissue-specific expression patterns were comprehensively analyzed to infer their potential biological roles. These findings are expected to provide a foundational framework for future functional investigations and genetic improvement, as well as candidate gene resources for regulatory research and genetic engineering of paclitaxel biosynthesis.

2. Materials and Methods

2.1. Identification of TcGRAS Gene Family Members

The genome and annotation files of T. cuspidata were downloaded from the Ensembl Plants database (https://plants.ensembl.org/index.html) (accessed on 11 June 2025), and both protein and coding DNA (CDS) sequences were extracted using TBtools-II. The hidden Markov model (HMM) profile for the GRAS domain (PF03514) was obtained from the Pfam database. Candidate TcGRAS proteins were screened using the Pfam (https://pfam.xfam.org/), SMART (http://smart.embl-heidelberg.de/) (accessed on 14 June 2025), and CDD (https://www.ncbi.nlm.nih.gov/Structure/bwrpsb/bwrpsb.cgi) (accessed on 16 June 2025) databases to verify the presence of conserved GRAS domains. Genes lacking the characteristic GRAS domain were excluded from further analysis. The physicochemical properties of the identified TcGRAS proteins, including amino acid length, molecular weight, theoretical isoelectric point (pI), and hydrophobicity, were analyzed using the Protein Parameters module in TBtools-II.

2.2. Chromosomal Localization of TcGRAS Genes

Chromosomal positions and corresponding gene density data for TcGRAS family members were obtained from the GFF3 annotation file of T. cuspidata using TBtools-II. The chromosomal distribution of TcGRAS genes was visualized using TBtools-II. In addition, all TcGRAS genes were uniformly renamed according to their chromosomal positions.

2.3. Gene Structure and Conserved Motif Analysis of TcGRAS Genes

The gene structures of TcGRAS genes were analyzed and visualized using the Gene Structure Display module in TBtools-II, based on the GFF3 annotation file of T. cuspidata. Conserved motifs within the TcGRAS proteins were identified using the MEME online suite (http://meme.nbcr.net/meme/intro.html) (accessed on 25 June 2025), with motif widths set between 6 and 15 amino acids and the number of motifs limited to 55.

2.4. Phylogenetic Analysis of TcGRAS Genes

A rooted neighbor-joining (NJ) phylogenetic tree was constructed using MEGA 7.0 to analyze the evolutionary relationships and classification of GRAS proteins from T. cuspidata, A. thaliana, Solanum lycopersicum (tomato), and Populus trichocarpa. Protein sequences for A. thaliana, S. lycopersicum, and P. trichocarpa were retrieved from the UniProt database (https://www.uniprot.org/) (accessed on 2 July 2025). The resulting phylogenetic tree was exported in Newick format and visualized via the Chiplot online platform (https://www.chiplot.online/#).

2.5. Cis-Acting Element Analysis of TcGRAS Genes

The 2000 bp sequences upstream of the start codon (ATG) of all TcGRAS genes were extracted using TBtools-II. Cis-acting regulatory elements in the promoter regions were predicted via the PlantCARE database (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/) (accessed on 4 July 2025), and the results were visualized using the ggplot2 v.4.0 package in R [20].

2.6. Expression Pattern and Functional Annotation Analysis of TcGRAS Genes

Transcriptomic data from five different tissue types—bark, leaves, roots, stems, and twig bark—were obtained from the NCBI database under a publicly available transcriptome project (BioProject: PRJNA661543). Three independent biological replicates were included for each tissue type to ensure data reliability and statistical power. All transcriptomic datasets were subjected to normalization and visualized using TBtools-II. To investigate the potential biological functions of the TcGRAS gene family, GO annotation was conducted using the GOATOOLS toolkit (https://github.com/tanghaibao/GOatools) (accessed on 24 July 2025). GO term information was obtained from the Gene Ontology Consortium database. During GO enrichment analysis, statistical significance for each GO term was evaluated via Fisher’s exact test, followed by multiple testing correction using the Bonferroni method. GO terms with p < 0.05 were considered significantly enriched and indicative of key biological processes associated with TcGRAS genes [21].

2.7. qRT-PCR Analysis

Nine healthy two-year-old T. cuspidata plants were randomly selected from the Jingou Science and Technology Ecological Park in Dandong, Liaoning Province, China, for quantitative real-time PCR (qRT-PCR) analysis. Bark, leaves, roots, stems, and young branch bark tissues were collected separately. For each tissue type, samples from every three plants were combined to form one biological replicate, resulting in a total of three biological replicates (n = 3). All samples were immediately flash-frozen in liquid nitrogen after collection and stored at –80 °C for subsequent analyses.
Total RNA was extracted using the RNAprep Pure Plant Kit (Tiangen, Beijing, China) according to the manufacturer’s instructions. First-strand cDNA was synthesized using the HiScript® II 1st Strand cDNA Synthesis Kit (Vazyme, Nanjing, China). Quantitative real-time PCR was conducted using the Gentier 96E real-time PCR system (Tianlong Technology Co., Ltd., Xi’an, China) with the SYBR Green Premix Pro Taq HS qPCR Kit III (Accurate Biotechnology (Hunan) Co., Ltd., Changsha, China). Each cDNA sample was analyzed in three technical replicates. Relative gene expression levels were calculated according to the 2−ΔΔCt method. All primer sequences are provided in Supplementary Table S1 and were synthesized by Bioengineering Co., Ltd. (Shanghai, China).

2.8. Data Analysis

Data collation and computation were performed using Microsoft Excel 2022. One-way analysis of variance (ANOVA) was conducted in SPSS 22.0 to evaluate statistical significance in gene expression across different groups at p values of <0.05, <0.01, <0.001, and <0.0001. Graphs were constructed and visualized using GraphPad Prism 8, R, and Adobe Illustrator 2023.

3. Results

3.1. Genome-Wide Identification and Physicochemical Analysis of GRAS Genes in T. cuspidata

Based on a combination of domain prediction and BLAST v.2.17 alignment against the NCBI database, a total of 26 GRAS gene members were identified in T. cuspidata after manually removing redundant sequences. To systematically characterize the structural and physicochemical properties of this gene family, a set of basic parameters was calculated for the 26 predicted TcGRAS proteins, including amino acid length, molecular weight (MW), theoretical isoelectric point (pI), instability index, hydropathicity (GRAVY), and aliphatic index (Table 1). The TcGRAS proteins ranged in length from 132 to 853 amino acids and had molecular weights ranging from 14.3 kDa to 92.1 kDa. The theoretical pI values ranged from 4.23 to 7.17, with most proteins showing pI values below 6.0. Additionally, 14 proteins were predicted to have instability indices exceeding 40. Analysis of hydropathicity revealed that most TcGRAS proteins were hydrophilic.

3.2. Chromosomal Localization of TcGRAS Gene Genes

Chromosomal mapping was performed to determine the genomic positions and physical distributions of the 26 identified TcGRAS genes in T. cuspidata (Figure 1). The results revealed that TcGRAS genes were unevenly distributed across 9 of the 11 chromosomes. Chromosome 3 contained the highest number of genes, with six members (TcGRAS3–TcGRAS8) clustered together. Chromosomes 4, 8, and 11 each harbored three TcGRAS genes, while chromosomes 1, 5, 6, 7, and 9 harbored one or two genes. Additionally, three genes (TcGRAS25, TcGRAS26, and TcGRAS23) were assigned to unplaced scaffold sequences (e.g., JAHRHJ020002717.1 and JAHRHJ020003142.1), suggesting that their genomic locations remain unanchored to specific chromosomes.

3.3. Phylogenetic Analysis of TcGRAS Genes

A phylogenetic tree was constructed to systematically examine the evolutionary relationships and functional diversification of the GRAS transcription factor family across diverse plant species, including A. thaliana, Oryza sativa, P. trichocarpa, and T. cuspidata (Figure 2). According to the classification system established for A. thaliana, the 26 TcGRAS genes were assigned to six well-defined subfamilies: LISCL, SHR, SCR, DELLA, HAM, and LS. Each subfamily formed well-supported monophyletic clades, indicating a high degree of phylogenetic conservation. Specifically, the LISCL subfamily included TcGRAS3, TcGRAS22, and TcGRAS25; the SHR subfamily included TcGRAS13 and TcGRAS14; the SCR subfamily included TcGRAS5, TcGRAS10, TcGRAS23, and TcGRAS24; and the HAM subfamily included TcGRAS6, TcGRAS7, TcGRAS8, TcGRAS16, TcGRAS17, TcGRAS18, TcGRAS19, TcGRAS20, and TcGRAS26. The DELLA subfamily comprised TcGRAS1, TcGRAS4, TcGRAS11, TcGRAS12, and TcGRAS15. Finally, the LS subfamily included TcGRAS9 and TcGRAS21. These findings suggest that several functionally defined core subfamilies have been evolutionarily conserved in T. cuspidata.

3.4. Motif, Conserved Domain, and Gene Structure Analysis of TcGRAS Genes

A comprehensive analysis was performed to provide insights into the structural characteristics and conserved functions of the GRAS gene family in T. cuspidata. Six conserved motifs (Motif 1–6) were identified using the MEME suite (Figure 3A). Most TcGRAS proteins contained Motifs 1, 2, 3, and 5, whereas Motif 6 was uniquely detected in a limited subset of proteins, including TcGRAS10, TcGRAS11, and TcGRAS15. Domain architecture analysis revealed that all TcGRAS proteins possessed a canonical GRAS domain (Figure 3B), while several also contained additional domains such as DELLA and PP2C. Gene structure analysis showed that most TcGRAS genes contained either a single intron or were intronless (Figure 3C), as exemplified by TcGRAS1, TcGRAS9, and TcGRAS14. In contrast, a limited number of genes—such as TcGRAS5, TcGRAS23, and TcGRAS25—contained multiple introns.

3.5. Cis-Acting Elements Analysis of TcGRAS Genes

To investigate the potential regulatory mechanisms of GRAS genes in T. cuspidata, cis-acting elements in the 2000 bp upstream promoter regions of 26 TcGRAS genes were systematically predicted and functionally categorized (Figure 4). A total of 89 cis-acting elements were identified across all TcGRAS family members, of which 43 were assigned putative functions. These elements were predominantly associated with abiotic stress, light-responsive regulation, phytohormone-responsive pathways, and plant growth and development. Elements related to abiotic and biotic stress comprised five distinct motifs: ARE, as-1, LTR, MBS, and WUN-motif. Phytohormone-responsive elements consisted of five categories: ABRE, CGTCA-motif, P-box, TCA-element, and TGACG-motif. Elements associated with plant growth and development included five elements: TATA-box, ERE, AAGAA-motif, CAT-box, and O2-site. Light-responsive elements were composed of five motifs: CAAT-box, Box 4, G-box, TCT-motif, and GT1-motif. Collectively, these findings suggest that TcGRAS genes may play crucial roles in the growth and development, hormone regulation, and stress adaptation of T. cuspidata.

3.6. Expression Patterns, GO Enrichment, and Protein–Protein Interaction Network Analysis of TcGRAS Genes

The expression patterns of TcGRAS genes were analyzed using transcriptomic data from five distinct tissues of T. cuspidata. The results revealed distinct tissue-specific expression patterns (Figure 5A). TcGRAS5, TcGRAS4, TcGRAS13, TcGRAS15, TcGRAS1, TcGRAS8, TcGRAS6, TcGRAS24, TcGRAS19, TcGRAS14, TcGRAS21, TcGRAS20, and TcGRAS11 were specifically upregulated in twig bark and were downregulated in root, stem, and leaf tissues. This coordinated expression pattern suggests a conserved functional role among these members. Likewise, TcGRAS12 and TcGRAS9 were exclusively upregulated in the root, whereas TcGRAS26 and TcGRAS23 were specifically upregulated in the stem tissue. These findings indicate that individual TcGRAS genes may play distinct biological roles across various tissues of T. cuspidata.
To further explore the functional properties of the TcGRAS gene family, Gene Ontology (GO) enrichment analysis was performed (Figure 5B), covering three categories: biological process, molecular function, and cellular component. In the biological process category, TcGRAS genes were predominantly enriched in pathways such as salicylic acid-mediated signaling, hyperosmotic salinity response, gibberellic acid-mediated signaling, negative regulation of leaf development, response to ethylene, positive regulation of gene expression, and regulation of the mitotic cell cycle. In the molecular function category, enrichment was observed in sequence-specific DNA binding, DNA-binding transcription factor activity, and transcription coregulator activity. In the cellular component category, genes were predominantly enriched in the nucleus.
To investigate potential co-regulatory interactions among TcGRAS genes, a gene co-expression network was constructed (Figure 5C). Twelve TcGRAS genes were identified as interacting within the network. Notably, TcGRAS24, TcGRAS9, TcGRAS13, and TcGRAS14 were positioned at central nodes, showing high degrees of connectivity and forming robust co-expression relationships with multiple other genes. These patterns suggest that they may serve as key regulators within the TcGRAS transcriptional network.

3.7. qRT-PCR Analysis of Tissue-Specific Expression Patterns of TcGRAS Genes

To validate the accuracy of transcriptome-derived expression profiles, qRT-PCR was performed to quantify the expression profiles of nine representative TcGRAS genes across five tissues of T. cuspidata—root, bark, twig bark, stem, and leaf (Figure 6). The results demonstrated significant variation in gene expression among tissues, indicating pronounced tissue specificity. TcGRAS1, TcGRAS4, TcGRAS6, TcGRAS13, and TcGRAS24 were highly expressed in twig bark, while expression levels in the root, stem, and other tissues were substantially lower. In contrast, TcGRAS9, TcGRAS11, and TcGRAS14 exhibited higher expression levels in the root, whereas TcGRAS23 was primarily expressed in the stem and leaf. These findings support the tissue-specific regulatory roles of TcGRAS genes and corroborate the transcriptomic data.

4. Discussion

As a relict gymnosperm, T. cuspidata has been recognized as the only natural source of paclitaxel. GRAS proteins, a plant-specific family of transcription factors, have been shown to play critical regulatory roles in diverse physiological processes, including growth and development, hormone signaling, and responses to abiotic stresses [22]. To date, members of the GRAS gene family have been identified and functionally characterized in various plant species, such as Avena sativa [23], Raphanus sativus [24], Dendrobium catenatum [6], Zea mays [25], Eucalyptus grandis [3], and Medicago sativa [4]. However, a comprehensive and systematic investigation of the GRAS gene family in T. cuspidata remains lacking.

4.1. Structural and Evolutionary Features of the GRAS Gene Family in T. cuspidata Exhibit Distinct Characteristics

Significant variability in the number of GRAS genes has been reported across plant species, with 34 in A. thaliana [26], 54 in tomato [27], 60 in rice [28], and up to 177 in wheat [29]. In contrast, only 26 GRAS genes were identified in T. cuspidata, a substantially lower count, potentially reflecting long-term functional integration and evolutionary conservation. This reduction is likely associated with the species’ complex genomic architecture and its gymnosperm lineage. Physicochemical analyses demonstrated substantial heterogeneity among TcGRAS proteins in terms of amino acid length, theoretical isoelectric point, instability index, and hydrophobicity, indicating pronounced structural and functional diversification. This finding is consistent with previous results in alfalfa [4], suggesting that structural polymorphisms within the GRAS family may contribute to the regulation of diverse physiological and developmental processes, including stress responses, tissue differentiation, and hormone signaling. Phylogenetic analysis further demonstrated that the 26 TcGRAS proteins clustered into six subfamilies, forming well-supported clades with homologs from A. thaliana (34 AtGRAS), O. sativa (60 OsGRAS), and P. trichocarpa (91 PtGRAS), indicating robust evolutionary conservation. Members within each subfamily shared similar gene structures and conserved motifs, underscoring both the functional stability and evolutionary inheritance of this gene family. Motif and domain analyses indicated that conserved functional regions were primarily located in the C-terminal domain, while significant divergence was observed in the N-terminal region. This structural flexibility may enable distinct TcGRAS members to adopt specialized regulatory functions and signaling responses, consistent with the structural variability previously described by Liu et al. [30]. Gene structure analysis showed that approximately 38.5% of TcGRAS genes lacked introns, and most of the remaining genes contained only one or two introns. This streamlined intron architecture, comparable to that observed in cucumber [31], may facilitate accelerated transcriptional responses under stress conditions [32].

4.2. Cis-Regulatory Element and Go Analysis Reveals the Multifunctionality of TcGRAS Genes

Cis-regulatory element analysis revealed that the promoter regions of TcGRAS genes were found to be enriched in regulatory motifs involved in light responsiveness, hormone signaling, growth and development, and abiotic stress responses. This enrichment suggests that TcGRAS genes are integrated into a complex transcriptional regulatory network and are likely to perform broad biological functions. In particular, the abundance of light-responsive elements in TcGRAS promoters was significantly greater than previously reported in other plant species [32], implying a more prominent role in light perception and photoadaptation. This likely reflects the ecological adaptation of T. cuspidata to understory, low-light environments in which the efficient regulation of photosynthesis is essential. Regarding developmental regulation, TATA-box elements were identified in all TcGRAS promoters, with notably high frequencies observed in TcGRAS1, TcGRAS6, and TcGRAS21 (64, 75, and 100 elements, respectively). This suggests a pivotal role in tissue-specific expression and developmental stage regulation and may contribute to the formation and secondary growth of structural tissues such as stems and twig bark. Moreover, promoter regions harbored multiple cis-elements responsive to abscisic acid (ABA), gibberellins (GA), methyl jasmonate (MeJA), and salicylic acid (SA), indicating that TcGRAS genes may participate in diverse hormone signaling pathways. For example, GRAS genes have been implicated in ABA-mediated drought resistance in poplar [33] and GA-regulated developmental processes in other species [34]. In terms of abiotic stress response, most TcGRAS promoters contained cis-elements related to hypoxia, drought, and salinity, suggesting roles in environmental adaptation and stress defense. Notably, certain genes such as TcGRAS23 were enriched with both hormone- and stress-responsive elements, exhibiting classic characteristics of signal crosstalk and potentially serving as integrators of external and internal signaling pathways.
GO enrichment analysis further indicated that several TcGRAS members were significantly associated with the salicylic acid-mediated signaling pathway. Previous studies have demonstrated that SA signaling promotes the expression of paclitaxel biosynthetic genes such as TASY [35], leading to the hypothesis that these TcGRAS genes may indirectly influence paclitaxel biosynthesis via SA signaling modulation. Although functional validation is still required, this finding offers new insights into the regulatory roles of GRAS transcription factors in hormone signaling and provides a potential entry point for elucidating upstream regulatory mechanisms of paclitaxel metabolism.

4.3. Functional Divergence Potential of TcGRAS Genes in T. cuspidate

Spatiotemporal expression analysis provides crucial insights into the potential biological functions of genes [21]. In this study, the spatiotemporal expression profiling of TcGRAS genes revealed distinct tissue-specific expression profiles across various organs of T. cuspidata, suggesting their multifunctionality and potential functional divergence during plant development and environmental adaptation. Notably, several TcGRAS members (TcGRAS5, TcGRAS4, TcGRAS13, and TcGRAS15) were highly expressed in the twig bark, the primary site of paclitaxel biosynthesis and accumulation [36]. This expression pattern implies that these genes may be involved in the regulation of vascular tissue development, thereby indirectly influencing paclitaxel synthesis or transport. Members of the HAM and SCR subfamilies of the GRAS gene family have been shown to be critically involved in meristem maintenance, cell division, and vascular differentiation [37]. Based on this evidence, the highly expressed TcGRAS genes are hypothesized to be involved in secondary growth, vascular formation, and xylem development, which may, in turn, affect paclitaxel biosynthesis via regulation of vascular development. Additionally, several TcGRAS genes displayed organ-specific expression profiles in other tissues. For example, TcGRAS9 and TcGRAS26 were predominantly expressed in roots, suggesting roles in regulating root development, nutrient uptake, or stress adaptation, thus potentially enhancing the adaptability of T. cuspidata to complex environmental conditions. This expression divergence supports the hypothesis that TcGRAS genes execute tissue-specific functions across different organs.
Further research is warranted to functionally characterize these highly expressed TcGRAS members through approaches such as CRISPR/Cas9-mediated gene editing or virus-induced gene silencing (VIGS), in order to elucidate their specific regulatory roles in the paclitaxel biosynthetic pathway.

5. Conclusions

In this study, 26 members of the GRAS transcription factor family were systematically identified and characterized in T. cuspidata, distributed across 10 chromosomes and classified into six distinct subfamilies. The TcGRAS genes exhibited strong evolutionary conservation, as evidenced by promoter regions enriched in cis-regulatory elements involved in light responsiveness, hormone signaling, and abiotic stress responses. Expression profiling and co-expression network analysis revealed that several genes—such as TcGRAS13, TcGRAS14, and TcGRAS24—were highly expressed in twig bark and occupied central positions within the co-expression network, indicating their putative involvement in paclitaxel biosynthesis via hormone-mediated regulatory pathways. Collectively, these findings provide a theoretical foundation and a valuable set of candidate genes for future functional studies of GRAS transcription factors in T. cuspidata, particularly in the context of secondary metabolism and paclitaxel biosynthesis.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/genes16111345/s1. Table S1: Primer sequences used for qRT-PCR.

Author Contributions

Conceptualization, Y.Z.; data curation, L.G. and H.T.; methodology, L.G. and A.S.; software, L.G. and A.S.; visualization, L.G. and M.W.; investigation, M.W. and H.T.; writing—original draft, L.G. and M.W.; formal analysis, A.S. and M.W.; writing—review and editing, L.G., M.W. and Y.Z.; resources, Y.Z. and H.T.; funding acquisition, Y.Z.; project administration, Y.Z.; supervision, Y.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This study was financially supported by the Liaoning Province Science and Technology Plan Joint Plan Project—Technology Research Project (2024JH2/102600132).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article/Supplementary Materials. Further inquiries can be directed to the corresponding author.

Acknowledgments

We thank all our colleagues for providing useful discussions and technical assistance.

Conflicts of Interest

Author Aokun Shi and Mian Wang were employed by the company Beijing Tianhaoyuan Agricultural Technology Development Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Liu, F.; Xi, M.; Liu, T.; Wu, X.; Ju, L.; Wang, D. The central role of transcription factors in bridging biotic and abiotic stress responses for plants’ resilience. New Crops 2024, 1, 100005. [Google Scholar] [CrossRef]
  2. Rabeh, K.; Hnini, M.; Oubohssaine, M. A comprehensive review of transcription factor-mediated regulation of secondary metabolites in plants under environmental stress. Stress Biol. 2025, 5, 15. [Google Scholar] [CrossRef]
  3. Lu, H.; Xu, J.; Li, G.; Zhong, T.; Chen, D.; Lv, J. Genome-wide identification and expression analysis of GRAS gene family in Eucalyptus grandis. BMC Plant Biol. 2024, 24, 573. [Google Scholar] [CrossRef]
  4. Fan, L.; Wang, R.; García Caparrós, P.; Meng, L.; Sun, X. Genome-wide identification and functional analysis of the GRAS gene family in Medicago lupulina L. BMC Genom. 2025, 26, 682. [Google Scholar] [CrossRef]
  5. Liu, X.; Widmer, A. Genome-wide Comparative Analysis of the GRAS Gene Family in Populus, Arabidopsis and Rice. Plant Mol. Biol. Rep. 2014, 32, 1129–1145. [Google Scholar] [CrossRef]
  6. Huang, Y.; Zheng, Q.; Zhang, M.; He, X.; Zhao, X.; Wang, L.; Lan, S.; Liu, Z. Genome-Wide Identification and Expression Analysis of the GRAS Gene Family and Their Responses to Heat Stress in Cymbidium goeringii. Int. J. Mol. Sci. 2024, 25, 6363. [Google Scholar] [CrossRef]
  7. Fu, X.; Richards, D.E.; Ait-ali, T.; Hynes, L.W.; Ougham, H.; Peng, J.; Harberd, N.P. Gibberellin-Mediated Proteasome-Dependent Degradation of the Barley DELLA Protein SLN1 Repressor. Plant Cell 2002, 14, 3191–3200. [Google Scholar] [CrossRef]
  8. Torres-Galea, P.; Hirtreiter, B.; Bolle, C. Two GRAS Proteins, SCARECROW-LIKE21 and PHYTOCHROME A SIGNAL TRANSDUCTION1, Function Cooperatively in Phytochrome A Signal Transduction. Plant Physiol. 2013, 161, 291–304. [Google Scholar] [CrossRef]
  9. Zhang, Z.; Ogawa, M.; Fleet, C.M.; Zentella, R.; Hu, J.; Heo, J.O.; Lim, J.; Kamiya, Y.; Yamaguchi, S.; Sun, T.P. SCARECROW-LIKE 3 promotes gibberellin signaling by antagonizing master growth repressor DELLA in Arabidopsis. Proc. Natl. Acad. Sci. USA 2011, 108, 2160–2165. [Google Scholar] [CrossRef] [PubMed]
  10. Helariutta, Y.; Fukaki, H.; Wysocka-Diller, J.; Nakajima, K.; Jung, J.; Sena, G.; Hauser, M.T.; Benfey, P.N. The SHORT-ROOT Gene Controls Radial Patterning of the Arabidopsis Root through Radial Signaling. Cell 2000, 101, 555–567. [Google Scholar] [CrossRef] [PubMed]
  11. Koizumi, K.; Hayashi, T.; Wu, S.; Gallagher, K.L. The SHORT-ROOT protein acts as a mobile, dose-dependent signal in patterning the ground tissue. Proc. Natl. Acad. Sci. USA 2012, 109, 13010–13015. [Google Scholar] [CrossRef]
  12. Bolle, C.; Koncz, C.; Chua, N.H. PAT1, a new member of the GRAS family, is involved in phytochrome A signal transduction. Genes Dev. 2000, 14, 1269–1278. [Google Scholar] [CrossRef]
  13. Muntha, S.T.; Zhang, L.; Zhou, Y.; Zhao, X.; Hu, Z.; Yang, J.; Zhang, M. Phytochrome A signal transduction 1 and CONSTANS-LIKE 13 coordinately orchestrate shoot branching and flowering in leafy Brassica juncea. Plant Biotechnol. J. 2019, 17, 1333–1343. [Google Scholar] [CrossRef]
  14. Li, J.; Zhou, P.; Hu, Z.; Xiong, A.; Li, X.; Chen, X.; Zhuang, J. The transcription factor CsPAT1 from tea plant (Camellia sinensis) is involved in drought tolerance by modulating phenylpropanoid biosynthesis. J. Plant Physiol. 2025, 308, 154474. [Google Scholar] [CrossRef]
  15. Ming, R.; Fang, T.; Ling, W.; Geng, J.; Qu, J.; Zhang, Y.; Chen, J.; Yao, S.; Li, L.; Huang, D.; et al. The GRAS transcription factor PtrPAT1 of Poncirus trifoliata functions in cold tolerance and modulates glycine betaine content by regulating the BADH-like gene. Hortic. Res. 2025, 12, uhae296. [Google Scholar] [CrossRef]
  16. Feng, M.; Zhang, A.; Nguyen, V.; Bisht, A.; Almqvist, C.; De Veylder, L.; Carlsbecker, A.; Melnyk, C.W. A conserved graft formation process in Norway spruce and Arabidopsis identifies the PAT gene family as central regulators of wound healing. Nat. Plants 2024, 10, 53–65. [Google Scholar] [CrossRef]
  17. Jîjie, A.R.; Iliescu, D.; Sbârcea, L.; Boru, C.; Pătrașcu, D.; Iftode, O.A.; Minda, I.D.; Avram, Ș.; Trandafirescu, C.M.; Dehelean, C.A.; et al. A Deep Dive into the Botanical and Medicinal Heritage of Taxus. Plants 2025, 14, 1439. [Google Scholar] [CrossRef] [PubMed]
  18. Jiang, B.; Gao, L.; Wang, H.; Sun, Y.; Zhang, X.; Ke, H.; Liu, S.; Ma, P.; Liao, Q.; Wang, Y.; et al. Characterization and heterologous reconstitution of Taxus biosynthetic enzymes leading to baccatin III. Science 2024, 383, 622–629. [Google Scholar] [CrossRef]
  19. Xiong, X.; Gou, J.; Liao, Q.; Li, Y.; Zhou, Q.; Bi, G.; Li, C.; Du, R.; Wang, X.; Sun, T.; et al. The Taxus genome provides insights into paclitaxel biosynthesis. Nat. Plants 2021, 7, 1026–1036. [Google Scholar] [CrossRef] [PubMed]
  20. Xie, K.; Sun, M.; Wang, X.; Zhou, X.; Di, Q.; Li, Y.; Wu, G.; Li, Y.; He, C.; Li, S. Genome-wide identification, expression analysis, and response to abiotic stress of the phosphate transporter gene family in cucumber (Cucumis sativus L.). BMC Plant Biol. 2025, 25, 677. [Google Scholar] [CrossRef] [PubMed]
  21. Fan, J.; Wang, Y.; Dou, M.; Hu, Q.; Li, Q.; Gan, S.; Jing, M.; Wang, G.; Huang, X. Genome-wide identification and expression pattern analysis of NAC family in Taxus yunnanensis and the TyuNAC30 role in paclitaxel production. BMC Genom. 2025, 26, 705. [Google Scholar] [CrossRef]
  22. Waseem, M.; Nkurikiyimfura, O.; Niyitanga, S.; Jakada, B.H.; Shaheen, I.; Aslam, M.M. GRAS transcription factors emerging regulator in plants growth, development, and multiple stresses. Mol. Biol. Rep. 2022, 49, 9673–9685. [Google Scholar] [CrossRef]
  23. Ling, L.; Li, M.; Chen, N.; Ren, G.; Qu, L.; Yue, H.; Wu, X.; Zhao, J. Genome-Wide Analysis and Expression of the GRAS Transcription Factor Family in Avena sativa. Genes 2023, 14, 164. [Google Scholar] [CrossRef]
  24. Li, C.; Wang, K.; Chen, S.; Zhang, X.; Zhang, X.; Fan, L.; Dong, J.; Xu, L.; Wang, Y.; Li, Y.; et al. Genome-wide identification of RsGRAS gene family reveals positive role of RsSHRc gene in chilling stress response in radish (Raphanus sativus L.). Plant Physiol. Biochem. 2022, 192, 285–297. [Google Scholar] [CrossRef]
  25. Guo, Y.; Wu, H.; Li, X.; Li, Q.; Zhao, X.; Duan, X.; An, Y.; Lv, W.; An, H. Identification and expression of GRAS family genes in maize (Zea mays L.). PLoS ONE 2017, 12, e0185418. [Google Scholar] [CrossRef]
  26. Lee, M.; Kim, B.; Song, S.; Heo, J.; Yu, N.; Lee, S.A.; Kim, M.; Kim, D.G.; Sohn, S.O.; Lim, C.E.; et al. Large-scale analysis of the GRAS gene family in Arabidopsis thaliana. Plant Mol. Biol. 2008, 67, 659–670. [Google Scholar] [CrossRef]
  27. Niu, Y.; Zhao, T.; Xu, X.; Li, J. Genome-wide identification and characterization of GRAS transcription factors in tomato (Solanum lycopersicum). PeerJ 2017, 5, e3955. [Google Scholar] [CrossRef] [PubMed]
  28. Tian, C.; Wan, P.; Sun, S.; Li, J.; Chen, M. Genome-wide analysis of the GRAS gene family in rice and Arabidopsis. Plant Mol. Biol. 2004, 54, 519–532. [Google Scholar] [CrossRef] [PubMed]
  29. Mishra, S.; Chaudhary, R.; Pandey, B.; Singh, G.; Sharma, P. Genome-wide identification and expression analysis of the GRAS gene family under abiotic stresses in wheat (Triticum aestivum L.). Sci. Rep. 2023, 13, 18705. [Google Scholar] [CrossRef] [PubMed]
  30. Liu, Y.; Wang, W. Characterization of the GRAS gene family reveals their contribution to the high adaptability of wheat. PeerJ 2021, 9, e10811. [Google Scholar] [CrossRef]
  31. Lu, X.; Liu, W.; Xiang, C.; Li, X.; Wang, Q.; Wang, T.; Liu, Z.; Zhang, J.; Gao, L.; Zhang, W. Genome-Wide Characterization of GRAS Family and Their Potential Roles in Cold Tolerance of Cucumber (Cucumis sativus L.). Int. J. Mol. Sci. 2020, 21, 3857. [Google Scholar] [CrossRef]
  32. Chang, J.; Fan, D.; Lan, S.; Cheng, S.; Chen, S.; Lin, Y.; Cao, S. Genome-Wide Identification, Expression and Stress Analysis of the GRAS Gene Family in Phoebe bournei. Plants 2023, 12, 2048. [Google Scholar] [CrossRef]
  33. Wang, P.; Yang, X.; Sun, S.; Wang, J.; Wang, J.; Li, X.; Li, D.; Wang, Y. The SCARECROW-LIKE transcription factor from Populus davidiana × P. bolleana simultaneously improved drought tolerance and plant growth through acetylation-dependent mechanisms. Plant Biotechnol. J. 2025, 23, 3650–3666. [Google Scholar] [CrossRef] [PubMed]
  34. Wang, F.; Zhu, D.; Huang, X.; Li, S.; Gong, Y.; Yao, Q.; Fu, X.; Fan, L.; Deng, X.W. Biochemical Insights on Degradation of Arabidopsis DELLA Proteins Gained from a Cell-Free Assay System. Plant Cell 2009, 21, 2378–2390. [Google Scholar] [CrossRef] [PubMed]
  35. Chen, Y.; Zhang, M.; Zhang, W.; Wang, Y.; Zhang, H.; Yang, L.; Yu, L.; Fu, C. miR5298b regulated taxol biosynthesis by acting on TcNPR3, resulting in an alleviation of the strong inhibition of the TcNPR3-TcTGA6 complex in Taxus chinensis. Int. J. Biol. Macromol. 2023, 248, 125909. [Google Scholar] [CrossRef]
  36. Yu, C.; Luo, X.; Zhang, C.; Xu, X.; Huang, J.; Chen, Y.; Feng, S.; Zhan, X.; Zhang, L.; Yuan, H.; et al. Tissue-specific study across the stem of Taxus media identifies a phloem-specific TmMYB3 involved in the transcriptional regulation of paclitaxel biosynthesis. Plant J. 2020, 103, 95–110. [Google Scholar] [CrossRef]
  37. Khan, Y.; Xiong, Z.; Zhang, H.; Liu, S.; Yaseen, T.; Hui, T. Expression and roles of GRAS gene family in plant growth, signal transduction, biotic and abiotic stress resistance and symbiosis formation—A review. Plant Biol. 2021, 24, 404–416. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Chromosomal localization of TcGRAS family members in T. cuspidate. The vertical axis indicates chromosomal length. “chr01” to “chr011” represent the 11 chromosomes of T. cuspidata, with black labels indicating the positions of the 26 GRAS genes on the respective chromosomes.
Figure 1. Chromosomal localization of TcGRAS family members in T. cuspidate. The vertical axis indicates chromosomal length. “chr01” to “chr011” represent the 11 chromosomes of T. cuspidata, with black labels indicating the positions of the 26 GRAS genes on the respective chromosomes.
Genes 16 01345 g001
Figure 2. Phylogenetic relationships of GRAS family genes in A. thaliana, O. sativa, P. trichocarpa, and T. cuspidate. Four symbols and colors on the right under species denote the GRAS family members corresponding to A. thaliana, O. sativa, P. trichocarpa, and T. cuspidata. Branches corresponding to each subfamily are depicted in distinct colors, with those of members within the same subfamily shown in identical colors.
Figure 2. Phylogenetic relationships of GRAS family genes in A. thaliana, O. sativa, P. trichocarpa, and T. cuspidate. Four symbols and colors on the right under species denote the GRAS family members corresponding to A. thaliana, O. sativa, P. trichocarpa, and T. cuspidata. Branches corresponding to each subfamily are depicted in distinct colors, with those of members within the same subfamily shown in identical colors.
Genes 16 01345 g002
Figure 3. Motif, conserved domain, and gene structure analysis of TcGRAS family members. (A) Conserved protein motifs. (B) Structural domains. (C) Gene structures. The horizontal axis indicates the gene and protein sequence lengths.
Figure 3. Motif, conserved domain, and gene structure analysis of TcGRAS family members. (A) Conserved protein motifs. (B) Structural domains. (C) Gene structures. The horizontal axis indicates the gene and protein sequence lengths.
Genes 16 01345 g003
Figure 4. Distribution of cis-regulatory elements in the promoter regions of TcGRAS genes. The left vertical axis lists the 26 TcGRAS gene identifiers. The central stacked bar chart displays the total number of cis-acting elements classified by functional categories. The heatmap on the right shows representative cis-acting elements along the horizontal axis, with the frequency of occurrence indicated numerically within each grid cell.
Figure 4. Distribution of cis-regulatory elements in the promoter regions of TcGRAS genes. The left vertical axis lists the 26 TcGRAS gene identifiers. The central stacked bar chart displays the total number of cis-acting elements classified by functional categories. The heatmap on the right shows representative cis-acting elements along the horizontal axis, with the frequency of occurrence indicated numerically within each grid cell.
Genes 16 01345 g004
Figure 5. Expression patterns, GO enrichment, and protein–protein interaction network of TcGRAS genes. (A) Heatmap showing the expression of TcGRAS genes in roots, stems, leaves, twigs, and bark based on public transcriptome data. Color bars represent scaled relative expression levels, and the data in each heatmap correspond to raw FPKM values from the respective samples. (B) GO enrichment analysis of TcGRAS genes. (C) Protein–protein interaction network analysis of TcGRAS genes.
Figure 5. Expression patterns, GO enrichment, and protein–protein interaction network of TcGRAS genes. (A) Heatmap showing the expression of TcGRAS genes in roots, stems, leaves, twigs, and bark based on public transcriptome data. Color bars represent scaled relative expression levels, and the data in each heatmap correspond to raw FPKM values from the respective samples. (B) GO enrichment analysis of TcGRAS genes. (C) Protein–protein interaction network analysis of TcGRAS genes.
Genes 16 01345 g005
Figure 6. qRT-PCR analysis of TcGRAS gene expression across different tissues of T. cuspidata. Error bars represent the standard deviation of three biological replicates. Asterisks above the bars indicate statistically significant differences in expression levels relative to the Root. * p < 0.05, ** p < 0.01, *** p < 0.001, **** p < 0.0001.
Figure 6. qRT-PCR analysis of TcGRAS gene expression across different tissues of T. cuspidata. Error bars represent the standard deviation of three biological replicates. Asterisks above the bars indicate statistically significant differences in expression levels relative to the Root. * p < 0.05, ** p < 0.01, *** p < 0.001, **** p < 0.0001.
Genes 16 01345 g006
Table 1. Physicochemical properties of TcGRAS genes.
Table 1. Physicochemical properties of TcGRAS genes.
Sequence IDNumber of Amino AcidsMolecular WeightTheoretical pIInstability IndexAliphatic IndexGrand Average of Hydropathicity
TcGRAS164771,235.425.8158.877.68−0.378
TcGRAS217719,725.615.1534.3187.010.081
TcGRAS326128,649.414.7340.5195.98−0.067
TcGRAS461267,175.215.9558.1277.52−0.468
TcGRAS559663,669.065.6249.6468.15−0.552
TcGRAS651557,609.724.6139.3580.1−0.33
TcGRAS756163,439.374.5646.8677.02−0.361
TcGRAS828031,481.67.1751.79111.430.026
TcGRAS974581,673.175.3347.8774.07−0.306
TcGRAS1015617,7035.9651.4886.22−0.212
TcGRAS1160567,384.644.7349.5884.99−0.375
TcGRAS1255762,434.985.4752.9181.83−0.273
TcGRAS1359265,868.945.3437.2676.49−0.507
TcGRAS1438543,992.35.5342.2581.06−0.386
TcGRAS1573282,508.35.8152.9587.68−0.243
TcGRAS1647754,182.125.7837.7289.37−0.129
TcGRAS1725128,369.175.4241.0786.73−0.223
TcGRAS1823026,586.596.6753.5690.26−0.239
TcGRAS1955662,685.076.2541.1690.29−0.036
TcGRAS2049956,327.174.9745.7787.33−0.24
TcGRAS2145350,918.654.7451.8480.55−0.189
TcGRAS2215217,072.555.9348.3295.53−0.122
TcGRAS2385392,140.23645.6275.77−0.446
TcGRAS2462969,349.675.5448.5773.66−0.372
TcGRAS2513214,348.344.2363.13110.760.168
TcGRAS2644950,692.785.2341.0389.53−0.125
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Gao, L.; Shi, A.; Wang, M.; Tian, H.; Zhang, Y. Genome-Wide Identification and Expression Profiling of the GRAS Gene Family in Taxus cuspidate. Genes 2025, 16, 1345. https://doi.org/10.3390/genes16111345

AMA Style

Gao L, Shi A, Wang M, Tian H, Zhang Y. Genome-Wide Identification and Expression Profiling of the GRAS Gene Family in Taxus cuspidate. Genes. 2025; 16(11):1345. https://doi.org/10.3390/genes16111345

Chicago/Turabian Style

Gao, Li, Aokun Shi, Mian Wang, Hui Tian, and Yanwen Zhang. 2025. "Genome-Wide Identification and Expression Profiling of the GRAS Gene Family in Taxus cuspidate" Genes 16, no. 11: 1345. https://doi.org/10.3390/genes16111345

APA Style

Gao, L., Shi, A., Wang, M., Tian, H., & Zhang, Y. (2025). Genome-Wide Identification and Expression Profiling of the GRAS Gene Family in Taxus cuspidate. Genes, 16(11), 1345. https://doi.org/10.3390/genes16111345

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop