Abstract
Background/Objectives: The terrestrialization of plants has been a significant driver of plant evolution over hundreds of millions of years, and epidermal development plays a crucial role in adapting to terrestrial environments. It is therefore of great interest that the HD-Zip IV subfamily serves as a key regulator of epidermal and cuticle development in plants. However, research on their expansion trajectory, adaptive evolution, and functional divergence remains scarce. Methods: We conducted a functional divergence and adaptive evolution analysis in the plant HD-Zip IV subfamily and confirmed the functional differentiation caused by positive selection site mutations through binding affinity prediction analysis and EMSA experiments. Results: Our findings revealed that the HD-Zip IV subfamily has diversified into five distinct branches, with progressive expansion throughout plant evolution. These variations in the HD (homeodomain) and START drive the functional differentiation among the evolutionary branch, in particular, a distinct leucine-rich motif in HD and lipid-binding pockets. Furthermore, we identified several amino acid sites within the START domain that have been under selective pressure during plant evolution, as well as convergent evolutionary sites shared between early land plants and seed plants. Conclusions: Our findings show that the HD-Zip IV subfamily experienced a significant expansion in gene number and diversification of evolutionary branch from streptophyte algae to seed plants. During plant evolution, genomic duplication events and variations in the HD and START have contributed to its expansion in gene number and diversification, respectively. We suggest that the balanced coexistence of functional robustness and relaxed constraints in the HD-Zip IV subfamily may have underpinned its successful response to the challenges of land environment.
1. Introduction
As sessile organisms, plants have evolved unique adaptive strategies to cope with various abiotic and biotic stresses, such as drought tolerance, diverse reproductive models, and specialized pollination systems. The formation of these adaptive strategies primarily relies on a complex regulatory network mediated by plant-specific transcription factors [1,2]. The expansion of these transcription factors has been a critical driver of plant adaptive evolution [3]. Among them, the HD-Zip (homeodomain-leucine zipper protein) gene family plays a pivotal role in stress responses, epidermal cell specialization, and the synthesis of essential metabolites, all of which are indispensable for plant adaptation to terrestrial environments. In plant evolution, the HD-Zip family has progressively diverged into four branches: HD-Zip I, HD-Zip II, HD-Zip III, and HD-Zip IV [4]. Differences in gene structure, domain composition, and physiological functions characterize these branches.
Within its four branches, the HD-Zip IV subfamily characterized by the unique combination of a homeodomain (HD) and a START (steroidogenic acute regulatory protein-related lipid transfer) domain, which confers lipid/sterol-binding capacity and is essential for lipid metabolism and epidermal cell specialization [5,6]. Phylogenetic analyses reveal that HD-Zip IV genes originated in charophycean green algae and expanded substantially during plant terrestrialization, with gene duplication events occurring independently in bryophytes, lycophytes, and euphyllophytes [7,8]. This evolutionary expansion represents a primary form of START domain diversification in plants and coincides with major evolutionary transitions [9]. The HD expansion, in particular, has been proposed as a key evolutionary force in major events, such as the Cambrian explosion [10,11]. The expansion of the HD-Zip IV subfamily is essential for plant adaptation to terrestrial environments, including protection against drought, UV radiation, and high temperatures. Its members play critical roles in lipid synthesis, water status regulation, defense, and reproductive strategies [12,13]. In rice, the HD-Zip IV gene ROC1 (ROC1KO) regulates leaf rolling and drought response by forming heterodimers with ROC5 and ROC8, it has been experimentally validated that co-overexpression of ROC1/ROC8 both enhanced drought tolerance to plants [14]. Another HD-Zip IV gene in rice, ROC4, has been identified as a positive regulator of wax biosynthesis and consequently enhances drought tolerance [15]. In Arabidopsis. thaliana, the HD-Zip IV gene EDT1 (Enhanced Drought Tolerance 1) has been demonstrated to play a critical role in regulating root system architecture and leaf stomatal density. Furthermore, overexpression of EDT1 leads to significantly enhanced drought tolerance in transgenic plants [16]. Across diverse crop species, HD-Zip IV genes have evolved specialized functions in regulating epidermal development, stress responses, and metabolic processes [17,18,19]. For example, in maize, the HD-Zip IV subfamily gene OCL4 inhibits trichome development and regulates anther cell wall division, with over-expression (OE) plants exhibiting a macrohair phenotype [20]. In tomatoes, three HD-Zip IV subfamily genes promote interlocking trichome formation, supporting the cleistogamy reproductive mechanism [21]. In Artemisia annua, the HD-ZIP IV subfamily transcription factor AaHD8 regulates cuticle-related enzyme genes, influencing leaf cuticle development [22]. These specialized adaptations demonstrate how HD-Zip IV genes enhance plant tolerance to environmental stresses by orchestrating the development of epidermal structures and their appendage synthesis, including leaf stomata, wax biosynthesis, and the cuticular layer [23]. The functional diversification of HD-Zip IV genes across plant lineages reflects their continuous evolution and adaptation. These transcription factors integrate metabolic signals through their START domains while driving epidermal cell fate determination, ultimately enabling plants to develop specialized epidermal features and produce protective compounds such as waxes, alkaloids, and resins to combat diverse biotic and abiotic challenges [24]. The HD-Zip IV subfamily members synergistically shape epidermal development and drive adaptive evolution in terrestrial plants [25].
Without a doubt, the evolution of the HD-Zip IV subfamily provides critical insights into the roles of lipid/sterol signaling and epidermal cell development in the plant terrestrialization process. Despite its evolutionary significance, research on HD-Zip IV subfamily genes has primarily focused on single gene functions, with limited attention to how they expand and undergo functional differentiation and whether their expansion follows any universal principles. To address these gaps, we selected 35 representative species spanning the major evolutionary lineages of plants for evolutionary analysis. A comparative analysis was conducted to trace the expansion trajectories of the HD-Zip IV subfamily during plant evolution. Adaptive selection and convergent evolutionary sites within the HD-Zip IV subfamily I branch were investigated. Furthermore, EMSA analysis confirmed that the L43R mutation in AtPDF2 significantly enhanced its binding specificity capacity to the tandem repeats of the L1 box core motif. This study aims at elucidating the molecular basis of functional differentiation and adaptive evolution within the HD-Zip IV subfamily, and analyzing the dynamics of its expansion during plant evolution. This study provides new insights into the adaptive evolutionary mechanisms driven by the HD-Zip IV subfamily and future functional research.
2. Materials and Methods
2.1. Preparation of Plant Species Genomic Data
In this study, genomic, protein sequence, and corresponding CDS data for 35 species were obtained from the Phytozome 13 database (https://phytozome-next.jgi.doe.gov/ (accessed on 11 September 2024)). These species were Amborella trichopoda, Ananas comosus, A. thaliana, Aquilegia coerulea, Beta vulgaris, Brachypodium distachyon, Brassica rapa, Chara braunii, Chenopodium quinoa, Chrysanthemum indicum, Cinnamomum kanehirae, Cucumis sativus, Cycas panzhihuaensis, Elaeis guineensis, Ginkgo biloba, Glycine max, Gossypium hirsutum, Klebsormidium nitens, Litchi chinensis, Musa acuminata, Nelumbo nucifera, Nymphaea colorata, Oryza sativa, Physcomitrium patens, Populus trichocarpa, Rhododendron delavayi, Santalum yasi, Selaginella moellendorffii, Solanum lycopersicum, Solanum tuberosum, Spirodela coerulea, Spirogyra pratensis, Theobroma cacao, Vitis vinifera and Zea mays. Additionally, genomic and protein sequences of 11 bryophytes and 10 lycophytes and ferns were retrieved from the National Genomics Data Centre (https://ngdc.cncb.ac.cn/ (accessed on 27 August 2024)). The species were Adiantum capillus, Alsophila spinulosa, Azolla filiculoides, Bryum argenteum, Calohypnum plumiforme, Ceratodon purpureus, Ceratopteris richardii, Entodon seductrix, Fontinalis antipyretica, Hypnum curvifolium, Isoetes taiwanensis, Marchantia paleacea, Marchantia polymorpha, Marsilea vestita, Physcomitrium patens, Pohlia nutans, Salvinia cucullata, Selaginella lepidophylla, Selaginella moellendorffii, Selaginella tamariscina, Syntrichia caninervis.
2.2. Time-Calibrated Phylogeny of 35 Plant Species and Whole-Genome Identification of HD-Zip IV Subfamily Genes
To identify homologous sequences of HD-Zip IV subfamily genes, homologous sequences of the HD-Zip IV subfamily were identified through a comprehensive search strategy. First, protein sequences of 16 known A. thaliana HD-Zip IV genes were used as queries to perform a local BLASTP search against the proteome datasets of the target species using TBtools v2.070 with E-value cutoff of 1 × 10−10. Second, to capture more divergent homologs that might have been missed by the similarity search, all candidate sequences were subjected to a domain-based verification using the HMMER module within TBtools. This step utilized the curated PFAM profiles for the Homeobox domain (PF00046.24) and the HD-Zip C-terminal domain (PF01852.14) [26]. Finally, to ensure a non-redundant and high-quality dataset, we removed fragmentary sequences (length < 300 amino acids) and retained only the longest protein isoform for each gene locus. The time-calibrated phylogeny for 35 selected plant species, and the time-calibrated phylogeny for 21 bryophytes, lycophytes and ferns, were constructed using TIMETREE 5 (https://timetree.org/ (accessed on 13 August 2024)). This resource synthesizes divergence time estimates from numerous published studies to generate a consensus timeline of evolution. The resulting trees, which use median divergence times for branch lengths, were visualized using TreeView V2.2.0 software. Simplified species illustrations for the phylogenetic tree were sourced from PhyloPic (https://www.phylopic.org/ (accessed on 23 September 2024)) [27].
2.3. Sequence Alignment and Phylogenetic Evolutionary Analysis
Identified HD-Zip IV subfamily gene sequences were aligned using the CLUSTALW module of MEGA 12, with the following parameters: Gap Opening Penalty setx sadscc to 10.00, Gap Extension Penalty to 0.2, and Delay Divergent Cutoff to 30% [28]. The phylogenetic tree was constructed using the Maximum Likelihood (ML) method in MEGA 12 with 2000 bootstrap replicates. Branches with bootstrap values ≥ 70% were considered moderately supported, and values ≥ 95% were considered highly supported. The final tree was visualized and annotated using Tree Of Life (iTOL) v5 [29].
2.4. Protein–DNA Docking and Visualization of the HD-Zip IV Subfamily
Protein structure prediction for conserved HD-Zip IV subfamily protein or HD was performed using AlphaFold2 [30]. The model with the highest pLDDT (Predicted Local Distance Difference Test) confidence score was selected for subsequent analysis. Protein–DNA docking analysis, focusing on interactions between HD and the L1-box core sequence (5′-TAAATG(C/T)A-3′), was conducted using the HDOCK server (hdock.phys.hust.edu.cn/ (accessed on 16 August 2024)). The docking search was performed in “Protein-DNA” mode, with the search space and sampling parameters set to default. The top 3 docking poses were generated and evaluated. Visualization and analysis of the 2D interaction diagrams of the DNA–protein complexes were carried out using LigPlot+. The 3D structures of the complexes were rendered and visualized using PyMOL Molecular Graphics System, Version 2.5.0 [31].
2.5. Amino Acid Conservation Analysis of HD in the HD-ZIP IV Subfamily
Amino acid conservation in HD of various HD-ZIP IV subfamily branches was analyzed using the ConSurf web server (https://consurf.tau.ac.il/ (accessed on 8 November 2024)) [32]. For each distinct branch, the member sequence that was closest in genetic distance to the branch’s consensus was selected as the template for that analysis. Multiple sequence alignments (MSA) were generated using MAFFT-L-INS-i against the UniRef90 sequence database. Conservation scores were calculated using the Bayesian method with the JTT evolutionary substitution model. The resulting conservation grades, ranging from 1 (variable) to 9 (highly conserved), were mapped onto the corresponding protein structure.
2.6. DAP-Seq Target Gene Profiling and Comparison Among Arabidopsis HD-Zip IV Transcription Factors
DAP-seq data for Arabidopsis HD-Zip IV subfamily genes (PDF2, HDG1, and ANL2) were retrieved from the Plant Cistrome Database (neomorph.salk.edu/dap_web/pages/index.php (accessed on 11 July 2024)) [33]. Gene functional enrichment analysis was performed using clusterProfiler R package, and data visualization was carried out with Origin 2021 software for data visualization and the KEGG database (kegg.jp/) for pathway analysis [34].
2.7. DNA–Protein Complex Binding Affinity Analysis
PDA-Pred software (https://web.iitm.ac.in/bioinfo2/pdapred/ (accessed on 6 November 2024)) was used to predict binding affinities between DNA and proteins [35]. Predictions were generated using the software’s default Random Forest regression model. The predictive performance of the model achieved a correlation coefficient of 0.86 and a mean absolute error (MAE) of 0.76 kcal/mol, indicating high reliability.
2.8. Functional Divergence and Adaptive Evolution Analysis
Functional divergence among 393 HD-Zip IV subfamily proteins was assessed using DIVERGE version 4, examining type I (Q1) and type II (Q2) divergence [36]. Adaptive evolution of 122 branch I HD-Zip IV subfamily genes was analyzed using EasyCodeML 1.4 under pre-set models, including pairwise comparisons (M0 vs. M3, M1a vs. M2a, M7 vs. M8, and M8a vs. M8) [37].
2.9. Electrophoretic Mobility Shift Assays
The coding sequences of AtPDF2 and AtPDF2-HD(L43R) were inserted into the pET-30a vector. AtPDF2-his and AtPDF2-HD(L43R)-his proteins were expressed in E. coli BL21 (DE3) by inducing with 0.5 mm IPTG for 12 h at 16 °C. The his-tagged proteins were purified using IDA-Ni agarose magnetic beads from Beyotime Biotechnology company (Beijing, China). The biotinylated probe of the TAATTAATTAATTAATTAATTAATTAATTAAT sequence and TCCTTCCTTCCTTCCTTCCTTCCTTCCTTCCT sequence was synthesized by Sangon Biotech (Shanghai, China). Electrophoretic mobility shift assays was performed using a chemiluminescence-based EMSA kit from Beyotime Biotechnology company. The competitor probe was non-biotinylated TAATTAATTAATTAATTAATTAATTAATTAAT sequence.
3. Results
3.1. Genome-Wide Identification and Evolutionary Analysis of Plant HD-Zip IV Subfamily Members
To comprehensively investigate the evolutionary expansion of HD-Zip IV subfamily genes in plants, 35 representative species were selected, encompassing major evolutionary branches from algae to angiosperms. These species include three streptophyte algae, one bryophyte, onelycopodiophyte, two gymnosperms, and twenty-eight angiosperms from various orders, spanning over hundreds of millions of years in plant evolution (Figure 1; Table S1). A total of 393 HD-Zip IV subfamily members were identified across these 35 species and classified into five evolutionary branches, consistent with the phylogenetic analysis of HD-Zip IV subfamily members in Arabidopsis (Figure 2; Figure S1). In the streptophyte algaes, only one HD-Zip IV subfamily members was identified in each of chlorophyta species studied: Klebsormidium nitens, Chara braunii and Spirogyra pratensis. In contrast, four members were detected in the bryophytes specie Physcomitrium patens, and eight were identified in the lycophyte species Selaginella moellendorffii. Among seed plants, the number of HD-Zip IV subfamily members varied significantly, ranging from 5 to 29 (Figure 2; Figure S2). From early land plants to seed plants, the HD-Zip IV subfamilies exhibited expansion of members and branches; however, the degree of expansion varied markedly across species, with no apparent correlation to their evolutionary position or development complexity. Furthermore, an analysis of the distribution of 393 HD-Zip IV subfamily sequences across branches revealed that, in the streptophyte algaes and early land plants, genes were restricted to one branch. In contrast, the HD-Zip IV subfamily of seed plants expanded into five branch (Figure S2). Gymnosperms and certain monocots lacked HD-Zip IV subfamily members in branch IV and branch V. This analysis suggests that the HD-Zip IV subfamily show an evolutionary expansion in geological timescales.
Figure 1.
Time-calibrated phylogeny and HD-Zip IV subfamily gene count of 35 species. The figure includes a geologic timescale representing the chronological system of plant evolution. Time (MYA): indicating millions of years ago.
Figure 2.
Phylogenetic tree of 393 HD-Zip IV family proteins in plants.
3.2. Genomic Identification and Evolutionary Expansion Analysis of the HD-Zip IV Subfamily in Non-Seed Plants
To explore the evolutionary expansion of the HD-Zip IV subfamily in early-diverging land plants, genomic identification of HD-Zip IV subfamily genes was conducted in 11 bryophytes and 10 lycophytes and ferns (Figure 3; Table S2). A total of 23 sequences were identified in bryophytes and 83 in lycophytes and ferns, resulting in 106 sequences overall (Figure S3). Fewer members were generally observed in bryophytes, which exhibited strong evolutionary conservation and limited gene expansion (Figure 3; Figure S3). In lycophytes and ferns, the number of HD-Zip IV subfamily gene exhibited a rapid expansion. Within the four species of lycophytes, the number of HD-Zip IV subfamily members ranged from 2 to 8, progressively increasing to 14 members in Adiantum capillus, and gradually diverged into new evolutionary branch (Figure 3; Figure S3). This trend suggests that the expansion of HD-Zip IV subfamily genes became more pronounced as plants became more terrestrial.
Figure 3.
Time-calibrated phylogeny and HD-Zip IV subfamily gene count of 11 bryophytes and 10 lycophytes and ferns.
3.3. Functional Divergence Analysis of Plant HD-Zip IV Subfamily Proteins
Throughout the evolutionary history of plant terrestrialization, the HD-Zip IV subfamily has served as a critical transcriptional regulator, orchestrating plant hormone signaling, which in turn affects epidermal cell fate and properties. While functional redundancy exists among evolutionary branches, distinct roles, and functional divergence are evident. This is supported by the characterization of the five evolutionary branches of the Arabidopsis HD-Zip IV subfamily (Figure S1). To explore the genetic and evolutionary basis of functional divergence among different evolutionary branches of the HD-Zip IV subfamily, type I and type II divergence analyses were conducted among the five branches using DIVERGE 4 software. The analysis revealed positive θ1 values for all branch pairs except the branch I/branch IV pair (θ1 ± S.E. < 0; LRT < 1). Several amino acid sites showed posterior probabilities of Qk > 0.8 (Table 1). Functional divergence sites were identified across branches as follows: 5 (I/II), 1 (I/III), 5 (I/V), 3 (II/III), 7 (II/IV), 5 (II/V), 13 (III/IV), 1 (III/V), and 9 (IV/V) (Figure 4A,B; Table S3). These sites were concentrated at the bottom and sides of the hydrophobic lipid-binding pocket in the START domain of HD-Zip IV subfamily transcription factors (Figure 4C). Type II analyses showed negative θ2 ± S.E. values among all branches, suggesting that functional divergence is driven by amino acid evolution rate in the START domain rather than changes in physicochemical properties.
Table 1.
Functional divergence analysis of HD-Zip IV subfamily branches.
Figure 4.
Functional divergence analysis of HD-Zip IV subfamily proteins in plants. (A) Heatmap showing divergence sites with QK > 0.8 among the five evolutionary branches of HD-Zip IV subfamily. (B) Conservation analysis of amino acid divergence across evolutionary branches. (C) Protein structure diagram of HD-Zip IV subfamily members and three-dimensional structure representation of HD-Zip IV subfamily proteins. The divergence sites are localized to the lipid-binding pocket, the structure is shown in the light-blue section.
As lipid/steroid binding transcription factors, HD-Zip IV subfamily proteins play a critical role in lipid–ligand interactions through the START domain. Additionally, the HD, responsible for DNA binding and regulation, contributes significantly to functional differentiation. Conservation analysis using conSurf software identified amino acid variations in the HD across branches, in particular, a leucine-rich motif RXXLXXXLXLXXR in HD (Figure 5A). Docking analyses using HDOCK and subsequent affinity evaluations with LigPlot and PDA-Pred revealed significant differences in the hydrophobic interactions and binding stability between HD and the L1 box core sequence (TAATTAAT) across branches (Figure 5B–F). For example, the HD of branch I and the TAATTAAT motif exhibited stronger hydrophobic interaction and higher binding free energy (ΔG), indicating more stable DNA binding (Figure 5B,G). In contrast, the HD of branch II demonstrated weaker interactions, with a ΔG of −7.65 kcal/mol, suggesting reduced binding stability (Figure 5C,G). The hydrophobic interactions of branches III, IV, and V were more concentrated, with intermediate binding stability between branches I and II (Figure 5D–G). These differences in DNA binding affinity and binding specificity likely contribute to variations in the regulation of downstream target genes, driving functional divergence among branches. DAP-seq analysis of Arabidopsis HD-Zip IV subfamily members supports this conclusion (Figure 6). For instance, branch I protein PDF2, with high binding specificity to the L1 box, regulates 1980 target genes. In contrast, branch II proteins ANL2 and HDG1, which exhibit weaker binding specificity to the L1-box, regulate 5146 and 7551 genes, respectively, with significant overlap (4416 co-regulated genes) and consistent functional pathways (Figure 6A–D).
Figure 5.
Amino acid variations and DNA binding affinity in the HD. (A) Conservation analysis of HD across different branches of the HD-Zip IV subfamily. The conservation grades range from 1 (variable) to 9 (highly conserved). (B–F) Docking visualizations of the HD of five branches with L1-box core sequences. (G) Predicted protein-DNA binding affinities the HD of five branches with L1-box core sequences.
Figure 6.
Target gene analysis of HD-Zip IV subfamily transcription factors in A. thaliana. (A) Venn diagram illustrating target gene overlap among HD-Zip IV subfamily members. (B–D) KEGG pathway enrichment analyses for HDG1, PDF2 and ANL2 regulatory target genes. The highlighted components represent the KEGG pathways that are shared among the target gene sets of the three transcription factors.
3.4. Adaptive Evolution and Convergent Selection Analysis of Early Evolutionary Branches
Darwinian positive selection plays a pivotal role in shaping the adaptive evolution of plant gene families. In this study, we examined the selective pressure influencing the evolutionary trajectory of the HD-Zip IV subfamily I branch during plant evolution. Using easycodeML software, we analyzed adaptive evolution in HD-Zip IV subfamily branch I genes in plants. In the site-specific model analysis, the ω (omega) value of the M0 model was 0.70164, indicating that the HD-Zip IV subfamily has been predominantly subjected to purifying selection. Comparisons of models M3 and M2a with models M0 and M1a revealed no positively selected amino acid sites. However, the comparison of model M8 with model M7 identified 47 positively selected sites, mainly concentrated within the START domain (Table 2; Table S3). These findings suggest that certain amino acid sites within the START domain of branch I in the HD-Zip IV subfamily have undergone strong positive selection. To assess whether selective pressures influenced the evolutionary transition of HD-Zip IV subfamily genes from early land plant to seed plant, a branch-site model analysis was conducted. Designating early land plant HD-Zip IV subfamily genes from branch I as the foreground branch, the comparison of branch-site model A with the null model (LRT, p = 0.0998871617) revealed no significant positive selection sites between early land plants and seed plant HD-Zip IV subfamily genes (Table 2). Thus, branch I genes of the HD-Zip IV subfamily are largely constrained by purifying selection across conserved domains such as HD, Zipper–Loop–Zipper’s (ZLZ) START adjacent domain (SAD). In contrast, specific sites within the START itself show signatures of adaptive evolution.
Table 2.
Positive selection sites in HD-Zip IV subfamily branch I genes.
A convergent evolution analysis was also performed for HD-Zip IV subfamily branch I genes. Convergent evolution sites were detected in the HD and SAD between early land plants and seed plants. In the HD, the following amino acid mutations were observed: at position 10, non-polar fatty acid residues (L/P/A/V) mutated to the polar amino acid Q; at position 32, neutral residues M/Q mutated to the basic amino acid K; and at position 43, the basic amino acid R mutated to the non-polar residue L (Figure S4). The R43L mutation decreased the binding free energy (ΔG) between the HD and the L1-box core sequence (TAATTAAT) from −13.6 kcal/mol to −9.17 kcal/mol, reducing binding stability (Figure 7A,B). Moreover, EMSA assays confirmed that the L43R mutation in the HD of PDF2 (a branch I member of Arabidopsis HD-Zip IV subfamily) significantly enhanced its binding capacity to the tandem repeats of the L1 box core motif (Figure 7C) These findings imply that relative to those early land plants, a binding affinity- and specificity-weakened mutation to the L1 box core motif has become widespread in seed plants during evolution. In the SAD, seven convergent evolution sites were identified: Q496E, T499V, N501S, N535D, E538R, A643D, and S707D. These mutations may significantly alter the protein properties (Figure S5). These findings indicate that HD-Zip IV subfamily branch I genes exhibit molecular-level convergent evolution in both the HD and SAD domains, contributing to functional optimization and adaptive modifications across diverse plant lineages.
Figure 7.
Convergent evolution in HD-Zip IV subfamily I branch. (A) Docking visualization of the wild-type HD with L1-box motifs. The red and light-red triangular arrows are used to indicate the forces exerted on the specific residues at positions 42 and 43 in HD domian during the molecular interaction with the L1-box motif. (B) Docking visualization of the R43L mutant HD with L1-box motifs. The red and light-red triangular arrows are used to indicate the forces exerted on the specific residues at positions 42 and 43 in HD domian during the molecular interaction with the L1-box motif. (C) Electrophoretic mobility shift assays were performed to examine the binding of PDF2-his and its mutant PDF2-HD(L43R)-his proteins with the biotin-labeled L1-box core motif. Shift represent gel shift band, free probe refers to the unbound probe that has not formed a complex with the target protein.
4. Discussion
4.1. The Evolutionary History of the HD-Zip IV Subfamily Provides Insights into Plant Terrestrialization
Gene families emerge from ancestral genes through mechanisms such as duplication and transposition. The transition from single to multiple members and the functional diversification of gene families reflect major evolutionary processes, including natural selection, genome duplication, and genetic drift [38]. In this study, we found that the first duplication or expansion of the HD-Zip IV subfamily occurred after the Marchantiales (Figure 3), and the members of HD-Zip IV subfamily gradually expand from early land plants to seed plants, especially after selaginella moellendorffii (Figure 1). At these HD-Zip IV gene family expansion nodes, Guan et al. confirmed that Marchantiales, an early-diverging lineage of land plants, represents a pivotal stage in the evolutionary transition of plants to terrestrial environments [39,40]. During this period, plants began developing central vascular bundles and enhancing cuticle formation, driven by alternating dry and wet conditions and long-term symbiotic interactions with fungi [41,42]. Furthermore, the early vascular plant Selaginella moellendorffii is characterized by the development of foundational terrestrial adaptations, including vascular system, stomata and a cuticle [43,44]. In summary, these findings suggest that the expansion of the HD-Zip IV subfamily is intricately tied to plant terrestrialization, playing a key role in the emergence of terrestrial adaptive traits.
From another perspective, our findings reveal that the evolution of HD-Zip IV subfamily maintained relatively conservation and only minimal gene expansion in bryophytes (Figure S3), and these was a rapid expansion of HD-Zip IV subfamily members and new evolutionary branches emerged majority of lycophyte and fern species (Figure 3). This result may be related to the WGD events or possibly horizontal gene transfer of lycophytes and ferns, which occurred about 30 million years ago [45,46]. Conversely, the HD-Zip IV subfamily did not expand in Selaginella species, which lack WGD events [47]. For instance, Selaginella moellendorffii and Selaginella tamariscina possess only two and four HD-Zip IV subfamily members, respectively (Figure 3). Such variation is also observed in seed plants. Beta vulgaris (sugar beet) has only half the number of HD-Zip IV subfamily members identified in Chenopodium quinoa, despite both being Amaranthaceae species (Figure 1). Beta vulgaris has also experienced fewer genome duplication events than other species in its lineage. Moreover, Chenopodium quinoa shows greater resilience to extreme conditions than Beta vulgaris, and overexpressing the HD-Zip IV subfamily gene EsHdzip1 exhibited increased drought resistance in tobacco [48,49,50]. These observations suggest that the evolution of the HD-Zip IV subfamily is not only synchronized with plant terrestrialization on the evolutionary timescale but also serves as a key player and molecular record of pivotal evolutionary events in plants. This provides invaluable insights into evolutionary processes spanning hundreds of millions of years. These observations suggest that the expansion of the HD-Zip IV subfamily was closely linked to genome duplication events and that it was an important contributor to the evolution of terrestrial adaptation in plants.
4.2. Site Variations in the HD and START Domains Is a Key Driver for Functional Diversification in the HD-Zip IV Subfamily
A comparative analysis of amino acid conservation within the HD-Zip IV subfamily revealed that divergence sites are primarily concentrated in the HD and START domains, specifically, in the leucine-rich region (spanning residues 31–43) of the HD (Figure 5A). Among these sites, variations at positions 31, 38, and 40 have been shown to influence the DNA binding specificity of HD-Zip IV subfamily transcription factor [51]. Meanwhile, our EMSA assays on the divergence site variation at site 43 in the HD domain demonstrated that this mutation in a conserved residue significantly alters the binding affinity of HD-Zip IV transcription factors for the L1-box motif (Figure 7C). Thus, these divergence sites ultimately led to significant differences in the predicted binding affinity for the L1-box motif among different subfamily lineages (Figure 5G). This functional divergence, in turn, resulted in starkly different regulatory target sets between branch I member PDF2 and branch II members HDG1/ANL2 of the Arabidopsis HD-Zip IV subfamily (Figure 6). In all, these findings highlight significant functional differentiation among HD-Zip IV family branches, driven primarily by amino acid variations in the HD domain.
Similarly, variations in the START domain are likely pivotal in the evolution and functional diversification of the HD-Zip IV subfamily. This study identified numerous branch-specific variations within the START domain, primarily concentrated in the hydrophobic lipid-binding pocket (Figure 4C). These variations can influence the START domain’s ability to bind lipid molecules, thereby affecting lipid-binding affinity and responses to environmental stress [52,53]. Furthermore, deletion or mutation of the START can also influence the regulatory activity of HD-Zip IV subfamily transcription factors by altering their dimerization and protein stability [54]. Together, these variations in the HD and START synergistically drive the functional diversification of the HD-Zip IV subfamily.
4.3. Balance Strategy in the Adaptive Evolution of the HD-Zip IV Subfamily
In biological evolution, purifying selection and positive selection are two opposing forces that collectively drive adaptive evolution. This study reveals that both purifying and positive selection simultaneously influence the evolution of the HD-Zip IV subfamily branch I, targeting distinct domains (Table 2). A total of 47 positive selection sites in branch I are concentrated in the START domain, and the abundance of positively selected sites suggests that the START domain has undergone strong positive selection during plant evolution (Table S3). Other gene families also show an abundance of positive selection sites similar to what we observed. In the NBS-LRR disease resistance gene family, Mariana et al. identified widespread positive selection within the LRR domain, reflecting the ongoing co-evolution between host plants and pathogens [55]. In the citrus NBS gene family, 18–24 positively selected sites were identified per branch, with the accumulation of these mutations accelerating functional divergence [56]. Similarly, in the GRAS gene family, evolutionary analysis revealed 11 strongly positively selected sites within α-helical domains in branches I and V, contributing to the evolution of conserved motifs and functional specialization [57]. Although some of the 47 positively selected sites found in the HD-Zip IV subfamily I branch may represent compensatory mutations with limited evolutionary impact, our findings underscore the substantial adaptive pressures acting on the START during evolution. These pressures have driven the evolutionary diversification of HD-Zip IV subfamilies, enabling their adaptation to increasingly complex terrestrial environments. In contrast, other domains of the HD-Zip IV subfamily, the HD, namely, ZLZ, and SAD are predominantly shaped by purifying selection (Figure 4C). This ensures the functional stability of these domains, which play critical roles in maintaining conserved physiological functions. This strategy, balancing differential positive and purifying selection across various domains, is commonly observed in the evolution of numerous animal and plant species. Such functional variation resulting from this balance grants organisms an enhanced adaptive capacity, allowing them to succeed in diverse and changing habitats [58,59]. Therefore, the balance between functional robustness and relaxed constraints may confer greater functional flexibility for the HD-Zip IV subfamily to respond to challenging land environments.
Supplementary Materials
The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/genes16111348/s1, Figure S1. Phylogenetic tree of HD-Zip IV family proteins in A. thaliana. Figure S2. The number of members in the five evolutionary branches of the HD-Zip IV family across different species. Figure S3. Phylogenetic tree of HD-Zip IV family proteins in 21 plant species, namely, 11 bryophytes and 10 pteridophytes. Figure S4. Convergent evolution sites in HD of HD-Zip IV subfamily I branch. Figure S5. Convergent evolutionary sites in the HD-Zip IV SAD across plant species. Table S1. Names and genus information of the 35 species. Table S2. Detailed information on the names and genus of the 21 species. Table S3. Positive selective site identified using the M8 model.
Author Contributions
Y.L. and Z.L.: writing—original. Y.L: writing—review and editing. Z.L., G.Q. and L.L.: investigation. Y.L., H.S. and Y.G.: data curation. Y.L.: validation. Y.L., M.Y., and G.H.: formal analysis. F.L., K.Z. and F.P.: conceptualization and supervision. All authors have read and agreed to the published version of the manuscript.
Funding
This study was supported by the Tianshan Talent Training Program of Xinjiang (Grant 2022TSYCCX0087), the National Natural Science Foundation of China (Grants 32441062 and 32360509), the Xinjiang Science and Technology Major Project of China (Grant 2024A02002), Key Research and Development Program of Xinjiang (Grant 2022B02052), and the National Key Laboratory of Cotton Bio-breeding and Integrated Utilization (CBIU2024004).
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
No new data were generated for this study. All data are included in the manuscript.
Conflicts of Interest
The authors declare that they have no competing financial interests or personal relationships that could have influenced the work reported in this paper.
References
- Pereira-Santana, A.; Alcaraz, L.D.; Castano, E.; Sanchez-Calderon, L.; Sanchez-Teyer, F.; Rodriguez-Zapata, L. Comparative genomics of NAC transcriptional factors in angiosperms: Implications for the adaptation and diversification of flowering plants. PLoS ONE 2015, 10, e0141866. [Google Scholar] [CrossRef]
- Waschburger, E.L.; Filgueiras, J.P.C.; Turchetto-Zolet, A.C. DOF gene family expansion and diversification. Genet. Mol. Biol. 2023, 46, e20230109. [Google Scholar] [CrossRef] [PubMed]
- Martin, C.; Ellis, N.; Rook, F. Do transcription factors play special roles in adaptive variation? Plant Physiol. 2010, 154, 506–511. [Google Scholar] [CrossRef] [PubMed]
- Mukherjee, K.; Brocchieri, L.; Bürglin, T.R. A comprehensive classification and evolutionary analysis of plant homeobox genes. Mol. Biol. Evol. 2009, 26, 2775–2794. [Google Scholar] [CrossRef]
- Javaid, M.; Ashfaq, U.; Ijaz, M.; Naz, B.; Saleem, B.; Arshad, A. Computational analysis of homeodomain-leucine zipper (HD-ZIP) family of transcription factors in Arabidopsis thaliana. JAPS J. Anim. Plant Sci. 2021, 31, 1145–1159. [Google Scholar] [CrossRef]
- Ariel, F.D.; Manavella, P.A.; Dezar, C.A.; Chan, R.L. The true story of the HD-Zip family. Trends Plant Sci. 2007, 12, 419–426. [Google Scholar] [CrossRef]
- Zalewski, C.S.; Floyd, S.K.; Furumizu, C.; Sakakibara, K.; Stevenson, D.W.; Bowman, J.L. Evolution of the class IV HD-zip gene family in streptophytes. Mol. Biol. Evol. 2013, 30, 2347–2365. [Google Scholar] [CrossRef]
- Romani, F.; Reinheimer, R.; Florent, S.N.; Bowman, J.L.; Moreno, J.E. Evolutionary history of HOMEODOMAIN LEUCINE ZIPPER transcription factors during plant transition to land. New Phytol. 2018, 219, 408–421. [Google Scholar] [CrossRef]
- Schrick, K.; Nguyen, D.; Karlowski, W.M.; Mayer, K.F. START lipid/sterol-binding domains are amplified in plants and are predominantly associated with homeodomain transcription factors. Genome Biol. 2004, 5, R41. [Google Scholar] [CrossRef] [PubMed]
- Sukumaran, P. Cambrian explosion of life: The Big Bang in metazoan evolution. Resonance 2004, 9, 38–50. [Google Scholar] [CrossRef]
- Vonk, P.J.; Ohm, R.A. The role of homeodomain transcription factors in fungal development. Fungal Biol. Rev. 2018, 32, 219–230. [Google Scholar] [CrossRef]
- Sharif, R.; Raza, A.; Chen, P.; Li, Y.; El-Ballat, E.M.; Rauf, A.; Hano, C.; El-Esawi, M.A. HD-ZIP gene family: Potential roles in improving plant growth and regulating stress-responsive mechanisms in plants. Genes 2021, 12, 1256. [Google Scholar] [CrossRef]
- Ma, G.; Zelman, A.K.; Apicella, P.V.; Berkowitz, G. Genome-Wide Identification and Expression Analysis of Homeodomain Leucine Zipper Subfamily IV (HD-ZIP IV) Gene Family in Cannabis sativa L. Plants 2022, 11, 1307. Plants 2022, 11, 1307. [Google Scholar] [CrossRef] [PubMed]
- Tao, Z.; Miao, X.; Shi, Z. HD-ZIP IV Gene ROC1 Regulates Leaf Rolling and Drought Response Through Formation of Heterodimers with ROC5 and ROC8 in Rice. Rice 2024, 17, 45. [Google Scholar] [CrossRef]
- Wang, Z.; Tian, X.; Zhao, Q.; Liu, Z.; Li, X.; Ren, Y.; Tang, J.; Fang, J.; Xu, Q.; Bu, Q. The E3 ligase DROUGHT HYPERSENSITIVE negatively regulates cuticular wax biosynthesis by promoting the degradation of transcription factor ROC4 in rice. Plant Cell 2018, 30, 228–244. [Google Scholar] [CrossRef]
- Yu, H.; Chen, X.; Hong, Y.-Y.; Wang, Y.; Xu, P.; Ke, S.-D.; Liu, H.-Y.; Zhu, J.-K.; Oliver, D.J.; Xiang, C.-B. Activated expression of an Arabidopsis HD-START protein confers drought tolerance with improved root system and reduced stomatal density. Plant Cell 2008, 20, 1134–1151. [Google Scholar] [CrossRef]
- Chen, E.; Zhang, X.; Yang, Z.; Wang, X.; Yang, Z.; Zhang, C.; Wu, Z.; Kong, D.; Liu, Z.; Zhao, G.; et al. Genome-wide analysis of the HD-ZIP IV transcription factor family in Gossypium arboreum and GaHDG11 involved in osmotic tolerance in transgenic Arabidopsis. Mol. Genet. Genom. 2017, 292, 593–609. [Google Scholar] [CrossRef] [PubMed]
- Wu, M.; Bian, X.; Hu, S.; Huang, B.; Shen, J.; Du, Y.; Wang, Y.; Xu, M.; Xu, H.; Yang, M.; et al. A gradient of the HD-Zip regulator Woolly regulates multicellular trichome morphogenesis in tomato. Plant Cell 2024, 36, 2375–2392. [Google Scholar] [CrossRef] [PubMed]
- Cai, Y.; Bartholomew, E.S.; Dong, M.; Zhai, X.; Yin, S.; Zhang, Y.; Feng, Z.; Wu, L.; Liu, W.; Shan, N.; et al. The HD-ZIP IV transcription factor GL2-LIKE regulates male flowering time and fertility in cucumber. J. Exp. Bot. 2020, 71, 5425–5437. [Google Scholar] [CrossRef]
- Vernoud, V.; Laigle, G.; Rozier, F.; Meeley, R.B.; Perez, P.; Rogowsky, P.M. The HD-ZIP IV transcription factor OCL4 is necessary for trichome patterning and anther development in maize. Plant J. 2009, 59, 883–894. [Google Scholar] [CrossRef]
- Wu, M.; Bian, X.; Huang, B.; Du, Y.; Hu, S.; Wang, Y.; Shen, J.; Wu, S. HD-Zip proteins modify floral structures for self-pollination in tomato. Science 2024, 384, 124–130. [Google Scholar] [CrossRef]
- Yan, T.; Li, L.; Xie, L.; Chen, M.; Shen, Q.; Pan, Q.; Fu, X.; Shi, P.; Tang, Y.; Huang, H.; et al. A novel HD-ZIP IV/MIXTA complex promotes glandular trichome initiation and cuticle development in Artemisia annua. New Phytol. 2018, 218, 567–578. [Google Scholar] [CrossRef]
- Zhang, H.; Ma, X.; Li, W.; Niu, D.; Wang, Z.; Yan, X.; Yang, X.; Yang, Y.; Cui, H. Genome-wide characterization of NtHD-ZIP IV: Different roles in abiotic stress response and glandular trichome induction. BMC Plant Biol. 2019, 19, 444. [Google Scholar] [CrossRef]
- Shabala, S. Learning from halophytes: Physiological basis and strategies to improve abiotic stress tolerance in crops. Ann. Bot. 2013, 112, 1209–1221. [Google Scholar] [CrossRef]
- Schrick, K.; Ahmad, B.; Nguyen, H.V. HD-Zip IV transcription factors: Drivers of epidermal cell fate integrate metabolic signals. Curr. Opin. Plant Biol. 2023, 75, 102417. [Google Scholar] [CrossRef]
- Chen, C.; Chen, H.; Zhang, Y.; Thomas, H.R.; Frank, M.H.; He, Y.; Xia, R. TBtools: An integrative toolkit developed for interactive analyses of big biological data. Mol. Plant 2020, 13, 1194–1202. [Google Scholar] [CrossRef]
- Kumar, S.; Suleski, M.; Craig, J.M.; Kasprowicz, A.E.; Sanderford, M.; Li, M.; Stecher, G.; Hedges, S.B. TimeTree 5: An expanded resource for species divergence times. Mol. Biol. Evol. 2022, 39, msac174. [Google Scholar] [CrossRef]
- Kumar, S.; Stecher, G.; Suleski, M.; Sanderford, M.; Sharma, S.; Tamura, K. MEGA12: Molecular Evolutionary Genetic Analysis version 12 for adaptive and green computing. Mol. Biol. Evol. 2024, 41, msae263. [Google Scholar] [CrossRef] [PubMed]
- Letunic, I.; Bork, P. Interactive Tree Of Life (iTOL) v5: An online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 2021, 49, W293–W296. [Google Scholar] [CrossRef] [PubMed]
- Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.; Bates, R.; Žídek, A.; Potapenko, A.; et al. Highly accurate protein structure prediction with AlphaFold. Nature 2021, 596, 583–589. [Google Scholar] [CrossRef] [PubMed]
- Laskowski, R.A.; Swindells, M.B. LigPlot+: Multiple ligand–protein interaction diagrams for drug discovery. J. Chem. Inf. Model. 2011, 15, 2778–2786. [Google Scholar] [CrossRef]
- Ben Chorin, A.; Masrati, G.; Kessel, A.; Narunsky, A.; Sprinzak, J.; Lahav, S.; Ashkenazy, H.; Ben-Tal, N. ConSurf-DB: An accessible repository for the evolutionary conservation patterns of the majority of PDB proteins. Protein Sci. 2020, 29, 258–267. [Google Scholar] [CrossRef] [PubMed]
- O’Malley, R.C.; Huang, S.-s.C.; Song, L.; Lewsey, M.G.; Bartlett, A.; Nery, J.R.; Galli, M.; Gallavotti, A.; Ecker, J.R. Cistrome and epicistrome features shape the regulatory DNA landscape. Cell 2016, 165, 1280–1292. [Google Scholar] [CrossRef] [PubMed]
- Fu, L.-Y.; Zhu, T.; Zhou, X.; Yu, R.; He, Z.; Zhang, P.; Wu, Z.; Chen, M.; Kaufmann, K.; Chen, D. ChIP-Hub provides an integrative platform for exploring plant regulome. Nat. Commun. 2022, 13, 3413. [Google Scholar] [CrossRef]
- Harini, K.; Kihara, D.; Gromiha, M.M. PDA-Pred: Predicting the binding affinity of protein-DNA complexes using machine learning techniques and structural features. Methods 2023, 213, 10–17. [Google Scholar] [CrossRef]
- Gu, X.; Zou, Y.; Su, Z.; Huang, W.; Zhou, Z.; Arendsee, Z.; Zeng, Y. An update of DIVERGE software for functional divergence analysis of protein family. Mol. Biol. Evol. 2013, 30, 1713–1719. [Google Scholar] [CrossRef]
- Gao, F.; Chen, C.; Arab, D.; Du, Z.; He, Y.; Ho, S.Y.W. EasyCodeML: A visual tool for analysis of selection using CodeML. Ecol. Evol. 2019, 9, 3891–3898. [Google Scholar] [CrossRef]
- Zhang, M.; Wu, Y.-H.; Lee, M.-K.; Liu, Y.-H.; Rong, Y.; Santos, T.S.; Wu, C.; Xie, F.; Nelson, R.L.; Zhang, H.-B. Numbers of genes in the NBS and RLK families vary by more than four-fold within a plant species and are regulated by multiple factors. Nucleic Acids Res. 2010, 38, 6513–6525. [Google Scholar] [CrossRef]
- Guan, Y.; Liu, L.; Wang, Q.; Zhao, J.; Li, P.; Hu, J.; Yang, Z.; Running, M.P.; Sun, H.; Huang, J. Gene refashioning through innovative shifting of reading frames in mosses. Nat. Commun. 2018, 9, 1555. [Google Scholar] [CrossRef]
- Zhang, J.; Fu, X.-X.; Li, R.-Q.; Zhao, X.; Liu, Y.; Li, M.-H.; Zwaenepoel, A.; Ma, H.; Goffinet, B.; Guan, Y.-L.; et al. The hornwort genome and early land plant evolution. Nat. Plants 2020, 6, 107–118. [Google Scholar] [CrossRef] [PubMed]
- Ligrone, R.; Duckett, J.G.; Renzaglia, K. Conducting tissues and phyletic relationships of bryophytes. Philos. Trans. R. Soc. London. Ser. B Biol. Sci. 2000, 355, 795–813. [Google Scholar] [CrossRef]
- Li, Y.F.; Luo, L.; Liu, Y.; He, Q.; Yu, N.N.; Gaowa, N.; Yi, Z.Q.; Wang, J.J.; Han, W.; Peng, T.; et al. The Bryophyte Phylogeny Group: A revised familial classification system based on plastid phylogenomic data. J. Syst. Evol. 2024, 62, 577–588. [Google Scholar] [CrossRef]
- Ferrari, C.; Shivhare, D.; Hansen, B.O.; Pasha, A.; Esteban, E.; Provart, N.J.; Kragler, F.; Fernie, A.; Tohge, T.; Mutwil, M. Expression atlas of Selaginella moellendorffii provides insights into the evolution of vasculature, secondary metabolism, and roots. Plant Cell 2020, 32, 853–870. [Google Scholar] [CrossRef]
- Xiong, H.; Yin, C.; Tang, D.; Xiong, X.; Liu, X.; Wang, Z.; Yu, X.; Mei, Z.; Li, J. The Telomere-to-Telomere Genome of Selaginella moellendorffii Provides Insights into Genome Evolution and Biflavone Biosynthesis. Hortic. Plant J. 2025. [Google Scholar] [CrossRef]
- Baniaga, A.E.; Arrigo, N.; Barker, M.S. The small nuclear genomes of Selaginella are associated with a low rate of genome size evolution. Genome Biol. Evol. 2016, 8, 1516–1525. [Google Scholar] [CrossRef] [PubMed]
- Gao, B.; Chen, M.X.; Li, X.S.; Liang, Y.Q.; Zhang, D.Y.; Wood, A.J.; Oliver, M.J.; Zhang, J.H. Ancestral gene duplications in mosses characterized by integrated phylogenomic analyses. J. Syst. Evol. 2022, 60, 144–159. [Google Scholar] [CrossRef]
- Banks, J.A.; Nishiyama, T.; Hasebe, M.; Bowman, J.L.; Gribskov, M.; DePamphilis, C.; Albert, V.A.; Aono, N.; Aoyama, T.; Ambrose, B.A.; et al. The Selaginella genome identifies genetic changes associated with the evolution of vascular plants. Science 2011, 332, 960–963. [Google Scholar] [CrossRef]
- Dohm, J.C.; Minoche, A.E.; Holtgräwe, D.; Capella-Gutiérrez, S.; Zakrzewski, F.; Tafer, H.; Rupp, O.; Sörensen, T.R.; Stracke, R.; Reinhardt, R.; et al. The genome of the recently domesticated crop plant sugar beet (Beta vulgaris). Nature 2014, 505, 546–549. [Google Scholar] [CrossRef]
- Ain, Q.T.; Siddique, K.; Bawazeer, S.; Ali, I.; Mazhar, M.; Rasool, R.; Mubeen, B.; Ullah, F.; Unar, A.; Jafar, T.H. Adaptive mechanisms in quinoa for coping in stressful environments: An update. PeerJ 2023, 11, e14832. [Google Scholar] [CrossRef]
- Zhou, C.; Zhu, L.; Ma, Z.; Wang, J. A homolog of Class IV HD-Zip transcription factors, EsHdzip1, confers drought resistance in tobacco via enhanced the capacity of water conserving and absorbing. Acta Physiol. Plant. 2015, 37, 124. [Google Scholar] [CrossRef]
- Kock, K.H.; Kimes, P.K.; Gisselbrecht, S.S.; Inukai, S.; Phanor, S.K.; Anderson, J.T.; Ramakrishnan, G.; Lipper, C.H.; Song, D.; Kurland, J.V.; et al. DNA binding analysis of rare variants in homeodomains reveals homeodomain specificity-determining residues. Nat. Commun. 2024, 15, 3110. [Google Scholar] [CrossRef]
- Romanowski, M.J.; Soccio, R.E.; Breslow, J.L.; Burley, S.K. Crystal structure of the Mus musculus cholesterol-regulated START protein 4 (StarD4) containing a StAR-related lipid transfer domain. Proc. Natl. Acad. Sci. USA 2002, 99, 6949–6954. [Google Scholar] [CrossRef]
- Husbands, A.Y.; Feller, A.; Aggarwal, V.; Dresden, C.E.; Holub, A.S.; Ha, T.; Timmermans, M.C. The START domain potentiates HD-ZIPIII transcriptional activity. Plant Cell 2023, 35, 2332–2348. [Google Scholar] [CrossRef]
- Mukherjee, T.; Subedi, B.; Khosla, A.; Begler, E.M.; Stephens, P.M.; Warner, A.L.; Lerma-Reyes, R.; Thompson, K.A.; Gunewardena, S.; Schrick, K. The START domain mediates Arabidopsis GLABRA2 dimerization and turnover independently of homeodomain DNA binding. Plant Physiol. 2022, 190, 2315–2334. [Google Scholar] [CrossRef]
- Mondragón-Palomino, M.; Meyers, B.C.; Michelmore, R.W.; Gaut, B.S. Patterns of positive selection in the complete NBS-LRR gene family of Arabidopsis thaliana. Genome Res. 2002, 12, 1305–1315. [Google Scholar] [CrossRef] [PubMed]
- Wang, Y.; Zhou, L.; Li, D.; Dai, L.; Lawton-Rauh, A.; Srimani, P.K.; Duan, Y.; Luo, F. Genome-wide comparative analysis reveals similar types of NBS genes in hybrid Citrus sinensis genome and original Citrus clementine genome and provides new insights into non-TIR NBS genes. PLoS ONE 2015, 10, e0121893. [Google Scholar] [CrossRef] [PubMed]
- Wu, N.; Zhu, Y.; Song, W.; Li, Y.; Yan, Y.; Hu, Y. Unusual tandem expansion and positive selection in subgroups of the plant GRAS transcription factor superfamily. BMC Plant Biol. 2014, 14, 373. [Google Scholar] [CrossRef]
- Liu, G.; Zhang, H.; Sun, G.; Zhao, C.; Shang, S.; Gao, X.; Xia, T.; Yang, X. Characterization of the peripheral blood transcriptome and adaptive evolution of the MHC I and TLR gene families in the wolf (Canis lupus). BMC Genom. 2017, 18, 584. [Google Scholar] [CrossRef][Green Version]
- Okuyama, H.; Tokunaga, A.; Hayashi, S.; Nakasako, K.; Tsukamoto, D.; Matsuo, T.; Tamura, K.; Ito, M. The Tolerated functional decline Model: Relaxed constraint in the molecular evolution of mammalian master Sex-Determining gene SRY. Gene 2025, 973, 149844. [Google Scholar] [CrossRef] [PubMed]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).