Next Article in Journal
Substituted Triazole-3,5-Diamine Compounds as Novel Human Topoisomerase III Beta Inhibitors
Previous Article in Journal
Enhancement of 3-MA in Paclitaxel Treatment of MDA-MB-231 Tumor-Bearing Nude Mice and Its Mechanisms
Previous Article in Special Issue
Genome-Wide Identification of GmARF9b/GmARF2a Negatively Regulate Root Growth in Soybean
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genome-Wide Analysis of the DNA-Binding with One Finger Gene Family Reveals Soybean Expression Pattern and Functional Analysis

1
Guangdong Provincial Key Laboratory of Plant Adaptation and Molecular Design, Innovative Center of Molecular Genetics and Evolution, School of Life Sciences, Guangzhou University, Guangzhou 510006, China
2
College of Agronomy and Biotechnology, Southwest University, Chongqing 400716, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Int. J. Mol. Sci. 2025, 26(13), 6192; https://doi.org/10.3390/ijms26136192
Submission received: 12 May 2025 / Revised: 18 June 2025 / Accepted: 25 June 2025 / Published: 27 June 2025
(This article belongs to the Special Issue Molecular Biology of Soybean)

Abstract

The Dof (DNA-binding with one finger) domain protein family is a plant-specific zinc finger transcription factor family that plays a role in various biological processes in plants. However, research on Dof transcription factors in soybean (Glycine max) remains limited. In this study, we identified 79 putative soybean Dof genes, which are distributed across the entire genome. A comparative phylogenetic analysis of the Dof gene family in soybean, Arabidopsis, rice, maize, and Medicago revealed five major groups. The synteny relationship analysis showed a large number of gene duplication events in soybean. Twelve cis-acting elements were detected in the promoter region of the Dof gene, including five hormone response elements and several environmental response elements. Expression pattern analysis indicated that most Gmdof genes exhibited specific expression patterns. Nine genes in group V, which exhibited higher expression in the root, were identified as significantly responsive to salt stress through qRT-PCR. The possible biological functions of several Gmdof genes were discussed, including Gmdof11.2, Gmdof2.1, and Gmdof16.2. In summary, this study integrated phylogenetic analysis with genome-wide expression profiling to provide valuable information for understanding the functional characteristics of Dof genes in soybean.

1. Introduction

Transcription factors, also known as sequence-specific DNA-binding factors, are key regulators of gene expression, which are capable of binding to specific cis-acting elements in the promoter regions of genes and activating or inhibiting the transcription of downstream target genes [1]. Plant TFDB V5.0 (https://planttfdb.gao-lab.org/index.php, accessed on 16 June 2025) reports that 6150 transcription factors are identified and classified into 57 families in soybean. Dof (DNA-binding with one finger) proteins belong to a plant-specific transcription factor (TF) gene family and are characterized by a Cys2/Cys2 (C2/C2)-type zinc finger domain at the N-terminal, and they bind specifically to cis-regulatory elements with a (T/A)AAAG core sequence [2,3,4,5]. With the whole-genome analysis of many species, an increasing number of Dof members have been discovered in different species, including 36 Dof genes in Arabidopsis thaliana [6], 30 in Oryza sativa [7], 18 in Zea mays [8], and 40 in Medicago sativa [9].
Dof transcription factors play a crucial regulatory role in various physiological and biochemical processes and exhibit functional diversity. In particular, Dof proteins participate as transcriptional regulators in seed development [10,11] and vascular development [12,13]. In addition, Dof transcription factors also contribute to the responses to various abiotic stresses in plants. Tomato SlDof22 [14], wheat Dof protein TaZNF [15], Arabidopsis AtDof5.8 [16], watermelon ClDof29 [17], alfalfa MsDof10, MsDof35, and MsDof39 [9] played an important role in the salt signaling pathway. As a Dof transcription factor, CDF has been reported to be widely involved in the responses to various abiotic stresses, such as Arabidopsis cdf3-1 [18], tomato SlCDF1 or SlCDF3 [19], Brassica BnCDF1 [20], cotton GhDof1 [21], and rice OsDof1 [22]. Recent research reported that Dof4.6 and XND1 jointly regulate root hydraulics and drought responses in Arabidopsis [23].
Although a considerable number of Dof transcription factors (TFs) have been reported in the model plant Arabidopsis thaliana and other species, the functions of most Dof family members remain unclear. This is particularly true for soybean (Glycine max), where functional characterization of Dof transcription factors is extremely limited. GmDof4 and GmDof11 had been reported to be involved in the lipid metabolism pathway of soybean. These DOF proteins can directly bind to cis-regulatory elements in the promoters of target genes, activating the expression of downstream genes involved in fatty acid biosynthesis, thereby regulating lipid metabolism at the transcriptional level [24]. Recently, GmDof41, as a Dof gene in soybean, has been identified as responsive to drought, salt stress, and exogenous ABA treatment. The transcription factor DOF41 directly binds to the promoter region of GmDREB2A, a gene encoding a DREB2-type protein implicated in regulating plant tolerance to abiotic stress [25]. Another study reported that Dof bound to the PIF promoter and regulated its expression. The regulatory network was speculated to improve the structure of soybean plants under shaded environments [26].
Since the soybean Dof gene plays a crucial role in many important developmental processes and responses to various environmental stresses, in-depth research on the soybean Dof family is of great significance. This study conducted a detailed analysis of the soybean Dof gene family, focusing on sequence phylogeny, duplication status, and expression profiles. The expression profiles of these genes with high expression in root were further explored for salt stress through RNA-seq data, providing insights into the potential specific functions of the predicted GmDof genes. These findings lay the foundation for further studies on the evolution and functional characteristics of the Dof gene family in soybean.

2. Results

2.1. Identification and Chromosomal Location of Dof Gene Family in Soybean

To identify the Dof gene family in the soybean genome, a BLAST search in TBtools-II v2.210 was conducted using the conserved Dof domain amino acid sequence against the Glycine max v1.2 genome (http://www.phytozome.net, accessed on 4 July 2024). A total of 79 non-redundant Dof transcription factor-encoding genes were identified across the entire genome (Supplementary Table S1). The predicted GmDof proteins contain the conserved Dof domain, confirming their membership in the Dof transcription factor family. These 79 soybean Dof genes were numbered from Gmdof1.1 to GmdofU (the latter not being anchored to a chromosome in the annotation) according to the naming convention established for Arabidopsis, and classified based on their locations on different chromosomes. The peptide chain lengths of the identified Gmdof genes ranged from 147 to 555 amino acids, with an average length of 335 amino acids (Supplementary Table S1). The molecular weights of the proteins ranged from 9.16 (Gmdof8.1) to 54.67 (Gmdof9.2) KDa. For the subcellular localization, only 6 Gmdofs were located outside the nucleus (Supplementary Table S1).
Genomic chromosomal localization analysis revealed that the Gmdof genes are non-randomly distributed across 19 chromosomes (Figure 1). Excluding the unassembled GmdofU gene, nearly all Gmdof genes are located on the chromosome arms, with no genes found in the heterochromatic regions surrounding centromeric repeat sequences. Among these chromosomes, chromosome 13 contains the most Gmdof genes and chromosome 14 contains no Gmdof genes. Several chromosomes display significant clustering of Gmdof genes, particularly those with higher gene density. For example, Gmdof7.4 and Gmdof7.5 are located within an 8.8 kb segment on chromosome 7, while Gmdof15.5 and Gmdof15.6 are found within a 19 kb segment on chromosome 15. Similarly, four genes on chromosome 13 (Gmdof13.2 and Gmdof13.3, as well as Gmdof13.6 and Gmdof13.7) are clustered into two groups, one spanning 10 kb and the other spanning 13 kb (Figure 1).

2.2. Phylogenetic Relationships and Gene Structure of Soybean Dof Genes

To elucidate the functional differences among the various Dof genes, structural characteristics and conserved motifs were analyzed. A phylogenetic tree was constructed based on the full-length amino acid sequences of all Gmdof proteins, using a rooted tree approach (Figure 2A). The sequence similarity and topology of the phylogenetic tree categorized the soybean Dof gene family into nine subgroups (subgroups I–IX). Each subgroup contained 4–19 members, and the high bootstrap values within each subgroup indicated that these Dof genes share a common origin. Additionally, conserved motifs were identified using the MEME program (Figure 2B). The results revealed that these genes exhibit varying numbers, classifications, and positions of motifs among different genes. The number of conserved motifs in the Dof genes of soybean and wild soybean ranged from 1 to 10, with motif 1 being present in all Dof genes. Most of the Dof genes in subgroup IX contained 8–10 motifs. Furthermore, Dof genes within the same evolutionary branch displayed similar conserved motif patterns (Figure 2C). In the domain analysis, all Dof genes contained the zf-Dof domain, except Gmdof8.1, which had the zf-Dof superfamily domain. In addition, Gmdof6.4 and Gmdof17.3 had the CBF superfamily and PHA03247 superfamily domain, respectively.
It is well known that gene structure diversity may represent an evolutionary trend within gene families. To further explore the structural diversity of Dof genes, we compared the exon/intron structures of the Dof genes in soybean (Figure 2D). According to the predicted structures, 29 Gmdof genes lack introns, while 48 genes contain a single intron, typically located upstream of the Dof domain. Within the same subgroup, the most closely related members typically exhibit the same exon/intron pattern, and the position and length of introns are nearly completely conserved across most subgroups.

2.3. Phylogenetic Analysis of the Dof Gene Family in Soybean, Arabidopsis, Rice, Maize, and Medicago

To investigate the molecular evolution and phylogenetic relationships of Dof domain proteins in soybean, Arabidopsis, rice, maize, and alfalfa, a multiple sequence alignment was conducted for the 79 predicted Gmdof proteins, 33 Arabidopsis Dof proteins, 30 rice Dof proteins, 49 maize Dof proteins, and 17 alfalfa Dof proteins. Based on the alignment results of all Dof amino acid sequences, a phylogenetic tree was constructed using the Maximum Likelihood (ML) method (Figure 3). The ML tree revealed that all Dof family proteins from the five plants were classified into five major clades, consistent with previous studies [8,13]. Four of these clades are the largest branches, comprising 75 members, which account for 36.1% of the total Dof genes. The remaining clades consist of 48 members (clade 1), 58 members (clade 2), 21 members (clade 3), and 6 members (clade 5). Overall, the majority of subfamilies exhibit interwoven distribution patterns, suggesting that the expansion of the Dof gene family occurred prior to the divergence of soybean, Arabidopsis, rice, maize, and Medicago.

2.4. Synteny Analysis of Gmdof Genes

To assess the synteny of the Gmdof genes, we analyzed the 79 Gmdof genes in soybean to evaluate the syntenic relationships among the members of the soybean Dof family (Figure 4). Among the 79 Gmdof genes in soybean, a total of 284 gene duplication segments were identified. The 13th chromosome harbors the highest number of gene duplication segments, totaling 36, followed by the 4th, 7th, and 15th chromosomes, with 22 duplication segments. No gene duplication segments were found on the 14th chromosome, and no gene pairs were detected within the same chromosome. Overall, these identified Gmdof gene pairs provide strong evidence for the whole-genome duplication events in soybean.

2.5. Analysis of Cis-Acting Elements in the Promoter Region of Dof Genes

To better understand the function of Dof genes and the precise regulation of stress-responsive gene expression, the cis-acting elements in the promoter region of Dof genes were identified and analyzed (Figure 5). Twelve cis-acting elements were detected in the promoter region of the Dof gene, including five hormone response elements (gerber, jasmonic acid, salicylic acid, abscisic acid, auxin), as well as environmental response elements (such as light, low temperature, anaerobic, defense and stress), and development-related expression or regulatory elements (meristem, endosperm, and zein metabolism). A total of 867 elements were identified in promoter region of Dof gene families, and the most responsive elements were abscisic acid (213), followed by jasmonic acid (152), and anaerobic induction (122). There was no obvious distribution pattern of cis-acting elements of genes in each group. These abundant cis-acting elements also indicate that the Dof gene family is involved in a variety of plant regulatory pathways and has important significance for crop growth and development.

2.6. Expression Pattern of Dof Genes in Various Tissues

To further understand the biological functions of the Gmdof genes in soybean, we analyzed the expression data of Gmdof genes in various tissues, including flowers, leaves, roots, seeds, pods, nodules, root hairs, apical meristems, and stems (Figure 6). The expression profiles of genes within each group exhibited similar patterns across different tissues. For instance, the 18 Gmdof genes in group I showed relatively high expression in the apical meristem and stem, with comparatively lower expression in other tissues. In group II, all 18 Gmdof genes exhibited high expression in the stem, similar to group I, while the expression in other tissues was relatively low or nearly undetectable. In group V, nine Gmdof genes had higher expression in root. Notably, the Gmdof genes in group VI, which were highly similar to the Arabidopsis Dof genes AtDof5.5 (CDF1), AtDof5.2 (CDF2), and AtDof3.3 (CDF3), showed high expression in the leaves, which suggested that these genes may be involved in regulating the photoperiodic flowering pathway in soybean, similar to their counterparts in Arabidopsis. In summary, most Dof family genes exhibit relatively high expression levels in specific tissues, while maintaining low expression in other tissues, showing distinct tissue-specific expression characteristics.

2.7. Expression Pattern of Some Dof Genes in Response to Salt

Based on the expression patterns of the Gmdof family in various tissues, nine Gmdof (Gmdof4.5, Gmdof5.4, Gmdof10.1, Gmdof13.1, Gmdof13.5, Gmdof15.7, Gmdof18.3, Gmdof19.3, Gmdof19.4) in group V were identified with higher expression in root. To investigate their regulatory roles, soybean at V1 stage were treated with 200 μM NaCl to identify the expression patterns of Gmdof genes response to salt. The result showed that nine genes all responded to salt stress significantly (Figure 7). After salt stress, the expression level of Gmdof15.7 significantly decreased and negatively regulated salt tolerance in soybeans, while the expression levels of the other eight genes significantly increased, indicating that these genes positively regulate salt tolerance in soybeans. All these nine genes provide genetic resources for new varieties of salt-tolerant soybeans in molecular breeding.

3. Discussion

With the development of integrated genomics and the ability to acquire large-scale soybean genome and multi-omics datasets, there are now opportunities to effectively predict phenotypes and explore gene functions [27]. The Dof (DNA-binding with one finger) transcription factor, a classic protein within the Cys2-His2 zinc finger (ZF) superfamily [28], plays a crucial role in several vital processes in plants. However, research on this gene in soybean, and even in leguminous plants in general, remains limited. This study utilized publicly available soybean genome data to conduct a comprehensive genome-wide identification and analysis of the soybean Dof gene family.
In this study, through extensive and comprehensive analysis, numerous members of the Dof gene family were identified within the soybean genome. Genome-wide analysis revealed the presence of 79 full-length Dof genes. Furthermore, phylogenetic analysis of soybean Dof proteins with those from Arabidopsis, rice, maize, and alfalfa revealed five major homologous groups. As most Arabidopsis Dof genes with similar functions tend to cluster within the same subgroups, it is likely that the GmDof genes within the same subgroups also share similar functions. AtDof2.4 and AtDof5.8 might function in the early but different processes for vascular development [29]. In this study, phylogenetic analysis revealed that Gmdof11.2 showed significant similarity with Atdof2.4, while Gmdof2.1 and Gmdof16.2 showed significant similarity with AtDof5.8. Importantly, these three Gmdofs genes had the highest expression in the stems, indicating that they may also be involved in the vascular development of soybeans. Meanwhile, AtDOF5.8 protein was the upstream regulator of AtNAC069, which integrated auxin and salt signals to regulate Arabidopsis thaliana seed germination [16]. Gmdof2.1 and Gmdof16.2 was responsive to abiotic stress. In group 4, some GmDof genes exhibited significant similarity to Arabidopsis Dof genes AtDof5.5 (CDF1), AtDof5.2 (CDF2), AtDof3.3 (CDF3), and AtDof1.7 (CDF5), which participated in the Arabidopsis photoperiodic flowering pathway by inhibiting the transcription of the CONSTANS gene [30]. Interestingly, among these Gmdof genes, Gmdof5.2, Gmdof17.3 and Gmdof8.1 had higher expression in flower, indicating their functions in the photoperiodic flowering pathway of soybean. In addition, four soybean Dof genes (Gmdof4.1, Gmdof6.5, Gmdof17.1, Gmdof5.1) clustered with the Arabidopsis Dof gene AtDof5.4, the overexpression of which resulted in dwarfing of Arabidopsis plants by reducing cell size and number [31]. Importantly, Gmdof6.5, Gmdof17.1 and Gmdof5.1 had high expression in stem, indicating that they may also be involved in the plant height of soybeans.
Soybean has undergone two whole-genome duplications during its evolutionary history, which has allowed the plant to acquire new genes and gain genetic diversity [32,33]. Among the 79 Gmdof genes in soybean, collinearity analysis showed that a total of 284 gene duplication segments were identified, indicating Gmdof genes provided strong evidence for the whole-genome duplication events in soybean. One relevant study that explores evolutionary dynamics of duplicated genes is the analysis of the NOX gene family in soybean [34]. This study identified 17 NOX genes, with 8 duplicated gene pairs formed by a whole-genome duplication event approximately 13 million years ago [34]. Similarly, the study of the APETALA2/Ethylene-Responsive transcription factors (AP2/ERFs) in sorghum reveals insights into gene family evolution and redundancy [35]. This redundancy is thought to provide a buffer against environmental stresses, allowing for flexible responses to changing conditions [34,35].
Furthermore, the study of organ and cell type-specific complementary expression patterns in Arabidopsis thaliana provides additional insights into the regulatory neofunctionalization of duplicated genes. The research found that a significant proportion of whole-genome and tandem duplicate pairs show reciprocal expression patterns, where only one copy is expressed in certain organ types, while the other is expressed in different tissues [36]. This pattern is indicative of regulatory subfunctionalization or neofunctionalization, allowing for the retention and divergence of duplicated genes [36]. In this study, most of the Gmdof proteins were located in the nucleus, with only six Gmdofs localized outside the nucleus. Importantly, Gmdof8.1 and Gmdof5.4 were homologous, but located in the cytoplasm and the nucleus, respectively. This pattern is similar to that observed for Gmdof11.2 (plasma membrane), Gmdof12.1 (nucleus), Gmdof6.1 (chloroplast), Gmdof4.5 (nucleus), Gmdof19.3 (nucleus), and Gmdof19.4 (chloroplast). These homologous Gmdof genes located in different organs may have gene subfunctionalization. These studies collectively illustrate the complexity of gene duplication and expression evolution in plants, emphasizing the roles of subfunctionalization and neofunctionalization in shaping the functional landscape of duplicated genes.
In this study, the expression profiles of genes within each group exhibited similar patterns across different tissues. Most Dof family genes exhibit relatively high expression levels in specific tissues, while maintaining low expression in other tissues, showing distinct tissue-specific expression characteristics, which is a common feature observed in various gene families across different species. Nine Gmdof genes with high expression in root were identified to response to salt stress. Interestingly, these genes contained at least one ABA response element by the promoter cis-acting elements analysis of Gmdof gene family. The ABA signaling pathway generally encompasses the initial stress signal perception, cellular signal transduction, and regulation of genes expressions related to ABA biosynthesis and catabolism, which in turn regulate the accumulation of ABA and trigger salt stress responses [37]. These findings indicated these nine Gmdof genes could be involved in salt stress by ABA signaling pathway.

4. Materials and Methods

4.1. Database Search and Sequence Retrieval

We obtained the protein and genome sequence files for soybean, Arabidopsis thaliana, Medicago truncatula, Oryza sativa, and maize from the Plant Genome Database (Phytozome, https://phytozome-next.jgi.doe.gov/, accessed on 4 July 2023) [38]. The Dof genes in soybean reported in previous studies [25] were extracted using TBtools-II v2.210 (https://github.com/CJ-Chen/TBtools/releases, accessed on 6 July 2023) [39].
To identify the Dof genes in soybean, two approaches were employed. First, we retrieved the conserved domain zinc finger, Dof-type (PF02071), from the PFAM database (http://pfam.xfam.org, accessed on 10 July 2023) for the model plant Arabidopsis thaliana and used it as a query sequence. We then performed a search for candidate BPS protein sequences across four species using Hmmer 3.0 (http://hmmer.org, accessed on 10 July 2023) with default parameters. Second, previously reported Dof proteins in soybean were used as query sequences, and a BLASTP search was conducted within the soybean protein sequence file using BLAST 2.5.0 for proteins (default parameters, e-value < 1 × 10−5). The resulting alignments were merged and redundant sequences were removed. To confirm the presence of the zinc finger domain in each sequence, further verification was carried out using SMART (http://smart.embl-heidelberg.de/, accessed on 10 July 2023). Subcellular localization of the Dof genes was predicted using the BUSCA server (http://busca.biocomp.unibo.it/, accessed on 15 July 2023) [40]. Additionally, the physicochemical properties of the Dof proteins (e.g., amino acid count, molecular weight, and isoelectric point) were analyzed using the ProtParam tool (http://web.expasy.org/compute_pi/, accessed on 15 July 2023) [41].

4.2. Chromosome Mapping, Synteny and Phylogenetic Analysis

The chromosomal mapping of all identified Dof genes in soybean were visualized using TBtools-II v2.210 software. TBtools was also utilized for synteny analysis and Ka/Ks analysis. Full-length amino acid sequences from soybean, Arabidopsis thaliana, Medicago truncatula, Oryza sativa, and maize were aligned using MUSCLE (default parameters) in MEGA7 [42]. Subsequently, a phylogenetic tree was constructed using the ML model in MEGA7, and further modifications and optimization were performed using iTOL (https://itol.embl.de, accessed on 10 August 2023).

4.3. Gene Structure and Conversed Motif Analysis

The gene structure of Dof genes was analyzed using the Gene Structure Display Server (GSDS, http://gsds.cbi.pku.edu.cn/, accessed on 10 August 2023) [43]. Conserved motifs were identified using the MEME tool (http://meme-suite.org/tools/meme, accessed on 10 August 2023) [44]. The results of gene structure and motif analyses were then integrated and visualized using TBtools.

4.4. Collinearity and Cis-Regulatory Element Analysis

First, the Fasta Stats tool in TBtools-II v2.210 was used to analyze the soybean genome sequence to calculate the length of each chromosome. Next, the Gene Density Profile tool in TBtools-II was used to process the gene annotation file and obtain gene density data. Subsequently, the Blast Compare Two Seqs tool in TBtools-II v2.210 was employed to align the soybean genome sequence with itself. Gene position information was extracted using the TBtools-II v2.210. Finally, the results were visualized and decorated using the Advanced Circos tool in TBtools-II v2.210. For cis-regulatory element analysis, 2000 bp sequences upstream from the initiation codon of the putative Gmdofs were retrieved. These sequences were subjected to search in the PlantCARE database (https://bioinformatics.psb.ugent.be/webtools/plantcare/html/, accessed on 17 March 2025) [45] to identify cis-regulatory elements.

4.5. Expression Pattern of Gmdofs

The expression values of Gmdofs (FPKMs) were retrieved from the Phytozome website (https://phytozome-next.jgi.doe.gov/, accessed on 24 August 2023). We analyzed the differential expression of Gmdofs across different tissues, including flowers, leaves, roots, seed, pod, nodules, root hair, sam, and stems. The expression values of Gmdofs were normalized to log2 (FPKM + 1), followed by a 0–1 standardization, and HeatMap in TBtools was used for visual drawing.
Williams 82 soybean seedlings at V1 stage were treated with 200 µM NaCl in a short-day incubator (8 h light/16 h dark, 25 °C). A blank control consisted of deionized water. After treatment, the seedling’s root were collected at 9 h after treatment, and they were promptly frozen in liquid nitrogen. For every sample, 3 biological replicates were collected. Total RNA was extracted by using the Plant RNA Extraction Kit (TaKaRa, Tokyo, Japan). The total amount of RNA (approximately 2 µg) was used to produce cDNA by using the PrimeScriptTM II 1st Strand cDNA Synthesis Kit (TakaRa, Japan). The 2× Taq Pro Universal SYBR qPCR Master Mix (Vazyme, Nanjing, China) was utilized for qRT-PCR on the Roche LightCycler® 480 II. The PCR amplification system comprised a total volume of 20 µL, including 2 µL template DNA, 10 µL 2× mix, and 0.8 µL of both forward and reverse primer (1 µM). The remaining volume was adjusted to 20 µL with ddH2O. The PCR amplification conditions were as follows: initial denaturing at 95 °C for 1 min, followed by 45 cycles for 95 °C for 5 s, 60 °C for 20 s, and then melting the curve starting at 95 °C for 15 s, followed by 60 °C for 1 min, and finally at 95 °C for 15 s. The 2−ΔΔCt method was used to calculate the relative expression levels. Each treatment was replicated three biological times and three technical times. Statistical significance was determined using Student’s t-tests in GraphPad Prism 8, with the aim to identify significant differences. The qRT-PCR primers were designed by Primer3 v4.1.0 (Supplementary Table S2).

5. Conclusions

In this study, through extensive and comprehensive analysis, genome-wide analysis revealed the presence of 79 full-length Dof genes, non-randomly distributed across 19 chromosomes. Phylogenetic analysis of soybean Dof proteins with those from Arabidopsis, rice, maize, and alfalfa revealed five major homologous groups. Soybean intraspecific collinearity analysis also provided strong evidence for soybean whole-genome duplication. Global expression profile analysis provided insights into the functional differentiation of soybean Dof gene family members, with most Gmdof genes exhibiting specific spatiotemporal expression patterns, and nine Gmdof genes with higher expression in root were identified response to salt stress significantly by qRT-PCR. Finally, the possible biological function of Gmdof11.2, Gmdof2.1, GmDof16.2, and so on were explored based on phylogenetic tree and expression pattern analysis. These analysis contribute to the subsequent study of the functional properties of the Dof gene, which is beneficial for the various vital processes of soybean.

Supplementary Materials

The supporting information can be downloaded at https://www.mdpi.com/article/10.3390/ijms26136192/s1.

Author Contributions

Conceptualization and methodology, X.F. and H.L. (Huan Liu); software and validation, C.G., Z.D., C.L. and H.L. (Haiyang Li); data curation, and writing—original draft preparation, C.G., G.Z., H.L. (Huan Liu) and X.F.; project administration and funding acquisition, X.F. and H.L. (Huan Liu). All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Guangzhou postdoctoral research project (624021-129).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article/Supplementary Materials. Further inquiries can be directed to the corresponding author(s).

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Wray, G.A.; Hahn, M.W.; Abouheif, E.; Balhoff, J.P.; Pizer, M.; Rockman, M.V.; Romano, L.A. The evolution of transcriptional regulation in eukaryotes. Mol. Biol. Evol. 2003, 20, 1377–1419. [Google Scholar] [CrossRef] [PubMed]
  2. Waschburger, E.L.; Filgueiras, J.P.C.; Turchetto-Zolet, A.C. DOF gene family expansion and diversification. Genet. Mol. Biol. 2024, 46 (Suppl. 1), e20230109. [Google Scholar] [CrossRef] [PubMed]
  3. Plesch, G.; Ehrhardt, T.; Mueller-Roeber, B. Involvement of TAAAG elements suggests a role for Dof transcription factors in guard cell-specific gene expression. Plant J. 2001, 28, 455–464. [Google Scholar] [CrossRef]
  4. Yanagisawa, S. The Dof family of plant transcription factors. Trends Plant Sci. 2002, 7, 555–560. [Google Scholar] [CrossRef]
  5. Yanagisawa, S. Dof domain proteins: Plant-specific transcription factors associated with diverse phenomena unique to plants. Plant Cell Physiol. 2004, 45, 386–391. [Google Scholar] [CrossRef]
  6. Lijavetzky, D.; Carbonero, P.; Vicente-Carbajosa, J. Genome-wide comparative phylogenetic analysis of the rice and Arabidopsis Dof gene families. BMC Evol. Biol. 2003, 3, 17. [Google Scholar] [CrossRef]
  7. Khan, I.; Khan, S.; Zhang, Y.; Zhou, J. Genome-wide analysis and functional characterization of the Dof transcription factor family in rice (Oryza sativa L.). Planta 2021, 253, 101. [Google Scholar] [CrossRef]
  8. Jiang, Y.; Zeng, B.; Zhao, H.; Zhang, M.; Xie, S.; Lai, J. Genome-wide transcription factor gene prediction and their expressional tissue-specificities in maize. J. Integr. Plant Biol. 2012, 54, 616–630. [Google Scholar] [CrossRef]
  9. Cao, B.; Cui, Y.; Lou, K.; Luo, D.; Liu, Z.; Zhou, Q. Genome-wide identification and expression analysis of the Dof gene family in Medicago sativa L. Under Various Abiotic Stresses. DNA Cell Biol. 2020, 39, 1976–1989. [Google Scholar] [CrossRef]
  10. Bewley, J.D. Seed germination and dormancy. Plant Cell 1997, 9, 1055–1066. [Google Scholar] [CrossRef]
  11. Rueda-Romero, P.; Barrero-Sicilia, C.; Gómez-Cadenas, A.; Carbonero, P.; Oñate-Sánchez, L. Arabidopsis thaliana DOF6 negatively affects germination in non-after-ripened seeds and interacts with TCP14. J. Exp. Bot. 2012, 63, 1937–1949. [Google Scholar] [CrossRef] [PubMed]
  12. Miyashima, S.; Roszak, P.; Sevilem, I.; Toyokura, K.; Blob, B.; Heo, J.O.; Mellor, N.; Help-Rinta-Rahko, H.; Otero, S.; Smet, W.; et al. Mobile PEAR transcription factors integrate positional cues to prime cambial growth. Nature 2019, 565, 490–494. [Google Scholar] [CrossRef] [PubMed]
  13. Smetana, O.; Mäkilä, R.; Lyu, M.; Amiryousefi, A.; Sánchez Rodríguez, F.; Wu, M.F.; Solé-Gil, A.; Leal Gavarrón, M.; Siligato, R.; Miyashima, S.; et al. High levels of auxin signalling define the stem-cell organizer of the vascular cambium. Nature 2019, 565, 485–489. [Google Scholar] [CrossRef]
  14. Cai, X.; Zhang, C.; Shu, W.; Ye, Z.; Li, H.; Zhang, Y. The transcription factor SlDof22 involved in ascorbate accumulation and salinity stress in tomato. Biochem. Biophys. Res. Commun. 2016, 474, 736–741. [Google Scholar] [CrossRef]
  15. Ma, X.; Liang, W.; Gu, P.; Huang, Z. Salt tolerance function of the novel C2H2-type zinc finger protein TaZNF in wheat. Plant Physiol. Biochem. 2016, 106, 129–140. [Google Scholar] [CrossRef]
  16. He, L.; Su, C.; Wang, Y.; Wei, Z. ATDOF5.8 protein is the upstream regulator of ANAC069 and is responsive to abiotic stress. Biochimie 2015, 110, 17–24. [Google Scholar] [CrossRef]
  17. Zhou, Y.; Cheng, Y.; Wan, C.; Li, J.; Yang, Y.; Chen, J. Genome-wide characterization and expression analysis of the Dof gene family related to abiotic stress in watermelon. PeerJ 2020, 8, e8358. [Google Scholar] [CrossRef]
  18. Corrales, A.R.; Carrillo, L.; Lasierra, P.; Nebauer, S.G.; Dominguez-Figueroa, J.; Renau-Morata, B.; Pollmann, S.; Granell, A.; Molina, R.V.; Vicente-Carbajosa, J.; et al. Multifaceted role of cycling DOF factor 3 (CDF3) in the regulation of flowering time and abiotic stress responses in Arabidopsis. Plant Cell Environ. 2017, 40, 748–764. [Google Scholar] [CrossRef]
  19. Corrales, A.R.; Nebauer, S.G.; Carrillo, L.; Fernández-Nohales, P.; Marqués, J.; Renau-Morata, B.; Granell, A.; Pollmann, S.; Vicente-Carbajosa, J.; Molina, R.V.; et al. Characterization of tomato Cycling Dof Factors reveals conserved and new functions in the control of flowering time and abiotic stress responses. J. Exp. Bot. 2014, 65, 995–1012. [Google Scholar] [CrossRef]
  20. Xu, J.; Dai, H. Brassica napus Cycling Dof Factor1 (BnCDF1) is involved in flowering time and freezing tolerance. Plant Growth Regul. 2016, 80, 315–322. [Google Scholar] [CrossRef]
  21. Su, Y.; Liang, W.; Liu, Z.; Wang, Y.; Zhao, Y.; Ijaz, B.; Hua, J. Overexpression of GhDof1 improved salt and cold tolerance and seed oil content in Gossypium hirsutum. J. Plant Physiol. 2017, 218, 222–234. [Google Scholar] [CrossRef]
  22. Liu, J.; Meng, Q.; Xiang, H.; Shi, F.; Ma, L.; Li, Y.; Liu, C.; Liu, Y.; Su, B. Genome-wide analysis of Dof transcription factors and their response to cold stress in rice (Oryza sativa L.). BMC Genomics 2021, 22, 800. [Google Scholar] [CrossRef]
  23. Ding, B.; Liang, M.; Shi, Y.; Zhang, R.; Wang, J.; Huang, Y.; Yan, D.; Hou, X.; Maurel, C.; Tang, N. The transcription factors DOF4.6 and XND1 jointly regulate root hydraulics and drought responses in Arabidopsis. Plant Cell 2025, 37, koaf083. [Google Scholar] [CrossRef]
  24. Wang, H.W.; Zhang, B.; Hao, Y.J.; Huang, J.; Tian, A.G.; Liao, Y.; Zhang, J.S.; Chen, S.Y. The soybean Dof-type transcription factor genes, GmDof4 and GmDof11, enhance lipid content in the seeds of transgenic Arabidopsis plants. Plant J. 2007, 52, 716–729. [Google Scholar] [CrossRef]
  25. Wei, J.T.; Zhao, S.P.; Zhang, H.Y.; Jin, L.G.; Yu, T.F.; Zheng, L.; Ma, J.; Chen, J.; Zhou, Y.B.; Chen, M.; et al. GmDof41 regulated by the DREB1-type protein improves drought and salt tolerance by regulating the DREB2-type protein in soybean. Int. J. Biol. Macromol. 2025, 296, 139782. [Google Scholar] [CrossRef]
  26. Liang, X.; Zhao, C.; Cui, J.; Liu, Z.; Han, D.; Chen, Q.; Yang, M.; Jiang, Z. Genome-wide identification of GmPIF family and regulatory pathway analysis of GmPIF3g in fifferent temperature environments. Int. J. Mol. Sci. 2025, 26, 551. [Google Scholar] [CrossRef]
  27. Liu, Y.; Zhang, Y.; Liu, X.; Shen, Y.; Tian, D.; Yang, X.; Liu, S.; Ni, L.; Zhang, Z.; Song, S.; et al. SoyOmics: A deeply integrated database on soybean multi-omics. Mol. Plant 2023, 16, 794–797. [Google Scholar] [CrossRef]
  28. Yang, Y.; He, Z.; Bing, Q.; Duan, X.; Chen, S.; Zeng, M.; Liu, X. Two Dof transcription factors promote flavonoid synthesis in kumquat fruit by activating C-glucosyltransferase. Plant Sci. 2022, 318, 111234. [Google Scholar] [CrossRef]
  29. Konishi, M.; Yanagisawa, S. Sequential activation of two Dof transcription factor gene promoters during vascular development in Arabidopsis thaliana. Plant Physiol. Biochem. 2007, 45, 623–629. [Google Scholar] [CrossRef]
  30. Fornara, F.; Panigrahi, K.C.; Gissot, L.; Sauerbrunn, N.; Rühl, M.; Jarillo, J.A.; Coupland, G. Arabidopsis DOF transcription factors act redundantly to reduce CONSTANS expression and are essential for a photoperiodic flowering response. Dev. Cell 2009, 17, 75–86. [Google Scholar] [CrossRef]
  31. Xu, P.P.; Chen, H.Y.; Ying, L.; Cai, W.M. AtDOF5.4/OBP4, a DOF transcription factor gene that negatively regulates cell cycle progression and cell expansion in Arabidopsis thaliana. Sci. Rep. 2016, 6, 27705. [Google Scholar] [CrossRef] [PubMed]
  32. Magadum, S.; Banerjee, U.; Murugan, P.; Gangapur, D.; Ravikesavan, R. Gene duplication as a major force in evolution. J. Genet. 2013, 92, 155–161. [Google Scholar] [CrossRef] [PubMed]
  33. Landis, J.B.; Soltis, D.E.; Li, Z.; Marx, H.E.; Barker, M.S.; Tank, D.C.; Soltis, P.S. Impact of whole-genome duplication events on diversification rates in angiosperms. Am. J. Bot. 2018, 105, 348–363. [Google Scholar] [CrossRef] [PubMed]
  34. Zhang, Z.; Zhao, Y.; Feng, X.; Luo, Z.; Kong, S.; Zhang, C.; Gong, A.; Yuan, H.; Cheng, L.; Wang, X. Genomic, molecular evolution, and expression analysis of NOX genes in soybean (Glycine max). Genomics 2019, 111, 619–628. [Google Scholar] [CrossRef]
  35. Mathur, S.; Priyadarshini, S.S.; Singh, V.; Vashisht, I.; Jung, K.H.; Sharma, R.; Sharma, M.K. Comprehensive phylogenomic analysis of ERF genes in sorghum provides clues to the evolution of gene functions and redundancy among gene family members. 3 Biotech 2020, 10, 139. [Google Scholar] [CrossRef]
  36. Liu, S.L.; Baute, G.J.; Adams, K.L. Organ and cell type-specific complementary expression patterns and regulatory neofunctionalization between duplicated genes in Arabidopsis thaliana. Genome Biol. Evol. 2011, 3, 1419–1436. [Google Scholar] [CrossRef]
  37. Tian, H.; Chen, S.; Yang, W.; Wang, T.; Zheng, K.; Wang, Y.; Cheng, Y.; Zhang, N.; Liu, S.; Li, D.; et al. A Novel Family of Transcription Factors Conserved in Angiosperms is Required for ABA Signalling. Plant Cell Environ. 2017, 40, 2958–2971. [Google Scholar] [CrossRef]
  38. Goodstein, D.M.; Shu, S.; Howson, R.; Neupane, R.; Hayes, R.D.; Fazo, J.; Mitros, T.; Dirks, W.; Hellsten, U.; Putnam, N.; et al. Phytozome: A comparative platform for green plant genomics. Nucleic Acids Res. 2012, 40, D1178–D1186. [Google Scholar] [CrossRef]
  39. Chen, C.; Chen, H.; Zhang, Y.; Thomas, H.R.; Frank, M.H.; He, Y.; Xia, R. TBtools: An Integrative Toolkit Developed for Interactive Analyses of Big Biological Data. Mol. Plant 2020, 13, 1194–1202. [Google Scholar] [CrossRef]
  40. Savojardo, C.; Martelli, P.L.; Fariselli, P.; Profiti, G.; Casadio, R. BUSCA: An integrative web server to predict subcellular localization of proteins. Nucleic Acids Res. 2018, 46, W459–W466. [Google Scholar] [CrossRef]
  41. Garg, V.K.; Avashthi, H.; Tiwari, A.; Jain, P.A.; Ramkete, P.W.; Kayastha, A.M.; Singh, V.K. MFPPI—Multi FASTA ProtParam Interface. Bioinformation 2016, 12, 74–77. [Google Scholar] [CrossRef] [PubMed]
  42. Kumar, S.; Stecher, G.; Tamura, K. MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets. Mol. Biol. Evol. 2016, 33, 1870–1874. [Google Scholar] [CrossRef] [PubMed]
  43. Hu, B.; Jin, J.; Guo, A.Y.; Zhang, H.; Luo, J.; Gao, G. GSDS 2.0: An upgraded gene feature visualization server. Bioinformatics 2016, 31, 1296–1297. [Google Scholar] [CrossRef] [PubMed]
  44. Bailey, T.L.; Johnson, J.; Grant, C.E.; Noble, W.S. The MEME Suite. Nucleic Acids Res. 2015, 43, W39–W49. [Google Scholar] [CrossRef]
  45. Lescot, M.; Déhais, P.; Thijs, G.; Marchal, K.; Moreau, Y.; Van de Peer, Y.; Rouzé, P.; Rombauts, S. PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences. Nucleic Acids Res. 2002, 30, 325–327. [Google Scholar] [CrossRef]
Figure 1. Chromosomal location of Dof gene family in soybean.
Figure 1. Chromosomal location of Dof gene family in soybean.
Ijms 26 06192 g001
Figure 2. Evolutionary tree, conserved motif, structural domains, and gene structure of Dof gene family. (A) Evolutionary tree; (B) conserved motif; (C) structural domains; (D) gene structure.
Figure 2. Evolutionary tree, conserved motif, structural domains, and gene structure of Dof gene family. (A) Evolutionary tree; (B) conserved motif; (C) structural domains; (D) gene structure.
Ijms 26 06192 g002
Figure 3. Phylogenetic analysis of the Dof gene family in soybean, Arabidopsis, rice, maize, and Medicago. The various colors on the periphery represent the different groups (or subgroups) of GmDof genes.
Figure 3. Phylogenetic analysis of the Dof gene family in soybean, Arabidopsis, rice, maize, and Medicago. The various colors on the periphery represent the different groups (or subgroups) of GmDof genes.
Ijms 26 06192 g003
Figure 4. Schematic of the interchromosomal relationships of the Dof genes in soybean. The collinear blocks are shown in gray lines. The synteny Dof gene pairs are highlighted in red.
Figure 4. Schematic of the interchromosomal relationships of the Dof genes in soybean. The collinear blocks are shown in gray lines. The synteny Dof gene pairs are highlighted in red.
Ijms 26 06192 g004
Figure 5. Analysis of cis-acting elements of the Dof gene family.
Figure 5. Analysis of cis-acting elements of the Dof gene family.
Ijms 26 06192 g005
Figure 6. Expression patterns of 79 Dof genes in nine tissues of soybean.
Figure 6. Expression patterns of 79 Dof genes in nine tissues of soybean.
Ijms 26 06192 g006
Figure 7. Expression pattern of some Gmdof genes in response to salt. * means p ≤ 0.05, ** means p ≤ 0.01, and *** means p ≤ 0.001. Mork represents blank control with deionized water. Salt represents 200 μM NaCl.
Figure 7. Expression pattern of some Gmdof genes in response to salt. * means p ≤ 0.05, ** means p ≤ 0.01, and *** means p ≤ 0.001. Mork represents blank control with deionized water. Salt represents 200 μM NaCl.
Ijms 26 06192 g007
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Gou, C.; Zhang, G.; Deng, Z.; Lin, C.; Li, H.; Liu, H.; Fang, X. Genome-Wide Analysis of the DNA-Binding with One Finger Gene Family Reveals Soybean Expression Pattern and Functional Analysis. Int. J. Mol. Sci. 2025, 26, 6192. https://doi.org/10.3390/ijms26136192

AMA Style

Gou C, Zhang G, Deng Z, Lin C, Li H, Liu H, Fang X. Genome-Wide Analysis of the DNA-Binding with One Finger Gene Family Reveals Soybean Expression Pattern and Functional Analysis. International Journal of Molecular Sciences. 2025; 26(13):6192. https://doi.org/10.3390/ijms26136192

Chicago/Turabian Style

Gou, Chuanjie, Guyue Zhang, Ziyuan Deng, Chenyang Lin, Haiyang Li, Huan Liu, and Xiaomei Fang. 2025. "Genome-Wide Analysis of the DNA-Binding with One Finger Gene Family Reveals Soybean Expression Pattern and Functional Analysis" International Journal of Molecular Sciences 26, no. 13: 6192. https://doi.org/10.3390/ijms26136192

APA Style

Gou, C., Zhang, G., Deng, Z., Lin, C., Li, H., Liu, H., & Fang, X. (2025). Genome-Wide Analysis of the DNA-Binding with One Finger Gene Family Reveals Soybean Expression Pattern and Functional Analysis. International Journal of Molecular Sciences, 26(13), 6192. https://doi.org/10.3390/ijms26136192

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop