Next Article in Journal
Genomic Survey of Flavin Monooxygenases in Wild and Cultivated Rice Provides Insight into Evolution and Functional Diversities
Next Article in Special Issue
Small Cellular Particles from European Spruce Needle Homogenate
Previous Article in Journal
Novel Seed Size: A Novel Seed-Developing Gene in Glycine max
Previous Article in Special Issue
Evolutionary Analysis of Respiratory Burst Oxidase Homolog (RBOH) Genes in Plants and Characterization of ZmRBOHs
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

Unraveling the Diverse Roles of Neglected Genes Containing Domains of Unknown Function (DUFs): Progress and Perspective

1
National Center for Soybean Improvement, Key Laboratory of Biology and Genetics and Breeding for Soybean, Ministry of Agriculture, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing 210095, China
2
Department of Botany, Jagannath University, Dhaka 1100, Bangladesh
*
Authors to whom correspondence should be addressed.
Current address: State Key Laboratory of Biocontrol, School of Ecology, Sun Yat-sen University, Shenzhen 518107, China.
Int. J. Mol. Sci. 2023, 24(4), 4187; https://doi.org/10.3390/ijms24044187
Submission received: 29 December 2022 / Revised: 6 February 2023 / Accepted: 8 February 2023 / Published: 20 February 2023
(This article belongs to the Collection Feature Papers in Molecular Plant Sciences)

Abstract

:
Domain of unknown function (DUF) is a general term for many uncharacterized domains with two distinct features: relatively conservative amino acid sequence and unknown function of the domain. In the Pfam 35.0 database, 4795 (24%) gene families belong to the DUF type, yet, their functions remain to be explored. This review summarizes the characteristics of the DUF protein families and their functions in regulating plant growth and development, generating responses to biotic and abiotic stress, and other regulatory roles in plant life. Though very limited information is available about these proteins yet, by taking advantage of emerging omics and bioinformatic tools, functional studies of DUF proteins could be utilized in future molecular studies.

1. Introduction

Domains of unknown function, or DUFs, are categorized in the Pfam database and named using the prefix DUF, followed by a number, such as DUF1 and DUF2. The naming scheme of DUF was first introduced by Chris Ponting through the addition of DUF1 and DUF2 to the SMART database in the 1990s, which were renamed based on their featured peptides as GGDEF (PF00990) and EAL (PF00563) domains, respectively [1]. These domains have two distinct characteristics: relatively conservative amino acid sequence and protein domain with a non-characterized function [2]. While collecting information about DUFs, the same classification method was used for other Pfam families; however, no information on DUF functional characteristics was gathered [2].
With the emergence of the post-genomic era, the presence of many annotated domains and proteins with unknown functions is one of the biggest challenges. Remarkably, in the current Pfam database (Pfam 35.0, https://pfam.xfam.org/, accessed on 9 November 2022), 4795 (24%) gene families belonging to the DUF type were found in the 19,632 entries [3]. Domains of unknown function (DUFs) are widely present in animals, fungi, and plants, with key roles mostly unknown considering the number of DUFs. In bacteria, over 2700 DUFs are found and serve as a rich source of novel protein folds and functions [4]. For example, DUF1127 [5], DUF1537 [6], DUF2324 [7], DUF3233 [8], DUF1023 [9], DUF1521 [10], DUF1792 [11], and DUF 4433 [12] were found to perform a specific function. Similarly, in plants, a large group of taxa important to human development, DUFs are widely present since important roles have only been determined for some. Distinct plant species have different DUF genes that have been investigated. On the one hand, the widespread use of forward genetics techniques has driven the research on DUF genes. For example, employing fine-mapping, genome sequencing, transcriptome analysis, and comprehensive comparative sequence analysis, a recent study determined that the LpSDUF247 gene determines the S-locus male component of the gametophytic SI system in perennial ryegrass [13]. In addition, loss-of-function mutations of DUF1668-containing genes represent the genetic events contributing to hybrid incompatibilities [14]. On the other hand, the era of big data has also brought opportunities for the functional characterization of DUF genes. In recent years, our understanding of DUFs has increased through reverse genetic approaches. In Arabidopsis thaliana, DUF6 [15], DUF26 [16], DUF246 [17], DUF538 [18], DUF579 [19], DUF617 [20], DUF642 [21], DUF647 [22], DUF724 [23], DUF784 [24], DUF1117 [25], DUF1218 [26], DUF4005 [27], and DUF4228 [28] have been functionally characterized. Some of them, such as DUF6, DUF246, and DUF579, are involved in cell wall development; the DUF538 protein is involved in trichome development; and DUF26, DUF1117, and DUF4228 are involved in plant stress responses. In summary, these studies revealed the involvement of DUF-domain-containing proteins in diverse biological processes.
However, the lack of genome-wide analysis of genes containing DUF domains has limited the complete understanding of their evolutionary history and biological functions. In the twenty-first century, with the advancement of science and technology, understanding the annotation and characterization of DUFs should be considered a challenge and an opportunity. To better understand DUFs in plants, we, therefore, review their functions in regulating growth and development, biotic and abiotic stress responses, technical approaches and prospects for DUFs research.

2. Role of DUFs in Plant Growth and Development

Domains of unknown function (DUFs) play a key role in the development of the plant. Proteins containing DUFs constitute 20 percent of all proteins [4,29]. Recently, genome sequencing and proteomics breakthroughs have made it easy to identify and sequence multiple superfamilies containing DUFs [30]. Few studies have been reported in Arabidopsis thaliana, Oryza sativa L., and other plants characterizing a few DUF families, viz., DUF1218 [26], DUF231 [31], DUF579 [32], DUF246 [17], DUF642 [33], DUF266 [34], DUF1618 [35], DUF2775 [36], DUF726 [37], DUF247 [13], DUF1668 [14], DUF538 [38], DUF827 [39], DUF724 [23], DUF1005 [40], DUF630 [41], DUF632 [41], and DUF3444 [42]. DUFs are considered to be involved in various functions related to enhancing plant growth and development (Table 1).

2.1. Regulation of Plant Cell Wall Development

The degradation of cellulose, pectin, hemicellulose, and lignin, all components of plant cell walls, is an important source of bioenergy. Plant cell walls play an essential part in the generation of biofuels and the transformation of sugar into ethanol for use in the industry [46,47,48]. Moreover, the plant cell wall is also involved in multiple functions, such as guarding plants against pathogen attack and helping in hormonal signaling and other physiological processes [49]. The cell wall is composed of proteins and polysaccharides. Polysaccharides such as cellulose, hemicellulose, lignin, and pectin are essential components of plant cell walls. More than 2000 genes have been estimated to be required for polysaccharide biosynthesis, assembly, and structural maintenance [50,51]. A recent study showed many DUF genes participate in polysaccharide synthesis, which affects the development of plant cell walls [34]. For instance, DUF266 proteins have been categorized as ‘not classified glycosyltransferases (GTnc)’ due to amino acid similarity with GTs. It was shown that the overexpression of PdDUF266A in Populus can increase cellulose content, reduce recalcitrance and enhance biomass production [34]. In Arabidopsis, three categories of proteins have been identified to alter the acetylation of cell wall polysaccharides. These proteins are named reduced wall acetylation (AtRWA), altered xyloglucan (AtAXY), and trichome birefringence (AtTBR)/TBRLIKE (AtTBL); among them, AtAXY and AtTBR/AtTBL share the conserved TBL domain and DUF231. Moreover, AXY4 acts as a xyloglucan-specific O-acetyltransferase, and the loss of function of this gene terminates xyloglucan acetylation in Arabidopsis [52,53]. Similarly, other research findings indicated that the AtESK1 gene belonging to DUF231 promotes xylan’s interaction with cellulose fibrils, and when AtESK1 is combined with AtTBL29, it boosts the acetylation of xylan in Arabidopsis [54,55]. In addition, overexpression of a DUF231-containing protein increases O-xylan acetylation and cellulose biosynthesis in Populus [31]. Accordingly, these outcomes demonstrate that DUF231 family proteins are important polysaccharide modifiers and affect numerous cell wall polymers in Arabidopsis. Xylan is the principal hemicellulose in the secondary cell walls of eudicots and the primary and secondary cell walls of grasses and cereals. A reduction in cellulose content has been recorded in many Arabidopsis xylan backbone synthesis mutants. For instance, the DUF579 genes (IRX15 and IRX15-L) are redundantly involved in xylan biosynthesis [32]. Later, another study showed that the loss of function of these two genes could reduce cellulose content in the Arabidopsis mutants [56]. Furthermore, a Populus DUF579 gene, PtrDUF579-3, negatively regulates glucuronoxylan biosynthesis and glucuronoxylan structure [57]; thus, these studies show that the DUF579 genes play diverse roles in cell wall biosynthesis. The biosynthesis and function of another structurally complex class of plant cell wall polysaccharides, pectins, are still poorly understood. To date, few enzymes involved in pectin biosynthesis have been described. For instance, one study found that a highly conserved putative glycosyltransferase-producing gene (containing the DUF246 domain) called Pectic Arabino Galactan synthesis-Related (PAGR) affects the biosynthesis of rhamnogalacturonan-I arabinogalactans and is essential for pollen tube growth [17].
In recent years, our understanding of the molecular mechanisms underlying secondary wall formation has been increased through both forward genetic approaches and genomics followed by reverse genetics. Combining Zinnia elegans in vitro tracheary element genomics with reverse genetics in Arabidopsis, a study found that WAT1 (DUF6) is a tonoplast-localized protein that functions downstream of FRA3 and upstream of NST1/SND1 in the signaling cascade, leading to secondary cell wall formation in fibers [15]. Few genes containing DUF1218 are reported to perform an important role in the vascular system and influence other important aspects of cell biology [58,59,60,61,62]. Moreover, previous findings have studied the expression of two genes, viz., At4g27435 (DUF1218-encoding gene) and the CesA gene, and found that the cellulose content of the mutant remained unchanged compared to the control [62,63]. Additionally, another DUF1218-encoding gene (At1g31720) was highly expressed in the Arabidopsis stem and, at the same time, co-expressed in the cell wall cellulose synthase gene along with lignin- and xylan-linked genes. These outcomes showed the involvement of the DUF1218 gene family in xylogenesis and secondary cell wall biosynthesis [26,63].

2.2. Role in Reproductive Development

Incompatibility during reproduction is a general phenomenon that can occur in interspecific and intraspecific hybridization and is termed hybrid incompatibility (HI). HI usually interferes with gene transfer among different species and results in stunted plant growth, hybrid sterility, and cell death deregulation [64,65,66,67,68,69,70]. Thus, for crop improvement, it is very important to understand the molecular and genetic mechanisms of HI. However, scientists are trying to discover the gene families involved in HI regulation [71]. In rice, a gene causing pollen and embryo sterility in hybrids is found at S1 and S5 [35,72], whereas the DUF1618 gene causing sterility in male rice hybrids was mapped on Chr 12. Moreover, it was reported that loss of function in this domain could induce HI [35,73]. Another important specific tissue protein (STs) family is present in the Asteraceae and Fabaceae and consists of the DUF2775 domain [36]. This domain is considered to be linked with early fruit development, seed germination, and cell elongation [74,75,76,77]. Homomorphic SI prevents self-pollination through physiological recognition of self-pollen by the style and is a widespread mechanism occurring in over half of the angiosperms [78]. A study found that a gene encoding a DUF247 domain protein cosegregates with the S self-incompatibility locus in perennial ryegrass [13]. Loss-of-function mutations of DUF1668-containing genes represent the candidate causal genetic events contributing to hybrid incompatibilities. This indicates that DUF1668-containing genes in multiple lineages are responsible for F1 pollen sterility in rice [14].

2.3. Trichome Development

It is worth noting that the functions of DUF538 proteins in plants are complex and diverse. Several studies have reported the importance of plant DUF538 proteins. The DUF538 proteins are emergent plant growth regulators and usually affect phosphoinositide signaling, trichome development, and endoplasmic-reticulum-associated stress response and may function as hydrolase enzymes in plants [79,80,81,82,83]. This review mainly introduces its role in the development of trichomes. Smaller trichomes with variable branches (SVB) belongs to a terrestrial plant-specific DUF538 domain-containing gene family. It was reported that SVB-like (SVBL) and its closest relative SVB moderate trichome development and plant growth in Arabidopsis thaliana [18]. Although no growth defect has been recorded in the case of any single mutants, dwarfed plant growth has been reported in double mutants (svb and svbl). Meanwhile, the translational reporter assay revealed that both at the subcellular and tissue level, SVBL and SVB share highly similar localization patterns, suggesting that SVBL and SVB target a specific set of trichome development regulators and hence play a key role in trichome development and plant growth. Similarly, the transcriptomic analysis of the gl3-sst sim gene reported the function of the DUF538 protein in trichome development [38]. These studies imply that DUF538 proteins play important roles in plant growth and development.

2.4. Other Essential Growth and Development

It has been reported that some other DUF genes are associated with seedling development, root development, plant cell growth, and chloroplast movement. For example, one study identified and characterized a rolling and erect leaf mutant in rice and named it rel2, encoding an unknown function protein that contains DUF630 and DUF632 domains that control leaf rolling in rice [41]. Expression pattern analysis showed that CiDUF1005 genes in Caragana intermedia were differentially regulated under conditions including cold, heat, dehydration, and drought treatments, as well as under the hormone abscisic acid (ABA). In addition, compared to the wild type, transgenic lines with heterologous CiDUF1005 expression in Arabidopsis thaliana had longer primary roots and a greater number of lateral roots [40]. DUF827 proteins could mediate protein–protein interaction and were involved in the chloroplast photorelocation movement response [39]. The functional conservation between the DUF724 proteins and FMRP suggests that DUF724 proteins might play conserved and novel roles in RNA transportation and be involved in the polar growth of plant cells in Arabidopsis thaliana [23]. A mutation of the OsSAC1 gene (containing the DUF4220 and DUF594 domains) causes sugar accumulation in rice leaves [43].
Equally importantly, the functions of some DUF genes are diverse and complex, as mentioned earlier in DUF538 genes. For instance, it is reported that a gene containing DUF266 is involved in leaf senescence in addition to the previously mentioned role related to cell wall development [44]. The study found the DUF266-containing gene OsPLS3 plays an important role in the onset of leaf senescence through perturbing ethylene production in leaves, thereby affecting the onset of leaf senescence in rice. In addition, the DUF1218-containing gene VCC plays a redundant role in early leaf margin patterning and is necessary for bilateral symmetry. Loss-of-function vcc alleles lead to unexpected changes in the size, shape, and spatial structure of the auxin and CUC2 leaf margin domains and the early loss of leaf bilateral symmetry [45].

3. Role of DUFs in Plant Biotic and Abiotic Stress

Numerous environmental stresses, both biotic and abiotic, are experienced by plants. However, these stresses may have a negative impact on a plant’s development, including its survival, growth, and productivity [84,85]. Few studies have shown that certain DUFs are involved in various functions related to conferring resistance to biotic and abiotic stresses (Table 2).

3.1. Biotic Stress

The domain of unknown function 26 (DUF26; Gnk2 or stress-antifungal domain; PF01657) is an extracellular domain in three plant proteins. The first class is CYSTEINE-RICH RECEPTOR-LIKE SECRETED PROTEINs (CRRSPs) and of which Gnk2(DUF26) from Gingko biloba acts as a mannose-binding lectin in vitro with antifungal activity [101,102]. In addition, another study discovered two maize CRRSPs that had also been shown to bind mannose and participate in the defense against a fungal pathogen [103]. The second class, CYSTEINE-RICH RECEPTOR-LIKE PROTEIN KINASES (CRKs), controls stress responses and development in Arabidopsis and rice. For example, the overexpression of the CRK13 (an Arabidopsis cysteine-rich receptor-like kinase) gene results in enhanced resistance to Pseudomonas syringae [104]. The third category of DUF26-domain-containing proteins is the PLASMODESMATALOCALIZED PROTEINS (PDLPs), which contain two DUF26 domains in their extracellular region and a transmembrane helix but lack a kinase domain. Research studies revealed that PDLPs are linked with plasmodesmata, are involved in symplastic intercellular signaling [105], pathogen response [106], systemic signaling [107], and regulation of callose deposition [108], and act as targets for viral movement proteins [109].
Negative regulation of DUF genes has also been reported to control biotic stress responses. For example, in Arabidopsis, AtDUF569 negatively regulates biotic stress responses as the resistant phenotype of the atduf569 KO mutant may be due to the upregulation of SA-dependent PR genes during the initial phase of pathogenicity, which affects the impact of pathogenic effects, which in turn protects the mutant phenotype from late virulence and disease symptoms [86]. Surprisingly, it was reported that AtDUF569 (At1g69890) positively regulates drought stress in Arabidopsis because the loss-of-function mutant atduf569 showed significant sensitivity to drought stress and significantly lower abscisic acid accumulation compared with WT Col-0 plants [90].
The At5g65040 gene (containing a DUF581 domain), named Increased Resistance to Myzus persicae 1 (IRM1), was used in a study that showed that overexpression of the cloned IRM1 gene developed an identical phenotype to the original mutant. Conversely, an IRM1 knockout mutant promoted aphid population development compared to wild-type ones [87].

3.2. Abiotic Stress

Abiotic stresses, such as drought, flooding, salt stress, heat, cold, high radiation, and heavy metal toxicity, have profound effects on plant growth and survival [110]. Research findings have demonstrated that some DUF-domain-containing proteins play a vital role in plant stress responses. Some of these DUF proteins are not only involved in a single stressful environment. For example, there are a large number of really interesting new gene (RING)-domain-containing E3 ubiquitin ligases in Arabidopsis; among them, At2g39720 (AtRHC2A), At3g46620 (AtRDUF1), and At5g59550 (AtRDUF2) are identified as having DUF1117 in their C-terminal regions [25]. It is suggested that the RDUF genes adapt biotic and abiotic plants to their environment. For example, the E3 ligase AtRDUF1 (DUF1117) positively regulates salt-stress responses in Arabidopsis thaliana [93], and the suppression of AtRDUF1 and AtRDUF2 reduces tolerance to abscisic acid (ABA)-mediated drought stress in Arabidopsis [25]. In addition, researchers discovered the role of GhRDUF4D against Verticillium dahliae infection in cotton, which will help to understand the function of the RDUF genes in plant immunity [88].

3.2.1. Drought and Salt Stress

Some DUF genes are involved in the drought- and salt-stress response. The overexpression of GmCBSDUF3 (containing one domain of unknown function (DUF21)) could enhance tolerance to drought and salt stress in Arabidopsis [89]. The AhDGR2 gene in Amaranthus hypochondriacus encodes a DUF642-domain-containing protein, and overexpression of AhDGR2 in transgenic Arabidopsis plants presents increased sensitivity to NaCl treatment [91]. The OsDSR2 gene, which encodes a DUF966-domain-containing protein, also negatively regulates salt and simulated drought stresses and ABA signaling in rice, which provided some useful data for understanding the functional roles of DUF966 genes in abiotic stress responses in plants [111]. Furthermore, analysis of gene expression profiling data showed that some TaDUF966 genes were induced by salt stress in wheat (Triticum aestivum L.) and further confirmed the role of TaDUF966-9B in salt stress using virus-induced gene silencing (VIGS) assay [92]. OsSIDP366, a gene containing DUF1644, may function as a regulator of the PBs/SGs and positively regulates responses to drought and salt stresses in rice [94]. As an important part of landscaping, turf plays a vital role in protecting, improving, and beautifying urban environments. Therefore, it is imperative to choose high-quality salt-tolerant turfgrass suitable for landscaping in areas with saline soils. A study found that ROPGEF7 (a DUF315 protein-coding gene) and UFSP (a DUF1671 protein-coding gene) might play important roles in the salt-tolerance process in Z. japonica and might have contrasting functions [95].

3.2.2. Signaling Pathway

Several DUFs participate in regulating the signal pathway related to plant stress resistance. For example, protein–protein interaction network analysis indicated that AtDUF506s may potentially interact with iron-deficiency response proteins, salt-inducible transcription factors, or calcium sensors (calmodulins), implying that DUF506 genes have distinct biological functions, including responses to environmental stimuli and nutrient deficiencies, and participation in Ca(2+) signaling [96]. In addition, bimolecular fluorescence complementation and calmodulin (CaM)-binding assays showed that AtRXR3(DUF506) interacted with CaM in the presence of Ca2+. Moreover, cytosolic Ca2+ ([Ca2+]cyt) oscillations in the root hairs of rxr3 mutants exhibited high frequencies and dampened amplitudes compared to wild-type ones. Thus, AtRXR3 is a novel calmodulin-interacting protein that represses root hair elongation in Arabidopsis [97]. Furthermore, AtRXR3 can attenuate P-limitation-induced root hair growth through mechanisms that involve RSL4 and interaction with CaM to modulate tip-focused [Ca2+]cyt oscillations [97]. So far, the regulatory mechanism of the BES1 transcription factor has been identified and clarified in the model plants Arabidopsis and rice. The main biological function of BES1 is reflected in that it is an important regulator downstream of brassinosteroid signaling and plays an important role in plant stress response, growth, and development [98,99,100].

3.2.3. UV-B

There are three types of UV rays, UV-A (315–400 nm), UV-B (280–315 nm), and UV-C (200–280 nm), although only UV-A and a small part of UV-B reach the Earth’s surface [112]. UV-B can cause stress or act as a developmental signal depending on its fluence levels. One study reported the involvement of DUF647 in root UV-B sensing in Arabidopsis early seedling development. RUS1 (encoding a protein that contains DUF647) is an Arabidopsis mutant (root UVB sensitive 1 (rus1)), whose primary root is hypersensitive to very low-fluence-rate (VLF) UV-B. Under standard growth-chamber fluorescent white light, rus1 displays stunted root growth and fails to form postembryonic leaves [22].

4. Research Methods for DUF Proteins

The lack of genome-wide analysis of genes containing DUF domains hinders a comprehensive understanding of their evolutionary history and biological functions. Therefore, identifying the function of DUFs is extremely important for characterizing organisms. In general, there are three ways to represent the function of DUF genes. The first way is to characterize DUF genes at the structural genomics level. In recent years, structural genomics projects have propelled technology development and solved the structures of literally hundreds of proteins within uncharacterized families [2], such as DUF194 [113], DUF442 [114], DUF1110 [115], and DUF1470 [116]. The second involves using bioinformatics to identify the function. With the rapid increase in available genome sequences and structural genomics, there is a growing need for reliable computational methods to extract information about gene family architecture and evolution and thus predict the biochemical function of these proteins. This leads to a series of bioinformatics databases and tools. Here, Table 3 provides information regarding some online databases and bioinformatic tools. The third is through experimental verification. The common method to study an unknown protein’s function is to assess the consequences of a loss-of-function mutation or overexpression of the corresponding gene under multiple conditions. Generally, phenotypes can be combined with bioinformatics methods such as gene family analysis, comparative genome analysis, and evolutionary analysis to provide evidence-based annotation for some proteins. For example, a recent study identified 28 TaDUF966 proteins in wheat, and phylogenetic analysis divided these proteins into two groups. Analysis of gene expression profiling data showed that some TaDUF966 genes were induced by salt stress. Then, virus-induced gene silencing (VIGS) assays were used to confirm the role of TaDUF966-9B in salt stress [92]. Additionally, genome-wide investigation and expression profiling under abiotic stresses of an unknown soybean function (DUF21) and Cystathionine-β-Synthase (CBS)-domain-containing protein family revealed tissue-specific and differential expression profiles of the GmCBSDUFs and qPCR analysis revealed that certain groups of soybean CBSDUFs are likely involved in specific stress responses. Furthermore, the overexpression of GmCBSDUF3 could enhance tolerance to drought and salt stress in Arabidopsis [89].
In conclusion, with the progress of science and technology, the research technology for DUFs will become more convenient and faster, and more and more DUFs will be identified and characterized.

5. Conclusions and Future Prospects

To gain a comprehensive understanding of the complex mechanisms of life activity in organisms on an entirely new level, we must understand the functions of all their components. Numerous protein functions remain unknown, even in the organisms subjected to the most experimentation. For instance, 17% of the genome of the yeast Saccharomyces cerevisiae has not yet been characterized [117]. More than 15,537 (56%) of the 27,662 protein-coding genes in Arabidopsis still have uncharacterized functions in the curator summary of functional descriptions (Araport11, www.arabidopsis.org, accessed on 20 January 2023). Unfortunately, DUFs are frequently neglected due to their little relevance and are only discovered in a few genomes. In a nutshell, the present review summarizes the functional research on DUFs in plants and their regulatory functions in plant growth, development, and stress response. It is important to note that DUFs can be considered exploitable treasures given the advancement of science and technology. Although understanding the DUF genes is one of the greatest challenges in plant science, it is anticipated that a growing number of new genes with unknown functions will be uncovered and elucidated in the future. The application of computer science will be one of the benefits of studying DUF proteins; mining combined with various omics data enables researchers to predict the functional direction of proteins. The advancement of genetic and biochemical experimentation technology will speed up the comprehensive understanding of the intricate life mechanisms of the plant body.

Author Contributions

P.L., N.B. and T.Z. conceived and planned this review paper. J.W., C.Z., A.H. and G.M.A.A. finished part of the data collection and content preparation of the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Key R&D Program of China (2021YFD1201603), the National Natural Science Foundation of China (grant nos. 32171965, 31871646, 31571691), the Jiangsu Seed Industry Revitalization Project [JBGS(2021)014], and the Jiangsu Collaborative Innovation Centre for Modern Crop Production (JCIC-MCP).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Schultz, J.; Milpetz, F.; Bork, P.; Ponting, C.P. SMART, a simple modular architecture research tool: Identification of signaling domains. Proc. Natl. Acad. Sci. USA 1998, 95, 5857–5864. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Bateman, A.; Coggill, P.; Finn, R.D. DUFs: Families in search of function. Acta Crystallogr. Sect. F Struct. Biol. Cryst. Commun. 2010, 66, 1148–1152. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Mistry, J.; Chuguransky, S.; Williams, L.; Qureshi, M.; Salazar, G.A.; Sonnhammer, E.L.L.; Tosatto, S.C.E.; Paladin, L.; Raj, S.; Richardson, L.J.; et al. Pfam: The protein families database in 2021. Nucleic Acids Res. 2021, 49, D412–D419. [Google Scholar] [CrossRef] [PubMed]
  4. Goodacre, N.F.; Gerloff, D.L.; Uetz, P. Protein domains of unknown function are essential in bacteria. mBio 2013, 5, e00744-13. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Kraus, A.; Weskamp, M.; Zierles, J.; Balzer, M.; Busch, R.; Eisfeld, J.; Lambertz, J.; Nowaczyk, M.M.; Narberhaus, F. Arginine-rich small proteins with a domain of unknown function, DUF1127, play a role in phosphate and carbon metabolism of Agrobacterium tumefaciens. J. Bacteriol. 2020, 202, e00309-20. [Google Scholar] [CrossRef]
  6. Zhang, X.; Carter, M.S.; Vetting, M.W.; San Francisco, B.; Zhao, S.; Al-Obaidi, N.F.; Solbiati, J.O.; Thiaville, J.J.; de Crécy-Lagard, V.; Jacobson, M.P.; et al. Assignment of function to a domain of unknown function: DUF1537 is a new kinase family in catabolic pathways for acid sugars. Proc. Natl. Acad. Sci. USA 2016, 113, E4161–E4169. [Google Scholar] [CrossRef] [Green Version]
  7. Pei, J.; Mitchell, D.A.; Dixon, J.E.; Grishin, N.V. Expansion of type II CAAX proteases reveals evolutionary origin of γ-secretase subunit APH-1. J. Mol. Biol. 2011, 410, 18–26. [Google Scholar] [CrossRef] [Green Version]
  8. Prakash, A.; Yogeeshwari, S.; Sircar, S.; Agrawal, S. Protein Domain of Unknown Function 3233 is a translocation domain of autotransporter secretory mechanism in Gamma proteobacteria. PLoS ONE 2011, 6, e25570. [Google Scholar] [CrossRef] [Green Version]
  9. Zheng, M.; Ginalski, K.; Rychlewski, L.; Grishin, N.V. Protein domain of unknown function DUF1023 is an α/β hydrolase. Proteins 2005, 59, 1–6. [Google Scholar] [CrossRef]
  10. Schirrmeister, J.; Zocher, S.; Flor, L.; Göttfert, M.; Zehner, S. The domain of unknown function DUF1521 exhibits metal ion-inducible autocleavage activity—A novel example from a putative effector protein of Vibrio coralliilyticus ATCC BAA-450. FEMS Microbiol. Lett. 2013, 343, 177–182. [Google Scholar] [CrossRef] [Green Version]
  11. Zhang, H.; Zhu, F.; Yang, T.; Ding, L.; Zhou, M.; Li, J.; Haslam, S.M.; Dell, A.; Erlandsen, H.; Wu, H. The highly conserved Domain of Unknown Function 1792 has a distinct glycosyltransferase fold. Nat. Commun. 2014, 5, 4339. [Google Scholar] [CrossRef] [Green Version]
  12. Jankevicius, G.; Ariza, A.; Ahel, M.; Ahel, I. The toxin-antitoxin system DarTG catalyzes reversible ADP-ribosylation of DNA. Mol. Cell 2016, 64, 1109–1116. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Manzanares, C.; Barth, S.; Thorogood, D.; Byrne, S.L.; Yates, S.; Czaban, A.; Asp, T.; Yang, B.; Studer, B. A gene encoding a DUF247 domain protein cosegregates with the S self-incompatibility locus in perennial ryegrass. Mol. Biol. Evol. 2016, 33, 870–884. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Sakata, M.; Takano-Kai, N.; Miyazaki, Y.; Kanamori, H.; Wu, J.; Matsumoto, T.; Doi, K.; Yasui, H.; Yoshimura, A.; Yamagata, Y. Domain unknown function DUF1668-containing genes in multiple lineages are responsible for F1 pollen sterility in rice. Front. Plant Sci. 2021, 11, 632420. [Google Scholar] [CrossRef] [PubMed]
  15. Ranocha, P.; Denance, N.; Vanholme, R.; Freydier, A.; Martinez, Y.; Hoffmann, L.; Kohler, L.; Pouzet, C.; Renou, J.P.; Sundberg, B.; et al. Walls are thin 1 (WAT1), an Arabidopsis homolog of Medicago truncatula NODULIN21, is a tonoplast-localized protein required for secondary wall formation in fibers. Plant J. 2010, 63, 469–483. [Google Scholar] [CrossRef]
  16. Vaattovaara, A.; Brandt, B.; Rajaraman, S.; Safronov, O.; Veidenberg, A.; Luklova, M.; Kangasjarvi, J.; Loytynoja, A.; Hothorn, M.; Salojarvi, J.; et al. Mechanistic insights into the evolution of DUF26-containing proteins in land plants. Commun. Biol. 2019, 2, 56. [Google Scholar] [CrossRef]
  17. Stonebloom, S.; Ebert, B.; Xiong, G.; Pattathil, S.; Birdseye, D.; Lao, J.; Pauly, M.; Hahn, M.G.; Heazlewood, J.L.; Scheller, H.V. A DUF-246 family glycosyltransferase-like gene affects male fertility and the biosynthesis of pectic arabinogalactans. BMC Plant Biol. 2016, 16, 90. [Google Scholar] [CrossRef] [Green Version]
  18. Yu, C.Y.; Sharma, O.; Nguyen, P.H.T.; Hartono, C.D.; Kanehara, K. A pair of DUF538 domain-containing proteins modulates plant growth and trichome development through the transcriptional regulation of GLABRA1 in Arabidopsis thaliana. Plant J. 2021, 108, 992–1004. [Google Scholar] [CrossRef]
  19. Urbanowicz, B.R.; Pena, M.J.; Ratnaparkhe, S.; Avci, U.; Backe, J.; Steet, H.F.; Foston, M.; Li, H.J.; O’Neill, M.A.; Ragauskas, A.J.; et al. 4-O-methylation of glucuronic acid in Arabidopsis glucuronoxylan is catalyzed by a domain of unknown function family 579 protein. Proc. Natl. Acad. Sci. USA 2012, 109, 14253–14258. [Google Scholar] [CrossRef] [Green Version]
  20. Moriwaki, T.; Miyazawa, Y.; Kobayashi, A.; Uchida, M.; Watanabe, C.; Fujii, N.; Takahashi, H. Hormonal regulation of lateral root development in Arabidopsis modulated by MIZ1 and requirement of GNOM activity for MIZ1 function. Plant Physiol. 2011, 157, 1209–1220. [Google Scholar] [CrossRef] [Green Version]
  21. Zuniga-Sanchez, E.; Soriano, D.; Martinez-Barajas, E.; Orozco-Segovia, A.; Gamboa-deBuen, A. BIIDXI, the At4g32460 DUF642 gene, is involved in pectin methyl esterase regulation during Arabidopsis thaliana seed germination and plant development. BMC Plant Biol. 2014, 14, 338. [Google Scholar] [CrossRef] [Green Version]
  22. Tong, H.Y.; Leasure, C.D.; Hou, X.W.; Yuen, G.; Briggs, W.; He, Z.H. Role of root UV-B sensing in Arabidopsis early seedling development. Proc. Natl. Acad. Sci. USA 2008, 105, 21039–21044. [Google Scholar] [CrossRef] [Green Version]
  23. Cao, X.; Yang, K.Z.; Xia, C.; Zhang, X.Q.; Chen, L.Q.; Ye, D. Characterization of DUF724 gene family in Arabidopsis thaliana. Plant Mol. Biol. 2010, 72, 61–73. [Google Scholar] [CrossRef]
  24. Jones-Rhoades, M.W.; Borevitz, J.O.; Preuss, D. Genome-wide expression profiling of the Arabidopsis female gametophyte identifies families of small, secreted proteins. PLoS Genet. 2007, 3, 1848–1861. [Google Scholar] [CrossRef]
  25. Kim, S.J.; Ryu, M.Y.; Kim, W.T. Suppression of Arabidopsis RING-DUF1117 E3 ubiquitin ligases, AtRDUF1 and AtRDUF2, reduces tolerance to ABA-mediated drought stress. Biochem. Biophys. Res. Commun. 2012, 420, 141–147. [Google Scholar] [CrossRef]
  26. Mewalal, R.; Mizrachi, E.; Coetzee, B.; Mansfield, S.D.; Myburg, A.A. The Arabidopsis Domain of Unknown Function 1218 (DUF1218) containing proteins, MODIFYING WALL LIGNIN-1 and 2 (At1g31720/MWL-1 and At4g19370/MWL-2) function redundantly to alter secondary cell wall lignin content. PLoS ONE 2016, 11, e0150254. [Google Scholar] [CrossRef]
  27. Li, Y.; Huang, Y.; Wen, Y.; Wang, D.; Liu, H.; Li, Y.; Zhao, J.; An, L.; Yu, F.; Liu, X. The Domain of Unknown Function 4005 (DUF4005) in an Arabidopsis IQD protein functions in microtubule binding. J. Biol. Chem. 2021, 297, 100849. [Google Scholar] [CrossRef] [PubMed]
  28. Yang, Q.; Niu, X.; Tian, X.; Zhang, X.; Cong, J.; Wang, R.; Zhang, G.; Li, G. Comprehensive genomic analysis of the DUF4228 gene family in land plants and expression profiling of ATDUF4228 under abiotic stresses. BMC Genom. 2020, 21, 12. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  29. Bateman, A.; Coin, L.; Durbin, R.; Finn, R.D.; Hollich, V.; Griffiths-Jones, S.; Khanna, A.; Marshall, M.; Moxon, S.; Sonnhammer, E.L.; et al. The Pfam protein families database. Nucleic Acids Res. 2004, 32, D138–D141. [Google Scholar] [CrossRef] [PubMed]
  30. Zhong, R.; Cui, D.; Ye, Z.H. Evolutionary origin of O-acetyltransferases responsible for glucomannan acetylation in land plants. New Phytol. 2019, 224, 466–479. [Google Scholar] [CrossRef] [PubMed]
  31. Yang, Y.; Yoo, C.G.; Winkeler, K.A.; Collins, C.M.; Hinchee, M.A.W.; Jawdy, S.S.; Gunter, L.E.; Engle, N.L.; Pu, Y.; Yang, X.; et al. Overexpression of a Domain of Unknown Function 231-containing protein increases O-xylan acetylation and cellulose biosynthesis in Populus. Biotechnol. Biofuels 2017, 10, 311. [Google Scholar] [CrossRef] [PubMed]
  32. Jensen, J.K.; Kim, H.; Cocuron, J.C.; Orler, R.; Ralph, J.; Wilkerson, C.G. The DUF579 domain containing proteins IRX15 and IRX15-L affect xylan synthesis in Arabidopsis. Plant J. 2011, 66, 387–400. [Google Scholar] [CrossRef]
  33. Gao, Y.; Badejo, A.A.; Sawa, Y.; Ishikawa, T. Analysis of two L-Galactono-1,4-lactone-responsive genes with complementary expression during the development of Arabidopsis thaliana. Plant Cell Physiol. 2012, 53, 592–601. [Google Scholar] [CrossRef] [PubMed]
  34. Yang, Y.; Yoo, C.G.; Guo, H.B.; Rottmann, W.; Winkeler, K.A.; Collins, C.M.; Gunter, L.E.; Jawdy, S.S.; Yang, X.; Guo, H.; et al. Overexpression of a Domain of Unknown Function 266-containing protein results in high cellulose content, reduced recalcitrance, and enhanced plant growth in the bioenergy crop Populus. Biotechnol. Biofuels 2017, 10, 74. [Google Scholar] [CrossRef] [Green Version]
  35. Shen, R.; Wang, L.; Liu, X.; Wu, J.; Jin, W.; Zhao, X.; Xie, X.; Zhu, Q.; Tang, H.; Li, Q.; et al. Genomic structural variation-mediated allelic suppression causes hybrid male sterility in rice. Nat. Commun. 2017, 8, 1310. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  36. Albornos, L.; Martin, I.; Iglesias, R.; Jimenez, T.; Labrador, E.; Dopico, B. ST proteins, a new family of plant tandem repeat proteins with a DUF2775 domain mainly found in Fabaceae and Asteraceae. BMC Plant Biol. 2012, 12, 207. [Google Scholar] [CrossRef] [Green Version]
  37. Yuan, G.; Zou, T.; He, Z.; Xiao, Q.; Li, G.; Liu, S.; Xiong, P.; Chen, H.; Peng, K.; Zhang, X.; et al. SWOLLEN TAPETUM AND STERILITY 1 is required for tapetum degeneration and pollen wall formation in rice. Plant Physiol. 2022, 190, 352–370. [Google Scholar] [CrossRef]
  38. Marks, M.D.; Wenger, J.P.; Gilding, E.; Jilk, R.; Dixon, R.A. Transcriptome analysis of Arabidopsis wild-type and gl3-sst sim trichomes identifies four additional genes required for trichome development. Mol. Plant 2009, 2, 803–822. [Google Scholar] [CrossRef]
  39. Kodama, Y.; Suetsugu, N.; Wada, M. Novel protein-protein interaction family proteins involved in chloroplast movement response. Plant Signal. Behav. 2011, 6, 483–490. [Google Scholar] [CrossRef] [Green Version]
  40. Tian, X.; Niu, X.; Chang, Z.; Zhang, X.; Wang, R.; Yang, Q.; Li, G. DUF1005 family identification, evolution analysis in plants, and primary root elongation regulation of CiDUF1005 from Caragana intermedia. Front. Genet. 2022, 13, 807293. [Google Scholar] [CrossRef]
  41. Yang, S.Q.; Li, W.Q.; Miao, H.; Gan, P.F.; Qiao, L.; Chang, Y.L.; Shi, C.H.; Chen, K.M. REL2, a gene encoding an unknown function protein which contains DUF630 and DUF632 domains controls leaf rolling in rice. Rice 2016, 9, 37. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  42. Malik, G.; Dangwal, M.; Kapoor, S.; Kapoor, M. Role of DNA methylation in growth and differentiation in Physcomitrella patens and characterization of cytosine DNA methyltransferases. FEBS J. 2012, 279, 4081–4094. [Google Scholar] [CrossRef] [PubMed]
  43. Zhu, X.; Shen, W.; Huang, J.; Zhang, T.; Zhang, X.; Cui, Y.; Sang, X.; Ling, Y.; Li, Y.; Wang, N.; et al. Mutation of the OsSAC1 gene, which encodes an endoplasmic reticulum protein with an unknown function, causes sugar accumulation in rice leaves. Plant Cell Physiol. 2018, 59, 487–499. [Google Scholar] [CrossRef] [PubMed]
  44. Li, K.; Chen, Y.; Luo, Y.; Huang, F.; Zhao, C.; Cheng, F.; Xiang, X.; Pan, G. A 22-bp deletion in OsPLS3 gene encoding a DUF266-containing protein is implicated in rice leaf senescence. Plant Mol. Biol. 2018, 98, 19–32. [Google Scholar] [CrossRef] [PubMed]
  45. Wilson-Sanchez, D.; Martinez-Lopez, S.; Navarro-Cartagena, S.; Jover-Gil, S.; Micol, J.L. Members of the DEAL subfamily of the DUF1218 gene family are required for bilateral symmetry but not for dorsoventrality in Arabidopsis leaves. New Phytol. 2018, 217, 1307–1321. [Google Scholar] [CrossRef] [Green Version]
  46. Jensen, J.K.; Schultink, A.; Keegstra, K.; Wilkerson, C.G.; Pauly, M. RNA-Seq analysis of developing nasturtium seeds (Tropaeolum majus): Identification and characterization of an additional galactosyltransferase involved in xyloglucan biosynthesis. Mol. Plant 2012, 5, 984–992. [Google Scholar] [CrossRef] [Green Version]
  47. Klein-Marcuschamer, D.; Oleskowicz-Popiel, P.; Simmons, B.A.; Blanch, H.W. The challenge of enzyme cost in the production of lignocellulosic biofuels. Biotechnol. Bioeng. 2012, 109, 1083–1087. [Google Scholar] [CrossRef]
  48. Pawar, P.M.; Koutaniemi, S.; Tenkanen, M.; Mellerowicz, E.J. Acetylation of woody lignocellulose: Significance and regulation. Front. Plant Sci. 2013, 4, 118. [Google Scholar] [CrossRef] [Green Version]
  49. Wolf, S.; Hematy, K.; Hofte, H. Growth control and cell wall signaling in plants. Annu. Rev. Plant Biol. 2012, 63, 381–407. [Google Scholar] [CrossRef] [Green Version]
  50. Dhugga, K.S. Building the wall: Genes and enzyme complexes for polysaccharide synthases. Curr. Opin. Plant Biol. 2001, 4, 488–493. [Google Scholar] [CrossRef]
  51. Carpita, N.; Tierney, M.; Campbell, M. Molecular biology of the plant cell wall: Searching for the genes that define structure, architecture and dynamics. Plant Mol. Biol. 2001, 47, 1–5. [Google Scholar] [CrossRef]
  52. Gille, S.; de Souza, A.; Xiong, G.; Benz, M.; Cheng, K.; Schultink, A.; Reca, I.-B.; Pauly, M. O-acetylation of Arabidopsis hemicellulose xyloglucan requires AXY4 or AXY4L, proteins with a TBL and DUF231 domain. Plant Cell 2011, 23, 4041–4053. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  53. Gille, S.; Pauly, M. O-acetylation of plant cell wall polysaccharides. Front. Plant Sci. 2012, 3, 12. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  54. Grantham, N.J.; Wurman-Rodrich, J.; Terrett, O.M.; Lyczakowski, J.J.; Stott, K.; Iuga, D.; Simmons, T.J.; Durand-Tardif, M.; Brown, S.; Dupree, R. An even pattern of xylan substitution is critical for interaction with cellulose in plant cell walls. Nat. Plants 2017, 3, 859–865. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  55. Yuan, Y.; Teng, Q.; Zhong, R.; Ye, Z.-H. The Arabidopsis DUF231 domain-containing protein ESK1 mediates 2-O-and 3-O-acetylation of xylosyl residues in xylan. Plant Cell Physiol. 2013, 54, 1186–1199. [Google Scholar] [CrossRef]
  56. Brown, D.; Wightman, R.; Zhang, Z.; Gomez, L.D.; Atanassov, I.; Bukowski, J.P.; Tryfona, T.; McQueen-Mason, S.J.; Dupree, P.; Turner, S. Arabidopsis genes IRREGULAR XYLEM (IRX15) and IRX15L encode DUF579-containing proteins that are essential for normal xylan deposition in the secondary cell wall. Plant J. 2011, 66, 401–413. [Google Scholar] [CrossRef]
  57. Song, D.; Gui, J.; Liu, C.; Sun, J.; Li, L. Suppression of PtrDUF579-3 expression causes structural changes of the glucuronoxylan in Populus. Front. Plant Sci. 2016, 7, 493. [Google Scholar] [CrossRef] [Green Version]
  58. Huang, X.; Wang, G.; Shen, Y.; Huang, Z. The wheat gene TaST can increase the salt tolerance of transgenic Arabidopsis. Plant Cell Rep. 2012, 31, 339–347. [Google Scholar] [CrossRef]
  59. Oda, S.; Kaneko, F.; Yano, K.; Fujioka, T.; Masuko, H.; Park, J.-I.; Kikuchi, S.; Hamada, K.; Endo, M.; Nagano, K. Morphological and gene expression analysis under cool temperature conditions in rice anther development. Genes Genet. Syst. 2010, 85, 107–120. [Google Scholar] [CrossRef] [Green Version]
  60. Roschzttardtz, H.; Paez-Valencia, J.; Dittakavi, T.; Jali, S.; Reyes, F.C.; Baisa, G.; Anne, P.; Gissot, L.; Palauqui, J.-C.; Masson, P.H. The VASCULATURE COMPLEXITY AND CONNECTIVITY gene encodes a plant-specific protein required for embryo provasculature development. Plant Physiol. 2014, 166, 889–902. [Google Scholar] [CrossRef] [Green Version]
  61. Ubeda-Tomas, S.; Edvardsson, E.; Eland, C.; Singh, S.K.; Zadik, D.; Aspeborg, H.; Gorzsàs, A.; Teeri, T.T.; Sundberg, B.; Persson, P. Genomic-assisted identification of genes involved in secondary growth in Arabidopsis utilising transcript profiling of poplar wood-forming tissues. Physiol. Plant 2007, 129, 415–428. [Google Scholar] [CrossRef]
  62. Persson, S.; Wei, H.; Milne, J.; Page, G.P.; Somerville, C.R. Identification of genes required for cellulose synthesis by regression analysis of public microarray data sets. Proc. Natl. Acad. Sci. USA 2005, 102, 8633–8638. [Google Scholar] [CrossRef] [Green Version]
  63. Brown, D.M.; Zeef, L.A.H.; Ellis, J.; Goodacre, R.; Turner, S.R. Identification of novel genes in Arabidopsis involved in secondary cell wall formation using expression profiling and reverse genetics. Plant Cell 2005, 17, 2281–2295. [Google Scholar] [CrossRef]
  64. Coyne, J.A. Sympatric speciation. Curr. Biol. 2007, 17, R787–R788. [Google Scholar] [CrossRef] [Green Version]
  65. Bomblies, K.; Weigel, D. Hybrid necrosis: Autoimmunity as a common barrier to gene flow in plants. Nat. Rev. Genet. 2007, 8, 382–393. [Google Scholar] [CrossRef]
  66. Alcázar, R.; García, A.V.; Parker, J.E.; Reymond, M. Incremental steps toward incompatibility revealed by Arabidopsis epistatic interactions modulating salicylic acid pathway activation. Proc. Natl. Acad. Sci. USA 2009, 106, 334–339. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  67. Jeuken, M.J.; Zhang, N.W.; McHale, L.K.; Pelgrom, K.; Den Boer, E.; Lindhout, P.; Michelmore, R.W.; Visser, R.G.; Niks, R.E. Rin4 causes hybrid necrosis and race-specific resistance in an interspecific lettuce hybrid. Plant Cell 2009, 21, 3368–3378. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  68. Yamamoto, E.; Schrier, L.; Hadjipanayis, A.; Stiris, T.; Ross-Russell, R.I.; Valiulis, A.; Turner, M.A.; Zhao, W.; De Cock, P.; de Wildt, S.N.; et al. Off-label use of medicines in neonates, infants, children, and adolescents: A joint policy statement by the European Academy of Paediatrics and the European society for Developmental Perinatal and Pediatric Pharmacology. Eur. J. Pediatr. 2020, 179, 839–847. [Google Scholar]
  69. Chen, C.; Chen, H.; Lin, Y.-S.; Shen, J.-B.; Shan, J.-X.; Qi, P.; Shi, M.; Zhu, M.-Z.; Huang, X.-H.; Feng, Q. A two-locus interaction causes interspecific hybrid weakness in rice. Nat. Commun. 2014, 5, 3357. [Google Scholar] [CrossRef] [Green Version]
  70. Atanasov, K.E.; Liu, C.; Erban, A.; Kopka, J.; Parker, J.E.; Alcázar, R. NLR mutations suppressing immune hybrid incompatibility and their effects on disease resistance. Plant Physiol. 2018, 177, 1152–1169. [Google Scholar] [CrossRef] [Green Version]
  71. Rieseberg, L.H.; Blackman, B.K. Speciation genes in plants. Ann. Bot. 2010, 106, 439–455. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  72. Yang, J.; Zhao, X.; Cheng, K.; Du, H.; Ouyang, Y.; Chen, J.; Qiu, S.; Huang, J.; Jiang, Y.; Jiang, L.; et al. A killer-protector system regulates both hybrid sterility and segregation distortion in rice. Science 2012, 337, 1336–1340. [Google Scholar] [CrossRef] [PubMed]
  73. Kubo, T.; Takashi, T.; Ashikari, M.; Yoshimura, A.; Kurata, N. Two tightly linked genes at the hsa1 locus cause both F1 and F2 hybrid sterility in rice. Mol. Plant 2016, 9, 221–232. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  74. Fernandez, L.; Torregrosa, L.; Terrier, N.; Sreekantan, L.; Grimplet, J.; Davies, C.; Thomas, M.R.; Romieu, C.; Ageorges, A. Identification of genes associated with flesh morphogenesis during grapevine fruit development. Plant Mol. Biol. 2007, 63, 307–323. [Google Scholar] [CrossRef] [PubMed]
  75. Wechter, W.P.; Levi, A.; Harris, K.R.; Davis, A.R.; Fei, Z.; Katzir, N.; Giovannoni, J.J.; Salman-Minkov, A.; Hernandez, A.; Thimmapuram, J. Gene expression in developing watermelon fruit. BMC Genom. 2008, 9, 275. [Google Scholar] [CrossRef] [Green Version]
  76. de Vries, S.C.; de Vos, W.M.; Harmsen, M.C.; Wessels, J.G. A shoot-specific mRNA from pea: Nucleotide sequence and regulation as compared to light-induced mRNAs. Plant Mol. Biol. 1985, 4, 95–102. [Google Scholar] [CrossRef]
  77. Hernández-Nistal, J.; Martín, I.; Esteban, R.; Dopico, B.; Labrador, E. Abscisic acid delays chickpea germination by inhibiting water uptake and down-regulating genes encoding cell wall remodelling proteins. J. Plant Growth Regul. 2010, 61, 175–183. [Google Scholar] [CrossRef]
  78. deNettancourt, D. Incompatibility in angiosperms. Sex. Plant Reprod. 1997, 10, 185–199. [Google Scholar] [CrossRef]
  79. Gholizadeh, A. Heterologous expression of stress-responsive DUF538 domain containing protein and its morpho-biochemical consequences. Protein J. 2011, 30, 351–358. [Google Scholar] [CrossRef]
  80. Gholizadeh, A. Pectin methylesterase activity of plant DUF538 protein superfamily. Physiol. Mol. Biol. Plants 2020, 26, 829–839. [Google Scholar] [CrossRef]
  81. Gholizadeh, A.; Kohnehrouz, S.B. DUF538 protein super family is predicted to be the potential homologue of bactericidal/permeability-increasing protein in plant system. Protein J. 2013, 32, 163–171. [Google Scholar] [CrossRef] [PubMed]
  82. Gholizadeh, A. DUF538 protein superfamily is predicted to be chlorophyll hydrolyzing enzymes in plants. Physiol. Mol. Biol. Plants 2016, 22, 77–85. [Google Scholar] [CrossRef] [Green Version]
  83. Yu, C.-Y.; Kanehara, K. The unfolded protein response modulates a phosphoinositide-binding protein through the IRE1-bZIP60 pathway. Plant Physiol. 2020, 183, 221–235. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  84. Wang, Y.; Cai, S.; Yin, L.; Shi, K.; Xia, X.; Zhou, Y.; Yu, J.; Zhou, J. Tomato HsfA1a plays a critical role in plant drought tolerance by activating ATG genes and inducing autophagy. Autophagy 2015, 11, 2033–2047. [Google Scholar] [CrossRef] [Green Version]
  85. Nguyen, H.N.; Lai, N.; Kisiala, A.B.; Emery, R.J.N. Isopentenyltransferases as master regulators of crop performance: Their function, manipulation, and genetic potential for stress adaptation and yield improvement. Plant Biotechnol. J. 2021, 19, 1297–1313. [Google Scholar] [CrossRef] [PubMed]
  86. Nabi, R.B.S.; Tayade, R.; Imran, Q.M.; Hussain, A.; Shahid, M.; Yun, B.W. Functional insight of nitric-oxide induced DUF genes in Arabidopsis thaliana. Front. Plant Sci. 2020, 11, 1041. [Google Scholar] [CrossRef]
  87. Chen, X.; Zhang, Z.; Visser, R.G.; Broekgaarden, C.; Vosman, B. Overexpression of IRM1 enhances resistance to aphids in Arabidopsis thaliana. PLoS ONE 2013, 8, e70914. [Google Scholar] [CrossRef] [Green Version]
  88. Zhao, Y.P.; Shen, J.L.; Li, W.J.; Wu, N.; Chen, C.; Hou, Y.X. Evolutionary and characteristic analysis of RING-DUF1117 E3 ubiquitin ligase genes in gossypium discerning the role of GhRDUF4D in verticillium dahliae resistance. Biomolecules 2021, 11, 1145. [Google Scholar] [CrossRef]
  89. Hao, Q.; Yang, Y.; Shan, Z.; Chen, H.; Zhang, C.; Chen, L.; Yuan, S.; Zhang, X.; Chen, S.; Yang, Z.; et al. Genome-wide investigation and expression profiling under abiotic stresses of a soybean unknown function (DUF21) and Cystathionine-β-Synthase (CBS) domain-containing protein family. Biochem. Genet. 2021, 59, 83–113. [Google Scholar] [CrossRef]
  90. Nabi, R.B.S.; Tayade, R.; Hussain, A.; Adhikari, A.; Lee, I.J.; Loake, G.J.; Yun, B.W. A novel DUF569 gene is a positive regulator of the drought stress response in Arabidopsis. Int. J. Mol. Sci. 2021, 22, 5316. [Google Scholar] [CrossRef]
  91. Palmeros-Suarez, P.A.; Massange-Sanchez, J.A.; Sanchez-Segura, L.; Martinez-Gallardo, N.A.; Rangel, E.E.; Gomez-Leyva, J.F.; Delano-Frier, J.P. AhDGR2, an amaranth abiotic stress-induced DUF642 protein gene, modifies cell wall structure and composition and causes salt and ABA hyper-sensibility in transgenic Arabidopsis. Planta 2017, 245, 623–640. [Google Scholar] [CrossRef]
  92. Zhou, X.; Zhu, X.; Shao, W.; Song, J.; Jiang, W.; He, Y.; Yin, J.; Ma, D.; Qiao, Y. Genome-wide mining of wheat DUF966 gene family provides new insights into salt stress responses. Front. Plant Sci. 2020, 11, 569838. [Google Scholar] [CrossRef] [PubMed]
  93. Li, J.; Han, Y.; Zhao, Q.; Li, C.; Xie, Q.; Chong, K.; Xu, Y. The E3 ligase AtRDUF1 positively regulates salt stress responses in Arabidopsis thaliana. PLoS ONE 2013, 8, e71078. [Google Scholar] [CrossRef] [PubMed]
  94. Guo, C.; Luo, C.; Guo, L.; Li, M.; Guo, X.; Zhang, Y.; Wang, L.; Chen, L. OsSIDP366, a DUF1644 gene, positively regulates responses to drought and salt stresses in rice. J. Integr. Plant Biol. 2016, 58, 492–502. [Google Scholar] [CrossRef] [PubMed]
  95. Wang, J.; An, C.; Guo, H.; Yang, X.; Chen, J.; Zong, J.; Li, J.; Liu, J. Physiological and transcriptomic analyses reveal the mechanisms underlying the salt tolerance of Zoysia japonica Steud. BMC Plant Biol. 2020, 20, 114. [Google Scholar] [CrossRef]
  96. Ying, S. Genome-wide identification and transcriptional analysis of Arabidopsis DUF506 Gene Family. Int. J. Mol. Sci. 2021, 22, 11442. [Google Scholar] [CrossRef]
  97. Ying, S.; Scheible, W.R. A novel calmodulin-interacting Domain of Unknown Function 506 protein represses root hair elongation in Arabidopsis. Plant Cell Environ. 2022, 45, 1796–1812. [Google Scholar] [CrossRef]
  98. Yin, Y.; Vafeados, D.; Tao, Y.; Yoshida, S.; Asami, T.; Chory, J. A new class of transcription factors mediates brassinosteroid-regulated gene expression in Arabidopsis. Cell 2005, 120, 249–259. [Google Scholar] [CrossRef] [Green Version]
  99. Ryu, H.; Kim, K.; Cho, H.; Hwang, I. Predominant actions of cytosolic BSU1 and nuclear BIN2 regulate subcellular localization of BES1 in brassinosteroid signaling. Mol. Cells 2010, 29, 291–296. [Google Scholar] [CrossRef]
  100. Guo, R.; Qian, H.; Shen, W.; Liu, L.; Zhang, M.; Cai, C.; Zhao, Y.; Qiao, J.; Wang, Q. BZR1 and BES1 participate in regulation of glucosinolate biosynthesis by brassinosteroids in Arabidopsis. J. Exp. Bot. 2013, 64, 2401–2412. [Google Scholar] [CrossRef] [Green Version]
  101. Miyakawa, T.; Hatano, K.; Miyauchi, Y.; Suwa, Y.; Sawano, Y.; Tanokura, M. A secreted protein with plant-specific cysteine-rich motif functions as a mannose-binding lectin that exhibits antifungal activity. Plant Physiol. 2014, 166, 766–778. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  102. Miyakawa, T.; Miyazono, K.; Sawano, Y.; Hatano, K.; Tanokura, M. Crystal structure of ginkbilobin-2 with homology to the extracellular domain of plant cysteine-rich receptor-like kinases. Proteins 2009, 77, 247–251. [Google Scholar] [CrossRef]
  103. Ma, L.S.; Wang, L.; Trippel, C.; Mendoza-Mendoza, A.; Ullmann, S.; Moretti, M.; Carsten, A.; Kahnt, J.; Reissmann, S.; Zechmann, B.; et al. The Ustilago maydis repetitive effector Rsp3 blocks the antifungal activity of mannose-binding maize proteins. Nat. Commun. 2018, 9, 1711. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  104. Acharya, B.R.; Raina, S.; Maqbool, S.B.; Jagadeeswaran, G.; Mosher, S.L.; Appel, H.M.; Schultz, J.C.; Klessig, D.F.; Raina, R. Overexpression of CRK13, an Arabidopsis cysteine-rich receptor-like kinase, results in enhanced resistance to Pseudomonas syringae. Plant J. 2007, 50, 488–499. [Google Scholar] [CrossRef]
  105. Brunkard, J.O.; Zambryski, P.C. Plasmodesmata enable multicellularity: New insights into their evolution, biogenesis, and functions in development and immunity. Curr. Opin. Plant Biol. 2017, 35, 76–83. [Google Scholar] [CrossRef]
  106. Caillaud, M.C.; Wirthmueller, L.; Sklenar, J.; Findlay, K.; Piquerez, S.J.; Jones, A.M.; Robatzek, S.; Jones, J.D.; Faulkner, C. The plasmodesmal protein PDLP1 localises to haustoria-associated membranes during downy mildew infection and regulates callose deposition. PLoS Pathog. 2014, 10, e1004496. [Google Scholar] [CrossRef] [PubMed]
  107. Lim, G.H.; Shine, M.B.; de Lorenzo, L.; Yu, K.; Cui, W.; Navarre, D.; Hunt, A.G.; Lee, J.Y.; Kachroo, A.; Kachroo, P. Plasmodesmata localizing proteins regulate transport and signaling during systemic acquired immunity in plants. Cell Host Microbe 2016, 19, 541–549. [Google Scholar] [CrossRef] [Green Version]
  108. Cui, W.; Lee, J.Y. Arabidopsis callose synthases CalS1/8 regulate plasmodesmal permeability during stress. Nat. Plants 2016, 2, 16034. [Google Scholar] [CrossRef] [PubMed]
  109. Amari, K.; Boutant, E.; Hofmann, C.; Schmitt-Keichinger, C.; Fernandez-Calvino, L.; Didier, P.; Lerich, A.; Mutterer, J.; Thomas, C.L.; Heinlein, M.; et al. A family of plasmodesmal proteins with receptor-like properties for plant viral movement proteins. PLoS Pathog. 2010, 6, e1001119. [Google Scholar] [CrossRef] [Green Version]
  110. Waadt, R.; Seller, C.A.; Hsu, P.K.; Takahashi, Y.; Munemasa, S.; Schroeder, J.I. Plant hormone regulation of abiotic stress responses. Nat. Rev. Mol. Cell Biol. 2022, 23, 680–694. [Google Scholar] [CrossRef]
  111. Luo, C.; Guo, C.; Wang, W.; Wang, L.; Chen, L. Overexpression of a new stress-repressive gene OsDSR2 encoding a protein with a DUF966 domain increases salt and simulated drought stress sensitivities and reduces ABA sensitivity in rice. Plant Cell Rep. 2014, 33, 323–336. [Google Scholar] [CrossRef] [PubMed]
  112. Rusaczonek, A.; Czarnocka, W.; Kacprzak, S.; Witon, D.; Slesak, I.; Szechynska-Hebda, M.; Gawronski, P.; Karpinski, S. Role of phytochromes A and B in the regulation of cell death and acclimatory responses to UV stress in Arabidopsis thaliana. J. Exp. Bot. 2015, 66, 6679–6695. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  113. Schulze-Gahmen, U.; Pelaschier, J.; Yokota, H.; Kim, R.; Kim, S.H. Crystal structure of a hypothetical protein, TM841 of Thermotoga maritima, reveals its function as a fatty acid-binding protein. Proteins 2003, 50, 526–530. [Google Scholar] [CrossRef]
  114. Krishna, S.S.; Tautz, L.; Xu, Q.P.; McMullan, D.; Miller, M.D.; Abdubek, P.; Ambing, E.; Astakhova, T.; Axelrod, H.L.; Carlton, D.; et al. Crystal structure of NMA1982 from Neisseria meningitidis at 1.5 angstrom resolution provides a structural scaffold for nonclassical, eukaryotic-like phosphatases. Proteins 2007, 69, 415–421. [Google Scholar] [CrossRef] [PubMed]
  115. Harada, K.; Yamashita, E.; Inoue, K.; Yamaguchi, K.; Fujiwara, T.; Nakagawa, A.; Kawasaki, T.; Kojima, C. Plant-specific DUF1110 protein from Oryza sativa: Expression, purification and crystallization. Acta Crystallogr. Sect. F Struct. Biol. Cryst. Commun. 2016, 72, 480–484. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  116. Bakolitsa, C.; Bateman, A.; Jin, K.K.; McMullan, D.; Krishna, S.S.; Miller, M.D.; Abdubek, P.; Acosta, C.; Astakhova, T.; Axelrod, H.L.; et al. The structure of Jann_2411 (DUF1470) from Jannaschia sp. at 1.45 angstrom resolution reveals a new fold (the ABATE domain) and suggests its possible role as a transcription regulator. Acta Crystallogr. Sect. F Struct. Biol. Cryst. Commun. 2010, 66, 1198–1204. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  117. Pena-Castillo, L.; Hughes, T.R. Why are there still over 1000 uncharacterized yeast genes? Genetics 2007, 176, 7–14. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Table 1. The role of some DUFs in plant growth and development.
Table 1. The role of some DUFs in plant growth and development.
Plant Growth and DevelopmentProtein (Gene_ID)DUF (Rename)Pfam AccessionBiological FunctionPlantsReference
Plant cell wallsWAT1 (At1g75500)DUF6 (EamA)PF00892The essential role in the control of secondary cell wall formation in fibersArabidopsis[15]
PdDUF231A (Potri.009G072800) DUF231 (PMR5N)PF14416It affects cellulose biosynthesis and plays a role in the acetylation of xylanPopulus trichocarpa[31]
PAGR (At3g26370)DUF246 (O-FucT)PF10250It affects male fertility and the biosynthesis of pectic arabinogalactansArabidopsis[17]
PdDUF266A (Potri.011G009500)DUF266 (Branch)PF02485Overexpression of PdDUF266A can increase cellulose content, reduce recalcitrance, and enhance biomass productionPopulus trichocarpa[34]
IRX15 (At3g50220);
IRX15-L (At5g67210)
DUF579 (Polysacc_synt_4)PF04669IRX15 and IRX15-L affect xylan synthesis Arabidopsis[32]
DGR1 (At1g80240); DGR2 (At5g25460)DUF642PF04862L-Galactono-1,4-Lactone-responsive GenesArabidopsis[33]
MWL-1 (At1g31720); MWL-2 (At4g19370)DUF1218PF06749At1g31720 and At4g19370 function redundantly to alter secondary cell wall lignin contentArabidopsis[26]
Reproductive developmentLpSDUF247DUF247PF03140Cosegregates with the S self-incompatibility locusLolium perenne L.[13]
STS1 (LOC_Os03g02170)DUF726PF05277STS1 is required for tapetum degeneration and pollen wall formationOryza sativa[37]
Sc-j (Os03g0247300)DUF1618PF07762Loss of function in this domain could induce hybrid incompatibilityOryza sativa[35]
S22B_j (LOC_Os02g04420)DUF1668PF07893DUF1668-containing genes in multiple lineages are responsible for F1 pollen sterilityOryza sativa[14]
Shoot-specific (ST) proteinsDUF2775 (Organ_specific)PF10950This domain is considered to be linked with early fruit development, seed germination, and cell elongationMagnoliopsida[36]
Trichome developmentSVB (At1g56580); SVBL (At1g09310) DUF538PF04398DUF538 affects trichome developmentArabidopsis[18,38]
Other essential growth and developmentMIZ1 (At2g41660)DUF617PF04759MIZ1 regulates lateral root development in ArabidopsisArabidopsis[20]
REL2 (NP_001065395.1)DUF630; DUF632PF04783; PF04782Controls leaf rolling in riceOryza sativa[41]
DUF724 proteinsDUF724PF05266The DUF724 gene family is involved in the polar growth of plant cells via the transportation of RNAsArabidopsis[23]
WEB1 (At2g26570); PMI2 (At1g66840)DUF827 (WEMBL)PF05701Regulate chloroplast movement velocityArabidopsis[39]
CiDUF1005DUF1005PF06219Regulates primary root elongationCaragana intermedia[40]
Pp1s52_118
(Pp1s52_118V6.1);
Pp1s1_561
(Pp1s1_561V6.2)
DUF3444PF11926Role of DNA methylation in growth and differentiation in Physcomitrella patens and characterization of cytosine DNA methyltransferasesPhyscomitrella patens[42]
OsSAC1
(LOC_Os07G02520)
DUF4220; DUF594PF13968; PF04578Causes sugar accumulation in rice leavesOryza sativa[43]
OsPLS3
(LOC_Os12g42420)
DUF266
(Branch)
PF02485OsPLS3 gene encoding a DUF266-containing protein is implicated in rice leaf senescenceOryza sativa[44]
VCC
(At2g32280)
DUF1218PF06749VCC is required for bilateral symmetry, but its loss of function does not visibly affect dorsoventralityArabidopsis[45]
Table 2. The role of some DUFs in plant biotic and abiotic stress.
Table 2. The role of some DUFs in plant biotic and abiotic stress.
Biotic/Abiotic StressProtein (Gene_ID)DUF (Rename)Pfam AccessionBiological FunctionPlantsReference
Biotic stressCRRSPs, PDLPs, and CRKsDUF26 (Stress-antifung)PF01657DUF26 is land plant-specific, but structural analyses of PDLP ectodomains revealed substantial similarity to fungal lectins and thus may constitute a group of plant carbohydrate-binding proteinsLand plants[16]
AtDUF569 (At1g69890)DUF569PF04601AtDUF569 negatively regulates biotic stress responses (pathogen inoculation) Arabidopsis[86]
IRM1 (At5g65040)DUF581
(zf-FLZ)
PF04570Overexpression of IRM1 enhances resistance to aphids in Arabidopsis thalianaArabidopsis[87]
GhRDUF4DDUF1117PF06547RING-DUF1117 E3 ubiquitin ligase genes in Gossypium discerning the role of GhRDUF4D in Verticillium dahliae resistanceGossypium[88]
Drought and salt stress GmCBSDUF3DUF21 (CNNM)PF01595The overexpression of GmCBSDUF3 could enhance tolerance to drought and salt stress in ArabidopsisArabidopsis[89]
AtDUF569 (At1g69890)DUF569PF04601Positive regulator of drought-stress response in ArabidopsisArabidopsis[90]
AhDGR2DUF642PF04862Overexpression of AhDGR2 in transgenic Arabidopsis plants presents increased sensitivity to NaCl treatmentAmaranthus hypochondriacus[91]
TaDUF966-9BDUF966 (SOK)PF06136TaDUF966 genes play a vital role in salt-stress tolerance in wheatTriticum aestivum[92]
AtRDUF1DUF1117PF06547The E3 ligase AtRDUF1 (DUF1117) positively regulates salt stress responses in Arabidopsis thalianaArabidopsis[93]
OsSIDP366 (Os06g47860)DUF1644PF07800OsSIDP366, a DUF1644 gene, positively regulates responses to drought and salt stresses in ricerice[94]
ROPGEF7DUF315 (PRONE)PF03759ROPGEF7 (a DUF 315 protein-coding gene) might play important roles in the salt tolerance process in Z. japonica and might have contrasting functionsZoysia grass[95]
UFSPDUF1671PF07761UFSP (a DUF 1671 protein-coding gene) might play important roles in the salt-tolerance process in Z. japonica and might have contrasting functionsZoysia grass[95]
DUF4228 proteinsDUF4228 (PADRE)PF14009Expression profiling of the ATDUF4228 genes under abiotic stresses (mainly osmotic, salt, and cold) and protein–protein interaction prediction suggested that some ATDUF4228 genes may be involved in the pathways of plant resistance to abiotic stressesArabidopsis[28]
Ca(2+) signalingAtDUF506sDUF506 (PDDEXK_6)PF04720DUF506 genes have distinct biological functions, including responses to environmental stimuli and nutrient deficiencies, and participate in Ca(2+) signalingArabidopsis[96]
Phosphorus (P) stressRXR3 (At1g62424)DUF506 (PDDEXK_6)PF04720AtRXR3 is another DUF506 protein that attenuates P-limitation-induced root hair growth through mechanisms that involve RSL4 and interaction with CaM to modulate tip-focused [Ca2+]cyt oscillationsArabidopsis[97]
Brassinosteroid signalingBES1DUF822 (BES1_N)PF05687BES1 is an essential regulator downstream of brassinosteroid signaling and plays an important role in plant stress response, growth, and development.Cucumis sativus[98,99,100]
UV-BRUS1 (At3g45890)DUF647 (UVB_sens_prot)PF04884Role of root UV-B sensing in Arabidopsis early seedling developmentArabidopsis[22]
Table 3. Bioinformatics databases and tools.
Table 3. Bioinformatics databases and tools.
CategoryName AbbreviationDescriptionURL
Integrative databasesNCBIThe National Center for Biotechnology Information advances science and health by providing access to biomedical and genomic information.https://www.ncbi.nlm.nih.gov/ (accessed on 6 February 2023)
Database CommonsDatabase Commons integrates relevant information for all collected databases and catalogs each database based on its data type, species, subjects, and location.http://bigd.big.ac.cn/databasecommons/ (accessed on 6 February 2023)
InterProInterPro provides functional analysis of proteins by classifying them into families and predicting domains and important sites.https://www.ebi.ac.uk/interpro/ (accessed on 6 February 2023)
ExpasyIt is an extensible and integrative portal that provides access to over 160 databases and software tools developed by SIB Groups and supporting a range of life science and clinical research domains, from genomics, proteomics, and structural biology, to evolution and phylogeny, systems biology, and medical chemistry.https://www.expasy.org/ (accessed on 6 February 2023)
Ensembl PlantsEnsembl Genomes is developed by EMBL-EBI and powered by the Ensembl software system to analyze and visualize genomic data.http://plants.ensembl.org/index.html (accessed on 6 February 2023)
Comparative genomic databasesPhytozomePhytozome, the Plant Comparative Genomics portal of the Department of Energy’s Joint Genome Institute, provides JGI users and the broader plant science community a hub for accessing, visualizing, and analyzing JGI-sequenced plant genomes, as well as selected genomes and datasets that have been sequenced elsewhere.https://phytozome-next.jgi.doe.gov/ (accessed on 6 February 2023)
PLAZAPLAZA is an access point for plant comparative genomics, centralizing genomic data produced by different genome-sequencing initiatives.https://bioinformatics.psb.ugent.be/plaza/ (accessed on 6 February 2023)
Protein databasesUniProtUniProt is the world’s leading high-quality, comprehensive, and freely accessible protein sequence and functional information resource.https://www.uniprot.org/ (accessed on 6 February 2023)
CATH-Gene3DCATH is a classification of protein structures downloaded from the Protein Data Bank.http://www.cathdb.info/ (accessed on 6 February 2023)
SMARTSMART (a Simple Modular Architecture Research Tool) allows the identification and annotation of genetically mobile domains and the analysis of domain architectures.http://smart.embl-heidelberg.de/ (accessed on 6 February 2023)
PROSITEDatabase of protein domains, families, and functional siteshttp://prosite.expasy.org/ (accessed on 6 February 2023)
PANTHERThe PANTHER (Protein ANalysis THrough Evolutionary Relationships) classification system was designed to classify proteins (and their genes) to facilitate high-throughput analysis. The core of PANTHER is a comprehensive, annotated “library” of gene family phylogenetic trees.http://www.pantherdb.org/ (accessed on 6 February 2023)
RCSB PDBThese data can be explored in the context of external annotations providing a structural view of biology.https://www.rcsb.org/ (accessed on 6 February 2023)
AlphaFold Protein Structure DatabaseAlphaFold, a state-of-the-art AI system developed by DeepMind, is able to predict protein structures with remarkable accuracy and speed computationally.https://alphafold.ebi.ac.uk/ (accessed on 6 February 2023)
Gene expression databasesExpression AtlasDatabase of differential and baseline expressionhttps://www.ebi.ac.uk/gxa/home (accessed on 6 February 2023)
BARThe Botany Array Resource: e-Northerns, expression angling, and promoter analyseshttp://bar.utoronto.ca/#GeneExpressionAndProteinTools (accessed on 6 February 2023)
Bioinformatic toolsOrthoVenn2For whole-genome orthologous gene comparisons and annotations across multiple species.https://orthovenn2.bioinfotoolkits.net/home (accessed on 6 February 2023)
eggNOG-mappereggNOG-mapper v2 is a tool for functional annotation of large sets of sequences based on fast orthology assignments using precomputed eggNOG v5.0 clusters and phylogenies.http://eggnog-mapper.embl.de/ (accessed on 6 February 2023)
STRINGProtein–protein interaction network functional enrichment analysis.https://cn.string-db.org/cgi/input.pl (accessed on 6 February 2023)
HMMERProvides life science researchers with fast and sensitive homology searcheshttps://www.ebi.ac.uk/Tools/hmmer/ (accessed on 6 February 2023)
SignalPThe SignalP server predicts the presence of signal peptides and the location of their cleavage sites in proteins from Archaea, Gram-positive Bacteria, Gram-negative Bacteria, and Eukarya.https://services.healthtech.dtu.dk/service.php?SignalP-5.0 (accessed on 6 February 2023)
PlantCAREPlantCARE is a plant cis-acting regulatory element database.http://bioinformatics.psb.ugent.be/webtools/plantcare/html/ (accessed on 6 February 2023)
Phyre2Phyre2 is designed around the idea that you have a protein sequence/gene and want to predict its three-dimensional (3D) structure.http://www.sbg.bio.ic.ac.uk/phyre2/html/page.cgi?id=index (accessed on 6 February 2023)
iTOLiTOL is an online tool for displaying, annotating, and managing phylogenetic and other trees.https://itol.embl.de/index.shtml (accessed on 6 February 2023)
EvolviewEvolview is an interactive tree visualization tool designed to help researchers visualize phylogenetic trees and annotate them with additional information.http://evolgenius.info/#/ (accessed on 6 February 2023)
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Lv, P.; Wan, J.; Zhang, C.; Hina, A.; Al Amin, G.M.; Begum, N.; Zhao, T. Unraveling the Diverse Roles of Neglected Genes Containing Domains of Unknown Function (DUFs): Progress and Perspective. Int. J. Mol. Sci. 2023, 24, 4187. https://doi.org/10.3390/ijms24044187

AMA Style

Lv P, Wan J, Zhang C, Hina A, Al Amin GM, Begum N, Zhao T. Unraveling the Diverse Roles of Neglected Genes Containing Domains of Unknown Function (DUFs): Progress and Perspective. International Journal of Molecular Sciences. 2023; 24(4):4187. https://doi.org/10.3390/ijms24044187

Chicago/Turabian Style

Lv, Peiyun, Jinlu Wan, Chunting Zhang, Aiman Hina, G M Al Amin, Naheeda Begum, and Tuanjie Zhao. 2023. "Unraveling the Diverse Roles of Neglected Genes Containing Domains of Unknown Function (DUFs): Progress and Perspective" International Journal of Molecular Sciences 24, no. 4: 4187. https://doi.org/10.3390/ijms24044187

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop