Next Article in Journal
Plasma Short-Chain Fatty Acids and Cytokine Profiles in Chronic Kidney Disease: A Potential Pathophysiological Link
Previous Article in Journal
Phylostratigraphic Analysis Reveals the Evolutionary Origins and Potential Role of New Genes in the Adaptive Evolution of Spodoptera frugiperda
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

From Structure to Function of Promoters and 5′UTRs in Maize

by
Nikita V. Sytov
1,
Vladimir V. Choob
2,
Sileshi Nemomissa
3,
Alexander S. Mishin
1,* and
Maxim M. Perfilov
1
1
Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow 117997, Russia
2
Botanical Garden of Lomonosov Moscow State University, Vorobievy Gory 1 b.12, Moscow 119234, Russia
3
Department of Plant Biology and Biodiversity Management, Addis Ababa University, Addis Ababa P.O. Box 3434, Ethiopia
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2026, 27(1), 548; https://doi.org/10.3390/ijms27010548
Submission received: 26 November 2025 / Revised: 24 December 2025 / Accepted: 3 January 2026 / Published: 5 January 2026
(This article belongs to the Topic Genetic Engineering in Agriculture, 2nd Edition)

Abstract

As a cornerstone of global agriculture, maize (Zea mays) is a crucial component of sustainable food systems and industrial uses. However, global agricultural production faces pressures from climate change, resource scarcity, and rising nutritional demands. To adapt to changes in their environment, plants evolved precise and sophisticated gene expression regulatory mechanisms. A majority of gene expression regulatory elements are located in promoters and untranslated regions of mRNA. This review aims to elucidate how promoters and 5′ untranslated regions function in complex synergy to regulate gene expression in maize. We discuss the structural organization of these regulatory elements, from their basic components to their integrated roles in shaping plant gene expression. Particular emphasis is placed on their significant impact on maize biotechnology, including strategies for controlling, fine-tuning, and enhancing gene expression for crop improvement. With this review we wish to guide future biotechnological innovations and food security.

1. Introduction

Eukaryotic gene expression relies on conserved mechanisms involving cis-regulatory elements (CREs), chromatin architecture, and RNA processing. However, plants have evolved several distinct features that reflect their unique developmental needs and sessile lifestyle. For instance, plants possess specialized nuclear RNA polymerases, Pol IV and Pol V, dedicated to transcribing noncoding RNAs essential for RNA-directed DNA methylation (RdDM). Transcriptional silencing via the RdDM pathway is essential for maintaining genome stability in plants, which commonly contain transposable elements and repetitive sequences. Additionally, RdDM plays a role in pathogen defense, abiotic stress response, and the regulation of several key developmental transitions [1]. The functional diversity of the abundant long non-coding RNAs in plants expands the range of potential regulatory mechanisms by interacting with chromatin-modifying proteins or sequestering regulatory proteins, thereby modulating gene expression [2,3]. Likewise, RNA-binding proteins in plants have undergone extensive expansion and diversification relative to other eukaryotes, providing an additional layer of post-transcriptional regulation essential for adaptation to environmental stresses [4]. These evolutionary adaptations underscore the complexity of regulatory networks in plants.
Along with unique features, plants also possess the fundamental eukaryotic transcription regulators, including promoters and untranslated regions of mRNA. The promoter acts as the primary controller, determining the transcription initiation site and orchestrating the basal transcriptional machinery. Plant promoters consist of a core promoter which includes the transcription start site (TSS), proximal promoter elements, and distal regions. These elements serve as recognition sites for RNA polymerase II and associated transcription factors (TFs) binding to CREs, ensuring that transcription occurs at the correct time, in the proper cellular context, and with appropriate frequency [5]. Plant cis-regulatory landscapes are characterized by a strong promoter-proximal preference. For example, a high-resolution chromatin accessibility study has shown that the bulk of open chromatin regions are located within 3 kb upstream of TSSs. Such localization indicates that in plants gene expression is regulated more by proximal regulatory elements, rather than by distal ones, as in animals [6].
Another key regulator of gene expression in eukaryotes is the 5′ untranslated region of mRNA (5′UTR). Functionally distinct from promoters, 5′UTRs are located immediately downstream of the TSS and regulate gene expression post-transcriptionally. They can harbor various cis-acting elements, such as upstream open reading frames (uORFs), enhancers, introns and may form stable secondary structures including hairpins and G-quadruplexes. Altogether 5′UTR parts affect ribosome binding efficiency, scanning and initiation of translation, impacting protein synthesis rates [7,8].
One of the primary purposes of modern biotechnology is agricultural improvement, including abiotic and biotic stress tolerance increase and nutritional enhancement. Among the wide variety of model organisms, maize (Zea mays) appears to be the most suitable for solving biotechnological problems. High-calorie and protein-rich seeds make maize an important food for humans and an essential component of livestock feed. In addition to its nutritional value, maize also has significant industrial importance, especially for biofuel production [9]. However, maize production is increasingly threatened by abiotic stresses, with annual crop failure commonly attributed to weather extremes like drought and high heat [10]. In response, biotechnological advances have been directed toward enhancing key traits, including drought tolerance, agronomic performance, and grain quality [11]. The comprehension of basic mechanisms of gene expression regulation, governed mainly by promoters and 5′UTRs, underlies the aforementioned innovations. Although promoters and 5′UTRs operate at different stages of gene expression, they both control the spatial, temporal, and quantitative aspects of protein production [12]. This coordinated regulation at multiple levels is crucial for developmental processes, cellular differentiation, and responses to environmental signals [13]. This review focuses on the features of promoters and 5′UTRs, as well as their interactions, specifically for gene expression fine-tuning in maize, a well-studied monocot model. It synthesizes current knowledge spanning sequence-level characteristics, chromatin- and enhancer-level regulation, translational control, and machine-learning-guided approaches, highlighting the potential of these regulatory elements for agricultural improvement in maize.

2. Promoters

As the primary driver of transcription, the promoter retains an essential function across species, though its specific structure and regulation are frequently taxon-specific. In monocot, promoters tend to be GC-rich, as well as their coding sequences. In contrast, promoters of eudicots are usually AT-rich [14,15]. Also, GC-content bias in promoters can be observed between actively and rarely expressing genes, as high GC-content in promoters correlates with nucleosome depletion and facilitated transcription initiation [16]. Low functional compatibility between monocot and eudicot promoters was demonstrated in a cross-species study performed by Jores et al. [17]. In this work, AT-rich promoters from Arabidopsis thaliana activated transcription in tobacco protoplasts significantly more efficiently than in maize cells, as well as GC-rich promoters from maize performed better in maize cells.

2.1. Structural Features of Promoter Organization in Maize

Generally, maize promoters are operationally defined as the 1–2 kb region upstream of the transcription start site, including various regulatory features critical for transcriptional initiation (Figure 1) [18]. In common, maize promoter regions may be positionally and functionally divided into three parts: core promoter, proximal promoter, and distal promoter. Core promoters contain the most crucial elements for gene expression, including the TSS, the Inr (initiator), the Y patch, the TATA box, the CCAAT box, and others. These elements define the transcription start site and facilitate the recruitment of RNA polymerase II and associated transcriptional machinery. The proximal promoter harbors clustered CREs. Sequence-specific TFs bind to these CREs and act as key integrators of environmental signals (e.g., light, stress), enabling transcriptional reprogramming. The distal promoter contains enhancers, silencers, and insulators, which interact with core-bound transcriptional complexes through chromatin looping. They enable fine-tuning of spatiotemporal expression patterns by responding to hormones in developmental processes or tissue-specific signals through epigenetic modifications.
Das and Bansal analyzed promoter architecture features in several plant species, including maize [19]. For example, they suggested broadly expressed genes have less compact and less stable promoters in terms of free energy than tissue-specific promoters. In addition, promoter stability may be species-specific. Specifically, maize and other studied monocot promoters are more stable than those of A. thaliana in terms of their free energy. Also, maize promoters demonstrated higher DNase I sensitivity. DNase I sensitivity reflects variable nucleosome occupancy and chromatin remodeling dynamics, which determine the accessibility of promoters to tissue-specific or other regulatory factors [19].

2.1.1. Core Promoter Region

As the name suggests, the transcription start site is the location of the first transcribed nucleotide. Based on the number and distribution of TSSs, core promoters can be divided into two distinct classes. Sharp promoters exhibit a highly focused TSS distribution with one or a few TSSs concentrated at a narrow range of nucleotide positions [20,21]. This precise architecture is typically supported by well-defined core motifs, such as the TATA box and Initiator sequences, positioned accurately relative to the TSS. Consequently, sharp promoters are often associated with tightly regulated, tissue-specific gene expression where precise initiation control is essential [22]. In contrast, broad promoters lack a single dominant TSS and instead harbor multiple closely spaced initiation sites. This structure commonly underlies the constitutive expression of housekeeping genes across diverse tissues [22]. In maize, the proportion of genes with sharp promoters ranges from 50% to 87%, contrasting with the approximately 36% observed in A. thaliana [23].
Core promoters often contain a TATA box, with a consensus sequence TATA(A/T)A(A/T). The main function of the TATA box is to bind TATA-binding protein (TBP), a subunit of TFIID. This interaction results in RNA polymerase II recruitment. The TATA box is required not only for transcription initiation but also to determine the level of gene expression in plants [24,25].
The location of core promoter elements relative to the TSS varies not only among different species, but also among the genes of the same species. Usually, it can only be described in probabilistic terms [26]. Thus, it is considered that the position of the TATA box in flowering plants is highly conserved – it is located mainly at −60 bp to −20 bp relative to TSS [14]. In maize promoters, the TATA box is located mainly at ~30 bp upstream of TSS with additional localization maxima at −55 and −70 [17]. At the same time, the distance between TATA-box and TSS negatively correlates with the strength of promoters, with maximal enhancement observed at −59 to −23 relative to the TSS [17].
Massive core promoter elements research revealed that TATA boxes are not universally present. Among 12,749 analyzed Arabidopsis thaliana (dicot) genes, merely ~29% contained a TATA box motif [27]. In monocot genomes TATA box was observed in ~19% Oryza sativa [28] and in 19–38% Zea mays promoters [17,23,26]. In addition, genes with broad promoters in maize are significantly less likely to contain a TATA box compared to sharp promoters [23]. Interestingly, maize promoters with TATA box demonstrated up to 4-fold greater transcriptional activity than TATA-less analogs [17].
Another important element of the plant core promoter is the Initiator sequence (Inr), which also initiates transcription by binding to general transcription factors and RNA polymerase II. Inr is a short DNA sequence that typically overlaps or lies very close to the TSS. Although Inr functionality is independent from the TATA box, it can also act synergistically [21]. For instance, light-mediated expression requires both Inr and the TATA box simultaneously [29]. Like the TATA box, Inr sequences are relatively widely distributed in the genes, with patterns of distribution varying among plant groups [26]. Genome-wide computational promoter elements analysis has predicted positional distribution of Inr in eudicots in the range −20 bp to +240 bp, followed by a gradual decrease in frequency with a maximum at −15 bp relative to TSS [14]. In monocots, this study described a more normal distribution of Inr frequencies in the range from approximately −170 bp to +250 bp, with a maximum also at −15 bp relative to TSS [14]. It is important to note that this broad distribution likely reflects a range of Inr-like signals present in the promoter and 5′ untranslated region rather than the true Inr motif used by the transcriptional machinery.
Along with the eukaryotic common core promoter elements, plants have unique ones. One such unique element is the Y patch (or TC motif), named for the pyrimidine abundance [30,31]. The significant proportion of TATA-less core promoters in plants assumes the presence of functional compensation by other promoter elements, such as the Initiator and Y patch motifs. Synthetic promoter screening confirmed that the Y patch motif can boost promoter strength up to 2-fold, while wild-type Y patch-containing promoters also demonstrated 10–15% greater strength compared to Y patch-less [17]. Although the most effective promoters in Jores et al. work contained the TATA box, Inr and Y patch simultaneously, the presence of the TATA box provided the strongest effect on promoter strength [17]. Also, Sanchez-Muñoz et al. demonstrated methylation accumulation for the Y patch, suggesting its potential role in methylation-dependent gene expression regulation [32].
The predicted distribution of Y patch motifs in eudicot and monocot promoters is similar, from −30 bp to +250 bp relative to TSS [14]. Approximately 50% of rice (Oryza sativa) core promoters contain one or more Y-patches, suggesting this element represents a functionally significant cis-regulatory feature [28].
In plant core promoters, the TATA box may be flanked by the TFIIB recognition element located upstream (BREu) or downstream (BREd). In eukaryotic cells, BREs modulate gene expression through interaction with TFIIB [33,34]. Although the details of BRE functioning in plants remain poorly characterized, BREu has been shown to enhance the strength of synthetic promoters in maize protoplasts by approximately 25%, whereas BREd elements reduce strength by ~10% [17]. Since this effect was not observed in tobacco leaves, a species-specific regulation through the maize TFIIB ortholog can be suggested [17].
The CCAAT box, defined by the CCAAT consensus sequence, is another common motif in the core promoter region, mainly positioned from −120 bp to −40 bp in eudicots and −460 bp to −140 bp in monocots, including maize [14]. This motif is the binding site for the nuclear factor Y (NF-Y) complex-a heterotrimeric transcription factor that participates in chromatin remodeling [35]. After binding, NF-Y can provoke both positive and negative histone modifications through specific enzymes [36,37], thus, positively or negatively affecting gene expression [35]. Despite there is a huge knowledge gap about the CCAAT motif and NF-Y functional roles in plants, research is currently being extensively conducted. For example, Xiong et al. demonstrated the abundance of CCAAT motifs in storage protein genes of rice, associating starch formation with NF-Y regulation [38].
Another common eukaryotic core promoter element is the GC box with a consensus sequence GGGCGG. In plant genes, the GC box is often found in multiple copies on both DNA strands within a broad range from −70 to +205 relative to TSS [14]. GC box hexamers are not found in plant promoters as often as in mammalian ones. Approximately 25% of maize promoters and only 7% of Arabidopsis promoters contain at least one copy of this motif [26]. In animal systems, the GC box interacts with several transcription factors, including Sp1 and other proteins of the Sp family, to enhance transcription rate significantly [39,40]. However, to the best of our knowledge, no association between Sp1 and plant promoter was found as well as Sp1 protein homologs in Arabidopsis thaliana and Oryza sativa genomes [41], suggesting the presence of alternative GC box-interacting proteins.
Yet another poorly characterized plant core promoter element is the Downstream Promoter Element (DPE). It was found and well characterized in Drosophila [42] and later described as conserved from fruit fly to humans [43]. In animals, it is often associated with TATA-less promoters and spaced precisely from Inr, which is important for TFIID binding [43]. Despite the predicted distribution of DPEs in plant promoters [14] and occasional mentions in review articles [33,44], no direct evidence of participation of these elements in transcription regulation in the plant cells was found.
Despite the cruciality of core promoter elements for gene expression, transcription could be regulated by coreless promoters. Such relatively short promoters lack traditional core elements such as TATA and Y patch. They are usually located far upstream of the TSS and associated with low-expression genes [45]. In eudicots, coreless promoters are often found in uniformly expressed genes. In contrast, monocot species tend to have a strong association between coreless promoters and conditionally expressed genes [46].

2.1.2. Proximal and Distal Promoter Regions

In contrast to well-characterized and highly conserved core promoters, proximal and distal promoter parts are more variable across different species and tissues. The proximal promoter region is located several hundred bp upstream of the TSS and harbors a diverse set of cis-regulatory elements and motifs that determine promoter strength, tissue specificity, and ability to respond to both developmental and environmental signals [16].
Typically, proximal promoters contain diverse cis-regulatory elements rather than uniform motifs. For example, CRE sequences and corresponding transcription factors involved in abiotic stress responses have been well studied [47]. Another group of CREs that interact with the MYB (myeloblastosis) transcription factor family plays a crucial role in plant development and defense responses. They regulate such important processes for viability as the cell cycle control, cell differentiation, the central circadian oscillator, secondary metabolism, and the regulation of stress signaling [48]. The W box element is a key target for WRKY transcription factors and often serves as a regulator of seed germination, pollen development, and hormonal regulation [49]. MCM1/AGAMOUS/DEFICIENS/SRF (MADS) box-interacting transcription factors play a role in the regulation of the entire plant life cycle through developmental processes control, including seed germination, vegetative growth, transition from vegetative to reproductive growth, floral development, and senescence, and additionally regulating the abiotic and biotic stress tolerance [50]. Also, some CREs are involved in hormone signaling. Key examples include the abscisic acid-responsive element (ABRE), the TGACG motif recognized by methyl jasmonate-responsive factors, the ethylene-responsive GCC box, and the salicylic acid-responsive TCA element [51].
Mapping CRE variants and their interactions with TFs under various stimuli is an active area of research in both eudicots and monocots [50,51,52]. For practical use, special databases have been established describing hundreds of diverse CREs and TFs found in flowering plants [53,54,55]. Several maize proximal promoters have also been thoroughly investigated in search for regulatory elements. For instance, the β-carotene hydroxylase 2 (ZmBCH2) promoter demonstrates the contribution of specific cis-elements, such as the P box and AACA motif, to seed-specific transcriptional activation through interactions with ZmPBF and ZmGAMYB transcription factors [56].
Finally, the most distant from the TSS promoter part, the distal promoter region is located at kilobases upstream or downstream of the core promoter. This region harbors cis-regulatory modules (CRMs), which in contrast to individual transcription factor binding sites containing CREs, represent a higher-order organization. CRMs include such functional units as enhancers, silencers, and insulators in distal promoter regions.
Enhancers are CRMs that promote transcription by recruiting specific TFs and cofactors to precisely increase the initiation rate of target genes in a tissue-, time-, or signal-specific manner [57]. Recent findings indicate that enhancers in plants act depending on their specific position relative to the TSS, contrasting with the position-independent nature seen in animals and yeast, highlighting a key difference in transcriptional regulation [58]. The molecular mechanisms by which enhancers regulate gene expression are complex and beyond the scope of this review. Several other reviews provide more detailed information on current known mechanisms [57,59,60]. Importantly, enhancers often participate in the formation of chromatin loops, establishing direct spatial interactions with core promoters to regulate transcription. Such enhancer-promoter loops have been observed in plants with moderate to large genomes, including rice, maize, and wheat [61]. Thus, genome-wide analysis identified approximately 1500 intergenic enhancer candidates and defined their tissue-specificity [62]. As revealed by Hi-C and HiChIP analyses, these loops in maize frequently exceed 20 kb in length, suggesting the crucial role of long-range CREs in maize gene regulation [63]. For example, several well-characterized maize enhancers located in kilobases from the transcription site they regulate [64,65,66].
As functional antagonists to enhancers, silencers could silence a gene directly, but also indirectly by disrupting enhancer-promoter communication and preventing transcriptional activation. Genome-wide analysis has identified silencing-associated chromatin signatures. In maize, for example, chromatin loops associated with H3K27me3-marked nucleosomes exhibited lower transcript levels than those associated with H3K4me3 [63].
The last large group of regulatory elements in distal promoters is insulators. An insulator is a DNA element that prevents the action of CRMs on a promoter. It is usually located between a target CRM and the core promoter and capable of binding specific proteins, blocking gene activation or silencing. Several studies suggest the presence of enhancer-blocking insulator sequences in plants [67,68,69]. However, the existence of true insulator elements in plants remains highly debated. Previous reviews have concluded that CRMs with definitive insulator function have not been identified, raising doubts about their presence in plant genomes [57,61]. This uncertainty is amplified by the absence of the most studied metazoan insulator protein—CCCTC-binding factor, suggesting convergent evolution of one or more functionally equivalent chromatin architectural regulators in plants [61]. Adding to this discussion, a recent study has identified over 100 different 170 bp fragments active as insulators with different enhancers and promoters across diverse plant tissues [70]. Intriguingly, these elements exhibited bifunctional activity: they behaved as silencers when paired with weak enhancers but functioned as insulators against strong enhancers. This context-dependent behavior suggests regulation that depends on the local environment rather than on canonical insulators, indicating a potentially uncharacterized form of cis-regulatory logic [70]. Thus, while the debate continues, these new insights offer a fresh perspective on this complex topic.
The remarkable morphological and functional diversity of plant tissues is the result of precise spatiotemporal gene regulation managed by the regulatory network of CREs and CRMs. Within these networks, enhancers and silencers work together to precisely balance gene expression and create boundaries between different cell types to express or not a target gene [71]. Lewis et al. suggested that complex expression patterns evolved by integrating repressor sites to silence broadly active enhancers in specific cell types [71]. Moreover, further complexity to understanding arises from the observation that a significant fraction of distal putative CRMs in maize did not interact with their immediately flanking genes, but instead bypassed them by interacting with more distal genes [72]. Therefore, multi-tissue maps of chromatin accessibility are extremely useful for deciphering this cis-regulatory logic in maize [73,74,75,76,77].

2.2. Applications of Promoter Diversity for Maize Genetic Engineering

The identification and functional validation of promoters represent an important objective in plant biotechnology. Although extensive research has characterized and exploited various promoters in model eudicot, their functional equivalence in monocot crops cannot be assumed. Substantial differences in core promoter architecture, transcription factor repertoires, and epigenetic landscapes between these major plant clades often lead to differing transcriptional behaviors. TF binding sites in proximal promoter regions can be associated with altered promoter strength in one species but not in another, reflecting species-specific regulatory network differences [17]. In practice, it means that strong eudicotyledon promoters, such as Cauliflower mosaic virus 35S promoter (CaMV 35S), may show weaker activity in monocots [78]. And vice versa: strong in monocots, maize ubiquitin-1 promoter (ZmUbi1) and the rice actin-1 promoter (OsAct1) are much less effective in eudicot tissues [78,79]. Here, we consider several different classes of promoters that are widely used in plant biotechnology. Also, we will try to identify several principles of choice specific promoters suitable for different tasks.

2.2.1. Constitutive Promoters

The most widely used promoters in the plant biotechnology are constitutive promoters, which drive gene expression continuously in most or all tissues throughout the life of the plant, independent of developmental stage or environmental signals. Constitutive promoters are often associated with genes essential for maintaining basic cellular functions. Absence of specific regulatory elements allows one not to worry that these promoters will behave differently in different tissues or will not work at all. Even a small fraction of possible constitutive promoters with different strengths, such as ubiquitins (ZmUbi1, PGNpr1, rubi3), H2B (maize H2B promoter), and Gos-2 (ZmGOS2) [80,81,82,83,84,85], allows researchers to create a precise toolkit for advanced genetic engineering of maize (Table 1).
Table 1. Functionally validated promoters with demonstrated activity in maize.
Table 1. Functionally validated promoters with demonstrated activity in maize.
ReferencesSpecificationPromoterType
[80,81,82,83,84,85]constitutiveZmUbi1, PGNpr1, rubi3, H2B, Gos2Plant constitutive
[86,87,88]constitutiveCaMV 35S, FMV, CmYLCV, CsVMVViral constitutive
[89]embryo and leavesZm-PLTPTissue-specific
[90]embryoOLE, EAP1, LTP2
[91]rootp8463, p5023, p1534, p8531, p6629
[92]endospermLWM
[93]embryoglobulin-1
[94]silk tissuesp1-R2R3-MYB
[95]pollenZmSTK2_USP
[96]wound-inducibleMPIInducible
[97]senescence-inducibleZm(PSEE1)
[89]auxin-inducibleZm-Axig1
[98]drought and salinityZmGAPP
[99]abscisic acid-inducibleABA-inducible
[100]chimericsuperpromoterSynthetic
[101]chimericA27znGlb1
[102]synthetic abscisic acid-responseZmDRO1
[103]bidirectionalBDP
A huge group of constitutive promoters that are currently in biotechnological and scientific use are promoters derived from plant viruses. Multiple caulimoviridae members, including Cauliflower mosaic virus (CaMV 35S), Figwort mosaic virus (FMV) [86], Cestrum yellow leaf curling virus (CmYLCV) [87], and Cassava vein mosaic virus (CsVMV) [88], have been characterized and drive strong heterologous expression in transgenic maize. Notably, the CmYLCV promoter has been shown to outperform both the CaMV 35S and ZmUbi1 promoters by 2-fold in maize [87].
Cereal engineering presents distinct challenges, as many synthetic biology tools were developed for model eudicot species. Currently, it relies on a limited set of constitutive promoters, with transgene expression further modulated by codon optimization, intron-mediated enhancement, and terminator selection. To address this limitation and expand the repertoire of effective regulatory elements, a wide comparison of diverse promoters was conducted to identify optimal regulatory elements for precise and robust transgene expression in cereal engineering [104]. However, empirical validation remains necessary for orthologous promoters due to their potential for variable expression fidelity [105], even though they are derived from phylogenetically related monocots. As an example, the rice OsAct1 promoter drives strong expression in maize [106], providing cross-species functionality. New work using maize protoplasts addresses the lack of standardized regulatory elements for synthetic biology in cereals. Also, in a recent article, May et al. characterized a diverse set of gene expression elements, including 17 promoters and 18 promoter-5′UTR fusions [107]. Thus, this study provided essential engineering components and critical insights into the cross-species functionality of promoters in maize [107].

2.2.2. Tissue-Specific and Inducible Promoters

Some tasks require gene expression activation in specific tissues or upon external chemical or physical signals. For example, application of tissue-specific promoters in transgenic engineering is particularly valuable when precise targeting of metabolic pathways or developmental processes is required, as it allows for enhanced trait performance without compromising plant growth. The functionality of this approach is illustrated by the maize promoter Zm-PLTP that drives gene expression specifically in embryos and leaves while showing limited expression in the ear, tassel, and roots. The expression of the morphogenic gene Bbm under Zm-PLTP control provided rapid and uniform formation of somatic embryos directly from the scutellum, leading to efficient plant regeneration without the pleiotropic effects associated with constitutive expression [89].
Seed-preferred promoters are the most extensively characterized class of tissue-specific regulatory elements, primarily used for metabolic engineering of grain nutritional profiles and seed oil biosynthesis [105]. Their application extends to recombinant protein accumulation, capitalizing on the seed endogenous biochemical environment, optimized for long-term macromolecular stability and storage. For example, embryo-specific promoters, including an oleosin (OLE) promoter, an early embryo protein (EAP1) promoter, and an aleurone-specific lipid transfer protein (LTP2) promoter, were used to achieve a high-level expression of two key TFs involved in the regulation of oil accumulation in maize. This approach substantially elevated seed oil content, demonstrating how tissue-engineered expression systems can optimize metabolic traits [90]. Five promoters driving root-predominant gene expression in maize were characterized, which could potentially lead to improved root architecture [91]. Additional maize-derived tissue-specific promoters could provide enhanced spatiotemporal precision by leveraging native regulation: LMW (low molecular weight) glutenin promoter in endosperm [92], globulin-1 promoter in embryo [93], p1-R2R3-MYB in silk tissues [94], and ZmSTK2_USP pollen-specific [95].
While tissue-specific promoters offer spatial control, restricting expression to defined anatomical regions or cell types, inducible promoters are designed to provide temporal control over gene expression by responding to specific exogenous stimuli or developmental signals. This conditional expression strategy minimizes detrimental effects associated with continuous expression. In maize, inducible promoters have been developed to respond to wound, senescence, and environmental stimuli such as drought, heat shock, and chemical inducers. Specifically, the maize proteinase inhibitor (MPI) promoter exhibits wound-inducible properties, effectively driving the expression of target genes in response to mechanical damage or pest attack. In nature, this inducibility allows the plant to locally increase the production of defensive proteins upon wounding, thereby enhancing resistance against herbivory or pathogen invasion. Studies have documented that the MPI promoter can yield significant expression of insecticidal proteins like Cry1B, albeit with a slight lag in expression relative to constitutive promoters [96]. Additionally, there are several inducible promoters with different inducing stimuli that have also been characterized in maize: senescence-Inducible promoter ZmP(SEE1) [97], auxin-Inducible promoter Zm-Axig1 [89], drought and salinity inducible promoter ZmGAPP [98], abscisic acid-inducible promoter [99].

2.2.3. Synthetic Promoters

Enhancing promoter strength is a main strategy for achieving higher transgene expression in plants. This can be pursued through multiple ways, including the discovery of novel natural promoters and the design of synthetic ones. Natural promoters offer the advantage of evolutionary conservation and a well-established track record in their native regulatory roles; however they may be limited by moderate expression strength, size, and issues related to homology-based silencing [108]. In some cases, such as the use of pathogen-derived promoters like the CaMV 35S, this can trigger unintended phenotypes or gene silencing due to sequence repetition [109]. Also, synthetic promoters can provide enhanced flexibility and the ability to fine-tune expression patterns, ranging from spatial and temporal specificity to inducibility and bidirectionality. However, their application demands sophisticated design strategies and rigorous validation to ensure predictable performance [110,111]. Moreover, synthetic promoters also have the advantage of reduced sequence length, which facilitates easier vector construction and diminishes the likelihood of homology-dependent gene silencing in complex genetic constructions [112].
Current synthetic promoter design efforts remain focused on model plants with the most studied regulatory architectures, leaving maize relatively underexplored. A significant obstacle in maize synthetic promoter development is the incomplete understanding of functional interplay among diverse CREs [18,109]. Despite these challenges, researchers have created a number of synthetic promoters in maize. For example, a chimeric ‘superpromoter’ was engineered through the fusion of three copies of the octopine synthase transcriptional activator with the mannopine synthase (mas20) activator and minimal promoter [100]. In transgenic maize systems, this hybrid construct exhibited transcriptional activity comparable to or exceeding established viral (CaMV 35S) and endogenous (ZmUbi1) constitutive promoters [100]. The fusion of regulatory sequences from the maize 27zn and Glb1 promoters yielded the synthetic chimeric promoter A27znGlb1. This engineered promoter demonstrated tissue-specific expression, directing transgene activity specifically within the embryo and endosperm of transgenic maize plants [101]. In another study, a synthetically engineered ABA-responsive promoter directed ZmDRO1 expression in maize, conferring steeper root growth angles and enhanced drought tolerance [102].
Careful modification engineering of natural promoters enables the creation of synthetic regulatory variants with fine-tuned functionality. Illustrating this approach, researchers used CRISPR-Cas9 to generate weak allelic variants of key maize CLE gene promoters. These edits achieved quantitative reduction in target gene expression without perturbing overall plant architecture, ultimately conferring enlarged ears, increased kernel rows, and higher grain yield per ear [113].
Many plant traits are polygenic, making phenotype modification more complicated. To mitigate gene silencing in multigene cassettes and reduce the size of transferred DNA, bidirectional promoters and gene stacking strategies can be employed [114]. A synthetic cassette incorporating a bidirectional promoter derived from the ZmUbi1 promoter, combined with a bicistronic gene arrangement, enabled the coordinated expression of four genes with higher efficiency compared to the expression of a single gene driven by the ZmUbi1 promoter alone in maize [103]. Additionally, for metabolic engineering of anthocyanin-rich maize, researchers have successfully utilized a seed-specific bidirectional promoter [115].
However, in case of difficulties in synthetic bidirectional promoter application including legislative restrictions, there are several natural promoters of this kind. For example, the intergenic region between maize Def1 and Def2 defensin-like genes is a naturally evolved bidirectional promoter. Functional analyses in immature embryos revealed that this compact (~635 bp) element drives asymmetric co-expression of both genes, proving that even minimal promoter sequences can orchestrate sophisticated spatiotemporal and directional control programs [116].
Finally, one could combine synthetic promoters with synthetic transcription factors to control gene expression more precisely. For instance, synthetic zinc finger TFs have demonstrated functional activity in maize protoplasts [117].
Despite these advantages, synthetic regulatory elements have several limitations. The design process for these elements often involves alternating extensive bioinformatics analysis and experimental validation to achieve the desired functional characteristics [112]. Incorrect combination of cis-elements or positioning can lead to unwanted transcription factor competition or steric hindrance, resulting in unpredictable expression levels. Moreover, synthetic promoters may not fully reproduce the nuances of adaptive responses of natural regulatory networks, especially under changing physiological conditions [112].

3. 5′ Untranslated Regions

The term “promoter” is commonly used to describe the whole regulatory region upstream of the start codon, including the promoter itself and the 5′ untranslated region (5′UTR). Whilst the promoter orchestrates transcription initiation and affects the number of transcripts, the 5′UTR plays a crucial role in determination of translation efficiency. As an example, several synthetic “promoters” used in maize include the 5′UTR intron of genes encoding Ubiquitin-1/2 or Adh1 for intron-mediated enhancement. In contrast, use of a poorly optimized 5′UTR could lead to lower protein expression due to inefficient translation [118,119]. Thus, to achieve the desired protein production level, one should improve both transcript quantity and quality [120]. Therefore, the development of plant synthetic promoters or optimization of existing ones is typically accompanied by the selection and refining of an appropriate 5′UTR sequence for translation.

3.1. The Functional Roles of the 5′UTR

The 5′UTR of an mRNA is a non-coding leader sequence between the transcription start site and the translation start site (start codon). The 5′UTR, together with the 3′UTR, is recognized as a key regulatory hub for post-transcriptional control [121]. Though not translated, it profoundly influences translation initiation and mRNA stability. Key functional features within the 5′UTR include the 5′ cap structure that recruits initiation factor eIF4E for translation initiation, regulatory elements modulating ribosomal scanning (e.g., stem-loops or upstream ORFs), embedded cis-acting elements such as translation enhancers and intronic sequences, and the start codon context sequence (Figure 2).
The composition and arrangement of 5′UTR regulatory elements and mRNA secondary structures regulate the susceptibility of mRNA to ribonuclease-mediated degradation, determining mRNA half-life and availability for translation [5,122]. By modulating both the initiation of translation and the stability of the mRNA, 5′UTRs provide a fine-tuning mechanism that allows cells to rapidly adjust protein output in response to changing physiological conditions without alterations in transcription rates [123].
In monocots, the functional significance of the 5′UTR extends to the regulation of gene expression under stress conditions such as drought, heat shock, and nutrient deficiency. For example, the splicing efficiency of 5′UTR introns in maize is sensitive to heat shock, resulting in increased expression of ubiquitin proteins [124]. Another functional significance of monocot 5′UTRs in stress-responsive gene expression is highlighted by the maize alcohol dehydrogenase (adh1) 5′UTR, which drives efficient translation under hypoxia and heat shock conditions. Incorporation of this 5′UTR into expression cassettes provided efficient target protein production in plants under stress conditions [125].
The length and content of 5′UTRs in plants correlate with the functional role of the gene [7,126]. Functional enrichment analysis in Arabidopsis and rice revealed functional differences between 5′UTRs of different lengths [127]. Similar analysis in maize suggested that genes related to developmental processes, response to abiotic stimuli, organic substance transport, anatomical structure development, and mRNA transport possess short 5′UTRs (1–500 bp). Medium-length 5′UTRs (501–1000 bp) are associated with genes involved in phosphorus metabolic processes, protein phosphorylation, cellular protein modification processes, cell communication, and the regulation of cellular processes. Genes associated with the regulation of iron ion transport were enriched across both the long (1001–2000 bp) and very long (>2000 bp) 5′UTR length categories [128].
In maize, 5′ and 3′UTR sequences make up approximately 24.7% of the total transcripts length. The average maize 5′UTR length is 430 bp, longer than that observed in rice (259 bp) and Arabidopsis (155 bp) [128]. Sometimes the number of cis-acting elements in 5′UTR is positively correlated with the expression level of the corresponding gene. This was revealed, for example, for the Zea mays LAZ1 gene family. Furthermore, 5′ UTRs of the ZmLAZ1 gene family not only modulate gene expression but also impart functional specificity in a length-dependent manner [128]. These results suggest the important role of 5′UTR architecture in gene regulation and functional diversification of gene expression in maize.

3.2. 5′ Capping

The 5′ cap is a characteristic structure of eukaryotic mRNAs. In maize, as in other eukaryotes, the canonical cap consists of a 7-methylated guanosine (m7G) linked by a 5′,5′-triphosphate chain to the first transcribed nucleotide at the very 5′ end of nascent transcripts, thereby protecting the mRNA from exonucleolytic decay [129,130]. There are several types of caps: Cap-0 (m7Gppp), Cap-1 (m7GpppN1m), and Cap-2 (m7GpppN1mN2m), where Nm is a 2′-O-methylated nucleotide and ppp indicates a triphosphate bridge. Mammalian mRNA typically possesses the Cap-1 and Cap-2 structures, whereas mRNAs with the Cap-0 are recognized as extrinsic RNA, triggering the immune response. However, elevated level of Cap-1-containing transcripts could still induce an innate immune response, albeit weaker [131,132]. Recent research has shown that the mRNA of arabidopsis and maize do not contain the Cap-1 or Cap-2 forms. This suggests a different mechanism and function of mRNA capping in gene regulation in plants [133].
The 5′ cap mediates cap-dependent translation by recruiting the eukaryotic initiation factor complex eIF4F through 5′ cap-eIF4E interaction [134]. This interaction is central to the assembly of the 43S preinitiation complex that scans the mRNA for the start codon. Specifically, in maize, selective cap-binding by isoforms such as eIF4E and eIF(iso)4E is critical during seed germination. Dynamic changes in the composition of cap-binding complexes modulate the selective translation of stored mRNAs and de novo transcripts [135,136].
Under stress conditions, such as heat shock, viral infection, or apoptosis, the efficiency of cap-dependent translation can be drastically reduced [137]. In response to stress, plants may use an alternative mechanism of translation initiation using internal ribosome entry site (IRES) mediated mRNA translation to maintain the synthesis of essential proteins. IRES is a sequence of mRNA with a set of cis-regulatory elements that can directly recruit the ribosomal subunits to the mRNA or interact with IRES trans-acting factors (ITAFs) for ribosome recruitment in eukaryotic cells [138]. While the mechanisms of IRES-dependent initiation have been well-characterized in animals, they are still not well understood in plants, with only a few examples available. For instance, A. thaliana mRNA translation of the WUSCHEL transcription factor, which plays a central role in maintaining and regulating stem cell populations in shoot and floral meristems, is IRES-mediated and is enhanced by environmental stress [139]. Scarce IRES data are available in maize, with one well-characterized example being the 5′UTR of hsp101 mRNA, which contains an IRES that facilitates internal ribosome recruitment and translation initiation even when cap-dependent pathways are blocked. This guarantees the production of a critical cell thermotolerance chaperone protein [129].
In addition to the common m7G 5′ caps, several plant studies have reported a few more cap modifications. Non-canonical caps such as NAD+ have been suggested as important regulators of RNA fate through their influence on mRNA stability and translation. For instance, the Arabidopsis dxo1 mutant, which lacks the NAD+ decapping enzyme DXO1, exhibits pleiotropic growth defects throughout its life cycle. This includes a dwarfing, light green coloration indicating total chlorophyll deficiency, and decreased fertility, giving smaller seed sets [140]. Advanced techniques, including NAD captureSeq, NAD-seq, and NAD tagSeq, have been successfully applied to map these modifications across plant transcriptomes. However, data on NAD+ caps in maize remain absent, pointing to a potentially significant and unexplored regulatory layer.

3.3. uORFs

Many plant 5′UTRs contain upstream open reading frames (uORFs) – start codons upstream of the main ORF. Genome-wide analysis shows that uORFs are widespread in a vast majority of eukaryotic genomes and probably have regulatory functions [141]. uORFs modulate translation by reducing ribosome reinitiation efficiency at the main ORF through ribosomal dissociation or stalling at the uORF and thereby preventing ribosomes from accessing the main open reading frame [142]. Recent research in A. thaliana has demonstrated that uORF length directly dictates translation reinitiation efficiency, with longer uORFs resulting in dramatically lower rates: ~95–100% translation reinitiation efficiency for a 3 aa uORF, declining to ~61% for 25 aa, ~25% for 40 aa, ~4% for 50 aa, and merely ~2% for a 70 aa uORF [143]. A small subset of uORFs, termed Conserved Peptide uORFs (CPuORFs), encode evolutionarily conserved peptides that are important for translational repression. For instance, a CPuORF within the bZIP11 gene in Arabidopsis can prevent the ribosome from producing the main protein when sucrose levels are high, thereby adjusting gene expression accordingly [144]. There is a wealth of evidence from mutagenesis and ribosome profiling that CPuORFs stall the ribosome in a sequence-dependent manner [145]. Notably, many of these CPuORFs are conserved between monocots and eudicots [146].
The distribution of uORFs across genes is functionally biased. Highly expressed mRNAs, like those encoding housekeeping genes, typically possess short 5′UTRs lacking uORFs. Conversely, poorly expressed mRNAs, such as those for TFs and kinases, often feature longer 5′UTRs enriched with uORFs [146]. About 35% of A. thaliana genes are transcribed into uORF-containing mRNAs, and about half of these have multiple uORFs [147].
In maize, uORFs have been identified in genes encoding transcription factors, such as Opaque-2 and other regulatory proteins, where overlapping uORFs exhibit redundant repressive functions [148]. These uORFs collectively restrict translation initiation of the main coding sequence unless specific cellular conditions, such as drought, are met. Additionally, in maize under drought stress, translation of 3063 uORFs associated with 2558 genes was detected, revealing a pervasive layer of translational control. This uORF-driven translational reprogramming functions synergistically with transcriptional regulation in the maize drought response [149]. Additionally, nucleotide variations near uORF start codons modulate translation efficiency by affecting the initiation process. This leads to inheritable allelic differences in protein levels that are associated with phenotypic traits and could serve as an adaptive function [150].
Mutagenesis of the uORFs’ start codons of the native 5′UTR can prevent their translation initiation. This strategy has been demonstrated to enhance translation efficiency of the primary open reading frame. For example, targeted disruption of an inhibitory uORF within the 5′UTR of the maize Lc transcription factor gene resulted in increased translation of the main ORF, correlating with elevated anthocyanin production [151]. Therefore, while designing genetic constructions, removing uORFs or avoiding introducing them inadvertently may be important for maximizing expression.

3.4. RNA Secondary Structure

mRNA molecules can form various secondary structures determined by intramolecular base pairing, yielding stable structures such as hairpins, stem-loops, pseudoknots, and G-quadruplexes. These structures are known to affect the accessibility to functionally significant mRNA regions, including the 5′ and 3′ untranslated regions as well as specific internal sequences in the coding region [152,153]. The tendency to form secondary structures can be estimated by mRNA free energy calculation, which is calculated from nucleotide sequence, taking into account the influence of the environment. Thus, changes in the cell environment (e.g., temperature or osmolarity) could affect mRNA secondary structures composition, making them an embedded transmitter of physical signals to the translation regulatory machinery. In particular, it has been suggested that plants have adopted the RNA G-quadruplex structure as an adaptation to cold conditions during evolution [154]. Temperature-sensitive RNA structural elements, known as RNA thermometers, also represent a regulatory mechanism in plants, where they modulate translational activity in response to thermal changes [155]. The secondary structure within the 5′UTR can affect the initiation of translation at AUG codons, especially in the case of weak start codon context [156,157].
Several studies demonstrated a role of the 5′UTR in translation repression in maize due to inefficient reinitiation of translation caused by upstream open reading frames and mRNA secondary structures [158,159,160]. Another study focused on maize 5′UTRs of hypoxia-induced genes distinguished two groups of untranslated regions based on the presence of stable secondary structures. The first group included relatively long 5′UTRs, which in common form stable secondary structures that may positionally coordinate translation initiation sites. In contrast, short 5′UTRs generally lack stable folding, suggesting distinct regulatory mechanisms of translational control [161].
Direct experimental validation of stable mRNA structures the impact on gene expression in maize remains limited. However, functional similarities across monocots allow one to project data obtained on other plants. In particular, this type of 5′UTR feature is relatively well investigated in Oryza sativa, making rice a model organism in the area of mRNA structure-function relationships in monocots. Thus, a study of the 5′UTR of the rice OsMac1 gene identified several stem-loop structures, whose deletion sharply reduced translation efficiency [162]. Although this study focused primarily on translation, the necessity of a defined secondary structure for efficient protein synthesis implies that these features could also contribute to mRNA stability. Additionally, in vivo structural study showed total transcript unfolding upon heat shock, promoting transcript degradation [163].
High GC content in the 5′UTR can affect translation initiation efficiency by forming stable structures. This obstructs mRNA scanning by the 43S preinitiation complex, thereby modulating start codon accessibility and hindering completion of initiation [150]. The impact of GC content on the structure and functioning of RNA is also well-studied in rice. A study on translation in rice shows that high GC content of transcripts is correlated with high translation efficiency [164]. However, the in vivo study in rice has revealed that GC content has little impact on RNA structure formation in coding sequences, but strongly affects the RNA structure in untranslated regions, especially in 5′UTR [165]. A similar effect was revealed for wheat, where mRNA structure has a strong impact on translation, independent of GC content [166]. Additionally, mRNAs with regulatory functions tend to fold into more thermostable structures in rice [165].

3.5. Introns

5′ untranslated regions can also harbor intronic sequences. A 5′UTR intron, located within the segment of the primary transcript (pre-mRNA), is classified as an intron because it is transcribed as part of the pre-mRNA and can be subsequently excised by the spliceosome, similar to introns within the coding sequence (CDS) [5]. 5′UTR introns can exert significant regulatory effects, influencing mRNA nuclear export, translational efficiency, and mRNA stability [167,168,169]. The positive impact of introns on gene expression is termed intron-mediated enhancement (IME). For IME, introns usually should be located near the transcription start site. So they facilitate enhanced pre-mRNA processing, including efficient splicing and proper nuclear export, ultimately leading to higher levels of mature transcripts available for translation [170]. However, despite thorough research, the exact mechanism of IME remains unclear and has been explained by several distinct hypotheses. A particularly well-analyzed one proposes that introns stimulate transcript initiation by creating a favorable local chromatin structure [171]. Additionally, 5′UTR introns can possibly have an impact on transcription efficiency. An analysis of the rice Rubi3 5′UTR intron revealed its capacity to enhance both transcription (2-fold in nuclear run-on experiment) and mRNA accumulation (20-fold), supporting dual regulatory mechanisms while indicating post-transcriptional enhancement as primary [172].
In monocots, some researchers suggest intron-mediated enhancement often depends on splicing with a few exceptions [170]. 5′UTR introns have been used for protein production increase for quite a long time. The rice 5′UTR intron from the salT gene was used to enhance cat reporter gene expression in maize suspension cells without altering mRNA stability [173]. The first intron of the maize Adh1 gene, positioned in its leader region, has been shown to amplify gene expression up to 100-fold compared to intronless constructs. This dramatic effect is attributed to the intron’s capacity to increase the accumulation and perhaps the quality of mRNA transcripts [170,174]. Similarly, studies on the maize GapA1 gene highlighted the necessity of its native leader intron for any detectable transient expression in maize cells [175]. Additional examples include the leader intron of the maize Shrunken-1 gene, which enhances reporter gene activity [176,177].
Some introns possess intrinsic promoter activity, enabling intron-mediated transcription initiation and expressing genes independently of conventional promoter sequences. This functional autonomy was demonstrated in monocots through the maize ubi1 intron, which actively directed GUS reporter expression in tritordeum inflorescences despite the absence of an exogenous promoter [178]. Interestingly, this effect seems to be cross-species working. Thus, leader introns from monocots (e.g., rice actin1 intron) have been incorporated into maize transgenes and have exhibited significant enhancement of expression, underscoring the conserved nature of this regulatory mechanism in monocots [105].

3.6. Translation Enhancers

Located within UTRs, translation enhancers are sequence-specific cis-acting elements that upregulate gene expression by increasing translational efficiency. The broad ambiguity of the term “translation enhancer” makes it difficult to study them in plants, as there are various mechanisms of translation enhancement. These mechanisms include recruiting specific translation initiation factors, facilitating cap-independent translation, altering RNA secondary structure to enhance ribosomal scanning and access, or mimicking tRNA-like structures. Therefore, a thorough understanding of the mechanisms behind established “translation enhancers”, both from plant and viral origins, is essential for their effective use.
In eudicots, several viral 5′UTRs, like the Tobacco mosaic virus Ω leader, are known to dramatically boost protein yield when placed before a coding sequence [179,180]. A similar translation enhancement mechanism via the 5′UTR engineering with viral translation enhancers proves equally viable in monocots. Thus, the 73-nucleotide 5′UTR derived from the CaMV 35S promoter has been used in maize to boost luciferase activity by 35-fold [181]. Also, researchers can employ tandem viral enhancers (e.g., 2× or 4× CaMV 35S configurations) to amplify gene expression. These multiplexed enhancers remain functional in monocots, demonstrating significantly enhanced expression in rice and maize while conserving spatial expression profiles equivalent to the full CaMV 35S promoter in eudicots [182,183]. While the core promoter derived from CaMV 35S may be less optimal in monocots, the tandem enhancer repeats can still drive significant transcriptional activation, highlighting a critical modular difference between enhancers and core promoters. Another example of tandem enhancement is shown by the active ribosomal RNA complementary (ARC) element. A hexameric tandem repeat (6 × ARC) outperforms translation enhancers such as the Ω sequence, OsADH, and even 3 × ARC [184]. The region of 18S rRNA corresponding to ARC is conserved among rice, tobacco, maize, wheat, and human, suggesting its potential broad utility [185].
5′UTR enhancers are usually not cross-compatible through plant clades. So, 5′UTR sequences that enhance mRNA translation in eudicots may be less effective or completely inactive in monocots [136]. Matsui et al. evaluated in various plant species translational enhancers derived from plants and viruses, including the 5′UTRs from N. tabacum, A. thaliana, and O. sativa alcohol dehydrogenase genes (NtADH, AtADH, and OsADH, respectively), as well as the Ω sequence from Tobacco mosaic virus and the Tobacco etch virus leader (TEVL) [186]. They found that the OsADH 5′UTR functioned across all tested plant species, while the Ω and TEVL sequences were effective only in eudicots. Polysome profiling in rice led to the identification of several native 5′UTRs that act as strong translational enhancers, which can be used to improve transgene expression in monocots [136].
Cap-independent translation enhancers (CITEs) are another group of cis-acting regulatory elements that are known to enhance cap-independent translation initiation. We already mentioned IRES elements, which also provide cap-independent translation initiation. Unlike IRESs, which are exclusively 5′UTR-located, CITEs can be located both within the 5′UTR or 3′UTR. For 5′CITEs, the recognition of the 5′ cap by eIF4E still plays a major role in the mRNA recruitment. However, mRNA recruitment is still possible in the absence of this interaction [187]. The main difference between IRES and CITE is their mechanism of translation initiation. IRESs enable a form of cap-independent translation that employs direct internal ribosome recruitment, effectively bypassing the 5′ end and the need for ribosome scanning. In contrast, CITEs promote translation indirectly by recruiting translation initiation factors and facilitating mRNA circularization, thereby enhancing a scanning-dependent mechanism that still relies on recognition of the mRNA’s 5′ end [187]. The widespread distribution of CITE elements among plant viruses indicates their functional importance [188]. Although reports of cellular CITE elements in maize are currently lacking, a maize-specific virus harbors a 3′CITE element, demonstrating that this mechanism is functional in maize [189].

3.7. Start Codon Context Sequence

The start codon context sequence is a well-established cis-acting element that surrounds the translation initiation site. In eukaryotes, it is known as the Kozak sequence, which plays a pivotal role in ribosome recognition and regulation of translation initiation efficiency [150,190]. Although the Kozak sequence has certain patterns, common for all higher plants, eudicots and monocots have different preferences for nucleotide sequences surrounding the start codon [191]. This divergence likely results from differences in how translation initiation factors recognize and bind the mRNA template in these two major plant groups.
One of the common Kozak features is the preference for adenine residues from positions −21 to −1 relative to the start codon, with a peak of enhancement observed within the critical −5 to −1 window [157]. Particularly important for high translational efficiency is the presence of adenines at positions from −4 to −1 [191,192]. This preference was experimentally confirmed by the introduction of an optimal adenine-rich sequence to the start codon context, resulting in enhanced translational efficiency in both Arabidopsis and wheat cell extracts. In contrast to the positive role of adenine residues, thymine residues in this region consistently reduce translation efficiency [157]. Thorough research of all possible start codon context combinations in eudicot Arabidopsis revealed the best trinucleotides at position −3 in terms of codon context. So, the presence of AAG or GAG resulted in approximately 4-fold higher translation efficiency compared to the worst trinucleotide UGC [193]. In a similar study on monocot rice, the presence of the AGC sequence resulted in a 6-fold increase in translation compared to low-efficiency sequences such as UAU, UGA, or CGA. Also, ACC and UCC sequences demonstrated high translation efficiency [193].
Not only upstream of start codon nucleotides are important for translation efficiency determination, but the whole consensus sequence [194,195]. So, genomic analysis of plant coding sequences revealed the conserved guanine and cytosine residues at positions +4 and +5 relative to the start codon, respectively [196]. Thus, sequences (GC)/(AA)NAUGGC were identified as the consensus sequence for the translation initiation site signal in flowering plants [196].
The recent integration of deep learning and machine learning methods has provided a powerful framework for predictive modeling of translation initiation sites in maize [197]. These computational pipelines analyze diverse cis-regulatory features, including the Kozak sequence, and have aided in the rational design of improved translation initiation contexts. By combining these models with high-throughput translation assays, it becomes possible to predict and validate optimal sequence variants that may further enhance protein output in maize.
Taken together, the 5′UTR contains multiple regulatory elements that function in an interconnected manner. Interactions among these elements generate a multilayered regulatory framework that enables fine-tuned, context-dependent, and dynamic control of plant translational regulation, as summarized in Table 2.

4. Conclusions

The projected climate change is having an impact on crops as well. Plants have intrinsic mechanisms of gene expression adaptation, but the rate and severity of climate change may be too rapid and high for these adaptations to cope with. Thus, along with agricultural needs, adaptation to climate change requires the development of various transgenic plants. This requires a balance between robust trait expression and the preservation of wild-type agronomic performance. This requires precise regulation of recombinant protein production to prevent negative adaptation consequences and metabolic burden [107]. In such conditions, promoters remain the key point for genetic engineering. Therefore, a deeper understanding of monocot-specific promoter features, such as the Y patch motif, and the prevalence of multiple TATA boxes, as well as enhancer organization, will enable the rational design of tailored promoters for advanced synthetic biology applications in crop species.
Despite significant advances, transgenic research and applications in maize remain heavily dependent on a narrow group of promoters characterized over two decades ago [18]. Practical guidance for selecting promoters to maximize the predictability of expression levels in engineered maize lines is often elusive and can be oversimplified into general guidelines. Preference is typically given to promoters derived from the same taxonomic subgroup, provided they meet the researcher’s objectives, such as desired expression level, promoter length, ease of cloning, and other relevant factors. Although viral promoters can achieve high expression levels, their use in multiple-copy gene cassettes may lead to silencing, potentially compromising the intended expression. In cases where pleiotropic effects may occur, such as when expressing morphogenic factors to enhance regeneration, tissue-specific or inducible promoters are generally the preferred choice.
The development of synthetic promoters is a widely used approach for expanding the available promoter repertoire, as they allow for tunable gene expression and support the development of novel traits in plants [109,111,198,199]. Additionally, cis-regulatory engineering contributes to the design of synthetic promoters [200]. These efforts are increasingly powered by machine learning. For instance, the maize transcriptional regulatory network for 104 transcription factors was reconstructed using machine learning, revealing a scale-free, modular, and conserved architecture [201]. Furthermore, the integration of machine learning with massively parallel reporter assays is becoming increasingly common for in silico plant promoter prediction and the design of synthetic promoters with tunable strength [17,202,203,204]. Nevertheless, synthetic promoters require thorough in vivo testing to ensure functional consistency in the intended biological context.
5′UTR optimization is another important factor equivalent to promoter engineering for achieving the desired gene expression level in maize. 5′UTR modifications can achieve precise expression control unattainable through strong constitutive promoters alone [180]. Synthetic biologists can enhance the robustness, predictability, and yield of genetic circuits in maize by systematically incorporating key 5′UTR features, including an optimized start codon context, ribosomal scanning barriers (stem-loops/uORFs), and embedded cis-regulatory modules.
In addition to optimizing 5′UTRs, the design of new ones is actively developing, including computational design approaches [205,206]. The PlantRNA foundation model was designed to identify functional RNA motifs in plants, including both RNA sequences and structural motifs. It was successfully used to determine critical regions inside the 5′UTRs that significantly affect translation [207]. The application of such computational design algorithms to maize holds significant potential for generating customized 5′UTR sequences that maximize expression of target open reading frames. However, currently, empirically validated 5′UTRs remain the most reliable option for predictable transgene expression in maize. As an example, the intron-containing 5′UTR of ZmUbi1 promoter delivers robust constitutive expression, making it a benchmark for high-output transgenesis. In contrast, the relatively short 5′UTR from the CaMV 35S promoter is frequently utilized when a moderate expression level is required.
Finally, different combinations of 5′UTRs and promoters enable fine-tuning of expression levels, with up to a 10-fold expression range observed in maize cells when using the same promoter with various 5′UTRs [107]. Crucially, since expression level depends on the final construct design architecture, including promoter-5′UTR combination, each combination should be evaluated separately to verify its efficacy in a specific regulatory context. The necessity for such validation not only highlights the complex nature of transcriptional and translational regulation but also exposes fundamental gaps in our understanding of their precise regulatory interplay.

Author Contributions

Conceptualization, N.V.S.; investigation, N.V.S.; writing—original draft preparation, N.V.S.; writing—review and editing, M.M.P., V.V.C., A.S.M. and S.N.; supervision, M.M.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the grant of the Ministry of Science and Higher Education of the Russian Federation № 075-15-2024-653 “Systematic characterization of promoters and terminators for gene expression in monocots”.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Erdmann, R.M.; Picard, C.L. RNA-Directed DNA Methylation. PLoS Genet. 2020, 16, e1009034. [Google Scholar] [CrossRef] [PubMed]
  2. Chekanova, J.A. Long Non-Coding RNAs and Their Functions in Plants. Curr. Opin. Plant Biol. 2015, 27, 207–216. [Google Scholar] [CrossRef] [PubMed]
  3. Rai, M.I.; Alam, M.; Lightfoot, D.A.; Gurha, P.; Afzal, A.J. Classification and Experimental Identification of Plant Long Non-Coding RNAs. Genomics 2019, 111, 997–1005. [Google Scholar] [CrossRef] [PubMed]
  4. Prall, W.; Sharma, B.; Gregory, B.D. Transcription Is Just the Beginning of Gene Expression Regulation: The Functional Significance of RNA-Binding Proteins to Post-Transcriptional Processes in Plants. Plant Cell Physiol. 2019, 60, 1939–1952. [Google Scholar] [CrossRef]
  5. Barrett, L.W.; Fletcher, S.; Wilton, S.D. Regulation of Eukaryotic Gene Expression by the Untranslated Gene Regions and Other Non-Coding Elements. Cell. Mol. Life Sci. 2012, 69, 3613–3634. [Google Scholar] [CrossRef]
  6. Maher, K.A.; Bajic, M.; Kajala, K.; Reynoso, M.; Pauluzzi, G.; West, D.A.; Zumstein, K.; Woodhouse, M.; Bubb, K.; Dorrity, M.W.; et al. Profiling of Accessible Chromatin Regions across Multiple Plant Species and Cell Types Reveals Common Gene Regulatory Principles and New Control Modules. Plant Cell 2018, 30, 15–36. [Google Scholar] [CrossRef]
  7. Araujo, P.R.; Yoon, K.; Ko, D.; Smith, A.D.; Qiao, M.; Suresh, U.; Burns, S.C.; Penalva, L.O.F. Before It Gets Started: Regulating Translation at the 5′ UTR. Int. J. Genom. 2012, 2012, 475731. [Google Scholar] [CrossRef]
  8. Chen, F.; Cocaign-Bousquet, M.; Girbal, L.; Nouaille, S. 5’UTR Sequences Influence Protein Levels in Escherichia Coli by Regulating Translation Initiation and mRNA Stability. Front. Microbiol. 2022, 13, 1088941. [Google Scholar] [CrossRef]
  9. Ambaye, T.G.; Vaccari, M.; Bonilla-Petriciolet, A.; Prasad, S.; van Hullebusch, E.D.; Rtimi, S. Emerging Technologies for Biofuel Production: A Critical Review on Recent Progress, Challenges and Perspectives. J. Environ. Manag. 2021, 290, 112627. [Google Scholar] [CrossRef]
  10. Fahad, S.; Bajwa, A.A.; Nazir, U.; Anjum, S.A.; Farooq, A.; Zohaib, A.; Sadia, S.; Nasim, W.; Adkins, S.; Saud, S.; et al. Crop Production under Drought and Heat Stress: Plant Responses and Management Options. Front. Plant Sci. 2017, 8, 1147. [Google Scholar] [CrossRef]
  11. Liu, J.; Fernie, A.R.; Yan, J. The Past, Present, and Future of Maize Improvement: Domestication, Genomics, and Functional Genomic Routes toward Crop Enhancement. Plant Commun. 2020, 1, 100010. [Google Scholar] [CrossRef] [PubMed]
  12. Hernandez-Garcia, C.M.; Finer, J.J. Identification and Validation of Promoters and Cis-Acting Regulatory Elements. Plant Sci. 2014, 217–218, 109–119. [Google Scholar] [CrossRef] [PubMed]
  13. Dao, L.T.M.; Spicuglia, S. Transcriptional Regulation by Promoters with Enhancer Function. Transcription 2018, 9, 307–314. [Google Scholar] [CrossRef] [PubMed]
  14. Kumari, S.; Ware, D. Genome-Wide Computational Prediction and Analysis of Core Promoter Elements across Plant Monocots and Dicots. PLoS ONE 2013, 8, e79011. [Google Scholar] [CrossRef]
  15. Singh, R.; Ming, R.; Yu, Q. Comparative Analysis of GC Content Variations in Plant Genomes. Trop. Plant Biol. 2016, 9, 136–149. [Google Scholar] [CrossRef]
  16. Brooks, E.G.; Elorriaga, E.; Liu, Y.; Duduit, J.R.; Yuan, G.; Tsai, C.-J.; Tuskan, G.A.; Ranney, T.G.; Yang, X.; Liu, W. Plant Promoters and Terminators for High-Precision Bioengineering. BioDesign Res. 2023, 5, 13. [Google Scholar] [CrossRef]
  17. Jores, T.; Tonnies, J.; Wrightsman, T.; Buckler, E.S.; Cuperus, J.T.; Fields, S.; Queitsch, C. Synthetic Promoter Designs Enabled by a Comprehensive Analysis of Plant Core Promoters. Nat. Plants 2021, 7, 842–855. [Google Scholar] [CrossRef]
  18. Nuccio, M.L. A Brief History of Promoter Development for Use in Transgenic Maize Applications. Methods Mol. Biol. 2018, 1676, 61–93. [Google Scholar] [CrossRef]
  19. Das, S.; Bansal, M. Variation of Gene Expression in Plants Is Influenced by Gene Architecture and Structural Properties of Promoters. PLoS ONE 2019, 14, e0212678. [Google Scholar] [CrossRef]
  20. Mora, A.; Sandve, G.K.; Gabrielsen, O.S.; Eskeland, R. In the Loop: Promoter-Enhancer Interactions and Bioinformatics. Brief. Bioinform. 2016, 17, 980–995. [Google Scholar] [CrossRef]
  21. Valen, E.; Sandelin, A. Genomic and Chromatin Signals Underlying Transcription Start-Site Selection. Trends Genet. 2011, 27, 475–485. [Google Scholar] [CrossRef] [PubMed]
  22. Haberle, V.; Stark, A. Eukaryotic Core Promoters and the Functional Basis of Transcription Initiation. Nat. Rev. Mol. Cell Biol. 2018, 19, 621–637. [Google Scholar] [CrossRef] [PubMed]
  23. Mejía-Guerra, M.K.; Li, W.; Galeano, N.F.; Vidal, M.; Gray, J.; Doseff, A.I.; Grotewold, E. Core Promoter Plasticity Between Maize Tissues and Genotypes Contrasts with Predominance of Sharp Transcription Initiation Sites. Plant Cell 2015, 27, 3309–3320. [Google Scholar] [CrossRef] [PubMed]
  24. Loganantharaj, R. Discriminating TATA Box from Putative TATA Boxes in Plant Genome. Int. J. Bioinform. Res. Appl. 2006, 2, 36–51. [Google Scholar] [CrossRef]
  25. Savinkova, L.K.; Sharypova, E.B.; Kolchanov, N.A. On the Role of TATA Boxes and TATA-Binding Protein in Arabidopsis Thaliana. Plants 2023, 12, 1000. [Google Scholar] [CrossRef]
  26. Brázda, V.; Bartas, M.; Bowater, R.P. Evolution of Diverse Strategies for Promoter Regulation. Trends Genet. 2021, 37, 730–744. [Google Scholar] [CrossRef]
  27. Molina, C.; Grotewold, E. Genome Wide Analysis of Arabidopsis Core Promoters. BMC Genom. 2005, 6, 25. [Google Scholar] [CrossRef]
  28. Civán, P.; Svec, M. Genome-Wide Analysis of Rice (Oryza sativa L. Subsp. Japonica) TATA Box and Y Patch Promoter Elements. Genome 2009, 52, 294–297. [Google Scholar] [CrossRef]
  29. Srivastava, R.; Rai, K.M.; Srivastava, M.; Kumar, V.; Pandey, B.; Singh, S.P.; Bag, S.K.; Singh, B.D.; Tuli, R.; Sawant, S.V. Distinct Role of Core Promoter Architecture in Regulation of Light-Mediated Responses in Plant Genes. Mol. Plant 2014, 7, 626–641. [Google Scholar] [CrossRef]
  30. Bernard, V.; Brunaud, V.; Lecharny, A. TC-Motifs at the TATA-Box Expected Position in Plant Genes: A Novel Class of Motifs Involved in the Transcription Regulation. BMC Genom. 2010, 11, 166. [Google Scholar] [CrossRef]
  31. Yamamoto, Y.Y.; Ichida, H.; Matsui, M.; Obokata, J.; Sakurai, T.; Satou, M.; Seki, M.; Shinozaki, K.; Abe, T. Identification of Plant Promoter Constituents by Analysis of Local Distribution of Short Sequences. BMC Genom. 2007, 8, 67. [Google Scholar] [CrossRef] [PubMed]
  32. Sanchez-Muñoz, R.; Bonfill, M.; Cusidó, R.M.; Palazón, J.; Moyano, E. Advances in the Regulation of In Vitro Paclitaxel Production: Methylation of a Y-Patch Promoter Region Alters BAPT Gene Expression in Taxus Cell Cultures. Plant Cell Physiol. 2018, 59, 2255–2267. [Google Scholar] [CrossRef] [PubMed]
  33. Porto, M.S.; Pinheiro, M.P.N.; Batista, V.G.L.; dos Santos, R.C.; de Albuquerque Melo Filho, P.; de Lima, L.M. Plant Promoters: An Approach of Structure and Function. Mol. Biotechnol. 2013, 56, 38–49. [Google Scholar] [CrossRef]
  34. Lagrange, T.; Kapanidis, A.N.; Tang, H.; Reinberg, D.; Ebright, R.H. New Core Promoter Element in RNA Polymerase II-Dependent Transcription: Sequence-Specific DNA Binding by Transcription Factor IIB. Genes Dev. 1998, 12, 34–44. [Google Scholar] [CrossRef] [PubMed]
  35. Laloum, T.; De Mita, S.; Gamas, P.; Baudin, M.; Niebel, A. CCAAT-Box Binding Transcription Factors in Plants: Y so Many? Trends Plant Sci. 2013, 18, 157–166, Erratum in Trends Plant Sci. 2013, 18, 594–595. [Google Scholar] [CrossRef]
  36. Gatta, R.; Mantovani, R. NF-Y Affects Histone Acetylation and H2A.Z Deposition in Cell Cycle Promoters. Epigenetics 2011, 6, 526–534. [Google Scholar] [CrossRef]
  37. Donati, G.; Gatta, R.; Dolfini, D.; Fossati, A.; Ceribelli, M.; Mantovani, R. An NF-Y-Dependent Switch of Positive and Negative Histone Methyl Marks on CCAAT Promoters. PLoS ONE 2008, 3, e2066, Correction in PLoS ONE 2014, 9, e102282. [Google Scholar] [CrossRef]
  38. Xiong, Y.; Ren, Y.; Li, W.; Wu, F.; Yang, W.; Huang, X.; Yao, J. NF-YC12 Is a Key Multi-Functional Regulator of Accumulation of Seed Storage Substances in Rice. J. Exp. Bot. 2019, 70, 3765–3780. [Google Scholar] [CrossRef]
  39. Nielsen, S.J.; Præstegaard, M.; Ørgensen, H.F.J.; Clark, B.F.C. Different Sp1 Family Members Differentially Affect Transcription from the Human Elongation Factor 1 A-1 Gene Promoter. Biochem. J. 1998, 333, 511–517. [Google Scholar] [CrossRef]
  40. Kadonaga, J.T.; Jones, K.A.; Tjian, R. Promoter-Specific Activation of RNA Polymerase II Transcription by Sp1. Trends Biochem. Sci. 1986, 11, 20–23. [Google Scholar] [CrossRef]
  41. Yamamoto, Y.Y.; Ichida, H.; Abe, T.; Suzuki, Y.; Sugano, S.; Obokata, J. Differentiation of Core Promoter Architecture between Plants and Mammals Revealed by LDSS Analysis. Nucleic Acids Res. 2007, 35, 6219–6226. [Google Scholar] [CrossRef] [PubMed]
  42. Burke, T.W.; Kadonaga, J.T. Drosophila TFIID Binds to a Conserved Downstream Basal Promoter Element That Is Present in Many TATA-Box-Deficient Promoters. Genes Dev. 1996, 10, 711–724. [Google Scholar] [CrossRef] [PubMed]
  43. Burke, T.W.; Kadonaga, J.T. The Downstream Core Promoter Element, DPE, Is Conserved from Drosophila to Humans and Is Recognized by TAFII60 of Drosophila. Genes Dev. 1997, 11, 3020–3031. [Google Scholar] [CrossRef] [PubMed]
  44. Biłas, R.; Szafran, K.; Hnatuszko-Konka, K.; Kononowicz, A.K. Cis-Regulatory Elements Used to Control Gene Expression in Plants. Plant Cell Tissue Organ Cult. 2016, 127, 269–287. [Google Scholar] [CrossRef]
  45. Yamamoto, Y.Y.; Yoshioka, Y.; Hyakumachi, M.; Obokata, J. Characteristics of Core Promoter Types with Respect to Gene Structure and Expression in Arabidopsis thaliana. DNA Res. 2011, 18, 333–342. [Google Scholar] [CrossRef]
  46. Yang, E.J.Y.; Maranas, C.J.; Nemhauser, J.L. A Comparative Analysis of Stably Expressed Genes across Diverse Angiosperms Exposes Flexibility in Underlying Promoter Architecture. G3 2023, 13, jkad206. [Google Scholar] [CrossRef]
  47. Hoang, X.L.T.; Nhi, D.N.H.; Thu, N.B.A.; Thao, N.P.; Tran, L.-S.P. Transcription Factors and Their Roles in Signal Transduction in Plants under Abiotic Stresses. Curr. Genom. 2017, 18, 483–497. [Google Scholar] [CrossRef]
  48. Li, C.; Ng, C.K.-Y.; Fan, L.-M. MYB Transcription Factors, Active Players in Abiotic Stress Signaling. Environ. Exp. Bot. 2015, 114, 80–91. [Google Scholar] [CrossRef]
  49. Rushton, P.J.; Somssich, I.E.; Ringler, P.; Shen, Q.J. WRKY Transcription Factors. Trends Plant Sci. 2010, 15, 247–258. [Google Scholar] [CrossRef]
  50. Abdullah-Zawawi, M.-R.; Ahmad-Nizammuddin, N.-F.; Govender, N.; Harun, S.; Mohd-Assaad, N.; Mohamed-Hussein, Z.-A. Comparative Genome-Wide Analysis of WRKY, MADS-Box and MYB Transcription Factor Families in Arabidopsis and Rice. Sci. Rep. 2021, 11, 19678. [Google Scholar] [CrossRef]
  51. Kaur, A.; Pati, P.K.; Pati, A.M.; Nagpal, A.K. In-Silico Analysis of Cis-Acting Regulatory Elements of Pathogenesis-Related Proteins of Arabidopsis thaliana and Oryza sativa. PLoS ONE 2017, 12, e0184523. [Google Scholar] [CrossRef] [PubMed]
  52. Ahmar, S.; Gruszka, D. Study of Brassinosteroid Signaling Genes in Rice Provides Insight Into Mechanisms Which Regulate Their Expression. Front. Genet. 2022, 13, 953458. [Google Scholar] [CrossRef] [PubMed]
  53. Lescot, M.; Déhais, P.; Thijs, G.; Marchal, K.; Moreau, Y.; Van de Peer, Y.; Rouzé, P.; Rombauts, S. PlantCARE, a Database of Plant Cis-Acting Regulatory Elements and a Portal to Tools for in Silico Analysis of Promoter Sequences. Nucleic Acids Res. 2002, 30, 325–327. [Google Scholar] [CrossRef] [PubMed]
  54. Higo, K.; Ugawa, Y.; Iwamoto, M.; Korenaga, T. Plant Cis-Acting Regulatory DNA Elements (PLACE) Database: 1999. Nucleic Acids Res. 1999, 27, 297–300. [Google Scholar] [CrossRef]
  55. Tian, F.; Yang, D.-C.; Meng, Y.-Q.; Jin, J.; Gao, G. PlantRegMap: Charting Functional Regulatory Maps in Plants. Nucleic Acids Res. 2020, 48, D1104–D1113. [Google Scholar] [CrossRef]
  56. Jin, X.; Bai, C.; Bassie, L.; Nogareda, C.; Romagosa, I.; Twyman, R.M.; Christou, P.; Zhu, C. ZmPBF and ZmGAMYB Transcription Factors Independently Transactivate the Promoter of the Maize (Zea mays) β-Carotene Hydroxylase 2 Gene. New Phytol. 2019, 222, 793–804. [Google Scholar] [CrossRef]
  57. Schmitz, R.J.; Grotewold, E.; Stam, M. Cis-Regulatory Sequences in Plants: Their Importance, Discovery, and Future Challenges. Plant Cell 2021, 34, 718–741. [Google Scholar] [CrossRef]
  58. Voichek, Y.; Hristova, G.; Mollá-Morales, A.; Weigel, D.; Nordborg, M. Widespread Position-Dependent Transcriptional Regulatory Sequences in Plants. Nat. Genet. 2024, 56, 2238–2246. [Google Scholar] [CrossRef]
  59. Andersson, R.; Sandelin, A. Determinants of Enhancer and Promoter Activities of Regulatory Elements. Nat. Rev. Genet. 2020, 21, 71–87. [Google Scholar] [CrossRef]
  60. Beernink, B.M.; Vogel, J.P.; Lei, L. Enhancers in Plant Development, Adaptation and Evolution. Plant Cell Physiol. 2025, 66, 461–476. [Google Scholar] [CrossRef]
  61. Marand, A.P.; Eveland, A.L.; Kaufmann, K.; Springer, N.M. Cis-Regulatory Elements in Plant Development, Adaptation, and Evolution. Annu. Rev. Plant Biol. 2023, 74, 111–137. [Google Scholar] [CrossRef] [PubMed]
  62. Oka, R.; Zicola, J.; Weber, B.; Anderson, S.N.; Hodgman, C.; Gent, J.I.; Wesselink, J.-J.; Springer, N.M.; Hoefsloot, H.C.J.; Turck, F.; et al. Genome-Wide Mapping of Transcriptional Enhancer Candidates Using DNA and Chromatin Features in Maize. Genome Biol. 2017, 18, 137. [Google Scholar] [CrossRef] [PubMed]
  63. Ricci, W.A.; Lu, Z.; Ji, L.; Marand, A.P.; Ethridge, C.L.; Murphy, N.G.; Noshay, J.M.; Galli, M.; Mejía-Guerra, M.K.; Colomé-Tatché, M.; et al. Widespread Long-Range Cis-Regulatory Elements in the Maize Genome. Nat. Plants 2019, 5, 1237–1249, Correction in Nat. Plants 2020, 6, 328. [Google Scholar] [CrossRef] [PubMed]
  64. Louwers, M.; Bader, R.; Haring, M.; van Driel, R.; de Laat, W.; Stam, M. Tissue- and Expression Level-Specific Chromatin Looping at Maize b1 Epialleles. Plant Cell 2009, 21, 832–842. [Google Scholar] [CrossRef]
  65. Sidorenko, L.; Li, X.; Tagliani, L.; Bowen, B.; Peterson, T. Characterization of the Regulatory Elements of the Maize P-Rr Gene by Transient Expression Assays. Plant Mol. Biol. 1999, 39, 11–19. [Google Scholar] [CrossRef]
  66. Clark, R.M.; Wagler, T.N.; Quijada, P.; Doebley, J. A Distant Upstream Enhancer at the Maize Domestication Gene tb1 Has Pleiotropic Effects on Plant and Inflorescent Architecture. Nat. Genet. 2006, 38, 594–597. [Google Scholar] [CrossRef]
  67. van der Geest, A.H.; Hall, T.C. The Beta-Phaseolin 5’ Matrix Attachment Region Acts as an Enhancer Facilitator. Plant Mol. Biol. 1997, 33, 553–557. [Google Scholar] [CrossRef]
  68. Hily, J.-M.; Singer, S.D.; Yang, Y.; Liu, Z. A Transformation Booster Sequence (TBS) from Petunia Hybrida Functions as an Enhancer-Blocking Insulator in Arabidopsis thaliana. Plant Cell Rep. 2009, 28, 1095–1104. [Google Scholar] [CrossRef]
  69. Singer, S.D.; Cox, K.D. A Gypsy-like Sequence from Arabidopsis thaliana Exhibits Enhancer-Blocking Activity in Transgenic Plants. J. Plant Biochem. Biotechnol. 2013, 22, 35–42. [Google Scholar] [CrossRef]
  70. Jores, T.; Mueth, N.A.; Tonnies, J.; Char, S.N.; Liu, B.; Grillo-Alvarado, V.; Abbitt, S.; Anand, A.; Deschamps, S.; Diehn, S.; et al. Small DNA Elements Can Act as Both Insulators and Silencers in Plants. Plant Cell 2025, 37, koaf084. [Google Scholar] [CrossRef]
  71. Lewis, J.J.; Geltman, R.C.; Pollak, P.C.; Rondem, K.E.; Van Belleghem, S.M.; Hubisz, M.J.; Munn, P.R.; Zhang, L.; Benson, C.; Mazo-Vargas, A.; et al. Parallel Evolution of Ancient, Pleiotropic Enhancers Underlies Butterfly Wing Pattern Mimicry. Proc. Natl. Acad. Sci. USA 2019, 116, 24174–24183. [Google Scholar] [CrossRef] [PubMed]
  72. Fagny, M.; Kuijjer, M.L.; Stam, M.; Joets, J.; Turc, O.; Rozière, J.; Pateyron, S.; Venon, A.; Vitte, C. Identification of Key Tissue-Specific, Biological Processes by Integrating Enhancer Information in Maize Gene Regulatory Networks. Front. Genet. 2020, 11, 606285. [Google Scholar] [CrossRef] [PubMed]
  73. Marand, A.P.; Chen, Z.; Gallavotti, A.; Schmitz, R.J. A Cis-Regulatory Atlas in Maize at Single-Cell Resolution. Cell 2021, 184, 3041–3055.e21. [Google Scholar] [CrossRef] [PubMed]
  74. Zheng, G.M.; Wu, J.W.; Li, J.; Zhao, Y.J.; Zhou, C.; Ren, R.C.; Wei, Y.M.; Zhang, X.S.; Zhao, X.Y. The Chromatin Accessibility Landscape during Early Maize Seed Development. Plant J. 2025, 121, e70073. [Google Scholar] [CrossRef]
  75. Parvathaneni, R.K.; Bertolini, E.; Shamimuzzaman, M.; Vera, D.L.; Lung, P.-Y.; Rice, B.R.; Zhang, J.; Brown, P.J.; Lipka, A.E.; Bass, H.W.; et al. The Regulatory Landscape of Early Maize Inflorescence Development. Genome Biol. 2020, 21, 165. [Google Scholar] [CrossRef]
  76. Dai, X.; Tu, X.; Du, B.; Dong, P.; Sun, S.; Wang, X.; Sun, J.; Li, G.; Lu, T.; Zhong, S.; et al. Chromatin and Regulatory Differentiation between Bundle Sheath and Mesophyll Cells in Maize. Plant J. 2022, 109, 675–692. [Google Scholar] [CrossRef]
  77. Marand, A.P.; Jiang, L.; Gomez-Cano, F.; Minow, M.A.A.; Zhang, X.; Mendieta, J.P.; Luo, Z.; Bang, S.; Yan, H.; Meyer, C.; et al. The Genetic Architecture of Cell Type-Specific Cis Regulation in Maize. Science 2025, 388, eads6601. [Google Scholar] [CrossRef]
  78. Kocaoglan, E.G.; Radhakrishnan, D.; Nakayama, N. Synthetic Developmental Biology: Molecular Tools to Re-Design Plant Shoots and Roots. J. Exp. Bot. 2023, 74, 3864–3876. [Google Scholar] [CrossRef]
  79. Schledzewski, K.; Mendel, R.R. Quantitative Transient Gene Expression: Comparison of the Promoters for Maize polyubiquitin1, Rice actin1, Maize-derivedEmu andCaMV 35S in Cells of Barley, Maize and Tobacco. Transgenic Res. 1994, 3, 249–255. [Google Scholar] [CrossRef]
  80. Ransom, C.; Balan, V.; Biswas, G.; Dale, B.; Crockett, E.; Sticklen, M. Heterologous Acidothermus Cellulolyticus 1,4-β-Endoglucanase E1 Produced Within the Corn Biomass Converts Corn Stover Into Glucose. Appl. Biochem. Biotecnol. 2007, 137, 207–219. [Google Scholar] [CrossRef]
  81. Rasco-Gaunt, S.; Liu, D.; Li, C.; Doherty, A.; Hagemann, K.; Riley, A.; Thompson, T.; Brunkan, C.; Mitchell, M.; Lowe, K.; et al. Characterisation of the Expression of a Novel Constitutive Maize Promoter in Transgenic Wheat and Maize. Plant Cell Rep. 2003, 21, 569–576. [Google Scholar] [CrossRef]
  82. Hood, E.E.; Bailey, M.R.; Beifuss, K.; Magallanes-Lundback, M.; Horn, M.E.; Callaway, E.; Drees, C.; Delaney, D.E.; Clough, R.; Howard, J.A. Criteria for High-Level Expression of a Fungal Laccase Gene in Transgenic Maize. Plant Biotechnol. J. 2003, 1, 129–140. [Google Scholar] [CrossRef] [PubMed]
  83. Gray, B.N.; Bougri, O.; Carlson, A.R.; Meissner, J.; Pan, S.; Parker, M.H.; Zhang, D.; Samoylov, V.; Ekborg, N.A.; Michael Raab, R. Global and Grain-Specific Accumulation of Glycoside Hydrolase Family 10 Xylanases in Transgenic Maize (Zea mays). Plant Biotechnol. J. 2011, 9, 1100–1108. [Google Scholar] [CrossRef] [PubMed]
  84. Shi, J.; Gao, H.; Wang, H.; Renee Lafitte, H.; Archibald, R.L.; Yang, M.; Hakimi, S.M.; Mo, H.; Habben, J.E. ARGOS8 Variants Generated by CRISPR-Cas9 Improve Maize Grain Yield under Field Drought Stress Conditions. Plant Biotechnol. J. 2017, 15, 207–216. [Google Scholar] [CrossRef] [PubMed]
  85. Wu, J.; Lawit, S.J.; Weers, B.; Sun, J.; Mongar, N.; Van Hemert, J.; Melo, R.; Meng, X.; Rupe, M.; Clapp, J.; et al. Overexpression of zmm28 Increases Maize Grain Yield in the Field. Proc. Natl. Acad. Sci. USA 2019, 116, 23850–23858. [Google Scholar] [CrossRef]
  86. Bradford, M.M. Analysis of Cis-Sequence of Subgenomic Transcript Promoter from the Figwort Mosaic Virus and Comparison of Promoter Activity with the Cauliflower Mosaic Virus Promoters in Monocot and Dicot Cells. Virus Res. 2002, 90, 47–62. [Google Scholar] [CrossRef]
  87. Stavolone, L.; Kononova, M.; Pauli, S.; Ragozzino, A.; de Haan, P.; Milligan, S.; Lawton, K.; Hohn, T. Cestrum Yellow Leaf Curling Virus (CmYLCV) Promoter: A New Strong Constitutive Promoter for Heterologous Gene Expression in a Wide Variety of Crops. Plant Mol. Biol. 2003, 53, 703–713. [Google Scholar] [CrossRef]
  88. Verdaguer, B.; de Kochko, A.; Beachy, R.N.; Fauquet, C. Isolation and Expression in Transgenic Tobacco and Rice Plants, of the Cassava Vein Mosaic Virus (CVMV) Promoter. Plant Mol. Biol. 1996, 31, 1129–1139. [Google Scholar] [CrossRef]
  89. Lowe, K.; La Rota, M.; Hoerster, G.; Hastings, C.; Wang, N.; Chamberlin, M.; Wu, E.; Jones, T.; Gordon-Kamm, W. Rapid Genotype “independent” Zea mays L. (maize) Transformation via Direct Somatic Embryogenesis. Vitr. Cell. Dev. Biol.-Plant 2018, 54, 240–252. [Google Scholar] [CrossRef]
  90. Shen, B.; Allen, W.B.; Zheng, P.; Li, C.; Glassman, K.; Ranch, J.; Nubel, D.; Tarczynski, M.C. Expression of ZmLEC1 and ZmWRI1 Increases Seed Oil Production in Maize. Plant Physiol. 2010, 153, 980–987. [Google Scholar] [CrossRef]
  91. Li, Y.; Liu, X.; Chen, R.; Tian, J.; Fan, Y.; Zhou, X. Genome-Scale Mining of Root-Preferential Genes from Maize and Characterization of Their Promoter Activity. BMC Plant Biol. 2019, 19, 584. [Google Scholar] [CrossRef] [PubMed]
  92. Naqvi, S.; Zhu, C.; Farre, G.; Ramessar, K.; Bassie, L.; Breitenbach, J.; Perez Conesa, D.; Ros, G.; Sandmann, G.; Capell, T.; et al. Transgenic Multivitamin Corn through Biofortification of Endosperm with Three Vitamins Representing Three Distinct Metabolic Pathways. Proc. Natl. Acad. Sci. USA 2009, 106, 7762–7767. [Google Scholar] [CrossRef] [PubMed]
  93. Belanger, F.C.; Kriz, A.L. Molecular Characterization of the Major Maize Embryo Globulin Encoded by the Glb1 Gene 1. Plant Physiol. 1989, 91, 636–643. [Google Scholar] [CrossRef] [PubMed]
  94. Cocciolone, S.M.; Nettleton, D.; Snook, M.E.; Peterson, T. Transformation of Maize with the p1 Transcription Factor Directs Production of Silk Maysin, a Corn Earworm Resistance Factor, in Concordance with a Hierarchy of Floral Organ Pigmentation. Plant Biotechnol. J. 2005, 3, 225–235. [Google Scholar] [CrossRef]
  95. Wang, H.; Fan, M.; Wang, G.; Zhang, C.; Shi, L.; Wei, Z.; Ma, W.; Chang, J.; Huang, S.; Lin, F. Isolation and Characterization of a Novel Pollen-Specific Promoter in Maize (Zea mays L.). Genome 2017, 60, 485–495. [Google Scholar] [CrossRef]
  96. Breitler, J.C.; Cordero, M.J.; Royer, M.; Meynard, D.; San Segundo, B.; Guiderdoni, E. The −689/+197 Region of the Maize Protease Inhibitor Gene Directs High Level, Wound-Inducible Expression of the cry1B Gene Which Protects Transgenic Rice Plants from Stemborer Attack. Mol. Breed. 2001, 7, 259–274. [Google Scholar] [CrossRef]
  97. Robson, P.R.H.; Donnison, I.S.; Wang, K.; Frame, B.; Pegg, S.E.; Thomas, A.; Thomas, H. Leaf Senescence Is Delayed in Maize Expressing the Agrobacterium IPT Gene under the Control of a Novel Maize Senescence-Enhanced Promoter. Plant Biotechnol. J. 2004, 2, 101–112. [Google Scholar] [CrossRef]
  98. Hou, J.; Jiang, P.; Qi, S.; Zhang, K.; He, Q.; Xu, C.; Ding, Z.; Zhang, K.; Li, K. Isolation and Functional Validation of Salinity and Osmotic Stress Inducible Promoter from the Maize Type-II H+-Pyrophosphatase Gene by Deletion Analysis in Transgenic Tobacco Plants. PLoS ONE 2016, 11, e0154041. [Google Scholar] [CrossRef]
  99. Streatfield, S.J.; Bray, J.; Love, R.T.; Horn, M.E.; Lane, J.R.; Drees, C.F.; Egelkrout, E.M.; Howard, J.A. Identification of Maize Embryo-Preferred Promoters Suitable for High-Level Heterologous Protein Production. GM Crops 2010, 1, 162–172. [Google Scholar] [CrossRef]
  100. Lee, L.-Y.; Kononov, M.E.; Bassuner, B.; Frame, B.R.; Wang, K.; Gelvin, S.B. Novel Plant Transformation Vectors Containing the Superpromoter. Plant Physiol. 2007, 145, 1294–1300. [Google Scholar] [CrossRef]
  101. Shepherd, C.T.; Paul Scott, M. Construction and Evaluation of a Maize (Zea mays) Chimaeric Promoter with Activity in Kernel Endosperm and Embryo. Biotechnol. Appl. Biochem. 2009, 52, 233–243. [Google Scholar] [CrossRef] [PubMed][Green Version]
  102. Feng, X.; Jia, L.; Cai, Y.; Guan, H.; Zheng, D.; Zhang, W.; Xiong, H.; Zhou, H.; Wen, Y.; Hu, Y.; et al. ABA-Inducible DEEPER ROOTING 1 Improves Adaptation of Maize to Water Deficiency. Plant Biotechnol. J. 2022, 20, 2077–2088. [Google Scholar] [CrossRef] [PubMed]
  103. Kumar, S.; AlAbed, D.; Whitteck, J.T.; Chen, W.; Bennett, S.; Asberry, A.; Wang, X.; DeSloover, D.; Rangasamy, M.; Wright, T.R.; et al. A Combinatorial Bidirectional and Bicistronic Approach for Coordinated Multi-Gene Expression in Corn. Plant Mol. Biol. 2015, 87, 341–353. [Google Scholar] [CrossRef] [PubMed]
  104. Feike, D.; Korolev, A.V.; Soumpourou, E.; Murakami, E.; Reid, D.; Breakspear, A.; Rogers, C.; Radutoiu, S.; Stougaard, J.; Harwood, W.A.; et al. Characterizing Standard Genetic Parts and Establishing Common Principles for Engineering Legume and Cereal Roots. Plant Biotechnol. J. 2019, 17, 2234–2245. [Google Scholar] [CrossRef]
  105. Lu, A.; Diehn, S.; Cigan, M. Maize Protein Expression. In Recent Advancements in Gene Expression and Enabling Technologies in Crop Plants; Springer: Berlin/Heidelberg, Germany, 2015; pp. 3–40. [Google Scholar] [CrossRef]
  106. McElroy, D.; Blowers, A.D.; Jenes, B.; Wu, R. Construction of Expression Vectors Based on the Rice Actin 1 (Act1) 5′ Region for Use in Monocot Transformation. Mol. Gen. Genet. MGG 1991, 231, 150–160. [Google Scholar] [CrossRef]
  107. May, D.B., 3rd; Pfotenhauer, A.C.; Concha, B.L.; Li, L.; Jones, S.M.; Harbison, S.A.; Martin, L.H.; Clark, L.A.; Occhialini, A.; Stewart, C.N., Jr.; et al. Toward a Monocot SynBio Toolkit: Assessing Regulatory Element Performance and Eudicot Compatibility. Plant Cell Rep. 2025, 44, 196. [Google Scholar] [CrossRef]
  108. Ali, S.; Kim, W.-C. A Fruitful Decade Using Synthetic Promoters in the Improvement of Transgenic Plants. Front. Plant Sci. 2019, 10, 493712. [Google Scholar] [CrossRef]
  109. Yasmeen, E.; Wang, J.; Riaz, M.; Zhang, L.; Zuo, K. Designing Artificial Synthetic Promoters for Accurate, Smart, and Versatile Gene Expression in Plants. Plant Commun. 2023, 4, 100558. [Google Scholar] [CrossRef]
  110. Khan, A.; Nasim, N.; Pudhuvai, B.; Koul, B.; Upadhyay, S.K.; Sethi, L.; Dey, N. Plant Synthetic Promoters: Advancement and Prospective. Agriculture 2023, 13, 298. [Google Scholar] [CrossRef]
  111. Cai, Y.-M.; Kallam, K.; Tidd, H.; Gendarini, G.; Salzman, A.; Patron, N.J. Rational Design of Minimal Synthetic Promoters for Plants. Nucleic Acids Res. 2020, 48, 11845–11856. [Google Scholar] [CrossRef]
  112. Aysha, J.; Noman, M.; Wang, F.; Liu, W.; Zhou, Y.; Li, H.; Li, X. Synthetic Promoters: Designing the Cis Regulatory Modules for Controlled Gene Expression. Mol. Biotechnol. 2018, 60, 608–620. [Google Scholar] [CrossRef]
  113. Liu, L.; Gallagher, J.; Arevalo, E.D.; Chen, R.; Skopelitis, T.; Wu, Q.; Bartlett, M.; Jackson, D. Enhancing Grain-Yield-Related Traits by CRISPR–Cas9 Promoter Editing of Maize CLE Genes. Nat. Plants 2021, 7, 287–294. [Google Scholar] [CrossRef]
  114. Shamloo-Dashtpagerdi, R.; Razi, H.; Aliakbari, M.; Lindlöf, A.; Ebrahimi, M.; Ebrahimie, E. A Novel Pairwise Comparison Method for in Silico Discovery of Statistically Significant Cis-Regulatory Elements in Eukaryotic Promoter Regions: Application to Arabidopsis. J. Theor. Biol. 2015, 364, 364–376. [Google Scholar] [CrossRef] [PubMed]
  115. Liu, X.; Li, S.; Yang, W.; Mu, B.; Jiao, Y.; Zhou, X.; Zhang, C.; Fan, Y.; Chen, R. Synthesis of Seed-Specific Bidirectional Promoters for Metabolic Engineering of Anthocyanin-Rich Maize. Plant Cell Physiol. 2018, 59, 1942–1955. [Google Scholar] [CrossRef] [PubMed]
  116. Liu, X.; Yang, W.; Li, Y.; Li, S.; Zhou, X.; Zhao, Q.; Fan, Y.; Lin, M.; Chen, R. The Intergenic Region of the Maize Defensin-like Protein Genes Def1 and Def2 Functions as an Embryo-Specific Asymmetric Bidirectional Promoter. J. Exp. Bot. 2016, 67, 4403–4413. [Google Scholar] [CrossRef] [PubMed]
  117. Stege, J.T.; Guan, X.; Ho, T.; Beachy, R.N.; Barbas, C.F., 3rd. Controlling Gene Expression in Plants Using Synthetic Zinc Finger Transcription Factors. Plant J. 2002, 32, 1077–1086. [Google Scholar] [CrossRef]
  118. Eisenhut, P.; Mebrahtu, A.; Moradi Barzadd, M.; Thalén, N.; Klanert, G.; Weinguny, M.; Sandegren, A.; Su, C.; Hatton, D.; Borth, N.; et al. Systematic Use of Synthetic 5′-UTR RNA Structures to Tune Protein Translation Improves Yield and Quality of Complex Proteins in Mammalian Cell Factories. Nucleic Acids Res. 2020, 48, e119. [Google Scholar] [CrossRef]
  119. De Nijs, Y.; De Maeseneire, S.L.; Soetaert, W.K. 5′ Untranslated Regions: The next Regulatory Sequence in Yeast Synthetic Biology. Biol. Rev. 2020, 95, 517–529. [Google Scholar] [CrossRef]
  120. Pan, Y.-Y.; Rui, C.; Zhu, L.; Wang, H.; Huang, D.-F.; Lang, Z.-H. Utilizing Modified ubi1 Introns to Enhance Exogenous Gene Expression in Maize (Zea mays L.) and Rice (Oryza sativa L.). J. Integr. Agric. 2016, 15, 1716–1726. [Google Scholar] [CrossRef]
  121. Hardy, E.C.; Balcerowicz, M. Untranslated yet indispensable—UTRs Act as Key Regulators in the Environmental Control of Gene Expression. J. Exp. Bot. 2024, 75, 4314–4331. [Google Scholar] [CrossRef]
  122. Cenik, C.; Derti, A.; Mellor, J.C.; Berriz, G.F.; Roth, F.P. Genome-Wide Functional Analysis of Human 5’ Untranslated Region Introns. Genome Biol. 2010, 11, R29. [Google Scholar] [CrossRef] [PubMed]
  123. Lin, Z.; Li, W.-H. Evolution of 5’ Untranslated Region Length and Gene Expression Reprogramming in Yeasts. Mol. Biol. Evol. 2012, 29, 81–89. [Google Scholar] [CrossRef] [PubMed]
  124. Christensen, A.H.; Sharrock, R.A.; Quail, P.H. Maize Polyubiquitin Genes: Structure, Thermal Perturbation of Expression and Transcript Splicing, and Promoter Activity Following Transfer to Protoplasts by Electroporation. Plant Mol. Biol. 1992, 18, 675–689. [Google Scholar] [CrossRef] [PubMed]
  125. Mardanova, E.S.; Zamchuk, L.A.; Ravin, N.V. The 5′-Untranslated Region of the Maize Alcohol Dehydrogenase Gene Provides Efficient Translation of mRNA in Plants under Stress Conditions. Mol. Biol. 2007, 41, 914–919. [Google Scholar] [CrossRef]
  126. Zheng, H.; Lin, S.; Zhang, Q.; Lei, Y.; Zhang, Z. Functional Analysis of 5’ Untranslated Region of a TIR-NBS-Encoding Gene from Triploid White Poplar. Mol. Genet. Genom. 2009, 282, 381–394. [Google Scholar] [CrossRef]
  127. Srivastava, A.K.; Lu, Y.; Zinta, G.; Lang, Z.; Zhu, J.-K. UTR-Dependent Control of Gene Expression in Plants. Trends Plant Sci. 2018, 23, 248–259. [Google Scholar] [CrossRef]
  128. Liu, B.; Liu, X.; Sun, M.; Sun, Y.; Liu, D.; Hao, L.; Tao, Y. Analysis of the 5′ Untranslated Region Length-Dependent Control of Gene Expression in Maize: A Case Study with the ZmLAZ1 Gene Family. Genes 2024, 15, 994. [Google Scholar] [CrossRef]
  129. Jiménez-González, A.S.; Fernández, N.; Martínez-Salas, E.; Sánchez de Jiménez, E. Functional and Structural Analysis of Maize hsp101 IRES. PLoS ONE 2014, 9, e107459. [Google Scholar] [CrossRef]
  130. Furuichi, Y. Discovery of m(7)G-Cap in Eukaryotic mRNAs. Proc. Jpn. Acad. Ser. B Phys. Biol. Sci. 2015, 91, 394–409. [Google Scholar] [CrossRef]
  131. Daffis, S.; Szretter, K.J.; Schriewer, J.; Li, J.; Youn, S.; Errett, J.; Lin, T.-Y.; Schneller, S.; Zust, R.; Dong, H.; et al. 2′-O Methylation of the Viral mRNA Cap Evades Host Restriction by IFIT Family Members. Nature 2010, 468, 452–456. [Google Scholar] [CrossRef]
  132. Despic, V.; Jaffrey, S.R. mRNA Ageing Shapes the Cap2 Methylome in Mammalian mRNA. Nature 2023, 614, 358–366. [Google Scholar] [CrossRef] [PubMed]
  133. Xiao, C.; Li, Q.; Wu, S.; Zhang, F.; Zhang, H.; Zhang, C.; Cai, Z.; Xia, Y. mRNAs of Plants and Green Algae Lack the m7G Cap-1 Structure. New Phytol. 2025, 246, 396–401. [Google Scholar] [CrossRef] [PubMed]
  134. Borden, K.L.B.; Volpon, L. The Diversity, Plasticity, and Adaptability of Cap-Dependent Translation Initiation and the Associated Machinery. RNA Biol. 2020, 17, 1239–1251. [Google Scholar] [CrossRef] [PubMed]
  135. Dinkova, T.D.; Márquez-Velázquez, N.A.; Aguilar, R.; Lázaro-Mixteco, P.E.; de Jiménez, E.S. Tight Translational Control by the Initiation Factors eIF4E and eIF(iso)4E Is Required for Maize Seed Germination. Seed Sci. Res. 2011, 21, 85–93. [Google Scholar] [CrossRef]
  136. Yamasaki, S.; Suzuki, A.; Yamano, Y.; Kawabe, H.; Ueno, D.; Demura, T.; Kato, K. Identification of 5’-Untranslated Regions That Function as Effective Translational Enhancers in Monocotyledonous Plant Cells Using a Novel Method of Genome-Wide Analysis. Plant Biotechnol. 2018, 35, 365–373. [Google Scholar] [CrossRef]
  137. Echevarría-Zomeño, S.; Yángüez, E.; Fernández-Bautista, N.; Castro-Sanz, A.B.; Ferrando, A.; Castellano, M.M. Regulation of Translation Initiation under Biotic and Abiotic Stresses. Int. J. Mol. Sci. 2013, 14, 4670–4683. [Google Scholar] [CrossRef]
  138. Godet, A.-C.; David, F.; Hantelys, F.; Tatin, F.; Lacazette, E.; Garmy-Susini, B.; Prats, A.-C. IRES Trans-Acting Factors, Key Actors of the Stress Response. Int. J. Mol. Sci. 2019, 20, 924. [Google Scholar] [CrossRef]
  139. Cui, Y.; Rao, S.; Chang, B.; Wang, X.; Zhang, K.; Hou, X.; Zhu, X.; Wu, H.; Tian, Z.; Zhao, Z.; et al. AtLa1 Protein Initiates IRES-Dependent Translation of WUSCHEL mRNA and Regulates the Stem Cell Homeostasis of Arabidopsis in Response to Environmental Hazards. Plant Cell Environ. 2015, 38, 2098–2114. [Google Scholar] [CrossRef]
  140. Yu, X.; Sharma, B.; Gregory, B.D. The Impact of Epitranscriptomic Marks on Post-Transcriptional Regulation in Plants. Brief. Funct. Genom. 2020, 20, 113–124. [Google Scholar] [CrossRef]
  141. Zhang, H.; Wang, Y.; Wu, X.; Tang, X.; Wu, C.; Lu, J. Determinants of Genome-Wide Distribution and Evolution of uORFs in Eukaryotes. Nat. Commun. 2021, 12, 1076, Correction in Nat. Commun. 2021, 12, 1076. [Google Scholar] [CrossRef]
  142. Dever, T.E.; Ivanov, I.P.; Hinnebusch, A.G. Translational Regulation by uORFs and Start Codon Selection Stringency. Genes. Dev. 2023, 37, 474–489. [Google Scholar] [CrossRef]
  143. Cymerman, M.A.; Saul, H.; Farhi, R.; Vexler, K.; Gottlieb, D.; Berezin, I.; Shaul, O. Plant Transcripts with Long or Structured Upstream Open Reading Frames in the NDL2 5’ UTR Can Escape Nonsense-Mediated mRNA Decay in a Reinitiation-Independent Manner. J. Exp. Bot. 2023, 74, 91–103. [Google Scholar] [CrossRef] [PubMed]
  144. Rahmani, F.; Hummel, M.; Schuurmans, J.; Wiese-Klinkenberg, A.; Smeekens, S.; Hanson, J. Sucrose Control of Translation Mediated by an Upstream Open Reading Frame-Encoded Peptide. Plant Physiol. 2009, 150, 1356–1367. [Google Scholar] [CrossRef] [PubMed]
  145. Causier, B.; Hopes, T.; McKay, M.; Paling, Z.; Davies, B. Plants Utilise Ancient Conserved Peptide Upstream Open Reading Frames in Stress-Responsive Translational Regulation. Plant Cell Environ. 2022, 45, 1229–1241. [Google Scholar] [CrossRef] [PubMed]
  146. von Arnim, A.G.; Jia, Q.; Vaughn, J.N.V. Regulation of Plant Translation by Upstream Open Reading Frames. Plant Sci. 2014, 214, 1–12. [Google Scholar] [CrossRef]
  147. Kim, B.-H.; Cai, X.; Vaughn, J.N.; von Arnim, A.G. On the Functions of the H Subunit of Eukaryotic Initiation Factor 3 in Late Stages of Translation Initiation. Genome Biol. 2007, 8, R60. [Google Scholar] [CrossRef]
  148. Lohmer, S.; Maddaloni, M.; Motto, M.; Salamini, F.; Thompson, R.D. Translation of the mRNA of the Maize Transcriptional Activator Opaque-2 Is Inhibited by Upstream Open Reading Frames Present in the Leader Sequence. Plant Cell 1993, 5, 65–73. [Google Scholar] [CrossRef]
  149. Lei, L.; Shi, J.; Chen, J.; Zhang, M.; Sun, S.; Xie, S.; Li, X.; Zeng, B.; Peng, L.; Hauck, A.; et al. Ribosome Profiling Reveals Dynamic Translational Landscape in Maize Seedlings under Drought Stress. Plant J. 2015, 84, 1206–1218. [Google Scholar] [CrossRef]
  150. Gage, J.L.; Mali, S.; McLoughlin, F.; Khaipho-Burch, M.; Monier, B.; Bailey-Serres, J.; Vierstra, R.D.; Buckler, E.S. Variation in Upstream Open Reading Frames Contributes to Allelic Diversity in Maize Protein Abundance. Proc. Natl. Acad. Sci. USA 2022, 119, e2112516119. [Google Scholar] [CrossRef]
  151. Procissi, A.; Piazza, P.; Tonelli, C. A Maize r1 Gene Is Regulated Post-Transcriptionally by Differential Splicing of Its Leader. Plant Mol. Biol. 2002, 49, 239–248. [Google Scholar] [CrossRef]
  152. Leppek, K.; Das, R.; Barna, M. Functional 5’ UTR mRNA Structures in Eukaryotic Translation Regulation and How to Find Them. Nat. Rev. Mol. Cell Biol. 2018, 19, 158–174, Correction in Nat. Rev. Mol. Cell Biol. 2018, 19, 673. [Google Scholar] [CrossRef]
  153. Bose, R.; Saleem, I.; Mustoe, A.M. Causes, Functions, and Therapeutic Possibilities of RNA Secondary Structure Ensembles and Alternative States. Cell Chem. Biol. 2024, 31, 17–35. [Google Scholar] [CrossRef]
  154. Yang, X.; Yu, H.; Duncan, S.; Zhang, Y.; Cheema, J.; Liu, H.; Benjamin Miller, J.; Zhang, J.; Kwok, C.K.; Zhang, H.; et al. RNA G-Quadruplex Structure Contributes to Cold Adaptation in Plants. Nat. Commun. 2022, 13, 6224. [Google Scholar] [CrossRef]
  155. Thomas, S.E.; Balcerowicz, M.; Chung, B.Y.-W. RNA Structure Mediated Thermoregulation: What Can We Learn from Plants? Front. Plant Sci. 2022, 13, 938570. [Google Scholar] [CrossRef]
  156. Kawaguchi, R.; Bailey-Serres, J. mRNA Sequence Features That Contribute to Translational Regulation in Arabidopsis. Nucleic Acids Res. 2005, 33, 955–965. [Google Scholar] [CrossRef]
  157. Kim, Y.; Lee, G.; Jeon, E.; Sohn, E.J.; Lee, Y.; Kang, H.; Lee, D.W.; Kim, D.H.; Hwang, I. The Immediate Upstream Region of the 5′-UTR from the AUG Start Codon Has a Pronounced Effect on the Translational Efficiency in Arabidopsis thaliana. Nucleic Acids Res. 2013, 42, 485–498. [Google Scholar] [CrossRef] [PubMed]
  158. Damiani, R.D.; Wessler, S.R. An Upstream Open Reading Frame Represses Expression of Lc, a Member of the R/B Family of Maize Transcriptional Activators. Proc. Natl. Acad. Sci. USA 1993, 90, 8244–8248. [Google Scholar] [CrossRef] [PubMed]
  159. Wang, L.; Wessler, S.R. Inefficient Reinitiation Is Responsible for Upstream Open Reading Frame–Mediated Translational Repression of the Maize R Gene. Plant Cell 1998, 10, 1733–1745. [Google Scholar] [CrossRef] [PubMed]
  160. Wang, L.; Wessler, S.R. Role of mRNA Secondary Structure in Translational Repression of the Maize Transcriptional ActivatorLc 1,2. Plant Physiol. 2001, 125, 1380–1387. [Google Scholar] [CrossRef]
  161. Titov, I.I.; Kochetov, A.V.; Kolchanov, N.A.; Sarai, A. Computer Analysis of mRNA Untranslated Regions of Hypoxia-Induced Corn Genes. In Proceedings of the Third International Conference on Bioinformatics of Genome Regulation and Structure (BGRS’2002), Novosibirsk, Russia, 14–20 July 2002; Volume 3, pp. 46–48. [Google Scholar]
  162. Mutsuro-Aoki, H.; Teramura, H.; Tamukai, R.; Fukui, M.; Kusano, H.; Schepetilnikov, M.; Ryabova, L.A.; Shimada, H. Dissection of a Rice OsMac1 mRNA 5’ UTR to Uncover Regulatory Elements That Are Responsible for Its Efficient Translation. PLoS ONE 2021, 16, e0253488. [Google Scholar] [CrossRef]
  163. Su, Z.; Tang, Y.; Ritchey, L.E.; Tack, D.C.; Zhu, M.; Bevilacqua, P.C.; Assmann, S.M. Genome-Wide RNA Structurome Reprogramming by Acute Heat Shock Globally Regulates mRNA Abundance. Proc. Natl. Acad. Sci. USA 2018, 115, 12170–12175. [Google Scholar] [CrossRef]
  164. Zhao, D.; Hamilton, J.P.; Hardigan, M.; Yin, D.; He, T.; Vaillancourt, B.; Reynoso, M.; Pauluzzi, G.; Funkhouser, S.; Cui, Y.; et al. Analysis of Ribosome-Associated mRNAs in Rice Reveals the Importance of Transcript Size and GC Content in Translation. G3 Genes|Genomes|Genet 2017, 7, 203–219. [Google Scholar] [CrossRef] [PubMed]
  165. Deng, H.; Cheema, J.; Zhang, H.; Woolfenden, H.; Norris, M.; Liu, Z.; Liu, Q.; Yang, X.; Yang, M.; Deng, X.; et al. Rice In Vivo RNA Structurome Reveals RNA Secondary Structure Conservation and Divergence in Plants. Mol. Plant 2018, 11, 607–622. [Google Scholar] [CrossRef] [PubMed]
  166. Yang, X.; Yu, H.; Sun, W.; Ding, L.; Li, J.; Cheema, J.; Ramirez-Gonzalez, R.; Zhao, X.; Martín, A.C.; Lu, F.; et al. Wheat in Vivo RNA Structure Landscape Reveals a Prevalent Role of RNA Structure in Modulating Translational Subgenome Expression Asymmetry. Genome Biol. 2021, 22, 326. [Google Scholar] [CrossRef] [PubMed]
  167. Chung, B.Y.W.; Simons, C.; Firth, A.E.; Brown, C.M.; Hellens, R.P. Effect of 5’UTR Introns on Gene Expression in Arabidopsis thaliana. BMC Genom. 2006, 7, 120. [Google Scholar] [CrossRef]
  168. Lim, C.S.; Wardell, S.J.T.; Kleffmann, T.; Brown, C.M. The Exon–intron Gene Structure Upstream of the Initiation Codon Predicts Translation Efficiency. Nucleic Acids Res. 2018, 46, 4575–4591. [Google Scholar] [CrossRef]
  169. Morello, L.; Gianì, S.; Troina, F.; Breviario, D. Testing the IMEter on Rice Introns and Other Aspects of Intron-Mediated Enhancement of Gene Expression. J. Exp. Bot. 2010, 62, 533–544. [Google Scholar] [CrossRef]
  170. Laxa, M. Intron-Mediated Enhancement: A Tool for Heterologous Gene Expression in Plants? Front. Plant Sci. 2017, 7, 227098. [Google Scholar] [CrossRef]
  171. Gallegos, J.E.; Rose, A.B. The Enduring Mystery of Intron-Mediated Enhancement. Plant Sci. 2015, 237, 8–15. [Google Scholar] [CrossRef]
  172. Samadder, P.; Sivamani, E.; Lu, J.; Li, X.; Qu, R. Transcriptional and Post-Transcriptional Enhancement of Gene Expression by the 5′ UTR Intron of Rice rubi3 Gene in Transgenic Rice Cells. Mol. Genet. Genom. 2008, 279, 429–439. [Google Scholar] [CrossRef]
  173. Rethmeier, N.; Seurinck, J.; Van Montagu, M.; Cornelissen, M. Intron-Mediated Enhancement of Transgene Expression in Maize Is a Nuclear, Gene-Dependent Process. Plant J. 1997, 12, 895–899. [Google Scholar] [CrossRef] [PubMed]
  174. Callis, J.; Fromm, M.; Walbot, V. Introns Increase Gene Expression in Cultured Maize Cells. Genes Dev. 1987, 1, 1183–1200. [Google Scholar] [CrossRef] [PubMed]
  175. Donath, M.; Mendel, R.; Cerff, R.; Martin, W. Intron-Dependent Transient Expression of the Maize GapA1 Gene. Plant Mol. Biol. 1995, 28, 667–676. [Google Scholar] [CrossRef] [PubMed]
  176. Vasil, V.; Clancy, M.; Ferl, R.J.; Vasil, I.K.; Hannah, L.C. Increased Gene Expression by the First Intron of Maize Shrunken-1 Locus in Grass Species. Plant Physiol. 1989, 91, 1575–1579. [Google Scholar] [CrossRef]
  177. Maas, C.; Laufs, J.; Grant, S.; Korfhage, C.; Werr, W. The Combination of a Novel Stimulatory Element in the First Exon of the Maize Shrunken-1 Gene with the Following Intron 1 Enhances Reporter Gene Expression up to 1000-Fold. Plant Mol. Biol. 1991, 16, 199–207. [Google Scholar] [CrossRef]
  178. Salgueiro, S.; Pignocchi, C.; Parry, M.A.J. Intron-Mediated gusA Expression in Tritordeum and Wheat Resulting from Particle Bombardment. Plant Mol. Biol. 2000, 42, 615–622. [Google Scholar] [CrossRef]
  179. Fan, Q.; Treder, K.; Miller, W.A. Untranslated Regions of Diverse Plant Viral RNAs Vary Greatly in Translation Enhancement Efficiency. BMC Biotechnol. 2012, 12, 22. [Google Scholar] [CrossRef]
  180. Basu, D.; South, P.F. Design and Analysis of Native Photorespiration Gene Motifs of Promoter Untranslated Region Combinations Under Short Term Abiotic Stress Conditions. Front. Plant Sci. 2022, 13, 828729. [Google Scholar] [CrossRef]
  181. Koziel, M.G.; Carozzi, N.B.; Desai, N. Optimizing Expression of Transgenes with an Emphasis on Post-Transcriptional Events. Plant Mol. Biol. 1996, 32, 393–405. [Google Scholar] [CrossRef]
  182. Omirulleh, S.; Abrahám, M.; Golovkin, M.; Stefanov, I.; Karabaev, M.K.; Mustárdy, L.; Mórocz, S.; Dudits, D. Activity of a Chimeric Promoter with the Doubled CaMV 35S Enhancer Element in Protoplast-Derived Cells and Transgenic Plants in Maize. Plant Mol. Biol. 1993, 21, 415–428. [Google Scholar] [CrossRef]
  183. Conner, T.; Flasinski, S.; Pang, S.; You, J. Chimeric Promoters Comprising a Rice Actin 1 Promoter and 35s Enhancer for Use in Plants. Patent US8436161B2, 7 May 2013. [Google Scholar]
  184. Guo, F.-X.; Yang, R.-X.; Yang, X.; Liu, J.; Wang, Y.-Z. Application of an Efficient Enhancer in Gene Function Research. Plants 2024, 13, 3120. [Google Scholar] [CrossRef]
  185. Akbergenov, R.Z.; Zhanybekova, S.S.; Kryldakov, R.V.; Zhigailov, A.; Polimbetova, N.S.; Hohn, T.; Iskakov, B.K. ARC-1, a Sequence Element Complementary to an Internal 18S rRNA Segment, Enhances Translation Efficiency in Plants When Present in the Leader or Intercistronic Region of mRNAs. Nucleic Acids Res. 2004, 32, 239–247. [Google Scholar] [CrossRef] [PubMed]
  186. Matsui, T.; Sawada, K.; Takita, E.; Kato, K. Compatibility of Translational Enhancers with Various Plant Species. Plant Biotechnol. 2015, 32, 309–316. [Google Scholar] [CrossRef][Green Version]
  187. Shatsky, I.N.; Dmitriev, S.E.; Terenin, I.M.; Andreev, D.E. Cap- and IRES-Independent Scanning Mechanism of Translation Initiation as an Alternative to the Concept of Cellular IRESs. Mol. Cells 2010, 30, 285–293. [Google Scholar] [CrossRef] [PubMed]
  188. Miras, M.; Miller, W.A.; Truniger, V.; Aranda, M.A. Non-Canonical Translation in Plant RNA Viruses. Front. Plant Sci. 2017, 8, 494. [Google Scholar] [CrossRef]
  189. Carino, E.J.; Scheets, K.; Allen Miller, W. The RNA of Maize Chlorotic Mottle Virus, an Obligatory Component of Maize Lethal Necrosis Disease, Is Translated via a Variant Panicum Mosaic Virus-Like Cap-Independent Translation Element. J. Virol. 2020, 94, e01005-20. [Google Scholar] [CrossRef]
  190. Kozak, M. An Analysis of 5’-Noncoding Sequences from 699 Vertebrate Messenger RNAs. Nucleic Acids Res. 1987, 15, 8125–8148. [Google Scholar] [CrossRef]
  191. Joshi, C.P.; Zhou, H.; Huang, X.; Chiang, V.L. Context Sequences of Translation Initiation Codon in Plants. Plant Mol. Biol. 1997, 35, 993–1001. [Google Scholar] [CrossRef]
  192. Shi, J.-J.; Cao, Y.; Lang, Q.-H.; Dong, Y.; Huang, L.-Y.; Yang, L.-J.; Li, J.-J.; Zhang, X.-X.; Wang, D.-Y. The Effect of the Nucleotides Immediately Upstream of the AUG Start Codon on the Efficiency of Translation Initiation in Sperm Cells. Plant Reprod. 2022, 35, 221–231. [Google Scholar] [CrossRef]
  193. Sugio, T.; Matsuura, H.; Matsui, T.; Matsunaga, M.; Nosho, T.; Kanaya, S.; Shinmyo, A.; Kato, K. Effect of the Sequence Context of the AUG Initiation Codon on the Rate of Translation in Dicotyledonous and Monocotyledonous Plant Cells. J. Biosci. Bioeng. 2010, 109, 170–173. [Google Scholar] [CrossRef]
  194. Kozak, M. Point Mutations Define a Sequence Flanking the AUG Initiator Codon That Modulates Translation by Eukaryotic Ribosomes. Cell 1986, 44, 283–292. [Google Scholar] [CrossRef] [PubMed]
  195. Kozak, M. Recognition of AUG and Alternative Initiator Codons Is Augmented by G in Position +4 but Is Not Generally Affected by the Nucleotides in Positions +5 and +6. EMBO J. 1997, 16, 2482–2492. [Google Scholar] [CrossRef] [PubMed]
  196. Gupta, P.; Rangan, L.; Venkata Ramesh, T.; Gupta, M. Comparative Analysis of Contextual Bias around the Translation Initiation Sites in Plant Genomes. J. Theor. Biol. 2016, 404, 303–311. [Google Scholar] [CrossRef] [PubMed]
  197. Wu, T.-Y.; Li, Y.-R.; Chang, K.-J.; Fang, J.-C.; Urano, D.; Liu, M.-J. Modeling Alternative Translation Initiation Sites in Plants Reveals Evolutionarily Conserved -Regulatory Codes in Eukaryotes. Genome Res. 2024, 34, 272–285. [Google Scholar] [CrossRef]
  198. Brophy, J.A.N.; Magallon, K.J.; Duan, L.; Zhong, V.; Ramachandran, P.; Kniazev, K.; Dinneny, J.R. Synthetic Genetic Circuits as a Means of Reprogramming Plant Roots. Science 2022, 377, 747–751. [Google Scholar] [CrossRef]
  199. Bhadouriya, S.L.; Karamchandani, A.N.; Nayak, N.; Mehrotra, S.; Mehrotra, R. Artificially Designed Synthetic Promoter for a High Level of Salt Induction Using a Cis-Engineering Approach. Sci. Rep. 2024, 14, 13657. [Google Scholar] [CrossRef]
  200. Nayak, N.; Mehrotra, S.; Karamchandani, A.N.; Santelia, D.; Mehrotra, R. Recent Advances in Designing Synthetic Plant Regulatory Modules. Front. Plant Sci. 2025, 16, 1567659. [Google Scholar] [CrossRef]
  201. Tu, X.; Mejía-Guerra, M.K.; Valdes Franco, J.A.; Tzeng, D.; Chu, P.-Y.; Shen, W.; Wei, Y.; Dai, X.; Li, P.; Buckler, E.S.; et al. Reconstructing the Maize Leaf Regulatory Network Using ChIP-Seq Data of 104 Transcription Factors. Nat. Commun. 2020, 11, 5089, Correction in Nat. Commun. 2023, 14, 1586. [Google Scholar] [CrossRef]
  202. Wang, Y.; Zhang, P.; Guo, W.; Liu, H.; Li, X.; Zhang, Q.; Du, Z.; Hu, G.; Han, X.; Pu, L.; et al. A Deep Learning Approach to Automate Whole-Genome Prediction of Diverse Epigenomic Modifications in Plants. New Phytol. 2021, 232, 880–897. [Google Scholar] [CrossRef]
  203. Deng, K.; Zhang, Q.; Hong, Y.; Yan, J.; Hu, X. iCREPCP: A Deep Learning-Based Web Server for Identifying Base-Resolution Cis-Regulatory Elements within Plant Core Promoters. Plant Commun. 2023, 4, 100455. [Google Scholar] [CrossRef]
  204. Fishman, V.; Kuratov, Y.; Shmelev, A.; Petrov, M.; Penzar, D.; Shepelin, D.; Chekanov, N.; Kardymon, O.; Burtsev, M. GENA-LM: A Family of Open-Source Foundational DNA Language Models for Long Sequences. Nucleic Acids Res. 2025, 53, gkae1310. [Google Scholar] [CrossRef]
  205. Barazandeh, S.; Ozden, F.; Hincer, A.; Seker, U.O.S.; Cicek, A.E. UTRGAN: Learning to Generate 5’ UTR Sequences for Optimized Translation Efficiency and Gene Expression. Bioinform. Adv. 2025, 5, vbaf134. [Google Scholar] [CrossRef]
  206. Zhang, H.; Liu, H.; Xu, Y.; Huang, H.; Liu, Y.; Wang, J.; Qin, Y.; Wang, H.; Ma, L.; Xun, Z.; et al. Deep Generative Models Design mRNA Sequences with Enhanced Translational Capacity and Stability. Science 2025, 390, eadr8470. [Google Scholar] [CrossRef]
  207. Yu, H.; Yang, H.; Sun, W.; Yan, Z.; Yang, X.; Zhang, H.; Ding, Y.; Li, K. An Interpretable RNA Foundation Model for Exploring Functional RNA Motifs in Plants. Nat. Mach. Intell. 2024, 6, 1616–1625. [Google Scholar] [CrossRef]
Figure 1. Hierarchical organization and functional architecture of plant promoters. The colored boxes represent distinct promoter parts. Abbreviations: BREu/BREd, upstream/downstream, B-recognition elements; CREB, cAMP Response Element-Binding protein; DNMT1, DNA (cytosine-5)-methyltransferase 1; Inr, initiator; NF-Y, Nuclear Factor Y; SAGA, Spt-Ada-Gcn5 Acetyltransferase; TFs, transcription factors.
Figure 1. Hierarchical organization and functional architecture of plant promoters. The colored boxes represent distinct promoter parts. Abbreviations: BREu/BREd, upstream/downstream, B-recognition elements; CREB, cAMP Response Element-Binding protein; DNMT1, DNA (cytosine-5)-methyltransferase 1; Inr, initiator; NF-Y, Nuclear Factor Y; SAGA, Spt-Ada-Gcn5 Acetyltransferase; TFs, transcription factors.
Ijms 27 00548 g001
Figure 2. Schematic representation of regulatory elements within the 5′ untranslated region (5′UTR). The composition, quantity, and position in sequence of elements may vary among different 5′UTRs.
Figure 2. Schematic representation of regulatory elements within the 5′ untranslated region (5′UTR). The composition, quantity, and position in sequence of elements may vary among different 5′UTRs.
Ijms 27 00548 g002
Table 2. Plant 5′UTR regulatory elements with relevance to maize-specific gene regulation.
Table 2. Plant 5′UTR regulatory elements with relevance to maize-specific gene regulation.
Biological/Biotechnological ApplicationsRepresentative Maize ExamplesPlant-Specific OccurrenceElement
Supports mRNA stability and translation efficiencyCap-0 formCap-0 form5′ cap
Cap-independent translation under stress conditionZmHsp101 geneFew confirmed examples in plant cellular mRNAsIRES
Fine-tuning gene expressionMaize drought-responsive genesCommonuORFs
Translational control during environmental stimuli; design of synthetic regulatory UTRsStructured 5′UTRs in ZmRab17 geneCommonRNA secondary structure
Intron-mediated enhancement to increase transgene expressionIntrons from Adh1 and Shrunken-1 genesCommonIntrons
Supports high-level gene expressionZmUbi1 5′UTRIdentified in dicot and monocot plantsTranslation enhancers
Optimization of translation initiation in gene constructsExpected to follow the plant consensus sequenceConserved among plantsStart codon context sequence
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Sytov, N.V.; Choob, V.V.; Nemomissa, S.; Mishin, A.S.; Perfilov, M.M. From Structure to Function of Promoters and 5′UTRs in Maize. Int. J. Mol. Sci. 2026, 27, 548. https://doi.org/10.3390/ijms27010548

AMA Style

Sytov NV, Choob VV, Nemomissa S, Mishin AS, Perfilov MM. From Structure to Function of Promoters and 5′UTRs in Maize. International Journal of Molecular Sciences. 2026; 27(1):548. https://doi.org/10.3390/ijms27010548

Chicago/Turabian Style

Sytov, Nikita V., Vladimir V. Choob, Sileshi Nemomissa, Alexander S. Mishin, and Maxim M. Perfilov. 2026. "From Structure to Function of Promoters and 5′UTRs in Maize" International Journal of Molecular Sciences 27, no. 1: 548. https://doi.org/10.3390/ijms27010548

APA Style

Sytov, N. V., Choob, V. V., Nemomissa, S., Mishin, A. S., & Perfilov, M. M. (2026). From Structure to Function of Promoters and 5′UTRs in Maize. International Journal of Molecular Sciences, 27(1), 548. https://doi.org/10.3390/ijms27010548

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop