Evolution of Regulated Transcription

Oleg V. Bylino; Airat N. Ibragimov; Yulii V. Shidlovskii

doi:10.3390/cells9071675

,

and

¹

Laboratory of Gene Expression Regulation in Development, Institute of Gene Biology, Russian Academy of Sciences, 34/5 Vavilov St., 119334 Moscow, Russia

²

Center for Precision Genome Editing and Genetic Technologies for Biomedicine, Institute of Gene Biology, Russian Academy of Sciences, 34/5 Vavilov St., 119334 Moscow, Russia

³

I.M. Sechenov First Moscow State Medical University, 8, bldg. 2 Trubetskaya St., 119048 Moscow, Russia

^*

Author to whom correspondence should be addressed.

Cells2020, 9(7), 1675;https://doi.org/10.3390/cells9071675

This article belongs to the Special Issue Evolution of Epigenetic Mechanisms and Signatures

Version Notes

Order Reprints

Abstract

The genomes of all organisms abound with various cis-regulatory elements, which control gene activity. Transcriptional enhancers are a key group of such elements in eukaryotes and are DNA regions that form physical contacts with gene promoters and precisely orchestrate gene expression programs. Here, we follow gradual evolution of this regulatory system and discuss its features in different organisms. In eubacteria, an enhancer-like element is often a single regulatory element, is usually proximal to the core promoter, and is occupied by one or a few activators. Activation of gene expression in archaea is accompanied by the recruitment of an activator to several enhancer-like sites in the upstream promoter region. In eukaryotes, activation of expression is accompanied by the recruitment of activators to multiple enhancers, which may be distant from the core promoter, and the activators act through coactivators. The role of the general DNA architecture in transcription control increases in evolution. As a whole, it can be seen that enhancers of multicellular eukaryotes evolved from the corresponding prototypic enhancer-like regulatory elements with the gradually increasing genome size of organisms.

Keywords:

enhancer; transcription; evolution; promoter; chromatin loop; gene regulation

1. Introduction

Gene expression is a complicated multistep process that requires tight control. Eukaryotes have the corresponding regulatory machinery containing multiple cis- and trans-elements. The cis-regulatory elements are noncoding DNA sequences distributed throughout the genome [1]. Enhancers are one of the most important and best studied groups of regulatory elements. These specific DNA regions act as binding targets for transcription factors (TFs), which form DNA–protein complexes associated with transcription activation. The complexes directly interact with gene promoters and govern their activity in time and space.

The origin of transcription regulation mechanisms of eukaryotes lies in the world of prokaryotes. Therefore, it is not surprising that many features of eukaryotic regulation are common between these two groups of organisms. Thus, it is of great interest to follow evolution of the mechanism whereby transcription is activated step by step. The evolutionary approach obviously allows us to better understand the origin of the transcription regulatory complexity of higher eukaryotes, to clarify the general details of regulation, and to highlight the specific mechanisms and features that are found in the regulation of eukaryotes and are shared with prokaryotes. Here we provide an overview of enhancer-like mechanisms in prokaryotes, describe similarities in these mechanisms shared by both eukaryotes and prokaryotes, and discuss the gradual increase in the complexity of regulation by enhancers in evolution from eubacteria and archaea to unicellular eukaryotes, further to multicellular invertebrates, and finally to vertebrates.

2. The Concept of an Enhancer Sequence for Different Organisms

For multicellular eukaryotes, the broadly accepted concept is that a transcriptional enhancer is an independent DNA sequence that is separated from the promoter by a distance in 1D (but not in 3D) and combines sites for positive DNA-binding transcriptional regulators. Classically, a transcription enhancer is understood as a DNA sequence that is able to activate transcription over long distances regardless of its orientation with respect to the core promoter [2]. In this form, the concept of a transcription enhancer is inapplicable to unicellular eukaryotes and surely to prokaryotes.

In unicellular eukaryotes such as yeasts, an upstream regulatory region serving to activate transcription is carefully termed upstream activating sequence (UAS) or sometimes the enhancer-like sequence, meaning that a UAS possesses most, but not all, of the properties characteristic of enhancers of multicellular eukaryotes. UASs cannot activate transcription when located in a gene intron or downstream of the gene in contrast to enhancers of multicellular eukaryotes and are similar to them in being capable of activating transcription when inverted [3,4].

In prokaryotes, the region harboring both binding site for RNA polymerase (RNAP) and binding sites for transcriptional regulators is usually not subdivided into a promoter and bacterial enhancer(s) and is defined as a promoter. However, the term UAS [5,6,7,8], or UAS-like [9], or US region [10] is sometimes used for bacteria. The term is most often used to denote the bacterial enhancer-like sequences involved in an activation loop formation mechanism (see Section 5.4) [11,12,13]. It should be noted that the term UAS is sometimes applied to the promoters of rRNA- and tRNA-coding genes [14,15].

It is clear from the brief explanations above that a united block of regulatory DNA was obviously subdivided during evolution into two independent sections, which were assumed in the recent past to have different functional properties. However, recent works provided evidence that promoters and enhancers of multicellular eukaryotes have much in common and share several properties and functions [16,17,18]. Sometimes, promoters and enhancers are even considered now as a single class of functional elements, with a unified architecture for transcription initiation [19].

Common architecture and functional features (chromatin structure and transcription machinery recruitment) of eukaryotic enhancers and promoters reflect the conversion of one type of elements into the other during evolution of eukaryotes. Indeed, it was found that several hundreds of ancestral enhancers were converted to promoters in primate and rodent lineages of mammals during evolution [20,21].

An interesting hypothesis on the origin of enhancers in eukaryotes has recently appeared to suggest that developmental enhancers of multicellular eukaryotes evolved from unicellular eukaryotic inducible promoters [22]. Here we support this view by showing that, in general, enhancers of multicellular eukaryotes, including developmental ones, evolved via increasing separation of the upstream binding sites of transcriptional regulators from the core promoter as the genome sizes of organisms gradually increased.

3. The Origin of Transcriptional Enhancers of Eukaryotes and Evolution of the Genome Size

3.1. Principles of Transcription Activation Shared by Both Prokaryotes and Eukaryotes

The architecture of genomes reflects the principles of gene expression regulation. Jacob and Monod’s operon concept [23], coupling signal perception with a genome response in the form of gene activation or repression, proved to be universal for all cellular organisms. In bacteria, the signal perception is directly coupled with an immediate genome response executed by transcriptional regulators, which interface with the general transcription apparatus and couple gene expression with signal transduction.

Two systems were developed by prokaryotes to regulate gene expression in response to stimuli. The first and the simplest one is the so-called one-component system. Transcriptional regulators of these systems are the most abundant and demonstrate remarkable structural similarity, containing both DNA-binding domain and a sensor domain, which can bind small-molecule ligands or metabolites. One-component systems are the most ancient signal-processing systems and are extremely diverse in relation to their function [24].

Two-component systems originate from one-component ones and evolved to respond to environmental stimuli [24]. These systems have more sophisticated structures and process signals from the environment, including metabolites, compounds taken up by the cell, light, temperature, and the redox state. In such systems, histidine kinase is combined in one polypeptide with a membrane-integrated sensor. Once the sensor conformation changes in response to a stimulus, the kinase autophosphorylates and phosphorylates a soluble intracellular transcriptional regulator, which possesses a receiver domain and a DNA-binding domain [25]. After phosphorylation, the transcriptional regulator acquires the property of binding to the target gene to enhance its expression. Many sensory domains are common for one-component regulators and transmembrane histidine kinases [24].

Eukaryotes developed several more systems with complicated cascades of signal transduction from membrane to DNA in addition to the two above signal transduction systems. However, both simple systems of prokaryotes and complex systems of eukaryotes converge on transcriptional regulators or TFs, which can act as activators and repressors. TFs convert incoming internal or environmental information into gene expression output, allowing the cell to adapt to changing conditions.

Promoters act as integrators of different regulative signals provided by transcriptional regulators and are absolutely essential for activation of transcription. Recruitment or driving away RNAP from a promoter is, by and large, a key and unified principle underlying the regulation of transcription from unicellular prokaryotes to vertebrates.

TFs are mostly homodimers in prokaryotes and heterodimers in eukaryotes. Both homo- and heterodimeric TFs occupy distinct and often single sites in a promoter, and each subunit recognizes a DNA sequence of 8–10 base pairs. A couple of sites is known as an operator or a box in prokaryotes. In multicellular eukaryotes, the sites of TFs often occur in many copies over extended DNA regions and are found in enhancers, locus control regions (LCRs), developmental enhancers, and superenhancers and may be tens, hundreds, or even thousands of kilobases away from the transcription start site (TSS), depending on how advanced the organism is evolutionarily.

Another common feature of both simple and complex systems is that binding sites for both activators and repressors may occur together in one regulatory DNA sequence. In prokaryotes, a good example is provided by the system of the lac operon. The lacZ promoter harbors two binding sites for a Lac repressor, which is a product of the lacI gene and acts to prevent lacZ transcription in the absence of lactose. In addition to LacI binding site, the lacZ promoter has one binding site for the transcriptional activator known as CAP or CRP (catabolite activator protein or cAMP receptor protein), which acts to stimulate lacZ transcription at lower glucose concentrations [26]. Complex systems of multicellular eukaryotes, for example, embryonic enhancers (initiators) of the HOX genes, harbor many sites for different repressors and activators [27]. Embryonic enhancers respond to morphogenetic gradients of maternally loaded proteins and other regulators, and the balance of sites for repressors and activators determines whether or not the gene is expressed in a particular body segment [27].

3.2. Separation of Enhancers from the Promoter with an Increase in Genome Size during Evolution

Binding sites for regulatory proteins evolved to be farther away from the TSS as genomes grew in size and complexity. For example, the genome of Escherichia coli consists of 4.6 Mb of DNA, and the binding sites for the PhoB protein, the master regulator of the Pho regulon, which is responsible for phosphorus assimilation in bacteria, are in the immediate vicinity of the −10 and −35 core promoter elements or even overlap them [28]. A direct physical contact can thus be established between the PhoB regulator and the RNAP–σ⁷⁰ complex [29].

The unicellular eukaryote Saccharomyces cerevisiae has a 12.1-Mb genome, and the binding sites of regulators in the enhancer-like UAS [3] are usually several hundreds of base pairs away from the core promoter, at maximum 1 Kb [30,31]. In this case, the interaction of the activator with general TFs and RNAP requires a loop formation between the enhancer and promoter [32] and the Mediator complex, a crucial eukaryotic multi-component transcriptional coactivator [33]. An increase in the distance between the UAS and the core promoter results in loss of function [34]. The observation provided a basis for designing an elegant reporter system, which makes it possible to monitor long-distance interactions between regulatory elements and proteins in the genomes of higher eukaryotes; its efficiency was well demonstrated with transgenes [32,35].

The majority of known enhancers are within 10 kb of the promoter in invertebrates, such as Drosophila, whose genome is 175 Mb and is about 20 times smaller than the human genome. At the same time, in this organism, up to 20% of all enhancers are long-range enhancers, which may jump over several genes to reach their target promoters [36,37]. The mean number of enhancers interacting with one promoter was estimated at five per promoter in Drosophila, like in humans [1,38]. The svb and cut Drosophila genes provide well-characterized examples of the loci where long-distance interactions between enhancers and promoters are involved in the regulation. Each of the genes has at least five proven enhancers, the farthest of which are approximately 80 kb away from the TSS [39,40]. This redundancy of transcriptional enhancers confers phenotypic robustness because the transcriptional output is guaranteed even in the case of a mutation or accidental loss of one enhancer [39]. Another strong long-range (235 kb) interaction in Drosophila was found between the regulatory regions of scyl and chrb [36]. An extreme example of regulatory complexity in terms of spatial interactions is provided by the Drosophila BX-C locus, which is approximately 320 kb in total size and regulates the development of the posterior half of the fly body. Both short- and long-range enhancer–promoter interactions were detected in this locus. These interactions are often mediated by boundary elements and insulators, which are located between BX-C domains [41]. An additional complexity level is added to the BX-C by Polycomb response elements (PREs), which are dispersed throughout BX-C and are associated with epigenetic memory and silencing of genes by forming spatially independent domains [42,43,44]. However, such regulatory complexity is an exception rather than a rule in Drosophila.

Long-range interactions reach a huge scale in vertebrates and, in particular, in humans, whose genome is 3300 Mb. For example, the human ZRS enhancer acts over 850 kb to regulate the Shh promoter [45], and enhancers of the SOX9 gene are 900 and 1300 Kb away from its promoter [46]. In addition to classical enhancer–promoter interactions, promoters acting as enhancers [47,48,49,50] and extensive enhancer–enhancer and promoter–promoter interactions [51] were detected in human, reflecting the complexity that is characteristic of the regulation in higher eukaryotes and is associated with complex 3D chromatin organization of the genome.

Thus, mammals have the largest genomes, the most complex regulation, and the most distant enhancers among all organisms. The estimated number of enhancers is up to ten actual enhancers per one gene in mammals [1].

3.3. Distancing of Enhancers from the Core Promoter in Evolution

While the distances between regulatory regions obviously grew over the course of evolution, it is not quite clear if this change provides an advantage to the organism. Moreover, an increase in distance between cis regulatory elements requires the development of special complex mechanisms that ensure the convergence of these elements in space (discussed in part 9). From this point of view, one could speculate that the increase might be a consequence of Kimura’s neutral evolution rather than adaptive evolution [52]. The appearance of additional DNA between enhancers and promoters and the increase in distance between them might arise for the same reasons as the appearance and increase in size of introns in evolution of eukaryotes and may be associated with a decrease in effective population size as organisms gradually became more complex [53]. The insertion of mobile elements serving as sites for homologous recombination during unequal crossing over might be another factor that increased the distance between cis regulatory elements in evolution, leading to a duplication of the DNA between cis regulatory elements [54].

On the other hand, one could speculate that a bacterial promoter-proximal upstream regulatory region is a tool with a very limited regulating capacity. Gene expression requires much more complicated control in Metazoa [55]. A separate location of enhancers provides evolutionary plasticity to the regulation, for example, allows the situations where one gene is regulated by multiple enhancers or, vice versa, expression of different genes is regulated by one enhancer.

4. Principles of Transcription Activation in Eubacteria

The origin of transcription regulation is in the prokaryotic world. In order to see whether parallels could be drawn between enhancer-like mechanisms found in prokaryotes and in higher eukaryotes, it is required first to consider the general principles of transcription activation in eubacteria.

4.1. The Role of the Promoter Structure

Bacterial promoters have a modular structure the same as that of eukaryotic promoters and appear to have evolved by a pick-and-mix mechanism, which produced the hierarchies of promoter activities that permit a 1000-fold dynamic range in transcript initiation frequencies [56,57,58]. A promoter may be activator dependent because it carries one or more “imperfect” elements so that the initial RNAP binding is reduced. Furthermore, some promoters have so non-consensus sequences that they exhibit no activator-independent activity. The purpose of activators in all these cases is to compensate for the weak or missing core promoter elements by mediating the interaction between DNA and the σ and/or α subunits of RNAP. Conversely, when all core promoter elements required for transcription are present in the promoter sequence (the so-called strong constitutive promoters, i.e., P_tac [59]), the initiation of transcription may occur without activators. Thus, activator-dependent promoters in bacteria are simply the promoters that have weak non-consensus core promoter sequences, in which activator–weak promoter–RNAP interactions replace strong promoter–RNAP interactions.

Bacteria use two distinct sets of mechanisms for transcription activation focusing either on the promoter or on RNAP. In RNAP-centric mechanisms, some factors interact with RNAP to alter its promoter preference. For example, alternative σ factors do this [60,61]. In other cases, the RNAP–σ⁷⁰ interaction is altered by activators that redirect the complex to a certain class of promoters, the process being known as the RNAP appropriation [62].

In promoter-centric mechanisms, activators interact with the promoter to improve the ability of RNAP to initiate transcription, either replacing missing core promoter determinants or blocking the action of a repressor, e.g., by competing for the same binding site. Two common ways are used in bacteria to activate transcription. One is related to the interaction of the C-terminal domain (CTD) of the α subunit of RNAP with the UP element located upstream of the core promoter. The second one involves the interaction of the σ4 domain of σ⁷⁰ with the −35 element. An activator interacts with the σ4 domain and promotes its binding to the “non-perfect” −35 hexamer. Only one of the two subunits of the activator on DNA makes contact with the σ4 domain of σ⁷⁰ upon activation because RNAP carries only one σ factor. To make contact with σ4, activators have to be precisely positioned on DNA and bind to the same face of the DNA helix as the σ4 domain does. The binding of an activator to a site adjacent to or overlapping the upstream end of the −35 hexamer is often triggered by phosphorylation of a receiver domain of the activator or by binding to an activatory ligand, for example, a sugar. This is the case with the AraC regulator at the P_BAD promoter when arabinose binding changes the AraC properties and converts AraC from a repressor to an activator (see Section 5.2 and [63]).

4.2. The Role of the Promoter Conformation

One more fundamental form of regulation is based on changing the DNA topology and is widespread from bacteria to human. In bacteria, activation of expression by changing the promoter conformation is a special case of this regulation. Experiments with synthetic promoters that form an angle of 90° in vitro showed that this sharp DNA bend may facilitate the binding of upstream DNA to the backside of RNAP [64]. In vivo, some activators alter the conformation of the promoter DNA to bring distant DNA regions to each other. For instance, target promoters of MerR and other related activators are defective because of the nonoptimal spacing between the −35 and −10 elements [65]. In the situation where RNAP initially binds to the UP element and −35 element, the −10 element is misplaced and promoter melting, unwinding, and interaction of the −10 element with the σ2 domain of the σ⁷⁰ subunit are hindered. This hindrance is overcome by the activator MerR, which causes a localized distortion to align the −10 element with the −35 element and thus triggers transcription activation [66].

Binding of one activator to DNA and subsequent DNA bending may facilitate a direct interaction of another activator with RNAP. For example, the CAP activator-induced bending was shown to modulate the location of binding of the activator MalT and to trigger transcription activation from the malK promoter [67].

Altering the promoter conformation in bacteria may involve nucleoid-associated proteins (NAPs), which are evolutionary predecessors of eukaryotic histones. Their binding to a distal promoter results in translocation of superhelical energy from upstream binding sites (supercoiling-induced DNA duplex destabilized region) to the promoter-proximal region, which is unwound within the open complex, and may change the promoter activity by several dozens of times [8,15,68,69].

NAPs, H-NS, and FIS may interplay with special shape-sensitive activators containing winged helix-turn-helix (wHTH) motifs. wHTHs use an alpha helix to read the base sequence in the major groove while inserting a beta sheet «wing» into the adjacent minor groove [70,71]. wHTH binding to relaxed DNA appears to generate a locally supercoiled state, which assists promoter activation by relocating supercoiling destabilization of DNA strands.

5. Enhancer-Like Mechanisms in Eubacteria

In this part we provide specific examples related to bacterial UAS-like regulation. These include direct stimulation of the RNAP complex by an activator, activation loops, repression loops, activator tracing/scanning from a landing enhancer-like site to a core promoter, and rings that mediate interactions between distant DNA regions.

5.1. Stimulation of RNAP by an Adjacent Activator

The simplest mode of the activator function is when an activator compensates for the weak core promoter sequence directly, by interacting simultaneously with RNAP and a promoter and thus increasing the frequency of the appearance of the initial RNAP–promoter complex. An example of the direct interaction of an activator with the RNAP–σ⁷⁰ complex is provided by inducible genes of the Pho regulon, which is involved in the cell response to deprivation of phosphorus (Figure 1A). The Pho-regulon genes are regulated by the activator PhoB and usually have weak –35 elements; PhoB is required to stably recruit the RNAP–σ⁷⁰ complex to such promoters. Pho-boxes, on which PhoB is bound, are located upstream of the core promoter or overlap the –35 element. PhoB is phosphorylated upon induction and binds to the Pho-box as a dimer. RNAP is not recruited to the promoter in the absence of the activator [72].

Figure 1. Enhancer-like mechanisms of transcription regulation in prokaryotes. (A) Stimulation of RNAP by an adjacent activator. Pho-regulon genes are responsible for adaptation of bacteria to phosphorus starvation. Expression of the Pho-regulon genes is regulated by a two-component system through direct contact of an activator PhoB with the RNAP–σ⁷⁰ complex. Lack of phosphorus in the medium leads to phosphorylation of PhoB (PhoB-P), which then acts as an intracellular soluble response regulator. PhoB-P as a dimer binds to its binding sites at the promoters of Pho-regulon genes (pho-boxes), thus compensating for the weak −35 element, and recruits the RNAP–σ⁷⁰ complex to the promoters. The physical interaction of PhoB-P with σ⁷⁰ in the region of the −35 element is shown. (B) Multiple activation of RNAP by several activators. Regulation of the araBAD promoter is shown. The upper panel illustrates the repression state in which the promoter is repressed by the AraC regulator. The lower panel shows the transition from repression to activation, which occurs by changing the conformation of the promoter DNA. The repression loop is broken by the CAP protein; the promoter is bent by CAP and the site of AraC binding is switched. Changing the promoter DNA conformation allows the activators to interact with RNAP. (C) Repression loop. Two alternative conformations of the lac operon promoter are shown. The lac-operon is regulated by a one-component regulatory system. The system includes the Lac repressor protein (LacI), which acts as both a lactose sensor and a transcriptional regulator. In the absence of lactose (or in the presence of glucose), the Lac repressor binds to several operator sites at the P_lac and negatively regulates expression of the lacZYA operon through the formation of a repression loop between the operator sites. The binding of lactose or IPTG affects the LacI conformation, causing LacI to disconnect from the operator sites, and the repression loop disappears. (D) Activation loop. An activation loop forms when glnALG operon expression is upregulated by the NtrC regulator. When ammonia is depleted in the medium, NtrC undergoes phosphorylation, assembles into heptamers, and binds to an upstream enhancer-like sequence at the promoter of the glnALG operon. The loop forms between the NtrC binding sites and core promoter. NtrC remodels the RNAP–σ⁵⁴ holoenzyme into the open complex. The remodeling reaction is ATP dependent. Nucleoid-associated proteins (NAPs) may facilitate the looping mechanism by bending the DNA between the enhancer and promoter. (E) Activation ring. The scanning/tracking mechanism of regulation of bacteriophage T4 late genes is shown. A component ring consisting of three gp45 phage polypeptides is put onto DNA at the enhancer region by the loader complex gp44–gp62. A part of the split σ-subunit known as the gp55 protein (σ55) tracks along DNA as a gp45 ligand. At the gene promoter region, the ring activation complex encounters RNAP with gp33 (a second part of the split σ) and mediates the transition of the closed complex to the open one and activation of transcription.

As evident from structural studies, the interaction of PhoB with σ⁷⁰ causes a reorientation of the β-flap tip helix and σ4 domain of σ⁷⁰ in the surroundings of the nascent RNA exit channel, suggesting that the remodeling of RNAP might alter its opening to facilitate the exit of the mRNA, thus reducing the probability of abortive transcription [29,73]. The activation of this type is highly similar to a eukaryotic system known for housekeeping genes, where activator sites are close to the core promoter and where fast RNAP turnover takes place.

Another well-studied example of direct interactions with RNAP is a mechanism whereby the λ cI repressor activates expression of its own gene from the native λ P_RM promoter [74]. The activation by the cI protein also involves the σ4 domain of σ⁷⁰. In this case, not accompanied by any structural change in any protein participant, the cI repressor and σ⁷⁰ are immediately adjacent to the DNA and interaction proceeds through small clusters of amino acid side chains.

5.2. Multiple Activation of RNAP by Several Activators

This mechanism is observed for complex promoters, which combine DNA binding sites for several different regulators. At least two regulators act in the simplest case. The first one is an activator, and the second one is of dual nature, acting as an activator or a repressor depending on the situation.

The araBAD promoter provides an example of such complex regulation. This promoter is regulated by two TFs, the activator CAP and AraC, which is of dual nature. P_BAD has two direct-repeat half-sites, I1 and I2 (8 bp each), for the AraC activator-repressor. The half-sites are centered at position −52.5 bp (five turns of the double helix in a B shape, of 10.5 bp each) and partly overlap the −35 region. One more AraC half-site, O2, is 211 bp upstream (−265 to −294 from TSS). When arabinose is absent, the subunits of the AraC interact with two nonadjacent DNA sites and a repression loop forms between them [75]; i.e., AraC behaves as a repressor in this case. In the presence of arabinose, one AraC subunit disconnects from O2 and binds to the I2 half-site, which is adjacent to the I1 half-site. This is accompanied by the loss of the repression loop, and AraC gains the ability to interact with RNAP; i.e., AraC behaves as an activator in this case (Figure 1B).

Thus, the reorientation of one subunit of AraC to another site regulates the DNA looping in P_BAD and specifies the activator–RNAP contacts. This mechanism reminds the mechanism of loop formation by a cohesion complex in eukaryotes, when the orientation of two CTCF protein binding sites specifies the direction of loop formation. Changing the polarity of one CTCF site compromises enhancer–promoter interactions [76,77,78,79]. Moreover, switching the interaction from one CTCF site to another can specify regulatory effects in eukaryotes, like in the case of P_BAD [80,81].

The site for the other regulator of P_BAD, the activator CAP, is upstream of the AraC site and is centered at position −93.5. Although there is no detectable interaction between AraC and CAP upon their binding to DNA [82], experiments in vitro showed that CAP facilitates the breaking of the DNA loop if situated on the same face of DNA no more than 4 helical turns upstream of the loop-formation site [83]. CAP does not significantly activate P_BAD when acting alone, in the absence of AraC [84]. Conversely, although AraC binding to both I1 and I2 sites in the absence of CRP is sufficient to stimulate the open complex formation, CAP is required for maximal promoter activity [84,85] and causes an eightfold increase in the rate of RNAP open complex formation on wild-type P_BAD in vitro [85]. Thus, CAP functions as a typical eukaryotic coactivator on P_BAD, enhancing the action of AraC, but is capable of binding to DNA unlike eukaryotic coactivators.

What mechanistic consequences arise when two different activators interact at once with RNAP upon activation? One scenario is that each activator induces its own specific rearrangement of RNAP and each of these rearrangements contributes to the final organization of the complex and transcription output. Another scenario is that both activators do the same job, that is, each of them repeats the other. Experiments testify in favor of the latter; i.e., both CAP and AraC stimulate the recruitment of RNAP as well as the transition to the open complex [63].

Upon activation, the CAP–AraC–RNAP complex substantially bends the DNA, and it is very likely that CAP is positioned close to RNAP despite the fact that the AraC binding site lies between the binding sites for CAP and RNAP (Figure 1B). One α subunit of RNAP utilizes its C-terminal domain to contact the AR1 (activation region) of the closer CAP subunit, and the σ4 domain of σ⁷⁰ presumably contacts the AR3 region of CAP [84]. Both AraC and σ⁷⁰ interact with the −35 element, but AraC interacts with σ⁷⁰ very slightly [86]. In contrast, it was shown in vitro that AraC binds RNAP very tightly [87].

Thus, the example above demonstrates that bacteria already display complex interplay of different regulators on the enhancer-promoter region.

5.3. Repression by Looping Prior to Activation

Regulation of gene expression by looping is common in eukaryotes during development [88,89,90], but specific molecular details of these mechanisms have just begun to appear. In this section we give examples of repression loop formation in prokaryotes to illustrate in detail how the repression can be replaced by activation and how this can be related to the DNA topology.

Repression loop formation was studied in bacteria in detail using several examples. The mechanism of how repression loops form in the araBAD promoter was described in Section 5.2 (Figure 1B). The loop formation between the repressor sites was confirmed for P_BAD using microscopic [91] and biochemical methods [75,92]. Repression was impaired when the operator sites are on different sides of the DNA helix, emphasizing a role of the proper DNA topology in repression loop formation [93,94]. It was shown that repression loops of up to 500 bp can form using AraC [94].

Using microscopy, repression loop formation was also demonstrated for the lac promoter [95] (Figure 1C) and the bacteriophage λ cI-repressor-operator system [96,97]. The findings confirmed that this way of regulation is universal for cell and viral genes. In the case of P_lac, when loops between operator sites are stabilized by the Lac repressor, RNAP appears not to be trapped or occluded in these loops [98]. The mechanism of repression probably consists in that RNAP cannot initiate from the tightly bent promoter because the enzyme lacks sufficient binding energy to untwist the constrained DNA within the loop [98]. The formation of a DNA loop was favored by correct phasing of the two lac operators on DNA [95,96,97].

Repression loops are stabilized in the presence of NAPs, which are functional evolutionary ancestors of eukaryotic histones and HMG proteins, and are destabilized by NAP deletions [99,100,101]. All of these examples have a common feature: in the case of repression, the loops are relatively small, no longer than several hundreds of base pairs [102].

Thus, the formation of a repression loop between DNA-binding transcriptional regulators associated with the correct DNA topology is a universal property of both prokaryotes and eukaryotes.

5.4. Activation Loops

An example opposite to the above two ones concerns the functioning of the so-called bacterial enhancer-binding proteins (bEBPs). Activators of this class bind remotely upstream of the TSS and induce the formation of large activation loops. bEBPs are special hexameric ATPases that use ATP hydrolysis to remodel theRNAP–σ⁵⁴ complex [103]. In addition, it was shown that bEBP binding sites can isolate or filter readthrough transcription from upstream genes [13,104]. This property makes these sequences similar to eukaryotic insulators.

There are two major families of σ factors in bacteria, based on sequence homology and the mechanism of action: σ⁷⁰ and σ⁵⁴. Upon binding to the promoter, the RNAP–σ⁷⁰ holoenzyme forms a closed complex, which can spontaneously isomerize to form an open promoter complex [60,61]. Due to this property, σ⁷⁰ is involved in the bulk of cell transcription during active growth, including all essential genes in E. coli. Unlike σ⁷⁰, σ⁵⁴ directs transcription of tightly regulated genes involved in the cell response to stress, including nitrogen and carbon starvation, antibiotics, and loss of membrane integrity [95]. σ⁵⁴ binds to RNAP to form a stable closed complex, which rarely spontaneously converts to the open one [105,106]. bEBPs facilitate the transition (isomerization) of the closed complex to the open one using ATP hydrolysis [107], resulting in further DNA melting and loading of the template strand of DNA into the RNAP active site [108,109]. Energy is transferred to the closed complex through a physical interaction between σ⁵⁴ and the AAA+ activation domain (AD) of the bEBP [110,111]. This is functionally analogous to enhancer-dependent initiation of eukaryotic RNAP, which requires an input of energy from ATP hydrolysis provided by TFIIH [112,113].

The best-studied example of activation loop formation by bEBPs is provided by the system of the glnALG operon (Figure 1D). When nitrogen is in excess, the nitrogen regulator I (NRI) NtrC exists in a non-phosphorylated form as a dimer and only weakly binds to a pair of its tandem sites in the upstream promoter of glnA. When nitrogen is in deficiency, NtrC is phosphorylated by NRII membrane kinase and occupies its binding sites, ~90 and ~140 bp upstream of the glnA TSS. The phosphorylated NtrC dimer forms heptameric rings and uses the energy of ATP hydrolysis to physically remodel the RNAP–σ⁵⁴ (NtrA) complex. Interestingly, the distance from the NtrC binding site to the TSS should be no less than 70 bp to allow activation. The distance can be increased up to 1000 bp or more without impairing the activation appreciably [114]. Recently, activation was found to occur even when the distance is 6000 bp, although being less efficient [115].

Electron microscopy demonstrated the presence of looped DNA in cis, with the length of the loop being consistent with the spacing between the NtrC binding sites and the core promoter. Once the closed complex is converted to the open one, the interaction of a bacterial enhancer with the promoter is terminated, the DNA loop is opened, and σ⁵⁴ dissociates after promoter clearance by RNAP [116,117].

Studies with a system of two plasmids in which one plasmid carried the promoter and the other one had activator binding sites showed that activation is possible not only in cis, but also in trans, and that the in trans interaction was enhanced with the increasing NRI concentration [117]. A similar concentration-dependent effect was detected in experiments with the native locus, where NRI was shown to be capable of activating transcription even in the absence of an upstream DNA binding site when its concentration is four- to fivefold higher than that required in the presence of the upstream site [118]. These experiments indicate that loop formation per se is not essential for NRI-dependent activation and that NRI binding sites near the promoter are necessary for increasing the local concentration of the activator in the vicinity of the promoter.

The most interesting finding is that the closed RNAP–σ⁵⁴ complex occurs on the promoter of the glnALG operon even in the absence of nitrogen deficiency and in the absence of the NRI activator [119]. This feature makes the glnALG operon similar to the eukaryotic genes that are regulated by stalling RNAP and are constitutively associated with the RNAP–TFIID complex even when these genes are inactive [120].

The relationship between activation and helical phasing was demonstrated in the case of the activation loop as well, although not for the glnALG operon. Binding sites of the NRI activator are in the DNA major groove on one side of the DNA helix, while σ⁵⁴ binding sites are in the major groove on the other side of the DNA helix in the glnALG promoter [119]. This is in contrast to other regulation systems described above (lambda repressor, lac repressor, AraC), in which two remote protein binding sites appear to lie on the same helical face when functioning optimally. A proper helical facing might be less relevant in the case of the glnALG promoter because the promoter has three activator binding sites and monomers bound at each site possess flexible activation domains pointing in opposite directions [119]. This makes activation systems based on NtrC very flexible. At the same time, a σ⁵⁴–NRI-regulated system with two NtrC binding sites was indeed face-of-the-helix dependent in another organism [121]. Thus, particular promoter properties may be determined by the number of bEBP binding sites available.

How frequent may such type of eukaryotic-like regulation be in prokaryotes? σ⁵⁴ is present in an estimated 60% of bacterial genomes [122]. In E.coli, σ⁵⁴ was experimentally shown to regulate 135 of 4 288 protein-coding genes [123]. A number of bEBPs have been characterized and many σ⁵⁴-dependent promoters have been predicted to date [103]. These promoters have bEBP-binding sites up to 3 kb upstream and 1.5 kb downstream of the TSS. Some of them were tested experimentally, and loop-dependent activation was verified. It was found that the looping mechanism at these promoters is facilitated by NAPs that bind between the enhancer and promoter and bend DNA [102,124].

Thus, the activation loop mechanism in bacteria is closest to what we expect from eukaryotic enhancer-dependent regulation. At the same time, the formation of an activation loop is not the only mechanism that spatially associates distant DNA regions in prokaryotes.

5.5. Activation Rings

Ring complexes that encompass the DNA molecule and slide along it are widely used in bacteria and eukaryotes (PCNA, cohesin-condensin, etc.). The role of a sliding ring in regulating gene expression by a scanning/tracking mechanism was discovered and studied on late genes of the bacteriophage T4. A sliding clamp comprised of a trimer of gp45 polypeptides is loaded to a distal enhancer-like site by the gp44–gp62 clamp loader complex (Figure 1E). This site can be upstream or downstream of a gene. The enhancer site is not a specific nucleotide sequence in this case, but a nick introduced in the non-transcribed strand and associated with normal DNA replication or recombination. The activation ring slides from the loading site to the promoter of the gene, carrying a part of the split σ-subunit as a ligand (gp55 protein or σ55, a functional analog of the σ2 domain of σ⁷⁰) [125]. On the gene promoter, the activation ring binds RNAP with the gp33 protein (another part of the split σ-subunit) attached to RNAP. gp33 binds to the so-called β-flap, a flexible tip of the β subunit of RNAP [126]. The β-flap normally binds to the σ4 domain of σ⁷⁰, and the σ4 domain recognizes the -35 element. Thus, gp33 mimics the σ4 domain of σ⁷⁰. The promoters of late T4 genes have a −10 element only. Thus, a blocking of the σ⁷⁰-subunit binding site on RNAP by gp33 allows transcription to be activated from late T4 promoters lacking the –35 element. Interactions of the gp45 trimer with RNAP allows the transition of the σ55 (gp55)-RNAP complex from the closed state to the open one and activation of transcription [127].

5.6. Rings Mediating Convergence of Distal DNA Sites

Condensins of prokaryotes and eukaryotes and cohesins of eukaryotes are collectively called SMC proteins and are evolutionarily related conserved multi-subunit complexes that form rings of various structures. Condensins/cohesins are vitally important for cell division and DNA repair in both domains of life [128,129,130]. In addition, cohesins were found to be important for the regulation of gene expression in G1 in eukaryotes, acting through a mechanism of loop extrusion. It became clear recently that a similar mechanism exists in bacteria [131,132,133,134]. Replication in B. subtilis was shown to proceed with the help of the loop extrusion mechanism. Upon replication, two large separate hairpins form and condensin rings move in an ATP-dependent manner along contiguous DNA segments, processively enlarging the DNA loops [133] at rates of >50 kilobases per minute [132].

Although more profound exploration is required to broaden the understanding of the role that SMC complexes play in regulating gene expression in bacteria, some SMC–transcription interactions have been demonstrated [135,136]. For example, it was shown that the klesin subunit of SMC is able to interact with at least one two-component sensor kinase whose interaction was confirmed functionally, as well as with other potential kinases and transcription regulators [135]. Head-on transcription was shown to interfere with SMC translocation over the chromosome [136]. A similar effect of transcription on cohesins was recently observed in eukaryotes [137].

The examples and explanations above make it obvious that the details of molecular mechanisms of regulation of bacterial transcription have analogues in the world of eukaryotes.

6. Mechanisms of Transcription Activation in Archaea

Current models of eukaryogenesis have undergone significant revision recently [138]. Modern research places the origin of eukaryotes in the archaea clade. The pattern formally corresponds to the two-domain tree of life [139,140]. There is no doubt that eukaryogenesis was a more complex event than previously thought, uniting the genomes of not two, but at least three prokaryotes [138]. Archaea combine the features of bacterial and eukaryotic regulation [55]. However, their mechanisms cannot be directly compared with bacterial ones because the principles of activation are fundamentally different, due to the eukaryotic-like structure of their transcriptional apparatus. In addition, their activation occurs in chromatin, which is fundamentally different from bacteria. Altogether, this makes archaea comparable with eukaryotes rather than with bacteria.

6.1. Archean Transcriptional Apparatus: A Combination of Eukaryotes and Bacteria

Archaea are characterized by a simplified version of eukaryote-like general transcription machinery including RNAP and general TFs (GTFs), TBP (TATA-box binding protein), TFB, and TFE (the functional analogs of eukaryotic TFIIB and TFIIE, respectively). Like eubacteria, archaea have one RNAP, while eukaryotes have three. Of these, RNAP most closely resembles the archaeal enzyme. Ten of 13 subunits of RNAP are identical between eukaryotes and archaea, and only 5 are identical to bacterial RNAP [141,142].

Eukaryotes inherited the transcription apparatus and principles of its regulation, including differential transcription regulation by multiple GTFs, TBP, and TFB, and activators/repressors from archaea [55,141,142]. Several archaeal species contain multiple, divergent tbp and tfb genes, and their multiplicity was hypothesized to have a regulatory role at a higher level, reminiscent of alternative σ factors in bacteria [143]. In turn, activators and repressors encoded by archaeal genomes are closely related to bacterial TFs [55,142,144]. Thus, a little space remains in eukaryotes on the transcription regulation mechanisms specific to them, those that have no analogues in bacteria and archaea. Such mechanisms should apparently be associated primarily with remote communication of DNA regulatory elements scattered in the genomes of multicellular eukaryotes over long distances.

Bacterial RNAP initiates transcription without the involvement of additional factors with the exception of the σ factor, whereas archaeal RNAP requires a minimal set of GTFs, TBP, and TFB. The archaeal core promoter has minimal structural differences from the eukaryotic core promoter [145]. The GTFs of archaea and eukaryotes (TFB/TFIIB, TFE/TFIIE, and MBF1) share no similarity between each other and with bacterial σ factors beyond the presence of distinct DNA binding domains (DBDs). The same situation is observed for gene/operon-specific TFs. Thus, the DBDs might have been independently inherited from a common ancestor in the bacterial and archaeal/eukaryotic lineages. During subsequent evolution, the similarity between archaeal and bacterial regulators might have been established and maintained through multiple horizontal gene transfer events [146].

TFs of archaea are much more similar to bacterial ones than to eukaryotic ones. Approximately 53% of all TFs identified in archaeal genomes have at least one homologue in bacteria, as opposed to 2% having a eukaryotic homologue [147]. However, there is an underrepresentation of ligand-binding domains in the archaeal TFs. This may be compensated for by the formation of different combinations of monomers, like what is observed in eukaryotic transcriptional machinery [146,147]. However, sensing domains are often unique for archaea [144].

6.2. Enhancer-Like Mechanisms in Archaea

Most archaeal repressors function as in bacteria by abrogating access of one or more components of general transcription machinery to the core promoter [148]. While activation mechanisms are eukaryotic-like and based on the recruitment and stabilization of TBP or TFB to their respective promoter elements [149,150,151,152,153,154]. Some activators are able to interact with both TBP and TFB [155,156] (Figure 2). Moreover, it was shown that one activator can interact with five different TBPs [156]. In contrast to repressors, which can function in combination with any promoter strength, archaeal activators are generally associated with promoters that have larger deviations from the consensus BRE and/or TATA box sequences, as in the case of bacteria [152,157,158].

Figure 2. An integrated circuit of the initial stages of transcription activation in archaea. When an incoming signal arrives, a transcriptional activator acquires the ability to bind to its DNA sites in the upstream activating sequence (UAS) region in the promoter of a target gene. Activator binding occurs both in the primary site, which is closest to the promoter, and auxiliary sites, which are distal to the primary site. The appearance of nucleation sites near the core promoter allows the activator to guide TBP to the TATA box (or TFB to the BRE element). This is accompanied by DNA bending in the promoter region, like in the case of activation in bacteria, bringing the regulatory elements of the promoter closer together. The activator may leave the complex when the complex proceeds to the stage of transcriptional bubble or stay in the complex until promoter clearance by RNAP (not shown).

Activator binding sites in archaea are preferentially located immediately upstream of or partly overlap the BRE element and are often called UASs, as in yeasts [153,159]. At the same time, some of the sites may occur in the distal promoter far from the TSS [160], like in higher eukaryotes. There may be several binding sites in the proximal promoter in the case of archaeal UAS. The site that is closest to the promoter is called the primary site and is critical for activation, while the other upstream sites serve as auxiliary ones (including those located after the TSS). This is similar to the findings in higher eukaryotes, in which the first enhancer in an enhancer chain was shown to be more important. This enhancer forms more stable contacts with the promoter and is important for initiating a transcription bridging of the promoter with other regulatory elements in the locus [161]. The auxiliary sites in archaea are necessary for nucleation and act to increase the concentration of the activator near the primary site and the core promoter. Both wild-type 21-bp spacing (two DNA helical turns) between the primary site and the TATA box and the proper facing of the activator and TBP on the same side of the DNA helix appear to be critical for promoter activation in archaea [153,159].

Distal sites are often utilized by Ribbon-Helix-Helix (RHH)-family activators [160]. It is unknown currently whether the RHH activators function through large loop formation, as it was described for bacteria in Section 5.4. (Figure 1D), or RHH activators employ a wrapping mechanism and reel up the DNA around themselves. It was shown that activators of the RHH family readily oligomerize into octamers upon crystallization [162,163,164]. The latter variant resembles a linking mechanism observed in eukaryotes (Figure 3B). The necessity of oligomerization for activation was shown for at least some of the RHH proteins [165]. Hence, the following model of wrapping/linking was proposed. The RHH proteins tend to preferentially bind to high-affinity sites located in the distal promoter region. At higher regulator concentrations, RHH proteins begin to oligomerize along DNA towards the core promoter, leading to the occupation of degenerate (low-affinity) binding sites and subsequent activation. In other cases, the process is accompanied by repression when oligomers presumably interfere with preinitiation complex (PIC) assembly [165,166].

Figure 3. Mechanisms of transcription regulation by enhancers in eukaryotes. (A) The regulatory system of yeasts. The UAS works in orientation-independent manner only when placed upstream of the promoter, but not downstream of the gene or in an intron. The yeast UAS is usually close to the promoter (at the distance less than 1 Kb). (B) Linking/Chaining mechanism of enhancer–promoter communication. A linking factor oligomerizes from the enhancer to the promoter using some anchored proteins to bridge the distal regulatory elements. This cascade of recruitments starts at the enhancer and proceeds until reaching the core promoter, where the preinitiation complex (PIC) is recruited. Alternatively, oligomerization factor-containing complexes might bind directly to relatively hyperacetylated chromatin in the absence of any anchor protein, possibly, by recognizing a signature of active loci, such as a specific pattern of histone acetylation. (C) Scanning/Tracking mechanism of enhancer–promoter communication. An activator binds to the enhancer region and utilizes its activating domain to recruit the other components of the initial transcription complex to the region of the enhancer, the set including basal factors, RNAP, and coactivators. RNAP moves from its initial binding site at the enhancer towards the core promoter, producing non-coding RNAs, which may also represent a platform for the binding of regulatory proteins and coactivators. Although the recruitment of the factors relevant for the initiation of transcription occurs at the enhancer region, transcription is not effectively initiated because the enhancer lacks all motifs necessary for the efficient binding of basic factors that ensure the recruitment of RNAP. At the core promoter, the total set of motifs favorable to preinitiation complex assembly is available and, after the formation of a stable transcriptional complex on the core promoter, the DNA loop between the core promoter and the enhancer is stabilized. The process of RNAP movement towards the core promoter may be accompanied by a wave of unidirectional spreading of histone acetylation through the DNA that separates the enhancer and the promoter (not shown), as well as by histone remodeling in the promoter region. Thus, scanning/tracking can be accompanied by the formation of an acetylated domain between cis-regulatory elements, as well as the formation of regions with a reduced density of nucleosomes. (D) Genome architecture of eukaryotes (schematic Hi-C maps are shown). Yeasts are devoid of TADs and have only self-associating domains (micro-TADs), which are associated with gene loops, and ~200 Kb TADs, which are associated with DNA replication. In Drosophila, the architectural proteins can mediate the interactions of cis-regulatory elements inside TADs (indicated with 1). Additionally, flies have long-range looping interactions, which are associated with the function of Polycomb silencing complexes (the long-range looping interactions are indicated with 2). Humans possess all of the mechanisms that mediate enhancer–promoter communication in Drosophila, but, unlike flies, mammals additionally have a cohesin-dependent loop extrusion mechanism. The CTCF protein is found at the bases of TADs of different sizes in mammals and is believed to be a master regulator of the chromosomal architecture, while Drosophila CTCF acts as an ordinary architectural insulator protein. Cohesin-dependent loop extrusion is an evolutionary acquisition of mammals compared with Drosophila (cohesin-CTCF mediated interactions are indicated with 2,3).

One more example of eukaryotic-like regulation involves a transcription activator that has binding sites not only in the proximal promoter, but also in the core promoter. The sequence in the core promoter is located immediately downstream of the TATA element [167], analogous to the BRE^d element in RNAP promoters [168].

Some archaeal regulators appear to be of dual nature and exert opposite regulatory effects at the same promoter in a concentration-dependent manner [169,170,171]. These regulators activate transcription at lower concentrations, when not all binding sites are occupied at the promoter and repress the promoter at higher concentrations. This type of regulation presumably involves the formation of a repression loop [148,172]. The formation of a highly dense region in which DNA appears to be wrapped was confirmed by atomic-force microscopy [173].

Thus, Archaea combine the principles of the regulation observed in the other two domains of life, Bacteria and Eukarya. The regulation by multiple TBP and TFB factors is an intrinsic trait of Archaea that has been fully inherited by Eukarya. The regulation in the chromatin is not considered here; however, it is a second common feature between Eukarya and Archaea. The molecular details of these systems are just emerging, and our understanding lags behind the understanding of bacterial systems. It is possible that more types of sophisticated regulation will be found in archaea in the near future.

7. Mechanisms of Transcription Activation in Eukaryotes

7.1. DNA Topology and Control of Gene Transcription in Eukaryotes

As was mentioned above, the prokaryotic mechanisms resemble or even parallel the mechanisms found in eukaryotes. However, the eukaryotic systems of regulation are generally more sophisticated and include proteins with intricate multidomain structures. Gene regulatory networks are more complex in eukaryotes as a result of several genomic duplications that occurred on the way to vertebrates. The duplications gave origin to paralogs, sometime several. For example, there is a single SMC condensin gene in eubacteria, while multicellular eukaryotes developed several similar complexes with overlapping, but not identical functions. The fixation of a large number of SMC genes in evolution of eukaryotes emphasizes their significance for the function of the eukaryotic genome. Due to huge sizes of higher eukaryotic genomes, some of these complexes, which are known as cohesion complexes, acquired a function in the regulation of gene expression by the loop extrusion mechanism in interphase; their role in enhancer–promoter interactions is now well characterized in eukaryotes [77,80,81].

Like in prokaryotes, the DNA topology has a fundamental control over promoter activity and the ability of TFs to access their target DNA sites in eukaryotes. On the one hand, cis regulatory elements dispersed over long distances should be brought together for proper activation or repression. On the other hand, the abundance of DNA-binding proteins subtly sensing the DNA shape begun to open up in eukaryotes. For example, recognition of the DNA shape by proteins and nucleosomes was recently found to involve the A-tracts as short as three base pairs in the DNA minor groove and received the name of indirect readout [174]. Sensing the electrostatic potential in the minor groove is contrasted to the direct readout mechanism that involves the reading of the base sequence in the DNA major groove. Although examples of such a type of regulation are still not abundant in eukaryotes [175,176] and detailed investigation is necessary, these findings partly explain why the binding sites of eukaryotic regulators are typically 10 nt long and are so degenerate [177].

7.2. Mechanism of Activator Functioning in Eukaryotes

Before discussing the mechanistic ideas about how enhancers work in eukaryotes, it is necessary to consider the mechanisms whereby eukaryotic activators function because the nature of an activator generally determines the way of enhancer action.

A minimal eukaryotic activator consists of an activating domain (AD) and a DNA-binding domain (DBD). When expressed in eukaryotic cells, such a fusion protein can activate any gene bearing the appropriate DNA binding site nearby. This principle works despite the fact that eukaryotic transcriptional machinery is much more complex than that of bacteria, comprising many proteins in addition to RNAP. A typical eukaryotic activator will recruit whatever factors are required to form the activation complex, and transcription will ensue [178].

The binding of an activator consists in concentration-dependent dynamic interactions, and what is an enhancer to do is simply to increase the concentration of the activator nearby the TSS. Recruitment of an activator merely increases the rate at which the binding reaction occurs. In the absence of an activator, the reaction proceeds spontaneously to some extent, generating a basal level of transcripts. A gene does not stay on once activated unless RNAP is continually recruited by the action of an activator. At abnormally high concentrations of TFs, the specificity can be lost: activators will bind and work where they should not, and enzymes will work on sites that they normally leave unmodified. RNA molecules can also act as recruiters. For example, many regulatory proteins were shown to bind to non-coding RNAs that function to activate their neighbor genes using a cis-mediated mechanism [179,180,181].

When contacting enhancers, activators do it by two ways. First, as in archaea, some activators interact directly with basal factors, such as TFIID-TBP or TFIIB and others, recruiting them to the nearest promoter. Second, activators convey their message through coactivators (adapters laying between the activators and basal factors), such as the above mentioned MBF1, the Mediator complex, and histone acetyltransferases (HATs). The spectrum of the recruited factors fully depends on of the AD. For instance, the most powerful AD of the herpes simplex virus VP16 protein recruits almost all basal factors, including several TAFs; coactivators, such as NuA4, p300/CBP, PCAF, and SAGA complexes; the SWI/SNF chromatin remodeling complex; and general cofactors, such as PC4, that are capable of interacting with both ADs and basal factors [182].

Activators make chromatin more permissive for transcription by recruiting HATs either directly, through interaction with a specific HAT subunit, e.g., Tra1p, a shared subunit of SAGA and NuA4 complexes [183], or indirectly, through coactivators other than the Mediator complex, such as, for example, the SRC family of proteins, which recruit HAT (e.g., p300/CBP, PCAF) and histone methyltransferases (HMTs, e.g., MLL) [184]. Chromatin remodeling complexes as well as HATs include special subunits necessary for the interaction with activators [185]. In general, multiple coactivators participate in promoter activation in a gene-specific way [186,187].

The remodeling of the RNAP complex due to direct contact of the activator with RNAP is common in bacteria, but exceptional in eukaryotes. For example, it was initially shown that AD of the HBx protein of hepatitis B virus interacts with the RBP5 subunit, which is present in all three RNAP I, II, III [188]. Later studies demonstrated that a ternary complex of RPB5, HBx, and TFIIB is rather formed [189].

In eukaryotes, a remodeling of RNAP is achieved by introducing a chemical modification in its structure, for example, by phosphorylating its CTD. Activators that can interact with positive transcription elongation factor b (p-TEFb), which consists of CycT and the cyclin-dependent kinase Cdk9), simulate phosphorylation of CTD by p-TEFb at Ser2, which causes the conversion of the RNAP holoenzyme from the initiation complex into an elongation one [190,191].

Phosphorylation of CTD of RNAP by p-TEFb or the CDK7 subunit of THIIH is not the only way to remodel RNAP. Activators can involve external third-party cell kinases for this purpose. For example, it was shown that interactions of VP16 with cell kinases other than THIIH potentiate CTD phosphorylation [192]. Cdc2 is one of these kinases [193]. The mechanism interconnects the cell cycle, signaling, and transcription regulation.

Thus, after binding to an enhancer, an activator presents its activation domain, which either directly or through coactivators interacts with chromatin remodelers and components of basal transcription machinery. The interactions facilitate the assembly of a complete set of GTFs on the promoter and the transition of RNAP to initial RNA synthesizes, which is accompanied by phosphorylation of Ser5 in CTD of RNAP. Subsequent interactions of the activator with p-TEBb and, possibly, CTD transfer transcription into the elongation phase.

7.3. Combinatorial Code of Activators

Unlike in prokaryotes, eukaryotic activators do not recognize long and easily recognizable DNA sites, but rather a family of short closely related sequences of up to 10 bp. The circumstance explains why some redundancy is usually observed. To overcome redundancy and to induce a high level of target gene transcription, activators should act in combination with each other, binding multiple regulatory elements in enhancers. In addition, they should act in a proper chromatin context (chromatin context requirements are reviewed in [194]). This combinatorial code is additionally reflected in the fact that, compared with prokaryotic TFs, eukaryotic TFs are often heterodimeric in structure, each of the monomers recognizing a similar, but slightly different sequence. An important principle is that heterodimerization not only yields factors with different affinities for DNA, but also combines activators from different groups, which carry DBDs of different affinities and have ADs of different properties. For example, the thyroid hormone nuclear receptor (TR) can form a TR/TR pair and a pair with the retinoid X-receptor, TR/RXR, which binds DNA more strongly than TR/TR [195,196]. Dimerization of MyoD (see below) with the ubiquitously expressed protein E12 or E47 is required for its maximal DNA-binding activity [197,198]. The c-Myc/Max heterodimer has a higher transactivation ability than the c-Myc/c-Myc homodimer [199,200,201,202].

Eukaryotic regulators often recognize specific DNA sequences in enhancers in a cooperative manner. Cooperativity helps to ensure the specificity of binding and is used to integrate information. Two proteins might exert no effect when present in separate cells but bind (cooperatively) to DNA when present in the same cell. A special case of this principle is the situation where the binding of one activator to one of a few adjacent sites enhances the binding of the same activator to the other sites. The principle is illustrated by the fact that many glucocorticoid response elements (GREs) present in an enhancer stimulate transcription far greater than one GRE does when present alone. The cooperativity of GREs is mediated by its DBD, but not AD [203]. This is one more feature of eukaryotic regulators in addition to their DNA-binding, trans-activating, and trans-repressing properties.

Sequential cooperativity is another sort of cooperativity and implies interplay of different activators, at least two of them. It is best illustrated by the example where the binding of a first activator (e.g., pioneer TF, see below) to its site enhances/induces the binding of a subsequent activator (of non-pioneer TF nature). Another example is related to the myogenin enhancer, where the MyoD activator initially recognizes its binding site in a proper chromatin context. MyoD utilizes its H/C and helix III domains to interact with an adjacent protein complex containing the homeodomain protein Pbx, which appears to be constitutively bound at this site. Thus, MyoD is necessary for the initiation of chromatin remodeling, but binds second at this site [204].

The principle of cooperativity is additionally expressed at the level of interplay between activators, where different activators recruit different coactivators, enhancing the effect of each other. For example, one activator recruits TFIID, another one recruits TFIIB or HAT, and a third one recruits a chromatin remodeling complex. A cumulative effect is created in this way by the activators, which access their own sites through different mechanisms.

8. Current Models of Enhancer Functioning in Eukaryotes

The specificity of an activator is determined solely by the location of the DNA site recognized by its DBD. For example, the yeast transcriptional activator Gal4 can activate essentially any gene when artificially expressed in another eukaryote, provided that the gene has Gal4 binding sites inserted nearby [205] (Figure 3A). However, as mentioned above, both activators and repressors work in eukaryotes even when positioned at any of a wide array of positions on DNA, often at considerable distances from their target genes. The interaction between proteins bound to well-separated DNA sites is accomplished via DNA looping. This kind of «action at a distance» is a rule, especially in eukaryotes with huge genomes (see part 9).

8.1. Initial Universal Stages of Enhancer Action

Several models have been presented in the literature to explain how enhancers of eukaryotes function. The key point at which all models converge is the presence of the so-called pioneer TFs (pTFs, or reprogramming factors) that can recognize their target sites in the context of compacted («closed») chromatin, which is covered by nucleosomes and is insensitive to DNase I and other nucleases. This is especially characteristic of developmental genes, which are typically embedded in «closed» chromatin. Thus, activators that potentiate transcription in development are inherently capable of initiating chromatin opening events. However, heterochromatin (carrying the H3K9me2/3 mark) appears to be insensitive to pTFs [206], and chromatin opening is therefore attributed to facultative chromatin created by Polycomb proteins, which regulate developmental genes. pTFs bind to the enhancer DNA to facilitate the subsequent recruitment of other (non-pioneer) activators/repressors, DNA- and chromatin-modifying enzymes, and chromatin remodelers.

One of the best-studied examples of pTFs is provided by the Zelda factor, which acts to establish (but not to maintain) active loop-domains before the midblastula transition in Drosophila embryos [44]. Zelda promotes accumulation of the Dorsal activator at the site of an enhancer [207]. Zelda accumulates and forms sub-nuclear dynamic hubs (microenvironments), where its binding events are transient, and facilitates transcriptional activation by accelerating the duration of multiple pre-initiation steps [208]. The exact mechanisms by which such facilitation occurs remain to be determined.

Direct nucleosome- and chromatin-binding studies are useful in order to investigate how exactly the pTFs mechanistically engage chromatin to initiate reprogramming and trans-differentiation of cells [206]. Interesting details were observed for the FoxA proteins, which belong to the large forkhead box (Fox) family (the Drosophila forkhead mutation causes defects in head fold involution). Mammalian FoxA opens compacted chromatin in nucleosome arrays assembled with linker histone H1 and containing the albumin gene enhancer in vitro, acting independently of the SWI/SNF chromatin remodeling ATPases, which are frequently required for chromatin opening [209]. This effect of FoxA is mediated by a high-affinity DNA binding site and the C-terminal domain, which binds core histones H3 and H4. FoxA is thought to open chromatin by replacing the linker histone with the use of their wHTH domain, which is highly similar in structure to that of H1 [210,211]. Direct binding to core histones is consistent with the finding that FoxA binds more stably to nucleosome core particles than to free DNA, is not affected by histone acetylation [212], and is facilitated by H3K4me2 [213]. Thus, pTFs translate epigenetic signatures into changes in chromatin structure, thereby helping to establish the lineage-specific transcriptional enhancers and to execute the respective programs.

Another example of how pTFs work is provided by MyoD, a master regulator of muscle differentiation. A cascade of chromatin events was assumed to occur upon myogenin gene activation by MyoD. MyoD is recruited to its target gene enhancers via interaction with the Pbx-Meis complex, which is constitutively bound to the genes prior to activation. p300/CBP acetylates histones H3 and H4 within the promoter region to relax chromatin and then recruits lysine acetylase PCAF. Once tethered to the promoter, PCAF acetylates MyoD to facilitate the trans-activation process [214]. Simultaneously, trans-activation by the AD of MyoD is suppressed by MEK1 kinase (a component of MAPK signaling pathway), which binds to the AD of MyoD [215]. Subsequently, a BRG1-containing SWI/SNF complex is also recruited to the promoter to remodel nucleosomes and to stabilize the DNA binding of MyoD [216]. The order of events may vary depending on the MyoD target genes.

8.2. Enhanceosome Complex Assembly

The complex of regulatory proteins on the enhancer is called the enhanceosome. The enhanceosome was crystallized for the interferon-β (IFN-β) enhancer. Transcriptional activation of the IFN-β gene requires that a complex containing the ATF-2/c-Jun, IRF-3/IRF-7, and NFκB activators is assembled on its enhancer. The factors bind to the IFN-β enhancer cooperatively and recruit coactivators to the IFN-β promoter. A crystal structure shows the association of eight proteins with the enhancer [217]. The activators are very close to each other on DNA, and contacts with virtually every nucleotide pair account for the evolutionary invariance of the enhancer sequence. Paucity of local protein–protein contacts suggests that cooperative occupancy of the enhancer comes from both binding-induced changes in DNA conformation and interactions with additional components, such as CBP and probably HMG-group proteins binding the A-T rich minor groove.

The assembly of an enhancesome was studied in vivo with another example. It was found that assembly is hierarchically ordered with the Sox2 protein engaging the target DNA first, followed by binding of Oct4. Sox2 employs a search mode, sliding along open DNA to efficiently locate its targets. Assembly takes about 15 s, including prior three-dimensional diffusion and nonspecific collisions of the factors [218]. The example showed the TFs dynamics and the influence of the epigenome on target search parameters.

8.3. Mechanistic Models of Enhancer Functioning

Several mechanical models were proposed to explain the action of enhancers over distance. In a tracking/scanning model [219] (Figure 3C), an enhancer-bound complex, which contains DNA-binding factors (the enhancesome) and coactivators, «tracks» via small steps, or short jumps, or small loops (and possibly scanning) along chromatin until it encounters the cognate promoter, at which a stable looped structure is established [220]. Although small loops are more likely to form than large loops at first glance, this model is doubtful currently, after the discovery of the loop extrusion mechanism functioning in vivo. ncRNAs produced by RNAP are found between the enhancer(s) and promoter (for details, see [194]). The ncRNAs directed towards the promoter may reflect the process whereby RNAP seeks for stable binding to a promoter [221]. Coactivators, such as p300/CBP and ATP-utilizing chromatin remodelers, may modify and remodel the chromatin substrate between the enhancer and promoter and thus may facilitate their communication and the tracking of the enhanceosome complex along the chromatin fiber. Some evidence confirming the mechanism was indeed obtained in vitro. An initially independent assembly of regulatory complexes was observed at the proximal promoter and upstream enhancer regions and was followed by movement of the upstream complex, accompanied by a unidirectional spreading of histone hyperacetylation and remodeling of the nucleosomes situated at the TSS [222].

An alternative model exploits a hypothetical linking/chaining mechanism [223] (Figure 3B), according to which protein–protein oligomerization bridges a distal enhancer and a target promoter. An activator first binds to the enhancer and facilitates the recruitment of a second TF to a site located just downstream of the former. This cascade of recruitments occurs until the core promoter is reached and finally builds a landing platform for general transcription machinery at the TSS [224]. The Chip protein of Drosophila was initially identified as a «facilitator» of long-range interactions, and it was assumed that communication between an enhancer and a promoter is mediated by a chain of Chip-containing complexes that are anchored to the chromatin fiber by interactions with homeodomain proteins (HDPs) [225]. Yeast HDPs bind DNA in vivo only in conjunction with other factors [226]. It was shown biochemically and genetically that Chip interacts with LIM-domain proteins [227]. In turn, LIM domains mediate diverse protein–protein interactions. Many HDPs contain the LIM domains. Moreover, mammalian Chip homologs are able to interact with P-Otx, which is a HDP that lacks the LIM domain and is required for synergistic activation of a reporter gene via the bridging of another LIM-domain protein and another HDP and their recruitment to individual promoter proximal sites [228]. Thus, the model of linking/chaining had some kind of biochemical basis. Alternatively, it was assumed that Chip-containing complexes might bind directly to relatively hyperacetylated chromatin in the absence of HDP, possibly by recognizing a feature of active loci, such as a specific histone acetylation pattern. The initiation of a chain of these Chip-containing complexes also takes place on the enhancer in this case [223]. However, studies with Lbd1, a vertebrate ortholog of Chip, suggest that Lbd1 might actually form targeted loops, rather than chains, through homodimerization when bound at enhancers and promoters [229]. At the b-globin locus, the GATA1 activator mediates promoter loops independently of cohesin by interacting with Lbd1 [230].

The targeting looping model is now more or less dominating in the case of multicellular eukaryotes. Even for yeasts, activation with UAS was demonstrated to proceed through the looping mechanism [32].

9. Global Genome Organization of Eukaryotes and Enhancer Functioning

The principles of metazoan genome organization were clearly identified in recent years. It was found that the genomes of multicellular eukaryotes are organized in hierarchic layers, which represent structural and functional building blocks of the genome (reviewed in [231]). The layers are (top to bottom): compartments A and B [232], formerly known as euchromatin and heterochromatin; giant CTCF/cohesion-dependent loop domains (insulated neighborhoods) [233,234]; and contact domains [235] or compartmental domains [236]. The latter three are collectively called TADs and subTADs. This area of research, called 3D Genomics, is rapidly developing now, and the terminology used to denote different types of these domains has not yet been fully established. Some attempts in this direction have been made [237]. In this part, we focus on the mechanistic models of enhancer functioning inside TADs.

9.1. TAD Level of Enhancer Functioning

TADs and subTADs are the level at which enhancers function. TADs restrict the enhancer action to prevent cross-reactivity between enhancers and promoters from different TADs and, on the other hand, facilitate the interaction between cis-regulatory elements located in one TAD [238]. The process of cohesin-mediated CTCF-dependent loop extrusion provides a basis for TADs and giant loop formation, bringing together distant regions of regulatory DNA in vertebrates [77,239]. To bring distal DNA regions to each other, the N terminus of CTCF interacts with the SA2-SCC1 subunits of the cohesion complex in mammals [78].

To monitor the TAD/loop formation biochemically at high resolution, special Hi-C and micro-C methods were developed based on the principle of cross-ligation of DNA fragments after digesting formaldehyde-fixed nuclei with micrococcal nuclease (MNase) or restriction endonuclease [234,240]. Using MNase treatment, loop extrusion was recently studied in mammals at the nucleosomal level at an unprecedented level of resolution. It was found that many of the newly identified loops that bridge regulatory elements and many newly identified peaks are localized along extrusion stripes [241]. Thus, loop extrusion is accompanied by weak loop extrusion pulses, i.e., cohesin complexes stop its progress from time to time.

Profound explanations were recently obtained for how the polarity of CTCF binding and the CTCF protein structure determine the genomic distribution of chromatin loops, bypass of CTCF sites and probably regulatory elements, and overall changes in chromatin organization [242]. Although the general picture is becoming more and more understandable, some difficulties in understanding still persist and direct mechanistic explanations of how loop extrusion proceeds remain to be determined. For instance, loop extrusion was recently demonstrated in vitro for the yeast condensin complex. The process was ATP dependent and occurred at a speed of 1.5 Kb/s, but was found to proceed only in one direction [243]. The ring size of a SMC complex (∼50 nm) exceeds that of a single nucleosome (∼10 nm), and nucleosomes would constitute challenging obstacles for SMC translocation if maintaining constant contact with DNA upon translocation. Thus, additional studies are needed to figure out how exactly loop-extrusion proceeds and how the CTCF complexes stop cohesin.

Although lack of the CTCF protein is embryonic lethal in mammals, Drosophila CTCF null mutants (completely lacking both maternal and zygotic CTCFs) survive past the larval stage and die as late pupae with visible defects associated with Hox gene misexpression [238]. These genetic data are supported by biochemical observations indicating that a mechanism of CTCF–cohesin-mediated loop extrusion is absent in invertebrates [236]. Collectively, these data suggest that invertebrates can use a more ancient and probably more universal way to mediate the interactions of remote cis-regulatory elements. For example, the role might be played by other molecular machines that depend on ATP hydrolysis and are capable of DNA translocation, for example, chromatin remodeling complexes [244].

While lacking the loop extrusion mechanism, Drosophila is not devoid of ways to bring remote regions of the genome close together. Long-range looping interactions were initially not found in Drosophila in early genome-wide studies using the Hi-C method because of the technical imperfections of the method at early stages of its development [245,246]. Later, a small number of Drosophila domains with apical interaction hotspots were identified in early embryos [247] and Kc embryonic cells [42]. Although CTCF binding sites do overlap cohesin ChIP-seq peaks [248], they do not exhibit a CTCF motif orientation bias at domain borders [244] and do not anchor Hi-C peaks [42,249]. Some of these contacts obviously represent rare selected interactions between distant regulatory elements, and some others are contacts between PREs and relate to interactions between Polycomb silencing complexes. PRC2-dependent loop formation was recently confirmed in vivo in embryos and S2 embryonic cells and in vitro in Drosophila [88], as well as in vitro using atomic force microscopy in liquid for human PRC2 [90]. PRC1-dependent loop formation independent of CTCF was recently shown in mouse embryonic stem cells [89].

Unicellular eukaryotes (yeasts) were shown to possess two types of TADs, ~200-kbp TADs, which regulate replication timing so that origins within a domain fire synchronously [250], and self-associating domains (micro-TADs), which typically encompass one to five genes [240]. None of these TAD types resembles the types habitual for metazoa. Later research identified the components of yeast basic transcriptional machinery—Mediator, Ssu72, and Rtt109 H3K56 acetyltransferase—that mediate chromatin organization into self-associating domains through the mechanism of gene loop formation, when the beginning and end of a gene are brought together. In addition, it was shown that the RSC chromatin remodeling complex, which normally occurs at promoters of highly transcribed genes, participates in organizing the boundaries of self-associating domains [240].

Thus, the examples above provide evidence that the way the genome is organized may change the principles of regulation by enhancers (Figure 3D). Vertebrates possess a long-range looping interaction system based on the cohesin-dependent loop extrusion mechanism and obviously the mechanisms that provide for enhancer–promoter contacts within TADs. Invertebrates do not have a perfect system for gene regulation by the cohesin complex, despite its obvious requirement for cell division. Therefore, gene regulation in invertebrates, like in yeasts, should obviously rely on the mechanisms that work inside TADs.

9.2. Principles of Activation by Enhancers Inside TADs

In multicellular eukaryotes, the principles of activation inside TADs might be exactly the same for both invertebrates and vertebrates because the evolutionary distance does not allow any fundamental molecular difference to develop. Instead of CTCF, which occupies TAD borders in mammals, a lot of architectural proteins were detected on the borders between TADs in Drosophila from the very beginning [245,246]. Some of them were assumed to play a role in establishing and maintaining the enhancer–promoter interactions, by bridging DNA sites in the enhancer and promoter or acting thorough protein–protein interactions. The concept received some confirmation. First, two specific types of enhancers, that is, enhancers of housekeeping and developmental genes, were found to exist in the Drosophila genome. Second, each class of enhancers was demonstrated to have its own preferences regarding the architectural protein composition [249]. A functional significance was demonstrated for certain zinc-finger architectural proteins, which were found to support the long-range interactions between the enhancer and the promoter [35]. Based on their large number, it cannot be ruled out that some of the architectural proteins may act as coactivators of transcription when placed or attached to chromatin.

Thus, in Drosophila, the enhancer–promoter specificity is achieved within a TAD with the use of a compatibility code between enhancers and classes of core promoters [251,252]; special enhancer-bound and promoter-bound architectural proteins that interact with each other [253]; and tethering elements, which are specific parts of proximal promoters or enhancers and provide for enhancer–promoter communication over great distances (reviewed in [254]).

In vertebrates, the regulatory system based on architectural proteins is not as obvious as in Drosophila, possibly because vertebrates possess the cohesin/CTCF-mediated loop extrusion system. However, some examples of architectural proteins were discovered. LBD1 (Chip in flies) and YY1 [255], studied far better, may execute analogous architectural functions in mammals, bringing enhancers and promoters together. As it should be expected from a mammalian regulator, YY1 appears to be multifunctional; acts through homodimerization both as a repressor and as an activator of gene expression [256], depending on the context, as we saw for prokaryotes above; and may act in cooperation with other factors [257]. Widespread loss of H3K27 acetylation was observed on YY1-bound enhancers, underscoring the crucial role of YY1 in enhancer regulation [258]. YY1 binds to both gene regulatory elements and their associated RNAs, indicating more complex interplay than that observed for bacterial activators [181]. YY1 may act as an auxiliary protein of Polycomb repressive complexes [259]. In addition, acting as an activator, YY1 recruits the INO80 remodeling complex to YY1-activated genes, where INO80 functions as a YY1 essential coactivator. Binding of YY1 to its DNA sites in target genes requires INO80, suggesting that YY1 uses the INO80 complex not only to activate transcription, but also to gain access to target promoters [260].

Thus, in the near future, the rapidly developing field of studying loop extrusion in mammals will move from accumulating general ideas of how it works to mechanistic justifications. Some examples of such a transition are already available [243], but still not numerous. At the same time, studies in invertebrates will solve the problem of understanding the role of architectural/insulator proteins and their function in the regulation of transcription. The problem still remains unsolved in spite of more than 30 years of relevant research.

10. Final Remarks

10.1. Parallels of Regulatory Evolution

The origin of the mechanisms of transcriptional regulation lies in the world of prokaryotes, and we will never know what the first mechanisms of regulation were. However, using the existing diversity of living beings, we can reconstruct the evolution of regulatory mechanisms, understand its laws, and highlight the fundamental differences. The first thing worth paying attention to is the principle of initiation of transcription by RNAP. Recent findings indicate a common origin of initiation factors: the bacterial sigma subunit of prokaryotes and TFB and TFIIB of archaea and eukaryotes, respectively [261]. The discovery of initiating TFIIB factors for RNAP I emphasizes that not a single RNAP of cellular organisms is capable of initiating transcription independently, testifying again to the evolutionary conservation of transcription initiation origin [262].

Regulation by multiple initiation factors is another trait used by all cellular organisms. In bacteria, such factors belong to the family of sigma subunits, while multiple TBPs and TFBs are used in archaea. This feature was inherited by eukaryotes from archaea. For example, in Metazoa, transcription is regulated by TBP and its TBP homologs, TBP-related factors [263,264].

One more general principle that has been preserved in evolution of all groups is compensation for weak motifs of the core promoter by activators. Activators recruit sigma-initiating subunit of bacteria and functional analogues of sigma, TBP+TFB, in archaea. In different types of eukaryotes, activators perform the same job and additionally recruit various auxiliary complexes that facilitate operations with nucleosomes.

10.2. Gradual Increase of Regulatory Complexity in Evolution of Eukaryotes

Although the eukaryotic system of gene expression regulation is the most sophisticated, eukaryotes use the same principle of recruitment of initiating basal factors to the promoter that they inherited from Archaea. In archaea, activators bring the TBP and TFB to a promoter and facilitate their interaction with RNAP. Such a functionality of archaeal activators resembles one-component coactivator activity of eukaryotes and appears to be very stable in evolution, being maintained from unicellular eukaryotes to invertebrates and finally to vertebrates. For example, the MBF1 factor bridges TBP and the GCN4 activator in yeasts [157] and nuclear receptor FTZ-F1 and TBP in Drosophila [158], while its repertoire expands in vertebrates to two activators, SF-1, which is a mammalian counterpart of FTZ-F1 and ATF1 [159,160]. Thus, the principle of recruitment of TBP and TFB by transcription activators, which was developed by prokaryotes about 4 billion years ago, was successfully fixed in evolution and transformed into a coactivator system in eukaryotes.

At the same time, gradual complication of organisms certainly affected the transcriptional regulation and eukaryotes have already acquired one-component heterodimeric coactivators. In addition, eukaryotes acquired multi-component coactivator complexes, such as Mediator, to integrate and transmit the signals from multiple TFs to a few factors of basal transcription machinery; certain other complexes, such as chromatin remodeling complexes, to operate with nucleosome-imbedded DNA; and histone-modifying complexes, to facilitate these operations.

Although examples where the DNA topology affects the activity of a promoter were described in bacteria, activation of transcription in this domain was always accompanied by direct interactions of activators with bacterial RNAP and/or the sigma-initiating subunit. Unlike in prokaryotes, eukaryotic activators convey their message by contacting RNAP through the basal factors, coactivators and the Mediator complex, which surround RNAP, but never interact with RNAP directly.

10.3. Perspectives and Problems to be Solved

Despite the advances made in recent decades in studying the regulatory mechanisms in eukaryotes and other organisms, including genome-wide studies, several evolutionary problems of molecular biology are still not resolved and, apparently, will not be resolved in the nearest future. For instance, although NGS technologies made it possible to conduct metagenomics analysis of populations of microorganisms, and representatives of a new archaeal group—direct evolutionary predecessors of eukaryotes—were identified and then cultivated in the past 5 years, accumulating knowledge is still the main feature of the research process and findings of the missing transition links between archaea and eukaryotes do not allow us to obtain comprehensive answers about evolution of specific molecular regulatory mechanisms.

For example, it is still an enigma when and why archaeal activators functionally turned into coactivators of eukaryotes. This can be partially explained by the increase in the number of regulators in Eukarya, where some of the regulators assumed the functions of coactivators by creating an adapter link between the remaining activators and basal factors. Alternatively, it may be explained by multiple gene and whole genome duplications, which occurred in evolution of unicellular eukaryotes and provided greater evolutionary flexibility to regularity mechanisms. This situation is clearly evident in the yeast genome, which has more than several hundreds of duplicated genes and where duplications were accompanied by asynchronous paralog differentiation and regulatory neofunctionalization [265,266,267].

One more enigma is the origin of the Mediator complex. Although several attempts were made to come closer to understanding this problem [268,269], the question of how such a large regulatory complex arose in evolution of eukaryotes remains a mystery. These studies identified a minimal set of core subunits, revealed elongation of the subunits in metazoans and plants, and showed the differences in distribution of intrinsically disordered regions in different kingdoms. It was also shown that none of the subunits is specific to metazoans, supporting a very ancient eukaryotic origin of this large complex.

11. Conclusions

Here we described diversity and the gradual increase in complexity during evolution of the mechanisms underlying transcriptional control by enhancers and enhancer-like regulatory elements (eREs) in different organisms (Table 1). A direct contact of the eRE and promoter is a common feature of activated transcription in all taxa. While the contact is due to the close proximity of the eRE to the promoter in DNA sequence in bacteria, specific mechanisms evolved in higher organisms to bring together the elements that are far apart. In general, bacterial transcription is governed by one or few TFs forming direct contacts with RNAP. In addition to TFs themselves, a local molecular environment (the DNA topology and packaging DNA-binding proteins) has an effect on transcription output. Archaea and unicellular eukaryotes are characterized by having evolved a specific layer of protein factors to mediate the TF–RNAP interaction; the layer consists of multiple classes of GTFs and coactivators in eukaryotes. Moreover, eREs become more separable from promoters in the genome. Epigenetic regulation of transcription developed in eukaryotes [270]. Finally, transcription in Metazoa is characterized by extensive control via multiple eREs, each occupied by multiple TFs. Enhancers become dispersed throughout the genome and diversified in function. The separation of enhancers and promoters by long distance and an increase in the abundance of enhancers are accompanied by the appearance of special mechanisms that ensure the convergence of regulatory elements in space. In general, the DNA architecture became more and more important in transcription regulation: other genomic elements, architectural proteins, and higher levels of DNA packaging became actively involved in this process during evolution [271]. However, although the mechanisms of transcriptional control drastically differ between Metazoa and Bacteria, it is still possible to observe the original prototypes of complex eukaryotic mechanisms in Bacteria. Our current understanding of these mechanisms is far from complete [194]. Further studies are necessary because comprehensive knowledge of the function of these genomic elements is highly important for multiple applications in biotechnological and medical fields [272,273].

Table 1. Features of transcription driven by enhancers and enhancer-like regulatory elements (eREs) in different taxa.

Funding

This work was funded by the Russian Science Foundation, grant no. 20-14-00201 (chapters 1–8), and the Ministry of Science and Higher Education of the Russian Federation, grant no. 075-15-2019-1661 (chapters 9-11).

Conflicts of Interest

The authors declare no conflict of interest.

References

Consortium, E.P. An integrated encyclopedia of DNA elements in the human genome. Nature 2012, 489, 57–74. [Google Scholar] [CrossRef] [PubMed]
Banerji, J.; Rusconi, S.; Schaffner, W. Expression of a beta-globin gene is enhanced by remote SV40 DNA sequences. Cell 1981, 27, 299–308. [Google Scholar] [CrossRef]
Guarente, L. UASs and enhancers: Common mechanism of transcriptional activation in yeast and mammals. Cell 1988, 52, 303–305. [Google Scholar] [CrossRef]
Struhl, K. Genetic properties and chromatin structure of the yeast gal regulatory element: An enhancer-like sequence. Proc. Natl. Acad. Sci. USA 1984, 81, 7865–7869. [Google Scholar] [CrossRef]
Frisby, D.; Zuber, P. Analysis of the upstream activating sequence and site of carbon and nitrogen source repression in the promoter of an early-induced sporulation gene of Bacillus subtilis. J. Bacteriol. 1991, 173, 7557–7564. [Google Scholar] [CrossRef] [PubMed][Green Version]
Hsu, L.M.; Giannini, J.K.; Leung, T.W.; Crosthwaite, J.C. Upstream sequence activation of Escherichia coli argT promoter in vivo and in vitro. Biochemistry 1991, 30, 813–822. [Google Scholar] [CrossRef]
Maeda, S.; Mizuno, T. Evidence for multiple OmpR-binding sites in the upstream activation sequence of the ompC promoter in Escherichia coli: A single OmpR-binding site is capable of activating the promoter. J. Bacteriol. 1990, 172, 501–503. [Google Scholar] [CrossRef][Green Version]
Verbeek, H.; Nilsson, L.; Bosch, L. FIS-induced bending of a region upstream of the promoter activates transcription of the E coli thrU(tufB) operon. Biochimie 1991, 73, 713–718. [Google Scholar] [CrossRef]
Yagi, S.; Yagi-Tanaka, K.; Yoshioka, J.; Suzuki, M. Expression enhancement of the Tn5 neomycin-resistance gene by removal of upstream ATG sequences and its use for probing heterologous upstream activating sequences in yeast. Curr. Genet. 1993, 24, 12–20. [Google Scholar] [CrossRef]
Xia, Y.; Forsman, K.; Jass, J.; Uhlin, B.E. Oligomeric interaction of the PapB transcriptional regulator with the upstream activating region of pili adhesin gene promoters in Escherichia coli. Mol. Microbiol. 1998, 30, 513–523. [Google Scholar] [CrossRef]
Dworkin, J.; Jovanovic, G.; Model, P. Role of upstream activation sequences and integration host factor in transcriptional activation by the constitutively active prokaryotic enhancer-binding protein PspF. J. Mol. Biol. 1997, 273, 377–388. [Google Scholar] [CrossRef] [PubMed]
Jaspers, M.C.M.; Sturme, M.; van der Meer, J.R. Unusual location of two nearby pairs of upstream activating sequences for HbpR, the main regulatory protein for the 2-hydroxybiphenyl degradation pathway of “Pseudomonas azelaica” HBP1. Microbiology 2001, 147, 2183–2194. [Google Scholar] [CrossRef][Green Version]
Velazquez, F.; Fernandez, S.; de Lorenzo, V. The upstream-activating sequences of the sigma54 promoter Pu of Pseudomonas putida filter transcription readthrough from upstream genes. J. Biol. Chem. 2006, 281, 11940–11948. [Google Scholar] [CrossRef] [PubMed]
Nachaliel, N.; Melnick, J.; Gafny, R.; Glaser, G. Ribosome associated protein(s) specifically bind(s) to the upstream activator sequence of the E. coli rrnA P1 promoter. Nucleic Acids Res. 1989, 17, 9811–9822. [Google Scholar] [CrossRef]
Rochman, M.; Blot, N.; Dyachenko, M.; Glaser, G.; Travers, A.; Muskhelishvili, G. Buffering of stable RNA promoter activity against DNA relaxation requires a far upstream sequence. Mol. Microbiol. 2004, 53, 143–152. [Google Scholar] [CrossRef] [PubMed]
Andersson, R.; Sandelin, A. Determinants of enhancer and promoter activities of regulatory elements. Nat. Rev. Genet. 2020, 21, 71–87. [Google Scholar] [CrossRef] [PubMed]
Kim, T.K.; Shiekhattar, R. Architectural and Functional Commonalities between Enhancers and Promoters. Cell 2015, 162, 948–959. [Google Scholar] [CrossRef] [PubMed]
Tippens, N.D.; Vihervaara, A.; Lis, J.T. Enhancer transcription: What, where, when, and why? Genes Dev. 2018, 32, 1–3. [Google Scholar] [CrossRef]
Andersson, R.; Sandelin, A.; Danko, C.G. A unified architecture of transcriptional regulatory elements. Trends Genet. 2015, 31, 426–433. [Google Scholar] [CrossRef]
Carelli, F.N.; Liechti, A.; Halbert, J.; Warnefors, M.; Kaessmann, H. Repurposing of promoters and enhancers during mammalian evolution. Nat. Commun. 2018, 9, 4066. [Google Scholar] [CrossRef] [PubMed]
Majic, P.; Payne, J.L. Enhancers Facilitate the Birth of De Novo Genes and Gene Integration into Regulatory Networks. Mol. Biol. Evol. 2020, 37, 1165–1178. [Google Scholar] [CrossRef] [PubMed]
Arenas-Mena, C. The origins of developmental gene regulation. Evol. Dev. 2017, 19, 96–107. [Google Scholar] [CrossRef] [PubMed]
Jacob, F.; Monod, J. Genetic regulatory mechanisms in the synthesis of proteins. J. Mol. Biol. 1961, 3, 318–356. [Google Scholar] [CrossRef]
Ulrich, L.E.; Koonin, E.V.; Zhulin, I.B. One-component systems dominate signal transduction in prokaryotes. Trends Microbiol. 2005, 13, 52–56. [Google Scholar] [CrossRef]
Casino, P.; Rubio, V.; Marina, A. The mechanism of signal transduction by two-component systems. Curr. Opin Struct Biol. 2010, 20, 763–771. [Google Scholar] [CrossRef] [PubMed]
Tagami, H.; Aiba, H. Role of CRP in transcription activation at Escherichia coli lac promoter: CRP is dispensable after the formation of open complex. Nucleic Acids Res. 1995, 23, 599–605. [Google Scholar] [CrossRef]
Maeda, R.K.; Karch, F. The open for business model of the bithorax complex in Drosophila. Chromosoma 2015, 124, 293–307. [Google Scholar] [CrossRef]
Santos-Beneit, F. The Pho regulon: A huge regulatory network in bacteria. Front. Microbiol. 2015, 6, 402. [Google Scholar] [CrossRef]
Blanco, A.G.; Canals, A.; Bernues, J.; Sola, M.; Coll, M. The structure of a transcription activation subcomplex reveals how sigma(70) is recruited to PhoB promoters. EMBO J. 2011, 30, 3776–3785. [Google Scholar] [CrossRef]
Cosma, M.P.; Tanaka, T.; Nasmyth, K. Ordered Recruitment of Transcription and Chromatin Remodeling Factors to a Cell Cycle- and Developmentally Regulated Promoter. Cell 2016, 166, 781. [Google Scholar] [CrossRef]
Krebs, J.E.; Kuo, M.H.; Allis, C.D.; Peterson, C.L. Cell cycle-regulated histone acetylation required for expression of the yeast HO gene. Genes Dev. 1999, 13, 1412–1421. [Google Scholar] [CrossRef] [PubMed]
Petrascheck, M.; Escher, D.; Mahmoudi, T.; Verrijzer, C.P.; Schaffner, W.; Barberis, A. DNA looping induced by a transcriptional enhancer in vivo. Nucleic Acids Res. 2005, 33, 3743–3750. [Google Scholar] [CrossRef] [PubMed]
Putlyaev, E.V.; Ibragimov, A.N.; Lebedeva, L.A.; Georgiev, P.G.; Shidlovskii, Y.V. Structure and Functions of the Mediator Complex. Biochemistry (Mosc) 2018, 83, 423–436. [Google Scholar] [CrossRef] [PubMed]
Dobi, K.C.; Winston, F. Analysis of transcriptional activation at a distance in Saccharomyces cerevisiae. Mol. Cell Biol. 2007, 27, 5575–5586. [Google Scholar] [CrossRef] [PubMed]
Zolotarev, N.; Fedotova, A.; Kyrchanova, O.; Bonchuk, A.; Penin, A.A.; Lando, A.S.; Eliseeva, I.A.; Kulakovskiy, I.V.; Maksimenko, O.; Georgiev, P. Architectural proteins Pita, Zw5,and ZIPIC contain homodimerization domain and support specific long-range interactions in Drosophila. Nucleic Acids Res. 2016, 44, 7228–7241. [Google Scholar] [CrossRef] [PubMed]
Ghavi-Helm, Y.; Klein, F.A.; Pakozdi, T.; Ciglar, L.; Noordermeer, D.; Huber, W.; Furlong, E.E. Enhancer loops appear stable during development and are associated with paused polymerase. Nature 2014, 512, 96–100. [Google Scholar] [CrossRef]
Kvon, E.Z.; Kazmar, T.; Stampfel, G.; Yanez-Cuna, J.O.; Pagani, M.; Schernhuber, K.; Dickson, B.J.; Stark, A. Genome-scale functional characterization of Drosophila developmental enhancers in vivo. Nature 2014, 512, 91–95. [Google Scholar] [CrossRef]
Arnold, C.D.; Gerlach, D.; Stelzer, C.; Boryń, Ł.; Rath, M.; Stark, A. Genome-wide quantitative enhancer activity maps identified by STARR-seq. Science 2013, 339, 1074–1077. [Google Scholar] [CrossRef]
Frankel, N.; Davis, G.K.; Vargas, D.; Wang, S.; Payre, F.; Stern, D.L. Phenotypic robustness conferred by apparently redundant transcriptional enhancers. Nature 2010, 466, 490–493. [Google Scholar] [CrossRef]
Jack, J.; DeLotto, Y. Structure and regulation of a complex locus: The cut gene of Drosophila. Genetics 1995, 139, 1689–1700. [Google Scholar]
Kyrchanova, O.; Mogila, V.; Wolle, D.; Magbanua, J.P.; White, R.; Georgiev, P.; Schedl, P. The boundary paradox in the Bithorax complex. Mech. Dev. 2015, 138 Pt 2, 122–132. [Google Scholar] [CrossRef]
Eagen, K.P.; Aiden, E.L.; Kornberg, R.D. Polycomb-mediated chromatin loops revealed by a subkilobase-resolution chromatin interaction map. Proc. Natl. Acad. Sci. USA 2017, 114, 8764–8769. [Google Scholar] [CrossRef]
Lanzuolo, C.; Roure, V.; Dekker, J.; Bantignies, F.; Orlando, V. Polycomb response elements mediate the formation of chromosome higher-order structures in the bithorax complex. Nat. Cell Biol. 2007, 9, 1167–1174. [Google Scholar] [CrossRef] [PubMed]
Ogiyama, Y.; Schuettengruber, B.; Papadopoulos, G.L.; Chang, J.M.; Cavalli, G. Polycomb-Dependent Chromatin Looping Contributes to Gene Silencing during Drosophila Development. Mol. Cell 2018, 71, 73–88 e75. [Google Scholar] [CrossRef] [PubMed]
Amano, T.; Sagai, T.; Tanabe, H.; Mizushina, Y.; Nakazawa, H.; Shiroishi, T. Chromosomal dynamics at the Shh locus: Limb bud-specific differential regulation of competence and active transcription. Dev. Cell 2009, 16, 47–57. [Google Scholar] [CrossRef] [PubMed]
Velagaleti, G.V.; Bien-Willner, G.A.; Northup, J.K.; Lockhart, L.H.; Hawkins, J.C.; Jalal, S.M.; Withers, M.; Lupski, J.R.; Stankiewicz, P. Position effects due to chromosome breakpoints that map approximately 900 Kb upstream and approximately 1.3 Mb downstream of SOX9 in two patients with campomelic dysplasia. Am. J. Hum. Genet. 2005, 76, 652–662. [Google Scholar] [CrossRef]
Dao, L.T.M.; Galindo-Albarran, A.O.; Castro-Mondragon, J.A.; Andrieu-Soler, C.; Medina-Rivera, A.; Souaid, C.; Charbonnier, G.; Griffon, A.; Vanhille, L.; Stephen, T.; et al. Genome-wide characterization of mammalian promoters with distal enhancer functions. Nat. Genet. 2017, 49, 1073–1081. [Google Scholar] [CrossRef]
Diao, Y.; Fang, R.; Li, B.; Meng, Z.; Yu, J.; Qiu, Y.; Lin, K.C.; Huang, H.; Liu, T.; Marina, R.J.; et al. A tiling-deletion-based genetic screen for cis-regulatory element identification in mammalian cells. Nat. Methods 2017, 14, 629–635. [Google Scholar] [CrossRef]
Engreitz, J.M.; Haines, J.E.; Perez, E.M.; Munson, G.; Chen, J.; Kane, M.; McDonel, P.E.; Guttman, M.; Lander, E.S. Local regulation of gene expression by lncRNA promoters, transcription and splicing. Nature 2016, 539, 452–455. [Google Scholar] [CrossRef]
Rajagopal, N.; Srinivasan, S.; Kooshesh, K.; Guo, Y.; Edwards, M.D.; Banerjee, B.; Syed, T.; Emons, B.J.; Gifford, D.K.; Sherwood, R.I. High-throughput mapping of regulatory DNA. Nat. Biotechnol. 2016, 34, 167–174. [Google Scholar] [CrossRef]
Li, G.; Ruan, X.; Auerbach, R.K.; Sandhu, K.S.; Zheng, M.; Wang, P.; Poh, H.M.; Goh, Y.; Lim, J.; Zhang, J.; et al. Extensive promoter-centered chromatin interactions provide a topological basis for transcription regulation. Cell 2012, 148, 84–98. [Google Scholar] [CrossRef] [PubMed]
Kimura, M. The Neutral Theory of Molecular Evolution; Cambridge University Press: New York, NY, USA, 1983. [Google Scholar]
Koonin, E.V. The origin of introns and their role in eukaryogenesis: A compromise solution to the introns-early versus introns-late debate? Biol. Direct 2006, 1, 22. [Google Scholar] [CrossRef]
Lodish, H.; Berk, A.; Kaiser, C.A.; Krieger, M.; Bretscher, A.; Ploegh, H.; Amon, A.; Martin, K. Molecular Cell Biology, 8th ed.; W. H. Freeman and Company: New York, NY, USA, 2016. [Google Scholar]
Koonin, E.V. Logic. of Chance, The: The Nature and Origin of Biological Evolution; FT Press: Upper Saddle River, NJ, USA, 2011. [Google Scholar]
Hook-Barnard, I.G.; Hinton, D.M. Transcription initiation by mix and match elements: Flexibility for polymerase binding to bacterial promoters. Gene Regul. Syst. Biol. 2007, 1, 275–293. [Google Scholar] [CrossRef]
Miroslavova, N.S.; Busby, S.J. Investigations of the modular structure of bacterial promoters. Biochem. Soc. Symp. 2006, 73, 1–10. [Google Scholar] [CrossRef]
Ross, W.; Aiyar, S.E.; Salomon, J.; Gourse, R.L. Escherichia coli promoters with UP elements of different strengths: Modular structure of bacterial promoters. J. Bacteriol. 1998, 180, 5375–5383. [Google Scholar] [CrossRef] [PubMed]
De Boer, H.A.; Comstock, L.J.; Vasser, M. The tac promoter: A functional hybrid derived from the trp and lac promoters. Proc. Natl. Acad. Sci. USA 1983, 80, 21–25. [Google Scholar] [CrossRef]
Murakami, K.S. Structural biology of bacterial RNA polymerase. Biomolecules 2015, 5, 848–864. [Google Scholar] [CrossRef]
Paget, M.S. Bacterial Sigma Factors and Anti-Sigma Factors: Structure, Function and Distribution. Biomolecules 2015, 5, 1245–1265. [Google Scholar] [CrossRef]
Hinton, D.M.; Pande, S.; Wais, N.; Johnson, X.B.; Vuthoori, M.; Makela, A.; Hook-Barnard, I. Transcriptional takeover by sigma appropriation: Remodelling of the sigma70 subunit of Escherichia coli RNA polymerase by the bacteriophage T4 activator MotA and co-activator AsiA. Microbiology 2005, 151, 1729–1740. [Google Scholar] [CrossRef]
Schleif, R. AraC protein, regulation of the l-arabinose operon in Escherichia coli, and the light switch mechanism of AraC action. FEMS Microbiol. Rev. 2010, 34, 779–796. [Google Scholar] [CrossRef]
Bracco, L.; Kotlarz, D.; Kolb, A.; Diekmann, S.; Buc, H. Synthetic curved DNA sequences can act as transcriptional activators in Escherichia coli. EMBO J. 1989, 8, 4289–4296. [Google Scholar] [CrossRef] [PubMed]
Reyes-Caballero, H.; Campanello, G.C.; Giedroc, D.P. Metalloregulatory proteins: Metal selectivity and allosteric switching. Biophys. Chem. 2011, 156, 103–114. [Google Scholar] [CrossRef] [PubMed]
Newberry, K.J.; Brennan, R.G. The structural mechanism for transcription activation by MerR family member multidrug transporter activation, N terminus. J. Biol. Chem. 2004, 279, 20356–20362. [Google Scholar] [CrossRef]
Richet, E.; Sogaard-Andersen, L. CRP induces the repositioning of MalT at the Escherichia coli malKp promoter primarily through DNA bending. EMBO J. 1994, 13, 4558–4567. [Google Scholar] [CrossRef] [PubMed]
Gaal, T.; Rao, L.; Estrem, S.T.; Yang, J.; Wartell, R.M.; Gourse, R.L. Localization of the intrinsically bent DNA region upstream of the E.coli rrnB P1 promoter. Nucleic Acids Res. 1994, 22, 2344–2350. [Google Scholar] [CrossRef] [PubMed]
Opel, M.L.; Aeling, K.A.; Holmes, W.M.; Johnson, R.C.; Benham, C.J.; Hatfield, G.W. Activation of transcription initiation from a stable RNA promoter by a Fis protein-mediated DNA structural transmission mechanism. Mol. Microbiol. 2004, 53, 665–674. [Google Scholar] [CrossRef] [PubMed]
Cameron, A.D.; Dorman, C.J. A fundamental regulatory mechanism operating through OmpR and DNA topology controls expression of Salmonella pathogenicity islands SPI-1 and SPI-2. PLoS Genet. 2012, 8, e1002615. [Google Scholar] [CrossRef]
Dorman, C.J.; Dorman, M.J. Control of virulence gene transcription by indirect readout in Vibrio cholerae and Salmonella enterica serovar Typhimurium. Environ. Microbiol. 2017, 19, 3834–3845. [Google Scholar] [CrossRef]
Makino, K.; Amemura, M.; Kim, S.K.; Nakata, A.; Shinagawa, H. Role of the sigma 70 subunit of RNA polymerase in transcriptional activation by activator protein PhoB in Escherichia coli. Genes Dev. 1993, 7, 149–160. [Google Scholar] [CrossRef][Green Version]
Canals, A.; Blanco, A.G.; Coll, M. sigma70 and PhoB activator: Getting a better grip. Transcription 2012, 3, 160–164. [Google Scholar] [CrossRef]
Jain, D.; Nickels, B.E.; Sun, L.; Hochschild, A.; Darst, S.A. Structure of a ternary transcription activation complex. Mol. Cell 2004, 13, 45–53. [Google Scholar] [CrossRef]
Lobell, R.B.; Schleif, R.F. DNA looping and unlooping by AraC protein. Science 1990, 250, 528–532. [Google Scholar] [CrossRef] [PubMed]
De Wit, E.; Vos, E.S.; Holwerda, S.J.; Valdes-Quezada, C.; Verstegen, M.J.; Teunissen, H.; Splinter, E.; Wijchers, P.J.; Krijger, P.H.; de Laat, W. CTCF Binding Polarity Determines Chromatin Looping. Mol. Cell 2015, 60, 676–684. [Google Scholar] [CrossRef] [PubMed]
Guo, Y.; Xu, Q.; Canzio, D.; Shou, J.; Li, J.; Gorkin, D.U.; Jung, I.; Wu, H.; Zhai, Y.; Tang, Y.; et al. CRISPR Inversion of CTCF Sites Alters Genome Topology and Enhancer/Promoter Function. Cell 2015, 162, 900–910. [Google Scholar] [CrossRef] [PubMed]
Li, Y.; Haarhuis, J.H.I.; Sedeno Cacciatore, A.; Oldenkamp, R.; van Ruiten, M.S.; Willems, L.; Teunissen, H.; Muir, K.W.; de Wit, E.; Rowland, B.D.; et al. The structural basis for cohesin-CTCF-anchored loops. Nature 2020, 578, 472–476. [Google Scholar] [CrossRef]
Sanborn, A.L.; Rao, S.S.; Huang, S.C.; Durand, N.C.; Huntley, M.H.; Jewett, A.I.; Bochkov, I.D.; Chinnappan, D.; Cutkosky, A.; Li, J.; et al. Chromatin extrusion explains key features of loop and domain formation in wild-type and engineered genomes. Proc. Natl. Acad. Sci. USA 2015, 112, E6456–E6465. [Google Scholar] [CrossRef]
Hanssen, L.L.P.; Kassouf, M.T.; Oudelaar, A.M.; Biggs, D.; Preece, C.; Downes, D.J.; Gosden, M.; Sharpe, J.A.; Sloane-Stanley, J.A.; Hughes, J.R.; et al. Tissue-specific CTCF-cohesin-mediated chromatin architecture delimits enhancer interactions and function in vivo. Nat. Cell Biol. 2017, 19, 952–961. [Google Scholar] [CrossRef]
Ren, G.; Jin, W.; Cui, K.; Rodrigez, J.; Hu, G.; Zhang, Z.; Larson, D.R.; Zhao, K. CTCF-Mediated Enhancer-Promoter Interaction Is a Critical Regulator of Cell-to-Cell Variation of Gene Expression. Mol. Cell 2017, 67, 1049–1058 e1046. [Google Scholar] [CrossRef]
Hendrickson, W.; Schleif, R.F. Regulation of the Escherichia coli L-arabinose operon studied by gel electrophoresis DNA binding assay. J. Mol. Biol. 1984, 178, 611–628. [Google Scholar] [CrossRef]
Lobell, R.B.; Schleif, R.F. AraC-DNA looping: Orientation and distance-dependent loop breaking by the cyclic AMP receptor protein. J. Mol. Biol. 1991, 218, 45–54. [Google Scholar] [CrossRef]
Zhang, X.; Schleif, R. Catabolite gene activator protein mutations affecting activity of the araBAD promoter. J. Bacteriol. 1998, 180, 195–200. [Google Scholar] [CrossRef] [PubMed]
Hahn, S.; Hendrickson, W.; Schleif, R. Transcription of Escherichia coli ara in vitro. The cyclic AMP receptor protein requirement for PBAD induction that depends on the presence and orientation of the araO2 site. J. Mol. Biol. 1986, 188, 355–367. [Google Scholar] [CrossRef]
Dhiman, A.; Schleif, R. Recognition of overlapping nucleotides by AraC and the sigma subunit of RNA polymerase. J. Bacteriol. 2000, 182, 5076–5081. [Google Scholar] [CrossRef] [PubMed]
Zhang, X.; Reeder, T.; Schleif, R. Transcription activation parameters at ara pBAD. J. Mol. Biol. 1996, 258, 14–24. [Google Scholar] [CrossRef] [PubMed][Green Version]
Alecki, C.; Chiwara, V.; Sanz, L.A.; Grau, D.; Arias Perez, O.; Boulier, E.L.; Armache, K.J.; Chedin, F.; Francis, N.J. RNA-DNA strand exchange by the Drosophila Polycomb complex PRC2. Nat. Commun. 2020, 11, 1781. [Google Scholar] [CrossRef]
Boyle, S.; Flyamer, I.M.; Williamson, I.; Sengupta, D.; Bickmore, W.A.; Illingworth, R.S. A central role for canonical PRC1 in shaping the 3D nuclear landscape. Genes Dev. 2020. [Google Scholar] [CrossRef]
Heenan, P.R.; Wang, X.; Gooding, A.R.; Cech, T.R.; Perkins, T.T. Bending and looping of long DNA by Polycomb repressive complex 2 revealed by AFM imaging in liquid. Nucleic Acids Res. 2020, 48, 2969–2981. [Google Scholar] [CrossRef]
Hirsh, J.; Berg, P. Electron microscopy of gene regulation: The L-arabinose operon. Proc. Natl. Acad. Sci. USA 1976, 73, 1518–1522. [Google Scholar] [CrossRef]
Carra, J.H.; Schleif, R.F. Variation of half-site organization and DNA looping by AraC protein. EMBO J. 1993, 12, 35–44. [Google Scholar] [CrossRef]
Dunn, T.M.; Hahn, S.; Ogden, S.; Schleif, R.F. An operator at -280 base pairs that is required for repression of araBAD operon promoter: Addition of DNA helical turns between the operator and promoter cyclically hinders repression. Proc. Natl. Acad. Sci. USA 1984, 81, 5017–5020. [Google Scholar] [CrossRef]
Lee, D.H.; Schleif, R.F. In vivo DNA loops in araCBAD: Size limits and helical repeat. Proc Natl. Acad. Sci. USA 1989, 86, 476–480. [Google Scholar] [CrossRef] [PubMed]
Kramer, H.; Niemoller, M.; Amouyal, M.; Revet, B.; von Wilcken-Bergmann, B.; Muller-Hill, B. lac repressor forms loops with linear DNA carrying two suitably spaced lac operators. EMBO J. 1987, 6, 1481–1491. [Google Scholar] [CrossRef] [PubMed]
Griffith, J.; Hochschild, A.; Ptashne, M. DNA loops induced by cooperative binding of lambda repressor. Nature 1986, 322, 750–752. [Google Scholar] [CrossRef] [PubMed]
Hochschild, A.; Ptashne, M. Cooperative binding of lambda repressors to sites separated by integral turns of the DNA helix. Cell 1986, 44, 681–687. [Google Scholar] [CrossRef]
Becker, N.A.; Peters, J.P.; Maher, L.J., 3rd; Lionberger, T.A. Mechanism of promoter repression by Lac repressor-DNA loops. Nucleic Acids Res. 2013, 41, 156–166. [Google Scholar] [CrossRef]
Becker, N.A.; Kahn, J.D.; Maher, L.J., 3rd. Bacterial repression loops require enhanced DNA flexibility. J. Mol. Biol. 2005, 349, 716–730. [Google Scholar] [CrossRef]
Becker, N.A.; Kahn, J.D.; Maher, L.J., 3rd. Effects of nucleoid proteins on DNA repression loop formation in Escherichia coli. Nucleic Acids Res. 2007, 35, 3988–4000. [Google Scholar] [CrossRef]
Wang, W.; Li, G.W.; Chen, C.; Xie, X.S.; Zhuang, X. Chromosome organization by a nucleoid-associated protein in live bacteria. Science 2011, 333, 1445–1449. [Google Scholar] [CrossRef]
Cournac, A.; Plumbridge, J. DNA looping in prokaryotes: Experimental and theoretical approaches. J. Bacteriol. 2013, 195, 1109–1119. [Google Scholar] [CrossRef]
Gao, F.; Danson, A.E.; Ye, F.; Jovanovic, M.; Buck, M.; Zhang, X. Bacterial Enhancer Binding Proteins-AAA(+) Proteins in Transcription Activation. Biomolecules 2020, 10, 351. [Google Scholar] [CrossRef]
Levy, L.; Anavy, L.; Solomon, O.; Cohen, R.; Brunwasser-Meirom, M.; Ohayon, S.; Atar, O.; Goldberg, S.; Yakhini, Z.; Amit, R. A Synthetic Oligo Library and Sequencing Approach Reveals an Insulation Mechanism Encoded within Bacterial sigma(54) Promoters. Cell Rep. 2017, 21, 845–858. [Google Scholar] [CrossRef] [PubMed]
Danson, A.E.; Jovanovic, M.; Buck, M.; Zhang, X. Mechanisms of sigma(54)-Dependent Transcription Initiation and Regulation. J. Mol. Biol. 2019, 431, 3960–3974. [Google Scholar] [CrossRef] [PubMed]
Glyde, R.; Ye, F.; Darbari, V.C.; Zhang, N.; Buck, M.; Zhang, X. Structures of RNA Polymerase Closed and Intermediate Complexes Reveal Mechanisms of DNA Opening and Transcription Initiation. Mol. Cell 2017, 67, 106–116 e104. [Google Scholar] [CrossRef] [PubMed]
Bose, D.; Joly, N.; Pape, T.; Rappas, M.; Schumacher, J.; Buck, M.; Zhang, X. Dissecting the ATP hydrolysis pathway of bacterial enhancer-binding proteins. BioChem. Soc. Trans. 2008, 36, 83–88. [Google Scholar] [CrossRef] [PubMed][Green Version]
Buck, M.; Bose, D.; Burrows, P.; Cannon, W.; Joly, N.; Pape, T.; Rappas, M.; Schumacher, J.; Wigneshweraraj, S.; Zhang, X. A second paradigm for gene activation in bacteria. BioChem. Soc. Trans. 2006, 34, 1067–1071. [Google Scholar] [CrossRef]
Cannon, W.; Bordes, P.; Wigneshweraraj, S.R.; Buck, M. Nucleotide-dependent triggering of RNA polymerase-DNA interactions by an AAA regulator of transcription. J. Biol. Chem. 2003, 278, 19815–19825. [Google Scholar] [CrossRef]
Bordes, P.; Wigneshweraraj, S.R.; Chaney, M.; Dago, A.E.; Morett, E.; Buck, M. Communication between Esigma(54), promoter DNA and the conserved threonine residue in the GAFTGA motif of the PspF sigma-dependent activator during transcription activation. Mol. Microbiol. 2004, 54, 489–506. [Google Scholar] [CrossRef]
Rappas, M.; Schumacher, J.; Beuron, F.; Niwa, H.; Bordes, P.; Wigneshweraraj, S.; Keetch, C.A.; Robinson, C.V.; Buck, M.; Zhang, X. Structural insights into the activity of enhancer-binding proteins. Science 2005, 307, 1972–1975. [Google Scholar] [CrossRef]
Fishburn, J.; Tomko, E.; Galburt, E.; Hahn, S. Double-stranded DNA translocase activity of transcription factor TFIIH and the mechanism of RNA polymerase II open complex formation. Proc. Natl. Acad. Sci. USA 2015, 112, 3961–3966. [Google Scholar] [CrossRef]
Lin, Y.C.; Choi, W.S.; Gralla, J.D. TFIIH XPB mutants suggest a unified bacterial-like mechanism for promoter opening but not escape. Nat. Struct. Mol. Biol. 2005, 12, 603–607. [Google Scholar] [CrossRef]
Reitzer, L.J.; Magasanik, B. Transcription of glnA in E. coli is stimulated by activator bound to sites far from the promoter. Cell 1986, 45, 785–792. [Google Scholar] [CrossRef]
Hao, N.; Shearwin, K.E.; Dodd, I.B. Positive and Negative Control of Enhancer-Promoter Interactions by Other DNA Loops Generates Specificity and Tunability. Cell Rep. 2019, 26, 2419–2433 e2413. [Google Scholar] [CrossRef] [PubMed]
Su, W.; Porter, S.; Kustu, S.; Echols, H. DNA-looping and enhancer activity: Association between DNA-bound NtrC activator and RNA polymerase at the bacterial glnA promoter. Proc. Natl. Acad. Sci. USA 1990, 87, 5504–5508. [Google Scholar] [CrossRef] [PubMed]
Wedel, A.; Weiss, D.S.; Popham, D.; Droge, P.; Kustu, S. A bacterial enhancer functions to tether a transcriptional activator near a promoter. Science 1990, 248, 486–490. [Google Scholar] [CrossRef]
Ninfa, A.J.; Reitzer, L.J.; Magasanik, B. Initiation of transcription at the bacterial glnAp2 promoter by purified E. coli components is facilitated by enhancers. Cell 1987, 50, 1039–1046. [Google Scholar] [CrossRef]
Sasse-Dwight, S.; Gralla, J.D. Probing the Escherichia coli glnALG upstream activation mechanism in vivo. Proc. Natl. Acad. Sci. USA 1988, 85, 8934–8938. [Google Scholar] [CrossRef]
Yadav, D.; Ghosh, K.; Basu, S.; Roeder, R.G.; Biswas, D. Multivalent Role of Human TFIID in Recruiting Elongation Components at the Promoter-Proximal Region for Transcriptional Control. Cell Rep. 2019, 26, 1303–1317 e1307. [Google Scholar] [CrossRef]
Minchin, S.D.; Austin, S.; Dixon, R.A. Transcriptional activation of the Klebsiella pneumoniae nifLA promoter by NTRC is face-of-the-helix dependent and the activator stabilizes the interaction of sigma 54-RNA polymerase with the promoter. EMBO J. 1989, 8, 3491–3499. [Google Scholar] [CrossRef]
Wigneshweraraj, S.; Bose, D.; Burrows, P.C.; Joly, N.; Schumacher, J.; Rappas, M.; Pape, T.; Zhang, X.; Stockley, P.; Severinov, K.; et al. Modus operandi of the bacterial RNA polymerase containing the sigma54 promoter-specificity factor. Mol. Microbiol. 2008, 68, 538–546. [Google Scholar] [CrossRef]
Bonocora, R.P.; Smith, C.; Lapierre, P.; Wade, J.T. Genome-Scale Mapping of Escherichia coli sigma54 Reveals Widespread, Conserved Intragenic Binding. PLoS Genet. 2015, 11, e1005552. [Google Scholar] [CrossRef]
Xu, H.; Hoover, T.R. Transcriptional regulation at a distance in bacteria. Curr. Opin. Microbiol. 2001, 4, 138–144. [Google Scholar] [CrossRef]
Tinker-Kulberg, R.L.; Fu, T.J.; Geiduschek, E.P.; Kassavetis, G.A. A direct interaction between a DNA-tracking protein and a promoter recognition protein: Implications for searching DNA sequence. EMBO J. 1996, 15, 5032–5039. [Google Scholar] [CrossRef] [PubMed]
Nechaev, S.; Kamali-Moghaddam, M.; Andre, E.; Leonetti, J.P.; Geiduschek, E.P. The bacteriophage T4 late-transcription coactivator gp33 binds the flap domain of Escherichia coli RNA polymerase. Proc. Natl. Acad. Sci. USA 2004, 101, 17365–17370. [Google Scholar] [CrossRef] [PubMed]
Geiduschek, E.P.; Kassavetis, G.A. Transcription of the T4 late genes. Virol. J. 2010, 7, 288. [Google Scholar] [CrossRef] [PubMed]
Makela, J.; Sherratt, D. SMC complexes organize the bacterial chromosome by lengthwise compaction. Curr. Genet. 2020. [Google Scholar] [CrossRef] [PubMed]
Ravi, M.; Ramanathan, S.; Krishna, K. Factors, mechanisms and implications of chromatin condensation and chromosomal structural maintenance through the cell cycle. J. Cell Physiol. 2020, 235, 758–775. [Google Scholar] [CrossRef]
Skibbens, R.V. Condensins and cohesins—One of these things is not like the other! J. Cell Sci. 2019, 132. [Google Scholar] [CrossRef]
Marbouty, M.; Le Gall, A.; Cattoni, D.I.; Cournac, A.; Koh, A.; Fiche, J.B.; Mozziconacci, J.; Murray, H.; Koszul, R.; Nollmann, M. Condensin- and Replication-Mediated Bacterial Chromosome Folding and Origin Condensation Revealed by Hi-C and Super-resolution Imaging. Mol. Cell 2015, 59, 588–602. [Google Scholar] [CrossRef]
Wang, X.; Brandao, H.B.; Le, T.B.; Laub, M.T.; Rudner, D.Z. Bacillus subtilis SMC complexes juxtapose chromosome arms as they travel from origin to terminus. Science 2017, 355, 524–527. [Google Scholar] [CrossRef]
Wang, X.; Hughes, A.C.; Brandao, H.B.; Walker, B.; Lierz, C.; Cochran, J.C.; Oakley, M.G.; Kruse, A.C.; Rudner, D.Z. In Vivo Evidence for ATPase-Dependent DNA Translocation by the Bacillus subtilis SMC Condensin Complex. Mol. Cell 2018, 71, 841–847 e845. [Google Scholar] [CrossRef]
Wang, X.; Le, T.B.; Lajoie, B.R.; Dekker, J.; Laub, M.T.; Rudner, D.Z. Condensin promotes the juxtaposition of DNA flanking its loading site in Bacillus subtilis. Genes Dev. 2015, 29, 1661–1675. [Google Scholar] [CrossRef] [PubMed]
Dervyn, E.; Noirot-Gros, M.F.; Mervelet, P.; McGovern, S.; Ehrlich, S.D.; Polard, P.; Noirot, P. The bacterial condensin/cohesin-like protein complex acts in DNA repair and regulation of gene expression. Mol. Microbiol. 2004, 51, 1629–1640. [Google Scholar] [CrossRef]
Tran, N.T.; Laub, M.T.; Le, T.B.K. SMC Progressively Aligns Chromosomal Arms in Caulobacter crescentus but Is Antagonized by Convergent Transcription. Cell Rep. 2017, 20, 2057–2071. [Google Scholar] [CrossRef] [PubMed]
Heinz, S.; Texari, L.; Hayes, M.G.B.; Urbanowski, M.; Chang, M.W.; Givarkes, N.; Rialdi, A.; White, K.M.; Albrecht, R.A.; Pache, L.; et al. Transcription Elongation Can Affect Genome 3D Structure. Cell 2018, 174, 1522–1536 e1522. [Google Scholar] [CrossRef] [PubMed]
Lopez-Garcia, P.; Moreira, D. Cultured Asgard Archaea Shed Light on Eukaryogenesis. Cell 2020, 181, 232–235. [Google Scholar] [CrossRef] [PubMed]
Spang, A.; Saw, J.H.; Jorgensen, S.L.; Zaremba-Niedzwiedzka, K.; Martijn, J.; Lind, A.E.; van Eijk, R.; Schleper, C.; Guy, L.; Ettema, T.J.G. Complex archaea that bridge the gap between prokaryotes and eukaryotes. Nature 2015, 521, 173–179. [Google Scholar] [CrossRef]
Zaremba-Niedzwiedzka, K.; Caceres, E.F.; Saw, J.H.; Backstrom, D.; Juzokaite, L.; Vancaester, E.; Seitz, K.W.; Anantharaman, K.; Starnawski, P.; Kjeldsen, K.U.; et al. Asgard archaea illuminate the origin of eukaryotic cellular complexity. Nature 2017, 541, 353–358. [Google Scholar] [CrossRef]
Gehring, A.M.; Walker, J.E.; Santangelo, T.J. Transcription Regulation in Archaea. J. Bacteriol. 2016, 198, 1906–1917. [Google Scholar] [CrossRef]
Jun, S.H.; Reichlen, M.J.; Tajiri, M.; Murakami, K.S. Archaeal RNA polymerase and transcription regulation. Crit. Rev. BioChem. Mol. Biol. 2011, 46, 27–40. [Google Scholar] [CrossRef]
Facciotti, M.T.; Reiss, D.J.; Pan, M.; Kaur, A.; Vuthoori, M.; Bonneau, R.; Shannon, P.; Srivastava, A.; Donohoe, S.M.; Hood, L.E.; et al. General transcription factor specified global gene regulation in archaea. Proc. Natl. Acad. Sci. USA 2007, 104, 4630–4635. [Google Scholar] [CrossRef] [PubMed]
Karr, E.A. Transcription regulation in the third domain. Adv. Appl. Microbiol. 2014, 89, 101–133. [Google Scholar] [CrossRef] [PubMed]
Blombach, F.; Matelska, D.; Fouqueau, T.; Cackett, G.; Werner, F. Key Concepts and Challenges in Archaeal Transcription. J. Mol. Biol. 2019, 431, 4184–4201. [Google Scholar] [CrossRef] [PubMed]
Aravind, L.; Koonin, E.V. DNA-binding proteins and evolution of transcription regulation in the archaea. Nucleic Acids Res. 1999, 27, 4658–4670. [Google Scholar] [CrossRef] [PubMed]
Perez-Rueda, E.; Janga, S.C. Identification and genomic analysis of transcription factors in archaeal genomes exemplifies their functional architecture and evolutionary origin. Mol. Biol. Evol. 2010, 27, 1449–1459. [Google Scholar] [CrossRef] [PubMed]
Peeters, E.; Peixeiro, N.; Sezonov, G. Cis-regulatory logic in archaeal transcription. BioChem. Soc. Trans. 2013, 41, 326–331. [Google Scholar] [CrossRef]
Brinkman, A.B.; Bell, S.D.; Lebbink, R.J.; de Vos, W.M.; van der Oost, J. The Sulfolobus solfataricus Lrp-like protein LysM regulates lysine biosynthesis in response to lysine availability. J. Biol. Chem. 2002, 277, 29537–29549. [Google Scholar] [CrossRef]
Brinkman, A.B.; Ettema, T.J.; de Vos, W.M.; van der Oost, J. The Lrp family of transcriptional regulators. Mol. Microbiol. 2003, 48, 287–294. [Google Scholar] [CrossRef]
Lassak, K.; Peeters, E.; Wrobel, S.; Albers, S.V. The one-component system ArnR: A membrane-bound activator of the crenarchaeal archaellum. Mol. Microbiol. 2013, 88, 125–139. [Google Scholar] [CrossRef]
Ouhammouch, M.; Dewhurst, R.E.; Hausner, W.; Thomm, M.; Geiduschek, E.P. Activation of archaeal transcription by recruitment of the TATA-binding protein. Proc. Natl. Acad. Sci. USA 2003, 100, 5097–5102. [Google Scholar] [CrossRef]
Ouhammouch, M.; Langham, G.E.; Hausner, W.; Simpson, A.J.; El-Sayed, N.M.; Geiduschek, E.P. Promoter architecture and response to a positive regulator of archaeal transcription. Mol. Microbiol. 2005, 56, 625–637. [Google Scholar] [CrossRef]
Ouhammouch, M.; Werner, F.; Weinzierl, R.O.; Geiduschek, E.P. A fully recombinant system for activator-dependent archaeal transcription. J. Biol. Chem. 2004, 279, 51719–51721. [Google Scholar] [CrossRef] [PubMed]
Bleiholder, A.; Frommherz, R.; Teufel, K.; Pfeifer, F. Expression of multiple tfb genes in different Halobacterium salinarum strains and interaction of TFB with transcriptional activator GvpE. Arch. Microbiol. 2012, 194, 269–279. [Google Scholar] [CrossRef] [PubMed]
Teufel, K.; Pfeifer, F. Interaction of transcription activator GvpE with TATA-box-binding proteins of Halobacterium salinarum. Arch. Microbiol. 2010, 192, 143–149. [Google Scholar] [CrossRef] [PubMed]
Ochs, S.M.; Thumann, S.; Richau, R.; Weirauch, M.T.; Lowe, T.M.; Thomm, M.; Hausner, W. Activation of archaeal transcription mediated by recruitment of transcription factor B. J. Biol. Chem. 2012, 287, 18863–18871. [Google Scholar] [CrossRef]
Peng, N.; Ao, X.; Liang, Y.X.; She, Q. Archaeal promoter architecture and mechanism of gene activation. BioChem. Soc. Trans. 2011, 39, 99–103. [Google Scholar] [CrossRef]
Peng, N.; Xia, Q.; Chen, Z.; Liang, Y.X.; She, Q. An upstream activation element exerting differential transcriptional activation on an archaeal promoter. Mol. Microbiol. 2009, 74, 928–939. [Google Scholar] [CrossRef]
Peeters, E.; Albers, S.V.; Vassart, A.; Driessen, A.J.; Charlier, D. Ss-LrpB, a transcriptional regulator from Sulfolobus solfataricus, regulates a gene cluster with a pyruvate ferredoxin oxidoreductase-encoding operon and permease genes. Mol. Microbiol. 2009, 71, 972–988. [Google Scholar] [CrossRef]
Song, W.; Sharan, R.; Ovcharenko, I. The first enhancer in an enhancer chain safeguards subsequent enhancer-promoter contacts from a distance. Genome Biol. 2019, 20, 197. [Google Scholar] [CrossRef]
De los Rios, S.; Perona, J.J. Structure of the Escherichia coli leucine-responsive regulatory protein Lrp reveals a novel octameric assembly. J. Mol. Biol. 2007, 366, 1589–1602. [Google Scholar] [CrossRef]
Okamura, H.; Yokoyama, K.; Koike, H.; Yamada, M.; Shimowasa, A.; Kabasawa, M.; Kawashima, T.; Suzuki, M. A structural code for discriminating between transcription signals revealed by the feast/famine regulatory protein DM1 in complex with ligands. Structure 2007, 15, 1325–1338. [Google Scholar] [CrossRef]
Yokoyama, K.; Ishijima, S.A.; Clowney, L.; Koike, H.; Aramaki, H.; Tanaka, C.; Makino, K.; Suzuki, M. Feast/famine regulatory proteins (FFRPs): Escherichia coli Lrp, AsnC and related archaeal transcription factors. FEMS Microbiol. Rev. 2006, 30, 89–108. [Google Scholar] [CrossRef] [PubMed]
Peixeiro, N.; Keller, J.; Collinet, B.; Leulliot, N.; Campanacci, V.; Cortez, D.; Cambillau, C.; Nitta, K.R.; Vincentelli, R.; Forterre, P.; et al. Structure and function of AvtR, a novel transcriptional regulator from a hyperthermophilic archaeal lipothrixvirus. J. Virol. 2013, 87, 124–136. [Google Scholar] [CrossRef]
Sheppard, C.; Werner, F. Structure and mechanisms of viral transcription factors in archaea. Extremophiles 2017, 21, 829–838. [Google Scholar] [CrossRef] [PubMed]
Kessler, A.; Sezonov, G.; Guijarro, J.I.; Desnoues, N.; Rose, T.; Delepierre, M.; Bell, S.D.; Prangishvili, D. A novel archaeal regulatory protein, Sta1, activates transcription from viral promoters. Nucleic Acids Res. 2006, 34, 4837–4845. [Google Scholar] [CrossRef] [PubMed]
Deng, W.; Roberts, S.G. A core promoter element downstream of the TATA box that is recognized by TFIIB. Genes Dev. 2005, 19, 2418–2423. [Google Scholar] [CrossRef]
Kanai, T.; Akerboom, J.; Takedomi, S.; van de Werken, H.J.; Blombach, F.; van der Oost, J.; Murakami, T.; Atomi, H.; Imanaka, T. A global transcriptional regulator in Thermococcus kodakaraensis controls the expression levels of both glycolytic and gluconeogenic enzyme-encoding genes. J. Biol. Chem. 2007, 282, 33659–33670. [Google Scholar] [CrossRef]
Lee, S.J.; Surma, M.; Hausner, W.; Thomm, M.; Boos, W. The role of TrmB and TrmB-like transcriptional regulators for sugar transport and metabolism in the hyperthermophilic archaeon Pyrococcus furiosus. Arch. Microbiol. 2008, 190, 247–256. [Google Scholar] [CrossRef] [PubMed]
Lipscomb, G.L.; Keese, A.M.; Cowart, D.M.; Schut, G.J.; Thomm, M.; Adams, M.W.; Scott, R.A. SurR: A transcriptional activator and repressor controlling hydrogen and elemental sulphur metabolism in Pyrococcus furiosus. Mol. Microbiol. 2009, 71, 332–349. [Google Scholar] [CrossRef]
Peeters, E.; Thia-Toong, T.L.; Gigot, D.; Maes, D.; Charlier, D. Ss-LrpB, a novel Lrp-like regulator of Sulfolobus solfataricus P2, binds cooperatively to three conserved targets in its own control region. Mol. Microbiol. 2004, 54, 321–336. [Google Scholar] [CrossRef]
Peeters, E.; Willaert, R.; Maes, D.; Charlier, D. Ss-LrpB from Sulfolobus solfataricus condenses about 100 base pairs of its own operator DNA into globular nucleoprotein complexes. J. Biol. Chem. 2006, 281, 11721–11728. [Google Scholar] [CrossRef]
Rohs, R.; West, S.M.; Sosinsky, A.; Liu, P.; Mann, R.S.; Honig, B. The role of DNA shape in protein-DNA recognition. Nature 2009, 461, 1248–1253. [Google Scholar] [CrossRef] [PubMed]
Abe, N.; Dror, I.; Yang, L.; Slattery, M.; Zhou, T.; Bussemaker, H.J.; Rohs, R.; Mann, R.S. Deconvolving the recognition of DNA shape from sequence. Cell 2015, 161, 307–318. [Google Scholar] [CrossRef] [PubMed]
Joshi, R.; Passner, J.M.; Rohs, R.; Jain, R.; Sosinsky, A.; Crickmore, M.A.; Jacob, V.; Aggarwal, A.K.; Honig, B.; Mann, R.S. Functional specificity of a Hox protein mediated by the recognition of minor groove structure. Cell 2007, 131, 530–543. [Google Scholar] [CrossRef] [PubMed]
Kribelbauer, J.F.; Rastogi, C.; Bussemaker, H.J.; Mann, R.S. Low-Affinity Binding Sites and the Transcription Factor Specificity Paradox in Eukaryotes. Annu. Rev. Cell Dev. Biol. 2019, 35, 357–379. [Google Scholar] [CrossRef] [PubMed]
Ptashne, M. Principles of a switch. Nat. Chem. Biol. 2011, 7, 484–487. [Google Scholar] [CrossRef]
Kung, J.T.; Kesner, B.; An, J.Y.; Ahn, J.Y.; Cifuentes-Rojas, C.; Colognori, D.; Jeon, Y.; Szanto, A.; del Rosario, B.C.; Pinter, S.F.; et al. Locus-specific targeting to the X chromosome revealed by the RNA interactome of CTCF. Mol. Cell 2015, 57, 361–375. [Google Scholar] [CrossRef]
Lai, F.; Orom, U.A.; Cesaroni, M.; Beringer, M.; Taatjes, D.J.; Blobel, G.A.; Shiekhattar, R. Activating RNAs associate with Mediator to enhance chromatin architecture and transcription. Nature 2013, 494, 497–501. [Google Scholar] [CrossRef]
Sigova, A.A.; Abraham, B.J.; Ji, X.; Molinie, B.; Hannett, N.M.; Guo, Y.E.; Jangi, M.; Giallourakis, C.C.; Sharp, P.A.; Young, R.A. Transcription factor trapping by RNA in gene regulatory elements. Science 2015, 350, 978–981. [Google Scholar] [CrossRef]
Hirai, H.; Tani, T.; Kikyo, N. Structure and functions of powerful transactivators: VP16, MyoD and FoxA. Int J. Dev. Biol. 2010, 54, 1589–1596. [Google Scholar] [CrossRef]
Brown, C.E.; Howe, L.; Sousa, K.; Alley, S.C.; Carrozza, M.J.; Tan, S.; Workman, J.L. Recruitment of HAT complexes by direct activator interactions with the ATM-related Tra1 subunit. Science 2001, 292, 2333–2337. [Google Scholar] [CrossRef]
Sharma, D.; Fondell, J.D. Ordered recruitment of histone acetyltransferases and the TRAP/Mediator complex to thyroid hormone-responsive promoters in vivo. Proc. Natl. Acad. Sci. USA 2002, 99, 7934–7939. [Google Scholar] [CrossRef] [PubMed]
Neely, K.E.; Hassan, A.H.; Brown, C.E.; Howe, L.; Workman, J.L. Transcription activator interactions with multiple SWI/SNF subunits. Mol. Cell Biol. 2002, 22, 1615–1625. [Google Scholar] [CrossRef] [PubMed]
Krasnov, A.N.; Mazina, M.Y.; Nikolenko, J.V.; Vorobyeva, N.E. On the way of revealing coactivator complexes cross-talk during transcriptional activation. Cell BioSci. 2016, 6, 15. [Google Scholar] [CrossRef] [PubMed]
Mazina, M.Y.; Kovalenko, E.V.; Derevyanko, P.K.; Nikolenko, J.V.; Krasnov, A.N.; Vorobyeva, N.E. One signal stimulates different transcriptional activation mechanisms. Biochim. Biophys. Acta Gene Regul. Mech. 2018, 1861, 178–189. [Google Scholar] [CrossRef]
Cheong, J.H.; Yi, M.; Lin, Y.; Murakami, S. Human RPB5, a subunit shared by eukaryotic nuclear RNA polymerases, binds human hepatitis B virus X protein and may play a role in X transactivation. EMBO J. 1995, 14, 143–150. [Google Scholar] [CrossRef]
Lin, Y.; Nomura, T.; Cheong, J.; Dorjsuren, D.; Iida, K.; Murakami, S. Hepatitis B virus X protein is a transcriptional modulator that communicates with transcription factor IIB and the RNA polymerase II subunit 5. J. Biol. Chem. 1997, 272, 7132–7139. [Google Scholar] [CrossRef]
Kim, Y.K.; Bourgeois, C.F.; Isel, C.; Churcher, M.J.; Karn, J. Phosphorylation of the RNA polymerase II carboxyl-terminal domain by CDK9 is directly responsible for human immunodeficiency virus type 1 Tat-activated transcriptional elongation. Mol. Cell Biol. 2002, 22, 4622–4637. [Google Scholar] [CrossRef]
Paparidis, N.F.; Durvale, M.C.; Canduri, F. The emerging picture of CDK9/P-TEFb: More than 20 years of advances since PITALRE. Mol. Biosyst. 2017, 13, 246–276. [Google Scholar] [CrossRef]
Herrmann, C.H.; Gold, M.O.; Rice, A.P. Viral transactivators specifically target distinct cellular protein kinases that phosphorylate the RNA polymerase II C-terminal domain. Nucleic Acids Res. 1996, 24, 501–508. [Google Scholar] [CrossRef]
Cisek, L.J.; Corden, J.L. Phosphorylation of RNA polymerase by the murine homologue of the cell-cycle control protein cdc2. Nature 1989, 339, 679–684. [Google Scholar] [CrossRef]
Ibragimov, A.N.; Bylino, O.V.; Shidlovskii, Y.V. Molecular basis of the function of transcriptional enhancers. Cells 2020, 9, 1620. [Google Scholar] [CrossRef] [PubMed]
Fattori, J.; Campos, J.L.; Doratioto, T.R.; Assis, L.M.; Vitorino, M.T.; Polikarpov, I.; Xavier-Neto, J.; Figueira, A.C. RXR agonist modulates TR: Corepressor dissociation upon 9-cis retinoic acid treatment. Mol. Endocrinol. 2015, 29, 258–273. [Google Scholar] [CrossRef] [PubMed]
Ikeda, M.; Rhee, M.; Chin, W.W. Thyroid hormone receptor monomer, homodimer, and heterodimer (with retinoid-X receptor) contact different nucleotide sequences in thyroid hormone response elements. Endocrinology 1994, 135, 1628–1638. [Google Scholar] [CrossRef] [PubMed]
Lassar, A.B.; Davis, R.L.; Wright, W.E.; Kadesch, T.; Murre, C.; Voronova, A.; Baltimore, D.; Weintraub, H. Functional activity of myogenic HLH proteins requires hetero-oligomerization with E12/E47-like proteins in vivo. Cell 1991, 66, 305–315. [Google Scholar] [CrossRef]
Sun, X.H.; Baltimore, D. An inhibitory domain of E12 transcription factor prevents DNA binding in E12 homodimers but not in E12 heterodimers. Cell 1991, 64, 459–470. [Google Scholar] [CrossRef]
Amati, B.; Dalton, S.; Brooks, M.W.; Littlewood, T.D.; Evan, G.I.; Land, H. Transcriptional activation by the human c-Myc oncoprotein in yeast requires interaction with Max. Nature 1992, 359, 423–426. [Google Scholar] [CrossRef]
Blackwood, E.M.; Eisenman, R.N. Max: A helix-loop-helix zipper protein that forms a sequence-specific DNA-binding complex with Myc. Science 1991, 251, 1211–1217. [Google Scholar] [CrossRef]
Kato, G.J.; Lee, W.M.; Chen, L.L.; Dang, C.V. Max: Functional domains and interaction with c-Myc. Genes Dev. 1992, 6, 81–92. [Google Scholar] [CrossRef]
Littlewood, T.D.; Amati, B.; Land, H.; Evan, G.I. Max and c-Myc/Max DNA-binding activities in cell extracts. Oncogene 1992, 7, 1783–1792. [Google Scholar]
Baniahmad, C.; Muller, M.; Altschmied, J.; Renkawitz, R. Co-operative binding of the glucocorticoid receptor DNA binding domain is one of at least two mechanisms for synergism. J. Mol. Biol. 1991, 222, 155–165. [Google Scholar] [CrossRef]
Berkes, C.A.; Bergstrom, D.A.; Penn, B.H.; Seaver, K.J.; Knoepfler, P.S.; Tapscott, S.J. Pbx marks genes for activation by MyoD indicating a role for a homeodomain protein in establishing myogenic potential. Mol. Cell 2004, 14, 465–477. [Google Scholar] [CrossRef]
Ptashne, M.; Gann, A. Genes and Signals; Cold Spring Harbor Laboratory Press: Cold Spring Harbor, NY, USA, 2002. [Google Scholar]
Iwafuchi-Doi, M.; Zaret, K.S. Pioneer transcription factors in cell reprogramming. Genes Dev. 2014, 28, 2679–2692. [Google Scholar] [CrossRef]
Yamada, S.; Whitney, P.H.; Huang, S.K.; Eck, E.C.; Garcia, H.G.; Rushlow, C.A. The Drosophila Pioneer Factor Zelda Modulates the Nuclear Microenvironment of a Dorsal Target Enhancer to Potentiate Transcriptional Output. Curr. Biol. 2019, 29, 1387–1393 e1385. [Google Scholar] [CrossRef] [PubMed]
Dufourt, J.; Trullo, A.; Hunter, J.; Fernandez, C.; Lazaro, J.; Dejean, M.; Morales, L.; Nait-Amer, S.; Schulz, K.N.; Harrison, M.M.; et al. Temporal control of gene expression by the pioneer factor Zelda through transient interactions in hubs. Nat. Commun. 2018, 9, 5194. [Google Scholar] [CrossRef]
Cirillo, L.A.; Lin, F.R.; Cuesta, I.; Friedman, D.; Jarnik, M.; Zaret, K.S. Opening of compacted chromatin by early developmental transcription factors HNF3 (FoxA) and GATA-4. Mol. Cell 2002, 9, 279–289. [Google Scholar] [CrossRef]
Cirillo, L.A.; McPherson, C.E.; Bossard, P.; Stevens, K.; Cherian, S.; Shim, E.Y.; Clark, K.L.; Burley, S.K.; Zaret, K.S. Binding of the winged-helix transcription factor HNF3 to a linker histone site on the nucleosome. EMBO J. 1998, 17, 244–254. [Google Scholar] [CrossRef] [PubMed]
Clark, K.L.; Halay, E.D.; Lai, E.; Burley, S.K. Co-crystal structure of the HNF-3/fork head DNA-recognition motif resembles histone H5. Nature 1993, 364, 412–420. [Google Scholar] [CrossRef] [PubMed]
Cirillo, L.A.; Zaret, K.S. An early developmental transcription factor complex that is more stable on nucleosome core particles than on free DNA. Mol. Cell 1999, 4, 961–969. [Google Scholar] [CrossRef]
Lupien, M.; Eeckhoute, J.; Meyer, C.A.; Wang, Q.; Zhang, Y.; Li, W.; Carroll, J.S.; Liu, X.S.; Brown, M. FoxA1 translates epigenetic signatures into enhancer-driven lineage-specific transcription. Cell 2008, 132, 958–970. [Google Scholar] [CrossRef]
Dilworth, F.J.; Seaver, K.J.; Fishburn, A.L.; Htet, S.L.; Tapscott, S.J. In vitro transcription system delineates the distinct roles of the coactivators pCAF and p300 during MyoD/E47-dependent transactivation. Proc. Natl. Acad. Sci. USA 2004, 101, 11593–11598. [Google Scholar] [CrossRef]
Perry, R.L.; Parker, M.H.; Rudnicki, M.A. Activated MEK1 binds the nuclear MyoD transcriptional complex to repress transactivation. Mol. Cell 2001, 8, 291–301. [Google Scholar] [CrossRef]
De la Serna, I.L.; Ohkawa, Y.; Berkes, C.A.; Bergstrom, D.A.; Dacwag, C.S.; Tapscott, S.J.; Imbalzano, A.N. MyoD targets chromatin remodeling complexes to the myogenin locus prior to forming a stable DNA-bound complex. Mol. Cell Biol. 2005, 25, 3997–4009. [Google Scholar] [CrossRef] [PubMed]
Panne, D.; Maniatis, T.; Harrison, S.C. An atomic model of the interferon-beta enhanceosome. Cell 2007, 129, 1111–1123. [Google Scholar] [CrossRef] [PubMed]
Chen, J.; Zhang, Z.; Li, L.; Chen, B.C.; Revyakin, A.; Hajj, B.; Legant, W.; Dahan, M.; Lionnet, T.; Betzig, E.; et al. Single-molecule dynamics of enhanceosome assembly in embryonic stem cells. Cell 2014, 156, 1274–1285. [Google Scholar] [CrossRef] [PubMed]
Kong, S.; Bohl, D.; Li, C.; Tuan, D. Transcription of the HS2 enhancer toward a cis-linked gene is independent of the orientation, position, and distance of the enhancer relative to the gene. Mol. Cell Biol. 1997, 17, 3955–3965. [Google Scholar] [CrossRef] [PubMed]
Blackwood, E.M.; Kadonaga, J.T. Going the distance: A current view of enhancer action. Science 1998, 281, 60–63. [Google Scholar] [CrossRef]
Furlong, E.E.M.; Levine, M. Developmental enhancers and chromosome topology. Science 2018, 361, 1341–1345. [Google Scholar] [CrossRef]
Hatzis, P.; Talianidis, I. Dynamics of enhancer-promoter communication during differentiation-induced gene activation. Mol. Cell 2002, 10, 1467–1477. [Google Scholar] [CrossRef]
Bulger, M.; Groudine, M. Looping versus linking: Toward a model for long-distance gene activation. Genes Dev. 1999, 13, 2465–2477. [Google Scholar] [CrossRef]
Vernimmen, D.; Bickmore, W.A. The Hierarchy of Transcriptional Activation: From Enhancer to Promoter. Trends Genet. 2015, 31, 696–708. [Google Scholar] [CrossRef]
Morcillo, P.; Rosen, C.; Baylies, M.K.; Dorsett, D. Chip, a widely expressed chromosomal protein required for segmentation and activity of a remote wing margin enhancer in Drosophila. Genes Dev. 1997, 11, 2729–2740. [Google Scholar] [CrossRef] [PubMed]
Johnson, A. A. A combinatorial regulatory circuit in budding yeast. In Transcriptional Regulation; McKnight, S.L., Yamamoto, K.R., Eds.; Cold Spring Harbor Laboratory Press: Cold Spring Harbor, NY, USA, 1992; pp. 975–1006. [Google Scholar]
Dorsett, D. Distant liaisons: Long-range enhancer-promoter interactions in Drosophila. Curr. Opin. Genet. Dev. 1999, 9, 505–514. [Google Scholar] [CrossRef]
Bach, I.; Carriere, C.; Ostendorff, H.P.; Andersen, B.; Rosenfeld, M.G. A family of LIM domain-associated cofactors confer transcriptional synergism between LIM and Otx homeodomain proteins. Genes Dev. 1997, 11, 1370–1380. [Google Scholar] [CrossRef] [PubMed]
Deng, W.; Lee, J.; Wang, H.; Miller, J.; Reik, A.; Gregory, P.D.; Dean, A.; Blobel, G.A. Controlling long-range genomic interactions at a native locus by targeted tethering of a looping factor. Cell 2012, 149, 1233–1244. [Google Scholar] [CrossRef] [PubMed]
Krivega, I.; Dean, A. LDB1-mediated enhancer looping can be established independent of mediator and cohesin. Nucleic Acids Res. 2017, 45, 8255–8268. [Google Scholar] [CrossRef]
Schoenfelder, S.; Fraser, P. Long-range enhancer-promoter contacts in gene expression control. Nat. Rev. Genet. 2019, 20, 437–455. [Google Scholar] [CrossRef]
Lieberman-Aiden, E.; van Berkum, N.L.; Williams, L.; Imakaev, M.; Ragoczy, T.; Telling, A.; Amit, I.; Lajoie, B.R.; Sabo, P.J.; Dorschner, M.O.; et al. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 2009, 326, 289–293. [Google Scholar] [CrossRef]
Dowen, J.M.; Fan, Z.P.; Hnisz, D.; Ren, G.; Abraham, B.J.; Zhang, L.N.; Weintraub, A.S.; Schujiers, J.; Lee, T.I.; Zhao, K.; et al. Control of cell identity genes occurs in insulated neighborhoods in mammalian chromosomes. Cell 2014, 159, 374–387. [Google Scholar] [CrossRef]
Rao, S.S.; Huntley, M.H.; Durand, N.C.; Stamenova, E.K.; Bochkov, I.D.; Robinson, J.T.; Sanborn, A.L.; Machol, I.; Omer, A.D.; Lander, E.S.; et al. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell 2014, 159, 1665–1680. [Google Scholar] [CrossRef]
Rao, S.S.P.; Huang, S.C.; Glenn St Hilaire, B.; Engreitz, J.M.; Perez, E.M.; Kieffer-Kwon, K.R.; Sanborn, A.L.; Johnstone, S.E.; Bascom, G.D.; Bochkov, I.D.; et al. Cohesin Loss Eliminates All Loop Domains. Cell 2017, 171, 305–320 e324. [Google Scholar] [CrossRef]
Rowley, M.J.; Nichols, M.H.; Lyu, X.; Ando-Kuri, M.; Rivera, I.S.M.; Hermetz, K.; Wang, P.; Ruan, Y.; Corces, V.G. Evolutionarily Conserved Principles Predict 3D Chromatin Organization. Mol. Cell 2017, 67, 837–852 e837. [Google Scholar] [CrossRef] [PubMed]
Beagan, J.A.; Phillips-Cremins, J.E. On the existence and functionality of topologically associating domains. Nat. Genet. 2020, 52, 8–16. [Google Scholar] [CrossRef] [PubMed]
Gambetta, M.C.; Furlong, E.E.M. The Insulator Protein CTCF Is Required for Correct Hox Gene Expression, but Not for Embryonic Development in Drosophila. Genetics 2018, 210, 129–136. [Google Scholar] [CrossRef]
Fudenberg, G.; Abdennur, N.; Imakaev, M.; Goloborodko, A.; Mirny, L.A. Emerging Evidence of Chromosome Folding by Loop Extrusion. In Cold Spring Harbor Symposia on Quantitative Biology; Cold Spring Harbor Laboratory Press: Cold Spring Harbor, NY, USA, 2017; Volume 82, pp. 45–55. [Google Scholar] [CrossRef]
Hsieh, T.H.; Weiner, A.; Lajoie, B.; Dekker, J.; Friedman, N.; Rando, O.J. Mapping Nucleosome Resolution Chromosome Folding in Yeast by Micro-C. Cell 2015, 162, 108–119. [Google Scholar] [CrossRef] [PubMed]
Krietenstein, N.; Abraham, S.; Venev, S.V.; Abdennur, N.; Gibcus, J.; Hsieh, T.S.; Parsi, K.M.; Yang, L.; Maehr, R.; Mirny, L.A.; et al. Ultrastructural Details of Mammalian Chromosome Architecture. Mol. Cell 2020, 78, 554–565 e557. [Google Scholar] [CrossRef] [PubMed]
Nora, E.P.; Caccianini, L.; Fudenberg, G.; Kameswaran, V.; Nagle, A.; Uebersohn, A.; So, K.; Hajj, B.; Le Saux, A.; Coulon, A.; et al. Molecular basis of CTCF binding polarity in genome folding. bioRxiv 2019. [Google Scholar] [CrossRef]
Ganji, M.; Shaltiel, I.A.; Bisht, S.; Kim, E.; Kalichava, A.; Haering, C.H.; Dekker, C. Real-time imaging of DNA loop extrusion by condensin. Science 2018, 360, 102–105. [Google Scholar] [CrossRef]
Matthews, N.E.; White, R. Chromatin Architecture in the Fly: Living without CTCF/Cohesin Loop Extrusion?: Alternating Chromatin States Provide a Basis for Domain Architecture in Drosophila. Bioessays 2019, 41, e1900048. [Google Scholar] [CrossRef]
Hou, C.; Li, L.; Qin, Z.S.; Corces, V.G. Gene density, transcription, and insulators contribute to the partition of the Drosophila genome into physical domains. Mol. Cell 2012, 48, 471–484. [Google Scholar] [CrossRef]
Sexton, T.; Yaffe, E.; Kenigsberg, E.; Bantignies, F.; Leblanc, B.; Hoichman, M.; Parrinello, H.; Tanay, A.; Cavalli, G. Three-dimensional folding and functional organization principles of the Drosophila genome. Cell 2012, 148, 458–472. [Google Scholar] [CrossRef] [PubMed]
Stadler, M.R.; Haines, J.E.; Eisen, M.B. Convergence of topological domain boundaries, insulators, and polytene interbands revealed by high-resolution mapping of chromatin contacts in the early Drosophila melanogaster embryo. Elife 2017, 6. [Google Scholar] [CrossRef] [PubMed]
Li, L.; Lyu, X.; Hou, C.; Takenaka, N.; Nguyen, H.Q.; Ong, C.T.; Cubenas-Potts, C.; Hu, M.; Lei, E.P.; Bosco, G.; et al. Widespread rearrangement of 3D chromatin organization underlies polycomb-mediated stress-induced silencing. Mol. Cell 2015, 58, 216–231. [Google Scholar] [CrossRef] [PubMed]
Cubenas-Potts, C.; Rowley, M.J.; Lyu, X.; Li, G.; Lei, E.P.; Corces, V.G. Different enhancer classes in Drosophila bind distinct architectural proteins and mediate unique chromatin interactions and 3D architecture. Nucleic Acids Res. 2017, 45, 1714–1730. [Google Scholar] [CrossRef] [PubMed]
Eser, U.; Chandler-Brown, D.; Ay, F.; Straight, A.F.; Duan, Z.; Noble, W.S.; Skotheim, J.M. Form and function of topologically associating genomic domains in budding yeast. Proc. Natl. Acad. Sci. USA 2017, 114, E3061–E3070. [Google Scholar] [CrossRef] [PubMed]
Vo Ngoc, L.; Kassavetis, G.A.; Kadonaga, J.T. The RNA Polymerase II Core Promoter in Drosophila. Genetics 2019, 212, 13–24. [Google Scholar] [CrossRef]
Vo Ngoc, L.; Wang, Y.L.; Kassavetis, G.A.; Kadonaga, J.T. The punctilious RNA polymerase II core promoter. Genes Dev. 2017, 31, 1289–1301. [Google Scholar] [CrossRef]
Zabidi, M.A.; Arnold, C.D.; Schernhuber, K.; Pagani, M.; Rath, M.; Frank, O.; Stark, A. Enhancer-core-promoter specificity separates developmental and housekeeping gene regulation. Nature 2015, 518, 556–559. [Google Scholar] [CrossRef]
Zabidi, M.A.; Stark, A. Regulatory Enhancer-Core-Promoter Communication via Transcription Factors and Cofactors. Trends Genet. 2016, 32, 801–814. [Google Scholar] [CrossRef]
Beagan, J.A.; Duong, M.T.; Titus, K.R.; Zhou, L.; Cao, Z.; Ma, J.; Lachanski, C.V.; Gillis, D.R.; Phillips-Cremins, J.E. YY1 and CTCF orchestrate a 3D chromatin looping switch during early neural lineage commitment. Genome Res. 2017, 27, 1139–1152. [Google Scholar] [CrossRef]
Weintraub, A.S.; Li, C.H.; Zamudio, A.V.; Sigova, A.A.; Hannett, N.M.; Day, D.S.; Abraham, B.J.; Cohen, M.A.; Nabet, B.; Buckley, D.L.; et al. YY1 Is a Structural Regulator of Enhancer-Promoter Loops. Cell 2017, 171, 1573–1588 e1528. [Google Scholar] [CrossRef] [PubMed]
Lee, K.H.; Evans, S.; Ruan, T.Y.; Lassar, A.B. SMAD-mediated modulation of YY1 activity regulates the BMP response and cardiac-specific expression of a GATA4/5/6-dependent chick Nkx2.5 enhancer. Development 2004, 131, 4709–4723. [Google Scholar] [CrossRef]
Gabriele, M.; Vulto-van Silfhout, A.T.; Germain, P.L.; Vitriolo, A.; Kumar, R.; Douglas, E.; Haan, E.; Kosaki, K.; Takenouchi, T.; Rauch, A.; et al. YY1 Haploinsufficiency Causes an Intellectual Disability Syndrome Featuring Transcriptional and Chromatin Dysfunction. Am. J. Hum. Genet. 2017, 100, 907–925. [Google Scholar] [CrossRef] [PubMed]
Bracken, A.P.; Helin, K. Polycomb group proteins: Navigators of lineage pathways led astray in cancer. Nat. Rev. Cancer 2009, 9, 773–784. [Google Scholar] [CrossRef]
Cai, Y.; Jin, J.; Yao, T.; Gottschalk, A.J.; Swanson, S.K.; Wu, S.; Shi, Y.; Washburn, M.P.; Florens, L.; Conaway, R.C.; et al. YY1 functions with INO80 to activate transcription. Nat. Struct. Mol. Biol. 2007, 14, 872–874. [Google Scholar] [CrossRef]
Burton, S.P.; Burton, Z.F. The sigma enigma: Bacterial sigma factors, archaeal TFB and eukaryotic TFIIB are homologs. Transcription 2014, 5, e967599. [Google Scholar] [CrossRef] [PubMed]
Knutson, B.A.; Hahn, S. TFIIB-related factors in RNA polymerase I transcription. Biochim Biophys. Acta 2013, 1829, 265–273. [Google Scholar] [CrossRef]
Akhtar, W.; Veenstra, G.J. TBP-related factors: A paradigm of diversity in transcription initiation. Cell BioSci. 2011, 1, 23. [Google Scholar] [CrossRef]
Kopytova, D.V.; Krasnov, A.N.; Kopantceva, M.R.; Nabirochkina, E.N.; Nikolenko, J.V.; Maksimenko, O.; Kurshakova, M.M.; Lebedeva, L.A.; Yerokhin, M.M.; Simonova, O.B.; et al. Two isoforms of Drosophila TRF2 are involved in embryonic development, premeiotic chromatin condensation, and proper differentiation of germ cells of both sexes. Mol. Cell Biol. 2006, 26, 7492–7505. [Google Scholar] [CrossRef]
Langkjaer, R.B.; Cliften, P.F.; Johnston, M.; Piskur, J. Yeast genome duplication was followed by asynchronous differentiation of duplicated genes. Nature 2003, 421, 848–852. [Google Scholar] [CrossRef]
Tirosh, I.; Barkai, N. Comparative analysis indicates regulatory neofunctionalization of yeast duplicates. Genome Biol. 2007, 8, R50. [Google Scholar] [CrossRef]
Wolfe, K.H. Origin of the Yeast Whole-Genome Duplication. PLoS Biol. 2015, 13, e1002221. [Google Scholar] [CrossRef] [PubMed]
Conaway, R.C.; Conaway, J.W. Origins and activity of the Mediator complex. Semin. Cell Dev. Biol. 2011, 22, 729–734. [Google Scholar] [CrossRef]
Nagulapalli, M.; Maji, S.; Dwivedi, N.; Dahiya, P.; Thakur, J.K. Evolution of disorder in Mediator complex and its functional relevance. Nucleic Acids Res. 2016, 44, 1591–1612. [Google Scholar] [CrossRef] [PubMed]
Aravind, L.; Burroughs, A.M.; Zhang, D.; Iyer, L.M. Protein and DNA modifications: Evolutionary imprints of bacterial biochemical diversification and geochemistry on the provenance of eukaryotic epigenetics. Cold Spring Harb. Perspect. Biol. 2014, 6, a016063. [Google Scholar] [CrossRef] [PubMed][Green Version]
Maksimenko, O.; Georgiev, P. Mechanisms and proteins involved in long-distance interactions. Front. Genet. 2014, 5, 28. [Google Scholar] [CrossRef]
Erokhin, M.; Vassetzky, Y.; Georgiev, P.; Chetverina, D. Eukaryotic enhancers: Common features, regulation, and participation in diseases. Cell Mol. Life Sci. 2015, 72, 2361–2375. [Google Scholar] [CrossRef]
Xia, J.H.; Wei, G.H. Enhancer Dysfunction in 3D Genome and Disease. Cells 2019, 8, 1281. [Google Scholar] [CrossRef]

Figure 1. Enhancer-like mechanisms of transcription regulation in prokaryotes. (A) Stimulation of RNAP by an adjacent activator. Pho-regulon genes are responsible for adaptation of bacteria to phosphorus starvation. Expression of the Pho-regulon genes is regulated by a two-component system through direct contact of an activator PhoB with the RNAP–σ⁷⁰ complex. Lack of phosphorus in the medium leads to phosphorylation of PhoB (PhoB-P), which then acts as an intracellular soluble response regulator. PhoB-P as a dimer binds to its binding sites at the promoters of Pho-regulon genes (pho-boxes), thus compensating for the weak −35 element, and recruits the RNAP–σ⁷⁰ complex to the promoters. The physical interaction of PhoB-P with σ⁷⁰ in the region of the −35 element is shown. (B) Multiple activation of RNAP by several activators. Regulation of the araBAD promoter is shown. The upper panel illustrates the repression state in which the promoter is repressed by the AraC regulator. The lower panel shows the transition from repression to activation, which occurs by changing the conformation of the promoter DNA. The repression loop is broken by the CAP protein; the promoter is bent by CAP and the site of AraC binding is switched. Changing the promoter DNA conformation allows the activators to interact with RNAP. (C) Repression loop. Two alternative conformations of the lac operon promoter are shown. The lac-operon is regulated by a one-component regulatory system. The system includes the Lac repressor protein (LacI), which acts as both a lactose sensor and a transcriptional regulator. In the absence of lactose (or in the presence of glucose), the Lac repressor binds to several operator sites at the P_lac and negatively regulates expression of the lacZYA operon through the formation of a repression loop between the operator sites. The binding of lactose or IPTG affects the LacI conformation, causing LacI to disconnect from the operator sites, and the repression loop disappears. (D) Activation loop. An activation loop forms when glnALG operon expression is upregulated by the NtrC regulator. When ammonia is depleted in the medium, NtrC undergoes phosphorylation, assembles into heptamers, and binds to an upstream enhancer-like sequence at the promoter of the glnALG operon. The loop forms between the NtrC binding sites and core promoter. NtrC remodels the RNAP–σ⁵⁴ holoenzyme into the open complex. The remodeling reaction is ATP dependent. Nucleoid-associated proteins (NAPs) may facilitate the looping mechanism by bending the DNA between the enhancer and promoter. (E) Activation ring. The scanning/tracking mechanism of regulation of bacteriophage T4 late genes is shown. A component ring consisting of three gp45 phage polypeptides is put onto DNA at the enhancer region by the loader complex gp44–gp62. A part of the split σ-subunit known as the gp55 protein (σ55) tracks along DNA as a gp45 ligand. At the gene promoter region, the ring activation complex encounters RNAP with gp33 (a second part of the split σ) and mediates the transition of the closed complex to the open one and activation of transcription.

Figure 2. An integrated circuit of the initial stages of transcription activation in archaea. When an incoming signal arrives, a transcriptional activator acquires the ability to bind to its DNA sites in the upstream activating sequence (UAS) region in the promoter of a target gene. Activator binding occurs both in the primary site, which is closest to the promoter, and auxiliary sites, which are distal to the primary site. The appearance of nucleation sites near the core promoter allows the activator to guide TBP to the TATA box (or TFB to the BRE element). This is accompanied by DNA bending in the promoter region, like in the case of activation in bacteria, bringing the regulatory elements of the promoter closer together. The activator may leave the complex when the complex proceeds to the stage of transcriptional bubble or stay in the complex until promoter clearance by RNAP (not shown).

Figure 3. Mechanisms of transcription regulation by enhancers in eukaryotes. (A) The regulatory system of yeasts. The UAS works in orientation-independent manner only when placed upstream of the promoter, but not downstream of the gene or in an intron. The yeast UAS is usually close to the promoter (at the distance less than 1 Kb). (B) Linking/Chaining mechanism of enhancer–promoter communication. A linking factor oligomerizes from the enhancer to the promoter using some anchored proteins to bridge the distal regulatory elements. This cascade of recruitments starts at the enhancer and proceeds until reaching the core promoter, where the preinitiation complex (PIC) is recruited. Alternatively, oligomerization factor-containing complexes might bind directly to relatively hyperacetylated chromatin in the absence of any anchor protein, possibly, by recognizing a signature of active loci, such as a specific pattern of histone acetylation. (C) Scanning/Tracking mechanism of enhancer–promoter communication. An activator binds to the enhancer region and utilizes its activating domain to recruit the other components of the initial transcription complex to the region of the enhancer, the set including basal factors, RNAP, and coactivators. RNAP moves from its initial binding site at the enhancer towards the core promoter, producing non-coding RNAs, which may also represent a platform for the binding of regulatory proteins and coactivators. Although the recruitment of the factors relevant for the initiation of transcription occurs at the enhancer region, transcription is not effectively initiated because the enhancer lacks all motifs necessary for the efficient binding of basic factors that ensure the recruitment of RNAP. At the core promoter, the total set of motifs favorable to preinitiation complex assembly is available and, after the formation of a stable transcriptional complex on the core promoter, the DNA loop between the core promoter and the enhancer is stabilized. The process of RNAP movement towards the core promoter may be accompanied by a wave of unidirectional spreading of histone acetylation through the DNA that separates the enhancer and the promoter (not shown), as well as by histone remodeling in the promoter region. Thus, scanning/tracking can be accompanied by the formation of an acetylated domain between cis-regulatory elements, as well as the formation of regions with a reduced density of nucleosomes. (D) Genome architecture of eukaryotes (schematic Hi-C maps are shown). Yeasts are devoid of TADs and have only self-associating domains (micro-TADs), which are associated with gene loops, and ~200 Kb TADs, which are associated with DNA replication. In Drosophila, the architectural proteins can mediate the interactions of cis-regulatory elements inside TADs (indicated with 1). Additionally, flies have long-range looping interactions, which are associated with the function of Polycomb silencing complexes (the long-range looping interactions are indicated with 2). Humans possess all of the mechanisms that mediate enhancer–promoter communication in Drosophila, but, unlike flies, mammals additionally have a cohesin-dependent loop extrusion mechanism. The CTCF protein is found at the bases of TADs of different sizes in mammals and is believed to be a master regulator of the chromosomal architecture, while Drosophila CTCF acts as an ordinary architectural insulator protein. Cohesin-dependent loop extrusion is an evolutionary acquisition of mammals compared with Drosophila (cohesin-CTCF mediated interactions are indicated with 2,3).