Composition of Transcription Machinery and Its Crosstalk with Nucleoid-Associated Proteins and Global Transcription Factors

The coordination of bacterial genomic transcription involves an intricate network of interdependent genes encoding nucleoid-associated proteins (NAPs), DNA topoisomerases, RNA polymerase subunits and modulators of transcription machinery. The central element of this homeostatic regulatory system, integrating the information on cellular physiological state and producing a corresponding transcriptional response, is the multi-subunit RNA polymerase (RNAP) holoenzyme. In this review article, we argue that recent observations revealing DNA topoisomerases and metabolic enzymes associated with RNAP supramolecular complex support the notion of structural coupling between transcription machinery, DNA topology and cellular metabolism as a fundamental device coordinating the spatiotemporal genomic transcription. We analyse the impacts of various combinations of RNAP holoenzymes and global transcriptional regulators such as abundant NAPs, on genomic transcription from this viewpoint, monitoring the spatiotemporal patterns of couplons—overlapping subsets of the regulons of NAPs and RNAP sigma factors. We show that the temporal expression of regulons is by and large, correlated with that of cognate regulatory genes, whereas both the spatial organization and temporal expression of couplons is distinctly impacted by the regulons of NAPs and sigma factors. We propose that the coordination of the growth phase-dependent concentration gradients of global regulators with chromosome configurational dynamics determines the spatiotemporal patterns of genomic expression.


Introduction
The bacterial transcriptional regulation system is one of the most fascinating of evolutionary devices. It comprises an intricate network of interdependent genes modulating the chromosome dynamics and thus providing an integrated response to changing environmental conditions [1][2][3][4]. The main hallmark of this system (also a stumbling block for exploring it) is its organizational complexity, featuring control mechanisms involving spatiotemporally coordinated communications between its analog and digital components [5,6]. In this system, the unique sequences of individual genes represent discontinuous (i.e., digital) entities expressed as continuous (analog) variables-different species of indistinguishable protein molecules produced at various concentrations, including the abundant DNA binding proteins. The latter in turn determine the occupation pattern of chromosomal DNA Whatever the implicated type of transcriptional control, the central element of genetic regulation system is the multi-subunit RNAP holoenzyme, which not only transcribes genes adjusting the cellular metabolism to changing environmental conditions, but also receives numerous signals from the environment by interacting with regulatory proteins and other molecules providing various impacts according to the physiological state of the cell. These include factors involved in transcription initiation, elongation and termination, RNAP secondary channel modulators and small molecules, such as the purine nucleotide derivative alarmones pppGpp and ppGpp that play a key role in stringent response as well as their putative opponent pppApp [38]. Ultimately, all these factors are involved in analog control by modifying and adjusting the RNAP composition and function to DNA topology and chromatin architecture [1,[38][39][40].
Importantly, previous studies using various approaches made it increasingly clear, that in vivo the RNAP holoenzyme can be closely associated with DNA topoisomerases, ribosomal proteins and metabolic enzymes [41][42][43][44][45]. While the composition of such a multiprotein RNAP assembly, designated the RNAP supramolecular complex, critically depends on the applied isolation procedure, it was found to vary with growth phase [43,46] and be alterable by mutations of NAP genes [45] (Figure 1; Supplementary Table S1). Despite variable composition, some of the RNAP-associated proteins such as the ribosomal proteins, DNA gyrase subunits and certain metabolic enzymes (e.g., inosine 5 -monophosphate dehydrogensase-IMPDH), have been identified repeatedly by different approaches. Detected ribosomal proteins most likely reflect the association of the transcribing RNAP with ribosomes in the expressome complex functionally coupling transcription with translation in E. coli and Mycoplasma but not in Gram positive model bacterium B. subtilis and other Firmicutes, where most of the operons are translated without a closely trailing ribosome and where this "runaway transcription" creates different rules for cotranscriptional regulation [47,48]. Interestingly, while about a third of the 30S ribosomal subunit proteins were found to be associated with RNAP supramolecular complex isolated by heparin column chromatography, none of the 50S subunit proteins were detected (Figure 1; Supplementary  Table S1), supporting the view that RNAP forms a complex with the 30S ribosomal subunit prior to expressome formation. The enzymes that co-purify with RNA polymerase, including those involved in tRNA processing, nucleotide metabolism, and energy biosynthesis, were proposed to be necessary for optimal transcription rates [43]. Notably, among the RNAP-associated metabolic enzymes, inositol monophosphatase (SuhB) is an integral part of the ribosomal antitermination complex involved in rRNA maturation and ribosome assembly [49]. SuhB both interacts with RNA and competes with C-terminal domain of the RNAP α subunit (αCTD) for binding the antitermination protein NusA [50]. lecular complex isolated by heparin column chromatography, none of the 50S subunit proteins were detected (Figure 1; Supplementary Table S1), supporting the view that RNAP forms a complex with the 30S ribosomal subunit prior to expressome formation. The enzymes that co-purify with RNA polymerase, including those involved in tRNA processing, nucleotide metabolism, and energy biosynthesis, were proposed to be necessary for optimal transcription rates [43]. Notably, among the RNAP-associated metabolic enzymes, inositol monophosphatase (SuhB) is an integral part of the ribosomal antitermination complex involved in rRNA maturation and ribosome assembly [49]. SuhB both interacts with RNA and competes with C-terminal domain of the RNAP α subunit (αCTD) for binding the antitermination protein NusA [50]. The RNAP supramolecular complex. Coomassie-stained, 2nd dimension SDS-PAGE of RNA polymerase supramolecular complexes isolated from E. coli CSH50 wild type cells and NAP deficient mutants (as indicated above the lines). The shown (2nd dimension) SDS-PAGE was preceded by heparin column chromatography and non-denaturing Blue-Native PAGE, from which the RNAP supramolecular complexes were excised and resolved into individual components by denaturing SDS-PAGE. The bands in the gel were further subjected to in-gel trypsin digestion followed by MS or MS-MS analyses [45]. Protein identities are linked to the numbers. Further details of the protein functions and analyses scores are provided in Supplementary Table S1. Another RNAP-associated enzyme, inositol monophosphate dehydrogenase (IMPDH), is encoded by guaB gene conserved from prokaryotes to humans and catalyzes the NAD + -dependent oxidation of IMP to xanthosine monophosphate, thus controlling the guanine nucleotide pool size [51]. Notably, IMPDH binds single-stranded nucleic acids both in vivo and in vitro [52]. In Drosophila, IMPDH was shown to bind single-stranded CT-rich DNA, acting as a transcriptional repressor of an evolutionarily conserved gene regulatory network of nucleotide metabolic enzymes and thus, linking the metabolic state to cell proliferation [53]. Also the E. coli IMPDH binds CT-rich single-stranded DNA via its Bateman domain and independent of its catalytic function, inhibits adenylate nucleotide biosynthesis [54]. IMPDH thus affects the ATP homeostasis, Figure 1. The RNAP supramolecular complex. Coomassie-stained, 2nd dimension SDS-PAGE of RNA polymerase supramolecular complexes isolated from E. coli CSH50 wild type cells and NAP deficient mutants (as indicated above the lines). The shown (2nd dimension) SDS-PAGE was preceded by heparin column chromatography and non-denaturing Blue-Native PAGE, from which the RNAP supramolecular complexes were excised and resolved into individual components by denaturing SDS-PAGE. The bands in the gel were further subjected to in-gel trypsin digestion followed by MS or MS-MS analyses [45]. Protein identities are linked to the numbers. Further details of the protein functions and analyses scores are provided in Supplementary Table S1. Another RNAP-associated enzyme, inositol monophosphate dehydrogenase (IM-PDH), is encoded by guaB gene conserved from prokaryotes to humans and catalyzes the NAD + -dependent oxidation of IMP to xanthosine monophosphate, thus controlling the guanine nucleotide pool size [51]. Notably, IMPDH binds single-stranded nucleic acids both in vivo and in vitro [52]. In Drosophila, IMPDH was shown to bind single-stranded CT-rich DNA, acting as a transcriptional repressor of an evolutionarily conserved gene regulatory network of nucleotide metabolic enzymes and thus, linking the metabolic state to cell proliferation [53]. Also the E. coli IMPDH binds CT-rich single-stranded DNA via its Bateman domain and independent of its catalytic function, inhibits adenylate nucleotide biosynthesis [54]. IMPDH thus affects the ATP homeostasis, which in turn could affect DNA gyrase activity and eventually, modulate DNA supercoiling [18,19,55]. Notably, the E. coli guaB gene, as well as almost all the nucleotide biosynthesis genes, is activated by increased negative superhelicity [15]. It is conceivable that in the context of RNA polymerase supramolecular complex, the DNA single-strand binding activity of IMPDH plays a role in adjusting the transcription machinery and DNA topology at these supercoiling-dependent gene promoters. In addition, IMPDH has been associated with transcription elongation and translation [56][57][58].
Furthermore, in the context of RNAP supramolecular complex functionally meaningful relationships have been proposed for enzymes involved in the metabolism of lipopolysaccharides [46]. For example, the RNAP-associated periplasmic glucan biosynthesis protein MdoG (Supplementary Table S1) is encoded by mdoG (now opgG) gene; a constituent of mdoA operon the mutations of which influence the levels of stationary phase σ S initiation factor in the cell [59]. Additionally, the association of RNAP with isocitrate dehydrogenase (Supplementary Table S1), an enzyme regulated by phosphorylation/dephosphorylation allowing rapid shifts between the catabolic TCA and the anabolic glyoxalate bypass pathways, which are enriched respectively for genes responding to DNA relaxation and high negative superhelicity [15], is suggestive.
The observed close association of RNAP with metabolic enzymes is consistent with proposed interdependence of cellular metabolism and transcription [3], revealing a new layer of analog control of gene expression via direct communication of RNAP with metabolic enzymes and perhaps, also via indirect effects mediated by changing availability of metabolites, as proposed for replication control [60]. Analog control is conceivable here given that the concentration of both the enzymes and metabolites in the cell is sufficiently high to generate concentration gradients. It is noteworthy, however, that while the composition of RNAP supramolecular complex varies with growth phase [43,46] the relationship between this variation and the growth phase-dependent changes in RNAP sigma factor composition remains largely unexplored.

Role of DNA Topology and Homeostatic Regulation of Supercoiling Response
Compositional variation in the transcription machinery during the E. coli growth cycle is paralleled by variation of spatial distribution of RNAP in the nucleoid, likely determined by the changes of NAP composition, DNA topology and overall nucleoid configuration [61,62]. Notably, while the direct communications of metabolic enzymes with RNAP could relate the extant state of particular physiological function(s) to transcription machinery, the topoisomerases relay the cellular energy charge to the global supercoiling level of chromosomal DNA [18,19,55] and also, facilitate the translocation of RNAP along the DNA template by relieving the supercoiling imbalance resultant from the Liu/Wang twin supercoil domain effect, manifest in accumulation of positive and negative supercoils respectively ahead and behind of the translocating RNAP [63,64]. Furthermore and perhaps most importantly, the superhelical density of the DNA modulates the promoter recognition and transcription initiation by RNAP holoenzyme [1]. Binding of RNAP at the strong stable RNA promoters lead to constraint of writhed microloops storing torque, which subsequently can be utilized for promoter opening via topological writhe to twist transition [65,66]. The RNAP holoenzyme and DNA topology thus act conjointly as major analog factors harnessing and channeling the torque for facilitated promoter opening and transcription initiation [38].
Dynamic changes in DNA supercoiling and NAP composition in vivo determine both how DNA is packaged and how it is accessed for transcription initiation and other DNA transactions [65]. At the same time, changes in DNA superhelical density concertedly alter the expression of the networked genes of NAPs, RNAP sigma initiation factors, transcription elongation and termination factors and RNAP secondary channel modulators [3]. Accordingly, gene expression analyses of the wild type E. coli cells and its mutant derivatives lacking the NAPs (either FIS or H-NS, or both) grown under conditions of norfloxacin-induced hypernegative DNA supercoiling (−σ~0.09) and DNA relaxation (−σ~0.03) reveal specific patterns of up-regulated genes (Table 1), consistent with the homeostatic network regulating DNA supercoiling [55,67,68]. DNA hypernegative superhelicity induced by the addition of the TopoIV and gyrase inhibitor norfloxacin to the E. coli LZ54 WT strain (and its derivatives lacking the NAPs) encoding norfloxacin-resistant gyrase (i.e., TopoIV is inhibited, whereas gyrase is not), activates the yrdD and parC/parE genes encoding DNA relaxing topoisomerases. Interestingly, under the same conditions activation of the crl gene, the product of which facilitates the assembly of RNAPσ S holoenzyme, is observed [69][70][71]. In addition, the rpoZ gene, encoding the RNAP ω subunit, which stabilises the σ 70 holoenzyme assembly [72], as well as the rpoE, nusG and rho genes respectively encoding RpoE initiation, NusG elongation and Rho termination factors, are up-regulated. Transcription elongation factor NusG enhances the elongation rate [73] and links transcription to translation [74,75] by forming a bridge with ribosomal protein S10 (aka NusE). NusG thus stabilizes the RNAP-ribosome interaction in the expressome complex, albeit dependent on the length of the mRNA spacer between the RNAP and the P site of the ribosome active-center [76]. However, the RNAP-ribosome contact likely precedes the association of NusG with the complex, suggesting that recruitment of NusG depends on translation [77]. The termination factor Rho can resolve clashes between transcription and replication machineries [78], is crucial for maintaining the transcriptional boundaries in the genome and, being supported by NusG and NusA, exerts a silencing effect on transcription of AT-rich DNA [79] which may undergo undesirable spontaneous unwinding under conditions of high negative superhelicity. Importantly, Rho is associated with RNAP throughout the transcription cycle allosterically trapping the transcription complex [74,80,81] and preventing the formation of deleterious R-loops (a loop formed by single-stranded RNA base-paired with one strand of duplex DNA with the complementary DNA strand being displaced) at supercoilingdependent stable RNA operons [82], as well as precluding antisense transcription [83] and transcriptional read-through [79], potentially increased under conditions of DNA hypernegative supercoiling. Concomitant activation of sigma rpoE and anti-sigma rseAB genes by hyperenegative DNA supercoiling (Table 1) suggests an induction of envelope stress under hypernegative supercoiling regimen [84] and supports the notion that the σ and anti-σ factors are often co-transcribed to ensure the maintenance of stoichiometric levels [85].
Again in keeping with homeostatic control of DNA supercoiling, DNA relaxation induced by addition of norfloxacin to LZ41WT strain (and its derivatives lacking the NAPs), encoding norfloxacin-resistant TopoIV (i.e., gyrase is inhibited, whereas TopoIV is not), activates the gyrA and gyrB genes encoding the DNA gyrase subunits. DNA relax-ation also activates the genes encoding the sigma factors RpoD, RpoH and the secondary channel modulator GreA ( Table 1). Activation of this latter is consistent with its role in ppGpp-DksA regulatory subnetwork [40,86] and might counteract the down-regulation of ribosomal RNA promoters under conditions of DNA relaxation [87,88]. The E. coli rpoH gene encoding the heat shock σ 32 (aka σ H ) factor is sensitive to changes of DNA supercoiling, and DNA relaxation induced by novobiocin treatment of B. subtilis, was shown to decrease transcription from both the ribosomal RNA promoters and the promoters dependent on alternative sigma factors including σ H [89,90]. The RNAP σ 70 subunit encoded by rpoD gene confers preference for highly negatively supercoiled templates [91] and its overexpression can increase the global DNA negative superhelicity [39], suggesting a compensatory activation of rpoD expression counteracting the norfloxacin-induced DNA relaxation especially in LZ41 cells lacking the NAPs and hence, their buffering effects due to constraint of negative supercoils. Among the genes encoding the NAPs, fis, lrp, hupB, ihfA and stpA are activated by hypernegative supercoiling, while ihfB, hupA, and the global TFs crp and fnr are activated by DNA relaxation ( Table 1). The fis promoter depends on high negative superhelicity for optimal transcription [92], whereas binding of FIS at phased sites in upstream activating sequences of stable RNA (ribosomal and transfer RNA) promoters stabilises tightly bent loops, acting as a topological "homeostat" rescuing these supercoiling-dependent promoters from inactivation on deviations from optimal superhelicity [65,87]. The crp gene is transcribed from two promoters showing different dependence on DNA topology, one of which is derepressed on transition to stationary phase [93] associated with DNA relaxation, whereas the fnr gene is negatively autoregulated by Fnr protein which is activated under anaerobic conditions [94] associated with increased levels of DNA superhelicity. Constraint of DNA supercoils by HU is required for balanced transcription of the bacterial genome [61,95] and the observed activation of hupA, as opposed to hupB, by DNA relaxation can also be explained by a homeostatic mechanism, since the HUα homodimer, but not the HUβ homodimer, efficiently constrains negative superhelicity and thus, could counteract the global relaxation of DNA [96]. Overall, the changing activities of genes featured in Table 1, mostly constituting the homeostatic network regulating the chromosomal DNA superhelicity [1,3], likely reflect the integrated adaptive response of this overarching network to norfloxacin-induced changes in global DNA supercoiling and its modulation in cells lacking FIS and H-NS. It is conceivable that this adaptive integration occurs, at least in part, at the level of direct communications in the context of RNAP supramolecular complex.

Interdependence of the Network Elements
Since the genes encoding DNA topoisomerases, NAPs and transcription machinery components are interconnected in a homeostatic network [1,3], the mutations in RNAP genes can affect the expression of genes encoding the NAPs and vice versa. In E. coli strains with mutated NAP genes, the σ 70 and σ S factor levels vary distinctly with the backgroundas evident both in crude cell extracts and in heparin column purified preparations of RNAP ( Figure 2). This suggests that the NAPs modulate not only the expression levels of sigma factors, but also the sigma factor composition of the holoenzyme. In turn, in extracts of cells carrying mutations in the beta subunit of RNAP (the so-called "stringent" RNAP mutants mimicking the effect of the stringent response regulator ppGpp) [97], the NAP levels are correspondingly altered during bacterial growth ( Figure 3). Furthermore, deletion of rpoZ gene encoding the RNAP ω subunit increases both the RNAP σ S /σ 70 ratio in the cells and the RpoS impact in the transcript profile, concomitantly modulating the impacts of the NAPs and global TFs and favoring transcription of Rel genes responding to DNA relaxation [39]. Conversely, a decrease of the RNAP σ S /σ 70 ratio in rpoZ cells induced by overexpression of RpoD, increases the impact of the latter on the transcript profile and again, modulates the impacts of the NAPs and global TFs ( Table 2) favoring transcription of Hyp genes responding to high levels of negative superhelicity [39]. These σ factor-dependent changes in regulatory impacts of the NAPs and global TFs are reflected in up-regulation of corresponding regulons and overlapping subsets thereof-the couplons ( Table 2; the organization of couplons will be enlarged upon below). scription of Rel genes responding to DNA relaxation [39]. Conversely, a decreas RNAP σ S /σ 70 ratio in rpoZ cells induced by overexpression of RpoD, increases the of the latter on the transcript profile and again, modulates the impacts of the NA global TFs ( Table 2) favoring transcription of Hyp genes responding to high l negative superhelicity [39]. These σ factor-dependent changes in regulatory im the NAPs and global TFs are reflected in up-regulation of corresponding regul overlapping subsets thereof-the couplons ( Table 2; the organization of couplons enlarged upon below).

Figure 2.
Variation of the σ 70 and σ S factor levels in E. coli strains with mutated NAP genes Graphical representation of quantitative Western blots of RNA polymerase subunits detec crude and heparin column-purified protein extracts of E. coli CSH50 wild type (wt), ΔihfAB Δhns, Δdps, ΔhupAB and Δlrp mutant cells during exponential growth phase (OD600 = 0.5) u mouse monoclonal antibodies raised against RpoD (σ 70 ) and RpoS (σ S ) and normalized aga RpoC (β'). (A). Comparison of the relative (crude cell extract) to effective (heparin protein amount of RpoD (σ 70 ) (B). Comparison of the relative (crude cell extract) to effective (hepa tein extract) amount of RpoS (σ S ). Note that in crude extracts the normalized levels of Rpo stitute about one third of those of RpoD. Bars indicate the standard errors of two biologica cates. E. coli CSH50 wild type and isogenic NAP mutant strains were grown in 3l of dYT m (1.6% tryptone 1% yeast extract, 0.05% NaCl) in BIOSTAT Bplus fermentor Sartorius AB, G gen, Germany). Conditions were set to 37 °C, pH 7.5, 500 rpm, and pO2 of 100% at the star growth, and carefully controlled throughout the growth. For more details see "Experimen cedure to Figure 2" in the Supplementary data file. Variation of the σ 70 and σ S factor levels in E. coli strains with mutated NAP genes. Graphical representation of quantitative Western blots of RNA polymerase subunits detected in crude and heparin column-purified protein extracts of E. coli CSH50 wild type (wt), ∆ihfAB, ∆fis, ∆hns, ∆dps, ∆hupAB and ∆lrp mutant cells during exponential growth phase (OD 600 = 0.5) using mouse monoclonal antibodies raised against RpoD (σ 70 ) and RpoS (σ S ) and normalized against RpoC (β'). (A). Comparison of the relative (crude cell extract) to effective (heparin protein extract) amount of RpoD (σ 70 ) (B). Comparison of the relative (crude cell extract) to effective (heparin protein extract) amount of RpoS (σ S ). Note that in crude extracts the normalized levels of RpoS constitute about one third of those of RpoD. Bars indicate the standard errors of two biological replicates. E. coli CSH50 wild type and isogenic NAP mutant strains were grown in 3l of dYT medium (1.6% tryptone 1% yeast extract, 0.05% NaCl) in BIOSTAT Bplus fermentor Sartorius AB, Göttingen, Germany). Conditions were set to 37 • C, pH 7.5, 500 rpm, and pO2 of 100% at the start of the growth, and carefully controlled throughout the growth. For more details see "Experimental procedure to   Table 2. Impacts of NAPs/global TFs and the RNAP sigma factors in the transcript profiles of growing cells (data fro ref. [39]). The first two columns compare the relative impacts in the profiles of wild-type and rpoZ mutants, the next tw columns compare those in rpoZ mutant with and without rpoD overexpression.

Strain, Condition
CF1943 Wild Type CF2790 rpoZ CF2790 rpoZ + rpoD * CF2790 rpoZ   [39]). The first two columns compare the relative impacts in the profiles of wild-type and rpoZ mutants, the next two columns compare those in rpoZ mutant with and without rpoD overexpression. Importantly, the increase of the RNAP σ S /σ 70 ratio in rpoZ cells where DNA is relaxed, coordinately changes both the global DNA topology and the average thermodynamic stability (GC-content) of the transcribed sequences, indicating that under conditions of DNA relaxation, sequences with lower GC-content are utilized preferentially ( Figure 4A,B). This finding is in keeping with previous observations [13]. Corresponding decrease in GC-content of transcribed sequences is observed on transition of cells from exponential growth to stationary phase ( Figure 4C,D), characterized by transition from the predominance of RNAPσ 70 holoenzyme to that of RNAPσ S and from high to low levels of global negative superhelicity, respectively [98]. It is noteworthy, that the more and the less GC-rich sequences are respectively enriched around the origin (OriC) and terminus (Ter) of chromosomal replication, such that ultimately, the genomic sequence organization underpins the sequential transcription of chromosomal ends [30].

Strain, Condition
Taken together, all these observations support the notion of structural coupling between DNA topology, NAP-dependent chromatin architecture and the transcription machinery composition as a fundamental device coordinating the spatiotemporal genomic transcription [3,38,39]. Understanding of this coordinating effect can be facilitated by analyses of the joint impacts of the NAPs and transcription machinery components on genomic transcription. the predominance of RNAPσ 70 holoenzyme to that of RNAPσ S and from high to lo els of global negative superhelicity, respectively [98]. It is noteworthy, that the mor the less GC-rich sequences are respectively enriched around the origin (OriC) an minus (Ter) of chromosomal replication, such that ultimately, the genomic sequen ganization underpins the sequential transcription of chromosomal ends [30].

Spatiotemporal Organization of Transcription in Genome
Previous explorations of chromosomal gene organisation revealed the gene copy number effects on gene expression, with higher expression levels characteristic of locations close to the chromosomal replication origin due to multiple rounds of replication initiation [99,100]. These positional effects were interpreted as basic differences in the "expressivity" potential of different chromosomal locations, rather than unique features of translocated genes. Other studies proposed that organization of genes in bacterial genome is explained by selective pressure driving the clustering of essential genes on the leading strand and thus, suggested that gene essentiality, not expressivity, is the determinative factor of chromosomal gene organisation. Still another explanation of spatial gene clustering did not take into account their function but rather their persistence, i.e., their presence either in the majority, or in the minority of the organisms [101,102]. However, pivotal for understanding the relationship between chromosomal gene organization and gene activity was the discovery of coupling between the highly conserved spatial order of major regulatory genes along the chromosomal OriC-Ter axis and the temporal order of their expression during the bacterial growth cycle (i.e., during the period from inoculation of the "overnight" culture in fresh growth medium through the exponential growth to stationary phase) [10,30,103,104]. In other words, it became apparent that the major factor organizing the important regulatory genes spatially along the OriC-Ter axis of the chromosome is the temporal requirement of their products during the bacterial growth cycle. Furthermore, this temporal expression pattern was linked to a putative growth phase-dependent OriC-Ter gradient of negative superhelicity thus coupling the gene expression to the metabolic state of the cell. So far, the existence of this superhelicity gradient could not be experimentally demonstrated. However, the importance of gene order along the chromosomal OriC-Ter axis has been supported by genetic studies demonstrating that chromosomal position shifts of regulatory genes lead to rewiring of the genetic network as well as alterations of both the growth phase-dependent DNA supercoil dynamics and the bacterial phenotype [105][106][107]. Furthermore, the chromosomal gene order appears to underlie the spatial organization of function in genome. In particular, it was observed that anabolic genes are enriched around the OriC-end and catabolic genes around the Ter-end of the E. coli chromosome [108]. Since this spatial separation of function was found associated with respectively high and low thermodynamic stability of the DNA at the chromosomal OriC and Ter ends [30,109], it revealed a general strategy underlying the structural-functional organization of the bacterial genomes [108] meaning that at the global level of the physical chromosome the genic 'typography' is integrated with the genomic 'topography' in the primary sequence organisation of the DNA molecule. Additional relevant features of genomic spatial organization are manifest in the emergence of sub-chromosomal domains eliciting peculiar patterns of gene expression, denoted as coherent domains of transcription (CODOs) [110]. Under different growth conditions various constellations of CODOs were observed both in commensal E. coli [30] and in plant pathogen D. dadantii [16,111,112]. The CODOs harbor functionally linked genes, the coordinated expression of which is related to particular physicochemical and structural-dynamical properties of their encoding DNA sequences [16,30] and is distinctly impacted by DNA topology-modulating effects of the NAPs [16,111,112].

Regulons and Couplons
As already mentioned above, the abundant NAPs bind the chromosomal DNA sites in a quasi-continuous mode with affinities differing by three orders of magnitude, affecting the supercoil dynamics of the chromosome and thus contributing to analog control of gene expression. However, numerous target genes have been identified that are regulated by NAPs directly via specific, high-affinity DNA binding sites and therefore, the NAPs are featuring as hubs in the electronically compiled digital TRN [8]. Together, such directly regulated gene classes constitute the TRN regulons, attributed to a particular NAP or a global transcription factor, such as e.g., Catabolite Repressor Protein (CRP) or Fumarate and Nitrate Reduction Regulator (Fnr). Likewise, the various sigma factors of RNA polymerase, having distinct DNA binding specificities and directing the holoenzyme to disparate gene promoters, also establish their cognate regulons [85,113]. The overlaps between the regulons of a particular NAP and a particular sigma factor make it possible to reveal sets of genes under the control of both regulators and thus, assess the coordinated impacts of the transcription machinery of particular composition (i.e., RNAP associated with a certain sigma factor) and a particular NAP (or a global TF, such as e.g., CRP and Fnr) on genomic transcription. Groups of genes regulated conjointly by a particular couple of RNAP holoenzyme (sigma factor) and NAP (or a global TF), are dubbed couplons [3].
Couplons were found to harbor functionally related genes that could be activated or repressed relatively independent of parent regulons, thus featuring as genuine entities reflecting the modular organization of the genetic regulation system. There are couplons with distinct functional properties as well as couplons sharing functional properties, such that combinations of couplons with different sets of genes can support similar functions. However, the vast majority of couplons inherit distinct local functions [3,39]. Notably, while both the NAPs and sigma factors are produced over a range of concentrations and represent analog components of the genetic regulation system, the conversion of analog information into the digital (punctuation, as it were) is achieved by intersections of their concentration gradients, revealing unique genes under the control of a particular couple of NAP (or TF) and sigma factor.

Spatial Organization of Regulons and Couplons
Given the revealed correspondence between the chromosomal order and temporal expression of genes during the passage of cells from early to exponential to stationary phase, how are the spatial organisation and the temporal expression of genes comprised in regulons and couplons related? The genomic targets of both the NAPs and the RNAP sigma factors demonstrate a peculiar spatial organization [103]. For example, the regulons of RpoD (major σ 70 factor, predominating during active growth and transcribing the bulk of genes) and RpoS (stationary phase or stress σ S factor predominating on cessation of growth) demonstrate a conspicuous opposite bias of spatial organization, being enriched respectively at the Ori and Ter ends of the chromosome (Supplementary Figure S1). Also the RpoH (heat shock sigma factor) and RpoE (extracytoplasmic sigma factor) regulons show a bias towards the OriC end, whereas RpoN (the nitrogen limitation sigma factor) shows an opposite bias on the left and right replichores. Similarly, the regulons of the NAPs and global TFs, CRP and Fnr, show distinct spatial patterns and all, perhaps except those of Fnr and IHF, show a relative enrichment in the OriC half of the chromosome (Supplementary Figure S2).
Interestingly, spatial organization of couplons reveals an apparent σ factor-dependent peculiarity. For example, genomic distributions of all the couplons of NAPs involving the major σ 70 factor (RpoD), positively correlate with the spatial pattern of the regulons of NAPs rather than with that of RpoD ( Figure 5A,C; Supplementary Figures S3 and S4). In contrast, the spatial patterns of couplons of the NAPs involving alternative sigma factors (ASFs, meaning all the sigma factors except the 'house-keeping' sigma RpoD) are positively correlated to the patterns of ASF regulons ( Figure 5B,D; Supplementary Figure S5). The difference between the spatial organization of the major RpoD and ASF couplons is perhaps unsurprising, given that the vegetative RNAPσ 70 holoenzyme binds the vast majority of genomic promoters, whereas the holoenzymes assembled with ASFs which govern more focused functions, bind only subsets thereof [113]. Since the observed biases in the impacts of regulons on spatial organization of couplons are conspicuous, the pertinent question is how these impacts are related to the temporal expression of couplons.
Supplementary Figure S5). The difference between the spatial organization of the major RpoD and ASF couplons is perhaps unsurprising, given that the vegetative RNAPσ 70 holoenzyme binds the vast majority of genomic promoters, whereas the holoenzymes assembled with ASFs which govern more focused functions, bind only subsets thereof [113]. Since the observed biases in the impacts of regulons on spatial organization of couplons are conspicuous, the pertinent question is how these impacts are related to the temporal expression of couplons.

Temporal Expression Patterns of Regulons and Couplons
Analyses of the growth phase-dependent expression patterns of regulons and couplons (computed as average patterns of all the genes in the given regulon/couplon and normalized to 1.0; for details see legend to Supplementary Figure S6) show that the temporal patterns of the regulons of NAPs (FIS, H-NS and Lrp) and the corresponding NAP/RpoD couplons are highly correlated ( Figure 6). Thus, not only the spatial organization, but also the temporal expression patterns of the NAP/RpoD couplons are positively correlated with those of NAP regulons. In contrast, the growth phase-dependent expression patterns of the NAP/ASF couplons correlate with those of the ASF (RpoS and RpoN) regulons (Figure 7). Thus again, both the spatial organization and the temporal expression patterns of the NAP/ASF couplons are positively correlated with those of ASF regulons. However, the temporal patterns of the IHF/ASF, CRP/ASF and Fnr/ASF couplons demonstrate positive correlations with both of the parent regulons (compare Figures 7 and 8), suggesting a joint impact of the latter on cognate couplon patterns.
Thus it appears that the regulons have variable impacts on the temporal patterns of couplons. However, in that case, what determines the temporal patterns of regulons in the first place? Are the expression patterns of regulons determined by cognate regulators? Indeed, analyses showed that with notable exception of the rpoH and rpoE genes, in all cases a positive correlation obtains between the expression pattern of the regulatory gene and its cognate regulon (Figure 9). Since the expression of NAP genes is positively correlated with that of the NAP regulons, while the latter in turn appear determinative for the expression patterns of NAP/RpoD (but not the NAP/ASF) couplons (see Figure 6), this suggests that the temporal patterns of the NAP/RpoD couplons are, by and large, determined by expression patterns of the NAP genes. Conversely, the expression pattern of rpoS gene positively correlates with that of the RpoS regulon, whereas this latter appears determinative for the patterns of corresponding NAP/RpoS couplons ( Figure 7B), suggesting that temporal expression patterns of these couplons are ultimately determined by the expression pattern of rpoS gene. Similar argument applies to rpoN gene, RpoN regulon and RpoN couplons. By the same token, whenever the temporal patterns of couplons correlate positively with those of both parent regulons, as is the case with the IHF and global TF couplons (compare Figures 7 and 8), the assumption would be that the expression patterns of both regulatory genes jointly contribute to that of the couplons.
The dependence of temporal regulon/couplon patterns on the regulating gene expression is corroborated by observed impacts of altered regulator gene expression on the expression of cognate couplons and/or regulons. For example, FIS is a repressor of rpoS gene expression [114] and while inactivation of the fis gene by mutation abolishes the down-regulation of the rpoS gene expression in stationary phase ( Figure 10; compare the grey dashed lines in A and B), the RpoS regulon and couplons react accordingly. Similarly, the fis mutation affects the expression of lrp gene and Lrp couplons and that of hns gene and H-NS couplons in a concerted manner (Figure 11). In all these cases altered regulator gene expression concomitantly alters the expression pattern of cognate couplons and/or regulon.        [29]). The experimental procedure was as described in the legend to Figure 6. Note that mutation of fis gene alters the expression pattern of lrp and hns genes (grey dashed lines) as well as the patterns of Lrp and H-NS couplons (data from [29]). The experimental procedure was as described in the legend to Figure 6.

NAPs versus Global TFs?
Whereas expression of the NAP genes positively correlates with, and most likely defines, the temporal patterns of NAP/RpoD but not that of NAP/RpoS couplons, the expression patterns of the latter appear defined primarily by that of rpoS gene. Among the studied NAPs, only IHF does not conform to this rule-there is no such sigma factordependent difference of IHF impact (compare Figures 7B and 8A) and in that sense, IHF shows similarity to the global TFs, CRP and Fnr.
This difference between IHF and the other NAPs is interesting and merits some elaboration. It is noteworthy that except IHF, all the NAPs examined here affect the global DNA topology by constraining DNA supercoils and stabilizing higher-order nucleoprotein structures, while IHF is apparently incapable of oligomerization. On binding DNA, IHF stabilizes planar bends rather than writhed coils [115] and affects local DNA trajectory, rather than global DNA topology [112,116]. The sequence-specificity of IHF binding is almost entirely determined by structural features of the DNA and not by direct readout of the base sequence [117], yet similar to the global TFs, IHF recognizes a relatively welldefined sequence motif [118] introducing sharp bends (U-turns) in the DNA. Stabilization of such tightly bent structures could explain, why IHF in contrast to other NAPs, is primarily affecting the directionality (leading/lagging strand bias) of genomic transcription with associated transcription-coupled supercoil diffusion [112]. Put another way, IHF exerts its primary effect at that level of genomic organization (mutual orientation of transcription units), for which the promoter sigma factor specificity is, most probably, immaterial.
The global TFs, CRP and Fnr, are structurally closely related sequence-specific DNA binding proteins [119] not known to constrain supercoils or form long-range nucleoprotein structures characteristic of NAPs, although DNA bending by CRP may assist in wrapping of DNA by RNAP-a characteristic associated with σ 70 holoenzyme [66,[120][121][122]. The debated difference between the NAPs and other global TFs [123] may lie by and large, in the prominent capacity of the former to constrain supercoils and stabilize long-range nucleoprotein structures, that is to say, in prevalence of analog over the digital mode of transcriptional control [5]. Notably, specific CRP binding activity strictly depends on cAMP levels [124], whereas Fnr utilizes iron-sulfur clusters as cofactors and is activated strictly under conditions of anaerobiosis [125]. It is conceivable that this strict dependence of specific binding effects of CRP and Fnr on physiological conditions would require tight coupling with ASFs in coordinating functional responses to physiological stress.
Interestingly, among the analyzed regulators, the rpoH and rpoE gene expression show no correlation with the expression patterns of corresponding regulons (Figure 9). In the case of extracytoplasmic sigma factor RpoE, this uncoupling of RpoE regulon pattern from rpoE gene expression might be related to the proteolytic cascade involving several factors and preceding the release of RpoE in the cytoplasm, whereas rpoH expression is in turn, under the control of RpoE [126,127]. Nevertheless, in the case of RpoS regulon we see a positive correlation with rpoS gene expression, despite the complex regulation of cellular RpoS levels [128] and coupling of RpoS effects to changes of DNA topology [129].

Conclusions
Overall, we observe positive correlation between the temporal expression of global regulatory genes and their cognate regulons (see Figure 9) as well as between the regulons and couplons (see Figures 6-8) on the one hand, and positive correlations between the spatial organization of regulons and couplons (in terms of their genomic distribution patterns; see Figure 5) on the other. Taken together, these observations suggest that the expression of global regulators is coordinated with organization of target genes in the chromosome, lending support to the hypothesis that transcriptional regulation shapes the spatial organization of genes [104,130]. The determinative impact of the regulons of NAPs and global TFs on spatial patterns of cognate RpoD couplons is apparently related to the fact that RpoD transcribes vast majority of the genes and so, tends to impose uniformity, whereas the effects of NAPs and global TFs regulating subsets of genes (as defined by the TRN) convey discreteness to the spatial couplon patterns (Figure 5A,C; Supplementary Figures S3 and S4). The alternative sigma factors are associated with particular functions [85] and also in this case, we see that the spatial impacts of the ASF regulons predominate (Figure 5B,D; Supplementary Figure S5). However, this does not explain why the regulon and couplon genes are organized in spatially defined clusters (see Figure 5 and Supplementary Figures S1-S5).
While the relation of regulatory gene expression pattern to the temporal regulon (or couplon) patterns appears trivial, the question is how the expression pattern of a regulator (be it a NAP or sigma factor) is related to spatial organization of cognate regulons and couplons in genome. One relevant observation is the distinct organization of the NAP and TF genes with respect to their targets in genome, indicating that the main parameters defining the position of a TF in the network hierarchy are the number and chromosomal distances of the genes they regulate and their protein concentration gradients [130]. Another relevant observation is that regulatory genes form gradients diffusing from their sites of production [131,132]. It is thus conceivable that the observed spatial patterns of regulons and couplons are selected to optimize the exposure of target genes to the gradients of diffusing regulators. Furthermore, recent observations that alterations of gene expression and phenotype can be induced by chromosomal position shift of a NAP gene, despite the maintenance of its natural expression pattern [105] and, that the effect of positional shift can be aggravated by altering the expression pattern of the NAP gene [107], support the proposal that the physical structure of chromosome is optimized by direct regulatory interactions involving the NAPs and other DNA structuring proteins [133]. In this respect it is relevant, that the NAPs can distinctly modulate the chromosomal regional dynamics by inducing transient domain boundaries in the genome [134,135]. Such adjustment would involve selection for efficient interactions between the growth phase-dependent concentration gradients of regulators and chromosomal domains by restricting the range of chromosome configurations within the available configuration space, delimited by various factors including macromolecular crowding, entropic repulsion, confinement-induced organisation and packing density of the DNA polymer. The spatiotemporal coupling between the E. coli chromosomal gene order and expression, the spatial organization of transcriptional regulation on both chromosomal arms along the OriC-Ter axis [103,104] the successive, growth phase-dependent activation of the OriC and Ter chromosomal ends [30,103] and clustering of co-functional genes [136] are all consistent with this notion. Notably, dynamic 3D colocalization of co-regulated genes optimized by locally increasing concentration of transcription factors has been proposed to occur in yeast [137]. It is obvious, that the evolutionary process of adjusting chromosome configuration dynamics and gene organization would be impacted by chromosomal replication [138]. Understanding the corresponding growth phase-dependent compositional changes of the RNAP supramolecular complex and underpinning cellular hyperstructures [139] both of which could be dependent on energy and resource availability, require further studies.
Supplementary Materials: The following are available online at https://www.mdpi.com/article/10 .3390/biom11070924/s1: List of abbreviations used in the text, Figure S1: Genomic organization of the sigma factor regulons; Figure S2: Genomic organization of the NAP regulons; Figure S3: Comparison of the genomic organization of NAP regulons and NAP/RpoD couplons; Figure S4