Abstract
The role of alternative nucleic acid structures (ANS) in biology is an area of increasing interest. These non-canonical structures include the Z-DNA and Z-RNA duplexes (ZNA), the three-stranded triplex, the four-stranded G-quadruplex (GQ), and i-motifs. Previously, the biological relevance of ANS was dismissed. Their formation in vitro often required non-physiological conditions, and there was no genetic evidence for their function. Further, structural studies confirmed that sequence-specific transcription factors (TFs) bound B-DNA. In contrast, ANS are formed dynamically by a subset of repeat sequences, called flipons. The flip requires energy, but not strand cleavage. Flipons are enriched in promoters where they modulate transcription. Here, computational modeling based on AlphaFold V3 (AF3), under optimized conditions, reveals that known B-DNA-binding TFs also dock to ANS, such as ZNA and GQ. The binding of HLH and bZIP homodimers to Z-DNA is promoted by methylarginine modifications. Heterodimers only bind preformed Z-DNA. The interactions of TFs with ANS likely enhance genome scanning to identify cognate B-DNA-binding sites in active genes. Docking of TF homodimers to Z-DNA potentially facilitates the assembly of heterodimers that dissociate and are stabilized by binding to a cognate B-DNA motif. The process enables rapid discovery of the optimal heterodimer combinations required to regulate a nearby promoter.
1. Introduction
The flip from B-DNA to a noncanonical structure has a physicochemical correlate. In many cases, it is characterized by the rotation of a nucleobase from the anti- to the syn conformation, where the base is directly above the sugar rather than pointing away, as occurs in the anti-conformation [1,2,3]. The energy cost of flipping to a syn orientation is lower for purine bases than for pyrimidines, as steric clashes can occur between pyrimidine and sugar adducts, and is lowest for guanosine. Non-canonical DNA structures exhibit a characteristic combination of syn and anti bases. These motifs are most often found in repeat sequences. Preferred sequences for forming Z-DNA and Z-RNA (collectively called ZNA) are alternating purine-pyrimidine repeats [4], while the tetrads that form G-quadruplexes (GQ) are based on G-repeats [5], triplexes (TPX) on polypyrimidine [6,7,8], with i-motifs formed by polycytosine tracts [9].
The evidence supporting a biological role for flipons is most complete for ZNAs [10]. Z-flipons are an integral component of the innate immune system. ADAR and ZBP1 modulate pathways through the structure-specific Zα domain, which docks with nanomolar affinity to the left-handed conformation of both RNA and DNA [11,12,13]. Loss-of-function (LOF) human variants of the ADAR Zα domain are causal for Aicardi-Goutières Syndrome Type 6. These phenotypes are supported by mouse studies that show the variants impair the negative regulation of type I interferon responses [14,15,16].
In contrast, the binding of ZBP1 to ZNA activates immune responses during viral infection, leading to inflammatory cell death (referred to as necroptosis). In other contexts, ZBP1-dependent caspase activation leads to either the inflammatory outcome called pyroptosis, or the immunologically silent form of cell death called apoptosis. ZBP1 is rapidly induced by interferon. Besides regulating interferon responses, ADAR1 squelches the activation of cell death by ZBP1 by competing for ZNAs [17,18,19,20,21,22,23,24,25,26,27,28,29].
Roles for Z-DNA in transcription regulation have also been defined in both cell biology and computational studies, each showing that Z-flipons are enriched in promoter regions [30,31,32,33,34,35]. The formation of GQ inside cells is also supported by genetic studies in which these structures accumulate in cells harboring loss-of-function helicase variants [36,37]. GQ binders have also been found in biochemical studies. The structure-specific interactions are validated using a non-GQ-forming oligonucleotide with the isoelectric substituent 7-deaza, 8-aza-guanosine bases as controls [38]. This base cannot form the Hoogsteen bases necessary to stabilize a GQ fold. Crystal studies reveal that the yeast B-DNA-specific RAP1 protein can also dock to GQ. Both interactions involve the same helix, but each through a different helical face. The B-DNA interaction occurs through the hydrogen-bonding face, whereas GQ docking depends upon the hydrophobic surface [39,40]. Roles for GQ have been validated for various biological outcomes [36,41,42,43,44].
Most recently, the binding modes of zinc-domain transcription factors (TFs) to DNA have been examined. These proteins constitute the largest and most ancient TF family with over 800 members. They are classically considered to bind only to B-DNA. However, predictive modeling studies with AF3 reveal that these proteins can also recognize ZNA and GQ [45]. Docking to Z-DNA was previously overlooked because the use of B-DNA competitors in binding assays likely masked interactions with Z-DNA. However, other experimental approaches have provided evidence for the interaction of zinc-finger domain (ZFd) transcription factors with both B-DNA and Z-DNA, with studies on zinc finger and BTB domain-containing protein 43 (ZBTB43) and Zinc finger and SCAN domain-containing protein 4 (ZSCAN4) revealing their role in suppressing Z-DNA formation by Z-prone alternating d(AC)n microsatellites [46,47].
The finding helps resolve the long-standing speed-stability paradox posed by ZFd, which asks how proteins with multiple high-affinity B-DNA sites can rapidly scan the genome to find their cognate binding sites [48]. The targeting of Z-DNA reduces the search space to active regions of the genome, where sufficient energy is available to power the structural transition. The energy can be provided during the pioneering round of transcription by the displacement of nucleosomes (Figure 1E) or through the negative supercoiling produced by processive enzymes, such as RNA polymerase, which then powers the reinitiation of transcription (Figure 1F). The formation of Z-DNA flags nucleosome-free regions (NFRs). Once localized to an NFR, ZFd can then scan for a cognate binding site. The rapid off-rate from Z-DNA, resulting from the release of free energy upon the flip back to B-DNA, contrasts with the slow off-rate for ZFd bound to a specific sequence. The interaction between ZFd and the GQ formed by RNAs (rGQ) was also documented, potentially enabling the formation of complexes involved in RNA transcription, splicing, editing, and translation.
Figure 1.
Flipons adopt alternative conformations by rotation of nucleobases around the glycosidic bond from the anti (shown in white) to the syn conformation (shown in yellow). (A) The ZNA motif consists of an alternating purine-pyrimidine repeat: the purine bases in syn and the pyrimidine bases in anti. The pattern produces a left-handed duplex with a zig-zag backbone. The dotted lines represent hydrogen bonds. (B) A triplex formed by Hoogsteen basepairing (indicated by “*”) between the third strand and the right-handed Watson–Crick duplex (basepairing is represented by “:”). (C) G-quadruplexes (GQ) are built by stacking a guanosine tetrad (G4) that is formed by Hoogsteen base pairing of four guanine bases. The central GQ core can accommodate a metal ion, such as potassium or sodium. (D) An i-motif intercalates pairs of cytosine base-pairs to form a four-stranded structure. (E) During the pioneering round of transcription, the eviction of nucleosome releases sufficient negative supercoiling to power the transition of Z-flipons from the B-DNA to the Z-DNA conformation. Recognition of Z-DNA by TF enables the rapid discovery of its cognate binding sites. The energy released as the flipon reverts to B-DNA then powers the assembly of complexes at the locus. (F) Formation of Z-DNA is also powered by the negative supercoiling generated by an elongating polymerase. The energy captured by Z-DNA can drive the reset and reinitiation of promoters. At the same time, the formation of GQ likely plays a role in the pausing and release of an assembled polymerase complex before the next burst of transcription.
This study reveals that other B-DNA-specific transcription factor motifs are capable of docking to both ZNA and GQ. Furthermore, the proposed docking of homodimers onto Z-DNA can efficiently promote the local assembly of heterodimers with sequence-specificity for nearby promoters. The outcome depends on the relative protein levels of the potential partners, their sequence preference, and the regulatory complexes they seed. The models presented exploit the dynamic nature of flipons, where the exchange of energy for information potentiates the transition from one cell state to another.
2. Results
The use of flipons to regulate biological outcomes predates the evolution of sequence-specific binding factors [49]. Over time, more robust protein-centric controls eventually superseded the structure-based modulation of responses. These modern pathways were overlaid over old ones, adding features, rather than abandoning the innovations that were successful in the past. It was therefore of interest to investigate whether factors that act early in development dock to alternative flipon structures in a way that echoes past ontogeny. Consequently, we tested the interaction of Yamanaka factors [50] with rGQs and Z-DNA. These factors enable the reprogramming of differentiated cells to a pluripotential state and consist of the MYC proto-oncogene, Krüppel-like factor 4 (KLF4), OCT4 (encoded by POU5F1), and SOX2 (Figure 2).
Figure 2.
Docking of Yamanaka factors and the master regulator to GQ (A,C,E,G,I) and Z-DNA (B,D,F,H,J). The amino acids and their corresponding UniProt Accession numbers are listed in the title of each panel. The position of N3, N4-Dimethyl-L-arginine (dMeR) substitution in MYC is labeled. In each panel, and in subsequent figures, three renditions are shown: a cartoon representation, a combined cartoon and licorice representation with bonding, and a surface representation for each component. Z-DNA and GQ are colored blue, each protein domain is colored green or fuchsia, and the dMeR is shown in a ball-and-stick representation with white carbons.
The binding of Yamanaka factors to B-DNA is firmly established. MYC recognizes the Z-prone core sequence 5′-CACRTG-3′ (R = G/A) [51,52]. KLF4 interacts with the p300 histone acetyltransferase and regulates gene transcription by modulating histone acetylation, thereby inducing chromatin opening during cell reprogramming [53,54]. KLF4 recognizes the guanine-rich d(GGGTGGG) motif [55]. OCT4 binds through its POU domain to the d(ATTTGCAT) octamer motif [56]. SOX2 forms a complex with OCT4 on DNA to induce the readout of developmental genes, targeting nucleosome-bound sequences, such as d(TGTGGGAC) that differ from the canonical B-DNA binding sequences that are used later in development to regulate gene expression in differentiated tissue [57,58].
The binding of Yamanaka factors to flipon structures also has experimental support. For example, MYC binds GQ in vitro and RNA sequences capable of forming GQ in vivo [59,60]. SOX2 also experimentally binds a rGQ with nanomolar affinity [61]. AF3 was able to molecularly model the docking of both MYC to a telomeric rGQ substrate, consistent with the experimental results. The AF3 interaction involved the DNA-binding domain of MYC and the high-mobility group (HMG) domain. (panels A,C). Whether OCT4 or KLF5 dock to GQ has not yet been experimentally established. AF3 models reveal that OCT4 can dock to GQ through its POU homeobox domain (HD). When both OCT4 and SOX are present in the same AF3 model, the TFs engage rGQ through independent binding sites [55] (panel G). KLF4 also docks to the parallel-strand telomeric GQ through its second ZFd (panel I), as do the ZFd of other zinc-finger TFs [45].
All the Yamanaka proteins bound as homodimers in AF3 to Z-DNA. The docking of MYC to Z-DNA was promoted by the symmetrical dimethylarginine modification (dMeR) of R382. This class of modification is known to affect the oncogenicity of c-MYC, but currently available dMeR datasets do not cover the entire protein [62,63]. The PRmePRed tool (84.10% accuracy, 82.38% sensitivity, 83.77% specificity) predicts that the peptide (373-THNVLERQRRNELKRSFFA-391, score = 0.58) that contains the R382 residue is a site of arginine methylation [64]. The methyl group potentially forms a weak hydrophobic contact with the sugar carbons of guanosine, akin to the 5-methylcytosine modification that promotes Z-DNA formation [65,66]. The dMeR ammonium group strengthens the interaction by forming ionic bonds with the phosphate backbone. OCT4 also bound Z-DNA both as a monomer and as a dimer (panels G and H). Of note, reports indicate that the POU HD of OCT4 can interact with a methylated Z-prone dinucleotide motif, d(ATGCGCAT), which contains a CpG core [67]. Interestingly, SOX2 engaged Z-DNA as a monomer, but docking of two copies flipped the complex back to B-DNA (panels I and J). KLF4 bound to Z-DNA when one copy was present, in line with previous findings on the docking of zinc finger domains to Z-DNA (panel J) [45]. With two copies of KLF4, docking was to separate sites rather than through dimer formation.
TF with other sequence-specific B-DNA binding folds also docked to both GQ and Z-DNA (Figure 3). These include the master regulators of muscle development, Myoblast Determination Protein 1 (MYOD) and Myogenic Factor 5 (MYF5) [68]. Both bind through a bHLH motif to the canonical E-box CANNTG motif (N = any nucleotide) [69]. The MYOD1/MYF5 docks to anti-parallel GQ rather than the parallel telomeric GQ bound by the Hamanaka factors (Figure 3A). Docking of the MYOD1 homodimer to Z-DNA is facilitated by MYOD1 dMeR121 (116-MRERRRL-122) (PRmePRed scores for each arginine in the sequence RERRR = 0.670, 0,597,0.569, 0,623, respectively) (Figure 3B). Similarly, the MYF5 dMeR91 (88-MRERRRL-96) (PRmePRed score for first R = 0.660) binds as a homodimer to Z-DNA. However, the MYOD1/MYF5 heterodimers do not dock to Z-DNA under these conditions, even with dMeR modification (not shown). This disparity between homodimer and heterodimer binding to Z-DNA is found in AF3 models for other TFs, as described below. The TF protein C-ets-1 (ETS1) has a winged-helix-turn helix motif (wHTH) and recognizes a RCMGGAWGCY sequence motif (R = A/G, M = C/A, W = A/T, Y = C/T) [70,71]. The ETS domain is unique to animals and regulates a large number of cellular processes, with 28 family members in humans [72]. The protein binds to both the RNA telomeric GQ (Figure 3C) and Z-DNA without modification (Figure 3D). Cellular tumor antigen p53 (encoded by TP53) regulates an extensive set of target genes that control the cell cycle and apoptosis. The protein engages the p53 B-DNA response element through a highly structured domain, which has a half-site sequence of RRRCWWGYYY (R = A/G, S = G/C, W = A/T, Y = C/T), and frequently contains a CATG core [73]. This domain also binds to the telomeric GQ RNA sequence (Figure 3E), but, in the models tested, exhibited only limited affinity for Z-DNA (data not shown). The C-terminal unstructured domain also bound Z-DNA. Many docking geometries for this peptide were also observed, with the best one shown in Figure 3F. The C-terminal 26 residues have been previously reported to be essential for the expression of certain target genes that mediate TP53-induced growth arrest and apoptosis [74]. The FOS and JUN proto-oncogenes dimerize through a bZIP domain that recognizes a TGASTCA (S = G/C) motif in response to growth factor/receptor tyrosine kinase signaling [75]. The heterodimer engages the telomeric GQ (Figure 3G), with docking to Z-DNA facilitated by FOS dMeR115 (152-KCRNRRREL-161. PRmePRed score = 0.626) and JUN dMeR263 and dMeR270 modifications (256-ERKRMRNRIAASKCRKRKL-274 PRmePRed scores 0.633 and 0.626, respectively). Without modification, the FOS/JUN heterodimer bound only to B-DNA. Modified FOS (dMeR115) and JUN (dMeR263) homodimers also engage Z-DNA (not shown).
Figure 3.
Other TF Motifs. (A) The MYOD1-MYF5 bHLH heterodimer bound to GQ. (B) The dMeR modified MYOD1 homodimer bound to Z-DNA. (C,D) ETS1 docks to Z-DNA and to a parallel-stranded telomere rGQ (E) DNA repair protein TP53 bound to rGQ. (F,G) Leucine zipper FOS-JUN heterodimer bound to Z-DNA and rGQ. (H) The dMeR modified FOS-JUN homodimer bound to Z-DNA.
A different set of bZIP dimers involved in the antioxidant responses bound to Z-DNA as homodimers, but not as heterodimers, even with dMeR modification (Figure 4H). These proteins bind to antioxidant response elements (AREs) with a common core motif of 5′-TGACNNNGC-3′ [76]. The bZIP NRF2 (Nuclear factor erythroid 2-related factor 2, encoded by NFE2L2) dimerizes with small MAF (musculoaponeurotic fibrosarcoma) family members to induce genes that protect against oxidant stress. The response is suppressed by BACH1 (BTB domain and CNC homolog 1), which competes with NRF2 for the same proteins and the promoter ARE [77,78,79]. Homodimers with dMeR modifications all bound to Z-DNA, but preferred B-DNA when unmodified (Figure 4). The dMeR modifications for NRF2 and BACH1 are not at high-confidence sites and produce models that increase chain separation of the ZIP domains (NRF2 Figure 4A, 510-AQNCRKRKL-519, PRmePRed score < 0.5; Figure 4, BBACH, 571-AQRCRKRKLD-580, PRmePRed score < 0.5). In these cases, Z-DNA may promote the disassembly of these homodimers. Other factors, such as oxidative DNA base modifications, may also modulate the interaction. In contrast, the MAFF and MAFG dMeR sites are scored as high-confidence arginine methylation sites. (Figure 4C, MAFF, 66-ASCRVKRVC-74, PRmePRed score = 0.530; Figure 4D, MAFG, 66-ASCRVKRVT-74, PRmePRed scores = 0.557 and 0.508, respectively). Both these homodimers engage Z-DNA. As with MYOD1/MYF5, dMeR modified bZIP heterodimers preferred B-DNA over Z-DNA. Together, these findings are compatible with a model in which homodimers bind to Z-DNA, exchange chains to form heterodimers, and then dissociate due to a reduced affinity for Z-DNA (Figure 4H).
Figure 4.
Mix and match bZIP and bHLH dimers. (A) NRF2 and (B) BACH1, and their binding partners (C) MAFF, (D) MAFG, dock to Z-DNA as homodimers. (E) Docking of homodimer pairs onto Z-DNA may facilitate the formation of heterodimers. The heterodimer AB, generated by exchange of chains between homodimer AA (F) and homodimer BB (G) on ZNAs, could bind a different B-DNA motif than the reciprocal pairing BA (H). The MAX and MYC homodimers bind to Z-DNA, but the heterodimer does not. (I–N) The bHLH motif and the ZNA-binding arms can be exchanged between MAX and MYC, while retaining ZNA binding. The colors indicate the segments that were swapped, and the numbers give the residues exchanged. The numbers are for the construct, not for the position in the protein. The ‘::’ indicates a fusion. The MAX and MAD1 homodimers also bind to Z-DNA (I,L). The bHLH arms of MAD1, MAX, and MYC can also be combined with the MAD1 ZNA-binding arms without affecting binding to ZNA. However, all six possible heterodimer combinations preferentially dock to Z-DNA rather than to B-DNA.
The MYC, MAX, and MAD1 proteins also bound to Z-DNA as homodimers when modified by dMeR, but not as heterodimers, even with more extensive dMeR substitutions (Figure 2B and Figure 4I–N). These proteins dock to the variable E-box CANNTG motif [69]. It was possible to swap the leucine-zipper and the Z-binding arms between these three proteins and still retain Z-DNA binding by dMeR14 modified homodimers. However, all possible heterodimer combinations formed with these domain swaps preferred B-DNA over Z-DNA. Replacement of individual residues in each arm revealed that multiple substitutions were required in both the leucine zipper and Z-DNA arms to achieve Z-DNA binding. Screening failed to identify other arginine, lysine, or serine modifications that favored an interaction with Z-DNA (results not shown).
In the models discussed so far, binding to Z-DNA relies on the ability of a dimer to promote the flip to Z-DNA. Not all Zα family members can induce the structural transition. For example, the vaccinia E3 protein binds to preformed Z-DNA but is unable to induce its formation. A structural rearrangement of the E3 tyrosine is required before Z-DNA engagement is possible [80]. To test whether heterodimers would preferentially bind preformed Z-DNA, AF3 models were run that included the well-characterized Zα domain (Figure 5A) [11]. The Zα domain stabilized Z-DNA in the models, enabling the determination of whether the heterodimers can engage the left-handed DNA conformer. In the presence of Zα, the NRF2 homodimer was able to dock to Z-DNA without dMeR modification (Figure 5B). Docking to Z-DNA of the NRF2-MAFF and BACH1-MAFF heterodimers was also observed (Figure 5C,D), indicating that these protein folds were able to bind preformed Z-DNA, but not promote the flip from B-DNA. Models of MAX-MAD1 and MAD1-MYC heterodimers made by swapping the leucine zipper and Z-DNA arms were also able to engage Z-DNA in the presence of Zα (Figure 5E–H).
Figure 5.
Binding of Heterodimers to Preformed Z-DNA. (A) The Zα domain flips d(CG)8 to the Z-DNA conformation. (B) Including Zα in models removed the requirement of dMeR modification to promote Z-DNA binding by NRF2. This approach provided a way to directly assess whether bZIP and bHLH heterodimers can dock to Z-DNA without the need to induce its formation. Binding of (C) NRF2- and (D) BACH1-MAFF heterodimers to Z-DNA. Engagement of Z-DNA by the (E) MYC-MAX and (F) MAX-MAD1 heterodimers. The Zα is colored yellow, Z-DNA is blue, and each monomer is either green or fuchsia. The binding of the mix-and-match heterodimers to Z-DNA was also tested. (G) The MAX heterodimer, where a Z-DNA binding arm was replaced with one from MYC. The colors indicate the segments that were swapped, and the numbers give the residues exchanged. The ‘::’ indicates a fusion. (H) A pairing of MAX and MYC bHLH motifs, with the Z-DNA binding arms from MAX and MYC swapped between the monomers.
3. Discussion
The approach taken here to investigate such interactions represents a novel strategy to overcome the limitations and time constraints associated with current experimental techniques. The reproducibility of interactions across many models and proteins increases confidence in the robustness of the findings, as does the recent experimental validation of models for the docking of general transcription factor E (TFEα encoded by GTFE1) to Z-DNA and the ADAR Zβ domain to GQ, as described in the methods. The analyses demonstrate that the bHLH and bZIP folds can accommodate the Z-DNA helix, in addition to binding B-DNA. Such interactions have never been previously assessed at the molecular level. The in silico domain swaps show the interchangeability of the dimerization and Z-DNA-binding domains of bHLH proteins (Figure 3 and Figure 4). The models enable the exploration of protein modifications that can dynamically alter the B-Z DNA equilibrium in a cell. Previously, only DNA modifications, such as methylation, oxidation, and base adducts, were known to modulate Z-DNA formation [65,81,82,83].
The methylarginine modifications described here warrant further experimental exploration, given their effectiveness in models of switching the binding preferences of homodimers from B- to Z-DNA, and their overlap with predicted sites of arginine methylation. There are specific inhibitors of arginine methyltransferase under development that could be used to examine how this modification affects Z-DNA formation in vivo. Techniques also exist for the targeted recoding of the arginine residues involved [84,85,86]. Genetic approaches may also help determine the role of dMeR in Z-DNA formation. For example, the human MYF5 variant R95C, which affects the dMeR modeled here, diminishes MYF5 binding and nuclear localization and produces a clinical disorder characterized by congenital ophthalmoplegia, scoliosis, and vertebral and rib anomalies [87]. The modeling results motivate the derivation of additional experimental data to examine the role of Z-DNA formation in such diseases.
The AF3 models also revealed unexpected differences between the binding of homodimers and heterodimers to Z-DNA. Homodimers modified by dMeR bind to Z-DNA. In contrast, heterodimers first require Zα to stabilize this structure, suggesting that the interactions are of lower affinity. However, the dimerization interface of both homo- and heterodimers is weak and permissive of chain exchange. Heterodimers with varying B-DNA sequence specificity can then rapidly form by the dissociation and reassociation of existing dimers. The heterodimer mix then varies in proportion to the cellular concentrations of each homodimer, with subunit exchange facilitated by the localization of homodimers to the Z-DNA surface, as shown here. Notably, the reduced affinity of heterodimers to Z-DNA diminishes the reverse reaction in which the exchange of chains instead reforms homodimers.
The binding of a heterodimer to a cognate B-DNA sequence stops any further exchange by preventing subunits from dissociating, thereby canalizing downstream events [88]. Notably, bZIP heterodimers recognize a large number of different sequences, as reflected in the degenerate motif 5′-TGACNNNGC-3′ bound by this family of TFs [76]. A subset of these heterodimers even recognizes more than one sequence combination [89]. Currently, it is unclear from existing B-DNA-based models how each heterodimer identifies the location of its preferred target. The concentration of homodimers on the ZNA surface (either Z-DNA or Z-RNA) helps pair different bZIP chains by promoting the disassembly of homodimers and the assembly of heterodimers (Figure 4E–H). The output of this assembly line depends on the homodimers produced by a cell. Some dimer combinations will not recognize a local sequence motif and will repeat the disassembly and reassortment cycle. Others will be stabilized by binding to DNA, but not impact promoter function [90,91,92]. Still others will bind B-DNA and either enhance or suppress gene expression depending on the complexes they seed. Those heterodimers that promote transcription will generate negative supercoiling, which can then power ANS formation by nearby flipons. Those sequences that flip to Z-DNA will provide an additional surface for assembling additional heterodimer variants. Those heterodimers with a higher affinity for a local B-DNA sequence will eventually be enriched in the region. This process can regulate tissue development by discovering the heterodimer combinations most adaptive to environmental inputs and cell state, without requiring their precise genetic specification.
The high-resolution models presented here are amenable to experimental evaluation. The findings will extend our understanding of what is possible and improve our knowledge of why previous laboratory work failed to uncover such mechanisms. One prevalent bias is that methods are optimized to show protein binding to B-DNA, and not to other possible flipon conformations. For example, the inclusion of competitor B-DNA into assays may interfere non-specifically with the binding of a domain to GQ and Z-DNA, just as it does with its sequence-specific recognition of B-DNA. Many structural studies optimize for the lowest-energy conformation. In most cases, the crystallization conditions used are based on the assumption that a protein binds only to B-DNA. Designing studies to detect the interaction between TFs and ANS will yield new insights. The design of in vivo, structure-sensitive reporter probes to track ANS in real time will also significantly advance our knowledge of flipon biology.
The binding of TF to both B-DNA and GQ is not unexpected, as some previous studies provide precedent. There is structural evidence that the yeast RAP1 protein docks to both B-DNA and GQ via different faces of the same α-helix [39,40]. Biochemical evidence suggests the same possibility for other proteins [59,93,94,95,96]. The existence of TFs that engage both B-DNA and Z-DNA is also not unexpected, despite previously expressed opinions to the contrary [10,97]. Rather than being a rare event, the current analysis provides evidence that TFs commonly engage multiple nucleic acid conformations. Of the proteins tested, only TP53 engages GQ and Z-DNA through different domains, suggesting that either TP53 can simultaneously engage different flipon conformations, or that TP53 binding to each structure can be independently regulated. An earlier study also indicates that TP53 binds triplexes, further supporting a role for flipons in the cell fate decisions enforced by TP53 [98].
The dynamic nature of transcriptional complex assembly is supported by recent evidence from single-cell immunofluorescent studies. RNA polymerase II transcriptional clusters have an average lifetime of 5.1 (±0.4) seconds and disassemble rapidly [99,100]. The same is true for the engagement of SOX2 and OCT4 near transcription start sites (TSS). Both have residence times of approximately 10 s, with only 20–50% of each bound to chromatin at any given time. In embryonic stem cells, 64 ± 7.8% of SOX2 is engaged with euchromatin and 16 ± 4.5% is present in heterochromatin. The outcomes reflect the docking of SOX2 and OCT4 to a site near a nucleosomal entry/exit site rather than to a site within the nucleosomal dyad [101].
Pioneering TFs such as SOX2 and OCT4 play essential roles in establishing DNA topology during development. They induce partial or complete ejection of nucleosomes, thereby unwrapping the DNA to produce negative supercoiling [102,103]. The energy captured in these underwound duplexes is sufficient to drive the flip from B-DNA to an ANS [104,105]. In the case of OCT4, the POU HD binds a non-canonical sequence that is accessible on the nucleosome surface and different from the canonical sequence recognized in nucleosome-free B-DNA. During docking, the POU HD forms a wedge that unwraps ~25 bp of DNA at the nucleosome entry site, eventually leading to nucleosome ejection and Z-DNA formation [58]. The recognition of Z-DNA localizes cell-specific TFs to the region. The TFs can then scan for a B-DNA-cognate binding site. Honing in on Z-DNA helps reduce the search space, speeding TF engagement. The off-rate from Z-DNA is likely to be as fast as the energy released when Z-DNA reverts to B-DNA will drive dissociation, speeding TF scanning of the genome. In contrast, the off-rate for sequence-specific binding to B-DNA is much slower, favoring more stable interactions that risk trapping TFs at non-functional sites throughout the genome.
Once established, reinitiation of transcription follows different kinetics from the pioneering round. The process requires recruiting RNA polymerase, pausing the polymerase to ensure the transcriptional machinery is fully loaded, and then releasing it to begin transcript elongation. Each cycle happens rapidly. Typically, promoters are ‘active’ for 3.0 ± 1.0 min, with a measured ‘burst’ size (RNA/active period) of 1.5 ± 0.5 transcripts [106]. The burst frequency, rather than the burst size, varies with the residence time of the TFs [107]. The length of inactive periods between bursts is highly variable. Somewhere between 1–10% of paused complexes enter productive elongation [108,109]. The cycle involves several distinct steps, during which flipons can actuate different outcomes (Figure 1F) [105]. Following the initiation of RPOL transcript elongation, the formation of Z-DNA is powered by the negative supercoiling generated. The release of negative supercoils as Z-DNA reverts to B-DNA offsets the positive supercoiling that stabilizes the preinitiation complex [110], leading to its disassembly [105]. Z-DNA also promotes the reengagement of Transcription Factor E, a necessary step for the reinitiation of the flipon cycle [111]. Z-DNA formation also facilitates scanning of promoter DNA by local TFs for a cognate binding site. Following TF docking and assembly of a promoter complex, a short RNA transcript is produced, and the RPOL temporarily enters a paused state [112]. The promoter is then locked and loaded until receiving the signal to fire.
The formation of GQ appears to be part of the transcription cycle and may depend on a brief round of antisense transcription. This process would dislodge TF bound to upstream sequences, enabling GQ formation by either the antisense RNA transcript produced or by the non-template DNA strand displaced during transcription. With repeated rounds of antisense transcription, the antisense RNA GQ becomes more abundant. However, that RNA subset is usually rapidly cleared by the RNA exosome. Non-template DNA can potentially embed many flipons capable of forming GQ. Docking of TF to GQ, either RNA or DNA, rather than to B-DNA motifs, then explains the High Occupancy Sites (HOT) identified by the ENCODE Consortium. These HOT sites engage multiple TFs, even when their binding motifs are absent [113]. TF-associated proteins will also localize to these sites, delivering the helicases needed to resolve GQ once the transcriptional burst is triggered. The HOT complexes then facilitate promoter reset and the reengagement of B-DNA-specific transcription factors. The accumulated non-productive transcripts from genes, enhancers, and promoters can also adopt alternative flipon conformations. They may enable the local retention of TFs and other factors for use in the next transcription cycle. They may also prevent the reassociation of a particular TF with DNA, thereby allowing it to be replaced by another TF or by a different complex [114].
The results presented here reveal that transcription factor motifs can potentially engage multiple alternative flipon conformations. The models motivate further experimental validation. The models provide functional endpoints for these studies, allowing a clear readout of their validity. That said, the current experimental evidence supports an essential role for flipons in regulating the dynamic readout of genetic information from DNA. The repeat motifs that underlie these alternative structures are simple. They set the stage for nature to strut its highly evolved codonware.
4. Methods
AFV3 is not explicitly trained to recognize Z-DNA or other flipon structures, nor their interactions with protein motifs. However, large neural networks often contain subnetworks that, when properly initialized, can perform well on specific tasks [115]. Empirically, it was found that AF3 can be nudged by setting input conditions and specifying the model seed, enabling exploration of the interactions between Z-flipon folds and proteins [49]. Optimal starting conditions were calibrated using known ZNA binders (Zα and TFEα)) and previously described GQ binders [59,111]. It was found that the AF3 default scoring of these models did not always assign high scores to the well-validated interaction between Zα and Z-DNA. Instead, a better indication of their significance was provided by manual inspection of the models produced. Models with many well-oriented binding interactions and no steric clashes were then selected for further evaluation. This approach was judged reasonable as the AF3 algorithm does not explicitly score bonding interactions. To test the novel predictions made with AF3, we have used molecular dynamics simulations (MDS) for preliminary validation. We find that the selected AF3 models are robust. In contrast, other modeling approaches that employ alternative docking algorithms are prone to failure in MDS, reflecting the sensitivity of this deterministic method to errors in the initial model. Using this strategy, we have validated the AF3-modeled interaction between the Zβ domain of ADAR and GQ using MDS. The result has subsequently been confirmed by NMR [116,117]. The MDS also overcomes another limitation of AF3, which captures only a snapshot of the interaction. MDS allows modeling of the dynamic transitions involved. Established physical-chemical principles allow inference of a protein’s relative affinity for B-DNA and Z-DNA based on the bonding scheme present in AF3 models. For example, the slow off-rate of Zα from Z-DNA is due to the face-on edge contact of tyrosine with the C8 of a purine base, not solely on ionic or hydrogen bonds [13,118]. In general, the release of the potential energy from the reversion of Z-DNA to B-DNA will favor a fast off-rate [119].
The following parameters were used here to model the interaction of transcription factors with Z-DNA: a d(CG)8 repeat in the presence of 12 Mg2+, 6 K+, and 4 Zn2+, and seed = 1076865862. A screenshot of the conditions used to initiate the model is given in the supplemental data file. For modelling G-quadruplex, the parallel strand r(GGGUUAGGGUUAGGGUUAGGG) telomere sequence and the anti-parallel C9ORF72 r(GGGGCCGGGGCCGGGGCCGGGG) in the presence of 3xK+, and seed = 528133945. Although proteins docked to both DNA GQ (dGQ) and RNA GQ (rGQ), higher-order stacked GQ structures were formed best with RNA folds. As before, model selection for rGQ and Z-DNA was based on bonding scheme and steric fit. The metal-ion interactions that nudge AF3 to fold the desired nucleic acid structure were ignored in these assessments. This approach is conservative, as metal interactions may help stabilize the complexes under physiological conditions and, consequently, contribute significantly to their stability. AlphaFold V3 was also used to model the effects of various amino acid adducts and substitutions on TF binding. These predictive models generate several outcomes. First, they increase knowledge of what interactions are possible when modeled at atomic resolution. Second, they allow an assessment of how various mutations and domain swaps affect binding. Third, they allow modeling of different amino modifications.
5. Conclusions
This paper questions whether transcription factors (TF) interact exclusively with B-DNA rather than other DNA structures. The analysis presented here is based on predictive modeling and provides evidence that TFs also bind to alternative DNA conformations, such as left-handed Z-DNA and G-quadruplexes. These alternative structures are encoded by repeat sequences called flipons that perform various roles in the cell. Alternative flipon structures flag active genes that generate sufficient energy to power the transition. They enable TF to quickly scan the genome to identify cognate binding sites, thereby facilitating rapid responses. By cycling conformation, flipons also enable promoter reset and reinitiation of transcription. In other cases, flipons may promote the local assembly of heterodimeric transcription factors from homodimers. This process finds the monomer combination that binds a control locus with sufficient affinity to regulate its activity. This reassortment of monomers is facilitated by weak dimerization interfaces that allow homo- and heterodimers to dissociate and then reassociate to form heterodimers with a different B-DNA sequence specificity. The binding of homodimers to alternative DNA structures increases their local concentration and facilitates exchanges that underlie the formation of new heterodimer combinations. This process is likely regulated in several ways, most notably by protein modifications, such as arginine methylation, and DNA modifications, such as methylation and oxidation, that alter the relative affinities of homo- and heterodimers for each DNA structure.
Supplementary Materials
The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms27010272/s1.
Funding
This work received no external funding.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
PDB files for all the Figure Panels are supplied as Supplemental Information. The conditions used for AF3 to determine the structures are given by the file titles.
Conflicts of Interest
Alan Herbert was employed by InsideOutBio. The author declares that this research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.
References
- Felsenfeld, G.; Davies, D.R.; Rich, A. Formation of a Three-Stranded Polynucleotide Molecule. J. Am. Chem. Soc. 1957, 79, 2023–2024. [Google Scholar] [CrossRef]
- Hoogsteen, K. The structure of crystals containing a hydrogen-bonded complex of 1-methylthymine and 9-methyladenine. Acta Crystallogr. 1959, 12, 822–823. [Google Scholar] [CrossRef]
- Sokoloski, J.E.; Godfrey, S.A.; Dombrowski, S.E.; Bevilacqua, P.C. Prevalence of syn nucleobases in the active sites of functional RNAs. RNA 2011, 17, 1775–1787. [Google Scholar] [CrossRef]
- Ellison, M.J.; Kelleher, R.J., 3rd; Wang, A.H.; Habener, J.F.; Rich, A. Sequence-dependent energetics of the B-Z transition in supercoiled DNA containing nonalternating purine-pyrimidine sequences. Proc. Natl. Acad. Sci. USA 1985, 82, 8320–8324. [Google Scholar] [CrossRef]
- Lightfoot, H.L.; Hagen, T.; Tatum, N.J.; Hall, J. The diverse structural landscape of quadruplexes. FEBS Lett. 2019, 593, 2083–2102. [Google Scholar] [CrossRef]
- Frank-Kamenetskii, M.D.; Mirkin, S.M. Triplex DNA structures. Annu. Rev. Biochem. 1995, 64, 65–95. [Google Scholar] [CrossRef]
- Westin, L.; Blomquist, P.; Milligan, J.F.; Wrange, O. Triple helix DNA alters nucleosomal histone-DNA interactions and acts as a nucleosome barrier. Nucleic Acids Res. 1995, 23, 2184–2191. [Google Scholar] [CrossRef]
- Dagneaux, C.; Gousset, H.; Shchyolkina, A.K.; Ouali, M.; Letellier, R.; Liquier, J.; Florentiev, V.L.; Taillandier, E. Parallel and antiparallel A*A-T intramolecular triple helices. Nucleic Acids Res. 1996, 24, 4506–4512. [Google Scholar] [CrossRef] [PubMed]
- Gehring, K.; Leroy, J.L.; Gueron, M. A tetrameric DNA structure with protonated cytosine.cytosine base pairs. Nature 1993, 363, 561–565. [Google Scholar] [CrossRef] [PubMed]
- Herbert, A. Z-DNA and Z-RNA: Methods-Past and Future. Methods Mol. Biol. 2023, 2651, 295–329. [Google Scholar]
- Herbert, A.; Alfken, J.; Kim, Y.G.; Mian, I.S.; Nishikura, K.; Rich, A. A Z-DNA binding domain present in the human editing enzyme, double-stranded RNA adenosine deaminase. Proc. Natl. Acad. Sci. USA 1997, 94, 8421–8426. [Google Scholar] [CrossRef]
- Schade, M.; Turner, C.J.; Lowenhaupt, K.; Rich, A.; Herbert, A. Structure-function analysis of the Z-DNA-binding domain Zα of dsRNA adenosine deaminase type I reveals similarity to the (alpha + beta) family of helix-turn-helix proteins. EMBO J. 1999, 18, 470–479. [Google Scholar] [CrossRef]
- Schwartz, T.; Rould, M.A.; Lowenhaupt, K.; Herbert, A.; Rich, A. Crystal structure of the Zα domain of the human editing enzyme ADAR1 bound to left-handed Z-DNA. Science 1999, 284, 1841–1845. [Google Scholar] [CrossRef] [PubMed]
- Herbert, A. Mendelian disease caused by variants affecting recognition of Z-DNA and Z-RNA by the Zα domain of the double-stranded RNA editing enzyme ADAR. Eur. J. Hum. Genet. 2020, 28, 114–117. [Google Scholar] [CrossRef] [PubMed]
- Guo, X.; Liu, S.; Sheng, Y.; Zenati, M.; Billiar, T.; Herbert, A.; Wang, Q. ADAR1 Zalpha domain P195A mutation activates the MDA5-dependent RNA-sensing signaling pathway in brain without decreasing overall RNA editing. Cell Rep. 2023, 42, 112733. [Google Scholar] [CrossRef] [PubMed]
- Liang, Z.; Chalk, A.M.; Taylor, S.; Goradia, A.; Heraud-Farlow, J.E.; Walkley, C.R. The phenotype of the most common human ADAR1p150 Zalpha mutation P193A in mice is partially penetrant. EMBO Rep. 2023, 24, e55835. [Google Scholar] [CrossRef]
- Kaiser, W.J.; Upton, J.W.; Mocarski, E.S. Receptor-interacting protein homotypic interaction motif-dependent control of NF-kappa B activation via the DNA-dependent activator of IFN regulatory factors. J. Immunol. 2008, 181, 6427–6434. [Google Scholar] [CrossRef]
- Upton, J.W.; Kaiser, W.J.; Mocarski, E.S. DAI/ZBP1/DLM-1 complexes with RIP3 to mediate virus-induced programmed necrosis that is targeted by murine cytomegalovirus vIRA. Cell Host Microbe 2012, 11, 290–297. [Google Scholar] [CrossRef]
- Lin, J.; Kumari, S.; Kim, C.; Van, T.M.; Wachsmuth, L.; Polykratis, A.; Pasparakis, M. RIPK1 counteracts ZBP1-mediated necroptosis to inhibit inflammation. Nature 2016, 540, 124–128. [Google Scholar] [CrossRef]
- Galluzzi, L.; Vitale, I.; Aaronson, S.A.; Abrams, J.M.; Adam, D.; Agostinis, P.; Alnemri, E.S.; Altucci, L.; Amelio, I.; Andrews, D.W.; et al. Molecular mechanisms of cell death: Recommendations of the Nomenclature Committee on Cell Death 2018. Cell Death Differ. 2018, 25, 486–541. [Google Scholar] [CrossRef]
- Devos, M.; Tanghe, G.; Gilbert, B.; Dierick, E.; Verheirstraeten, M.; Nemegeer, J.; de Reuver, R.; Lefebvre, S.; De Munck, J.; Rehwinkel, J.; et al. Sensing of endogenous nucleic acids by ZBP1 induces keratinocyte necroptosis and skin inflammation. J. Exp. Med. 2020, 217, e20191913. [Google Scholar] [CrossRef]
- Zhang, T.; Yin, C.; Boyd, D.F.; Quarato, G.; Ingram, J.P.; Shubina, M.; Ragan, K.B.; Ishizuka, T.; Crawford, J.C.; Tummers, B.; et al. Influenza Virus Z-RNAs Induce ZBP1-Mediated Necroptosis. Cell 2020, 180, 1115–1129. [Google Scholar] [CrossRef]
- Nakahama, T.; Kato, Y.; Shibuya, T.; Inoue, M.; Kim, J.I.; Vongpipatana, T.; Todo, H.; Xing, Y.; Kawahara, Y. Mutations in the adenosine deaminase ADAR1 that prevent endogenous Z-RNA binding induce Aicardi-Goutieres-syndrome-like encephalopathy. Immunity 2021, 54, 1976–1988.e7. [Google Scholar] [CrossRef]
- de Reuver, R.; Verdonck, S.; Dierick, E.; Nemegeer, J.; Hessmann, E.; Ahmad, S.; Jans, M.; Blancke, G.; Van Nieuwerburgh, F.; Botzki, A.; et al. ADAR1 prevents autoinflammation by suppressing spontaneous ZBP1 activation. Nature 2022, 607, 784–789. [Google Scholar] [CrossRef] [PubMed]
- Herbert, A.; Fedorov, A.; Poptsova, M. Mono a Mano: ZBP1’s Love-Hate Relationship with the Kissing Virus. Int. J. Mol. Sci. 2022, 23, 3079. [Google Scholar] [CrossRef]
- Hubbard, N.W.; Ames, J.M.; Maurano, M.; Chu, L.H.; Somfleth, K.Y.; Gokhale, N.S.; Werner, M.; Snyder, J.M.; Lichauco, K.; Savan, R.; et al. ADAR1 mutation causes ZBP1-dependent immunopathology. Nature 2022, 607, 769–775. [Google Scholar] [CrossRef] [PubMed]
- Jiao, H.; Wachsmuth, L.; Wolf, S.; Lohmann, J.; Nagata, M.; Kaya, G.G.; Oikonomou, N.; Kondylis, V.; Rogg, M.; Diebold, M.; et al. ADAR1 averts fatal type I interferon induction by ZBP1. Nature 2022, 607, 776–783. [Google Scholar] [CrossRef]
- Zhang, T.; Yin, C.; Fedorov, A.; Qiao, L.; Bao, H.; Beknazarov, N.; Wang, S.; Gautam, A.; Williams, R.M.; Crawford, J.C.; et al. ADAR1 masks the cancer immunotherapeutic promise of ZBP1-driven necroptosis. Nature 2022, 606, 594–602. [Google Scholar] [CrossRef]
- Yin, C.; Fedorov, A.; Guo, H.; Crawford, J.C.; Rousseau, C.; Zhong, X.; Williams, R.M.; Gautam, A.; Koehler, H.S.; Whisnant, A.W.; et al. Host cell Z-RNAs activate ZBP1 during virus infections. Nature 2025, 648, 707–716. [Google Scholar] [CrossRef]
- Beknazarov, N.; Jin, S.; Poptsova, M. Deep learning approach for predicting functional Z-DNA regions using omics data. Sci. Rep. 2020, 10, 19134. [Google Scholar] [CrossRef] [PubMed]
- Georgakopoulos-Soares, I.; Victorino, J.; Parada, G.E.; Agarwal, V.; Zhao, J.; Wong, H.Y.; Umar, M.I.; Elor, O.; Muhwezi, A.; An, J.Y.; et al. High-throughput characterization of the role of non-B DNA motifs on promoter function. Cell Genom. 2022, 2, 100111. [Google Scholar] [CrossRef]
- Umerenkov, D.; Herbert, A.; Konovalov, D.; Danilova, A.; Beknazarov, N.; Kokh, V.; Fedorov, A.; Poptsova, M. Z-flipon variants reveal the many roles of Z-DNA and Z-RNA in health and disease. Life Sci. Alliance 2023, 6, e202301962. [Google Scholar] [CrossRef]
- Beknazarov, N.; Konovalov, D.; Herbert, A.; Poptsova, M. Z-DNA formation in promoters conserved between human and mouse are associated with increased transcription reinitiation rates. Sci. Rep. 2024, 14, 17786. [Google Scholar] [CrossRef]
- Smeds, L.; Kamali, K.; Kejnovska, I.; Kejnovsky, E.; Chiaromonte, F.; Makova, K.D. Non-canonical DNA in human and other ape telomere-to-telomere genomes. Nucleic Acids Res. 2025, 53, gkaf298. [Google Scholar] [CrossRef] [PubMed]
- Chantzi, N.; Nayak, A.; Baltoumas, F.A.; Aplakidou, E.; Liew, S.W.; Galuh, J.E.; Patsakis, M.; Montgomery, A.; Moeckel, C.; Mouratidis, I.; et al. Quadrupia provides a comprehensive catalog of G-quadruplexes across genomes from the tree of life. Genome Res. 2025, 35, 2578–2600. [Google Scholar] [CrossRef] [PubMed]
- Maizels, N. G4-associated human diseases. EMBO Rep. 2015, 16, 910–922. [Google Scholar] [CrossRef]
- Sauer, M.; Paeschke, K. G-quadruplex unwinding helicases and their function in vivo. Biochem. Soc. Trans. 2017, 45, 1173–1182. [Google Scholar] [CrossRef]
- Spiegel, J.; Adhikari, S.; Balasubramanian, S. The Structure and Function of DNA G-Quadruplexes. Trends Chem. 2020, 2, 123–136. [Google Scholar] [CrossRef] [PubMed]
- Konig, P.; Giraldo, R.; Chapman, L.; Rhodes, D. The crystal structure of the DNA-binding domain of yeast RAP1 in complex with telomeric DNA. Cell 1996, 85, 125–136. [Google Scholar] [CrossRef]
- Traczyk, A.; Liew, C.W.; Gill, D.J.; Rhodes, D. Structural basis of G-quadruplex DNA recognition by the yeast telomeric protein Rap1. Nucleic Acids Res. 2020, 48, 4562–4571. [Google Scholar] [CrossRef] [PubMed]
- Lane, A.N.; Chaires, J.B.; Gray, R.D.; Trent, J.O. Stability and kinetics of G-quadruplex structures. Nucleic Acids Res. 2008, 36, 5482–5515. [Google Scholar] [CrossRef] [PubMed]
- Varshney, D.; Spiegel, J.; Zyner, K.; Tannahill, D.; Balasubramanian, S. The regulation and functions of DNA and RNA G-quadruplexes. Nat. Rev. Mol. Cell Biol. 2020, 21, 459–474. [Google Scholar] [CrossRef]
- Sato, K.; Knipscheer, P. G-quadruplex resolution: From molecular mechanisms to physiological relevance. DNA Repair. 2023, 130, 103552. [Google Scholar] [CrossRef]
- Herbert, A. A Compendium of G-Flipon Biological Functions That Have Experimental Validation. Int. J. Mol. Sci. 2024, 25, 10299. [Google Scholar] [CrossRef] [PubMed]
- Herbert, A. The Evolutionary Entanglement of Flipons with Zinc Fingers and Retroelements has Engendered a Large Family of Z-DNA and G-Quadruplex Binding Proteins. Open Biol. 2025, 15, 250171. [Google Scholar] [CrossRef]
- Meng, Y.; Wang, G.; He, H.; Lau, K.H.; Hurt, A.; Bixler, B.J.; Parham, A.; Jin, S.G.; Xu, X.; Vasquez, K.M.; et al. Z-DNA is remodelled by ZBTB43 in prospermatogonia to safeguard the germline genome and epigenome. Nat. Cell Biol. 2022, 24, 1141–1153. [Google Scholar] [CrossRef] [PubMed]
- Srinivasan, R.; Nady, N.; Arora, N.; Hsieh, L.J.; Swigut, T.; Narlikar, G.J.; Wossidlo, M.; Wysocka, J. Zscan4 binds nucleosomal microsatellite DNA and protects mouse two-cell embryos from DNA damage. Sci. Adv. 2020, 6, eaaz9115. [Google Scholar] [CrossRef]
- Iwahara, J.; Levy, Y. Speed-stability paradox in DNA-scanning by zinc-finger proteins. Transcription 2013, 4, 58–61. [Google Scholar]
- Herbert, A. Flipons and the origin of the genetic code. Biol. Lett. 2025, 21, 20240635. [Google Scholar] [CrossRef]
- Takahashi, K.; Yamanaka, S. Induction of pluripotent stem cells from mouse embryonic and adult fibroblast cultures by defined factors. Cell 2006, 126, 663–676. [Google Scholar] [CrossRef]
- Kerkhoff, E.; Bister, K.; Klempnauer, K.H. Sequence-specific DNA binding by Myc proteins. Proc. Natl. Acad. Sci. USA 1991, 88, 4323–4327. [Google Scholar] [CrossRef]
- Carroll, P.A.; Freie, B.W.; Mathsyaraja, H.; Eisenman, R.N. The MYC transcription factor network: Balancing metabolism, proliferation and oncogenesis. Front. Med. 2018, 12, 412–425. [Google Scholar] [CrossRef] [PubMed]
- Evans, P.M.; Zhang, W.; Chen, X.; Yang, J.; Bhakat, K.K.; Liu, C. Kruppel-like factor 4 is acetylated by p300 and regulates gene transcription via modulation of histone acetylation. J. Biol. Chem. 2007, 282, 33994–34002. [Google Scholar] [CrossRef] [PubMed]
- Di Giammartino, D.C.; Kloetgen, A.; Polyzos, A.; Liu, Y.; Kim, D.; Murphy, D.; Abuhashem, A.; Cavaliere, P.; Aronson, B.; Shah, V.; et al. KLF4 is involved in the organization and regulation of pluripotency-associated three-dimensional enhancer networks. Nat. Cell Biol. 2019, 21, 1179–1190. [Google Scholar] [CrossRef]
- Soufi, A.; Donahue, G.; Zaret, K.S. Facilitators and impediments of the pluripotency reprogramming factors’ initial engagement with the genome. Cell 2012, 151, 994–1004. [Google Scholar] [CrossRef]
- Petryniak, B.; Staudt, L.M.; Postema, C.E.; McCormack, W.T.; Thompson, C.B. Characterization of chicken octamer-binding proteins demonstrates that POU domain-containing homeobox transcription factors have been highly conserved during vertebrate evolution. Proc. Natl. Acad. Sci. USA 1990, 87, 1099–1103. [Google Scholar] [CrossRef]
- Williams, D.C., Jr.; Cai, M.; Clore, G.M. Molecular basis for synergistic transcriptional activation by Oct1 and Sox2 revealed from the solution structure of the 42-kDa Oct1.Sox2.Hoxb1-DNA ternary transcription factor complex. J. Biol. Chem. 2004, 279, 1449–1457. [Google Scholar] [CrossRef]
- Guan, R.; Lian, T.; Zhou, B.R.; Wheeler, D.; Bai, Y. Structural mechanism of LIN28B nucleosome targeting by OCT4. Mol. Cell 2023, 83, 1970–1982.e6. [Google Scholar] [CrossRef]
- Spiegel, J.; Cuesta, S.M.; Adhikari, S.; Hansel-Hertsch, R.; Tannahill, D.; Balasubramanian, S. G-quadruplexes are transcription factor binding hubs in human chromatin. Genome Biol. 2021, 22, 117. [Google Scholar] [CrossRef]
- Li, S.; Wang, Z.; Wang, X.; Wang, Y.; Pattarayan, D.; Zhang, Y.; Nguyen, P.; Bhuniya, A.; Chen, Y.; Huang, H.; et al. Integrative characterization of MYC RNA-binding function. Cell Genom. 2025, 5, 100878. [Google Scholar] [CrossRef] [PubMed]
- Hamilton, D.J.; Hein, A.E.; Holmes, Z.E.; Wuttke, D.S.; Batey, R.T. The DNA-Binding High-Mobility Group Box Domain of Sox Family Proteins Directly Interacts with RNA In Vitro. Biochemistry 2022, 61, 943–951. [Google Scholar] [CrossRef]
- Tikhanovich, I.; Zhao, J.; Bridges, B.; Kumer, S.; Roberts, B.; Weinman, S.A. Arginine methylation regulates c-Myc-dependent transcription by altering promoter recruitment of the acetyltransferase p300. J. Biol. Chem. 2017, 292, 13333–13344. [Google Scholar] [CrossRef]
- Zhang, L.; Shao, G.; Shao, J.; Zhao, J. PRMT5-activated c-Myc promote bladder cancer proliferation and invasion through up-regulating NF-kappaB pathway. Tissue Cell 2022, 76, 101788, Erratum in Tissue Cell 2023, 85, 102221. [Google Scholar] [CrossRef]
- Kumar, P.; Joy, J.; Pandey, A.; Gupta, D. PRmePRed: A protein arginine methylation prediction tool. PLoS ONE 2017, 12, e0183318. [Google Scholar] [CrossRef] [PubMed]
- Behe, M.; Felsenfeld, G. Effects of methylation on a synthetic polynucleotide: The B--Z transition in poly(dG-m5dC).poly(dG-m5dC). Proc. Natl. Acad. Sci. USA 1981, 78, 1619–1623. [Google Scholar] [CrossRef]
- Fujii, S.; Wang, A.H.; van der Marel, G.; van Boom, J.H.; Rich, A. Molecular structure of (m5 dC-dG)3: The role of the methyl group on 5-methyl cytosine in stabilizing Z-DNA. Nucleic Acids Res. 1982, 10, 7879–7892. [Google Scholar] [CrossRef]
- Tan, D.S.; Cheung, S.L.; Gao, Y.; Weinbuch, M.; Hu, H.; Shi, L.; Ti, S.C.; Hutchins, A.P.; Cojocaru, V.; Jauch, R. The homeodomain of Oct4 is a dimeric binder of methylated CpG elements. Nucleic Acids Res. 2023, 51, 1120–1138. [Google Scholar] [CrossRef]
- Wardle, F.C. Master control: Transcriptional regulation of mammalian Myod. J. Muscle Res. Cell Motil. 2019, 40, 211–226. [Google Scholar] [CrossRef] [PubMed]
- Cao, Y.; Yao, Z.; Sarkar, D.; Lawrence, M.; Sanchez, G.J.; Parker, M.H.; MacQuarrie, K.L.; Davison, J.; Morgan, M.T.; Ruzzo, W.L.; et al. Genome-wide MyoD binding in skeletal muscle cells: A potential for broad cellular reprogramming. Dev. Cell 2010, 18, 662–674. [Google Scholar] [CrossRef] [PubMed]
- Kodandapani, R.; Pio, F.; Ni, C.Z.; Piccialli, G.; Klemsz, M.; McKercher, S.; Maki, R.A.; Ely, K.R. A new pattern for helix-turn-helix recognition revealed by the PU.1 ETS-domain-DNA complex. Nature 1996, 380, 456–460. [Google Scholar] [CrossRef]
- Donaldson, L.W.; Petersen, J.M.; Graves, B.J.; McIntosh, L.P. Solution structure of the ETS domain from murine Ets-1: A winged helix-turn-helix DNA binding motif. EMBO J. 1996, 15, 125–134. [Google Scholar] [CrossRef]
- Gutierrez-Hartmann, A.; Duval, D.L.; Bradford, A.P. ETS transcription factors in endocrine systems. Trends Endocrinol. Metab. 2007, 18, 150–158. [Google Scholar] [CrossRef]
- Riege, K.; Kretzmer, H.; Sahm, A.; McDade, S.S.; Hoffmann, S.; Fischer, M. Dissecting the DNA binding landscape and gene regulatory network of p63 and p53. eLife 2020, 9, e63266. [Google Scholar] [CrossRef] [PubMed]
- Hamard, P.J.; Lukin, D.J.; Manfredi, J.J. p53 basic C terminus regulates p53 functions through DNA binding modulation of subset of target genes. J. Biol. Chem. 2012, 287, 22397–22407. [Google Scholar] [CrossRef]
- Vierstra, J.; Lazar, J.; Sandstrom, R.; Halow, J.; Lee, K.; Bates, D.; Diegel, M.; Dunn, D.; Neri, F.; Haugen, E.; et al. Global reference mapping of human transcription factor footprints. Nature 2020, 583, 729–736. [Google Scholar] [CrossRef] [PubMed]
- Rushmore, T.H.; Morton, M.R.; Pickett, C.B. The antioxidant responsive element. Activation by oxidative stress and identification of the DNA consensus sequence required for functional activity. J. Biol. Chem. 1991, 266, 11632–11639. [Google Scholar] [CrossRef] [PubMed]
- Itoh, K.; Chiba, T.; Takahashi, S.; Ishii, T.; Igarashi, K.; Katoh, Y.; Oyake, T.; Hayashi, N.; Satoh, K.; Hatayama, I.; et al. An Nrf2/small Maf heterodimer mediates the induction of phase II detoxifying enzyme genes through antioxidant response elements. Biochem. Biophys. Res. Commun. 1997, 236, 313–322. [Google Scholar] [CrossRef]
- Katsuoka, F.; Motohashi, H.; Ishii, T.; Aburatani, H.; Engel, J.D.; Yamamoto, M. Genetic evidence that small maf proteins are essential for the activation of antioxidant response element-dependent genes. Mol. Cell Biol. 2005, 25, 8044–8051. [Google Scholar] [CrossRef]
- Dhakshinamoorthy, S.; Jain, A.K.; Bloom, D.A.; Jaiswal, A.K. Bach1 competes with Nrf2 leading to negative regulation of the antioxidant response element (ARE)-mediated NAD(P)H:quinone oxidoreductase 1 gene expression and induction in response to antioxidants. J. Biol. Chem. 2005, 280, 16891–16900. [Google Scholar] [CrossRef]
- Kahmann, J.D.; Wecking, D.A.; Putter, V.; Lowenhaupt, K.; Kim, Y.G.; Schmieder, P.; Oschkinat, H.; Rich, A.; Schade, M. The solution structure of the N-terminal domain of E3L shows a tyrosine conformation that may explain its reduced affinity to Z-DNA in vitro. Proc. Natl. Acad. Sci. USA 2004, 101, 2712–2717. [Google Scholar] [CrossRef]
- Santella, R.M.; Grunberger, D.; Broyde, S.; Hingerty, B.E. Z-DNA conformation of N-2-acetylaminofluorene modified poly(dG-dC).poly(dG-dC) determined by reactivity with anti cytidine antibodies and minimized potential energy calculations. Nucleic Acids Res. 1981, 9, 5459–5467. [Google Scholar] [CrossRef]
- Moller, A.; Nordheim, A.; Kozlowski, S.A.; Patel, D.J.; Rich, A. Bromination stabilizes poly(dG-dC) in the Z-DNA form under low-salt conditions. Biochemistry 1984, 23, 54–62. [Google Scholar] [CrossRef]
- Wang, J.; Wang, S.; Zhong, C.; Tian, T.; Zhou, X. Novel insights into a major DNA oxidative lesion: Its effects on Z-DNA stabilization. Org. Biomol. Chem. 2015, 13, 8996–8999. [Google Scholar] [CrossRef] [PubMed]
- Radzisheuskaya, A.; Shliaha, P.V.; Grinev, V.; Lorenzini, E.; Kovalchuk, S.; Shlyueva, D.; Gorshkov, V.; Hendrickson, R.C.; Jensen, O.N.; Helin, K. PRMT5 methylome profiling uncovers a direct link to splicing regulation in acute myeloid leukemia. Nat. Struct. Mol. Biol. 2019, 26, 999–1012. [Google Scholar] [CrossRef] [PubMed]
- Cao, M.; Nguyen, T.; Song, J.; Zheng, Y.G. Biomedical effects of protein arginine methyltransferase inhibitors. J. Biol. Chem. 2025, 301, 108201. [Google Scholar] [CrossRef] [PubMed]
- Wang, X.; Zhu, B.; Winn, R.; Lu, S.; Wang, H. Functional Dynamics of Arginine Mono- and Di-Methylation. Cells 2025, 14, 796. [Google Scholar] [CrossRef]
- Di Gioia, S.A.; Shaaban, S.; Tuysuz, B.; Elcioglu, N.H.; Chan, W.M.; Robson, C.D.; Ecklund, K.; Gilette, N.M.; Hamzaoglu, A.; Tayfun, G.A.; et al. Recessive MYF5 Mutations Cause External Ophthalmoplegia, Rib, and Vertebral Anomalies. Am. J. Hum. Genet. 2018, 103, 115–124. [Google Scholar] [CrossRef]
- Waddington, C.H. Canalization of Development and the Inheritance of Acquired Characters. Nature 1942, 150, 563–565. [Google Scholar] [CrossRef]
- Rodriguez-Martinez, J.A.; Reinke, A.W.; Bhimsaria, D.; Keating, A.E.; Ansari, A.Z. Combinatorial bZIP dimers display complex DNA-binding specificity landscapes. eLife 2017, 6, e19272. [Google Scholar] [CrossRef]
- Hai, T.; Curran, T. Cross-family dimerization of transcription factors Fos/Jun and ATF/CREB alters DNA binding specificity. Proc. Natl. Acad. Sci. USA 1991, 88, 3720–3724. [Google Scholar] [CrossRef]
- van Dam, H.; Castellazzi, M. Distinct roles of Jun: Fos and Jun: ATF dimers in oncogenesis. Oncogene 2001, 20, 2453–2464. [Google Scholar]
- Bejjani, F.; Tolza, C.; Boulanger, M.; Downes, D.; Romero, R.; Maqbool, M.A.; Zine El Aabidine, A.; Andrau, J.C.; Lebre, S.; Brehelin, L.; et al. Fra-1 regulates its target genes via binding to remote enhancers without exerting major control on chromatin architecture in triple negative breast cancers. Nucleic Acids Res. 2021, 49, 2488–2508. [Google Scholar] [CrossRef]
- Walsh, K.; Gualberto, A. MyoD binds to the guanine tetrad nucleic acid structure. J. Biol. Chem. 1992, 267, 13714–13718. [Google Scholar] [CrossRef]
- Yafe, A.; Shklover, J.; Weisman-Shomer, P.; Bengal, E.; Fry, M. Differential binding of quadruplex structures of muscle-specific genes regulatory sequences by MyoD, MRF4 and myogenin. Nucleic Acids Res. 2008, 36, 3916–3925. [Google Scholar] [CrossRef]
- Pipier, A.; Devaux, A.; Lavergne, T.; Adrait, A.; Coute, Y.; Britton, S.; Calsou, P.; Riou, J.F.; Defrancq, E.; Gomez, D. Constrained G4 structures unveil topology specificity of known and new G4 binding proteins. Sci. Rep. 2021, 11, 13469. [Google Scholar] [CrossRef]
- Meier-Stephenson, V. G4-quadruplex-binding proteins: Review and insights into selectivity. Biophys. Rev. 2022, 14, 635–654. [Google Scholar] [PubMed]
- Morange, M. What history tells us IX. Z-DNA: When nature is not opportunistic. J. Biosci. 2007, 32, 657–661. [Google Scholar] [CrossRef]
- Brazdova, M.; Tichy, V.; Helma, R.; Bazantova, P.; Polaskova, A.; Krejci, A.; Petr, M.; Navratilova, L.; Ticha, O.; Nejedly, K.; et al. p53 Specifically Binds Triplex DNA In Vitro and in Cells. PLoS ONE 2016, 11, e0167439. [Google Scholar] [CrossRef] [PubMed]
- Wang, Z.; Deng, W. Dynamic transcription regulation at the single-molecule level. Dev. Biol. 2022, 482, 67–81. [Google Scholar] [CrossRef] [PubMed]
- Choi, I.; Baek, I. Single-molecule imaging for investigating the transcriptional control. Mol. Cells 2025, 48, 100179. [Google Scholar] [CrossRef]
- Li, S.; Zheng, E.B.; Zhao, L.; Liu, S. Nonreciprocal and Conditional Cooperativity Directs the Pioneer Activity of Pluripotency Transcription Factors. Cell Rep. 2019, 28, 2689–2703.e4. [Google Scholar] [CrossRef]
- Chen, Y.; Tokuda, J.M.; Topping, T.; Sutton, J.L.; Meisburger, S.P.; Pabit, S.A.; Gloss, L.M.; Pollack, L. Revealing transient structures of nucleosomes as DNA unwinds. Nucleic Acids Res. 2014, 42, 8767–8776. [Google Scholar] [CrossRef] [PubMed]
- Chen, Y.; Tokuda, J.M.; Topping, T.; Meisburger, S.P.; Pabit, S.A.; Gloss, L.M.; Pollack, L. Asymmetric unwrapping of nucleosomal DNA propagates asymmetric opening and dissociation of the histone core. Proc. Natl. Acad. Sci. USA 2017, 114, 334–339. [Google Scholar] [CrossRef] [PubMed]
- Herbert, A. Nucleosomes and flipons exchange energy to alter chromatin conformation, the readout of genomic information, and cell fate. Bioessays 2022, 44, e2200166. [Google Scholar] [CrossRef]
- Herbert, A. Flipons and small RNAs accentuate the asymmetries of pervasive transcription by the reset and sequence-specific microcoding of promoter conformation. J. Biol. Chem. 2023, 299, 105140. [Google Scholar] [CrossRef]
- Rodriguez, J.; Ren, G.; Day, C.R.; Zhao, K.; Chow, C.C.; Larson, D.R. Intrinsic Dynamics of a Human Gene Reveal the Basis of Expression Heterogeneity. Cell 2019, 176, 213–226.e18. [Google Scholar] [CrossRef]
- Popp, A.P.; Hettich, J.; Gebhardt, J.C.M. Altering transcription factor binding reveals comprehensive transcriptional kinetics of a basic gene. Nucleic Acids Res. 2021, 49, 6249–6266. [Google Scholar] [CrossRef]
- Darzacq, X.; Shav-Tal, Y.; de Turris, V.; Brody, Y.; Shenoy, S.M.; Phair, R.D.; Singer, R.H. In vivo dynamics of RNA polymerase II transcription. Nat. Struct. Mol. Biol. 2007, 14, 796–806. [Google Scholar] [CrossRef] [PubMed]
- Rodriguez, J.; Larson, D.R. Transcription in Living Cells: Molecular Mechanisms of Bursting. Annu. Rev. Biochem. 2020, 89, 189–212. [Google Scholar] [CrossRef]
- Le, S.N.; Brown, C.R.; Harvey, S.; Boeger, H.; Elmlund, H.; Elmlund, D. The TAFs of TFIID Bind and Rearrange the Topology of the TATA-Less RPS5 Promoter. Int. J. Mol. Sci. 2019, 20, 3290. [Google Scholar] [CrossRef]
- Herbert, A. The ancient Z-DNA and Z-RNA specific Zalpha fold has evolved modern roles in immunity and transcription through the natural selection of flipons. R. Soc. Open Sci. 2024, 11, 240080. [Google Scholar] [CrossRef]
- Mayer, A.; di Iulio, J.; Maleri, S.; Eser, U.; Vierstra, J.; Reynolds, A.; Sandstrom, R.; Stamatoyannopoulos, J.A.; Churchman, L.S. Native elongating transcript sequencing reveals human transcriptional activity at nucleotide resolution. Cell 2015, 161, 541–554. [Google Scholar] [CrossRef] [PubMed]
- Ramaker, R.C.; Hardigan, A.A.; Goh, S.T.; Partridge, E.C.; Wold, B.; Cooper, S.J.; Myers, R.M. Dissecting the regulatory activity and sequence content of loci with exceptional numbers of transcription factor associations. Genome Res. 2020, 30, 939–950. [Google Scholar] [CrossRef]
- Hemphill, W.O.; Fenske, R.; Gooding, A.R.; Cech, T.R. PRC2 direct transfer from G-quadruplex RNA to dsDNA has implications for RNA-binding chromatin modifiers. Proc. Natl. Acad. Sci. USA 2023, 120, e2220528120. [Google Scholar] [CrossRef]
- Frankle, J.; Carbin, M. The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks. arXiv 2019, arXiv:1803.03635. [Google Scholar] [CrossRef]
- Herbert, A.; Cherednichenko, O.; Lybrand, T.P.; Egli, M.; Poptsova, M. Zalpha and Zbeta Localize ADAR1 to Flipons That Modulate Innate Immunity, Alternative Splicing, and Nonsynonymous RNA Editing. Int. J. Mol. Sci. 2025, 26, 2422. [Google Scholar] [CrossRef] [PubMed]
- Kroft, C.W.; Krall, J.B.; Warchol, M.; Welty, R.; Herbert, A.; Henen, M.A.; Vogeli, B. Zα and Zβ domains of ADAR1 and ZBP1 bind to G-quadruplexes with low micromolar affinity. bioRxiv 2025. [Google Scholar] [CrossRef]
- Schade, M.; Turner, C.J.; Kuhne, R.; Schmieder, P.; Lowenhaupt, K.; Herbert, A.; Rich, A.; Oschkinat, H. The solution structure of the Zα domain of the human RNA editing enzyme ADAR1 reveals a prepositioned binding surface for Z-DNA. Proc. Natl. Acad. Sci. USA 1999, 96, 12465–12470. [Google Scholar] [CrossRef] [PubMed]
- Herbert, A. Flipons enable genomes to learn by intermediating the exchange of energy for information. J. R. Soc. Interface 2025, 22, 20250049. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.