The N-Terminal Part of Drosophila CP190 Is a Platform for Interaction with Multiple Architectural Proteins

CP190 is a co-factor in many Drosophila architectural proteins, being involved in the formation of active promoters and insulators. CP190 contains the N-terminal BTB/POZ (Broad-Complex, Tramtrack and Bric a brac/POxvirus and Zinc finger) domain and adjacent conserved regions involved in protein interactions. Here, we examined the functional roles of these domains of CP190 in vivo. The best-characterized architectural proteins with insulator functions, Pita, Su(Hw), and dCTCF, interacted predominantly with the BTB domain of CP190. Due to the difficulty of mutating the BTB domain, we obtained a transgenic line expressing a chimeric CP190 with the BTB domain of the human protein Kaiso. Another group of architectural proteins, M1BP, Opbp, and ZIPIC, interacted with one or both of the highly conserved regions in the N-terminal part of CP190. Transgenic lines of D. melanogaster expressing CP190 mutants with a deletion of each of these domains were obtained. The results showed that these mutant proteins only partially compensated for the functions of CP190, weakly binding to selective chromatin sites. Further analysis confirmed the essential role of these domains in recruitment to regulatory regions associated with architectural proteins. We also found that the N-terminal of CP190 was sufficient for recruiting Z4 and Chromator proteins and successfully achieving chromatin opening. Taken together, our results and the results of previous studies showed that the N-terminal region of CP190 is a platform for simultaneous interaction with various DNA-binding architectural proteins and transcription complexes.


Introduction
In higher eukaryotes, the regulation of gene expression is a complex process which ultimately leads to cell differentiation during embryonic development.Cell specialization is determined by differences in transcription factor (TF) repertoires, and the genes responsible for cell differentiation and development of organisms are typically regulated by multiple independent enhancers, each of which stimulates a promoter in a limited population of cells during a specific time interval [1][2][3][4][5][6][7][8][9].In contrast, housekeeping genes usually have enhancers located in close proximity to regulated promoters [10,11].Special elements called chromatin boundaries or insulators are thought to be responsible for restricting the interactions between enhancers and promoters [12][13][14][15][16][17][18].
A number of studies have shown that the promoter can act as an insulator in some cases [13,[19][20][21], suggesting the presence of common proteins with different functions involved in these regulatory elements.CP190, the 190 kDa centrosome-associated protein, is one such protein that binds predominantly to the promoters of housekeeping genes and insulator/boundary elements [22,23].CP190 was initially identified within a cytoplasmic scaffolding complex that includes the centrosomal proteins [24].Further study showed that during mitosis, CP190 is localized at the centrosomes and the mitotic spindle, but during interphase, CP190 binds to chromatin in the nucleus [25][26][27].CP190 (1096 aa) contains an N-terminal Broad-complex, Tramtrack, and Bric-à-brac (BTB) domain and a cluster consisting of four C2H2-type zinc-finger domains (C2H2 domains) located in the central part of the protein [28,29].According to several crystallization studies, the BTB domain of CP190 forms stable homodimers, and it has a high level of similarity to the BTB domains of mammalian proteins such as Kaiso, Bcl6, and PLZF that also contain clusters of C2H2 domains involved in DNA binding [25,[30][31][32][33][34][35][36].
The Drosophila melanogaster genome contains only four genes encoding C2H2 proteins that have BTB-forming stable homodimers [33].In contrast to mammalian BTB-C2H2 proteins, the four C2H2 domains of CP190 appear to be involved in protein-protein interactions rather than in DNA binding [28].
CP190 predominantly associates with gene promoters and is required to support the opening of the chromatin [23,[49][50][51].ZIPIC, M1BP, and Opbp bind to their motifs in the promoters of housekeeping genes and are involved in recruiting CP190 [39,48,50].Several studies have suggested that CP190 is responsible for the association of the NURF, dREAM, SAGA, and other complexes with regulatory elements [52][53][54][55].In the Bithorax complex, the CP190 protein binds to all boundaries to ensure independent activity of transcriptional domains [23,[56][57][58].Each of the domains activates one of three homeotic genes, Ubx, abd-A, or Abd-B, in the appropriate parasegment/segment during Drosophila development [59][60][61][62].Many boundaries have different combinations of binding sites for Pita, Su(Hw), and dCTCF that function in concert [63].It has been demonstrated that CP190 is required for boundary and chromatin-opening functions [45,50,64,65].Since Pita or dCTCF mutants lacking CP190-interacting domains are functional [45,46], it seems likely that CP190 is recruited to regulatory elements through interaction with several DNA-binding proteins.
The aim of this study is to finely map the CP190 domains that are responsible for interaction with various C2H2 proteins and to examine the functional role of these CP190 domains in vivo.We mapped two CP190 regions that interact with the ZIPIC, M1BP, and Opbp proteins.ZIPIC and Opbp interact with both the 210-245 aa and 309-390 aa regions, while M1BP interacts only with the 210-245 aa region.The deletion of these regions strongly affected CP190 binding to the chromatin.The BTBs of CP190 and human Kaiso showed structural similarity but low sequence homology.Kaiso BTB was able to partially substitute for the functions of the BTB domain in CP190, but it could not effectively recruit the CP190 hk chimeric protein to regulatory elements associated with dCTCF and Su(Hw) proteins.We also found that the N-terminal region of CP190 is sufficient to open chromatin and recruit the partner proteins Chromator (Chro/Chriz) and Z4 (Putzig/Pzg).

Mapping the Domains of CP190 Required for Interaction with Architectural Proteins
The CP190 protein (Figure 1A) contains the BTB domain (1-126 aa), a nuclear localization signal NLS (207-263), a D-rich domain (262-286 aa), four zinc-finger domains of the C2H2 type (471-594 aa), a Q-rich domain (647-717 aa), and a D/E-rich domain  Variants of CP190 were fused to the GAL4 DNA-binding domain and tested for interaction with the C2H2 proteins fused to the GAL4 activating domain.The results are summarized in columns with "+" and "−" signs referring to the presence and absence of interaction, respectively.The growth assay yeast plates are shown in Supplementary Materials Figure S3.
It was previously shown that CP190 interacts with a large group of architectural C2H2 proteins.Five C2H2 proteins, including the well-known insulator proteins Su(Hw), dCTCF, and Pita, primarily interact with the BTB domain [31,47].Three other C2H2 proteins, M1BP, Opbp, and ZIPIC, that bind to the promoters of housekeeping genes, interact with other regions of CP190 but not with the BTB domain.In a previous study [39], it was shown that ZIPIC interacts with the 245-468 aa region of CP190.Opbp was shown to interact with two overlapping regions in CP190: 1-309 aa and 245-468 aa [48].The domain in CP190 responsible for interaction with M1BP has not yet been mapped [40].
We used a yeast two-hybrid assay to precisely map the domains in CP190 involved in the interaction with M1BP, Opbp, and ZIPIC (Figure 1B).A series of the N-terminal CP190 fragments fused with the GAL4 DNA-binding domain were tested for interaction with the C2H2 proteins fused with the GAL4 activation domain.The results showed that Opbp and ZIPIC interacted with the two non-overlapping CP190 regions located between the 1-245 aa and 309-470 aa regions.M1BP only interacted with the 1-245 region.Next, we mapped the interacting regions of CP190 more precisely to the 210-245 aa and 309-390 aa regions.Opbp and ZIPIC interacted with both CP190 regions simultaneously, while M1BP only interacted with the 210-245 aa region.Interestingly, the identified regions are preserved not only in Drosophilidae (Supplementary Figure S1) but also in Diptera (Supplementary Figure S2).The 210-245 aa and 309-390 aa regions are located in close proximity but do not overlap the CP190 regions that are involved in microtubule binding (Figure 1A).

Testing the Functional Role of CP190 Domains Required for Interaction with the Architectural C2H2 Proteins
Our previous results [31,39,45,46] showed that the BTB domain is required for the interaction of CP190 with a large group of C2H2 proteins.The BTB domain is critical for the stability and functions of CP190 [28], and thus most mutations in this domain destabilize its structure.The CP190 BTB domain [25,30,31] displays high sequence homology to the BTB domains of human Kaiso mammalian transcription factor clusters of the C2H2 domains (Supplementary Figure S4).The BTB domain of the human Kaiso protein had 16% amino acids identity, and 30% and 21% strong and weak amino acids similarity with the CP190 BTB domain [66,67], respectively.At the same time, the structural homology among Drosophila BTB CP190 and the BTB of the human Kaiso protein appeared incomplete (Supplementary Figure S4).In the Y2H assay, the Kaiso BTB domain did not interact with dCTCF, Su(Hw), or Pita, in contrast to the CP190 BTB domain (Supplementary Figure S5).To examine whether the interaction of BTB with C2H2 proteins was important for CP190 recruitment to chromatin, we obtained a transgenic line that expressed the chimeric CP190 hK protein in which the BTB domain was taken from the human Kaiso protein.To express the chimeric CP190 hK targeted by the FLAG epitope at the C-terminus of the fly protein, we inserted the transgene encoding CP190 hK -F under the control of the strong ubiquitous ubiquitin (Ubi) promoter into the 38D region using the ϕC31-based integration system [68] (Figure 2A).
We also examined the functional role of the 210-245 aa and 309-390 aa regions of CP190 responsible for the interaction with the M1BP, Opbp, and ZIPIC proteins.We used the same expression vector and obtained CP190 ∆210-245 -F, CP190 ∆308-373 -F, and a control CP190 WT -F driven by the Ubi promoter inserted in the same 38D genomic region (Figure 2A).It was shown previously that the 210-245 aa region coincides with the nuclear localization signal [25].For this reason, we substituted the 210-245 aa with the NLS from SV40 (CP190 ∆210-245 -F).In the 309-390 region, we deleted the conserved 308-373 aa in CP190 ∆308-373 -F.Western blots showed that CP190 ∆210-245 -F, CP190 ∆309-390 -F, and CP190 hK -F were expressed at similar levels as the control CP190 WT -F, suggesting that all CP190 variants displayed normal stability (Supplementary Figure S6).
To examine the functional activity of the CP190 variants, we crossed the transgenes into the Cp190 − background.We used previously characterized null mutations in the Cp190 gene: Cp190 2 and Cp190 3 [26].The Cp190 2 /Cp190 3 trans heterozygotes died during the late embryonic stages, and less than 10% developed into pharate adults.Thus, the maternal effect of wild-type CP190 protein, inherited from wild-type parents, allows a small proportion of the Cp190 null mutants to survive to pharate adults.The Ubi:CP190 WT transgene completely restored the viability of Cp190 2 /Cp190 3 , even in the heterozygous state (Supplementary Figure S6).The Ubi:CP190 hK /CyO partially restored the viability of Cp190 2 /Cp190 3 : approximately 3% (2.6%) survived to three-day-old adult flies that showed a normal phenotype and remained fertile.The homozygotes Ubi:CP190 hK /Ubi:CP190 hK ; Cp190 2 /Cp190 3 survived longer: about 15% of the offspring reached the adult stage.The flies were viable and fertile, but their progeny died at the embryonic and first instar larval stages.These results suggest that CP190 hK protein expression cannot compensate for the functions of the wild-type protein.Moreover, the Ubi:CP190 hK /Ubi:CP190 hK ; Cp190 2 /TM6,Tb, and Ubi:CP190 hK /Ubi:CP190 hK ; Cp190 3 /TM6,Tb lines could not be maintained for several generations due to low viability and fertility.Thus, a high concentration of CP190 hK negatively affects the activity of the wild type CP190 protein, the amount of which is lower in a heterozygote for the Cp190 null mutation.Similar to Ubi:CP190 hK , both Ubi:CP190 ∆210-245 and Ubi:CP190 ∆308-373 in the heterozygous state were insufficient to complement the Cp190 2 /Cp190 3 mutations; the adult flies died within days of hatching (Supplementary Figure S7).The Cp190 − flies homozygous for the Ubi:CP190 ∆308-373 or Ubi:CP190 ∆210-245 transgenes that developed to the adult stage were less viable than Ubi:CP190 hK (less than 7%), and they died soon after hatching.Also, strong expression of Ubi:CP190 ∆210-245 or Ubi:CP190 ∆308-373 negatively affected the viability of flies heterozygous for the Cp190 2 or Cp190 3 mutations.Thus, each of the mutant proteins Ubi:CP190 hK , Ubi:CP190 ∆210-245 , and Ubi:CP190 ∆308-373 displayed similar effects on Drosophila development.

The Ability of CP190 Variants to Bind to Regulatory Elements
Recent studies have shown that the interbands of polytene chromosomes typically correspond to the promoter regions of broadly expressed housekeeping genes and display an "open" chromatin conformation [69].CP190 binds to many interband regions on the polytene chromosomes of third-instar larvae and extensively co-localizes with dCTCF, Su(Hw), PITA, ZIPIC, and M1BP proteins [23,39,50,70].
To determine whether the mutant variants of CP190 would bind to identical genomic sites and co-localize with Su(Hw), dCTCF, and M1BP the same as the wild-type proteins, we analyzed the binding patterns of the CP190 variants expressed from one copy of the transgene (Ubi:CP190*/CyO) in Cp190 2 /Cp190 3 larvae.The degree of co-localization of CP190 variants with Su(Hw) or dCTCF (insulator and promoter sites) and M1BP (mainly housekeeping promoters) on the polytene chromosomes of third instar larvae was studied (Figure 2B,C).
Overall, our results suggest that the binding of CP190 hK to dCTCF/Su(Hw) sites was reduced, and that this was correlated with the main role of the BTB domain of CP190 in interaction with dCTCF and Su(Hw).At the same time, CP190 ∆308-373 and CP190 ∆210-245 bound more efficiently to the dCTCF and Su(Hw) sites and co-localized less efficiently with M1BP.
To further analyze the specificity of recruiting by CP190 variants, we compared the binding of CP190 variants by qChIP to the main well-described binding sites for the Su(Hw) protein (Figure 3A).The 50A, 62D, and 87E sites are intergenic regions containing 2-3 Su(Hw) binding sites [71].The 1A2 region is located downstream from the 3 side of the yellow gene and contains two Su(Hw) binding sites [72].The gypsy insulator consists of the 12 binding sites for Su(Hw) [73,74].For the qChIP analysis, we used chromatin obtained from Cp190 2 /Cp190 3 larvae heterozygous for either of the CP190 transgenes.For all tested sites, CP190 WT bound more efficiently than the mutant versions of CP190 (Figure 3A).In accordance with the results obtained for the polytene chromosomes, CP190 ∆210-245 bound to the Su(Hw) sites with similar or better efficiency than CP190 ∆308-373 .Finally, CP190 hK bound with the same or weaker efficiency than CP190 ∆308-373 .Thus, the results were consistent with the hypothesis that the BTB domain of CP190 is critical for binding to Su(Hw) sites [31,47].
tained from Cp190 2 /Cp190 3 larvae heterozygous for either of the CP190 transgenes.For all tested sites, CP190 WT bound more efficiently than the mutant versions of CP190 (Figure 3A).In accordance with the results obtained for the polytene chromosomes, CP190 Δ210-245 bound to the Su(Hw) sites with similar or better efficiency than CP190 Δ308-373 .Finally, CP190 hK bound with the same or weaker efficiency than CP190 Δ308-373 .Thus, the results were consistent with the hypothesis that the BTB domain of CP190 is critical for binding to Su(Hw) sites [31,47].We also tested the binding of CP190 variants to the well-described boundaries in the Bithorax complex (BX-C) (Figure 3B).The selected study boundaries contain different combinations of dCTCF and Pita sites: Mcp (Pita + dCTCF), Fab-6 (2 dCTCF), Fab-7 (2 Pita), and Fab-3 (2 dCTCF) [75].The qChIP analysis showed that the patterns of binding efficiency of CP190 variants are preserved for the BX-C boundaries: the most efficient binding is observed for CP190 ∆210-245 , and less efficient for CP190 hK (Figure 3B).

Identification of the CP190 Domain Required for the Formation of the Interband Region in Larval Polytene Chromosomes
The CP190 protein binds to promoters of housekeeping genes and is involved in the organization of the opened chromatin [23,50].Interbands have been reported to be preferentially associated with the CP190, Chromator (Chro/Chriz), and Z4 (Putzig/Pzg) proteins [76].To map the CP190 domains involved in chromatin opening, a model system was used [77] in which 14 GAL4 binding sites were inserted into the 10A1-2 region, which in the polytene chromosomes of larvae is an inactive chromatin band.The CP190 protein and its truncated derivatives were fused in-frame with the DNA-binding domain of the yeast GAL4 (DBD) under the control of the hsp70 promoter.The transgenes expressing the chimeric proteins were inserted into the 51C region on the second chromosome using the ϕC31-based integration system [68].A series of transgenic lines were obtained, each of which had the 10A1-2 insert combined with one of the DBD:CP190 variants (Figure 4).To express the chimeric protein, the flies were maintained at 29 • C from the embryonic to the pupal stages.As a negative control, a transgenic line expressing only the GAL4 binding region (DBD) was used [77].In this transgenic line, the DBD is recruited to the 10A1-2 region but does not change the polytene organization and fails to recruit other proteins (Figure 4A).

Larval Polytene Chromosomes
The CP190 protein binds to promoters of housekeeping genes and is involved in th organization of the opened chromatin [23,50].Interbands have been reported to be prefer entially associated with the CP190, Chromator (Chro/Chriz), and Z4 (Putzig/Pzg) protein [76].To map the CP190 domains involved in chromatin opening, a model system was used [77] in which 14 GAL4 binding sites were inserted into the 10A1-2 region, which in the pol ytene chromosomes of larvae is an inactive chromatin band.The CP190 protein and its trun cated derivatives were fused in-frame with the DNA-binding domain of the yeast GAL (DBD) under the control of the hsp70 promoter.The transgenes expressing the chimeric pro teins were inserted into the 51C region on the second chromosome using the φC31-based integration system [68].A series of transgenic lines were obtained, each of which had th 10A1-2 insert combined with one of the DBD:CP190 variants (Figure 4).To express the chi meric protein, the flies were maintained at 29 °C from the embryonic to the pupal stages.A a negative control, a transgenic line expressing only the GAL4 binding region (DBD) wa used [77].In this transgenic line, the DBD is recruited to the 10A1-2 region but does no change the polytene organization and fails to recruit other proteins (Figure 4A).represents the presence ("yes") or absence ("no") of the interband in the 10A1-2 band model system.The results of CP190, Z4, and Chriz protein recruitment are summarized in columns with "++", "+" and "−" signs referring to the full, partial presence, or absence, respectively, of tested proteins in the generated interband in the 10A1-2 band.Other terms are as in Figure 1.
The expression of the complete CP190 (DBD:CP190 WT ) gave rise to a prominently decondensed zone and formation of the interband that splits the 10A1-2 band (Figure 4A,B).On the polytene chromosomes, CP190 actively bound to the decondensed region of the interband, which also contains Chromator and Z4 proteins.Thus, the complete CP190 protein could recruit Z4 and Chromator proteins, suggesting their tight cooperation.

Discussion
The architectural C2H2 proteins employed different strategies for interaction with CP190.The dCTCF, Su(Hw), and Pita proteins interact predominantly with the BTB domain of CP190.These proteins bind to promoters and are responsible for the organization of the most well-characterized Drosophila insulators.The second group of architectural proteins, including M1BP, Opbp, and ZIPIC, interacted with one or two highly conserved regions located between 210-245 aa and 309-390 aa.The existence of several domains in CP190 involved in interactions with DNA-binding architectural proteins makes it possible to create an effective platform for in vivo recruitment of CP190 to regulatory elements.The regions involved in the interactions with architectural proteins are located close to the regions that are responsible for the association of CP190 with centrosomes.It is likely that such an organization of domains makes it possible to efficiently carry out the relocalization of CP190 from chromatin in the interphase to the centrosomes during mitosis.
The BTB domain of Kaiso has 67% sequence homology to the CP190 BTB domain [78,79].Both domains form dimers, but the BTB domain of Kaiso cannot interact with the Drosophila architectural proteins.Our results showed that in the chimeric CP190 hK , the Kaiso BTB could partially functionally replace the CP190 BTB domain.This suggests that the BTB domains may not only have a common ability for homodimerization but may also interact with unknown protein complexes that are conserved among higher eukaryotes.Thus, the CP190 hk chimeric protein can be used to identify conserved transcriptional complexes among higher eukaryotes that are recruited to the Kaiso BTB domain.Mammalian BTB-C2H2 proteins, including Kaiso, are important transcriptional regulators involved in organismal development and cell differentiation [32,33,80,81].Impaired expression of BTB-C2H2 proteins is closely correlated with tumorigenesis [82][83][84][85][86][87][88][89].Currently, anticancer inhibitors are being actively developed that bind to individual regions of the BTB domain and block interactions with partner proteins [90][91][92][93][94][95][96][97].It seems likely that chimeric Drosophila proteins carrying the BTB domains of key mammalian transcription factors can be successfully used to identify and test new anticancer drugs.
Flies expressing chimeric CP190 hK , CP190 ∆210-245 , or CP190 ∆308-373 displayed similar phenotypes: strong reduction in viability and fertility of adults, and high levels of larval and pupal mortality.Thus, all three regions of CP190 may have similar essential functional roles as well as being highly conserved in different Dipteran species.In contrast, the 126-209 aa microtubule binding domain, the C2H2 domains, and the C-terminal D/E rich region are not essential for CP190 functions in vivo [25,28].Thus, the CP190 protein consists of several structural modules that perform different functions.
On the polytene chromosomes, it was possible to study the efficiency of CP190 binding to the promoters of active genes and insulators located in the interbands.Only the most efficient CP190 binding sites are visible on polytene chromosomes, making it possible to assess the efficiency of CP190 variants binding to different architectural proteins.We obtained the clear result that CP190 hK binds less effectively to the regions that are associated with the dCTCF and Su(Hw) proteins.This result supported the hypothesis of a key role for the BTB domain in the interaction of CP190 with this group of architectural C2H2 proteins.
M1BP along with ZIPIC and Opbp form another group of architectural C2H2 proteins that predominantly bind to housekeeping promoters and interact with the CP190-Z4-Chromator complex.The 309-390 aa region is the most essential element for recruiting CP190 to polytene chromosomes.It is likely that the same or adjacent (located in 390-430 aa) region is required for interaction with Z4 and Chromator proteins.The Z4, Chromator, and CP190 proteins can form a platform on the housekeeping promoters for efficient recruitment of transcription and remodeling complexes such as NURF and dREAM [52,53,55] through multiple protein-protein interactions.
On the whole, the obtained results are consistent with the hypothesis that the CP190 protein constitutes a platform for interaction with various proteins and transcription complexes.The BTB domain and several conserved N-terminal regions are responsible for specific interactions of CP190 with architectural proteins and Chro/Z4 partner proteins.Different combinations of the architectural proteins determine the ability of CP190 and its partners to bind to the regulatory regions-predominantly promoters.The binding of CP190/Z4/Chro to regulatory elements results in the attraction of transcriptional com-plexes and, as a result, in switching on of regulatory elements.There are also a number of experimental data indicating that CP190/Z4/Chro can also participate in the organization of TADs and distance interactions between regulatory elements [51,[98][99][100].The CP190, Z4, and Chro proteins, by interacting with each other and homodimerizing, can support long-distance interactions between housekeeping gene clusters, which often form TAD boundaries [4].Further studies are needed to understand the mechanisms of interaction between the CP190, Z4, and Chro proteins, as well as the degree of their interdependence in the recruitment of transcriptional complexes to regulatory elements and the maintenance of long-range interactions.
To generate M1BP plasmid, 5 -tcg aattcATGTCGAAATCGGCGC-3 (introduced site for EcoRIenzyme) and 5 -aggtcgacGATGTGCAGGCTGTC-3 (introduced site for SalI enzyme) primers were used to amplify cDNA, followed by digesting with EcoRI and SalI enzymes (recognition sites underlined) and cloning in pGAD424 vector digested by the same enzymes.
To generate Opbp plasmid, 5 -ttggatcc aaATGTCATCGTCCAACCAGG-3 (introduced site for BamHI enzyme) and 5 -ttgtcgacGTTATTACAGATTGGCTTAAG-3 (introduced site for SalI enzyme) primers were used to amplify cDNA, followed by digesting with BamHI and SalI enzymes (recognition sites underlined) and cloning in pGAD424 vector digested by the same enzymes.
To generate ZIPIC plasmid, 5 -ttg aattcATGAATTGCTGCATTTGTC-3 (introduced site for EcoRI enzyme) and 5 -ttgtcgacAACTGGCGTGCAGTCGTC-3 (introduced site for SalI enzyme) primers were used to amplify cDNA, followed by digesting with EcoRI and SalI enzymes (recognition sites underlined) and cloning in pGAD424 vector digested by the same enzymes.
To generate Su(Hw) plasmid, 5 -aggtcgacTCAAGCTTTCTCTTGTTC-3 (introduced site for SalI enzyme (underlined)) and 5 -GTTAAACACATCAGCGGGCTC-3 primers were used to amplify C part of Su(Hw) cDNA.Resulting PCR fragments were digested with EcoRI and SalI enzymes and cloned in pGAD424 vector digested by the same enzymes.Next, EcoRI-EcoRI fragment from Su(Hw) cDNA was cloned in obtained plasmid with orientation checking.

Plasmids for Transgenic Lines Generation
Transgene line expressing full length CP190 protein was described previously [102].To generate hKaiso-BTB CP190 derivative, hKaiso-BTB was amplified with primer 5 ttactagtATGGAGAGTAGAAAACTGATTTCTG 3 , including recognition site for SpeI enzyme (underlined) and 5 GCTGTACCTGAGATGCTTTTAAC 3 primer.Resulting PCR product was digestated with SpeI and cloned in pUbi-CP190-attB vector digested by NcoI followed by fill in with Klenow fragment and SpeI.
To generate CP190 derivative with 308-373 aa deletion, we digested PCR product generated by 5 AAAATCTGCTCCAGCGACAGT 3 and 440 rev 5 tggatccttaTGCTGGTG-GATTCGCCC 3 primers by BstEII, and cloned it in pUbi-CP190-attB vector digested by BamHI followed by fill in with Klenow fragment and BstEII.
4.2.Germ-Line Transformation, Genetic Crosses, and Phenotypic Analysis Drosophila strains were grown at 25 • C under standard culture conditions.The transgenic constructs were injected into preblastoderm embryos using the ϕC31-mediated site-specific integration system [68].The emerging adults were crossed with the y 1 w 1118 flies, and the progeny carrying the transgene were identified by y + pigmented cuticle.Details of the crosses are available upon request.

Yeast Two-Hybrid Assay
The Y2H assay was performed with plasmids and protocols from Clontech (Palo Alto, CA, USA).Yeast strain PJ69-4a (MATa trp1-901 leu2-3,112 ura3-52 his3-200 gal4(deleted) gal80(deleted) LYS2::GAL1-HIS3 GAL2-ADE2 met2::GAL7-lacZ) was used.Yeast clones from fresh YPD (2% Difco peptone, 1% yeast extract, 2% glucose, 1.5% agar) agar plate were inoculated into 0.5 mL YPD medium, incubated overnight with shaking at 30 • C and 250 r/min.Then, overnight culture was diluted ten times in new flask and grow in YPD medium until A600 0.4-0.6.Next, cells were precipitated 1000 g 5 min RT follow by supernatant discarding.Then, cells were resuspended in 25 mL of sterilized water, washed, and collected by centrifugation in the same conditions.Cells were resuspended in 1 mL fresh sterile 100 mM LiAC.These cells' suspension was gently resuspended with 0.1 µg mix of BD/X and AD/Y plasmid previously prepared into each tube.Appropriate volume of 50% PEG 4000/100 mM LiAc was added following by gentle mix and incubation 30 • C, 200 r/min, with shaking for 0.5 h.Heat shock was performed for 15 min at 42C followed by 5 min on ice incubation.Then, cells were precipitated 2 min at 4000 g at room temperature.Resuspended in sterilized water and plated on first selective media (Yeast Nitrogen Base without Amino Acids (Sigma cat#Y0626, St. Louis, MO, USA), 2% glucose, 1.5 agar) with addition of all essential amino acids except tryptophan and leucine.After 3 days of growth at 30 • C, the cells were plated on the same selective media without tryptophan, leucine, histidine, and adenine, and their growth was compared after 2-3 days.Each assay was repeated three times.

Chromatin Preparation and ChIP Analysis
Chromatin was prepared from the third instar larvae stage, and the resulting chromatin preparation was used for ChIP experiments, as described previously [103].Immunoprecipitated DNA was quantified using qPCR, with SYBR green (Bio-Rad Cat# 170-8882, Hercules, CA, USA).Primers were positioned in the middle of the binding region, as identified in ModEncode by ChIP-seq.The primer sequences used in PCR for ChIP analyses are shown in Supplementary Table S1.Analyses were performed on three biological replicates.

Antibodies
Antibodies to Su(Hw), CP190, dCTCF, Z4, and Chriz were generated and described previously [77,103,104].Antibodies against M1BP (90 aa-240 aa) (provided by Anna Fedotova) were produced in rabbits and purified by affinity purification on Aminolink resin (ThermoFisherScientific, Waltham, MA, USA) according to standard protocols.Details are described in the Supplementary Materials.

Figure 1 .
Figure 1.Precise mapping of two conserved N-terminal regions of CP190 that interact with architectural proteins.(A) Schematic presentation of the CP190 domain structure.The scale rule (in aa) is shown at the top of the figure.The highly conserved regions for the CP190 protein among Drosophila species are marked with orange lines.(B) Mapping of CP190 domains interacting with Opbp, M1BP, and ZIPIC proteins in the Y2H assay.Variants of CP190 were fused to the GAL4 DNA-binding domain and tested for interaction with the C2H2 proteins fused to the GAL4 activating domain.The results are summarized in columns with "+" and "−" signs referring to the presence and absence of interaction, respectively.The growth assay yeast plates are shown in Supplementary Materials FigureS3.

Figure 1 .
Figure 1.Precise mapping of two conserved N-terminal regions of CP190 that interact with architectural proteins.(A) Schematic presentation of the CP190 domain structure.The scale rule (in aa) is shown at the top of the figure.The highly conserved regions for the CP190 protein among Drosophila species are marked with orange lines.(B) Mapping of CP190 domains interacting with Opbp, M1BP, and ZIPIC proteins in the Y2H assay.Variants of CP190 were fused to the GAL4 DNA-binding domain and tested for interaction with the C2H2 proteins fused to the GAL4 activating domain.The results are summarized in columns with "+" and "−" signs referring to the presence and absence of interaction, respectively.The growth assay yeast plates are shown in Supplementary Materials FigureS3.

Figure 3 .Figure 3 .
Figure 3.The results of experiments designed to evaluate the ability of the CP190 variants to bind the selected chromatin sites.ChIP-qPCR analysis of CP190 binding to chromatin was performed using third-instar larvae of transgenic lines expressing different variants of the CP190 protein: CP190 WT -F (wt), CP190 hK -F (hK), CP190 Δ210-245 -F (Δ210-245), CP190 Δ309-373 -F (Δ309-373), and control Cp190 2 /Cp190 3 (Cp190 -).The most well-characterized CP190-bound sites associated with different architectural proteins were studied: (A) Su(Hw) binding regions.Magenta squares indicate the number of Su(Hw) binding sites in the tested regions.(B) Boundaries in the Bithorax complex.Blue squares indicate the number of dCTCF binding sites, and yellow squares indicate the number of Pita binding sites in the tested regions.Immunoprecipitation was performed with either FLAG (blue bars) or CP190 (green bars) antibodies.PCR products were amplified from three separate immunoprecipitates of three different chromatin preparations.The ras64B coding region (Ras) was used as a control, being devoid of CP190 binding regions.The percentage recovery of immunoprecipitated Figure 3.The results of experiments designed to evaluate the ability of the CP190 variants to bind the selected chromatin sites.ChIP-qPCR analysis of CP190 binding to chromatin was performed using third-instar larvae of transgenic lines expressing different variants of the CP190 protein: CP190 WT -F (wt), CP190 hK -F (hK), CP190 ∆210-245 -F (∆210-245), CP190 ∆309-373 -F (∆309-373), and control Cp190 2 /Cp190 3 (Cp190 − ).The most well-characterized CP190-bound sites associated with different architectural proteins were studied: (A) Su(Hw) binding regions.Magenta squares indicate the number of Su(Hw) binding sites in the tested regions.(B) Boundaries in the Bithorax complex.Blue squares indicate the number of dCTCF binding sites, and yellow squares indicate the number of Pita binding sites in the tested regions.Immunoprecipitation was performed with either FLAG (blue bars) or CP190 (green bars) antibodies.PCR products were amplified from three separate immunoprecipitates of three different chromatin preparations.The ras64B coding region (Ras) was used as a control, being devoid of CP190 binding regions.The percentage recovery of immunoprecipitated DNA (Y axis) was calculated relative to the amount of input DNA.Error bars indicate standard deviations of three independent biological replicates.

Figure 4 .Figure 4 .
Figure 4. Mapping of the region in CP190 required to organize interbands in a polytene chromosom model.(A) Targeting the GAL4 DNA-binding region fused with the myc epitope (DBD) and the 1 1096 aa CP190 regions fused with the GAL4 DNA-binding region and the myc epitope (DBD:CP19 1-1096) to the 16 GAL4 binding sites in the 10A1-2 disc.The left panel demonstrates the polyten chromosomes in phase contrast.The right panel is an overlay of phase contrast and immunostainin with antibodies against myc (red), Chriz (green), Z4 (green), and CP190 (red).Black and white squar brackets and white arrows indicate the model 10A1-2 bands in different polytene chromosom preparations.At the top, the recruitment of the GAL4 DNA-binding region alone (DBD) did no induce the formation of the interband in the 10A1-2 band (negative control).At the bottom, the re cruitment of the GAL4 DNA-binding region fused with 1-1096 aa CP190 induces the formation o the interband in the 10A1-2 band (positive control).Scale bars, 10 µm.(B) Summary of the result for attracting truncated versions of CP190 to 16xGAL4 binding sites in the 10A1-2 band.The trun cated versions of CP190 were fused with the GAL4 DNA-binding region: DBD:CP190(1-309 DBD:CP190(1-430), DBD:CP190(1-470), DBD:CP190(1-590), and DBD:CP190(1-755)."Interband represents the presence ("yes") or absence ("no") of the interband in the 10A1-2 band model system The results of CP190, Z4, and Chriz protein recruitment are summarized in columns with "++", "+ Figure 4. Mapping of the region in CP190 required to organize interbands in a polytene chromosome model.(A) Targeting the GAL4 DNA-binding region fused with the myc epitope (DBD) and the 1-1096 aa CP190 regions fused with the GAL4 DNA-binding region and the myc epitope (DBD:CP190 1-1096) to the 16 GAL4 binding sites in the 10A1-2 disc.The left panel demonstrates the polytene chromosomes in phase contrast.The right panel is an overlay of phase contrast and immunostaining with antibodies against myc (red), Chriz (green), Z4 (green), and CP190 (red).Black and white square brackets and white arrows indicate the model 10A1-2 bands in different polytene chromosome preparations.At the top, the recruitment of the GAL4 DNA-binding region alone (DBD) did not induce the formation of the interband in the 10A1-2 band (negative control).At the bottom, the recruitment of the GAL4 DNA-binding region fused with 1-1096 aa CP190 induces the formation of the interband in the 10A1-2 band (positive control).Scale bars, 10 µm.(B) Summary of the results for attracting truncated versions of CP190 to 16xGAL4 binding sites in the 10A1-2 band.The truncated versions of CP190 were fused with the GAL4 DNA-binding region: DBD:CP190(1-309), DBD:CP190(1-430), DBD:CP190(1-470), DBD:CP190(1-590), and DBD:CP190(1-755)."Interband"represents the presence ("yes") or absence ("no") of the interband in the 10A1-2 band model system.The results of CP190, Z4, and Chriz protein recruitment are summarized in columns with "++", "+" and "−" signs referring to the full, partial presence, or absence, respectively, of tested proteins in the generated interband in the 10A1-2 band.Other terms are as in Figure1.