Next Article in Journal / Special Issue
QTL Mapping and Candidate Gene Screening for Enhancing Oil Content in Silage Maize
Previous Article in Journal
Genome-Wide Analysis of NAC Transcription Factor Gene Family in Morus atropurpurea
Previous Article in Special Issue
A Bread Wheat Line with the Substituted Wild Emmer Chromosome 4A Results in Fragment Deletions of Chromosome 4B and Weak Plants
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Decoding G-Quadruplexes Sequence in Vitis vinifera: Regulatory Region Enrichment, Drought Stress Adaptation, and Sugar–Acid Metabolism Modulation

1
College of Forestry, Gansu Agriculture University, Lanzhou 730070, China
2
State Key Laboratory of Efficient Production of Forest Resources, Yinchuan 750004, China
3
College of Forestry, Shandong Agricultural University, Tai’an 271018, China
4
Beijing Anling Ecological Construction Co., Ltd., Beijing 102300, China
5
Horticulture Research Institute, Ningxia Academy of Agricultural and Forestry Sciences, Yinchuan 750012, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Plants 2025, 14(8), 1180; https://doi.org/10.3390/plants14081180
Submission received: 7 March 2025 / Revised: 4 April 2025 / Accepted: 8 April 2025 / Published: 10 April 2025
(This article belongs to the Special Issue Bioinformatics and Functional Genomics in Modern Plant Science)

Abstract

:
G-quadruplexes play a crucial role in transcription, translation, and DNA replication in plant genomes. Here, we comprehensively examined the prevalence and functions of G-quadruplexes in Vitis vinifera. A total of 467,813 G-quadruplexes were identified in grapevine genome, with enrichment in the promoter (0.54/kbp) and near transcription start sites (TSSs, 1.00/kbp), and showed conservative strand preference. The G-quadruplex density in centromeres exhibited heterogeneity. The differentially expressed genes (DEGs) under two-day drought stress manifested high G-quadruplex density in the promoter and TSS regions. The upregulated DEGs showed template strand-biased G-quadruplex enrichment, while downregulated DEGs displayed coding strand dominance linked to metal ion homeostasis and sugar–acid metabolism pathways, respectively. G-quadruplexes were enriched in key sugar–acid metabolism genes, including pyruvate kinase and sucrose synthase. The number of G-quadruplexes in sucrose transferase VINV genes was higher than that in the CWINV and NINV genes. This study revealed G-quadruplexes as regulatory elements of stress response and berry development, providing abundant genetic targets for precision breeding and the quality improvement of grapevines.

1. Introduction

The G-quadruplexes are novel nucleic acid secondary structures formed by the folding of guanine-rich DNA or RNA sequences [1,2]. The fundamental structural unit is the G-tetrad, where four guanine bases are paired through Hoogsteen hydrogen bonding [3]. Cations (such as K+, Na+, Ca2+, Mg2+) could coordinate with the carbonyl oxygen atoms within the G-tetrad, thereby enhancing the stability of the G-quadruplex [4]. These structures are widely distributed in eukaryotic genomes and are particularly enriched in telomeres, promoters, enhancers, and non-coding regions [5,6]. In the promoter regions of cancer-related genes (such as MYC, KRAS, and VEGF), the formation of G-quadruplexes regulates the suppression or activation of oncogenes [7,8,9,10]. Numerous functional studies have demonstrated that G-quadruplexes play crucial roles in key biological processes such as gene transcription, DNA replication, genomic stability, and epigenetic regulation by modulating chromatin accessibility, impeding replication fork progression, or recruiting transcription factors [11,12]. G-quadruplexes have been extensively studied in humans. However, the characteristics and functions of G-quadruplexes in plant genomes remain largely unexplored.
Studies focusing on G-quadruplexes in plant genomes have been gradually expanded [13,14,15]. In Arabidopsis thaliana, genome-wide prediction discovered 43,000 G-quadruplexes, 80% of which were located in the gene region [16]. The enrichment of G-quadruplexes has also been observed in gene regions, promoters, and UTR regions in rice [17]. In Spirodela polyrhiza, the promoters of the key nitrogen assimilation genes SpNR and SpNiR exhibited strong G-quadruplex peaks [18]. G-quadruplexes have been found to be non-randomly distributed across the genome, particularly associated with regulatory regions of tobacco [19], wheat [20], barley [21], and pea [22]. Additionally, G-quadruplexes showed distinct DNA methylation patterns between transposon genes and non-transposon genes [17]. The specific distribution of G-quadruplexes in plant genomes suggested underlying correlation with gene expression and protein translation [23,24].
The function of G-quadruplex structures in plant development and stress resistance has been demonstrated. In A. thaliana, G-quadruplexes were overrepresented in genes encoding proteins with enzyme activity [16]. The G-quadruplexes in the 5′ UTR of SMXL4/5 are directly bound and induced by the zinc-finger protein JUL, which suppresses the translation of SMXL4/5 and restricts phloem differentiation [25]. Genetic and biochemical analyses have demonstrated that RNA G-quadruplex structures play a role in regulating gene expression and plant development [26]. A study in rice has demonstrated that G-quadruplexes exhibit location-dependent effects on gene expression, with promoter-localized G-quadruplexes positively correlating with expression and gene body-localized ones showing an inverse correlation [27].
G-quadruplex has been proven to promote the adaptation of plants in cold environments by enhancing the stability of mRNA [28]. In the maize genome, G-quadruplexes were enriched in genes associated with energy status, hypoxia, low sugar, and nutrient deprivation [29]. A study on sorghum suggested that the stress-responsive G-rich motif may be recognized by polyadenylation factors or RNA-binding proteins through the formation of G-quadruplex structures [30]. In Nicotiana tabacum, G-quadruplexes were enriched in drought stress-responsive genes and related gene families [19]. The Arabidopsis Pif-like helicase 1 mutant led to the redistribution of G-quadruplexes in the genome, with differentially enriched peaks affecting genes related to salt stress and heat stress [31].
Grapevine is a nutrient-rich fruit and an important economic crop, and it holds significant value in plant studies [32]. However, as a crucial regulatory element, the specific roles of G-quadruplex structures in grapevine stress tolerance, berry development, and quality formation have not yet been fully elucidated. The progressive improvement of genome assembly quality and the rapid advancement of bioinformatics have provided an opportunity for the study of grape G-quadruplexes [33,34]. This study thoroughly analyzed the distribution characteristics and potential functions of grapevine G-quadruplexes using T2T genome and transcriptome data, including genomic landscape, feature region distribution, G-quadruplexes associated with drought-responsive genes, and G-quadruplexes linked to key genes in sugar–acid metabolism. The findings significantly enhanced the understanding of the prevalence and function of G-quadruplexes in grapevine and horticultural plants.

2. Results

2.1. General Landscape of G-Quadruplexes in Vitis vinifera Genome

A total of 467,813 G-quadruplexes were identified in the V. vinifera nuclear genome, and the number and density of G-quadruplexes vary among chromosomes (Table 1). Chromosome PN18 formed the highest number of G-quadruplexes (32,786), while chromosome PN3 developed the lowest number (18,839). The G-quadruplex density on grapevine chromosomes ranged from 0.8 (PN16, PN19) to 1.1 (PN8, PN11) per kilobase. The average density of G-quadruplex in the grapevine genome was 0.95 occurrences per 1000 base pairs. A significant difference in G-quadruplex formation was observed in the centromeres (Table S1). In the centromere of PN19, only one G-quadruplex was detected. In the centromere of PN15, the 2883 G-quadruplexes were determined, with a G-quadruplex density of 1.61 per kbp. The 105 and 1092 G-quadruplexes were formed in the chloroplast and mitochondrial genomes, with densities of 0.72 and 1.41, respectively.
The genome-wide G-quadruplex landscape was presented, allowing the uncovering of mutual influence between G-quadruplexes and genomic features (Figure 1). To clarify the potential relationships between G-quadruplexes and genomic features, the correlation between G-quadruplex density and four genomic features, including GC content, gene density, tandem repeat density, and transposable element density, were investigated at the chromosome level. The results showed that G-quadruplexes were consistently and significantly positively correlated with all four genomic features (Figure 2).

2.2. Distribution Patterns of G-Quadruplexes in the Genomic Feature Regions

A large number of G-quadruplexes were characterized in each feature region (Table S2). The 272,652, 81,450, and 37,562 G-quadruplexes were characterized in gene region, promoter2000 region, and TSS500 region, respectively. G-quadruplexes were also frequently found in other feature regions, including exons, CDS, introns, 5′ UTR, and 3′ UTR.
Different G-quadruplex densities were found across these feature regions, and differences were also revealed between the template strand and the coding strand (Figure 3). The G-quadruplex density in the TSS500 region was 1.00/kbp, the highest among all investigated feature regions. The G-quadruplex density in the promoter2000 region was also relatively high, at 0.54/kbp. For the promoter region, as the truncated promoter sequences became shorter, the G-quadruplex density gradually increased. For the remaining feature regions, the G-quadruplex density was highest in CDS, followed by 5′ UTR, exon, gene, intron, and lowest in 3′ UTR. Almost equal G-quadruplex densities were observed in both the coding and template strands of the TSS500 region. In other feature regions, the G-quadruplex density on the coding strand was consistently higher than that on the template strand. The conserved strand preference pattern of G-quadruplexes in a specific feature region was particularly evident in the CDS region, where the density on the coding strand was 2.45-fold higher than that on the template strand. The highest G-quadruplex density (1.22/kb) of single-stranded regions was exposed on the coding strand in the CDS.

2.3. Functions of Genes with Highly Enriched G-Quadruplexes in Promoters and TSSs

G-quadruplexes were enriched in different amounts in the promoters and TSSs of genomic genes, and there were also differences across three sequence contexts (double, coding strand, and template strand) (Figure 4). G-quadruplexes were detected in the promoter2000 regions of 23,755 genes, accounting for 63.39% of all genes in the genome (Table S3). The promoter2000 regions of 1152 genes, representing 3.07% of all genes, contained more than 10 G-quadruplexes (Figure 4A). The 1152 genes were significantly enriched in many GO terms by Gene Ontology (GO) enrichment analysis, indicating their multiple roles in biological processes, cellular components, and molecular functions (Figure 4B). The biological process category included regulation of post-transcriptional gene silencing (GO:0060149, GO:0060147), nuclear-transcribed mRNA catabolic process (GO:0000184, GO:0070478, GO:0034427), and specification of plant organ identity (GO:0010093, GO:0090701). The cellular component category included the H3 histone acetyltransferase complex (GO:0070775), MOZ/MORF histone acetyltransferase complex (GO:0070776), histone acetyltransferase complex (GO:0000123), and protein acetyltransferase complex (GO:0031248). The molecular function category included RNA helicase activity (GO:0003724), helicase activity (GO:0004386), and DNA-binding transcription activator activity-RNA polymerase II-specific (GO:0001228).
G-quadruplexes were found in the TSS500 regions of 16,435 genes, accounting for 43.85% of all genes in the genome (Table S4). The 936 genes existed with more than 5 G-quadruplexes in their TSS500 regions, accounting for 2.50% of all genes (Figure 4C). The essential functions of these 936 genes were revealed through GO enrichment (Figure 4D). The biological process category included histone H3 acetylation (GO:0043966), histone acetylation (GO:0016573), internal peptidyl-lysine acetylation (GO:0018393), peptidyl-lysine acetylation (GO:0018394), internal protein amino acid acetylation (GO:0006475), and protein acetylation (GO:0006473). The cellular component category included H3 histone acetyltransferase complex (GO:0070775), MOZ/MORF histone acetyltransferase complex (GO:0070776), histone acetyltransferase complex (GO:0000123), and protein acetyltransferase complex (GO:0031248). The molecular function category included DNA-binding transcription activator activity-RNA polymerase II-specific (GO:0001228), transcription cis-regulatory region binding (GO:0000976), transcription regulatory region nucleic acid binding (GO:0001067), protein domain-specific binding (GO:0019904), and RNA polymerase II transcription regulatory region sequence-specific DNA binding (GO:0000977).

2.4. Prevalence of G-Quadruplexes in DEGs Under Drought Stress

A large number of differentially expressed genes were found under drought stress of two days, four days, and eight days (Tables S5–S7). The distinct G-quadruplex distribution was discovered in DEGs under drought stress for two days (Figure 3 and Figure 5). For the G-quadruplex density in the TSS500, promoter2000, promoter1500, promoter1000, and promoter500 regions, DEGs exhibited higher density compared to all genes in the genome. Similarly, for the coding strand G-quadruplex density in these five feature regions, DEGs showed higher densities than the genome-wide genes. However, for the coding strand G-quadruplex density in the remaining feature regions, including gene, exon, CDS, intron, 5′ UTR, and 3′ UTR, DEGs exhibited lower densities than the genome-wide genes (Figure 5).
Notably, opposite G-quadruplex patterns were found in transcriptional regulatory regions of the upregulated DEGs and downregulated DEGs following two days of drought stress, including TSS500, promoter2000, promoter1500, promoter1000, and promoter500 regions (Figure 5). For the G-quadruplex density in these five specific regions, the template strand of upregulated DEGs exhibited higher values than the coding strand, with a fold change ranging from 2.10 to 3.02. In contrast, for downregulated DEGs, the coding strand showed greater values than the template strand, with a fold change ranging from 3.03 to 4.67 (Figure 5 and Figure 6). Extensive significance was also found in these obvious differences (Figure S1).
In the upregulated/downregulated DEGs under drought stress at 4 and 8 days, the fold change in G-quadruplex density between the two complementary strands in the TSS500 and promoter regions (promoter2000, promoter1500, promoter1000, promoter500) gradually decreased compared to the difference of DEGs under drought stress at 2 days (Figure 6). In the downregulated DEGs at 4 days of drought stress, the evidence that the coding strand G-quadruplex density was higher than that of the template strand G-quadruplex density could still be observed, whereas this state was disrupted in the downregulated DEGs at 8 days of drought stress (Figure 6).
To further clarify the potential significance of the specific G-quadruplex patterns in DEGs after two days of drought stress, upregulated DEGs and downregulated DEGs with a fold change greater than three in G-quadruplex density between the coding strand and template strand in the TSS500 region were selected as representatives for GO enrichment analysis (Figure 7). The GO enrichment of upregulated DEGs revealed that these genes were broadly associated with multiple metal ion homeostasis in both the biological process and molecular function categories, including cellular transition metal ion homeostasis, cellular cation homeostasis, copper ion homeostasis, manganese ion homeostasis, iron ion transmembrane transporter activity, and the sequestering of metal ions. The biological process of positive regulation of DNA-binding transcription factor activity was also enriched (Figure 7). The GO analysis of downregulated DEGs revealed that these genes were enriched in several biological processes, including circadian rhythm, photosynthesis, regulation of carbohydrate biosynthetic processes, and regulation of cellular carbohydrate metabolic processes (Figure 7).

2.5. G-Quadruplexes in the Grape Berry Sugar–Acid Metabolism Pathway

The 84 key protease genes, classified into 15 distinct types, were annotated in sugar–acid metabolic pathways, participating in sucrose metabolism, glycolysis, gluconeogenesis, and the tricarboxylic acid cycle (Table S5). The vast majority of these genes contain G-quadruplexes in the gene region, TSS500 region, and promoter region (Figure 8 and Table S5).
The highest number of G-quadruplexes was found in pyruvate kinase (PK) genes, with 148 in the gene region, 19 in the TSS500 region, and 51 in the promoter2000 region (Figure 8). Many G-quadruplexes were also detected in feature regions of other important protease gene categories. For instance, 60, 52, and 51 G-quadruplexes were found in the gene regions of citrate synthase (CS) genes, isocitrate dehydrogenase (IDH) genes, and phosphofructokinase (PFK) genes, respectively. Moreover, 12, 12, and 10 G-quadruplexes were identified in the TSS500 regions of sucrose synthase (SS) genes, hexokinase (HK) genes, and phosphoenolpyruvate carboxylase (PEPC) genes. Additionally, in the promoter2000 regions of SS genes, HK genes, and PFK genes, 36, 15, and 10 G-quadruplexes were detected, respectively (Figure 8). G-quadruplex expansion was particularly prevalent in several genes. A total of 53 G-quadruplexes were identified in the gene region of Vitvi021099 (CS). In the TSS500 region of Vitvi010819 (HK), seven G-quadruplexes were found. Additionally, 14 G-quadruplexes were observed in the promoter region of Vitvi015209 (PK) (Figure 8). The CWINV, NINV, and VINV genes contained G-quadruplexes in different quantities: 16, 9, and 31 in their gene body regions; 1, 5, and 8 within TSS500 regions; and 1, 4, and 3 in promoter regions. Additionally, significant differences in the number and distinct motif patterns of G-quadruplexes between NINV and VINV genes were revealed.
These key protease genes engaged in sugar acid metabolism, which possess G-quadruplexes, also respond to drought stress. The expression of the Vitvi030819 (PK) was significantly upregulated after two days of drought stress, with two G-quadruplexes in its gene region. The Vitvi011578 (SS) was significantly upregulated after eight days of drought stress, with three G-quadruplexes present in its gene region and two in the TSS500 region. Furthermore, Vitvi005213 (PK) and Vitvi035569 (SPS) showed significant downregulation after eight days of drought stress, and G-quadruplexes were also found in their gene regions, TSS500 regions, and promoter2000 regions. These results suggest that G-quadruplexes might play an extensive regulatory role as gene expression regulators in drought responses and quality-related metabolism.

3. Discussion

3.1. High-Density G-Quadruplexes May Facilitate Grapevine Adaptive Evolution

G-quadruplexes were widely and diversely distributed in the grapevine genome. The G-quadruplex density in the grapevine nuclear genome was 0.95 per kbp, which was higher than that of herbaceous plants identified by the same means, such as A. thaliana (0.34 per kbp) [35], pea (0.42 per kbp) [22], and tobacco (0.5 per kbp) [19]. In grapevine, the G-quadruplex density in the mitochondrial genome was higher than that in the chloroplast genome, suggesting an important regulatory role of G-quadruplexes in mitochondria [36]. G-quadruplex density was confirmed to be significantly positively correlated with TE density, suggesting that TE may play a crucial role in G-quadruplex expansion. The high-density G-quadruplexes as a form of epigenetic regulation may have facilitated environmental adaptation in woody plants such as grapevine [37] and potentially enhanced the cost–benefit ratio of the genome [38]. The impact of G-quadruplex sequence variations mediated by nucleotide polymorphisms on gene expression and local adaptation will be further elucidated with the assembly of the Vitis pan-genome [39,40].
Plant centromeres are composed of highly repetitive satellite DNA and retrotransposon elements [41,42], ensuring accurate chromosome segregation and maintaining genomic stability [43,44]. Several studies have uncovered the prevalence of non-B DNA at centromeres [45,46,47,48]. G-quadruplexes were distributed within the long terminal repeat retrotransposons of Citrus sinensis centromeres [49]. In the oat centromere, G-quadruplexes formed within the CENH3-binding regions [50]. Benefiting from the complete genome of grapevine, we found that G-quadruplexes were unevenly distributed among all centromeres. The centromere of chromosome PN15 exhibited the highest G-quadruplex density (1.61/kbp), which may have contributed to enhancing the structural stability of this region, thereby more effectively ensuring the accurate transmission of genetic information. Additionally, the presence of high-density G-quadruplexes in PN15 might influence chromatin accessibility, thereby regulating gene transcription activity. However, the lower G-quadruplex density in other centromeres of grapevine suggested that genomic regulation in these regions might rely on alternative mechanisms, such as histone modifications or DNA methylation [51]. In the future, the causes and consequences of the significant differences in G-quadruplex formation among different centromeres would be an interesting question. The genomic structural heterogeneity might have represented different adaptive evolutionary patterns of grape centromeres among different chromosomes.

3.2. G-Quadruplexes Enriched in Transcriptional Regulatory Regions with Strand Bias

In the grapevine promoter and TSS500 regions, 81,450 and 37,562 G-quadruplexes were identified, respectively. As the transcription start site was approached, the density of G-quadruplexes increased. In these two regulatory regions, the G-quadruplex densities were 0.54/kbp and 1.00/kbp, respectively, showing higher G-quadruplex density compared to tobacco [19]. Notably, in these transcriptional control regions of the grapevine genome, the G-quadruplex density on the coding strand was greater than that on the template strand.
Genes with G-quadruplexes enriched in genetic regulatory regions were defined as genes with more than ten G-quadruplexes in the promoter and genes with more than five G-quadruplexes in TSS500. GO enrichment analysis indicated that these genes were involved in important biological processes in grapevine, including post-transcriptional gene silencing (GO:0060149, GO:0060147), histone acetylation (GO:0016573, GO:0043966), and specific binding of the transcription regulatory region (GO:0000976, GO:0001067). In rice, K+-specific G-quadruplexes were correlated with active histone marks and low DNA methylation levels [52]. It is worth noting that genes with the same GO function (e.g., GO0060149, GO0008186, GO0002576, GO0019904) almost all exhibited the same strand preference pattern for their G-quadruplexes. In the human genome, G-quadruplexes were enriched in the promoter regions as cis-acting elements, and their functions and therapeutic potential have garnered widespread attention [5,53,54]. The significant enrichment of G-quadruplexes in the transcriptional regulatory regions, along with conserved strand preference distribution, underscores their indispensable function in the regulation of gene expression in grapevine.

3.3. G-Quadruplex Might Participate in the Rapid Response to Drought Stress in Grapevine

To uncover the potential role of G-quadruplexes in abiotic stress, G-quadruplexes of DEGs under drought stress were specifically investigated. Using time-series transcriptomic data, the unique G-quadruplex pattern in DEGs under two days of drought stress was illustrated. In the promoter and TSS500 regions, the G-quadruplex density of DEGs was higher than that of all genes in the genome. The high G-quadruplex density phenotype in transcriptional regulatory regions was also observed in the DEGs under drought stress in A. thaliana and tobacco [19,35]. It was noteworthy that the opposite strand preference in the G-quadruplex distribution of different categories of DEGs was uncovered. For upregulated DEGs, the G-quadruplex density on the template strand was higher than that on the coding strand in the promoter and TSS500 regions. For downregulated DEGs, the density on the coding strand was greater than that on the template strand. Several studies have confirmed the impact of strand positioning on the function of G-quadruplexes. For example, the insertion of G-quadruplex sequences in the upstream regulatory region of the Escherichia coli expression system significantly enhanced transcription and translation efficiency when G-quadruplex structures formed in the coding strand. However, both transcription efficiency and fluorescence intensity were reduced when G-quadruplex sequences were inserted in the template strand [55]. The fold difference of G-quadruplex density in grapevine DEGs under drought stress between the two strands was around three, representing a conserved and undeniable distinction.
The upregulated genes with G-quadruplex strand preference were associated with the transport and homeostasis of various metal ions. The regulation of metal ions involved important processes such as osmotic regulation, the activation of antioxidant defense systems, and the protection of the photosynthetic system under drought stress [56,57,58]. Based on the ability of metal ions to stabilize G-quadruplex structures, we hypothesized that G-quadruplexes may function as receptor elements, allowing plants to rapidly respond to environmental changes without the need to synthesize new proteins; thus, this might help plants conserve valuable energy during drought. A similar but weaker high G-quadruplex density pattern to that observed in the second-day DEGs was also found in the DEGs on the fourth and eighth days of drought stress. These results emphasized that G-quadruplexes enriched in transcriptional regulatory regions may widely regulate drought stress tolerance and play a key role in rapid stress response. The molecular mechanisms by which G-quadruplexes participate in drought stress responses in grapevine can be further investigated through the integration of biophysical and molecular biological approaches [59,60].
Extensive and significant G-quadruplex differences were also identified among DEGs under 2-day, 4-day, and 8-day drought stress conditions. While no statistical significance was detected in several gene sets, we observed approximately threefold differences between the template and coding strands, representing a pronounced differential trend. In fact, the chain localization of G-quadruplex has been found to play a key role in biological processes. For example, recent scientific research shows that the DNA unwinding activity mediated by the Cdc45-MCM-GINS helicase complex often encounters structural barriers within chromosomal DNA, such as G-quadruplex structures, that may induce replication stress and compromise genomic integrity [61].

3.4. G-Quadruplexes as Targets of Sugar–Acid Regulation of Grape Berry

Sugar and acid metabolism was a core physiological process in grape berry development and stress response, and its dynamic balance directly affected fruit quality, flavor formation, and plant stress resistance [60,62,63]. G-quadruplexes were abundant in the key genes associated with sucrose metabolism, glycolysis, gluconeogenesis, and the tricarboxylic acid cycle in grapevine. In the TSS500 regions of SS, HK, and PK genes, there were 12, 12, and 19 G-quadruplexes, respectively, while in the promoter regions, there were 36, 15, and 51 G-quadruplexes, respectively. Multiple G-quadruplexes were also distributed in CS, IDH, PFK, and PEPC genes. In maize, G-quadruplexes were found to be present in genes associated with sugar metabolism [29]. In the grapevine genome, the number of G-quadruplexes in the VINV gene was higher than that in CWINV and NINV, particularly in the gene body region. The differences in G-quadruplex distribution among the three types of sucrose transferases may be attributed to variations in the GC content of their nucleotide sequences and could be related to their unique subcellular localization and functional specificity [64].
GO enrichment analysis revealed that genes involved in the regulation of carbohydrate biosynthesis and metabolic processes participated in drought stress resistance. The PK gene Vitvi030819 and the SS gene Vitvi011578 were significantly upregulated under drought stress, with two and three G-quadruplexes identified in their gene body regions, respectively. G-quadruplexes are expected to serve as important epigenetic targets for the precise regulation of sugar and acid metabolism based on the CRISPR/Cas system in the future, aiming to synergistically optimize berry flavor and stress adaptability [65,66]. Notably, biological large language models have recently demonstrated unique advantages in deciphering the relationship between nucleic acid sequences and gene expression regulation [67,68], including the deciphering of G-quadruplexes by PlantRNA-FM [69]. These advancements will provide new strategies and tools for G-quadruplex-based fruit quality improvement and stress-resistant molecular breeding while simultaneously accelerating the establishment of a molecular breeding paradigm based on nucleic acid secondary structure regulation.

4. Materials and Methods

4.1. Identification of G-Quadruplexes in the Vitis vinifera Genome

The Vitis vinifera T2T genome assembly (nucleus, mitochondria, chloroplast) and annotation (gene, tandem repeats, and TE) were retrieved from the Zenodo database (https://zenodo.org/records/10969867 (accessed on 10 September 2024)) [70]. The centromeric sequences were extracted based on the confirmed chromosome location using the SeqKit2 [71]. The G-quadruplexes in chromosomes and centromeres were identified using G4Hunter, with window 25 and threshold 1.2 [72]. The G4Hunter analysis outcomes were further cleaned. Finally, the genomic coordinates of each G-quadruplex were obtained. The G-quadruplex number, G-quadruplex density, sequence length, and GC content in each chromosome and centromere, along with detailed information on all G-quadruplexes, were summarized. The grapevine genome landscape was calculated and displayed using Advanced Circos [73], including G-quadruplex density, GC density, GC skew, gene density, tandem repeat density, and TE density. The correlation analyses were performed using the Pearson test.

4.2. G-Quadruplex Analysis of Genomic Feature Regions in the Grapevine Genome

The genome coordinates of feature regions, including exon, CDS, intron, 5′ UTR, 3′ UTR, gene, promoter2000, promoter1500, promoter1000, promoter500, and TSS500, were extracted and calculated based on the genome annotation file. The promoter2000, promoter1500, promoter1000, and promoter500 regions were defined as the sequences located 2000 bp, 1500 bp, 1000 bp, and 500 bp upstream of the gene, respectively. TSS500 was delimited as the sequence composed of 250 bp on each side of the transcription start site. Derived from the genomic coordinates of G-quadruplexes and feature regions, the numbers of G-quadruplexes in these feature regions were quantified, and G-quadruplex densities were further calculated.

4.3. Functional Enrichment Analysis of Grapevine G-Quadruplex-Rich Genes

In accordance with the number of G-quadruplexes of the promoter region and TSS500 region in the DNA double strand, template strand, and coding strand, all genes in the grapevine genome were classified, and their percentages were calculated. GO enrichment analysis was performed on two gene clusters: (1) genes with more than 10 G-quadruplexes in the promoter double strand and (2) genes with more than 5 G-quadruplexes in the TSS500 double strand. The functional annotation of all genomic genes was conducted using eggNOG-mapper (http://eggnog-mapper.embl.de/ (accessed on 6 November 2024)) [74]. GO enrichment analysis and visualization were performed by deploying the clusterProfiler 4.6.0 R package [75] based on the Metware Cloud platform (https://cloud.metware.cn (accessed on 8 November 2024)).

4.4. G-Quadruplex Analysis of DEGs Under Drought Stress at Different Stages

To probe the G-quadruplex patterns of DEGs under drought stress at various stages, RNA-seq data (SRP132681) from leaf samples collected after 2, 4, and 8 days of drought stress were retrieved from the Sequence Read Archive (SRA) database [76]. The raw sequencing data were assessed for quality through FastQC v0.12.1. Adapter sequences were removed, and the first 12 bases of reads were trimmed to obtain clean reads using Trimmomatic v0.39 [77]. The grapevine genome index was constructed, and clean reads were mapped to the reference genome by HISAT2 v2.2.1 [78]. SAM files were then converted to reordered BAM files via SAMtools v1.7 [79]. Transcript abundances were estimated, and FPKM values were calculated using StringTie v2.1.7 [80]. Gene count values were obtained by the prepDE.py3 script provided by StringTie. DEGs were identified with DESeq2 [81], adopting a |log2FoldChange| ≥ 1 and padj ≤ 0.05 as the threshold.
All DEGs were classified into upregulated DEGs and downregulated DEGs, and the G-quadruplex density in feature regions of these DEGs was calculated. G-quadruplex density was displayed using ChiPlot (https://www.chiplot.online/ (accessed on 2 February 2025)). The upregulated and downregulated DEGs after 2 days of drought stress, with fold change greater than three for G-quadruplex density in template and coding strands of the TSS500 region, were selected for GO enrichment analysis using the clusterProfiler 4.6.0 R package.

4.5. G-Quadruplex Analysis of Key Genes in the Sugar–Acid Metabolism Pathway of Grape Berry

The key protease genes involved in the important sugar–acid metabolic pathways of grape berries were gathered from previously published studies [82,83]. The KO Entry of each protease gene was retrieved through the KEGG database [84]. Based on KO Entry and grapevine protein annotations from eggNOG, the key proteins involved in grape sugar–acid metabolism were extracted. The subcellular localization of the proteins was predicted using DeepLoc-2.0 [85]. The number of G-quadruplexes in the gene body, promoter2000, and TSS500 regions of genes encoding these proteins was calculated based on their genomic coordinates. G-quadruplex sequences were aligned using MAFFT v7.525 [86] and then trimmed using trimAl v1.5.0 [87]. The SeqLogo was shown using TBtools v2.210.

5. Conclusions

This study revealed that G-quadruplexes are widespread and functionally significant in the grapevine genome. Hundreds of thousands of G-quadruplexes were identified, exhibiting significant enrichment in transcriptional regulatory regions and demonstrating positive correlations with GC content, gene density, and transposon density. The G-quadruplex density within centromeres displays heterogeneity. Under drought stress, differentially expressed genes (DEGs) exhibited distinct strand-biased G-quadruplex patterns. The upregulated DEGs showed template strand dominance in promoters and TSS500 linked to metal ion homeostasis, while downregulated DEGs displayed coding strand preference associated with sugar–acid metabolism. Additionally, G-quadruplexes were highly enriched in key sugar–acid metabolism genes. Sucrose transferases with three distinct subcellular localizations exhibited varying abundances of G-quadruplexes. These findings highlight G-quadruplexes as versatile regulators of grapevine stress adaptation and berry development, offering novel targets for epigenetic engineering to enhance plant resilience and fruit quality.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/plants14081180/s1, Figure S1: The difference of G-quadruplex density (/kbp) between coding strand and template strand of differentially expressed genes under drought stress. (A) The density difference of G-quadruplex in promoter 2000, promoter 1500, promoter 1000, promoter 500 and TSS 500 regions of differentially expressed genes was up-regulated after two days of drought stress. (B) The density difference of G-quadruplex in promoter 2000, promoter 1500, promoter 1000, promoter 500 and TSS 500 regions of differentially expressed genes was down-regulated after two days of drought stress. (C) The density difference of G-quadruplex in promoter 2000, promoter 1500, promoter 1000, promoter 500 and TSS 500 regions of differentially expressed genes was up-regulated after four days of drought stress. (D) The density difference of G-quadruplex in promoter 2000, promoter 1500, promoter 1000, promoter 500 and TSS 500 regions of differentially expressed genes was down-regulated after four days of drought stress. (E) The density difference of G-quadruplex in promoter 2000, promoter 1500, promoter 1000, promoter 500 and TSS 500 regions of differentially expressed genes was up-regulated after eight days of drought stress. (F) The density difference of G-quadruplex in promoter 2000, promoter 1500, promoter 1000, promoter 500 and TSS 500 regions of differentially expressed genes was down-regulated after eight days of drought stress. Figure S2: Differences in the number of G-quadruplexes and specific motif patterns in the gene body regions of sucrose transferases. (A) Differences in the number of G-quadruplexes among three sucrose transferases. (B) Specific motif patterns of G-quadruplexes in the NINV genes. (C) Specific motif patterns of G-quadruplexes in the VINV genes. Table S1: G-quadruplex profile of Vitis vinifera centromeres; Table S2: The number of G-quadruplexes in feature regions of grapevine genome; Table S3: The number and percentage of genes with different numbers of G-quadruplexes in promoter2000; Table S4: The number and percentage of genes with different numbers of G-quadruplexes in TSS500; Table S5: Differentially expressed genes in two days under drought stress; Table S6: Differentially expressed genes in four days under drought stress; Table S7: Differentially expressed genes in eight days under drought stress; Table S8: The key genes in the sugar–acid metabolic pathway and their G-quadruplex enrichment.

Author Contributions

Conceptualization, Y.L. (Yi Li), M.X., J.X. and K.S.; methodology, Y.L. (Yi Li), M.X., J.X. and K.S.; software, J.X. and K.S.; validation, J.X. and K.S.; formal analysis, J.X. and K.S.; investigation, J.X., G.Q. and R.W.; resources, Y.L. (Yi Li) and M.X.; data curation, J.X., G.Q. and R.W.; writing—original draft preparation, J.X. and K.S.; writing—review and editing, Y.L. (Yi Li), M.X., J.X. and K.S.; visualization, J.X., K.S. and G.Q.; supervision, Y.L. (Yi Li), H.W., Y.L. (Yujuan Liu) and Q.J.; project administration, Y.L. (Yi Li), M.X., H.W. and Q.J.; funding acquisition, Y.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by The Key Research and Development Project of Ningxia Hui Autonomous Region (2024BBF01002) and Research and Demonstration of Grassland Ecological Restoration Technology Model along the Silk Road (KY202002011).

Data Availability Statement

Data are contained within the article or Supplementary Materials.

Conflicts of Interest

Author Hongyuan Wu was employed by the company Beijing Anling Ecological Construction Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Crespi, M.; Ariel, F. Non-B DNA structures emerging from plant genomes. Trends Plant Sci. 2022, 27, 624–626. [Google Scholar] [CrossRef] [PubMed]
  2. Kwok, C.K.; Merrick, C.J. G-Quadruplexes: Prediction, Characterization, and Biological Application. Trends Biotechnol. 2017, 35, 997–1013. [Google Scholar] [CrossRef] [PubMed]
  3. Carvalho, J.; Mergny, J.L.; Salgado, G.F.; Queiroz, J.A.; Cruz, C. G-quadruplex, Friend or Foe: The Role of the G-quartet in Anticancer Strategies. Trends Mol. Med. 2020, 26, 848–861. [Google Scholar] [CrossRef]
  4. Bhattacharyya, D.; Arachchilage, G.M.; Basu, S. Metal Cations in G-Quadruplex Folding and Stability. Front. Chem. 2016, 4, 38. [Google Scholar] [CrossRef] [PubMed]
  5. Hänsel-Hertsch, R.; Di Antonio, M.; Balasubramanian, S. DNA G-quadruplexes in the human genome: Detection, functions and therapeutic potential. Nat. Rev. Mol. Cell Biol. 2017, 18, 279–284. [Google Scholar] [CrossRef]
  6. Guo, J.U.; Bartel, D.P. RNA G-quadruplexes are globally unfolded in eukaryotic cells and depleted in bacteria. Science 2016, 353, aaf5371. [Google Scholar] [CrossRef]
  7. Garcia, I.E.; Kirchner, A.; Melidis, L.; Tavares, R.D.A.; Dhir, S.; Simeone, A.; Yu, Z.T.; Madden, S.K.; Hermann, R.; Tannahill, D.; et al. G-quadruplex DNA structure is a positive regulator of MYC transcription. Proc. Natl. Acad. Sci. USA 2024, 121, e2320240121. [Google Scholar] [CrossRef]
  8. Pramanik, S.; Chen, Y.L.; Song, H.Y.; Khutsishvili, I.; Marky, L.A.; Ray, S.; Natarajan, A.; Singh, P.K.; Bhakat, K.K. The human AP-endonuclease 1 (APE1) is a DNA G-quadruplex structure binding protein and regulates KRAS expression in pancreatic ductal adenocarcinoma cells. Nucleic Acids Res. 2022, 50, 3394–3412. [Google Scholar] [CrossRef]
  9. Agrawal, P.; Hatzakis, E.; Guo, K.X.; Carver, M.; Yang, D.Z. Solution structure of the major G-quadruplex formed in the human VEGF promoter in K+: Insights into loop interactions of the parallel G-quadruplexes. Nucleic Acids Res. 2013, 41, 10584–10592. [Google Scholar] [CrossRef]
  10. Chen, L.Y.; Dickerhoff, J.; Sakai, S.; Yang, D.Z. DNA G-Quadruplex in Human Telomeres and Oncogene Promoters: Structures, Functions, and Small Molecule Targeting. Acc. Chem. Res. 2022, 55, 2628–2646. [Google Scholar] [CrossRef]
  11. Spiegel, J.; Adhikari, S.; Balasubramanian, S. The Structure and Function of DNA G-Quadruplexes. Trends Chem. 2020, 2, 123–136. [Google Scholar] [CrossRef]
  12. Varshney, D.; Spiegel, J.; Zyner, K.; Tannahill, D.; Balasubramanian, S. The regulation and functions of DNA and RNA G-quadruplexes. Nat. Rev. Mol. Cell Biol. 2020, 21, 459–474. [Google Scholar] [CrossRef]
  13. Sharma, H.; Garg, R. Knot-knot chronicles: Unveiling the G-quadruplexes. Crit. Rev. Biotechnol. 2025, 1–16. [Google Scholar] [CrossRef] [PubMed]
  14. Li, Z.; Tavakoli, M.; Jiang, R.; Zong, L.; Zhang, W. Identification and functional characterization of ssDNA in plants. Trends Plant Sci. 2025, 25, S1360–S1385. [Google Scholar] [CrossRef] [PubMed]
  15. Garg, R.; Aggarwal, J.; Thakkar, B. Genome-wide discovery of G-quadruplex forming sequences and their functional relevance in plants. Sci. Rep. 2016, 6, 28211. [Google Scholar] [CrossRef] [PubMed]
  16. Mullen, M.A.; Olson, K.J.; Dallaire, P.; Major, F.; Assmann, S.M.; Bevilacqua, P.C. RNA G-Quadruplexes in the model plant species Arabidopsis thaliana: Prevalence and possible functional roles. Nucleic Acids Res. 2010, 38, 8149–8163. [Google Scholar] [CrossRef]
  17. Feng, Y.L.; Tao, S.T.; Zhang, P.Y.; Sperti, F.R.; Liu, G.Q.; Cheng, X.J.; Zhang, T.; Yu, H.X.; Wang, X.E.; Chen, C.Y.; et al. Epigenomic features of DNA G-quadruplexes and their roles in regulating rice gene transcription. Plant Physiol. 2022, 188, 1632–1648. [Google Scholar] [CrossRef]
  18. Zhou, Y.Z.; Kishchenko, O.; Stepanenko, A.; Chen, G.M.; Wang, W.; Zhou, J.; Pan, C.Z.; Borisjuk, N. The Dynamics of NO3− and NH4+ Uptake in Duckweed Are Coordinated with the Expression of Major Nitrogen Assimilation Genes. Plants 2022, 11, 11. [Google Scholar] [CrossRef]
  19. Song, K.K.; Li, B.; Li, H.Z.; Zhang, R.; Zhang, X.H.; Luan, R.W.; Liu, Y.; Yang, L. The Characterization of G-Quadruplexes in Tobacco Genome and Their Function under Abiotic Stress. Int. J. Mol. Sci. 2024, 25, 4331. [Google Scholar] [CrossRef]
  20. Cagirici, H.B.; Sen, T.Z. Genome-Wide Discovery of G-Quadruplexes in Wheat: Distribution and Putative Functional Roles. G3-Genes Genomes Genet. 2020, 10, 2021–2032. [Google Scholar] [CrossRef]
  21. Cagirici, H.B.; Budak, H.; Sen, T.Z. Genome-wide discovery of G-quadruplexes in barley. Sci. Rep. 2021, 11, 7876. [Google Scholar] [CrossRef] [PubMed]
  22. Dobrovolná, M.; Bohálová, N.; Peska, V.; Wang, J.W.; Luo, Y.; Bartas, M.; Volná, A.; Mergny, J.L.; Brázda, V. The Newly Sequenced Genome of Pisum sativum Is Replete with Potential G-Quadruplex-Forming Sequences-Implications for Evolution and Biological Regulation. Int. J. Mol. Sci. 2022, 23, 8482. [Google Scholar] [CrossRef] [PubMed]
  23. Griffin, B.D.; Bass, H.W. Plant G-quadruplex (G4) motifs in DNA and RNA; abundant, intriguing sequences of unknown function. Plant Sci. 2018, 269, 143–147. [Google Scholar] [CrossRef]
  24. Yang, X.; Yang, M.; Deng, H.; Ding, Y. New Era of Studying RNA Secondary Structure and Its Influence on Gene Regulation in Plants. Front. Plant Sci. 2018, 9, 671. [Google Scholar] [CrossRef] [PubMed]
  25. Cho, H.; Cho, H.S.; Nam, H.; Jo, H.; Yoon, J.; Park, C.; Dang, T.V.T.; Kim, E.; Jeong, J.; Park, S.; et al. Translational control of phloem development by RNA G-quadruplex-JULGI determines plant sink strength. Nat. Plants 2018, 4, 376–390. [Google Scholar] [CrossRef]
  26. Yang, X.F.; Cheema, J.; Zhang, Y.Y.; Deng, H.J.; Duncan, S.; Umar, M.I.; Zhao, J.Y.; Liu, Q.; Cao, X.F.; Kwok, C.K.; et al. RNA G-quadruplex structures exist and function in vivo in plants. Genome Biol. 2020, 21, 226. [Google Scholar] [CrossRef]
  27. Foroozani, M. Unraveling the role of DNA G-quadruplexes in transcription in the rice. Plant Physiol. 2022, 188, 1412–1414. [Google Scholar] [CrossRef]
  28. Yang, X.F.; Yu, H.P.; Duncan, S.; Zhang, Y.Y.; Cheema, J.; Liu, H.F.; Miller, J.B.; Zhang, J.; Kwok, C.K.; Zhang, H.K.; et al. RNA G-quadruplex structure contributes to cold adaptation in plants. Nat. Commun. 2022, 13, 6224. [Google Scholar] [CrossRef]
  29. Andorf, C.M.; Kopylov, M.; Dobbs, D.; Koch, K.E.; Stroupe, M.E.; Lawrence, C.J.; Bass, H.W. G-Quadruplex (G4) Motifs in the Maize (Zea mays L.) Genome Are Enriched at Specific Locations in Thousands of Genes Coupled to Energy Status, Hypoxia, Low Sugar, and Nutrient Deprivation. J. Genet. Genom. 2014, 41, 627–647. [Google Scholar] [CrossRef]
  30. Chakrabarti, M.; de Lorenzo, L.; Abdel-Ghany, S.E.; Reddy, A.S.N.; Hunt, A.G. Wide-ranging transcriptome remodelling mediated by alternative polyadenylation in response to abiotic stresses in Sorghum. Plant J. 2020, 102, 916–930. [Google Scholar] [CrossRef]
  31. Singh, S.; Singh, S.; Gupta, K.; Sharma, H.; Bady, S.; Sondhi, M.; Garg, R. Genome-Wide Identification of G-Quadruplex forming regions in Arabidopsis: Unraveling the Role of Pif like Helicase (AtPLH1) in Gene Regulation and Stress Response. bioRxiv 2024. [Google Scholar] [CrossRef]
  32. Dong, Y.; Duan, S.C.; Xia, Q.J.; Liang, Z.C.; Dong, X.; Margaryan, K.; Musayev, M.; Goryslavets, S.; Zdunic, G.; Bert, P.F.; et al. Dual domestications and origin of traits in grapevine evolution. Science 2023, 379, 892–901. [Google Scholar] [CrossRef]
  33. Wang, Y.; Ding, K.Y.; Li, H.Y.; Kuang, Y.F.; Liang, Z.C. Biography of Vitis genomics: Recent advances and prospective. Hortic. Res. 2024, 11, 4. [Google Scholar] [CrossRef] [PubMed]
  34. Cantu, D.; Massonnet, M.; Cochetel, N. The wild side of grape genomics. Trends Genet. 2024, 40, 601–612. [Google Scholar] [CrossRef] [PubMed]
  35. Pecinka, P.; Bohálová, N.; Volná, A.; Kundrátová, K.; Brázda, V.; Bartas, M. Analysis of G-Quadruplex-Forming Sequences in Drought Stress-Responsive Genes, and Synthesis Genes of Phenolic Compounds in Arabidopsis thaliana. Life 2023, 13, 199. [Google Scholar] [CrossRef] [PubMed]
  36. Sahayasheela, V.J.; Yu, Z.; Hidaka, T.; Pandian, G.N.; Sugiyama, H. Mitochondria and G-quadruplex evolution: An intertwined relationship. Trends Genet. 2023, 39, 15–30. [Google Scholar] [CrossRef]
  37. Tuskan, G.A.; Groover, A.T.; Schmutz, J.; DiFazio, S.P.; Myburg, A.; Grattapaglia, D.; Smart, L.B.; Yin, T.M.; Aury, J.M.; Kremer, A.; et al. Hardwood Tree Genomics: Unlocking Woody Plant Biology. Front. Plant Sci. 2018, 9, 1799. [Google Scholar] [CrossRef]
  38. Faizullah, L.; Morton, J.A.; Hersch-Green, E.I.; Walczyk, A.M.; Leitch, A.R.; Leitch, I.J. Exploring environmental selection on genome size in angiosperms. Trends Plant Sci. 2021, 26, 1039–1049. [Google Scholar] [CrossRef]
  39. Guo, L.; Wang, X.; Ayhan, D.H.; Rhaman, M.S.; Yan, M.; Jiang, J.; Wang, D.; Zheng, W.; Mei, J.; Ji, W.; et al. Super pangenome of Vitis empowers identification of downy mildew resistance genes for grapevine improvement. Nat. Genet. 2025, 57, 741–753. [Google Scholar] [CrossRef]
  40. Liu, Z.J.; Wang, N.; Su, Y.; Long, Q.M.; Peng, Y.L.; Shangguan, L.F.; Zhang, F.; Cao, S.; Wang, X.; Ge, M.Q.; et al. Grapevine pangenome facilitates trait genetics and genomic breeding. Nat. Genet. 2024, 56, 2804–2814. [Google Scholar] [CrossRef]
  41. Tsukahara, S.; Bousios, A.; Perez-Roman, E.; Yamaguchi, S.; Leduque, B.; Nakano, A.; Naish, M.; Osakabe, A.; Toyoda, A.; Ito, H.; et al. Centrophilic retrotransposon integration via CENH3 chromatin in Arabidopsis. Nature 2025, 637, 744–748. [Google Scholar] [CrossRef] [PubMed]
  42. Wlodzimierz, P.; Rabanal, F.A.; Burns, R.; Naish, M.; Primetis, E.; Scott, A.; Mandáková, T.; Gorringe, N.; Tock, A.J.; Holland, D.; et al. Cycles of satellite and transposon evolution in Arabidopsis centromeres. Nature 2023, 618, 557–565. [Google Scholar] [CrossRef]
  43. Naish, M. Bridging the gap: Unravelling plant centromeres in the telomere-to-telomere era. New Phytol. 2024, 244, 2143–2149. [Google Scholar] [CrossRef] [PubMed]
  44. Naish, M.; Henderson, I.R. The structure, function, and evolution of plant centromeres. Genome Res. 2024, 34, 161–178. [Google Scholar] [CrossRef]
  45. Kasinathan, S.; Henikoff, S. Non-B-Form DNA Is Enriched at Centromeres. Mol. Biol. Evol. 2018, 35, 949–962. [Google Scholar] [CrossRef] [PubMed]
  46. Liu, Y.; Liu, Q.; Su, H.D.; Liu, K.P.; Xiao, X.; Li, W.; Sun, Q.W.; Birchler, J.A.; Han, F.P. Genome-wide mapping reveals R-loops associated with centromeric repeats in maize. Genome Res. 2021, 31, 1409–1418. [Google Scholar] [CrossRef]
  47. Patchigolla, V.S.P.; Mellone, B.G. Enrichment of Non-B-Form DNA at D. melanogaster Centromeres. Genome Biol. Evol. 2022, 14, 5. [Google Scholar] [CrossRef]
  48. Liu, C.; Huang, Y.H.; Guo, X.R.; Yi, C.Y.; Liu, Q.; Zhang, K.B.; Zhu, C.L.; Liu, Y.; Han, F.P. Young retrotransposons and non-B DNA structures promote the establishment of dominant rye centromere in the 1RS.1BL fused centromere. New Phytol. 2024, 241, 607–622. [Google Scholar] [CrossRef]
  49. Song, S.; Liu, H.; Miao, L.; Lan, H.; Chen, C. Centromeric repeats in Citrus sinensis provide new insights into centromeric evolution and the distribution of G-quadruplex structures. Hortic. Adv. 2023, 1, 7. [Google Scholar] [CrossRef]
  50. Liu, Q.; Yi, C.Y.; Zhang, Z.Y.; Su, H.D.; Liu, C.; Huang, Y.H.; Li, W.; Hu, X.J.; Liu, C.; Birchler, J.A.; et al. Non-B-form DNA tends to form in centromeric regions and has undergone changes in polyploid oat subgenomes. Proc. Natl. Acad. Sci. USA 2022, 120, e2211683120. [Google Scholar] [CrossRef]
  51. Altemose, N.; Logsdon, G.A.; Bzikadze, A.; Sidhwani, P.; Langley, S.A.; Caldas, G.; Hoyt, S.J.; Uralsky, L.; Ryabov, F.D.; Shew, C.J.; et al. Complete genomic and epigenetic maps of human centromeres. Science 2022, 376, eabl4178. [Google Scholar] [CrossRef]
  52. Feng, Y.L.; Luo, Z.Y.; Huang, R.R.; Yang, X.M.; Cheng, X.J.; Zhang, W.L. Epigenomic Features and Potential Functions of K+ and Na+ Favorable DNA G-Quadruplexes in Rice. Int. J. Mol. Sci. 2022, 23, 8404. [Google Scholar] [CrossRef] [PubMed]
  53. Huppert, J.L.; Balasubramanian, S. G-quadruplexes in promoters throughout the human genome. Nucleic Acids Res. 2006, 35, 406–413. [Google Scholar] [CrossRef]
  54. Zhang, R.X.; Wang, Y.Q.; Wang, C.; Sun, X.; Mergny, J.L. G-quadruplexes as pivotal components of cis-regulatory elements in the human genome. BMC Biol. 2024, 22, 177. [Google Scholar] [CrossRef] [PubMed]
  55. Lee, C.Y.; Joshi, M.; Wang, A.; Myong, S. 5′UTR G-quadruplex structure enhances translation in size dependent manner. Nat. Commun. 2024, 15, 3963. [Google Scholar] [CrossRef]
  56. Farooq, M.; Wahid, A.; Kobayashi, N.; Fujita, D.; Basra, S.M.A. Plant drought stress: Effects, mechanisms and management. Agron. Sustain. Dev. 2009, 29, 185–212. [Google Scholar] [CrossRef]
  57. Dong, Q.Y.; Wallrad, L.; Almutairi, B.O.; Kudla, J. Ca2+ signaling in plant responses to abiotic stresses. J. Integr. Plant Biol. 2022, 64, 287–300. [Google Scholar] [CrossRef]
  58. Zhang, H.M.; Zhu, J.H.; Gong, Z.Z.; Zhu, J.K. Abiotic stress responses in plants. Nat. Rev. Genet. 2022, 23, 104–119. [Google Scholar] [CrossRef]
  59. Mendoza, O.; Gueddouda, N.M.; Boulé, J.B.; Bourdoncle, A.; Mergny, J.L. A fluorescence-based helicase assay: Application to the screening of G-quadruplex ligands. Nucleic Acids Res. 2015, 43, e71. [Google Scholar] [CrossRef]
  60. Zhang, X.Y.; Spiegel, J.; Cuesta, S.M.; Adhikari, S.; Balasubramanian, S. Chemical profiling of DNA G-quadruplex-interacting proteins in live cells. Nat. Chem. 2021, 13, 626–633. [Google Scholar] [CrossRef]
  61. Batra, S.; Allwein, B.; Kumar, C.; Devbhandari, S.; Brüning, J.-G.; Bahng, S.; Lee, C.M.; Marians, K.J.; Hite, R.K.; Remus, D. G-quadruplex–stalled eukaryotic replisome structure reveals helical inchworm DNA translocation. Science 2025, 387, eadt1978. [Google Scholar] [CrossRef] [PubMed]
  62. Li, G.S.; Zhao, Y. The critical roles of three sugar-related proteins (HXK, SnRK1, TOR) in regulating plant growth and stress responses. Hortic. Res. 2024, 11, uhae099. [Google Scholar] [CrossRef]
  63. Chen, T.; Zhang, Z.; Li, B.; Qin, G.; Tian, S. Molecular basis for optimizing sugar metabolism and transport during fruit development. Abiotech 2021, 2, 330–340. [Google Scholar] [CrossRef] [PubMed]
  64. Wan, H.J.; Wu, L.M.; Yang, Y.J.; Zhou, G.Z.; Ruan, Y.L. Evolution of Sucrose Metabolism: The Dichotomy of Invertases and Beyond. Trends Plant Sci. 2018, 23, 163–177. [Google Scholar] [CrossRef] [PubMed]
  65. Qin, G.; Liu, Z.Q.; Yang, J.; Liao, X.F.; Zhao, C.Q.; Ren, J.S.; Qu, X.G. Targeting specific DNA G-quadruplexes with CRISPR-guided G-quadruplex-binding proteins and ligands. Nat. Cell Biol. 2024, 26, 1212–1224. [Google Scholar] [CrossRef]
  66. Ren, C.; Lin, Y.; Liang, Z. CRISPR/Cas genome editing in grapevine: Recent advances, challenges and future prospects. Fruit Res. 2022, 2, 7. [Google Scholar] [CrossRef]
  67. Lam, H.Y.I.; Ong, X.E.; Mutwil, M. Large language models in plant biology. Trends Plant Sci. 2024, 29, 1145–1155. [Google Scholar] [CrossRef]
  68. Nguyen, E.; Poli, M.; Durrant, M.G.; Kang, B.; Katrekar, D.; Li, D.B.; Bartie, L.J.; Thomas, A.W.; King, S.H.; Brixi, G.; et al. Sequence modeling and design from molecular to genome scale with Evo. Science 2024, 386, eado9336. [Google Scholar] [CrossRef]
  69. Yu, H.; Yang, H.; Sun, W.; Yan, Z.; Yang, X.; Zhang, H.; Ding, Y.; Li, K. An interpretable RNA foundation model for exploring functional RNA motifs in plants. Nat. Mach. Intell. 2024, 6, 1616–1625. [Google Scholar] [CrossRef]
  70. Shi, X.Y.; Cao, S.; Wang, X.; Huang, S.Y.; Wang, Y.; Liu, Z.J.; Liu, W.W.; Leng, X.P.; Peng, Y.L.; Wang, N.; et al. The complete reference genome for grapevine (Vitis vinifera L.) genetics and breeding. Hortic. Res. 2023, 10, uhad061. [Google Scholar] [CrossRef]
  71. Shen, W.; Sipos, B.; Zhao, L.Y. SeqKit2: A Swiss army knife for sequence and alignment processing. Imeta 2024, 3, e191. [Google Scholar] [CrossRef] [PubMed]
  72. Bedrat, A.; Lacroix, L.; Mergny, J.L. Re-evaluation of G-quadruplex propensity with G4Hunter. Nucleic Acids Res. 2016, 44, 1746–1759. [Google Scholar] [CrossRef] [PubMed]
  73. Chen, C.J.; Wu, Y.; Xia, R. A painless way to customize Circos plot: From data preparation to visualization using TBtools. Imeta 2022, 1, 3. [Google Scholar] [CrossRef] [PubMed]
  74. Cantalapiedra, C.P.; Hernández-Plaza, A.; Letunic, I.; Bork, P.; Huerta-Cepas, J. eggNOG-mapper v2: Functional Annotation, Orthology Assignments, and Domain Prediction at the Metagenomic Scale. Mol. Biol. Evol. 2021, 38, 5825–5829. [Google Scholar] [CrossRef]
  75. Wu, T.; Hu, E.; Xu, S.; Chen, M.; Guo, P.; Dai, Z.; Feng, T.; Zhou, L.; Tang, W.; Zhan, L.; et al. clusterProfiler 4.0: A universal enrichment tool for interpreting omics data. Innovation 2021, 2, 100141. [Google Scholar] [CrossRef]
  76. Katz, K.; Shutov, O.; Lapoint, R.; Kimelman, M.; Brister, J.R.; O’Sullivan, C. The Sequence Read Archive: A decade more of explosive growth. Nucleic Acids Res. 2022, 50, D387–D390. [Google Scholar] [CrossRef]
  77. Bolger, A.M.; Lohse, M.; Usadel, B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 2014, 30, 2114–2120. [Google Scholar] [CrossRef]
  78. Kim, D.; Paggi, J.M.; Park, C.; Bennett, C.; Salzberg, S.L. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat. Biotechnol. 2019, 37, 907–915. [Google Scholar] [CrossRef]
  79. Li, H.; Handsaker, B.; Wysoker, A.; Fennell, T.; Ruan, J.; Homer, N.; Marth, G.; Abecasis, G.; Durbin, R.; Genome Project Data, P. The Sequence Alignment/Map format and SAMtools. Bioinformatics 2009, 25, 2078–2079. [Google Scholar] [CrossRef]
  80. Pertea, M.; Pertea, G.M.; Antonescu, C.M.; Chang, T.C.; Mendell, J.T.; Salzberg, S.L. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 2015, 33, 290–295. [Google Scholar] [CrossRef]
  81. Love, M.I.; Huber, W.; Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014, 15, 550. [Google Scholar] [CrossRef] [PubMed]
  82. Lu, L.; Delrot, S.; Liang, Z. From acidity to sweetness: A comprehensive review of carbon accumulation in grape berries. Mol. Hortic. 2024, 4, 22. [Google Scholar] [CrossRef]
  83. Hong, P.; Sadeghnezhad, E.; Wang, J.; Yu, W.; Zheng, J.; Zhong, R.; Xu, Y.; Zhang, Y.; Dong, T.; Fang, J.; et al. VvSnRK1-VvSS3 regulates sugar accumulation during grape berry ripening in response to abscisic acid. Sci. Hortic. 2023, 320, 112208. [Google Scholar] [CrossRef]
  84. Kanehisa, M.; Furumichi, M.; Sato, Y.; Matsuura, Y.; Ishiguro-Watanabe, M. KEGG: Biological systems database as a model of the real world. Nucleic Acids Res. 2024, 53, D672–D677. [Google Scholar] [CrossRef] [PubMed]
  85. Thumuluri, V.; Armenteros, J.J.A.; Johansen, A.R.; Nielsen, H.; Winther, O. DeepLoc 2.0: Multi-label subcellular localization prediction using protein language models. Nucleic Acids Res. 2022, 50, W228–W234. [Google Scholar] [CrossRef] [PubMed]
  86. Nakamura, T.; Yamada, K.D.; Tomii, K.; Katoh, K. Parallelization of MAFFT for large-scale multiple sequence alignments. Bioinformatics 2018, 34, 2490–2492. [Google Scholar] [CrossRef]
  87. Capella-Gutiérrez, S.; Silla-Martínez, J.M.; Gabaldón, T. trimAl: A tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 2009, 25, 1972–1973. [Google Scholar] [CrossRef]
Figure 1. G-quadruplex landscape in grapevine genome. (A) GC skew. (B) GC content. (C) Gene density. (D) Tandem repeats density. (E) TE density. (F) G-quadruplex density.
Figure 1. G-quadruplex landscape in grapevine genome. (A) GC skew. (B) GC content. (C) Gene density. (D) Tandem repeats density. (E) TE density. (F) G-quadruplex density.
Plants 14 01180 g001
Figure 2. The correlation between G-quadruplexes and genomic characteristics of (A) GC content, (B) gene density, (C) tandem repeat density, and (D) TE density at the chromosome level. The gene density, tandem repeat density, and TE density represented the number per 1000 base pairs on the chromosome.
Figure 2. The correlation between G-quadruplexes and genomic characteristics of (A) GC content, (B) gene density, (C) tandem repeat density, and (D) TE density at the chromosome level. The gene density, tandem repeat density, and TE density represented the number per 1000 base pairs on the chromosome.
Plants 14 01180 g002
Figure 3. The G-quadruplex density in different feature regions of grapevine genome. The promoter2000, promoter1500, promoter1000, and promoter500 represent the sequences located 2000 bp, 1500 bp, 1000 bp, and 500 bp upstream of the gene, respectively. TSS500 represents the sequence composed of 250 bp on each side of the transcription start site.
Figure 3. The G-quadruplex density in different feature regions of grapevine genome. The promoter2000, promoter1500, promoter1000, and promoter500 represent the sequences located 2000 bp, 1500 bp, 1000 bp, and 500 bp upstream of the gene, respectively. TSS500 represents the sequence composed of 250 bp on each side of the transcription start site.
Plants 14 01180 g003
Figure 4. The proportion and GO enrichment of genes involved in G-quadruplexes. (A) The proportion of genes with different G-quadruplex numbers in the promoter. (B) GO enrichment of genes rich in promoter G-quadruplex (≥11) in double strand. (C) The proportion of genes with different G-quadruplex numbers in the TSS500 region. (D) GO enrichment of genes rich in TSS500 G-quadruplex (≥6) in double strand. In the GO circle plot, the outermost circle represents the IDs of GO terms. The second circle indicates the total number of genes for each term, and the color of the heatmap represents −log10(Pvalue). The third circle shows the number of enriched genes, with pink/blue boxes representing the number of genes where the G-quadruplex density in the coding strand (CSGQ) was greater than/less than that in the template strand (TSGQ). The fourth circle represents the Rich factor.
Figure 4. The proportion and GO enrichment of genes involved in G-quadruplexes. (A) The proportion of genes with different G-quadruplex numbers in the promoter. (B) GO enrichment of genes rich in promoter G-quadruplex (≥11) in double strand. (C) The proportion of genes with different G-quadruplex numbers in the TSS500 region. (D) GO enrichment of genes rich in TSS500 G-quadruplex (≥6) in double strand. In the GO circle plot, the outermost circle represents the IDs of GO terms. The second circle indicates the total number of genes for each term, and the color of the heatmap represents −log10(Pvalue). The third circle shows the number of enriched genes, with pink/blue boxes representing the number of genes where the G-quadruplex density in the coding strand (CSGQ) was greater than/less than that in the template strand (TSGQ). The fourth circle represents the Rich factor.
Plants 14 01180 g004
Figure 5. G-quadruplex density of DEGs under drought stress. D2, drought stress for two days; D4, drought stress for four days; D8, drought stress for eight days. UpDEGs, upregulated DEGs; DownDEGs, downregulated DEGs.
Figure 5. G-quadruplex density of DEGs under drought stress. D2, drought stress for two days; D4, drought stress for four days; D8, drought stress for eight days. UpDEGs, upregulated DEGs; DownDEGs, downregulated DEGs.
Plants 14 01180 g005
Figure 6. The fold difference in G-quadruplex density between the coding strand and template strand of DEGs under drought stress. The right/left half of the figure represents the fold difference where the coding strand density was greater/less than the template strand density. G4SDF: the fold difference in G-quadruplex density between the two strands.
Figure 6. The fold difference in G-quadruplex density between the coding strand and template strand of DEGs under drought stress. The right/left half of the figure represents the fold difference where the coding strand density was greater/less than the template strand density. G4SDF: the fold difference in G-quadruplex density between the two strands.
Plants 14 01180 g006
Figure 7. GO function enrichment of representative DEGs under drought stress for two days. (A) GO enrichment of upregulated DEGs with a fold change greater than 3 in G-quadruplex density between the coding strand and template strand in the TSS500 region. (B) GO enrichment of downregulated DEGs with a fold change greater than 3 in G-quadruplex density between the coding strand and template strand in the TSS500 region.
Figure 7. GO function enrichment of representative DEGs under drought stress for two days. (A) GO enrichment of upregulated DEGs with a fold change greater than 3 in G-quadruplex density between the coding strand and template strand in the TSS500 region. (B) GO enrichment of downregulated DEGs with a fold change greater than 3 in G-quadruplex density between the coding strand and template strand in the TSS500 region.
Plants 14 01180 g007
Figure 8. G-quadruplex prevalence of sugar–acid metabolism pathway in the grape berry. (A) The sugar–acid metabolic regulation during the pre-véraison and post-véraison stages. (B) The G-quadruplex number and differential expression of key genes in the sugar–acid metabolism pathway. The red star and blue star represent significantly upregulated and downregulated gene expression under drought stress, respectively.
Figure 8. G-quadruplex prevalence of sugar–acid metabolism pathway in the grape berry. (A) The sugar–acid metabolic regulation during the pre-véraison and post-véraison stages. (B) The G-quadruplex number and differential expression of key genes in the sugar–acid metabolism pathway. The red star and blue star represent significantly upregulated and downregulated gene expression under drought stress, respectively.
Plants 14 01180 g008
Table 1. G-quadruplex profile of Vitis vinifera genome. The G-quadruplexes were identified by G4Hunter, with a window size of 25 and a threshold value of 1.2.
Table 1. G-quadruplex profile of Vitis vinifera genome. The G-quadruplexes were identified by G4Hunter, with a window size of 25 and a threshold value of 1.2.
ChromosomeLength (bp)GC Content
(%)
G-Quadruplex
Number
G-Quadruplex Density
(per kbp)
PN127,822,16236.026,0410.94
PN220,941,26335.221,5551.03
PN321,317,29034.818,8390.88
PN425,934,92834.725,2320.97
PN526,899,77135.327,7611.03
PN624,571,96935.523,6390.96
PN731,654,36234.828,5520.90
PN823,763,02335.026,6791.12
PN924,372,19934.122,2620.91
PN1027,504,06135.625,0680.91
PN1120,048,50835.022,2531.11
PN1224,706,00834.824,2710.98
PN1329,842,24234.528,7490.96
PN1430,475,31534.529,4510.97
PN1523,565,45635.422,3210.95
PN1627,608,94636.021,2070.77
PN1719,942,83635.420,3371.02
PN1836,684,27134.732,7860.89
PN1927,218,60035.120,8100.76
mitochondrion774,66344.110921.41
chloroplast145,13436.71050.72
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Xie, J.; Song, K.; Qiao, G.; Wang, R.; Wu, H.; Jia, Q.; Liu, Y.; Li, Y.; Xu, M. Decoding G-Quadruplexes Sequence in Vitis vinifera: Regulatory Region Enrichment, Drought Stress Adaptation, and Sugar–Acid Metabolism Modulation. Plants 2025, 14, 1180. https://doi.org/10.3390/plants14081180

AMA Style

Xie J, Song K, Qiao G, Wang R, Wu H, Jia Q, Liu Y, Li Y, Xu M. Decoding G-Quadruplexes Sequence in Vitis vinifera: Regulatory Region Enrichment, Drought Stress Adaptation, and Sugar–Acid Metabolism Modulation. Plants. 2025; 14(8):1180. https://doi.org/10.3390/plants14081180

Chicago/Turabian Style

Xie, Jun, Kangkang Song, Gaixia Qiao, Rong Wang, Hongyuan Wu, Qiaoxia Jia, Yujuan Liu, Yi Li, and Meilong Xu. 2025. "Decoding G-Quadruplexes Sequence in Vitis vinifera: Regulatory Region Enrichment, Drought Stress Adaptation, and Sugar–Acid Metabolism Modulation" Plants 14, no. 8: 1180. https://doi.org/10.3390/plants14081180

APA Style

Xie, J., Song, K., Qiao, G., Wang, R., Wu, H., Jia, Q., Liu, Y., Li, Y., & Xu, M. (2025). Decoding G-Quadruplexes Sequence in Vitis vinifera: Regulatory Region Enrichment, Drought Stress Adaptation, and Sugar–Acid Metabolism Modulation. Plants, 14(8), 1180. https://doi.org/10.3390/plants14081180

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop