1. Introduction
Glioblastoma (GBM) is an aggressive brain tumor with inferior outcomes, as median survival remains roughly 15 months despite intensive therapy [
1]. Although genetic changes play a role [
2], significant epigenetic abnormalities also contribute to GBM’s aggressive behavior [
3,
4]. One of the most notable epigenetic features is abnormal DNA methylation. Promoter CpG island hypermethylation can transcriptionally silence key tumor-suppressor genes (e.g.,
RB1,
TP53,
PTEN) and pathways, whereas global hypomethylation of intergenic regions may activate oncogenes and promote genomic instability [
5,
6]. Such DNA methylation changes have direct clinical relevance. For example, epigenetic silencing of the O6-methylguanine DNA methyltransferase (
MGMT) gene via promoter hypermethylation impairs DNA repair and sensitizes GBMs to temozolomide, correlating with significantly improved patient survival [
7,
8,
9].
In this context, a distinct hypermethylation pattern, the CpG island methylator phenotype (CIMP), has attracted considerable interest. CIMP refers to coordinated hypermethylation of many gene promoters, first noted in colorectal cancer [
10] and later identified in many other malignancies [
11], including gliomas [
12]. In diffuse gliomas, glioma-CIMP (G-CIMP) is strongly associated with IDH1/2 mutations and defines a distinct epigenetic subclass [
12,
13]. Nearly all IDH-mutant lower-grade gliomas exhibit a G-CIMP profile, and these G-CIMP-positive tumors are associated with substantially better outcomes than their CIMP-negative counterparts [
12,
13,
14]. IDH mutations establish the G-CIMP phenotype through production of 2-hydroxyglutarate, which inhibits TET-mediated DNA demethylation and leads to widespread hypermethylation [
15]. By contrast, CIMP is comparatively rare in IDH-wild-type GBM [
16], which typically displays only limited or focal promoter methylation changes.
Recent studies indicate that certain IDH–wild-type GBMs can exhibit a CIMP-like hypermethylation pattern. Notably, methylation profiling has revealed an IDH-independent CIMP-like subset within
RTK2-subtype GBMs [
17]. These findings suggest that epigenetic subtypes in GBM are more complex than a simple binary model and that a subset of CIMP-negative GBMs with widespread promoter hypermethylation may represent a biologically distinct group with unique molecular regulators and clinical behavior [
17,
18,
19]. Moreover, DNA methylation landscapes in gliomas are shaped by multiple factors, including tumor differentiation state [
20] and dynamic epigenetic reprogramming during clonal evolution [
21,
22].
Although G-CIMP gliomas typically have better outcomes, recent studies indicate considerable heterogeneity within this group [
14,
23,
24]. Methylation profiling has identified G-CIMP–low tumors with reduced methylation and poor survival, and G-CIMP–high tumors with extensive methylation and more favorable outcomes [
25]. Some IDH-mutant astrocytomas also progress aggressively despite being G-CIMP–positive [
26], indicating that IDH status alone cannot fully predict behavior. Current glioma classifications do not integrate genome-wide methylation patterns beyond
IDH and
MGMT [
27], and potential prognostic methylation events remain underexplored. To address this, we analyzed gene-specific DNA methylation and expression patterns and evaluated their associations with patient survival. Our integrative approach aimed to uncover epigenetic markers that improve prognostic accuracy beyond IDH-based systems.
2. Materials and Methods
2.1. Data Acquisition and Sample Classification
2.1.1. Data Acquisition and Preprocessing
DNA methylation and gene expression data, along with associated clinical information, were obtained from The Cancer Genome Atlas (TCGA). The discovery cohort comprised the TCGA GBM dataset (NFirehose Legacy = 285), and the validation cohort used the TCGA GBMLGG dataset (N = 657, including GBM and lower-grade gliomas). Data preprocessing was performed in R (v4.4.3). Methylation levels were represented as β-values (0–1), and RNA-seq data were log2-transformed normalized counts. Only samples with complete methylation, expression, and clinical data were retained for analysis.
2.1.2. Classification of Methylation Phenotypes
Tumors were classified using established glioma molecular and epigenetic frameworks as described by Noushmehr et al., Ceccarelli et al., and the TCGA Consortium. Canonical G-CIMP status was defined by IDH mutation status: IDH-mutant diffuse gliomas were classified as G-CIMP, and IDH-wildtype tumors as non-G-CIMP. Because the TCGA-GBM discovery cohort consists almost entirely of IDH-wildtype glioblastomas, no canonical G-CIMP tumors were present in this cohort, and all samples were classified as non-G-CIMP. Canonical G-CIMP stratification was applied only to the independent TCGA-GBMLGG validation cohort, where IDH mutation status and tumor grade information were available. Global DNA methylation levels were evaluated only as descriptive features and were not used for subtype assignment. This IDH-based definition was used to avoid reliance on arbitrary global methylation thresholds and to ensure consistency with established G-CIMP classifications (
Supplementary Table S1). This approach captures the canonical, IDH-associated G-CIMP phenotype but does not represent a genome-wide methylation classifier (e.g., the Heidelberg classifier).
2.2. Validation of Canonical G-CIMP Methylation Patterns
To confirm that our stratification captured canonical G-CIMP biology, we examined promoter methylation of 10 established G-CIMP marker genes (PDGFRA, EGFR, CDKN2A, MGMT, RBP1, GATA6, SOX10, DCC, and two HOXA family genes) in the TCGA-GBMLGG validation cohort. Promoter CpG β-values were averaged per sample, and group comparisons used Wilcoxon rank-sum tests with Benjamini–Hochberg (BH) correction (adj. p < 0.05).
2.3. Genome-Wide Methylation Analysis
In the discovery cohort, genome-wide differential methylation between G-CIMP and non-CIMP tumors was assessed using Mann–Whitney U tests at each CpG site, with BH correction across all loci (adj. p < 0.05). Mean β-value differences between groups indicated the direction and magnitude of methylation changes. The top 50 differentially methylated CpGs (ranked by adjusted p-value) were used for unsupervised hierarchical clustering to visualize subtype separation. In the validation cohort, we focused on validating key differentially methylated genes identified in the discovery cohort, comparing promoter methylation between groups using Wilcoxon tests with BH correction (q < 0.05).
2.4. Methylation-Expression Coupling Analysis
2.4.1. Differential Gene Expression Analysis
Differential gene expression analyses were performed to investigate transcriptional differences associated with DNA methylation patterns and to identify candidate epigenetically regulated genes relevant to glioma biology and patient outcome. Analyses focused on two primary categories of genes: (i) core DNA methylation–associated regulators (DNMT1, DNMT3A, DNMT3B, TET1, TET2, TET3, MECP2, MBD1, MBD2, MBD3, and MBD4), and (ii) genes identified as differentially methylated between tumor subgroups in genome-wide methylation analyses.
In the TCGA-GBM discovery cohort, log2-normalized expression values were compared between tumors using unpaired t-tests. Because this cohort consists predominantly of IDH-wildtype glioblastomas and does not include canonical G-CIMP tumors, expression analyses were conducted without G-CIMP stratification and used to identify gene-level expression patterns associated with differential methylation status. Nominal statistical significance was defined as p < 0.05, and multiple-testing correction was applied using the Benjamini–Hochberg method, with adjusted p < 0.05 considered significant.
In the TCGA-GBMLGG validation cohort, differential expression analyses were restricted to candidate genes identified in the discovery phase, including TUBB2A, TM4SF1, SCD5, and selected DNA methylation regulators. Expression levels were compared between canonical G-CIMP (IDH-mutant) and non-G-CIMP (IDH-wildtype) tumors using the same statistical framework. These analyses assessed the reproducibility and generalizability of gene-specific expression differences across a molecularly heterogeneous glioma population encompassing multiple tumor grades. Together, these expression analyses provided a foundation for subsequent integrative assessment of methylation–expression coupling and for prioritizing candidate genes for downstream survival analyses.
2.4.2. Integration of Methylation and Expression Data
To assess the functional relevance of differential DNA methylation, integrative analyses were performed to evaluate the relationship between promoter methylation and gene expression. For genes exhibiting significant promoter methylation differences, corresponding mRNA expression levels were examined across samples to determine whether methylation status was associated with transcriptional regulation. Promoter methylation levels were summarized as mean β-values across promoter-associated CpG probes for each gene and sample. Promoter-associated CpG probes were defined based on Illumina array annotation of gene promoter regions, and all probes mapping to a given promoter were included in the calculation. Spearman rank correlation analyses were then conducted to evaluate the relationship between promoter methylation and log2-normalized gene expression levels. Correlations were computed across samples within each cohort, and the strength and direction of association were reported. Genes were classified as exhibiting canonical methylation–expression coupling when increased promoter methylation was associated with reduced gene expression, consistent with transcriptional repression. Genes that did not show a clear inverse relationship were classified as exhibiting non-canonical or context-dependent coupling, reflecting more complex regulatory behavior. Representative scatter plots were generated to visualize these relationships for selected candidate genes, including TUBB2A and TM4SF1.
These integrative analyses were performed separately in the discovery and validation cohorts to assess the consistency of methylation–expression relationships across datasets with differing molecular composition. Results from these analyses informed the prioritization of candidate genes for downstream survival analyses and interpretation of context-dependent regulatory patterns.
2.5. Analysis of DNA Methylation Regulators
Expression and promoter methylation of key methylation regulators (MECP2, MBD1, MBD3, DNMT3A) were compared between groups in both cohorts using t-tests for expression differences and Wilcoxon tests for methylation differences. These analyses evaluated whether G-CIMP tumors display alterations in epigenetic machinery that establish or maintain their hypermethylated state.
2.6. Survival and Prognostic Analysis
2.6.1. Discovery Cohort (TCGA-GBM)
Kaplan–Meier survival analyses evaluated prognostic associations of molecular features in the discovery cohort. Samples were dichotomized by median expression for candidate genes (TUBB2A, TM4SF1, CCDC91, TGFB2, WHSC1), and survival distributions were compared using log-rank tests.
2.6.2. Validation Cohort (TCGA-GBMLGG)
In the validation cohort, patients were stratified by median expression or methylation of key discovery genes (particularly TUBB2A, TM4SF1, SCD5). Kaplan–Meier curves were generated, and log-rank tests and hazard ratios from univariate Cox models were used to assess the robustness and generalizability of prognostic associations across glioma grades.
2.6.3. Multivariable Survival Analysis
To assess the independent prognostic value of candidate epigenetically regulated genes, multivariable Cox proportional hazards regression models were fitted in the TCGA-GBMLGG validation cohort. Models included age at diagnosis, tumor grade, IDH mutation status, 1p/19q codeletion status, and MGMT promoter methylation status (when available), together with gene-specific promoter methylation levels for TUBB2A and TM4SF1. Promoter methylation variables were entered into the Cox proportional hazards models as continuous β-values (range 0–1), and hazard ratios were interpreted per unit increase in promoter methylation. Hazard ratios (HRs), 95% confidence intervals (CIs), and Wald p-values were reported. Proportional hazards assumptions were evaluated using Schoenfeld residuals.
2.7. Statistical Analysis
All statistical tests were two-sided, with p < 0.05 considered significant. False discovery rate (FDR) correction using the Benjamini–Hochberg method was applied to multiple comparisons, as indicated. Data visualization (volcano plots, heatmaps, scatter plots, Kaplan–Meier curves) was performed using the R packages ggplot2 and survminer.
3. Results
3.1. Cohort Characteristics and Canonical G-CIMP Stratification
3.1.1. Discovery Cohort (TCGA-GBM)
The TCGA-GBM discovery cohort consisted predominantly of IDH-wildtype glioblastomas. Because canonical G-CIMP is tightly linked to IDH mutation status, no G-CIMP tumors were present in this cohort. Accordingly, the discovery dataset was used exclusively to identify gene-level DNA methylation–expression relationships and candidate prognostic loci, without applying G-CIMP stratification.
3.1.2. Validation Cohort (TCGA-GBMLGG)
To evaluate the robustness and clinical relevance of canonical G-CIMP classification beyond GBM, we analyzed the TCGA-GBMLGG cohort (n = 657), which comprises WHO grades II–IV diffuse gliomas and includes a substantial proportion of IDH-mutant tumors. Using canonical IDH-defined G-CIMP classification, the validation cohort demonstrated a significant separation in overall survival between G-CIMP and non-G-CIMP tumors (log-rank
p < 0.0001;
Figure 1). G-CIMP tumors showed significantly longer overall survival compared with non-G-CIMP tumors. This pattern is consistent with the established association between IDH mutation–defined G-CIMP status and favorable clinical outcome.
3.2. Validation of Canonical G-CIMP Markers
To confirm that tumors classified using canonical G-CIMP criteria exhibit expected methylation features, we examined promoter methylation of established G-CIMP marker genes. The distribution of promoter β-values for each marker (
Figure 2A–J) showed a clear and consistent shift toward higher methylation in G-CIMP tumors.
Across the TCGA-GBMLGG validation cohort, eight of the ten genes showed higher promoter methylation in G-CIMP tumors, with several reaching nominal significance (
Table 1). These results confirm strong subtype-specific methylation enrichment.
Markers showing the most pronounced difference, RBP1, SOX10, GATA6, HOXA1, and HOXA9, are key regulators of developmental and lineage-specific programs. Their coordinated hypermethylation is a defining feature of the G-CIMP phenotype. Additional canonical markers, including CDKN2A, DCC, and PDGFRA, also showed significantly higher methylation in G-CIMP tumors, though with smaller effect sizes. In contrast, MGMT and EGFR showed no significant methylation differences between subtypes (adjusted p > 0.2). MGMT promoter methylation occurs independently of global CIMP status, whereas EGFR activation in GBM is typically driven by gene amplification rather than promoter hypomethylation. Their lack of differential methylation thus reflects expected biology rather than classifier limitations. Collectively, these results are consistent with prior reports describing promoter hypermethylation of established G-CIMP marker genes in IDH-mutant gliomas and support the use of canonical G-CIMP classification for downstream analyses.
3.3. Genome-Wide Methylation Differences Between Subtypes
To characterize broader methylation alterations distinguishing G-CIMP from non-CIMP GBMs, we performed an unbiased genome-wide differential methylation analysis. After stringent false discovery rate correction (FDR < 0.05), 11 CpG loci, each mapping to a distinct gene, remained statistically significant (
Table 2 and
Table S2).
Although moderate in number relative to the >400,000 CpGs assessed, these loci represent the most reproducible subtype-specific methylation differences in the discovery cohort. Among these sites,
MTCH2,
TUBB2A,
TGFB2, and
DACT3 were significantly hypermethylated in G-CIMP tumors, whereas
WHSC1,
CCDC91,
STOML3,
HIPK3,
NCRNA00095,
IL27RA, and
TM4SF1 were hypomethylated. Promoter-level summaries further confirmed that
TM4SF1 exhibited substantially higher methylation in non-CIMP tumors (Δβ ≈ +0.10), despite modest variability among individual probes within the locus (
Supplementary Table S2). These findings illustrate that differential methylation in GBM involves both promoter and locus-specific contributions.
The genome-wide distribution of differential methylation is illustrated in
Figure 3A. The volcano plot shows a strong skew toward hypermethylated loci in CIMP-high tumors (red points shifted to the right), along with a smaller but distinct cluster of hypomethylated loci in non-CIMP tumors (blue points moved to the left). Although many CpGs showed nominal group differences, only loci with highly consistent methylation shifts across samples achieved genome-wide significance, reflecting both the biological coherence of the CIMP phenotype and the stringency of multiple-testing correction. Unsupervised clustering of the top 50 differentially methylated CpGs (ranked by adjusted
p-value) provided further confirmation of subtype separation. As shown in
Figure 3B, G-CIMP tumors displayed uniformly elevated β-values across these CpGs, whereas non-CIMP tumors were consistently hypomethylated. The clear segregation of samples indicates coordinated methylation differences across these loci rather than isolated gene-level events.
Notably, well-established tumor suppressor genes such as RB1, CDKN2A, and PTEN did not appear among the CpGs that were significantly discriminative. This is expected: these loci are often methylated in both G-CIMP and non-CIMP tumors, diminishing their utility as subtype-defining markers. Their absence reinforces prior observations that GBM harbors widespread promoter methylation independent of CIMP status. Overall, these genome-wide findings demonstrate that although only a few CpGs meet stringent statistical thresholds, the directionality, consistency, and clustering behavior of these loci support a robust epigenetic distinction between tumors stratified by differential methylation patterns in the TCGA-GBM discovery cohort. These signatures form a strong quantitative foundation for downstream integrative analyses of transcriptional regulation, pathway involvement, and clinical implications.
3.4. Methylation–Expression Coupling of Key Genes
To determine whether promoter methylation differences translated into functional transcriptional changes, we examined mRNA expression for the genes identified as significantly differentially methylated in the discovery cohort. Five representative loci,
TUBB2A,
CCDC91,
TGFB2,
WHSC1, and
TM4SF1, were selected to capture both hypermethylated and hypomethylated patterns. In the TCGA-GBM discovery cohort, four genes (
TUBB2A,
CCDC91,
TGFB2, and
WHSC1) displayed expression profiles consistent with canonical epigenetic repression: each showed significantly lower mRNA levels in G-CIMP tumors (
p < 0.01), mirroring their higher promoter methylation. These relationships indicate that promoter hypermethylation is associated with transcriptional downregulation in the hypermethylated subtype. Conversely,
TM4SF1 exhibited higher expression in G-CIMP tumors, reflecting the relative hypomethylation observed at this locus (
Figure 4A–D).
TM4SF1 nonetheless demonstrated an inverse methylation–expression correlation, indicating a functional relationship between methylation and transcription. Still, its regulatory behavior differed from that of other genes, suggesting that additional context-dependent mechanisms beyond promoter methylation alone may govern
TM4SF1.
To further evaluate whether these associations were influenced by extreme values, sensitivity analyses were performed using Kendall’s rank correlation and 1% winsorized data. For
TUBB2A, the inverse association remained stable (initial Spearman ρ = −0.34; trimmed ρ = −0.32; Kendall τ = −0.23). For
TM4SF1, the association was similarly robust (initial ρ = −0.44; trimmed ρ = −0.45; Kendall τ = −0.31). These findings confirm that the reported correlations are not driven by a small subset of extreme observations (
Supplementary Table S3).
We next evaluated whether these relationships persisted in the TCGA-GBMLGG validation cohort.
TUBB2A again demonstrated strong inverse coupling, with G-CIMP tumors showing pronounced promoter hypermethylation accompanied by significantly reduced expression (adj.
p = 3.1 × 10
−25), supporting
TUBB2A as a robust gene whose expression is consistently associated with promoter methylation across cohorts. In contrast, TM4SF1 showed lower expression in G-CIMP tumors in the validation cohort, consistent with the higher promoter methylation observed in this dataset. These cohort-dependent differences indicate that TM4SF1 methylation–expression associations are influenced by the underlying molecular composition, including IDH mutation status and tumor grade, rather than reflecting an actual biological reversal of regulatory direction (
Figure 5A–D).
Other differentially methylated genes, including
WHSC1 and
CCDC91, showed directionally consistent repression in G-CIMP tumors in the validation cohort. However, effect sizes were modest, reflecting the greater biological heterogeneity of the mixed-grade GBMLGG dataset. Finally, cross-cohort analysis revealed that
TUBB2A and
TM4SF1 both exhibited clear inverse methylation–expression relationships, confirming functional coupling at these loci. By contrast,
MECP2, despite being significantly upregulated in G-CIMP tumors, showed neither promoter methylation differences nor methylation–expression correlation, illustrating that not all transcriptional alterations in G-CIMP tumors arise from promoter methylation mechanisms (
Supplementary Table S4).
Overall, these findings indicate that G-CIMP tumors exhibit a coordinated epigenetic–transcriptional program in which promoter hypermethylation contributes to gene silencing at a key subset of loci, most notably
TUBB2A. In contrast, other genes, such as
TM4SF1, exhibit more complex, context-dependent regulatory behavior across glioma subtypes. To assess whether cohort-dependent differences reflected underlying molecular heterogeneity, we examined methylation–expression relationships within homogeneous molecular strata. When analyses were restricted to IDH-mutant tumors,
TM4SF1 promoter methylation retained an inverse association with gene expression, although effect sizes were attenuated (
Supplementary Table S5). In IDH-wildtype tumors, the association was weaker and more variable. Similar patterns were observed when analyses were stratified by tumor grade. These findings indicate that apparent cohort-dependent differences are primarily due to differences in IDH status and grade distribution rather than to an actual biological inversion.
3.5. Differential Expression of DNA Methylation Regulators
To determine whether G-CIMP tumors harbor alterations in epigenetic machinery that help establish or maintain their hypermethylated state, we evaluated the expression of key DNA methylation regulators in the discovery cohort. Several regulators showed significantly higher expression in G-CIMP compared with non-CIMP tumors, including
MECP2,
MBD3,
DNMT3A, and
MBD1 (each adj.
p < 0.01;
Table 3). These genes encode essential components of DNA methylation control:
DNMT3A mediates de novo methylation, while
MECP2,
MBD1, and
MBD3 bind methylated CpG sites and help recruit chromatin-remodeling complexes such as the NuRD complex.
The coordinated upregulation of these regulators suggests that G-CIMP tumors may reinforce their hypermethylated state by increasing their methylation “writing” and “reading” capacity. This pattern indicates that the G-CIMP phenotype is not solely the downstream consequence of IDH mutation but may involve active stabilization of methylation-dependent transcriptional repression. Other regulators, including DNMT1, DNMT3B, MBD2, and TET1–3, did not differ between groups, indicating that only specific components of the DNA methylation apparatus participate in G-CIMP–associated regulatory remodeling.
In the validation cohort, methylation regulator expression patterns were more heterogeneous, consistent with the broader spectrum of glioma grades and IDH statuses represented. Among the regulators elevated in the discovery cohort, only MECP2 remained significantly upregulated in G-CIMP tumors (adj. p = 0.0038). MBD1 showed a nonsignificant trend toward increased expression, while MBD3 and DNMT3A did not display subtype-specific differences. These discrepancies suggest that some regulatory changes observed in GBM may not generalize uniformly across lower-grade IDH-mutant gliomas and may instead reflect grade-dependent or lineage-specific epigenetic requirements.
Notably, MECP2 overexpression occurred despite no differences in promoter methylation, indicating that its regulation is independent of the promoter-level CpG state. This finding suggests that MECP2 may be selectively upregulated in hypermethylated tumors to help maintain transcriptional repression across a broad set of methylated loci. Together, these data show that G-CIMP tumors exhibit a selective, partially conserved epigenetic regulatory profile, characterized by the enhancement of specific methylation-writing and methylation-interpreting genes. This regulatory environment likely contributes to the stability and functional impact of the G-CIMP epigenetic program.
3.6. Prognostic Significance of Epigenetically Regulated Genes
To determine whether epigenetically regulated genes contribute to clinical heterogeneity within the G-CIMP subtype, we assessed the prognostic relevance of expression and promoter methylation for the top differentially methylated candidates identified in earlier analyses.
3.6.1. Discovery Cohort (TCGA-GBM)
In the TCGA-GBM discovery cohort,
TUBB2A expression was significantly associated with overall survival. Patients with high
TUBB2A expression experienced notably shorter survival than those with low expression (log-rank
p = 0.02;
Figure 6B). This finding suggests that incomplete silencing of
TUBB2A may contribute to more aggressive tumor behavior or reduced therapeutic responsiveness.
By contrast,
TM4SF1 expression exhibited only a borderline association with survival (log-rank
p = 0.033;
Figure 6A). Although higher expression tended to correlate with poorer outcomes, the association did not reach statistical significance, and other differentially methylated genes (e.g.,
TGFB2,
CCDC91,
WHSC1) showed no survival differences. These results indicate that not all epigenetically altered genes have prognostic impact within the TCGA-GBM cohort.
3.6.2. Validation Cohort (TCGA-GBMLGG)
We next evaluated whether prognostic associations observed in GBM extended to the broader, more clinically heterogeneous TCGA-GBMLGG cohort. For TUBB2A, the validation dataset confirmed a strong predictive effect. Still, at the level of methylation rather than expression, tumors with high promoter methylation corresponding to transcriptional repression had significantly improved overall survival (log-rank p < 2 × 10−16; HR ≈ 0.15). This is entirely consistent with findings from the discovery cohort, which showed that high TUBB2A activity was associated with adverse outcomes, reinforcing its role as a potential epigenetically regulated driver of tumor aggressiveness.
For TM4SF1, both gene expression and promoter methylation were significantly associated with survival in the validation cohort (p < 0.0001 for each). Tumors with high TM4SF1 expression or low promoter methylation exhibited worse prognosis, whereas tumors with TM4SF1 promoter hypermethylation and corresponding transcriptional suppression showed improved survival. These results suggest that TM4SF1 may function as a context-dependent oncogenic factor, promoting a more aggressive or treatment-resistant disease phenotype.
Together, survival findings across both cohorts highlight TUBB2A and TM4SF1 as reproducible prognostic markers modulated by epigenetic state. TUBB2A appears detrimental when expressed, supporting the beneficial impact of promoter hypermethylation and transcriptional silencing. TM4SF1 demonstrates cohort-specific regulatory behavior yet consistently associates with poorer outcomes when active. These results underscore that, even within the clinically favorable G-CIMP subgroup, epigenetic heterogeneity shapes patient survival, revealing biologically meaningful substructure within the G-CIMP classification.
3.6.3. Multivariable Survival Analysis
To assess whether TUBB2A and TM4SF1 provide prognostic information beyond established clinical and molecular factors, multivariable Cox proportional hazards regression analyses were performed in the TCGA-GBMLGG validation cohort. Models included age at diagnosis, tumor grade, IDH mutation status, 1p/19q co-deletion status, and MGMT promoter methylation status, together with gene-specific promoter methylation levels for TUBB2A and TM4SF1. Promoter methylation variables were entered into the Cox proportional hazards models as continuous β-values (range 0–1), and hazard ratios were interpreted per unit increase in promoter methylation.
As expected, increasing age at diagnosis and higher tumor grade were independently associated with poorer overall survival. After multivariable adjustment, promoter methylation of TUBB2A demonstrated a strong protective effect (HR = 0.017, 95% CI: 0.0003–1.06) with borderline statistical significance (p = 0.053), indicating that its prognostic association is partly independent of established clinical and molecular determinants. In contrast, the association between TM4SF1 promoter methylation and survival was attenuated after adjustment and did not remain independently significant (p = 0.59).
These findings indicate that, while both genes show prognostic associations in univariate analyses,
TUBB2A exhibits a partially independent relationship with outcome, whereas
TM4SF1 appears more context-dependent and influenced by underlying tumor characteristics. As shown in
Table 4.
4. Discussion
Our study shows that IDH-defined G-CIMP gliomas are not a single molecular state but exhibit heterogeneity in regulatory and prognostic features. Using two large TCGA cohorts [
28,
29], we identify
TUBB2A and
TM4SF1 as consistent, methylation-regulated prognostic markers across IDH backgrounds and tumor grades. G-CIMP tumors are associated with increased expression of methylation-associated regulators, such as
MECP2 and
DNMT3A, suggesting a potential role for these factors in maintaining a hypermethylated state. Because G-CIMP status was assigned based on IDH mutation status, this framework reflects canonical G-CIMP biology but does not constitute a genome-wide methylation-based tumor classification.
In addition to canonical IDH-driven G-CIMP stratification, our results indicate substantial heterogeneity within G-CIMP tumors that is captured by gene-specific methylation and transcriptional regulation [
19,
20]. Gene-specific methylation effects vary by context, as seen in differences in TM4SF1 between IDH-wild-type and IDH-mutant gliomas and in
MECP2 expression uncoupled from promoter methylation. Together, these findings indicate that G-CIMP classification obscures biologically meaningful subgroups identified by gene-specific methylation and transcriptional regulation.
The foundational analyses by Noushmehr et al. and Ceccarelli et al. defined G-CIMP as an IDH-driven hypermethylation state linked to better prognosis [
12,
14]. Our study extends this by identifying functional modules within G-CIMP, including variation in methylation-maintenance genes (
MECP2,
DNMT3A) and new methylation-regulated prognostic genes (
TUBB2A,
TM4SF1). Our analyses extend prior work by highlighting heterogeneity within G-CIMP tumors driven by differential regulation of methylation-maintenance machinery and gene-specific methylation events [
14]. Thus, while earlier studies treated G-CIMP as relatively uniform, our data show clear subclass structure.
Earlier classifications based on global methylation levels identified survival differences but provided limited mechanistic resolution [
25,
30]. Our work advances this by identifying specific CpG sites and genes that distinguish subgroups and linking them to gene expression and survival. We also observe that methylation–expression relationships differ across cohorts, particularly for
TM4SF1 and
SCD5, consistent with context-dependent regulation. Gene-specific methylation patterns reveal heterogeneity within G-CIMP tumors that is not captured by broad methylation-based classifications alone. While prior studies have emphasized global methylation, we identify the key regulatory genes that drive variation within G-CIMP.
Previous studies have demonstrated that DNA methylation patterns are strongly influenced by IDH mutation status and tumor differentiation state in diffuse gliomas [
12,
13,
14,
15,
16,
20]. Our findings suggest that context-dependent methylation at specific loci, such as
SCD5 and
TM4SF1, may reflect differences in epigenetic state associated with IDH mutation status and tumor grade. These context-dependent effects may help explain previously reported inconsistencies in prognostic associations across glioma subtypes [
14,
16,
21,
22,
23,
24,
25]. This work improves risk stratification within IDH-defined G-CIMP gliomas by identifying gene-specific methylation events associated with patient outcome [
12,
13,
14,
25,
29,
30,
31,
32,
33].
TUBB2A and
TM4SF1 provide additional prognostic information beyond IDH status alone, although their biological roles remain incompletely elucidated.
TUBB2A encodes a β-tubulin isoform that contributes to α/β-tubulin heterodimer formation and microtubule assembly. Microtubules are essential for mitotic spindle organization, intracellular trafficking, and cytoskeletal remodeling, processes directly linked to cellular proliferation and tumor aggressiveness [
34,
35]. Alterations in tubulin dynamics have been implicated in multiple malignancies and are known to influence tumor cell division and migration [
35,
36]. The association between
TUBB2A promoter methylation, transcriptional repression, and improved survival, therefore, suggests a biologically plausible link between epigenetic regulation and proliferative capacity.
TM4SF1 (Transmembrane 4 L Six Family Member 1) is a tetraspanin-like membrane protein involved in membrane microdomain organization and regulation of cell adhesion and motility.
TM4SF1 has been implicated in tumor invasion, angiogenic signaling, and tumor–microenvironment interactions across several solid tumors [
37,
38,
39]. In glioma, membrane-associated signaling pathways and cytoskeletal remodeling contribute to invasive growth patterns [
32,
39]. The observed relationship between TM4SF1 methylation, expression, and clinical outcome is therefore consistent with a potential role in modulating tumor aggressiveness.
Altered expression of methylation-associated regulators, such as MECP2 and DNMT3A, is consistent with prior reports of epigenetic remodeling in G-CIMP tumors; however, functional studies will be required to elucidate causal mechanisms and therapeutic relevance.
Several limitations warrant consideration. In addition, G-CIMP status was assigned using an IDH-based definition and therefore reflects canonical G-CIMP biology rather than the full spectrum of methylation-defined glioma subtypes identified by genome-wide classifiers [
31,
32]. Second, although the associations were strong, we did not test whether
TUBB2A,
TM4SF1,
SCD5, or
MECP2 directly drives these effects, and experimental validation is required [
33]. Third, the survival findings are based on retrospective TCGA data with heterogeneous treatment regimens, which may influence the observed prognostic associations. Fourth, TCGA primarily represents Western populations, and validation in more diverse cohorts is essential.