Next Article in Journal
The Role of Gene Therapy and RNA-Based Therapeutic Strategies in Diabetes
Previous Article in Journal
PF4 Autoantibody Complexes Cause Activation of Integrins αIIbβ3 and αvβ3 and Possible Subsequent Thrombosis and Autoimmune Diseases
Previous Article in Special Issue
Nidogen-1, a Player in KMT2A-Rearranged Pediatric Acute Myeloid Leukemia
 
 
ijms-logo
Article Menu

Article Menu

Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Comprehensive Epigenome-Wide Profiling Reveals Distinctive DNA Methylation Signatures and Potential Prognostic Biomarkers in Mexican Pediatric B-ALL

by
Alan Alberto Fong-López
1,2,3,4,
Juan Carlos Núñez-Enríquez
5,
Vilma Carolina Bekker-Méndez
6,
Janet Flores-Lujano
7,
Minerva Mata-Rocha
8,
Elva Jiménez-Hernández
9,
Mónica Patricia Ortíz-Maganda
6,
Francisco Xavier Guerra-Castillo
6,
Aurora Medina-Sanson
10,
Jorge Alfonso Martín-Trejo
5,
José Gabriel Peñaloza-González
11,
Martha Margarita Velázquez-Aviña
11,
José Refugio Torres-Nava
9,
Rosa Martha Espinosa-Elizondo
12,
María Luisa Pérez-Saldívar
7,
Luz Victoria Flores-Villegas
13,
Laura Elisa Merino-Pasaye
13,
David Aldebaran Duarte-Rodríguez
14,
Omar Alejandro Sepúlveda-Robles
8,
Georgina Jiménez-Morales
1,
Haydeé Rosas-Vargas
8,
Jorge Meléndez-Zajgla
15,
Eva Ramón-Gallegos
3,
Juan Manuel Mejía-Aranguré
15,16,* and
Silvia Jiménez-Morales
1,*
add Show full author list remove Hide full author list
1
Laboratorio de Innovación y Medicina de Precisión, Núcleo “A”, Instituto Nacional de Medicina Genómica (INMEGEN), Mexico City 14610, Mexico
2
Programa de Doctorado en Biomedicina y Biotecnología Molecular, Escuela Nacional de Ciencias Biológicas, Instituto Politécnico Nacional, Mexico City 07738, Mexico
3
Laboratorio de Citopatología Ambiental, Departamento de Morfología, Escuela Nacional de Ciencias Biológicas, Instituto Politécnico Nacional, Campus Zacatenco, Mexico City 07738, Mexico
4
Facultad de Ciencias Químico-Biológicas, Universidad Autónoma de Sinaloa, Culiacán Rosales 80030, Mexico
5
División de Investigación en Salud, Hospital de Pediatría “Dr. Silvestre Frenk Freund”, Centro Médico Nacional “Siglo XXI”, Instituto Mexicano del Seguro Social (IMSS), Mexico City 06720, Mexico
6
Unidad de Investigación Biomédica en Inmunología e Infectología, Hospital de Infectología “Dr. Daniel Méndez Hernández”, Centro Médico Nacional “La Raza”, Instituto Mexicano del Seguro Social (IMSS), Mexico City 02990, Mexico
7
Unidad de Investigación Médica en Epidemiología Clínica, Hospital de Pediatría “Dr. Silvestre Frenk Freund”, Centro Médico Nacional “Siglo XXI”, Instituto Mexicano del Seguro Social (IMSS), Mexico City 06720, Mexico
8
Unidad de Investigación Médica en Genética Humana, Hospital de Pediatría “Dr. Silvestre Frenk Freund”, Centro Médico Nacional “Siglo XXI”, Instituto Mexicano del Seguro Social (IMSS), Mexico City 06720, Mexico
9
Servicio de Hematología Pediátrica, Secretaría de Salud de la Ciudad de México (SSCDMX), Hospital Pediátrico de Moctezuma, Mexico City 15530, Mexico
10
Departamento de Oncología, Hospital Infantil de México Federico Gómez, Mexico City 06720, Mexico
11
Servicio de Onco-Pediatría, Hospital Juárez de México, Mexico City 07760, Mexico
12
Servicio de Hematología Pediátrica, Secretaría de Salud (SS), Hospital General de México, Mexico City 06720, Mexico
13
Servicio de Hematología Pediátrica, Centro Médico Nacional “20 de Noviembre”, Instituto de Seguridad Social al Servicio de los Trabajadores del Estado (ISSSTE), Mexico City 03104, Mexico
14
División de Desarrollo de la Investigación en Salud, Coordinación de Investigación en Salud, Centro Médico Nacional “Siglo XXI”, Instituto Mexicano del Seguro Social (IMSS), Mexico City 06720, Mexico
15
Laboratorio de Genómica Funcional del Cáncer, Instituto Nacional de Medicina Genómica (INMEGEN), Mexico City 14610, Mexico
16
Facultad de Medicina, Universidad Nacional Autónoma de México, Mexico City 04360, Mexico
*
Authors to whom correspondence should be addressed.
Int. J. Mol. Sci. 2025, 26(21), 10261; https://doi.org/10.3390/ijms262110261
Submission received: 5 September 2025 / Revised: 14 October 2025 / Accepted: 17 October 2025 / Published: 22 October 2025

Abstract

Acute lymphoblastic leukemia (ALL) is the most common childhood cancer. In Mexico, its higher incidence and lower survival suggest a role for epigenetic factors like DNA methylation (DNAme). We conducted an epigenome-wide association study (EWAS) to define the methylation landscape and identify the profiles associated with ALL and relapse. Bone marrow or peripheral blood samples from pediatric ALL patients at diagnosis and controls without ALL were analyzed using an Infinium MethylationEPIC v2.0 array. Differential methylation was assessed using the ChAMP package. We identified a significant hypermethylated profile in ALL patients compared to controls. Probes in MAD1L1 and RPTOR contained the most differentially methylated CpG sites. Key affected pathways included proliferation, neurotransmission, and neuronal signaling. Survival analysis revealed that hypomethylation of four specific CpGs—cg01052776 (RNH1), cg20747787, cg05001671, and cg01767116 (FBXL22)—was significantly associated with an increased risk of relapse, highlighting their potential as prognostic biomarkers. This study underscores the importance of epigenetic mechanisms in pediatric ALL.

Graphical Abstract

1. Introduction

Acute lymphoblastic leukemia (ALL) is the most common pediatric malignancy, accounting for approximately 25% of all childhood cancer diagnoses [1,2]. It is characterized by the uncontrolled proliferation of lymphoid blasts and exhibits significant clinical and molecular heterogeneity, which complicates its therapeutic management. Although ALL displays a consistent age pattern across ethnic groups, variable incidences have been reported among them. In Mexico, similarly to other Latin American nations (Mexico: 5.3; Manaus: 5.7; Costa Rica: 5.2 cases per 100,000 children), the incidence of this malignancy is higher than in countries such as United States (4.1 cases per 100,000 children) or other Caucasian populations [3,4,5], and, according to several reports, its incidence is increasing [2,4,6,7,8]. Moreover, the five-year survival rate is considerably lower in Mexico (66%), even worse than other Latin American populations and developed countries, which is currently around 85–90% [5,6,8,9,10,11,12]. Actually, more than 95% five-year survival rates have been reported by groups such as the Nordic Society of Pediatric Hematology and Oncology (NOPHO) and the Dutch Childhood Oncology Group (DCOG) [13,14,15] and countries such as Finland and Qatar [16,17], and even 100% in China for patients with TCF3::PBX1 rearrangement [17].
In addition to disparities in ALL incidence and overall survival, differences in clinical presentation and molecular features have also been reported among various populations. For instance, Mexican patients display higher relapse rates and a lower prevalence of favorable prognostic fusion genes, such as ETV6::RUNX1, compared to Caucasian populations (26.2% vs. 10–15%, respectively) [3,18,19,20,21]. Relapse in ALL cases was found in 26.2% of ALL children from Mexico, while it was found in 10–15% in Caucasian from the United States [10] and 2–4% in Nordic Caucasian populations [15,21]. Regarding good prognosis biomarkers, studies have shown that ETV6::RUNX1 is present in 7–13% of Mexican cases, but around 15–25% in Caucasian, Black, and Asian populations [3,5,13,14,22,23]. Significant differences have also been reported in other molecular features such as TCF3::PBX1, which displays frequencies between 15 and 25% in Asian-, African-, and Caucasian-descendant populations but 7.1–11.5% in Mexicans [3,17,20,22,23,24].
These differences have been partially attributed to the influence of specific genetic ancestry backgrounds and epigenetic factors, such as DNA methylation (DNAme), which is recognized as an early and fundamental event in cell transformation and cancer development [24,25,26]. Genetic causes of ALL have been addressed by multiples studies, discovering racial/ethnic disparities in the frequencies of specific genetic variants linked to ALL and treatment outcome [5,26,27] and uncovering new molecular subtypes and subtype-specific alterations with differential prognostic impacts [13,28,29,30]. Nevertheless, epigenomic studies are still sparse, and most of them have focused on European-descent populations [30,31,32,33,34,35]; ethnic groups with different genetic ancestries and environmental exposures, such as Latin American, remain poorly explored [36,37].
DNAme is a chemical modification process in which a methyl group is added to cytosine residues at CpG dinucleotide sites, regulating gene expression and maintaining genome stability, as well as influencing cell fate. Aberrant DNAme, mainly in CpG islands located within gene regulatory elements (e.g., promoter regions), often leads to the silencing of cancer-controlling genes such as tumor suppressor genes. Conversely, hypomethylation may activate the expression of proto-oncogenes, thereby promoting uncontrolled cell proliferation [38]. As in other types of cancers, oncogenes, tumor suppressor genes, and DNA repair genes are frequently affected by abnormal methylation in ALL [21,39]. In addition, genomic and epigenomic studies have shown that, in contrast to adult cancer, most of the altered genes in ALL are epigenetic regulators, which can conduct alterations in DNAme patterns [5,26,34,40,41]. Genes such as IKZF1, CDKN2B, TET2, CYP1B1, PTPN6, HOPX, SMIM24, and PPP1R10, which play key roles in the regulation of hematopoietic processes, commonly exhibit altered methylation patterns in ALL [5,26,39,41]. Other genes revealed in epigenomic studies to be relevant players in ALL, mainly in Down syndrome ALL, include TRIM13, which was identified in newborn dried blood spots obtained from biobanks [39].
Beyond the growing body of evidence supporting the role of DNAme in the pathogenesis of leukemia [42], methylation profiles are also being investigated for their potential use in molecular classification, outcome and chemotherapy resistance prediction, and as novel therapeutic targets [35,43,44]. For example, a wide epigenomic association study conducted by the NOPHO AALL2008 and DCOG ALL10/11 proposed an epigenetics-based classifier for pediatric T-ALL that distinguishes two risk groups of patients based on a CpG island methylator phenotype [45].
In addition, data from paired diagnosis-relapsed samples showed a lower global promoter methylation at diagnosis in contrast to relapse [42], and therapy with decitabine, a DNA methyltransferase inhibitor, restores chemosensitivity in animal models [44]. To identify aberrantly methylated genes in pediatric B-cell ALL (B-ALL) and to evaluate their association with relapse, we conducted an epigenome-wide association study (EWAS) in a prospective cohort of Mexican children.

2. Results

2.1. Clinical Characteristics of the Analyzed Population

We included 53 pediatric ALL cases, of which 45% (n = 24) were male. The median age at diagnosis was 37 months (range: 4–205 months old). The most common fusion gene was ETV6::RUNX1 (n = 11, 20%), followed by TCF3::PBX1 (n =6, 12%). According to the National Cancer Institute (NCI) criteria, 53% (n= 28) of cases were classified as standard risk, while 47% (n = 25) were classified as high risk. Based on the most recent WHO classification tumors of hematopoietic and lymphoid tissues (WHO-HEM5) and by incorporating molecular data from tested fusion genes, the distribution of molecular genetic subtypes was as follows: 20% of cases were classified as ETV6::RUNX1-rearranged, 12% as TCF3::PBX1-rearranged, 4% as BCR::ABL1-rearranged, and 2% as KMT2A-rearranged.
Non-ALL controls consisted of four individuals aged 0–17 years, 50% female and 50% males, and median age 37 months (range: 4–205 months old).
The initial cohort included 53 patients; however, 7 of them abandoned treatment before reaching 18 months of follow-up (median follow-up: 1 month; range: 0–15 months), and the remaining (n = 46) completed at least 18 months of follow-up after initial treatment. Further analyses were only performed on patients with complete clinical records. Control samples were included only in case–control comparisons. Patients without appropriate follow-up were excluded from relapse and early relapse analyses. Among the remaining group, relapse was observed in 39% (18/46); of these, 61% (n = 11) experienced very early relapse (VER) and 39% (n = 7) experienced early relapse (ER) (Table 1).

2.2. Identification of Differential Methylated CpGs in Acute Lymphoblastic Leukemia

Due to the fact that we obtained PB (n = 16) or BM (n = 37) from our patients, and because PB cells came from the BM, we first compared their methylation profiles to determine whether PB and BM samples could be combined for subsequent analyses. It is worth mentioning that both BM and PB have >75% of lymphoblasts, ensuring that the captured methylation data was predominantly derived from the leukemic cell population, regardless of tissue source. This comparative analysis revealed 99.06% (6711/719,243 probes) similarities in the CpGs between both tissues, and only 3 of them were detected with Δβ |>20|%. Even when p-values were significant, none of these resulted in a differentially methylated site (DMC) (FDR >0.05). Additionally, multidimensional scaling analysis confirmed the absence of systematic clustering by tissue origin (Supplementary Figure S1). Then, a subsequent analysis included all ALL patients.
To further identify a methylation signature specific to ALL, a comparative analysis was performed. A total of 37,541 DMCs (hypomethylation 19,101 vs. hypermethylation: 18,440 CpGs) located in 12862 genes were identified at an FDR < 0.05 (Figure 1a, Supplementary Table S1). As expected, several previously reported genes with abnormal methylation in leukemias such as CPEB1, BLK, FLT3, ZCCHC7, EBF1, HDACs, IKZF1, RB1, CDK6, DOCK5, DPP10, MUC4, JAKs, KIT, KRAS, LINC01013, MAD1L1, NOTCH1, and STATs, among others, were also detected (Supplementary Table S1). The median absolute difference in methylation levels among these DMCs was 0.40 (range: from 0.20 to 0.73) for hypomethylated and −0.35 (range: from −0.69 to −0.20) for hypermethylation.
Unsupervised-clustering analysis detected the 500 DMCs that define an ALL profile (Figure 1b). The DMCs located in SLC2A9, ADGRD1, TK1, SLC25A2, SBF2, PTPN14, CHST1, MBD1, OPRD1, LOC100302652, OR5A2, APP, LOC100302652 showed the highest statistical significance (FDR < 4 × 10−8).
Based on the knowledge that DMCs can affect the expression of relevant genes involved in the pathogenesis of ALL, we explored CpGs located into the TSS200, TSS1500, 1stExon, 5′UTR regions, and body genes. Among all DMCs identified, 11,226 were positioned in promoter regions, of which 3264 were in 5′ untranslated regions (5′UTRs), 1,413 in the first exons, 2657 in the region 200bp before the transcription start site (TSS200), and 3,893 in the region 1500bp before the transcription start site (TSS1500). Around 13,571 DMCs were placed in the gene body. The genomic distribution of CpGs on the CpG island and neighborhood context, as well as the functional context (Supplementary Figure S2a and S2b, respectively), highlighted higher methylation in cases compared to controls, mainly within the first exon, 5′UTR, and the TSS200 and TSS1500 regions (Figure 2a). After stratifying by biotype, significant differences between the ALL and control groups were observed in CpGs within lncRNAs (p = 2.6 × 10−8, d = −0.30) and protein-coding genes (p = 3.6 × 10−8, d = −0.01). In contrast, miRNAs, snRNAs, snoRNAs, and pseudogene biotypes showed no statistically significant differences (Figure 2b). The non-coding genes included 277 long non-coding RNA (CCDC26, PVT1, CASC15, LINC00426, MEG3, etc.), 133 microRNAs (MIR124-2, MIR3134, MIR5095, MIR548F3, MIR548H4, etc.), and 96 pseudogenes (SEC1P, RNU6-79P, FMO6P, HLA-DPB2, PLEKHA8P1, etc.) (Supplementary Table S2).
Sixteen probes of the top twenty DMCs that displayed significant differences between ALL cases and controls were located in the bodies of and SLC2A9, SLC25A2, TK1, SBF2, PTPN14, CHST1, MBD1, SLC25A2, APP, OR5A2, SALL3, SOX30, AOAH, GALNT2, AURKC, and SLC26A9 genes (Table 2). Based on the number of DMCs, MAD1L1 (n = 72), RPTOR (n = 55), PCDHGA4 (n = 45), TBCD (n = 40), PTPRN2 (n = 38), and ZIC4 (n = 32) showed the highest numbers of DMCs per gene, in contrast to controls (Supplementary Table S2). Notably, MAD1L1 were less methylated in patients than the ALL-free group (Supplementary Figure S3). Additionally, we found abnormal methylation patterns of commonly reported genes such as BLK, BCR, and ARID5B, and, notably, MACROD2 and LEPR displayed highly methylated promoter regions in ALL cases (Supplementary Figure S4a–e).

2.2.1. Distribution of Differentially Methylated Regions in the Functional Context

By considering that differentially methylated regions (DMRs) can affect gene expression, particularly that located in promoter regions, a primary comparison between ALL patients as cases and non-ALL subjects as controls was performed. DMRs were defined as contiguous segments containing at least seven DMCs of the genome to ensure robust regional changes [46]. This analysis revealed a total of 1056 DMR between cases and controls (mean width = 812 bp, range = 50 to 2590 pb, p = 0 to 0.02). The majority of them (1026: 97.2%) were hypermethylated (mean Δβ = −2.9 p = 0.001), while only 31 (2.8%) were hypomethylated (mean Δβ = 2.75 and p = 0.002) regions (Supplementary Table S3). Notably, chromosomal distribution analysis revealed that chromosome 6 harbored the highest number of DMRs (n = 48), while no DMRs were identified on chromosome 9.

2.2.2. Pathway Enrichment and Protein–Protein Interaction Network Analysis

Pathway enrichment analysis using Gene Ontology (GO) showed that differentially methylated genes are involved in the biological processes associated with cancer, such as cell cycle regulation, apoptosis, cell proliferation, and cytokine or growth factor signaling.
In addition to this, the ingenuity pathway analysis (IPA) included cancer-associated canonical pathways, cellular stress and injury, cellular immune response, and humoral immunity (Figure 3). Notably, neurotransmission and neuronal signaling emerged as the top canonical pathways, with specific enrichment in GABAergic (z-score = −0.831), serotonergic (z-score = 2.303), and glutamatergic pathways (z-score = 2.168) (65 metabolism-related genes). Additionally, IPA predicts the activation of cancer drug resistance by drug efflux (z-score = +2.353).
Additionally, IPA network analysis highlighted the involvement of inflammatory cytokines (e.g., TNF, IL15) and transcription factors (e.g., SP1) as potential modulators of cytoskeletal organization, cell chemotaxis, and activation of blood cells’ biological processes (Supplementary Figure S5).
To obtain further insight into the interaction among differential methylated genes, protein–protein interaction (PPI) network construction and identification of hub genes were performed using the STRING database. The PPI network consisted of 1287 nodes interacting at 374 edges (evidence, high confidence 0.9). The top four hub nodes with higher degrees were GNAS (n = 25), EGFR (n = 14), SST (n = 11), and ADCYAP1 (n = 10), which were considered the hub proteins (Supplementary Figure S6).

2.3. Methylation Analysis Based on Clinical Variables

Stratification analysis based on clinical variables showed no statistically significant differences after gender, white blood cell (WBC), NCI risk, body mass index (BMI), and outcome (relapse + death) (Supplementary Table S4). Nevertheless, a comparative analysis between ETV6::RUNX1+ vs. no-ETV6::RUNX1−, as well as TCF3::PBX1+ vs. no-TCF3::PBX1, revealed 4616 and 1954 DMCs, respectively (Supplementary Tables S5 and S6).
PER2, STK24, ERI3, FAM19A5, NARFL, CUX1, TBC1D10A, MRPS27, and CBFA3T3, were among the most hypomethylated genes in ETV6::RUNX1, while IL10RA and PTGER4 were significantly hypermethylated (Supplementary Figure S7). Based on β values, the top 100 DMCs clearly distinguish ETV6::RUNX1-positive from -negative cases (Supplementary Figure S8). The GO top enriched terms identified were modulation of chemical synaptic transmission, regulation of trans-synaptic signaling, cell fate commitment, and cell junction assembly, among others (Supplementary Figure S9).
Regarding TCF3::PBX1 molecular subtype, 100 DMCs were used to build the heatmap (Supplementary Figure S10). DMCs characterizing the TCF3::PBX1-positive group were located in the hypomethylated genes DTX4, SLCO4C1, TMEM131, STM1, LINC00939, DHTKD1, and UBAC2, and the hypermethylated genes XXYLT1 and PDE3B (Supplementary Figure S11). Enrichment analysis revealed that genes with altered methylation patterns in TCF3::PBX1 play a role in the regulation of neuron projection development, regulation of GTPase activity, and small GTPase-mediated signal transduction, among other processes (Supplementary Figure S12).

2.4. Differentially Methylated CpGs Associated with Relapse

To identify relapse-associated methylation patterns or CpGs at diagnosis, only cases who were followed-up at least 18 months after the initial treatment and had complete medical records were included, accounting for 28 non-relapse patients and 18 relapsed cases, who were grouped into control and case groups, respectively. Comparative analysis showed 360 CpGs (p < 0.05); nevertheless, none of them remain statistically significant after a multiple-correction test (FDR > 0.05). Because the CpGs cg01052776 (p = 0.3), cg20747787 (p = 0.3), cg05001671 (p = 0.3), and cg01767116 (p = 0.5) showed a trend towards differential methylation between both groups, an association analysis between their methylation levels and relapse-free survival (RFS) using univariate Cox proportional hazards regression was conducted. This approach let us use methylation as a continuous variable, and the relationship between CpGs methylation with the time-to-event outcome was directly tested. Based on this analysis, an association between all these CpGs and the risk of relapse was observed: Hazard ratio (HR) = 0.53 (95% CI: 0.37–0.75, p =0.0004), 0.46 (95% CI: 0.31–0.67, p = 0.00005), 0.50 (95% CI: 0.34–0.72, p = 0.0001), and 0.53 (95% CI: 0.36–0.77, p = 0.0008) for cg01052776, cg20747787, cg05001671, and cg01767116, respectively. Further, we examined whether CpG sites remained associated with relapse risk after adjustment for clinical variables (age, sex, white blood cell count, and NCI risk group). The adjusted hazard ratios (HRs) ranged from 0.37 to 0.45 (p < 0.01), indicating that each standard deviation increase in methylation was associated with a 55–63% reduction in relapse risk, independent of clinical factors (Supplementary Table S7). While these analyses share the statistical power limitations of our cohort size, the consistent and robust associations strengthen the evidence for these CpGs as candidates for future validation. Additionally, we dichotomized the methylation levels of the four CpGs, using the median value of each one to classify patients into hypomethylated and hypermethylated groups. Hypomethylation of these CpG sites was associated with worse prognosis (Figure 4a–d).

2.4.1. Methylated CpGs Profile in Early Relapse

The analysis between ER and non-relapsed patients led us to identify 2629 DMCs (38 hypermethylated and 2591 hypomethylated) at FDR < 0.05. The top hypermethylated genes included ASB18, SLC38A11 FAM19A2, TAL1, LOC389458, FAM135B, BCL11B, NR4A2, SLC15A1, and MOSC2, while the top hypomethylated genes were NMNAT3, MORN5, LOC284837, SEPT9, MEIS1, WNT5B, SLC19A3, AK7, and TMC3 (Figure 5a, Supplementary Table S8). The genes with more DMCs were CORO1B (n = 7), DOCK1 (n = 7), ARID5B (n = 7), MAD1L1 (n = 6), and SEPT9 (n = 5). In addition to the genes involved in methylation processes, UHRF1 and DNMT3A were found to be differentially methylated genes between the relapse and non-relapse groups. Both genes showed two DMCs located into the gene body and 5′UTR hypomethylated in the early-relapse group.
Pathway enrichment analysis showed the RhoGDI-signaling as the main abnormally methylated pathway (Supplementary Figure S13). Among the most significant canonical pathways were hepatic stellate cell activation, epithelial adherens junction signaling, Rho GTPase cycle, Sertoli cell–Sertoli cell junction signaling, and regulation of actin based motility by Rho. The most enriched canonical pathways were associated with organismal growth and development, intracellular and second messenger signaling, extracellular matrix organization, signal transduction, nuclear receptor signaling, and cancer. Notably, pathways related to neurotransmitter signaling and nervous system function ranked among the top hits (Figure 5b).

2.4.2. Random Forest Model on Risk of Relapse

The 20 most significant DMCs were combined with clinically relevant variables and used to construct a Random Forest model in order to predict relapse. The highest contribution observed included four CpGs (cg01052776, cg20747787, cg05001671, and cg01767116), which displayed MeanDecreaseAccuracy values from 12 to 14. The model achieved a training accuracy of 97.5% (95% CI: 94.1–99.9) and an out-of-bag (OOB) error rate of 5% (95% CI: 2.1–9.8). Given the limited sample size (18/46 relapse events), we relied solely on OOB error estimation for internal validation, as holdout validation would have further reduced the already limited training data. The model achieved an OOB accuracy of 78.3%, with a sensitivity of 90.0% for detecting relapse cases and a specificity of 75% for non-relapse cases. The confusion matrix showed that nine out of ten relapse cases were correctly identified, but nine out of thirty-six non-relapse cases were misclassified as relapse (false positives). Variable importance analysis confirmed that the four CpG sites (cg01052776, cg20747787, cg05001671, cg01767116) were among the top predictors. A near-perfect separation of outcome groups and reduced performance in the holdout validation set (75% accuracy) were signs of overfitting; therefore, the model requires further validation in larger cohorts for biomarker discovery and relapse prediction.

3. Discussion

Hematological malignancies, including ALL, are multifactorial entities in which epigenetic factors such as DNAme play a significant role [24,47,48]. Despite the enormous advances in the success of ALL treatments, this disease remains the leading cause of cancer-related death in children and adolescents, with the Mexican pediatric population being among the most affected worldwide [1,3,4,6,18]. In fact, Mexico reports one of the highest incidence and mortality rates of ALL [3,6,18]. A more developed knowledge of the genetic and epigenetic processes undergoing leukemogenesis and therapy responsiveness is a cornerstone of the new treatment identification of this disease. To obtain a better understanding of ALL Mexican patients, we performed a case–control study involving pediatric cases under 17 years old. Based on the fact that PB samples could provide similar information to BM in leukemia [49] and also based on the low number of DMCs (<0.01%) detected between both tissues, we included samples from PB or BM for further analysis. Our study identifies more hypermethylated than hypomethylated DMRs in children with ALL, in contrast to controls (Supplementary Table S3). Genome-wide abnormalities in DNAme are a hallmark of cancer [24,47,50], and increased DNAme in normally unmethylated genes has been already reported in ALL [50,51,52,53,54,55]. It has been described that global hypomethylation, observed in most types of cancer, could be associated with the loss of methylation during each cell division as a defense mechanism of the cell in the course of tumor progression [51,56]. The high hypermethylation rate in ALL could be explained by the rapid evolution progress of this disease, which occurs within weeks, and as a result of a failure of leukemic cells to de-activate tumor-suppressor mechanisms [51,57,58].
Genes that have been reported as abnormally methylated in leukemia, including CPEB1, BLK, FLT3, PAX5, EBF1, HDACs, IKZF1, RB, etc., were also detected in this study [41,48,50,59,60,61,62,63]. These genes are relevant for hematopoietic cell commitment, and abnormal expression of them have been associated with ALL outcome [64,65,66].
As far as we known, few EWAS using the MethylationEPIC Beadchip genome-wide DNAme arrays (850K) in ALL have been conducted [39], and several of them have used the 4500K beadchip. In addition to the consistency of well-known abnormally methylated genes in leukemia cells, these studies uncovered new likely essential genes for leukemogenesis. As an example, VTRNA2-1 was found to be hypermethylated at birth and at diagnosis in ALL cases, and, interestingly, normal levels were found at remission, but increased levels were found at relapse [34].
Another EWAS performed in drops of blood collected at birth in children with Down Syndrome (DS) showed that HOPX, SMIM24, and PPP1R10 were abnormally methylated in children with DS who developed ALL [39].
In the present study, we identify MAD1L1, RPTOR, PCDHGA4, TBCD, and PTPRN2 as the genes with the highest count of DMCs between cases and controls, while PTPN14 and SLC2A1 were identified as having the most significant DMCs.
The encoded protein PTPRN2 is a tumor-suppressor gene involved in malignant cell transformation [67,68,69]. To our knowledge, there is no available data on the abnormal methylation of PTPN2 in ALL, nor on other hematological diseases, even though there is no doubt as to its role in T-ALL [70]. Moreover, PTPRN2 plays a role in vesicle-mediated secretory processes, and has been associated with progression [71] and as a methylation biomarker of poor survival in colorectal cancer [72]. Meanwhile, RPTOR, a crucial component of the mTOR pathway, regulates cell growth in response to nutrient and insulin levels and has been reported as a key driver gene in the brain metastasis of lung and colorectal cancer progression [72,73,74].
Another gene was the mitotic-arrest-deficient 1-like 1 protein gene, a key component of the spindle assembly checkpoint that is essential for correct chromosomal segregation during mitosis. MAD1L1 was strongly associated with cancer progression and the development of colorectal cancer, and its aberrant expression has been reported in solid tumors [75]. In murine models, overexpression of MAD1L1 decreases p53 levels and promotes inflammation-driven colon tumorigenesis [76], while altered expression of this gene has been associated with poor prognosis and recurrence in hepatocellular carcinoma [77]. However, methylation data in ALL or any hematological disease have been scarcely explored. Notably, abnormal methylation at MAD1L1 (loci cg08972190) has been reported in children surviving ALL, particularly in association with obesity and exposure to cranial radiotherapy [78]. In addition, DMRs of MAD1L1 have also been published in adults with ALL [79]. In addition to these findings, in our data, MAD1L1 emerges as a relevant candidate gene in leukemogenesis. Based on our results and data from the analysis of PTPN14 in diverse types of cancer, we suggest that epigenetic mechanisms could modulate gene expression [43,48]; we also propose that aberrant methylation of this gene could contribute to leukemogenesis.
Similarly to most cancer types, aberrant methylation patterns in the promoter regions were observed in ALL (Figure 2a) [80]. Among the genes that were most differentially hypomethylated in their promoter region are PPP1R18, GNG7, ABR, PDE9A, and NT5C, whose deregulation could favor the uncontrolled proliferation of blasts [81,82]. Notably, abnormal expression of NT5C, which is related to pyrimidine metabolism, has been associated with resistance to chemotherapies (e.g., cytarabine) in pediatric leukemia [83].
Otherwise, MACROD2 and LEPR were two genes displaying highly methylated promoter regions in ALL cases. MACROD2 is an epigenetic regulator that encodes an enzyme that removes ADP-ribose groups from mono-ADP-ribosylated proteins and deacetylates metabolites derived from histone deacetylation, such as O-acetyl-ADP ribose, influencing chromatin regulation and gene expression [84]. Meanwhile, the LEPR gene is associated with the JAK-STAT pathway. Interestingly, human xenograft models revealed that LEPR expression is essential for the development and maintenance of ALL and different types of cancer [85].
The pathway enrichment analysis confirmed the expected involvement of cancer processes such as cell cycle regulation, apoptosis, cell proliferation, and cytokine- and growth-factor-mediated signaling were the most affected in ALL [80]. Interestingly, we also detected enrichment for pathways related to neurotransmission and neuronal signaling. The biological significance of neuronal signaling in ALL is currently unknown. It is hypothesized that cancer cells, including leukemia, hijack neural pathways and can exploit neuronal networks to invade the central nervous system (CNS) [86]. Previous studies have reported methylation changes in neuropeptides in ALL [87] and in a significant proportion of patients presenting with CNS infiltration [87,88]. Additionally, the evidence of overexpression of the GABAergic pathway [89] and the usage of the serotonin and glutamatergic pathway for proliferation and survival in cell lines [90,91] support the hypothesis that the neurotransmission and neuronal signaling pathways are involved in ALL. It is noteworthy that many genes within these pathways (e.g., AKT2, AKT3, CREBBP, MAPK1, GNAS, RPTOR) are already known to play relevant roles in cancer cell growth and proliferation [74,92,93,94,95], which could provide a plausible biological basis for such a link. Notwithstanding, functional studies are required to decipher any potential causal relationship.
In addition, we investigated whether methylation patterns might be associated with ETV6::RUNX1 and TCF3::PBX1 molecular subtypes or relapse risk. Notably, 3 out of the 42 CpGs previously proposed by Nordlund to define ETV6::RUNX1 and 16 out of the 21 CpGs proposed for TCF3::PBX1 were also found in our analysis, highlighting the existence of specific patterns of DNAme based on fusion genes, potentially affecting prognosis [40].
The analysis of outcomes revealed four CpGs associated with RFS: two of them were found in the coding genes RNH1 and FBXL22. Although associations between DNAme and relapse have been proposed in pediatric ALL, few studies have directly assessed DMCs between patients with and without relapse. One of the most relevant contributions [40] employed a shrunken centroid classification followed by RFS analysis, identifying gene regions with methylation levels associated with RFS. RNH1 encodes a ribonuclease inhibitor that has been reported as being aberrantly expressed in bladder cancer [96]. Interestingly, this has also been suggested as a therapeutic target due to its role in mTOR signaling [97]. FBXL22 encodes an F-box protein that interacts with S-phase kinase-associated protein 1A and cullin to form SCF (Skp1-Cullin-F-box) complexes functioning as ubiquitin ligases. This gene has been linked to breast cancer [98].
The consistent performance of the four candidate CpG sites in multivariate analyses represents a particularly noteworthy finding. Despite the limited sample size for multivariable modeling, all four CpGs maintained significant associations with RFS after adjustment for established clinical risk factors including age, white blood cell count, and NCI risk category. To the best of our knowledge, the present study is among the first describing the DNAme landscape of relapsed cases at diagnosis. Nordlund [99] included 764 ALL cases to explore the association between DNAme with RFS based on the most common molecular subtypes; six, eight, and one regions at FDR < 0.1 were found in ETV6::RUNX1, KMT2A-r and BCR::ABL1, respectively [99]. Other studies have compared paired diagnosis-first relapsed samples, finding significant differences between them and suggesting that DNAme can propel leukemia evolution [100,101]. Otherwise, we were unable to develop a multivariate approach for relapse prediction, as Mosquera-Orgeira did (relapse risk prediction index composed by16 CpGs, 20-month AUC with 81.4% and 82.5%) [102]. In our Random Forest analysis, the near-perfect separation observed in the training set and the drop in performance in the test set clearly indicate model overfitting [103,104]. This is likely attributable to (i) the limited sample size, (ii) the high dimensionality of the methylation dataset, and (iii) class imbalance [103,104]. Nonetheless, the consistent identification of four relapse-associated CpGs within this Cox univariated and multivariate model warrants further evaluation on their prognostic relevance. These findings require validation in larger independent cohorts before any predictive or clinical application can be considered.
After the analysis stratification by relapse type, DNMT3A and UHRF1, which are a DNAme writer and modulator, respectively, were identified in ER. UHRF1 encodes a member of a subfamily of RING-finger type E3 ubiquitin ligases that acts as a key epigenetic regulator by bridging DNAme and chromatin modification. In the present study, we observed significant hypermethylation at 14 CpGs within the gene body of UHRF1 in ALL patients with ER compared to non-relapsing patients. This finding suggests a potential disruption in the epigenetic landscape during ER, positioning UHRF1 as a candidate biomarker for risk to ER [105]. To note, UHRF1 has been identified as a hub gene in chronic myeloid leukemia (upregulated in murine models) [106], and has been reported as a promoter of oncogenesis in T-cell ALL. Further, UHRF1 drives proliferation in T-cell ALL, and has been proposed as a therapeutic target for thymoquinone, a natural anticancer agent [107]. While UHRF1 methylation changes could lead to increased gene expression, altered mRNA splicing, or aberrant protein isoforms, functional studies are needed for unveiling the mechanism underlying ER in ALL [107].
In summary, our study provides a comprehensive epigenomic overview of pediatric ALL in the Mexican population, revealing aberrant methylation patterns associated with leukemogenesis, molecular subtypes, and ER risk. The identification of potential biomarkers with relapse and novel candidate genes, including UHRF1 and MAD1L1’s involvement in ER, underscores the contribution of epigenetic dysregulation to disease progression and relapse dynamics. Although relapse prediction using DMCs alone proved to be limited in our cohort, stratified analyzes suggest that subtype-specific methylation signatures may hold prognostic value. We are aware that the small sample size included in the present study is a critical limitation, which could be amplified due to tissue heterogeneity or after stratification by molecular subtype analyses (e.g., relapse, fusion subtypes). Therefore, to confirm our results and the clinical utility of the proposed potential epigenetic biomarkers, validation in larger and independent cohorts, as well as validation matched by variables (e.g., type tissue, age, gender) that could potentially bias the results, is needed. Additionally, these findings highlight the need for integrative approaches combining epigenetic, genetic, and clinical data to improve risk stratification and therapeutic targeting in childhood ALL.

4. Materials and Methods

4.1. Patient and Sample Collection

Newly diagnosed patients with B-ALL and individuals without ALL (used as controls), aged 0–17 years, and without Down syndrome were included in this study. Bone marrow (BM) or PB samples were collected at the time of diagnosis and prior to treatment initiation. Samples under 75% of lymphoblast count in BM or PB were excluded. Controls consisted of individuals aged 0–17 years who underwent BM aspiration as part of a differential diagnostic process due to clinical suspicions of acute leukemia, but were confirmed not to have ALL. No specific age or sex matching was performed.
Public hospitals affiliated with the Mexican Inter-Institutional Group for the Identification of the Causes of Childhood Leukemia (MIGICCL) contributed patients to this study. The diagnosis of B-ALL was validated by pediatric hematologists/oncologists based on clinical presentation, and was confirmed using BM aspirate analysis, which included morphological assessment, immunophenotyping, and, when available molecular studies, in accordance with the 2022 World Health Organization (WHO) classification of lymphoid neoplasms. Cohort follow-up began on the date of B-ALL diagnosis confirmation (Day 0) and continued until the date of the last hospital visit or death.
Clinical and demographic data were obtained from the patients’ medical records. The variables analyzed included the following: patient’s sex, age at diagnosis, National Cancer Institute (NCI) risk classification (standard risk, high risk), CNS and/or testicular infiltration at diagnosis, gene rearrangements (when available), death during induction (yes/no), early mortality (yes/no), and causes of death. Gene rearrangement data were limited to four fusion genes (ETV6::RUNX1, TCF3::PBX1, BCR::ABL1, and KMT2A::AFF1), which are the only cytogenetic and molecular biomarkers tested in Mexican public hospitals.
NCI risk classification: The risk was assigned based on age and peripheral white blood cell (NCI) count at diagnosis. Patients aged between 1.00 and 9.99 years with a WBC count < 50 × 109/L were classified as standard risk. In contrast, those aged ≤1 year or ≥10 years and/or with a WBC count ≥ 50 × 109/L were classified as high risk, in accordance with established criteria [108].
Relapse was defined as disease recurrence in patients who had previously achieved morphologic remission, characterized by <5% blasts in BM and clearance of extramedullary disease [9]. Bone marrow relapse was defined as ≥25% morphologic blasts or ≥5% blasts accompanied by extramedullary involvement. Central nervous system relapse was identified based on CNS3 status (≥5 white blood cells/μL in cerebrospinal fluid with blasts on cytospin) or the presence of clinical signs of CNS leukemia. Isolated extramedullary recurrence required histological confirmation by biopsy. Based on the time to relapse following treatment initiation, patients were categorized into two groups: very early relapse (VER: <18 months) and early relapse (ER: 18–36 months) [9,64].

4.2. Nucleic Acid Extraction and Fusion Gene Analysis

Nucleic acids were extracted from PB or BM samples. DNA extraction was performed using the Puregene Blood Core Kit (QIAGEN, Hilden, Germany), according to the manufacturer’s instructions. Nucleic acid integrity was assessed by 1% agarose gel electrophoresis. The four most common fusion genes (ETV6::RUNX1, TCF3::PBX1, BCR::ABL1, and KMT2A::AFF1) were detected using methods previously described [21].

4.3. Genome-Wide DNA Methylation Analysis

Genome-wide DNAme profiling was performed using the Infinium MethylationEPIC v2.0 BeadChip array (Illumina, San Diego, CA, USA), which interrogates over 850,000 CpG sites across gene promoters, gene-coding and -enhancer regions, miRNA-promoter regions, and intergenic CpGs. A total of 500 ng of genomic DNA per sample was subjected to bisulfite conversion using the EZ DNA Methylation Kit (Zymo Research, Irvine, CA, USA) according to the manufacturer’s instructions. The bisulfite-converted DNA was then hybridized to the EPIC arrays. Array scanning and image acquisition were carried out using the Illumina HiScan SQ system (Illumina Inc., San Diego, CA, USA), which generated raw intensity data in the form of IDAT files. Initial quality control of the IDAT files was conducted using BeadArray Controls Reporter (Illumina, San Diego, CA, USA). Data preprocessing and normalization were performed in R (v4.3.0) using the ChAMP pipeline (Chip Analysis Methylation Pipeline, v2.23.0) for EPIC arrays, available from Bioconductor v3.18 (https://bioconductor.org/packages/release/bioc/html/ChAMP.html accessed on 19 October, 2025) [46,109,110]. Probes with detection p-values ≥ 0.01 in more than 5% of samples, indicating poor signal reliability, were discarded. Additionally, we removed non-CpG probes, probes containing single-nucleotide polymorphisms (SNPs) at the CpG site or within the single-base extension, multi-hit probes aligning to multiple genomic locations, and all probes located on sex chromosomes (X and Y) to avoid confounding by sex differences. Samples with more than 10% of probes failing the detection p-value threshold (<0.01) were excluded. Intra-array normalization was performed using the Beta MIxture Quantile dilation (BMIQ) method to correct for the well-documented type I/II probe design bias [46]. To account for potential technical variation, we applied the ComBat algorithm from the sva R package, using the pd$Sample_Plate variable as the known batch covariate. Model adjustment for known biological covariates (e.g., age, sex) was not performed, as our primary aim was to identify disease-associated signals. Methylation levels were quantified as β values, calculated as the ratio of the methylated signal intensity to the sum of methylated and unmethylated signal intensities for each CpG site. These β values ranged from 0 (completely unmethylated) to 1 (fully methylated), reflecting the proportion of DNA methylation at individual CpG loci [46]. Preprocessing steps included filtering out probes with detection p-values < 0.01, non-CpG probes, probes overlapping known SNPs, multi-hit probes, and probes located on sex chromosomes (X and Y). Normalization of β values was performed using the Beta MIxture Quantile dilation (BMIQ) method [46], and batch effects were corrected using the ComBat algorithm [46]. The GRCh37/hg19 human genome assembly was used as the reference for probe mapping and gene annotation.

4.4. Assessment of Differentially Methylated Regions

Since BM or PB samples were available, we first evaluated global DNAme differences between these two tissue types to identify potential biases and ensure that the analyzed DNAme signal was representative of the leukemic clone, minimizing confounding signals from the normal cellular background and potential differences between tissue microenvironments. A multidimensional scaling plot based on the 1000 most variable CpG sites was generated using the champ.QC function from the ChAMP package.
To identify DMCs, the champ.DMP function was applied to compare B-ALL cases vs. controls, as well as to perform stratified analyzes based on clinical features. P-values were adjusted for multiple testing using the Benjamini–Hochberg method. CpG probes with an absolute Δβ (|Δβ|) ≥0.2 and an adjusted p-value <0.05 were considered statistically significant. Heatmap was constructed using ComplexHeatmap (V 2.16.0), and Volcano plots were constructed using ggplot2 (V 3.5.2) in R.
DMRs were identified using the champ.DMR function within the ChAMP pipeline, which applies the bumphunter algorithm to detect the genomic regions with consistent methylation differences between groups. DMRs were defined as contiguous genomic segments meeting the following criteria: each DMR contained at least 7 DMCs, ensuring robust regional methylation changes. Adjacent DMCs were grouped into the same DMR if they were located within ≤50 bp of each other, based on the assumption that regulatory elements typically span short genomic intervals. DMRs were identified after 250 bootstrap interactions to estimate the null distribution of methylation differences [46,111].

4.5. Epigenome-Wide Association Study (EWAS)

Global DNAme differences were assessed between cases and controls, as well as by stratifying patients according to gender, hyperleukocytosis (WBC >100,000 vs. <100,000), NCI risk classification, BMI z-score category (normal weight, underweight, obesity), gene fusion (ETV6::RUNX1 and TFC3::PBX1), and clinical outcome (relapse or death).
DNA methylation differences between relapse and non-relapse ALL patients was also assessed. Adjusted p-value threshold of 0.05 was considered statistically significant. Selected CpG sites were further analyzed using a univariate Cox proportional hazards regression mode to evaluate RFS. This analysis incorporated both time to relapse and relapse status (coded as follows: 1 = event occurred; 0 = censored at last follow-up). p-values were adjusted for multiple comparisons using the Benjamini–Hochberg false discovery rate (FDR) method. Further, these CpG sites were analyzed using Kaplan–Meier survival analysis and the Mantel–Cox test. Methylation β values were dichotomized based on the overall mean to classify samples as hypomethylated or hypermethylated.
To assess whether the association of the four candidate CpG sites with relapse was independent of the established clinical risk factors, we performed exploratory multivariate Cox proportional hazards analyses. Each CpG site was evaluated in a separate model adjusted for age at diagnosis (continuous variable), sex, white blood cell count (categorized as <50,000/µL vs. ≥50,000/µL and <100,000/µL vs. ≥100,000/µL), and NCI risk category (standard risk and high risk). Methylation levels for each CpG were included as continuous variables scaled to standard-deviation units to facilitate the interpretation of HR.

4.6. Relapse Prediction Model in ALL Patients, Grounded in a Random Forest-Based Machine Learning Approach

To evaluate the predictive potential of the top 20 DMCs, a Random Forest classifier was implemented following these steps: (1) Data partitioning: The dataset was split into a training set (75%) and a holdout test set (25%), with stratification by relapse status to preserve class distribution. (2) Model training: The Random Forest model was trained using 1,000 decision trees. The number of variables randomly sampled at each split (mtry) was optimized through 5-fold cross-validation. Variable importance was assessed using the Mean Decrease in Accuracy and Mean Decrease in Gini metrics. (3) Performance evaluation: Model performance was evaluated on the independent test set using area under the ROC curve (AUC-ROC), sensitivity, and specificity. To ensure the validity of the Cox proportional hazards analysis, proportionality assumptions were verified using Schoenfeld residuals. Additionally, potential model overfitting in the Random Forest classifier was mitigated using out-of-bag (OOB) error estimation and holdout test validation [112].

4.7. Pathway Enrichment and Protein–Protein Interaction Analyses

Pathway enrichment analysis was performed using the GO database (http://www.geneontology.org accessed on 25 July 2025) and IPA software (QIAGEN inc., https://digitalinsights.qiagen.com/IPA accessed on 25 July 2025) to identify the most significantly altered biological pathways in ALL [113,114]. GO provides comprehensive gene annotations, including molecular functions, biological processes, and cellular components, and IPA calculates enrichment scores (p-values) based on the probability of gene set overrepresentation. For each probe, the associated gene symbol, Δβ-value, and adjusted p-value were used as input. Canonical pathways were identified using enrichment p-values calculated by Fisher’s exact test. Genes were considered biologically relevant if their associated CpG sites showed a methylation change (Δβ) of ≥±0.20 and were located in functionally relevant genomic regions (e.g., promoter regions, CpG islands).
To evaluate protein–protein interaction (PPI) networks among genes harboring DMCs in ALL versus controls, we used the STRING database (https://string-db.org, accessed on 29 June 2025). Hub genes were identified, and high-confidence interaction networks were constructed using a confidence score threshold of 0.900 [115].

4.8. Statistical Analysis

All statistical analyzes were performed using R software version 4.3.0. Qualitative variables were expressed as counts and percentages. Comparisons between groups were made using Student’s t-test for continuous variables and Chi-square or Fisher’s exact test for categorical variables, as appropriate. Spearman’s rank correlation coefficient was used to assess associations between continuous or ordinal variables. To estimate effect size, Cohen’s d was calculated, which represents the standardized difference between the two group means, divided by their pooled standard deviation, and indicates the magnitude of the observed difference [116,117]. To evaluate the prognostic impact of DMCs on relapse, a univariate Cox regression analysis was conducted. For this analysis, based on the median methylation values, samples were grouped into hypomethylated or hypermethylated categories. Survival differences between these both groups were evaluated using the Mantel–Cox log rank test. Two-sided p-values < 0.05 were considered statistically significant.

5. Conclusions

This study is one of the few case–control analyses providing a comprehensive examination of DNAme patterns in pediatric patients with ALL, revealing a distinctive epigenetic signature associated with ALL. It is characterized by methylation abnormalities in genes involved in the cell cycle and cell signaling, as well as unexpected pathways such as neurotransmission. The genes involved in the latter pathway could hypothetically influence leukemic cell trafficking into the CNS and could be associated with chemotherapy drugs evasion. Even the limited statistical power of this study, due to its small sample size and chemotherapy drugs evasion, the identification of DNAme profiles associated with ETV6::RUNX1 and TCF3::PBX1, two of the most common molecular subtypes of ALL, provide valuable information that could contribute to the classification of molecular subtypes in pediatric ALL. We also identified four specific CpG sites (cg01052776, cg20747787, cg05001671, and cg01767116) whose methylation levels were significantly associated with RFS, suggesting their potential use as prognostic biomarkers; however, further validation in independent cohorts is needed. Finally, our findings provide knowledge about the relevance of epigenetic mechanisms in pediatric ALL and open new opportunities for the identification of prognostic biomarkers as well as for the development of targeted therapeutic strategies.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms262110261/s1.

Author Contributions

Conceptualization, A.A.F.-L. and S.J.-M.; Methodology, A.A.F.-L., S.J.-M. and J.M.M.-A.; Software, A.A.F.-L.; Validation, A.A.F.-L. and G.J.-M.; Formal Analysis, A.A.F.-L.; Investigation, A.A.F.-L., J.C.N.-E., V.C.B.-M., J.F.-L., M.M.-R., E.J.-H., M.P.O.-M., F.X.G.-C., A.M.-S., J.A.M.-T., J.G.P.-G., M.M.V.-A., J.R.T.-N., R.M.E.-E., M.L.P.-S., L.V.F.-V., L.E.M.-P., D.A.D.-R., O.A.S.-R., G.J.-M., H.R.-V., J.M.-Z. and E.R.-G.; Resources, S.J.-M. and J.M.M.-A.; Data Curation, A.A.F.-L. and J.C.N.-E.; Writing—Original Draft Preparation, A.A.F.-L. and S.J.-M.; Writing—Review and Editing, All authors; Visualization, A.A.F.-L.; Supervision, S.J.-M. and J.M.M.-A.; Project Administration, S.J.-M.; Funding Acquisition, S.J.-M. and J.M.M.-A. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Instituto Nacional de Medicina Genómica (INMEGEN), 01/2018I; National Institutes of Health 3R01CA262263-01A1S1 sub-award number SCON-00004408 and 2U24ES028524 sub-award number 00011320; Consejo Nacional de Humanidades, Ciencias y Tecnologías (CONAHCYT, CF-2023-G 1399 and Dirección General de Políticas de Investigación en Salud (DGPIS), Financiamiento de Proyectos de Investigación para la Salud (FPIS 2024) FPIS2024-INMEGEN-7593. A.A.F.-L was supported by a doctoral fellowship from Consejo Nacional de Humanidades, Ciencias y Tecnologías (CONAHCYT) [CVU 860574] and by the “Programa de Doctores Jóvenes” of the Universidad Autónoma de Sinaloa (UAS). The funding body did not have a role in the design of the study, collection, analysis, and interpretation of the data or manuscript preparation.

Institutional Review Board Statement

This study was conducted in accordance with the ethical standards of the 1964 Helsinki Declaration and its later amendments. It was approved by the following Institutional Review Boards: the Mexican Social Security Institute (IMSS) (Ref. 09-B5-6I-2800), approved on 24 September 2013; the Bioethics and Research Committees of the Secretary of Health (Comités de Investigación Ética y Bioseguridad de la Secretaría de Salud) (HIM-2014-089), approved on 27 October 2014; the National Institute of Genomic Medicine (INMEGEN) in 2017 (03/2017/I, approved on 3 April 2017) and 2018 (01/2018/I, approved on 8 March 2018).

Informed Consent Statement

Additional written informed consent was obtained from all participants’ legal guardians for publication of anonymized clinical and molecular data. All data have been de-identified to protect patient privacy.

Data Availability Statement

The raw sequencing data from this study have been submitted to the NCBI Gene Expression Omnibus (GEO) database and are currently under accession number GSE306095. The dataset will be publicly accessible upon publication of this manuscript.

Acknowledgments

The Consejo Nacional de Humanidades, Ciencias y Tecnologías (CONAHCYT, formerly CONACYT) for the Ph.D. fellowship awarded to A.A.F.-L. (CVU: 860574). The Universidad Autónoma de Sinaloa (UAS) for additional academic and financial support to A.A.F.-L. during this research. The patients and their families for their participation in this study. The Microarray Unit at the National Institute of Genomic Medicine (INMEGEN) for technical support in processing the MethylationEPIC arrays. Clinical collaborators from participating hospitals for patient recruitment and sample collection.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
ALLAcute lymphoblastic leukemia
DNAmeDNA methylation
EWASepigenome-wide association study
B-ALLB-cell ALL
Z-BMIBody Mass Index Z score
NCINational Cancer Institute
VERVery early relapse
EREarly relapse
DMCDifferentially methylated CpG sites
5’UTR5′ untranslated regions
TSS200Region 200bp previous of transcription start site
TSS1500Region 1500bp previous of transcription start site
DMRDifferentially methylated regions
GOGene Ontology
IPAIngenuity pathway analysis
PPIProtein–protein interaction
OOBOut-of-bag
RFSRelapse-free survival
MIGICCLMexican Inter-Institutional Group for the Identification of the Causes of Childhood Leukemia
BMBone marrow
PBPeripheral blood
CNSCentral nervous system
BMIQBeta Mixture Quantile dilation
HRHazard ratios

References

  1. Jiménez-Morales, S.; Hidalgo-Miranda, A.; Ramírez-Bello, J. Leucemia linfoblástica aguda infantil: Una aproximación genómica. Bol. Med. Hosp. Infant. Mex. 2017, 74, 13–26. [Google Scholar] [CrossRef]
  2. Siegel, R.L.; Miller, K.D.; Jemal, A. Cancer statistics. CA Cancer J. Clin. 2017, 67, 7–30. [Google Scholar] [CrossRef]
  3. Flores-Lujano, J.; Duarte-Rodríguez, D.A.; Jiménez-Hernández, E.; Martín-Trejo, J.A.; Allende-López, A.; Peñaloza-González, J.G.; Pérez-Saldivar, M.L.; Medina-Sanson, A.; Torres-Nava, J.R.; Solís-Labastida, K.A.; et al. Persistently high incidence rates of childhood acute leukemias from 2010 to 2017 in Mexico City: A population study from the MIGICCL. Front. Public Health 2022, 10, 918921. [Google Scholar] [CrossRef]
  4. National Cancer Institute. Childhood Leukemia—Cancer Stat Facts. Surveillance, Epidemiology, and End Results Program. 2020. Available online: https://seer.cancer.gov/statfacts/html/childleuk.html (accessed on 26 September 2025).
  5. de Smith, A.J.; Jiménez-Morales, S.; Mejía-Aranguré, J.M. The genetic risk of acute lymphoblastic leukemia and its implications for children of Latin American origin. Front. Oncol. 2023, 13, 1299355. [Google Scholar] [CrossRef] [PubMed]
  6. Moreira, D.C.; González-Ramella, O.; Valenzuela, M.E.; Carrillo, A.K.; Faughnan, L.; Job, G.; Chen, Y.; Villegas, C.; Irigoyen, A.E.; Urbays, R.B.; et al. Evaluation of factors leading to poor outcomes for pediatric acute lymphoblastic leukemia in Mexico: A multi-institutional report of 2,116 patients. Front. Oncol. 2023, 13, 1255555. [Google Scholar] [CrossRef]
  7. Quiroz, E.; Aldoss, I.; Pullarkat, V.; Rego, E.; Marcucci, G.; Douer, D. The emerging story of acute lymphoblastic leukemia among the Latin American population—Biological and clinical implications. Blood Rev. 2019, 33, 98–105. [Google Scholar] [CrossRef]
  8. Montes-Rodríguez, I.M.; Soto-Salgado, M.; Torres-Cintrón, C.R.; Tomassini-Fernandini, J.C.; Suárez, E.; Clavell, L.A.; Cadilla, C.L. Incidence and Mortality Rates for Childhood Acute Lymphoblastic Leukemia in Puerto Rican Hispanics, 2012–2016. Cancer Epidemiol. Biomark. Prev. 2023, 32, 1030–1037. [Google Scholar] [CrossRef] [PubMed]
  9. Hunger, S.P.; Raetz, E.A. How I treat relapsed acute lymphoblastic leukemia in the pediatric population. Blood 2020, 136, 1803–1812. [Google Scholar] [CrossRef] [PubMed]
  10. Rheingold, S.R.; Bhojwani, D.; Ji, L.; Xu, X.; Devidas, M.; Kairalla, J.A.; Shago, M.; Heerema, N.A.; Carroll, A.J.; Breidenbach, H.; et al. Determinants of survival after first relapse of acute lymphoblastic leukemia: A Children’s Oncology Group study. Leukemia 2024, 38, 2382–2394. [Google Scholar] [CrossRef]
  11. Burns, M.A.; Place, A.E.; Stevenson, K.E.; Gutiérrez, A.; Forrest, S.; Pikman, Y.; Vrooman, L.M.; Harris, M.H.; Weinberg, O.K.; Hunt, S.K.; et al. Identification of prognostic factors in childhood T-cell acute lymphoblastic leukemia: Results from DFCI ALL Consortium Protocols 05-001 and 11-001. Pediatr. Blood Cancer 2020, 68, e28719. [Google Scholar] [CrossRef]
  12. Toft, N.; Birgens, H.; Abrahamsson, J.; Griškevičius, L.; Hallböök, H.; Heyman, M.; Klausen, T.W.; Jónsson, Ó.; Palk, K.; Pruunsild, K.; et al. Results of NOPHO ALL2008 treatment for patients aged 1–45 years with acute lymphoblastic leukemia. Leukemia 2017, 32, 606–615. [Google Scholar] [CrossRef]
  13. Roberts, K.G.; Li, Y.; Payne-Turner, D.; Harvey, R.C.; Yang, Y.-L.; Pei, D.; McCastlain, K.; Ding, L.; Lu, C.; Song, G.; et al. Targetable Kinase-Activating Lesions in Ph-like Acute Lymphoblastic Leukemia. N. Engl. J. Med. 2014, 371, 1005–1015. [Google Scholar] [CrossRef]
  14. Østergaard, A.; Fiocco, M.; de Groot-Kruseman, H.; Moorman, A.V.; Vora, A.; Zimmermann, M.; Schrappe, M.; Biondi, A.; Escherich, G.; Stary, J.; et al. ETV6::RUNX1 Acute Lymphoblastic Leukemia: How much therapy is needed for cure? Leukemia 2024, 38, 1477–1487. [Google Scholar] [CrossRef]
  15. Schmiegelow, K.; Forestier, E.; Hellebostad, M.; Heyman, M.; Kristinsson, J.; Söderhäll, S.; Taskinen, M.; on behalf of the Nordic Society of Paediatric Haematology and Oncology (NOPHO). Long-term results of NOPHO ALL-92 and ALL-2000 studies of childhood acute lymphoblastic leukemia. Leukemia 2009, 24, 345–354, Erratum in Leukemia 2010, 24, 670.. [Google Scholar] [CrossRef]
  16. Allemani, C.; Matsuda, T.; Di Carlo, V.; Harewood, R.; Matz, M.; Niksic, M. Global surveillance of trends in cancer survival: Analysis of individual records for 37,513,025 patients diagnosed with one of 18 cancers during 2000–2014 from 322 population-based registries in 71 countries (CONCORD-3). Lancet 2018, 391, 1023–1075. [Google Scholar] [CrossRef]
  17. Lin, A.; Cheng, F.W.; Chiang, A.K.; Luk, C.; Li, R.C.; Ling, A.S.; Cheuk, D.K.; Chang, K.; Ku, D.; Lee, V.; et al. Excellent outcome of acute lymphoblastic leukaemia with TCF3-PBX1 rearrangement in Hong Kong. Pediatr. Blood Cancer 2018, 65, e27346. [Google Scholar] [CrossRef]
  18. Jiménez-Hernández, E.; Jaimes-Reyes, E.Z.; Arellano-Galindo, J.; García-Jiménez, X.; Tiznado-García, H.M.; Dueñas-González, M.T.; Villegas, O.M.; Sánchez-Jara, B.; Bekker-Méndez, V.C.; Ortíz-Torres, M.G.; et al. Survival of Mexican children with acute lymphoblastic leukaemia under treatment with the protocol from the Dana-Farber cancer institute 00-01. BioMed Res. Int. 2015, 2015, 576950. [Google Scholar] [CrossRef]
  19. Chang, T.-C.; Chen, W.; Qu, C.; Cheng, Z.; Hedges, D.; Elsayed, A.; Pounds, S.B.; Shago, M.; Rabin, K.R.; Raetz, E.A.; et al. Genomic Determinants of Outcome in Acute Lymphoblastic Leukemia. J. Clin. Oncol. 2024, 42, 3491–3503. [Google Scholar] [CrossRef]
  20. Jiménez-Morales, S.; Miranda-Peralta, E.; Saldaña-Alvarez, Y.; Perez-Vera, P.; Paredes-Aguilera, R.; Rivera-Luna, R.; Velázquez-Cruz, R.; Ramírez-Bello, J.; Carnevale, A.; Orozco, L. BCR-ABL, ETV6-RUNX1 and E2A-PBX1: Prevalence of the most common acute lymphoblastic leukemia fusion genes in Mexican patients. Leuk. Res. 2008, 32, 1518–1522. [Google Scholar] [CrossRef]
  21. Mata-Rocha, M.; Rangel-López, A.; Jimenez-Hernandez, E.; Nuñez-Enríquez, J.C.; Morales-Castillo, B.A.; Sánchez-Escobar, N.; Sepúlveda-Robles, O.A.; Bravata-Alcántara, J.C.; Nájera-Cortés, A.S.; Pérez-Saldivar, M.L.; et al. Low Prevalence of ETV6::RUNX1 Fusion Gene in a Hispanic Population. Front. Pediatr. 2022, 10, 837656. [Google Scholar] [CrossRef]
  22. Malhotra, P.; Jain, S.; Agarwal, A.; Sharma, A.; Agarwal, N.; Kapoor, G. Incidence and Prognostic Impact of TCF3-PBX1 Fusion in Childhood Acute Lymphoblastic Leukemia: A Single Centre Experience. Indian J. Hematol. Blood Transfus. 2021, 38, 164–168. [Google Scholar] [CrossRef]
  23. Lee, S.H.R.; Antillon-Klussmann, F.; Pei, D.; Yang, W.; Roberts, K.G.; Li, Z.; Devidas, M.; Yang, W.; Najera, C.; Lin, H.P.; et al. Association of Genetic Ancestry with the Molecular Subtypes and Prognosis of Childhood Acute Lymphoblastic Leukemia. JAMA Oncol. 2022, 8, 354–363, Erratum in JAMA Oncol. 2022, 8, 484. [Google Scholar] [CrossRef] [PubMed]
  24. Hanahan, D. Hallmarks of Cancer: New Dimensions. Cancer Discov. 2022, 12, 31–46. [Google Scholar] [CrossRef]
  25. Lim, J.Y.; Bhatia, S.; Robison, L.L.; Yang, J.J. Genomics of Racial and Ethnic Disparities in Childhood Acute Lymphoblastic Leukemia. Cancer 2013, 120, 955–962. [Google Scholar] [CrossRef] [PubMed]
  26. de Smith, A.J.; Wahlster, L.; Jeon, S.; Kachuri, L.; Black, S.; Langie, J.; Cato, L.D.; Nakatsuka, N.; Chan, T.-F.; Xia, G.; et al. A noncoding regulatory variant in IKZF1 increases acute lymphoblastic leukemia risk in Hispanic/Latino children. Cell Genom. 2024, 4, 100526. [Google Scholar] [CrossRef] [PubMed]
  27. Xu, H.; Cheng, C.; Devidas, M.; Pei, D.; Fan, Y.; Yang, W.; Neale, G.; Scheet, P.; Burchard, E.G.; Torgerson, D.G.; et al. ARID5B genetic polymorphisms contribute to racial disparities in the incidence and treatment outcome of childhood acute lymphoblastic leukemia. J. Clin. Oncol. 2012, 30, 751–757. [Google Scholar] [CrossRef]
  28. Gu, Z.; Churchman, M.L.; Roberts, K.G.; Moore, I.; Zhou, X.; Nakitandwe, J.; Hagiwara, K.; Pelletier, S.; Gingras, S.; Berns, H.; et al. PAX5-driven subtypes of B-progenitor acute lymphoblastic leukemia. Nat. Genet. 2019, 51, 296–307. [Google Scholar] [CrossRef]
  29. Shah, B.; Mattison, R.J.; Abboud, R.; Abdelmessieh, P.; Aldoss, I.; Burke, P.W.; DeAngelo, D.J.; Dinner, S.; Fathi, A.T.; Gauthier, J.; et al. Acute Lymphoblastic Leukemia, Version 2.2024, NCCN Clinical Practice Guidelines in Oncology. J. Natl. Compr. Cancer Netw. 2024, 22, 563–576. [Google Scholar] [CrossRef]
  30. Pölönen, P.; Di Giacomo, D.; Seffernick, A.E.; Elsayed, A.; Kimura, S.; Benini, F.; Montefiori, L.E.; Wood, B.L.; Xu, J.; Chen, C.; et al. The genomic basis of childhood T-lineage acute lymphoblastic leukaemia. Nature 2024, 632, 1082–1091. [Google Scholar] [CrossRef]
  31. Almamun, M.; Levinson, B.T.; van Swaay, A.C.; Johnson, N.T.; McKay, S.D.; Arthur, G.L.; Davis, J.W.; Taylor, K.H. Integrated methylome and transcriptome analysis reveals novel regulatory elements in pediatric acute lymphoblastic leukemia. Epigenetics 2015, 10, 882–890. [Google Scholar] [CrossRef]
  32. Bergeron, B.P.; Diedrich, J.D.; Zhang, Y.; Barnett, K.R.; Dong, Q.; Ferguson, D.C.; Autry, R.J.; Yang, W.; Hansen, B.S.; Smith, C.; et al. Epigenomic profiling of glucocorticoid responses identifies cis-regulatory disruptions impacting steroid resistance in childhood acute lymphoblastic leukemia. Leukemia 2022, 36, 2374–2383. [Google Scholar] [CrossRef]
  33. Tejedor, J.R.; Bueno, C.; Vinyoles, M.; Petazzi, P.; Agraz-Doblas, A.; Cobo, I.; Torres-Ruiz, R.; Bayón, G.F.; Pérez, R.F.; López-Tamargo, S.; et al. Integrative methylome-transcriptome analysis unravels cancer cell vulnerabilities in infant MLL-rearranged B cell acute lymphoblastic leukemia. J. Clin. Investig. 2021, 131, e138833. [Google Scholar] [CrossRef]
  34. Ghantous, A.; Nusslé, S.G.; Nassar, F.J.; Spitz, N.; Novoloaca, A.; Krali, O.; Nickels, E.; Cahais, V.; Cuenin, C.; Roy, R.; et al. Epigenome-wide analysis across the development span of pediatric acute lymphoblastic leukemia: Backtracking to birth. Mol. Cancer 2024, 23, 238. [Google Scholar] [CrossRef] [PubMed]
  35. Borssén, M.; Nordlund, J.; Haider, Z.; Landfors, M.; Larsson, P.; Kanerva, J.; Schmiegelow, K.; Flaegstad, T.; Jónsson, Ó.G.; Frost, B.-M.; et al. DNA methylation holds prognostic information in relapsed precursor B-cell acute lymphoblastic leukemia. Clin. Epigenet. 2018, 10, 1–7. [Google Scholar] [CrossRef]
  36. Jain, N.; Roberts, K.G.; Jabbour, E.; Patel, K.; Eterovic, A.K.; Chen, K.; Zweidler-McKay, P.; Lu, X.; Fawcett, G.; Wang, S.A.; et al. Ph-like acute lymphoblastic leukemia: A high-risk subtype in adults. Blood 2017, 129, 572–581. [Google Scholar] [CrossRef]
  37. Shoag, J.M.; Barredo, J.C.; Lossos, I.S.; Pinheiro, P.S. Acute lymphoblastic leukemia mortality in Hispanic Americans. Leuk. Lymphoma 2020, 61, 2674–2681. [Google Scholar] [CrossRef]
  38. Jones, P.A. Functions of DNA methylation: Islands, start sites, gene bodies and beyond. Nat. Rev. Genet. 2012, 13, 484–492. [Google Scholar] [CrossRef]
  39. Li, S.; Sok, P.; Xu, K.; Muskens, I.S.; Elliott, N.; Myint, S.S.; Pandey, P.; Hansen, H.M.; Morimoto, L.M.; Kang, A.Y.; et al. Epigenome-wide association study of acute lymphoblastic leukemia in children with down syndrome. Blood Adv. 2022, 6, 4132–4136. [Google Scholar] [CrossRef] [PubMed]
  40. Nordlund, J.; Bäcklin, C.L.; Wahlberg, P.; Busche, S.; Berglund, E.C.; Eloranta, M.-L.; Flaegstad, T.; Forestier, E.; Frost, B.-M.; Harila-Saari, A.; et al. Genome-wide signatures of differential DNA methylation in pediatric acute lymphoblastic leukemia. Genome Biol. 2013, 14, r105. [Google Scholar] [CrossRef]
  41. Rahmani, M.; Talebi, M.; Hagh, M.F.; Feizi, A.A.H.; Solali, S. Aberrant DNA methylation of key genes and Acute Lymphoblastic Leukemia. Biomed. Pharmacother. 2018, 97, 1493–1500. [Google Scholar] [CrossRef] [PubMed]
  42. Hogan, L.E.; Meyer, J.A.; Yang, J.; Wang, J.; Wong, N.; Yang, W.; Condos, G.; Hunger, S.P.; Raetz, E.; Saffery, R.; et al. Integrated genomic analysis of relapsed childhood acute lymphoblastic leukemia reveals therapeutic strategies. Blood 2011, 118, 5218–5226. [Google Scholar] [CrossRef]
  43. Milani, L.; Lundmark, A.; Kiialainen, A.; Nordlund, J.; Flaegstad, T.; Forestier, E.; Heyman, M.; Jonmundsson, G.; Kanerva, J.; Schmiegelow, K.; et al. DNA methylation for subtype classification and prediction of treatment outcome in patients with childhood acute lymphoblastic leukemia. Blood 2010, 115, 1214–1225. [Google Scholar] [CrossRef] [PubMed]
  44. Bhatla, T.; Wang, J.; Morrison, D.J.; Raetz, E.A.; Burke, M.J.; Brown, P.; Carroll, W.L. Epigenetic reprogramming reverses the relapse-specific gene expression signature and restores chemosensitivity in childhood B-lymphoblastic leukemia. Blood 2012, 119, 5201–5210. [Google Scholar] [CrossRef]
  45. Simonin, M.; Touzart, A. Epigenetics-based stratification in pediatric T-ALL. Blood 2025, 145, 2108–2110. [Google Scholar] [CrossRef]
  46. Tian, Y.; Morris, T.J.; Webster, A.P.; Yang, Z.; Beck, S.; Feber, A.; Teschendorff, A.E. ChAMP: Updated methylation analysis pipeline for Illumina BeadChips. Bioinformatics 2017, 33, 3982–3984. [Google Scholar] [CrossRef] [PubMed]
  47. Figueroa, M.E.; Chen, S.-C.; Andersson, A.K.; Phillips, L.A.; Li, Y.; Sotzen, J.; Kundu, M.; Downing, J.R.; Melnick, A.; Mullighan, C.G. Integrated genetic and epigenetic analysis of childhood acute lymphoblastic leukemia. J. Clin. Investig. 2013, 123, 3099–3111. [Google Scholar] [CrossRef]
  48. Chatterton, Z.; Morenos, L.; Mechinaud, F.; Ashley, D.M.; Craig, J.M.; Sexton-Oates, A.; Halemba, M.S.; Parkinson-Bates, M.; Ng, J.; Morrison, D.; et al. Epigenetic deregulation in pediatric acute lymphoblastic leukemia. Epigenetics 2014, 9, 459–467. [Google Scholar] [CrossRef]
  49. Caliskan, G.; Pawitan, Y.; Vu, T.N. Similarities and differences of bone marrow and peripheral blood samples from acute myeloid leukemia patients in terms of cellular heterogeneity and ex-vivo drug sensitivity. eJHaem 2024, 5, 721–727. [Google Scholar] [CrossRef]
  50. Takeuchi, S.; Matsushita, M.; Zimmermann, M.; Ikezoe, T.; Komatsu, N.; Seriu, T.; Schrappe, M.; Bartram, C.R.; Koeffler, H.P. Clinical significance of aberrant DNA methylation in childhood acute lymphoblastic leukemia. Leuk. Res. 2011, 35, 1345–1349. [Google Scholar] [CrossRef]
  51. Hetzel, S.; Mattei, A.L.; Kretzmer, H.; Qu, C.; Chen, X.; Fan, Y.; Wu, G.; Roberts, K.G.; Luger, S.; Litzow, M.; et al. Acute lymphoblastic leukemia displays a distinct highly methylated genome. Nat. Cancer 2022, 3, 768–782. [Google Scholar] [CrossRef]
  52. Schäfer, V.; Ernst, J.; Rinke, J.; Winkelmann, N.; Beck, J.F.; Hochhaus, A.; Gruhn, B.; Ernst, T. EZH2 mutations and promoter hypermethylation in childhood acute lymphoblastic leukemia. J. Cancer Res. Clin. Oncol. 2016, 142, 1641–1650. [Google Scholar] [CrossRef]
  53. Roman-Gomez, J.; Jimenez-Velasco, A.; Barrios, M.; Prosper, F.; Heiniger, A.; Torres, A.; Agirre, X. Poor prognosis in acute lymphoblastic leukemia may relate to promoter hypermethylation of cancer-related genes. Leuk. Lymphoma 2007, 48, 1269–1282. [Google Scholar] [CrossRef]
  54. Ng, Ö.H.; Fırtına, S.; Can, I.; Karakaş, Z.; Ağaoğlu, L.; Doğru, Ö.; Celkan, T.; Akçay, A.; Yıldırmak, Y.; Timur, Ç.; et al. WNT5A hipermetilasyonun Çocukluk Çağı akut lenfoblastik lösemideki olası rolü. Turk. J. Hematol. 2015, 32, 127–135. [Google Scholar] [CrossRef]
  55. Roman-Gomez, J.; Castillejo, J.A.; Jimenez, A.; Barrios, M.; Heiniger, A.; Torres, A. The role of DNA hypermethylation in the pathogenesis and prognosis of acute lymphoblastic leukemia. Leuk. Lymphoma 2003, 44, 1855–1864. [Google Scholar] [CrossRef] [PubMed]
  56. Johnstone, S.E.; Reyes, A.; Qi, Y.; Adriaens, C.; Hegazi, E.; Pelka, K.; Chen, J.H.; Zou, L.S.; Drier, Y.; Hecht, V.; et al. Large-Scale Topological Changes Restrain Malignant Progression in Colorectal Cancer. Cell 2020, 182, 1474–1489.e23. [Google Scholar] [CrossRef]
  57. May-Hau, D.I.; Bárcenas-López, D.A.; Núñez-Enríquez, J.C.; Bekker-Méndez, V.C.; Beltrán-Anaya, F.O.; Jiménez-Hernández, E.; Ortíz-Maganda, M.P.; Guerra-Castillo, F.X.; Medina-Sanson, A.; Flores-Lujano, J.; et al. Underexpression of LINC00173 in TCF3/PBX1-Positive Cases Is Associated With Poor Prognosis in Children With B-Cell Precursor Acute Lymphoblastic Leukemia. Front. Oncol. 2022, 12, 887766. [Google Scholar] [CrossRef]
  58. Bárcenas-López, D.A.; Núñez-Enríquez, J.C.; Hidalgo-Miranda, A.; Beltrán-Anaya, F.O.; May-Hau, D.I.; Jiménez-Hernández, E.; Bekker-Méndez, V.C.; Flores-Lujano, J.; Medina-Sansón, A.; Tamez-Gómez, E.L.; et al. Transcriptome Analysis Identifies LINC00152 as a Biomarker of Early Relapse and Mortality in Acute Lymphoblastic Leukemia. Genes 2020, 11, 302. [Google Scholar] [CrossRef]
  59. Georgiadis, P.; Liampa, I.; Hebels, D.G.; Krauskopf, J.; Chatziioannou, A.; Valavanis, I.; de Kok, T.M.; Kleinjans, J.C.; Bergdahl, I.A.; Melin, B.; et al. Evolving DNA methylation and gene expression markers of B-cell chronic lymphocytic leukemia are present in pre-diagnostic blood samples more than 10 years prior to diagnosis. BMC Genom. 2017, 18, 728. [Google Scholar] [CrossRef]
  60. Yang, L.; Rodriguez, B.; Mayle, A.; Park, H.J.; Lin, X.; Luo, M.; Jeong, M.; Curry, C.V.; Kim, S.-B.; Ruau, D.; et al. DNMT3A Loss Drives Enhancer Hypomethylation in FLT3-ITD-Associated Leukemias. Cancer Cell 2016, 29, 922–934, Erratum in Cancer Cell 2016, 30, 363–365.. [Google Scholar] [CrossRef]
  61. Bastian, L.; Schroeder, M.P.; Eckert, C.; Schlee, C.; Tanchez, J.O.; Kämpf, S.; Wagner, D.L.; Schulze, V.; Isaakidis, K.; Lázaro-Navarro, J.; et al. PAX5 biallelic genomic alterations define a novel subgroup of B-cell precursor acute lymphoblastic leukemia. Leukemia 2019, 33, 1895–1909. [Google Scholar] [CrossRef]
  62. Guilhamon, P.; Eskandarpour, M.; Halai, D.; Wilson, G.A.; Feber, A.; Teschendorff, A.E.; Gomez, V.; Hergovich, A.; Tirabosco, R.; Amary, M.F.; et al. Meta-analysis of IDH-mutant cancers identifies EBF1 as an interaction partner for TET2. Nat. Commun. 2013, 4, 2166. [Google Scholar] [CrossRef]
  63. Navarrete-Meneses, M.d.P.; Pérez-Vera, P. Epigenetic alterations in acute lymphoblastic leukemia. Bol. Med. Hosp. Infant. Mex. 2017, 74, 243–264. [Google Scholar] [CrossRef]
  64. Núñez-Enríquez, J.C.; Bárcenas-López, D.A.; Hidalgo-Miranda, A.; Jiménez-Hernández, E.; Bekker-Méndez, V.C.; Flores-Lujano, J.; Solis-Labastida, K.A.; Martínez-Morales, G.B.; Sánchez-Muñoz, F.; Espinoza-Hernández, L.E.; et al. Gene Expression Profiling of Acute Lymphoblastic Leukemia in Children with Very Early Relapse. Arch. Med. Res. 2016, 47, 644–655. [Google Scholar] [CrossRef]
  65. Brown, P.; Levis, M.; Shurtleff, S.; Campana, D.; Downing, J.; Small, D. FLT3 inhibition selectively kills childhood acute lymphoblastic leukemia cells with high levels of FLT3 expression. Blood 2005, 105, 812–820. [Google Scholar] [CrossRef] [PubMed]
  66. Hughes, K.; Evans, K.; Earley, E.J.; Smith, C.M.; Erickson, S.W.; Stearns, T.; Philip, V.M.; Neuhauser, S.B.; Chuang, J.H.; Jocoy, E.L.; et al. In vivo activity of the dual SYK/FLT3 inhibitor TAK-659 against pediatric acute lymphoblastic leukemia xenografts. Pediatr. Blood Cancer 2023, 70, e30503. [Google Scholar] [CrossRef]
  67. Au, A.C.; Hernandez, P.A.; Lieber, E.; Nadroo, A.M.; Shen, Y.-M.; Kelley, K.A.; Gelb, B.D.; Diaz, G.A. Protein tyrosine phosphatase PTPN14 Is a regulator of lymphatic function and choanal development in humans. Am. J. Hum. Genet. 2010, 87, 436–444. [Google Scholar] [CrossRef]
  68. Olafsdottir, T.; Stacey, S.N.; Sveinbjornsson, G.; Thorleifsson, G.; Norland, K.; Sigurgeirsson, B.; Thorisdottir, K.; Kristjansson, A.K.; Tryggvadottir, L.; Sarin, K.Y.; et al. Loss-of-function variants in the tumor-suppressor gene PTPN14 confer increased cancer risk. Cancer Res. 2021, 81, 1954–1964. [Google Scholar] [CrossRef]
  69. Liu, Y.; Zhang, J.; Du, Z.; Huang, J.; Cheng, Y.; Yi, W.; Li, T.; Yang, J.; Chen, C. Comprehensive analysis of PTPN family expression and prognosis in acute myeloid leukemia. Front. Genet. 2023, 13, 1087938. [Google Scholar] [CrossRef]
  70. Kleppe, M.; Lahortiga, I.; El Chaar, T.; De Keersmaecker, K.; Mentens, N.; Graux, C.; Van Roosbroeck, K.; Ferrando, A.A.; Langerak, A.W.; Meijerink, J.P.P.; et al. Deletion of the protein tyrosine phosphatase gene PTPN2 in T-cell acute lymphoblastic leukemia. Nat. Genet. 2010, 42, 530–535. [Google Scholar] [CrossRef]
  71. Laczmanska, I.; Karpinski, P.; Bebenek, M.; Sedziak, T.; Ramsey, D.; Szmida, E.; Sasiadek, M.M. Protein tyrosine phosphatase receptor-like genes are frequently hypermethylated in sporadic colorectal cancer. J. Hum. Genet. 2012, 58, 11–15. [Google Scholar] [CrossRef]
  72. Tsai, Y.-H.; Mitra, P.; Taniar, D.; Pai, T.-W. DNA methylation biomarker analysis from low-survival-rate cancers based on genetic functional approaches. Front. Bioinform. 2025, 5, 1523524. [Google Scholar] [CrossRef]
  73. Izquierdo, A.G.; Boughanem, H.; Diaz-Lagares, A.; Arranz-Salas, I.; Esteller, M.; Tinahones, F.J.; Casanueva, F.F.; Macias-Gonzalez, M.; Crujeiras, A.B. DNA methylome in visceral adipose tissue can discriminate patients with and without colorectal cancer. Epigenetics 2021, 17, 665–676. [Google Scholar] [CrossRef]
  74. Lin, Y.; Wu, Y.; Zhang, Q.; Tu, X.; Chen, S.; Pan, J.; Xu, N.; Lin, M.; She, P.; Niu, G.; et al. RPTOR blockade suppresses brain metastases of NSCLC by interfering the ceramide metabolism via hijacking YY1 binding. J. Exp. Clin. Cancer Res. 2024, 43, 1, Erratum in J. Exp. Clin. Cancer Res. 2024, 43, 37.. [Google Scholar] [CrossRef]
  75. Wang, H.; Wang, W.; Wang, Z.; Li, X. Transcriptomic correlates of cell cycle checkpoints with distinct prognosis, molecular characteristics, immunological regulation, and therapeutic response in colorectal adenocarcinoma. Front. Immunol. 2023, 14, 1291859. [Google Scholar] [CrossRef]
  76. Copeland, S.E.; Snow, S.M.; Wan, J.; Matkowskyj, K.A.; Halberg, R.B.; Weaver, B.A. MAD1 upregulation sensitizes to inflammation-mediated tumor formation. PLoS Genet. 2024, 20, e1011437. [Google Scholar] [CrossRef]
  77. Liping, X.; Jia, L.; Qi, C.; Liang, Y.; Dongen, L.; Jianshuai, J. Cell Cycle Genes Are Potential Diagnostic and Prognostic Biomarkers in Hepatocellular Carcinoma. BioMed. Res. Int. 2020, 2020, 6206157, Erratum in Biomed. Res. Int. 2020, 2020, 8679532. [Google Scholar] [CrossRef]
  78. Lupo, P.J.; Brown, A.L.; Arroyo, V.M.; Kamdar, K.Y.; Belmont, J.W.; Scheurer, M.E.; Leisenring, W.M.; Gramatges, M.M.; Okcu, M.F.; Yasui, Y.; et al. DNA methylation and obesity in survivors of pediatric acute lymphoblastic leukemia: A report from the Childhood Cancer Survivor Study. Genes Chromosom. Cancer 2018, 58, 52–59. [Google Scholar] [CrossRef]
  79. Zhu, Z.; Zhou, Q.; Sun, Y.; Lai, F.; Wang, Z.; Hao, Z.; Li, G. MethMarkerDB: A comprehensive cancer DNA methylation biomarker database. Nucleic Acids Res. 2023, 52, D1380–D1392. [Google Scholar] [CrossRef]
  80. Schraw, J.M.; Yiu, T.T.; Lupo, P.J.; Tsavachidis, S.; Rau, R.; Bondy, M.L.; Rabin, K.R.; Shen, L.; Scheurer, M.E. Maternal folate genes and aberrant DNA hypermethylation in pediatric acute lymphoblastic leukemia. PLoS ONE 2018, 13, e0197408. [Google Scholar] [CrossRef]
  81. Zhou, M.; Ge, X.; Xu, X.; Sheng, B.; Wang, H.; Shi, H.; Liu, S.; Tan, B.; Xu, K.; Wang, J. A hot and cold tumor-related prognostic signature for stage II colorectal cancer. Oncol. Lett. 2024, 28, 419. [Google Scholar] [CrossRef]
  82. Susmi, T.F.; Rahman, A.; Khan, M.R.; Yasmin, F.; Islam, S.; Nasif, O.; Alharbi, S.A.; Batiha, G.E.-S.; Hossain, M.U. Prognostic and clinicopathological insights of phosphodiesterase 9A gene as novel biomarker in human colorectal cancer. BMC Cancer 2021, 21, 577. [Google Scholar] [CrossRef]
  83. Ma, X.; Edmonson, M.; Yergeau, D.; Muzny, D.M.; Hampton, O.A.; Rusch, M.; Song, G.; Easton, J.; Harvey, R.C.; Wheeler, D.A.; et al. Rise and fall of subclones from diagnosis to relapse in pediatric B-acute lymphoblastic leukaemia. Nat. Commun. 2015, 6, 6604. [Google Scholar] [CrossRef]
  84. Lombardo, B.; Esposito, D.; Iossa, S.; Vitale, A.; Verdesca, F.; Perrotta, C.; Di Leo, L.; Costa, V.; Pastore, L.; Franzé, A. Intragenic deletion in macrod2: A family with complex phenotypes including microcephaly, intellectual disability, polydactyly, renal and pancreatic malformations. Cytogenet. Genome Res. 2019, 158, 25–31. [Google Scholar] [CrossRef]
  85. Lu, Z.; Xie, J.; Wu, G.; Shen, J.; Collins, R.; Chen, W.; Kang, X.; Luo, M.; Zou, Y.; Huang, L.J.-S.; et al. Fasting selectively blocks development of acute lymphoblastic leukemia via leptin-receptor upregulation. Nat. Med. 2016, 23, 79–90. [Google Scholar] [CrossRef] [PubMed]
  86. Zhang, Y.; Liao, Q.; Wen, X.; Fan, J.; Yuan, T.; Tong, X.; Jia, R.; Chai, P.; Fan, X. Hijacking of the nervous system in cancer: Mechanism and therapeutic targets. Mol. Cancer 2025, 24, 44. [Google Scholar] [CrossRef] [PubMed]
  87. Gossai, N.P.; Gordon, P.M. The role of the central nervous system microenvironment in pediatric acute lymphoblastic leukemia. Front. Pediatr. 2017, 5, 90. [Google Scholar] [CrossRef] [PubMed]
  88. Ranta, S.; Nilsson, F.; Harila-Saari, A.; Saft, L.; Tani, E.; Söderhäll, S.; Heyman, M. Detection of Central Nervous System Involvement in Childhood Acute Lymphoblastic Leukemia by Cytomorphology and Flow Cytometry of the Cerebrospinal Fluid. Pediatric Blood Cancer 2015, 62, 951–956. [Google Scholar] [CrossRef]
  89. Wang, H.-M.; Feng, M.; Liu, Y.-L.; Wei, C.-F.; Qian, M.-J.; Li, T.; Yang, X.-P.; Cui, S.; Liu, C.-Y.; Yi, F.; et al. Up-Regulation of GABAergic Signal Events in Bone Marrow Lymphocytes in Childhood Acute Lymphoblastic Leukemia. Chin. J. Physiol. 2016, 59, 119–125. [Google Scholar] [CrossRef]
  90. Liu, X.; Guo, H.; Kang, M.; Fu, W.; Li, H.; Ji, H.; Zhao, J.; Fang, Y.; Du, M.; Xue, Y. Multi-step gene set analysis identified HTR3 family genes involving childhood acute lymphoblastic leukemia susceptibility. Arch. Toxicol. 2024, 99, 299–307. [Google Scholar] [CrossRef]
  91. Stanulović, V.S.; Al Omair, S.; Reed, M.A.; Roberts, J.; Potluri, S.; Fulton-Ward, T.; Gudgeon, N.; Bishop, E.L.; Roels, J.; Perry, T.A.; et al. The glutamate/aspartate transporter EAAT1 is crucial for T-cell acute lymphoblastic leukemia proliferation and survival. Haematologica 2024, 109, 3505–3519. [Google Scholar] [CrossRef]
  92. Tang, J.; Peng, W.; Ji, J.; Peng, C.; Wang, T.; Yang, P.; Gu, J.; Feng, Y.; Jin, K.; Wang, X.; et al. GPR176 Promotes Cancer Progression by Interacting with G Protein GNAS to Restrain Cell Mitophagy in Colorectal Cancer. Adv. Sci. 2023, 10, 2205627. [Google Scholar] [CrossRef] [PubMed]
  93. Garcia-Carpizo, V.; Ruiz-Llorente, S.; Sarmentero, J.; González-Corpas, A.; Barrero, M.J. CREBBP/EP300 bromodomain inhibition affects the proliferation of AR-positive breast cancer cell lines. Mol. Cancer Res. 2019, 17, 720–730. [Google Scholar] [CrossRef] [PubMed]
  94. Wang, Y.; Guo, Z.; Tian, Y.; Cong, L.; Zheng, Y.; Wu, Z.; Shan, G.; Xia, Y.; Zhu, Y.; Li, X.; et al. MAPK1 promotes the metastasis and invasion of gastric cancer as a bidirectional transcription factor. BMC Cancer 2023, 23, 959. [Google Scholar] [CrossRef]
  95. Lin, H.-P.; Lin, C.-Y.; Huo, C.; Jan, Y.-J.; Tseng, J.-C.; Jiang, S.S.; Kuo, Y.-Y.; Chen, S.-C.; Wang, C.-T.; Chan, T.-M.; et al. AKT3 promotes prostate cancer proliferation cells through regulation of Akt, B-Raf & TSC1/TSC2. Oncotarget 2015, 6, 27097–27112. [Google Scholar] [CrossRef]
  96. Chen, S.; Ran, J.; Fan, Z.; Liu, M.; Wu, L.; Li, Q.; Peng, J.; Hu, Z. Functional status analysis of RNH1 in bladder cancer for predicting immunotherapy response. Sci. Rep. 2023, 13, 12625. [Google Scholar] [CrossRef]
  97. Lin, X.; Dinglin, X.; Cao, S.; Zheng, S.; Wu, C.; Chen, W.; Li, Q.; Hu, Q.; Zheng, F.; Wu, Z.; et al. Enhancer-Driven lncRNA BDNF-AS Induces Endocrine Resistance and Malignant Progression of Breast Cancer through the RNH1/TRIM21/mTOR Cascade. Cell Rep. 2020, 31, 107753. [Google Scholar] [CrossRef]
  98. Haddad, S.A.; Ruiz-Narváez, E.A.; Haiman, C.A.; Sucheston-Campbell, L.E.; Bensen, J.T.; Zhu, Q.; Liu, S.; Yao, S.; Bandera, E.V.; Rosenberg, L.; et al. An exome-wide analysis of low frequency and rare variants in relation to risk of breast cancer in African American Women: The AMBER Consortium. Carcinogenesis 2016, 37, 870–877. [Google Scholar] [CrossRef] [PubMed]
  99. Nordlund, J.; Bäcklin, C.L.; Zachariadis, V.; Cavelier, L.; Dahlberg, J.; Öfverholm, I.; Barbany, G.; Nordgren, A.; Övernäs, E.; Abrahamsson, J.; et al. DNA methylation-based subtype prediction for pediatric acute lymphoblastic leukemia. Clin. Epigenet. 2015, 7, 11. [Google Scholar] [CrossRef]
  100. Kunz, J.B.; Rausch, T.; Bandapalli, O.R.; Eilers, J.; Pechanska, P.; Schuessele, S.; Assenov, Y.; Stütz, A.M.; Kirschner-Schwabe, R.; Hof, J.; et al. Pediatric T-cell lymphoblastic leukemia evolves into relapse by clonal selection, acquisition of mutations and promoter hypomethylation. Haematologica 2015, 100, 1442–1450. [Google Scholar] [CrossRef]
  101. Liapodimitri, A.; Camacho, O.; Lunsford, K.; Tetens, A.R.; Craig-Schwartz, J.; Skalitzky, K.; Rabin, K.R.; Feinberg, A.; Koldobskiy, M. Mapping the Dynamic DNA Methylation Landscape in Pre-B ALL from Diagnosis to Relapse Reveals a Role for Aberrant PRC2 Regulation. Blood 2024, 144, 4127. [Google Scholar] [CrossRef]
  102. Orgueira, A.M.; Krali, O.; Míguez, C.P.; Raíndo, A.P.; Arias, J.Á.D.; Pérez, M.S.G.; Encinas, M.M.P.; Sanmartín, M.F.; Sinnet, D.; Heyman, M.; et al. Refining risk prediction in pediatric acute lymphoblastic leukemia through DNA methylation profiling. Clin. Epigenet. 2024, 16, 49. [Google Scholar] [CrossRef]
  103. Han, S.; Williamson, B.D.; Fong, Y. Improving random forest predictions in small datasets from two-phase sampling designs. BMC Med. Inform. Decis. Mak. 2021, 21, 322. [Google Scholar] [CrossRef]
  104. Barreñada, L.; Dhiman, P.; Timmerman, D.; Boulesteix, A.-L.; Van Calster, B. Understanding overfitting in random forest for probability estimation: A visualization and simulation study. Diagn. Progn. Res. 2024, 8, 14, Erratum in Diagn. Progn. Res. 2025, 9, 9.. [Google Scholar] [CrossRef]
  105. Kim, A.; Benavente, C.A. Oncogenic Roles of UHRF1 in Cancer. Epigenomes 2024, 8, 26. [Google Scholar] [CrossRef]
  106. Zhou, Q.; Li, Z.; Meng, L.; Wang, Y.; Ashaq, M.S.; Li, Y.; Zhao, B. Identification of ubiquitination-related hub genes in chronic myeloid leukemia cell by bioinformatics analysis. J. Cancer 2024, 15, 3750–3759. [Google Scholar] [CrossRef] [PubMed]
  107. Yao, J.; Luo, Y.; Zeng, C.; He, H.; Zhang, X. UHRF1 regulates the transcriptional repressor HBP1 through MIF in T acute lymphoblastic leukemia. Oncol. Rep. 2021, 46, 131. [Google Scholar] [CrossRef] [PubMed]
  108. PDQ Pediatric Treatment Editorial Board. Childhood Acute Lymphoblastic Leukemia Treatment (PDQ®): Health Professional Version. In PDQ Cancer Information Summaries; National Cancer Institute: Bethesda, MD, USA, 2002. [Google Scholar]
  109. Nordlund, J.; Christofer, B.; Raine, A. Chapter 10 Computational and Statistical Analysis of Array-Based DNA Methylation Data. Methods Mol. Biol. 2019, 1878, 173–191. [Google Scholar]
  110. Wu, M.C.; Kuan, P. Chapter 16 A Guide to Illumina BeadChip Data Analysis. Methods Mol. Biol. 2018, 1708, 303–330. [Google Scholar] [PubMed]
  111. Jaffe, A.E.; Murakami, P.; Lee, H.; Leek, J.T.; Fallin, M.D.; Feinberg, A.P.; Irizarry, R.A. Bump hunting to identify differentially methylated regions in epigenetic epidemiology studies. Leuk. Res. 2012, 41, 200–209. [Google Scholar] [CrossRef]
  112. Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  113. The Gene Ontology Consortium. The Gene Ontology Resource: 20 years and still GOing strong. Nucleic Acids Res. 2019, 47, D330–D338. [Google Scholar] [CrossRef] [PubMed]
  114. Ingenuity Pathway Analysis|QIAGEN Digital Insights. Available online: https://digitalinsights.qiagen.com/products-overview/discovery-insights-portfolio/analysis-and-visualization/qiagen-ipa/ (accessed on 26 September 2025).
  115. Szklarczyk, D.; Franceschini, A.; Wyder, S.; Forslund, K.; Heller, D.; Huerta-Cepas, J.; Simonovic, M.; Roth, A.; Santos, A.; Tsafou, K.P.; et al. STRING v10: Protein–protein interaction networks, integrated over the tree of life. Nucleic Acids Res. 2015, 43, D447–D452. [Google Scholar] [CrossRef] [PubMed]
  116. Mansell, G.; Gorrie-Stone, T.J.; Bao, Y.; Kumari, M.; Schalkwyk, L.S.; Mill, J.; Hannon, E. Guidance for DNA methylation studies: Statistical insights from the Illumina EPIC array. BMC Genom. 2019, 20, 27. [Google Scholar] [CrossRef] [PubMed]
  117. Cohen, J. Statistical Powe Analysis for the Behavioral Sciences, 2nd ed.; Lawrence Erlbaum Associates: Mahwah, NJ, USA, 1988; Volume 3. [Google Scholar]
Figure 1. Case–control analysis showing differences in methylation levels between pediatric patients with ALL (yellow) and controls (green). (a) Unsupervised hierarchical clustering of differentially methylated CpGs (DMC) based on methylation differences (Δβ) between ALL cases and controls. The heatmap shows a cluster diagram of the top 500 DMCs in ALL patients versus controls. The scale on the right represents the methylation status (red: hypermethylated, blue. hypomethylated). (b) The volcano plot displays the DMCs between cases and controls, with FDR-adjusted <0.05 and absolute Δβ (|Δβ|) >0.2. The x-axis shows the methylation difference (Δβ), and the y-axis represents –log10 of the adjusted p-value. Hypermethylated (red) and hypomethylated (blue) CpGs are highlighted.
Figure 1. Case–control analysis showing differences in methylation levels between pediatric patients with ALL (yellow) and controls (green). (a) Unsupervised hierarchical clustering of differentially methylated CpGs (DMC) based on methylation differences (Δβ) between ALL cases and controls. The heatmap shows a cluster diagram of the top 500 DMCs in ALL patients versus controls. The scale on the right represents the methylation status (red: hypermethylated, blue. hypomethylated). (b) The volcano plot displays the DMCs between cases and controls, with FDR-adjusted <0.05 and absolute Δβ (|Δβ|) >0.2. The x-axis shows the methylation difference (Δβ), and the y-axis represents –log10 of the adjusted p-value. Hypermethylated (red) and hypomethylated (blue) CpGs are highlighted.
Ijms 26 10261 g001
Figure 2. Box plot results of the DNA methylation levels derived from ALL cases are shown on CpG islands of promoter regions (a) and on gene biotypes (b) in ALL cases (yellow) and controls (green). The line inside each box represents the median of the methylation β values of the samples. The upper and lower edges of the boxes are the whiskers (75th percentiles and 25th, respectively), while upper and lower lines outside the boxes are the error bars. p-values < 0.05 (Mann–Whitney test) mean that the differences in DNA methylation between ALL patients and controls are statistically significant. 1stExon: first exon; 5′UTR: 5′ untranslated region; TSS: transcriptional start site; PC: protein coding; lncRNA: long non-coding RNA; miRNA: micro-RNA; TUPsdg: transcribed unitary pseudogene; TPPsdg: transcribed processed pseudogene; TUnPPsdg: transcribed unprocessed pseudogene; UPPsdg: unprocessed pseudogene; PPsdg: processed pseudogene.
Figure 2. Box plot results of the DNA methylation levels derived from ALL cases are shown on CpG islands of promoter regions (a) and on gene biotypes (b) in ALL cases (yellow) and controls (green). The line inside each box represents the median of the methylation β values of the samples. The upper and lower edges of the boxes are the whiskers (75th percentiles and 25th, respectively), while upper and lower lines outside the boxes are the error bars. p-values < 0.05 (Mann–Whitney test) mean that the differences in DNA methylation between ALL patients and controls are statistically significant. 1stExon: first exon; 5′UTR: 5′ untranslated region; TSS: transcriptional start site; PC: protein coding; lncRNA: long non-coding RNA; miRNA: micro-RNA; TUPsdg: transcribed unitary pseudogene; TPPsdg: transcribed processed pseudogene; TUnPPsdg: transcribed unprocessed pseudogene; UPPsdg: unprocessed pseudogene; PPsdg: processed pseudogene.
Ijms 26 10261 g002
Figure 3. Gene set enrichment analysis on the genes showing differentially methylated CpGs between ALL patients and controls using ingenuity pathway analysis (IPA). Plot shows the most significant terms for cellular components, biological processes, and molecular functions from the IPAs of DMC-annotated genes (p-values  <  0.05). X-axis: Statistical significance (−log10(p)), where higher values indicate greater relevance. Bubble size represents the number of genes involved in each pathway. Color indicates the IPA z-score, which predicts the direction of regulation. Orange: significant activation (z-score > 2). Blue: significant inhibition (z-score < −2). Gray: statistically significant pathways (p < 0.05) with no predicted direction of change.
Figure 3. Gene set enrichment analysis on the genes showing differentially methylated CpGs between ALL patients and controls using ingenuity pathway analysis (IPA). Plot shows the most significant terms for cellular components, biological processes, and molecular functions from the IPAs of DMC-annotated genes (p-values  <  0.05). X-axis: Statistical significance (−log10(p)), where higher values indicate greater relevance. Bubble size represents the number of genes involved in each pathway. Color indicates the IPA z-score, which predicts the direction of regulation. Orange: significant activation (z-score > 2). Blue: significant inhibition (z-score < −2). Gray: statistically significant pathways (p < 0.05) with no predicted direction of change.
Ijms 26 10261 g003
Figure 4. Kaplan–Meier survival curves of the most differentially methylated CpGs between ALL detected in the comparative analysis between patients and controls. Patients were categorized into low- and high-methylation groups using the median values of (a) cg01052776, (b) cg01767116, (c) 271 cg05001671, and (d) cg20747787. p-value was calculated using the Cox regression model, and HR indicates hazard ratio. Patients with hypermethylated levels (red lines) in any of these CpGs showed a significantly reduced risk of relapse compared to those with hypomethylated (blue lines).
Figure 4. Kaplan–Meier survival curves of the most differentially methylated CpGs between ALL detected in the comparative analysis between patients and controls. Patients were categorized into low- and high-methylation groups using the median values of (a) cg01052776, (b) cg01767116, (c) 271 cg05001671, and (d) cg20747787. p-value was calculated using the Cox regression model, and HR indicates hazard ratio. Patients with hypermethylated levels (red lines) in any of these CpGs showed a significantly reduced risk of relapse compared to those with hypomethylated (blue lines).
Ijms 26 10261 g004aIjms 26 10261 g004b
Figure 5. Methylation comparative analysis between ALL cases with early relapse (ER) (n = 7) and without relapse (n = 28) showing the hypomethylation and activation of Rho GTPase signaling in the ER group. (a) Volcano plot displaying differentially methylated CpGs (N = 2629) between ER and no-relapse showing statistical significance (p < 0.05). The x-axis shows the methylation difference (Δβ), and the y-axis represents –log10 of the adjusted p-value. Hypermethylated (red) and hypomethylated (blue) CpGs are highlighted. Gray points represent the non-significant DMP. (b) Summary graph generated by IPA comparing ER and non-relapse ALL cases. Orange color predicts activation, and blue color indicates inhibition. Octagons: biological functions; ovals: transcription regulators; squares: cytokines or growth factors; diamonds: G-protein-coupled receptors; triangles: transcription regulators with unknown or indirect roles. Solid lines: direct relationships; dashed lines: indirect relationships; dotted lines: predicted or less certain associations.
Figure 5. Methylation comparative analysis between ALL cases with early relapse (ER) (n = 7) and without relapse (n = 28) showing the hypomethylation and activation of Rho GTPase signaling in the ER group. (a) Volcano plot displaying differentially methylated CpGs (N = 2629) between ER and no-relapse showing statistical significance (p < 0.05). The x-axis shows the methylation difference (Δβ), and the y-axis represents –log10 of the adjusted p-value. Hypermethylated (red) and hypomethylated (blue) CpGs are highlighted. Gray points represent the non-significant DMP. (b) Summary graph generated by IPA comparing ER and non-relapse ALL cases. Orange color predicts activation, and blue color indicates inhibition. Octagons: biological functions; ovals: transcription regulators; squares: cytokines or growth factors; diamonds: G-protein-coupled receptors; triangles: transcription regulators with unknown or indirect roles. Solid lines: direct relationships; dashed lines: indirect relationships; dotted lines: predicted or less certain associations.
Ijms 26 10261 g005
Table 1. Clinical characteristics of pediatric ALL patients.
Table 1. Clinical characteristics of pediatric ALL patients.
Variablen (%)
Sex
Male24 (45)
Female29 (55)
Age
1–9 years46 (87)
>10 years7 (13)
WBC count at diagnosis (1 × 103/µL)
<5032 (60)
>5021 (40)
WHO ALL classification
ETV6::RUNX111 (20)
TCF3::PBX16 (12)
BCR::ABL12 (4)
KMT2A::AFF11 (2)
Undetermined *33 (62)
NCI risk classification at diagnosis
Standard risk28 (53)
High risk25 (47)
WHO Z-BMI
Underweight7 (13)
Normal weight25 (48)
Overweight2 (4)
Obesity11 (20)
No data8 (15)
Death
Yes8 (18)
No38 (82)
Relapse status
Relapse **18(34)
VER11 (61%)
ER7 (39%)
Non-relapse28 (53)
Unknown7 (13)
WBC: white blood cell; WHO: World Health Organization; * Undetermined: 26 (49%) cases were negative to ETV6::RUNX1, TCF3::PBX1, BCR::ABL1, and KMT2A::AFF1, and the remaining 7 (13%) were not tested for the most common fusion genes; Z-BMI: Body Mass Index Z score; NCI: National Cancer Institute; ** Relapse data was available for 46/53 patients; VER: very early relapse; ER: early relapse.
Table 2. Top 20 DMCs defining acute lymphoblastic leukemia.
Table 2. Top 20 DMCs defining acute lymphoblastic leukemia.
CpG IDChromosomeGeneGenomic RegionCpG Island RegionΔβFDR
cg100306586 IGRisland−0.217.30 × 10−14
cg263424544SLC2A95′UTRopensea0.344.59 × 10−13
cg000304205SLC25A2TSS200island−0.262.81 × 10−11
cg0609827617TK1Bodyopensea−0.252.81 × 10−11
cg070073826 IGRopensea0.264.24 × 10−11
cg0399026111SBF2Bodyopensea−0.391.20 × 10−9
cg099744321PTPN14Bodyopensea0.241.52 × 10−9
cg0903975111CHST1Bodyisland−0.411.88 × 10−9
cg1109398018MBD1Bodyshore−0.231.88 × 10−9
cg216366835SLC25A2TSS200island−0.202.09 × 10−9
cg230828837 IGRopensea−0.313.70 × 10−9
cg1942317021APPBodyopensea−0.323.70 × 10−9
cg0070699411OR5A21stExonopensea0.433.70 × 10−9
cg1628443718SALL3Bodyshore−0.234.42 × 10−9
cg1291987321 IGRopensea0.454.98 × 10−9
cg029188725SOX30TSS200island−0.216.97 × 10−9
cg267679167AOAHBodyopensea−0.306.97 × 10−9
cg097644351GALNT2Bodyopensea0.346.97 × 10−9
cg2543223219AURKC5′UTRisland−0.317.01 × 10−9
cg167326631SLC26A9Bodyopensea0.507.36 × 10−9
IGR: Intergenomic region; 5′UTR: 5‘untranslated region; TSS1500/200: 1500/200 base pairs upstream of the transcription start site; 1stExon: first exon.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Fong-López, A.A.; Núñez-Enríquez, J.C.; Bekker-Méndez, V.C.; Flores-Lujano, J.; Mata-Rocha, M.; Jiménez-Hernández, E.; Ortíz-Maganda, M.P.; Guerra-Castillo, F.X.; Medina-Sanson, A.; Martín-Trejo, J.A.; et al. Comprehensive Epigenome-Wide Profiling Reveals Distinctive DNA Methylation Signatures and Potential Prognostic Biomarkers in Mexican Pediatric B-ALL. Int. J. Mol. Sci. 2025, 26, 10261. https://doi.org/10.3390/ijms262110261

AMA Style

Fong-López AA, Núñez-Enríquez JC, Bekker-Méndez VC, Flores-Lujano J, Mata-Rocha M, Jiménez-Hernández E, Ortíz-Maganda MP, Guerra-Castillo FX, Medina-Sanson A, Martín-Trejo JA, et al. Comprehensive Epigenome-Wide Profiling Reveals Distinctive DNA Methylation Signatures and Potential Prognostic Biomarkers in Mexican Pediatric B-ALL. International Journal of Molecular Sciences. 2025; 26(21):10261. https://doi.org/10.3390/ijms262110261

Chicago/Turabian Style

Fong-López, Alan Alberto, Juan Carlos Núñez-Enríquez, Vilma Carolina Bekker-Méndez, Janet Flores-Lujano, Minerva Mata-Rocha, Elva Jiménez-Hernández, Mónica Patricia Ortíz-Maganda, Francisco Xavier Guerra-Castillo, Aurora Medina-Sanson, Jorge Alfonso Martín-Trejo, and et al. 2025. "Comprehensive Epigenome-Wide Profiling Reveals Distinctive DNA Methylation Signatures and Potential Prognostic Biomarkers in Mexican Pediatric B-ALL" International Journal of Molecular Sciences 26, no. 21: 10261. https://doi.org/10.3390/ijms262110261

APA Style

Fong-López, A. A., Núñez-Enríquez, J. C., Bekker-Méndez, V. C., Flores-Lujano, J., Mata-Rocha, M., Jiménez-Hernández, E., Ortíz-Maganda, M. P., Guerra-Castillo, F. X., Medina-Sanson, A., Martín-Trejo, J. A., Peñaloza-González, J. G., Velázquez-Aviña, M. M., Torres-Nava, J. R., Espinosa-Elizondo, R. M., Pérez-Saldívar, M. L., Flores-Villegas, L. V., Merino-Pasaye, L. E., Duarte-Rodríguez, D. A., Sepúlveda-Robles, O. A., ... Jiménez-Morales, S. (2025). Comprehensive Epigenome-Wide Profiling Reveals Distinctive DNA Methylation Signatures and Potential Prognostic Biomarkers in Mexican Pediatric B-ALL. International Journal of Molecular Sciences, 26(21), 10261. https://doi.org/10.3390/ijms262110261

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop