2. Material and Methods
Primary tumor samples (n = 63), corresponding nonmalignant lung tissues (n = 63), and matching blood samples (n = 51) were obtained from patients with NSCLC who had been treated in 2009 with curative surgical resection at the Clinic for Pulmonology, University Clinical Center of Serbia. The samples were collected during surgery and immediately snap-frozen for research purposes. Patients had no neoadjuvant chemotherapy or radiotherapy and patients with large cell-carcinoma were not included in the study.
The study was approved by the institution’s ethics committee, and informed consent was obtained from all study participants.
Demographic data were obtained through patient interview, and clinical information was derived from chart review.
Blood samples (5 mL) were collected using EDTA vacutainers and stored at −20 °C. DNA extraction from whole blood samples was performed using the method previously described [
8]. Briefly, blood cells and platelets were lysed by adding equal volumes of lysis buffer (0.32 M sucrose, 10 mM Tris-HCl pH 7.5, 5 mM MgCl
2 and 1% Triton X-100). The lysate underwent centrifugation (3000 rpm for 10 min). Following the removal of supernatant, the pellet was suspended in 3 mL buffer (10 mM Tris-HCl, 0.4 M NaCl, 2 mM EDTA, and 25 μL proteinase K), followed by the addition of SDS (0.7% final concentration). After overnight incubation at 37 °C, 1 mL of 6 M NaCl was added, and the proteins were pelleted by centrifugation. Supernatant containing DNA was transferred into fresh tubes and centrifuged (4000 rpm for 10 min). The supernatant was transferred into new microcentrifuge tubes and an equal volume of isopropanol was added. DNA that became visible was transferred and washed in 1 mL 70% ethanol, air-dried, resuspended in distilled water, analyzed for quality by electrophoresis on agarose gel and quantified by using a Nanodrop spectrophotometer (ThermoScientific Inc., Wilmington, DE, USA). The isolation of DNA from fresh frozen tumors was performed as previously described [
5]. The subsequent laboratory research was carried out at the Masonic Cancer Center, University of Minnesota, USA. The primer sequences and PCR conditions have been described previously [
9,
13,
14,
15]. Amplicons were confirmed via agarose electrophoresis, and methylation percentages were calculated using PyroMark software (Qiagen, Hilden, Germany) version 2.0.8.
The detection of DNA methylation was undertaken based on a treatment of genomic DNA with sodium bisulfite, which converts unmethylated cytosines to uracil, while methylated cytosines stay unaltered. Bisulfite-modified DNA as per manufacturer’s protocol (Zymo Research, Irvine, CA, USA) was then subjected to strand-specific polymerase chain reaction to generate templates for pyrosequencing. In our study, the sequencing depth and coverage were carefully monitored to meet the standards required for methylation analysis. Specifically, we achieved an average sequencing depth of approximately 500–1000 reads per CpG site, with a minimum depth of 200 reads for inclusion in the analysis. This ensured sufficient coverage to accurately detect methylation patterns.
Statistical Analysis
The statistical analyses included calculations for each CpG site and the average across multiple CpG sites for each gene. Since methylation values were not normally distributed, the Wilcoxon signed-rank test was applied to the average methylation across all CpG sites within a gene to assess differences between tumor and normal tissues. When the analysis was repeated using individual CpG sites instead of gene-level averages, the results remained consistent. Tumors were classified as hypermethylated if the average methylation across different CpG sites for a specific gene exceeded the corresponding average methylation in normal tissues by more than three standard deviations, as previously described [
9]. Comparisons of blood methylation levels between patients with hypermethylated tumors and those without were performed using the Wilcoxon rank-sum test.
All analyses were performed using the R package version 0.5.5.
3. Results
The correlations of clinicopathologic characteristics with the hypermethylation of tumor profiles of the NSCLC patients are summarized in
Table 1.
Stage-wise analysis showed that HOXA9 hypermethylation was more prevalent in stage II and III tumors compared to stage I (p = 0.03). No significant stage-wise difference was observed for SOX1 methylation (p = 0.24). In squamous cell lung cancer, hypermethylation in genes SOX1 (p = 0.05) and HOXA9 (p = 0.01) is more frequent than in other histological types of NSCLC.
Patients with stage II and III showed more frequent hypermethylation in gene HOXA9.
There were no significant associations between the methylation status of
SOX1 and HOXA9 and patient gender or smoking status (
Table 1).
The numbers of CpG islands withing the promoters of genes
SOX1 and
HOXA9 were 4 and 11, respectively (
Table 2).
The methylation levels of
SOX1 and
HOXA9 were significantly higher in NSCLC tissue samples compared to normal lung tissues and blood (
Table 3 and
Table 4).
Intraclass correlation coefficients (ICCs) were calculated for both genes using ANOVA, for all tissue types. High values mean great methylation variability between patients.
Higher values were detected in tumors compared to normal lung tissue and blood samples (
Table 5). This is the result of great differences in methylation between tumors, as some tumors were hypermethylated and others were not.
The following tables provide a comprehensive overview of DNA methylation patterns for
SOX1 and
HOXA9—
Table 6 presents the mean methylation values across all CpG islands in promoter regions,
Table 7 summarizes the proportion of hypermethylation across different tissue types, and
Table 8 details the total number of patients exhibiting hypermethylation in tumor DNA.
Figure 1 and
Figure 2 illustrate the methylation levels at individual CpG islands within the
HOXA9 and
SOX1 promoters, respectively, across different tissue types (tumor, normal lung tissue, and blood), highlighting the tissue-specific differences in methylation patterns.
Figure 3 illustrate the difference in the methylation profile of
SOX1 gene between normal lung and tumor tissue in the same patient.
4. Discussion
The elevated methylation levels of
SOX1 and
HOXA9 in tumor samples compared to normal lung tissue and blood samples indicate that these epigenetic alterations are specific to NSCLC. However, the lack of significant methylation differences in blood samples compared to tumor tissue suggests that blood-based assays might not be an indicator of hypermethylation in tumor tissue, and cannot serve as a tool for the early diagnosis of lung cancer. This observation is in accordance with previous studies comparing the hypermethylation of genes in tumor tissues and matched blood samples using bisulfite pyrosequencing in patients with resectable NSCLC [
11,
16,
17]. On the other hand, Wen et al. reported a sensitivity of 75.0% and a specificity of 98.0% when detecting methylated
HOXA9 in blood samples of NSCLC patients with methylated
HOXA9 tumors [
18]. The advanced stage and histology-specific nature of their cohort, as well as its size, may explain the discrepancy with our findings. Methylation differences in tumor and normal lung tissue were also evident [
18].
Although we evaluated the significance of SOX1 and HOXA9 hypermethylation individually, we did not assess the combined effects of the simultaneous hypermethylation of both genes due to the limited sample size. Future studies with larger cohorts are warranted to explore the potential synergistic impacts of co-methylation on patient outcomes.
The HOX gene family, which includes
HOXA9, and the
SRY box gene family, which includes the
SOX1 gene, are crucial for normal embryogenesis [
11,
19]. The hypermethylation of both
HOXA9 and
SOX1 genes has been reported in various malignancies including NSCLC, with the
SOX1 gene being one of the most frequently hypermethylated genes [
11]. The exact roles and mechanisms by which these genes impact carcinogenesis are not yet fully understood. Some data derived from preclinical and experimental studies suggest that the reduced expression of
SOX1 and HOXA9 leads to the disruption of normal processes of cell migration in NSCLC [
20,
21].
SOX1 suppresses Rac1 activity, a key member of the Rho family of GTPases that regulates actin cytoskeletal remodeling and membrane protrusion formation—critical steps in cancer cell motility. The loss of SOX1, as a result of promoter hypermethylation, enhances Rac1-mediated actin dynamics, leading to increased membrane protrusions and elevated cell migration. Since increased cell motility and invasion are key drivers of metastasis, the methylation-induced silencing of
SOX1 likely facilitates tumor dissemination, and is thus associated with poorer clinical outcomes in NSCLC patients [
22].
The role of
SOX1 and
HOXA9 methylation in prognosis remains a subject of ongoing investigation. Hwang et al. found that the hypermethylation of
HOXA9 was associated with disease recurrence in early-stage NSCLC patients [
20]. According to Ben et al., the hypermethylation of
HOXA9 was associated with worse progression free and overall survival among advanced-stage NSCLC patients undergoing PD-1/PD-L1 inhibitor therapy [
22]. In a meta-analysis, Cai et al. found that the hypermethylation of HOXA9 was associated with poor overall survival among patients with solid malignancies including NSCLC [
19]. This contrasts with the more recent findings of Vicente at al., that the hypermethylation of
HOXA9 is not an independent prognostic biomarker of cancer-specific survival among patients with lung adenocarcinoma [
23]. These discrepancies may stem from differences in cohort size, with characteristics including disease stage, histological types, and potential publication bias favoring positive findings [
19,
23,
24]. Vicente et al. also found the that hypermethylation of
HOXA9 was associated with earlier stages of the disease, as was the case with our cohort, which was smaller in comparison to theirs [
23]. This finding, along with that of Hwang et al., that the hypermethylation of
HOXA9 was associated with the disease recurrence of early-stage NSCLC, may serve to guide further trials of implementing hypermethylation in
HOXA9 detection in blood-based assays for minimally invasive screening and monitoring programs. One study found that a methylation panel of six genes including HOXA9 has shown potential for use as a biomarker for early NSCLC detection [
17].
In our previous work, we found that
SOX1 promoter hypermethylation in NSCLC tumors was significantly associated with inferior survival [
11].
This highlights the potential of SOX1 and HOXA9 methylation as prognostic biomarkers, emphasizing the need for further studies to validate their clinical relevance and inform risk stratification.
Ever since DNA methylation changes have been associated with carcinogenesis, efforts have been made to investigate their prognostic capabilities as well as clinical and pathological characteristics in order to inform optimal patient selection. Data regarding the clinical and pathological characteristics of NSCLC patients with hypermethylated SOX1 and HOXA9 remain scarce.
Our analysis has revealed that histology, specifically squamous cell carcinoma, is significantly associated with the hypermethylation of both SOX1 and HOXA9.
While investigating the methylation status of another group of commonly methylated genes in early NSCLC that have shown prognostic capabilities (
RASSF1A,
CDH13,
MGMT,
ESR1 and
DAPK), several groups of authors have reported conflicting results. While some reported that the methylation of
CDH13 was associated with lung adenocarcinoma, others found no difference among major histological types of NSCLC [
9,
25,
26]. Our findings may be influenced by the size of our cohort, and reflect the high prevalence of squamous cell carcinoma in our study.
Additionally,
HOXA9 hypermethylation was more prevalent in patients with stage II and III tumors when compared to stage I, suggesting its potential role in tumor progression and aggressiveness. Considering the mentioned association of the hypermethylation of genes in the tumor tissue but not the normal lung tissue, the prevalence of hypermediated tumor suppressor genes could be expected to rise in later stages of NSCLC [
6]. This is in contrast with the recent finding of Du et al., who found that
HOXA9 hypermethylation was not significantly associated with advanced NSCLC stage when comparing patients in stage IA to the rest of the cohort [
26]. In their study, there was a much higher prevalence of patients with stage I disease and adenocarcinoma in comparison to ours, which may explain this difference in results.
Gao et al. found that the number methylation-positive
SHOX2 and
RASSF1A patients increased with clinicopathological stage of the disease, and the age of the patients affected by lung adenocarcinoma [
6]. While the methylation of certain oncogenes, namely,
RASSF1A was associated with gender in NSCLC patients, in our previous work among the patient cohort from our center, we did not find such an association [
9]. These discrepancies warrant further investigation into the influence of demographic and lifestyle factors on gene methylation patterns in NSCLC, given their potential as prognostic biomarkers.
Although the current study did not directly assess downstream molecular signaling, prior evidence suggests that the hypermethylation-induced silencing of
SOX1 and
HOXA9 may impact key oncogenic pathways in NSCLC.
SOX1 has been shown to act as a tumor suppressor by antagonizing the Wnt/β-catenin signaling pathway. Specifically,
SOX1 can interact with β-catenin and prevent its transcriptional activity, thereby inhibiting Wnt target gene expression and limiting tumor progression [
20]. The epigenetic silencing of
SOX1 through promoter hypermethylation, as observed in our cohort, may thus lead to the aberrant activation of the Wnt pathway, contributing to the increased proliferation and survival of cancer cells. On the other hand,
HOXA9 has been implicated in regulating Notch signaling, particularly in hematologic malignancies and some solid tumors. Aberrant
HOXA9 expression can modulate components of the Notch pathway, including HES1 and DLL4, and influence stemness, differentiation, and cell fate decisions [
27,
28]. While direct evidence in NSCLC is limited, the potential for epigenetically silenced
HOXA9 to disrupt these regulatory networks highlights a plausible mechanism underlying tumor progression. These interactions between epigenetic regulation and developmental signaling pathways represent important areas for future functional studies in lung cancer.
Several studies have shown that
PAX1 and
SOX1 are frequently hypermethylated in cervical neoplasia. Combined methylation analysis of these genes has been effectively used as a screening tool for cervical intraepithelial neoplasia (CIN) and cervical cancer, demonstrating good sensitivity and specificity [
27,
28]. In esophageal squamous cell carcinoma (ESCC),
PAX1 has also been identified as a hypermethylated gene. Methylation markers such as
PAX1, ZNF582, and
SOX1 have been investigated as potential diagnostic biomarkers for ESCC [
29]. Although
HOXA9, a member of the
HOX gene family, is implicated in various malignancies, its specific inclusion in methylation panels for cervical or esophageal cancer is less well documented. Nonetheless, other
HOX family members, such as
HOXC10, have been reported as hypermethylated in ESCC, supporting the broader relevance of
HOX genes in cancer epigenetics [
29]. While both
PAX1 and
HOXA9 (or related
HOX genes) have been studied in the context of cancer-related methylation, their presence in the same diagnostic panels does not imply direct functional cooperation between them. Their co-involvement likely reflects parallel roles in tumor suppressor pathways affected by epigenetic dysregulation, rather than a shared mechanistic interaction.
The potential use of SOX1 and HOXA9 methylation as biomarkers for early detection, prognosis, and risk stratification in NSCLC is promising, but requires further exploration.
Recent advances in blood-based DNA methylation assays offer promising improvements in early lung cancer detection. A study found that a methylation-based risk score performed similarly to the PLCOm2012 model, and combining both enhanced predictive accuracy—highlighting the potential of methylation markers to strengthen existing screening strategies. Additionally, an assay in development, Lung EpiCheck
®, which includes markers such as
HOXA9, has demonstrated strong performance in detecting lung cancer, including early-stage disease, across European and Chinese high-risk populations, supporting its utility as a non-invasive screening tool [
30,
31,
32,
33]. Efforts should be made to investigate the influence of demographic and lifestyle factors on methylation patterns to better understand their prognostic value. Future research should also focus on developing reliable, non-invasive methods for detecting methylation changes, which could facilitate the use of these biomarkers in routine clinical practice. Integrating methylation data with other molecular and clinical parameters could lead to more comprehensive and personalized approaches to NSCLC management. By addressing and bridging these gaps, we could aspire to realize the full potential of methylation biomarkers in improving outcomes for NSCLC patients.
Limitations
While our study provides valuable insights, it is limited by its retrospective nature and the relatively small sample size. Larger, prospective studies are needed to validate our findings and explore the mechanisms driving gene hypermethylation in NSCLC. Current research predominantly involves studies from a single country, which might limit the generalizability of findings to other populations. External validation using larger, multicenter datasets and cohorts from diverse geographic regions is essential to confirm the applicability of the identified methylation patterns in broader NSCLC populations.
Furthermore, in our study, detailed data on comorbidities were not systematically recorded for the patient cohort, which limits our ability to analyze their potential impact on survival outcomes.
Understanding the relationship between SOX1 and HOXA9 methylation and their expression levels would provide valuable insights. Unfortunately, gene expression data for the analyzed tissue and blood samples were not available for this study. We acknowledge this as a limitation, and we plan to incorporate gene expression analyses in future studies to better elucidate the functional impacts of SOX1 and HOXA9 methylation. This will allow us to draw more comprehensive conclusions about the implications of our findings.
Gene expression data for the analyzed tissue and blood samples were not available in this study. Future research integrating methylation and mRNA expression analyses is necessary to validate the functional significance of promoter hypermethylation in SOX1 and HOXA9.
The use of bisulfite pyrosequencing, while highly sensitive and specific, is limited to analyzing preselected CpG sites, and does not provide a genome-wide perspective on methylation changes. Advanced methods such as whole-genome bisulfite sequencing or targeted methylation panels could offer broader insights into the epigenetic landscape of NSCLC and identify additional clinically relevant methylation targets. Additionally, experimental research is needed to elucidate the detailed mechanisms by which HOXA9 and SOX1 methylation influence cancer progression, and to explore its potential as a therapeutic target. There is still a need for more consistent criteria for assessing HOXA9 and SOX1 methylation levels across studies.
Our study did not include molecular testing for EGFR, ALK, or ROS1 mutations, as these analyses were not routinely performed at the time of sample collection. This limits our ability to assess the potential interactions between driver mutations and promoter hypermethylation. Future studies should incorporate comprehensive genomic and epigenomic profiling to provide a more integrated view of tumor biology and prognosis.