Next Article in Journal
The Efficacy of Music Intervention in Patients with Cancer Receiving Radiation Therapy: A Systematic Review and Meta-Analysis
Next Article in Special Issue
Targeting Immune Checkpoint Inhibitors for Non-Small-Cell Lung Cancer: Beyond PD-1/PD-L1 Monoclonal Antibodies
Previous Article in Journal
Fertility Preservation in More than 7000 Male Patients: A Single-Center, Tertiary Care Registry Study over 30 Years
Previous Article in Special Issue
Soluble PD-L1 and Serum Vascular Endothelial Growth Factor-B May Independently Predict Prognosis in Patients with Advanced Non-Small Cell Lung Cancer Treated with Pembrolizumab
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Systematic Review

Methylation Biomarkers of Lung Cancer Risk: A Systematic Review and Meta-Analysis

by
Jacopo Dolcini
1,
Manuela Chiavarini
2,*,
Giorgio Firmani
1,*,
Kasey J. M. Brennan
3,
Andres Cardenas
4,
Andrea A. Baccarelli
3 and
Pamela Barbadoro
1
1
Department of Biomedical Sciences and Public Health, Section of Hygiene, Preventive Medicine and Public Health, Polytechnic University of the Marche Region, 60126 Ancona, Italy
2
Department of Health Sciences, University of Florence, 50134 Florence, Italy
3
Department of Environmental Health, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA
4
Department of Epidemiology and Population Health, Stanford University, Palo Alto, CA 94305, USA
*
Authors to whom correspondence should be addressed.
Cancers 2025, 17(4), 690; https://doi.org/10.3390/cancers17040690
Submission received: 18 December 2024 / Revised: 4 February 2025 / Accepted: 10 February 2025 / Published: 18 February 2025

Simple Summary

Lung cancer (LC) is among the leading causes of cancer deaths worldwide, and understanding factors that increase risk could improve prevention and early detection efforts. Several indicators, such as epigenetic age and others that analyze methylation levels, have been shown to predict aging and disease risk in relation to LC. Still, there is no clear consensus on how to implement these measures for clinical use or prevention strategies. This review and meta-analysis examines the link between DNAm in blood and the risk of developing LC. By combining data from existing studies, we found that higher levels of DNAm are associated with a greater risk of LC. These results suggest that DNAm and related indicators could be used as biomarkers to identify individuals at higher risk of LC. Further research is needed to clarify how DNAm patterns could be used in LC screening and to understand other factors that might influence this relationship.

Abstract

Background: Lung cancer (LC) is the leading cause of cancer deaths worldwide among both men and women, and represents a major public health challenge. DNA methylation (DNAm) has shown potential in identifying individuals at higher risk of LC, but the overall evidence has not been systematically evaluated. This review and meta-analysis aims to evaluate and summarize existing research on the association between blood DNAm levels and LC risk. Methods: Searches were conducted in PubMed, Web of Science, and Scopus for studies published until February 2024, following PRISMA and MOOSE guidelines. Eleven studies met the eligibility criteria. Results: Using a random effects model, our pooled analysis showed a significant association between increased DNAm levels and LC risk (OR 1.24, 95% CI 1.10–1.39; I2 = 93.90%, p = 0.0001). Stratifying the results by study design showed a stronger association in two prospective cohort studies (OR 1.61; 95% CI 1.36–1.90; I2 = 14.42%, p = 0.32), while case–control studies showed a weaker association (OR 1.05; 95% CI 0.99–1.11; I2 = 70.57%, p = 0.0001). Sensitivity analyses indicated that omitting individual studies did not significantly alter the LC risk estimates. Conclusions: These findings suggest that higher blood DNAm levels are associated with an increased risk of LC, especially in long-term cohort studies. Further research is recommended to explore the potential of DNAm as a screening biomarker for LC and to clarify the role of other influencing factors.

1. Introduction

According to the International Agency for Research on Cancer, with an estimated 2.48 million new cases and 1.81 million deaths per year in 2022 [1], lung cancer (LC) is one of the most frequently diagnosed cancers and the leading cause of cancer-related deaths worldwide. Not only is LC highly prevalent, but it has also shown an increased trend over time [2,3]. Public health campaigns to reduce smoking have contributed to a partial but not sufficient decrease in incidence and mortality in several countries [4,5]. Significative improvements have been made in terms of the general understanding of disease biology, risk factors, the application of predictive biomarkers, and refinements in treatment that have led to remarkable progress for the health outcomes of many patients [6,7]. However, prevention strategies and screening still remain limited, as LC survival has mostly improved due to new treatments rather than screening [3]. Preventive chest X-rays with or without sputum cytology did not show significant results in terms of patient outcomes [8]. Some benefits have been shown through screening focusing on high-risk groups, identified based on exposure to high levels of cigarette smoking [9,10], and using more advanced approaches like low-dose CT [10]. However, these technologies have higher costs and lower availability, especially in low- and middle-income countries. It is well known that LC can also occur in subjects without a history of high-risk exposure, such as smoking [11]. However, these populations are often not effectively reached by conventional prevention campaigns. Therefore, new minimally invasive biomarkers involving blood or sputum sampling have gained interest [12]. From these specimens, biological information can be obtained through epigenetic analyses involving DNA methylation (DNAm).
Over the past decade, there has been growing interest in using epigenetic changes as markers for early cancer detection [13]. Epigenetic dysregulation, including in DNAm, is associated with many tumor types, including LC, and chemotherapy resistance [14]. Based on the hypothesis that DNA from apoptotic cancer cells can be detected in serum and plasma, DNAm changes in several target genes have been evaluated in blood samples of patients with LC. Additionally, some evidence suggests that certain aberrant DNAm in the blood might reflect pathological changes in target tissues that cannot otherwise be identified easily or safely [15]. Other studies that focused on a specific set of genes, such as RASSF1A, CDKN2A, RARb, CDH13, FHIT, and BLU, showed an accordance of methylation levels between blood samples and tumor samples [16]. Following increasing interest in DNAm and other epigenetic mechanisms, in recent years many studies have focused on the development of biomarkers that look at methylation and quantitatively combine DNAm levels at tens to hundreds of genomic locations into composite methylation-based age predictors. These measures often exhibit extremely high correlations with chronological age levels but can assess inter-individual and/or inter-tissue variability in the rate of aging, possibly capturing differences between epigenetic or biological aging and chronological aging [17]. These differences can be related to differential susceptibility to death [18,19] or diseases [20,21,22]. Looking at global DNAm modifications has led to a broader understanding of LC tumorigenesis and has shown promising candidates for early LC detection or for the selection of high-risk subjects with further specific screening methods [23,24]. This is why also epigenetics clocks that are usually built on global methylation analyses have been found to be associated with increased cancer risk [21,25,26], including LC [27]. It must be noted that, in contrast, some other studies encountered a weak association or a lack of one [28,29,30], showing a lack of consensus probably due to several factors such as reverse causation and residual confounding.

2. Materials and Methods

The present meta-analysis was conducted following the MOOSE (Meta-analysis Of Observational Studies in Epidemiology) guidelines [31] and PRISMA (Preferred Reporting Items for Systematic reviews and Meta-Analyses) statement [32].
The study protocol was registered in the International Prospective Register of Systematic Reviews (www.crd.york.ac.uk/PROSPERO/, registration No: CRD42024516565) (accessed on 22 February 2024).

2.1. Search Strategy and Data Sources

We carried out a systematic literature search up to 13 February 2024 through the PubMed http://www.ncbi.nlm.nih.gov/pubmed/ (accessed on 14 February 2024), Web of Science http://wokinfo.com/ (accessed on 14 February 2024), and Scopus https://www.scopus.com/ (accessed on 14 February 2024) databases to identify original articles on the association between changes in DNAm and LC. The following key words were used: (DNA methylation) AND (lung cancer risk). The different associations of keywords combined with Boolean operators used for each database are shown in Table S1. No publication date limitation was applied, but due to translation restrictions, only English-language studies were eligible. In addition, to identify additional relevant publications, we manually examined the reference lists of the included articles and recent relevant reviews.

2.2. Eligibility Criteria

Of the selected articles, only those that met the following criteria were included: (i) evaluated the relationship between changes in DNAm and LC; (ii) used a case–control, prospective, or cross-sectional study design; (iii) reported odds ratio (OR), relative risk (RR), or hazard ratio (HR), estimated with 95% confidence intervals (CIs). Studies that aggregated LC cases with other types of cancer were excluded. For each potentially included study, two investigators independently carried out the selection, data extraction, and quality assessment. Disagreements were resolved by discussion or in consultation with a third author. Although useful to have background information, reviews and meta-analyses were excluded. No studies were excluded for weakness of design or data quality.

2.3. Data Extraction and Quality Assessment

For each selected study, we extracted the following information: first author’s last name, year of publication, country, study design, sample size, population characteristics (sex, age, race, BMI, smoking status), duration of follow-up for cohort studies, risk estimates with 95% CIs, type of LC, tissue type, DNAm measure, and confounding factor adjustment. The effect estimates represent a relative change in DNAm rather than an absolute increase measured in specific units, which may be associated with aging as measured by epigenetic clocks, smoking, or the methylation risk score. Our results are derived from the analysis of epigenetic clocks, which are regression models constructed in studies comparing DNA methylation levels between populations diagnosed with lung cancer and control groups without lung cancer. While these studies used diagnosis as a reference for model construction, the resulting regression model is capable of assessing risk based on given methylation levels. When multiple estimates were reported in the article, we extracted those adjusted for the most confounding factors. The Newcastle–Ottawa Scale (NOS) was used to assess the quality of the literature for cohort and case–control studies using a 9-star system, as shown in Tables S2 and S3 [33]. The full score was 9 and a total score ≥ 7 was used to indicate a high-quality study [32].

2.4. Statistical Analysis

The overall effect-size statistic estimate was carried out considering the risk of LC associated with the highest versus the lowest level of changes in DNAm. The risk values of the multivariable models were selected by considering those that took into account the greatest number of potential confounding variables. The relative risk and hazard ratio were taken as an approximation to the OR, and the meta-analysis was performed as if all types of ratios were ORs. Due to the high heterogeneity, a random effects model and inverse variance weighting methods were used to calculate the sum of the OR and the 95% confidence intervals. An effect was considered statistically significant when a two-tailed p < 0.05 was obtained. We further aimed to quantify the effects of DNAm on risk of LC using a dose–response meta-analysis with random effects models [34,35]. To evaluate the dose–response relationship between exposure and outcome, data were extracted from studies that reported results by exposure quartiles (Q1, Q2, Q3, Q4). The outcome variable considered was the odds ratio (OR) for each quartile, with corresponding lower (LL) and upper (UL) 95% confidence intervals. For the reference quartile (Q1), the odds ratio was set to 1 (log(OR) = 0). For the subsequent quartiles (Q2, Q3, Q4), the logarithm of the OR was calculated to provide a linear representation of the dose–response relationship. Variance was calculated using the confidence intervals, following a well-established procedure to estimate uncertainty in meta-analyses [36,37]. Exposure was standardized on a continuous scale, assigning values of 0, 1, 2, and 3 to quartiles Q1, Q2, Q3, and Q4, respectively. This approach is commonly used to explore dose–response relationships in epidemiological data and allows the effect of increasing exposure to be modeled as a continuous variable [35]. A random effects meta-regression was conducted using the metafor package in R [38], employing the restricted maximum likelihood method (REML). The independent variable in the regression was the standardized exposure, with the goal of estimating the incremental effect of exposure in the higher quartiles compared to Q1. The random effects model was chosen to account for heterogeneity among the included studies [39]. Predicted ORs for each quartile were obtained from the meta-regression results. The OR estimates for the subsequent quartiles were compared to Q1 (OR = 1) to assess the dose–response relationship between exposure and outcome. Additionally, confidence intervals were constructed for each prediction. A dose–response plot was created to visualize the relationship between standardized exposure (quartiles) and odds ratios. Quartile Q1 was used as the reference (OR = 1), and the subsequent quartiles were plotted relative to Q1, clearly representing how the odds ratio changes with increasing exposure. This procedure is commonly used to explore and present the dose–risk relationship in epidemiological studies [34].
The chi-square-based Cochran’s Q statistic and the I2 statistic were used to evaluate the heterogeneity in the results across the studies [40]. The I2 statistic yields results ranging from 0% to 100% (I2 = 0–25%, no heterogeneity; I2 = 25–50%, moderate heterogeneity; I2 = 50–75%, large heterogeneity; and I2 = 75–100%, extreme heterogeneity) [41]. The results of the meta-analysis may be biased if the probability of publication is dependent on the study results.
We used the methods of Begg and Mazumdar [42] and Egger et al. [43] to detect publication bias. Both methods tested for funnel plot asymmetry, with the former being based on the rank correlation between the effect estimates and their sampling variances and the latter on the linear regression of a standard normal deviate and its precision. If a potential bias was detected, we conducted a further sensitivity analysis to assess the strength of the combined effect estimates and the possible influence of the bias, and to have the bias corrected. We also conducted a sensitivity analysis to investigate the influence of a single study on the overall risk estimate by omitting one study in each turn. We considered the funnel plot to be asymmetrical if the intercept of Egger’s regression line deviated from zero, with a p-value < 0.05. The analyses were performed using the ProMeta 3 statistical program and the calculations on the data extracted from the original papers were performed using STATA 13.3.

3. Results

3.1. Study Selection

The literature search revealed 582 studies from the PubMed database, 955 from Web of Science, and 829 from Scopus. After removing duplicates (n = 794), we identified 1572 records screened for title and abstract revision. Among these, 1546 articles were excluded (reviews, pooled or meta-analyses, commentaries, and case studies) (Figure 1).
Therefore, 26 studies were subjected to full-text revision. We identified one additional study through the reference lists of both the selected articles and recent relevant reviews. Subsequently, 16 articles were excluded because they did not meet the inclusion criteria, since they were studies on single-gene methylation, on animal models, or focused on the promoter methylation of specific genes (Table S4). We excluded articles that addressed these points from this systematic review and meta-analysis for the following reasons: methylation of a single gene was not the primary focus of methylation regression models; this systematic review and meta-analysis focused on human studies and not animal models; methylation as an epigenetic mechanism extends beyond the promoter region; and the NOS is not applicable to randomized studies such as Mendelian randomization. At the end of the selection process, 11 studies were eligible for inclusion in the systematic review and meta-analysis [27,28,44,45,46,47,48,49,50,51,52].

3.2. Study Characteristics and Quality Assessment

The general characteristics of the 11 studies evaluating the association between DNAm and LC risk, as well as the relative populations included in the meta-analysis, are shown, respectively, in Table 1.
Studies were conducted in the United States of America [45,48,52]; Australia [27,44,46,49,51]; Germany [47,50]; Italy [44]; Scotland [28]; and Norway [44]. There was one case–control study [45], eight nested case–control studies [27,44,46,47,48,51,52], and two cohort studies [28,50]. Studies were published from 2015 to 2023, and all of them were conducted in adults aged ≥50 years old.
The health outcome investigated was the risk incidence of LC. The study-specific quality scores of selected studies are shown in the last column on the right of Table 1. The quality scores range from 7 to 9 (median: 8; mean: 8) in Table 1.

3.3. Meta-Analysis of DNAm

Regarding DNAm (Table 2 and Figure 2), we found an overall increased risk associated with greater values of DNAm (OR = 1.24 95% C.I. 1.10–1.39) that showed a stronger association and statistical significance when focusing only on cohort studies (OR = 1.61, 95% C.I. 1.36–1.90). Looking at stratification according to smoking status, we found a statically significant risk increase both for former smokers (OR = 1.58; 95% C.I. 1.41–1.77) and current smokers (OR = 1.60; 95% C.I. 1.43–1.79). A stratification based on follow-up time period revealed a statistically significant increased likelihood of LC risk with greater DNAm with a follow-up of ≤ 5 years (OR = 1.46; 95% C.I. 1.08–1.98) and 5–10 years (OR = 1.20; 95% C.I. 1.06–1.36). We then stratified the several different types of epigenic clocks and indicators according to similar characteristics (if they measured accelerated age (AA) or intrinsic epigenetic age acceleration (IEAA)), but we did not observe any associations for grouped indicators. All DNAm measure definitions, which are taken in consideration, are described in Table S5. Regarding the single epigenetic clock (Table 2), we observed a statistical association only for the epigenetic age acceleration of GrimAge (OR = 1.97; 95% C.I. 1.57–2.47). Because of the small amount of data, no further stratification according to other variables such as body mass index (BMI), age, or race was possible. Concerning heterogeneity, the analysis revealed significant heterogeneity across the studies. In the combined analysis (ALL), the heterogeneity was high (Q = 163.95, I2 = 93.90%, p < 0.001), indicating substantial variability between studies. This high level of heterogeneity was consistent across various subgroup analyses, including case–control studies (Q = 350, I2 = 70.57%, p < 0.001) and cohort studies (Q = 7.01, I2 = 14.42%, p = 0.32), though the latter showed lower heterogeneity. For follow-up periods, studies with ≤5 years showed moderate heterogeneity (Q = 4.92, I2 = 59.36%, p = 0.09), studies with a 5–10-year follow-up showed no heterogeneity (Q = 1.27, I2 = 0.00%, p = 0.53), and studies with a follow-up of >10 years showed high heterogeneity (Q = 12.97, I2 = 76.87%, p = 0.005).

3.4. Sensitivity Analysis of DNAm

Sensitivity analyses investigating the influence of a single study on the LC risk estimates suggested that these were not substantially driven by any single study. Indeed, the LC risk estimates ranged from 1.07 (95% CI 1.01–1.13, p = 0.025), omitting the study of Yu et al. [47], to 1.17 (95% CI 1.10–1.24, p < 0.001), omitting the study of Michaud et al. [52].

3.5. Publication Bias of DNAm

Publication bias was detected for ALL and cohort studies with Egger’s method and for current smoking status with Begg’s method (Table 2 and Figure 3).

3.6. Dose–Response

Three articles were identified for dose–response analysis [47,50,52]. The other studies were excluded because they did not include division by quartiles [27,28,44,45,46,48,49,51].
The dose–response relationship between methylation levels, expressed as standardized quartiles, and LC risk is presented in Figure 4. The x-axis represents the standardized exposure levels corresponding to methylation quartiles Q1 (reference), Q2, Q3, and Q4, while the y-axis represents the odds ratio (OR) for LC risk on a linear scale. The red dashed horizontal line at OR = 1 marks the reference category (Q1).
The black solid line in the figure illustrates a slight increase in ORs across the quartiles, indicating a potential rise in LC risk with increasing methylation levels. The blue dashed lines represent the 95% confidence intervals, which widen at higher quartiles, suggesting increased uncertainty in the estimates for the upper methylation quartiles.
The mixed effects meta-regression model indicated that individuals in the lowest methylation quartile (Q1) had reduced odds of LC, with an OR of 0.77 (95% CI: 0.63 to 0.94). This suggests that individuals in Q1 have about 23% lower odds of developing LC compared to the higher quartiles.
For each one-unit increase in standardized exposure (i.e., moving from one quartile to the next), the odds ratio for LC was 1.03 (95% CI: 0.95 to 1.13). This indicates a slight, but non-significant, increase in LC risk with higher methylation quartiles.
The wide confidence intervals in the higher quartiles indicate some uncertainty in the risk estimates, and no conclusive dose–response pattern was observed.

4. Discussion

Our results are consistent with the growing body of evidence in the literature that links changes in DNAm patterns or biomarkers with LC risk [53]. These methylation changes have been deeply investigated, with a particular focus on specific regions, such as the promoter region of the CDKN2A gene (Cyclin-Dependent Kinase Inhibitor 2A), which has been identified as one of the first potential new biomarkers for the early detection of LC [52,54]. In addition, a more comprehensive approach has been taken through modern measures, such as epigenetic clocks, which can also be related to LC [27,45], even if some studies did not find a strong association [46,52]. We found a strong significant association for the overall effect and for several types of stratification, including smoking status (Table 2). This is in agreement with previous studies that found a stable epigenetic signature of cigarette smoking, both in current and past smokers [55,56,57]. Interestingly, the follow-up period (Table 2) seems to represent an important discrimination in terms of DNAm biomarkers and LC risk: for time periods ≤ 5 years and from 5 to 10 years of observation, the risk increases in a statistically significant manner with a higher risk in the ≤5 years class. Then, the risk decreases and is no longer statistically significant after ten years. It can be hypothesized that epigenetic modifications have a much higher impact on LC risk within the first five years of their occurrence, and their effect gradually decreases over time. This is in accordance with some studies that observed that some methylation changes induced by smoking can reverse within 5 years of quitting. Anyway, a large portion of CpG sites remain persistently altered for decades. These findings highlight both the substantial reversibility of smoking-related epigenetic modifications and the potential for long-lasting effects at specific loci [55]. We then stratified our results by grouping several epigenetic clocks that, while built on different premises, share some common characteristics. For example, the Horvath clock and Hannum clock are built by regressing DNAm on chronologic age, but the first is a pan-tissue clock, originally constructed utilizing CpG sites across 51 human cell types and tissues, while the Hannum clock was developed by regressing only peripheral blood samples, and includes 71 CpG sites [58,59]. The PhenoAge clock was created using blood samples, similar to Hannum, but it regresses DNAm states on clinical biomarkers rather than on chronologic age and incorporates 513 CpG sites [21]. Moreover, we created groups considering the papers analyzed in our investigation: the authors adjusted these clocks by regressing DNAm age on chronological age and calculating the difference between the observed chronological age and the fitted DNAm age, obtaining DNAm age acceleration (AA) [51]. Intrinsic epigenetic age acceleration (IEAA) was then derived using the residuals from the linear regression of DNAm age on chronological age, further adjusted for estimated blood cell composition [51]. These adjustments allowed for more comparable indicators. Despite these attempts, only the GrimAge epigenetic age acceleration, calculated on the residuals from regression models of GrimAge estimates on chronological age [27], showed a statistically significant increased risk of LC incidence. These results may be due to the nature of the epigenetic clocks built in a two-stage process, where the first stage involved the inclusion of DNAm-based surrogate biomarkers such as smoking pack-years [60]. The lack of significance for the stratification and grouping of the several different epigenetic measures may be due to both the low number of studies that were eligible for our analyses and differences in the biological elements and aspects that are captured by each of these clocks, even if they share common characteristics, as previously mentioned. Interestingly, the dose–response analysis showed a reduced likelihood of incidence for LC risk for subjects in the first quartile that was statistically significant. While not statistically significant, increasing quartiles of methylation levels showed a trend of increased risk. This increased risk is in agreement with the overall effect we encountered, and the loss of significance may be due to the fact that the dose–response analyses was performed on a reduced number of studies, i.e., those that reported quartiles of methylation levels. From a biological point of view, smoking is known to promote DNA methylation at certain loci, leading to the silencing of tumor suppressor genes and the activation of oncogenes. Over time, these epigenetic alterations can accumulate, potentially increasing the risk of lung cancer even in the absence of a direct dose–response pattern in the available data. This is consistent with epidemiological findings showing that the longer a person smokes, the higher their lung cancer risk.
Furthermore, the complexity of epigenetic regulation means that methylation changes may have non-linear effects on gene expression, making it challenging to detect a straightforward dose–response trend in case–control studies. Other confounding factors, such as inter-individual variability in epigenetic plasticity and the interplay with additional genetic and environmental influences, may further obscure a simple dose–response relationship [55]. Moreover, the heterogeneity of biomarkers may have influenced this result since some are aging clocks while others are biomarkers of smoking and/or methylation risk scores. It is possible that the association is driven primarily by those biomarkers, such as GrimAge, that have been built on smoking habits but end up more strongly associated with smoking effects than smoking itself.

Strengths and Limitations

To the best of our knowledge, this is the first study to conduct a systematic review and metanalysis of several DNAm measures in relation to the risk of LC incidence, representing a comprehensive, if incomplete, approach to better understanding the relationship of this possible promising biomarker with LC. Moreover, all studies included were of high quality, scoring above 7 on the NOS, ensuring reliability, even if we are aware of their possible limitations, especially in terms of the inter-rater agreement between observers of the NOS [61,62]. All articles included in this analysis were published within the last decade, reflecting the most current research in this field. In addition, all studies included had a considerable time range of follow-up of up to 10 years that allowed the proper stratification of our results. However, several limitations should be acknowledged. The considerable heterogeneity observed across studies can be attributed to differences in several analytic factors affecting the several different types of epigenetic clocks and indicators that were used in the selected studies. Another limitation is the potential impact of intra-tumoral epigenetic heterogeneity, which was not directly assessed in the included studies. The studies included in our analysis did not directly assess this variability at the tumor level, as they relied on blood-derived DNAm measurements rather than intra-tumoral sampling. The reliance on blood-derived DNAm measurements means that variability in tumor subclones could indirectly influence the findings. The release of DNA from apoptotic tumor cells into the bloodstream may reflect diverse tumor subclones, potentially introducing variability in the DNAm patterns detected. To mitigate this, we applied random effects models to account for between-study variability and performed subgroup analyses. Anyway, future research integrating data from both tumor tissues and matched blood samples, as well as single-cell epigenomic technologies, is needed to better understand the role of intra-tumoral heterogeneity in systemic DNAm signatures and their potential as biomarkers.
Additionally, due to limited data availability, further stratification according to BMI, age, socioeconomic status, or other demographic characteristics was not possible.

5. Conclusions

This systematic review and meta-analysis suggest that DNAm is a potentially promising biomarker for LC risk prediction. However, more studies are needed to better identify stratification algorithms and strategies to develop specific epigenetic clocks and measures, which must be validated in different populations. The literature shows an increasing number of epigenetic measures, and systematic organization and classification are needed both to reduce heterogeneity among different studies and to enable these tools to be used in clinical practice and prevention campaigns. Public health systems and policymakers should increase efforts to apply evidence-based medicine in the evaluation context of epigenetic tools for LC risk prevention, ensuring that all citizens can access these potentially powerful instruments as reliable and responsible resources for their health.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/cancers17040690/s1. Table S1: Search strategy; Table S2: Newcastle–Ottawa Scale of the studied selected cohort studies; Table S3: Newcastle–Ottawa Scale of the studied selected case–control studies; Table S4: Reasons for the exclusion of studies; Table S5: Definition of DNA methylation measure. References [63,64] were cited in Supplementary Materials.

Author Contributions

Conceptualization, J.D. and M.C.; methodology, J.D., M.C. and G.F.; software, J.D. and G.F.; validation, J.D., M.C. and G.F.; formal analysis, J.D., M.C. and G.F.; investigation, J.D., M.C. and G.F.; data curation, J.D., M.C. and G.F.; writing—original draft preparation, J.D., M.C., G.F., A.C., K.J.M.B., A.A.B. and P.B.; writing—review and editing, J.D., M.C., G.F., A.C., K.J.M.B., A.A.B. and P.B.; supervision, P.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The data presented in this study are available upon request from the corresponding author.

Acknowledgments

All work was completed at the Polytechnic University of the Marche Region, Ancona, Italy. We want to thank our home institution for their financial support.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. International Agency for Research on Cancer. Cancer Today. Available online: https://gco.iarc.fr/today/en/dataviz/pie?mode=cancer&group_populations=1&types=0 (accessed on 3 June 2024).
  2. Chiavarini, M.; Rosignoli, P.; Sorbara, B.; Giacchetta, I.; Fabiani, R. Benzene Exposure and Lung Cancer Risk: A Systematic Review and Meta-Analysis of Human Studies. Int. J. Environ. Res. Public Health 2024, 21, 205. [Google Scholar] [CrossRef] [PubMed]
  3. Thai, A.A.; Solomon, B.J.; Sequist, L.V.; Gainor, J.F.; Heist, R.S. Lung cancer. Lancet 2021, 398, 535–554. [Google Scholar] [CrossRef]
  4. Siegel, R.L.; Miller, K.D.; Jemal, A. Cancer statistics, 2020. CA Cancer J. Clin. 2020, 70, 7–30. [Google Scholar] [CrossRef] [PubMed]
  5. Jemal, A.; Center, M.M.; DeSantis, C.; Ward, E.M. Global Patterns of Cancer Incidence and Mortality Rates and Trends. Cancer Epidemiol. Biomark. Prev. 2010, 19, 1893–1907. [Google Scholar] [CrossRef]
  6. Howlader, N.; Forjaz, G.; Mooradian, M.J.; Meza, R.; Kong, C.Y.; Cronin, K.A.; Mariotto, A.B.; Lowy, D.R.; Feuer, E.J. The Effect of Advances in Lung-Cancer Treatment on Population Mortality. N. Engl. J. Med. 2020, 383, 640–649. [Google Scholar] [CrossRef] [PubMed]
  7. Fabiani, R.; La Porta, G.; Cavoli, L.L.; Rosignoli, P.; Chiavarini, M. Adherence to Data-Driven Dietary Patterns and Lung Cancer Risk: A Systematic Review and Dose–Response Meta-Analysis. Nutrients 2023, 15, 4406. [Google Scholar] [CrossRef] [PubMed]
  8. Humphrey, L.L.; Teutsch, S.; Johnson, M. Lung Cancer Screening with Sputum Cytologic Examination, Chest Radiography, and Computed Tomography: An Update for the U.S. Preventive Services Task Force. Ann. Intern. Med. 2004, 140, 740–753. [Google Scholar] [CrossRef]
  9. National Lung Screening Trial Research Team; Aberle, D.R.; Adams, A.M.; Berg, C.D.; Black, W.C.; Clapp, J.D.; Fagerstrom, R.M.; Gareen, I.F.; Gatsonis, C.; Marcus, P.M.; et al. Reduced Lung-Cancer Mortality with Low-Dose Computed Tomographic Screening. N. Engl. J. Med. 2011, 365, 395–409. [Google Scholar] [CrossRef] [PubMed]
  10. De Koning, H.J.; Van Der Aalst, C.M.; De Jong, P.A.; Scholten, E.T.; Nackaerts, K.; Heuvelmans, M.A.; Lammers, J.-W.J.; Weenink, C.; Yousaf-Khan, U.; Horeweg, N.; et al. Reduced Lung-Cancer Mortality with Volume CT Screening in a Randomized Trial. N. Engl. J. Med. 2020, 382, 503–513. [Google Scholar] [CrossRef] [PubMed]
  11. Pinheiro, P.S.; Callahan, K.E.; Medina, H.N.; Koru-Sengul, T.; Kobetz, E.N.; Gomez, S.L.; Lopes, G.d.L. Lung cancer in never smokers: Distinct population-based patterns by age, sex, and race/ethnicity. Lung Cancer 2022, 174, 50–56. [Google Scholar] [CrossRef]
  12. Fabiani, R.; Chiavarini, M.; Rosignoli, P.; Giacchetta, I. Leucocyte Telomere Length and Lung Cancer Risk: A Systematic Review and Meta-Analysis of Prospective Studies. Cancers 2024, 16, 3218. [Google Scholar] [CrossRef] [PubMed]
  13. Ramazi, S.; Dadzadi, M.; Sahafnejad, Z.; Allahverdi, A. Epigenetic regulation in lung cancer. Medcomm 2023, 4, e401. [Google Scholar] [CrossRef] [PubMed]
  14. Jones, P.A.; Baylin, S.B. The Epigenomics of Cancer. Cell 2007, 128, 683–692. [Google Scholar] [CrossRef]
  15. Levenson, V.V. DNA methylation as a universal biomarker. Expert Rev. Mol. Diagn. 2010, 10, 481–488. [Google Scholar] [CrossRef] [PubMed]
  16. Hsu, H.; Chen, T.; Hung, C.; Wen, C.; Lin, R.; Lee, H.; Wang, Y. Characterization of a multiple epigenetic marker panel for lung cancer detection and risk assessment in plasma. Cancer 2007, 110, 2019–2026. [Google Scholar] [CrossRef]
  17. Liu, Z.; Leung, D.; Thrush, K.; Zhao, W.; Ratliff, S.; Tanaka, T.; Schmitz, L.L.; Smith, J.A.; Ferrucci, L.; Levine, M.E. Underlying features of epigenetic aging clocks in vivo and in vitro. Aging Cell 2020, 19, 13229. [Google Scholar] [CrossRef]
  18. Field, A.E.; Robertson, N.A.; Wang, T.; Havas, A.; Ideker, T.; Adams, P.D. DNA Methylation Clocks in Aging: Categories, Causes, and Consequences. Mol. Cell 2018, 71, 882–895. [Google Scholar] [CrossRef] [PubMed]
  19. Chen, B.H.; Marioni, R.E.; Colicino, E.; Peters, M.J.; Ward-Caviness, C.K.; Tsai, P.C.; Roetker, N.S.; Just, A.C.; Demerath, E.W.; Guan, W.; et al. DNA methylation-based measures of biological age: Meta-analysis predicting time to death. Aging 2016, 8, 1844–1865. [Google Scholar] [CrossRef] [PubMed]
  20. Ambatipudi, S.; Horvath, S.; Perrier, F.; Cuenin, C.; Hernandez-Vargas, H.; Le Calvez-Kelm, F.; Durand, G.; Byrnes, G.; Ferrari, P.; Bouaoun, L.; et al. DNA methylome analysis identifies accelerated epigenetic ageing associated with postmenopausal breast cancer susceptibility. Eur. J. Cancer 2017, 75, 299–307. [Google Scholar] [CrossRef]
  21. Levine, M.E.; Lu, A.T.; Quach, A.; Chen, B.H.; Assimes, T.L.; Bandinelli, S.; Hou, L.; Baccarelli, A.A.; Stewart, J.D.; Li, Y.; et al. An epigenetic biomarker of aging for lifespan and healthspan. Aging 2018, 10, 573–591. [Google Scholar] [CrossRef]
  22. Yang, Z.; Wong, A.; Kuh, D.; Paul, D.S.; Rakyan, V.K.; Leslie, R.D.; Zheng, S.C.; Widschwendter, M.; Beck, S.; Teschendorff, A.E. Correlation of an epigenetic mitotic clock with cancer risk. Genome Biol. 2016, 17, 205. [Google Scholar] [CrossRef] [PubMed]
  23. Mehta, A.; Dobersch, S.; Romero-Olmedo, A.J.; Barreto, G. Epigenetics in lung cancer diagnosis and therapy. Cancer Metastasis Rev. 2015, 34, 229–241. [Google Scholar] [CrossRef]
  24. Balgkouranidou, I.; Liloglou, T.; Lianidou, E.S. Lung Cancer Epigenetics: Emerging Biomarkers. Biomark. Med. 2013, 7, 49–58. [Google Scholar] [CrossRef] [PubMed]
  25. Zheng, Y.; Joyce, B.T.; Colicino, E.; Liu, L.; Zhang, W.; Dai, Q.; Shrubsole, M.J.; Kibbe, W.A.; Gao, T.; Zhang, Z.; et al. Blood Epigenetic Age may Predict Cancer Incidence and Mortality. eBioMedicine 2016, 5, 68–73. [Google Scholar] [CrossRef]
  26. Kresovich, J.K.; Xu, Z.; O’brien, K.M.; Weinberg, C.R.; Sandler, D.P.; Taylor, J.A. Epigenetic mortality predictors and incidence of breast cancer. Aging 2019, 11, 11975–11987. [Google Scholar] [CrossRef] [PubMed]
  27. Dugué, P.-A.; Bassett, J.K.; Wong, E.M.; Joo, J.E.; Li, S.; Yu, C.; Schmidt, D.F.; Makalic, E.; Doo, N.W.; Buchanan, D.D.; et al. Biological Aging Measures Based on Blood DNA Methylation and Risk of Cancer: A Prospective Study. JNCI Cancer Spectr. 2020, 5, pkaa109. [Google Scholar] [CrossRef]
  28. Hillary, R.F.; Stevenson, A.J.; McCartney, D.L.; Campbell, A.; Walker, R.M.; Howard, D.M.; Ritchie, C.W.; Horvath, S.; Hayward, C.; McIntosh, A.M.; et al. Epigenetic measures of ageing predict the prevalence and incidence of leading causes of death and disease burden. Clin. Epigenetics 2020, 12, 115. [Google Scholar] [CrossRef]
  29. Wang, C.; Ni, W.; Yao, Y.; Just, A.; Heiss, J.; Wei, Y.; Gao, X.; Coull, B.A.; Kosheleva, A.; Baccarelli, A.A.; et al. DNA methylation-based biomarkers of age acceleration and all-cause death, myocardial infarction, stroke, and cancer in two cohorts: The NAS, and KORA F4. eBioMedicine 2021, 63, 103151. [Google Scholar] [CrossRef]
  30. Durso, D.F.; Bacalini, M.G.; Sala, C.; Pirazzini, C.; Marasco, E.; Bonafé, M.; Valle, Í.F.D.; Gentilini, D.; Castellani, G.; Faria, A.M.C.; et al. Acceleration of leukocytes’ epigenetic age as an early tumor and sex-specific marker of breast and colorectal cancer. Oncotarget 2017, 8, 23237–23245. [Google Scholar] [CrossRef] [PubMed]
  31. Stroup, D.F.; Berlin, J.A.; Morton, S.C.; Olkin, I.; Williamson, G.D.; Rennie, D.; Moher, D.; Becker, B.J.; Sipe, T.A.; Thacker, S.B. Meta-analysis of observational studies in epidemiology: A proposal for reporting. Meta-analysis Of Observational Studies in Epidemiology (MOOSE) group. JAMA 2000, 283, 2008–2012. [Google Scholar] [CrossRef] [PubMed]
  32. Page, M.J.; McKenzie, J.E.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. J. Clin. Epidemiol. 2021, 134, 178–189. [Google Scholar] [CrossRef] [PubMed]
  33. Ottawa Hospital Research Institute. (n.d.). Available online: http://www.ohri.ca/programs/clinical_epidemiology/oxford.asp (accessed on 28 January 2025).
  34. Orsini, N.; Bellocco, R.; Greenland, S. Generalized Least Squares for Trend Estimation of Summarized Dose–response Data. Stata J. Promot. Commun. Stat. Stata 2006, 6, 40–57. [Google Scholar] [CrossRef]
  35. Greenland, S. Dose-response and trend analysis in epidemiology: Alternatives to categorical analysis. Epidemiology 1995, 6, 356–365. [Google Scholar] [CrossRef]
  36. Borenstein, M.; Hedges, L.V.; Higgins, J.P.; Rothstein, H.R. Introduction to Meta-Analysis; John Wiley & Sons: Hoboken, NJ, USA, 2021; pp. 1–421. [Google Scholar] [CrossRef]
  37. The Cochrane Collaboration ®. Cochrane Handbook for Systematic Reviews of Interventions; The Cochrane Collaboration ®: London, UK, 2019. [Google Scholar]
  38. Viechtbauer, W. Conducting Meta-Analyses in R with the metafor Package. J. Stat. Softw. 2010, 36, 1–48. [Google Scholar] [CrossRef]
  39. DerSimonian, R.; Laird, N. Meta-analysis in clinical trials. Control Clin. Trials 1986, 7, 177–188. [Google Scholar] [CrossRef]
  40. Higgins, J.P.T.; Thompson, S.G. Quantifying heterogeneity in a meta-analysis. Stat. Med. 2002, 21, 1539–1558. [Google Scholar] [CrossRef] [PubMed]
  41. Higgins, J.P.T.; Thompson, S.G.; Deeks, J.J.; Altman, D.G. Measuring inconsistency in meta-analyses. BMJ 2003, 327, 557–560. [Google Scholar] [CrossRef]
  42. Begg, C.B.; Mazumdar, M. Operating Characteristics of a Rank Correlation Test for Publication Bias. Biometrics 1994, 50, 1088–1101. [Google Scholar] [CrossRef]
  43. Egger, M.; Smith, G.D.; Schneider, M.; Minder, C. Bias in meta-analysis detected by a simple, graphical test. BMJ 1997, 315, 629–634. [Google Scholar] [CrossRef] [PubMed]
  44. Gagliardi, A.; Dugué, P.-A.; Nøst, T.H.; Southey, M.C.; Buchanan, D.D.; Schmidt, D.F.; Makalic, E.; Hodge, A.M.; English, D.R.; Doo, N.W.; et al. Stochastic Epigenetic Mutations Are Associated with Risk of Breast Cancer, Lung Cancer, and Mature B-cell Neoplasms. Cancer Epidemiol. Biomark. Prev. 2020, 29, 2026–2037. [Google Scholar] [CrossRef]
  45. Levine, M.E.; Hosgood, H.D.; Chen, B.; Absher, D.; Assimes, T.; Horvath, S. DNA methylation age of blood predicts future onset of lung cancer in the women’s health initiative. Aging 2015, 7, 690–700. [Google Scholar] [CrossRef] [PubMed]
  46. Dugué, P.; Bassett, J.K.; Joo, J.E.; Jung, C.; Wong, E.M.; Moreno-Betancur, M.; Schmidt, D.; Makalic, E.; Li, S.; Severi, G.; et al. DNA methylation-based biological aging and cancer risk and survival: Pooled analysis of seven prospective studies. Int. J. Cancer 2018, 142, 1611–1619. [Google Scholar] [CrossRef]
  47. Yu, H.; Raut, J.R.; Schöttker, B.; Holleczek, B.; Zhang, Y.; Brenner, H. Individual and joint contributions of genetic and methylation risk scores for enhancing lung cancer risk strati-fication: Data from a population-based cohort in Germany. Clin. Epigenetics 2020, 12, 89. [Google Scholar] [CrossRef] [PubMed]
  48. Zhao, N.; Ruan, M.; Koestler, D.C.; Lu, J.; Salas, L.A.; Kelsey, K.T.; Platz, E.A.; Michaud, D.S. Methylation-derived inflammatory measures and lung cancer risk and survival. Clin. Epigenetics 2021, 13, 222. [Google Scholar] [CrossRef] [PubMed]
  49. Dugué, P.-A.; Hodge, A.M.; Wong, E.M.; Joo, J.E.; Jung, C.-H.; Hopper, J.L.; English, D.R.; Giles, G.G.; Milne, R.L.; Southey, M.C. Methylation marks of prenatal exposure to maternal smoking and risk of cancer in adulthood. Int. J. Epidemiol. 2021, 50, 105–115. [Google Scholar] [CrossRef] [PubMed]
  50. Li, X.; Schöttker, B.; Holleczek, B.; Brenner, H. Associations of DNA methylation algorithms of aging and cancer risk: Results from a prospective cohort study. eBioMedicine 2022, 81, 104083. [Google Scholar] [CrossRef]
  51. Dugué, P.A.; Yu, C.; Hodge, A.M.; Wong, E.M.; Joo, J.E.; Jung, C.H.; Southey, M.C. Methylation scores for smoking, alcohol consumption, and body mass index and risk of seven types of cancer. Int. J. Cancer 2023, 153, 489–498. [Google Scholar] [CrossRef] [PubMed]
  52. Michaud, D.S.; Chung, M.; Zhao, N.; Koestler, D.C.; Lu, J.; Platz, E.A.; Kelsey, K.T. Epigenetic age and lung cancer risk in the CLUE II prospective cohort study. Aging 2023, 15, 617–629. [Google Scholar] [CrossRef]
  53. Hong, Y.; Kim, W.J. DNA Methylation Markers in Lung Cancer. Curr. Genom. 2021, 22, 79–87. [Google Scholar] [CrossRef]
  54. Belinsky, S.A.; Nikula, K.J.; Palmisano, W.A.; Michels, R.; Saccomanno, G.; Gabrielson, E.; Baylin, S.B.; Herman, J.G. Aberrant methylation of p16 INK4a is an early event in lung cancer and a potential biomarker for early diagnosis. Proc. Natl. Acad. Sci. USA 1998, 95, 11891–11896. [Google Scholar] [CrossRef]
  55. Joehanes, R.; Just, A.C.; Marioni, R.E.; Pilling, L.C.; Reynolds, L.M.; Mandaviya, P.R.; Guan, W.; Xu, T.; Elks, C.E.; Aslibekyan, S.; et al. Epigenetic Signatures of Cigarette Smoking. Circ. Cardiovasc. Genet. 2016, 9, 436–447. [Google Scholar] [CrossRef]
  56. McCartney, D.L.; Stevenson, A.J.; Hillary, R.F.; Walker, R.M.; Bermingham, M.L.; Morris, S.W.; Clarke, T.-K.; Campbell, A.; Murray, A.D.; Whalley, H.C.; et al. Epigenetic signatures of starting and stopping smoking. eBioMedicine 2018, 37, 214–220. [Google Scholar] [CrossRef]
  57. Keshawarz, A.; Joehanes, R.; Guan, W.; Huan, T.; DeMeo, D.L.; Grove, M.L.; Fornage, M.; Levy, D.; O’connor, G. Longitudinal change in blood DNA epigenetic signature after smoking cessation. Epigenetics 2022, 17, 1098–1109. [Google Scholar] [CrossRef]
  58. Horvath, S. DNA methylation age of human tissues and cell types. Genome Biol. 2013, 14, R115, Erratum in Genome Biol. 2015, 16, 96. [Google Scholar] [CrossRef] [PubMed]
  59. Hannum, G.; Guinney, J.; Zhao, L.; Zhang, L.; Hughes, G.; Sadda, S.; Klotzle, B.; Bibikova, M.; Fan, J.-B.; Gao, Y.; et al. Genome-wide Methylation Profiles Reveal Quantitative Views of Human Aging Rates. Mol. Cell 2013, 49, 359–367. [Google Scholar] [CrossRef]
  60. Lu, A.T.; Quach, A.; Wilson, J.G.; Reiner, A.P.; Aviv, A.; Raj, K.; Hou, L.; Baccarelli, A.A.; Li, Y.; Stewart, J.D.; et al. DNA methylation GrimAge strongly predicts lifespan and healthspan. Aging 2019, 11, 303–327. [Google Scholar] [CrossRef] [PubMed]
  61. Hartling, L.; Milne, A.; Hamm, M.P.; Vandermeer, B.; Ansari, M.; Tsertsvadze, A.; Dryden, D.M. Testing the Newcastle Ottawa Scale showed low reliability between individual reviewers. J. Clin. Epidemiol. 2013, 66, 982–993. [Google Scholar] [CrossRef] [PubMed]
  62. Oremus, M.; Oremus, C.; Hall, G.B.C.; McKinnon, M.C.; ECT & Cognition Systematic Review Team. Inter-rater and test–retest reliability of quality assessments by novice student raters using the Jadad and Newcastle–Ottawa Scales. BMJ Open 2012, 2, e001368. [Google Scholar] [CrossRef] [PubMed]
  63. Horvath, S.; Erhart, W.; Brosch, M.; Ammerpohl, O.; Von Schönfels, W.; Ahrens, M.; Heits, N.; Bell, J.T.; Tsai, P.C.; Spector, T.D.; et al. Obesity accelerates epigenetic aging of human liver. Proc. Natl. Acad. Sci. USA 2014, 111, 15538–15543. [Google Scholar] [CrossRef]
  64. Smith, J.A.; Raisky, J.; Ratliff, S.M.; Liu, J.; Kardia, S.L.; Turner, S.T.; Mosley, T.H.; Zhao, W. Intrinsic and extrinsic epigenetic age acceleration are associated with hypertensive target organ damage in older African Americans. BMC Med. Genom. 2019, 12, 141. [Google Scholar] [CrossRef]
Figure 1. Flow diagram of the systematic literature search on DNAm and LC risk. a Studies included in review are [27,28,44,45,46,47,48,49,50,51,52].
Figure 1. Flow diagram of the systematic literature search on DNAm and LC risk. a Studies included in review are [27,28,44,45,46,47,48,49,50,51,52].
Cancers 17 00690 g001
Figure 2. Forest plot of DNAm and risk of LC. References of studies in the figure from top to bottom are following: [27,28,44,45,46,47,48,49,50,51,52].
Figure 2. Forest plot of DNAm and risk of LC. References of studies in the figure from top to bottom are following: [27,28,44,45,46,47,48,49,50,51,52].
Cancers 17 00690 g002
Figure 3. Funnel plot of publication bias of DNAm and risk of LC.
Figure 3. Funnel plot of publication bias of DNAm and risk of LC.
Cancers 17 00690 g003
Figure 4. Dose-dependent plots displaying the relation between DNAm and LC risk. The x-axis represents the standardized exposure levels corresponding to methylation quartiles Q1 (reference), Q2, Q3, and Q4, while the y-axis represents the odds ratio (OR) for LC risk on a linear scale. The red dashed horizontal line at OR = 1 marks the reference category (Q1). The blue lines represent the confidence interval.
Figure 4. Dose-dependent plots displaying the relation between DNAm and LC risk. The x-axis represents the standardized exposure levels corresponding to methylation quartiles Q1 (reference), Q2, Q3, and Q4, while the y-axis represents the odds ratio (OR) for LC risk on a linear scale. The red dashed horizontal line at OR = 1 marks the reference category (Q1). The blue lines represent the confidence interval.
Cancers 17 00690 g004
Table 1. Characteristics of the studies included in the systematic review and meta-analysis of the association between DNAm and LC risk. All studies collected samples from peripheral blood. NA: Not Available.
Table 1. Characteristics of the studies included in the systematic review and meta-analysis of the association between DNAm and LC risk. All studies collected samples from peripheral blood. NA: Not Available.
Author, Year, ReferenceCohort 1,
Location
Study
Design
DNAm Measure 2NAge (M: Mean, Mdn: Median)Sex (% Male)Race
(% White)
BMI (%, M: Mean, Mdn: Median [kg/m2] < 25)Type of LC 3Smoking Status (% Never)Matched or Adjusted
Variables
NOS 4
Michaud DS, 2023 [52]CLUE II, USANested case–controlEpigenetic Age
AA_Hannum,
AA_Horvath,
AA_Pheno, IEAA_Hannum, IEAA_Horvath, IEAA_Pheno
Cases: 208
Controls: 208
Cases: 58.3 (M)
Controls: 55.9 (M)
Cases: 45.7
Controls: 45.7
NACases: 26.0 (M)
Controls: 26.2 (M)
LC (NSCLC: 74%)Cases: 10.6
Controls: 10.6
Batch effects, BMI, smoking predicted years8
Dugue PA, 2023 [51]MCCS, AustraliaNested case–controlSMOKING
Smk-233, Smk-1061.
BMI
BMI-1109, BMI-85.
ALCOHOL CONSUMPTION
Alc-450, Alc-459
Cases: 327
Controls: 327
Cases: 61 (Mdn)
Controls: 61 (Mdn)
Cases: 61
Controls: 61
NACases: 27 (M)
Controls: 27 (M)
LCCases: 46
Controls: 48
Smoking details, physical activity, diet quality, education, SES, alcohol consumption, BMI9
Li X, 2022 [50]ESTHER, GermanyCohort study;
FU: 17 years
Epigenetic Age
AgeAccelPheno,
AgeAccelPheno
Methylation Score
MRscore-8CpGs
Cases: 207
Controls: 205
Cases: 63.2 (M) Controls: 62.5 (M)Cases: 63.8
Controls: 42.9
NACases: 23.7 (%) Controls: 31.2 (%)LCCases: 30.4
Controls: 49.5
Age, sex, leukocyte composition, batch, educational level, smoking status, alcohol consumption, BMI, diabetes status8
Dugue PA, 2020 [27]MCCS, AustraliaNested case–controlEpigenetic Age
PhenoAge, GrimAge
Cases: 327
Controls: 327
Cases: 61 (Mdn)
Controls: 61 (Mdn)
Cases: 69.3
Controls: 69.3
NACases: 27 (Mdn)
Controls: 27 (Mdn)
LCCases: 45.8
Controls: 45.8
Age, sex, country of birth, sample type, smoking information, BMI, height, alcohol consumption, physical activity, dietary quality, socioeconomic status, education8
Dugue PA, 2021 [49]MCCS, AustraliaNested case–controlMATERNAL SMOKING
MS-568,
MS-19,
MS-15,
MS-28,
MS-17
ADULT SMOKING
AS-233,
AS-56,
AS-1061
Cases: 327
Controls: 327
Cases: 61 (Mdn)
Controls: 61 (Mdn)
Cases: 61
Controls: 61
NANALCCases: 48
Controls: 46
Smoking information, alcohol consumption, BMI, physical activity, dietary quality, education, socioeconomic status, height8
Zhao N, 2021 [48]CLUE I-II, USANested case–controlmdNLR;
CRP Score 1, 2, 3.
Cases: 208
Controls: 208
Cases: 59 (Mdn)
Controls: 56 (Mdn)
Cases: 45.7
Controls: 45.7
Cases: 98.6
Controls: 100
NALC (all LC and NSCLC)Cases: 10.6
Controls: 10.6
Age, sex, smoking status, BMI, batch effects,
predicted pack-years smoked, cell proportions
9
Hillary RF, 2020 [28]GS, ScotlandCohortDunedinPoAmCases: 4450
Controls: 2578
Cases: 51.4 (M)
Controls: 50 (M)
Cases: 43.7
Controls: 38.6
NACases: 26.8 (M)
Controls: 27.2 (M)
LCNAAge, sex, alcohol consumption, BMI, deprivation, education, smoking8
Yu H,
2020 [47]
ESTHER, GermanyNested case–controlMethylation Risk Score
MRS
Cases: 143
Controls: 1460
Cases: 63.7 (M)
Controls: 61.8 (M)
Cases: 62.9
Controls: 44.0
NANALCCases: 12.1
Controls: 49.7
Batch, leukocyte composition, age, sex, smoking status, pack-years9
Gagliardi A, 2020 [44]EPIC, ItalyNested case–controlLogSEM model 4; LogSEM model EPIC, LogSEMmodel(TTD) ≤ 5 y, 5–10 y, >10 yCases: 556
Controls: 556
Cases: 53.7 (M)
Controls: 53.5 (M)
Cases and controls: 31NANALCCases: 43
Controls: 49
Age, sex smoking, BMI, dietary quality, alcohol intake, physical activity, education, Horvath DNAmAge epigenetic AA, DNAmGrimAge, epigenetic AA9
MCCS, AustraliaNested case–controlLogSEM model 4; LogSEM model MCCS, LogSEMmodel(TTD) ≤ 5 y, 5–10 y, > 10 yCases: 3482
Controls: 3482
Cases: 59.1 (M)
Controls: 58.9 (M)
Cases and controls: 61NANALCCases: 46
Controls: 48
Age, sex smoking, BMI, dietary quality, alcohol intake, physical activity, education, Horvath DNAmAge epigenetic AA, DNAmGrimAge, epigenetic AA9
NOWAC, NorwayNested case–controlLogSEM model 4; LogSEM model NOWAC, LogSEMmodel(TTD) ≤ 5 y, 5–10 y, > 10 yCases: 316
Controls: 316
Cases and Controls: 55.9 (M)Cases and Controls: 0NANALCCases: 26
Controls: 38
Age, sex smoking, BMI, dietary quality, alcohol intake, physical activity, education, Horvath DNAmAge epigenetic AA, DNAmGrimAge, epigenetic AA9
Dugue AP,
2018 [46]
MCCS, AustraliaNested case–controlEpigenetic Age
AA_Hannum,
AA_Horvath, IEAA_Hannum, IEAA_Horvath,
EEAA.
Cases: 332
Controls: 332
Cases: 59.5 (M)
Controls: 59.4 (M)
Cases and Controls: 64NACases: 37.0 (%)
Controls: 28.0 (%)
LCCases and Controls: 12BMI, smoking, alcohol intake, diet quality,
physical activity, socioeconomic status education, age, sex, ethnicity
8
Levine ME,
2015 [45]
WHI, USACase–controlEpigenetic Age
IEAA by age:
All ages,
50–59,
60–69,
70+.
IEAA by smoking status:
Current,
Former,
Never.
Cases: 43
Controls: 1986
65.34 (M:
Cases +
Controls)
Cases: 0
Controls: 0
NANALC54.4 (M: Cases + Controls)Age, race/ethnicity, CHD
status, pack-years, smoking status
7
1 Cohort acronyms: CLUE (I/II), Campaign Against Cancer and Stroke; ESTHER, Epidemiological Study on the Chances of Cure, Early Detection and Optimized Therapy of Chronic Diseases in the Elderly Population; MCCS, Melbourn Collaborative Cohort Study; EPIC, European Prospective Investigation into Cancer and Nutrition; NOWAC, Norwegian Women and Cancer Study; WHI, Women Health Initiative; GS, Generation Scotland. 2 DNAm measure acronyms: AA, age acceleration; IEAA, intrinsic epigenetic age acceleration; EEAA, extrinsic epigenetic age acceleration; MRS, methylation risk score; LogSEM, logistic structural equation modeling; MS, maternal smoking; AS, adult smoking; MRscore, methylation risk score, 8CpGs, eight specific CpG sites; DunedinPoAm, Dunedin pace of aging methylation. 3 LC acronyms: LC, lung cancer; SCLC, small-cell lung cancer; NSCLC, non-small-cell lung cancer; LAUD, lung adenocarcinoma; LUSC, lung squamous cell carcinoma. 4 Newcastle–Ottawa Scale.
Table 2. Results of stratified analysis of the LC risk estimates associated with DNAm.
Table 2. Results of stratified analysis of the LC risk estimates associated with DNAm.
Combined Risk Estimate a Test of Heterogeneity Publication Bias
N bValue (95% CI)  Q I2 % pp (Egger Test)p (Begg Test)
ALL (11 articles) 111 1.24 (1.10–1.39)  163.95 93.90 0.00 0.032 0.07
ALL (Case–control study) 104 1.05 (0.99–1.11)  350 70.57 0.00 0.09 0.14
ALL (Cohort study) 7 1.61 (1.36–1.90)  7.01 14.42 0.32 0.012 0.18
  
Smoking status   
           CURRENT 7 1.60 (1.43–1.79)  2.65 0.00 0.45 0.06 0.042
           PAST 7 1.58 (1.42–1.77)  4.11 0.00 0.66 0.09 0.19
Follow-up   
           ≤5 3 1.46 (1.08–1.98)  4.92 59.36 0.09 0.12 0.12
           5–10 3 1.20 (1.06–1.36)  1.27 0.00 0.53 0.60 0.60
           ≤10  20 1.06 (0.93–1.21)  16.92 82.27 0.001 0.46 0.50
           >10 17 0.99 (0.85–1.16)  12.97 76.87 0.005 0.94 0.50
Group of indicators   
     7 indicators 1 23 1.04 (0.95–1.14)  64.89 66.1 0.00 0.75 0.96
           4 indicators 214 1.12 (0.96–1.30)  55.52 76.58 0.00 0.39 0.70
               AA_Hannum 3 0.93 (0.74–1.16)  4.19 52.28 0.12 0.39 0.60
               AA_Horvath 3 0.95 (0.86–1.05)  1.97 0.00 0.37 0.29 0.60
               AA_Pheno 5 1.18 (0.95–1.48)  12.10 66.94 0.017 0.59 0.33
               AA_Grim 3 1.97 (1.57–2.47)  1.14 0.00 0.57 0.62 0.60
           3 indicators 3 8 0.96 (0.89–1.04)  4.05 0.00 0.77 0.026 0.14
              IEAA_Hannum 3 0.95 (0.79–1.14)  2.83 29.28 0.24 0.19 0.12
              IEAA_Horvath 3 0.97 (0.87–1.08)  0.58 0.00 0.75 0.046 0.12
     Type of indicators   
           Hannum (AA + IEAA) 6 0.95 (0.85–1.08)  7.02 28.77 0.22 0.023 0.19
           Horvath (AA + IEAA) 6 0.96 (0.89–1.03)  2.64 0.00 0.76 0.023 0.13
           Pheno (AA + IEAA) 7 1.09 (0.91–1.31)  16.73 64.13 0.010 0.68 0.88
           Grim (AA) 3 1.97 (1.57–2.47)  1.14 0.00 0.57 0.62 0.60
1 AA_Hannum, AA_Horvath, AA_Grim, AA_Pheno, IEAA_Hannum, IEAA_Horvath, IEAA_Pheno. 2 AA_Hannum, AA_Horvath, AA_Grim, AA_Pheno. 3 IEAA_Hannum, IEAA_Horvath, IEAA_Pheno. a Risk estimates were calculated using the random effects model. b Number of data used to calculate the risk.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Dolcini, J.; Chiavarini, M.; Firmani, G.; Brennan, K.J.M.; Cardenas, A.; Baccarelli, A.A.; Barbadoro, P. Methylation Biomarkers of Lung Cancer Risk: A Systematic Review and Meta-Analysis. Cancers 2025, 17, 690. https://doi.org/10.3390/cancers17040690

AMA Style

Dolcini J, Chiavarini M, Firmani G, Brennan KJM, Cardenas A, Baccarelli AA, Barbadoro P. Methylation Biomarkers of Lung Cancer Risk: A Systematic Review and Meta-Analysis. Cancers. 2025; 17(4):690. https://doi.org/10.3390/cancers17040690

Chicago/Turabian Style

Dolcini, Jacopo, Manuela Chiavarini, Giorgio Firmani, Kasey J. M. Brennan, Andres Cardenas, Andrea A. Baccarelli, and Pamela Barbadoro. 2025. "Methylation Biomarkers of Lung Cancer Risk: A Systematic Review and Meta-Analysis" Cancers 17, no. 4: 690. https://doi.org/10.3390/cancers17040690

APA Style

Dolcini, J., Chiavarini, M., Firmani, G., Brennan, K. J. M., Cardenas, A., Baccarelli, A. A., & Barbadoro, P. (2025). Methylation Biomarkers of Lung Cancer Risk: A Systematic Review and Meta-Analysis. Cancers, 17(4), 690. https://doi.org/10.3390/cancers17040690

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop