1. Introduction
Alzheimer’s disease (AD) is the major cause of dementia and is projected to affect more than 13 million people in the United States by 2050, thus imposing huge health and economic burdens [
1,
2]. Late onset AD is believed to be a multifactorial disease caused by complex interactions between various genetic and non-genetic factors [
3]. Many genetic variants mapped to several chromosomal regions and genes have thus far been associated with AD by genome-wide association studies (GWAS) [
4,
5]; although, the vast majority of AD cases cannot be etiologically attributed to these variants [
2,
6]. Also, none of non-genetic AD-associated factors (e.g., age, cardiovascular risk factors, head trauma, depression, and educational attainment) has been proven to have a strong causal relationship with AD [
7,
8].
Epigenetic modifications of gene expression in interaction with non-genetic factors are hypothesized to contribute to AD development [
6,
9], particularly in light of the heterogeneous clinical manifestations of AD observed among patients with similar or identical genetic backgrounds [
10]. The potential role of epigenetic mechanisms in AD pathogenesis has been widely investigated in cell lines, mouse models, post-mortem brain tissue, and blood cells [
6,
10,
11,
12,
13]. Several studies have explored the global DNA methylation in AD cases compared with controls, although their findings have been inconclusive, with some reporting global hypomethylation in AD, some suggesting global hypermethylation in AD, and the others reporting no significant differences between cases and controls [
12]. Previous studies have also provided many lines of evidence of associations between AD and gene-specific epigenetic modifications. They mainly investigated the DNA methylation and histone modification differences between AD cases and unaffected controls using candidate gene or genome-wide analysis approaches (e.g., pyrosequencing and array hybridization) which revealed AD-associated epigenetic modifications in some well-known AD genes, such as amyloid-β precursor protein (
APP), Microtubule Associated Protein Tau (
MAPT) [
14], and Apolipoprotein E (
APOE) [
15], as well as in other genes [
12]. For instance, Iwata et al. discovered CpG hypermethylation in
APP and
MAPT in post-mortem brain samples from AD patients, which were suggested to contribute to neural dysfunction and AD development [
14]. Foraker et al. found that AD patients had a lower mean methylation level in 76 CpG sites across
APOE gene compared with age-matched controls when hippocampus and frontal lobe samples were analyzed. However,
APOE methylation was not statistically different between cases and controls in samples obtained from their cerebellum [
15].
In most cases, epigenetically dysregulated genes were uniquely found in a single study [
6,
10,
12,
13], although AD-associated epigenetic modifications of some genes have been replicated in independent studies. For instance, several studies have reported CpG hypermethylation in the
ANK1 gene in different brain regions, such as entorhinal and prefrontal cortices, superior temporal gyrus, and/or hippocampus in AD patients [
16,
17,
18]. Hypermethylated regions overlapping
DUSP22 gene were previously detected in entorhinal and dorsolateral prefrontal cortices and/or hippocampus of AD affected individuals [
18,
19], and CpG hypermethylation of
SORBS3 was detected in the cerebral cortex of AD patients and transgenic AD mouse models [
11,
20]. Moreover, differentially methylated regions overlapping
CDH23,
RHBDF2, and
RPL13 genes were reported in previous studies [
16,
17,
21]. The mRNA expressions of these genes were also found to be altered in AD patients [
16]. In addition, several genes whose associations with AD were replicated by independent GWAS [
2], such as
ABCA7,
BIN1,
CLU,
HLA-DRB5,
SLC24A4, and
SORL1, are epigenetically implicated in AD as well [
16,
22,
23]. The case-control studies and cell/animal models may not, however, reflect genetic contributions to AD-associated epigenetic modifications as they are more likely to identify the environmentally induced epigenetic alterations [
6,
9]. In addition to the studies using individual-level data, several epigenetically AD-associated genes, such as
BIN1,
APOC1,
HLA-DRB1,
HLA-DRB5, and
TOMM40, have been reported by summary data-based analyses [
24,
25] which reflect genetically driven (i.e., through cis acting variants) epigenetic alterations [
26].
In this study, we performed methylome-wide association (MWA) analyses of AD using the summary data-based Mendelian randomization (SMR) method [
26] to investigate genetically driven epigenetic contributors to AD pathogenesis. Instead of analyzing individual-level data, the SMR method integrates the summary results from previous GWAS [
27,
28] and methylation quantitative trait loci (mQTLs) studies using blood samples [
29] and brain tissue [
30] in order to identify associations between AD and methylation alterations that may mediate the genetic associations examined by GWAS. Central to our study was to investigate potential genetically driven epigenetic heterogeneity of AD. Therefore, summary results from our previous GWAS which aimed to analyze genetic heterogeneity of AD in contrasting groups of subjects stratified based on their sex and history of hypertension (HTN) were used for our MWA analyses. Sex has been identified as a risk factor for AD and there are many reports highlighting sex disparities in epidemiological and clinical features of AD [
31,
32,
33,
34,
35,
36,
37]. HTN is also a major cardiovascular risk factor for AD that may be involved in initiation and progression of the disease by causing structural and functional damages to cerebral microvasculature and promoting amyloid plaques formation [
8,
38,
39]. By detecting several group-specific AD-associated single-nucleotide polymorphisms (SNPs) at P < 5E-06, our GWAS suggested that differences in the genetic architecture of AD between these contrasting groups may differentially contribute to AD pathogenesis [
27,
28]. Thus, the current study using summary results from these two GWAS may provide novel insights into potential genetically driven epigenetic heterogeneity of AD. To further validate significant findings, we compared our MWA results with those from our previous transcriptome-wide association (TWA) analyses of AD [
27,
28] that implemented the SMR method using the same GWAS summary results along with data from blood-based [
40] and brain-specific [
30,
41] expression quantitative trait loci (eQTLs) studies.
4. Discussion
Despite the detection of many genetic variants and identification of several non-genetic factors that may play roles in AD susceptibility, the definitive underlying mechanisms in most AD cases is unclear. Thus, epigenetic mechanisms may be key contributors to the heterogeneous nature of AD [
9,
10,
13,
23]. The epigenetic architecture of AD has been widely investigated in case-control studies and cell/animal models [
12]. The AD-associated epigenetic modifications found in these studies can be environmentally induced or genetically driven (i.e., through cis acting variants).
We combined the results from our previous GWAS [
27,
28] with data from two publicly available mQTLs studies of brain tissue [
30] and blood samples [
29] to identify genes that might be epigenetically associated with AD. In contrast to studies using individual-level data, epigenetic associations detected by summary data-based analyses are all genetically driven [
26]. A major focus of our study was to explore potential genetically driven epigenetic heterogeneity of AD based on its two main risk factors (i.e., sex [
31,
32,
33,
34,
35,
36,
37] and HTN [
8,
38,
39]). Therefore, in order to investigate sex-specific and HTN-specific epigenetic changes, our MWA analyses were performed under five alternative plans in which summary results from GWAS on either all subjects, only males, only females [
27], only subjects with a history of HTN, or only subjects with no history of HTN [
28] were included in analyses.
Our analyses demonstrated that 152 probes corresponding to 113 genes were epigenetically associated with AD. The top mQTLs corresponding to these probes were mostly nominally significant in our genome-wide meta-analyses. This might be in part due to suboptimal statistical power of our analyses which can be improved by analyzing larger datasets or more importantly due to the genetic heterogeneity of AD within and between the analyzed cohorts (i.e., LOADFS, CHS, FHS, and HRS). The ±1 Mb flanking regions of ~18% and ~34% of detected probes had attained P
GWAS < 5E-08 and 5E-08 ≤ P
GWAS < 5E-06, respectively, in our genome-wide meta-analyses or other studies reported by GWAS databases [
4,
5]. Comparing our findings with those detected in other SMR-based analyses of AD [
24,
25] revealed that
TOMM40, which had significant probes in brain-specific analyses under all five plans of our study, was epigenetically associated with AD in a previous study [
24].
Investigating group-specific epigenetic alterations, we found that probes corresponding to
APOE and
TOMM40 genes (i.e., inside the chromosome 19q13.32 region) were significant in blood-based and brain-specific analyses, respectively, of both males and females (i.e., plans 2 and 3) and both hypertensive and non-hypertensive groups (i.e., plans 4 and 5). However, several probes (all outside the chromosome 19q13.32 region, except cg05206559 corresponding to
NANOS2 gene in males) were group-specifically associated with AD, indicating potential genetically driven epigenetic heterogeneity of AD based on the two studied risk factors. For instance, we found that among 38 and eight probes that were detected in blood-based and brain-specific analyses, respectively, in either males or females, 22 probes had sex-specific effects when their b
SMR were compared between the two sexes using a Wald chi-square test (
Additional File 1: Tables S3 and S4). Comparing results from hypertensive and non-hypertensive groups, we found that there were 88 (blood-based analyses) and 29 (brain-specific analyses) significant probes outside the
APOE region which were not in common between these two groups. Of these, 79 probes had group-specific effects when their b
SMR were compared between hypertensive and non-hypertensive groups (
Additional File 1: Tables S5 and S6). Addressing genetic and epigenetic heterogeneities of AD is essential for understanding its pathogenesis and developing more efficient and personalized medical interventions tailored to the genetic and epigenetic profiles of individuals.
Our MWA analyses were performed using both brain-specific and blood-based mQTLs data which provided the opportunity to assess the consistency of potential AD-associated epigenetic changes detected in these analyses. Although the pattern of DNA methylation can be tissue- or cell-specific [
6,
60], previous studies have demonstrated the utility of blood samples for investigating AD-associated epigenetic modifications by reporting global or gene-specific methylation changes in AD subjects compared with matched healthy controls [
61,
62,
63,
64,
65]. This might be due to the systemic sequelae of AD, as AD may extensively impact cellular and molecular processes in peripheral tissues and nonneural cells including red blood cells, leukocytes, and platelets [
66,
67,
68,
69,
70,
71]. In addition, blood-based analyses may provide more statistical power than brain-specific studies, which generally have smaller sample sizes due to difficulties in obtaining brain samples from living subjects. Consistent with previous reports, our findings supported the feasibility of using data from blood samples to investigate epigenetic changes involved in AD. The direction of blood-based and brain-specific effects were the same for ~77% of probes and the effects of less than 1% of probes were significantly different between the two analyses across the five analysis plans of interest. We also found that probes corresponding to 10 genes were associated with AD in both blood-based and brain-speficic MWA analyses (
Table 1 and
Table 2). Most of these genes were previously implicated in AD at genome-wide or suggestive significance levels by GWAS [
4,
5], except
SLC6A7,
PSTK, and
KRTAP5-11. AD-associated SNPs at P
GWAS < 5E-08 were found within ±1 Mb of probes mapped to
NANOS2,
HLA-DQB2, and
LECT1 in our meta-analyses and/or previous GWAS. SNPs with 5E-08 ≤ P
GWAS < 5E-06 were found within ±1 Mb flanking regions of probes corresponding to
FAM193B,
BPGM,
ZNF598, and
C16orf80. Moreover, empirical evidence links some of these genes to AD in humans and animal models (e.g.,
SLC6A7 [
72] and
BPGM [
71]).
It should be stressed that the identified AD-associated genes in summary-based analyses do not prove any definitive causal relationships. Instead, they suggest a list of prioritized genes whose potential roles in AD pathogenesis need to be validated by further functional studies [
26]. In a recent study, Hannon et al. detected overlapping mQTL and eQTL signals with functional implications for several complex diseases/traits, such as Crohn’s disease, ulcerative colitis, blood lipids, height, and schizophrenia by comparing their SMR-based analyses [
73]. Therefore, to further pinpoint potential targets, we compared the list of epigenetically AD-associated genes identified from MWA analyses with transcriptionally AD-associated genes identified from our previous TWA analyses [
27,
28].
Our comparisons identified a short list of four potentially AD-associated genes that had significant probes in both MWA and TWA analyses (i.e.,
AIM2,
DGUOK,
ST14, and
C16orf80 in non-hypertensive subjects with P
SMR between 4.62E-07 and 1.35E-10 in MWA analyses and between 2.18E-05 and 7.78E-07 in TWA analyses [
28]). Probes corresponding to all genes but
AIM2 had group-specific effects when their b
SMR were compared between hypertensive and non-hypertensive groups using a Wald chi-square test (
Additional File 1: Tables S5 and S6). AD-associated SNPs with P
GWAS < 5E-08 were not found within ±1 Mb flanking regions of these probes in our meta-analyses or other studies in GWAS databases [
4,
5], although several SNPs with 5E-08 ≤ P
GWAS < 5E-06 were previously reported within ±1 Mb of probes corresponding to
AIM2 [
74] and
C16orf80 [
75,
76]. In addition, chromosomal regions corresponding to
ST14 [
77] (i.e., 11q24.3 region) contained previously reported AD-associated SNPs at P < 5E-08.
A review of the literature provided additional insights, strengthening the potential roles of these four genes in AD. For instance,
AIM2 encodes a protein involved in regulating cell proliferation and innate immunity [
78]. SNPs mapped to this gene were previously associated with white blood cells count at P
GWAS < 5E-08 [
79].
AIM2, along with several other proteins, were suggested to initiate inflammasome formation in response to stimuli such as viruses, bacteria, and damaged cells. Inflammasomes mediate the release of pro-inflammatory cytokines, such as
IL-1β and
IL-18, that are believed to be involved in AD development [
80,
81,
82].
IL-1β may increase in the blood, cerebrospinal fluid, and brain of AD patients and blood level of
IL-18 may increase in early stages of AD.
IL-1β can activate astrocytes and microglia cells and stimulate the release of
APP and amyloid-β (
Aβ) from neurons. Also,
IL-18, which is overexpressed in astrocytes, microglia, and neurons around
Aβ plaques, may promote
Aβ formation and mediate
tau protein hyper-phosphorylation [
82]. It was reported that methylene blue (MB), an inhibitor of inflammasome proteins such as
AIM2,
NLRP3, and
NLRC4 [
80], can decelerate the production of
Aβ plaques and neurofibrillary tangles. Thus, MB-based medications were suggested as potential treatments for AD [
83]. Moreover, Wu et al. reported that
AIM2 knock-out mice exhibited behavioral changes and impaired auditory fear memory [
84].
DGUOK encodes a mitochondrial enzyme involved in the purine metabolism pathway [
78]. Mutations in this gene were linked to some mitochondrial disorders with Mendelian inheritance, such as mitochondrial depletion syndrome [
85]. Mitochondrial dysfunction has also been reported as an important finding in neurons of AD patients [
86,
87]. Ansoleaga et al. showed that
DGUOK was downregulated in the precuneus and entorhinal cortex of patients in AD stages III-IV and V-VI (Braak and Braak staging system [
88]), respectively, compared with matched healthy controls [
89]. In addition, SNPs mapped to
DGUOK were associated with systemic lupus erythematosus at P
GWAS < 5E-08 [
90]. The risks of developing AD and vascular dementia slightly increases among patients with autoimmune disorders, such as lupus erythematosus [
91].
ST14 encodes a membrane serine protease with tumor suppressor activity [
78] that was not associated with AD or its risk factors at P
GWAS < 5E-06 by previous GWAS [
4,
5]. However, Wirz et al. found that the ortholog of
ST14 is overexpressed (i.e., 5.39-fold change with
p < 0.008) in the frontal cortex of
APPswe/PS1dE9 transgenic mice harboring mutant forms of
APP and
PSEN1 in response to
Aβ plaque development [
92]. Yin et al. reported that the mouse ortholog of
ST14 was upregulated in
Aβ plaque-associated microglia cells in
5XFAD transgenic mice harboring mutant forms of
APP and
PSEN1 genes compared with aged-matched control mice [
93].
C16orf80 (also known as
BUG22 and
CFAP20) encodes a highly conserved protein involved in the post-translational modification of
Tubulin subunits of microtubules. Such modifications might be essential for microtubule function and stability in ciliated cells, such as sperm, and in neurons [
94]. Microtubules are major component of neuronal transport machinery, in which defects can lead to neurodegenerative diseases (e.g., the role of microtubule-associated proteins, such as
tau protein, in AD) [
95,
96]. In a previous study, Mendes Maia et al. reported that
Drosophila melanogaster carrying mutant copies of the ortholog of
C16orf80 had a short lifespan and defects in body morphology, climbing activity, and locomotion, which were mostly reversed when gene expression was restored in the nervous system [
94]. However,
C16orf80 was not previously associated with AD or its risk factors at P
GWAS < 5E-06 [
4,
5].
Our pathway enrichment analyses of the brain-specific and blood-based MWA results revealed that nine and 16 pathways were associated with AD, respectively. These pathways were mostly involved in biological processes such as immune system responses (e.g., MHC class II antigen presentation), mitochondrial function (e.g., TCA cycle and respiratory electron transport), neurogenesis, synaptic function, and neurotransmitter signaling (e.g.,
L1CAM interactions,
GABA receptor activation, neurotransmitter receptors and postsynaptic signal transmission, and transmission across chemical synapses pathways) that have been implicated in AD pathogenesis [
87,
97,
98,
99,
100,
101,
102,
103]. Two enriched pathways (i.e., MHC class II antigen presentation and type II diabetes mellitus) were common between the brain-speficic and blood-based MWA analyses, highlighting potential links between AD and immune system responses [
102,
103] and type II diabetes mellitus as an important vascular risk factor for AD [
104].
Despite its rigor, we acknowledge that this study has limitations that could be addressed by future research using different methodologies and data. Using summary results from GWAS with larger sample sizes is likely to increase the statistical power of analyses. However, it should be noted that increasing sample sizes may not necessarily result in considerably increased power of GWAS due to the genetic heterogeneity underlying complex diseases. As mentioned above, the summary-based methylome-/transcriptome-wide approaches cannot draw definitive causal relationships between the disease of interest and detected genes [
26]. Such analyses can only help generate hypotheses regarding the possible involvement of a short list of genes in the pathogenesis of the studied disorder, which need to be validated empirically. Analyzing individual-level data which provide gene expressions and epigenetic profiles for the same case and control subjects would help obtaining a more definitive view of the underlying biological processes of AD and, in addition, may allow investigating the roles of non-genetic factors (e.g., smoking, medications that interfere with DNA methylation, exposure to metals, nutritional ingredients) in the observed transcriptome and epigenome changes. This is particularly important because epigenetic alterations can be environmentally induced [
6,
9]. It would also be interesting to investigate whether detected epigenome changes are associated with AD progression. This requires data from different AD stages [
88] with sufficient sample sizes. The CHS, FHS, HRS, and LOADFS datasets analyzed in our study do not provide disease staging information for AD subjects. Finally, investigating cell-specific (i.e., neurons and different glial cells) epigenetic alterations may provide valuable additional insights into the epigenetic architecture of AD, although small sample sizes and insufficient statistical power can be a major problem for such studies.