Benchmarking edgeR and methylKit for the Detection of Differential DNA Methylation: A Methodological Evaluation

Muñoa-Hoyos, Iraia; Araolaza, Manu; Calzado, Irune; Albizuri, Mikel; Subirán, Nerea

doi:10.3390/ijms27041964

Open AccessArticle

Benchmarking edgeR and methylKit for the Detection of Differential DNA Methylation: A Methodological Evaluation

by

Iraia Muñoa-Hoyos

^1,2,3,*

,

Manu Araolaza

^2,3

,

Irune Calzado

^2,3

,

Mikel Albizuri

²

and

Nerea Subirán

^2,3

¹

Department of Nursing I, Faculty of Medicine and Nursing, University of the Basque Country, 48940 Leioa, Bizkaia, Spain

²

Department of Physiology, Faculty of Medicine and Nursing, University of the Basque Country, 48940 Leioa, Bizkaia, Spain

³

Bizkaia Health Research Institute, 48903 Barakaldo, Bizkaia, Spain

^*

Author to whom correspondence should be addressed.

Int. J. Mol. Sci. 2026, 27(4), 1964; https://doi.org/10.3390/ijms27041964

Submission received: 19 December 2025 / Revised: 4 February 2026 / Accepted: 6 February 2026 / Published: 18 February 2026

(This article belongs to the Special Issue Benchmarking of Modeling and Informatic Methods in Molecular Sciences)

Download

Browse Figures

Versions Notes

Abstract

Despite the improvements in tool development for DNA methylation analysis, there is a lack of a consensus on computational and statistical models used for differentially methylated cytosine (DMC) identification. This variability complicates the interpretation of findings and raises concerns about the reproducibility and biological significance of the detected results. In this regard, here we conducted a comparative evaluation of edgeR and methylKit tools to assess their performance, concordance, and biological relevance in detecting DMCs following a morphine exposure model in mouse embryonic stem cells (mESCs). Both pipelines were applied to the same WGBS dataset (GEO accession number: GSE292082), and concordance was calculated at both single-base and gene levels. Although the total number of DMCs identified differed between tools, both pipelines detected a global hypomethylation pattern. Genomic distribution analysis revealed that DMCs predominantly localized to intergenic and intronic regions, as well as to open sea regions. Despite differences in sensitivity, both pipelines demonstrated moderate concordance at the DMC level (~56%) and high concordance at the gene level (~90%), identifying largely overlapping sets of differentially methylated genes (DMGs). Comparative assessments further showed that the choice of statistical metric can influence the perceived magnitude of biological effects. Sensitivity analyses indicated that threshold selection and normalization methods influence DMC detection, whereas aggregation at gene level reduces discrepancies. Overall, our findings underscore the complementary strengths of methylKit and edgeR and highlight the importance of careful tool selection for epigenetic studies. As a conclusion, we recommend integrating both pipelines to ensure a balanced interpretation of effect sizes, particularly in studies with complex experimental designs.

Keywords:

DNA methylation; differential methylation cytosines; differential methylation genes; edgeR; methylKit; bioinformatics pipelines

Graphical Abstract

1. Introduction

Epigenetics is the study of heritable changes in gene expression that do not involve alterations to the underlying DNA sequence. The primary mechanisms of epigenetic regulation include DNA methylation, histone modifications, chromatin remodelling, RNA-based regulation (involving both small and long non-coding RNA) and the emerging role of non-canonical DNA structures [1,2,3]. These modifications are essential for cellular differentiation and function, as well as for normal development, aging and a number of diseases [1,2].

Among previously mentioned mechanisms, DNA methylation is the most stable type of epigenetic modification that plays a significant role in regulating gene expression, maintaining genomic integrity and influencing developmental processes and disease in a cell-type specific manner [4,5,6]. Specifically, this modification primarily consists of the addition of a methyl group to the fifth carbon position of cytosine (5mC) [7,8]. While 5mC is the most prevalent form in vertebrates, it is increasingly recognized that methylation can also occur on other bases, such as N6-methyladenine (6mA), which plays important regulatory roles despite being less frequent and less studied in eukaryotes [9]. It is worth noting that DNA methylation exerts diverse regulatory effects depending on its genomic context since it is not uniformly distributed across the genome. While hypermethylation of CpG-rich promoters is linked to gene silencing in vertebrates (implicated in processes such as genomic imprinting and X-chromosome inactivation), global hypomethylation is associated with genomic instability and aberrant gene activation, contributing to disease development, such as tumorigenesis, neurodegenerative diseases and neurological disorders [10,11,12].

With the fast development of next-generation sequencing technologies, currently the most reliable and widely used method for measuring DNA methylation is bisulfite sequencing (BS-Seq), as it enables a detailed exploration of the epigenome at single-nucleotide resolution [13,14]. In particular, whole-genome bisulfite sequencing (WGBS) has become the gold standard for profiling cytosine methylation as it provides a high-resolution map of DNA methylation across the entire genome, allowing for the identification of different methylation patterns genome-wide [15]. In this technique, DNA is treated with sodium bisulfite, converting unmethylated cytosine residues to uracil (and ultimately to thymine when PCR amplification occurs), whilst maintaining methylated cytosines unchanged and finally providing single-base, quantitative cytosine methylation levels [16]. However, the analysis of methylation sequencing data presents several computational challenges, including the increasing diversity of methodologies and library preparation protocols for WGBS, the complexity of bisulfite-converted data and the need for precise identification of differential methylated cytosines (DMCs) and differential methylated regions (DMRs) [17,18,19].

The first step of analyzing bisulfite-sequencing data is to align reads to a reference genome and to count the number of C-to-T conversions, using read mapping and methylation-calling aligner tools, such as Bismark [14], Methylcoder [20], BRAT [21], RMAP [22], BSSeeker [23,24] and BSMAP [25], among others. Additionally, within the past few years, multiple computational approaches have been developed for downstream DNA methylation studies, which often involve detecting DMCs and DMRs [26]. Some of them implement linear regression designs, including methylKit [27], RnBeads [28] and BSmooth [29]. Other commonly used software are based in binomial model, for example, edgeR [30], DSS [31], BiSeq [32], MOABS [19], RADMeth [33], Bisulfighter [34], methylSig [35], DMRfinder [36], and HMM-DM [37].

Despite the improvements in tool development, a major challenge in DNA methylation analysis arises from the lack of a consensus on statistical and computational tools used for DNA methylation analysis. Indeed, there is no standard protocol to carry out this process [38,39,40]. Specifically, given that these methods rely on distinct statistical models and normalization and dispersion estimation strategies, they lead to variable DMC or DMR outputs with no large percentage of agreement, even when applied to the same dataset. This variability complicates the interpretation of findings and raises concerns about the reproducibility and biological significance of the detected regions.

To address this gap and in order to have a deeper understanding of DMCs identification, in this article, we performed a methodological comparison between two widely used tools to test predefined genomic regions for differential methylation in BS-seq data: methylKit [27] and edgeR [30,41]. Both are available within the statistical packages provided by the Bioconductor software on the R platform and have shown high sensitivity and/or low false-positive rates. However, while edgeR was initially designed for RNA-seq and later adapted for NGS-based methylation data, methylKit was developed specifically for bisulfite sequencing applications. Both tools support differential methylation analysis, even if using different statistical and conceptual frameworks. methylKit [27] employs Fisher’s exact test or logistic regression with overdispersion correction to calculate p-values adjusted to q-values for multiple test correction using a SLIM approach [42]. In comparison, edgeR [30,41] is based on the linear model framework (GLM), quasi-likelihood F tests (QLF) or an exact test that is conceptually similar to Fisher’s exact test, but tailored for overdispersion [43], and models the variation between biological replicates through the negative binomial dispersion supporting both competitive and self-contained tests. This difference in modelling also extends to the reporting metrics. methylKit provides the percentage of methylation change—a highly intuitive metric for biological interpretation—whereas edgeR reports log-fold changes, requiring an extra step of contextualization to relate it to absolute methylation levels. It is worth mentioning that a key divergence was identified in their regional analysis capabilities. methylKit inherently supports de novo DMR calling through genome tiling windows, whereas edgeR’s capacity for regional analysis is generally dependent on predefined genomic intervals. To avoid systematic and base-calling errors, both tools implement different quality control and preprocessing strategies. Specifically, edgeR relies on coverage normalization and removes low-coverage regions to ensure accurate modelling of count data, whereas methylKit applies library-size normalization and uses similar criteria for filtering and removing potential duplication bias, enhancing the reliability of downstream statistical testing.

In this study, a publicly available dataset (GEO accession number: GSE292082) was used as a benchmark to quantitatively evaluate and compare the performance of edgeR and methylKit under controlled and reproducible conditions to share a new perspective on the use of an appropriate tool or tool combination for the identification of DMCs and differentially methylated genes (DMGs). While the dataset used was originally generated in a previous study from our group [44], where the genome-wide effect of chronic morphine exposure on DNA methylation in mESCs was described, the present work focuses specifically on methodological aspects. Here, we aim to rigorously assess and compare the analytical capabilities of these pipelines for detecting DMCs and DMGs, contributing to the reproducibility and transparency of DNA methylation research. Our findings highlight the importance of analytical choices in methylome interpretation and provide a reproducible benchmark framework that can guide researchers in selecting appropriate tools for differential methylation analysis. In fact, it is expected that the combination of different advanced analytical tools will expand the frontiers of methylome analysis, with increasingly relevant applications in environmental epigenetic, biomedical research and precision medicine [45].

2. Results

2.1. Comparative Assessment of DMC Detection Tools

First, to evaluate their suitability for our dataset, a systematic comparative analysis of their methodological frameworks was performed on both R packages, edgeR and methylKit. Supplementary Table S1 summarizes the main features of each tool used for analysing DMCs, including platform, source and implementation language, tool origin, alignment ability, single- or paired-end capacity, major function, DMR calling capabilities, concept, model or test, statistical method, reporting metrics, quality control and preprocessing step, smoothing and further analysis options. Although both packages are compatible with major sequencing protocols and platforms, they diverge in their statistical modelling and reporting metrics (absolute percentages in methylKit vs. log-fold changes in edgeR). After confirming that both tools implement robust but distinct quality control and normalization strategies to mitigate technical biases, their performance in detecting DMCs was compared.

2.2. Similarity Analysis of DMC Detection Tools

To evaluate the concordance between edgeR and methylKit in the detection of DMCs, both tools were applied to DNA methylation data derived from control and morphine-treated mESC samples. edgeR identified a total of 203,337 DMCs, while methylKit detected 223,280 DMCs. To further characterize these findings in depth, changing trends in DMCs were identified. Of these, edgeR classified 60,704 DMCs as hypermethylated (29.85%) and 142,633 as hypomethylated (70.15%). Similarly, methylKit identified 67,054 hypermethylated DMCs (30.03%) and 156,226 hypomethylated DMCs (69.97%) (Table 1 and Figure 1A). Then the overlap between DMCs detected by both tools was identified, amounting to 153,394 DMCs, corresponding to 56.14% of the total unique DMCs identified by either method (n = 273,223), calculated as the union of both sets to avoid double-counting overlapping sites. Within this overlapping set, 44,641 common DMCs (29.10%) were classified as hypermethylated, while 108,753 common DMCs (70.90%) were identified as hypomethylated. This was equivalent to a 53.71% and a 57.20% fraction of the total unique hypermethylated (n = 83,117) and hypomethylated (n = 190,106) DMCs, respectively (Table 1 and Figure 1A). To further evaluate the technical concordance and ensure the detection was not skewed towards a specific direction, a cross-platform comparison and a Spearman correlation analysis was performed between the methylation metrics of both tools (Supplementary Figure S2). For the common DMCs, a moderate-to-strong positive correlation was observed (ρ = 0.62, p < 0.0001), confirming high directional consistency. Furthermore, non-common DMCs (sites detected by only one tool) exhibited a weaker correlation (ρ = 0.49, p < 0.0001), indicating that even for tool-specific sites, the underlying biological signal is preserved. Additionally, the mean and the standard deviation were calculated for both tools to further estimate the coefficient of variation. This value was less than 5% (0.05) in all cases, indicating low relative variability. In order to focus on the commonly identified sites, the distribution of overlapping DMCs was analysed for each chromosome. To explore the chromosomal distribution of commonly identified DMCs, their frequency across chromosomes was analysed. The majority of the DMCs were significantly located on chromosomes 8 and 11. In contrast, chromosomes 2, 3, 4, 7, 12, 13, 18, 19 and sex chromosomes (X and Y) exhibited a notably lower number of DMCs (Supplementary Figure S3). However, it is worth mentioning that a consistent pattern was observed across all chromosomes, and the number of hypomethylated cytosines was always higher than hypermethylated ones.

Given the functional effect of DNA methylation on gene expression, the next step was to examine each tool’s capacity to detect DMCs across specific gene-associated features. Particular attention was given to DMCs located in promoter regions due to their central role in transcriptional regulation. The following gene features were included in the analysis: promoters (≤1 kb, 1–2 kb and 2–3 kb upstream of transcription start sites), 5′UTR, 3′UTR, exons, introns, downstream regions and distal intergenic regions (Figure 1B). In the promoter region edgeR identified a total of 34,817 DMCs (14,015 DMCs within ≤1 kb, 11,216 DMCs in 1–2 kb and 9586 DMCs in 2–3 kb), while methylKit detected a total of 37,699 DMCs (14,974 DMCs within ≤1 kb, 12,140 DMCs in 1–2 kb and 10,585 DMCs in 2–3 kb). The identified number of 5′UTR and 3′UTR DMCs were very similar in both tools, specifically 129 and 4460 DMCs in edgeR and 149 and 4920 in methylKit. Furthermore, for exonic and intronic regions, edgeR reported 9146 and 83,043 DMCs, whereas methylKit identified 10,024 and 92,460 DMCs, respectively. Similarly, for downstream and distal intergenic regions, edgeR identified 2284 and 69,458 DMCs, while methylKit reported 2540 and 75,488 DMCs respectively. As shown in Figure 1B, most DMCs were located in introns and distal intergenic regions, corresponding to 40.84% and 34.16% of gene features in edgeR, and conforming 41.41% and 33.81% in methylKit. Promoter regions and adjacent areas collectively accounted for approximately 17% of DMCs in both tools. In any case, the distribution of DMCs across genomic features was consistent between tools, with only minor variations (Table 2 and Figure 1B). Then, to assess overlap between both tools, common DMCs within specific gene features were identified. For the promoter region a total of 26,031 DMCs were identified (10,324 DMCs within ≤1 kb, 8412 DMCs in 1–2 kb and 7295 DMCs in 2–3 kb); 100 DMCs in 5′UTR; 3403 DMCs in 3′UTR; 6909 DMCs in exons; 63,657DMCs in introns; 1699 DMCs in downstream regions; and 51,595 DMCs in distal intergenic regions. In all cases, the fraction of overlapped DMCs remained close to 56%, in the same way as observed in the general dataset. Additionally, the coefficient of variation remained below 5% (0.05) in the majority of cases, reinforcing the reproducibility of the findings (Table 2).

DNA methylation is not uniformly distributed across the genome. Instead, CpG sites tend to cluster in distinct genomic contexts, including CpG islands (CGIs), shores, shelves and open sea regions (Figure 1C), each with potentially different genetic regulatory roles. Consequently, the way forward was to evaluate the ability of each tool to identify DMCs within these CpG-related features. Both tools predominantly identified DMCs in the area known as open sea (over 90% of all DMCs); that is, edgeR detected 184,129 DMCs, and methylKit identified 203,048 DMCs outside the CpG islands area. Likewise, edgeR identified 985 DMCs in CGI, 9638 DMCs in shores and 8585 DMCs in shelves (summing up to 9.44%), while methylKit detected 849 DMCs in CGI, 9888 DMCs in shores and 9495 DMCs in shelves (amounting to 9.06%). In fact, the 1% of DMCs located in CGIs that simultaneously coincided with promoter regions ranged from 25.79% in edgeR to 22.73% in methylKit (Table 3 and Figure 1C). Next, overlap analysis revealed 566 common DMCs in CGIs, 6839 DMCs in shores, 6604 DMCs in shelves and 139,385 in open sea regions. As with previous comparisons, the proportion of overlapping DMCs remained close to 56% across all categories, except for CGIs, which exhibited a lower overlap of 44.64%. A coefficient of variation under 5% (0.05) was observed in the vast majority of cases, suggesting again low variability between replicates (Table 3).

2.3. Similarity Analysis of DMG Detection Tools

When identifying DMCs, it is crucial that a tool also facilitates the identification of DMGs, as the latter provide a more biologically meaningful and functionally relevant perspective. While DMCs offer site-specific information on methylation changes, DMGs enable the contextualization of these modifications at the gene level, linking methylation patterns to biological processes and transcriptional effect, thereby facilitating interpretation within a physiological and pathological context. Therefore, a comparative evaluation of both tools in DMG detection was conducted. edgeR identified a total of 17,657 DMGs, whereas methylKit reported 17,772 DMGs. Within these, edgeR classified 13,128 DMGs as hypermethylated, 16,313 DMGs as hypomethylated and 11,783 DMGs with both alterations (66.7%). Similarly, methylKit identified 13,426 hypermethylated DMGs, 16,429 hypomethylated DMGs and 12,081 with both marks (68%) (Table 4 and Figure 2A). Analysis of the overlap revealed 16,357 DMGs detected by both tools, corresponding to 87.8% of the total unique DMGs identified by either method (n = 19,072), calculated as the union of both sets. Of these, 11,594 were commonly classified as hypermethylated DMGs and 14,954 as hypomethylated DMGs, while 10,191 common DMGs displayed both methylation alterations (62.30%). This was equivalent to 80.00%, 86.20% and 74.53% of the total unique DMGs in each respective category (Table 4 and Figure 2A). In all comparisons, the coefficient of variation remained below 5% (0.05), reinforcing the reproducibility of the observations (Table 4).

Significantly, while the general overlap in DMCs between edgeR and methylKit was 56%, this value increased substantially to 87.8% when focusing on the gene level (DMGs). To enhance the biological relevance of the results and minimize false positives, equivalent stringency thresholds were applied, that is, a minimum fold-change difference of 2 in edgeR and minimum methylation difference of 25% in methylKit (with significance set at p ≤ 0.05 in both cases) (Figure 2B,C). These parameters reduced the total number of DMCs to 121,122 in edgeR and 156,050 for methylKit. The resulting intersection of both tools identified 64,138 common DMCs, corresponding to 30.11% of the total unique DMCs (n = 213,034). Despite this reduction, the distribution of methylation trends remained consistent with previously described results since common hypermethylated DMCs were 28.31% and common hypomethylated DMCs were 71.69% (Figure 2B). Focusing on the genetic impact, these thresholds decreased the number of DMGs to 16,362 for edgeR and 16,538 for methylKit, with an overlap of 14,896 DMGs. It is worth mentioning that this overlap was 82.74% of the total unique DMGs (n = 18,004). To measure the biological relevance of the identified DMGs, gene ontology enrichment analyses were performed separately on the DMGs identified by each tool, as well as on the set of DMGs commonly detected by both tools, revealing substantial biological overlap. Both edgeR and methylKit identified processes related to cellular development, metabolic activity, chemical stimulus response, cellular localization and regulation of cell adhesion, reflecting core biological functions across datasets. The common list reinforced these findings by highlighting biological processes such as cellular localization, regulation of metabolic and cellular processes, regulation of response to stimulus, and multicellular organismal processes, thereby emphasizing the robustness and the biological relevance of the shared results. While some differences in the ranking of top GO terms were observed between the individual and common sets (Figure 2C), these results offered complementary insights suitable for integrative interpretation.

3. Discussion

This study provides a comprehensive evaluation and comparison analysis between methylKit and edgeR, two widely used computational tools for detecting DMCs in WGBS data. In brief, both methods proved effective in identifying DMCs across various genomic contexts, showing that both tools can be complementary, although notable differences were observed in the total number detected and the way the results were presented. These findings highlight the importance of considering methodological integration in methylation analyses and underscore the need for standardized protocols to ensure reproducibility and consistency in findings with biological relevance.

Although methylKit and edgeR share similar objectives, they differ considerably in their statistical frameworks and normalization strategies. edgeR employs a generalized linear model based on a negative binomial distribution with empirical Bayes dispersion estimation and TMM (trimmed mean of M-values) normalization (which may reduce false positives) [30], while methylKit implements Fisher’s exact test or logistic regression with overdispersion correction, applying multiple testing correction via the SLIM method [27]. These methodological differences were reflected in how each tool reported DMCs, as methylKit provided percentage change values, whereas edgeR presented fold-change values. However, this did not present any impediment to the inter-tool comparison performed in this study, nor in the establishment of an equivalent threshold between both tools to identify the most relevant methylation changes [27,30]. While the metrics differ in nature, they allowed for a synchronized filtering of the data for downstream analysis. Interestingly, methylKit identified more total DMCs than edgeR (223,280 vs. 203,337 DMCs). On the one hand, this may be potentially related to the correction for overdispersion in normalization, which can lead to a more conservative DMC detection in edgeR. On the other hand, that identification difference may reflect the methylKit design as a methylation-specific tool [27], unlike edgeR, which was initially designed to analyse RNA-seq [41] and later modified to identify methylation changes [30]. These differences support previous findings indicating that the choice of analytical pipeline can significantly affect both the sensitivity and specificity of methylation calls [39,46]. Importantly, the cross-platform comparison revealed a moderate level of concordance (56.14%) in DMC detection, with a coefficient of variation below 5%, indicating a strong technical robustness and reproducibility of results, even with a limited number of replicates. This result aligns with earlier benchmarking studies comparing methylation tools, which have noted similarly limited overlap among DMC detection methods using distinct statistical models [39,46].

This interpretation is supported by the correlation analysis, which revealed that both common and non-common DMCs maintain a consistent directional trend. The fact that non-common DMCs still display a correlation of ρ = 0.49 suggests that many tool-specific calls are not biological contradictions but rather marginal cases where one algorithm is more conservative than the other. Specifically, the observed instances where edgeR reports high effect sizes (logFC) relative to moderate methylation differences in methylKit can be attributed to their mathematical frameworks. While methylKit measures absolute shifts in methylation, edgeR is highly sensitive to relative changes; thus, small absolute increments can manifest as disproportionately large fold changes. This divergence is particularly evident in loci where edgeR reports logFC values exceeding 4 despite methylKit showing only moderate percentage shifts (0–40%). Such a pattern highlights a decoupling between statistical sensitivity and biological magnitude: while edgeR is highly effective at identifying statistically significant relative changes in read distribution, these variations do not necessarily translate into large-scale absolute changes in the methylome. That is why tool integration remains essential to contextualize these mathematical artefacts and distinguish high-sensitivity relative signals from substantial physical changes in DNA methylation [47]. Consequently, while the overlap is moderate, the high directional agreement reinforces the reliability of the detected information. Despite these differences, both tools exhibited a shared methylation directionality trend, with a predominant hypomethylation pattern (approximately 70% of DMCs) not only globally but also across individual chromosomes. The relative absence of differential methylation in these regions is in line with the absence of repressive methylation pattern in undifferentiated mESCs, reflecting the maintenance of the pluripotent state [48]. Altogether, these results further support the notion that both tools provide complementary perspectives and that their combined use is critical for robust biological interpretation in methylome studies.

The distribution of DMCs was largely consistent between both platforms, particularly within intronic and distal intergenic regions, which together represented over 70% of total DMCs. This distribution agrees with previous epigenomic landscape analyses showing that DNA methylation variability is widely distributed throughout the genome (not confined to promoter regions) and may regulate enhancer activity and long-range chromatin interactions [6,49,50,51]. When looking to the rest of the gene features, both tools demonstrated similar distribution, further underscoring their robustness despite differences in statistical methodology. Interestingly, although promoter regions accounted for a smaller proportion of total DMCs in both tools (~17%), this result has mechanistic relevance since promoter methylation is linked to gene silencing [52,53]. This trend was also observed by DMCs distributed across different CpG contexts. Specifically, a minor fraction mapped to CGIs, shores and shelves in both tools, and the vast majority of identified DMCs resided in open sea regions (over 90%). These results reinforce the evidence that open sea methylation may act as a dynamic regulatory element sensitive to epigenetic modulation [54]. Although DMCs within CGIs were less frequent, they displayed a high co-occurrence with promoter regions (approximately 25% of CGI DMCs overlapped with promoters), highlighting their potential regulatory relevance. This is in line with the view that methylation outside of canonical CGIs, particularly in enhancer regions, may provide more sensitive markers of cell state and environmental perturbation [55]. These results suggest that while both tools can detect DMCs in high-density CpG contexts, their primary sensitivity lies in identifying methylation variation in lower-density regions. These results align with studies suggesting that integrative approaches, combining multiple algorithms, can provide the most reliable view of methylation landscapes [38,39,40].

In terms of biological relevance, the most striking evidence for complementarity between the tools was identified from the gene -level analysis. Despite the moderate overlap in DMCs, both edgeR and methylKit identified a high concordance (87.8%) in DMG identification. This indicates that the biological impact of methylation changes is preserved at the gene level, even if the precise cytosine positions differ between tools. Furthermore, gene ontology enrichment analyses of the DMGs underlined processes related to stimulus response, cell adhesion, metabolism and development pathways in each tool separately but also after overlapping results, functions that are known to be affected by opioid exposure in other studies [56,57,58]. Notably, while the core biological signal remained consistent, minor shifts in GO rankings between individual and common sets were observed, reflecting the sensitivity of enrichment statistics to list size [59,60]. However, the consensus set acts as a stringency filter that highlights the most robust biological signals by reducing background noise. Importantly, the observed increase in overlap when analysing DMGs (compared to DMCs) suggests that gene-level aggregation may reduce false positives and increase biological interpretability, which supports its use as a complementary tool in methylation studies [61].

While this study provides valuable insights into the comparative performance of two key tools, several limitations should be acknowledged. First, as noted in our comparative assessment, the choice of tools carries inherent trade-offs. While edgeR is robust for handling biological variance, its origin as an RNA-seq tool means it lacks certain native methylation features found in methylKit, such as the ability to perform de novo DMR calling via tiling windows. Second, although both tools allow for effective data filtering, the mathematical nature of logFC in edgeR can lead to a potential overestimation of the biological effect size, particularly in regions with low baseline methylation. In these cases, a high log-fold change may represent a significant relative shift in read ratios without corresponding to a large-scale absolute change in DNA methylation. Therefore, we emphasize that while edgeR excels in detecting statistical significance, its effect size should be contextualized alongside methylKit’s percentage-based metrics to ensure a realistic biological interpretation. Third, the analysis was restricted to one experimental model (mESCs, single cell type and one treatment condition, morphine treatment), which is why results may differ in other biological contexts or with different exposure durations and doses. Fourth, we focused on DMCs and DMGs rather than DMRs; thereby, future work should consider DMRs, which may provide additional site-level information worth taking into account. Fifth, the inclusion of only two replicates per condition may limit statistical power and increase sensitivity to outliers. Future studies should incorporate more replicates and explore integrative frameworks that combine WGBS with transcriptomic or proteomic data for functional validation.

Taken together, these findings underscore the benefits of integrative and complementary strategies to provide a more comprehensive picture of methylation dynamics. While no single tool can provide a complete and accurate representation of the methylome, integrating results from multiple statistical approaches can strengthen analytical confidence and enable more reliable identification of functionally relevant regions, thus reinforcing biologically relevant interpretation. Future directions should involve benchmarking these and other tools across a broader range of biological systems and experimental designs to establish more standardized protocols and promote reproducibility in methylomic research.

4. Materials and Methods

4.1. DNA Processing and Sequencing Workflow

The used DNA samples and sequencing workflow was previously published by our research group and was generated under the same experimental procedure (GEO accession number: GSE292082) [44]. Briefly, genomic DNA from control and morphine-treated (10 μM, Alcaliber; 24-h treatment period) mouse embryonic stem cells (mESCs; Oct4-GFP cell line/PCEMM08, PrimCells, San Diego, CA, USA) was isolated and extracted, using a classic phenol–chloroform/isoamyl methodology with phenol (P4557, Sigma, Saint Louise, MO, USA), chloroform (CL01981000, Scharlau, Sentmenat, Spain) and isoamyl alcohol (BP1150, Fisher BioReagents, Waltham, MA, USA) and following the manufacturer’s instructions. DNA concentration and purity were measured using the Nanodrop Spectrophotometer ND-1000 (Thermo Fisher Scientific, Wilmingyon, DE, USA) by quantifying the 260/280 absorbance ratio. The amount of extracted DNA from two samples was preserved at −80° prior to library preparation.

Genomic DNA was sonicated using a Soniprep 150 (MSE Ldt, London, UK) to produce fragments of approximately 300 base pairs (bps). These fragments were subsequently denatured and prepared for bisulfite conversion, using the EZ DNA Methylation-lightning Kit (Zymo Research, Irvine, CA, USA) to facilitate downstream methylation analysis. DNA fragments were then subjected to library preparation, employing the KAPA Library Preparation Kit (Roche, Pleasanton, CA, USA) along with xGenTM Methyl UDI-UMI Adapters (IDT, Coralville, IA, USA). The quality and integrity of the libraries were assessed using an Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA, USA) with the DNA 7500 assay. Following library preparation, sequencing was performed on an Illumina NovaSeq 6000 S4 platform (paired-end) (Illumina, San Diego, CA, USA), with 4× multiplexing to ensure robust coverage, achieving a minimum of 50,000 reads per replicate for each sample. All cytosine residues across both duplicates for each sample were included in subsequent genome-wide methylation analyses.

4.2. Pre-Processing

The library fragment adapters were trimmed using Trim Galore! (v.0.6.2) [62], and subsequently, a FastQC High-Throughput Sequence Quality Control report (v.0.11.6) [63] was generated to evaluate the quality of the WGBS FASTQ files, indicating a quality score exceeding 30 and foreseeing a suitable mapping efficiency. The resultant reads were concatenated using Cat (v.8.22) [64]. Spearman correlation confirmed the reproducibility of the two biological replicates in methylation samples, demonstrating high concordance (Supplementary Figure S1A).

4.3. Cross-Platform Analysis of m5C Sequencing Data

The WGBS data was mapped to a bisulfite-converted reference genome file for mouse (UCSC GRm38/mm10) with Bowtie2 [65,66] on Bismarck (v.0.22.1) [67] with default parameters, and the resulting binary alignment files were sorted and indexed with SAMtools (v.14.0) [68] to enable quicker access. To be unable to use the post-processing programs that follow the WGBS bioinformatics analysis, a MethylDackel (v.0.5.1) [69] tool was used to estimate the percentage coverage and percentage methylation for all CpG sites on a genome-wide scale. Then, to evaluate the effect of morphine on differentially methylated cytosines (DMC), two different statistical tools were chosen (under R v4.0.3): (1) edgeR (v.3.32.1) [30,41], which uses the negative binomial distribution to assess statistical significance, with TMM normalization accounting for library size and (2) methylKit (v.1.16.1) [27], which normalizes read coverage distribution between samples. In both cases regions with a p-value < 0.05 were considered differentially methylated. The PCA (principal component analysis) in each of the tools confirmed the similarity between replicates and distinguishable differences between samples (control and morphine-treated duplicate samples). After applying TMM normalization (correcting for library size), a read count distribution plot showed a correct data normalization and Volcano plot analysis confirmed the differences in these DMCs analysed with two tools (Supplementary Figure S1B). It is worth mentioning that both tools present the DMCs in different ways: edgeR uses foldchange values, and methylKit applies percentage values. The methylation landscape is viewable via the UCSC genome browser [70]. Integrative analyses between both tools were performed using Venny tools (v.2.1.0) [71] and the Gene Ontology Resource from the GO Consortium (https://geneontology.org/, v.16.1.0) [72,73] was used to identify the biological functions. The computational workflow summarizing all the steps is included in Figure 3.

5. Conclusions

In conclusion, this comparative study demonstrates that while edgeR and methylKit each have unique advantages, their complementary use enhances the robustness and biological interpretability of differential DNA methylation analysis. The convergence in general trends, specifically the predominance of hypomethylation in response to morphine, the regional consistency of DMCs across gene features and the high concordance of overlapped DMGs identification, supports the validity of findings derived from both statistical frameworks. However, we found that tool-specific metrics used for results quantification significantly impact biological interpretation. Recognizing these mathematical differences is essential to avoid overestimating effect sizes. Therefore, the proposed tool integration is necessary to ensure that methodological consistency reinforces the biological relevance of the shared results.

Given the lack of a standardized pipeline in the field, cross-validation across multiple platforms remains essential to improve accuracy in DMC detection and robust and reproducible epigenetic profiling. As DNA methylation continues to gain visibility as a key epigenetic biomarker, computational strategies that combine sensitivity, specificity, and flexibility will be essential. Future studies should continue exploring the impact of pipeline design, normalization strategy, and statistical modelling in large-scale WGBS datasets to enable more comprehensive and biologically meaningful methylome interpretations.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/ijms27041964/s1.

Author Contributions

Conceptualization, I.M.-H. and N.S.; methodology, I.M.-H. and N.S.; software, I.M.-H.; formal analysis, I.M.-H.; investigation, I.M.-H., M.A. (Manu Araolaza), I.C. and M.A. (Mikel Albizuri); resources, N.S.; data curation, I.M.-H.; writing—original draft preparation, I.M.-H.; writing—review and editing, I.M.-H., M.A. (Manu Araolaza), I.C., M.A. (Mikel Albizuri) and N.S.; visualization, I.M.-H., M.A. (Manu Araolaza), I.C. and M.A. (Mikel Albizuri); supervision, N.S.; project administration, N.S.; funding acquisition, N.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Ministry of Science and Innovation, Spain, by Instituto de Salud Carlos III and funded by European Union (ERDF/ESF, “Investing in your future”, grant number PI24/01600) to N.S., Basque Government, Department of Education (IT1547-22, IT1966-26) to N.S., I.M.-H. and Manu Araolaza. The work was also supported by Researcher Fellowships from Basque Government to M.A and from UPV/EHU to I.C.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Authors consent to the availability of data and materials. The raw data has been deposited in the NCBI Sequence Read Archive (SRA) through the Gene Expression Omnibus under the accession number GEO storage: GSE292082.

Acknowledgments

The authors particularly acknowledge SGIKer resources of UPV/EHU for technical support with the computational calculations, which were carried out in the Arina informatics cluster. During the preparation of this manuscript/study, the authors used ChatGPT (OpenAI, GPT-4o) for the purposes of improving the English writing. The authors have reviewed and edited the output and take full responsibility for the content of this publication.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

DMC	Differentially methylated cytosine
DMG	Differentially methylated gene
DMR	Differentially methylated region
mESC	Mouse embryonic stem cell

References

Moosavi, A.; Motevalizadeh Ardekani, A. Role of Epigenetics in Biology and Human Diseases. Iran. Biomed. J. 2016, 20, 246–258. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Evangelina, R.; Ganesan, S.; George, M. The Epigenetic Landscape: From Molecular Mechanisms to Biological Aging. Rejuvenation Res. 2025, 28, 93–112. [Google Scholar] [CrossRef] [PubMed]
Agustinus, A.S.; David, Y. Thinking outside the chromosome: Epigenetic mechanisms in non-canonical chromatin species. Nat. Struct. Mol. Biol. 2024, 31, 8–10. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Armstrong, M.J.; Jin, Y.; Allen, E.G.; Jin, P. Diverse and dynamic DNA modifications in brain and diseases. Hum. Mol. Genet. 2019, 28, R241–R253. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Schübeler, D. Function and information content of DNA methylation. Nature 2015, 517, 321–326. [Google Scholar] [CrossRef] [PubMed]
Smith, Z.D.; Meissner, A. DNA methylation: Roles in mammalian development. Nat. Rev. Genet. 2013, 14, 204–220. [Google Scholar] [CrossRef] [PubMed]
Kumar, S.; Chinnusamy, V.; Mohapatra, T. Epigenetics of Modified DNA Bases: 5-Methylcytosine and Beyond. Front. Genet. 2018, 9, 640. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Suzuki, M.; Liao, W.; Wos, F.; Johnston, A.D.; DeGrazia, J.; Ishii, J.; Bloom, T.; Zody, M.C.; Germer, S.; Greally, J.M. Whole-genome bisulfite sequencing with improved accuracy and cost. Genome Res. 2018, 28, 1364–1371. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Wu, K.J. The epigenetic roles of DNA N6-Methyladenine (6mA) modification in eukaryotes. Cancer Lett. 2020, 494, 40–46. [Google Scholar] [CrossRef] [PubMed]
Hao, X.; Luo, H.; Krawczyk, M.; Wei, W.; Wang, W.; Wang, J.; Flagg, K.; Hou, J.; Zhang, H.; Yi, S.; et al. DNA methylation markers for diagnosis and prognosis of common cancers. Proc. Natl. Acad. Sci. USA 2017, 114, 7414–7419. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Qu, Y.; Siggens, L.; Cordeddu, L.; Gaidzik, V.I.; Karlsson, K.; Bullinger, L.; Döhner, K.; Ekwall, K.; Lehmann, S.; Lennartsson, A. Cancer-specific changes in DNA methylation reveal aberrant silencing and activation of enhancers in leukemia. Blood 2017, 129, e13–e25. [Google Scholar] [CrossRef] [PubMed]
Younesian, S.; Mohammadi, M.H.; Younesian, O.; Momeny, M.; Ghaffari, S.H.; Bashash, D. DNA methylation in human diseases. Heliyon 2024, 10, e32366. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Lister, R.; Pelizzola, M.; Dowen, R.H.; Hawkins, R.D.; Hon, G.; Tonti-Filippini, J.; Nery, J.R.; Lee, L.; Ye, Z.; Ngo, Q.M.; et al. Human DNA methylomes at base resolution show widespread epigenomic differences. Nature 2009, 462, 315–322. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Krueger, F.; Andrews, S.R. Bismark: A flexible aligner and methylation caller for Bisulfite-Seq applications. Bioinformatics 2011, 27, 1571–1572. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Wulfridge, P.; Langmead, B.; Feinberg, A.P.; Hansen, K.D. Analyzing whole genome bisulfite sequencing data from highly divergent genotypes. Nucleic Acids Res. 2019, 47, e117. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Stirzaker, C.; Taberlay, P.C.; Statham, A.L.; Clark, S.J. Mining cancer methylomes: Prospects and challenges. Trends Genet. 2014, 30, 75–84. [Google Scholar] [CrossRef] [PubMed]
Bock, C.; Tomazou, E.M.; Brinkman, A.B.; Müller, F.; Simmer, F.; Gu, H.; Jäger, N.; Gnirke, A.; Stunnenberg, H.G.; Meissner, A. Quantitative comparison of genome-wide DNA methylation mapping technologies. Nat. Biotechnol. 2010, 28, 1106–1114. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Smallwood, S.A.; Tomizawa, S.; Krueger, F.; Ruf, N.; Carli, N.; Segonds-Pichon, A.; Sato, S.; Hata, K.; Andrews, S.R.; Kelsey, G. Dynamic CpG island methylation landscape in oocytes and preimplantation embryos. Nat. Genet. 2011, 43, 811–814. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Sun, D.; Xi, Y.; Rodriguez, B.; Park, H.J.; Tong, P.; Meong, M.; Goodell, M.A.; Li, W. MOABS: Model based analysis of bisulfite sequencing data. Genome Biol. 2014, 15, R38. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Pedersen, B.; Hsieh, T.F.; Ibarra, C.; Fischer, R.L. MethylCoder: Software pipeline for bisulfite-treated sequences. Bioinformatics 2011, 27, 2435–2436. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Harris, E.Y.; Ponts, N.; Levchuk, A.; Roch, K.L.; Lonardi, S. BRAT: Bisulfite-treated reads analysis tool. Bioinformatics 2010, 26, 572–573, Erratum in Bioinformatics 2010, 26, 2499. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Smith, A.D.; Chung, W.Y.; Hodges, E.; Kendall, J.; Hannon, G.; Hicks, J.; Xuan, Z.; Zhang, M.Q. Updates to the RMAP short-read mapping software. Bioinformatics 2009, 25, 2841–2842. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Chen, P.Y.; Cokus, S.J.; Pellegrini, M. BS Seeker: Precise mapping for bisulfite sequencing. BMC Bioinform. 2010, 11, 203. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Huang, K.Y.Y.; Huang, Y.J.; Chen, P.Y. BS-Seeker3: Ultrafast pipeline for bisulfite sequencing. BMC Bioinform. 2018, 19, 111. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Xi, Y.; Li, W. BSMAP: Whole genome bisulfite sequence MAPping program. BMC Bioinform. 2009, 10, 232. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Wreczycka, K.; Gosdschan, A.; Yusuf, D.; Grüning, B.; Assenov, Y.; Akalin, A. Strategies for analyzing bisulfite sequencing data. J. Biotechnol. 2017, 261, 105–115. [Google Scholar] [CrossRef] [PubMed]
Akalin, A.; Kormaksson, M.; Li, S.; Garrett-Bakelman, F.E.; Figueroa, M.E.; Melnick, A.; Mason, C.E. methylKit: A comprehensive R package for the analysis of genome-wide DNA methylation profiles. Genome Biol. 2012, 13, R87. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Assenov, Y.; Müller, F.; Lutsik, P.; Walter, J.; Lengauer, T.; Bock, C. Comprehensive analysis of DNA methylation data with RnBeads. Nat. Methods 2014, 11, 1138–1140. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Hansen, K.D.; Langmead, B.; Irizarry, R.A. BSmooth: From whole genome bisulfite sequencing reads to differentially methylated regions. Genome Biol. 2012, 13, R83. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Chen, Y.; Pal, B.; Visvader, J.E.; Smyth, G.K. Differential methylation analysis of reduced representation bisulfite sequencing experiments using edgeR. F1000Research 2017, 6, 2055. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Feng, H.; Conneely, K.N.; Wu, H. A Bayesian hierarchical model to detect differentially methylated loci from single nucleotide resolution sequencing data. Nucleic Acids Res. 2014, 42, e69. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Hebestreit, K.; Dugas, M.; Klein, H.U. Detection of significantly differentially methylated regions in targeted bisulfite sequencing data. Bioinformatics 2013, 29, 1647–1653. [Google Scholar] [CrossRef] [PubMed]
Dolzhenko, E.; Smith, A.D. Using beta-binomial regression for high-precision differential methylation analysis in multifactor whole-genome bisulfite sequencing experiments. BMC Bioinform. 2014, 15, 215. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Saito, Y.; Tsuji, J.; Mituyama, T. Bisulfighter: Accurate detection of methylated cytosines and differentially methylated regions. Nucleic Acids Res. 2014, 42, e45. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Park, Y.; Figueroa, M.E.; Rozek, L.S.; Sartor, M.A. MethylSig: A whole genome DNA methylation analysis pipeline. Bioinformatics 2014, 30, 2414–2422. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Gaspar, J.M.; Hart, R.P. DMRfinder: Efficiently identifying differentially methylated regions from MethylC-seq data. BMC Bioinform. 2017, 18, 528. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Yu, X.; Sun, S. HMM-DM: Identifying differentially methylated regions using a hidden Markov model. Stat. Appl. Genet. Mol. Biol. 2016, 15, 69–81. [Google Scholar] [CrossRef] [PubMed]
Klein, H.U.; Hebestreit, K. An evaluation of methods to test predefined genomic regions for differential methylation in bisulfite sequencing data. Brief Bioinform. 2016, 17, 796–807. [Google Scholar] [CrossRef] [PubMed]
Liu, Y.; Han, Y.; Zhou, L.; Pan, X.; Sun, X.; Liu, Y.; Liang, M.; Qin, J.; Lu, Y.; Liu, P. A comprehensive evaluation of computational tools to identify differential methylation regions using RRBS data. Genomics 2020, 112, 4567–4576. [Google Scholar] [CrossRef] [PubMed]
Yu, X.; Sun, S. Comparing five statistical methods of differential methylation identification using bisulfite sequencing data. Stat. Appl. Genet. Mol. Biol. 2016, 15, 173–191. [Google Scholar] [CrossRef]
Robinson, M.D.; McCarthy, D.J.; Smyth, G.K. edgeR: A Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 2010, 26, 139–140. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Wang, H.Q.; Tuominen, L.K.; Tsai, C.J. SLIM: A sliding linear model for estimating the proportion of true null hypotheses in datasets with dependence structures. Bioinformatics 2011, 27, 225–231. [Google Scholar] [CrossRef] [PubMed]
McCarthy, D.J.; Chen, Y.; Smyth, G.K. Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation. Nucleic Acids Res. 2012, 40, 4288–4297. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Araolaza, M.; Muñoa-Hoyos, I.; Urizar-Arenaza, I.; Calzado, I.; Subirán, N. Chronic Morphine Treatment Leads to a Global DNA Hypomethylation via Active and Passive Demethylation Mechanisms in mESCs. Int. J. Mol. Sci. 2025, 26, 7056. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Sun, R.; Zhu, P. Advances in measuring DNA methylation. Blood Sci. 2021, 4, 8–15. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Ziller, M.J.; Gu, H.; Müller, F.; Donaghey, J.; Tsai, L.T.; Kohlbacher, O.; De Jager, P.L.; Rosen, E.D.; Bennett, D.A.; Bernstein, B.E.; et al. Charting a dynamic DNA methylation landscape of the human genome. Nature 2013, 500, 477–481. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Piao, Y.; Xu, W.; Park, K.H.; Ryu, K.H.; Xiang, R. Comprehensive Evaluation of Differential Methylation Analysis Methods for Bisulfite Sequencing Data. Int. J. Environ. Res. Public Health 2021, 18, 7975. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Pasque, V.; Karnik, R.; Chronis, C.; Petrella, P.; Langerman, J.; Bonora, G.; Song, J.; Vanheer, L.; Sadhu Dimashkie, A.; Meissner, A.; et al. X Chromosome Dosage Influences DNA Methylation Dynamics during Reprogramming to Mouse iPSCs. Stem Cell Rep. 2018, 10, 1537–1550. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Roadmap Epigenomics Consortium; Kundaje, A.; Meuleman, W.; Ernst, J.; Bilenky, M.; Yen, A.; Heravi-Moussavi, A.; Kheradpour, P.; Zhang, Z.; Wang, J.; et al. Integrative analysis of 111 reference human epigenomes. Nature 2015, 518, 317–330. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Kulis, M.; Esteller, M. DNA methylation and cancer. Adv. Genet. 2010, 70, 27–56. [Google Scholar] [CrossRef] [PubMed]
Maunakea, A.K.; Chepelev, I.; Cui, K.; Zhao, K. Intragenic DNA methylation modulates alternative splicing by recruiting MeCP2 to promote exon recognition. Cell Res. 2013, 23, 1256–1269. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Bird, A. DNA methylation patterns and epigenetic memory. Genes Dev. 2002, 16, 6–21. [Google Scholar] [CrossRef] [PubMed]
Deaton, A.M.; Bird, A. CpG islands and the regulation of transcription. Genes Dev. 2011, 25, 1010–1022. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Hansen, K.D.; Timp, W.; Bravo, H.C.; Sabunciyan, S.; Langmead, B.; McDonald, O.G.; Wen, B.; Wu, H.; Liu, Y.; Diep, D.; et al. Increased methylation variation in epigenetic domains across cancer types. Nat. Genet. 2011, 43, 768–775. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Greenberg, M.V.C.; Bourc’his, D. The diverse roles of DNA methylation in mammalian development and disease. Nat. Rev. Mol. Cell Biol. 2019, 20, 590–607. [Google Scholar] [CrossRef] [PubMed]
Hasin, Y.; Seldin, M.; Lusis, A. Multi-omics approaches to disease. Genome Biol. 2017, 18, 83. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Browne, C.J.; Godino, A.; Salery, M.; Nestler, E.J. Epigenetic Mechanisms of Opioid Addiction. Biol. Psychiatry 2020, 87, 22–33. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Walker, D.M.; Cates, H.M.; Loh, Y.E.; Purushothaman, I.; Ramakrishnan, A.; Cahill, K.M.; Lardner, C.K.; Godino, A.; Kronman, H.G.; Rabkin, J.; et al. Cocaine Self-administration Alters Transcriptome-wide Responses in the Brain’s Reward Circuitry. Biol. Psychiatry 2018, 84, 867–880. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Ballouz, S.; Pavlidis, P.; Gillis, J. Using predictive specificity to determine when gene set analysis is biologically meaningful. Nucleic Acids Res. 2017, 45, e20. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Maleki, F.; Ovens, K.; Hogan, D.J.; Kusalik, A.J. Gene Set Analysis: Challenges, Opportunities, and Future Research. Front. Genet. 2020, 11, 654. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Jaffe, A.E.; Murakami, P.; Lee, H.; Leek, J.T.; Fallin, M.D.; Feinberg, A.P.; Irizarry, R.A. Bump hunting to identify differentially methylated regions in epigenetic epidemiology studies. Int. J. Epidemiol. 2012, 41, 200–209. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Krueger, F.; James, F.; Ewels, P.; Afyounian, E.; Schuster-Boeckler, B. FelixKrueger/TrimGalore: v0.6.2. 2019. Available online: https://www.bioinformatics.babraham.ac.uk/projects/trim_galore/ (accessed on 26 January 2020).
Andrews, S. FastQC: A Quality Control Tool for High Throughput Sequence Data. 2010. Available online: http://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (accessed on 26 January 2020).
Granlund, T.; Stallman, R.M. Cat. 2018. Available online: https://github.com/Batch-Man/cat (accessed on 26 January 2020).
Langmead, B.; Salzberg, S.L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 2012, 9, 357–359. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Langmead, B.; Wilks, C.; Antonescu, V.; Charles, R. Scaling read aligners to hundreds of threads on general-purpose processors. Bioinformatics 2019, 35, 421–432. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Krueger, F.; Kreck, B.; Franke, A.; Andrews, S.R. DNA methylome analysis using short bisulfite sequencing data. Nat. Methods 2012, 9, 145–151. [Google Scholar] [CrossRef] [PubMed]
Li, H.; Handsaker, B.; Wysoker, A.; Fennell, T.; Ruan, J.; Homer, N.; Marth, G.; Abecasis, G.; Durbin, R. 1000 Genome Project Data Processing Subgroup. The Sequence Alignment/Map format and SAMtools. Bioinformatics 2009, 25, 2078–2079. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
MethylDackel (v.0.5.1): A Universal Methylation Extractor for BS-Seq Experiments. 2020. Available online: https://github.com/dpryan79/MethylDackel (accessed on 26 January 2020).
Kent, W.J.; Sugnet, C.W.; Furey, T.S.; Roskin, K.M.; Pringle, T.H.; Zahler, A.M.; Haussler, D. The human genome browser at UCSC. Genome Res. 2002, 12, 996–1006. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Oliveros, J.C. Venny. An Interactive Tool for Comparing Lists with Venn’s Diagrams. 2007–2015. Available online: https://bioinfogp.cnb.csic.es/tools/venny/index.html (accessed on 26 January 2020).
Ashburner, M.; Ball, C.A.; Blake, J.A.; Botstein, D.; Butler, H.; Cherry, J.M.; Davis, A.P.; Dolinski, K.; Dwight, S.S.; Eppig, J.T.; et al. Gene ontology: Tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. 2000, 25, 25–29. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
The Gene Ontology Consortium. The Gene Ontology Resource: 20 years and still GOing strong. Nucleic Acids Res. 2019, 47, D330–D338. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]

Figure 1. Distribution of DMCs across different genomic regions. (A) Column chart displaying the percentage of identified hypermethylated and hypomethylated DMCs for each tool and commonly identified overlapped sites. (B) Bar plot showing DMCs distribution around gene features in promoter (divided in ≤1 kb, 1–2 kb and 2–3 kb), 5′UTR, 3′UTR, exon (1st and others), intron (1st and others), downstream of the gene end and intergenic regions, for edgeR (left bar) and methylKit (right bar). (C) Bar plot showing DMCs distribution around different CpG-related features, in CpG island (belonging to promoter or non-promoter region + 1 kb from TSS), shore (<2 kb), shelf (<4 kb) and open sea (rest of the genome) for edgeR (left bar) and methylKit (right bar).

Figure 2. Identification of biologically relevant results. (A) Column chart displaying the percentage of only hypermethylated, only hypomethylated and hyper-/hypomethylated genes for each tool and commonly identified overlapped sites. (B) Total number of DMCs identified before and after the applied threshold for each tool (x < −2 and x > +2 fold change for edgeR and x < −25 and x > +25 percentage for methylKit), Venn diagram showing the overlap between common DMCs sites and a percentage column chart displaying hyper- and hypomethylated DMCs proportions. (C) Total number of DMGs identified before and after the applied threshold for each tool (x < −2 and x > +2 fold change for edgeR and x < −25 and x > +25 percentage for methylKit), Venn diagram showing the overlap between common genes and a functional enrichment analysis showing the most indicative biological functions of the specific genes annotated from each tool. Statistical analyses Bonferroni corrected for p < 0.05.

Figure 3. DNA methylation analysis workflow using bisulfite sequencing data. A summary of the protocol followed and file types is shown in a flow chart.

Table 1. Comparison of identified DMCs between edgeR and methylKit. Total hypermethylated and hypomethylated DMCs are specified for edgeR and methylKit, together with common sites identified by both tools.

	edgeR	methylKit	$\bar{x}$	σ	C_V (%)	Common DMCs
Identified total DMCs	203,337	223,280	213,308.5	9971.5	4.67	153,394 (56.14%)
Hypermethylated DMCs	60,704	67,054	63,879	3175	4.97	44,641 (53.71%)
Hypomethylated DMCs	142,633	156,226	149,429.5	6796.5	4.55	108,753 (57.21%)

Table 2. Comparison of identified gene features between edgeR and methylKit. DMCs belonging to promoter (divided in ≤1 kb, 1–2 kb and 2–3 kb), 5′UTR, 3′UTR, exon (1st and others), intron (1st and others), downstream of the gene end and intergenic regions are specified for edgeR and methylKit, together with common sites identified by both tools.

	edgeR	methylKit	$\bar{x}$	σ	C_V (%)	Common DMCs
Promoter (≤1 kb)	14,015	14,974	14,494.5	479.5	3.31	10,324 (55.31%)
Promoter (1–2 kb)	11,216	12,140	11,678	462	3.96	8412 (56.29%)
Promoter (2–3 kb)	9586	10,585	10,085.5	499.5	4.95	7295 (56.66%)
5′UTR	129	149	139	10	7.19	100 (56.18%)
3′UTR	4460	4920	4690	230	4.9	3403 (56.93%)
Exon	9146	10,024	9585	439	4.58	6909 (56.35%)
Intron	83,043	92,460	87,751.5	4708.5	5.37	63,657 (56.91%)
Downstream region	2284	2540	2412	128	5.31	1699 (54.37%)
Distal intergenic region	69,458	75,488	72,473	3015	4.16	51,595 (55.27%)

Table 3. Comparison of identified CpG-related features between edgeR and methylKit. DMCs belonging to CGI, shore (<2 kb), shelf (<4 kb) and open sea (rest of the genome) regions are specified for edgeR and methylKit, together with common sites identified by both tools.

	edgeR	methylKit	$\bar{x}$	σ	C_V (%)	Common DMCs
CGI	985	849	917	68	7.42	566 (44.64%)
Shore	9638	9888	9763	125	1.28	6839 (53.91%)
Shelf	8585	9495	9040	455	5.03	6604 (57.55%)
Open Sea	184,129	203,048	193,588.5	9459.5	4.89	139,385 (56.25%)

Table 4. Comparison of identified differentially methylated genes between edgeR and methylKit. Total, hypermethylated, hypomethylated and hyper-/hypomethylated genes are specified for edgeR and methylKit, together with common sites identified by both tools.

	edgeR	methylKit	$\bar{x}$	σ	C_V (%)	Common Genes
Identified total Genes	17,657	17,772	17,714.5	57.5	0.32	16,357 (87.8%)
Hypermethylated Genes	13,128	13,426	13,277	149	1.12	11,594 (80%)
Hypomethylated Genes	16,313	16,429	16,371	58	0.35	14,954 (86.2%)
Hyper- and Hypomethylated Genes	11,783	12,081	11,932	149	1.25	10,191 (74.53%)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Muñoa-Hoyos, I.; Araolaza, M.; Calzado, I.; Albizuri, M.; Subirán, N. Benchmarking edgeR and methylKit for the Detection of Differential DNA Methylation: A Methodological Evaluation. Int. J. Mol. Sci. 2026, 27, 1964. https://doi.org/10.3390/ijms27041964

AMA Style

Muñoa-Hoyos I, Araolaza M, Calzado I, Albizuri M, Subirán N. Benchmarking edgeR and methylKit for the Detection of Differential DNA Methylation: A Methodological Evaluation. International Journal of Molecular Sciences. 2026; 27(4):1964. https://doi.org/10.3390/ijms27041964

Chicago/Turabian Style

Muñoa-Hoyos, Iraia, Manu Araolaza, Irune Calzado, Mikel Albizuri, and Nerea Subirán. 2026. "Benchmarking edgeR and methylKit for the Detection of Differential DNA Methylation: A Methodological Evaluation" International Journal of Molecular Sciences 27, no. 4: 1964. https://doi.org/10.3390/ijms27041964

APA Style

Muñoa-Hoyos, I., Araolaza, M., Calzado, I., Albizuri, M., & Subirán, N. (2026). Benchmarking edgeR and methylKit for the Detection of Differential DNA Methylation: A Methodological Evaluation. International Journal of Molecular Sciences, 27(4), 1964. https://doi.org/10.3390/ijms27041964

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Benchmarking edgeR and methylKit for the Detection of Differential DNA Methylation: A Methodological Evaluation

Abstract

1. Introduction

2. Results

2.1. Comparative Assessment of DMC Detection Tools

2.2. Similarity Analysis of DMC Detection Tools

2.3. Similarity Analysis of DMG Detection Tools

3. Discussion

4. Materials and Methods

4.1. DNA Processing and Sequencing Workflow

4.2. Pre-Processing

4.3. Cross-Platform Analysis of m5C Sequencing Data

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI