Experimental and Bioinformatic Approaches to Studying DNA Methylation in Cancer
Abstract
Simple Summary
Abstract
1. Introduction
2. DNA Methylation Assays
2.1. Single-Cell and Single-Cell Multi-Omics Approaches
2.2. Cell-Free Circulating Tumour DNA (ct) from Liquid Biopsies
3. Processing of DNA Methylation Data
4. Analysis of DNA Methylation
4.1. Exploratory Data Analysis and Sparse Data
4.2. Deconvolution of Cellular Heterogeneity and Estimating Tumour Purity
5. DNA Methylation Signatures
5.1. Differential Methylation
5.2. Methylome Segmentation and the DNA Methylation Landscape
6. Downstream Analysis: Interpretation and Application of DNA Methylation Signatures for Research and Clinics
7. Conclusion and Remaining Challenges
Author Contributions
Funding
Conflicts of Interest
References
- Berdasco, M.; Esteller, M. Aberrant Epigenetic Landscape in Cancer: How Cellular Identity Goes Awry. Dev. Cell 2010, 19, 698–711. [Google Scholar] [CrossRef] [PubMed]
- Esteller, M. Cancer epigenomics: DNA methylomes and histone-modification maps. Nat. Rev. Genet. 2007, 8, 286–298. [Google Scholar] [CrossRef] [PubMed]
- Berdasco, M.; Esteller, M. Clinical epigenetics: Seizing opportunities for translation. Nat. Rev. Genet. 2019, 20, 109–127. [Google Scholar] [CrossRef] [PubMed]
- Esteller, M. CpG island hypermethylation and tumor suppressor genes: A booming present, a brighter future. Oncogene 2002, 21, 5427–5440. [Google Scholar] [CrossRef] [PubMed]
- Deaton, A.M.; Bird, A. CpG islands and the regulation of transcription. Genes Dev. 2011, 25, 1010–1022. [Google Scholar] [CrossRef]
- Lister, R.; Pelizzola, M.; Kida, Y.S.; Hawkins, R.D.; Nery, J.R.; Hon, G.; Antosiewicz-Bourget, J.; O’Malley, R.; Castanon, R.; Klugman, S.; et al. Hotspots of aberrant epigenomic reprogramming in human induced pluripotent stem cells. Nature 2011, 471, 68–73. [Google Scholar] [CrossRef]
- Stadler, M.B.; Murr, R.; Burger, L.; Ivanek, R.; Lienert, F.; Schöler, A.; van Nimwegen, E.; Wirbelauer, C.; Oakeley, E.J.; Gaidatzis, D.; et al. DNA-binding factors shape the mouse methylome at distal regulatory regions. Nature 2011, 480, 490–495. [Google Scholar] [CrossRef]
- Kulis, M.; Merkel, A.; Heath, S.; Queirós, A.C.; Schuyler, R.P.; Castellano, G.; Beekman, R.; Raineri, E.; Esteve, A.; Clot, G.; et al. Whole-genome fingerprint of the DNA methylome during human B cell differentiation. Nat. Genet. 2015, 47, 746–756. [Google Scholar] [CrossRef]
- Gaidatzis, D.; Burger, L.; Murr, R.; Lerch, A.; Dessus-Babus, S.; Schübeler, D.; Stadler, M.B. DNA Sequence Explains Seemingly Disordered Methylation Levels in Partially Methylated Domains of Mammalian Genomes. PLoS Genet. 2014, 10, e1004143. [Google Scholar] [CrossRef][Green Version]
- Sandoval, J.; Heyn, H.A.; Moran, S.; Serra-Musach, J.; Pujana, M.A.; Bibikova, M.; Esteller, M. Validation of a DNA methylation microarray for 450,000 CpG sites in the human genome. Epigenetics 2011, 6, 692–702. [Google Scholar] [CrossRef]
- Moran, S.; Arribas, C.; Esteller, M. Validation of a DNA methylation microarray for 850,000 CpG sites of the human genome enriched in enhancer sequences. Epigenomics 2016, 8, 389–399. [Google Scholar] [CrossRef] [PubMed]
- Moran, S.; Esteller, M. Infinium DNA Methylation Microarrays on Formalin-Fixed, Paraffin-Embedded Samples. In CpG Islands: Methods and Protocols; Vavouri, T., Peinado, M.A., Eds.; Springer: New York, NY, USA, 2018; pp. 83–107. [Google Scholar]
- Laird, P.W. Principles and challenges of genome—Wide DNA methylation analysis. Nat. Rev. Genet. 2010, 11, 191–203. [Google Scholar] [CrossRef] [PubMed]
- Bock, C. Analysing and interpreting DNA methylation data. Nat. Rev. Genet. 2012, 13, 705–719. [Google Scholar] [CrossRef] [PubMed]
- Krueger, F.; Andrews, S.R. Bismark: A flexible aligner and methylation caller for Bisulfite-Seq applications. Bioinformatics 2011, 27, 1571–1572. [Google Scholar] [CrossRef]
- Amarasinghe, S.L.; Su, S.; Dong, X.; Zappia, L.; Ritchie, M.E.; Gouil, Q. Opportunities and challenges in long-read sequencing data analysis. Genome Biol. 2020, 21, 1–16. [Google Scholar] [CrossRef]
- Lister, R.; Mukamel, E.A.; Nery, J.R.; Urich, M.; Puddifoot, C.A.; Johnson, N.D.; Lucero, J.; Huang, Y.; Dwork, A.J.; Schultz, M.D.; et al. Global epigenomic reconfiguration during mammalian brain development. Science 2013, 341, 1237905. [Google Scholar] [CrossRef]
- Ficz, G.; Branco, M.R.; Seisenberger, S.; Santos, F.; Krueger, F.; Hore, T.A.; Marques, C.J.; Andrews, S.; Reik, W. Dynamic regulation of 5-hydroxymethylcytosine in mouse ES cells and during differentiation. Nature 2011, 473, 398–402. [Google Scholar] [CrossRef]
- Lian, C.G.; Xu, Y.; Ceol, C.; Wu, F.; Larson, A.; Dresser, K.; Xu, W.; Tan, L.; Hu, Y.; Zhan, Q.; et al. Loss of 5-hydroxymethylcytosine is an epigenetic hallmark of Melanoma. Cell 2012, 150, 1135–1146. [Google Scholar] [CrossRef]
- Ko, M.; An, J.; Pastor, W.A.; Koralov, S.B.; Rajewsky, K.; Rao, A. TET proteins and 5-methylcytosine oxidation in hematological cancers. Immunol. Rev. 2015, 263, 6–21. [Google Scholar] [CrossRef]
- Booth, M.J.; Ost, T.W.B.; Beraldi, D.; Bell, N.M.; Branco, M.R.; Reik, W.; Balasubramanian, S. Oxidative bisulfite sequencing of 5-methylcytosine and 5-hydroxymethylcytosine. Nat. Protoc. 2013, 8, 1841–1851. [Google Scholar] [CrossRef]
- Yu, M.; Hon, G.C.; Szulwach, K.E.; Song, C.X.; Jin, P.; Ren, B.; He, C. Tet-assisted bisulfite sequencing of 5-hydroxymethylcytosine. Nat. Protoc. 2012, 7, 2159–2170. [Google Scholar] [CrossRef]
- Jain, M.; Olsen, H.E.; Paten, B.; Akeson, M. The Oxford Nanopore MinION: Delivery of nanopore sequencing to the genomics community. Genome Biol. 2016, 17. [Google Scholar] [CrossRef]
- Li, W.; Ye, Z.; Wan, S.; Liu, H.; Zhang, J.; Xie, S.; Xu, J. Cancer biomarkers discovery of methylation modification with direct high-throughput nanopore qequencing. Front. Genet. 2021, 12, 672804. [Google Scholar] [CrossRef]
- Yuen, Z.W.S.; Srivastava, A.; Daniel, R.; McNevin, D.; Jack, C.; Eyras, E. Systematic benchmarking of tools for CpG methylation detection from nanopore sequencing. Nat. Commun. 2021, 12, 3438. [Google Scholar] [CrossRef]
- Sakamoto, Y.; Zaha, S.; Nagasawa, S.; Miyake, S.; Kojima, Y.; Suzuki, A.; Suzuki, Y. Long-read whole-genome methylation patterning using enzymatic base conversion and nanopore sequencing. Nucleic Acids Res. 2021, 49, e81. [Google Scholar] [CrossRef]
- Guo, H.; Zhu, P.; Wu, X.; Li, X.; Wen, L.; Tang, F. Single-Cell methylome landscapes of mouse embryonic stem cells and early embryos analyzed using reduced representation bisulfite sequencing. Genome Res. 2013, 23, 2126–2135. [Google Scholar] [CrossRef]
- Wang, K.; Li, X.; Dong, S.; Liang, J.; Mao, F.; Zeng, C.; Wu, H.; Wu, J.; Cai, W.; Sun, Z.S. Q-RRBS: A quantitative reduced representation bisulfite sequencing method for single-cell methylome analyses. Epigenetics 2015, 10, 775–783. [Google Scholar] [CrossRef]
- Smallwood, S.A.; Lee, H.J.; Angermueller, C.; Krueger, F.; Saadeh, H.; Peat, J.; Andrews, S.R.; Stegle, O.; Reik, W.; Kelsey, G. Single-cell genome-wide bisulfite sequencing for assessing epigenetic heterogeneity. Nat. Methods 2014, 27. [Google Scholar] [CrossRef]
- Farlik, M.; Sheffield, N.C.; Nuzzo, A.; Datlinger, P.; Schönegger, A.; Klughammer, J.; Bock, C. Single-Cell DNA Methylome Sequencing and Bioinformatic Inference of Epigenomic Cell-State Dynamics. Cell Rep. 2015, 10, 1386–1397. [Google Scholar] [CrossRef]
- Luo, C.; Keown, C.L.; Kurihara, L.; Zhou, J.; He, Y.; Li, J.; Castanon, R.; Lucero, J.; Nery, J.R.; Sandoval, J.P.; et al. Single-cell methylomes identify neuronal subtypes and regulatory elements in mammalian cortex. Sci. 2017, 357, 600–604. [Google Scholar] [CrossRef]
- Mulqueen, R.M.; Pokholok, D.; Norberg, S.J.; Torkenczy, K.A.; Fields, A.J.; Sun, D.; Sinnamon, J.R.; Shendure, J.; Trapnell, C.; O’Roak, B.J.; et al. Highly scalable generation of DNA methylation profiles in single cells. Nat. Biotechnol. 2018, 36, 428–431. [Google Scholar] [CrossRef]
- Hu, Y.; Huang, K.; An, Q.; Du, G.; Hu, G.; Xue, J.; Zhu, X.; Wang, C.Y.; Xue, Z.; Fan, G. Simultaneous profiling of transcriptome and DNA methylome from a single cell. Genome Biol. 2016, 17, 1–11. [Google Scholar] [CrossRef]
- Angermueller, C.; Clark, S.J.; Lee, H.J.; Macaulay, I.C.; Teng, M.J.; Hu, T.X.; Krueger, F.; Smallwood, S.A.; Ponting, C.P.; Voet, T.; et al. Parallel single-cell sequencing links transcriptional and epigenetic heterogeneity. Nat. Methods 2016, 13, 229–232. [Google Scholar] [CrossRef]
- Hou, Y.; Guo, H.; Cao, C.; Li, X.; Hu, B.; Zhu, P.; Wu, X.; Wen, L.; Tang, F.; Huang, Y.; et al. Single-cell triple omics sequencing reveals genetic, epigenetic, and transcriptomic heterogeneity in hepatocellular carcinomas. Cell Res. 2016, 26, 304–319. [Google Scholar] [CrossRef]
- Pott, S. Simultaneous measurement of chromatin accessibility, DNA methylation, and nucleosome phasing in single cells. elife 2017, 6, 1–19. [Google Scholar] [CrossRef]
- Clark, S.J.; Argelaguet, R.; Kapourani, C.A.; Stubbs, T.M.; Lee, H.J.; Alda-Catalinas, C.; Krueger, F.; Sanguinetti, G.; Kelsey, G.; Marioni, J.C.; et al. ScNMT-seq enables joint profiling of chromatin accessibility DNA methylation and transcription in single cells e. Nat. Commun. 2018, 9, 1–9. [Google Scholar] [CrossRef]
- Karemaker, I.D.; Vermeulen, M. Single-Cell DNA Methylation Profiling: Technologies and Biological Applications. Trends Biotechnol. 2018, 36, 952–965. [Google Scholar] [CrossRef]
- Angeles, A.K.; Janke, F.; Bauer, S.; Christopoulos, P.; Riediger, A.L.; Sültmann, H. Liquid biopsies beyond mutation calling: Genomic and epigenomic features of cell-free dna in cancer. Cancers 2021, 13, 5615. [Google Scholar] [CrossRef]
- Fettke, H.; Kwan, E.M.; Azad, A.A. Cell-free DNA in cancer: Current insights. Cell Oncol. 2019, 42, 13–28. [Google Scholar] [CrossRef]
- Moss, J.; Magenheim, J.; Neiman, D.; Zemmour, H.; Loyfer, N.; Korach, A.; Samet, Y.; Maoz, M.; Druid, H.; Arner, P.; et al. Comprehensive human cell-type methylation atlas reveals origins of circulating cell-free DNA in health and disease. Nat. Commun. 2018, 9, 5068. [Google Scholar] [CrossRef]
- Shen, S.Y.; Singhania, R.; Fehringer, G.; Chakravarthy, A.; Roehrl, M.H.A.; Chadwick, D.; Zuzarte, P.C.; Borgida, A.; Wang, T.T.; Li, T.; et al. Sensitive tumour detection and classification using plasma cell-free DNA methylomes. Nature 2018, 563, 579–583. [Google Scholar] [CrossRef] [PubMed]
- Xi, Y.; Li, W. BSMAP: Whole genome bisulfite sequence MAPping program. BMC Bioinform. 2009, 10, 1–9. [Google Scholar] [CrossRef] [PubMed]
- Chen, P.-Y.; Cokus, S.J.; Pellegrini, M. Open Access SOFTWARE Software BS Seeker: Precise mapping for bisulfite sequencing. BMC Bioinform. 2010, 11, 2–7. [Google Scholar] [CrossRef] [PubMed]
- Merkel, A.; Fernández-Callejo, M.; Casals, E.; Marco-Sola, S.; Schuyler, R.; Gut, I.G.; Heath, S.C. GemBS: High throughput processing for DNA methylation data from bisulfite sequencing. Bioinformatics 2019. [Google Scholar] [CrossRef]
- Nunn, A.; Otto, C.; Stadler, P.F.; Langenberger, D. Erratum to: Comprehensive benchmarking of software for mapping whole genome bisulfite data: From read alignment to DNA methylation analysis. Brief. Bioinform. 2021, 22, 1–9. [Google Scholar] [CrossRef]
- Simons, A. FastQC: A Quality Control Tool for High Throughput Sequencing Data. Available online: https://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (accessed on 25 November 2021).
- Krueger, F. Trim Galore. Available online: https://www.bioinformatics.babraham.ac.uk/projects/trim_galore/ (accessed on 25 November 2021).
- GitHub. Picard Tools. Available online: https://broadinstitute.github.io/picard/ (accessed on 25 November 2021).
- Liu, Y.; Siegmund, K.D.; Laird, P.W.; Berman, B.P. Bis-SNP: Combined DNA methylation and SNP calling for Bisulfite-seq data. Genome Biol. 2012, 13, R61. [Google Scholar] [CrossRef]
- Van der Auwera, G.; O’Connor, B. Genomics in the Cloud, 1st ed.; O’Reilly Medi, Inc.: Newton, MS, USA, 2020. [Google Scholar]
- Barturen, G.; Rueda, A.; Oliver, J.L.; Hackenberg, M. MethylExtract: High-Quality methylation maps and SNV calling from whole genome bisulfite sequencing data. F1000Research 2013, 2, 1–23. [Google Scholar] [CrossRef]
- Langmead, B.; Salzberg, S.L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 2012, 9, 357–359. [Google Scholar] [CrossRef]
- Li, H.; Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 2009, 25, 1754–1760. [Google Scholar] [CrossRef]
- Love, M.I.; Huber, W.; Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014, 15, 550. [Google Scholar] [CrossRef]
- Lienhard, M.; Grimm, C.; Morkel, M.; Herwig, R.; Chavez, L. MEDIPS: Genome-wide differential coverage analysis of sequencing data derived from DNA enrichment experiments. Bioinformatics 2014, 30, 284–286. [Google Scholar] [CrossRef]
- Stark, R.; Brown, R. DiffBind: Differential Binding Analysis of ChIP-Seq Peak Data. 2011. Available online: http://bioconductor.org/packages/release/bioc/vignettes/DiffBind/inst/doc/DiffBind.pdf (accessed on 25 November 2021).
- Lienhard, M.; Grasse, S.; Rolff, J.; Frese, S.; Schirmer, U.; Becker, M.; Börno, S.; Timmermann, B.; Chavez, L.; Sültmann, H.; et al. QSEA-modelling of genome-wide DNA methylation from sequencing enrichment experiments. Nucleic Acids Res. 2017, 45, e44. [Google Scholar] [CrossRef]
- Shabalin, A.A.; Hattab, M.W.; Clark, S.L.; Chan, R.F.; Kumar, G.; Aberg, K.A.; van den Oord, E.J.C.G. RaMWAS: Fast methylome-wide association study pipeline for enrichment platforms. Bioinformatics 2018, 34, 2283–2285. [Google Scholar] [CrossRef]
- Aryee, M.J.; Jaffe, A.E.; Corrada-Bravo, H.; Ladd-Acosta, C.; Feinberg, A.P.; Hansen, K.D.; Irizarry, R.A. Minfi: A flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays. Bioinformatics 2014, 30, 1363–1369. [Google Scholar] [CrossRef]
- Ritchie, M.E.; Phipson, B.; Wu, D.; Hu, Y.; Law, C.W.; Shi, W.; Smyth, G.K. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015, 43, e47. [Google Scholar] [CrossRef]
- Pidsley, R.; Wong, C.C.Y.; Volta, M.; Lunnon, K.; Mill, J.; Schalkwyk, L.C. A data-driven approach to preprocessing Illumina 450 K methylation array data. BMC Genom. 2013, 14, 293. [Google Scholar] [CrossRef]
- Merkel, A.; Heath, S.C. DNA Methylation Assays Using Bisulphite Sequencing and Next-Generation Sequencing. In Data Analysis for Omics Science: Methos and Applications; Jaumot, J., Bedia, C., Taula, R., Eds.; Elsvier: Oxford, UK, 2018; pp. 108–137. [Google Scholar]
- Rodriguez, B.A.T.; Frankhouser, D.; Murphy, M.; Trimarchi, M.; Tam, H.H.; Curfman, J.; Huang, R.; Chan, M.W.Y.; Lai, H.C.; Parikh, D.; et al. Methods for high-throughput MethylCap-Seq data analysis. BMC Genom. 2012, 13, 1–11. [Google Scholar] [CrossRef]
- Wilhelm-Benartzi, C.S.; Koestler, D.C.; Karagas, M.R.; Flanagan, J.M.; Christensen, B.C.; Kelsey, K.T.; Marsit, C.J.; Houseman, E.A.; Brown, R. Review of processing and analysis methods for DNA methylation array data. Br. J. Cancer 2013, 109, 1394–1402. [Google Scholar] [CrossRef]
- Rappoport, N.; Shamir, R. Multi-omic and multi-view clustering algorithms: Review and cancer benchmark. Nucleic Acids Res. 2018, 46, 10546–10562. [Google Scholar] [CrossRef]
- Chauvel, C.; Novoloaca, A.; Veyre, P.; Reynier, F.; Becker, J. Evaluation of integrative clustering methods for the analysis of multi-omics data. Brief. Bioinform. 2020, 21, 541–552. [Google Scholar] [CrossRef]
- R Core Team. R: A Language and Environment for Statistical Computing; Vienna, Austria. 2013. Available online: https://www.R-project.org/ (accessed on 25 November 2021).
- Venables, W.N.; Ripley, B.D. Modern Applied Statistics with S, Fourth; Springer: New York, NY, USA, 2002. [Google Scholar]
- Krijthe, J.H. Rtsne: T-Distributed Stochastic Neighbor Embedding Using a Barnes-Hut Implementation. 2015. Available online: https://github.com/jkrijthe/Rtsne (accessed on 25 November 2021).
- Gaujoux, R.; Seoighe, C. A flexible R package for nonnegative matrix factorization. BMC Bioinform. 2010, 11, 367. [Google Scholar] [CrossRef]
- Derrien, T.; Johnson, R.; Bussotti, G.; Tanzer, A.; Djebali, S.; Tilgner, H.; Guernec, G.; Martin, D.; Merkel, A.; Knowles, D.G.; et al. The GENCODE v7 catalog of human long noncoding RNAs: Analysis of their gene structure, evolution, and expression. Genome Res. 2012, 22, 1775–1789. [Google Scholar] [CrossRef]
- Maechler, M.; Rousseeuw, P.; Struyf, A.; Hubert, M.; Hornik, K. Cluster: Cluster Analysis Basics and Extensions. R Package Version 2.1.2. Available online: https://cran.r-project.org/web/packages/cluster/cluster.pdf (accessed on 25 November 2021).
- Hansen, K.D.; Langmead, B.; Irizarry, R.A.; Hansen, K.; Timp, W.; Bravo, H.C.; Sabunciyan, S.; Langmead, B.; McDonald, O.; Wen, B.; et al. BSmooth: From whole genome bisulfite sequencing reads to differentially methylated regions. Genome Biol. 2012, 13, R83. [Google Scholar] [CrossRef]
- Kapourani, C.A.; Sanguinetti, G. Melissa: Bayesian clustering and imputation of single cell methylomes. bioRxiv 2018, 8, 1–15. [Google Scholar] [CrossRef]
- de Souza, C.P.E.; Andronescu, M.; Masud, T.; Kabeer, F.; Biele, J.; Laks, E.; Lai, D.; Ye, P.; Brimhall, J.; Wang, B.; et al. Epiclomal: Probabilistic clustering of sparse single-cell DNA methylation data. PLoS Comput. Biol. 2020, 16, e1008270. [Google Scholar] [CrossRef]
- Angermueller, C.; Lee, H.J.; Reik, W.; Stegle, O. DeepCpG: Accurate prediction of single-cell DNA methylation states using deep learning. Genome Biol. 2017, 18, 1–13. [Google Scholar] [CrossRef]
- Teschendorff, A.E.; Relton, C.L. Statistical and integrative system-level analysis of DNA methylation data. Nat. Rev. Genet. 2018, 19, 129–147. [Google Scholar] [CrossRef]
- Teschendorff, A.E.; Zheng, S.C. Cell-type deconvolution in epigenome-wide association studies: A review and recommendations. Epigenomics 2017, 9, 757–768. [Google Scholar] [CrossRef]
- Teschendorff, A.E.; Zhu, T.; Breeze, C.E.; Beck, S. EPISCORE: Cell type deconvolution of bulk tissue DNA methylomes from single-cell RNA-Seq data. Genome Biol. 2020, 21, 1. [Google Scholar] [CrossRef]
- Leek, J.; Johnson, W.; Parker, H.; Fertig, E.; Jaffe, A.; Storey, J.; Zhang, Y.; Torres, L. SVA: Surrogate Variable Analysis. Available online: https://bioconductor.org/packages/release/bioc/html/sva.html (accessed on 25 November 2021).
- Gagnon-Bartsch, J.A. Ruv: Detect and Remove Unwanted Variation Using Negative Controls. Available online: http://www-personal.umich.edu/~johanngb/ruv/ (accessed on 25 November 2021).
- Phipson, B.; Maksimovic, J.; Oshlack, A. missMethyl: An R package for analyzing data from Illumina’s HumanMethylation450 platform. Bioinformatics 2016, 32, 286–288. [Google Scholar] [CrossRef]
- Teschendorff, A.E.; Breeze, C.E.; Zheng, S.C.; Beck, S. A comparison of reference-based algorithms for correcting cell-type heterogeneity in Epigenome-Wide Association Studies. BMC Bioinform. 2017, 18. [Google Scholar] [CrossRef] [PubMed]
- Chakravarthy, A.; Furness, A.; Joshi, K.; Ghorani, E.; Ford, K.; Ward, M.J.; King, E.V.; Lechner, M.; Marafioti, T.; Quezada, S.A.; et al. Pan-cancer deconvolution of tumour composition using DNA methylation. Nat. Commun. 2018, 9. [Google Scholar] [CrossRef] [PubMed]
- Arneson, D.; Yang, X.; Wang, K. MethylResolver—A method for deconvoluting bulk DNA methylation profiles into known and unknown cell contents. Commun. Biol. 2020, 3, 1–13. [Google Scholar] [CrossRef] [PubMed]
- Li, W.; Li, Q.; Kang, S.; Same, M.; Zhou, Y.; Sun, C.; Liu, C.C.; Matsuoka, L.; Sher, L.; Wong, W.H.; et al. CancerDetector: Ultrasensitive and non-invasive cancer detection at the resolution of individual reads using cell-free DNA methylation sequencing data. Nucleic Acids Res. 2018, 46, e89. [Google Scholar] [CrossRef]
- Akalin, A.; Kormaksson, M.; Li, S.; Garrett-bakelman, F.E.; Figueroa, M.E.; Melnick, A.; Mason, C.E. MethylKit: A comprehensive R package for the analysis of genome-wide DNA methylation profiles. Genome Biol. 2012, 13, R87. [Google Scholar] [CrossRef]
- Scherer, M.; Nebel, A.; Franke, A.; Walter, J.; Lengauer, T.; Bock, C.; Müller, F.; List, M. Quantitative comparison of within-sample heterogeneity scores for DNA methylation data. Nucleic Acids Res. 2021, 48. [Google Scholar] [CrossRef]
- Park, Y.; Figueroa, M.E.; Rozek, L.S.; Sartor, M.A. MethylSig: A whole genome DNA methylation analysis pipeline. Bioinformatics 2014, 30, 2414–2422. [Google Scholar] [CrossRef]
- Park, Y.; Wu, H. Differential methylation analysis for BS-seq data under general experimental design. Bioinformatics 2016, 32, 1446–1453. [Google Scholar] [CrossRef]
- Assenov, Y.; Müller, F.; Lutsik, P.; Walter, J.; Lengauer, T.; Bock, C. Comprehensive analysis of DNA methylation data with RnBeads. Nat. Methods 2014, 11, 1138–1140. [Google Scholar] [CrossRef]
- Robinson, M.D.; McCarthy, D.J.; Smyth, G.K. edgeR: A Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 2010, 26, 139–140. [Google Scholar] [CrossRef]
- Peters, T.J.; Buckley, M.J.; Statham, A.L.; Pidsley, R.; Samaras, K.; V Lord, R.; Clark, S.J.; Molloy, P.L. De novo identification of differentially methylated regions in the human genome. Epigenetics and Chromatin 2015, 8, 1–16. [Google Scholar] [CrossRef]
- Suderman, M.; Staley, J.R.; French, R.; Arathimos, R.; Simpkin, A.; Tilling, K. Dmrff: Identifying differentially methylated regions efficiently with power and control. bioRxiv 2018, 508556. [Google Scholar] [CrossRef]
- Chen, Y.; Pal, B.; Visvader, J.E.; Smyth, G.K. Differential methylation analysis of reduced representation bisulfite sequencing experiments using edgeR. F1000Research 2018, 6, 2055. [Google Scholar] [CrossRef]
- Feng, H.; Conneely, K.N.; Wu, H. A Bayesian hierarchical model to detect differentially methylated loci from single nucleotide resolution sequencing data. Nucleic Acids Res. 2014, 42. [Google Scholar] [CrossRef]
- Jaffe, A.E.; Murakami, P.; Lee, H.; Leek, J.T.; Fallin, M.D.; Feinberg, A.P.; Irizarry, R.A. Bump hunting to identify differentially methylated regions in epigenetic epidemiology studies. Int. J. Epidemiol. 2012, 41, 200–209. [Google Scholar] [CrossRef]
- Song, Q.; Decato, B.; Hong, E.E.; Zhou, M.; Fang, F.; Qu, J.; Garvin, T.; Kessler, M.; Zhou, J.; Smith, A.D. A reference methylome database and analysis pipeline to facilitate integrative and comparative epigenomics. PLoS ONE 2013, 8, e81148. [Google Scholar] [CrossRef]
- Xie, W.; Schultz, M.; Lister, R.; Hou, Z.; Rajagopal, N.; Ray, P.; Whitaker, J.; Tian, S.; Hawkins, R.D.; Leung, D.; et al. Epigenomic Analysis of Multilineage Differentiation of Human Embryonic Stem Cells. Cell 2013, 153, 1134–1148. [Google Scholar] [CrossRef]
- Jeong, M.; Sun, D.; Luo, M.; Huang, Y.; Challen, G.A.; Rodriguez, B.; Zhang, X.; Chavez, L.; Wang, H.; Hannah, R.; et al. Large conserved domains of low DNA methylation maintained by Dnmt3a. Nat. Genet. 2013, 46, 17–23. [Google Scholar] [CrossRef]
- Zhao, S.G.; Chen, W.S.; Li, H.; Foye, A.; Zhang, M.; Sjöström, M.; Aggarwal, R.; Playdle, D.; Liao, A.; Alumkal, J.J.; et al. The DNA methylation landscape of advanced prostate cancer. Nat. Genet. 2020, 52, 778–789. [Google Scholar] [CrossRef]
- Burger, L.; Gaidatzis, D.; Schübeler, D.; Stadler, M.B. Identification of active regulatory regions from DNA methylation data. Nucleic Acids Res. 2013, 41, e155. [Google Scholar] [CrossRef]
- Timp, W.; Bravo, H.C.; McDonald, O.G.; Goggins, M.; Umbricht, C.; Zeiger, M.; Feinberg, A.P.; Irizarry, R. Large hypomethylated blocks as a universal defining epigenetic alteration in human solid tumors. Genome Med. 2014, 6, 61. [Google Scholar] [CrossRef]
- Subramanian, I.; Verma, S.; Kumar, S.; Jere, A.; Anamika, K. Multi-omics Data Integration, Interpretation, and Its Application. Bioinform. Biol. Insights 2020, 14, 1177932219899051. [Google Scholar] [CrossRef]
- Capper, D.; Jones, D.T.W.; Sill, M.; Hovestadt, V.; Schrimpf, D.; Sturm, D.; Koelsche, C.; Sahm, F.; Chavez, L.; Reuss, D.E.; et al. DNA methylation-based classification of central nervous system tumours. Nature 2018, 555, 469–474. [Google Scholar] [CrossRef]
- Koelsche, C.; Schrimpf, D.; Stichel, D.; Sill, M.; Sahm, F.; Reuss, D.E.; Blattner, M.; Worst, B.; Heilig, C.E.; Beck, K.; et al. Sarcoma classification by DNA methylation profiling. Nat. Commun. 2021, 12. [Google Scholar] [CrossRef]
- Moran, S.; Martínez-Cardús, A.; Sayols, S.; Musulén, E.; Balañá, C.; Estival-Gonzalez, A.; Moutinho, C.; Heyn, H.; Diaz-Lagares, A.; de Moura, M.C.; et al. Epigenetic profiling to classify cancer of unknown primary: A multicentre, retrospective analysis. Lancet Oncol. 2016, 17, 1386–1395. [Google Scholar] [CrossRef]
- Duruisseaux, M.; Martínez-Cardús, A.; Calleja-Cervantes, M.E.; Moran, S.; Castro de Moura, M.; Davalos, V.; Piñeyro, D.; Sanchez-Cespedes, M.; Girard, N.; Brevet, M.; et al. Epigenetic prediction of response to anti-PD-1 treatment in non-small-cell lung cancer: A multicentre, retrospective analysis. Lancet Respir. Med. 2018, 6, 771–781. [Google Scholar] [CrossRef]
- Garcia-Prieto, C.A.; Villanueva, L.; Bueno-Costa, A.; Davalos, V.; González-Navarro, E.A.; Juan, M.; Urbano-Ispizua, Á.; Delgado, J.; Ortiz-Maldonado, V.; del Bufalo, F.; et al. Epigenetic Profiling and Response to CD19 Chimeric Antigen Receptor T-Cell Therapy in B-Cell Malignancies. JNCI J. Natl. Cancer Inst. 2021, 1–10. [Google Scholar] [CrossRef]
- Pajtler, K.W.; Witt, H.; Sill, M.; Jones, D.T.W.; Hovestadt, V.; Kratochwil, F.; Wani, K.; Tatevossian, R.; Punchihewa, C.; Johann, P.; et al. Molecular Classification of Ependymal Tumors across All CNS Compartments, Histopathological Grades, and Age Groups. Cancer Cell 2015, 27, 728–743. [Google Scholar] [CrossRef]

| Description | Software | Bulk BS-Seq | scBS-Seq | AE-Seq | BS- Arrays | Ref | 
|---|---|---|---|---|---|---|
| Quality control | FastQC | yes | yes | yes | [47] | |
| Adapter/end-base trimming | TrimGalore | yes | yes | [48] | ||
| BS-aware read alignment | BISMARK, BS Seeker2, gemBS, BSMAP | yes | yes | [15,43,44,45] | ||
| Remove PCR duplicates | PicardTools | yes | yes | yes | [49] | |
| Variant calling | gemBS, Bis-SNP, GATK | yes | [45,50,51] | |||
| Methylation calling | BISMARK, Bis-SNP, gemBS, MethylExtract | yes | yes | [15,45,50,52] | ||
| standard read alignment | bowtie2, BWA | yes | [53,54] | |||
| Normalization | DESeq2, MEDIPS, Diffbind | yes | [55,56,57] | |||
| Enrichment analysis | QSEA, RaMWAS, Diffbind | yes | [57,58,59] | |||
| Quality control | minfi, limma, wateRmelon | yes | [60,61,62] | |||
| Normalization | minfi, limma, wateRmelon | yes | [60,61,62] | |||
| Methylation calling (bvalues, mvalues) | minfi, wateRmelon | yes | [60,62] | 
| Process | Description | Method | Software | BulkBS-Seq | scBS-Seq | AE-Seq | BS-Arrays | Ref | 
|---|---|---|---|---|---|---|---|---|
| Visualization | Variance decomposition | PCA | R | yes | yes | yes | yes | [68] | 
| Dimensionality reduction | MDS, t-SNE, NMF | MASS, stats, Rtsne, NMF | yes | yes | (yes) | (yes) | [69,70,71] | |
| Clustering | Clustering (nearest neighbour) | k-means | ||||||
| Hierarchical clustering (un-/supervised) | hclust() | stats, cluster, | yes | yes | yes | yes | [72,73] | |
| Imputation of missing data | Based on local spatial methylation correlation | Local likelihood smoothing | BSmooth | yes | (yes) | [74] | ||
| Based on local spatial methylation correlations within and across cells and different genomic regions | glm, Bayesian clustering | Melissa | yes | [75] | ||||
| Based on local spatial methylation correlations within and across cells and different genomic regions | Bayesian clustering, hierarchical mixture model | Epiclonal | yes | yes | [76] | |||
| Based on neighbouring CpG correlation and sequence composition | Deep neural network | DeepCpG | yes | [77] | 
| Task | Class | Method | Software | Bulk BS-Seq | scBS-Seq | BS-Arrays | Ref | 
|---|---|---|---|---|---|---|---|
| Remove unwanted variation (including batch effects) | Reference-free | Surrogate and independent surrogate variable analysis | SVA | yes | yes | [81] | |
| Remove unwanted variation | RUV, missMethyl | yes | [82,83] | ||||
| Intra-sample cell type deconvolution | Reference-free, semi-reference-free | NMF using recursive QP | RefFreeEWAS | yes | yes | ||
| Reference based | Robust partial correlations, CIBERSORT, Houseman CP, COMBAT | HEpiDISH/EpiDISH | yes | [84] | |||
| CIBERSORT | METHYLCIBERSORT | yes | yes | [85] | |||
| Reference based using scRNAseq | EPISCORE | yes | |||||
| Estimate immune cell fraction in tumours | Reference based | MethylResolveR | [86] | ||||
| Inference of tumour burden and tissue of origin from plasma cfDNA | CancerDetector | yes | |||||
| Estimate tumour purity from plasma cf-DNA | Reference-free | Concordance of neighbouring CpGs | CancerDetector | [87] | |||
| Estimate epipolymorphism, methylation entropy, clonal heterogeneity | Reference-free | Epiallele frequency | WSH | yes | (yes) | [88] | 
| Type | Method | Distribution | Software | Bulk BS-Seq | scBS-Seq | AE-Seq | BS- Arrays | Ref | 
|---|---|---|---|---|---|---|---|---|
| DMC, DMR (predefined) | Fisher’s Exact test, logistic regression | Binomial (dispersion) | MethylKit | yes | [88] | |||
| DMC, DMR (predefined) | Likelihood ratio | Beta-binomial | MethylSig | yes | [90] | |||
| DMC, DMR (defines) | Wald test, linear regression | Beta-binomial (dispersion) | DSS | yes | [91] | |||
| DMC, DMR (defines) | local linear regression, smoothing, t-test similar | Binomial | BSseq (BSmooth), | yes | [74] | |||
| DMC, DMR (predefind) | Linear regression, t-test | Linear | RnBeads | yes | yes | yes | [92] | |
| DMC, DMR (predefind) | glm, likelihood ratio | Negative-binomial (dispersion) | EdgeR | yes | yes | [93] | ||
| DMC, DMR (predefind) | glm, Wald test | Negative-binomial (dispersion) | DEseq2 (Diffbind) | yes | yes | [54] | ||
| DMC | non-parametric test, beta-regression | Gauss | limma | yes | [60] | |||
| DMC, DMR (defines) | local linear models, smoothing | Gauss | minfi (bump hunter, DMPfinder) | yes | [59] | |||
| DMC, DMR (defines) | local linear models, smoothing | DMRcate | yes | [94] | ||||
| DMC, DMR (defines) | Linear models, combining subregions | Gauss | dmrff | yes | [95] | 
| Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. | 
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Merkel, A.; Esteller, M. Experimental and Bioinformatic Approaches to Studying DNA Methylation in Cancer. Cancers 2022, 14, 349. https://doi.org/10.3390/cancers14020349
Merkel A, Esteller M. Experimental and Bioinformatic Approaches to Studying DNA Methylation in Cancer. Cancers. 2022; 14(2):349. https://doi.org/10.3390/cancers14020349
Chicago/Turabian StyleMerkel, Angelika, and Manel Esteller. 2022. "Experimental and Bioinformatic Approaches to Studying DNA Methylation in Cancer" Cancers 14, no. 2: 349. https://doi.org/10.3390/cancers14020349
APA StyleMerkel, A., & Esteller, M. (2022). Experimental and Bioinformatic Approaches to Studying DNA Methylation in Cancer. Cancers, 14(2), 349. https://doi.org/10.3390/cancers14020349
 
         
                                                


 
       