Urine-Based cfDNA Ensemble Modeling for Early Detection of Bladder Cancer Using Whole-Genome Methylation Sequencing
Simple Summary
Abstract
1. Introduction
2. Materials and Methods
2.1. Subjects
2.2. EM-Seq Library Preparation
2.3. Sequencing Data Processing
2.4. Methylation Call and Marker Selection
2.5. Assessment of Tissue–Urine Concordance in Methylation Markers
2.6. CNV Call
2.7. Model Training for Methylation and CNV
2.8. Ensemble Model Construction
2.9. Annotation and Gene Ontology Enrichment Analysis
2.10. Statistical Analysis
3. Results
3.1. Design and Rationale for Urine-Based Cancer Detection Model
3.2. Tissue-Derived Signals Present in Urine CNV
3.3. Tissue-Derived Signals Present in Urine Methylation
3.4. Evaluation of Ensemble Model Performance
4. Discussion
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Jiang, L.; Guo, S.; Zhou, Z.; Li, Z.; Zhou, F.; Yu, C.; Li, M.; Huang, W.; Liu, Z.; Tian, X. Snai2-mediated Upregulation of NADSYN1 Promotes Bladder Cancer Progression by Interacting with PHB. Clin. Transl. Med. 2024, 14, e1555. [Google Scholar] [CrossRef] [PubMed]
- Zhou, Y.; Wang, R.; Zeng, M.; Liu, S. Circulating Tumor DNA: A Revolutionary Approach for Early Detection and Personalized Treatment of Bladder Cancer. Front. Pharmacol. 2025, 16, 1551219. [Google Scholar] [CrossRef] [PubMed]
- Beijert, I.J.; van den Burgt, Y.; Hentschel, A.E.; Bosschieter, J.; Kauer, P.C.; Lissenberg-Witte, B.I.; van Moorselaar, R.J.A.; Nieuwenhuijzen, J.A.; Steenbergen, R.D.M. Bladder Cancer Detection by Urinary Methylation Markers GHSR/MAL: A Validation Study. World J. Urol. 2024, 42, 578. [Google Scholar] [CrossRef] [PubMed]
- Christensen, E.; Nordentoft, I.; Birkenkamp-Demtröder, K.; Elbæk, S.K.; Lindskrog, S.V.; Taber, A.; Andreasen, T.G.; Strandgaard, T.; Knudsen, M.; Lamy, P.; et al. Cell-Free Urine and Plasma DNA Mutational Analysis Predicts Neoadjuvant Chemotherapy Response and Outcome in Patients with Muscle-Invasive Bladder Cancer. Clin. Cancer Res. 2023, 29, 1582–1591. [Google Scholar] [CrossRef]
- Dudley, J.C.; Schroers-Martin, J.; Lazzareschi, D.V.; Shi, W.Y.; Chen, S.B.; Esfahani, M.S.; Trivedi, D.; Chabon, J.J.; Chaudhuri, A.A.; Stehr, H.; et al. Detection and Surveillance of Bladder Cancer Using Urine Tumor DNA. Cancer Discov. 2019, 9, 500–509. [Google Scholar] [CrossRef]
- Cai, Y.; Yang, X.; Lin, S.; Xu, Y.; Zhu, S.; Fan, D.; Zhao, M.; Zhang, Y.; Yang, X.; Li, X. Low-Coverage Sequencing of Urine Sediment DNA for Detection of Copy Number Aberrations in Bladder Cancer. Cancer Manag. Res. 2021, 13, 1943–1953. [Google Scholar] [CrossRef]
- Meng, X.-Y.; Zhou, X.-H.; Li, S.; Shi, M.-J.; Li, X.-H.; Yang, B.-Y.; Liu, M.; Yi, K.-Z.; Wang, Y.-Z.; Zhang, H.-Y.; et al. Machine Learning-Based Detection of Bladder Cancer by Urine cfDNA Fragmentation Hotspots That Capture Cancer-Associated Molecular Features. Clin. Chem. 2024, 70, 1463–1473. [Google Scholar] [CrossRef]
- Cheng, T.H.T.; Jiang, P.; Teoh, J.Y.C.; Heung, M.M.S.; Tam, J.C.W.; Sun, X.; Lee, W.-S.; Ni, M.; Chan, R.C.K.; Ng, C.-F.; et al. Noninvasive Detection of Bladder Cancer by Shallow-Depth Genome-Wide Bisulfite Sequencing of Urinary Cell-Free DNA for Methylation and Copy Number Profiling. Clin. Chem. 2019, 65, 927–936. [Google Scholar] [CrossRef] [PubMed]
- Wang, P.; Shi, Y.; Zhang, J.; Shou, J.; Zhang, M.; Zou, D.; Liang, Y.; Li, J.; Tan, Y.; Zhang, M.; et al. UCseek: Ultrasensitive Early Detection and Recurrence Monitoring of Urothelial Carcinoma by Shallow-Depth Genome-Wide Bisulfite Sequencing of Urinary Sediment DNA. eBioMedicine 2023, 89, 104437. [Google Scholar] [CrossRef]
- Alberca-Del Arco, F.; Prieto-Cuadra, D.; Santos-Perez de la Blanca, R.; Sáez-Barranquero, F.; Matas-Rico, E.; Herrera-Imbroda, B. New Perspectives on the Role of Liquid Biopsy in Bladder Cancer: Applicability to Precision Medicine. Cancers 2024, 16, 803. [Google Scholar] [CrossRef]
- Lu, T.; Li, J. Clinical Applications of Urinary Cell-Free DNA in Cancer: Current Insights and Promising Future. Am. J. Cancer Res. 2017, 7, 2318–2332. [Google Scholar]
- Mattox, A.K.; Douville, C.; Wang, Y.; Popoli, M.; Ptak, J.; Silliman, N.; Dobbyn, L.; Schaefer, J.; Lu, S.; Pearlman, A.H.; et al. The Origin of Highly Elevated Cell-Free DNA in Healthy Individuals and Patients with Pancreatic, Colorectal, Lung, or Ovarian Cancer. Cancer Discov. 2023, 13, 2166–2179. [Google Scholar] [CrossRef]
- Lee, D.; Lee, W.; Kim, H.-P.; Kim, M.; Ahn, H.K.; Bang, D.; Kim, K.H. Accurate Detection of Urothelial Bladder Cancer Using Targeted Deep Sequencing of Urine DNA. Cancers 2023, 15, 2868. [Google Scholar] [CrossRef] [PubMed]
- Erger, F.; Nörling, D.; Borchert, D.; Leenen, E.; Habbig, S.; Wiesener, M.S.; Bartram, M.P.; Wenzel, A.; Becker, C.; Toliat, M.R.; et al. cfNOMe—A Single Assay for Comprehensive Epigenetic Analyses of Cell-Free DNA. Genome Med. 2020, 12, 54. [Google Scholar] [CrossRef]
- Chen, S.; Zhou, Y.; Chen, Y.; Gu, J. Fastp: An Ultra-Fast All-in-One FASTQ Preprocessor. Bioinformatics 2018, 34, i884–i890. [Google Scholar] [CrossRef]
- Cheng, H.; Xu, Y. BitMapperBS: A Fast and Accurate Read Aligner for Whole-Genome Bisulfite Sequencing. bioRxiv 2018. [Google Scholar] [CrossRef]
- Li, H.; Handsaker, B.; Wysoker, A.; Fennell, T.; Ruan, J.; Homer, N.; Marth, G.; Abecasis, G.; Durbin, R. 1000 Genome Project Data Processing Subgroup The Sequence Alignment/Map Format and SAMtools. Bioinformatics 2009, 25, 2078–2079. [Google Scholar] [CrossRef] [PubMed]
- McKenna, A.; Hanna, M.; Banks, E.; Sivachenko, A.; Cibulskis, K.; Kernytsky, A.; Garimella, K.; Altshuler, D.; Gabriel, S.; Daly, M.; et al. The Genome Analysis Toolkit: A MapReduce Framework for Analyzing next-Generation DNA Sequencing Data. Genome Res. 2010, 20, 1297–1303. [Google Scholar] [CrossRef] [PubMed]
- Jun, G.; Wing, M.K.; Abecasis, G.R.; Kang, H.M. An Efficient and Scalable Analysis Framework for Variant Extraction and Refinement from Population-Scale DNA Sequence Data. Genome Res. 2015, 25, 918–925. [Google Scholar] [CrossRef]
- Ryan, D. MethylDackel. Available online: https://github.com/dpryan79/MethylDackel (accessed on 1 October 2025).
- Amemiya, H.M.; Kundaje, A.; Boyle, A.P. The ENCODE Blacklist: Identification of Problematic Regions of the Genome. Sci. Rep. 2019, 9, 9354. [Google Scholar] [CrossRef]
- Adalsteinsson, V.A.; Ha, G.; Freeman, S.S.; Choudhury, A.D.; Stover, D.G.; Parsons, H.A.; Gydush, G.; Reed, S.C.; Rotem, D.; Rhoades, J.; et al. Scalable Whole-Exome Sequencing of Cell-Free DNA Reveals High Concordance with Metastatic Tumors. Nat. Commun. 2017, 8, 1324. [Google Scholar] [CrossRef]
- Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; Association for Computing Machinery: New York, NY, USA, 2016; pp. 785–794. [Google Scholar]
- Cavalcante, R.G.; Sartor, M.A. Annotatr: Genomic Regions in Context. Bioinformatics 2017, 33, 2381–2383. [Google Scholar] [CrossRef] [PubMed]
- Lawrence, M.; Huber, W.; Pagès, H.; Aboyoun, P.; Carlson, M.; Gentleman, R.; Morgan, M.T.; Carey, V.J. Software for Computing and Annotating Genomic Ranges. PLoS Comput. Biol. 2013, 9, e1003118. [Google Scholar] [CrossRef] [PubMed]
- Kuleshov, M.V.; Jones, M.R.; Rouillard, A.D.; Fernandez, N.F.; Duan, Q.; Wang, Z.; Koplev, S.; Jenkins, S.L.; Jagodnik, K.M.; Lachmann, A.; et al. Enrichr: A Comprehensive Gene Set Enrichment Analysis Web Server 2016 Update. Nucleic Acids Res. 2016, 44, W90–W97. [Google Scholar] [CrossRef]
- Abbosh, P.H.; Plimack, E.R. Molecular and Clinical Insights into the Role and Significance of Mutated DNA Repair Genes in Bladder Cancer. Bladder Cancer 2018, 4, 9–18. [Google Scholar] [CrossRef] [PubMed]
- Mun, J.-Y.; Baek, S.-W.; Park, W.Y.; Kim, W.-T.; Kim, S.-K.; Roh, Y.-G.; Jeong, M.-S.; Yang, G.-E.; Lee, J.-H.; Chung, J.W.; et al. E2F1 Promotes Progression of Bladder Cancer by Modulating RAD54L Involved in Homologous Recombination Repair. Int. J. Mol. Sci. 2020, 21, 9025. [Google Scholar] [CrossRef]
- Freedman, N.D.; Silverman, D.T.; Hollenbeck, A.R.; Schatzkin, A.; Abnet, C.C. Association between Smoking and Risk of Bladder Cancer among Men and Women. JAMA 2011, 306, 737–745. [Google Scholar] [CrossRef]
- Kantor, A.F.; Hartge, P.; Hoover, R.N.; Narayana, A.S.; Sullivan, J.W.; Fraumeni, J.F. Urinary Tract Infection and Risk of Bladder Cancer. Am. J. Epidemiol. 1984, 119, 510–515. [Google Scholar] [CrossRef]
- Tang, H.; Shi, W.; Fu, S.; Wang, T.; Zhai, S.; Song, Y.; Han, J. Pioglitazone and Bladder Cancer Risk: A Systematic Review and Meta-analysis. Cancer Med. 2018, 7, 1070–1080. [Google Scholar] [CrossRef]
- Tong, T.; Li, Z. Predicting Learning Achievement Using Ensemble Learning with Result Explanation. PLoS ONE 2025, 20, e0312124. [Google Scholar] [CrossRef]
- Khan, A.A.; Chaudhari, O.; Chandra, R. A Review of Ensemble Learning and Data Augmentation Models for Class Imbalanced Problems: Combination, Implementation and Evaluation. Expert Syst. Appl. 2024, 244, 122778. [Google Scholar] [CrossRef]
- Lee, S.H.; Hu, W.; Matulay, J.T.; Silva, M.V.; Owczarek, T.B.; Kim, K.; Chua, C.W.; Barlow, L.J.; Kandoth, C.; Williams, A.B.; et al. Tumor Evolution and Drug Response in Patient-Derived Organoid Models of Bladder Cancer. Cell 2018, 173, 515–528.e17. [Google Scholar] [CrossRef] [PubMed]
- Zhao, H.; Lin, N.; Ho, V.W.S.; Liu, K.; Chen, X.; Wu, H.; Chiu, P.K.-F.; Huang, L.; Dantes, Z.; Wong, K.-L.; et al. Patient-Derived Bladder Cancer Organoids as a Valuable Tool for Understanding Tumor Biology and Developing Personalized Treatment. Adv. Sci. 2025, 12, e2414558. [Google Scholar] [CrossRef] [PubMed]



Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Kim, T.; Shin, D.; Ahn, H.K.; Moon, Y.J.; Bang, D.; Kim, K.H. Urine-Based cfDNA Ensemble Modeling for Early Detection of Bladder Cancer Using Whole-Genome Methylation Sequencing. Cancers 2026, 18, 767. https://doi.org/10.3390/cancers18050767
Kim T, Shin D, Ahn HK, Moon YJ, Bang D, Kim KH. Urine-Based cfDNA Ensemble Modeling for Early Detection of Bladder Cancer Using Whole-Genome Methylation Sequencing. Cancers. 2026; 18(5):767. https://doi.org/10.3390/cancers18050767
Chicago/Turabian StyleKim, Taehoon, Dongju Shin, Hyun Kyu Ahn, Young Joon Moon, Duhee Bang, and Kwang Hyun Kim. 2026. "Urine-Based cfDNA Ensemble Modeling for Early Detection of Bladder Cancer Using Whole-Genome Methylation Sequencing" Cancers 18, no. 5: 767. https://doi.org/10.3390/cancers18050767
APA StyleKim, T., Shin, D., Ahn, H. K., Moon, Y. J., Bang, D., & Kim, K. H. (2026). Urine-Based cfDNA Ensemble Modeling for Early Detection of Bladder Cancer Using Whole-Genome Methylation Sequencing. Cancers, 18(5), 767. https://doi.org/10.3390/cancers18050767

