Previous Article in Journal
Chemotherapy (Etoposide)-Induced Intermingling of Heterochromatin and Euchromatin Compartments in Senescent PA-1 Embryonal Carcinoma Cells
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Review

Scaling for African Inclusion in High-Throughput Whole Cancer Genome Bioinformatic Workflows

1
Ancestry and Health Genomics Laboratory, Charles Perkins Centre, School of Medical Sciences, Faculty of Medicine and Health, The University of Sydney, Camperdown, NSW 2050, Australia
2
Sydney Informatics Hub, The University of Sydney, Camperdown, NSW 2050, Australia
3
Manchester Cancer Research Centre, The University of Manchester, Manchester M20 4GJ, UK
4
School of Health Systems and Public Health, University of Pretoria, Pretoria 0002, South Africa
5
Norwich Medical School, University of East Anglia, Norwich NR4 7TJ, UK
6
Computational Genomics Group, Charles Perkins Centre, School of Medical Sciences, Faculty of Medicine and Health, The University of Sydney, Camperdown, NSW 2050, Australia
*
Authors to whom correspondence should be addressed.
Cancers 2025, 17(15), 2481; https://doi.org/10.3390/cancers17152481 (registering DOI)
Submission received: 28 June 2025 / Revised: 21 July 2025 / Accepted: 23 July 2025 / Published: 26 July 2025

Simple Summary

Africa faces the highest mortality rates across eight cancer types. However, cancer studies are biased toward European populations, leading to major concerns that cancer treatments may be ineffective for African patients. Providing a systematic review of African-inclusive whole cancer genome studies, African-derived tumours reveal distinct clinically relevant drivers, molecular taxonomies, and overall increased genomic instability, highlighting challenges associated with non-African-derived computational workflows. We provide a rationale for parallelism strategies to accelerate the processing steps of those distinctly intensive data, allowing for required scalability. Advocating for further resources that capture the rich African ancestral diversity, a concerted global effort will be required to improve and ultimately standardise bioinformatic workflows, thereby enhancing health outcomes for African cancer patients.

Abstract

Sub-Saharan Africa is experiencing the highest mortality rates for several cancer types. While cancer research globally has entered the genomic era and advanced the deployment of precision oncology, Africa has largely been excluded and has received few benefits from tumour profiling. Through a thorough literature review, we identified only five whole cancer genome databases that include patients from Sub-Saharan Africa, covering four cancer types (breast, esophageal, prostate, and Burkitt lymphoma). Irrespective of cancer type, these studies report higher tumour genome instability, including African-specific cancer drivers and mutational signatures, suggesting unique contributory mechanisms at play. Reviewing bioinformatic tools applied to African databases, we carefully select a workflow suitable for large-scale African resources, which incorporates cohort-level data and a scalable design for time and computational efficiency. Using African genomic data, we demonstrate the scalability achieved by high-level parallelism through physical data or genomic interval chunking strategies. Furthermore, we provide a rationale for improving current workflows for African data, including the adoption of more genomic techniques and the prioritisation of African-derived datasets for diverse applications. Together, these enhancements and genomic scaling strategies serve as practical computational guidance, lowering technical barriers for future large-scale African-inclusive research and ultimately helping to reduce the disparity gap in cancer mortality rates across Sub-Saharan Africa.
Keywords: Africa; computational workflow; parallelism; cancer genomics; whole-genome sequencing Africa; computational workflow; parallelism; cancer genomics; whole-genome sequencing

Share and Cite

MDPI and ACS Style

Jiang, J.; Samaha, G.; Willet, C.E.; Chew, T.; Hayes, V.M.; Jaratlerdsiri, W. Scaling for African Inclusion in High-Throughput Whole Cancer Genome Bioinformatic Workflows. Cancers 2025, 17, 2481. https://doi.org/10.3390/cancers17152481

AMA Style

Jiang J, Samaha G, Willet CE, Chew T, Hayes VM, Jaratlerdsiri W. Scaling for African Inclusion in High-Throughput Whole Cancer Genome Bioinformatic Workflows. Cancers. 2025; 17(15):2481. https://doi.org/10.3390/cancers17152481

Chicago/Turabian Style

Jiang, Jue, Georgina Samaha, Cali E. Willet, Tracy Chew, Vanessa M. Hayes, and Weerachai Jaratlerdsiri. 2025. "Scaling for African Inclusion in High-Throughput Whole Cancer Genome Bioinformatic Workflows" Cancers 17, no. 15: 2481. https://doi.org/10.3390/cancers17152481

APA Style

Jiang, J., Samaha, G., Willet, C. E., Chew, T., Hayes, V. M., & Jaratlerdsiri, W. (2025). Scaling for African Inclusion in High-Throughput Whole Cancer Genome Bioinformatic Workflows. Cancers, 17(15), 2481. https://doi.org/10.3390/cancers17152481

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop