This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Open AccessReview
Scaling for African Inclusion in High-Throughput Whole Cancer Genome Bioinformatic Workflows
by
Jue Jiang
Jue Jiang 1,
Georgina Samaha
Georgina Samaha 2,
Cali E. Willet
Cali E. Willet 2
,
Tracy Chew
Tracy Chew 2
,
Vanessa M. Hayes
Vanessa M. Hayes 1,3,4,5,*
and
Weerachai Jaratlerdsiri
Weerachai Jaratlerdsiri 1,6,*
1
Ancestry and Health Genomics Laboratory, Charles Perkins Centre, School of Medical Sciences, Faculty of Medicine and Health, The University of Sydney, Camperdown, NSW 2050, Australia
2
Sydney Informatics Hub, The University of Sydney, Camperdown, NSW 2050, Australia
3
Manchester Cancer Research Centre, The University of Manchester, Manchester M20 4GJ, UK
4
School of Health Systems and Public Health, University of Pretoria, Pretoria 0002, South Africa
5
Norwich Medical School, University of East Anglia, Norwich NR4 7TJ, UK
6
Computational Genomics Group, Charles Perkins Centre, School of Medical Sciences, Faculty of Medicine and Health, The University of Sydney, Camperdown, NSW 2050, Australia
*
Authors to whom correspondence should be addressed.
Cancers 2025, 17(15), 2481; https://doi.org/10.3390/cancers17152481 (registering DOI)
Submission received: 28 June 2025
/
Revised: 21 July 2025
/
Accepted: 23 July 2025
/
Published: 26 July 2025
Simple Summary
Africa faces the highest mortality rates across eight cancer types. However, cancer studies are biased toward European populations, leading to major concerns that cancer treatments may be ineffective for African patients. Providing a systematic review of African-inclusive whole cancer genome studies, African-derived tumours reveal distinct clinically relevant drivers, molecular taxonomies, and overall increased genomic instability, highlighting challenges associated with non-African-derived computational workflows. We provide a rationale for parallelism strategies to accelerate the processing steps of those distinctly intensive data, allowing for required scalability. Advocating for further resources that capture the rich African ancestral diversity, a concerted global effort will be required to improve and ultimately standardise bioinformatic workflows, thereby enhancing health outcomes for African cancer patients.
Abstract
Sub-Saharan Africa is experiencing the highest mortality rates for several cancer types. While cancer research globally has entered the genomic era and advanced the deployment of precision oncology, Africa has largely been excluded and has received few benefits from tumour profiling. Through a thorough literature review, we identified only five whole cancer genome databases that include patients from Sub-Saharan Africa, covering four cancer types (breast, esophageal, prostate, and Burkitt lymphoma). Irrespective of cancer type, these studies report higher tumour genome instability, including African-specific cancer drivers and mutational signatures, suggesting unique contributory mechanisms at play. Reviewing bioinformatic tools applied to African databases, we carefully select a workflow suitable for large-scale African resources, which incorporates cohort-level data and a scalable design for time and computational efficiency. Using African genomic data, we demonstrate the scalability achieved by high-level parallelism through physical data or genomic interval chunking strategies. Furthermore, we provide a rationale for improving current workflows for African data, including the adoption of more genomic techniques and the prioritisation of African-derived datasets for diverse applications. Together, these enhancements and genomic scaling strategies serve as practical computational guidance, lowering technical barriers for future large-scale African-inclusive research and ultimately helping to reduce the disparity gap in cancer mortality rates across Sub-Saharan Africa.
Share and Cite
MDPI and ACS Style
Jiang, J.; Samaha, G.; Willet, C.E.; Chew, T.; Hayes, V.M.; Jaratlerdsiri, W.
Scaling for African Inclusion in High-Throughput Whole Cancer Genome Bioinformatic Workflows. Cancers 2025, 17, 2481.
https://doi.org/10.3390/cancers17152481
AMA Style
Jiang J, Samaha G, Willet CE, Chew T, Hayes VM, Jaratlerdsiri W.
Scaling for African Inclusion in High-Throughput Whole Cancer Genome Bioinformatic Workflows. Cancers. 2025; 17(15):2481.
https://doi.org/10.3390/cancers17152481
Chicago/Turabian Style
Jiang, Jue, Georgina Samaha, Cali E. Willet, Tracy Chew, Vanessa M. Hayes, and Weerachai Jaratlerdsiri.
2025. "Scaling for African Inclusion in High-Throughput Whole Cancer Genome Bioinformatic Workflows" Cancers 17, no. 15: 2481.
https://doi.org/10.3390/cancers17152481
APA Style
Jiang, J., Samaha, G., Willet, C. E., Chew, T., Hayes, V. M., & Jaratlerdsiri, W.
(2025). Scaling for African Inclusion in High-Throughput Whole Cancer Genome Bioinformatic Workflows. Cancers, 17(15), 2481.
https://doi.org/10.3390/cancers17152481
Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details
here.
Article Metrics
Article Access Statistics
For more information on the journal statistics, click
here.
Multiple requests from the same IP address are counted as one view.