Previous Article in Journal
Correction: Rodríguez-Bautista et al. Immune Milieu and Genomic Alterations Set the Triple-Negative Breast Cancer Immunomodulatory Subtype Tumor Behavior. Cancers 2021, 13, 6256
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

A Multi-Dimensional Framework for Data Quality Assurance in Cancer Imaging Repositories

by
Olga Tsave
*,
Alexandra Kosvyra
,
Dimitrios T. Filos
,
Dimitris Th. Fotopoulos
and
Ioanna Chouvarda
Laboratory of Computing, Medical Informatics and Biomedical Imaging Technologies, School of Medicine, Aristotle University of Thessaloniki, 541 24 Thessaloniki, Greece
*
Author to whom correspondence should be addressed.
Cancers 2025, 17(19), 3213; https://doi.org/10.3390/cancers17193213
Submission received: 6 July 2025 / Revised: 25 August 2025 / Accepted: 26 September 2025 / Published: 1 October 2025
(This article belongs to the Section Methods and Technologies Development)

Simple Summary

In cancer imaging research, data collection, integration, and utilization to generate multicentric data repositories pose a series of significant challenges such as data harmonization, data quality, utility, and overall suitability for reuse. These challenges directly affect the reliability and robustness of research outcomes, making systematic approaches essential. This work presents the INCISIVE project approach for assessing the quality of cancer imaging and clinical (meta)data in a structured and transparent way. The proposed methodology serves as a guiding map to ensure the creation and maintenance of a high-quality data repository, which is a crucial factor for generalizable and trustworthy AI-services development and their safe adoption in healthcare practice.

Abstract

Background/Objectives: Cancer remains a leading global cause of death, with breast, lung, colorectal, and prostate cancers being among the most prevalent. The integration of Artificial Intelligence (AI) into cancer imaging research offers opportunities for earlier diagnosis and personalized treatment. However, the effectiveness of AI models depends critically on the quality, standardization, and fairness of the input data. The EU-funded INCISIVE project aimed to create a federated, pan-European repository of imaging and clinical data for cancer cases, with a key objective to develop a robust framework for pre-validating data prior to its use in AI development. Methods: We propose a data validation framework to assess clinical (meta)data and imaging data across five dimensions: completeness, validity, consistency, integrity, and fairness. The framework includes procedures for deduplication, annotation verification, DICOM metadata analysis, and anonymization compliance. Results: The pre-validation process identified key data quality issues, such as missing clinical information, inconsistent formatting, and subgroup imbalances, while also demonstrating the added value of structured data entry and standardized protocols. Conclusions: This structured framework addresses common challenges in curating large-scale, multimodal medical data. By applying this approach, the INCISIVE project ensures data quality, interoperability, and equity, providing a transferable model for future health data repositories supporting AI research in oncology.
Keywords: data validation; cancer imaging; clinical metadata; harmonization; imaging data repository; multi-site derived data; data quality data validation; cancer imaging; clinical metadata; harmonization; imaging data repository; multi-site derived data; data quality

Share and Cite

MDPI and ACS Style

Tsave, O.; Kosvyra, A.; Filos, D.T.; Fotopoulos, D.T.; Chouvarda, I. A Multi-Dimensional Framework for Data Quality Assurance in Cancer Imaging Repositories. Cancers 2025, 17, 3213. https://doi.org/10.3390/cancers17193213

AMA Style

Tsave O, Kosvyra A, Filos DT, Fotopoulos DT, Chouvarda I. A Multi-Dimensional Framework for Data Quality Assurance in Cancer Imaging Repositories. Cancers. 2025; 17(19):3213. https://doi.org/10.3390/cancers17193213

Chicago/Turabian Style

Tsave, Olga, Alexandra Kosvyra, Dimitrios T. Filos, Dimitris Th. Fotopoulos, and Ioanna Chouvarda. 2025. "A Multi-Dimensional Framework for Data Quality Assurance in Cancer Imaging Repositories" Cancers 17, no. 19: 3213. https://doi.org/10.3390/cancers17193213

APA Style

Tsave, O., Kosvyra, A., Filos, D. T., Fotopoulos, D. T., & Chouvarda, I. (2025). A Multi-Dimensional Framework for Data Quality Assurance in Cancer Imaging Repositories. Cancers, 17(19), 3213. https://doi.org/10.3390/cancers17193213

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop