This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Open AccessArticle
A Multi-Dimensional Framework for Data Quality Assurance in Cancer Imaging Repositories
by
Olga Tsave
Olga Tsave
Dr. Olga Tsave works at the Laboratory of Computing, Medical Informatics and Biomedical Imaging of a [...]
Dr. Olga Tsave works at the Laboratory of Computing, Medical Informatics and Biomedical Imaging Technologies, School of Medicine, Aristotle University of Thessaloniki. She received B.S. and M.S. degrees from the Department of Biology, Faculty of Sciences, Aristotle University of Thessaloniki (AUTH), in 2012 and 2015, respectively, and a Ph.D. degree from the Department of Chemical Engineering, Faculty of Engineering, AUTH, in 2016. Her research interests involve bioinorganic materials with physiological and biomimetic substrates as potential pharmaceuticals of insulin mimetic activity, molecular and cellular studies on adipose tissue exposed to novel well-characterized hybrid inorganic–organic substrates, systems biology and adverse outcomes in metabolic pathologies, and bioinformatics targeting biotechnological advancements with theragnostic applications.
*,
Alexandra Kosvyra
Alexandra Kosvyra
,
Dimitrios T. Filos
Dimitrios T. Filos
Dr. Dimitrios T Filos is a researcher in the Laboratory of Computing, Medical Informatics and School [...]
Dr. Dimitrios T Filos is a researcher in the Laboratory of Computing, Medical Informatics and Biomedical Imaging Technologies, School of Medicine, Aristotle University of Thessaloniki, Greece. He has been involved in the fields of Digital Health and Medical Informatics for more than 10 years. His main scientific areas of interest include biomedical signal and image processing, physiological systems, behavioral modeling, machine learning, data analytics, and decision-making.
,
Dimitris Th. Fotopoulos
Dimitris Th. Fotopoulos
and
Ioanna Chouvarda
Ioanna Chouvarda
Laboratory of Computing, Medical Informatics and Biomedical Imaging Technologies, School of Medicine, Aristotle University of Thessaloniki, 541 24 Thessaloniki, Greece
*
Author to whom correspondence should be addressed.
Submission received: 6 July 2025
/
Revised: 25 August 2025
/
Accepted: 26 September 2025
/
Published: 1 October 2025
Simple Summary
In cancer imaging research, data collection, integration, and utilization to generate multicentric data repositories pose a series of significant challenges such as data harmonization, data quality, utility, and overall suitability for reuse. These challenges directly affect the reliability and robustness of research outcomes, making systematic approaches essential. This work presents the INCISIVE project approach for assessing the quality of cancer imaging and clinical (meta)data in a structured and transparent way. The proposed methodology serves as a guiding map to ensure the creation and maintenance of a high-quality data repository, which is a crucial factor for generalizable and trustworthy AI-services development and their safe adoption in healthcare practice.
Abstract
Background/Objectives: Cancer remains a leading global cause of death, with breast, lung, colorectal, and prostate cancers being among the most prevalent. The integration of Artificial Intelligence (AI) into cancer imaging research offers opportunities for earlier diagnosis and personalized treatment. However, the effectiveness of AI models depends critically on the quality, standardization, and fairness of the input data. The EU-funded INCISIVE project aimed to create a federated, pan-European repository of imaging and clinical data for cancer cases, with a key objective to develop a robust framework for pre-validating data prior to its use in AI development. Methods: We propose a data validation framework to assess clinical (meta)data and imaging data across five dimensions: completeness, validity, consistency, integrity, and fairness. The framework includes procedures for deduplication, annotation verification, DICOM metadata analysis, and anonymization compliance. Results: The pre-validation process identified key data quality issues, such as missing clinical information, inconsistent formatting, and subgroup imbalances, while also demonstrating the added value of structured data entry and standardized protocols. Conclusions: This structured framework addresses common challenges in curating large-scale, multimodal medical data. By applying this approach, the INCISIVE project ensures data quality, interoperability, and equity, providing a transferable model for future health data repositories supporting AI research in oncology.
Share and Cite
MDPI and ACS Style
Tsave, O.; Kosvyra, A.; Filos, D.T.; Fotopoulos, D.T.; Chouvarda, I.
A Multi-Dimensional Framework for Data Quality Assurance in Cancer Imaging Repositories. Cancers 2025, 17, 3213.
https://doi.org/10.3390/cancers17193213
AMA Style
Tsave O, Kosvyra A, Filos DT, Fotopoulos DT, Chouvarda I.
A Multi-Dimensional Framework for Data Quality Assurance in Cancer Imaging Repositories. Cancers. 2025; 17(19):3213.
https://doi.org/10.3390/cancers17193213
Chicago/Turabian Style
Tsave, Olga, Alexandra Kosvyra, Dimitrios T. Filos, Dimitris Th. Fotopoulos, and Ioanna Chouvarda.
2025. "A Multi-Dimensional Framework for Data Quality Assurance in Cancer Imaging Repositories" Cancers 17, no. 19: 3213.
https://doi.org/10.3390/cancers17193213
APA Style
Tsave, O., Kosvyra, A., Filos, D. T., Fotopoulos, D. T., & Chouvarda, I.
(2025). A Multi-Dimensional Framework for Data Quality Assurance in Cancer Imaging Repositories. Cancers, 17(19), 3213.
https://doi.org/10.3390/cancers17193213
Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details
here.
Article Metrics
Article metric data becomes available approximately 24 hours after publication online.