Next Article in Journal
Generation of Synthetic Dataset for Part Segmentation Problems
Previous Article in Journal
Looking Ahead When It Is Safe: An Uncertainty-Aware Paradigm for Blood Glucose Prediction with Dynamic Horizon Control
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Ensemble Variability as a Signal of Confounding in Medical Imaging Models

1
Research Institute of Computer Vision and Robotics, University of Girona, 17003 Girona, Spain
2
McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX 77030, USA
3
Center for Precision Health, McWilliams School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX 77030, USA
*
Author to whom correspondence should be addressed.
Mach. Learn. Knowl. Extr. 2026, 8(6), 146; https://doi.org/10.3390/make8060146
Submission received: 13 April 2026 / Revised: 20 May 2026 / Accepted: 25 May 2026 / Published: 27 May 2026
(This article belongs to the Section Data)

Abstract

Machine learning models for medical image analysis are vulnerable to hidden confounders, which can compromise generalization and clinical reliability. Existing detection strategies typically require explicit knowledge or labels of the confounder, which are often unavailable. In this work, we propose an ensemble-based framework to detect potential confounder-driven learning without explicitly defining the confounders, but only which samples might be affected. Our approach leverages the variability of model performance across ensembles to identify signatures of shortcut learning. Shortcut learning occurs when a model uses non-robust features or correlations rather than learning the true underlying task, and it is often observed when confounders are present. We generate controlled dataset variants with increasing confounding levels and analyze distributions of AUC (area under the ROC curve) scores across training, validation, and test splits, revealing converging performance and reduced variance as confounding intensifies. We validate our method on two clinically relevant tasks, diabetic retinopathy detection from retinal fundus images and tumor detection from brain MRI slices. Then, we further demonstrate its practical utility on another dataset and image modality with a stroke reperfusion prediction task with suspected hidden confounders. This work provides a practical, data-driven diagnostic tool to flag potential confounding and support the reliability assessment of machine learning models in medical imaging.
Keywords: confounding; bias; medical imaging confounding; bias; medical imaging
Graphical Abstract

Share and Cite

MDPI and ACS Style

Lal-Trehan Estrada, U.M.; Sheth, S.A.; Oliver, A.; Lladó, X.; Giancardo, L. Ensemble Variability as a Signal of Confounding in Medical Imaging Models. Mach. Learn. Knowl. Extr. 2026, 8, 146. https://doi.org/10.3390/make8060146

AMA Style

Lal-Trehan Estrada UM, Sheth SA, Oliver A, Lladó X, Giancardo L. Ensemble Variability as a Signal of Confounding in Medical Imaging Models. Machine Learning and Knowledge Extraction. 2026; 8(6):146. https://doi.org/10.3390/make8060146

Chicago/Turabian Style

Lal-Trehan Estrada, Uma M., Sunil A. Sheth, Arnau Oliver, Xavier Lladó, and Luca Giancardo. 2026. "Ensemble Variability as a Signal of Confounding in Medical Imaging Models" Machine Learning and Knowledge Extraction 8, no. 6: 146. https://doi.org/10.3390/make8060146

APA Style

Lal-Trehan Estrada, U. M., Sheth, S. A., Oliver, A., Lladó, X., & Giancardo, L. (2026). Ensemble Variability as a Signal of Confounding in Medical Imaging Models. Machine Learning and Knowledge Extraction, 8(6), 146. https://doi.org/10.3390/make8060146

Article Metrics

Back to TopTop