Next Article in Journal
Microsatellite Instability Status and Mismatch Repair Defects Testing in Endometrial Cancer—Insights from the Multicenter E-PEC Trial
Previous Article in Journal
Quantifying Explainability in OCT Segmentation of Macular Holes and Cysts: A SHAP-Based Coverage and Factor Contribution Analysis
Previous Article in Special Issue
Genetic Artificial Intelligence in Gastrointestinal Disease: A Systematic Review
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

IDF-Net: Interpretable Dynamic Fusion Network for Colorectal Cancer Diagnosis Using Cross-Modal Imaging

Department of Biomedical Engineering, University of Electronic Science and Technology of China, Chengdu 610054, China
*
Author to whom correspondence should be addressed.
Diagnostics 2026, 16(1), 99; https://doi.org/10.3390/diagnostics16010099 (registering DOI)
Submission received: 3 December 2025 / Revised: 17 December 2025 / Accepted: 25 December 2025 / Published: 27 December 2025

Abstract

Background/Objectives: Colorectal cancer (CRC) is a leading cause of cancer deaths worldwide, underscoring the need for diagnostic tools that early, accurate, and clinically interpretable. Current artificial intelligence (AI) models are predominantly unimodal and lack sufficient interpretability, which restricts their clinical adoption. Methods: We propose IDF-Net, an interpretable dynamic fusion framework that integrates endoscopy, computed tomography (CT), and histopathology using modality-specific encoders, a dual-stage adaptive gating mechanism, and cross-modal attention. We conducted stratified 5-fold cross-validation and assessed interpretability using spatial heatmaps and modality attribution. We also quantified the results using the intersection-over-union metric for saliency alignment. Results: IDF-Net achieved a state-of-the-art accuracy of 0.920 (0.907–0.936) and area under the curve (AUC) of 0.991 (95% CI: 0.965–0.997), significantly outperforming unimodal and static-fusion baselines (p < 0.05). Interpretability analysis of IDF-Net demonstrated a strong alignment between Gradient-weighted Class Activation Mapping++ heatmaps and expert-annotated lesions, as well as case-specific modality contributions via SHapley Additive exPlanations values. Ablation studies confirmed the contribution of each component, with dynamic routing and cross-attention fusion improving AUC by 0.038 and 0.046, respectively. Conclusions: IDF-Net introduces a dynamically fused, multimodal diagnostic framework with integrated quantitative interpretability, demonstrating superior accuracy and strong potential for clinical translation in CRC diagnosis. The model’s adaptive design allows it to function robustly even when CT data is unavailable, aligning with common clinical pathways while leveraging additional imaging when present for comprehensive staging.
Keywords: colorectal cancer diagnosis; gastrointestinal disease; dynamic fusion network; deep learning; machine learning; cross-modal imaging; interpretable AI colorectal cancer diagnosis; gastrointestinal disease; dynamic fusion network; deep learning; machine learning; cross-modal imaging; interpretable AI

Share and Cite

MDPI and ACS Style

Hayeso, H.H.; Shi, P.; Lian, J.; Lonseko, Z.M.; Rao, N. IDF-Net: Interpretable Dynamic Fusion Network for Colorectal Cancer Diagnosis Using Cross-Modal Imaging. Diagnostics 2026, 16, 99. https://doi.org/10.3390/diagnostics16010099

AMA Style

Hayeso HH, Shi P, Lian J, Lonseko ZM, Rao N. IDF-Net: Interpretable Dynamic Fusion Network for Colorectal Cancer Diagnosis Using Cross-Modal Imaging. Diagnostics. 2026; 16(1):99. https://doi.org/10.3390/diagnostics16010099

Chicago/Turabian Style

Hayeso, Helen Haile, Peifeng Shi, Jingwen Lian, Zenebe Markos Lonseko, and Nini Rao. 2026. "IDF-Net: Interpretable Dynamic Fusion Network for Colorectal Cancer Diagnosis Using Cross-Modal Imaging" Diagnostics 16, no. 1: 99. https://doi.org/10.3390/diagnostics16010099

APA Style

Hayeso, H. H., Shi, P., Lian, J., Lonseko, Z. M., & Rao, N. (2026). IDF-Net: Interpretable Dynamic Fusion Network for Colorectal Cancer Diagnosis Using Cross-Modal Imaging. Diagnostics, 16(1), 99. https://doi.org/10.3390/diagnostics16010099

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop