Confidence-Guided Adaptive Diffusion Network for Medical Image Classification
Abstract
1. Introduction
- 1.
- We identify a critical limitation in existing diffusion-based medical image classification methods, namely the lack of explicit modeling of hierarchical semantic consistency during the denoising process, and propose two structural modules—Hierarchical Pyramid Context Modeling (HPCM) and Intra-Scale Dilated Convolution Refinement (IDCR)—to stabilize diffusion feature representations.
- 2.
- We propose a Confidence-Guided Adaptive Noise Injection (CG-ANI) strategy that effectively alleviates the mismatch between fixed noise scheduling and discriminative learning objectives, thereby improving training stability in diffusion-based classification.
- 3.
- By integrating structural prior modeling with an optimization stabilization mechanism, we construct a unified diffusion-based medical image classification framework, termed CGAD-Net, which enables stable feature learning across different denoising stages.
- 4.
- Extensive experiments conducted on multiple medical image classification benchmarks suggest that CGAD-Net achieves competitive performance relative to existing state-of-the-art methods in terms of classification accuracy, robustness, and cross-dataset generalization.
2. Methods
| Algorithm 1 End-to-End Unified Training Procedure of CGAD-Net |
Require: Training dataset
|
2.1. Prior Module Design
2.2. Confidence-Guided Adaptive Noise Injection Strategy
2.3. Loss Function
2.3.1. Adaptive Denoising Regression Loss
2.3.2. Multi-View Maximum Mean Discrepancy Regularization
2.3.3. Periodic Auxiliary Supervision
2.3.4. Overall Training Objective
3. Results
3.1. Datasets and Evaluation
3.2. Implementation Details
3.3. Performance Comparison with State-of-the-Art Methods
3.4. Ablation Study
4. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Kumar, S.; Ivanova, O.; Melyokhin, A.; Tiwari, P. Deep-learning-enabled multimodal data fusion for lung disease classification. Inform. Med. Unlocked 2023, 42, 101367. [Google Scholar] [CrossRef]
- Ahmad, I.; Amin, J.; Lali, M.I.U.; Abbas, F.; Sharif, M.I. A novel Deeplabv3+ and vision-based transformer model for segmentation and classification of skin lesions. Biomed. Signal Process. Control 2024, 92, 106084. [Google Scholar] [CrossRef]
- Dehbozorgi, P.; Ryabchykov, O.; Bocklitz, T. A systematic investigation of image pre-processing on image classification. IEEE Access 2024, 12, 64913–64926. [Google Scholar] [CrossRef]
- de Vente, C.; van Ginneken, B.; Hoyng, C.B.; Klaver, C.C.W.; Sánchez, C.I. Uncertainty-aware multiple-instance learning for reliable classification: Application to optical coherence tomography. Med. Image Anal. 2024, 97, 103259. [Google Scholar] [CrossRef] [PubMed]
- LeCun, Y.; Bottou, L.; Bengio, Y.; Haffner, P. Gradient-based learning applied to document recognition. Proc. IEEE 1998, 86, 2278–2324. [Google Scholar] [CrossRef]
- Krizhevsky, A.; Sutskever, I.; Hinton, G.E. ImageNet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 2012, 25, 1097–1105. [Google Scholar] [CrossRef]
- Fu’adah, Y.N.; Wijayanto, I.; Pratiwi, N.K.C.; Taliningsih, F.F.; Rizal, S.; Pramudito, M.A. Automated classification of Alzheimer’s disease based on MRI image processing using CNN with AlexNet architecture. J. Phys. Conf. Ser. 2021, 1844, 012020. [Google Scholar] [CrossRef]
- Maqsood, M.; Nazir, F.; Khan, U.; Aadil, F.; Jamal, H.; Mehmood, I.; Song, O.-Y. Transfer learning assisted classification and detection of Alzheimer’s disease stages using 3D MRI scans. Sensors 2019, 19, 2645. [Google Scholar] [CrossRef]
- Tajbakhsh, N.; Shin, J.Y.; Gurudu, S.R.; Hurst, R.T.; Kendall, C.B.; Gotway, M.B.; Liang, J. Convolutional neural networks for medical image analysis: Full training or fine tuning? IEEE Trans. Med. Imaging 2016, 35, 1299–1312. [Google Scholar] [CrossRef]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is all you need. Adv. Neural Inf. Process. Syst. 2017, 30, 1–11. [Google Scholar]
- Aboutalebi, H.; Pavlova, M.; Gunraj, H.; Shafiee, M.J.; Sabri, A.; Alaref, A.; Wong, A. MEDUSA: Multi-scale encoder–decoder self-attention network. Front. Med. 2022, 8, 821120. [Google Scholar] [CrossRef]
- Zhou, Y.; Sun, L.; Xiong, X.; Ti, G.; Yang, S. GCNet-Mamba: Leveraging state space models and CNN for medical image classification. Expert Syst. Appl. 2025, 303, 130733. [Google Scholar] [CrossRef]
- Liao, H.; Luo, J. A deep multi-task learning approach to skin lesion classification. In Proceedings of the AAAI-17 Joint Workshop on Health Intelligence, San Francisco, CA, USA, 4–9 February 2017. WS-17-09. [Google Scholar]
- Cao, K.; Wei, C.; Gaidon, A.; Arechiga, N.; Ma, T. Learning imbalanced datasets with label-distribution-aware margin loss. Adv. Neural Inf. Process. Syst. 2019, 32, 1565–1576. [Google Scholar]
- Marrakchi, Y.; Makansi, O.; Brox, T. Fighting class imbalance with contrastive learning. In Proceedings of the 24th International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI 2021), Strasbourg, France, 27 September–1 October 2021; pp. 466–476. [Google Scholar]
- Yang, Z.; Pan, J.; Yang, Y.; Shi, X.; Zhou, H.-Y.; Zhang, Z.; Bian, C. ProCo: Prototype-aware contrastive learning for long-tailed medical image classification. In Proceedings of the 25th International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI 2022), Singapore, 18–22 September 2022; pp. 173–182. [Google Scholar]
- Han, B.; Yao, Q.; Yu, X.; Niu, G.; Xu, M.; Hu, W.; Tsang, I.; Sugiyama, M. Co-teaching: Robust training of deep neural networks with extremely noisy labels. Adv. Neural Inf. Process. Syst. 2018, 31, 8527–8537. [Google Scholar]
- Chen, P.; Liao, B.B.; Chen, G.; Zhang, S. Understanding and utilizing deep neural networks trained with noisy labels. In Proceedings of the 36th International Conference on Machine Learning (ICML 2019), Long Beach, CA, USA, 9–15 June 2019; pp. 1062–1070. [Google Scholar]
- Xue, C.; Dou, Q.; Shi, X.; Chen, H.; Heng, P.-A. Robust learning at noisy labeled medical images. In Proceedings of the IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019), Venice, Italy, 8–11 April 2019; pp. 1280–1283. [Google Scholar]
- Goodfellow, I.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative adversarial nets. Adv. Neural Inf. Process. Syst. 2014, 27, 2672–2680. [Google Scholar]
- Shin, H.-C.; Roth, H.R.; Gao, M.; Lu, L.; Xu, Z.; Nogues, I.; Yao, J.; Mollura, D.; Summers, R.M. Deep convolutional neural networks for computer-aided detection. IEEE Trans. Med. Imaging 2016, 35, 1285–1298. [Google Scholar] [CrossRef]
- Frid-Adar, M.; Diamant, I.; Klang, E.; Amitai, M.; Goldberger, J.; Greenspan, H. GAN-based synthetic medical image augmentation. Neurocomputing 2018, 321, 321–331. [Google Scholar] [CrossRef]
- Firildak, K.; Talu, M.F. A hybrid capsule network for pneumonia detection. Trait. Signal 2021, 38, 3. [Google Scholar]
- Ho, J.; Jain, A.; Abbeel, P. Denoising diffusion probabilistic models. Adv. Neural Inf. Process. Syst. 2020, 33, 6840–6851. [Google Scholar]
- Kazerouni, A.; Aghdam, E.K.; Heidari, M.; Azad, R.; Fayyaz, M.; Hacihaliloglu, I.; Merhof, D. Diffusion models in medical imaging: A survey. Med. Image Anal. 2023, 88, 102846. [Google Scholar] [CrossRef]
- Kazerouni, A.; Khodapanah Aghdam, E.; Heidari, M.; Azad, R.; Fayyaz, M.; Hacihaliloglu, I.; Merhof, D. Diffusion models for medical image analysis: A comprehensive survey. arXiv 2022, arXiv:2211.07804. [Google Scholar]
- Chen, X.; Liu, Z.; Xie, S.; He, K. Deconstructing denoising diffusion models for self-supervised learning. arXiv 2024, arXiv:2401.14404. [Google Scholar] [CrossRef]
- Yang, Y.; Fu, H.; Aviles-Rivero, A.I.; Schoenlieb, C.B.; Zhu, L. DiffMIC: Dual-guidance diffusion network for medical image classification. In Proceedings of the 26th International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI 2023); Springer: Cham, Switzerland, 2023; pp. 95–105. [Google Scholar]
- Zhu, C.; Chen, W.; Peng, T.; Wang, Y.; Jin, M. Hard sample aware noise robust learning. IEEE Trans. Med. Imaging 2021, 41, 881–894. [Google Scholar] [CrossRef] [PubMed]
- Chen, A.; Li, Y.; Qian, W.; Morse, K.; Miao, C.; Huai, M. Modeling and understanding uncertainty in medical image classification. In Proceedings of the 27th International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI 2024), Marrakesh, Morocco, 6–10 October 2024. [Google Scholar]
- Pfaff, L.; Wagner, F.; Vysotskaya, N.; Thies, M.; Maul, N.; Mei, S.; Wuerfl, T.; Maier, A. No-New-Denoiser: A critical analysis of diffusion models for medical image denoising. In Proceedings of the 27th International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI 2024), Marrakesh, Morocco, 6–10 October 2024. [Google Scholar]







| Dataset | Metric | MTL | LDAM | CL | ProCo | DiffMIC | Ours |
|---|---|---|---|---|---|---|---|
| HAM10000 | Accuracy | 0.811 | 0.857 | 0.865 | 0.887 | 0.905 | 0.914 |
| F1-score | 0.660 | 0.734 | 0.739 | 0.763 | 0.815 | 0.839 | |
| APTOS2019 | Accuracy | 0.813 | 0.813 | 0.835 | 0.837 | 0.838 | 0.855 |
| F1-score | 0.632 | 0.620 | 0.652 | 0.674 | 0.649 | 0.675 |
| Dataset | Metric | Co-Tea | INCV | OUSM | HSA-NRL | DiffMIC | Ours |
|---|---|---|---|---|---|---|---|
| Chaoyang | Accuracy | 0.794 | 0.803 | 0.805 | 0.835 | 0.868 | 0.878 |
| F1-score | 0.720 | 0.741 | 0.737 | 0.766 | 0.815 | 0.830 |
| CG-ANI | HPCM | IDCR | Accuracy | F1-Score | Kappa | |
|---|---|---|---|---|---|---|
| B0 | – | – | – | 0.905 | 0.815 | 0.805 |
| B1 | √ | – | – | 0.910 | 0.844 | 0.815 |
| B2 | √ | √ | – | 0.912 | 0.845 | 0.813 |
| Ours | √ | √ | √ | 0.914 | 0.839 | 0.825 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Yan, Y.; Xie, Z.; Huang, W. Confidence-Guided Adaptive Diffusion Network for Medical Image Classification. J. Imaging 2026, 12, 80. https://doi.org/10.3390/jimaging12020080
Yan Y, Xie Z, Huang W. Confidence-Guided Adaptive Diffusion Network for Medical Image Classification. Journal of Imaging. 2026; 12(2):80. https://doi.org/10.3390/jimaging12020080
Chicago/Turabian StyleYan, Yang, Zhuo Xie, and Wenbo Huang. 2026. "Confidence-Guided Adaptive Diffusion Network for Medical Image Classification" Journal of Imaging 12, no. 2: 80. https://doi.org/10.3390/jimaging12020080
APA StyleYan, Y., Xie, Z., & Huang, W. (2026). Confidence-Guided Adaptive Diffusion Network for Medical Image Classification. Journal of Imaging, 12(2), 80. https://doi.org/10.3390/jimaging12020080
