CMT-BUSNet: Adaptive Fusion-Based Triple-Branch Hybrid Architecture for Explainable Breast Ultrasound Tumor Segmentation
Abstract
1. Introduction
2. Related Works
2.1. Deep Learning-Based Segmentation in Breast Ultrasound
2.2. Hybrid and Transformer-Based Architectures
2.3. Explainable Artificial Intelligence in Medical Imaging
3. Materials and Methods
3.1. Dataset
3.2. CMT-BUSNet Architecture
3.3. Training Details
3.4. Evaluation Metrics
3.5. XAI Framework
4. Results
4.1. Comparative Performance Analysis
4.2. Statistical Analysis
4.3. Training-Based Ablation Study
| Configuration | DSC | B-IoU | HD95 ↓ | Params (M) |
|---|---|---|---|---|
| Full model (AFFM) | 0.9037 ± 0.0047 | 0.6108 | 10.07 | 33.32 |
| CNN + Mamba (w/o Trans.) | 0.9025 ± 0.006 | 0.6092 | 10.01 | 30.18 |
| CNN + Trans. (w/o Mamba) | 0.8996 ± 0.006 | 0.5986 | 10.15 | 28.06 |
| CNN Only | 0.9011 ± 0.005 | 0.6062 | 10.25 | 21.43 |
| Concat + 1 × 1Conv | 0.9032 ± 0.004 | 0.6058 | 9.87 | 33.32 |
| SE Attention | 0.9010 ± 0.004 | 0.6051 | 10.43 | 33.50 |
| Simple Average | 0.9010 ± 0.007 | 0.6022 | 10.15 | 32.28 |
| w/o DBR loss | 0.9028 ± 0.006 | 0.6054 | 10.01 | 30.71 |
| w/o Dense Nested Dec. | 0.9015 ± 0.005 | 0.5989 | 10.07 | 32.40 |

4.4. AFFM Fusion Weight Analysis

4.5. XAI Visualizations
4.5.1. Quantitative XAI Validation
4.5.2. Failure Case Analysis
4.6. Computational Efficiency
4.7. Comparison with the Literature
4.8. nnU-Net Benchmark Comparison
5. Discussion
5.1. Difficulty Based Analysis
5.2. Learning Curve Analysis
5.3. External Validation
6. Conclusions and Limitations
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Al-Karawi, D.; Al-Zaidi, S.; Helael, K.A.; Obeidat, N.; Mouhsen, A.M.; Ajam, T.; Alshalabi, B.A.; Salman, M.; Ahmed, M.H. A Review of Artificial Intelligence in Breast Imaging. Tomography 2024, 10, 705–726. [Google Scholar] [CrossRef]
- Zhang, J.; Wu, J.; Zhou, X.S.; Shi, F.; Shen, D. Recent Advancements in Artificial Intelligence for Breast Cancer: Image Augmentation, Segmentation, Diagnosis, and Prognosis Approaches. Semin. Cancer Biol. 2023, 96, 11–25. [Google Scholar] [CrossRef]
- Ferreira, M.R.; Torres, H.R.; Oliveira, B.; Gomes-Fonseca, J.; Morais, P.; Novais, P.; Vilaça, J.L. Comparative Analysis of Current Deep Learning Networks for Breast Lesion Segmentation in Ultrasound Images. In Proceedings of the 44th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Glasgow, UK, 11–15 July 2022; pp. 3878–3881. [Google Scholar]
- Zhang, S.; Liao, M.; Wang, J.; Zhu, Y.; Zhang, Y.; Zhang, J.; Zheng, R.; Lv, L.; Zhu, D.; Chen, H.; et al. Fully Automatic Tumor Segmentation of Breast Ultrasound Images with Deep Learning. J. Appl. Clin. Med. Phys. 2023, 24, e13863. [Google Scholar] [CrossRef] [PubMed]
- Li, W.; Ye, X.; Chen, X.; Jiang, X.; Yang, Y. A Deep Learning-Based Method for the Detection and Segmentation of Breast Masses in Ultrasound Images. Phys. Med. Biol. 2024, 69, 155027. [Google Scholar] [CrossRef] [PubMed]
- Yang, L.; Zhang, B.; Ren, F.; Gu, J.; Gao, J.; Wu, J.; Li, D.; Jia, H.; Li, G.; Zong, J.; et al. Rapid Segmentation and Diagnosis of Breast Tumor Ultrasound Images at the Sonographer Level Using Deep Learning. Bioengineering 2023, 10, 1220. [Google Scholar] [CrossRef] [PubMed]
- Borys, K.; Schmitt, Y.A.; Nauta, M.; Seifert, C.; Krämer, N.; Friedrich, C.M.; Nensa, F. Explainable AI in Medical Imaging: An Overview for Clinical Practitioners—Beyond Saliency-Based XAI Approaches. Eur. J. Radiol. 2023, 162, 110786. [Google Scholar] [CrossRef] [PubMed]
- Ghnemat, R.; Alodibat, S.; Abu Al-Haija, Q. Explainable Artificial Intelligence (XAI) for Deep Learning Based Medical Imaging Classification. J. Imaging 2023, 9, 177. [Google Scholar] [CrossRef]
- Li, Y.; Liu, Y.; Huang, L.; Wang, Z.; Luo, J. Deep Weakly-Supervised Breast Tumor Segmentation in Ultrasound Images with Explicit Anatomical Constraints. Med. Image Anal. 2022, 76, 102315. [Google Scholar] [CrossRef]
- Misra, S.; Yoon, C.; Kim, K.-J.; Managuli, R.; Barr, R.G.; Baek, J.; Kim, C. Deep Learning-Based Multimodal Fusion Network for Segmentation and Classification of Breast Cancers Using B-Mode and Elastography Ultrasound Images. Bioeng. Transl. Med. 2023, 8, e10480. [Google Scholar] [CrossRef]
- Chen, J.; Mei, J.; Li, X.; Lu, Y.; Yu, Q.; Wei, Q.; Luo, X.; Xie, Y.; Adeli, E.; Wang, Y.; et al. TransUNet: Rethinking the U-Net Architecture Design for Medical Image Segmentation through the Lens of Transformers. Med. Image Anal. 2024, 97, 103280. [Google Scholar] [CrossRef]
- Zhu, H.; Huang, Y.; Yao, K.; Shang, J.; Hu, K.; Li, Z.; He, G. AttmNet: A Hybrid Transformer Integrating Self-Attention, Mamba, and Multi-Layer Convolution for Enhanced Lesion Segmentation. Quant. Imaging Med. Surg. 2025, 15, 4296–4310. [Google Scholar] [CrossRef]
- Xiong, Y.; Shu, X.; Liu, Q.; Yuan, D. HCMNet: A Hybrid CNN-Mamba Network for Breast Ultrasound Segmentation for Consumer Assisted Diagnosis. IEEE Trans. Consum. Electron. 2025, 71, 8045–8054. [Google Scholar] [CrossRef]
- Uppal, D.; Prakash, S. CLT-MambaSeg: An Integrated Model of Convolution, Linear Transformer and Multiscale Mamba for Medical Image Segmentation. Comput. Biol. Med. 2025, 196, 110736. [Google Scholar] [CrossRef]
- Teng, Z.; Li, L.; Xin, Z.; Xiang, D.; Huang, J.; Zhou, H.; Shi, F.; Zhu, W.; Cai, J.; Peng, T.; et al. A Literature Review of Artificial Intelligence (AI) for Medical Image Segmentation: From AI and Explainable AI to Trustworthy AI. Quant. Imaging Med. Surg. 2024, 14, 9620–9652. [Google Scholar] [CrossRef] [PubMed]
- Nikulins, A.; Edelmers, E.; Sudars, K.; Polaka, I. Adapting Classification Neural Network Architectures for Medical Image Segmentation Using Explainable AI. J. Imaging 2025, 11, 55. [Google Scholar] [CrossRef] [PubMed]
- Khaledyan, D.; Marini, T.J.; O’Connell, A.; Meng, S.; Kan, J.; Brennan, G.; Zhao, Y.; Baran, T.M.; Parker, K.J. WATUNet: A Deep Neural Network for Segmentation of Volumetric Sweep Imaging Ultrasound. Mach. Learn. Sci. Technol. 2024, 5, 15042. [Google Scholar] [CrossRef]
- Boro, L.O.; Nandi, G. CBAM-RIUnet: Breast Tumor Segmentation with Enhanced Breast Ultrasound and Test-Time Augmentation. Ultrason. Imaging 2025, 47, 24–36. [Google Scholar] [CrossRef]
- Pramanik, P.; Roy, A.; Cuevas, E.; Perez-Cisneros, M.; Sarkar, R. DAU-Net: Dual Attention-Aided U-Net for Segmenting Tumor in Breast Ultrasound Images. PLoS ONE 2024, 19, e0303670. [Google Scholar] [CrossRef]
- Gómez-Flores, W.; Gregorio-Calas, M.J.; Coelho de Albuquerque Pereira, W. BUS-BRA: A Breast Ultrasound Dataset for Assessing Computer-Aided Diagnosis Systems. Med. Phys. 2024, 51, 3110–3123. [Google Scholar] [CrossRef]
- Al-Dhabyani, W.; Gomaa, M.; Khaled, H.; Fahmy, A. Dataset of Breast Ultrasound Images. Data Br. 2020, 28, 104863. [Google Scholar] [CrossRef]
- Guan, H.; Liu, M. Domain Adaptation for Medical Image Analysis: A Survey. IEEE Trans. Biomed. Eng. 2022, 69, 1173–1185. [Google Scholar] [CrossRef]
- Zhang, Y.; Liu, H.; Hu, Q. TransFuse: Fusing Transformers and CNNs for Medical Image Segmentation. In Proceedings of the Medical Image Computing and Computer Assisted Intervention—MICCAI 2021, Strasbourg, France, 27 September–1 October 2021; Lecture Notes in Computer Science; de Bruijne, M., Cattin, P.C., Cotin, S., Padoy, N., Speidel, S., Zheng, Y., Essert, C., Eds.; Springer: Cham, Switzerland, 2021; Volume 12901, pp. 14–24. [Google Scholar] [CrossRef]
- He, A.; Wang, K.; Li, T.; Du, C.; Xia, S.; Fu, H. H2Former: An Efficient Hierarchical Hybrid Transformer for Medical Image Segmentation. IEEE Trans. Med. Imaging 2023, 42, 2763–2775. [Google Scholar] [CrossRef]
- Isensee, F.; Jaeger, P.F.; Kohl, S.A.A.; Petersen, J.; Maier-Hein, K.H. NnU-Net: A Self-Configuring Method for Deep Learning-Based Biomedical Image Segmentation. Nat. Methods 2021, 18, 203–211. [Google Scholar] [CrossRef]
- Gipiškis, R.; Tsai, C.-W.; Kurasova, O. Explainable AI (XAI) in Image Segmentation in Medicine, Industry, and beyond: A Survey. ICT Express 2024, 10, 1331–1354. [Google Scholar] [CrossRef]
- Wei, W.; Wu, J.; Shao, G. Research on Breast Tumor Segmentation Based on the Mamba Architecture. Front. Oncol. 2025, 15, 1672274. [Google Scholar] [CrossRef]
- Jiao, J.; Jiao, J.; Li, Z. BS-Mamba: A Robust Network for Breast Ultrasound Image Segmentation Using Mamba Architecture. Int. J. Pattern Recognit. Artif. Intell. 2025, 39, 2540002. [Google Scholar] [CrossRef]
- Shi, J.; Liu, H.; Li, Z. A Hybrid Medical Image Semantic Segmentation Network Based on Novel Mamba and Transformer. IET Image Process. 2025, 19, e70205. [Google Scholar] [CrossRef]
- Varsavsky, T.; Orbes-Arteaga, M.; Sudre, C.H.; Graham, M.S.; Nachev, P.; Cardoso, M.J. Test-Time Unsupervised Domain Adaptation. In Proceedings of the Medical Image Computing and Computer-Assisted Intervention—MICCAI 2020, Lima, Peru, 4–8 October 2020; Martel, A.L., Abolmaesumi, P., Stoyanov, D., Mateus, D., Zuluaga, M.A., Zhou, S.K., Racoceanu, D., Joskowicz, L., Eds.; Springer: Cham, Switzerland, 2020; Volume 12261, pp. 428–436. [Google Scholar]
- Kozegar, E.; Soryani, M.; Behnam, H.; Salamati, M.; Tan, T. Mass Segmentation in Automated 3-D Breast Ultrasound Using Adaptive Region Growing and Supervised Edge-Based Deformable Model. IEEE Trans. Med. Imaging 2018, 37, 918–928. [Google Scholar] [CrossRef] [PubMed]








| Study | Year | Method | Dataset | N | DSC | IoU | XAI | Real-Time |
|---|---|---|---|---|---|---|---|---|
| Zhang et al. [4] | 2023 | Cls + Seg CNN | Custom | 1600 | 0.898 | — | No | No |
| Li et al. [9] | 2022 | CAM-DLS | Custom | 1422 | 0.773 | 0.660 | No | No |
| Ferreira et al. [3] | 2022 | GG-Net | Multi | 2000 | 0.826 | — | No | No |
| Yang et al. [6] | 2023 | Att-Seg CNN | Custom | 2057 | 0.890 | — | No | No |
| Khaledyan et al. [17] | 2024 | WATUNet | BUSI + VSI | 4598 | 0.930 | — | No | No |
| Li et al. [5] | 2024 | GOLO-CMSS | USTC | 28,477 | 0.932 | — | No | No |
| Boro & Nandi [18] | 2024 | CBAM-RIUnet | BUSI | 780 | 0.894 | 0.887 | No | No |
| Pramanik et al. [19] | 2024 | DAU-Net | BUSI | 780 | 0.742 | — | No | No |
| Zhu et al. [12] | 2025 | Attune | BUSI + BUS | 1062 | 0.880 | 0.810 | No | No |
| Present study | 2026 | CMT-BUSNet | BUS-BRA | 1875 | 0.9036 | 0.8348 | 11 modules | 73.1 FPS |
| Characteristic | BUS-BRA (Internal) | BUSI (External) |
|---|---|---|
| Country | Brazil | Egypt |
| Source | Multi-institutional | Cairo University Hospital |
| N (total images) | 1875 | 647 |
| Benign cases, n (%) | 1125 (60.0%) | 437 (67.5%) |
| Malignant cases, n (%) | 750 (40.0%) | 210 (32.5%) |
| Imaging device | Multiple devices | Mindray DC-30 |
| Approximate image size (px) | ~500 × 500 | ~300 × 230 |
| Number of annotators | Multiple expert radiologists | Single radiologist |
| Annotation type | Pixel-level binary mask | Pixel-level binary mask |
| Image format | PNG | PNG |
| Role in study | Training/Internal validation | External validation (test only) |
| Model | DSC | IoU | HD95 | B-IoU | Recall | Precision | Specificity |
|---|---|---|---|---|---|---|---|
| CMT-BUSNet (proposed) | 0.9037 ± 0.0047 | 0.8348 | 10.07 | 0.6108 | 0.9243 | 0.8996 | 0.9905 |
| nnU-Net v2 † | 0.9108 ± 0.0031 | 0.8462 | 13.54 | 0.5571 | 0.9226 | 0.9142 | 0.9901 |
| U-Net++ | 0.9030 ± 0.0056 | 0.8335 | 9.64 | 0.5474 | 0.9158 | 0.9054 | 0.9911 |
| U-Net | 0.9007 ± 0.0060 | 0.8310 | 10.0 | 0.5433 | 0.9156 | 0.9033 | 0.9909 |
| DeepLabV3+ | 0.8990 ± 0.0076 | 0.8267 | 9.49 | 0.5285 | 0.9098 | 0.9034 | 0.9913 |
| Swin-UNet | 0.8983 ± 0.0060 | 0.8271 | 10.04 | 0.5361 | 0.9151 | 0.8982 | 0.9905 |
| SegNet | 0.8808 ± 0.0054 | 0.8041 | 12.91 | 0.5201 | 0.9008 | 0.8852 | 0.9900 |
| TransUNet | 0.8778 ± 0.0061 | 0.8013 | 12.92 | 0.5177 | 0.8947 | 0.8850 | 0.9893 |
| VM-UNet | 0.8463 ± 0.0068 | 0.7579 | 16.84 | 0.4756 | 0.8739 | 0.8520 | 0.9877 |
| Attention U-Net | 0.8149 ± 0.0103 | 0.7120 | 36.41 | 0.4587 | 0.8438 | 0.8192 | 0.9838 |
| Model | Parameters (M) | Inference (ms) | FPS | DSC |
|---|---|---|---|---|
| CMT-BUSNet | 33.32 | 13.7 | 73.1 | 0.9036 |
| U-Net++ | 26.07 | 1.9 | 523.7 | 0.9030 |
| U-Net | 24.43 | 1.2 | 859.0 | 0.9007 |
| Swin-UNet | 34.40 | 4.8 | 209.4 | 0.8983 |
| Model | DSC ↑ | IoU ↑ | Precision ↑ | Recall ↑ | HD95 ↓ | B-IoU ↑ | p | Sig. |
|---|---|---|---|---|---|---|---|---|
| CMT-BUSNet | 0.6709 ± 0.0227 | 0.5733 ± 0.0236 | 0.7238 ± 0.0374 | 0.7268 ± 0.0348 | 37.22 ± 3.00 | 0.1703 ± 0.0123 | — | — |
| nnU-Net v2 | 0.5579 ±0.0274 | 0.4533 ±0.0262 | 0.5419 ± 0.0274 | 0.7428 ± 0.0262 | 208.4743 ±12.5400 | 0.0582 ± 0.057 | 0.001 | ✓ |
| U-Net | 0.5172 ± 0.1166 | 0.4379 ± 0.1020 | 0.7373 ± 0.0579 | 0.5495 ± 0.1499 | 40.58 ± 3.50 | 0.1229 ± 0.0305 | 0.039 | ✓ |
| U-Net++ | 0.4818 ± 0.0863 | 0.4040 ± 0.0777 | 0.7117 ± 0.1244 | 0.5422 ± 0.1476 | 45.21 ± 11.72 | 0.1049 ± 0.0279 | 0.014 | ✓ |
| Attention U-Net | 0.1065 ± 0.0533 | 0.0611 ± 0.0325 | 0.0845 ± 0.0122 | 0.4741 ± 0.3991 | 151.53 ± 7.61 | 0.0100 ± 0.0019 | <0.001 | ✓ |
| DeepLabV3+ | 0.0978 ± 0.0798 | 0.0582 ± 0.0475 | 0.2064 ± 0.2399 | 0.5955 ± 0.4862 | 141.37 ± 22.80 | 0.0025 ± 0.0036 | <0.001 | ✓ |
| Swin-UNet | 0.0813 ± 0.0654 | 0.0471 ± 0.0391 | 0.0624 ± 0.0385 | 0.3758 ± 0.4205 | 157.72 ± 7.96 | 0.0061 ± 0.0051 | <0.001 | ✓ |
| TransUNet | 0.0625 ± 0.0763 | 0.0369 ± 0.0451 | 0.4575 ± 0.4030 | 0.3930 ± 0.4813 | 140.32 ± 31.79 | 0.0016 ± 0.0024 | <0.001 | ✓ |
| VM-UNet | 0.0564 ± 0.0653 | 0.0325 ± 0.0383 | 0.0415 ± 0.0349 | 0.2888 ± 0.3866 | 159.54 ± 8.83 | 0.0043 ± 0.0037 | <0.001 | ✓ |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Kutlu, H.; Çolak, C. CMT-BUSNet: Adaptive Fusion-Based Triple-Branch Hybrid Architecture for Explainable Breast Ultrasound Tumor Segmentation. Diagnostics 2026, 16, 1203. https://doi.org/10.3390/diagnostics16081203
Kutlu H, Çolak C. CMT-BUSNet: Adaptive Fusion-Based Triple-Branch Hybrid Architecture for Explainable Breast Ultrasound Tumor Segmentation. Diagnostics. 2026; 16(8):1203. https://doi.org/10.3390/diagnostics16081203
Chicago/Turabian StyleKutlu, Hüseyin, and Cemil Çolak. 2026. "CMT-BUSNet: Adaptive Fusion-Based Triple-Branch Hybrid Architecture for Explainable Breast Ultrasound Tumor Segmentation" Diagnostics 16, no. 8: 1203. https://doi.org/10.3390/diagnostics16081203
APA StyleKutlu, H., & Çolak, C. (2026). CMT-BUSNet: Adaptive Fusion-Based Triple-Branch Hybrid Architecture for Explainable Breast Ultrasound Tumor Segmentation. Diagnostics, 16(8), 1203. https://doi.org/10.3390/diagnostics16081203

