CMFA-Net: A CNN–Mamba Collaborative Feature Alignment Network for Robust Medical Image Segmentation

Yang, Liu; Wang, Hui; Fu, Xiaolin; Wang, Yang; Wu, Duohai

doi:10.3390/electronics15112343

This is an early access version, the complete PDF, HTML, and XML versions will be available soon.

Open AccessArticle

CMFA-Net: A CNN–Mamba Collaborative Feature Alignment Network for Robust Medical Image Segmentation

by

Liu Yang

¹,

Hui Wang

^1,2

,

Xiaolin Fu

^1,*,

Yang Wang

¹ and

Duohai Wu

¹

College of Computer Science and Engineering, Changchun University of Technology, Changchun 130012, China

²

Jilin Provincial Science and Technology Innovation Center for Multimodal Cognitive Computing and Analysis of Medical Biometrics, Changchun 130102, China

^*

Author to whom correspondence should be addressed.

Electronics 2026, 15(11), 2343; https://doi.org/10.3390/electronics15112343

Submission received: 8 April 2026 / Revised: 9 May 2026 / Accepted: 26 May 2026 / Published: 28 May 2026

(This article belongs to the Special Issue AI-Driven Medical Image/Video Processing)

Download Versions Notes

Abstract

Medical image segmentation still faces three critical challenges: insufficient joint modeling of local details and long-range dependencies, the high computational burden of transformer-based architectures for high-resolution inputs, and performance degradation caused by domain shift across imaging centers and acquisition devices. To address these issues, this paper proposes CMFA-Net, a CNN–Mamba collaborative feature alignment network for robust medical image segmentation. The proposed framework adopts Vision Mamba (VSSM) as the encoder backbone to capture long-range contextual dependencies with linear computational complexity. A CNN–Mamba fusion attention (CMFA) module is designed to integrate the local representation capability of convolution with the long-range modeling capability of Mamba, improving the segmentation of complex boundaries and multi-scale targets. In addition, an enhanced multi-scale context aggregation decoder (EMCAD) is introduced to reduce the semantic gap between encoder and decoder features and strengthen hierarchical feature fusion. To enhance cross-dataset robustness, a contrastive domain alignment learning (cDAL) strategy is applied in the intermediate feature space to learn domain-invariant discriminative representations via an InfoNCE-based objective. Experiments on the CirrMRI600+ pathological liver MRI dataset and several public polyp segmentation benchmarks show that the proposed method achieves competitive segmentation performance. Ablation studies provide empirical evidence for the contributions of the CMFA module, EMCAD decoder, and cDAL mechanism under the same experimental protocol. These results suggest that CMFA-Net is a promising framework for medical image segmentation across heterogeneous datasets.

Keywords: medical image segmentation; Vision Mamba (VSSM); CNN–Mamba collaboration; feature alignment; hierarchical feature fusion; computational efficiency

Share and Cite

MDPI and ACS Style

Yang, L.; Wang, H.; Fu, X.; Wang, Y.; Wu, D. CMFA-Net: A CNN–Mamba Collaborative Feature Alignment Network for Robust Medical Image Segmentation. Electronics 2026, 15, 2343. https://doi.org/10.3390/electronics15112343

AMA Style

Yang L, Wang H, Fu X, Wang Y, Wu D. CMFA-Net: A CNN–Mamba Collaborative Feature Alignment Network for Robust Medical Image Segmentation. Electronics. 2026; 15(11):2343. https://doi.org/10.3390/electronics15112343

Chicago/Turabian Style

Yang, Liu, Hui Wang, Xiaolin Fu, Yang Wang, and Duohai Wu. 2026. "CMFA-Net: A CNN–Mamba Collaborative Feature Alignment Network for Robust Medical Image Segmentation" Electronics 15, no. 11: 2343. https://doi.org/10.3390/electronics15112343

APA Style

Yang, L., Wang, H., Fu, X., Wang, Y., & Wu, D. (2026). CMFA-Net: A CNN–Mamba Collaborative Feature Alignment Network for Robust Medical Image Segmentation. Electronics, 15(11), 2343. https://doi.org/10.3390/electronics15112343

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

CMFA-Net: A CNN–Mamba Collaborative Feature Alignment Network for Robust Medical Image Segmentation

Abstract

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI