Next Article in Journal
A Vertebra-Aware Framework for Structured Analysis of Post-Fracture Lumbar CT
Previous Article in Journal
Parameters Identification of Sub-Synchronous Oscillation in D-PMSG Based on Improved VMD and TLS-MP
Previous Article in Special Issue
Causal Representation-Based Personalized Federated Learning with Causal Graph Consensus for Medical Imaging
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

CMFA-Net: A CNN–Mamba Collaborative Feature Alignment Network for Robust Medical Image Segmentation

1
College of Computer Science and Engineering, Changchun University of Technology, Changchun 130012, China
2
Jilin Provincial Science and Technology Innovation Center for Multimodal Cognitive Computing and Analysis of Medical Biometrics, Changchun 130102, China
*
Author to whom correspondence should be addressed.
Electronics 2026, 15(11), 2343; https://doi.org/10.3390/electronics15112343
Submission received: 8 April 2026 / Revised: 9 May 2026 / Accepted: 26 May 2026 / Published: 28 May 2026
(This article belongs to the Special Issue AI-Driven Medical Image/Video Processing)

Abstract

Medical image segmentation still faces three critical challenges: insufficient joint modeling of local details and long-range dependencies, the high computational burden of transformer-based architectures for high-resolution inputs, and performance degradation caused by domain shift across imaging centers and acquisition devices. To address these issues, this paper proposes CMFA-Net, a CNN–Mamba collaborative feature alignment network for robust medical image segmentation. The proposed framework adopts Vision Mamba (VSSM) as the encoder backbone to capture long-range contextual dependencies with linear computational complexity. A CNN–Mamba fusion attention (CMFA) module is designed to integrate the local representation capability of convolution with the long-range modeling capability of Mamba, improving the segmentation of complex boundaries and multi-scale targets. In addition, an enhanced multi-scale context aggregation decoder (EMCAD) is introduced to reduce the semantic gap between encoder and decoder features and strengthen hierarchical feature fusion. To enhance cross-dataset robustness, a contrastive domain alignment learning (cDAL) strategy is applied in the intermediate feature space to learn domain-invariant discriminative representations via an InfoNCE-based objective. Experiments on the CirrMRI600+ pathological liver MRI dataset and several public polyp segmentation benchmarks show that the proposed method achieves competitive segmentation performance. Ablation studies provide empirical evidence for the contributions of the CMFA module, EMCAD decoder, and cDAL mechanism under the same experimental protocol. These results suggest that CMFA-Net is a promising framework for medical image segmentation across heterogeneous datasets.
Keywords: medical image segmentation; Vision Mamba (VSSM); CNN–Mamba collaboration; feature alignment; hierarchical feature fusion; computational efficiency medical image segmentation; Vision Mamba (VSSM); CNN–Mamba collaboration; feature alignment; hierarchical feature fusion; computational efficiency

Share and Cite

MDPI and ACS Style

Yang, L.; Wang, H.; Fu, X.; Wang, Y.; Wu, D. CMFA-Net: A CNN–Mamba Collaborative Feature Alignment Network for Robust Medical Image Segmentation. Electronics 2026, 15, 2343. https://doi.org/10.3390/electronics15112343

AMA Style

Yang L, Wang H, Fu X, Wang Y, Wu D. CMFA-Net: A CNN–Mamba Collaborative Feature Alignment Network for Robust Medical Image Segmentation. Electronics. 2026; 15(11):2343. https://doi.org/10.3390/electronics15112343

Chicago/Turabian Style

Yang, Liu, Hui Wang, Xiaolin Fu, Yang Wang, and Duohai Wu. 2026. "CMFA-Net: A CNN–Mamba Collaborative Feature Alignment Network for Robust Medical Image Segmentation" Electronics 15, no. 11: 2343. https://doi.org/10.3390/electronics15112343

APA Style

Yang, L., Wang, H., Fu, X., Wang, Y., & Wu, D. (2026). CMFA-Net: A CNN–Mamba Collaborative Feature Alignment Network for Robust Medical Image Segmentation. Electronics, 15(11), 2343. https://doi.org/10.3390/electronics15112343

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop