Next Article in Journal
Leaf Angle Distribution Effects on Modelling Accuracy of Sensible and Latent Heat Fluxes in Sunflower and Wheat Crops
Previous Article in Journal
Multi-Context Validation of Global Fractional Vegetation Cover Products in Croplands Using Multi-Source Crop FVC References
Previous Article in Special Issue
Weakly Supervised Oriented Object Detection in Remote Sensing via Geometry-Aware Enhancement Network
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

StarRoute-DBNet: A Novel Multi-Modal Framework for Advanced Target Detection in Dynamic Environments Using SAR and Optical Image Fusion with FocusGraph and PhaseRoute

by
Lanfang Lei
1,
Sheng Chang
2,
Zhongzhen Sun
3,
Jianxin Zou
1,
Huazheng Yang
1,
Xinli Zheng
1,
Changyu Liao
1,
Wenjun Wei
1,
Long Ma
1 and
Ping Zhong
1,*
1
College of Physics, Donghua University, Shanghai 200051, China
2
Department of Space Microwave Remote Sensing System, Aerospace Information Research Institute, Beijing 100190, China
3
College of Electronic Science and Technology, National University of Defense Technology, Changsha 410073, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2026, 18(11), 1731; https://doi.org/10.3390/rs18111731
Submission received: 26 March 2026 / Revised: 22 May 2026 / Accepted: 23 May 2026 / Published: 27 May 2026

Abstract

Multimodal object detection based on synthetic aperture radar (SAR) and optical imagery is of great significance in remote sensing, particularly under adverse weather conditions, nighttime environments, and complex background scenarios. Although SAR imagery has unique advantages under all-weather conditions, its object detection performance still faces challenges in low-texture regions and cluttered scenes. Optical imagery provides rich spatial and texture information, but its applicability is limited in harsh environments. To overcome the limitations of unimodal SAR object detection, this paper proposes a novel multimodal object detection framework, termed StarRoute-DBNet, to improve detection accuracy and robustness through multimodal data fusion and efficient feature interaction. Specifically, a FocusGraph (Graph Convolution-Based Feature Relationship Modeling) module is first designed to adaptively model the spatial relationships between optical and SAR features via graph convolutional networks (GCNs), thereby capturing complex cross-modal spatial dependencies. This module enhances feature interaction across modalities, improves the localization accuracy of oriented targets, and shows clear advantages for small-object detection in complex backgrounds. Second, to alleviate the loss of critical information during downsampling, a PhaseRoute (Sparse Routing Polyphase Downsampling Module) is introduced, which combines multi-phase decomposition with a Top-2 sparse routing strategy to preserve informative spatial cues. By incorporating Gumbel noise into the routing process, the proposed module further improves routing flexibility, detection accuracy, and model robustness. In addition, a Multi-Scale Shuffle-Gated Fusion (MSSGF) module is proposed to address the multi-scale issue in multimodal feature fusion. This module integrates multi-scale convolutional branches, channel shuffles, and dual-attention mechanisms to enhance feature interaction across scales, while an adaptive weighted fusion strategy is employed to dynamically adjust the fusion weights of multimodal features. As a result, the proposed method significantly improves detection accuracy and robustness, especially in complex scenes. Extensive experiments conducted on the MVSDA dataset and the M4-SAR dataset demonstrate that the proposed StarRoute-DBNet consistently outperforms existing state-of-the-art methods under complex backgrounds and adverse conditions. In particular, it achieves clear advantages in oriented object detection and small-object detection, verifying its effectiveness and robustness for cross-modal remote sensing object detection.
Keywords: synthetic aperture radar (SAR); optical imagery; multimodal object detection; oriented object detection; feature fusion synthetic aperture radar (SAR); optical imagery; multimodal object detection; oriented object detection; feature fusion

Share and Cite

MDPI and ACS Style

Lei, L.; Chang, S.; Sun, Z.; Zou, J.; Yang, H.; Zheng, X.; Liao, C.; Wei, W.; Ma, L.; Zhong, P. StarRoute-DBNet: A Novel Multi-Modal Framework for Advanced Target Detection in Dynamic Environments Using SAR and Optical Image Fusion with FocusGraph and PhaseRoute. Remote Sens. 2026, 18, 1731. https://doi.org/10.3390/rs18111731

AMA Style

Lei L, Chang S, Sun Z, Zou J, Yang H, Zheng X, Liao C, Wei W, Ma L, Zhong P. StarRoute-DBNet: A Novel Multi-Modal Framework for Advanced Target Detection in Dynamic Environments Using SAR and Optical Image Fusion with FocusGraph and PhaseRoute. Remote Sensing. 2026; 18(11):1731. https://doi.org/10.3390/rs18111731

Chicago/Turabian Style

Lei, Lanfang, Sheng Chang, Zhongzhen Sun, Jianxin Zou, Huazheng Yang, Xinli Zheng, Changyu Liao, Wenjun Wei, Long Ma, and Ping Zhong. 2026. "StarRoute-DBNet: A Novel Multi-Modal Framework for Advanced Target Detection in Dynamic Environments Using SAR and Optical Image Fusion with FocusGraph and PhaseRoute" Remote Sensing 18, no. 11: 1731. https://doi.org/10.3390/rs18111731

APA Style

Lei, L., Chang, S., Sun, Z., Zou, J., Yang, H., Zheng, X., Liao, C., Wei, W., Ma, L., & Zhong, P. (2026). StarRoute-DBNet: A Novel Multi-Modal Framework for Advanced Target Detection in Dynamic Environments Using SAR and Optical Image Fusion with FocusGraph and PhaseRoute. Remote Sensing, 18(11), 1731. https://doi.org/10.3390/rs18111731

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop