SAR and Visible Image Fusion via Retinex-Guided SAR Reconstruction

Yuman Yuan; Tianyu Deng; Yi Le; Hongyang Bai; Shuai Guo; Shangjing Sun; Yuanbo Chen

doi:10.3390/rs18010111

,

and

¹

Key Laboratory of Maritime Intelligent Cyberspace Technology, School of Energy and Power Engineering, Ministry of Education, Nanjing University of Science and Technology, Nanjing 210094, China

²

Nanjing Institute of Electronic Engineering, Nanjing 210046, China

³

National Key Laboratory of Land and Air Based Information Perception and Control, Xi’an Modern Control Technology Research Institute, Xi’an 710065, China

^*

Author to whom correspondence should be addressed.

Remote Sens.2026, 18(1), 111;https://doi.org/10.3390/rs18010111
(registering DOI)

Version Notes

Order Reprints

Abstract

The fusion of synthetic aperture radar (SAR) and visible images offers complementary spatial and spectral information, enabling more reliable and comprehensive scene interpretation. However, SAR speckle noise and the intrinsic modality gap pose significant challenges for existing methods in extracting consistent and complementary features. To address these issues, we propose VGSRF-Net, a Retinex-guided SAR reconstruction-driven fusion network that leverages visible-image priors to refine SAR features. This approach effectively reduces modality discrepancies before fusion, enabling improved multi-modal representation. The cross-modality reconstruction module (CMRM) reconstructs SAR features guided by visible priors, effectively reducing modality discrepancies before fusion and enabling improved multi-modal representation. The multi-modal feature joint representation module (MFJRM) enhances cross-modal complementarity by integrating global contextual interactions and local dynamic convolution, thereby achieving further feature alignment. Finally, the feature enhancement module (FEM) refines multi-scale spatial features and selectively enhances high-frequency details in the frequency domain, improving structural clarity and texture fidelity. Extensive experiments on diverse real-world remote sensing datasets demonstrate that VGSRF-Net surpasses state-of-the-art methods in denoising, structural preservation, and generalization under varying noise and illumination conditions.

Keywords:

image fusion; SAR reconstruction; feature enhancement; multi-modal representation

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Article metric data becomes available approximately 24 hours after publication online.