Next Article in Journal
Projection for Ecological Carrying Capacity Based on the Interpretable CAXO Model: The Case of China
Previous Article in Journal
Coupling Coordination Between Urban Development and Eco-Environment in Chinese Coastal Cities: A Multisource Remote Sensing-Based Assessment
Previous Article in Special Issue
LMamba: Local-Guided Mamba with Multi-Scale Filtering for Hyperspectral Image Classification
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Cross-Modality Spectral Expansion Combined with Physical–Semantic Dual Priors for Cloud Detection in GF-1 Imagery

State Key Laboratory of Integrated Services Networks, School of Telecommunication Engineering, Xidian University, Xi’an 710071, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2026, 18(11), 1689; https://doi.org/10.3390/rs18111689 (registering DOI)
Submission received: 20 April 2026 / Revised: 14 May 2026 / Accepted: 21 May 2026 / Published: 23 May 2026

Abstract

Cloud detection in high-resolution Gaofen-1 (GF-1) imagery is challenging due to the absence of short-wave infrared (SWIR) bands, which prevents the use of physically interpretable indices such as the Normalized Difference Snow Index (NDSI) and often leads to severe cloud–snow confusion. To address this limitation, we propose a unified framework, termed the Cross-Modality Spectral Expansion and Dual-Prior Network (CMSE-DPNet), that integrates cross-modality spectral expansion with physical–semantic dual priors. First, an improved CycleGAN reconstructs 13-band pseudo-Sentinel-2 spectra from four-band GF-1 imagery, enabling the computation of snow-sensitive physical indices. Second, a Snow-Aware Feature Attention Guidance Module (SAFAGM) introduces pixel-level physical priors derived from NDSI, while a Label-Guided Channel Attention Module (LG-CAM) injects scene-level semantic priors inferred from geographic metadata using a large language model. These complementary priors guide the network to better distinguish clouds from spectrally similar backgrounds. Experiments on the GF-1 dataset show that the proposed method achieves an F1-score of 94.41% and an Intersection over Union (IoU) of 89.40%, outperforming several state-of-the-art cloud detection methods. The results indicate that cross-modality spectral expansion combined with physical–semantic prior guidance effectively improves cloud detection performance in complex cloud–snow coexistence scenarios.
Keywords: cloud detection; spectral expansion; attention mechanism; LLM-generated labels; channel attention; semantic-guided modulation; remote sensing cloud detection; spectral expansion; attention mechanism; LLM-generated labels; channel attention; semantic-guided modulation; remote sensing

Share and Cite

MDPI and ACS Style

Zhang, J.; Shen, K.; Song, L.; Pan, S.; Li, Y. Cross-Modality Spectral Expansion Combined with Physical–Semantic Dual Priors for Cloud Detection in GF-1 Imagery. Remote Sens. 2026, 18, 1689. https://doi.org/10.3390/rs18111689

AMA Style

Zhang J, Shen K, Song L, Pan S, Li Y. Cross-Modality Spectral Expansion Combined with Physical–Semantic Dual Priors for Cloud Detection in GF-1 Imagery. Remote Sensing. 2026; 18(11):1689. https://doi.org/10.3390/rs18111689

Chicago/Turabian Style

Zhang, Jing, Kexiao Shen, Liangnong Song, Shiyi Pan, and Yunsong Li. 2026. "Cross-Modality Spectral Expansion Combined with Physical–Semantic Dual Priors for Cloud Detection in GF-1 Imagery" Remote Sensing 18, no. 11: 1689. https://doi.org/10.3390/rs18111689

APA Style

Zhang, J., Shen, K., Song, L., Pan, S., & Li, Y. (2026). Cross-Modality Spectral Expansion Combined with Physical–Semantic Dual Priors for Cloud Detection in GF-1 Imagery. Remote Sensing, 18(11), 1689. https://doi.org/10.3390/rs18111689

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop