Frequency-Aware Refinement Network with Multi-Scale Fusion for Remote Sensing Change Detection
Abstract
1. Introduction
- 1.
- We propose a novel framework termed FARNet, which effectively integrates frequency-domain analysis with RGB-domain refinement to improve change detection performance.
- 2.
- We design a frequency-aware module (FAM) to perform coarse localization of changed objects by exploiting frequency-domain representations, providing spatial priors for subsequent refinement.
- 3.
- We design a refinement fusion module (RFM) that integrates coarse localization maps with multi-scale contextual features, enhancing edge and structural information of changed objects through RGB-domain detail refinement.
2. Related Works
2.1. CNN-Based Methods
2.2. Transformer-Based Methods
2.3. Frequency-Domain and Frequency–Spatial Hybrid Networks
2.4. Contextual Refinement Techniques
3. Methodology
3.1. Overall Structure
3.2. Frequency-Aware Module
3.3. Refinement Fusion Module
3.4. Loss Functions
4. Experiments
4.1. Datasets
4.2. Implementation Details
4.3. Evaluation Metrics
4.4. Compared Methods
- 1.
- FC-EF [13] concatenates two input images from different timestamps into a multi-channel image, which is processed by the FCN for feature extraction and change detection.
- 2.
- FC-Siam-Diff [13] is built upon FC-EF and extracts multi-level features from bi-temporal images. It applies difference operations to generate a difference map, emphasizing regions of change. Finally, it uses a twin network with shared weights to fuse these features for more accurate change detection.
- 3.
- FC-Siam-Conc [13] is based on FC-EF and uses a twin network with shared weights for extracting multi-level features. This network combines bi-temporal features through concatenation.
- 4.
- BIT [19] uses ResNet-18 [57] to extract high-level features from bi-temporal images, mapping them into a semantic space. It then leverages the encoder–decoder structure of Transformers for global context modeling. Finally, the enriched semantic representations are mapped back to the pixel space, and difference operations are applied to highlight change regions.
- 5.
- ChangFormer [53] is a Transformer-based twin network architecture. Through multi-scale feature modeling, it captures remote sensing details more effectively, improving change detection performance.
- 6.
- DMINet [21] extracts multi-level features using ResNet-18 and concatenates them channel-wise. It uses a joint attention module to learn spatiotemporal relationships across different feature levels. Finally, it provides multi-level supervision through a feature decoder to enhance the detection of changed objects.
- 7.
- SEIFNet [24] employs a twin hierarchical backbone network to capture multi-level image information through feature maps. It uses attention mechanisms to extract global and local information from bi-temporal features, highlighting changed objects. Finally, a fusion module integrates cross-level features to enhance change detection accuracy.
- 8.
- STADE-CDNet [25] uses a Transformer encoder to extract low-to-high-level semantic information from bi-temporal images, capturing temporal dependencies. A memory module stores temporal information, and a difference enhancement module emphasizes change information, improving change detection performance.
- 9.
- SDA-Encoding [58] employs a wavelet transform to decompose features into low-frequency and high-frequency components. In the high-frequency branch, local details and boundary information are enhanced through spatial pyramid pooling and cross-scale alignment fusion. In the low-frequency branch, coordinate attention and selective fusion attention are used to restore positional information and model region-level contextual dependencies. Finally, the two frequency branches are complementarily fused in the spatial domain to achieve unified modeling of both fine-grained and structural changes.
4.5. Results and Discussion
4.5.1. Results on WHU-CD Dataset
4.5.2. Results on LEVIR-CD Dataset
4.5.3. Results on LEVIR-CD+ Dataset
4.5.4. Ablation Studies
- 1.
- Ablation Study of Frequency-Aware Module (FAM): To address the detection ambiguity caused by the similarity of features between changed objects and the background in the RGB domain in remote sensing scenarios, the FAM module is designed as follows: (1) Employ octave convolution to explicitly separate high-frequency details from low-frequency contour features, thereby enhancing boundary discrimination for similar targets; (2) generate a prior for the changed objects through coarse localization, guiding the subsequent refinement module to focus on key areas. As shown in Table 4, on the LEVIR-CD dataset, FAM improved IoU by 0.69% and F1 by 0.40%; on the WHU-CD dataset, IoU increased by 0.51% and F1 by 0.28%, verifying its effectiveness. As shown in Figure 8b, after the application of octave convolution, these features more effectively represent the positional information of the changed areas.
- 2.
- Ablation Study of Refinement Fusion Module (RFM): To make the segmentation boundaries clearer, the RFM is designed as follows: (1) Introduce high-resolution features from the RGB domain and fuse differential features from adjacent layers using cross-layer channel attention mechanisms to enhance edge detail modeling; (2) use the coarse localization map for spatial constraints to generate refined segmentation results. As shown in Table 4, on the LEVIR-CD dataset, RFM improved IoU by 0.47% and F1 by 0.27%, and on the WHU-CD dataset, IoU increased by 0.46% and F1 increased by 0.26%, verifying its effectiveness. As shown in Figure 8c, the coarse localization map has located the changed areas, but the segmentation boundaries are still blurred. After RFM processing, the feature map mainly focuses on the changed areas, significantly reducing noise interference on the segmentation boundaries.
- 3.
- Ablation Study of Loss Function Weights: We aim to assess the impact of the loss function coefficients on the performance of FARNet using the LEVIR-CD dataset. The goal of these experiments was to understand how adjusting the coefficients affects overall performance. The results in Table 5 show that the best performance was achieved with cross-entropy and edge loss coefficients set to 0.9 and 0.1, respectively. This suggests that edge loss enhances high-frequency details, improving the ability of the network to detect changes more accurately. Thus, subsequent experiments used a cross-entropy coefficient of 0.9. Although results for in the range of 0.1 to 0.4 were not included, preliminary experiments indicate that performance in this lower range is significantly worse, and therefore has limited practical relevance. As shown in Figure 9c, without edge loss, the boundaries of the changed areas are blurrier. In contrast, as shown in Figure 9d, with the addition of edge loss, the boundaries become clearer, enabling the model to more accurately distinguish the changed objects.
4.5.5. Comparison of Complexity
4.5.6. Discussion
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Shafique, A.; Cao, G.; Khan, Z.; Asad, M.; Aslam, M. Deep learning-Based change detection in remote sensing images: A review. Remote Sens. 2022, 14, 871. [Google Scholar] [CrossRef]
- Jiang, H.; Peng, M.; Zhong, Y.; Xie, H.; Hao, Z.; Lin, J.; Ma, X.; Hu, X. A survey on deep learning-based change detection from high-resolution remote sensing images. Remote Sens. 2022, 14, 1552. [Google Scholar] [CrossRef]
- Qin, H.; Wang, J.; Mao, X.; Zhao, Z.; Gao, X.; Lu, W. An Improved Faster R-CNN Method for Landslide Detection in Remote Sensing Images. J. Geovis. Spat. Anal. 2024, 8, 2. [Google Scholar] [CrossRef]
- Stow, D. Reducing the effects of misregistration on pixel-level change detection. Int. J. Remote Sens. 1999, 20, 2477–2483. [Google Scholar] [CrossRef]
- Zhuang, H.; Hao, M.; Deng, K.; Zhang, K.; Wang, X.; Yao, G. Change detection in SAR images via ratio-based gaussian kernel and nonlocal theory. IEEE Trans. Geosci. Remote Sens. 2022, 60, 5210215. [Google Scholar] [CrossRef]
- Su, L.; Gong, M.; Sun, B. Change detection in synthetic aperture radar images based on non-local means with ratio similarity measurement. Int. J. Remote Sens. 2014, 35, 7673–7690. [Google Scholar] [CrossRef]
- Lambin, E.F.; Strahlers, A.H. Change-vector analysis in multitemporal space: A tool to detect and categorize land-cover change processes using high temporal-resolution satellite data. Remote Sens. Environ. 1994, 48, 231–244. [Google Scholar] [CrossRef]
- Du, P.; Wang, X.; Chen, D.; Liu, S.; Lin, C.; Meng, Y. An improved change detection approach using tri-temporal logic-verified change vector analysis. ISPRS J. Photogramm. Remote Sens. 2020, 161, 278–293. [Google Scholar] [CrossRef]
- Celik, T. Unsupervised change detection in satellite images using principal component analysis and k-means clustering. IEEE Geosci. Remote Sens. Lett. 2009, 6, 772–776. [Google Scholar] [CrossRef]
- Habib, T.; Inglada, J.; Mercier, G.; Chanussot, J. Support vector reduction in SVM algorithm for abrupt change detection in remote sensing. IEEE Geosci. Remote Sens. Lett. 2009, 6, 606–610. [Google Scholar] [CrossRef]
- Seo, D.; Kim, Y.; Eo, Y.; Park, W.; Park, H. Generation of radiometric, phenological normalized image based on random forest regression for change detection. Remote Sens. 2017, 9, 1163. [Google Scholar] [CrossRef]
- Sun, Y.; Lei, L.; Guan, D.; Kuang, G. Iterative robust graph for unsupervised change detection of heterogeneous remote sensing images. IEEE Trans. Image Process. 2021, 30, 6277–6291. [Google Scholar] [CrossRef] [PubMed]
- Daudt, R.C.; Le Saux, B.; Boulch, A. Fully convolutional siamese networks for change detection. In Proceedings of the 2018 25th IEEE International Conference on Image Processing (ICIP), Athens, Greece, 7–10 October 2018; pp. 4063–4067. [Google Scholar] [CrossRef]
- Liu, R.; Jiang, D.; Zhang, L.; Zhang, Z. Deep depthwise separable convolutional network for change detection in optical aerial images. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2020, 13, 1109–1118. [Google Scholar] [CrossRef]
- Chollet, F. Xception: Deep Learning with Depthwise Separable Convolutions. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 1800–1807. [Google Scholar]
- Wang, M.; Tan, K.; Jia, X.; Wang, X.; Chen, Y. A deep siamese network with hybrid convolutional feature extraction module for change detection based on multi-sensor remote sensing images. Remote Sens. 2020, 12, 205. [Google Scholar] [CrossRef]
- Yu, F.; Koltun, V.; Funkhouser, T. Dilated Residual Networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 636–644. [Google Scholar]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is all you need. In Proceedings of the 31st Conference on Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017; pp. 5998–6008. [Google Scholar]
- Chen, H.; Qi, Z.; Shi, Z. Remote sensing image change detection with transformers. IEEE Trans. Geosci. Remote Sens. 2021, 60, 5607514. [Google Scholar] [CrossRef]
- Li, Q.; Zhong, R.; Du, X.; Du, Y. TransUNetCD: A hybrid transformer network for change detection in optical remote-sensing images. IEEE Trans. Geosci. Remote Sens. 2022, 60, 5622519. [Google Scholar] [CrossRef]
- Feng, Y.; Jiang, J.; Xu, H.; Zheng, J. Change detection on remote sensing images using dual-branch multilevel intertemporal network. IEEE Trans. Geosci. Remote Sens. 2023, 61, 4401015. [Google Scholar] [CrossRef]
- Zhang, X.; Cheng, S.; Wang, L.; Li, H. Asymmetric cross-attention hierarchical network based on cnn and transformer for bitemporal remote sensing images change detection. IEEE Trans. Geosci. Remote Sens. 2023, 61, 2000415. [Google Scholar] [CrossRef]
- Cong, R.; Sun, M.; Zhang, S.; Zhou, X.; Zhang, W.; Zhao, Y. Frequency perception network for camouflaged object detection. In Proceedings of the 31st ACM International Conference on Multimedia, Ottawa, ON, Canada, 29 October–3 November 2023; pp. 1179–1189. [Google Scholar]
- Huang, Y.; Li, X.; Du, Z.; Shen, H. Spatiotemporal enhancement and interlevel fusion network for remote sensing images change detection. IEEE Trans. Geosci. Remote Sens. 2024, 62, 5609414. [Google Scholar] [CrossRef]
- Li, Z.; Cao, S.; Deng, J.; Wu, F.; Wang, R.; Luo, J.; Peng, Z. STADE-CDNet: Spatial-temporal attention with difference enhancement-based network for remote sensing image change detection. IEEE Trans. Geosci. Remote Sens. 2024, 62, 5611617. [Google Scholar] [CrossRef]
- Xie, E.; Wang, W.; Yu, Z.; Anandkumar, A.; Alvarez, J.M.; Luo, P. SegFormer: Simple and efficient design for semantic segmentation with transformers. Adv. Neural Inf. Process Syst. 2021, 34, 12077–12090. [Google Scholar]
- Shao, H.; Zeng, Q.; Hou, Q.; Yang, J. MCANet: Medical Image Segmentation with Multi-Scale Cross-Axis Attention. arXiv 2023, arXiv:2312.08866. [Google Scholar]
- Peng, D.; Zhang, Y.; Guan, H. End-to-End Change Detection for High Resolution Satellite Images Using Improved UNet++. Remote Sens. 2019, 11, 1382. [Google Scholar] [CrossRef]
- Fang, S.; Li, K.; Shao, J.; Li, Z. SNUNet-CD: A Densely Connected Siamese Network for Change Detection of VHR Images. IEEE Geosci. Remote Sens. Lett. 2022, 19, 8007805. [Google Scholar] [CrossRef]
- Ji, S.; Wei, S.; Lu, M. Fully Convolutional Networks for Multisource Building Extraction From an Open Aerial and Satellite Imagery Data Set. IEEE Trans. Geosci. Remote Sens. 2019, 57, 574–586. [Google Scholar] [CrossRef]
- Zheng, J.; Feng, Y.; Bai, C.; Zhang, J. Hyperspectral image classification using mixed convolutions and covariance pooling. IEEE Trans. Geosci. Remote Sens. 2020, 59, 522–534. [Google Scholar] [CrossRef]
- Wang, Y.; Wang, M.; Hao, Z.; Wang, Q.; Wang, Q.; Ye, Y. MSGFNet: Multi-Scale Gated Fusion Network for Remote Sensing Image Change Detection. Remote Sens. 2024, 16, 572. [Google Scholar] [CrossRef]
- Li, S.; Yan, F.; Liao, C.; Hu, Q.; Ma, K.; Wang, W.; Zhang, H. Object-Level Contrastive-Learning-Based Multi-Branch Network for Building Change Detection from Bi-Temporal Remote Sensing Images. Remote Sens. 2025, 17, 217. [Google Scholar] [CrossRef]
- Liu, Y.; Pang, C.; Zhan, Z.; Zhang, X.; Yang, X. Building change detection for remote sensing images using a dual-task constrained deep siamese convolutional network model. IEEE Geosci. Remote Sens. Lett. 2020, 18, 811–815. [Google Scholar] [CrossRef]
- Xiong, F.; Li, T.; Chen, J.; Zhou, J.; Qian, Y. Mask-Guided Local–Global Attentive Network for Change Detection in Remote Sensing Images. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2024, 17, 3366–3378. [Google Scholar] [CrossRef]
- Zhang, C.; Yue, P.; Tapete, D.; Jiang, L.; Shangguan, B.; Huang, L.; Liu, G. A deeply supervised image fusion network for change detection in high resolution bi-temporal remote sensing images. ISPRS J. Photogramm. Remote Sens. 2020, 166, 183–200. [Google Scholar] [CrossRef]
- Shi, Q.; Liu, M.; Li, S.; Liu, X.; Wang, F.; Zhang, L. A Deeply Supervised Attention Metric-Based Network and an Open Aerial Image Dataset for Remote Sensing Change Detection. IEEE Trans. Geosci. Remote Sens. 2022, 60, 5604816. [Google Scholar] [CrossRef]
- Wu, J.; Xie, C.; Zhang, Z.; Zhu, Y. A deeply supervised attentive high-resolution network for change detection in remote sensing images. Remote Sens. 2022, 15, 45. [Google Scholar] [CrossRef]
- Ren, H.; Xia, M.; Weng, L.; Hu, K.; Lin, H. Dual-Attention-Guided Multiscale Feature Aggregation Network for Remote Sensing Image Change Detection. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2024, 17, 4899–4916. [Google Scholar] [CrossRef]
- Dosovitskiy, A.; Beyer, L.; Kolesnikov, A.; Weissenborn, D.; Zhai, X.; Unterthiner, T.; Dehghani, M.; Minderer, M.; Heigold, G.; Gelly, S.; et al. An image is worth 16× 16 words: Transformers for image recognition at scale. arXiv 2020, arXiv:2010.11929. [Google Scholar]
- Mohammadian, A.; Ghaderi, F. SiamixFormer: A fully-transformer Siamese network with temporal Fusion for accurate building detection and change detection in bi-temporal remote sensing images. Int. J. Remote Sens. 2023, 44, 3660–3678. [Google Scholar] [CrossRef]
- Liu, Z.; Lin, Y.; Cao, Y.; Hu, H.; Wei, Y.; Zhang, Z.; Lin, S.; Guo, B. Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, BC, Canada, 11–17 October 2021; pp. 10012–10022. [Google Scholar]
- Zhang, C.; Wang, L.; Cheng, S.; Li, Y. SwinSUNet: Pure Transformer Network for Remote Sensing Image Change Detection. IEEE Trans. Geosci. Remote Sens. 2022, 60, 5224713. [Google Scholar] [CrossRef]
- Yan, T.; Wan, Z.; Zhang, P. Fully transformer network for change detection of remote sensing images. In Proceedings of the Asian Conference on Computer Vision, Macao, China, 4 December 2022; pp. 1691–1708. [Google Scholar]
- Wang, G.; Li, B.; Zhang, T.; Zhang, S. A network combining a transformer and a convolutional neural network for remote sensing image change detection. Remote Sens. 2022, 14, 2228. [Google Scholar] [CrossRef]
- Xu, X.; Li, J.; Chen, Z. TCIANet: Transformer-Based Context Information Aggregation Network for Remote Sensing Image Change Detection. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2023, 16, 1951–1971. [Google Scholar] [CrossRef]
- Ding, L.; Xia, M.; Lin, H.; Hu, K. Multi-Level Attention Interactive Network for Cloud and Snow Detection Segmentation. Remote Sens. 2024, 16, 112. [Google Scholar] [CrossRef]
- Liu, M.; Chai, Z.; Deng, H.; Liu, R. A CNN-transformer network with multiscale context aggregation for fine-grained cropland change detection. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2022, 15, 4297–4306. [Google Scholar] [CrossRef]
- Feng, Y.; Xu, H.; Jiang, J.; Liu, H.; Zheng, J. ICIF-Net: Intra-Scale Cross-Interaction and Inter-Scale Feature Fusion Network for Bitemporal Remote Sensing Images Change Detection. IEEE Trans. Geosci. Remote Sens. 2022, 60, 4410213. [Google Scholar] [CrossRef]
- Gao, F.; Fu, M.; Cao, J.; Dong, J.; Du, Q. Adaptive frequency enhancement network for remote sensing image semantic segmentation. IEEE Trans. Geosci. Remote. Sens. 2025, 63, 5619415. [Google Scholar] [CrossRef]
- Fu, J.; Yu, Y.; Wang, L. FSDENet: A Frequency and Spatial Domains based Detail Enhancement Network for Remote Sensing Semantic Segmentation. IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens. 2025, 18, 19378–19392. [Google Scholar] [CrossRef]
- Xie, Z.; Miao, S.; Jiang, Y.; Zhang, Z.; Yao, J.; Li, X.; Ghamisi, P. FSG-Net: Frequency-spatial synergistic gated network for high-resolution remote sensing change detection. IEEE Trans. Geosci. Remote Sens. 2026, 64, 4702115. [Google Scholar] [CrossRef]
- Bandara, W.G.C.; Patel, V.M. A transformer-based siamese network for change detection. In Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, Kuala Lumpur, Malaysia, 17–22 July 2022; pp. 207–210. [Google Scholar] [CrossRef]
- Chen, Y.; Fan, H.; Xu, B.; Yan, Z.; Kalantidis, Y.; Rohrbach, M.; Shuicheng, Y.; Feng, J. Drop an octave: Reducing spatial redundancy in convolutional neural networks with octave convolution. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, Republic of Korea, 27 October–2 November 2019; pp. 3435–3444. [Google Scholar]
- Woo, S.; Park, J.; Lee, J.; Kweon, I.S. Cbam: Convolutional block attention module. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 3–19. [Google Scholar]
- Chen, H.; Wu, C.; Du, B.; Zhang, L. Change detection in multi-temporal vhr images based on deep siamese multi-scale convolutional networks. arXiv 2019, arXiv:1906.11479. [Google Scholar]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
- Zhang, X.; Du, Y.; Zhou, W.; Zhang, K. Spatial-Frequency Decoupling Alignment Encoding for Remote Sensing Change Detection. Sensors 2026, 26, 1979. [Google Scholar] [CrossRef]









| Methods | OA (%) | IoU (%) | F1 (%) | Pre (%) | Rec (%) |
|---|---|---|---|---|---|
| FC-EF | 98.19 | 58.30 | 73.66 | 87.25 | 63.73 |
| FC-Siam-Diff | 98.25 | 67.34 | 80.48 | 72.27 | 90.80 |
| FC-Siam-Conc | 95.91 | 45.37 | 62.42 | 49.08 | 85.72 |
| BIT | 98.75 | 72.39 | 83.98 | 86.64 | 81.48 |
| ChangeFormer | 99.11 | 79.16 | 88.37 | 92.21 | 84.83 |
| DMINet | 98.97 | 79.68 | 88.69 | 93.84 | 86.25 |
| SEIFNet | 98.90 | 76.04 | 86.39 | 87.01 | 85.77 |
| STADE-CDNet | 99.20 | 80.77 | 89.36 | 94.06 | 85.11 |
| SDA-Encoding | 99.59 | 89.87 | 94.67 | 96.69 | 92.73 |
| FARNet(ours) | 99.59 | 90.00 | 94.74 | 97.50 | 92.13 |
| Methods | OA (%) | IoU (%) | F1 (%) | Pre (%) | Rec (%) |
|---|---|---|---|---|---|
| FC-EF | 98.38 | 72.19 | 83.85 | 85.10 | 82.63 |
| FC-Siam-Diff | 98.98 | 81.38 | 89.74 | 92.09 | 87.50 |
| FC-Siam-Conc | 98.62 | 75.72 | 86.18 | 87.83 | 84.59 |
| BIT | 98.92 | 80.68 | 89.31 | 89.24 | 89.37 |
| ChangeFormer | 99.04 | 82.48 | 90.40 | 92.05 | 88.80 |
| DMINet | 99.07 | 82.99 | 90.71 | 92.52 | 89.95 |
| SEIFNet | 99.09 | 83.40 | 90.95 | 92.49 | 89.46 |
| STADE-CDNet | 98.91 | 80.44 | 89.16 | 90.65 | 87.72 |
| SDA-Encoding | 99.20 | 85.29 | 92.06 | 93.34 | 90.82 |
| FARNet (ours) | 99.21 | 85.52 | 92.19 | 92.32 | 92.07 |
| Methods | OA (%) | IoU (%) | F1 (%) | Pre (%) | Rec (%) |
|---|---|---|---|---|---|
| FC-EF | 97.63 | 56.05 | 71.83 | 69.75 | 74.05 |
| FC-Siam-Diff | 98.61 | 70.83 | 82.93 | 83.18 | 82.68 |
| FC-Siam-Conc | 98.17 | 63.89 | 77.97 | 76.38 | 79.63 |
| BIT | 98.77 | 72.99 | 84.38 | 87.57 | 81.42 |
| ChangeFormer | 98.61 | 70.74 | 82.86 | 83.30 | 82.44 |
| DMINet | 98.68 | 71.19 | 83.17 | 86.21 | 80.33 |
| SEIFNet | 98.54 | 69.22 | 81.81 | 82.86 | 80.79 |
| STADE-CDNet | 98.53 | 71.02 | 83.06 | 78.07 | 88.73 |
| SDA-Encoding | 98.84 | 74.97 | 85.70 | 85.81 | 85.58 |
| FARNet(ours) | 98.87 | 75.65 | 86.13 | 86.12 | 86.15 |
| FAM | RFM | LEVIR-CD | WHU-CD | ||||
|---|---|---|---|---|---|---|---|
| OA (%) | IoU (%) | F1 (%) | OA (%) | IoU (%) | F1 (%) | ||
| × | ✓ | 99.18 | 84.83 | 91.79 | 99.57 | 89.49 | 94.46 |
| ✓ | × | 99.18 | 85.05 | 91.92 | 99.57 | 89.54 | 94.48 |
| ✓ | ✓ | 99.21 | 85.52 | 92.19 | 99.59 | 90.00 | 94.74 |
| LEVIR-CD | |||||
|---|---|---|---|---|---|
| OA (%) | IoU (%) | F1 (%) | Pre (%) | Rec (%) | |
| 1.0 | 99.18 | 84.98 | 91.88 | 92.77 | 91.01 |
| 0.9 | 99.21 | 85.52 | 92.19 | 92.32 | 92.07 |
| 0.8 | 99.18 | 85.01 | 91.90 | 92.95 | 90.86 |
| 0.7 | 99.18 | 84.98 | 91.88 | 92.56 | 91.21 |
| 0.6 | 99.15 | 84.57 | 91.64 | 91.89 | 91.39 |
| 0.5 | 99.17 | 84.84 | 91.80 | 92.66 | 90.95 |
| Operation | LEVIR-CD | ||||
|---|---|---|---|---|---|
| OA (%) | IoU (%) | F1 (%) | Pre (%) | Rec (%) | |
| Noise corruption | 99.15 | 84.48 | 91.58 | 92.63 | 90.56 |
| Cloud occlusion | 99.12 | 83.93 | 91.27 | 91.82 | 90.72 |
| - | 99.21 | 85.52 | 92.19 | 92.32 | 92.07 |
| Methods | Para. (M) | FLOPs (G) |
|---|---|---|
| FC-EF | 1.35 | 3.59 |
| FC-Siam-diff | 1.35 | 4.74 |
| FC-Siam-conc | 1.55 | 5.34 |
| BIT | 12.40 | 10.87 |
| DMINet | 6.24 | 14.55 |
| STADE-CDNet | 11.9 | 14.3 |
| SEIFNet | 27.91 | 8.37 |
| SChanger | 2.37 | 17.91 |
| CASP | 14.55 | 9.19 |
| SDA-Encoding | 45.57 | 44.80 |
| FARNet (Ours) | 55.92 | 41.84 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Zhang, X.; Du, Y.; Zhang, Z.; Zhang, K. Frequency-Aware Refinement Network with Multi-Scale Fusion for Remote Sensing Change Detection. Sensors 2026, 26, 3538. https://doi.org/10.3390/s26113538
Zhang X, Du Y, Zhang Z, Zhang K. Frequency-Aware Refinement Network with Multi-Scale Fusion for Remote Sensing Change Detection. Sensors. 2026; 26(11):3538. https://doi.org/10.3390/s26113538
Chicago/Turabian StyleZhang, Xu, Yue Du, Zeyu Zhang, and Kaihua Zhang. 2026. "Frequency-Aware Refinement Network with Multi-Scale Fusion for Remote Sensing Change Detection" Sensors 26, no. 11: 3538. https://doi.org/10.3390/s26113538
APA StyleZhang, X., Du, Y., Zhang, Z., & Zhang, K. (2026). Frequency-Aware Refinement Network with Multi-Scale Fusion for Remote Sensing Change Detection. Sensors, 26(11), 3538. https://doi.org/10.3390/s26113538

