A Differential-Based Siamese Network Integrating the CSWin Transformer for Rural Land Cover Semantic Change Detection
Highlights
- A Siamese network framework integrating CNNs and Transformers is proposed. Residual learning modules for differential structures and integrating the CSWin Transformer enhance the model’s ability to extract local features and global dependencies, respectively.
- We propose a rural land cover semantic change detection dataset comprising 2000 pairs of pixelwise annotated samples, which includes 6 main rural land cover types.
- The proposed method increases the accuracy of rural land cover change detection, which adds an effective monitoring method for achieving accurate surveys of land cover change in rural areas.
- The proposed method can provide more accurate land cover change information in rural areas of developed regions, which is of great significance for rural land use planning and ecological environment monitoring.
Abstract
1. Introduction
- (1)
- We propose a CNN–Transformer fusion framework, designed to learn semantic information of change categories. The Siamese network with differential structures to learn local features and the Transformers to capture spatial contextual features.
- (2)
- We enhance the detail extraction capability by incorporating residual learning modules to augment the bitemporal and differential features. Integrating the CSWin Transformer into the differential Siamese network enhances the model’s ability to capture global dependency.
- (3)
- This study introduces an RLCD dataset comprising 2000 pairs of pixel-labeled samples. This dataset includes 6 main land cover types with a spatial resolution of 1 m, providing a benchmark dataset for semantic change detection models in rural areas.
2. Related Work
2.1. CNN and Transformer Integrated Methods
2.2. Land Cover Semantic Change Detection Datasets
3. Rural Land Cover Semantic Change Detection Dataset
3.1. Research Area and Images
3.2. Dataset Description
3.2.1. Overview
3.2.2. Categories Distribution
4. Methods
4.1. The Siamese Network Framework with Differential Structures
4.2. The CSWin Transformer Embedded Within the Siamese Network
4.3. Residual Learning
4.4. Loss Function
5. Experiments and Results
5.1. Evaluation Metrics and Experimental Settings
5.2. Comparison Methods
- (1)
- FC-Siam-diff [25]: This network initially derives difference features by subtracting the encoder features of the bitemporal images, then concatenates difference features to the Siamese decoders by skip connections.
- (2)
- FC-Siam-conc [25]: This network concatenates the encoder features of the bitemporal images to the Siamese decoders by skip connections.
- (3)
- SCDNet [48]: This method utilizes a Siamese network with differential features. It incorporates encoder features and differential features into two Siamese decoders by skip connections, achieving improvements through attention mechanisms and a deep supervision strategy.
- (4)
- SSD-l [54]: This method first extracts bitemporal features based on the Siamese encoders, then feeds them into three decoder branches that generate a single change map and two temporal semantic change maps.
- (5)
- Bi-SRNet [54]: This method enhances semantic information extraction by incorporating two Siamese semantic reasoning (Siam-SR) blocks and a cross-temporal semantic reasoning (Cot-SR) block on top of the SSD-l. It coordinates semantic representations with change representations through a semantic consistency loss (SCLoss).
- (6)
- MTSCD [29]: This method first extracts multi-scale features based on the Siamese semantic-aware encoder originating from Swin Transformer. Then deeply fuses the two-level differential features through an information exchange module. Finally, it fully leverages the correlation between the two subtasks with a spatial feature enhancement module.
- (7)
- SCanNet [55]: This method develops a semantic change Transformer (SCanFormer) to explicitly model the spatial dependency in semantic transitions between bitemporal images. It then leverages temporal consistency as a prior constraint to extract semantic information from bitemporal images.
- (8)
- SSCLNet [56]: This method extracts contextual information using HRNet and change information through an absolute difference to form the baseline model. It then incorporates a semi-supervised contrastive learning module for semantic segmentation to enhance class discriminability, employing a self-training (ST) method to achieve semi-supervised semantic segmentation.
- (9)
- STS-FINet [57]: This method extracts multilevel features through a Multi-Scale Feature Extraction Encoder (MS-FEE) equipped with Mixed Spatial Reasoning Convolution blocks (MixSrc). It then leverages a Transformer-based Multilevel Feature Interaction module (TML-FI) to capture long-range dependencies and spatial information within multi-level features. Finally, a Multilevel Feature Fusion Decoder (MLFFD) integrates multilevel features to generates semantic change maps.
5.3. Ablation Experiment
5.4. Comparison Results with Different Methods
6. Discussion
6.1. Advantages of Our Proposed Model
6.2. Implications
6.3. Limitations
7. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Singh, A. Review Article Digital change detection techniques using remotely-sensed data. Int. J. Remote Sens. 1989, 10, 989–1003. [Google Scholar] [CrossRef]
- Zhang, C.X.; Yue, P.; Tapete, D.; Jiang, L.C.; Shangguan, B.Y.; Huang, L.; Liu, G.C. A deeply supervised image fusion network for change detection in high resolution bi-temporal remote sensing images. ISPRS J. Photogramm. Remote Sens. 2020, 166, 183–200. [Google Scholar] [CrossRef]
- Woodcock, C.E.; Loveland, T.R.; Herold, M.; Bauer, M.E. Transitioning from change detection to monitoring with remote sensing: A paradigm shift. Remote Sens. Environ. 2020, 238, 111558. [Google Scholar] [CrossRef]
- Zhang, J.D.; Shao, Z.F.; Ding, Q.; Huang, X.; Wang, Y.; Zhou, X.C.; Li, D.R. AERNet: An Attention-Guided Edge Refinement Network and a Dataset for Remote Sensing Building Change Detection. IEEE Trans. Geosci. Remote Sens. 2023, 61, 5617116. [Google Scholar] [CrossRef]
- Liu, Z.H.; Li, J.H.; Syam, M.S.; Ashraf, M.; Asif, M.; Awwad, E.M.; Al-Razgan, M.; Bhatti, U.A. Remote sensing-enhanced transfer learning approach for agricultural damage and change detection: A deep learning perspective. Big Data Res. 2024, 36, 100449. [Google Scholar] [CrossRef]
- Dai, A.J.; Yang, J.Y.; Zhang, Y.X.; Zhang, T.T.; Tang, K.X.; Xiao, X.Y.; Zhang, S.J. A difference enhancement and class-aware rebalancing semi-supervised network for cropland semantic change detection. Int. J. Appl. Earth Obs. Geoinf. 2025, 137, 104415. [Google Scholar] [CrossRef]
- Tang, X.; Zhang, T.X.; Ma, J.J.; Zhang, X.R.; Liu, F.; Jiao, L.C. WNet: W-Shaped Hierarchical Network for Remote-Sensing Image Change Detection. IEEE Trans. Geosci. Remote Sens. 2023, 61, 5615814. [Google Scholar] [CrossRef]
- Wang, Y.H.; Gao, L.R.; Hong, D.F.; Sha, J.J.; Liu, L.; Zhang, B.; Rong, X.H.; Zhang, Y.G. Mask DeepLab: End-to-end image segmentation for change detection in high-resolution remote sensing images. Int. J. Appl. Earth Obs. Geoinf. 2021, 104, 102582. [Google Scholar] [CrossRef]
- Zheng, Z.; Zhong, Y.F.; Zhao, J.; Ma, A.L.; Zhang, L.P. Unifying remote sensing change detection via deep probabilistic change models: From principles, models to applications. ISPRS J. Photogramm. Remote Sens. 2024, 215, 239–255. [Google Scholar] [CrossRef]
- Peng, D.F.; Liu, X.L.; Zhang, Y.J.; Guan, H.Y.; Li, Y.S.; Bruzzone, L. Deep learning change detection techniques for optical remote sensing imagery: Status, perspectives and challenges. Int. J. Appl. Earth Obs. Geoinf. 2024, 136, 104282. [Google Scholar] [CrossRef]
- Daudt, R.C.; Le Saux, B.; Boulch, A.; Gousseau, Y. Multitask learning for large-scale semantic change detection. Comput. Vis. Image Underst. 2019, 187, 102783. [Google Scholar] [CrossRef]
- Yang, K.P.; Xia, G.S.; Liu, Z.C.; Du, B.; Yang, W.; Pelillo, M.; Zhang, L.P. Asymmetric Siamese Networks for Semantic Change Detection in Aerial Images. IEEE Trans. Geosci. Remote Sens. 2021, 60, 5609818. [Google Scholar] [CrossRef]
- Yuan, P.L.; Zhao, Q.Z.; Zhao, X.B.; Wang, X.W.; Long, X.F.; Zheng, Y.C. A transformer-based Siamese network and an open optical dataset for semantic change detection of remote sensing images. Int. J. Digit. Earth 2022, 15, 1506–1525. [Google Scholar] [CrossRef]
- Liu, F.; An, J.Q.; Liu, J.; Yang, J.X.; Tang, X.; Xiao, L. Conjoint Cross-Attention Modeling and Joint Feature Calibrating for Remote Sensing Image Change Detection via a Triple-Double Network. IEEE Trans. Geosci. Remote Sens. 2024, 62, 5622616. [Google Scholar] [CrossRef]
- El Amin, A.M.; Liu, Q.J.; Wang, Y.H. Zoom Out CNNs Features for Optical Remote Sensing Change Detection. In Proceedings of the 2nd International Conference on Image, Vision and Computing (ICIVC), Chengdu, China, 2–4 June 2017. [Google Scholar]
- Zhang, M.; Shi, W.Z. A Feature Difference Convolutional Neural Network-Based Change Detection Method. IEEE Trans. Geosci. Remote Sens. 2020, 58, 7232–7246. [Google Scholar] [CrossRef]
- Yuan, Y.; Chen, X.; Tang, K.; Chen, J. A “Difference-in-Differences”-Based Method for Unsupervised Change Detection in Season-Varying Images. IEEE Geosci. Remote Sens. Lett. 2025, 22, 1–5. [Google Scholar] [CrossRef]
- Lei, T.; Wang, J.; Ning, H.L.; Wang, X.W.; Xue, D.H.; Wang, Q.; Nandi, A.K. Difference Enhancement and Spatial-Spectral Nonlocal Network for Change Detection in VHR Remote Sensing Images. IEEE Trans. Geosci. Remote Sens. 2021, 60, 4507013. [Google Scholar] [CrossRef]
- Liu, M.X.; Chai, Z.Q.; Deng, H.J.; Liu, R. A CNN-Transformer Network with Multiscale Context Aggregation for Fine-Grained Cropland Change Detection. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2022, 15, 4297–4306. [Google Scholar] [CrossRef]
- Tang, W.J.; Wu, K.; Zhang, Y.X.; Zhan, Y.T. A Siamese Network Based on Multiple Attention and Multilayer Transformers for Change Detection. IEEE Trans. Geosci. Remote Sens. 2023, 61, 5219015. [Google Scholar] [CrossRef]
- He, F.C.; Chen, H.; Yang, S.T.; Guo, Z.X. A Hierarchical Local-Sparse Model for Semantic Change Detection in Remote Sensing Imagery. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2025, 18, 3144–3159. [Google Scholar] [CrossRef]
- Ding, Q.; Wang, F.Y.; Wang, M.C.; Zhang, Y.; Cheng, G. GLAI-Net: Global-Local Awareness Integrated Network for Semantic Change Detection in Remote Sensing Images. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2025, 18, 14291–14307. [Google Scholar] [CrossRef]
- Yu, W.T.; Zhuo, L.; Li, J.F. GCFormer: Global Context-Aware Transformer for Remote Sensing Image Change Detection. IEEE Trans. Geosci. Remote Sens. 2024, 62, 4703212. [Google Scholar] [CrossRef]
- Song, F.; Zhang, S.X.; Lei, T.; Song, Y.X.; Peng, Z.M. MSTDSNet-CD: Multiscale Swin Transformer and Deeply Supervised Network for Change Detection of the Fast-Growing Urban Regions. IEEE Geosci. Remote Sens. Lett. 2022, 19, 6508505. [Google Scholar] [CrossRef]
- Daudt, R.C.; Le Saux, B.; Boulch, A. Fully convolutional Siamese networks for change detection. In Proceedings of the 25th IEEE International Conference on Image Processing (ICIP), Athens, Greece, 7–10 October 2018; pp. 4063–4067. [Google Scholar]
- Dosovitskiy, A.; Beyer, L.; Kolesnikov, A.; Weissenborn, D.; Zhai, X.; Unterthiner, T.; Dehghani, M.; Minderer, M.; Heigold, G.; Gelly, S.; et al. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. arXiv 2020, arXiv:2010.11929. [Google Scholar]
- Chen, H.; Qi, Z.P.; Shi, Z.W. Remote Sensing Image Change Detection with Transformers. IEEE Trans. Geosci. Remote Sens. 2021, 60, 5607514. [Google Scholar] [CrossRef]
- Feng, Y.C.; Xu, H.H.; Jiang, J.W.; Liu, H.; Zheng, J.W. ICIF-Net: Intra-Scale Cross-Interaction and Inter-Scale Feature Fusion Network for Bitemporal Remote Sensing Images Change Detection. IEEE Trans. Geosci. Remote Sens. 2022, 60, 4410213. [Google Scholar] [CrossRef]
- Cui, F.Z.; Jiang, J. MTSCD-Net: A network based on multi-task learning for semantic change detection of bitemporal remote sensing images. Int. J. Appl. Earth Obs. Geoinf. 2023, 118, 103294. [Google Scholar] [CrossRef]
- Xu, C.; Ye, Z.Y.; Mei, L.Y.; Shen, S.; Zhang, Q.; Sui, H.G.; Yang, W.; Sun, S.H. SCAD: A Siamese Cross-Attention Discrimination Network for Bitemporal Building Change Detection. Remote Sens. 2022, 14, 6213. [Google Scholar] [CrossRef]
- Cai, C.; Wang, Y.; Yap, K.H. Interactive Change-Aware Transformer Network for Remote Sensing Image Change Captioning. Remote Sens. 2023, 15, 5611. [Google Scholar] [CrossRef]
- Li, H.; Liu, X.Y.; Li, H.H.; Dong, Z.Y.; Xiao, X.L. MDFENet: A Multiscale Difference Feature Enhancement Network for Remote Sensing Change Detection. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2023, 16, 3104–3115. [Google Scholar] [CrossRef]
- Zhang, K.; Zhao, X.; Zhang, F.; Ding, L.; Sun, J.D.; Bruzzone, L. Relation Changes Matter: Cross-Temporal Difference Transformer for Change Detection in Remote Sensing Images. IEEE Trans. Geosci. Remote Sens. 2023, 61, 5611615. [Google Scholar] [CrossRef]
- Liu, W.; Kang, Z.W.; Liu, J.W.; Lin, Y.Y.; Yu, Y.T.; Li, J.A.T. A Multitask CNN-Transformer Network for Semantic Change Detection from Bitemporal Remote Sensing Images. IEEE Trans. Geosci. Remote Sens. 2024, 62, 5647215. [Google Scholar] [CrossRef]
- Mei, L.Y.; Ye, Z.Y.; Xu, C.; Wang, H.Z.; Wang, Y.; Lei, C.; Yang, W.; Li, Y.S. SCD-SAM: Adapting Segment Anything Model for Semantic Change Detection in Remote Sensing Imagery. IEEE Trans. Geosci. Remote Sens. 2024, 62, 5626713. [Google Scholar] [CrossRef]
- Jiang, Z.H.; Wang, B.; Zhang, P.; Wu, Y.L.; Ye, Z.Y.; Yang, H. Semantic enhancement and change consistency network for semantic change detection in remote sensing images. Int. J. Digit. Earth 2025, 18, 2496790. [Google Scholar] [CrossRef]
- Zhang, D.; Wang, F.Y.; Ning, L.C.; Zhao, Z.Y.; Gao, J.Y.; Li, X.L. Integrating SAM with Feature Interaction for Remote Sensing Change Detection. IEEE Trans. Geosci. Remote Sens. 2024, 62, 4513011. [Google Scholar] [CrossRef]
- Zhang, J.; Ding, L.; Zhou, T.Y.; Wang, J.; Atkinson, P.M.; Bruzzone, L. Recurrent Semantic Change Detection in VHR Remote Sensing Images Using Visual Foundation Models. IEEE Trans. Geosci. Remote Sens. 2025, 63, 5402314. [Google Scholar] [CrossRef]
- Schmitt, M.; Ahmadi, S.A.; Xu, Y.H.; Taskin, G.; Verma, U.; Sica, F.; Haensch, R. There Are No Data Like More Data: Datasets for Deep Learning in Earth Observation. IEEE Geosci. Remote Sens. Mag. 2023, 11, 63–97. [Google Scholar] [CrossRef]
- Zan, Y.J.; Ji, S.P.; Chao, S.T.; Luo, M.Y. Open-vocabulary generative vision-language models for creating a large-scale remote sensing change detection dataset. ISPRS J. Photogramm. Remote Sens. 2025, 225, 275–290. [Google Scholar] [CrossRef]
- Xiong, Z.T.; Zhang, F.H.; Wang, Y.; Shi, Y.L.; Zhu, X.X. EarthNets: Empowering artificial intelligence for Earth observation. IEEE Geosci. Remote Sens. Mag. 2024, 13, 45–78. [Google Scholar] [CrossRef]
- Tian, S.Q.; Ma, A.L.; Zheng, Z.; Zhong, Y.F. Hi-UCD: A Large-scale Dataset for Urban Semantic Change Detection in Remote Sensing Imagery. arXiv 2020, arXiv:2011.03247. [Google Scholar]
- Zhou, Y.P.; Wang, J.J.; Ding, J.L.; Liu, B.H.; Weng, N.; Xiao, H.Z. SIGNet: A Siamese Graph Convolutional Network for Multi-Class Urban Change Detection. Remote Sens. 2023, 15, 2464. [Google Scholar] [CrossRef]
- Shi, S.A.; Zhong, Y.F.; Liu, Y.H.; Wang, J.; Wan, Y.T.; Zhao, J.; Lv, P.Y.; Zhang, L.P.; Li, D.R. Multi-temporal urban semantic understanding based on GF-2 remote sensing imagery: From tri-temporal datasets to multi-task mapping. Int. J. Digit. Earth 2023, 16, 3321–3347. [Google Scholar] [CrossRef]
- Ji, D.; Gao, S.; Tao, M.Y.; Lu, H.T.; Zhao, F. Changenet: Multi-temporal asymmetric change detection dataset. In Proceedings of the 49th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Seoul, Republic of Korea, 14–19 April 2024; pp. 2725–2729. [Google Scholar]
- Liu, X.G.; Dai, C.G.; Zhang, Z.C.; Li, M.M.; Wang, H.Y.; Ji, H.L.; Li, Y.J. TBSCD-Net: A Siamese Multitask Network Integrating Transformers and Boundary Regularization for Semantic Change Detection From VHR Satellite Images. IEEE Geosci. Remote Sens. Lett. 2024, 21, 6008305. [Google Scholar] [CrossRef]
- Toker, A.; Kondmann, L.; Weber, M.; Eisenberger, M.; Camero, A.; Hu, J.L.; Hoderlein, A.P.; Senaras, C.; Davis, T.; Cremers, D.; et al. DynamicEarthNet: Daily Multi-Spectral Satellite Dataset for Semantic Change Segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA, 18–24 June 2022; pp. 21126–21135. [Google Scholar]
- Peng, D.F.; Bruzzone, L.; Zhang, Y.J.; Guan, H.Y.; He, P.F. SCDNET: A novel convolutional network for semantic change detection in high resolution optical remote sensing imagery. Int. J. Appl. Earth Obs. Geoinf. 2021, 103, 102465. [Google Scholar] [CrossRef]
- Zhang, X.R.; He, L.; Qin, K.; Dang, Q.; Si, H.J.; Tang, X.; Jiao, L.C. SMD-Net: Siamese Multi-Scale Difference-Enhancement Network for Change Detection in Remote Sensing. Remote Sens. 2022, 14, 1580. [Google Scholar] [CrossRef]
- Li, W.M.; Xue, L.H.; Wang, X.Q.; Li, G. ConvTransNet: A CNN-Transformer Network for Change Detection with Multiscale Global-Local Representations. IEEE Trans. Geosci. Remote Sens. 2023, 61, 5610315. [Google Scholar] [CrossRef]
- Dong, X.Y.; Bao, J.M.; Chen, D.D.; Zhang, W.M.; Yu, N.H.; Yuan, L.; Chen, D.; Guo, B.N. CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped Windows. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA, 18–24 June 2022; pp. 12114–12124. [Google Scholar]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, L.; Polosukhin, I. Attention Is All You Need. In Proceedings of the 31st Annual Conference on Neural Information Processing Systems (NIPS), Long Beach, CA, USA, 4–9 December 2017. [Google Scholar]
- Ding, L.; Tang, H.; Liu, Y.H.; Shi, Y.L.; Zhu, X.X.; Bruzzone, L. Adversarial Shape Learning for Building Extraction in VHR Remote Sensing Images. IEEE Trans. Image Process. 2022, 31, 678–690. [Google Scholar] [CrossRef]
- Ding, L.; Guo, H.T.; Liu, S.C.; Mou, L.C.; Zhang, J.; Bruzzone, L. Bi-Temporal Semantic Reasoning for the Semantic Change Detection in HR Remote Sensing Images. IEEE Trans. Geosci. Remote Sens. 2022, 60, 5620014. [Google Scholar] [CrossRef]
- Ding, L.; Zhang, J.; Guo, H.T.; Zhang, K.; Liu, B.; Bruzzone, L. Joint Spatio-Temporal Modeling for Semantic Change Detection in Remote Sensing Images. IEEE Trans. Geosci. Remote Sens. 2024, 62, 5610814. [Google Scholar] [CrossRef]
- Zhang, X.W.; Yang, Y.Z.; Ran, L.Y.; Chen, L.; Wang, K.W.; Yu, L.; Wang, P.; Zhang, Y.N. Remote Sensing Image Semantic Change Detection Boosted by Semi-Supervised Contrastive Learning of Semantic Segmentation. IEEE Trans. Geosci. Remote Sens. 2024, 62, 5624113. [Google Scholar] [CrossRef]
- Zhang, Y.H.; Zhang, W.X.; Ding, S.T.; Wu, S.Y.; Lu, X.Q. Spatial-Temporal Semantic Feature Interaction Net-Work for Semantic Change Detection in Remote Sensing Images. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2025, 18, 12090–12102. [Google Scholar] [CrossRef]
- Zhang, Z.X.; Du, S.J.; Qian, L.; Qian, G.Y.; Shi, Z.W.; Yan, C. Analysis of spatial and temporal characteristics and influence mechanisms of blue-green spaces in China’s, 2000–2020. Ecol. Indic. 2025, 178, 113903. [Google Scholar] [CrossRef]
- Ming, X.Y.; Tian, Y.C.; Zhang, Q.; Zhang, Y.L.; Tao, J.; Lin, J.L. Coupling ICESat-2 and Sentinel-2 data for inversion of mangrove tidal flat to predict future distribution pattern of mangroves. Int. J. Appl. Earth Obs. Geoinf. 2025, 136, 104398. [Google Scholar] [CrossRef]
- Chen, M.; Zhang, Q.J.; Ge, X.M.; Xu, B.; Hu, H.; Zhu, Q.; Zhang, X. A Full-Scale Connected CNN-Transformer Network for Remote Sensing Image Change Detection. Remote Sens. 2023, 15, 5383. [Google Scholar] [CrossRef]












| Dataset | Year | Resolution | Image Pairs | Image Size | Classes |
|---|---|---|---|---|---|
| HRSCD | 2019 | 0.5 m | 291 | 10,000 × 10,000 | Artificial surfaces, Agricultural areas, Forests, Wetlands, Water |
| Hi-UCD | 2020 | 0.1 m | 1293 | 1024 × 1024 | Water, Grassland, Woodland, Bare land, Building, Greenhouse, Road, Bridge, Others |
| SECOND | 2020 | - | 4662 | 512 × 512 | Low vegetation, N.v.g surface, Tree, Water, Building, Playground |
| Landsat-SCD | 2022 | 30 m | 8468 | 416 × 416 | Farmland, Desert, Building, Water |
| DynamicEarthNet | 2022 | 3 m | 600 | 1024 × 1024 | Impervious surfaces, Agriculture, Forest and Other vegetation, Wetlands, Soil, Water, Snow and Ice |
| CNAM-CD | 2023 | 0.5 m | 2503 | 512 × 512 | Impervious surfaces, Bare land, Vegetation, Water, Others |
| WUSU dataset | 2023 | 1 m | 3 | 6358 × 6382/ 7025 × 5500 | Road, Low building, High building, Arable land, Woodland, Grassland, River, Lake, Structure, Excavation, Bare surface |
| ChangeNet | 2024 | 0.3 m | 31,000 | 1900 × 1200 | Building, Farmland, Bare land, Water, Road |
| FZ-SCD | 2024 | 0.8 m | 4480 | 512 × 512 | Bare ground, Building, Vegetable, Water, Road |
| Semantic Classes | Description |
|---|---|
| Cropland | Paddy field and dryland. |
| Forest | Forest, shrubs, and landscaping seedlings. |
| Facility Agricultural Land | Greenhouse and livestock, poultry, and aquaculture facilities. |
| Construction land | Rural dwellings, public buildings, production facilities, and other structures. |
| Road | Asphalt road and rural road. |
| Water | River, lake, pond, aquaculture pond, and artificial lake, etc. |
| Models | mIoU | Sek | Fscd | OA |
|---|---|---|---|---|
| Baseline | 71.19 | 20.51 | 60.50 | 85.11 |
| Baseline-RL | 72.23 | 22.03 | 61.86 | 86.67 |
| Baseline-Transformer | 73.81 | 26.30 | 65.08 | 87.90 |
| Baseline-RL-Transformer | 75.10 | 28.72 | 67.87 | 88.71 |
| Models | mIoU | Sek | Fscd | OA |
|---|---|---|---|---|
| Baseline | 67.02 | 15.77 | 58.78 | 85.67 |
| Baseline-RL | 67.23 | 16.18 | 59.36 | 85.82 |
| Baseline-Transformer | 67.95 | 17.05 | 60.47 | 86.05 |
| Baseline-RL-Transformer | 68.79 | 17.72 | 61.35 | 86.59 |
| Methods | mIoU | Sek | Fscd | OA |
|---|---|---|---|---|
| FC-Siam-diff | 67.78 | 15.05 | 56.83 | 84.21 |
| FC-Siam-conc | 68.06 | 15.19 | 56.89 | 84.44 |
| SCDNet | 71.29 | 20.70 | 60.71 | 85.69 |
| SSD-l | 72.41 | 22.47 | 61.93 | 86.87 |
| Bi-SRNet | 72.50 | 22.63 | 62.21 | 87.09 |
| MTSCD | 72.61 | 22.72 | 62.51 | 87.14 |
| SCanNet | 73.35 | 23.67 | 63.53 | 87.80 |
| SSCLNet | 74.06 | 26.57 | 65.59 | 87.99 |
| STS-FINet | 73.02 | 22.79 | 63.08 | 87.26 |
| DSTNet | 75.10 | 28.72 | 67.87 | 88.71 |
| Methods | mIoU | Sek | Fscd | OA |
|---|---|---|---|---|
| FC-Siam-diff | 63.82 | 12.29 | 53.52 | 84.17 |
| FC-Siam-conc | 63.90 | 12.45 | 53.69 | 84.21 |
| SCDNet | 66.87 | 15.57 | 58.51 | 85.60 |
| SSD-l | 66.95 | 15.68 | 58.63 | 85.65 |
| Bi-SRNet | 67.57 | 16.65 | 59.85 | 85.92 |
| MTSCD | 67.99 | 17.20 | 60.53 | 86.08 |
| SCanNet | 68.22 | 17.51 | 60.88 | 86.15 |
| SSCLNet | 68.32 | 17.57 | 60.91 | 86.20 |
| STS-FINet | 68.08 | 17.39 | 60.70 | 86.12 |
| DSTNet | 68.79 | 17.72 | 61.35 | 86.59 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Si, B.; Dong, B.; Wang, K. A Differential-Based Siamese Network Integrating the CSWin Transformer for Rural Land Cover Semantic Change Detection. Remote Sens. 2026, 18, 557. https://doi.org/10.3390/rs18040557
Si B, Dong B, Wang K. A Differential-Based Siamese Network Integrating the CSWin Transformer for Rural Land Cover Semantic Change Detection. Remote Sensing. 2026; 18(4):557. https://doi.org/10.3390/rs18040557
Chicago/Turabian StyleSi, Bo, Baiyu Dong, and Ke Wang. 2026. "A Differential-Based Siamese Network Integrating the CSWin Transformer for Rural Land Cover Semantic Change Detection" Remote Sensing 18, no. 4: 557. https://doi.org/10.3390/rs18040557
APA StyleSi, B., Dong, B., & Wang, K. (2026). A Differential-Based Siamese Network Integrating the CSWin Transformer for Rural Land Cover Semantic Change Detection. Remote Sensing, 18(4), 557. https://doi.org/10.3390/rs18040557

