Next Article in Journal
Spatiotemporal Evolution of Urban Blue-Green Spaces and Evaluation of Their Thermal Environmental Benefits in Beijing
Previous Article in Journal
Surface-Subsurface Thermal Correspondence over Coal Fire Areas with UAV Thermal Infrared Remote Sensing and Subsurface Temperature Field Reconstruction
Previous Article in Special Issue
SG-YOLO: A Multispectral Small-Object Detector for UAV Imagery Based on YOLO
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

A Spatial-Temporal Attention-Based U-Net for Crop Mapping from Time-Series Sentinel-2 Imagery: A Case in Sanjiang Plain

1
The Center for Hyperspectral Imaging in Remote Sensing (CHIRS), Information Science and Technology College, Dalian Maritime University, Dalian 116026, China
2
China Centre for Resources Satellite Data and Application, Beijing 100094, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2026, 18(11), 1677; https://doi.org/10.3390/rs18111677
Submission received: 4 March 2026 / Revised: 12 May 2026 / Accepted: 21 May 2026 / Published: 22 May 2026

Abstract

Food security is a growing global concern, and accurate crop mapping in major grain-producing regions like China’s Sanjiang Plain—which contributes approximately 7% of national grain output—is essential for agricultural resource management. However, crop classification in this area is hindered by frequent cloud cover, complex phenological rhythms, and spatial heterogeneity. To address these challenges, this study proposes Spatial-Temporal Attention U-Net (STA-UNet), a crop classification model based on time-series Sentinel-2 imagery, incorporating four key modules: Convolutional Block Attention for enhanced sensitivity to parcel boundaries, Temporal Attention Encoder for adaptive capture of temporal dependencies under cloud interference, Dynamic Upsampling for improved boundary recovery of small parcels, and Adaptive Feature Fusion for bridging semantic gaps between heterogeneous features. Extensive experiments on rice, maize, and soybean classification demonstrate that STA-UNet achieves an overall accuracy of 93.61% and an F1-score of 0.925, outperforming state-of-the-art methods. In spatial generalization tests, STA-UNet maintains overall accuracy above 85.02% in the left-subregion transfer setting and achieves the best three-year average OA of 81.34% in the rice-dominated right-subregion stress test, while temporal generalization tests confirm limited inter-annual performance degradation. These results indicate that STA-UNet provides a robust and effective framework for crop mapping in cloud-prone, phenologically complex agricultural regions.
Keywords: crop mapping; multi-temporal remote sensing; attention mechanism; U-Net; Sentinel-2 crop mapping; multi-temporal remote sensing; attention mechanism; U-Net; Sentinel-2

Share and Cite

MDPI and ACS Style

Zhao, E.; Zhang, W.; Wang, Y.; Zhang, H.; Zhao, H. A Spatial-Temporal Attention-Based U-Net for Crop Mapping from Time-Series Sentinel-2 Imagery: A Case in Sanjiang Plain. Remote Sens. 2026, 18, 1677. https://doi.org/10.3390/rs18111677

AMA Style

Zhao E, Zhang W, Wang Y, Zhang H, Zhao H. A Spatial-Temporal Attention-Based U-Net for Crop Mapping from Time-Series Sentinel-2 Imagery: A Case in Sanjiang Plain. Remote Sensing. 2026; 18(11):1677. https://doi.org/10.3390/rs18111677

Chicago/Turabian Style

Zhao, Enyu, Wei Zhang, Yulei Wang, Hao Zhang, and Hang Zhao. 2026. "A Spatial-Temporal Attention-Based U-Net for Crop Mapping from Time-Series Sentinel-2 Imagery: A Case in Sanjiang Plain" Remote Sensing 18, no. 11: 1677. https://doi.org/10.3390/rs18111677

APA Style

Zhao, E., Zhang, W., Wang, Y., Zhang, H., & Zhao, H. (2026). A Spatial-Temporal Attention-Based U-Net for Crop Mapping from Time-Series Sentinel-2 Imagery: A Case in Sanjiang Plain. Remote Sensing, 18(11), 1677. https://doi.org/10.3390/rs18111677

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop