4.1. The Necessity of Agricultural Land Use Classification in Hilly and Mountainous Areas
Cropland, as a fundamental resource for global food security, has been a primary focus of remote sensing monitoring. Over the years, considerable progress has been made in cropland extraction, resulting in the availability of various datasets. Notable examples include the 30 m resolution global map of cropland extent and change [
57], the 30 m annual cropland dataset of China from 1986 to 2021 [
58], the 10 m resolution dataset depicting annual changes in maize cropland in China from 2017 to 2021 [
59], and the 10 m resolution national scale map of cropland use intensity in China during 2018–2023 [
60]. However, these existing classification systems predominantly focus on crop types (e.g., soybean, maize) [
61] or broad cropping patterns [
62]. Research on classifying specific agricultural land use types characterized by distinct landscape morphologies remains limited. This gap is particularly significant in hilly and mountainous regions, where complex topography necessitates a distinction between these land use types to address their critical ecological and economic implications.
From an ecological perspective, different agricultural land use types impose distinct environmental footprints and conservation needs. Terraces serve as effective conservation measures against soil erosion by transforming steep slopes into level platforms to enhance stability and cultivation conditions [
63]. However, socio-economic factors like urbanization and labor migration have caused terrace abandonment in certain regions, which reverses the ecological benefits and may exacerbate soil erosion and slope instability. In contrast, greenhouses have expanded rapidly, particularly in China; the area of agricultural plastic greenhouses grew by 42.4% during 2000–2020 [
64]. While greenhouses optimize microclimates to protect crops from extreme weather and pests, they also introduce environmental challenges: extensive plastic coverings can alter pollination cycles and reduce biodiversity, and the accumulation of plastic residues often degrades soil structure. Therefore, accurate mapping of these agricultural land use types is essential for understanding specific ecological impacts and guiding conservation efforts.
From an economic perspective, agricultural land use types vary substantially in production efficiency, input costs, and income stability. Plain open-field cropland, featuring flat terrain and high mechanization, supports large-scale cultivation with high efficiency and low per-unit costs. However, this type depends heavily on natural rainfall and climatic conditions, leading to limited growing seasons and substantial seasonal income variability [
65]. In contrast, greenhouse cultivation offers high economic potential by extending growing seasons, enabling year-round production, and reducing dependency on seasonal and climatic variations. Greenhouses are particularly suitable for high-value crops such as fruits, vegetables, and flowers. However, greenhouse systems require substantial financial investment for construction and maintenance and are heavily reliant on energy for heating, lighting, and ventilation, making them vulnerable to energy price fluctuations [
66]. Slope cropland, constrained by steep terrain and limited mechanization, is typically used for small-scale agriculture. This type relies on natural rainfall, making it highly susceptible to soil erosion and climate variability, resulting in low production stability [
67]. Therefore, understanding the spatial distribution of different agricultural land use types is crucial for conducting comprehensive economic analyses, optimizing resource allocation, and improving land use efficiency.
In conclusion, developing a specialized classification for diverse agricultural land use types is essential. Such an approach bridges the gap between general monitoring and precision agriculture, enhances ecological conservation efforts, and facilitates comprehensive economic analysis. To fulfill these objectives, the classification results must achieve an accuracy level that can reliably support practical applications and scientific research.
With the support of geographic zoning, this study achieved an overall classification accuracy of 90.81% and a Kappa coefficient of 0.85, indicating its potential to meet these requirements from the following perspectives. First, reliable crop type mapping serves as a fundamental prerequisite for monitoring planting area, crop growth, and yield estimation [
68]. The accuracy level attained in this study is expected to meet the requirements for regional agricultural resource surveys and land use inventories. Second, accurate classification of diverse agricultural land use types contributes to crop phenology analysis and growth monitoring in heterogeneous environments, where data quality and temporal resolution directly affect classification reliability [
69]. Third, accurate delineation of agricultural field boundaries from high-resolution imagery has been recognized as crucial for parcel-level cropland monitoring and precision agriculture management [
70]. The high boundary precision achieved through our zoning-optimized segmentation strategy suggests that the proposed method can provide reliable parcel-level spatial references, potentially supporting area estimation, agricultural economic analysis, and resource allocation optimization at the regional scale.
4.2. Pathways of Geographical Zoning in Improving Classification Accuracy: Feature Heterogeneity and Boundary Precision
The concept of geographical zoning has a long history, with its roots tracing back to the early 19th century when Alexander von Humboldt combined climate and vegetation distribution to explore the laws of geographic zonation. This pioneering work led to the creation of the first global isothermal map and marked a significant milestone in geographical zoning research [
71]. Over time, the application of geographical zoning has expanded across various fields, such as climatic zoning [
72], ecological zoning [
73], and agricultural zoning [
74]. Despite this extensive theoretical foundation, the potential of geographical zoning in remote sensing classification remains underexplored [
75].
In remote sensing, the most common strategy for handling large-scale or HR remote sensing imagery is image tiling. This method divides the study area into equal-sized rectangular tiles or segments based on administrative boundaries to enhance computational efficiency [
76,
77]. For example, Zhang et al. [
78] divided their study area into 159 km grid cells to classify land cover in North America, demonstrating that the average classification confidence achieved by training a separate classifier for each grid cell is higher than that of the global classifier. Campos et al. [
79] divided their study area into 5 km grid cells to classify small farms in Spain, demonstrating that tiling could address spatial variability and improve classification accuracy by approximately 12%. However, the image tiling method overlooks the spatiotemporal heterogeneity of geographical environmental factors. This mechanical division poses challenges for classification in areas with complex environments or diverse land cover types.
With the growing recognition of spatial heterogeneity and multi-scale variability in remote sensing, geographical zoning has gained attention as a more optimized alternative to image tiling [
80]. By leveraging the attributes and characteristics of geographical, ecological, and human activity features, geographical zoning provides a comprehensive representation of regional environmental differences. This approach reduces the complexity of the environment and land cover within subzones, thereby creating the potential for improved classification accuracy [
81]. For instance, Jin et al. [
82] applied eco-geographical zoning using environmental data and clustering methods for 30 m national-scale land cover classification. Their findings indicated that this approach effectively reduced the regional complexity of land cover, although no classification validation experiments within the zones were reported. Similarly, Jiang et al. [
83] classified forests in the Pacific Northwest of the United States using WWF-provided ecological zoning. Their results demonstrated that zoning effectively reduced spectral heterogeneity and led to a 15% improvement in classification accuracy.
However, reliance on pre-existing zoning datasets introduces significant limitations in the context of HR remote sensing applications. First, there is a scale mismatch because most existing zoning datasets are designed for large-scale mapping scenarios, such as global or national analyses. These datasets typically define zones that encompass extensive areas, which often do not align with the finer spatial resolution or specific objectives of HR studies. Second, the loss of spatial detail is a critical issue. The original data used to create large-scale zoning datasets often has a resolution at the kilometer scale, which is insufficient for capturing detailed geographic and spectral variations required for precision agriculture. Third, traditional zoning methods typically overlook temporal dynamics. These methods produce results that only reflect spatial differences at a single time point and fail to account for dynamic phenological changes over time, which are essential for distinguishing agricultural land use types [
84].
In this study, we addressed these challenges by integrating multi-source data to develop a spatiotemporal feature-driven geographical zoning method. This approach improves classification accuracy through two primary pathways: reducing feature heterogeneity and enhancing boundary precision.
Feature heterogeneity: Geographical zoning enhances classification by creating subzones with more concentrated distributions of land use types. As shown in
Figure 13, each subzone after zoning represents distinct conditions of topography, human activity, and vegetation phenology, resulting in spectral and other classification feature values (e.g., texture, topography) that are more distinctive. The increased inter-regional feature differences reduce confusion among similar types, such as the spectral overlap between “same objects with different spectra” or “different objects with similar spectra” [
85]. Moreover, zoning allows the classification models to be locally optimized within each subzone, enabling them to focus on regional feature patterns, thereby reducing classification errors associated with the global model.
Boundary precision: By optimizing segmentation scales within each subzone, the segmentation results more accurately reflect the actual distribution of cropland types within the region. As shown in
Figure 14, the segmentation results of the zoning model delineate cropland boundaries more precisely than those of the global model. Segmentation boundaries define the spatial extent for calculating classification characteristics of the target patches. When segmentation scales are too large, as shown in
Figure 14b,c,e, patches may include multiple land-cover types, leading to mixed feature calculations and subsequent misclassification. Conversely, overly small segmentation scales divide a single type of distribution into an excessive number of patches, as illustrated in
Figure 14a,d, increasing computational complexity while reducing efficiency.
4.3. Limitations and Future Work
The proposed method has several limitations that warrant further investigation.
First, the geographical zoning strategy relies on optical time-series data to capture vegetation phenology. While this approach proved highly effective in the relatively dry and clear-sky conditions of northern mountainous regions like Beijing, its transferability to southern regions requires caution. Southern hilly areas often experience persistent cloud cover and rain during critical crop growth stages, causing severe data gaps in optical imagery. To ensure all-weather phenological monitoring and enhance model transferability across climatically diverse regions, future research should integrate optical data with Synthetic Aperture Radar (SAR) time-series imagery [
86].
Second, as a hierarchical approach, errors in the first-level zoning may propagate to the subsequent classification stage, particularly in transitional areas along zoning boundaries. Although the dual-condition intersection strategy and the statistically derived thresholds adopted in this study have effectively minimized such risks, future research should quantitatively assess the sensitivity of the final classification accuracy to potential zoning errors, for example, by systematically introducing perturbations to the zoning boundaries and evaluating the resulting changes in classification performance.
Third, this study adopted the RF classifier based on its proven robustness with limited training samples in agricultural classification tasks. Future studies will explore the integration of Deep Learning (DL) architectures with the geographical zoning framework through two potential strategies [
87]. The first involves embedding the geographical zoning information as prior knowledge into DL networks, for example, by encoding the zoning results as additional input feature channels or incorporating them into spatial attention modules to guide the network to learn zone-specific feature patterns. The second involves replacing the current RF classifier within each subzone with DL-based semantic segmentation models (e.g., U-Net or DeepLab), which could directly learn spatial features from high-resolution imagery under zone-specific constraints. A key challenge in both strategies is that DL models typically require large volumes of labeled training data, and the sample availability within individual subzones after geographical zoning may be insufficient for effective model training. Addressing this issue through strategies such as transfer learning or data augmentation represents an important future direction. Additionally, systematically comparing the performance of different classification algorithms within the geographical zoning framework will help assess the generalizability of the proposed zoning strategy across different classifiers.