Zonal Estimation of the Earliest Winter Wheat Identification Time in Shandong Province Considering Phenological and Environmental Factors

Chen, Jiaqi; Du, Xin; Wang, Chen; Cai, Cheng; Fang, Guanru; Wang, Ziming; Liu, Mengyu; Zhang, Huanxue

doi:10.3390/agronomy15061463

Open AccessArticle

Zonal Estimation of the Earliest Winter Wheat Identification Time in Shandong Province Considering Phenological and Environmental Factors

by

Jiaqi Chen

¹,

Xin Du

^2,3

,

Chen Wang

¹,

Cheng Cai

¹,

Guanru Fang

¹

,

Ziming Wang

¹,

Mengyu Liu

¹ and

Huanxue Zhang

^1,*

¹

College of Geography and Environment, Shandong Normal University, Jinan 250300, China

²

Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100101, China

³

University of Chinese Academy of Sciences, Beijing 100049, China

^*

Author to whom correspondence should be addressed.

Agronomy 2025, 15(6), 1463; https://doi.org/10.3390/agronomy15061463

Submission received: 14 May 2025 / Revised: 9 June 2025 / Accepted: 13 June 2025 / Published: 16 June 2025

(This article belongs to the Section Precision and Digital Agriculture)

Download

Browse Figures

Versions Notes

Abstract

Early-season crop mapping plays a critical role in yield estimation, agricultural management, and policy-making. However, most existing methods assign a uniform earliest identification time across provincial or broader extents, overlooking spatial heterogeneity in crop phenology and environmental conditions. This often results in delayed detection or reduced mapping accuracy. To address this issue, we proposed a zonal-based early-season mapping framework for winter wheat by integrating phenological and environmental factors. Aggregation zones across Shandong Province were delineated using Principal Component Analysis (PCA) based on factors such as start of season, end of season, temperature, slope, and others. On this basis, early-season winter wheat identification was conducted for each zone individually. Training samples were generated using the Time-Weighted Dynamic Time Warping (TWDTW) method. Time-series datasets derived from Sentinel-1/2 imagery (2021–2022) were processed on the Google Earth Engine (GEE) platform, followed by feature selection and classification using the Random Forest (RF) algorithm. Results indicated that Shandong Province was divided into four zones (A–D), with Zone D (southwestern Shandong) achieving the earliest mapping by early December with an overall accuracy (OA) of 97.0%. Other zones reached optimal timing between late December and late January, all with OA above 95%. The zonal strategy improved OA by 3.6% compared to the non-zonal approach, demonstrated a high correlation with official municipal-level statistics (R² = 0.97), and surpassed the ChinaWheat10 and ChinaWheatMap10 datasets in terms of crop differentiation and boundary delineation. Historical validation using 2017–2018 data from Liaocheng City, a prefecture-level city in Shandong Province, achieved an OA of 0.98 and an F₁ score of 0.96, further confirming the temporal robustness of the proposed approach. This zonal strategy significantly enhances the accuracy and timeliness of early-season winter wheat mapping at a large scale.

Keywords:

winter wheat; earliest identification time; phenological feature; environmental factor; remote sensing

1. Introduction

Food security remains a critical global concern, increasingly challenged by rapid urbanization, climate change, soil salinization, and arable land degradation, all of which exert mounting pressure on agricultural production [1]. As a major contributor to and beneficiary of global agricultural production and consumption [2], China relies heavily on stable crop yields to safeguard national food security. Winter wheat is one of the most important staple crops, contributing more than 20% to the national grain-sowing area and total yield. The North China Plain functions as the primary zone for its cultivation. Rapid and accurate identification of winter wheat distribution is thus essential for agricultural monitoring, crop management, and structural optimization [3].

Remote sensing has proven to be an effective and economical approach for large-scale crop mapping, owing to its extensive spatial coverage, frequent temporal observations, and inherent objectivity [4]. However, traditional classification approaches typically rely on imagery from the full growing period or from specific phenological stages [5], leading to substantial delays in the availability of crop distribution information—often until harvest or later. For instance, the USDA’s Crop Data Layer (CDL) was generally released five months after harvest, limiting its application for real-time decision-making [6]. To overcome this limitation, early-season crop mapping was proposed to enable the timely identification of crop types during the growing period [7]. This approach supported numerous applications, including disaster response, agricultural insurance, precision farming, yield estimation, and environmental monitoring [8].

Unlike traditional post-season classification, early-season mapping presents several unique challenges. One of the primary issues is the lack of timely training samples, as early-season survey sample collection is labor-intensive, time-consuming, and often delayed beyond the optimal mapping window. To address this, researchers developed a variety of automated sample generation methods. Among them, sample transfer became widely used, whereby early-season samples were generated from historical labeled remote sensing data and applied to current classification tasks [8]. To improve this method’s adaptability to sowing date variations, the Time-Weighted Dynamic Time Warping (TWDTW) technique was incorporated [9]. This method automatically matches historical samples to current crop growth conditions by comparing the similarity of temporal curves and has proven effective in early-season sample generation.

Another challenge stems from the heavy dependence on early-season satellite observations, which are essential for capturing key phenological and spectral traits. However, data quality during this period is often compromised by frequent cloud cover. To mitigate it, time-series interpolation techniques can be employed to reconstruct complete temporal datasets across the crop growth cycle, improving data continuity and usability [10]. Based on the reconstructed time-series feature dataset, feature selection has become a crucial step for improving model performance. It enhances model simplicity, computational efficiency, and classification accuracy. Researchers have typically constructed time-series features spanning the entire growth period to represent spectral responses throughout crop development and to compensate for missing key phenological stages [11]. However, due to the high dimensionality and redundancy in raw features, it was necessary to perform dimensionality reduction before feature ranking [12]. For example, a study in the Huaihe Basin extracted time-series features such as the Normalized Difference Vegetation Index (NDVI), Enhanced Vegetation Index (EVI), Normalized Difference Red Edge Index (NDRDI), and Land Surface Water Index (LSWI), and used the Random Forest (RF) algorithm to evaluate their importance, achieving an early-season accuracy of 91% [13]. Another study in Henan Province selected eight representative spectral indices, including the Land Surface Water Index (LSWI), Green Chlorophyll Vegetation Index (GCVI), Red Edge 2, Inverted Red Edge Chlorophyll Index (IRECI), and others, further improving classification performance [14].

Despite considerable progress in early-season crop mapping, large-scale applications remain challenging due to spatial heterogeneity and environmental variability. In Shandong Province, several studies have reported an underestimation of winter wheat distribution during the overwintering period [5,15], primarily attributed to inconsistent phenological development driven by topographic and climatic differences. In warmer and wetter zones, winter wheat resumes growth earlier, while in colder areas with limited sunlight, its growth remains dormant for extended periods. Consequently, early detection requires zone-specific adaptation to phenological and environmental differences. To mitigate spatial inconsistency, some studies incorporated phenological information into mapping frameworks. For example, rice distribution in Northeast Asia was effectively mapped using flood signal phenology combined with vegetation indices [16]. However, environmental factors such as rainfall, temperature, and elevation exhibited substantial spatial variability, making it challenging to fully capture zonal differences based solely on phenological features. Thus, zone-specific threshold adjustments were still necessary to maintain classification stability across diverse zones [17].

Subdividing the study area into finer zones was recognized as an effective strategy to enhance zonal adaptability. Systems such as Agroecological Zones (AEZs) and Agricultural Climatic Zones (ACZs) were widely adopted to support zonal classification and improve large-scale mapping accuracy. For instance, the classification accuracy of winter wheat significantly improved after Jiangsu Province was divided into four AEZs and mapped separately [18]. Nevertheless, predefined zones often failed to reflect real-time planting conditions within a given year, limiting their effectiveness for early-season mapping. Therefore, future zoning strategies require dynamic integration of phenological and environmental information to improve robustness and adaptability.

To tackle these challenges, this study proposes a zonal-based early-season mapping approach that integrates phenological and environmental factors. Zone-specific optimal identification times in Shandong Province are determined using Sentinel-1/2 time-series data processed on the Google Earth Engine (GEE) platform. This study aims to (1) construct and assess an integrated approach to enhance both the timeliness and accuracy of large-area winter wheat mapping at early growth periods; (2) identify the earliest feasible identification time for winter wheat within each defined zone; and (3) develop zone-specific classification strategies that account for environmental heterogeneity.

2. Materials

2.1. Study Area

Shandong Province, located in the lower reaches of the Yellow River, extends from 34°23′ N to 38°17′ N and 114°48′ E to 122°42′ E. As a prominent agricultural zone in eastern China, it lies within the North China Plain and is characterized by a warm temperate monsoon climate, featuring hot and rainy summers as well as cold and dry winters (Figure 1a). These climatic conditions, coupled with fertile soil, provide an optimal environment for agricultural production. The central part of Shandong Province features mountainous landscapes, whereas the southwest and northwest are mainly flat and low in elevation. In contrast, the eastern area is dominated by gently rolling hills (Figure 1b).

In 2021, the arable land of Shandong Province reached 64,000 km², representing 40.5% of the provincial territory, highlighting its importance as a major grain-producing zone. Winter wheat had a sown area of 2.357 million ha in 2021, comprising 16.95% of the national winter wheat cultivation [19]. As China’s second-largest winter wheat-producing province, Shandong is crucial to maintaining national food security.

In Shandong, winter wheat typically grows over a period of 220 to 270 days. Sowing occurs from September to October, followed by dormancy beginning in December. As temperatures rise, growth resumes between February and March. The jointing stage usually starts in early April, transitions into the heading phase from mid-April to May, and the crop is generally harvested by June.

2.2. Data and Preprocessing

2.2.1. Remote Sensing Imageries

This study utilized 1428 Sentinel-2 Multi Spectral Instrument (MSI) Top-of-Atmosphere (TOA) reflectance images acquired from the GEE platform, spanning the winter wheat-growing period in Shandong Province from 10 October 2021 to 10 June 2022. Additionally, 464 Sentinel-2 images were acquired for Liaocheng City during the 2017–2018 winter wheat-growing period [20]. Sentinel-2 provided 13 spectral bands with a maximum spatial resolution of 10 m and a temporal resolution of 5 days. During the 2021–2022 winter wheat-growing period, pixels with cloud cover greater than 30%, as indicated by the QA60 band, were excluded, and only cloud-free or low-cloud images (with cloud cover below 30%) were retained for further analysis. These valid images provided sufficient spatial and temporal coverage for the study area.

To mitigate temporal discontinuities caused by orbital differences, the Sentinel-2 data were aggregated into 10-day intervals using maximum value compositing. A Savitzky–Golay (SG) filter with a 70-day moving window and a third-order polynomial function was subsequently employed to smooth the time series, generating a dataset with 10-day temporal intervals.

We also acquired Sentinel-1 Ground Range Detected (GRD) images during the winter wheat-growing period, including 392 scenes across Shandong Province from 2021 to 2022 and 98 scenes within Liaocheng City from 2017 to 2018. The dataset comprised vertical transmit/vertical receive (VV) and vertical transmit/horizontal receive (VH) polarization modes acquired in Interferometric Wide (IW) swath mode. Sentinel-1 imagery featured a spatial resolution of 10 m and a temporal resolution of 6 days. The preprocessing workflow included the elimination of thermal noise, application of radiometric calibration, and execution of terrain correction. Speckle suppression was achieved using a refined Lee filter with a 7 × 7 kernel. The processed SAR images were then resampled to a 10-day interval to align with the Sentinel-2 time series, ensuring temporal consistency in the integration of multi-source remote sensing datasets.

2.2.2. Auxiliary Data

This study employed phenological features and environmental factors jointly to generate zones. Phenological features were obtained by fitting time-series curves using the TIMESAT (version 3.3) software based on the MOD13A1.061 Terra 16-Day 500 m Global Vegetation Indices dataset provided by the GEE platform [21]. Environmental factors, including average temperature, evapotranspiration, and precipitation during the winter wheat-growing period (October to the following June), as well as terrain slope, were extracted from the following datasets. Specifically, the temperature, evapotranspiration, and precipitation data were obtained from the National Tibetan Plateau Data Center (https://data.tpdc.ac.cn/, accessed on 1 October 2024), while the slope was derived from the digital elevation model (DEM) provided by the United States Geological Survey (https://earthexplorer.usgs.gov/, accessed on 1 October 2024). All datasets were reprojected and resampled to 1 km resolution to ensure spatial alignment and positional consistency.

Cropland data from the global 10 m land cover product dataset “ESA/WorldCover/v200” released by the European Space Agency (ESA) in 2021 and winter wheat planting area official statistics from the Shandong Provincial Bureau of Statistics (http://tjj.shandong.gov.cn/tjnj/nj2021/indexch.htm, accessed on 1 October 2024) for Shandong Province were collected for masking the cropland and comparison analysis, respectively.

To enable the automatic acquisition of winter wheat training samples, we collected historical winter wheat datasets, including ChinaWheatMap10 [22], ChinaWheat10 [23], and the 30 m winter wheat distribution map of 11 provinces in China [24], from which samples were extracted to serve as reference data for current-season sample selection.

2.2.3. Sample Dataset

Over the course of the 2021–2022 winter wheat-growing period, the research team systematically collected survey samples using handheld GPS devices, encompassing winter wheat and other crop types. To improve the reliability and representativeness of the sample dataset, all field survey samples were carefully validated using Google Earth high-resolution imagery. In addition, supplementary winter wheat and non-winter wheat samples were manually labeled through visual interpretation based on high-resolution images available from Google Earth (Figure 1c). The integrated set of survey samples and manually labeled samples was subsequently used to construct standard VH time-series curves, which supported the automatic sample generation process and served as validation samples for evaluating the accuracy of winter wheat classification. The collected validation samples are presented in Table 1.

3. Methods

The flowchart is illustrated in Figure 2. The primary objective of the study was to accurately determine the earliest identification time of winter wheat across different zones in Shandong Province, following three main steps:

(1) Zoning Shandong Province based on a comprehensive analysis of phenological features and environmental factors.

(2) Automatically generating current-season training samples for winter wheat and non-winter wheat using the TWDTW method.

(3) Selecting optimal features for each zone and employing the Random Forest classifier to determine the earliest identification time of winter wheat.

3.1. Clustering Zone Generation Based on Phenological and Environmental Factors

3.1.1. Phenological and Environmental Factors Preparation

In this study, zonal delineation across the study area was based on phenological features and environmental factors, with detailed descriptions provided in Table 2. To avoid multicollinearity among these factors, the Spearman correlation coefficients were first calculated, and variables with a correlation greater than 0.9 were excluded [25]. Specifically, amplitude and maximum vegetation value (MVV) exhibited a high degree of correlation, and thus, MVV was removed.

Secondly, Principal Component Analysis (PCA) was employed to condense the dimensionality of the selected phenological and environmental factors. Prior to analysis, all variables were standardized using Z-score normalization to eliminate the influence of scale differences among indicators, as defined by the following formula:

x^{'} = \frac{x - \bar{x}}{σ}

(1)

where

x^{'}

refers to the standardized form of the variable, computed as the deviation from the mean

\bar{x}

, normalized by the standard deviation

σ

.

Subsequently, principal components were extracted, and an orthogonal rotation was applied to optimize the loading matrix structure, thereby enhancing the interpretability of each principal component. In accordance with the commonly used eigenvalue criterion, only components with eigenvalues exceeding 1 were preserved [26], as these accounted for the majority of the variance within the dataset. Each principal component was expressed as a linear combination of standardized variables, formulated as

F_{i} = a_{1} x_{1} + a_{2} x_{2} + \dots + a_{n} x_{n}

(2)

where

F_{i}

represents the

i

-th principal component,

a_{1}, a_{2}, . . ., a_{n}

are the corresponding coefficients, and

x_{1}, x_{2}, . . ., x_{n}

represent the standardized indicator values.

Through this approach, complex phenological and environmental factors were transformed into a limited number of comprehensive principal components. This not only reduced data dimensionality, but also more clearly revealed the dominant variations among indicators and their internal relationships.

3.1.2. Zone Delineation Using K-Means Clustering Method

In this study, the K-means clustering algorithm was applied to group and analyze the composite indicators obtained through PCA. As a widely adopted unsupervised learning method, K-means delineates spatially homogeneous zones by assigning each sample to the nearest cluster centroid, thereby minimizing the total within-cluster sum of squares.

This clustering approach effectively delineates zones with similar feature characteristics, enabling a reduction in internal heterogeneity across large-scale research areas. K-means was selected not only for its computational efficiency and scalability but also due to its unsupervised nature, which avoids reliance on manually selected training samples and minimizes human subjectivity. Instead, it clusters samples based solely on their intrinsic feature separability. Similar strategies have been successfully employed in previous studies involving time-series remote sensing data for crop phenology and zonal mapping, further demonstrating the method’s robustness and suitability for large-scale agricultural applications [25,27].

During the clustering process, the similarity between samples was quantified using the Euclidean distance, as defined by the following formula:

d (x, y) = \sqrt{\sum_{i = 1}^{n} {(x_{i} - y_{i})}^{2}}

(3)

where

d (x, y)

quantifies the dissimilarity between two samples

x

and

y

,

x_{i}

and

y_{i}

represent the values of the

i

-th feature for each sample, and

n

is the total number of features. Based on this distance metric, each sample was iteratively assigned to the closest cluster center.

To determine the optimal number of clusters, both the Elbow Method and the Silhouette Coefficient were adopted. The Elbow Method involved plotting the ratio of the between-cluster sum of squares (BSS) to total sum of squares (TSS) across varying values of

k

, and identifying the point at which the rate of increase in this ratio began to level off. The ratio was computed using the following formula:

\frac{BSS}{TSS} = \frac{\sum_{j = 1}^{k} n_{j} | μ_{j} - \bar{x} |^{2}}{\sum_{i = 1}^{N} | x_{i} - \bar{x} |^{2}}

(4)

where

N

represents the total sample size,

k

represents the number of clusters,

x_{i}

is the feature vector of the

i

th sample,

\bar{x}

denotes the overall mean,

{\bar{μ}}_{j}

is the centroid of cluster

j

, and

n_{j}

indicates the number of samples in the cluster [28].

Moreover, the Silhouette Coefficient was employed to assess the clustering quality by simultaneously considering the cohesion within clusters and the separation between different clusters. It provides an effective metric for evaluating the overall clustering structure, and has been widely applied in ecological and remote sensing studies due to its balance between intra-cluster compactness and inter-cluster distinctiveness [25]. It was calculated as

S (k) = \frac{b (k) - a (k)}{\max (a (k), b (k))}

(5)

where

a (k)

denotes the mean distance between sample

k

and all other members of the same cluster, whereas

b (k)

refers to the average distance between sample

k

and the samples in the closest adjacent cluster. The coefficient varied between −1 and 1, with values approaching 1 indicating stronger and more distinct clustering.

Finally, the results of both evaluation metrics were jointly considered to determine the optimal number of clusters. This ensured that the final zonal delineation achieved both internal homogeneity and external distinctiveness across the delineated zones.

3.2. Automated Sample Generation Using the TWDTW Method

To obtain representative winter wheat training samples with strong adaptability and robustness, winter wheat pixels were first extracted from historical datasets corresponding to 2018, 2019 and 2020, and then their spatial intersection was calculated. To further refine the sample quality, an area-based filtering step was subsequently applied. Specifically, winter wheat patches were ranked by area, and the smallest 50% were excluded. This allowed the retention of only relatively large and spatially consistent zones, from which training samples were selected.

Subsequently, a standard winter wheat time-series curve was constructed based on both survey samples and manually labeled samples. The TWDTW algorithm was then applied to calculate the TWDTW distance between candidate pixels and the standard curve. Pixels with TWDTW distances below a predefined threshold were selected as training samples.

The selection of an appropriate input variable is critical to the effectiveness of TWDTW in sample generation. Common vegetation indices like NDVI and EVI exhibited constrained effectiveness in differentiating winter wheat from other co-occurring winter crops. In contrast, a significant discrepancy in VH backscatter coefficients between winter wheat and other winter crops in Shandong Province was reported, indicating the potential of the VH band to enhance class separability [20]. Therefore, VH time-series data were employed as inputs to the TWDTW algorithm to facilitate the automated extraction of training samples.

3.3. Winter Wheat Early-Season Mapping

3.3.1. Feature Selection

This study integrated both spectral and radar features to enhance the discrimination among crop types and mitigate the adverse effects caused by spectral similarity on classification performance. Spectral features captured the physiological and biochemical characteristics of vegetation, whereas radar features provided complementary information related to surface structure and backscattering properties. The fusion of these heterogeneous data sources significantly improved the robustness and accuracy of crop classification.

Specifically, spectral features were derived from ten multispectral bands of Sentinel-2 imagery, alongside vegetation indices that reflect crop growth conditions [29]. Radar features included the VV and VH polarization bands extracted from Sentinel-1 data [30], which are sensitive to surface roughness and structural variations. The detailed list of spectral bands and vegetation indices used in this study is provided in Table 3.

To identify the most discriminative features for classification within each zone, the RF algorithm was employed to calculate the Variable Importance Measure (VIM) scores (Section 3.3.2). An ensemble of decision trees was constructed, and the averaged feature importance scores were used to identify the most relevant variables, thereby optimizing the overall classification performance.

3.3.2. Winter Wheat Early-Season Mapping and Accuracy Assessment

The RF algorithm was utilized in this study to facilitate the early-season identification of winter wheat. RF has been widely applied to land cover monitoring tasks on the GEE platform [38], including dynamic land cover mapping, detection of croplands and irrigated areas, and crop type classification [39]. Within the framework of ensemble learning, RF enhanced model diversity and reduced generalization error by incorporating bootstrap aggregation (bagging) and random feature subset selection. Moreover, it allowed for the evaluation of variable importance when determining class labels. During the classification process, final outputs were determined by majority voting across all decision trees. Previous studies have demonstrated that RF outperformed several traditional classifiers, including maximum likelihood classification and shallow neural networks, in terms of robustness, accuracy, and computational efficiency [12,40]. Given its strong capability in handling high-dimensional data and its robustness to collinearity and redundancy, RF was well suited to the needs of this study.

The number of trees in the RF was set to 100, and the minimum number of samples per leaf node was set to 10, while all remaining hyperparameters were kept at their default values.

Early-season winter wheat mapping was conducted for each zone based on the optimal strategy identified for that specific zone. Classification performance was assessed based on confusion matrices, from which evaluation metrics including overall accuracy (OA), user’s accuracy (UA), producer’s accuracy (PA), and the F₁ score were derived.

F_{1} = 2 \times \frac{P A \times U A}{P A + U A}

(6)

4. Results

4.1. Clustering Zone Results and Analysis

In this study, a total of eleven principal components were extracted through PCA. The eigenvalues of the first three principal components were 2.80, 2.50, and 1.50, respectively, cumulatively explaining 86.9% of the total variance (PC1: 40.5%; PC2: 27.7%; PC3: 18.7%). The eigenvalues of these components were significantly greater than 1, whereas those of subsequent components declined rapidly and remained stable, indicating their relatively minor contribution to overall variability (Figure 3b).

PC1 exhibited a strong correlation with the SOS. The spatial distribution of the SOS ranged from 23 to 331 days across the study area, with earlier onset observed in southern Shandong and later dates in northern and mountainous zones (Figure 4a). PC2 was closely associated with temperature during the growing period. Temperature values ranged from 0.07 °C to 16.36 °C, with higher temperatures mainly concentrated in the southwestern part of the province (Figure 4c). PC3 was strongly correlated with the LOS, which revealed substantial zonal differences in crop development periods. In particular, southern areas experienced longer growing seasons due to warmer climatic conditions (Figure 4b).

The optimal number of clusters was determined as four using the Elbow Method, which involved evaluating the total within-cluster sum of squares alongside the explanatory power of principal components under varying cluster configurations (Figure 3). The final clustering results delineated the study area into four distinct zones (Zone A, Zone B, Zone C, and Zone D), each exhibiting unique phenological and environmental characteristics (Figure 3a).

Zone A (mainly encompassing parts of Weifang and Qingdao) was characterized by early greening and moderate thermal accumulation, creating favorable conditions for winter wheat cultivation. Zone B (including Yantai, Dongying, and portions of Weifang and Zibo) featured higher elevation, resulting in a noticeable delay in phenological development. Zone C (primarily including Liaocheng and parts of Dezhou and Jining) displayed intermediate phenological and meteorological conditions. Zone D (covering Heze and parts of Jining and Zaozhuang) had SOS values predominantly between 45 and 100 days, with an average of 64 days, indicating a significantly earlier growing season onset compared to other zones. This zone’s distinctiveness lay in its early SOS combined with suitable thermal conditions, making its phenological profile notably different from the rest.

4.2. Training Sample Generation

To train zone-specific RF classification models for early-season winter wheat mapping, high-confidence training samples were generated using the TWDTW algorithm with historical data. In this study, the distribution of training samples was relatively balanced, with sufficient quantity and category representation to support reliable model training in each zone.

Using the TWDTW algorithm, the similarity between the standard VH time-series curve of winter wheat for each zone and the corresponding historical sample curves was computed. To determine the optimal threshold, the Otsu automatic thresholding method was applied, resulting in an average threshold value of 1.73 for sample generation across the four zones. The winter wheat and non-winter wheat samples selected under this threshold were then used for zone-specific crop classification training.

A total of 2423 winter wheat samples and 1022 non-winter wheat samples were collected, resulting in a combined dataset of 3445 samples. Among all zones, Zone C contributed the largest number of training samples, with 1415 in total—comprising 956 winter wheat and 459 non-winter wheat samples. This was primarily due to the extensive and representative winter wheat cultivation areas in this region, which enabled efficient and effective sample extraction. Zone D followed with 1009 samples, while Zones B and A contributed 719 and 302 samples, respectively (Table 4).

4.3. Winter Wheat Early-Season Mapping Results

4.3.1. Feature Selection Results

As illustrated in Figure 5, several spectral bands and vegetation indices—particularly SWIR1, EVI, NDVI, VH, and GCVI—consistently exhibited high VIMs across all four delineated zones, indicating their strong and stable relevance for early-season winter wheat identification.

Despite this overall consistency, notable differences were observed in the importance rankings of features among the individual zones, reflecting the spatial heterogeneity in remote sensing responses. Specifically, the top six features in Zone A were RED, EVI, RE2, SWIR1, SAVI, and VH, while Zone B emphasized EVI, NDVI, VH, IRECI, SWIR1, and GCVI. In Zone C, the most important features included SWIR1, GCVI, RED, VH, NDVI, and GNDVI. For Zone D, SWIR1, VH, NDVI, GCVI, EVI, and RE3 were most relevant.

The observed discrepancies in feature importance can be explained by zonal agroecological variability, such as sowing dates, topographic conditions, and accumulated thermal time across zones. Therefore, the development of zone-specific feature selection strategies is essential to accommodate zonal variability in crop growth patterns and spectral characteristics, ultimately enhancing classification accuracy and generalization. Based on these insights, the top six features in each zone were selected to support subsequent classification tasks.

4.3.2. Zonal Early-Season Mapping Results and Analysis

Based on the optimized feature combinations and zone-specific RF, winter wheat classification was conducted across the four delineated zones. The classification model was trained using samples generated through TWDTW and Otsu thresholding (Table 4), while the classification accuracy was assessed using validation samples derived from a combination of survey samples and manually labeled samples based on high-resolution Google Earth imagery (Table 1). The validation samples were evenly distributed across different zones and categories (winter wheat and non-winter wheat) to ensure both reliability and spatial representativeness.

The classification accuracy including UA, PA, and OA for each zone is presented in Table 5, which was derived from a confusion matrix by comparing classification results with the validation samples. Specifically, the confusion matrix was constructed by counting the number of correctly and incorrectly classified validation samples in each category (i.e., winter wheat and non-winter wheat). The diagonal elements of the matrix represent the number of correctly classified samples for each class, while the off-diagonal elements correspond to misclassifications.

The criterion for the earliest identification time was set as the first time point at which the F₁ score for winter wheat classification exceeded 0.9. Based on the preceding experiments, the maximum F₁ score achieved in Zone A and Zone C was 0.95, while the remaining two zones reached 0.94 and 0.96, respectively. Therefore, an F₁ score threshold of 0.9 was adopted as the criterion for determining the earliest identification time.

The earliest identification dates were derived from the temporal trends shown in Figure 6. The results indicated that the earliest identification time for winter wheat was 1 December in Zone D, 31 December in Zone C, 2 January in Zone A, and 30 January in Zone B, corresponding to the first occurrence of an F₁ score exceeding 0.9. These findings highlight the spatial variability in phenological development across zones and validate the effectiveness of the proposed zone-specific classification strategy for achieving timely and accurate winter wheat identification.

4.3.3. Comparison Between Zonal and Non-Zonal Mapping Approaches

In this study, early-season winter wheat mapping results derived from the proposed zonal strategy were systematically compared with those obtained using a non-zonal approach in Shandong Province (Table 6 and Figure 7). The results indicated that the zonal classification strategy significantly increased OA and improved the spatial consistency of the classification outcomes.

In terms of classification accuracy, the non-zonal method achieved an OA of 94.03%. In contrast, the zonal method yielded a markedly improved OA of 97.63%, with higher accuracies observed across all zones. Specifically, Zone C, where the planting area of winter wheat was extensive and winter crop types were relatively homogeneous, attained the highest OA of 98.94%. In Zone A, although the OA was comparatively lower at 95.76%, it still outperformed the non-zonal approach. The detailed accuracy assessments conducted for each zone provided additional evidence supporting the robustness and effectiveness of the proposed method: in Zone A (20 January), the UA and PA for winter wheat were 93.44% and 97.44%, respectively; in Zone B (30 January), these values reached 95.73% and 99.09%; in Zone C (31 December), they were 97.65% and 99.49%; and in Zone D (1 December), the PA reached 99.32%, with an OA of 97.02%.

Regarding identification timeliness, the non-zonal method typically recognizes winter wheat around mid-January. However, the zonal method allowed for earlier and zone-specific identification, which better aligned with local phenological development. Although Shandong Province lies entirely within the warm temperate zone, intra-provincial climatic variability—driven by topography and maritime influence—led to significant differences in the greening periods of winter wheat. For instance, in Zone D, located in the southwestern and northwestern inland zones (e.g., Heze and Liaocheng), winter wheat was sown earlier, and the greening stage was reached as early as 1 December. In contrast, the eastern coastal zones, such as Zone A (e.g., Qingdao and Yantai), experienced slower warming due to maritime regulation, and early identification was only feasible by 20 January. These findings demonstrated that the zonal strategy effectively accommodated zonal phenological variability, thereby improving both the accuracy and timeliness of early-season winter wheat identification.

Figure 8 presents a detailed comparison of identification results across different zones. In Zone B(a), characterized by complex mountainous terrain, the non-zonal method exhibited substantial sparsity and failed to accurately delineate the spatial distribution of winter wheat. By contrast, the zonal approach successfully incorporated spatial heterogeneity across zones, thereby substantially enhancing classification performance.

In the flat terrain of Zone B(b), the zonal method successfully corrected the misclassification issues observed in the global results. It not only enhanced the detection accuracy of winter wheat but also significantly suppressed salt-and-pepper noise, thereby improving the overall classification performance. In Zones A and C, the zonal method also demonstrated notable improvements in both classification accuracy and consistency compared with the non-zonal approach. By contrast, the performance gap in Zone D between the two methods was less pronounced, likely due to the similar early identification times under both strategies in this zone.

4.3.4. Comparison with Official Statistics

To further evaluate the reliability of the classification results, the early-season winter wheat area in Shandong Province was quantitatively estimated and compared with official planting area statistics from the Shandong Provincial Bureau of Statistics (2021). The relative error was −3.6%, indicating that the identified area was slightly smaller than the reported figure, thereby demonstrating a high level of classification accuracy.

Moreover, to assess the spatial consistency of winter wheat classification, official statistics on planting areas from 16 prefecture-level cities in Shandong Province were gathered, and the mapped results were evaluated at the municipal level. The validation results (Figure 9) revealed a strong agreement between the zonal early-season identification results and the official statistics, with an R² value of 0.97. This further confirmed the reliability and robustness of the zonal classification approach proposed in this study.

4.4. Confusion Analysis Between Winter Wheat and Garlic

In Shandong Province, garlic is one of the primary confounding overwintering crops besides winter wheat, due to its high similarity in both phenological cycles and spectral characteristics. This issue was particularly prominent in key garlic cultivation zones, such as Jinxiang County in Jining City, where adjacent and interspersed fields posed challenges for accurate classification.

To evaluate the feasibility of distinguishing garlic from winter wheat in remote sensing, this study selected Jinxiang County as a representative area. Multi-temporal remote sensing images from the 2021–2022 growing period were used to construct time-series features, and a total of 291 manually labeled garlic samples were obtained. Feature selection and classification experiments were subsequently conducted. The feature selection process identified that the top six features were VH, NDVI, IRECI, GCVI, RE2, and RED, three of which were also identified as key features in winter wheat classification within the same zone. This finding indicated that the proposed method exhibited a degree of feature universality when distinguishing crops with similar phenology and spectral responses.

Using the optimized time-series features for classification, an OA of 0.97 and an F₁ score of 0.94 were achieved, demonstrating that garlic and winter wheat remained distinguishable despite the close timing and similarity in feature expression. Recent studies have investigated the spatial mapping of winter wheat and garlic [5,13]. In this study, the proposed method achieved high classification accuracy, demonstrating its effectiveness in distinguishing between these spectrally and phenologically similar crops.

Additionally, distribution maps for winter wheat and garlic were generated to visually illustrate their spatial distribution characteristics in Jinxiang County (Figure 10a), along with classification results (Figure 10b) and detailed comparisons between imagery (Figure 10c). The results indicated that the garlic planting area was significantly larger than that of winter wheat. Moreover, the classification method using optimized features yielded high accuracy even in cases of adjacent planting of winter wheat and garlic.

5. Discussion

5.1. Cross-Year Experimental Result Validation

To further examine the effectiveness of the adopted strategy, an early-season winter wheat identification experiment was carried out using historical data from Liaocheng City, a representative area located within Zone C. Liaocheng was selected for this interannual transfer experiment due to its stable winter wheat cultivation regime and typical agroecological conditions, making it an ideal case for evaluating the generalizability and robustness of the proposed method across different years. For the 2017–2018 growing period. When the time-series remote sensing data were extended to 10 January 2018, the classification results achieved a stable F₁ score and an overall accuracy of 0.98, indicating high classification performance even under historical conditions. Figure 11 illustrates the early-season winter wheat mapping results for Liaocheng, along with comparative visualizations of the imagery and classification outcomes for selected zones.

In addition, the phenological stages of winter wheat in Liaocheng during the 2017–2018 and 2021–2022 growing seasons were compared using the NDVI peak detection method [41]. Results indicated that the sowing date in 2017 was 22 October, with the earliest identification time on 10 January; for 2021, the sowing date was 16 October, with the earliest identification achieved by 31 December. In both cases, winter wheat was successfully identified within approximately three months of sowing. These findings were consistent with existing studies in terms of both timing and spatial distribution patterns [17]. This result validated the coordination between the recognition timing of the proposed zonal method and the phenological characteristics of the crop, further supporting its temporal robustness and adaptability.

5.2. Comparison with Publicly Available Winter Wheat Datasets

This study employed publicly available winter wheat datasets for comparative analysis, including the 2021 winter wheat distribution map derived from the Automatic Training Data Generation (ATDG) framework, ChinaWheat10, and the 2021 winter wheat planting distribution map covering eight major producing provinces, ChinaWheatMap10. These publicly available datasets were developed at large spatial scales using uniform classification models, a strategy commonly adopted in previous studies [20,42]. However, such an approach often fails to fully account for the detailed differences across zones.

As illustrated in Figure 12, the proposed zonal classification strategy exhibited clear advantages in enhancing the separability between winter wheat and other land cover types. It also achieved superior delineation of field boundaries, with higher spatial clarity and classification accuracy. For instance, in Qingzhou City within Zone B—an area characterized by dense distributions of vegetable greenhouses and fragmented, irregular-shaped plots—the zonal strategy improved winter wheat identification performance by mitigating classification confusion.

In Zoucheng City within Zone C, which features hilly terrain and dispersed farmland with small field sizes, winter wheat mapping remained challenging. However, the zonal classification approach effectively improved classification outcomes under such complex zonal conditions, thereby enhancing the spatial mapping accuracy of winter wheat in mountainous zones.

5.3. Significance, Limitations, and Future Prospects

This study proposed a zonal early-season winter wheat identification framework that integrates phenological and environmental factors. Results demonstrated that by dividing Shandong Province into four zones and optimizing zone-specific classification strategies, high winter wheat early-season classification accuracy could be achieved. For Zone D in southwestern Shandong, characterized by flat terrain and higher average temperatures compared to other areas, winter wheat could be mapped as early as December, with an OA of up to 97.02%. In contrast, in Zone B, which is characterized by predominantly mountainous and hilly terrain with lower average temperatures, winter wheat could be identified as early as late January, achieving an OA of 97.52%. These findings validated the effectiveness of incorporating phenological and environmental heterogeneity into classification strategies, and also demonstrated the significant implications of the proposed framework for enhancing the timeliness and precision of early-season crop mapping. The proposed approach not only improves early-season crop mapping accuracy but also provides robust technical support for precision agriculture, food security monitoring, and disaster response efforts.

Despite these advancements, several limitations still remain in this study. With respect to remote sensing imagery, this study primarily relied on Sentinel-2 data with a spatial resolution of 10 m. However, from other early-season winter wheat mapping studies [43], the images with 10 m resolution had shown limitations in capturing field boundaries accurately in heterogeneous or mountainous regions. Existing research has investigated the impact of various remote sensing images on early-season crop classification [44,45]. Future work may benefit from the integration of multi-source satellite imagery with higher spatial resolution, such as GF-2 and Planet satellites, to extract finer spatial and temporal features and support more accurate early-season crop identification.

Regarding the zonal strategy, administrative counties were used as basic analytical units in the study due to the accessibility of statistical data and relevance to policy applications. Nevertheless, administrative divisions may not align with ecological patterns and often fail to capture spatial heterogeneity in key environmental variables such as climate, elevation, and soil properties, thereby constraining model generalizability. Recent studies have proposed regular grid-based or agroecological zoning schemes as alternatives to better reflect underlying environmental variability, offering promising directions for refining the current framework [46].

In this study, to generate different zones, we adopted a combination of the Silhouette Coefficient and the Elbow Method, along with the K-means clustering algorithm, which are widely used in the existing literature [25] and have been proven effective in delineating spatially homogeneous zones in our study. However, the choice of clustering metrics and algorithms can significantly affect the outcomes of zonal division. Therefore, future research will systematically compare various clustering strategies, such as hierarchical clustering and density-based clustering, and explore alternative clustering evaluation indices. These efforts aim to enhance the robustness and adaptability of the zoning framework under diverse topographic and phenological conditions.

Regarding the classification algorithm, this study exclusively employed the RF classifier, which has been widely validated in crop mapping for its robustness and interpretability [12,40]. While the method has demonstrated strong performance, we acknowledge the potential of alternative approaches. In future work, we plan to incorporate and compare additional algorithms, such as support vector machines, gradient boosting, and deep learning models, to further enhance classification accuracy and improve adaptability across diverse agricultural scenarios.

6. Conclusions

This study proposed a zonal early-season identification strategy to address the spatial heterogeneity in large-scale winter wheat mapping. By integrating phenological and environmental factors, Shandong Province was divided into four representative zones, each with tailored classification schemes. The results demonstrated significant improvements in accuracy, timeliness, and spatial consistency.

The main contributions are as follows:

(1) Zonal strategy effectiveness: Zoning based on phenological and environmental factors reduced large-scale heterogeneity and supported high classification accuracy.

(2) Optimized identification timing: Zone-specific identification windows were established. The earliest mapping occurred in Zone D in early December, with an OA of 97.02%. Zones A-C reached their optimal identification periods between late December and January, each attaining an OA above 95%.

(3) Reliability and comparison: Validation against official statistics showed a relative area error of –3.6% at the provincial level and an R² of 0.97 across 16 municipalities. The zonal strategy improved overall accuracy by 3.6% over a non-zonal approach and outperformed public datasets such as ChinaWheat10 in spatial detail and classification precision.

(4) Feature adaptability: Zone-specific features enhanced classification. The method also successfully distinguished winter wheat from garlic, a spectrally similar crop, achieving an F₁ score of 0.94, indicating strong generalization capability.

(5) Temporal robustness: Historical validation using 2017–2018 data in Liaocheng produced consistent results, with an OA of 98% and an F₁ score of 0.96. The identification timing matches historical phenology.

In summary, the proposed zonal strategy proved accurate, robust, and adaptable for early-season winter wheat mapping at large zonal scales.

Author Contributions

J.C.: writing—original draft, methodology, visualization. C.W.: resources, investigation, formal analysis. X.D.: supervision. C.C.: methodology, visualization. G.F.: supervision, writing—review and editing. Z.W. and M.L.: resources, investigation. H.Z.: conceptualization, supervision, writing—review and editing, funding acquisition. All authors have read and agreed to the published version of the manuscript.

Funding

This research received the financial support provided by the National Natural Science Foundation of China (Grant No. 42471364), the Natural Science Foundation of Shandong Province (Grant No. ZR2024MD004) and a grant from Jinan City Municipal-School Integration Development Strategic Project (Grant No. JNSX2023036).

Data Availability Statement

The data presented in this study are available upon request from the corresponding author. The data are not publicly available due to privacy concerns. Requests for data should include a brief description of the intended use or research purpose for evaluation of sharing eligibility.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Kussul, N.; Lavreniuk, M.; Skakun, S.; Shelestov, A. Deep Learning Classification of Land Cover and Crop Types Using Remote Sensing Data. IEEE Geosci. Remote Sens. Lett. 2017, 14, 778–782. [Google Scholar] [CrossRef]
Liu, X. International perspectives on food safety and regulations—A need for harmonized regulations: Perspectives in China. J. Sci. Food Agric. 2014, 94, 1928–1931. [Google Scholar] [CrossRef] [PubMed]
Howard, D.M.; Wylie, B.K. Annual Crop Type Classification of the US Great Plains for 2000 to 2011. Photogramm. Eng. Remote Sens. 2014, 80, 537–549. [Google Scholar] [CrossRef]
Long, J.A.; Lawrence, R.L.; Greenwood, M.C.; Marshall, L.; Miller, P.R. Object-oriented crop classification using multitemporal ETM+ SLC-off imagery and random forest. GISci. Remote Sens. 2013, 50, 418–436. [Google Scholar] [CrossRef]
Liu, X.Y.; Li, X.H.; Gao, L.X.; Zhang, J.S.; Qin, D.P.; Wang, K.; Li, Z.H. Early-season and refined mapping of winter wheat based on phenology algorithms—A case of Shandong, China. Front. Plant Sci. 2023, 14, 1016890. [Google Scholar] [CrossRef]
Kussul, N.; Mykola, L.; Shelestov, A.; Skakun, S. Crop inventory at regional scale in Ukraine: Developing in season and end of season crop maps with multi-temporal optical and SAR satellite imagery. Eur. J. Remote Sens. 2018, 51, 627–636. [Google Scholar] [CrossRef]
Mao, M.X.; Zhao, H.W.; Tang, G.L.; Ren, J.Q. In-Season Crop Type Detection by Combing Sentinel-1A and Sentinel-2 Imagery Based on the CNN Model. Agronomy 2023, 13, 1723. [Google Scholar] [CrossRef]
Lin, C.; Zhong, L.; Song, X.-P.; Dong, J.; Lobell, D.B.; Jin, Z. Early- and in-season crop type mapping without current-year ground truth: Generating labels from historical information via a topology-based approach. Remote Sens. Environ. 2022, 274, 112994. [Google Scholar] [CrossRef]
Maus, V.; Câmara, G.; Appel, M.; Pebesma, E. dtwSat: Time-Weighted Dynamic Time Warping for Satellite Image Time Series Analysis in R. J. Stat. Softw. 2019, 88, 1–31. [Google Scholar] [CrossRef]
You, N.S.; Dong, J.W. Examining earliest identifiable timing of crops using all available Sentinel 1/2 imagery and Google Earth Engine. ISPRS J. Photogramm. Remote Sens. 2020, 161, 109–123. [Google Scholar] [CrossRef]
Chen, R.Q.; Sun, L.; Chen, Z.X.; Wuyun, D.J.; Sun, Z. Early Identification of Corn and Soybean Using Crop Growth Curve Matching Method. Agronomy 2024, 14, 146. [Google Scholar] [CrossRef]
Luo, K.; Lu, L.; Xie, Y.; Chen, F.; Yin, F.; Li, Q. Crop type mapping in the central part of the North China Plain using Sentinel-2 time series and machine learning. Comput. Electron. Agric. 2023, 205, 107577. [Google Scholar] [CrossRef]
Guo, Y.; Xia, H.M.; Zhao, X.Y.; Qiao, L.X.; Du, Q.; Qin, Y.C. Early-Season Mapping of Winter Wheat and Garlic in Huaihe Basin Using Sentinel-1/2 and Landsat-7/8 Imagery. IEEE J. Sel. Top. Appl. Earth Observ. Remote Sens. 2023, 16, 8809–8817. [Google Scholar] [CrossRef]
Huang, X.D.; Huang, J.X.; Li, X.C.; Shen, Q.R.; Chen, Z.C. Early mapping of winter wheat in Henan province of China using time series of Sentinel-2 data. GISci. Remote Sens. 2022, 59, 1534–1549. [Google Scholar] [CrossRef]
Huang, H.J.; Ma, J.H.; Yang, Y.F. Spatial heterogeneity of driving factors for urban heat health risk in Chongqing, China: A new identification method and proposal of planning response framework. Ecol. Indic. 2023, 153, 110449. [Google Scholar] [CrossRef]
Dong, J.; Xiao, X.; Menarguez, M.A.; Zhang, G.; Qin, Y.; Thau, D.; Biradar, C.; Moore, B. Mapping paddy rice planting area in northeastern Asia with Landsat 8 images, phenology-based algorithm and Google Earth Engine. Remote Sens. Environ. 2016, 185, 142–154. [Google Scholar] [CrossRef]
Zhang, H.Y.; Du, H.Y.; Zhang, C.K.; Zhang, L.P. An automated early-season method to map winter wheat using time-series Sentinel-2 data: A case study of Shandong, China. Comput. Electron. Agric. 2021, 182, 105962. [Google Scholar] [CrossRef]
Yang, G.X.; Yu, W.G.; Yao, X.; Zheng, H.B.A.; Cao, Q.; Zhu, Y.; Cao, W.X.; Cheng, T. AGTOC: A novel approach to winter wheat mapping by automatic generation of training samples and one-class classification on Google Earth Engine. Int. J. Appl. Earth Obs. Geoinf. 2021, 102, 102446. [Google Scholar] [CrossRef]
Feng, Y.Y.; Chen, B.Y.; Liu, W.; Xue, X.R.; Liu, T.Q.; Zhu, L.Y.; Xing, H.Q. Winter Wheat Mapping in Shandong Province of China with Multi-Temporal Sentinel-2 Images. Appl. Sci. 2024, 14, 3940. [Google Scholar] [CrossRef]
Fu, Y.Y.; Shen, R.Q.; Song, C.Q.; Dong, J.; Han, W.; Ye, T.; Yuan, W.P. Exploring the effects of training samples on the accuracy of crop mapping with machine learning algorithm. Sci. Remote Sens. 2023, 7, 100081. [Google Scholar] [CrossRef]
Jönsson, P.; Eklundh, L. TIMESAT—A program for analyzing time-series of satellite sensor data. Comput. Geosci. 2004, 30, 833–845. [Google Scholar] [CrossRef]
Hu, J.K.; Zhang, B.; Peng, D.L.; Huang, J.X.; Zhang, W.J.; Zhao, B.; Li, Y.; Cheng, E.H.; Lou, Z.H.; Liu, S.W.; et al. Mapping 10-m harvested area in the major winter wheat-producing regions of China from 2018 to 2022. Sci. Data 2024, 11, 1038. [Google Scholar] [CrossRef] [PubMed]
Yang, G.X.; Li, X.R.; Liu, P.Z.; Yao, X.; Zhu, Y.; Cao, W.X.; Cheng, T. Automated in-season mapping of winter wheat in China with training data generation and model transfer. ISPRS J. Photogramm. Remote Sens. 2023, 202, 422–438. [Google Scholar] [CrossRef]
Dong, J.; Fu, Y.Y.; Wang, J.J.; Tian, H.F.; Fu, S.; Niu, Z.; Han, W.; Zheng, Y.; Huang, J.X.; Yuan, W.P. Early-season mapping of winter wheat in China based on Landsat and Sentinel images. Earth Syst. Sci. Data 2020, 12, 3081–3095. [Google Scholar] [CrossRef]
Fang, G.R.; Wang, C.; Dong, T.F.; Wang, Z.M.; Cai, C.; Chen, J.Q.; Liu, M.Y.; Zhang, H.X. A Landscape-Clustering Zoning Strategy to Map Multi-Crops in Fragmented Cropland Regions Using Sentinel-2 and Sentinel-1 Imagery with Feature Selection. Agriculture 2025, 15, 186. [Google Scholar] [CrossRef]
Cumming, S.; Vernier, P. Statistical models of landscape pattern metrics, with applications to regional scale dynamic forest simulations. Landsc. Ecol. 2002, 17, 433–444. [Google Scholar] [CrossRef]
Fatchurrachman; Rudiyanto; Soh, N.C.; Shah, R.M.; Giap, S.G.E.; Setiawan, B.I.; Minasny, B. High-Resolution Mapping of Paddy Rice Extent and Growth Stages across Peninsular Malaysia Using a Fusion of Sentinel-1 and 2 Time Series Data in Google Earth Engine. Remote Sens. 2022, 14, 1875. [Google Scholar] [CrossRef]
Amoroso, N.; Cilli, R.; Nitti, D.O.; Nutricato, R.; Iban, M.C.; Maggipinto, T.; Tangaro, S.; Monaco, A.; Bellotti, R. PSI Spatially Constrained Clustering: The Sibari and Metaponto Coastal Plains. Remote Sens. 2023, 15, 2560. [Google Scholar] [CrossRef]
Segarra, J.; Buchaillot, M.L.; Araus, J.L.; Kefauver, S.C. Remote Sensing for Precision Agriculture: Sentinel-2 Improved Features and Applications. Agronomy 2020, 10, 641. [Google Scholar] [CrossRef]
Tian, G.X.; Li, H.P.; Feng, X.; Jiang, Q.; Li, N.; Guo, Z.W.; Zhao, J.H.; Yang, H.J. An automatic method for rice mapping in Taishan, China using Sentinel-1A Time-series images. Remote Sens. Lett. 2024, 15, 99–109. [Google Scholar] [CrossRef]
Huete, A.; Didan, K.; Miura, T.; Rodriguez, E.P.; Gao, X.; Ferreira, L.G. Overview of the radiometric and biophysical performance of the MODIS vegetation indices. Remote Sens. Environ. 2002, 83, 195–213. [Google Scholar] [CrossRef]
Chandrasekar, K.; Sai, M.; Roy, P.S.; Dwevedi, R.S. Land Surface Water Index (LSWI) response to rainfall and NDVI using the MODIS Vegetation Index product. Int. J. Remote Sens. 2010, 31, 3987–4005. [Google Scholar] [CrossRef]
Tucker, C.J. Red and photographic infrared linear combinations for monitoring vegetation. Remote Sens. Environ. 1979, 8, 127–150. [Google Scholar] [CrossRef]
Gitelson, A.A.; Viña, A.; Ciganda, V.; Rundquist, D.C.; Arkebauer, T.J. Remote estimation of canopy chlorophyll content in crops. Geophys. Res. Lett. 2005, 32. [Google Scholar] [CrossRef]
Korhonen, L.; Hadi; Packalen, P.; Rautiainen, M. Comparison of Sentinel-2 and Landsat 8 in the estimation of boreal forest canopy cover and leaf area index. Remote Sens. Environ. 2017, 195, 259–274. [Google Scholar] [CrossRef]
Gitelson, A.A.; Kaufman, Y.J.; Merzlyak, M.N. Use of a green channel in remote sensing of global vegetation from EOS-MODIS. Remote Sens. Environ. 1996, 58, 289–298. [Google Scholar] [CrossRef]
Huete, A.R. A soil-adjusted vegetation index (SAVI). Remote Sens. Environ. 1988, 25, 295–309. [Google Scholar] [CrossRef]
Huang, H.; Chen, Y.; Clinton, N.; Wang, J.; Wang, X.; Liu, C.; Gong, P.; Yang, J.; Bai, Y.; Zheng, Y.; et al. Mapping major land cover dynamics in Beijing using all Landsat images in Google Earth Engine. Remote Sens. Environ. 2017, 202, 166–176. [Google Scholar] [CrossRef]
Wei, P.; Ye, H.C.; Qiao, S.T.; Liu, R.H.; Nie, C.J.; Zhang, B.R.; Song, L.J.; Huang, S.Y. Early Crop Mapping Based on Sentinel-2 Time-Series Data and the Random Forest Algorithm. Remote Sens. 2023, 15, 3212. [Google Scholar] [CrossRef]
Belgiu, M.; Dragut, L. Random forest in remote sensing: A review of applications and future directions. ISPRS J. Photogramm. Remote Sens. 2016, 114, 24–31. [Google Scholar] [CrossRef]
Zhang, X.C.; Kuang, M.K.; Shi, L.S. Extracting the phenological periods of winter wheat at field scale based on the characteristics of NDVI time series curves from multisource remote sensing images. Trans. CSAE 2025, 41, 181–191. [Google Scholar] [CrossRef]
Chong, L.; Liu, H.J.; Lu, L.P.; Liu, Z.R.; Kong, F.C.; Zhang, X.L. Monthly composites from Sentinel-1 and Sentinel-2 images for regional major crop mapping with Google Earth Engine. J. Integr. Agric. 2021, 20, 1944–1957. [Google Scholar] [CrossRef]
Cai, Z.W.; Hu, Q.; Zhang, X.Y.; Yang, J.Y.; Wei, H.D.; Wang, J.Y.; Zeng, Y.L.; Yin, G.F.; Li, W.J.; You, L.Z.; et al. Improving agricultural field parcel delineation with a dual branch spatiotemporal fusion network by integrating multimodal satellite data. ISPRS-J. Photogramm. Remote Sens. 2023, 205, 34–49. [Google Scholar] [CrossRef]
Wang, C.; Zhang, X.Y.; Wang, W.J.; Wei, H.D.; Wang, J.Y.; Li, Z.X.; Li, X.N.; Wu, H.; Hu, Q. Understanding the potentials of early-season crop type mapping by using Landsat-8, Sentinel-1/2, and GF-1/6 data. Comput. Electron. Agric. 2024, 224, 109239. [Google Scholar] [CrossRef]
Zhang, C.; Di, L.P.; Lin, L.; Li, H.; Guo, L.Y.; Yang, Z.W.; Yu, E.G.; Di, Y.H.; Yang, A.N. Towards automation of in-season crop type mapping using spatiotemporal crop information and remote sensing data. Agric. Syst. 2022, 201, 103462. [Google Scholar] [CrossRef]
Xin, J.X.; Peng, Y.; Peng, N.Y.; Yang, L.Y.; Huang, J.J.; Yuan, J.X.; Wei, B.S.; Ren, Y.M. Both class- and landscape-level patterns influence crop yield. Eur. J. Agron. 2024, 153, 127057. [Google Scholar] [CrossRef]

Figure 1. Description of the study area. (a) The location of Shandong Province in China. (b) Digital elevation model (DEM) of the study area. (c) Survey samples along with the manually labeled samples.

Figure 2. The overall flowchart of the study.

Figure 3. Zoning process and results. (a) Zonal results of Shandong Province; (b) the eigenvalues of the principal components, with eigenvalues greater than 1 indicated above the yellow line; (c) Elbow Method-based optimal cluster number determination.

Figure 4. The features strongly correlated with the first three principal components. (a) SOS refers to the start of season, (b) LOS refers to the length of season, and (c) temperature.

Figure 5. Presentation of the feature importance ranking for each zone.

Figure 6. The F₁ score along with the earliest identification time for each zone. The purple dashed line indicates an F₁ score of 0.9, while the four red vertical dashed lines represent the early identification times for the four respective zones.

Figure 7. Comparison of zonal and non-zonal early-season mapping results. (a) Non-zonal early-season mapping results; (b) zonal early-season mapping results; (c) spatial distribution of prefecture-level cities in Shandong.

Figure 8. Comparative details of zonal and non-zonal early-season mapping results. Specifically, Zone B illustrates (a) the results for mountainous areas and (b) the results for plain areas.

Figure 9. Comparison of mapped area and official statistics.

Figure 10. Early-season mapping results of winter wheat and garlic. (a) Early mapping results of winter wheat and garlic in Jinxiang County; (b) detailed view of the classification results for winter wheat and garlic; (c) optical image of the detailed area, acquired in mid-April.

Figure 11. Early-season mapping of winter wheat in different years and detailed analysis. (a) Early identification results of winter wheat in Liaocheng City; (b) location of Liaocheng City within Shandong Province; (c1) optical image of the detailed area on the eastern side of Liaocheng City; (c2) winter wheat extraction results for the detailed area on the eastern side of Liaocheng City; (d1) optical image of the detailed area on the western side of Liaocheng City; (d2) winter wheat extraction results for the detailed area on the western side of Liaocheng City.

Figure 12. Comparison of results with dataset details. The Sentinel-2 imagery, displayed in standard false color and captured in mid-March, shows distinct winter wheat characteristics. The second and third columns show the winter wheat dataset maps, and the fourth column presents the zonal recognition results. Zones A, B, C, and D correspond to four zones. Orange circles highlight misclassification, while black circles show boundary delineation.

Table 1. The number of validation samples in 2021.

Sample Types	Winter Wheat	Non-Winter Wheat	Total
Survey samples	134	109	243
Manually labeled samples	225	459	684
Total	359	568	927

Table 2. Selected phenological features and environmental factors in the study.

	Factors	Implication
Phenological Features	SOS (Start of Season)	The time when plant growth begins, marked by an initial rise in the vegetation index.
	Amplitude	The difference between the peak and baseline vegetation index, indicating growth intensity.
	EOS (End of Season)	The time when the vegetation index starts decreasing, marking dormancy onset.
	LOS (Length of Season)	The total vegetation index accumulated over the season, reflecting productivity.
	LI (Large Integral)	The cumulative sum of the vegetation index over the growing season, reflecting total vegetation productivity.
	MVV (Maximum Value of Vegetation)	The highest vegetation index value, representing peak growth.
	Left Derivative	The rate of vegetation increase during early growth indicates the transition speed.
	Right Derivative	The rate of vegetation decline, indicating the shift to dormancy.
Environmental Factors	Precipitation	A key water source for vegetation, affecting soil moisture and crop growth.
	Temperature	Influences plant growth rate; extreme temperatures can hinder crops.
	Evapotranspiration	Measures water loss from soil and plants, affecting moisture balance.
	Slope	Describes land steepness, impacting water runoff, erosion, and soil moisture.

Table 3. Summary of the feature types employed in the study.

Feature Types	Features	Description
Spectral features	Sentinel-2 bands	$ρ_{blue}, ρ_{green} {, ρ}_{red}, ρ_{red_edge 1}, ρ_{red_edge 2}, ρ_{red_edge 3}, ρ_{red_edge 4}, ρ_{nir}, ρ_{swir 1}, ρ_{swir 2}$
	Enhanced Vegetation Index (EVI) [31]	$\begin{matrix} E V I = 2.5 \times \frac{ρ_{nir} - ρ_{red}}{ρ_{nir} + 6 \times ρ_{red} - 7.5 \times ρ_{blue} + 1} \end{matrix}$
	Land Surface Water Index (LSWI) [32]	$\begin{matrix} L S W I = \frac{ρ_{nir} - ρ_{swir 1}}{ρ_{nir} + ρ_{swir 1}} \end{matrix}$
	Normalized Difference Vegetation Index (NDVI) [33]	$\begin{matrix} N D V I = \frac{ρ_{nir} - ρ_{red}}{ρ_{nir} + ρ_{red}} \end{matrix}$
	Green Chlorophyll Vegetation Index (GCVI) [34]	$\begin{matrix} G C V I = \frac{ρ_{nir}}{ρ_{green}} - 1 \end{matrix}$
	Inverted Red-Edge Chlorophyll Index (IRECI) [35]	$\begin{matrix} I R E C I = \frac{ρ_{nir} - ρ_{red}}{ρ_{red_edge 1} + 1} \end{matrix}$
	Green Normalized Difference Vegetation Index (GNDVI) [36]	$\begin{matrix} G N D V I = \frac{ρ_{nir} - ρ_{green}}{ρ_{nir} + ρ_{green}} \end{matrix}$
	Soil-Adjusted Vegetation Index (SAVI) [37]	$\begin{matrix} S A V I = \frac{{(ρ}_{nir} - ρ_{red})}{ρ_{nir} + ρ_{red} + L} \times (1 \times L) \end{matrix}$
Radar features	Backscattering coefficient	$VV, VH$

Table 4. The number of training samples in each zone in 2021.

	Winter Wheat	Non-Winter Wheat	Total
Zone A	196	106	302
Zone B	546	173	719
Zone C	956	459	1415
Zone D	725	284	1009
Total	2423	1022	3445

Table 5. The results of recognition accuracy for each zone.

	Zone A (20 January 2022)		Zone B (30 January 2022)		Zone C (31 December 2021)		Zone D (1 December 2021)
	Winter Wheat	Non- Winter Wheat	Winter Wheat	Non- Winter Wheat	Winter Wheat	Non- Winter Wheat	Winter Wheat	Non- Winter Wheat
UA (%)	93.44	96.61	95.73	99.20	97.65	98.81	93.18	98.80
PA (%)	97.44	95.00	99.09	96.12	99.49	98.99	99.32	96.05
OA (%)	95.76		97.52		98.94		97.02

Table 6. Comparison of accuracy between non-zonal and zonal early mapping.

	Non-Zonal			Zonal
	Winter Wheat	Non-Winter Wheat	PA (%)	Winter Wheat	Non-Winter Wheat	PA (%)
Winter wheat	330	29	97.06	342	17	95.26
Non-winter wheat	10	538	94.90	5	563	99.12
UA (%)	91.91	98.18		98.56	97.07
OA (%)	94.03			97.63

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chen, J.; Du, X.; Wang, C.; Cai, C.; Fang, G.; Wang, Z.; Liu, M.; Zhang, H. Zonal Estimation of the Earliest Winter Wheat Identification Time in Shandong Province Considering Phenological and Environmental Factors. Agronomy 2025, 15, 1463. https://doi.org/10.3390/agronomy15061463

AMA Style

Chen J, Du X, Wang C, Cai C, Fang G, Wang Z, Liu M, Zhang H. Zonal Estimation of the Earliest Winter Wheat Identification Time in Shandong Province Considering Phenological and Environmental Factors. Agronomy. 2025; 15(6):1463. https://doi.org/10.3390/agronomy15061463

Chicago/Turabian Style

Chen, Jiaqi, Xin Du, Chen Wang, Cheng Cai, Guanru Fang, Ziming Wang, Mengyu Liu, and Huanxue Zhang. 2025. "Zonal Estimation of the Earliest Winter Wheat Identification Time in Shandong Province Considering Phenological and Environmental Factors" Agronomy 15, no. 6: 1463. https://doi.org/10.3390/agronomy15061463

APA Style

Chen, J., Du, X., Wang, C., Cai, C., Fang, G., Wang, Z., Liu, M., & Zhang, H. (2025). Zonal Estimation of the Earliest Winter Wheat Identification Time in Shandong Province Considering Phenological and Environmental Factors. Agronomy, 15(6), 1463. https://doi.org/10.3390/agronomy15061463

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Zonal Estimation of the Earliest Winter Wheat Identification Time in Shandong Province Considering Phenological and Environmental Factors

Abstract

1. Introduction

2. Materials

2.1. Study Area

2.2. Data and Preprocessing

2.2.1. Remote Sensing Imageries

2.2.2. Auxiliary Data

2.2.3. Sample Dataset

3. Methods

3.1. Clustering Zone Generation Based on Phenological and Environmental Factors

3.1.1. Phenological and Environmental Factors Preparation

3.1.2. Zone Delineation Using K-Means Clustering Method

3.2. Automated Sample Generation Using the TWDTW Method

3.3. Winter Wheat Early-Season Mapping

3.3.1. Feature Selection

3.3.2. Winter Wheat Early-Season Mapping and Accuracy Assessment

4. Results

4.1. Clustering Zone Results and Analysis

4.2. Training Sample Generation

4.3. Winter Wheat Early-Season Mapping Results

4.3.1. Feature Selection Results

4.3.2. Zonal Early-Season Mapping Results and Analysis

4.3.3. Comparison Between Zonal and Non-Zonal Mapping Approaches

4.3.4. Comparison with Official Statistics

4.4. Confusion Analysis Between Winter Wheat and Garlic

5. Discussion

5.1. Cross-Year Experimental Result Validation

5.2. Comparison with Publicly Available Winter Wheat Datasets

5.3. Significance, Limitations, and Future Prospects

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI