Identiﬁcation of Seed Maize Fields With High Spatial Resolution and Multiple Spectral Remote Sensing Using Random Forest Classiﬁer

: Seed maize and common maize plots have di ﬀ erent planting patterns and variety types. Identiﬁcation of seed maize is the basis for seed maize growth monitoring, seed quality and common maize seed supply. In this paper, a random forest (RF) classiﬁer is used to develop an approach for seed maize ﬁelds’ identiﬁcation, using the time series vegetation indexes (VIs) calculated from multispectral data acquired from Landsat 8 and Gaofen 1 satellite (GF-1), ﬁeld sample data, and texture features of Gaofen 2 satellite (GF-2) panchromatic data. Huocheng and Hutubi County in the Xinjiang Uygur Autonomous Region of China were chosen as study area. The results show that RF performs well with the combination of six VIs (normalized di ﬀ erence vegetation index (NDVI), enhanced vegetation index (EVI), triangle vegetation index (TVI), ratio vegetation index (RVI), normalized di ﬀ erence water index (NDWI) and di ﬀ erence vegetation index (DVI)) and texture features based on a grey-level co-occurrence matrix. The classiﬁcation based on “spectrum + texture” information has higher overall, user and producer accuracies than that of spectral information alone. Using the “spectrum + texture” method, the overall accuracy of classiﬁcation in Huocheng County is 95.90%, the Kappa coe ﬃ cient is 0.92, and the producer accuracy for seed maize ﬁelds is 93.91%. The overall accuracy of the classiﬁcation in Hutubi County is 97.79%, the Kappa coe ﬃ cient is 0.95, and the producer accuracy for seed maize ﬁelds is 97.65%. Therefore, RF classiﬁer inputted with high-resolution remote-sensing image features can distinguish two kinds of planting patterns (seed and common) and varieties types (inbred and hybrid) of maize and can be used to identify and map a wide range of seed maize ﬁelds. However, this method requires a large amount of sample data, so how to e ﬀ ectively use and improve it in areas lacking samples needs further research.

test the effect of different time series length on classification accuracy of alfa, corn, sorghum, soybean and winter wheat in Kansas, USA, and the result showed that the accuracy reached 88.45% when the time series length was 5 months. Based on landsat7 Enhanced Thematic Mapper Plus (ETM+) images, Tatsumi [44] used a RF classifier in Peru to explore ways to extract a variety of crops, such as cotton, grape, maize, soybean, and wheat.
The typical example of different planting patterns of the same crop are grazing and moving of pastures, which are often distinguished by the change detection algorithm based on the information of reflectance, height and biomass of herbage, as well as spatial geometric pattern of pastures. Remote sensing identification research of different varieties of the same crop is mainly aimed at perennial crops with few main planting varieties, such as grape, sugarcane and citrus, and auxiliary information such as distribution map, meteorology and geography of the crop is needed to distinguish the varieties [45]. Despite the advances in remote-sensing classification of different crops, however, there is still a lack of in-depth research on whether satellite remote sensing can further identify different planting patterns or variety types of the same crop, which is crucial for large-scale staple crops such as maize, rice and wheat. First, the acreage of a particular pattern or type of these staple crops may be much larger than that of many smaller crops. Second, there are huge regional differences in agronomic traits, stress resistance and yield per unit area between different planting patterns or cultivar types of a staple crop. If it cannot be distinguished accurately, it is difficult to accurately calibrate quantitative remote sensing parameters of this crop, resulting in the accuracy decrease of crop growth, risk and yield monitoring based on this. Therefore, it has become a key scientific problem to identify different planting patterns or varieties of the same crop.
The maize seed-producing area provides us with a similar remote-sensing recognition research scenario. In addition, other crops, even common maize, are grown around the seed maize fields. Therefore, in order to accurately identify the seed maize fields in these kind of areas, it is necessary to be able to distinguish the seed maize fields from common maize, as well as maize from other crops. These differences between seed maize, common maize and other crops are mainly reflected in the spectral reflectance, texture and other information differences of plant population canopy between plots, as well as the temporal changes in different growth periods [46]. The selection of suitable classification characteristics and methods can help identify the subtle differences between different planting patterns and varieties of the same crop. Liu et al. [46] and Zhang et al. [47] respectively took remote-sensing identification of seed maize and common maize in Linze County, Gansu Province, and Qitai County, Xinjiang Autonomous Region, China, as examples to explore the classification method of decision trees using time-series spectra and high-resolution texture characteristics.
Based on the above analysis, it can be seen that vegetation index such as enhanced vegetation index (EVI), RVI, gray-level co-occurrence matrix and other texture features of high-resolution satellite remote sensing image data, as well as their temporal variation information, have the potential to identify crops with slight differences. Random forest classifier has a good tolerance for outliers and noise and is not easy to overfit, so it can be used to build a classification model for seed maize field detection.
In order to explore the identification method for two kinds of planting patterns (seed and common) and variety types (inbred and hybrid) of the same crop (maize). This paper selected two maize seed production bases in Huocheng County and Hutubi County in Xinjiang Uygur Autonomous Region of China as the study area, and took Landsat 8, China Gaofen 1 satellite (GF-1) and Gaofen-2 satellite (GF-2) as the data source. Using random forest classifier, we propose a seed maize identification method that combines multi-temporal spectral features and texture features. It provides a reference for the large-scale mapping of seed maize field and fine classification of other crops by remote sensing.

Study Area
The study areas are Huocheng County and Hutubi County in the Xinjiang Uygur Autonomous Region, which belong to different ecological zones and are the major seed maize production areas in Xinjiang, China ( Figure 1). Huocheng County is a border region of China with a geographical range of longitude 80.18-80.40E and latitude 43.65-44.83N. Agriculture is one of the mainstays of Huocheng County. There are approximately 964,500 acres of arable land in this region, which is in a temperate semi-arid climate, and it receives full sunlight throughout the year, significant temperature variations between winter and summer, and significant temperature variations between day and night. In this climate, the amount of precipitation tends to be low. The average annual precipitation and average temperature in the area are 140-460 mm and 8.2-9.4 • C, respectively. The annual sunshine hours are 2550-3500 h, and the frost-free period is 167-187 days. This area provides a good planting environment for major crops such as seed maize, common maize, rice, cotton, grapes, wheat, sugar beets and soybeans. It is also the most important planting area for seed maize in Ili Kazak Autonomous Prefecture. Hutubi County is located in the middle of the northern slope of Tianshan, Junggar Basin south rim, between 86 • 05 ~87 • 07 E and 43 • 16 ~45 • 20 N. The alluvial plain in the central area is the main planting area of crops in Hutubi County, which has a temperate continental climate with high variation between the four seasons. The annual average precipitation is 167 mm, the frost-free period lasts for 180 days, and the average temperatures of the coldest and hottest months are -17.1 • C and 26.4 • C, respectively. From May to August, during the season for strong crop growth in the plains, the average daily sunshine hours reach more than 10 h, and in July, they reach more than 11 h. The following categories were included in this study: cotton, seed maize, common maize, grapes, pumpkins, tomatoes, and watermelons. There are approximately 964,500 acres of arable land in this region, which is in a temperate semi-arid climate, and it receives full sunlight throughout the year, significant temperature variations between winter and summer, and significant temperature variations between day and night. In this climate, the amount of precipitation tends to be low. The average annual precipitation and average temperature in the area are 140-460 mm and 8.2-9.4 °C, respectively. The annual sunshine hours are 2550-3500 hours, and the frost-free period is 167-187 days. This area provides a good planting environment for major crops such as seed maize, common maize, rice, cotton, grapes, wheat, sugar beets and soybeans. It is also the most important planting area for seed maize in Ili Kazak Autonomous Prefecture. Hutubi County is located in the middle of the northern slope of Tianshan, Junggar Basin south rim, between 86°05′~87°07′E and 43°16′~45°20′N. The alluvial plain in the central area is the main planting area of crops in Hutubi County, which has a temperate continental climate with high variation between the four seasons. The annual average precipitation is 167 mm, the frostfree period lasts for 180 days, and the average temperatures of the coldest and hottest months are -17.1 °C and 26.4 °C, respectively. From May to August, during the season for strong crop growth in the plains, the average daily sunshine hours reach more than 10 hours, and in July, they reach more than 11 hours. The following categories were included in this study: cotton, seed maize, common maize, grapes, pumpkins, tomatoes, and watermelons.

Remote-Sensing Data
The data used in this study consist of a time series of GF-1 wide field view (WFV) imagery, Landsat8 OLI imagery from March to September, 2016 and GF-2 panchromatic sensor (PMS) imagery. The specific parameters are shown in Table 1. The GF-1 satellite is the first satellite in China's high-resolution Earth observation system. Since the study area is large and relatively homogeneous, Landsat8 Operational Land Imager (OLI) data with path147/row 29 were selected as a supplement. The multi-temporal GF-1 WFV and Landsat 8 OLI image series were then used to describe the changes in the spectral characteristics of the agricultural crop conditions. The GF-2 satellite is the first civil optical remote sensing satellite with spatial resolution better than 1 m that was independently developed by China and has high radiation accuracy, high positioning accuracy, and rapid posture maneuverability. In the seed maize-producing fields, two inbred lines (as female and male parent respectively) are planted in different rows, usually every 4-8 rows of female parents have 1 row of male parent, with row spacing of 0.6-0.8m. The male parent supplies pollen to the female parent, on which hybrid seeds are harvested. This heterogeneous population structure can be reflected in the spectral reflectance and canopy texture differences in remote sensing images. Usually, the maternal tassels are removed during tasseling in mid-July, and the paternal line is cut off after pollination at the beginning of August. Thus, the stripe texture unit size in the high spatial resolution remote-sensing images is between 3.6 m and 4.8 m. This texture feature should be available at a 1 m spatial resolution in the GF-2 panchromatic images. Because of the difference in planting patterns, the texture characteristics of common maize are not noticeable. Figure 2 shows the photo and GF-2 panchromatic image acquired on 26 July 2016 of seed maize and common maize. In Figure 2, we can see texture features clearly in both the photo and GF-2 image of seed maize. Therefore, GF-2 panchromatic images from the middle of July to the beginning of August were selected to analyze the feature contours and texture features.

Field Sample Data
A total of 162 samples were obtained in Huocheng County from the field survey, and included rice, seed maize, cotton, grapes, wheat, sugar beets, common maize, and soybeans. These samples were polygons, with an average of 148 GF-1 WFV 16 m pixels per polygon sample, and they were primarily distributed in the southern half of the county, the primary seed maize-planting area. In Hutubi County, there are 80 samples of cotton, seed maize, grapes, pumpkins, common maize, tomatoes and watermelons. On average, each polygon sample covered 139 pixels of GF-1 WFV 16 m, and the samples were evenly distributed in the central crop planting area in the middle of the county. The attribute table of the samples includes the crop type, plot area, geographic coordinates, and crop growth period. According to the national standardized planting mode and management of seed maize, the geographical location of the field was relatively stable over a short period of time. Therefore, the arable land data from 2014 were used, which were derived from the results of the country's annual land change survey, are in raster format, and they were the field boundary of the study area. These reference data were used for crop classification and validation.

Methods
The research workflow is shown in Figure 3. It mainly includes five parts: data preprocessing, spectral feature optimization, texture feature calculation, random forest classification, and accuracy

Field Sample Data
A total of 162 samples were obtained in Huocheng County from the field survey, and included rice, seed maize, cotton, grapes, wheat, sugar beets, common maize, and soybeans. These samples were polygons, with an average of 148 GF-1 WFV 16 m pixels per polygon sample, and they were primarily distributed in the southern half of the county, the primary seed maize-planting area. In Hutubi County, there are 80 samples of cotton, seed maize, grapes, pumpkins, common maize, tomatoes and watermelons. On average, each polygon sample covered 139 pixels of GF-1 WFV 16 m, and the samples were evenly distributed in the central crop planting area in the middle of the county. The attribute table of the samples includes the crop type, plot area, geographic coordinates, and crop growth period. According to the national standardized planting mode and management of seed maize, the geographical location of the field was relatively stable over a short period of time. Therefore, the arable land data from 2014 were used, which were derived from the results of the country's annual land change survey, are in raster format, and they were the field boundary of the study area. These reference data were used for crop classification and validation.

Methods
The research workflow is shown in Figure 3. It mainly includes five parts: data preprocessing, spectral feature optimization, texture feature calculation, random forest classification, and accuracy assessment. First, preprocess the remote sensing data based on the automatic processing platform Remote Sens. 2020, 12, 362 7 of 19 developed by our team. The second step is to use the correlation coefficient to optimize the spectral characteristics. The third step is to calculate the texture features. The fourth step is to classify only the spectral features and fusion spectrum and texture features in the regions covered by GF-2 image respectively. Finally, the land cover data in 2014 were used to mask and evaluate the accuracy separately.
Remote Sens. 2020, 12,362 7 of 20 assessment. First, preprocess the remote sensing data based on the automatic processing platform developed by our team. The second step is to use the correlation coefficient to optimize the spectral characteristics. The third step is to calculate the texture features. The fourth step is to classify only the spectral features and fusion spectrum and texture features in the regions covered by GF-2 image respectively. Finally, the land cover data in 2014 were used to mask and evaluate the accuracy separately.

Data Preprocessing
Both the GF-1 images and the field samples were stored using the Raster Dataset Clean and Reconstitution Multi-Grid (RDCRMG) grid system developed by China Agricultural University [48]. Based on C # combined with the Geospatial Data Abstraction Library (GDAL), procedures such as radiometric calibration, ortho-rectification, and the image registration of their products were performed for all the data. Atmospheric correction and radiometric calibration were performed using Fast Line-of-Site Atmospheric Analysis of Spectral Hypercube (FLAASH) tools [49]. According to the mid-latitude location and the image capturing time of the study area, a suitable atmospheric model was selected. Since the urban area in the study area is relatively small, a rural aerosol model was selected to correct the effects of the aerosol factors. The GF satellite image products used in this study were provided with rational polynomial coefficient (RPC) files. In addition, the parameters provided in the RPC files were used to perform ortho-rectification on high-resolution remote sensing images. In this paper, multi-source remote sensing images and multi-temporal remote sensing images were well geo-referenced.

Data Preprocessing
Both the GF-1 images and the field samples were stored using the Raster Dataset Clean and Reconstitution Multi-Grid (RDCRMG) grid system developed by China Agricultural University [48]. Based on C # combined with the Geospatial Data Abstraction Library (GDAL), procedures such as radiometric calibration, ortho-rectification, and the image registration of their products were performed for all the data. Atmospheric correction and radiometric calibration were performed using Fast Line-of-Site Atmospheric Analysis of Spectral Hypercube (FLAASH) tools [49]. According to the mid-latitude location and the image capturing time of the study area, a suitable atmospheric model was selected. Since the urban area in the study area is relatively small, a rural aerosol model was selected to correct the effects of the aerosol factors. The GF satellite image products used in this study were provided with rational polynomial coefficient (RPC) files. In addition, the parameters provided in the RPC files were used to perform ortho-rectification on high-resolution remote sensing images. In this paper, multi-source remote sensing images and multi-temporal remote sensing images were well geo-referenced.

Spectral Feature Optimization
In this paper, a correlation coefficient calculated by Python was used to select the better VIs. Correlation coefficient is a statistical indicator designed by statistician Carl Pearson and is a measure of Remote Sens. 2020, 12, 362 8 of 19 the degree of linear correlation between study variables. All the field survey samples were analyzed in this paper.
The VI quantifies the vegetation properties by helping transform the reflectance of two or more spectral bands [50]. Considering the differences in phenology, seasonal differences, as well as the significance and anti-saturation degree of different VIs, the commonly used VIs can be divided into four categories: (1)  (4) To reflect the canopy moisture content of crops: normalized difference water index (NDWI) [8]. The formula is in Table 2 as follows: In the above formula, B, G, R, NIR are the reflectance of blue, green, red and near-infrared bands, respectively. L is the soil conditioning parameter and the value is 0.5.

LBP-GLCM
GLCM was first proposed by Haralick [52] in 1973, which is one of the most common and widely used texture statistical analysis methods. The element values in the matrix represent the joint conditional probability density between the gray levels, which means that given the spatial distance d and the direction θ, the probability (i.e., frequency) of gray level j occurring when the gray level i is the starting point. It can calculate 14 texture features, such as angular second-order moment, entropy, contrast, and correlation. In this study, five features calculated by Python are selected for experiment, namely mean, variance, contrast, entropy, and correlation. Although the strip texture information exists on the same structure for the seed maize fields, the texture direction varies in the same remote sensing images. To eliminate the influence of the crop planting direction, before the GLCM is calculated, the image is first transformed into a local binary pattern (LBP) with rotation invariance. The LBP is an operator that is used to describe local texture features of images. It has significant advantages such as rotation invariance and gray invariance.
Specifically, the minimum and the most central pixel values in the binary pattern of 8 domains around the pixel points are taken, and the LBP image with rotation invariance is obtained. There are three extensions to the original operator to describe texture features, namely rotation invariant patterns.
Remote Sens. 2020, 12, 362 9 of 19 LBP ri P,R , uniform patterns U(LBP P,R ), and rotation invariant uniform patterns LBP riu2 P,R , which can be calculated using the following equation: The calculation formula for LBP uniform patterns is where R is the neighborhood radius, P is the number of pixels in the circular neighborhood of the LBP algorithm, the g c is the grayscale of the neighborhood center cell, and the g p is the grayscale of all the other cells except the central cell in the domain. The threshold formula is where x is the difference between the central pixel g c and the pixel g p−1 . Comparing the g p−1 of P -1 gray in the circular neighborhood with the center gray g c , subblocks larger than the center size are represented by 1, otherwise by 0.

Random Forest Classification
Random forest (RF) is an integrated algorithm, which belongs to the Bagging type. By combining multiple weak classifiers, the final result is obtained by voting, which gives the overall model result a higher precision and generalization ability [8].
Random forests can process high-dimensional data well, and this method has significant advantages when there are many samples and features. In this study, multi-temporal single-band VI image series were compiled into a multi-layer data cube for further analysis. In Huocheng County, 8 planting indexes were calculated for 12 scene data, so this data cube has 96 bands, and correspondingly, there are 80 bands in Hutubi County.
In principle, the more decision trees there are in the RF classifier, the better the prediction. However, there is a trade-off between the classification accuracy and the time efficiency. In this paper, we tested different numbers of trees, including 10, 30, 50, 70, 100, 120, 150, 170, and 200 (Figure 4), and we selected 150 trees to classify the seed maize fields when considering both the classification accuracy and time efficiency.
Crop classification based on remote sensing data is essentially based on the similarity of pixels. In this article, we combined C# with Waikato Environment for Knowledge Analysis (weka), which is an open source machine learning and data mining software based on JAVA environment, to build data sets and classification models. First of all, the characteristics of the samples were extracted from remote sensing image data. Then, the comprehensive characteristics of long phase data, generate the sample set, as the input data of RF classifier, used in the model of training. Finally, the same time-phase remote sensing data were put into the classifier to obtain the crop classification results.
In this paper, two experiments are designed, which are based solely on spectral data and fuse spectrum and texture data. First of all, according to the phenological calendar and the planting system used for the primary crops in the study area, the vegetation index system was constructed, the multi-temporal spectral characteristics were analyzed, and the seed maize was preliminarily identified using the spectral characteristics. Then, in the regions covered by GF-2 image, the classification results were further recognized by a texture analysis of high spatial resolution remote sensing images. In this way, two classification results of seed maize can be obtained. Crop classification based on remote sensing data is essentially based on the similarity of pixels. In this article, we combined C# with Waikato Environment for Knowledge Analysis (weka), which is an open source machine learning and data mining software based on JAVA environment, to build data sets and classification models. First of all, the characteristics of the samples were extracted from remote sensing image data. Then, the comprehensive characteristics of long phase data, generate the sample set, as the input data of RF classifier, used in the model of training. Finally, the same timephase remote sensing data were put into the classifier to obtain the crop classification results.
In this paper, two experiments are designed, which are based solely on spectral data and fuse spectrum and texture data. First of all, according to the phenological calendar and the planting system used for the primary crops in the study area, the vegetation index system was constructed, the multitemporal spectral characteristics were analyzed, and the seed maize was preliminarily identified using the spectral characteristics. Then, in the regions covered by GF-2 image, the classification results were further recognized by a texture analysis of high spatial resolution remote sensing images. In this way, two classification results of seed maize can be obtained.

Accuracy Assessment
In this paper, the arable land data of 2014 were firstly used for masking, then a random selection resulting in 114 samples (70%) as training samples, with the remaining samples (30%) as verification samples, and the confusion matrix based on Python was used to assess the classification results. By constructing a confusion matrix, four accuracy assessment indexes can be obtained: overall accuracy (OA), producer accuracy (PA), user accuracy (UA), kappa coefficient (K). Kappa analysis provided a measure of the magnitude of agreement between the predicted and actual class membership [53]. A kappa value of 0 represents a total random classification, while a kappa value of 1 corresponds to a perfect agreement between the reference and classification data. The calculation formula for each indicator is as follows:

Accuracy Assessment
In this paper, the arable land data of 2014 were firstly used for masking, then a random selection resulting in 114 samples (70%) as training samples, with the remaining samples (30%) as verification samples, and the confusion matrix based on Python was used to assess the classification results. By constructing a confusion matrix, four accuracy assessment indexes can be obtained: overall accuracy (OA), producer accuracy (PA), user accuracy (UA), kappa coefficient (K). Kappa analysis provided a measure of the magnitude of agreement between the predicted and actual class membership [53]. A kappa value of 0 represents a total random classification, while a kappa value of 1 corresponds to a perfect agreement between the reference and classification data. The calculation formula for each indicator is as follows:

Feature Optimization Result
Data from Huocheng County were used to evaluate the VI correlation of seed maize and to apply the same rules to both regions. The correlation indices of VIs during different growing stages are shown in Tables 3-6. We can find that there is a high correlation among the various VIs. The lowest value was the correlation coefficient between RVI and DVI on June 29, with a value of 0.727. The correlation coefficients of vegetation index in other phases were all greater than 0.8. Among them, the largest correlation coefficients are NDVI and SAVI, NDWI and GNDVI, and their absolute correlation coefficients are all 1. Since the NDWI is quantifying different aspects of vegetation (a water-related property) from other VIs, the correlation indices between the NDWI and other VIs are negative. Table 7 is the T-test result of the vegetation index with an absolute correlation coefficient of 1. It can be seen from the figure that, except NDVI and SAVI with low significance on April 19, the other t-test results all show extremely significant correlations. The increase in the number of features will greatly increase the calculation dimension and reduce the calculation efficiency. So we discarded SAVI and GNDVI, the NDVI, EVI, TVI, RVI, NDWI, DVI were selected as the input data in the final classification.

Local Binary Pattern-Gray-Level Co-Occurrence Matrix (LBP-GLCM) Analysis
The GLCM in this paper is determined by using two parameters, the distance and the direction. According to the field investigation, the tassels in the female parent row are removed from the maize planting field in the study area at a specific time, while the male parent row retains its tassels. In the GF-2 panchromatic image at a 1-meter resolution, the parents of seed-producing maize had obvious gray contrast in the neighborhood. Simultaneously, because the line ratio of the parents is 1:6 and the row spacing is 60 cm, the texture structure of the seed maize field is based on approximately one pixel, so the distance parameter of the GLCM is based on one pixel. In large-scale crop planting, it is difficult to ensure the same crop planting direction.

Accuracy Assessment and Analysis
The overall accuracy verification results based on the spectrum in Huocheng County and Hutubi County are shown in Table 8 and Table 9. The overall accuracies are 86.71% and 88.43%, and the Kappa coefficients are 0.76 and 0.83, meeting the accuracy requirements of crop classification. The

Accuracy Assessment and Analysis
The overall accuracy verification results based on the spectrum in Huocheng County and Hutubi County are shown in Tables 8 and 9. The overall accuracies are 86.71% and 88.43%, and the Kappa coefficients are 0.76 and 0.83, meeting the accuracy requirements of crop classification. The user accuracy and the producer accuracy of Huocheng County's seed maize are 94.82% and 86.78%, respectively. These results means that other crops are rarely misclassified into seed maize, but misclassified seed maize into other crops are more common, especially the mixes of seed maize fields and common maize are more serious, with 5.32% seed maize fields are mistakenly divided into common maize. In Hutubi County, both the user accuracy and the producer accuracy are higher, 90.02% and 94.08%, respectively. This is due to the fact that there are more seed maize in Hutubi County and the plots are larger, so the pepper and salt effect is lighter. So, we can find that the average overall accuracy and the Kappa coefficient of the two counties were 87.57% and 0.80. The average user accuracy for seed maize was 92.42%, and the producer accuracy was 90.43%. Table 8. Confusion matrix based on the "spectrum" in Huocheng County.  Table 9. Confusion matrix based the "spectrum" in Hutubi County. In the regions covered by GF-2, the overall accuracy verification results based on the "spectrum + texture" in Huocheng County and Hutubi County are shown in Table 10. For Huocheng County, the overall accuracy and Kappa coefficient for "spectrum + texture" are higher than that of "spectrum", which are 9.19% and 0.16, respectively. The omission rate is 7.13% lower than that of the "spectrum". For Hutubi County, the overall accuracy and Kappa coefficient based on "spectrum + texture" are 9.36% and 0.12 higher than those based on "spectrum", respectively. In addition, the user accuracy and producer accuracy of "spectrum + texture" are better, since the commission rate and the omission rate are lower than that of "spectrum". The average overall crop accuracy and the Kappa coefficient of the two counties were improved to 96.85% and 0.94, respectively. The average user accuracy of the seed maize field was improved to 98.03% and the average producer accuracy was 95.78%. Therefore, the classification method of "spectrum + texture" can obtain higher accuracy. A confounding matrix based on field samples is not sufficiently comprehensive to evaluate classification accuracy. It is necessary to evaluate the classification results by combining crop mapping. Figure 6 shows the seed maize distribution in the Huocheng and Hutubi County. In this Figure, a and b are based on the "spectrum" and "spectrum + texture", respectively, in Huocheng County, c and d are that of Hutubi County. We can find that the distribution of seed maize can be obtained based on both schemes, but the classification results based on "spectrum + texture" have clearer land boundaries, this is in line with the characteristics of seed maize fields such as large plot, good contiguous property and concentrated planting. So the identification of seed maize is more accurate. At the same time, comparing Huocheng County and Hutubi County, we can find that the plot in Hutubi County is larger and more orderly, which also corresponds to the higher accuracy of Hutubi County mentioned above. Therefore, the method of fusing spectrum and texture to identify seed maize is feasible. A confounding matrix based on field samples is not sufficiently comprehensive to evaluate classification accuracy. It is necessary to evaluate the classification results by combining crop mapping. Figure 6 shows the seed maize distribution in the Huocheng and Hutubi County. In this Figure, a and b are based on the "spectrum" and "spectrum + texture", respectively, in Huocheng County, c and d are that of Hutubi County. We can find that the distribution of seed maize can be obtained based on both schemes, but the classification results based on "spectrum + texture" have clearer land boundaries, this is in line with the characteristics of seed maize fields such as large plot, good contiguous property and concentrated planting. So the identification of seed maize is more accurate. At the same time, comparing Huocheng County and Hutubi County, we can find that the plot in Hutubi County is larger and more orderly, which also corresponds to the higher accuracy of Hutubi County mentioned above. Therefore, the method of fusing spectrum and texture to identify seed maize is feasible.

Discussion
The purpose of this research was to explore the identification method for different planting patterns and varieties of the same crop. According to the differences in the planting methods and varieties of seed maize and common maize, based on Landsat 8 and GF-1 WFV multispectral sequence images, and GF-2 PMS panchromatic images with a spatial resolution of 1 m, six spectral vegetation indexes and LBP-GLCM texture features were extracted, and a random forest classification model was constructed to distinguish seed maize from common maize and other crops. The mapping accuracy of two typical seed maize producing counties was 95.90% and 97.79%, respectively. Compared with the "spectral method", the classification accuracy was improved by 9.19 and 9.36 percentage points by adding texture features.
Another example of different planting patterns of the same crop is grazing and mowing in pastures. Grazing and mowing can cause changes in biomass and crop height, in addition, mowing can also result in soil exposure [45]. Lopes et al. [54] distinguish mowing, grazing, and mixed practices, using an object-based classification of grasslands from high-resolution satellite image time series (Formosat-2) and Gaussian mean map kernels. In a case study in Brittany, France, Dusseux et al. [55] derived the NDVI and two biophysical variables (leaf area index and fraction of vegetation cover) from a series of three SPOT images and field data to monitor pasture mowing and grazing. Compared with the above methods, the method proposed in this paper integrates texture information and spectral information to identify the two planting patterns of maize, and makes better use of the apparent differences between the two patterns. However, this study does not consider the biomass differences between seed maize and common maize. Therefore, the calculation of biomass can be used in a future study to improve the identification accuracy.
Compared with previous exploratory studies on remote-sensing identification of seed maize, the technical scheme adopted in this paper (the combination of data source, spectral feature, texture feature and classification method) has better universality. For example, the decision tree method based on multi-temporal NDVI and high-resolution texture adopted by Liu et al. [46] in the study of Linze County, Gansu Province, China, has a mapping accuracy of 83%, which is more suitable for identifying the seed-producing maize areas covered with plastic film before sowing. The decision tree method based on multi-temporal EVI and GLCM texture adopted by Zhang et al. [47] in Qitai County, Xinjiang, China, has a mapping accuracy of about 90%, but the subjective influence of artificial threshold set by the decision tree method is greater. Zhang et al. [51] adopted the random forest method based on 8 vegetation indexes and LBP-GLCM texture in Huocheng County, Xinjiang, China, with many input parameters and mapping accuracy of about 86%. In terms of texture feature extraction, the results of this study show that GF-2 panchromatic band images with 1-meter spatial resolution can also extract texture information satisfying the identification of seed production plots. Compared with the 0.3 m Geoeye-1 [46] and 0.7 m Compsat-3 [47] used in previous studies, the effective coverage of high-resolution data sources of GF-2 can be significantly improved.
Although the method presented in this paper has high accuracy in identifying seed maize, there are still some omissions and misclassification of seed maize. In the spectroscopy-based classification, the seed maize in Huocheng County had a relatively high error of omissions, and the miscounting errors of seed maize as common maize, cotton and rice were 5.32%, 4.63% and 2.69%, respectively. In Hutubi County, the misclassification error was higher. Cotton, watermelon and tomato were misclassified into seed maize, with errors of 3.92%, 2.61% and 1.92%, respectively. This is mainly because the phenological calendar and biomass of these crops are similar, and the sample size of seed maize in Huocheng County is relatively large, accounting for 65.33% of the total sample size. This unbalanced sample structure tends to increase the error of model training, making the classification results more inclined to the category with a large sample size. Therefore, in the following research, it is necessary to consider the subtle differences of these crops and explore the characteristics that are more suitable to distinguish them. At the same time, we will increase the sample sizes of fewer classes and optimize the distribution of samples, so as to make the number of samples and spatial structure more balanced.
The identification accuracy of the method in two seed maize production counties in Xinjiang is relatively high, which can be used for the statistics and mapping of annual seed maize acreage in well-known seed production areas, and the accurate estimation of seed yield based on this. However, this method requires data input from a large number of ground survey samples and full-growth season images. Therefore, it is necessary to further improve the algorithm and identification scheme for areas lacking samples and for obtaining seed production acreage in early growth season.

Conclusions
The aim of this paper is to explore the feasibility of high-resolution remote-sensing satellite images in identifying different planting patterns and varieties of the same crop, and expand the connotation of crop precise classification by remote sensing. The result shows that the random forest classifier constructed based on the temporal spectra and texture information extracted from high-resolution remote-sensing satellite images, combined with ground samples, can distinguish two kinds of planting patterns (seed and common) and variety types (inbred and hybrid) of maize. Moreover, the method had high precision in two representative maize seed-producing areas in Xinjiang, China, and can be used for remote-sensing mapping of large-scale maize seed-producing fields. The key findings are as follows: By comparing and screening the vegetation indices involved in the modeling, we found that SAVI and GNDVI are redundant features for the classification of the seed maize, common maize and other crops, and only the six indices of NDVI, EVI, TVI, RVI, NDWI and DVI are required. Compared with the method using EVI as the spectral feature, the recognition accuracy was improved.
Another seed maize identification model was built based on the fusion of spectrum and texture characteristics. It was found that the texture parameters calculated based on 1-meter panchromatic band of GF-2 can also accurately express the internal differences between seed maize and common maize. There is a significant improvement in accuracy compared to using only spectral information.
In addition, compared with other high-resolution images, GF-2 has a wider image width, making this method more scalable.
In this study, remote-sensing identification of seed maize was taken as an example to explore the discrimination method of different planting patterns and varieties of the same crop, which is of great reference significance for fine identification of similar crops. In addition, on this basis, the statistics and mapping of maize area for seed production can be carried out, and the growth, quality and yield of maize seeds can be calculated by combining crop nutrient, disease and yield estimation models. However, this method still has certain limitations, because it requires a large amount of sample data. Therefore, how to effectively use and improve this method in areas lacking samples is a work that needs further research.