UAV, a Farm Map, and Machine Learning Technology Convergence Classiﬁcation Method of a Corn Cultivation Area

: South Korea’s agriculture is characterized by a mixture of various cultivated crops. In such an agricultural environment, convergence technology for ICT (information, communications,


Introduction
Corn, along with rice and wheat, is one of the world's three major grains, and it is a food crop that has high productivity per unit area and is widely used as snacks, forage, starch, and cooking oil [1,2]. The consumption of corn continues to increase worldwide [3]. Production was 1135 million tons in 2019, accounting for the largest proportion of food crops [4]. However, the Korean domestic grain self-sufficiency rate was 21.7% in 2018, and corn, among major food crops, had the lowest rate at 0.8%. In addition, the import volume of corn is about 10,166,000 tons, with an import value of 2126 million U.S. dollars, which is a situation dependent on imports. The South Korean government is preparing a plan to increase the production of crops with high import dependence, such as corn, by cultivating other crops in paddy fields and improving varieties [5]. Global abnormal weather events such as those in 2020 highlight the instability of international grain prices and the importance of food security.
In particular, in South Korea, which is highly reliant on imports, it is very important to understand the current status of the cultivation area and production in order to stabilize consumer prices, make decisions on grain import policies, and efficiently inventory selfsufficiency. The crop cultivation area survey conducted in the past was carried out by selecting a sample and visiting the site in person or by conducting a cultivation intention survey by interview [6]. It is very difficult to obtain objective data because the survey results depend mainly on observations and can be subjective depending on the skill level of the investigator [7]. As an alternative to this, the research team proposed a method to check the cultivated area without directly visiting the site, and it is being applied and utilized in practice [6]. The proposed technology utilizes remote sensing technology that can acquire objective information for a large area and is being applied in various ways [8][9][10]. In particular, in the remote sensing (RS) field, unmanned aerial vehicles (UAVs) have advantages in precision, economy, and periodicity compared to satellite images, so they are attracting attention as a suitable platform for agricultural monitoring in South Korea, where various crops are grown in small areas and precipitation is concentrated in summer [11].
Land use/land cover (LULC) and crop classification are some of the most active research areas in the RS field [12][13][14][15][16][17][18]. The research results obtained through this study are useful as basic data for the calculation of cultivated areas and the monitoring of crop conditions in the agricultural field [19].
On the other hand, from 2010, research on land cover classification and crop classification that applied artificial intelligence technology increased rapidly with the development of software and hardware [20][21][22][23]. In particular, in the early stage, land cover classification was performed using satellite data such as Landsat images and various machine learning classification algorithms (ANN: artificial neural network, SVM: support vector machine, RF: random forest), and their functions were compared [24][25][26]. During this period, studies were conducted to evaluate and compare machine learning classification models for target regions of each country mainly using satellite images [27][28][29][30]. Since then, research on land use and classification methods has been continuously developed, and various application cases of object-based classification methods with GIS (Geographic Information System) have been reported [7,[31][32][33][34]. As for the cases of using UAV images, as various UAVs have been developed and distributed, the number of applications used, not only in surveying but also in agriculture, has increased. In the agricultural field, research on detecting weeds in paddy fields or classifying cultivated crops using UAV images and object-based classification techniques has begun to be actively conducted [35][36][37]. Most of these studies use SVM, RF, neural networks, Bayesian networks, and maximum likelihood classification methods to recognize plants or objects of interest [38][39][40][41][42][43][44][45][46].
In 2020, AI techniques using deep learning methods were proposed, but research focusing on leaves or individual plants has been the main focus [47]. Based on a review of various proposed studies, deep learning techniques are not applied in studies across a wide area, where mainly ANN, SVM, and RF algorithms are used, and it is suggested that the accuracy is higher than that obtained using other machine learning classification techniques [48,49].
Previous domestic studies using UAV and satellite images include studies on autumn cabbage and radish [8], onion, garlic [35], winter crops [36], and potatoes in high-altitude cabbage cultivation regions [49]. In the case of overseas regions, research is mainly conducted on major crops and in areas with social problems with high added value in each country [50][51][52].
In South Korea, there is a lack of research on corn classification, and research on corn is mainly conducted by estimating the cultivated area and production overseas based on satellite images [27,28]. In addition, most of the studies using UAVs are on a field scale of less than 20 to 200 ha, and there are very few studies on a 2000 ha-wide area [53].
In the future, as the scope of the application of UAVs is diversified, they are expected to be used in various ways in the agricultural field, and the scope of application is expected to expand. The expansion of the scope of application in agriculture requires the development of related technologies and research on how to apply them. To this end, the Korean government has developed a Farm Map and is trying to use it in various fields in agriculture [54]. The Farm Map provides important spatial information about agriculture that is updated every two years by applying satellite and aerial imagery. Although the technology using Farm Map is limited, if UAV and machine learning methods are combined, the time and effort required for image processing can be minimized [6].
Therefore, the purpose of this study is to (1) propose a UAV-based multi-spectral image acquisition and processing method for an area of 2000 ha or more, (2) apply a Farm Map and machine learning classification algorithm to identify the fields where corn is grown, and (3) estimate the corn cultivation area in the relevant area using the acquired cultivation land information.

Study Area
As shown in Figure 1, this study was conducted in Gammul-myeon (36 • 50 15 N, 127 • 52 29 E), located in the northeastern part of Goesan-gun, Chungcheongbuk-do, South Korea. The research area covers 4280 ha and consists of 7 legal districts. The western part of the region is a plain that forms agricultural land, and the southeast consists of mountainous areas where Juwolsan and Bakdalsan are located. To the northwest is the Dalcheon River flows, forming fertile agricultural land.
less than 20 to 200 ha, and there are very few studies on a 2000 ha-wide area [53].
In the future, as the scope of the application of UAVs is diversified, they are expected to be used in various ways in the agricultural field, and the scope of application is expected to expand. The expansion of the scope of application in agriculture requires the development of related technologies and research on how to apply them. To this end, the Korean government has developed a Farm Map and is trying to use it in various fields in agriculture [54]. The Farm Map provides important spatial information about agriculture that is updated every two years by applying satellite and aerial imagery. Although the technology using Farm Map is limited, if UAV and machine learning methods are combined, the time and effort required for image processing can be minimized [6].
Therefore, the purpose of this study is to (1) propose a UAV-based multi-spectral image acquisition and processing method for an area of 2000 ha or more, (2) apply a Farm Map and machine learning classification algorithm to identify the fields where corn is grown, and (3) estimate the corn cultivation area in the relevant area using the acquired cultivation land information.

Study Area
As shown in Figure 1, this study was conducted in Gammul-myeon (36°50′15″ N, 127°52′29″ E), located in the northeastern part of Goesan-gun, Chungcheongbuk-do, South Korea. The research area covers 4280 ha and consists of 7 legal districts. The western part of the region is a plain that forms agricultural land, and the southeast consists of mountainous areas where Juwolsan and Bakdalsan are located. To the northwest is the Dalcheon River flows, forming fertile agricultural land.
Goesan-gun is an area where wax corn was developed, and edible corn such as Mibaekchal and Miheukchal are mainly grown. In addition, this area was evaluated for local adaptability by selecting it as an open-field cultivation test bed for Golden Matchal, a newly developed variety developed by the RDA (Rural Development Administration, South Korea). The corn cultivation area of Gammul-myeon in Goesan-gun is 141.4 ha, based on the registration of agricultural business in 2017, and it is the region with the highest ratio of cultivated area [54].  Goesan-gun is an area where wax corn was developed, and edible corn such as Mibaekchal and Miheukchal are mainly grown. In addition, this area was evaluated for local adaptability by selecting it as an open-field cultivation test bed for Golden Matchal, a newly developed variety developed by the RDA (Rural Development Administration, South Korea). The corn cultivation area of Gammul-myeon in Goesan-gun is 141.4 ha, based on the registration of agricultural business in 2017, and it is the region with the highest ratio of cultivated area [54].

Corn Growth Schedule and Weather Conditions in Goesan-Gun
Corn has a slightly different growing period depending on the growing region and method. Corn cultivation in Goesan-gun is mainly carried out by the seedling method. Corn is sown using seedling plugs in March and is planted in April after a seedling period of about 25-30 days. As shown in Figure 2, planted corn is harvested through vegetative growth stages (VE-V9), flowering stages (VT), and reproductive growth stages (R1-R6). The cultivated corn has been shown to grow with the growth characteristics suggested in the previous research results [55,56]. Corn's vegetative stage (VE~V9) is a period in which the number of leaves and plant height growth are remarkable, and it lasts for about 60 to 70 days after planting. Corn, which has sufficiently grown leaves and plant height, begins flowering (VT) from early to mid-June. Corn that has flowered is harvested after going through reproductive stages (R1~R6) for about 30 days. The reproductive stages are the period in which leaf and plant height growth are no longer achieved and the ears mature.

Corn Growth Schedule and Weather Conditions in Goesan-Gun
Corn has a slightly different growing period depending on the growing region and method. Corn cultivation in Goesan-gun is mainly carried out by the seedling method. Corn is sown using seedling plugs in March and is planted in April after a seedling period of about 25-30 days. As shown in Figure 2, planted corn is harvested through vegetative growth stages (VE-V9), flowering stages (VT), and reproductive growth stages (R1-R6). The cultivated corn has been shown to grow with the growth characteristics suggested in the previous research results [55,56]. Corn's vegetative stage (VE~V9) is a period in which the number of leaves and plant height growth are remarkable, and it lasts for about 60 to 70 days after planting. Corn, which has sufficiently grown leaves and plant height, begins flowering (VT) from early to mid-June. Corn that has flowered is harvested after going through reproductive stages (R1~R6) for about 30 days. The reproductive stages are the period in which leaf and plant height growth are no longer achieved and the ears mature. The optimum growing temperature for corn is 20~30 °C, and as shown in Figure 3, the average temperature in Goesan-gun in early April is 5~15 °C [57]. As a result, corn planted in early April often suffers from cold damage. In addition, corn is one of the crops that causes a large decrease in farmers' income due to a decrease in harvest when rainfall is concentrated during the harvest season. The year 2020 was also greatly affected by cold weather in the early stages of cultivation and rain that continued during the harvest period ( Figure 3). The optimum growing temperature for corn is 20~30 • C, and as shown in Figure 3, the average temperature in Goesan-gun in early April is 5~15 • C [57]. As a result, corn planted in early April often suffers from cold damage. In addition, corn is one of the crops that causes a large decrease in farmers' income due to a decrease in harvest when rainfall is concentrated during the harvest season. The year 2020 was also greatly affected by cold weather in the early stages of cultivation and rain that continued during the harvest period ( Figure 3).
The UAV image acquisition was carried out for four days over a total of two corn growth stages: 8-9 May 2020, which corresponds to stage V3 in Figure 2, and 18-19 June, which corresponds to stage VT. The field survey was conducted for a total of four days from 18 to 21 June in accordance with the UAV imaging period.
The Farm Map was promoted with the goal of increasing the efficiency of policies related to agriculture by generating accurate farmland data consistent with the field, and at the same time linking agricultural and rural-related administrative information. Therefore, the Farm Map is an electronic map produced to construct high-quality farmland information that matches the actual farmland due to the discrepancy in the boundary between the South Korean cadastral map and the actual farmland [54]. The Ministry of Agriculture, Food, and Rural Affairs, South Korea, established an electronic map of agricultural land across the country that was acquired from aerial imagery from 2014 to 2016 and has been continuously updated since 2017. The current version of the Farm Map is updated every two years by dividing South Korea into eastern and western regions. Therefore, in the case of general rural areas, the Farm Map accurately reflects the current state of farmland boundaries. The UAV image acquisition was carried out for four days over a total of two corn growth stages: 8-9 May 2020, which corresponds to stage V3 in Figure 2, and 18-19 June, which corresponds to stage VT. The field survey was conducted for a total of four days from 18 to 21 June in accordance with the UAV imaging period.

Farm Map (Electronic Map of Farmland)
The Farm Map was promoted with the goal of increasing the efficiency of policies related to agriculture by generating accurate farmland data consistent with the field, and at the same time linking agricultural and rural-related administrative information. Therefore, the Farm Map is an electronic map produced to construct high-quality farmland information that matches the actual farmland due to the discrepancy in the boundary between the South Korean cadastral map and the actual farmland [54]. The Ministry of Agriculture, Food, and Rural Affairs, South Korea, established an electronic map of agricultural land across the country that was acquired from aerial imagery from 2014 to 2016 and has been continuously updated since 2017. The current version of the Farm Map is updated every two years by dividing South Korea into eastern and western regions. Therefore, in the case of general rural areas, the Farm Map accurately reflects the current state of farmland boundaries.
The initial purpose of developing the Farm Map was to provide accurate data to overcome the limitations of front-line work on the payment status and field verification, as the problem of the difference between the area under agricultural subsidy inspection and the actual area was raised in the inspection system for crops. The accuracy of the Farm Map boundary is very accurate because the farmland boundary was demarcated and divided by experts using high-resolution satellite and aerial images. Currently, it is usefully used to block the illegal receipt of agricultural subsidies. However, there are some problems in the lot number mapping on the Farm Map and the cadastral map, so the utilization rate in the field is somewhat insufficient. In addition, there is still inefficiency in the operation of geospatial data such as those of Farm Maps due to the lack of a connection base for agricultural-related geospatial data and a lack of awareness of their use.
A Farm Map is produced through the following steps: First, satellite (Kompsat, etc.) and aerial imagery data are collected. Second, image processing of the collected data is undertaken. Third, the boundaries for seven agricultural land items are demarcated.

Farm Map (Electronic Map of Farmland)
The initial purpose of developing the Farm Map was to provide accurate data to overcome the limitations of front-line work on the payment status and field verification, as the problem of the difference between the area under agricultural subsidy inspection and the actual area was raised in the inspection system for crops. The accuracy of the Farm Map boundary is very accurate because the farmland boundary was demarcated and divided by experts using high-resolution satellite and aerial images. Currently, it is usefully used to block the illegal receipt of agricultural subsidies. However, there are some problems in the lot number mapping on the Farm Map and the cadastral map, so the utilization rate in the field is somewhat insufficient. In addition, there is still inefficiency in the operation of geospatial data such as those of Farm Maps due to the lack of a connection base for agricultural-related geospatial data and a lack of awareness of their use.
A Farm Map is produced through the following steps: First, satellite (Kompsat, etc.) and aerial imagery data are collected. Second, image processing of the collected data is undertaken. Third, the boundaries for seven agricultural land items are demarcated. Fourth, the processes of classification and segmentation of the image are undertaken. Fifth, a field survey for comparison with field conditions is performed. Sixth, on-site inspection is carried out to ensure that the site conditions are properly reflected. Seventh, metadata are created so that they can be used in various ways in the agricultural field. The Farm Map is an agricultural support map optimized for agriculture by linking aerial and satellite images, which are not easy to obtain and process by ordinary people, with field conditions. As a problem, it has been pointed out that it is difficult to immediately reflect changes in the field conditions, such as agricultural land near town centers, where the update cycle of the source data, such as aerial images, rapidly changes every two years.
On the other hand, as in this study, rural areas are represented in an electronic map of farmland that can be used as farmland does not change much. The land use classification of the Farm Map is divided into seven fields: paddy fields, fields, orchards, facilities, ginseng, fallow land, and bare land. In this study, paddy fields, fields, orchards, ginseng, and fallow land were investigated, i.e., not facilities and bare land, from the land use classifications using the 2019 Farm Map, the most recent data of the Chungcheongbuk-do region ( Figure 4). satellite images, which are not easy to obtain and process by ordinary people, with field conditions. As a problem, it has been pointed out that it is difficult to immediately reflect changes in the field conditions, such as agricultural land near town centers, where the update cycle of the source data, such as aerial images, rapidly changes every two years.
On the other hand, as in this study, rural areas are represented in an electronic map of farmland that can be used as farmland does not change much. The land use classification of the Farm Map is divided into seven fields: paddy fields, fields, orchards, facilities, ginseng, fallow land, and bare land. In this study, paddy fields, fields, orchards, ginseng, and fallow land were investigated, i.e., not facilities and bare land, from the land use classifications using the 2019 Farm Map, the most recent data of the Chungcheongbuk-do region ( Figure 4).

Crop Classification Method and Algorithm
The classification method used in this study is an object-based classification method. For wide-area classification using low-resolution satellite images, pixel-based classification has been used [13,15]. However, in high-resolution images of 5 m or less, there is a limitation in classification accuracy due to the Salt and Pepper phenomenon [22]. For this reason, object-based classification is widely used for UAV or high-resolution satellite images with a spatial resolution of 5 m or less [37,53,58].
Machine learning is a branch of AI that finds rules by inputting data and obtaining expected answers from these data [59,60]. For this, it is important to learn the input data.
In the field of spatial information, machine learning is applied for classifying or recognizing objects using images and point clouds. When machine learning provides a lot of data related to a task, it can find a statistical structure in these data and create rules to automate that task.
If data on crops are accumulated in the field of spatial information and such learning data can be automatically built, the work efficiency for machine learning will be improved. In this study, SVM and RF methods were reviewed to evaluate the efficiency of object classification by applying machine learning to the spatial information field.

Crop Classification Method and Algorithm
The classification method used in this study is an object-based classification method. For wide-area classification using low-resolution satellite images, pixel-based classification has been used [13,15]. However, in high-resolution images of 5 m or less, there is a limitation in classification accuracy due to the Salt and Pepper phenomenon [22]. For this reason, object-based classification is widely used for UAV or high-resolution satellite images with a spatial resolution of 5 m or less [37,53,58].
Machine learning is a branch of AI that finds rules by inputting data and obtaining expected answers from these data [59,60]. For this, it is important to learn the input data. In the field of spatial information, machine learning is applied for classifying or recognizing objects using images and point clouds. When machine learning provides a lot of data related to a task, it can find a statistical structure in these data and create rules to automate that task.
If data on crops are accumulated in the field of spatial information and such learning data can be automatically built, the work efficiency for machine learning will be improved. In this study, SVM and RF methods were reviewed to evaluate the efficiency of object classification by applying machine learning to the spatial information field.
SVM is an algorithm to find the optimal linear decision boundary that classifies classification items using the concepts of support vectors and margins [61,62]. Here, the support vector and margin mean the distance between the support vector and the decision boundary, that is, the data closest to the decision boundary. SVM is an algorithm that finds the optimal decision boundary that maximizes the margin using this principle. However, when the input data are difficult to distinguish with a linear decision boundary, the kernel method can be used in SVM. The kernel trick is a method to find the optimal decision boundary in the projected multidimensional space by mapping the dimension of the input data to the multidimensional space [29]. Here, various nonlinear kernel functions can be used. Among them, a polynomial kernel and a radial basis function (RBF) kernel are representative [30]. In this study, the RBF kernel was selected as the kernel function. The next step was to select hyperparameters. The hyperparameter has a regulatory parameter (C) and gamma, which is the inverse of the kernel width [43]. C serves to limit the importance of each training material. The larger the C value, the more non-linear the decision boundary becomes for classifying the classification items. Gamma defines the range of influence that training data can have. The larger the gamma and the narrower the kernel width, the more complex the model.
RF is an ensemble learning approach developed by Breiman [63] and maximizes algorithm performance by collecting several decision tree algorithm results, which are basic components, compositing them into one result [64,65]. The final class decision is a method of weighted voting for the discriminant classes of the generated decision tree. Since RF iteratively constructs independent decision trees through maximum randomness when selecting samples and variables for each model, it is possible to reduce the prediction error by lowering the variance while maintaining the low bias of the decision tree [47,66]. In addition, RF is stable even for high-dimensional data including multiple explanatory variables because it considers the interaction and nonlinearity between explanatory counts.
The hyperparameters to be considered in the RF algorithm are mtry and n.tree. Here, mtry is the number of randomly sampled variables in each partition, and n.tree is the number of decision trees. If the number of mtry is reduced, the calculation speed increases, but the correlation between the two trees and the strength of all trees in the forest decrease, which has a complicated effect on the classification accuracy [66]. It is known that n.tree has less influence on classification results than mtry [52].

Verification of the Accuracy of the Classification Results
The accuracy evaluation of the SVM and RF classification algorithms was evaluated by creating an error matrix by comparing it with field survey data collected at the time of image acquisition. The error matrix is evaluated with five indicators for LULC classification results [25]. Evaluation indicators were compared and evaluated by expressing the overall accuracy (OA), user's accuracy (UA), producer's accuracy (PA), Kappa coefficient, and the F-measure. The evaluation of these indicators was performed by preparing an error matrix, as shown in Table 1. In Table 1, Type A and B indicate classification items. In this formula, a is the number of rows and columns in the error matrix, n is the number of observations in the error matrix, a ij is a major diagonal element for class i, a i+ is the total number of observations in row i (right margin), and a +j is the total number of observations in column j (bottom margin).

Research Progression and Method
This study was conducted using the following steps and methods.
(1) Image acquisition equipment: The UAV used to acquire wide-area images was a fixed-wing eBee Plus (Sensefly, Cheseaux-sur-Lausanne, Switzerland) ( Figure 5). The UAV-mounted sensor used an RGB (16 MP) camera and a multi-spectral sensor (Sequoia+, Parrot, Paris, France) consisting of four spectral bands (12 MP): Green, Red, Red-Edge (RE), and Near Infrared (NIR). The Sequoia+ sensor collects the amount of light in real time by using the light sensor located on the top, and the light information is inputted into every single image and corrected.
reflectance and NDVI (Normalized Difference Vegetation Index) of 10 variables in the two periods. NDVI was calculated using Equation (1): Here, NIR is the reflectance of the UAV image near infrared band, and RED is the reflectance of the Red band.
(5) The dependent variable was based on whether or not corn was cultivated. The constructed variables were divided into training data and test data. For training data, 30% of the total data were randomly selected. The remaining 70% were used as test data. (6) Crop classification: In this study, an object-based classification method based on the Farm Map was applied. In object-based classification, image segmentation precedes object classification. The segmentation operation creates a region based on the similarity of texture, distance, and spacing of neighboring pixels. In order to apply this method to each site, the optimal division operation is completed through trial and error, and a lot of time and effort are required. Therefore, in this study, the image segmentation task was omitted by setting the boundary of the Farm Map as one object and using it for classification in order to efficiently utilize the high-capacity UAV data. Figure 5. Configuration of fixed-wing UAVs (eBee Plus, Sensefly) and onboard devices used to collect imagery in a wide area. Figure 5. Configuration of fixed-wing UAVs (eBee Plus, Sensefly) and onboard devices used to collect imagery in a wide area.
(2) UAV image acquisition plan: For the UAV imaging plan, an area of 2520 ha was planned using eMotion 3 (Sensefly, Cheseaux-sur-Lausanne, Switzerland) software, as shown in Figure 6, excluding mountainous and non-agricultural land. In consideration of the battery capacity, the flight time was set to not exceed 40 min at a time, and a total of 18 routes were set to acquire the entire area of the farmland. In Figure 6, the path in the orange box is the path taken on Day 1 of image acquisition, and the blue box is the path progression on Day 2. The numbers in the boxes are numbered arbitrarily, and the number in parentheses is the flight time. The time required for acquiring images in a wide area was two days for each acquisition, and it was performed once in May and June, taking a total of 4 days. The image acquisition altitude was 110 m, and a spatial resolution of 10 cm was set based on the multi-spectral band. The collected single images were produced as one reflectance image for each band using the Pix4D Mapper (Sensefly, Cheseaux-sur-Lausanne, Switzerland) program.
The production time was about 26 h at a time in the hardware environment of Table 2. Radiation correction of each image was performed using the real-time light amount collected by the optical sensor in Figure 5 and the calibration panel during the image processing process.
Here, NIR is the reflectance of the UAV image near infrared band, and RED is the reflectance of the Red band. (7) Accuracy evaluation: The UAV multispectral image consists of four bands, and when time series images are analyzed, 4*n high-dimensional data are generated. In this study, the mutual accuracy was compared by applying the SVM and RF algorithms with high interpretability in high-dimensional data. For crop classification, the Farm Map boundary was set as one object, and the SVM and RF algorithms were used to classify corn and other crops. The classified results were evaluated for accuracy using an error matrix. (8) Analysis tools: ArcGIS Pro software version 2.5 was used for explanation and input of dependent variables as well as visualization of image classification results. All procedures of data preprocessing and classification were performed using RStudio, an open-source program. The SVM and RF models were built using the R packages "Caret" and "randomForest". Figure 8 shows the schematic of the whole process of the study. The composition of the research process system was largely divided into data acquisition, image pre-processing, image classification, and cultivation area estimation steps to calculate the total area of corn cultivation in Gammul-myeon, South Korea. (5) The dependent variable was based on whether or not corn was cultivated. The constructed variables were divided into training data and test data. For training data, 30% of the total data were randomly selected. The remaining 70% were used as test data. (6) Crop classification: In this study, an object-based classification method based on the Farm Map was applied. In object-based classification, image segmentation precedes object classification. The segmentation operation creates a region based on the similarity of texture, distance, and spacing of neighboring pixels. In order to apply this method to each site, the optimal division operation is completed through trial and error, and a lot of time and effort are required. Therefore, in this study, the image segmentation task was omitted by setting the boundary of the Farm Map as one object and using it for classification in order to efficiently utilize the high-capacity UAV data. (7) Accuracy evaluation: The UAV multispectral image consists of four bands, and when time series images are analyzed, 4*n high-dimensional data are generated. In this study, the mutual accuracy was compared by applying the SVM and RF algorithms with high interpretability in high-dimensional data. For crop classification, the Farm Map boundary was set as one object, and the SVM and RF algorithms were used to classify corn and other crops. The classified results were evaluated for accuracy using an error matrix. (8) Analysis tools: ArcGIS Pro software version 2.5 was used for explanation and input of dependent variables as well as visualization of image classification results. All procedures of data preprocessing and classification were performed using RStudio, an open-source program. The SVM and RF models were built using the R packages "Caret" and "randomForest". Figure 8 shows the schematic of the whole process of the study. The composition of the research process system was largely divided into data acquisition, image pre-processing, image classification, and cultivation area estimation steps to calculate the total area of corn cultivation in Gammul-myeon, South Korea.

UAV Orthoimage and Field Survey
Image acquisition using UAV was performed twice, in May ( Figure 9a) and in June (Figure 9b), targeting agricultural land, except for mountainous areas in Gammul-myeon, Goesan-gun, Chungcheongbuk-do, South Korea, as shown in Figure 9. The area of the obtained image was 2721 ha in May and 2680 ha in June, which corresponds to about 63% of the total area of Gammul-myeon. The numbers of images were 48,564 and 47,858 in May and June, respectively. As a result of matching based on the image acquisition period, the acquired image capacities were 14.8 and 14.7 GB, respectively. Table 3 summarizes the data acquisition area, number of shots, and orthoimage capacity of UAV images for each image acquisition date.

UAV Orthoimage and Field Survey
Image acquisition using UAV was performed twice, in May ( Figure 9a) and in June (Figure 9b), targeting agricultural land, except for mountainous areas in Gammul-myeon, Goesan-gun, Chungcheongbuk-do, South Korea, as shown in Figure 9. The area of the obtained image was 2721 ha in May and 2680 ha in June, which corresponds to about 63% of the total area of Gammul-myeon. The numbers of images were 48,564 and 47,858 in May and June, respectively. As a result of matching based on the image acquisition period, the acquired image capacities were 14.8 and 14.7 GB, respectively. Table 3 summarizes the data acquisition area, number of shots, and orthoimage capacity of UAV images for each image acquisition date. 52 was used to unify the UAV coordinate system with the Farm Map coordinate system. As shown in Figure 10, the Farm Map showed high accuracy because the study area was mostly composed of agricultural areas.
The total number of lots in Gammul-myeon is 5827, based on the Farm Map. The number of lots included in the UAV image is 5500, which corresponds to 94.4% of the total agricultural land area. The field survey was conducted on 5500 lots included in the UAV image. The number of corn cultivation plots surveyed in the field was 582, and the total of other crops was 4918. The cultivated area was calculated using ArcGIS Pro based on the farmland division of the Farm Map. As a result of the calculation, the total agricultural land area was 855.99 ha, the cultivated area of corn was 106.94 ha, and the cultivated area of other crops was 749.05 ha. The corn cultivation area accounted for 12.5% of the total agricultural land area (Table 4).   South Korea uses cadastral maps to classify land locations, lot numbers, ground points, and boundaries. However, there is a problem in that the cadastral map does not accurately indicate the relationship between agricultural boundaries and land ownership in agricultural areas. Therefore, farmers in many agricultural areas have demanded solutions to these problems. Accordingly, the South Korean government has produced and distributed agricultural land maps for crop-growing areas that are not related to land ownership, apart from the cadastral map of cultivated land [54]. This map is called a Farm Map and is currently being used for agricultural statistics and policy operation [6].
The Farm Map was used because it accurately distinguishes the boundaries of farmland using aerial and satellite images. The Farm Map consists of a PNU code, lot center coordinates, read code, readjustment properties of arable land, and lot area as the major constituent items. First, it is necessary to integrate the coordinate system of the image data acquired by the UAV and the Farm Map. Therefore, in this study, WGS 1984 UTM Zone 52 was used to unify the UAV coordinate system with the Farm Map coordinate system. As shown in Figure 10, the Farm Map showed high accuracy because the study area was mostly composed of agricultural areas.     The total number of lots in Gammul-myeon is 5827, based on the Farm Map. The number of lots included in the UAV image is 5500, which corresponds to 94.4% of the total agricultural land area. The field survey was conducted on 5500 lots included in the UAV image. The number of corn cultivation plots surveyed in the field was 582, and the total of other crops was 4918. The cultivated area was calculated using ArcGIS Pro based on the farmland division of the Farm Map. As a result of the calculation, the total agricultural land area was 855.99 ha, the cultivated area of corn was 106.94 ha, and the cultivated area of other crops was 749.05 ha. The corn cultivation area accounted for 12.5% of the total agricultural land area (Table 4).

Data Preprocessing and Hyperparameter Tuning
For data preprocessing, the reflectance and vegetation index (NDVI) corresponding to each lot were extracted using multispectral images of the two periods and Farm Map. The pre-processed data were displayed in 12 rows and 5501 columns consisting of a lot number for parcel classification, 10 explanatory variables, and independent variables. For the preprocessed data, the training data were set before applying the classification algorithm. Of the total 5500 lots, 1650 lots, which accounted for 30% of the training data, were randomly selected. A total of 30% of the training data were used to tune the optimal hyperparameters. The most representative tuning parameter setting method is a grid search. A grid search is a function that creates a model for each parameter and finds the optimal parameter, and it is a method of searching including all combinations of each parameter. The optimal combination of each parameter was determined through 5-fold cross-validation of training data.
In the SVM algorithm, the optimal hyperparameters were selected using a grid search method, such as C and Gamma. As a result of the analysis, as shown in Figure 11a, C was set to 1 and Gamma was set to 0.25, and the accuracy of the training model was 0.955 at this time. The hyperparameter of RF was searched by using mtry and n.Tree. n.Tree was set to 500, which is the default value of R package "random Forest", and mtry was searched by a grid search method. The applied n.Tree was set to 500, mtry was set to 5, and the accuracy of the training model was about 0.958 (Figure 11b).

Data Preprocessing and Hyperparameter Tuning
For data preprocessing, the reflectance and vegetation index (NDVI) corresponding to each lot were extracted using multispectral images of the two periods and Farm Map. The pre-processed data were displayed in 12 rows and 5501 columns consisting of a lot number for parcel classification, 10 explanatory variables, and independent variables. For the preprocessed data, the training data were set before applying the classification algorithm. Of the total 5500 lots, 1650 lots, which accounted for 30% of the training data, were randomly selected. A total of 30% of the training data were used to tune the optimal hyperparameters. The most representative tuning parameter setting method is a grid search. A grid search is a function that creates a model for each parameter and finds the optimal parameter, and it is a method of searching including all combinations of each parameter. The optimal combination of each parameter was determined through 5-fold cross-validation of training data.
In the SVM algorithm, the optimal hyperparameters were selected using a grid search method, such as C and Gamma. As a result of the analysis, as shown in Figure 11a, C was set to 1 and Gamma was set to 0.25, and the accuracy of the training model was 0.955 at this time. The hyperparameter of RF was searched by using mtry and n.Tree. n.Tree was set to 500, which is the default value of R package "random Forest", and mtry was searched by a grid search method. The applied n.Tree was set to 500, mtry was set to 5, and the accuracy of the training model was about 0.958 (Figure 11b).

Crop Classification Results and Accuracy Verification
The classification of cultivated crops in the study area was performed using Farm Map-based SVM and RF algorithms. In general, classification items are used for cultivated crops grown during the observation period. However, since this study was aimed at estimating the cultivated area of corn, the classification items were set by dividing them into corn and other crops. Figure 12 shows the classification results of corn cultivation areas obtained by each algorithm. The error matrix was used to evaluate the accuracy of the classified results using the Farm Map-based SVM and RF algorithms. Accuracy evaluation was conducted by comparing the image classification results and field survey data. The

Crop Classification Results and Accuracy Verification
The classification of cultivated crops in the study area was performed using Farm Map-based SVM and RF algorithms. In general, classification items are used for cultivated crops grown during the observation period. However, since this study was aimed at estimating the cultivated area of corn, the classification items were set by dividing them into corn and other crops. Figure 12 shows the classification results of corn cultivation areas obtained by each algorithm. The error matrix was used to evaluate the accuracy of the classified results using the Farm Map-based SVM and RF algorithms. Accuracy evaluation was conducted by comparing the image classification results and field survey data. The verification items were set as corn and other crops as the classification items. Accuracy evaluation was performed on 5500 parcels acquired in UAV images. As a result of classification using the Farm Map-based SVM algorithm, corn was classified as 437 lots and other crops as 4820 lots, so a total of 5257 lots were exactly the same. The overall accuracy obtained by the SVM algorithm was 95.88%. The classification accuracy of corn, i.e., the purpose of this study, was 81.68% for PA and 75.09% for UA. The Kappa coefficient was 0.7777 and the F-measure was 0.78 (Table 5).   As a result of classification using the Farm Map-based SVM algorithm, corn was classified as 437 lots and other crops as 4820 lots, so a total of 5257 lots were exactly the same. The overall accuracy obtained by the SVM algorithm was 95.88%. The classification accuracy of corn, i.e., the purpose of this study, was 81.68% for PA and 75.09% for UA. The Kappa coefficient was 0.7777 and the F-measure was 0.78 ( Table 5).
The classification result obtained by applying the Farm Map-based RF algorithm was 537 lots of corn and 4899 lots of other crops, and a total of 5436 lots were exactly the same. The overall accuracy of RF algorithm application was 98.84% higher than that of SVM application. In the classification accuracy of corn, PA was 96.58%, UA was 92.27%, the Kappa coefficient was 0.94, and the F-measure was 0.94, showing a higher classification performance compared to that of the SVM algorithm (Table 6). To calculate the total corn cultivation area of Gammul-myeon, the results of classification using two algorithms, SVM and RF, were used. The result of calculating the cultivated area using the Farm Map-based SVM algorithm was 96.54 ha, which was estimated to be 90.27% of the actual corn cultivation area. The result of calculating the cultivated area by applying the Farm Map-based RF algorithm was 98.77 ha, which was estimated to be 92.36% of the actual corn cultivation area (Table 7). In this way, it can be seen that when estimating the cultivation area for a wide area, some errors are included depending on the applied algorithm. In both algorithms, RF was found to be higher in accuracy than SVM, and, in terms of processing speed, SVM was found to be slightly faster than RF. Considering the recent computer processing power, there is no significant difference in processing speed and time, so it will be necessary to increase the efficiency and to maximize the learning of wide-area images for machine learning to secure accuracy.

Crop Classification Methodology
The overall accuracy of the classification results using the Farm Map-based SVM and RF algorithms used in this study did not differ significantly, but there was a slight difference in the classification accuracy of corn. The reason is the difference between the total number of corn items and the total number of other items. There was an exact seven-fold difference between the number of non-corn and corn items, and even if the classification accuracy of corn was lower than others, it did not significantly affect the overall accuracy. Therefore, it would be appropriate to judge the accuracy of this study based on the classification accuracy of corn. The corn classification results of the two algorithms used differed by about 15% in UA and 17% in PA, confirming that the RF algorithm was more suitable for corn classification. For the computational speed of both algorithms, SVM took about 10 min and RF took about 60 min. As such, it was confirmed that the most important factor in classifying the research target crop in the agricultural field is that it is more accurate to use the classification accuracy for the target crop rather than the overall classification accuracy.
In addition, it can be seen that the time required for classification differs depending on the algorithm, but is short when the investigation time is taken into consideration. Therefore, in consideration of the development of the computer processing speed, etc., the acquisition of learning data that can be applied to AI application algorithms and learning about them are considered to be important to further increase accuracy.

Comparison between the Existing Image Segmentation Method and the Method Using Farm Map Data
The Farm Map-based method was compared with the image segmentation method used in the existing object-based classification method to verify the efficiency. In general, the image segmentation algorithm used in wide-area satellite images utilizes the region growing technique, which is a technique for expanding an area with one pixel. The region growing technique creates one region based on the similarity of spacing, texture, and distance in pixels around one pixel, and when it exceeds the threshold value of a set parameter, it is set as another object. Parameters for object segmentation include segmentation scale, spatial information (shape), spectral information (color), compactness, and smoothness. In general parameter threshold selection, a trial-and-error method is used to find the optimal threshold value while changing parameter values. The optimal parameter settings were determined to be scale 700, shape 0.1, color 0.9, compactness 0.5, and smoothness 0.5. The segmentation result is presented in Figure 13 for comparison with the boundary data of the Farm Map. Image segmentation techniques are generally very complex and time-consuming [42]. The image segmentation of the UAV image in the study area took about 18 h to perform once. The optimal threshold takes a lot of time because of trial and error while changing the parameter values in many cases.
Agronomy 2021, 11, x FOR PEER REVIEW 18 of 22 about 10 min and RF took about 60 min. As such, it was confirmed that the most important factor in classifying the research target crop in the agricultural field is that it is more accurate to use the classification accuracy for the target crop rather than the overall classification accuracy.
In addition, it can be seen that the time required for classification differs depending on the algorithm, but is short when the investigation time is taken into consideration. Therefore, in consideration of the development of the computer processing speed, etc., the acquisition of learning data that can be applied to AI application algorithms and learning about them are considered to be important to further increase accuracy.

Comparison between the Existing Image Segmentation Method and the Method Using Farm Map Data
The Farm Map-based method was compared with the image segmentation method used in the existing object-based classification method to verify the efficiency. In general, the image segmentation algorithm used in wide-area satellite images utilizes the region growing technique, which is a technique for expanding an area with one pixel. The region growing technique creates one region based on the similarity of spacing, texture, and distance in pixels around one pixel, and when it exceeds the threshold value of a set parameter, it is set as another object. Parameters for object segmentation include segmentation scale, spatial information (shape), spectral information (color), compactness, and smoothness. In general parameter threshold selection, a trial-and-error method is used to find the optimal threshold value while changing parameter values. The optimal parameter settings were determined to be scale 700, shape 0.1, color 0.9, compactness 0.5, and smoothness 0.5. The segmentation result is presented in Figure 13 for comparison with the boundary data of the Farm Map. Image segmentation techniques are generally very complex and time-consuming [42]. The image segmentation of the UAV image in the study area took about 18 h to perform once. The optimal threshold takes a lot of time because of trial and error while changing the parameter values in many cases. On the other hand, the Farm Map has the advantage of reducing the processing time because EPIS (Korea Agency of Education, Promotion and Information Service in Food, Agriculture, Forestry and Fishery) provides various types of information based on satellite and aerial images by combining them [54]. This study was able to minimize the time required for image segmentation by using the open source provided by EPIS.
In addition, if the Farm Map is not used, the time and effort required in the process of image segmentation must be put into operation. In the general method, other land cover On the other hand, the Farm Map has the advantage of reducing the processing time because EPIS (Korea Agency of Education, Promotion and Information Service in Food, Agriculture, Forestry and Fishery) provides various types of information based on satellite and aerial images by combining them [54]. This study was able to minimize the time required for image segmentation by using the open source provided by EPIS.
In addition, if the Farm Map is not used, the time and effort required in the process of image segmentation must be put into operation. In the general method, other land cover information such as forests, downtown areas, roads, and water is included in addition to agricultural land, so cover classification for agricultural land and non-agricultural land must be performed separately and removed. However, when using the Farm Map, land cover other than agricultural land is excluded from the classification items in advance, thereby minimizing the possibility of misclassification and increasing the classification accuracy and data processing efficiency.

Conclusions
In this study, the vegetative (V3) and flowering (VT) stages of corn were selected for Gammul-myeon, Goesan-gun, Chungcheongbuk-do, South Korea, and the cultivation area of corn was estimated using UAV imaging and Farm Map-based machine learning techniques.
The conclusions obtained in this study can be summarized as follows: 1.
Existing UAV images were used for a narrow area of a lot unit (2 ha) or a field unit (20 ha), but in this study, a wide area of about 2700 ha was studied. It took about 4 days to acquire the UAV imagery, 36 flights, and about 52 h to produce the reflectance image.

2.
As a result of classification using the Farm Map-based SVM algorithm for wide-area images, we found 81.68% for PA, 75.09% for UA, 0.77 for the Kappa coefficient, and 0.78 for the F-measure. In particular, the results of the classification of the Farm Map-based RF algorithm showed a PA of 96.58%, UA of 92.27%, a Kappa coefficient of 0.94, and the F-measure of 0.94, with higher accuracy than SVM.

3.
As a result of applying the SVM algorithm, the corn cultivation area was estimated to be 96.54 ha, showing an accuracy of 90.27%. The RF algorithm estimated the corn cultivation area to be 98.77 ha and showed higher accuracy than SVM at 92.36%. 4.
In the task of classifying and dividing the cultivated area of crops, as a result of performing object-based classification using the Farm Map, it was found that the work efficiency, accuracy improvement, and processing time reduction were found to be very effective compared to the existing object division method. In addition, the Farm Map-based method was able to increase the efficiency of data processing by minimizing the possibility of misclassification of farmland and cultivated crops. As a result, it was found that the identification of the cultivated area of crops is a method that enables rapid and reliable analysis when using UAV images, Farm Map, and machine learning convergence.
The Farm Map is an agricultural land information map specifically produced for agriculture by the South Korean government. Although the information provided here may not immediately apply changes depending on the agricultural area, most agricultural areas do not show any difference due to the update every two years. In this way, if the boundary and area information of farmland directly digitized through visual inspection and field inspection using aerial and satellite images are effectively utilized in related fields, this will be useful for saving time and for management in agricultural fields.
In particular, since the agricultural field requires a lot of effort and time according to environmental changes, it will be possible to create a more efficient working environment, classify crops, and derive cultivation areas by using public data and machine learning algorithms. In the future, it is necessary to improve the accuracy through acquisition and learning of large amounts of data to proceed and apply AI.