A New Identiﬁcation Method for Surface Cracks from UAV Images Based on Machine Learning in Coal Mining Areas

: Obtaining real-time, objective, and high-precision distribution information of surface cracks in mining areas is the ﬁrst task for studying the development regularity of surface cracks and evaluating the risk. The complex geological environment in the mining area leads to low accuracy and e ﬃ ciency of the existing extracting cracks methods from unmanned air vehicle (UAV) images. Therefore, this manuscript proposes a new identiﬁcation method of surface cracks from UAV images based on machine learning in coal mining areas. First, the acquired UAV image is cut into small sub-images, and divided into four datasets according to the characteristics of background information: Bright Ground, Dark Dround, Withered Vegetation, and Green Vegetation. Then, for each dataset, a training sample is established with cracks and no cracks as labels and the RGB (red, green, and blue) three-band value of the sub-image as feature. Finally, the best machine learning algorithms, dimensionality reduction methods and image processing techniques are obtained through comparative analysis. The results show that using the V-SVM (Support vector machine with V as penalty function) machine learning algorithm, principal component analysis (PCA) to reduce the full features to 95% of the original variance, and image color enhancement by Laplace sharpening, the overall accuracy could reach 88.99%. This proves that the method proposed in this manuscript can achieve high-precision crack extraction from UAV image.


Introduction
In western China, especially in the sandy areas, surface cracks are one of the geological environmental problems caused by coal mining [1]. Surface cracks have caused deformation of buildings, damage to underground pipelines, damage to cultivated land, accelerated soil moisture evaporation, vegetation destruction, and soil erosion [2][3][4]. This creates considerable difficulties for mining area management staff. Therefore, it is necessary to obtain real-time, objective, and high-precision distribution information of surface cracks in mining areas, which can be used to study the development regularity of surface cracks and evaluate the risk to provide guarantees for land reclamation [5].
Traditional surface crack information acquisition methods mainly include field surveys, radar detection technology [6,7], and satellite remote sensing images [8]. Although field survey accuracy is high, the cost is expensive [9]. Airborne radar technology is used in landslide monitoring and surface crack detection [10]. However, due to the complex geological environment in the mining area, the application conditions of using radar to detect ground crack are limited. Mining caused the surface to collapse, making airborne radar unable to cover the entire area. Crack extraction through images can effectively extract cracks. Mohan summarized some crack detection techniques based on the type of the image used, including camera image, IR (infrared) image, Ultrasonic Image, Time of Flight Diffraction image, Laser image, and various other distinctive image types through a review [11]. Satellite remote sensing image is a method of extracting cracks from images, but the resolution of satellite remote sensing is difficult to extract small cracks. Unmanned air vehicles (UAVs) have significant advantages, such as high resolution, flexible maneuverability, high efficiency, and low operating costs [12]. Their resolution can reach the level of centimeters [13], which provides an ideal data source for information extraction of surface cracks in mining areas. At present, the methods for extracting surface cracks from UAV image data are mainly object-oriented [13,14], edge detection [15], threshold segmentation [16], and artificial visual interpretation [9].
The object-oriented method achieved good results in surface crack extraction, but its class and inheritance characteristics led to many more pointer operations in the program for locating the function entry and maintaining additional virtual method tables and other additional work causes the program's processing efficiency to be relatively low. The object-oriented method for surface crack extraction needs to be divided into multiple steps. First, spectral feature extraction must be performed, then whether the extraction results satisfies geometric features, linear features, and fractal dimension features in turn. It is difficult to distinguish because the spectral color characteristics of surface cracks and withered vegetation on the ground are similar. At the same time, stepwise detection also requires a lot of additional work, resulting in low efficiency [14]. Methods such as edge detection and threshold segmentation can cause many error points, and the accuracy of crack extraction is poor, which affects the extraction effect of surface cracks. Edge detection is used to identify the characteristic points with obvious brightness changes in the image. Because the surface of the mining area contains a lot of vegetation, the edge detection method will also extract the contour of the vegetation to appear as error points, which makes the cracks extraction have low efficiency [16]. The threshold segmentation method is to extract the ground crack pixel gray value range to extract it. Because the spectral color characteristics of surface crack and withered vegetation on the ground are similar, many error points will appear in the extraction result [17]. The artificial visual interpretation method requires workers to manually process each image [9]. This method is too complicated and has low efficiency and poor timeliness, so it is not popular or practical. Machine learning has been widely used in the field of graphic recognition. It can not only improve accuracy but also greatly improve efficiency. Deep learning techniques for feature extraction of image have been applied over a wide range of applications through UAV image [18]. Fei Y. uses deep learning to detect cracks in 3D asphalt pavement images [19]. N. Ammour using convolutional neuron network(CNN) and SVM for car detection and counting, which is superior in accuracy and computational time [20]. Zeggada, proposing a novel method based on convolutional neural networks to solve problem of multilabeling UAV image, typically characterized by a high level of information content [21]. However, the complexity of land surface information in mining areas has limited its application. Therefore, finding a method to reasonably apply machine learning to the extraction of surface cracks in mining areas is a key requirement.
To solve the problems mentioned above, this article provides a method for detecting surface cracks in areas with complex geological environments, such as mining areas, using UAV images as data sources; machine learning as technical means; and optimizing the best machines learning algorithms, dimensionality reduction methods, and image processing methods through comparative analysis. The detailed research method idea is introduced into the second part. This method can effectively Remote Sens. 2020, 12, 1571 3 of 20 reduce the interference from complex surface environments such as vegetation on the extraction of cracks.
The rest of the article is organized as follows. Section 2 introduces the materials and methods, Section 3 shows the experimental results, and Section 4 discusses and concludes the study in the final section.

Data Source
The research area is Yulin, Shaanxi Province, China. The research object is the information about the surface cracks in the sandy area. The research data are UAV images. The parameter information of UAV image data is shown in Table 1. Figure 1 shows the geographical location of Yulin, Shaanxi Province, China. Figure 2 shows four UAV image datasets.

Data Source
The research area is Yulin, Shaanxi Province, China. The research object is the information about the surface cracks in the sandy area. The research data are UAV images. The parameter information of UAV image data is shown in Table 1. Figure 1 shows the geographical location of Yulin, Shaanxi Province, China. Figure 2 shows four UAV image datasets.   As SVM, RF, and KNN are all supervised learning algorithms, it is necessary to build UAV image datasets for model construction. Four UAV images were cut into 50 × 50 pixel images with MATLAB, and 795 crack images and some no crack images were obtained. The background information of the UAV image refers to the land surface information mainly included in the image

Data Source
The research area is Yulin, Shaanxi Province, China. The research object is the information about the surface cracks in the sandy area. The research data are UAV images. The parameter information of UAV image data is shown in Table 1. Figure 1 shows the geographical location of Yulin, Shaanxi Province, China. Figure 2 shows four UAV image datasets.   As SVM, RF, and KNN are all supervised learning algorithms, it is necessary to build UAV image datasets for model construction. Four UAV images were cut into 50 × 50 pixel images with MATLAB, and 795 crack images and some no crack images were obtained. The background information of the UAV image refers to the land surface information mainly included in the image except for the surface cracks, which mainly include bare land and vegetation. Because background  and 795 crack images and some no crack images were obtained. The background information of the UAV image refers to the land surface information mainly included in the image except for the surface cracks, which mainly include bare land and vegetation. Because background information can interfere with the classification results of machine learning, to improve classification accuracy, datasets of UAV images must be constructed reasonably.
This article proposes a method based on cluster analysis [22] and the characteristics of the background information of the UAV images. The clustering method is shown in Figure 3. First, obtain the percentage of vegetation area of each image through Normalized Difference Vegetation Index (NDVI). When it exceeds 10%, divide this image into Vegetation dataset; otherwise, it is Bare Ground dataset. Then, for the images in the Vegetation dataset, obtain the percentage of the green vegetation area of each image through the RGB band. When it exceeds 10%, divide this image into the Green Vegetation dataset; otherwise, the Withered Vegetation dataset. Next, for the image of the Bare Ground dataset, convert the image into grayscale image and obtain its grayscale average value. When this value exceeds 168, divide the image into a Bright Ground dataset; otherwise, it is a Dark Ground dataset. Finally, select the same number of crack and no crack images with the same background information to construct four datasets, respectively: Green Vegetation dataset, Withered Vegetation, Bright Ground dataset, and Dark Ground dataset.
Example: Figure 4 is the classification process of one image when constructing dataset, where Figure 4a is a UAV image, Figure 4b is a vegetation image extracted through NDVI, and Figure 4c is a green vegetation image extracted through RGB. First, the percentage of vegetation area obtained through NDVI for this image is 44.2% as shown in Figure 4b, 44.2% > 10%, so this image is divided into Vegetation dataset. Then, the percentage of green vegetation area obtained through the RGB band for this image is 4.3% as shown in Figure 4c, 4.3% < 10%, so this image is divided into Withered Vegetation dataset. All other images are divided by this method, and finally four datasets are obtained: Green Vegetation dataset, Withered Vegetation, Bright Ground dataset, and Dark Ground dataset. Figure 5 is the schematic diagram of UAV images with cracks and no cracks in the four datasets.
It needs to be specifically explained that 10% and 168 both are thresholds, the 10% threshold is vegetation area percentage (vegetation area divided by image area), and the 168 threshold is image grayscale average. The first image classification according to whether its vegetation area percentage exceeds 10% threshold. Further, the second image classification according to whether its green vegetation area percentage exceeds 10% threshold for Vegetation dataset or whether its image grayscale average exceeds the 168 threshold for Bare Ground dataset. These thresholds are empirical parameters and only applicable to this research area. When using UAV images from different regions as research objects, researchers need to further determine the thresholds reasonably.
The training samples are established with cracks and no cracks as labels, and the same number of no crack images with the same background information is combined with crack images. The number of training samples for the four datasets is shown in                 This article proposes a new identification method for surface cracks from UAV images based on machine learning in coal mining areas. As shown in Figure 6, first, the UAV image data are acquired. Second, the images were cut into 50 × 50 pixel images with MATLAB. Third, we build Remote Sens. 2020, 12, 1571 6 of 20 four types of datasets based on characteristics of the UAV image background information: bright ground, dark ground, withered vegetation, and green vegetation. Fourth, for each type of dataset, training sample is established with cracks and no cracks as labels and features values number is 7500. Finally, the best machine learning algorithms, dimensionality reduction methods, and image processing technology are selected through comparative analysis. The validation methods are leave-one-out cross-validation and permutation tests. The classification accuracy and Area Under Curve (AUC) values are evaluation indicators. Result 1 is the classification result of three machine learning algorithms, from which the best machine learning algorithm is selected. Result 2 is the classification result using two dimensionality reduction methods. Result 3 is the classification result using two image enhancement methods to process images. This article proposes a new identification method for surface cracks from UAV images based on machine learning in coal mining areas. As shown in Figure 6, first, the UAV image data are acquired. Second, the images were cut into 50 × 50 pixel images with MATLAB. Third, we build four types of datasets based on characteristics of the UAV image background information: bright ground, dark ground, withered vegetation, and green vegetation. Fourth, for each type of dataset, training sample is established with cracks and no cracks as labels and features values number is 7500. Finally, the best machine learning algorithms, dimensionality reduction methods, and image processing technology are selected through comparative analysis. The validation methods are leave-one-out cross-validation and permutation tests. The classification accuracy and Area Under Curve (AUC) values are evaluation indicators. Result 1 is the classification result of three machine learning algorithms, from which the best machine learning algorithm is selected. Result 2 is the classification result using two dimensionality reduction methods. Result 3 is the classification result using two image enhancement methods to process images. The leave-one-out cross-validation is used to divide a large dataset into k small datasets. Then, k-1 is used as the training set, and the other is used as the test set. Then, select the next one as the test set, and the remaining k-1 as the training set. By analogy, the k classification accuracy is obtained and the average value is taken as the final classification accuracy of the dataset.

Machine Learning Methods
Machine learning is a common research hotspot in the field of artificial intelligence and pattern recognition. Common algorithms include support vector machine (SVM) algorithm, random forest (RF) algorithm, k-nearest neighbor (KNN) algorithm, Naïve Bayes (NB) algorithm, and deep learning (DP). It has been widely used in graphics recognition and other fields. It plays an important role in the rapid and efficient resolution of complex problems.

Support Vector Machine
Hoang, ND found that SVM is superior to RF and ANN machine learning algorithms in the research of asphalt pavement crack classification [23]. Wang found that the HOG + SVM (Histogram of Oriented Gradients + Support Vector Machine) method can efficiently count oil palm trees from The leave-one-out cross-validation is used to divide a large dataset into k small datasets. Then, k-1 is used as the training set, and the other is used as the test set. Then, select the next one as the test set, and the remaining k-1 as the training set. By analogy, the k classification accuracy is obtained and the average value is taken as the final classification accuracy of the dataset.

Machine Learning Methods
Machine learning is a common research hotspot in the field of artificial intelligence and pattern recognition. Common algorithms include support vector machine (SVM) algorithm, random forest (RF) algorithm, k-nearest neighbor (KNN) algorithm, Naïve Bayes (NB) algorithm, and deep learning (DP). It has been widely used in graphics recognition and other fields. It plays an important role in the rapid and efficient resolution of complex problems.

Support Vector Machine
Hoang, ND found that SVM is superior to RF and ANN machine learning algorithms in the research of asphalt pavement crack classification [23]. Wang found that the HOG + SVM (Histogram of Oriented Gradients + Support Vector Machine) method can efficiently count oil palm trees from UAV images [24]. The kernel function [25] changes in the form and parameters implicitly change the Remote Sens. 2020, 12, 1571 7 of 20 mapping from the input space to the feature space and then affect the characteristics of the feature space, ultimately changing the performance of various kernel function methods.
C-SVM is a support vector machine algorithm with parameter C as penalty function. It is a two-category classification model [26]. It is defined as the linear classifier with the largest interval in feature space. The learning strategy is margin maximization. This translates into a solution to a convex quadratic programming problem. For linearly separable cases, the C-SVM problem can be transformed into the following quadratic programming problem, where C is a penalty parameter. The larger C is, the more the SVM punishes the incorrect classification. C is the only adjustable parameter in the C-SVM. ξ i represents a relaxation variable. l represents the amount of variable. w represents the normal vector of the classification hyperplane in the high-dimensional space and b is the constant term. x i represents the training set.
There are two contradictory goals in C-SVM, namely, maximum margin and minimum training error, and C plays a role in regulating these two goals. The selection of parameter C is difficult. Based on C-SVM, V-SVM is proposed. V-SVM is a support vector machine algorithm with parameter V as penalty function to replace C [27]. In the case of linear separability, the V-SVM model is as follows, where l is the number of training sample points. The parameter v can be used to control the number and error of support vectors. In addition, it is also easier to choose. Parameter ρ represents two types of points (classes −1 and +1) that are separated by an interval of 2ρ w .

Random Forest
Random forest (RF) is a classifier containing multiple decision trees in machine learning. Its output category is determined by the mode of the categories output by individual trees [28]. Quanlong F. has achieved good results using the UAV images for Urban Vegetation Mapping through random forest [29]. Su J. has achieved good monitoring results by random forest algorithm to monitor Wheat yellow rust from multispectral UAV aerial imagery [30].
Random forest [31] uses bootstrap resampling technology to randomly extract k samples from the original training sample set N to generate a new training sample set and then generates k based on the self-service sample set. Each classification tree forms a random forest, and the classification results of the new data are determined by how many points the classification tree votes. In essence it is an improvement on the decision tree algorithm. Multiple decision trees are merged together. The establishment of each tree depends on an independent sample. Each tree in the forest has the same distribution. The classification error depends on the classification ability of each tree and the correlation between them.
Random forest has three main hyperparameter adjustments: node size, number of trees, and number of predictor samples. A reasonable selection of the number of trees can effectively improve the accuracy of classification.

K-Nearest Neighbors
The core idea of k-nearest neighbor is that if most of the k-nearest-neighbors of a sample in the feature space belong to a certain category, then the sample also belongs to this category and has the characteristics of this category [32]. In determining the classification decision, the method only determines the category to which the sample to be classified belongs according to the category of the nearest sample or samples. The KNN method is only related to a very small number of adjacent samples when making category decisions. The KNN method mainly depends on the limited neighboring samples, rather than the method of discriminating the class domain. The KNN method is more suitable than other methods for the set of sample domains that have many intersections or overlaps [33]. Liu K. has achieved good results that estimating forest structural attributes using UAV-LiDAR data by K-Nearest Neighbors [34]. The three elements of the KNN algorithm are distance measurement, k-value selection, and classification decision rules.

Dimensionality Reduction Method
Dimensionality is the number of feature vectors in the image. More than three feature vectors perpendicular to each other represents a high-dimensional space that cannot be visualized. When the dimension is higher, the amount of information contained is larger, and the classification difficulty of the machine learning algorithm is also greater. When the dimension exceeds a certain value, the curse of dimensionality occurs [35]. At this time, dimensionality reduction is needed to achieve the best classification effect.

F-Score Feature Selection
Feature selection selects the features that are most effective for classification and recognition from among many features to achieve compression of the feature space dimension. The F-score is a method for measuring the ability of features to be distinguished between two categories [36]. This method can achieve the most effective feature selection, and the detailed description is as follows.
The training sample set xk ∈ Rm, k = 1, 2, . . . , n where n+ and n− are the number of positive and negative samples, respectively. The F-score of the i-th feature of the training sample is defined as ki is the k-th negative sample point.

Principal Component Analysis
Principal component analysis (PCA) is a dimensionality reduction method often used in image processing [37]. The steps are as follows.
First, input the sample set D = {x1, x2, . . . , xm} and map it to the low-dimensional (k-dimensional) space dimension. Second, transform the samples in X to the standard normal distribution N~(0,1). Third, find the covariance matrix X T X∈R m * m and solve the eigenvalues and eigenvectors of the covariance matrix, X T X = V * * V −1 . Fourth, find the maximum k eigenvalues and the corresponding eigenvectors and record them as (ω 1 , ω 2 , ω 3 . . . ω k ) and output them as W = {ω 1 , ω 2 , ω 3 . . . ω k }.

Image Processing Technology
Image enhancement is a common image processing method. It can emphasize the local features of an image [23]. According to the difference in the nature of the enhanced image, it can be divided into two types: one is image gray enhancement, the image is a grayscale image after enhancement, and the other is image color enhancement, the image is a color image after enhancement. This article selects two representative image enhancement methods for research, the min-max gray level discrimination and laplace sharpening. Hoang [38] proposed a method for min-max gray level discrimination for image gray enhancement, hereinafter referred to as M2GLD. Let I 0 (m,n) be the gray value of a pixel at the coordinate (m,n), and I 0 (m,n) is transformed using the following formulas, where I A (m,n) represents the adjusted gray intensity of the pixel at position (m,n). R A denotes the adjusting ratio. I 0max and I 0min represent the maximum and minimum values of the gray intensity of the original image, respectively. τ is a margin parameter. The M2GLD method aims at discriminating the gray intensity of potential crack and noncrack pixels. Hence, after being processed, the crack pixels become darker and the noncrack pixels tend to be lighter. In this article, referring to Nhat-Duc Hoang [38], the two parameters R A and τ are selected as 1.5 and 0.1.

Laplace Sharpening
The Laplace operator [39] is an edge detection operator. The effect of this operator on f (x,y) is From the sharpening formula of the one-dimensional signal, the sharpening formula of the two-dimensional digital image is In digital image processing, ∂y 2 can be expressed as a differential equation as By adding Equations (10) and (11) into g (m, n), the Laplacian sharpening expression is where α is the sharpening intensity coefficient. The larger the α, the stronger the sharpening degree, and the larger the "overshoot" corresponding to the figure. Table 3 shows the classification results of UAV image surface cracks by SVM with two kernel functions: C-SVM and V-SVM. The classification accuracy and AUC values are used as evaluation indicators. Table 3 shows that the classification accuracy and AUC value of V-SVM are higher than those of C-SVM, which has a better classification effect.

Tree Number Selection of RF
The number of trees is an important parameter of the RF. This article uses 100, 200, 300, 400, and 500 as the quantitative parameters of the tree. Table 4 shows the classification results of UAV image surface cracks by RF. The classification accuracy and AUC values are used as evaluation indicators. From Table 4, we find that when the number of RF trees is between 100 and 500, the classification accuracy of the RF has little difference, and it is relatively optimal when the number of RF trees selected is 300 for a comprehensive comparison.

K-Value Selection of KNN
K-value selection is one of the elements of the KNN. Cross-validation is usually used to select a suitable k value. This article uses 3, 6, 9, 12, and 15 as k-value parameters for research. Table 5 shows the classification results of UAV image surface cracks by KNN. The classification accuracy and AUC values are used as evaluation indicators. From Table 5, we find that when the k value of the KNN is between 3 and 15, the classification accuracy of the KNN is not significantly different, and it is relatively optimal when the k value of the KNN is 9 for a comprehensive comparison.

Optimization of SVM & RF & KNN
The methods with the best parameters in SVM, RF, and KNN are selected to compare the machine learning algorithms. SVM selects V-SVM, the number of RF tree selections is 300, and the K value of KNN selection is 9. The comparison of the prediction results of the three machine learning algorithms of SVM, RF, and KNN is shown in Table 6 and Figure 7. V-SVM has the best classification accuracy. The methods with the best parameters in SVM, RF, and KNN are selected to compare the machine learning algorithms. SVM selects V-SVM, the number of RF tree selections is 300, and the K value of KNN selection is 9. The comparison of the prediction results of the three machine learning algorithms of SVM, RF, and KNN is shown in Table 6 and Figure 7. V-SVM has the best classification accuracy.

Comparison of Dimensionality Reduction Methods
Dimensionality is the number of feature vectors in the image. More than three feature vectors are perpendicular to each other. It is a high-dimensional space that cannot be visualized. When the dimension is higher, the amount of information contained is larger, and the classification difficulty of the machine learning algorithm is also greater. When the dimension exceeds a certain value, the phenomenon of the dimension disaster (curse of dimensionality) will occur [35]. At this time, dimensionality reduction is needed to achieve the best classification effect.

F-Score Feature Selection
The parameter selection of the F-score is 0.1:0.1:1, which means that according to the weight/F value, the first 0.1 features are selected first, and then 0.1 is used as the step size, and the features are selected gradually to 1 (100%). V-SVM is used as the machine learning algorithm. Four datasets, D2,

Comparison of Dimensionality Reduction Methods
Dimensionality is the number of feature vectors in the image. More than three feature vectors are perpendicular to each other. It is a high-dimensional space that cannot be visualized. When the dimension is higher, the amount of information contained is larger, and the classification difficulty of the machine learning algorithm is also greater. When the dimension exceeds a certain value, the phenomenon of the dimension disaster (curse of dimensionality) will occur [35]. At this time, dimensionality reduction is needed to achieve the best classification effect.

F-Score Feature Selection
The parameter selection of the F-score is 0.1:0.1:1, which means that according to the weight/F value, the first 0.1 features are selected first, and then 0.1 is used as the step size, and the features are selected gradually to 1 (100%). V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification results are shown in Figure 8, where the left figure is a schematic diagram of the change in accuracy in the case of different feature selections. The figure on the right is the ROC curve and its AUC value. As seen in Figure 8, when full feature selection is performed on the image, the classification accuracy and AUC value are the largest, which has a better classification effect.
Remote Sens. 2020, 12, x FOR PEER REVIEW 12 of 20 D3, D4, and D5, are used as the research objects. The classification results are shown in Figure 8, where the left figure is a schematic diagram of the change in accuracy in the case of different feature selections. The figure on the right is the ROC curve and its AUC value. As seen in Figure 8, when full feature selection is performed on the image, the classification accuracy and AUC value are the largest, which has a better classification effect.

Principal Component Analysis
Principal component analysis (PCA) is a commonly used method of dimensionality reduction. This research compares the classification results with 95% dimensionality reduction and no dimensionality reduction. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 7 shows the results of the surface crack classification in UAV images using PCA to reduce the dimensions to 95% of the original variance and no dimensionality reduction. It can be seen from Table 7 that when PCA is used to reduce the dimensions to 95% of the

Principal Component Analysis
Principal component analysis (PCA) is a commonly used method of dimensionality reduction. This research compares the classification results with 95% dimensionality reduction and no dimensionality reduction. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 7 shows the results of the surface crack classification in UAV images using Remote Sens. 2020, 12, 1571 13 of 20 PCA to reduce the dimensions to 95% of the original variance and no dimensionality reduction. It can be seen from Table 7 that when PCA is used to reduce the dimensions to 95% of the original variance, its classification accuracy and AUC value are greater, which has a better classification effect.

The Min-Max Gray Level Discrimination (M2GLD)
M2GLD is an image gray enhancement image processing technology. Table 8 is a schematic diagram of the effect of the image after M2GLD and the original image, where A represents the original image and B represents the image after M2GLD. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 9 shows the results of the surface crack classification in UAV images using M2GLD and No-M2GLD. It can be seen from Table 9 that when M2GLD is used to enhance the grayscale of an image, its classification accuracy and AUC value are smaller, which has a worse classification effect. original variance, its classification accuracy and AUC value are greater, which has a better classification effect. M2GLD is an image gray enhancement image processing technology. Table 8 is a schematic diagram of the effect of the image after M2GLD and the original image, where A represents the original image and B represents the image after M2GLD. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 9 shows the results of the surface crack classification in UAV images using M2GLD and No-M2GLD. It can be seen from Table 9 that when M2GLD is used to enhance the grayscale of an image, its classification accuracy and AUC value are smaller, which has a worse classification effect.  original variance, its classification accuracy and AUC value are greater, which has a better classification effect. M2GLD is an image gray enhancement image processing technology. Table 8 is a schematic diagram of the effect of the image after M2GLD and the original image, where A represents the original image and B represents the image after M2GLD. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 9 shows the results of the surface crack classification in UAV images using M2GLD and No-M2GLD. It can be seen from Table 9 that when M2GLD is used to enhance the grayscale of an image, its classification accuracy and AUC value are smaller, which has a worse classification effect.  original variance, its classification accuracy and AUC value are greater, which has a better classification effect. M2GLD is an image gray enhancement image processing technology. Table 8 is a schematic diagram of the effect of the image after M2GLD and the original image, where A represents the original image and B represents the image after M2GLD. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 9 shows the results of the surface crack classification in UAV images using M2GLD and No-M2GLD. It can be seen from Table 9 that when M2GLD is used to enhance the grayscale of an image, its classification accuracy and AUC value are smaller, which has a worse classification effect.  original variance, its classification accuracy and AUC value are greater, which has a better classification effect. M2GLD is an image gray enhancement image processing technology. Table 8 is a schematic diagram of the effect of the image after M2GLD and the original image, where A represents the original image and B represents the image after M2GLD. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 9 shows the results of the surface crack classification in UAV images using M2GLD and No-M2GLD. It can be seen from Table 9 that when M2GLD is used to enhance the grayscale of an image, its classification accuracy and AUC value are smaller, which has a worse classification effect.  original variance, its classification accuracy and AUC value are greater, which has a better classification effect. M2GLD is an image gray enhancement image processing technology. Table 8 is a schematic diagram of the effect of the image after M2GLD and the original image, where A represents the original image and B represents the image after M2GLD. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 9 shows the results of the surface crack classification in UAV images using M2GLD and No-M2GLD. It can be seen from Table 9 that when M2GLD is used to enhance the grayscale of an image, its classification accuracy and AUC value are smaller, which has a worse classification effect.  original variance, its classification accuracy and AUC value are greater, which has a better classification effect. M2GLD is an image gray enhancement image processing technology. Table 8 is a schematic diagram of the effect of the image after M2GLD and the original image, where A represents the original image and B represents the image after M2GLD. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 9 shows the results of the surface crack classification in UAV images using M2GLD and No-M2GLD. It can be seen from Table 9 that when M2GLD is used to enhance the grayscale of an image, its classification accuracy and AUC value are smaller, which has a worse classification effect.  original variance, its classification accuracy and AUC value are greater, which has a better classification effect. M2GLD is an image gray enhancement image processing technology. Table 8 is a schematic diagram of the effect of the image after M2GLD and the original image, where A represents the original image and B represents the image after M2GLD. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 9 shows the results of the surface crack classification in UAV images using M2GLD and No-M2GLD. It can be seen from Table 9 that when M2GLD is used to enhance the grayscale of an image, its classification accuracy and AUC value are smaller, which has a worse classification effect.  original variance, its classification accuracy and AUC value are greater, which has a better classification effect. M2GLD is an image gray enhancement image processing technology. Table 8 is a schematic diagram of the effect of the image after M2GLD and the original image, where A represents the original image and B represents the image after M2GLD. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 9 shows the results of the surface crack classification in UAV images using M2GLD and No-M2GLD. It can be seen from Table 9 that when M2GLD is used to enhance the grayscale of an image, its classification accuracy and AUC value are smaller, which has a worse classification effect.  original variance, its classification accuracy and AUC value are greater, which has a better classification effect. M2GLD is an image gray enhancement image processing technology. Table 8 is a schematic diagram of the effect of the image after M2GLD and the original image, where A represents the original image and B represents the image after M2GLD. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 9 shows the results of the surface crack classification in UAV images using M2GLD and No-M2GLD. It can be seen from Table 9 that when M2GLD is used to enhance the grayscale of an image, its classification accuracy and AUC value are smaller, which has a worse classification effect.  original variance, its classification accuracy and AUC value are greater, which has a better classification effect. M2GLD is an image gray enhancement image processing technology. Table 8 is a schematic diagram of the effect of the image after M2GLD and the original image, where A represents the original image and B represents the image after M2GLD. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 9 shows the results of the surface crack classification in UAV images using M2GLD and No-M2GLD. It can be seen from Table 9 that when M2GLD is used to enhance the grayscale of an image, its classification accuracy and AUC value are smaller, which has a worse classification effect.  original variance, its classification accuracy and AUC value are greater, which has a better classification effect.

The Min-Max Gray Level Discrimination (M2GLD)
M2GLD is an image gray enhancement image processing technology. Table 8 is a schematic diagram of the effect of the image after M2GLD and the original image, where A represents the original image and B represents the image after M2GLD. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 9 shows the results of the surface crack classification in UAV images using M2GLD and No-M2GLD. It can be seen from Table 9 that when M2GLD is used to enhance the grayscale of an image, its classification accuracy and AUC value are smaller, which has a worse classification effect.  original variance, its classification accuracy and AUC value are greater, which has a better classification effect.

The Min-Max Gray Level Discrimination (M2GLD)
M2GLD is an image gray enhancement image processing technology. Table 8 is a schematic diagram of the effect of the image after M2GLD and the original image, where A represents the original image and B represents the image after M2GLD. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 9 shows the results of the surface crack classification in UAV images using M2GLD and No-M2GLD. It can be seen from Table 9 that when M2GLD is used to enhance the grayscale of an image, its classification accuracy and AUC value are smaller, which has a worse classification effect.  original variance, its classification accuracy and AUC value are greater, which has a better classification effect.

The Min-Max Gray Level Discrimination (M2GLD)
M2GLD is an image gray enhancement image processing technology. Table 8 is a schematic diagram of the effect of the image after M2GLD and the original image, where A represents the original image and B represents the image after M2GLD. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 9 shows the results of the surface crack classification in UAV images using M2GLD and No-M2GLD. It can be seen from Table 9 that when M2GLD is used to enhance the grayscale of an image, its classification accuracy and AUC value are smaller, which has a worse classification effect.  original variance, its classification accuracy and AUC value are greater, which has a better classification effect.

The Min-Max Gray Level Discrimination (M2GLD)
M2GLD is an image gray enhancement image processing technology. Table 8 is a schematic diagram of the effect of the image after M2GLD and the original image, where A represents the original image and B represents the image after M2GLD. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 9 shows the results of the surface crack classification in UAV images using M2GLD and No-M2GLD. It can be seen from Table 9 that when M2GLD is used to enhance the grayscale of an image, its classification accuracy and AUC value are smaller, which has a worse classification effect.  original variance, its classification accuracy and AUC value are greater, which has a better classification effect.

The Min-Max Gray Level Discrimination (M2GLD)
M2GLD is an image gray enhancement image processing technology. Table 8 is a schematic diagram of the effect of the image after M2GLD and the original image, where A represents the original image and B represents the image after M2GLD. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 9 shows the results of the surface crack classification in UAV images using M2GLD and No-M2GLD. It can be seen from Table 9 that when M2GLD is used to enhance the grayscale of an image, its classification accuracy and AUC value are smaller, which has a worse classification effect.  original variance, its classification accuracy and AUC value are greater, which has a better classification effect. M2GLD is an image gray enhancement image processing technology. Table 8 is a schematic diagram of the effect of the image after M2GLD and the original image, where A represents the original image and B represents the image after M2GLD. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 9 shows the results of the surface crack classification in UAV images using M2GLD and No-M2GLD. It can be seen from Table 9 that when M2GLD is used to enhance the grayscale of an image, its classification accuracy and AUC value are smaller, which has a worse classification effect.

Laplace Sharpening
Laplace sharpening is an image color enhancement image processing technology. Table 10 is a schematic diagram of the effect of the image after Laplace sharpening and the original image, where A represents the original image and B represents the image after Laplace sharpening. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects.
The classification accuracy and AUC values are used as evaluation indicators. Table 11 shows the results of the surface crack classification in UAV images using Laplace sharpening and no Laplace sharpening. It can be seen from Table 11 that when Laplace sharpening is used to enhance the color of an image, its classification accuracy and AUC value are greater, which has a better classification effect. Laplace sharpening is an image color enhancement image processing technology. Table 10 is a schematic diagram of the effect of the image after Laplace sharpening and the original image, where A represents the original image and B represents the image after Laplace sharpening. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 11 shows the results of the surface crack classification in UAV images using Laplace sharpening and no Laplace sharpening. It can be seen from Table 11 that when Laplace sharpening is used to enhance the color of an image, its classification accuracy and AUC value are greater, which has a better classification effect.

Comparison of Cluster Analysis Results
In view of the fact that background information will interfere with the classification results of machine learning algorithms, based on the idea of cluster analysis and characteristics of the background information of the UAV image, this article divides the UAV image data into four types of datasets, namely, bright ground, dark ground, withered vegetation, and green vegetation. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The images are processed by Laplace sharpening image color enhancement processing. PCA is used to reduce the dimensions to 95% of the original variance for all features. The Laplace sharpening is an image color enhancement image processing technology. Table 10 is a schematic diagram of the effect of the image after Laplace sharpening and the original image, where A represents the original image and B represents the image after Laplace sharpening. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 11 shows the results of the surface crack classification in UAV images using Laplace sharpening and no Laplace sharpening. It can be seen from Table 11 that when Laplace sharpening is used to enhance the color of an image, its classification accuracy and AUC value are greater, which has a better classification effect.

Comparison of Cluster Analysis Results
In view of the fact that background information will interfere with the classification results of machine learning algorithms, based on the idea of cluster analysis and characteristics of the background information of the UAV image, this article divides the UAV image data into four types of datasets, namely, bright ground, dark ground, withered vegetation, and green vegetation. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The images are processed by Laplace sharpening image color enhancement processing. PCA is used to reduce the dimensions to 95% of the original variance for all features. The Laplace sharpening is an image color enhancement image processing technology. Table 10 is a schematic diagram of the effect of the image after Laplace sharpening and the original image, where A represents the original image and B represents the image after Laplace sharpening. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 11 shows the results of the surface crack classification in UAV images using Laplace sharpening and no Laplace sharpening. It can be seen from Table 11 that when Laplace sharpening is used to enhance the color of an image, its classification accuracy and AUC value are greater, which has a better classification effect.

Comparison of Cluster Analysis Results
In view of the fact that background information will interfere with the classification results of machine learning algorithms, based on the idea of cluster analysis and characteristics of the background information of the UAV image, this article divides the UAV image data into four types of datasets, namely, bright ground, dark ground, withered vegetation, and green vegetation. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The images are processed by Laplace sharpening image color enhancement processing. PCA is used to reduce the dimensions to 95% of the original variance for all features. The Laplace sharpening is an image color enhancement image processing technology. Table 10 is a schematic diagram of the effect of the image after Laplace sharpening and the original image, where A represents the original image and B represents the image after Laplace sharpening. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 11 shows the results of the surface crack classification in UAV images using Laplace sharpening and no Laplace sharpening. It can be seen from Table 11 that when Laplace sharpening is used to enhance the color of an image, its classification accuracy and AUC value are greater, which has a better classification effect.

Comparison of Cluster Analysis Results
In view of the fact that background information will interfere with the classification results of machine learning algorithms, based on the idea of cluster analysis and characteristics of the background information of the UAV image, this article divides the UAV image data into four types of datasets, namely, bright ground, dark ground, withered vegetation, and green vegetation. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The images are processed by Laplace sharpening image color enhancement processing. PCA is used to reduce the dimensions to 95% of the original variance for all features. The Laplace sharpening is an image color enhancement image processing technology. Table 10 is a schematic diagram of the effect of the image after Laplace sharpening and the original image, where A represents the original image and B represents the image after Laplace sharpening. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 11 shows the results of the surface crack classification in UAV images using Laplace sharpening and no Laplace sharpening. It can be seen from Table 11 that when Laplace sharpening is used to enhance the color of an image, its classification accuracy and AUC value are greater, which has a better classification effect.

Comparison of Cluster Analysis Results
In view of the fact that background information will interfere with the classification results of machine learning algorithms, based on the idea of cluster analysis and characteristics of the background information of the UAV image, this article divides the UAV image data into four types of datasets, namely, bright ground, dark ground, withered vegetation, and green vegetation. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The images are processed by Laplace sharpening image color enhancement processing. PCA is used to reduce the dimensions to 95% of the original variance for all features. The Laplace sharpening is an image color enhancement image processing technology. Table 10 is a schematic diagram of the effect of the image after Laplace sharpening and the original image, where A represents the original image and B represents the image after Laplace sharpening. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 11 shows the results of the surface crack classification in UAV images using Laplace sharpening and no Laplace sharpening. It can be seen from Table 11 that when Laplace sharpening is used to enhance the color of an image, its classification accuracy and AUC value are greater, which has a better classification effect.

Comparison of Cluster Analysis Results
In view of the fact that background information will interfere with the classification results of machine learning algorithms, based on the idea of cluster analysis and characteristics of the background information of the UAV image, this article divides the UAV image data into four types of datasets, namely, bright ground, dark ground, withered vegetation, and green vegetation. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The images are processed by Laplace sharpening image color enhancement processing. PCA is used to reduce the dimensions to 95% of the original variance for all features. The Laplace sharpening is an image color enhancement image processing technology. Table 10 is a schematic diagram of the effect of the image after Laplace sharpening and the original image, where A represents the original image and B represents the image after Laplace sharpening. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 11 shows the results of the surface crack classification in UAV images using Laplace sharpening and no Laplace sharpening. It can be seen from Table 11 that when Laplace sharpening is used to enhance the color of an image, its classification accuracy and AUC value are greater, which has a better classification effect.

Comparison of Cluster Analysis Results
In view of the fact that background information will interfere with the classification results of machine learning algorithms, based on the idea of cluster analysis and characteristics of the background information of the UAV image, this article divides the UAV image data into four types of datasets, namely, bright ground, dark ground, withered vegetation, and green vegetation. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The images are processed by Laplace sharpening image color enhancement processing. PCA is used to reduce the dimensions to 95% of the original variance for all features. The Laplace sharpening is an image color enhancement image processing technology. Table 10 is a schematic diagram of the effect of the image after Laplace sharpening and the original image, where A represents the original image and B represents the image after Laplace sharpening. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 11 shows the results of the surface crack classification in UAV images using Laplace sharpening and no Laplace sharpening. It can be seen from Table 11 that when Laplace sharpening is used to enhance the color of an image, its classification accuracy and AUC value are greater, which has a better classification effect.

Comparison of Cluster Analysis Results
In view of the fact that background information will interfere with the classification results of machine learning algorithms, based on the idea of cluster analysis and characteristics of the background information of the UAV image, this article divides the UAV image data into four types of datasets, namely, bright ground, dark ground, withered vegetation, and green vegetation. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The images are processed by Laplace sharpening image color enhancement processing. PCA is used to reduce the dimensions to 95% of the original variance for all features. The

No crack
A represents the original image and B represents the image after Laplace sharpening. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 11 shows the results of the surface crack classification in UAV images using Laplace sharpening and no Laplace sharpening. It can be seen from Table 11 that when Laplace sharpening is used to enhance the color of an image, its classification accuracy and AUC value are greater, which has a better classification effect.

Comparison of Cluster Analysis Results
In view of the fact that background information will interfere with the classification results of machine learning algorithms, based on the idea of cluster analysis and characteristics of the background information of the UAV image, this article divides the UAV image data into four types of datasets, namely, bright ground, dark ground, withered vegetation, and green vegetation. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The images are processed by Laplace sharpening image color enhancement processing. PCA is used to reduce the dimensions to 95% of the original variance for all features. The A represents the original image and B represents the image after Laplace sharpening. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 11 shows the results of the surface crack classification in UAV images using Laplace sharpening and no Laplace sharpening. It can be seen from Table 11 that when Laplace sharpening is used to enhance the color of an image, its classification accuracy and AUC value are greater, which has a better classification effect.

Comparison of Cluster Analysis Results
In view of the fact that background information will interfere with the classification results of machine learning algorithms, based on the idea of cluster analysis and characteristics of the background information of the UAV image, this article divides the UAV image data into four types of datasets, namely, bright ground, dark ground, withered vegetation, and green vegetation. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The images are processed by Laplace sharpening image color enhancement processing. PCA is used to reduce the dimensions to 95% of the original variance for all features. The A represents the original image and B represents the image after Laplace sharpening. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 11 shows the results of the surface crack classification in UAV images using Laplace sharpening and no Laplace sharpening. It can be seen from Table 11 that when Laplace sharpening is used to enhance the color of an image, its classification accuracy and AUC value are greater, which has a better classification effect.

Comparison of Cluster Analysis Results
In view of the fact that background information will interfere with the classification results of machine learning algorithms, based on the idea of cluster analysis and characteristics of the background information of the UAV image, this article divides the UAV image data into four types of datasets, namely, bright ground, dark ground, withered vegetation, and green vegetation. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The images are processed by Laplace sharpening image color enhancement processing. PCA is used to reduce the dimensions to 95% of the original variance for all features. The A represents the original image and B represents the image after Laplace sharpening. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 11 shows the results of the surface crack classification in UAV images using Laplace sharpening and no Laplace sharpening. It can be seen from Table 11 that when Laplace sharpening is used to enhance the color of an image, its classification accuracy and AUC value are greater, which has a better classification effect.

Comparison of Cluster Analysis Results
In view of the fact that background information will interfere with the classification results of machine learning algorithms, based on the idea of cluster analysis and characteristics of the background information of the UAV image, this article divides the UAV image data into four types of datasets, namely, bright ground, dark ground, withered vegetation, and green vegetation. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The images are processed by Laplace sharpening image color enhancement processing. PCA is used to reduce the dimensions to 95% of the original variance for all features. The A represents the original image and B represents the image after Laplace sharpening. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 11 shows the results of the surface crack classification in UAV images using Laplace sharpening and no Laplace sharpening. It can be seen from Table 11 that when Laplace sharpening is used to enhance the color of an image, its classification accuracy and AUC value are greater, which has a better classification effect.

Comparison of Cluster Analysis Results
In view of the fact that background information will interfere with the classification results of machine learning algorithms, based on the idea of cluster analysis and characteristics of the background information of the UAV image, this article divides the UAV image data into four types of datasets, namely, bright ground, dark ground, withered vegetation, and green vegetation. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The images are processed by Laplace sharpening image color enhancement processing. PCA is used to reduce the dimensions to 95% of the original variance for all features. The A represents the original image and B represents the image after Laplace sharpening. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 11 shows the results of the surface crack classification in UAV images using Laplace sharpening and no Laplace sharpening. It can be seen from Table 11 that when Laplace sharpening is used to enhance the color of an image, its classification accuracy and AUC value are greater, which has a better classification effect.

Comparison of Cluster Analysis Results
In view of the fact that background information will interfere with the classification results of machine learning algorithms, based on the idea of cluster analysis and characteristics of the background information of the UAV image, this article divides the UAV image data into four types of datasets, namely, bright ground, dark ground, withered vegetation, and green vegetation. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The images are processed by Laplace sharpening image color enhancement processing. PCA is used to reduce the dimensions to 95% of the original variance for all features. The A represents the original image and B represents the image after Laplace sharpening. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 11 shows the results of the surface crack classification in UAV images using Laplace sharpening and no Laplace sharpening. It can be seen from Table 11 that when Laplace sharpening is used to enhance the color of an image, its classification accuracy and AUC value are greater, which has a better classification effect.

Comparison of Cluster Analysis Results
In view of the fact that background information will interfere with the classification results of machine learning algorithms, based on the idea of cluster analysis and characteristics of the background information of the UAV image, this article divides the UAV image data into four types of datasets, namely, bright ground, dark ground, withered vegetation, and green vegetation. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The images are processed by Laplace sharpening image color enhancement processing. PCA is used to reduce the dimensions to 95% of the original variance for all features. The A represents the original image and B represents the image after Laplace sharpening. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The classification accuracy and AUC values are used as evaluation indicators. Table 11 shows the results of the surface crack classification in UAV images using Laplace sharpening and no Laplace sharpening. It can be seen from Table 11 that when Laplace sharpening is used to enhance the color of an image, its classification accuracy and AUC value are greater, which has a better classification effect.

Comparison of Cluster Analysis Results
In view of the fact that background information will interfere with the classification results of machine learning algorithms, based on the idea of cluster analysis and characteristics of the background information of the UAV image, this article divides the UAV image data into four types of datasets, namely, bright ground, dark ground, withered vegetation, and green vegetation. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The images are processed by Laplace sharpening image color enhancement processing. PCA is used to reduce the dimensions to 95% of the original variance for all features. The

Comparison of Cluster Analysis Results
In view of the fact that background information will interfere with the classification results of machine learning algorithms, based on the idea of cluster analysis and characteristics of the background information of the UAV image, this article divides the UAV image data into four types of datasets, namely, bright ground, dark ground, withered vegetation, and green vegetation. V-SVM is used as the machine learning algorithm. Four datasets, D2, D3, D4, and D5, are used as the research objects. The images are processed by Laplace sharpening image color enhancement processing. PCA is used to reduce the dimensions to 95% of the original variance for all features. The classification accuracy and AUC values are used as evaluation indicators. The final classification results are shown in Table 12. Statistically significant p-values are obtained by permutation tests. The permutation test is a method proposed by Fisher that is computationally intensive, uses a random arrangement of sample data to perform statistical inference, and is widely used in the field of machine learning. The specific use is similar to bootstrap methods by sequentially replacing samples, recalculating statistical tests, constructing empirical distributions, and then inferring the p-value based on this. Assume N replacement tests are performed, and the classification accuracy rate obtained by n replacement tests is higher than the true accuracy rate, then p-value = n/N. When the classification accuracy rate obtained without the replacement test is higher than the true accuracy rate, it is usually recorded as p-value <1/N. The smaller the p-value is, the more significant the difference. In this article, N = 1000 permutation tests are performed on the four datasets. As shown in Figure 9, the blue histogram represents the statistical distribution of classification accuracy for 1000 replacement tests, and the red straight line represents the true accuracy.
that the classification effect of machine learning has strong statistical significance. The classification accuracy of bright ground reaches 89.70%, dark ground reaches 88.35%, winter vegetation reaches 88.65%, green vegetation reaches 93.75%, and the overall classification accuracy reaches 88.99%.  The method proposed in this article is used to identify whether the UAV image has cracks. The edge segmentation method is used to extract cracks from the UAV image with cracks, and the results are processed by the opening operation. The processing becomes a white background from the UAV image with no cracks. Finally, each processed image is joined according to the serial number of the As shown in Table 12 and Figure 9, the p-values of the four datasets are <0.001. This indicates that the classification effect of machine learning has strong statistical significance. The classification accuracy of bright ground reaches 89.70%, dark ground reaches 88.35%, winter vegetation reaches 88.65%, green vegetation reaches 93.75%, and the overall classification accuracy reaches 88.99%.
The method proposed in this article is used to identify whether the UAV image has cracks. The edge segmentation method is used to extract cracks from the UAV image with cracks, and the results are processed by the opening operation. The processing becomes a white background from the UAV image with no cracks. Finally, each processed image is joined according to the serial number of the image cutting. The final schematic diagram of the crack extraction effect in the UAV image is shown in Figure 10, where Figure 10a represents the original UAV image, Figure 10b represents the schematic diagram of the crack extraction effect by edge segmentation for the original UAV image, Figure 10c represents the schematic diagram of the crack extraction effect by edge segmentation for the UAV image after the method mentioned in this article, and Figure 10d represents the comparison of the final schematic diagram of the crack extraction effect in the UAV image and original UAV image. Figure 10 shows that the effect in Figure 10c is better than the effect in Figure 10b, and the method proposed in this article can extract crack information from UAV images well. schematic diagram of the crack extraction effect by edge segmentation for the original UAV image, Figure 10c represents the schematic diagram of the crack extraction effect by edge segmentation for the UAV image after the method mentioned in this article, and Figure 10d represents the comparison of the final schematic diagram of the crack extraction effect in the UAV image and original UAV image. Figure 10 shows that the effect in Figure 10c is better than the effect in Figure 10b, and the method proposed in this article can extract crack information from UAV images well.

Machine Learning Methods
In the SVM machine learning algorithm, when the parameter penalty function C of C-SVM tends to infinity, it means that samples with classification errors are not allowed to exist, and it easily causes hard-margin SVM overfitting. When C approaches 0, it means no longer focus on whether the classification is correct; only the interval needs to be maximized, which easily leads to underfitting of the algorithm. V-SVM uses a new parameter V instead of C, which can be used to control the number and error of support vectors and is relatively easy to choose. Therefore, better classification results are achieved.
In the RF machine learning algorithm, the classification effect improves correspondingly with the increase in the tree within a certain range. When it reaches a certain number, it will be balanced. However, when the number of trees is too large, it will cause overfitting. Instead, the classification

Machine Learning Methods
In the SVM machine learning algorithm, when the parameter penalty function C of C-SVM tends to infinity, it means that samples with classification errors are not allowed to exist, and it easily causes hard-margin SVM overfitting. When C approaches 0, it means no longer focus on whether the classification is correct; only the interval needs to be maximized, which easily leads to underfitting of the algorithm. V-SVM uses a new parameter V instead of C, which can be used to control the number and error of support vectors and is relatively easy to choose. Therefore, better classification results are achieved.
In the RF machine learning algorithm, the classification effect improves correspondingly with the increase in the tree within a certain range. When it reaches a certain number, it will be balanced. However, when the number of trees is too large, it will cause overfitting. Instead, the classification accuracy of the random forest is reduced. Therefore, the final selection number of trees is 300 through comparative analysis.
In the KNN machine learning algorithm, the classification effect improves as the k value decreases in a certain range. However, when the k value is too low, it will cause overfitting. Therefore, this article finally chooses the k value of 9 through comparative analysis.
Among the three machine learning algorithms for surface crack extraction in mining areas, SVM is superior to RF and KNN. It has the best classification effect, which is consistent with the conclusions of Hoang. in research on road crack extraction [23].

Dimensionality Reduction Method
In the F-score feature selection, as the UAV images have been cut, the information contained in each image has been sufficiently simplified. Therefore, full feature selection has been adopted to achieve better classification results. When PCA is used to reduce the dimensions to 95% of the original variance, its classification accuracy and AUC value are greater, which has a better classification effect. This is consistent with the conclusions of Chen W. in research on face detection and recognition using PCA for dimensionality reduction has a better detection success rate [40].

Image Processing Technology
Before performing machine learning, images are usually preprocessed first, among which image enhancement methods are widely used. In this article, the min-max gray level discrimination (M2GLD) and Laplace sharpening image enhancement methods are selected for research. M2GLD is an image gray enhancement method that has no effect on the classification accuracy of surface cracks in mining areas. This may be because when the color image is converted into a gray image to enhance the crack information, the background interference information is also enhanced, resulting in worse classification results. Laplace sharpening is an image color enhancement method that can effectively enhance the crack information of the land in the mining area and has achieved a good classification effect. Jijun W. uses the four-neighbor laplace sharpening to enhance image detail information and obtain more effective results [41]. This is consistent with our results using laplace sharpening to enhance the image crack information and obtain higher classification accuracy.

Cluster Analysis Results
UAV remote sensing technology plays an important role in land reclamation in mining areas and has the characteristics of low cost and high efficiency. Machine learning has been widely used in the field of pattern recognition. This article proposes a new identification method for surface cracks from UAV images based on machine learning in coal mining areas. This method first cuts the drone images to simplify the surface information contained in each image and then uses the idea of cluster analysis to differentiate the background information of the images. Clustering is performed twice so that images with similar background information are combined to build datasets: bright ground, dark ground, withered vegetation, and green vegetation. Therefore, this method can effectively reduce the interference of background information on the classification results. The overall accuracy is improved to 88.99%.

Conclusions
This article proposes a new identification method for surface cracks from UAV images based on machine learning in coal mining areas. The cluster analysis is used to construct different datasets based on the background information of the image. Through dimensionality reduction methods and image processing technologies, three types of SVM, RF, and KNN are compared. The following four conclusions are made.

1.
In the surface crack recognition of UAV images, the accuracy of SVM is better than the RF and KNN.

2.
Image color enhancement can improve the accuracy of machine learning, but image gray enhancement cannot. 3.
Reasonable use of dimensionality reduction methods can improve the accuracy of machine learning 4.
By using the V-SVM machine learning algorithm, PCA to reduce the full features to 95% of the original variance, and image color enhancement by Laplace sharpening, the overall accuracy could reach 88.99%.
The method provided in this article could effectively identify and extract ground cracks. It would provide data support for further research on the crack information such as the length, width, direction, and crack rate of the surface cracks and the development regularity.