3D Mineral Prospectivity Mapping of Zaozigou Gold Deposit, West Qinling, China: Machine Learning-Based Mineral Prediction

: This paper focuses on researching the scientiﬁc problem of deep extraction and inference of favorable geological and geochemical information about mineralization at depth, based on which a deep mineral resources prediction model is established and machine learning approaches are used to carry out deep quantitative mineral resources prediction. The main contents include: (i) discussing the method of 3D geochemical anomaly extraction under the multi-fractal content-volume (C-V) models, extracting the 12 element anomalies and constructing a 3D geochemical anomaly data volume model for laying the data foundation for researching geochemical element distribution and association; (ii) extracting the element association characteristics of primary geochemical halos and inferring deep metallogenic factors based on compositional data analysis (CoDA), including quantitatively extracting the geochemical element associations corresponding to ore-bearing structures (Sb-Hg) based on a data-driven CoDA framework, quantitatively identifying the front halo element association (As-Sb-Hg), near-ore halo element association (Au-Ag-Cu-Pb-Zn) and tail halo element association (W-Mo-Co-Bi), which provide quantitative indicators for the primary haloes’ structural analysis at depth; (iii) establishing a deep geological and geochemical mineral resources prediction model, which is constructed by ﬁve quantitative mineralization indicators as input variables: fracture buffer zone, element association (Sb-Hg) of ore-bearing structures, metallogenic element Au anomaly, near-ore halo element association Au-Ag-Cu-Pb-Zn and the ratio of front halo to tail halo (As-Sb-Hg)/(W-Mo-Bi); and (iv) three-dimensional MPM based on the maximum entropy model (MaxEnt) and Gaussian mixture model (GMM), and delineating exploration targets at depth. The results show that the C-V model can identify the geological element distribution and the CoDA method can extract geochemical element associations in 3D space reliably, and the machine learning methods of MaxEnt and GMM have high performance in 3D MPM.


Introduction
The main task of quantitative mineral prediction is to conduct comprehensive analysis of geological, geophysical, geochemical and remote sensing data and drilling engineering data in the study area based on the research of the geological background and metallogenic regularity, and then construct mathematical models to effectively extract and identify favorable information on mineralization, carry out data fusion of quantitative mineralization information, construct mineral prediction models, perform potential mineral resources quantitative assessment and exploration targets delineation [1][2][3].
Quantitative mineral resource prediction is developing toward deep 3D quantitative prediction. The framework of 3D quantitative mineral prediction has been basically completed, among which the representative works include Xiao [4][5][6][7][8][9][10], who studied the characteristics of large-scale three-dimensional mineralization prediction and summarized the combined prediction process of mine concession structure, mineral exploration model, metallogenic series and quantitative analysis methods, which laid the foundation for three-dimensional mineral resource prediction; Chen [11][12][13][14] proposed the "cubic prediction model" for mineralization prediction by 3D-comprehensive 3D modeling of geology, geophysics, geochemistry and remote sensing [15][16][17]; Mao [18][19][20][21][22] proposed a three-dimensional prediction process for deep mineral resources prediction, the threedimensional morphological analysis of geological bodies, geological anomaly extraction and three-dimensional quantitative prediction methods [23]; Yuan [24,25] proposed a "four-step" three-dimensional mineralization prediction method, which summarizes the three-dimensional mineralization prediction work into four key steps: three-dimensional geological database construction, three-dimensional geological modeling and mapping, three-dimensional prediction information deep mining and three-dimensional geosciences data fusion and prediction.
In this study, three-dimensional quantitative mineral resources prediction is executed from a geological and geochemical perspective. It is a common understanding in the last century that the distribution of rock geochemical elements follows normal or lognormal distribution, based on which classical statistical theories and methods have been widely applied in the field of geochemical data analysis [26]. Since the 1990s, fractal and multifractal methods have been developed rapidly, with representative achievement from Cheng's team, who proposed the content-area (C-A) model [26,27], the spectrum-area (S-A) model [28] and Local Singularity Analysis (LSA) [26,[29][30][31][32], and it is believed that the multifractal distribution has the ability to simultaneously portray the normal distribution, lognormal distribution, Zipf's law, Pareto's law, etc., and can be used as the basic law of geochemical element distribution [26,29]. Based on the theoretical research of fractal and multifractal methods, its application in geochemical spatial distribution pattern research is blossoming [10,[33][34][35][36][37]. The multifractal-related methods have achieved many successful cases in regional geochemical anomaly extraction [32,[38][39][40][41][42][43][44][45]. With the improvement of 3D software and hardware performance, as well as the development of 3D modeling technology, fractal models used in 3D data have made some progress [34,46,47]. Three-dimensional primary halo geochemical survey is an effective tool for deep mineral resources prediction. In fact, the spatial zonation pattern of primary haloes is the pattern of element clustering regularity in 3D space, and the anomalous structure of elements is an intuitive indicator of deep mineral prediction. Especially, Cheng [27] proposed that the distribution of geochemical elements in either horizontal or vertical directions is consistent with the fractal distribution, based on which Afzal [34] proposed the C-V model, which is a powerful tool to delineate the nonlinear characteristics of element distribution in a 3D space.
Geochemical data are typically compositional data, which gives rise to the problem of "closure effects" due to the lack of scale consistency in the covariance matrix of the compositional data [48][49][50], and, in practice, classical geostatistical methods that ignore the compositional properties of geochemical data often yield poor results [51][52][53][54][55][56][57][58][59][60]. Since the introduction of Aitchison geometry and the application of additive log-ratio transformation (alr) [61], the theory and methods of log-ratio transformation of compositional data have been gradually improved, making the application of classical statistical methods more reasonable [33,35]. The geochemical elemental association extraction method based on Sequential Binary Partition (SBP) [50] can design elemental associations based on the geological background and mineralization knowledge, which is easy to interpret, and provides solutions for geochemical inference of lithology, tectonics, alteration, mineralization, etc. Therefore, the CoDA method provides new ideas for analyzing the relationship of geochemical data and extracts geochemical associations for evaluating mineral resources rapidly [51,55,[62][63][64][65][66][67].
Geosciences is a data-intensive science, and geological survey and mineral exploration have accumulated a large amount of multi-source geosciences data, and the introduction of big data and its application of machine learning-related methods can effectively support mineral resources exploration [26,[68][69][70][71][72][73]. During the past decade or so, a large number of achievements related to machine learning have been published in geosciences [37,57,[74][75][76][77][78][79][80][81][82][83][84][85]. For example, based on metallogenic regularity, an intelligent geology and intelligent mine model is established on a big data platform combined with high-performance computers [70,86]; big data methods are used to realize the automatic collection of geological prospecting thematic data, machine learning-and deep learningrelated methods development for deep mining of geological data, intelligent prospecting, etc. [37,[87][88][89]; and three-dimensional prediction methods of hidden orebodies can be based on deep learning [17,72,[90][91][92][93][94][95][96][97][98]. Numerous studies have shown that big data and machine learning have good performance in mineralization prediction, and machine learning-related algorithm research has greatly enriched the approaches of geological data processing and analysis. Meanwhile, data cleaning, data screening and deep mining of geological big data can enrich the information sources for mineralization information inference and quantitative mineral prediction at depth.
This study focuses on the scientific problems of deep extraction and inference of favorable geological and geochemical information about mineralization at depth and quantitative mineralization prediction at large depth. Specifically, the C-V model will be used to extract the spatial distribution pattern of primary halo geochemical element anomalies, and the CoDA will be employed to identify and infer the element associations for tracing the extension of deep ore-controlling information. Moreover, the machine learning methods of the MaxEnt model and GMM will be carried out for mineral resources prediction at the large depth of the Zaozigou gold deposit.

Geological Setting
West Qinling is located at the western part of the Qinling orogenic belt, with the Qilian orogenic belt to the north, the Qaidam block to the west and the songpan block to the south ( Figure 1a) [99][100][101][102].
The geotectonic position of the Xiahe-Hezuo area is located at the northwestern part of the West Qinling orogenic belt and at the western extension of the Qinling-Qilian-Kunlun central orogenic belt, whose complex geological structure creates a superior mineralization environment [103].
The Zaozigou gold deposit is located within the West Qinling fold belt and is a typical epithermal-type gold deposit in the Xiahe-Hezuo area ( Figure 1a). The main controlling factors for mineral resources formation within the region are tectonic movement and magmatism [104], with the regional tectonics spreading in a NW direction with developing folds and fractures. Complex geological structure and magmatism are dominated by Yanshan period intermediate-acid intrusive rocks, which are widely distributed in the form of the batholith, stock, apophysis and veins [104] (Figure 1b).
The Triassic strata are the main stratigraphy for gold deposits. The genesis and spatialtemporal evolution of the intermediate-acidic dike is closely related to gold mineralization in the area, and during the mineralization process, the magmatic rocks not only provide the mineralized material, but also their internal environment is very suitable as an orebearing space, which can be regarded as a significant mineralization indicator for gold mineralization [105].
The spatial distribution of mineral deposits is directly controlled by the geotectonic position in the Xiahe-Hezuo area, which plays a major role in the formation of different types of gold deposits and is the boundary of the belt from a spatial perspective. The most important types of mineral-controlling structures are fractures and folds in this area [106,107]. The main orebodies of the Zaozigou deposit are produced in NE, NW  The Triassic strata are the main stratigraphy for gold deposits. The genesis and spatial-temporal evolution of the intermediate-acidic dike is closely related to gold mineralization in the area, and during the mineralization process, the magmatic rocks not only provide the mineralized material, but also their internal environment is very suitable as an ore-bearing space, which can be regarded as a significant mineralization indicator for gold mineralization [105].
The spatial distribution of mineral deposits is directly controlled by the geotectonic position in the Xiahe-Hezuo area, which plays a major role in the formation of different types of gold deposits and is the boundary of the belt from a spatial perspective. The most  The Zaozigou gold deposit is a typical representative of gold deposits associated with intermediate-acid dike rocks in the south of the Xiahe-Hezuo fracture. It is located approximately 9 km southwest of Hezuo city, Gansu Province, with convenient access to the mine site ( Figure 2a). The main ore-bearing position is between Gully 1 and Gully 4, with a total area of approximately 2.6 km 2 (Figure 2b). There are only Triassic and Quaternary strata exposed in the Zaozigou gold deposit ( Figure 2b). The strata of the area are mainly the lower part of the Gulangdi group (T2g 1 ) of the Middle Triassic formation, and the Quaternary (Qh alp ) is developed in the intermountain valley. The lower part of the Gulangdi group (T2g 1 ) is the main ore-bearing rock, which is a set of fine clastic rocks consisting of siliceous slate, clastic feldspathic fine sandstone with interbedded siltstone slate and argillaceous slate. The formation is composed of a large sedimentary cycle from bottom to top consisting of siltstone → argillaceous slate → calcareous slate [108,114].
The fractures are well developed and have a complex morphology, forming an intersecting trend, indicating that the area has experienced multiple phases of geological activity. Fractures strictly control the distribution of orebodies and vein rocks within the deposit. They can be classified into five groups of orientations, namely NW, N-S, NE, E-W and NNE [115,116]. Fracture structures are the structural surfaces of mineralization and these structural surfaces control the spreading characteristics of the orebodies [117].
Intermediate-acid dike, including fine crystalline diorite, diorite porphyrite, biotite dioritic porphyrite and quartz diorite porphyrite densely produced with a porphyritic structure, spreads in NNE direction, turning to a nearly N-S direction in the western part of the deposit and a few NW directions. Influenced by the regional multi-period deep fracture activity, the multi-period magmatism has overlapped on multi-phases mineralization within the deposit [118].
There are 147 gold orebodies that have been found in the Zaozigou gold deposit, of There are only Triassic and Quaternary strata exposed in the Zaozigou gold deposit ( Figure 2b). The strata of the area are mainly the lower part of the Gulangdi group (T 2 g 1 ) of the Middle Triassic formation, and the Quaternary (Q h alp ) is developed in the intermountain valley. The lower part of the Gulangdi group (T 2 g 1 ) is the main ore-bearing rock, which is a set of fine clastic rocks consisting of siliceous slate, clastic feldspathic fine sandstone with interbedded siltstone slate and argillaceous slate. The formation is composed of a large sedimentary cycle from bottom to top consisting of siltstone → argillaceous slate → calcareous slate [108,114].
The fractures are well developed and have a complex morphology, forming an intersecting trend, indicating that the area has experienced multiple phases of geological activity. Fractures strictly control the distribution of orebodies and vein rocks within the deposit. They can be classified into five groups of orientations, namely NW, N-S, NE, E-W and NNE [115,116]. Fracture structures are the structural surfaces of mineralization and these structural surfaces control the spreading characteristics of the orebodies [117].
Intermediate-acid dike, including fine crystalline diorite, diorite porphyrite, biotite dioritic porphyrite and quartz diorite porphyrite densely produced with a porphyritic structure, spreads in NNE direction, turning to a nearly N-S direction in the western part of the deposit and a few NW directions. Influenced by the regional multi-period deep fracture activity, the multi-period magmatism has overlapped on multi-phases mineralization within the deposit [118].
There are 147 gold orebodies that have been found in the Zaozigou gold deposit, of which 17 orebodies are main orebodies with gold reserves greater than 1 tonne and the total gold reserves amount to more than 100 tonnes [113]. According to the spatial distribution and combination of the mineralization, the Zaozigou deposit can be divided into eastern and western ore groups.
The eastern ore group is mainly located between Gully 1 and Gully 3 with the strike of NE orientation, containing Au1 controlled by F24, Au9 controlled by F21 and Au15 controlled by F25. These orebodies extend over 1000m long and 300m wide, with a NW direction tendency and steep dip near the ground surface, locally nearly upright. In the deep, these orebodies have been staggered by a gently dipping fracture, causing the tendency to change to a SE orientation (Figure 2b). In addition, orebodies M4 and M6 are laying underground, with the strike of NWW orientation, the tendency of SW orientation and a dip of 8 •~2 6 • . These orebodies cross the NE-striking orebodies obliquely, staggering them, and their own mineralization behavior occurs simultaneously [119].
The western ore group is mainly distributed between Gully 3 and Gully 4, spreading in a nearly N-S direction, with Au29~Au31 as the main orebodies. The orebodies extend over 1000 m long and are wider than 500 m, with a strike of 350 •~1 0 • , varying tendency and dips greater than 75 • , locally subvertical. These orebodies extend, and the mineralization is weaker in the steeper parts and stronger in the shallower parts.

Datesets Description
Historical geological and geochemical data were completed, including geological reports, geological exploration maps, drills geochemical data, etc., from the Development Research Center of China Geological Survey and No. 3 Geological and Mineral Exploration team, Gansu Provincial Bureau of Geology and Mineral Exploration and Development; the coordinate system used in the mine-scale is Gaussian Kruger projection coordinates.
This study collects data from 72 drillings in the Zaozigou gold deposit and establishes a drilling location database, an assay database, an inclinometry database and a lithology database. The 3D model of drillings is constructed based on the drill hole data database ( Figure 3). The eastern ore group is mainly located between Gully 1 and Gully 3 with the strike of NE orientation, containing Au1 controlled by F24, Au9 controlled by F21 and Au15 controlled by F25. These orebodies extend over 1000m long and 300m wide, with a NW direction tendency and steep dip near the ground surface, locally nearly upright. In the deep, these orebodies have been staggered by a gently dipping fracture, causing the tendency to change to a SE orientation (Figure 2b). In addition, orebodies M4 and M6 are laying underground, with the strike of NWW orientation, the tendency of SW orientation and a dip of 8°~26°. These orebodies cross the NE-striking orebodies obliquely, staggering them, and their own mineralization behavior occurs simultaneously [119].
The western ore group is mainly distributed between Gully 3 and Gully 4, spreading in a nearly N-S direction, with Au29~Au31 as the main orebodies. The orebodies extend over 1000 m long and are wider than 500 m, with a strike of 350°~10°, varying tendency and dips greater than 75°, locally subvertical. These orebodies extend, and the mineralization is weaker in the steeper parts and stronger in the shallower parts.

Datesets Description
Historical geological and geochemical data were completed, including geological reports, geological exploration maps, drills geochemical data, etc., from the Development Research Center of China Geological Survey and No. 3 Geological and Mineral Exploration team, Gansu Provincial Bureau of Geology and Mineral Exploration and Development; the coordinate system used in the mine-scale is Gaussian Kruger projection coordinates.
This study collects data from 72 drillings in the Zaozigou gold deposit and establishes a drilling location database, an assay database, an inclinometry database and a lithology database. The 3D model of drillings is constructed based on the drill hole data database ( Figure 3). The primary geochemical halo data from the drillings are collected from the "Zaozigou gold successive resources exploration project in Hezuo city, Gansu Province", with a total of 72 drillings and 5028 samples with 12 elements of Ag, As, Au, Cu, Hg, Pb, Zn, Sb, W, Bi, Co and Mo (the element detection methods can be seen in the literature [113]). The The primary geochemical halo data from the drillings are collected from the "Zaozigou gold successive resources exploration project in Hezuo city, Gansu Province", with a total of 72 drillings and 5028 samples with 12 elements of Ag, As, Au, Cu, Hg, Pb, Zn, Sb, W, Bi, Co and Mo (the element detection methods can be seen in the literature [113]). The sampling method was the continuous picking method with sample intervals generally within 10m. Some orebodies or strongly altered areas were sampled at decreased intervals.

Concentration-Volume (C-V) Model
Geochemical anomalies are a concept relative to geochemical background, and the multifractal approach provides an effective tool to separate anomalies from the background. For hydrothermal mineralization processes, multi-phase mineralization is common, which will result in multi-phase superposed element spatial distribution [38,120,121]. The C-V model can process the nonlinear primary geochemical halo data with the following equation: where V(ρ < v) and V(ρ ≥ v) are the volumes of the element content less than v and greater than v; D 1 and D 2 are the fractal dimension values, also called the singularity index; k is a constant coefficient, which can be calculated by the least square method; v is the threshold value of element contents, and several element content intervals are divided by v.
The curves V(ρ < v) and V(ρ ≥ v) of the volumes corresponding to different v follow a power-law relationship. Taking the natural logarithm of both sides of the equation, the lg-lg plots have a linear relationship in different v intervals. The fractal dimensions in different v intervals can be calculated by the least square method. The geochemical anomalies and backgrounds can be extracted by different fractal dimensions [39].
In practice, the C-V model is used on the primary geochemical halo data volume model for anomaly identification, where the threshold of fractal dimensions may indicate the boundary between different mineralization.

Compositional Data Analysis
Geochemical data, as typical of compositional data, should be properly transformed before data analysis [122][123][124][125] to eliminate the effects of "closure effects". The "opened (transformed)" data can often be analyzed by classical statistical methods to obtain performance improvements [36,53,61,122].

Central Log-Ratio Transformation (clr)
The calculation of this method is: (i) calculate the geometric mean of all compositional subvectors; (ii) divide each subvector by the geometric mean separately; (iii) take the natural logarithm. Its calculation formula is shown as follows.

Sequential Binary Partition (SBP)
The common isometric log-ratio transformation (ilr) is difficult to interpret. Egozcue [50] proposed the sequential binary partition (SBP) technique based on the ilr transformation, which can provide geochemical interpretation reasonably [66,123,124].
The sequential binary partition technique performs non-overlapping dichotomous classification continuously by the relative information between variables. In practice, positive (+) and negative (−) signs are used to represent two different classifications of compositional variables, and '0' is used to represent the unconcerned variables in one time. By performing continuous non-overlapping dichotomy in a simplex space, a basis vector is formed and then transformed. The results, called compositional balance, can be geologically interpreted according to dichotomy clusters. Especially, the technique provides an important tool to identify element associations [66,123,124,126].
The relevant formula is [50]: where B i denotes the new compositional vector defined by the standard orthogonal basis, ∏ + x j is the product of the variables labeled (+) involved in the ith binary partition and ∏ − x k is the product of the variables labeled (−) involved in the ith binary partition.

Geochemical Compositional Data Analysis Framework Based on CoDA
The log-ratio transformation of geochemical data can solve the problem of the "closure effect" caused by the lack of scale consistency in the covariance matrix of the compositional data. The clr-biplot and compositional balance methods developed based on the clr and SBP have their advantages in the geochemical associations' extraction, especially in inferring lithology, faults, alteration and mineralization, and the corresponding frameworks have been well applied [65,66,124].
This study uses a data-driven and knowledge-driven framework of compositional data analysis to identify the geochemical associations of the primary halo. The data-driven framework infers the data characteristics by measuring elemental statistical correlations to gain the element associations, while the knowledge-driven framework is based on geological and geochemical understanding to design the element associations ( Figure 4).

Geochemical Compositional Data Analysis Framework Based o
The log-ratio transformation of geochemical data can solve the p sure effect" caused by the lack of scale consistency in the covariance m sitional data. The clr-biplot and compositional balance methods deve clr and SBP have their advantages in the geochemical associations' e in inferring lithology, faults, alteration and mineralization, and the co works have been well applied [65,66,124].
This study uses a data-driven and knowledge-driven framewo data analysis to identify the geochemical associations of the primary h framework infers the data characteristics by measuring elemental sta to gain the element associations, while the knowledge-driven framew ological and geochemical understanding to design the element associ In Figure 4, the closure and centering are necessary preprocess s be seen in the literature [51]. The data-driven framework in this stud mation to "open" geochemical data, and the factor analysis and corre used to explore the relationship among elements and extract element while, the knowledge-driven framework designs the element associa research of the geological features, such as the element concentration the primary geochemical halo associations, and so on. Then, the SBP is titatively extract the value of corresponding geochemical associations sults of the CoDA can be employed to infer metallogenic information i

Machine Learning-Based Quantitative Mineral Prediction Methods
Machine learning algorithms could highlight hidden details in d plicit search and have the ability to identify complex spatial pattern scholars have attempted to use machine learning algorithms to ex In Figure 4, the closure and centering are necessary preprocess steps, which can can be seen in the literature [51]. The data-driven framework in this study used clr transformation to "open" geochemical data, and the factor analysis and correlativity methods are used to explore the relationship among elements and extract element associations. Meanwhile, the knowledge-driven framework designs the element associations through deep research of the geological features, such as the element concentrations in different rocks, the primary geochemical halo associations, and so on. Then, the SBP is performed to quantitatively extract the value of corresponding geochemical associations. Eventually, the results of the CoDA can be employed to infer metallogenic information in an unknown area.

Machine Learning-Based Quantitative Mineral Prediction Methods
Machine learning algorithms could highlight hidden details in datasets without explicit search and have the ability to identify complex spatial patterns [127]. Given this, scholars have attempted to use machine learning algorithms to extract mineralization information by integrating multi-source geosciences data for identifying mineralization-related anomalies and their variation that cannot be detected by traditional methods [128][129][130][131][132][133][134][135]. Machine learning algorithms can be roughly categorized as supervised learning algorithms and unsupervised algorithms. For better validating the suitability of 3D MPM, this study discusses the application of the supervised algorithm MaxEnt model and unsupervised algorithm GMM.

MaxEnt Model
The principle of MaxEnt is a criterion of probabilistic model learning for making predictions based on incomplete information. The main idea is that, when predicting the probability distribution of a random event, all known constraints need to be satisfied without subjective assumptions so that the probability distribution is most uniform, the prediction risk is minimal and the entropy value of the probability distribution is maximum [136].
Let T = {(x 1 , y 1 ), (x 2 , y 2 ), . . . (x n , y n } be the training dataset and f i (x, y), i = 1, 2, . . . , n be the eigenfunction, and the learning process of the MaxEnt model is equivalent to the constrained optimization problem: is the expected value of n eigenfunctions f i (x, y) related to the P(Y|X ) with the empirical distribution P(X).
Following the custom of optimization problems, the problem of finding the maximum value is rewritten as the equivalent problem of finding the minimum value: The solution resulting from solving the constrained optimization problem is the solution learned by the MaxEnt model. However, the empirical distribution expectation is usually not equal to the true expectation but will be approximate to the true expectation. If the solution is solved strictly according to the above constraints, it is easy to cause overfitting of the training data during the learning process. Therefore, the constraints can be appropriately relaxed in practice, such as replacing the above equation with E P ( f i ) − E P ( f i ) ≤ β i , β i is the modulation multiplier, which is a constant.

Gaussian Mixture Model (GMM)
GMM is a quantified model which generated through Gaussian probability density function fitting. This model decomposes objective distribution into several Gaussian probability density functions. By using enough Gaussian functions and tuning the parameters, the model can generate a very complex probability density to approximate to almost any continuous probability. Theoretically, any objective distribution can be fitted by a combination of multiple Gaussian density functions, and the higher the number of probability density sub-functions, the more closely it approximates to the actual data distribution. GMM has the advantages of good flexibility, is not limited by the sample size and can accurately describe the data structure.
When the dataset X = (X 1 , X 2 , . . . , X T ) of the training data can be divided into k classes and each class obeys Gaussian distribution, the k order probability distribution of the GMM is where P(X|θ i ) is the probability density of the ith Gaussian model, w i is the weight of the ith model in the whole GMM, µ i is the mean vector and ∑ i is the covariance matrix.
After the parameter initialization, the Expectation-Maximization (EM) algorithm based on the maximum likelihood estimation is often chosen for the parameter estimation of the GMM [137].
In the geological point of view, the GMM is an unsupervised machine learning algorithm; it divides the data into two categories (mineralization and non-mineralization) and then makes category judgments on samples by learning information about unlabeled samples.
As for the dataset D = {x 1 , x 2 , . . . , x m }, which contains k Gaussian mixture components, the algorithm steps are as follows.

1.
Initializing the k multivariate Gaussian distributions and their weights.

2.
Estimating the posterior probability of each sample generated by each component according to Bayes' theorem.

3.
Updating the mean vector, covariance matrix and the weights according to the step 2.

4.
Repeating steps 2-3 until the increase in the likelihood function has been less than the convergence threshold, or the maximum number of iterations is reached.

5.
For each sample point, calculating its posterior probability of belonging to each cluster according to Bayes' theorem and classifying the sample into the cluster with the largest posterior probability.

Three-Dimensional Primary Geochemical Halo Anomaly Data Volume Modeling Based on the C-V Model
This section adopts the multifractal C-V model to analyze the spatial anomalous structure of elements and provides single-element anomaly signatures for the subsequent deep prediction.
Taking the ore-forming element Au as an example: Using the ordinary kriging interpolation method [138][139][140], with the experimental variogram fitted to build a 3D data volume model of Au (Figure 5), the Au content value observably does not obey the normal distribution (Figure 6a). Therefore, the multifractal C-V model was carried out to identify Au anomalies.
Through counting the variation of volume with Au content, the lgAu-lgV scatter diagram was generated and lg-lg lines were fitted by the least square method (Figure 3b). Meanwhile, the fractal dimension can be obtained as follows: The slopes of the lines correspond to fractal dimensions, and the inflection points are threshold values of geochemical abnormal intensity as shown in Table 1 and Figure 6b. Through counting the variation of volume with Au content, the lgAu-lgV scatter diagram was generated and lg-lg lines were fitted by the least square method (Figure 3b). Meanwhile, the fractal dimension can be obtained as follows: The slopes of the lines correspond to fractal dimensions, and the inflection points are threshold values of geochemical abnormal intensity as shown in Table 1 and Figure 6b.  Through counting the variation of volume with Au content, the lgAu-lgV scatter diagram was generated and lg-lg lines were fitted by the least square method (Figure 3b). Meanwhile, the fractal dimension can be obtained as follows:  Table 1 and Figure 6b.   The orebody is compared with the outside anomalies, central anomaly and internal anomaly area by superposition, as shown in Figure 7b,c, and the three show a good spatial correlation with the orebody. The orebody is compared with the outside anomalies, central anomaly and internal anomaly area by superposition, as shown in Figure 7b,c, and the three show a good spatial correlation with the orebody. The middle and inner anomalies are mainly distributed inside and around the orebody, which accurately reflects the spatial spreading of the orebody and its trend.
On this basis, this study analyzed and visualized the 3D anomalous structures of the remaining 11 elements using the above model ( Figure 8; Table 2).  The middle and inner anomalies are mainly distributed inside and around the orebody, which accurately reflects the spatial spreading of the orebody and its trend.
On this basis, this study analyzed and visualized the 3D anomalous structures of the remaining 11 elements using the above model ( Figure 8; Table 2).
The orebody is compared with the outside anomalies, central anomaly and internal anomaly area by superposition, as shown in Figure 7b,c, and the three show a good spatial correlation with the orebody. The middle and inner anomalies are mainly distributed inside and around the orebody, which accurately reflects the spatial spreading of the orebody and its trend.
On this basis, this study analyzed and visualized the 3D anomalous structures of the remaining 11 elements using the above model ( Figure 8; Table 2).    Figure 9 shows that the middle anomalies of As and Sb are mainly distributed near the elevation of 1700~1900 m. The middle anomalies of Ag, Cu, Pb and Zn have close relationship to the orebody. W, Mo, Co and Bi have two concentrations, the first one is located near the surface and the second one is distributed near the elevation of 2500 m. The Zaozigou gold deposit has multi-phase mineralization, forming a complicated spatial distribution of elements, while the C-V model can better identify the anomaly for recognizing the pattern of the primary geochemical halo in the Zaozigou gold deposit. relationship to the orebody. W, Mo, Co and Bi have two concentrations, the first one is located near the surface and the second one is distributed near the elevation of 2500 m. The Zaozigou gold deposit has multi-phase mineralization, forming a complicated spatial distribution of elements, while the C-V model can better identify the anomaly for recognizing the pattern of the primary geochemical halo in the Zaozigou gold deposit.

Correlation Analysis
The primary halos were processed by cluster analysis (Figure 10). The elements can be roughly divided into two groups: Au, As, Sb, Ag, W, Hg and Cu, Zn, Bi, Pb, Co, Mo. Among them, the elements most closely associated with Au are As, Sb and Ag; the Au, As, Sb and Cl are moderate volatile elements [141] often associated in gold mineralization like Hg (indicator of volcanism). Ag is often associated to gold as electrum. Hg should be the front halo indicating element of the gold orebody; the correlation coefficient between Au and As reaches 0.8, and most of the As exists within arsenopyrite, which is an important gold-bearing mineral. Therefore, this Au-As-Sb should be the element association of mineralization reflecting a middle-and low-temperature metallogenic environment.

Element Associations Identification Based on clr-Biplot
Geochemical data are typically compositional data, and if traditional multivariate statistical methods (e.g., principal component analysis, factor analysis, etc.) are applied directly to the raw geochemical data, it may lead to erroneous results. Therefore, the raw data should be properly transformed before data analysis is performed.

Correlation Analysis
The primary halos were processed by cluster analysis (Figure 10). The elements can be roughly divided into two groups: Au, As, Sb, Ag, W, Hg and Cu, Zn, Bi, Pb, Co, Mo. Among them, the elements most closely associated with Au are As, Sb and Ag; the Au, As, Sb and Cl are moderate volatile elements [141] often associated in gold mineralization like Hg (indicator of volcanism). Ag is often associated to gold as electrum. Hg should be the front halo indicating element of the gold orebody; the correlation coefficient between Au and As reaches 0.8, and most of the As exists within arsenopyrite, which is an important gold-bearing mineral. Therefore, this Au-As-Sb should be the element association of mineralization reflecting a middle-and low-temperature metallogenic environment.

Element Associations Identification Based on clr-Biplot
Geochemical data are typically compositional data, and if traditional multivariate statistical methods (e.g., principal component analysis, factor analysis, etc.) are applied directly to the raw geochemical data, it may lead to erroneous results. Therefore, the raw data should be properly transformed before data analysis is performed.
Data from 12 geochemical elements were clr-transformed, and the skewness values of the clr-transformed data were statistically obtained ( Figure 11). Compared with the raw data, clr-transformed data has the lower skewness value around zero, indicating that the data distribution after clr transformation tends to be more normal in character (Figures 11 and 12).
Data from 12 geochemical elements were clr-transformed, and the skewness values of the clr-transformed data were statistically obtained ( Figure 11). Compared with the raw data, clr-transformed data has the lower skewness value around zero, indicating that the data distribution after clr transformation tends to be more normal in character (Figures 11  and 12).   of the clr-transformed data were statistically obtained ( Figure 11). Compared with the raw data, clr-transformed data has the lower skewness value around zero, indicating that the data distribution after clr transformation tends to be more normal in character (Figures 11  and 12).   Data from 12 geochemical elements were clr-transformed, and the skewness values of the clr-transformed data were statistically obtained ( Figure 11). Compared with the raw data, clr-transformed data has the lower skewness value around zero, indicating that the data distribution after clr transformation tends to be more normal in character (Figures 11  and 12).   Factor analysis (FA) is used to extract element associations. Four factors are extracted as element associations to indicate different geological and geochemical meanings. From the view of the loadings of FA, factor F1 (34.95%, five variables) represents Cu, Pb, Zn, Ag and Bi, which are a group of medium-temperature elements; factor F2 (12.83%, three variables) is the element association of Au, As and Sb, representing the Au-polymetallic sulfide phase, which is the most dominant phase of gold mineralization; factor F3 (11.13%, one variable) indicates Mo, which is a high-temperature element and may be related to magmatism; factor F4 (9.26%, two variables) is Sb and Hg association, where Sb mainly exists within the form of stibnite within the quartz-stibnite veins and Hg is closely related to fractures (Figure 13). The distribution of each factor in the three-dimensional space is shown in Figure 14a. fide phase, which is the most dominant phase of gold mineralization; factor F3 (11.13%, one variable) indicates Mo, which is a high-temperature element and may be related to magmatism; factor F4 (9.26%, two variables) is Sb and Hg association, where Sb mainly exists within the form of stibnite within the quartz-stibnite veins and Hg is closely related to fractures (Figure 13). The distribution of each factor in the three-dimensional space is shown in Figure 14a.
To analyze and show the geological meaning of each factor more clearly, the 85# profile was selected for analysis by profile cutting (Figure 14). It can be intuitively understood that the F2 factor is closely related to mineralization, and its spatial distribution is well matched with the known orebodies in the exploration profile. The F4 factor is closely related to ore-bearing fractures, so that the Sb-Hg element association could be used as evidence of deep fracture extension.  To analyze and show the geological meaning of each factor more clearly, the 85# profile was selected for analysis by profile cutting (Figure 14). It can be intuitively understood that the F2 factor is closely related to mineralization, and its spatial distribution is well matched with the known orebodies in the exploration profile. The F4 factor is closely related to ore-bearing fractures, so that the Sb-Hg element association could be used as evidence of deep fracture extension.

Knowledge-Driven CoDA and Its Based Element Association Extraction
From the anomaly data volume model based on the C-V method in this paper, it is clear to recognize the distribution of each element in a three-dimensional space.

1.
In the mid-shallow part and the deeper part of elevation at 2500 m, As, Sb and Hg are concentrated as the front halo of the orebodies. The anomalies of Au, Ag, Cu, Pb and Zn linked to sulphurs (pyrite, arsenopyrite, galena and covelline) and are superposed with the orebodies and can be regarded as the near-ore halo elements. At the shallow position of 2500 m, the anomaly locations of W, Mo and Bi linked to magmatism are at the tail of the orebodies, which can be regarded as the tail halo element association.

2.
In terms of Au, its anomalies are controlled by fractures observably, and observable fractures certainly cross-cut orebodies. Especially, the abnormal intensity is larger along the depth indicating the deep mineralization.
Based on the analysis above, the knowledge of the element associations of the front halo, near-ore halo and tail halo can be summarized. Moreover, corresponding element associations are quantitatively extracted as compositional balances by the knowledgedriven CoDA framework (Figure 4), that is, the front halo association is B1 (As-Sb-Hg vs Au-Ag-Cu-Pb-Zn-W-Bi-Co-Mo), the near-ore halo association is B2 (Au-Ag-Cu-Pb-Zn vs As-Sb-Hg-W-Bi-Co-Mo), and the tail halo association is B3 (W-Bi-Co-Mo vs As-Sb-Hg-Au-Ag-Cu-Pb-Zn) (Figure 15).

Knowledge-Driven CoDA and Its Based Element Association Extraction
From the anomaly data volume model based on the C-V method in this paper, it is clear to recognize the distribution of each element in a three-dimensional space.
1. In the mid-shallow part and the deeper part of elevation at 2500 m, As, Sb and Hg are concentrated as the front halo of the orebodies. The anomalies of Au, Ag, Cu, Pb and Zn linked to sulphurs (pyrite, arsenopyrite, galena and covelline) and are superposed with the orebodies and can be regarded as the near-ore halo elements. At the shallow position of 2500 m, the anomaly locations of W, Mo and Bi linked to magmatism are at the tail of the orebodies, which can be regarded as the tail halo element association. 2. In terms of Au, its anomalies are controlled by fractures observably, and observable fractures certainly cross-cut orebodies. Especially, the abnormal intensity is larger along the depth indicating the deep mineralization.
Based on the analysis above, the knowledge of the element associations of the front halo, near-ore halo and tail halo can be summarized. Moreover, corresponding element associations are quantitatively extracted as compositional balances by the knowledgedriven CoDA framework (Figure 4), that is, the front halo association is B1 (As-Sb-Hg vs Au-Ag-Cu-Pb-Zn-W-Bi-Co-Mo), the near-ore halo association is B2 (Au-Ag-Cu-Pb-Zn vs As-Sb-Hg-W-Bi-Co-Mo), and the tail halo association is B3 (W-Bi-Co-Mo vs As-Sb-Hg-Au-Ag-Cu-Pb-Zn) (Figure 15).

Geological and Geochemical Quantitative Prediction Model at Depth of Zaozigou Gold Deposit
The mineral resources prediction model is usually summarized as text, diagrams, and tables by integrating comprehensive metallogenic information, such as orebodies, ore deposits, ore fields and even metallogenic zones. Establishing a mineral resources prediction model is an effective way to discover potential deposits and has significantly im-

Geological and Geochemical Quantitative Prediction Model at Depth of Zaozigou Gold Deposit
The mineral resources prediction model is usually summarized as text, diagrams, and tables by integrating comprehensive metallogenic information, such as orebodies, ore deposits, ore fields and even metallogenic zones. Establishing a mineral resources prediction model is an effective way to discover potential deposits and has significantly important meaning for guiding mineral exploration [142].
Orebodies are strictly controlled by fractures in the Zaozigou gold deposit. The 30 m buffer zone of the fractures can effectively reflect the influence range of the fracture, which can be used as a mineral prediction indicator. Factor F4 is an element association of Sb-Hg, which has close relationship with fractures, and can be used as a favorable indicator for inferring deep fractures [143][144][145][146] (Figure 14a).
Geochemical element distribution, association and zonation are favorable indicators for mineral resources prediction. The geochemical anomalies are extracted by the multiple fractal C-V model in Section 4.1, among which the middle anomaly of Au can well reflect the spatial distribution of orebodies (Figure 7b) and should be used as an important quantitative indicator. Near-ore halo element association B2 is extracted by the knowledgedriven CoDA and also can express the location of orebodies well; it should be another mineral prediction indicator ( Figure 15). The ratio of front halo to tail halo is an important geochemical parameter for predicting orebodies, and B1/B3 is regarded as a prediction indicator accordingly [143] (Figure 16).   In summary, the geological and geochemical quantitative prediction model at the depth of the Zaozigou gold deposit is constructed as in Table 3.

Three-Dimensional MPM Based on Machine Learning
To address the scientific problem of quantitative mineralization prediction at large depths, the previous section quantitatively extracted the deep geochemical mineralization signatures and constructed a geological and geochemical quantitative mineral resource prediction model at depth. In this section, the MaxEnt model and GMM are applied to carry out the 3D MPM to quantitatively predict deep mineral resources, and the uncertainty evaluation of the two models is performed for improving the accuracy of mineralization prediction.

Training Sample Construction
The MaxEnt model is a supervised machine learning algorithm, which requires learning an optimal model from a given training dataset and using this model to output the corresponding result for classification.
For mineralization prediction, the target variable of supervised learning (i.e., the label of training samples) is either mineralized or nonmineralized (denoted by 1 and 0, respectively). A total of 39,306 positive samples were extracted from known orebodies and 49,686 negative samples were extracted from the nonmineralized position confirmed by drillings, and these were used as the training dataset. In contrast to mineralization, which generates in a concentrated way in a limited space, the non-mineralization is a widespread phenomenon, and negative samples are selected to be distributed as randomly and uniformly as possible in the wall rock without mineralization and alteration throughout the study area [33] (Figure 17). The MaxEnt model is a supervised machine learning algorithm, which requires learning an optimal model from a given training dataset and using this model to output the corresponding result for classification.
For mineralization prediction, the target variable of supervised learning (i.e., the label of training samples) is either mineralized or nonmineralized (denoted by 1 and 0, respectively). A total of 39,306 positive samples were extracted from known orebodies and 49,686 negative samples were extracted from the nonmineralized position confirmed by drillings, and these were used as the training dataset. In contrast to mineralization, which generates in a concentrated way in a limited space, the non-mineralization is a widespread phenomenon, and negative samples are selected to be distributed as randomly and uniformly as possible in the wall rock without mineralization and alteration throughout the study area [33] (Figure 17).  The MaxEnt method estimates the probability of the target variable with the maximum entropy value and is controlled by a set of constraints representing incomplete information about the target distribution. In mineralization prediction, the best interpretation of unknown occurrences by the model is to maximize the entropy value of the probability distribution for estimating the location of orebodies, and many scholars have achieved better results in this regard [65,76,147] When modeling with MaxEnt software, if the model parameters are not set properly, it may lead to overfitting or redundancy [148]. The overfitting can be controlled by the modulation multiplier β [149], and the best performance of the model is obtained by setting the β value 2~4 [148,150]. Therefore, the study tested different values to find the best β value of 2 for the model to reduce the influence of model overfitting.
Five prediction indicators are integrated into the MaxEnt model as input parameters, and data were randomly selected from the dataset during simulation, with 75% of the dataset as the training data and 25% as the test data. To reduce the randomness of the simulation results, the model is repeated 50 times with a maximum convergence threshold of 0.00001. A maximum background points value of 10,000 is selected, and a logical value format output is chosen for a more favorable interpretation of the results.
The final prediction result of the MaxEnt model is evaluated using the average value of 50 iterations of the simulation, with the contribution rate of each mineral indicator shown in Table 4. Table 4. Contribution rate of prediction indicators.

Prediction Indicator
Rate of Contribution (%) Au 77.6 30m buffer zone 13 Au-Ag-Cu-Pb-Zn (B2) 7.7 Hg-Sb (F4) 0.9 As-Sb-Hg(B1)/W-Bi-Co-Mo(B3) 0.8 The AUC value of the test dataset is 0.844 and the AUC value of the training dataset is 0.848, so the MaxEnt model has high accuracy in mineral resources prediction at depth ( Figure 18).
The output logical probability of the MaxEnt model is in the range of 0.000804~0.927941, which is mapped to 0 and 1, and then the 3D MPM is formed (Figure 19).
Although the mineral prospectivity map shows a good relationship between high probabilities and known gold orebodies (Figure 19), it is difficult to determine a certain logistic probabilities value as the prediction threshold value.
We take the ratio of prediction volume to orebodies occupied volume as a parameter. Observably, it must be that reverse variation of this parameter follows the greater logistic probabilities ( Figure 20). The high potential area (logical probability > 0.525) is defined by the logistic probability of 0.525, which covers 80% of the known orebodies; the medium potential area (0.3 < logical probability < 0.525) is defined by the logistic probability of 0.3, which is another inflection point and covers all the known orebodies ( Figure 20). The spatial distribution of mineralization potential areas by the MaxEnt model are shown in Figure 21a, based on which two mineral exploration targets are circled ( Figure 21b).
As-Sb-Hg(B1)/W-Bi-Co-Mo(B3) 0.8 The AUC value of the test dataset is 0.844 and the AUC value of the training is 0.848, so the MaxEnt model has high accuracy in mineral resources prediction ( Figure 18).  Although the mineral prospectivity map shows a good relationship between high probabilities and known gold orebodies (Figure 19), it is difficult to determine a certain logistic probabilities value as the prediction threshold value.
We take the ratio of prediction volume to orebodies occupied volume as a parameter. Observably, it must be that reverse variation of this parameter follows the greater logistic probabilities (Figure 20). The high potential area (logical probability > 0.525) is defined by the logistic probability of 0.525, which covers 80% of the known orebodies; the medium Observably, it must be that reverse variation of this parameter follows the greater logistic probabilities (Figure 20). The high potential area (logical probability > 0.525) is defined by the logistic probability of 0.525, which covers 80% of the known orebodies; the medium potential area (0.3 < logical probability < 0.525) is defined by the logistic probability of 0.3, which is another inflection point and covers all the known orebodies ( Figure 20). The spatial distribution of mineralization potential areas by the MaxEnt model are shown in Figure 21a, based on which two mineral exploration targets are circled (Figure 21b).

Three-dimensional MPM and Uncertainty Evaluation of GMM
When training and testing the model, the labeled data (which is used only in the evaluation) is divided into 75% for the training dataset and 25% for the test dataset, and the GMM is used to learn the information of the training dataset, and then the ROC curve is used for performance evaluation of the training dataset and test dataset, respectively. The AUC value of the test dataset is 0.75 and the AUC value of the training dataset is 0.75 ( Figure 22).

Three-Dimensional MPM and Uncertainty Evaluation of GMM
When training and testing the model, the labeled data (which is used only in the evaluation) is divided into 75% for the training dataset and 25% for the test dataset, and the GMM is used to learn the information of the training dataset, and then the ROC curve is used for performance evaluation of the training dataset and test dataset, respectively. The AUC value of the test dataset is 0.75 and the AUC value of the training dataset is 0.75 ( Figure 22). From the point of the AUC value, 0.75 is not a high value, which may indicate that unsupervised training methods have a shortage in prediction with big data.
However, from the mineral prospectivity map point of view, the prediction area has covered orebodies well, and it still has a certain indication function in the mineralization prediction. Finally, two mineral exploration targets are delineated at depth (Figure 23).  From the point of the AUC value, 0.75 is not a high value, which may indicate that unsupervised training methods have a shortage in prediction with big data.
However, from the mineral prospectivity map point of view, the prediction area has covered orebodies well, and it still has a certain indication function in the mineralization prediction. Finally, two mineral exploration targets are delineated at depth (Figure 23). From the point of the AUC value, 0.75 is not a high value, which may indicate that unsupervised training methods have a shortage in prediction with big data.
However, from the mineral prospectivity map point of view, the prediction area has covered orebodies well, and it still has a certain indication function in the mineralization prediction. Finally, two mineral exploration targets are delineated at depth (Figure 23).

Discussion
This study employed the geostatistical interpolation method to build a 3D geochemical model and geochemical anomaly model. In addition to deterministic modeling of 3D geology and geochemistry [151,152], geostatistical techniques also include uncertainty modeling of spatial distribution of subsurface heterogeneous structures and dynamic processes of fluid migration [153]. In view of the 3D heterogeneous structure, the multipoint geostatistical method can be used to overcome the shortage of traditional geostatistical simulations in delineating the geometric continuity of geological structures [154][155][156][157][158][159]. Meanwhile, traditional geostatistical simulation has the limitations of large computation, complicated parameterization and difficult to characterize multi-scale data. The application of machine learning and deep learning methods to reconstruct geological and geochemical structures can improve the simulation efficiency and can accurately express complex heterogeneous spatial structures [160,161], which deserves further research work.
The machine learning methods of the MaxEnt model and GMM are carried out for 3D MPM in the Zaozigou gold deposit in this study. Compared with the GMM, the MaxEnt model has a higher precision in detection of ore-induced anomalies, which demonstrates a higher reliability of 3D MPM (Figures 18 and 22). The prediction results of the two methods express a high correlation with the known orebodies, based on which two mineral exploration targets are circled (Figures 21 and 23).
Target I of the MaxEnt model is located at an elevation of 1600~2000 m, belonging to the NE-orientation orebody group, which should be the extension of the Au1 orebody. The Au and Sb concentration in this position ( Figure 9) and the ratio of front halo to tail halo has been increasing ( Figure 16). Additionally, it appears that the high logical probability calculated by the MaxEnt model and GMM at this position indicates the Au1 orebody may extend deeper or a concealed orebody exists therein. Meanwhile, the Target I of GMM is located at the elevation of about 1300 m, reflecting the weak anomaly in the deep drill.
Target II of the two methods is similarly located at the NW-orientation orebody group at the elevation of about 2500 m, where the fractures distribute complexly and the anomalies of tail halo elements and front halo elements overlapped (Figure 9).

Conclusions
In this paper, the three-dimensional primary halo anomaly data volume model is built based on the multifractal C-V model, which fully considers the nonlinear characteristics of the primary geochemical data. The C-V model is a three-dimensional extension of the two-dimensional multifractal method, according to which the geochemical concentrations are clearly illustrated at depth. The 3D geochemical anomaly data volume model provides an important element distribution indicator to the 3D MPM.
The data-driven CoDA method was performed in this paper by using clr transformation and factor analysis, among which factor F4 is selected as a prediction indicator. The knowledge-driven CoDA method used the SBP approach to extract the element associations of front halo, near-ore halo and tail halo, and the association of near-ore halo and the ratio of front halo to tail halo are selected as the other two prediction indicators. These selected geochemical association indicators are reliable for their good reflection in metallogenic regularity.
From the results of this paper, the MaxEnt model and the GMM are efficient machine learning methods in 3D MPM. By comparing the spatial distribution of the orebodies and the indication of the metallogenic regularity, the delineated mineral exploration targets can be considered as the mineral potential areas for further investigation. However, it must be mentioned that machine learning algorithms have fast and accurate calculation in the case of small data but lack generalization ability compared with deep learning algorithms in big data. As the amount of data gradually increases, the prediction ability of the MaxEnt model and the GMM usually reach the bottleneck, while deep learning can use more parameters to continuously optimize and improve the detection ability of the models. Deep