Support Vector Machine Algorithm for Automatically Identifying Depositional Microfacies Using Well Logs

Depositional microfacies identification plays a key role in the exploration and development of oil and gas reservoirs. Conventionally, depositional microfacies are manually identified by geologists based on the observation of core samples. This conventional method for identifying depositional microfacies is time-consuming, and only the depositional microfacies in a few wells can be identified due to the limited core samples in these wells. In this study, the support vector machine (SVM) algorithm is proposed to identify depositional microfacies automatically using well logs. The application of SVM includes the following steps: First, the depositional microfacies are determined manually in several wells with core samples. Then, the training sets used in the SVM algorithm are extracted from the well logs. Finally, a quantitative discrimination model based on the SVM algorithm is established to realize the classification of depositional microfacies. Field application shows that this innovative and constructive solution can be effectively used in uncored wells to identify depositional microfacies with a rate of accuracy approaching 84%. It overcomes the limitation of the conventional manual method which greatly contributes to the cost-saving of core analysis and improves the sustainable profitability of oil and gas exploration.


Introduction
The first person proposing the systematic application of logs in geology was Pirson [1].Pirson's key idea is to use log data to perform sedimentology studies in the oil field, and hence to conduct reservoir descriptions of oil and gas reservoirs.Serra systematically introduced the application of logging data in geology [2][3][4].The practice of logging facies, via establishing a forward model based on a comprehensive analysis of depositional facies and logging data as well as the calibration of wells to the core, is the key for sedimentary interpretation with continuous logging data.In different sedimentary environments, the sediment provenance, hydrodynamic force conditions and depth of water are different.Those different conditions lead to different superimposed forms and sequence characteristics of sediments [5][6][7], and hence different curve shapes and combination styles on log curves.Original petrophysical well log values are physical properties (such as density, resistivity, or radioactivity) of the rock measured by logging tools.The determination of depositional microfacies is a geological topic.Although each of the microfacies may have different log values, geological characteristics may not directly link to log values.Fortunately, the variation of log values within a depth window is more suitable for describing the geological characteristics because the variation of petrophysical well log values contains information of geological variation in the time and space domains.So, the derived (log variation) characteristic parameters of the logs within a depth window are useful, and they represent the link between the geological characteristics and log values.In theory, for most of the conventional logs, especially the neutron log, density log and acoustic log, any single log curve or combination of log curves can be used.Currently, due to the development of logging techniques and log interpretation methods, the study of depositional microfacies with logging data is gradually moving towards high precision, automation and intelligence [8][9][10].
So far, the most used machine learning algorithms for the automatic identification of depositional microfacies by well logs include Bayesian, fuzzy clustering and artificial neural network (ANN) algorithms [11][12][13][14].They have provided robust, efficient and effective platforms for solving problems [15].However, these machine learning algorithms have many limitations and may not meet the need for automatic identification of microfacies with a highly successful rate in many cases [16,17].When using the Bayesian algorithm, the assumptions of the independent distribution should be known first [18].However, it is impossible for these distributions to be completely independent.As for the classification decisions, the error rates of these are inevitable.Fuzzy clustering analysis is sensitive to outliers and noise, and the results are unstable [19].The choice of initial value still affects the results of this method.Although ANN has the advantages of strong robustness, memory ability and strong nonlinear mapping ability, it requires a large number of parameters [20].When the parameters are insufficient, it cannot work.Besides, the learning process cannot be observed, and the output is difficult to understand.
Support vector machine (SVM) is a machine learning algorithm proposed by Vapnik, based on the Vapnik-Chervonenkis Dimension theory of statistical learning theory and the principle of structural risk minimization [21].It has many unique advantages in solving nonlinear and high dimensional pattern recognition problems with a small number of samples [22,23].Due to the absence of the curse of dimensionality, SVM exhibits better generalization due to structural risk minimization.Moreover, SVM is less prone to over-fitting, requires a lower number of training features, and has the significant advantage of being without a local minimum [24,25].The solution to an SVM algorithm is global and unique because it is being defined by a convex optimization problem [26], which makes SVM better in these areas.SVM also has the advantage of a simple geometric interpretation and give a sparse solution [27].Compared to ANN, the robustness (among other advantages) of SVM has proven that SVM is a better computational intelligence method to use in this study area.
Overall, the identification of depositional microfacies of oil and gas reservoirs is money-cost and time-consuming due to the expensive core analysis and long-time manualwork.This paper, for the first time, proposes a new solution based on SVM to identify depositional microfacies quantitativly and automatically.Specifically, Section 2 introduces the methodology of SVM in details, Section 3 presents the result and discussion and Section 4 ends with conclusion and future research.

Methodology
Generally, the sedimentary background of the study area should be known and the qualitative identification model of depositional microfacies should be established [28,29].The target reservoir in the study area is the He-8 reservoir of Lower Shihezi in the Su-77 block of the Sulige gas field, and the whole body of He-8 is sandy braided river depositional systems in a region of wet marshes, at a certain distance from the sediment provenance [30,31].The corresponding relationship between the shape and the superimposed pattern of well logs in cored wells and the total of six sedimentary microfacies was established, which is different from those braided river geological models proposed by Miall et al. [32,33].The river was transformed into a more arid climate from bottom to top as the swing of the river increased.The depositional microfacies models in this area are shown in Figure 1.
certain distance from the sediment provenance [30,31].The corresponding relationship between the shape and the superimposed pattern of well logs in cored wells and the total of six sedimentary microfacies was established, which is different from those braided river geological models proposed by Miall et al. [32,33].The river was transformed into a more arid climate from bottom to top as the swing of the river increased.The depositional microfacies models in this area are shown in Figure 1.Based on the observation of outcrops, drilled cores and thin sections in the study area, combining the identified depositional microfacies with their corresponding log curve shape characteristics, the standard microfacies identification models suitable for the study area are established [34][35][36].Table 1 shows six types of main microfacies of the braided river in the study area.Based on the observation of outcrops, drilled cores and thin sections in the study area, combining the identified depositional microfacies with their corresponding log curve shape characteristics, the standard microfacies identification models suitable for the study area are established [34][35][36].Table 1 shows six types of main microfacies of the braided river in the study area.After establishing the qualitative identification models of depositional microfacies in the study case, the quantitative identification of depositional microfacies could then be performed.The key idea of quantitative identification of depositional microfacies by logging data based on the SVM algorithm is that each depositional microfacies can be represented by a set of eigenvectors (log curves characteristic parameters).Sample sets of different microfacies types are mapped to a higher dimension space by nonlinear transformation.This method seeks the optimal classification plane among the samples in the high dimensional feature space, which enables the classification surface to both guarantee the classification accuracy and maximize the interval between two classes of data.Finally, decision functions are constructed to determine the microfacies type of unknown prediction samples as well as the output type labels of the prediction samples, thus achieving depositional microfacies identification [37,38].The Figure 2 presents the main flow chart of the methodology.The following subsections introduce the steps in details.
idea of quantitative identification of depositional microfacies by logging data based on the SVM algorithm is that each depositional microfacies can be represented by a set of eigenvectors (log curves characteristic parameters).Sample sets of different microfacies types are mapped to a higher dimension space by nonlinear transformation.This method seeks the optimal classification plane among the samples in the high dimensional feature space, which enables the classification surface to both guarantee the classification accuracy and maximize the interval between two classes of data.Finally, decision functions are constructed to determine the microfacies type of unknown prediction samples as well as the output type labels of the prediction samples, thus achieving depositional microfacies identification [37,38].The Figure 2 presents the main flow chart of the methodology.The following subsections introduce the steps in details.

Data Preparation
Data mining and mathematical analysis of the log curves were carried out to determine the characteristic parameters corresponding to the depositional microfacies.After the analysis of the available logs of the study area, the natural gamma curve (GR) [39]-which can best reflect the depositional microfacies of the study area-was selected.The parameters used to quantitatively describe the log curve characteristics were extracted from several aspects such as the curve shape, top-bottom contact relationship and smoothness [3,40].The value of shale content Vsh is used as a constraint to improve the microfacies identification in the complex geological conditions of the tight sandstone reservoirs in the study area.It is better to extract more characteristic parameters of log curves that can be used to express geological characteristics from density, neutron and acoustic logging curves.The number of characteristic parameters [41,42] in this study that were selected to characterize sedimentary microfacies extracted from well logs was 11; The data were normalized.
The training instances, that is, the matrix (m by n) needs to be constructed.Each column represents a characteristic parameter with a total of n characteristic parameters.Each row represents an instance, with a total of m instances.The training instances are  ,  ,  1,2, … ,  , where m is the number of instances, among which,  is the ith instance.The output is  ∈ 1, 1 , which represents the classification label.If  belongs to a certain class, the corresponding output is labeled as  1; if it does not belong to a certain class, the output is labeled as  1.The data sets used in this study were extracted from the well logs of six cored wells in the He-8 reservoir of the Lower

Data Preparation
Data mining and mathematical analysis of the log curves were carried out to determine the characteristic parameters corresponding to the depositional microfacies.After the analysis of the available logs of the study area, the natural gamma curve (GR) [39]-which can best reflect the depositional microfacies of the study area-was selected.The parameters used to quantitatively describe the log curve characteristics were extracted from several aspects such as the curve shape, top-bottom contact relationship and smoothness [3,40].The value of shale content V sh is used as a constraint to improve the microfacies identification in the complex geological conditions of the tight sandstone reservoirs in the study area.It is better to extract more characteristic parameters of log curves that can be used to express geological characteristics from density, neutron and acoustic logging curves.The number of characteristic parameters [41,42] in this study that were selected to characterize sedimentary microfacies extracted from well logs was 11; The data were normalized.
The training instances, that is, the matrix (m by n) needs to be constructed.Each column represents a characteristic parameter with a total of n characteristic parameters.Each row represents an instance, with a total of m instances.The training instances are {(x i , y i ), i = 1, 2, . . ., m}, where m is the number of instances, among which, x i is the ith instance.The output is y i ∈ (+1, −1), which represents the classification label.If x i belongs to a certain class, the corresponding output is labeled as y i = +1; if it does not belong to a certain class, the output is labeled as y i = −1.The data sets used in this study were extracted from the well logs of six cored wells in the He-8 reservoir of the Lower Shihezi Formation in Ordos basin, China.The China National Petroleum Corporation (CNPC) provided the petrophysical well logs and corresponding petrographic core analysis reports.The training instances include 11 features parameters as displayed in Table 2. Figure 3 is the schematic diagram describing the shape of a log curve.The log is sampled with a constant sampling interval from top to bottom, and n + 1 log samples were obtained: depths d 0 , d 1 , d 2 , . . ., d n versus corresponding log reading (amplitude) values a 0 , a 1 , a 2 , . . ., a n .The thickness n) is calculated as well.Within the determined depth window, using the determined minimum amplitude of gear tooth ∆a min as the lowest limit, the maximum and minimum values of the amplitude and depth sequence from top to bottom were obtained successively.The extreme value and depth sequence of log curve [A i , D i ](i = 1, 2, ..., n) are obtained.Finally, it is assumed that the sequence started and ended with a valley (see Figure 3).The vertical axis is the depth, and the horizontal axis is the log value.a i is the log reading value, and A i is the extreme value.The wave peak is found to exhibit a sawtooth form.Figure 3 is the schematic diagram describing the shape of a log curve.The log is sampled with a constant sampling interval from top to bottom, and  1 log samples were obtained: depths  ,  ,  , … ,  versus corresponding log reading (amplitude) values  ,  ,  , … ,  .The thickness ℎ    1,2, . . .,  is calculated as well.Within the determined depth window, using the determined minimum amplitude of gear tooth  as the lowest limit, the maximum and minimum values of the amplitude and depth sequence from top to bottom were obtained successively.The extreme value and depth sequence of log curve  ,   1,2, . . .,  are obtained.Finally, it is assumed that the sequence started and ended with a valley (see Figure 3).The vertical axis is the depth, and the horizontal axis is the log value.ai is the log reading value, and Ai is the extreme value.The wave peak is found to exhibit a sawtooth form.The four categories of feature parameters listed in Table 2 represents curve shape (G, S, P, Am), contact relation with top and bottom beds (am-b, am-t, Kp), smoothness (m, R, Kc), and geological The four categories of feature parameters listed in Table 2 represents curve shape (G, S, P, Am), contact relation with top and bottom beds (am-b, am-t, Kp), smoothness (m, R, Kc), and geological constraint parameters (Vsh).The acquisition of these characteristic parameters is an important step in the data preparation stage, and it is also the basis of this method.Previous researchers did not systematically and clearly tell the readers how to acquire these parameters and their meanings.
To enable the readers to understand better and reproduce this method, this paper re-combed how these parameters are calculated in the following paragraphs of Section 2.1 and elaborated their geological significance in detail.
The curve shape of the log curve includes feature parameters of G, S, P, and A m .G reflect the curve shape and also explain the graded sequence of the sedimentary rock grains [41].It can be expressed as: where G is the relative center of gravity, dimensionless; a i is the ith log value; n is the number of attributes features of the samples.When G > 0.5, the center of gravity is lower and the shape of the curve is bell-shaped, which indicates that the energy of water flow decreases gradually, the supply of sediment provenance decreases gradually and the sedimentary rock grain usually exhibits a graded bed sequence.When G < 0.5, the curve is funnel-shaped and the center of gravity is higher, which reflects the gradual increase of the water flow energy, the gradual increase of the sediment provenance supply and the sedimentary rock grain has a reversed grain sequence.When G ≈ 0.5, the curve shape is a box type-symmetrical or flat tooth-shaped-which represents the abundant supply of sediment provenance in the process of sedimentation, strong hydrodynamic force conditions and a stable sedimentary environment.The S reflects the undulatory property of the curve data [41] and can be expressed as: where S is the variance of log amplitude, dimensionless; a represents the mean values of the n data points; a i represents the ith log values.n represents the number of sampling points in the selected curve section.S can reflect the physical property of rock to a certain extent.If the grain-size sorting is good, there would be few changes to the physical property, and the log curve shape would be approximately a straight line (mainly a box shape).The log data mean value would be close to most log values over the middle part of the curve, and only a few log values over the top and bottom parts of the curve would be different from the log data mean value.So, S would be smaller.If the grain-size sorting is poor and the physical property variation is significant, the logging curve would have some fluctuation.The curve may be funnel-shaped, bell-shaped or finger-shaped, and the log data would be relatively dispersed.Some of the data points would deviate from the mean value, so S would be bigger.The mean square deviation S can be used to distinguish box shape and other morphology.The P reflect the rate of lithological change [41] and can be expressed as: where P is the ratio of the amplitude to the thickness of a log curve, dimensionless; ∆a indicates the difference between the extreme value of the log curve and the value of the shale baseline; ∆h indicates the thickness of the formation (m).As for the sand-shale formations, the larger the value of ∆h, the more abundant the sediment source, and the shape of the curve is observed to be box type when the physical property changes little.On the contrary, if the value of ∆h is small, this illustrates only a thin interlayer, the deposition time is short and the curve segment shape is finger-shaped.
The A m reflect the concentration degree of the primary amplitude values of the curve [41] and can be expressed as: where A m is the median of log values, dimensionless; a is the mean value of the curve; Me is the median value of the curve.
The contact relation between top and bottom beds can be described by the combination of two feature parameters-the mutational amplitude difference a m [42] and the average slope K p . a m can be calculated by Equation ( 5), a m = a max − a min (5) where a m is the mutational amplitude difference, dimensionless; a max is the maximum amplitude at the mutation; a min is the minimum amplitude at the mutation.The average slope K p [42], for data sequences within the depth window of the log curve [a i , d i ](i = 1, 2, ..., n), can be expressed as: where K p is the average slope of a log curve, dimensionless; The contact relation between top and bottom beds can reflect the variation rate of the sediment source supply and water flow energy at the initial and final depositional stages of the sand body.There are two types of contact relations are gradual change and abrupt change.The abrupt change position is the position at which the average slope values of two adjacent curve segments change from positive to negative or from negative to positive, and it tends to indicate scouring (abrupt change at the bottom) or sediment provenance interruption (abrupt change at the top).
The smoothness of the log curve can be indicated by the feature parameters of m, R and K c .The concave-convex shape of the log curve's envelope can reflect the change of the water flow energy and depositional rate in the process of sedimentation.The convex, straight-line and concave shapes of the envelope corresponding to the variation of the depositional rate being accelerated, linear or subsided, respectively.The second derivative of a function (curve) can represent the concave and convex characteristics of the curve.So, the concave and convex characteristics of the envelope of the log curve can be obtained by the second derivative m of the envelope [41], which is fitted by the maximum value sequence of the log segment.The envelope is fitted by a parabolic curve using Equation (7), where D is the function of the log curve's envelope, dimensionless; m is obtained by performing the second derivative of Equation (7).If m > 0, the envelope shape is concave-concave downward stands for accelerated deposition; concave upward means decelerated deposition.If m < 0 the envelope shape is convex-convex downward indicates accelerated deposition; convex upward represents decelerated deposition.If m = 0, the envelope shape is linear, which represent uniform deposition.With the combination of m and K p , the progradational, reprogradational and aggradational cycles of the sediments can be further distinguished.If K p < 0, m >0, it represents the progradational decelerated type.If K p < 0, m < 0, it is the progradational accelerated type.If K p > 0, m > 0, it is the reprogradational decelerated type.If K p > 0, m < 0, it is the reprogradational accelerated type.For the decrement curve, the concavity and convexity of the envelope are determined by the minimum value sequence using the above method.The number of sawtooth waves of a log curve segment can reflect the intermittence of the deposition process and the stability of the sedimentary environment.A different sequence of log points is constructed first.If the sign of adjacent difference values reverses and the absolute difference between adjacent difference values is higher than a specified value, then it is considered as a sawtooth.So, the number of sign reversals of adjacent difference values in a log segment indicates the number of gear teeth (L).R is a relative sawtooth number of a log curve segment [41], and can be expressed as: where R is the sawtooth rate, dimensionless; L is the number of sawtooth waves; n is the number of log data points in the log curve segment.The sawtooth center line can be represented by the slope K of the gear tooth centerline or the angle θ between the sawtooth centerline and the horizontal line [42].In the depth window, the sequence of data amplitude and depth of the log curve is [a i , d i ](i = 1, 2, ..., n).
Selecting the crest as the maximum value and the trough as the minimum value, the sequence of maximum value and depth is (A imax , D imax )(i = 1, 2, . . ., n), and the sequence of minimum value and depth is (A imin , D imin )(i = 1, 2, . . ., n).The angle between the tooth center line of the ith crest and the horizontal line can be expressed as: where θ c is the angle between the tooth center line of the trough and the horizontal line ( • ).The angle between the tooth center line of the ith trough and the horizontal line [42] can be expressed as: where θ t is the angle between the tooth center line of the trough and the horizontal line ( • ).The slope of the tooth centerline [42] of the ith crest can be expressed as: where K c is the slope of the tooth centerline of the crest.The slope of the tooth centerline [42] of the ith trough can be expressed as: where K c is the slope of the tooth centerline of the trough.θ c , θ t , K c and K t may be used together.
In practice, usually one of them (for example, K c ), is used.If K c > 0 and its value increases from bottom to top, it represents outward convergence and indicates that the deposition rate increases from bottom to top.If K c < 0 and its value decreases from bottom to top, it represents inward convergence and indicates that the deposition rate decreases from bottom to top.If K c ≈ 0, this indicates that the physical property of the layers is stable, and that the sedimentary environment and sediment provenance are stable.When the K c value swings seriously, it shows that the sedimentary environment frequently changes, hence causing great changes in the physical property.The shale content (V sh ) can reflect the magnitude of the flow energy and the distance from the sediment provenance.The shale content of different depositional facies is different and set apart from the changes in the log curve shape.Therefore, the shale content, as a geological constraint parameter, can be used to optimize the microfacies quantitative identification model.The shale content is calculated by the combination of the gamma-ray log and the difference between density porosity and neutron porosity logs.

SVM Classification Model
The following is the specific process of training the SVM classifier by the training sample set to gain the classification model (optimal decision function), including three steps: selecting the kernel function; constructing the optimal classification surface; solving the optimal decision function [43,44].

Kernel Function
If the low dimensional space is linearly inseparable, it may become linearly separable after being mapped to a higher dimensional feature space through nonlinear mapping.If this technique is used for classification or regression in high dimensional space directly, there will be some challenging issues such as determining the form and parameters of a nonlinear mapping function and the number of dimensions of the feature space.Support vector machine is used to solve the inner product operation in the high dimensional feature space by using the kernel function K(x i , x) in the low dimensional space and avoiding the dimension disaster [45].The kernel function value is the similarity between x i and x.There are three kinds of kernel functions in SVM, and selecting the suitable kernel function is the key to successful nonlinear classification [46].In the case of an unknown probability of classification, this paper selects the widely used radial basis function (RBF), which is expressed by Equation (13), where K is the kernel function; r is the RBF parameter; x i represents the feature vectors of the training samples; x represents the prediction samples.Based on the training samples of depositional microfacies of six cored wells and 11 feature parameters, using the cross-validation method, the obtained optimal kernel function parameter r for identifying positional microfacies in the study case was found to be 0.1.

Optimal Classification Surface
After the input sample is transformed into the high dimensional space using the radial basis kernel function, the optimal classification surface needs to be determined in the high dimensional space (see Figure 4) The separation between different clusters is termed as the hyperplane, and its geometry can be linear, polynomial or radial [25].w is the normal vector of the optimal SVM decision surface in Figure 4.The red and blue dots in Figure 4 are the features of two different sedimentary microfacies samples.Solving the hyperplane is equivalent to solving a Lagrange function L(w, b, a).Nonlinear data set separation is achieved by a separating the hyperplane using a kernel function in the feature space.
If the low dimensional space is linearly inseparable, it may become linearly separable after being mapped to a higher dimensional feature space through nonlinear mapping.If this technique is used for classification or regression in high dimensional space directly, there will be some challenging issues such as determining the form and parameters of a nonlinear mapping function and the number of dimensions of the feature space.Support vector machine is used to solve the inner product operation in the high dimensional feature space by using the kernel function   ,  in the low dimensional space and avoiding the dimension disaster [45].The kernel function value is the similarity between  and .There are three kinds of kernel functions in SVM, and selecting the suitable kernel function is the key to successful nonlinear classification [46].In the case of an unknown probability of classification, this paper selects the widely used radial basis function (RBF), which is expressed by Equation ( 13), where K is the kernel function; r is the RBF parameter;  represents the feature vectors of the training samples;  represents the prediction samples.Based on the training samples of depositional microfacies of six cored wells and 11 feature parameters, using the cross-validation method, the obtained optimal kernel function parameter r for identifying positional microfacies in the study case was found to be 0.1.

Optimal Classification Surface
After the input sample is transformed into the high dimensional space using the radial basis kernel function, the optimal classification surface needs to be determined in the high dimensional space (see Figure 4) The separation between different clusters is termed as the hyperplane, and its geometry can be linear, polynomial or radial [25].w is the normal vector of the optimal SVM decision surface in Figure 4.The red and blue dots in Figure 4 are the features of two different sedimentary microfacies samples.Solving the hyperplane is equivalent to solving a Lagrange function  , ,  .Nonlinear data set separation is achieved by a separating the hyperplane using a kernel function in the feature space.After obtaining the optimal classification hyperplane, it can be used to predict the prediction sample set.The determination of the optimal classification surface can be expressed as a constrained After obtaining the optimal classification hyperplane, it can be used to predict the prediction sample set.The determination of the optimal classification surface can be expressed as a constrained optimization based on the Lagrange relaxation method L(w, b, a).The optimal objective function of the generalized optimal classification surface of SVM [22] can be transformed into Equation (14), The corresponding constraint is where x i , x j are the log curve characteristic parameter vectors; y i , y j are the microfacies type label vectors of the training samples; c is the penalty parameter.When c = 10, the best result of depositional microfacies identification in the study area was achieved.The optimal solution α * = a * 1 , ..., a * n is obtained, then select a * j as a positive component of α * and calculate b * = y j − ∑ n i=1 y i a * i K → x i , → x j , where b * is the threshold value for classification and α * is the Lagrange multiplier.

Optimal Decision Function
After the above step, the optimal decision function [21] (expressed by Equation ( 15)) satisfying the constraint is obtained by solving the optimal classification surface (a quadratic programming optimization problem under the inequality constraint and having a unique solution).The optimal decision function is the categorization model.That is, Model = svmtrain (train_label, train_matrix, [option]).
where x i denotes the feature vectors of the training samples; x represents the prediction samples; y i and y are output depositional microfacies types labels of the training samples and prediction samples, respectively; K(x i •x) is the kernel function; n is the number of samples.

Discrimination of the Prediction Instances
After obtaining the classification model by the training samples, the model was used to discriminate the depositional microfacies types of prediction samples.Finally, the microfacies type label column vector of the prediction samples are the output, and the row number of the column vector is equal to the number of the prediction sample set matrix.That is, [predict_label, accuracy] = svmpredict (predict_label, predict_matrix, model).
Finally, the "one against one" method was used to perform multi-classification [22].Any two sample sequences were combined to construct C 2 N vector machines, the "vote" was adopted to classify and perform the identification of N types of depositional microfacies in the intervals without the core.

Results and Discussion
The method proposed in this paper for the automatic identification of microfacies was applied to the H-8 reservoir in multiple wells of the studied area.To verify the results of the method, two cored wells (S77-23-37 and S77-3-21) are randomly selected as verification wells.These two cored wells are not used in the stage of establishing qualitative identification models of depositional microfacies, so they could be used to verify whether the method works or not.In these two verification wells, 25 microfacies including 6 major river bed lag deposit, 4 crevasse river bed lag deposit, 6 major river channel bar, 4 crevasse river channel bar, 1 flood swamp, and 3 flood plain as shown in Table 3 are selected.These 25 depositional microfacies are identified by geologists through the observation of thin slice photographs of core samples and can be regarded as the standard and true depositional microfacies.After the microfacies identification by geologists, these 25 depositional microfacies are processed by the SVM method.To verify the reliability, the identification results by SVM are compared with that by geologists.The comparison results in Figures 5 and 6 show that only 4 layers identified by the SVM are not inconsistent with the results by geologists.In other words, there are 21 accurate recognition results and four errors, so the accuracy rate is 84%.This paper compares the recognition results between the SVM method and the traditional manual method by a geologist in the two example wells, as shown in Figures 5 and 6.Some core thin section photos are shown to verify the real results as well.In well S77-3-21 (Figure 6), braided river channel lag deposit and diara deposit occur alternately from 2970 to 2980 m, while the lithology is transitional and the sedimentary thickness is thin.This SVM method accurately identified and distinguished the thin layer, and showed that the automatic method for the identification of depositional microfacies works well in a tight sandstone gas reservoir.This paper compares the recognition results between the SVM method and the traditional manual method by a geologist in the two example wells, as shown in Figures 5 and 6.Some core thin section photos are shown to verify the real results as well.In well S77-3-21 (Figure 6), braided river channel lag deposit and diara deposit occur alternately from 2970 to 2980 m, while the lithology is transitional and the sedimentary thickness is thin.This SVM method accurately identified and distinguished the thin layer, and showed that the automatic method for the identification of depositional microfacies works well in a tight sandstone gas reservoir.

Conclusions
In this study, the SVM algorithm is introduced and applied to automatically identify depositional microfacies using well logs.The following conclusions have been obtained.(1) The comprehensive, accurate, and quantitative description of the variation characteristics of log curves is the foundation for the automatic identification of depositional microfacies by SVM algorithm.That is, four categories of feature parameters in terms of curve shape, contact relation with top and bottom beds, smoothness, and geological constraint parameters should be used as the input parameters of SVM.(2) The SVM follows the steps of selecting radial basis function, obtaining the SVM discrimination model of depositional microfacies by training sample sets, and realizing the discrimination of predicted samples.(3) The application of SVM in the He-8 reservoir of Lower Shihezi in the Su-77 block of the Sulige gas field shows that the SVM algorithm can automatically and quantitatively identify depositional microfacies with a rate of accuracy approaching 84%.
In summary, this study proposes an innovative and constructive solution for identifying depositional microfacies.This innovative and constructive solution for identifying depositional microfacies will contribute to the cost-saving of core analysis and improve the sustainable profitability of oil and gas exploration.However, the application of this method needs to take into consideration the regional sedimentary environment and the geological stratification.Future work to

Conclusions
In this study, the SVM algorithm is introduced and applied to automatically identify depositional microfacies using well logs.The following conclusions have been obtained.(1) The comprehensive, accurate, and quantitative description of the variation characteristics of log curves is the foundation for the automatic identification of depositional microfacies by SVM algorithm.That is, four categories of feature parameters in terms of curve shape, contact relation with top and bottom beds, smoothness, and geological constraint parameters should be used as the input parameters of SVM.(2) The SVM follows the steps of selecting radial basis function, obtaining the SVM discrimination model of depositional microfacies by training sample sets, and realizing the discrimination of predicted samples.
(3) The application of SVM in the He-8 reservoir of Lower Shihezi in the Su-77 block of the Sulige gas field shows that the SVM algorithm can automatically and quantitatively identify depositional microfacies with a rate of accuracy approaching 84%.
In summary, this study proposes an innovative and constructive solution for identifying depositional microfacies.This innovative and constructive solution for identifying depositional microfacies will contribute to the cost-saving of core analysis and improve the sustainable profitability of oil and gas exploration.However, the application of this method needs to take into consideration the regional sedimentary environment and the geological stratification.Future work to improve this method will focus on the combination of automatic stratification technology, signal analysis technology and the selection of microfacies-sensitive log curves.This method in our study was only applied in the depositional microfacies identification of tight sandstone gas reservoirs.It has potential applications in carbonate and shale reservoirs, although the universal applicability of the method in these two kinds of reservoirs should be verified.

Figure 1 .
Figure 1.Braided river depositional microfacies qualitative identification models of the He-8 reservoir of the Su-77 block in the Sulige gas field, Ordos basin.

Figure 1 .
Figure 1.Braided river depositional microfacies qualitative identification models of the He-8 reservoir of the Su-77 block in the Sulige gas field, Ordos basin.

Figure 2 .
Figure 2. Flow chart of depositional microfacies classification method by support vector machine (SVM).

Figure 2 .
Figure 2. Flow chart of depositional microfacies classification method by support vector machine (SVM).

Figure 3 .
Figure 3. Schematic diagram of well logging curve shape modified from Ma et al. [42].

Figure 3 .
Figure 3. Schematic diagram of well logging curve shape modified from Ma et al. [42].

Figure 4 .
Figure 4. Schematic diagram of the optimal classification, support vectors, hyperplane, and optimal SVM decision surface.

Figure 4 .
Figure 4. Schematic diagram of the optimal classification, support vectors, hyperplane, and optimal SVM decision surface.

Figure 5 .
Figure 5.Comparison of the microfacies identification results by SVM method and geologists in well S77-23-37.

Figure 5 .
Figure 5.Comparison of the microfacies identification results by SVM method and geologists in well S77-23-37.

Figure 6 .
Figure 6.Comparison of the microfacies identification results by SVM method and geologists in well S77-3-21.

Figure 6 .
Figure 6.Comparison of the microfacies identification results by SVM method and geologists in well S77-3-21.

Table 1 .
Braided river depositional microfacies types of the He-8 reservoir of the Su-77 block in the Sulige gas field.

Table 2 .
Characteristic parameters of log curves and corresponding depositional microfacies in the study area.

Table 2 .
Characteristic parameters of log curves and corresponding depositional microfacies in the study area.

Table 3 .
Comparison of the microfacies identification results by SVM method and geologists.