Study on an AHP-Entropy-ANFIS Model for the Prediction of the Unfrozen Water Content of Sodium-Bicarbonate-Type Salinization Frozen Soil

: The development of agriculture and ecology, and the construction of water conservancy facilities are seriously hindered by the salinization of seasonal frozen soil. Unfrozen water exists in the freezing and thawing of frozen soil. This unfrozen water is the core and foundation for studying the process of seasonal frozen soil salinization. However, it is di ﬃ cult to obtain the unfrozen water content (UW) in routine experiments, and it shows nonlinear characteristics under the action of the main factors contained: salt content, water content, and temperature. In this paper, a new model is proposed to predict the UW of saline soil based on the combined weighting method and the adaptive neuro-fuzzy inference system (ANFIS). Firstly, the distance function was used to combine the analytic hierarchy process (AHP) with the entropy weight method (the combined weighting method) to determine the importance of the inﬂuencing factors (temperature, initial water content, and salt content) on UW. On this basis, the AHP, entropy weight method, and adaptive neuro-fuzzy inference system (AHP-entropy-ANFIS) ensemble model was established. Secondly, the ﬁve-fold cross-validation method and statistical factors (coe ﬃ cient of determination, mean squared error, mean absolute percent error, and mean absolute error) were applied to evaluate and compare the AHP-entropy-ANFIS ensemble model, the ANFIS model, the support vector machine (SVM) model, and the AHP, entropy weight method, and support vector machine (AHP-entropy-SVM) ensemble model. In addition, the prediction values of the four models and the experimental values were also compared. The results show that the AHP-entropy-ANFIS model had the strongest prediction capability and the best stability, and so is more suitable for predicting the UW of saline soil. This study provides useful guidance for preventing and mitigating salinization hazards in seasonally frozen areas.


Introduction
Jilin province is one of the major agricultural provinces in China. It has abundant arable land and pasture resources. However, in the west of Jilin province, there is a large area of carbonate saline soil [1,2]. As a result of the influence of factors such as the climatic environment and soil material composition in western Jilin, the saline soil in the area not only has salinization characteristics, but also shows the typical characteristics of seasonal frozen soil [3,4]; therefore, the saline soil in the western area of Jilin province has complex engineering geological properties, which lead to the expansion of area was selected and applied to the prediction of UW. In addition, for the process of model training and testing, a five-fold cross-validation method was adopted.

Test Soil Materials
According to an investigation and research into the saline soil in the west of Jilin province, compared with other depth areas, salt is more active at a depth of 40 cm below the ground surface, and the depth of 40 cm is the salt concentration area. In order to study the UW of saline soil, it was planned to collect soil samples from Zhenlai county. These soil samples were collected from below the surface at a depth of 0.4 m. The collected soil samples were remolded to the samples needed for the study. Before remolding the soil samples, the particle size composition ( Figure 1) and the clay mineral composition (Figure 2) of the soil were analyzed. As shown in Figure 1, the proportion of clay and silt was the highest. The mass percentages of the clay and silt were 42.5% and 49.5%, respectively. As shown in Figure 2, the clay minerals in the saline soil mainly contained illite and illite/smectite mixed layer. The relative contents of the illite and illite/smectite mixed layer were 23% and 59%, respectively.
Ma thema tics 2020, 8, x FOR PEER REVIEW  3 of 20 suitable for the Zhenlai area was selected and applied to the prediction of UW. In addition, for the process of model training and testing, a five-fold cross-validation method was adopted.

Test Soil Materials
According to an investigation and research into the saline soil in the west of Jilin province, compared with other depth areas, salt is more active at a depth of 40 cm below the ground surface, and the depth of 40 cm is the salt concentration area . In order to study the UW of saline soil, it was planned to collect soil samples from Zhenlai county. These soil samples were collected from below the surface at a depth of 0.4 m. The collected soil samples were remolded to the samples needed for the study. Before remolding the soil samples, the particle size composition ( Figure 1) and the clay mineral composition (Figure 2) of the soil were analyzed. As shown in Figure 1, the proportion of clay and silt was the highest. The mass percentages of the clay and silt were 42.5% and 49.5%, respectively. As shown in Figure 2, the clay minerals in the saline soil mainly contained illite and illite/smectite mixed layer. The relative contents of the illite and illite/smectite mixed layer were 23% and 59%, respectively.   suitable for the Zhenlai area was selected and applied to the prediction of UW. In addition, for the process of model training and testing, a five-fold cross-validation method was adopted.

Test Soil Materials
According to an investigation and research into the saline soil in the west of Jilin province, compared with other depth areas, salt is more active at a depth of 40 cm below the ground surface, and the depth of 40 cm is the salt concentration area . In order to study the UW of saline soil, it was planned to collect soil samples from Zhenlai county. These soil samples were collected from below the surface at a depth of 0.4 m. The collected soil samples were remolded to the samples needed for the study. Before remolding the soil samples, the particle size composition ( Figure 1) and the clay mineral composition (Figure 2) of the soil were analyzed. As shown in Figure 1, the proportion of clay and silt was the highest. The mass percentages of the clay and silt were 42.5% and 49.5%, respectively. As shown in Figure 2, the clay minerals in the saline soil mainly contained illite and illite/smectite mixed layer. The relative contents of the illite and illite/smectite mixed layer were 23% and 59%, respectively.

Design for the Remodeling Samples
The UW of the saline soil is influenced by multiple factors: salt content, initial water content, temperature, and so on. Therefore, it was necessary to decide upon the salt content, moisture content, and temperature before preparing the remolded soil samples of salinized soil. According to the definition of saline soil in the specification of the saline soil to be considered, the soluble salt content cannot be less than 0.3%. In addition, the maximum value of soluble salt content in the soil of western Jilin was once higher than 1.5%. Thus, the salt content of the remodeled samples was artificially designed to be 0%, 0.3% and 1.5%. The optimum water content is the corresponding water content when the soil samples reach the maximum dry density. When the collected soil samples were tested, it was found that the optimum moisture content was about 20%. Therefore, 19%, 21%, and 24% were artificially chosen as the initial water content of the reshaped soil samples. Combined with the actual temperature during winter in western Jilin (the lowest value of monthly average temperature was lower than −20 • C), the test temperatures of the samples were artificially designed to have seven temperature points: −1, −3, −5, −7, −10, −15 and −20 • C. During the test, the UW corresponding to each temperature point was tested. Moreover, western Jilin province is a concentrated area of sodium bicarbonate type saline soil. This soil has a high content of sodium ions, so when remodeling the soil sample, a certain amount of sodium bicarbonate was added. Table 1 shows the design parameters of the samples. The salt content, water content, and test temperatures of the remolded soil samples were used as the input variables of the models.

Test Remodeling Samples
After the design of the remolded soil samples was completed, remolded soil samples with a diameter of 2.5 cm and a height of 5 cm were obtained. In this paper, the nuclear magnetic resonance (NMR) method was selected to study the remolded soil samples, with the expectation of obtaining the UW of the soil samples in the process of freezing. During the test, the soil samples were placed in the NMR experimental equipment. The temperature was controlled and gradually reduced from −1 to −20 • C. At each temperature point, the UW of the soil sample was obtained. A total of 35 groups of UW data were obtained from the NMR experiment, as shown in Table 2. The UW data were then used as the output variables of the models.

Predictive Model Development
This paper intends to establish a hybrid model based on the combination weighting method and machine learning algorithm. In the model, temperature, salt content and initial water content were selected as the input factors, and the output was the UW. The previous showed that the UW can be obtained under the control of three factors: temperature, salt content, and initial water content. However, the degree of influence of the three influencing factors on the UW is not clear.
Therefore, in order to establish a combined model, it was necessary to clarify the degree of influence of the three influencing factors on the UW-that is, to determine reasonable weights for the three influencing factors. To prevent the results from being too subjective or objective, and analytical method combining both subjective and objective methods, was applied to provide more reasonable weights for the influencing factors. This helped to improve the prediction accuracy of the model.

• Subjective Evaluation Method
In the mid-1970s, the American scholar Saaty [23] proposed a method for subjective decision-making: the analytic hierarchy process (AHP). This method can quantitatively analyze and solve problems. The analytic hierarchy process is good at dealing with complex problems featuring multiple levels and factors, and offers good applicability. For solving practical problems, the AHP method can be based on the following process [24,25]: 1. According to the needs of the practical problem, construct a hierarchical structure model to achieve the purpose of decomposing the target layer. The impact factors are then compared and evaluated for selecting suitable candidate objects; 2. According to the method described by Saaty, the values from 1 to 9 and their reciprocals are used as scoring criteria to assess the relative importance of the influencing factors [23][24][25]. The importance of the influencing factors is evaluated for the target factors. The degree of importance of any two influencing factors on the target factors is determined by comparing them. The degree of importance is scored by experts, and a judgment matrix is built using the obtained scores. The judgment matrix is shown in Equation (1): where m is the number of influencing factors. The judgment matrix and its maximum eigenvalue, as well as the eigenvector corresponding to the maximum eigenvalue need to satisfy the following relationship: where λmax is the maximum eigenvalue and w is the eigenvector. 3. When the AHP is used to deal with practical problems, it is necessary to determine whether the judgment matrix meets the consistency principle. The consistency of the judgment matrix can be measured using the consistency ratio. In general, when the consistency ratio is less than 0.1, the matrix passes the consistency test. This proves that the results obtained from the matrix are reliable. The consistency ratio can be obtained by Equation (3): where CR is the consistency ratio, RI is the average random consistency index, and CI is the consistency index. The average random consistency index can be obtained as shown in Table 3 [25]. The consistency index can be determined by the maximum characteristic root corresponding to the discrimination matrix, as shown in Equation (4): • Objective Evaluation Methods The entropy weight method uses the entropy values of the influencing factors to determine the weights of the influencing factors [26,27]. It is an objective evaluation method. This method mainly determines the importance (weight) of each factor by evaluating the discrete degree of each factor in the system. The higher the degree of dispersion, the smaller the entropy and the greater the importance of the factor. The process that the entropy weight method uses to deal with the problem is as follows [26,27]: 1. Assuming that the system contains n samples and m influence factors, and construct them to matrix B, as shown in Equation (5). In addition, the matrix B is normalized according to Equation (6), thus obtaining C ij .
where b ij is the jth influencing factor of the ith sample; min b 1 j , · · · , b nj is the minimum value of the jth influencing factor; and max b 1 j , · · · , b nj is the maximum value of the jth influencing factor. After the data are normalized, the entropy value can be determined using Equation (7): where k is related to the sample size n, as shown in Equation (8); and u ij is the frequency of the jth factor, which can be calculated by Equation (9): where C ij is the value obtained after normalizing matrix B. 2. The difference coefficient of the entropy can be determined by the entropy value, as shown in Equation (10): where R j is the difference coefficient of the entropy.
3. According to the difference coefficient, the weight of the factor can be calculated, as shown in Equation (11): where w j is the weight of the factor.

• Comprehensive Evaluation Methods
When the subjective method (AHP) or the objective method (entropy weight method) are used to analyze the degree of influence of the influencing factors on UW, the weights obtained are too subjective or too objective, respectively. Reasonable weights will help to build a model with better performance. Therefore, in order to get reasonable weights, the subjective method (AHP) and the objective method (entropy weight method) should be used in combination when weighting the influence factors. In this paper, the concept of the distance function is introduced, and the distance function is used to combine the subjective and objective methods, and to construct a combined weighting method. Using the combination weighting method to evaluate the importance of influencing factors and obtain reasonable weights of influencing factors [28], the subjective weight coefficient and the objective weight coefficient can be determined by the Equation (12) related to the distance function, and the combination weight can be determined by Equation (13).
where w 1 j and w 2 j are the weights, they should be calculated by the subjective method (AHP) and the objective method (the entropy weight method), respectively; D w 1 j , w 2 j is the distance function of the subjective weight (w 1 j ) and objective weight (w 2 j ); and a and b are the weight coefficients of the subjective weight and objective weight coefficients, respectively, a and b are real numbers.
By substituting the weighting coefficients (a and b) into Equation (13), the combination weight of the subjective and objective weights can be obtained: where w cj is the combination weight.

The Adaptive Network-Based Fuzzy Inference System
In the early 1990s, a new algorithm, the adaptive neuro-fuzzy inference system (ANFIS), was proposed [29]. It applies the back-gradient method and the least squares algorithm, and uses the "If-Then" rule for management and constraints. In addition, with the help of the membership function, it completes nonlinear mapping of the data from the input to the output [30,31]. The structure of the ANFIS model is shown in Figure 3. ANFIS inherited the advantages of neural networks and fuzzy reasoning theory, and is good at solving nonlinear and other related complex problems. Therefore, ANFIS has been applied to practical engineering, for example, to evaluate concrete performance [32], and to study soil chemical properties [33]. In this paper, the process of establishing ANFIS for the UW prediction model is shown in Figure 4. Figure 3. The structure of the adaptive ne uro-fuzzy infe rence system (ANFIS) mode l: (1) p and q re pre sent input variable s; (2) node s containing fuzzy se ts Xi (i = 1,2)and Yj (j = 1,2) ne e d to de te rmine the ir me mbe rship functions and pe rform fuzzy ope rations on input variable s; (3) the function of the node with the Gre e k le tter ∏ is to multiply the output data of the pre vious layer, and the output data of the node re presents the credibility of the fuzzy rule s ; (4) the ne uron labe led N indicate s that the output data of the pre vious layer is normalize d and the n o utput from this node ; (5) the function of the node with ui (i = 1,2,...,6) is to multiply the output data of the pre vious laye r with the re lated function to calculate the output of fuzzy rule s ; (6) the function of the node with symbol ∑ is to sum all the input data of this node , and ge t the output re sult u.  (2) nodes containing fuzzy sets X i (i = 1,2)and Y j (j = 1,2) need to determine their membership functions and perform fuzzy operations on input variables; (3) the function of the node with the Greek letter is to multiply the output data of the previous layer, and the output data of the node represents the credibility of the fuzzy rules; (4) the neuron labeled N indicates that the output data of the previous layer is normalized and then output from this node; (5) the function of the node with u i (i = 1,2,...,6) is to multiply the output data of the previous layer with the related function to calculate the output of fuzzy rules; (6) the function of the node with symbol is to sum all the input data of this node, and get the output result u. re pre sent input variable s; (2) node s containing fuzzy se ts Xi (i = 1,2)and Yj (j = 1,2) ne e d to de te rmine the ir me mbe rship functions and pe rform fuzzy ope rations on input variable s; (3) the function of the node with the Gre e k le tter ∏ is to multiply the output data of the pre vious layer, and the output data of the node re presents the credibility of the fuzzy rule s ; (4) the ne uron labe led N indicate s that the output data of the pre vious layer is normalize d and the n o utput from this node ; (5) the function of the node with ui (i = 1,2,...,6) is to multiply the output data of the pre vious laye r with the re lated function to calculate the output of fuzzy rule s ; (6) the function of the node with symbol ∑ is to sum all the input data of this node , and ge t the output re sult u.

The Support Vector Machine
The Support Vector Machine (SVM) algorithm was created in the mid-1990s [34]. The most prominent feature of the support vector machine is that it has a strong generalization adaptability [35][36][37], and it can deal with complex problems caused by nonlinearities. When encountering a nonlinear problem, it first processes the complex nonlinear data. After these data are transformed, they are projected into a high-dimensional space, and then a suitable linear function is found to solve the complex nonlinear problem [35,36]. As a result of the excellent performance of the SVM, it has been applied to deal with matters of landslide sensitivity analysis [37], slope stability research [38], and surrounding rock deformations estimation [39]. The principle of the SVM to deal with problems is as follows [35,40,41]: Assuming that the input data is x i (i = 1, 2, ..., k) and the output data is y i (i = 1, 2, ..., k), then the data set consisting of input data and output data is Assuming µ(x) is a nonlinear mapping, it can map the input data from low-dimensional space to high-dimensional space. Turn complex nonlinear problems into simple linear problems of high-dimensional space. It can be expressed by Equation (14): where w is weight vector, as shown in Equation (15); and b is bias value.
where α i and α * i are Lagrange multipliers, which can be calculated by Equation (16): According to Equations (14)- (16), the regression function can be determined as shown in Equation (17): where K(x i , x) represents kernel function that meets Mercer's conditions.

Data Collection and Collation
The UW prediction model not only needs to determine the intelligent algorithm for establishing the model, but also needs to obtain a certain amount of reasonable data as support. Therefore, before the prediction model was established, it was necessary to achieve the collection and arrangement of data. In this paper, the inputs of the models are temperature, salinity and water content, and the output of the models are the unfrozen water content. After the UW test experiment was completed, a total of 35 sets of experimental data were obtained, as shown in Table 2. The data were divided as follows: about 80% of the experimental data were taken out and applied as training data; about 20% of the remaining experimental data were allocated as testing prediction model data [42]. In other words, out of the 35 sets of experimental data, 28 sets of experimental data were randomly taken for training, and the remaining seven sets of experimental data were used to test the prediction model. The models were trained and tested with the same data. Furthermore, the model was trained and tested using the five-fold cross-validation method.

Models Comparison Method
In order to accurately predict the UW of the soil samples, it was necessary to compare the predictive ability of the models. In this paper, four statistical factors were introduced to assess the predictive power of the two models. The four evaluation factors are the coefficient of determination (R 2 ), the mean squared error (MSE), the mean absolute percent error (MAPE) and the mean absolute error (MAE) [31,43]. These factors can be calculated using Equations (18)- (21).
The coefficient of determination is: The mean squared error is: The mean absolute percent error is: The mean absolute error is: where UW predict is the prediction data of the model, UW experiment is the result of the NMR experiment, and UW experiment-a is the average of the experimental results. The MAPE, MSE and MAE reflect the predictive capabilities of the model for the expected values, where the value range is from 0 to positive infinity. The smaller the values are, the closer the prediction result of the model is to the experimental values, which indicates that the predictive ability of the model is better. The value range of R 2 is between 0 and 1. This reflects the degree of closeness between the predicted result of the model and the test value. The larger the value of R 2 is, the better the data fit. At the same time, this also shows that the model has a better predictive ability.

Mechanisms Affecting Unfrozen Water Content
The relatively high clay content in the soil, close to 50% (Figure 1), showed that the specific surface area of the soil was relatively large, and the soil particles had a strong adsorption capacity for the water molecules. The water molecules were gathered around the soil particles through the adsorption capacity of the soil particles, thus forming a water film. The higher the initial water content was, the thicker the water film. At the same temperature, this phenomenon led to higher initial water content, a higher UW, and the same change trend for UW and water content, as shown in Figure 5a. In Figure 5a, the green bar, blue bar, and red bar represent the UW of the soil samples with water content of 19%, 21%, and 24%.  When the outside temperature is less than 0 °C and continues to decrease, unfrozen water appears in the soil, and as a whole, the UW decreases as the temperature decreases [44]. Therefore, it is self-evident that temperature is the main factor affecting changes in UW.

Analysis of the Degree of Importance of the Factors Affecting the Unfrozen Water Content
In this paper, the AHP and entropy weight methods were used to study the influencing factors of UW, and the subjective and objective weights of the influencing factors were thus determined. The distance function was introduced to combine the subjective weights with the objective w eights, and reasonable weights were thereby redetermined. The soil mainly contained sodium bicarbonate, so the content of sodium ions was relatively high. The clay minerals in the soil contained some highly hydrophilic minerals, such as illite and illite/smectite mixed layer minerals (Figure 2). When the highly hydrophilic minerals interacted with the sodium ions in the sodium salt, a large amount of thin film water (weakly bound water) gathered around the soil particles. With an increase in sodium ions, the water film gradually thickened (Figure 6), showing the UW to increase as with the water film thickened. Therefore, at the same temperature, the salt content increased, which indirectly inhibited the freezing of the soil, resulting in an increased UW. The UW and the salt content also maintained the same change trend, as shown in Figure 5b. In the Figure 5b, the green bar and red bar represent the UW of the soil sample with salt content of 0.3% and 1.5%.  When the outside temperature is less than 0 °C and continues to decrease, unfrozen water appears in the soil, and as a whole, the UW decreases as the temperature decreases [44]. Therefore, it is self-evident that temperature is the main factor affecting changes in UW.

Analysis of the Degree of Importance of the Factors Affecting the Unfrozen Water Content
In this paper, the AHP and entropy weight methods were used to study the influencing factors When the outside temperature is less than 0 • C and continues to decrease, unfrozen water appears in the soil, and as a whole, the UW decreases as the temperature decreases [44]. Therefore, it is self-evident that temperature is the main factor affecting changes in UW.

Analysis of the Degree of Importance of the Factors Affecting the Unfrozen Water Content
In this paper, the AHP and entropy weight methods were used to study the influencing factors of UW, and the subjective and objective weights of the influencing factors were thus determined. The distance function was introduced to combine the subjective weights with the objective weights, and reasonable weights were thereby redetermined.

The Analytic Hierarchy Process
The UW is mainly controlled by three influencing factors: temperature, salt content and initial water content. Before applying the AHP to determine the weights of the three influencing factors, the judgment matrix should be constructed first. By comparing the three factors, the importance of each influencing factor relative to the other influencing factors was obtained. In Table 4, the first three columns of data present the results of the comparison. According to Formula (2), the weights of the three influencing factors can thus be calculated. The fourth column in Table 4 shows the weights of the three influencing factors. In Table 4, the weight values of temperature, water content and salt content are 0.6483, 0.2297 and 0.1220, respectively. This shows that the temperature weight was the highest, while the salt content weight was the lowest. The subjective analysis method tends to consider temperature as the main factor affecting the UW. At the same time, the average consistency index (CR) and consistency index (CI) were determined according to Equations (3) and (4), respectively. The CI value was 0.002, the CR value was 0.0036, and the CR value was less than 0.1. These comparison results were consistent with the rule of consistency, and the weights of the influencing factors determined by the AHP method were reasonable.

The Entropy Weight Method
The entropy weight method (the objective method) was used to evaluate the degree of importance of the three influencing factors of temperature, salt content, and initial water content. According to Formula (6), the matrix of the three factors was normalized. The entropy value, difference coefficient, and weight of influencing factors were determined by Equations (7)-(11), as shown in Table 5. In Table 5, the weights of the temperature, salt content and initial water content are 0.1416, 0.2187 and 0.6397 respectively. It can be seen that the weight of the initial water content is the largest, while the weight of the temperature is the smallest. The objective analysis method tends to consider the initial water content as the main factor affecting the UW.

The Combined Weighting Method
In order to obtain a reasonable weight, and prevent the weights determined by the AHP and entropy weight method from being too subjective or objective, thereby ensuring that the obtained prediction model offers better performance, it is necessary to combine the AHP (subjective) with the entropy weight method (objective) evaluation methods. Using Equations (12) and (13), the weight coefficient and combination weight of the combination weighting method can be determined, as shown in Table 6. In Table 6, the weight coefficient of AHP is 0.7330, and the weight coefficient of entropy is 0.2670. The combined weights of temperature, salt content and initial water content are 0.5130, 0.1478 and 0.3392, respectively. Among them, the weight of temperature is the largest, and the weight of salt content is the smallest. This shows that temperature had the strongest influence on UW, salt content had the weakest influence on UW, and the initial water content was between both.

Compare and Select the Model
After determining the weights of the influencing factors, the AHP-entropy-ANFIS model was established in combination with ANFIS. This model was compared with the ANFIS model, the SVM model and the AHP-entropy-SVM model, and the model with the strongest predictive ability was selected to predict the UW. In this paper, temperature, salt content, and initial water content were input into the models, and the UW of saline soil was the output factor of the models. The training and testing of the models were realized on the MATLAB software platform. In the ANFIS prediction model, according to actual needs, a total of 27 different "If-Then" rules were formulated, and the "trimf"-type function, with an improved training effect, was selected as the membership function [31,43]. In the SVM prediction model, the "radial basis" function is an important part of the model. It is the main method for the model to process data. The SVM model applies the grid search method, and combines the penalty parameter and radial basis kernel factor to train the model [36].
In addition, in the process of training and testing, a five-fold cross-validation method was used to avoid the problem of overfitting the model. Through this five-fold cross-validation, the coefficient of determination, the mean squared error, and the p-value of each model was obtained, as shown in Table 7.
The data in Table 7 were then analyzed. In the process of training, the mean R 2 values of the four models (AHP-entropy-ANFIS, ANFIS, SVM, and AHP-entropy-SVM models) were all greater than 0.8, and the mean R 2 of the AHP-entropy-ANFIS model was the highest, reaching 0.9935. The mean MSEs of the four models were 0.14, 0.19, 2.99, and 2.78, respectively. Clearly AHP-entropy-ANFIS (0.14) was the smallest. For the stability of the model, the order from high to low was as follows: the AHP-entropy-ANFIS model, the ANFIS model, the AHP-entropy-SVM model, and the SVM model.
In the process of testing, the four models were compared. The mean R 2 the of the AHP-entropy-ANFIS model was the largest, at 0.9692. The mean MSE of this model was 0.86, making it the smallest. Compared to the training process, the stability of the AHP-entropy-ANFIS model, the ANFIS model, the AHP-entropy-SVM model and the SVM model were slightly reduced, but the AHP-entropy-ANFIS model remained the most stable. The AHP-entropy-ANFIS model thus performed the best. In addition, in the process of training and testing, the p-values of the models were less than 0.05.  Figure 7 describes the mean absolute percent errors (MAPEs) of the different models. Among them, legend A is the AHP-entropy-ANFIS model, legend B is the ANFIS model, legend C is the SVM model, and legend D is the AHP-entropy-SVM model. It can be seen from Figure 7 that in both the training process and the testing process, the MAPE of the AHP-entropy-ANFIS model was less than that of the other three models, and the MAPE of the AHP-entropy-ANFIS model was less than 10. In the training process, the MAPE of the AHP-entropy-ANFIS model was about 1/1.5 that of the ANFIS model, and the MAPE of the AHP-entropy-ANFIS model was about 1/5 that of the AHP-entropy-SVM model. In the testing process, the MAPE of the AHP-entropy-ANFIS model was about 1/1.5 that of the SVM model, and about 1/2 that of the ANFIS model. The data in Table 7 were then analyzed. In the process of training, the mean R 2 values of the four models (AHP-entropy-ANFIS, ANFIS, SVM, and AHP-entropy-SVM models) were all greater than 0.8, and the mean R 2 of the AHP-entropy-ANFIS model was the highest, reaching 0.9935. The mean MSEs of the four models were 0.14, 0.19, 2.99, and 2.78, respectively. Clearly AHP-entropy-ANFIS (0.14) was the smallest. For the stability of the model, the order from high to low was as follows: the AHP-entropy-ANFIS model, the ANFIS model, the AHP-entropy-SVM model, and the SVM model.
In the process of testing, the four models were compared. The mean R 2 the of the AHP-entropy-ANFIS model was the largest, at 0.9692. The mean MSE of this model was 0.86, making it the smallest. Compared to the training process, the stability of the AHP-entropy-ANFIS model, the ANFIS model, the AHP-entropy-SVM model and the SVM model were slightly reduced, but the AHP-entropy-ANFIS model remained the most stable. The AHP-entropy-ANFIS model thus performed the best. In addition, in the process of training and testing, the p-values of the models were less than 0.05. Figure 7 describes the mean absolute percent errors (MAPEs) of the different models. Among them, legend A is the AHP-entropy-ANFIS model, legend B is the ANFIS model, legend C is the SVM model, and legend D is the AHP-entropy-SVM model. It can be seen from Figure 7 that in both the training process and the testing process, the MAPE of the AHP-entropy-ANFIS model was less than that of the other three models, and the MAPE of the AHP-entropy-ANFIS model was less than 10. In the training process, the MAPE of the AHP-entropy-ANFIS model was about 1/1.5 that of the ANFIS model, and the MAPE of the AHP-entropy-ANFIS model was about 1/5 that of the AHP-entropy-SVM model. In the testing process, the MAPE of the AHP-entropy-ANFIS model was about 1/1.5 that of the SVM model, and about 1/2 that of the ANFIS model.     In addition, the predicted values and test values of the different models were compared, and the models' ability to predict the UW of saline soil was judged by the conclusions obtained. Figure 9 describes the relationship between the data output by the different models during the training process and the real (experimental) data. As shown in Figure 9, compared with the ANFIS model, the prediction result of the AHP-entropy-ANFIS model was closest to the real experimental value. The changes in the blue curve and red curve are very similar; the difference between the two curves is very small, and the two curves basically coincide. The performance of t he SVM and AHPentropy-SVM models was slightly worse, as the blue and red curves are not completely coincident. Although the changes in the two curves are similar, they present some differences. This result reveals that, compared with the other three models, the AHP-entropy-ANFIS model provided a stronger fitting capability and better performance during the training process. In addition, the predicted values and test values of the different models were compared, and the models' ability to predict the UW of saline soil was judged by the conclusions obtained. Figure 9 describes the relationship between the data output by the different models during the training process and the real (experimental) data. As shown in Figure 9, compared with the ANFIS model, the prediction result of the AHP-entropy-ANFIS model was closest to the real experimental value. The changes in the blue curve and red curve are very similar; the difference between the two curves is very small, and the two curves basically coincide. The performance of the SVM and AHP-entropy-SVM models was slightly worse, as the blue and red curves are not completely coincident. Although the changes in the two curves are similar, they present some differences. This result reveals that, compared with the other three models, the AHP-entropy-ANFIS model provided a stronger fitting capability and better performance during the training process. In addition, the predicted values and test values of the different models were compared, and the models' ability to predict the UW of saline soil was judged by the conclusions obtained. Figure 9 describes the relationship between the data output by the different models during the training process and the real (experimental) data. As shown in Figure 9, compared with the ANFIS model, the prediction result of the AHP-entropy-ANFIS model was closest to the real experimental value. The changes in the blue curve and red curve are very similar; the difference between the two curves is very small, and the two curves basically coincide. The performance of t he SVM and AHPentropy-SVM models was slightly worse, as the blue and red curves are not completely coincident. Although the changes in the two curves are similar, they present some differences. This result reveals that, compared with the other three models, the AHP-entropy-ANFIS model provided a stronger fitting capability and better performance during the training process.  . The pre dicte d data of the diffe re nt mode ls during the training proce ss and the real (e xpe rimental) data. Figure 10 describes the relationship between the data output by the different models during the testing process and the real data. As described in Figure 10a, the variations in the blue and red curves are similar, and there is only a small difference. In the two curves, the curve corresponding to data points 2 to 6 basically coincides. Thus, the error of the AHP-entropy-ANFIS model is smaller. In Figure 10b-d, the data point corresponding to the red curve becomes larger or smaller, resulting in a large difference between the red and blue curves. This shows that during the test process, compared with the AHP-entropy-ANFIS model, the other three models provided a weaker fitting ability and poorer performance.
(a) (b) Figure 9. The predicted data of the different models during the training process and the real (experimental) data. Figure 10 describes the relationship between the data output by the different models during the testing process and the real data. As described in Figure 10a, the variations in the blue and red curves are similar, and there is only a small difference. In the two curves, the curve corresponding to data points 2 to 6 basically coincides. Thus, the error of the AHP-entropy-ANFIS model is smaller. In Figure 10b-d, the data point corresponding to the red curve becomes larger or smaller, resulting in a large difference between the red and blue curves. This shows that during the test process, compared with the AHP-entropy-ANFIS model, the other three models provided a weaker fitting ability and poorer performance. Figure 9. The pre dicte d data of the diffe re nt mode ls during the training proce ss and the real (e xpe rimental) data. Figure 10 describes the relationship between the data output by the different models during the testing process and the real data. As described in Figure 10a, the variations in the blue and red curves are similar, and there is only a small difference. In the two curves, the curve corresponding to data points 2 to 6 basically coincides. Thus, the error of the AHP-entropy-ANFIS model is smaller. In Figure 10b-d, the data point corresponding to the red curve becomes larger or smaller, resulting in a large difference between the red and blue curves. This shows that during the test process, compared with the AHP-entropy-ANFIS model, the other three models provided a weaker fitting ability and poorer performance. Xu [45], Anderson and Tice [46] studied the unfrozen water content. They built empirical models on the basis of experiments to predict the unfrozen water content. The unfrozen water content is obtained through empirical formulas, but only the unfrozen water content corresponding to the freezing temperature can be obtained, and the complete unfrozen water content change characteristic curve cannot be obtained. So, the models established in this paper can better predict the unfrozen water content.
In short, through the comparative analysis of the statistical factors, the predicted values, and the experimental values of the different models, it was found that, compared with the ANFIS, SVM and AHP-entropy-SVM models, the AHP-entropy-ANFIS model can produce smaller prediction errors and have a stronger fitting ability. Therefore, the AHP-entropy-ANFIS model offers better prediction capability. Consequently, the AHP-entropy-ANFIS model is the best choice for predicting the UW of saline soil in Zhenlai.

Conclusions
This article mainly focused on the UW of sodium-bicarbonate-type salinization frozen soil in Zhenlai county, western Jilin province. The UW of the remodeled samples was obtained using a nuclear magnetic resonance experiment. The combined weighting method was used to study the influence of temperature, salinity and initial water content on the UW. On the basis of these results, combined with the fuzzy neural inference system, the UW prediction model was established. The five-fold cross-validation method was used to train and test the m odel. The following conclusions can be drawn: 1. The soil contains a great deal of clay and highly hydrophilic clay minerals, so a thick water film forms around the soil particles. In addition, as the water content or salt content increase, the water film around the soil particles becomes thicker. Therefore, under the same temperature, the higher the water content or salt content is, the higher the UW; 2. The AHP (subjective) and entropy weight method (objective) were used to evaluate the degree of influence of temperature, salt content, and water content on UW, and thus determine the subjective and objective weights of temperature, salt content, and water content. The distance function was introduced and applied to combine the subjective weight and objectiv e weight, and the combined weight of the three influencing factors was obtained.
3. On the basis of the combined weighting method and the ANFIS method, the AHP-entropy-ANFIS ensemble model was proposed. In addition, this model was compared with the ANFIS model, the SVM model and the AHP-entropy-SVM model. It was found that the p-values of all models were Figure 10. The relationship between the predicted data of the different models during the test process and the real data: (a) the predicted data of the AHP-entropy-ANFIS model during the test process and the real data; (b) the predicted data of the ANFIS model during the test process and the real data; (c) the predicted data of the SVM model during the test process and the real data; (d) the predicted data of the AHP-entropy-SVM model during the test process and the real data.
Xu [45], Anderson and Tice [46] studied the unfrozen water content. They built empirical models on the basis of experiments to predict the unfrozen water content. The unfrozen water content is obtained through empirical formulas, but only the unfrozen water content corresponding to the freezing temperature can be obtained, and the complete unfrozen water content change characteristic curve cannot be obtained. So, the models established in this paper can better predict the unfrozen water content.
In short, through the comparative analysis of the statistical factors, the predicted values, and the experimental values of the different models, it was found that, compared with the ANFIS, SVM and AHP-entropy-SVM models, the AHP-entropy-ANFIS model can produce smaller prediction errors and have a stronger fitting ability. Therefore, the AHP-entropy-ANFIS model offers better prediction capability. Consequently, the AHP-entropy-ANFIS model is the best choice for predicting the UW of saline soil in Zhenlai.

Conclusions
This article mainly focused on the UW of sodium-bicarbonate-type salinization frozen soil in Zhenlai county, western Jilin province. The UW of the remodeled samples was obtained using a nuclear magnetic resonance experiment. The combined weighting method was used to study the influence of temperature, salinity and initial water content on the UW. On the basis of these results, combined with the fuzzy neural inference system, the UW prediction model was established. The five-fold cross-validation method was used to train and test the model. The following conclusions can be drawn: 1. The soil contains a great deal of clay and highly hydrophilic clay minerals, so a thick water film forms around the soil particles. In addition, as the water content or salt content increase, the water film around the soil particles becomes thicker. Therefore, under the same temperature, the higher the water content or salt content is, the higher the UW; 2. The AHP (subjective) and entropy weight method (objective) were used to evaluate the degree of influence of temperature, salt content, and water content on UW, and thus determine the subjective and objective weights of temperature, salt content, and water content. The distance function was introduced and applied to combine the subjective weight and objective weight, and the combined weight of the three influencing factors was obtained.
3. On the basis of the combined weighting method and the ANFIS method, the AHPentropy-ANFIS ensemble model was proposed. In addition, this model was compared with the ANFIS model, the SVM model and the AHP-entropy-SVM model. It was found that the p-values of all models were less than 0.05. Compared with the other three models, the R 2 of the AHP-entropy-ANFIS model was the largest, its MSE, MAPE, and MAE were the smallest; and its stability was the best. Moreover, the difference between the predicted value and the experimental value of the AHP-entropy-ANFIS model was relatively small. Therefore, the AHP-entropy-ANFIS model had the strongest prediction capability, and thus is more suitable for predicting the UW of saline soil in the Zhenlai area than other methods.