A GA-BP Neural Network Regression Model for Predicting Soil Moisture in Slope Ecological Protection

: In this study, based on a highway project in Zhejiang, China, the meteorological factors and soil moisture of high side slopes were monitored in real time by a meteorological data monitoring system, and the correlation between soil moisture and meteorological factors was investigated using the obtained data of soil moisture and total solar radiation, atmospheric temperature, soil temperature, relative humidity, and wind speed. Based on the correlation and the inﬂuence of meteorological factors on soil moisture lag, a back propagation (BP) neural network regression model optimized with genetic algorithm (GA) was proposed for the ﬁrst time and applied to soil moisture prediction of high side slopes. The results showed that the BP neural network regression model and the GA-BP neural network regression model were used for soil moisture prediction in two cases without and with lags, respectively, and both prediction methods showed a more signiﬁcant improvement in prediction accuracy considering their lags compared with those without lags; the GA-BP neural network regression model outperformed the BP neural network regression model in terms of accuracy. V-fold cross-validation eliminated the effect of random errors, indicating that the model can be applied to soil moisture prediction for ecological conservation. Using the soil moisture prediction results as the basis for screening ecological slope protection vegetation is of great signiﬁcance to the safety and reliability of road construction.


Introduction
Soil moisture controls material and energy exchange at the ground-space interface, is the material bearer of heat transfer and energy exchange between the soil and atmospheric environment in the hydrological cycle [1], and is an important factor in maintaining the growth state of plants [2,3]. Soil moisture promotes plant growth and development by improving nutrient uptake by plants, and proper soil moisture is essential for plant growth and development [4,5]. Too little water can affect the growth of plant cells, while too much water can interfere with certain stages of plant life [6,7]. Soil moisture changes are influenced by two main factors: climate change and vegetation growth status, where climate change plays a decisive role in soil moisture trends and vegetation growth status plays a regulatory role under climate change conditions [8]. Therefore, accurate prediction of soil moisture can provide effective information for drought and flood control [9] and precise irrigation decisions [10,11] in plant cultivation, as well as improve the predictability of plant flowering and abiotic stress risk assessment [12]. In this paper, we use machine learning methods to obtain meteorological data and predict soil moisture by meteorological factors, which are important for ecological conservation of slopes.
With the continuous development of technology and research, the prediction methods of soil moisture are becoming more and more diverse. At present, the mainstream soil moisture prediction methods mainly include empirical equations, linear regression, and neural networks [13]. With the deepening of soil moisture research and the diversification PCA-MLMVN has good performance in the prediction of soil moisture Yang et al. [24] VV-polarized Sentinel-1 SAR and Landsat optical data As can be seen from Table 1, many artificial intelligence methods and techniques exist for soil moisture prediction. The material sources are mainly satellite remote sensing data, satellite images, and microwave radar data, and there are fewer studies on meteorological factors and soil moisture. In this paper, we study the corresponding relationship between meteorological factors and soil moisture so as to propose a method for soil moisture with easily measured and obtained meteorological data. However, soil moisture prediction involves complex climate change and ecosystem structure effects, and it is difficult to study the relationship between soil moisture and climatic factors. In complex practical applications, traditional soil moisture prediction methods often perform poorly, while neural network models perform better in soil moisture prediction [25,26]. Although artificial neural networks can solve complex nonlinear mapping problems, they also have some disadvantages, such as overfitting, ease of falling into local optimum, and slow convergence speed. Using genetic algorithms to optimize back propagation (BP) neural networks can effectively overcome the local minima of the original BP neural network and outperform the original BP neural network in terms of convergence speed and computational accuracy. The neural network based on genetic algorithm is optimized in terms of learning efficiency and globalization, which improves the original network structure and increases the computational power. Subsequently, the cross-validation method was chosen to evaluate the soil moisture prediction model, which was introduced by Allen et al. [27] as a method to evaluate the prediction model parameters with the aim of improving the prediction accuracy. In this paper, the V-fold cross-validation method proposed by Geisser [28] is used, which has the characteristics of high computational efficiency and simple operation. It is important to promote ecological conservation in road construction. The objective of this paper is to predict soil moisture on high side slopes, which is one of the key factors of the hydrological cycle in the ecosystem. The model can achieve high prediction accuracy in predicting soil moisture.

Materials and Methods
2.1. Genetic Algorithm and Neural Network 2.1.1. Back-Propagation Neural Network BP neural network can be analogized as multi-layer perceptron, in which the hidden layer function mostly adopts S-type transfer function, while the output layer neuron mostly adopts linear transfer function. Figure 1a shows the three-layer BP network topology of z × d × b × t, where z represents the number of input layers, d and b represent the number of two hidden layers, and t represents the number of output layers. The output calculation process of the BP neural network consists of three steps. First, the input layer is the data of eight meteorological factors obtained through climate monitoring instruments. Second, the obtained output data are compared with the actual data and the error is calculated using the loss function of the evaluation module. Third, the error is propagated backward to find the derivative of the error for each weight in the network and to optimize the model. In a BP artificial neural network, each layer will only transmit data to the next layer. We apply the model to new data to obtain more accurate predictions after repeating the above steps and learning the weights for several times. It can be assumed that the existing sample pairs (x,ŷ) are x = [x 1 , x 2 , . . . , x m ], y = [y 1 , y 2 , . . . , y t ], and the hidden layer neurons are o = [o 1 , o 2 , . . . , o m ]. The network weight matrix between the neurons in the input and hidden layers is w 1 , and the network weights between the neurons in the hidden and output layers are w 3 , respectively. (1) x i represents the ith input value in a neural network, y is the vector of predicted values, a l i represents the ith activation of neurons in the ith layer, and w i lm represents the connection between the mth unit on the ith layer and the first unit on the (i + 1)th layer.
The threshold values of the hidden layer neuron and the output layer neuron are θ 1 and θ 2 , respectively Then the output of hidden layer neurons is The output of neurons in the output layer is The error between the network output and the expected output is

BP Neural Network Optimized by GA
GA is an optimization method used to find exact or approximate solutions to optimization problems and search problems applied in many research fields [29]. The BP neural network optimized by GA can effectively avoid the local minimum defects of the original BP neural network, and has the advantages of fast convergence speed and high accuracy [30]. GA is an adaptive heuristic search algorithm that mimics the biological evolutionary mechanism of "survival of the fittest" in nature, and is widely used to solve optimization problems in combination with artificial neural networks [31]. GA is efficient, practical, and robust. GA encodes data into chromosomes, exchanges information in chromosomes through selection, crossover, and iterative variation, and finally finds chromosomes that meet the optimization goal. By selecting, crossing over, and changing populations consisting of a certain number of problem parameters, GA screens out individuals with higher fitness and excludes those with lower fitness [32], and evolves and iterates to find the most satisfactory individual. By coming to optimize the weights and thresholds of the neural network, GA can avoid falling into local optimum and improve the training speed effectively. The algorithm consists of the following eight steps (see Figure 1b).
Step 1: Determine the neural network topology; in this BP neural network model, the input layer is each set of data of eight weather and meteorological factors, and the output layer is the daily average soil moisture, constituting an input layer of eight nodes and an output layer of one node.
Step 2: The initial population is obtained; the population size is set by preprocessing the input data, where each individual contains a real string of ownership values and thresholds for the BP neural network.
Step 3: Setup of fitness function F; The formula is

BP Neural Network Optimized by GA
GA is an optimization method used to find exact or approximate solutions to optimization problems and search problems applied in many research fields [29]. The BP neural network optimized by GA can effectively avoid the local minimum defects of the original BP neural network, and has the advantages of fast convergence speed and high accuracy [30]. GA is an adaptive heuristic search algorithm that mimics the biological evolutionary mechanism of "survival of the fittest" in nature, and is widely used to solve optimization problems in combination with artificial neural networks [31]. GA is efficient, practical, and robust. GA encodes data into chromosomes, exchanges information in chromosomes through selection, crossover, and iterative variation, and finally finds chromosomes that meet the optimization goal. By selecting, crossing over, and changing populations consisting of a certain number of problem parameters, GA screens out individuals with higher fitness and excludes those with lower fitness [32], and evolves and iterates to find the most satisfactory individual. By coming to optimize the weights and thresholds of the neural network, GA can avoid falling into local optimum and improve the training speed effectively. The algorithm consists of the following eight steps (see Figure 1b).
Step 1: Determine the neural network topology; in this BP neural network model, the input layer is each set of data of eight weather and meteorological factors, and the output layer is the daily average soil moisture, constituting an input layer of eight nodes and an output layer of one node.
Step 2: The initial population is obtained; the population size is set by preprocessing the input data, where each individual contains a real string of ownership values and thresholds for the BP neural network.
Step 3: Setup of fitness function F; The formula is where f i is the expected output value and f(x i ) is the actual output value.
Step 4: Selection operations; selection operations are defined as the selection of superior individuals from the original population by a certain probability and the production of the next generation of individuals by reproduction. The probability of individual selection is related to fitness. The higher the fitness, the higher the probability of individual selection. In this paper, roulette is used. The probability of selection is where F i is the population fitness value of individual I, and N is the population number.
Step 5: Crossover operation; crossover operation is the random selection of two individuals from a population to transfer the good characteristics of their parents to their offspring through the exchange and combination of their chromosomes to produce new individuals with high adaptability. That is, there are two individuals a k and a l which operate at j crossover, as follows where b is a random number between [0, 1].
Step 6: Mutation operations; population diversity can be maintained through population variation. A randomly selected individual from the population and a part of that individual is selected for the mutation operation. in order to produce a better individual. Individual a i mutates at j, forming a new gene a ij as follows.
where a max , a min are the upper and lower bounds of a ij , r 2 is a random number, g is the number of current iterations, and G max is the maximum number of evolutions.
Step 7: Calculate the fitness of individuals in the newly generated population. If the fitness function meets the requirements or the evolution time reaches the maximum, the evolution is completed, otherwise return to step 4.
Step 8: Through GA optimization, the BP network is given initial weights and thresholds, and the prediction results are obtained after the termination conditions are satisfied.

Error Estimation
The evaluation module assesses the validity and accuracy of the GA-optimized neural network prediction model considering MSE (mean square error), RMSE (root mean square error), MAE (mean absolute error) and MAPE (mean absolute percentage error), respectively (see Figure 1c). where N is the total number of training sets or test sets;ŷ(t) and y(t) are the predicted and actual values, respectively.

Meteorological Data Monitoring Scheme
A monitoring device and prediction method for vegetation growth environment on slopes in typhoon areas were adopted to address the problems of complicated operation and high cost of monitoring instruments for long-term monitoring of slope environment. By setting up temperature and humidity sensors, soil temperature and humidity, salt integrated sensors, wind speed and direction sensors, rainfall sensing and ware solar radiation sensors, long-term remote monitoring of the slope growth environment is achieved by means of wireless network and cloud platform data transmission. Finally, based on the collected longterm data and the data provided by China Meteorological Network, the time series method and linear regression model are used to make short-term prediction of slope meteorological parameters, so as to obtain the temporal changes of slope meteorological parameters.
The ecological berm meteorological parameter monitoring system consists of temperature and humidity sensors, integrated soil temperature and humidity and salt sensors, wind speed and direction sensors, rainfall sensors, solar radiation sensors, data collectors, wireless communication modules, and a solar power supply system powered by the whole test system. Part of the structure is shown in Figure 2. where N is the total number of training sets or test sets; y (t) and y(t) are the predicted and actual values, respectively.

Meteorological Data Monitoring Scheme
A monitoring device and prediction method for vegetation growth environment on slopes in typhoon areas were adopted to address the problems of complicated operation and high cost of monitoring instruments for long-term monitoring of slope environment. By setting up temperature and humidity sensors, soil temperature and humidity, salt integrated sensors, wind speed and direction sensors, rainfall sensing and ware solar radiation sensors, long-term remote monitoring of the slope growth environment is achieved by means of wireless network and cloud platform data transmission. Finally, based on the collected long-term data and the data provided by China Meteorological Network, the time series method and linear regression model are used to make short-term prediction of slope meteorological parameters, so as to obtain the temporal changes of slope meteorological parameters.
The ecological berm meteorological parameter monitoring system consists of temperature and humidity sensors, integrated soil temperature and humidity and salt sensors, wind speed and direction sensors, rainfall sensors, solar radiation sensors, data collectors, wireless communication modules, and a solar power supply system powered by the whole test system. Part of the structure is shown in Figure 2. The solar radiation sensor, ambient temperature and humidity sensor, and wind speed and direction sensor in the installation were mounted on the top of the slope without vegetation shading, and erected with a triangular bracket at 2 m from the top surface of the slope. The triangular bracket has a horizontal vacuum indicator at the top to ensure the level of the triangular bracket, and then the three corners of the bracket are fixed with expansion screws. The rain sensor is mounted on the top of the slope, unobstructed, and fixed with expansion screws. The integrated soil temperature, moisture, and salinity sensor was installed at the top of the slope by placing the sensor into a hole with a diameter of 50 mm and a depth of 2 m at the top of the slope, bringing the sensor into close contact with the soil, ensuring that the soil fills the sensor, and filling the upper part with soil to prevent water from entering. The wireless communication module and data collector were placed uniformly in the collection box and fixed on a triangular support. The solar power supply system consists of solar photovoltaic panel, battery, and debugger. The solar photovoltaic panel is tilted to the south and placed on a reserved bracket; the battery The solar radiation sensor, ambient temperature and humidity sensor, and wind speed and direction sensor in the installation were mounted on the top of the slope without vegetation shading, and erected with a triangular bracket at 2 m from the top surface of the slope. The triangular bracket has a horizontal vacuum indicator at the top to ensure the level of the triangular bracket, and then the three corners of the bracket are fixed with expansion screws. The rain sensor is mounted on the top of the slope, unobstructed, and fixed with expansion screws. The integrated soil temperature, moisture, and salinity sensor was installed at the top of the slope by placing the sensor into a hole with a diameter of 50 mm and a depth of 2 m at the top of the slope, bringing the sensor into close contact with the soil, ensuring that the soil fills the sensor, and filling the upper part with soil to prevent water from entering. The wireless communication module and data collector were placed uniformly in the collection box and fixed on a triangular support. The solar power supply system consists of solar photovoltaic panel, battery, and debugger. The solar photovoltaic panel is tilted to the south and placed on a reserved bracket; the battery is placed in a stainless steel box and fixed to the ground with expansion screws. The ambient temperature and humidity sensor, air pressure sensor, solar radiation sensor, rainfall sensor and integrated soil temperature, humidity and salt sensor are connected to the collector through the data line, and the collected data enter the data collector through the data line. The data collector uploads the collected data to the cloud platform server in real time through the wireless communication module, and the client computer can view and download the test data by accessing the cloud, thus realizing the long-term monitoring of the vegetation growth environment on the slope.

Influencing Factors of Soil Moisture
Soil moisture of ecological berm is closely related to local meteorological factors, and soil moisture is influenced not only by its own hydrological factors, but also by surrounding meteorological factors. The movement of the atmospheric system is a highly unified process of determinism and stochasticity. From a macroscopic point of view, the uncertainty of soil moisture variation is mainly determined by the influence of preexisting meteorological conditions. Climate is the main factor that determines the distribution of soil moisture on earth, while the uncertainty of soil moisture mainly comes from the stochastic nature of atmospheric factors. Therefore, meteorological factors can be used to predict soil moisture. In this paper, nine meteorological factors are obtained, namely, daily maximum ambient temperature (X 1 ), daily minimum ambient temperature (X 2 ), daily average temperature (X 3 ), daily average ambient humidity (X 4 ), daily average dew point temperature (X 5 ), daily average wind speed (X 6 ), daily average rainfall (X 7 ), daily total solar radiation (X 8 ), and daily average soil temperature (X 9 ). The Pearson correlation coefficient and Spearman correlation coefficient are shown in Figure 3. is placed in a stainless steel box and fixed to the ground with expansion screws. The ambient temperature and humidity sensor, air pressure sensor, solar radiation sensor, rainfall sensor and integrated soil temperature, humidity and salt sensor are connected to the collector through the data line, and the collected data enter the data collector through the data line. The data collector uploads the collected data to the cloud platform server in real time through the wireless communication module, and the client computer can view and download the test data by accessing the cloud, thus realizing the long-term monitoring of the vegetation growth environment on the slope.

Influencing Factors of Soil Moisture
Soil moisture of ecological berm is closely related to local meteorological factors, and soil moisture is influenced not only by its own hydrological factors, but also by surrounding meteorological factors. The movement of the atmospheric system is a highly unified process of determinism and stochasticity. From a macroscopic point of view, the uncertainty of soil moisture variation is mainly determined by the influence of preexisting meteorological conditions. Climate is the main factor that determines the distribution of soil moisture on earth, while the uncertainty of soil moisture mainly comes from the stochastic nature of atmospheric factors. Therefore, meteorological factors can be used to predict soil moisture. In this paper, nine meteorological factors are obtained, namely, daily maximum ambient temperature (X1), daily minimum ambient temperature (X2), daily average temperature (X3), daily average ambient humidity (X4), daily average dew point temperature (X5), daily average wind speed (X6), daily average rainfall (X7), daily total solar radiation (X8), and daily average soil temperature (X9). The Pearson correlation coefficient and Spearman correlation coefficient are shown in Figure 3.  In general, the correlation strength of the variables can be judged by the following range of values, with correlation coefficients in the range of 0.8-1.0 for very strong correlation, 0.6-0.8 for strong correlation, 0.4-0.6 for moderate correlation, 0.2-0.4 for weak correlation, and 0.0-0.2 for very weak or no correlation. The correlation was based on the Pearson correlation coefficient and Spearman correlation coefficient between meteorological factors and soil moisture, and it was found that (Y) was very strongly correlated with (X2), (X5), and (X9), strongly correlated with (X1) and (X3), and moderately correlated with (X4) and (X8). The Pearson correlation coefficient between (Y) and (X7) shows a strong correlation, and the Spearman correlation coefficient turns out to be weakly correlated. (Y) showed a very weak or no correlation with (X6), so this factor was excluded (see Figure 3).  The red word means that the correlation between wind speed (X 6 ) and soil moisture is extremely weak. Based on this correlation, wind speed is excluded from soil moisture prediction.
In general, the correlation strength of the variables can be judged by the following range of values, with correlation coefficients in the range of 0.8-1.0 for very strong correlation, 0.6-0.8 for strong correlation, 0.4-0.6 for moderate correlation, 0.2-0.4 for weak correlation, and 0.0-0.2 for very weak or no correlation. The correlation was based on the Pearson correlation coefficient and Spearman correlation coefficient between meteorological factors and soil moisture, and it was found that (Y) was very strongly correlated with (X 2 ), (X 5 ), and (X 9 ), strongly correlated with (X 1 ) and (X 3 ), and moderately correlated with (X 4 ) and (X 8 ). The Pearson correlation coefficient between (Y) and (X 7 ) shows a strong correlation, and the Spearman correlation coefficient turns out to be weakly correlated. (Y) showed a very weak or no correlation with (X 6 ), so this factor was excluded (see Figure 3).
Atmospheric temperature and soil temperature exchange heat directly due to the presence of temperature differences, so changes in temperature can affect changes in soil moisture. Wind can contribute to the decrease of soil moisture by promoting the evaporation of water to absorb heat, showing a negative correlation. The relative humidity of air can prevent a certain amount of solar radiation from being absorbed by the ground and can also prevent the escape of long-wave radiation from the ground. Soil moisture has different lag times for each meteorological factor, with temperature and wind speed having the most rapid effect on soil temperature, followed by total solar radiation, and relative humidity the slowest. Considering that the lag time of soil moisture to total solar radiation and ambient humidity is more obvious, the lag is considered for prediction. After considering their lags, the correlation coefficients are significantly improved, as shown in Table 2. Considering the discrete nature of field monitoring results due to small samples and many factors, a neural network algorithm optimized by GA was used. Among 100 days of monitoring data, the meteorological data of the first 90 days were selected as the training samples and the data of the last 10 days as the prediction samples and the GA-BP regression prediction model was programmed by MATLAB language. Experimental data for meteorological factors and soil moisture are shown in Table A1.

Comparison of Model Prediction
A GA-BP regression prediction model was developed using 100 days of data obtained through a meteorological monitoring system as the experimental object. Eight meteorological factors were used as the input layer and soil moisture was used as the output layer. Among them, the first 90 days of monitoring data were used as the training set and 10 days of data were used as the test set, and the soil moisture of the ecological berm was simulated and predicted by neural network. In this prediction model, the crossover probability was 0.8, the variance probability was 0.05, the initial population size was 100, and the maximum evolutionary generation was 100. Prediction simulation experiments were conducted using Matlab software and the prediction results were compared. The results of the training set considering the lag are shown in Figure 4.
After running the program, the MAE, MSE, RMSE, and MAPE can be obtained by the above equations, the BP neural network prediction model is compared with the GAoptimized neural network prediction model, the prediction results of the original data are compared with the prediction results considering the lag, and the re-derived results are shown in Figure 5 and Table 3.
The MSE of the training set with lags is 0.403. By observing (a) and (b) in Figure 5, it is easy to find that the accuracy of the BP prediction model with lags is lower than that without lags, while the accuracy of the GA-BP prediction model with lags is higher than that without lags, and the accuracy of the GA-BP prediction model with lags is better. The GA-BP prediction model is significantly better than the BP prediction model. Through Table 3, comparing the original data and the prediction model considering lags, it can be seen that the GA-BP neural network prediction model has an MSE of 0.505 and a MAPE of 1.595% for the original data. The MSE and MAPE of the GA-BP prediction model considering lags were 0.0516 and 0.412%, respectively. The MSE of the BP prediction model was 39.7918 and the mean absolute error MAPE was 10.592%, which was larger for the prediction of soil moisture. The GA-BP prediction model considering hysteresis has good prediction effect, and the error is controlled around 0.01, which can predict soil moisture more accurately. After running the program, the MAE, MSE, RMSE, and MAPE can be obtained by the above equations, the BP neural network prediction model is compared with the GAoptimized neural network prediction model, the prediction results of the original data are compared with the prediction results considering the lag, and the re-derived results are shown in Figure 5 and Table 3.  The MSE of the training set with lags is 0.403. By observing (a) and (b) in Figure 5, it is easy to find that the accuracy of the BP prediction model with lags is lower than that  After running the program, the MAE, MSE, RMSE, and MAPE can be obtained by the above equations, the BP neural network prediction model is compared with the GAoptimized neural network prediction model, the prediction results of the original data are compared with the prediction results considering the lag, and the re-derived results are shown in Figure 5 and Table 3.  The MSE of the training set with lags is 0.403. By observing (a) and (b) in Figure 5, it is easy to find that the accuracy of the BP prediction model with lags is lower than that

Prediction Model Evaluation
The cross-validation method was selected to evaluate the soil moisture prediction model. The cross-validation method is a method that eliminates the bias caused by random sampling during sample training and is a method that directly estimates the generalization error without any preconditions. Cross-validation refers to dividing the original data into a training set and a test set according to the prediction settings of the neural network. The training set is used to train the model and the test sample is used to validate the trained model to evaluate the prediction accuracy of the model. We use the V-fold cross-validation method to evaluate the prediction model. The V-fold cross-validation method refers to dividing the data into V parts, selecting one of them as the test set and V-1 part as the test set, repeating v times, and finally analyzing the average error of v times as the performance index for model evaluation. In this paper, 100 sets of data are divided into 10 parts in chronological order, and the GA-BP prediction model considering lag is evaluated by the V-fold cross-validation method, as shown in Figure 6. prediction effect, and the error is controlled around 0.01, which can predict soil moisture more accurately.

Prediction Model Evaluation
The cross-validation method was selected to evaluate the soil moisture prediction model. The cross-validation method is a method that eliminates the bias caused by random sampling during sample training and is a method that directly estimates the generalization error without any preconditions. Cross-validation refers to dividing the original data into a training set and a test set according to the prediction settings of the neural network. The training set is used to train the model and the test sample is used to validate the trained model to evaluate the prediction accuracy of the model. We use the V-fold cross-validation method to evaluate the prediction model. The V-fold cross-validation method refers to dividing the data into V parts, selecting one of them as the test set and V-1 part as the test set, repeating v times, and finally analyzing the average error of v times as the performance index for model evaluation. In this paper, 100 sets of data are divided into 10 parts in chronological order, and the GA-BP prediction model considering lag is evaluated by the V-fold cross-validation method, as shown in Figure 6.  Table 4, it can be seen that the average RMSE and MAPE of the BP neural network prediction model are 4.845 and 9.316%, respectively, while the average RMSE and MAPE of the GA-BP neural network prediction model are 0.540 and 1.152%, respectively. After cross-validation, it is obvious that the prediction results of BP neural network fluctuate a lot, while the prediction results of GA-BP neural network are relatively stable. Ten experiments show that the accuracy of its model is greatly improved compared with BP neural network. It is not reliable to evaluate the accuracy of the model by the accuracy of one prediction, and there is an element of random error. After cross-validation, it is more reasonable to use the average of RMSE and MAPE as the evaluation criteria of the model.  Table 4, it can be seen that the average RMSE and MAPE of the BP neural network prediction model are 4.845 and 9.316%, respectively, while the average RMSE and MAPE of the GA-BP neural network prediction model are 0.540 and 1.152%, respectively. After cross-validation, it is obvious that the prediction results of BP neural network fluctuate a lot, while the prediction results of GA-BP neural network are relatively stable. Ten experiments show that the accuracy of its model is greatly improved compared with BP neural network. It is not reliable to evaluate the accuracy of the model by the accuracy of one prediction, and there is an element of random error. After cross-validation, it is more reasonable to use the average of RMSE and MAPE as the evaluation criteria of the model.

Discussion and Outlook
In this study, we investigated the meteorological factor monitoring and soil moisture prediction methods for high and steep slopes, and proposed a machine learning-based soil moisture prediction method. The correlation between meteorological factors and soil moisture was obtained by analyzing meteorological factors and soil moisture using MATLAB software, and a soil moisture prediction method of BP neural network optimized by genetic algorithm considering hysteresis was established. The results show that the prediction results obtained by the method have a small error with the actual results and a high accuracy.
We compared two prediction methods, BP neural network and GA-BP neural network, to determine the accuracy of soil moisture prediction. BP neural network has the problem of falling into local optimum, so the accuracy of this prediction method is low. For the proposed GA-BP neural network prediction method, the advantage of this method is that it can optimize the defects of the BP neural network itself, and the accuracy of the prediction results is significantly improved.
At present, the research on predicting soil moisture through the correlation between meteorological factors and soil moisture is still in its initial stage. The factors affecting soil moisture are many and complex, and the prediction methods still need to be improved continuously. In this paper, we consider the lag of total solar radiation and relative humidity on soil moisture, but meteorological factors such as sunshine duration and atmospheric pressure also have important effects on the variation of soil moisture. Therefore, in order to predict soil moisture more scientifically and accurately, a more comprehensive and in-depth study of the correlation between soil moisture and meteorological factors is needed. Soil moisture varies with soil depth, so further re-examination of soil moisture variation at different soil depths is needed.