Abstract
Accurate estimation of reference evapotranspiration is a key step in irrigation and water resources planning. The Penman Monteith (FAO56-PM) formula recommended by FAO56-PM is the standard for calculating the reference evapotranspiration. However, the FAO56-PM model is limited in the observation of meteorological variables, so it is necessary to choose an alternative ET0 model which requires less meteorological data. Based on the daily climate data of eight meteorological stations in northern Xinjiang from 2000 to 2020, seven empirical models (Hargreaves, Berti, Dorji, Dalton, Meyer, WMO, Albrecht) and four optimization algorithms (RF model, LS-SVR model, Bi-LSTM model and GA-BP model) combined with seven different parameters were evaluated comprehensively. The results show that the accurate of the empirical model based on temperature is obviously better than the empirical model based on air mass transport. The annual and multi-year alternative ET0 models of different input parameter combinations are: LS-SVR1, RF2, LS-SVR3, LS-SVR4, GA-BP5, LS-SVR6, GA-BP7. It can be used as a substitute for the reference evapotranspiration model without relevant meteorological data. Only the LS-SVR6 model and GA-BP7 model are recommended as the best alternative models for northern Xinjiang reference evapotranspiration at daily, monthly and seasonal scales.
1. Introduction
The quantification and accurate estimation of evapotranspiration are of great significance to the formulation of farmland irrigation systems, the study of hydrology and water balance, and the planning of water resources. The Penman Monteith (FAO56-PM) method recommended by the FAO [1] is the standard method for estimating reference evapotranspiration (ET0) thus far [2,3]. This method does not need to be initially calibrated locally and has global applicability [4,5,6]. However, this method has many input parameters (air temperature, humidity, solar radiation and wind speed), so the acquisition of many meteorological parameters becomes the only limitation of its application [7]. In this case, relatively simple empirical models are usually used to estimate ET0, and the selection of the optimal empirical model is of great significance to water resource planning and management [8,9]. Existing empirical evapotranspiration models mainly include temperature-based, radiation-based, air-mass-transport-based and combination models. The results show that the combined models are better, followed by radiation-based, temperature-based and aerodynamic models [10]. However, the combined model is highly dependent on the input of meteorological variables, which requires the sufficient input of meteorological variables, such as net radiation, soil heat flux, air temperature, wind speed and relative humidity. Moreover, some scholars also pointed out that, compared with the climate in humid and semi-humid regions, ET0 in arid and semi-arid regions is mainly affected by aerodynamics and water vapor pressure deficit rather than radiation availability [11,12]. Therefore, the ET0 model based on temperature and air mass transport was selected in this study to verify its applicability in northern Xinjiang.
In recent years, more scholars have applied artificial intelligence (AI) to the prediction and estimation of ET0 and published a number of papers. For example, Vahid Nourani et al. [13] used a feed-forward neural network (FFNN), adaptive neuro-fuzzy reasoning system (ANFIS), support vector machine regression (SVR) and other algorithms to simulate the ET0 several climate zones of Turkey, Cyprus, Iraq, Iran and Libya. Ahmed Elbeltagi et al. [14] estimated long-term ET0 in Egypt through a deep neural network (DNN). Babak Mohammadi and Saeid Mehdizadeh [15] used support vector regression (SVR) and random forest (RF) to simulate daily ET0 from Isfahan, Urmiya and Yazd, Iran. Tongren Xu et al. [16] evaluated the adaptability of machine learning, remote sensing and land surface ET0 products in the United States. Saman Maroufpoor et al. [17] simulated ET0 in Iran by using artificial neural network optimization (ANN-GWO). Francesco Granata [18] used the M5P regression tree, bagging, random forest (RF) and support vector machine regression (SVR) to simulate ET0 in central Florida. Juan Yin et al. [19] simulated ET0 in the central Ningxia region of China by using the mixed bidirectional long- and short-term memory model (BI-LSTM). Jingran Liu et al. [20] predicted the actual evapotranspiration of green pepper by using an extended neural network (MEA-ENN, GA-ENN) optimized by the mind evolutionary algorithm and genetic algorithm. The results show that artificial intelligence can accurately predict daily ET0 and is a powerful tool for modelling ET0 using incomplete meteorological parameters.
In summary, this study selected eight widely used empirical models for performance evaluation, including four temperature-based models: Hargreaves model [21], Berti model [22] and Dorji model [23] and four mass transfer-based models: Dalton Model [24], Meyer Model [25], WMO Model [26] and Albrecht Model [27]. Four optimization algorithms (RF model, LS-SVR model, BI-LSTM model and GA-BP model) were combined with 28 optimization algorithm models to make a comprehensive ranking. On this basis, this study proposes the following hypotheses. First, the empirical model and the ET0 model based on the optimization algorithm are significantly affected by time and space in northern Xinjiang. Second, simple linear regression and the global performance index GPI can effectively verify the performance effect of the ET0 model in northern Xinjiang. The research objectives of this study are: (1) to use linear regression to verify seven empirical models to determine the best substitute for the FAO56-PM model in northern Xinjiang; (2) to use the global performance index GPI to rank the 28 models based on the optimization algorithm and to determine the best algorithm model under seven different parameter input combinations; and (3) to discuss the influence of time scale on the model and to recommend the most appropriate reference evapotranspiration estimation model for northern Xinjiang at different time scales. The innovation of this study lies in the first application of eight empirical models and four models based on an optimization algorithm in northern Xinjiang and the recommendation of corresponding models in the absence of meteorological data to fill the gap in knowledge regarding models suitable for northern Xinjiang under different meteorological parameters.
2. Overview of the Study Area and Data
2.1. Study Area
Xinjiang is located in the hinterland of Eurasia and the northwest border of China. The Tianshan Mountains traverse the whole territory, dividing the whole territory into northern and southern Xinjiang. The part north of the Tianshan Mountains is called northern Xinjiang. In northern Xinjiang lies the Altai Mountains, and between the Altai Mountains and Tianshan Mountains lies the semi-closed Junggar Basin (Figure 1). Northern Xinjiang has a temperate continental arid climate, and the annual average temperature ranges from 1.96 °C to 9.06 °C. The annual precipitation ranges from 137.75 mm to 253.96 mm, and the annual evaporation ranges from 909.92 mm to 1328.76 mm (Table 1). In the eastern and western regions, the wind speed is high, and the number of windy days is high. The mountainous terrain is undulating, and the wind is blocked. The number of gale days in the plains area is greater than that in the middle and low mountainous areas, and the number of gale days in the Alashankou area in the western Junggar Basin is the highest. In short, due to the dry climate, less precipitation, sparse vegetation, loose soil quality and high winds, the annual ET0 in northern Xinjiang is relatively large, and it is vulnerable to drought stress.
Figure 1.
Geographical locations of Gurbantunggut Desert and meteorological stations.
Table 1.
Geographical and long-term average meteorological information for 8 stations in Gurbantunggut Desert, Xingjiang, China. The values in parentheses are the standard deviations for each variable.
2.2. Data Sources
Eight weather stations in northern Xinjiang were selected in this study (Figure 1), and daily meteorological data were collected from 2000 to 2020. The observation data include temperature, humidity, wind speed, rainfall, saturated water vapour pressure deficit and extraterrestrial radiation (Table 1). The climatic data of historical days are obtained from the China Meteorological Data Sharing Service System (http://data.cma.cn (access on 24 November 2021)). Among them, the Wujiaqu, Wusu, Shawan and Qinghe datasets span from 2000 to 2017, and the datasets of Hutubi, Fuyun, Hebukesel and Karamay span from 2000 to 2020.
3. Modelling Structure and Approach
3.1. FAO56 Penman–Monteith Model
The FAO56 P-M approach proposed by Allen et al. [1] served here as a reference method, it is expressed by Equation (1):
where ET0 is the reference evapotranspiration (mm day−1), Rn is net radiation (MJm−2 day−1), G is soil heat flux (MJm−2 day−1), γ is the psychometric constant (kPa °C−1), T is the mean air temperature (°C), U2 is the wind speed at 2 m height (ms−1), es is the saturation vapour pressure (kPa), ea is the actual vapour pressure (kPa), and ∆ is the slope of the vapour pressure curve (kPa °C−1).
3.2. Empirical Models
Temperature-based models performed well in all subregions of the world. Due to the special climatic conditions in desert areas, it is necessary to select an empirical model that is more suitable for ET0 in northern Xinjiang through model selection. Hargreaves and Allen, Berti and Dorji models were selected for comparison with FAO56 P-M models (Table 2). Similarly, four empirical models based on air mass transport, Dalton, Meyer, WMO and Albrecht, were selected to verify the applicability of the four models in northern Xinjiang to find an alternative model with higher applicability under fewer input meteorological parameters (Table 2).
Table 2.
Empirical reference evapotranspiration model and parameter calculation.
3.3. Random Forest-Based Reference Evapotranspiration (ET0) Model
Random forest (RF) is a class of discriminant models that support classification, regression, and multi-classification [28]. It is based on bagging integration on decision trees and introduces random attribute selection of prediction in the training process of decision trees. It establishes a forest in a random way. The forest is composed of many decision trees. There is no correlation between each decision tree, and with the same distribution for all trees, multiple trees are used to train and predict the samples [29].
For a given classifier set, the numerical estimator is h1(X), h2(X), …, hk(X), and a random vector Θ is set; therefore, the tree predictor h (X, Θ) can take on numerical values. As the number of trees in the forest increases, the training set drawn at random from the distribution of the random vector Y, X, and the marginal functions can be given as:
For a large number of decision trees, the following two basic theorems are valid:
Theorem 1.
We define the upper bound for generalization error of RF as follows:
Theorem 2.
As the number of decision trees increase, all sequences will finally converge to:
3.4. Least Square Support Vector Regression
A support vector machine (SVM) was introduced in the 1990s [30] and then extended for support vector regression (SVR), which is a new machine learning algorithm based on the structural risk minimization criterion. The biggest problem of traditional neural networks is that they can minimize the training error. However, it cannot minimize the generalization error in the learning process. SVM is successfully applied for the classification and prediction based on VC dimensions and structural risk minimization. Then, the least square support vector machine (LS-SVR) was proposed by Suykens and Vandewalle [31] of the United States on this basis. The difference between LS-SVR and SVM is that the relaxation variables in the optimization objective are changed from a penalty term to a quadratic term, and the constraint conditions are also changed into only equality constraints, which allows the solution to be realized by the least square method.
The linear regression process is as follows: the n-dimensional inputted vector and 1-dimensional outputted vector: (xk, yk), xk ∈ Rn, yk ∈ R, k = 1, …, N, are mapped from the original space to the high-dimensional space F, and the optimal linear regression function is constructed in F. That is:
According to the principle of structural risk minimization, there are:
where, ‖ω‖2 is the complexity of control model, γ is the determining penalty factor, Remp is the loss function, LS-SVM selects the two-norm of e as the loss function. At this time, the optimization problem becomes:
The constraint condition is:
The model is changed into dual space and Lagrange function is introduced:
The partial derivation of the above formula is obtained:
For k = 1, …, N, the system of linear equations is obtained by eliminating ω and e:
Obtained:
where, I is the identity matrix.
a and b are calculated by the least square method:
Let K (x, xk) = φT(x)φ(xk), the prediction model of LS-SVM is as follows
where K(x, xk) is the kernel function and the kernel function realizes the mapping from low-dimensional space to high-dimensional space and transforms the non-linear problem of low-dimensional space into a linear problem of high-dimensional space.
3.5. Bidirectional Long-Term and Short-Term Memory Network
The bidirectional long-term and short-term memory network (LSTM) cell, first proposed by Hochreiter and Schmidhuber [32], is an upgraded version of a recursive neural network that can overcome the problem of vanishing recurrent neural network (RNN) gradients. Standard LSTM networks deal with sequences chronologically, and they ignore future contexts. To overcome this shortcoming, the Bi-LSTM algorithm is introduced to improve the accuracy. The algorithm is a deformed structure of LSTM, and the forward and backward LSTM layers contained in the memory block are able to utilize past and future information. The hidden layer with opposite time series is obtained, and the two hidden layers are connected to obtain the same output result. The hidden layer of Bi-LSTM at time t includes forward and backward bars:
where T denotes the length of the time.
3.6. Back Propagation Neural Network Optimized by Genetic Algorithm
The genetic algorithm (GA) is a heuristic algorithm simulating biological evolution that has three genetic operators: selection, crossover and mutation [33]. BPNN consists of a three-layer network structure of an input layer, hidden layer and output layer [12]. With neurons as the basic unit, the BPNN solves the non-linear fitting problem mainly by setting activation functions in the hidden layer and output layer. In this study, a genetic algorithm was used to optimize the structural parameters of the BP neural network. To realize the dynamic updating of the value interval of the number of gene loci in the hidden layer, the crossover operator and mutation operator of the structural parameter chromosome in GA-BP were improved. First, the neuron gene calculation was carried out, and then the value interval of gene loci was derived according to the number of neurons to generate the value of gene loci. The BP structural parameter population was optimized by a genetic operator to obtain the optimal structural parameters (Figure 2).
Figure 2.
Optimization of BP neural network structure parameters by GA.
Through the correlation analysis of meteorological factors (Figure 3), it can be seen that under the condition of p < 0.05, all parameters are significantly correlated with ET0.
Figure 3.
Correlation analysis of meteorological factors.
Therefore, the following parameters (Table 3) are selected for ET0 prediction. We adopted four optimization algorithms, including RF, LS-SVR, BI-LSTM and GA-BP, and combined with different parameter combinations; a total of 28 reference evapotranspiration models were established.
Table 3.
Summary of the inputs used for the implementation of each RF, LS-SVR, Bi-LSTM and GA-BP models.
Figure 4 shows the flow chart of reference evapotranspiration model research. The data from 2000 to 2014 were taken as the training dataset, and the data from 2015 to 2020 were taken as the test dataset. Four different algorithms were combined to simulate ET0 values in northern Xinjiang, and the simulation results were evaluated.
Figure 4.
General representation of the ET0 model study flowchart.
3.7. Performance Evaluation of Models
The assessment of the models was carried out using 5 statistical performance evaluations: the root mean square error (RMSE), the mean absolute error (MAE), the mean bias error (MBE), correlation of determination (R2) and global performance indicator (GPI) expressions, which are as follows:
where ETP-M, i is the FAO56 P-M model-observed reference evapotranspiration at the ith time step (mm day−1), ETE,i is the empirical or algorithm model (RF, LS-SV, Bi-LSTM and ANN models)-observed reference evapotranspiration at the ith time step (mm day−1), and n is the total observations. Ai,new is the normalized value of the above four indicators, corresponds to the median of the indicators, and αi is equal to 1 for R2, equal to −1 for the RMSE and MAE and equal to ±1 for the MBE indicators (the sign is the same as MBE).
4. Results and Analysis
4.1. Performance Appraisal of Seven Empirical Models (Temperature-Based and Mass Transfer-Based) for Estimating ET0
Figure 5 shows that the three temperature-based empirical models have good applicability in northern Xinjiang, China, with R2 values above 0.9. The Dorji model has the highest coefficient of determination, with R2 up to 0.9275. However, the empirical model based on air mass transport is not ideal in northern Xinjiang, with an overall R2 lower than 0.8, and the R2 of the Albrecht model is only 0.7477. Compared with the 1:1 line, the Hargreaves model overestimate ET0. The Berti model, Dorji model, Dalton model, Meyer model, WMO model and Albrecht model underestimated in northern Xinjiang. The average daily ET0 of the Meyer model, WMO model and Albrecht model are all less than 2 mm, which is only 1/4 of that calculated by the P-M model, with a large error. Under the condition of wind speed U2 and saturated vapour pressure difference (es − ea) as the only inputs, it is impossible to accurately estimate ET0 in this region. Among the seven empirical models, the Berti model and Hargreaves model have the best calculation effect in the northern region of Xinjiang. However, the WMO model and Meyer model have poor simulation effects. This finding indicates that the model based on air mass transport has poor applicability in northern Xinjiang, which further proves that temperature is the decisive factor affecting ET0 in northern Xinjiang.
Figure 5.
Comparisons of the daily reference evapotranspiration (ET0) values calculated by the FAO-56 PM method vs. modelled data via the seven empirical models, including the Hargreaves model (a), Berti model (b), Dorji model (c), Dalton model (d), Meyer model (e), WMO model (f) and Albrecht model (g).
4.2. Components Comparison of the Four Algorithm Models for Estimating ET0
In this study, seven different parameter combination input modes (Table 3) and four optimization algorithms, including the RF, LS-SVR, Bi LSTM and GA-BP models, were used to rank and evaluate the models in the training period (Table 4) and test period (Table 5) by introducing the global performance index (GPI). In the training period, the top 10% of the models were focused on the RF model, while the advantage of the RF model disappeared in the test period, which indicates that the RF model has overfitting in the training period, resulting in a significant reduction in the accuracy of the model in the test period. Therefore, the model evaluation in this paper is mainly based on the test period to reduce the impact of the overfitting algorithm on the simulation results.
Table 4.
Performance of RF, LS-SV, Bi-LSTM and GA-BP models during the training period.
Table 5.
Performance of RF, LS-SV, Bi-LSTM and GA-BP models during the testing period.
Table 5 shows that the GA-BP5 model has the best overall simulation effect during the test period, with an RMSE of 0.2542 mm·d−1, MAE of 0.1706 mm·d−1, MBE of −0.0039 and R2 of 0.9918. The second best simulation is the LS-SVR6 model, wherein the RMSE is 0.2380 mm·d−1, MAE is 0.1604 mm·d−1, MBE is −0.0425 and R2 is 0.9928.
The worst model is the Bi-LSTM1 model, wherein the RMSE is 0.7811 mm·d−1, MAE is 0.5374 mm·d−1, MBE is −0.3776 and R2 is 0.9227. The worst Bi-LSTM1 model has the least number of input parameters, and the best, the GA-BP5 model and LS-SVR6 model, have the most input parameters, which shows that the number of input parameters has a great impact on the model simulation accuracy. In the same algorithm with different parameter combinations, the best results of RF model and LS-SVR model appear in the sixth group of parameter inputs (T, Tmax, Tmin, Ra, Rs and U2), and the best results of Bi LSTM model and GA-BP model appear in the fifth group of parameter inputs (T, Tmax, Tmin, Ra, Rs and U2); this finding shows that the accuracy of the model cannot be improved by 100% only by increasing the number of input parameters.
Because the influence of meteorological factors on reference evapotranspiration is different, considering RH and U2, RMSE decreases by 0.5 mm·d−1, MAE decreases by 0.32 mm·d−1, MBE amplitude decreases by 0.3 and R2 increases by 0.077. Considering the radiation terms Rs and U2 based on the temperature model, RMSE decreases by 0.52 mm·d−1, MAE decreases by 0.33 mm·d−1, MBE amplitude decreases by 0.28 and R2 increases by 0.076. The results show that the radiation term Rs and aerodynamic term U2 have a significant effect on reference evapotranspiration, and the reference evapotranspiration model is mainly composed of three parts: the temperature term, radiation term and aerodynamic term.
We obtained the optimal model results under different input combinations (Table 6). Figure 6 shows a comparison between the optimal model and the P-M model under the conditions of various input parameters, which can also support the view that the accuracy of simulation results is improved after considering the radiation term Rs and aerodynamic term U2 parameters.
Table 6.
The optimal model of different parameter combination input.

Figure 6.
Comparisons of the daily reference evapotranspiration (ET0) values calculated by the FAO-56 PM method vs. modelled data via the optimal reference evapotranspiration model under different input combinations during the testing phase. LS-SVR1 model correlation diagram (a), LS-SVR1 model measured value and simulated value (b); RF2 model correlation diagram (c), RF2 model measured value and simulated value (d); LS-SVR3 model correlation diagram (e), LS-SVR3 model measured value and simulated value (f); LS-SVR4 model correlation diagram (g), LS-SVR4 model measured value and simulated value (h); GA-BP5 model correlation diagram (i), GA-BP5 model measured value and simulated value (j); LS-SVR6 model correlation diagram (k), LS-SVR6 model measured value and simulated value (l); GA-BP7 model correlation diagram (m), GA-BP7 model measured value and simulated value (n).
4.3. Evaluation of Optimal Reference Evapotranspiration Model under Different Time-Scale Conditions
In the verification of the empirical model and optimization algorithm model, this paper takes the data from 2000 to 2020 as samples. A large amount of data and a long time span increase the model verification accuracy. Generally, the estimation of crop water requirements is only based on the reference crop evapotranspiration of 5–7 days. Therefore, the influence of different time scales on the simulation accuracy of the model is also included in the scope of this paper. We divided our data into four time scales, which were 7 days (1 July 2015 to 7 July 2015), one month (May 2019), one season (summer 2015) and one year (2017), and evaluated them with the Berti model, Hargreaves model and optimal model under seven different parameter input combinations. Figure 7 shows a comparison of Taylor diagrams of models under different time scales. It can be seen from Figure 7, when the time scale is 7 days, the range of standard deviation is 0.5~1.25, while, when the time scale is year, the range of standard deviation is reduced to 0.625~1, and the correlation is increased from 0.75 to 0.95. This result shows that the time scale has a significant effect on the accuracy of the model, and the accuracy of the model increases with increasing time scale. At the annual scale, the seven models based on the optimization algorithm are better. When meteorological data are missing, seven models can be used to estimate ET0 in northern Xinjiang. Only the LS-SVR6 model and GA-BP7 model are recommended to estimate the local ET0 at the daily, monthly and seasonal scales.
Figure 7.
Evaluation of Taylor diagram for reference evapotranspiration model in different time scales; 7 days (a), May 2019 (b), quarter 2015 (c), 2016 (d).
5. Discussion
The correlation between reference evapotranspiration model and meteorological factors is the key to determining the input parameters of the optimization model, and many studies have shown that air temperature is the primary factor affecting reference evapotranspiration [4,5,7]. This study shows that ET0 has a very significant positive correlation with the daily average temperature T, the daily minimum air temperature Tmin, the daily maximum air temperature Tmax, the net radiation Ra and the saturated water vapor pressure difference VPD at the level of p < 0.001 (Figure 3), and the saturated water air pressure difference VPD is a function of temperature and humidity. The relative humidity RH was negatively correlated with ET0 at the level of p < 0.001. The whole evapotranspiration process is dominated by temperature and net radiation Ra. This result is consistent with the conclusions of some scholars [22,23,24].
This study found that among the empirical models, the temperature-based model is more suitable for northern Xinjiang, this is consistent with the conclusion of Rachid et al. [34]. That is, the temperature-based model is more suitable for arid and semi-arid areas. The reason is that heat is the main driving factor of water transmission and the key step of the water cycle, especially in arid and semi-arid areas with limited water resources. It is feasible and reliable to apply the optimization algorithm to ET0 prediction. However, under the same parameter input conditions, the estimation accuracy of ET0 at different scales is different, this conclusion is consistent with the views of some scholars [35,36,37]. The main reason is that the time scale itself is the factor restricting the accuracy of the algorithm. It can be seen from Figure 7 that the model based on the optimization algorithm is significantly better than the empirical, and this conclusion has also been verified in semi-arid areas of Spain [38]. Chen et al. [12] also pointed out that the model based on optimization algorithm is better than the empirical model, and the empirical model based on temperature factor is better than that based on radiation or humidity in the training and research area. It shows that the application of an optimization algorithm can greatly improve the estimation accuracy of the model and make up for the shortcomings of existing empirical formulas.
6. Conclusions
Based on the daily datasets of eight meteorological stations in northern Xinjiang, China, from 2000 to 2020, seven empirical models (Hargreaves, Berti, Dorji, Dalton, Meyer, WMO, and Albrecht) and 28 models based on optimization algorithms (RF, LS-SVR, Bi-LSTM and GA-BP) were compared. Through the evaluation of eight empirical models, the Berti model and Hargreaves model are better alternative models, while the WMO model and Meyer model are worse alternative models. The global performance index (GPI) is used to rank 28 models based on the optimization algorithm, and the optimal recommendation models of the annual scale and multi-year scale under different input parameter combinations are obtained as follows: LS-SVR1 (input: T, Tmax, Tmin, Ra), RF2 (input: T, Tmax, Tmin, Ra, Rs), LS-SVR3 (input: T, Tmax, Tmin, Ra, RH), LS-SVR4 (input: T, Tmax, Tmin, Ra, Rs, RH), GA-BP5 (input: T, Tmax, Tmin, Ra, RH, U2), LS-SVR6 (input: T, Tmax, Tmin, Ra, Rs, U2) and GA-BP7 (input: T, Tmax, Tmin, Ra, U2). Only the LS-SVR6 model and GA-BP7 model are recommended as the best alternative models for ET0 when the local climate dataset is incomplete at the daily, monthly and seasonal scales, which has high applicability in northern Xinjiang. This study can provide theoretical guidance and technical support for the determination of farmland irrigation systems and water resource planning and management in northern Xinjiang. This method is also applicable to other arid and semi-arid areas.
Author Contributions
Conceptualization, P.J. and S.-J.H.; methodology, P.J.; software, P.J.; validation, P.J.; formal analysis, P.J.; investigation, P.J. and S.-J.H.; resources, S.-J.H.; data curation, P.J. and S.-J.H.; writing—original draft preparation, P.J.; writing—review and editing, P.J. and S.-J.H.; supervision, S.-J.H. All authors have read and agreed to the published version of the manuscript.
Funding
This work was supported by the special fund project of Xinjiang Water Conservancy Science and technology (No. YF2020-08 and XSK-2021-05), the National Natural Science Foundation of China (No. 41671032), the National Key R&D Program of China (No. 2013CB429,902).
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Data is contained within article.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Allen, R.G. Assessing integrity of weather data for reference evapotranspiration estimation. J. Irrig. Drain. Eng. 1996, 122, 97–106. [Google Scholar] [CrossRef]
- Ladlani, I.; Houichi, L.; Djemili, L.; Heddam, S.; Belouz, K. Modeling daily reference evapotranspiration (ET0) in the north of Algeria using generalized regression neural networks (GRNN) and radial basis function neural networks (RBFNN): A comparative study. Meteorol. Atmos. Phys. 2012, 118, 163–178. [Google Scholar] [CrossRef]
- Falamarzi, Y.; Palizdan, N.; Huang, Y.F.; Lee, T.S. Estimating evapotranspiration from temperature and wind speed data using artificial and wavelet neural networks (WNNs). Agric. Water Manag. 2014, 140, 26–36. [Google Scholar] [CrossRef]
- Kim, S.; Kim, H.S. Neural networks and genetic algorithm approach for nonlinear evaporation and evapotranspiration modeling. J. Hydrol. 2008, 351, 299–317. [Google Scholar] [CrossRef]
- Kumar, M.; Raghuwanshi, N.S.; Singh, R. Development and Validation of GANN Model for Evapotranspiration Estimation. J. Hydrol. Eng. 2009, 14, 131–140. [Google Scholar] [CrossRef]
- Landeras, G.; Ortiz-Barredo, A.; Lopez, J.J. Comparison of artificial neural network models and empirical and semi-empirical equations for daily reference evapotranspiration estimation in the Basque Country (Northern Spain). Agric. Water Manag. 2008, 95, 553–565. [Google Scholar] [CrossRef]
- Ferreira, L.B.; da Cunha, F.F.; de Oliveira, R.A.; Fernandes, E.I. Estimation of reference evapotranspiration in Brazil with limited meteorological data using ANN and SVM—A new approach. J. Hydrol. 2019, 572, 556–570. [Google Scholar] [CrossRef]
- Salam, R.; Islam, A.M.T.; Pham, Q.B.; Dehghani, M.; Al-Ansari, N.; Linh, N.T.T. The optimal alternative for quantifying reference evapotranspiration in climatic sub-regions of Bangladesh. Sci. Rep. 2020, 10, 20171. [Google Scholar] [CrossRef]
- Farias, V.D.D.; Costa, D.L.P.; Pinto, J.V.D.; de Souza, P.J.D.P.; de Souza, E.B.; Ortega-Farias, S. Calibration of reference evapotranspiration models in Para. Acta Sci. Agron. 2020, 42, e42475. [Google Scholar] [CrossRef] [Green Version]
- Valle, L.C.G.; Ventura, T.M.; Gomes, R.S.R.; Nogueira, J.D.; Lobo, F.D.; Vourlitis, G.L.; Rodrigues, T.R. Comparative assessment of modelled and empirical reference evapotranspiration methods for a brazilian savanna. Agric. Water Manag. 2020, 232, 106040. [Google Scholar] [CrossRef]
- Marino, M.A.; Tracy, J.C.; Taghavi, S.A. Forecasting of Reference Crop Evapotranspiration. Agric. Water Manag. 1993, 24, 163–187. [Google Scholar] [CrossRef]
- Chen, Z.J.; Zhu, Z.C.; Jiang, H.; Sun, S.J. Estimating daily reference evapotranspiration based on limited meteorological data using deep learning and classical machine learning methods. J. Hydrol. 2020, 591, 125286. [Google Scholar] [CrossRef]
- Nourani, V.; Elkiran, G.; Abdullahi, J. Multi-station artificial intelligence-based ensemble modeling of reference evapotranspiration using pan evaporation measurements. J. Hydrol. 2019, 577, 123958. [Google Scholar] [CrossRef]
- Elbeltagi, A.; Deng, J.S.; Wang, K.; Malik, A.; Maroufpoor, S. Modeling long-term dynamics of crop evapotranspiration using deep learning in a semi-arid environment. Agric. Water Manag. 2020, 241, 106334. [Google Scholar] [CrossRef]
- Mohammadi, B.; Mehdizadeh, S. Modeling daily reference evapotranspiration via a novel approach based on support vector regression coupled with whale optimization algorithm. Agric. Water Manag. 2020, 237, 106145. [Google Scholar] [CrossRef]
- Xu, T.R.; Guo, Z.X.; Xia, Y.L.; Ferreira, V.G.; Liu, S.M.; Wang, K.C.; Yao, Y.J.; Zhang, X.J.; Zhao, C.S. Evaluation of twelve evapotranspiration products from machine learning, remote sensing and land surface models over conterminous United States. J. Hydrol. 2019, 578, 124105. [Google Scholar] [CrossRef]
- Maroufpoor, S.; Bozorg-Haddad, O.; Maroufpoor, E. Reference evapotranspiration estimating based on optimal input combination and hybrid artificial intelligent model: Hybridization of artificial neural network with grey wolf optimizer algorithm. J. Hydrol. 2020, 588, 125060. [Google Scholar] [CrossRef]
- Granata, F. Evapotranspiration evaluation models based on machine learning algorithms-A comparative study. Agric. Water Manag. 2019, 217, 303–315. [Google Scholar] [CrossRef]
- Yin, J.; Deng, Z.; Ines, A.V.M.; Wu, J.B.; Rasu, E. Forecast of short-term daily reference evapotranspiration under limited meteorological variables using a hybrid bi-directional long short-term memory model (Bi-LSTM). Agric. Water Manag. 2020, 242, 106386. [Google Scholar] [CrossRef]
- Liu, J.R.; Meng, X.K.; Ma, Y.J.; Liu, X. Introduce canopy temperature to evaluate actual evapotranspiration of green peppers using optimized ENN models. J. Hydrol. 2020, 590, 125437. [Google Scholar] [CrossRef]
- Hargreaves, G.H.; Samani, Z.A. Estimating Potential Evapo-Transpiration. J. Irrig. Drain. Div. 1982, 108, 225–230. [Google Scholar] [CrossRef]
- Berti, A.; Tardivo, G.; Chiaudani, A.; Rech, F.; Borin, M. Assessing reference evapotranspiration by the Hargreaves method in north-eastern Italy. Agric. Water Manag. 2014, 140, 20–25. [Google Scholar] [CrossRef]
- Dorji, U.; Olesen, J.E.; Seidenkrantz, M.S. Water balance in the complex mountainous terrain of Bhutan and linkages to land use. J. Hydrol. Reg. Stud. 2016, 7, 55–68. [Google Scholar] [CrossRef] [Green Version]
- Dalton, J. Performance of Twelve Mass Transfer Based Reference Evapotranspiration Models under Humid Climate. Mem. Proc. Manch. Lit. Philos. Soc. 1802, 5, 535–602. [Google Scholar]
- Meyer, A. Über einige Zusammenhänge zwischen Klima und Boden in Europa. Chem. Erde 1926, 2, 209–347. [Google Scholar]
- Geneva (World Meteorological Organization). Measurement and estimation of evaporation and evapotranspiration. Tech. Pap. 1966, 83, 121. [Google Scholar]
- Albrecht, F. Die Methoden zur Bestimmung der Verdunstung der natürlichen Erdoberfläche. Arch. Meteorol. Geophys. Bioklimatol. Ser. B 1950, 2, 1–38. [Google Scholar] [CrossRef]
- Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
- Strobl, C.; Boulesteix, A.L.; Augustin, T. Unbiased split selection for classification trees based on the Gini Index. Comput. Stat. Data Anal. 2007, 52, 483–501. [Google Scholar] [CrossRef] [Green Version]
- Anguita, D.; Ridella, S.; Rovetta, S. Circuital implementation of support vector machines. Electron. Lett. 1998, 34, 1596–1597. [Google Scholar] [CrossRef]
- Suykens, J.A.K.; Vandewalle, J. Least squares support vector machine classifiers. Neural Process Lett. 1999, 9, 293–300. [Google Scholar] [CrossRef]
- Hochreiter, S.; Schmidhuber, J. Long short-term memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef] [PubMed]
- Yin, C.G.; Luo, Z.Y.; Ni, M.J.; Cen, K.F. Predicting coal ash fusion temperature with a back-propagation neural network model. Fuel 1998, 77, 1777–1782. [Google Scholar] [CrossRef]
- Hadria, R.; Benabdelouhab, T.; Lionbouia, H.; Salhib, A. Comparative assessment of different reference evapotranspiration models towards a fit calibration for arid and semi-arid areas. J. Arid Environ. 2021, 184, 104318. [Google Scholar] [CrossRef]
- Feng, Y.; Cui, N.B.; Gong, D.Z.; Zhang, Q.W.; Zhao, L. Evaluation of random forests and generalized regression neural networks for daily reference evapotranspiration modelling. Agric. Water Manag. 2017, 193, 163–173. [Google Scholar] [CrossRef]
- Ferreira, L.B.; da Cunha, F.F. Multi-step ahead forecasting of daily reference evapotranspiration using deep learning. Comput. Electron. Agric. 2020, 178, 105728. [Google Scholar] [CrossRef]
- Ferreira, L.B.; da Cunha, F.F. New approach to estimate daily reference evapotranspiration based on hourly temperature and relative humidity using machine learning and deep learning. Agric. Water Manag. 2020, 234, 106113. [Google Scholar] [CrossRef]
- Jiménez, J.A.B.; Estévez, J.; Marín, A. New machine learning approaches to improve reference evapotranspiration estimates using intra-daily temperature-based variables in a semi-arid region of Spain. Agric. Water Manag. 2021, 245, 106558. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).








