Machine Learning-Based Estimation of Daily Cropland Evapotranspiration in Diverse Climate Zones

: The accurate prediction of cropland evapotranspiration (ET) is of utmost importance for effective irrigation and optimal water resource management. To evaluate the feasibility and accuracy of ET estimation in various climatic conditions using machine learning models, three-, six, and nine-factor combinations (V3, V6, and V9) were examined based on the data obtained from global cropland eddy flux sites and Moderate Resolution Imaging Spectroradiometer (MODIS) remote sensing data. Four machine learning models, random forest (RF), support vector machine (SVM), extreme gradient boosting (XGB), and backpropagation neural network (BP), were used for this purpose. The input factors included daily mean air temperature (T a ), net radiation (R n ), soil heat flux (G), evaporative fraction (EF), leaf area index (LAI), photosynthetic photon flux density (PPFD), vapor pressure deficit (VPD), wind speed (U), and atmospheric pressure (P). The four machine learning models exhibited significant simulation accuracy across various climate zones, reflected by their global performance indicator (GPI) values ranging from − 3.504 to 0.670 for RF, − 3.522 to 1.616 for SVM, − 3.704 to 0.972 for XGB, and − 3.654 to 1.831 for BP. The choice of suitable models and the different input factors varied across different climatic regions. Specifically, in the temperate–continental zone (TCCZ), subtropical–Mediterranean zone (SMCZ), and temperate zone (TCZ), the models of BP C -V9, SVM S - V6, and SVM T -V6 demonstrated the highest simulation accuracy, with average RMSE values of 0.259, 0.373, and 0.333 mm d − 1 , average MAE values of 0.177, 0.263, and 0.248 mm d − 1 , average R 2 values of 0.949, 0.819, and 0.917, and average NSE values of 0.926, 0.778, and 0.899, respectively. In climate zones with a lower average LAI (TCCZ), there was a strong correlation between LAI and ET, making LAI more crucial for ET predictions. Conversely, in climate zones with a higher average LAI (TCZ, SMCZ), the significance of the LAI for ET prediction was reduced. This study recognizes the impact of climate zones on ET simulations and highlights the necessity for region-specific considerations when selecting machine learning models and input factor combinations.


Introduction
Evapotranspiration (ET) plays a critical role in the terrestrial water cycle, accounting for approximately 60% of global precipitation consumption.ET encompasses the process of water vapor transitioning from the Earth's surface to the atmosphere, which includes evaporation from soil or water bodies, transpiration from vegetation, and evaporation of rainfall intercepted by vegetated surfaces [1].Crop transpiration is intricately related to physiological activities such as crop growth and the formation of photosynthetic products.Simultaneously, evaporation assists in dissipating the heat generated by the increase in near-surface temperature caused by radiation, thereby maintaining an optimal growth environment within the crop system [2].Generally, more than 90% of the agricultural water is consumed through ET globally [3].Accurate estimation of ET is beneficial for real-time monitoring of crop water use status, offering a basis for determining irrigation schedules, enhancing water use efficiency, and even predicting yields within agricultural fields [4,5].
Numerous methodologies have emerged to estimate terrestrial ET across various spatial scales, including hydrological modeling [6], empirical approaches [7], remote sensing inversion [8], and data-driven models.The hydrological method, which relies on the water balance principle for basin or sub-basin ET calculations, encounters challenges owing to uncertainties in the input and output data, model structure, initial conditions, and parameter settings, impacting simulation precision [9].Empirical, semi-empirical, and physical-mathematical formulations based on meteorological data offer alternatives, with selection contingent on data availability, resulting in varied simulation accuracies across geographical locations [10].Remote sensing for estimating ET presents distinct advantages in terms of accuracy and spatial resolution [11]; however, its limitation lies in the inability to provide continuous temporal values, which might not meet the temporal demands for irrigation and water resource management.Recently, data-driven models have been widely used for estimating ET owing to their remarkable capability of identifying intricate relationships.Machine learning (ML) techniques, which are characterized by their ability to handle complex relationships without prior knowledge or assumptions, have proven to be highly effective.Among them, the random forest (RF) algorithm has gained significant popularity in agricultural applications, such as land cover classification [12], water resources management [13], and crop yield prediction [14].Its extensive applicability can be attributed to its exceptional accuracy in both classification and regression tasks, with minimal parameter dependencies, efficient processing capabilities, and the ability to handle overfitting problems [15].In contrast, the support vector machine (SVM) model possesses a globally optimal solution and exhibits remarkable training efficiency.These qualities endow the SVM with enhanced robustness, efficiency, and reliability [16].The SVM focuses on establishing functional relationships between ET and explanatory variables without explicitly considering the underlying biophysical mechanisms [17], making it particularly suitable for short-term ET prediction.For example, Liu, et al. [18] achieved a remarkable explanatory power of 71-85% for global ET changes by utilizing only five indicators (average daily temperature, relative humidity, wind speed, solar radiation, and NDVI) as input variables for SVM.XGB is an emerging machine learning algorithm that offers versatility and scalability for modeling small-to medium-sized datasets, making it a popular choice for crop yield predictions because of its flexibility and adaptability [19].BP is centered around its application in training and testing using input variables, such as temperature, sunshine hours, and wind speed [20].Its extensive applicability surpasses that of traditional neural networks, as demonstrated in a study by Kumar,et al. [21], in which the BP model outperformed the traditional method in accurately predicting crop ET.Given the widespread application and promising prospects of these four machine learning algorithms for predicting cropland ET, a comprehensive comparative analysis is essential to assess their strengths, weaknesses, and universality across different input factor combinations and climatic conditions.
Obtaining parameters for the underlying surface of general croplands is challenging, and the availability of reliable meteorological data is often limited.Consequently, researchers have consistently aimed to increase the accuracy of ET predictions using a reduced number of variables.Previous studies have revealed that the dominant factors driving ET vary across different climatic regions, leading to differences in the ET simulation accuracy under various combinations of input factors.Pagano, et al. [22] compared the performance of multi-layer perceptron (MLP) and random forest (RF) in predicting daily ET in a citrus orchard typical of the Mediterranean ecosystem, highlighting the substantial influence of soil water content (SWC) and solar radiation (Rs) on ET prediction.Remarkably, even with a reduction in the number of input features to just four and a judicious selection of feature combinations, the machine learning models still achieved high accuracy in pre-dicting ET.Chen, et al. [23] utilized the fuzzy rough set algorithm (BSFL-FRSA) to discern both individual and multifactorial determinants of ET in evergreen needleleaf forests across three distinct climate zones in North America: the Mediterranean, warm summer continental, and subarctic regions.The study revealed the predominant factors driving ET and the most crucial combinations of multiple factors.Agricultural ecosystems are predominantly found in the boreal, temperate, subtropical-Mediterranean, and temperate-continental climatic regions [24].Hence, investigating the primary factors contributing to ET prediction within these climate zones and selecting optimal combinations of input factors can offer valuable insights and facilitate accurate ET assessments.
The objectives of this study were to (1) identify the important input factors deriving daily crop ET in different climatic regions; (2) explore the applicability of four machine learning models, RF, SVM, XGB, and BP, in predicting daily crop ET; and (3) evaluate the accuracy of these models using specific combinations of three, six, and nine input factors and recommend an optimal model for each climatic region.This study provides a convenient method to accurately simulate ET in farmlands across diverse climatic zones.

Description of the Flux Sites
In this study, 15 eddy covariance (EC) cropland flux towers located in three different climate zones were carefully selected.These sites included representative stations from the temperate-continental climate zone (TCCZ), featuring US-ARM, US-CRT, US-Ne1, US-Ne2, and US-Ne3.The subtropical-Mediterranean climate zone (SMCZ) included representative stations IT-BCi, IT-CA2, US-TW2, US-TW3, and US-TW, whereas the temperate climate zone (TCZ) comprised representative stations BE-Lon, CH-Oe2, DE-Geb, DE-Kli, and FR-Gri.Detailed information for each site is presented in Table 1.

Flux and Auxiliary
This study was based on the analysis of daily EC flux and meteorological information extracted from the FLUXNET Tier 2 dataset (http://fluxnet.fluxdata.org,accessed on 10 November 2023), which includes variables such as R n (W m −2 ), T a ( • C), soil temperature (T s , • C), VPD (hPa), sensible heat flux (H, W m −2 ), and latent heat flux (LE, W m −2 ).According to Allen, et al. [25], daily ET was derived from LE using the latent heat of vaporization as a function of T a by ET = LE 2.501−(2.361×10−3 )×T a .The soil water content (SWC) was measured at various depths at diverse sites, potentially failing to adequately represent the wetness or dryness of the soil, as soil properties vary across different sites.Here, we utilized the evaporative fraction (EF) to represent the degree of ground dryness and wetness [26,27] because an increase in energy allocation to evaporating water implies a greater potential water supply from the soil.The evaporative fraction (EF) is calculated as follows: where LE is the latent heat flux and H is the sensible heat flux.The primary source of data for capturing crop phenology information is the leaf area index (LAI).In our study, LAI data were acquired from the MODIS remote sensing product MODIS 15A2H (https://lpdaac.usgs.gov/products/mod15a2hv006/,accessed on 15 October 2023) with an 8-day interval and 500 × 500 m spatial resolution.The original LAI data were preprocessed using the TIMESAT software (version 3.3) to attenuate peak values and eliminate transient, unrealistic fluctuations due to factors such as cloud interference or the presence of snow and ice on the ground [28].Any gaps in the LAI values were filled using linear interpolation based on available nearby data points over time.Subsequently, cubic spline interpolation was applied to interpolate the 8-day LAI data, generating daily data that aligned with the requirements of our modeling inputs.This approach minimized data redundancy and ensured a consistent and high-quality dataset for our research.The statistical parameters of the environmental variables from the flux tower and LAI data across the different climatic regions are presented in Table 2. Observations illustrate that the mean T a varied from 11.43  Note: Environmental variables included daily mean air temperature (Ta), net radiation (Rn), soil heat flux (G), evaporative fraction (EF), leaf area index (LAI), photosynthetic photon flux density (PPFD), vapor pressure deficit (VPD), wind speed (U), and atmospheric pressure (P).The statistical information covers X mean (mean), X max (maximum), X min (minimum), X sd (standard deviation), X ku (kurtosis), and X sk (skewness) for each environmental variable.

Random Forest (RF)
A flowchart of the implementation of the applied machine learning models is shown in Figure 1. Figure 2 shows a flowchart of the four machine learning algorithms.RF is a supervised ensemble learning algorithm that was initially proposed by [29].Its primary objective is to generate accurate predictions without overfitting the data.The RF operates as a combination of tree predictors, with each tree depending on the values of the random vectors sampled independently and from the same distribution for all trees within the forest [13].Previous studies have shown that RF outperforms conventional approaches in estimating Eto, achieving a significant reduction in the obtained error by approximately half [30].After training, predictions for the unseen samples x can be made by averaging the predictions from all individual regression trees on x, as follows: where B is the number of trees, f b is the function obtained from training the b-th tree, and f is the final prediction value.

Support Vector Machine (SVM)
The SVM is recognized as a classical data-driven technique known for its robust ability to handle complex non-linear relationships between input and output variables [31].Owing to its strong capacity to solve intricate nonlinear problems, the SVM has been widely applied to simulate both ET0 [32,33] and ET [34,35].Additionally, research suggests that using the radial basis function (RBF) to transform the feature space yields highly accurate estimation results [36].Consequently, in this study, an SVM model based on the RBF function was used to predict the ET.The approximated function is expressed as follows: where  is the dimension of ,  is the weight vector, and  is a bias term.To determine the optimum  and , the target of the optimization problem can be expressed as follows:

Support Vector Machine (SVM)
The SVM is recognized as a classical data-driven technique known for its robust ability to handle complex non-linear relationships between input and output variables [31].Owing to its strong capacity to solve intricate nonlinear problems, the SVM has been widely applied to simulate both ET 0 [32,33] and ET [34,35].Additionally, research suggests that using the radial basis function (RBF) to transform the feature space yields highly accurate estimation results [36].Consequently, in this study, an SVM model based on the RBF function was used to predict the ET.The approximated function is expressed as follows: where M is the dimension of x, w is the weight vector, and b is a bias term.To determine the optimum w and b, the target of the optimization problem can be expressed as follows: where w denotes the normal vector of the hyperplane.

Extreme Gradient Boosting (XGB)
The XGB model is an enhanced version of the gradient boosting machines (GBMs) proposed by [37].Originating from the concept of "boosting", the XGB model combines predictions from a series of "weak" learners to create a "strong" learner using an additive training process [38].Recent studies have indicated that the XGB model is a promising alternative method for estimating the daily ET 0 [39].However, the specific performance of the direct application of XGB for simulating ET remains unclear.The general function for prediction at step t is as follows: where x i is the input variable, and f t (x i ) and f t i are the learner and predictions at step t, respectively.

Backpropagation Neural Network (BP)
An artificial neural network (ANN) is a well-established supervised learning method known for its exceptional capability to extract nonlinear features from gathered data, making it a widely used modeling tool [40].As a result, ANNs have found extensive applications in estimating ET 0 [41,42].Backpropagation, a gradient descent method, is commonly used among various algorithms proposed for training the ANN method.Backpropagation involves calculating the gradient of the error with respect to the weights for a given input by propagating the error backward from the output layer to the hidden layer and then to the input layer [43].The backpropagation algorithm was adopted in this study, and a flowchart of the backpropagation algorithm is illustrated in Figure 2.During the error backpropagation process, the weights and biases are modified at each iteration by minimizing an error metric that quantifies the disparity between the produced output and the desired output, which can be expressed as follows: where W n+1 and W n represent the weight matrix during iterations n and n + 1, respectively, in the iterative training process.∆W denotes the adjusting weight matrix responsible for controlling the convergence rate and the computational complexity.The error minimization process was repeated until a satisfactory convergence criterion was obtained: where y i is the final output of the ANN model, and t i is the measured output.

Model Development
This study develops four machine learning models (RF, SVM, XGB, and BP) to simulate and predict daily cropland ET using daily EC flux and meteorological information.By inputting different variable combinations, the models' predictive potentials are explored and variable combinations are compared.The dataset was split into training and testing sets.At each flux tower site, 80% of the time series data was used for model training, and the remaining 20% for testing, allowing for an investigation into the predictive performance of machine learning models at specific sites and, further, within specific climatic regions.All machine learning model development and statistical computations were conducted using R version 4.0.5 [44].Four machine learning models utilized in this study, along with their corresponding hyperparameters, can be found in the Supplementary Materials.

Evaluating Indicators
The root mean square error (RMSE) and mean absolute error (MAE) were used to check the accuracy of the models, whereas the determination coefficient (R 2 ) and the Nash-Sutcliffe efficiency coefficient (NSE) are measurements of generalizability when comparing the performances of the four machine learning models under different input combinations [45][46][47].
where x i is the predicted value of ET, y i is the measured value of ET, x i and y i are the corresponding average values of x i and y i , subscript i refers to the number of datasets, n is the length of the dataset.Larger R 2 and NSE values, along with smaller RMSE and MAE values, signify a better fit in the context of modeling.In addition, a global performance indicator (GPI), which normalizes the four different metrics as one, was used to comprehensively evaluate the model performance [48,49].
where a j is a coefficient (1 for RMSE and MAE and −1 for R 2 and NSE), g j represents the median of the scaled values of statistical indicator j, and y ij represents the scaled value of statistical indicator j for model i.A higher GPI value implies superior model performance.

The Overall Performance of Four Machine Learning Models in Simulating ET
In certain regions, simultaneously obtaining a complete set of input data for ET simulations is a challenge.To reduce the number of input variables while ensuring the accuracy of the ET simulation, we selected variables that demonstrated stronger correlations with ET for both three-factor and six-factor input combinations.The initial correlation coefficients were computed independently using the input factor data from various climate stations and subsequently merging the data from all stations to produce the overall correlation coefficients, as shown in Table 3.The outcomes of this comprehensive data integration revealed R n , PPFD, and EF to be strongly correlated with ET, thus forming a three-factor input combination.Additionally, T a , LAI, and VPD were strongly correlated with ET, extending this input combination to six factors.Using all available input data enabled the creation of a nine-factor input combination, as detailed in Table 4. Table 4. Three-factor combinations were obtained from the correlation analysis of ET using data from all sites, along with their combination with the RF, SVM, XGB, and BP models.
The simulation performances of the four models at different represented sites across various climate zones are illustrated in Figure 3.The sites were chosen based on the availability of the highest volume of valid data within the respective climate zone.The consistent results between the simulated and actual ET demonstrated their capacity to simulate ET with three different input factor combinations. Figure 4 presents the simulation results of ET using the RF, SVM, XGB, and BP models with three-factor input combinations (V3, V6, and V9).For the RF-V3, SVM-V3, XGB-V3, and BP-V3 models, the average RMSE values were 0.738, 0.774, 0.780, and 0.892 mm d −1 , respectively.The average MAE values were 0.547, 0.573, 0.576, and 0.627 mm d −1 , respectively.Correspondingly, the average R 2 values were 0.618, 0.622, 0.597, and 0.570, whereas the average NSE values were 0.378, 0.256, 0.272, and −0.161, respectively.According to the results presented in Table 5, RF-V3 exhibited a higher prediction accuracy for ET than the other models, with a GPI value of −2.376.The same pattern of results emerged with the six-and nine-factor input combinations, in which the GPI values for SVM-V6 and SVM-V9 were 0.548 and 0.561, respectively, demonstrating their superior accuracy in simulating ET among the six-and nine-factor input combinations.Table 5.The GPI values of machine learning models for ET simulations at different stations using different input combinations.

Performance of Four Machine Learning Models in Simulating ET in Different Climatic Regions
Table 3 presents the results of the correlation analysis of ET using data collected from stations in three distinct climate zones.Using the same variable grouping described in Section 3.1, three input factor combinations for these climate zones were derived, and four machine learning models were used to simulate ET in the TCCZ, SMCZ, and TCZ climate zones, as shown in Table 7. Across specific input factor combinations in each climate zone, the SVM model consistently demonstrated superior accuracy in the ET simulation, with average RMSE values of 0.312, 0.387, and 0.460 mm d −1 for TCCZ, SMCZ, and TCZ, respectively.The average MAE values were 0.218, 0.275, and 0.332 mm d −1 , respectively.The   The overall simulation results for ET across various sites under three different input combinations are presented in Table 6.Considering all input factor combinations, the average RMSE values for the RF, SVM, XGB, and BP models were 0.527, 0.488, 0.545, and 0.551 mm d −1 , respectively.The average MAE values were 0.388, 0.356, 0.396, and 0.390 mm/d, respectively.Concurrently, the average R 2 values were 0.771, 0.804, 0.749, and 0.768, whereas the average NSE values were 0.633, 0.659, 0.588, and 0.491, respectively.Consequently, the SVM model outperformed the other models in terms of the overall ET simulation across all climate zones.Table 6.Three combinations were obtained from the correlation analysis of ET using data from all sites, along with their combination with the model for simulating ET.

Performance of Four Machine Learning Models in Simulating ET in Different Climatic Regions
Table 3 presents the results of the correlation analysis of ET using data collected from stations in three distinct climate zones.Using the same variable grouping described in Section 3.1, three input factor combinations for these climate zones were derived, and four machine learning models were used to simulate ET in the TCCZ, SMCZ, and TCZ climate zones, as shown in Table 7. Across specific input factor combinations in each climate zone, the SVM model consistently demonstrated superior accuracy in the ET simulation, with average RMSE values of 0.312, 0.387, and 0.460 mm d −1 for TCCZ, SMCZ, and TCZ, respectively.The average MAE values were 0.218, 0.275, and 0.332 mm d −1 , respectively.

Runtime Analysis of Four Machine Learning Models in ET Simulation
In addition to seeking higher accuracy, the computational runtime of machine learning models is also a crucial consideration in ET simulations.As shown in Figure 6, the runtime for the RF, SVM, XGB, and BP models ranged from 6.77 to 12.83 s, 1.24 to 1.85 s, 0.37 to 0.46 s, and 4.33 to 7.07 s using data merged from all stations, respectively.Using

Runtime Analysis of Four Machine Learning Models in ET Simulation
In addition to seeking higher accuracy, the computational runtime of machine learning models is also a crucial consideration in ET simulations.As shown in Figure 6, the runtime for the RF, SVM, XGB, and BP models ranged from 6.77 to 12.83 s, 1.24 to 1.85 s, 0.37 to 0.46 s, and 4.33 to 7.07 s using data merged from all stations, respectively.Using data exclusively from stations within distinct climate zones, the runtime for the RF, SVM, XGB, and BP models spans from 7.45 to 12.96 s, 1.72 to 2.46 s, 1.60 to 1.87 s, and 3.92 to 6.70 s, respectively.The differentiation of climate zones for the input factor combinations had a negligible impact on the runtime performance of the models.It is noteworthy that, among the four models, both the RF and XGB models showed an increasing trend in runtime with an increase in the number of input factors.This trend was particularly pronounced in the RF model.Conversely, the SVM and BP models exhibited a fluctuating trend in the runtime as the number of input factors increased.In the comprehensive assessment of all models and their respective input factor combinations, RF*-V9 boasts the longest runtime at 12.96 s, whereas XGB-V3 displayed the shortest runtime at 0.37 s.For clarity, RF* is defined as the sum of RFC, RFS, and RFT, with this naming convention uniformly applicable to the other models.

Importance of Input Factors for Simulating Evapotranspiration
Based on a correlation analysis of various input variables with ET, Rn was identified as the most crucial input factor for simulating ET.Subsequently, photosynthetic photon flux density (PPFD) has emerged as the next significant factor, both of which are intricately related to energy absorption.The energy required for water vapor evaporation is derived from the radiation, thus establishing Rn as a pivotal driver of ET, especially in non-moisture-restricted areas [50].In addition, numerous studies have proved that variations in light intensity can affect plant photosynthesis, leaf area morphology, and For clarity, RF* is defined as the sum of RF C , RF S , and RF T , with this naming convention uniformly applicable to the other models.

Importance of Input Factors for Simulating Evapotranspiration
Based on a correlation analysis of various input variables with ET, R n was identified as the most crucial input factor for simulating ET.Subsequently, photosynthetic photon flux density (PPFD) has emerged as the next significant factor, both of which are intricately related to energy absorption.The energy required for water vapor evaporation is derived from the radiation, thus establishing R n as a pivotal driver of ET, especially in non-moisturerestricted areas [50].In addition, numerous studies have proved that variations in light intensity can affect plant photosynthesis, leaf area morphology, and radiation absorption.Light also directly affects the stomatal opening, which is an important channel for water vapor diffusion [51].
In addition to the energy term, our study found that LAI played an important role in ET simulation, and there was a stronger correlation between LAI and ET in the TCCZ than that in the TCZ or SMCZ, ranking at the top (Table 4).The LAI influences the ground energy reception by affecting the sensible heat flux and radiation.This impact is particularly pronounced when the LAI is low and diminishes as the LAI surpasses a certain threshold.Current studies have found that the relationships between environmental factors and ET were mediated by leaf area [52], and the regulatory effect on ET was significantly different before and after an LAI close to 1 (1.2~1.5 m 2 m −2 ).When the canopy cover is full, the increase in canopy cover is not apparently equal to the increase in LAI; therefore, the intercepted energy does not increase significantly.Therefore, beyond this threshold, changes in LAI have a diminishing effect on ET [53].However, in TCZ and TCCZ, the correlation was relatively weaker.By comparing the average LAI values for different climatic zones, as displayed in Table 2, it was found that the LAI in the TCZ was greater than that in the SMCZ and TCCZ, further confirming the mechanism of the impact of LAI on ET.As an important component of the water cycle, ET involves numerous complex energy exchanges.In particular, evaporation is the movement of water into the air and can readily lead to changes in the air temperature.Conversely, variations in the daily average temperature of agricultural fields can reflect the intensity and rate of energy exchange in the atmosphere.Thus, the daily mean air temperature (T a ) also demonstrates a strong correlation with ET [54].
Although EF was considered a crucial factor for quantifying surface water deficits and the water cycle, and the estimation of ET based on a specific daytime EF is considered a favorable method [27,55], EF did not demonstrate the same universally strong correlation across all climatic regions as R n and PPFD in the present study.Further investigation of previous research has revealed that EF demonstrates higher sensitivity to land surface moisture conditions in arid regions, whereas it is less sensitive in relatively humid areas [56].On one hand, this was probably due to the generally humid climate in agricultural areas; on the other hand, it may be influenced by irrigation practices in agricultural areas, which can affect the land surface moisture conditions at flux measurement sites.Therefore, when considering the number of required input factors, these relatively more important factors should be prioritized.

Optimal Input Factor Combinations and Machine Learning Models for Simulating Evapotranspiration
The four machine learning models used in this study demonstrated relatively good simulation accuracy.After obtaining the input factor combinations from the correlation analysis of ET using data from sites within each respective climate zone, the simulation accuracy of the three-and six-factor combinations of each model was significantly improved, indicating that it is meaningful to consider the differences and impacts of climate zones when simulating cropland daily ET.Given the high simulation accuracy across all the models, each model had suitable scenarios and characteristics.RF models often achieve better simulation accuracy than other models when trained with fewer input variables.However, they also had the longest execution times among the four models [57].When conducting ET simulations with fewer input variables and when the importance of the time cost is low or not considered, the RF model can be prioritized.The SVM consistently demonstrated better modeling performance for ET than the other three models across most combinations of input variables and various climate zones.In the comparison without distinguishing climate zones and input factor combinations, SVM-V6 and SVM-V9 were the optimal six-factor and nine-factor input combination models.In the comparison where input factor combinations were divided by climate zones, overall, the combined performance of SVM models with three different input factor combinations (SVM*-V3 and SVM*-V6) was the best.Additionally, SVM models exhibited faster single-model runtimes (significantly lower than those of the RF and BP models).The XGB model has a notable advantage in terms of computational time and efficiency over other models [37], which makes it more suitable for addressing real-time prediction problems, even though the simulation accuracy of the XGB model for ET is not particularly outstanding among these machine learning models.However, the simulation accuracy of the BP model is unstable across different climatic regions, and its generalization ability is relatively average [58], particularly when dealing with very limited or extremely large datasets, and the performance of the BP model is not as strong as that of other models.In some cases, the backpropagation algorithm may become stuck at a local optimum [59].Additionally, the BP model had a relatively long single-model runtime (only slightly faster than that of the RF model).However, when the dataset size is moderate, the BP model may achieve high accuracy.Therefore, to attain excellent simulation accuracy with the BP model, a substantial amount of data is required to pre-determine the optimal local input variables for each site, which may require a significant amount of time and effort.Considering the simulation accuracy of the model and the runtime under various conditions, this study suggests that the SVM model is the preferred choice for simulating daily cropland ET.
Considering three different combinations of input factors, we observed differences in the adaptability of input factor combinations across climatic zones.By comparing the performance of machine learning models using the three-factor combination (V3) in different climatic zones, three out of four of the models exhibited better performance in the SMCZ.Similar results were also observed with the nine-factor combination (V9), which showed better performance in the TCCZ (Table 9).Additionally, the four machine learning models generally performed well in both V6 and V9 input factor combinations.This indicates that the predictive accuracy of the models improved with the increase in input factors.However, the simulation accuracy of the V9 input factor combination did not show significant superiority over that of the V6 input factor combination, and at some sites, it performed worse for certain site data.Therefore, considering that the V9 input factor combination requires 50% more input factors than the V6 input factor combination while achieving similar simulation accuracy, this study suggests that V6 is the most economical input factor combination for situations with limited meteorological data.

Uncertainties
Estimating uncertainty for EC data is challenging, and the presence of missing data in the dataset can lead to discontinuities in the data time series, potentially increasing the difficulty of accurately predicting ET using machine learning models.Considering the complexity of relationships between variables, future research may explore the application of some deep learning models for agricultural evapotranspiration prediction.Additionally, interpolating MODIS LAI from 8-day periods to daily using cubic spline interpolation may introduce uncertainty into the models.However, the daily scale LAI interpolated from 8-day data may be beneficial to improve the simulation accuracy of the machine learning used in this study.The daily scale LAI is consistent with the input meteorological data on the time scale, and it can better quantify the impact of vegetation change on ET, especially in the period when the vegetation changed dramatically (generally the rapid development stage of crops).Directly conducting correlation analysis on the original time series dataset can result in non-independent impacts of input variables on ET (ideally, analyzing the impact of a certain variable should ensure that other variables remain constant, which is not achievable in reality).This can lead to larger underestimations of variables (such as U and P) ranked lower in correlation analysis, and the uncertainty in flux tower measurement data may exacerbate this underestimation.However, its impact on our selection of three-factor and six-factor input variables (V3 and V6) is minimal.As far as the representativeness of the EC sites selected in this study, we have to admit that not all EC flux sites in different climate zones could be included due to limitations of accessibility and openness of data.In the future, extending field ET predictions to flux tower sites in other climatic zones beyond those mentioned in this study (e.g., the Boreal climatic zone) could be considered.Due to the limited representation of site numbers in this study, not all relevant climatic zones were investigated, although most flux tower station data located in agricultural areas were included.

Conclusions
This study used meteorological and remote sensing data from diverse agricultural sites in distinct climatic regions to simulate ET using four machine learning models with three different input combinations.The key findings are summarized as follows: (1) R n , PPFD, LAI, EF, and T a emerged as pivotal factors influencing daily ET in agricultural areas, and they all exhibited a relatively strong correlation with ET across various climate zones; (2) all four machine learning models yielded satisfactory simulation performance, with the SVM model demonstrating the best simulation performance, particularly when considering the influence of climate zones on ET simulation; (3) the predictive ET accuracy of three-factor combinations (V3) was improved with the inclusion of more input factors.However, considering both predictive accuracy and input factor efficiency, the V6 input factor combination is recommended as the preferred choice; (4) in climate zones with a lower average LAI, LAI was more crucial for ET predictions than in climate zones with a higher average LAI (TCZ, SMCZ), highlighting the need for region-specific considerations when selecting machine learning models and input factor combinations.

22 Figure 1 .
Figure 1.Flowchart of machine learning models applied to evapotranspiration simulation.Figure 1. Flowchart of machine learning models applied to evapotranspiration simulation.

Figure 1 .
Figure 1.Flowchart of machine learning models applied to evapotranspiration simulation.Figure 1. Flowchart of machine learning models applied to evapotranspiration simulation.

Figure 1 .
Figure 1.Flowchart of machine learning models applied to evapotranspiration simulation.

Figure 2 .
Figure 2. Flowchart of the four machine learning algorithms.

Figure 2 .
Figure 2. Flowchart of the four machine learning algorithms.

Figure 3 .
Figure 3. Relationship between simulated evapotranspiration using four machine learning models with three input combinations and measurements at different represented sites located in the temperate-continental (TCCZ), subtropical-Mediterranean (SMCZ), and temperate (TCZ) climate zones.

Figure 3 .
Figure 3. Relationship between simulated evapotranspiration using four machine learning models with three input combinations and measurements at different represented sites located in the temperate-continental (TCCZ), subtropical-Mediterranean (SMCZ), and temperate (TCZ) climate zones.

Figure 4 .
Figure 4. Overall ET simulation accuracy across the 15 EC cropland sites.In the box plot, symbol x represents the mean, the circles represent the 25 th and 75 th percentile values, and the horizontal line in the center represents the median value.

Figure 4 .
Figure 4. Overall ET simulation accuracy across the 15 EC cropland sites.In the box plot, symbol x represents the mean, the circles represent the 25th and 75th percentile values, and the horizontal line in the center represents the median value.

Figure 5 .
Figure 5. Accuracy of ET simulation using three different combinations of input factors with the SVM model in three distinct climate zones.

Figure 5 .
Figure 5. Accuracy of ET simulation using three different combinations of input factors with the SVM model in three distinct climate zones.

Figure 6 .
Figure 6.Single-run time for training and simulation with all site data under various input factor combinations.The divisions of input factor combinations were conducted for correlation analysis on evapotranspiration utilizing two distinct datasets: all site data (represented by RF, SVM, XGB, and BP) and site data located in different climatic zones (represented by RF*, SVM*, XGB*, and BP*).For clarity, RF* is defined as the sum of RFC, RFS, and RFT, with this naming convention uniformly applicable to the other models.

Figure 6 .
Figure 6.Single-run time for training and simulation with all site data under various input factor combinations.The divisions of input factor combinations were conducted for correlation analysis on evapotranspiration utilizing two distinct datasets: all site data (represented by RF, SVM, XGB, and BP) and site data located in different climatic zones (represented by RF*, SVM*, XGB*, and BP*).For clarity, RF* is defined as the sum of RF C , RF S , and RF T , with this naming convention uniformly applicable to the other models.

Table 1 .
Site characteristics used in this study.
• C in TCCZ to 16.19 • C in SMCZ; the mean R n ranged from 86.37 W m −2 in TCZ to 113.87 W m −2 in SMCZ; the mean ET fluctuated between 1.62 mm d −1 in TCZ and 2.32 mm d −1 in SMCZ; the mean LAI changed from 0.54 m 2 /m 2 in TCCZ to 1.91 m 2 /m 2 in TCZ.Additional statistical characteristics of each variable are comprehensively provided.

Table 2 .
Statistical parameters of flux tower measured environmental variables and remote sensing data during the entire study period at the four sites.

Table 3 .
Pearson correlation coefficients between input factors and evapotranspiration (ET).

Table 5 .
The GPI values of machine learning models for ET simulations at different stations using different input combinations.

Table 9 .
The GPI values of the four machine learning models for evapotranspiration simulation using different input combinations across different climate zones.
Note: The best model is in bold.Remote Sens. 2024, 16, x FOR PEER REVIEW 15 of 22