Prediction of Reference Crop Evapotranspiration in China’s Climatic Regions Using Optimized Machine Learning Models

Hu, Jian; Ma, Rong; Jiang, Shouzheng; Liu, Yuelei; Mao, Huayan

doi:10.3390/w16233349

Open AccessArticle

Prediction of Reference Crop Evapotranspiration in China’s Climatic Regions Using Optimized Machine Learning Models

by

Jian Hu

^1,*,

Rong Ma

¹,

Shouzheng Jiang

²,

Yuelei Liu

¹ and

Huayan Mao

³

¹

College of Water Resources and Hydropower, Sichuan Agricultural University, Ya’an 625014, China

²

State Key Laboratory of Hydraulics and Mountain River Engineering & College of Water Resource and Hydropower, Sichuan University, Chengdu 610065, China

³

Jianyang Xintianfu Agricultural Technology Co., Ltd., Jianyang 641400, China

^*

Author to whom correspondence should be addressed.

Water 2024, 16(23), 3349; https://doi.org/10.3390/w16233349

Submission received: 2 October 2024 / Revised: 31 October 2024 / Accepted: 15 November 2024 / Published: 21 November 2024

Download

Browse Figures

Versions Notes

Abstract

The accurate estimation of reference crop evapotranspiration (ET₀) is essential for crop water consumption modeling and agricultural water resource management. In the present study, three bionic algorithms (aquila optimizer (AO), tuna swarm optimization (TSO), and sparrow search algorithm (SSA)) were combined with an extreme learning machine (ELM) model to form three mixed models (AO-ELM, TSO-ELM, and SSA-ELM). The accuracy of the ET₀ estimates for five climate regions in China from 1970 to 2019 was evaluated using the FAO-56 Penman–Monteith (P-M) equation. The results showed that the predicted values of the three mixed models and the ELM model fitted the P-M calculated values well. R² and RMSE were 0.7654–0.9864 and 0.1271–0.7842 mm·d⁻¹, respectively, for which the prediction accuracy of the AO-ELM model was the highest. The performance of the AO-ELM combination5 (maximum temperature (T_max), minimum temperature (T_min), total solar radiation (R_s), sunshine duration (n)) was most significantly improved on the basis of the ELM model. The prediction accuracy for the stations in the plateau mountain climate (PMC) region was the best, while the prediction accuracy for the stations in the tropical monsoon climate region (TPMC) was the worst. In addition to the wind speed (U₂) in the temperate continental climate region (TCC)—which was the largest variable affecting ET₀—n, R_a, and total solar radiation (R_s) in the other climate regions were more important than relative humidity (RH) and wind speed (U₂) in predicting ET₀. Therefore, AO-ELM4 was selected for the TCC region (with T_max, T_min, R_s, and U₂ as inputs) and AO-ELM5 (with T_max, T_min, R_s, and n as inputs) was selected for the TMC, PMC, SMC, and TPMC regions when determining the best model for each climate region with limited meteorological data.

Keywords:

reference crop evapotranspiration; climate regions; extreme learning machine; biological algorithm; modeling

1. Introduction

Reference crop evapotranspiration (ET₀) is the sum of crop transpiration and soil evaporation under conditions of uniform, vigorous growth, complete ground coverage, and adequate water supply. It represents the exchange of gas, energy, and water between the soil, crops, and the atmosphere [1]. The accurate measurement and estimation of regional ET₀ are crucial for simulating crop water consumption, scheduling irrigation, and managing agricultural water resources [2].

There are several methods for calculating ET₀, including the Hargreaves, FAO-56 Penman–Monteith (P-M), and Priestley–Taylor models. The most widely accepted method is the FAO-56 P-M model, recommended by the Food and Agriculture Organization of the United Nations (FAO). This model combines radiation and aerodynamic terms of meteorological factors [3] and has been extensively validated globally [4,5,6]. However, the P-M model requires comprehensive meteorological data, which are often unavailable due to the limited development and insufficient investment in many regions. As a result, empirical models that can accurately estimate ET₀ with minimal input data, such as temperature, radiation, and mass transfer models, have been developed and applied [7].

With the advancement in computing power and the integration of machine learning algorithms in agriculture, researchers have increasingly used various machine learning models to predict ET₀. Examples include the artificial neural network model (ANN) [8,9], support vector machine model (SVM) [10,11,12,13], generalized regression neural network model (GRNN) [14,15], and random forest model (RF) [16,17]. These models have demonstrated high accuracy in predicting ET₀. For instance, Wen et al. [13] used an SVM model to predict ET₀ in arid regions with limited data and found it to outperform three empirical models. Studies have consistently shown that machine learning models provide more accurate ET₀ predictions compared to empirical models [18,19,20,21,22].

Recently, the extreme learning machine model (ELM) has gained popularity due to its fast computation speed and high performance [23,24,25,26,27]. Abdullah et al. [28] found the ELM model to be more efficient and faster than the feedforward backpropagation (FFBP) model and the P-M equation in predicting ET₀ in Iraq. Feng et al. [15,24] and Allan T et al. [12] also reported that the ELM model outperformed other machine learning models.

As research has advanced, scholars have observed that while the extreme learning machine (ELM) model can achieve high accuracy in predicting ET₀, it suffers from issues such as local optimal solutions and slow convergence rates during simulations. These problems reduce the model’s portability across different regions. To address these limitations, researchers have employed various bio-inspired algorithms to optimize the initial weights and thresholds of the ELM model. This optimization aims to mitigate existing model issues, further enhance prediction accuracy, and improve overall predictive performance [8,29]. For instance, Liu et al. [30] employed a hybrid model combining ELM with a genetic algorithm (GA-ELM) to forecast ET₀ in southwestern China. Their findings indicated that the GA-ELM model outperformed both the standalone ELM and empirical models during both training and testing phases. Similarly, Zhu et al. [31] compared the PSO-ELM model with the ELM, ANN, and RF models and six empirical models based on three input modes of radiation, temperature, and mass transfer and found that the machine learning model had more accurate ET₀ predictions than the empirical models, with the PSO-ELM model surpassing other machine learning approaches in performance.

Although many studies have demonstrated that combining the ELM model with bio-inspired algorithms can yield excellent predictions of ET₀, these applications have largely been limited to traditional optimization algorithms such as genetic algorithm (GA), particle swarm optimization (PSO), and whale optimization algorithm (WOA). Over time, several novel bio-inspired algorithms have emerged, including tuna swarm optimization (TSO) [32], aquila optimizer (AO) [33], and sparrow search algorithm (SSA) [34]. However, due to the scarcity of research on the application of these new algorithms in various climate regions across China, their performance remains largely unexplored. Consequently, there are insufficient data to assess the advantages and limitations of these algorithms. To address this gap, the current study proposes the integration of three novel bio-inspired algorithms—TSO, AO, and SSA—with the ELM model. The resulting hybrid algorithms, TSO-ELM, AO-ELM, and SSA-ELM, are applied to simulate and predict the ET₀ at 20 meteorological stations across five distinct climate regions in China, covering the period from 1970 to 2019.

This study aims to combine three new bionic algorithms (TSO, AO, and SSA) with the ELM model to simulate and predict ET₀ across 20 meteorological stations in five Chinese climate regions from 1970 to 2019. The primary objectives are as follows: (1) to analyze the performance of these models under different meteorological factors; (2) to compare the prediction accuracy of the TSO-ELM, AO-ELM, and SSA-ELM models with the standard ELM model; and (3) to recommend reliable forecasting models and meteorological factor input combinations for different Chinese climate regions.

2. Materials and Methods

2.1. Experimental Site

China’s diverse climate is categorized into five distinct climatic regions based on variations in temperature, altitude, and precipitation (Figure 1). These regions include the temperate continental climate region (TCC), temperate monsoon climate region (TMC), plateau mountain climate region (PMC), subtropical monsoon climate region (SMC), and tropical monsoon climate region (TPMC) [35]. The TCC and PMC regions are characterized as arid and semi-arid, with average annual precipitation levels of 285 mm and 382 mm, respectively. The average annual evaporation in these regions is 2148 mm for the TCC region and 1883 mm for the PMC region. In contrast, the TMC region is classified as sub-humid, with an average annual precipitation of 648 mm and an average annual evaporation of 1475 mm. The SMC and TPMC regions are considered humid, with annual precipitation levels of 1538 mm and 1964 mm, respectively. The average annual evaporation in these humid regions is 1545 mm for the SMC region and 1175 mm for the TPMC region.

2.2. Data Collection and Analysis

This study selected 20 representative weather stations with comprehensive and easily accessible meteorological data to capture the climatic characteristics of each region. The selected data spanned from 1970 to 2019 and included the following meteorological variables as input factors for analysis: daily maximum/low temperature (T_max/T_min), atmospheric relative humidity (RH), wind speed at a height of 2 m (U₂), sunshine hours (n), total solar radiation (R_s) and atmospheric radiation (R_a). The performance of a hybrid machine learning model, optimized through various bionic algorithms, was evaluated using these input factors.

Table 1 presents the geographical coordinates (longitude and latitude), altitude, and multi-year average meteorological data for the selected weather stations. The wind speed at a height of 10 m (U₁₀) was converted to a 2 m height equivalent (U₂) using established wind profile relationships. The average temperature (T_mean) was calculated as the average of T_max and T_min [1]. The meteorological data utilized in this study were sourced from the National Meteorological Information Center–China Meteorological Data Network “https://data.cma.cn/ (accessed on 15 July 2023)”, ensuring data authenticity and validity.

In cases where meteorological data were missing, an interpolation method was employed to estimate and fill in the missing records, thereby maintaining data integrity. Given the extensive time span and volume of the data, the meteorological records from 1970 to 2019 were divided into training and test datasets. The training dataset comprised 40 years of meteorological data, whereas the test dataset included 10 years of data. This division allowed for a robust assessment of the model’s performance across different temporal scales.

2.3. Extreme Learning Machine Model (ELM)

To address the challenges of slow convergence and lengthy iteration times associated with traditional machine learning models, the extreme learning machine (ELM) proposes a learning algorithm based on a single hidden layer feedforward neural network [36]. This algorithm requires the user only to specify the number of hidden layer nodes, randomly generate all the parameters of the hidden layer, and employ the least squares method to determine the weights of the output layer [37]. The ELM algorithm is favored by many researchers due to its characteristics, including short computation times, strong nonlinear approximation capabilities, and robust learning performance.

Figure 2 illustrates the topology of the ELM, which consists of an input layer, a hidden layer, and an output layer. In this network model, the input sample is represented by x, and the output of the hidden layer is denoted as H (x). The calculation formula for the output H (x) of the hidden layer is expressed as follows:

H (x) = [h_{1} (x), \dots, h_{L} (x)]

(1)

The input value is multiplied by the corresponding weight, adjusted by a bias term, and subsequently processed through a nonlinear function node to obtain the hidden layer’s output. Here, h_i (x) denotes the output of the i-th hidden node, which can be formulated as follows:

h_{i} (x) = g (w_{i}, b_{i}, x) = g (w_{i} x + b_{i} {), w}_{i} \in R^{D}, b_{i} \in R

(2)

In this expression,

g (w_{i}, b_{i}, x)

serves as the activation function. For this study, the Sigmoid function is employed, defined as:

g (x) = \frac{1}{1 + e^{- x}} = \frac{e^{x}}{e^{x} + 1}

(3)

Finally, the output from the output layer is given by:

f_{L} (x) = \sum_{i = 1}^{L} β_{i} h_{i} (x) = H (x) β

(4)

2.4. Novel Bionic Algorithm

In this study, the sparrow search algorithm, tuna swarm optimization algorithm, and Skyhawk search algorithm were employed to optimize the initial weights and thresholds of the ELM model. The primary aim of this optimization is to enhance the overall performance of the ELM model.

Figure 3 illustrates the entire process of the ELM model, which includes the input of meteorological factors, the optimization stage utilizing the three bionic algorithms, and the final model output. The schematic principles of the three bionic algorithms are further detailed in Figure 4.

2.4.1. Sparrow Search Algorithm (SSA)

The sparrow search algorithm was first proposed by Xue Jiankai in 2020 [34]. Inspired by the predation and anti-predation behaviors of sparrows to attain the optimal position, the individuals within the population are categorized into two roles: discoverers and followers. This categorization mimics the real predation scenario to search for the optimal value. The search process is conducted through the following steps:

Initialization of sparrow population:

Suppose there are n sparrows in a predator population, with the initial population F_X represented as a matrix of fitness values:

F_{X} = f [X] = [\begin{matrix} f ([x_{1, 1} & x_{1, 2} & \dots & \dots & x_{1, d}]) \\ f ([x_{2, 1} & x_{2, 2} & \dots & \dots & x_{2, d}]) \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ f ([x_{n, 1} & x_{n, 2} & \dots & \dots & x_{n, d}]) \end{matrix}]

(5)

The fitness values of the sparrow population are then sorted from largest to smallest to obtain the optimal fitness value of the sparrow. Subsequently, the initially generated sparrow population is updated according to the position of the discoverer, follower, and danger warning. The formulas for these updates are as follows:

2.: Discoverer Update:

X_{i, j}^{t + 1} = \{\begin{matrix} X_{i, j}^{t} \cdot \exp (\frac{- i}{α \cdot {iter}_{\max}}) & if R_{2} < ST \\ X_{i, j}^{t} + Q \cdot L & if R_{2} \geq ST \end{matrix}

(6)

3.: Follower Updates:

X_{ij}^{t + 1} = \{\begin{matrix} Q \cdot \exp (\frac{x_{worst}^{t} - x_{ij}^{t}}{i^{2}}) & if i > n / 2 \\ X_{best}^{t} + ∣ X_{ij}^{t} - X_{best}^{t} ∣ \cdot rand ({- 1, 1}) \cdot L & otherwise \end{matrix}

(7)

4.: Danger warning:

X_{ij}^{t + 1} = \{\begin{matrix} X_{best}^{t} + β \cdot ∣ X_{ij}^{t} - X_{best}^{t} ∣ & if f_{i} > f_{g} \\ X_{ij}^{t} + K \cdot (\frac{∣ X_{ij}^{t} - X_{worst}^{t} ∣}{(f_{i} - f_{w}) + ε}) & if f_{i} = f_{g} \end{matrix}

(8)

2.4.2. Tuna Swarm Optimization (TSO)

The tuna swarm optimization algorithm is a novel optimization structure algorithm proposed in 2021 based on the predation strategy of tuna [32]. Its primary features include simple operation, fewer adjustable parameters, and higher accuracy beyond local optimization.

Population initialization:

X_{i}^{int} = rand \cdot (ub - lb) + lb, i = 1, 2, \dots, NP

(9)

2.: Spiral predation:

The tuna group forms a spiral state to control the prey within a certain range, thereby achieving efficient predation. According to the spiral predation strategy, its mathematical model is formulated as follows:

X_{i}^{t + 1} = \{\begin{matrix} α_{1} \cdot (X_{best}^{t} + β \cdot |X_{best}^{t} - X_{i}^{t} |) + α_{2} \cdot X_{i}^{t}, & i = 1 \\ α_{1} \cdot (X_{best}^{t} + β \cdot |X_{best}^{t} - X_{i}^{t} |) + α_{2} \cdot X_{i - 1}^{t}, & i = 2, 3, \dots, N P \end{matrix}

(10)

An optimal random coordinate is randomly generated within the tuna population, which serves as the reference point for the spiral search. The mathematical model is expressed as:

X_{i}^{t + 1} = \{\begin{matrix} α_{1} \cdot (X_{rand}^{t} + β \cdot |X_{rand}^{t} - X_{i}^{t} |) + α_{2} \cdot X_{i}^{t}, & i = 1 \\ α_{1} \cdot (X_{rand}^{t} + β \cdot |X_{rand}^{t} - X_{i}^{t} |) + α_{2} \cdot X_{i - 1}^{t}, & i = 2, 3, \dots, N P \end{matrix}

(11)

Heuristic algorithms typically begin with a global search and gradually narrow the search scope to a local search. Consequently, as the number of iterations increases, the search of the tuna population shifts from random reference points to precise optimal individuals.

2.4.3. Aquila Optimizer (AO)

The aquila optimizer is a new biological optimization algorithm developed in 2021 based on the hunting behavior of the aquila in nature [33]. It possesses strong optimization capabilities and rapid convergence speed. The algorithm explores the optimal solution through a four-stage optimization process, updating the location in real time, and the search mechanism automatically stops when the desired result is achieved.

First, the position of the population within the search range is initialized:

X_{ij} = rand \times (U B_{j} - L B_{j}) + L B_{j}, i = 1, 2, \dots, N j = 1, 2, \dots Dim

(12)

Extended Search (X1):

Aquila hover high in search of prey areas and select the best hunting area in a vertical bent position. Its mathematical model is as follows:

X_{1} (t + 1) = X_{best} (t) \times (1 - \frac{t}{T}) + (X_{M} (t) - X_{best} (t) \cdot rand)

(13)

2.: Narrowing Down the Search (X2):

When the first stage identifies the prey area, the AO narrowly searches the selected hunting area for the target prey, preparing to attack the targeted prey. Its mathematical model is expressed as:

X_{2} (t + 1) = X_{best} (t) \times Levy (D) + X_{R} (t) + (y - x) \cdot rand

(14)

3.: Expansion Development (X3):

When the prey is locked down, it descends vertically and attacks. The chosen target area in AO is utilized to get close to the prey. Its mathematical model is formulated as:

\begin{matrix} X_{3} (t + 1) & = (X_{bex} (t) - X_{M} (t)) \times α - rand + ((UB - LB) \times rand + LB) \times δ \end{matrix}

(15)

4.: Scaling Down Development (X4):

When an aquila attacks its prey, it randomly attacks as the prey moves. Its mathematical model is expressed as:

X_{4} (t + 1) = QF (t) \times X_{best} (t) - (G_{1} \times X (t) \times rand) - G_{2} \times Levy (D) + rand \times G_{1}

(16)

2.5. Refer to Crop Evapotranspiration Calculation Model

2.5.1. FAO-56 Penman–Monteith Equation

In this study, the FAO-recommended Penman–Monteith formula was employed to calculate the evapotranspiration of 20 sites across various climatic regions in China, providing a benchmark for assessing the model’s accuracy [1].

{ET}_{0} = \frac{0.408 Δ (R_{n} - G) + γ \frac{900}{T + 273} U_{2} (e_{s} - e_{a})}{Δ + γ (1 + 0.34 U_{2})}

(17)

where ET₀ is the reference crop evapotranspiration quantity (mm·d⁻¹); R_n is the net radiation; G is the soil heat flux (MJ m⁻²·d⁻¹); T is the average temperature (℃); e_s is the saturated water pressure (kPa); e_a is the actual water pressure (kPa); Δ is the slope of the saturated water barometric temperature curve (kPa·℃⁻¹); γ is the hygrometer constant (kPa·℃⁻¹); and U₂ is the ground wind speed 2 m high above the ground (m·s⁻¹). The detailed calculation process of parameters has been explained in the paper FAO-56 [1].

2.5.2. Model Accuracy Verification

To more accurately evaluate and quantify the performance and accuracy of the ET₀ model, four evaluation indices—the coefficient of determination (R²), root mean square error (RMSE), normalized root mean square error (NRMSE), and mean absolute error (MAE)—were employed in this study. Their mathematical expressions are as follows:

R^{2} = \frac{{[\sum_{i = 1}^{n} (X_{i} - \overline{X}) (Y_{i} - \overline{Y})]}^{2}}{\sum_{i = 1}^{n} (X_{i} - \overline{X})^{2} \sum_{i = 1}^{n} (Y_{i} - \overline{Y})^{2}}

(18)

RMSE = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} (Y_{i} - X_{i})^{2}}

(19)

NRMSE = \frac{\sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(Y_{i} - X_{i})}^{2}}}{\overline{X}} \times 100 %

(20)

MAE = \frac{1}{n} \sum_{i = 1}^{n} ∣ X_{i} - Y_{i} ∣

(21)

where X_i and Y_i are measured and predicted values, respectively; and

\overline{X}

and

\overline{Y}

are the average of

X_{i}

and

Y_{i}

, respectively. When ET₀ extremes exist in the process of assessing model performance, we should adopt the following strategies to ensure the accuracy and reliability of the results. Firstly, priority should be given to using metrics that are less sensitive to outliers, such as MAE; secondly, in order to gain a more comprehensive understanding of the model performance, we should not rely on only one metric to assess model performance, but should use a combination of multiple metrics, such as R², MAE, RMSE, and NSE; furthermore, before assessing the model, it is very important to carry out appropriate preprocessing and outlier handling of the data, which can help to minimize the impact of outliers on the assessment results.

2.6. The Importance of Meteorological Factors for ET₀ as Determined by the Through Analysis Method

Figure 5 illustrates the importance of meteorological factors on ET₀ based on the path coefficient method. The study assessed the impact of seven meteorological factors on ET₀ at 20 stations using the path coefficient method. The results indicate that total solar radiation (Rs) and maximum temperature (T_max) are the most influential factors on ET₀, with importance ranges of 0.832–0.947 and 0.766–0.906, respectively, which are crucial across most meteorological stations. The U₂ factor shows significant importance only at the Kuerle and Kashi stations, while it is less influential at other stations. Additionally, the relative humidity (RH) factor exhibits an inverse effect on ET₀ at most stations, meaning that an increase in humidity leads to a decrease in ET₀. Therefore, the importance of each factor in influencing ET₀ can be ranked as follows: R_s > T_max > R_a > T_min > n > U₂ > RH.

In the present study, seven meteorological factors were combined based on their importance and ease of acquisition. The basic combination included T_max, T_min, and R_s. Additional factors such as R_a, RH, U₂, and n were then added individually to form four combinations. According to the principle of mass transfer, RH and U₂ were further added to the temperature factors to form a total of six input combinations. The detailed meteorological input combinations are presented in Table 2.

3. Results and Analysis

3.1. Comparison of Performance Differences of Machine Learning Models in Different Climate Regions

In this study, four evaluation indicators were used to assess the ET₀ prediction capability of the ELM model and the three hybrid machine learning models (TSO-ELM, SSA-ELM, AO-ELM) across five climate regions in China under various combinations of inputs. The evaluation indicators are shown in Figure 6. It was observed that all four of these models achieved satisfactory prediction accuracy, with the hybrid models demonstrating higher accuracy under the same meteorological input combinations compared to the ELM model. Notably, the combination5 yielded better predictions than the other combinations, while using combination1exhibited the worst performance in terms of R², indicating significant differences in model accuracy across different meteorological input scenarios.

To further evaluate the performance of each model, the statistical indicators from each climate region station during the training stage and the test stage were averaged, with the results presented in Table 3, Table 4, Table 5, Table 6 and Table 7. The R², RMSE, NRMSE, and MAE values ranged from 0.7654 to 0.9864, 0.1271 to 0.7842 mm∙d⁻¹, 0.0409 to 0.2647, and 0.0886 to 0.5757 mm∙d⁻¹, respectively. Based on the test stage data, the AO-ELM model exhibited the best performance, with R², RMSE, NRMSE, and MAE values of 0.9139, 0.4333 mm∙d⁻¹, 0.1532, and 0.3114 mm∙d⁻¹, respectively. The TSO-ELM and SSA-ELM models followed, while the ELM model showed the poorest performance, with R², RMSE, NRMSE, and MAE values of 0.9089, 0.4429 mm∙d⁻¹, 0.1574, and 0.3227 mm∙d⁻¹, respectively.

The AO, TSO, and SSA models all possess strong global search capabilities, rapid convergence rates, and good robustness, characterized by stable performance across diverse optimization problems and datasets, along with concise algorithmic structures and straightforward implementation. The findings indicate that the three bionic algorithms enhance the ELM model with varying degrees of optimization performance, suggesting a certain level of rationality in the bionic algorithms’ optimization of the ELM model.

Despite the superior performance of all three bionic algorithms, there are notable differences. This is attributed to the AO algorithm’s ability to incorporate global information during the optimization process, allowing the AO-ELM model to more effectively avoid overfitting and thereby exhibit stronger generalization abilities. Consequently, the AO-ELM model maintains higher prediction accuracy when faced with new, unseen data. Although the SSA algorithm is also an effective global optimization method, it may become trapped in local optima when dealing with complex problems, limiting the model’s prediction accuracy. Additionally, the convergence speed of the SSA algorithm may be influenced by the initial population and the number of iterations, impacting training efficiency and prediction performance. In practice, the TSO algorithm may be constrained by problem size and complexity. These advantages enable the AO-ELM model to demonstrate higher prediction accuracy and stability in addressing complex prediction tasks.

In summary, the machine learning models in this study achieved commendable accuracy in simulating and predicting ET₀ across the five climate regions, indicating a strong correlation between the predicted and measured values. Furthermore, the performance of the hybrid models varied, suggesting that the simulation performance of the ELM model was improved to varying extents after optimization by the bionic algorithms.

In the five different climate regions, the PMC stations showed good performance in the test stage (mean R² = 0.9281, RMSE = 0.3653 mm∙d⁻¹, NRMSE = 0.1424, MAE = 0.2668 mm∙d⁻¹). The TPMC sites showed the worst performance (mean R² = 0.8815, RMSE = 0.4461 mm∙d⁻¹, NRMSE = 0.1246, MAE = 0.3427 mm∙d⁻¹). In the five distinct climate regions, the PMC (plateau mountain climate) stations exhibited a strong performance during the testing phase, with a mean R² of 0.9281, RMSE of 0.3653 mm∙d⁻¹, NRMSE of 0.1424, and MAE of 0.2668 mm∙d⁻¹. Conversely, the TPMC (tropical and subtropical monsoon climate) stations demonstrated the weakest performance, with a mean R² of 0.8815, RMSE of 0.4461 mm∙d⁻¹, NRMSE of 0.1246, and MAE of 0.3427 mm∙d⁻¹. PMC regions are typically characterized by intense solar radiation and significant diurnal temperature variations. In contrast, TPMC regions are generally marked by high temperatures, elevated humidity levels, and concentrated, heavy rainfall events. Given that the primary meteorological inputs are temperature and solar radiation, factors such as solar radiation (R_s), temperature (T), and sunshine duration (n) have a more pronounced effect on ET₀ prediction in PMC regions. On the other hand, in TPMC regions, rainfall and humidity play a more critical role. The substantial fluctuations in rainfall and humidity can lead to overfitting or underfitting of the model during training, thereby affecting prediction accuracy.

For instance, Dong et al. [38] reported that four hybrid models based on KNEA (K-Nearest Neighbor Evolutionary Algorithm) performed worst at PMC sites when predicting ET₀ across various climate regions in China. Similarly, Wu et al. [23] found that the prediction accuracy of their optimized model was significantly higher in TMC and SMC regions compared to TCC and PMC regions. The discrepancy between our results and those of previous studies may be attributed to differences in the overall performance of the various hybrid machine learning models, the specific combinations of input factors, and the selection of sites within the same climate regions. It is evident that the prediction accuracy of the same hybrid model can vary depending on the climate region of the station, indicating that local climate and environmental conditions influence model performance.

During the testing phase, scatter plots were generated to compare the predicted values and P-M (Penman–Monteith) measured values for the four machine learning models under different input combinations, using Linxia Station as a representative example (Figure 7). Most data points are closely aligned with the 1:1 line, suggesting that the model’s predicted values are in close agreement with the measured values and indicating a strong correlation. However, it is apparent that varying input combinations affect the accuracy of the ET₀ predictions. The R² values for the combination of two and five variables are consistently higher than those of other combinations, and the scatter plots exhibit less dispersion, indicating a better fit between the predicted and measured values and higher prediction accuracy. In contrast, the scatter plots for the input combination of six variables show a high degree of dispersion and less ideal prediction outcomes, suggesting that while introducing solar radiation (R_a) and sunshine duration (n) as input parameters yields reliable prediction performance, incorporating relative humidity (RH) and wind speed (U₂) tends to degrade the model performance.

These findings are consistent with earlier research, confirming that Ra and n are critical parameters for accurate ET₀ prediction [8,39]. The variation in model performance across different climate regions is closely linked to local climate conditions. Sunshine duration indirectly reflects solar radiation, which provides the necessary energy for evapotranspiration, converting liquid water into water vapor [31]. Additionally, some studies have highlighted solar radiation as the primary factor influencing ET₀ variations in China [40]. This further supports the notion that Ra and n are more critical than RH and U₂ for ET₀ prediction.

3.2. Comparison of the Stability of Each Machine Learning Model

In general, the four models demonstrated good accuracy across the five climate regions, as shown in Table 3, Table 4, Table 5, Table 6 and Table 7. Figure 8 presents the average RMSE values of the ELM model and the mixed model. The results indicate that the average RMSE values and their growth rates during both the training and testing stages are higher than those observed during the training stage. Fan et al. [25] and Wu et al. [41] also reported similar findings when predicting ET₀. In terms of growth rate, the RMSE growth rates of the three mixed models are higher than that of the ELM model; among them, the AO-ELM model exhibits the largest growth rate (1.5–7.2%), while the ELM model shows the lowest growth rate (0.3–5.9%). However, it is observed that the mixed models all exhibit lower RMSE values than the ELM model in the testing phase. Despite the ELM model having a smaller RMSE increase and a more stable performance, the RMSE value of the AO-ELM model is more satisfactory. Therefore, it is recommended to use the AO-ELM model for simulating and predicting ET₀. Compared to the model performance during the training stage, the model performance at the stations in each climate region declined during the testing stage. The primary reason for this performance difference is the distribution discrepancy between the climatic environment and meteorological data samples during the training and testing stages. McVicar et al. [42] previously discussed in their study that the environment and climate of stations in each region would change over time, resulting in differences in ET₀ prediction accuracy between the two stages.

Figure 9 illustrates the changes in statistical indicators for the mixed model based on the ELM model. As shown in the figure, R² increases by 0.0022–0.5188%, RMSE decreases by −0.0079–3.9794%, NRMSE decreases by 1.3139–4.0016%, and MAE decreases by 1.3139–4.0016% across all the climate regions. This suggests that the mixed model’s performance is improved compared to the ELM model, consistent with the conclusion that the mixed model can achieve better performance in predicting ET₀ than the original model, as noted by Muhammad et al. [43], Zhu et al. [31], and Reham et al. [44]. In particular, in the PMC region, the performance improvement is the most obvious (R² increased by 0.3229–0.5188%, RMSE decreased by 3.9605–3.9794%, NRMSE decreased by 2.9798–4.0016%, MAE decreased by 2.9798–4.0016%). The TSO-ELM model exhibits the best improvement effect (R² increased by 0.5188%, RMSE decreased by 3.9605%, NRMSE decreased by 4.0016%, MAE decreased by 4.5507%). However, the performance improvement at the TCC sites is the poorest, and the RMSE error value of the SSA-ELM model even increases (RMSE increases by 0.0079%), indicating that the optimization effect of the SSA algorithm is not ideal in this area.

3.3. Comparison of Computational Costs of Various Machine Learning Models

Figure 10 describes the average computational runtime for the four models across combinations of six weather factors. From the figure, it is evident that the ELM model boasts the shortest runtime, which ranges from 0.91 to 1.32 s. The other three hybrid algorithms, however, exhibit longer runtimes, approximately 5.41 to 14.86 times those of the ELM model. This discrepancy is attributed to the larger number of parameters optimized by the hybrid models, which are run using bionic algorithms that necessitate more time to identify the optimal solution. The computational times of the models for various combinations of meteorological factors do not exhibit significant variations. Notably, the TSO-ELM model’s runtime shows slight fluctuations under different combinations. Among the three hybrid algorithms, the AO-ELM model records the shortest runtime. Despite the AO-ELM model’s runtime being increased by 5.41 to 7.34 times compared to the ELM model, its prediction accuracy surpasses that of the ELM model. Given the primary importance of prediction accuracy in estimating ET₀ by a machine learning model, the computational time of the AO-ELM model is deemed acceptable within this context.

3.4. Selection of the Best Model for Each Climate Region

Table 8 presents the optimal statistical indicator values for the best-performing models derived from meteorological stations across the five climate regions in China, using different meteorological factor inputs (using the testing phase as an example). As indicated in the table, the statistical indices R², RMSE, NRMSE, and MAE range between 0.9291 and 0.9864, 0.1271 and 0.4550 mm·d⁻¹, 0.0886 and 0.3373, and 0.0409 and 0.1825 mm·d⁻¹, respectively, across the climate regions. The AO-ELM model excelled in predictive performance across the most sites (13 sites) among the five climate regions, suggesting that the AO-ELM model can achieve superior prediction performance across a broad range of applications. Our analysis revealed that the hybrid model in the TCC region exhibited the highest prediction accuracy under combination 4. Wind speed directly influences the velocity and direction of water molecule movement on the ground. In conditions of high wind speed, water molecules are more likely to diffuse from the evaporating surface into the atmosphere, thereby accelerating the evaporation rate. Additionally, when constructing the evapotranspiration model, wind speed serves as one of the critical input parameters, and the accuracy and reliability of the wind speed data significantly affect the model’s prediction outcomes. Furthermore, wind speed indirectly impacts the model’s evapotranspiration prediction accuracy by adjusting the model parameters. Given that meteorological stations in this climatic region are situated inland, near deserts and grasslands, characterized by year-round dry, less rainy, and windy conditions, the wind speed’s impact on evapotranspiration becomes more pronounced. This indicates that in this climatic region, ET₀ is more influenced by the U₂ factor, and thus combination 4, which includes the U₂ factor, is more suitable for the TCC region. The other four climate regions exhibited the best performance when using the combination2 and the combination5 as input factors, especially, combination5 notably achieving better prediction accuracy, underscoring the pivotal role of the meteorological factor n in predicting ET₀. This finding aligns with Yan et al.’s [39] conclusion regarding the prediction of ET₀ in arid and humid areas of China, where they observed that wind speed had a greater impact on arid areas, while sunshine duration was more critical in humid areas. In summary, when assessing the predictive performance of different models across various climate regions, the best results are achieved by selecting the most appropriate meteorological input parameters for the local stations.

3.5. Improving ET₀ Predictions More Effectively

Although the hybrid algorithm in this study achieved good accuracy, the boosting performance is not satisfactory, for which the prediction of ET₀ can be further improved by the following methods. Firstly, before the introduction of advanced algorithms, it is necessary to carry out pre-processing work on the data, including data missing value processing, outlier detection, etc., which helps to improve the stability and accuracy of the algorithm. Secondly, the appropriate algorithm is selected according to the characteristics of the data and regional variability, and the parameters of the algorithm are tuned. This can be achieved by introducing optimization algorithms to optimize the model parameters; by combining multiple machine learning algorithms, using the advantages of each model to improve the accuracy of the prediction model; or by constructing the ET₀ prediction model through deep machine learning algorithms, integrated learning algorithms, and a series of other methods. After constructing the model, the model needs to be evaluated and verified to ensure its performance in practical applications.

4. Conclusions

In this study, the performance of ELM is optimized using three algorithms: the aquila optimizer (AO), tuna swarm optimization (TSO), and sparrow search algorithm (SSA). The optimized hybrid models, namely AO-ELM, TSO-ELM, and SSA-ELM, along with the standalone ELM model, were simulated to predict the reference evapotranspiration (ET₀) at stations located in various climatic regions across China. The models were trained and tested using daily meteorological data (T_max, T_min, R_s, R_a, RH, U₂, and n) collected from 20 meteorological stations spanning five distinct climatic regions. The models were evaluated with six different combinations of meteorological input factors, and the predicted ET₀ values were compared against those calculated using the FAO-56 Penman–Monteith (PM) equation to assess the degree of fit. The results indicate the following:

(1): During the testing phase, the three hybrid models demonstrated satisfactory prediction accuracy across different climatic regions. Among them, the AO-ELM model exhibited superior predictive performance compared to the SSA-ELM and TSO-ELM models.
(2): In scenarios where complete meteorological data are unavailable, the combination of the T_max, T_min, and R_s parameters with U₂ as an input parameter yield better ET₀ predictions in temperate continental monsoon climate regions. Conversely, using n as an input parameter provided satisfactory ET₀ predictions in the other climate regions.
(3): Stations located in highland mountain climate regions exhibited excellent simulation performance, while those in tropical monsoon climate regions showed the poorest performance. This suggests that local climate conditions significantly influence the overall model performance.
(4): For model selection, the AO-ELM model demonstrated superior predictive performance when applied on a large scale. Regarding the optimal combination of input parameters, apart from the superior prediction accuracy of the combination4 in the temperate continental monsoon (TCC) region, the combination5 performed better in the remaining four climatic regions. Therefore, AO-ELM4 (utilizing T_max, T_min, R_s, and U₂ as inputs) was chosen for the temperate continental climate (TCC) region, and AO-ELM5 (utilizing T_max, T_min, R_s and n as inputs) was chosen for the tropical monsoon climate (TMC), plateau mountain climate (PMC), subtropical monsoon climate (SMC), and temperate monsoon climate (TPMC) regions when determining the most suitable model for each climatic region with limited meteorological data.

The findings of this study provide valuable insights for predicting ET₀ in diverse climatic regions. However, due to the limited number of bio-inspired algorithms employed in this research, the study has certain limitations. For more in-depth and rigorous research, it is recommended to explore a broader range of meteorological factors, input combinations, or advanced bio-inspired algorithms to develop more reliable methods for ET₀ prediction.

Author Contributions

Conceptualization, R.M. and Y.L.; methodology, R.M.; software, R.M. and S.J.; validation, R.M., S.J., Y.L. and J.H.; formal analysis, R.M. and S.J.; investigation, H.M.; resources, J.H. and S.J.; data curation, R.M. and S.J.; writing—original draft preparation, J.H. and R.M.; writing—review and editing, J.H. and R.M.; visualization, H.M.; supervision, S.J.; project administration, J.H.; funding acquisition, J.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Sichuan Agricultural University Professional Development Support Program (2221998094), the Sichuan Science and Technology Program (2023YFN0024), the Chengdu Eastern New Area Technological Innovation Research Program (2024-DBXQ-KJYF008).

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The data that support the findings of this study are openly available in National Meteorological Information Center-China Meteorological Data Network “https://data.cma.cn/ (Accessed on 15 July 2023)”.

Acknowledgments

We would like to express our gratitude to everyone who has provided support and advice throughout the research process. We would like to acknowledge the financial support from the Sichuan Agricultural University Professional Development Support Program (2221998094), the Sichuan Science and Technology Program (2023YFN0024), the Chengdu Eastern New Area Technological Innovation Research Program (2024-DBXQ-KJYF008). All authors have read and agreed to the published version of the manuscript.

Conflicts of Interest

The authors declare no conflicts of interest. The funding sponsors had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, and in the decision to publish the results.

References

Allen, R.G. Crop Evapotranspiration-Guidelines for computing crop water requirements. FAO Irrig. Drain. Pap. 1998, 56, 147–151. [Google Scholar]
Shiri, J.; Kisi, Ö.; Landeras, G.; López, J.J.; Nazemi, A.H.; Stuyt, L. Daily reference evapotranspiration modeling by using genetic programming approach in the Basque Country (Northern Spain). J. Hydrol. 2012, 414, 302–316. [Google Scholar] [CrossRef]
Zhang, Q.; Cui, N.; Feng, Y.; Gong, D.; Hu, X. Improvement of Makkink model for reference evapotranspiration estimation using temperature data in Northwest China. J. Hydrol. 2018, 566, 264–273. [Google Scholar] [CrossRef]
Feng, Y.; Jia, Y.; Cui, N.; Zhao, L.; Li, C.; Gong, D. Calibration of Hargreaves model for reference evapotranspiration estimation in Sichuan basin of southwest China. Agric. Water Manag. 2017, 181, 1–9. [Google Scholar] [CrossRef]
Yang, Y.; Chen, R.S.; Han, C.T.; Liu, Z.W. Evaluation of 18 models for calculating potential evapotranspiration in different climatic zones of China. Agric. Water Manag. 2021, 244, 106545. [Google Scholar] [CrossRef]
Guo, X.H.; Sun, X.H.; Ma, J.J. Prediction of daily crop reference evapotranspiration (ET₀) values through a least-squares support vector machine model. Hydrol. Res. 2011, 42, 268–274. [Google Scholar] [CrossRef]
Feng, Y.; Jia, Y.; Zhang, Q.; Gong, D.; Cui, N. National-scale assessment of pan evaporation models across different climatic zones of China. J. Hydrol. 2018, 564, 314–328. [Google Scholar] [CrossRef]
Gao, L.L.; Gong, D.Z.; Cui, N.B.; Lv, M.; Feng, Y. Evaluation of bio-inspired optimization algorithms hybrid with artificial neural network for reference crop evapotranspiration estimation. Comput. Electron. Agric. 2021, 190, 106466. [Google Scholar] [CrossRef]
Ferreira, L.B.; Da Cunha, F.F.; de Oliveira, R.A.; Fernandes, E.I. Estimation of reference evapotranspiration in Brazil with limited meteorological data using ANN and SVM—A new approach. J. Hydrol. 2019, 572, 556–570. [Google Scholar] [CrossRef]
Fan, J.L.; Zheng, J.; Wu, L.F.; Zhang, F.C. Estimation of daily maize transpiration using support vector machines, extreme gradient boosting, artificial and deep neural networks models. Agric. Water Manag. 2021, 245, 106547. [Google Scholar] [CrossRef]
Mohammadi, B.; Mehdizadeh, S. Modeling daily reference evapotranspiration via a novel approach based on support vector regression coupled with whale optimization algorithm. Agric. Water Manag. 2020, 237, 106145. [Google Scholar] [CrossRef]
Tejada, A.T.; Ella, V.B.; Lampayan, R.M.; Reaño, C.E. Modeling Reference Crop Evapotranspiration Using Support Vector Machine (SVM) and Extreme Learning Machine (ELM) in Region IV-A, Philippines. Water 2022, 14, 754. [Google Scholar] [CrossRef]
Wen, X.H.; Si, J.H.; He, Z.B.; Wu, J.; Shao, H.B.; Yu, H.J. Support-Vector-Machine-Based Models for Modeling Daily Reference Evapotranspiration With Limited Climatic Data in Extreme Arid Regions. Water Resour. Manag. 2015, 29, 3195–3209. [Google Scholar] [CrossRef]
Ladlani, I.; Houichi, L.; Djemili, L.; Heddam, S.; Belouz, K. Modeling daily reference evapotranspiration (ET₀) in the north of Algeria using generalized regression neural networks (GRNN) and radial basis function neural networks (RBFNN): A comparative study. Meteorol. Atmos. Phys. 2012, 118, 163–178. [Google Scholar] [CrossRef]
Feng, Y.; Peng, Y.; Cui, N.B.; Gong, D.Z.; Zhang, K.D. Modeling reference evapotranspiration using extreme learning machine and generalized regression neural network only with temperature data. Comput. Electron. Agric. 2017, 136, 71–78. [Google Scholar] [CrossRef]
Feng, Y.; Cui, N.B.; Gong, D.Z.; Zhang, Q.W.; Zhao, L. Evaluation of random forests and generalized regression neural networks for daily reference evapotranspiration modelling. Agric. Water Manag. 2017, 193, 163–173. [Google Scholar] [CrossRef]
Wang, S.; Lian, J.J.; Peng, Y.Z.; Hu, B.Q.; Chen, H.S. Generalized reference evapotranspiration models with limited climatic data based on random forest and gene expression programming in Guangxi, China. Agric. Water Manag. 2019, 221, 220–230. [Google Scholar] [CrossRef]
Yang, Y.; Sun, H.W.; Xue, J.; Liu, Y.; Liu, L.G.; Yan, D.; Gui, D.W. Estimating evapotranspiration by coupling Bayesian model averaging methods with machine learning algorithms. Environ. Monit. Assess. 2021, 193, 156. [Google Scholar] [CrossRef]
Gul, S.; Ren, J.; Wang, K.; Guo, X. Estimation of reference evapotranspiration via machine learning algorithms in humid and semiarid environments in Khyber Pakhtunkhwa, Pakistan. Int. J. Environ. Sci. Technol. 2023, 20, 5091–5108. [Google Scholar] [CrossRef]
Spontoni, T.A.; Ventura, T.M.; Palacios, R.S.; Curado, L.; Fernandes, W.A.; Capistrano, V.B.; Fritzen, C.L.; Pavao, H.G.; Rodrigues, T.R. Evaluation and Modelling of Reference Evapotranspiration Using Different Machine Learning Techniques for a Brazilian Tropical Savanna. Agronomy 2023, 13, 2056. [Google Scholar] [CrossRef]
Agrawal, Y.; Kumar, M.; Ananthakrishnan, S.; Kumarapuram, G. Evapotranspiration Modeling Using Different Tree Based Ensembled Machine Learning Algorithm. Water Resour. Manag. 2022, 36, 1025–1042. [Google Scholar] [CrossRef]
Nagappan, M.; Gopalakrishnan, V.; Alagappan, M. Prediction of reference evapotranspiration for irrigation scheduling using machine learning. Hydrol. Sci. J. 2020, 65, 2669–2677. [Google Scholar] [CrossRef]
Wu, L.F.; Peng, Y.W.; Fan, J.L.; Wang, Y.C.; Huang, G.M. A novel kernel extreme learning machine model coupled with K-means clustering and firefly algorithm for estimating monthly reference evapotranspiration in parallel computation. Agric. Water Manag. 2021, 245, 106624. [Google Scholar] [CrossRef]
Feng, Y.; Cui, N.B.; Zhao, L.; Hu, X.T.; Gong, D.Z. Comparison of ELM, GANN, WNN and empirical models for estimating reference evapotranspiration in humid region of Southwest China. J. Hydrol. 2016, 536, 376–383. [Google Scholar] [CrossRef]
Fan, J.L.; Yue, W.J.; Wu, L.F.; Zhang, F.C.; Cai, H.J.; Wang, X.K.; Lu, X.H.; Xiang, Y.Z. Evaluation of SVM, ELM and four tree-based ensemble models for predicting daily reference evapotranspiration using limited meteorological data in different climates of China. Agric. For. Meteorol. 2018, 263, 225–241. [Google Scholar] [CrossRef]
Feng, Y.; Gong, D.Z.; Mei, X.R.; Cui, N.B. Estimation of maize evapotranspiration using extreme learning machine and generalized regression neural network on the China Loess Plateau. Hydrol. Res. 2017, 48, 1156–1168. [Google Scholar] [CrossRef]
Yin, Z.L.; Feng, Q.; Yang, L.S.; Deo, R.C.; Wen, X.H.; Si, J.H.; Xiao, S.C. Future Projection with an Extreme-Learning Machine and Support Vector Regression of Reference Evapotranspiration in a Mountainous Inland Watershed in North-West China. Water 2017, 9, 880. [Google Scholar] [CrossRef]
Abdullah, S.S.; Malek, M.A.; Abdullah, N.S.; Kisi, O.; Yap, K.S. Extreme Learning Machines: A new approach for prediction of reference evapotranspiration. J. Hydrol. 2015, 527, 184–195. [Google Scholar] [CrossRef]
Chia, M.Y.; Huang, Y.F.; Koo, C.H. Swarm-based optimization as stochastic training strategy for estimation of reference evapotranspiration using extreme learning machine. Agric. Water Manag. 2021, 243, 106447. [Google Scholar] [CrossRef]
Liu, Q.S.; Wu, Z.J.; Cui, N.B.; Zhang, W.J.; Wang, Y.S.; Hu, X.T.; Gong, D.Z.; Zheng, S.S. Genetic Algorithm-Optimized Extreme Learning Machine Model for Estimating Daily Reference Evapotranspiration in Southwest China. Atmosphere 2022, 13, 971. [Google Scholar] [CrossRef]
Zhu, B.; Feng, Y.; Gong, D.Z.; Jiang, S.Z.; Zhao, L.; Cui, N.B. Hybrid particle swarm optimization with extreme learning machine for daily reference evapotranspiration prediction from limited climatic data. Comput. Electron. Agric. 2020, 173, 105430. [Google Scholar] [CrossRef]
Xie, L.; Han, T.; Zhou, H.; Zhang, Z.R.; Han, B.; Tang, A.D. Tuna Swarm Optimization: A Novel Swarm-Based Metaheuristic Algorithm for Global Optimization. Comput. Intell. Neurosci. 2021, 2021, 9210050. [Google Scholar] [CrossRef] [PubMed]
Abualigah, L.; Yousri, D.; Abd Elaziz, M.; Ewees, A.A.; Al-qaness, M.; Gandomi, A.H. Aquila Optimizer: A novel meta-heuristic optimization algorithm. Comput. Ind. Eng. 2021, 157, 107250. [Google Scholar] [CrossRef]
Xue, J.; Shen, B. A novel swarm intelligence optimization approach: Sparrow search algorithm. Syst. Sci. Control Eng. 2020, 8, 22–34. [Google Scholar] [CrossRef]
Fan, J.L.; Wu, L.F.; Zhang, F.C.; Xiang, Y.Z.; Zheng, J. Climate change effects on reference crop evapotranspiration across different climatic zones of China during 1956–2015. J. Hydrol. 2016, 542, 923–937. [Google Scholar] [CrossRef]
Huang, G.; Zhu, Q.; Siew, C. Extreme learning machine: A new learning scheme of feedforward neural networks. In Proceedings of the 2004 IEEE International Joint Conference on Neural Networks, Budapest, Hungary, 25–29 July 2004. [Google Scholar]
Ding, S.F.; Zhao, H.; Zhang, Y.N.; Xu, X.Z.; Nie, R. Extreme learning machine: Algorithm, theory and applications. Artif. Intell. Rev. 2015, 44, 103–115. [Google Scholar] [CrossRef]
Dong, J.H.; Liu, X.G.; Huang, G.M.; Fan, J.L.; Wu, L.F.; Wu, J. Comparison of four bio-inspired algorithms to optimize KNEA for predicting monthly reference evapotranspiration in different climate zones of China. Comput. Electron. Agric. 2021, 186, 106211. [Google Scholar] [CrossRef]
Yan, S.C.; Wu, L.F.; Fan, J.L.; Zhang, F.C.; Zou, Y.F.; Wu, Y. A novel hybrid WOA-XGB model for estimating daily reference evapotranspiration using local and external meteorological data: Applications in arid and humid regions of China. Agric. Water Manag. 2021, 244, 106594. [Google Scholar] [CrossRef]
Jiang, S.Z.; Liang, C.; Cui, N.B.; Zhao, L.; Du, T.S.; Hu, X.T.; Feng, Y.; Guan, J.; Feng, Y. Impacts of climatic variables on reference evapotranspiration during growing season in Southwest China. Agric. Water Manag. 2019, 216, 365–378. [Google Scholar] [CrossRef]
Wu, L.F.; Fan, J.L. Comparison of neuron-based, kernel-based, tree-based and curve-based machine learning models for predicting daily reference evapotranspiration. PLoS ONE 2019, 14, e0217520. [Google Scholar] [CrossRef]
McVicar, T.R.; Roderick, M.L.; Donohue, R.J.; Li, L.T.; Van Niel, T.G.; Thomas, A.; Grieser, J.; Jhajharia, D.; Himri, Y.; Mahowald, N.M.; et al. Global review and synthesis of trends in observed terrestrial near-surface wind speeds: Implications for evaporation. J. Hydrol. 2012, 416, 182–205. [Google Scholar] [CrossRef]
Adnan, R.M.; Dai, H.L.; Mostafa, R.R.; Islam, A.; Kisi, O.; Elbeltagi, A.; Zounemat-Kermani, M. Application of novel binary optimized machine learning models for monthly streamflow prediction. Appl. Water Sci. 2023, 13, 110. [Google Scholar] [CrossRef]
Mostafa, R.R.; Kisi, O.; Adnan, R.M.; Sadeghifar, T.; Kuriqi, A. Modeling Potential Evapotranspiration by Improved Machine Learning Methods Using Limited Climatic Data. Water 2023, 15, 486. [Google Scholar] [CrossRef]

Figure 1. Geographical distribution of meteorological stations in different climatic regions of China.

Figure 2. Topology structure of ELM.

Figure 3. Input, optimization, and output flow of optimized ELM model.

Figure 4. Flow chart of bionic optimization algorithm.

Figure 5. Importance of meteorological factors to ET₀ based on the through-coefficient method.

Figure 6. Statistical indicators of each model under different input combinations.

Figure 7. Scatter plot of ET₀ prediction and corresponding FAO-56 P-M values of four machine learning models in Linxia Station under six different input combinations (Note: thin lines represent 1:1 lines).

Figure 8. Percentage increase in RMSE values in the test phase of four machine learning models compared to the RMSE values in the training phase (average of the five climate regions sites).

Figure 9. Changes in statistical index values (average values of four weather stations in each climate regions) of the mixed model compared with those of the ELM model in different climate regions.

Figure 10. Computational cost (model runtime) of four machine learning models with different input combinations (Combination1: T_max, T_min, R_s; Combination2: T_max, T_min, R_s, R_a; Combination3: T_max, T_min, R_s, RH; Combination4: T_max, T_min, R_s, U₂; Combination5: T_max, T_min, R_s, n; Combination6: T_max, T_min, RH, U₂).

Table 1. Geographical locations of selected weather stations and daily mean values of meteorological data from 1970 to 2019.

Climate Regions	Station	Latitude (N)	Longitude (E)	Elevation (m)	T_max (℃)	T_min (℃)	N (h)	RH (%)	U₂ (m·s⁻¹)	R_s (MJ m⁻²·d⁻¹)	R_a (MJ m⁻²·d⁻¹)
TCC	Kuerle	41.75	86.17	937	18.31	6.09	7.89	45.21	1.30	15.97	27.61
	Kashi	39.47	75.99	1281	18.49	6.02	7.71	49.93	1.01	16.32	28.41
	Jiuquan	39.75	98.51	1476	15.07	1.24	8.42	47.10	1.25	16.91	28.31
	Huhehaote	40.81	111.62	1074	13.46	0.93	7.72	52.16	1.06	15.96	27.93
TMC	Changchun	43.83	125.29	215	11.34	0.83	7.14	62.76	2.07	14.56	26.78
	Zhengzhou	34.72	113.64	107	20.40	9.89	5.80	64.29	1.39	14.77	30.02
	Linxia	35.60	103.21	1882	14.49	1.69	6.66	66.33	0.71	15.60	29.75
	Luochuan	35.76	109.43	1166	15.63	4.83	6.89	61.76	1.19	15.85	29.69
PMC	Xining	36.65	101.77	2249	14.05	0.08	7.27	56.24	0.82	16.14	29.42
	Linzhi	29.64	94.36	3100	16.29	4.05	5.48	63.19	0.94	14.90	31.57
	Naqu	31.48	92.05	4500	7.13	−7.73	7.58	51.76	1.48	17.38	31.04
	Changdu	31.14	97.18	3244	16.82	0.93	6.56	50.31	0.64	16.15	31.14
SMC	Wuhan	30.60	114.03	48	21.46	13.22	5.26	77.02	1.08	14.76	31.30
	Guangzhou	23.16	113.27	21	26.56	18.99	4.57	76.98	1.02	14.57	33.23
	Guiyang	26.68	106.62	1100	19.63	12.12	3.15	77.53	1.26	12.47	32.41
	Dujiangyan	30.99	103.65	1019	19.28	12.65	2.52	79.68	0.66	11.15	31.18
TPMC	Haikou	20.03	110.33	15	28.11	21.55	5.61	83.18	1.53	16.50	33.93
	Dongfang	19.10	108.65	73	28.71	22.26	7.07	78.78	2.42	18.61	34.10
	Lancang	22.56	99.93	1054	27.45	14.69	5.91	77.41	0.48	16.45	33.38
	Zhanjiang	21.27	110.37	23	26.84	20.76	5.24	81.89	1.62	15.80	33.69

Table 2. Input combinations of meteorological variables for various machine learning models.

Models				Input Combinations
ELM	TSO-ELM	SSA-ELM	AO-ELM	Input Combinations
ELM1	TSO-ELM1	SSA-ELM1	AO-ELM1	T_max, T_min, R_s
ELM2	TSO-ELM2	SSA-ELM2	AO-ELM2	T_max, T_min, R_s, R_a
ELM3	TSO-ELM3	SSA-ELM3	AO-ELM3	T_max, T_min, R_s, RH
ELM4	TSO-ELM4	SSA-ELM4	AO-ELM4	T_max, T_min, R_s, U₂
ELM5	TSO-ELM5	SSA-ELM5	AO-ELM5	T_max, T_min, R_s, n
ELM6	TSO-ELM6	SSA-ELM6	AO-ELM6	T_max, T_min, RH, U₂

Table 3. Average statistical values of mixed machine learning models with six different input parameters in the training and testing phases of TCC in China.

	Training				Testing
	R²	RMSE	NRMSE	MAE	R²	RMSE	NRMSE	MAE
ELM1	0.9022	0.6259	0.2178	0.4219	0.9046	0.6251	0.2060	0.4265
TSO-ELM1	0.9041	0.6197	0.2155	0.4155	0.9016	0.6402	0.2118	0.4151
SSA-ELM1	0.9038	0.6208	0.2160	0.4163	0.9010	0.6721	0.2123	0.4162
AO-ELM1	0.9043	0.6191	0.2153	0.4144	0.9015	0.6402	0.2117	0.4150
ELM2	0.9304	0.5233	0.1730	0.3592	0.9282	0.5475	0.1807	0.3659
TSO-ELM2	0.9337	0.5136	0.1771	0.3519	0.9280	0.5481	0.1807	0.3628
SSA-ELM2	0.9332	0.5157	0.1778	0.3546	0.9281	0.5478	0.1806	0.3637
AO-ELM2	0.9340	0.5127	0.1768	0.3522	0.9278	0.5489	0.1810	0.3644
ELM3	0.9349	0.5107	0.1773	0.3540	0.9238	0.5632	0.1924	0.3957
TSO-ELM3	0.9370	0.5021	0.1743	0.3466	0.9235	0.5642	0.1851	0.3929
SSA-ELM3	0.9372	0.5016	0.1742	0.3450	0.9150	0.5675	0.1862	0.3952
AO-ELM3	0.9375	0.5002	0.1736	0.3433	0.9230	0.5657	0.1856	0.3916
ELM4	0.9612	0.3794	0.1353	0.2527	0.9607	0.3948	0.1321	0.2720
TSO-ELM4	0.9667	0.3542	0.1259	0.2225	0.9658	0.3724	0.1242	0.2450
SSA-ELM4	0.9669	0.3537	0.1257	0.2214	0.9656	0.3734	0.1245	0.2466
AO-ELM4	0.9671	0.3524	0.1241	0.2205	0.9659	0.3717	0.1240	0.2445
ELM5	0.9317	0.5214	0.1797	0.3596	0.9277	0.5494	0.1812	0.3671
TSO-ELM5	0.9332	0.5154	0.1776	0.3531	0.9284	0.5467	0.1803	0.3620
SSA-ELM5	0.9333	0.5146	0.1777	0.3532	0.9279	0.5504	0.1815	0.3631
AO-ELM5	0.9336	0.5143	0.1773	0.3533	0.9280	0.5481	0.1807	0.3639
ELM6	0.9425	0.4715	0.1667	0.3535	0.9440	0.4818	0.1597	0.3635
TSO-ELM6	0.9463	0.4560	0.1612	0.3430	0.9467	0.4701	0.1560	0.3567
SSA-ELM6	0.9459	0.4578	0.1618	0.3448	0.9461	0.4728	0.1568	0.3583
AO-ELM6	0.9468	0.4538	0.1604	0.3408	0.9472	0.4677	0.1552	0.3545