Daily Reference Evapotranspiration Derived from Hourly Timestep Using Different Forms of Penman–Monteith Model in Arid Climates

Alazba, A A; Mattar, Mohamed A.; El-Shafei, Ahmed; Radwan, Farid; Ezzeldin, Mahmoud; Alrdyan, Nasser

doi:10.3390/w17152272

Open AccessArticle

Daily Reference Evapotranspiration Derived from Hourly Timestep Using Different Forms of Penman–Monteith Model in Arid Climates

by

A A Alazba

^1,2

,

Mohamed A. Mattar

²

,

Ahmed El-Shafei

²

,

Farid Radwan

^1,*

,

Mahmoud Ezzeldin

^1,2,*

and

Nasser Alrdyan

¹

Alamoudi Water Research, King Saud University, P.O. Box 2460, Riyadh 11451, Saudi Arabia

²

Department of Agricultural Engineering, College of Food & Agriculture Sciences, King Saud University, P.O. Box 2460, Riyadh 11451, Saudi Arabia

^*

Authors to whom correspondence should be addressed.

Water 2025, 17(15), 2272; https://doi.org/10.3390/w17152272

Submission received: 27 June 2025 / Revised: 23 July 2025 / Accepted: 28 July 2025 / Published: 30 July 2025

(This article belongs to the Section Hydrology)

Download

Browse Figures

Versions Notes

Abstract

In arid and semi-arid climates, where water scarcity is a persistent challenge, accurately estimating reference evapotranspiration (ET) becomes essential for sustainable water management and agricultural planning. The objectives of this study are to compare hourly ET among P–M ASCE, P–M FAO, and P–M KSA mathematical models. In addition to the accuracy assessment of daily ET derived from hourly timestep calculations for the P–M ASCE, P–M FAO, and P–M KSA. To achieve these goals, a total of 525,600-min data points from the Riyadh region, KSA, were used to compute the reference ET at multiple temporal resolutions: hourly, daily, hourly averaged over 24 h, and daily as the sum of 24 h values, across all selected Penman–Monteith (P–M) models. For hourly investigation, the comparison between reference ET computed as average hourly values and as daily/24 h values revealed statistically and practically significant differences. The Wilcoxon test confirmed a statistically significant difference (p < 0.0001) with R² of 94.75% for ASCE, 94.87% for KSA at h_plt = 50 cm, 92.41% for FAO, and 92.44% for KSA at h_plt = 12 cm. For daily investigation, comparing the sum of 24 h ET computations to daily ET measurements revealed an underestimation of daily ET values. The Wilcoxon test confirmed a statistically significant difference (p < 0.0001), with R² exceeding 90% for all studied reference ET models. This comprehensive approach enabled a rigorous evaluation of reference ET dynamics under hyper-arid climatic conditions, which are characteristic of central Saudi Arabia. The findings contribute to the growing body of literature emphasizing the importance of high-frequency meteorological data for improving ET estimation accuracy in arid and semi-arid regions.

Keywords:

reference evapotranspiration; Penman–Monteith; ASCE; FAO; KSA; various plant heights; arid climate

1. Introduction

Water scarcity in Saudi Arabia is further exacerbated by the absence of permanent rivers or lakes [1], forcing the country to rely heavily on non-renewable groundwater resources and energy-intensive desalination [2]. According to Chowdhury and Al-Zahrani [3], Saudi Arabia’s annual water demand has been increasing at an alarming rate, with agriculture consuming approximately 85% of the total water usage. This situation necessitates the development and implementation of efficient water management strategies, particularly in the agricultural sector, to ensure long-term water security and sustainability [4]. In arid and semi-arid regions where water scarcity is a persistent challenge [5], accurate estimation of reference evapotranspiration (ET) becomes essential for sustainable water management and agricultural planning [6]. ET represents the combined process of water loss from the Earth’s surface through evaporation from soil and water bodies and transpiration from vegetation [7]. As a critical component of the hydrological cycle, evapotranspiration significantly influences water resource availability [8], agricultural water productivity [9], and ecosystem sustainability [10].

By definition, reference ET is the ET rate from a hypothetical reference plant with specific characteristics under given climatic conditions and serves as a standard measure for estimating plant water requirements and designing irrigation systems [11]. Aly, Darwish [12] stated that reference ET is a complex system, influenced by non-linear factors such as temperature (T), solar radiation (RS), relative humidity (RH), and wind speed (U₂) [13]. The concept of reference ET and methods for its estimation have evolved significantly over the past century [14]. Several conventional methods have been developed and applied for estimating reference ET [15], each with its strengths and limitations [16]. These methods can be broadly categorized into temperature-based, radiation-based, and combination approaches [17]. Temperature-based methods, such as the Hargreaves–Samani equation [18], rely primarily on air temperature data and have gained popularity due to their simplicity and minimal data requirements. According to Raziei and Pereira [19], these methods perform reasonably well in arid and semi-arid regions where temperature is a dominant factor influencing ET. However, their accuracy may be compromised in humid regions or areas with significant advection effects [20]. Radiation-based methods, including the Priestley–Taylor equation [21], incorporate solar radiation data along with temperature to estimate reference ET [20]. These methods are based on the principle that energy availability is a primary driver of ET processes [22,23]. As highlighted by Vishwakarma, Pandey [24] radiation-based methods often provide more accurate estimates than temperature-based approaches, particularly in regions with high RS levels such as Saudi Arabia.

Combination methods, exemplified by the P–M ASCE [25], P–M FAO [26], and P–M KSA [27] models, integrate both energy balance and mass transfer principles to provide comprehensive reference ET estimates [16]. These methods account for various meteorological parameters and have demonstrated superior performance across diverse climatic conditions [7]. However, as noted by Kisi, Sanikhani [28], the extensive data requirements of these methods present significant challenges in data-scarce regions. According to Subedi and Chávez [16], the P–M FAO considered very accurate for calculating grass reference ET on a daily basis but may not be applicable for applying to hourly timestep. Moreover, the P–M ASCE can calculate both grass and alfalfa plant reference ET on both hourly and daily timesteps, but the plant coefficient (K_plt) needs to be developed also for alfalfa reference surfaces to avoid errors in estimating reference ET due to using fixed canopy surface resistance (r_c) for the entire day.

Zhang, Chen [29] examined five different ways to scale up hourly latent heat data to get a daily estimate of reference ET under different energy and crop-growing conditions. The study indicated that the evaporative fraction (EF) and crop coefficient (Kc) approaches usually worked better and were more consistent, especially between 11:00 and 15:00, when the average R² was 0.96. Using machine learning and deep learning models like CNN, ANN, RF, and XGBoost, Ferreira and da Cunha [30] looked at how to use limited hourly meteorological data, specifically temperature and relative humidity, to estimate daily reference ET. Results show that convolutional neural networks (CNNs) trained with 24 h hourly data and hourly extraterrestrial radiation did much better than traditional models that used daily data. In regional scenarios, RMSE was reduced by up to 28.2%, and NSE and R² were increased by up to 21.7% and 11.4%, respectively. Using long-term weather data, Djaman, Irmak [31] compared five methods for estimating reference ET—FAO-56 Penman–Monteith, Hargreaves–Samani, Blaney–Criddle, Priestley–Taylor, and radiation-based models—at three dry sites in Saudi Arabia. The results show that the FAO-56 technique is still the best, but simpler empirical models like Hargreaves–Samani can give good estimates when there is not a lot of data available. Calibrating them for a specific site makes them work better. Ji, Chen [32] studied how well eleven reference ET equations worked by comparing them to the FAO-56 Penman–Monteith standard in Riyadh’s hot, dry climate. The results show that many empirical models were very different from FAO-56 values. However, the Hargreaves–Samani and Makkink equations did fairly well, suggesting that they could be good for estimating reference ET in dry areas with little data.

Despite the widespread use of the Penman–Monteith (P–M) model for estimating reference ET, significant inconsistencies arise when applying its different formulations—namely, those prescribed by ASCE, FAO, and country-specific modifications such as the P–M KSA model. While previous research has evaluated these models under temperate or semi-arid environments, comparative studies under hyper-arid conditions, such as those prevailing in Saudi Arabia, remain scarce. Additionally, little attention has been given to the influence of plant height (h_plt) on reference ET calculations, especially when derived from high-frequency (hourly) meteorological inputs. Most existing validations of P–M formulations rely on daily or coarser temporal scales, which may overlook the intra-day variability crucial for precise water budgeting in extreme climates.

The present study introduces several contributions to the field of reference ET modeling. Firstly, it provides a comparative evaluation of P–M ASCE, P–M FAO, and P–M KSA formulations using hourly timestep meteorological data under hyper-arid conditions, capturing short-term atmospheric fluctuations often missed in daily-scale studies. Secondly, it includes a systematic assessment of model sensitivity to h_plt, a parameter often fixed in conventional studies, enabling more realistic simulations across diverse agricultural applications. Thirdly, it analyzes the accuracy of daily reference ET derived from hourly reference ET, offering insight into cumulative error propagation and temporal aggregation effects across the three models. Collectively, these contributions address a critical knowledge gap by extending the validation of reference ET models into hyper-arid climates using a high-resolution temporal framework and contextualized agronomic parameters, ultimately supporting improved irrigation scheduling and water resource planning in regions such as the Riyadh region of Saudi Arabia.

2. Methodology

2.1. Site Selection

Riyadh Province, geographically located inside the Kingdom of Saudi Arabia, encompasses a vast area of approximately 404,240 km² (Figure 1), making it the nation’s second-largest administrative province by land area after the Eastern Region and the most populous, with over 8.5 million individuals as of the 2022 census [33]. The province is situated on the Najd Plateau, a notable ecological feature marked by its steep topography and arid weather [34]. The region’s geography is dominated by the Tuwaiq Mountain range, also known as Jabal Tuwaiq, which extends roughly 1000 km from the northern boundary near Al-Zulfi to the southern edge adjoining the Rub’ al Khali (Empty Quarter) desert [35]. Various wadis dot this escarpment, including Wadi Hanifah, which historically supported agricultural activity and today helps to control urban water supplies. Based on the Köppen climatic classification system, Riyadh Province generally falls within the hot desert climate (BWh) [36]. While winter temperatures can drop to almost freezing, the area suffers extreme heat; summer highs often top 45 °C [37]. Mostly occurring in the winter, annual precipitation is moderate and varies between 85 and 138 mm [38].

From a minimum of 15% in the parched summer months to about 42% in the winter, relative humidity swings greatly [6]. The area is also exposed to recurrent sandstorms, being surrounded by desert dunes. The climatic conditions provide high potential evapotranspiration rates, approximated at 9.5 mm daily [39], which emphasizes the need for exact evapotranspiration modeling for efficient water resource management in the area. The dry climate of the province presents major difficulties for agriculture that call for the use of advanced irrigation techniques and water-saving strategies.

2.2. P–M Model Formulation

Reference ET can be quantified using both direct and indirect methods. Direct measurement techniques such as weighing lysimeters, pan evaporimeters, and water balance approaches are commonly employed in field applications. These methods are known for their high accuracy; however, their widespread use is often constrained by the significant costs associated with installation, operation, and maintenance [40]. Consequently, indirect methods are generally preferred, particularly in large-scale or long-term studies. Mathematical models, which estimate reference ET based on observed meteorological parameters, are widely used due to their cost-effectiveness and operational simplicity. These models typically incorporate statistically derived coefficients that are calibrated for specific climatic or regional conditions, ensuring reasonable estimation accuracy [41,42]. In this study, three widely recognized Penman–Monteith-based models were employed to estimate reference ET across diverse geographical regions: the ASCE standardized model (ET_r) [25], the FAO model (ET_o) [26], and the KSA model (ET_ref) [27]. These models are grounded in physical principles and incorporate both physiological and aerodynamic parameters, making them suitable for a broad range of climatic conditions. They are known as a combination method that integrates key meteorological variables, including air temperature (T), solar radiation (RS), relative humidity (RH), wind speed (U₂), and saturation vapor pressure, to provide robust estimates of reference ET. The mathematical forms of the utilized models for the daily time step are represented as follows.

E T_{r} = \frac{0.408 ∆ (R n - G) + γ \frac{1600}{T a + 273} u_{2} (e_{s} - e_{a})}{∆ + γ (1 + 0.38 u_{2})}

(1)

E T_{o} = \frac{0.408 ∆ (R n - G) + γ \frac{900}{T a + 273} u_{2} (e_{s} - e_{a})}{∆ + γ (1 + 0.34 u_{2})}

(2)

E T_{r e f} = λ^{- 1} [\frac{Δ}{Δ + γ^{*}} (R_{n} - G) + \frac{γ \frac{1.854 \times 10^{5} λ / r_{a}}{T_{a} + 273}}{Δ + γ (1 + r_{s} / r_{a})} (e_{s} - e_{a})]

(3)

The mathematical forms of the utilized models for the hourly time step are represented as follows.

E T_{r} = \frac{0.408 ∆ (R n - G) + γ \frac{66}{T a + 273} u_{2} (e_{s} - e_{a})}{∆ + γ (1 + 0.38 u_{2})}

(4)

E T_{o} = \frac{0.408 ∆ (R n - G) + γ \frac{37}{T a + 273} u_{2} (e_{s} - e_{a})}{∆ + γ (1 + 0.34 u_{2})}

(5)

E T_{r e f} = λ^{- 1} [\frac{Δ}{Δ + γ^{*}} (R_{n} - G) + \frac{γ \frac{7.725 \times 10^3 λ / r_{a}}{T_{a} + 273}}{Δ + γ (1 + r_{s} / r_{a})} (e_{s} - e_{a})]

(6)

where ET_r, ET_o, and ET_ref refer to reference evapotranspiration (mm·d⁻¹ or h⁻¹) for ASCE, FAO, and KSA P–M models, respectively; λ represents the latent heat of vaporization (MJ·kg⁻¹); λ is commonly approximated at 2.45 (MJ·kg⁻¹), or it can be precisely determined using the formula

[λ = 2.501 - 0.002361 T a]

, where T_a represents the average air temperature in (°C),

∆

denotes the slope of the saturation vapor pressure–temperature curve (kPa·°C⁻¹), γ* signifies the modified psychometric constant (kPa·°C⁻¹), R_n indicates the calculated net radiation at the plant surface (MJ·m⁻²·d⁻¹ or h⁻¹), G is the soil heat flux density at the soil surface (MJ·m⁻²·d⁻¹ or h⁻¹), γ stands for the psychrometric constant (kPa·°C⁻¹), Additionally, e_s represents the saturation vapor pressure (kPa), e_a is the mean actual vapor pressure (kPa), T indicates the air temperature (°C), and r_a symbolizes the aerodynamic resistance (s·m⁻¹).

The common suitable single form for daily and hourly timesteps representing the above forms is proposed as follows:

E T_{r e f} = λ^{- 1} [\frac{Δ}{Δ + γ^{*}} (R_{n} - G) + \frac{γ}{Δ + γ^{*}} k (e_{s} - e_{a})]

(7)

The coefficients K and

γ^{*}

could be obtained from Table 1.

r_a and r_s could be calculated as follows:

r_{a} = \frac{1 - l n (h_{p l t})}{0.0151 u_{2}}

(8)

r_{s} = \frac{1}{0.0275 + 0.0075 l n (h_{p l t})}

(9)

According to Jensin M. E. [25] and Allen, Pereira [26], the surface resistance and albedo were 70 s·m⁻¹ and 0.23, respectively. Utilization of fixed values of surface resistance and albedo for ASCE and FAO formulations enables direct comparison between tall (50 cm) and short (12 cm) reference ET rates under uniform atmospheric conditions [16].

2.3. Data Collection

The investigation of climatic parameters in the study area was based on a reliable dataset obtained from the weather station at the educational farm of King Saud University in Riyadh, Saudi Arabia (Figure 1). This dataset included key meteorological variables: solar radiation (RS), relative humidity (RH), wind speed at 2 m (U₂), and air temperature (T), recorded at one-minute intervals over five years (2020–2024). These data were carefully collected, quality-checked, and consolidated into one representative year, comprising 525,600 one-minute records used to compute reference ET at multiple temporal scales: hourly, daily, 24 h averages, and daily totals from hourly values, across all selected Penman–Monteith (P–M) models.

To ensure data integrity, several imputation and correction steps were applied. Short gaps (less than 3 consecutive hours) were filled using linear interpolation, while longer gaps led to exclusion of the corresponding days. Outliers were identified using physical plausibility thresholds (e.g., RH > 100%, extreme RS or U₂ values) and removed if confirmed as erroneous. All variables were subjected to unit consistency checks and time alignment to ensure accurate parameter matching for reference ET calculations. This rigorous approach enabled reliable assessment of reference ET estimation under hyper-arid conditions.

2.4. Statistical Evaluation

The performance of empirical models for estimating evapotranspiration (ET) is typically assessed using a range of statistical error metrics. However, no single metric can comprehensively capture all aspects of model performance. Therefore, it is essential to consider multiple evaluation criteria to gain a more holistic understanding of model accuracy and reliability. During the current work, the performance of hourly and daily reference ET values obtained from the models studied was statistically evaluated. Firstly, to compare groups, the data were examined for their normality using the Anderson–Darling, D’Agostino and Pearson, Shapiro–Wilk, and Kolmogorov–Smirnov tests to determine whether to use a parametric or non-parametric test [43]. According to the following hypothesis,

H₀.

The data follow a normal distribution.

H₁.

The data does not follow a normal distribution.

The hypothesis of normal distribution was tested because many statistical analyses assume that the data (or residuals) are normally distributed to ensure the validity and interpretability of the results [44,45]. A low p-value (typically below 0.05) leads to rejecting the null hypothesis, indicating the data likely deviates from normality. Then the nonparametric Wilcoxon test will be used according to the following hypothesis: “Wilcoxon test [46] According to the following hypothesis:

H₂.

The median of the differences between paired observations is zero.

H₃.

The median of the differences between paired observations is not zero.

Otherwise, the parametric test “t-Test [47]” will be used according to the following hypothesis:

H₄.

No difference between the means of the paired samples.

H₅.

There is a difference between the means.

In addition, the two-parameter statistical test was performed to evaluate the performance of each model depending on multiple temporal resolutions: hourly vs. hourly averaged over 24 h and daily vs. daily as the sum of 24 h values. Commonly used metrics include correlation coefficient (r) [48], Nash–Sutcliffe efficiency (NSE) [49], root mean square error (RMSE) [50], mean absolute error (MAE) [51], average bias (b) [52], index of agreement (d) [53], and confidence index (c) [54], each offering distinct insights into model behavior [55,56]. For instance, while RMSE emphasizes larger errors, MAE provides a more balanced view of average deviations. Moreover, the NSE is used to see how well the distribution of data (scatterplot) from observations and models fits the 1:1 line, where NSE = 1 indicates a perfect value from the comparison, NSE = 0 indicates the prediction model has the same accuracy as the average value of the observations, whereas negative NSE states that the model is unacceptable [49]. Using multiple metrics allows for a more nuanced evaluation, particularly when comparing models across different climatic or geographical contexts [55]. The mathematical formula of the evaluation criteria is obtained in Table 2.

The concordance index (Table 3) is a statistical measure used to evaluate the agreement between observed and estimated reference ET values, particularly in terms of their distribution relative to the 1:1 line. To further assess the reliability of each estimation method, the confidence index (c) proposed by de Camargo and Sentelhas [57] is employed. This index combines the Pearson correlation coefficient (r) and Willmott’s index of agreement (d). This composite metric provides a more comprehensive evaluation by accounting for both the strength of the linear relationship and the degree of agreement between observed and predicted values.

3. Results and Discussion

3.1. Climatic Parameters

The annual climatology of the Riyadh province, when viewed through the four principal drivers of reference evapotranspiration (ET) (Figure 2), reveals a picture both instructive and exacting for any Penman–Monteith implementation in hyper-arid regions. Firstly, air temperature in Riyadh exhibits one of the steepest seasonal amplitudes on record: daily means rise from approximately 12 °C in mid-winter to peaks approaching 40 °C in mid-summer. Such extremes inexorably elevate saturation vapor pressure deficit (SVPD), imposing the highest ET demand precisely when water availability is most precious. Secondly, relative humidity, which oscillates between modest winter maxima of 50–60% and precipitous summer minima below 15%, increases the plant’s need for water. The convergence of high temperature and low humidity, therefore, underpins the very apex of the highest ETₒ values during June, July, and August.

Thirdly, wind speed, though only moderate (2–5 m/s), plays a non-negligible role in sustaining the turbulent transfer of moisture from the canopy layer into the free atmosphere. The spring months, marked by slightly more variable and gusty breezes, engender transient surges in ET that would be entirely overlooked by coarse daily averaging. Finally, incoming solar radiation, which attains daily totals of more than 25 MJ /m² in the clear-sky summer months, supplies the latent heat necessary for phase change. Its annual profile closely parallels that of temperature, reinforcing the correlation between radiative forcing and ET output. Taken together, these four drivers produce an ET regime that is both temporally punctuated and climatically extreme. The hourly timestep approach thus emerges as indispensable: only by resolving diurnal peaks and troughs can one accurately capture the true evapotranspiration load—and, by extension, the water requirements—of any reference surface in this setting.

3.2. Hourly Reference ET Investigation

Figure 3 presents a comprehensive analysis of hourly reference evapotranspiration (ET) comparisons based on average monthly data. The findings juxtapose ET estimates derived from three widely adopted methodologies: the ASCE-standardized Penman–Monteith (P–M ASCE), the FAO Penman–Monteith (P–M FAO), and the regionally calibrated Penman–Monteith KSA formulation. The comparison is stratified by two plant height conditions—50 cm and 12 cm—to reflect different aerodynamic and surface resistance regimes. The first set of comparisons examines ASCE versus KSA reference ET estimates at a plant height of 50 cm, while the second set contrasts FAO against KSA values at 12 cm.

These plant heights were chosen to reflect typical reference vegetation for tall and short plant surfaces, respectively, and the analysis provides insight into both methodological and biophysical influences on evapotranspiration dynamics. The temporal distribution of reference ET across the diurnal cycle reveals clear and systematic trends. For the 50 cm canopy height, the KSA method consistently produces slightly higher hourly reference ET values than the ASCE formulation, particularly during midday hours, when solar radiation and temperature reach their maxima. The observed divergence is most prominent during the summer months (April to August), a period marked by elevated temperatures, intense solar radiation, and low humidity—conditions that amplify the aerodynamic and radiative components of the P–M equation. Conversely, the FAO estimates at 12 cm yield reference ET values that are marginally lower than their KSA counterparts, although the magnitude of divergence is somewhat smaller than in the ASCE comparison. This difference reflects the enhanced aerodynamic resistance associated with shorter vegetation, which modulates the transfer of water vapor from the canopy surface to the atmosphere.

Quantitative metrics derived from the twelve-month hourly data substantiate these observations. The bias between ASCE and KSA (h = 50 cm) remains negative across all months, typically between –0.003 and –0.007 mm h⁻¹, indicating that the KSA model slightly overestimates reference ET relative to ASCE under identical input conditions. The corresponding RMSE values range from 0.003 to 0.007 mm h⁻¹, with maximum error occurring during the summer peak, further reinforcing the sensitivity of model divergence to climatic extremity. For the FAO and KSA comparison at 12 cm, the mean monthly bias is even smaller (–0.002 mm h⁻¹), and RMSE rarely exceeds 0.003 mm h⁻¹. These values represent a small fraction—typically under 5%—of the absolute hourly reference ET magnitude, indicating a high degree of concordance across models. However, it is worth emphasizing that while these absolute differences may appear numerically minor, their diurnal concentration during periods of peak evaporative demand can lead to non-trivial cumulative errors in daily and seasonal water budgeting. In practical terms, for irrigation scheduling and plant water requirement assessments, even small hourly inaccuracies can propagate into meaningful discrepancies when extrapolated over time. The seasonal modulation of model agreement is also notable. During cooler months (November to February), the three formulations converge closely, with minimal model-to-model deviation. This convergence is expected given the reduced vapor pressure deficits, lower aerodynamic activity, and more stable radiative regimes. In contrast, from late spring through early autumn, the intensification of atmospheric demand reveals the distinct assumptions embedded in each model, most notably in the KSA formulation, which has been explicitly tuned to account for the arid conditions of Saudi Arabia.

From a methodological standpoint, this study provides compelling empirical validation of the KSA variant of the Penman–Monteith model under high-resolution (hourly) meteorological forcing. That the KSA method maintains strong concordance with both ASCE and FAO standards across all months—and especially under severe arid stress—speaks to its robustness and adaptability for regional application.

3.2.1. Hourly ET_r of ASCE Model (h_plt = 50 cm)

The comparative analysis of average hourly reference evapotranspiration (ET) versus daily reference ET divided by 24 h reveals model-specific biases and distributional characteristics across four standardized methodologies: ASCE, KSA at h_plt = 50 cm, FAO, and KSA at h_plt = 12 cm.

Figure 4 demonstrates a strong linear correlation (R² > 90%) between average hourly and daily/24 ET_r. However, systematic deviations occur during high-evaporative periods (June, July, and August), where daily/24 values underestimate observed hourly ET_r by 4–12%. Table 4 confirms non-normal distributions for both hourly and daily/24 ET_r, with all normality tests rejecting the null hypothesis. This non-normality invalidates parametric statistical assumptions. Hence, the Wilcoxon test was applied.

As depicted in Figure 5a, the Wilcoxon signed-rank test indicated a statistically significant difference (p < 0.0001), confirming that the two methods are not identically distributed. However, the high Pearson correlation coefficient (r = 0.97) and coefficient of determination (R² = 94.75%) demonstrate a strong linear relationship between the two datasets (Figure 5b). Moreover, the descriptive statistics show that the average hourly ET_r (mean = 0.457 mm/h) is slightly higher than the daily/24 estimate (mean = 0.426 mm/h), with both methods exhibiting similar variability. The negative bias (b = −0.0312 mm/h) confirms that daily/24 values tend to underestimate ET_r relative to the hourly computation. Despite this, the Nash–Sutcliffe Efficiency (NSE = 0.92) confirms strong predictive performance. In the same context, the error metrics, including RMSE (0.06 mm/h) and MAE (0.05 mm/h), are low, indicating high accuracy. The Index of Agreement (d = 0.98) and the Confidence Index (c = 0.95) further support the reliability and consistency of the daily/24 method, where c is classified as excellent according to Table 3.

3.2.2. Hourly ET_ref of KSA Model (h_plt = 50 cm)

The scatterplot (Figure 6) indicates weaker agreement (R² = 81%) compared with ASCE, with daily/24 underestimating hourly ET_ref by 7–15% during spring midday peaks. Greater variability in hourly ET_ref suggests that aerodynamic components for taller plants amplify diurnal fluctuations, which daily approximations fail to capture. Normality tests parallel ASCE, rejecting normality for both datasets (all p < 0.0001), Table 5. Hence, the non-parametric Wilcoxon test was applied.

At h_plt = 50 cm, the Wilcoxon signed-rank test confirmed a statistically significant difference (p < 0.0001) (Figure 7a). This non-parametric test confirms that the two paired data sets are not identically distributed, with the hourly method consistently producing slightly higher ET_ref values (mean = 0.468 mm/h) compared with the daily/24 method (mean = 0.433 mm/h). This systematic difference is further supported by the observed negative bias (b = −0.04), suggesting that the daily/24 approximation tends to underestimate ET_ref relative to the hourly method. Despite this bias, the high Pearson correlation coefficient (r = 0.97) and coefficient of determination (R² = 94.87%) demonstrate a strong linear relationship between the two datasets (Figure 7b), indicating that they track ET_ref dynamics similarly over time.

The model performance statistics reinforce this observation: the Nash–Sutcliffe Efficiency (NSE = 0.92) confirms strong predictive performance, while the RMSE (0.06 mm/h) and MAE (0.05 mm/h) indicate high accuracy. Furthermore, the Index of Agreement (d = 0.98) and Confidence Index (c = 0.95, excellent class) reflect strong agreement and reliability between the two estimation approaches.

3.2.3. Hourly ET_o of FAO Model (h_plt = 12 cm)

Figure 8 shows the strongest correlation (R² = 92%) among models. Despite this, because of Table 6, normality tests uniformly reject normality (i.e., Anderson–Darling p < 0.0001). Low-evaporation months (December, January, and February) exhibit 8–10% overestimation by daily/24, attributed to reduced nighttime atmospheric demand. This model demonstrates the least bias, though temporal mismatches persist.

The comparison between ET_o computed as average hourly values and as daily/24 h revealed statistically and practically significant differences. The Wilcoxon signed-rank test confirmed a statistically significant difference (p < 0.0001) as depicted in Figure 9a, indicating that the two methods are not identically distributed. Despite this, the Pearson correlation coefficient (r = 0.96) and coefficient of determination (R² = 92.41%) demonstrate a strong linear relationship between the two datasets, as supported by a 1:1 plot (Figure 9b).

Descriptive statistics show that the hourly ET_ref (mean = 0.307 mm/h) is slightly higher than the daily/24 estimate (mean = 0.282 mm/h), with both methods exhibiting similar variability. In addition, the negative bias (b = −0.03 mm/h) indicates systematic underestimation by the daily/24 method. Model performance statistics further support the consistency: the Nash–Sutcliffe efficiency (NSE = 0.87) and index of agreement (d = 0.96) suggest that the hourly method captures the variability of ET_o much more reliably. Error magnitudes remain moderate, while the RMSE (0.05 mm/h) and MAE (0.04 mm/h) indicate high accuracy. Furthermore, the Confidence Index (c = 0.93) reflects strong agreement and reliability between the two estimation approaches (excellent class).

3.2.4. Hourly ET_ref of KSA Model (h_plt = 12 cm)

For shorter canopies, daily/24 consistently underestimates hourly ET_ref by 9–18% during summer (Figure 10). Normality tests confirm non-normal distributions (all p < 0.05), Table 7. The pronounced bias at reduced height highlights canopy-structure influences on diurnal ET partitioning, which daily downscaling inadequately represents. For how sensitive the model is, factors like aerodynamic and plant-height parameters (e.g., KSA₅₀ vs. KSA₁₂) significantly affect downscaling accuracy, with taller canopies exhibiting greater hourly variability.

The Wilcoxon signed-rank test (Figure 11a) confirmed a statistically significant difference (p < 0.0001), indicating a consistent difference across paired observations. Descriptive statistics show that the hourly ET_ref values (mean = 0.312 mm/h) are systematically higher than those derived from daily/24 approximations (mean = 0.283 mm/h), supported by a negative average bias (b = −0.03 mm/h). This underlines the daily/24 method’s tendency to underestimate actual ET_ref values due to temporal aggregation. Nonetheless, the Pearson correlation coefficient (r = 0.96) and coefficient of determination (R² = 92.44%) demonstrate a strong linear relationship between the two datasets as plotted in Figure 11b. This strong linearity is further supported by the high index of agreement (d = 0.96) and a confidence index (c = 0.92), both of which suggest high consistency between the two approaches. The Nash–Sutcliffe Efficiency (NSE = 0.86) confirms good predictive performance, while the RMSE (0.05 mm/h) and MAE (0.04 mm/h) indicate high accuracy.

Figure 12 depicts a visual comparative bar chart illustrating the statistical performance metrics for the four reference ET mathematical models—ASCE, KSA at 50 cm, FAO, and KSA at 12 cm—based on paired hourly ET data. These results highlight that although both estimation approaches are closely aligned in temporal trends. While daily/24 estimates are computationally convenient, they may mask diurnal ET variability and lead to underestimation, especially under fluctuating meteorological conditions. The significant Wilcoxon result also suggests that averaging across the diurnal cycle introduces aggregation bias not captured in the daily/24 approximation. The small but consistent underestimation associated with the daily/24 method may be attributed to the loss of intra-daily variability, particularly under high temporal resolution meteorological conditions. Therefore, for applications requiring precise reference ET—such as irrigation scheduling or hydrological modeling—the hourly method is preferable due to its higher fidelity. On the other hand, the daily/24 method provides a reliable approximation of average hourly ET and is suitable for use in operational and research contexts where direct hourly data are unavailable.

3.3. Daily Reference ET Investigation

3.3.1. Daily ET_r of ASCE Model (h_plt = 50 cm)

This study evaluates the agreement between conventional daily reference evapotranspiration (ET) and the sum of 24-hourly ET across four standardized models: ASCE, KSA at plant height = 50 cm, FAO, and KSA at plant height = 12 cm. The analysis leverages full-year datasets (365 days) for each model, with statistical validation via normality tests and graphical comparisons.

Figure 13 illustrates a strong linear correlation (R² > 90%) between daily ET_r and 24 h cumulative ET_r. However, systematic deviations emerge during high-evaporative periods (e.g., June, July, and August), where daily values underestimate the 24 h sum by 4–12%. Table 8 confirms non-normality for both datasets (all p < 0.0001). This indicates that parametric statistical methods are invalid for temporal scaling.

The Wilcoxon signed-rank test revealed a statistically significant difference between the sum-24 h ET_r and daily ET_r (p < 0.0001), indicating systematic and non-random deviations between paired observations (Figure 14a). The sum-24 h ET_r consistently exceeded the daily-calculated values, with respective means of 10.976 mm/d and 10.227 mm/d, with both methods exhibiting comparable variability (standard deviations of 5.07 and 4.58 mm/d, respectively). The minimum and median values also suggest that the daily method tends to slightly underestimate ET_r, which is further supported by the negative average bias (b = −0.75 mm/d). Despite this difference, the two approaches show a strong linear relationship, as evidenced by the Pearson correlation coefficient (r = 0.97) and the coefficient of determination (R² = 94.75%), which indicates a strong linear relationship between the two datasets (Figure 14b).

Performance statistics further validate this consistency: The Nash–Sutcliffe Efficiency (NSE = 0.92) confirms that the sum-24 h method provides a reliable approximation of the daily ET_r. Error metrics such as RMSE (1.43 mm/d) and MAE (1.13 mm/d) are within acceptable limits for hydrological modeling, reinforcing the practical interchangeability of the two approaches under most conditions. Furthermore, the Index of Agreement (d = 0.98) and Confidence Index (c = 0.95) reflect a high degree of concordance and reliability between the methods. These findings suggest that while statistically distinguishable, the daily ET_r estimates are sufficiently accurate for operational and research applications, particularly when hourly data are unavailable.

3.3.2. Daily ET_ref of KSA Model (h_plt = 50 cm)

The scatterplot (Figure 15) reveals moderate agreement (R² = 81%), with daily ET_ref underestimating the 24 h sum by 7–15% during spring midday peaks. Normality tests uniformly reject normality (all p < 0.0001), mirroring ASCE’s distribution characteristics, Table 9. The heightened variability suggests aerodynamic influences at taller plant heights amplify diurnal flux discrepancies.

A statistical comparison was conducted between sum-24 h ET_ref values and those computed directly as daily ET_ref values. The Wilcoxon signed-rank test (Figure 16a) indicated a statistically significant difference (p < 0.0001), suggesting that the two methods are not identically distributed. However, the strength of their relationship is underscored by a Pearson correlation coefficient of r = 0.97 and a coefficient of determination (R²) of 94.87%, indicating a strong linear association (Figure 16b).

Descriptive statistics reveal that the sum-24 h ET_ref has a slightly higher mean (11.237 mm/d) compared with the daily ET_ref (10.402 mm/d), with both methods exhibiting similar variability (standard deviations of 5.23 and 4.68 mm/d, respectively). The average bias b = −0.84 mm/d confirms a consistent underestimation by the daily method relative to the sum-24 h values. In addition, model performance metrics further support the reliability of the daily ET_ref estimates. The Nash–Sutcliffe Efficiency (NSE = 0.92) indicates strong predictive accuracy. However, the RMSE (1.5 mm/d) and MAE (1.19 mm/d) indicate that the underestimation by the daily method is non-negligible, especially in water-sensitive applications such as precision irrigation or plant modeling. The Index of Agreement (d = 0.98) and the Confidence Index (c = 0.95) reflect high concordance and model reliability, where c is classified as excellent.

3.3.3. Daily ET_o of FAO Model (h_plt = 12 cm)

Figure 17 demonstrates the strongest correlation (R² = 92%) among models. Despite this, normality tests reject both distributions (Anderson–Darling p < 0.0001), Table 10. Low-evaporation months (December, January, and February) exhibit 8–10% overestimation by daily ET_o, attributed to reduced nighttime atmospheric demand. This model shows the least absolute bias but retains seasonally dependent errors.

The comparison between daily ET_o values calculated using either the sum-24 h estimates or as a daily computation at a reference plant height of 12 cm demonstrates a statistically significant discrepancy. The Wilcoxon signed-rank test yielded a p-value < 0.0001 (Figure 18a), indicating a systematic and non-random difference between the two methods.

The sum-24 h ET_o (mean = 7.368 mm/d) consistently exceeded the daily method (mean = 6.761 mm/d), with both methods exhibiting similar variability (standard deviations of 3.16 and 2.68 mm/d, respectively). The negative average bias of −0.61 mm/d confirms a tendency for underestimation by the daily approach. Despite this deviation, the two methods exhibited strong agreement in their temporal patterns, as shown by a high Pearson correlation coefficient (r = 0.96) and R² = 92.41% (Figure 18b). Additional performance metrics—Nash–Sutcliffe efficiency (NSE = 0.87), index of agreement (d = 0.96), and confidence index (c = 0.93)—further support the reliability and coherence of the sum-24 h method in replicating the daily ET_o dynamics. While the RMSE (1.12 mm/d) and MAE (0.85 mm/d) are moderate, they are relevant in applications requiring high precision, such as deficit irrigation or water balance assessments.

3.3.4. Daily ET_ref of KSA Model (h_plt = 12 cm)

For shorter canopies, daily ET_ref consistently underestimates the sum of 24 h by 9–18% during summer (Figure 19). Table 11 mentions the normality tests that confirm non-normality (all p < 0.0001). The pronounced bias at reduced plant height underscores the role of canopy structure in diurnal ET partitioning, which daily approximations fail to capture. For how sensitive the model is, taller canopies (KSA@ 50 cm) exhibit greater variability than shorter ones (KSA@ 12 cm). FAO’s robustness stems from its global calibration, while KSA variants show height-dependent discrepancies.

A comparative analysis was conducted between daily ET_ref values derived from the sum of 24 h estimates and those computed directly as daily values at a h_plt of 12 cm. The Wilcoxon signed-rank test (Figure 20a) confirmed a statistically significant difference (p < 0.0001), indicating that the two methods are not identically distributed. However, the high Pearson correlation coefficient (r = 0.96) and coefficient of determination (R² = 92.44%) demonstrate a strong linear relationship between the two datasets (Figure 20b). Moreover, descriptive statistics reveal that the sum-24 h ET_ref has a slightly higher mean (7.48 mm/d) compared with the daily ET_ref (6.78 mm/d), with both methods showing similar variability (standard deviations of 3.22 and 2.70 mm/d, respectively).

The average bias of −0.67 mm/d indicates a consistent underestimation by the daily method. In the same context, model performance metrics further support the reliability of the daily ET_ref estimates. The Nash–Sutcliffe Efficiency (NSE = 0.86) suggests good predictive accuracy, while the Root Mean Square Error (RMSE = 1.19 mm/d) and Mean Absolute Error (MAE = 0.91 mm/d) are within acceptable limits. The Index of Agreement (d = 0.96) and Confidence Index (c = 0.92) confirm strong agreement and model reliability, where c could be classified as excellent.

Figure 21 depicts a visual comparative bar chart illustrating the statistical performance metrics for the four reference ET mathematical models—ASCE, KSA at 50 cm, FAO, and KSA at 12 cm—based on paired daily ET estimates. These results confirm that the direct daily method, although computationally simpler, tends to smooth over intra-daily climatic variability, leading to underestimation, especially under arid or variable weather conditions common in the KSA region. These results confirm that while the two methods are statistically distinguishable, the daily reference ET estimates are sufficiently accurate and consistent for practical applications, especially when hourly data is unavailable. Consequently, the use of a sum-24 h reference ET is recommended for improved accuracy in hydrological modeling and irrigation planning.

Generally, using conventional daily reference ET as a proxy for sum-24 h ET introduces quantifiable, model-specific biases exacerbated by non-normal distributions. The ASCE and FAO models show the strongest correlations, but errors remain significant during seasonal extremes. For precise water resource management, direct hourly measurement is recommended over daily approximations. Future work should develop plant-height-specific correction factors to minimize scaling errors.

Practically, the underestimation of actual ET can lead to several important consequences, particularly in precision irrigation and water resource management. One major impact is under-irrigation, where underestimated plant water requirements may result in water stress, reduced yield, and diminished plant quality. Additionally, it can lead to inefficient water budgeting, causing supply deficits, especially in arid and semi-arid regions. The bias introduced in irrigation scheduling models may reduce soil moisture availability and negatively affect plant growth, particularly during peak ET periods. Furthermore, it may introduce a distortion in climate impact assessments, hindering accurate evaluation of changing water demands under climate variability. Lastly, it can create overconfidence in apparent water savings, falsely suggesting improvements in system efficiency or conservation outcomes.

3.4. Ratio of P–M Mathematical Models

Figure 22 depicts the hourly dataset that provided a granular view of the comparative performance of the evapotranspiration models, revealing distinct patterns in the ratios of ASCE/FAO and KSA₅₀/ KSA₁₂. The overall mean ASCE/FAO ratio for hourly data across the entire year was approximately 1.472. This indicates that, on average, the ASCE-PM model estimated hourly ET rates that were approximately 47.2% higher than those estimated by the FAO PM model. A detailed examination of the hourly data revealed that the ASCE/FAO ratios exhibited a considerable range, from a minimum of approximately 1.248 (observed on Day 353, 19 December 2024) to a maximum of 1.674 (recorded on Day 70, 11 March 2024). This wide range indicates significant temporal variability in the relative performance of these two models throughout the year. The consistent ratio significantly greater than 1.0 throughout the year indicates a systematic bias, where the ASCE-PM model consistently estimated higher hourly evapotranspiration rates compared with the FAO PM model.

The overall mean KSA₅₀/ KSA₁₂ ratio for hourly data was approximately 1.483. This implies that the KSA model for a 50 cm plant height estimated hourly ET rates were, on average, 48.3% higher than those for a 12 cm plant height. The hourly ratios for KSA₅₀/ KSA₁₂ also demonstrated substantial variability, ranging from a minimum of approximately 1.257 (observed on Day 353, 19 December 2024) to a maximum of 1.687 (recorded on Day 70, 11 March 2024). This range is notably like that observed for the ASCE/FAO ratio.

The persistent ratio exceeding 1.0 reinforces the finding that the KSA model’s output is highly sensitive to the specified plant height, with taller canopies yielding higher estimated ET. As depicted in Figure 22, the hourly ratios for both pairs of models exhibited clear temporal variability, suggesting that the discrepancies are not static but influenced by changing environmental conditions or seasonal patterns. The study demonstrated that, under arid climatic conditions, the hourly ratio of ASCE to FAO and KSA₅₀ to KSA₁₂ reached 1.48—exceeding the commonly cited ASCE reference ratio of 1.33 reported by Allen, Pereira [26].

Figure 23 depicts the daily dataset that provided a broader temporal perspective on the comparative model performance, generally mirroring the trends observed in the hourly data but with potentially different magnitudes of variability due to aggregation effects. The overall mean ASCE/FAO ratio for daily data was approximately 1.483. This average is remarkably close to the hourly mean, suggesting a consistent systemic difference between ASCE-PM and FAO PM regardless of temporal aggregation. The daily data showed the ASCE/FAO ratios ranging from a minimum of approximately 1.165 (observed on Day 353, 19 December 2024) to a maximum of 1.678 (recorded on Day 151, 31 May 2024). This range is also substantial, indicating daily fluctuations in relative model performance. The consistent daily ratio significantly above 1.0 confirms the systematic tendency of the ASCE-PM model to estimate higher daily evapotranspiration rates compared with the FAO PM model.

The overall mean KSA₅₀/ KSA₁₂ ratio for daily data was approximately 1.503. This average is slightly higher than the hourly mean for the same ratio, indicating that the KSA model for a 50 cm plant height consistently estimated daily ET rates that were approximately 50.3% higher than those for a 12 cm plant height. The daily ratios for KSA₅₀/ KSA₁₂ ranged from a minimum of approximately 1.175 (observed on Day 353, 19 December 2024) to a maximum of 1.703 (recorded on Day 151, 31 May 2024). This range is consistent with the hourly observations, reaffirming the strong influence of plant height within the KSA framework. The sustained ratio greater than 1.0 reinforces the direct relationship between plant height and estimated ET within the KSA models. As illustrated in Figure 23, the daily ratios also exhibited temporal fluctuations. The study revealed that, under arid climatic conditions, the daily ratio of ASCE to FAO and KSA₅₀ to KSA₁₂ reached 1.49, surpassing the commonly referenced ASCE ratio of 1.33, as reported by Fitriani, Bowo [56].

3.5. Study Limitations

This study is subject to several limitations. The study excludes field reference ET data obtained directly through methods such as lysimeters or soil water balance. This comparison is restricted to three mathematical formulations of the Penman–Monteith (P–M) equation: the ASCE, FAO, and KSA forms. The results illustrate the distinctions among these models rather than their accuracy in representing actual ET rates. All three models examined in this study are variations of the indirect Penman–Monteith method, indicating the presence of inherent uncertainties in the estimates. The data collected over five years was recorded every minute; however, the analysis focused solely on weather data from a single weather station in Riyadh, Saudi Arabia, characterized by its extremely dry climate. The results may not be generalizable to other locations or climates. The spatial limitation reduces the applicability of the study’s conclusions in other regions, particularly where hydrometeorological conditions differ significantly. It is essential to acknowledge that mathematical models such as Penman–Monteith possess specific data requirements. These models require extensive and precise weather data, which may be difficult to obtain in certain regions. Further studies may focus on expanding to additional arid and/or humid regions, integrating remote sensing data, or developing model-specific correction algorithms to mitigate biases, particularly for various plant heights.

4. Conclusions

The analysis of climatic parameters in this study was based on a high-resolution meteorological dataset obtained from the weather station that provided minutely records of key atmospheric variables, including solar radiation (RS), relative humidity (RH), wind speed at 2 m (U₂), air temperature (T), and rainfall, spanning a full calendar year. Reference ET was computed using three universal mathematical models—ASCE, FAO at 50 and 12 cm reference plant, and KSA. Calculations were performed at multiple temporal resolutions—hourly, hourly averaged over 24 h, daily, and daily as the sum of 24-hourly values—to assess the sensitivity and consistency of reference ET estimates across different time scales. For hourly ET investigation, the findings emphasize that although daily/24 calculations offer operational simplicity, they may suppress diurnal variability and lead to consistent underestimation, particularly under fluctuating climatic conditions. Consequently, for precision applications such as real-time irrigation scheduling or high-resolution hydrological modeling, the use of hourly-based ET is recommended for improved accuracy. For daily ET investigation, the results confirm that the direct daily method, although computationally simpler, tends to smooth over intra-daily climatic variability, leading to underestimation, especially under arid or variable weather conditions common in the KSA region. Consequently, the use of sum-24 h reference ET is recommended for improved accuracy in hydrological modeling and irrigation planning. For practical applications, the daily reference ET estimates are sufficiently accurate and consistent, especially when hourly data are unavailable.

This study offers several contributions to the field of reference ET modeling. The study initially compares the P–M ASCE, P–M FAO, and P–M KSA formulations utilizing hourly meteorological data in hyper-arid conditions. This identifies transient atmospheric variations often overlooked in analyses focused on daily data. Secondly, it encompasses a systematic assessment of the model’s sensitivity to h_plt, a parameter frequently established in conventional studies. This enables the execution of more realistic simulations across a broader range of diverse agricultural applications. This study examines the accuracy of daily reference ET derived from hourly reference ET data. This provides insights into the accumulation of errors over time and their impact on the three models. These contributions address a significant gap in understanding by validating reference ET models in arid climates through a high-resolution temporal framework and contextualized agronomic parameters. This will enhance irrigation scheduling and water resource planning in arid and hyper-arid regions such as Saudi Arabia. Moreover, it contributes to the growing body of literature emphasizing the importance of high-frequency meteorological data for improving ET estimation accuracy in arid and semi-arid regions.

Author Contributions

Conceptualization, A.A.A., F.R., M.E. and N.A.; Data curation, A.A.A., F.R., M.E. and N.A.; Formal analysis, F.R., M.E. and N.A.; Investigation, F.R. and M.E.; Methodology, A.A.A., F.R., M.E. and N.A.; Project administration, A.A.A.; Software, F.R., M.E. and N.A.; Supervision, A.A.A.; Validation, A.A.A., F.R., M.E. and N.A.; Visualization, F.R., M.E. and N.A.; Writing—original draft, F.R. and M.E.; Writing—review and editing, A.A.A., M.A.M., A.E.-S., F.R., M.E. and N.A. All authors have read and agreed to the published version of the manuscript.

Funding

This project was funded by the National Plan for Science, Technology and Innovation (MAARIFAH), King Abdulaziz City for Science and Technology, Kingdom of Saudi Arabia, award number (WAT1152) and the APC was funded by (MAARIFAH).

Data Availability Statement

The data can be provided by the appropriate author upon reasonable request.

Acknowledgments

This project was funded by the National Plan for Science, Technology and Innovation (MAARIFAH), King Abdulaziz City for Science and Technology, Kingdom of Saudi Arabia, Award Number (WAT1152).

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

ET	Evapotranspiration
P–M	Penman–Monteith
ASCE	American Association of Civil Engineering
FAO	Food and Agriculture Organization
KSA	Kingdom of Saudi Arabia
ET_r	Reference ET for ASCE model
ET_o	Reference ET for FAO model
ET_ref	Reference ET for KSA model
h_plt	Reference plant height
T	Temperature
RS	Solar radiation
RH	Relative humidity
U₂	Wind speed
r	Correlation coefficient
R²	Coefficient of determination
NSE	Nash–Sutcliffe efficiency
RMSE	Root mean square error
MAE	Mean absolute error
b	Average bias
d	Index of agreement
c	Confidence index
p	p-Value

References

Abdella, F.I.; El-Sofany, W.I.; Mansour, D. Water scarcity in the Kingdom of Saudi Arabia. Environ. Sci. Pollut. Res. 2024, 31, 27554–27565. [Google Scholar] [CrossRef]
Al-Zahrani, K.H.; Baig, M.B. Water in the Kingdom of Saudi Arabia: Sustainable management options. J. Anim. Plant Sci. 2011, 21, 601–604. [Google Scholar]
Chowdhury, S.; Al-Zahrani, M. Implications of climate change on water resources in Saudi Arabia. Arab. J. Sci. Eng. 2013, 38, 1959–1971. [Google Scholar] [CrossRef]
Mishra, B.K.; Kumar, P.; Saraswat, C.; Chakraborty, S.; Gautam, A. Water security in a changing environment: Concept, challenges and solutions. Water 2021, 13, 490. [Google Scholar] [CrossRef]
El Kenawy, A.M. Hydroclimatic extremes in arid and semi-arid regions: Status, challenges, and future outlook. In Hydroclimatic Extremes in the Middle East and North Africa; Elsevier: Amsterdam, The Netherlands, 2024; pp. 1–22. [Google Scholar]
Ezzeldin, M.; Alazba, A.A.; Alrdyan, N.; Radwan, F. Rationalizing Irrigation Water Consumption in Arid Climates Based on Multicomponent Landscape Coefficient Approach. Earth Syst. Environ. 2025, 9, 277–298. [Google Scholar] [CrossRef]
Allen, R.G. Crop evapotranspiration. FAO Irrig. Drain. Pap. 1998, 56, 60–64. [Google Scholar]
Jin, X.; Schaepman, M.E.; Clevers, J.G.P.W.; Bob Su, Z. Impact and consequences of evapotranspiration changes on water resources availability in the arid Zhangye Basin, China. Int. J. Remote Sens. 2009, 30, 3223–3238. [Google Scholar] [CrossRef]
Wanniarachchi, S.; Sarukkalige, R. A review on evapotranspiration estimation in agricultural water management: Past, present, and future. Hydrology 2022, 9, 123. [Google Scholar] [CrossRef]
Lu, Z.; Zhao, Y.; Wei, Y.; Feng, Q.; Xie, J. Differences among evapotranspiration products affect water resources and ecosystem management in an Australian catchment. Remote Sens. 2019, 11, 958. [Google Scholar] [CrossRef]
Pereira, L.S.; Allen, R.G.; Smith, M.; Raes, D. Crop evapotranspiration estimation with FAO56: Past and future. Agric. Water Manag. 2015, 147, 4–20. [Google Scholar] [CrossRef]
Aly, M.S.; Darwish, S.M.; Aly, A.A. High performance machine learning approach for reference evapotranspiration estimation. Stoch. Environ. Res. Risk Assess. 2024, 38, 689–713. [Google Scholar] [CrossRef]
Madugundu, R.; Al-Gaadi, K.A.; Tola, E.; El-Hendawy, S.; Marey, S.A. Mapping of Evapotranspiration and Determination of the Water Footprint of a Potato Crop Grown in Hyper-Arid Regions in Saudi Arabia. Sustainability 2023, 15, 12201. [Google Scholar] [CrossRef]
Talebi, H.; Samadianfard, S.; Valizadeh Kamran, K. Estimation of daily reference evapotranspiration implementing satellite image data and strategy of ensemble optimization algorithm of stochastic gradient descent with multilayer perceptron. Environ. Dev. Sustain. 2023, 27, 3707–3729. [Google Scholar] [CrossRef]
Acharki, S.; Raza, A.; Vishwakarma, D.K.; Amharref, M.; Bernoussi, A.S.; Singh, S.K.; Al-Ansari, N.; Dewidar, A.Z.; Al-Othman, A.A.; Mattar, M.A. Comparative assessment of empirical and hybrid machine learning models for estimating daily reference evapotranspiration in sub-humid and semi-arid climates. Sci. Rep. 2025, 15, 2542. [Google Scholar] [CrossRef] [PubMed]
Subedi, A.; Chávez, J.L. Crop evapotranspiration (ET) estimation models: A review and discussion of the applicability and limitations of ET methods. Agric. Sci. 2015, 7, 50. [Google Scholar] [CrossRef]
Djaman, K.; Balde, A.B.; Sow, A.; Muller, B.; Irmak, S.; N’Diaye, M.K.; Manneh, B.; Moukoumbi, Y.D.; Futakuchi, K.; Saito, K. Evaluation of sixteen reference evapotranspiration methods under sahelian conditions in the Senegal River Valley. J. Hydrol. Reg. Stud. 2015, 3, 139–159. [Google Scholar] [CrossRef]
Hargreaves, G.H.; Samani, Z.A. Reference crop evapotranspiration from temperature. Appl. Eng. Agric. 1985, 1, 96–99. [Google Scholar] [CrossRef]
Raziei, T.; Pereira, L.S. Estimation of ETo with Hargreaves–Samani and FAO-PM temperature methods for a wide range of climates in Iran. Agric. Water Manag. 2013, 121, 1–18. [Google Scholar] [CrossRef]
Tabari, H.; Grismer, M.E.; Trajkovic, S. Comparative analysis of 31 reference evapotranspiration methods under humid conditions. Irrig. Sci. 2013, 31, 107–117. [Google Scholar] [CrossRef]
Priestley, C.H.B.; Taylor, R.J. On the assessment of surface heat flux and evaporation using large-scale parameters. Mon. Weather Rev. 1972, 100, 81–92. [Google Scholar] [CrossRef]
McAneney, K.; Itier, B. Operational limits to the Priestley-Taylor formula. Irrig. Sci. 1996, 17, 37–43. [Google Scholar] [CrossRef]
Satpathi, A.; Danodia, A.; Abed, S.A.; Nain, A.S.; Al-Ansari, N.; Ranjan, R.; Vishwakarma, D.K.; Gacem, A.; Mansour, L.; Yadav, K.K. Estimation of the crop evapotranspiration for Udham Singh Nagar district using modified Priestley-Taylor model and Landsat imagery. Sci. Rep. 2024, 14, 21463. [Google Scholar] [CrossRef] [PubMed]
Vishwakarma, D.K.; Pandey, K.; Kaur, A.; Kushwaha, N.L.; Kumar, R.; Ali, R.; Elbeltagi, A.; Kuriqi, A. Methods to estimate evapotranspiration in humid and subtropical climate conditions. Agric. Water Manag. 2022, 261, 107378. [Google Scholar] [CrossRef]
Jensin, M.E.; Allen, R.G. (Eds.) Evaporation, Evapotranspiration, and Irrigation Water Requirements; American Society of Civil Engineers: Reston, VA, USA, 2016. [Google Scholar]
Allen, R.G.; Pereira, L.S.; Raes, D.; Smith, M. FAO Irrigation and drainage paper No. 56. Food Agric. Organ. U. N. 1998, 56, e156. [Google Scholar]
Alazba, A.A. Estimating palm water requirements using Penman-Monteith mathematical model. J. King Saud. Univ. 2004, 16, 137–152. [Google Scholar]
Kisi, O.; Sanikhani, H.; Zounemat-Kermani, M.; Niazi, F. Long-term monthly evapotranspiration modeling by several data-driven methods without climatic data. Comput. Electron. Agric. 2015, 115, 66–77. [Google Scholar] [CrossRef]
Zhang, B.; Chen, H.; Xu, D.; Li, F. Methods to estimate daily evapotranspiration from hourly evapotranspiration. Biosyst. Eng. 2017, 153, 129–139. [Google Scholar] [CrossRef]
Ferreira, L.B.; da Cunha, F.F. New approach to estimate daily reference evapotranspiration based on hourly temperature and relative humidity using machine learning and deep learning. Agric. Water Manag. 2020, 234, 106113. [Google Scholar] [CrossRef]
Djaman, K.; Irmak, S.; Sall, M.; Sow, A.; Kabenge, I. Comparison of sum-of-hourly and daily time step standardized ASCE Penman-Monteith reference evapotranspiration. Theor. Appl. Climatol. 2018, 134, 533–543. [Google Scholar] [CrossRef]
Ji, X.B.; Chen, J.M.; Zhao, W.Z.; Kang, E.S.; Jin, B.W.; Xu, S.Q. Comparison of hourly and daily Penman-Monteith grass- and alfalfa-reference evapotranspiration equations and crop coefficients for maize under arid climatic conditions. Agric. Water Manag. 2017, 192, 1–11. [Google Scholar] [CrossRef]
Radwan, F.; Alazba, A.; Mossad, A. Flood risk assessment and mapping using AHP in arid and semiarid regions. Acta Geophys. 2019, 67, 215–229. [Google Scholar] [CrossRef]
Radwan, F.; Alazba, A.A. Suitable sites identification for potential rainwater harvesting (PRWH) using a multi-criteria decision support system (MCDSS). Acta Geophys. 2023, 71, 449–468. [Google Scholar] [CrossRef]
Al Shaye, N.A.; Masrahi, Y.S.; Thomas, J. Ecological significance of floristic composition and life forms of Riyadh region, Central Saudi Arabia. Saudi J. Biol. Sci. 2020, 27, 35–40. [Google Scholar] [CrossRef]
Alazba, A.; Mosad, A.; Geli, H.M.; El-Shafei, A.; Ezzeldin, M.; Alrdyan, N.; Radwan, F. Transboundary Urban Basin Analysis Using GIS and RST for Water Sustainability in Arid Regions. Water 2025, 17, 1463. [Google Scholar] [CrossRef]
Radwan, F.; Alazba, A.; Mossad, A. Analyzing urban watersheds morphometric in arid and semiarid regions using the complementarity of RST and GIS. Arab. J. Geosci. 2020, 13, 1–21. [Google Scholar] [CrossRef]
Radwan, F.; Alazba, A.; Mossad, A. Analyzing the geomorphometric characteristics of semiarid urban watersheds based on an integrated GIS-based approach. Model. Earth Syst. Environ. 2020, 6, 1913–1932. [Google Scholar] [CrossRef]
Alazba, A.; Mattar, M.A.; El-Shafei, A.; Ezzeldin, M.; Radwan, F.; Alrdyan, N. Water Demand Determination for Landscape Using WUCOLS and LIMP Mathematical Models. Water 2025, 17, 1429. [Google Scholar] [CrossRef]
Abtew, W.; Melesse, A. Evaporation and Evapotranspiration Measurement. In Evaporation and Evapotranspiration: Measurements and Estimations; Abtew, W., Melesse, A., Eds.; Springer: Dordrecht, The Netherlands, 2013; pp. 29–42. [Google Scholar]
Dai, L.; Fu, R.; Zhao, Z.; Guo, X.; Du, Y.; Hu, Z.; Cao, G. Comparison of fourteen reference evapotranspiration models with lysimeter measurements at a site in the humid Alpine Meadow, northeastern Qinghai-Tibetan Plateau. Front. Plant Sci. 2022, 13, 854196. [Google Scholar] [CrossRef] [PubMed]
Gebler, S.; Hendricks Franssen, H.-J.; Pütz, T.; Post, H.; Schmidt, M.; Vereecken, H. Actual evapotranspiration and precipitation measured by lysimeters: A comparison with eddy covariance and tipping bucket. Hydrology 2015, 19, 2145–2161. [Google Scholar] [CrossRef]
González-Estrada, E.; Cosmes, W. Shapiro–Wilk test for skew normal distributions based on data transformations. J. Stat. Comput. Simul. 2019, 89, 3258–3272. [Google Scholar] [CrossRef]
Razali, N.M. Power Comparisons of Shapiro-Wilk, Kolmogorov-Smirnov, Lilliefors and Anderson-Darling Tests. J. Stat. Model. Anal. 2011, 2, 21–33. [Google Scholar]
Ghasemi, A.; Zahediasl, S. Normality tests for statistical analysis: A guide for non-statisticians. Int. J. Endocrinol. Metab. 2012, 10, 486–489. [Google Scholar] [CrossRef] [PubMed]
Wilcoxon, F. Individual comparisons by ranking methods. In Breakthroughs in Statistics: Methodology and Distribution; Springer: Berlin/Heidelberg, Germany, 1992; pp. 196–202. [Google Scholar]
Field, A. Discovering Statistics Using IBM SPSS Statistics; Sage Publications Limited: London, UK, 2024. [Google Scholar]
Shaw, P.A.; Johnson, L.L.; Proschan, M.A. Intermediate topics in biostatistics. In Principles and Practice of Clinical Research; Elsevier: Amsterdam, The Netherlands, 2018; pp. 383–409. [Google Scholar]
Dlouhá, D.; Dubovský, V.; Pospíšil, L. Optimal calibration of evaporation models against Penman–Monteith equation. Water 2021, 13, 1484. [Google Scholar] [CrossRef]
Harwell, M. A strategy for using bias and RMSE as outcomes in Monte Carlo studies in statistics. J. Mod. Appl. Stat. Methods 2019, 17, 5. [Google Scholar] [CrossRef]
Karunasingha, D.S.K. Root mean square error or mean absolute error? Use their ratio as well. Inf. Sci. 2022, 585, 609–629. [Google Scholar] [CrossRef]
James, G.; Witten, D.; Hastie, T.; Tibshirani, R. An Introduction to Statistical Learning; Springer: Berlin/Heidelberg, Germany, 2013; Volume 112. [Google Scholar]
Pereira, H.R.; Meschiatti, M.C.; Pires, R.C.d.M.; Blain, G.C. On the performance of three indices of agreement: An easy-to-use r-code for calculating the Willmott indices. Bragantia 2018, 77, 394–403. [Google Scholar] [CrossRef]
Melo, G.L.d.; Fernandes, A.L. Avaliação de métodos empíricos na estimativa devapotranspiração de referência para Uberaba-MG. Eng. Agrícola 2012, 32, 875–888. [Google Scholar] [CrossRef]
Muhammad, M.K.; Nashwan, M.S.; Shahid, S.; Ismail, T.B.; Song, Y.H.; Chung, E.-S. Evaluation of Empirical Reference Evapotranspiration Models Using Compromise Programming: A Case Study of Peninsular Malaysia. Sustainability 2019, 11, 4267. [Google Scholar] [CrossRef]
Fitriani, V.; Bowo, C.; Mandala, M.; Gandri, L. Comparison of Empirical Methods to Estimated Reference Evapotranspiration. J. Ilm. Rekayasa Pertan. Dan Biosist. 2024, 12, 177–192. [Google Scholar]
de Camargo, A.d.; Sentelhas, P.C. Avaliação do desempenho de diferentes métodos de estimativa da evapotranspiração potencial no Estado de São Paulo, Brasil. Rev. Bras. De Agrometeorol. 1997, 5, 89–97. [Google Scholar]
Steidle Neto, A.J.; Borges Júnior, J.C.; Andrade, C.L.; Lopes, D.C.; Nascimento, P.T. Reference evapotranspiration estimates based on minimum meteorological variable requirements of historical weather data. Chil. J. Agric. Res. 2015, 75, 366–374. [Google Scholar] [CrossRef]

Figure 1. Weather station location map.

Figure 2. Climatic parameters: (a) temperature, (b) relative humidity, (c) wind speed, and (d) solar radiation.

Figure 3. Hourly reference ET comparison between (ASCE and KSA at h_plt = 50 cm) and (FAO and KSA at h_plt = 12 cm).

Figure 4. Average hourly ET_r vs. daily/24 ET_r for ASCE model.

Figure 5. (a) Wilcoxon test and (b) correlation analysis for hourly ET_r of the ASCE model.

Figure 6. Average hourly ET_ref vs. daily/24 ET_ref for KSA model at h_plt = 50 cm.

Figure 7. (a) Wilcoxon test and (b) correlation analysis for hourly ET_ref of KSA₅₀ model.

Figure 8. Average hourly ET_o vs. daily/24 ET_o for the FAO model.

Figure 9. (a) Wilcoxon test and (b) correlation analysis for hourly ET_o of the FAO model.

Figure 10. Average hourly ET_ref vs. daily/24 ET_ref for the KSA model at h_plt = 12 cm.

Figure 11. (a) Wilcoxon test and (b) correlation analysis for hourly ET_ref of KSA₁₂ model.

Figure 12. Performance statistics of the P–M mathematical models studied for hourly reference ET.

Figure 13. Daily ET_r vs. daily derived from hourly timestep ET_r for the ASCE model.

Figure 14. (a) Wilcoxon test and (b) correlation analysis for daily ET_r of the ASCE model.

Figure 15. Daily ET_ref vs. daily derived from hourly timestep ET_ref for the KSA model at h_plt = 50 cm.

Figure 16. (a) Wilcoxon test and (b) correlation analysis for daily ET_ref of the KSA₅₀ model.

Figure 17. Daily ET_o vs. daily ET_o derived from hourly timesteps for the FAO model.

Figure 18. (a) Wilcoxon test and (b) correlation analysis for daily ET_o of the FAO model.

Figure 19. Daily ET_ref vs. daily ET_ref derived from hourly timesteps for the KSA model at h_plt = 12 cm.

Figure 20. (a) Wilcoxon test and (b) correlation analysis for daily ET_ref of the KSA₁₂ model.

Figure 21. Performance statistics of the studied P–M mathematical models for daily reference ET.

Figure 22. Hourly ratio of ASCE to FAO and KSA₅₀ to KSA₁₂.

Figure 23. Daily ratio of ASCE to FAO and KSA₅₀ to KSA₁₂.

Table 1. Coefficients K and γ* formulas according to various P–M models.

Penman–Monteith Form	K		$γ^{*}$ (kPa·°C⁻¹)	h_plt (cm)
Penman–Monteith Form	Daily (MJ·m⁻² day⁻¹)	Hourly (MJ·m⁻² h⁻¹)	$γ^{*}$ (kPa·°C⁻¹)	h_plt (cm)
ASCE	$\frac{1600}{T_{a} + 273} u_{2}$	$\frac{66}{T_{a} + 273} u_{2}$	$γ (1 + 0.38 u_{2})$	Specified @ 50
FAO	$\frac{900}{T_{a} + 273} u_{2}$	$\frac{37}{T_{a} + 273} u_{2}$	$γ (1 + 0.34 u_{2})$	Specified @ 12
KSA	$\frac{1.854 \times 1 0^{5} λ / r_{a}}{T_{a} + 273}$	$\frac{7.725 \times 10^{3} λ / r_{a}}{T_{a} + 273}$	$γ (1 + r_{s} / r_{a})$	Ranging from 5 to 105

Note: h_plt: reference plant height.

Table 2. Statistical evaluation criteria for hourly and daily reference ET P–M mathematical models.

Criterion	Mathematical Formulation
Correlation coefficient (r)	$r = \frac{\sum_{i}^{n} (P_{i} - \bar{P}) (O_{i} - \bar{O})}{\sqrt{\sum_{i}^{n} {(P_{i} - P)}^{2}} \sqrt{{(O_{i} - O)}^{2}}}$
Nash–Sutcliffe efficiency (NSE)	$N S E = 1 - [\frac{\sum_{i = 1}^{n} {(O_{i} - P_{i})}^{2}}{\sum_{i = 1}^{n} {(O_{i} - \bar{O})}^{2}}]$
Root Mean Square Error (RMSE)	$R M S E = \sqrt{\frac{\sum_{i = 1}^{n} {(P_{i} - O_{i})}^{2}}{n}}$
Mean Absolute Error (MAE)	$M A E = \frac{1}{n} \sum_{i}^{n} \|P_{i} - O_{i}\|$
Average Bias (b)	$b = n^{- 1} \sum_{i}^{n} (P_{i} - O_{i})$
Index of Agreement (d)	$d = 1 - [\frac{\sum_{i = 1}^{n} {(P_{i} - O_{i})}^{2}}{\sum_{i = 1}^{n} {(\|P_{i} - O\| + \|O_{i} - \bar{O}\|)}^{2}}]$
Confidence Index (c)	$c = r . d$

Note: where: O: ET_ref values from P–M ASCE and P–M FAO. P: ET_ref values from P–M KSA. n: Number of samples. i: Integer number from 1,2,3…n.

Table 3. Confidence index classes [58].

Value of “c”	Class
>0.85	Excellent
0.76 to 0.85	Very Good
0.66 to 0.75	Good
0.61 to 0.65	Medium
0.51 to 0.60	Tolerable
0.41 to 0.50	Bad
<0.41	Terrible

Table 4. Normality test for hourly ET_r data sets.

Test	Statistic		p-Value
Test	AV Hourly	Daily/24	AV Hourly	Daily/24
Anderson–Darling	A2 = 3.058	A2 = 3.563	<0.0001	<0.0001
D’Agostino and Pearson	K2 = 30.06	K2 = 53.16	<0.0001	<0.0001
Shapiro–Wilk	W = 0.9633	W = 0.9502	<0.0001	<0.0001
Kolmogorov–Smirnov	KS = 0.07345	KS = 0.07477	<0.0001	<0.0001