1. Introduction
Evapotranspiration (ET), a term used in environmental science, hydrology, and agriculture to describe the process by which water is transferred from the Earth’s surface to the atmosphere, has become one of the most complex components of the hydrological cycle as it depends on multi-climatological parameters and their interactions [
1,
2,
3]. This important phenomenon is a component of the Earth’s water cycle, as it influences the movement and distribution of water in the atmosphere, the availability of water resources, and the climate. It is necessary to understand the connections between evapotranspiration and ecosystem type in response to climate change [
4,
5]. In the long term, the water that is directly available for human consumption and control is what separates evapotranspiration from continental precipitation. Therefore, quantitative ET knowledge is necessary for quantitative assessments of water resources and the impacts of climate and land-use change on these resources [
6,
7,
8].
To better comprehend this complex formation and determine the amount of water lost through ET, definitions have been made based on a variety of assumptions [
9,
10,
11,
12,
13]. ET approaches can be divided into three categories: hydrological and water balance methods, analytical methods based on climate variables, and empirical methods [
14]. The water balance technique and basin hydrology for measuring or indirectly determining the ET value consists of sampling soil water change and lysimeter testing. Since this is primarily a physically based method, its use in climate change assessment is limited to the laboratory [
15,
16]. The second approach, known as the micrometeorological method, uses a scientific understanding of the physics of evapotranspiration. Mathematical relationships have been developed to describe these processes through two fundamental climatological components: energy balance and mass transport [
10,
17,
18,
19]. The third method centers on developing empirical relationships that are often site-specific and based on regional climatological conditions and often used in regression analysis. These techniques are often calibrated by correlating experimental predictions with observed data [
20,
21,
22].
ET can be computed by the aerodynamic approach when energy is unlimited and by the energy balance method when vapor movement (mass transfer) is limitless. Normally, however, both of these factors are limiting; therefore, a combination of the two methods is required [
17]. The Penman–Monteith (PM) technique, recommended worldwide by the Food and Agriculture Organization (FAO), is one of the most universally used energy balance and mass transfer-based methods to compute evaporation on terrestrial surfaces [
14]. The equation developed by Penman [
23] to calculate evaporation from an open water surface was modified by Monteith [
24] by including canopy conductance to represent the ET rate from a vegetative surface. In the 1980s, the method was developed and presented in detail by combining the canopy and soil evaporation [
25]. Allen et al. [
1] updated the PM approach by adapting the ET values to the reference grass plant with constant albedo and surface resistance and proposed it as the FAO-56 Penman–Monteith equation. The PM method, recommended by the FAO for applications worldwide, has been analyzed in various regions for decades and has yielded sufficiently accurate results [
26,
27,
28,
29,
30].
The fact that the FAO-56 PM equation requires a large number of meteorological data (i.e., temperature, humidity, radiation, and wind speed) makes the solution of the equation difficult, and obtaining such data is not always possible. Although Allen et al. [
1] expressed the solvability of the equation using auxiliary formulae based on temperature data for the PM approach, many alternative empirical methods (i.e., temperature-based, radiation-based, and a combination of them) have been investigated for use in cases where sufficient data are not available [
31,
32,
33,
34,
35]. For instance, Tabari et al. [
34] evaluated the performance of thirty-one alternative empirical methods using meteorological data obtained under humid conditions in Northern Iran and the PM method as a reference. Analysis of the data revealed that, in general, the best results were attained with the Blaney–Criddle and Hargreaves methods compared to the PM equation; mass transfer-based approaches underestimated ET, whereas overestimations were more dominant in temperature-based and radiation-based methods. Sarlak and Bagcaci [
35] evaluated the performances of six empirical ET approaches, namely Blaney–Criddle, Jensen–Haise, Makking, Turc, Priestley–Taylor, and Hargreaves–Samani, compared to the PM method using daily meteorological data from five stations in Konya Closed Basin. They concluded that in the absence of daily observation data, the Turc, Hargreaves–Samani, and Priestley–Taylor techniques, which require less data, can be used as an alternative to PM. Similarly, Song et al. [
36] applied twelve different ET estimation methods relative to the PM method recommended by the FAO in northeast China, which they divided into eight sub-regions according to the climate and land types. In the study, temperature-based (Blaney–Criddle, Thornthwaite, Romanenko-1, and Romanenko-2), radiation-based (Hargreaves–Samani, Turc, Makking, and H-Makking), and combination methods (Linacre, simplified Penman–Monteith, Valiantzas-1, and Valiantzas-2) were employed using data over 126 stations for more than half a century. H-Makking and Valiantzas-2 approaches can be considerable as alternative methods in agricultural areas in northern regions of China, while Valiantzas-2, Romanenko-2, and H-Makking methods are more suitable during crop growth periods. Although the results varied regionally and seasonally, the temperature was the most sensitive parameter for estimating ET values. Furthermore, it is claimed that the Turc approach constantly tends to deviate negatively whereas the Hargreaves–Samani method produces notable biases.
Researchers have performed local calibrations as well as modifications of these empirical methods for use in various regions under differing conditions, even though many of the methods developed for a particular location are widely utilized worldwide. For instance, Cobaner et al. [
37] compared the reference evapotranspiration values determined using the PM technique in the Mediterranean region with the values derived using calibrated Hargreaves–Samani equations, which require less data. According to this study, the ET values produced by the Hargreaves–Samani equations calibrated with minimum and maximum temperatures, as well as minimum and maximum humidity data, were close to the values calculated using the PM formula. They concluded that the Hargreaves–Samani equation calibrated with the maximum temperature is better than those calibrated with other meteorological data. Similarly, monthly ET values were computed using the modified Blaney–Criddle and Hargreaves–Samani equations, obtaining data from three stations located in the semi-arid climate regions of Pakistan and compared with PM-driven ET values [
38]. Overall, it was stated in the study that both equations overestimated PM-driven ET values, although the findings of the Hargreaves–Samani method were superior to those of the modified Blaney–Criddle method. In another example, the effect of modified approaches on ET was investigated in Eastern Turkiye [
14]. Study results showed that the modified Hargreaves–Samani approach formed by the constant values in the Hargreaves–Samani equation revealed better results than the Hargreaves–Samani equation over the region, while altitude-based modified Hargreaves–Samani technique has the lowest correlation results among the other methods in the study. Additionally, they concluded that the modification of the Blaney–Criddle formula increased the performance relative to the Blaney–Criddle equation.
It is necessary to conduct studies to ascertain the suitability and accuracy of empirical techniques for different regions that have emerged from investigations conducted in local areas with soil structure as well as certain climatic and environmental characteristics. Several empirical approaches can be used with varying degrees of success, owing to the unique characteristics of each place and the availability of a limited number of measurable climatic and environmental factors. Hydrological studies are significant for the effective management of water resources in Kahramanmaras, given their considerable water potential. Unfortunately, ET measurements are not available for Turkiye, and these values vary spatiotemporally. However, no comprehensive ET study has been conducted in the city, and a district-based ET study was implemented for the first time in the region. Additionally, the city has complex climate zones, varying vegetation cover, and uneven distribution of elevation gradient.
In this study, the quantitative determination of ET, one of the most significant water losses of the hydrological cycle, can contribute to studies in this field and will also play an important role in various hydrological planning studies, such as agricultural irrigation projects and basin management, to be carried out in the study area. Data-scarce ET time series were assessed utilizing ten climatological-driven ET approaches relative to the PM method over eleven districts of Kahramanmaras region, Turkiye. In this study, the performances of ten empirical evapotranspiration approaches, namely Blaney–Criddle (BC), modified Blaney–Criddle (BCM), Hamon (HM), modified Hamon (HMM), Hargreaves–Samani (HS), Kharrufa (KH), Romanenko (RM), Schendel (SC), Thornthwaite (TH), and Penman–Monteith at 0.5 m (PM0.5), were evaluated at daily and monthly temporal resolution in Kahramanmaras province and its eleven districts and compared to the reference PM method. In addition, the effect of modified techniques on evapotranspiration estimates was investigated over the region.
3. Results
The daily ET values for each station were estimated using the PM, PM
0.5, BC, BC
M, HM, HM
M, HS, KH, and SC techniques, resulting in the box plot shown in
Figure 3. A box plot visualizes the five-number summary of the dataset: the minimum, first quartile, median, third quartile, and maximum. The box plot was produced using the fourth-generation programming language MATLAB. Alternative methods are shown on the horizontal axis of the box plot in
Figure 3, whereas evapotranspiration values in mm d
−1 predicted using these approaches are displayed logarithmically on the vertical axis. As shown in the figure, days with missing data and negative ET values were not evaluated, while the extreme ET values and estimates between these values produced by each approach were presented with values from 0.001 to 100 mm d
−1. As can be seen from the figure, PM-driven simulations were overestimated by the PM
0.5-based approach over all districts, but the HM method, which displays a symmetrical box plot distribution, consistently underestimates the references in the study area. The positive results of the modifications to the BC method are shown in the graph. For example, the BC
M approach yielded more effective outcomes than the BC method, which gave ET values within a narrow range when compared to the reference method over the region. Additionally, at numerous stations, the results acquired by the HM
M technique were comparable to those obtained by the reference ET
PM values, with higher performance than the HM approach, revealing the importance of modifications made to the HM equation.
The ET values obtained were clustered in the range of 1 to 6 mm d−1 (between the lower and upper quartiles) at Kahramanmaras station (S1), where 22 years of comprehensive data are available, which means that half of all ET values are in this range, considering the PM technique used as reference. The PM0.5 approach results reveal that it slightly overestimates the ET values with respect to the reference PM method. The BC-driven ET values, shown in black, were aggregated in a narrower range at the S1 station relative to the reference method. The BC method simulated the minimum (maximum) ET values with overestimation (underestimation). It can be seen that the methods that give results close to the reference method at the S1 station are the HMM and HS approaches, although there are slight differences in the values of the whiskers. It is clear from the box plot of the HM approach with the best performance, shown in blue, that its quarters are distributed uniformly, and there is no skewness in the data to the PM technique. The upper outlier and interquartile range of the ET simulations produced by the BCM and KH methods—which are represented in gray and orange, respectively—produced similar findings to those of the reference method, but they had a longer minimum outlier owing to the underestimation of the small ET values. Overestimation is dominant in the maxima of SC-based ET predictions, whereas the opposite is true for smaller values less than one.
Additionally, the reference ET values for the S2 station, which had the least amount of accessible climatic data, were concentrated between 2 and 9.5 mm d−1. The BCM and SC techniques yield ET values within a similar range; nevertheless, the values obtained by both methods simulate the minimal ET values with strong underestimation. It can be seen that while the PM0.5 approach produces ET values with a comparable distribution to the reference method, as in the S1 station, it continually produces a slight overestimation. The findings can be made more accurate by multiplying the values by the calibration coefficients to minimize this bias. The BC-based ET values generated for the Dulkadiroglu district were clustered in a narrower range, similar to the results at the S1 station. Although different distributions are attained in the HM, HMM, HS, and KH approaches, where the maximum ET values are estimated to be close to each other, it has been observed that the underestimations are predominant compared with the PM reference method.
Moreover, as can be seen in the box plot of the S3 station, it can be seen that the HS method, shown in green, produces results substantially closer to the reference approach, although it slightly underestimates the minimum outlier. Although the ET values generated by the SC approach remained in the interquartile range compared to the reference method, the absolute values of the extreme ETs were higher. When the PM0.5 technique was examined in terms of distribution, it was observed that it was quite similar to the reference method, although it tended to yield slightly higher ET values compared to the reference method. The convergence of the BC-driven ET time series in the first and third quartiles clearly shows a concentration in a narrower range, similar to those from other stations. It has been noted that the BC, BCM, and KH approaches underestimated the minimum values for quartiles smaller than 0.05 at the S3 station. However, the BCM and KH approaches performed well in the interquartile range and higher whisker values. Although the HM and HMM methods capture ET values with very small variations in a negative way from the reference values, their distributions are similar to those in the PM.
In addition, when looking at the box plot of stations S4 and S5, the extreme ETs were close to one another, and the ET values computed using the reference technique were concentrated in a similar range. It was observed that the ET values acquired by the HS and KH techniques at both stations were concentrated in a similar range and captured the majority of the reference values, although the ET values smaller than the lower quartile were predicted with underestimation. Although the extreme ET values in whiskers simulated with the HM and HMM techniques were close to the reference values, it was observed that both stations estimated ET values with a slight constant underestimation in the interquartile range compared to the reference values. The BCM and SC methods produce overestimation (underestimation) ET values in the upper (lower) whisker compared to the reference method at both stations, while PM0.5-driven ET simulations overestimated ET values in all quartiles.
Additionally, ETPM values, varying between 1.8 and 4 mm d−1 for the interquartile range, were clustered in a narrower range relative to other stations, and they were captured using the HMM, HS, and SC approaches at the S6 station. While the PM0.5 and BC methods produce higher ET values for values between the 25th and 75th percentiles with respect to the PM technique, ETBC values are concentrated in a narrower range. Although it was discovered that the ETHM results were close to the reference for extreme values, these estimations clustered with underestimation until the median. Although BCM-based ET estimates produce values close to the PM, it has been monitored that this method underestimates ET values smaller than the 25th percentile. As can be seen from the figure, the box plot produced by the KH approach has a wider spread with a higher standard deviation, and ETKH values are simulated with a significant degree of bias in comparison to the reference method.
In the box plot of stations S7 and S8, where ET
PM values are close to each other, it is seen that the ET
BC simulations are similar to the other stations with the clustered in a narrower range. The ET values estimated by these two approaches are close to one another, and their maximum (minimum) values are simulated less (more) than those of the reference method. The PM
0.5 model overestimated the reference ET values with a slight difference, similar to the previous six stations. In the Pazarcik and Caglayancerit districts, the formula that yielded the closest results to the reference method was the HS approach. The BC
M, KH, and SC approaches, which simulated the minimum ET values up to the first quartiles with a high deviation compared to the reference method, captured the ET
PM values at both stations for the values interquartile range. It can also be seen from
Figure 3 that the evapotranspiration values obtained via the HM and HM
M techniques had a more symmetrical distribution, even though the ET values at both stations underestimated the reference values.
At the S9 station, the interquartile range of reference ET values varies between 1.8 and 6 mm d−1, and the best performance was acquired with the ETHS formula. In the Ekinozu district, although the BCM, KH, and SC approaches underestimated the minimum ET values in the lower whisker with high deviation compared with the reference method, ETPM values were captured between the 25th and 75th percentiles. Underestimation is dominant in HM and HMM-driven predictions, and BC-driven simulations are concentrated in a narrower range with a small standard deviation, whereas KH-based values are spread over a wider range with a high standard deviation. In addition, while the ETBC values are in a narrower range with a small standard deviation in Nurhak and Turkoglu, the PM0.5 technique tends to overestimate evapotranspiration time series compared to the reference method, as in the other stations in general.
When the box plot of the S10 station was examined, while ETKH and ETSC values had high variance compared to ETPM, both methods underestimated evapotranspiration smaller than the median value, and strong overestimation was dominant in ETSC values greater than the median. The BCM approach revealed the most accurate results relative to the reference method at the S10 station (except for the lower whisker), while the best performance was obtained with HS at the S11 station, with insignificant underestimations in the upper whisker. Another result obtained from the figure is that HS, HM, and HMM-based estimates in the Nurhak district have a symmetric distribution and simulate ET values slightly less than the reference values. In the Turkoglu district, ET values in the interquartile range were captured by the BCM, HM, HMM, and KH approaches in addition to the ETHS formula. The graph also shows that the values in the lower than 25th percentile are underestimated by the BCM, KH, and SC techniques.
To investigate the impact of slope on evapotranspiration, its map was prepared using the DEM model via the Arc-GIS program, as seen in
Figure 4a. When the slopes of Nurhak and Turkoglu were compared, it was observed that the slope of the district where the S10 station was located was steeper than S11. On the other hand, while low ET values were detected in Nurhak (1368 m) at high altitudes, a positive effect of the slope on ET was observed. Similarly, while the S8 station in Caglayancerit (1001 m) is expected to have lower ET values than the S7 station in Pazarcik (787 m) because of the significant altitude differences, it is believed that the higher slope of Caglayancerit contributes to an increase in ET. In addition, the altitude of the S5 station in Elbistan is 1137 m, and Nurhak ET values are higher than those of S5 and S7 despite the higher altitude. Although the district has a high altitude, its steep slope is predicted to have a directly proportional effect on ET.
While examining the effect of the slope on ET, an aspect geography map over the study area given in
Figure 4b was prepared to take into account the direction in which the slope was formed and the angle of receiving sunlight to make more accurate evaluations. To further support and elucidate the aspect map assessments and identify any subtle variations on a station basis, the solar radiation map shown in
Figure 4c was generated. The data connected to the prepared map via Arc-GIS were obtained from the internet portal developed by Solargis and financed by the Energy Sector Management Assistance Program (ESMAP) [
74]. Larger ET
PM values are found in S4, located in Afsin, even though the slopes of the S4 and S5 stations are close to one another when
Figure 3 and
Figure 4a are examined together. However, it becomes clear from the aspect map in
Figure 4b that Afsin has more southern slopes and that these slopes have a positive impact on ET.
Additionally, examining the solar radiation maps of the two stations in
Figure 4c, it can be concluded that Elbistan has significantly stronger solar radiation than Afsin; on the other hand, a slight discrepancy occurred in ET values of these stations due to the positive effect of solar radiation on ET. Upon analyzing the aspect (
Figure 4b) and solar radiation (
Figure 4c) maps of the districts containing the S2 (525 m) and S11 (535 m) stations with similar altitudes (
Figure 2) and slopes (
Figure 4a), it was observed that Dulkadiroglu had larger ET
PM values (
Figure 3) than Turkoglu because of the dominance of the southern slopes and high solar radiation. Moreover, when the slopes in
Figure 4a of S1 (572 m) and S2 stations with similar heights were examined, it was revealed that the S1 station had a greater slope but exhibited lower ET values. This is because Dulkadiroglu has stronger solar radiation and more southern slopes, as shown in
Figure 4b,c, and these two factors have a linear effect on ET.
When the ET values of the S8 (1001 m) and S9 (1246 m) stations, which have close slopes, were compared (
Figure 2 and
Figure 4a), higher ETs were reached in Caglayancerit because of their lower altitudes (
Figure 3). It is also estimated that this may have been triggered because Caglayancerit has more southern slopes (
Figure 4b) and higher solar radiation (
Figure 4c) than Ekinozu. As another example, as can be seen in
Figure 3, ET values in a similar range for S10 (1368 m) and S11 (535 m) stations (
Figure 2), which have a high-altitude difference, reveal the importance of the slope (
Figure 4a) and the abundance of southern slopes (
Figure 4b) in evapotranspiration estimations. On the other hand, even though S5 (1137 m) and S6 (1108 m), whose station altitudes are close to one another, show no discernible differences in their aspect maps (
Figure 4b), it is obvious that Andirin has a steeper slope (
Figure 4a) and less solar radiation than Elbistan (
Figure 4c). Thus, although Elbistan has a flatter land structure, high solar radiation resulted in larger evapotranspiration values in S5.
For the monthly variation analysis of alternative ET methods across Kahramanmaras in a general manner, Thornthwaite and Romanenko methods were added, and the combined ET values of all stations are presented with the scatter plot in
Figure 5. The X- and Y-axes of the graph represent the monthly evapotranspiration values in mm d
−1 for the reference PM and alternative approaches, respectively. In addition, the calculated monthly correlation value and linear trend line with its formula between the reference PM and other methods are displayed in the plots.
As can be seen from the scatter plot shown in
Figure 5, the PM
0.5 method has a strong positive correlation with the reference PM method with a PCC value of 0.99, while it estimates high ET values on a monthly basis more so than the references. While the BC-driven simulation revealed a correlation of 0.95, it overestimated (underestimated) low (high) ET
PM values. This discrepancy became wider for values greater than 6 mm d
−1. On the other hand, a significant improvement is observed in the BC
M technique compared to the original method, and it estimates the reference values slightly higher with a strong correlation (0.98). Additionally, the scatter plots of the HM and HM
M methods reveal similar results, with a correlation of 0.94 in both methods. Unlike other methods, the two equations, which include water vapor density, tend to underestimate ET
PM values.
The HS approach has a PCC value of 0.96, indicating a strong positive relationship with the PM method, whereas it underestimates the maximum extremes in ET estimations relative to the reference values. As can be seen in the box plot in
Figure 3, although KH estimated the minimum ET values lower than the references, it is observed that the trend line in the scatter plot in
Figure 5, obtained with monthly values with high standard deviation, produces similar results compared to the reference method with 0.94 correlation and yields an approximate slope of 1:1. As can be seen from
Figure 5, the RM and SC methods show the highest variance and have correlation values of 0.89 and 0.88, respectively. In general, overestimation is dominant in the RM and SC formulae, and the SC method performs the poorest due to the significantly higher inconsistency in the maximum ET values. The results show that the monthly average temperature-driven ET
TH simulation underestimates the reference ET
PM values, while it has a lower standard deviation in its distribution compared to the other monthly methods, RM, and a strong PCC value of 0.93.
Figure 6a was obtained to evaluate the CRMSE index of eight different ET estimation methods, which vary between 10.63 and 91.69%, at 11 stations in Kahramanmaras. The BC
M-based simulations at the S2 station produced the lowest CRMSE of 10.63% among all values, whereas the HS technique at stations S1, S3, S4, S5, and S9 reached the lowest value. The best results in terms of CRMSE performance are seen to be the KH method, with values of 19.73% and 19.16% at stations S7 and S8, respectively. Moreover, the PM
0.5, BC
M, and HM methods produced lower CRMSE values than the other methods at stations S6, S10, and S11, respectively. It was also noted that the SC technique, which ranged from 34.94 to 91.69%, showed the worst performance compared with other approaches at all stations.
Examining
Figure 6b, which demonstrates the variation of the determination coefficient, reveals that the PM
0.5 approach is the most consistent with the reference method, having the highest DET values (0.94–1) at all stations. On the other hand, the obtained results detect that the SC technique has the lowest DET values ranging from 0.61 to 0.82. The highest DET values, after PM
0.5 simulations, were produced with the BC method at the S1 station; the HS method at stations S3, S4, and S5; and the BC
M method at the other seven stations.
Additionally, the distribution of the MAE index values, which had the same unit (mm d
−1) as evapotranspiration, is shown in
Figure 6c. Even though the BC
M has the lowest MAE values at S2, S8, and S10, the HS technique is the most successful approach since it shows the least absolute difference at the other eight stations. Although the methods with the highest MAE differ depending on the station, the SC technique has the worst MAE performance at the six stations.
When the results were evaluated according to the MRE index, one of the indices frequently used in performance assessments, BC
M showed the best overall result (
Figure 6d). The HM
M method has the highest accuracy, with an MRE value of 0.0 at station S1, whereas negative MRE values indicate an underestimation tendency at other stations. In general, the PM
0.5 and SC (HM and KH) techniques with positive (negative) MRE values overestimated (underestimated) ET compared to the reference method.
The MSE error metric results for the eleven stations are presented in
Figure 6e. While the lowest performance is achieved with SC, surprisingly, the PM
0.5 values are the highest MSEs after SC. Upon examining the results per station, it is seen that the BC
M approach at stations S2, S8, and S10; the KH method at station S7; and the HS technique at the other seven stations have the least error with MSE values closest to zero.
In
Figure 6f, the NSCE statistical index is used to evaluate the predictive accuracy of evapotranspiration algorithms. As can be seen from the graph, NSCE values indicated performance higher than 0.5 across stations and methods except for the SC technique, while negative NSCE values were captured for SC methods (except S2 and S3). While the BC
M method exhibited the best performance at stations S2, S7, S8, and S10, this method reached the highest NSCE value at station S2 with a value of 0.98. The HS method produced the closest result to the reference values for the other seven stations.
NNSCE is an extension of NSCE and is used as a benchmark metric, allowing for easier comparison and interpretation (
Figure 6g). NNSCE performed furthest from the reference method were HM in S3, PM
0.5 in S2, and the SC method in the other nine stations. At stations S2, S7, S8, and S10, the BC
M method yields accurate results by giving the NNSCE value closest to one, and the KH method converges to 0.9 at station S7. As can be observed from the NNSCE index, the HS technique worked well for the remaining seven stations.
It is observed from
Figure 6h that while the BC
M, PM
0.5, and SC methods, overall, take positive Bias values, indicating a tendency to overestimate, underestimation is more dominant in KH, HM, and HM
M approaches with general negative Bias values. Another noteworthy point when the results are examined in terms of the Bias error metric is that the BC
M and KH methods have performances closest to zero at the majority of the stations.
The PCC results are given in
Figure 6i, and it can be seen that the strongest positive correlation is the PM
0.5 method, ranging between 0.97 and 1.00, while the lowest is the SC technique, varying between 0.78 and 0.91.
Figure 6j displays the variation in the RMSD metric at 11 stations, which has the same unit as evapotranspiration as the MAE performance assessment index. As can be seen from the figure, while the BC
M method performed better than the other empirical formulae with RMSD values closest to zero at stations S2, S8, and S10, the KH approach showed a successful performance at station S7. The figure indicates that for the other seven stations, the HS method produced RMSD values with a discrepancy of only a maximum of 1 mm d
−1. The HS (SC) method achieved the best (worst) performance among all the methods at station S3 (S10) with an RMSD value of 0.49 (4.80) mm d
−1.
4. Discussion
As can be seen from the box plot in four equal quartiles, information was obtained regarding the extremes of ET values, the range in which they were clustered, and their distribution. The BC
M, KH, and SC-based ET simulations underestimated minimum values smaller than the 25th percentile relative to the ET
PM. Saud et al. [
75] analyzed the spatiotemporal variation of several methods over Al-Anbar province, western Iraq. They found that the Kharrufa equation tended to underestimate ET values, similar to the above-mentioned finding. On the other hand, the SC method tends to predict maximum extreme ET values larger than the third quartile than the reference method. The performance of the PM
0.5 approach was higher for the minimum ET values, which were especially smaller than the lower quartile. However, it produces more ET values, albeit with a slight difference, at larger ET values compared with the reference technique. Additionally, the ET
BC values were clustered in a narrower range than the reference ET values with smaller standard deviations. It was concluded that the BC
M method produced more successful results than the BC method, showing values close to those of the PM method for values greater than the lower quartile. As a matter of fact, in this sense, the positive effect of the modifications on the BC-based ET equation was observed, as in the previous studies [
14,
76]. It is also understood that the ET values obtained with the HM and HM
M approaches have a more symmetrical distribution; however, the HM method underestimates ET
PM at all stations, whereas the modified HM method achieves more successful results than the original HM method. The results of these two modified equations support the importance of adjustment in the original formulae [
77,
78,
79]. For example, Proutsos et al. [
19] evaluated 127 ET approaches in Mediterranean urban green sites and concluded that the adjusted models performed more accurate ET simulations overall compared to the original equations. Compared to other techniques, the HS method produces results that are closest to the reference ET
PM when examining the box plot of the method at all stations.
The graphs and maps were analyzed to see how altitude, slope, aspect geography, and solar radiation affected the evapotranspiration values. The results showed that evapotranspiration varies proportionally with the impact of altitude on Kahramanmaras. The results support the findings of Lin et al. [
80] regarding ET correlation with topography in the Xiliao River Plain, while the outcomes are different from those reported by Ablikim et al. [
81], in which ET values rise with the increment in altitude in the Urumqi River Basin. For instance, the reason for the high ET values in S2, which has the lowest altitude of 525 m, is thought to be the adverse effect of altitude on ET. The observation of lower ET values at the S3 station, which has a higher elevation (1344 m) than the S2 station, shows that the same effect occurs. The fact that the S9 station, at an altitude of 1246 m, had lower ET values relative to the S8 station at an altitude of 1001 m supports this effect. As S7 (787 m) exhibited smaller ET values than S2 and S11, which have altitudes of 535 m, it is another example that elevation has an adverse effect on ET. While station S1 (572 m) had smaller ET values relative to the S2 station, indicating the impact of altitude, a larger discrepancy in ET values was observed in response to the minor elevation differences. On the other hand, despite the high-altitude variation between stations S1 and S3 (572 m/1344 m), the difference between the ET values of both stations was lower. Similar results were observed at stations between S10 (1368 m) and S11 (535 m) and between S7 (787 m) and S8 (1001 m). These results reveal that the inverse proportion of altitude on ET may vary depending on altitude and that factors other than elevation may also be effective for ET. For this aim, considering previous studies showing that distributions of slope and aspect may have an effect on evapotranspiration, slope, aspect geography, and solar radiation maps, these were also investigated by correlating with ET [
82,
83,
84]. A strong correlation with ET has been observed on the southern slopes, and solar radiation is another factor controlling ET. Among all the stations, for example, S6, which has a narrow ET distribution with a minor standard deviation and stands out with its low ET values compared to other stations, appears to have the lowest solar radiation. As another example, it is recognized that the S2 station at the lowest altitude exhibits the highest ET values as a result of all these factors, such as the fact that it has a more southern slope compared to other stations and intense solar radiation, in addition to its steeper slope.
Additionally, within the scope of this study, the combined performances of all methods across the city were examined on a monthly basis using a scatter plot. For instance, the Schendel approach produced ET values at some stations that were similar to the reference method, as can be seen from the box plot; on the other hand, the method yielded the lowest performance of all the alternative methods over the entire city when the monthly based scatter plot was examined. Both the BC and TH techniques underestimated ET
PM reference values, while an overestimation tendency was observed for the PM
0.5 method, although the highest correlation was achieved with it. As stated in the methodology section, the modified Blaney–Criddle approach applies parameters computed using the minimum relative humidity, ratio of actual sunshine to the maximum possible sunshine duration, and daytime wind speed instead of the seasonal crop coefficient used in the original method. This resulted in a significant improvement in the evapotranspiration estimations, as seen from the scatter plot. In the Hamon methods, which tend to underestimate ET, the modified version produced better ET values than the original equation. This graph reveals the importance of modification analysis more clearly and supports the results of previous studies [
14,
76,
77,
78]. Another important result obtained from the scatter plot is that the KH method, which tends to predict lower daily ET minimum extremes, showed a high performance in monthly simulations across the city. Additionally, the HS method produced the best result by providing a high correlation, although it underestimated the maximum extreme values.
The performances of the ET approaches on a daily scale were computed and graphed using ten various statistical criteria to enhance the evaluation of the methods to be more detailed and sensitive for each station. As can be seen from the NSCE results, a value higher than 0.5 demonstrates that alternative approaches (except SC) can sufficiently replicate the variability of the ETPM. Conversely, negative NSCE values indicate that for SC-driven simulations at the stations, the mean of the ETPM values is a better predictor than the SC empirical method. The main difference for NNCSE lies in the normalization of the NSCE value, which helps NNSCE to be less sensitive to the variability in the reference data and allows for better comparison across other empirical approaches, regardless of its variance. The highest NSCE and NNSCE values were obtained with BCM-driven simulations in S2, S8, and S10, whereas the best results were yielded with ETHS estimations in S7, and it is the HS method at the other seven stations. The obtained findings were observed more clearly in the NNSCE graph. It is essential to note that while the NSCE is a crucial error metric for measuring predictive accuracy, it has some limitations. For instance, it gives equal weight to both overestimation and underestimation errors, which might not always reflect the true importance of such errors in the direction of discrepancy. In comparison to ETPM, in general, the underestimation is more dominant in Hargreaves–Samani, Kharrufa, and both Hamon equations with negative MRE values, whereas the BCM, PM0.5, and SC techniques overestimated the reference values with positive MRE indices. These results are seen more clearly in the Bias metric. Additionally, overall, HS, BCM, and KH-driven simulations yielded the best MSE, CRMSE, and RMSD results, while the PM0.5 (SC) method had the highest (lowest) value at all stations according to the DET metric. All approaches, excluding SC, have strong positive correlations greater than the PCC value of 0.9 at all stations except for S6. The BC (HS) technique exhibits the highest PCC values in S1 (S3, S4, and S5) after the PM0.5 approach, while the BCM method, in S2 and the six stations between S6 and S11, is secondary in terms of PCC performance.
However, this study has some limitations; for instance, in addition to the aforementioned factors, ET may also be affected by elements such as vegetation cover, soil map, land use, and land cover belonging to the relevant study area, and they were not evaluated within the scope of this study. Despite these limitations, the results of this study can help future studies mitigate the effects of drought and the prejudice of hydrological modeling results over the study area. Furthermore, the findings motivate future studies to analyze how well alternative empirical approaches are performed in other areas with features comparable to the analyzed region.
5. Conclusions
Empirical approaches for calculating ET values hold a significant place in the literature because of their advantages, such as simplicity of use when utilizing meteorological data and obtaining results in a short time. Evapotranspiration may be quantitatively assessed using a variety of techniques. Numerous studies have been conducted to find the most straightforward and accurate method that can be applied in various study regions, and these have been updated in response to evolving circumstances. In this study, evapotranspiration simulations carried out various techniques such as Penman–Monteith, Penman–Monteith at 0.5 m, Blaney–Criddle, modified Blaney–Criddle, Hamon, modified Hamon, Hargreaves–Samani, Kharrufa, Schendel, Romanenko, and Thornthwaite using daily meteorological data from stations in 11 districts of Kahramanmaras province. In evaluating the alternative methods at daily and monthly temporal resolutions, the Penman–Monteith method, recommended by the Food and Agriculture Organization for use worldwide, was considered as a reference.
A box plot was generated using ET values derived from daily scale estimations utilizing the PM, PM0.5, BC, BCM, HM, HMM, HS, and KH, along with SC methods, and assessments were conducted on a station basis. The HM method, which shows a symmetrical box plot distribution, underestimated the ETPM over the region, while the PM0.5 method overestimated the reference ETPM values at all stations. In contrast to the BC method, which produced ET values in a narrow range compared with the reference method in the 11 districts, the BCM method produced more successful results. The HMM method, which has a symmetrical box plot distribution, produced results close to those of the reference method at some of the stations, indicating that the modification of the HM method was positively reflected. Additionally, underestimation is dominant in minimum whisker ET values obtained from BCM, KH, and SC-driven simulations. In the Onikisubat district, the HM, HMM, and HS methods yielded the highest performances among the other methods. Although the BCM, KH, and SC techniques underestimated the minimum extremes, they generally overestimated ET values compared to the reference method. In Dulkadiroglu, where the highest ET values were produced among all stations, the approaches that gave the closest results to the reference method for the interquartile range were the BCM, KH, and SC. In the Goksun district, BCM, HS, KH, and SC-based ET simulations captured ETPM variations greater than the 25th percentile, whereas they predicted the minimum ET values to be lower. It was concluded that the HS and KH approaches, which underestimated the minimum outliers, provided the closest results to the reference method in the districts of Afsin and Elbistan, where similar ET values were achieved. For Andirin, where ET fluctuations were in the lowest range among all stations, the HS and SC methods produced the most accurate results for ETPM values in the interquartile range. The HS method, which slightly underestimated the reference ET values, exhibited the highest accuracy in Pazarcik, Caglayancerit, and Ekinozu districts. In Nurhak, the BCM method was the most successful, slightly underestimating the minimum ETPM outliers, while the second-highest performance belonged to the HS simulations with underestimation relative to the reference method. Finally, in the Turkoglu district, the HMM and HS methods produced results similar to those of the reference method.
To evaluate the statistical performance of the methods on a daily scale, the central square mean error, determination coefficient, mean absolute error, mean relative error, mean squared error, Nash–Sutcliffe efficiency coefficient, normalized NSCE, percentage error, Pearson’s correlation coefficient, and root mean square error metrics were applied. According to the CRMSE index, the HS method had the lowest values in Onikisubat, Goksun, Afsin, Elbistan, and Ekinozu, whereas the BCM approach achieved the highest performance in Dulkadiroglu and Nurhak. Additionally, KH resulted in the smallest CRMSE in Pazarcik and Caglayancerit, whereas PM0.5 and HM were the best in Andirin and Turkoglu, respectively. The PM0.5 approach performed well at all stations based on the DET and PCC metrics because of its similarity with the reference method, although SC-based simulations produced the lowest values. After the PM0.5-driven performance, the methods showing the highest correlations are the HS method at Onikisubat, Goksun, Afsin, and Elbistan, similar to the CRMSE metric, whereas the BCM approach has the highest at other stations. Moreover, the most successful results were obtained via the BCM approach in Dulkadiroglu, Caglayancerit, as well as Nurhak, and the HS method yielded MAE values less than 0.5 mm d−1 at other stations, while the SC and PM0.5 formulae produced strong discrepancy in terms of MAE. An underestimation tendency is observed in the HM, HMM, and KH methods with negative MRE and Bias indices, while the PM0.5 and SC methods overestimated the ETPM values. Additionally, the BCM and HS techniques are generally the ones that are closest to zero, while the methods with the least error vary based on the stations in terms of MRE and Bias metrics. MSE and MRE produced comparable outcomes, and SC-based ET simulations performed the poorest in terms of both statistical indices. The RMSD values more clearly displayed inconsistencies and corroborated those derived from the MSE index. Additionally, the lowest performances were obtained with SC and PM0.5 formulae, while other methods generally received NSCE values greater than 0.7 and BCM, HS as well as KH-driven ET predictions exhibited the best NSCE values. The negative NSCE values in the SC-driven simulations indicated that the model did not capture the variability and patterns present in the reference value. This finding typically means that SC predictions perform poorly and might be less accurate than simply using the mean of the ETPM data as a prediction. NNSCE performances support these results and reveal the variation in accuracy/discrepancy on a station basis more clearly.
In this study, the monthly averages of ET values generated by the TH and RM techniques—which can compute ET on a monthly scale—as well as the daily ET values produced by other methods, were utilized to build a scatter plot. An overestimation tendency is observed for the PM0.5 approach, with the strongest correlation value of 0.99 for the reference among the alternative methodologies. The approach that generates the ET value at a height of 50 cm is likely to have been overestimated as a result of the standardized as a result of coefficient modifications in the original PM equation. The ETBC values were clustered, ranging from 1.82 to 6.15 mm d−1, and the BC model overestimated low ET values, whereas it underestimated high ET values relative to ETPM values. On the other hand, the modified BC version used the “a” and “b” coefficients computed depending on various climatic parameters (i.e., relative humidity, wind speed, and sunshine duration) instead of the “k” seasonal crop coefficient in the formula and yielded significant improvement in the results. After this adjustment, it was concluded that the BCM approach, which has the second highest correlation with a PCC value of 0.98, can be used as an alternative to the PM method over many districts in the region. Although the HM and HMM techniques involving water vapor density underestimated the ETPM values with an identical PCC (0.94), unlike other alternative methods, the modified version produced better results than the original Hamon formula. Furthermore, even though the 1.2 local calibration coefficient improved the results in the equations of both techniques, it is anticipated that regional and seasonal modifications to the included coefficient will improve the accuracy of the ET estimations. In addition, the HS approach produced ET values that were similar to the reference method throughout Kahramanmaraş, demonstrating a successful performance with a low bias between ETPM and ETHS and a high correlation of 0.96 PCC. The KH technique, in which the linear trend line is close to that of the PM method with a PCC value of 0.94, produced accurate ETPM at some stations, although it had higher noise in overall ET estimates. Moreover, the TH method, which underestimates ET values compared to the reference method, showed a high correlation with a PCC value of 0.93. Among all alternative empirical approaches, SC and RM methods generated the highest deviation in the simulations relative to the ETPM values and smallest PCC values of 0.88 and 0.89, respectively. Additionally, both methods tended to overestimate the evapotranspiration time series compared with the reference method. Examining the equations for both methods revealed that ET values were derived only from the average temperature and relative humidity data. However, the majority of other alternative formulae are functions of sunshine duration in addition to the aforementioned parameters, and coefficients derived depending on sunshine duration are also enhanced in the correlation.
Within the scope of this study, where the impact of terrain characteristics and altitude on ET was also assessed, a slope, aspect geography, and solar radiation map of the study area were prepared. It was concluded that altitude has an adverse effect on ET, although the evaluation of altitude alone might not be comprehensive except in rare circumstances. Upon evaluating the aspect, slope, and solar radiation maps in this manner on a station basis, it was observed that the slope positively impacted ET. While the southern slopes of the slope are another factor that increases ET, it has been detected that interconnected solar radiation raises ET. Along with these elements in terrain characteristics, examining the effects of land use and vegetation on ET can motivate future studies.
In light of all examinations and evaluations, it can be concluded that the BCM and HS approaches can be utilized as alternatives to the PM method in estimating evapotranspiration values over Kahramanmaras province. Additionally, the KH technique, which only employs temperature data, can be listed as an alternative for accurately capturing ETs. While the worst results in the region were obtained with SC-driven ET simulations, the PM0.5 method consistently overestimated the ETPM values despite having a high correlation. Investigating the effectiveness of the alternative empirical methodologies assessed in this study in other locations with features comparable to the region is another matter that can attract attention. The obtained ET results will play a significant role in the planning of areas for agriculture and forestry, in determining the usable water potential of dams, in the accurate estimation of water losses in rainfall-runoff simulations, and in hydrometeorological applications such as forecasting drought or flood predictions.