1. Introduction
One of the main features of the hydrological cycle is evapotranspiration [
1]. Therefore, the precise determination of reference evapotranspiration (ET
0) is crucial for water resource management and optimal irrigation scheduling [
2]. Water circulation patterns have undergone considerable changes as a result of global climate change [
3,
4], marked by low rainfalls and high temperatures [
5]. In the same vein, ET
0 has also been severely impacted [
6]. Using meteorological data, an analysis of the steaming effect and ET
0 distribution in the Senegal River Basin revealed that, between 1984 and 2017, 32% of the basin’s ET
0 area expanded significantly [
7]. The Loess Plateau region, upstream of the Yellow River, underwent a progressive increase in temperature between 2001 and 2020, and this coincided with a steady increase in ET
0 [
8]. Climate change is the primary cause of changes in the diverse processes of the Yellow River source area, and research revealed that the climate will become more humid and warm in this area and that ET
0 will increase by 10.4% in the Maqu secondary basin [
9]. Four ecosystems (forest, shrub, farmland, and grassland) were simulated using CMIP6, and the results showed that ET
0 gradually increased as the CO
2 concentration increased [
10].
The Penman–Monteith (PM) [
11,
12,
13] and Hargreaves–Samani (HS) [
14,
15] formulas, recommended by the Food and Agriculture Organization of the United Nations, are the primary methods used to determine ET
0. In a previous study using the HS formula, ET
0 analysis in Hami, China, showed increasing trends from 1991 to 2020, except for winter [
16]. When calculating ET
0 in the arid regions of central–north Mexico, the HS model yielded more accurate results than the PM model [
17]. Compared to the PM formulas, HS formulas only require solar radiation and maximum and minimum temperature. However, the empirical coefficient’s value determines how accurate the ET
0 estimate is. In China, the accuracy of the modified HS formula is higher than the original HS formula [
18], and in different climate zones, the results of the modified HS formula are fairly close to those of the PM formula.
Many weather characteristics have been included in the original computing formulas; however, data collection is frequently challenging, and these formulas are not particularly suitable for future ET
0 prediction. In this regard, another method has emerged with the advent of deep learning that does not require many challenging physical parameters [
19]. In recent years, deep learning has been extensively employed in hydrological models used for ET
0 estimation [
20,
21]. Recurrent Neural Networks (RNNs), Long Short-Term Memory (LSTM), and Bidirectional LSTM were used on Prince Edward Island, Canada, to estimate ET
0. The results revealed that LSTM and Bidirectional LSTM were the most appropriate methods to accurately estimate ET
0, achieving results at the majority of sites. The accuracy of LSTM and Bidirectional LSTM were not significantly different [
22]. The effects of Random Forest (RF), Gradient-Boosting Machine (GBM), Generalized Linear Model (GLM), and Deep Learning–Multilayer Perceptron (DL) in Punjab Province, India, were investigated using the H
2O model framework. The DL model demonstrated superior performance [
23].
Convolutional Neural Networks (CNNs) are also frequently utilized for hydrological forecasts. CNNs have a great ability to capture the local features of the data [
24], and time sequence challenges can be effectively addressed using CNNs [
25]. Compared to the original machine learning model, the forecast’s accuracy can be significantly increased by incorporating the CNN model [
26]. The benefit of the Bidirectional Gated Recurrent Unit (BiGRU) is that it can solve gradient explosions, since the GRU forecasts successful time sequence performance [
27]. The BiGRU can effectively improve the accuracy of the prediction of evapotranspiration [
28]. The attention mechanism allows for the selection of prominent characteristics from data based on the relationships between the data [
29]. In recent years, CNN-BiGRU-Attention has been widely used in time series prediction, and its performance is superior to that of traditional models [
30]; however, there has been relatively little research conducted utilizing the CNN-BiGRU-Attention model for forecasting ET
0 in the future.
The meticulous selection of hyperparameters significantly influences the accuracy of predictive models [
31]. In this research, the snow ablation optimizer algorithm was used for hyperparameter optimization. The primary goal of the snow ablation optimizer algorithm is to mimic the behavior of snow melting and sublimation to balance space exploration and development and avoid premature convergence. Here, 22 CEC2020 real-world constrained optimization problems and 29 typical CEC2017 unconstrained benchmarks were used, including 7 synthesis and design problems and 15 mechanical engineering problems. The SAO algorithm was applied to extract the key parameters in photovoltaic systems, and the simulation results demonstrate that the SAO algorithm is a promising technology, as it outperforms other complex algorithms [
32].
China’s Shandong Province is a major winter wheat region [
33,
34]. In 2021, the largest winter crop in the area was winter wheat, with a planting area of 3,949,353.08 hm
2 [
35]. However, there was a shortage of fresh water for irrigation [
36]. In 2018, the province’s total water consumption was 21,266 billion cubic meters, with agriculture accounting for 62.8% (13,346 billion cubic meters). Irrigation used 11,468 billion cubic meters, accounting for 85.9% of the water consumption in agriculture. The share of the region in terms of per capita water resources was only one-sixth of the national rate, accounting for 4.0% of the world’s share [
37]. Therefore, the precise estimation of ET
0 is crucial for freshwater conservation.
The change in Shandong Province’s ET
0 from 1980 to 2019 was 1070.5 mm, and the ET
0 percentages in spring, summer, autumn, and winter were 29%, 40%, 21%, and 10% of the total ET
0, respectively [
34]. Previous studies generally concentrated on analyzing changes in the spatial–temporal distribution and the sensitivity of controlling variables, [
38]; however, there has been less research on the trajectory of ET
0 changes in Shandong Province. Previous studies have mostly focused on space–time distribution and the factors affecting ET
0 estimation. As a result of factors such as human-caused global climate change [
39,
40], agricultural water scarcity will likely be an increasing challenge in the future [
37]. Therefore, it is crucial to research the future changing trends of ET
0 to preserve irrigation water. This article provides a reference for the preservation of freshwater resources and advancing agriculture using future climate data, determined using CMIP6 and the deep learning model SAO-CNN-BiGRU-Attention to estimate ET
0 trends in Shandong Province, China. A Mann–Kendall test was then used to verify the trends.
3. Results
3.1. Hierarchical Clustering
Based on the Davies–Bouldin index minimum, the province can be classified into three homogeneous zones, denoted as H1, H2, and H3.
Table 3 provides the Davies–Bouldin index of different k values.
Figure 4 displays the partition diagram generated using the inverse distance weighting method (IDW) [
57] as well as the hierarchical clustering dendrogram diagram. The analysis report includes high and low representative points using Origin 2021 for clustering.
Figure 5 displays the land cover inside each cluster, and
Figure 6 presents the DEM analysis results within each cluster.
The majority of the H1 cluster includes the plain southwest of Mountain Tai and Yimeng Mountains; however, a few points located south of Mountain Tai fall under H2. The total area constitutes 40.50% of Shandong Province, and the majority of the landscape is lowland plains, rising to a height of less than 200 m. H1’s ET0mean and ET0max are the highest among the three clusters, with ET0 reaching 2.4–3.0 mm/day in 165.73 months. The low representative point P119 is close to the Bohai Sea, while the high representative point P168 is near Shandong Province’s center and extends toward H2. The amount of cropland in H1 is comparatively considerable, comprising 21.73% irrigated and 43.64% rainfed cropland.
H2 is mostly situated in Mountain Tai and Yimeng Mountains, as well as the plain to the east and south of Mountain Tai and Yimeng Mountains; a few points are included in H1, and certain points of H1 and H3 are also included in H2. This cluster extends north to the plain near the Bohai Sea, with the highest mean altitude. The ET0mean and ET0max of H2 are lower than those of H1, but greater than those of H3; in particular, ET0 reached 2.4–3.0 mm/day in only 78.16 months, significantly less than H1. H1 contains the high representative point P47, while the low representative point P79 is situated south of the Yimeng Mountains. H2 comprises 22.11% rainfed and 13.79% irrigated cropland.
H3 mostly encompasses coastal areas in the east and southeast and accounts for 25.26% of Shandong Province, with a few locations overlapping with H2. The terrain is mostly hilly and mountainous. Among the three clusters, H3 has the minimum values of ET0mean, ET0max, ET0min, and ET0std, with ET0 reaching 2.4–3.0 mm/day in only 3.85 months, which is significantly less than H1 and H2. However, ET0 reached 0–0.6 mm/day in 442.63 months, which is the highest value. The high representative point P208 is close to the Bohai Sea and Laizhou Bay, and the low representative point P238 is to the east. The small agricultural area in H3 comprises 19.37% rainfed and 4.17% irrigated cropland.
3.2. Analysis of Clusters
Figure 7,
Figure 8 and
Figure 9 show the ET
0 trends in H1, H2, and H3, respectively.
Figure 10 shows the ET
0 distribution map of Shandong Province in different periods.
Figure 11 shows the box plot of residual distribution.
Table 4 shows the evaluation criteria of different clusters, and
Table 5 shows the Mann–Kendall results of the annual average ET
0. Deep learning and DL were used to obtain the SAO-CNN-BiGRU-Attention, which are discussed below.
The three clustering regions of H1, H2, and H3 showed monotonic declining trends from 1901 to 2020 (p < 0.001, Slope < 0). While the H1 and H2 SSP2-4.5 forecast trends were not significant (p > 0.05), the H1, H2, and H3 regions obtained using DL and SSP5-8.5 exhibited monotonically increasing trends (p < 0.001, Slope > 0). The ten-year average ET0 of the three clusters in 2021–2030 and 2031–2040 exhibited a tendency in the order of SSP2-4.5 > SSP5-8.5 > DL; from the period 2061–2070 until the conclusion of the projected period, the three cluster regions showed a trend of SSP5-8.5 > SSP2-4.5 > DL. For the ten-year average ET0, according to the DL results, H1, H2, and H3 will reach their peaks in 2091–2100, with 1.37 mm/day, 1.32 mm/day, and 1.18 mm/day, respectively. The SSP2-4.5 results indicate that H1, H2, and H3 will reach their peaks in 2031–2040, with 1.46 mm/day, 1.39 mm/day, and 1.24 mm/day, respectively. The SSP5-8.5 results indicate that during 2091–2100, H1, H2, and H3 will reach peaks of 1.54 mm/day, 1.47 mm/day, and 1.32 mm/day, respectively. The ten-year average ET0 of 2091–2100 demonstrated an increasing tendency when compared to that of 2021–2030. DL’s prediction for H1, H2, and H3 increased by 1.31%, 1.56%, and 1.80%, respectively, whereas the SSP2-4.5 forecasts for H1, H2, and H3 increased by 0.31%, 0.95%, and 1.57%, respectively, and the SSP5-8.5 forecasts for H1, H2, and H3 increased by 10.88%, 10.76%, and 10.69%, respectively.
In general, the DL prediction scores of the three cluster areas are closer to those of SSP2-4.5 and lower than those of SSP2-4.5 and SSP5-8.5; the R
2 is higher, and the RMSE, MAE, and MAPE are lower when compared to SSP5-8.5. Additionally,
Figure 10 shows that the box is lower and that the average values and medium number of residuals from the SSP2-4.5 prediction findings are lower. The DL prediction results contain some points and time periods greater than SSP2-4.5 and SSP5-8.5 within the forecast period, and in a few spots, the average residuals are less than zero, as seen in
Figure 11. The DL analysis indicates that ET
0 trend continues to follow the pattern of H1 > H2 > H3, meaning that inland regions will continue to have higher ET
0 values than coastal areas. H1 has the highest ET
0, followed by H2 and H3. A similar trend is observed in the ten-year average ET
0 pillar chart using SSP2-4.5 and SSP5-8.5, which is shown in
Figure 10; darker colors indicate inland regions.
3.3. Analysis of Representative Points
The ten-year average change in ET
0 values of the high and low representative points are displayed in
Figure 12 and
Figure 13, respectively, with their evaluation criteria displayed in
Table 6 and
Table 7, respectively. The Mann–Kendall test results for the annual average ET
0 of the high and low representative points are displayed in
Table 8 and
Table 9, respectively.
Considering the high representative points, H1, H2, and H3 steadily declined throughout the historical period (p < 0.001, Slope < 0). In H1 and H3, the future prediction values of DL and SSP5-8.5 exhibited a monotonically increasing trend (p < 0.001, Slope > 0). By contrast, in H2, SSP5-8.5 exhibited a monotonic upward trend (p < 0.001, Slope = 0.00224), whereas DL exhibited a monotonically decreasing trend (p < 0.001, Slope = −0.00046). The DL prediction results in the three clusters are closer to those of SSP2-4.5, with a greater R2 and lower RMSE and MAE than SSP5-8.5. All three cluster areas in the ten-year average ET0 of 2021–2030 and 2030–2040 exhibited a trend of SSP2-4.5 > SSP5-8.5 > DL; H1 and H3 revealed a pattern of SSP5-8.5 > SSP2-4.5 > DL for the 2041–2050 period, whereas H2 exhibited this trend for the 2061–2070 period. According to the DL predictions, H2 will peak in 2081–2090 at 1.31 mm/day, whereas H1 and H3 will peak in 2091–2100 at 1.36 mm/day and 1.20 mm/day, respectively. Based on SSP2-4.5 projections, in 2031–2040, the peaks of H1, H2, and H3 will be 1.48 mm/day, 1.39 mm/day, and 1.26 mm/day, respectively. According to the SSP5-8.5 results, H1, H2, and H3 are expected to peak in 2031–2040 at 1.53 mm/day, 1.47 mm/day, and 1.34 mm/day, respectively. In comparison to the results for 2021–2020, considering the ten-year average ET0 in 2091–2100, DL predicted an increase in H1 and H3 by 1.59% and 1.79%, respectively, and a decrease in H2 by 0.43%. SSP2-4.5 estimated an increase in H1 and H3 by 0.85% and 1.15%, respectively, and a decrease in H2 by 0.07%. SSP5-8.5 predicted an increase in H1, H2, and H3 by 10.82%, 11.09%, and 10.97%, respectively.
Regarding the low representative points, throughout the historical period, H1, H2, and H3 all experienced monotonic declines (p < 0.001, Slope < 0). The forecast trend of SSP2-4.5 was not significant (p > 0.05) in H1 and H2; however, all of the future predictions of DL and SSP5-8.5 exhibited an increasing trend (p < 0.001, Slope > 0) in the three clusters. The DL prediction results in the three clusters are closer to SSP2-4.5, with greater R2 and lower RMSE and MAE values than SSP5-8.5. H2 exhibited a trend of SSP5-8.5 > SSP2-4.5 > DL in 2021–2030, while H1 and H3 exhibited a SSP2-4.5 > SSP5-8.5 > DL pattern. For H1 and H3, the pattern was SSP5-8.5 > SSP2-4.5 > DL after 2061–2070. The predicted H1, H2, and H3 clusters peaked in 2091–2100 with respective velocities of 1.38 mm/day, 1.43 mm/day, and 0.95 mm/day. According to SSP2-4.5 results for 2031–2040, H1, H2, and H3 will reach their peaks at 1.38 mm/day, 1.43 mm/day, and 0.95 mm/day. According to SSP5-8.5, H1, H2, and H3 will peak at 1.48 mm/day, 1.52 mm/day, and 1.02 mm/day, respectively, in 2031–2040. Compared with the ten-year average ET0 of 2091–2100, for the years 2021–2020, the DL predictions in H1, H2, and H3 increased by 3.80%, 5.17%, and 5.58%, respectively; in SSP2-4.5, the predictive values increased by 0.85%, 1.18%, and 2.35%, respectively; in SSP5-8.5, the forecast increased by 10.53%, 10.18%, and 11.05%, respectively.
4. Discussion
From a spatial perspective, the eastern coastal region’s ET
0 is lower than that of the western inland region, indicating a pattern of increasing westward; similarly, the southern region’s ET
0 in Mountain Tai is lower than that of the northern region at the same longitude. From a clustering standpoint, the geographical and coastal characteristics are included in ET
0; for example, low temperatures and ET
0 may be found in mountainous and coastal regions [
58]. The ET
0 of land-based water bodies is likewise low, particularly for P54 in the southwest area of Shandong Province and P194 on the western edge of H3, suggesting that topography and land cover influence ET
0. Previous research has demonstrated a comparatively high level of freshwater coverage [
59]. The primary reason for this is that the HS equation alone takes into account the impact of temperature, while other meteorological elements may differ. The primary agricultural production regions are H1 and H2, which are located close to the inland region. Both rainfed and irrigated croplands are sizable and pose a significant need for agricultural water, particularly for winter wheat, which frequently needs three irrigation cycles with a 240 mm irrigation quota [
60]. Although the amount of water used for agriculture will decrease, climate change will increase the amount of water needed for irrigation [
61], and inland areas with high steam temperatures will require more intensive irrigation. These regions can adjust their planting model and employ different methods to save water for agricultural purposes.
A non-parametric Mann–Kendall test was used to determine the monotonic trend in sequence data, in which the number of continuous data points exhibiting an increasing or decreasing trend is determined to assess the trend’s intensity and direction [
62]. The “evaporation paradox” reveals that evapotranspiration has been declining despite a notable rise in global temperatures [
63]. Following the Mann–Kendall test, we determined the annual average ET
0 for the historical period, and we found a monotonically declining trend, suggesting that Shandong Province is also experiencing a noteworthy “evaporation paradox” phenomenon, which may be associated with the effects of irrigation on agriculture and other activities [
64]. Certain regions exhibit a monotonically declining trend during the forecast period, likely due to variations in surface resistance [
65]. This demonstrates that different regions will require water supplies to varying degrees.
Deep learning methods are widely used to predict ET
0 [
21]. Compared with the original Penman–Monteith formula and Hargreaves–Samani formula, it does not need to include meteorological forcing data such as temperature and dew point, which are usually difficult to obtain [
19]. However, the hyperparameters of deep learning models often need to rely on experience to obtain [
66], and selecting inappropriate hyperparameters can increase the risk of overfitting and decreased accuracy [
31]. In this research, the snow ablation optimizer algorithm was used to obtain the hyperparameters of the model’s learning rate and L2 regularization parameters, effectively improving the prediction capabilities of deep learning models for time series. The SAO-CNN-BiGRU-Attention deep learning model’s prediction accuracy was considerably greater than that of linear regression and other models. This is because, in Shandong Province, monthly changes were characterized by single-peak curves, with significantly higher values in summer than in winter [
59]. The linear regression is not suitable for predicting future ET
0.
While the SAO-CNN-BiGRU-Attention prediction findings in this article are generally lower than those of SSP2-4.5 and SSP5-8.5 and are more in line with SSP2-4.5, some of the points exhibit greater values than those of the two future scenarios, indicating that the predictions made using the SAO-CNN-BiGRU-Attention model are accurate. According to the Mann–Kendall test, ET
0 will rise in the future, but each point’s peak value will have a distinct height and duration. The inland ET
0 was still greater than the coastal ET
0 in the forecast period, indicating a growing tendency from east to west. The SSP2-4.5 and SSP5-8.5 findings revealed a similar pattern. The annual fluctuations in ET
0 exhibited single-peak characteristics throughout the historical and anticipated period [
59], with winter presenting comparatively low values and summer exhibiting the highest, which are attributed to temperature characteristics. Furthermore, in future research, many GCMs, including BCC-CSM2-MR [
67] and ACCESS-CM2 [
68], may be considered, thus yielding more intriguing outcomes using different modes.
5. Conclusions
After hierarchical clustering, Shandong Province was found to encompass three homogeneous regions. The inland region had a greater ET0 than the coastal region since they are the primary agricultural production areas. Historically, the average yearly ET0 for the three regions decreased monotonically. The SAO-CNN-BiGRU-Attention and SSP5-8.5 domains showed a monotonic increasing trend in the three clusters, while the outcomes of a few points of the SAO-CNN-BiGRU-Attention predictions showed a monotonic decreasing trend.
According to the SAO-CNN-BiGRU-Attention results, during 2091–2100, H1, H2, and H3 will reach their peaks of 1.37 mm/day, 1.32 mm/day, and 1.18 mm/day, respectively; the SSP2-4.5 results show that H1, H2, and H3 will peak in 2031–2040, with 1.46 mm/day, 1.39 mm/day, and 1.24 mm/day, respectively; and the SSP5-8.5 results show that H1, H2, and H3 will reach their peaks in 2091–2100, with 1.54 mm/day, 1.47 mm/day, and 1.32 mm/day, respectively.
The SAO-CNN-BiGRU-Attention prediction outcomes were closer to those of SSP2-4.5. In comparison to 2021–2030, considering the ten-year average ET0 of 2091–2100 for H1, H2, and H3, the SAO-CNN-BiGRU-Attention forecasts increased by 1.31, 1.56%, and 1.80%, respectively; the SSP2-4.5 forecasts increased by 0.31%, 0.95%, and 1.57%, respectively; and the SSP5-8.5 forecasts increased by 10.88%, 10.76%, and 10.69%, respectively.
The SAO-CNN-BiGRU-Attention deep learning model’s prediction values were similar to those of SSP2-4.5 (R2 > 0.96), making it a promising method for future ET0 prediction.