Estimation of Daily Potential Evapotranspiration in Real ‐ Time from GK2A/AMI Data Using Artificial Neural Network for the Korean Peninsula

: Evapotranspiration (ET) is a fundamental factor in energy and hydrologic cycles. Although highly precise in ‐ situ ET monitoring is possible, such data are not always available due to the high spatiotemporal variability in ET. This study estimates daily potential ET (PET) in real ‐ time for the Korean Peninsula, via an artificial neural network (ANN), using data from the GEO ‐ KOMPSAT 2A satellite, which is equipped with an Advanced Meteorological Imager (GK2A/AMI). We also used passive microwave data, numerical weather prediction (NWP) model data, and static data. The ANN ‐ based PET model was trained using data for the period 25 July 2019 to 24 July 2020, and was tested by comparing with in ‐ situ PET for the period 25 July 2020 to 31 July 2021. In terms of accuracy, the PET model performed well, with root ‐ mean ‐ square error (RMSE), bias, and Pearson’s correlation coefficient ( R ) of 0.649 mm day − 1 , − 0.134 mm day − 1 , and 0.954, respectively. To examine the efficiency of the GK2A/AMI ‐ derived PET data, we compared it with in ‐ situ ET measured at flux towers and with MODIS PET data. The accuracy of the GK2A/AMI ‐ derived PET, in comparison with the flux tower ‐ measured ET, showed RMSE, bias, and Pearson’s R of 1.730 mm day − 1 , 1.212 mm day − 1 , and 0.809, respectively. In comparison with the in ‐ situ PET, the ANN model produced more accurate estimates than the MODIS data, indicating that it is more locally optimized for the Korean Peninsula than MODIS. This study advances the field by applying an ANN approach using GK2A/AMI data and could play an important role in examining hydrologic energy for air ‐ land interactions.


Introduction
Evapotranspiration (ET) reflects fundamental components of hydrologic and energy cycles of the Earth and is a key element in hydrological resource management [1]. As climate change has progressed, trends in drought and flood have shown different spatial variability, and the importance of hydrological system monitoring has been emphasized [2]. Accordingly, it is fundamental to quantify and monitor ET. However, since water resources are directly affected by regional hydrologic systems and meteorology, ET shows high spatial and temporal variability [3].
A major application of ET is drought monitoring. Climate change has altered drought trends, increasing the intensity, frequency, and extent of droughts [4]. Thereafter, numerous indices for drought monitoring have been developed, with several, such as the standardized precipitation evapotranspiration index [5], precipitation evapotranspiration difference condition index [6], reconnaissance drought index [7], and combined terrestrial evapotranspiration index [8], directly associated with ET. Based on these drought indices, many studies were conducted to investigate the long-term variability of water budget under specific climate change conditions [9], effects of climate elasticity of ET on water In order to manage hydrological resources over the Korean Peninsula, Korea Meteorological Administration (KMA) monitors the ET in real-time using in-situ measurements and numerical model data. In-situ measurements exhibit good performance with high temporal resolution every hour; however, its availability is limited due to the point observation. For complementing the limitation of in-situ measurements, KMA calculates the spatial distribution of ET using numerical model data based on geophysical models. Numerical model data-derived ET is suitable for analyzing droughts with a large time scale. In contrast, the accuracy of the ET changes depending on the numerical model data, and it is difficult to calculate the ET that reflects various topographical characteristics of the Korean Peninsula due to the sparse spatial resolution of the numerical model data.
Although physical-based models show good performance, due to numerous associated meteorological parameters, it is difficult to estimate accurate ET, especially in remote sensing applications. Then, over the last few decades, many researchers have identified that machine learning (ML) approaches were an effective method to overcome the complexity of ET estimation [29]. Because ML techniques solve the non-linear relationship between input and output variables, a lot of ML techniques have been proposed to estimate ET for hydrological applications [36], such as k-nearest neighbors [37], support vector machine [38], random forest [37], and artificial neural network (ANN) [39]. Previously, most studies applied ML approaches to in-situ measurements; however, many recent studies have also applied ML approaches to remote sensing data [40][41][42].
In this study, considering the spatiotemporal variability in ET, we developed a model that estimates daily PET based on ANN using the GEOstationary Korea Multi-Purpose SATellite 2A (GEO-KOMPSAT 2A, GK2A). The objective was to retrieve real-time daily ET with a spatial resolution of 1 km for hydrological resource monitoring on the Korean Peninsula. To reflect the complex relationships and nonlinearity between the GK2Aderived data and ET, we used precipitation data and the digital elevation data as input data for the ANN. Daily PET from KMA was used as reference data for ANN model training. The accuracy of the model was verified by comparing modeled data with ET from in-situ measurements of the KMA and National Institute of Forest Science (NIFoS) for the period excluding the period of training data.

Remote Sensing Data
GK2A, launched on 4 December 2018 and operated by the KMA National Meteorological Satellite Center (NMSC), is equipped with the Advanced Meteorological Imager (AMI). AMI is the optical-infrared sensor with 16 channels and its spatial resolution ranges from 0.5 to 2.0 km depending on wavelength (Table 1). Since GK2A/AMI observes the Earth with a high spatiotemporal resolution, it is more capable of monitoring the Earth's hydrological system than previous GEO satellite (Communication, Ocean and Meteorological Satellite, COMS) operated by KMA NMSC and other LEO satellites [43]. We used seven GK2A/AMI operational products: Reflected Shortwave Radiation (RSR), Downward Shortwave Radiation (DSR), Absorbed Shortwave Radiation (ASR), Outgoing Longwave Radiation (OLR), Downward Longwave Radiation (DLR), Upward Longwave Radiation (ULR), and Normalized Difference Vegetation Index (NDVI).

. Precipitation Data
Integrated Multi-satellitE Retrievals for Global Precipitation Measurement (IMERG) version 6 were used to calculate ET, even for areas for which precipitation data were not available. The IMERG precipitation products are derived from the global precipitation measurement constellation comprising the various passive microwave sensors including the meteorological operational satellite series, polar operational environmental satellite series, and global change observation mission 1st-water satellite [44]. The data from various passive microwave satellites are merged into 0.1° × 0.1° resolution every 30 min. We used the standardized precipitation index for six months (SPI6), derived from the precipitation product of IMERG, rather than daily precipitation data.

Numerical Model and Elevation Data
Since 2010, the KMA has used numerical weather prediction (NWP) systems from the Unified Model (UM). NWP model data from UM systems, operated by KMA in realtime, could be classified depending on spatial coverage and boundary conditions, and we used Local Data Assimilation and Prediction System (LDAPS) over the Korean Peninsula in this study. LDAPS is based on boundary conditions derived by three-dimensional variational data assimilation and its spatial resolution of 1.5 km [45]. LDAPS has 70 vertical layers and provides 36- h predictions (at every 00, 06, 12, and 18 UTC), and  additional 3-h predictions (at every 03, 09, 15, and 21 UTC). We used four meteorological parameters-air temperature (Ta), surface temperature (Ts), relative humidity (RH), and wind speed (WS)-from LDAPS version 10.1. Furthermore, Shuttle Radar Topography Mission (SRTM) digital elevation model (DEM) data were used to reflect the effect of elevation on ET, and its spatial resolution was an arc-second, approximately 30 m [46].

In-Situ Measurements
PM equation calculates the PET using micrometeorological data, and the eddy covariance (EC) systems estimate the AET based on energy flux observations [47]. PET derived from the PM method was used for model training and validation. On the other hand, since the AET derived from EC systems was different from PET, we only used the AET data for testing the availability of the PET model.
Since the Korean Peninsula has specific geographic characteristics, each region shows different weather conditions and climate properties. KMA operates 81 Automated Surface Observing System (ASOS) stations in real-time. In this study, we used 42 of these that monitor based on the PM equation (we hereafter refer to ET obtained using the PM equation as PM-ET) ( Figure 2a). ASOS stations observe the following meteorological parameters every hour: Ta, Ts, RH, WS, soil temperature, precipitation, surface pressure, and net solar radiation (https://data.kma.go.kr/cmmn/main.do, accessed on 26 August 2021). To evaluate the ANN model-derived ET, we used ET calculated using the EC method. The NIFoS operates six flux towers to monitor ET on the Korean Peninsula ( Figure 2b). These flux towers observe meteorological parameters every 30 min (http://know.nifos.go.kr/know/service/flux/fluxIntro.do, accessed on 26 August 2021). Using these direct observations of vertical flux and meteorological data, it is possible to calculate ET via the EC method. From the ASOS stations and flux towers, we selected only those variables observed for full 24-h periods. Figure 3 illustrates the process used here to estimate and evaluate daily ET using GK2A/AMI data. We preprocessed the input data; the preprocessed data were then subsampled (at 1 km resolution) around the Korean Peninsula. We constructed matchups between the subsampled data and PM-ET, and classified them into two datasets (training and testing) depending on the acquisition date. For the ANN model training, we used five-fold cross-validation; 80% of the data were used to optimize the weights and biases of the model, and 20% were used to verify the accuracy and monitor the loss function of the model, to minimize overfitting. To enable the ANN model to reflect seasonal variation, we set the training period for the training data to 1 year (25 July 2019 to 24 July 2020). ANN model performance was assessed using PM-ET and EC-ET data for the period 25 July 2019 to 31 July 2021. To estimate daily ET via GK2A/AMI data, we used 22 parameters as input variables of the ANN model ( Table 2). The GK2A/AMI operational products include the preprocessed daily means of six radiation variables (RSR, DSR, ASR, OLR, DLR, and ULR) and the 16 days maximum NDVI. The GPM IMERG precipitation product was preprocessed to generate SPI6. We used four UM LDAPS variables (Ta, Ts, RH, and WS) affecting ET. To take into account diurnal variation in ET, we preprocessed NWP variables to daily mean, daily minimum, and daily maximum. As static data, we used extraterrestrial solar radiation (ESR) and a DEM to account for seasonal variation and the terrain effect, respectively. ESR indicates solar radiation incident outside the Earth from the Sun. ESR is a key parameter for estimating ET, and can be calculated using the latitude and the day of the year as follows [21,48]:

Processing
where Ra refers to ESR; GSC denotes the solar constant; represents the inverse of the relative distance between the Earth and the Sun; indicates the Sun and sunset hour angle; and refer to latitude and solar declination, respectively.

Penman-Monteith Evapotranspiration (PM-ET)
We calculated hourly PM-ET from in-situ KMA ASOS station measurements. To account for diurnal variability in ET, we also derived daily PM-ET from hourly PM-ET. It is possible to estimate hourly PM-ET as follows [21]: where indicates hourly ET; and represents hourly mean air temperature and hourly mean wind speed, respectively; ∆ denotes the saturation slope vapor pressure at ; and denote the psychrometric constant and the net radiation at the surface, respectively; and refer to the soil heat flux density and the saturation vapor pressure at , respectively; indicates hourly mean actual vapor pressure. KMA ASOS station calculated hourly PM-ET every hour, and the cumulative PM-ET over 24 h was used as the daily PM-ET.

Standardization of Input Variables
In an ANN model, when the input variables are linearly related, it is not necessary to standardize or normalize them. However, when the input variables show a non-linear relationship in the ANN model, before using input variables, it is important to standardize or normalize them [49]. When using the variables without standardization or normalization, large values of the input variables would cause very small weighting factors, and small values of the input variables would result in very large weighting factors, which could cause some problems during training and optimizing process [50]; Using extremely small weights would cause the uncertainties of floating-point calculations on computer; not using extremely small initial weights would make the improvement of the model by the backpropagation algorithm insignificantly small [51]. There are no fixed methods of standardization that should be used in specific applications; in this study, standardization was applied to input variables as follows: , where and indicate the standardized input variable and unstandardized input variable, respectively; represents the mean of input variable; denotes the standard deviation of the input variable.

ANN Model
We used a multilayer perceptron (MLP), ANN, to estimate daily ET. MLP involves feedforward backpropagation networks with a simple structure and high performance; they have therefore been used for diverse applications using satellite data [52,53]. These neurons are interconnected, with weights and biases that enable repetitive learning. Each hidden layer has an activation function computing the neuronal weights and biases. An optimizer algorithm trains the network and minimizes the error, by correcting the weights and biases via a backpropagation process [54]. We developed a five-layer MLP model with hidden layers of 200 neurons. In MLP model training, input values of neurons in the previous layer transfer to a neuron in the current layer, and a neuron combines the input values with weights and biases as follows [51]: , where represent the net of the weighted input for the th neuron; indicate the input transferred from the th neuron; refers to the weight connected from the th neuron to the th neuron; means the bias of the th neuron. In , for being a final output for passing to the next layer, it should be activated by the activation function [49]. The activation function can be a diverse discrete or continuous function; we used the exponential linear unit (ELU), showing fine performance with a fast learning rate and significantly better generalization as follows [55]: where represents the hyperparmeter controlling the value where an ELU saturates for negative ; denotes the input value and indicates the . For improving and accelerating the convergence, we used the batch normalization (BN) layer between each hidden layer [56]. The normalization is calculated based on the dimension of the batch and BN ensures that the input of each hidden layer is distributed in the same way. Their performance dramatically depends on the batch size, and setting a larger batch size generally yields better performance [57]. We used a method for stochastic optimization (ADAM) as the optimizer algorithm [58]. The parameters and hyperparameters of the MLP model are summarized in Table 3. To train and run the MLP model, we used Keras with the TensorFlow back-end in Python. In a black-box model such as an ANN model, it is difficult to analyze the information and structure of the model in detail. However, it is possible to rank the importance that each input variable occupies in the model. In this study, in order to analyze the trained MLP model, we conducted a permutation test of each input variable. This test randomly permutes the list of a variable and measures the decrease of model accuracy; this process was conducted repeatedly with each variable; finally, the Mean Decrease Accuracy (MDA; also known as the permutation importance) was calculated with each variable [59]. A variable with a larger MDA is interpreted as an important variable in the model because the accuracy of the variable greatly affects the accuracy of the model. We used the MDA in terms of the increase in RMSE when each variable was randomly permutated.

Statistical Analysis
Daily ET, estimated via MLP, was compared with PM-ET and EC-ET. To quantitatively evaluate the MLP-derived daily ET, we used the bias [60], root-meansquare error (RMSE) [36], mean absolute error (MAE) [36], standard deviation (STD) [60], normalized RMSE (nRMSE) [61], Pearson's correlation coefficient (R) [36], and the Index of Agreement (IOA) [62]. The detailed equations are as follows: where and represent the estimated ET and observed ET, respectively; the subscript denotes the th data point; refers to the number of data; and represent the mean of the estimated ET and observed ET, respectively. As the radiation incident on the surface and the temperature increases, evaporation increases, because sufficient energy to convert water into water vapor is provided, and transpiration increases because vegetation activity accelerates [63]. Seven variables (i.e., RSR, SPI6, RHmean, RHmin, RHmax, WSmin, and DEM) were negatively correlated with daily ET from the KMA ASOS stations. As higher RH is associated with less water vapor transported from the water surface, RH was negatively correlated with ET. Since precipitation increases surface water content and inhibits evaporation, SPI6 was negatively correlated to ET. As RSR increases, the radiation incident on the surface decreases, reducing both evaporation and transpiration. The mean, maximum, and minimum WS showed different correlations with ET; this could be because the complex topography of the Korean Peninsula, in terms of spatiotemporal variability in WS, causes uncertainty of the LDAPS model WS estimates. Overall, the positive correlations were stronger than the negative correlations. Relative to ET, DSR had the strongest positive correlation (0.86), and RHmean had the largest negative correlation (−0.45).     Ta, Ts, and RH are directly related to ET estimation, via the PM equation. However, since they are derived from numerical model data with uncertainty, they showed relatively low MDAs, from 0.99 to 0.19 mm day −1 . The variables describing WS, which are used directly in the PM-ET estimation, showed lower MDAs (<0.45 mm day −1 ) than the other meteorological variables. This reflects the fact that it is difficult to simulate transient changes in wind caused by sudden gusts or topography using numerical model-based wind data. PET reflects the rate of ET when sufficient soil moisture is available; hence it does not account for vegetation and terrain characteristics. As a result, NDVI and DEM showed lower MDA values (<0.18 mm day −1 ).

Evaluation against KMA Stations
We compared GK2A-derived daily ET for the Korean Peninsula with PM-ET derived from KMA ASOS stations for the period 25 July 2020 to 31 July 2021 (Figure 7). KMA ASOS-derived PM-ET (mm day −1 ) ranged from 0.28 to 14.41, and GK2A/AMI-derived PET (mm day −1 ) ranged from 0.00 to 11.10. In comparison with PM-ET derived from KMA ASOS stations, the total number of matchup data was 15,414, and GK2A/AMI-derived PET showed accuracy (mm day −1 ) of 0.649 (RMSE), 0.488 (MAE), 0.636 (STD), and −0.134 (bias) with nRMSE of 0.168, indicating the MLP model tended to underestimate relative to the in-situ PM-ET overall. In particular, at PET values less than 2.0 mm day −1 , the tendency of underestimation of the MLP model was remarkable. Although the MLP model shows the tendency to underestimate, its underestimation was slight overall and it shows good performance estimating PM-ET from the KMA ASOS stations; Pearson's R was 0.954, and IOA was 0.975.  We examined the seasonal characteristics of GK2A/AMI-derived PET. We simply classified the seasons into two classes; we hereafter referred to the period when monthly mean value of observed PET was less than 3 mm day −1 as cold seasons (November to February), and the period when monthly mean value of observed PET was more than 3 mm day −1 as warm seasons (March to October). In the cold seasons, KMA ASOS-derived PM-ET and GK2A/AMI-estimated PET both had lower values than in the warm seasons (Table 4). In cold seasons, RMSE (mm day −1 ) ranged from 0.399 to 0.671, Pearson's R ranged from 0.881 to 0.908, and nRMSE ranged from 0.193 to 0.244 (Table 5). On the other hand, in warm seasons, RMSE (mm day −1 ) ranged from 0.585 to 0.804, Pearson's R ranged from 0.901 to 0.960, and nRMSE ranged from 0.116 to 0.207. Regardless of seasons, the model was found to show low RMSE less than 0.81 mm day −1 and high Pearson's R more than 0.88, indicating that the model simulates the in-situ PET with high accuracy. When compared to the warm seasons, the cold seasons show good performance in terms of RMSE, MAE, and STD, but poor performance in terms of nRMSE, Pearson's R, and IOA. These seasonal differences are caused by the seasonal variation of PET. As shown in Table 4, the lower the temperature, the lower the water vapor evaporated from soil and transpired by vegetation; the variation of PET in the warm seasons is higher than in the cold seasons [64,65]. Therefore, the low variation of PET in the cold seasons causes low RMSE, MAE, and STD; however, due to the small magnitude of PET in cold seasons, even a small error substantially affects the ratio-dependent accuracy score such as nRMSE, Pearson's R, and IOA.

NIFoS Flux Towers
Because the ANN-based daily ET model was trained using the PM-ET data from the KMA ASOS stations, we examined the availability of the GK2A/AMI-derived PET by comparing it with EC-ET data. We compared daily PET derived from GK2A/AMI for the Korean Peninsula with EC-ET derived from NIFoS flux tower, for the period 25 July 2020 to 31 July 2021 (Figure 9). NIFoS flux tower-derived EC-ET (mm day −1 ) ranged from 0.02 to 9.82, and GK2A/AMI-derived PET (mm day −1 ) ranged from 0.00 to 10.06. In comparison with EC-ET derived from NIFoS flux tower, the total number of matchup data was 654, and GK2A/AMI-derived PET showed the accuracy (mm day −1 ) of 1.730 (RMSE), 1.409 (MAE), 1.235 (STD), and 1.212 (bias) with nRMSE of 0.525, indicating the PET derived from GK2A/AMI using the MLP model tended to overestimate relative to the EC-ET derived from NIFoS flux tower overall. The model performed in following the trend in the EC-ET data; Pearson's R was 0.809, and IOA was 0.822. In theoretical conditions, the PET derived from the PM method was not expected to match with the AET derived from the EC method. Although the differences depend on the environmental conditions and PET retrieval methods, the PM method generally overestimated ET compared with EC-ET in both hourly and daily time scales [47]. However, the comparison result shows a high correlation with both variables and between the input parameters for both variables, which indicates that PM-ET and EC-ET are affected by the same factors [66,67]. Because the PM method quantifies water vapor loss in sufficient soil moisture conditions, it overestimates ET relative to EC-ET under the dry conditions [67]. However, in sufficient soil moisture conditions on rainy days, the PM method nonetheless overestimated ET relative to EC-ET [49,68]. Furthermore, the differences between PM-ET and EC-ET depend on the environmental conditions, the tendency to overestimate ET was strong with intense net radiation and water vapor deficit [67,69]. Another possible reason for the overestimation is that PM-ET does not consider the complicated structure of the forest. The comparison result between PM-ET and EC-ET depended on the reference level, and the accuracy of PM-ET increased with the reference level of measurement [47]. The PM method assumes that the vegetation is a single big leaf, and ET occurs on a surface with zero plane displacement. However, vegetation conditions vary depending on the spatiotemporal environment, and ET occurs in the forest floor to the top of vegetation. On the other hand, during the vegetation growing season with low leaf area index, surface and underground ET take a substantial part of the water vapor cycle. Because of that, PM-ET could underestimate ET at a small leaf area index, compared with EC-ET [47]. Another possible reason for overestimation is that the PM method cannot accurately include the resistance due to the surface canopy or soil conditions [69]. Since PM-ET data depend highly on surface conductance; its overestimation could cause the overestimation of ET [47,70]. Although the PM model overestimated ET, it showed a high correlation with the EC-ET data. Since the model accounts for radiative and aerodynamic conditions, it might produce more reliable estimates of AET than other PET models [71].

Comparison with MODIS
To validate the GK2A/AMI-derived daily PET data, we compared it with the Terra/MODIS PET product. Because Terra/MODIS produces an 8-days PET composite, we produced 8-days aggregates of daily PET data from the GK2A/AMI satellite and from the KMA ASOS stations. In the KMA ASOS stations, when the number of daily PET data for 8-days was less than 8, it was excluded from the validation data. We then compared the Terra/MODIS PET data with the KMA ASOS station and GK2A/AMI satellite PET data, for the period 27 July 2020 to 27 July 2021( Figure 10).  [72,73]. Although the previous studies and this study used the verification with a daily and 8-day product, respectively, the high Pearson's R means that MODIS-based PET product is useful for ET monitoring on the Korean Peninsula. In comparison with the GK2A/AMI-derived PET data, the Terra/MODIS PET data showed accuracy (mm 8 day −1 ) of 6.094 (RMSE), 4.705 (MAE), 6.076 (STD), and −0.471 (bias) with an nRMSE of 0.236; Pearson's R was 0.887 and IOA was 0.939, indicating the Terra/MODIS PET data tended to underestimate PET relative to GK2A/AMI (Figure 10b). The underestimation of the Terra/MODIS PET data was remarkably shown in the PET of less than 20 mm 8 day −1 , indicating the comparing result of GK2A was consistent with that of KMA ASOS.
For the assessment of the spatial distribution of GK2A/AMI-derived PET, we verified the accuracy of Terra/MODIS PET relative to the PET data for each KMA ASOS station and GK2A/AMI coordinate for the period 27 July 2020 to 27 July 2021 ( Figure 11). In comparison with the KMA ASOS station data, RMSE (mm 8 day −1 ) ranged from 3.056 (at station 119) to 10.061 (at station 105); bias (mm 8 day −1 ) ranged from −5.692 (at station 105) to 1.075 (at station 177); and Pearson's R ranged from 0.748 (at station 185) to 0.981 (at station 119) (Figure 11a  The KMA ASOS station-derived PM-ET data showed a Pearson correlation of 0.914 with Terra/MODIS PET (Figure 10a), and 0.954 with GK2A/AMI-derived PET (Figure 7). While the Terra/MODIS PET algorithm is optimized for global coverage, our MLP model was locally optimized for the Korean Peninsula. Furthermore, since our MLP model used daily remotely sensed and numerical model product not related to cloud, the GK2Aderived PET shows fine temporal resolution and has no masked value due to cloud relative to Terra/MODIS product. Therefore, the GK2A/AMI-derived PET performed better than Terra/MODIS for estimating PET on the Korean Peninsula. Relative to the GK2A/AMI-derived PET and in-situ PM-ET data, the consistency of the Terra/MODIS PET data decreased remarkably for the eastern region of the Korean Peninsula ( Figure 11). In the eastern coastal area of the Korean Peninsula, elevation decreases dramatically ( Figure  2). In contrast to the lack of consistency with the Terra/MODIS PET data, the GK2A/AMIderived PET and in-situ PM-ET were highly correlated (Pearson's R > 0.879), regardless of the topography (Figure 8). This result indicates that Terra/MODIS did not reflect the local terrain characteristics of the Korean Peninsula, due to its global optimization. Thus, for ET monitoring with high spatiotemporal variability on the Korean Peninsula, the realtime daily GK2A/AMI-derived PET was more suitable (due to local optimization) than the 8-days Terra/MODIS PET product.

Previous Studies on the Korean Peninsula
The Korean Peninsula comprises various vegetation cover types and shows specific agrometeorological characteristics, and it is able to perform agrometeorological analysis using ET data. When investigating the ensemble model of virtual water content based on ET, it was found that the ensemble virtual water content and production of rice and maize decreased in future projections, which affected future water consumption on the Korean Peninsula [74]. Birhanu et al. [75], when constructing hydrological models, investigated the effect of model complexity and ET calculation methods on model performance based on the in-situ measurement. Um et al. [76] estimated the spatial distribution of ET based on in-situ measurements using the hybrid Kriging method and revealed various ET characteristics depending on the distance from the coast and elevation level above the ground surface. Jung et al. [77] developed the physiological modules to simulate the canopy photosynthesis and ET process and established the relationship of photosynthesis and ET with crop production based on satellite data and in-situ measurements. Similar to this study, Kim et al. [41] developed the ML model estimating daily PET for the Korean Peninsula using satellite data and NWP data. MODIS-based monthly vegetation index data, multi-microwave satellite-derived precipitation data, and LDAPS data were used as input data of the random forest model. The model showed accuracy (mm day −1 ) of 1.038 (RMSE), 0.790 (MAE), and 0.007 (bias) with Pearson's R of 0.870. The model developed in this study not only has better accuracy but also has the advantage of retrieval in real-time.

Conclusions
This paper presents an ANN model that retrieves daily PET in real-time for the Korean Peninsula, using GK2A/AMI-derived data, microwave composite data, and NWP data. We used the data from 25 July 2019 to 24 July 2020 for model training, and 25 July 2020, to 31 July 2021 for model testing. In comparison with the KMA ASOS station-derived PM-ET, the ANN-based GK2A-derived PET showed high accuracy (mm day −1 ) of 0.649 (RMSE) and −0.134 (bias); Pearson's R of 0.954; and IOA of 0.975. In validating the spatial distribution, the ANN model-estimated daily PET showed high accuracy at all KMA ASOS stations. To assess the efficiency of the GK2A/AMI-derived PET, we verified it using NIFoS flux tower-derived EC-ET, which showed that GK2A/AMI-derived PET overestimated ET. Furthermore, we assessed the performance of our ANN model by comparing it with operational Terra/MODIS PET products with 8-days temporal resolution. Because it was locally optimized, our ANN model outperformed Terra/MODIS PET over the Korean Peninsula. GK2A/AMI-derived PET performed particularly better than the Terra/MODIS PET product for the eastern coastal region of the Korean Peninsula, where elevation changes dramatically.
Although GK2A/AMI-derived PET showed high accuracy, it is necessary to extend its spatial coverage for overcoming its local optimization. When applying the additional in-situ measurements on other areas to the model, it is possible to improve the model in terms of spatial coverage. Furthermore, in order to develop the model estimating ET, we used and optimized the MLP model, but it is able to apply diverse ANN methods such as recurrent neural network, convolutional neural network, and long short-term memory. When applying and validating various ANN methods, it is possible to improve the accuracy of the model estimating ET.
ET is a key indicator to investigate the effects of the meteorological drought on vegetation activities. GK2A/AMI-derived 2-dimensional ET is thought to be a useful tool in examining the drought affecting the Korean Peninsula. In further studies, we will attempt to investigate drought on the Korean Peninsula by examining the relationship of GK2A/AMI-derived ET and precipitation data with vegetation information. This study contributes to understanding air-land interactions, and the development of ANN approaches using satellite and NWP data.