Heat Exposure Information at Screen Level for an Impact-Based Forecasting and Warning Service for Heat-Wave Disasters

: The importance of impact-based forecasting services, which can support decision-making, is being emphasized to reduce the damage of meteorological disasters, centered around the World Meteorological Organization. The Korea Meteorological Administration (KMA) began developing impact-based forecasting technology and warning services in 2018. This paper proposes statistical downscaling and bias correction methods for acquiring high-resolution meteorological data for the heat-wave impact forecast system operated by KMA. Hence, digital forecast data from KMA, with 5 km spatial resolution, were downscaled and corrected to a spatial resolution of 1 km using statistical interpolation methods. Cross-validation indicated the superior performance of the Gaussian process regression model (GPRM) technique with low root mean square error and percent bias values and high CC value. The GPRM technology had the lowest forecast error, especially during the hottest period in Korea. In addition, temperatures for land-use areas with low elevations and high activity, such as the urban, road, and agricultural areas, were high. It is essential to provide accurate heat exposure information at the screen level with high human activity. Spatiotemporally accurate heat exposure information can be used more realistically for risk management in agriculture, livestock and fishery, and for adjusting the working hours of outdoor workers in construction and shipbuilding.


Introduction
Recently, due to extreme weather phenomena such as heat-waves, the world has experienced setbacks in several sectors, including health, energy, agriculture, dairy, fishery, transportation and tourism [1][2][3]. A global coupled climate model has shown that there is a distinct geographic pattern to future changes in heat-waves. Model results for areas of Europe and North America, associated with the severe heat-waves in Chicago in 1995 and Paris in 2003, show that future heat-waves in these areas will become more intense, more frequent, and longer lasting in the second half of the 21st century. Further observations and modeling results also show that present-day heat-waves over Europe and North America coincide with a specific atmospheric circulation pattern that is intensified by ongoing increases in greenhouse gases, indicating that it will produce more severe heat-waves in those regions in the future [4]. Similar to the global and continental trends, regional temperatures in Central/Eastern Europe increased during the second half of the 20th century. Furthermore, the regional intensity and frequency of extreme precipitation events increased, while the total amount of precipitation decreased, and the mean climate became drier [5]. Changes in related to the spatial distribution of weather conditions [24] and as the basis for forecasting heat-wave impacts within downscaled spaces.
Conducting studies to produce high-resolution meteorological data yields physically meaningful meteorological data; however, such studies require considerable computational resources and substantial calculation times. Statistical interpolation has emerged as an important methodology for producing high-resolution meteorological data, despite the limitations resulting from relatively limited computing power and analysis time.
There has been an increase in the number of studies focusing on statistical interpolation due to the growing number of weather observation stations [25][26][27][28]. Previously, methods such as inverse distance weight, kriging, natural neighbor, splines, and triangulated irregular network were used; these methods are solely based on the relationship between distances in the observed data [29]. However, as observation data suffer from limitations in reproducing spatial differences in the weather conditions varying with respect to terrains, approaches that employ additional terrain and spatial data have been suggested [30,31]. These approaches include co-kriging and parameter-elevation regressions on independent slopes model [32][33][34][35][36][37][38][39][40][41]. Refs. [42,43] proposed a method to parameterize downscaled spatial data, such as buildings in urban areas and land cover, to produce high-resolution meteorological data at an urban scale. Recently, methods such as machine learning and geographically weighted regression analyses have also been employed.
Thus far, studies focusing on statistical interpolation have been conducted as an approach to produce high-resolution weather distribution charts. Furthermore, national-scale research has primarily focused on producing data with a spatial resolution of 1 km, considering the density and simulation efficiency of weather observation networks in South Korea.
Weather forecasts should possess the same spatial resolution as previous weather data in order to forecast the impacts of heat-waves, especially the short-term effects. In several studies, researchers have compared and evaluated interpolation methods in order to identify the approaches suitable for particular factors such as the regions and type of interpolation [44][45][46][47][48][49][50][51]. However, very few studies have attempted to compare and evaluate interpolation methods for spatial downscaling in weather forecasting operations.
Thus, this paper proposes statistical interpolation methods for obtaining high-resolution weather forecast data in order to forecast the impacts of heat-waves within the downscaled spaces of the heat-wave impact forecast system currently operated by the KMA and to confirm whether the values for non-observation grids have been estimated appropriately. For this purpose, digital forecast data with a spatial resolution of 5 km, which were provided by the KMA, were downscaled to a spatial resolution of 1 km using various statistical interpolation methods; the results thus obtained were compared and evaluated. Additionally, the suitability of the information regarding areas where people are active with certain screen levels was determined by analyzing heat exposure characteristics based on the type of land use and urban areas in South Korea.

Research Area
For this study, the entire Korean Peninsula was considered as the research area, and the weather data were downscaled and corrected bias values for this region. Seven specific areas were selected for the downscaling analyses in the major cities ( Figure 1): Incheon, Seoul, Daejeon, Daegu, Gwangju, Ulsan, and Busan. The characteristics of these areas are listed in Table 1. Geographic elevations and the land cover data for each city were acquired from the National Geographic Information Service and the Ministry of Environment, respectively.

Digital Forecast System of the KMA
The digital forecast system of the KMA is a service that provides weather forecasts every three hours in Eup-, Myeon-, and Dong-district units. There are 149 grids in the east-west direction and 253 grids in the north-south direction, resulting in a total of 37,697 grids (i.e., 4438 grids in the inland area). A forecast is announced eight times per day at 3 h intervals from +4 h to + 58-67 h. Temperatures (3 h, daily maximum, and daily minimum), wind direction, wind speed, sky conditions (clear, few clouds, many clouds, and cloudy), precipitation, probability of precipitation, amount of precipitation, snowfall, humidity, and waves are forecasted.
The initial data for digital forecasts are acquired from the KMA's unified model, which covers the Korean Peninsula and its surrounding waters, with a spatial resolution of 40 km. These data are corrected through the gridded model output statistics technique and forecasters in order to produce gridded data for the Korean Peninsula, which has an area of 745 km × 1265 km, at 5 km intervals.
Digital forecasts can be categorized based on the KMA's digital analyzed data and the digital forecast data. Digital analyzed data are live data updated each hour from the standard time (t), using a total of 533 stations (i.e., 91 Automated Synoptic Observation System (ASOS) and 442 Automatic Weather System (AWS)). On the other hand, digital forecast data are based on the digital analyzed data and are forecasted 4 h after the standard time (t + 4-48). Gridded digital forecast data in the GRIdded Binary (GRIB) format, provided by the KMA, are converted to the GeoTiff format using geospatial data abstraction library in order to facilitate the process in GIS and stored and managed as Image Mosaic layers in GeoServer. In this study, data were collected during the summer of June-August 2018, starting at a standard time of 06:00. The daily maximum temperature (daily minimum temperature) data, collected several hours after the standard time, were used as the input data for interpolation.

Ground Observation Data
In this study, verification was performed by using observation data that had not been used for downscaling the 5 km resolution digital forecasts to a local scale with the resolution of 1 km. Since 2017, the Korea telecom (KT) mobile telecommunication company has been collecting high-density particulate matter and meteorological data (temperature and humidity) as part of the Air Map Korea Project involving 1500 locations built in seven cities: Seoul, Incheon, Daejeon, Daegu, Gwangju, Ulsan, and Busan. The measurement devices are installed on building rooftops (3-15 m above ground level) and on public telephone booths (2.5 m above ground level); only the data acquired at a height of 2.5 m from the ground level were used for verification.
The hourly data for the period between June and August 2018 were used. For each city, 20 observation points with a distance of more than 1 km between each other were selected; these points are represented by the gray dots in Figure 1. A majority of the meteorological data collected by KT were from urban areas and telephone booths located along pedestrian streets; thus, they reflected the weather conditions of locations and elevations with residential areas. Point verifications of the results from downscaling KT's meteorological observation data from June to August 2018 were conducted using the support vector machine (SVM), random forest (RF), and Gaussian process regression model (GPRM) techniques. The verification indices included root mean square error (RMSE), correlation coefficient (CC), and percent bias (PBIAS).
Additionally, ASOS data were used to analyze the extent to which the surface data affected the observation points, including the daily maximum temperature, average temperature, and minimum temperature data from 88 points during the summer from 2003 to 2017. These ASOS data were also used to analyze the correlations among different areas with residential sectors (cities, roads, and farms).

Surface Data
In this study, surface data associated with the spatial distribution of temperatures were used as the independent variables for interpolation, in addition to the daily maximum temperature, which was used as a dependent variable for spatial interpolation. The independent variables used in this study are the elevation above sea level, slope angle, distance from the shoreline, land cover, hollow depth, north-south azimuth, east-west azimuth, and slope aspect ( Figure 2). The topographic data for the independent variables were calculated with a resolution of 1 km using the analysis algorithm in GIS [25]. The initial data for calculating surface data included the sub-divided land cover map from the Ministry of Environment and the digital map from the National Geographic Information Service. All the data used were in UTM-K coordinates. The independent variables were chosen to account for factors such as the temperature reduction rate, difference in heat emissions based on terrain, and duration of the sun's influence ( Table 2).

Statistical Model and Evaluation Method
Modeling was performed using real-time temperature analysis data of the KMA (5 km grid that has already been assimilated with observation data) and spatially matching surface data (elevation, slope, aspect, etc.) as learning data. In this modeling approach, the relationship between temperature and surface data is learned, and the unknown temperature value is then predicted for each new 1 km grid. Statistical methods were used for the modeling and prediction processes.
The GPRM technique is a regression analysis method that employs Gaussian probability; it is a type of Bayesian nonparametric algorithm. It predicts the label of data in terms of probability distributions of the mean and variance; it also achieves a higher predictive performance than other regression models, despite the increased complexity. Various approximation methods have been used to handle the difficulties in computation when dealing with large-scale data. The Gaussian process can be defined based on the mean and covariance, where covariance is specified as a hyper-parameter. During the learning process, an initial value is assigned to the hyper-parameter, and iterative calculations are conducted to determine the optimal hyper-parameter, while minimizing the log marginal likelihood [52][53][54].
The RF technique is a model involving numerous decision trees, whereby various training data are created in a single dataset in order to generate several decision trees and combine the results. In a random forest, trees are generated by selecting m divided by 3 variables in each partition, where m is the number of variables. The predicted value of each tree is non-correlated because a random forest is composed of slightly different trees due to the sampling data and variables; this improves its performance in generalization [55].
The SVM technique is a machine learning approach that involves a map-learning model for pattern recognition and data analyses. It is mainly used for classification and regression analyses. For a given dataset, a non-probabilistic binary-linear classification model is generated to determine how new data are classified. While the generated model is expressed as boundaries of a space where the data exist, the SVM algorithm identifies the broadest boundary among them [42,56,57].
In this study, statistical downscaling was employed along with these methods, and the terrain elevation, slope angle, distance from the shoreline, land cover, hollow depth, south-north azimuth, east-west azimuth, and slope aspect were used as independent variables.
Two types of verifications were performed on the downscaled daily maximum and minimum temperature results. The first is a statistical technique whereby the KMA's digital analyzed data with a resolution of 5 km were downscaled to digital analyzed data with a resolution of 1 km through spatial interpolation in order to cross-validate downscaled spatial grids. Cross-validation was conducted by modeling the data into a validation set (10% of the total data) and learning set (90% of the total data); thereafter, the statistical values were calculated, including RMSE, CC, and PBIAS. This process is iterated by varying the verification and learning sets in order to calculate the verification indices. For each technique, the verification and learning sets were randomly divided with a ratio of 1:9 into five separate folds for the verification.
The second verification involved verifying the KMA's digital forecast data with a resolution of 1 km downscaled using the selected statistical method. For this purpose, two types of data were used: assimilated live digital analyzed data for verifying the digital forecast data; and a portion of KT's meteorological observation data that was not used in the training set to verify the digital forecast data with a resolution of 1 km. RMSE, CC, and PBIAS were used as the verification indices. The flow diagram of this research in presented in Figure 3.

Digital Analyzed Data Downscaled to a 1 km Resolution for Daily Maximum and Minimum Temperatures
The recent heat-wave events that occurred during the summer of 2016 to 2018 were selected, and the corresponding daily maximum and minimum temperature variables were downscaled to a resolution of 1 km (Figures 4 and 5). K-fold cross-validation was conducted on these downscaled daily maximum and minimum temperatures, which indicated that the GPRM technique yielded superior results in terms of the RMSE, CC, and PBIAS (Tables 3 and 4).  In a majority of the events, the RMSE prediction error was found to be the highest when using the SVM technique and the lowest when using the GPRM technique. The GPRM technique exhibited an improvement of 0.35-0.59 °C as compared to the SVM technique; this indicates its superiority in spatial predicting when downscaling the 5 km resolution grids to 1-km resolution grids. The value of CC was high (0.92 or greater) when using the GPRM technique; this also reflects the superiority of this technique. For a majority of the events, the PBIAS values, representing differences between the predicted and the actual values within a space in terms of numerical percentages, were found to be underestimated and calculated as negative values when using the SVM technique. PBIAS values of −0.12 (2016), −0.19 (2017), and −0.13 (2018) were regarded as biased and underestimated by 12%, 19%, and 13%, respectively. Furthermore, the GPRM and RF techniques did not appear to be particularly biased as they resulted in values near 0. The RMSE values of the daily minimum temperature were higher than those of the daily maximum temperature, indicating that the downscaling of daily minimum temperatures was more accurate. The CC and PBIAS values were similar to those of the daily maximum temperatures, whereas the RMSE value was approximately 0.25 °C lower. This is believed to be a result of the stable atmospheric conditions at nighttime and the more significant influence of surface data.

Daily Maximum and Minimum Temperatures Downscaled to 1 km Resolution Digital Analyzed Data Using Different Techniques
The performance of predicting high temperatures was compared for the techniques based on the daily maximum and minimum temperature frequencies. It was found that the predicted distributions of the temperature range varied even within the same area. In a majority of the areas, the range of temperature distributions was relatively wider when using the GPRM technique, as compared to those when using the RF and SVM techniques (Figures 6 and 7).
For the daily maximum temperatures, the SVM and RF techniques predicted a low frequency of high temperatures and even no frequency in certain areas. In particular, for Seoul, the GPRM technique yielded a prediction in the range of 39.5-41.0 °C, even when there were no predicts by the SVM and RF techniques. The SVM and RF techniques tend to result in distributions within certain temperature ranges. However, the GPRM technique exhibited a relatively wider temperature distribution range without leaning toward a particular range, thereby resulting in higher and lower temperature distributions as compared to the other techniques.
The SVM and RF are representative machine learning models for making predictions. Shmueli 2010 [58] distinguished statistical models into explanatory predictive models. Machine learning aids scientific interpretations, but has some limitations. Statistical models created via machine learning belong to predictive models that cannot be explanatory model because of their complexity. However, GPRM is a spatial linear model, in which the covariance matrix and spatial correlation generally follow the Gaussian process. This process serves to increase the correlation between the actual values of an area with similar geographical characteristics. For this reason, the range of temperature distribution predicted spatially is broadened and the performance is considered excellent.
The GPRM technique (red colored line, Figures 6 and 7) indicated a high frequency of high-temperature occurrences, especially in the inland cities such as Seoul, Daejeon, Daegu, and Gwangju. This is because the GPRM technique appears to be more sensitive towards surface data, which affects the calculation of daily maximum temperatures.
The distribution of the daily minimum temperature was similar for the RF and GPRM techniques; both these approaches exhibited better prediction performance for the higher temperatures in all the areas, as compared to the SVM technique.

Verification of KMA's Digital Forecast Data Downscaled to a 1 km Resolution
Point verification was performed on the downscaled digital forecast data using the SVM, RF, and GPRM techniques on KT's meteorological observation data collected during the summer of June to August 2018. RMSE, CC, and PBIAS were used as the verification indices in order to verify the downscaled digital forecast data using KT's meteorological observation data. The RMSE values were found to be 2.8, 2.9, and 3.0 °C when using the SVM, RF, and GPRM techniques, respectively; on average, the CC value was 0.9 or greater, as compared with all the three techniques. The PBIAS values, ranging from −7.5 to +7.8%, were slightly underestimated (negative values). The three techniques exhibited a similar tendency in June, whereas the accuracy of the GPRM technique was relatively higher (low RMSE, high CC, and low PBIAS) than that of the other two techniques when the heat-waves intensity increased during July and August (Figure 8).

Analysis of Spatial Characteristics of Major Cities
The downscaled data for the seven major cities were analyzed using the GPRM technique, which demonstrated relatively superior verification indices than the other techniques. The daily maximum and minimum temperatures were extracted by dividing each major city into road, water, grass and bare soil, forest, agricultural, and urban areas.
Seoul, Daejeon, Daegu, and Gwangju are the inland cities investigated, and the daily maximum temperature during the daytime in these cities varies little (interquartile range (IQR) is small), being 38 to 39 °C on average. In Incheon, Ulsan, and Busan, which are coastal cities, the temperature variability is large (IQR is large), even during daytime, with average values of 33 to 35 °C. Based on these data, the inland cities exhibit higher temperatures than the coastal cities during daytime.
With regard to the daily maximum temperature, a range of 37-39 °C was predicted for the inland cities of Daejeon, Daegu, and Gwangju; a range of 37-41 °C was predicted for Seoul; a range of 30-36 °C was predicted for the coastal cities of Ulsan and Busan; and a range of 33-39 °C was predicted for Incheon. Temperature variation ranges of 3 °C and 7 °C were observed for the inland and coastal cities. Incheon did not exhibit a similar temperature range as Busan and Ulsan, despite being a marine city. This was attributed to the heat sources within the infrastructure of Incheon and its relative location with respect to the sea. A significant proportion of land cover in Incheon has been urbanized; contrarily, forests account for a major proportion of the land cover in Busan and Ulsan ( Table 1).
The temperature range in Seoul is different from that in the other inland cities, such as Daejeon, Daegu, and Gwangju, because of its high urban area ratio. Incheon and Seoul are located in the capital region of South Korea; feature the highest density of buildings, population, and traffic; and experience intensive land use. These factors could possibly affect the prediction results [59][60][61].
Considering the various land-use types, the highest daily maximum temperature values were observed in the urbanized areas of Incheon, Seoul, Daejeon, and Gwangju, in terms of both mean and maximum values (Figure 9b,c,d,f). In Ulsan and Busan, the mean value of agricultural and road areas was higher than that of urban areas (Figure 9g,h). The daily minimum temperatures of Seoul and Daejeon were higher in the urban areas, as compared to that in the other land areas (Figure  10c,d). Moreover, the daily minimum temperatures of Daegu and Gwangju were higher for the road areas, as compared to the urban areas. Since most of the urban areas are located on the coast of Busan and Ulsan, it was assumed that the heat absorption and emission processes in the three-dimensional structure of the city will be affected by wind blowing from the sea.
In the IQR of the daily minimum temperature distribution by land cover (Figure 10), high temperature trends were predicted for roads and urban areas in most regions, while agricultural areas in Busan were on average 0.3 °C warmer than roads and urban areas. In Seoul, agricultural areas were cooler than urban areas and warmer than roads, which was thought to be due to the fact that agricultural areas in Busan or Seoul occupy a smaller land proportion than in other areas. Given this small contribution of the agricultural areas (3.6% in Seoul and 14.5% in Busan), the 1-km resolution seemed to insufficiently reflect these patterns.
The correlations between surface and meteorological observation parameters (last 15 summers, 2003-2017), indicated that the correlations were stronger in August than in July. In addition, the daily minimum temperature, daily average temperature, and daily maximum temperature had a clear order ( Table 5). The coefficient of determination was particularly high for the correlation between the surface data and the daily minimum temperature, which indicates that synoptic weather conditions are dominant factors during the day, while surface conditions have a greater effect at night. The correlation between the surface data and the daily minimum temperature was high and indicated that high temperatures appeared in low-altitude urban areas, while the high correlation to the daily maximum temperature indicated that high temperatures appeared in low-altitude agricultural areas.

Correlation between Screen Level (Activity Height of People) and Observation Data
The purpose of generating detailed weather information as part of the heat-waves forecasts is to predict the risk of exposure to weather elements at a specific elevation where people are active. A common tendency observed in each city is the higher temperatures for urban, agricultural, and road areas, as compared to the other land cover areas. To analyze the explanatory power of the applicable land cover data, the overall grids in each city (mountains, streams, urban, and agricultural areas included; I, Figure 11b) and the urban and agricultural grids where people reside (mountains and streams excluded; II, Figure 11c) were separated, and their correlations with the observed daily maximum and minimum temperature data were analyzed ( Figure 11).
The overall grids (I) of the digital analyzed data with a resolution of 1 km and the extracted urban and agricultural area data (II) were compared by subtracting (I) from (II), whereby a positive value would indicate a higher temperature for the extracted data in the urban and agricultural areas. From June to August for 2016-2018, the daily maximum temperature was 85% and the daily minimum temperature was 92% for the cases where the extracted data for urban and agricultural areas were high with respect to the city, county, and district. In August 2018, the daily maximum and minimum temperatures exhibited a difference of 0.5-1.5 °C in the forest areas, with a more significant difference between (I) and (II) along the East Coast for the daily minimum temperature.
When the overall grid data (I) and the extracted data for urban and agricultural areas (II) were compared with the data from 88 ASOS observation points across the country, the observed daily maximum and minimum temperatures and the values of (II) exhibited a higher coefficient of determination and explanatory power (over 70%) for the analysis of average daily maximum and minimum temperatures during July and August of 2018 ( Figure 12).
As observed in the temperature distributions at a downscaled resolution of 1 km, the higher correlation of temperature in urban and agricultural areas, which is associated with people's activity, with the actual observation data representative of these areas implies that downscaling meteorological elements is suitable for spatial heat exposure analyses and predictions. It is also believed that the temperature information applicable to urban and agricultural areas should be included in the detailed meteorological information as a part of the impact forecasts.

Discussion
The detailed heat exposure information calculated in this study was used to forecast the effects of heat-waves. The daily change in the rate of mortality at a given temperature per average summer mortality (MCR) was analyzed as a result of the damage effect of the heat-waves. Age, occupation, household type, chronic disease, and regional temperature distribution were considered. As a result, it was confirmed that MCR depends on socio-economic factors and the regional temperature distribution [16]. It was found that the MCR of the elderly, outdoor workers, chronically ill, and single-person households were relatively high compared to other groups. However, current warnings and policies on heat-waves do not appear effective for the elderly and outdoor workers. In particular, the regional temperature distribution was found to be one of the key factors to consider when determining the effects of heat-waves. The regional temperature distribution should be taken into account when establishing heat alert levels and practical and effective policies. To establish a heat-wave policy at the regional level, it is suggested that the heat stress risk assessment in cities should be carried out separately for vulnerable areas and vulnerable groups via a heat stress impact assessment to quantify vulnerability. For example, high-risk areas can focus on reducing heat loads, such as by greening and shadowing, while vulnerable people can be protected, e.g., by providing shelters for people who are economically vulnerable to heat stress, or through special care for the elderly, i.e., groups can be focused on [23]. Through this heat stress impact assessment, the heat stress map and high temperature early warning system by the National Institute of Meteorological Science were used to develop a prototype of an event-based heat-related risk assessment model. The highest daily temperature for predicting the summer excess mortality rate for six major cities in Korea, the daily maximum air temperature (Tmax), the daily maximum perceived temperature (PTmax), and the daily maximum wetbulb globe temperature (WBGTmax) were compared to assess the risks associated with heat. In this study, it was found that the highest PTmax during the day showed the best performance in expressing the thermal stress for Koreans. To use PTmax, the necessity of collecting temperature, dew point temperature, relative humidity, wind speed, cloud amount, and geographical information data was presented [11]. However, at present, spatially detailed data are not available for all areas of Korea. The detailed weather information system we developed calculates temperature, humidity, insolation (opposite cloudiness), and wind speed forecasts for every hour as well as daily maximum and minimum temperatures associated with heat exposure information. This information can be used for early warning by connecting detailed weather information to forecast PTmax during summer in Korea.

Conclusions
This paper proposes statistical interpolation methods for acquiring high-resolution meteorological data for the heat-wave impact forecast system currently operated by the KMA. For this purpose, digital forecast data with a spatial resolution of 5 km, provided by the KMA, were downscaled and corrected to a spatial resolution of 1 km using various statistical interpolation methods; the results thus obtained were compared and evaluated. The statistical interpolation methods used included the SVM, RF, and GPRM techniques. Surface data with a resolution of 1 km and the digital analyzed data were used as the learning data for modeling. The values estimated via downscaling to a resolution of 1 km were cross-validated, indicating the superior performance of the GPRM technique with low RMSE and PBIAS values and a high CC value. This is because the GPRM technique involves an iterative estimation process to determine a solution that is closest to the actual value, while accounting for surface data, in order to build a model. The widest range of estimated daily maximum temperature distributions was obtained using the GPRM technique; however, the SVM and RF techniques were more focused on certain temperature ranges. As it is essential for heat-wave impact forecasts to predict high-temperature occurrences when people are more exposed during the day (i.e., the daily maximum temperature) and at night (i.e., the daily minimum temperature), the GPRM technique is considered to be an optimal solution. Based on the verification for urban areas, which was performed using a portion of KT's meteorological data (from 132 locations in major cities) that had not been used as learning data, the GPRM technique demonstrated superior performance and resulted in the lowest prediction errors from mid-July to mid-August, which is the duration of the highest summer temperatures in South Korea.
On analyzing the daily maximum temperature in terms of the land-use type, the highest values were observed in the urban areas of Incheon, Seoul, Daejeon, and Gwangju. Moreover, the mean values were high for the agricultural and road areas in Ulsan and Busan. The daily minimum temperature in the urban areas was higher than that in the other land-use areas of Seoul and Daejeon. Alternatively, it was higher in the road areas than in the urban areas of Daegu and Gwangju. The temperatures for land-use areas with low elevations and high activity of people, such as the urban, road, and agricultural areas, were high; these surface variables were highly correlated with the meteorological observation data. The correlation between urban, road and agricultural areas with the observed meteorological data for these areas was analyzed. As a result, of analyzing the correlation, it should not be down-leveled due to low grid values in areas with mountains or rivers in the phase of providing weather information of the heat-waves forecast system. Currently, it is provided on an administrative boundary basis, and is calculated on average with urban, road, and agricultural areas with high heat exposure, along with mountains or rivers with low heat exposure. It is essential to provide accurate heat exposure information in areas with high human activity. The more detailed spatially and temporally accurate heat exposure prediction information can be used more realistically for the health of people engaged in agriculture, livestock and fisheries, and outdoor workers such as construction and shipbuilding.