Next Article in Journal
Effectiveness of Adherence to a Mediterranean Diet in the Management of Overweight Women: The Prospective Interventional Cohort Study
Next Article in Special Issue
Urban Land-Cover Changes in Major Cities in China from 1990 to 2015
Previous Article in Journal
Forest Carbon Reserve Calculation and Comprehensive Economic Value Evaluation: A Forest Management Model Based on Both Biomass Expansion Factor Method and Total Forest Value
Previous Article in Special Issue
Trade-Off Analyses of Multiple Ecosystem Services and Their Drivers in the Shandong Yellow River Basin
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Examining the Relationship between Land Use/Land Cover (LULC) and Land Surface Temperature (LST) Using Explainable Artificial Intelligence (XAI) Models: A Case Study of Seoul, South Korea

1
Department of Environmental Planning, Korea Environment Institute, Sejong 30147, Republic of Korea
2
Technical Research Institute NEGGA Co., Ltd., Seoul 07220, Republic of Korea
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2022, 19(23), 15926; https://doi.org/10.3390/ijerph192315926
Submission received: 25 October 2022 / Revised: 25 November 2022 / Accepted: 28 November 2022 / Published: 29 November 2022
(This article belongs to the Special Issue Land Use Change and Its Environmental Effects)

Abstract

:
Understanding the relationship between land use/land cover (LULC) and land surface temperature (LST) has long been an area of interest in urban and environmental study fields. To examine this, existing studies have utilized both white-box and black-box approaches, including regression, decision tree, and artificial intelligence models. To overcome the limitations of previous models, this study adopted the explainable artificial intelligence (XAI) approach in examining the relationships between LULC and LST. By integrating the XGBoost and SHAP model, we developed the LST prediction model in Seoul and estimated the LST reduction effects after specific LULC changes. Results showed that the prediction accuracy of LST was maximized when landscape, topographic, and LULC features within a 150 m buffer radius were adopted as independent variables. Specifically, the existence of surrounding built-up and vegetation areas were found to be the most influencing factors in explaining LST. In this study, after the LULC changes from expressway to green areas, approximately 1.5 °C of decreasing LST was predicted. The findings of our study can be utilized for assessing and monitoring the thermal environmental impact of urban planning and projects. Also, this study can contribute to determining the priorities of different policy measures for improving the thermal environment.

1. Introduction

Due to rapid urban growth over the last few decades, soils and vegetation have been converted into impervious materials such as concrete, asphalt, and buildings, which induce changes in the city’s biophysical environment and energy processes [1]. These shifts also affect the degree of solar radiation absorption, albedo, evaporation rate, heat transfer to soil, heat storage, wind turbulence, and can significantly alter atmospheric conditions near the urban surfaces [2].
The urban heat island (UHI) phenomenon refers to how the air (or surface) temperature of urban areas is warmer than their rural surroundings [3]. Because of its negative impact on the ecology and viability of cities, UHI has become a major research focus in various interrelated fields, including urban climatology, urban ecology, urban planning, and urban geography [4]. In particular, an increase in land surface temperature (LST) induced by UHI can impair the composition and distribution of species by increasing the length of the growing season [5,6], and it can worsen air quality [7,8] and increase health risks [9].
LST is a key variable for understanding the impact of land use/land cover (LULC) changes due to urbanization [10]. Accordingly, many researchers have studied the relationship between LULC and LST using various methods. In early studies, researchers utilized multiple regression models to compare LST for each LULC type [2,11]. Using satellite images such as Landsat, they showed that LST is highly associated with adjacent green space and built-up areas [12].
However, as many pointed out that the relationship between LULC and LST is not linear, recent studies have adopted more advanced methods to analyze them. Weng et al. [13] combined the maximum likelihood method and decision tree algorithm in analyzing the relationship between LST, LULC, and NDVI. Rana and Suryanarayana [14] analyzed the contribution of various LULC types on LST by comparing four machine learning models (K-Nearest Neighbor, Artificial Neural Network, Random Tree, and Support Vector Machine). Bakar et al. [15] used a support vector machine and maximum likelihood method to investigate the relationship between LST and LULC. Gage and Cooper [16] evaluated the relative importance of LULC, NDVI, and vertical structure for LST patterns through random forest modeling. Kafy et al. [17] used the Multi-Layer Perceptron-Markov Chain and Artificial Neural Network (ANN) methods to simulate LST and LULC. Bozorgi et al. [18] used an artificial neural network (ANN) algorithm to explore the relationship between LST and green LULC spatial patterns.
While those machine learning (ML) and artificial intelligence (AI) models have improved the overall prediction accuracy of LST, the black-box nature of these techniques has inevitably lowered the interpretability [19]. To overcome these methodological limitations, the explainable artificial intelligence (XAI) approach has been recently highlighted. XAI is a methodology that provides interpretation so that humans can understand the results predicted by the machine learning algorithm [20]. In this regard, XAI models require an additional explainable algorithm, such as Shapley additive explanations, to interpret how the model achieved a specific result [21].
Moreover, a majority of previous studies have focused on examining the effects of certain LULC types (parks, rivers, etc.) or landmarks (buildings, factories, etc.) on increasing (or decreasing) LST [22,23]. To consider various factors that influence LST, however, it is necessary to analyze the associations between LST within a whole region and various LULC types nearby. Furthermore, it is essential to figure out the appropriate range of nearby LULC features for LST prediction, which is not yet fully addressed in the existing literature. This can be achieved by developing multiple prediction models with different buffer distances from each of the LST points.
The research question of this study is which are the surrounding LULC types that affect LST, and what is the extent of these effects. Furthermore, our main concern is whether the advanced AI techniques provide accurate LST prediction that can be applied to the urban and environmental planning sector. To this end, this study aims at examining the impact of various LULC types on LST using XAI models in Seoul, Korea. By integrating the XGBoost and SHAP model, we developed the LST model for several buffer distances and predicted the LST reduction effects after LULC changes. More specifically, an expressway that would be undergrounded was chosen and assumed to be converted into green park areas. By utilizing the developed LST model, we predicted the LST reduction effects on the regions surrounding the expressway. The findings of our study may contribute not only to assessing the thermal and environmental impact of urban planning, but also to determining the priorities of different policy measures in improving the thermal environment.
This paper is organized as follows. Chapter 2 describes the materials and methods of research. The dependent variable was the LST in Seoul, and landscape, topography, and LULC were independent variables in the LST model. In addition, the pre-processing steps, mathematical formulas for building the proposed LST model, and post-processing steps are described. Chapter 3 provides training, validation, and testing results for the LST predictive model. Finally, Chapters 4 and 5 discuss the results of this experiment, and present the conclusion and future tasks.

2. Materials and Methods

2.1. Study Area

Seoul is the capital city of Korea, with an area of 606 km2 and a population of 9.8 million, and it is one of the most densely populated cities in the world. The city has a high proportion of elderly population who are vulnerable to heat waves, and their proportion of the population is expected to continuously increase [24].
For several reasons, Seoul is one of the most suitable cities for examining the relationships between LST and LULC. First, Seoul has a highly mixed pattern of LULC (Figure 1a). A majority of the city’s land is currently covered by urbanized areas with residential and commercial use, while mountainous and agricultural areas are distributed on the outskirts of the city [25]. The surface of Seoul is generally flat and plain, and the Han River runs through the city center.
Second, Seoul is located in the center of Northeast Asia, and thus has a representative continental climate where it is cold and dry in winter but hot and humid in summer periods [26]. Due to densely distributed built-up areas within the city, Seoul has experienced a severe UHI phenomenon every summer, where the LST of urbanized areas is 10 to 13 °C higher than surrounding rural areas [27]. It provides an environment in which the association between LST and LULC can be analyzed more variously from temporal and spatial perspectives.
In order to examine the practical applicability of using the model built through these experiments, the Gyeongbu Expressway underground project site (6.8 km) was selected as the test subject for Seoul when the area, a large-scale transportation facility, was transformed into a green space (Figure 1b). We tried to analyze the LST-reduction effect by comparing and analyzing the existing LST. The area is being promoted for the purpose of resolving extreme traffic congestion and securing self-regulating green space in the ground-level space, and the feasibility of the current plan is being reviewed. By understanding the relationship between the LST and various factors constituting the city, it is judged that it will be very helpful in urban planning and policies to prevent the urban heat island in urban space.

2.2. Data

To analyze LST in Seoul, we constructed landscape, topographic, and LULC feature data for 2019. Table 1 summarizes the dependent and independent variables used in the study.

2.2.1. Dependent Variable

In this study, the dependent variable was LST in Seoul, which was derived from a Landsat 8 satellite image of 13 June 2019 (Figure 2a). Landsat 8 is currently operated by the United States Geological Survey (USGS) and is accessible using Earth Explorer (available online: http://earthexplorer.usgs.gov/ (accessed on 1 June 2022)).
To obtain LST from the Landsat 8 image, we utilized band 10 from the thermal infrared sensor (TIRS) and band 3, 4 from the operational land imager (OLI) sensor, as the USGS suggested [28]. First, top of atmospheric radiance (TOA, L λ ) is calculated as below:
L λ = M L Q c a l + A L
where Q c a l denotes pixel values of band 10 (DN), M L (3.34 × 14−4) and A L (0.1) denote band-specific multiplicative and additive rescaling factor.
Using TOA, we then calculated at-satellite brightness temperature ( T ) as:
T = K 2 l n ( K 1 L λ + 1 )
where K 1 (774.89) and K 2 (1321.08) indicate band-specific thermal conversion constants. To convert values from Kelvin (K) to Celsius degree (°C), we subtracted 273.15 from the brightness temperature ( T ).
To calculate LST using brightness temperature ( T ), the surface emissivity needs to be obtained first. This study adopted the simplified normalized difference vegetation index (NDVI) approach to estimate it [29]. Using NDVI, the proportion of vegetation (PV) and the surface emissivity ( ε ) are calculated as below:
P V = ( N D V I N D V I m i n N D V I m a x N D V I m i n ) 2
ε = 0.004 × P V + 0.986
where N D V I is the actual index while N D V I m a x and N D V I m i n are the NDVI ranges used between soil and vegetated area.
Lastly, using the brightness temperature ( T ) and surface emissivity ( ε ), LST is calculated as follows:
L S T = T 1 + 0.00115 × T 1.4388 × l n ( ε )

2.2.2. Independent Variables

Independent variables of the study consist of (1) landscape, (2) topography, and (3) LULC features. All features were derived from remotely sensed data and have a 10 m spatial resolution.
First, landscape features include normalized difference built-up index (NDBI), normalized difference water index (NDWI), and green normalized difference vegetation index (GNDVI). These indices were derived from a Sentinel-2 satellite image of 23 May 2019, which was provided by the European Space Agency (ESA). Different from urban and vegetation LULC types, the NDBI and GNDVI indices reflect the building density and the vitality of vegetation, respectively [30].
To calculate these, we utilized band 3 (Green), band 8 (NIR), and band 11 (SWIR) of Sentinel-2, as below [31]:
N D B I = ( S W I R N I R ) ( S W I R + N I R )
N D W I = ( N I R S W I R ) ( N I R + S W I R )
G N D V I = ( N I R G r e e n ) ( N I R + G r e e n )
Second, this study utilized elevation and slope as topographic features. To calculate those, a digital elevation model (DEM) dataset was obtained from the National Spatial Data Infrastructure portal (NSDI, available online: http://www.nsdi.go.kr/ (accessed on 1 August 2022)). Using the Surface tool in ArcGIS 10.1 software, the10 m resolution elevation and slope for Seoul were calculated (Figure 2e,f).
Third, for land-cover features, we used a 10 m resolution sub-divided land-cover map of 2019, which was provided by the Environmental Geographic Information Service (EGIS, available online: http://egis.me.go.kr/ (accessed on 1 August 2022)). In this study, land cover in Seoul was classified into 23 types including urbanized, agricultural, forest, grassland, wetland, bareland, and water area.

2.3. Methods

2.3.1. Research Procedure

Figure 3 illustrates the study’s overall research procedure. To derive the LST model, we estimated landscape, topographic, and land-cover features within a 50 to 500 m buffer radius from each LST raster cell. The LST value is increased in 50 m increments from 50 to 500 m by performing a buffer centered on each pixel.
Then, this study combined the extreme gradient boosting (XGBoost) and Sharpley additive explanations (SHAP) to develop the LST model. For each dataset, 80% of samples were utilized to train the model, and 20% of them were left for validation. Among all the LST models that were developed from the XGBoost algorithm, we compared the overall accuracy including RMSE and pseudo R2 and chose the optimal LST model with a certain buffer radius through sensitivity analysis. Last, for the optimal LST model, this study calculated SHAP values for each independent variable and analyzed relative importance and direction of predicting LST.
Using the optimal LST model, this study predicted the effects of LULC change in Seoul on surrounding LST. As the case study area, we selected the expressway in the south-eastern part of the city, which will be undergrounded until 2025. While there are still no final plans for utilizing these areas, we assumed that the existing expressway infrastructures would be converted into green park areas and predicted the LST after the LULC changes.

2.3.2. Extreme Gradient Boosting (XGBoost) Model

Boosting-based machine learning techniques combine a series of weak learners to build a strong learner [32]. Among these, gradient boosted decision trees (GBDT) are performed in a way that sequentially reduces errors in the tree model by reflecting residuals calculated from the previous one [33]. The XGBoost model is an algorithm developed based on GDBT, which improves computational speed and prevents overfitting problems [34]. Thanks to its high predictive precision and ability to deal with both classification and regression problems, the GBDT algorithm has been widely used in various disciplines [35]. This study utilized the “xgboost” package in R (ver. 1.5.2.1.) for the analysis, which was released in February 2022.
Figure 4 illustrates the overall process of the XGBoost model. In the model, decision tree K(X, θ k ) is trained from the residuals of the previous tree K−1(X, θ k 1 ), and additively builds a more powerful learner until the loss function f k ( X ,   θ k ) is minimized [36].
To maximize the model’s prediction power while preventing over-fitting issues, there are several parameters that need to be designated in the XGBoost algorithm. Parameters include the number of iterations, maximum depth, the fraction of observations, and learning rate. For optimal hyper-parameter tuning, this study adopted fivefold cross-validation techniques.

2.3.3. Shapley Additive Explanations (SHAP)

The Shapley additive explanation (SHAP) is one of the representative explainable artificial intelligence (XAI) methods that have been widely used to increase the interpretability of machine learning models [37]. The SHAP value quantifies the relative importance of each independent variable for the model outcome based on its marginal contribution [38]. For n features, the SHAP value ϕ i assigned to each feature i is represented as below:
ϕ i = S F \ { i } | S | ! ( | F | | S | 1 ) ! | F | ! [ f S   { i } ( x S   { i } ) f S ( x S ) ]
where F represents all the combinable features in S, and f S   { i } ( x S   { i } ) f S ( x S ) calculates the difference between the contributions when the feature i is used and when it is not used.

3. Results

3.1. Descriptive Statistics

Table 2 summarizes the descriptive statistics of dependent and independent variables used in the study. First, the average LST of Seoul was 27.35 °C, which ranges from 13.62 °C to 43.13 °C. Areas with relatively high LST were generally covered with urbanized area, while those with low LST were associated with green and water areas (Figure 2a).
The landscape features, including NDBI, NDWI, and GNDVI, show negative average values. The average NDBI, however, was higher than the other two indices, which implies that Seoul’s lands are more occupied by built-up areas compared to natural environment. Average elevation and slope in Seoul were 59.93 and 7.9, respectively, indicating that there are relatively gentle topographic characteristics throughout the city.
The proportion of each LULC type in Seoul is described in Figure 5. The majority of Seoul’s LULC is occupied by urban areas (60%), particularly for transportation (28%), residential (13%), and commercial areas (8%). Among the urban LULC, industrial area (0.18%) covers the smallest ratio. This suggests that urbanized areas within the city are being predominantly utilized for the citizens’ livelihood.
Meanwhile, forest (25%) and grassland (12%) have the largest proportion within the natural LULC in the city, followed by water (5%) and bareland area (5%). More specifically, most of the natural areas in Seoul consist of broadleaf forest (16%), artificial grassland (11%), and inland water (5%). However, there were a few agricultural (2%) and wetland areas (1%) throughout the city.

3.2. Model Results

3.2.1. Hyper-Parameter Tuning

Table 3 summarizes the optimal hyper-parameter values chosen for the XGBoost model developed in this study. First, the number of iterations and maximum depth were 280 and 11, respectively. Among samples, 70% of them were randomly extracted for training the model, with a learning rate of 0.2. Based on the fivefold cross-validation, ‘colsample_bytree’, ‘alpha’, ‘lambda’, and ‘gamma’ values were tuned as 0.9, 1 × 10−6, 0.05, and 0, respectively, to prevent the overfitting issues of the XGBoost model.

3.2.2. Sensitivity Test

Using the derived hyper-parameters, we developed 10 XGBoost models with different buffer distances from each LST raster cell. Figure 6 illustrates the sensitivity of prediction accuracy with respect to a buffer radius from 50 m to 500 m.
For a 50 m buffer, root mean square error (RMSE) and R-squared (R2) values of the model were 1.32 and 0.882, respectively. The prediction accuracy was sharply increased until 150 m of buffer radius, where RMSE and R2 values reached 1.09 and 0.918. After 150 m, however, the RMSE of the LST model steadily increased to 1.35, while R2 decreased to 0.876 until 500 m of buffer radius. As a result, the optimal buffer distance for the LST model was chosen as 150 m in this study.

3.2.3. Factor Importance

Based on the sensitivity test results, this study estimated SHAP values of the LST model for 150 m buffer radius (Figure 7). Figure 7a shows the relative importance of influencing factors on LST, where the x-axis indicates the absolute SHAP value for each variable. As the length of the absolute SHAP value increases, the amount of how much a single feature affected the prediction also increases. Figure 7b represents the original SHAP value, where a high feature value (red color) indicates the direction of influencing factors on LST. When the SHAP value of a certain feature is positively distributed, it implies that a feature positively affects the prediction.
First, NDBI and GNDVI were found to be the most important predictors of LST. More specifically, the LST of a certain raster cell increases as the surrounding region is more urbanized and less vegetated. NDWI also tended to be negatively associated with LST. This corresponds with previous studies’ findings that the existence of nearby impervious and natural environments have opposite effects on LST [39].
With regards to topographic features, adjacent DEM and Slope showed relatively high association with LST, but their direction of effects were not clear. One of the possible reasons for this result is that Seoul has relatively gentle topographic features, and thus there is no significant difference in LST depending on them [40]. For LULC features, the existence of inland water (710) showed strong negative associations with LST. It is not surprising that the average temperature of riverside areas is relatively low [41]. On the other hand, surrounding built-up areas, including commercial (130), residential (110), and transportation areas (150), were found to be positively associated with LST, which is in line with findings from previous works [42]. It is notable that the existence of commercial and residential areas is more important for determining LST compared to transportation areas. Regarding natural LULC, artificial bareland (620), broadleaf forest (310), and artificial grassland (420) were some of the most influencing factors in explaining LST.

3.2.4. LST Prediction

Using the optimal LST model, we predicted the LST before and after the expressway underground project in Seoul. For the comparison, the study assumed that all transportation areas (150) along the expressway would be converted to artificial grassland (420). To examine the effect of LULC changes on the LST itself, we entered the average values of landscape (NDBI, NDWI, GNDVI) and topographic (elevation, slope) features of artificial grassland (420) in Seoul when predicting the LST.
Figure 8 and Figure 9 show the distribution of LST before and after the expressway underground project with several buffer distances from the expressway. Prior to the project (Figure 8a,b), the average LST along the expressway was 27.51 °C, which ranges from 22.50 °C to 30.97 °C. After the greening (Figure 8c,d), however, the average LST within the expressway was decreased by 0.39 °C, and the largest decrease was about 1.59 °C. This LST reduction effect was found to be more pronounced near the expressway intersection, where the proportion of impervious areas is generally high [43].
As the distance from the expressway increases, the LST reduction effect was steadily weakened. It is noteworthy that there is little effect on reducing LST by greening expressway from a radius of 150 m or more. Similarly, the maximum LST reduction was 1.63 °C, and it was sharply decreased after 150 m away from the expressway. This trend corresponds with the LST prediction models the study developed. In addition, our results suggest that when the built-up area is converted to green space, the LST reduction effect can be brought to not only the area itself but also the neighboring area.

4. Discussion

Compared to previous studies, our study is novel in both methodological and theoretical perspectives. First, the study adopted an XAI approach in examining the impact of LULC changes on LST. In early studies, the prediction accuracy of the LST model was relatively low since the associations between influencing factors and LST had been understood to be linear [44]. To capture the non-linear properties of LST modeling, recent studies have adopted several advanced techniques including machine learning (ML) and artificial intelligence (AI) models [18]. As a result, the overall accuracy of LST prediction has been improved, but their black-box nature has inevitably lowered the interpretability of models [45].
To overcome these methodological limitations, we utilized the XAI model by integrating XGBoost and SHAP models. This enabled our study to sufficiently interpret the associations between several remotely sensed features and LST, while guaranteeing the relatively high prediction accuracy. Second, this study developed several LST models and implemented the sensitivity test to derive the optimal buffer radius for landscape, topographic, and LULC features from a certain LST raster cell. The findings of this study showed that the overall accuracy of models peaks at a 150 m buffer radius, and then steadily decreased with longer distances. While there are a number of recent studies that have examined the LST reduction effect at several distances from a specific LULC type, including a park or a river [23,46], only a few studies have examined the sensitivity of model accuracy by differing distances from LST.
Third, our study’s results showed that NDBI and GNDVI are some of the most influencing factors in predicting LST. It is noteworthy that qualitative rather than quantitative aspects of the surrounding environment may contribute more to lowering LST. In addition, by using the developed model, we predicted the LST reduction effects by converting LULC types. Results showed that replacing built-up areas with vegetation can significantly decrease the nearby LST.
Based on this study’s findings, we propose several policy implications for planning and managing the urban thermal environment. First, current environmental planning promotes greening projects and low-density development throughout the city to reduce thermal environmental issues such as UHI. However, the thermal environment of the city is spatially heterogeneous, and thus it is necessary to consider regional characteristics in establishing strategies. To this end, practitioners in the urban and environmental planning sector should evaluate the spatial extent of various factors that affect the air (or surface) temperature within the city. The optimal extent of thermal impact may depend on the city’s built-environment and climate characteristics.
Second, the XAI-based LST prediction model can be utilized for assessing and monitoring the thermal environmental impact of urban planning and projects. More specifically, planners can construct LST prediction maps and can evaluate the effects of multiple urban development (or greening) projects on thermal environments. Moreover, with the high reliability of the model and the derived relative importance between variables, this method can contribute to determine the priorities of different policy measures. Third, urban and environmental planners should take several strategic measures to preserve green areas within the city that are found to be effective in reducing LST. In particular, for the areas with relatively high LST, including commercial, road, and parking areas, it is necessary to establish measures to secure or to link with adjacent green areas. Otherwise, the LST of urban areas can be lowered by constructing narrow green areas in between the roads and buildings.
Finally, this study suggests that both quantitative and qualitative improvement is necessary to increase the city’s thermal comfort. While the current environmental planning has focused on securing a sufficient number of LULC types (park, river, etc.) that reduce thermal environmental issues, the quality of those areas (shape, vitality, etc.) has not been fully considered [47]. In order to improve the thermal environment in the city, it is necessary to evaluate the surrounding environment of vulnerable areas and establish site-specific strategies with both quantitative and qualitative perspectives.

5. Conclusions

Examining the impact of LULC changes on LST has long been an area of interest in urban study and remote sensing fields. By integrating the XGBoost and SHAP model, this study developed the LST model for several buffer distances and predicted the LST reduction effects after LULC changes. The findings of this study showed that the prediction accuracy of LST was maximized when landscape, topographic, and LULC features within a 150 m buffer radius were adopted as independent variables. As a result, the existence of surrounding built-up and vegetation areas were found to be the most influencing factors for explaining LST. Moreover, LULC changes from expressway to green park areas were predicted as more than 1.5 °C of decreasing LST, which supports the reliability of the model.
The findings of this study contribute to the existing literature as it applied advanced XAI models in examining the effects of LULC change on LST. Since dependent and independent variables used in this study are easily accessible, the same analysis can be performed for other regions. Based on its generality, the research process of this study could be widely applied as a preliminary tool for establishing strategies of thermal environment planning.
Despite this study’s contribution, there are still several limitations that need to be improved in future studies. First, this study utilized a single remotely sensed image to analyze the relationships between LST and LULC. To capture the diurnal and seasonal variations of LST, it is necessary to acquire a sufficient number of satellite images in further research [48]. Second, while it is obvious that LST is largely affected by the albedo of land surface [49], the independent variables used in the study could not reflect these characteristics in the model. When future studies include building (or road) materials and reflectivity in modeling LST, it is expected that more in-depth urban planning and design measures can be derived [50]. Finally, there have been a number of XAI models, such as LightGBM and Catboost, developed and applied in recent studies [51,52]. Predicting LST using various XAI models and comparing the results would contribute to improving methodological aspects of thermal environment modeling.

Author Contributions

Conceptualization, M.K. and G.K.; methodology, M.K. and G.K.; software, M.K. and D.K.; validation, D.K.; formal analysis, M.K. and D.K.; investigation, M.K. and D.K.; resources, D.K. and G.K.; data curation, D.K.; writing—original draft preparation, M.K.; writing—review and editing, G.K.; visualization, M.K.; supervision, G.K. project administration, G.K.; funding acquisition, G.K. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by a Korea Agency for Infrastructure Technology Advancement (KAIA) grant funded by the Ministry of Land, Infrastructure and Transport (Grant 22UMRG-C158194-03).

Institutional Review Board Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

This paper is based on the findings of the research project (2022-012(R)) which was conducted by the Korea Environment Institute (KEI) and supported by a Korea Agency for Infrastructure Technology Advancement (KAIA) grant funded by the Ministry of Land, Infrastructure and Transport (Grant 22UMRG-C158194-03).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Lo, C.P.; Quattrochi, D.A. Land-use and land-cover change, urban heat island phenomenon, and health implications. Photogramm. Eng. Remote Sens. 2003, 69, 1053–1063. [Google Scholar] [CrossRef]
  2. Mallick, J.; Kant, Y.; Bharath, B.D. Estimation of land surface temperature over Delhi using Landsat-7 ETM+. J. Ind. Geophys. Union 2008, 12, 131–140. [Google Scholar]
  3. Kim, M.; Lee, K.; Cho, G.H. Temporal and spatial variability of urban heat island by geographical location: A case study of Ulsan, Korea. Build. Environ. 2017, 126, 471–482. [Google Scholar] [CrossRef]
  4. Estoque, R.C.; Murayama, Y.; Myint, S.W. Effects of landscape composition and pattern on land surface temperature: An urban heat island study in the megacities of Southeast Asia. Sci. Total. Environ. 2017, 577, 349–359. [Google Scholar] [CrossRef] [PubMed]
  5. Yang, L.; Qian, F.; Song, D.-X.; Zheng, K.-J. Research on Urban Heat-Island Effect. Procedia Eng. 2016, 169, 11–18. [Google Scholar] [CrossRef]
  6. Niemelä, J. Ecology and urban planning. Biodivers. Conserv. 1999, 8, 119–131. [Google Scholar] [CrossRef]
  7. Lai, L.-W.; Cheng, W.-L. Air quality influenced by urban heat island coupled with synoptic weather patterns. Sci. Total. Environ. 2009, 407, 2724–2733. [Google Scholar] [CrossRef]
  8. Sarrat, C.; Lemonsu, A.; Masson, V.; Guédalia, D. Impact of urban heat island on regional atmospheric pollution. Atmos. Environ. 2006, 40, 1743–1758. [Google Scholar] [CrossRef]
  9. Patz, J.A.; Campbell-Lendrum, D.; Holloway, T.; Foley, J.A. Impact of regional climate change on human health. Nature 2005, 438, 310–317. [Google Scholar] [CrossRef]
  10. Fu, P.; Weng, Q. A time series analysis of urbanization induced land use and land cover change and its impact on land surface temperature with Landsat imagery. Remote. Sens. Environ. 2016, 175, 205–214. [Google Scholar] [CrossRef]
  11. Xiao, R.; Weng, Q.; Ouyang, Z.; Li, W.; Schienke, E.W.; Zhang, Z. Land Surface Temperature Variation and Major Factors in Beijing, China. Photogramm. Eng. Remote. Sens. 2008, 74, 451–461. [Google Scholar] [CrossRef]
  12. Park, J.-H.; Cho, G.-H. Examining the Association between Physical Characteristics of Green Space and Land Surface Temperature: A Case Study of Ulsan, Korea. Sustainability 2016, 8, 777. [Google Scholar] [CrossRef] [Green Version]
  13. Weng, Q.; Lu, D.; Schubring, J. Estimation of land surface temperature–vegetation abundance relationship for urban heat island studies. Remote Sens. Environ. 2004, 89, 467–483. [Google Scholar] [CrossRef]
  14. Rana, V.K.; Suryanarayana, T.M.V. Performance evaluation of MLE, RF and SVM classification algorithms for watershed scale land use/land cover mapping using sentinel 2 bands. Remote Sens. Appl. Soc. Environ. 2020, 19, 100351. [Google Scholar] [CrossRef]
  15. Abu Bakar, S.B.; Pradhan, B.; Lay, U.S.; Abdullahi, S. Spatial assessment of land surface temperature and land use/land cover in Langkawi Island. IOP Conf. Series Earth Environ. Sci. 2016, 37, 12064. [Google Scholar] [CrossRef] [Green Version]
  16. Gage, E.A.; Cooper, D.J. Relationships between landscape pattern metrics, vertical structure and surface urban Heat Island formation in a Colorado suburb. Urban Ecosyst. 2017, 20, 1229–1238. [Google Scholar] [CrossRef]
  17. Kafy, A.A.; Rahman, M.S.; Hasan, M.M.; Islam, M. Modelling future land use land cover changes and their impacts on land surface temperatures in Rajshahi, Bangladesh. Remote Sens. Appl. Soc. Environ. 2020, 18, 100314. [Google Scholar] [CrossRef]
  18. Bozorgi, M.; Nejadkoorki, F.; Mousavi, M.B. Land surface temperature estimating in urbanized landscapes using artificial neural networks. Environ. Monit. Assess. 2018, 190, 250. [Google Scholar] [CrossRef]
  19. Rudin, C. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nat. Mach. Intell. 2019, 1, 206–215. [Google Scholar] [CrossRef] [Green Version]
  20. Abdollahi, A.; Pradhan, B. Urban Vegetation Mapping from Aerial Imagery Using Explainable AI (XAI). Sensors 2021, 21, 4738. [Google Scholar] [CrossRef]
  21. Dikshit, A.; Pradhan, B. Interpretable and explainable AI (XAI) model for spatial drought prediction. Sci. Total. Environ. 2021, 801, 149797. [Google Scholar] [CrossRef] [PubMed]
  22. Zhang, Y.; Murray, A.T.; Turner, B. Optimizing green space locations to reduce daytime and nighttime urban heat island effects in Phoenix, Arizona. Landsc. Urban Plan. 2017, 165, 162–171. [Google Scholar] [CrossRef]
  23. Chibuike, E.M.; Ibukun, A.O.; Abbas, A.; Kunda, J.J. Assessment of green parks cooling effect on Abuja urban microclimate using geospatial techniques. Remote Sens. Appl. Soc. Environ. 2018, 11, 11–21. [Google Scholar] [CrossRef]
  24. Kim, E.-S.; Yun, S.-H.; Park, C.-Y.; Heo, H.-K.; Lee, D.-K. Estimation of Mean Radiant Temperature in Urban Canyons Using Google Street View: A Case Study on Seoul. Remote Sens. 2022, 14, 260. [Google Scholar] [CrossRef]
  25. Han, Y.; Song, Y.; Burnette, L.; Lammers, D. Spatiotemporal Analysis of the Formation of Informal Settlements in a Metropolitan Fringe: Seoul (1950–2015). Sustainability 2017, 9, 1190. [Google Scholar] [CrossRef] [Green Version]
  26. Choi, Y.Y.; Suh, M.S.; Park, K.H. Assessment of Surface Urban Heat Islands over Three Megacities in East Asia Using Land Surface Temperature Data Retrieved from COMS. Remote Sens. 2014, 6, 5852–5867. [Google Scholar] [CrossRef] [Green Version]
  27. Landsat Missions. Using the USGS Landsat8 product. US Dep. Inter. US Geol. Surv. NASA 2016, 1–3. [Google Scholar]
  28. Sobrino, J.A.; Jimenez-Munoz, J.C.; Soria, G.; Romaguera, M.; Guanter, L.; Moreno, J.; Plaza, A.; Martinez, P. Land Surface Emissivity Retrieval from Different VNIR and TIR Sensors. IEEE Trans. Geosci. Remote. Sens. 2008, 46, 316–327. [Google Scholar] [CrossRef]
  29. Jimenez-Munoz, J.C.; Cristobal, J.; Sobrino, J.A.; Soria, G.; Ninyerola, M.; Pons, X. Revision of the Single-Channel Algorithm for Land Surface Temperature Retrieval from Landsat Thermal-Infrared Data. IEEE Trans. Geosci. Remote Sens. 2009, 47, 339–349. [Google Scholar] [CrossRef]
  30. Lee, G.; Hwang, J.; Cho, S. A novel index to detect vegetation in urban areas using UAV-based multispectral images. Appl. Sci. 2021, 11, 3472. [Google Scholar] [CrossRef]
  31. Kshetri, T. NDVI, NDBI and NDWI calculation using landsat 7 and 8. GeoWorld 2018, 2, 32–34. [Google Scholar]
  32. Zhang, Y.; Haghani, A. A gradient boosting method to improve travel time prediction. Transp. Res. Part C Emerg. Technol. 2015, 58, 308–324. [Google Scholar] [CrossRef]
  33. Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
  34. Friedman, J.H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 2001, 29, 1189–1232. [Google Scholar] [CrossRef]
  35. Jabeur, S.B.; Mefteh-Wali, S.; Viviani, J.L.; Viviani, J.L. Forecasting gold price with the XGBoost algorithm and SHAP interaction values. Ann. Oper. Res. 2021, 1–21. [Google Scholar] [CrossRef]
  36. Parsa, A.B.; Movahedi, A.; Taghipour, H.; Derrible, S.; Mohammadian, A.K. Toward safer highways, application of XGBoost and SHAP for real-time accident detection and feature analysis. Accid. Anal. Prev. 2020, 136, 105405. [Google Scholar] [CrossRef]
  37. Lundberg, S.M.; Lee, S.I. A unified approach to interpreting model predictions. Adv. Neural Inf. Processing Syst. 2017, 30. [Google Scholar]
  38. Shapley, L.S. Stochastic games. Proc. Natl. Acad. Sci. USA 1953, 39, 1095–1100. [Google Scholar] [CrossRef] [Green Version]
  39. Kwak, Y.; Park, C.; Deal, B. Discerning the success of sustainable planning: A comparative analysis of urban heat island dynamics in Korean new towns. Sustain. Cities Soc. 2020, 61, 102341. [Google Scholar] [CrossRef]
  40. Otgonbayar, M.; Atzberger, C.; Mattiuzzi, M.; Erdenedalai, A. Estimation of Climatologies of Average Monthly Air Temperature over Mongolia Using MODIS Land Surface Temperature (LST) Time Series and Machine Learning Techniques. Remote Sens. 2019, 11, 2588. [Google Scholar] [CrossRef] [Green Version]
  41. Xue, Z.; Hou, G.; Zhang, Z.; Lyu, X.; Jiang, M.; Zou, Y.; Shen, X.; Wang, J.; Liu, X. Quantifying the cooling-effects of urban and peri-urban wetlands using remote sensing data: Case study of cities of Northeast China. Landsc. Urban Plan. 2019, 182, 92–100. [Google Scholar] [CrossRef]
  42. Yue, W.; Xu, J.; Tan, W.; Xu, L. The relationship between land surface temperature and NDVI with remote sensing: Application to Shanghai Landsat 7 ETM+ data. Int. J. Remote. Sens. 2007, 28, 3205–3226. [Google Scholar] [CrossRef]
  43. Yuan, F.; Bauer, M.E. Comparison of impervious surface area and normalized difference vegetation index as indicators of surface urban heat island effects in Landsat imagery. Remote Sens. Environ. 2007, 106, 375–386. [Google Scholar] [CrossRef]
  44. Wicki, A.; Parlow, E. Multiple regression analysis for unmixing of surface temperature data in an urban environment. Remote Sens. 2017, 9, 684. [Google Scholar] [CrossRef] [Green Version]
  45. Das, A.; Rad, P. Opportunities and challenges in explainable artificial intelligence (xai): A survey. arXiv 2020, arXiv:cs.CV/2006.11371. [Google Scholar]
  46. Cheval, S.; Popa, A.-M.; Șandric, I.; Iojă, I.-C. Exploratory analysis of cooling effect of urban lakes on land surface temperature in Bucharest (Romania) using Landsat imagery. Urban Clim. 2020, 34, 100696. [Google Scholar] [CrossRef]
  47. Wu, W.; Li, L.; Li, C. Seasonal variation in the effects of urban environmental factors on land surface temperature in a winter city. J. Clean. Prod. 2021, 299, 126897. [Google Scholar] [CrossRef]
  48. Song, J.; Chen, W.; Zhang, J.; Huang, K.; Hou, B.; Prishchepov, A.V. Effects of building density on land surface temperature in China: Spatial patterns and determinants. Landsc. Urban Plan. 2020, 198, 103794. [Google Scholar] [CrossRef]
  49. Chen, W.; Wu, A.N.; Biljecki, F. Classification of urban morphology with deep learning: Application on urban vitality. Comput. Environ. Urban Syst. 2021, 90, 101706. [Google Scholar] [CrossRef]
  50. Shahriar, S.A.; Kayes, I.; Hasan, K.; Hasan, M.; Islam, R.; Awang, N.R.; Hamzah, Z.; Rak, A.E.; Salam, M.A. Potential of Arima-ann, Arima-SVM, dt and catboost for atmospheric PM2.5 forecasting in Bangladesh. Atmosphere 2021, 12, 100. [Google Scholar] [CrossRef]
  51. Zhao, X.; Liu, J.; Bu, Y. Quantitative Analysis of Spatial Heterogeneity and Driving Forces of the Thermal Environment in Urban Built-Up Areas: A Case Study in Xi’an, China. Sustainability 2021, 13, 1870. [Google Scholar] [CrossRef]
  52. Teimouri, R.; Yigitcanlar, T. An approach towards effective ecological planning: Quantitative analysis of urban green space characteristics. Glob. J. Environ. Sci. Manag. 2018, 4, 195–206. [Google Scholar]
Figure 1. Study area (a) Seoul city and (b) Gyeongbu Expressway.
Figure 1. Study area (a) Seoul city and (b) Gyeongbu Expressway.
Ijerph 19 15926 g001
Figure 2. Variables used in the study. (a) LST, (b) NDBI, (c) NDWI, (d) GNDVI, (e) elevation, (f) slope.
Figure 2. Variables used in the study. (a) LST, (b) NDBI, (c) NDWI, (d) GNDVI, (e) elevation, (f) slope.
Ijerph 19 15926 g002
Figure 3. Research process.
Figure 3. Research process.
Ijerph 19 15926 g003
Figure 4. XGBoost model process.
Figure 4. XGBoost model process.
Ijerph 19 15926 g004
Figure 5. Proportion of LULC types in Seoul.
Figure 5. Proportion of LULC types in Seoul.
Ijerph 19 15926 g005
Figure 6. Sensitivity of prediction accuracy.
Figure 6. Sensitivity of prediction accuracy.
Ijerph 19 15926 g006
Figure 7. (a) Relative importance and (b) direction of independent variables.
Figure 7. (a) Relative importance and (b) direction of independent variables.
Ijerph 19 15926 g007
Figure 8. Prediction of LST before (a,b) and after (c,d) expressway underground project.
Figure 8. Prediction of LST before (a,b) and after (c,d) expressway underground project.
Ijerph 19 15926 g008
Figure 9. Average and maximum LST reduction after greening project (50 m~500 m).
Figure 9. Average and maximum LST reduction after greening project (50 m~500 m).
Ijerph 19 15926 g009
Table 1. Variables used in the study.
Table 1. Variables used in the study.
DataSpatial ResolutionSource (Year)
Dependent VariableLand Surface Temperature30 m × 30 mLandsat 8 (2019)
Independent
Variable
Landscape
features
NDBI10 m × 10 mSentinel-2 (2019)
NDWI
GNDVI
Topographic
features
ElevationDigital Elevation Map (2019)
Slope
LULC
features
Urbanized areaLand Cover Map (2019)
Agricultural area
Forest area
Grassland area
Wetland area
Bareland area
Water area
Table 2. Descriptive statistics.
Table 2. Descriptive statistics.
VariablesMeanMinMaxSD
LST27.351413.619843.13084.0636
NDBI−0.0987−0.99610.81490.1847
NDWI−0.2621−0.82640.43360.2284
GNDVI−0.1715−0.95980.60930.2272
Elevation59.93350.1000752.749582.8336
Slope7.91190.000069.81809.3268
Table 3. Parameters of XGBoost model.
Table 3. Parameters of XGBoost model.
ParametersValues
Number of iterations280
Max depth11
Subsample ratio0.7
Learning rate0.2
Colsample_bytree0.9
Alpha1 × 10−6
Lambda0.05
Gamma0
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Kim, M.; Kim, D.; Kim, G. Examining the Relationship between Land Use/Land Cover (LULC) and Land Surface Temperature (LST) Using Explainable Artificial Intelligence (XAI) Models: A Case Study of Seoul, South Korea. Int. J. Environ. Res. Public Health 2022, 19, 15926. https://doi.org/10.3390/ijerph192315926

AMA Style

Kim M, Kim D, Kim G. Examining the Relationship between Land Use/Land Cover (LULC) and Land Surface Temperature (LST) Using Explainable Artificial Intelligence (XAI) Models: A Case Study of Seoul, South Korea. International Journal of Environmental Research and Public Health. 2022; 19(23):15926. https://doi.org/10.3390/ijerph192315926

Chicago/Turabian Style

Kim, Minjun, Dongbeom Kim, and Geunhan Kim. 2022. "Examining the Relationship between Land Use/Land Cover (LULC) and Land Surface Temperature (LST) Using Explainable Artificial Intelligence (XAI) Models: A Case Study of Seoul, South Korea" International Journal of Environmental Research and Public Health 19, no. 23: 15926. https://doi.org/10.3390/ijerph192315926

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop