Spatiotemporal Characteristics and Influencing Factors of Urban Heat Island Based on Geographically Weighted Regression Model: A Case Study of Urumqi City

: Rapid urbanization threatens the ecological environment and quality of life by signiﬁcantly altering land use and land cover (LULC) and heat distribution. One of the most signiﬁcant environmental consequences of urbanization is the urban heat island effect (UHI). This study investigated the spatiotemporal characteristics of the SUHI and its relationship with land use types from 2000 to 2020 in Urumqi City, located in an arid and semi-arid region of northwestern China. Additionally, the ordinary least squares (OLS) and geographically weighted regression (GWR) models were used to quantify the relationship between the land surface temperature (LST) and inﬂuencing factors. The results showed that the area of the lower surface temperature classes has decreased signiﬁcantly. In comparison, the area of the higher surface temperature classes has experienced a steady rise over the last two decades. From 2000 to 2020, the share of the area occupied by the temperature range <30 ◦ C decreased by 67.09%. In addition, the LST varied signiﬁcantly from one category of land use to another. The average LST of built-up land and unused land was higher than the average LST of other land use types in all years, while the average LST of grassland, forest land, and water bodies was signiﬁcantly lower. Finally, the results of the GWR model showed that R 2 and adjusted R 2 of the GWR were 0.75 and 0.73, obviously larger than the 0.58 of the OLS models. The GWR model’s higher R 2 and adjusted R 2 compared to the OLS model indicates that the relationship between LST and the inﬂuencing factors underlying the model may exhibit spatial non-stationarity, and the GWR model performs better than the OLS model. The results of both OLS and GWR models show that the normalized difference vegetation index (NDVI) and slope were negatively correlated with LST, while the urban index (UI) and normalized difference built-up index (NDBI) were positively correlated with LST. The ﬁndings of the study indicate that increasing green spaces and limiting the unplanned expansion of urban areas are effective measures to mitigate the UHIs in the study area. The results of the study may provide valuable insights into the spatiotemporal characteristics of the UHI and its drivers. Understanding the spatiotemporal characteristics of the UHI can help urban planners, policymakers, and scientists develop more effective urban cooling strategies and improve the urban thermal environment.


Introduction
With the development of modern society, a large influx of people to cities has led to increasing urbanization around the world [1,2].On the one hand, rapid urbanization has led to significant improvements in basic infrastructure, public services and living conditions for the population [3].On the other hand, it has created many problems, such as environmental pollution, reduction of biological diversity, and human health problems that need to be solved [4].The large number of building facilities in urban centers can significantly increase land surface temperature (LST), which has become a significant concern for geographers and urban planners [5].The urban land surface consists of vegetation (grassland, forest, bush jungle, orchards, farmland), water bodies, impervious materials, soils and rocks, and the increase in urban surface temperature caused by large areas of waterproof materials in the city contributes to forming an urban heat island (UHI).The UHI affects the urban environment, air quality, and growth and development of plants and animals, and directly or indirectly affects people's health [6][7][8].The climate change assessment report published by the United Nations Intergovernmental Panel on Climate Change (IPCC) states that the best estimate of global warming would reach 1.5 • C in the next twenty years, and the world will inevitably face multiple climate hazards.Thus, a better understanding of the spatiotemporal characteristics and influencing factors of UHI effects is necessary to support climate change mitigation efforts and urban planning and management [9][10][11][12].The scholar Manley first formally defined the UHI in the 1950s [13].The UHI is a phenomenon in which the LST of urban areas is higher than that of the surrounding nonurban environment due to urbanization [14].The energy basis for the formation of UHIs is the heat balance, which is influenced by a combination of natural and anthropogenic factors, emphasizing land use and land cover (LULC) changes [15,16].Urban expansion causes dramatic changes in the urban land surface in the urbanization process, and the original rural land undergoes irreversible transformation and gradually becomes urban land.This process replaces natural soil and vegetation with impermeable cement, asphalt, brick, tile, etc. [17,18].This change leads to a drastic change like thermal radiation in the urban substrate, which results in a significant difference in solar radiation between urban and suburban areas.Thus, there is a corresponding quantitative relationship between UHI and LULC.Scholars have recently conducted much research to quantify the relationship between UHI and LULC.Chen et al. have studied the urbanization process through changes in LST and land use using Indiana, USA, as an example [19].Jiang and Zha have analyzed the quantitative relationship between LST and land use in urban areas of Hefei City using Landsat TM images as the data source, and the study showed that the spatial distribution of SUHIs in Hefei City was consistent with the contour of urban construction land.Liu et al. have analyzed the relationship between the SUHI effect and land use changes in 11 jurisdictions in Wuhan city by applying remote sensing and GIS techniques and found that the increase in built-up land was an important factor in the expansion of SUHI intensity [20].
Remote sensing (RS) has proven to be a powerful technology in the domain of environmental studies.It has been widely used in the study of urban climate because of its ability to generate continuous real-time data on a large scale [21,22].RS technology offers a simple and consistent approach for researching the thermal environment of cities, as it can be used to acquire spatially continuous information over a wide region and historical data, thus overcoming the constraints of the traditional process of ground-based observation by a small number of meteorological stations [11].At present, NOAA/AVHRR, EOS/MODIS, ASTER, and Landsat TM/ETM+ are commonly used data sources in the research on the SUHI effect.Among them, the Landsat TM/ETM+ sensors have a relatively high spatial resolution.They can accurately reflect the spatial differences in heat island intensity within cities, so they are widely used in SUHI research.Xu et al. quantitatively analyzed the temporal and spatial characteristics of the SUHI effect in Shenzhen City, China, using Landsat-7 and Landsat-8 images between 2014 and 2019 [23].Tesfamariam et al. assessed the spatiotemporal variation in SUHI effects in Mekelle City, Northern Ethiopia, using Landsat TM, ETM+, and OLI sensor satellite images [24].Welegedara et al. explored the spatiotemporal changes in the SUHI effects in Edmonton City, Canada, using Landsat-5 and Landsat-8 satellite images [25].
Studies on the factors influencing the UHI focus on revealing the intensity of each factor's contribution to the UHI.Many factors affect the LST, including vegetation cover, water bodies, slope, elevation, and urban construction intensity.Rizwan et al. divided the influencing factors into two major categories: controllable and uncontrollable [26].The controllable factors include human-related anthropogenic heat, air pollution, design-related urban sky view factors, green space coverage, building materials, etc. Uncontrollable factors include anticyclone, season, wind speed, cloudiness, etc. Tang et al. have analyzed the relationship between vegetation cover, normalized difference water index (NDWI), normalized difference built-up index (NDBI), and LST in Zhengzhou, China [27].They found that the NDBI was positively correlated with LST, while the NDWI was negatively correlated with LST.Hung et al. have studied the spatial distribution characteristics of the SUHI effect intensity in 18 major Asian cities using TERRA/MODIS data and Landsat image series and pointed out the correlation between LST and vegetation cover [28].Shahfahad et al. conducted a study on the seasonal variation in LST in eight cities located in the semi-arid region of India.Their findings revealed that soil moisture has a greater influence on SUHI than vegetation conditions in semi-arid environments [29].Enete et al. have studied the effect of urban tree species on SUHI effect reduction in Enugu, Nigeria, and found that urban tree species reduced the temperature between 5 and 8 • C [30].Zhang et al. studied the cooling effects of different green spaces and found that the average cooling range of various green spaces ranged from 1.2 to 9.5 • C.They also found that green spaces' SUHI effect mitigation function was related to the type of green space, tree species, species composition, and stand density [31].
Nowadays, a variety of mathematical approaches and models are being used by researchers to examine the connection between UHI changes and influencing factors, which mainly include Pearson correlation analysis, principal component analysis (PCA), ordinary least squares (OLS) regression analysis, the geographical weighted regression (GWR) model, grey correlation analysis, and geo-detector analysis [32][33][34].Among them, the GWR model proposed by Brunsdon et al. has been applied in geography and its related disciplines [35].Wang et al. investigated the relationship between land cover and surface temperature in the Beijing-Tianjin-Tanggu urban agglomeration by establishing a GWR model, and the results showed that the land cover ratio and LST were significantly correlated [36].Kashki et al. evaluated the relationship between LST and influencing factors in Shiraz City, Iran.They found that the GWR model had the spatial distribution of the LST compared with OLS [37].Li et al. analyzed the correlations between SUHI and driving factors in 419 major cities worldwide [38].They found that the GWR model had a higher coefficient of determination (R 2 ) than OLS and stepwise multiple linear regression (SMLR).
Although extensive research has been carried out on the spatiotemporal patterns and influencing factors of LST globally and in China, there are a lack of investigations on cities in arid and semi-arid regions of northwestern China.As the capital of Xinjiang Uygur Autonomous Region, Urumqi is a world-famous route along the ancient Silk Road and a transportation hub between Xinjiang and mainland China [39].In the 21st century, due to the ongoing implementation of the national development strategy, the population of Urumqi has been growing [40].As a result, there has been a growing trend of urban population, leading to a steady increase in urbanization [41].However, this city belongs to a temperate continental arid climate with a fragile ecological environment, and the ecological problems caused by urban heat islands are prominent.Therefore, the purpose of this research was to (1) evaluate the spatial and temporal distribution of UHI intensity between 2000 and 2020 in Urumqi City in the arid and semi-arid areas of northwestern China; (2) explore the relationship between LST and land use; (3) establish a GWR model to investigate the spatial relationship between the LST and relevant driving forces, and recommend possible solutions for mitigating the UHI in the study region.This study attempts to provide insights into the spatiotemporal characteristics of the UHI in Urumqi.By doing so, it tries to provide a theoretical foundation for mitigating the impact of UHI in the city and ultimately enhance the quality of life and living conditions.

Study Area
Urumqi is located at 86 • 37 33 -88 • 58 24 E, 42 • 45 32 -45 • 00 00 N, which is the farthest from the coastline among all cities in the world.It is the capital of the Xinjiang Uyghur Autonomous Region and the core city of the urban agglomeration on the northern slope of Tianshan Mountain (Figure 1).Urumqi is in a mid-temperate continental arid climate zone, with significant variations in temperature, sharp fluctuations in cold and heat, and scant precipitation.The highest temperatures in the area occur in July and August, with an average temperature of 25.7 • C. Winters are cold and long, with average temperatures reaching −15 • C in January.With the rapid social and economic development, the urban and rural residents of Urumqi are increasing year by year, and people's living standards and quality of life are improving.The share of population and employment in the territory is rising, with the resident population exceeding 3 million, according to the sixth national census.The city's per capita GDP exceeds $4000, placing it among the top provincial capitals in western China.
to investigate the spatial relationship between the LST and relevant driving forces, and recommend possible solutions for mitigating the UHI in the study region.This study attempts to provide insights into the spatiotemporal characteristics of the UHI in Urumqi.By doing so, it tries to provide a theoretical foundation for mitigating the impact of UHI in the city and ultimately enhance the quality of life and living conditions.

Study Area
Urumqi is located at 86°37′33″-88°58′24″ E, 42°45′32″-45°00′00″ N, which is the farthest from the coastline among all cities in the world.It is the capital of the Xinjiang Uyghur Autonomous Region and the core city of the urban agglomeration on the northern slope of Tianshan Mountain (Figure 1).Urumqi is in a mid-temperate continental arid climate zone, with significant variations in temperature, sharp fluctuations in cold and heat, and scant precipitation.The highest temperatures in the area occur in July and August, with an average temperature of 25.7 °C.Winters are cold and long, with average temperatures reaching −15 °C in January.With the rapid social and economic development, the urban and rural residents of Urumqi are increasing year by year, and people's living standards and quality of life are improving.The share of population and employment in the territory is rising, with the resident population exceeding 3 million, according to the sixth national census.The city's per capita GDP exceeds $4000, placing it among the top provincial capitals in western China.

Data Sources
Landsat remote sensing data for 2000-2020, land use data for 2000-2020, digital elevation model (DEM) data at a resolution of 30 m, and the administrative boundary vector file were used in this study.Landsat remote sensing data and DEM data and the administrative boundary vector file of the study area were sourced from the Geospatial Data Cloud Platform of the Computer Network Information Center of the Chinese Academy of Sciences (https://www.gscloud.cn(accessed on 10 September 2022)).The spatial resolution of Landsat remote sensing data was 30 m, the selected data were concentrated from June to September, and the cloud cover was less than 10%, which met the requirements of the study (Table 1).The administrative boundary of the study area did not change during the study period.Therefore, the administrative boundary vector file for the year 2018 was used in this study.Land use data were obtained from the Institute of Geographical Sciences and Resources, Chinese Academy of Sciences (https://www.resdc.cn(accessed on 13 September 2022)) with a resolution of 30 m.Based on the LULC classification system of the Chinese Academy of Sciences, the land use types of Urumqi in 2000, 2010, 2015, and 2020 were classified into six categories: cropland, grassland, forest land, built-up land, water bodies, and unused land.Radiometric calibration and atmospheric correction were performed on Landsat satellite image data during pre-processing.The Radiometric Calibration tool in the ENVI5.3software was utilized to convert the digital numbers (DN) of Landsat images to corresponding thermal radiation intensity values (L λ ) [42].The equations can be written as: where L maxλ and L minλ are the minimum and maximum radiation values, respectively, which can be obtained from the metadata of Landsat images.FLAASH Atmospheric Correction under the Atmospheric Correction Module column of Radiometric Correction in the ENVI5.3toolbar was used to perform an atmospheric correction process for the thermal infrared band of Landsat images.

Retrieval of the LST
In this study, the atmospheric correction method (radiative transfer equation) was used for surface temperature inversion of Landsat images of Urumqi City.The atmospheric correction approach is a traditional algorithm derived from the atmospheric radiative transfer model.Firstly, the radiance value was converted into brightness temperature with the following equation: where P(T S ) is the brightness temperature in Kelvin (K).L ↑ is the atmospheric upwelling radiance, and L ↓ is the atmospheric downwelling radiance.τ is the atmospheric transmission, while ε is the surface emissivity.L ↓ , L ↑ and τ were obtained from the query website published by NASA (http://atmcorr.gsfc.nasa.gov(accessed on 20 September 2022)).Surface emissivity can be calculated by the NDVI threshold method proposed by Sobrino [43].The mathematical expression can be written as: where ε is the surface emissivity, while P V is the vegetation cover.Secondly, vegetation cover was calculated based on the formula proposed by Zhang [44] with the following equation: where NDVI stands for the normalized difference vegetation index, NDVI Soil is the NDVI value of bare soil, and NDVI Veg is the NDVI of the regions with full vegetation.The values 0.05 and 0.70, respectively, were used as the values for NDVI Soil and NDVI Veg in the study.Finally, the LST can be obtained by using the Planck function after obtaining the brightness temperature, which is calculated as: where LST represents the land surface temperature measured in degrees Celsius, while K 1 and K 2 are constants whose values can be obtained from the MTL file.

Selection and Calculation of the Influencing Factors
According to the existing studies on the LST driving mechanism, four elements, including NDBI, UI, NDVI in 2020, and slope, were selected in this study (Figure 2).The selection of these drivers was based on their relevance to the research question, the availability of the data, and the principles of validity and reliability.The definitions and equations for calculating these factors are shown in Table 2.
Table 2.The definition of the selected influencing factors.

NDBI
NDBI is a remote sensing index used to measure the density of buildings on the ground surface.The higher its value, the higher the density of buildings in the corresponding area [45].
UI UI is an indicator that describes the size structure of a country or region's cities [46].
NDVI NDVI is the premier indicator for determining vegetation growth and cover [47].

Slope
The slope is the measure of the inclination of the actual ground (D) when compared to the horizontal plane (H).

S=H/D
Figure 2 shows the spatial distribution of each driver in Urumqi.The NDVI is generally consistent with the distribution of vegetation in the study area.The slopes of Urumqi are widely distributed in the mountainous regions on the eastern, southern, and western sides of the city.It is evident that the distribution of NDBI and UI is related to the urban substrate.

OLS and GWR Model
The OLS estimation method, which has global spatial stability, is based on the assumption of a linear regression relationship, thereby generating a unique regression equation that matches all variables.The specific equation is as follows: where Y represents the dependent variable that indicates the surface temperature, while x1-xk are independent variables indicating the influencing factors related to the surface temperature.β is the coefficient to be estimated, responding to the linear correlation between Y and x, where β0 denotes the intercept constant, β1-βk denotes the correlation coefficient between each influence factor of x1-xk and the surface temperature;  is the error term that conforms to the positive terrestrial distribution.The matrix equation for the least squares estimation of the parameters is as follows: where X is a design matrix of the independent variables of dimension n × (k + 1), while Y is a column vector of observed dependent variables of dimension n × 1. n is the number of data points, while T is a vector of n + 1 local regression coefficients.

OLS and GWR Model
The OLS estimation method, which has global spatial stability, is based on the assumption of a linear regression relationship, thereby generating a unique regression equation that matches all variables.The specific equation is as follows: where Y represents the dependent variable that indicates the surface temperature, while x 1 -x k are independent variables indicating the influencing factors related to the surface temperature.β is the coefficient to be estimated, responding to the linear correlation between Y and x, where β 0 denotes the intercept constant, β 1 -β k denotes the correlation coefficient between each influence factor of x 1 -x k and the surface temperature; ε is the error term that conforms to the positive terrestrial distribution.The matrix equation for the least squares estimation of the parameters is as follows: where X is a design matrix of the independent variables of dimension n × (k + 1), while Y is a column vector of observed dependent variables of dimension n × 1. n is the number of data points, while T is a vector of n + 1 local regression coefficients.When OLS is used to estimate parameters, it is assumed that the observations are independent and unaffected by other factors.When the residuals of OLS regression results are analyzed using the Morans' I index, it is necessary to use local regression methods to improve model stability and prediction accuracy if the residuals are spatially autocorrelated.
The OLS model is a global regression method that cannot account for spatial heterogeneity between the dependent and independent variables.The GWR model, on the other hand, improves the OLS model by incorporating spatial variation into the coefficient estimation process of the explanatory variables in the regression model [48].Unlike traditional global regression models, GWR can analyze data characterized by spatially varying relationships (non-stationarity) with dependent variables [49].GWR constructs a local model to explain the spatial relationship between the relevant dependent and predictor variables [50] with the following equation: where y i , X ik and ε i are the dependent variable, the independent variable, and the random error at point i, respectively.The coordinates represent the position (u i , v i ) of the fixed issue i.The coefficient β k (u i , v i ) indicates the different weights at that location, and β 0 (u i , v i ) shows the geographically diverse intercepts.Thus, the GWR technique enhances the global regression model by integrating geographic location parameters, allowing for the production of local coefficients and elucidation of spatial non-stationarity.The β 0 (u i , v i ) and β k (u i , v i ) are estimated by calculating an unbiased set of observations, wherein the weight matrix is employed to assign varying weights to the comments [51].The distinct β k (u i , v i ) values at different locations differentiate the GWR from the OLS.The distance to the fixed point i greatly impacts the weight matrix.
The weight matrix (W(u i ,v i )) is generally generated using a Gaussian function of the following equation: where d is the distance between observed data points, which can usually be computed using Euclidean distance or other distance metrics, and b 2 is the bandwidth parameter, which controls the rate of decay of the weights.A smaller b 2 causes the weights to decay faster, and only nearby observations have a significant effect on the model, and a larger value of b 2 causes the weights to decay more slowly, and more distant observations have some effect on the model.
The GWR model in this study was constructed using the GWR tool in ArcGIS 10.8 software, and the Akaike Information Criterion (AIC) was used to determine the most bandwidth.Compared with OLS, GWR produces more accurate estimates.Also, the model results reflect the local situation because the sample spatial units can correspond precisely to the coefficients and thus can better restore some regional characteristics of the relationship between the original independent variables [50].

SUHI Intensity Classification
The most common method for modeling the effects of urban sprawl and land use change on SUHII is to calculate the difference between urban and rural temperatures [52].However, in the context of local zone (LCZ) concept, SUHII is defined as the surface temperature difference between built-up areas and green spaces [53].Since the LST of greenfield sites also showed variation, the average LST of greenfield sites (i.e., cropland, forest land, and grassland) was used as the reference temperature for SUHII mapping.Therefore, SUHII was calculated for each combined pixel using Equation ( 15) [54].
where SUHII i is the SUHII at built-up pixel i, T bui is the LST at built-up pixel i, and T gs is the average LST of the greenfield pixels.The further derivation of the SUHII for the whole study area is given in the following Equation (16).
where SUHII i denotes the SUHII at pixel i and n denotes the total number of accumulated pixels.

The Spatiotemporal Characteristics of the LST
The inversion results of LST in Urumqi were obtained for 2000, 2010, 2015, and 2020 with the help of ENVI5.3 software (Figure 3).Based on the results of the LST inversion, the LST of Urumqi was divided into five classes: <10 The statistical analysis of the area and percentage of the LST classes for each year are presented in Table 3.The results showed a significant variation in the LST in Urumqi, with a considerable inter-annual variation.From 2000 to 2020, the share of the area occupied by the temperature range <10 • C decreased by 9.51%, while the percentage of the space occupied by the temperature range 10-20 • C and 20-30 • C kept reducing by 25.46% and 32.12%, respectively.On the contrary, the share of the area occupied by the temperature range 30-40 • C increased by 17.48%, while the percentage of the space occupied by the temperature range >40 • C increased by 49.60%.The results indicate that the area of the lower surface temperature classes has decreased significantly.In comparison, the area of higher surface temperature classes has experienced a steady increase over the last two decades.

Analysis of Spatial and Temporal Variation in SUHII
Figure 4 and Table 4 show the spatial distribution and temporal variations of the SUHII with five temperature classes.The results showed that the SUHII is higher in the core area of Urumqi compared to the suburbs, indicating that the area has low vegetation cover and dense urban buildup.The SUHII continued to increase between 2000 and 2020.Between 2000 and 2010, the SUHII was low in all areas of Urumqi, and the proportion of areas with no SUHII, low SUHII, and moderate SUHII increased by 0.07%, 0.37%, and 0.15%, respectively.At the same time, the proportion of areas with high SUHII and areas with very high SUHII decreased by 0.17% and 0.11%, respectively.From 2010 to 2015, the SUHII increased significantly, and it can be seen that the area of very high SUHII increased significantly, mainly in the north, east and west of the built-up area, and the proportion of the increase was 3.04%, the area of high SUHII in the southern part of the built-up area increased significantly, with an overall increase proportion of 0.42%, and the proportion of the area accounted for by no SUHII, low SUHII and moderate SUHII decreased sharply, with decreasing proportions of 1.16%, 0.73% and 0.38%, respectively.From 2015 to 2020, the SUHII still remained high, and there were minor fluctuations in the SUHII for each class.

Analysis of Spatial and Temporal Variation in SUHII
Figure 4 and Table 4 show the spatial distribution and temporal variations of the SU-HII with five temperature classes.The results showed that the SUHII is higher in the core area of Urumqi compared to the suburbs, indicating that the area has low vegetation cover and dense urban buildup.The SUHII continued to increase between 2000 and 2020.Between 2000 and 2010, the SUHII was low in all areas of Urumqi, and the proportion of areas with no SUHII, low SUHII, and moderate SUHII increased by 0.07%, 0.37%, and 0.15%, respectively.At the same time, the proportion of areas with high SUHII and areas with very high SUHII decreased by 0.17% and 0.11%, respectively.From 2010 to 2015, the SUHII increased significantly, and it can be seen that the area of very high SUHII increased  of the increase was 3.04%, the area of high SUHII in the southern part of the built-up are increased significantly, with an overall increase proportion of 0.42%, and the proportion of the area accounted for by no SUHII, low SUHII and moderate SUHII decreased sharply with decreasing proportions of 1.16%, 0.73% and 0.38%, respectively.From 2015 to 2020 the SUHII still remained high, and there were minor fluctuations in the SUHII for each class.

The Relationship between LULC and LST
Figure 5 and Table 5 show the spatial distribution of different LULC classes in Urumqi City and the corresponding area statistics between 2000 and 2020.Spatially, as shown in the figure, the built-up land was mainly distributed in the west and a small part in the southeast of Urumqi City.There has been a clear trend over the last 20 years for built-up land to expand to the north of the city.The cultivated land area showed a distribution pattern around the built-up land, while grassland was widely distributed in most of the study area.There were many small and dispersed water bodies throughout the study area.The unused land was mainly concentrated in the northern region and the mountainous areas of the study area's northeast, southeast, and southernmost parts.Temporally, grassland was the dominant land use type, with a share of 53.42%, 53.10%, 52.55 and 52.20% in 2000, 2010, 2015 and 2020, respectively.There was a decrease of 1.22% in the area of grasslands in 2020 compared with the year 2000.The unused land was the second most dominant land use type in the study area, with its size decreasing by 0.12% over the past 20 years.The percentage of cropland also showed a downward trend, reducing by 0.69%.The percentage of water bodies was the smallest of all land-use types, and its variation was essentially stable.The built-up land area of all land use types increased by 2.11%.
the area of grasslands in 2020 compared with the year 2000.The unused land was the second most dominant land use type in the study area, with its size decreasing by 0.12% over the past 20 years.The percentage of cropland also showed a downward trend, reducing by 0.69%.The percentage of water bodies was the smallest of all land-use types, and its variation was essentially stable.The built-up land area of all land use types increased by 2.11%.The SUHI is closely related to the land cover type and is important in studying the heat island effect.The statistical results for LST between different land use types were obtained by overlaying the land use data and LST of Urumqi City in 2000, 2010, 2015 and 2020, respectively (Figure 6).The results showed that the LST varied significantly from one category of land use to another.First, the average LST of all land uses in Urumqi increased in all years of the study period, with different rates of increase as follows: unused land > cropland > grassland > built-up land > forest land > water bodies.Secondly, the average LST of built-up and unused land was higher than the average LST of other land use types.The average LST of grassland, forest land, and water bodies was significantly lower.Finally, the standard deviation of the LST of water bodies and unused land was more extensive each year, indicating that the LST of these two land use types varied widely and irregularly within the study area.This is mainly because the water bodies in the area studied are minor and include ice and snow, as well as the fact that there is a large area of desert and Gobi in the northern part of the study area, which leads to a significant difference in their LST.Moreover, the standard deviation of the LST was more minor for built-up land and forest land, and they had stable high-and low-temperature environments, respectively.

Results of the OLS and GWR Model
To further explore the relationship between the LST and the influencing factors, th OLS and GWR models were used in this paper, and their results were compared.Th results of the OLS model are presented in Table 6.To avoid the problem of multicolline arity, the variance inflation factor (VIF) was calculated.The variance was removed if th VIF value was greater than 10.In the models used in this paper, the VIF values of th respective variables did not exceed a discount of 10.The regression model's coefficient o determination (R 2 ) was estimated at 0.58, indicating a good fit.For the Koenker (BP) sta tistic, Prob (probability) was less than 0.01, which was statistically significant.The hig value of the joint Chi-square statistic from the reported data, with a likelihood of muc

Results of the OLS and GWR Model
To further explore the relationship between the LST and the influencing factors, the OLS and GWR models were used in this paper, and their results were compared.The results of the OLS model are presented in Table 6.To avoid the problem of multicollinearity, the variance inflation factor (VIF) was calculated.The variance was removed if the VIF value was greater than 10.In the models used in this paper, the VIF values of the respective variables did not exceed a discount of 10.The regression model's coefficient of determination (R 2 ) was estimated at 0.58, indicating a good fit.For the Koenker (BP) statistic, Prob (probability) was less than 0.01, which was statistically significant.The high value of the joint Chi-square statistic from the reported data, with a likelihood of much less than 0.01, indicated that the regression model setup was statistically significant.The above analysis shows that this regression model is statistically significant, and the fit is fair, but it is a non-stationary regression model.Meanwhile, among the explanatory variables, the NDVI and slope were negatively correlated with surface temperature, and UI and NDBI were positively correlated with surface temperature.Table 7 presents the results of the GWR model.The results show that R 2 and adjusted R2 of the GWR were 0.75 and 0.73, obviously larger than the 0.58 of the OLS model described above.The GWR model's higher R 2 and adjusted R 2 compared to the OLS model indicates that the relationship between LST and the influencing factors underlying the model may exhibit spatial non-stationarity, and the GWR model performed better than the OLS model.The results indicate that the GWR model can effectively explain the geographically heterogeneous and nonlinear relationships between LST and influencing factors.The spatial distribution of the coefficient estimates of the influencing factors in the GWR model is shown in Figure 7. Positive values in the figure indicate positive correlations, and negative values indicate negative correlations.The higher the saturation, the higher the absolute regression coefficient.As seen in Figure 7, from a quantitative point of view, the estimates of the GWR coefficients showed a spatially varying pattern.Figure 7a,b show that the slope and the NDVI negatively correlated with the LST in most areas, especially in the northern and eastern regions.Figure 7c,d show that NDBI and UI positively correlated with LST in most areas.The absolute value of the NDBI coefficient was the largest, and its contribution to increased LST was the most pronounced.
Both OLS and GWR indicated that land use type influenced the change in surface temperature during the study period.Based on the coefficient values and t-statistics, GWR further revealed the solid spatial heterogeneity of their relationships.In addition, the coefficients t of NDBI and UI in Urumqi showed that most areas had a significant relationship between surface temperature and built-up area.Overall, the regression analysis showed that the land use composition and topographic morphology of the study area were closely related to the SUHI, and the study confirmed that the increase in built-up area tended to exacerbate the SUHI.In contrast, the increase in vegetation cover intensity mitigated the SUHI.In addition, the surface temperature tended to decrease as the slope increased, since construction activities were not advisable at higher slopes.

Analysis of the Relationship between LULC and LST
Urbanization leads to changes in land use and the dynamic characteristics of the ur ban atmosphere, which affects the formation of the UHI [33].The study results show tha the average LST of built-up land and unused land was higher than the average LST o other land use types in all years.The average LST of grassland, forest land, and wate bodies was significantly lower.The difference in average LST between unused land and greenspace (cropland, forest land, and grassland) was 6.21 °C, while the difference in av erage LST between built-up land and greenspace (cropland, forest land, and grassland was 6.03 °C.The results indicate that grassland, forest land, and water bodies can effec tively mitigate the SUHI.In contrast, the subsurface of built-up land is very complex, and many artificial structures consisting of concrete and asphalt roads alter the thermal prop erties of the subsurface due to their rapid heat absorption and low heat capacity.As a result, the subsurface of built-up land warms faster than the natural subsurface consisting of greenbelt and water under the same solar radiation conditions.In addition, surface temperatures are high in all parts of the built-up land due to the dense, tall, poorly venti lated artificial structures.Unused land has a high surface temperature because it has large bare areas that absorb heat quickly [27].Previous studies have also confirmed that the LST of built-up and unused land is higher than that of other land use types [56][57][58].Ma and Peng studied the relationship between LULC and SUHI in Kunming City, China, and found that unused land had a higher average LST than all other land use types, with a

Analysis of the Relationship between LULC and LST
Urbanization leads to changes in land use and the dynamic characteristics of the urban atmosphere, which affects the formation of the UHI [33].The study results show that the average LST of built-up land and unused land was higher than the average LST of other land use types in all years.The average LST of grassland, forest land, and water bodies was significantly lower.The difference in average LST between unused land and greenspace (cropland, forest land, and grassland) was 6.21 • C, while the difference in average LST between built-up land and greenspace (cropland, forest land, and grassland) was 6.03 • C. The results indicate that grassland, forest land, and water bodies can effectively mitigate the SUHI.In contrast, the subsurface of built-up land is very complex, and many artificial structures consisting of concrete and asphalt roads alter the thermal properties of the subsurface due to their rapid heat absorption and low heat capacity.As a result, the subsurface of built-up land warms faster than the natural subsurface consisting of greenbelt and water under the same solar radiation conditions.In addition, surface temperatures are high in all parts of the built-up land due to the dense, tall, poorly ventilated artificial structures.Unused land has a high surface temperature because it has large bare areas that absorb heat quickly [27].Previous studies have also confirmed that the LST of builtup and unused land is higher than that of other land use types [56][57][58].Ma and Peng studied the relationship between LULC and SUHI in Kunming City, China, and found that unused land had a higher average LST than all other land use types, with a difference in mean LST of 2.45 • C between unused land and green space [59].Cai et al. found in the study of Fuzhou City that built-up land had the highest LST, followed by unused land from 1989 to 2009, with a difference in mean LST of 1.7 • C between built-up land and green space [60].Saha et al. mainly analyzed the spatial relationship between LULC and LST in three urban agglomerations in the eastern part of India, and found that built-up areas exhibited comparatively higher LST and SUHI, with a difference in mean LST of 4.7 • C between built-up land and green space [61].Njoku and Tenenbaum studied the relationship between LULC and LST in Ilorin, Nigeria, and found that LST values exhibited considerable spatial and temporal variation, with the high temperature clusters evident in the built-up areas, with a difference in mean LST of 4.4 • C between built-up land and green space [62].Moisa and Gemeda assessed the urban thermal field variance of Addis Ababa metropolitan city, Ethiopia, and found that the highest LST was recorded in low-vegetation areas, particularly on built-up areas, cropland, and bare land [63].At the same time, Song et al. found a 3.4 • C difference between the average LSTs for built-up land and other land use types, including cropland, grassland, and forestland, using Landsat ETM+ and Quick Bird data in a SUHI study in Beijing, China [58].The SUHI study conducted by Weng et al. in Indianapolis, USA, using Landsat ETM+ data, found that the difference in average LST between built-up land and green spaces was even higher, as much as 5.4 • C [64].

The Model Performance and Influencing Factors of SUHI
In this study, we adopted the OLS and GWR models to quantify the relationship between the SUHI and the relevant driving factors.The results show that the GWR model performed better than the OLS model, and the relationship between the LST and the influencing factors exhibited spatial non-stationarity.The findings are in line with those of other studies.For example, Li et al. investigated the relationship between surface urban heat islands and the driving factors in 419 major cities.They found that the GWR model had higher R 2 than the OLS and stepwise multiple linear regression (SMLR) models [38].Similarly, Gao et al. explored the relationship between the SUHI effect and morphological variables.They found that the GWR model significantly improved modeling fit by capturing spatial heterogeneity compared to the OLS model [65].In addition, the results of both OLS and GWR models show that the NDVI and slope were negatively correlated with the LST, while the UI and NDBI were positively correlated with the LST.The results of the study are consistent with other studies.A study by Ma and Peng found that the LST was negatively correlated with the NDVI and positively correlated with the NDBI in the study of the SUHI in Kunming, China [59].Anniballe et al. studied the spatiotemporal trends of the SUHI over the city of Milan and found that SUHI was highly correlated with NDVI [66].Derdouri et al. studied the thermal variations in eight key Moroccan cities and found that the LST was positively correlated with the NDBI and negatively correlated with the NDVI [67].Similarly, Shahfahad et al. modeled the relationship between UHI and land use indices in Delhi and Mumbai, India.They found that the LST was negatively correlated with the NDVI and positively correlated with the NDBI [68].

Limitations and Future Work
This study investigated the spatiotemporal characteristics of the SUHI and its relationship with land use types from 2000 to 2020 in Urumqi City, in the arid and semi-arid region of northwestern China.Additionally, the OLS and GWR models were used to quantify the relationship between the LST and influencing factors.The study results can provide urban planners and policymakers with a better understanding of the spatiotemporal characteristics of the SUHI and its drivers.However, there are still limitations.First, we could not consider the SUHI's seasonal and daily characteristics due to limited access to the data source and the cloud cover.Future research may focus on the SUHI's seasonal and diurnal daily characteristics through integrating multisource data such as Landsat and Modis.In addition, the city is a complex, dynamic system of social connections, human activities, and infrastructure.UHIs are caused by several factors, including local climate, socioeconomic conditions, urban size, and urban form [19].However, we only considered four influencing factors in this study.Systematic research is needed to determine how these factors contribute to the formation of the UHI.Furthermore, the Landsat TM and Landsat OLI images from different Landsat sensors were used as the main data sources in this study.Remote sensing images from different Landsat sensors have different thermal infrared bands, which have some influence on the accuracy of the surface temperature inversion [69].Therefore, further research is needed to eliminate related uncertainties, especially when precise LST change detection or temporal trends are required.Finally, the LSTs for the year 2005 were not analyzed in this study because the corresponding land use data from the Chinese Academy of Sciences were not available.This may have some impact on the consistency of temporal trends in LST in the study area.

Conclusions
In this study, the spatiotemporal characteristics of UHI intensity were quantified from 2000 to 2020 using Landsat remote sensing data in Urumqi City, located in the arid and semi-arid region of northwestern China.A GWR model was developed to assess the relationship between the UHI and various influencing factors.It was further compared with the OLS model.The results show that the area of the lower surface temperature classes has decreased significantly.In comparison, the area of higher surface temperature classes has experienced a steady rise over the last two decades.In addition, the LST varies considerably from one category of land use to another.The average LST of built-up land and unused land was higher than the average LST of other land use types in all years, while the average LST of grassland, forest land, and water bodies was significantly lower.Finally, the results of the GWR model show that R 2 and adjusted R 2 of the GWR were 0.75 and 0.73, obviously larger than the 0.58 of the OLS model.The GWR model's higher R 2 and adjusted R 2 compared to the OLS model indicated that the relationship between LST and the influencing factors underlying the model may exhibit spatial non-stationarity, and the GWR model performed better than the OLS model.The results of both OLS and GWR models show that the NDVI and slope were negatively correlated with LST, while the UI and NDBI were positively correlated with LST.The results of the study revealed that the increase in the LST was closely related to urban expansion, and the urban heat island effect can be significantly mitigated with the aid of green spaces.The results of the study suggest that increasing the green spaces (grassland, forest, jungle, orchards, etc.) in the city and limiting the disorderly expansion of urban areas are effective means to alleviate the UHI effect in the study area.

Figure 1 .
Figure 1.Geographical location (a) and topographic map (b) of the study area.

2. 2 .
Data Sources Landsat remote sensing data for 2000-2020, land use data for 2000-2020, digital elevation model (DEM) data at a resolution of 30 m, and the administrative boundary vector file were used in this study.Landsat remote sensing data and DEM data and the administrative boundary vector file of the study area were sourced from the Geospatial Data Cloud Platform of the Computer Network Information Center of the Chinese Academy of

Figure 1 .
Figure 1.Geographical location (a) and topographic map (b) of the study area.
Land 2023, 12, 2012 7 of 20 Land 2023, 12, x FOR PEER REVIEW 7 of 21sides of the city.It is evident that the distribution of NDBI and UI is related to the urban substrate.

Figure 2 .
Figure 2. Spatial distribution of selected influencing factors in Urumqi.

Figure 2 .
Figure 2. Spatial distribution of selected influencing factors in Urumqi.

Figure 3 .
Figure 3. Spatial distribution maps of LST for Urumqi City for four consecutive years ((a-d) refers to the spatial distribution of LSTs in 2000, 2010, 2015 and 2020, respectively).

Figure 3 .
Figure 3. Spatial distribution maps of LST for Urumqi City for four consecutive years ((a-d) refers to the spatial distribution of LSTs in 2000, 2010, 2015 and 2020, respectively).

Figure 5 .
Figure 5. Spatial distribution maps of LULC for Urumqi City for four consecutive years ((a-d) refers to the spatial distribution of land use in 2000, 2010, 2015 and 2020, respectively).

Figure 5 .
Figure 5. Spatial distribution maps of LULC for Urumqi City for four consecutive years ((a-d) refers to the spatial distribution of land use in 2000, 2010, 2015 and 2020, respectively).

Land 2023 , 2 Figure 6 .
Figure 6.Differences in LSTs between land use types ((a-d) refers to the, mean LSTs and standar deviations of the different land use types in 2000, 2010, 2015 and 2020, respectively).

Figure 6 .
Figure 6.Differences in LSTs between land use types ((a-d) refers to the, mean LSTs and standard deviations of the different land use types in 2000, 2010, 2015 and 2020, respectively).

Table 1 .
Descriptions of Landsat TM/OLI images used in the study.

Table 3 .
The area and percentage of the LST classes for Urumqi City for four consecutive years.

Table 3 .
The area and percentage of the LST classes for Urumqi City for four consecutive years.

Table 4 .
Spatiotemporal variation in the areal coverage of SUHII zones.

Table 4 .
Spatiotemporal variation in the areal coverage of SUHII zones.

Table 5 .
Proportion of land use area in Urumqi by year.

Table 6 .
The OLS model results for LST in Urumqi City.

Table 7 .
The GWR model results for LST in Urumqi City.