An Application of the LCZ Approach in Surface Urban Heat Island Mapping in Soﬁa, Bulgaria

: This article presents the results of the thermal survey of the capital of Bulgaria (Soﬁa) carried out in August 2019, with the application of an unmanned aerial system (UAS). The study is based on the concept of local climate zones (LCZs), taking into account the inﬂuence of the features of land use/land cover and urban morphology on the urban climate. The basic spatial units used in the study are presented in the form of a regular grid consisting of 3299 cells with sides of 250 × 250 m. A total of 13 types of LCZs were identiﬁed, of which LCZs 6, 5, 8, 4, D, and A form the largest share. In the thermal imaging of the surface, a stratiﬁed sampling scheme was applied, which allowed us to select 74 cells, which are interpreted as representative of all cells belonging to the corresponding LCZ in the urban space. The performed statistical analysis of the thermal data allowed us to identify both the most thermally loaded zones (LCZs 9, 4, and 5) and the cells forming Urban Cool Islands (mainly in LCZs D and C). The average surface temperature in Soﬁa during the study period (in the time interval between 8:00 p.m. and 10:00 p.m.) was estimated at 20.9 ◦ C, and between the different zones it varied in the range 17.2–25.1 ◦ C. The highest maximum values of LST (27.9–30.6 ◦ C) were registered in LCZ 4 and LCZ 5. The relation between the spatial structure of the urban thermal patterns and urban surface characteristics was also analyzed. Regression analysis conﬁrmed the hypothesis that as the proportion of green areas increases, surface temperatures decrease, and, vice versa, as the proportion of built-up and impermeable areas increases, surface temperatures increase. A heat load map (via applying a z-transformation to standardize the temperature values), a map of the average surface temperature, and a map of the average intensity of the heat island on the surface were generated in the GIS environment. The results of the study adequately reﬂect the complex spatial model of the studied phenomenon, which gives grounds to conclude that the research approach used is applicable to similar studies in other cities.


Introduction
The urban heat island (UHI) is one of the most significant examples of the impact of cities on the environment. This phenomenon was first documented in 1818 for the city of London by the "father of urban climatology" Luke Howard [1][2][3].
An UHI is characterized by positive temperature differences between the city and its surroundings, which usually reach their maximum a few hours after sunset [4]. As a traditional measure of the intensity of the phenomenon, the difference between two measurement locations is used-one representing "urban" conditions and the other-"rural" [5]. The magnitude of the temperature difference (∆T u-r ) correlates positively with the size of the city [6], but there are studies that do not establish such a relationship [7]. The study of UHIs is not only of scientific interest, but also has important practical implications for urban planning. The negative effects of an UHI are usually associated with a general deterioration of the urban environment, with increased health risks, increased consumption of energy for cooling, the frequent formation of smog, an increased concentration of ground-level The classification divides urban and rural landscapes into 17 standard classes, each defined by ten structural or "built-up" (LCZs numbered from 1 to 10) and seven land cover or "natural" (LCZ named with letters from A to G) climate-relevant surface properties that influence the air temperature. Thus, according to the building types, LCZs from 1 to 3 are compact high-rise, compact midrise, and compact low-rise; LCZs from 4 to 6 are open high-rise, open midrise, and open low-rise; and LCZs from 7 to 10 are lightweight low-rise, large low-rise, sparsely built-up, and heavy industry. Based on seasonal fluctuations, it is possible to differentiate additional variable subclasses of LCZs (b-bare trees, s-snow cover, d-dry ground, and w-wet ground). LCZs are regions of uniform surface cover, structure, material, and human activity spanning hundreds of meters to several kilometers on a horizontal scale. In the context of the LCZ classification system, the urban heat island intensity is not an urban-rural near-surface air temperature difference (∆Tu-r), but an air temperature difference between pairs of LCZ types (∆T LCZ X-Y ; that is, an inter-zone temperature difference [47]. Available spatial databases on urban structures are used as primary sources of information. This includes digital cadastre data, remotely received LULC and LST data, orthophotos, DEM/DTM data and building geometry (via LiDAR-and SAR-based technologies), data from ground stations for T a , and freely available data from OpenStreetMap, Google Earth Engine, Google Street View, etc. Primary methods for LCZ mapping include in situ measurements, geographic information system (GIS)-based and remote-sensingimage-based calculations (the WUDAPT method), or a combination of them, which are presented in detail in many publications [48][49][50][51][52][53][54][55][56]. To improve the accuracy, some scientists offer an alternative GIS-based workflow to translate Copernicus datasets from Urban Atlas and Corine Land Cover shapefiles and Raster GeoTiff into LCZ maps, which, however, limits the possibilities for practical application of the method outside major European cities [57]. Testing of the methodology for five southern European cities (Athens, Barcelona, Lisbon, Marseille, and Naples) shows promising results [58]. At the same time, methods for GIS-based spatial analysis and UHI map generation are being improved [59], as well as the interpolation techniques for cartographic representation [60]. Correlation analysis and regression modeling are commonly used to detect the influence of LULC and urban morphology on the UHI [61,62]. Thanks to advances in geospatial technologies, UHI/SUHI maps based on the LCZ concept have been created for hundreds of cities around the world today. There are also examples of the application of the LCZ system in studies for which it was not originally intended (e.g., for mapping and assessment of urban ecosystem conditions and services [63] and urban ecohydrological studies [64]).
Although remote thermal studies do not provide direct observations of T a in the lower layer of the urban atmosphere, they give an idea of SUHI, which is the main engine of atmospheric UHI. The various methods for remotely acquiring LST data have both advantages and disadvantages. The main advantage of using data from satellite sensors is that LSTs can be observed simultaneously over a large area and temperature variations can be analyzed depending on the influence of certain factors (LULC and urban morphology). Using appropriate satellite data depends on the individual sensors and their spectral and time resolution. Still, the optimal time to receive data may not coincide with the time the satellite passes over the target object because the temporal resolution is not controlled by the user [65].
Unlike satellite data, data from airborne thermal remote sensing (from airplanes and helicopters) have a much higher level of detail, which allows for the study of microscale variations in LST between the individual elements of the urban landscape. Another significant advantage is that you can choose the most appropriate time of day to receive thermal data. However, the disadvantages are the high costs, the complex organization, and the strict rules of flight management in a country. This type of observation is usually combined with ground temperature measurements from stationary and/or mobile meteorological stations [66][67][68][69].
An alternative approach is the use of unmanned aerial vehicle systems (UAVs, or drones), which are increasingly used in various scientific fields [70]. These platforms can be equipped not only with thermal, but also with multispectral and hyperspectral sensors, as well as with LiDAR, which allows for obtaining very detailed information about the ground cover and urban morphology needed to generate 2-D and 3-D models of the urban environment. This is important for the assessment of the heat load of cities, as the total area of the different types of heat-emitting surfaces in them is much larger than in the suburbs [71]. In recent years, there have been many examples of successful use of UAV-based thermal studies in different parts of the world. For instance, in several studies, Gaitani et al. [72][73][74] presented the results of a thermal survey in a suburb of Athens, and Naughton & McDonald [75] used a UAV quadcopter drone and thermal camera at two university campuses in Milwaukee (Wisconsin) and El Paso (Texas). However, we are not aware of any UAV-based thermal studies of LCZs that have been performed so far throughout the whole city. The main advantage of thermal studies with an UAV is obtaining high-quality data for LST with a very high spatial resolution in a flexible, easy, and accessible way. On the other side, the application of UAVs often is affected by a number of limitations related to urban safety regulations and privacy [76].
Collecting air temperature data with an appropriate spatial resolution for an entire city like Sofia is a complex task that requires expensive equipment and labor-intensive organization. A possible approach is to use remote sensing to provide a snapshot of the LST in time as a proxy for air temperature. This publication presents some of the results of a more extensive study using GIS-based spatial analysis methods, traditional satellite data sources, and high spatial resolution data from unmanned aerial vehicle systems (UAVs) [77]. The paper is organized as follows. After reviewing the literature on the topic (this Section), we present the approach used in the development of the map of LCZs in Sofia (Section 2.2) and UAV-based thermal surveys performed using a stratified sampling scheme (Section 2.3). In Section 3.1, we present the statistical characteristics of LST by LCZs, in Section 3.2 we perform spatial interpretation and analysis of the results obtained from the thermal imaging of the surface, and in Section 3.3 we demonstrate the map of the intensity of SUHI in Sofia. In Section 4, we discuss the results of the research, and in Section 5 we draw some conclusions about future research.

Study Area
The present study is focused on the urbanized area of Sofia-the capital of Bulgaria. The city is located in the largest valley in Bulgaria-Sofia (about 1200 sq. km) at 550 m above sea level. The relief of the valley floor is flat, with a slight slope up to the north. To the north, it is surrounded by the Balkan Mountains and to the south-by the Vitosha Mountain. The climate is temperate continental with four clearly distinguished seasons. According to the climatic classification of Köppen-Geiger, the study area has a climatic index Cfb (C-warm temperate, f-fully humid, b-warm summer), [78,79]. According to the Sofia Station, which has been in operation from 1881 to the present, the average annual temperature is 10.3 • C, and the average yearly rainfall is 612 mm, with a maximum in May-June. The average temperature of the coldest month is -1.7 • C and the warmest month is 21.2 • C. The measured absolute minimum air temperature is -27.5 • C, and the absolute maximum temperature is 37.4 • C ( Figure S1). Sofia's urban area is a complex combination of different functional zones (administrative, residential, industrial, transport, commercial, parks, etc.), which have been developed at different times, over a very long period.

Local Climate Zones (LCZs)
For the study, an integrated spatial model of local climate zones (LCZs) was created, which is based on the classical concept developed by Oke and Stewart. In the present study, preliminary research and fieldwork were performed to determine the location of specific areas within a given LCZ. Metadata sources include information from aerial imagery with an UAV, orthophotos, a DEM, LULC maps, cadastral maps, street maps, Copernicus datasets from Urban Atlas, and data from Google Earth and Google Street View, which were integrated and organized into a target geodatabase. This, in turn, made it possible to determine the general characteristics of the urban landscape and to specify the spatial distribution of different types of local surfaces (the impervious areas and the Blue-Green Infrastructure elements) and the peculiarities of the urban morphology (e.g., street center-lines and width on the streets, building footprints, the density and height of construction).
Usually, the basic spatial units for which LCZ information is collected are regular grids (with a resolution of 100-500 m) [80,81], lot area polygons [82], urban blocks [83], or urban function zones (UFZs) [84]. For this study, a regular grid was used, consisting of 3299 cells with sides of 250 × 250 m, each cell covering an area of 6.25 ha. In this way, both the unification of the range of the basic units, which are the subject of research, is achieved, and the application of the geostatistical and spatial-interpolation methods in the further analysis is facilitated. Based on similarities in land use and morphological characteristics, most of these cells were grouped into different LCZ classes. A basic rule in determining the affiliation of each cell to a particular LCZ is that of the predominant area of its surface (majority rule), i.e., if different morphological and land-use types fall within the scope of a cell, the cell is determined by the predominant type. This approach, although leading to a greater degree of generalization, allows for a better adaptation of spatial data for the purposes of stratified sampling in subsequent geostatistical processing. Based on orthophoto images and three-dimensional models of the surface in the city of Sofia provided in the Google Earth Pro platform, as well as with the help of Google Street View, a detailed analysis of the more complex areas (cells) and validation of already defined group cells were performed. In determining the height of the building, information on the height of buildings with an Urban atlas source with a resolution of 10 m was used, as the height information is based on IRS-P5 stereo images and is presented as a digital surface model ( Figure S2). With the results of this analysis, a map of LCZs was prepared, and, subsequently, some previously unconfirmed areas were validated by the field team. Thus, 13 types of LCZ were identified within the urban area ( Figure 1). Built classes cover around 71.1% of the total study area, compared with 28.9% of the non-built classes. Most classes are in the LCZ types 6, 5, 8, 4, D, and A. The share distribution for each type of LCZ is presented in Table 1.

UAV-Based Thermal Survey of LCZs
LST data within the urbanized space of Sofia were collected in the field for selected representative cells in the grid using an UAV with an installed Thermal Infrared (TIR) sensor. In this case, the Albris platform of the Swiss company Sensefly was used, which is a V-shaped inspection quadcopter with five sensors and three camcorders, allowing for three types of imaging with image detail below 1 cm depending on the subject of study. The TIR camera has parameters of 80 × 60 pixels, with a 50-degree angle and a fixed measuring point, positioned on a three-axis head (gimble). The thermal imager is an uncooled microbolometer type with a thermal sensitivity of 150 mK and an accuracy of ±3 • C or ±5% of reading. The unmanned aerial system enables remote capture of thermal images and video with a dynamically adaptable scale in real-time ( Figure 2).
Stratified sampling (STR) was applied to the selection of representative cells, in which the population comprising all 3299 cells was first divided into non-overlapping subpopulations (13 LCZ classes), called "strata", and sampling was performed independently in each of the strata. As the individual areas of LCZs vary widely, the weights of the areas whose total sum is equal to 1 were also determined; for example, LCZ 6 received a weight of 0.219, as it occupies the most significant space of all zones, and LCZ G recevied a weight of 0.007 because it has the smallest area compared with all other sites. When transmitting the required number of cells (samples) in each zone, the sample size was determined by the method of irreversible selection [85][86][87]. An essential advantage of the stratified sample, compared with other sampling schemes (simple random, systematic, clustered, etc.), is that with the same sample size the results have less stochastic error and higher accuracy due to less scattering in the strata compared with the total scattering in the population [88][89][90].
As a result of the statistical approach used, a total of 74 cells belonging to different local climatic zones were identified from all 3299 cells (Table 1). The locations of these cells play the role of "virtual meteorological stations", and the measured LSTs in the respective cells are interpreted as representative of all cells belonging to a given LCZ in the urban space. The most appropriate study period of the year is August, which is the month with the highest average temperature of the Earth's surface, which is a fundamental prerequisite for the most visible signs of the UHI phenomenon. Its effect is most clearly observed during the early evening hours, immediately after the active sunshine, with a relatively windless and cloudless sky in the situation of anticyclonic synoptic conditions. In this part of the day, there is the best opportunity to maximize the potential for precise identification of micro and local climatic differences between urbanized territories and their response to the UHI. Regarding this, the time interval between 8:00 p.m. and 10:00 p.m. was selected for thermal imaging observations for the 74 predefined grid cells. The UAV-based thermal imaging campaign was conducted over eight days on calendar dates 7, 8, 12, 13,14, 21, 22, and 23 August 2019.
In each of the studied cells, there are different types of land cover and types of construction, which are in different proportions. Therefore, in the range of each cell, the areas of four main categories of surfaces were quantified: (1) Sealed surfaces; (2) Buildings; (3) Vegetation; and (4) Water bodies, that were used in the calculation of the weighted average LST value for the given thermally examined cell.

Results
As a result of the research, a general geoinformation model of LCZs of the urbanized space of Sofia was created in the form of a standardized grid of size 250 × 250 m and various types of attributive information obtained through the integrated application, in situ observations and surveys, mapping of the territory via an UAV, and classical remote sensing and geostatistical and geospatial analytical operations.

LST Statistics by LCZs
For each LCZ, the statistics of the LSTs measured in them were processed, with the exception of LCZ 3 and LCZ G, which are represented by only one sampling cell and contain only one temperature value. Of all 74 cells, 58 cells covered 77.3% of the total area of all cells examined. These are LCZ 4 (7 cells), LCZ 5 (15 cells), LCZ 6 (12 cells), LCZ 8 (14 cells), and LCZ D (10 cells). Table 2 presents the statistical characteristics of the LST of the studied cells. The highest mean and median values of LST (25.1 • C) were registered in LCZ 9, but due to the insignificant relative share of the zone (~1.0%) its contribution to the total heat load of the city is minor. On the other hand, the highest maximum values of LST (27.9-30.6 • C) were observed in LCZ 4 and LCZ 5, which occupy a significant share (about 30%) of the urban area. As expected, the lowest temperatures are measured in the "natural" classes of climatic zones (LCZ A-LCZ G). Intra-LCZ variability in the measured LSTs is best seen in the Box-Plot diagrams of Figure 3. It can be seen that the most extensive range of LST is in LCZ 4 (11.8 • C) and in LCZ 5 (8.9 • C), which can be explained by the more complex and diverse internal spatial structure of these areas-the presence of complex combinations of anthropogenic elements (buildings with different stores and geometries and differences in the type and area of sealed spaces) and natural features (green spaces with diverse vegetation and/or bare permeable soil substrates).

Spatial Interpretation and Analysis of Field Research Data
In this part of the study, we present the results of the performed z-transformation of the temperature values, as well as the generated map based on this transformation, showing the spatial distribution of the different degrees of heat load in Sofia. Then, we present the constructed regression model, revealing the relationship between the main types of land cover and the measured temperatures in them. Finally, we present the map of the spatial distribution of temperature values.
The data for LST were used for the preliminary estimation of the heat load in Sofia. The heat load can be categorized into four stages (most favorable, favorable, less favorable, and most unfavorable) on the basis of the standardized temperature values by the applied z-transformation according to the following formula [91]: where X is the mean temperature of a single test cell, µ is the mean temperature of all cells tested, and σ is the standard deviation of the mean LST of all cells. Table 3 and Figure 4 present the results of the spatial z-transformation of the temperature values. The areas with values of z <−1 are distinguished by the lowest levels of heat load. They cover 16.22% of the urbanized area and include mainly forested areas, areas with grass or shrubs and surface water bodies (LCZ A, LCZ C, LCZ D, and LCZ G), as well as low-rise areas with yards and gardens, with trees scattered between the buildings (LCZ 6).
The lowest value of the z-score in this category is registered in LCZ D (z-score = −2.10). Category 4 covers 12.16% of the urbanized area and shows the highest degree of heat load (hot island) with a value of z of over 1 (i.e., areas with a LST higher than the mean + one std). These are mainly areas with high and medium-high construction (LCZ 4 and LCZ 5), as well as large low-rise buildings with extensive sealed spaces between them (LCZ 8). Even single cells with z-score = 3.11 (LCZ 4) and z-score = 2.24 (LCZ 5) fall into this category. The other categories (2 and 3) occupy the largest share of the urbanized territory (71.62%).
The results from the in-situ UAV-based thermal surveys were used to construct linear regression dependencies through the least-squares method between the areas of the main land-use types (as an independent quantity) and the average measured temperatures by land-use types (as a dependent value). The general conclusions confirm the hypothesis that the greater the proportion of green space, the lower are the measured temperature values, and vice versa-the higher the proportion of built-up and impervious areas, the higher are the surface temperatures ( Figure 5).  Using the created regression dependencies and the spatial distribution of the measurements made by the UAV, a geostatistical model of the spatial distribution of the surface temperature within the city of Sofia was created. For this purpose, an approach for deterministic local interpolation was applied whereby the predicted values are generated within local groups of adjacent points based on the network of measured values and their locations via the UAV. The result of spatial interpolation clearly shows that within the urban space in Sofia a poly-structural (polycentric) urban heat island has been formed ( Figure 6). The great diversity of the territories involved in the different LCZs, as well as the high spatial variability in their spatial structure, determines to a large extent the complex and poly-structural nature of the UHI phenomenon. For example, only one section of 1000 square meters exhibits significant temperature contrasts. The figure below shows a fragment of a field measurement of the surface temperature at 10:00 p.m., carried out in the area of a large hypermarket in Sofia. The image clearly shows that within this confined space, substantial differences in surface temperature are observed-in the grassland, it is within 12.9-13.1 • C, in the parking places made of concrete rosette panels it is 17.1 • C, and on the asphalt road it is 23.6 • C (Figure 7).

Map of SUHI Intensity
The observed temperature differences characterize the complexity of the UHI and represent one of its most important parameters-its intensity. The intensity of SUHI represents the temperature difference (∆T) between two local climatic zones x and y (∆T LCZx -LCZy ). Inter-LCZ differences in measured LST in Sofia are presented in Table 4. Based on the data from the matrix in the table, a statistical surface was created using local polynomial interpolation, representing in a continuous form the UHI magnitude in the urban space of Sofia (Figure 8).

Discussion
The study clearly showed that the "artificial" LCZ classes have significantly higher surface temperatures than the "natural" LCZ classes, which are relatively cooler. It was also found that there are significant temperature differences not only between the individual LCZs, but also within the zones themselves. Therefore, future research should focus on revealing the micro-scale effects caused by the influences of the type of construction, the orientation of the buildings, the ventilation possibilities, the urban canyons, the used building materials, the peculiarities of the green infrastructure, the meteorological conditions and several other factors affecting the climate in Sofia.
The range and spatial configuration of SUHI show significant variability over relatively short distances due to the complex combination of anthropogenic and natural elements forming the urban landscape. Therefore, the SUHI looks more like an "archipelago" than a clearly defined single thermal island. However, the warmest places generally coincide with the residential areas, which also affects the climatic comfort of the people. Therefore, future micro-scale studies should focus mainly on these urban areas.

Conclusions
The results of the study adequately reflect the complex spatial model of the studied phenomenon, which gives grounds to conclude that the research approach used is applicable to similar studies in other cities. The developed geoinformation model of LCZs and the related integrated database constitute a suitable basis for further research related to mapping and assessing the effects of the urban heat island. Because of advancing urbanization and deepening climate change, urban areas will inevitably become the focus of such research. The approaches, methods, and technological solutions used in the present study were fully consistent with the challenges that objectively exist in the study of UHIs, as well as with the complex geospatial nature of this phenomenon. We are taking into account all advantages as well as the disadvantages of the used UAV-based solution, whose major problem is the relatively low resolution of the thermal imager. Therefore, currently, we are moving to a more advanced solution based on the integration of thermal radiometric infrared (FLIR) and and a photogrammetric camera. This is the Duet T sensor of the Sensefly Ebee X platform (https://www.sensefly.com/camera/sensefly-duet-t-thermal-mapping-camera/, accessed on 15 September 2021), which has provided very promising results in our preliminary research experiments.
In our opinion, because of the need for data with an extremely high spatial resolution for the changing urban environment, as well as based on the requirements for the time in which the field mapping activities of the thermodynamic characteristics of urban areas should be carried out, modern technological means, such as unmanned aerial systems, provide a high level of flexibility and operability while providing opportunities for accurate collection of high-resolution geospatial data, objectively characterizing various aspects of the territory, including land cover, land use, urban morphology, and local climatic conditions.
The latest technological innovations related to thermal photogrammetry and hyperspectral sensors installed on UAVs open up new possibilities for detailed studies on the urban climate and the factors that determine it. The integration of these new geospatial technologies with traditional geographic information systems and tools will contribute to a better understanding of the mechanism of manifestation and spatial distribution of UHI effects, which is an essential prerequisite for mitigating its adverse effects on urban areas and their adaptation to climate change.