Inference of Local Climate Zones from GIS Data, and Comparison to WUDAPT Classiﬁcation and Custom-Fit Clusters

: A GIS-based approach is used in this study to obtain a better LCZ map of Berlin in comparison to the remote-sensing-based WUDAPT L0 approach. The LCZ classiﬁcation of land use/cover can be used, among other applications, to characterize the urban heat island. An improved fuzzy logic method is employed for the purpose of classiﬁcation of the zone properties to yield the GIS-LCZ map over 100m × 100m grid tiles covering the Berlin region. The zone properties are calculated from raster and vector datasets with the aids of the urban multi-scale environmental predictor (UMEP), QGIS and Python scripts. The standard framework is modiﬁed by reducing the threshold for the zone property impervious fraction for LCZ E to better detect paved surfaces in urban areas. Another modiﬁcation is the reduction in the window size in the majority ﬁlter during post-processing, compared to the WUDAPT L0 method, to retain more details in the GIS-LCZ map. Moreover, new training areas are generated considering building height information. The result of the GIS-LCZ approach is compared to the new training areas for accuracy assessment, which shows better overall accuracy compared to that of the WUDAPT L0 method. The new training areas are also submitted to the LCZ generator and the resulting LCZ-map gives a better overall accuracy value compared to the previous (WUDAPT) submission. This study shows one shortcoming of the WUDAPT L0 method: it does not explicitly use building height information and that leads to misclassiﬁcation of LCZs in several cases. The GIS-LCZ method addresses this shortcoming effectively. Finally, an unsupervised machine learning method, k-means clustering, is applied to cluster the grid tiles according to their zone properties into custom classes. The custom clusters are compared to the GIS-LCZ classes and the results indicate that k-means clustering can identify more complex city-speciﬁc classes or LCZ transition types, while the GIS-LCZ method always divides regions into the standard LCZ classes.


Introduction
The urban heat island (UHI) effect has been defined as a phenomenon whereby the near-surface urban canopy air temperature is, on average, higher than that of its surrounding countryside [1]. Its intensity is likely to keep increasing in the future due to population and urbanization growth [2]. The UHI intensity characterizes urban climates and is related to a negative impact on the environment by increasing energy demand due to an increase in air conditioning, elevating emission of greenhouse gases and air pollutants, endangering human health and comfort and impairing water quality [3]; therefore, the prediction of UHI is becoming significantly important.
Mesoscale numerical climate simulation software such as the Weather Research and Forecasting (WRF) [4] as well as simplified urban canopy models [5] is used to estimate the UHI intensity; however, they require highly resolved and accurate land use/land cover based on the LCZ scheme [21]. A GIS-LCZ map was also generated for Vienna, Austria by Hammeberg et al. [22]. This work applies 100 m × 100 m grid tiles as the BSU and employs five zone properties: building height, aspect ratio, and the usual three surface fractions. The classification of the zone properties utilizes a naive Bayes classifier. There was no post-processing performed in this study. The evaluation for the classification result was carried out by comparing it to the WUDAPT L0 map.
Wang et al. [14] derived the GIS-LCZ map for Hong Kong applying a 100 m × 100 m grid tile as the BSU. They used three zone properties and one additional land use data. The zone property's building height, building surface fraction, and sky view factor were employed to classify the built-up classes (LCZs 1-10). Additional land use data are used for the classification of the land cover classes (LCZs A-G). The classification was achieved by modifying the standard rules proposed by Stewart and Oke [7]. An accuracy assessment was carried out for the resulting GIS-LCZ map by comparing it to the established validation samples.
Another GIS-LCZ method was conducted by Estacio et al. [23] for Quezon City, Philippines. The study employed seven zone properties: sky view factor, building height, roughness length, surface albedo, and the three usual surface fractions. These zone properties were calculated over 100 m × 100 m grid tiles, which were further classified by applying a fuzzy logic algorithm modified from Lelovics et al. [18]. The result of the classification was aggregated by using cellular automata to derive the LCZ map. The map was validated using expert knowledge. The land surface temperature profile for each LCZ type was also assessed in this research.
A clustering method was also applied in the work of Hidalgo et al. [24] to classify three cities in France (Toulouse, Paris and Nantes) based on the GIS datasets. They used eight parameters to categorize the built up classes (LCZ 1-10), and from those parameters, there was only one parameter that corresponds to the LCZ framework, which is mean building height. The other parameters are building density, building typology, majority building typology within the polygon, mean minimum distance, median minimum distance, main morphological group, and number of buildings. Morphological groups are used to identify LCZs 7 and 8. For the other built up classes, k-means clustering is applied for the classification. LCZs A-G were classified with another set of parameters: urban vegetation data from satellite images (LCZs A and B), road fraction indicator (LCZ E), and water fraction indicator (LCZ G).
In this paper, the GIS-LCZ method was applied with several novelties. It was used to define the LCZs in the city of Berlin, which, to the best of the authors' knowledge, has not been classified with the GIS-based method before. The resulting GIS-LCZ map is expected to improve the existing remote sensing-based LCZ map (WUDAPT L0). The classification of the zone properties employs an improved fuzzy logic algorithm and modification of the standard LCZ framework. The post-processing applied to the result of the GIS-LCZ classification is a majority filter as applied by the WUDAPT L0 approach, but in this paper, the window size in the majority filter is reduced to retain more detail from the GIS-LCZ data. New TAs of LCZ classes for Berlin are also generated considering the building height information, which is not considered by the TAs generated for WUDAPT L0. The resulting GIS-LCZ map and the WUDAPT L0 map are compared to the TAs for assessing the accuracy. Moreover, the TAs are also submitted to the LCZ generator website to derive an improved RS-LCZ. Furthermore, the k-means clustering method is used to cluster the grid tiles according to their zone properties into custom classes. Finally, the custom clusters are compared to the GIS-LCZ classes to investigate their similarities and differences. Section 2 in this paper explains the study area and methodology. Section 3 describes the classification results and discussions. Section 4 contains conclusions and research outlook.

Methodology
In this study, a GIS-based method was employed to derive the LCZs for Berlin. The general GIS-LCZ mapping process is illustrated in Figure 1. The process was initialized by collecting vector and raster data that are later used to calculate the zone properties. The zone properties are calculated based on a basic spatial unit (BSU), which is defined as a grid tile. Each BSU is classified applying a fuzzy logic algorithm to derive the LCZs. During post-processing, a filtering is carried out to the classification result of LCZs. An evaluation was performed on the final GIS-LCZ by comparing it to the new training areas. The new training areas were further used to evaluate the existing WUDAPT L0 map and to generate an LCZ map from the LCZ generator. Furthermore, the zone properties of the grid tiles were clustered, and then compared to the GIS-LCZ result.

Study Area and Datasets
The focus of this study is the city of Berlin. Berlin has an area of 892 km 2 . As of 2019, its population was around 3.8 million, which makes it the most populous city in Germany. One of the reasons for choosing this city is the availability of the GIS data, which we require for deriving the GIS-LCZ map. A Google Earth image of Berlin and the newly created training areas (see Section 2.5) shown in Figure 2.
The DLR dataset is obtained from the work of Heldens et al. [25]. They generated raster data of Berlin for a microclimate simulation. The raster dataset includes rasters of building height, terrain height, vegetation height, water, streets, and bridges. The dataset provides several resolution ranges from 1 m to 16 m. On the other hand, OSM is a vector dataset containing primarily building land use, road, and water features. OSM is an open source data generated by a community of mappers [29].
Satellite imagery form Copernicus is also used in this study. Copernicus is the European Earth monitoring system where data are acquired from different sources, such as in situ sensors and Earth observation satellites. Raster data of land cover and high resolution layers, such as imperviousness density (IMD) are provided by Copernicus [27].
As mentioned previously, WUDAPT data are used for evaluation purposes in this paper. WUDAPT is a project initiated by urban climate research to provide universally coherent and consistent information on form and function of urban morphology for climate studies [9]. WUDAPT information consists of three levels of detail (L) and is gathered using distinct methodologies. Level 0 (L0) data comprises a local climate zone map, which is based on the work of Stewart and Oke [7]. On the other hand, Level 1 (L1) data provide a better representation for each LCZ through sampling by providing information regarding urban form and function in a finer spatial resolution. L1 data have a representation in three-dimensional form. Furthermore, Level 2 (L2) data provide more information by giving detailed descriptions of urban parameter values for boundary layer modeling.
The WUDAPT project provides level 0 (L0) maps for many cities around the world. The WUDAPT L0 map for Berlin was downloaded from the WUDAPT portal [30]. The map was produced in 2016 and was derived from Landsat 8 Images from March and April 2015. The resolution of the map is 100 m and it was resampled from 30 m Landsat 8 input image resolution. The training areas can also be downloaded through the WUDAPT website [31].
The thermal property used for the classification of GIS-LCZ of Berlin is anthropogenic heat flux that is obtained from AH4GUC. This dataset provides global present and future hourly data of anthropogenic heat flux (AHF) with a resolution of 1 km, which is derived from a global population density map, global gridded monthly air temperature, a global nighttime lights map, and a global combustion sources map [28].

Inference of Urban Morphology from GIS Data
According to the standard LCZ methodology, there are 10 zone properties defining the 17 local climate zones. These properties are: sky view factor, aspect ratio, building surface fraction, impervious surface fraction, pervious surface fraction, height of roughness elements (building heights), terrain roughness class, surface admittance, surface albedo, and anthropogenic heat flux. However, due to limited data sources, in practice, only a subset of those properties can be used for the classification of the LCZs.
For this study, seven zone properties are calculated to generate the LCZ map: sky view factor (SVF), mean building height (H) or mean vegetation height (H v ), aspect ratio (H/W), building surface fraction (BSF), impervious surface fraction (ISF), anthropogenic heat flux (AHF), and roughness length (z 0 ). The basic spatial unit for the classification of the zone properties is in the form of grid tiles with the size of 100 m × 100 m. Every zone property will be resampled to this resolution. The polygon of the grid tiles is created in QGIS in the shapefile format with the extent of the area of Berlin with the coordinate reference system (CRS) of European Petroleum Survey Group Geodesy (EPSG) 25833 (ETRS89/UTM zone 33N). This CRS is used as the default CRS for all the calculation of the zone properties. The total number of grid tiles for the area of Berlin is 90517.
The sky view factor (SVF) is calculated by applying raster height data of building, terrain, and vegetation patch derived from the DLR dataset with a resolution of 5 m. The calculation is performed by employing the Urban Multi-scale Environmental Predictor (UMEP) plugin in QGIS by Lindberg et al. [32]. The DLR rasters of building and vegetation patch height with a resolution of 1 m are also used for the calculation of mean building height (H) for urban classes and mean vegetation height (H v ) for natural classes, respectively. The building surface fraction (BSF) is calculated to define the percentage of building area in a grid tile. Because not all building areas in Berlin are covered by the DLR data, additional building polygon data from OSM are used to define the BSF.
The impervious surface fraction (ISF) is the percentage of the area covered by impervious (paved or rock) materials in a grid tile. The Copernicus Impervious Density (IMD) raster is used for the calculation of the ISF by calculating the mean of the 20 m resolution raster data over the grid tile; however, the IMD cannot be directly used to represent the ISF needed by the LCZ framework since the IMD also includes building information. The ISF as a zone property in the LCZ classification excludes the information of buildings since that information is already covered by BSF. Thus, the BSF should be subtracted from the IMD in order to obtain the ISF, which can be formulated as: ISF = IMD − BSF; however, when this formula is implemented, it results in negative ISF values in several grid tiles. This can be due to the fact that the IMD raster is not fully harmonized with the BSF value since they are from different data sources and have different resolutions and acquisition methods. To avoid negative values in the ISF, the IMD is corrected by taking the maximum between the original IMD and the BSF which can be formulated as IMD = max(IMD, BSF).
The aspect ratio (H/W) is the ratio between mean building height (H) and building spacing (or street width). The width of building spacing (W) is estimated by the equation modified from Samsonov and Varentsov [33]: S 0 is the grid tile area, which, in our case, is 10.000 m 2 . N BLD is the number of buildings that is obtained from building data of OSM and DLR. The resulting H/W calculation contains outliers where the grid tiles have a mean building height H but either they have no building width W or the value of it is small (less than 1 m). This results in incorrect values of H/W. These outliers occur, for example, in the grid tiles covered mostly or fully by buildings. To solve this issue, another H/W is calculated from the grid tiles of 250 m × 250 m to obtain a broader perspective of H/W. The new H/W is resampled to 100 m × 100 m grid tiles and assigned to the grid tiles that have incorrect H/W values or having a mean width W of less than 1 m.
The anthropogenic heat flux (AHF) is obtained from the global AHF dataset with the resolution of 1 km. The raster of AHF is resampled to 100 m × 100 m to be further used to calculate the mean AHF in a grid tile. The roughness length (z 0 ) is calculated from the formula suggested by Oke (cited from Estacio et al. [23]): f 0 is 0.1, which is a constant value generally used for surfaces andz H is the average difference of DSM and DTM, which is calculated as the addition of two rasters: building and vegetation patch heights with their 1 m resolution. Water raster data from DLR with a resolution of 1 m are also used in the classification of GIS-LCZ. The raster is applied directly to classify grid tiles that contain mainly water into LCZ G. The calculation is performed in QGIS by applying the Zonal Statistic tool. Table 1 summarizes the data sources and the calculation method to derive the zone properties and LCZ G. The calculation result of the zone properties, which are SVF, H, H v , H/W, BSF, ISF, AHF, and z 0 , are shown in Figure 3. The zone properties are visualized in QGIS with the equal count (quantile) of 5 categories.

Classification to Local Climate Zones
The zone properties used in the classification of the GIS-LCZ method in this study are simplified into 12 classes instead of 17 classes. LCZ 1 (compact high-rise), LCZ 7 (lightweight low-rise), LCZ C (bush and scrub), and LCZ F (soil/sand) are excluded due to the quasi-nonexistence of these LCZ classes in Berlin. For the classification, the grid tiles are divided into nine urban classes (LCZ 2, 3, 4, 5, 6, 8, 9, E) and three natural classes (LCZ A, B, D). The natural classes are categorized as grid tiles having BSF ≤ 10 and ISF ≤ 10 or containing water. The rest of the grid tiles, which are not natural classes, are classified as urban classes. Furthermore, the grid tiles are classified based on their zone properties into LCZ classes based on the range of values adapted from the standard LCZ framework of Stewart and Oke [7], which is shown in Table 2. In this study, LCZ E is considered as an urban class; therefore, its aspect ratio H/W is modified from the standard framework, where the W indicates building spacing instead of tree spacing. Its ISF range value is also modified from ISF ≥ 90 to ISF ≥ 60, so that it can better detect bare rock or paved surfaces. Table 2. Zone properties of LCZ classes (adapted from Stewart and Oke [7]).

LCZ
SVF The zone properties are classified into LCZ classes by applying fuzzy logic with a trapezoidal membership function modified from Estacio et al. [23]. The membership of every zone property for every LCZ class is determined as shown in Figure 4 (example case of LCZ 2 and its ISF property). The property value which is in the specified range will have a membership value of 1 and the membership value will linearly decrease from 1 to 0 when the property value is out of the range.
To understand how this membership function works, an example from Figure 4 is explained here. The zone property of the ISF of LCZ 2 has a range value from 30-50, which implies 30 as the left bound (LB), 50 as the right bound (RB) and length L = RB − LB = 20. When a grid tile has an ISF value, which is in this range (30-50), the membership value will be 1. On the other hand, when a grid tile has an ISF which is not in this range, the membership value will depend on how far it is away from the LB or RB. The membership value will become 0 for ISF values of less than 100 and more than 70. These values will be called as left zero bound (LZB) and right zero bound (RZB), respectively. The value of the RZB and LZB are defined as LZB = LB − L = 10 and RZB = RB + L = 70.
A problem arose for the zone properties that do not imply the RB, for example, the property value of H for LCZ 4, which is bigger than 25 m. The LB is 2 and the RB goes to infinity. It will not be a problem for defining the RZB because it can be set to infinity as well; however, it will be a problem for defining the LZB. The LZB cannot go to negative infinity, as it will overestimate the membership value of the zone property, which is less than LB. For this case, Estacio et al. [23] chose a value of 0 as the LZB; however, when choosing 0 as the LZB, it will still overestimate the membership value of H/W that is less than 25, because the H will obtain membership values from all the urban classes of LCZs which ranges from 0 (LZB) to 25 (LB). This is not desirable since the purpose of the classification is to obtain a relatively distinct classification outcome. To tackle this issue, the LZB is chosen from the second highest RB value of the range values of H defined in Table 2, which is 15 m. The other zone properties, which do not define LB or RB, are SVF, BSF, and ISF. The common value is chosen as the LB or RB. For SVF, the LB would be 0 and the RB would be 1. For BSF and ISF, the LB would be 0 and the RB would be 100.

Filtering
The classification result is further processed by applying a post-processing step. As summarized by Quan and Bansal [17], previous GIS-LCZ studies carry out post-processing steps for two main reasons: diminishing unnecessary heterogeneity of the LCZ classes and maintaining their minimum area requirement.
In this study, during post-processing, a filtering was applied to obtain more homogeneous LCZ areas. The filter is a majority filter and it is applied using SAGA with a filter radius of one pixel and the search mode of square. These parameters yield a window filter of 3 × 3 pixels. The majority filter is also used by the WUDAPT L0 approach but with a greater size of the window filter, which is 5 × 5 pixels. We reduced the window size to retain more details of the GIS-LCZ classification.

New Training Areas
In generating WUDAPT L0 map, training areas (TAs) are digitized by the local experts who know the overview of the urban morphology of the city. Based on its guidelines [34], the WUDAPT L0 approach specifies TAs of LCZ classes in the respective city based on the view on Google Earth imagery. This implies that the approach relies only on the 2D perspective, which is inconsistent with the guidelines of the original framework of LCZ. Stewart and Oke [7] indicate that the geometric properties of the LCZ (mean building height, aspect ratio, and sky view factor) need building height information.
Therefore, in this study, new TAs are generated to improve the TAs of WUDAPT by considering the geometric properties calculated from the GIS-LCZ mapping method. The new TAs are digitized in QGIS by the aid of satellite imagery and, at the same time, by considering the calculated zone properties. Figure 2 shows the new TAs, which are later used for the evaluation of the resulting GIS-LCZ map and the existing WUDAPT L0 map, as well as to derive a new LCZ map from the LCZ generator.

Custom Classification Using k-Means-Clustering
The classification of LCZ based on the standard framework is very general, as it was developed to fit a majority of cities around the world; however, it does not fit any specific city perfectly well and it may be that some classes of the standard framework do not exist in a city, or the city has specific classes not found in the standard framework. As summarized by Quan and Bansal [17], some studies modified the original LCZ classes by removing, mixing, and adding the standard LCZ classes. In the reviewed studies, not all the standard LCZ classes are found and the classified zones do not have zone properties that fit to the range values defined by the standard.
In this study, a custom classification to derive LCZs is introduced, where range values of the zone properties do not have to be defined. Instead, the grid tiles are clustered according to their zone properties into a number of custom classes, for which the average zone properties can be calculated to define the classes. This will give a result of distinct clusters, specific to the city, which contain grid tiles of similar zone properties. From this result, a custom land use/land cover map can be derived, and, together with the average zone properties, the urban morphology of a specific city may be described more accurately. This may also improve the accuracy of mesoscale climate simulation models that need highly resolved and accurate LULC maps.
For the clustering purpose, the k-means clustering method is applied. k-means clustering is one of the most popular methods in cluster analysis. It uses the vector quantization method to partition N observations into k clusters, in which each observation belongs to the cluster with the nearest mean (cluster centers or cluster centroid). k-means clustering minimizes within-cluster variances (squared Euclidean distances) and optimizes squared errors. This method has been widely used in the classification of land use/land cover [35].
We use the k-means clustering method to find clusters of grid tiles according to their zone properties. The clustering is only performed for the urban classes. In cases where H/W is null because of no building height information, the zone property is set to 0. The five zone properties of urban classes are normalized and clustered applying the k-means clustering package of scikit-learn in Python. The number of clusters are set to be the same as the number of GIS-LCZ classes. The clustering result are compared to the GIS-LCZ classification result to investigate their relationship.

Local Climate Zones Classification from GIS Data
The GIS-LCZ of Berlin is shown in Figure 5 The result of the GIS-LCZ method is compared to the new training areas generated for Berlin, which are shown in Figure 2. The comparison is performed using the confusion matrix calculation in SAGA. Using a confusion matrix is a common method to assess the accuracy of a classification, where the classification result is compared with the reference or ground truth data. AccProd or producer accuracy implies the map accuracy from the perspective of the producer (map maker) or the probability that the reference class is correctly classified in the classification result. On the other hand, the AccUser or user accuracy is the probability that the classified class is correctly classified in the reference class. This accuracy specifies map accuracy from the perspective of a map user. Moreover, the overall accuracy value is the number of sites correctly classified divided by the total number of sites. The kappa coefficient can also be calculated. This value describes how well the classification was executed in comparison to just a random classification. The value ranges from 0 to 1, where 1 represents a perfect match between the classification result and the reference data, and 0 is the other way around where the classification result is considered completely random [36]. The confusion matrix of GIS-LCZ compared to the TAs is tabulated in Table 3. The overall accuracy and kappa values (resp. 92.47% and 0.91) are excellent; however, the LCZ 8 does not have a good producer accuracy value since the training areas of LCZ 8 do not detect any LCZ 8 in the classification result. Nevertheless, LCZ 8 only corresponds to only 0.1% of GIS-LCZ, which is also the lowest proportion of the LCZ classes in the GIS-LCZ result. Moreover, the TA for LCZ 3 could not be created, because insufficient training areas were identified.
The WUDAPT L0 is also compared with the TAs in a confusion matrix shown in Table 4. The overall accuracy and kappa values are 74.95% and and 0.69, respectively, which are lower than that of GIS-LCZ. Generally the producer accuracy of each LCZ classes is lower than that of GIS-LCZ. The producer accuracies of WUDAPT L0 of LCZ 5 and 9 have significant discrepancies compared to that of GIS-LCZ; however, the WUDAPT L0 detects the LCZ 8 far better compared to GIS-LCZ. Moreover, LCZ 3 and E are not classified in WUDAPT L0.     It is observed that WUDAPT L0 is not effective at detecting the zone properties related to building geometry (H, H/W, and BSF), which leads to misclassification of some LCZs. This situation is observed in WUDAPT L0 for LCZ 2, LCZ 4, and particularly LCZ 5 as implied in the confusion matrix of Table 4. LCZ 5 has a very low producer accuracy since it classifies most of the TAs of LCZ 5 into LCZ 4. WUDAPT L0 also gives a completely random classification result on LCZ 6 and 9. It is found that the tiles classified in these classes are in reality natural classes instead of urban classes, based on the zone properties and the view from the satellite imagery as shown in Figure 6.
On the other hand, GIS-LCZ considers the building geometry, which enables it to correctly detect LCZs that are misclassified by WUDAPT L0: LCZ 2 instead of LCZ 5, LCZ 5 instead of LCZ 2 and LCZ 4, and LCZ 6 instead of LCZ 5 as indicated in the confusion matrix of Table 3. Figure 7 shows misclassification in WUDAPT L0, where the method classified the tiles as LCZ 4, but the mean building height values of these tiles are actually in the range of LCZ 5. The GIS-LCZ method addresses this shortcoming effectively.
One shortcoming of the GIS-LCZ method is its strong dependency on the availability of input data. It is observed that some tiles cannot be classified correctly due to the inadequacy of the data to calculate the zone properties. GIS-LCZ also does not manage to detect LCZ 8, which is probably due to the unavailability of pervious surface fraction property or the small extent of the grid tile, which is insufficient for defining LCZ 8. The majority filter applied can remove the granular view and produce more homogeneous LCZ classes; however, this filter can also diminish the correctly classified GIS-LCZ classes. Some tiles of GIS-LCZ are found to belong partially to LCZ 6 or 8, which implies that the combination of two LCZs is possible to represent a local climate zone as it was also noted by other researchers [37]. The airport areas are classified as LCZ 5 in GIS-LCZ (rather than LCZ 8) because the building height property suits this class. This highlights a limitation of the LCZ framework, showing that its implementation cannot always be ideal for every city.

LCZ Generator
The new TAs based on GIS-LCZ data, see Figure 2, were also submitted to the LCZ generator and the result of the new LCZ map gives an accuracy value of 84% (see Figure 8). The previous (WUDAPT) submission for Berlin to the LCZ generator obtained a lower accuracy value of 61% (see Figure 9). The accuracy evaluation of the LCZ Generator refers to five indexes [38]: overall accuracy (OA), overall accuracy for the urban LCZ classes only (OAu), overall accuracy of the built vs. natural LCZ classes only (OAbu), a weighted accuracy (OAw), and the class-wise metric F1. The accuracy indexes OA, OAu, OAbu, and OAw of the new LCZ map are 84%, 82%, 98%, and 97%, respectively. The relevant indexes of WUDAPT are 61%, 59%, 94%, and 92%, respectively. The class-wise metric F1 evaluation also shows that in the new LCZ map, except for LCZ 8 and LCZ B, the accuracy of the other eight LCZ classes are better than that of the previous submission.  [39] (see also [15]).

Figure 9.
Boxplot accuracy of the LCZ generator map for the previous WUDAPT training areas of Berlin; data from [40] (see also [15]).

k-Means Clustering and Comparison to GIS-LCZ Classes
First, we compare the result of the k-means clustering to the previously determined GIS-LCZ classes using average cluster properties. The average of the zone properties for each cluster is shown in Table 5. If we analyze each average cluster property separately, we can make the following observations. The clusters 0, 1, 2, 4 and 7 with building height between 10 m and 20 m may be middle-rise areas, the clusters 3, 5, 6 having building heights between 5 m and 10 m may The BSF values of the other clusters are between 10% and 20%, buildings in these areas are sparsely built. This shows that the BSF of most clusters in Berlin is small. The ISF value of cluster 3 is 66.7%, which may correspond to bare rock or paved areas. The ISF value of cluster 2 is 56.0%, which may correspond to compact areas. The ISF values of clusters 0, 1, 4, and 7 are between 20% and 40% indicating open or sparsely built areas. The cluster 5 with an ISF value of 9.7% may belong to sparsely built areas. The clusters 2, 4, and 7 with an AHF more than 10 W m −2 may indicate compact, open or lightweight areas. On the other hand, the other clusters with an AHF less than 10 W m −2 may correspond to sparsely built or paved areas.
Overall, it seems difficult to establish a clear relationship between the clusters and the LCZ classes using only average cluster properties, because, depending on the property analyzed, the most likely LCZ class corresponding to each cluster differs.
Next, we performed a cross-analysis of the average zone-properties between the custom clusters and the GIS-LCZ classification. The resulting agreement values in percent are shown in Table 6. Percentages greater than 20 are marked to identify GIS-LCZ classes that are similar to the k-means clusters.
It can be seen from Table 6 that 71.8% of cluster 1 belong to GIS-LCZ 5, 67.9% of cluster 6 belong to GIS-LCZ 6, and 69.3% of cluster 4 belong to GIS-LCZ 2. Most of the members of cluster 0, 2, and 3 belong to GIS-LCZ 5 and 6. Relatively speaking, the mean building height of cluster 2 is closer to that of a midrise area, while the mean building height of cluster 0 and 3 are closer to that of a low-rise area. In addition, there may be a large number of paved areas in cluster 3. Moreover, cluster 5 is mainly composed of GIS-LCZs 6 and 9, and cluster 7 is distributed among GIS-LCZs 2, 5 and 6. The classification of the GIS-LCZ method is based on the standard LCZ framework; therefore, the GIS-LCZ method is not able to identify zone types outside of the standard LCZ framework, in which the zone types have different range values than that defined in the standard. On the other hand, k-means clustering is based entirely on quantitative properties, which can effectively avoid the constraints of the standard LCZ framework and may distinguish zone types other than that defined by the standard LCZ framework; however, the resulting clusters do not correspond to clearly defined urban topologies, and so the researchers need to name them according to the characteristics of each cluster on a case by case basis.

Conclusions
In this study, it is shown that the GIS-LCZ method can improve the accuracy over the remote sensing-based WUDAPT L0 approach in deriving an LCZ map of Berlin. When compared to the new training areas, a high overall accuracy of 92.47% and a high kappa value of 0.91 were found for the GIS-LCZ map. This indicates a highly accurate classification result and a strong improvement over the previous WUDAPT L0 result with an overall accuracy of 74.95% and a kappa value of 0.69.
It is observed that the WUDAPT L0 method misclassified some LCZs due to its shortcoming in detecting the zone properties related to building geometry. On the other hand, the GIS-LCZ approach calculated the zone properties from vector and raster datasets, which correctly detects LCZs that were misclassified by WUDAPT L0. Nevertheless, the GIS-LCZ approach strongly depends on the availability of the data to calculate the zone properties. It can lead to a misclassification of LCZs when the zone properties are not available. This study also shows the limitation of the LCZ framework in addressing the finer variety of zone property values of the grid tiles in Berlin, where the LCZ classes from the standard do not fit perfectly to the calculated zone properties.
The GIS-LCZ approach classified the zone properties using a fuzzy logic algorithm that was adapted from Estacio et al. [23] and modified in order to solve the membership value problem of the left zero bound of the trapezoidal function. The standard framework for LCZ E is also modified by defining the impervious surface fraction bigger than or equal to 60%, so that it can better detect paved surfaces in urban areas. The majority filter used in the post-processing employs a window filter of 3 × 3 pixels instead of 5 × 5 pixels used in the standard WUDAPT L0, in order to keep the details of the GIS-LCZ classification.
This study also shows that the new training areas generated with the consideration of building height information can increase the accuracy of the LCZ map produced by the LCZ generator. Moreover, the k-means clustering result shows that the GIS-LCZ method tends to divide regions into clearly differentiated LCZ types according to the standard framework, while k-means clustering can identify regions with city-specific characteristics or inter-LCZ transition types. From the cross analysis between the clusters and the GIS-LCZ classes, it is found that some of the GIS-LCZ classes seem to be naturally distinguishable but some are difficult to separate.
Future improvement of the methodology presented here could proceed in several directions. Only seven of the zone properties were used here for the classification of GIS-LCZ. It would be interesting to add other zone properties (pervious surface fraction, surface albedo, and surface admittance) and to see whether the result improves. The WUDAPT L0 map has been inefficient in detecting certain LCZs, where geometric zone properties, such as H and H/W, are critical. The classification in WUDAPT L0 could be improved by introducing building height information to the machine learning algorithm. Integration of the GIS-LCZ and the WUDAPT L0 method could also be possible to obtain advantages from both of these methods by replacing the post-processing step (majority filter) of WUDAPT L0 with an aggregation step applied in GIS-LCZ method by Gál et al. [20].
Existing investigations [21,[41][42][43][44] can be enhanced or reapplied by using the result of the GIS-LCZ map of Berlin. This approach is particularly suitable for practitioners attempting to characterize UHI via climate simulation as it can improve the accuracy of the simulation results by reflecting terrain features more precisely and consistently. This is especially valuable in the case of high horizontal grid resolution. The GIS-LCZ classes can also be used in the evaluation of simulation results from microscale numerical models, such as PALM-4U. Furthermore, studies that correlate UHI intensity with land cover characteristics can be extended to establish a correlation with the more accurate urban classification of the GIS-LCZ method. Another type of correlation can be carried out directly between the GIS-LCZ classes and remotely sensed land surface temperature (LST) or in situ measurements of near-surface air temperature. The GIS-LCZ classes can also be applied to design urban temperature measurement networks for the purpose of understanding spatial and temporal characteristics of urban climate. This network can be further used to analyze the air temperature conditions within the GIS-LCZ classes.