On the Calculation of Urban Morphological Parameters Using GIS: An Application to Italian Cities

: The identiﬁcation of parameters that can quantitatively describe the different characteristics of urban morphology is fundamental to studying urban ventilation and microclimate at the local level and developing parameterizations of the dynamic effect of an urban area in mesoscale models. This paper proposes a methodology to calculate four morphological parameters, namely mean height, aspect ratio, sky view factor, and plan area ratio, of ﬁve cities located in southern (Bari and Lecce), central (Naples and Rome), and northern (Milan) Italy. The calculation is performed using the Geographical Information System (GIS), starting from morphological and land use data collected and analyzed in shapeﬁles. The proposed methodology, which can be replicated in other cities, also presents in detail the procedure followed to properly build input data to calculate the sky view factor using the UMEP GIS tool. The results show a gradual increase in the plan area index, λ p , and mean building height, H , moving from the south to the north of Italy. Maximum values of λ p and H are obtained in the regions of Milan, Rome, and Naples, where the highest spatially-averaged values are also found, i.e., λ p = 0.22, H = 10.9 m in Milan; λ p = 0.19, H = 12.7 m in Rome; λ p = 0.20, H = 12 m in Naples. Furthermore, for all the cities investigated, areas characterized by the Corine Land Cover class as “continuous urban fabric” are those with medium sky view factor SVF values (around 0.6–0.7) and λ p values (around 0.3) typical of intermediate/compact cities. The methodology employed here for calculating morphological parameters using GIS proves to be replicable in different urban contexts. This opens to a better classiﬁcation of cities in local climate zones (LCZ), as shown for the Lecce region, useful for urban heat island (UHI) studies and to the development of parameterizations of the urban effects in global and regional climate models


Introduction
Rapid urbanization and climate change pose new challenges to urban areas. Specifically, the rapid urbanization experienced in the 20th century has caused an increased pressure of urban areas on the surrounding environments. Moreover, urban areas have increased their size and density [1]. These processes affect the atmospheric environment and local climate by impacting air flows and the allocation of radiation and energy on the earth's surface. They are responsible for the formation of the well-known urban heat island (UHI) effect, i.e., the increase in air temperature registered in urban areas compared to that of the surrounding rural areas [2]. authors have derived relevant flow and dispersion parameters related to urban morphology for parts of some northern European cities, such as London, Toulouse, Berlin (e.g., [18]) and for main Chinese and Indian cities (e.g., [19][20][21][22][23]). However, little attention has been paid to cities in southern European and Mediterranean regions (see for example [24][25][26]).
In this context, the present paper is aimed at proposing an approach to calculate some morphological parameters using the Geographical Information System (GIS), with application to five Italian cities. Several GIS models have been used in the literature to quantitatively calculate such parameters, as recently reviewed by Ferreira et al. [4]. They summarized works dealing with the representation of the urban surface in GIS platforms, listing the calculated parameters, type of software (commercial or open source), type of spatial data used (vector or raster), and how the city (as a real or a hypothetical arrangement) and the results (scale of analysis) were represented. The main urban parameters investigated were surface cover and urban structure by area ratios (plan area ratio/plan area index, building coverage ratio), height and volume of buildings, urban density, and roughness variables (frontal area index, roughness length, zero-plane displacement height). In the present research, some of those morphological parameters (i.e., mean building height, aspect ratio, sky view factor, and aspect ratio) are calculated, and results are compared in terms of similarities and differences among the investigated cities located in southern (Bari and Lecce), central (Naples and Rome), and northern (Milan) Italy. The outcomes of such morphological analyses have direct atmospheric-oriented applications in urban climatology. They may be used for the development of parameterizations of the urban effects in global and regional climate models, as well as in the analysis of the UHI effects. To this regard, an application of the morphological analysis performed here is further shown by building an LCZ map of the Lecce region based on calculated morphological parameters and thus achieving a detailed classification in LCZ classes.
The paper is structured as follows. After the Introduction, Section 2 presents the cities investigated, their climate, position, and geometrical structure. The methodology employed for the morphological analysis is presented in Section 3, with a focus on the definition of morphological parameters and their calculation using GIS. Section 4 presents results in maps and tables, summarizing the values of the morphological parameters for different areas of the cities, including the LCZ map. Conclusions are given in Section 5.  24 N-18 • 10 08.9 E) has also been chosen, being the home city of the University of Salento. Based on the Köppen Climate Classification, the climate of the cities of Lecce, Naples, and Rome is "Csa" (Mediterranean Climate), characterized by relatively mild winters and very warm summers. The city of Bari is instead characterized by a humid subtropical climate, classified as "Cfa" in the Köppen Climate Classification, and presents relatively high temperatures and evenly distributed precipitation throughout the year. Finally, for the city of Milan, the climate is classified as "Cfb" (Marine West Coast Climate) that is characterized by equable climates with few extremes of temperature and abundant precipitation in all months (https://www.weatherbase.com, accessed on 14 December 2022).

Description of the Cities Investigated
Overall, Milan (1,349,930 inhabitants) (http://dati.istat.it/index.aspx?queryid=19101, accessed on 10 January 2023) shows a radial city development. Relatively new settlements have been built, radiating from the city's original center. A radial development is also present in Rome (2,749,031 inhabitants), although several nuclei may be distinguished, and the presence of vegetation is more evident. Regarding the city of Lecce (94,783 inhabitants), the radial development starting from the original nucleus occurs along three directions, i.e., the north-west, where there is an industrial area, the east, towards the coast, and the south, where there are mainly residential areas. Finally, Naples (921,142 inhabitants) and Bari (315,948 inhabitants) are both coastal cities. However, while the former developed along the coastline with evident interference from the Vesuvio volcano, the latter expanded inland. Figure 1 shows the central areas of the five Italian cities selected for the study. Lecce presents a very compact and irregular city center surrounded by compact but regular neighborhoods built in the last decades. Regarding Bari, the compact and irregular old town extends into the sea, while the more recent neighborhoods have been built according to an orthogonal grid. The central area of Naples is very compact and presents both irregular and regular structures. In contrast, the regular and compact structure of the central area of Rome is frequently interrupted by gardens and parks. Finally, Milan exhibits a very compact and irregular structure. extremes of temperature and abundant precipitation in all months (https://www.weatherbase.com, accessed on 14 December 2022).
Overall, Milan (1,349,930 inhabitants) (http://dati.istat.it/index.aspx?queryid=19101, accessed on 10 January 2023) shows a radial city development. Relatively new settlements have been built, radiating from the city's original center. A radial development is also present in Rome (2,749,031 inhabitants), although several nuclei may be distinguished, and the presence of vegetation is more evident. Regarding the city of Lecce (94,783 inhabitants), the radial development starting from the original nucleus occurs along three directions, i.e., the north-west, where there is an industrial area, the east, towards the coast, and the south, where there are mainly residential areas. Finally, Naples (921,142 inhabitants) and Bari (315,948 inhabitants) are both coastal cities. However, while the former developed along the coastline with evident interference from the Vesuvio volcano, the latter expanded inland. Figure 1 shows the central areas of the five Italian cities selected for the study. Lecce presents a very compact and irregular city center surrounded by compact but regular neighborhoods built in the last decades. Regarding Bari, the compact and irregular old town extends into the sea, while the more recent neighborhoods have been built according to an orthogonal grid. The central area of Naples is very compact and presents both irregular and regular structures. In contrast, the regular and compact structure of the central area of Rome is frequently interrupted by gardens and parks. Finally, Milan exhibits a very compact and irregular structure.

Description of the Morphological Parameters
Several parameters ( Figure 2) have been used to characterize the selected cities:

Description of the Morphological Parameters
Several parameters ( Figure 2) have been used to characterize the selected cities:  mean building height, (m): the geometric average over a specific area of building heights;  sky view factor, SVF (-): the ratio of the amount of sky hemisphere visible from ground level to that of an unobstructed hemisphere;  aspect ratio, AR (-): the mean height-to-width ratio of street canyons, building spacing;  plan area index, λp (-): the ratio of building plan area to total plan area.

Mean Building Height
is the average of building height over a specific area: where Hi is the height of building i and N is the total number of buildings in the cell of the grid employed to discretize the city area in GIS (see Section 3.2).

Sky View Factor
The sky view factor (SVF) ranges from 0 to 1, representing totally obstructed and free spaces, respectively; 1 means that all the sky is visible-e.g., it characterizes open land areas-whereas 0 means that the sky is not visible from that specific point. Hence, the lower the SVF, the higher the degree of obstruction in the urban canyon. It is also defined as the ratio of the radiation received (or emitted) by a plan surface to the radiation emitted • mean building height, H (m): the geometric average over a specific area of building heights; • sky view factor, SVF (-): the ratio of the amount of sky hemisphere visible from ground level to that of an unobstructed hemisphere; • aspect ratio, AR (-): the mean height-to-width ratio of street canyons, building spacing; • plan area index, λ p (-): the ratio of building plan area to total plan area.

Mean Building Height
H is the average of building height over a specific area: where H i is the height of building i and N is the total number of buildings in the cell of the grid employed to discretize the city area in GIS (see Section 3.2).

Sky View Factor
The sky view factor (SVF) ranges from 0 to 1, representing totally obstructed and free spaces, respectively; 1 means that all the sky is visible-e.g., it characterizes open land areas-whereas 0 means that the sky is not visible from that specific point. Hence, the lower the SVF, the higher the degree of obstruction in the urban canyon. It is also defined as the ratio of the radiation received (or emitted) by a plan surface to the radiation emitted (or received) by the entire hemispheric environment [29]. Middel et al. [30] have calculated, using a human-centric street canyon perspective in Google Street View, typical SVF footprints for 15 cities around the world. Miao et al. [31] have recently reviewed several methods used to estimate the SVF in urban canyons, outlining their differences, including their respective strengths and weaknesses. In the present work, SVF has been calculated using the Urban Multi-scale Environmental Predictor (UMEP) (https://umep-docs.readthedocs.io/en/latest/index.html, accessed on 14 December 2022). UMEP is a climate service tool designed for researchers and service providers (e.g., architects, climatologists, energy, health, and urban planners) presented as a plugin for QGIS. The Sky View Factor plugin can be used to generate pixel-wise SVF using ground and building models. The methodology that is used to generate SVF in this work is described in Lindberg and Grimmond [32].

Aspect Ratio
The aspect ratio, AR, varies from 0 to more than 2 and represents how narrow the urban canyons are proportional to their height. It may either be computed directly if the average width of urban canyons is known or derived from the sky view factor, SVF. In fact, SVF can also be expressed as SVF = cos (arctan (2*AR)) [33], and, consequently, AR may be expressed as follows: Equation (2) is the one used for performing the present study.

Plan Area Index
The plan area index, λ p , ranges from 0 to 1 and is defined as the ratio of the plan area of buildings to the total surface area of a specific region: where A pi is the plan area of the building, i, at ground level, and A T is the total area of the cell. The values of this parameter depend on the size of the area or of the specific land use types included in the calculation. As reported by Burian et al. [14], in most cases the plan area fraction varies significantly from one city block to the next because of the heterogeneous nature of the urban landscape. The appropriate size of the calculation element should be chosen such that the characteristics of interest in the urban area are homogeneous and discernible.

Estimation of Morphological Parameters Using GIS
The morphological analyses have been conducted for the selected cities, employing the Geographic Information System (GIS) software QGIS 3.22.1 (https://www.qgis.org/en/site, accessed on 14 December 2022). Hence, morphological and land use data have been collected in shapefiles and analyzed. In addition, for the city of Lecce, the Digital Elevation Model (DEM) has been employed to calculate building heights. Different institutional sources have been used for collecting the relevant data: Note that a region of interest has been identified for each city selected. The dimensions of the region of interest have been set depending on the selected city size. Therefore, a 10 km × 10 km area has been used for the cities of Lecce and Bari (referred to as "Lecce region" and "Bari region" hereinafter), while a 20 km × 20 km area has been used for the cities of Naples, Rome, and Milan (referred to as "Naples region", and so on).
Initially, the vector layer containing building footprints and heights has been converted into a raster layer. Then, a vector grid with cells of 100 m × 100 m has been created for the region of interest and used to compute the selected morphological parameters by running zonal statistics functions. The cell size of 100 m × 100 m has been chosen as the theoretically mean smallest relevant spatial unit, where the physical properties of the environment significantly affect air temperatures at the local scale (as reported in Merbitz et al. [34] and Geletič and Lehnert [35]). After the processing, the resulting vector layer has been converted into a raster one.
This process has been applied to calculate each parameter, albeit with slight differences due to various degrees of complexity. Calculating the mean building height has been straightforward: averaging the heights of the buildings in each cell. For the region of Lecce, the mean height has been calculated using the DEM, which has been converted into a raster file containing the shape of the buildings.
In contrast, the calculation of SVF and λ p has required additional operations, such as creating attributes indicating the presence of buildings in each cell. Finally, the computation of SVF has been more complex and has required more operations. Therefore, a detailed description of the methodology followed for its calculation has been provided in the following subsection.

Specific Case of SVF
For UHI-oriented studies, SVF should be calculated only within urban canyons without considering the building rooftops. The computation has been performed using the UMEP plugin for QGIS. Two raster layers have been employed: the former, called A, reports buildings' footprint and height, and NoData values have been used to identify the ground. The latter, called B, contains only the buildings' perimeter. The resolution of the two raster layers has been set to 4 m × 4 m to keep the calculation time acceptable.
To adequately calculate the SVF at the ground level, it is essential to ensure that no values are assigned inside the buildings. The first step is to calculate the difference between the two rasters B and A to obtain a new raster, C. In the new raster, negative values are inside the buildings and NoData values correspond to ground/streets. In the second step, the NoData values have been converted into zeros. In the third step, raster C is summed to raster B to obtain a new raster, D. D ranges from -(max height) to + (max height). In the fourth step, the negative values in raster D, corresponding to pixels inside the buildings, have been replaced by NoData values. Consequentially, a raster layer is created with zeros for the ground and NoData values for the pixels inside the buildings, and height information is reported along the building contours. It should be noted that the NoData values of pixels inside the buildings include the rooftops.
Once the fourth step has been completed, it is possible to compute SVF by employing a function embedded in the UMEP plugin, i.e., sky view factor. The resulting raster (called E) still includes SVF values for contours and the inside of the buildings, which must be removed. Therefore, a comparison of raster E with raster D has enabled the replacement of the SVF data (for the inside of the buildings) with NoData. Finally, to remove the SVF data on the contours, a negative of the contour lines raster (neg_B) has been created with NoData on the contours, and 0 everywhere else. Comparing raster neg_B with raster E, the SVF data have thus been removed from the contour lines in raster E.
Finally, the obtained object is a raster with SVF values everywhere (except in the buildings), which has been processed using the zonal mean statistics function over the 100 m × 100 m grid as performed for the other morphological parameters.

Data Analysis Based on CORINE Land Cover Classes
In order to ease the comparison between the different cities taking into account their main characteristics in terms of land cover, Corine Land Cover (CLC) classes have been used. The Corine (Coordination of Information on the Environment) Land Cover project provides information on the biophysical characteristics of European land cover. The CLC project was established in the 1980s to standardize data collection on the state of land in Europe and to support environmental policy and has become the primary spatial data source on land for the European Economic Area. Images are acquired by earth observation satellites. It is currently a product of the Copernicus Land Monitoring Service. Copernicus is the earth observation program of the European Union (https://collections.eurodatacube.com/, accessed on 14 December 2022). The CLC inventory consists of 44 land cover and land use classes derived from satellite data.
For the present study, the three main CLC classes present in the cities have been considered: 1.1.1, 1.1.2, and 1.2.1, which stand for "continuous urban fabric" (hereafter CLC 1), "discontinuous urban fabric" (hereafter CLC 2), and "industrial or commercial units"

Results and Discussion
This section reports and discusses the results of the morphological analyses performed for the five cities in the regions of interest. In Section 4.1, the results are presented in terms of λp and using maps. These two parameters have been selected because they quantify the pressure on the environment of the selected built areas. Comparing building height and plan area density distributions may characterize qualitatively and quantitatively the differences between the selected regions. The morphological characteristics are analyzed separately for each region, and then a crosscomparison is performed. In Section 4.2, the results are referred to the three specific CLCs and are presented using tables. Please note that each value in the table is calculated as the spatially-averaged value over the entire area of the specific CLC for each region. Limitations and future perspectives are given in Section 4.3. Section 4.4. finally reports on an example of an atmospheric-oriented application of the morphological parameters, specifically the building of a detailed LCZ map (for the Lecce region). CLC 1 is assigned when urban structures and transport networks are predominant: above 80% of the land surface is covered by impermeable features such as buildings, roads, and artificially surfaced areas. Non-linear areas of vegetation and bare soil are exceptional. CLC 2 is assigned when urban structures and transport networks associated with vegetated areas and bare surfaces are present and occupy significant surfaces in a discontinuous spatial pattern. Impermeable features such as buildings, roads, and artificially surfaced areas range from 30 to 80% land coverage. In CLC 3, buildings, other built-up structures, and artificial surfaces (with concrete, asphalt, tarmacadam, or stabilized, e.g., beaten earth) occupy most of the area. It can also contain vegetation (most likely grass) or other non-sealed surfaces. This class is assigned for land units that are under industrial or commercial use or serve for public service facilities (https://land.copernicus.eu/user-corner/ technical-library/corine-land-cover-nomenclature-guidelines/html/index.html, accessed on 14 December 2022).
Once the Corine Land Cover raster (https://land.copernicus.eu/pan-european/corineland-cover/clc2018, accessed on 3 November 2022) has been imported in QGIS, a vector layer has been created for each of the selected CLC classes. The zonal "majority" statistical function has been used to obtain values of morphological parameters over the grid of each CLC class. The obtained data have then been extracted to calculate the statistics (in terms of spatial averages and standard deviations) for each CLC class.

Results and Discussion
This section reports and discusses the results of the morphological analyses performed for the five cities in the regions of interest. In Section 4.1, the results are presented in terms of λ p and H using maps. These two parameters have been selected because they quantify the pressure on the environment of the selected built areas. Comparing building height and plan area density distributions may characterize qualitatively and quantitatively the differences between the selected regions. The morphological characteristics are analyzed separately for each region, and then a cross-comparison is performed. In Section 4.2, the results are referred to the three specific CLCs and are presented using tables. Please note that each value in the table is calculated as the spatially-averaged value over the entire area of the specific CLC for each region. Limitations and future perspectives are given in Section 4.3. Section 4.4. finally reports on an example of an atmospheric-oriented application of the morphological parameters, specifically the building of a detailed LCZ map (for the Lecce region).          Finally, it is interesting to note that the city of Lecce and, to a lesser extent, Milan has a symmetrical spatial distribution of λ p and H, with higher values in the city center and decreasing symmetrically moving towards the periphery. On the other hand, the two coastal cities, Bari and Naples, show higher values near the coast, with values gradually decreasing away from the coast. Finally, Rome shows a particular distribution, with higher values in the city center and along spatial radii stretching in all directions, while the values are lower elsewhere.

Morphological Parameter Maps
Results obtained for the investigated cities agree with values found in the literature for other Italian cities. For example, Giovannini et al. [25] and Pappaccogli et al. [26] have obtained a similar range of λ p for the cities of Trento and Bolzano. Similarly, Di Sabatino et al. [24] have analyzed three different urbanized neighborhoods of the city of Lecce and found values of λ p equal to approximately 0.40, in line with the results achieved in this paper.

Comparison of the Morphological Characteristics
Tables 2-4 report the spatially-averaged values, over the entire CLC area, of each parameter and the standard deviations. Please note that CLCs represent homogeneous areas that are mostly covered by buildings or artificial structures (CLC 1 and CLC 3) or a mixture of buildings and vegetated areas. Spatially-averaged values over each CLC thus do not expressively refer to whole non-built-up areas. Table 2. Spatially-averaged values and standard deviations of the morphological parameters over the CLC 1 area.

CLC 1
Spatially-Averaged St. Dev.  Table 4. Spatially-averaged values and standard deviations of the morphological parameters over the CLC 3 area.

CLC 3 Spatially-Averaged
St. Dev.  The Naples and Rome regions are instead characterized by the lowest values (0.26 and 0.28, respectively). It is interesting to note that the Lecce region, even though characterized by the lowest H (9.5 m), presents a high value of λ p (0.34), which is likely due to the fact that CLC 1 mainly covers the compact city center (Figure 3) and not further surrounding areas, as occurs for the other regions. Note that this value confirms the definition of Lecce as a compact city and is in line with the values found by Di Sabatino et al., where three neighborhoods of Lecce were analyzed [24].

H (m) AR (-) SVF (-) λ p (-) H (m) AR (-) SVF (-) λ p (-)
Regarding CLC 2, similarly to CLC 1, there is a gradual increase in H moving from the south to the north of Italy, with the Rome and Milan regions showing the highest spatially-averaged values (12.5 m and 11.6 m, respectively) and high standard deviations. Moreover, the two regions present similar values of AR (0.25 and 0.31, respectively) and SVF (0.88 and 0.84, respectively). Finally, λ p is the highest (0.20) in the Milan region, followed by the Lecce region, with 0.17.
As expected, CLC 3 presents for each city region the lowest values of H and AR and the largest values of SVF. Regarding H, the Naples region presents the highest value, while the Bari region shows the lowest. The Milan region, similar to what occurs in CLC 1 and CLC 2, still shows the highest AR (0.22) and λ p (0.25), followed by the Napoli region (0.19 for AR and 0.22 for λ p ). It should be noted that spatially-averaged values and standard deviations of λ p are slightly larger than those found in CLC 2, but for Lecce, which, given its geometrical symmetrical distribution, shows a symmetrical distribution of λ p .
Comparing the three CLCs, as expected from their definition, the quantitative morphological analysis performed here confirms that, for all the city regions, CLC 1 areas (continuous urban fabric) are those with the highest urbanization, with medium SVF values and λ p values typical of compact cities [36,37], characterized by wake interference or skimming flow regimes [38]. In CLC 2 areas, the plan area density decreases, presenting values typical of sparse cities characterized by isolated roughness or wake interference flow regimes. Finally, results for CLC 3 confirm a decrease in H and AR moving from the city center, but with larger λ p than CLC 2. This is because CLC 2 includes impermeable features covering 30 to 80% of the land, while in CLC 3 built-up structures and artificial surfaces occupy most of the area.

Limitations and Future Morphological Works
An advantage of the methodology presented in this paper is that it employs only two sets of input data (building heights and grid maps) based on the actual shape of buildings to calculate the morphological parameters of a given city or region. Hence, for its application, it is necessary to collect appropriate and accurate datasets of building geometry to apply zonal statistical functions and calculate morphological parameters.
Looking at Figures 4 and 5 and Tables 2-4, larger spatially-averaged values can be expected in the tables, especially for AR and λ p for the Naples, Rome, and Milan regions in CLC 1. First, it should be noted that results do not refer only to the city but to a region of 20 km × 20 km that includes, in some cases, nearby towns, which are less dense than the main city. Furthermore, a minimum mapping unit (MMU) of 25 ha for areal phenomena and a minimum width of 100 m for linear phenomena have been used for obtaining the CLC in the Corine project. In addition, high-resolution satellite imagery is mainly processed by visual interpretation. Boundaries of a specific class may thus cover zones that do not belong appropriately to that class. This aspect has affected the spatially-averaged values of morphological parameters over a given CLC, as shown in Tables 2-4. A grid sensitivity test has been performed using several grid resolutions: 50 m × 50 m, 250 m × 250 m, 500 m × 500 m, and 1000 m × 1000 m. The calculation has been performed for the Lecce region, and the results are shown in Table 5. Taking into account values for the 100 m × 100 m grid resolution as a reference, the table shows that:  focusing on values obtained for CLC 3, larger deviations (than those found for CLC 1 and 2) can be noted using the 50 m × 50 m grid resolution, with a maximum deviation of 60% for λ p . This may be due to the small grid cells (50 m × 50 m), which experience a larger number of values close to 0 (cell without buildings) and 1 (cell fully occupied by buildings) as confirmed by the high standard deviation.
These preliminary results suggest that using a grid resolution finer than 50 m × 50 m will produce lower differences with respect to the 100 m × 100 m grid resolution and thus building a dataset of parameters averaged over cells of 100 m × 100 m may be an appropriate compromise to calculate parameters over large areas. These preliminary findings will be explored in future work, performed by conducting the grid sensitivity test to the other regions of the cities investigated.
Finally, it is worth mentioning that there are several methods in the literature for calculating SVF [31]. An additional aspect of future work is to test the accuracy of the different methods in QGIS.

LCZ Map Based on Morphological Parameters
The calculated morphological parameters can be used to build detailed LCZ maps. Here, as an example, the map of the Lecce region is presented and discussed. As mentioned in the Introduction, the concept of LCZ contributed to an advancement in the thermal analysis of urban areas. Thermal differences, for example in UHI intensities, are no longer confined to urban/rural temperature differences, but they can also be analyzed more closely as differences between LCZs. Thus, the LCZ scheme is a useful tool to compare the thermal features of various neighborhoods within a city, and/or similar neighborhoods between different cities. The LCZ concept has been applied in a wide range of urban climate investigations [39].
As mentioned above, each LCZ is characterized by different morphological and surface cover properties. The morphological parameters are H, SVF, AR, λ p , impervious surface fraction, pervious surface fraction, and terrain roughness class [15]. There are three different levels of LCZ maps. The World Urban Database and Access Portal Tools (WUDAPT, www.wudapt.org, accessed on 19 December 2022), which uses freely available Landsat imagery and Google Earth for creating the training areas (TAs), provides a procedure for generating the LCZ level 0 product. Level 0 represents the first level of information, while Levels 1 and 2 provide more detailed information [40]. The European LCZ level 0 map from Demuzere et al. [41] is available for download as a GeoTiff at the WUDAPT database. There are other approaches for generating LCZ level 0 products that can also lead to detailed Level 1 and 2 data. Where the data are available, administrative data (on building footprints, heights, green spaces, etc.) and GIS can be used (e.g., [19,35,42,43]), as performed here.
Specifically, for building the LCZ map (GIS LCZ map hereinafter) of the Lecce region, all the morphological parameters given by Stewart and Oke [15] have been calculated, except for the roughness class. The GIS LCZ map has then been built considering Stewart and Oke's criterions and employing fuzzy logic to determine the three most probable LCZ for every cell [44]. A grid of 100 m × 100 m cells has been thus obtained, each cell containing those three most probable LCZs. The corresponding map (with only the most probable LCZ drawn) can be seen in Figure 6, which also shows the European LCZ map for comparison. Please note that no fuzzy logic has been used for the pervious surface fraction, used at the beginning of the script to distinguish between urban and non-urban areas, with a threshold at 80% (this choice is a main source of inaccuracy). In the GIS LCZ map, as information on the Land Cover Types (LCZ A, B, C, D, E, F, and G) was missing, all these classes were grouped in a single class called Non-built areas and, for consistency, the same was done for the European LCZ map.
The European LCZ map shows that LCZ 6 (Open low-rise) is the most abundant (35%), characterizing the city outskirts. The city center is characterized by LCZ 2 (Compact mid-rise), while all around the center and in the north-western part of the city there is the presence of LCZ 8 (Large low-rise), the latter corresponding to the industrial/commercial zone of Lecce city. When compared with the GIS LCZ map, it can be noted that the area covered by LCZ 2 is the same (6%), even though more irregularly distributed throughout the region. It is noteworthy that the LCZ 6 now covers 11% of the area; in fact, areas around the center are now mostly reclassified as LCZs 5 (Open mid-rise) and 6, while the LCZ 8 area is reduced and mostly appears in the northern/western part as in the European map. Finally, scattered areas of LCZ 9 (Sparsely built) appears.
To assess the classification accuracy of both the GIS-based and WUDAPT methods, a confusion matrix was produced by comparing the European map to the GIS map. As an example, the overall accuracy (OA) has been calculated as follows [20]: OA is a statistic representing the proportion of pixels that were classified in the same way by both maps, and it is shown in the form of a percentage. An overall accuracy of 73% has been found.
In summary, while the general spatial distribution patterns of different LCZ classes in the two LCZ maps are qualitatively similar (Figure 6), the GIS-based method can improve the accuracy because the WUDAPT L0 method misclassified some LCZs, especially at the core urban cells. On the other hand, the WUDAPT method generates LCZ maps with a more homogeneous pattern than the GIS-based method. Findings from this work can provide a useful reference for researchers who are interested in LCZ classification and mapping work for their cities. Further analysis will be performed in the future to compare the methods, improve the GIS-based method in classifying the zones, and also consider surface cover properties to improve the overall LCZ classification. The same will be applied to the other regions analyzed in the present paper. and Oke's criterions and employing fuzzy logic to determine the three most probable LCZ for every cell [44]. A grid of 100 m × 100 m cells has been thus obtained, each cell containing those three most probable LCZs. The corresponding map (with only the most probable LCZ drawn) can be seen in Figure 6, which also shows the European LCZ map for comparison. Please note that no fuzzy logic has been used for the pervious surface fraction, used at the beginning of the script to distinguish between urban and non-urban areas, with a threshold at 80% (this choice is a main source of inaccuracy). In the GIS LCZ map, as information on the Land Cover Types (LCZ A, B, C, D, E, F, and G) was missing, all these classes were grouped in a single class called Non-built areas and, for consistency, the same was done for the European LCZ map. The European LCZ map shows that LCZ 6 (Open low-rise) is the most abundant (35%), characterizing the city outskirts. The city center is characterized by LCZ 2 (Compact mid-rise), while all around the center and in the north-western part of the city there is the presence of LCZ 8 (Large low-rise), the latter corresponding to the industrial/commercial zone of Lecce city. When compared with the GIS LCZ map, it can be noted that the area covered by LCZ 2 is the same (6%), even though more irregularly distributed throughout the region. It is noteworthy that the LCZ 6 now covers 11% of the area; in fact, areas around the center are now mostly reclassified as LCZs 5 (Open mid-rise) and 6, while the LCZ 8 area is reduced and mostly appears in the northern/western part as in the European map. Finally, scattered areas of LCZ 9 (Sparsely built) appears.
To assess the classification accuracy of both the GIS-based and WUDAPT methods, a confusion matrix was produced by comparing the European map to the GIS map. As an example, the overall accuracy (OA) has been calculated as follows [20]:

Conclusions
Morphological analyses have been conducted for five cities characterized by different climate, structure, and size, located in southern (Lecce and Bari), central (Naples and Rome), and northern (Milan) Italy. The calculation of four morphological parameters, namely mean building height, sky view factor, aspect ratio, and plan area index, has been performed using the GIS software QGIS. The methodology employs only two sets of data as input, i.e., building heights and footprints, and is based on the creation of a vector grid covering the study area to compute zonal statistics thanks to the appropriate module of QGIS. This methodology can be replicable in other urban contexts.
Results have been presented in maps for selected regions of interest, including the cities and the respective surrounding areas. A gradual increase in the plan area index moving from the south to the north of Italy has been obtained. Following the geometrical structure of the city, Lecce and, to a lesser extent, Milan have shown a symmetrical spatial distribution of planar area index and mean building height, with higher values in the city centers. On the other hand, the two coastal cities, Bari and Naples, have shown higher values near the coast, with values gradually decreasing away from the coast. Rome is instead characterized by higher values in the city center and along spatial radii stretching in all directions.
In addition, the morphological parameters have been spatially averaged over three specific CLCs, confirming that CLC 1 areas (continuous urban fabric) have the highest urbanization, with medium sky view factor values and plan area index values typical of compact cities. At the same time, CLC 2 and CLC 3 areas have shown values typical of sparse cities.
Results suggest that it is possible to build datasets of morphological parameters averaged over a grid of a chosen resolution (100 m × 100 m in this case), which can be employed to calculate the parameters over wider areas. The methodology followed here can represent the first step towards (i) building LCZ maps of levels 1 and 2, as performed in this paper for the Lecce region, useful for UHI/microclimate studies; and (ii) developing parameterizations for mesoscale models to represent the dynamic effect of urban areas.