For testing the framework were selected ten Italian cities: Bari, Bologna, Bolzano, Cagliari, Florence, Genoa, Naples, Palermo, Turin, Trieste.
These cities were chosen to test how the framework performs in cities with different urban features. In fact, the selected cities have different histories, building types, and environments, which made them a good test to see how adaptable the method is.
For each city, a structured dataset was compiled based on census data from the 2011 population and building census dataset provided by the Italian National Statistical Institute (ISTAT). These data, collected at the level of subzones as the smallest territorial units used for statistical purposes, provided detailed information on population distribution, household composition, building typologies, and residential structures.
Population and building census data are collected every 10 years and released by the ISTAT; the last census dataset was carried out in 2021, but it does not include information on buildings due to the limited urban-residential development in Italian cities in the last decade. This led to the choice to use the previous census dataset from 2011. Since there have been no significant changes in residential built-up areas in Italian cities, urban changes since 2011 do not impact the framework’s applicability.
4.2. Cluster Characterization and Centroid Analysis
After determining the optimal number of clusters using the six validity indices described previously; FCM is executed, and each subzone was assigned to the cluster for which it had the highest membership value. To assign a building specificity to each cluster, a fuzzification was applied to the feature values of the cluster centroid. This process uses the Ruspini fuzzy partition in
Figure 3. given by three overlapping fuzzy numbers, called Low, Medium, and High.
Each feature value in the centroid was then assigned to the fuzzy set (Low, Medium or High) corresponding to the highest membership degree.
This fuzzy labeling process allows each cluster to be semantically described through the identification of the dominant building typologies and historical construction periods.
For example, a centroid with high values in load-bearing masonry and early construction periods is indicative of a historic urban fabric, while high values in reinforced concrete construction and recent construction periods suggest the presence of recent residential developments.
The cluster labeling process provides a detailed understanding of the spatial articulation of the built environment. Based on the combination of dominant features, each cluster was assigned a descriptive label that captures its most representative urban characteristics. The labels assigned to the clusters were assigned with the broad consensus of experts who participated in the testing activities.
Labels such as “Historical masonry residential area”, “Post-war reinforced concrete area” or “Contemporary reinforced concrete residential area” were adopted to summarize the results and facilitate the communication of spatial patterns to stakeholders and professionals. These semantic labels enrich both the interpretability of the clustering results and their subsequent cartographic representation through GIS-based thematic mapping, allowing for a deeper understanding of the urban morphology in the cities under study.
Finally, the Subzone dissolving process is performed; neighborhood subzones belonging to the same cluster are dissolved in an urban pattern; the thematic map of the urban patterns is generated.
Now the details obtained applying the proposed method to the city of Florence are shown and discussed. The city was segmented into four clusters; analyzing the values of the centroids of each cluster were assigned specific labels which semantically summarize the urban characteristics of each cluster.
Below, for each cluster, the results of the clustering and fuzzification processes are shown.
As can be seen from the results in
Table 4, Cluster 1 is predominantly characterized by high values both in E5—Residential buildings in load-bearing masonry (value: 0.2958, membership degree: High)—and E8—Buildings constructed before 1919 (value: 0.2803, membership degree: High)
At the same time, all other construction period variables (E9–E16) and structural typologies (E6: reinforced concrete, E7: other materials) fall within the Low fuzzy set, with very small or null values.
This suggests that Cluster 1 corresponds to the Historic masonry center of Florence, where the architectural fabric is primarily composed of masonry buildings built before the 1919th.
The results in
Table 5 indicate that all building features in Cluster 2 fall within the Low fuzzy membership set.
This suggests that this cluster corresponds to sparse or transitional residential development with low residential building density.
Table 6 presents the fuzzy labeling of the centroid for Cluster 3. This cluster shows a clear predominance of both E6—Reinforced concrete buildings with a High fuzzy membership (value: 0.2118) and E10 (1946–1960) and E11 (1961–1970) with Medium fuzzy memberships, indicating concentration of buildings constructed during the post-war period. All other variables fall within the Low fuzzy category.
This suggests a residential urban fabric developed primarily in the 1950s–1970s, dominated by reinforced concrete structures.
The fuzzification results of the features of Cluster 4, shown in
Table 7, reveal a clear predominance of buildings constructed with load-bearing masonry techniques, with a high degree of membership to the variable E5 (value: 0.1965). Furthermore, the most representative construction period is 1919–1945, as indicated by the high degree of membership associated with E9 (value: 0.3029), followed by the period 1946–1960 with a medium degree of membership, suggesting some post-war additions. These variables define the primary characteristics of this cluster, indicating a built environment composed largely of masonry structures developed during the interwar period and up to 1960. The remaining periods (from E11 to E16) and the structural categories show a low influence.
These urban features are located just beyond the historic center, forming a first suburban ring that preserves a compact and coherent morphological structure.
Given the structural and temporal attributes observed in the centroid, Cluster 4 was semantically labeled as Suburban Residential Area.
To better visualize the distribution of urban typologies across the city of Florence,
Figure 3 presents the final thematic map of the urban patterns, classified based on its cluster labels.
The map illustrates the spatial distribution of the four urban patterns identified in the city of Florence, based on the results of the fuzzy classification model.
In the map each cluster is associated with the corresponding semantic label; the spatial distribution of the patterns highlights the morphological differentiation within the urban fabric.
The Historic Masonry Nucleus (in green), located mainly in the central part of the city, delineates the oldest portion of the urban fabric. This cluster includes subzones where buildings constructed before 1919 with load-bearing masonry techniques are particularly widespread, reflecting the oldest portions of the urban settlement.
Adjacent to this area, the Residential Zone in Load-bearing Masonry (in blue) extends towards the eastern and north-eastern portions of the city. This zone includes buildings constructed mainly between 1919 and 1960, also with masonry techniques, and corresponds to development phases subsequent to those of the historic center.
The Reinforced Concrete Residential Zone (in red) is mainly located in the outer areas, particularly in the southern and south-eastern portions of the city. The buildings in this cluster were generally built between the 1940s and the 1970s and are characterized by the use of reinforced concrete, in line with post-war building practices.
The remaining areas of the city, assigned to the Peripheral Urban Zones cluster (in beige), are generally located at the urban fringes. These areas show lower values in all structural and temporal indicators of the buildings, suggesting a more heterogeneous or non-predominant pattern in terms of construction techniques and periods.
The results reflect the city’s urban development. Indeed, the subzones identified as Historic Masonry Core correspond to the city’s historic center, and the areas classified as Reinforced Concrete Residential zones are the areas of subsequent development that arose from the building boom following the end of World War II.
This process was performed for all the ten Italian cities; thematic maps of the urban patterns were generated for each city to spatially represent their spatial distribution. These maps allow for an immediate and intuitive reading of the morphological structure of urban settlements, providing visual insight into the spatial extent and concentration of homogeneous building typologies.
For brevity, below are shown the results obtained for three others Italian cities: Genoa, Naples and Turin.
Figure 4,
Figure 5 and
Figure 6 show, respectively, the urban pattern thematic maps obtained for the cities of Genoa, Naples and Turin. In each map, the classified urban patterns are displayed using a distinct color scheme, with legends reflecting the semantic labels derived from the fuzzy centroid analysis. These visualizations serve as practical tools for identifying areas with similar construction characteristics and can support targeted urban regeneration strategies, especially in contexts marked by complex stratifications of building age and technique.
Figure 4 visualizes the spatial distribution of urban pattern of Genoa, in which each cluster is associated with a semantic label that synthetically captures its predominant construction characteristics. The Historic Masonry Core (in red) is distributed mainly along the coastal strip, the subzones that identify this pattern are aligned along the central valleys and hillsides of the city. This cluster includes subzones characterized by buildings constructed before 1919 with load-bearing masonry techniques. The Reinforced Concrete Residential Zones (in blue) appear more dispersed and fragmented, following the post-war urban expansion that occurred in the 1960s. The remaining areas are classified as Peripheral Urban Zones (in beige), located mostly in peripheral or less consolidated parts of the municipality. These zones do not show a predominance of any specific building technique or historical period, suggesting a more heterogeneous urban structure.
The thematic map of the urban patterns of Genoa highlights the development of the city, which is a typical port city and former maritime republic. Development began in the port area and subsequently spread along the coast and into neighboring inland areas. The subzones classified as Peripheral Urban Zones are areas further from the port, where residential construction has been less intense.
Figure 5 displays the spatial distribution of the urban patterns identified in the city of Naples. The classification highlights the structural and historical layering of the built environment.
Like the thematic map of urban patterns for the city of Genoa, the one for the city of Naples highlights that the areas belonging to the Historic Mansory Core (in red) are those is concentrated in the oldest part of the city, particularly around the central and coastal areas. This cluster includes buildings mostly constructed before 1919 and built using traditional load-bearing masonry methods, reflecting the city’s historical urban core. The Reinforced Concrete Residential Zones (in blue) extend across various parts of the city, particularly in areas that underwent expansion during the 1960s and 1970s. The Peripheral Urban Zones (in beige) represent the rest of the city, often located in the outer margins of the urban territory. These zones exhibit a more mixed or less clearly defined building composition, without strong dominance of either specific structural types or time periods.
Unlike Genoa, where the areas distant from the city center are predominantly Peripheral Urban Zones, a significant number of peripheral areas of the city of Naples are classified as Concrete Residential Zones, Unlike Genoa, where the areas distant from the city center are predominantly Peripheral Urban Zones, a significant number of peripheral areas of the city of Naples are classified as X, as they underwent residential development during the decades of Italy’s economic boom in the 1960s and 1970s.
The spatial distribution of urban patterns in Turin is presented in
Figure 6.
The classification outlines three distinct urban typologies: the Historic Masonry Core (in red) extends concentrically from the central part of the city and includes portions of the built environment dating back to before 1919, as well as to the 1946–1960 period. This dual component reflects both the historical center and the masonry-based expansion that followed World War II. The fact that these subzones refer to two different time periods indicates that the buildings constructed in load-bearing masonry refer to two different construction periods, a time interval that reaches 1919 and a subsequent period between 1946 and 1960. Unlike other cities, Turin underwent a building evolution in the post-war period in the central areas but always adopted the load-bearing masonry construction technique.
What is most striking about the thematic map of the urban patterns of Turin is the fact that, unlike Florence and the two port cities of Naples and Genoa, Turin does not have a well-located ancient historic center, but rather a large central zone in which subzones belonging to the Historic Masonry Core are mixed with subzones classified as Reinforced Concrete Residential Zones (in blue) which underwent subsequent building development from 1946 to 1970. These subzones are widely distributed across the municipality. Finally, the Peripheral Urban Zones (in beige) comprise subzones with no dominant structural typology or construction period. They are mainly located in the eastern part of the city and may include mixed or less consolidated residential areas.
4.3. Comparison Results
Unlike DL and ML classification models, which cannot be used without massive training sets, the proposed unsupervised method does not require labeled datasets and is easily scalable and reproducible across different urban settlements. Indeed, it provides the classification of urban residential areas using building census data, which are available for each type of urban settlement. Furthermore, it facilitates better interpretation of the classification results through appropriate user-defined cluster labeling.
Then, since ML and DL algorithms are not applicable in these case studies as they require appropriate training sets that would be very expensive to build, the comparative tests were performed against urban form classifications carried out directly by pools of experts. To this end, a pool of domain experts, made up of two urban technology experts and two urban planners, was asked to assign the correct class from the set of classes obtained after the mapping process to a sample of different subzones in each city, given by about 10% of the residential sections, selected randomly. In order to compare the classifications obtained with those assigned by the experts, for each of the cities studied it was necessary to present the experts with the set of urban form classes obtained after mapping the resulting clusters. This was performed so that the experts’ assessments could be made starting from the same set of classes.
To evaluate the method’s performance, we used the Adjusted Rand Index (ARI) [
37], a measure that evaluates the similarity between cluster-based classifications of data points. The Cohen’s Kappa concordance index was used to measure the agreement between the classifications made by experts; it provides a value of 0.95, which implies significant agreement among experts and justifies the use of the ARI to assess the accuracy of the proposed method.
The ARI is calculated from the contingency table of the two classifications.
Let n be the sample size and L
1, L
2,…, L
C be the class labels obtained after the mapping process applied on the C clusters. The contingency table of two classifications is given by (
Table 8):
The ARI is given by
where n
ij is the number of objects in the sample assigned to the ith class by the first classifier and to the jth class by the second classifier. In the table, a
i represents the total number of objects assigned by the first classifier to the ith cass, and b
j represents the total number of objects assigned by the second classifier to the jth cass.
where in general, . It represents a measure of mean binary classification accuracy over pairs of object. The value of ARI oscilates between −1 and 1. The closer it is to 1, the better the similarity between the two classifications.
Table 9 shows the ARI values calculated for all 10 Italian cities analyzed.
The results in
Table 9 show that the classification performed by the proposed FCM-based method is very similar to that performed by experts, regardless of the type of urban settlement analyzed. The ARI values range in all cases between 0.93 and 1.00.