Next Article in Journal
Robust Geometry–Hue Point Cloud Registration via Hybrid Adaptive Residual Optimization
Previous Article in Journal
Dynamic Evaluation of Urban Park Service Performance from the Perspective of “Vitality-Demand-Supply”: A Case Study of 59 Parks in Gongshu District, Hangzhou
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Method for Recognizing I-Shaped Building Patterns Utilizing Multi-Scale Data and Knowledge Graph

1
Faculty of Geomatics, Lanzhou Jiaotong University, Lanzhou 730070, China
2
National-Local Joint Engineering Research Center of Technologies and Applications for National Geographic State Monitoring, Lanzhou 730070, China
3
Key Laboratory of Science and Technology in Surveying & Mapping, Lanzhou 730070, China
4
Department of Surveying and Mapping Engineering, College of Water Conservancy and Hydropower Engineering, Gansu Agricultural University, Lanzhou 730070, China
*
Author to whom correspondence should be addressed.
ISPRS Int. J. Geo-Inf. 2026, 15(1), 23; https://doi.org/10.3390/ijgi15010023
Submission received: 5 November 2025 / Revised: 28 December 2025 / Accepted: 30 December 2025 / Published: 4 January 2026

Abstract

Building pattern recognition is essential for understanding the dynamics of urban development, facilitating automatic map synthesis, and aiding in municipal planning Sefforts. Traditional research methods, which rely solely on geometric feature extraction from isolated objects, struggle to capture the complex and visually significant building patterns within urban environments, often suffering from low accuracy and robustness. This paper proposes a novel approach for recognizing I-shaped building patterns utilizing multi-scale data and knowledge graphs. The process begins by extracting inter-building relationships at and across different scales based on geometric and spatial rules derived from the Smallest Bounding Rectangle (SBR) representation, thereby establishing a comprehensive framework for recognizing I-shaped building patterns. This framework is encoded into a knowledge graph that translates specific scale-based and cross-scale recognition rules into conditions for knowledge graph reasoning. Utilizing rule-based reasoning within this framework, our method effectively identifies I-shaped building patterns that align with human visual principles. Experimental results underscore the efficacy of this approach, with significant enhancements in the recognition of I-shaped patterns being noted. Specifically, when compared to traditional methods that overlook multi-scale data and visual dynamics, our approach achieved a 24% increase in recall rate in Lanzhou and a 52.75% increase in London, while also Amaintaining high precision.

1. Introduction

Building patterns typically refer to visually significant structures formed by a single building or a group of buildings, which often contain rich spatial information and hold considerable social significance in modern urban planning and spatial analysis [1,2,3]. Beyond their geometric appearance, letter-shaped building patterns such as I-, C-, and E-shaped layouts reflect typical spatial organization logics in urban environments. These patterns are commonly associated with residential compounds, campuses, and industrial blocks, and are closely related to land-use intensity, internal circulation structures, and spatial enclosure characteristics. Therefore, identifying such building patterns provides useful intermediate-scale indicators for analyzing urban spatial structure, diagnosing development constraints, and supporting planning decisions regarding future urban expansion [4]. In the construction of smart cities, the automated identification and analysis of building patterns offer a scientific basis for urban decision-making [5]. Automatically extracting, simplifying, and retaining the core information of building patterns enables a more intuitive representation of significant building features in geographic space [6,7,8]. Thus, building pattern recognition is also essential for enhancing the efficiency and accuracy of map synthesis.
Contemporary research on building pattern recognition broadly categorizes into two main approaches: shape recognition for individual buildings and arrangement recognition for clusters of buildings. Shape recognition focuses on analyzing geometric features and parameters of buildings, employing techniques such as corner functions [9], Fourier descriptors [10], and graph convolutional auto encoders [11,12]. Conversely, arrangement recognition leverages template matching [13], structural rules [14], and machine learning techniques [15] to decode the layouts of building groups. Particularly in contemporary research, I-shaped building patterns, noted for their distinct geometric configurations and significant spatial relationships, have emerged as a focal area of study. Their representation of a type of building pattern that is commonly found in real cities and possesses distinct geometric structures, making it highly visible and structural within urban layouts. Additionally, focusing on this specific pattern enables a deeper exploration of the performance and advantages of recognition methods when applied to building groups with evident geometric relationships, with the potential for subsequent extension to the recognition of other building patterns [14]. Nonetheless, traditional techniques primarily focus on the geometric attributes of individual objects, which, while effective at a certain level, often lack the precision and robustness needed for recognizing complex building groupings in urban settings and across varied spatial scales. As illustrated in Figure 1, existing building pattern recognition methods can successfully identify I-shaped patterns that exhibit clear and stable structural characteristics at a single scale, such as the cases shown in Figure 1a,c. In contrast, the patterns shown in Figure 1b,d rely on building configurations that are distributed across multiple scales, where the I-shaped structure is fragmented or ambiguous when observed at any individual scale. As a result, these patterns are difficult to reliably identify by conventional single-scale methods. In this article, the patterns in Figure 1b,c are referred to as potential architectural patterns.
Multi-scale data, which partially reflect human insights into the clustering and segmentation of building groups [16,17,18], can facilitate the identification of latent building patterns that conform to human visual principles. For instance, in the case where the buildings in Figure 1a,c are identified as I-shaped building patterns, the potential I-shaped building pattern in Figure 1b can be recognized through the “one-to-many” relationship between the buildings in Figure 1a,b, or the “many-to-one” relationship between the buildings in Figure 1c,b. Similarly, the potential building pattern in Figure 1d can be recognized using the same method. The use of multi-scale data effectively addresses the limitations of single-scale data in complex pattern recognition. By fusing cross-scale information, it becomes possible to capture a broader range of potential patterns. In urban architecture and cartography, researchers have integrated multi-source, multi-scale building data with geometry-based matching methods and geospatial ontologies to model multi-scale building representations and spatial relationships [19]. Related studies have also investigated changes in building representations induced by map synthesis, distinguishing temporal updates from representational changes across scales [20].
However, multi-scale building data are inherently heterogeneous and structurally complex, which poses significant challenges for their integrated analysis. Existing approaches based on spatial databases are mainly designed for static data storage and geometric queries, and thus have limited capability to represent complex spatial semantics, manage multi-scale relationships, or support dynamic reasoning. As a result, effectively integrating and reasoning over building data across different scales remains difficult using conventional spatial database technologies.
Knowledge graphs (KGs), as graph-based structures for modeling multi-relational networks, are well-suited for representing both the geometric features and semantic relationships of buildings. By organizing entities as nodes and relationships as edges, KGs support explicit and interpretable rule-based reasoning, enabling dynamic inference with fully traceable reasoning processes. These properties make KGs particularly effective for integrating heterogeneous multi-scale building data and reasoning about complex spatial relationships [21,22].
Consequently, this paper utilizes a knowledge graph to articulate the multi-scale shape features, spatial relationships, and semantic attributes of buildings, thus introducing a novel approach for recognizing I-shaped building patterns within a cohesive framework. This method uses knowledge graphs to discern I-shaped arrangement patterns within building clusters and utilizes recognition outcomes from multi-scale data to enhance the identification of potential I-shaped building configurations. Moreover, individual buildings characterized by I-shaped designs have previously been identified through human visual recognition techniques.
The rest of this paper is organized as follows: In Section 2, we introduce related work and concepts. Section 3 details the knowledge graph construction process for recognizing I-shaped building patterns. Section 4 outlines how rule-based reasoning is applied using the constructed knowledge graph. Section 5 presents the experimental results following the workflow. Section 6 explores related topics. Finally, Section 7 offers a conclusion to the paper.

2. Related Work

2.1. Building Pattern Recognition

Building shape recognition focuses on characterizing individual buildings using geometric descriptors such as orientation [23], compactness [24], and isoperimetric quotient [25]. Various methods, including principal component analysis [26], geometric descriptor selection [27], turning functions [9], shape context [28], Fourier descriptors [10], and rectangular coding [29], have been proposed to classify buildings into recognizable geometric forms. Advanced AI techniques, including graph convolutional autoencoders [11] and CNN-based polygon encoders [30], are also increasingly used in shape recognition tasks. In recent years, deep learning-based approaches have made significant progress in building footprint extraction and related urban spatial analysis, particularly in learning building-level representations from large-scale data [31,32,33]. However, these methods primarily focus on building footprints or aggregated urban indicators and generally do not explicitly model pattern-level configurations.
Collectively, these methodologies effectively categorize individual buildings into recognizable shapes; however, shape recognition constitutes only one component of building pattern recognition, which also requires arrangement recognition to identify structured patterns formed by groups of buildings.
Recognition of building arrangement patterns involves categorizing groups of buildings into defined configurations, such as linear, grid, and letter-like forms. This process requires establishing clear distribution rules for these groups. There are primarily three recognition techniques: template-based [1,13,34], machine learning-based, and structural rule-based. The template-based methods utilize predefined templates to standardize patterns, which are effective in recognizing linear or composite formations. However, the rigidity of this approach can limit its ability to capture all possible patterns. Alternatively, machine learning methods, including random forests and graph convolutional neural networks [11,15,35], enable the automatic learning of pattern rules. While these techniques are potent, they demand a considerable amount of prior knowledge and can obscure the logic behind their data integration, potentially limiting broader applicability.
In contrast, methods based on structural rules define and detect patterns using perceptual principles such as proximity graphs and Gestalt principles, and have been widely applied in building arrangement recognition due to their flexibility and interpretability. Representative studies have employed relative neighborhood graphs (RNG) [36], minimum spanning trees [37], and Delaunay triangulation–based (DT) proximity graphs to detect linear, grid, collinear, parallel, and curvilinear building patterns [38,39,40]. More recent work has further incorporated spatial reasoning and building decomposition strategies to enhance the recognition of structured building arrangements [14,29].
In summary, methods based on structural rules are widely adopted due to their flexibility and ability to accommodate user-specific requirements. However, proximity graph–based approaches often rely on extensive search and matching operations, which limit their scalability and may not comprehensively identify all latent building patterns. Given the influence of scale variations on building morphologies in maps, utilizing knowledge graphs to integrate multi-scale data, while building on structural rule approaches, can significantly improve the accuracy and efficiency of building pattern recognition.

2.2. Knowledge Graph and Rule-Based Reasoning

In recent years, the utilization of knowledge graphs for expressing spatial data and modeling complex interactions has increasingly attracted attention [21]. By depicting spatial relationships between buildings through a graph structure, knowledge graphs facilitate robust decomposability during their construction [41]. This process involves initially breaking down the graph into fundamental nodes and relationship types. The subsequent application of rules to these elements enables the execution of inference calculations that generate new insights. Such inferred knowledge is instrumental in advancing the automation of map synthesis processes [42], offering fresh perspectives and tools for building pattern recognition and spatial reasoning. These capabilities are particularly advantageous in managing and analyzing multi-scale spatial datasets, showcasing significant strengths in these complex tasks [22]. This methodological approach not only improves the accuracy and efficiency of spatial data handling but also enhances the scalability and applicability of spatial analysis in various urban and environmental contexts.
Recent studies have explored the use of knowledge graphs for building arrangement recognition. For instance, Wei et al. applied rule-based reasoning on knowledge graphs to identify linear building patterns [43], while Tang et al. constructed geographic knowledge graphs to recognize linear and grid patterns and analyze the sensitivity of recognition results to parameter settings [44]. However, most existing studies concentrate on recognizing patterns like linear or grid arrangements at a single scale. Research that delves into the application of multi-scale data in building pattern recognition is scant, particularly for more complex configurations such as I-shaped patterns. Additionally, from a methodological perspective, deep learning (DL) approaches provide a powerful solution for building-related tasks when large-scale annotated datasets are available. However, most DL-based methods primarily lack explicit representations of spatial relationships and multi-scale interactions. In contrast, the knowledge-graph-based framework emphasizes relational modeling, rule-based reasoning, and interpretability, which are particularly suited for structured building pattern recognition under multi-scale settings. Rather than competing with DL-based approaches, the proposed method is complementary, offering a data-efficient solution for pattern-level analysis. The integration of DL-based feature extraction with knowledge-graph-based reasoning represents a promising direction for future research. The representative research on architectural shape and pattern recognition is shown in Table 1 below.
Consequently, this paper is dedicated to utilizing knowledge graphs to integrate multi-scale data for the identification of I-shaped building patterns, aiming to precisely and comprehensively discern latent pattern characteristics within building groups.

3. Construction of a Knowledge Graph for Recognizing I-Shaped Building Patterns

3.1. Defining Spatial Relationships in I-Shaped Building Group Arrangements

The spatial arrangement of I-shaped building patterns typically comprises three distinct buildings: one centrally located building flanked by two others, forming the I-shape. These side buildings are both parallel to each other and perpendicular to the central building. The definition of an I-shaped arrangement includes “proximity”, “exact parallelism”, and “perfect perpendicularity” between the buildings. To refine these definitions, this paper introduces the concept of “relative positional relationships” to more precisely delineate the spatial positioning of each building relative to others. Moreover, given that the central positioning of the middle building is crucial for pattern recognition quality, a “center distance constraint” is applied to enhance the alignment of recognition outcomes with human visual perception principles.

3.1.1. Proximity Relationships

In the context of building groups, this study addresses the proximity relationships between spatially adjacent yet unconnected buildings using the Constrained Delaunay Triangulation (CDT) skeleton method. This approach employs shared boundaries or corners to establish the proximity relationships between two buildings on a map. As illustrated in Figure 2a, buildings sharing the same CDT skeleton are considered adjacent. Subsequently, a proximity graph is constructed to visually represent these relationships, as depicted in Figure 2b. Within the knowledge graph, the proximity between building Bi and Bj is represented by a ‘Neighbor’ type relationship between the respective entities.

3.1.2. Relative Positional Relationships

This study employs the Smallest Bounding Rectangle (SBR) method to describe the relative positions between two buildings, denoted as the reference building (Ba) and the target building (Bb), with their corresponding SBRs referred to as SBRa and SBRb, respectively [14]. By orienting the longer and shorter axes of SBRa as the X and Y axes, respectively, the eight-directional neighborhood of SBRa is established, as shown in Figure 3. This framework allows for the analysis of Ba and Bb’s relative positions by evaluating where SBRb is located within SBRa’s designated eight-directional neighborhood.
When SBRb is projected onto the long or short axis of SBRa, their spatial relationship can be characterized using 13 fundamental relative positions. These relations include before, meets, overlaps, starts, during, finishes, equal, and their corresponding inverse relations, forming a set of 13 mutually exclusive and jointly exhaustive spatial relations. In this study, these relations are adopted to discretize and model the relative positional relationships between building entities, thereby facilitating rule-based reasoning within the knowledge graph. Given that SBRb may overlap projections on both SBRa’s long and short axes, the relative positions between the two can be denoted as ri,j (a,b). This notation represents the 169 potential directional relationships where the target building Bb is positioned relative to the reference building Ba, as shown in Figure 4. In this figure, the color orange indicates the reference building Ba, while blue signifies the target building Bb.
In this study, buildings with overlapping geometries, as highlighted by the red boxes in the diagram, are excluded from the relative positional analysis due to the ambiguity they introduce in defining spatial relationships. Moreover, buildings with excessively complex shapes are generally not considered suitable for forming regular patterns. Before approximating a building’s shape using SBR, an assessment of the building’s shape complexity is performed by calculating its rectangularity, defined as the ratio of the building’s area to the area of its SBR. A rectangularity greater than 0.6 suggests that the building’s shape approximates a rectangle, which implies simplicity. Buildings with a lower rectangularity, indicative of greater shape complexity, are excluded from the analysis of relative positional relationships and are not considered part of the I-shaped building pattern formation [5].

3.1.3. Fully Parallel Relationships

Fully Parallel Relationships occur when two buildings, Ba and Bb, not only share similar dimensions and shapes but are also aligned parallel to each other along their main orientation, with their long axes partially overlapping [14]. To quantify the extent of this overlap more precisely along the long axes of Ba and Bb, this study introduces the long axis overlap ratio (LR), as described in Equation (1). This ratio is critical for assessing the degree of alignment necessary to classify the relationship as completely parallel.
L R ( B a , B b ) = O v e r l a p ( l a b ) / M e r g e ( l a b , l a )
In this formula, LR(Ba, Bb) quantifies how much the projection of the target building Bb on the reference building Ba’s long axis overlaps with Ba’s axis itself, with ‘Overlap’ indicating the intersection area and ‘Merge’ indicating the combined area.
The criteria for a fully parallel relationship are outlined in Equation (2).
F u l l _ P a l ( a , b ) = S i m A = A a / A b 1 θ 1 P a l O = | O a O b | θ 2 L R ( B a , B b ) θ 3 r ( B a , B b ) { r 1 , j , r 2 , j , r 12 , j , r 13 , j } j = 4 10
In this formula, A is the building area, O is the primary orientation of the building’s SBR, LR is the Long Axis Overlap Ratio, and θ1, θ2, θ3 are the thresholds that determine the precise conditions under which the relationship is considered fully parallel. The ratio Aa/Ab is a positive value, and when this ratio approaches 1, it indicates that the areas of the two buildings are more similar. Subtracting 1 shifts the ratio so that perfect area similarity corresponds to SimA = 0. Taking the absolute value ensures that SimA is non-negative and symmetric with respect to Aa and Ab.

3.1.4. Full Perpendicular Relationships

A full perpendicular relationship is established between buildings Ba and Bb when their principal directions are perpendicular to each other, and the projection of Bb on Ba aligns precisely with Ba’s long axis. This condition is represented as Full_Per(Ba, Bb), indicating a strictly unidirectional relationship from the reference building Ba to the target building Bb. The definition of this relationship is specified in Equation (3).
F u l l _ P e r ( B a , B b ) = P e r O = 90 | O a O b | θ 2 r ( B a , B b ) { r i , 5 , r i , 6 , r i , 7 , r i , 10 } i = 1 , 2 , 12 , 13
In this formula, O is the primary orientation of the building’s SBR, PerO is used to quantify how close the orientations of two buildings are to being perpendicular. When the angle difference between Oa and Ob approaches 90°, the value of PerO decreases and approaches zero, indicating that the two buildings are nearly perpendicular in orientation. When the angle difference deviates from 90°, the value of PerO increases, reflecting a weaker perpendicular relationship between the two buildings.

3.1.5. Center Distance Constraint

The I-shaped building pattern is characterized by parallel wing buildings with a central building that is perpendicular to these wings and positioned at their midpoint. The centrality of this middle building is an essential factor that influences its recognition as part of an I-shaped pattern, based on human visual perception principles. This paper establishes a center distance evaluation system by measuring the relative distance between the centroid of the middle building and that of the entire building group, as specified in Equation (4).
C e n _ D ( A , B 1 , B 2 ) = | d i s ( A , B 1 ) d i s ( A , B 2 ) 1 | + ( d i s ( A , B 1 ) + d i s ( A , B 2 ) d i s ( B 1 , B 2 ) 1 ) θ 4
In this formula, A is the central building, while B1 and B2 are the wing buildings on either side. dis(x,y) denotes the Euclidean distance between the centroids of buildings x and y. This distance is normalized and expressed in a relative form, and the subtraction of 1 is applied to shift the ideal distance condition to zero. As a result, smaller values indicate that the spatial separation between A and B1 is closer to the expected configuration, while larger values reflect a greater deviation from this ideal distance.

3.2. Definitions of Building Matching Relationships at Different Scales

Map data at smaller scales is commonly derived through the generalization of larger-scale data, essentially an information abstraction process. When buildings from a large-scale map are updated to a smaller-scale map, they might be simplified in shape, merged with other buildings, or even removed entirely [1]. Such changes can lead to substantial discrepancies in the shape and size of the same building on different scales, or result in its absence on a smaller scale. To address this, this study defines six matching relationships to associate the same building entities across scales: one-to-one (1:1), one-to-many (1:M), many-to-one (M:1), many-to-many (M:M), one-to-none (1:0), and none-to-one (0:1), as detailed in Figure 5. These relationships facilitate the consistent representation of buildings despite scale-driven alterations in their depiction.
The determination of matching relationships between corresponding buildings at different scales employs methodologies outlined in recent studies [19,45]. Utilizing Equation (5), the overlap ratio, Roverlap, is calculated efficiently. This ratio is crucial for identifying whether two instances of a building, Bb and Bs, at different scales represent the same structure. Specifically, following previous studies [19], if the Roverlap between Bb and Bs is 0.3 or higher, they are considered to represent the same building at different scales. And the type of matching relationship is inferred by the number of corresponding matching nodes between nodes at different scales in the knowledge graph.
R o v e r l a p = A ( B b B s ) M i n ( A b , A s )
In this formula, Ab and As represent the areas of buildings Bb and Bs, respectively, while A(BbBs) denotes the area where Bb and Bs overlap.

3.3. Construction of the Knowledge Graph

Compared to the proximity graph model, the attribute graph model has shown greater prominence and broader adoption in knowledge graph applications, as noted by [46]. This paper utilizes the attribute graph model in the Neo4j graph database to describe the graph structure of building groups with nodes and edges, representing the attribute graph of the building group as: Graph = (B, R).
In this framework, B represents a set of nodes, defined as B = {B1, B2, …, Bn}. Each node corresponds to an independent building entity, and these nodes are categorized based on their scale with labels such as “BuildingBig” for large-scale buildings and “BuildingSmall” for smaller-scale structures. Each building entity within the graph is detailed with four key attributes: “bID” which serves as a unique identifier for each building; “Area” which specifies the building’s total surface area; “Orientation” which describes the principal direction that the building faces; and “Pattern” which indicates whether the building is part of an I-shaped building arrangement.
Represents a set of edges, R = {Rm(Bi, Bj), Bi B, Bj B}, where R(Bi, Bj) indicates the relationship between entities Bi and Bj. The proximity between buildings A and B is represented in the knowledge graph as a relationship between entities Ba and Bb, denoted as Ba-(Neighbor)-Bb. The relative position relationship is represented as a unidirectional relationship from the reference building entity Ba to the target building entity Bb, denoted as Ba-(Position)->Bb. This relative position relationship has two relationship attributes: “P_type” indicates the type of relative position relationship; “LR” represents the degree of overlap of the longitudinal axes of the two buildings. From the size, main direction, proximity, and relative position relationships between buildings, the bidirectional fully parallel relationship and the unidirectional fully vertical relationship between building entities Ba and Bb are inferred, represented as Ba-(Full_Pal)-Bb and Ba-(Full_Per)->Bb, respectively. Additionally, the center distance between building entities can be deduced from the centroid distance between the buildings, with the centroid distance between entities Ba and Bb represented as dis(Ba, Bb).
The matching relationships between buildings across different scales can be represented as a “Match” relationship with an attribute “M_type”, where the value of “M_type” corresponds to the type of relationship, including one-to-one (1:1), one-to-many (1:M), many-to-one (M:1), and many-to-many (M:M) types. However, the previously mentioned one-to-zero (1:0) and zero-to-one (0:1) relationships are not required to be explicitly represented in the knowledge graph.
Entities and relationships in the knowledge graph are shown in Table 2. This paper utilizes Neo4j to store the knowledge graph and uses the Cypher query language to define rules for building pattern recognition, thereby enhancing rule-based reasoning.

4. Rule-Based Reasoning for Recognizing I-Shaped Building Patterns

This paper utilizes rule-based reasoning within the established knowledge graph to identify I-shaped building patterns, a process divided into two key steps. Initially, the method identifies I-shaped patterns at specific spatial scales. Subsequently, it integrates multi-scale data into the recognition process, with the goal of improving the overall effectiveness of pattern identification. This approach not only allows for precise pattern detection at individual scales but also enhances the accuracy and reliability of the results by considering variations across different scales, ultimately achieving a more robust pattern recognition outcome.

4.1. Recognizing I-Shaped Building Patterns at a Single Spatial Scale

This paper utilizes the Neo4j graph database to host the knowledge graph and employs the Cypher language to articulate rules crucial for reasoning and recognizing I-shaped building patterns at designated spatial scales. The identification process involves discerning key relationships such as Neighbor, Full_Pal (Fully Parallel), Full_Per (Fully Perpendicular),and the Cen_D (Center Distance). Essential prerequisites, as specified by Equations (2)–(4), include verifying similarities between building entities, assessing parallelism or perpendicularity in their principal orientations, and measuring centroid distances. These steps facilitate the logical deduction of Fully Parallel and Fully Perpendicular relationships, alongside the Center Distance constraint, which are essential for constructing an accurate I-shaped building pattern recognition framework, as demonstrated in Equation (6).
It is worth mentioning that the specific steps for inferring the Center constraint based on the centroid distances between building entities are as follows: (1) Assume there are three buildings Bm, B1, B2, and infer the possibility of forming an I-shaped pattern based on their proximity relationships, relative positions, and relationships of being fully parallel and fully perpendicular. (2) Calculate their center distances using the distances dis(Bm, B1), dis(Bm, B2), and dis(B1, B2), as specified in Equation (4). (3) Compare the calculated center distance with a threshold; if it is below the threshold, then modify the attribute “Pattern” of the three entities to “True”, indicating that the building entities form an I-shaped pattern after meeting the center distance constraint. Otherwise, they are deemed not to form an I-shaped pattern under human visual recognition principles. Importantly, the center distance constraint is not explicitly represented in the knowledge graph.
I _ P a t t e r n ( a , b 1 , b 2 ) = N e r i g h b o r ( a , b 1 ) N e r i g h b o r ( a , b 2 ) F u l l _ P a l ( b , b 2 ) F u l l _ P e r ( a , b 1 ) F u l l _ P e r ( a , b 2 ) C e n _ D ( a , b 1 , b 2 )
Consequently, the rule-based reasoning process for identifying I-shaped building patterns comprises four sequential steps: (1) Validate relationships concerning area similarity (SimA), parallel orientation (PalO), and perpendicular orientation (PerO); (2) Deduce relationships that are Full_Pal and Full_Per; (3) Confirm compliance with the center Cen_D; (4) Recognize and confirm the I-shaped building pattern configuration. This methodical approach is illustrated in Figure 6.

4.2. Integrating Multi-Scale Data in Recognizing I-Shaped Building Patterns

Research into enhancing pattern recognition with multi-scale data encompasses two primary strategies. The first strategy involves transitioning from larger to smaller scales, essentially treating groups of buildings at larger scales as decomposed into their component structures at smaller scales. For instance, a building or group of buildings identified with an I-shaped pattern at a larger scale may suggest the potential for similar patterns among corresponding smaller-scale buildings [17]. For example, if three buildings (such as B3, B4, B5) at a larger scale are recognized as forming an I-shaped pattern, these may represent a breakdown into constituent buildings at a smaller scale. Specifically, B5 corresponds directly to B2 in a one-to-one (1:1) relationship, while B3 and B4 collectively relate to B1 in a many-to-one (M:1) relationship, potentially leading B1 and B2 to manifest an I-shaped pattern at a smaller scale. This method of enhancing recognition from larger to smaller scales is depicted in Figure 7b, emphasizing the dynamic flow and transformation of building patterns across different scales.
The second approach involves scaling from smaller to larger scales, whereby a building group at a smaller scale is considered as an aggregation of corresponding larger-scale structures. If a group of buildings is identified with an I-shaped pattern at a smaller scale, it suggests that the associated group at a larger scale might also exhibit an I-shaped pattern [13]. For instance, at a smaller scale, if three buildings (such as B3, B4, B5) are recognized as forming an I-shaped configuration, their corresponding entities at a larger scale could be their aggregated counterparts, still likely to form an I-shaped pattern. Specifically, B3 is linked to B6 and B7 in a many-to-one (M:1) relationship, whereas B4 and B5 are each linked to B8 and B9 in one-to-one (1:1) relationships, respectively. This setup suggests that at the larger scale, buildings B6, B7, B8, and B9 could also establish an I-shaped pattern. This process of enhancing recognition from smaller to larger scales is demonstrated in Figure 7c, effectively illustrating how scale transitions influence pattern identification.
Therefore, the process of leveraging multi-scale data to enhance pattern recognition can be delineated into three primary steps: Firstly, cross-scale matching relationships are established for buildings that have been identified as forming I-shaped patterns at specific scales. Secondly, the relationships between these buildings and their corresponding entities at other scales are automatically assessed through rule-based reasoning within the knowledge graph, where the enhancement strategy (large-to-small or small-to-large) is determined based on predefined cross-scale matching types (e.g., one-to-one, one-to-many, many-to-one, and many-to-many), without manual intervention. Finally, the selected enhancement strategy is applied to identify potential I-shaped patterns across scales. This cross-scale matching and pattern transformation process improves the overall accuracy and effectiveness of building pattern recognition.

5. Experiments and Analysis

5.1. Dataset

The experimental data is sourced from the National Basic Geographic Information Database of China, including buildings from the Lanzhou City (LZ), area at spatial scales of 1:10,000 and 1:5000. Although only two vector scales are used in this study due to the limited availability of reliable multi-scale cadastral datasets, they are sufficient to fully instantiate and validate the proposed cross-scale enhancement mechanism, which operates bidirectionally (large-to-small or small-to-large) through rule-based reasoning over predefined cross-scale matching types.
To assess the accuracy and robustness of the experimental method in pattern recognition across urban environments of varying complexity, buildings from London (LD) were also used as a control group, as shown in Figure 8.The selected datasets are categorized into two scale levels, which correspond to two spatial resolutions: large-scale data (SL) at approximately 1:5000 and small-scale data (SS) at approximately 1:10,000. The dataset from Lanzhou encompasses about 2 square kilometers and includes 654 buildings at the larger scale and 395 at the smaller scale. Conversely, the London dataset covers about 1.6 square kilometers and comprises 1343 buildings at the larger scale and 724 at the smaller scale. These datasets provide a comprehensive basis for evaluating the method’s effectiveness in identifying Building patterns within significantly varied urban settings.
The dataset encompasses buildings with a diverse array of shapes, sizes, and orientations, distinctly partitioned by roads, presenting a complex array of relative positions among the structures. Within this dataset, the majority of buildings at both considered scales typically feature fewer than 8 edges, and most possess high simplicity in form, as indicated by a significant measure of rectangularity (often greater than 0.7). The comprehensive analysis of building features, as demonstrated in Figure 9, highlights that complex buildings—characterized by having more than 10 edges and a rectangularity of less than 0.6—make up a minor portion (marked in red in Figure 9). This implies that most buildings in the research area are straightforward in shape, and readily approximated by the SBR, which facilitates the identification of I-shaped patterns. The diversity in orientation and spatial relations suggests a potential for more intricate, “composite” building patterns such as I-shaped configurations, thus enhancing the ability to effectively assess the efficacy of the methodologies described in this study.

5.2. Parameter Settings

The building pattern recognition methodology employed in this study is based on Gestalt principles, with visual variables such as position, shape, size, orientation, and center distance. Variations in the threshold values of these parameters will influence the recognition results differently. As defined in Section 2 of this paper, the spatial relationships between buildings at a given scale are initially extracted. Based on a comprehensive consideration of experimental analysis and relevant literature [14,27], the threshold values in Equations (2) and (3) are determined as follows: θ 1 = 2, θ 2 = 15, θ 3 = 0.4.
At the same time, in order to define the center distance constraint, the performance of the model at different center distance thresholds is evaluated using the ROC-AUC metric under the three threshold settings mentioned above. A set of 100 threshold values ranging from 0 to 1 with a step size of 0.01 is used, and the ROC-AUC curve is plotted, as shown in Figure 10, with an AUC value of 0.90. The threshold θ 4 in Equation (4) is selected as 0.2, which satisfies the human visual principle.
The comparison of recognition effects when threshold 4 is 0.1, 0.2, and 0.5 is shown in Figure 11.

5.3. The Process of Constructing the Knowledge Graph

The building relationships within and across different scales are mapped and represented as a knowledge graph, which is maintained within Neo4j version 4.4.26. An illustrative example of this can be seen in Figure 12, which shows the knowledge graph constructed using data from London City. To facilitate the recognition of I-shaped patterns, the Cypher query language is utilized to perform rule-based reasoning based on the arrangement analysis rules described in Section 4.1.
Using this approach, I-shaped patterns are first identified at specified scales through rule-based verification of building arrangements. Based on these recognized patterns, the subsequent enhancement recognition (ER) further explores potential I-shaped configurations at other scales by leveraging the cross-scale matching relationships represented in the knowledge graph. This process enables comprehensive pattern detection and analysis by combining rule-based reasoning with cross-scale information propagation supported by Neo4j.

5.4. Analysis of Experimental Results and Evaluation of Methodology

The I-shaped patterns identified through the method proposed in this study are depicted in Figure 13. A total of 82 groups of I-shaped patterns were successfully detected across two regions.
To assess the efficacy of the proposed method, outcomes are analyzed across four scenarios: (1) Results from recognizing individual building shapes and their arrangements (SR + AR); (2) Integrated analysis including shape recognition, arrangement assessment, and center distance evaluation (SR + AR + CR); (3) Comprehensive analysis incorporating shape recognition, arrangement determination, and multi-scale enhancement (SR + AR + ER); (4) A full suite of analyses combining shape recognition, arrangement verification, center distance checks, and multi-scale enhancement (SR + AR + CR + ER).
In addition, comparative experiments were conducted on the same dataset using the template matching method (TM) simultaneously. The TM follows the canonical definition of I-shaped building patterns reported in the literature [34] and employs a corresponding template consisting of two parallel wing buildings and one perpendicular central building. A graph-based matching approach was applied to identify building patterns conforming to this template. The results of both the template matching method and artificial visual recognition are shown in Figure 14.
These analyses are benchmarked against human visual recognition outcomes sourced from seven graduate and doctoral students in cartography and geographic information systems, well-versed in how buildings may be decomposed or aggregated to form recognizable patterns. Inconsistencies in recognition are resolved through a voting mechanism. It should be noted that, although the National Basic Geographic Information Database provides authoritative building footprint and cadastral information, it does not explicitly include ground-truth annotations for building pattern configurations (e.g., I-shaped building groups). As a result, such authoritative datasets cannot be directly used as reference ground truth for building pattern recognition. In this context, expert-based visual interpretation is adopted as the gold-standard reference, which is a common and accepted practice in studies on building pattern recognition and cartographic generalization [29,43]. Based on this human visual interpretation process, a total of 50 I-shaped building patterns were identified in the Lanzhou dataset, and 182 I-shaped building patterns were identified in the London dataset as the gold standard. In the reported results, matches with human recognition are indicated as True positives (Tr), discrepancies as False positives (Fr), and omissions as Misses (Mr). It should be noted that for each method configuration, the sum of true positives (Tr) and misses (Mr) equals the total number of gold-standard I-shaped patterns, whereas the number of false positives (Fr) is not fixed and depends on the number of incorrectly identified candidate structures. The effectiveness of the recognition method is quantified by computing precision and recall, utilizing a formula referenced from Zhang [37]. Table 3 below summarizes the effectiveness of different recognition methods applied to datasets from Lanzhou (LZ) and London (LD).
From the data presented in Figure 13 and Figure 14, and Table 3, it is clear that the integration of multi-scale data has substantially enhanced the recall rates. Specifically, the recall rate for Lanzhou City’s data increased by 24%, and for London City’s data by 52.75%, compared to methods that did not account for multiple scales. This improvement suggests that utilizing multi-scale data for pattern enhancement significantly facilitates the detection of more potential building patterns, particularly in London City, where the high density of buildings and complex building forms pose significant challenges. This achievement can be ascribed to the complex spatial relationships often present in urban environments, which promote the emergence of numerous potential building patterns. The approach based on multi-scale data exhibits an enhanced capability to identify these patterns, effectively addressing the deficiencies of traditional methods in handling areas with dense construction and complex building forms.
Furthermore, according to the Gestalt principles of human visual recognition [47], which emphasize the importance of proximity, similarity, continuity, closure, and good form in recognized objects, conventional methods that are weakly linked to physical entities often fall short in identifying all potential patterns. For instance, as illustrated in Figure 13, the I-shaped patterns G2 and G6 cannot be recognized at a larger scale, and G5 and G8 at a smaller scale, while G1 and G4 are only partially recognizable at a certain scale. However, with the application of pattern enhancement techniques, the methodology developed in this study is able to more comprehensively recognize these potential I-shaped patterns, demonstrating its effectiveness and adaptability in complex urban settings.
The unique structural characteristics of I-shaped patterns mean that the positioning of the central building relative to the overall pattern’s center critically affects whether recognition results are consistent with human visual perception. For instance, in Figure 13, the building groups G3 and G7 adhere to the rule-based definitions of I-shaped patterns but do not meet the criteria for human visual recognition. To address this, this study introduces a center distance evaluation system, which has proven effective in enhancing the precision of I-shaped pattern recognition. Implementing this system resulted in a significant improvement in precision: for the data from Lanzhou City, precision increased by 22.45%, and for London City, by 26.98%.
This improvement underscores the importance of considering spatial relationships and central alignment in building pattern recognition, particularly in complex urban settings where conventional rule-based approaches might overlook subtleties recognized by human observers. The center distance evaluation system offers a quantifiable means to align rule-based recognition more closely with human visual standards, thereby bridging a critical gap between automated processes and intuitive human judgments.
Comparison experiments with traditional methods demonstrate that the method presented in this paper successfully tackles the issue of low recall rates in recognizing typical letter-shaped buildings using traditional template matching and similar approaches. When compared to the Lanzhou dataset, the recall rate of traditional template matching methods declined by 18.79% in the London dataset, where regional building density and building form complexity are higher. In contrast, the method proposed here significantly enhances the recognition recall rate while maintaining a high level of precision, and achieves strong recognition results in both datasets.
The outcomes discussed confirm that the method presented in this study effectively identifies the majority of I-shaped patterns within the experimental regions, with recognition results that align closely with human visual perception. The application of this method across two distinct datasets has yielded high precision and recall rates, further underscoring its precision and robustness.

6. Discussion

6.1. Misrecognition Due to Scale Variability

The approach we have developed leverages a knowledge graph to integrate data across multiple scales, significantly improving the precision and adaptability of Building pattern recognition. By associating data from various scales, our method captures the complex, multi-dimensional characteristics of buildings, enhancing the robustness of the recognition process. However, despite its effectiveness, this technique can occasionally lead to errors in pattern recognition. Such errors often arise from the dynamic variations in how buildings are represented at different scales. As identified by Chen, Zhang, and Lin [20], the variability in map objects across multiple scales is influenced primarily by two factors: map generalization, which involves the simplification or abstraction of map information at various scales, and the genuine changes in geographical features, reflecting actual alterations that occur in buildings and other geographic entities over time or due to environmental factors.
Studies involving multi-scale map data reveal that map generalization strategies at various scales significantly influence how geographic entities are represented. At smaller scales, details may be simplified or omitted, leading to a less detailed depiction, whereas larger scales tend to retain and display greater complexity and details, as indicated by the blue arrows in Figure 15. Concurrently, the real-world conditions of building entities are subject to continual evolution due to dynamic environmental changes, such as the demolition or reconstruction of buildings as depicted by the orange arrows in Figure 15. This dynamic nature adds layers of complexity to the task of pattern recognition. Thus, when employing multi-scale data for building recognition, it is essential not only to manage the inconsistencies in information across different scales but also to account for the temporal dynamics of building entities. This sets a direction for future research on how to more effectively merge multi-scale data to capture the dynamic changes in buildings accurately. Future strategies should also explore utilizing advanced knowledge graph technologies to mitigate the impacts of recognition errors, thereby enhancing the reliability and accuracy of building pattern analysis.

6.2. Factors Influencing the Visual Recognition of Building Patterns

The shape, proportion, and spatial arrangement of building clusters are key visual factors in building pattern recognition. Additionally, contextual influences such as the surrounding environment and building density play significant roles in determining recognition accuracy. While much-existing research focuses on the impact of individual object parameters on visual recognition, there is less emphasis on quantitatively describing the morphologies and special structures of building clusters within complex urban settings. Some researchers have integrated principles of spatial cognition and psychology into spatial data mining, incorporating factors like position, orientation, and size. This integration has led to enhancements in the recognition of building distribution patterns by introducing the concept of “visual distance”, thus aligning pattern mining more closely with human visual perception [47]. Nevertheless, there remains a need for further studies on the effects of other Gestalt principles, such as closure and continuity, on spatial pattern recognition.
Figure 16 illustrates that the position and orientation of the central building significantly influence the recognition of I-shaped building patterns. Although the layout on the left side of Figure 16a adheres to the I-shaped arrangement rules identified by [14], it more closely resembles a C-shaped pattern visually. Conversely, a deviation in the central building’s orientation on the right side results in a Z-shaped pattern. Similarly, the E-shaped pattern in Figure 16b is altered due to positional shifts among three parallel buildings.
This study successfully implements a quantitative description of the I-shaped pattern by integrating multi-scale data and applying a center distance constraint to define the recognition rules for this pattern. The findings highlight the centrality of the middle building as a crucial factor influencing recognition accuracy. Quantitatively characterizing this centrality allows for a more thorough and precise capture of the buildings’ visual features. Nonetheless, navigating complex urban settings and responding to dynamic changes remains challenging. Future research should continue to evolve more comprehensive recognition models and methods to address these complexities effectively, thus enhancing the adaptability and accuracy of building pattern recognition in varied urban landscapes.

6.3. Recognition of Other Structural Patterns

Robustness and scalability are critical considerations for rule-based building pattern recognition approaches, particularly when applied to complex urban environments and multi-scale datasets. The proposed method is designed to address these challenges through a combination of explicit relational modeling, multi-constraint reasoning, and cross-scale consistency analysis.
Regarding robustness, rule-based systems are often considered sensitive to ambiguous configurations or local noise when individual rules are applied in isolation. In the proposed framework, however, pattern recognition does not rely on a single geometric or relational constraint. Instead, building patterns are identified only when multiple structural conditions—such as shape characteristics, relative positional relationships, neighborhood configurations, and cross-scale matching relations—are simultaneously satisfied. This multi-constraint reasoning strategy effectively reduces the influence of local irregularities or noisy building geometries. Moreover, cross-scale enhancement recognition contributes to robustness by propagating building patterns identified at one scale to other related scales through cross-scale matching relationships. This mechanism enables the detection of potential building patterns that may be ambiguous or difficult to identify when relying on single-scale analysis alone, thereby improving the recall of pattern recognition in complex urban environments.
To further examine robustness under more challenging conditions, additional experiments were conducted in a newly selected area of the Lanzhou dataset characterized by higher building density, greater structural heterogeneity, and more irregular spatial arrangements. In this complex region, the proposed framework was applied at two map scales (1:5000 and 1:10,000) to recognize multiple building patterns, including C-shaped, and F-shaped configurations. The results indicate that the framework maintains stable performance even in densely built and structurally diverse environments, as visually demonstrated in Figure 17, suggesting that the combined use of relational constraints and cross-scale reasoning provides a degree of robustness against ambiguity and noise commonly encountered in real-world urban data.
In terms of scalability and generality, the proposed framework separates generic relational modeling from pattern-specific semantic constraints. Core components such as multi-scale representation, inter-building relational modeling, knowledge graph construction, and rule-based reasoning are pattern-agnostic and remain unchanged when extending the method to new building patterns. While recognizing additional patterns requires the definition of pattern-specific semantic rules, this process primarily involves recombining existing relational primitives rather than redesigning the overall framework. The successful recognition of C-shaped and F-shaped building patterns in the additional experiments demonstrates that the proposed method provides a scalable foundation for building pattern recognition beyond the I-shaped case.
Nevertheless, it should be acknowledged that the effectiveness of rule-based recognition depends on the completeness and correctness of the encoded pattern semantics. Extremely irregular building morphologies or patterns lacking clear geometric organization may remain challenging. Future work may explore the integration of data-driven or probabilistic components to complement the current rule-based framework. However, within the scope of this study, the proposed method offers a robust, interpretable, and scalable solution for multi-scale building pattern recognition in complex urban environments.

7. Conclusions

Building pattern recognition holds significant value in urban planning and map synthesis, yet traditional approaches often struggle to capture latent and structurally meaningful building patterns. This paper introduces a recognition method that leverages multi-scale data and knowledge graphs, with I-shaped building patterns serving as a representative case. This approach abstracts the spatial relationships between buildings and their cross-scale connections into nodes and edges within the knowledge graph, further enhancing accuracy through a center distance constraint to align more closely with human visual principles. Utilizing rule-based reasoning within the knowledge graph, the method effectively recognizes both I-shaped Building patterns and potential features, showcasing its efficacy across varied datasets and regions. The experimental results demonstrate that integrating knowledge graphs with multi-scale information provides a viable and interpretable solution for building pattern recognition guided by human visual cognition. However, the variability in building features due to scale adjustments poses challenges, which may occasionally lead to misrecognition under a purely rule-based framework. Future work will further investigate the interaction between scale effects and pattern characteristics, extend the framework to additional building pattern types, and explore the integration of data-driven learning with knowledge-graph-based reasoning.

Author Contributions

Conceptualization, Shenglu Xu, Tao Liu and Ping Du; methodology, Shenglu Xu and Bo Qiang; software, Shenglu Xu; validation, Shenglu Xu; formal analysis, Tao Liu, Pengpeng Li and Ping Du; investigation, Shenglu Xu and Ping Du; resources, Wenning Wang; data curation, Shenglu Xu; writing—original draft preparation, Shenglu Xu; writing—review and editing, Tao Liu, Ping Du and Pengpeng Li; visualization, Shenglu Xu; supervision, Bo Qiang and Shuangtong Liu; project administration, Bo Qiang; funding acquisition, Tao Liu, Pengpeng Li and Ping Du. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Natural Science Foundation of Gansu Province [grant number 25JRRA152], the National Natural Science Foundation of China [grant number 42261076; 42461059; 42401564], the Postdoctoral Fellowship Program of CPSF [grant number GZC20231020], the China Postdoctoral Science Foundation [grant number 2023M741496], the Gansu Youth Science and Technology Fund Project [grant number 24JRRA275, 25JRRA376], the Excellent Doctoral Program of Gansu Province [grant number 25JRRA288].

Data Availability Statement

The data and software that support the findings of this study are publicly available at https://doi.org/10.6084/m9.figshare.28429820.v1.

Acknowledgments

The authors would like to thank the editors and the anonymous reviewers for their helpful and constructive comments that greatly contributed to improving the manuscript.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
KGKnowledge Graph
CNNConvolutional Neural Networks
RNGRelative Neighborhood Graph
CDTConstrained Delaunay Triangulation
MSTMinimum Spanning Tree
SBRSmallest Bounding Rectangle
ROCReceiver Operating Characteristic
AUCArea Under the Curve

References

  1. Yang, W. Identify building patterns. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2008, 37, 391–398. [Google Scholar]
  2. Renard, J.; Duchêne, C. Urban structure generalization in multi-agent process by use of reactional agents. Trans. GIS 2014, 18, 201–218. [Google Scholar] [CrossRef]
  3. Cetinkaya, S.; Basaraner, M.; Burghardt, D. Proximity-based grouping of buildings in urban blocks: A comparison of four algorithms. Geocarto Int. 2015, 30, 618–632. [Google Scholar] [CrossRef]
  4. Carmona, M. Public Places Urban Spaces: The Dimensions of Urban Design; Routledge: London, UK, 2021. [Google Scholar]
  5. Zhang, L.; Deng, H.; Chen, D.; Wang, Z. A spatial cognition-based urban building clustering approach and its applications. Int. J. Geogr. Inf. Sci. 2013, 27, 721–740. [Google Scholar] [CrossRef]
  6. Ruas, A.; Holzapfel, F. Automatic characterisation of building alignments by means of expert knowledge. In Proceedings of the 21th ICC Conference, Durban, South Africa, 10–16 August 2003; pp. 1604–1616. [Google Scholar]
  7. Li, Z.; Yan, H.; Ai, T.; Chen, J. Automated building generalization based on urban morphology and Gestalt theory. Int. J. Geogr. Inf. Sci. 2004, 18, 513–534. [Google Scholar] [CrossRef]
  8. Gong, X. Research on Settlement Generalization Methods Considering Spatial Pattern and Road Networks. Doctoral Dissertation, PLA Strategic Support Force Information Engineering University, Zhengzhou, China, 2017. [Google Scholar]
  9. Arkin, E.M.; Chew, L.P.; Huttenlocher, D.P.; Kedem, K.; Mitchell, J.S. An Efficiently Computable Metric for Comparing Polygonal Shapes; IEEE: New York, NY, USA, 1991. [Google Scholar]
  10. Ai, T.; Cheng, X.; Liu, P.; Yang, M. A shape analysis and template matching of building features by the Fourier transform method. Comput. Environ. Urban Syst. 2013, 41, 219–233. [Google Scholar] [CrossRef]
  11. Yan, X.; Ai, T.; Yang, M.; Yin, H. A graph convolutional neural network for classification of building patterns using spatial vector data. ISPRS J. Photogramm. Remote Sens. 2019, 150, 259–273. [Google Scholar] [CrossRef]
  12. Yan, X.; Ai, T.; Yang, M.; Tong, X. Graph convolutional autoencoder model for the shape coding and cognition of buildings in maps. Int. J. Geogr. Inf. Sci. 2021, 35, 490–512. [Google Scholar] [CrossRef]
  13. Xing, R.; Wu, F.; Gong, X.; Du, J.; Liu, C. The template matching approach to combined collinear pattern recognition in building groups. Acta Geod. Cartogr. Sin. 2021, 50, 800. [Google Scholar]
  14. Du, S.; Shu, M.; Feng, C.C. Representation and discovery of building patterns: A three-level relational approach. Int. J. Geogr. Inf. Sci. 2016, 30, 1161–1186. [Google Scholar] [CrossRef]
  15. Zhao, R.; Ai, T.; Yu, W.; He, Y.; Shen, Y. Recognition of building group patterns using graph convolutional network. Cartogr. Geogr. Inf. Sci. 2020, 47, 400–417. [Google Scholar] [CrossRef]
  16. Yang, M.; Ai, T.; Yan, X.; Yang, W.; Zhou, X.; Zhou, Q. Change detection and updating by using map overlay for buildings on multi-scale maps. Acta Geod. Cartogr. Sin. 2016, 45, 466. [Google Scholar]
  17. Wei, Z.; Xu, W.; Xiao, Y.; Shu, M.; Cheng, L.; Wang, Y.; Liu, C. Enhancing building pattern recognition through multi-scale data and knowledge graph: A case study of C-shaped patterns. Int. J. Digit. Earth 2023, 16, 3860–3881. [Google Scholar] [CrossRef]
  18. Zhang, X.; Yin, W.; Yang, M.; Ai, T.; Stoter, J. Updating authoritative spatial data from timely sources: A multiple representation approach. Int. J. Appl. Earth Obs. Geoinf. 2018, 72, 42–56. [Google Scholar] [CrossRef]
  19. Memduhoglu, A.; Basaraner, M. An approach for multi-scale urban building data integration and enrichment through geometric matching and semantic web. Cartogr. Geogr. Inf. Sci. 2022, 49, 1–17. [Google Scholar] [CrossRef]
  20. Chen, L.; Zhang, X.; Lin, H.; Yang, M. Change analysis and decision tree based detection model for residential objects across multiple scales. Acta Geod. Cartogr. Sin. 2018, 47, 403. [Google Scholar]
  21. Lu, F.; Yu, L.; Qiu, P. On Geographic Knowledge Graph. J. Geo-Inf. Sci. 2017, 19, 723–734. [Google Scholar]
  22. Jiang, B.; Wan, G.; Xu, J.; Li, F.; Wen, H. Geographic knowledge graph building extracted from multi-sourced heterogeneous data. Acta Geod. Cartogr. Sin. 2018, 47, 1051. [Google Scholar]
  23. Duchêne, C.; Bard, S.; Barillot, X.; Ruas, A.; Trevisan, J.; Holzapfel, F. Quantitative and qualitative description of building orientation. In Proceedings of the Fifth Workshop on Progress in Automated Map Generalization, Paris, France, 28–30 April 2003. [Google Scholar]
  24. Li, W.; Goodchild, M.F.; Church, R. An efficient measure of compactness for two-dimensional shapes and its application in regionalization problems. Int. J. Geogr. Inf. Sci. 2013, 27, 1227–1250. [Google Scholar] [CrossRef]
  25. Basaraner, M.; Cetinkaya, S. Performance of shape indices and classification schemes for characterising perceptual shape complexity of building footprints in GIS. Int. J. Geogr. Inf. Sci. 2017, 31, 1952–1977. [Google Scholar] [CrossRef]
  26. Burghardt, D.; Steiniger, S. Usage of principal component analysis in the process of automated generalisation. In Proceedings of the 22nd International Cartographic Conference, A Coruña, Spain, 9–16 July 2005; pp. 9–16. [Google Scholar]
  27. Wei, Z.; Guo, Q.; Wang, L.; Yan, F. On the spatial distribution of buildings for map generalization. Cartogr. Geogr. Inf. Sci. 2018, 45, 539–555. [Google Scholar] [CrossRef]
  28. Belongie, S.; Malik, J.; Puzicha, J. Shape matching and object recognition using shape contexts. IEEE Trans. Pattern Anal. Mach. Intell. 2002, 24, 509–522. [Google Scholar] [CrossRef]
  29. Wei, Z.; Ding, S.; Cheng, L.; Xu, W.; Wang, Y.; Zhang, L. Linear building pattern recognition in topographical maps combining convex polygon decomposition. Geocarto Int. 2022, 37, 11365–11389. [Google Scholar] [CrossRef]
  30. Mai, G.; Jiang, C.; Sun, W.; Zhu, R.; Xuan, Y.; Cai, L.; Janowicz, K.; Ermon, S.; Lao, N. Towards general-purpose representation learning of polygonal geometries. GeoInformatica 2023, 27, 289–340. [Google Scholar] [CrossRef]
  31. Oostwegel, L.J.; Schorlemmer, D.; Guéguen, P. From Footprints to Functions: A Comprehensive Global and Semantic Building Footprint Dataset. Sci. Data 2025, 12, 1699. [Google Scholar] [CrossRef]
  32. Le, Q.H.; Shin, H.; Kwon, N.; Ho, J.; Ahn, Y. Deep learning based urban building coverage ratio estimation focusing on rapid urbanization areas. Appl. Sci. 2022, 12, 11428. [Google Scholar] [CrossRef]
  33. Kang, J.; Fernandez-Beltran, R.; Sun, X.; Ni, J.; Plaza, A. Deep learning-based building footprint extraction with missing annotations. IEEE Geosci. Remote Sens. Lett. 2021, 19, 3002805. [Google Scholar] [CrossRef]
  34. Gong, X.; Wu, F. A graph match approach to typical letter-like pattern recognition in urban building groups. Geomat. Inf. Sci. Wuhan Univ. 2018, 43, 159–166. [Google Scholar]
  35. He, X.; Zhang, X.; Xin, Q. Recognition of building group patterns in topographic maps based on graph partitioning and random forest. ISPRS J. Photogramm. Remote Sens. 2018, 136, 26–40. [Google Scholar] [CrossRef]
  36. Riedl, A.; Kainz, W.; Elmes, G.A. (Eds.) Progress in Spatial Data Handling: 12th International Symposium on Spatial Data Handling; Springer Science & Business Media: Berlin, Germany, 2006. [Google Scholar]
  37. Zhang, X.; Ai, T.; Stoter, J.; Kraak, M.J.; Molenaar, M. Building pattern recognition in topographic data: Examples on collinear and curvilinear alignments. Geoinformatica 2013, 17, 1–33. [Google Scholar] [CrossRef]
  38. Gong, X.; Wu, F.; Qian, H.; Ma, K. The parameter discrimination approach to multi-connected linear pattern recognition in building groups. Geomat. Inf. Sci. Wuhan Univ. 2014, 39, 335–339. [Google Scholar]
  39. Pilehforooshha, P.; Karimi, M. An integrated framework for linear pattern extraction in the building group generalization process. Geocarto Int. 2019, 34, 1000–1021. [Google Scholar] [CrossRef]
  40. Wang, X.; Burghardt, D. Using stroke and mesh to recognize building group patterns. Int. J. Cartogr. 2020, 6, 71–98. [Google Scholar] [CrossRef]
  41. Jiang, Y.; Zhang, W.; Wang, P.; Zhang, X.; Mei, H. Knowledge graph construction method via internet-based collective intelligence. J. Softw. 2021, 33, 2646–2666. [Google Scholar]
  42. Gao, X.; Yan, H.; Chen, Z.; Yin, P. Knowledge-Guided Automated Cartographic Generalization Process Construction: A Case Study Based on Map Analysis of Public Maps of China. Trans. GIS 2024, 28, 2492–2508. [Google Scholar] [CrossRef]
  43. Wei, Z.; Xiao, Y.; Tong, Y.; Xu, W.; Wang, Y. Linear building pattern recognition via spatial knowledge graph. Acta Geod. Cartogr. Sin. 2023, 52, 1355–1363. [Google Scholar] [CrossRef]
  44. Tang, Z.; Ai, T.; Xu, H. Reasoning of spatial distribution pattern of building cluster based on geographic knowledge graph. J. Geo-Inf. Sci 2023, 25, 1202–1214. [Google Scholar]
  45. Ali, U.; Shamsi, M.H.; Bohacek, M.; Purcell, K.; Hoare, C.; Mangina, E.; O’Donnell, J. A data-driven approach for multi-scale GIS-based building energy modeling for analysis, planning and support decision making. Appl. Energy 2020, 279, 115834. [Google Scholar] [CrossRef]
  46. Wang, H.; Qi, G.; Chen, H. Knowledge Graph: Method, Practice and Application; Publishing House of Electronics Industry: Beijing, China, 2019. [Google Scholar]
  47. Ai, T.; Guo, R. Polygon cluster pattern mining based on Gestalt principles. Acta Geod. Cartogr. Sin. 2007, 36, 302–308. [Google Scholar]
Figure 1. Multi-scale spatial data. The same set of I-shaped buildings is displayed in four increasing proportions. (a) 1:20,000 scale; (b) 1:10,000 scale; (c) 1:5000 scale; (d) 1:2000 scale.
Figure 1. Multi-scale spatial data. The same set of I-shaped buildings is displayed in four increasing proportions. (a) 1:20,000 scale; (b) 1:10,000 scale; (c) 1:5000 scale; (d) 1:2000 scale.
Ijgi 15 00023 g001
Figure 2. Neighborhood relationship of buildings. (a) Constrained Delaunay triangulation skeleton; (b) Nodes and edges of proximity graph.
Figure 2. Neighborhood relationship of buildings. (a) Constrained Delaunay triangulation skeleton; (b) Nodes and edges of proximity graph.
Ijgi 15 00023 g002
Figure 3. Eight directional neighborhoods of reference buildings. The letters WE, NN… ES in the figure represent eight different directional neighborhoods.
Figure 3. Eight directional neighborhoods of reference buildings. The letters WE, NN… ES in the figure represent eight different directional neighborhoods.
Ijgi 15 00023 g003
Figure 4. The schematic diagram of 169 relative positional relationships between two buildings.
Figure 4. The schematic diagram of 169 relative positional relationships between two buildings.
Ijgi 15 00023 g004
Figure 5. Definition of cross-scale matching relationships in buildings. Different colors represent building entities at different scales.
Figure 5. Definition of cross-scale matching relationships in buildings. Different colors represent building entities at different scales.
Ijgi 15 00023 g005
Figure 6. The recognition process of I-shaped building patterns. (1) Verify basic spatial relationships; (2) Derive deeper relationships; (3) Confirm Cen_D; (4) Recognization.
Figure 6. The recognition process of I-shaped building patterns. (1) Verify basic spatial relationships; (2) Derive deeper relationships; (3) Confirm Cen_D; (4) Recognization.
Ijgi 15 00023 g006
Figure 7. The process for building pattern enrichments across considering the multi-scale data. (a) Cross-scale matching relationships between buildings at different scales; (b) Large-to-small scale enhancement; (c) Small-to-large scale enhancement.
Figure 7. The process for building pattern enrichments across considering the multi-scale data. (a) Cross-scale matching relationships between buildings at different scales; (b) Large-to-small scale enhancement; (c) Small-to-large scale enhancement.
Ijgi 15 00023 g007
Figure 8. Experimental dataset. (a) Lanzhou Small Scale Data (LZSL); (b) Lanzhou Large Scale Data (LZSS); (e) London Large Scale Data (LDSL); (f) London Small Scale Data (LDSS); (c,d) indicate the locations of the study areas within the full extents of Lanzhou and London, providing city-level spatial context.
Figure 8. Experimental dataset. (a) Lanzhou Small Scale Data (LZSL); (b) Lanzhou Large Scale Data (LZSS); (e) London Large Scale Data (LDSL); (f) London Small Scale Data (LDSS); (c,d) indicate the locations of the study areas within the full extents of Lanzhou and London, providing city-level spatial context.
Ijgi 15 00023 g008
Figure 9. Analysis of Building Features in the Dataset. (a) Building Edge Count; (b) Building Rectangle Count.
Figure 9. Analysis of Building Features in the Dataset. (a) Building Edge Count; (b) Building Rectangle Count.
Ijgi 15 00023 g009
Figure 10. ROC-AUC plot. The optimal value for θ 4 should be between 0.19 and 0.26.
Figure 10. ROC-AUC plot. The optimal value for θ 4 should be between 0.19 and 0.26.
Ijgi 15 00023 g010
Figure 11. Comparison of recognition performance when θ 4 is 0.1, 0.2, and 0.5, respectively.
Figure 11. Comparison of recognition performance when θ 4 is 0.1, 0.2, and 0.5, respectively.
Ijgi 15 00023 g011
Figure 12. The constructed knowledge graph, illustrated using the London dataset as an example. For visual clarity, only the cross-scale matching relationships and intra-scale neighbor relationships are displayed, while the complete knowledge graph employed in the reasoning process includes all relationship types defined in Section 4.1. The inset in the upper-right corner illustrates the matching relationships between three building units at a larger scale and their corresponding buildings at a smaller scale.
Figure 12. The constructed knowledge graph, illustrated using the London dataset as an example. For visual clarity, only the cross-scale matching relationships and intra-scale neighbor relationships are displayed, while the complete knowledge graph employed in the reasoning process includes all relationship types defined in Section 4.1. The inset in the upper-right corner illustrates the matching relationships between three building units at a larger scale and their corresponding buildings at a smaller scale.
Ijgi 15 00023 g012
Figure 13. Recognition results of I-shaped building patterns in the dataset. (a) LZSS recognition result; (b) LZSL recognition result; (c) LDSL recognition result; (d) LDSS recognition result.
Figure 13. Recognition results of I-shaped building patterns in the dataset. (a) LZSS recognition result; (b) LZSL recognition result; (c) LDSL recognition result; (d) LDSS recognition result.
Ijgi 15 00023 g013
Figure 14. The results of template matching method and artificial visual recognition. (a) LZSS recognition result; (b) LZSL recognition result; (c) LDSL recognition result; (d) LDSS recognition result.
Figure 14. The results of template matching method and artificial visual recognition. (a) LZSS recognition result; (b) LZSL recognition result; (c) LDSL recognition result; (d) LDSS recognition result.
Ijgi 15 00023 g014
Figure 15. Causes contributing to misrecognition of building patterns. The blue arrow in the figure represents the impact of the map synthesis strategy, while the orange arrow represents the actual change in buildings caused by inconsistent sampling times.
Figure 15. Causes contributing to misrecognition of building patterns. The blue arrow in the figure represents the impact of the map synthesis strategy, while the orange arrow represents the actual change in buildings caused by inconsistent sampling times.
Ijgi 15 00023 g015
Figure 16. Quantitative description of complex shape patterns. (a) The position and direction of the center building in the I-shaped pattern; (b) The positions of three parallel buildings in the E-shaped pattern.
Figure 16. Quantitative description of complex shape patterns. (a) The position and direction of the center building in the I-shaped pattern; (b) The positions of three parallel buildings in the E-shaped pattern.
Ijgi 15 00023 g016
Figure 17. Visualization of multi-scale recognition results for multiple building patterns in a densely built area of Lanzhou.
Figure 17. Visualization of multi-scale recognition results for multiple building patterns in a densely built area of Lanzhou.
Ijgi 15 00023 g017
Table 1. Summary of representative studies on building shape and pattern recognition.
Table 1. Summary of representative studies on building shape and pattern recognition.
CategoryRepresentative StudiesMain Focus/MethodStrengthsLimitations
Building Shape RecognitionGeometric Descriptor–BasedPrincipal Component Analysis [23,24,25,26,27]; Turning function [9]; Fourier descriptors [10]; Shape context [28]; Rectangular encoding [29]Characterize individual building geometry using geometric descriptors and mathematical representationsEffective for identifying basic building shapes; suitable for single-building analysisLimited to individual buildings; cannot capture group-level patterns
Deep learning algorithmsGraph convolutional autoencoders [11]; CNN-based polygon encoders [30]; building coverage estimation [32]; Footprint [31,33]Learn building representations from large-scale data using CNNs or deep encodersHigh accuracy for footprint-level or area-level analysis; strong representation learning capabilityFocus on individual buildings or aggregated indicators; lacks explicit modeling of inter-building spatial relationships
Building Arrangement RecognitionTemplate-Based RecognitionYang [1]; Gong and Wu [34]; Xing et al. [13]Detect specific building arrangements using predefined geometric templatesIntuitive and effective for well-defined patternsRigid templates; poor adaptability to pattern variations and scale changes
Machine Learning-Based RecognitionRandom Forests [11]; Graph Convolutional Neural Networks [15,35]Automatically learn building arrangement patterns from dataReduced manual rule design; flexible pattern learningDepend on labeled data; limited interpretability; weak explicit rule modeling
Structural Rule-Based MethodsProximity Graphs–Based [14,36,37,38,40]Recognize building patterns using spatial relations and perceptual rulesInterpretable; aligned with human spatial cognitionExtensive search operations; limited support for multi-scale patterns
Knowledge Graph–Based [43,44]Model buildings and spatial relations as graph entities and perform rule-based reasoningExplicit relational modeling; high interpretability; efficient rule executionMostly limited to single-scale scenarios and less explored for complex multi-scale pattern structures
Table 2. This is a table representing entities and relationships in a knowledge graph.
Table 2. This is a table representing entities and relationships in a knowledge graph.
TypeElementContent
EntityBuilding BiRepresents a building entity
LabelBuildingBigRepresents data for large-scale buildings
BuildingSmallRepresents data for Small-scale buildings
AttributebIDUnique identifier of the building entity
AreaRepresents the area of the entity
OrientationRepresents the main direction of the entity
PatternIndicates whether the entity forms an I-shaped pattern, boolean value
RelationR(Bi,Bj)Represents the relationship existing between entities Bi and Bj
Relation TypeNeighborIndicates a proximity relationship between entities
PositionRepresents a relative positional relationship between entities, with two attributes P_Type and Fd, representing the type of relative position r(i,j) and the overlap ratio of the long axes of the two entities, respectively
MatchIndicates that two entities correspond to buildings at different scales, with an attribute M_type representing the type of cross-scale matching relationship
disRepresents the centroid distance between entities
Table 3. Recognition effect statistics.
Table 3. Recognition effect statistics.
DataMethodsTrFrMrPrecise = Tr/(Tr + Fr)Recall = Tr/(Tr + Mr)
LZSR + AR38111277.55%76.00%
SR + AR + CR38012100%76.00%
SR + AR + ER5013079.37%100%
SR + AR + CR + ER502096.15%100%
TM30020100%60%
LDSR + AR823610069.49%45.05%
SR + AR + CR82310096.47%45.05%
SR + AR + ER17844480.18%97.80%
SR + AR + CR + ER1788495.70%97.80%
TM75110798.68%41.21%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Xu, S.; Liu, T.; Du, P.; Li, P.; Wang, W.; Liu, S.; Qiang, B. A Method for Recognizing I-Shaped Building Patterns Utilizing Multi-Scale Data and Knowledge Graph. ISPRS Int. J. Geo-Inf. 2026, 15, 23. https://doi.org/10.3390/ijgi15010023

AMA Style

Xu S, Liu T, Du P, Li P, Wang W, Liu S, Qiang B. A Method for Recognizing I-Shaped Building Patterns Utilizing Multi-Scale Data and Knowledge Graph. ISPRS International Journal of Geo-Information. 2026; 15(1):23. https://doi.org/10.3390/ijgi15010023

Chicago/Turabian Style

Xu, Shenglu, Tao Liu, Ping Du, Pengpeng Li, Wenning Wang, Shuangtong Liu, and Bo Qiang. 2026. "A Method for Recognizing I-Shaped Building Patterns Utilizing Multi-Scale Data and Knowledge Graph" ISPRS International Journal of Geo-Information 15, no. 1: 23. https://doi.org/10.3390/ijgi15010023

APA Style

Xu, S., Liu, T., Du, P., Li, P., Wang, W., Liu, S., & Qiang, B. (2026). A Method for Recognizing I-Shaped Building Patterns Utilizing Multi-Scale Data and Knowledge Graph. ISPRS International Journal of Geo-Information, 15(1), 23. https://doi.org/10.3390/ijgi15010023

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop