Next Article in Journal
A Spatial Statistics Methodology for Inspector Allocation Against Fare Evasion
Previous Article in Journal
An Operational Ethical Framework for GeoAI: A PRISMA-Based Systematic Review of International Policy and Scholarly Literature
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

BD-GNN: Integrating Spatial and Administrative Boundaries in Property Valuation Using Graph Neural Networks

by
Jetana Somkamnueng
and
Kitsana Waiyamai
*,†
Department of Computer Engineering, Kasetsart University, Bangkok 10900, Thailand
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
ISPRS Int. J. Geo-Inf. 2026, 15(2), 52; https://doi.org/10.3390/ijgi15020052
Submission received: 11 November 2025 / Revised: 6 January 2026 / Accepted: 19 January 2026 / Published: 23 January 2026
(This article belongs to the Topic Geospatial AI: Systems, Model, Methods, and Applications)

Abstract

GNN approaches to property valuation typically rely on spatial proximity, assuming that nearby properties exhibit similar price patterns. In practice, this assumption often fails as neighborhood and administrative boundaries create sharp price discontinuities, a form of spatial heterophily. This study proposes a Boundary-Aware Dual-Path Graph Neural Network (BD-GNN), a heterophily-oriented GNN specifically designed for continuous regression tasks. The model uses a dual and adaptive message passing design, separating inter- and intra-boundary pathways and combining them through a learnable gating parameter α . This allows it to capture boundary effects while preserving spatial continuity. Experiments conducted on three structurally contrasting housing datasets, namely Bangkok, King County (USA), and Singapore, demonstrate consistent performance improvements over strong baselines. The proposed BD-GNN reduces MAPE by 7.9%, 4.4%, and 4.5% and increases R 2 by 3.2%, 0.7%, and 5.0% for the respective datasets. Beyond predictive performance, α provides a clear picture of how spatial and administrative factors interact across urban scales. GNN Explainer provides local interpretability by showing which neighbors and features shape each prediction. BD-GNN bridges predictive accuracy and structural insight, offering a practical, interpretable framework for applications such as property valuation, taxation, mortgage risk assessment, and urban planning.

1. Introduction

Property valuation represents a fundamental component of economic systems worldwide, directly influencing monetary policy decisions, urban development strategies, and household wealth dynamics. Housing markets play a major role in shaping economic activity. Rising property values can boost household wealth and encourage consumer spending, while sharp fluctuations in prices may worsen affordability problems and widen social inequality. For governments, financial institutions, and individual stakeholders, accurate house price prediction serves as a critical tool for informing investment decisions, taxation frameworks, and urban planning initiatives.
Accurate property valuation is a central challenge in urban analytics, real-estate economics, and spatial decision support. Estimating land and housing prices requires understanding how geographic proximity, functional amenities, and boundary-induced segmentation jointly influence market dynamics. Traditional hedonic and spatial econometric models such as spatial autoregressive (SAR) or geographically weighted regression (GWR) have long provided interpretable spatial insights through distance-based dependence structures [1,2]. However, these methods rely on predefined spatial weight matrices, limiting their ability to capture complex spatial heterogeneity or sharp discontinuities caused by segmentation effects. As cities become more fragmented by zoning regulations, school district boundaries, and developer-defined communities, these rigid assumptions can no longer capture the complex spatial logic that shapes modern property markets.
In parallel, machine learning (ML) approaches including support-vector machines, tree-based ensembles, and deep neural networks have achieved higher predictive accuracy through data-driven learning [3,4,5]. However, these methods typically treat properties as independent observations and therefore overlook spatial interactions altogether.
Graph Neural Networks (GNNs) have recently emerged as powerful alternatives that learn spatial relationships directly from graph structure rather than from fixed kernels [6,7,8]. Their message-passing mechanisms allow flexible aggregation of neighborhood information and have advanced spatial modeling by enabling complex, data-driven relationships between nodes to be learned directly from observed interactions. Recent graph-based valuation models [9,10,11,12] and scalable extensions incorporating transformer layers [13] show that GNNs can match or even surpass traditional spatial models in predictive performance.
Despite their promise, classical GNNs assume homophily—that spatially or topologically connected nodes share similar attributes and outcomes. In housing markets, spatial heterogeneity and heterophily are common rather than exceptional. Boundary-induced segmentation, such as zoning regimes or neighborhood communities, can produce sharp price discontinuities between adjacent parcels. When the homophily assumption fails, message passing across such boundaries propagates misleading signals, thereby degrading predictive performance and obscuring the underlying spatial structure.
To overcome this limitation, a growing body of research on heterophily-aware GNNs has sought to relax the homophily constraint through enhanced message passing, multi-hop aggregation, adaptive filtering, and attention-based mechanisms [14,15,16,17,18,19,20,21,22]. Although these approaches improve learning in weak-homophily contexts, they tend to treat all heterophilous relationships uniformly, overlooking the distinction between structured boundary effects and random variation. Moreover, most existing heterophily-aware GNNs address node-classification problems, whereas property valuation is a continuous regression task that demands finer sensitivity to both spatial smoothness and boundary contrast.
The critical gap in current research lies in the treatment of boundary-induced segmentation. Existing graph-based valuation models [9,10,11,12] typically build graphs using only geographic distance or simple spatial adjacency. As a result, segmentation created by administrative or functional boundaries remains outside the learning process. This is problematic because decades of urban economics research have shown that such boundaries consistently produce clear and persistent price differences [23,24,25,26,27,28].
The key limitation is twofold: First, traditional spatial models and recent GNN approaches fail to explicitly differentiate between intra-boundary similarity (properties within the same administrative unit subject to similar regulations) and inter-boundary discontinuity (neighboring properties belonging to different administrative units with distinct market regimes). Second, while heterophily-aware GNNs address node heterogeneity, they treat all heterophilous relationships uniformly without distinguishing structured boundary effects from random variation. They cannot differentiate between systematic price discontinuities caused by administrative boundaries and random price variations within the same administrative context.
This leads to a fundamental question: How can spatial learning models incorporate boundary-induced segmentation directly into the learning process, while maintaining the flexibility of data-driven representation learning?
To address this gap, this study proposes the Boundary-Aware Dual-Path Graph Neural Network (BD-GNN). The key innovation is the use of two separate learning paths: one for capturing spatial proximity relationships among geographically nearby properties, and another for modeling administrative boundary effects. BD-GNN learns when to emphasize spatial similarity within administrative units and when to capture price discontinuities across boundaries, enabling more accurate and interpretable property valuation in heterogeneous urban markets.
To comprehensively evaluate the proposed framework, we conduct experiments on three datasets representing diverse developmental contexts and spatial configurations across Bangkok (Thailand), King County (USA), and Singapore. These settings differ markedly in urban morphology, data richness, and segmentation structures, providing a rigorous basis for assessing both predictive generalizability and interpretability robustness. The results show that BD-GNN not only improves predictive accuracy but also reveals interpretable spatial–administrative trade-offs consistent with established urban-economic theory.
The main contributions of this paper are threefold:
1.
Conceptual integration: We formalize boundary-induced segmentation as structured sources of geospatial heterophily and embed them directly into a learnable dual-path graph representation, explicitly distinguishing intra-boundary and inter-boundary relationships.
2.
Methodological advancement: We integrate an adaptive gating mechanism that seamlessly fuses spatial and boundary pathways within a unified dual-path framework, enabling the model to dynamically balance information flow between proximity-based and segmentation-based relationships during message passing.
3.
Empirical validation and interpretability: We evaluate the proposed BD-GNN across three cities—Bangkok, King County, and Singapore—demonstrating consistent performance gains and interpretable boundary effects aligned with urban-economic theory.
Together, these contributions advance boundary-aware spatial learning and offer a new direction for interpretable, policy-relevant property valuation in complex urban environments.
The remainder of this paper is organized as follows. Section 2 reviews related work in traditional approaches, ML, GNNs, heterophily-aware GNNs, and real estate-specific GNNs, including explainability. Section 3 describes the experimental setups, datasets, and the proposed architecture, including evaluation metrics. Section 4 presents results and analysis across different urban contexts. Section 5 discusses the findings and their practical implications. Section 6 concludes with future research directions.

2. Related Work

Research on property valuation has progressed through several methodological phases, from traditional econometric models to modern machine learning and graph-based deep learning approaches. Each stage provides a different perspective on how spatial, structural, and institutional factors influence property prices. The hedonic pricing model [29] established the theoretical foundation for linking property values to structural and locational attributes. Spatial econometric models such as the Spatial Autoregressive (SAR) Model and the Spatial Error Model (SEM) extended this framework by explicitly modeling spatial dependence among nearby properties [1,30,31]. Geographically Weighted Regression (GWR) [2] and its multiscale extension, MGWR [32], further enhanced this approach by allowing model parameters to vary locally across space. Although these models remain interpretable and widely used, they rely on predefined spatial weight matrices that impose rigid neighborhood structures. Such assumptions limit their ability to capture complex spatial heterogeneity or sharp price contrasts at neighborhood and administrative boundaries, motivating a shift toward more flexible, data-driven frameworks that can learn spatial relationships directly from data.
Machine learning (ML) approaches address this need for flexibility by prioritizing predictive performance over structural constraints. Ensemble methods such as Random Forests, Gradient Boosting, and XGBoost have demonstrated strong predictive power in property valuation and mass appraisal tasks [33,34], while deep learning techniques have enabled the integration of multiple data modalities, including imagery and tabular information [35,36]. However, most ML models treat each property as an independent observation, overlooking spatial dependence among nearby locations. Although these approaches capture complex nonlinear relationships among features, they fail to model neighborhood interactions that strongly influence price formation in real markets. This limitation has motivated growing interest in frameworks capable of learning spatial relationships directly from observed data rather than relying on fixed distance-based kernels.
Graph Neural Networks (GNNs) provide a flexible way to model spatial dependencies by learning directly from graph structures instead of fixed spatial kernels. Core architectures such as GCN [6], GraphSAGE [7], and GAT [8] employ message passing mechanisms that aggregate neighborhood information adaptively to learn spatial relationships. However, these models rely on the homophily assumption, which suggests that connected nodes tend to share similar features or outcomes. In housing markets, this assumption is often violated, as zoning policies, administrative boundaries, or developer-defined communities create sharp discontinuities between adjacent properties. When message passing occurs across such dissimilar regions, it propagates noisy signals that reduce predictive performance and obscure meaningful spatial patterns.
To address this issue, heterophily-aware GNNs have been proposed to improve representation learning under weak-homophily conditions. MixHop [14] and GPR-GNN [15] extend information aggregation through multi-hop or weighted propagation, while H2GCN [16] separates ego and neighbor embeddings to mitigate noisy neighbor effects. Geom-GCN [17] redefines neighborhoods based on geometric or structural similarity, and FAGCN [18] applies adaptive filtering to balance low- and high-frequency signals. PNA [19] enhances model expressiveness through multiple aggregation functions with degree scaling, whereas ACM [20] and LinkX [21] decouple structural and feature representations, improving generalization to non-homophilous and large-scale graphs. SHGNN [22] explicitly targets spatial heterophily in urban graphs by introducing a Spatial Diversity Score and heterophily-sensitive aggregation modules. Despite these advances, most models treat heterophilous relationships in a uniform way and do not distinguish between structured spatial heterogeneity, such as boundary-induced segmentation, and random local variation. A recent study indicates that the reported progress of heterophily-aware GNNs may be overstated, as many evaluations rely on biased benchmarks and inconsistent experimental setups [37]. This highlights the need for models that can explicitly represent structured spatial heterogeneity. Furthermore, they are primarily developed for node classification tasks rather than continuous regression, which requires finer sensitivity to spatial smoothness and boundary contrast.
Several studies have applied GNNs directly to property valuation, demonstrating their ability to integrate spatial and nonlinear relationships. Spatial Regression GCNs (SRGCNN and A–SRGCNN) [9,10], geo-spatial network embeddings (GSNEs) [11], lifelong valuation over heterogeneous information networks (LUCE) [12], and scalable architectures for large urban markets [13] show that GNNs can outperform traditional spatial econometric models. Nevertheless, these approaches often rely on homophily-based message passing and construct graphs solely from Euclidean proximity, without incorporating administrative or functional boundaries into graph design. Extensive evidence from urban economics shows that institutional boundaries, such as school districts, zoning areas, and developer-defined communities, lead to persistent and measurable price discontinuities [23,24,25,26,27,28]. When boundary effects are used only as additional features instead of being modeled as part of the graph structure, the model becomes less interpretable and its ability to represent local neighborhood patterns is reduced. Explicitly embedding boundary structures into spatial learning can better align model behavior with real-world market segmentation.
Interpretability has long been a core consideration in property valuation, given that valuation models directly inform taxation, investment, and urban planning decisions. Traditional econometric models provide transparent, coefficient-based explanations linking property characteristics to price outcomes [29]. Machine learning models, while offering higher predictive accuracy, rely on post hoc tools such as feature importance [3] and SHAP values [38,39] to interpret predictions. However, these methods largely focus on individual feature effects and do not explain how spatial dependencies shape property prices. Although GNNs are effective at capturing relational structures, they often function as black boxes because their learned spatial embeddings are difficult to interpret. Explainable AI (XAI) techniques such as GNNExplainer [40], PGExplainer [41], and SubgraphX [42] attempt to reveal the most influential nodes, edges, or subgraphs. Recent work by Karamanou et al. [43] applied explainable GNNs to housing markets, illustrating the potential of graph-based interpretation in real estate analysis. Yet, these explanation techniques remain largely data-driven and detached from economic reasoning. Incorporating domain-relevant structures, such as market or administrative boundaries, directly into GNN learning could enhance both predictive performance and interpretability by aligning spatial representations with economic and institutional realities.
In summary, studies across econometric, machine learning, and graph-based approaches reveal several unresolved challenges in modeling spatial dynamics of property markets. The main research gaps that emerge from this body of work are outlined as follows:
  • Econometric models emphasize interpretability through explicit coefficient estimates but impose rigid spatial assumptions.
  • Machine learning models improve predictive accuracy but ignore spatial dependence.
  • GNN-based models learn spatial relationships but often overlook structured heterophily caused by administrative or functional boundaries.
  • Heterophily-aware GNNs represent progress toward handling spatial complexity but still fail to differentiate systematic boundary effects from random noise.
  • Existing property valuation GNNs similarly omit boundary segmentation from their learning architecture.
This gap motivates the present study by embedding structured spatial heterogeneity directly into the graph representation. The proposed framework enhances both predictive performance and interpretability for property valuation in complex, heterophilic urban environments.

3. Materials and Methods

3.1. Study Area and Datasets

This study utilizes three urban housing datasets representing diverse developmental contexts and spatial configurations across Bangkok (Thailand), King County (USA), and Singapore, as shown in Figure 1. The Bangkok dataset is proprietary and subject to data-sharing restrictions, while the King County and Singapore datasets are derived from publicly available sources. Detailed information on data accessibility and reproducibility is provided in the Data Availability declaration statement.

3.1.1. Bangkok Dataset (Proprietary)

Bangkok, the capital city of Thailand, represents the spatial characteristics of horizontal residential developments in a rapidly urbanizing, middle-income economy. The dataset was provided courtesy of the Thai Government Housing Bank (G H BANK), which utilizes housing price information to support credit assessment for home loan applications. It contains 16,522 properties spanning the period 2018–2023 across 50 districts, aggregated at the village level covering 711 villages. Each village corresponds to a private residential development characterized by relatively uniform design and pricing structures, a pattern commonly observed in Southeast Asian urban expansion. Statistical summaries of key housing attributes are provided in Table 1. The spatial distribution of house locations and an example of village boundary units are shown in Figure 1a and Figure 2, respectively.

3.1.2. King County Dataset

King County, Washington, USA, provides an international benchmark for comparative housing analysis in developed economies. The King County dataset (https://www.kaggle.com/datasets/astronautelvis/kc-house-data (accessed on 15 September 2025)) serves as a widely used benchmark for housing price prediction and spatial analysis studies in the real estate domain. It contains residential property sales records covering approximately 21,613 transactions primarily from 2014–2015, representing horizontal residential properties utilizing zip code boundaries as administrative divisions for spatial aggregation. This mature market serves as the standard reference for global real estate research. The spatial distribution of housing transactions is illustrated in Figure 1b, highlighting the spatial extent and administrative boundary structure used in this study. Descriptive statistics of key housing attributes are provided in Table 2.

3.1.3. Singapore Dataset

Singapore represents vertical residential developments in a highly urbanized, developed city-state in Southeast Asia. Housing data were obtained from https://data.gov.sg (accessed on 15 September 2025), the Singapore government’s open data platform, which provides public access to housing transaction records via API. The dataset contains 177,171 resale Housing and Development Board (HDB) units spanning the period from 2017 to mid-2024, distributed across 8887 individual blocks that serve as administrative boundary units. This configuration enables the examination of vertical spatial relationships through floor-level connectivity within high-rise residential complexes. Descriptive statistics of key housing attributes are provided in Table 3, and the spatial distribution of housing blocks is illustrated in Figure 1c.
The three datasets were selected to represent different stages of urban development and contrasting urban forms, allowing us to test the proposed model in varied real-world settings. Bangkok represents a fast-growing city in a developing economy, where housing is organized into private residential villages with horizontal layouts and fragmented administrative structures. King County (USA) reflects a mature housing market in a developed economy, shaped by government-defined zip code regions that capture suburban sprawl and mixed housing types. Singapore illustrates a highly structured urban environment, with state-planned vertical housing blocks (HDB) that create clear and consistent administrative boundaries. Together, these three cases capture a broad range of spatial patterns (horizontal vs. vertical) and boundary systems (village-based, zip code, and state-defined), providing a solid foundation for evaluating the robustness of our approach across different urban and socioeconomic contexts.

3.1.4. Spatial Homophily Analysis

Homophily and heterophily are fundamental concepts in spatial network analysis. Homophily refers to the tendency of spatially adjacent properties to exhibit similar characteristics, such as neighboring houses with comparable market values or physical attributes. In contrast, heterophily arises when adjacent properties differ substantially, producing weak local similarity and often introducing noisy spatial signals. This distinction is particularly relevant for graph-based learning models, since most traditional GNN architectures implicitly assume homophilic relationships among connected nodes [16,21].
To quantify the degree of spatial homophily and heterophily in our datasets, we use a set of complementary metrics. Spatial relationships are evaluated through K-nearest neighbor (kNN) graphs at two representative neighborhood sizes ( K = 5 for local neighborhoods and K = 80 for extended spatial relationships), which are selected to reflect two distinct spatial regimes rather than tuned hyperparameters, as well as boundary-based graphs that reflect each dataset’s urban morphology: private developer villages in Bangkok, government zip code divisions in King County, and individual HDB blocks in Singapore.
Our analysis focuses on three established spatial metrics. First, the assortativity coefficient [44] measures the tendency of nodes with similar property values to be connected, with values ranging from −1 (perfect heterophily) to +1 (perfect homophily). Second, Moran’s I statistic [45], a classic measure of spatial autocorrelation, quantifies the extent to which similar values cluster spatially: positive values indicate clustering, while negative values suggest dispersion. Third, threshold analysis at 5% and 10% similarity levels captures the proportion of directly connected property pairs whose price differences fall below a given relative threshold. Higher values indicate stronger local similarity in property prices, whereas lower values signal greater variability within local neighborhoods.
Although both assortativity and Moran’s I describe spatial dependence, they emphasize different aspects of the underlying structure. Moran’s I focuses on how closely each property value aligns with the average of its neighbors, making it more sensitive to overall cluster cohesion. Assortativity, on the other hand, measures pairwise correlation between connected nodes, capturing finer variation within clusters. In areas where properties are densely connected within administrative boundaries but exhibit only modest differences in value, as seen in Bangkok’s village developments, Moran’s I tends to remain high, whereas assortativity is comparatively lower. This contrast shows how the two metrics complement one another: Moran’s I highlights broader spatial coherence, whereas assortativity captures localized variability.
Beyond metric-based analysis, examining the graph structure itself provides further insight. We distinguish between intra-boundary edges (connections within the same administrative unit) and inter-boundary edges (connections across different administrative units). This structural composition directly shapes spatial dependence patterns, which are reflected in homophily metrics (Table 4). In Bangkok, for example, the dominance of intra-boundary edges leads to a Moran’s I of 0.720 compared to an assortativity of 0.573, indicating strong boundary-level coherence despite moderate pairwise similarity. In King County, both Moran’s I (0.404) and assortativity (0.376) are lower, reflecting weaker boundary alignment as connectivity expands. Singapore shows both high assortativity (0.887) and Moran’s I (0.793), consistent with its highly structured HDB estate design.
As neighborhood size (K) increases, inter-boundary connections become more prevalent, especially in Bangkok and Singapore. This pattern reflects the gradual expansion of spatial reach beyond administrative boundaries and contributes to the observed decline in homophily metrics at larger K levels. For example, in Bangkok, assortativity drops from 0.727 at K = 5 to 0.438 at K = 80 . A complementary view of this underlying edge structure, including the proportion of intra- and inter-boundary connections, is presented in Table 5.
Taken together, these findings suggest that no single graph construction or spatial unit can universally guarantee high homophily. This highlights the need for GNN models that can adapt flexibly to both spatial and administrative heterogeneity, achieving robust performance across diverse urban settings while remaining efficient enough for real-world use.

3.2. Data Preprocessing

The preprocessing workflow was designed to ensure spatial consistency and data quality prior to model training. The steps were standardized across all datasets while accounting for dataset-specific issues.

3.2.1. Spatial Validation

Spatial validation prioritized coordinate accuracy to preserve meaningful graph relationships. In the Bangkok dataset, approximately 3300 residential units (about 20% of total records) were identified as being located anomalously far from their designated village clusters. These records were repositioned programmatically within the correct village boundaries using controlled random sampling to ensure spatial consistency with the underlying administrative structure. Nodes with unverifiable coordinates were excluded from further analysis.
For King County, around 220 units (approximately 1%) were found outside county or zip code boundaries. Given the small proportion, these records were excluded rather than repositioned. In contrast, the Singapore dataset required no adjustment, as all coordinates were already spatially consistent and matched their respective administrative units with high precision.

3.2.2. Feature Engineering

Feature engineering was deliberately minimal. The primary transformation involved applying a logarithmic scale to house prices to reduce skewness and approximate a Gaussian distribution, which is beneficial for regression. This normalization step ensures that extreme values do not dominate the learning process.

3.2.3. Missing Values and Outliers

Data quality was ensured through complete case analysis: records with missing values were removed using listwise deletion. Outliers were identified and removed using the Interquartile Range (IQR) method, thereby improving robustness of the regression models to anomalous values.

3.2.4. Train–Validation–Test Splits

All datasets were partitioned into 70% training, 15% validation, and 15% testing sets. Splits were performed randomly without temporal stratification, as the objective was comparative algorithm evaluation rather than forecasting. This consistent protocol ensured a fair and reproducible benchmark across all baseline and proposed methods.

3.3. Graph Construction

This subsection explains how neighborhood sizes are selected and why specific values are used to represent different spatial regimes. We construct k-nearest neighbor (kNN) graphs to represent spatial proximity among properties, with the neighborhood size K controlling the scale of connectivity. Figure 3 and Figure 4 illustrate examples of horizontal (2D) and vertical (3D) graph structures at K = 5 . In horizontal settings (e.g., Bangkok and King County), edges reflect geographic distance on a 2D plane, while in vertical settings (e.g., Singapore), floor information is incorporated into the distance metric to capture vertical spatial relations.
To systematically examine the effects of neighborhood size, we experiment with K = 5 ,   10 ,   20 ,   40 , and 80. As K increases, the resulting graphs exhibit a clear shift from highly localized structures to broader spatial connectivity. Specifically, smaller K values emphasize fine-scale local neighborhood relations, with a high proportion of intra-boundary edges. Increasing K progressively introduces more inter-boundary edges, reflecting spatial interactions that extend beyond immediate neighborhood clusters. This pattern is evident in Table 5, where the share of intra-boundary edges declines substantially as K grows, indicating a transition from localized to increasingly diffuse spatial structures.
We select K = 80 as the upper bound for our experiments based on both empirical and practical considerations. First, preliminary analyses (Table 4) indicate that homophily metrics such as assortativity and Moran’s I stabilize once this neighborhood size is reached. Beyond this point, increasing K adds little new spatial information but greatly enlarges the graph. For instance, in Singapore, the number of edges grows from 7 million at K = 40 to over 14 million at K = 80 with only limited gains in homophily. Second, this choice aligns with prior work on spatial graph learning, which suggests that overly large neighborhood sizes can dilute meaningful local structure and introduce noise due to excessive inter-boundary connections [16,46,47]. Thus, K = 80 provides a practical balance between capturing extended spatial context and maintaining computational scalability for large-scale urban property datasets.
Table 5 reports edge connection statistics across different neighborhood sizes, demonstrating the evolving balance between intra- and inter-boundary connectivity as K increases.

3.4. The Proposed Architecture

BD-GNN is designed to address structured spatial heterogeneity in property valuation by explicitly separating and adaptively integrating two spatial processes: geometric proximity and boundary-induced segmentation. Instead of relying on a single spatial graph, the model employs two complementary pathways, inter- and intra-boundary, and combines them through an adaptive gating mechanism that dynamically adjusts their influence at each node.
At a glance, the architecture in Figure 5 consists of five key components: (1) spatial aggregation (Figure 5a), which constructs both inter- and intra-boundary graphs; (2) inter-boundary pathway (Figure 5b), which captures geometric proximity effects across boundaries; (3) intra-boundary pathway (Figure 5c), which encodes segmentation within administrative or functional units; (4) adaptive gating mechanism (Figure 5d), which learns context-dependent weights between the two pathways; and (5) prediction layer (Figure 5e), which produces the final price estimate based on the fused representation. This structure allows the model to differentiate between spatial continuity and administrative segmentation, and to adapt its focus according to local heterophily patterns.

3.4.1. Motivation and Design Rationale

Property valuation is governed by two interacting spatial processes: geometric proximity, which captures similarities arising from locational closeness, and boundary-induced segmentation, which introduces price discontinuities across administrative or functional zones. Conventional GNNs, built on a single proximity-based graph, assume homophily and therefore propagate signals indiscriminately across boundaries. Heterophily-aware variants partially relax this assumption but still model segmentation only implicitly. To overcome these limitations, BD-GNN explicitly separates spatial interactions into intra-boundary and inter-boundary pathways, which are adaptively fused through a learnable gating mechanism (Figure 5). This architecture maintains spatial continuity while respecting segmentation boundaries, allowing the model to dynamically prioritize informative signals at each node. As a result, it achieves higher predictive accuracy and more transparent interpretability in heterogeneous spatial environments.

3.4.2. Dual-Pathway Feature Learning

The architecture integrates two parallel three-layer GCN pathways to capture distinct spatial relationship patterns. The choice of three layers balances model expressiveness with computational efficiency, allowing sufficient receptive field expansion while avoiding over-smoothing effects common in deeper GCN architectures. In practice, the number of layers can be adjusted (e.g., 1–3 layers) depending on the scale of the study area, spatial resolution, and graph connectivity characteristics
For the inter-boundary pathway, which captures geometric neighborhood effects:
H s ( 1 ) = ReLU ( W s ( 1 ) · AGG inter ( X , E inter ) + b s ( 1 ) )
H s ( 2 ) = ReLU ( W s ( 2 ) · AGG inter ( H s ( 1 ) , E inter ) + b s ( 2 ) )
H s ( 3 ) = ReLU ( W s ( 3 ) · AGG inter ( H s ( 2 ) , E inter ) + b s ( 3 ) )
Similarly, for the intra-boundary pathway, which models boundary-based spatial relationships:
H v ( 1 ) = ReLU ( W v ( 1 ) · AGG intra ( X , E intra ) + b v ( 1 ) )
H v ( 2 ) = ReLU ( W v ( 2 ) · AGG intra ( H v ( 1 ) , E intra ) + b v ( 2 ) )
H v ( 3 ) = ReLU ( W v ( 3 ) · AGG intra ( H v ( 2 ) , E intra ) + b v ( 3 ) )
where:
  • X R n × d represents the input node features for n properties with d attributes;
  • W s ( l ) R h × h and W v ( l ) R h × h are learnable weight matrices for layer l (with W s ( 1 ) , W v ( 1 ) R h × d for the first layer);
  • b s ( l ) , b v ( l ) R h are bias vectors for layer l;
  • H s ( l ) , H v ( l ) R n × h represent hidden node embeddings at layer l with hidden dimension h;
  • AGG ( · ) denotes the graph convolution aggregation function following the standard GCN formulation [6];
  • E inter N 2 × | E inter | and E intra N 2 × | E intra | represent edge index tensors for inter-boundary and intra-boundary graphs, respectively.
The two edge sets correspond to Figure 5b,c, which depict how geometric proximity and boundary-induced segmentation are modeled as separate graph structures.

3.4.3. Adaptive Gating Mechanism

The core innovation of BD-GNN lies in the adaptive gating mechanism (Figure 5d), which addresses the fundamental question: when should geometric proximity be prioritized over administrative boundaries, and vice versa? This mechanism employs cosine similarity as a coherence measure between the two pathway representations and is fully differentiable and trained end-to-end via backpropagation.
S cos ( i ) = h s , i ( 3 ) · h v , i ( 3 ) h s , i ( 3 ) 2 h v , i ( 3 ) 2
where h s , i ( 3 ) , h v , i ( 3 ) R h represent the final layer representations for node i in the spatial and administrative pathways, respectively, and  · 2 denotes the L2 norm. High cosine similarity ( S cos ( i ) 1 ) indicates consistent signals between pathways, while low similarity ( S cos ( i ) 0 ) reflects divergent spatial patterns across boundaries.
The gating network processes a comprehensive feature vector that includes original features, both pathway representations, and their coherence measure:
G input ( i ) = [ x i , h s , i ( 3 ) , h v , i ( 3 ) , S cos ( i ) ]
The gate weight is computed through a two-layer MLP with ReLU activation:
α ( i ) = σ ( W gate ( 2 ) · ReLU ( W gate ( 1 ) · G input ( i ) + b gate ( 1 ) ) + b gate ( 2 ) )
where:
  • x i R d is the original feature vector for node i;
  • [ · ] denotes vector concatenation operation;
  • G input ( i ) R d + 2 h + 1 is the concatenated input to the gating network;
  • W gate ( 1 ) R h × ( d + 2 h + 1 ) and W gate ( 2 ) R 1 × h are gating network weight matrices;
  • b gate ( 1 ) R h and b gate ( 2 ) R are gating network bias terms;
  • σ denotes the sigmoid activation function ensuring α ( i ) [ 0 , 1 ] .

3.4.4. Final Representation and Prediction

The final node representation combines both pathways through the learned adaptive weights:
h final ( i ) = α ( i ) h s , i ( 3 ) + ( 1 α ( i ) ) h v , i ( 3 )
This weighted combination enables smooth interpolation between spatially continuous and administratively segmented representations. When α ( i ) 1 , the model prioritizes inter-boundary patterns; when α ( i ) 0 , intra-boundary patterns dominate; intermediate values reflect a balanced contribution of both.
The final prediction is obtained through linear transformation:
y ^ ( i ) = w out T h final ( i ) + b out
where w out R h is the output weight vector and b out R is the output bias term.

3.4.5. Complete Model Formulation and Theoretical Properties

The complete BD-GNN can be expressed as a composition of functions:
f ( X , E inter , E intra ) = Linear ( G ( F s ( X , E inter ) , F v ( X , E intra ) ) )
where F s and F v represent the spatial and administrative pathway functions, and  G denotes the adaptive gating function.
This architecture possesses several desirable theoretical properties: (1) Permutation invariance: the model output is invariant to node ordering; (2) Spatial locality: the three-layer design ensures that information propagates within reasonable spatial neighborhoods; and (3) Adaptive expressiveness: the gating mechanism allows the model to automatically adjust its spatial bias based on local data characteristics.
The per-iteration computational complexity of BD-GNN is O h ( | E inter | + | E intra | ) + n d h + n h 2 , where h is the hidden dimension. The first term corresponds to sparse message passing over both inter-boundary and intra-boundary graphs, while the latter two terms arise from the node-wise linear projections in the GCN layers and the gating MLP. By ignoring constant factors from the number of layers and pathways, the model remains scalable for large, sparse geospatial graphs while retaining sufficient capacity to capture complex spatial dependencies that are crucial for accurate property valuation.

3.5. Experimental Setup

3.5.1. Hardware and Software Environment

All experiments were conducted on the Google Colab Pro platform.
For the GNN-based models, GPU acceleration was enabled using an NVIDIA A100 Tensor Core GPU with 80 GB memory (High-RAM environment). The implementation was developed in Python 3.12.11, with PyTorch 2.8.0 serving as the deep learning framework and PyTorch Geometric 2.7.0 used for graph neural network operations. All GNN computations were executed on the GPU unless otherwise specified.
In contrast, the SAR model was estimated using the standard CPU backend on Google Colab with a high-RAM configuration (51 GB). The SAR implementation was also executed in Python 3.12.11, but relied exclusively on CPU-based computation without GPU acceleration.

3.5.2. Model Configuration and Hyperparameters

All GNN architectures were configured with standardized parameter settings as detailed in Table 6. We deliberately chose not to optimize hyperparameters for individual models to ensure fair comparative evaluation across different architectures. This approach prioritizes empirical comparison over model-specific performance maximization, allowing us to assess the relative effectiveness of each architecture under equivalent conditions. The uniform hyperparameter configuration enables direct attribution of performance differences to architectural design choices rather than optimization advantages. This standardized setup provides a controlled environment for evaluating how different GNN architectures handle heterophily regression in real estate valuation tasks without the confounding effects of architecture-specific tuning. Although individual models may achieve higher accuracy when carefully optimized, our comparative framework provides more reliable insights into their fundamental capabilities for boundary-aware property valuation.

3.5.3. Baseline Models

We compare BD-GNN against two categories of baseline methods: traditional spatial econometric models and graph neural network (GNN) architectures.
Spatial Econometric Model:  We include the Spatial Autoregressive (SAR) model [1,48] as a representative and well-established baseline from spatial econometrics. SAR is widely used in applied property valuation to model global spatial dependence through a fixed spatial weight matrix, making it a natural reference point for positioning BD-GNN with respect to classical valuation approaches.
The SAR model is estimated using the maximum likelihood estimator (ML_Lag) implemented in the spreg Python library, which is a standard approach in spatial econometric analysis [48]. In general, spatial econometric models are primarily designed for explanatory analysis. When applied in a predictive setting, careful handling of spatial lag construction is required to avoid information leakage. In this study, SAR is implemented under a strict train–test split with kNN-based spatial weight matrices matched to the GNN baselines.
SAR is also sensitive to multicollinearity, as the joint estimation of regression coefficients ( β ) and the spatial autoregressive parameter ( ρ ) requires a nonsingular design matrix. To ensure stable estimation, we apply automated feature filtering using a pairwise correlation threshold of 0.8, resulting in a reduced feature set. This follows standard econometric practice and allows a stable evaluation of spatial dependence under the SAR framework.
GWR and its multiscale extension (MGWR) are not included in the quantitative benchmark because their spatial influence is determined endogenously through bandwidth selection rather than fixed neighborhood sizes. Their effective spatial support therefore differs fundamentally from the fixed-K graph construction used in this study, which would prevent a consistent and interpretable comparison.
Classical GNNs: GCN serves as the foundational spectral convolution baseline. GAT introduces attention mechanisms to learn adaptive neighbor weights, while GraphSAGE employs inductive, sampling-based aggregation and has demonstrated strong performance in real estate applications [34].
Heterophily-Aware GNNs: MixHop addresses heterophily through higher-order neighborhood mixing. H2GCN separates ego and multi-hop information to mitigate oversmoothing under heterophily. PNA combines multiple aggregators and scalers to capture diverse neighborhood structures, and FAGCN balances low- and high-frequency signals through learnable filters.
To benchmark against recent spatial heterophily-aware GNNs, we additionally include SHGNN, which model spatial heterogeneity through sector–ring graph decomposition and multi-view feature fusion. The original SHGNN implementation is provided in PaddlePaddle (https://github.com/PaddlePaddle/PaddleSpatial/tree/main/research/SHGNN, accessed on 15 September 2025). To ensure compatibility with our PyTorch-based experimental pipeline, we reimplemented SHGNN in PyTorch following the official architecture and training procedure as closely as possible. While minor implementation-level differences across frameworks are unavoidable, this reimplementation represents a faithful reproduction that enables a transparent and meaningful comparison.
All GNN baselines use kNN graphs without boundary information, isolating the architectural contribution of BD-GNN. The SAR model similarly relies on conventional spatial weight matrices without boundary modifications. We conduct 30 independent runs for each GNN with different random seeds and report mean performance with standard deviations to account for training stochasticity [21,49]. In contrast, SAR is deterministic given fixed weights and therefore requires a single estimation per configuration.
Transformer-based GNN variants (e.g., graph transformers) are not included in this study. Most existing graph transformers rely on global or near-global attention mechanisms with quadratic complexity in the number of nodes, which becomes computationally impractical for large spatial property datasets. Moreover, incorporating global attention would shift the focus away from the core objective of this work, which is to isolate the effect of boundary-aware graph construction within a message-passing framework. Extending BD-GNN to transformer-based architectures is a promising direction for future research but is beyond the scope of this study.

3.6. Evaluation Metrics

Given the actual house price y i and the predicted price y ^ i for each sample i, we evaluate the model’s performance using three error metrics: coefficient of determination ( R 2 ), Mean Absolute Percentage Error (MAPE), and General Error Rate (GER).

3.6.1. Coefficient of Determination ( R 2 )

The R 2 score quantifies how well the model captures variance in the target variable, and is defined as:
R 2 = 1 i = 1 n ( y i y ^ i ) 2 i = 1 n ( y i y ¯ ) 2
where y ¯ is the mean of the ground truth values. A higher R 2 (closer to 1) indicates better prediction performance.

3.6.2. MAPE (Mean Absolute Percentage Error)

MAPE is a scale-independent metric that computes the average of absolute percentage errors:
MAPE = 100 n i = 1 n y i y ^ i y i
Lower MAPE values indicate more accurate predictions. However, MAPE can be sensitive to very small ground truth values ( y i close to zero).

3.6.3. GER (General Error Rate)

Following the definition in LUCE [12], we define the signed percentage error as:
GER i = y ^ i y i y i
We then use the absolute value to construct the cumulative distribution of errors:
| GER i | = y ^ i y i y i
Instead of averaging, we report the percentage of predictions where | GER i | is below predefined thresholds (e.g., 5 % ,   10 % ,   15 % ,   20 % ), thereby assessing how often the model produces relatively small errors. This complements MAPE by revealing the distributional shape of error magnitudes across all samples.

4. Results

Across all datasets, BD-GNN consistently achieved the best predictive performance, improving MAPE by up to 7.92% and increasing R 2 by up to 6.05% compared with the best baselines. The model demonstrates particularly strong gains in highly fragmented administrative settings, such as Singapore and Bangkok, while maintaining stable improvements in King County. These results confirm that explicitly modeling boundary effects not only enhances prediction accuracy but also improves robustness under high inter-boundary connectivity.
We note that SAR could not be estimated on the Singapore dataset due to prohibitive memory requirements associated with constructing large spatial weight matrices, even under a high-RAM environment. This reflects a known scalability limitation of classical spatial econometric models in large-scale urban settings.
Table 7 summarizes the MAPE results across all datasets and connectivity levels. BD-GNN achieves the lowest MAPE in every configuration: Bangkok (14.95% at K = 5 , 11.80% at K = 80 ), King County (17.89% at K = 5 , 12.64% at K = 80 ), and Singapore (14.32% at K = 5 , 11.59% at K = 80 ). Relative to the best baselines in case of K = 80 , the reductions in MAPE range from 4.42% to 7.92%, with the largest improvement observed in Bangkok (7.92%), while King County and Singapore exhibit comparable gains of 4.42% and 4.54% respectively. Pairwise t-tests at the 95% confidence level (p-value < 0.05) confirm that these improvements are statistically significant across all datasets and connectivity settings. These results indicate that BD-GNN effectively captures structured discontinuities in property values that conventional and heterophily-aware GNNs often fail to model.
Table 8 presents the R 2 results across all datasets and connectivity levels. BD-GNN consistently outperforms all baseline methods, with performance gains particularly evident at higher connectivity ( K = 80 ) where inter-boundary heterophily is strongest. For K = 80 , the model achieves R 2 = 0.8117 in Bangkok, 0.8958 in King County, and 0.6599 in Singapore. Relative to the best-performing baselines, these correspond to improvements of 3.26%, 0.67%, and 5.01% respectively. Pairwise t-tests at the 95% confidence level (p-value < 0.05) confirm significance across all dataset–connectivity combinations, underscoring the model’s ability to explain more variance in fragmented administrative environments.
For GER, the cumulative distributions in Figure 6 and the tabulated values in Table 9 paint a consistent picture. BD-GNN places a larger share of predictions within low error ranges across all datasets and thresholds. At K = 80 and a 10% tolerance, it reaches 51.70% in Bangkok, 53.78% in King County, and 53.61% in Singapore, corresponding to improvements of 3.31, 3.28, and 2.33 percentage points over the best baselines. The comparison between K = 5 and K = 80 shows that BD-GNN maintains stable performance even when neighborhood size increases and cross-boundary connections become more prevalent. This effect is especially evident in Bangkok and Singapore, where larger neighborhoods introduce substantial boundary-induced heterogeneity, yet the model continues to deliver robust predictive accuracy.
Across all three datasets and both neighborhood sizes, BD-GNN consistently achieves the lowest MAPE, outperforming all baseline methods in Bangkok, King County, and Singapore. This advantage holds under both localized neighborhoods ( K = 5 ) and extended spatial interactions ( K = 80 ), indicating robustness to neighborhood scale.
The strongest baseline differs by dataset and neighborhood size. In Bangkok, the most competitive baselines are SHGNN and MixHop, while in Singapore, PNA is strongest at K = 5 and SHGNN becomes strongest at K = 80 . In King County, SAR provides the best baseline performance at K = 5 , whereas at K = 80 the best baseline shifts to MixHop, suggesting that global linear spatial dependence is already informative at small neighborhoods, while multi-hop mixing becomes more effective as spatial connectivity expands.
Despite these strong baselines—including cases where a classical spatial model (SAR) is most competitive—BD-GNN remains best in every dataset–connectivity setting. These results indicate that explicitly distinguishing spatial proximity from boundary-related effects, and adaptively weighting their contributions, improves predictive performance relative to both classical spatial models and existing GNN approaches.

5. Discussion: Boundary Dynamics, Adaptation, Boundary Robustness, Interpretability, Practical Trade-Off, and Potential Extension to Spatiotemporal

5.1. Interpreting α : Spatial–Boundary Dynamics Across Cities

The adaptive gating parameter α measures how the model balances spatial and boundary information, serving as an interpretable signal of each city’s underlying structural regime. Its distribution provides insight into how different urban morphologies mediate the interaction between spatial proximity and administrative segmentation.
As shown in Figure 7 for Bangkok, α values remain below 0.5, increasing slightly from 0.43 to 0.46 as the neighborhood size (K) expands. This pattern indicates that boundary effects continue to dominate even at broader spatial scales. The trend reflects Bangkok’s village-based urban structure, where private housing developments are internally uniform but separated by distinct price gaps along their boundaries. Bangkok therefore represents a boundary-aligned regime, where spatial continuity exists but remains secondary to administrative or community segmentation.
As shown in Figure 8 for King Country, α averages around 0.44 across scales but shows greater dispersion as K increases, suggesting diverse local dynamics. Suburban areas depend more strongly on spatial proximity, while central zones are shaped by zoning and regulatory boundaries. The coexistence of these conditions produces a hybrid regime, in which spatial and administrative influences interact differently depending on the local context.
As shown in Figure 9 for Singapore, α decreases markedly from 0.51 to 0.33 as K grows, showing that boundary effects intensify with neighborhood expansion. The narrow distribution indicates structural uniformity across the city, consistent with Singapore’s centrally planned housing environment dominated by Housing and Development Board (HDB) estates. These estates exhibit strong internal homogeneity, with prices organized primarily by block-level boundaries. Singapore thus represents a boundary-dominant regime.
Across the three regimes, represented by boundary-aligned Bangkok, hybrid King County, and boundary-dominant Singapore, the results show that α effectively reflects key variations in urban structure. It operates not merely as a model parameter but as a diagnostic indicator of spatial–administrative balance, offering a more interpretable understanding of how governance and morphology shape property markets.

5.2. Boundary Robustness Under Weak and Artificial Segmentation

To assess the robustness of BD-GNN under imperfect boundary definitions, we conduct a controlled boundary-perturbation experiment on the Bangkok dataset, where villages act as explicit boundaries. We evaluate boundary robustness by selectively perturbing village labels for houses located near village borders, while keeping the underlying kNN graph topology unchanged. The perturbation level is increased from 0% to 60% to emulate progressively weaker or partially misaligned segmentation specifically in the most ambiguous boundary regions. Noise levels are limited to 60% because perturbations beyond this threshold no longer represent plausible boundary uncertainty in real-world valuation settings and instead correspond to near-random segmentation, which falls outside the intended scope of this robustness analysis.
Figure 10 (left) shows that prediction error increases gradually as border noise rises for both neighborhood sizes. This pattern indicates that BD-GNN is sensitive to boundary degradation in the expected direction, yet does not exhibit abrupt failure under moderate boundary perturbations. The degradation is smooth and limited in magnitude, suggesting that the model retains useful predictive structure even when boundary labels are partially corrupted at border areas.
Figure 10 (right) reports the behavior of the adaptive gating parameter α . Overall, α tends to move upward under higher noise levels, indicating a shift toward relying more on spatial continuity when boundary information becomes less reliable. At the same time, the response is not strictly monotonic, particularly for larger neighborhoods ( K = 80 ), where broader connectivity increases interactions across villages and introduces more heterogeneous signals. This non-monotonic pattern reflects the fact that the gating mechanism is learned from data and adaptively balances spatial and boundary information under uncertainty, rather than following a fixed or deterministic rule.
Taken together, these results address the concern of weak or imperfect boundaries. BD-GNN does not assume boundary labels to be error-free; instead, it treats segmentation as a soft structural signal whose contribution is adaptively regulated. When border-level boundary information is degraded, the model maintains stable behavior: prediction error increases smoothly, while α adjusts to re-balance boundary-based and spatial aggregation. This supports the robustness of BD-GNN to boundary uncertainty in realistic settings where boundary regions are most prone to ambiguity or misclassification.

5.3. Model Adaptability and Transferability

The variations in α across cities highlight the importance of adaptive modeling. Conventional GNNs typically apply a single message-passing strategy regardless of spatial or administrative context, which can lead to systematic bias when these influences vary widely. In contrast, BD-GNN dynamically adjusts its balance between spatial and boundary pathways through the adaptive gating mechanism.
When α approaches 1, inter-boundary connections dominate the message-passing process, whereas when α approaches 0, intra-boundary relationships prevail. This flexibility allows BD-GNN to respond automatically to local urban morphology without requiring manual tuning or prior zoning assumptions. The model therefore learns the appropriate structural emphasis directly from data, aligning its internal representation with the city’s actual spatial organization.
From a practical perspective, α also functions as a diagnostic tool for model transferability. Before applying BD-GNN to a new context, analyzing α on a validation subset can reveal the city’s dominant regime:
  • Hybrid regimes (for example, King County) may require stronger spatial aggregation to capture inter-boundary diffusion.
  • Boundary-dominant regimes (for example, Singapore) benefit from a higher weighting on boundary pathways to reflect institutional segmentation.
  • Boundary-aligned regimes (for example, Bangkok) represent intermediate conditions in which both spatial and boundary signals exert comparable influence.
This flexibility allows BD-GNN to adapt effectively across cities with different spatial and administrative configurations. Rather than training a separate model for each area, the same architecture can adjust its α -based balance between spatial continuity and boundary segmentation. As a result, BD-GNN functions not only as a predictive tool but also as an adaptive framework for multi-city property valuation.

5.4. Policy and Planning Relevance

Understanding how boundaries influence property values is crucial for designing equitable and efficient urban policies. The α -based regimes identified above offer practical guidance for aligning planning instruments with the structural organization of each area.
Boundary-dominant regimes, such as in Singapore, indicate that property values are largely determined within administrative units. In such cases, block-level taxation, estate-specific development controls, or targeted subsidy programs are likely to be most effective. Hybrid regimes, such as in King County, exhibit mixed spatial interactions, suggesting the need for inter-boundary coordination, including regional infrastructure planning, inter-municipal zoning, or coordinated service delivery. Boundary-aligned regimes, such as in Bangkok, emphasize the internal coherence of residential clusters, where neighborhood-scale interventions, such as localized land-use regulations or community-level infrastructure investments, can yield significant impacts.
Rather than applying uniform, city-wide valuation rules, policy design should align with the dominant spatial–administrative regime. BD-GNN supports this regime-aware formulation by revealing how spatial and boundary signals interact across scales. This integration of analytical insight and governance logic bridges predictive modeling with practical decision-making, enabling data-driven interventions that reflect the spatial realities of urban morphology. In this way, BD-GNN transforms boundary effects, traditionally seen as constraints, into measurable and actionable spatial processes.

5.5. Explainability and Local Insights

While α provides a global perspective on spatial–boundary dynamics, understanding individual predictions requires local interpretability. BD-GNN integrates the GNN Explainer to identify the most influential neighbors and structural attributes that contribute to each property’s estimated value.
Figure 11 shows a representative example from Bangkok. For a selected property, the model identifies key neighbors, two of which lie across administrative boundaries, with a gating value of α = 0.757 . It also ranks the most influential structural features, including building area, number of floors, age, and land size. This node-level interpretation clarifies how spatial and physical attributes interact in determining the predicted price.
The combination of global interpretability (via α ) and local transparency (via GNN Explainer) allows BD-GNN to move beyond black-box prediction. The global signal captures citywide structural regimes, while the local explanation exposes the reasoning behind individual predictions. Together, they enhance transparency, user trust, and the interpretive value of spatial AI in real-world valuation practice.

5.6. Practical Trade-Off: Accuracy and Computational Cost

The computational cost of BD-GNN should be understood in relation to its modeling objective rather than as a direct efficiency comparison with classical spatial econometric models. Spatial regression models such as SAR are designed primarily for explanatory analysis under fixed linear specifications and are estimated through a single model-fitting procedure. However, their computational cost increases rapidly with sample size due to the construction and manipulation of large spatial weight matrices.
As shown in Table A2, SAR exhibits high computational and memory demands, especially under large and dense spatial graphs. In practice, this limitation becomes critical for large-scale datasets. In our experiments, SAR could not be estimated for the Singapore dataset ( n = 177 , 171 ) at either K = 5 or K = 80 , even when using a high-RAM environment (51 GB) on Google Colab, highlighting its limited scalability in nationwide or city-scale mass appraisal settings.
For graph-based valuation models, neighborhood size (K) plays a central role in determining predictive performance. While small neighborhoods preserve local homogeneity, larger neighborhoods increasingly mix information across administrative boundaries. Standard GNNs aggregate neighbor information uniformly, which often leads to signal dilution when boundary-induced heterogeneity is strong.
BD-GNN addresses this issue by explicitly separating spatial proximity and boundary-based neighborhood structure, and by learning how to balance them through the adaptive gating parameter α . This allows the model to maintain stable and improved accuracy at larger neighborhood sizes, particularly in cities where administrative segmentation strongly shapes property values.
In practice, the trade-off is therefore between modeling simplicity and structural fidelity. SAR remains suitable for moderate-sized datasets where fast estimation and global interpretability are prioritized. BD-GNN is most appropriate when accurate valuation under boundary-driven heterogeneity is required, and where a moderate and predictable computational overhead is acceptable.

5.7. Potential Extension to Spatiotemporal Property Valuation

This study focuses on a static property valuation setting in order to isolate the structural effects of administrative boundaries on spatial price formation. This design choice allows the boundary-induced heterogeneity to be examined without interference from temporal market fluctuations, thereby enabling a clearer interpretation of the proposed boundary-aware learning mechanism.
Nevertheless, temporal dynamics are undeniably important in real estate markets, where property values evolve over time in response to macroeconomic conditions, policy changes, and local development processes. Extending BD-GNN to a spatiotemporal setting therefore represents a natural and meaningful direction for future research. Such an extension would require additional modeling considerations, including constructing graphs that evolve over time, ensuring consistent boundary definitions across periods, and using transaction data with adequate temporal coverage.
Importantly, the dual-path architecture of BD-GNN provides a flexible foundation for incorporating temporal information. The explicit separation between spatial proximity and boundary-based relations could be combined with temporal graph neural networks or sequence-based learning frameworks, allowing spatial boundary effects and temporal price dynamics to be modeled jointly rather than conflated. For example, temporal encoders could be applied to node representations while preserving the boundary-aware aggregation mechanism at each time step.
While such spatiotemporal integration lies beyond the scope of the current study, the results presented here demonstrate that explicitly modeling administrative boundaries yields clear benefits even in a static setting. This suggests that boundary-aware spatial learning is a complementary component, rather than a substitute, to temporal modeling, and highlights the potential of BD-GNN as a building block for future spatiotemporal property valuation frameworks.

5.8. Summary of Discussion

This study shows that property values in urban areas are shaped not only by spatial proximity but also by administrative boundaries that organize markets into distinct segments. These boundaries function as socio-economic separators. This means that locations that are geographically close may still differ substantially in value if they lie within different governance or development contexts.
BD-GNN incorporates this segmentation directly by separating intra- and inter-boundary message passing and learning the relative importance of each through the adaptive gating parameter ( α ). The learned α values indicate stable spatial–administrative regimes that reveal where price continuity holds and where boundary-driven differentiation is structurally reinforced. This provides a transparent mechanism for understanding how local market identities form.
Explainable GNN analysis shows that neighborhood features, access to amenities, and local development history work together with boundary segmentation to shape price patterns. In contrast, conventional GNNs tend to diffuse information uniformly and therefore overlook these distinctions.
Although BD-GNN is formulated under a static valuation setting in this study, extending the model to incorporate temporal dynamics is a promising direction for future work. Real estate markets are inherently time-dependent, with major macroeconomic events and policy interventions unfolding over time and influencing price formation. The dual-path, boundary-aware architecture of BD-GNN provides a natural foundation for integration with temporal graph or sequence-based models, enabling future research to jointly model spatial segmentation and temporal market dynamics.
All in all, the findings highlight that structured heterophily arising from governance, zoning, and administrative boundaries is a critical yet underrepresented factor in property valuation. Explicitly modeling this structure improves predictive accuracy and clarifies how urban morphology shapes price identities. In practice, this positions BD-GNN as a suitable choice when valuation accuracy and structural insight are prioritized, rather than as a replacement for faster but more rigid modeling approaches. This establishes a stronger foundation for decision support in urban planning, real estate governance, and valuation practice.

6. Conclusions

This study proposed BD-GNN, a boundary-aware dual-path graph neural network for property valuation in complex urban environments. By explicitly separating intra-boundary and inter-boundary message passing and adaptively balancing spatial proximity and administrative segmentation, BD-GNN addresses a key limitation of existing valuation models that uniformly aggregate neighborhood information and overlook boundary-induced price discontinuities.
Empirical evaluations across three heterogeneous urban datasets demonstrate that BD-GNN consistently improves predictive accuracy and stability in boundary-driven markets, reducing MAPE by up to 7.9% and increasing R 2 by up to 5.0% compared to strong baselines, particularly at larger neighborhood scales where standard GNNs tend to oversmooth local price signals. Additional robustness and scalability analyses further show that BD-GNN maintains linear scalability with respect to graph size and adapts gracefully to uncertainty or misalignment in administrative boundaries.
Beyond predictive performance, BD-GNN contributes a form of structural interpretability through its adaptive gating mechanism, enabling insight into the relative importance of spatial continuity versus boundary-induced segmentation in different urban contexts. This makes the framework particularly relevant for applied valuation tasks where transparency, boundary effects, and regulatory compliance play a central role, including mass appraisal, property taxation, and urban planning.
While this study examines static valuation settings to isolate boundary effects, the dual-path architecture provides a natural foundation for incorporating temporal dynamics. Extending BD-GNN to spatiotemporal valuation represents the most immediate direction for future research, while incorporating alternative or learned boundary definitions and integrating transformer-based architectures offer longer-term extensions. We believe that boundary-aware graph modeling provides a useful foundation for more realistic, interpretable, and policy-relevant property valuation in complex urban systems.

Author Contributions

Conceptualization, Jetana Somkamnueng and Kitsana Waiyamai; methodology, Jetana Somkamnueng and Kitsana Waiyamai; software, Jetana Somkamnueng; validation, Jetana Somkamnueng and Kitsana Waiyamai; writing—original draft preparation, Jetana Somkamnueng; writing—review and editing, Jetana Somkamnueng and Kitsana Waiyamai; supervision, Kitsana Waiyamai All authors have read and agreed to the published version of the manuscript.

Funding

This research was partially supported by a student scholarship from the Faculty of Engineering, Kasetsart University.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The Bangkok dataset used in this study is not publicly available due to data-sharing restrictions and confidentiality agreements. Requests to access this dataset should be directed to the Thai Government Housing Bank (GHB). The King County and Singapore datasets are derived from public sources. The code related to this study is publicly available at https://github.com/jetanaso/bdgnn-property-valuation (accessed on 15 September 2025).

Acknowledgments

During the preparation of this study, the authors used AI language tools, including ChatGPT (GPT-5) and Claude 4.5 Sonnet, for the purposes of language refinement, translation assistance, and content organization. The authors have reviewed and edited the output and take full responsibility for the content of this publication.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Computational Positioning and Cost Characteristics

This appendix summarizes the computational characteristics of the models evaluated in this study, with the goal of clarifying their computational positioning rather than ranking raw efficiency. The analysis combines theoretical complexity with empirical runtime and memory profiling to contextualize each model within its intended valuation paradigm.
Table A1 defines the notation used in the complexity analysis. For GNNs, computational complexity is reported per training epoch under full-batch training with a directed kNN graph, where the number of edges scales linearly with the number of nodes ( E N K ). For the SAR model, complexity is reported per end-to-end model fit, reflecting its single-estimation procedure.
Table A2 presents the dominant asymptotic time complexity of all models. SAR is optimized for global linear spatial inference and relies on CPU-based numerical solvers, whereas GNNs are representation-learning models trained through repeated message passing and nonlinear transformations. As a result, their computational costs arise from fundamentally different execution paradigms and are not directly comparable on a per-iteration basis.
Table A1. Notations used in computational complexity analysis.
Table A1. Notations used in computational complexity analysis.
SymbolDescription
NNumber of nodes (properties) in the graph
ENumber of edges in the graph; for a directed KNN graph, E N K
KNumber of nearest neighbors used in graph construction
FNumber of input features per node
HHidden feature dimension in GNN layers
LNumber of message-passing layers
hNumber of attention heads in GAT
| P | Number of hop powers in MixHop
ANumber of aggregators in PNA
SNumber of scalers in PNA
E s Number of spatial (inter-boundary) edges in BD-GNN
E v Number of village/boundary (intra-boundary) edges in BD-GNN
CModel-specific constant factor (e.g., number of sector–ring views in SHGNN)
Table A2. Theoretical computational complexity of spatial econometric and graph-based models. For GNNs, complexity is reported per training epoch under full-batch training with a directed KNN graph ( E N K ). For spatial econometric models, complexity is reported per model fit.
Table A2. Theoretical computational complexity of spatial econometric and graph-based models. For GNNs, complexity is reported per training epoch under full-batch training with a directed KNN graph ( E N K ). For spatial econometric models, complexity is reported per model fit.
ModelArchitecture Used in this StudyDominant Time Complexity
SARExact maximum likelihood estimation O ( n 3 ) (dense) or sparse-dependent
GCN2-layer graph convolutional network O 2 ( N F H + E H )
GAT2-layer, h = 4 attention heads O 2 ( N F ( h H ) + E ( h H ) )
GraphSAGE (mean)2-layer mean aggregator O 2 ( N F H + E H )
MixHop2-layer, hop powers | P | = 3 O 2 | P | ( N F H + E H )
H2GCN2-layer hop-separated aggregation (0-, 1-, and 2-hop) O 2 ( N F H + 3 E H )
FAGCN2-layer low-/high-frequency decomposition O 2 ( N F H + 2 E H )
PNA2-layer, A = 4 aggregators and S = 3 scalers O 2 ( N F H + ( A · S ) E H )
SHGNNSector–ring graph decomposition with multi-view fusion and gating O C ( N F H + E H ) , C 1
BD-GNN (ours)Dual-path (spatial + boundary) GCN, 3 + 3 layers with node-wise gating O 6 N F H + 3 E H + N ( F + 2 H ) H
Note: For SAR, dense exact ML has cubic complexity, while sparse ML implementations (as used in this study) rely on sparse matrix factorization or iterative solvers, with complexity depending on sparsity and the number of likelihood evaluations.
Table A3 reports empirical runtime and memory profiling on the Bangkok dataset under model-appropriate hardware settings. All GNN-based models are trained on an NVIDIA A100 GPU using epoch-based optimization, while SAR is estimated once on a high-RAM CPU environment. In this setup, SAR exhibits substantially higher CPU memory usage and longer end-to-end fitting time, increasing from 293.27 s at K = 5 to 499.27 s at K = 80 , as denser spatial weight matrices increase computational burden. GPU memory and runtime are therefore not reported for SAR.
Table A3. Empirical runtime and memory profiling are conducted on the Bangkok dataset under controlled hardware settings. All GNN-based models are executed on an NVIDIA A100 GPU on Google Colab, while the SAR model is estimated on a standard CPU with high-RAM environment on Google Colab. Reported metrics include peak CPU memory usage, peak GPU memory allocation, average epoch time, and total training time. Bold and underline indicate the best (lowest) and worst (highest) values within the GNN family, respectively.
Table A3. Empirical runtime and memory profiling are conducted on the Bangkok dataset under controlled hardware settings. All GNN-based models are executed on an NVIDIA A100 GPU on Google Colab, while the SAR model is estimated on a standard CPU with high-RAM environment on Google Colab. Reported metrics include peak CPU memory usage, peak GPU memory allocation, average epoch time, and total training time. Bold and underline indicate the best (lowest) and worst (highest) values within the GNN family, respectively.
KModelCPU PeakGPU PeakAvg. EpochTotal Training
(MB)(MB)(s)(s)
5Spatial econometric models (single model fit; no epochs; CPU-based)
SAR4681293.27
Graph neural networks (full-batch training; epoch-based; GPU-based)
FAGCN20491140.01195.87
GAT20233760.00974.63
GCN2045930.00643.41
GraphSAGE2009410.00572.23
H2GCN2068660.00993.82
MixHop20522350.00923.30
PNA20852010.01363.53
SHGNN21459940.068514.00
BD-GNN (ours)20891180.01587.45
80Spatial econometric models (single model fit; no epochs; CPU-based)
SAR11,036499.27
Graph neural networks (full-batch training; epoch-based; GPU-based)
FAGCN21199650.036720.69
GAT209447690.051323.91
GCN21329400.019612.69
GraphSAGE20382360.01024.05
H2GCN20911910.01084.76
MixHop213021170.035114.45
PNA213011070.064711.75
SHGNN223617440.094645.79
BD-GNN (ours)21135770.033626.40
Note: “Avg. Epoch” is only defined for epoch-based training (GNNs). SAR report end-to-end model fitting time. GPU peak is not logged for SAR in this setup (marked as “–”).
Within the GNN family, runtime and memory usage vary primarily due to architectural constant factors rather than asymptotic order. Lightweight models such as GCN and GraphSAGE incur low overhead, while more expressive architectures such as MixHop, PNA, and SHGNN exhibit higher computational cost due to multi-hop, multi-aggregator, or structural decompositions.
BD-GNN introduces a moderate constant-factor overhead by separating spatial and boundary relations into two parallel message-passing paths with node-wise adaptive gating. Importantly, this design preserves linear scalability with respect to both nodes and edges, as the spatial and boundary edges partition the original graph rather than duplicating it. These results position BD-GNN as a boundary-aware representation-learning model suited for capturing market segmentation effects, rather than as a replacement for classical spatial econometric approaches focused on global linear dependence.

References

  1. Anselin, L. Lagrange multiplier test diagnostics for spatial dependence and spatial heterogeneity. Geogr. Anal. 1988, 20, 1–17. [Google Scholar] [CrossRef]
  2. Brunsdon, C.; Fotheringham, A.S.; Charlton, M.E. Geographically weighted regression: A method for exploring spatial nonstationarity. Geogr. Anal. 1996, 28, 281–298. [Google Scholar] [CrossRef]
  3. Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  4. Mora-Garcia, R.T.; Cespedes-Lopez, M.F.; Perez-Sanchez, V.R. Housing price prediction using machine learning algorithms in COVID-19 times. Land 2022, 11, 2100. [Google Scholar] [CrossRef]
  5. Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM Sigkdd International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar] [CrossRef]
  6. Kipf, T.N.; Welling, M. Semi-Supervised Classification with Graph Convolutional Networks. arXiv 2016, arXiv:1609.02907. [Google Scholar] [CrossRef]
  7. Hamilton, W.; Ying, Z.; Leskovec, J. Inductive representation learning on large graphs. arXiv 2017, arXiv:1710.09471. [Google Scholar] [CrossRef]
  8. Veličković, P.; Cucurull, G.; Casanova, A.; Romero, A.; Liò, P.; Bengio, Y. Graph Attention Networks. arXiv 2017, arXiv:1710.10903. [Google Scholar] [CrossRef]
  9. Zhu, D.; Liu, Y.; Yao, X.; Fischer, M.M. Spatial regression graph convolutional neural networks: A deep learning paradigm for spatial multivariate distributions. GeoInformatica 2022, 26, 645–676. [Google Scholar] [CrossRef]
  10. Yang, Z.; Hong, Z.; Zhou, R.; Ai, H. Graph convolutional network-based model for megacity real estate valuation. IEEE Access 2022, 10, 104811–104828. [Google Scholar] [CrossRef]
  11. Das, S.S.S.; Ali, M.E.; Li, Y.F.; Kang, Y.B.; Sellis, T. Boosting house price predictions using geo-spatial network embedding. Data Min. Knowl. Discov. 2021, 35, 2221–2250. [Google Scholar] [CrossRef]
  12. Peng, H.; Li, J.; Wang, Z.; Yang, R.; Liu, M.; Zhang, M.; Yu, P.S.; He, L. Lifelong property price prediction: A case study for the toronto real estate market. IEEE Trans. Knowl. Data Eng. 2021, 35, 2765–2780. [Google Scholar] [CrossRef]
  13. Riveros, E.; Vairetti, C.; Wegmann, C.; Truffa, S.; Maldonado, S. Scalable Property Valuation Models via Graph-based Deep Learning. arXiv 2024, arXiv:2405.06553. [Google Scholar] [CrossRef]
  14. Abu-El-Haija, S.; Perozzi, B.; Kapoor, A.; Alipourfard, N.; Lerman, K.; Harutyunyan, H.; Ver Steeg, G.; Galstyan, A. Mixhop: Higher-order graph convolutional architectures via sparsified neighborhood mixing. In Proceedings of the International Conference on Machine Learning, PMLR, Long Beach, CA, USA, 9–15 June 2019; pp. 21–29. [Google Scholar]
  15. Chien, E.; Peng, J.; Li, P.; Milenkovic, O. Adaptive universal generalized pagerank graph neural network. arXiv 2020, arXiv:2006.07988. [Google Scholar] [CrossRef]
  16. Zhu, J.; Yan, Y.; Zhao, L.; Heimann, M.; Akoglu, L.; Koutra, D. Beyond homophily in graph neural networks: Current limitations and effective designs. Adv. Neural Inf. Process. Syst. 2020, 33, 7793–7804. [Google Scholar]
  17. Pei, H.; Wei, B.; Chang, K.C.C.; Lei, Y.; Yang, B. Geom-gcn: Geometric graph convolutional networks. arXiv 2020, arXiv:2002.05287. [Google Scholar] [CrossRef]
  18. Bo, D.; Wang, X.; Shi, C.; Shen, H. Beyond low-frequency information in graph convolutional networks. In Proceedings of the AAAI Conference on Artificial Intelligence, Virtually, 2–9 February 2021; Volume 35, pp. 3950–3957. [Google Scholar] [CrossRef]
  19. Corso, G.; Cavalleri, L.; Beaini, D.; Liò, P.; Veličković, P. Principal neighbourhood aggregation for graph nets. Adv. Neural Inf. Process. Syst. 2020, 33, 13260–13271. [Google Scholar]
  20. Luan, S.; Hua, C.; Lu, Q.; Zhu, J.; Zhao, M.; Zhang, S.; Chang, X.W.; Precup, D. Revisiting heterophily for graph neural networks. Adv. Neural Inf. Process. Syst. 2022, 35, 1362–1375. [Google Scholar] [CrossRef]
  21. Lim, D.; Hohne, F.; Li, X.; Huang, S.L.; Gupta, V.; Bhalerao, O.; Lim, S.N. Large scale learning on non-homophilous graphs: New benchmarks and strong simple methods. Adv. Neural Inf. Process. Syst. 2021, 34, 20887–20902. [Google Scholar]
  22. Xiao, C.; Zhou, J.; Huang, J.; Xu, T.; Xiong, H. Spatial heterophily aware graph neural networks. In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Long Beach, CA, USA, 6–10 August 2023; pp. 2752–2763. [Google Scholar] [CrossRef]
  23. Black, S.E. Do better schools matter? Parental valuation of elementary education. Q. J. Econ. 1999, 114, 577–599. [Google Scholar] [CrossRef]
  24. Brasington, D.M.; Hite, D. Demand for environmental quality: A spatial hedonic analysis. Reg. Sci. Urban Econ. 2005, 35, 57–82. [Google Scholar] [CrossRef]
  25. McMillen, D.P.; McDonald, J.F. Land values in a newly zoned city. Rev. Econ. Stat. 2002, 84, 62–72. [Google Scholar] [CrossRef]
  26. Nelson, A.C.; Pendall, R. Growth Management and Housing Affordability: The Academic Evidence. In Growth Management and Affordable Housing: Do they Conflict? Brookings Institution Press: Washington, DC, USA, 2004; p. 117. [Google Scholar]
  27. Machin, S. Houses and schools: Valuation of school quality through the housing market. Labour Econ. 2011, 18, 723–729. [Google Scholar] [CrossRef]
  28. Gibbons, S.; Machin, S.; Silva, O. Valuing school quality using boundary discontinuities. J. Urban Econ. 2013, 75, 15–28. [Google Scholar] [CrossRef]
  29. Rosen, S. Hedonic prices and implicit markets: Product differentiation in pure competition. J. Political Econ. 1974, 82, 34–55. [Google Scholar] [CrossRef]
  30. LeSage, J.P. The Theory and Practice of Spatial Econometrics; University of Toledo: Toledo, OH, USA, 1999; Volume 28, pp. 1–39. [Google Scholar]
  31. Dubin, R.A. Spatial autocorrelation and neighborhood quality. Reg. Sci. Urban Econ. 1992, 22, 433–452. [Google Scholar] [CrossRef]
  32. Fotheringham, A.S.; Yang, W.; Kang, W. Multiscale geographically weighted regression (MGWR). Ann. Am. Assoc. Geogr. 2017, 107, 1247–1265. [Google Scholar] [CrossRef]
  33. Antipov, E.A.; Pokryshevskaya, E.B. Mass appraisal of residential apartments: An application of Random forest for valuation and a CART-based approach for model diagnostics. Expert Syst. Appl. 2012, 39, 1772–1778. [Google Scholar] [CrossRef]
  34. Geerts, M.; vanden Broucke, S.; De Weerdt, J. Graph neural networks for house price prediction: Do or don’t? Int. J. Data Sci. Anal. 2025, 20, 3563–3593. [Google Scholar] [CrossRef]
  35. Law, S.; Paige, B.; Russell, C. Take a look around: Using street view and satellite images to estimate house prices. ACM Trans. Intell. Syst. Technol. (TIST) 2019, 10, 1–19. [Google Scholar] [CrossRef]
  36. Wang, P.Y.; Chen, C.T.; Su, J.W.; Wang, T.Y.; Huang, S.H. Deep learning model for house price prediction using heterogeneous data analysis along with joint self-attention mechanism. IEEE Access 2021, 9, 55244–55259. [Google Scholar] [CrossRef]
  37. Platonov, O.; Kuznedelev, D.; Diskin, M.; Babenko, A.; Prokhorenkova, L. A Critical Look at the Evaluation of GNNs under Heterophily: Are We Really Making Progress? arXix 2023, arXiv:2302.11640. [Google Scholar] [CrossRef]
  38. Lundberg, S.M.; Lee, S.I. A Unified Approach to Interpreting Model Predictions. Adv. Neural Inf. Process. Syst. 2017, 30, 4768–4777. [Google Scholar]
  39. Lundberg, S.M.; Erion, G.; Chen, H.; DeGrave, A.; Prutkin, J.M.; Nair, B.; Katz, R.; Himmelfarb, J.; Bansal, N.; Lee, S.I. From local explanations to global understanding with explainable AI for trees. Nat. Mach. Intell. 2020, 2, 56–67. [Google Scholar] [CrossRef]
  40. Ying, Z.; Bourgeois, D.; You, J.; Zitnik, M.; Leskovec, J. Gnnexplainer: Generating explanations for graph neural networks. Adv. Neural Inf. Process. Syst. 2019, 32, 9244–9255. [Google Scholar]
  41. Luo, D.; Cheng, W.; Xu, D.; Yu, W.; Zong, B.; Chen, H.; Zhang, X. Parameterized explainer for graph neural network. Adv. Neural Inf. Process. Syst. 2020, 33, 19620–19631. [Google Scholar]
  42. Yuan, H.; Yu, H.; Wang, J.; Li, K.; Ji, S. On explainability of graph neural networks via subgraph explorations. In Proceedings of the International Conference on Machine Learning, PMLR, Virtual, 18–24 July 2021; pp. 12241–12252. [Google Scholar]
  43. Karamanou, A.; Brimos, P.; Kalampokis, E.; Tarabanis, K. Explainable Graph Neural Networks: An Application to Open Statistics Knowledge Graphs for Estimating House Prices. Technologies 2024, 12, 128. [Google Scholar] [CrossRef]
  44. Newman, M.E. Assortative mixing in networks. Phys. Rev. Lett. 2002, 89, 208701. [Google Scholar] [CrossRef] [PubMed]
  45. Moran, P.A. Notes on continuous stochastic phenomena. Biometrika 1950, 37, 17–23. [Google Scholar] [CrossRef] [PubMed]
  46. Li, X.; Zhu, R.; Cheng, Y.; Shan, C.; Luo, S.; Li, D.; Qian, W. Finding global homophily in graph neural networks when meeting heterophily. In Proceedings of the International Conference on Machine Learning, PMLR, Baltimore, MD, USA, 17–23 July 2022; pp. 13242–13256. [Google Scholar] [CrossRef]
  47. Wu, Z.; Pan, S.; Chen, F.; Long, G.; Zhang, C.; Yu, P.S. A comprehensive survey on graph neural networks. IEEE Trans. Neural Netw. Learn. Syst. 2020, 32, 4–24. [Google Scholar] [CrossRef]
  48. LeSage, J.; Pace, R.K. Introduction to Spatial Econometrics; Chapman and Hall/CRC: Boca Raton, FL, USA, 2009. [Google Scholar]
  49. Dwivedi, V.P.; Joshi, C.K.; Luu, A.T.; Laurent, T.; Bengio, Y.; Bresson, X. Benchmarking graph neural networks. J. Mach. Learn. Res. 2023, 24, 1–48. [Google Scholar]
Figure 1. Spatial distribution of housing transactions for (a) Bangkok, (b) King County, and (c) Singapore datasets. Different color represents different administrative boundary.
Figure 1. Spatial distribution of housing transactions for (a) Bangkok, (b) King County, and (c) Singapore datasets. Different color represents different administrative boundary.
Ijgi 15 00052 g001
Figure 2. An example of village boundaries in Bangkok. Each color represents a distinct village boundary unit.
Figure 2. An example of village boundaries in Bangkok. Each color represents a distinct village boundary unit.
Ijgi 15 00052 g002
Figure 3. kNN Graph Construction—Horizontal 2D case ( K = 5 ).
Figure 3. kNN Graph Construction—Horizontal 2D case ( K = 5 ).
Ijgi 15 00052 g003
Figure 4. kNN graph construction for the vertical 3D case ( K = 5 ). Different colors represent different building storeys, illustrating vertical neighborhood relationships across floors.
Figure 4. kNN graph construction for the vertical 3D case ( K = 5 ). Different colors represent different building storeys, illustrating vertical neighborhood relationships across floors.
Ijgi 15 00052 g004
Figure 5. The architecture of BD-GNN. (a) Spatial aggregation, (b) Inter-boundary pathway (geometric proximity), (c) Intra-boundary pathway (boundary-induced segmentation), (d) Adaptive gating mechanism, and (e) Prediction layer.
Figure 5. The architecture of BD-GNN. (a) Spatial aggregation, (b) Inter-boundary pathway (geometric proximity), (c) Intra-boundary pathway (boundary-induced segmentation), (d) Adaptive gating mechanism, and (e) Prediction layer.
Ijgi 15 00052 g005
Figure 6. Cumulative General Error Rate (GER, %) at an error threshold of 20% for K = 5 and K = 80 across three datasets. Higher GER values indicate a larger proportion of predictions falling within the threshold, reflecting a more favorable overall error distribution. Dashed-bold entries denote the best performance for each dataset–threshold combination. BD-GNN consistently achieves the highest GER across all settings, showing greater robustness and predictive precision compared to baseline models.
Figure 6. Cumulative General Error Rate (GER, %) at an error threshold of 20% for K = 5 and K = 80 across three datasets. Higher GER values indicate a larger proportion of predictions falling within the threshold, reflecting a more favorable overall error distribution. Dashed-bold entries denote the best performance for each dataset–threshold combination. BD-GNN consistently achieves the highest GER across all settings, showing greater robustness and predictive precision compared to baseline models.
Ijgi 15 00052 g006
Figure 7. Distribution of α values—Bangkok.
Figure 7. Distribution of α values—Bangkok.
Ijgi 15 00052 g007
Figure 8. Distribution of α values—King County.
Figure 8. Distribution of α values—King County.
Ijgi 15 00052 g008
Figure 9. Distribution of α values—Singapore.
Figure 9. Distribution of α values—Singapore.
Ijgi 15 00052 g009
Figure 10. Boundary robustness analysis under border-focused village perturbations (Bangkok dataset). The left panel shows prediction error (MAPE) under increasing levels of border noise, while the right panel shows the corresponding mean adaptive gating response ( α ) on the test set. Results are reported for two representative neighborhood sizes ( K = 5 and K = 80 ).
Figure 10. Boundary robustness analysis under border-focused village perturbations (Bangkok dataset). The left panel shows prediction error (MAPE) under increasing levels of border noise, while the right panel shows the corresponding mean adaptive gating response ( α ) on the test set. Results are reported for two representative neighborhood sizes ( K = 5 and K = 80 ).
Ijgi 15 00052 g010
Figure 11. An example of local interpretability—Bangkok case (price in log-scale).
Figure 11. An example of local interpretability—Bangkok case (price in log-scale).
Ijgi 15 00052 g011
Table 1. Descriptive statistics of Bangkok house data.
Table 1. Descriptive statistics of Bangkok house data.
Feature NameUnitTypeMinMaxMeanMedianStd Dev.
land areaWah2 *Float10.3076.3020.3418.805.10
usage aream2Float50.50276.00119.27114.0028.52
ageYearInt4.0048.0020.3023.009.39
num floorsInt1.005.002.012.000.40
price (log1p)Float13.3115.6913.8014.400.35
Note. * 1 Wah2 = 4 Meters2.
Table 2. Descriptive statistics of King County house data.
Table 2. Descriptive statistics of King County house data.
Feature NameTypeMinMaxMeanMedianStd Dev.
bedroomsInt0.0033.003.373.000.93
bathroomsInt0.008.002.112.250.77
sqft livingFloat290.0013,540.002079.901910.00918.44
sqft lotFloat520.001,651,359.0015,106.977618.0041,420.51
floorsFloat1.003.51.491.500.54
waterfrontInt0.001.000.010.000.09
viewInt0.004.000.230.000.77
conditionInt1.005.003.413.000.65
gradeInt1.0013.007.667.001.18
sqft aboveFloat290.009410.001788.391560.00828.09
sqft basementFloat0.004820.00291.510.00442.58
year builtInt1900.002015.001971.001975.0029.37
year renovatedInt0.002015.0084.400.00401.68
price (log1p)Float11.2315.8613.0513.020.53
Table 3. Descriptive statistics of Singapore house data.
Table 3. Descriptive statistics of Singapore house data.
Feature NameUnitTypeMinMaxMeanMedianStd Dev.
number of roomsInt1.006.004.134.000.91
floor aream2Float31.00243.0097.3593.0023.90
remaining leaseYearFloat41.5097.7574.8774.9213.93
num floorInt2.0050.008.818.005.95
price (log1p)Float7.649.658.528.500.26
Table 4. Homophily metrics under kNN and boundary graphs, showing how spatial dependence weakens as K increases.
Table 4. Homophily metrics under kNN and boundary graphs, showing how spatial dependence weakens as K increases.
Dataset (N)GraphKAssort.Moran’s IThresh@5%/10%
Bangkok (16,522)kNN50.7270.71519.51/36.56
kNN800.4380.43213.16/25.60
Boundary0.5730.72016.04/31.05
King County (21,436)kNN50.6460.63115.95/29.56
kNN800.5150.50210.88/21.33
Boundary0.3760.4049.22/18.25
Singapore (177,171)kNN50.8870.76030.53/51.56
kNN800.8190.74723.40/42.73
Boundary0.8870.79325.81/46.56
Note. Assortativity (pairwise edge correlation) and Moran’s I (correlation between values and their spatial lag under row-standardized weights) capture complementary aspects of spatial dependence. Their magnitudes need not match exactly, but both consistently reflect the same trend as neighborhood scale K increases.
Table 5. Edge connection statistics under kNN graphs with different K values, showing how intra-boundary links decrease and inter-boundary links increase as K grows. Bold values highlight representative low-K and high-K settings (e.g., K = 5 and K = 80 ) used to illustrate local and global neighborhoods, respectively.
Table 5. Edge connection statistics under kNN graphs with different K values, showing how intra-boundary links decrease and inter-boundary links increase as K grows. Bold values highlight representative low-K and high-K settings (e.g., K = 5 and K = 80 ) used to illustrate local and global neighborhoods, respectively.
Dataset (N)GraphKEdgesIntra-Boundary (%)Inter-Boundary (%)
Bangkok (16,522)kNN582,61077.622.4
10165,22070.729.3
20330,44060.139.9
40660,88047.053.0
801,321,76034.365.7
King County (21,436)kNN5107,18096.33.7
10214,36094.85.2
20428,72092.57.5
40857,44089.110.9
801,714,88084.016.0
Singapore (177,171)kNN5885,85599.80.2
101,771,71097.92.1
203,543,42086.113.9
407,086,84057.942.1
8014,173,68031.568.5
Table 6. Hyperparameter configuration for all GNN models.
Table 6. Hyperparameter configuration for all GNN models.
ParameterValueDescription
Network Architecture
Hidden dimensions64Number of neurons per GNN layer
Number of GNN layers2–3 *Graph Neural Network layers
Activation functionReLU, eLU **Non-linear activation
Dropout rate0.2Regularization parameter
Output dimension1Price prediction output
Graph Construction
k-NN neighbors5, 10, 20, 40, 80Spatial connectivity parameter
Edge weightingBinary (0, 1)Spatial weight function
Training Configuration
Learning rate0.001Initial optimizer learning rate
OptimizerAdamGradient descent optimizer
Max epochs1000Maximum training iterations
Early stopping patience30Epochs without improvement
Note. * GNN baselines use 2 layers, while BD-GNN uses 3 layers. ** eLU is applied only for the GAT baseline.
Table 7. Comparison of MAPE (%) at K = 5 and K = 80 across three datasets. Bold values indicate the best overall performance, and underlined values indicate the second best-performing method. The arrow (↓) indicates that lower MAPE values correspond to better predictive accuracy.
Table 7. Comparison of MAPE (%) at K = 5 and K = 80 across three datasets. Bold values indicate the best overall performance, and underlined values indicate the second best-performing method. The arrow (↓) indicates that lower MAPE values correspond to better predictive accuracy.
ModelBangkokKing CountySingapore
K = 5 K = 80 K = 5 K = 80 K = 5 K = 80
SAR (1988) [1]15.338615.823818.092616.0462--
GCN (2016) [6]19.4718 ± 0.100122.3293 ± 0.261725.8342 ± 0.266525.7337 ± 0.300617.1562 ± 0.072117.1382 ± 0.0911
GAT (2017) [8]19.6766 ± 0.448119.7184 ± 1.634025.7910 ± 0.138123.8540 ± 2.248017.9461 ± 0.137116.7682 ± 0.1569
GraphSAGE (2017) [7]16.1375 ± 0.266514.3203 ± 0.283821.1287 ± 0.307416.2221 ± 0.394516.2632 ± 0.412514.8680 ± 0.7081
MixHop (2019) [14]15.3097 ± 0.151213.3017 ± 0.120418.8092 ± 0.338913.2200 ± 0.335516.0210 ± 0.362914.2790 ± 0.3392
H2GCN (2020) [16]16.8942 ± 0.440415.0402 ± 0.421922.5333 ± 0.538618.2042 ± 0.619017.3873 ± 0.269315.6753 ± 0.7471
PNA (2020) [19]15.8619 ± 0.230115.2075 ± 0.497120.6827 ± 0.310415.6680 ± 0.820614.5124 ± 0.210712.8045 ± 0.8114
FAGCN (2021) [18]16.6632 ± 0.305014.7351 ± 0.329821.7958 ± 0.288016.7397 ± 0.354317.0319 ± 0.222415.3650 ± 0.4185
SHGNN (2023) [22]15.3039 ± 0.066512.8123 ± 0.093720.8523 ± 0.138413.2661 ± 0.129116.6109 ± 0.092212.1387 ± 0.2164
BD-GNN (ours)14.9518 ± 0.098711.7971 ± 0.129317.8873 ± 0.252412.6354 ± 0.134714.3226 ± 0.059611.5873 ± 0.3977
Note: SAR results for the Singapore dataset are not reported because model estimation could not be completed due to memory constraints. Despite using a high-RAM environment (51 GB) on Google Colab, the spatial weight matrix required for SAR exceeded available memory at both K = 5 and K = 80.
Table 8. Comparison of R 2 at K = 5 and K = 80 across three datasets. Bold values indicate the best overall performance, and underlined values indicate the second best-performing method. The arrow (↑) indicates that higher R 2 values correspond to better explanatory power.
Table 8. Comparison of R 2 at K = 5 and K = 80 across three datasets. Bold values indicate the best overall performance, and underlined values indicate the second best-performing method. The arrow (↑) indicates that higher R 2 values correspond to better explanatory power.
ModelBangkokKing CountySingapore
K = 5 K = 80 K = 5 K = 80 K = 5 K = 80
SAR (1988) [1]0.70380.66470.80560.8261--
GCN (2016) [6]0.5063 ± 0.00460.3849 ± 0.01400.6303 ± 0.00590.6122 ± 0.00780.3012 ± 0.00730.3126 ± 0.0086
GAT (2017) [8]0.4989 ± 0.02180.5154 ± 0.08350.6440 ± 0.00360.6835 ± 0.06680.2404 ± 0.01120.3198 ± 0.0080
GraphSAGE (2017) [7]0.6674 ± 0.00960.7396 ± 0.01020.7580 ± 0.00610.8469 ± 0.00630.3499 ± 0.02460.4414 ± 0.0397
MixHop (2019) [14]0.7004 ± 0.00530.7742 ± 0.00410.8037 ± 0.00590.8898 ± 0.00380.3746 ± 0.02600.4963 ± 0.0230
H2GCN (2020) [16]0.6363 ± 0.01490.7142 ± 0.01270.7289 ± 0.01320.8096 ± 0.01240.2670 ± 0.01670.3910 ± 0.0490
PNA (2020) [19]0.6649 ± 0.00930.6764 ± 0.03050.7573 ± 0.00840.8428 ± 0.01930.4491 ± 0.01660.5490 ± 0.0598
FAGCN (2021) [18]0.6444 ± 0.01610.7220 ± 0.01310.7435 ± 0.00490.8389 ± 0.00700.2869 ± 0.01440.4145 ± 0.0257
SHGNN (2023) [22]0.7013 ± 0.00180.7860 ± 0.00240.7604 ± 0.00250.8888 ± 0.00190.3044 ± 0.00550.6284 ± 0.0134
BD-GNN (ours)0.7134 ± 0.00370.8117 ± 0.00530.8155 ± 0.00470.8958 ± 0.00180.4763 ± 0.00350.6599 ± 0.0211
Note: SAR results for the Singapore dataset are not reported because model estimation could not be completed due to memory constraints. Despite using a high-RAM environment (51 GB) on Google Colab, the spatial weight matrix required for SAR exceeded available memory at both K = 5 and K = 80.
Table 9. Cumulative General Error Rate (GER, %) at error thresholds of 5–20% for K = 5 and K = 80 across three datasets. Higher GER values indicate a larger share of predictions within each threshold. Bold entries denote the best overall performance, and underlined entries indicate the best baseline methods. BD-GNN consistently achieves the highest GER across all settings.
Table 9. Cumulative General Error Rate (GER, %) at error thresholds of 5–20% for K = 5 and K = 80 across three datasets. Higher GER values indicate a larger share of predictions within each threshold. Bold entries denote the best overall performance, and underlined entries indicate the best baseline methods. BD-GNN consistently achieves the highest GER across all settings.
Dataset/ModelGER at K = 5 GER at K = 80
<5%<10%<15%<20%<5%<10%<15%<20%
Bangkok
GCN (2016) [6]16.9433.6448.1960.6413.7128.0041.3153.07
GAT (2017) [8]17.1333.0147.5460.7217.1933.5548.2260.96
GraphSAGE (2017) [7]20.1738.6955.6369.5622.0542.8361.2675.61
MixHop (2019) [14]21.1040.1358.1272.2323.6545.6964.7178.61
H2GCN (2020) [16]18.7437.0253.5967.3721.1940.7058.3672.92
PNA (2020) [19]20.0138.7855.8769.7920.2539.0955.7970.63
FAGCN (2021) [18]19.3137.9754.0667.7022.1142.4960.3074.06
SHGNN (2023) [22]21.4041.5259.1272.0625.9548.3967.3680.33
BD-GNN21.4242.0059.5573.5127.6651.7070.4683.03
King County
GCN (2016) [6]13.7627.1439.3650.7213.7527.0639.6950.80
GAT (2017) [8]13.1726.6239.1550.5014.7629.2942.9755.03
GraphSAGE (2017) [7]16.1731.4845.7558.9221.4341.0558.0671.15
MixHop (2019) [14]18.2135.5751.1764.3827.3150.6168.5880.73
H2GCN (2020) [16]14.4428.8943.0556.0518.9136.9152.7066.05
PNA (2020) [19]14.8629.7144.3258.0320.7140.4958.1572.22
FAGCN (2021) [18]15.3030.3844.3557.2020.3639.7856.4769.75
SHGNN (2023) [22]17.1033.0547.0159.4327.2150.5068.1380.48
BD-GNN19.6838.0253.7166.5929.6453.7870.8381.79
Singapore
GCN (2016) [6]18.0435.8051.4865.0518.0235.3551.3465.09
GAT (2017) [8]17.6934.3949.5862.1718.3636.2052.6166.32
GraphSAGE (2017) [7]19.8938.3654.4767.2722.4642.6859.5372.21
MixHop (2019) [14]20.5639.5355.6968.3522.6743.7461.3174.64
H2GCN (2020) [16]18.2635.7751.3864.2620.7940.0256.6869.69
PNA (2020) [19]23.1344.0460.6072.8726.3949.1466.4878.73
FAGCN (2021) [18]18.8836.7152.4665.2221.5941.2857.9470.73
SHGNN (2023) [22]20.0038.2754.1166.8327.3051.2869.4681.83
BD-GNN23.7444.9861.9074.2028.9853.6171.5183.08
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Somkamnueng, J.; Waiyamai, K. BD-GNN: Integrating Spatial and Administrative Boundaries in Property Valuation Using Graph Neural Networks. ISPRS Int. J. Geo-Inf. 2026, 15, 52. https://doi.org/10.3390/ijgi15020052

AMA Style

Somkamnueng J, Waiyamai K. BD-GNN: Integrating Spatial and Administrative Boundaries in Property Valuation Using Graph Neural Networks. ISPRS International Journal of Geo-Information. 2026; 15(2):52. https://doi.org/10.3390/ijgi15020052

Chicago/Turabian Style

Somkamnueng, Jetana, and Kitsana Waiyamai. 2026. "BD-GNN: Integrating Spatial and Administrative Boundaries in Property Valuation Using Graph Neural Networks" ISPRS International Journal of Geo-Information 15, no. 2: 52. https://doi.org/10.3390/ijgi15020052

APA Style

Somkamnueng, J., & Waiyamai, K. (2026). BD-GNN: Integrating Spatial and Administrative Boundaries in Property Valuation Using Graph Neural Networks. ISPRS International Journal of Geo-Information, 15(2), 52. https://doi.org/10.3390/ijgi15020052

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop