A Novel Region Similarity Measurement Method Based on Ring Vectors

Zhi Cai; Hongyu Pan; Shuaibing Lu; Limin Guo; Xing Su

doi:10.3390/ijgi14120488

,

and

College of Computer Science, Beijing University of Technology, Beijing 100124, China

^*

Author to whom correspondence should be addressed.

ISPRS Int. J. Geo-Inf.2025, 14(12), 488;https://doi.org/10.3390/ijgi14120488

Version Notes

Order Reprints

Abstract

Spatial distribution similarity analysis has extensive application value in multiple domains including geographic information science, urban planning, and engineering site selection. However, traditional regional similarity analysis methods face three key challenges: high sensitivity to directional changes, limitations in feature interpretability, and insufficient adaptability to multi-type data. Addressing these issues, this paper proposes a rotation-invariant spatial distribution similarity analysis method based on ring vectors. This method comprises three stages. First, the traversal starting point of the ring vector is dynamically selected based on the maximum value point of the regional feature matrix. Next, concentric ring features are extracted according to this starting point to achieve multi-scale characterization. Finally, the bidirectional weighted comprehensive distance of ring vectors between regions is calculated to measure the similarity between regions. Three experimental sets verified the method’s effectiveness in terrain matching, engineering site selection, and urban functional area identification. These results confirm its rotational invariance, feature interpretability, and adaptability to multi-type data. This research provides a new technical approach for spatial distribution similarity analysis, with significant theoretical and practical implications for geographic information science, urban planning, and engineering site selection.

Keywords:

rotation invariance; spatial distribution similarity; engineering site selection; urban functional area identification; geographic information science

1. Introduction

Spatial data analysis has emerged as a fundamental research domain within geographic information science, remote sensing, and urban planning, garnering significant attention from both academic and industrial communities in recent years [1,2,3]. The rapid advancement of remote sensing technologies has revolutionized data acquisition processes, enabling unprecedented access to high-resolution satellite imagery (e.g., WorldView-4 with 0.31 m resolution, GF-2 with 0.8m resolution), precise digital elevation models (DEM) (e.g., SRTM90m, TanDEM-X12m), and fine-grained points of interest (POI) datasets [4,5]. These multi-type spatial datasets provide a rich foundation for regional analysis. However, they also present novel challenges for data processing and pattern recognition methodologies [6].

Within the diverse landscape of spatial analysis applications, the identification of regions exhibiting similar spatial distribution patterns represents a fundamental task with extensive practical utility. In urban planning contexts, planners frequently seek communities with comparable functional characteristics to optimize the distribution of public service facilities and resource allocation [7]. Environmental scientists routinely identify ecological habitats with the corresponding geographical features to forecast species distribution and migration patterns [8]. Disaster management frameworks benefit from analyzing terrain similarities between historical disaster sites and potential risk zones to formulate more effective evacuation protocols [9]. Agricultural researchers compare soil distribution patterns across various regions to determine optimal locations for crop transplantation and yield enhancement [10]. Healthcare systems optimize the placement of medical facilities by identifying areas exhibiting similar population density distributions [11].

These multifaceted applications underscore the critical importance of developing robust spatial pattern similarity analysis methodologies; however, existing regional similarity analysis methods still face several key challenges in practical applications:

Directional Sensitivity Issues. Traditional grid or pixel-based comparison methods, such as template matching algorithms, exhibit high sensitivity to rotation. The accuracy of these methods decreases significantly when similar regions undergo rotational changes [12,13]. When identical geographical features are present in different orientations (e.g., north-south versus east-west river valleys, as shown in Figure 1), conventional methods struggle to identify their intrinsic similarities. This issue is particularly pronounced in remote sensing data processing, as the orientation of topographical features is often determined by natural evolutionary processes. Functionally similar regions may exhibit significant directional differences, exemplified by the contrasting ridge orientations of the Himalayan and Alpine mountain ranges.

Figure 1. Remote Sensing Imagery of Ili River Valley in Xinjiang, China with Different Orientations. (a) East-West Oriented Remote Sensing Imagery of Ili River Valley. (b) North-South Oriented Remote Sensing Imagery of Ili River Valley.
Feature Interpretability Limitations. Although current deep learning-based spatial pattern recognition methods have enhanced feature representation capabilities, their “black box” nature results in similarity measurements lacking geographic semantic interpretations [14]. For instance, Deep Neural Networks (DNNs) classify output images without providing further explanations of scene-corresponding results [15,16]. Consequently, developing techniques that make black-box processes more transparent and comprehensible is crucial in the remote sensing domain. In medical resource allocation scenarios, similarity models that merely output probability values without revealing the spatial association mechanisms between population density distribution and healthcare needs significantly constrain the scientific validity of policy formulation.
Multi-type Data Adaptation Challenges. Existing methods are predominantly tailored to specific data types, limiting their applicability across diverse spatial datasets [17]. For instance, methods optimized for POI data may fail to perform effectively on road network data or terrain elevation models, and vice versa. This specialization restricts their utility in scenarios where practitioners need to analyze different types of spatial information using a unified analytical framework. Furthermore, the lack of methodological versatility complicates comparative analyses across heterogeneous data sources (such as comparing distribution patterns between satellite imagery and point-based datasets), thereby limiting applications that require consistent similarity assessment across multiple spatial data modalities.

To address the aforementioned challenges, this paper proposes a novel rotation-invariant spatial distribution similarity analysis method based on ring vectors.

The proposed approach overcomes the sensitivity limitations of traditional methods under rotational transformations by incorporating three core components: a dynamic starting point selection mechanism, a multi-layer ring vector representation, and a bidirectional matching strategy. These innovations maintain feature interpretability while ensuring adaptability to multi-type data. The principal contributions of this paper are:

Development of a dynamic starting point selection mechanism that identifies the maximum value coordinates within the feature matrix to dynamically determine the starting point for ring vector traversal, thereby significantly mitigating the effects of rotation on similarity analysis.
Creation of a multi-layer ring vector representation framework that extracts concentric ring vectors from the center outward, establishing a multi-scale feature representation that enhances algorithmic robustness against scale variations.
Implementation of a bidirectional matching mechanism that generates complementary feature representations through both clockwise and counterclockwise traversal methods, substantially improving the algorithm’s capacity to recognize mirror-symmetric or inversely rotated distributions.

The comprehensive advantages of our proposed methodology are manifested in: (1) robust invariance to rotational transformations; (2) intuitive and interpretable feature representations that facilitate decision-making processes in practical applications; and (3) versatility across diverse spatial data types, including Digital Elevation Models (DEM), remote sensing imagery, Points of Interest (POI) distributions, and other spatial datasets, enabling broad application potential.

The remainder of this paper is structured as follows: Section 2 reviews the relevant literature, examining the evolutionary trajectory of regional feature representation and similarity measurement methodologies; Section 3 elucidates fundamental definitions and concepts, including spatial grid representation, coordinate transformation, and essential ring vector principles; Section 4 presents a detailed exposition of the proposed methodology and algorithm, encompassing ring vector extraction procedures, similarity calculation techniques, and efficient search strategies; Section 5 comprehensively validates the algorithm’s efficacy through three distinct experimental applications (dam site terrain analysis, radio telescope site evaluation, and urban functional area identification); finally, Section 6 summarizes the research contributions and delineates directions for future investigation.

2. Related Work

2.1. Regional Feature Expression

The effective representation of regional spatial characteristics forms the cornerstone of geographic pattern recognition and similarity analysis. Over the past decade, the evolution of feature expression methodologies has progressed from simple pixel-based descriptors to sophisticated deep learning architectures, each attempting to capture the intrinsic properties of spatial distributions while addressing challenges such as scale variance, rotational sensitivity, and computational efficiency.

Traditional Feature Transformation Approaches leverage mathematical transformations to derive invariant representations from spatial data. Peng et al. [18] employed the Fourier-Mellin transform to extract rotation-invariant spectral features, exploiting the property that Fourier magnitude remains unchanged under rotation. Lu and Yang [19] applied Zernike moments for image feature extraction, utilizing the orthogonality of moment functions to achieve rotational invariance. However, these frequency-domain methods frequently sacrifice spatial locality information during transformation, showing limited sensitivity to subtle variations in spatial distributions. Furthermore, their performance degrades substantially in the presence of noise and local perturbations, which are common in real-world geographic data.

Local Invariant Feature Descriptors techniques were initially developed for computer vision and are widely used in remote sensing for image registration and matching. SIFT (Scale-Invariant Feature Transform) and SURF (Speeded-Up Robust Features) are classic examples [20]. They generate descriptors by detecting keypoints in images and computing local gradient orientation histograms, making them invariant to scale and rotation. Although they excel at extracting local features, when applied to regional features, additional frameworks such as aggregation or Bag-of-Words are needed to integrate all local descriptors [20]. This complicates the descriptor generation process and may reduce performance when handling non-textured areas or continuous gradient fields.

Template Matching and Geometric Alignment Methods represent another major paradigm in regional feature representation. Zhang and Su [21] developed a fast matching algorithm accommodating angular variations through multi-angle template rotation, while Choi and Kim [22] introduced a two-stage approach achieving both rotation and illumination invariance. Traditional multi-angle matching is computationally prohibitive for large-scale registration. Current efforts focus on improving efficiency and robustness in multimodal remote sensing data. For instance, the Multi-scale Template Matching (MSTM) framework utilizes frequency-domain convolutional maps and omni-directional aggregated feature vectors to significantly speed up matching while improving adaptability to geometric differences and non-linear radiation differences between multimodal images [23]. Another strategy involves integrating feature matching with template matching: a “feature-to-template” approach leverages local structural information via Local Self-Similarity (LSS) templates for accurate fine matching after coarse alignment [24], successfully resisting significant geometric and intensity differences. These works demonstrate that the core concept of template matching is being updated with structural and frequency domain techniques to tackle modern geospatial challenges.

Shape-Based Descriptors and Contour Analysis techniques attempt to characterize regions through their geometric properties. Yang et al. [25] proposed an invariant multi-scale descriptor for shape representation, matching, and retrieval, incorporating both global shape characteristics and local boundary features. Wang et al. [26] developed a curvature saliency descriptor for complete and partial shape matching, emphasizing perceptually significant boundary segments. Nevertheless, these methodologies are primarily designed for objects with well-defined boundaries and struggle to characterize continuous spatial distributions exhibiting gradient properties, such as elevation surfaces or population density fields.

Deep Learning-Based Feature Representations have emerged as powerful alternatives in recent years. Convolutional Neural Networks (CNNs) and their variants excel at processing Euclidean grid data (e.g., remote sensing images and DEMs), automatically learning hierarchical spatial features. However, standard CNN architectures are inherently not rotation-invariant. This limitation must be mitigated through data augmentation or by designing specific rotation-equivariant convolutional kernels (e.g., RIC-CNN) [27]. As the latest state-of-the-art architecture, Vision Transformers (ViT) and its variants are being widely applied in remote sensing image analysis. ViT uses a multi-head attention mechanism to capture long-range contextual relationships between pixels (or image patches) [28]. This mechanism gives ViT greater potential than CNNs for capturing global spatial patterns. For non-Euclidean spatial data (such as Point-of-Interest (POI) distributions, traffic networks, or irregular polygonal land parcels), Graph Neural Networks (GNNs) have emerged as a powerful feature representation tool. GNNs learn node representations by aggregating information from neighboring nodes, thereby explicitly encoding spatial topological relationships (e.g., adjacency, connectivity) into the features [29]. This gives GNNs a distinct advantage in processing irregular spatial data and performing relational reasoning [30].

2.2. Regional Similarity Measurement

Quantifying similarity between spatial regions represents a fundamental analytical task with applications spanning environmental monitoring, urban planning, and resource management. The challenge lies in developing metrics that capture meaningful geographic relationships while remaining robust to variations in scale, orientation, and data modality.

Distance-Based Metrics constitute the most straightforward approach to similarity quantification. Classical measures including Euclidean distance, Manhattan distance, and cosine similarity. Although these approaches offer computational efficiency and simplicity, they demonstrate poor performance when addressing rotational transformations and fail to capture the intrinsic similarities between regions [31].

Structural and Topological Approaches explicitly model spatial relationships and organizational patterns. Shape-based matching techniques [25,26] evaluate regions through their boundary properties, medial axes, and skeletal representations. Graph-based methods represent regions as networks of interconnected features, comparing them through graph matching algorithms or spectral properties.

Graph-Based Similarity Learning methods model spatial regions as networks of interconnected features. Zhou et al. [32] proposed GRLSTM for trajectory similarity computation with graph-based residual LSTM, effectively capturing spatial network structures and temporal dependencies simultaneously. Recent work on deep graph similarity learning has developed sophisticated embedding techniques that map input graphs to target spaces where distances approximate structural distances in the input space, enabling more nuanced comparison of complex spatial relationships.

Deep Learning Approaches for Similarity Computation have gained substantial attention, often focusing on learning the similarity metric itself. These methods go beyond a simple distance calculation (like Euclidean) and instead learn a complex, non-linear function to compare feature vectors. A prominent architecture for this is the Siamese Neural Network. This approach uses two or more identical sub-networks (which can be the CNNs or GNNs discussed in Section 2.1) to process two input regions. The sub-networks output two feature vectors, which are then fed into a final set of layers that are trained to output a similarity score (e.g., 0 to 1). This “deep metric learning” approach is widely used in remote sensing for tasks like image patch matching. Furthermore, as mentioned in Section 2.1, Graph Neural Networks (GNNs) are particularly relevant. Models like DeepSIM [33] or trajectory similarity frameworks [34] utilize graph-based architectures not just for feature extraction, but as the core mechanism to compute the similarity between two complex graph structures. Despite achieving state-of-the-art performance in specific tasks, these approaches inherit the limitations of deep learning methods: substantial annotated data requirements (e.g., pairs of “similar” and “dissimilar” regions), limited interpretability, and potential brittleness when encountering distribution shifts between training and deployment environments.

Spatial Context-Aware Methods explicitly model spatial relationships and organizational patterns for similarity assessment. Jin et al. [35] proposed the Context-Aware Region similarity learning (CARE) framework that leverages spatial normalization techniques to measure regional significance within surrounding neighborhoods, enabling zero-shot inference for region similarity based on specific application requirements. This approach addresses the challenge of regions with different point-of-interest distributions sharing similar purposes across applications. Abbasi et al. [36] developed a geospatial semantic similarity measure combining BERT architecture with Moran’s I to improve the correlation between semantic similarity and geographical distance in natural language processing applications. These context-aware methods represent significant advances in capturing spatial dependencies, though they may require careful tuning of neighborhood definitions and spatial weight parameters.

3. Relevant Definition

3.1. Regional Feature Matrices

Regional feature matrices represent a two-dimensional array formulation derived from the spatial discretization of continuous geographic information. This representation method divides the study area into regular grid cells, with each cell storing location-specific geographic attribute information, such as elevation, population density, land use type, or vegetation coverage.

Formally, given a geographic region, we can partition it into a grid matrix M of size

p \times q

. Each element

m_{i j}

in this matrix (where

1 \leq i \leq p

and

1 \leq j \leq q

) denotes the feature value of the grid cell located at the

i - t h

row and

j - t h

column. This value quantifies the geographic attribute characteristics at that spatial position. The dimensions p and q of matrix M correspond to the granularity of spatial division in the two directional axes of the study area, while the scale of division depends on the specific application scenario and data precision requirements.

Regional feature matrices provide a unified data structure that enables quantitative comparison and mathematical operations on spatial characteristics across different geographic regions, establishing a foundation for subsequent similarity calculations and rotation invariance analysis.

3.2. Coordinate Transformation

In ring vector-based regional similarity measurement, we need to establish a mapping relationship between the matrix index coordinate system and the standard Cartesian coordinate system to more effectively define and manipulate the ring structure.

Consider an odd-order

N \times N

matrix M with row and column indices starting from 0. The center point C of the matrix is located at the row-column coordinates

(⌊ N / 2 ⌋, ⌊ N / 2 ⌋)

. To facilitate the definition and analysis of ring layers, we establish a Cartesian coordinate system with the center point C as the origin, defined as follows:

X-axis direction: Consistent with the column direction of the matrix, with positive direction to the right
Y-axis direction: Opposite to the row direction of the matrix, with positive direction upward (contrary to the traditional rule where matrix row indices increase from top to bottom)

Based on the above definition, there exists a one-to-one conversion relationship between the row-column index coordinates of any element in the matrix and its Cartesian coordinates. For any element

M (i, j)

in the matrix, where

0 \leq i, j < N

, its coordinates

(x, y)

in the Cartesian coordinate system with the center point C as the origin are calculated as Equation (1):

\{\begin{matrix} x = j - ⌊ N / 2 ⌋ \\ y = ⌊ N / 2 ⌋ - i \end{matrix},

(1)

where:

x represents the horizontal offset of the element relative to the center point C (positive to the right, negative to the left);
y represents the vertical offset of the element relative to the center point C (positive upward, negative downward).

Conversely, for any point

(x, y)

in the Cartesian coordinate system that satisfies the condition

- ⌊ N / 2 ⌋ \leq x, y \leq ⌊ N / 2 ⌋

, the corresponding matrix row-column indices

(i, j)

can be calculated using the following Equation (2):

\{\begin{matrix} i = ⌊ N / 2 ⌋ - y \\ j = x + ⌊ N / 2 ⌋ \end{matrix} .

(2)

This coordinate transformation mechanism provides us with the ability to flexibly switch between matrix indices and the Cartesian coordinate system, making ring-based feature extraction and rotation invariance analysis more intuitive and convenient. In particular, using the Cartesian coordinate system allows for a more natural definition of the ring boundary condition

max (| x |, | y |) = r

, thereby constructing a rotation-invariant ring vector representation.

3.3. Ring Vector

The ring vector represents a spatial feature representation method that radiates outward from the matrix center. By organizing matrix elements into concentric rings, this approach provides a rotation-invariant feature extraction mechanism.

Given an odd-order

N \times N

matrix M, where

N = 2 k + 1

and

k \in N^{+}

, we define the center point C of the matrix at coordinates

(⌊ N / 2 ⌋, ⌊ N / 2 ⌋)

. Taking this center point as the origin of a Cartesian coordinate system, each element in the matrix can be represented by coordinates

(x, y)

, where

x, y \in {- ⌊ N / 2 ⌋, \dots, 0, \dots, ⌊ N / 2 ⌋}

.

The ring vector consists of

⌊ N / 2 ⌋

concentric square rings, each organizing matrix elements into vectors according to specific rules. The r-th ring

(r = 1, 2, \dots, ⌊ N / 2 ⌋)

contains all matrix elements that satisfy Equation (3):

max (| x |, | y |) = r,

(3)

where

(x, y)

represents the relative coordinates of the element with respect to the center point C.

Considering the center point as a one-dimensional vector

V_{0}

, each concentric ring r surrounding the center point contains

8 r

elements. These elements are organized into a vector

V_{r}

following a predefined traversal order (e.g., clockwise direction). The complete representation of the ring vector is expressed as the sequence

V = {V_{0}, V_{1}, V_{2}, \dots, V_{⌊ N / 2 ⌋}}

, where each

V_{r}

captures the spatial feature distribution of the matrix at a distance of r units from the center.

This ring-based representation method provides a structured approach to describe the spatial features of a matrix, establishing a foundation for implementing rotation-invariant similarity measures in subsequent analyses.

4. Methodology

This section provides a detailed description of the regional similarity measurement method based on ring vector representation. The proposed approach utilizes ring vectors as its foundation to establish a framework for regional feature representation and similarity computation with rotational invariance. This framework effectively captures the intrinsic similarity of regional features under rotational transformations.

As illustrated in Figure 2, the overall framework of this methodology is structured into three core stages:

Figure 2. Overall Framework of the Ring Vector-based Similarity Method.

Data Preprocessing. This stage processes multi-source input data, such as DEM or POI datasets. It employs rasterization to convert unstructured data into standard feature matrices and performs matrix order adjustment to ensure a unique center point, establishing a foundation for subsequent analysis.
Ring Vector Feature Extraction. This stage, central to our method, begins with “Directional Anchor Localization” (i.e., locating the maximum value point) to determine the region’s dominant direction. Subsequently, a dynamic “Starting Point” is calculated for each ring layer, from which a “Raw Ring Vector” is extracted via traversal. Finally, these vectors undergo “Standardization” and “Data Cleaning” to eliminate scale effects and remove invalid padding values.
Similarity Computation. This phase first generates “Bidirectional Ring Vectors” (clockwise and counter-clockwise) to enhance robustness against rotational transformations. A ”Distance Calculation” (e.g., Euclidean distance) is then applied to quantify the dissimilarities between corresponding ring layers. These individual distances are ultimately aggregated into a “Weighted Comprehensive Distance”, which serves as the final metric for regional similarity.

4.1. Data Preprocessing

4.1.1. Rasterization

The ring vector construction algorithm requires structured raster data as input, represented as a regular grid matrix

M \in R^{N \times N}

. However, a large number of spatial datasets are inherently unstructured, including discrete point data (e.g., POIs, GPS trajectories, sensor networks) and linear features (e.g., road networks, river systems). To adapt our methodology to these unstructured data types, a critical rasterization preprocessing step is required to convert them into structured feature matrices. Specifically, for discrete point data, common point rasterization methods such as the Density Method (e.g., point count density), Kernel Density Estimation (KDE), or even the Assignment Method (for attributing single-point values to grid cells) can be employed. For linear features, core line rasterization techniques like the Coverage Raster Method (to identify line presence within grids), Length-Weighted Method (to quantify proportional line length in each cell), or Distance Raster Method (to characterize proximity to linear features) are applicable to quantify feature relevance within each grid cell. Our urban functional area experiment (Section 5.3.2) serves as a concrete example to illustrate the implementation details of this rasterization preprocessing step.

4.1.2. Matrix Order Adjustment

After obtaining the feature matrix, the proposed algorithm requires the matrix to be of odd order to ensure the uniqueness of the center point. For an input matrix

M \in R^{N \times N}

, the following order verification and adjustment are performed:

If the matrix order N is odd, the original matrix is used directly: $M_{a d j} = M$ , with order $N^{'} = N$ .
If the matrix order N is even, an additional row and column are inserted at the center position (i.e., at the intersection of the $N / 2$ -th row and column), with each fill values set to $min (M) - 1$ , forming an adjusted matrix $M_{a d j} \in R^{(N + 1) \times (N + 1)}$ with order $N^{'} = N + 1$ .

The padding value is set to

min (M) - 1

, an extreme outlier, which ensures it will never be selected as the directional anchor (Section 4.2.1). Crucially, this padding value is explicitly identified and removed during the ’Data Cleaning’ step (Section 4.2.4) before any distance calculations are performed. Therefore, the padding acts purely as a structural placeholder to establish a unique matrix center and does not contribute to the final feature vector’s magnitude or pattern.

4.2. Ring Vector Feature Extraction

In the Ring Vector Feature Extraction stage, the process begins by locating the global maximum value, excluding the center point, within the matrix and defines it as the directional anchor P. Then, using the vector from the center point C to this directional anchor P as the directional reference, a dynamic traversal starting point

S_{r}

is calculated for each concentric ring layer r. Subsequently, a clockwise traversal is performed from each ring’s starting point

S_{r}

, collecting the element values along the path to construct the raw ring vector

V_{r}^{r a w}

for that layer. Finally, all raw ring vectors undergo standardization and data cleaning, yielding the final ring vectors set

V

for the feature matrix. The detailed steps of this procedure are elaborated in the following subsections.

4.2.1. Directional Anchor Localization

To capture significant features within the matrix, the directional anchor of the adjusted matrix

M_{a d j}

is defined as the global maximum value point P, excluding the center point C. The center point C has row and column coordinates

(i_{c}, j_{c}) = (⌊ N^{'} / 2 ⌋, ⌊ N^{'} / 2 ⌋)

. The selection is based on the hypothesis that the maximum value point P in the matrix characterizes the most significant feature of the region and can serve as an indicator of regional directionality, providing a directional reference for ring vector extraction.

The row and column coordinates

(i_{P}, j_{P})

of the maximum value point P are transformed into Cartesian coordinates

(x_{P}, y_{P})

with the center point C as the origin according to Equation (1). This transformation establishes a unified spatial reference framework, making the analysis of regional directionality more intuitive.

4.2.2. Ring Vector Starting Point Calculation

This part introduces an adaptive mechanism for selecting ring vector starting points that achieves rotation invariance in feature representation by effectively capturing the directional characteristics of spatial structures. The main idea of this algorithm is that, for each ring layer, the starting point selection follows the dominant direction established from the center point C to the maximum value point P, thereby maintaining consistent relative directional relationships under rotational transformations.

In a Cartesian coordinate system with the center point C as the origin, the coordinates

(x_{P}, y_{P})

of the maximum value point P contain critical directional information. Based on the relative magnitudes and signs of these coordinate components, we establish a precise directional classification framework that maps all possible spatial orientations into four dominant cases:

Case 1 (Y-positive dominance): $y_{P} > 0$ and $| y_{P} | \geq | x_{P} |$ , indicating that the maximum value point resides in the upper half-plane with the vertical upward component being dominant.
Case 2 (X-negative dominance): $x_{P} < 0$ and $| x_{P} | > | y_{P} |$ , indicating that the maximum value point resides in the left half-plane with the horizontal leftward component being dominant.
Case 3 (Y-negative dominance): $y_{P} < 0$ and $| y_{P} | \geq | x_{P} |$ , indicating that the maximum value point resides in the lower half-plane with the vertical downward component being dominant.
Case 4 (X-positive dominance): $x_{P} > 0$ and $| x_{P} | > | y_{P} |$ , indicating that the maximum value point resides in the right half-plane with the horizontal rightward component being dominant.

Following the determination of directional dominance, the algorithm computes the Cartesian coordinates

(x_{r}, y_{r})

of the starting point

S_{r}

for each concentric layer r (where

r = 1, 2, \dots, ⌊ N^{'} / 2 ⌋

). This calculation preserves the directional consistency between the starting points of various ring layers and the maximum value point P, while adapting to the spatial scope of different ring layers:

Case 1: $x_{r} = r o u n d (\frac{x_{P}}{y_{P}} \cdot r), y_{r} = r$ .
Case 2: $x_{r} = - r, y_{r} = r o u n d (\frac{y_{P}}{| x_{P} |} \cdot r)$ .
Case 3: $x_{r} = r o u n d (\frac{x_{P}}{| y_{P} |} \cdot r), y_{r} = - r$ .
Case 4: $x_{r} = r, y_{r} = r o u n d (\frac{y_{P}}{x_{P}} \cdot r)$ .

Here,

r o u n d (\cdot)

denotes the operation of rounding to the nearest integer.

Figure 3a illustrates this process. The starting points for each layer (e.g.,

S_{1}

,

S_{2}

,

S_{3}

) are all calculated to lie on the directional vector from the center C to the anchor point P. In this specific visual example, the anchor point P happens to be located on the second ring, which means the starting point

S_{2}

and the anchor point P are the same point.

Figure 3. Schematic Diagram of Key Steps in Ring Vector Generation. (a) Starting Point Calculation, and different colored lines represent different layers. (b) Raw Ring Vector Extraction. (c) The Variation Trend of Feature Values.

This direction-adaptive computational mechanism ensures that across different ring layers, the starting points maintain a consistent spatial directional relationship with the maximum value point. When the matrix undergoes rotational transformation, although the absolute position of the maximum value point changes, the ring layer starting points calculated using the aforementioned method maintain their invariant relative positional relationship with the maximum value point, thereby achieving invariance to rotational transformations during the feature extraction process.

4.2.3. Raw Ring Vector Extraction

Starting from the initial point

(x_{r}, y_{r})

of each ring layer r, we traverse the entire layer in a clockwise direction, and collect matrix element values along the traversal path to form the raw ring vector

V_{r}^{r a w} = {v_{1}^{(r)}, v_{2}^{(r)}, \dots, v_{8 r}^{(r)}}

, as illustrated in Figure 3b, where

v_{j}^{(r)}

represents the value of the j-th element in the r-th ring layer. Each ring layer r contains

8 r

elements. During the traversal process, we employ coordinate transformation rules to map points

(x, y)

in the Cartesian coordinate system back to matrix row and column indices

(i, j)

, thereby extracting the corresponding matrix element values. After processing

⌊ N^{'} / 2 ⌋

ring layers, we obtain

⌊ N^{'} / 2 ⌋

ring vectors, which together with the one-dimensional vector formed by the center point, constitute the raw ring vector set

V^{r a w} = {V_{0}^{r a w}, V_{1}^{r a w}, V_{2}^{r a w}, \dots, V_{⌊ N^{'} / 2 ⌋}^{r a w}}

. During traversal starting from the starting point

S_{r}

, the collected feature values follow a distinct trend. They decrease gradually at the beginning, but start to increase again as the traversal nears point

S_{r}

, as shown in Figure 3c.

4.2.4. Vectors Standardization and Data Cleaning

To eliminate the influence of scale differences among eigenvalues on similarity measurements, the original ring vectors must undergo standardization and removal of invalid data introduced during standardization. First, each original ring vector

V_{r}^{r a w}

is subjected to a translation transformation using Equation (4):

V_{r}^{t r a n s} = {v_{j}^{r} - min (V_{r}^{r a w})}_{j = 1}^{8 r} .

(4)

This operation normalizes the data baseline to zero by subtracting the minimum value of the feature matrix

(min (M))

, thereby eliminating differences in absolute numerical scale.

Following standardization, it is necessary to remove padding values inserted during the matrix standardization phase, as their presence would interfere with similarity calculations. These padding values

(min (M) - 1)

consistently transform to

- 1

after standardization. The cleaning operation is defined by Equation (5):

V_{r} = {v | v \in V_{r}^{t r a n s}, v \neq - 1} .

(5)

The cleaned ring vector

V_{r}

retains only the valid features from the original matrix. Notably, center points of even-order matrices (standardization padding values) are eliminated in this step, ultimately generating a refined set of ring vectors:

V = \{\begin{matrix} {V_{0}, V_{1}, V_{2}, \dots, V_{⌊ N^{'} / 2 ⌋}}, N i s o d d \\ {V_{1}, V_{2}, \dots, V_{⌊ N^{'} / 2 ⌋}}, N i s e v e n \end{matrix} .

(6)

Therefore, for any square matrix of order

N (N \geq 2)

, the modulus length of the final ring vector set

V

is

| V | = ⌊ \frac{(N + 1)}{2} ⌋

. The ring vector set can be uniformly expressed as:

V = {V_{1}, V_{2}, \dots, V_{⌊ \frac{(N + 1)}{2} ⌋}} .

(7)

The cardinality (length) of each ring vector is defined as:

| V_{r} | = \{\begin{matrix} \{\begin{matrix} 1, r = 1 \\ 8 (r - 1), r > 1 \end{matrix}, N i s o d d \\ 4 (2 r - 1), N i s e v e n \end{matrix} .

(8)

Based on the construction process described above, the ring vector representation possesses the following key characteristics:

Directional Consistency: Through maximum value point localization and region classification, we ensure that even when the matrix is rotated, the starting point of the ring vector maintains a relatively consistent spatial direction.
Structural Preservation: The ring layer structure preserves the spatial adjacency relationships among elements in the matrix, enabling the ring vector to effectively capture the spatial structural features of the region.
Scale Invariance: Vector normalization processing eliminates the influence of numerical scale, making the ring vector robust to intensity variations in regional features.

These characteristics collectively ensure that the ring vector remains relatively stable under matrix rotational transformations, providing a solid foundation for implementing rotation-invariant regional similarity metrics. The ring vector not only captures the spatial distribution features of a region but also, through its special construction mechanism, achieves adaptability to regional rotational transformations, enabling similarity calculations based on this approach to identify essentially similar regions under different rotational states. The pseudo-code for ring vector generation is shown in Algorithm 1.

The time complexity of the Ring Vector Generation Algorithm can be analyzed through its two primary phases.

In the standardization phase, the algorithm first checks whether the input

N \times N

matrix has odd or even dimensions, which constitutes an

O (1)

operation. For matrices with even dimensions, the algorithm creates a new

(N + 1) \times (N + 1)

matrix and copies the original data, requiring

O (N^{2})

operations. Subsequently, the algorithm identifies the maximum value point by traversing all

N^{2} - 1

elements (excluding the center point), yielding a time complexity of

O (N^{2})

. The conversion of matrix indices to Cartesian coordinates is performed in constant time,

O (1)

.

During the ring vectors generation phase, the algorithm processes

⌊ N^{'} / 2 ⌋

concentric rings, where

N^{'}

represents the adjusted matrix dimension (approximately equal to N). For each ring r, the algorithm processes

8 r

elements, with each element requiring constant time operations for coordinate transformation and value extraction. Consequently, the total processing time for all rings can be expressed as Equation (9),

\begin{matrix} T_{r i n g} & = \sum_{r = 1}^{⌊ N^{'} / 2 ⌋} 8 r = 8 \times \sum_{r = 1}^{⌊ N^{'} / 2 ⌋} r \\ = 4 ⌊ N^{'} / 2 ⌋ (⌊ N^{'} / 2 ⌋ + 1) . \end{matrix}

(9)

When

N^{'} \approx N

,

T_{r i n g}

can be simplified to approximately

N^{2} + 2 N

, which asymptotically approaches

O (N^{2})

. Combining both phases, the overall time complexity of the Algorithm 1 is

O (N^{2})

.

Algorithm 1: Ring Vectors Extraction

4.3. Similarity Computation

Given two regional feature matrices of identical dimensions,

M_{A}, M_{B} \in R^{N \times N}

, this section presents a similarity measurement method based on bidirectional ring vectors to quantify the degree of similarity between two regional feature distributions.

For the two input regional feature matrices

M_{A}

and

M_{B}

, we first apply Algorithm 1 to generate their corresponding sets of clockwise traversal ring vectors:

The clockwise traversal ring vector set for $M_{A}$ is defined as $V_{A}^{c w} = {V_{1}^{A}, V_{2}^{A}, \dots, V_{⌊ \frac{(N + 1)}{2} ⌋}^{A}}$
The clockwise traversal ring vector set for $M_{B}$ is defined as $V_{B}^{c w} = {V_{1}^{B}, V_{2}^{B}, \dots, V_{⌊ \frac{(N + 1)}{2} ⌋}^{B}}$

To enhance the algorithm’s robustness against rotational invariance, we further derive the counterclockwise traversal ring vector set for

M_{B}

from

V_{B}^{c w}

as

V_{B}^{c c w} = {R (V_{1}^{B}), R (V_{2}^{B}), \dots, R (V_{⌊ \frac{(N + 1)}{2} ⌋}^{B})}

, where

R (\cdot)

represents the reverse operation on vector elements while maintaining the position of the first element unchanged as Equation (10),

R (V) = {v_{1}, v_{| V |}, v_{| V | - 1}, \dots, v_{3}, v_{2}} .

(10)

The distances in both traversal directions are calculated as shown in Equations (11) and (12):

d_{r}^{c w} = D i s t a n c e M e t r i c (V_{r}^{A}, V_{r}^{B}),

(11)

d_{r}^{c c w} = D i s t a n c e M e t r i c (V_{r}^{A}, R (V_{r}^{B})),

(12)

where

D i s t a n c e M e t r i c (\cdot, \cdot)

can be selected based on specific data characteristics, such as Euclidean distance, Earth Mover’s Distance (EMD), or other appropriate metrics. The term

d_{r}^{c w}

represents the clockwise directional distance between ring vectors of layer r from both matrices, while

d_{r}^{c c w}

represents the counterclockwise directional distance.

Based on the distances at each ring layer, we calculate the weighted comprehensive distance for both traversal directions using Equations (13) and (14):

D_{c w} = \sum_{r = 1}^{⌊ \frac{(N + 1)}{2} ⌋} w_{r} \cdot d_{r}^{c w},

(13)

D_{c c w} = \sum_{r = 1}^{⌊ \frac{(N + 1)}{2} ⌋} w_{r} \cdot d_{r}^{c c w},

(14)

where

D_{c w}

represents the comprehensive clockwise directional distance between

M_{A}

and

M_{B}

ring vectors, and

D_{c c w}

represents the comprehensive counterclockwise distance. The coefficient

w_{r}

is the weight assigned to ring layer r, satisfying the normalization condition

\sum_{r = 1}^{⌊ \frac{(N + 1)}{2} ⌋} w_{r} = 1

.

The final regional distance metric D is defined as the minimum value between the clockwise comprehensive distance and the counterclockwise comprehensive distance, as shown in Equation (15):

D = min (D_{c w}, D_{c c w}) .

(15)

This bidirectional ring vector generation strategy enables the algorithm to simultaneously consider feature distribution patterns in both clockwise and counterclockwise directions across the region, effectively enhancing recognition robustness against mirror symmetry transformations and reverse rotation patterns. The complete algorithm procedure is presented in Algorithm 2.

Algorithm 2: Area Similarity Measurement

For two regional feature matrices

M_{A}

and

M_{B}

to be compared, our algorithm applies Algorithm 1 to generate clockwise ring vectors with a time complexity of

O (N^{2})

. When generating counterclockwise ring vectors, we perform an element reversal operation on the clockwise vectors according to the Equation (10). This process requires processing all elements in the matrix, resulting in an equivalent time complexity of

O (N^{2})

.

In the distance calculation phase, the algorithm’s performance is influenced by the chosen distance metric method. Assuming that the computational complexity of a single element comparison operation is

O (1)

, the overall complexity can be analyzed as follows:

Single-layer Distance Calculation: If the selected distance metric method has a computational complexity $O (L \cdot c_{o p})$ for vectors of length L, where $c_{o p}$ represents the additional operational complexity for each element comparison, then the distance calculation complexity for layer r is at most $O (8 r \cdot c_{o p})$ . This is because the ring vector at layer r contains a maximum of $8 r$ elements (each layer forms a ring path with a “perimeter” of $8 r$ ).
Total Distance Calculation Complexity: By summing the computational complexity across all layers:

\begin{matrix} T_{distance} & = \sum_{r = 1}^{⌊ \frac{N + 1}{2} ⌋} O (8 r \cdot c_{o p}) \\ = O (8 c_{o p} \cdot \sum_{r = 1}^{⌊ \frac{N + 1}{2} ⌋} r) \\ = O (8 c_{o p} \cdot \frac{⌊ \frac{N + 1}{2} ⌋ \cdot (⌊ \frac{N + 1}{2} ⌋ + 1)}{2}) \\ \approx O (8 c_{o p} \cdot \frac{N^{2}}{8}) \\ = O (c_{o p} \cdot N^{2}) . \end{matrix}

(16)

The algorithm calculates distances in both clockwise and counterclockwise directions, essentially executing the distance calculation process twice. Therefore, the overall time complexity of Algorithm 2 is

T_{a l g 2} = O (N^{2}) + 2 \times O (c_{o p} \cdot N^{2}) = O (c_{o p} \cdot N^{2})

.

In the field of regional distribution pattern similarity search, traditional methods typically employ sliding window techniques to enumerate all possible sub-regions within a given area. These approaches calculate the similarity between each subregion and the target pattern, ultimately identifying the K regions with the highest similarity scores. However, this exhaustive search strategy incurs substantial computational costs when processing large-scale regional data, making it inefficient for many practical applications.

To enhance search efficiency, our research introduces a pre-filtering mechanism based on ring layer features. This approach stems from a key observation about regional distribution patterns: the ring layer containing the maximum value typically reflects the core distribution characteristics of a region (such as terrain peaks or areas of high population density). This observation leads to an important insight: if two regions exhibit significant differences in their maximum value ring layers, their overall distribution patterns are unlikely to possess high similarity. By leveraging this property, we can quickly eliminate many dissimilar regions before performing more computationally intensive similarity calculations.

Let us examine the theoretical basis for this approach. Consider two highly similar regional feature matrices

M_{A}

and

M_{B}

, with their respective ring vector representations:

$V_{A} = V_{A}^{1}, V_{A}^{2}, \dots, V_{A}^{⌊ \frac{N + 1}{2} ⌋}$
$V_{B} = V_{B}^{1}, V_{B}^{2}, \dots, V_{B}^{⌊ \frac{N + 1}{2} ⌋}$

According to our definition of similar regions, the weighted comprehensive distance between

M_{A}

and

M_{B}

should approach zero. Consequently, the distance between their ring vectors at each layer

d (V_{A}^{r}, V_{B}^{r})

should also approach zero.

The ring layer

r_{m a x}

that contains the maximum value plays a particularly crucial role in characterizing regional features. Therefore:

d (V_{1}^{r_{m a x}}, V_{2}^{r_{m a x}}) \approx 0 .

(17)

This mathematical relationship indicates that the feature distributions in the ring layers containing the maximum values of both matrices should exhibit high similarity. Since maximum value points typically represent the most significant features in a region, these points should maintain consistent positions within similar regions.

The algorithm can be formally described through the following steps:

Input Definition: Given a target region feature matrix $M_{t a r} \in R^{N \times N}$ and a larger region matrix $M_{c a n d} \in R^{R \times C}$ to be searched $(R, C \geq N)$ .
Target Analysis: Extract the ring layer $L_{t a r}$ containing the maximum value point from the target matrix $M_{t a r}$ .
Candidate Generation: Employ a sliding window technique to traverse all possible $N \times N$ sub-matrices $M_{i j}$ in $M_{c a n d}$ $(0 \leq i \leq R - N, 0 \leq j \leq C - N)$ .
Feature Extraction: For each sub-matrix $M_{i j}$ , calculate the ring layer $L_{M_{i j}}$ containing its maximum value.
Pre-filtering: Apply the ring layer filtering condition: if $L_{t a r} = L_{M_{i j}}$ , add $M_{i j}$ to the candidate set $Ω$ .
Refined Similarity Calculation: For each region in the candidate set $Ω$ , apply Algorithm 2 to calculate its distance from the target region.
Result Ranking: Sort by distance and return the set $T$ containing the K regions with the highest similarity.

The pseudocode for this algorithm is presented in Algorithm 3.

Algorithm 3: Similar Region Search

To determine the layer

L_{t a r}

containing the maximum value point in the target matrix

M_{t a r}

, we must traverse the entire

N \times N

matrix to identify the maximum value. This step has a time complexity of

O (N^{2})

. Determining the layer to which the maximum value belongs is an

O (1)

operation. Therefore, the total complexity of this preliminary step is

O (N^{2})

.

When using the sliding window approach to traverse all potential sub-matrices, there are

(R - N + 1) \times (C - N + 1)

sub-matrices to examine, where R and C represent the dimensions of the larger search space. For each sub-matrix, the algorithm performs the following operations:

Sub-matrix Extraction: Obtaining the candidate sub-matrix $M_{c a n d} [i : i + N, j : j + N]$ through index slicing, which is a pure indexing operation with a time complexity of $O (1)$ .
Maximum Value Point Localization: Finding the maximum value point within the candidate sub-matrix requires traversing all $N^{2}$ elements, resulting in a time complexity of $O (N^{2})$ .
Layer Assignment and Filtering: Determining the layer to which the maximum value point belongs and comparing it with the target layer $L_{t a r}$ . If they match, the sub-matrix is added to the candidate set $Ω$ . These operations require constant time, with a complexity of $O (1)$ .

Consequently, the complexity of a single iteration is

O (N^{2})

, and the total complexity of the sliding window traversal is

T_{s l i d i n g} = O ((R - N + 1) \times (C - N + 1) \times N^{2})

.

Considering the continuity of the sliding window in both horizontal and vertical directions, we can optimize the maximum value update process through incremental computation:

Horizontal Sliding Optimization: When the window moves from position $(i, j)$ to $(i, j + 1)$ , the new window removes the leftmost column (N elements) and adds a rightmost column (N elements). By maintaining information about the current window’s maximum value and its position, the maximum value can be incrementally updated in $O (N)$ time.
Vertical Sliding Optimization: Similarly, when the window moves from position $(i, j)$ to $(i + 1, j)$ , incremental updates can be performed by removing the top row and adding a bottom row.
Optimized Complexity: Through incremental computation, the complexity of a single window movement for maximum value updates is reduced from $O (N^{2})$ to $O (N)$ , resulting in an optimized total time complexity for the sliding window traversal as $T_{s l i d i n g}^{o p t} = O ((R - N + 1) \times (C - N + 1) \times N)$ .

This optimization delivers significant performance improvements in large-scale search scenarios.

Assuming the number of candidate regions retained through the pre-filtering mechanism is

| Ω |

, complete similarity calculations (invoking Algorithm 2) must be executed for each candidate region. The computational complexity for calculating the similarity of a single candidate region is

O (c_{o p} \cdot N^{2})

(as described in Part B). Inserting the calculated similarity results into an ordered result set can be accomplished in

O (log K)

time using appropriate data structures (such as heaps or balanced binary trees), where K represents the number of top-K results to be returned.

Therefore, the complexity for similarity calculation of a single candidate region is

O (c_{o p} \cdot N^{2}) + O (log K) = O (c_{o p} \cdot N^{2})

. Consequently, the total complexity for similarity calculations across all candidate regions is

T_{s i m i l a r i t y} = O (| Ω | \cdot c_{o p} \cdot N^{2})

.

Integrating the above analyses, the total time complexity of Algorithm 3 is

T_{t o t a l} = O (N^{2}) + O ((R - N + 1) \cdot (C - N + 1) \cdot N) + O (| Ω | \cdot c_{o p} \cdot N^{2})

, where the first term represents the preprocessing complexity for the target region, the second term represents the optimized sliding window traversal complexity for pre-filtering, and the third term represents the complexity for similarity calculations across candidate regions.

4.4. Elaboration on Rotational Invariance

The rotational invariance of the proposed similarity measure is ensured by the combination of two core mechanisms: the Dynamic Anchor Localization (Section 4.2.1) and the Bidirectional Matching Strategy (Section 4.3):

Role of the Dynamic Anchor (Primary Invariance): The Dynamic Anchor (the global maximum value point P) acts as an internal, data-driven “compass” for the matrix. Consider a matrix M and its version $M^{'}$ which has been rotated by 45°. In M, the algorithm identifies the max point $P_{A}$ and calculates all starting points $S_{r}$ relative to the vector $C \to P_{A}$ 8. This generates the ring vector set $V_{A}$ . In $M^{'}$ , the entire internal structure is rotated. The original max point $P_{A}$ is now at a new 45° position, $P_{B}$ . The algorithm, when run on $M^{'}$ , identifies $P_{B}$ as its anchor. It then calculates its starting points $S_{r}$ relative to the vector $C \to P_{B}$ . Because the relative structure is identical (just rotated), the vector $V_{B}$ generated from this process will be identical to the original vector $V_{A}$ . The dynamic anchor effectively "normalizes" the vector generation process, ensuring the final vectors are already aligned, regardless of the original matrix’s orientation.
Role of Bidirectional Matching (Safeguard): The Bidirectional Matching strategy is a safeguard against perfect 180° rotations or mirror-symmetric patterns. A 180° rotation is functionally equivalent to traversing the same ring in the reverse (counter-clockwise) direction. By calculating the distance for both the clockwise vector ( $V_{B}$ ) and its reverse ( $R (V_{B})$ ) and taking the minimum, the method robustly identifies 180° rotations as highly similar.

5. Experimental Evaluation

In this section, we systematically evaluate the effectiveness of the proposed algorithm. We first detail the experimental configuration and justify our selection of validation datasets, followed by three targeted experiments across diverse datasets. This comprehensive approach demonstrates the algorithm’s performance characteristics and computational efficiency in various application scenarios.

5.1. Datasets

To ensure a comprehensive and representative evaluation, we selected three datasets with significant domain differences and diverse spatial characteristics. Each dataset provides specific validation dimensions and challenging scenarios for our algorithm.

5.1.1. SRTM 90M DEM

The SRTM 90M DEM (Shuttle Radar Topography Mission 90 Meter Digital Elevation Model) is a global digital elevation model jointly released by the National Aeronautics and Space Administration (NASA) and the National Geospatial-Intelligence Agency (NGA). This dataset provides global terrain coverage with a spatial resolution of 90 m and is accessible from the CGIAR Consortium for Spatial Information (CGIAR-CSI) website (https://srtm.csi.cgiar.org/ (accessed on 27 September 2024) ). It constitutes an important open resource for terrain analysis research.

5.1.2. Beijing WIFI Access Point Dataset

The dataset utilized in this experiment comprises the geospatial distribution information of 23,144,836 WIFI access points across Beijing, comprehensively reflecting the density and distribution characteristics of the city’s digital infrastructure. Each record details the access point’s latitude and longitude coordinates, MAC address, and associated address information. Given our focus on the core metropolitan area, the study scope was confined to the region within Beijing’s Fourth Ring Road, from which 6,982,254 data points were extracted, with only their coordinates used for subsequent analysis. The spatial distribution pattern of WIFI access points is highly correlated with human activity intensity and urban development density, establishing this dataset as an ideal testing scenario for validating the algorithm’s capability to identify similar distribution patterns. The inherent high density and heterogeneity of these data points provide a challenging validation environment for the pattern matching algorithm, effectively assessing the proposed method’s robustness and efficacy in processing complex urban spatial data.

5.1.3. Beijing POI Dataset

The Beijing Points of Interest (POI) dataset contains 633,372 spatial records covering nine major functional categories, including catering services, scenic spots, corporate enterprises, commercial shopping centers, and business residences. Consistent with our WIFI dataset processing approach, we limited the analysis to within Beijing’s Fourth Ring Road, ultimately utilizing 135,659 data points with only their geographical coordinates extracted. POI data typically form functional cluster areas (such as business centers and shopping districts), presenting unique spatial distribution patterns. This provides an ideal testing foundation for evaluating the algorithm’s performance in identifying similarities between urban functional areas.

5.2. Experimental Setup

5.2.1. Experimental Design

Euclidean distance is employed for ring vector similarity measurement and equal weighting is adopted across ring layers in our experimental framework, with detailed justifications provided in Section 5.2.2 and Section 5.2.3. Three experiments were designed to evaluate the algorithm’s rotation invariance, engineering applicability, and multi-type data adaptability.

Experiment 1: Dam Terrain Classification. Eight representative dams worldwide were selected as research subjects, with terrain features extracted from SRTM 90M DEM data. These dams can be classified into two categories based on construction terrain: gravity dams (Three Gorges, Itaipu, Guri, Grand Coulee) located in wide U-shaped valleys spanning one to two kilometers, and arch dams (Baihetan, Xiluodu, Wudongde, Jinping-I) situated in narrow V-shaped canyons with steep slopes. The primary objective of this experiment is to verify the effectiveness of the proposed method in identifying similar terrain features, and to benchmark its performance against established baseline methods, including Normalized Cross-Correlation (NCC), Fourier-Mellin Transformation (FMT) [18], and Zernike Moments (ZM) [19].
For each dam area, a $25 \times 25$ grid size was employed, covering approximately 2.25 km × 2.25 km around each dam. This dimension was determined through preliminary testing with grid sizes ranging from $15 \times 15$ to $35 \times 35$ , which demonstrated stable discrimination performance in the $20 \times 20$ to $30 \times 30$ range. The $25 \times 25$ configuration was selected to optimally balance three requirements: capturing sufficient terrain context beyond the dam structure, maintaining adequate resolution for elevation gradient representation, and avoiding inclusion of irrelevant distant terrain.
Experiment 2: FAST Telescope Site Selection. China’s FAST radio telescope site selection was used as a case study to demonstrate practical engineering application. The telescope requires karst depression terrain with “high periphery, low center” morphology. An $8 \times 8$ grid was extracted from the actual FAST construction site as the template, covering 720 m × 720 m at 90 m resolution.
This grid size was determined to directly correspond to engineering requirements: the $8 \times 8$ configuration at 90 m resolution yields 720 m total coverage, accommodating FAST’s 500 m aperture with approximately 110 m peripheral margin on each side. This margin is essential for capturing the surrounding high-elevation terrain that characterizes suitable sites. Smaller grids would truncate the critical peripheral features, while larger grids would introduce irrelevant distant terrain. The search space encompassed the southern Guizhou karst region ( $5071 \times 1621$ grid cells, approximately 456 km × 146 km), and the top-20 candidate regions were retrieved.
Experiment 3: Urban Functional Area Identification. WiFi access point and POI distributions within Beijing’s Fourth Ring Road were analyzed to demonstrate adaptability to urban spatial data. The study area was partitioned into 200 m × 200 m grid cells, totaling $112 \times 120$ cells.
This cell size was chosen to align with typical urban functional block scales in Beijing, where coherent zones (residential compounds, commercial clusters, parks) span 100–300 m. Finer resolutions would fragment functional areas across excessive cells, while coarser resolutions would merge distinct zones. At 200 m resolution, the grid provides sufficient detail for block-level analysis while maintaining computational tractability for large point datasets. Taoranting Park and its northwestern region were selected as the template, exhibiting a distinct “dense on one side, sparse on the other” pattern characteristic of park-residential interfaces. The top 10 most similar regions ( $K = 10$ ) were retrieved for both datasets.

5.2.2. Distance Measurement Method

Euclidean distance was selected as the fundamental metric for measuring ring vector similarity based on three key advantages. First, geometric intuitiveness is provided through direct measurement of straight-line distance in multidimensional space, naturally reflecting differences in spatial distribution patterns. Second, high computational efficiency is achieved with O(N) complexity, making the method suitable for large-scale analysis. Third, essential mathematical properties (non-negativity, identity, symmetry, triangle inequality) are satisfied, ensuring reliable and consistent measurement behavior.

For two ring vectors

V_{i}

and

V_{j}

, their Euclidean distance is defined as Equation (18):

d (V_{i}, V_{j}) = \sqrt{\sum_{l = 1}^{n} {(v_{i l} - v_{j l})}^{2}},

(18)

where n is the dimension of the vector, and

v_{i l}

and

v_{j l}

represent the values of the two vectors in the l-th dimension, respectively.

5.2.3. Multi-Layer Ring Vector Weighting Strategy

When calculating comprehensive similarity using multi-layer ring vector distances, we adopted an average weighting strategy. Specifically, each ring layer is assigned an equal weight coefficient as Equation (19):

w_{r} = \frac{1}{R}, r = 1, 2, \dots R,

(19)

where

R

is the total number of ring layers, and

w_{r}

is the weight coefficient for the r-th ring layer.

In our concentric ring structure, ring layer r contains

8 r

elements, meaning outer rings contain progressively more elements and cover larger geographic areas. When Euclidean distance is computed between corresponding ring layers, larger distance values are naturally produced by outer rings due to their increased element counts, even with similar per-element differences. This natural increase effectively emphasizes large-scale spatial structures without requiring explicit weighting schemes. These natural data distribution characteristics are preserved through equal weighting, allowing the relative importance of different scales to emerge from the data structure itself rather than being imposed through arbitrary parameters. This makes the algorithm more general and applicable across diverse domains without requiring application-specific calibration.

5.3. Experiment Results and Analysis

5.3.1. Experiment 1

To validate the proposed algorithm’s performance in terrain feature capture and rotational invariance, this experiment was conducted using eight globally representative dams as research subjects, with NCC, FMT, and ZM applied as baseline methods for comparison. Based on their structural types and terrain adaptability, these dams were classified into two categories:

Gravity Dams: These are distributed across wide, gentle U-shaped valleys or Y/T-type river sections. The river valleys typically span 1–2 km in width with gradual slopes and minimal upstream-downstream elevation differences. The reservoir areas are predominantly characterized by wide valleys, hills, or plateaus. Representative projects include Three Gorges Dam (Yangtze River, China), Itaipu Hydroelectric Dam (Paraná River, Brazil/Paraguay), Guri Hydroelectric Power Station (Caroní River, Venezuela), and Grand Coulee Dam (Columbia River, USA).
Arch Dams: These are primarily concentrated in deeply incised canyon regions with narrow V-shaped or U-shaped valleys. These sites feature steep slopes, high mountain symmetry, significant elevation differences, and dramatic terrain fluctuations. Representative projects include Baihetan Hydropower Station (Jinsha River, China), Xiluodu Hydropower Station (Jinsha River, China), Wudongde Hydropower Station (Jinsha River, China), Jinping-I Hydropower Station (Yalong River, China).

Figure 4 presents DEM renderings of selected study areas, clearly illustrating the distinct terrain characteristics associated with different dam types. In the experiment, using each dam’s center as the origin point, we extracted

25 \times 25

grid regions (covering 2.25 km × 2.25 km) from SRTM 90M DEM data, ensuring the analysis window encompassed both the dam structure and surrounding characteristic terrain features.

Figure 4. Representative dams DEM rendering images. (a) Three Gorges Dam. (b) Itaipu Hydroelectric Dam. (c) Baihetan Hydropower Station. (d) Xiluodu Hydropower Station.

The proposed algorithm was applied to calculate the distance D between the eight dam regions, which was then converted to a normalized similarity score S:

S = 1 - \frac{D}{D_{m a x}},

(20)

where

D_{m a x}

represents the maximum distance value among all dam pairs, ensuring that

S \in [0, 1]

, with higher values indicating greater similarity.

To quantitatively assess algorithm performance, we defined three key metrics:

Intra-class Similarity ( $S_{i n t r a - c l a s s}$ ): The average similarity score between dam pairs within a single category. In this experiment, we calculate two such metrics: $S_{i n t r a - g r a v i t y}$ for gravity dams and $S_{i n t r a - a r c h}$ for arch dams.
Inter-class Similarity $S_{i n t e r}$ : The average similarity score between dam pairs across different categories.
Discrimination $Δ S$ : The overall classification capability, defined as the difference between the average intra-class similarity ( $S_{i n t r a - a v g}$ ) and the inter-class similarity ( $S_{i n t e r}$ ).

All four methods (our method, NCC, FMT, and ZM) were applied to the eight dam DEM datasets. For NCC, the maximum correlation coefficient was first converted to a distance metric (D = 1 − NCC), which was subsequently normalized using Equation (20) to yield the final similarity score. For FMT, ZM, and our method, similarity was calculated based on the Euclidean distance between their respective feature vectors, which was then normalized using Equation (20). The resulting classification performance metrics are summarized in Table 1.

Table 1. Comparative performance on the Dam Classification task.

The results in Table 1 clearly and quantitatively demonstrate the superior performance of the proposed method. The method achieved high intra-class similarity scores for both gravity dams (

S_{i n t r a - g r a v i t y} = 0.8855 \pm 0.0176

) and arch dams (

S_{i n t r a - a r c h} = 0.5654 \pm 0.0652

). These results are particularly significant as they demonstrate that the proposed method successfully overcomes substantial directional differences among dams within the same category (e.g., Itaipu and Guri, which are rotated almost 180° relative to each other). This capability to accurately identify similarly typed dams regardless of their orientation explicitly validates the method’s rotation-invariant characteristics.

This rotational robustness stands in sharp contrast to the traditional Template Matching (NCC) method, which failed entirely (

Δ S \approx 0

) due to its critical weakness in handling directional sensitivity. The established rotation-invariant methods, FMT and Zernike Moments, performed better (positive

Δ S

), which validates their known theoretical properties. However, the proposed method achieved a far superior discrimination score of

Δ S = 0.5267

.

This comparative analysis confirms the advancement of the proposed approach. The method uniquely achieves robust rotation invariance while simultaneously preserving the multi-scale spatial structure. This preservation of interpretable spatial patterns allows it to capture the nuanced terrain differences between the “U-shaped” gravity dam valleys and the "V-shaped" arch dam canyons far more effectively than other methods.

5.3.2. Experiment 2

Southern Guizhou represents one of China’s most extensively developed karst topographical regions, characterized by widespread dissolution depressions (approximately 300–800 m in diameter). These natural formations inherently conform to the “high periphery, low center” bowl-shaped terrain requirements for the 500-m aperture Five-hundred-meter Aperture Spherical Telescope (FAST) [37]. This experiment aims to validate the proposed algorithm’s capability to precisely search for specific terrain patterns in complex topographical regions and to evaluate its practical application value in large-scale engineering site selection.

The experiment selected Digital Elevation Model (DEM) data from the actual FAST telescope construction site (Dawodang depression in Pingtang County, Guizhou Province) as a template, extracting an

8 \times 8

grid (720 m × 720 m, 90 m resolution) that precisely covered its core depression area (approximately 500 m in diameter). The research employed SRTM 90 M DEM data from southern Guizhou as the search space, encompassing

5071 \times 1621

grid cells (approximately 456 km × 146 km), comprehensively covering regions with dense karst topography. The actual FAST site location falls within this search range, facilitating comparison between algorithm-recommended areas and the actual site’s terrain similarity.

By applying Algorithm 3 and traversing the entire search area with an

8 \times 8

sliding window, we extracted the 20 candidate regions with the highest similarity scores. Table 2 presents the key parameters of these candidate regions and their comparison with the FAST template region. Evaluation metrics include similarity scores and critical engineering parameters, including peripheral elevation difference and depression diameter.

Table 2. FAST Construction Site Similar Area Search Results.

Area	Similarity	Peripheral	Depression	Figure
	Score	Elevation	Diameter
	(0–1)	Difference (m)	(m)
FAST	1.0	255	600	Figure 5a
1	0.9516	209	560
2	0.9484	228	580	Figure 5b
3	0.9479	231	650
4	0.9468	225	680
5	0.9448	220	650	Figure 5c
6	0.9430	256	500
7	0.9424	201	400
8	0.9419	261	500	Figure 5d
9	0.9418	254	450	Figure 5e
10	0.9417	211	560	Figure 5f
11	0.9416	237	600
12	0.9408	261	650	Figure 5g
13	0.9408	261	650	Figure 5h
14	0.9401	200	500
15	0.9400	237	480
16	0.9391	232	400	Figure 5i
17	0.9391	211	500
18	0.9389	224	650
19	0.9387	315	400	Figure 5j
20	0.9387	205	380
avg	0.9424 ±0.0036	234.55 ± 23.82	539.5 ± 96.5

The ring vector-based algorithm demonstrated high precision and robustness in identifying specific terrain patterns. All Top-20 candidate regions identified by the algorithm exhibited the typical “high periphery, low center” karst depression morphological characteristics, achieving a precision of

100 %

for this search task with regions 2, 5, 8, 14, and 16 displaying particularly high structural similarity to the actual FAST radio telescope construction site. Quantitative analysis revealed:

The average similarity score of candidate regions reached $0.9424 \pm 0.0036$ , indicating the algorithm’s high precision in identifying terrain features. Notably, the standard deviation of similarity was $0.0036$ , reflecting the stability and consistency of algorithm output—a significant attribute for large-scale spatial data analysis.
The average peripheral elevation difference was $234.55 \pm 23.82$ m, close to the FAST template region (255 m) and within the acceptable engineering error range.
The mean depression diameter distribution was $539.5 \pm 96.5$ m, with most regions satisfying FAST’s engineering requirements for bowl-shaped depression dimensions (500 m aperture). This demonstrates the algorithm’s capacity to precisely capture terrain features at specific scales.

Despite the algorithm’s effective performance in terrain pattern recognition, actual engineering site selection necessitates comprehensive consideration of multidimensional constraint conditions. The current analysis, based solely on elevation features, presents the following limitations:

Unidimensionality of Features: The algorithm matches solely based on elevation features, without integrating critical engineering parameters such as transportation accessibility, hydrological distribution, and geological stability. Deeper analysis revealed that candidate regions 9 and 19 were traversed by existing roads, while region 12 overlapped with rivers by more than 35%. Despite their high terrain conformity, these regions are unsuitable for constructing large radio telescopes.
Single-Scale Analysis: The experiment employed only an $8 \times 8$ grid (720 m × 720 m) as the template scale, without considering multi-scale fusion analysis, potentially leading to insufficient assessment of terrain stability over larger areas. In practical engineering, the terrain characteristics of the broader area surrounding FAST similarly exert significant influence on engineering stability and electromagnetic environment.

Figure 5. Some of the top 20 representative areas are similar to the FAST construction site. (a) FAST. (b) area 2. (c) area 5. (d) area 8. (e) area 9. (f) area 10. (g) area 12. (h) area 14. (i) area 16. (j) area 19.

This experiment validated the proposed algorithm’s capability to precisely identify specific spatial patterns within large-scale terrain data. In the FAST radio telescope site selection case study, the algorithm efficiently screened candidate regions meeting the “high periphery, low center” bowl-shaped terrain requirements, providing robust data support for preliminary site selection. Quantitative analysis of key indicators such as similarity scores, peripheral elevation differences, and depression diameters demonstrated high congruence between algorithm-identified candidate regions and engineering requirements.

The case study highlights that in practical engineering applications, our algorithm should function as a pre-filtering component within a larger Multi-Criteria Decision Analysis (MCDA) framework. This framework is necessary because structural similarity alone is insufficient; comprehensive evaluation requires integrating other specialized assessments, such as hydrology, geology, and accessibility. Future work will focus on integrating our ring vector method with diverse geographic information data to construct a formal, integrated evaluation system, thereby maximizing its application value in complex site selection tasks.

5.3.3. Experiment 3

WiFi distribution and Points of Interest (POI) distribution effectively reflect urban spatial structure characteristics and social activity patterns, particularly spatial features related to population mobility, commercial activities, and urban functional layout. This experiment aims to validate the proposed algorithm’s generalization capability and application potential in multi-type urban data analysis, with specific objectives including:

Searching for regions with the highest similarity within Beijing’s Fourth Ring Road based on WiFi access points and POI distribution data, using the same target area.
Comparing matching results from both data sources, analyzing their spatial overlap and differences.
Evaluating the algorithm’s practical value in identifying urban functional zones and its implications for urban planning.

This study divided the area within Beijing’s Fourth Ring Road into a 200 m × 200 m grid network (totaling

112 \times 120

grid cells), with each cell covering approximately 40,000 square meters, suitable for block-level spatial analysis. Two types of spatial distribution feature matrices were constructed:

WiFi Density Matrix $M_{w i f i}$ , where $M_{w i f i} (i, j) = c o u n t (A P_{i j})$ , with $c o u n t (A P_{i j})$ representing the number of WiFi access points within grid cell $(i, j)$ , reflecting regional human activity intensity and real-time population density.
POI Density Matrix $M_{P O I}$ , where $M_{P O I} (i, j) = c o u n t (P O I_{i j})$ , with $c o u n t (P O I_{i j})$ representing the count of various POI categories within grid cell $(i, j)$ , reflecting regional infrastructure distribution and functional attributes.

The study selected Taoranting Park and its northwestern region as the template area, as shown in Figure 6. This region combines green space (low density) and residential/commercial mixed areas (medium-high density), displaying a distinct “dense on one side, sparse on the other” spatial distribution pattern, making it suitable as a representative sample of mixed functional zones.

Figure 6. Map of Taoranting Park and its northwestern region (the template). Red and green overlays highlight the high-density (built-up) and low-density (park) areas, respectively. The overlaid grid on each map represents

200 m \times 200 m

cells.

Applying the proposed algorithm, the top-10 similar regions were identified in both the WiFi density matrix and POI density matrix, denoted as sets

R_{W I F I}

and

R_{P O I}

. Through qualitative and quantitative comparative analysis of matching results from both data sources, the algorithm’s effectiveness in capturing urban spatial structure characteristics was evaluated, with particular attention to its identification capabilities across different spatial orientations.

The algorithm identified “half-dense half-sparse” distribution patterns similar to the target area (Taoranting Park and surroundings) in both the WiFi density matrix (Figure 7) and the POI density matrix (Figure 8).

Figure 7. TOP-10 matching results of Taoranting area on WIFI dataset. Red and green overlays highlight the corresponding high-density and low-density areas in each matched region. The overlaid grid on each map represents

200 m \times 200 m

cells. (a) W1. (b) W2. (c) W3. (d) W4. (e) W5. (f) W6. (g) W7. (h) W8. (i) W9. (j) W10.

Figure 8. TOP-10 matching results of Taoranting area on POI dataset. Red and green overlays highlight the corresponding high-density and low-density areas in each matched region. The overlaid grid on each map represents

200 m \times 200 m

cells. (a) P1. (b) P2. (c) P3. (d) P4. (e) P5. (f) P6. (g) P7. (h) P8. (i) P9. (j) P10.

In the WiFi dataset matching results, all top 10 regions exhibited the typical “dense on one side, sparse on the other” distribution characteristics, primarily including:

Chaoyang Park Fenghuayuan and its northwestern region (W1): sparse WiFi in the park area, dense WiFi in surrounding residential areas.
Beihai Park and its northern surrounding area (W2): sparse WiFi distribution within the park, dense WiFi distribution in northern commercial and residential areas.
Lianhuachi Park and its western and northern regions (W3): sparse WiFi in the park and railway areas, dense WiFi in surrounding schools and residential areas.
Jingshan Park north of the Forbidden City and surrounding areas (W8): sparse in the park area, dense WiFi in the eastern side.
Zhongnanhai and its western region (W10): Zhongnanhai’s river area has sparse WiFi, while the western downtown residential area has dense WiFi.

In the POI dataset matching results, the top 10 regions similarly reflected “dense on one side” distribution characteristics, primarily including:

South of the Forbidden City and around Tiananmen Square (P1): sparse POI in the northern Forbidden City, more POI around southern Tiananmen Square and the National Museum.
Wanliu Golf Course and its eastern region (P3): sparse POI in the golf course, dense POI in eastern residential areas.
Yuyuantan Park and its northern region (P5): sparse POI in the park area, dense POI in northern residential areas.
Taiyangong Sports and Leisure Park and its southwestern region (P6): sparse POI in the large park area, dense POI in the southwest with residential areas, schools, and companies.
Beihai Park, Jingshan Park, and their surroundings (P10): fewer POI in Beihai Park and Jingshan Park, more POI in northeastern residential and commercial areas.

These matching results demonstrate that the proposed algorithm effectively captures functional zone differences in urban spaces, with significant advantages in identifying typical mixed functional zones such as “ecological-residential” and “park-commercial” areas.

By comparing the spatial directional distribution of the target area with the matching results, we further validated the algorithm’s advantages in rotational invariance. The Taoranting area exhibits a distinct directional distribution of “dense northwest, sparse southeast.” In the matching results, some regions displayed spatial orientations different from the template:

WiFi matching region W6 shows sparse northwest and dense southeast distribution, opposite to the template area; W9 exhibits “northeast-southwest” density gradient distribution; W4, W5, W7, and W8 all display “north-south” density gradient distribution; W10 shows an “east-west” distribution pattern.
POI matching region P1 has its dense area in the southeast, opposite to the template; P2 and P5 display “north-south” density gradient distribution; P3 and P10 show “east-west” distribution trends; P6 and P7 exhibit “northeast-southwest” density gradient distribution.

Despite these directional differences, the algorithm dynamically adjusts the starting point of ring vector traversal through maximum value point positioning, enabling the generated feature vectors to maintain high similarity. This characteristic makes the algorithm applicable to natural urban data distribution without requiring preset directional constraints.

Both WiFi and POI data reflect regional population density distribution to some extent, and the algorithm captured the spatial coupling between the two data types:

Jingshan Park area (ID: W8/P10) was selected as a high-similarity area by both WiFi and POI models, where the park area forms a low-density zone while surrounding commercial clusters form high-density zones, perfectly reproducing the mixed functional features of the template.
Beihai Park area (W2/P4): displays north-south distribution in both WiFi and POI distribution patterns, with sparse WiFi and POI distribution in the southern park area and dense WiFi and POI distribution in northern commercial and residential areas.
Yuyuantan Park area (ID: W4/P5) exhibits “north-south” density gradients in both data sources, reflecting the typical functional distribution of park-residential areas.

This spatial coupling of multi-type data verifies the algorithm’s effectiveness and stability in identifying urban functional zones.

This experiment demonstrates that the proposed ring vector-based rotation-invariant spatial distribution similarity analysis method has good applicability and practical value in identifying urban functional zones. The analysis results based on multi-type urban data provide new technical approaches and analytical perspectives for urban spatial structure research, functional zoning planning, and commercial decision-making.

5.4. Computational Performance and Scalability Analysis

To empirically validate the algorithm’s scalability and address its performance on large-scale datasets, a computational performance analysis of the Similar Region Search (Algorithm 3) was conducted. This experiment was specifically designed to measure the practical execution time of the search procedure and verify its relationship with the theoretical time complexity.

The experiment measured the total execution time of Algorithm 3 as a function of the search space size. A target matrix (

M_{t a r}

) of a fixed size,

N \times N

(where

N = 32

), was used. A series of increasingly large, randomly generated search space matrices (

M_{c a n d}

) 3 were prepared with dimensions

R \times C

(where

R = C

). The search space dimension R was varied incrementally, with

R \in {32, 64, 128, 256, 512, 1024}

.

For each R, the total runtime required to execute Algorithm 3—including the pre-filtering and refined similarity calculation for all candidate regions to retrieve the Top-K results 4—was recorded. All tests were executed on an Intel Core i5-12600KF CPU @ 3.70 GHz with 32 GB RAM (Intel Corp., Santa Clara, CA, USA).

The relationship between the search space dimension (R) and the total execution time (T) is presented in Figure 9a. The plot clearly exhibits a quadratic growth trend. To further clarify this relationship, execution time was also plotted against the total number of pixels in the search space (

R^{2}

), as shown in Figure 9b, which reveals a strong linear relationship.

Figure 9. Scalability analysis of the Similar Region Search (Algorithm 3). (a) Execution time plotted against the search space dimension (R). (b) Execution time plotted against the total number of pixels (

R^{2}

).

This empirical result is fully consistent with the optimized theoretical time complexity derived in Section 4.3. The total complexity of Algorithm 3 is dominated by the pre-filtering step, which is

O ((R - N + 1) \cdot (C - N + 1) \cdot N)

5. The number of sub-windows (candidate locations) to be evaluated is

(R - N + 1) \times (R - N + 1)

, which scales with

O (R^{2})

. Due to the incremental computation optimization, the processing time for each window is reduced to

O (N)

.Since N (the target size) is a fixed constant in this experiment, the total time T is theoretically proportional to

O (R^{2})

. The observed quadratic curve in Figure 9a precisely confirms this

T \propto R^{2}

relationship.For instance, searching a

1024 \times 1024

search space completed in

50.069

s. This demonstrates that the optimized search algorithm is computationally efficient and scales predictably, validating its viability for the large-scale spatial analysis applications presented in this paper.

6. Conclusions

This study presented a rotation-invariant spatial distribution similarity analysis method based on ring vectors. The proposed approach successfully addresses the sensitivity issues of traditional methods under rotational transformations through its dynamic starting point selection mechanism, multi-layer ring vector representation, and bidirectional matching strategy.

The method’s real-world applicability was validated across three diverse experimental scenarios: robust terrain classification (Experiment 1), high-precision engineering site selection (Experiment 2), and multi-source urban functional area identification (Experiment 3). Together, these experiments demonstrate the method’s broad applicability as a general spatial pattern matching tool in diverse real-world scenarios, including environmental monitoring (e.g., identifying vegetation degradation patterns), geological hazard assessment (e.g., landslide morphology matching), optimal facility placement, and urban dynamics analysis (e.g., tracking functional zone evolution). Moreover, by preserving the multi-scale spatial structure, the method enhances feature interpretability, offering a transparent alternative to the "black-box" nature of deep learning-based models.

Despite these successful results, the authors acknowledge several limitations that define the method’s applicable scenarios and scope:

Rasterization dependency: The method requires structured grid data as input, and the choice of rasterization parameters—particularly grid cell size and density function—is critical and can significantly influence the final similarity results.
Anchor point sensitivity: The single global maximum anchor may be affected by noise or multi-modal distributions, potentially reducing rotational consistency.
Unidimensionality of features: The current method performs matching based on a single spatial feature (e.g., elevation or POI density), without integrating heterogeneous data such as hydrology, geology, or accessibility, which limits its applicability in multi-criteria real-world scenarios.

Building upon these limitations, a clearer roadmap for future work can be established. First, to address the anchor point sensitivity, a more robust anchor mechanism could be developed, such as using the centroid of the highest-value cluster rather than a single pixel. Second, as noted in the FAST experiment limitations (Section 5.3.2), the method should be integrated within multi-criteria decision-making (MCDA) frameworks to combine morphological similarity with other practical engineering or planning constraints. Third, explicit multi-scale analysis should be incorporated to evaluate spatial patterns across a spectrum of scales, rather than a single fixed grid size. Finally, future work will also focus on multi-source feature fusion, adaptive weight learning, and the development of an open-source tool to promote wider adoption and application of the algorithm.

Author Contributions

Conceptualization, Zhi Cai; methodology, experiment and data curation, Zhi Cai and Hongyu Pan; writing—original draft preparation, Hongyu Pan; writing—review and editing, Shuaibing Lu, Zhi Cai and Hongyu Pan; supervision, Limin Guo and Xing Su. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the National Natural Science of Foundation of China under Grant 62072016 and the Beijing Natural Science Foundation under Grant 4244074.

Data Availability Statement

Data will be made available on request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Zhang, A.; Tariq, A.; Quddoos, A.; Naz, I.; Aslam, R.W.; Barboza, E.; Ullah, S.; Abdullah-Al-Wadud, M. Spatio-temporal analysis of urban expansion and land use dynamics using google earth engine and predictive models. Sci. Rep. 2025, 15, 6993. [Google Scholar] [CrossRef]
Dritsas, E.; Trigka, M. Remote Sensing and Geospatial Analysis in the Big Data Era: A Survey. Remote Sens. 2025, 17, 550. [Google Scholar] [CrossRef]
Adeyeye, A.; Lynch, C.; Hester, J.; Tentzeris, M. A machine learning enabled mmWave RFID for rotational sensing in human gesture recognition and motion capture applications. In Proceedings of the 2022 IEEE/MTT-S International Microwave Symposium-IMS 2022, Denver, CO, USA, 19–24 June 2022; pp. 137–140. [Google Scholar]
Wang, Z.; Ma, D.; Sun, D.; Zhang, J. Identification and analysis of urban functional area in Hangzhou based on OSM and POI data. PLoS ONE 2021, 16, e0251988. [Google Scholar] [CrossRef]
Yang, X.; Xie, F.; Liu, S.; Zhu, Y.; Fan, J.; Zhao, H.; Fu, Y.; Duan, Y.; Fu, R.; Guo, S. Mapping Debris-Covered Glaciers Using High-Resolution Imagery (GF-2) and Deep Learning Algorithms. Remote Sens. 2024, 16, 2062. [Google Scholar] [CrossRef]
Selmy, S.A.; Kucher, D.E.; Yang, Y.; García-Navarro, F.J. Geospatial Data: Acquisition, Applications, and Challenges. In Exploring Remote Sensing-Methods and Applications; IntechOpen Limited: London, UK, 2024. [Google Scholar]
Le Falher, G.; Gionis, A.; Mathioudakis, M. Where is the Soho of Rome? Measures and algorithms for finding similar neighborhoods in cities. In Proceedings of the International AAAI Conference on Web and Social Media, Oxford, UK, 26–29 May 2015; Volume 9, pp. 228–237. [Google Scholar]
Kim, G.Y.; Lee, W.H. Prediction of the spatial distribution of vine weevil under climate change using multiple variable selection methods. Sci. Rep. 2025, 15, 7845. [Google Scholar] [CrossRef]
Liu, G.; Dai, E.; Ge, Q.; Wu, W.; Xu, X. A similarity-based quantitative model for assessing regional debris-flow hazard. Nat. Hazards 2013, 69, 295–310. [Google Scholar] [CrossRef]
Nyssen, J.; Tielens, S.; Gebreyohannes, T.; Araya, T.; Teka, K.; Van de Wauw, J.; Degeyndt, K.; Descheemaeker, K.; Amare, K.; Haile, M.; et al. Understanding spatial patterns of soils for sustainable agriculture in northern Ethiopia’s tropical mountains. PLoS ONE 2019, 14, e0224041. [Google Scholar] [CrossRef]
Zhao, Q.; Xiong, Y.; Li, Q.; Cui, X. Spatial Layout Planning of Medical and Health Institutions Based on the Concept of Healthy City: A Case Study of Mianyang. In Proceedings of the International Conference on Urban Climate, Sustainability and Urban Design, Sydney, Australia, 28 August–1 September 2023; Springer: Berlin/Heidelberg, Germany, 2023; pp. 865–877. [Google Scholar]
Ylihärsilä, M.; Hirvonen, J. Grid shape descriptor using path integrals for measuring sheet metal parts similarity. Comput. Aided Des. Appl. 2022, 19, 712–721. [Google Scholar] [CrossRef]
Ullah, F.; Kaneko, S. Using orientation codes for rotation-invariant template matching. Pattern Recognit. 2004, 37, 201–209. [Google Scholar] [CrossRef]
Temenos, A.; Temenos, N.; Kaselimi, M.; Doulamis, A.; Doulamis, N. Interpretable deep learning framework for land use and land cover classification in remote sensing using SHAP. IEEE Geosci. Remote Sens. Lett. 2023, 20, 8500105. [Google Scholar] [CrossRef]
Kakogeorgiou, I.; Karantzalos, K. Evaluating explainable artificial intelligence methods for multi-label deep learning classification tasks in remote sensing. Int. J. Appl. Earth Obs. Geoinf. 2021, 103, 102520. [Google Scholar] [CrossRef]
Temenos, A.; Tzortzis, I.N.; Kaselimi, M.; Rallis, I.; Doulamis, A.; Doulamis, N. Novel insights in spatial epidemiology utilizing explainable AI (XAI) and remote sensing. Remote Sens. 2022, 14, 3074. [Google Scholar] [CrossRef]
Lian, Z.; Zhan, Y.; Zhang, W.; Wang, Z.; Liu, W.; Huang, X. Recent Advances in Deep Learning-Based Spatiotemporal Fusion Methods for Remote Sensing Images. Sensors 2025, 25, 1093. [Google Scholar] [CrossRef]
Peng, T.; Yang, N.; Peng, X.; Chen, Z. A method of weak fault detection based on sparse representation for pmsm. In Proceedings of the 2020 Chinese Automation Congress (CAC), Shanghai, China, 6–8 November 2020; pp. 4412–4417. [Google Scholar]
Lu, X.; Yang, J. Image analysis with logarithmic Zernike moments. Digit. Signal Process. 2023, 133, 103829. [Google Scholar] [CrossRef]
Ye, Y.; Bruzzone, L.; Shan, J.; Bovolo, F.; Zhu, Q. Fast and robust matching for multimodal remote sensing image registration. IEEE Trans. Geosci. Remote Sens. 2019, 57, 9059–9070. [Google Scholar] [CrossRef]
Zhang, S.; Su, L. A new fast matching algorithm for angle-adaptive grayscale templates. In Proceedings of the 2019 2nd World Conference on Mechanical Engineering and Intelligent Manufacturing (WCMEIM), Shanghai, China, 22–24 November 2019; pp. 672–675. [Google Scholar]
Choi, M.S.; Kim, W.Y. A novel two stage template matching method for rotation and illumination invariance. Pattern Recognit. 2002, 35, 119–129. [Google Scholar] [CrossRef]
Zhang, Y.; Lan, C.; Zhang, H.; Ma, G.; Li, H. Multimodal remote sensing image matching via learning features and attention mechanism. IEEE Trans. Geosci. Remote Sens. 2024, 62, 5603620. [Google Scholar] [CrossRef]
Lai, G.; Fan, Z.; Zhao, W.; Wang, Y.; Qu, X. An effective feature-to-template matching algorithm for multimodal geospatial data. In Proceedings of the Sixth International Conference on Geoscience and Remote Sensing Mapping (GRSM 2024), Qingdao, China, 25–27 October 2025; Volume 13506, pp. 302–307. [Google Scholar]
Yang, J.; Wang, H.; Yuan, J.; Li, Y.; Liu, J. Invariant multi-scale descriptor for shape representation, matching and retrieval. Comput. Vis. Image Underst. 2016, 145, 43–58. [Google Scholar] [CrossRef]
Wang, Z.; Xu, G.; Cheng, Y.; Guo, R.; Wang, Z. A curvature salience descriptor for full and partial shape matching. Multimed. Tools Appl. 2018, 77, 27405–27426. [Google Scholar] [CrossRef]
Mo, H.; Zhao, G. RIC-CNN: Rotation-invariant coordinate convolutional neural network. Pattern Recognit. 2024, 146, 109994. [Google Scholar] [CrossRef]
Bazi, Y.; Bashmal, L.; Rahhal, M.M.A.; Dayil, R.A.; Ajlan, N.A. Vision transformers for remote sensing image classification. Remote Sens. 2021, 13, 516. [Google Scholar] [CrossRef]
Kaczmarek, I.; Iwaniak, A.; Świetlicka, A. Classification of spatial objects with the use of graph neural networks. ISPRS Int. J. Geo-Inf 2023, 12, 83. [Google Scholar] [CrossRef]
Zhan, W.; Datta, A. Neural networks for geospatial data. J. Am. Stat. Assoc. 2025, 120, 535–547. [Google Scholar] [CrossRef]
Markiewicz, J. The comparison of distance metrics in descriptor matching methods utilised in TLS-SfM point cloud registration. Rep. Geod. 2025, 119, 39–61. [Google Scholar] [CrossRef]
Zhou, S.; Li, J.; Wang, H.; Shang, S.; Han, P. GRLSTM: Trajectory similarity computation with graph-based residual LSTM. In Proceedings of the AAAI Conference on Artificial Intelligence, Washington, DC, USA, 7–14 February 2023; Volume 37, pp. 4972–4980. [Google Scholar]
Liu, B.; Wang, Z.; Zhang, J.; Wu, J.; Qu, G. DeepSIM: A novel deep learning method for graph similarity computation. Soft Comput. 2024, 28, 61–76. [Google Scholar] [CrossRef]
Zhou, S.; Huang, C.; Wen, Y.; Chen, L. Feature Enhanced Spatial–Temporal Trajectory Similarity Computation. Data Sci. Eng. 2025, 10, 1–11. [Google Scholar] [CrossRef]
Jin, J.; Song, Y.; Kan, D.; Zhang, B.; Lyu, Y.; Zhang, J.; Lu, H. Learning context-aware region similarity with effective spatial normalization over Point-of-Interest data. Inf. Process. Manag. 2024, 61, 103673. [Google Scholar] [CrossRef]
Abbasi, O.R.; Alesheikh, A.A.; Lotfata, A. Semantic similarity is not enough: A novel NLP-based semantic similarity measure in geospatial context. IScience 2024, 27. [Google Scholar] [CrossRef]
Nan, R. Five hundred meter aperture spherical radio telescope (FAST). Sci. China Ser. G 2006, 49, 129–148. [Google Scholar] [CrossRef]

Figure 1. Remote Sensing Imagery of Ili River Valley in Xinjiang, China with Different Orientations. (a) East-West Oriented Remote Sensing Imagery of Ili River Valley. (b) North-South Oriented Remote Sensing Imagery of Ili River Valley.

Figure 2. Overall Framework of the Ring Vector-based Similarity Method.

Figure 3. Schematic Diagram of Key Steps in Ring Vector Generation. (a) Starting Point Calculation, and different colored lines represent different layers. (b) Raw Ring Vector Extraction. (c) The Variation Trend of Feature Values.

Figure 4. Representative dams DEM rendering images. (a) Three Gorges Dam. (b) Itaipu Hydroelectric Dam. (c) Baihetan Hydropower Station. (d) Xiluodu Hydropower Station.

Figure 6. Map of Taoranting Park and its northwestern region (the template). Red and green overlays highlight the high-density (built-up) and low-density (park) areas, respectively. The overlaid grid on each map represents

200 m \times 200 m

cells.

Figure 7. TOP-10 matching results of Taoranting area on WIFI dataset. Red and green overlays highlight the corresponding high-density and low-density areas in each matched region. The overlaid grid on each map represents

200 m \times 200 m

cells. (a) W1. (b) W2. (c) W3. (d) W4. (e) W5. (f) W6. (g) W7. (h) W8. (i) W9. (j) W10.

Figure 8. TOP-10 matching results of Taoranting area on POI dataset. Red and green overlays highlight the corresponding high-density and low-density areas in each matched region. The overlaid grid on each map represents

200 m \times 200 m

cells. (a) P1. (b) P2. (c) P3. (d) P4. (e) P5. (f) P6. (g) P7. (h) P8. (i) P9. (j) P10.

Figure 9. Scalability analysis of the Similar Region Search (Algorithm 3). (a) Execution time plotted against the search space dimension (R). (b) Execution time plotted against the total number of pixels (

R^{2}

).

Table 1. Comparative performance on the Dam Classification task.

Method	$S_{intra - Gravity}$	$S_{intra - Arch}$	$S_{inter}$	$Δ S$
NCC	$0.3550 \pm 0.2178$	$0.4665 \pm 0.1912$	$0.4738 \pm 0.1815$	$0.0631$
FMT	$0.4181 \pm 0.1589$	$0.3507 \pm 0.0776$	$0.2592 \pm 0.1940$	$0.1253$
ZM	$0.3940 \pm 0.2009$	$0.4372 \pm 0.168$	$0.2922 \pm 0.1773$	$0.1234$
Our	$0.8855 \pm 0.0176$	$0.5654 \pm 0.0652$	$0.1988 \pm 0.1264$	$0.5267$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Published by MDPI on behalf of the International Society for Photogrammetry and Remote Sensing. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

A Novel Region Similarity Measurement Method Based on Ring Vectors

Abstract

1. Introduction

2. Related Work

2.1. Regional Feature Expression

2.2. Regional Similarity Measurement

3. Relevant Definition

3.1. Regional Feature Matrices

3.2. Coordinate Transformation

3.3. Ring Vector

4. Methodology

4.1. Data Preprocessing

4.1.1. Rasterization

4.1.2. Matrix Order Adjustment

4.2. Ring Vector Feature Extraction

4.2.1. Directional Anchor Localization

4.2.2. Ring Vector Starting Point Calculation

4.2.3. Raw Ring Vector Extraction

4.2.4. Vectors Standardization and Data Cleaning

4.3. Similarity Computation

4.4. Elaboration on Rotational Invariance

5. Experimental Evaluation

5.1. Datasets

5.1.1. SRTM 90M DEM

5.1.2. Beijing WIFI Access Point Dataset

5.1.3. Beijing POI Dataset

5.2. Experimental Setup

5.2.1. Experimental Design

5.2.2. Distance Measurement Method

5.2.3. Multi-Layer Ring Vector Weighting Strategy

5.3. Experiment Results and Analysis

5.3.1. Experiment 1

5.3.2. Experiment 2

5.3.3. Experiment 3

5.4. Computational Performance and Scalability Analysis

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics