SAS-KNN-DPC: A Novel Algorithm for Multi-Sensor Multi-Target Track Association Using Clustering

Guan, Xin; Huang, Zhijun; Yi, Xiao

doi:10.3390/electronics14102064

Open AccessArticle

SAS-KNN-DPC: A Novel Algorithm for Multi-Sensor Multi-Target Track Association Using Clustering

by

Xin Guan

,

Zhijun Huang

^* and

Xiao Yi

Naval Aviation University, Yantai 264001, China

^*

Author to whom correspondence should be addressed.

Electronics 2025, 14(10), 2064; https://doi.org/10.3390/electronics14102064

Submission received: 21 April 2025 / Revised: 14 May 2025 / Accepted: 17 May 2025 / Published: 20 May 2025

(This article belongs to the Section Systems & Control Engineering)

Download

Browse Figures

Versions Notes

Abstract

The track-to-track association (T2TA) problem is a fundamental and critical challenge in information fusion, situational awareness, and target tracking. Existing algorithms based on statistical mathematics, fuzzy mathematics, gray theory, and artificial intelligence suffer from several limitations that are hard to solve, such as over-idealized models, unrealistic assumptions, insufficient real-time performance, and high computational complexity due to pairwise matching requirements. Considering these limitations, we propose a self-adaptive step-2-based K-nearest neighbor density peak clustering (SAS-KNN-DPC) algorithm to address T2TA problem. Firstly, the step-2 temporal neighborhood affinity matrix under a non-registration framework is defined and the calculation methods for multi-feature track-point fusion similarity matrix are given. Secondly, the proposed self-adaptive multi-feature similarity truncation matrix is defined to measure the multidimensional distance between track points and the self-adaptive step-2 truncation distance is also defined to enhance the adaptivity of the algorithm. Finally, we propose improved definitions of local distance and global relative distance to complete both cluster center selection and association assignment. The proposed algorithm eliminates the need for exhaustive pairwise matching between track sequences and avoids time alignment, significantly improving the real-time performance of T2TA. Simulation results demonstrate that compared to other algorithms, the proposed algorithm achieves higher accuracy, reduced computational time, and better real-time performance in complex scenarios.

Keywords:

track-to-track association; multi-feature track-point fusion similarity; self-adaptive multi-feature similarity truncation matrix; SAS-KNN-DPC

1. Introduction

In distributed multi-sensor systems, track-to-track association (T2TA) plays a fundamental and critical role [1,2,3,4]. With the rapid advancement and widespread adoption of sensor technologies, modern sensors have achieved significant improvements in both measurement accuracy and real-time multi-target information acquisition capability. These enhanced sensors are now extensively employed for critical missions, including surveillance, target detection, and tracking operations [5,6,7,8]. Each sensor is equipped with an independent processing system that accumulates extensive target track data, forming the basis for detection and tracking operations. T2TA is simple under ideal conditions. However, in environments with multiple targets, interference, clutter, measurement noise, or overlapping trajectories, the problem becomes much more complex [9,10,11].

Current approaches for solving the T2TA problem primarily include statistical mathematics-based methods using probabilistic computations and hypothesis testing [12,13,14,15], fuzzy mathematics-based methods, gray theory-based methods, and artificial intelligence-based methods [16]. In [17], a classical algorithm was proposed and has been widely applied to T2TA. An efficient recursive algorithm for solving the generalized S-D assignment problem was presented and its application in localizing an unknown number of emitters via multiple high-frequency direction-finding sensors was implemented. Further, when accounting for diverse interference and other complex operational scenarios, ref. [18] proposed restricted and attenuation memory track correlation algorithms and sequential classic assignment rules and discussed the track correlation mass and multivalency processing methods based on Singer’s and Bar-Shalom’s traditional algorithms. Compared with traditional algorithms, these algorithms exhibited superior performance in complex scenarios, including dense targets, interference noise, and track crossings. To address the impact of measurement biases in multi-sensor scenarios, ref. [19] proposed a reference topology feature (RET)-based multi-radar bias-resistant T2TA algorithm, and an anti-bias T2TA method suitable for more than two radars was developed. In [11], a cost function was constructed based on GCI divergence and a pairwise sensor registration strategy proposed. However, probabilistic approaches often rely on unrealistic assumptions, impractical models, and arbitrary thresholds. Their complexity also grows rapidly as the number of sensors and targets increases. Furthermore, statistical methods require precise assumptions about noise distribution, such as assuming Gaussian white noise. This creates challenges in real-world situations where information is often uncertain.

The advancement of fuzzy mathematics (particularly fuzzy logic) and gray system theory has motivated alternative solutions for T2TA problems. Specifically, fuzzy-logic methods utilize membership functions to rigorously quantify association probabilities, and gray system theory handles incomplete data associations through gray relational analysis. These approaches collectively offer: (1) enhanced expressiveness in modeling uncertainties (e.g., sensor noise, ambiguous observations); and (2) higher computational efficiency than traditional statistical methods (e.g., JPDA or MHT) when processing high-dimensional or sparse data. In [20], a novel gray system theory-based track correlation algorithm was proposed, with its implementation procedures systematically elaborated. This method quantifies the relative correlation degree by analyzing the developmental trends among factors in data sequences, thereby overcoming the limitations of conventional approaches that rely heavily on large samples and assumptions of specific statistical distributions. The influence of the discrimination coefficient was also investigated. Comparative analysis demonstrated that the proposed algorithm achieved superior performance in terms of both accuracy and robustness compared to classical statistical methods, particularly in scenarios involving dense targets, high noise interference, and track crossings. Ref. [21] integrated a density-based fuzzy clustering approach with systematic error compensation, achieving optimized performance under the Bayesian minimum mean square error (MMSE) criterion. However, the reported tracks frequently exhibited temporal misalignment and variable-length track segments, rendering asynchronous track association a persistent challenge. Notably, many existing approaches operate under idealized temporal synchronization assumptions, limiting their practical applicability. Conventional temporal alignment approaches, primarily relying on interpolation and extrapolation techniques, inevitably introduce estimation errors during synthetic point generation. To address the asynchronous challenge, ref. [22] pioneered a segment-wise sequence divergence algorithm for track association. The key innovation lies in its adaptive segmentation rules for uneven-length track sequences, which eliminates the need for temporal alignment while enabling robust association under complex scenarios, including track bifurcation, crossing, and merging. Fuzzy mathematics and gray relational theory demonstrate notable advantages in T2TA, including computational efficiency and model-free operation without requiring precise assumptions, particularly effective in high-uncertainty and small-sample scenarios. However, these methods suffer from critical limitations: heightened sensitivity to manual parameterization, inadequate adaptability, and nonlinearly escalating computational complexity with increasing target dimensions. In contrast, deep learning methods use end-to-end spatiotemporal feature extraction. They can automatically improve association accuracy and noise robustness, even in complex environments. These methods also support real-time processing of large-scale target sets, showing strong adaptability and scalability. Concurrently, the rapid advancement of computer science has facilitated the application of artificial intelligence methodologies to T2TA problems.

Ref. [23] proposed a novel dynamic target network management framework that integrated Dempster–Shafer (DS) evidence theory to achieve adaptive multi-source data fusion and real-time target state evolution. However, this algorithm’s reliance on DS evidence theory necessitates a complete and predefined discernment framework, a requirement often unattainable in practical multi-sensor environments due to dynamic target behaviors and heterogeneous data sources. By synergistically combining a hybrid feature extraction module with a homography estimation module, the novel neural architecture proposed in [24] achieved interpretable track-to-track association through unified radar coordinate transformation while maintaining robust performance. Ref. [25] introduced a neural framework for track association that combined hierarchical feature learning with probabilistic matching. The system first transforms raw positional data into an embedded representation using dense layers with stochastic regularization, capturing nonlinear motion characteristics. In parallel, volumetric convolution operators process spatial target distributions to derive contextual scene information. These dual pathways converge in a unified network that outputs association probabilities through cascaded transformation layers, enabling joint optimization of feature learning and matching confidence. These methods predominantly rely on conventional metrics, failing to quantify the temporal–spatial consistency of association results. Ref. [26] developed an innovative two-module framework consisting of a track fusion module and a track segmentation-mapping module. The track fusion module effectively integrates and extracts multi-track features, significantly reducing dependence on conventional assumptions and manually configured thresholds. The track segmentation-mapping module directly converts track tensors into association matrices, thereby eliminating the computationally expensive process of pairwise track comparisons while maintaining robust performance in complex, high-density tracking scenarios. This approach demonstrates superior computational efficiency and scalability by learning discriminative representations without restrictive constraints and enabling end-to-end trainable association decisions. Nevertheless, this method still requires exhaustive pairwise computations across multiple sensors and operates in an offline processing mode due to its sequential track-sequence implementation.

The clustering approach proposed in [27] provides valuable methodological insights for addressing the T2TA problem and utilizes the clustering algorithm [28,29,30,31,32,33]. However, the applied DBSCAN algorithm demonstrates poor adaptability. Although capable of identifying nonspherical clusters, this method is limited to coordinate-defined data and exhibits high computational costs [34]. In 2014, Alex Rodriguez and Alessandro Laio introduced the density peak clustering (DPC) algorithm, a seminal contribution to the field of machine learning. This algorithm demonstrates remarkable capability in detecting nonspherical clusters and autonomously determining the optimal number of clusters, leading to its widespread adoption across diverse domains with consistently promising results. Although DPC exhibits significant advantages over conventional clustering techniques, it is not without limitations. Firstly, the algorithm’s definitions of local density and distance metrics are oversimplified, impairing its effectiveness when handling complex data structures. Secondly, in scenarios where prior knowledge of the data distribution is insufficient, selecting an appropriate cutoff distance becomes nontrivial. This parameter critically influences the identification of cluster centers, and even minor variations in its value can induce substantial deviations in the clustering outcome. Thirdly, the allocation strategy exhibits high sensitivity and low fault tolerance. Specifically, an erroneous assignment of a single data point to an incorrect cluster can trigger a cascading effect, propagating errors to subsequent points during the allocation process [35,36,37,38]. To address these limitations, ref. [35] proposed the shared-nearest-neighbor-based clustering (SNN-DPC) algorithm, which redefines the local density, distance metric, and allocation strategy in the original DPC framework by incorporating the shared-nearest-neighbor (SNN) concept. This enhanced approach demonstrates superior performance, effectively mitigating the aforementioned drawbacks while retaining the core advantages of DPC. However, it must be acknowledged that despite continuous improvements to the DPC algorithm and its variants, they remain fundamentally constrained by one undeniable limitation: DPC-based methods are predominantly designed for static data analysis (e.g., image segmentation, bioinformatics) rather than real-time applications. More critically, the DPC framework inherently lacks the capability to process spatiotemporally asynchronous data streams—a crucial requirement for dynamic system analysis.

Moreover, nearly all algorithms based on statistical mathematics, fuzzy mathematics, gray theory, and artificial intelligence adopt a pairwise matching approach for all tracks from two sensors. In practical scenarios, this involves two levels of exhaustive comparisons: (1) at the sensor level, requiring pairwise combinations of all sensors, and (2) at the data level, necessitating pairwise matching of tracks reported by each paired sensor combination. These approaches not only significantly increase computational complexity and reduce efficiency but also fail to meet real-time requirements for tracking highly maneuvering targets. Secondly, for asynchronous data processing, avoiding temporal alignment presents a promising strategy to minimize error introduction and accumulation. To address these challenges while leveraging the aforementioned advantages of DPC, we propose the SAS-KNN-DPC algorithm, which integrates clustering techniques. Firstly, this algorithm performs similarity measurement, extracts a step-2 temporal neighborhood affinity matrix based on temporal anchors under a non-registration framework, and derives an adaptive multi-feature similarity truncation matrix through computational procedures. Secondly, it computes the step-2 multi-feature normalized distance matrix to obtain global relative distances, then determines cluster centers by combining KNN-based local density calculation; Finally, assignment association is performed based on K-nearest-neighbor concepts under the metric defined in this work. The main contributions and innovations of this work can be summarized as follows.

A mathematical model is constructed for the T2TA problem, and a step-2 temporal neighborhood affinity matrix under non-registration framework is proposed, which eliminates temporal alignment operations and mitigates the performance degradation caused by asynchronous track points.
An innovative track point similarity metric is proposed, incorporating multidimensional information to improve feature utilization efficiency, significantly enhancing algorithm adaptability.
The SAS-KNN-DPC algorithm is proposed, a novel solution to overcome the pairwise traversal challenge between all tracks reported with real-time association capability.
Experimental simulations were conducted to validate the proposed algorithm, demonstrating its superior accuracy and real-time performance.

The rest of this paper is organized as follows. Section 2 presents the problem formulation and mathematical modeling framework. Section 3 presents formal definitions and the algorithmic procedure. Section 4 presents comprehensive simulation studies validating the proposed algorithm, with Section 5 providing a conclusion.

2. Problem Formulation

The focus of this study was a multi-sensor multi-target scenario. Multi-sensor refers to systems containing more than two sensors, while multi-target indicates scenarios with more than five targets (often significantly more). Consider a specific scenario where multiple sensors simultaneously detect a common area, with each sensor reporting a unique track for each target. A critical constraint is that distinct tracks from the same sensor cannot correspond to the same target. Let the number of sensors be

M

and the S-th sensor (where

S = 1, 2, \dots, M

) reports distinct tracks, forming a track set

Γ_{S} = {T_{S}^{1}, T_{S}^{2}, \dots, T_{S}^{N_{S}}}

:

T_{S}^{i} = [\begin{matrix} ξ_{S}^{i, 1} \\ ξ_{S}^{i, 2} \\ ⋮ \\ ξ_{S}^{i, l_{S}^{i}} \end{matrix}] = [\begin{matrix} t_{S}^{i, 1} & x_{S}^{i, 1} & y_{S}^{i, 1} & θ_{S}^{i, 1} & v_{S}^{i, 1} \\ t_{S}^{i, 2} & x_{S}^{i, 2} & y_{S}^{i, 2} & θ_{S}^{i, 2} & v_{S}^{i, 2} \\ ⋮ \\ t_{S}^{i, l_{S}^{i}} & x_{S}^{i, l_{S}^{i}} & y_{S}^{i, l_{S}^{i}} & θ_{S}^{i, l_{S}^{i}} & v_{S}^{i, l_{S}^{i}} \end{matrix}]

(1)

where

T_{S}^{i}

denotes the i-th track from the S-th sensor, and each track consists of

l_{S}^{i}

data points represented as vector

ξ_{S}^{i, μ} (μ = 1, 2, \dots, l_{S}^{i})

. Each point j in track I contains parameters

t_{S}^{i, j}, x_{S}^{i, j}, y_{S}^{i, j}, θ_{S}^{i, j}, v_{S}^{i, j}

, representing timestamp, x-coordinate, y-coordinate, and heading angle in degrees, with 0° referencing true north and velocity. In practice, asynchronous reporting rates across sensors create challenges for track association. Conventional time-alignment approaches introduce artificial errors that are amplified during mathematical derivation. To address this, our method directly processes asynchronous data without temporal registration, leveraging spatiotemporal clustering for track association. This approach mitigates error propagation while preserving data originality and integrity.

Unlike most existing studies, our track data construction process incorporates both heading angle and velocity. Our rationale is that most current track association algorithms rely solely on static positional data, requiring accumulated track points for position-based association. This accumulation process inherently delays real-time performance. Moreover, relying exclusively on static position data while ignoring dynamic features leads to inadequate information utilization and degraded association performance.

3. SAS-KNN-DPC Algorithm

Before delving into the theoretical details of the algorithm, we provide a brief overview of its operation. It is important to note that although only the trajectories of two targets are illustrated in the figure for clarity (blue dashed lines), the actual application scenario of the proposed algorithm involves many more targets. As shown in Figure 1, points of different colors represent detections from various sensors, with each color corresponding to a different sensor source. For target 1, due to differences in the initial reporting times and reporting rates of each sensor, the data exhibit asynchrony in the temporal domain and positional deviations in the spatial domain caused by measurement errors. Considering the real-time requirements of the proposed algorithm and the definition of the distance metric, the earliest track point we can select for evaluation is the second reported point. For a global and fair comparison, we choose the latest time at which the second point is produced among all sensors as the reference time for data processing. The specific definitions and steps of the algorithm will be detailed in the following sections. For target 2, we treat the black solid points as a special case, where the reporting speed of the corresponding sensor is significantly faster than that of the other sensors. Therefore, we select the third solid point as the association object. This choice is based not on spatial proximity to the second point cluster, but on its reporting time being the closest to the chosen reference time. Furthermore, our algorithm is based on the association between individual measurement points, since each track point is tagged with both a target and a sensor identifier, meaning that every track point uniquely corresponds to one target and one sensor.

The algorithm proposed in this paper is based on sensor data, meaning that it can only be applied to track association in complex environments after target detection has been completed and the number of targets is known.

3.1. Definitions

Definition 1.

Temporal Anchor.

Assuming M sensors simultaneously initiate detection of the same target area at time zero, the first two measurement points of each trajectory are obtained:

T r (2) = [\begin{matrix} Γ_{1} (\leq 2) \\ Γ_{2} (\leq 2) \\ ⋮ \\ Γ_{M} (\leq 2) \end{matrix}] = [\begin{matrix} {T_{1}^{1}, T_{1}^{2}, \dots, T_{1}^{N_{1}}} (\leq 2) \\ {T_{2}^{1}, T_{2}^{2}, \dots, T_{2}^{N_{2}}} (\leq 2) \\ ⋮ \\ {T_{M}^{1}, T_{M}^{2}, \dots, T_{M}^{N_{M}}} (\leq 2) \end{matrix}]

(2)

where:

{T_{S}^{1}, T_{S}^{2}, \dots, T_{S}^{N_{1}}} (\leq 2) = \{\begin{matrix} [\begin{matrix} ξ_{S}^{1, 1} \\ ξ_{S}^{1, 2} \end{matrix}] & [\begin{matrix} ξ_{S}^{2, 1} \\ ξ_{S}^{2, 2} \end{matrix}] & \dots & [\begin{matrix} ξ_{S}^{N_{s}, 1} \\ ξ_{S}^{N_{s}, 2} \end{matrix}] \end{matrix}\}

(3)

The maximum value in Tr(2) is the temporal anchor

t_{c}

.

The concept of the temporal anchor is metaphorically derived from vessel anchoring. While physical anchoring fixes a ship’s spatial position, our temporal anchor establishes a fixed reference point in the time dimension. Subsequently, a step-2 temporal neighborhood affinity matrix is generated based on this temporal anchor.

Definition 2.

Step-2 Temporal Neighborhood Affinity Matrix under Non-Registration Framework, STNAM.

Let M sensors with asynchronous reporting cycles (maximum reporting period

T_{m a x}

), and the STNAM D is defined as:

D = {[\begin{matrix} {(ξ_{1}^{1, α_{1}})}^{T} & {(ξ_{1}^{2, α_{2}})}^{T} & {(ξ_{1}^{3, α_{3}})}^{T} & \dots & {(ξ_{2}^{1, α_{N_{1} + 1}})}^{T} & \dots & {(ξ_{M}^{1, α_{N_{1} + N_{2} + \dots + N_{M - 1} + 1}})}^{T} & \dots \end{matrix}]}_{(\sum_{i}^{M} N_{i}) \times 5}^{T}

(4)

where

0 < α_{i} \leq \max {N_{1}, N_{2}, \dots, N_{M}}

and

| t_{S}^{i, j} - t_{c} | \leq T_{\max}

.

Notably, when multiple timestamped feature values

t_{S}^{i, j_{1}}, t_{S}^{i, j_{2}}

simultaneously satisfy the spatiotemporal constraints in matrix D, only the value closest to the reference time

t_{c}

is retained for inclusion in the STNAM. That is, the STNAM is composed of feature vectors corresponding to the nearest track points at each reporting timestamp within individual trajectories. In this matrix, each row vector represents the features of a selected track point. Distinct rows either originate from different sensors or correspond to different detected targets. The subsequent execution of the SAS-KNN-DPC algorithm operates precisely on such a matrix. It can be seen that the construction of this matrix does not require time registration, and data preprocessing can be performed directly on data reported asynchronously by distributed networked sensors. In essence, the asynchrony problem is equivalent to the problem of missing data, as there may be no eligible points available for association at a certain moment or within a certain time period. Therefore, handling the asynchrony problem is to some extent also a discussion of track association under conditions of missing data.

Definition 3.

K-Nearest Neighbors, KNN.

Given a distance metric

ρ

, a predefined integer

K

, and a dataset X,

\forall x_{i} \in X

, the K-nearest neighbors of

x_{i}

are defined as the top

K

samples in X with the smallest distances to

x_{i}

under metric

ρ

. The set of these points is denoted as:

KNN (x_{i}) = {x_{j} | ρ (x_{i}, x_{j}) \leq ρ_{K}, x_{i} \neq x_{j}, x_{j} \in X}

(5)

where

ρ_{K}

denotes the K-th distance in the sorted distance sequence.

Definition 4.

Multi-Feature Track-Point Fusion Similarity, simi.

Consider M sensors monitoring N targets within the same surveillance region, generating an STNAM D. Given the sensors’ maximum slant range measurement error

r_{e}

and the targets’ maximum possible velocity

v_{\max}

, the multi-feature similarity between any two track points

ξ_{S_{1}}^{i_1, j_1}

and

ξ_{S_{2}}^{i_2, j_2}

(simi) is defined as:

simi (ξ_{S_{1}}^{i_1, j_1}, ξ_{S_{2}}^{i_2, j_2}) = \frac{1}{r_{e} + dist (ξ_{S_{1}}^{i_1, j_1}, ξ_{S_{2}}^{i_2, j_2})} + \frac{1}{360 + | θ_{S_{1}}^{i_1, j_1} - θ_{S_{2}}^{i_2, j_2} |} + \frac{1}{v_{\max} + | v_{S_{1}}^{i_1, j_1} - v_{S_{2}}^{i_2, j_2} |}

(6)

dist (ξ_{S_{1}}^{i_1, j_1}, ξ_{S_{2}}^{i_2, j_2}) = \sqrt{{(x_{S_{1}}^{i_1, j_1} - x_{S_{2}}^{i_2, j_2})}^{2} + {(y_{S_{1}}^{i_1, j_1} - y_{S_{2}}^{i_2, j_2})}^{2}}

(7)

where

ξ_{S_{1}}^{i_1, j_1}

denotes the feature vector of the j_1-th track point from the i_1-th track of sensor S₁, and

ξ_{S_{2}}^{i_2, j_2}

is defined analogously.

Although the definition of simi appears to be a simple summation, the denominator parameter selection is critical. We choose the maximum possible value of each parameter as the parameter in the denominator, preventing disproportionate weighting in simi caused by extreme parameter magnitudes.

Definition 5.

Multi-Feature Track-Point Fusion Similarity Matrix, MTFSM.

Consider M sensors monitoring N targets within the same surveillance region. Calculate the simi between all pairwise feature vectors in STNAM and construct a symmetric matrix of dimension

(\sum_{i = 1}^{M} N_{i}) \times (\sum_{i = 1}^{M} N_{i})

.

\begin{array}{l} S = {[s_{i j}]}_{N_{M} \times N_{M}} \\ = [\begin{matrix} simi (ξ_{1}^{1, α_{1}}, ξ_{1}^{1, α_{1}}) & simi (ξ_{1}^{1, α_{1}}, ξ_{1}^{2, α_{2}}) & \dots & simi (ξ_{1}^{1, α_{1}}, ξ_{M}^{N_{M}, α_{N_{1} + N_{2} + \dots + N_{M}}}) \\ simi (ξ_{1}^{2, α_{2}}, ξ_{1}^{1, α_{1}}) & simi (ξ_{1}^{2, α_{2}}, ξ_{1}^{2, α_{2}}) & \dots & simi (ξ_{1}^{2, α_{2}}, ξ_{M}^{N_{M}, α_{N_{1} + N_{2} + \dots + N_{M}}}) \\ ⋮ & ⋮ & ⋮ \\ simi (ξ_{M}^{N_{M}, α_{N_{1} + N_{2} + \dots + N_{M}}}, ξ_{1}^{1, α_{1}}) & simi (ξ_{M}^{N_{M}, α_{N_{1} + N_{2} + \dots + N_{M}}}, ξ_{1}^{2, α_{2}}) & \dots & simi (ξ_{M}^{N_{M}, α_{N_{1} + N_{2} + \dots + N_{M}}}, ξ_{M}^{N_{M}, α_{N_{1} + N_{2} + \dots + N_{M}}}) \end{matrix}] \end{array}

(8)

Definition 6.

KNN-based Local Density Matrix, LD.

Given an MTFSM, the LD matrix is generated by performing descending sorting on each row and computing the sum of the first K values.

\begin{matrix} L D = {[\begin{matrix} l d_{1} & l d_{2} & \dots & l d_{N_{1} + N_{2} + \dots + N_{M}} \end{matrix}]}^{T} \\ = {[\begin{matrix} ld (ξ_{1}^{1, α_{1}}) & ld (ξ_{1}^{2, α_{2}}) & \dots & ld (ξ_{M}^{N_{M}, α_{N_{1} + N_{2} + \dots + N_{M}}}) \end{matrix}]}^{T} \end{matrix}

(9)

\{\begin{cases} ld (ξ_{S}^{i, j}) = \max \sum_{j = 1}^{N_{1} + N_{2} + \dots + N_{M}} a_{j} s_{i' j} \\ \sum_{j = 1}^{N_{1} + N_{2} + \dots + N_{M}} a_{j} = K \\ a_{j} = 0 or 1 \end{cases}

(10)

Definition 7.

Multi-Feature Step-2 Normalized Distance Matrix, MSNDM.

Given a similarity matrix

S_{0}

derived from a specified metric function, the multi-feature step-2 normalized distance matrix

D_{n}

is defined as:

D_{n} = {(1 - {(S_{0} . / \max (S_{0} (:)))}^{0.1})}^{0.8} + 0.00001

(11)

where

\max (S_{0} (:))

denotes the maximum value in matrix

S_{0}

and the constant 0.0001 prevents the maximum element in S₀ from becoming zero after normalization, ensuring numerical stability for subsequent computations. The larger the value, the farther the distance.

Definition 8.

Global Relative Distance.

Given an STNAM D and its corresponding

L D

, each row (feature vector) in

D

possesses a global relative distance. For the i-th row, its global relative distance

δ_{i}

is defined as:

δ_{i} = \min_{L D (i) < L D (j)} D_{n} (i, j)

(12)

Definition 9.

Decision Value.

Consider M sensors monitoring N targets within the same surveillance region and setting the number of cluster centers K = N, the decision value of the i-th point is computed as:

γ_{i} = l d_{i} \times δ_{i}

(13)

Based on the computed

γ_{i}

values, the largest K values are selected as cluster centers. In practice, each cluster center corresponds to one physical target, and correctly formed clusters represent detection of the same target by different sensors.

3.2. Self-Adaptive Step-2 Truncation Distance

The Hausdorff distance is a classical trajectory similarity metric for periodically batched tracks with accumulated points. The Hausdorff distance is defined as the maximum of all nearest-neighbor distances between two datasets, which captures their global shape similarity without requiring identical point counts or uniform sampling rates. This metric demonstrates particular robustness by maintaining stable similarity evaluation, even with partially missing data or noise contamination. Given two tracks

T_{S}^{i}, T_{S}^{j} (i, j = 1, 2, \dots, N_{S})

, their Hausdorff distance is defined as:

h a u (T_{S}^{i}, T_{S}^{j}) = \max_{ξ_{S}^{i, α} \in T_{S}^{i}} {\min_{ξ_{S}^{j, β} \in T_{S}^{j}} {dist (ξ_{S}^{i, α}, ξ_{S}^{j, β})}}

(14)

where

ξ_{S}^{i, α} = [t_{S}^{i, α}, x_{S}^{i, α}, y_{S}^{i, α}, θ_{S}^{i, α}, v_{S}^{i, α}]

and

ξ_{S}^{i, β} = [t_{S}^{i, β}, x_{S}^{i, β}, y_{S}^{i, β}, θ_{S}^{i, β}, v_{S}^{i, β}]

.

dist (ξ_{S}^{i, α}, ξ_{S}^{j, β})

represents the Euclidean distance between two vectors, with the specific expression given by:

dist (ξ_{S}^{i, α}, ξ_{S}^{j, β}) = \sqrt{{(x_{S}^{i, α} - x_{S}^{j, β})}^{2} + {(y_{S}^{i, α} - y_{S}^{j, β})}^{2}}

(15)

The Hausdorff distance focuses on global shape matching, but cannot effectively capture local similarities or differences between trajectories. This metric is inapplicable for measuring similarity between individual track points. Based on this analysis, we propose a self-adaptive step-2 truncation distance that exclusively considers positional distances between track points while deliberately excluding heading and velocity information. We posit that only when the distance between track points in STNAM is smaller than the self-adaptive step-2 truncation distance can these points qualify for MTFS. Conversely, if the positional distance between two points exceeds the self-adaptive step-2 truncation distance, they are deemed irrelevant with zero similarity.

Since the similarity metric incorporates position, heading, and velocity information of track points, a special case occurs when two tracks are far apart, as shown in Figure 1.

The proposed algorithm operates on STNAM and does not require processing the full set of the whole track points. However, Figure 2 displays additional track points to better illustrate this special case. The three tracks are distinctly represented using black, blue, and red with corresponding labels 1, 2, and 3 respectively. Assuming track 1 and track 2 originate from the same target, they exhibit close spatial proximity, but significant discrepancies in heading and velocity due to measurement errors. Track 1 and track 3 originate from distinct targets, exhibiting significant spatial separation, but minimal differences in heading and velocity parameters. Consequently, according to Equation (6), it becomes challenging to determine whether track 2 or 3 should be associated with track 1. In light of this consideration, this paper proposes the concept of self-adaptive step-2 truncation distance. Track points qualify for both kinematic and positional feature comparison only if their positional distance is smaller than the self-adaptive step-2 truncation distance. Points beyond this threshold are considered uncorrelated, and their spatial relationship is fundamentally invalid. More fundamentally, it is emphasized that the step-truncation distance must be adaptively generated rather than manually specified, enabling autonomous determination of optimal truncation boundary based on the intrinsic distribution characteristics of the reported data.

Definition 10.

Self-Adaptive Step-2 Truncation Distance.

Consider M sensors monitoring N targets within the same surveillance region. In the STNAM, for data originating from the same sensor (denoted as the S-th sensor)

{ξ_{S}^{1, α_{β + 2}}, ξ_{S}^{2, α_{β + 2}}, ξ_{S}^{3, α_{β + 3}}, \dots, ξ_{S}^{N_{S}, α_{β + N_{S}}}} (β = N_{1} + N_{2} + \dots + N_{S - 1})

, obtain the largest distance value

d_{\max_S}

:

d_{\max_S} = \max_{S} \min_{i \neq j} (dist (ξ_{S}^{i, ω}, ξ_{S}^{j, ψ}))

(16)

Definition 11.

Self-Adaptive Multi-Feature Similarity Truncation Matrix, SMSTM.

Given an MTFSM and its corresponding SASTD, for any element simi (a, b),

a, b \in {ξ_{1}^{1, α_{1}}, ξ_{1}^{2, α_{2}}, ξ_{1}^{3, α_{3}}, \dots, ξ_{M}^{1, α_{N_{1} + N_{2} + \dots + N_{M}}}, \dots}

, if:

dist (a, b) > d_{\max_S}

(17)

then simi (a, b) = 0, and the self-adaptive multi-feature similarity truncation matrix F is formed:

F = [\begin{matrix} χ (dist (ξ_{1}^{1, α_{1}}, ξ_{1}^{1, α_{1}}) - d_{\max_S}) s_{11} & χ (dist (ξ_{1}^{1, α_{1}}, ξ_{1}^{2, α_{2}}) - d_{\max_S}) s_{11} & \dots & χ (dist (ξ_{1}^{1, α_{1}}, ξ_{M}^{N_{M}, α_{N_{t}}}) - d_{\max_S}) s_{11} \\ χ (dist (ξ_{1}^{2, α_{2}}, ξ_{1}^{1, α_{1}}) - d_{\max_S}) s_{21} & χ (dist (ξ_{1}^{2, α_{2}}, ξ_{1}^{2, α_{2}}) - d_{\max_S}) s_{22} & \dots & χ (dist (ξ_{1}^{2, α_{2}}, ξ_{M}^{N_{M}, α_{N_{t}}}) - d_{\max_S}) s_{2 N_{t}} \\ ⋮ & ⋮ & ⋮ \\ χ (dist (ξ_{M}^{N_{M}, α_{N_{t}}}, ξ_{1}^{1, α_{1}}) - d_{\max_S}) s_{N_{t} 1} & χ (dist (ξ_{M}^{N_{M}, α_{N_{t}}}, ξ_{1}^{2, α_{2}}) - d_{\max_S}) s_{N_{t} 2} & \dots & χ (dist (ξ_{M}^{N_{M}, α_{N_{t}}}, ξ_{M}^{N_{M}, α_{N_{t}}}) - d_{\max_S}) s_{N_{t} N_{t}} \end{matrix}]

(18)

where

N_{t} = N_{1} + N_{2} + \dots + N_{M}

and:

χ (x) = \{\begin{matrix} 1, x < 0 \\ 0, else \end{matrix}

(19)

The specific implementation steps for the aforementioned definitions can be found in the detailed algorithm framework and pseudocode described later.

3.3. The Framework of SAS-KNN-DPC Algorithm

The structural framework of the SAS-KNN-DPC algorithm, as illustrated in Figure 3, primarily comprises three key operational steps:

Step 1. (Track point similarity metric) For track points reported in real-time within the STNAM D, a similarity metric is applied. This utilizes the MTFSM along with the corresponding SASTD to generate SMSTM F.

Step 2. (Cluster center selection) The SMSTM F generates LD and global relative distance, which are subsequently used to compute decision values for cluster center determination. Each distinct cluster center represents a unique target.

Step 3. (Association assignment) By leveraging the K-nearest neighbor (KNN) relationships within the SMSTM F, each cluster center and its corresponding K-nearest neighbors form a complete cluster. Each cluster represents the collective detection of a single target across all sensors.

3.4. Similarity Metric

Nearly all existing association algorithms operate by exhaustively pairing all tracks between two sensors in a pairwise manner. In practical multi-sensor scenarios, this approach entails two levels of combinatorial complexity: first, all sensors must be paired combinatorially; and second, within each sensor pair, all reported tracks must again be matched pairwise. This dual-layer exhaustive search not only drastically increases computational overhead and reduces efficiency but also fails to meet real-time requirements. Furthermore, the core challenge of track association lies in defining the appropriate distance metric between tracks. Current metrics almost exclusively rely on accumulated track sequences, meaning that a period of track-point accumulation is required to enable inter-track distance measurement before association occurs. This inherent latency fundamentally conflicts with real-time processing demands. Our proposed algorithm addresses this problem by performing distance calculations directly on real-time track reports, eliminating the need for prolonged track accumulation prior to metric evaluation.

The whole similarity metric proposed in this paper utilizes two complementary distance metrics to achieve efficient track association. One calculates

d_{\max_S}

using purely positional information, while the other implements a multi-feature similarity measure incorporating kinematic attributes and positional data.

As previously discussed, we posit that in practical scenarios, when the positional distance between two track points exceeds

d_{\max_S}

, the probability of them originating from the same target becomes statistically negligible. Consequently, further computation of their MTFS can be safely omitted.

Step 1. For real-time track reporting, temporal anchor t_c is extracted and STNAM is constructed. In STNAM, each row vector represents a sensor’s track-point observation of a target, and each column corresponds to a distinct feature parameter of the track points.

Step 2. The MTFS between each row is computed according to Equation (6), yielding MTFSM.

Step 3. Calculate the minimum Euclidean distance between any two track points reported by the i-th sensor. Then iterate through all sensors and select the maximum value among these minimum distances as

d_{\max_S}

.

Step 4. Using the

d_{\max_S}

, process matrix MTFSM by setting elements satisfying Equation (17) to zero, thereby generating matrix SMSTM F.

ξ_{S}^{i, j'}

denotes the j’-th track point of the i-th track reported by the S-th sensor, where this point is selected as the one with a timestamp closest to the temporal anchor t_c among all points in this track. The operation

sum (t e m p, 2)

computes row-wise summation of the temp.

3.5. Cluster Center Selection and Association Assignment

The detailed steps are as follows:

Line 11 implies not only assigning the maximum value to the cluster but also allocating the corresponding label (indicating which target from which sensor) to the cluster. Under normal circumstances, these targets are distinct from each other. Line 12 identifies the nearest neighbors corresponding to each cluster center in the F and assigns them to their respective cluster centers. Under nominal conditions, each cluster corresponds to a distinct target, with the cluster’s data originating from multiple sensors.

3.6. Time Complexity

We assume that M sensors are detecting N targets within the same surveillance region.

According to the previous section, the detailed analysis of Algorithm 1 is as follows.

Lines 2–7: For each track, the point closest to temporal anchor t_c must be extracted, which involves conditional logic (if-statements) and comparison operations, and complexity is O(MN).

Lines 8–13: Extract the track point closest to the anchored timestamp t_c from each track, requiring traversal of tracks. Thus, the complexity is O(MN).

Lines 14–18: Compute pairwise vector for elements in D using MTFS and the positional distance metric, resulting in O((MN)²).

Lines 19: The

d_{\max_S}

requires computing the row-wise summation of the matrix, and the complexity is O((MN)²).

Lines 20–26: Compute SMSTM. After satisfying the if-condition, each element in SMSTM requires value assignment, resulting in O((MN)²).

Algorithm 1 Compute SMSTM

Input: 1. The number of sensors

2. The number of targets

3. The second track points generated by each radar

Output: SMSTM

1 : C \leftarrow 0, D \leftarrow 0, S \leftarrow 0, label \leftarrow 0, t_{c} \leftarrow 0

2 : for each t_{S}^{i, j}

in T r (2)

do

3 : if t_{c} \leq t_{S}^{i, j}

then

4 : t_{c} = t_{S}^{i, j}

and S' = S

and i' = i

5 : label = {S', i', 2}

6: end if

7: end for

8: for each sensor

S

do

9 : for each T_{S}^{i}

in Γ_{S}

do

10 : if | t_{S}^{i, j'} - t_{c} | is the maximum among all the | t_{S}^{i, j} - t_{c} | (1 \leq j \leq N_{S}^{i})

then

11 : D \leftarrow [D; ξ_{S}^{i, j'}]

12: end for

13: end for

14: for each i and j in row(

D

) do

15 : S (i, j) \leftarrow simi (D (i), D (j))

and temp (i, j) \leftarrow sqrt (D (i), D (j))

16: end for

17: end for

18 : d_{\max_S} \leftarrow \min (sum (temp, 2))

19: for each i in row(

S

) do

20: for each j in col(

S

) do

21 : if sqrt (D (i), D (j)) \geq d_{\max_S}

then

22 : F (i, j) \leftarrow 0

23: end if

24: end for

25: end for

According to the previous section, the detailed analysis of Algorithm 2 is as follows.

Lines 2–5: Compute LD and the complexity is O(MN).

Lines 6: There are (MN)² elements in SMSTM, and the complexity is O(MN).

Lines 7–9: Compute the relative distance matrix by traversing each element in

D_{n}

, yielding a time complexity of O((MN)²).

Lines 10–12: All three steps involve numerical computations or comparisons, and the complexity is O((MN)²).

Algorithm 2 Cluster Formation

Input: 1. The number of sensors

2. The number of targets

3. SMSTM

Output: Cluster

1 : C \leftarrow 0, K \leftarrow num_r, temp_1 \leftarrow 0, L D \leftarrow 0, D \leftarrow 0, Θ \leftarrow 0, H \leftarrow 0

2: for each i in row(

F

) do

3 : temp_1 \leftarrow

sort F (:, K)

in descending order

4 : L D (i) \leftarrow sum (temp_1)

5: end for

6 : D_{n} = {(1 - {(F . / \max (F (:)))}^{0 . 1})}^{0 . 8} + 0 . 00001

7 : for each i in row (D_{n}

) do

8 : Θ (i) \leftarrow

\min_{ρ_{j} > ρ_{i}} (D_{n} (i, j))

with j = 1, 2, \dots, col (D_{n})

9: end for

10 : H \leftarrow Θ . * L D

11 : C \leftarrow

select the top

K

largest values

12 : C \leftarrow

KNN based on

F

3.7. Space Complexity

STNAM contains features of track points with O(MN) space complexity. The system involves M heterogeneous sensors monitoring N targets within a common surveillance region, where each sensor generates tracks of varying lengths. For each trajectory, select the track point with the reporting timestamp closest to t_c to participate in clustering operations. The space complexity of matrices

S, F, D_{n}

is O((MN)²), while that of

L D, Θ, H, C

is O(MN).

4. Experiments

4.1. Simulation Scene Setting

The experimental validation was conducted using simulated spatiotemporal track data. We assume multiple sensors simultaneously detect all targets within a common surveillance region and generate corresponding measurements. Real-time association is performed on these track data. The experimental design process and results are discussed in detail. The simulation environment is a 64-bit Windows 10 operating system and the MATLAB R2024 software platform. The processor is an Intel(R) Core(TM) i7-7700HQ with a main frequency of 2.80 GHz.

4.1.1. Scenario Data Visualization Analysis

The trajectory generation program published by Piciarelli et al. has been widely used in simulation studies for trajectory classification algorithms [39]. This program can generate random anomalous trajectories, set the number of trajectory clusters, and configure trajectory distribution parameters according to simulation requirements, including the number of information points per trajectory and the quantity of anomalous trajectories [29]. It serves as an effective validation tool for both anomaly detection and multi-trajectory clustering algorithms. In this section, based on this trajectory generation program, we extracted and modified part of its data while adding parameters such as heading, speed, and operational range to the information points, aiming to create irregular and unpredictable flight trajectories. Figure 4 shows a schematic diagram of 20 tracks.

The specific information of these tracks is shown below. Twenty tracks were generated within the designated area

x \in [0 km, 50 km], y \in [0 km, 50 km]

, with the initial headings of the targets following a uniform distribution between 0 and 360 degrees. The target velocities follow a uniform distribution ranging from 400 m/s to 600 m/s and the maximum turning rate is 30 degrees per second. It is essential to specify the operational area of tracks because during simulation verification, the impact of sensor slant range measurement errors and angular measurement errors on track association results varies with the target distance. Under identical sensor measurement errors, the detection error increases with target distance, as illustrated in Figure 5.

Black hollow points represent the true positions of targets, while blue solid points denote the measured positions. Target 1 and target 2 are aligned along the same azimuth angle of the sensor, but differ in range. Under identical measurement errors, the positional deviations between true and measured positions are

Δ d_{1}, Δ d_{2}

for the two targets respectively. As expected, target 2, at greater sensor range, exhibits larger measurement deviation. This illustrates the measurement deviation of a single target relative to a single sensor. For multi-target multi-sensor scenarios, the operational area boundaries must be explicitly defined in simulations and the impact of the target distance will be studied further in the subsequent section.

The track association problem fundamentally processes sequences of track points. For visualization purposes, Figure 6 demonstrates the simulation results incorporating sensor detection effects. The circular plots in the diagram represent simulated track points with measurement errors added. The simulation configures ten radar coordinates uniformly distributed along the x and y axes.

4.1.2. Experimental Design

Association accuracy and real-time performance are the two most critical advantages of the proposed track association algorithm, which also constitute the primary focus of simulation validation. We apply the proposed algorithm to this simulated tracks, and the details of the experimental projects and design concepts are outlined as follows.

Set the situation in Figure 6 as scenario 1, with the number of targets, initial headings, target velocities, and turn rates consistent with the Figure 4 scenario. The number of radars is set to 10, uniformly distributed along the x and y axes. The total observation time is 5 s. Specific parameters are listed in Table 1.

(1): To validate the association accuracy of the proposed algorithm, comparative experiments with other methods are conducted under scenario 1, while varying error parameters to rigorously evaluate the algorithm’s performance.
(2): To validate the real-time performance of the proposed algorithm, scenario 1 is modified by varying the number of targets from 10 to 100 (in increments of 10). Comparative experiments are conducted against other existing algorithms.
(3): To validate sensor scalability and operational applicability, scenario 2 is configured by dynamically adjusting the number of sensors (baseline: 10) in scenario 1. The sensor count is varied from 6 to 30.
(4): To validate the impact of target distance on the accuracy of the algorithm proposed, an analysis is conducted by varying the target activity range.

4.2. Experiment Based on Scenario 1

To demonstrate the feasibility and superiority of the proposed algorithm, comparative experiments are conducted under scenario 1, focusing on two key factors: association accuracy and real-time performance. The proposed method is evaluated against five novel algorithms: DBSCAN-based [27], PND [40], uncertainty-based [41], TF-TS [26], and CNN-LSTM-based algorithms [42].

To ensure the reliability and superiority of the experimental results, each algorithm was tested 1000 times (these 1000 experiments were divided into groups of 100 experiments each, and the association accuracy is calculated for every group). The track scenarios were generated using the rng( ) random seed function in MATLAB, allowing for precise control of a single variable while maintaining randomness in the distribution of scenarios.

In randomly generated trajectories, there exist overlapping/crossing trajectories and those in close proximity, which significantly impact association accuracy. However, most randomly generated tracks are non-intersecting and well separated. If only the mean association accuracy is used as the evaluation metric, it would fail to comprehensively reflect the effectiveness and robustness of the track association algorithm. Therefore, this study establishes the following four evaluation parameters for systematic comparative analysis:

Worst association accuracy, reflecting the algorithm’s performance lower bound in extreme scenarios.
Best association accuracy, demonstrating the algorithm’s potential under ideal conditions.
Mean value, representing the overall average performance.
Variance value, quantifying stability and consistency.

The variance indicates the stability of different algorithms when dealing with various random tracks, and the mean value represents the general performance of each algorithm. As shown in Table 2, in addition to the algorithm proposed, TF-TS and CNN-LSTM-based algorithm demonstrate superior performance in terms of both association accuracy and stability. The former innovatively integrates a track fusion module and a track segmentation mapping module, autonomously extracting and synthesizing track features without reliance on empirical thresholds or heuristic assumptions. The latter employs a CNN-LSTM hybrid architecture, where the LSTM component specializes in capturing long-term temporal dependencies inherent in sequential data, while the CNN module efficiently processes localized short-term patterns. This synergistic combination enables comprehensive utilization of both long-range and short-range track features, with the final association decision generated through an integrated classifier. These two neural network-based AI algorithms exhibit strong adaptability, delivering better association accuracy and robust stability in performance. The DBSCAN-based algorithm is primarily designed for mechanical circular scanning scenarios and demonstrates limited effectiveness in handling asynchronous data processing tasks. The PND (pseudo-nearest-neighbor distance) method employs an innovative distance metric. This metric definition demonstrates suboptimal association performance and unstable behavior when applied to targets with proximity or intersecting tracks. The uncertainty-based algorithmic model is constructed under the idealized assumption of completed track point preprocessing, exhibiting limited adaptability to highly maneuvering target scenarios with notably higher variance and compromised stability. The proposed algorithm distinguishes itself from comparative methods by leveraging only the initial single-point trajectory measurements from each sensor for real-time data association. This approach circumvents the need for prolonged data accumulation to form complete tracks, thereby achieving more efficient information utilization while minimizing the accumulation of measurement errors. Consequently, the algorithm demonstrates significantly enhanced stability and accuracy in operational scenarios.

Figure 7 shows a comparative analysis of the accuracy achieved by different algorithms under identical scenarios, with varying angular measurement errors from 0° to 0.5°. The experimental results show that when the angular measurement error is small, the TF-TS, CNN-LSTM-based, and SAS-KNN-DPC algorithms all perform well with comparable association accuracy rates. However, as the angular measurement error increases, the accuracy of all algorithms declines, with a particularly noticeable drop when the error reaches 0.5°. Overall, the proposed SAS-KNN-DPC algorithm demonstrates superior accuracy compared to other methods in this study. However, it should be noted that the inherent randomness in target track generation may occasionally lead to counterintuitive association results. Specifically, in scenarios with randomly generated intersecting or adjacent tracks, smaller measurement errors may paradoxically yield lower association accuracy compared to cases with larger errors. This phenomenon, however, does not negate the fundamental influence of random measurement errors on association performance.

Similarly, Figure 8 compares the accuracy of different algorithms under identical scenario with varying distance measurement errors, and the angular measurement error is fixed at 0.3°. The slant range measurement error was systematically varied from 10 to 370 m. It can be observed that when the errors are small, the TF-TS, CNN-LSTM-based, and SAS-KNN-DPC algorithms all achieve relatively high accuracy.

The asynchronous problem of radar data association, caused by differences in reporting cycles and initial data timestamps, fundamentally impacts track matching by introducing time discrepancies. These timing variations lead to significant changes in target information and degraded association accuracy. The experimental configuration sets the shortest data reporting cycle among the ten radars to 0.1 s, while the longest reporting cycle varies uniformly between 1 s and 10 s. The parameter k is defined as the difference between the maximum and minimum reporting cycles. Each radar’s reporting cycle is then assigned a value uniformly spaced between these minimum and maximum bounds. Comparative results are presented in Figure 9.

As shown in Figure 9, as the value of k increases—indicating greater disparity in data reporting cycles among sensors—the asynchronous data reporting problem becomes more severe. Consequently, the accuracy rates of all algorithms exhibit an overall declining trend, which aligns with intuitive expectations. The DBSCAN-based algorithm utilizes the DBSCAN clustering method, which shares a similar innovative starting point with our proposed algorithm. Both approaches aim to mitigate the impact of temporal discrepancies in asynchronous data association to some extent through clustering techniques. However, this algorithm fails to consider the physical motion characteristics of targets, such as their heading direction and speed—crucial pieces of information for accurate association. Moreover, the DBSCAN clustering method has several inherent limitations: it heavily depends on parameter settings (making it less adaptive), performs poorly with manifold structures, and struggles with complex non-convex clusters. These limitations constitute significant factors that restrict the algorithm’s potential for further accuracy improvement. The PND algorithm focuses on addressing the track asynchrony issue. When the ratio of data reporting cycles between two sensors is less than 4, it achieves near-perfect accuracy (approaching 100%). However, when the number of sensors increases and asynchrony occurs among multiple sensors (not just between two or three), the algorithm’s accuracy drops significantly. In multi-sensor and multi-target scenarios, this method requires pairwise matching between both sensors and tracks, which substantially increases processing time for asynchronous data. As a result, its overall performance is poor. The TF-TS and CNN-LSTM-based algorithms leverage neural networks, offering adaptability and flexibility to extract key information and features from track sequences. These approaches demonstrate superior performance in handling asynchronous problems compared to the traditional algorithms; however, when significant asynchrony-induced temporal discrepancies exist in the data, all these algorithms demonstrate notable limitations in accuracy. The proposed SAS-KNN-DPC algorithm also exhibits adaptive capabilities because of the self-adaptive step-2 truncation distance proposed. More importantly, it can mitigate the impact of excessive asynchrony-induced data discrepancies to some extent through STNAM, and additionally incorporates target motion characteristics and imposes almost no requirements on temporal alignment, enabling it to overcome the challenges posed by temporal mismatches in asynchronous data. Consequently, it demonstrates optimal overall performance.

4.3. Real-Time Performance Verification

The real-time performance verification of the proposed algorithm encompasses two key processes: first, evaluating the time complexity of the algorithm itself while excluding track generation time; and second, assessing the complete end-to-end processing duration from data reporting to result output, which includes the actual data generation time. Currently, almost all track association algorithms process data based on sequences of track points. In other words, they first generate complete track data, and then determine whether two given tracks originate from the same target. The more track points are generated, the more operational data and references become available for track association, which may lead to higher accuracy rates in the association process. However, in the real-time simulation verification experiments, it was not feasible to reasonably configure the number of track points due to various constraints. From an intuitive perspective, the method proposed in this paper requires fewer reporting track points and shorter data acquisition time (for instance, generating two track points is faster than producing five track points). Therefore, in this context, only comparing the algorithms’ inherent computational time consumption yields meaningful results.

In scenario 1, excluding the time required for track data generation, we designate the moment when the target track data are ready for processing as the initial timestamp (t = 0) for algorithm verification, and proceed to analyze the algorithm’s execution time, as illustrated in Figure 10.

In scenario 1, starting the analysis with 10 targets, we observe that when the number of targets remains below 40, all algorithms except AI-based methods exhibit relatively low computational overhead. When the number of targets approaches or exceeds 40, the running time of all algorithms increases. Notably, our proposed algorithm consistently demonstrates the lowest computational overhead under these conditions. The uncertainty-based method consumes a certain amount of computation time for noise model construction and it also employs conventional mathematical statistical approaches, which incur computational overhead. The higher computational time of the PND algorithm is attributed to its reliance on pseudo-nearest-neighbor distance for track distance measurement. This method requires calculating the minimum distances between all trajectory points in both tracks and then selecting the maximum value from these computations, resulting in a relatively large computational load and longer processing time. When evaluating neural network-based algorithms, the computational cost must account for the substantial training time expenditure. Consequently, even with a limited number of targets, these algorithms exhibit prolonged execution durations due to their inherent training overhead. However, from a holistic perspective, the algorithm demonstrates both time-efficient and stable performance characteristics. To be specific, TF-TS transforms the track tensor of the tracks into association matrix directly, saving much consumption time.

From a global perspective, whether the target count is large or small, the proposed algorithm consistently maintains low running time. Further, when factoring in data reporting time, the advantages of the proposed algorithm become even more apparent, with STNAM processing.

4.4. Impact of Sensor Quantity

In our algorithm, multi-sensor scenarios are defined not as two or three sensors, but as significantly larger quantities. This section investigates the impact of varying sensor counts on algorithmic performance. By increasing the number of sensors in scenario 1 from 6 to 30 in linearly spaced increments, we analyze the effects on both running time and association accuracy.

Figure 11 and Figure 12 demonstrate that increasing the number of sensors leads to increased running time and reduced algorithm accuracy. The increase in sensor quantity means that given the same number of targets, more track points are generated. This results in a larger dataset for real-time track association calculations, leading to correspondingly increased running time. When the number of targets is fewer than 10, an increase in sensor quantity may not necessarily lead to a decline in association accuracy, as shown in Figure 12. There are two main reasons for this. First, with a smaller number of targets and larger activity areas, the spacing between targets is sufficiently wide. Even with significant sensor measurement errors, the detected track points after clustering cannot be misassigned to other targets. Second, the tracks are randomly generated. When two randomly generated tracks overlap and intersect while having nearly identical headings and speeds, even the smallest measurement errors cannot prevent incorrect associations. When the number of targets exceeds 10, it becomes clearly visible that as the target count increases, the association accuracy declines. Moreover, the more sensors there are, the worse the algorithm performs.

4.5. Impact of the Target Distance

As discussed above, the essence of the impact of target distance on track association lies in its amplification and attenuation of sensor errors due to changes in distance. It is intuitive that the farther the target distance, the greater the impact of the errors on the measurement, as shown in Figure 5. We define the distance from the center of the target activity area to the origin of the two-dimensional coordinate system as the target distance d_t and the area of the generated region remains constant. The range of d_t is from 0 to

250 \sqrt{2} \approx 353 . 5534 km

, with values taken uniformly at

25 \sqrt{2} \approx 35 . 3553 km

intervals. In scenario 1, the original parameters are kept fixed except measurement errors, and we consider the association accuracy under different errors.

As shown in Figure 13, each subfigure shows the impact of different azimuth measurement errors and target distance on accuracy under the corresponding slant range measurement error, and the values of these errors are representative. It can be seen that when the target distance is greater, the measurement errors are amplified, and the decrease in accuracy is particularly noticeable. In long-range detection, the relationship between measurement errors and association accuracy becomes significantly weaker and may even exhibit nonlinear or inversely correlated trends due to geometric error amplification, target sparsity, and resolution limits. Therefore, for the algorithm proposed in this paper, the closer the overall activity area of the target is to the sensors, the better the detection performance, the smaller the spatial error, and the higher the association accuracy.

5. Conclusions

T2TA is a fundamental and critical issue in multi-sensor multi-target information fusion. Particularly in complex electromagnetic environments, achieving real-time, non-exhaustive track association for multi-target data originating from multiple sensors presents significant challenges.

This paper proposes a track association method based on the SAS-KNN-DPC algorithm, which enables real-time clustering in a non-exhaustive manner. By introducing a step-2 temporal neighborhood affinity matrix under a non-registration framework, the algorithm eliminates the need for temporal registration, significantly reducing error propagation while maintaining real-time performance. Multi-feature track-point fusion similarity integrating positional and kinematic data and its corresponding matrix are proposed, enhancing feature utilization efficiency. The self-adaptive step-2 truncation distance is also proposed to adjust to data distribution dynamically, improving robustness in dense and noisy environments. With the self-adaptive multi-feature similarity truncation matrix proposed, cluster centers are determined and assignment association is completed, achieving simultaneous clustering and real-time track association. Experimental results demonstrate that SAS-KNN-DPC outperforms existing methods in both association accuracy and computational efficiency, as discussed in the Experiments section.

In the next phase of research, we will focus on two main tasks. First, we will test and improve the algorithm using more extensive spatiotemporal track datasets. Second, building upon this research and inspired by stability-preserving strategies, we will develop new real-time online track association algorithms and put them into practice.

Author Contributions

Conceptualization, X.G., Z.H. and X.Y.; methodology and implementation, Z.H. and X.Y.; software, Z.H.; validation, X.G. and X.Y.; writing—original draft preparation, Z.H.; writing—review and editing, X.G. and X.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Taishan Scholars Program of Shandong Province, China (grant ts201712072).

Data Availability Statement

The track generation method has been described in the experimental section and the datasets were computer-generated.

Acknowledgments

The authors wish to express their sincere appreciation to the anonymous reviewers for their constructive and insightful suggestions, which have greatly contributed to the clarity and quality of this work.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

STNAM	step-2 temporal neighborhood affinity matrix under non-registration framework
KNN	K-nearest neighbors
simi	multi-feature track-point fusion similarity
MTFSM	multi-feature track-point fusion similarity matrix
LD	KNN-based local density matrix
MSNDM	multi-feature step-2 normalized distance matrix
SMSTM	self-adaptive multi-feature similarity truncation matrix

References

Zhu, H.; Wang, W.; Wang, C. Robust track-to-track association in the presence of sensor biases and missed detections. Inf. Fusion 2016, 27, 33–40. [Google Scholar] [CrossRef]
Zhu, H.; Leung, H.; Yuen, K.-V. A joint data association, registration, and fusion approach for distributed tracking. Inf. Sci. 2015, 324, 186–196. [Google Scholar] [CrossRef]
Sinha, A.; Ding, Z.; Kirubarajan, T.; Farooq, M. Track Quality Based Multitarget Tracking Approach for Global Nearest-Neighbor Association. IEEE Trans. Aerosp. Electron. Syst. 2012, 48, 1179–1191. [Google Scholar] [CrossRef]
Maurer, D.E. Information handover for track-to-track correlation. Inf. Fusion 2003, 4, 281–295. [Google Scholar] [CrossRef]
Yu, Y.; Hou, Q.; Zhang, W.; Zhang, J. A Sequential Two-Stage Track-to-Track Association Method in Asynchronous Bearings-Only Sensor Networks for Aerial Targets Surveillance. Sensors 2019, 19, 3185. [Google Scholar] [CrossRef] [PubMed]
Wang, X.; Xu, Z.; Gou, X. Consensus-Based Track Association with Multistatic Sensors under a Nested Probabilistic-Numerical Linguistic Environment. Sensors 2019, 19, 1381. [Google Scholar] [CrossRef]
Liu, Y.; Yao, L.; Xiong, W.; Zhou, Z. GF-4 Satellite and Automatic Identification System Data Fusion for Ship Tracking. IEEE Trans. Geosci. Remote Sens. Lett. 2019, 16, 281–285. [Google Scholar] [CrossRef]
Li, B.; Dong, Y.; Huang, G.; Chen, X.; Guan, J. Radar/ESM anti-bias track association algorithm based on track distance vector detection. J. Eng. 2019, 21, 7563–7567. [Google Scholar] [CrossRef]
Ping, R.; Munari, A.; Petrova, M. Performance Tradeoffs of Joint Radar-communication Networks. IEEE Wireless Commun. Lett. 2019, 8, 165–168. [Google Scholar]
Hachour, S.; Delmotte, F.; Mercier, D.; Lefèvre, E. Object tracking and credal classification with kinematic data in a multi-target context. Inf. Fusion 2014, 20, 174–188. [Google Scholar] [CrossRef]
Chai, L.; Yi, W.; Kong, L. Joint sensor registration and multi-object tracking with PHD filter in distributed multi-sensor networks. Signal Process. 2023, 206, 108909. [Google Scholar] [CrossRef]
Wang, J.; Zeng, Y.; Wei, S.; Wei, Z.; Wu, Q.; Savaria, Y. Multi-sensor track-to-track association and spatial registration algorithm under incomplete measurements. IEEE Trans. Signal Process. 2021, 69, 3337–3350. [Google Scholar] [CrossRef]
Bar Shalom, Y. On the sequential track correlation algorithm in a multisensor data fusion system. IEEE Trans. Aerosp. Electron. Syst. 2008, 44, 396. [Google Scholar] [CrossRef]
Singer, R.A.; Kanyuck, A.J. Computer control of multiple site track correlation. Automatica 1971, 7, 455–463. [Google Scholar] [CrossRef]
Kanyuck, A.J.; Singer, R.A. Correlation of Multiple-Site Track Data. IEEE Trans. Aerosp. Electron. Syst. 1970, AES-6, 180–187. [Google Scholar] [CrossRef]
He, Y.; Wang, G.; Guan, X. Information Fusion Theory with Applications; Publishing House of Electronics Industry: Beijing, China, 2010; pp. 178–210. [Google Scholar]
Deb, S.; Yeddanapudi, M. A generalized S-D assignment algorithm for multisensor-multitarget state estimation. IEEE Trans. Aerosp. Electron. Syst. 1997, 33, 523–538. [Google Scholar]
You, H.; Zhang, J. New track correlation algorithms in a multisensor data fusion system. IEEE Trans. Aerosp. Electron. Syst. 2006, 42, 1359–1371. [Google Scholar] [CrossRef]
Qi, L.; He, Y.; Dong, K.; Liu, J. Multi-radar anti-bias track association based on the reference topology feature. IET Radar Sonar Navig. 2018, 12, 366–372. [Google Scholar] [CrossRef]
Guan, X.; He, Y.; Yi, X. Gray track-to-track correlation algorithm for distributed multitarget tracking system. Signal Process. 2006, 86, 3448–3455. [Google Scholar]
Mousa, N.; Saeid, P.; Leyli, M. An Adaptive Density-Based Fuzzy Clustering Track Association for Distributed Tracking System. IEEE Access 2019, 7, 135972–135981. [Google Scholar]
Yi, X.; Du, J. Asynchronous track-to-track association algorithm based on discrete degree of segmented sequence. Acta Aeronaut. Astronaut. Sin. 2020, 41, 265. [Google Scholar]
Wu, L.; Xu, Y.; Wang, F.; Lu, Q.; Hu, M. Real-time Ship Track Association: A Benchmark and a Network-Based Methodl. In Proceedings of the 2019 22th International Conference on Information Fusion (FUSION), Ottawa, ON, Canada, 2–5 July 2019. [Google Scholar]
Xiong, W.; Xu, P.; Cui, Y. Unsupervised and interpretable track-to-track association based on homography estimation of radar bias. IET Radar Sonar Navig. 2024, 18, 294–307. [Google Scholar]
Jin, B.; Tang, Y.; Zhang, Z.; Lian, Z.; Wang, B. Radar and AIS Track Association Integrated Track and Scene Features Through Deep Learning. IEEE Sens. J. 2023, 23, 8001–8009. [Google Scholar] [CrossRef]
Xiong, W.; Xu, P.L.; Cui, Y.Q.; Xiong, Z.Y.; Lv, Y.F. TF-TS: Track-to-Track association via track fusion and track segmentation in distributed multi-sensor systems. Expert Syst. Appl. 2024, 255, 124540. [Google Scholar] [CrossRef]
He, S.; Shin, H.; Tsourdos, A. Multi-Sensor Multi-Target Tracking Using Domain Knowledge and Clustering. IEEE Sens. J. 2018, 18, 8074–8084. [Google Scholar] [CrossRef]
Dai, Q.; Xiong, Z.; Xie, J.; Wang, X.; Zhang, Y.; Jia, X. A novel clustering algorithm based on the natural reverse nearest neighbor structure. Inf. Syst. 2019, 84, 1–16. [Google Scholar] [CrossRef]
Jiangrevise, Q.W.; Liu, Y.; Ding, Z.R.; Sun, S. Behavior pattern mining based on spatiotemporal trajectory multidimensional information fusion. Chin. J. Aeronaut. 2023, 36, 387–399. [Google Scholar]
Kiersztyn, A.; Pylak, D.; Horodelski, M.; Kiersztyn, K.; Urbanovich, P. Random clustering-based outlier detector. Inf. Sci. 2024, 667, 120498. [Google Scholar] [CrossRef]
Kim, J.; Lee, H.; Ko, Y.M. Constrained Density-Based Spatial Clustering of Applications with Noise (DBSCAN) using hyperparameter optimization. Knowl.-Based Syst. 2024, 303, 112436. [Google Scholar] [CrossRef]
Nurfalah, A.; Supangkat, S.H.; Mulyana, E. Effective & near real-time track-to-track association for large sensor data in Maritime Tactical Data System. ICT Express 2024, 10, 312–319. [Google Scholar]
García-García, J.C.; García-Ródenas, R. A methodology for automatic parameter-tuning and center selection in density-peak clustering methods. Soft Comput. 2021, 25, 1543–1561. [Google Scholar] [CrossRef]
Rodriguez, A.; Laio, A. Clustering by fast search and find of density peaks. Science 2014, 344, 1492–1496. [Google Scholar] [CrossRef]
Liu, R.; Wang, H.; Yu, X.M. Shared-nearest-neighbor-based clustering by fast search and find of density peaks. Inf. Sci. 2018, 450, 200–226. [Google Scholar] [CrossRef]
Zhang, X.L.; Liu, X.Y.; Han, Z.; Zhao, Y.Z. Neural membrane computing models with migrating rules for density peak clustering. Inf. Process. Manag. 2025, 62, 104031. [Google Scholar] [CrossRef]
Preeti, N.; Deep, K. Density Peak Clustering Using Grey Wolf Optimization Approach. J. Classif. 2024, 41, 338–370. [Google Scholar] [CrossRef]
Lu, J.Y.; Shao, J.M. Unsupervised contrastive clustering via density cluster representative combination. Expert Syst. Appl. 2025, 266, 125967. [Google Scholar] [CrossRef]
Piciarelli, C.; Micheloni, C.; Foresti, G.L. Trajectory-based anomalous event detection. IEEE Trans. Circuits Syst. Video Technol. 2008, 18, 1544–1554. [Google Scholar] [CrossRef]
Chen, S.Y.; Zhang, H.W.; Ma, J.T.; Xie, H. Asynchronous Track-to-Track Association Based on Pseudo Nearest Neighbor Distance for Distributed Networked Radar System. Electronics 2023, 18, 1794. [Google Scholar] [CrossRef]
Qiu, J.J.; Cai, Y.C.; Li, H.; Han, Y. A New Track Association Model Based on Uncertainty Distribution. IEEE Access 2023, 11, 1. [Google Scholar]
Chen, J.Y.; Wang, X.H.; Chen, X. Track Correlation Algorithm Based on CNN-LSTM for Swarm Targets. J. Syst. Eng. Electron. 2024, 35, 417–429. [Google Scholar] [CrossRef]

Figure 1. Schematic diagram of selection of processing points.

Figure 2. Schematic diagram of special cases.

Figure 3. Framework of the SAS-KNN-DPC algorithm.

Figure 4. Plot of 20 tracks.

Figure 5. Deviation effect diagram.

Figure 6. Plot of simulated track points.

Figure 7. Performance comparison under azimuth-dependent measurement errors.

Figure 8. Performance comparison under slant range measurement errors.

Figure 9. Comparison diagram of algorithms under asynchronous conditions.

Figure 10. Algorithm time consumption comparison.

Figure 11. Running time comparison.

Figure 12. Accuracy comparison.

Figure 13. The impact of target distance on accuracy under different measurement errors: (a) slant range measurement error of 50 km; (b) slant range measurement error of 75 km; (c) slant range measurement error of 100 km.

Table 1. The parameter settings of radars.

Contents	Ranges
Radar data reporting cycle (s)	[0.1, 1.2]
Azimuth measurement error (°)	[−0.3, 0.3]
Range measurement error (m)	[−100, 100]
Time of the initial data report (s)	[0, 1.2]

Table 2. Association results of contrast experiments.

Contents	Worst Association Accuracy	Best Association Accuracy	Mean	Variance (×10⁻⁴)
DBSCAN-based	0.81	0.86	0.835	3.3778
PND	0.78	0.84	0.81	5.5111
Uncertainty-based	0.75	0.83	0.79	5.6556
TF-TS	0.85	0.96	0.905	2.8444
CNN-LSTM-based	0.87	0.95	0.910	2.4733
SAS-KNN-DPC (ours)	0.92	0.99	0.955	2.4800

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Guan, X.; Huang, Z.; Yi, X. SAS-KNN-DPC: A Novel Algorithm for Multi-Sensor Multi-Target Track Association Using Clustering. Electronics 2025, 14, 2064. https://doi.org/10.3390/electronics14102064

AMA Style

Guan X, Huang Z, Yi X. SAS-KNN-DPC: A Novel Algorithm for Multi-Sensor Multi-Target Track Association Using Clustering. Electronics. 2025; 14(10):2064. https://doi.org/10.3390/electronics14102064

Chicago/Turabian Style

Guan, Xin, Zhijun Huang, and Xiao Yi. 2025. "SAS-KNN-DPC: A Novel Algorithm for Multi-Sensor Multi-Target Track Association Using Clustering" Electronics 14, no. 10: 2064. https://doi.org/10.3390/electronics14102064

APA Style

Guan, X., Huang, Z., & Yi, X. (2025). SAS-KNN-DPC: A Novel Algorithm for Multi-Sensor Multi-Target Track Association Using Clustering. Electronics, 14(10), 2064. https://doi.org/10.3390/electronics14102064

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

SAS-KNN-DPC: A Novel Algorithm for Multi-Sensor Multi-Target Track Association Using Clustering

Abstract

1. Introduction

2. Problem Formulation

3. SAS-KNN-DPC Algorithm

3.1. Definitions

3.2. Self-Adaptive Step-2 Truncation Distance

3.3. The Framework of SAS-KNN-DPC Algorithm

3.4. Similarity Metric

3.5. Cluster Center Selection and Association Assignment

3.6. Time Complexity

3.7. Space Complexity

4. Experiments

4.1. Simulation Scene Setting

4.1.1. Scenario Data Visualization Analysis

4.1.2. Experimental Design

4.2. Experiment Based on Scenario 1

4.3. Real-Time Performance Verification

4.4. Impact of Sensor Quantity

4.5. Impact of the Target Distance

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI