Next Article in Journal
IoT-Enabled Adaptive Traffic Management: A Multiagent Framework for Urban Mobility Optimisation
Previous Article in Journal
Integrated NV Center-Based Temperature Sensor for Internal Thermal Monitoring in Optical Waveguides
Previous Article in Special Issue
Multi-Task Learning for Joint Indoor Localization and Blind Channel Estimation in OFDM Systems
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Hyperbolic Spatial Covariance Modeling with Adaptive Signal Filtering for Robust Wi-Fi Indoor Positioning

1
School of Information Engineering, Shaoguan University, Shaoguan 512158, China
2
School of Automation and Intelligent Manufacturing, Southern University of Science and Technology, Shenzhen 518055, China
*
Author to whom correspondence should be addressed.
Sensors 2025, 25(13), 4125; https://doi.org/10.3390/s25134125
Submission received: 24 May 2025 / Revised: 23 June 2025 / Accepted: 26 June 2025 / Published: 2 July 2025

Abstract

Robust indoor positioning, crucial to modern location-based services, increasingly leverages Channel State Information (CSI) for its superior multipath resolution over the traditional RSSI. However, current CSI-based methods are hampered by three key limitations: susceptibility to skewed, non-Gaussian noise; informational redundancy from multi-AP configurations; and spatial discontinuities arising from Euclidean-based modeling. To address these challenges, we propose a unified framework that synergistically combines three innovations: (1) an adaptive filtering pipeline that uses wavelet decomposition and dynamic Kalman updates to suppress skewed noise; (2) a graph attention network that optimizes AP selection by modeling spatiotemporal correlations; and (3) a hyperbolic covariance model that captures the intrinsic non-Euclidean geometry of signal propagation. Evaluations on experimental data demonstrate that our framework achieves superior positioning accuracy and environmental robustness over state-of-the-art methods. Crucially, the hyperbolic representation enhances resilience to obstructions by preserving the signal manifold’s true structure, thereby advancing the practical deployment of fingerprinting systems.

1. Introduction

The proliferation of smart mobile devices has catalyzed revolutionary advancements in location-based services (LBSs), with precise indoor positioning emerging as a critical enabler for applications ranging from asset tracking to augmented reality. Unlike outdoor environments where Global Navigation Satellite Systems (GNSSs) are effective, indoor localization faces fundamental challenges, including severe multipath propagation, Non-Line-of-Sight (NLOS) conditions, and dynamic environmental interference [1]. Among competing technologies, Wi-Fi infrastructure demonstrates distinct advantages. Bluetooth-based systems [2] encounter limitations in coverage density and synchronization complexity, and ZigBee networks [3] sacrifice data rates for energy efficiency, while RFID [4] and UWB [5] require specialized hardware incompatible with existing infrastructure. Leveraging existing Wi-Fi access points (APs), therefore, provides a cost-effective and scalable foundation for indoor positioning, making it a highly viable solution for widespread deployment without additional specialized hardware [6].
Within Wi-Fi-based positioning, early systems predominantly relied on the Received Signal Strength Indicator (RSSI) as the primary signal feature [7]. The RSSI provides a single scalar value representing the total power of the received signal. However, this simplicity is also its fundamental weakness. As a coarse-grained metric, the RSSI is the superposition of signals from all propagation paths, which suffers from severe temporal fluctuations and spatial ambiguity in complex indoor environments, fundamentally constraining the achievable positioning accuracy [8].
To overcome the inherent limitations of the RSSI, recent advancements have shifted the focus to Channel State Information (CSI), a fine-grained physical layer metric [9]. The superiority of CSI over the RSSI for indoor positioning is threefold [1]. First, CSI offers superior multipath resolution; unlike the RSSI, which aggregates all signal paths into one value, CSI provides frequency-selective measurements across dozens of subcarriers, enabling the discrimination of distinct multipath components. Second, CSI provides richer information; it is a high-dimensional vector containing both amplitude and phase information for each subcarrier, offering a detailed channel fingerprint that is far more descriptive than a single power value. Third, the overall structure of the Channel Frequency Response (CFR) that CSI represents exhibits significantly higher temporal stability compared with the volatile RSSI, making it a more robust feature for creating reliable location fingerprints.
Despite its advantages, leveraging CSI for robust positioning introduces its own set of challenges, which remain inadequately addressed by the existing literature. Current methods exhibit three primary limitations. First, raw CSI data are often corrupted by non-Gaussian, skewed noise induced by dynamic environmental factors, which conventional filters assuming simple noise distributions fail to effectively suppress [10,11]. Second, the high dimensionality of CSI from multiple APs often leads to informational redundancy, and existing feature extraction techniques frequently lack adaptive fusion mechanisms to model the complex spatiotemporal correlations between APs, thereby limiting feature discriminability [12,13]. Third, and most critically, the majority of current positioning algorithms rely on Euclidean distance metrics to model spatial relationships. This assumption of a flat, linear space fundamentally fails to capture the intrinsic non-linear manifold structure of signal propagation in complex indoor environments, leading to significant information loss and degraded performance, especially under NLOS conditions [14,15].
In summary, to address the limitations of conventional fingerprinting-based indoor positioning systems, this paper proposes a comprehensive Wi-Fi fingerprinting framework with three key innovations. The main contributions are as follows:
  • Skewness-Adaptive Signal Filtering: To mitigate non-Gaussian noise characteristics induced by multipath propagation, we develop a multi-modal preprocessing pipeline combining wavelet packet decomposition, dynamic Kalman filtering, and skewness-aware normalization. This approach effectively handles asymmetric signal distributions while preserving transient features through adaptive noise covariance updates, outperforming conventional median/Gaussian filters in computational efficiency.
  • Correlation-Driven AP Selection: We introduce a hierarchical feature extraction mechanism that integrates multi-scale Temporal Convolutional Networks with graph attention fusion. This architecture dynamically selects optimal antenna pairs by modeling spatiotemporal correlations between APs through learned attention weights, resolving redundancy issues in conventional static AP selection methods.
  • Hyperbolic Spatial Covariance Modeling: A novel manifold-aware fingerprinting framework that encodes location relationships in hyperbolic space is proposed. By constructing hybrid covariance kernels combining hyperbolic distance metrics and feature similarity, our method better preserves spatial continuity in complex environments compared with Euclidean-based approaches.
The remainder of this paper is organized as follows: Section 2 systematically reviews foundational concepts of Channel State Information characterization and critically examines existing CSI-based indoor positioning strategies. Section 3 mathematically formalizes the indoor positioning problem while presenting the comprehensive system architecture. Section 4 elaborates on the proposed framework, encompassing three synergistic components: multi-modal signal preprocessing, hierarchical spatiotemporal feature extraction, and hyperbolic manifold-aware optimization. Section 5 provides rigorous quantitative comparisons with state-of-the-art methods, accompanied by error distribution analysis and ablation studies. Finally, Section 6 synthesizes key findings, discusses practical implications, and outlines future research trajectories in non-Euclidean signal modeling and edge computing integration.

2. Related Work

2.1. Channel State Information

Channel State Information (CSI) fundamentally characterizes the propagation medium between wireless transceivers, encapsulating critical parameters including signal attenuation, multipath scattering effects, and environmental interference patterns. As a time-varying descriptor of channel quality, CSI enables the real-time optimization of transmission parameters through adaptive modulation and beamforming techniques, thereby enhancing both spectral efficiency and link reliability. In frequency-domain analysis, the Channel Frequency Response for the k-th subcarrier is mathematically expressed as
H ( f k ) = H ( f k ) e j H ,
where H ( f k ) and H denote the amplitude and phase components at subcarrier frequency f k , respectively. Modern orthogonal frequency-division multiplexing (OFDM) systems facilitate CSI acquisition through commercial network interfaces, generating complex-valued channel matrices that capture spatial–frequency characteristics across multiple antennas. The superiority of CSI over the Received Signal Strength Indicator (RSSI) manifests in three key aspects: (1) frequency-selective resolution through orthogonal subcarriers; (2) phase coherence information for multipath analysis; and (3) spatial diversity via multiple-input multiple-output (MIMO) configurations. Xiao et al. pioneered CSI-based localization through their Fine-Grained Indoor Fingerprinting System (FIFS) [16], employing averaged amplitude features for position estimation. Subsequent research by Chapre et al. [17] enhanced fingerprint dimensionality through MIMO-OFDM implementations, while Wang et al. [12] introduced deep learning architectures for CSI denoising and feature extraction. Further advancements such as PhaseFi [18] demonstrated the viability of calibrated phase information, while other works explored contrastive learning [19] and hybrid CSI-RSSI systems [20]. However, these foundational works, while establishing the richness of CSI features, often overlooked the inherent spatial continuity of the wireless channel—a critical property where signal characteristics vary smoothly across adjacent physical locations. This oversight limits their ability to model gradual signal variations in continuous space, setting the stage for more advanced spatial modeling techniques.

2.2. CSI-Based Indoor Positioning Methods

Indoor positioning leveraging CSI has evolved significantly, with model-enhanced fingerprinting emerging as the dominant paradigm. Unlike geometric methods (e.g., AOA [21] and TOF [22]), which falter under Non-Line-of-Sight (NLOS) conditions [23,24], or traditional fingerprinting [16], which suffers from temporal instability and scalability issues [25,26], deep learning-based models can learn complex, non-linear mappings from CSI to location. However, even recent state-of-the-art methods exhibit critical limitations that motivate our work.
A prominent research trend involves using attention mechanisms to enhance feature robustness. For instance, Wan et al. proposed ACPNet, which incorporates channel and spatial attention to dynamically weigh different subcarriers and antenna streams within a single CSI sample [27]. This allows the model to focus on more informative signal components, improving accuracy. The primary limitation of such approaches, however, is that their attention mechanisms are “inward-facing”—they operate within a single CSI matrix to refine its features. While beneficial, this fails to explicitly model the dynamic spatiotemporal correlations between different access points (APs). This oversight leads to suboptimal feature fusion, as the relative reliability of signals from different APs is not dynamically assessed.
To address the issue of spatial relationships, Graph Neural Networks (GNNs) have been introduced to model the physical topology of the indoor environment. The work by Wang et al. represents a state-of-the-art example, constructing a graph to model the connectivity between reference points and APs, even incorporating an AP selection mechanism to reduce redundancy [28]. While GNNs effectively capture network topology better than methods like DeepFi [12], which treat fingerprints as isolated data, they suffer from a “Euclidean blind spot”. These models implicitly assume that the underlying signal propagation manifold is flat. This assumption breaks down in complex indoor environments where obstacles create a non-Euclidean geometric structure. Using Euclidean distance to measure similarity in these graph models leads to a fundamental misrepresentation of the true “signal distance” between points, particularly degrading performance under NLOS conditions, a challenge also faced by hybrid methods like Gridloc [15].
Another key research direction aims to improve the practicality of fingerprinting by reducing the burden of data collection. The Adapted Mean Teacher (AMT) model proposed by Zhu et al. is a notable example, employing a semi-supervised learning framework to leverage a large number of unlabeled data, thereby reducing the need for laborious manual labeling [29]. While this approach commendably tackles the challenge of data quantity, it does not fully address the issue of data quality. Specifically, such methods often rely on standard data augmentation and do not incorporate sophisticated, physics-aware noise models. They largely fail to account for the highly skewed, non-Gaussian noise distributions that are common in real-world CSI data corrupted by dynamic multipath interference. This gap highlights the need for more advanced signal preprocessing pipelines.
In summary, despite significant progress with methods leveraging contrastive learning [30] and graph networks [31], existing approaches face three critical limitations that our work aims to address. First, they lack robust mechanisms for handling the skewed, non-Gaussian noise prevalent in dynamic indoor environments. Second, their feature fusion strategies are often suboptimal, as they do not dynamically model the spatiotemporal correlations between signals from multiple APs. Third, and most critically, their reliance on Euclidean-based metrics fails to preserve the intrinsic non-Euclidean manifold structure of signal propagation, leading to information loss and reduced accuracy in complex spaces. Our framework overcomes these specific challenges through three synergistic innovations: (1) a skewness-adaptive filtering pipeline for superior noise suppression, (2) a graph attention network for correlation-aware AP feature fusion, and (3) a hyperbolic covariance model that provides a more geometrically faithful representation of the spatial relationships in CSI fingerprints.

3. Problem Description

In the process of indoor positioning based on the fingerprinting method, signal features are collected in two main stages: an offline initialization stage (for building the fingerprint database) and an online positioning stage. During the initialization stage, we collect fingerprint information at I known reference locations within an environment where one or more wireless access points are deployed. At each location p i ( i = 1 , 2 , , I ), we capture M measurements of Channel State Information (CSI), denoted by g i , m R N for m = 1 , 2 , , M . We use p = [ p 1 , p 2 , , p I ] to denote all the reference locations and p m = [ g 1 , m , g 2 , m , , g I , m ] to denote the m-th measurement vector at each location. The mean fingerprint is given by g ¯ = 1 M m = 1 M g m . To provide the requested clarification, each measurement vector g i , m (and similarly the online measurement vector y ) is composed of the real-valued amplitudes of the Channel Frequency Response (CFR) across N subcarriers, as defined in Equation (1):
g i , m = H i , m ( f 1 ) , H i , m ( f 2 ) , , H i , m ( f N ) R N
where H i , m ( f k ) denotes the complex channel response at location p i for measurement m and subcarrier k. The target measurement vector y , captured during the online phase, follows the same structure:
y = H target ( f 1 ) , H target ( f 2 ) , , H target ( f N )
In the positioning stage, our main task is to estimate the receiver’s position coordinates p target based on the received measurement vector y . The schematic diagram of the two stages of fingerprint positioning is shown in Figure 1.

4. Methodology

The proposed indoor positioning methodology consists of an offline phase for database construction and model training and an online phase for estimating the position of a target based on its received signal.

4.1. Offline Phase: Database Preparation and Model Training

4.1.1. Fingerprint Database Acquisition

As described in Section 3, an initial database is collected, containing CSI measurements g i , m R N for m = 1 , , M samples at N f p distinct reference locations p i ( i = 1 , , N f p ). We denote the physical positions by P = { p 1 , , p N f p } . For robustness, the M measurements at each location can be averaged or processed to yield a representative signal vector g ¯ i for each location p i .

4.1.2. Signal Preprocessing

Each representative signal vector g ¯ i (or potentially each raw g i , m ) undergoes a three-stage preprocessing pipeline to mitigate noise and artifacts inherent in CSI measurements. Let s r a w denote a signal vector input to this stage. The sequential design of this pipeline is motivated by the hierarchical nature of noise in CSI signals and empirical validation across diverse environments. Wavelet decomposition first isolates high-frequency noise components, Kalman filtering then tracks low-frequency signal dynamics, and skewness normalization finally addresses asymmetric distortions from multipath propagation. This order prevents distortion amplification (e.g., normalizing before filtering would amplify Kalman’s residual errors) and aligns with the noise characteristics observed in our experimental data.
Wavelet Packet Decomposition
The raw signal s r a w R N is decomposed using a 3-level wavelet packet decomposition with Daubechies-4 basis functions. The Daubechies-4 (db4) wavelet was specifically chosen as it offers an optimal trade-off between compact support and regularity for analyzing CSI signals. Its support is narrow enough to accurately localize the sharp, high-frequency transients caused by multipath effects, while its four vanishing moments provide sufficient smoothness to model the underlying signal structure without introducing artifacts or blurring critical features. This balance is crucial to effectively separating the stable location fingerprint from volatile noise, outperforming both simpler (e.g., Haar) and more complex (e.g., db8) alternatives for this application. The wavelet packet functions are defined as
ψ k , n ( t ) = 2 k / 2 ψ ( 2 k t n ) , k { 1 , 2 , 3 } , n Z
where ψ ( t ) is the mother wavelet. This yields 2 3 = 8 subbands of wavelet coefficients  { d k , n } .
s r a w k = 1 8 n d k , n ψ k , n ( t )
Optimal subbands might be selected based on criteria like Shannon entropy to retain the most informative components:
E ( d k ) = n | d k , n | 2 d k 2 log | d k , n | 2 d k 2
Adaptive Kalman Filtering
The selected detail coefficients (let z t be the vector of coefficients at index/time t) are filtered using a Kalman filter to reduce noise while adapting to signal dynamics. The state s t represents the estimated true coefficients.
s t = s t 1 ( State prediction )
P t = P t 1 + Q ( Covariance prediction )
K t = P t ( P t + R t ) 1 ( Kalman Gain )
s t = s t + K t ( z t s t ) ( State update )
P t = ( I K t ) P t ( Covariance update )
Here, z t is the observed coefficient vector at step t, Q is the process noise covariance (e.g.,  0.01 I ), and R t is the measurement noise covariance, updated adaptively as
R t = ( 1 α ) R t 1 + α ( z t s t ) ( z t s t )
with a forgetting factor α (e.g., 0.1). Let s ^ denote the vector of filtered coefficients after processing all relevant indices/times.
Robust Skewness Normalization
The filtered coefficients s ^ are normalized to handle potential skewness in their distribution by using median and Interquartile Range (IQR), compensated by a skewness factor:
s ˜ = s ^ median ( s ^ ) IQR ( s ^ ) tanh ( γ | s ^ | )
where ⊙ denotes element-wise multiplication and γ depends on the sample skewness of s ^ , which is
skewness ( s ^ ) = 1 N j = 1 N ( s ^ j μ ) 3 σ 3 , γ = c · | skewness ( s ^ ) |
with μ , σ being the mean and standard deviation of s ^ and c being a scaling constant (e.g., 1.5). Let s ˜ i denote the preprocessed signal corresponding to location p i .

4.1.3. Deep Feature Extraction

The preprocessed signals s ˜ i are fed into a deep learning model to extract discriminative feature vectors h i . This model combines temporal and spatial information.
Multi-Scale Temporal Encoding
Temporal dependencies are captured using parallel Temporal Convolutional Network (TCN) branches with different dilation factors (e.g., d { 1 , 2 , 4 } ). Each TCN layer performs a convolution:
h tcn ( l ) = ReLU k = 0 K 1 W k ( l ) s ˜ t d k + b ( l )
Weights W k ( l ) are initialized, e.g., using Xavier initialization:
W k ( l ) N 0 , 2 n in + n out
The outputs from parallel branches can be concatenated or summed.
Graph Attention Fusion
If multiple access points (APs) or receiving antennas are used, spatial relationships are modeled using graph attention networks (GATs). Features from different sources (APs/antennas) corresponding to the same location are fused. Let { h j } be the TCN features from neighboring sources j N i (within communication range or physically adjacent). The attention mechanism computes weights α i j :
e i j = LeakyReLU ( a [ W h i W h j ] )
α i j = exp ( e i j ) k N i exp ( e i k )
The fused feature h i for source i is then
h i = ELU j N i α i j W h j
where W and a are learnable parameters of the GAT layer. The final output of this stage is a feature vector h i R D for each reference location p i . The TCN and GAT models are trained offline using the preprocessed fingerprint data { ( p i , s ˜ i ) } i = 1 N f p , typically aiming to optimize a loss function related to position estimation or feature discriminability. Let the resulting feature database be D h = { ( p i , h i ) } i = 1 N f p .

4.1.4. Manifold-Aware Fingerprint Modeling (GP Training)

We model the relationship between position p and the extracted feature vector h using a Gaussian Process (GP). The GP assumes h = f ( p ) + ϵ , where f ( p ) is a function drawn from a GP prior and ϵ is noise. The key component is the covariance kernel function K ( p i , p j ) , which defines the similarity between features at different locations.
Hybrid Covariance Kernel
Conventional fingerprinting methods typically assume that the spatial relationship between reference points can be adequately modeled in Euclidean space. However, this assumption falters in complex indoor environments characterized by significant multipath propagation and Non-Line-of-Sight (NLOS) conditions. The physical obstacles, such as walls and furniture, impose a natural hierarchical structure on the signal propagation paths. For example, signals from a corridor access point to locations in different adjacent rooms must all traverse their respective doorways, creating a tree-like signal path topology where the corridor acts as a parent node. It has been well-established that such hierarchical or tree-like structures are more faithfully represented in negatively curved hyperbolic spaces than in flat Euclidean spaces, as the exponential volume growth of hyperbolic manifolds allows for low-distortion embeddings  [32,33]. The mathematical prerequisite for employing a hyperbolic metric, therefore, arises directly from the intrinsic geometry of the indoor signal manifold. Using hyperbolic distance allows the model to capture the effective “signal distance” between points, which is more relevant for positioning than the direct Line-of-Sight Euclidean distance. We, therefore, propose a hybrid kernel that combines this global geometric prior with local feature-based information:
K ( p i , p j ; , η ) = exp d H ( p i , p j ) 2 2 2 + η h i h j h i h j + σ n 2 δ i j
where
  • d H ( p i , p j ) is the hyperbolic distance between positions p i and p j . Note: The specific formula depends on the chosen hyperbolic model (e.g., Poincare disk/half-plane) and the mapping from Euclidean coordinates p to hyperbolic space, which needs to be defined.
  • is the characteristic length scale in the hyperbolic space.
  • h i , h j are the feature vectors corresponding to p i , p j . The cosine similarity h i h j h i h j is used for the feature similarity component.
  • η weighs the contribution of feature similarity.
  • σ n 2 is the noise variance, and δ i j is the Kronecker delta.
Kernel Parameter Optimization
The hyperparameters θ = { , η , σ n 2 } are learned offline by maximizing the log marginal likelihood of the observed fingerprint features H = [ h 1 , , h N f p ] given the positions P :
θ * = arg max θ log p ( H | P , θ )
Assuming independent features for simplicity or considering a multi-output GP framework, the log marginal likelihood involves the Gram matrix K , whose entries are K i j = K ( p i , p j ; θ ) . For a single dimension of the feature vector (let h ( d ) be the d-th dimension across all points), this is
log p ( h ( d ) | P , θ ) = 1 2 ( h ( d ) ) K 1 h ( d ) 1 2 log | K | N f p 2 log ( 2 π )
This optimization is typically performed using gradient-based methods.

4.2. Online Phase: Target Positioning

A new CSI measurement y target from an unknown target location p target is given.

4.2.1. Preprocessing and Feature Extraction

The target signal y target is processed through the same pipeline used in the offline phase (Section 4.1.2 and Section 4.1.3) to obtain its feature vector h target .

4.2.2. Bayesian Optimization for Position Estimation

We use the trained GP model h GP ( 0 , K ( p , p ; θ * ) ) and the target feature vector h target to estimate the target’s position p target . We aim to find the position p that maximizes the likelihood of observing h target . The log likelihood (for a single feature dimension h target ( d ) ) is
log p ( h target ( d ) | p , D h , θ * ) = ( h target ( d ) μ ( d ) ( p ) ) 2 2 σ 2 ( p ) 1 2 log ( 2 π σ 2 ( p ) )
where μ ( d ) ( p ) and σ 2 ( p ) are the GP predictive mean and variance for the d-th feature dimension at query location p :
μ ( d ) ( p ) = k ( p ) ( K + σ n 2 I ) 1 h ( d )
σ 2 ( p ) = K ( p , p ) k ( p ) ( K + σ n 2 I ) 1 k ( p )
with k ( p ) = [ K ( p , p 1 ) , , K ( p , p N f p ) ] and K being the Gram matrix computed on the fingerprint locations P . Assuming independence across feature dimensions, the total log likelihood is the sum over dimensions d. We use Bayesian Optimization (BO) to maximize this log likelihood function L ( p ) = d log p ( h target ( d ) | p , D h , θ * ) over the possible location space p .
Momentum-Enhanced Acquisition Function
BO iteratively selects the next point p t + 1 to evaluate by maximizing an acquisition function. We use Expected Improvement (EI) enhanced with a momentum term:
α ( p ) = EI ( p ) + β m t p EI ( p )
where m t is a momentum term accumulating past gradients of EI and β controls its influence. The standard EI for maximizing the objective function L ( p ) is
EI ( p ) = ( μ L ( p ) L + ) Φ ( Z ) + σ L ( p ) ϕ ( Z )
with L + being the maximum log likelihood value observed so far and
Z = μ L ( p ) L + σ L ( p )
Here, μ L ( p ) and σ L ( p ) are the mean and standard deviation of the GP model built for the log likelihood function L ( p ) itself. (Note: This requires a secondary GP model within the BO loop or approximations.) Φ ( · ) and ϕ ( · ) are the standard normal CDF and PDF.
Posterior Updates (Within BO)
After evaluating L ( p t ) at the selected point, the BO’s internal GP model for L ( p ) is updated. The BO procedure continues for a fixed number of iterations T max or until convergence. The final estimated position p ^ target is the location p that yielded the highest log likelihood L ( p ) during optimization.

4.3. Algorithm Summary

The overall process is summarized in Algorithm 1.
Algorithm 1 Manifold-aware Bayesian Positioning
 1:
  Input: Initial fingerprint database D = { ( p i , g i , m ) } , Target signal y target
 2:
  Output: Estimated target position p ^ target
  Offline Phase:
 3:
  Preprocess all g i , m (or averages g ¯ i ) to get s ˜ i (Equations (5)–(13))
 4:
  Train Deep Feature Extractor (TCN/GAT) using { ( p i , s ˜ i ) }
 5:
  Extract features for all reference points: h i = Extractor ( s ˜ i ) . Create D h = { ( p i , h i ) } i = 1 N f p
 6:
  Optimize GP kernel hyperparameters θ * = { * , η * , σ n 2 * } using D h by maximizing marginal likelihood (Equation (21))
 7:
  Construct the final Gram matrix K using θ * and P
  Online Phase:
 8:
  Preprocess target signal y target to get s ˜ target
 9:
  Extract target feature h target = Extractor ( s ˜ target )
10:
  Define the log-likelihood objective function L ( p ) = log p ( h target | p , D h , θ * ) based on GP predictions (Equations (23)–(25))
11:
  Initialize Bayesian Optimization to maximize L ( p ) .
12:
  for  t = 1 to T max do
13:
        Select next query point p t by maximizing acquisition function α ( p ) (Equation (26))
14:
        Evaluate L ( p t )
15:
        Update BO’s internal model of L ( p )
16:
  end for
17:
  Return p ^ target = arg max p { p 1 , , p T max } L ( p )

4.4. Computational Complexity Analysis

In this section, we analyze the computational complexity of the proposed method during its operational online phase. The offline training phase, while computationally intensive, is executed only once to establish the model parameters. Therefore, our analysis focuses exclusively on the inference stage, as its efficiency is paramount for real-time applicability. The computational burden of the online phase is primarily determined by the forward propagation of an input sample through the network. Let us define the key variables that characterize the network’s architecture:
  • D i n : the dimension of the input vector x .
  • L: the total number of hidden layers in the network.
  • N l : the number of neurons in the l-th hidden layer, where l { 1 , 2 , , L } .
  • D o u t : the dimension of the final output vector.
The online computation process for a single input sample x R D i n proceeds as follows: First, the computation between the input layer and the first hidden layer involves a matrix–vector multiplication with the weight matrix W 1 R N 1 × D i n and the addition of a bias vector b 1 R N 1 . The computational cost of this step is dominated by the matrix–vector product, which requires O ( D i n N 1 ) floating-point operations (FLOPs). Next, for each subsequent hidden layer l (where l ranges from 2 to L), the computation involves transforming the activation output from the previous layer, h l 1 R N l 1 , using the weight matrix W l R N l × N l 1 . The cost for each such layer is similarly dominated by the matrix–vector multiplication, amounting to O ( N l 1 N l ) FLOPs. Finally, the output layer computes the final result by transforming the activations of the last hidden layer, h L R N L , using the output weight matrix W o u t R D o u t × N L . This step has a complexity of O ( N L D o u t ) . The costs of bias additions and element-wise activation functions (e.g., ReLU or sigmoid) are O ( N l ) and O ( D o u t ) , respectively. These are lower-order terms compared with the matrix–vector multiplications and can be omitted in the asymptotic analysis. By summing the costs of all layers, the total computational complexity for a single forward pass during the online phase is given by
C o n l i n e = O D i n N 1 + l = 2 L N l 1 N l + N L D o u t
This expression demonstrates that the online computational complexity is linear with respect to the number of parameters in the network. Such a linear relationship ensures that the model can perform inference efficiently, making it well-suited for applications requiring rapid response times, as the latency scales predictably with the model size. This complexity is well within the acceptable range for modern computational platforms.

4.5. Theoretical Guarantees

To theoretically evaluate the fundamental limit of positioning accuracy for the proposed system, we derive the Cramér–Rao Lower Bound (CRLB). The CRLB provides a lower bound on the variance of any unbiased estimator for a deterministic parameter, which in our case is the target’s position vector p target R 2 . For an unbiased position estimator p ^ target , the error covariance matrix is bounded by the inverse of the Fisher Information Matrix (FIM), J :
E ( p ^ target p target ) ( p ^ target p target ) J 1
The trace of this matrix gives the lower bound on the mean squared error (MSE), which was briefly introduced in the original manuscript and is restated here for completeness:
E [ p ^ target p target 2 ] Tr ( J 1 )
The FIM, J , quantifies the amount of information that the observed feature vector h target carries about the unknown position p . For a set of parameters p = [ p 1 , p 2 ] , the elements of the 2 × 2 FIM are given by the negative expectation of the second-order partial derivatives of the log-likelihood function with respect to those parameters:
J i j ( p ) = E h target 2 log p ( h target | p ) p i p j
In our framework, the likelihood p ( h target | p ) is defined by the Gaussian Process posterior predictive distribution. Assuming independence across the D feature dimensions for analytical tractability, the total log likelihood is the sum of the individual log likelihoods from Equation (23):
log p ( h target p ) = d = 1 D log p ( h target ( d ) p ) = d = 1 D ( h target ( d ) μ ( d ) ( p ) ) 2 2 σ d 2 ( p ) 1 2 log ( 2 π σ d 2 ( p ) )
where μ ( d ) ( p ) and σ d 2 ( p ) are the GP predictive mean and variance for the d-th feature dimension, as given in Equations (24) and (25). To compute the FIM, we take the partial derivatives of Equation (33) with respect to the position coordinates p i and p j . Under the assumption that the estimator is evaluated at the true location p target , we have E [ h target ( d ) ] = μ ( d ) ( p target ) . After performing the differentiation and taking the negative expectation, which simplifies the cross-terms involving ( h target ( d ) μ ( d ) ( p ) ) , the FIM elements are found to be
J i j ( p ) = d = 1 D 1 σ d 2 ( p ) μ ( d ) ( p ) p i μ ( d ) ( p ) p j + 1 2 σ d 4 ( p ) σ d 2 ( p ) p i σ d 2 ( p ) p j
This expression connects the theoretical performance limit directly to our model’s components. The terms μ ( d ) ( p ) p i and σ d 2 ( p ) p i are the gradients of the GP predictive mean and variance, which depend on the partial derivatives of the kernel function K ( p , p ) with respect to the coordinates of p . Therefore, the FIM is ultimately determined by the sensitivity of our hybrid covariance kernel to spatial changes. A higher gradient in the kernel function leads to a larger FIM and thus a lower CRLB, indicating higher potential positioning accuracy. In our experimental analysis, we compute this bound to benchmark the performance of the proposed estimator.

5. Experimental Validation

5.1. Experimental Setup and Data Acquisition

To validate the proposed positioning framework, we designed a comprehensive experimental protocol encompassing hardware configuration, data acquisition, and multi-scenario environmental validation. The entire setup was meticulously planned to ensure the collection of high-fidelity data and enable a rigorous evaluation of the algorithm’s performance under diverse and challenging conditions.
The acquisition of Channel State Information (CSI) was performed using a TP-Link WDR4310 router (TP-Link Technologies Co., Ltd., Shenzhen, China) flashed with the OpenWrt open-source firmware (v22.03.3), as illustrated in Figure 2. This platform facilitates the capture of CSI measurements across 56 orthogonal frequency-division multiplexing (OFDM) subcarriers. To align with common real-world deployment, we implemented a 3 × 1 multiple-input single-output (MISO) architecture where the transmitter utilizes three omnidirectional antennas and the receiver employs a single antenna.
The raw CSI measurements, inherently susceptible to significant non-Gaussian noise from multipath propagation and hardware non-linearities, were processed using the adaptive filtering pipeline detailed in Section 4.1.2. This crucial step validates the integrity of the fingerprint database prior to model training and feature extraction. As illustrated in Figure 3, the framework provides a quantifiable improvement in data quality: at a representative test location, it achieved an 8.78 dB reduction in noise variance—equivalent to an 86.8% reduction in noise power—while retaining 98.2% of the original signal energy. By significantly enhancing the signal-to-noise ratio (SNR) without losing the discriminative amplitude characteristics essential to positioning, this preprocessing step establishes the high-fidelity data foundation necessary for the robust performance of the subsequent modeling stages.
To systematically evaluate the robustness and practical performance of our localization methodology, experimental validations were conducted in three distinct scenarios, specifically chosen to represent a spectrum of environmental complexity. The first, an empty chamber (9.6 × 4.4 m), served as a baseline to assess performance under near-ideal, Line-of-Sight (LOS) conditions, using a grid of 72 initialization and 54 test points (Figure 4). To introduce realistic challenges, subsequent evaluations were performed in an office space (9.6 × 4.4 m) with typical furniture and obstructions causing complex multipath propagation (50 initialization and 44 test points; Figure 5) and an adjacent corridor (28.8 × 3 m). The corridor, with its elongated geometry and potential for severe signal attenuation, represents a common yet difficult indoor navigation scenario, for which an expanded dataset of 216 initialization and 81 test points was collected (Figure 6). In all scenarios, the test locations were methodically selected to ensure comprehensive spatial coverage with a consistent 0.5-m separation between adjacent points.
The number of deployed access points (APs) is a critical parameter that dictates the trade-off between positioning accuracy and practical deployment cost. To establish a methodologically sound configuration for our main experiments, a preliminary study was conducted to determine an optimal AP quantity. As shown in Figure 7, we evaluated the average localization error while varying the number of APs from one to four. The results clearly indicate that performance improves significantly up to three APs, after which the accuracy gains diminish. Based on this empirical evidence, a three-AP configuration—with APs strategically deployed as shown in Figure 4, Figure 5 and Figure 6—was adopted for all scenarios. This data-driven decision ensures that the subsequent comparative analysis is conducted under conditions representing an optimal balance between performance and complexity, thereby enabling a fair and robust evaluation against other methods.

5.2. Results and Discussion

In addition to the methodology presented in this work, five state-of-the-art localization techniques were employed for comparative analysis in test point positioning: the conventional fingerprinting technique (FIFS) proposed in Reference [16], which utilizes traditional radio fingerprint matching; the Deep Neural Network-based approach (DeepFi) introduced in Reference [12], employing classical depth learning architectures for signal pattern recognition; the Attention-Aided Deep Learning network (ACPNet) from Reference [27], which leverages attention mechanisms to refine CSI features; the hybrid localization methodology (Gridloc) developed in Reference [15], which synergistically combines Received Signal Strength Indicator (RSSI) measurements with Channel State Information (CSI) data fusion; and the graph convolutional neural network architecture (GCNN) presented in Reference [31], implementing advanced graph-based learning for spatial feature extraction. Quantitative performance metrics, including mean positioning error, standard deviation of the error, and 90% accuracy thresholds (i.e., the error value below which 90% of test points fall), are summarized for each method across the different environments in Table 1, Table 2 and Table 3. The error distributions are further visualized through Cumulative Distribution Functions (CDFs) in Figure 8, Figure 9 and Figure 10. Figure 11 specifically illustrates the error distributions achieved by the method proposed in this paper across the three distinct scenarios. These results visually corroborate the expectation that an increase in environmental complexity, such as the presence of obstacles and the need for signal penetration through walls, generally leads to a reduction in the stability of received signals and, consequently, an escalation in positioning errors. This figure provides a crucial self-benchmark, illustrating how the proposed method’s performance intrinsically responds to increasing environmental challenges, which then serves as a basis for evaluating its relative robustness compared with other techniques.
In the empty room scenario (Table 1), which represents a near-ideal environment with minimal multipath interference, the proposed method achieved superior performance with a mean error of 0.7608 m. This result underscores the fundamental efficacy of our model’s feature extraction and spatial modeling capabilities even under baseline conditions. In contrast, the conventional FIFS method exhibited the highest mean error (1.4557 m), attributable to its reliance on averaged CSI amplitude features that fail to address temporal–spatial signal dynamics. DeepFi (1.1157 m) and the newly introduced ACPNet (0.9034 m) demonstrated the advantages of using deep learning for feature extraction. Notably, ACPNet’s use of intra-sample attention mechanisms to weigh subcarriers and antennas allows it to refine CSI features more effectively than DeepFi’s single-scale architecture, resulting in better accuracy. However, both GCNN (0.8350 m) and Gridloc (0.8746 m) outperformed ACPNet. This is because, as argued in our related work review, ACPNet’s “inward-facing” attention operates only within a single CSI sample and does not explicitly model the spatial relationships between different APs. Methods like GCNN, which model the physical topology, and the proposed method, which models the signal manifold, gain a distinct advantage by capturing this crucial inter-AP spatial information, leading to lower positioning errors even in this relatively simple environment.
In the office environment with obstacles (Table 2), the introduction of complex multipath propagation provides a more rigorous test of each model’s robustness. The proposed method maintained its leading performance with a mean error of 0.8943 m, showing a modest degradation of only 17.5% compared with the empty room. This resilience highlights the effectiveness of our framework’s skewness-adaptive filtering and dynamic Kalman updates in suppressing the non-Gaussian noise prevalent in such environments. The performance gap between our method and others widened significantly. For instance, ACPNet’s mean error increased to 1.1517 m, a performance degradation of 27.5%. This larger increase empirically validates the critique of its architecture: in an environment with obstructions, the reliability of signals from different APs varies drastically. ACPNet’s inability to model these dynamic inter-AP correlations means it cannot adaptively fuse features from the most reliable sources, leading to a substantial drop in accuracy. In contrast, our method’s graph attention fusion mechanism is explicitly designed to learn these correlations, providing superior robustness. While Gridloc’s hybrid approach (0.9538 m) offered some adaptability, its reliance on Euclidean metrics for spatial relationships still resulted in a 6.6% higher mean error than our geometrically aware model.
In the elongated corridor scenario (Table 3), where severe signal attenuation and geometrically constrained propagation paths challenge conventional models, the proposed method’s core innovation demonstrates its full value. It achieved a mean error of 1.0433 m, outperforming the next-best method, Gridloc (1.2716 m), by 21.9%. The performance of Euclidean-based models degraded significantly in this non-Euclidean environment. Notably, GCNN (1.4009 m) performed worse than ACPNet (1.3235 m). This reversal is highly instructive: GCNN’s graph model, which assumes that Euclidean distance is a meaningful proxy for signal similarity, is fundamentally misled by the corridor’s geometry. The actual signal propagation paths are constrained by the walls, creating a hierarchical structure that flat space cannot represent. In this extreme case, GCNN’s flawed spatial model becomes more detrimental than ACPNet’s approach of simply refining features without an explicit inter-AP model. This result provides a powerful justification for our method’s central contribution: the hyperbolic covariance kernel. By representing fingerprints in a negatively curved space that naturally accommodates hierarchical, tree-like structures, our framework preserves the intrinsic manifold of signal propagation. This geometric fidelity prevents the critical information loss suffered by all other methods, enabling superior accuracy in the most challenging indoor environments.
To further assess the statistical efficiency of the proposed method, we compare its empirical performance against the theoretical Cramér–Rao Lower Bound (CRLB) derived in Section 4.5. The CRLB represents the minimum achievable mean squared error for any unbiased estimator and thus serves as an ideal performance benchmark. We calculated the FIM (Equation (34)) and the corresponding CRLB (Equation (31)) at the true location of each test point in our three experimental environments. Table 4 summarizes the average CRLB for each scenario and compares it with the mean positioning error of our method.
As shown in Table 4, our method achieves a mean error that is reasonably close to the CRLB, with the gap increasing from 0.11 m in the empty room to 0.19 m in the corridor. This trend is expected, as unmodeled environmental complexities introduce biases that prevent any real-world estimator from perfectly attaining the bound. In all scenarios, the mean error of our method is close to the CRLB, indicating high efficiency for a majority of the test points. The divergence in the tail of the distribution reflects the challenging NLOS conditions for a subset of points, where the GP model’s assumptions are most stressed. Overall, this analysis confirms that the proposed estimator operates near the fundamental performance limit dictated by the information content of the CSI signals.

6. Conclusions

This paper presents a novel Wi-Fi indoor positioning framework that significantly enhances accuracy and robustness by addressing key limitations in existing methods. The synergy of three innovations is central to our approach: (1) a skewness-adaptive filtering pipeline for effective non-Gaussian noise suppression; (2) a graph attention network for correlation-aware feature fusion from multiple APs; and (3) a hyperbolic covariance model that preserves the intrinsic manifold structure of signal propagation in complex environments. Experimental results validate the superiority of our method, especially in obstructed scenarios where the hyperbolic spatial representation demonstrates exceptional resilience. However, we acknowledge the limitations of this study. Our validation was conducted using a homogeneous hardware setup within single-floor environments, thus the generalizability to large-scale, multi-floor venues and across heterogeneous devices warrants further investigation. Future work will focus on extending the hyperbolic model to 3D space for multi-floor positioning and exploring domain adaptation techniques to create a hardware-agnostic framework. Furthermore, integrating our model with semi-supervised learning and edge computing will be explored to reduce recalibration efforts in dynamic environments and enhance real-time deployment feasibility.

Author Contributions

Conceptualization, W.W.; methodology, W.W.; software, W.W.; validation, W.W.; formal analysis, M.L.; investigation, W.W.; resources, M.L.; data curation, W.W.; writing—original draft preparation, W.W.; writing—review and editing, M.L.; visualization, W.W.; supervision, M.L.; project administration, M.L.; funding acquisition, W.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research study was funded by Shaoguan University Doctoral Research Startup Fund grant number 441-9900064601 and Guangdong Key Construction Discipline Research Capacity Enhancement Project grants number 2022ZDJS049 and 2022ZDJS048.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data supporting the reported results in this study are not publicly available due to privacy and ethical restrictions. However, additional details and datasets may be made available upon reasonable request to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
ACPNetAttention-Aided Deep Learning Network
AMTAdapted Mean Teacher
AOAAngle of Arrival
APaccess point
BOBayesian Optimization
CDFCumulative Distribution Function
CFRChannel Frequency Response
CRLBCramér–Rao Lower Bound
CSIChannel State Information
DeepFiDeep Neural Network-based Fingerprinting
DNNDeep Neural Network
EIExpected Improvement
FIFSFine-Grained Indoor Fingerprinting System
FIMFisher Information Matrix
FLOPsfloating-point operations
GATgraph attention network
GCNNgraph convolutional neural network
GNNsGraph Neural Networks
GNSSGlobal Navigation Satellite System
GPGaussian Process
Gridlochybrid localization methodology
IQRInterquartile Range
LBSlocation-based service
LOSLine-of-Sight
MIMOmultiple-input multiple-output
MISOmultiple-input single-output
MSEmean squared error
NLOSNon-Line-of-Sight
OFDMorthogonal frequency-division multiplexing
PDFProbability Density Function
RFIDRadio-Frequency Identification
RSSIReceived Signal Strength Indicator
SNRsignal-to-noise ratio
TCNTemporal Convolutional Network
TOFTime of Flight
UWBUltra-Wideband

References

  1. Yang, Z.; Zhou, Z.; Liu, Y. From RSSI to CSI: Indoor localization via channel response. ACM Comput. Surv. CSUR 2013, 46, 1–32. [Google Scholar] [CrossRef]
  2. Bencak, P.; Hercog, D.; Lerher, T. Indoor positioning system based on bluetooth low energy technology and a nature-inspired optimization algorithm. Electronics 2022, 11, 308. [Google Scholar] [CrossRef]
  3. Li, J.; Yu, S.; Wei, Z.; Zhou, Z. An Enhanced ZigBee-Based Indoor Localization Method Using Multi-Stage RSSI Filtering and LQI-Aware MLE. Sensors 2025, 25, 2947. [Google Scholar] [CrossRef] [PubMed]
  4. Shi, W.; Chen, Z.; Zhao, K.; Xi, W.; Qu, Y.; He, H.; Guo, Z.; Ma, Z.; Huang, X.; Wang, P.; et al. 3D target location based on RFID polarization phase model. EURASIP J. Wirel. Commun. Netw. 2022, 2022, 1–19. [Google Scholar] [CrossRef]
  5. Xiang, Z.; Chen, L.; Wu, Q.; Yang, J.; Dai, X.; Xie, X. An Improved UWB Indoor Positioning Approach for UAVs Based on the Dual-Anchor Model. Sensors 2025, 25, 1052. [Google Scholar] [CrossRef]
  6. Eleftherakis, S.; Santaromita, G.; Rea, M.; Costa-Pérez, X.; Giustiniano, D. SPRING+: Smartphone positioning from a single WiFi access point. IEEE Trans. Mob. Comput. 2024, 23, 9549–9566. [Google Scholar] [CrossRef]
  7. Wang, P.; Luo, Y. Research on WiFi indoor location algorithm based on RSSI Ranging. In Proceedings of the 2017 4th International Conference on Information Science and Control Engineering (ICISCE), Changsha, China, 21–23 July 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 1694–1698. [Google Scholar]
  8. Yang, T.; Cabani, A.; Chafouk, H. A survey of recent indoor localization scenarios and methodologies. Sensors 2021, 21, 8086. [Google Scholar] [CrossRef]
  9. Liu, W.; Cheng, Q.; Deng, Z.; Chen, H.; Fu, X.; Zheng, X.; Zheng, S.; Chen, C.; Wang, S. Survey on CSI-based indoor positioning systems and recent advances. In Proceedings of the 2019 International Conference on Indoor Positioning and Indoor Navigation (IPIN), Pisa, Italy, 30 September–3 October 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 1–8. [Google Scholar]
  10. Chen, M.; Zhang, D.; Zhao, Y.; Wu, T. An Optimized Positioning Algorithm Based on Improved Gaussian Filtering. Proc. J. Phys. Conf. Ser. 2021, 2010, 012047. [Google Scholar] [CrossRef]
  11. Wu, Z.; Jiang, L.; Jiang, Z.; Chen, B.; Liu, K.; Xuan, Q.; Xiang, Y. Accurate indoor localization based on CSI and visibility graph. Sensors 2018, 18, 2549. [Google Scholar] [CrossRef]
  12. Wang, X.; Gao, L.; Mao, S.; Pandey, S. DeepFi: Deep learning for indoor fingerprinting using channel state information. In Proceedings of the 2015 IEEE Wireless Communications and Networking Conference (WCNC), New Orleans, LA, USA, 9–12 March 2015; IEEE: Piscataway, NJ, USA, 2015; pp. 1666–1671. [Google Scholar]
  13. Zhou, R.; Hao, M.; Lu, X.; Tang, M.; Fu, Y. Device-free localization based on CSI fingerprints and deep neural networks. In Proceedings of the 2018 15th Annual IEEE International Conference on Sensing, Communication, and Networking (SECON), Hong Kong, China, 11–13 June 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 1–9. [Google Scholar]
  14. Rocamora, J.M.; Ho, I.W.H.; Mak, M.W. Fingerprint quality classification for CSI-based indoor positioning systems. In Proceedings of the ACM MobiHoc Workshop on Pervasive Systems in the IoT Era, Catania, Italy, 2 July 2019; pp. 31–36. [Google Scholar]
  15. Dai, Q.; Qian, B.; Boateng, G.O.; Guo, X.; Ansari, N. GRIDLoc: A Gradient Blending and Deep Learning based Localization Approach Combining RSS and CSI. IEEE Wirel. Commun. Lett. 2024, 13, 2620–2624. [Google Scholar] [CrossRef]
  16. Xiao, J.; Wu, K.; Yi, Y.; Ni, L.M. FIFS: Fine-grained indoor fingerprinting system. In Proceedings of the 2012 21st International Conference on Computer Communications and Networks (ICCCN), München, Germany, 30 July–2 August 2012; IEEE: Piscataway, NJ, USA, 2012; pp. 1–7. [Google Scholar]
  17. Chapre, Y.; Ignjatovic, A.; Seneviratne, A.; Jha, S. CSI-MIMO: An efficient Wi-Fi fingerprinting using channel state information with MIMO. Pervasive Mob. Comput. 2015, 23, 89–103. [Google Scholar] [CrossRef]
  18. Wang, X.; Gao, L.; Mao, S. PhaseFi: Phase fingerprinting for indoor localization with a deep learning approach. In Proceedings of the 2015 IEEE Global Communications Conference (GLOBECOM), San Diego, CA, USA, 6–10 December 2015; IEEE: Piscataway, NJ, USA, 2015; pp. 1–6. [Google Scholar]
  19. Deng, J.; Shi, W.; Zhang, J.; Zhang, X.; Zhang, C. Supervised contrastive CSI representation learning for massive MIMO positioning. IEEE Commun. Lett. 2022, 26, 1799–1803. [Google Scholar] [CrossRef]
  20. Wang, J.; Park, J. An enhanced indoor positioning algorithm based on fingerprint using fine-grained csi and rssi measurements of ieee 802.11 n wlan. Sensors 2021, 21, 2769. [Google Scholar] [CrossRef]
  21. Chen, L.; Ahriz, I.; Le Ruyet, D. AoA-aware probabilistic indoor location fingerprinting using channel state information. IEEE Internet Things J. 2020, 7, 10868–10883. [Google Scholar] [CrossRef]
  22. Tadayon, N.; Rahman, M.T.; Han, S.; Valaee, S.; Yu, W. Decimeter ranging with channel state information. IEEE Trans. Wirel. Commun. 2019, 18, 3453–3468. [Google Scholar] [CrossRef]
  23. Altarazi, A.; Al-Madi, N.; Awad, F. Geometric-based localization for wireless sensor networks. In Proceedings of the 2020 11th International Conference on Information and Communication Systems (ICICS), Irbid, Jordan, 7–9 April 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 108–112. [Google Scholar]
  24. Kumrai, T.; Cai, Z.; Maekawa, T.; Hara, T.; Ohara, K.; Murakami, T.; Abeysekera, H. AoA-net: Estimating Angle-of-arrival Using Wi-Fi Channel State Information Based on Deep Neural Networks with Subcarrier Selection. J. Inf. Process. 2024, 32, 863–872. [Google Scholar] [CrossRef]
  25. Li, T.; Wang, H.; Shao, Y.; Niu, Q. Channel state information–based multi-level fingerprinting for indoor localization with deep learning. Int. J. Distrib. Sens. Netw. 2018, 14, 1550147718806719. [Google Scholar] [CrossRef]
  26. Basri, C.; El Khadimi, A. Survey on indoor localization system and recent advances of WIFI fingerprinting technique. In Proceedings of the 2016 5th international conference on multimedia computing and systems (ICMCS), Marrakech, Morocco, 29 September–1 October 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 253–259. [Google Scholar]
  27. Wan, J.; Zhang, Z.; Li, W.; Wang, Z.; Wu, S. CSI-Based MIMO Indoor Positioning Using Attention-Aided Deep Learning. IEEE Commun. Lett. 2024, 28, 330–334. [Google Scholar] [CrossRef]
  28. Wang, S.; Chen, P.H.; Wu, H.Y.; Chiu, C.C. Graph Neural Network-Based WiFi Indoor Localization System with Access Point Selection. IEEE Internet Things J. 2024, 11, 28386–28398. [Google Scholar] [CrossRef]
  29. Zhu, Y.; Zhang, P.; Zhang, J.; Jiang, M.; Pan, D. Semi-Supervised Learning-Enhanced Fingerprint Indoor Positioning by Exploiting an Adapted Mean Teacher Model. Electronics 2024, 13, 298. [Google Scholar] [CrossRef]
  30. Zhu, H.; Cheng, L.; Li, X.; Yuan, H. Neural-network-based localization method for Wi-Fi fingerprint indoor localization. Sensors 2023, 23, 6992. [Google Scholar] [CrossRef] [PubMed]
  31. Liu, X.; Wu, R.; Zhang, H.; Chen, Z.; Liu, Y.; Qiu, T. Graph Temporal Convolution Network-based WiFi Indoor Localization Using Fine-grained CSI Fingerprint. IEEE Sens. J. 2025, 25, 9019–9033. [Google Scholar] [CrossRef]
  32. Sala, F.; De Sa, C.; Gu, A.; Ré, C. Representation tradeoffs for hyperbolic embeddings. In Proceedings of the International Conference on Machine Learning, PMLR, Stockholm, Sweden, 10–15 July 2018; pp. 4460–4469. [Google Scholar]
  33. Chami, I.; Ying, Z.; Ré, C.; Leskovec, J. Hyperbolic graph convolutional neural networks. In Proceedings of the 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, BC, Canada, 8–14 December 2019; Volume 32. [Google Scholar]
Figure 1. Flowchart of the proposed positioning algorithm. The workflow consists of four sequential stages: signal preprocessing, feature extraction, manifold modeling, and adaptive positioning.
Figure 1. Flowchart of the proposed positioning algorithm. The workflow consists of four sequential stages: signal preprocessing, feature extraction, manifold modeling, and adaptive positioning.
Sensors 25 04125 g001
Figure 2. TP-Link WDR4310 router with OpenWrt firmware.
Figure 2. TP-Link WDR4310 router with OpenWrt firmware.
Sensors 25 04125 g002
Figure 3. Signal preprocessing results: (Top) Raw CSI amplitude measurements showing multipath-induced fluctuations. (Bottom) Denoised signal after wavelet decomposition and skewness normalization.
Figure 3. Signal preprocessing results: (Top) Raw CSI amplitude measurements showing multipath-induced fluctuations. (Bottom) Denoised signal after wavelet decomposition and skewness normalization.
Sensors 25 04125 g003
Figure 4. The experimental setup of the empty environment.
Figure 4. The experimental setup of the empty environment.
Sensors 25 04125 g004
Figure 5. The experimental setup of the office environment.
Figure 5. The experimental setup of the office environment.
Sensors 25 04125 g005
Figure 6. The experimental setup of the corridor environment.
Figure 6. The experimental setup of the corridor environment.
Sensors 25 04125 g006
Figure 7. Variation in average positioning error with number of access points.
Figure 7. Variation in average positioning error with number of access points.
Sensors 25 04125 g007
Figure 8. The CDF of positioning errors in the empty room.
Figure 8. The CDF of positioning errors in the empty room.
Sensors 25 04125 g008
Figure 9. The CDF of positioning errors in the office room.
Figure 9. The CDF of positioning errors in the office room.
Sensors 25 04125 g009
Figure 10. The CDF of positioning errors in the corridor environment.
Figure 10. The CDF of positioning errors in the corridor environment.
Sensors 25 04125 g010
Figure 11. The error distribution of the proposed method in 3 different scenarios.
Figure 11. The error distribution of the proposed method in 3 different scenarios.
Sensors 25 04125 g011
Table 1. The mean positioning errors and corresponding standard deviations across different positioning methods (empty room).
Table 1. The mean positioning errors and corresponding standard deviations across different positioning methods (empty room).
Positioning MethodError Metrics
Mean Error (m)Standard Deviation (m)90% Acc. (m)
Proposed Method0.76080.17611.3844
DeepFi1.11570.39331.9518
FIFS1.45570.72902.6652
GCNN0.83500.21401.5532
Gridloc0.87460.28011.6064
ACPNet0.90340.21791.5446
Table 2. The mean positioning errors and corresponding standard deviations across different positioning methods (office room).
Table 2. The mean positioning errors and corresponding standard deviations across different positioning methods (office room).
Positioning MethodError Metrics
Mean Error (m)Standard Deviation (m)90% Acc. (m)
Proposed Method0.89430.20261.5667
DeepFi1.26450.47252.2875
FIFS1.95011.30353.7244
GCNN1.03230.26671.6978
Gridloc0.95380.25141.6767
ACPNet1.15170.36731.9691
Table 3. The mean positioning errors and corresponding standard deviations across different positioning methods (corridor environment).
Table 3. The mean positioning errors and corresponding standard deviations across different positioning methods (corridor environment).
Positioning MethodError Metrics
Mean Error (m)Standard Deviation (m)90% Acc. (m)
Proposed Method1.04330.38001.9677
DeepFi1.52290.78233.0115
FIFS2.09451.37373.5409
GCNN1.40090.59422.3599
Gridloc1.27160.53522.3692
ACPNet1.32350.49102.1994
Table 4. Comparison of the proposed method’s mean positioning error with the theoretical CRLB.
Table 4. Comparison of the proposed method’s mean positioning error with the theoretical CRLB.
EnvironmentProposed Method’s Mean Error (m)Average CRLB (m)
Empty Room0.76080.6512
Office Room0.89430.7325
Corridor1.04330.8561
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wang, W.; Liu, M. Hyperbolic Spatial Covariance Modeling with Adaptive Signal Filtering for Robust Wi-Fi Indoor Positioning. Sensors 2025, 25, 4125. https://doi.org/10.3390/s25134125

AMA Style

Wang W, Liu M. Hyperbolic Spatial Covariance Modeling with Adaptive Signal Filtering for Robust Wi-Fi Indoor Positioning. Sensors. 2025; 25(13):4125. https://doi.org/10.3390/s25134125

Chicago/Turabian Style

Wang, Wenxu, and Mingxiang Liu. 2025. "Hyperbolic Spatial Covariance Modeling with Adaptive Signal Filtering for Robust Wi-Fi Indoor Positioning" Sensors 25, no. 13: 4125. https://doi.org/10.3390/s25134125

APA Style

Wang, W., & Liu, M. (2025). Hyperbolic Spatial Covariance Modeling with Adaptive Signal Filtering for Robust Wi-Fi Indoor Positioning. Sensors, 25(13), 4125. https://doi.org/10.3390/s25134125

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop