Indoor UAV 3D Localization Using 5G CSI Fingerprinting

Shahraki, Mohsen; Elamin, Ahmed; El-Rabbany, Ahmed

doi:10.3390/ijgi15010024

Open AccessArticle

Indoor UAV 3D Localization Using 5G CSI Fingerprinting

by

Mohsen Shahraki

^*,

Ahmed Elamin

and

Ahmed El-Rabbany

Department of Civil Engineering, Faculty of Engineering and Architectural Science, Toronto Metropolitan University, Toronto, ON M5B 2K3, Canada

^*

Author to whom correspondence should be addressed.

ISPRS Int. J. Geo-Inf. 2026, 15(1), 24; https://doi.org/10.3390/ijgi15010024

Submission received: 4 November 2025 / Revised: 31 December 2025 / Accepted: 1 January 2026 / Published: 5 January 2026

(This article belongs to the Special Issue Indoor Mobile Mapping and Location-Based Knowledge Services)

Download

Browse Figures

Versions Notes

Abstract

Fifth-generation (5G) wireless networks have been widely deployed across various applications, including indoor positioning. This paper presents a model for 3D indoor localization of an unmanned aerial vehicle (UAV) using 5G millimeter-wave technology. Wireless InSite software is used to simulate a real-world environment and extract channel state information from multiple 5G next-generation NodeBs (gNBs), which is then used to generate channel frequency response (CFR) images. These images are employed in a fingerprinting method, where a deep convolutional neural network is trained for accurate position prediction. The model is trained across multiple scenarios involving changes in the number of gNBs, receiver positions, and spacing. In all scenarios, the model is tested using a UAV flying along a trajectory at variable speed. It is shown that a mean positioning error (MPE) of 0.36 m in 2D and 0.43 m in 3D is achieved when twelve gNBs with receivers spaced at 0.25 m are used. In addition, the corresponding root mean square error (RMSE) values of 0.32 m (2D) and 0.33 m (3D) further confirm the stability of the localization performance by indicating a low dispersion of positioning errors. This demonstrates that high positioning accuracy is feasible, even when synchronization errors and hardware imperfections exist.

Keywords:

three-axis indoor positioning; unmanned aerial vehicle (UAV); 5G signal; deep convolutional neural network (DCNN); channel state information (CSI); fingerprinting

1. Introduction

Localization of unmanned aerial vehicles (UAVs) in environments such as indoor spaces, urban canyons, and tunnels poses a major challenge due to the unavailability or unreliability of global navigation satellite systems (GNSS) [1]. In these settings, precise positioning is essential to ensure safe navigation and autonomous flight, motivating the development of alternative localization methods. Consequently, a wide range of technologies and approaches have been proposed to address GNSS-denied positioning scenarios [2,3]. However, directly applying general indoor positioning solutions to UAV platforms remains nontrivial, as UAV operation is characterized by full three-dimensional mobility, highly time-varying signal propagation (dynamic channel conditions), and stringent size, weight, and power (SWaP) constraints. These factors introduce distinct technical challenges that require focused investigation and tailored localization frameworks [4].

Indoor localization systems rely on technologies such as 5G, Ultra-Wideband (UWB), Wi-Fi, Bluetooth, and ZigBee. UWB has been widely adopted for indoor UAV localization by deploying fixed anchors that provide range or time-of-flight measurements to estimate the UAV’s position with high accuracy. While UWB can offer high-precision positioning, it has significant drawbacks, such as the need for dedicated infrastructure, higher power consumption, and increased cost, which can limit its deployment in large-scale or budget-constrained scenarios. For UAVs, the requirement for a pre-calibrated, fixed UWB anchor network severely limits operational flexibility and scalability in large or temporary deployment zones, despite its demonstrated effectiveness for indoor aerial robot localization [5,6].

Wi-Fi-based localization has been explored for indoor UAV positioning primarily through fingerprinting approaches that match the received signal strength or channel characteristics to pre-collected radio maps [7]. While this approach can leverage existing network infrastructure and is therefore cost-effective, its positioning accuracy is often degraded by multipath propagation and temporal signal fluctuations, especially in dynamic indoor environments [8]. Similarly, Bluetooth-based localization has been investigated for UAVs using BLE beacon deployments, where proximity or fingerprinting techniques are applied to estimate position. However, the short communication range and susceptibility to signal interference and reflection limit its achievable accuracy, particularly for precise 3D UAV navigation [9].

ZigBee, known for low power consumption and suitability for mesh networks, has also been used for UAV indoor localization [10]; however, its limited range and low positioning precision make it less suitable for high-accuracy UAV localization applications. These limitations are amplified for UAVs, which operate not only along a floor plane but throughout a volumetric space where radio propagation varies rapidly with altitude and motion, leading to frequent non-line-of-sight (NLOS) conditions and stronger multipath effects. Therefore, selecting an appropriate technology for indoor UAV localization must jointly consider infrastructure requirements, scalability, and robustness under 3D mobility and time-varying propagation conditions [2,11,12,13,14,15].

Using 5G new radio (NR) signals for indoor positioning takes advantage of the advanced features of 5G networks, such as high-frequency millimeter-wave (mmWave) signals, large bandwidth, and low latency [12], to achieve accurate and reliable localization in GNSS challenging environments. The dense deployment of small cells supported by 5G technology provides numerous reference points, enhancing positioning accuracy [16]. Furthermore, the higher frequency bands of 5G allow for finer spatial resolution [17], which is particularly useful for distinguishing between closely spaced receivers in an indoor environment. For UAVs, the potential to leverage existing or future 5G communication infrastructure for dual-purpose positioning is a significant advantage, potentially reducing the need for dedicated sensors. The three primary methods for indoor positioning using 5G signals are triangulation, which determines location based on angles; multilateration, which uses time differences of signal arrival; and fingerprint-based methods, which match pre-collected signal characteristics to estimate a device’s position.

Techniques like triangulation, including angle of departure (AoD) and angle of arrival (AoA), and multilateration methods, such as time-difference-of-arrival (TDoA) and time-of-arrival (ToA), can face significant challenges in indoor environments [18,19]. The suboptimal performance of these methods indoors is primarily due to factors, such as signal obstruction caused by walls, furniture, and other obstacles, which can block or distort the signals. Additionally, these techniques often require precise hardware synchronization across multiple nodes, which can be difficult to achieve in practice [20]. Complications also arise from multipath effects, where signals reflect off surfaces and reach the receiver multiple times, as well as from non-line-of-sight (NLOS) conditions, where no direct path exists between the transmitter and receiver [21]. In UAV applications, these challenges are exacerbated. The UAV’s movement through 3D space creates rapidly changing and often severe NLOS and multipath conditions, especially relative to ceiling-mounted or elevated infrastructure. Maintaining synchronization with a fast-moving aerial node adds another layer of complexity. These challenges can result in inaccuracies when calculating distances or angles, diminishing the reliability of these methods for indoor localization [22], particularly for high-precision UAV navigation [19].

Fingerprint-based localization techniques offer a viable solution for indoor positioning, addressing the challenges posed by NLOS [23,24,25]. These methods utilize a pre-existing database comprising actual location coordinates and either channel state information (CSI) or received signal strength indicator (RSSI) readings. Location estimation is achieved by matching the measured RSSI or CSI values against this database. Despite its utility, RSSI-based localization tends to underperform compared to CSI-based approaches due to inherent limitations [26]. RSSI, derived from radio frequency signals at a packet level, presents difficulties in obtaining precise measurements. In standard indoor settings, the variance of RSSIs recorded from a stationary receiver over a one-minute period can reach up to 5 dB, highlighting the challenge of achieving consistent and accurate readings [26]. Furthermore, RSSI is susceptible to the multipath effect, causing fluctuations in signal strength, which compromises its effectiveness for precise localization. Conversely, CSI offers a detailed perspective on the signal’s condition at the subcarrier level, providing insights into the signal’s behavior, especially concerning its multipath propagation characteristics. This detailed information includes phase and amplitude changes across different subcarriers, enabling a more accurate depiction of the signal environment and thereby improving the accuracy of location estimates [24,25,27]. The robustness of CSI to multipath makes it a particularly attractive candidate for UAV localization in cluttered indoor environments.

Positioning using the CSI feature can be performed either from the transmitter side, such as through the network’s access points (APs), or from the user side, such as with user equipment (UE). When positioning is conducted from the transmitter side, APs are responsible for determining the location of the UE. However, this method comes with several drawbacks. For example, it requires a dense deployment of APs to achieve high accuracy, which is often costly and impractical. Also, precise synchronization between multiple APs is essential for accurate positioning, and even minor synchronization errors can lead to significant inaccuracies.

On the other hand, localizing devices at the UE side benefits from APs transmitting specialized positioning reference signals. These signals allow UEs to calculate CSI, which can improve localization accuracy when data from multiple APs are combined [28,29]. A UE-side (UAV-side) approach is often more practical for autonomous UAVs, as it grants the platform direct control over its position estimate. In scenarios where AP networks are synchronized perfectly, the relative phase information derived from the CSI across different APs is crucial for enhancing positioning accuracy. However, when synchronization among APs is suboptimal, the importance of relative phases diminishes, and they may be excluded during the aggregation of multi-AP CSI. Moreover, it is essential to note that the CSI, whether in the time or frequency domain, produced by a UE fundamentally relies on the UE’s ability to synchronize with the signals received from an AP [30]. This dependency on synchronization highlights a critical challenge in achieving accurate positioning on the UE side. Expanding on the foundational work [31,32,33], recent advancements in indoor positioning have begun to explore the utilization of commercial 5G NR CSI for localization purposes. The paper by [34] proposes a hybrid indoor positioning model that combines convolutional neural networks (CNNs) with a path-loss model. The model leverages multivariable fingerprints, including signal strength and environmental factors, to improve the accuracy of 2D indoor positioning in complex environments. The authors reported an average positioning error of 1.47 m, achieving a 9.26% accuracy improvement compared to the CNN-only approach [34]. The method [30] outperforms those utilizing detailed frequency characteristics from CSI, significantly improving positioning accuracy in indoor environments. By processing frequency-selective CSI, the method achieved an average positioning error of 0.60 m for indoor 2D localization.

Recently, the research in [35] tackled the challenges associated with 2D indoor positioning using standalone 5G next-generation NodeBs (gNBs). They proposed a fingerprinting approach that leveraged the multi-beam capability of 5G downlink signals. This methodology uses an “Extreme Learning Machine” to reduce dimensionality, enhancing both the accuracy and speed of indoor positioning. In parallel, ref. [36] introduced the iPos-5G system, which was evaluated in indoor office scenarios using commercial 5G CSI. Their results demonstrated that 94.45% of test samples achieved positioning errors below 4.01 m—corresponding to the 2σ confidence interval—based on a cumulative distribution function (CDF) analysis of localization errors. These findings underscore the practical potential of CSI-based positioning using a single gNB. Building upon these foundations, ref. [37] further extended the applicability of commercial 5G technologies for robust and scalable indoor positioning solutions. Complementing these findings, ref. [38] reported that 67% of 2D positioning errors were within 1 m, affirming the growing reliability of such methods. These studies highlight the potential of 5G NR CSI and multi-beam signals for providing precise and effective indoor 2D positioning solutions. However, these methods are not well-suited for UAV applications that involve flight at varying altitudes within an environment. Additionally, most of them have not been evaluated in large-scale indoor spaces, underscoring the need for continued research and innovation in this rapidly evolving field.

In this work, we propose a 3D indoor localization framework for UAVs using commercial 5G millimeter-wave (mmWave) signals. The proposed approach formulates UAV positioning as a CSI-based fingerprinting problem and employs a deep convolutional neural network (DCNN) that relies on CSI amplitude features, enabling robust localization under imperfect synchronization and device-level variability in practical 5G deployments [39]. The main contributions of this paper are as follows:

We introduce a 5G CSI-based 3D indoor localization system for UAVs, explicitly addressing altitude variation and full 3D mobility in GNSS-denied environments.
We develop a DCNN-based CSI fingerprinting method that exploits amplitude information, avoiding reliance on phase coherence and improving robustness to synchronization mismatches.
We provide a systematic 3D evaluation using a realistic indoor UAV flight scenario, analyzing the impact of gNB count and training-point distribution on localization accuracy.

The remainder of the paper is organized as follows: Section 2 presents the proposed method, Section 3 describes the simulation setup, Section 4 reports the results, Section 5 discusses the findings, and Section 6 concludes the paper.

2. Proposed Approach

This study investigated the positioning of UE devices (UAV trajectory) in a transmission environment consisting of multiple gNBs (base stations) where each gNB is a multi-beam antenna and the UE is equipped with a single antenna. The localization process, which determines the position of the UE, is performed directly on the UE side, leveraging the signals received from the surrounding gNBs. The methodology consists of multiple steps, including channel modeling, CSI feature extraction, generating a channel frequency response (CFR) image, and finally, developing, training, and testing the DCNN model. In channel modeling, the channel data are initially denoted in the time domain and then converted into the frequency domain using the Fourier transform (FT). This conversion enables the extraction of CSI, represented by the complex H-matrix, which captures key characteristics of the wireless channel, including amplitude and phase information across different subcarriers and antennas. Relevant CSI features are subsequently extracted from this matrix to effectively characterize the signal propagation environment. These CSI features are then used to construct CFR images, providing a visual representation of the channel’s frequency response over the spatial domain. These images serve as inputs to DCNN architectures, supporting accurate positioning and localization tasks.

2.1. Channel Model

In a typical 5G network, a gNB is responsible for managing radio communications with UEs within its designated service area, known as a cell. Each gNB is equipped with M antennas to facilitate communication. The position of a UE, denoted as u, is defined by

x_{u} \in R^{D}

, where D can be either 3 or 2, indicating three-dimensional or two-dimensional space. Similarly, the position of a gNB, represented as t, is given by

x_{t} \in R^{D}

. It is taken for granted that the UE is in sync with one gNB, without considering the need for synchronization at the network level across the gNBs.

The channel, donated as c, between a gNB and a UE is modeled as comprising P multipath components, where each component is characterized by a complex path gain

g_{p} \in c

and a propagation delay

τ_{p} \in [0, τ_{m a x}]

. Here,

g_{p}

represents the amplitude attenuation and phase shift experienced by the signal along the p-th path, while

τ_{p}

denotes the time delay of the p-th multipath component, with being the maximum delay spread.

The continuous-time baseband CIR between the UE and a single antenna of the gNB, accounting for these multipath components, is given by [30]:

c (τ) = \sum_{p = 1}^{P} g_{p} δ (τ - τ_{p}),

(1)

where

c (τ)

represents the CIR, which describes the channel’s response to an impulse at the time

τ

, capturing the effects of all multipath components, and

g_{p}

represents the complex gain of the p-th multipath component. The summation is over all P multipath components, where each term

g_{p} δ (τ - τ_{p})

models the contribution of the p-th path. The Dirac delta function

δ (τ - τ_{p})

represents an impulse arriving at the receiver with a delay of

τ_{p}

. This model effectively captures the impact of multipath propagation in wireless communication, where the transmitted signal reaches the receiver through multiple paths, each with its distinct gain and delay. In contexts involving the Dirac delta function, the CFR is derived from the CIR by transforming it from the time domain to the frequency domain. This transformation is performed using the FT, which allows us to analyze how the different frequency components of a signal are affected by the channel. The CFR is denoted as:

C (f) = \int e^{- j 2 π f τ} c (τ),

(2)

It is assumed that 64 quadrature amplitude modulation (QAM) is used in the creation and decoding of signals within orthogonal frequency division multiplexing (OFDM). In this setup, the gNB transmits pilot signals to the UE across N evenly spaced subcarriers, where N represents the total number of subcarriers, and each subcarrier has a spacing of Δf. These subcarriers are indexed by:

n = - \frac{N - 1}{2}, \dots, \frac{N - 1}{2},

(3)

where n represents the subcarrier index. Given

f_{0}

as the carrier frequency, each subcarrier n has a center frequency defined as:

f_{n} = f_{0} + n \cdot f_{s},

(4)

The vector

C = [C_{1}, \dots, C_{N}]

which represents the subcarrier channels, is obtained by applying the discrete Fourier transform (DFT) to the discrete-time tapped delay line (TDL) channel model. This TDL channel comprises P delay taps, where each tap corresponds to a different multipath component with a specific delay. To perform the DFT, the discrete-time channel c is extended to length N by appending zeros, ensuring it is compatible with the DFT operation. The resulting channel response at the subcarrier frequency given

f_{n}

by:

C_{N} ≜ C (f_{n}) = \sum_{p = 1}^{P} g_{p} e^{- j 2 π f_{n} τ_{p}},

(5)

where

g_{p}

represents the complex gain of the p-th multipath component, and

e^{- j 2 π f_{n} τ_{p}}

accounts for the phase shift due to the delay at the subcarrier frequency

f_{n}

. This transformation from the time domain to the frequency domain enables the analysis of each subcarrier’s response to the transmitted signal, which is crucial for estimating the CSI in wireless communication systems to ensure reliable data transmission.

In this model, the CFR captures how the multipath propagation environment affects each subcarrier’s signal, considering the impact of different path gains and delays. The use of 64-QAM modulation allows for efficient transmission by mapping data onto multiple amplitude and phase states, while the OFDM scheme divides the transmitted signal into multiple subcarriers, each carrying a portion of the data. By leveraging the pilot signals across these subcarriers, the UE can accurately estimate the channel conditions, facilitating reliable decoding of the transmitted information.

Given hardware constraints and synchronization errors, CSI measurements experience various distortions related to timing, phase, and magnitude [30]. The elements of the estimated CSI vector

\hat{C} = {[{\hat{C}}_{1}, \dots, {\hat{C}}_{N}]}^{T} \in C^{N}

are modeled as:

{\hat{C}}_{N} = e^{- j φ_{n}} C_{n} + n_{ε},

(6)

where

φ_{n}

is the phase distortion caused by synchronization errors and other impairments, and

n_{ε}

represents the estimation noise, assumed to be Gaussian and white, which includes interference effects from carrier frequency offset, in-phase/quadrature (I/Q) imbalance, and phase noise. The term

φ_{n} = 2 π {n f}_{s} α + θ_{n}

captures the frequency component at subcarrier n,

α

represents the scaling factor for frequency offset, and

θ_{n}

denotes additional phase noise. Due to the phase distortions affecting the estimation of the CFR, the CSI cannot be directly used as a feature for localization [30,40,41,42].

2.2. CSI Feature

CSI captures extensive channel characteristics, providing rich scene information, and is highly sensitive to environmental changes. This means that both the movement of people and objects in the scene can cause numerical fluctuations in CSI value [43]. In modern communication systems that utilize multi-beam antenna arrays, CSI provides detailed measurements of the signal attenuation and phase shifts along each propagation path between the transmitter and receiver. These characteristics are typically represented as the CIR in the time domain and the CFR in the frequency domain.

In 5G mmWave multi-beam systems, CSI is represented through a beam-space H-matrix that captures the complex channel responses from each directional beam, following the framework established in [44]. Our system employs twelve horn antennas, each transmitting two orthogonally steered beams (X- and Y-axis) as described in Section 3.2, received by a single antenna [45]. The H-matrix is constructed through sequential pilot transmissions using time-division scheduling, where each beam’s contribution is isolated and estimated via least-squares methods similar to [46]. The resulting matrix provides complete channel characterization while leveraging the directional advantages of horn antennas for enhanced signal quality and spatial resolution. This beam-space representation serves as the foundation for precise localization in our system. Once the per-antenna responses are obtained, they are assembled to reconstruct the full H-matrix, representing the channel state at a given time-frequency resource. Based on these estimated responses, the H-matrix can be formally expressed as:

Y_{u}^{a} = H_{u}^{a} X_{u}^{a} + N,

(7)

where

Y_{u}^{a}

is the received signal,

X_{u}^{a}

is the transmitted signal, N is noise, and

H_{u}^{a}

is the H-matrix in the frequency domain, containing both amplitude and phase information [47,48]. The indices a and u correspond to the transmitting antenna and receiving user, respectively. This can be represented by the following equation:

H_{u}^{a} = |H_{u}^{a}| e^{∠ H_{u}^{a}},

(8)

where

|H_{u}^{a}|

and

e^{∠ H_{u}^{a}}

represent the amplitude and phase response, respectively [39]. The amplitude of CSI serves as a unique identifier for indoor positioning in 5G mmWave environments. It fluctuates based on individual movements, affecting signal reception. While the phase offers more detail, it is cyclic and requires calibration. Also, due to fading and frequency deviations, the phase is more susceptible to noise [24]. This study focuses exclusively on the amplitude aspect of CSI, using a setup with 12 multi-beam antennas.

2.3. CFR Image

CFR images are visual representations created by mapping H-matrix amplitude data into a matrix format, where each pixel represents a specific amplitude measurement at a particular point and transmit antenna. This visual representation captures unique patterns and features associated with different locations while maintaining relative stability for CFR at the same location, making it a valuable tool for enhancing positioning accuracy [39]. By comparing CFR images obtained during the testing phase with those stored in the database from the training phase, the system can accurately determine the position of a device or object within an environment.

The H-matrix represents the amplitude of CFR (Equation (9)), where measurements from the S subcarrier are collected at R distinct receiver locations (i.e., sampling points in space). These measurements form an R × S matrix, which is then used to generate the CFR images that serve as input to the positioning system. By leveraging these images, the model can effectively learn spatial signal patterns and estimate the receiver’s location with high accuracy.

H = [\begin{matrix} h_{11} & h_{12} & \dots & h_{1 S} \\ h_{21} & h_{22} & \dots & h_{2 S} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ h_{R 1} & h_{R 2} & \dots & h_{R S} \end{matrix}],

(9)

In this context,

h_{R S}

denotes the CFR amplitude value from the S-th subcarrier in the R-th point. Inspired by the successful use of dictionary learning techniques in image classification, we adopted a similar approach to visualize H-matrix data. To improve the clarity of these visual representations, we first standardized the data by calculating the mean amplitude at each point across all images.

{\hat{h}}_{11} = \frac{h_{11} - m i n (H)}{m a x (H) - m i n (H)},

(10)

Subsequently, the standardized CSI image is created.

\hat{H} = [\begin{matrix} {\hat{h}}_{11} & {\hat{h}}_{12} & \dots & {\hat{h}}_{1 S} \\ {\hat{h}}_{21} & {\hat{h}}_{22} & \dots & {\hat{h}}_{2 S} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ {\hat{h}}_{R 1} & {\hat{h}}_{R 2} & \dots & {\hat{h}}_{R S} \end{matrix}],

(11)

In a multi-beam system, the observations of high-dimensional CFRs are condensed into two-dimensional CFR images for every antenna pair involved in transmission and reception. These images form the basis of our dataset construction. In Figure 1, a sample of the CFR image is shown, where the color gradient, ranging from blue (indicating low amplitudes) to yellow (representing high amplitudes), visualizes the amplitude values. Such representations highlight the channel’s frequency-dependent characteristics, aiding in understanding its behavior for location determination. The generated CFR images are processed by a DCNN model to ascertain positions.

2.4. Positioning Framework

In the context of processing CFR images for estimating UE locations using DCNNs, the process involves several key steps and considerations. These images are then analyzed by a DCNN model to infer the UE’s position. The DCNN model is designed to handle the increased dimensionality and complexity introduced by combining features from various gNBs. This integration not only enhances the accuracy of location predictions but also necessitates careful management of input features to balance performance improvements against increased computational demands. The model’s architecture begins with data preprocessing, including transforming CFR images into a suitable format and normalizing them to optimize training efficiency. Data augmentation techniques, such as noise injection and random rotations are employed to enrich the dataset, thereby making the model’s capability to adapt to new data.

2.4.1. Deep Learning Model

The proposed DCNN model, as shown in Figure 2, is designed to estimate 3D positions from the amplitude of CSI feature data with high precision. Unlike conventional fingerprint-based localization methods that rely on shallow CNNs or fully connected regressors applied to static RSS or CSI fingerprints, the proposed model explicitly exploits hierarchical spatial–frequency structures embedded in multi-beam CFR feature maps. The architecture is constructed using six residual blocks, each consisting of two convolutional layers followed by batch normalization and Leaky rectified linear unit (ReLU) activation functions. Each block incorporates a shortcut connection that adds the input to the output, forming a residual learning path. This design facilitates improved gradient flow during training, enabling the network to be deeper and more effective at capturing complex spatial patterns without suffering from vanishing gradients. Furthermore, attention mechanisms are integrated after each residual block to adaptively emphasize informative frequency components and antenna-beam correlations, which are typically ignored in baseline fingerprinting approaches. By combining deep residual learning with attention-based feature refinement, the proposed network captures both local and global dependencies within CFR feature maps, enabling more discriminative representations for 3D localization.

The proposed DCNN model is designed to estimate 3D positions from CFR feature maps with high precision. The model receives input in the form of CFR feature images that represent spatial–frequency patterns aggregated from multiple antenna beams. These inputs are processed through a sequence of convolutional (C) and downsampling (D) layers that progressively extract high-level spatial representations while reducing spatial resolution. Each convolutional layer is followed by batch normalization and a Leaky ReLU activation function to enhance feature discrimination and maintain stable gradient propagation during training. In contrast to most of the existing fingerprint-based deep learning models that directly regress location from flattened CSI features, the proposed framework preserves the spatial structure of CFR maps throughout the convolutional pipeline, enabling more robust learning under channel variability. The progressive abstraction of features across network stages allows the model to effectively encode both fine-grained frequency responses and broader spatial patterns. The downsampling layers capture multi-scale features, contributing to improved robustness against positional and channel variations.

To improve the model’s generalization capability, the training data are augmented using several techniques. These include adding zero-mean Gaussian noise (σ = 0.01), flipping the feature patterns, applying random circular shifts (rotations), and scaling the features by a random factor between 0.9 and 1.1. This augmentation emulates moderate receiver-side measurement uncertainty, thereby increasing robustness to signal variability. The testing dataset, in contrast, is generated along continuous UAV trajectories and no additional artificial noise is explicitly injected. Instead, realistic variability is inherently introduced through UAV motion, changing propagation geometry, and time-varying multipath effects captured by the ray-based simulation environment. This design allows the localization performance to be evaluated under physically meaningful channel dynamics while isolating the impact of UAV motion and gNB deployment configurations. This distinction between training-time stochastic augmentation and test-time physics-driven variability differentiates the proposed evaluation framework from prior fingerprinting studies that rely on static or randomly sampled test points. The hierarchical convolution–downsampling architecture inherently enhances resilience to noise and distortion by allowing the model to learn spatially invariant representations.

Following the final convolutional block, the extracted features are flattened and passed to fully connected layers that map the learned high-level representations to the final 3D coordinate output. Unlike single-head regression architectures commonly used in fingerprint-based localization, the proposed network employs three parallel fully connected branches to independently estimate the x-, y-, and z-coordinates, enabling axis-specific feature refinement and improved vertical positioning accuracy. This decoupled regression strategy allows the network to better model anisotropic localization characteristics, particularly along the vertical dimension. Dropout (p = 0.5) and Leaky ReLU activations are applied within the fully connected layers to prevent overfitting and ensure stable convergence.

The careful selection and tuning of hyperparameters—including batch size, dropout rate, weight decay, and learning rate—are critical for optimizing model performance. The training process uses the RAdam optimizer [49] with an initial (base) learning rate of 3 × 10⁻⁴, which provides a stable starting point for convergence. A batch size of 64 was selected to balance computational efficiency and memory usage, while a weight decay of 1 × 10⁻⁵ was applied as a regularization mechanism to mitigate overfitting by penalizing large weights. To further improve convergence and generalization, the training employs the OneCycleLR learning rate scheduler [50]. Under this scheme, the learning rate is warm-started from the base value of 3 × 10⁻⁴, increased to a maximum value of 1 × 10⁻² during the initial phase of training, and then gradually decreased following a cyclic schedule for the remainder of the training process. This strategy enables the model to efficiently explore the loss landscape, avoid suboptimal local minima, and achieve stable convergence. Together, these training and optimization strategies ensure high localization accuracy and strong generalization performance across varying signal environments.

2.4.2. Evaluation Metrics

To rigorously assess the efficiency of the proposed indoor positioning technique, both quantitative and qualitative evaluations were conducted to comprehensively demonstrate its accuracy and performance characteristics. The mean positioning error (MPE) was used as a primary metric to quantify the average magnitude of localization errors across all test scenarios. The MPE is defined as [51]:

M P E = \frac{1}{n} \sum_{i = 1}^{n} \sqrt{{({\hat{x}}_{i} - x_{i})}^{2} + {({\hat{y}}_{i} - y_{i})}^{2} + {({\hat{z}}_{i} - z_{i})}^{2}},

(12)

where

{\hat{x}}_{i}

,

{\hat{y}}_{i}

, and

{\hat{z}}_{i}

are the predicted values, and

x_{i}

,

y_{i}

, and

z_{i}

are the ground truth for the x, y, and z coordinates of sample i, respectively, and n is the number of samples.

In addition to MPE, the root mean square error (RMSE) is reported to capture the dispersion of localization errors and penalize larger deviations more strongly. RMSE provides complementary insight into the robustness and stability of the positioning performance and is defined as [52]:

R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {[({\hat{x}}_{i} - x_{i})}^{2} + {({\hat{y}}_{i} - y_{i})}^{2} + {({\hat{z}}_{i} - z_{i})}^{2}]},

(13)

Furthermore, CDFs of the positioning error are presented to illustrate the statistical distribution of localization accuracy and to provide percentile-based performance insights (e.g., median and 90th-percentile errors).

3. Environmental Simulation

Our environmental simulation involves developing a simulated office environment with dimensions 20 m × 30 m × 3 m for experimentation. Initially, the building was modeled using Wireless InSite software [53] and customized to imitate a real-world environment. The resulting layout is shown in Figure 3a. Within this structure, twelve transmitters (TXs), represented by green cubes, were strategically positioned at heights ranging from 1.7 m to 2.8 m above the floor. To train the DCNN model, three horizontal plates of receivers (RXs), represented by red cubes, were positioned at heights of 0.15 m, 1.5 m, and 2.5 m above the floor, as depicted in Figure 3b. These receivers were arranged with different spacing intervals between them across three separate scenarios: 0.25 m in the first, 0.5 m in the second, and 1 m in the third. If a nominal receiver (fingerprint) location overlaps with or is occluded by furniture or other indoor objects, it is shifted to the nearest collision-free position to ensure physically feasible UAV placement. Each scenario was tested independently, including the UAV flight trajectory shown in Figure 3c. Detailed descriptions of both TX and RX antennas are provided in Section 3.2. Furthermore, the study evaluated the influence of various building materials on wave propagation using a 28 GHz frequency band. This multifaceted approach ensures a comprehensive analysis of how these materials affect signal behavior within the simulated environment.

To collect testing data, an indoor office environment—consistent with the simulated building modeled in Wireless InSite—was recreated using the Gazebo software package (version 11) [54]. A UAV equipped with multiple onboard sensors was simulated to fly along a predefined trajectory within this environment. The simulated UAV was modeled as an X500-class quadrotor [55], whose physical dimensions and geometry closely match those of commonly used real-world platforms, ensuring realistic size, mass distribution, and flight characteristics. The UAV motion was designed to emulate practical indoor flight behavior, including full three-dimensional movement and variable velocities, thereby capturing realistic UAV dynamics. The ground-truth UAV trajectory was extracted directly from the Gazebo simulation and subsequently used to place the testing receivers within the Wireless InSite environment, ensuring that the receiver configuration accurately reflected a real UAV flight path. The simulated trajectory was sampled at a rate of 30 Hz over a duration of 38.7 s, resulting in 1161 receiver locations, as illustrated in Figure 3a,c. For improved visualization clarity, the ceiling of the simulated environment is hidden in Figure 3.

3.1. Waveform Simulation

In Wireless InSite, waveforms define the time and frequency characteristics of the signal transmitted by the antenna, as illustrated in Figure 4, enabling users to configure parameters that influence the signal’s behavior during propagation. The 28 GHz band is widely used for 5G mmWave simulations due to its practical benefits, including the availability of large bandwidths that support high data rates and its suitability for indoor environments. This band allows for enhanced spectrum reuse through smaller cell sizes, supports advanced antenna technologies, such as multi-beam, and facilitates low-latency communication essential for real-time applications. Additionally, the 28 GHz band is allocated for 5G use in many countries, supported by industry adoption and regulatory frameworks [56].

In our 5G mmWave simulation for indoor positioning, we used 28 GHz signals with a 100 MHz bandwidth and employed a raised cosine waveform, as suggested in [57,58], due to its performance enhancements. This approach reduced the peak-to-average power ratio, minimized out-of-band emissions, and improved bit error rate performance, ensuring precise localization and reliable real-time indoor positioning with reduced latency.

3.2. Antenna Design

The multi-beam transmitter, developed for high-precision 3D localization, consists of 12 horn antennas mounted at varying heights between 1.7 and 2.8 m above the ground. As illustrated in Figure 5a, the system generates two distinct narrow beams: one oriented in the X–Z plane and another in the Y–Z plane, enabling directional coverage along both horizontal axes and enhancing the spatial resolution of CSI-based positioning. Horn antennas were selected for their ability to provide focused, directional beams, which help enhance signal strength and spatial resolution and are crucial for extracting detailed CSI features. The receiver antenna was omnidirectional, as shown in Figure 5b, allowing it to capture signals from all directions and ensure comprehensive coverage of the indoor environment [59]. This combination of horn and omnidirectional antennas enhances the collection and analysis of CSI features for 5G mmWave indoor positioning by leveraging the directional properties of the horn antenna to focus on specific areas and the broader coverage of the omnidirectional antenna for wider signal reception. This complimentary setup improves the quality and diversity of the captured data, as detailed in Table 1, allowing for more comprehensive and accurate signal analysis.

3.3. Material Properties

The electrical properties of the selected materials were analyzed and are summarized in Table 2. The real part of the relative permittivity Re (

ε_{r}

), representing the material’s capacity to store electrical energy in an electric field, and the conductivity σ, which reflects the material’s ability to conduct electric current, were determined using a curve-fitting method. This process was complemented by simple expressions detailed in [59,60]. Understanding these properties is crucial for predicting how materials will behave at specific frequencies, which is essential for applications in telecommunications and materials science.

4. Results

This section presents the research results, along with an analysis and evaluation of the effectiveness of the proposed indoor positioning method. As detailed in Section 3, the data for the tests were generated from a simulation that accurately recreated a UAV’s flight trajectory within an office environment. The simulation conducted using Gazebo [54] software, depicted typical indoor drone flights with altitudes ranging from 0.15 m to 2.3 m and lasted for 38.7 s, producing a total of 1161 data points. It should be noted that for each experimental configuration, a separate DCNN model was trained to ensure alignment with the corresponding input data and unbiased performance evaluation.

4.1. Effect of Spacing

To optimize the model’s predictive accuracy and robustness, experiments were conducted under three horizontal reference-point spacing configurations of 0.25 m, 0.5 m, and 1 m. For each spacing configuration, reference points were arranged on three horizontal layers located at heights of 0.15 m, 1.5 m, and 2.5 m above the floor, enabling the evaluation of 3D localization performance across representative indoor operating heights. The lowest horizontal layer at 0.15 m was selected to reflect realistic near-floor device and sensor heights commonly encountered in indoor environments, while avoiding direct floor contact and extreme near-field propagation effects that may distort mmWave fingerprints. The higher layers corresponded to typical operating heights of indoor mobile platforms and user equipment. These configurations were evaluated independently to assess the influence of reference-point spacing and height on the training process and localization performance. The results of this investigation are summarized in Table 3 and Figure 6, Figure 7 and Figure 8. These figures collectively present the localization performance and include the CDFs of the localization errors across all test points, providing a comprehensive view of the error distribution and overall positioning accuracy.

The positioning accuracy of the proposed model was noticeably influenced by the spacing between receivers. As demonstrated in the results, the optimal spacing of 0.25 m yielded the highest accuracy, with MPE values of 0.43 m in 3D and 0.36 m in 2D. This fine spacing allows the model to capture more precise measurements due to the closer proximity of the receivers, resulting in better model training and reduced uncertainty in the estimated positions. As shown in Figure 6 (1D and 3D plots), the estimated positions closely aligned with the ground truth, with minimal deviation observed across all axes. This alignment underscores the model’s effectiveness in maintaining high accuracy with closer receiver spacing. As the spacing increased to 0.5 m, a noticeable decline in positioning accuracy occurred. The MPE values increased to 0.61 m in 3D and 0.55 m in 2D. Visual analysis of Figure 7 reveals more significant discrepancies between the estimated and ground truth positions, particularly along the Z-axis. This decrease in accuracy suggests that the greater spacing between receivers reduces the precision of position estimation. Further deterioration in positioning accuracy was observed at a spacing of 1.0 m, with the highest MPE values recorded, 1.06 m in 3D and 1.01 m in 2D. The visual results presented in Figure 8 demonstrate a more substantial divergence between the estimated and ground truth positions, not only along the Z-axis but also across the X- and Y-axes. This significant increase in error indicates that the model struggles to maintain precise localization with larger receiver spacing.

The CDF results underscore the critical impact of receiver spacing on localization accuracy. With a tight spacing of 0.25 m, the 3D Euclidean error exhibited superior performance, with 75% of errors below 0.331 m and a median (50%) error of only 0.188 m. The tail of the distribution was also tightly bounded, with 95% of errors falling under 0.706 m. Expanding the spacing to 0.5 m significantly degraded the precision; the median error nearly doubled to 0.295 m, and the 75th percentile error increased to 0.467 m, while the 95th percentile error escalated to 1.158 m, indicating a broader spread of larger errors. This trend culminated with the poorest performance at a 1.0 m spacing, where the median error jumped to 0.750 m and a substantial 25% of errors exceeded 1.125 m, culminating in a 95th percentile error of 2.498 m. This direct comparison demonstrates that denser receiver configurations yield not only lower average errors but, more importantly, a drastically reduced probability of large localization outliers, which is essential for reliable and safe operation in applications like UAV navigation.

4.2. Effect of Number Training Plate

As outlined in Section 3, the training dataset was generated by placing horizontal arrays of receiver antennas—referred to here as “training plates”—at three distinct elevation levels, as shown in Figure 9. These plates were positioned parallel to the ground at fixed heights to capture spatial signal variations at different vertical layers. To evaluate the effect of the number of training plates and their heights, three combinations of these training plates (with 0.5 m RXs spacing) were tested. In each configuration, one of the horizontal plates was omitted in the training process (as shown in Table 4). The objective was to understand how variations in the number and heights of training data points impact the model’s accuracy and reliability. This analysis aimed to identify the optimal settings for maximizing the effectiveness of the model. The results derived from our simulation of each scenario are summarized in Table 4 and illustrated in Figure 10, Figure 11 and Figure 12.

When one of the training plates was omitted, the deterioration in positioning accuracy was more pronounced in the Z-axis (vertical positioning) compared to the x–y plane (horizontal positioning). This is because each plate, especially those at different heights, provides crucial information for estimating the vertical position of the receiver. In 3D positioning, removing data from one of these heights led to a significant loss of elevation information, making it more difficult for the model to accurately estimate the Z-axis location. For instance, removing the plate at 0.15 m resulted in the largest increase in 3D positioning error, from 0.61 m to 1.39 m. Moreover, in 2D positioning, the impact of omitting any single plate was less severe, with MPE values ranging from 0.55 to 0.98 m. This indicates that each height contributes to overall positioning performance. Our investigation underlines the importance of incorporating all three training plates to achieve the lowest possible errors in both 3D and 2D positioning, emphasizing that the presence of data from multiple heights, particularly ones close to the UE, is key to optimizing the model’s overall performance.

The CDF analysis of omitting training plates demonstrates a clear sensitivity of the model, particularly at lower elevations. Omitting the highest training plate resulted in a modest performance degradation, increasing the median (50%) 3D error to 0.689 m and the 95th percentile error to 2.943 m. However, omitting a mid-height plate showed a more pronounced effect on the error tail, raising the 95th percentile to 2.236 m. The most severe impact occurred when the lowest plate was omitted, which caused a fundamental failure in the model’s vertical estimation, as evidenced by a drastically higher median error of 1.382 m. This result highlights that training data from the lower height plane, which most closely correspond to a significant portion of the UAV’s operating trajectory, are essential for establishing a robust baseline for 3D localization.

4.3. Effect of Number of Transmitters

In Section 2, we illustrated the installation of 12 gNBs at varying heights, ranging from 1.5 to 2.5 m above ground level. The strategic variation in antenna height aims to enhance the Z-axis (vertical) position estimation accuracy. Additionally, we adjusted the network density by gradually reducing the number of active gNBs from 12 to 4, as shown in Table 5, to assess how network density changes affect the positioning system’s accuracy. The results of these different configurations are presented in Table 5 and Figure 13, Figure 14 and Figure 15.

This trend suggests that higher network density (with more gNBs) is critical for improving the precision of vertical positioning, while 2D horizontal positioning is less affected by the reduction in gNBs. The visualization of ground truth versus estimated positions also supports this, where discrepancies in the Z-axis were more noticeable when fewer gNBs were used, further emphasizing the importance of maintaining sufficient antenna coverage for accurate 3D localization.

To place the proposed approach in context, a comparative summary of representative 5G-based indoor positioning methods is provided in Table 6. The table reviews state-of-the-art techniques reported in the literature, highlighting their underlying methodologies, employed 5G signal features, positioning dimensionality (2D or 3D), evaluation type (simulation or real-world), and reported localization accuracy. This comparison emphasizes that most existing studies focus on real-world 2D positioning or simulation-based 3D evaluations, whereas the proposed method was evaluated in both 2D and 3D using real-world measurements.

The effect of reducing the number of active transmitters is evident in the progressive degradation of localization precision. With 12 transmitters, the model achieved robust performance, with a median (50%) 3D error of 0.295 m and 95% of errors below 1.158 m. Reducing the count to 8 transmitters increased the median error to 0.405 m and the 95th percentile to 1.245 m. A further reduction to 6 transmitters showed a continued but less severe decline in median error (0.44 m), but the tail (95th percentile of 1.596 m) remained worse than the 12-transmitter baseline. The most significant performance drop occurred with only 4 transmitters, where the median error rose to 0.369 m and, critically, the 95th percentile error jumped to 2.172 m. This demonstrates that while reducing the transmitter count moderately affects the median accuracy, it disproportionately inflates the occurrence of large, outlier errors, highlighting that a dense network is essential for consistent and reliable positioning, especially for safety-critical operations.

5. Discussion

It is shown that the positioning accuracy of the model is closely tied to the configuration of receiver spacing, the number of active gNBs, and the inclusion of training plates at different heights, demonstrating the complex interplay of factors influencing the model’s performance. The research highlights the importance of receiver spacing in fingerprinting-based positioning methods, where both the distance between the RX antennas and the number of antennas significantly impact accuracy. A spacing of 0.25 m captures distinct signal fingerprints, improving the model’s ability to differentiate between locations. However, reducing the spacing further would increase system complexity and data processing requirements, making it impractical. On the other hand, a spacing of 0.5 m is considered optimal as it balances accuracy with practicality, requiring fewer receivers and generating less data while still achieving reliable positioning. Increasing the spacing beyond 0.5 m led to weaker signals, reduced coverage overlaps, and increased positioning errors. Thus, balancing the number of antennas and their spacing is essential for minimizing errors and maintaining high positioning accuracy in dynamic environments.

The CDF curves provide deeper insight into these error characteristics beyond average metrics such as MPE and RMSE. As shown in the CDF plots, tighter receiver spacing and a higher number of active gNBs consistently shifted the 3D Euclidean error curves to the left, indicating not only lower median errors but also a reduced tail of large localization errors. For instance, configurations with 0.25 m and 0.5 m receiver spacing exhibited steep CDF slopes, where more than 75% of the positioning errors fell below sub-meter levels. This steepness reflects a stable and well-conditioned fingerprint space, where CFR images capture sufficiently distinct spatial–frequency patterns. In contrast, larger receiver spacing or reduced gNB density led to flatter CDF curves with heavier tails, revealing a higher probability of outliers, which is particularly critical for UAV navigation and safety-critical applications.

Additionally, the presence of training plates at multiple heights significantly impacts the model’s ability to estimate vertical positioning (Z-axis) accurately. The exclusion of any plate, especially the one at the lowest height, introduces larger errors in 3D positioning, indicating that data from various elevations are essential for maintaining accuracy in vertical estimates. This is primarily due to the fact that most of the UAV trajectory data are collected while the UAV is on the ground, near the lowest plate. As a result, the height of the horizontal training plates relative to the UAV significantly affects the positioning accuracy, particularly along the vertical axis (Z-axis).

Furthermore, the CDF analysis highlights the sensitivity of vertical (Z-axis) accuracy to both antenna density and the availability of multi-height training plates. When training plates at specific elevations were omitted, the corresponding CDF curves for the 3D Euclidean error shifted noticeably to the right, especially at higher percentiles (90–95%), indicating degraded robustness rather than merely increased mean error. This behavior confirms that CFR images implicitly encode elevation-dependent propagation characteristics, and removing height diversity reduces the model’s ability to generalize vertical positioning.

Moreover, the number of active gNBs strongly influences the model’s performance, particularly on the Z-axis. A gradual reduction in the number of gNBs from 12 to 4 resulted in a sharper increase in 3D positioning error compared to the 2D error. This demonstrates the importance of network density for accurate vertical positioning, as fewer gNBs limit the diversity of signals available for triangulating vertical distances. Similarly, reducing the number of active gNBs disproportionately affects the upper tail of the CDF, demonstrating that network sparsity primarily increases worst-case errors. Our evaluation highlights the importance of maintaining a sufficient number of active gNBs, optimizing receiver spacing, and incorporating comprehensive training data from multiple heights to achieve the desired positioning accuracy in both horizontal and vertical dimensions.

Overall, the CDF-based evaluation confirms that the proposed CFR image representation, combined with sufficient antenna density and elevation-aware training data, yields not only high average accuracy but also reliable and bounded localization performance suitable for onboard UAV deployment.

Our proposed method leverages 5G CSI amplitude data to construct CFR images, a novel representation that exploits rich spatial-frequency information from multiple gNBs together with data from three vertical training plates. This configuration enables high-precision, multi-dimensional localization, achieving a 2D MPE of 0.36 m and a 3D MPE of 0.43 m. A comparison with related methods, summarized in Table 6, underscores the effectiveness of this approach. While previous works predominantly focused on 2D positioning—often within simulation environments or with reported accuracies exceeding one meter in realistic settings—our method achieves superior precision while solving the more complex 3D problem. This performance is directly attributed to the CFR image representation and the explicit incorporation of multi-height training data, which is especially critical for vertical accuracy and overcomes key limitations of traditional fingerprinting approaches. These results position the proposed method as a state-of-the-art solution for precise 3D positioning in 5G-enabled indoor environments.

6. Conclusions

A 3D UAV indoor positioning approach was developed employing 5G mmWave technology using DCNN. We simulated an indoor office environment using the Wireless InSite software, imitating a real-world environment with dimensions of 20 m × 30 m × 3 m. This setup involved the integration of twelve multi-beam gNBs equipped with horn-directional transmitter antennas positioned at varying heights above the floor to emit raised cosine waveforms. Omnidirectional antennas were utilized during both the training and testing phases. The H-matrix, representing the relationship between the received and transmitted signals, was extracted and processed to construct the CFR images. These CFR images were used to train the developed DCNN model. The CFR images constructed from the receivers’ data positioned along the UAV’s trajectory were processed using the DCNN as testing data to localize the UAV. The model, utilizing twelve gNBs and three horizontal plates, demonstrated strong accuracy, achieving a 3D MPE of 0.43 m and a 2D MPE of 0.36 m, with RMSE values of 0.32 m (2D) and 0.33 m (3D). It was shown that reducing the number of gNBs decreased the positioning accuracy. Decreasing the spacing between transmitters in the fingerprinting method could improve accuracy, albeit at the cost of increased complexity. The inclusion of training plates at various heights played a crucial role in enhancing the model’s accuracy in estimating vertical positioning (Z-axis). The proposed method demonstrated resilience to synchronization issues and hardware constraints, achieving high localization accuracy without requiring precise timing alignment between gNBs.

Author Contributions

Conceptualization, Mohsen Shahraki, Ahmed Elamin and Ahmed El-Rabbany; Methodology, Mohsen Shahraki, Ahmed Elamin and Ahmed El-Rabbany; Software, Mohsen Shahraki; Validation, Mohsen Shahraki; Formal analysis, Mohsen Shahraki; Investigation, Mohsen Shahraki; Resources, Mohsen Shahraki; Data curation, Mohsen Shahraki; Writing—original draft, Mohsen Shahraki; Writing—review & editing, Mohsen Shahraki, Ahmed Elamin and Ahmed El-Rabbany; Visualization, Mohsen Shahraki; Supervision, Ahmed Elamin and Ahmed El-Rabbany; Project administration, Ahmed El-Rabbany; Funding acquisition, Ahmed El-Rabbany. All authors have read and agreed to the published version of the manuscript.

Funding

This research is supported by Toronto Metropolitan University and the Natural Sciences and Engineering Research Council of Canada (NSERC) RGPIN-2022-03822.

Data Availability Statement

The data presented in this study are available upon request from the corresponding author. The system is currently under active development and is being prepared for the next phase of research.

Conflicts of Interest

The authors declare no conflict of interest.

References

Zhou, X.; Zhang, X.; Yang, X.; Zhao, J.; Liu, Z.; Shuang, F. Towards UAV Localization in GNSS-Denied Environments: The SatLoc Dataset and a Hierarchical Adaptive Fusion Framework. Remote Sens. 2025, 17, 3048. [Google Scholar] [CrossRef]
Obeidat, H.; Shuaieb, W.; Obeidat, O.; Abd-Alhameed, R. A Review of Indoor Localization Techniques and Wireless Technologies. Wirel. Pers. Commun. 2021, 119, 289–327. [Google Scholar] [CrossRef]
Sandamini, C.; Maduranga, M.W.P.; Tilwari, V.; Yahaya, J.; Qamar, F.; Nguyen, Q.N.; Ibrahim, S.R.A. A Review of Indoor Positioning Systems for UAV Localization with Machine Learning Algorithms. Electronics 2023, 12, 1533. [Google Scholar] [CrossRef]
Teuliere, C.; Marchand, E.; Eck, L. 3-D Model-Based Tracking for UAV Indoor Localization. IEEE Trans. Cybern. 2015, 45, 869–879. [Google Scholar] [CrossRef]
Queralta, J.P.; Almansa, C.M.; Schiano, F.; Floreano, D.; Westerlund, T. UWB-based system for UAV Localization in GNSS-Denied Environments: Characterization and Dataset. In Proceedings of the 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Virtual, 25–29 October 2020. [Google Scholar] [CrossRef]
Tiemann, J.; Schweikowski, F.; Wietfeld, C. Design of an UWB indoor-positioning system for UAV navigation in GNSS-denied environments. In Proceedings of the International Conference on Indoor Positioning and Indoor Navigation (IPIN), Banff, AB, Canada, 13–16 October 2015; IEEE: New York, NY, USA, 2015; pp. 1–7. [Google Scholar]
Li, Z.; Zhang, Y. Constrained ESKF for UAV Positioning in Indoor Corridor Environment Based on IMU and WiFi. Sensors 2022, 22, 391. [Google Scholar] [CrossRef]
Liu, F.; Liu, J.; Yin, Y.; Wang, W.; Hu, D.; Chen, P.; Niu, Q. Survey on WiFi-based indoor positioning techniques. IET Commun. 2020, 14, 1372–1383. [Google Scholar] [CrossRef]
Ponte, S.; Ariante, G.; Greco, A.; Del Core, G. Differential Positioning with Bluetooth Low Energy (BLE) Beacons for UAS Indoor Operations: Analysis and Results. Sensors 2024, 24, 7170. [Google Scholar] [CrossRef]
Chiueh, H.-L.; Wu, C.-H.; Xie, Z.-D.; Xu, H.-W. Implementation of UAV Positioning And Navigation System Using Zigbee Communication. In Proceedings of the 4th International Conference on Electronics, Circuits and Information Engineering (ECIE), Hangzhou, China, 24–26 May 2024; IEEE: New York, NY, USA, 2024; pp. 358–362. [Google Scholar]
Adeyeye-Oshin, M.; Sakpere, W.; Mlitwa, N.B.W. A state-of-the-art survey of indoor positioning and navigation systems and technologies. S. Afr. Comput. J. Suid-Afr. Rekenaartydskrif 2017, 29, 145–197. [Google Scholar] [CrossRef]
Keating, R.; Saily, M.; Hulkkonen, J.; Karjalainen, J. Overview of Positioning in 5G New Radio. In Proceedings of the International Symposium on Wireless Communication Systems (ISWCS), Oulu, Finland, 27–30 August 2019; IEEE: New York, NY, USA, 2019; pp. 320–324. [Google Scholar]
Leitch, S.G.; Ahmed, Q.Z.; Abbas, W.B.; Hafeez, M.; Laziridis, P.I.; Sureephong, P.; Alade, T. On Indoor Localization Using WiFi, BLE, UWB, and IMU Technologies. Sensors 2023, 23, 8598. [Google Scholar] [CrossRef] [PubMed]
Alhafnawi, M.; Salameh, H.B.; Masadeh, A.E.; Al-Obiedollah, H.; Ayyash, M.; El-Khazali, R.; Elgala, H. A Survey of Indoor and Outdoor UAV-based Target Tracking Systems: Current Status, Challenges, Technologies, and Future Directions. IEEE Access 2023, 11, 68324–68339. [Google Scholar] [CrossRef]
Farahsari, P.S.; Farahzadi, A.; Rezazadeh, J.; Bagheri, A. A Survey on Indoor Positioning Systems for IoT-Based Applications. IEEE Internet Things J. 2022, 9, 7680–7699. [Google Scholar] [CrossRef]
Mogyorósi, F.; Revisnyei, P.; Pašić, A.; Papp, Z.; Törös, I.; Varga, P.; Pašić, A. Positioning in 5G and 6G Networks—A Survey. Sensors 2022, 22, 4757. [Google Scholar] [CrossRef]
Rajkumar, S. Precision indoor positioning system for 5G. Int. J. Sci. Res. Eng. Dev. 2022, 5, 411–418. [Google Scholar]
Yanying, G.; Lo, A.; Niemegeers, I. A survey of indoor positioning systems for wireless personal networks. IEEE Commun. Surv. Tutor. 2009, 11, 13–32. [Google Scholar] [CrossRef]
Zeng, Y.; Zhang, R.; Lim, T.J. Wireless communications with unmanned aerial vehicles: Opportunities and challenges. IEEE Commun. Mag. 2016, 54, 36–42. [Google Scholar] [CrossRef]
Widdison, E.; Long, D.G. A Review of Linear Multilateration Techniques and Applications. IEEE Access 2024, 12, 26251–26266. [Google Scholar] [CrossRef]
Javed, Y.; Khan, Z.; Asif, S. Evaluating indoor location triangulation using Wi-Fi signals. In Proceedings of the Advances in Internet, Data and Web Technologies: The 7th International Conference on Emerging Internet, Data and Web Technologies (EIDWT-2019), Fujairah Campus, United Arab Emirates, 26–28 February 2019; Springer: Berlin/Heidelberg, Germany, 2019; pp. 180–186. [Google Scholar]
Huang, J.; Junginger, S.; Liu, H.; Thurow, K. Indoor positioning systems of mobile robots: A review. Robotics 2023, 12, 47. [Google Scholar] [CrossRef]
Fathalizadeh, A.; Moghtadaiee, V.; Alishahi, M. Indoor Location Fingerprinting Privacy: A Comprehensive Survey. arXiv 2024, arXiv:2404.07345. [Google Scholar] [CrossRef]
Li, Q.; Liao, X.; Liu, M.; Valaee, S. Indoor localization based on CSI fingerprint by siamese convolution neural network. IEEE Trans. Veh. Technol. 2021, 70, 12168–12173. [Google Scholar] [CrossRef]
Zhu, X.; Qu, W.; Qiu, T.; Zhao, L.; Atiquzzaman, M.; Wu, D.O. Indoor intelligent fingerprint-based localization: Principles, approaches and challenges. IEEE Commun. Surv. Tutor. 2020, 22, 2634–2657. [Google Scholar] [CrossRef]
Kaishun, W.; Jiang, X.; Youwen, Y.; Dihu, C.; Xiaonan, L.; Ni, L.M. CSI-Based Indoor Localization. IEEE Trans. Parallel Distrib. Syst. 2013, 24, 1300–1309. [Google Scholar] [CrossRef]
Xiang, C.; Zhang, S.; Xu, S.; Chen, X.; Cao, S.; Alexandropoulos, G.C.; Lau, V.K.N. Robust Sub-Meter Level Indoor Localization With a Single WiFi Access Point-Regression Versus Classification. IEEE Access 2019, 7, 146309–146321. [Google Scholar] [CrossRef]
Arnold, M.; Dorner, S.; Cammerer, S.; Ten Brink, S. On deep learning-based massive MIMO indoor user localization. In Proceedings of the 19th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC), Kalamata, Greece, 25–28 June 2018; IEEE: New York, NY, USA, 2018; pp. 1–5. [Google Scholar]
Prasad, K.S.V.; Hossain, E.; Bhargava, V.K. Machine learning methods for RSS-based user positioning in distributed massive MIMO. IEEE Trans. Wirel. Commun. 2018, 17, 8402–8417. [Google Scholar] [CrossRef]
Kazemi, P.; Al-Tous, H.; Studer, C.; Tirkkonen, O. User-Side Indoor Localization Using CSI Fingerprinting. In Proceedings of the 23rd International Workshop on Signal Processing Advances in Wireless Communication (SPAWC), Oulu, Finland, 4–6 July 2022; IEEE: New York, NY, USA, 2022; pp. 1–5. [Google Scholar]
Khalilsarai, M.B.; Stefanatos, S.; Wunder, G.; Caire, G. WiFi-based indoor localization via multi-band splicing and phase retrieval. In Proceedings of the 20th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC), Cannes, France, 2–5 July 2019; IEEE: New York, NY, USA, 2019; pp. 1–5. [Google Scholar]
Rabiei, P.; Namgoong, W.; Al-Dhahir, N. Reduced-complexity joint baseband compensation of phase noise and I/Q imbalance for MIMO-OFDM systems. IEEE Trans. Wirel. Commun. 2010, 9, 3450–3460. [Google Scholar] [CrossRef]
Tadayon, N.; Rahman, M.T.; Han, S.; Valaee, S.; Yu, W. Decimeter ranging with channel state information. IEEE Trans. Wirel. Commun. 2019, 18, 3453–3468. [Google Scholar] [CrossRef]
Wang, Y.; Zhao, K.; Zheng, Z.; Ji, W.; Huang, S.; Ma, D. Indoor Positioning with CNN and Path-Loss Model Based on Multivariable Fingerprints in 5G Mobile Communication System. Sensors 2022, 22, 3179. [Google Scholar] [CrossRef]
Zhou, X.; Chen, L.; Ruan, Y.; Zhou, T.; Chen, R. IMPos: Indoor Mobile Positioning with 5G Multibeam Signals From a Single Base Station. IEEE Internet Things J. 2024, 11, 20743–20756. [Google Scholar] [CrossRef]
Ruan, Y.; Chen, L.; Zhou, X.; Liu, Z.; Liu, X.; Guo, G.; Chen, R. iPos-5G: Indoor Positioning via Commercial 5G NR CSI. IEEE Internet Things J. 2023, 10, 8718–8733. [Google Scholar] [CrossRef]
Zhou, X.; Chen, L.; Ruan, Y.; Chen, R. Indoor positioning with multi-beam CSI of commercial 5G signals. Urban Inform. 2024, 3, 1. [Google Scholar] [CrossRef]
Dai, Y.; Chen, L.; Zhou, X.; Ruan, Y.; Chen, R. Indoor Localization in Commercial 5G Environment with Single BS. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2024, XLVIII-4-2024, 619–625. [Google Scholar] [CrossRef]
Cheng, Z.; Zhao, D.; Guo, W.; Li, L. A Channel State Information and Geomagnetic Fused Fingerprint Localisation Algorithm Based on Multi-Input Convolutional Neural Network. IET Wirel. Sens. Syst. 2024, 14, 33–46. [Google Scholar] [CrossRef]
Ferrand, P.; Decurninge, A.; Ordoñez, L.G.; Guillaud, M. Triplet-Based Wireless Channel Charting: Architecture and Experiments. arXiv 2021, arXiv:2005.12242. [Google Scholar] [CrossRef]
Gönültaş, E.; Lei, E.; Langerman, J.; Huang, H.; Studer, C. CSI-Based Multi-Antenna and Multi-Point Indoor Positioning Using Probability Fusion. arXiv 2021, arXiv:2009.02798. [Google Scholar] [CrossRef]
Yiwei, Z.; Hongzi, Z.; Hua, X.; Shan, C. Perceiving accurate CSI phases with commodity WiFi devices. In Proceedings of the IEEE INFOCOM 2017—IEEE Conference on Computer Communications, Atlanta, GA, USA, 1–4 May 2017; IEEE: New York, NY, USA, 2017; pp. 1–9. [Google Scholar]
Liu, W.; Wang, X.; Deng, Z. CSI Amplitude Fingerprinting for Indoor Localization with Dictionary Learning. Entropy 2021, 23, 1164. [Google Scholar] [CrossRef] [PubMed]
Alkhateeb, A.; Leus, G.; Heath, R.W. Limited Feedback Hybrid Precoding for Multi-User Millimeter Wave Systems. IEEE Trans. Wirel. Commun. 2015, 14, 6481–6494. [Google Scholar] [CrossRef]
Mohammadian, R.; Amini, A.; Khalaj, B.H. Deterministic Pilot Design for Sparse Channel Estimation in MISO/Multi-User OFDM Systems. IEEE Trans. Wirel. Commun. 2017, 16, 129–140. [Google Scholar] [CrossRef]
Yang, H.; Geng, X.; Xu, H.; Shi, Y. An improved least squares (LS) channel estimation method based on CNN for OFDM systems. Electron. Res. Arch. 2023, 31, 5780–5792. [Google Scholar] [CrossRef]
Kirthiga, S.; Govindankutty, A.; Krishnan, S.; Nair, S.P. Transmit beamforming using singular value decomposition. In Proceedings of the International Conference on Electronics and Communication Systems (ICECS), Coimbatore, India, 13–14 February 2014; IEEE: New York, NY, USA, 2014; pp. 1–4. [Google Scholar]
Lo, T.K.Y. Maximum ratio transmission. IEEE Trans. Commun. 1999, 47, 1458–1461. [Google Scholar] [CrossRef]
Liu, L.; Jiang, H.; He, P.; Chen, W.; Liu, X.; Gao, J.; Han, J. On the Variance of the Adaptive Learning Rate and Beyond. arXiv 2021, arXiv:1908.03265. [Google Scholar] [CrossRef]
Smith, L.N.; Topin, N. Super-Convergence: Very Fast Training of Neural Networks Using Large Learning Rates. arXiv 2018, arXiv:1708.07120. [Google Scholar] [CrossRef]
Tao, Y.; Yan, R.; Zhao, L. An Effective Fingerprint-Based Indoor Positioning Algorithm Based on Extreme Values. ISPRS Int. J. Geo-Inf. 2022, 11, 81. [Google Scholar] [CrossRef]
Zhang, Y.; Wang, H.; Zheng, J.; Hua, L.; Zhang, R.; Xie, J. Localization and ambiguity resolution algorithm for time-difference fusion of three satellites based on observation filtering. Sci. Rep. 2025, 15, 26060. [Google Scholar] [CrossRef] [PubMed]
Remcom. Wireless InSite^® 3D Wireless Propagation Software; Version 4.0, 2025. Available online: https://www.remcom.com/wireless-insite-propagation-software (accessed on 15 July 2025).
Open Robotics. Gazebo: Robot Simulation Made Easy; Version 11. Available online: https://gazebosim.org/home (accessed on 6 February 2024).
Holybro. X500 V2 Kits; Manufacturer: Holybro; City: Hong Kong; Country: China. Available online: https://holybro.com/products/x500-v2-kits?srsltid=AfmBOopX7sYanV7c5a6hFt3oNbWk1EZxjnKJFpaVXY-ZLWfkhVcuAlW7 (accessed on 15 December 2025).
Sakaguchi, K.; Haustein, T.; Barbarossa, S.; Strinati, E.C.; Clemente, A.; Destino, G.; Pärssinen, A.; Kim, I.; Chung, H.; Kim, J.; et al. Where, When, and How mmWave is Used in 5G and Beyond. IEICE Trans. Electron. 2017, E100.C, 790–808. [Google Scholar] [CrossRef]
Cui, X.; Chen, X.; Song, H.; Li, J. Study on the Ranging Estimation Based on Millimeter-Wave with Raised-Cosine Carrier. In Proceedings of the International Conference on Identification, Information and Knowledge in the Internet of Things (IIKI), Beijing, China, 20–21 October 2016; IEEE: New York, NY, USA, 2016; pp. 223–226. [Google Scholar]
Yadav, P.K.; Dwivedi, V.K.; Maharaj, B.T.; Karwal, V.; Gupta, J.P. Gupta Mobile Communications; Findings in the Area of Mobile Communications Reported from Jaypee Institute of Information Technology (Performance Enhancement of 5G OFDM Systems Using Modified Raised Cosine Power Pulse). Telecommun. Wkly. 2019, 106, 11. [Google Scholar] [CrossRef]
AlAbdullah, A.A.; Ali, N.; Obeidat, H.; Abd-Alhmeed, R.A.; Jones, S. Indoor millimetre-wave propagation channel simulations at 28, 39, 60 and 73 GHz for 5G wireless networks. In Proceedings of the 2017 Internet Technologies and Applications (ITA), Wrexham, UK, 12–15 September 2017; IEEE: New York, NY, USA, 2017; pp. 235–239. [Google Scholar]
Remcom. Wireless InSite Reference Manual Version 3.4.4.; Remcom Inc.: State College, PA, USA, 2023; pp. 99–125. [Google Scholar]
Chen, L.; Zhou, X.; Chen, F.; Yang, L.-L.; Chen, R. Carrier Phase Ranging for Indoor Positioning with 5G NR Signals. IEEE Internet Things J. 2022, 9, 10908–10919. [Google Scholar] [CrossRef]
Al-Habashna, A.; Wainer, G.; Aloqaily, M. Machine learning-based indoor localization and occupancy estimation using 5G ultra-dense networks. Simul. Model. Pract. Theory 2022, 118, 102543. [Google Scholar] [CrossRef]
Huang, S.; Zhao, K.; Zheng, Z.; Ji, W.; Li, T.; Liao, X.; Zhu, F. An Optimized Fingerprinting-Based Indoor Positioning with Kalman Filter and Universal Kriging for 5G Internet of Things. Wirel. Commun. Mob. Comput. 2021, 2021, 9936706. [Google Scholar] [CrossRef]
El Boudani, B.; Kanaris, L.; Kokkinis, A.; Kyriacou, M.; Chrysoulas, C.; Stavrou, S.; Dagiuklas, T. Implementing Deep Learning Techniques in 5G IoT Networks for 3D Indoor Positioning: DELTA (DeEp Learning-Based Co-operaTive Architecture). Sensors 2020, 20, 5495. [Google Scholar] [CrossRef]

Figure 1. A sample of the CFR images was obtained from the simulation for testing data. The x-axis represents subcarriers, and the y-axis represents the number of samples.

Figure 2. Design of the DCNN Architecture. Here, “C” stands for convolutional, and “D” denotes subsampling layers.

Figure 3. Simulated Environment in Wireless Insite. (a) 2D view: Location of TX antennas (green cubes) and distribution of RX antennas for testing data (red cubes); (b) 3D view: Placement of RX antennas (red cubes) with 1 m spacing in three horizontal plates for training data, and TX antennas (green cubes) are hidden; (c) 3D view of the environment: Positions of TX antennas (green cubes) and RX antennas (red cubes) for testing data.

Figure 4. Waveform used in the simulation: (a) in time domain and (b) frequency domain.

Figure 5. Radiation patterns for (a) directional transmitter antenna and (b) omnidirectional receiver antenna.

Figure 6. Comparison of estimated and ground truth positions with 0.25 m RX spacing: (a) 1D axis-specific comparison; (b) 3D spatial comparison; (c) CDF of localization errors across all test points.

Figure 7. Comparison of estimated and ground truth positions with 0.5 m RX spacing: (a) 1D axis-specific comparison; (b) 3D spatial comparison; (c) CDF of localization errors across all test points.

Figure 8. Comparison of estimated and ground truth positions with 1.0 m RX spacing: (a) 1D axis-specific comparison; (b) 3D spatial comparison; (c) CDF of localization errors across all test points.

Figure 9. 2D side view of the area of interest showing the distribution of TX antennas (green cubes) and RX antenna plates for training data (red cubes).

Figure 10. Comparison of estimated and ground truth positions with 0.5 m RX spacing and omitting the 0.15 m plate: (a) 1D axis-specific comparison; (b) 3D spatial comparison; (c) CDF of localization errors across all test points.

Figure 11. Comparison of estimated and ground truth positions with 0.5 m RX spacing and omitting the 1.5 m plate: (a) 1D axis-specific comparison; (b) 3D spatial comparison; (c) CDF of localization errors across all test points.

Figure 12. Comparison of estimated and ground truth positions with 0.5 m RX spacing and omitting the 2.5 m plate: (a) 1D axis-specific comparison; (b) 3D spatial comparison; (c) CDF of localization errors across all test points.

Figure 13. Comparison of estimated and ground truth positions with 0.5 m RX spacing and using 8 gNBs: (a) 1D axis-specific comparison; (b) 3D spatial comparison; (c) CDF of localization errors across all test points.

Figure 14. Comparison of estimated and ground truth positions with 0.5 m RX spacing and using 6 gNBs: (a) 1D axis-specific comparison; (b) 3D spatial comparison; (c) CDF of localization errors across all test points.

Figure 15. Comparison of estimated and ground truth positions with 0.5 m RX spacing and using 4 gNBs: (a) 1D axis-specific comparison; (b) 3D spatial comparison; (c) CDF of localization errors across all test points.

Table 1. Properties of the simulated transmitter and receiver antennas.

Properties	Transmitter Antenna	Receiver Antenna
Type	Horn	Omnidirectional
Polarization	Vertical	Vertical
Gain (dBi)	24.5	1.80
E-plane Half Power Bandwidth (degree)	10.9	90.00
Receiver Threshold (dBm)	−140.00	−140.00
VSWR	1.00	1.00
Temperature (K)	293.00	293.00
Input Power (dBm)	30.0	0

Table 2. Parameter values of materials at 28 GHz frequency.

Properties	Conductivity (σ)	Re (ε_r)
Concrete	0.484	5.310
Wood	0.167	1.990
Glass	0.229	6.270
Ceiling board	0.024	1.500
Metal	10 × 10⁷	1.000
Floorboard	0.398	3.660
Brick	0.038	3.750
Chipboard	0.292	2.580

Table 3. Performance of the proposed model under variable spacing in RXs.

	0.25 m Spacing	0.50 m Spacing	1.00 m Spacing
MPE (3D)	0.43 m	0.61 m	1.06 m
MPE (2D)	0.36 m	0.55 m	1.01 m
RMSE (3D)	0.33 m	0.55 m	1.20 m
RMSE (2D)	0.32 m	0.53 m	1.17 m
Max Error (3D)	0.98 m	1.23 m	3.78 m

Table 4. Performance of the proposed model under omitting the training plate.

	Omitting 0.15-m Plate	Omitting 1.5-m Plate	Omitting 2.5-m Plate
MPE (3D)	1.39 m	1.08 m	1.00 m
MPE (2D)	0.97 m	0.98 m	0.95 m
RMSE (3D)	1.47 m	1.37 m	1.45 m
RMSE (2D)	1.17 m	1.27 m	1.37 m
Max Error (3D)	2.92 m	3.35 m	4.92 m

Table 5. Performance of the proposed model under decreasing the number of active gNBs when the spacing between RXs is 0.5 m.

	12 Active gNBs	8 Active gNBs	6 Active gNBs	4 Active gNBs
MPE (3D)	0.61 m	0.70 m	0.74 m	0.84 m
MPE (2D)	0.55 m	0.65 m	0.70 m	0.78 m
RMSE (3D)	0.55 m	0.66 m	0.76 m	1.03 m
RMSE (2D)	0.53 m	0.65 m	0.75 m	1.01 m
Max Error (3D)	1.23 m	1.97 m	2.78 m	3.14 m

Table 6. Comparison of 5G-based indoor positioning methods.

Methods	Dimension	Environment	Reported Accuracy
Wang et al. (CNN) [34]	2D	Realistic	1.47 m (MPE)
Kazemi et al. (CNN) [30]	2D	Simulation	0.60 m (MPE)
Dai et al. (CNN) [38]	2D	Realistic	67% < 1 m (CDF)
iPos (CNN) [36]	2D	Realistic	2.14 m (MAE)
Chen et al. (CNN) [61]	2D	Realistic	85% < 0.95 m (CDF)
Al-Habashna (KNN) [62]	2D	Simulation	5.72 m (MPE)
Al-Habashna (CNN) [62]	2D	Simulation	5.39 m (MPE)
Huang et al. (KNN) [63]	2D	Realistic	1.58 m < MPE < 2.24 m
El Boudani et al. (KNN, SVM) [64]	2D & 3D	Simulation	MPE Worse than 1.6 m
Our Method	2D & 3D	Simulation	0.36 m (MPE, 2D), 0.43 m (MPE, 3D), 0.32 m (RMSE, 2D), 0.33 m (RMSE, 3D)

K-Nearest Neighbors (KNN), Support Vector Machine (SVM).

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2026 by the authors. Published by MDPI on behalf of the International Society for Photogrammetry and Remote Sensing. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.

Share and Cite

MDPI and ACS Style

Shahraki, M.; Elamin, A.; El-Rabbany, A. Indoor UAV 3D Localization Using 5G CSI Fingerprinting. ISPRS Int. J. Geo-Inf. 2026, 15, 24. https://doi.org/10.3390/ijgi15010024

AMA Style

Shahraki M, Elamin A, El-Rabbany A. Indoor UAV 3D Localization Using 5G CSI Fingerprinting. ISPRS International Journal of Geo-Information. 2026; 15(1):24. https://doi.org/10.3390/ijgi15010024

Chicago/Turabian Style

Shahraki, Mohsen, Ahmed Elamin, and Ahmed El-Rabbany. 2026. "Indoor UAV 3D Localization Using 5G CSI Fingerprinting" ISPRS International Journal of Geo-Information 15, no. 1: 24. https://doi.org/10.3390/ijgi15010024

APA Style

Shahraki, M., Elamin, A., & El-Rabbany, A. (2026). Indoor UAV 3D Localization Using 5G CSI Fingerprinting. ISPRS International Journal of Geo-Information, 15(1), 24. https://doi.org/10.3390/ijgi15010024

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Indoor UAV 3D Localization Using 5G CSI Fingerprinting

Abstract

1. Introduction

2. Proposed Approach

2.1. Channel Model

2.2. CSI Feature

2.3. CFR Image

2.4. Positioning Framework

2.4.1. Deep Learning Model

2.4.2. Evaluation Metrics

3. Environmental Simulation

3.1. Waveform Simulation

3.2. Antenna Design

3.3. Material Properties

4. Results

4.1. Effect of Spacing

4.2. Effect of Number Training Plate

4.3. Effect of Number of Transmitters

5. Discussion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI