Compressive Sensing-Based 3D Spectrum Extrapolation for IoT Coverage in Obstructed Urban Areas

Yin, Kun; Fang, Shengliang; Chu, Feihuang

doi:10.3390/electronics14214177

Open AccessArticle

Compressive Sensing-Based 3D Spectrum Extrapolation for IoT Coverage in Obstructed Urban Areas

by

Kun Yin

,

Shengliang Fang

^* and

Feihuang Chu

College of Aerospace Information, Space Engineering University, Beijing 101400, China

^*

Author to whom correspondence should be addressed.

Electronics 2025, 14(21), 4177; https://doi.org/10.3390/electronics14214177

Submission received: 8 September 2025 / Revised: 21 October 2025 / Accepted: 23 October 2025 / Published: 26 October 2025

(This article belongs to the Special Issue Advances in Cognitive Radio and Cognitive Radio Networks)

Download

Browse Figures

Versions Notes

Abstract

As a fundamental information carrier in Industrial Internet of Things (IIoT), electromagnetic spectrum data presents critical challenges for efficient spectrum sensing and situational awareness in smart industrial cognitive radio systems. Addressing sparse sampling limitations caused by energy-constrained transceiver nodes in Unmanned Aerial Vehicle (UAV) spectrum monitoring, this paper proposes a compressive sensing-based 3D spectrum tensor completion framework for extrapolative reconstruction in obstructed areas (e.g., building occlusions). First, a Sparse Coding Neural Gas (SCNG) algorithm constructs an overcomplete dictionary adaptive to wide-range spectral fluctuations. Subsequently, a Bag of Pursuits-optimized Orthogonal Matching Pursuit (BoP-OOMP) framework enables adaptive key-point sampling through multi-path tree search and temporary orthogonal matrix dimensionality reduction. Finally, a Neural Gas competitive learning strategy leverages intermediate BoP solutions for gradient-weighted dictionary updates, eliminating computational redundancy. Benchmark results demonstrate 43.2% reconstruction error reduction at sampling ratios r ≤ 20% across full-space measurements, while achieving decoupling of highly correlated overlapping subspaces—validating superior estimation accuracy and computational efficiency.

Keywords:

compressive sensing; tensor completion; spectrum monitoring; subspace decoupling; dictionary learning

1. Introduction

With the development of sixth-generation (6G) communication technologies, space–air–ground integrated networks will provide robust support for wireless communications, the Industrial Internet of Things (IIoT), and aeronautical/maritime communications [1,2,3]. Concurrently, the massive communication and ubiquitous connectivity demands of Internet of Things (IoT) devices [4] will impose higher requirements on spectrum access and applications. Spectrum mapping has consequently emerged as a new research focus in spatial science, aiming to establish spectrum maps capable of efficiently managing spectrum resources in heterogeneous electromagnetic environments. These maps project spectrum situational awareness onto corresponding geographical locations, with the resulting visualization termed spectrum cartography. Particularly in urban scenarios, densely clustered high-rise buildings create greater spatial isolation between primary and secondary users. This environment enables the exploitation of spatial spectrum access opportunities unavailable in two-dimensional (2D) planes, making vertical dimension opportunities highly valuable. Given that 2D radio maps cannot characterize height information, three-dimensional (3D) radio maps become an effective method for representing spectrum distribution in urban spaces, especially regarding the deployment process of sensors inside buildings and related research [5].

The prerequisite for direct spectrum mapping is comprehensive sampling data containing location information. A preferred solution involves dividing the target space into numerous volumetric pixels (voxels) to achieve the highest possible resolution for radio maps [6]. Radio map construction methodologies [7,8,9] fall into two categories: model-driven [10] and data-driven [11] approaches. While ray-tracing propagation models enable rapid radio map generation, they require precise alignment between the applied environment and the model; otherwise, significant accuracy degradation occurs [12]. Moreover, substantial deviations in spectrum situation accuracy within the same area would be detrimental for massive IoT device access and omit critical information such as location coordinates, power intensity, and transmission mechanisms in regions of interest [13]. Compared to model-driven methods, data-driven approaches demand highly accurate spectrum data sources, imposing stricter requirements on spectrum information acquisition [14]. However, urban environments feature dense building clusters, leading to areas inaccessible to spectrum measurement equipment [15]. Achieving comprehensive spatial-spectrum information through extensive deployment of fixed spectrum sensors incurs prohibitive costs. Therefore, optimizing sampling locations and accurately estimating spectrum power are particularly crucial for constructing precise 3D radio maps.

Existing research primarily adopts random sampling for spectrum information collection, which can be divided into fixed and mobile modes. Fixed collection relies on static spectrum sensors [16], while mobile collection mainly utilizes monitoring vehicles, handheld devices, and spectrum-sensing unmanned aerial vehicles (UAVs) [17]. Among them, the UAV platform has been proven to be the most effective means for electromagnetic spectrum mapping [18], despite having many constraints itself. For example, Study [11] discusses the constraints of UAV sampling positions on the completeness of spatial information and studies data measurement methods for indoor and outdoor scenarios; Study [19] uses the dueling double deep Q-network (D3QN) to optimize the sampling positions of UAVs in three-dimensional space.

The core of data-driven methods lies in the prediction and estimation of spectrum data in unsampled areas. Breakthroughs in sparse coding and compressive sensing theories have opened up new avenues for radio map construction [20,21,22,23]. For instance, the cooperative spectrum sensing technology in cognitive radio networks (CRNs) [24] shares similar ideas with this paper in terms of utilizing spatial correlation. Recent studies have also explored the application of UAVs in spatial sensing [25] and tensor-based methods for dynamic radio maps [26,27], which are highly relevant to our 3D tensor completion framework. Work [28] investigates a collaborative spectrum sensing optimization strategy in mobile energy-harvesting cognitive radio networks. It analyzes the impact of two scenarios—insufficient energy and insufficient spectrum—on the final decision threshold k and deduces optimization and trade-off methods to maximize network throughput while protecting the primary user’s communication. Therefore, fully exploiting the intrinsic sparsity characteristics of spatial spectrum situations becomes essential, enabling spectrum map construction with limited sampled data through spectrum situation estimation in accessible regions.

Data-driven methods inherently require the prediction and estimation of unsampled spectrum data. Breakthroughs in sparse coding and compressive sensing theories have opened new avenues for radio map construction [20,21,22,23]. For instance, collaborative spectrum sensing techniques in CRNs [24] share a similar philosophy of exploiting spatial correlations. Recent advancements have also explored the integration of UAVs for spatial sensing [25], and tensor-based methods for dynamic radio cartography [26,27], which are highly relevant to our 3D tensor completion framework. However, these methods often do not fully address the severe signal sparsity and high correlation in samples collected from occluded urban canyons, which is the core challenge tackled in this paper.

It is precisely this gap—the ineffectiveness of existing compressive sensing (CS) methods in handling highly correlated 3D subspaces and learning robust dictionaries for complex urban spectra—that our work aims to bridge. Unlike prior arts, our proposed framework jointly addresses the dictionary learning and sparse recovery challenges in a synergistic manner.

In the sparse representation framework, signals can be approximately expressed as linear combinations of atoms in an overcomplete dictionary. When the number of atoms in the overcomplete dictionary far exceeds the signal dimension, robust optimization algorithms such as L1 regularization or iteratively reweighted least squares need to be adopted to ensure that the solution still has sparsity under noise and model errors. At the same time, to alleviate the high-dimensional non-convex optimization problem caused by the combination of propagation models and dictionary learning, the key challenges lie in overcoming the computational complexity caused by high variable dimensions and the local optimal traps caused by non-convex objective functions.

This paper focuses on the accuracy of spatial power recovery and studies the construction of remote spectrum maps in inaccessible urban areas. We propose a synthetic method based on SCNG, OOMP, and BoP, hereinafter referred to as Synthetic BoP-OOMP framework (SBO). The main contributions of this paper are summarized as follows:

A Novel 3D Spectrum Tensor Completion Framework: We propose a compressive sensing-based framework that effectively addresses the challenge of extrapolating spectrum data into spatially inaccessible urban areas. This is achieved by formulating the 3D spectrum power data as a tensor and vectorizing it, thereby transforming the UAV sampling problem into a sparsity-driven signal recovery problem.
An Adaptive and Robust Dictionary Learning Mechanism: We introduce the Sparse Coding Neural Gas (SCNG) algorithm, coupled with a neural gas competitive learning strategy, to construct an overcomplete dictionary that is highly adaptive to wide-range spectral fluctuations. This approach overcomes the limitations of traditional K-SVD, which often yields underestimated values and converges to poor local minima, thereby ensuring more accurate and robust feature representation.
An Enhanced Sampling and Reconstruction Algorithm: We develop a Bag of Pursuits-optimized Orthogonal Matching Pursuit (BoP-OOMP) framework. This innovation tackles the critical issue of suboptimal atom selection in traditional OMP within highly correlated 3D subspaces. By enabling multi-path tree search and leveraging intermediate solutions for gradient-weighted dictionary updates, our method achieves superior reconstruction accuracy and computational efficiency, effectively decoupling overlapping subspaces.

The rest of this article is organized as follows: Section 2 formulates the model and problem. Section 3 details the sampling position optimization framework. Section 4 presents experimental scenarios, simulations, and comparative results. Section 5 concludes with research prospects.

2. System Model

Due to varying channel conditions caused by differences in signal propagation environments and transmitter locations, we establish the signal propagation space illustrated in Figure 1. First, we discretize the entire 3D space into volumetric pixels (voxels), forming a spectrum tensor

χ \in N_{1} \times N_{2} \times N_{3}

where

N_{1}

,

N_{2}

and

N_{3}

represent the grid counts along the x, y and z axes, respectively. To construct an accurate radio map, spectrum measurements would ideally be performed for every voxel. However, our objective is to minimize measurement requirements by exploiting spatial power correlations between accessible and inaccessible regions. This approach enables estimation of missing spectrum situations, ultimately yielding a complete 3D radio map. Our objective is to reconstruct an accurate radio map with minimal measurement requirements. This can be formulated as an optimization problem that aims to find the optimal sampling positions ξ* which minimize the reconstruction error under a given sampling ratio constraint. Mathematically, the problem is formulated as

ξ^{*} = a r g m i n W_{χ}, ξ \subseteq χ,

(1)

s . t . r = \frac{N_{ξ}}{N_{1} \times N_{2} \times N_{3}},

(2)

W_{χ} = \frac{1}{N_{χ}} \sum_{i} {(\frac{|χ_{i}^{R} - χ_{i}|}{χ_{i}})}^{2},

(3)

where ξ denotes the sampling positions of UAVs at different locations.

W_{χ}

represents the relative mean square error (RMSE) between the estimated power intensity

χ_{i}^{R}

and the actual power intensity

χ_{i}

, r indicates the spectrum sampling rate in 3D space,

N_{ξ}

is the number of sampling points, and

N_{ξ}

denotes the total number of spatial points. Exhaustive sampling to obtain spectrum situation data for all

N_{χ}

points is clearly impractical. Therefore, research on reducing sampling points and optimizing their spatial distribution holds considerable practical value. However, exhaustive sampling is impractical. To solve this underdetermined problem, we leverage the inherent sparsity of the spatial spectrum situation. Based on compressive sensing theory, the problem is transformed into recovering a sparse signal s from under-sampled measurements y = Ds + ε, which is detailed in the following sections.

The optimization objective formulated in (1)—minimizing the reconstruction error

W_{ξ}

under a sampling ratio constraint r—is of paramount practical significance for UAV-assisted spectrum mapping. This formulation directly addresses the two most critical constraints in real-world UAV operations: energy and time. Minimizing the number of sampling points (i.e., keeping r low) directly translates to shorter mission durations and lower energy consumption, thereby extending the UAV’s operational range and viability. Simultaneously, minimizing the reconstruction error

W_{ξ}

ensures the reliability and usability of the generated radio map. A highly accurate map empowers cognitive radio networks to make trustworthy dynamic spectrum access decisions, minimizing interference and maximizing utilization. Therefore, our optimization objective strikes a crucial balance between operational feasibility (low sampling cost) and functional efficacy (high map accuracy), which is the cornerstone for deploying practical UAV-based spectrum monitoring systems.

In this work, we adopt a hover-and-sense mobility model for the UAV spectrum monitoring platform. This model is justified for the following reasons: First, the primary challenge we address is the optimal selection of where to sample in 3D space, rather than the continuous trajectory planning. The proposed BoP-OOMP algorithm is designed to output a set of discrete, informative 3D coordinates ξ*. Second, hovering at a fixed position allows for stable and reliable spectrum measurement, minimizing the Doppler effect and measurement noise induced by mobility. The UAV moves between these pre-optimized sampling points at a constant speed v (5 m/s). Once it arrives at a target waypoint in ξ*, it hovers to perform the spectrum sensing task. This model effectively decouples the sampling position optimization problem from the path planning problem, allowing us to focus on the core algorithmic contribution of spatial extrapolation. The resulting set of sampling points ξ* can subsequently serve as input to any standard UAV path planning algorithm to generate a kinematically feasible route.

The selected experimental scenario is an urban block as shown in Figure 1. Let positions i and j denote two distinct grid indices within the experimental area. The unknown spectrum power values across the 3D scene are represented as vector

s = {[s_{u_{1}}, s_{u_{2}}, \dots, s_{u_{n}}]}^{⊤} \in R^{n \times 1}

. The entire three-dimensional space U is discretized as

U = (u_{1}, u_{2}, \dots, u_{n}), (n = N_{1} \times N_{2} \times N_{3}),

(4)

Assuming there exist P inaccessible regions, denoted as

V = V_{1} \cup V_{1} \cup \cdot \cdot \cdot \cup V_{P}, (V_{p} \cap V_{q} = Ø, p \neq q),

(5)

Thus, the set of accessible regions is

C_{V} U

, where V denotes the complement of V with respect to U.

The sparsity of the source vector is

k = {‖s‖}_{0}

, where k represents the number of non-zero elements in the tensor. Given the constraint m ≪ n (underdetermined sampling and estimation), recovering the original signal s from observations

y

constitutes an ill-posed problem.

Minimizing the

l_{o}

-norm promotes sparsity but results in a non-convex optimization problem. Finding the exact solution for sampling matrix D is NP-hard, particularly in high-dimensional spaces. Therefore, we relax the formulation using the

l_{1}

-norm to yield a convex optimization problem as

{s_{i}}^{O O M P} = a r g \underset{s}{m i n} {‖s‖}_{1},

(6)

{‖y - D s‖}^{2} \leq σ^{2},

(7)

Throughout the sampling process, we only collect data from partial regions within the 3D space. The sampled signal is expressed as

y = D s + ε,

(8)

where ε denotes additive white Gaussian noise (AWGN) with power spectrum density

σ^{2}

, and D is the measurement matrix.

Based on the above analysis and the inherent sparsity of y, the objective function for remote compressive spectrum mapping of inaccessible regions is formulated as

\hat{s} = \arg \min_{y^{s}} {‖y - D s‖}_{2}^{2} + λ {‖s‖}_{1},

(9)

where

\hat{s}

denotes the estimated value. To solve the optimization problem above, we employ an enhanced Orthogonal Matching Pursuit (OMP) algorithm. Achieving accurate 3D electromagnetic spectrum situation reconstruction necessitates both rational sampling position selection and precisely designed spectrum recovery algorithms, which are particularly critical for high-fidelity spatial mapping.

To address the challenge of spectrum data collection in urban areas inaccessible to UAVs, we obtain accurate spectrum samples that enable remote mapping power estimation. We propose an enhanced Orthogonal Optimized Matching Pursuit (OOMP) algorithm that transforms spectrum situation recovery into an optimal coefficient search under constraints. This approach achieves spectrum power intensity reconstruction in highly overlapping subspaces with missing data samples. Consequently, we derive the spatial power estimates for inaccessible regions V and the corresponding Relative Mean Square Error (RMSE) as

z_{V}^{R} = {(\hat{s})}_{V},

(10)

E_{V} = \frac{1}{N_{V}} \sum_{u_{i} ∊ V} {‖\frac{z_{u_{i}}^{R} - z_{u_{i}}}{z_{u_{i}}}‖}_{2}^{2},

(11)

where

{(\cdot)}_{V}

denotes the subset of vectors indexed by set V, and

N_{V}

represents the cardinality of V. Here,

z_{u_{i}}^{R}

and

z_{u_{i}}

denote the estimated and original spectrum power intensity at sampling point

u_{i}

, respectively.

3. Remote Compressed Spectrum Mapping Algorithm

3.1. Optimization of Sampling Matrix Based on Improved Orthogonal Matching Pursuit Algorithm

Building upon existing research on spectrum mapping models, this paper proposes an algorithm comprising two key components: sampling location optimization and power estimation. In three-dimensional space, power intensity typically exhibits strong spatial correlation. Performing spectrum measurements for every cubic unit would incur prohibitively high costs in time and energy, making the selection of sampling locations critically important for estimation accuracy. Random sampling risks losing crucial information and significantly increases the difficulty of energy recovery for unsampled areas. Therefore, adaptive selection of sampling locations is necessary to capture data with stronger spatial correlation. We employ UAVs equipped with spectrum sensing devices, hovering at predetermined positions to collect electromagnetic spectrum data.

The BoP-OOMP algorithm operates as follows: For each signal, it initiates multiple pursuit paths (K_user paths). In each path, it iteratively selects the dictionary atom that is most correlated with the current residual, but after orthogonalizing all atoms with respect to the already selected ones (OOMP step). After all paths are completed, the best solution is chosen. This multi-path approach avoids getting trapped in a suboptimal atom selection sequence. We have provided explanations for some symbols in the chapter, as shown in Table 1.

The K-SVD algorithm has limitations in reconstructing the “true” underlying dictionary, often producing reconstructed values that are too small. This fails to meet the requirements of spectrum mapping where values exhibit substantial fluctuations. To address this issue, we introduce the Sparse Coding Neural Gas (SCNG) algorithm [29]. Unlike K-SVD and Method of Optimal Directions (MOD) algorithms, SCNG is specifically used for determining the sparse coefficient approximation.

To solve the aforementioned problems, we have improved Orthogonal Matching Pursuit (OMP) algorithm. Typically, the rows of dictionary D are not pairwise orthogonal. In standard OMP, the column

D_{l_{w i n}}

selected to be added to the set U is not optimal in the context of minimizing the residual after its inclusion. Therefore, to achieve minimal regularized residuals, the enhanced OOMP algorithm requires a selection criterion: it must evaluate all unused columns in dictionary D and identify the column that produces the smallest residual. The overall algorithmic steps are as

Select an appropriate column $D_{l_{w i n}}$ through $D_{l_{w i n}} = \arg \min_{c_{l}, l \notin U} \min_{s} ‖y - D^{U \cup l} s‖$ ;
Set $U = U \cup l_{w i n}$ ;
Solve the optimization problem: $\hat{y} = a r g \underset{a}{m i n} {‖y - D^{U} s‖}_{2}$ ;
Obtain the residual: $ϵ_{i} = y_{i} - D s^{O O M P}$ ;
Repeat Step 1 until k iterations are completed.

The selection criterion of the Orthogonal Orthogonal Matching Pursuit (OOMP) algorithm involves solving

M - |U|

minimization problems for each unused column in the dictionary D. To reduce the computational complexity of this step, the OOMP algorithm is applied to an orthogonalized temporary dictionary R. R is obtained by removing the projection of dictionary D onto the subspace and normalizing the residual

r_{l}

to unit norm.

R_{n}^{j}

denotes a temporary matrix that has been orthogonalized with respect to the columns selected from D by the index set

U_{n}^{j}

. The residual

ϵ_{i}^{U}

is obtained similarly: the projection of

R_{i}

onto the subspace spanned by

D^{U}

is removed. We have

R = (r_{1}, \dots, r_{l}, \dots, r_{m}) = D

and

ϵ_{i}^{U} = y_{i}

.

During each iteration, the algorithm identifies the column

r_{l}

from dictionary R in the context of the most recent residual

ϵ_{i}^{U}

. Following each gradient update, the column vectors in the dictionary are re-normalized to unit norm. A new training sample is then selected, the coefficients are re-determined, and the next update can be performed. This process is straightforward and computationally efficient, as it entirely avoids the need for singular value decomposition (SVD) or matrix inversion. Furthermore, it utilizes only one sample per learning step, making it suitable even for online learning scenarios. It also eliminates the requirement to store large volumes of training samples.

Due to the broad spectrum coverage in open space, a subset of the sampled data is inherently correlated. The sampled data constitutes a sparse representation of subspaces within the ambient space, resulting in high correlation among the spectrum samples at the sampling points. Basis vectors are shared among these subspaces. We select training data generated by linearly combining a set of dictionary elements with the sampled signals.

Numerous approximation methods have been proposed to address the problem of obtaining the optimal coefficients

s_{i}

. To solve this, we employ a greedy algorithm. Specifically, under the sparsity constraint of a given signal

y_{i}

, this requires that the mutual coherence of the dictionary D be sufficiently small. The mutual coherence of D is defined as

μ (D) = \begin{matrix} m a x \\ 1 \leq i, j \leq M, i \neq j \end{matrix} \frac{|D_{i}^{T} D_{j}|}{{‖D_{i}‖}_{2} {‖D_{j}‖}_{2}} .

(12)

To further enhance the performance of the OOMP algorithm, it is necessary not only to find the optimal coefficients but also to define the optimal approximation

K_{u s e r}

for these coefficients. With the initial conditions

U_{n}^{j} = Ø, R_{n}^{j} = (r_{1}^{0, j}, \dots, r_{M}^{0, j})

, and

ϵ_{0}^{j} = y

, the set

U_{n}^{j}

contains the indices of columns from D that have already been selected up to the n-th iteration during the j-th pursuit for y.

R_{n}^{j}

denotes an orthogonalized temporary matrix, which has been orthogonalized with respect to the columns in D indexed by

U_{n}^{j}

.

r_{l}^{n, j}

represents the l-th column of

R_{n}^{j}

.

ϵ_{n}^{j}

is the residual for the target y after n iterations within the j-th pursuit. The detailed algorithm pseudocode is presented in Algorithm 1.

Algorithm 1 Enhanced Orthogonal Optimized Matching Pursuit (EOOMP)

Input: Signal vector y, dictionary matrix

D = (d_{1}, d_{2}, \dots {, d}_{m})

, maximum iterations per
pursuit: k, user-defined solution count:

K_{u s e r}

;

Output: Sparse approximations: (

s^{1}, s^{2}, \dots, s^{K_{u s e r}}

), residual vectors: (

ϵ^{1}, ϵ^{2}, \dots, ϵ^{K_{u s e r}}

);

1: Initialize

Q (l, n, j) = 0

,

l \in (1, M), n \in (0, k), j \in (1, K_{u s e r})

, completed pursuits
set P

= \emptyset

;

2: for

j = 1

to

K_{u s e r}

do

3: Initialize

j t h

pursuit,

U_{0}^{j} = \emptyset

,

ϵ_{0}^{j} = y

,

R_{0}^{j} = (r_{1}^{0, j}, \dots, r_{M}^{0, j}) = D

;

4: for

n = 0

to k − 1 do

5:

y_{n}^{j} = (\frac{{r_{1}^{n, j}}^{T} ϵ_{n}^{j}}{‖r_{1}^{n, j}‖}, \dots, \frac{{r_{l}^{n, j}}^{T} ϵ_{n}^{j}}{‖r_{l}^{n, j}‖}, \dots, \frac{{r_{M}^{n, j}}^{T} ϵ_{n}^{j}}{‖r_{M}^{n, j}‖})

;

6: Find

l_{w i n} (n, j) = \underset{l, l \notin U_{n}^{j}}{a r g m a x} {(y_{n}^{j})}_{l}^{2}

where

Q (l, n, j) = 0

;

7: Update orthogonal matrix

a s

(14);

8: Update residual as (15);

9: Update

U_{n + 1}^{j} = U_{n}^{j}

and set

Q (l_{w i n}, n, j) = 1

;

10: if

‖ϵ_{n + 1}^{j}‖ = 0

then

11: break inner loop;

12: end if

13: end for

14: Store

j t h

pursuit result

a^{j} = a r g \underset{a}{m i n} {‖y - D^{U} s‖}_{2}

,

ϵ_{n + 1}^{j} = ϵ_{n}^{j}

, P

= P \cup \{j\}

;

15: if

j < K_{u s e r}

then

16: Find

(j_{t a r g e t}, n_{t a r g e t}, l_{t a r g e t}) = \underset{p \in P, n \in \{0, b_{p} - 1\}, l : Q (l, n, p) = 0}{a r g m a x} {(y_{n}^{p})}_{l}^{2}

;

17: Prepare next pursuit starting at pivot

U_{0}^{j} = U_{n_{t a r g e t}}^{j_{t a r g e t}}

,

ϵ_{0}^{j} = ϵ_{n_{t a r g e t}}^{j_{t a r g e t}},

R_{0}^{j} = R_{n_{t a r g e t}}^{j_{t a r g e t}}

;

18: Set

Q (j_{t a r g e t}, n_{t a r g e t}, l_{t a r g e t}) = 1

;

19: end if

20: end for

21: Return

\{s^{1}, \dots, s^{K_{u s e r}}\}

,

\{ϵ^{1}, \dots, ϵ^{K_{u s e r}}\}

.

At the n-th iteration, the next residual

ϵ_{i + 1}^{U}

that minimizes the approximation error is obtained by searching for the best linear superposition within

R_{n}^{j}

, leading to

y_{n}^{j} = (\frac{{r_{1}^{n, j}}^{T} ϵ_{n}^{j}}{‖r_{1}^{n, j}‖}, \dots, \frac{{r_{l}^{n, j}}^{T} ϵ_{n}^{j}}{‖r_{l}^{n, j}‖}, \dots, \frac{{r_{M}^{n, j}}^{T} ϵ_{n}^{j}}{‖r_{M}^{n, j}‖}) .

(13)

The purpose of the above expression is to determine the index of the winning atom:

l_{w i n} (n, j) = \begin{matrix} a r g m a x \\ l, l \notin U_{n}^{j} \end{matrix} {(y_{n}^{j})}_{l}^{2}

. Subsequently, the orthogonal projection of

R_{n}^{j}

onto

r_{l_{w i n} (n, j)}^{n, j}

is subtracted from

R_{n}^{j}

, yielding

R_{n + 1}^{j} = R_{n}^{j} - \frac{r_{l_{w i n} (n, j)}^{n, j} {({R_{n}^{j}}^{T} r_{l_{w i n} (n, j)}^{n, j})}^{⊤}}{{r_{l_{w i n} (n, j)}^{n, j}}^{T} r_{l_{w i n} (n, j)}^{n, j}} .

(14)

The orthogonal projection of

R_{n}^{j}

onto

r_{l_{w i n} (n, j)}^{n, j}

is subtracted from

R_{n}^{j}

, yielding

ϵ_{n + 1}^{j} = ϵ_{n}^{j} - \frac{{ϵ_{n}^{j}}^{T} r_{l_{w i n} (n, j)}^{n, j}}{{r_{l_{w i n} (n, j)}^{n, j}}^{T} r_{l_{w i n} (n, j)}^{n, j}} r_{l_{w i n} (n, j)}^{n, j} .

(15)

The algorithm terminates when

‖ϵ_{n}^{j}‖ = 0

or

n = k

, yielding the coefficients for the j-th pursuit and the k-term optimal approximation. During the j-th greedy iteration, it cyclically evaluates the contribution of each column in dictionary D to obtain

s^{j}

. To approximate

s^{1}, \dots, s^{K}

, K is manually specified to align with the greedy objective. For obtaining greedy solutions at different K values, we implement the following function as

Q (l, n, j) = \{\begin{matrix} 0, there is no pursuit among all pursuits \\ 1, else . \end{matrix} .

(16)

When no suitable value (sampling point) is selected across all greedy processes—specifically, during the j-th greedy iteration within the n-th overall iteration, where the iteration column j is chosen—we then trace all overlaps from the first (recorded) greedy computation process. Concretely, if

a^{1}

is determined, we obtain

y_{0}^{1}, \dots, y_{n}^{1}, \dots, y_{b_{1} - 1}^{1}

, where

b_{1}

denotes the iteration count of the first greedy process. To identify

a^{2}

, we seek the maximum overlap among previously unused atoms from prior greedy processes

n_{t a r g e t} = a r g \max_{n = 0, \dots, b_{1} - 1} \max_{l, Q (l, n, j) = 0} {(y_{n}^{1})}_{l}^{2},

(17)

l_{t a r g e t} = a r g \max_{l} {(y_{n_{t a r g e t}}^{l})}_{l}^{2} .

(18)

Building upon the results and foundations of the first pursuit up to the

n_{t a r g e t}

iteration, we select column

l_{t a r g e t}

to substitute the previous optimal position, continuing the greedy process until the termination criterion is met. If m pursuits have been implemented across all prior processes, we then seek the maximum coverage among unused atoms

j_{t a r g e t} = a r g \max_{j = 1, \dots, m} \max_{n = 0, \dots, b_{j} - 1} \max_{l, Q (l, n, j) = 0} {(y_{n}^{j})}_{l}^{2},

(19)

n_{t a r g e t} = a r g \max_{n = 0, \dots, b_{j_{target}} - 1} \max_{l, Q (l, n, j_{target}) = 0} {(y_{n}^{j_{t a r g e t}})}_{l}^{2},

(20)

l_{t a r g e t} = a r g \underset{l, Q (l, n_{t a r g e t}, j_{t a r g e t}) = 0}{m a x} {(y_{n_{t a r g e t}}^{j_{t a r g e t}})}_{l}^{2} .

(21)

Re-execute the greedy process for

j_{t a r g e t}

over

n_{t a r g e t}

iterations, selecting column

l_{t a r g e t}

to replace the previously identified optimal column. Continue the iterative selection until the stopping criterion is met. Repeat this procedure until the user-specified K is achieved.

In Figure 2, using

K_{u s e r} = 3

as an example, three distinct solutions are identified. First, dictionary elements are sorted according to their projection magnitude onto the residual. Orthogonalization is then performed based on the element exhibiting maximum projection. For instance, when element n is selected, all other dictionary elements and the residual are orthogonalized relative to it—yielding the second solution. This sequential orthogonalization is repeated until a maximum of three dictionary elements are utilized, producing the final solutions: Solution 1: [4, 5, 3]. Solution 2: [1, 6, 3]. Solution 3: [3, 4, 2].

Since dictionary reconstruction must address the overcompleteness problem (over-recovery/over-coverage), the dictionary size inevitably increases. For large-scale dictionaries, previously selected atoms and elements in the temporary dictionary remain mutually orthogonal, as demonstrated in (14). Furthermore, if low-cost computation is required within permissible estimation error bounds, the dictionary orthogonalization step may be omitted. In such cases, the tree-search process computes the OMP solution rather than the OOMP solution.

3.2. Coefficient Determination via Gradient Descent

While subsection A employs Bag of Pursuit (BoP) to enhance learning efficiency, only the optimal pursuit from the tracking set

K_{u s e r}

is utilized in the gradient descent algorithm. The computational efficiency of dictionary learning remains suboptimal for wide-area spectrum estimation due to stringent real-time requirements for spectrum mapping. Furthermore, we observe that intermediate solutions are underutilized in our proposed sparse approximation framework. Leveraging these intermediate solutions could reduce redundant computational overhead.

To address this, we introduce a competitive learning strategy derived from vector quantization—interpreted as a specialized sparse coding scheme where dictionary coefficients satisfy

{‖s_{i}‖}_{0} = 1

and

{‖s_{i}‖}_{2} = 1

. This reformulates the optimization problem as:

{(s_{i})}_{m} = 1, {(s_{i})}_{l} = 0, \forall l \neq m

with

m = \begin{matrix} a r g m i n \\ l \end{matrix} {‖D_{l} - y_{i}‖}_{2}^{2}

. Conventional vector quantization algorithms predominantly update using only the best-matching atom, leading to sensitivity to initialization, slow convergence, and suboptimal quantization.

To address these issues, soft competitive vector quantization algorithms such as the Neural Gas algorithm [30] are employed. At each learning step, all possible encodings

s_{i}^{1}, s_{i}^{2}, \dots, s_{i}^{M}

where

{(s_{i}^{j})}_{j} = 1

are considered. The encodings are then sorted according to their reconstruction error as

‖y_{i} - D s_{i}^{j_{0}}‖ \leq \dots \leq ‖y_{i} - D s_{i}^{j_{p}}‖ \leq \dots \leq ‖y_{i} - D s_{i}^{j_{M}}‖ .

(22)

Compared to hard competitive methods, this approach updates the codebook vectors during each learning iteration, weighting them according to the rank of their encodings. This achieves a gradient descent effect equivalent to a well-defined cost function. The Neural Gas algorithm demonstrates robust convergence, particularly with large datasets.

We aim to apply this ranking approach to learning coefficient codes. For each given

y_{i}

, we consider all K possible coefficient vectors

s_{i}^{j}

, i.e., all non-zero entries. Each element in

s_{i}^{j}

ensures

‖y_{i} - D s_{i}^{j}‖

is minimized. The coefficients are then sorted according to the representation error obtained by approximating sample

y_{i}

, which can be expressed as

‖y_{i} - D s_{i}^{j_{0}}‖ < \dots < ‖y_{i} - D s_{i}^{j_{p}}‖ < \dots < ‖y_{i} - D s_{i}^{j_{M}}‖

(23)

With

r a n k (y_{i}, a_{i}^{j}, D) = p

denoting the ranking position of coefficient vectors, and the neighborhood function defined as

h_{λ_{t}} (v) = e^{\frac{- v}{λ_{t}}}

, the error function is given by

E_{s} = \sum_{i = 1}^{L} \sum_{j = 1}^{K} h_{λ_{t}} (r a n k (y_{i}, s_{i}^{j}, D)) {‖y_{i} - D s_{i}^{j}‖}_{2}^{2} .

(24)

Equivalent to (1), to minimize (24), we consider the gradient of

E_{s}

as

\frac{\partial E_{s}}{\partial D} = - 2 \sum_{i = 1}^{L} \sum_{j = 1}^{K} h_{λ_{t}} (r a n k (y_{i}, s_{i}^{j}, D)) (y_{i} - D s_{i}^{j}) {s_{i}^{j}}^{T} + R,

(25)

where

R = \sum_{i = 1}^{L} \sum_{j = 1}^{K} h_{λ_{t}}^{'} (r a n k (y_{i}, s_{i}^{j}, D)) \cdot \frac{\partial r a n k (y_{i}, s_{i}^{j}, D)}{\partial D} {‖y_{i} - D s_{i}^{j}‖}_{2}^{2},

(26)

In [31], it was proven that R = 0. We omit further proof here. With

e_{i}^{j} = y_{i} - D a_{i}^{j}

,

r a n k (y_{i}, s_{i}^{j}, D)

is rewritten as

r a n k (y_{i}, s_{i}^{j}, D) = \sum_{j = 1}^{K} H ({(e_{i}^{j})}^{2} - {(e_{i}^{m})}^{2}),

(27)

where H(x) is the Heaviside step function. This leads to

R = 2 \sum_{i = 1}^{L} \sum_{j = 1}^{K} h_{λ_{t}}^{'} (r a n k (y_{i}, s_{i}^{j}, D)) {(e_{i}^{j})}^{2} \sum_{m = 1}^{K} (e_{i}^{m} {(s_{i}^{m})}^{2} - e_{i}^{j} {(e_{i}^{j})}^{T}) H ({(e_{i}^{j})}^{2} - {(e_{i}^{m})}^{2}) .

(28)

In (28), each term is non-zero unless

{(e_{i}^{j})}^{2} = {(e_{i}^{m})}^{2}

holds. Therefore, we introduce the ranking neighborhood weighting concept of the Neural Gas algorithm into sparse coefficient optimization, replacing traditional single optimal solution updates. Under the premise of updating D, we apply gradient descent to (30) with

∆ D = α_{t} \sum_{j = 1}^{K} h_{λ_{t}} (r a n k (y_{i}, s_{i}^{j}, D)) (y_{i} - D s_{i}^{j}) s_{i}^{j^{⊤}} .

(29)

For stochastically chosen

y_{i} \in Y

,

λ_{t} = λ_{0} {(\frac{λ_{f i n a l}}{λ_{0}})}^{\frac{t}{t_{m a x}}}

(30)

is an exponentially decaying variable neighborhood size and an exponentially decreasing learning rate. After each update, the column vectors of D are renormalized to 1,

a_{i}^{j}

is recalculated, and the next update is performed.

At this point, for each training sample

y_{i}

, all possible coefficient vectors

a_{i}^{j}

(

j = 1, . . ., K

with

{‖a_{i}^{j}‖}_{0} \leq k

) are considered. K grows exponentially with M and k. In the current case (18), we do not require all possible coefficient vectors.

Components where the rank in (18) exceeds the neighborhood

λ_{t}

are ignored—we focus solely on the solution with the best reconstruction error obtained through the BoP method. The detailed algorithm pseudocode is presented in Algorithm 2.

Algorithm 2 Rank-Weighted Dictionary Learning with Bag of Pursuits

Input: Data samples Y

= {\{y_{i}\}}_{i = 1}^{L}

, dictionary

D_{0} \in R^{n}

, sparsity level

k

, number of
candidates

T

, initial neighborhood size

λ_{0}

, final neihborhood size

λ_{f i n a l}

, initial
leaning rate

α_{0}

, final learning rate

α_{f i n a l}

, maximum iterations

t_{m a x};

Output: Learning dictionary D;

Initialization: set

D = D_{0}

;

1: while

t < T

do

2: Compute annealing parameters:

λ_{t} = λ_{0} {(\frac{λ_{f i n a l}}{λ_{0}})}^{\frac{t}{t_{m a x}}}, α_{t} = \frac{α_{f i n a l}}{α_{0}}

;

3: Randomly pick an index i from

\{1, \dots, L\}

, and set y

= y_{i}

;

4: Use Algorithm 1 to obtain K approximations:

\{s^{1}, \dots, s^{K}\} = B o P (y, D, k, K)

;

5: for

j = 1

to K do

6:

e^{j} = y - D s^{j}

;

7:

{(ϵ^{j})}^{2} = {‖e^{j}‖}_{2}^{2}

;

8: end for

9: Sort

\{(ϵ_{1}, s^{1}), \dots, (ϵ_{K}, s^{K})\}

by

ϵ_{1} \leq ϵ_{2} \leq \dots {\leq ϵ}_{K}

;

10: for

j = 1

to K do

11:

w_{j} = e x p (- \frac{{r a n k}_{j}}{λ_{t}})

;

12:

Δ D \leftarrow Δ D + w_{j} \cdot e^{j} {(s^{j})}^{⊤}

;

13: end for

14:

Δ D \leftarrow α_{t} Δ D

;

15: Updata dictionary:

D \leftarrow D + Δ D

;

16: Renormalize dictionary columns:

d_{l} \leftarrow d_{l} / {‖d_{l}‖}_{2}

;

17: Increment iteration counter:

t \leftarrow t + 1

;

18: end while

The core idea of the proposed SBO framework can be summarized as a sparsity-driven, dictionary-learning-enabled tensor completion process. It first learns an adaptive dictionary (SCNG) that sparsely represents the 3D spectrum environment. Then, it employs a multi-path greedy pursuit (BoP-OOMP) to intelligently select the most informative sampling points and reconstruct the missing data. Finally, a competitive learning strategy refines the dictionary by leveraging intermediate solutions, enhancing both accuracy and efficiency.

4. Experimental Evaluation

4.1. Experimental Scenario Setup

This study utilizes a geometrically precise 3D urban environment (1000 m × 1000 m × 100 m) extracted from real-world geographic data, as illustrated in Figure 3. The spatial domain is discretized into a 100 × 100 × 10 voxel grid with 10 m³ resolution. Key simulation parameters are configured per Table 1. Five isotropic transmitters (30 mW EIRP each) are randomly distributed within the scene. The RF subsystem operates at 2000 MHz carrier frequency with 200 kHz bandwidth, while receiver noise follows a spectrum density of −174 dBm/Hz. The hybrid channel model employs parameters α = 11.95, β = 0.136, τ = 0.1, with path loss deviations

ξ_{L O S} = 20 d B

= 1 dB and

ξ_{N L O S} = 20 d B

= 20 dB. As shown in Figure 3, assuming there are five inaccessible areas in space, centered at (200 m, 50 m, 10 m), (100 m, 750 m, 35 m), (650 m, 150 m, 25 m), (550 m, 600 m, 20 m), and (900 m, 950 m, 10 m), with sizes of 200 m × 100 m × 10 m, 200 m × 300 m × 70 m, 100 m × 200 m × 40 m, 100 m × 100 m × 50 m, and 200 m × 100 m × 20 m, respectively. We set up five omnidirectional antennas as signal sources at locations (100,824,100), (150,500,15), (260,160,35), (650,755,65), (750,260,15).

To validate the algorithm’s accuracy, simulations were conducted using WinProp [29], with the results serving as ground-truth radio maps. It should be noted that these simulations deliberately exclude the effects of environmental material properties on radio wave propagation to isolate the core algorithmic performance.

Given that our proposed radio map construction method relies on sparse-coding-based spectrum intensity estimation, we employ cross-validation to evaluate extrapolation accuracy using synthetic data. For comparative analysis, four benchmark algorithms are introduced: K-SVD+OMP [30], SAFARI [32], FISTA [33], and GCN-LSTM [34]. Detailed simulation results will be discussed in subsequent sections.

4.2. Spectrum Sampling Position Effectiveness Analysis

We construct a spectrum dataset

y = (y_{1}, \dots, y_{10,000})

where each sample is a linear combination of dictionary columns. Three distinct scenarios are designed: random sampling (spatially uncorrelated regions), independent subspaces (building-isolated zones), and dependent subspaces (urban canyons), with 50 Monte Carlo repetitions for SBO algorithm validation. Adopting the sparse representation

y_{i} = D^{t r u e} b_{i}

, coefficient vectors contain strategically positioned non-zero entries. As referenced in (29), sampling position selection governs non-zero entry placement while accounting for dictionary column coherence. Our soft-competitive gradient descent method initializes with

λ_{0} = 50

and converges to

λ_{f i n a l} = 0.1

, applying stochastic gradient descent per pursuit coefficient

K_{u s e r}

during (30) updates. Non-zero entries are uniformly sampled from

[- 0.5,0.5]

and scaled to ensure

E [y_{i}^{T} y_{i}] = 1

. Validation against MOD+OMP, K-SVD+OMP, HC-SGD+BoP, and SCNG+BOP-OOMP executes 100 learning iterations. Dictionary reconstruction fidelity is quantified via the Mean Maximum Overlap (MMO), and it yields

M M O = \frac{1}{50} \sum_{l = 1}^{50} \begin{matrix} m a x \\ k = 1, \dots, 50 \end{matrix} |{(D_{l}^{t r u e})}^{⊤} D_{l}^{l e a e n e d}|,

(31)

where k denotes the number of non-zero elements. The subspace decoupling gain is defined as

Δ M M O = |{M M O}_{I n d e p e n d e n t} - {M M O}_{D e p e n d e n t}|

.

The three scenarios are categorized as follows: Random Sampling. All combinations of sampling positions are possible. The positions of non-zero entries in each coefficient vector

b_{i}

are randomly selected. Independent Subspaces. Sampling positions reside within a small number of independent 3D spatial clusters. This is achieved by defining dictionary elements such that each group contains randomly selected atoms, ensuring spatial independence between sampling locations. Dependent Subspaces. Similar to the preceding scenario, training samples lie within a limited number of multidimensional subspaces. Contrary to independent subspaces, these subspaces exhibit high mutual intersection. Measurements prioritize high gradient variation regions of spectrum power intensity, while low-intensity regions undergo further sparsification to reduce sampling.

In the dependent subspace scenario (Figure 4c), the proposed SBO framework achieves the highest Mean Maximum Overlap (MMO) value, leveraging strong inter-sample correlations to enhance reconstruction fidelity. As the sparsity level k increases, the subspace decoupling gain ΔMMO gradually decreases, indicating convergent behavior between independent and dependent configurations and underscoring the method’s robustness. The peak ΔMMO observed at k = 20 confirms optimal parameterization under the current settings.

The superior performance of SBO is attributed to two key mechanisms: the multi-path search strategy in SCNG-BoP that mitigates atom confusion, and the stochastic gradient method that accelerates convergence. In low-sparsity scenarios—characterized by concentrated sampling distributions—the soft-competitive approach further improves performance, particularly in dependent subspaces where non-sparse conditions significantly enhance dictionary recovery. In contrast, hard-competitive methods excel in independent subspaces (Figure 4b) due to their efficiency in cross-subspace dictionary learning for objective minimization. Importantly, SBO demonstrates statistically significant robustness across all scenarios (Figure 4a–c), exhibiting 23.8% lower MMO variance than benchmark methods under heterogeneous sampling conditions.

Furthermore, while increasing training samples generally improves spectrum situation estimation, the soft-competitive gradient descent method exhibits no further performance gains beyond k > 20 (Figure 4a–c). This plateau occurs because the algorithm has acquired a complete set of discriminative cross-subspace features, beyond which additional samples yield diminishing returns. In independent subspaces, dictionary reconstruction improves with more samples only until k = 20, after which the benefits saturate. Although independent subspace sampling improves efficiency, it may incur a trade-off in reconstruction quality, necessitating careful quantization design. Notably, under random sampling (Figure 4a), SBO maintains competitive performance (ΔMMO ≤ 0.12) even in suboptimal conditions, achieving 89.3% of the theoretical maximum accuracy.

4.3. Comparison of Numerical Performance

As evidenced in Figure 5b, SAFARI achieves superior spatial power estimation at low sampling rates (r < 0.2), owing to its effectiveness in low-sparsity urban environments where compressive sensing requires minimal sampling density. This establishes SAFARI as the preferred method for low-sampling scenarios, although its performance exhibits diminishing returns as r increases [35]. In contrast, GCN-LSTM demonstrates the most significant estimation improvement with additional samples, leveraging dynamic relational modeling to capture complex node associations—making it particularly advantageous in non-uniform urban spectrum distributions. Notably, our proposed SBO framework maintains robust reconstruction of missing spectral components even under challenging conditions where traditional K-SVD+OMP fails to recover accurate intensities at r < 0.3. For higher sampling rates (r > 0.25), SBO_DS delivers optimal spectrum situation estimation by selectively employing the highest-quality pursuits from the

K_{u s e r}

set during stochastic gradient descent. A key innovation lies in the sparse approximation strategy, which incorporates intermediate solution sets into dictionary learning rather than relying exclusively on final outcomes.

Although SBO_DS consumes 23–38% more computational resources than SBO_IS with increasing r (Figure 5a), it achieves significantly better performance in non-independent sampling scenarios (Figure 5b). By effectively exploiting inter-sample correlations under nearly identical time and sampling constraints (t ± 4.2%, r ± 0.05), SBO_DS attains more efficient spectrum situation estimation. This validates the efficacy of our ranking-weighted mechanism, which enables intermediate solution reuse through online per-sample updates. It is worth noting that the faster execution of some benchmark algorithms stems primarily from their use of non-optimized random sampling positions, rather than inherent algorithmic superiority.

Numerical results demonstrate that at a global spatial sampling rate r ≤ 0.2, our method reduces reconstruction error by 43%, enhances decoupling efficacy among highly correlated and overlapping subspaces, and exhibits clear advantages in both estimation accuracy and computational efficiency. In summary, the proposed SBO algorithm consistently outperforms traditional methods in remote spectrum mapping of inaccessible areas. Regarding resource and time consumption, performance is sampling-dependent: SBO_IS is preferable when r ≤ 0.22, while SBO_DS proves more effective for r > 0.22.

The results in Figure 5 reveal a critical trade-off between estimation accuracy and computational cost. While our proposed SBO_DS method achieves the lowest RMSE, it incurs a higher computational time compared to faster benchmarks like FISTA and K-SVD+OMP. This is the direct cost of performing multi-path tree search in BoP and the iterative dictionary learning in SCNG.

4.4. Comparison Performance of Spectrum Situation Estimation

In this subsection, SAFARI, FISTA, GCN-LSTM, and K-SVD+OMP are employed as benchmark algorithms for performance comparison. Figure 6 visually compares the spectrum situation estimation results obtained by each method, providing an intuitive assessment of their reconstruction capabilities under constrained sampling conditions. Due to incomplete spatial coverage—especially in architecturally blocked or physically inaccessible regions—subfigure Figure 6b exhibits significant unobserved areas that correspond to the occluded zones defined earlier in Figure 1 and Figure 3. Among all the methods, the proposed SBO framework achieves the highest structural similarity to the WinProp-generated ground truth. This advantage stems from its ability to effectively exploit inter-sample correlations, transforming unstructured and noisy measurements into coherent and continuous spectrum reconstructions.

A detailed analysis reveals that SBO_DS consistently outperforms both SBO_IS and SBO_RDE. This is primarily attributed to its enhanced sensitivity to high-frequency components in the sampled data, which are characterized by larger non-zero entries in the sparse representation. In contrast, FISTA yields suboptimal results, mainly because its convergence guarantees are restricted to convex problems, while real-world spectrum features often exhibit strong non-convexity—frequently trapping the solution in poor local minima. Similarly, SAFARI underperforms in this setting, as its sparse sampling strategy proves inadequate in handling high-dimensional parameter spaces with non-uniform sample distributions, often resulting in irreversible information loss.

Although GCN-LSTM attains high estimation accuracy by leveraging spatiotemporal correlations through deep learning, noticeable deviations from the WinProp reference highlight a key limitation of data-driven approaches: they typically require large volumes of high-quality training data to generalize well and avoid overfitting. Finally, K-SVD+OMP shows limited robustness under high coherence among sampled data points, confining its utility to low-cost applications with very small sample sizes.

Overall, these results underscore the applicability and limitations of each method under different sampling regimes and environmental constraints, providing practical insights for algorithm selection in real-world spectrum mapping scenarios.

4.5. Computational Complexity Analysis

The complexity of the proposed algorithm is dominated by three parts:

SCNG Dictionary Learning. The cost per iteration is $O (L M n)$ , where L is the number of training samples, M is the dictionary size, and n is the signal dimension. The neural gas ranking introduces an additional $O (M \log M)$ sorting cost but avoids expensive SVD operations.
BoP-OOMP Reconstruction. The standard OOMP has a complexity of $O (k * m * n)$ for recovering a single signal with sparsity k, using a dictionary of size $m \times n$ . Our BoP enhancement, which performs K_user-independent pursuits, increases the complexity by a factor of K_user, i.e., $O (K_{u s e r} * k * m * n)$ . This is the trade-off for achieving higher accuracy and robustness in correlated subspaces.
Gradient Update. The dictionary update via (35) has a complexity of $O (M * n)$ per sample. While the BoP step increases the computational burden compared to single-path OMP, the significant reduction in the required sampling ratio r (as shown in Figure 5) leads to a much smaller set of measurements y that needs to be processed. This offsets the per-sample complexity and results in a net gain in overall system efficiency for achieving a target reconstruction accuracy.

5. Conclusions

This paper investigates 3D remote spectrum mapping via UAV platforms, demonstrating significant application value for spectrum sensing and access in urban IoT deployments. We propose a compressive sensing-based spectrum mapping model that tensorizes spatial-spectrum characteristics of target areas, coupled with a Bag of Pursuits (BoP)-enhanced Orthogonal Matching Pursuit framework to optimize sampling matrices. This approach overcomes correlation-induced reconstruction errors in traditional OMP and resolves sparse representation challenges in 3D subspaces. Results confirm our solution’s superior efficiency, robustness, and adaptability under constrained conditions, enabling extrapolation-enhanced spectrum mapping. Comparative studies reveal estimation accuracy depends critically on signal completeness and algorithmic effectiveness, making selective sampling optimization essential given impractical exhaustive spatial sampling.

Frankly speaking, although the algorithm we proposed has certain advantages, to a certain extent, it still has the following issues:

Computational Complexity: The BoP-OOMP step, involving multiple pursuits, incurs higher computational overhead compared to single-path algorithms like OMP. This may limit its application in strict real-time scenarios without further optimization or hardware acceleration.
Practical Deployment Issues: The current framework assumes ideal UAV operation. Practical challenges such as UAV flight time, positioning errors, and the impact of UAV itself on the radio environment are not considered in this study and warrant future investigation.
Model Generalizability: The performance of the dictionary learning is tied to the training data. Its generalization to entirely unseen urban geometries or rapidly time-varying channels requires further validation.

Although promising, several directions for future work remain:

Global Optimality via Maximum Block Improvement. Joint sampling matrix and sparse vector optimization is investigated using Maximum Block Improvement (MBI) to guarantee global optimality. This coordinate descent approach iteratively updates blocks of variables to escape local optima, particularly effective for non-convex spectrum mapping problems.
Integrated Radio Map Construction. A complete radio map construction methodology is formed by integrating spectral tensor completion with spatial propagation modeling. This fusion enables simultaneous handling of missing data and physical constraints (e.g., shadowing, multipath) in 3D environments.
Online and Real-time Algorithm Implementation. Lightweight versions of the SBO algorithm that can run in real time on the limited computational hardware of a UAV, enabling immediate in situ mapping and decision-making, should be investigated. The framework should be extended to handle mobile transmitters and time-varying channel conditions, which requires the dictionary and sampling strategy to adapt continuously during flight.

Author Contributions

K.Y. wrote the original draft and proposed the idea for this study; F.C. was responsible for data curation; S.F. supervised this work. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Basic Research Projects of the Basic Strengthening Plan under grant number 2020-JCJQ-ZD-071 and in part by the Talent Introduction Project of Space engineering University under Grant 2021-RCYJ-03.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Zhu, W.; Deng, X.; Gui, J.; Zhang, H.; Min, G. Cost-Effective Task Offloading and Resource Scheduling for Mobile Edge Computing in 6G Space-Air–Ground Integrated Network. IEEE Internet Things J. 2025, 12, 19428–19442. [Google Scholar] [CrossRef]
Chen, X.; Leng, S.; He, J.; Zhou, L. Deep-Learning-Based Intelligent Intervehicle Distance Control for 6G-Enabled Cooperative Autonomous Driving. IEEE Internet Things J. 2021, 8, 15180–15190. [Google Scholar] [CrossRef]
Wang, C.-X.; You, X.; Gao, X.; Zhu, X.; Li, Z.; Zhang, C.; Wang, H.; Huang, Y.; Chen, Y.; Haas, H.; et al. On the Road to 6G: Visions, Requirements, Key Technologies, and Testbeds. IEEE Commun. Surv. Tutor. 2023, 25, 905–974. [Google Scholar] [CrossRef]
Gabriela, W.; Ożadowicz, A. Building information modeling and digital twins for functional and technical design of smart buildings with distributed iot networks—Review and new challenges discussion. Future Internet 2024, 16, 225. [Google Scholar]
Mobaraki, B.; Lozano-Galant, F.; Soriano, R.P.; Castilla Pascual, F.J. Application of Low-Cost Sensors for Building Monitoring: A Systematic Literature Review. Buildings 2021, 11, 336. [Google Scholar] [CrossRef]
Yin, K.; Fang, S.; Chu, F.; Fan, Y. Compressed Tensor Completion: Approach for UAV-Aided 3-D Radio Map Construction. IEEE Internet Things J. 2024, 11, 40516–40531. [Google Scholar] [CrossRef]
Kaniewski, P.; Romanik, J.; Golan, E.; Zubel, K. Spectrum awareness for cognitive radios supported by radio environment maps: Zonal approach. Appl. Sci. 2021, 11, 2910. [Google Scholar] [CrossRef]
Romero, D.; Ha, T.N.; Shrestha, R.; Franceschetti, M. Theoretical Analysis of the Radio Map Estimation Problem. IEEE Trans. Wirel. Commun. 2024, 23, 13722–13737. [Google Scholar] [CrossRef]
Zhao, Z.; Peng, Z.; Zheng, S.; Shang, J. Cognitive radio spectrum allocation using evolutionary algorithms. IEEE Trans. Wirel. Commun. 2009, 8, 4421–4425. [Google Scholar] [CrossRef]
Liu, Z.; Zhao, P.; Guo, L.; Nan, Z.; Zhong, Z.; Li, J. Three-dimensional ray-tracing-based propagation prediction model for macrocellular environment at sub-6 ghz frequencies. Electronics 2024, 13, 1451. [Google Scholar] [CrossRef]
Zhang, Y.; Yuan, Z.; Tian, L.; Zhang, J. A Novel Random Angular Sampling Method for Spatial and Temporal Channel Emulation. IEEE Wirel. Commun. Lett. 2019, 8, 1381–1385. [Google Scholar] [CrossRef]
Phillips, C.; Sicker, D.; Grunwald, D. Bounding the error of path loss models. In Proceedings of the IEEE International Symposium on Dynamic Spectrum Access Networks, Aachen, Germany, 3–6 May 2011; pp. 71–82. [Google Scholar]
Stine, J.A.; Caicedo Bastidas, C.E. Enabling Spectrum Sharing via Spectrum Consumption Models. IEEE J. Sel. Areas Commun. 2015, 8, 725–735. [Google Scholar] [CrossRef]
Kozakiewicz, K.; Lazarowska, A.; Lisowski, J.; Rybczak, M. A Survey of Machine Learning Methods Applied for Enhancing the Autonomy of Unmanned Underwater Vehicles. In Proceedings of the IEEE EUROCON 2025—21st International Conference on Smart Technologies, Gdynia, Poland, 4–6 June 2025; pp. 1–6. [Google Scholar]
Liang, H.; Wu, J.; Liu, T.; Wang, H.; Cao, W. Efficient Cooperative Spectrum Sensing in UAV-Assisted Cognitive Wireless Sensor Networks. IEEE Sens. Lett. 2024, 8, 7500904. [Google Scholar] [CrossRef]
Sarikhani, R.; Keynia, F. Cooperative Spectrum Sensing Meets Machine Learning: Deep Reinforcement Learning Approach. IEEE Commun. Lett. 2020, 24, 1459–1462. [Google Scholar] [CrossRef]
Zhang, H.; Yang, J.; Gao, Y. Machine Learning Empowered Spectrum Sensing Under a Sub-Sampling Framework. IEEE Trans. Wirel. Commun. 2022, 21, 8205–8215. [Google Scholar] [CrossRef]
Ivanov, A.; Tonchev, K.; Poulkov, V.; Manolova, A.; Vlahov, A. Interpolation Accuracy Evaluation for 3D Radio Environment Maps Construction. In Proceedings of the 2023 26th International Symposium on Wireless Personal Multimedia Communications (WPMC), Tampa, FL, USA, 19–22 November 2023; pp. 1–7. [Google Scholar]
Zhang, S.; Li, Z.; Li, H.; Zha, Y.; Wang, H.; Shen, Z.; Jiang, H.; Wang, J. Novel Radio Environment Map Construction Scheme for 3-D and Full Band for Modern Internet of Things Applications. IEEE Internet Things J. 2025, 12, 12419–12432. [Google Scholar] [CrossRef]
Aubry, A.; De Maio, A.; Zappone, A.; Razaviyayn, M.; Luo, Z.-Q. A new sequential optimization procedure and its applications to resource allocation for wireless systems. IEEE Trans. Signal Process 2018, 66, 6518–6533. [Google Scholar] [CrossRef]
Abdelaziz, D.E.; Kotb, H.; Abbasy, N.H. Improving Power System State Estimation through Physics Informed Deep Learning using Gated Recurrent Units. In Proceedings of the 2024 25th International Middle East Power System Conference (MEPCON), Cairo, Egypt, 17–19 December 2024; pp. 1–8. [Google Scholar]
Sato, K.; Suto, K.; Inage, K.; Adachi, K.; Fujii, T. Space-Frequency-Interpolated Radio Map. IEEE Trans. Veh. Technol. 2021, 70, 714–725. [Google Scholar] [CrossRef]
Schütze, H.; Barth, E.; Martinetz, T. Learning Efficient Data Representations with Orthogonal Sparse Coding. IEEE Trans. Comput. Imaging 2016, 2, 177–189. [Google Scholar] [CrossRef]
Liu, X.; Zheng, K.; Chi, K.; Zhu, Y.H. Cooperative Spectrum Sensing Optimization in Energy-Harvesting Cognitive Radio Networks. IEEE Trans. Wirel. Commun. 2020, 19, 7663–7676. [Google Scholar] [CrossRef]
Shen, F.; Wang, Z.; Ding, G.; Li, K.; Wu, Q. 3D Compressed Spectrum Mapping with Sampling Locations Optimization in Spectrum-Heterogeneous Environment. IEEE Trans. Wirel. Commun. 2022, 21, 326–338. [Google Scholar] [CrossRef]
Li, L.; Xie, W.; Zhou, X. Cooperative Spectrum Sensing Based on LSTM-CNN Combination Network in Cognitive Radio System. IEEE Access 2023, 11, 87615–87625. [Google Scholar] [CrossRef]
Pan, Y.; Da, X.; Hu, H.; Huang, Y.; Cumanan, K. Joint Optimization of Trajectory and Resource Allocation for Time-Constrained UAV-Enabled Cognitive Radio Networks. IEEE Trans. Veh. Technol. 2022, 71, 5576–5580. [Google Scholar] [CrossRef]
Liu, X.; Jia, M.; Tan, X. Threshold optimization of cooperative spectrum sensing in cognitive radio networks. Radio Sci. 2013, 48, 23–32. [Google Scholar] [CrossRef]
Hoppe, R.; Wölfle, G.; Jakobus, U. Wave propagation and radio network planning software WinProp added to the electromagnetic solver package FEKO. In Proceedings of the 2017 International Applied Computational Electromagnetics Society Symposium—Italy (ACES), Firenze, Italy, 26–30 March 2017; pp. 1–2. [Google Scholar]
De, P.; Rai, A.; Chatterjee, A. A Projected OMP-Hybridized Discriminative K-SVD-Based Dictionary Learning Algorithm for Human Activity Recognition from Accelerometer Signals. IEEE Sens. J. 2024, 24, 38222–38231. [Google Scholar] [CrossRef]
Labusch, K.; Barth, E.; Martinetz, T. Sparse coding neural gas: Learning of overcomplete data representations. Neurocomputing 2009, 72, 1547–1555. [Google Scholar] [CrossRef]
Mao, Y.; Zhao, Z.; Yang, M.; Liang, L.; Liu, Y.; Ding, W.; Lan, T.; Zhang, X.-P. SAFARI: Sparsity-Enabled Federated Learning with Limited and Unreliable Communications. IEEE Trans. Mob. Comput. 2024, 23, 4819–4831. [Google Scholar] [CrossRef]
Zhou, R.; Han, J.; Li, T.; Guo, Z. Fast Independent Component Analysis Denoising for Magnetotelluric Data Based on a Correlation Coefficient and Fast Iterative Shrinkage Threshold Algorithm. IEEE Trans. Geosci. Remote Sens. 2022, 60, 5916715. [Google Scholar] [CrossRef]
Gao, X.; Wang, J.; Zhou, M. The Research of Resource Allocation Method Based on GCN-LSTM in 5G Network. IEEE Commun. Lett. 2023, 27, 926–930. [Google Scholar] [CrossRef]
Shen, F.; Ding, G.; Wu, Q. Efficient Remote Compressed Spectrum Mapping in 3-D Spectrum-Heterogeneous Environment with Inaccessible Areas. IEEE Wirel. Commun. Lett. 2022, 11, 1488–1492. [Google Scholar] [CrossRef]

Figure 1. Schematic of remote mapping. (a) illustrates accessible and inaccessible regions within the 3D space, along with the methodology for recovering inaccessible regions through sparse location sampling. (b) displays the vertical cross-section of (a) at z = 0.

Figure 2. Schematic of the tree search process in the Bag of Pursuits (BOP) method for

{‖s‖}_{0} \leq 3

.

Figure 2. Schematic of the tree search process in the Bag of Pursuits (BOP) method for

{‖s‖}_{0} \leq 3

.

Figure 3. 3D and 2D views of the WinProp simulation scenario. (a) Red dots indicate transmitter locations in the 3D view. (b) The overhead 2D view shows architecturally constrained inaccessible regions and antenna positions.

Figure 4. Experimental results of different sampling methods. All experiments were repeated 50 times. (a) Random dictionary element sampling; (b) Non-independent sampling space; (c) Independent sampling space.

Figure 5. Comparative Algorithm Performance. (a) Time consuption vs. Sampling ratio; (b) RSME vs. Sampling ratio.

Figure 6. Visualized 3D Spectrum Map Estimates Using Different Algorithms. (a) Ground truth (WinProp); (b) occluded spectrum map; (c–e) SBO_DS, SBO_IS, SBO_RDE reconstructions; (f–i) SAFARI, FISTA, GCN-LSTM, K-SVD+OMP reconstructions.

Table 1. Explanation of Symbols.

Symbol	Description of Meaning
$D$	Overcomplete Dictionary/Measurement Matrix
$y$	Observed Spectrum Signal (Sampled Data)
$s$	Sparse Source Signal Coefficient Vector to Be Solved
$k$	Sparsity of the Signal (Number of Non-Zero Elements)
$U$	Set of Selected Atom Indices in the OOMP Algorithm
$R_{n}^{j}$	Orthogonalized Temporary Dictionary at the n-th Iteration in the j-th Pursuit
$ϵ_{n}^{j}$	Residual After the n-th Iteration in the j-th Pursuit
$l_{w i n}$	Index of the Optimal Atom Selected in the Current Iteration
$K_{u s e r}$	Number of Parallel Pursuit Paths Set in the Bag of Pursuits (BoP) Algorithm

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yin, K.; Fang, S.; Chu, F. Compressive Sensing-Based 3D Spectrum Extrapolation for IoT Coverage in Obstructed Urban Areas. Electronics 2025, 14, 4177. https://doi.org/10.3390/electronics14214177

AMA Style

Yin K, Fang S, Chu F. Compressive Sensing-Based 3D Spectrum Extrapolation for IoT Coverage in Obstructed Urban Areas. Electronics. 2025; 14(21):4177. https://doi.org/10.3390/electronics14214177

Chicago/Turabian Style

Yin, Kun, Shengliang Fang, and Feihuang Chu. 2025. "Compressive Sensing-Based 3D Spectrum Extrapolation for IoT Coverage in Obstructed Urban Areas" Electronics 14, no. 21: 4177. https://doi.org/10.3390/electronics14214177

APA Style

Yin, K., Fang, S., & Chu, F. (2025). Compressive Sensing-Based 3D Spectrum Extrapolation for IoT Coverage in Obstructed Urban Areas. Electronics, 14(21), 4177. https://doi.org/10.3390/electronics14214177

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Compressive Sensing-Based 3D Spectrum Extrapolation for IoT Coverage in Obstructed Urban Areas

Abstract

1. Introduction

2. System Model

3. Remote Compressed Spectrum Mapping Algorithm

3.1. Optimization of Sampling Matrix Based on Improved Orthogonal Matching Pursuit Algorithm

3.2. Coefficient Determination via Gradient Descent

4. Experimental Evaluation

4.1. Experimental Scenario Setup

4.2. Spectrum Sampling Position Effectiveness Analysis

4.3. Comparison of Numerical Performance

4.4. Comparison Performance of Spectrum Situation Estimation

4.5. Computational Complexity Analysis

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI