1. Introduction
Through-the-wall radar imaging (TWRI) as an emerging technology has attracted increasing interest owing to its capability to reveal stationary or moving targets of interest behind an opaque obstacle [
1,
2,
3,
4], and it has been widely used in numerous civil and military applications, such as anti-terrorist reconnaissance, mineral resource exploration, life rescue missions, and security monitoring. TWRI aims at acquiring high-resolution images free from wall clutter and multipath.
A huge number of efforts have been devoted to addressing the high-resolution TWRI issue. The time-reversal MUSIC (TR-MUSIC) was studied for through wall target detection and localization, and the simultaneous localization of multiple targets behind the wall has been achieved by employing the null space of the multistatic response matrix [
5], and the performance analysis of the TR-MUSIC method was also investigated and provided in [
6]. On the other hand, the multipath effect in the propagation of electromagnetic waves severely reduces the accuracy of TWRI. In addition to the directly received target echoes, multiple reflections at the walls and the targets will generate interference echoes, which lead to ghosts in the imaging and make the through-wall radar unable to accurately detect the targets. Previous studies proposed various approaches to suppress the multipath effect. By analyzing the properties of ghosts, Tan et al. [
7] discovered the aspect dependence (AD) of multipath scattering and suppressed the multipath ghosts based on the Hidden Markov model. Inspired by the characteristics of AD, Liu et al. [
8] studied the distribution characteristics of the first-order ghost of MIMO radar and applied phase coherence factor to suppress the ghosts. In contrast, some recent algorithms focused on exploiting multipath to enhance the target signal. Exploiting the information from multipaths can be successfully achieved using compressive sensing (CS) which makes use of the group sparsity of the targets caused by different propagation paths [
9,
10]. The group sparsity represents that members of a certain subset of entries have the same support. This means the entries in each group appear as nonzero at the same time [
10].
TWRI is commonly used to image indoor humans. In indoor scenes, multipath propagations are mainly caused by the reflections on the related walls except the echo directly approaching the receiver. Group sparsity is suitable for describing the scene containing the different propagation paths, which differ in scattering coefficients but not in the nonzero spatial support. In addition to the group sparsity due to multipath, another property is the extended spatial occupancy involving the spatial region where their nonzero entries are clustered. Unfortunately, it is hard to determine the exact size and shape of the target in advance [
10]. A series of algorithms have been applied to address group sparsity caused by multipath propagations [
11,
12,
13,
14]. However, all of the above ignored the benefits of the target structure. Both overlapping group sparse (GS) algorithm [
15] and multi-task clustered Bayesian compressive sensing (BCS) algorithms [
16,
17] are developed to take clustered structure into account. However, the overlapping GS algorithm is prone to produce false targets owing to inappropriate overlapping partition. Furthermore, the multi-task BCS algorithm cannot handle the unresolvable observations caused by the multipath propagation in the TWRI.
Within the hierarchical framework, this paper explores a clustered BCS algorithm in the presence of multipath propagation. High-resolution imaging is achieved by leveraging the aforementioned two types of group sparsity. We employ the group sparsity where multipath scenarios have the same nonzero support but different scattering coefficients to represent the former type. Furthermore, the latter originates from the nonzero area or volume of the target. A hierarchical spike-and-slab prior is explored to model the pattern of the group sparsity of multipath propagations. We develop a spike-and-slab clustered prior to model the relationship of spatial pixels. The proposed algorithm can estimate the prior parameters without supervision and successfully achieve imaging without any information on targets, such as sizes, shapes, and numbers.
Notations: Vectors and matrixes are represented by boldface lower-case and upper-case letters, respectively. The probability density function (pdf) is denoted by 
. Given the random variable 
x, the Gaussian distribution is denoted by 
 with mean 
a and variance 
b, and the Bernoulli distribution is denoted by 
 with the weight of 
. In addition, symbols 
 and 
 denote a transpose and Hermitian transpose, respectively, 
 denotes the N-dimensional identity matrix, and ∘ denotes the Hadamard product of the vectors or matrixes. For clarity, the major acronyms are collected and provided in 
Table 1.
The remainder of the paper is organized as follows. The signal model in the multipath propagation is presented in 
Section 2, and the high-resolution TWRI problem is converted into the sparse reconstruction problem, and then the clustered BCS algorithm is developed in 
Section 3; subsequently, simulation and experimental results are provided in 
Section 4; finally, some conclusions are made in 
Section 5.
  2. Signal Model
The monostatic stepped-frequency radar [
18] is employed to image indoor stationary scenes. A uniform linear array (ULA) is deployed using 
N wideband transceivers with known positions 
, 
, and an inter-element interval of 
d. The stepped-frequency signal, 
, 
, with a uniform distribution over the signal bandwidth 
B, is transmitted by each transceiver. Since most of the front wall returns are assumed to be eliminated, only the target echoes are analyzed here [
19]. Considering the monostatic radar scenario, the target echoes 
 can be formulated as [
20]
      
      where 
 denotes the total number of pixels in the region of interest, 
 denotes the time delay from the 
lth scatterer to the 
nth transceiver, and 
 denotes the complex reflectivity of the 
lth pixel. Assume the thickness and permittivity of the wall are already known or can be accurately estimated, thus the effects of propagation through the wall can be well-compensated, and 
 can be calculated from geometric considerations [
21].
  2.1. Vectorized Measurements
For brief notation, the measurements in (
1) can be transformed into a single column vector 
 by stacking all 
 received echoes, i.e., 
. Therefore, (
1) can be rewritten as
        
        where 
 denotes the target reflectivity vector and 
 denotes the sensing matrix. The 
th element of the sensing matrix can be represented as
        
        where 
 denotes the modulus after 
i over 
M, while 
 denotes the remainder after 
i over 
N for 
.
  2.2. Interior Wall Multipath
While the original signals reflect once or more times at the walls, the target returns at different propagation paths will be received with different delays. As shown in 
Figure 1, in the scenario of radar detection, we consider the dominant multipath component generated by the reflections at the internal surface. As indicated by the black and blue arrows, the signal is directly transmitted from the transceiver to the target, denoted by 
, and then the scattered echo reflects back to the transceiver and vice versa. The direct return path is indicated by the red arrow. Assuming the geometry information is accessible, the time delays corresponding to the different multipath returns can be accurately acquired according to geometric relationships.
When there exists one direct path and 
 indirect paths, the signal propagation model can be considered as the superposition of all propagation paths and formulated as [
20]
        
        where 
 and 
 with 
 represent the sensing matrix and the complex reflectivities mentioned in 
Section 2.1 of the 
kth propagation path, respectively. When 
, 
 and 
 correspond to the direct path. In addition, the remaining 
 and 
 correspond to the indirect path, where the elements of 
 are phase terms, similar to (
3), containing the corresponding time delay. It is worth mentioning that the elements of target reflectivity vectors 
 have the same nonzero support on all propagation paths regardless of the different reflectivity coefficients.
Since the measurements of different propagation paths generally cannot be solved in TWRI, to overcome this difficulty, we rewrite (
4) as
        
        where 
 denotes the measurement data vector, 
 denotes the joint sensing matrix, and 
 denotes the combined complex reflectivity vector. For generalization, (
2) plus a measurement noise vector 
 is considered as the measurements.
Within the CS framework, only a portion of the full measurements is used for high-quality sparse reconstruction. Therefore, we first decrease the amount of data collected. Mathematically, the data reduction operation is equivalent to implementing a measurement or downsampling matrix 
 on the total measurements, where 
 is the number of reduced measurements. A binary measurement matrix 
 is a proper option for the stepped-frequency radar system in this article [
20]. Therefore, the measurement vector after undersampling 
 can be given by
        
        where 
 is the dictionary matrix.
  3. Clustered Bayesian Compressive Sensing
Direct application of the multi-task clustered BCS algorithm [
17] for scene reconstruction in TWRI is not feasible due to the non-resolvability of the measurements corresponding to different propagation paths. To address the limitation of unresolvable measurements, a novel clustered BCS is devised as follows.
  3.1. Sparse Prior
To improve the group sparsity across the multiple propagation paths, a Bernoulli–Gaussian distribution, also named a spike-and-slab prior [
22,
23], is incorporated in the elements of the reflectivity vector 
,
        
        where 
 with 
, 
 is the prior probability of the 
lth pixel taking nonzero value. The entry is therefore more likely to be zero if the mixing weight 
 is small, whereas a larger 
 usually results in a nonzero entry. Additionally, 
 is the precision (inverse variance) of complex Gaussian distribution, and 
 is the Dirac delta function.
To acquire an analytical inference, the product of two latent variables 
 is used to follow the pdf in (
11) and express their joint pdf as [
17],
        
        where 
, 
 is a binary variable, and 
 indicates that the 
lth pixel has a value of zero. It is evident that the group sparsity across the 
K paths is characterized by the same 
 for the 
lth pixel. Consequently, all scattering coefficients in the 
lth pixel share the same ‘on’ or ‘off’ pattern. Furthermore, the values of all coefficients 
 in the 
lth pixel are generally different, due to diverse propagation paths [
20].
A Gamma prior, conjugate to the Gaussian distribution, is incorporated in posterior 
 to make it trackable. Thus, 
 can be expressed as
        
        where 
, and 
a and 
b are hyperparameters. In particular, this nonparametric hierarchical structure makes the model infer the precision 
 automatically. For generalization, the additive noise is supposed to exhibit a Gaussian distribution, i.e., 
 Similarly, a Gamma prior is incorporated in 
 to make the posterior distribution analytical, i.e., 
, where 
c and 
d are hyperparameters.
  3.2. Clustered Prior
To take advantage of the dependency structure between pixels, we consider three kinds of sparsity patterns, namely 
Pattern 0 (“strong rejection”), 
Pattern 1 (“weak rejection”), and 
Pattern 2 (“strong acceptance”). In the underlying TWRI problem, a 2-D imaging scene is considered, as shown in 
Figure 2. An auxiliary variable 
 is defined according to the total number of nonzero pixels adjacent to the 
ith pixel under test and introduces a new nonlinear function 
 in terms of 
. To reduce inference time, we incorporate a Beta prior into weights 
 according to the clustered patterns. The Beta prior exhibits conjugately to the Bernoulli distribution and has three types of parameters 
.
Strong rejection [Pattern 0]: For a small , it implies that most or all of the neighboring pixels are zero. Therefore, there is a high likelihood that the lth test pixel will be equal to zero. In this instance,  is used in prior Beta to recommend a low value of  for rejecting this pixel.
Weak rejection [Pattern 1]: For a comparative , it implies the amounts of adjacent pixels with zero and nonzero values are comparable. In this instance, the likelihood that the lth test pixel is equal to zero or nonzero value is the same. Thus,  is used in the prior  to impose prior without information on .
Strong acceptance [Pattern 2]: For a large , it suggests that most or all of the neighboring pixels are non-zero. To enforce the target scene continuity, the lth pixel is required to be a non-zero value. In this instance,  in the prior  recommends a high value of  to adopt this pixel.
  3.3. Posterior Inference
We use a Markov Chain Monte Carlo (MCMC) sampler to conduct the Bayesian inference of the algorithm. For simplicity, the set of the hyperparameters is defined as 
, where 
 and 
. The set of random variables is defined as 
. Thus, the specific form of the joint pdf can be given by
        
According to the conjugate relationship, an analytical method will be used to determine the posterior distribution of each random variable. Since the difference between the proposed algorithm and the clustered MT-BCS lies only in the measurements, the updated variables independent of the measurements will share the same updated equations. Here we further present the inferences of those variables which are dependent on the measurements and have different updated formulations in [
17].
From the signal model, it is observed that only the variables 
 are related to the observation data 
. We first update the paired variables 
. The conditional pdf is sampled iteratively by the paired Gibbs sampler and given by
        
        where 
 and 
 represent 
 minus the variable vector 
 and 
 minus the variable 
, respectively. The logistic function is employed to calculate the analytical likelihood 
 [
17] which can be expressed as
        
        where 
u is given by
        
        and 
 denotes the collection of the 
lth column in each dictionary matrix 
.
The conditional distribution of 
 can be denoted by
        
As depicted in 
Section 3.2, each element 
 is selected based on the sparsity pattern, namely 
 is allocated as 
 according to different pattern 
. For a specific sparsity pattern 
q, a posterior distribution is obtained analytically owing to the Beta distribution on 
 and given by
        
        where 
, and 
 is the length of 
. In the two-dimensional signal instance shown in 
Figure 2, 
 indicates that each pixel is taken into account along with its 8 closest neighbors.
With the help of the conjugate relationship between the Gaussian and Gamma distributions, the analytical posterior distribution of the precision variable 
 is obtained as
        
        where 
, 
, and 
.
For the noise precision 
, we acquire the posterior distribution as
        
Algorithm 1 provides the entire procedure of the proposed clustered BCS algorithm. We employ the Gibbs sampler with consecutive sampling to infer the model parameters. It is observed that the computational complexity is 
, where 
 is the maximum number of iterations. When the maximum number of iterations is reached, the algorithm stops and outputs the result.
        
| Algorithm 1 Clustered BCS algorithm | 
| Input: | 
| Initialize  and ; | 
| Assign ; | 
| for , do | 
| for , do | 
| update  [Equation (16)]; | 
| update  [Equation (19)]; | 
| update  [Equation (20)]; | 
| update  [Equation (21)]; | 
| end for | 
| update  [Equation (22)]; | 
| end for | 
| Output: | 
| and . | 
  4. Results
This section presents simulations and experimental results to verify the correctness of the proposed radar imaging algorithm. The following values of the parameters are assumed in the experiments: hyperparameters 
a, 
b, 
c and 
d are equal to 
, 
 for strong rejection, 
 for weak rejection, and 
 for strong acceptance. The function 
 is a step function, which takes a zero value with 
 for strong rejection, an infinite value for strong acceptance with 
, or a comparative value for weak rejection. The 
 is set as 200, and the sample with the maximum marginal likelihood among the last 20 samples are selected as the estimation of 
 [
17]. To evaluate the reconstruction results, the normalized mean square error (NMSE) 
, where 
 is the estimate of the 
, is introduced as the performance index. The results of the conventional (BP) algorithm [
24], overlapping GS method [
15], BOMP [
20] and BSBL [
25] are also presented for comparison. All the results are implemented on the PC with CPU (Intel Core i7-10700K) using Matlab R2020a.
  4.1. Simulated Dataset
Simulations were performed for a monostatic radar with a 77-element monostatic ULA. The spacing between elements is set as 
 cm. Each element transmits a stepped-frequency signal and receives the target returns. The transmitted wave consists of 200 equally spaced frequency steps and covers the 1 GHz to 3 GHz. The detailed parameters are given in 
Table 2. The middle of the antenna array is regarded as the original point. The front wall is 
 meters away from the antenna array, but it is not considered in the scenario for simplicity. The locations of the two side walls are around 
 m crossrange, whereas the location of the back wall, parallel to the antenna array, is 
 m downrange. The scattering coefficient of the direct path is determined according to 
. In addition, the signal amplitudes of the received returns decrease as the distance increases. A target occupying 
 contiguous pixels is considered, as shown in 
Figure 3a. The received signal includes a total of 
 paths, consisting of the direct echo as well as indirect paths reflected on the surfaces of the three walls. To simulate the real scenario, the measurements with an SNR of 0 dB plus additive Gaussian noise are employed for simulations. When using the classical BP method, all available data are used; however, only a quarter of all frequencies are chosen at random for the CS-based imaging.
We first present the reconstructed image using the BP algorithm in 
Figure 3b in which three ghosts exist due to the three indirect paths from three interior walls, respectively. It is evident that the BP-based algorithm is unable to properly rebuild the scenario and distinguish the true target from the ghosts. The reconstruction result contains a much higher amount of clutter. In addition, the resolution of the BP algorithm is 
 m owing to the 2 GHz band. Constrained by the Rayleigh boundary, the exact shape of the target is hard to obtain. In these CS-based approaches, it is observed that these ghosts from multipath propagations can be effectively alleviated due to the exploitation of group sparsity structure, as depicted in 
Figure 3b–f. The reconstruction result using the BOMP method is provided in 
Figure 3c, and it is clear that several fake pixels are located around the real positions, even though the true sparsity number is used in advance. 
Figure 3d depicts the imaging result based on the BSBL method. It can be found that the reconstruction result is similar to the ground truth, whereas some spurious isolated pixels occur because of the target structure being neglected. 
Figure 3e gives the reconstructed result using the overlapping group CS algorithm. It is clear that the overlapping group CS algorithm can reconstruct and capture the target shape, whereas it leads to some fake pixels around the real locations owing to the inappropriate overlapping partition. 
Figure 3f suggests that the proposed method can successfully reconstruct the target with a continuously clustered structure and effectively suppress the separated and false targets by enforcing the clustered prior. The improvements of the proposed method are also verified by the NMSE values, which are, respectively, 
 in the BOMP method, 
 in the BSBL method, 
 in the overlapping GS method, and 
 in the proposed algorithm. In addition, we also provide the executive time comparisons among these methods, as shown in 
Table 3. It is observed that the BOMP method has the smallest runtime, and the runtime is less than 1 s, because the used iteration number of this method is exactly the sparsity number. The executive time in the proposed method is comparable to that in the BSBL method.
  4.2. Experimental Dataset
To verify the proposed radar imaging algorithm experimentally, the experimental data were collected at the Radar Imaging Lab, Villanova University. The parameters of the antenna array and the positions of the concrete walls are the same in the simulation. In particular, the difference is that the stepped-frequency signal contains 801 frequency steps evenly distributed from 1 GHz to 3 GHz. An aluminum pipe (length of 61 cm and diameter of 7.6 cm) was set upright on a foam pedestal (height of 1.2 m) at a 3.67 m downrange and 0.31 m crossrange. The protruding corner causing multipath is located at the upper right area at 3.4 m crossrange and 4.57 m downrange. We put RF absorbing material on the surfaces of the side walls to eliminate multipath from these two walls; however, the surfaces of the back wall and protruding corner are uncovered. The wall clutter is well removed so that only target echoes are considered in the experiments [
20]. The overall experimental scene is shown in 
Figure 5.
We first present the conventional beamformed result using all available data in 
Figure 6a. It can be observed that the ghosts caused by the uncovered surfaces are clear and distinct. Therefore, we simply analyze the cases where the reflection points are on the uncovered surfaces in the following. To reduce the time of data collection, one fourth of the array elements and the frequencies are employed in the CS reconstruction. Since the sparsity number in this dataset is hard to know, we do not consider using the BOMP mentioned in the simulation above for comparison. As depicted in 
Figure 6b, There exist a lot of separated pixels in the reconstruction result of the BSBL algorithm. From 
Figure 6c,d, it is evident that the overlapping GS and the proposed algorithm yield fewer cluttered images and better target depictions. However, it appears that there are numerous fake pixels around the real pixels due to the overlapping partition. Similar to [
20], the metric of relative clutter peak (RCP), which is defined as the maximum target amplitude divided by the maximum clutter amplitude, is introduced to evaluate the imaging quality. The clutter region is defined as the space inside the indoor scenes, excluding a circle with a radius of 10 cm at the known target position. The RCP, respectively, evaluates 
 dB, 
 dB, and 48.62 dB for the BSBL, the overlapping GS methods as well as the proposed algorithms.
  5. Conclusions
This paper focused on the problem of multipath exploitation for TWRI in view of sparsity-based reconstruction methods. The underlying idea is to take advantage of both the group sparsity due to the multipath propagations and the continuous clustered structure on the targets. A novel CS-based multipath exploitation clustered BCS approach for image reconstruction was proposed. Benefiting from the data-driven capability of the nonparametric hierarchical Bayesian model, the proposed method achieved high-resolution TWRI without any prior information about the targets, such as shapes, sizes, and numbers. In addition, it is worth mentioning that the proposed algorithm is not restricted to the ULA application, and can be generalized to any other array shapes.
The proposed method has the capability of acquiring high-resolution target images with free ghosts in the indoor multipath propagation environment, and thus it provides opportunities to better capture indoor messages through the wall, such as the shape of indoor stationary items, locations of indoor human targets, etc. It would also prompt further feature extraction and recognition of indoor targets behind the wall. Although the proposed method enables obtaining ghost-free high-resolution indoor targets image, the following challenges remain to be addressed in future work: (1) Prior knowledge of the number of propagation paths and their topology was assumed and provided for multipath exploitation; however, it is difficult to accurately acquire this information in practice. Therefore, it would be a challenging issue to acquire ghost-free high-resolution TWRI for multipath exploitation, when the indoor topology is inaccurate or completely unknown. (2) A centralized scheme with a uniform linear array was used where all measurements are available at a single processing node. However, a decentralized approach would be beneficial for practical purposes. Each of the nodes carries out part of the reconstruction while communicating with the other units. This results in the simpler and more flexible operation of the TWRI system and avoids a single point of failure. Proper algorithm design should ensure that the reconstruction quality is on par with centralized processing.
In summary, it is still quite challenging for the high-resolution TWRI in the multipath-propagation environment for practical purposes due to the diversities and complexities of the indoor configurations.