Deep Unfolding Sparse Bayesian Learning Network for Off-Grid DOA Estimation with Nested Array

Gong, Zhenghui; Su, Xiaolong; Hu, Panhe; Liu, Shuowei; Liu, Zhen

doi:10.3390/rs15225320

Open AccessCommunication

Deep Unfolding Sparse Bayesian Learning Network for Off-Grid DOA Estimation with Nested Array

College of Electronic Science and Technology, National University of Defense Technology, Changsha 410073, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2023, 15(22), 5320; https://doi.org/10.3390/rs15225320

Submission received: 13 September 2023 / Revised: 5 November 2023 / Accepted: 8 November 2023 / Published: 10 November 2023

(This article belongs to the Topic Advanced Array Signal Processing for B5G/6G: Models, Algorithms, and Applications)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Recently, deep unfolding networks have been widely used in direction of arrival (DOA) estimation because of their improved estimation accuracy and reduced computational cost. However, few have considered the existence of a nested array (NA) with off-grid DOA estimation. In this study, we present a deep sparse Bayesian learning (DSBL) network to solve this problem. We first establish the signal model for off-grid DOA with NA. Then, we transform the array output into a real domain for neural networks. Finally, we construct and train the DSBL network to determine the on-grid spatial spectrum and off-grid value, where the loss function is calculated using reconstruction error and the sparsity of network output, and the layers correspond to the steps of the sparse Bayesian learning algorithm. We demonstrate that the DSBL network can achieve better generalization ability without training labels and large-scale training data. The simulation results validate the effectiveness of the DSBL network when compared with those of existing methods.

Keywords:

deep unfolding network; sparse Bayesian learning; off-grid; direction of arrival (DOA) estimation; nested array (NA)

1. Introduction

The direction of arrival (DOA) estimation for UAV emitters has been an important application in the field of array signal processing [1,2,3]. A non-uniform array is an array structure with non-uniform spacing between elements [4,5]. In the case of the same number of elements, the non-uniform array has a larger array aperture than the uniform array, which can improve the resolution of parameter estimation [6,7,8]. In addition, when the array aperture is the same, a non-uniform array requires fewer elements, which can reduce the hardware cost of the signal processing system and suppress the impact of mutual coupling between elements [9].

The sparse representation method divides the spatial domain into discrete grids, and grid mismatch (GM) occurs when the DOA does not fall on the grid [10], which can reduce the estimation performance of signal parameters. In addition, if the estimation accuracy is improved by reducing the spacing of the grid, the dimensionality of the overcomplete dictionary will lead to an increase in computational complexity in the process of sparse reconstruction. According to sparse reconstruction conditions such as the mutual incoherence property (MIP) and the restricted isometric property (RIP) [11,12], the high correlation between different columns in the overcomplete dictionary with small grid spacing will lead to the failure of sparse reconstruction algorithms. In response to grid mismatch, quantization errors are introduced into the signal model, which does not strictly limit the signal to fall on the grid [13,14,15,16,17]. Yang et al., proposed a mathematical model using basis pursuit denoising (BPDN) to jointly solve the nearest grid and corresponding quantization errors [18]. Compared to the sparse global least squares method [19], the regularization parameters in this method can be set through off-grid mathematical models and noise. In addition, using the off-grid mathematical model of the first-order Taylor expansion, Yang et al., proposed off-grid sparse Bayesian inference (OGSBI) for off-grid DOA estimation [19], which is suitable for both single and multiple snapshot situations and can reduce computational complexity through singular value decomposition. Moreover, Jagannath et al., analyzed the performance of the off-grid mathematical model and quantization error estimation with the first-order Taylor expansion [20]. Yang et al., developed iterative algorithm with the off-grid model from a Bayesian perspective [21]. Tan proposed a joint sparse recovery method to solve the problem of overcomplete dictionary mismatching, which can improve the accuracy of off-grid parameter estimation [22]. In addition, Wu et al., utilized the perturbation covariance matrix to improve the convergence of sparse Bayesian learning methods for off-grid parameter estimation [23].

The above-mentioned methods can be summarized as model-driven methods [24]. In recent years, deep learning has been gradually applied to DOA estimation [25,26,27]. However, deep neural networks and convolutional neural networks belong to the black box, and their generalization ability for untrained data is relatively poor. In addition, overfitting may occur during the training process. Noticeably, deep unfolding networks (DUNs) construct the iterative process of the sparse reconstruction method into the hidden layers of networks [28]. Since the hidden layer corresponds to an iterative process of the sparse reconstruction method, a DUN requires fewer layers for convergence than sparse reconstruction methods, which can accelerate DOA estimation. Compared to the traditional deep neural networks and convolutional neural networks, the parameters of the hidden layer in the DUNs have certain mathematical meanings, which correspond to the calculation process of iterative solutions [29,30]. During the training process, the deep unfolding network can learn the regular pattern of data and have generalization ability for untrained samples [31].

Accordingly, the contribution of this work is to construct a deep unfolding network for off-grid DOA estimation with a nested array, which can reduce computational complexity and improve the estimation accuracy. Utilizing the quantization error, we establish a mathematical model for off-grid DOA estimation with the first-order derivative of an overcomplete dictionary. In order to reduce the computational complexity, we transform the complex domain covariance vector of nested arrays into a real domain covariance vector. Considering the dual advantages of model-driven and data-driven methods in deep unfolding networks, we transform the iterative steps of the SBL algorithm into a cascaded form of neural networks and construct a deep SBL network. By alternating the on-grid spatial spectrum and the off-grid quantization error, off-grid angle estimation is achieved. The experimental results show that the computational complexity of the DSBL network is lower than that of the model-driven SBL algorithm. Moreover, the proposed DSBL network can improve the estimation accuracy under a low signal-to-noise ratio.

Notations: Throughout this paper, the italic letters (e.g., a), lowercase boldface letters (e.g., a), and the capital boldface letters (e.g., A) denote variables, vectors, and matrices, respectively.

∥ \cdot ∥_{1}

and

∥ \cdot ∥_{2}

denote

l_{1}

norm and

l_{2}

norm, respectively.

\otimes

and

⊙

denote Kronecker and Khatri–Rao products, respectively.

E (\cdot)

,

vec (\cdot)

, and

diag (\cdot)

denote mathematical expectation, vectorization operator, and diagonal operator, respectively.

{(\cdot)}^{*}

,

{(\cdot)}^{T}

, and

{(\cdot)}^{H}

denote complex conjugate, transpose, and Hermitian transpose, respectively.

ℜ (\cdot)

and

ℑ (\cdot)

denote the real part and imaginary part of a complex value, respectively.

2. Signal Model for Off-Grid DOA with NA

In practice, the geometry of a nested array (NA) contains M elements, where the internal spacing of the first subarray is d, which is located at the following position:

\{ξ_{m} |ξ_{m} = m d, m = 1, 2, …, M / 2\},

(1)

and the internal spacing of the second subarray is (M/2 + 1)d, which is located at the following position:

\{ξ_{m} |ξ_{m} = (m - M / 2) (M / 2 + 1) d, m = M / 2 + 1, M / 2 + 2, …, M\} .

(2)

Considering that the configuration of NAs is impinged by K narrowband signals from different DOAs, the array output at the nth snapshot can be expressed as follows:

\begin{matrix} x (n) & = {[x_{1} (n) x_{1} (n) \dots x_{M} (n)]}^{T} \\ = \sum_{k = 1}^{K} a (θ_{k}) s_{k} (n) + w (n) \\ = A s (n) + w (n), \end{matrix}

(3)

where

s (n) = [s_{1} (n) s_{2} (n) \dots s_{K} (n)]^{T}

denotes the vector of K sources at the nth snapshot, A =

[a (θ_{1}) a (θ_{2}) \dots a (θ_{K})]

stands for the steering matrix,

a (θ_{k}) = {[a_{1} (θ_{k}) a_{2} (θ_{k}) \dots a_{M} (θ_{k})]}^{T}

with

a_{M} (θ_{k}) = \exp (- j (2 π ξ_{m} \sin θ_{k} / λ))

stands for the steering vector of the kth signals,

λ

denotes the wavelength of signals, and w(n) =

{[w_{1} (n) w_{1} (n) \dots w_{M} (n)]}^{T}

denotes the vector of Gaussian white noise at the nth snapshot.

We consider the covariance matrix of array output for NAs, which are calculated as follows:

\begin{matrix} R & = E (x (n) x^{H} (n)) = A diag ({[\begin{matrix} σ_{1}^{2} & σ_{2}^{2} & \dots & σ_{K}^{2} \end{matrix}]}^{T}) A^{H} + σ_{w}^{2} I_{M} \\ \approx \frac{1}{N} \sum_{n = 1}^{N} x (n) x^{H} (n), \end{matrix}

(4)

where

σ_{k}^{2}

denotes the power of the kth signal and I_M denotes the M × M dimensional identity matrix.

Therefore, the vector form of the covariance matrix can be expressed as follows:

\begin{matrix} y & = vec (R) \\ = (A^{*} ⊙ A) {[σ_{1}^{2} σ_{2}^{2} \dots σ_{K}^{2}]}^{T} + σ_{W}^{2} {[η_{1}^{T} η_{2}^{T} \dots η_{M}^{T}]}^{T}, \end{matrix}

(5)

where

η_{m}^{T}

denotes the M dimensional vector, the mth element of

η_{m}^{T}

is 1, and the remaining elements are 0; the equivalent steering matrix

A^{*} ⊙ A

can be calculated as follows:

A^{*} ⊙ A = [a^{*} (θ_{1}) \otimes a (θ_{1}) a^{*} (θ_{2}) \otimes a (θ_{2}) \dots a^{*} (θ_{K}) \otimes a (θ_{K})],

(6)

The scenario of off-grid DOA estimation is shown in Figure 1. In the off-grid case, using the first-order derivative of the overcomplete dictionary on the grid, the sparse representation of the covariance vector in (5) can be constructed as follows:

y = (Φ + F B) z + σ_{w}^{2} {[η_{1}^{T} η_{2}^{T} \dots η_{M}^{T}]}^{T},

(7)

where

σ_{w}^{2}

denotes the noise power,

z = {[z_{1} z_{2} \dots z_{Q}]}^{T}

denotes the spatial spectrum on the grid, and the values of

{θ_{q_{1}}, θ_{q_{2}}, \dots, θ_{q_{K}}}

in the spatial spectrum are

{σ_{1}^{2}, σ_{2}^{2}, \dots, σ_{K}^{2}}

. In addition,

Φ + F B

denotes the off-grid overcomplete dictionary, and

Φ

denotes the on-grid overcomplete dictionary, which can be expressed as follows:

\begin{matrix} Φ & = [φ (θ_{1}) φ (θ_{2}) \dots φ (θ_{Q})] \\ = [a^{*} (θ_{1}) \otimes a (θ_{1}) a^{*} (θ_{2}) \otimes a (θ_{2}) \dots a^{*} (θ_{Q}) \otimes a (θ_{Q})] \end{matrix}

(8)

where

a (θ_{q}) = {[a_{1} (θ_{q}) a_{2} (θ_{q}) \dots a_{M} (θ_{q})]}^{T}

denotes the steering vector corresponding to the qth angle in the angle set

{θ_{1}, θ_{2}, \dots, θ_{Q}}

and F denotes the first derivative of the overcomplete dictionary on the grid, which can be expressed as follows:

\begin{matrix} F & = [f (θ_{1}) f (θ_{2}) \dots f (θ_{Q})] \\ = [φ^{'} (θ_{1}) φ^{'} (θ_{2}) \dots φ^{'} (θ_{Q})] \\ = [{(a^{*} (θ_{1}) \otimes a (θ_{1}))}^{'} {(a^{*} (θ_{2}) \otimes a (θ_{2}))}^{'} \dots {(a^{*} (θ_{Q}) \otimes a (θ_{Q}))}^{'}] \end{matrix}

(9)

In addition, B denotes the quantization error matrix, which can be expressed as the following:

B = diag (b)

(10)

where

b = {[b_{1} b_{2} \dots b_{Q}]}^{T}

represents the quantization error vector, the values of

{θ_{q_{1}}, θ_{q_{2}}, \dots, θ_{q_{K}}}

in the quantization error vector are

{Δ_{1}, Δ_{2}, \dots, Δ_{K}}

, and the values of the remaining elements in the quantization error vector are 0;

Δ_{q_{k}} = θ_{k} - θ_{q_{k}}

denotes the quantization error between the DOA of the kth signal and the closest angle on the grid.

3. Proposed Algorithm

In this section, the DSBL network is used to determine the on-grid spatial spectrum and off-grid quantization error via NA, where the layers of the DSBL network correspond to the steps of the model-driven SBL method. Since the neural networks are suitable for dealing with the real-valued data, we transformed the array output into the real domain in advance.

3.1. Transformation of the Array Output to the Real Domain

Considering that the neural networks are suitable for dealing with real-valued data, we rewrote Equation (7) to the following form:

[\begin{matrix} ℜ (y) \\ ℑ (y) \end{matrix}] = [\begin{matrix} ℜ (Φ + F B) & - ℑ (Φ + F B) \\ ℑ (Φ + F B) & ℜ (Φ + F B) \end{matrix}] [\begin{matrix} ℜ (z) \\ ℑ (z) \end{matrix}] + {[\begin{matrix} ℜ (σ_{w}^{2} {[\begin{matrix} η_{1}^{T} & η_{2}^{T} & \dots & η_{M}^{T} \end{matrix}]}^{T}) \\ ℑ (σ_{w}^{2} {[\begin{matrix} η_{1}^{T} & η_{2}^{T} & \dots & η_{M}^{T} \end{matrix}]}^{T}) \end{matrix}]}_{\cdot}

(11)

Since z is considered as the DOA spatial spectrum with source power, and its imaginary part is zero, Equation (11) can be equivalently rewritten as follows:

\begin{matrix} [\begin{matrix} ℜ (y) \\ ℑ (y) \end{matrix}] & = [\begin{matrix} ℜ (Φ + F B) \\ ℑ (Φ + F B) \end{matrix}] z + [\begin{matrix} σ_{w}^{2} {[η_{1}^{T} η_{2}^{T} \dots η_{M}^{T}]}^{T} \\ 0_{M^{2} \times 1} \end{matrix}] \\ = ([\begin{matrix} ℜ (Φ) \\ ℑ (Φ) \end{matrix}] + [\begin{matrix} ℜ (F) \\ ℑ (F) \end{matrix}] B) z + [\begin{matrix} σ_{w}^{2} {[η_{1}^{T} η_{2}^{T} \dots η_{M}^{T}]}^{T} \\ 0_{M^{2} \times 1} \end{matrix}] \end{matrix}

(12)

where

ℜ (Φ) = [ℜ (φ (θ_{1})) ℜ (φ (θ_{2})) \dots ℜ (φ (θ_{q})) \dots ℜ (φ (θ_{Q}))]

(13)

ℑ (Φ) = [ℑ (φ (θ_{1})) ℑ (φ (θ_{2})) \dots ℑ (φ (θ_{q})) \dots ℑ (φ (θ_{Q}))]

(14)

and

φ (θ_{q})

denotes the qth column of

Φ

. Furthermore,

ℜ (F)

and

ℑ (F)

can be expressed as follows:

ℜ (F) = [ℜ (f (θ_{1})) ℜ (f (θ_{2})) \dots ℜ (f (θ_{q})) \dots ℜ (f (θ_{Q}))]

(15)

ℑ (F) = [ℑ (f (θ_{1})) ℑ (f (θ_{2})) \dots ℑ (f (θ_{q})) \dots ℑ (f (θ_{Q}))]

(16)

where

f (θ_{q})

denotes the qth column of

F

.

3.2. Deep Unfolding Sparse Bayesian Learning Network

In order to accelerate the convergence speed of the SBL algorithm, we expanded the iterative steps of the SBL method into the network of cascade form, where the estimation of off-grid DOA is calculated by the peaks of the on-grid spatial spectrum and the corresponding off-grid quantization errors. As shown in Figure 2, the DSBL network contains L layers for on-grid spatial spectrum estimation and off-grid quantization error estimation, where the previous layer of off-grid quantization error matrix B is used to estimate the current layer of on-grid spatial spectrum Z, and the current layer of on-grid spatial spectrum Z is used to estimate the previous layer of off-grid quantization error matrix B.

In practice, the covariance vector in the real domain can be expressed as follows:

\tilde{y} = (\tilde{Φ} + \tilde{F} B) z + [\begin{matrix} σ_{w}^{2} {[\begin{matrix} η_{1}^{T} & η_{2}^{T} & \dots & η_{M}^{T} \end{matrix}]}^{T} \\ 0_{M^{2} \times 1} \end{matrix}]

(17)

where

\tilde{y} = {[ℜ (y^{T}) ℑ (y^{T})]}^{T}

,

\tilde{Φ} = {[ℜ (Φ^{T}) ℑ (Φ^{T})]}^{T}

, and

\tilde{F} = {[ℜ {(F)}^{T} ℑ {(F)}^{T}]}^{T}

.

Considering that the noise part in Equation (17) makes the optimization problem more nebulous [32], we constructed the following convex relaxation:

\min ({‖\tilde{y} - (\tilde{Φ} + \tilde{F} B) z‖}_{2}^{2} + ζ {‖z‖}_{1}) .

(18)

By integrating the amplitudes of spatial power in the SBL framework [33], the probability of

\tilde{y}

with respect to the hyperparameters z and

σ_{w}^{2}

can be expressed as follows:

\begin{matrix} p (\tilde{y} |z, σ_{w}^{2}) & = \int p (\tilde{y} |z, σ_{w}^{2}) p (z |γ) d z \\ = \frac{\exp (- tr ({\tilde{y}}^{H} Σ_{\tilde{y}}^{- 1} \tilde{y}))}{\det (π Σ_{\tilde{y}})}, \end{matrix}

(19)

where

z = [z_{1} z_{2} \dots z_{Q}]^{T}

denotes the on-grid spatial power spectrum and

Σ_{\tilde{y}} = (\tilde{Φ} + \tilde{F} B) Z {(\tilde{Φ} + \tilde{F} B)}^{H} + σ_{w}^{2} I_{2 M^{2}}

with Z = diag(z) denotes the covariance matrix of the array output.

Therefore, the hyperparameters z and

σ_{w}^{2}

can be estimated by maximizing

p (\tilde{y} |z, σ_{w}^{2})

, which can be considered as the type-II maximum likelihood (ML) problem and is derived from the EM method in [34]. In this study, z and

σ_{w}^{2}

are updated by exploiting the iterative E-steps and M-steps of the EM method.

The estimation of the on-grid spatial spectrum in the DSBL network consists of L layers, where

\tilde{y}

,

\tilde{Φ}

, and

\tilde{F}

are considered as the input of each layer, and the initial hyperparameters are set as

Ζ^{(0)} = eye (Q)

and

{(σ_{w}^{2})}^{(0)} = 1

. Based on the EM method, the E-step in the lth layer for on-grid spatial spectrum estimation is performed to calculate the posteriori mean and posteriori covariance:

μ_{z}^{(l)} = Z^{(l - 1)} {(\tilde{Φ} + \tilde{F} B)}^{H} {(Σ_{\tilde{y}}^{(l - 1)})}^{- 1} \tilde{y},

(20)

Σ_{z}^{(l)} = Z^{(l - 1)} - Z^{(l - 1)} {(\tilde{Φ} + \tilde{F} B)}^{H} {(Σ_{\tilde{y}}^{(l - 1)})}^{- 1} (\tilde{Φ} + \tilde{F} B) Z^{(l - 1)},

(21)

for l = 1, 2, …, L, where

Σ_{\tilde{y}}^{(l - 1)} = (\tilde{Φ} + \tilde{F} B) Z^{(l - 1)} {(\tilde{Φ} + \tilde{F} B)}^{H} + {(σ_{w}^{2})}^{(l - 1)} I_{2 M^{2}}

denotes the array covariance.

Moreover, the M-step in the lth layer for on-grid spatial spectrum estimation is performed to calculate the following:

Z^{(l)} = μ_{z}^{(l)} {(μ_{z}^{(l)})}^{T} + Σ_{z}^{(l)},

(22)

and the corresponding noise variance for on-grid spatial spectrum estimation is derived from the following:

{(σ_{w}^{2})}^{(l)} = \frac{1}{2 M^{2}} {‖\tilde{y} - (\tilde{Φ} + \tilde{F} B) μ_{z}^{(l)}‖}_{2}^{2} + \frac{{(σ_{w}^{2})}^{(l - 1)}}{2 M^{2}} (Q - \sum_{q = 1}^{Q} \frac{{(Σ_{z}^{(l)})}_{q, q}}{{(Z^{(l)})}_{q, q}}) \begin{matrix} , \end{matrix}

(23)

where

{(Σ_{z}^{(l)})}_{q, q}

and

{(Z^{(l)})}_{q, q}

denote the (q,q)th element of

Σ_{z}^{(l)}

and

Z^{(l)}

, respectively.

As for the estimation of the off-grid quantization error matrix B, the sparse representation of the covariance vector in the real domain can be constructed as follows:

\tilde{y} - \tilde{Φ} z = \tilde{F} (B z) + [\begin{matrix} σ_{w}^{2} {[η_{1}^{T} η_{2}^{T} \dots η_{M}^{T}]}^{T} \\ 0_{M^{2} \times 1} \end{matrix}] \begin{matrix} , \end{matrix}

(24)

Similarly, the optimization problem of off-grid quantization error estimation can be expressed as follows:

\min ({‖\tilde{g} - \tilde{F} (B z)‖}_{2}^{2} + ζ_{2} {‖B z‖}_{1}),

(25)

where

\tilde{g} = \tilde{y} - \tilde{Φ} z

. In this study, the estimation of the off-grid quantization error in the DSBL network consists of L layers; the initial hyperparameters are set as

Β^{(0)} = eye (Q)

and

{(σ_{B}^{2})}^{(0)} = 1

.

Based on the EM method, the E-step in the lth layer for off-grid quantization error estimation is performed to calculate the posteriori mean and posteriori covariance:

μ_{B}^{(l)} = Γ^{(l - 1)} {\tilde{F}}^{H} {(Σ_{\tilde{g}}^{(l - 1)})}^{- 1} {\tilde{g}}^{(l - 1)},

(26)

Σ_{B}^{(l)} = Γ^{(l - 1)} - Γ^{(l - 1)} {\tilde{F}}^{H} {(Σ_{\tilde{g}}^{(l - 1)})}^{- 1} \tilde{F} Γ^{(l - 1)},

(27)

for l = 1, 2, …, L, where

{\tilde{g}}^{(l - 1)} = \tilde{y} - \tilde{Φ} diag (Z^{(l - 1)})

, and

Σ_{\tilde{g}}^{(l - 1)} = \tilde{F} Γ^{(l - 1)} {\tilde{F}}^{H} + {(σ_{B}^{2})}^{(l - 1)} I_{2 M^{2}}

denotes the array covariance.

Moreover, the M-step in the lth layer of the off-grid quantization error estimation is performed to calculate the following:

Γ^{(l)} = μ_{B}^{(l)} {(μ_{B}^{(l)})}^{T} + Σ_{B}^{(l)},

(28)

and the corresponding noise variance of the off-grid quantization error estimation is derived from the following:

{(σ_{Β}^{2})}^{(l)} = \frac{1}{2 M^{2}} {‖\tilde{g} - \tilde{F} μ_{Β}^{(l)}‖}_{2}^{2} + \frac{{(σ_{Β}^{2})}^{(l - 1)}}{2 M^{2}} (Q - \sum_{q = 1}^{Q} \frac{{(Σ_{Β}^{(l)})}_{q, q}}{{(Γ^{(l)})}_{q, q}}) \begin{matrix} , \end{matrix}

(29)

where

{(Σ_{B}^{(l)})}_{q, q}

and

{(Γ^{(l)})}_{q, q}

denote the (q,q)th element of

Σ_{Β}^{(l)}

and

Γ^{(l)}

, respectively.

Therefore, by employing the output of the lth layer for the on-grid spatial spectrum

Z^{(l)}

and off-grid quantization error

Γ^{(l)}

, the qth element on the diagonal of the off-grid quantization error matrix

B^{(l)}

can be calculated as follows:

b_{q}^{(l)} = Γ_{q}^{(l)} / z_{q}^{(l)},

(30)

Generally, in the training progress of the proposed DSBL network, a stochastic gradient descent (SGD) is exploited to renew the trainable parameters. Referring to the convex relaxation in Equations (18) and (25), the loss function is defined as follows:

\min (\sum_{t = 1}^{T} ({‖{\tilde{y}}_{t} - (\tilde{Φ} + \tilde{F} B) diag (Z_{t}^{(L)})‖}_{2}^{2} + ζ_{1} {‖diag (Z_{t}^{(L)})‖}_{1} + ζ_{2} {‖B_{t}^{(L)} diag (Z_{t}^{(L)})‖}_{1})) \begin{matrix} . \end{matrix}

(31)

for t = 1, 2, …, T, where T stands for the total number of samples in the dataset,

{\tilde{y}}_{t}

stands for the network input, and

Z_{t}^{(L)}

and

B_{t}^{(L)}

stand for the estimated DOA spectrum from the network output. Moreover, the proposed DSBL network can determine the off-grid DOA without training labels and large-scale training data, which have generalization abilities with interpretable parameters and layers for off-grid DOA estimation.

Based on the output of the lth layer of the on-grid spatial spectrum estimation and the output of the lth layer of the off-grid quantization error estimation, the off-grid DOA of the kth signal can be calculated as follows:

{\hat{θ}}_{k} = θ_{k}^{(L)} + b_{k}^{(L)}

(32)

where

θ_{k}^{(L)}

stands for the angle corresponding to the kth spectral peak in

z^{(L)}

and

b_{k}^{(L)}

stands for the value of the kth spectral peak in

b^{(L)}

, where

b^{(L)} = {[b_{1}^{(L)} b_{2}^{(L)} \dots b_{Q}^{(L)}]}^{T}

.

3.3. Network Implementation of Proposed Method

Overall, the main steps of the trained DSBL network for DOA estimation are summarized in Algorithm 1 (The process of implementing a proposed DSBL algorithm).

Algorithm 1 DSBL Network for Off-Grid DOA Estimation with NA
1:	Calculate the covariance matrix using Equation (4).
2:	Apply the vector form of covariance matrix in Equation (5).
3:	Combine real and imaginary parts in Equation (12) as the input of the DSBL network.
4:	Perform the trained DSBL network to acquire spatial spectrum and off-grid quantization error.
5:	Obtain off-grid DOA from the peaks of the spatial spectrum and the corresponding off-grid quantization error in Equation (3).

As for the computational complexity of the proposed DSBL network, the covariance matrix in Equation (4) requires M²N multiplications and M²(N − 1) additions; the vectorization in Equation (5) and combination in Equation (12) do not require additional calculation. When employing the trained deep unfolded SBL network to obtain the on-grid spatial spectrum and off-grid quantization error, the calculation of

Σ_{\tilde{y}}^{(l)}

and

Σ_{\tilde{g}}^{(l)}

requires 4M⁴Q + 2M²Q multiplications and 4M⁴(Q − 1) additions; the calculation of

μ_{z}^{(l)}

and

μ_{B}^{(l)}

requires 4QM²(M² + 1) multiplications, Q(4M⁴ − 1) additions, and O(2M²) for the inverse operator; the calculation of

Σ_{z}^{(l)}

and

Σ_{B}^{(l)}

requires Q(4M⁴ + 2M² + 2M²Q + Q²) multiplications, Q(4M⁴ − 2M² + 2M²Q + Q² − 2Q) additions, 2M² subtractions, and O(2M²) for the inverse operator; the calculation of

Z^{(l)}

and

B^{(l)}

requires Q² multiplications and 2Q² additions; and the calculation of

{(σ_{w}^{2})}^{(l)}

in Equation (23) requires 2M² (Q + 1) multiplications, 2QM² + Q − 2 additions, 2M² subtractions, and Q + 2 divisions.

4. Computer Simulation Experiments

In this section, a two-level NA with six sensors was exploited to investigate the performance of the DSBL network for off-grid DOA estimation. Specifically, the locations were set as [1,2,3,4,8,12]d, and the angle interval of the overcomplete dictionary was set as 1°. Of the samples, 80% of those in the dataset were used for network training, and 20% were used for network validation. Each sample was generated by two signals, with an off-grid angle between −60° and 60°. The signal-to-noise ratio (SNR) was selected from 0 dB to 20 dB, and the number of snapshots was selected from 100 to 500. During the training process of the network, the batch size, epoch, and learning rate were set to 16, 16, and 0.01, respectively.

4.1. Layer Number Determination

In this subsection, we employed simulation experiments to determine the layer number. During the training process of the DSBL network, the appropriate layer number was determined by the mean square error (MSE). In this research, the MSE is defined as follows:

MSE = \frac{1}{T} \sum_{t = 1}^{T} {((Z_{t}^{(L)} - Z_{t}^{label}) + (B_{t}^{(L)} - B_{t}^{label}))}^{2} \begin{matrix} . \end{matrix}

(33)

where

Z_{t}^{(L)}

and

Z_{t}^{label}

denote the output and label of the on-grid spatial spectrum for the tth data, respectively;

B_{t}^{(L)}

and

B_{t}^{label}

denote the output and label of the off-grid quantization error matrix for the tth data, respectively.

During the training and validation process of the DSBL network, the variation in the RMSE with the epoch is shown in Figure 3. From the figure, it can be seen that the RMSEs of layers 10, 20, 30, and 40 gradually decrease with the increase in epoch during the training process, and the RMSE of the 40-layer network is smaller than that of the other layer networks. This indicates that the estimation accuracy of the 40-layer network is better than that of the other layer networks. Due to insufficient training in the initial stage, when the epoch is less than five, the RMSEs of the networks with more layers are greater than those of networks with fewer layers. As the epoch increases, the RMSEs of networks with more layers gradually decrease compared to networks with fewer layers. After the training process of the DSBL network is completed, the RMSE of the 40-layer network is slightly smaller than that of the 30-layer network. In order to balance the accuracy of the off-grid angle estimation and computational complexity, the DSBL network is set to 30 layers. In addition, as shown in Figure 3b, during the validation process, the RMSEs of the 10-, 20-, 30-, and 40-layer networks gradually decreased with the increase in epoch, indicating that there was no overfitting during the training process of the DSBL network.

4.2. Comparison of Convergence Performance

When the off-grid DOA of the test samples are set to −10.95° and 2.98°, Figure 4 shows the relative error of the off-grid DOA estimation. The red solid line represents the relative error of the DSBL network, and the blue dashed line represents the relative error of the model-driven algorithm. As shown in Figure 4, compared to the 8-layer DSBL network, the model-driven algorithm converges after 16 iterations. Due to the computational complexity of each layer in the DSBL network being the same as in the model-driven algorithm, the DSBL network can converge in a shorter time.

4.3. Generalization Ability for Off-Grid DOA Estimation

In order to verify the generalization ability of the deep unfolding network for off-grid DOAs under different numerical conditions, a total of 120 test samples were generated, with each containing a signal. The angle of the spatial spectrum on the grid was set to −60° to 59° with an interval of 1°; the off-grid quantization error was set to a random value of 0° to 1°; and the signal-to-noise ratio was set to 5 dB. The off-grid DOA estimates obtained through the DSBL network are shown in Figure 5a, and the off-grid estimation error is shown in Figure 5b. It can be seen that the DSBL network has the ability to generalize the off-grid angle under different numerical conditions. Due to the fact that the DSBL network models the iterative steps of the corresponding sparse reconstruction algorithm as hidden layers of the neural network, the network parameters have certain mathematical meanings. During the training process, the deep unfolding network can learn the rules hidden behind the data. Therefore, for untrained data, the DSBL network can also estimate the off-grid angle.

In this study, the DOA estimates and errors are shown in Figure 5a,b, respectively, where the abscissa denotes the testing index and the red dots and blue dots denote the first sources and the second sources, respectively. Therefore, we can conclude that the trained DSBL network has a generalization ability for the DOA estimation of off-grid sources.

4.4. RMSE Comparison of DOA Estimation

In this subsection, root mean square error (RMSE) analysis is performed to investigate the performance of the proposed DSBL network. In this study, the RMSE of DOA estimation is defined as follows:

RMSE (θ) = \sqrt{\frac{1}{V K} \sum_{v = 1}^{V} \sum_{k = 1}^{K} {({\hat{θ}}_{k}^{(v)} - θ_{k})}^{2}},

(34)

where

θ_{k}

denotes the real DOA of the kth source and

{\hat{θ}}_{k}^{(v)}

denotes the estimated DOA of the kth source in the vth Monte Carlo simulation experiment.

The RMSE of the proposed DSBL network was compared with the FOCUSS network in [35], the RVSBL algorithm in [36], the JSR algorithm in [13], and the Cramér–Rao lower bound (CRLB) in [37]. In total, 500 simulation experiments were performed to calculate the RMSEs of two sources, where the DOAs were set as −10.1° and 20.8°, respectively. The RMSE versus the SNR and snapshot number are shown in Figure 6a,b, respectively, where the RMSE gradually decreases with the increase in the SNR and snapshot number. Since the noise power is not updated in the iterative process of the JSR algorithm and RVSBL algorithm, the accuracy of the DSBL network outperforms the existing methods.

5. Conclusions

In this research, the DSBL network was constructed for off-grid DOA estimation using the geometry of NAs with mutual coupling. Firstly, the array covariance of the NA was transformed into an equivalent single snapshot, which can form a continuous array with virtual sensors and increase the degrees of freedom. Then, the vectorization of the array covariance was transformed into the real domain and considered as the input to the DSBL network. Next, the DSBL network was constructed and trained to determine the MCC, where the iterative steps of the EM algorithm were transformed into the layers of the DSBL network, and the loss function was only related to the reconstruction error and the sparsity of the network output. Therefore, the training labels and large-scale training data were not required during the training process of the DSBL network. Finally, the off-grid DOA can be obtained from the peaks of the spatial spectrum and the corresponding off-grid quantization error. The simulation results demonstrate that the proposed DSBL network has better generalization ability with interpretable parameters and layers for off-grid DOA estimation with different source numbers. Compared with the joint sparse recovery method, the SBL method, and the RARE method, the proposed DSBL network achieved a more accurate DOA estimation in the cases of limited snapshot numbers and low SNRs.

Author Contributions

Conceptualization, Z.G. and X.S.; methodology, Z.G. and P.H.; software, X.S. and S.L.; writing—review and editing, Z.G. and X.S.; supervision, Z.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported in part by the National Natural Science Foundation of China (62201588, 62022091, 61921001) and, in part, by the research program of the National University of Defense Technology (ZK21-14).

Data Availability Statement

Data sharing not applicable. No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Conflicts of Interest

The authors declare there are no conflict of interest.

References

Zhang, H.; Yan, J.; Liu, W.; Zhang, Q. Array Scheduling with Power and Bandwidth Allocation for Simultaneous Multibeam Tracking Low-Angle Targets in a VHF-MIMO Radar. IEEE Trans. Aerosp. Electron. Syst. 2023, 59, 5714–5730. [Google Scholar] [CrossRef]
Zhang, Z.; Wen, F.; Shi, J. 2D-DOA estimation for coherent signals via a polarized uniform rectangular array. IEEE Signal Process. Lett. 2023, 30, 893–897. [Google Scholar] [CrossRef]
Wang, X.; Guo, Y.; Wen, F. EMVS-MIMO radar with sparse Rx geometry: Tensor modeling and 2D direction finding. IEEE Trans. Aerosp. Electron. Syst. 2023. [Google Scholar] [CrossRef]
Moffet, A. Minimum-Redundancy Linear Arrays. IEEE Trans. Antennas Propag. 1968, 16, 172–175. [Google Scholar] [CrossRef]
Pal, P.; Vaidyanathan, P.P. Nested Arrays: A Novel Approach to Array Processing with Enhanced Degrees of Freedom. IEEE Trans. Signal Process. 2010, 58, 4167–4181. [Google Scholar] [CrossRef]
Vaidyanathan, P.P.; Pal, P. Sparse Sensing with Co-Prime Samplers and Arrays. IEEE Trans. Signal Process. 2011, 59, 573–586. [Google Scholar] [CrossRef]
Qin, S.; Zhang, Y.D.; Amin, M.G. Generalized Coprime Array Configurations for Direction-of-Arrival Estimation. IEEE Trans. Signal Process. 2015, 63, 1377–1390. [Google Scholar] [CrossRef]
Sellone, F.; Serra, A. A novel online mutual coupling compensation algorithm for uniform and linear arrays. IEEE Trans. Signal Process. 2007, 55, 560–573. [Google Scholar] [CrossRef]
BouDaher, E.; Ahmad, F.; Amin, M.G.; Hoorfar, A. Mutual coupling effect and compensation in non-uniform arrays for direction-of-arrival estimation. Digit. Signal Process. 2017, 61, 3–14. [Google Scholar] [CrossRef]
Chen, P.; Cao, Z.; Chen, Z.; Wang, X. Off-Grid DOA Estimation Using Sparse Bayesian Learning in MIMO Radar with Unknown Mutual Coupling. IEEE Trans. Signal Process. 2019, 67, 208–220. [Google Scholar] [CrossRef]
Donoho, D.L. Compressed Sensing. IEEE Trans. Inf. Theory 2006, 52, 1289–1306. [Google Scholar] [CrossRef]
Candes, E.J.; Tao, T. Near-Optimal Signal Recovery from Random Projections: Universal Encoding Strategies. IEEE Trans. Inf. Theory 2006, 52, 5406–5425. [Google Scholar] [CrossRef]
Liu, Q.; So, H.C.; Gu, Y. Off-Grid DOA Estimation with Nonconvex Regularization via Joint Sparse Representation. Signal Process. 2017, 140, 171–176. [Google Scholar] [CrossRef]
Zhang, X.; Jiang, T.; Li, Y.; Liu, X. An Off-Grid DOA Estimation Method Using Proximal Splitting and Successive Nonconvex Sparsity Approximation. IEEE Access 2019, 7, 66764–66773. [Google Scholar] [CrossRef]
Das, A. Theoretical and Experimental Comparison of Off-Grid Sparse Bayesian Direction-of-Arrival Estimation Algorithms. IEEE Access 2017, 5, 18075–18087. [Google Scholar] [CrossRef]
Yang, J.; Yang, Y. A Correlation-Aware Sparse Bayesian Perspective for DOA Estimation with Off-Grid Sources. IEEE Trans. Antennas Propag. 2019, 67, 7661–7666. [Google Scholar] [CrossRef]
Chen, F.; Dai, J.; Hu, N.; Ye, Z. Sparse Bayesian Learning for Off-Grid DOA Estimation with Nested Arrays. Digit. Signal Process. 2018, 82, 187–193. [Google Scholar] [CrossRef]
Yang, Z.; Zhang, C.; Xie, L. Robustly Stable Signal Recovery in Compressed Sensing with Structured Matrix Perturbation. IEEE Trans. Signal Process. 2012, 60, 4658–4671. [Google Scholar] [CrossRef]
Zhu, H.; Leus, G.; Giannakis, G.B. Sparsity-Cognizant Total Least-Squares for Perturbed Compressive Sampling. IEEE Trans. Signal Process. 2011, 59, 2002–2016. [Google Scholar] [CrossRef]
Jagannath, R.; Hari KV, S. Block Sparse Estimator for Grid Matching in Single Snapshot DoA Estimation. IEEE Signal Process. Lett. 2013, 20, 1038–1041. [Google Scholar] [CrossRef]
Yang, Z.; Xie, L.; Zhang, C. Off-Grid Direction of Arrival Estimation Using Sparse Bayesian Inference. IEEE Trans. Signal Process. 2013, 61, 38–43. [Google Scholar] [CrossRef]
Tan, Z.; Yang, P.; Nehorai, A. Joint Sparse Recovery Method for Compressed Sensing with Structured Dictionary Mismatches. IEEE Trans. Signal Process. 2014, 62, 4997–5008. [Google Scholar] [CrossRef]
Wu, X.; Zhu, W.; Yan, J. Direction of Arrival Estimation for Off-Grid Signals Based on Sparse Bayesian Learning. IEEE Sens. J. 2016, 16, 2004–2016. [Google Scholar] [CrossRef]
Li, Y.; Tofighi, M.; Monga, V.; Eldar, Y.C. An Algorithm Unrolling Approach to Deep Image Deblurring. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, 12–17 May 2019; pp. 7675–7679. [Google Scholar]
Borgerding, M.; Schniter, P.; Rangan, S. AMP-Inspired Deep Networks for Sparse Linear Inverse Problems. IEEE Trans. Signal Process. 2017, 65, 4293–4308. [Google Scholar] [CrossRef]
Sun, J.; Li, H.; Xu, Z. Deep ADMM-Net for Compressive Sensing MRI. In Proceedings of the Advanced Neural Information Processing Systems (NIPS), Barcelona, Spain, 5–10 December 2016; pp. 10–18. [Google Scholar]
Yang, Y.; Sun, J.; Li, H.; Xu, Z. ADMM-CSNet: A Deep Learning Approach for Image Compressive Sensing. IEEE Trans. Pattern Anal. Mach. Intell. 2020, 42, 521–538. [Google Scholar] [CrossRef]
Zheng, S.; Jayasumana, S.; Romera-Paredes, B.; Vineet, V.; Su, Z.; Du, D.; Huang, C.; Torr, P.H. Conditional Random Fields as Recurrent Neural Networks. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Santiago, Chile, 7–13 December 2015; pp. 1529–1537. [Google Scholar]
Hosseini, S.A.; Yaman, B.; Moeller, S.; Hong, M.; Akçakaya, M. Dense Recurrent Neural Networks for Accelerated MRI: History-Cognizant Unrolling of Optimization Algorithms. IEEE J. Sel. Top. Signal Process. 2020, 14, 1280–1291. [Google Scholar] [CrossRef]
Li, R.; Zhang, S.; Zhang, C.; Liu, Y.; Li, X. Deep Learning Approach for Sparse Aperture ISAR Imaging and Autofocusing Based on Complex-Valued ADMM-Net. IEEE Sens. J. 2021, 21, 3437–3451. [Google Scholar] [CrossRef]
Li, R.; Zhang, S.; Zhang, C.; Liu, Y.; Li, X. A Computational Efficient 2-D Block-Sparse ISAR Imaging Method Based on PCSBL-GAMP-Net. IEEE Trans. Geosci. Remote Sens. 2022, 60, 1–14. [Google Scholar] [CrossRef]
Wipf, D.P.; Rao, B.D. An empirical Bayesian strategy for solving the simultaneous sparse approximation problem. IEEE Trans. Signal Process. 2007, 55, 3704–3716. [Google Scholar] [CrossRef]
Liu, Z.; Huang, Z.; Zhou, Y. An efficient maximum likelihood method for direction-of-arrival estimation via sparse Bayesian learning. IEEE Trans. Wirel. Commun. 2012, 11, 1–11. [Google Scholar] [CrossRef]
Dempster, A.P.; Laird, N.M.; Rubin, D.B. Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Society. Ser. B Methodol. 1977, 39, 1–38. [Google Scholar] [CrossRef]
Su, X.; Liu, Z.; Shi, J.; Hu, P.; Liu, T.; Li, X. Real-Valued Deep Unfolded Networks for Off-Grid DOA Estimation via Nested Array. IEEE Trans. Aerosp. Electron. Syst. 2023, 59, 4049–4062. [Google Scholar] [CrossRef]
Das, A. Real-valued sparse Bayesian learning for off-grid direction-of-arrival (DOA) estimation in ocean acoustics. IEEE J. Ocean. Eng. 2021, 46, 172–182. [Google Scholar] [CrossRef]
Stoica, P.; Larsson, E.G.; Gershman, A.B. The stochastic CRB for array processing: A textbook derivation. IEEE Signal Process. Lett. 2001, 8, 148–150. [Google Scholar] [CrossRef]

Figure 1. The scenario of off-grid DOA estimation.

Figure 2. Scheme of the proposed DSBL network for off-grid DOA estimation.

Figure 3. MSEs of different layer numbers. (a) Training; (b) validation.

Figure 4. Comparison of relative errors.

Figure 5. DOA estimation of two off-grid sources. (a) DOA estimates; (b) estimation error.

Figure 6. RMSEs of different methods. (a) SNR; (b) snapshot.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Gong, Z.; Su, X.; Hu, P.; Liu, S.; Liu, Z. Deep Unfolding Sparse Bayesian Learning Network for Off-Grid DOA Estimation with Nested Array. Remote Sens. 2023, 15, 5320. https://doi.org/10.3390/rs15225320

AMA Style

Gong Z, Su X, Hu P, Liu S, Liu Z. Deep Unfolding Sparse Bayesian Learning Network for Off-Grid DOA Estimation with Nested Array. Remote Sensing. 2023; 15(22):5320. https://doi.org/10.3390/rs15225320

Chicago/Turabian Style

Gong, Zhenghui, Xiaolong Su, Panhe Hu, Shuowei Liu, and Zhen Liu. 2023. "Deep Unfolding Sparse Bayesian Learning Network for Off-Grid DOA Estimation with Nested Array" Remote Sensing 15, no. 22: 5320. https://doi.org/10.3390/rs15225320

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Deep Unfolding Sparse Bayesian Learning Network for Off-Grid DOA Estimation with Nested Array

Abstract

1. Introduction

2. Signal Model for Off-Grid DOA with NA

3. Proposed Algorithm

3.1. Transformation of the Array Output to the Real Domain

3.2. Deep Unfolding Sparse Bayesian Learning Network

3.3. Network Implementation of Proposed Method

4. Computer Simulation Experiments

4.1. Layer Number Determination

4.2. Comparison of Convergence Performance

4.3. Generalization Ability for Off-Grid DOA Estimation

4.4. RMSE Comparison of DOA Estimation

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI