A New Method of Measurement Matrix Optimization for Compressed Sensing Based on Alternating Minimization

Yi, Renjie; Cui, Chen; Wu, Biao; Gong, Yang

doi:10.3390/math9040329

Open AccessArticle

A New Method of Measurement Matrix Optimization for Compressed Sensing Based on Alternating Minimization

¹

Institute of Electronic Countermeasure, National University of Defense Technology, Hefei 230000, China

²

Huayin Ordnance Test Center, Weinan 714000, China

^*

Author to whom correspondence should be addressed.

Mathematics 2021, 9(4), 329; https://doi.org/10.3390/math9040329

Submission received: 5 January 2021 / Revised: 25 January 2021 / Accepted: 26 January 2021 / Published: 7 February 2021

Download

Browse Figures

Versions Notes

Abstract

:

In this paper, a new method of measurement matrix optimization for compressed sensing based on alternating minimization is introduced. The optimal measurement matrix is formulated in terms of minimizing the Frobenius norm of the difference between the Gram matrix of sensing matrix and the target one. The method considers the simultaneous minimization of the mutual coherence indexes including maximum mutual coherence

μ_{m a x}

, t-averaged mutual coherence

μ_{a v e}

and global mutual coherence

μ_{a l l}

, and solves the problem that minimizing a single index usually results in the deterioration of the others. Firstly, the threshold of the shrinkage function is raised to be higher than the Welch bound and the relaxed Equiangular Tight Frame obtained by applying the new function to the Gram matrix is taken as the initial target Gram matrix, which reduces

μ_{a v e}

and solves the problem that

μ_{m a x}

would be larger caused by the lower threshold in the known shrinkage function. Then a new target Gram matrix is obtained by sequentially applying rank reduction and eigenvalue averaging to the initial one, leading to lower. The analytical solutions of measurement matrix are derived by SVD and an alternating scheme is adopted in the method. Simulation results show that the proposed method simultaneously reduces the above three indexes and outperforms the known algorithms in terms of reconstruction performance.

Keywords:

compressed sensing; measurement matrix; Equiangular Tight Frame; mutual coherence

1. Introduction

Compressed sensing (CS) [1] can sample the sparse or compressible signals at a sub-Nyquist rate, which brings great convenience for data storage, transmission, and processing. By adopting the reconstruction algorithms, the signal can be exactly reconstructed from the sampled data. As a marvel way of signal processing, CS is applied in different fields such as image encryption [2], wideband spectrum sensing [3], wireless sensor network data processing [4], etc.

The original signal

x \in R^{N \times 1}

is assumed to have a sparse representation in a known domain

Ψ \in R_{N}^{N \times L}

(

N \leq L

) as

x = Ψ s

where

Ψ

is the dictionary matrix and

s

is a

K

-sparse signal. The incomplete measurement

y \in R^{M \times 1}

is obtained through the linear model

y = Φ x = D s

(1)

where

Φ \in R^{M \times N}

(

M < N

) is called the measurement matrix and

D = Φ Ψ

is the sensing matrix.

Some specific properties of

Φ

have great impacts on the reconstruction performance. In [5,6], spark and restricted isometric property (RIP) are respectively proposed as the sufficient conditions on

Φ

to recovery guarantee. However, computing the spark of a matrix has combinatorial complexity and certifying RIP for a matrix requires combinatorial search, that is to say, these tasks are NP-hard and difficult to accomplish. To a large extent, the coherence between

Φ

and

Ψ

reflects the performance of meeting the above conditions. Actually, the coherence is equivalent to the mutual coherence of

D

. Since the mutual coherence can be easily manipulated to provide recovery guarantees, it is commonly used to measure the performance of

Φ

. The frequently-used mutual coherence indexes include the maximum mutual coherence

μ_{m a x}

[7], t-averaged mutual coherence

μ_{a v e}

[8] and global mutual coherence

μ_{a l l}

[9], which respectively represent the maximum, the average and the sum of squares of the correlation between any distinct pair of columns in

D

.

The first attempt to consider the optimal design of

Φ

is given in [8]. The simulation results carried out in [8] show that the optimized

Φ

leads to smaller

μ_{a v e}

and a substantially better CS reconstruction performance is obtained. Then the optimization of

Φ

becomes an important issue in CS. The recent works try to find the optimal

Φ

which has excellent performance in reducing the mutual correlation by minimizing the Frobenius norm of the difference between the Gram matrix

G = D^{'} D

and the target Gram matrix

G_{t}

. The main work focuses on designing

G_{t}

and finding the best

Φ

.

In [8],

G_{t}

is obtained by shrinking the off-diagonal entries in

G

. The shrinkage technique reduces

μ_{a v e}

but it is time-consuming. Furthermore,

μ_{m a x}

is still large, which ruins the worst-case guarantees of the reconstruction algorithms. A suitable point between the current solution and the one obtained using a new shrinkage function is chosen to design the

G_{t}

in [10]. It is of very strong competitiveness in

μ_{a v e}

and

μ_{m a x}

. However, the optimal point is hard to determine and the unsuitable point may seriously degrade the algorithm’s performance. In [9],

G_{t}

is obtained by averaging the eigenvalues of

G

. Simulation results show that

μ_{a l l}

is reduced effectively but reducing

μ_{a v e}

and

μ_{m a x}

is hard to be guaranteed, which means

μ_{a v e}

and

μ_{m a x}

may maintain large values. Duarte-Carajalino and Sapiro [11] set

G_{t}

as an identity matrix. Since

D

is overcomplete and

G

cannot be an identity matrix, simply minimizing the difference between

G

and

G_{t}

does not imply low

μ_{m a x}

[12]. In [13,14,15,16,17],

G_{t}

is chosen from a set of relaxed Equiangular Tight Frames (ETF) [18]. The set can be formulated as

S = \{G_{t} \in R^{L \times L} : G_{t} = G_{t}^{'}, d i a g G_{t} = 1, \max_{i \neq j} |G_{t} (i, j)| \leq μ_{w e l c h}\}

where

μ_{w e l c h}

denotes the Welch bound [19] and

G_{t} (i, j)

denotes the

(i, j)

th entry of

G_{t}

. However, the maximum absolute value of off-diagonal entries in

G

is almost always greater than

μ_{w e l c h}

. In this case, the optimization usually implies a solution

D

with low

μ_{a v e}

but high

μ_{m a x}

. In summary, the target Gram matrices mentioned above only focus on a certain mutual coherence index, and fail to take into account

μ_{a v e}

,

μ_{m a x}

and

μ_{a l l}

simultaneously. When a certain index is targeted, the other indexes may not decrease significantly or even increase. Therefore,

Φ

is not ‘good’ enough and the reconstruction performance is well below par.

After designing

G_{t}

, the next step is to find the ‘best’

Φ

by approaching

G

to

G_{t}

. In [8,9,10,11,17],

G

is obtained by applying SVD to

G_{t}

primarily and then the square root

\hat{D}

of

G

is built as

{\hat{D}}^{'} \hat{D} = G

. At last,

Φ

is obtained by

Φ = \hat{D} Ψ^{†}

where

†

denotes the Moore Penrose pseudoinverse. This kind of method is intuitive, but the generalized pseudoinverse poses problems of calculation accuracy and robustness [15]. In [14,15], gradient algorithm and quasi-Newtonian algorithm are respectively utilized to attain

Φ

. Firstly, the cost function

F (Φ)

with

Φ

as the variable is constructed. Then the search direction is determined by the derivative of

F (Φ)

. Finally,

Φ

is obtained with a fixed step size. However, choosing a suitable step size which has a great influence on the accuracy of the solution requires a lot of comparison work. Moreover, the gradient algorithm and quasi-Newtonian algorithm cannot converge until a certain number of iterations is accomplished, resulting in high computational cost. In [11,16], the method for designing

Φ

shares the same concept as K-SVD [20], that is to update a matrix row by row. Eigenvalue decomposition is required to find the square root of the maximum eigenvalue for each row, which results in a significant increase in the calculation. For solving this problem, Hong et al. [16] utilize the power method instead of eigenvalue decomposition. However, the eigenvalue obtained by power method is the one with the largest absolute value. When the eigenvalue is negative, eigenvalue decomposition is still necessary.

The primary contributions of this paper are threefold:

The new target Gram matrix $G_{t}$ targets $μ_{a v e}$ , $μ_{m a x}$ , and $μ_{a l l}$ of $D$ simultaneously is designed. Firstly, a new shrinkage function whose threshold exceeds $μ_{w e l c h}$ is utilized to determine the initial target Gram matrix. Then $G_{t}$ is obtained by sequentially applying rank reduction and eigenvalue averaging to the initial matrix.
Analytical solutions of the measurement matrix $Φ$ to minimize the difference between $G = Ψ^{'} Φ^{'} Φ Ψ$ and $G_{t}$ are derived by SVD.
Based on alternating minimization, an iterative method is proposed to optimize the measurement matrix. The simulation results confirm the effectiveness of the proposed method in decreasing the mutual coherence indexes and improving reconstruction performance.

The remainder of this paper is organized as follows. Some basic definitions related to mutual coherence indexes and frames are described in the next section. The main results are presented in Section 3, where the solutions to the

G_{t}

design are characterized and a class of the solutions to the optimal

Φ

is derived in detail. The procedure of our method and the discussion can be also found in Section 3. In Section 4, simulations are carried out to confirm the effectiveness of the proposed method. In the end, the conclusion is drawn.

2. Mutual Coherence Indexes and ETFs

2.1. Mutual Coherence Indexes

Rewrite

D = [d_{1}, d_{2} \dots d_{L}] \in R^{M \times L}

where

d_{i} \in R^{M \times 1}

and

{‖d_{i}‖}_{2} = 1

. Denote

g_{i j} = {d^{'}}_{i} d_{j}

the entry at the position of row

i

and column

j

in

G

, where

i, j = 1, 2 \dots L

. Here, we quote the definitions of the mutual coherence indexes as that presented by Donoho [7], Elad [8], and Zhao [9].

Definition 1.

For a matrix

D

, the maximum mutual coherence

μ_{m a x}

is defined as the largest absolute and normalized inner product between all columns in

D

that can be described as

μ_{m a x} = \max_{i \neq j} |{d^{'}}_{i} d_{j}| = \max_{i \neq j} |g_{i j}|

(2)

Definition 2.

For a matrix

D

, the t-averaged mutual coherence

μ_{a v e}

is defined as the average of all absolute and normalized inner products between different columns in

D

that are above t and can be described as

μ_{a v e} = \frac{\sum_{i \neq j} (|g_{i j} \geq t|) |g_{i j}|}{\sum_{i \neq j} (|g_{i j} \geq t|)}

(3)

Definition 3.

For a matrix

D

, the global mutual coherence

μ_{a l l}

is defined as the sum of squares of normalized inner products between all columns in

D

that can be described as

μ_{a l l} = \sum_{i \neq j} g_{i j}^{2}

(4)

As shown in [5], the original signal can be exactly reconstructed as long as

K < (1 + 1 / μ_{m a x}) / 2

. The conclusion is true from a worst-case standpoint which means that

μ_{m a x}

does not do justice to the actual behavior of sparse representations. Therefore, Elad considers that an “average” measure of mutual coherence, namely

μ_{a v e}

, is more likely to describe its true behavior. Different from the previous two indexes,

μ_{a l l}

reflects the overall property of

D

.

In fact, the purpose of reducing the mutual coherence indexes of

D

is to attain

G

that meets the following requirements: (1) The maximum absolute value of off-diagonal entries in

G

is sufficiently small; (2) The number of off-diagonal entries with large absolute value is minimized; (3) The average of off-diagonal entries with large absolute value is as small as possible. However, when a certain mutual coherence index is targeted solely, we cannot guarantee that the obtained

G

will fully meet the requirements. Therefore, the decrease of a certain index does not always mean better

Φ

and improved reconstruction performance. When the three indexes are reduced simultaneously, the requirements are better satisfied and better performance is obtained.

2.2. ETFs

It is shown in [19] that

μ_{m a x}

of

D \in R_{M}^{M \times L}

is lower bounded by

μ_{m a x} \geq μ_{w e l c h} = \sqrt{\frac{L - M}{M (L - 1)}}

The bound is achievable for ETF. Here, we recall the definition of ETF [18].

Definition 4.

Let

F

be a

M \times L

matrix whose columns are

f_{1}, f_{2} \dots f_{L}

. The matrix

F

is called an equiangular tight frame if it satisfies three conditions

(1): Each column has a unit norm: ${‖f_{i}‖}_{2} = 1$ for $i = 1, 2 \dots L$ .
(2): The columns are equiangular. For some nonnegative $θ$ , we have ${f^{'}}_{i} f_{j} = θ$ when $i, j = 1, 2 \dots L$ and $i \neq j$ .
(3): The columns form a tight frame. That is, $F F^{'} = (L / M) I_{M}$ where $I_{M}$ is an $M \times M$ identity matrix.

Sustik et al. [18] show that a real

M \times L

(

1 < M < L - 1

) ETF exists on if

L \leq \min \{M (M + 1) / 2, (L - M) (L - M + 1) / 2\}

holds. Furthermore,

\sqrt{M (L - 1) / (L - M)}

and

\sqrt{(L - M) (L - 1) / M}

must be odd integers when

L \neq 2 M

,

M

, and

2 M - 1

must be an odd number and the sum of two squares respectively when

L = 2 M

. Fickus et al. [21] surveys some known construction of ETFs and tabulates existence for sufficiently small dimensions. The above studies show that

M

and

L

must meet some exacting requirements when an ETF is available for

D

. However, it is really difficult to meet the requirements in practice, which means the maximum absolute value of the off-diagonal entries in

G

is usually significantly larger than

μ_{w e l c h}

.

3. The Proposed Method

The off-diagonal entries in

G

actually are the inner products between different columns in

D

. Reducing those entries is likely to lead to lower mutual coherence indexes and better performance. The most straightforward approach is to replace large off-diagonal values with small ones. However, it is impossible to solve

Φ

from a certain

G

because of the inequality of rank between

Ψ^{'} Φ^{'} Φ Ψ

and

G

when the approach is adopted. Therefore, a feasible approach is to minimize the difference between

G

and

G_{t}

that can be formulated as

\min {‖G_{t} - G‖}_{F}^{2}

(5)

where

G = Ψ^{'} Φ^{'} Φ Ψ

. This problem can be solved by alternating minimization strategy [14,16], which iteratively minimizes (5) to find the desired

Φ

. The idea is to update

G_{t}

and

Φ

alternatively and repeat this proceeding until a stop criterion is reached. In this section, we design

G_{t}

firstly and then derive the analytical solutions of

Φ

. Finally, an iterative method is proposed to optimize the measurement matrix based on alternating minimization.

3.1. The Design of $G_{t}$

It can be seen from (5) that

G_{t}

plays an important role in measurement matrix optimization. In recent works,

G_{t}

is frequently set as the relaxed ETF matrix, which is obtained by applying the following shrinkage function

G_{t} (i, j) = \{\begin{array}{l} g_{i j}, & |g_{i j}| \leq ς \\ s i g n (g_{i j}) ς, & o t h e r w i s e \end{array}

(6)

where

ς = μ_{w e l c h}

for

i, j = 1, 2 \dots L

and

i \neq j

. Such a scheme in designing

G_{t}

guarantees that the off-diagonal entries with large value of

G

will be intensively constrained, which means lower

μ_{a v e}

and

μ_{m a x}

.

Recall from Section 2.2 that the Welch bound is not achievable for

G

in most cases. As shown in [14,16], different

ς

yields different results and

μ_{w e l c h}

is not the optimal value. Li [22] et al. found that a smaller

μ_{m a x}

is available when

ς

is slightly larger than

μ_{w e l c h}

. Inspired by [22], we propose an improved shrinkage function which divides the entries in

G

into three segments through two thresholds. One of the thresholds is

μ_{w e l c h}

and the other is larger than

μ_{w e l c h}

. The function is as follows

G_{t} (i, j) = \{\begin{array}{l} s i g n (g_{i j}) T h r, & |g_{i j}| \geq T h r \\ s i g n (g_{i j}) μ_{w e l c h}, & μ_{w e l c h} \leq |g_{i j}| < T h r \\ g_{i j}, & |g_{i j}| < μ_{w e l c h} \end{array}

(7)

where

T h r = μ_{w e l c h} + c

and

0 < c < μ_{w e l c h}

. As can be seen from Equation (7), the maximum absolute value of off-diagonal entries in

G_{t}

is raised from

μ_{w e l c h}

to

T h r

. According to the previous analysis, the new function is likely to lead to a further reduction in

μ_{m a x}

while maintaining the advantage of Equation (6) with respect to

μ_{a v e}

.

After shrinkage,

G_{t}

becomes full rank generally [8], that is

R a n k (G_{t}) = L

. However, the rank of

G

is identically equal to

M

. Thus, we consider mending this by forcing a rank

M

. A new target Gram matrix, denoted as

G_{t_M}

, is obtained by solving

\min {‖G_{t} - G_{t_M}‖}_{F}^{2}, s . t . R a n k (G_{t_M}) = M

(8)

The solutions to this problem are given by Theorem 1 below.

Theorem 1.

Let

G_{t} \in R_{L}^{L \times L}

be the matrix obtained by applying the shrinkage operation shown as Equation (7) to

G

and

G_{t} = P Λ P^{'}

be the eigendecomposition of

G_{t}

.

P

is orthonormal with dimension

L

and

Λ = d i a g (λ_{1}, λ_{2} \dots λ_{L})

with

|λ_{1}| \geq |λ_{2}| \geq \dots \geq |λ_{L}|

. The solutions of the minimization problem defined by (8) are characterized by

G_{t_M} = P A Λ P^{'}

(9)

where

A = [\begin{matrix} I_{M} & 0 \\ 0 & 0 \end{matrix}] \in R^{L \times L}

.

Proof.

Denote

G_{t_M} = X^{'} X

where

X \in R_{M}^{M \times L}

. Let

X = U_{X} [\begin{matrix} Σ_{X} & 0 \end{matrix}] {V^{'}}_{X}

be an SVD of

X

, where

U_{X} \in R^{M \times M}

and

V_{X} \in R^{L \times L}

are unitary. Then

G_{t_M}

can be rewritten as

G_{t_M} = V_{X} [\begin{matrix} Σ_{X}^{2} & 0 \\ 0 & 0 \end{matrix}] {V^{'}}_{X}

(10)

Denote

f = {‖G_{t} - G_{t_M}‖}_{F}^{2}

. By substituting

G_{t_M}

with

G_{t_M} = X^{'} X

, can be rewritten as a function of matrix

X

. Let

\partial f / \partial X

be the derivative of

f

with respect to

X

. The optimal

X

should satisfy

\partial f / \partial X = 0

. Equivalently, we have

X X^{'} X = X G_{t}

(11)

It then follows from

X = U_{X} [\begin{matrix} Σ_{X} & 0 \end{matrix}] {V^{'}}_{X}

that

[\begin{matrix} Σ_{X}^{2} & 0 \\ 0 & 0 \end{matrix}] = A {V^{'}}_{X} G_{t} V_{X}

(12)

Substituting Equation (12) into Equation (10), we obtain

G_{t_M} = V_{X} A {V^{'}}_{X} G_{t}

(13)

It turns out from the unitary invariance with Equation (13) that

f = {‖(I_{L} - V_{X} A {V^{'}}_{X}) G_{t}‖}_{F}^{2}

(14)

With a few manipulations, we conclude that the solution of (8) is equivalent to solving

\max t r (G_{t}^{'} V_{X} A {V^{'}}_{X} G_{t})

(15)

where

t r ()

denotes the matrix trace operation. Noting that

A = A^{'} A

and

G_{t} = P Λ P^{'}

, the problem in (15) is equivalent to

\max {‖A {V^{'}}_{X} P Λ‖}_{F}^{2}

(16)

Denote

B = {V^{'}}_{X} P

and rewrite

B

as

B = [b_{1}, b_{2} \dots b_{L}]

where

b_{i} \in R^{L \times 1}

for

i = 1, 2 \dots L

. Rewrite

A

as

A = {[e_{1}, e_{2} \dots e_{M}, 0, 0 \dots 0]}^{'}

, where

e_{j} \in R^{L \times 1}

denotes a unit vector with the ith entry is equal to 1 for

j = 1, 2 \dots M

. Then, it is easy to obtain that

A {V^{'}}_{X} P Λ = [\begin{matrix} λ_{1} {e^{'}}_{1} b_{1} & λ_{2} {e^{'}}_{1} b_{2} & \dots & λ_{L} {e^{'}}_{1} b_{L} \\ λ_{1} {e^{'}}_{2} b_{1} & λ_{2} {e^{'}}_{2} b_{2} & \dots & λ_{L} {e^{'}}_{2} b_{L} \\ ⋮ \\ λ_{1} {e^{'}}_{M} b_{1} & λ_{2} {e^{'}}_{M} b_{2} & \dots & λ_{L} {e^{'}}_{M} b_{L} \\ 0 & 0 & \dots & 0 \\ ⋮ \\ 0 & 0 & \dots & 0 \end{matrix}]

Let

b_{j i} = {e^{'}}_{j} b_{i}

be the jth entry of

b_{i}

. It can be shown with some manipulations that the problem in (16) is equivalent to

\max \sum_{i = 1}^{L} \sum_{j = 1}^{M} λ_{i}^{2} b_{j i}^{2}

(17)

With

|λ_{1}| \geq |λ_{2}| \geq \dots \geq |λ_{L}|

and

\sum_{i = 1}^{L} \sum_{j = 1}^{M} b_{j i}^{2} = M

, it is straightforward that

\sum_{i = 1}^{L} \sum_{j = 1}^{M} λ_{i}^{2} b_{j i}^{2}

reaches the maximum value

λ_{1}^{2} + λ_{2}^{2} + \dots + λ_{M}^{2}

only if

\sum_{j = 1}^{M} b_{j i}^{2} = 1

for

i = 1, 2 \dots M

. In this case,

\sum_{j = 1}^{M} b_{j i}^{2} = 0

and

\sum_{j = 1}^{M} b_{i j}^{2} = 0

hold for

i = M + 1, M + 2 \dots L

. Rewrite

B

as

B = [\begin{matrix} B_{1} & B_{2} \\ B_{3} & B_{4} \end{matrix}]

where

B_{1} \in R^{M \times M}

. Accordingly, all of the entries in both

B_{2}

and

B_{3}

are zero. Noting that

B

is unitary, it can be shown that

B_{1}

and

B_{4}

are unitary. Hence, we have

\begin{array}{l} G_{t_M} & = V_{X} A {V^{'}}_{X} P Λ P^{'} \\ = P B^{'} A B Λ P^{'} \\ = P A Λ P^{'} \end{array}

(18)

As can be seen, the rank of

G_{t_M}

is equal to

M

. The proof is then completed. □

After the operation of rank reduction, the rank of

G_{t_M}

is equal to that of

G

. Additionally,

G_{t_M}

is most similar to

G_{t}

in terms of Frobenius norm. Inspired by [9], we reduce the sum of squares of all off-diagonal values of

G_{t_M}

, namely

{\hat{μ}}_{a l l}

, by eigenvalue averaging. When minimizing the difference between

G

and

G_{t_M}

, a smaller

{\hat{μ}}_{a l l}

is more likely to lead to a smaller

μ_{a l l}

.

{\hat{μ}}_{a l l}

can be formulated as

{\hat{μ}}_{a l l} = \sum_{i, j = 1}^{L} {\hat{g}}_{i j}^{2} - \sum_{i = 1}^{L} {\hat{g}}_{i i}^{2}

(19)

where

{\hat{g}}_{i j}

denotes the

(i, j)

th entry of

G_{t_M}

and

{\hat{g}}_{i i} = 1

holds for

i = 1, 2 \dots L

.

Noting that

\sum_{i, j = 1}^{L} g_{i j}^{2} = {‖{\hat{G}}_{t}‖}_{F}^{2}

and

G_{t_M} = P \hat{Λ} P^{'}

where

\hat{Λ} = d i a g (λ_{1}, λ_{2} \dots λ_{M}, 0 \dots 0)

,

{\hat{μ}}_{a l l}

can be rewritten as

{\hat{μ}}_{a l l} = \sum_{i = 1}^{M} λ_{i}^{2} - \sum_{i = 1}^{L} {\hat{g}}_{i i}^{2}

(20)

Assuming that

\sum_{i = 1}^{M} λ_{i}

is invariable, it then follows from the Cauchy BuniakowskySchwarz Inequality that

\sum_{i = 1}^{M} λ_{i}^{2}

takes the minimum value only if

λ_{i} = \frac{1}{M} \sum_{i = 1}^{M} λ_{i}

for

i = 1, 2 \dots M

. Let

\hat{λ} = \frac{1}{M} \sum_{i = 1}^{M} λ_{i}

and

\hat{Λ} = d i a g (\underset{M}{\underset{⏟}{\hat{λ}, \hat{λ} \dots \hat{λ}}}, 0 \dots 0)

, a new target Gram matrix denoted as

G_{t_o p t}

with

M

equal all non-zero eigenvalues is given by

G_{t_o p t} = P \hat{Λ} P^{'}

(21)

Recall that

G_{t_M}

is most similar to

G_{t}

in terms of Frobenius norm, it means that

G_{t_M}

is of good competitiveness in

μ_{a v e}

and

μ_{m a x}

. Furthermore, as a variant of

G_{t_M}

,

G_{t_o p t}

reduces the sum of squares of all off-diagonal values of

G_{t_M}

, leading to a better performance in minimizing

μ_{a l l}

. Therefore,

G_{t_o p t}

is more likely to be an ideal solution of target Gram matrix which leads to better

μ_{a v e}

,

μ_{m a x}

, and

μ_{a l l}

simultaneously.

3.2. The Analytical Solutions of $Φ$

After obtaining the target Gram matrix

G_{t_o p t}

, the next step of the optimization is to find the best

Φ

. To handle the problem, we try to find the optimal solution by minimizing the difference between

Ψ^{'} Φ^{'} Φ Ψ

and

G_{t_o p t}

as

\min {‖G_{t_o p t} - Ψ^{'} Φ^{'} Φ Ψ‖}_{F}^{2}

(22)

Let

D = U_{D} [\begin{matrix} Σ_{D} & 0 \end{matrix}] {V^{'}}_{D}

be an SVD of

D

, where

U_{D} \in R^{M \times M}

and

V_{D} \in R^{L \times L}

are unitary. Similarly, Let

Ψ = U_{Ψ} [\begin{matrix} Σ_{Ψ} & 0 \end{matrix}] {V^{'}}_{Ψ}

be an SVD of

Ψ

, where

U_{Ψ} \in R^{N \times N}

and

V_{Ψ} \in R^{L \times L}

are unitary. The solutions to this problem are given by Theorem 2 below.

Theorem 2.

Let

G_{t_o p t}

be the matrix shown as Equation (21) and

Λ_{M} \in R^{M \times M}

be the

M

th principal submatrix of

\hat{Λ}

. Then the solutions of the minimization problem defined by (22) are characterized by

Φ_{o p t} = U_{Z} [\begin{matrix} {(Λ_{M})}^{\frac{1}{2}} & 0 \end{matrix}] P^{'} V_{Ψ} {[\begin{matrix} Σ_{Ψ}^{- 1} & 0 \end{matrix}]}^{'} {U^{'}}_{Ψ}

(23)

where

U_{Z} \in R^{M \times M}

is an arbitrary unitary matrix.

Proof.

Assume that the off-diagonal values of

Σ_{D}

and

Σ_{Ψ}

are non-zero,

Φ

can be written as

Φ = U_{D} [\begin{matrix} Σ_{D} & 0 \end{matrix}] {V^{'}}_{D} V_{Ψ} {[\begin{matrix} Σ_{Ψ}^{- 1} & 0 \end{matrix}]}^{'} {U^{'}}_{Ψ}

(24)

By substituting

Φ

in (22) with Equation (24), it can be shown with some manipulations that the solutions of the problem in (22) are equivalent to the solutions of

\min {‖{V^{'}}_{D} P \hat{Λ} P^{'} V_{D} - [\begin{matrix} Σ_{D}^{2} & 0 \\ 0 & 0 \end{matrix}]‖}_{F}^{2}

(25)

Let

Z = {V^{'}}_{D} P \hat{Λ} P^{'} V_{D}

and

z_{i}

be the ith diagonal entry of

Z

. Denote

Λ_{Z} = d i a g (z_{1}, z_{2} \dots z_{M})

. With further manipulations, we simplify (25) to

\min {‖G_{t_o p t}‖}_{F}^{2} + {‖Λ_{Z} - Σ_{D}^{2}‖}_{F}^{2} - {‖Λ_{Z}‖}_{F}^{2}

(26)

Obviously, the minima are achievable only if

Λ_{Z} = Σ_{D}^{2}

holds and

{‖Λ_{Z}‖}_{F}^{2}

takes the maximum value. Let

U = {V^{'}}_{D} P

and

u_{i j}

be the

(i, j)

th entry of

U

where

i, j = 1, 2 \dots L

. Noting that the top

M

diagonal entries of

\hat{Λ}

are all equal to

\hat{λ}

, we have

z_{i} = \hat{λ} (u_{i 1}^{2} + u_{i 2}^{2} + \dots + u_{i M}^{2})

(27)

It is worth noting that

{‖Λ_{Z}‖}_{F}^{2} = \sum_{i = 1}^{M} z_{i}^{2}

and

\sum_{i = 1}^{L} z_{i} = M \hat{λ}

where

0 \leq z_{i} \leq \hat{λ}

. Hence, it is clear that the maxima of

\sum_{i = 1}^{M} z_{i}^{2}

are reached when

z_{i}

takes the maximum value

\hat{λ}

for

i = 1, 2 \dots M

. Rewrite

U

as

U = [\begin{matrix} U_{1} & U_{2} \\ U_{3} & U_{4} \end{matrix}]

where

U_{1} \in R^{M \times M}

. Since

z_{i} = \hat{λ}

holds for

i = 1, 2 \dots M

, we have

u_{i 1}^{2} + u_{i 2}^{2} + \dots + u_{i M}^{2} = 1

and

U_{2} = 0

,

U_{3} = 0

accordingly. As

U_{Z}

is a unitary matrix, it is easy to verify that

U_{1}

and

U_{4}

are both unitary matrices. Then it follows that

{V^{'}}_{D} = [\begin{matrix} U_{1} & 0 \\ 0 & U_{4} \end{matrix}] P^{'}

(28)

Substituting Equation (28) into Equation (24), the optimal solution is obtained by

Φ_{o p t} = U_{Z} [\begin{matrix} {(Λ_{M})}^{\frac{1}{2}} & 0 \end{matrix}] P^{'} V_{Ψ} {[\begin{matrix} Σ_{Ψ}^{- 1} & 0 \end{matrix}]}^{'} {U^{'}}_{Ψ}

(29)

The proof is then completed. □

3.3. Comments

According to Section 3.1 and Section 3.2, the procedure for measurement matrix optimization has been summarized in Algorithm 1.

Algorithm 1. The proposed optimization method.

Input: Dictionary matrix

Ψ \in R_{N}^{N \times L}

which has an SVD form of

Ψ = U_{Ψ} [\begin{matrix} Σ_{Ψ} & 0 \end{matrix}] {V^{'}}_{Ψ}

, number of iterations Iter, constant

c

, Welch bound

μ_{w e l c h}

.
Output: Measurement matrix

Φ_{o p t}

.
Initialization: Initialize

Φ_{0} \in R^{M \times N}

to a random matrix, initial

U_{Z} \in R^{M \times M}

to a unitary matrix.
For

l = 1

to

I t e r

do

Compute the sensing matrix $D = Φ_{l} Ψ$ and normalize the columns in $D$ .
Compute Gram matrix $G = D^{'} D$ .
Shrink $G$ and obtain $G_{t}$ by $G_{t} (i, j) = \{\begin{array}{l} s i g n (g_{i j}) (μ_{w e l c h} + c), & |g_{i j}| \geq μ_{w e l c h} + c \\ s i g n (g_{i j}) μ_{w e l c h}, & μ_{w e l c h} \leq |g_{i j}| < μ_{w e l c h} + c \\ g_{i j}, & |g_{i j}| < μ_{w e l c h} \end{array}$
Apply eigenvalue decomposition to $G_{t}$ and obtain $G_{t} = P Λ P^{'}$ .
Compute the average of the top $M$ diagonal entries in $Λ$ , denoted as $\hat{λ}$ .
Construct $Λ_{M} \in R^{M \times M}$ as $Λ_{M} = d i a g (\hat{λ}, \hat{λ} \dots \hat{λ})$ .
Update $Φ$ by $Φ_{l} = U_{Z} [\begin{matrix} {(Λ_{M})}^{\frac{1}{2}} & 0 \end{matrix}] P^{'} V_{Ψ} {[\begin{matrix} Σ_{Ψ}^{- 1} & 0 \end{matrix}]}^{'} {U^{'}}_{Ψ}$ .

end
return

Φ_{I t e r}

Noting that

G_{t}

plays an important role in measurement matrix optimization, Algorithm 1 takes

μ_{a v e}

,

μ_{m a x}

and

μ_{a l l}

into consideration simultaneously when designing the

G_{t}

. By minimizing (5), the Gram matrix is most similar to

G_{t}

in terms of Frobenius norm, leading to maintain the advantage of

G_{t}

in reducing the mutual coherence indexes. Therefore, Algorithm 1 is effective in reducing

μ_{a v e}

,

μ_{m a x}

, and

μ_{a l l}

.

In the shrinkage function, a different threshold yields different results. Inspired by [22], we propose a shrinkage function shown as (7) which has a new threshold

μ_{w e l c h} + c

. We have not derived the optimal value of

c

in theory, but setting

c

to a proper value can also lead to a moderate result.

After averaging the eigenvalues of

G_{t_M}

, the first term on the right part of (20) is minimized. However, the diagonal entries of

G_{t_M}

change accordingly. Hence, we can’t assure that

{\hat{μ}}_{a l l}

reaches the minima, that is to say,

G_{t_o p t}

may not be the optimal solution in terms of

μ_{a l l}

. Fortunately, we find that the change of

\sum_{i = 1}^{M} λ_{i}^{2}

is much greater than that of

\sum_{i = 1}^{L} {\hat{g}}_{i i}^{2}

in (20), which means our approach is effective in reducing

μ_{a l l}

.

The proposed algorithm is an iterative one. The main complexity of Algorithm 1 for each iteration is located at steps 1, 2, 4, and 7. For those steps, the flops required are

O (M N L)

,

O (M L^{2})

,

O (L^{3})

, and

O (L^{3})

respectively. Hence, the complexity of Algorithm 1 is approximate to be

O (I t e r L^{3})

. Since the complexity for similar algorithms in [8,16,17] which apply eigenvalue decomposition or SVD is no less than

O (I t e r L^{3})

, the proposed algorithm has not increased the complexity significantly.

4. Simulation Results and Discussion

In this section, we conduct simulations to predetermine a suitable

c

firstly. Then, we examine the mutual coherence indexes and reconstruction performance of the proposed method and compare them with the well-established similar algorithms given in [8,16,17] by presenting the empirical results. Last, we verify the effectiveness of our method with various measurement matrices and dictionary matrices. The iteration number Iter is set to 100 and t is set to

μ_{w e l c h}

. For a given dictionary matrix

Ψ \in R^{80 \times 120}

,

x \in R^{120 \times 1}

has a sparse representation as

x = Ψ s

where

s

is

K

-sparse and each non-zero entry is randomly positioned with a Gaussian distribution of i.i.d. zero-mean and unit variance. Orthogonal Matching Pursuit (OMP) [23] algorithm is employed in signal reconstruction. Denote

ε = {‖x_{e} - x‖}_{2}^{} / {‖x‖}_{2}^{}

the reconstruction error where

x_{e}

is the reconstructed signal. The reconstruction is identified as a success, called exact reconstruction, provided

ε \leq 10^{- 6}

. Denote

P_{s u c}

the percentage of successful reconstruction. In Section 4.1, Section 4.2 and Section 4.3,

Φ_{0}

and

Ψ

are both Gaussian random matrices.

4.1. The Choice of $c$

Since the analytical solution of

c

is extremely difficult, here, we conduct a serious of simulations to find a suitable

c

. Figure 1 illustrates the change tendency of mutual coherence indexes and

P_{s u c}

with argument

c

. We fix the row number of

M

to 28, the sparsity to 8, and varies

c

from 0 to 0.16. The experiment is performed for 1000 random sparse ensembles and the results are recorded.

When

c = 0

, the shrinkage function shown as (7) is the same as (6). As can be seen from the graphs, when

c

increases,

μ_{a v e}

increases,

μ_{m a x}

and

μ_{a l l}

decrease first and then increase,

P_{s u c}

increases firstly and then decreases.

μ_{m a x}

and

μ_{a l l}

reach the minima when

c = 0.02

and

c = 0.03

respectively. It is worth noting that appropriate increase of

c

leads to decrease of

μ_{m a x}

and

μ_{a l l}

but increase of

μ_{a v e}

. When

c = 0.01

, better

μ_{m a x}

and

μ_{a l l}

are obtained and the loss in

μ_{a v e}

is tolerable. Moreover,

P_{s u c}

reaches the maxima. Therefore, 0.01 may be a moderate value for

c

and

c

is set to 0.01 in the simulations in Section 4.2, Section 4.3 and Section 4.4.

4.2. Comparing the Mutual Coherence Indexes

This section presents a series of simulations to compare our method with algorithms given in [8,16,17] on the three mutual coherence indexes of

D

obtained by

D = Φ_{o p t} Ψ

where

Φ_{o p t}

is the optimized measurement matrix. For convenience, each method is denoted as Propose, Elad, Hong, and Entezari. The down-scaling factor for Elad is set to 0.95. The inner iteration number for Hong is set to 2, which means K-SVD is applied twice in every updating of

Φ

. The point is set to 0.5 to update the

G_{t}

in Entezari.

Figure 2 illustrates the change tendency of mutual coherence indexes with iteration number for

M = 28

. As can be seen from the figure, the indexes corresponding to different algorithms all change monotonously with the iteration number. When

μ_{m a x}

and

μ_{a v e}

converge, the number of iterations required by our method is almost equal to that of Hong and significantly less than that of Elad. When

μ_{a l l}

converges, the number of iterations required by our method is equivalent to that of Entezari and significantly less than that of Hong and Elad.

Figure 3 presents the histogram of the absolute off-diagonal values of

{(Φ_{o p t} Ψ)}^{'} Φ_{o p t} Ψ

for

M = 28

. It is seen from the figure that Elad and Entezari have long tails, showing that the number of off-diagonal values that exceed 0.34 is relatively large. The tail of Hong is shorter than that of Elad and Entezari, and reaches the maximum of 0.34. Compared with Hong, our method has a shorter tail which reaches the maximum of 0.32 and has more off-diagonal values below the

μ_{w e l c h}

(0.1662).

Table 1, Table 2 and Table 3 present

μ_{m a x}

,

μ_{a v e}

, and

μ_{a l l}

by Elad, Hong, Entezari, and our method versus measurement dimension

M

respectively. From Table 1, it can be observed that the

μ_{m a x}

of our method is significantly less than that of Elad and Entezari, and less than Hong, which means our method is effective in reducing the maximum mutual coherence. In Table 2, we see that the

μ_{a v e}

of our method is significantly less than that of Elad and Entezari, with an advantage of more than 0.03. It is worth noting that the

μ_{a v e}

of our method is slightly larger than Hong. In Table 3, the

μ_{a l l}

of our method is significantly less than that of Elad and Hong, with an advantage of more than 70 and 20 respectively. On the other hand, the

μ_{a l l}

of our method is almost the same as Entezari.

In conclusion, while effectively reducing

μ_{m a x}

and

μ_{a l l}

, our method can maintain a small

μ_{a v e}

at the same time. Additionally, the number of iterations required for the convergence of each index of our method is significantly less than that of Elad. Therefore, from the view of mutual coherence indexes, the measurement matrix obtained by our method has better properties than the other three methods. This coincides with the theoretical results obtained in the Section 3.

4.3. Comparing the Reconstruction Performance

Case 1.

Comparison of the

P_{s u c}

in the noiseless case.

In this case, we conduct two separate CS experiments, first by fixing

K = 8

and varying

M

from 12 to 44 and second by fixing

M = 28

and varying

K

from 4 to 20. Each experiment is performed for 1000 random sparse ensembles and the number of exact reconstruction is recorded.

Figure 4 and Figure 5 reveal that the

P_{s u c}

of our method is the highest, which indicates its superiority over the other three methods.

Case 2.

Comparison of the

ε

in the noisy case.

To show the robustness of the proposed method in noisy cases we consider the noisy model

y = Φ x + v

where

v

is the vector of additive Gaussian noise with zero means. We conduct the experiment by fixing

M = 28

,

K = 8

, and varying SNR from 10 to 50 dB. The experiment is performed for 1000 random sparse ensembles and the average reconstruction error is recorded. From Figure 6, we can see that the reconstruction errors decrease with the increase of SNR, and the error of the proposed method is smaller than that of the others.

Table 3 presents that

μ_{a l l}

of the Entezari is slightly larger than that of our method. It is interesting to note that the number of off-diagonal entries with smaller absolute values in the Entezari is significantly larger than that of our method from Figure 3. Moreover, it can be seen from Table 2 that

μ_{a v e}

of Hong is slightly lower than that of our method. However, the simulation results show that our method outperforms the others in terms of reconstruction performance. It is also worthy noting that our method reduces

μ_{a v e}

,

μ_{m a x}

, and

μ_{a l l}

simultaneously, leading to better reconstruction performance in CS. This implies that a single mutual coherence index cannot accurately reflect the actual performance of the methods, and verifies the necessity of using multiple indexes simultaneously in measurement matrix optimization.

4.4. Different Kinds of $Φ$ and $Ψ$ Optimized by the Proposed Methods

To analyze the performance of our method with various measurement matrices and dictionary matrices, a serious of simulations are carried out in this section. We choose the measurement matrix as a Gaussian random matrix and a Bernoulli random matrix, and choose the dictionary matrix as a Gaussian random matrix and the DCT matrix, respectively. We compare the mutual coherence indexes and the reconstruction performance before and after optimization. When

Ψ

is the Gaussian random matrix,

Φ

belongs to

R^{M \times 80}

and

Ψ

belongs to

R^{80 \times 120}

. When

Ψ

is the DCT matrix,

Φ

belongs to

R^{M \times 120}

and

Ψ

belongs to

R^{120 \times 120}

. Each experiment is performed for 1000 random sparse ensembles.

The mutual coherence indexes of different measurement matrices

Φ

with different dictionary matrices

Ψ

are shown in Figure 7. As seen from the simulations, all the optimized measurement matrices produce smaller

μ_{m a x}

,

μ_{a v e}

, and

μ_{a l l}

than the random ones.

Figure 8 and Figure 9 present the reconstruction performance of OMP with the optimized measurement matrices and the random ones. It is seen from the graphs in these figures that all the optimized matrices outperform the random ones in terms of the percentage of exact reconstruction.

5. Conclusions

This paper focused on the optimization of measurement matrix for compressed sensing. To decrease

μ_{m a x}

,

μ_{a v e}

, and

μ_{a l l}

simultaneously, we designed a new target Gram matrix which was obtained by applying a new shrinkage function to the Gram matrix and updated by performing rank reduction and eigenvalue averaging. Then, we characterized the analytical solutions of the measurement matrix by SVD. Based on alternating minimization, we proposed an iterative method to optimize the measurement matrix. The simulation results show that the proposed method reduces

μ_{m a x}

,

μ_{a v e}

, and

μ_{a l l}

simultaneously and outperforms the existing algorithms in terms of reconstruction performance. In addition, the proposed method is computationally less expensive than some existing algorithms in the literature.

As detailed, we gave the optimal value of

c

under a fixed matrix scale through simulation. When the scale changes, the value of

c

in Section 4.1 may no longer be applicable. Therefore, it is meaningful to find the theoretical ‘optimal value’ of

c

. Furthermore, noting that lower mutual coherence indexes mean potentially higher reconstruction performance, further efforts are needed to decrease the indexes simultaneously.

Author Contributions

Conceptualization: R.Y.; Methodology: C.C.; Software: R.Y.; Validation: B.W. and Y.G.; Formal analysis: C.C.; Data curation: R.Y.; Writing—original draft preparation: R.Y. and B.W.; Writing—review and editing: C.C. and Y.G. All authors have read and agreed to the published version of the manuscript.

Funding

No funding was received for conducting this study.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Conflicts of Interest

The authors declare no conflict of interest.

References

Donoho, D.L. Compressed sensing. IEEE Trans. Inf. Theory 2006, 52, 1289–1306. [Google Scholar] [CrossRef]
Xie, Y.; Yu, J.; Guo, S.; Ding, Q.; Wang, E. Image Encryption Scheme with Compressed Sensing Based on New Three-Dimensional Chaotic System. Entropy 2019, 21, 819. [Google Scholar] [CrossRef] [Green Version]
Fang, Y.; Li, L.; Li, Y.; Peng, H.; Yang, Y. Low Energy Consumption Compressed Spectrum Sensing Based on Channel Energy Reconstruction in Cognitive Radio Network. Sensors 2020, 20, 1264. [Google Scholar] [CrossRef] [Green Version]
Martinez, J.A.; Ruiz, P.M.; Skarmeta, A.F. Evaluation of the Use of Compressed Sensing in Data Harvesting for Vehicular Sensor Networks. Sensors 2020, 20, 1434. [Google Scholar] [CrossRef] [Green Version]
Donoho, D.L.; Elad, M. Optimally sparse representation in general (nonorthogonal) dictionaries via 1 minimization. Proc. Natl. Acad. Sci. USA 2003, 100, 2197–2202. [Google Scholar] [CrossRef] [Green Version]
Candes, E.J.; Tao, T. Decoding by linear programming. IEEE Trans. Inf. Theory 2005, 51, 4203–4215. [Google Scholar] [CrossRef] [Green Version]
Stark, D.P.B. Uncertainty Principles and Signal Recovery. SIAM J. Appl. Math. 1989, 49, 906–931. [Google Scholar]
Elad, M. Optimized Projections for Compressed Sensing. IEEE Trans. Signal Process. 2007, 55, 5695–5702. [Google Scholar] [CrossRef]
Shaohai, H.U. An Optimization Method for Measurement Matrix Based on Eigenvalue Decomposition. Signal Process. 2012. [Google Scholar] [CrossRef]
Yan, W.; Wang, Q.; Shen, Y. Shrinkage-Based Alternating Projection Algorithm for Efficient Measurement Matrix Construction in Compressive Sensing. IEEE Trans. Instrum. Meas. 2014, 63, 1073–1084. [Google Scholar] [CrossRef]
Duartecarvajalino, J.M.; Sapiro, G. Learning to Sense Sparse Signals: Simultaneous Sensing Matrix and Sparsifying Dictionary Optimization. IEEE Trans. Image Process. 2009, 18, 1395–1408. [Google Scholar] [CrossRef] [Green Version]
Lu, C.; Li, H.; Lin, Z. Optimized Projections for Compressed Sensing via Direct Mutual Coherence Minimization. Signal Process. 2018, 151, 45–55. [Google Scholar] [CrossRef] [Green Version]
Xu, J.; Pi, Y.; Cao, Z. Optimized projection matrix for compressive sensing. EURASIP J. Adv. Signal Process. 2010, 2010, 560349. [Google Scholar] [CrossRef] [Green Version]
Abolghasemi, V.; Ferdowsi, S.; Sanei, S. A gradient-based alternating minimization approach for optimization of the measurement matrix in compressive sensing. Signal Process. 2012, 92, 999–1009. [Google Scholar] [CrossRef] [Green Version]
Zheng, H.; Li, Z.; Huang, Y. An Optimization Method for CS Projection Matrix Based on Quasi-Newton Method. Acta Electron. Sin. 2014, 42, 1977–1982. [Google Scholar]
Hong, T.; Bai, H.; Li, S.; Zhu, Z. An efficient algorithm for designing projection matrix in compressive sensing based on alternating optimization. Signal Process. 2016, 125, 9–20. [Google Scholar] [CrossRef]
Entezari, R.; Rashidi, A. Measurement matrix optimization based on incoherent unit norm tight frame. AEU Int. J. Electron. Commun. 2017, 82, 321–326. [Google Scholar] [CrossRef]
Sustik, M.A.; Tropp, J.A.; Dhillon, I.S.; Heath, R.W. On the existence of equiangular tight frames. Linear Algebra Its Appl. 2007, 426, 619–635. [Google Scholar] [CrossRef] [Green Version]
Welch, L. Lower bounds on the maximum cross correlation of signals (Corresp.). IEEE Trans. Inf. Theory 1974, 20, 397–399. [Google Scholar] [CrossRef]
Aharon, M.; Elad, M.; Bruckstein, A.M. K-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation. IEEE Trans. Signal Process. 2006, 54, 4311–4322. [Google Scholar] [CrossRef]
Fickus, M.; Mixon, D.G. Tables of the existence of equiangular tight frames. arXiv 2015, arXiv:1504.00253. [Google Scholar]
Li, G.; Zhu, Z.; Yang, D.; Chang, L.; Bai, H. On Projection Matrix Optimization for Compressive Sensing Systems. IEEE Trans. Signal Process. 2013, 61, 2887–2898. [Google Scholar] [CrossRef]
Tropp, J.A.; Gilbert, A.C. Signal Recovery from Random Measurements Via Orthogonal Matching Pursuit. IEEE Trans. Inf. Theory 2007, 53, 4655–4666. [Google Scholar] [CrossRef] [Green Version]

Figure 1. (a)

μ_{m a x}

and

P_{s u c}

versus

c

; (b)

μ_{a v e}

and

P_{s u c}

versus

c

; (c)

μ_{a l l}

and

P_{s u c}

versus

c

, both with

M = 28

and

K = 8

.

Figure 1. (a)

μ_{m a x}

and

P_{s u c}

versus

c

; (b)

μ_{a v e}

and

P_{s u c}

versus

c

; (c)

μ_{a l l}

and

P_{s u c}

versus

c

, both with

M = 28

and

K = 8

.

Figure 2. The convergence results: (a) evolution of

μ_{m a x}

, (b)

μ_{a v e}

and (c) the evolution of

μ_{a l l}

, all versus iteration number, where

M = 28

.

Figure 2. The convergence results: (a) evolution of

μ_{m a x}

, (b)

μ_{a v e}

and (c) the evolution of

μ_{a l l}

, all versus iteration number, where

M = 28

.

Figure 3. Histogram of the absolute off-diagonal values of

{(Φ_{o p t} Ψ)}^{'} Φ_{o p t} Ψ

for

M = 28

.

Figure 3. Histogram of the absolute off-diagonal values of

{(Φ_{o p t} Ψ)}^{'} Φ_{o p t} Ψ

for

M = 28

.

Figure 4. The change tendency of

P_{s u c}

with

M

while

K = 8

in the noiseless case.

Figure 4. The change tendency of

P_{s u c}

with

M

while

K = 8

in the noiseless case.

Figure 5. The change tendency of

P_{s u c}

with

K

while

M = 28

in the noiseless case.

Figure 5. The change tendency of

P_{s u c}

with

K

while

M = 28

in the noiseless case.

Figure 6. The change tendency of

ε

with SNR while

M = 40

and

K = 8

.

Figure 6. The change tendency of

ε

with SNR while

M = 40

and

K = 8

.

Figure 7. The evolution of (a)

μ_{m a x}

, (b)

μ_{a v e}

and (c)

μ_{a l l}

, all versus measurements number with different

D

.

Figure 7. The evolution of (a)

μ_{m a x}

, (b)

μ_{a v e}

and (c)

μ_{a l l}

, all versus measurements number with different

D

.

Figure 8. The change tendency of

P_{s u c}

with

M

while

K = 8

in the noiseless case.

Figure 8. The change tendency of

P_{s u c}

with

M

while

K = 8

in the noiseless case.

Figure 9. The change tendency of

P_{s u c}

with

K

while

M = 28

in the noiseless case.

Figure 9. The change tendency of

P_{s u c}

with

K

while

M = 28

in the noiseless case.