Computing One-Bit Compressive Sensing via Alternating Proximal Algorithm

Wang, Jin-Jiang; Hu, Yan-Hong

doi:10.3390/math13182926

Open AccessArticle

Computing One-Bit Compressive Sensing via Alternating Proximal Algorithm

by

Jin-Jiang Wang

¹ and

Yan-Hong Hu

^2,*

¹

College of Computer Science and Information Engineering, Harbin Normal University, Harbin 150025, China

²

College of Mathematics and Computer Science, Guangdong Ocean University, Zhanjiang 524088, China

^*

Author to whom correspondence should be addressed.

Mathematics 2025, 13(18), 2926; https://doi.org/10.3390/math13182926

Submission received: 30 June 2025 / Revised: 17 August 2025 / Accepted: 8 September 2025 / Published: 10 September 2025

Download

Browse Figures

Review Reports Versions Notes

Abstract

It is challenging to recover a real sparse signal using one-bit compressive sensing. Existing methods work well when there is no noise (sign flips) in the measurements or the noise level or a priori information about signal sparsity is known. However, the noise level and a priori information about signal sparsity are not always known in practice. In this paper, we propose a robust model with a non-smooth and non-convex objective function. In this model, the noise factor is considered without knowing the noise level or a priori information about the signal sparsity. We develop an alternating proximal algorithm and prove that the sequence generated from the algorithm converges to a local minimizer of the model. Our algrithm possesses high time efficiency and recovery accuracy. It performs better than other algorithms tested in our experiments when the the noise level and the sparsity of the signal is known.

Keywords:

one-bit compressive sensing; sparse signal reconstruction; alternating proximal algorithm; local convergence

MSC:

49M37; 65K10; 65K15; 90C90

1. Introduction

One-bit compressive sensing (CS) was originally introduced in [1]. It has been applied extensively, such as in [2,3,4,5]. Let x be a sparse signal in

R^{n}

and B be an

m \times n

matrix. The measurement vector

b \in R^{m}

of the signal x is calculated via

b = A x = sign (B x)

(1)

where

sign (\cdot)

operates componentwise with

sign (z) = 1

if

z \geq 0

and

sign (z) = - 1

otherwise. The measurement operator

A : R^{n} \to {- 1, 1}^{m}

is called the one-bit scalar quantizer. The prime target of one-bit CS is to recover the sparse signal x from the one-bit observation b and the measurement matrix B. We will now review existing models for the one-bit CS problem. The ideal optimization model is the following

l_{0}

-norm minimization:

min_{x \in R^{n}} {{∥ x ∥}_{0}}, s . t . b = sign (B x),

(2)

where

{∥ x ∥}_{0}

is the

l_{0}

-norm of x, counting the number of its non-zero entries. The NP-hardness of model (2) makes approximation challenging. The earliest attempt can be traced back to [1], in which the ideal model (2) was relaxed by

min_{x \in R^{n}} {{∥ x ∥}_{1}}, s . t . Y B x \geq 0, ∥ x ∥ = 1,

(3)

where Y is a diagonal matrix with diagonal entries from b, and

{∥ x ∥}_{1}

is the sum of the absolute values of the components of x. This model is valid for the noiseless case. For the case in which the measurement

B x

is contaminated by noise (sign flips), instead of solving model (3), ref. [1] relaxed it as follows:

min_{x \in R^{n}} {{∥ x ∥}_{1} + λ {{∥ [Y B x]}_{-} ∥}^{2}}, s . t . ∥ x ∥ = 1,

(4)

where

λ > 0

is a parameter, and

{[\cdot]}_{-}

represents the operator projected onto the convex set

{u : u \in R^{m} and u_{i} \leq 0, i = 1, 2, \dots, m}

. Since both models (3) and (4) minimize convex objective functions over the non-convex unit sphere, to overcome difficulties resulting from non-convexity, the following model was proposed in [6]:

min_{x \in R^{n}} {{∥ x ∥}_{1} {}, s . t . Y B x \geq 0, ∥ B x ∥}_{1} = r,

(5)

where r is an arbitrary positive number. Obviously, the set determined by the two constraints is convex. In addition, the model can be efficiently solved using a linear programming method. However, it was shown numerically that solutions of model (5) are not sparse enough when the signal

x \in R^{n}

is known to have s-sparsity, that is,

x \in {x \in R^{n} | {∥ x ∥}_{0} \leq s} .

A newer model for the one-bit CS model,

min_{x \in R^{n}} {h ({[Y B x]}_{-})} {, s . t . ∥ x ∥}_{0} \leq s, ∥ x ∥ = 1

(6)

was proposed in [7], where function h is either

{∥ \cdot ∥}_{1}

or

{∥ \cdot ∥}^{2}

. An algorithm called binary iterative hard thresholding (BIHT) was developed for solving model (6). Based on BIHT, the adaptive outlier pursuit (AOP) technique was introduced in [8] for solving the following one-bit CS model:

min_{x \in R^{n}} {h ({[Λ Y B x]}_{-})}, s . t . \sum_{i = 1}^{m} (1 - Λ_{i i}) \leq {L, ∥ x ∥}_{0} \leq s, ∥ x ∥ = 1,

(7)

where L is the noise level, that is, there are at most L measurements in y that are wrongly detected (with sign flips), and

Λ

is an

m \times m

diagonal matrix whose diagonal entries are either 1 or 0. If

Λ_{i i} = 1

, then the ith component

y_{i}

of y is correct; otherwise, it is incorrect. If the matrix

Λ

is specified, model (7) can be reduced to model (6) by disregarding the incorrect measurements. However, in model (7), the noise level L must be given in advance, a condition that is hardly ever satisfied in practice. The following one-sided

l_{0}

(OSL0) model was proposed in [9] for when L is unavailable:

min_{x \in R^{n}, v \in R^{m}} {{λ ∥ [v - ϵ e]}_{-} ∥_{0} + \frac{γ}{2} {∥ Y B x - v ∥}^{2} + \frac{β}{2} {∥ x ∥}^{2} {}, s . t . ∥ x ∥}_{0} \leq s,

(8)

where

ϵ

,

λ

,

γ

and

β

are positive parameters, and e denotes the vector in

R^{m}

with all components equal to one. In [9], a fixed-point proximity algorithm was proposed for solving model (8). It was shown that the sequence generated from the fixed-point proximity algorithm converges to a local minimizer of the objective function of model (8) and converges to a global minimizer of the function as long as the initial estimate is sufficiently close to any global minimizer of the function. In [10], the authors took advantage of

l_{1}

-regularized least squares to address the one-bit CS problem regardless of the sign information of

B x

, namely,

min_{x \in R^{n}} {{∥ x ∥}_{1} {+ λ ∥ b - B x ∥}^{2}},

(9)

where

λ

is a given positive parameter. A primal dual active set algorithm was proposed to solve model (9) and proved to converge within one step under two assumptions: the submatrix of B indexed on the nonzero components of the sparse solution is full row rank and the initial point is sufficiently close to the sparse solution. Therefore, the generated sequence again has a local convergence property. To eliminate the assumptions on data B for better convergence results, xiu et al. [11] proposed the following double-sparsity constrained model:

min_{x \in R^{n}, y \in R^{m}} {{∥ Y B x + y - ϵ e ∥}^{2} + {η ∥ x ∥}^{2} {}, s . t . ∥ x ∥}_{0} \leq s, {∥ y_{+} ∥}_{0} \leq k,

(10)

where

∥ y_{+} ∥_{0} = {∥ {[- y]}_{-} ∥}_{0}

,

η > 0

is a penalty parameter and

s ≪ n

and

k ≪ m

are two positive integers representing prior information about the upper bounds of the signal sparsity and the number of sign flips, respectively. An algorithm called Gradient projection subspace pursuit was proposed in [11] to solve model (10).

To solve the previous model, either a priori knowledge is required or the algorithm lacks convergence analysis, excluding it from many potential applications. In this study, we introduce a model for the noisy one-bit CS problem that requires neither a priori knowledge of the signal sparsity nor a priori knowledge of the noise level of the measurements. We develop an algorithm to solve the proposed model and analyze the convergence of the proposed algorithm. Our algorithm possesses high time efficiency and recovery accuracy. Moreover, it performs better than other existing algorithms when the the noise level and the sparsity of the signal is known. To summarize, our algorithm is suitable for more many practical scenarios.

2. Elementary Facts

The Euclidean scalar product of

R^{n}

and its corresponding norm are denoted by

〈 \cdot, \cdot 〉

, and

∥ \cdot ∥

, respectively. For a symmetric matrix A, the maximum eigenvalue is denoted by

λ_{max} (A)

. The M-norm of the vector x is denoted by

{∥ x ∥}_{M} : = \sqrt{x^{T} M x}

if the matrix M is positive definite.

Definition 1

(Rockafellar and Wets [12]). Let

f : R^{n} \to R \cup {+ \infty}

be a proper lower semicontinuous function.

(i): The domain of f is defined and denoted by $dom f : = {x \in R^{n} | f (x) < + \infty}$ ;
(ii): For each $x \in dom f$ , the Fréchet subdifferential of f at x, written $\hat{\partial} f (x)$ , is the set of vectors $x^{*} \in R^{n}$ that satisfies

$\underset{y \neq x, y \to x}{lim inf} \frac{1}{∥ y - x ∥} [f (y) - f (x) - 〈 x^{*}, y - x 〉] \geq 0 .$

(11)

If $x \notin dom f$ , then $\hat{\partial} f (x) = ϕ$ ;
(iii): The limiting-subdifferential (Mordukhovich [13]), or simply the subdifferential for short, of f at $x \in dom f$ , written $\partial f (x)$ , is defined as follows:

$\partial f (x) : = {x^{*} \in R^{n} | \exists x_{n} \to x, f (x_{n}) \to f (x), x_{n}^{*} \in \hat{\partial} f (x_{n}) \to x^{*}} .$

(12)

A necessary (but insufficient) condition for

x \in R^{n}

to be a minimizer of a proper lower semicontinuous function f is

0 \in \partial f (x)

. For a proper lower semi-continuous function

f : R^{n} \to R \cup {+ \infty}

, the proximity operator of f is defined by [14]

{prox}_{f} (x) : = arg min_{y \in R^{n}} {f (y) + \frac{1}{2} {∥ y - x ∥}^{2}} .

(13)

Clearly, for any

z \in {prox}_{f} (x)

, by the calculus of the limiting-subdifferential, we have that

x - z \in \partial f (z)

.

3. The $l_{1}$ Model

A classical approach to problem (1) is the least squares (LS) approach [15], in which the estimator is chosen to minimize the data error:

min_{x \in R^{n}} {{∥ b - sign (B x) ∥}^{2}} .

(14)

Notice that the LS solution may have a huge norm and is thus meaningless. Regularization methods are needed to stabilize the solution. The basic concept of regularization is to replace the original problem with a “nearby” problem whose solution approximates the required solution. A popular regularization technique is Tikhonov regularization [16], in which a quadratic penalty is added to the objective function:

min_{x \in R^{n}} {\frac{α}{2} {∥ b - sign (B x) ∥}^{2} + \frac{β}{2} {∥ x ∥}^{2}} .

(15)

The second term in the above minimization problem is a regularization term that controls the norm of the solution. Since x is a sparse signal,

l_{1}

regularization (see, e.g., [17,18]) can be used to induce sparsity in the optimal solution:

min_{x \in R^{n}} {\frac{α}{2} {∥ b - sign (B x) ∥}^{2} + \frac{β}{2} {∥ x ∥}^{2} + ρ {∥ x ∥}_{1}} .

(16)

Notice that for any

x \in R^{n}

and

b \in {- 1, 1}^{m}

, the following identity holds:

{∥ b - sign (B x) ∥}^{2} = 4 {∥ b - sign (B x) ∥}_{0} .

(17)

It follows from [9] (Proposition 3.1) that the function

{∥ b - sign (B x) ∥}_{0}

can be majorized by a lower semi-continuous function

{∥ {[Y B x - ϵ e]}_{-} ∥}_{0}

. We therefore substitute expression

{∥ b - sign (B x) ∥}^{2}

in model (16) by the lower semi-continuous function

{∥ {[Y B x - ϵ e]}_{-} ∥}_{0}

. For notational simplicity, we set

A : = Y B

and

{φ (\cdot) : = ∥ {[\cdot - ϵ e]}_{-} ∥}_{0}

for a fixed

ϵ > 0

. As a consequence, model (16) can be recast as

min_{x \in R^{n}} {λ φ (A x) + \frac{β}{2} {∥ x ∥}^{2} + ρ {∥ x ∥}_{1}},

(18)

where

λ = 2 α

. The objective function of model (18) is lower semi-continuous and coercive; therefore, model (18) attains its minimum. By introducing an additional variable

y \in R^{m}

, we can rewrite problem (18) as

\begin{matrix} min_{x \in R^{n}, y \in R^{m}} {λ φ (y) + \frac{β}{2} {∥ x ∥}^{2} + ρ {∥ x ∥}_{1}} \\ s . t . A x = y . \end{matrix}

(19)

Furthermore, model (19) can be approximated by the following unconstrained optimization problem (see, e.g., [19], model (2)):

min_{x \in R^{n}, y \in R^{m}} {L (x, y) : = {ρ ∥ x ∥}_{1} + \frac{γ}{2} {∥ A x - y ∥}^{2} + \frac{β}{2} {∥ x ∥}^{2} + λ φ (y)},

(20)

where

γ

is a positive relaxation parameter. model (20) has more than one local minimizer. We are now ready to estimate an upper bound of the number of local minimizers of model (20).

Proposition 1.

The number of local minimizers for model (20) is no more than

2^{m}

.

Proof.

Let

u^{*} = (x^{*}, y^{*})

be a local minimizer of model (20). Then, there is a positive number

ε

such that

ρ ∥ x^{*} ∥_{1} + \frac{γ}{2} ∥ A x^{*} - y^{*} ∥^{2} + \frac{β}{2} ∥ x^{*} ∥^{2} + λ φ (y^{*}) \leq {ρ ∥ x ∥}_{1} + \frac{γ}{2} {∥ A x - y ∥}^{2} + \frac{β}{2} {∥ x ∥}^{2} + λ φ (y)

(21)

holds for all

u = (x, y) \in R^{n} \times R^{m}

satisfying

∥ u - u^{*} ∥ < ε

. For the vector

y^{*}

, we define

S_{-} : = {j \in {1, 2, \dots, m} | y_{j}^{*} < ϵ}

. In association with this set, we further define

S_{y^{*}} : = {y \in R^{m} | y_{j} < ϵ, \forall j \in S_{-}; y_{j} \geq ϵ, \forall j \in {1, 2, \dots, m} ∖ S_{-}}

(22)

which is a convex set in

R^{m}

. Obviously, we have

y^{*} \in S_{y^{*}}

and

φ (y) = φ (y^{*})

for all

y \in S_{y^{*}} .

Hence,

ρ ∥ x^{*} ∥_{1} + \frac{γ}{2} ∥ A x^{*} - y^{*} ∥^{2} + \frac{β}{2} ∥ x^{*} ∥^{2} + δ_{S_{y^{*}}} (y^{*}) \leq {ρ ∥ x ∥}_{1} + \frac{γ}{2} {∥ A x - y ∥}^{2} + \frac{β}{2} {∥ x ∥}^{2} + δ_{S_{y^{*}}} (y)

(23)

holds for all u satisfying

∥ u - u^{*} ∥ < ε

. That is, the function

{ρ ∥ x ∥}_{1} + \frac{γ}{2} {∥ A x - y ∥}^{2} + \frac{β}{2} {∥ x ∥}^{2} + δ_{S_{y^{*}}} (y)

attains its local minimum at the vector

u^{*}

. Since the function

{ρ ∥ x ∥}_{1} + \frac{γ}{2} {∥ A x - y ∥}^{2} + \frac{β}{2} {∥ x ∥}^{2} + δ_{S_{y^{*}}} (y)

is a strictly convex function on

R^{n + m}

, it actually attains its global minimum at the vector

u^{*}

. Notice that the number of all possible sets

S_{y^{*}}

is

2^{m}

. Therefore, there are a total

2^{m}

functions of the form

{ρ ∥ x ∥}_{1} + \frac{γ}{2} {∥ A x - y ∥}^{2} + \frac{β}{2} {∥ x ∥}^{2} + δ_{S_{y^{*}}} (y)

, each of which has at most one minimizer. As a result, the number of local minimizers for model (20) is no more than

2^{m}

. □

4. Alternating Proximal Algorithm

In this section, we describe an alternating minimization algorithm (see [19,20,21,22]) for finding local minimizers of the objective function

L (x, y)

of model (20). The alternating discrete dynamical system to be studied is of the following form:

\{\begin{matrix} x^{k + 1} = \underset{x \in R^{n}}{arg min} \{L (x, y^{k}) + \frac{1}{2} {∥x - x^{k}∥}_{S}^{2}\}, \\ y^{k + 1} \in \underset{y \in R^{m}}{arg min} \{L (x^{k + 1}, y) + \frac{1}{2} {∥y - y^{k}∥}_{T}^{2}\}, \end{matrix}

(24)

where

S

and

T

are entirely positive definite operators. Next, we address the computation of

x^{k + 1}

and

y^{k + 1}

. To update

x^{k + 1}

more easily, we take

S = μ I - γ A^{T} A

with

μ > γ λ_{max} (A^{T} A)

. Then, the x-subproblem can be solved by

\begin{matrix} x^{k + 1} & = arg min_{x \in R^{n}} {ρ {∥ x ∥}_{1} + \frac{γ}{2} ∥ A x - y^{k} ∥^{2} + \frac{β}{2} {∥ x ∥}^{2} + λ φ (y^{k}) + \frac{1}{2} {∥ x - x^{k} ∥}_{S}^{2}} \\ = arg min_{x \in R^{n}} {{ρ ∥ x ∥}_{1} + \frac{μ + β}{2} ∥ x - \frac{1}{μ + β} (μ x^{k} + γ A^{T} (y^{k} - A x^{k})) ∥^{2}} \\ = {prox}_{\frac{ρ}{μ + β} {∥ \cdot ∥}_{1}} (\frac{1}{μ + β} (μ x^{k} + γ A^{T} (y^{k} - A x^{k}))) \\ = P_{\frac{ρ}{μ + β}} (\frac{1}{μ + β} (μ x^{k} + γ A^{T} (y^{k} - A x^{k}))), \end{matrix}

(25)

where the operator

P_{r} (\cdot)

with

r > 0

at

x \in R^{n}

is defined by

{[P_{r} (x)]}_{i} : = \{\begin{matrix} x_{i} - r, & x_{i} > r, \\ x_{i} + r, & x_{i} < - r, \\ 0, & o . w ., \end{matrix} i = 1, 2, \dots, n .

(26)

We take

T = ν I

with

ν > 0

. Then, the y-subproblem can be solved by

\begin{matrix} y^{k + 1} & \in arg min_{y \in R^{m}} {ρ ∥ x^{k + 1} ∥_{1} + \frac{γ}{2} ∥ A x^{k + 1} {- y ∥}^{2} + \frac{β}{2} ∥ x^{k + 1} ∥^{2} + λ φ (y) + \frac{1}{2} {∥ y - y^{k} ∥}_{T}^{2}} \\ = arg min_{y \in R^{m}} {λ φ (y) + \frac{ν + γ}{2} ∥ y - \frac{1}{ν + γ} (γ A x^{k + 1} + ν y^{k}) ∥^{2}} \\ = {prox}_{\frac{λ}{ν + γ} φ} (\frac{1}{ν + γ} (γ A x^{k + 1} + ν y^{k})) . \end{matrix}

(27)

The proximity operator

{prox}_{c φ} (\cdot)

with

c > 0

at

y \in R^{m}

can be presented as follows [9] (Proposition 7.2):

{[{prox}_{c φ} (z)]}_{i} \in \{\begin{matrix} {z_{i}}, & z_{i} < ϵ - \sqrt{2 c}, \\ {z_{i}, ϵ}, & z_{i} = ϵ - \sqrt{2 c}, \\ {ϵ}, & ϵ - \sqrt{2 c} < z_{i} < ϵ, \\ {z_{i}}, & o . w ., \end{matrix} i = 1, 2, \dots, m .

(28)

Thus, a complete algorithm for solving model (20) can be presented as follows:

Next, we establish the convergence results of the sequence

{(x^{k}, y^{k}) | k \in N}

generated by Algorithm 1, where

N

denotes all natural numbers. We use

ω (x^{*}, y^{*})

to denote the set of limit points of the sequence

{(x^{k}, y^{k}) | k \in N}

and

crit L

to denote the set of critical points of the function L.

Algorithm 1 Alternating proximal algorithm (APA) for model (20).

1: Input: the matrix B and the vector b, $A = diag (b) B$ , $ρ > 0$ , $β > 0$ ,
$λ > 0$ , $γ > 0$ , $μ > γ λ_{max} (A^{T} A)$ , $ν > 0$ and $k_{max} > 0$ ;
2: Initialize: choose $x^{0}$ and $y^{0}$ ;
3: For $k = 0, 1, 2, \dots, k_{max} - 1$ do
$x^{k + 1} = P_{\frac{ρ}{μ + β}} (\frac{1}{μ + β} (μ x^{k} + γ A^{T} (y^{k} - A x^{k})))$ ,
$y^{k + 1} \in {prox}_{\frac{λ}{ν + γ} φ} (\frac{1}{ν + γ} (γ A x^{k + 1} + ν y^{k}))$ ;
4: end for
5: Output: $\bar{x}$ .

Proposition 2.

Let the sequence

{(x^{k}, y^{k}) | k \in N}

be generated by Algorithm 1. Then, the following hold:

(i): The sequence ${L (x^{k}, y^{k}) | k \in N}$ is nonincreasing and convergent;
(ii): The sequence ${(x^{k}, y^{k}) | k \in N}$ is bounded and

$lim_{k \to + \infty} (∥ x^{k} - x^{k - 1} ∥ + ∥ y^{k} - y^{k - 1} ∥) = 0;$

(29)
(iii): For all $k \geq 1$ , define

$(x_{*}^{k}, y_{*}^{k}) : = - (γ A^{T} (y^{k} - y^{k - 1}), 0) - (S (x^{k} - x^{k - 1}), T (y^{k} - y^{k - 1})),$

(30)

we then have

$(x_{*}^{k}, y_{*}^{k}) \in \partial L (x^{k}, y^{k}) and lim_{k \to + \infty} (x_{*}^{k}, y_{*}^{k}) = (0, 0);$

(31)
(iv): $ω (x^{*}, y^{*})$ is a nonempty compact connected set and $ω (x^{*}, y^{*}) \subset crit L$ ;
(v): L is finite and constant on $ω (x^{*}, y^{*})$ , equal to ${lim}_{k \to + \infty} L (x^{k}, y^{k})$ .

Proof.

(i) It follows from (25), (27), and the definition of

L (x, y)

that

\{\begin{matrix} L (x^{k}, y^{k - 1}) + \frac{1}{2} {∥x^{k} - x^{k - 1}∥}_{S}^{2} \leq L (x^{k - 1}, y^{k - 1}), \\ L (x^{k}, y^{k}) + \frac{1}{2} {∥y^{k} - y^{k - 1}∥}_{T}^{2} \leq L (x^{k}, y^{k - 1}), \end{matrix}

(32)

which implies that

L (x^{k}, y^{k}) + \frac{1}{2} ∥ x^{k} - x^{k - 1} ∥_{S}^{2} + \frac{1}{2} {∥ y^{k} - y^{k - 1} ∥}_{T}^{2} \leq L (x^{k - 1}, y^{k - 1}) .

(33)

Since

S

and

T

are positive definite, the sequence

{L (x^{k}, y^{k}) | k \in N}

does not increase. Since

L (x, y) \geq 0, \forall x \in R^{n}, \forall y \in R^{m}

, the sequence

{L (x^{k}, y^{k}) | k \in N}

is convergent.

(ii) Since the function

L (x, y)

is coercive, proper lower semi-continuous, and bounded below, the sequence

{(x^{k}, y^{k}) | k \in N}

is bounded. In addition, we have

{lim}_{k \to + \infty} (∥ x^{k} - x^{k - 1} ∥ + ∥ y^{k} - y^{k - 1} ∥) = 0

from inequality (33) and the convergence of

{L (x^{k}, y^{k}) | k \in N}

.

(iii) By the very definition of

x^{k}

and

y^{k}

, we have that for all

k \geq 1

,

\{\begin{matrix} 0 \in ρ \partial {∥x^{k}∥}_{1} + γ A^{T} (A x^{k} - y^{k - 1}) + β x^{k} + S (x^{k} - x^{k - 1}), \\ 0 \in γ (y^{k} - A x^{k}) + λ \partial φ (y^{k}) + T (y^{k} - y^{k - 1}) . \end{matrix}

(34)

Because of the definition of

L (x, y)

, we have

\{\begin{matrix} \partial_{x} L (x^{k}, y^{k}) = ρ \partial {∥x^{k}∥}_{1} + γ A^{T} (A x^{k} - y^{k}) + β x^{k}, \\ \partial_{y} L (x^{k}, y^{k}) = γ (y^{k} - A x^{k}) + λ \partial φ (y^{k}) . \end{matrix}

(35)

Hence, we can obtain from (34) and (35) that

\{\begin{matrix} - γ A^{T} (y^{k} - y^{k - 1}) - S (x^{k} - x^{k - 1}) \in \partial_{x} L (x^{k}, y^{k}), \\ - T (y^{k} - y^{k - 1}) \in \partial_{y} L (x^{k}, y^{k}) . \end{matrix}

(36)

This yields

(x_{*}^{k}, y_{*}^{k}) \in \partial L (x^{k}, y^{k})

with [19] (Proposition 2.1). Furthermore, it follows from (29) that

{lim}_{k \to + \infty} (x_{*}^{k}, y_{*}^{k}) = (0, 0)

.

(iv) It follows from (ii) and the results of point set topology that

ω (x^{*}, y^{*})

is nonempty compact connected. Let

(\bar{x}, \bar{y})

be a point in

ω (x^{*}, y^{*})

, where there exists a subsequence

{(x^{k^{'}}, y^{k^{'}})}

of

{(x^{k}, y^{k})}

converging to

(\bar{x}, \bar{y})

. Furthermore, by the definition of

y^{k}

, we have for all

k \geq 1

and

\forall y \in R^{m}

,

\frac{γ}{2} ∥ A x^{k} - y^{k} ∥^{2} + λ φ (y^{k}) + \frac{1}{2} ∥ y^{k} - y^{k - 1} ∥_{T}^{2} \leq \frac{γ}{2} ∥ A x^{k} {- y ∥}^{2} + λ φ (y) + \frac{1}{2} {∥ y - y^{k - 1} ∥}_{T}^{2} .

(37)

It follows from (29) that by replacing k with

k^{'}

in (37) and letting

k^{'} \to + \infty

, we can deduce

\underset{k^{'} \to + \infty}{lim inf} {λ φ (y^{k^{'}})} + \frac{γ}{2} ∥ A \bar{x} - \bar{y} ∥^{2} \leq λ φ (y) + \frac{γ}{2} ∥ A \bar{x} {- y ∥}^{2} + \frac{1}{2} {∥ y - \bar{y} ∥}_{T}^{2}, \forall y \in R^{m} .

(38)

In particular, for

y = \bar{y}

, we have that

{lim inf}_{k^{'} \to + \infty} {λ φ (y^{k^{'}})} \leq λ φ (\bar{y})

. Since the function

φ (y)

is lower semicontinuous, we have

{lim inf}_{k^{'} \to + \infty} {λ φ (y^{k^{'}})} = λ φ (\bar{y})

. There is no loss of generality in assuming that the whole sequence

{λ φ (y^{k^{'}})}

converges to

λ φ (\bar{y})

, i.e.,

lim_{k^{'} \to + \infty} {λ φ (y^{k^{'}})} = λ φ (\bar{y}) .

(39)

Notice that the function

{ρ ∥ x ∥}_{1} + \frac{γ}{2} {∥ A x - y ∥}^{2} + \frac{β}{2} {∥ x ∥}^{2}

is continuous, so we have

lim_{k^{'} \to + \infty} {ρ ∥ x^{k^{'}} ∥_{1} + \frac{γ}{2} ∥ A x^{k^{'}} - y^{k^{'}} ∥^{2} + \frac{β}{2} ∥ x^{k^{'}} ∥^{2}} = ρ ∥ \bar{x} ∥_{1} + \frac{γ}{2} ∥ A \bar{x} - \bar{y} ∥^{2} + \frac{β}{2} {∥ \bar{x} ∥}^{2} .

(40)

Thus,

{lim}_{k^{'} \to + \infty} L (x^{k^{'}}, y^{k^{'}}) = L (\bar{x}, \bar{y})

. It follows from (31) that

(x_{*}^{k^{'}}, y_{*}^{k^{'}}) \in \partial L (x^{k^{'}}, y^{k^{'}})

and

{lim}_{k \to + \infty} (x_{*}^{k^{'}}, y_{*}^{k^{'}}) = (0, 0)

. Owing to the closedness of

\partial L

, we determine that

(0, 0) \in \partial L (\bar{x}, \bar{y})

. Hence,

ω (x^{*}, y^{*}) \subset crit L

.

(v) Let

(\bar{x}, \bar{y})

be a point in

ω (x^{*}, y^{*})

so that there is a subsequence

{(x^{k^{'}}, y^{k^{'}})}

of

{(x^{k}, y^{k})}

, with

{lim}_{k^{'} \to + \infty} L (x^{k^{'}}, y^{k^{'}}) = L (\bar{x}, \bar{y})

. Since the sequence

{L (x^{k}, y^{k}) | k \in N}

is convergent, we have

L (\bar{x}, \bar{y}) = {lim}_{k \to + \infty} L (x^{k}, y^{k})

independent of

(\bar{x}, \bar{y})

, i.e., L is finite and constant on

ω (x^{*}, y^{*})

. □

The next result shows that the sequence

{(x^{k}, y^{k}) | k \in N}

generated by Algorithm 1 converges to a local minimizer of (20).

Theorem 1.

The sequence

{(x^{k}, y^{k}) | k \in N}

generated by Algorithm 1, with the initial point

(x^{0}, y^{0})

, converges to a local minimizer of (20).

Proof.

Let

(\bar{x}, \bar{y})

be a point in

ω (x^{*}, y^{*})

. Then, there exists a subsequence

{(x^{k^{'}}, y^{k^{'}})}

of

{(x^{k}, y^{k})}

converging to

(\bar{x}, \bar{y})

, and (39) holds. Define

\begin{matrix} S_{-}^{*} & : = {j \in {1, 2, \dots, m} | {\bar{y}}_{j} < ϵ}, S_{+}^{*} : = {j \in {1, 2, \dots, m} | {\bar{y}}_{j} > ϵ}, \\ S_{0}^{*} & : = {j \in {1, 2, \dots, m} | {\bar{y}}_{j} = ϵ}, δ_{1} : = \frac{1}{2} min {| {\bar{y}}_{j} - ϵ | | j \in S_{-}^{*} \cup S_{+}^{*}} . \end{matrix}

(41)

For all

y \in R^{m}

satisfying

∥ y - \bar{y} ∥_{\infty} < δ_{1}

, we deduce that the entries of both y and

\bar{y}

are all less than

ϵ

on the index set

S_{-}^{*}

and are all greater than

ϵ

on the index set

S_{+}^{*}

. Hence, by the definition of

φ

, we have

φ (\bar{y}) = φ (y) - \sum_{j \in S_{0}^{*}} {∥ {[y_{j} - ϵ]}_{-} ∥}_{0} .

(42)

On one hand, there at least exists one index

j \in S_{0}^{*}

such that

y_{j} < ϵ

. Then,

\sum_{j \in S_{0}^{*}} {∥ {[y_{j} - ϵ]}_{-} ∥}_{0} \geq 1

. Thus, we have

φ (\bar{y}) \leq φ (y) - 1 .

(43)

Further, we denote the function

{ρ ∥ x ∥}_{1} + \frac{γ}{2} {∥ A x - y ∥}^{2} + \frac{β}{2} {∥ x ∥}^{2}

by

f (x, y)

. Since

f (x, y)

is continuous, there is a constant

δ_{2} > 0

such that

f (\bar{x}, \bar{y}) < f (x, y) + λ

(44)

holds for all

(x, y)

satisfying

∥ (x, y) - (\bar{x}, \bar{y}) ∥_{\infty} < δ_{2}

. Choose

δ : = min {δ_{1}, δ_{2}}

. It follows from (44) and (43) that for all

(x, y)

satisfying

∥ (x, y) - (\bar{x}, \bar{y}) ∥_{\infty} < δ

, there exists at least one index

j \in S_{0}^{*}

such that

y_{j} < ϵ

, and we have

L (\bar{x}, \bar{y}) < L (x, y)

.

On the other hand, for all

(x, y)

satisfying

∥ (x, y) - (\bar{x}, \bar{y}) ∥_{\infty} < δ

and

y_{j} \geq ϵ, \forall j \in S_{0}^{*}

, we have

φ (\bar{y}) = φ (y) .

(45)

By the definition of

y^{k}

, we get

λ φ (y^{k^{'} + 1}) + \frac{γ}{2} ∥ y^{k^{'} + 1} - A x^{k^{'} + 1} ∥^{2} + \frac{1}{2} ∥ y^{k^{'} + 1} - y^{k^{'}} ∥_{T}^{2} \leq λ φ (y) + \frac{γ}{2} ∥ y - A x^{k^{'} + 1} ∥^{2} + \frac{1}{2} {∥ y - y^{k^{'}} ∥}_{T}^{2},

(46)

which, together with (29) and (39), implies that

λ φ (\bar{y}) + \frac{γ}{2} ∥ \bar{y} - A \bar{x} ∥^{2} \leq λ φ (y) + \frac{γ}{2} ∥ y - A \bar{x} ∥^{2} + \frac{1}{2} {∥ y - \bar{y} ∥}_{T}^{2} .

(47)

Thus, we have that

\frac{γ}{2} ∥ \bar{y} - A \bar{x} ∥^{2} \leq \frac{γ}{2} ∥ y - A \bar{x} ∥^{2} + \frac{1}{2} {∥ y - \bar{y} ∥}_{T}^{2} .

(48)

For any

t \in [0, 1]

, define

\tilde{y} : = t y + (1 - t) \bar{y}

, then

∥ \tilde{y} - \bar{y} ∥_{\infty} < δ

and

{\tilde{y}}_{j} \geq ϵ, \forall j \in S_{0}^{*}

. We denote the function

\frac{γ}{2} {∥ y - A \bar{x} ∥}^{2}

by

g (y)

. Then,

g (y)

is a convex function and

g (\bar{y}) \leq g (\tilde{y}) + \frac{1}{2} {∥ \tilde{y} - \bar{y} ∥}_{T}^{2} .

(49)

Hence, we have

g (\bar{y}) \leq t g (y) + (1 - t) g (\bar{y}) + \frac{t^{2}}{2} {∥ y - \bar{y} ∥}_{T}^{2},

(50)

which can be reduced to

g (\bar{y}) \leq g (y) + \frac{t}{2} {∥ y - \bar{y} ∥}_{T}^{2} .

(51)

By letting

t \to 0^{+}

, it yields that

g (\bar{y}) \leq g (y) .

(52)

Furthermore, by the definition of

x^{k}

, we have

x^{k^{'} + 1} = {prox}_{\frac{ρ}{μ + β} {∥ \cdot ∥}_{1}} (\frac{1}{μ + β} (μ x^{k^{'}} + γ A^{T} (y^{k^{'}} - A x^{k^{'}}))) .

(53)

It follows from (29) and the continuity of the operator

{prox}_{\frac{ρ}{μ + β} {∥ \cdot ∥}_{1}} (\cdot)

that

\bar{x} = {prox}_{\frac{ρ}{μ + β} {∥ \cdot ∥}_{1}} (\frac{1}{μ + β} (μ \bar{x} + γ A^{T} (\bar{y} - A \bar{x}))),

(54)

which is equivalent to

\bar{x} = arg min_{x \in R^{n}} {{ρ ∥ x ∥}_{1} + \frac{γ}{2} ∥ A x - \bar{y} ∥^{2} + \frac{β}{2} {∥ x ∥}^{2}} .

(55)

Hence, we obtain

ρ ∥ \bar{x} ∥_{1} + \frac{γ}{2} ∥ A \bar{x} - \bar{y} ∥^{2} + \frac{β}{2} ∥ \bar{x} ∥^{2} \leq {ρ ∥ x ∥}_{1} + \frac{γ}{2} ∥ A x - \bar{y} ∥^{2} + \frac{β}{2} {∥ x ∥}^{2} .

(56)

It follows from (52) that

∥ A x - \bar{y} ∥^{2} \leq ∥ A x - \bar{y} ∥^{2} + ∥ y - A \bar{x} ∥^{2} - ∥ \bar{y} - A \bar{x} ∥^{2} \leq {∥ A x - y ∥}^{2} + ∥ A (x - \bar{x}) ∥^{2} + {∥ y - \bar{y} ∥}^{2},

(57)

which, together with (56), implies that

ρ ∥ \bar{x} ∥_{1} + \frac{γ}{2} ∥ A \bar{x} - \bar{y} ∥^{2} + \frac{β}{2} ∥ \bar{x} ∥^{2} \leq {ρ ∥ x ∥}_{1} + \frac{γ}{2} {∥ A x - y ∥}^{2} + \frac{β}{2} {∥ x ∥}^{2} + \frac{γ}{2} ∥ A (x - \bar{x}) ∥^{2} + \frac{γ}{2} {∥ y - \bar{y} ∥}^{2} .

(58)

We use

h (x, y)

to denote the function

{ρ ∥ x ∥}_{1} + \frac{γ}{2} {∥ A x - y ∥}^{2} + \frac{β}{2} {∥ x ∥}^{2}

. Then

h (x, y)

is a convex function. For any

t \in [0, 1]

, define

(\tilde{x}, \tilde{y}) : = t (x, y) + (1 - t) (\bar{x}, \bar{y})

. Then,

∥ (\tilde{x}, \tilde{y}) - (\bar{x}, \bar{y}) ∥_{\infty} < δ

and

y_{j} \geq ϵ, \forall j \in S_{0}^{*}

. It follows from (58) that

h (\bar{x}, \bar{y}) \leq h (\tilde{x}, \tilde{y}) + \frac{γ}{2} ∥ A (\tilde{x} - \bar{x}) ∥^{2} + \frac{γ}{2} {∥ \tilde{y} - \bar{y} ∥}^{2},

(59)

which can be reduced to

h (\bar{x}, \bar{y}) \leq t h (x, y) + (1 - t) h (\bar{x}, \bar{y}) + \frac{γ t^{2}}{2} ∥ A (x - \bar{x}) ∥^{2} + \frac{γ t^{2}}{2} {∥ y - \bar{y} ∥}^{2},

(60)

i.e.,

h (\bar{x}, \bar{y}) \leq h (x, y) + \frac{γ t}{2} ∥ A (x - \bar{x}) ∥^{2} + \frac{γ t}{2} {∥ y - \bar{y} ∥}^{2} .

(61)

By letting

t \to 0^{+}

, we can obtain

h (\bar{x}, \bar{y}) \leq h (x, y) .

(62)

It follows from (45) and (62) that we have

L (\bar{x}, \bar{y}) \leq L (x, y)

for all

∥ (x, y) - (\bar{x}, \bar{y}) ∥_{\infty} < δ

and

y_{j} \geq ϵ, \forall j \in S_{0}^{*}

. Thus,

(\bar{x}, \bar{y})

is a local minimizer of (20). By Proposition 1 and (iv) of Proposition 2, we have that the sequence

{(x^{k}, y^{k}) | k \in N}

converges to a local minimizer of (20). □

Remark 1.

It follows from the proof of Theorem 1 that if

{(A \bar{x})}_{j} \geq ϵ, \forall j \in S_{0}^{*}

, then we have

A \bar{x} = \bar{y}

. At this point, we have that

(\bar{x}, \bar{y})

is also a local minimizer of (19).

The next theorem shows that if the initial point of Algorithm 1 is sufficiently close to any one of the global minimizers of the function L given in (20), then the sequence generated by Algorithm 1 converges to a global minimizer of model (20).

Theorem 2.

Let the sequence

{(x^{k}, y^{k}) | k \in N}

be generated by Algorithm 1 with the initial point

(x^{0}, y^{0})

, and let

(x^{*}, y^{*})

be global minimizers of (20). If

(x^{0}, y^{0})

is sufficiently close to

(x^{*}, y^{*})

with

y^{0} \in Ω : = {y \in R^{m} | y_{j} \geq ϵ for j satisfying y_{j}^{*} = ϵ}

, then the sequence

{(x^{k}, y^{k}) | k \in N}

converges to a global minimizer of (20).

Proof.

We know from Proposition 1 that model (20) has a finite number of local minimizers. If the function

L (x, y)

has a unique local minimal value, then by Theorem 1, we have that the sequence

{(x^{k}, y^{k}) | k \in N}

converges to a global minimizer of (20). Otherwise, we suppose that

L (x, y)

has at least two local minimal values, and we denote the second smallest minimal value by M. Then, we have that

h (x^{*}, y^{*}) = L (x^{*}, y^{*}) - λ φ (y^{*}) < M - λ φ (y^{*})

. Since the function

h (x, y)

is continuous, we can determine that

h (x^{0}, y^{0}) < M - λ φ (y^{*})

if

(x^{0}, y^{0})

is sufficiently close to

(x^{*}, y^{*})

. It follows from (42) that if

∥ y^{0} - y^{*} ∥_{\infty} < δ_{1}

with

y^{0} \in Ω

, we have

L (x^{0}, y^{0}) < M

. By (i) of Proposition 2, it holds that

L (x^{k}, y^{k}) \leq L (x^{0}, y^{0}) < M

for all k. By Theorem 1, we know that the sequence

{(x^{k}, y^{k}) | k \in N}

converges to

(\bar{x}, \bar{y})

which is a local minimizer of

L (x, y)

. By the lower semi-continuity of

L (x, y)

, we get

L (\bar{x}, \bar{y}) < M

. This implies that

(\bar{x}, \bar{y})

must be a global minimizer of (20). This completes the proof of the desired result. □

5. Numerical Simulations

In this section, we describe simulation experiments conducted to demonstrate the effectiveness of our proposed Algorithm 1. Our code was written in Matlab 2015b and executed on a Dell personal computer with 11th Gen Intel(R) Core(TM) i5-1135G7 @ 2.40 GHz 2.42 GHz and 16 G memory.

We generated an

m \times n

matrix B whose entries were subjected to the independent and identically distributed (i.i.d.) standard Gaussian distribution. We then generated an s-sparse vector

x^{*} \in R^{n}

, whose nonzero components were drawn from the i.i.d. samples of the standard Gaussian distribution. To avoid tiny nonzero entries of

x^{*}

, we let

x_{i}^{*} = x_{i}^{*} + sign (x_{i}^{*})

for nonzero

x_{i}^{*}

and then normalized the resulting vector to be a unit vector. The ideal one-bit measurement vector

b^{*}

is given by

b^{*} = sign (B x^{*})

. To simulate sign flips in actual one-bit measurements, we randomly selected

⌈ a m ⌉

components in

b^{*}

and flipped their signs, denoting the resulting vector b. And

a \in (0, 1)

denotes the flipping ratio in the measurement.

Three metrics were chosen to evaluate the quality of the signals reconstructed by one-bit compressive algorithms. They are the signal-to-noise ratio (SNR), Hamming error (HE), and Hamming distance (HD), defined, respectively, by

\begin{matrix} SNR & : = - 20 {log}_{10} (∥ \bar{x} - x^{*} ∥), \\ HE & : = \frac{1}{m} {∥ b^{*} - sign (B \bar{x}) ∥}_{0}, \\ HD & : = \frac{1}{m} {∥ b - sign (B \bar{x}) ∥}_{0}, \end{matrix}

(63)

where

\bar{x}

is the reconstructed signal with norm 1. The higher the SNR value, the better the reconstructed signal. The values of HE and HD are within the range [0, 1]. The smaller these values, the better the reconstructed signal.

The larger parameter

γ

in model (20) is set, the closer model (20) will be to model (19). Hence, in subsequent numerical experiments, we initially set

γ = 500

in Algorithm 1, doubled it in every ten iterations, and fixed it when the total number of iterations exceeds 60. This is the same approach described in [9]. For other parameters, we chose

μ = 1.001 γ λ_{max} (A^{T} A)

,

ν = 0.005

,

λ = 80

,

ϵ = 0.05

,

β = 10^{- 5}

, and

ρ = 0.005 (μ + β)

. We set the initial estimates

(x^{0}, y^{0}) = (0, 0)

and

k_{max} = 500

.

First, we explored the performance ofAlgorithm 1 without the noise level or a priori information about signal sparsity. We tried problems of sizes

n = 1000, 2000, 3000

, and 5000. For each n, we generated 100 input vectors and reported the average results over 100 runs in Table 1. We defined

stopc : = \frac{1}{m} ∥ b^{*} - sign (B (\frac{x^{k}}{∥ x^{k} ∥})) ∥_{0}

, with

k = 1, 2, 3, \dots

. In all tests, if

stopc < 10^{- 6}

or

k \geq k_{max}

, the Algorithm 1 terminated and outputted

\bar{x} = Π_{r} (x^{k}) / ∥ Π_{r} (x^{k}) ∥

, where the operator

Π_{r} (\cdot)

with

r > 0

at

x \in R^{n}

is defined by

{[Π_{r} (x)]}_{i} : = \{\begin{matrix} 0, & - r < x_{i} < r, \\ x_{i}, & o . w ., \end{matrix} i = 1, 2, \dots, n .

(64)

The parameter r is set to

\frac{(40 + 50 a) ρ}{μ + β}

if

n \leq 2000

; otherwise,

\frac{(25 + 50 a) ρ}{μ + β}

.

From Table 1, we can see that Algorithm 1 can effectively recover the sparse signal x from the one-bit observation b if the flipping ratio

a \in [0, 0.05]

.

Second, we assumed that the sparsity s was known and compared the performance of the Algorithm 1 (APA) with two state-of-the-art algorithms, namely, OSL0 in [9] and AOP in [8]. In our experiments, the parameters for OSL0, AOP, and their variants were determined as suggested in [8,9]. All methods start with the same initial points

(x^{0}, y^{0}) = (0, 0)

and

k_{max} = 300

. In Algorithm 1, we projected x onto set

C = {x \in R^{n} | ∥ x ∥_{0} \leq s}

every 25 iterations and outputted

\bar{x} = x^{k_{max}} / ∥ x^{k_{max}} ∥

.

Three configurations were considered to test the robustness of Algorithm 1 for the one-bit compressive sensing problem. They were designed to test cases of different levels of noise in the measurements, the size of the sensing matrix B, and the true sparsity s of the vector

x^{*}

. In the first configuration, we fixed

m = n = 1000

and sparsity at

s = 10

and varied the noise level

a \in {0, 0.05, 0.1, 0.15, 0.2, 0.25, 0.3}

. In the second configuration, we fixed

n = 1000

, sparsity at

s = 10

, and the noise level at

a = 0.05

and varied m such that the ratio

m / n

was in

{0.05, 0.1, 0.5, 1, 1.5, 2, 2.5}

. In the third configuration, we fixed

m = n = 1000

and the noise level at

a = 0.05

and changed the sparsity

s \in {5, 8, 10, 12, 15, 18, 20}

.

For the first configuration, the average values of SNR, HE, and HD over 50 trials for the signals reconstructed by all algorithms against the noise levels are depicted in Figure 1a, Figure 1b, and Figure 1c, respectively. We can observe from Figure 1 that Algorithm 1 performs better than the other algorithms when the noise level is greater than

0.05

.

For the second configuration, the average values of SNR, HE, and HD over 50 trials for the signals reconstructed by all algorithms against the ratio of

m / n

are depicted in Figure 2a, Figure 2b, and Figure 2c, respectively. We can see that Algorithm 1 performs better than the other algorithms when the ratio of

m / n

is greater than

0.5

.

For the third configuration, the average values of SNR, HE, and HD over 50 trials for the signals reconstructed by all algorithms against the sparsity of the ideal signals are depicted in Figure 3a, Figure 3b, and Figure 3c, respectively. We can see that Algorithm 1 performs best among all the algorithms tested in our experiments.

6. Conclusions

In this paper, we propose a robust model for the one-bit CS problem and prove that the model has a finite number of local minimizers. We propose an alternating proximal algorithm for solving the proposed model and prove the following: the sequence of the objective function decreases and converges, the set of limit points of the sequence generated from the algorithm is included in the set of critical points of the objective function, the sequence generated from the algorithm converges to a local minimizer of the objective function of the proposed model, and the sequence generated from the algorithm converges to a global minimizer of the function as long as the initial estimate is sufficiently close to any global minimizer of the function. The proposed algorithm is suitable for practical scenarios in which neither the noise level nor the sparsity of the signal are known in advance. Our algorithm possesses high time efficiency and recovery accuracy. Moreover, it performs better than other algorithms tested in our experiments when the the noise level and the sparsity of the signal is known.

Author Contributions

J.-J.W. is responsible for conceptualization, data curation, formal analysis, investigation, software, visualization, and writing—orinitial draft. Y.-H.H. is responsible for conceptualization, funding acquisition, methodology, validation, writing—original draft and writing—review & editing. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the National Natural Sciences Grant (No. 11871182) and the program for scientific research start-up funds of Guangdong Ocean University (060302102004 and 060302102005).

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors have no competing interests to declare that are relevant to the content of this article.

References

Boufounos, P.T.; Baraniuk, R.G. 1-bit Compressive Sensing. In Proceedings of the Conference on Information Science and Systems (CISS), Princeton, NJ, USA, 19 March 2008. [Google Scholar] [CrossRef]
Zhou, Z.; Chen, X.; Guo, D.; Honig, M.L. Sparse channel estimation for massive MIMO with 1-bit feedback per dimension. In Proceedings of the 2017 IEEE Wireless Communications and Networking Conference (WCNC), San Francisico, CA, USA, 19–22 March 2017. [Google Scholar] [CrossRef]
Chen, C.H.; Wu, J.Y. Amplitude-aided 1-bit compressive sensing over noisy wireless sensor networks. IEEE Wirel. Commun. Lett. 2015, 4, 473–476. [Google Scholar] [CrossRef]
Fu, N.; Yang, L.; Zhang, J. Sub-nyquist 1-bit sampling system for sparse multiband signals. In Proceedings of the 2014 22nd European Signal Processing Conference (EUSIPCO), Lisbon, Portugal, 1–5 September 2014. [Google Scholar]
Li, Z.; Xu, W.; Zhang, X.; Lin, J. A survey on one-bit compressed sensing: Theory and applications. Front. Comput. Sci. 2018, 12, 217–230. [Google Scholar] [CrossRef]
Plan, Y.; Vershynin, R. One-bit compressed sensing by linear programming. Commun. Pure Appl. Math. 2013, 66, 1275–1297. [Google Scholar] [CrossRef]
Jacques, L.; Laska, J.; Boufounos, P.T.; Baraniuk, R.G. Roubust 1-bit compressive sensing via binary stable embeddings of sparse vectors. IEEE Trans. Inf. Theory 2013, 59, 2082–2102. [Google Scholar] [CrossRef]
Yan, M.; Yang, Y.; Osher, S. Robust 1-bit compressive sensing using adaptive outlier pursuit. IEEE Trans. Signal. Process. 2012, 60, 3868–3875. [Google Scholar] [CrossRef]
Dai, D.Q.; Shen, L.; Xu, Y.; Zhang, N. Noisy 1-bit compressive sensing: Models and algorithms. Appl. Comput. Harmon. Anal. 2016, 40, 1–32. [Google Scholar] [CrossRef]
Huang, J.; Jiao, Y.; Lu, X.; Zhu, L. Robust decoding from 1-bit compressive sampling with ordinary and regularized least squares. SIAM J. Sci. Comput. 2018, 40, 2062–2786. [Google Scholar] [CrossRef]
Zhou, S.; Luo, Z.; Xiu, N.; Li, G.Y. Computing one-bit compressive sensing via double-sparsity constrained optimization. IEEE Trans. Signal. Process. 2022, 70, 1593–1608. [Google Scholar] [CrossRef]
Rockafellar, R.T.; Wets, R. Variational Analysis, Grundlehren der Mathematischen Wissenschaften; Springer: Berlin, Germany, 1998; Volume 317. [Google Scholar]
Mordukhovich, B. Variational Analysis and Generalized Differentiation. I. Basic Theory; Grundlehren der Mathematischen Wissenschaften; Springer: Berlin, Germany, 2006; Volume 330. [Google Scholar]
Bauschke, H.L.; Combettes, P.L. Convex Analysis and Monotone Operator Theory in Hilbert Spaces; AMS Books in Mathematics; Springer: New York, NY, USA, 2011. [Google Scholar]
Björck, A. Numerical Methods for Least Squares Problems; SIAM: Philadelphia, PA, USA, 1996. [Google Scholar]
Tikhonov, A.N.; Arsenin, V.Y. Solution of Ill-Posed Problems; V. H. Winston: Washington, DC, USA, 1977. [Google Scholar]
Daubechies, I.; Defrise, M.; Mol, C.D. An iterative thresholding algorithm for linear inverse problems with a sparsity constraint. Comm. Pure Appl. Math. 2004, 57, 1413–1457. [Google Scholar] [CrossRef]
Figueiredo, M.A.T.; Nowak, R.D.; Wright, S.J. Gradient projection for sparse reconstruction: Application to compressed sensing and other inverse problems. IEEE J.-STSP. 2007, 1, 586–597. [Google Scholar] [CrossRef]
Attouch, H.; Bolte, J.; Redont, P.; Soubeyran, A. Proximal alternating minimization and projection methods for nonconvex problems: An approach based on the Kurdyka-Łojasiewicz inequality. Math. Oper. Res. 2010, 35, 438–457. [Google Scholar] [CrossRef]
Attouch, H.; Soubeyran, A. Inertia and reactivity in decision making as cognitive variational inequalities. J. Convex. Anal. 2006, 13, 207–224. [Google Scholar]
Attouch, H.; Redont, P.; Soubeyran, A. A new class of alternating proximal minimization algorithms with costs to move. SIAM J. Optimiz. 2007, 18, 1061–1081. [Google Scholar] [CrossRef]
Attouch, H.; Bolte, J.; Redont, P.; Soubeyran, A. Alternating proximal algorithms for weakly coupled convex minimization problems. Applications to dynamical games and PDE’s. J. Convex. Anal. 2008, 15, 485–506. [Google Scholar]

Figure 1. Average values of (a) SNR, (b) HE, and (c) HD over 50 trials vs. the noise level for all tested algorithms. We fixed

m = n = 1000

and sparsity at 10.

Figure 1. Average values of (a) SNR, (b) HE, and (c) HD over 50 trials vs. the noise level for all tested algorithms. We fixed

m = n = 1000

and sparsity at 10.

Figure 2. Average values of (a) SNR, (b) HE, and (c) HD over 50 trials vs.

m / n

for all tested algorithms. We fix

n = 1000

, the noise level at

0.05

, and sparsity at 10.

Figure 2. Average values of (a) SNR, (b) HE, and (c) HD over 50 trials vs.

m / n

for all tested algorithms. We fix

n = 1000

, the noise level at

0.05

, and sparsity at 10.

Figure 3. Average values of (a) SNR, (b) HE, and (c) HD over 50 trials vs. true sparsity s for all tested algorithms. We fix

m = n = 1000

and the noise level at

0.05

.

Figure 3. Average values of (a) SNR, (b) HE, and (c) HD over 50 trials vs. true sparsity s for all tested algorithms. We fix

m = n = 1000

and the noise level at

0.05

.

Table 1. Numerical results of Algorithm 1.

$m = n$	s	a	SNR	HE	HD	Sparsity of $\bar{x}$
1000	10	0.00	30.57	0.001	0.001	10
1000	10	0.01	25.94	0.013	0.021	10
1000	10	0.03	21.06	0.027	0.051	10
1000	10	0.05	20.40	0.023	0.065	10
1000	10	0.06	16.64	0.02	0.076	10
1000	10	0.08	16.55	0.036	0.102	11
1000	10	0.10	18.19	0.037	0.130	11
1000	10	0.15	12.89	0.066	0.184	13
1000	10	0.20	4.480	0.172	0.272	17
2000	20	0.00	28.49	0.0055	0.0055	20
2000	20	0.01	23.32	0.0295	0.0375	20
2000	20	0.03	21.52	0.019	0.047	20
2000	20	0.05	20.47	0.0245	0.0735	20
2000	20	0.06	19.92	0.026	0.082	20
2000	20	0.08	16.09	0.0345	0.1085	19
2000	20	0.10	16.28	0.0445	0.1355	21
2000	20	0.15	11.78	0.0745	0.1975	18
2000	20	0.20	5.081	0.153	0.371	17
3000	30	0.00	29.64	0.008	0.008	30
3000	30	0.01	22.23	0.0213	0.0307	30
3000	30	0.03	20.20	0.0277	0.0543	30
3000	30	0.05	19.71	0.025	0.0717	30
3000	30	0.06	17.38	0.0336	0.085	30
3000	30	0.08	14.01	0.057	0.1213	29
3000	30	0.10	14.19	0.061	0.142	32
3000	30	0.15	8.62	0.1013	0.2087	34
3000	30	0.20	6.369	0.1423	0.2677	27
5000	50	0.00	22.36	0.017	0.017	50
5000	50	0.01	20.51	0.0278	0.0362	50
5000	50	0.03	20.04	0.0295	0.0602	50
5000	50	0.05	19.95	0.0302	0.0766	50
5000	50	0.06	17.14	0.0388	0.092	49
5000	50	0.08	16.05	0.0428	0.1168	48
5000	50	0.10	14.32	0.0544	0.1396	46
5000	50	0.15	10.55	0.0892	0.2064	45
5000	50	0.20	7.892	0.1164	0.2696	47

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, J.-J.; Hu, Y.-H. Computing One-Bit Compressive Sensing via Alternating Proximal Algorithm. Mathematics 2025, 13, 2926. https://doi.org/10.3390/math13182926

AMA Style

Wang J-J, Hu Y-H. Computing One-Bit Compressive Sensing via Alternating Proximal Algorithm. Mathematics. 2025; 13(18):2926. https://doi.org/10.3390/math13182926

Chicago/Turabian Style

Wang, Jin-Jiang, and Yan-Hong Hu. 2025. "Computing One-Bit Compressive Sensing via Alternating Proximal Algorithm" Mathematics 13, no. 18: 2926. https://doi.org/10.3390/math13182926

APA Style

Wang, J.-J., & Hu, Y.-H. (2025). Computing One-Bit Compressive Sensing via Alternating Proximal Algorithm. Mathematics, 13(18), 2926. https://doi.org/10.3390/math13182926

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Computing One-Bit Compressive Sensing via Alternating Proximal Algorithm

Abstract

1. Introduction

2. Elementary Facts

3. The $l_{1}$ Model

4. Alternating Proximal Algorithm

5. Numerical Simulations

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Computing One-Bit Compressive Sensing via Alternating Proximal Algorithm

Abstract

1. Introduction

2. Elementary Facts

3. The l 1 Model

4. Alternating Proximal Algorithm

5. Numerical Simulations

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3. The $l_{1}$ Model