Complex Noise-Based Phase Retrieval Using Total Variation and Wavelet Transform Regularization

Qin, Xing; Gao, Xin; Yang, Xiaoxu; Xie, Meilin

doi:10.3390/photonics11010071

Open AccessArticle

Complex Noise-Based Phase Retrieval Using Total Variation and Wavelet Transform Regularization

¹

Xi’an Institute of Optics and Precision Mechanics, Chinese Academy of Sciences, Xi’an 710119, China

²

University of Chinese Academy of Sciences, Beijing 100049, China

³

Beijing Institute of Tracking and Telecommunications Technology, Beijing 100094, China

^*

Author to whom correspondence should be addressed.

Photonics 2024, 11(1), 71; https://doi.org/10.3390/photonics11010071

Submission received: 26 October 2023 / Revised: 15 December 2023 / Accepted: 18 December 2023 / Published: 10 January 2024

(This article belongs to the Section Optical Communication and Network)

Download

Browse Figures

Versions Notes

Abstract

This paper presents a phase retrieval algorithm that incorporates sparsity priors into total variation and framelet regularization. The proposed algorithm exploits the sparsity priors in both the gradient domain and the spatial distribution domain to impose desirable characteristics on the reconstructed image. We utilize structured illuminated patterns in holography, consisting of three light fields. The theoretical and numerical analyses demonstrate that when the illumination pattern parameters are non-integers, the three diffracted data sets are sufficient for image restoration. The proposed model is solved using the alternating direction multiplier method. The numerical experiments confirm the theoretical findings of the lighting mode settings, and the algorithm effectively recovers the object from Gaussian and salt–pepper noise.

Keywords:

phase retrieval; incomplete magnitudes; wavelet decomposition; alternative directional multiplier method

1. Introduction

Because of the loss of phase information, reconstruct the underlying image from Fourier transform magnitudes is defined as a phase retrieval (PR) problem. Phase retrieval problems are ill-posed, because different images can have the same Fourier transform results, so the solution is not unique. It is necessary to know the image’s prior information, such as domain, nonnegative, sparse representation [1,2], etc., to constrain the numerical process to obtain the global saddle-point solution. Readers can consult [1,2,3] for applications in regularizing seismic image beyond aliasing via gradient and spectral techniques and additional information.

Gerchberg and Saxton [4] proposed an error reduction method (ER) based on projection; later, in 1982, this method was improved by Fienup’s hybrid inputs–outputs (HIO) in [1]. Derived iterative projection algorithms include hybrid projective–reflection [3,5], the iterative difference map (DF) [6] and the relaxed averaged alternation reflection (RAAR) algorithms [7]. In addition, Marchesini adopted the saddle-point optimization method [8] to solve the PR problem. The alternate projection method lacks convergent guarantee because of its alternating projection process that does not have nonconvex constraint sets. Thus, the solution converges to a local stationary point rather than the global optimal solution. Moreover, gradient-type methods have become popular; one example is the Wirtinger Flow (WF) method [9] proposed by Candès, Li and Soltanolkotabi who put forward a gradient scheme with novel update rules that are carefully initialized by the means of a spectral method. Gradient-based approaches usually have first-order convergence. Farrell solved the phase retrieval (PR) problem with a network where every agent only contains a subset of the measurements [10]. A convex method is characterized by the use of convex relationships of the quadratic equations or semidefinite programming. Candès, Strohmer and Voroninski proposed Phaselift in [11], which uses SDP lifting techniques to formulates convex trace (nuclear) norm minimization. Moretta study the impact of constraints in Phaselift in [12]. PhaseCut [13], proposed by Waldspurger, is a convex method that separates phase information and magnitudes. Yin and Xin proposed PhaseLiftOff [14], a nonconvex variant of Phaselift which cut out the Frobenius norm from the trace norm to retrieve phase information with fewer measurements. Xia studied the sparse phase retrieval method to recover a k-sparse signal in [15]. Gao studied the adaptive sparse signal reconstruction algorithm in [16].

In recent years, total variation (TV) regularization has been successfully used to solve image blind deblurring optimization problems with certain specific blurring kernels. In addition, framelet-based regularization has been adopted to solve deblurring problem with motion blur kernels. However, there is huge potential area for them to be used in phase retrieval problems from Fourier transform magnitudes. In this paper, we propose a phase retrieval algorithm based on TV regularization and the analysis-based sparsity framelet transform method to recover the image through an analysis matrix. We explicitly focus on the sparsity of image representation in phase retrieval problems from Fourier transform magnitudes. Total variation regularization can effectively guarantee a sparsity prior for the gradient domain of ground truth images with a small TV semi-norm and framelet transform can enforce a sparsity prior of target images under the redundant tight frame.

In the field of image deblurring and phase recovery, a single Gaussian noise is usually used to test the robustness of phase recovery algorithms against noise. In the field of image denoising, the robustness of the algorithm to salt-and-pepper noise is studied. Gaussian noise and salt-and-pepper noise are typical types of noise, which exist simultaneously in actual measurements. In order to simulate realistic measurement conditions, the reconstruction ability against two kinds of noise is studied for the first time. In order to simulate more realistic measurement conditions, this paper innovatively proposes to study the robustness of the algorithm with complex noise when two kinds of noise exist simultaneously. This is a very challenging problem.

Phase retrieval problems from Fourier transform magnitudes are an ill-posed problem, and these problems are more difficult than phase retrieval problems derived from motion blur. Readers can consult the work [17,18] of E. J. Candes, who studies the relationship between the number of measurements and the possibility of reconstructing an image. For research on further applications for reconstructing images from partial measurements, please see the work of Chang Huibin [19]. On that basis, our research shows that when the illumination source meets a certain parameter setting, three sets of measured values can reconstruct an image.

2. Foundations of Phase Retrieval

2.1. Phase Retrieval Model

Phase retrieval is a branch of deconvolution problems and can be expressed as follows:

b = | ℱ u | + η

(1)

ℱ : R e a l^{n_{1} \times n_{2}} \to C o m p l e x^{n_{1} \times n_{2}}

is the two-dimensional discrete Fourier transform.

b

is the measurement magnitude with noise

η

. For the convenience of calculation and implementation, we use vectors to represent two-dimensional discrete images in lexicographical order (connected by columns). To more easily present the modules, the symbols do not change.

u

is defined on a discrete lattice

Ω = {0, 1, \dots, n_{1} \times n_{2} - 1} \to R^{n_{1} \times n_{2}}

of size

n_{1} \times n_{2}

, which denotes that image u is connected by columns. The corresponding discrete Fourier transform is expressed as follows:

\begin{array}{l} ℱ u (ω_{1} + ω_{2} n_{2}) = & \frac{1}{\sqrt{n_{1} n_{2}}} \sum_{0 \leq t_{1} \leq n_{k} - 1} \sum_{\leq t_{2} \leq n_{k} - 1} u (t_{1} + n_{1} t_{2}) \cdot \\ \exp (- i 2 π (\frac{ω_{1} t_{1}}{n_{1}} + \frac{ω_{2} t_{2}}{n_{2}}) \\ 0 \leq ω_{1} \leq n_{k} - 1, 0 \leq ω_{2} \leq n_{k} - 1 \end{array}

(2)

2.2. Total Variation

The total variation (TV) model of image processing based on partial differential equation (PDE) variation method is known as the classic Rudin Osher Fatemi (ROF) model in [20]. It is one of the most common models used for image restoration, and generally consists of a fidelity term, regularization penalty items and regularization parameters. In recent years, the TV model is widely used in image denoising and other fields; readers can consult [21,22,23] for detail. Total variation regularization can effectively guarantee a sparsity prior for the gradient domain of the underlying images which have a small norm.

In the following equations,

u

denotes the image for phase retrieval which is represented as discrete 2-dimensionl matrices with a size of

m \times n

. The total variation in the discrete domain calculates the gradient of u, which is denoted by a gradient operator

\nabla u

:

\begin{array}{l} \nabla u_{p, q} = (\nabla^{1} u_{p, q}, \nabla^{2} u_{p, q}) = (d x (u_{p, q}), dy (u_{p, q})) \\ \nabla^{1} u_{p, q} = {\begin{cases} u_{p, q} - u_{p, q + 1} i f q < n \\ 0 i f q = n \end{cases} \\ \nabla^{2} u_{p, q} = {\begin{cases} u_{p, q} - u_{p + 1, q} i f p < m \\ 0 i f p = m \end{cases} \end{array}

(3)

where

u_{p, q}

is an element of u,

\nabla^{1} u_{p, q}

is the gradient of the horizontal direction and

\nabla^{2} u_{p, q}

is the gradient of the vertical direction.

The total variation (TV) regularization term of u is indicated as follows:

T V (u) = \sum_{p \leq m, q \leq n} | \nabla u_{p, q} |

(4)

For the restoration model with additive noise pollution data, it can be indicated as follows:

\begin{array}{l} u = \arg \min_{u} L (u) \leq ε \\ L (u) = \frac{λ}{2} {‖ | ℱ u | - b ‖}_{Ω}^{2} + T V (u) \end{array}

(5)

where λ > 0 is the parameter of the total variation regular term. The total variation regularization penalty term is the semi-norm of the image gradient. One type of TV variant, namely the isotropic TV, is defined by

\begin{matrix} T V {(u)}_{1} = & \sqrt{{(d x (u))}^{2} + {(d y (u))}^{2}} \\ = & \sum_{p = 1}^{m - 1} \sum_{q = 1}^{n - 1} \sqrt{{(u_{p, q} - u_{p, q + 1})}^{2} + {(u_{p, q} - u_{p + 1, q})}^{2}} \\ + \sum_{p = 1}^{m - 1} | u_{p, n} - u_{p + 1, n} | + \sum_{q = 1}^{n - 1} | u_{m, n} - u_{q + 1} | \end{matrix}

(6)

Another type is anisotropic TV, which is defined by

\begin{matrix} T V {(u)}_{2} = & d x (u) + d y (u) \\ = & \sum_{p = 1}^{m - 1} \sum_{q = 1}^{n - 1} {| u_{p, q} - u_{p, q + 1} | + | u_{p, q} - u_{p + 1, q} |} \\ + \sum_{q = 1}^{n - 1} | u_{m, q} - u_{q + 1} | + \sum_{p = 1}^{m - 1} | u_{p, n} - u_{p + 1, n} | \end{matrix}

(7)

2.3. Wavelet

Images have sparse representation or approximation in redundant transformation, such as tight frame transform [24,25]. Wavelet [26] is one type of tight frame. A large number of studies have shown that sparsity prior and low rank prior regularization enable the corresponding algorithm to produce a high-quality solution [27,28].

2.3.1. Tight Framework

A tight frame in a Hilbert space is introduced as follows, and the interested readers can consult [27] for more in-depth study. Let

‖ \cdot ‖

denote the norm of a variable in a Hilbert space

ℋ

. The sequence

{ϕ_{n}}_{n \in L} \subset ℋ

constructs a tight frame in

ℋ

, when

\begin{array}{l} f = \sum_{ϕ_{n} \in} 〈 f, ϕ_{n} 〉 ϕ_{n}, \forall f \in ℋ \\ {‖ f ‖}_{2}^{2} = {\sum_{n \in L} | 〈 f, ϕ_{n} 〉 |}^{2}, \forall f \in ℋ \end{array}

(8)

where

〈 f, ϕ_{n} 〉

indicates an inner product and

‖ \cdot ‖

indicates the norm of

ℋ

.

For the bounded sequence

{ϕ_{n}}_{n \in L} \subset ℋ

, let

W

denote the analysis operator and let

W^{*}

denote its adjoining operator; they are defined by

\begin{array}{l} W : f \in ℋ \to {〈 f, ϕ_{n} 〉} \in l^{2} (N), \\ W^{*} : {a_{n}} \in l^{2} (N) \to \sum_{n \in L} a_{n} ϕ_{n} \in ℋ \end{array}

(9)

The sequence

{ϕ_{n}}_{n \in L} \subset ℋ

constructs a tight frame when

W^{*} W = I

; therein,

I : ℋ \to ℋ

is the identity operator.

In the following equations, u denotes the image for phase retrieval.

g = W u

(10)

u = W^{*} (W u) = W^{*} g

(11)

where g indicates the tight framelet transform coefficients.

The phase retrieval deconvolution optimization model based on synthetic sparsity representation is

\begin{array}{l} W : u \in ℋ \to {〈 u, ϕ_{n} 〉} \in l^{2} (N), \\ W^{*} : {a_{n}} \in l^{2} (N) \to \sum_{n \in L} a_{n} ϕ_{n} \in ℋ \end{array}

(12)

The deconvolution optimization model based on analytical sparsity representation is

\bar{g} = \underset{u \in R^{m \times n}}{\arg \min} Φ (k \otimes u - f) + λ {‖ g ‖}_{1}

(13)

The deconvolution optimization model based on synthesis sparsity representation is

\begin{array}{l} \bar{g} = R \bar{u} \\ \bar{u} = \underset{u \in R^{m \times n}}{\arg \min} Φ (k \otimes u - f) + λ {‖ W u ‖}_{1} \end{array}

(14)

The tight frame is one type of orthonormal basis generalization; redundant frame has been found to be useful in image and signal processing [25,29,30]. Analysis operator W is redundant when its column dimension is smaller than the row dimension and the two methods generate different results. The synthesis-based method aims to obtain the most spars result among all possible transform coefficient vectors, while the analysis-based method seeks the most spar solutions among all possible canonical framelet coefficient vectors. Thus, the analysis-based method is a strict subset of synthesized-based methods. The solutions of analysis-based methods are close to the underlying image have increased smoothness. This has been proven empirically in many experiments. Since a solution with a certain smoothness will have a better visual quality, the analysis-based approach was chosen for this study. The two approaches are equivalent only if

W^{*} W = W W^{*} = I

. Then, the tight frame becomes a canonical orthogonal transformation.

2.3.2. Wavelet Tight Framework

A one-dimensional wavelet frame is constructed by a finite set of generators with shifts and dilations,

Ψ = {ψ^{1}, \dots ψ^{r}} \subset L_{2}

, where

ψ

is called the father wavelet and

ψ_{j, k}

is called the wavelet. A refinable function is usually used to construct a tight wavelet frame,

φ (t) \in L^{2} (R)

, which is also known as the scale function or mother wavelet. This satisfies the two-scale equation

{\hat{ψ}}^{i} (2 ω) = h_{i} \hat{ϕ} (ω)

, where

h_{i}

are

2 π

periodic trigonometric polynomials, which satisfies

h_{0} (0) = 1

.

The unitary extension principle (UEP) [31] declares that

X (Ψ)

forms a tight frame when

h_{0} (ω) \bar{h_{0} (ω + γ π)} + \sum_{i = 1}^{r} h_{i} (ω) \bar{h_{i} (ω + γ π)} = δ_{γ, 0}, γ = 0, 1

(15)

Image processing uses two-dimensional information, so the two-dimensional wavelet frames are needed. Indeed, a two-dimensional wavelet is a tensor product of a one-dimensional wavelet. S. Malat and Meyer proposed MRA theory [31,32,33], which studies the multi-resolution analysis properties of wavelets from the perspective of a function space, and provides a unified theory for constructing a wavelet framework and a fast algorithm for orthogonal wavelet transform. In this study, a two-level piecewise linear B-spline tight frame system with tensor product filters was adopted. The piecewise linear B-spline tight frame is the simplest system in this family, which employs piecewise linear B-spline functions as

φ

. The corresponding tensor product filter is:

\begin{array}{l} a_{0, 0} = \frac{1}{16} (\begin{matrix} 1 & 2 & 1 \\ 2 & 4 & 2 \\ 1 & 2 & 1 \end{matrix}) & a_{0, 1} = \frac{\sqrt{2}}{16} (\begin{matrix} 1 & 0 & - 1 \\ 2 & 0 & - 2 \\ 1 & 0 & - 1 \end{matrix}) \\ a_{0, 2} = \frac{1}{16} (\begin{matrix} - 1 & 2 & - 1 \\ - 2 & 4 & - 2 \\ - 1 & 2 & - 1 \end{matrix}) & a_{1, 0} = \frac{\sqrt{2}}{16} (\begin{matrix} 1 & 2 & 1 \\ 0 & 0 & 0 \\ - 1 & - 2 & - 1 \end{matrix}) \\ a_{1, 1} = \frac{1}{8} (\begin{matrix} 1 & 0 & - 1 \\ 0 & 0 & 0 \\ - 1 & 0 & 1 \end{matrix}) & a_{1, 2} = \frac{\sqrt{2}}{16} (\begin{matrix} - 1 & 2 & - 1 \\ 0 & 0 & 0 \\ 1 & - 2 & 1 \end{matrix}) \\ a_{2, 0} = \frac{1}{16} (\begin{matrix} - 1 & - 2 & - 1 \\ 2 & 4 & 2 \\ - 1 & - 2 & - 1 \end{matrix}) & a_{2, 1} = \frac{\sqrt{2}}{16} (\begin{matrix} - 1 & 0 & 1 \\ 2 & 0 & - 2 \\ - 1 & 0 & 1 \end{matrix}) \\ a_{2, 2} = \frac{1}{16} (\begin{matrix} 1 & - 2 & 1 \\ - 2 & 4 & - 2 \\ 1 & - 2 & 1 \end{matrix}) \end{array}

(16)

MATLAB software R2016b was used for the calculations. We used the wavelet frame decomposition algorithm [25], whose construction varies with boundary setting conditions. In this research, Neumann (symmetric) boundary conditions were adopted. Interested readers can refer to [27,31,34] for the principles of generating such matrices.

3. Proposed Model and Numerical Algorithm

3.1. Formulation of Minimization Model

Our work is closely related to article [17]. where E. J. Candes discussed the possibility of reconstructing the target from incomplete sampling, and

{7 n}_{1} n_{2}

measurements (the size of ground truth image is

n_{1} \times n_{2}

) were suggested in order to uniquely recover real-valued images. Chang Huibin proposed a TV regularization [19] model which can recover images from noisy measurements. In this research, a TV and wavelet-based co-exist regularization model is proposed to recover underlying images from

{3 n}_{1} n_{2}

measurements with noise. The measurements were obtained by structured light illumination and image value with box constraint

u_{o b j e c t} \in [0, 1]

. The model is introduced as follows:

\begin{array}{l} f i n d u \\ s . t {. b}_{0} : = | ℱ (u_{o b j e c t}) |, b_{1} : = | ℱ (u_{o b j e c t} + D^{s} u_{o b j e c t}) |, b_{2} : = | ℱ (u - i D^{s} u_{o b j e c t}) | \end{array}

(17)

where

ℱ

denotes the Fourier transform. Let

R (u_{o b j e c t}) : = {R_{0}, R_{1}, R_{2}}

(18)

The data R(

u_{o b j e c t}

) are obtained using three-light-field illumination

| R (u_{o b j e c t}) | = {1, | 1 + \exp (i 2 π (\frac{s_{1} t_{1}}{n_{1}} + \frac{s_{2} t_{2}}{n_{2}})) |, | 1 + \exp (i 2 π (\frac{s_{1} t_{1}}{n_{1}} + \frac{s_{2} t_{2}}{n_{2}}) - \frac{π}{2}) |}

(19)

where

\begin{array}{l} D^{s} u_{o b j e c t} (t_{1} + n t_{2}) = \exp (\frac{2 π s_{1} + t_{1}}{n_{1}} + \frac{2 π s_{2} + t_{2}}{n_{2}}) u_{o b j e c t} (t_{1} + t_{2} n_{1}), \\ 0 \leq t_{1} \leq n_{k} - 1, 0 \leq t_{2} \leq n_{k} - 1 \end{array}

(20)

The research in [19,35] showed that the least square method produces a unique result for the PR problem with these three sets of data. The model is referred to as the least-square minimization problem with a box-constrained (LSB) model [35].

\begin{array}{l} \min_{0 \leq u \leq 1} & {‖ | R_{0} | - b_{0} ‖}_{Ω_{0}}^{2} + {‖ {| R |}_{1} - b_{1} ‖}_{Ω_{1}}^{2} \\ + {‖ | R_{2} | - b_{2} ‖}_{Ω 2}^{2} \end{array}

(21)

This study focused on the sparse prior of objects; we incorporated TV and wavelet transform regularization into (17) to guarantee an exact solution. The analysis-based sparsity approach under wavelet tight frame decomposition was selected for this study. The proposed TV and framelet-based minimization problem of the least-square type with a box constraint (TFLSB) model is presented as follows:

\begin{array}{l} \min_{0 \leq u \leq 1} ε_{T F L S B} = & \frac{λ}{2} ({‖ | R_{0} | - b_{1} ‖}_{Ω_{0}}^{2} + {‖ | R_{1} | - b_{2} ‖}_{Ω_{1}}^{2} + {‖ | R_{2} | - b_{3} ‖}_{Ω 2}^{2}) \\ + T V (u) + {‖ W u ‖}_{1} \end{array}

(22)

where we adopt anisotropic TV regularization to be the term of TV(u),

W u

is the wavelet decomposition of

u

, and

{‖ \cdot ‖}_{1}

denotes the L-1 norm.

3.2. Uniqueness Analysis

Hayes [36] and Sanz [37] proved that using double the number of measurements can uniquely determine the solution of PR when the underling signal is nonnegative and finitely supported. The results are further extended with random oversampling [38,39,40], where random illumination guarantees the absolute uniqueness and resolves all types of problems. We found that when

s_{1} = s_{2} = N + 0.5

(where N is a positive integer), 3n₁n₂ measurements and additional constraints such as the underlying object

u

is real and has a non-negative value, the algorithm can produce a unique solution.

Theorem 1.

Assume that

u_{m, n} \in [0, 1]

and the DFT of u and D^s u are nonvanishing, when s₁ and s₂ are prime with m and n, and s₁ = s₂ = N + 0.5 (N is a positive integer). Then, u can be recovered with 3 mn measurements in (15).

Proof.

This theorem applies to one-dimensional and two-dimensional cases. The complete proof is shown in Appendix A. □

3.3. Solution Existence Analysis

Theorem 2.

Let Ω denote a bounded set in Lipschitz regular domain and comprehensive data b = (b₀, b₁, b₂) is non-negative; then, the TFLSB model (19) has at least one minimum solution

u

* ∈ BV (Ω).

Proof.

It should be noted that

ε_{TFLSB (u)} \geq 0

. A minimizing sequence {u_k}_{0 ≤ k ≤ ∞} satisfies (s.t.)

ε_{TFLSB (u 0)} \geq ε_{TFLSB (u 1)} \geq \dots

. It is known that

v a l u e (u_{k}) \in [0, 1]

. The set Ω is bounded in a Lipschitz regular domain, C is a positive constant and exists as the upper limit, and s.t.

T V (u_{k}) + W u_{k} + {‖ u_{k} ‖}_{1} + + {‖ W u_{k} ‖}_{1} \leq C

. As Rellich’s compactness theorem state that there is

u^{*} \in B V (Ω)

and a subsequence

{u_{n k}}_{k \geq 1}

, which meet

u_{n k} \to u^{*}

in the norm of

L_{1} (Ω)

when

k \to \infty

. Using the continuity of the fidelity term of

ε_{T F L S B}

and lower semi-continuity of TV and wavelet regularization, one obtains

\underset{u_{n k} \to u *}{\lim \sup} ε_{T F L S B} (u_{k}) \geq_{T F L S B} (u *)

. Then, u* is the one solution of

ε_{T F L S B}

. □

3.4. Numerical Model

There are several numerical methods that are used for the constrained optimization problem. At present, the Projected Gradient Descent (PGD) and Alternating Direction Method of Multipliers (ADMM) are widely used. The Alternating Direction Method of Multipliers (ADMM) was first proposed by Glowinski and Gabay, and further improved by Boyd in 2011, who demonstrated that ADMM is applicable in large-scale distributed optimization problems [41]. ADMM is a computational method for optimization problems, and is effective in solving distributed convex optimization problems, especially statistical learning problems [42,43]. Through the process of decomposition–coordination, ADMM disassembles the large complicated global problem into several smaller solvable local sub-problems, which can be computed more easily and could converge to the global optimal solution through the coordination of the sub-problems. The disadvantages of the Projected Gradient Descent method are as follows: (1) it may converge to the local optimal solution; (2) the differential at saddle point is 0, but it is not the optimal solution; (3) because of its computational complexity, it is a time-consuming method, especially when the data is large-scale. Thus, ADMM was selected to solve the constrained optimization problem in this paper.

The above minimum problem model can be described by the formula:

\begin{array}{l} \min & {‖ p_{1} ‖}_{1} + {‖ p_{2} ‖}_{1} + \frac{λ}{2} {‖ | z_{0} | - b_{0} ‖}_{Ω_{0}}^{2} + \frac{λ}{2} {‖ | z_{1} | - b_{1} ‖}_{Ω 1}^{2} \\ + \frac{λ}{2} {‖ | z_{2} | - b_{2} ‖}_{Ω_{2}}^{2} + χ (v) \\ s . t . & v = u p_{1} = \nabla u p_{2} = w u z_{0} = R_{0} z_{1} = R_{1} z_{2} = R_{2} \end{array}

(23)

where

χ (μ) = {\begin{cases} 0, 0 \leq v \leq 1 \\ \infty, otherwise \end{cases}

(24)

The augmented Lagrangian equation of

ℒ_{T F L S B}

reads

\begin{matrix} ℒ (u, z_{0}, z_{1}, z_{2}, p, v) & = {‖ p_{1} ‖}_{1} + {‖ p_{2} ‖}_{1} + \frac{λ}{2} {‖ | z_{0} | - b_{0} ‖}_{Ω_{0}}^{2} \\ + \frac{λ}{2} {‖ | z_{1} | - b_{1} ‖}_{Ω 1}^{2} + \frac{λ}{2} {‖ | z_{2} | - b_{2} ‖}_{Ω_{2}}^{2} + χ (v) \\ + R 〈 d_{0}, z_{0} - R_{0} 〉 + \frac{ρ_{1}}{2} {‖ z_{0} - R_{0} ‖}_{Ω_{0}}^{2} \\ + R 〈 d_{1}, z_{1} - R_{1} 〉 + \frac{ρ_{1}}{2} {‖ z_{1} - R_{1} ‖}_{Ω 1}^{2} \\ + R 〈 d_{2}, z_{2} - R_{2} 〉 + \frac{ρ_{1}}{2} {‖ z_{2} - R_{2} ‖}_{Ω_{2}}^{2} \\ + 〈 q_{1}, p_{1} - \nabla u 〉 + \frac{ρ_{3}}{2} {‖ p_{1} - \nabla u ‖}_{2}^{2} \\ + 〈 q_{2}, p_{2} - W u 〉 + \frac{ρ_{4}}{2} {‖ p_{2} - W u ‖}_{2}^{2} \\ + 〈 w, u - v 〉 + \frac{ρ_{2}}{2} {‖ u - v ‖}_{2}^{2} + t {‖ H v ‖}^{2} \\ = {‖ p_{1} ‖}_{1} + {‖ p_{2} ‖}_{1} + \frac{λ}{2} {‖ | z_{0} | - b_{0} ‖}_{Ω_{0}}^{2} \\ + \frac{λ}{2} {‖ | z_{1} | - b_{1} ‖}_{Ω 1}^{2} + \frac{λ}{2} {‖ | z_{2} | - b_{2} ‖}_{Ω_{2}}^{2} + χ (v) \\ + \frac{ρ_{1}}{2} {‖ z_{0} - R_{0} + \frac{d_{1}}{ρ_{1}} ‖}_{Ω_{0}}^{2} + \frac{ρ_{1}}{2} {‖ z_{1} - R_{1} + \frac{d_{2}}{ρ_{1}} ‖}_{Ω_{1}}^{2} \\ + \frac{ρ_{1}}{2} {‖ z_{2} - R_{2} + \frac{d_{3}}{ρ_{1}} ‖}_{Ω_{2}}^{2} + \frac{ρ_{2}}{2} {‖ u - v + \frac{w}{ρ_{2}} ‖}_{2}^{2} \\ + \frac{ρ_{3}}{2} {‖ p_{1} - \nabla u + \frac{q_{1}}{ρ_{3}} ‖}_{2}^{2} + \frac{ρ_{4}}{2} {‖ p_{2} - W u + \frac{q_{2}}{ρ_{4}} ‖}_{2}^{2} \\ + t {‖ H v ‖}^{2} \end{matrix}

(25)

where

q_{1}, q_{1} : Ω \to R^{2}

,

H v

is the position penalty term of the image, and

λ, ρ_{1}, ρ_{2}, ρ_{3}, ρ_{4}

are weight parameters and are positive. The ADMM solution framework for the above saddle-point solution, minimizes

ℒ_{T F L S B}

with regardto

u, z, p, v

alternately and then update the dual variables

d_{0}

,

d_{1}

,

d_{2}

,

d_{3}

and

q_{1}, q_{2}

. The algorithm is summarized in the Algorithm 1.

According to the ADMM algorithm, the solution is decomposed into the following steps.

Algorithm 1 ADMM method for solving the TFLSB model (22)

Initialization:

\begin{array}{l} k = 0, z_{i}^{0} = b_{i} (ω) \exp (- 2 π i θ_{i}), \\ p^{0} = 0, v^{0} = 0, d_{i}^{0} = 0, q_{1}^{0} = 0, q_{2}^{0} = 0 \end{array}

While the loop stop conditions are not satisfied, do

u^{k + 1} = \underset{u}{\arg \min} L_{T F L S B} (u, z_{i}^{k}, p_{1}^{k}, p_{2}^{k}, v^{k}, d_{i}^{k}, w^{k}, q_{1}^{k}, q_{2}^{k})

z_{i}^{k + 1} = \underset{z_{i}}{\arg \min} L_{T F L S B} (u^{k}, z_{i}, p_{1}^{k}, p_{2}^{k}, v^{k}, d_{i}^{k}, w^{k}, q_{1}^{k}, q_{2}^{k})

p_{1}^{k + 1} = \underset{p_{1}}{\arg \min} L_{T F L S B} (u^{k}, z_{i}^{k}, p_{1}, p_{2}^{k}, v^{k}, d_{i}^{k}, w^{k}, q_{1}^{k}, q_{2}^{k})

p_{2}^{k + 1} = \underset{p_{2}}{\arg \min} L_{T F L S B} (u^{k}, z_{i}^{k}, p_{1}^{k}, p_{2}, v^{k}, d_{i}^{k}, w^{k}, q_{1}^{k}, q_{2}^{k})

v^{k + 1} = \underset{v}{\arg \min} L_{T F L S B} (u^{k}, z_{i}^{k}, p_{1}^{k}, p_{2}^{k}, v, d_{i}^{k}, w^{k}, q_{1}^{k}, q_{2}^{k})

Update dual variables

{\begin{cases} d_{1}^{k + 1} = d_{1}^{k} + ρ_{1} (z_{0}^{k + 1} - R_{0}^{k + 1}) \\ d_{2}^{k + 1} = d_{2}^{k} + ρ_{1} (z_{1}^{k + 1} - R_{1}^{k + 1}) \\ d_{3}^{k + 1} = d_{3}^{k} + ρ_{1} (z_{2}^{k + 1} - R_{2}^{k + 1}) \\ w^{k + 1} = w^{k} + ρ_{2} (u^{k + 1} - v^{k + 1}) \\ q_{1}^{k + 1} = q_{1}^{k} + ρ_{3} (p^{k + 1} - \nabla u^{k + 1}) \\ q_{2}^{k + 1} = q_{2}^{k} + ρ_{4} (p_{2}^{k + 1} - w u^{k + 1}) \end{cases}

k = k + 1

end while

output the solution

u^{*} = u^{k + 1}

The subproblem to solve the saddle point of u is

\begin{array}{l} \min_{u} & \frac{ρ_{3}}{2} {‖ p - \nabla u + \frac{q_{1}}{ρ_{3}} ‖}^{2} + \frac{ρ_{4}}{2} {‖ p_{2} - W u + \frac{q_{2}}{ρ_{4}} ‖}_{2}^{2} + \frac{ρ_{2}}{2} {‖ u - v + \frac{q_{2}}{ρ_{2}} ‖}^{2} \\ + \frac{ρ_{1}}{2} {‖ z_{0} - ℱ u + \frac{d_{1}}{ρ_{1}} ‖}_{Ω_{0}}^{2} + \frac{ρ_{1}}{2} {‖ z_{1} - ℱ (u + D^{s} u) + \frac{d_{2}}{ρ_{1}} ‖}_{Ω_{1}}^{2} \\ + \frac{ρ_{1}}{2} {‖ z_{2} - ℱ (u - i D^{s} u) + \frac{d_{3}}{ρ_{1}} ‖}_{Ω_{2}}^{2} \end{array}

(26)

By calculating the derivative, the solution is

\begin{matrix} - Δ u + \frac{ρ_{2}}{ρ_{3}} u + \frac{ρ_{4}}{ρ_{3}} u + \frac{ρ_{1}}{ρ_{3}} (5 I + 2 R (D^{s}) + 2 E (D^{s})) u \\ = \frac{ρ_{1}}{ρ_{3}} (\begin{array}{l} R (ℱ^{*} (z_{0} + \frac{d_{1}}{ρ_{1}})) \\ + R (ℱ^{*} (z_{1} + \frac{d_{2}}{ρ_{1}}) + D^{s} \bar{ℱ^{*} (z_{1} + \frac{d_{2}}{ρ_{1}})}) \\ + R (ℱ^{*} (z_{2} + \frac{d_{3}}{ρ_{1}}) - i D^{s} \bar{ℱ^{*} (z_{2} + \frac{d_{3}}{ρ_{1}})}) \end{array}) \\ + \frac{ρ_{2}}{ρ_{3}} (v - \frac{q_{2}}{ρ_{2}}) - d i v (p_{1} + \frac{q_{1}}{ρ_{3}}) + \frac{ρ_{4}}{ρ_{3}} W^{T} (p_{2} + \frac{q_{2}}{ρ_{4}}) \end{matrix}

(27)

4. Numerical Experiments

The initialization method for the proposed Algorithm 1 is as follows.

The measurements b_i containing noise are used directly for the calculations without any processing. The initial phase is given randomly without any specific request.

The initialization for variables

v_{0}

and

z_{i}^{0}

was chosen to be

z_{i}^{0} (ω) = {\begin{matrix} b_{i} (ω) \exp (- 2 π i θ_{0}) & if ω \in Ω \\ 0 & otherwise \end{matrix}

(28)

and

v^{0} = ℱ z_{0}^{0} = ℱ (b_{0} (ω) \exp (- 2 π i θ_{0}))

, where initial phase

θ_{0}

is derived from the standard uniform distribution in the open interval (0, 1).

The quality of the reconstructed image includes two aspects: one is the visual effect and the other is the evaluation index to analyze the difference between the reconstructed image and original image. The evaluation of visual effects varies with different people’s visual conditions. Objective evaluation indicators are very important. The affluent evaluation index of the reconstruction image evaluation used in the field of image reconstruction was adopted in this study to compare the proposed algorithm with other algorithms.

The peak signal-to-noise ratio (PSNR), structural similarity (SSIM) and signal-to-noise ratio (SNR) were used to measure the quality of the reconstruction, and the relative error (relative-error) was used to measure the convergence speed. PSNR is a comprehensive evaluation index of reconstruct image quality. MSE is the mean square error of the current calculated image compared to the ground truth image. H and V represent the number of the row and column, respectively, and n is the number of bits of storage per pixel.

\begin{array}{l} M S E = \frac{1}{H * v} \sum_{p = 1}^{H} \sum_{q = 1}^{v} {(u (p, q) - u_{g} (p, q))}^{2} \\ P S N R = 10 \log_{10} (\frac{{(2^{H \times V} - 1)}^{2}}{M S E}) \end{array}

(29)

SSIM (structural similarity) is a comprehensive evaluation of the image restoration quality in terms of brightness, contrast and image structure.

\begin{array}{l} μ_{u} = \frac{1}{H * V} \sum_{i = 1}^{H} \sum_{j = 1}^{V} u (p, q) \\ σ_{u} = \frac{1}{H * V - 1} {({\sum_{i = 1}^{H} \sum_{j = 1}^{V} (u (p, q) - μ_{u})}^{2})}^{\frac{1}{2}} \\ σ_{u_{g}} = \frac{1}{H * W - 1} \sum_{i = 1}^{H} \sum_{j = 1}^{W} (u (p, q) - μ_{u}) (u_{g} (p, q) - μ_{u}) \\ l (u, u_{g}) = \frac{2 μ_{u} μ_{u_{g}} + C_{1}}{μ_{u}^{2} μ_{u_{g}}^{2} + C_{1}} \\ c (u, u_{g}) = \frac{2 σ_{u} σ_{u_{g}} + C_{2}}{σ_{u}^{2} σ_{u_{g}}^{2} + C_{2}} \\ s (u, u_{g}) = \frac{σ_{u, u_{g}} + C_{3}}{σ_{u} σ_{u_{g}} + C_{3}} \\ S S I M = l (u, u_{g}) * c (u, u_{g}) * s (u, u_{g}) \end{array}

(30)

In general, let

C_{1} = 6.5 C_{2} = 58.5 C_{3} = 29

.

Signal-to-noise ratio (SNR) is calculated as

\begin{array}{l} R M S E (u, u_{g}) = \frac{\sum_{j \in Ω} {| u (p, q) - u_{g} (p, q) |}^{2}}{\sum_{j \in Ω} {| u (p, q) |}^{2}} \\ S N R (u, u_{g}) = - 10 \log_{10} R M S E (u, u_{g}) \end{array}

(31)

Relative-error is defined as follows:

r e l a t i v e - e r r o r = \frac{‖ u (p, q) - u_{g} (p, q) ‖}{u_{g} (p, q)}

(32)

In all the above formulas, u represents the current reconstructed image and

u_{g}

represents the ground truth image.

4.1. Numerical Results

We also compared the PR results for the performance of the proposed TFLSB model with three other related phase retrieval algorithms: the error reduction algorithm (ER) [3], TVB method [18] and Wirtinger flow (WTF) method [9]. The data were contaminated by the Gaussian noise

b_{i} = b_{0} + σ n_{i}

, where

b_{0}

is magnitudes of the original image,

σ

represents the noisy weight,

n_{i}

represents the white Gaussian noise, and

b_{i}

is the measurement of the real object’s Fourier transform magnitude.

The test simulation images are available on the Internet (no copyright restrictions). Our research focused on the phase retrieval problem from Fourier transform magnitudes. Its application field covers remote target imaging. Therefore, the selected images have obvious geometric features and simple textures, with a size of 256 × 256.

In order to study the algorithm’s robustness to noise, 60 dB Gaussian noise was added to the measured value to verify the robustness of the algorithms. The noise level was set according to the SNR formula and rand function.

\begin{matrix} S N R & = - 10 \log_{10} \frac{\sum_{j \in Ω} {| u (i, j) - u_{g} (i, j) |}^{2}}{\sum_{j \in Ω} {| u (i, j) |}^{2}} \\ = - 10 \log_{10} \frac{\sum_{j \in Ω} {| | n o i s e (i, j) | |}^{2}}{\sum_{j \in Ω} {| u (i, j) |}^{2}} \end{matrix}

(33)

The TVB method and the TFLSB algorithm use the same structured illumination pattern in this study; the relevant parameters were set as follows: the iteration numbers were 1000,

λ = 100

,

ρ_{1} = 0.07

,

ρ_{2} = 0.02

and

ρ_{3} = 0.08

. As

s_{1} = s_{2} = N + 0.5

(N is a positive integer) proves that the underlying image has an optimal solution, then s₁ and s₂ can be set to 0.5, 1.5 and 2.5. Here, in this experiment, we set

s_{1} = s_{2} = 0.5

. Other value settings, such as 1.5, 2.5 and 3.5, could also obtain visually good reconstructed images and can be modified according to different graphics. The results are shown in the following figures, Figure 1, Figure 2, Figure 3, Figure 4, Figure 5 and Figure 6.

The numerical evaluation index performance of the phase retrieval method reflects the approximate degree of the distribution characteristics of the calculated image and the real object. Image vision includes sharpness, smoothness and similarity. The comparison results of the reconstructed image quality from the three images in Figure 1, Figure 3 and Figure 5 were as follows. Firstly, the ER and TVB algorithms introduced obvious defects in the originally smooth background area. Secondly, the reconstructed image of the ER algorithm was fuzzy overall, while the reconstructed image of WF was sharper. This is because these algorithms do not have a good balance between image smoothness and boundary sharpness. Although the constrained performance of the TVB algorithm was better, its staircase effect can produce obvious defects which could not be omitted. Overall, compared with the other algorithms, the proposed TFLSB model using the ADMM algorithm performed better, and the reconstructed images were more visually pleasant with few noticeable image artifacts.

The comparison results in Figure 2, Figure 4 and Figure 6 show that the proposed algorithm was more stable, converged to the optimal value and always had the lowest relative error value when dealing with images with different complexities. Table 1, Table 2 and Table 3 show the image restoration quality with the numerical evaluation indicators, PSNR, SSIM, SNR and processing time. When the iteration number was 1000, our algorithm required more time due to the algorithm’s computing complexity. However, according to the speed of iterative convergence, the solving time could be shortened by reducing the iteration number. The PSNR, SSIM and SNR indexes directly reflect the pixel correspondence between the reconstructed image and the original image. According to the results of the three reconstructed images, for the TFLSB model proposed in this paper, compared with ER, WF and TVB algorithm, the PSNR index was improved by 111.59%, 108.87% and 57.14%, respectively. The SSIM index improved by 174.49%, 194,74% and 84.37%, and the SNR index improved by 674.89%, 517.72% and 241.47%, respectively. The comparison results in Table 1, Table 2 and Table 3 directly show that our proposed FLSB model is more robust to noise and more efficient in reconstructing images compared to the other methods.

4.2. Sensitivity with Complex Noise

Image acquisition not only contains Gaussian noise, but also electromagnetic interference in the environment, and sensor internal errors, which will introduce salt–pepper noise. Salt–pepper noise, also known as pulse noise, is represented in the image as discrete distributions of pure white or black pixels. The image reconstruction ability of the TFLSB model was studied in images where Gaussian noise and salt–pepper noise coexist. Due to the excellent performance of the proposed algorithm in the previous task, in this experiment, we challenged the proposed algorithm to deal images with more complex textures to study its robustness in the presence of complex noise. The images are public images released by the Kodak Company. For the sake of consistency in this paper, the images used below are part of the original Kodak images, with a size of 256 × 256. Images were named—“Bird”, “Hat”, “Tower”.

The data were contaminated with Gaussian noise and salt–pepper noise, i.e.,

b_{i} = b_{0} + σ n_{i} + p

, where

b_{0}

is the magnitudes of the spatial spectrum of the original image,

σ

represents the noisy weight,

n_{i}

represents the white Gaussian noise, p denotes the salt–pepper noise, and

b_{i}

is the measurement of the Fourier transform magnitude of the real object. The salt–pepper noise was generated by a random function and threshold setting. First, a random matrix was generated, whose values were derived from the standard uniform distribution in the open interval (0, 1). Then, using the threshold setting, the elements of a random matrix were converted to integers 0 or 1.

Thus, the complex noise is described using the following formula:

n_{c o m p l e x} = σ n_{i} + p

(34)

In this experiment,

σ

= 1 and the threshold setting of salt–pepper noise is 0.9. When the value generated by the random function is greater than 0.9, white noise is obtained. The salt–pepper noise setting means that 10% of the measurements were corrupted, which is more serious than a real experiment. The results are shown in the following Figure 7.

By comparing the reconstructed images with the original images, we can observe that the reconstructed images with complex noise have significant clarity regarding visual quality. The numerical values in Table 4 show that the PSNR values of the restored images varied from 23.606 to 29.4759, the SSIM values varied from 0.355219 to 0.443401, and the SNR values varied from 24.715 dB to 27,188 dB. These data suggest that the proposed sparse prior regularization model TFLSB is robust even with complex noise, and it is also effective in processing images with complex textures. In the future, its application range could be determined by studying the relationship between noise standards, parameter settings and image complexity.

5. Experimental Results

Experimental equipment was set up to simulate reconstruction of the underlying image using Fourier transform magnitudes. The process consisted of two parts. The first part was data collection, during which the spatial spectrum modulus of the target underwent correlation computing. The second is to use the phase recovery algorithm to calculate the reconstructed image.

The schematic diagram of the experiment is shown in Figure 8. The working wavelength selected for the laser for the purpose of the experiment was 532 nm. The laser turned into a pseudothermal light source after passing through the rotating glass. In order to reduce the extra stray light caused by reflection, this experiment adopted a transmission target, which was made by hollowing out a target pattern on a metal plate, with an image size of 3 × 3 mm. The structure is shown in Figure 9.

The pixel size of the CCD camera was 6.5 μm, and the number of detection units is 2048 × 2048. In the experiment, the CCD detection frequency was 20 Hz, and the exposure time was 30 ms. The glass rotation speed was 0.3°/s. A SIM structured light modulator was set up using the internal DMD array to realize the control of the structured light. The light field distribution of structured light mode 1 is

\exp (i 2 π (\frac{s_{1} t_{1}}{n_{1}} + \frac{s_{2} t_{2}}{n_{2}})

. When

s_{1} = s_{2} = 0.5

, it is a two-dimensional sinusoidal distribution of phases in box constrained interval (0-Π). The light field distribution of structured light mode 1 is

\exp (i 2 π (\frac{s_{1} t_{1}}{n_{1}} + \frac{s_{2} t_{2}}{n_{2}}) - \frac{π}{2})

, and compared to pattern 1, it has a phase shift.

In order to reduce the influence of the environment on the phase retrieval experiment, data acquisition was divided into two steps. First, when the measurement was not illuminated by a light source, the measurement values

I_{d a r k}

of the experimental environment used for imaging and the inherent defects of the measurement system recorded with the detector, representing the background noise. Then, when the light source illuminates the target, the measurement values

I_{1}, I_{2}, I_{3}

were recorded. The difference between these two points can offset the impact of some of the noise.

The TFLSB algorithm in this paper is used for phase retrieval, and the results are shown in Figure 10.

Due to the limited experimental conditions, the acquisition frequency of the CCD camera was limited, as was the target spatial spectrum information that could be obtained by the pseudothermal light source, making the obtained target information very scarce and increasing the difficulty of phase recovery. When structured light lit an area, the two optical paths should be at the same frequency and there should be no phase delay, but in practical applications, it is difficult to ensure that the optical path difference between the two optical paths is 0. This error reduces reconstructed image quality. From the visual evaluation, the resolution of the reconstruction image is not good enough when compared the previous numerical simulation. The evaluation indexes PSNR, SSIM and SNR used for image restoration are not ideal, and additional research on phase retrieval should be carried out in the future. The numerical results in Table 5 directly show that our proposed FLSB model could efficiently reconstruct images in practice.

6. Conclusions

In this paper, we introduced an innovative TV and framelet-based regularization minimization phase retrieval model with a box constraint (TFLSB) for image recovery from magnitudes degraded by Gaussian and salt–pepper noise. Our proposed model incorporates isotropic TV and analysis-based wavelet regularization, enabling the enforcement of sparse priors in both the gradient domain and spatial structure domain simultaneously. Through heuristic analysis, we identified the key parameters s₁ = s₂ = N + 0.5 (N being a positive integer) that contribute to stable phase recovery with 3n₁n₂ measurements. This structural light can be easily obtained by using a structured light modulator. The TFLSB model effectively reconstructs high-quality latent images from corrupted measurement data obtained with a structured lighting model. Comparative evaluations against the ER, WF, and TVB algorithms demonstrate that the degraded images reconstructed by the TFLSB model exhibit superior image quality, with clearer edges and fewer artifacts. The evaluation indices PSNR, SSIM and SNR further confirm the significant enhancement in image quality achieved by the numerical theory and TFLSB model. Additionally, our study investigated the robustness of the TFLSB model against Gaussian and salt–pepper noise, revealing its resilience against complex noise. This provides a certain direction for the implementation of phase retrieval from Fourier transform magnitudes in practice.

Furthermore, in practice, the proposed method is able to reconstruct the underlying image from Fourier transform magnitudes, helping us to solve the phase retrieval problem in environments. Under the existing experimental conditions, the reconstructed images solved using the TFLSB algorithm are not clear enough, but there is an optimal solution, that can verify the stability and feasibility of the proposed algorithm.

It is important to note that phase retrieval remains a challenging deconvolution problem, and there are several open questions that require further exploration. The proposed algorithm was the most time consuming compared to the other algorithms, which is not conducive to real-time imaging applications. One potential improvement is the use of a more powerful industrial computer to reduce the computation time. Future work will focus on developing faster algorithms with a second-order convergence rate to reduce processing time. Additionally, we aim to incorporate more priors to enhance the quality of the reconstructed solutions for a broader range of image categories. While the proposed phase retrieval model is currently applicable to oversampling scenarios (structured illuminated patterns), our future research will explore additional patterns to enable the exact recovery of latent images in various settings.

Author Contributions

X.Q. developed the theoretical formalism. X.G. and X.Y. conceived the presented idea. X.Q. and M.X. collected and analyzed the data. X.Q. described the results and wrote the paper. X.G. supervised the work. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data are contained within the article.

Acknowledgments

The authors would like to thank Bin Wang from the Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Science, for helping us develop the theoretical formalism The insightful comments by Bin Wang is greatly appreciated and helped to improve this paper.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Proof of Theorem 1

Proof.

Case 1 is for a one-dimensional signal, i.e.,

u = (u_{0}, u_{1}, \dots u_{n - 1}) \in R^{n}

, and

u_{i} \in [0, 1], 0 \leq i \leq n - 1

. The measurements are represented by

R (u) : = {| R_{0} |, | R_{1} |, | R_{2} |}

,

ℱ

denotes a discrete Fourier transformation, and

D^{s} u = \exp {i \frac{2 π k}{n} (N + 0.5)} u = \exp (\frac{i π k}{n}) u, 0 \leq k \leq n - 1

.

Define

U = (U_{0}, U_{1}, \dots U_{n - 1}) : = ℱ u

;

U = (U_{0}, U_{1}, \dots U_{n - 1}, \bar{U_{n - 1}} \dots \bar{U_{2}}, \bar{U_{1}})

can obtained, where

\bar{U_{1}}

identifies the complex conjugate operator of U.

Define

v = (v_{0}, v_{1}, \dots v_{n - 1}) : = D^{s} u

and

V = (V_{0}, V_{1}, \dots V_{n - 1}) : = ℱ v

; then,

V = (V_{0}, V_{1}, \dots V_{n - 1}, \bar{V_{n - 1}} \dots \bar{V_{2}}, \bar{V_{1}})

can obtained.

Then, two triples,

M (u) : = {| U |, | U + V |, | U - i V |}

and

\bar{M (u)} : = {| \bar{U} |, | \bar{U} + \bar{V} |, | \bar{U} - i \bar{V} |}

, can be obtained.

If

U_{0}

and

V_{0}

are real-valued and non-negative, the set

U_{0}

and

V_{0}

could be solved with the triple set

{| U_{0} |, | U_{0} + V_{0} |, | U_{0} - i V_{0} |}

:

Solve

U_{1}

with the triple

{| \bar{U_{1}} |, | \bar{U_{1}} + \bar{V_{0}} |, | \bar{U_{1}} - i \bar{V_{0}} |}

if

U_{1} \neq 0

.

Since

V_{1} = \bar{V_{0}}

,

Solve

V_{2}

with the triple

{| \bar{U_{1}} |, | \bar{U_{1}} + \bar{V_{2}} |, | \bar{U_{1}} - i \bar{V_{2}} |}

if

V_{2} \neq 0

.

When

U = (U_{0}, U_{1}, \dots U_{n - 1})

has been solved, through the inverse Fourier transform of U, the exact minimum solution

u = ℱ^{*} U

. □

Case 2 is for two dimension images,

u, v \in R^{n_{1} \times n_{2}}

; here, matrix forms are used to represent the images.

Similar to the 1D case, the solution process is as follows:

U_{i 0, j 0} \to V_{i 0, j 0} \to V_{i 1, j 1} \to U_{i 1, j 1} \to U_{i 2, j 2} \to

when

\begin{array}{l} U_{i, j} = {\bar{U}}_{\mod (n 1 - i, n 1)}_{\mod (n 2 - j, n 2)} \\ V_{i, j} = {\bar{V}}_{\mod (n 1 + 1 - i, n 1)}_{\mod (n 2 + 1 - j, n 2)} \end{array}

where

\begin{array}{l} i_{2 k + 2} = \mod (n_{1} - i_{2 k + 1}, n_{1}) & j_{2 k + 2} = \mod (n_{2} - j_{2 k + 1}, n_{2}) \\ i_{2 k + 1} = \mod (n_{1} + 1 - i_{2 k}, n_{1}) & j_{2 k + 1} = \mod (n_{2} + 1 - j_{2 k}, n_{2}) \end{array}

References

Fienup, J.R. Phase retrieval algorithms: A comparison. Appl. Opt. 1982, 21, 2758–2769. [Google Scholar] [CrossRef] [PubMed]
Lu, Y.M.; Vetterli, M. Sparse spectral factorization: Unicity and reconstruction algorithms. In Proceedings of the ICASSP 2011–2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Prague, Czech Republic, 22–27 May 2011; pp. 5976–5979. [Google Scholar] [CrossRef]
Bauschke, H.H.; Combettes, P.L.; Luke, D.R. Phase retrieval, error reduction algorithm, and Fienup variants: A view from convex optimization. J. Opt. Soc. Am. A 2002, 19, 1334–1345. [Google Scholar] [CrossRef] [PubMed]
Gerchberg, R.W. A practical algorithm for the determination of phase from image and diffraction plane pictures. Optik 1972, 35, 237–250. [Google Scholar]
Bauschke, H.H.; Combettes, P.L.; Luke, D.R. Hybrid projection–reflection method for phase retrieval. J. Opt. Soc. Am. A 2003, 20, 1025–1034. [Google Scholar] [CrossRef] [PubMed]
Elser, V. Phase retrieval by iterated projections. J. Opt. Soc. Am. A 2003, 20, 40–55. [Google Scholar] [CrossRef]
Luke, D.R. Relaxed averaged alternating reflections for diffraction imaging. Inverse Probl. 2004, 21, 37–50. [Google Scholar] [CrossRef]
Marchesini, S. Phase retrieval and saddle-point optimization. J. Opt. Soc. Am. A 2007, 24, 3289–3296. [Google Scholar] [CrossRef]
Candes, E.J.; Li, X.; Soltanolkotabi, M. Phase Retrieval via Wirtinger Flow: Theory and Algorithms. IEEE Trans. Inf. Theory 2015, 61, 1985–2007. [Google Scholar] [CrossRef]
Farrell, S.M.; Veeraraghavan, A.; Sabharwal, A.; Uribe, C.A. Distributed Generalized Wirtinger Flow for Interferometric Imaging on Networks. IFAC-PapersOnLine 2022, 55, 258–263. [Google Scholar] [CrossRef]
Candès, E.J.; Strohmer, T.; Voroninski, V. PhaseLift: Exact and Stable Signal Recovery from Magnitude Measurements via Convex Programming. Commun. Pure Appl. Math. 2012, 66, 1241–1274. [Google Scholar] [CrossRef]
Moretta, R.; Maisto, M.A.; Pierri, R. Numerical experiments on the impact of constraints in Phaselift. In Proceedings of the 41st Photonics & Electromagnetics Research Symposium (PIERS), Xiamen, China, 17–20 December 2019; IEEE: New York, NY, USA. [Google Scholar]
Waldspurger, I.; D’aspremont, A.; Mallat, S. Phase recovery, MaxCut and complex semidefinite programming. Math. Program. 2013, 149, 47–81. [Google Scholar] [CrossRef]
Yin, P.; Xin, J. PhaseLiftOff: An accurate and stable phase retrieval method based on difference of trace and Frobenius norms. Commun. Math. Sci. 2015, 13, 1033–1049. [Google Scholar] [CrossRef]
Xia, Y.; Xu, Z. Sparse Phase Retrieval Via PhaseLiftOff. IEEE Trans. Signal Process. 2021, 69, 2129–2143. [Google Scholar] [CrossRef]
Gao, M.; Fang, S.; Wang, J.; Zhang, X.; Cao, Y. A Dual Frequency Predistortion Adaptive Sparse Signal Reconstruction Algorithm. Teh. Vjesn.-Tech. Gaz. 2022, 29, 580–589. [Google Scholar] [CrossRef]
Candes, E.J.; Eldar, Y.C.; Strohmer, T.; Voroninski, V. Phase Retrieval via Matrix Completion. SIAM Rev. 2015, 57, 225–251. [Google Scholar] [CrossRef]
Chia, C.; Sesia, M.; Ho, C.-S.; Jeffrey, S.S.; Dionne, J.; Candes, E.J.; Howe, R.T. Interpretable Classification of Bacterial Raman Spectra With Knockoff Wavelets. IEEE J. Biomed. Health Inform. 2021, 26, 740–748. [Google Scholar] [CrossRef] [PubMed]
Chang, H.; Lou, Y.; Ng, M.K.; Zeng, T. Phase Retrieval from Incomplete Magnitude Information via Total Variation Regularization. SIAM J. Sci. Comput. 2016, 38, A3672–A3695. [Google Scholar] [CrossRef]
Leonid, I. Rudin and Stanley Osher and Emad Fatemi. Nonlinear total variation based noise removal algorithms. Phys. D Nonlinear Phenom. 1992, 60, 259–268. [Google Scholar] [CrossRef]
Bessas, K. Fractional total variation denoising model with L1 fidelity. Nonlinear Anal. Int. Multidiscip. J. 2022, 222, 112926. [Google Scholar] [CrossRef]
Corentin, C.; Antonin, C. Error estimates for finite differences approximations of the total variation. IMA J. Numer. Anal. 2023, 43, 692–736. [Google Scholar]
Li, X.; Li, Y.; Chen, P.; Li, F. Combining convolutional sparse coding with total variation for sparse-view CT reconstruction. Appl. Opt. 2022, 61, C116–C124. [Google Scholar] [CrossRef] [PubMed]
Ron, A.; Shen, Z. Affine Systems inL2(Rd): The Analysis of the Analysis Operator. J. Funct. Anal. 1997, 148, 408–447. [Google Scholar] [CrossRef]
Daubechies, I.; Han, B.; Ron, A.; Shen, Z. Framelets: MRA-based constructions of wavelet frames. Appl. Comput. Harmon. Anal. 2003, 14, 1–46. [Google Scholar] [CrossRef]
Mallat, S. A Wavelet Tour of Signal Processing, 2nd ed.; Academic Press: San Diego, CA, USA, 1999. [Google Scholar]
Chan, R.; Shen, L.; Shen, Z. A framelet-based approach for image inpainting. Preprint 2005, 4, 325. [Google Scholar]
Nnolim, U.A. Fourth-Order Partial Differential Equation Framelet Fusion-Based Colour Correction and Contrast Enhancement for Underwater Images. Int. J. Image Graph. 2022, 23, 235044. [Google Scholar] [CrossRef]
Ron, A.; Shen, Z. Affine systems in L_2((R^d) II: Dual systems. J. Fourier Anal. Appl. 1997, 3, 617–637. [Google Scholar] [CrossRef]
Abdollahi, F.; Lakestani, M.; Razzaghi, M. Hybrid Vessel Extraction Method Based on Tight-Frame and EM Algorithms by Using 2D Dual Tree Complex Wavelet. Informatica 2021, 32, 1–22. [Google Scholar] [CrossRef]
Shen, Z. Wavelet frames and image restorations. In Proceedings of the International Congress of Mathematicians, Hyderabad, India, 19–27 August 2010; Vol. I: Plenary Lectures and Ceremonies Vols. II–IV: Invited Lectures. pp. 2834–2863, ISBN 0-12-466605-1. [Google Scholar]
Mallat, S.G. A Theory for Multiresolution Signal Decomposition: The Wavelet Representation. IEEE Trans. Pattern Anal. Mach. Intell. 2009, 11, 494–513. [Google Scholar] [CrossRef]
Chang, D.; Wang, Y.; Fan, R. Forecast of Large Earthquake Emergency Supplies Demand Based on PSO-BP Neural Network. Teh. Vjesn.-Tech. Gaz. 2022, 29, 561–571. [Google Scholar] [CrossRef]
Cai, J.-F.; Chan, R.H.; Shen, Z. A framelet-based image inpainting algorithm. Appl. Comput. Harmon. Anal. 2008, 24, 131–149. [Google Scholar] [CrossRef]
Wang, Y.; Yang, J.; Yin, W.; Zhang, Y. A New Alternating Minimization Algorithm for Total Variation Image Reconstruction. SIAM J. Imaging Sci. 2008, 1, 248–272. [Google Scholar] [CrossRef]
Hayes, M. The reconstruction of a multidimensional sequence from the phase or magnitude of its Fourier transform. IEEE Trans. Acoust. Speech Signal Process 1982, 30, 140–154. [Google Scholar] [CrossRef]
Sanz, J.L.C. Mathematical Considerations for the Problem of Fourier Transform Phase Retrieval from Magnitude. SIAM J. Appl. Math. 1985, 45, 651–664. [Google Scholar] [CrossRef]
Fannjiang, A. Absolute uniqueness of phase retrieval with random illumination. Inverse Probl. 2012, 28, 075008. [Google Scholar] [CrossRef]
Fannjiang, A.; Liao, W. Phase retrieval with random phase illumination. J. Opt. Soc. Am. A 2012, 29, 1847–1859. [Google Scholar] [CrossRef]
Fannjiang, A.; Liao, W. Fourier phasing with phase-uncertain mask. Inverse Probl. 2013, 29, 125001. [Google Scholar] [CrossRef]
Wen, Z.; Yang, C.; Liu, X.; Marchesini, S. Alternating direction methods for classical and ptychographic phase retrieval. Inverse Probl. 2012, 28, 115010. [Google Scholar] [CrossRef]
Han, D.R. A Survey on Some Recent Developments of Alternating Direction Method of Multipliers. J. Oper. Res. Soc. China 2022, 10, 1–52. [Google Scholar] [CrossRef]
Chou, H.-H.; Tsai, F.-S. Technology-Enabled Mobilization in the Emergence of a Value Co-Creating Ecosystem. J. Organ. End User Comput. 2022, 34, 1–17. [Google Scholar] [CrossRef]

Figure 1. The phase retrieval (PR) results and comparisons. (a) Original image of pine-tree (b) Fourier transform magnitudes of the image; (c) PR by ER; (d) PR by WF; (e) PR by TVB; (f) PR by TFLSB.

Figure 2. Relative error vs. iteration for image of pine-tree using different methods.

Figure 3. The phase retrieval (PR) results and comparisons. (a) Original image of six-claw satellite; (b) Fourier transform magnitudes of the image; (c) PR by ER; (d) PR by WF; (e) PR by TVB; (f) PR by TFLSB.

Figure 4. Relative error vs. iteration for image of six-claw satellite using different methods.

Figure 5. The phase retrieval (PR) results and comparisons. (a) Original image of satellite; (b) Fourier transform magnitudes of the image; (c) PR by ER; (d) PR by WF; (e) PR by TVB; (f) PR by TFLSB.

Figure 6. Relative error vs. iteration for image of satellite using different methods.

Figure 7. The phase retrieval (PR) results with complex noise. The images in the first column are original images: (a) “Bird”; (d) “Hat”; (g) “Tower”. The content in middle column, (b,e,h) are corresponding magnitudes of the image with Gaussian and salt–pepper noise; (c,f,i) the PR results obtained viaTFLSB.

Figure 8. Structured light illumination phase retrieval experimental system.

Figure 9. Target structure. (a) Target 1; (b) Target 2.

Figure 10. The phase retrieval (PR) results. (a) “Target 1”; (b) “Target 2”. In the second row, (c,d) are the real spatial spectrum mode value distribution of the target. In the third row, (e,f) are the measured spatial spectrum mode value distribution of the target with noise. In the last row (g,h) are the images recovered by TFLSB algorithm.

Table 1. Phase retrieval image quality evaluation parameters for different reconstruction algorithms for pine-tree image. The best data are shown in bold.

Method	Time (s)	PSNR	SSIM	SNR (dB)
ER	15.4121	13.8215	0.062745	3.31793
WF	23.8868	17.5721	0.1232	9.45358
TVB	18.1098	23.3189	0.121072	15.7823
TFLSB	36.5556	44.8056	0.166499	47.5823

Table 2. Phase retrieval image quality evaluation parameters for different reconstruction algorithms for six-claw image. The best data are shown in bold.

Method	Time (s)	PSNR	SSIM	SNR (dB)
ER	15.783	19.3888	0.120204	9.33326
WF	35.14	16.2947	0.154068	6.28477
TVB	25.1905	23.4579	0.184836	14.2892
TFLSB	46.7172	36.6348	0.535543	45.7626

Table 3. Phase retrieval image quality evaluation parameters for different reconstruction algorithm for satellite image. The best data are shown in bold.

Method	Time (s)	PSNR	SSIM	SNR (dB)
ER	12.0018	23.0435	0.271731	10.5373
WF	24.9471	19.0969	0.301472	6.78469
TVB	18.7448	22.7706	0.243096	10.4753
TFLSB	39.7584	28.264	0.305235	32.5247

Table 4. Parameters of reconstruction result with complex noise.

Name	Time (s)	PSNR	SSIM	SNR (dB)
Bird	35.632	23.605	0.420935	24.7175
Hat	36.2162	26.3138	0.355219	25.1869
Tower	38.1924	29.4759	0.443401	27.188

Table 5. Parameters of reconstruction result.

Name	PSNR	SSIM	SNR (dB)
Object1	8.71775	0.00939781	12.2024
Object2	7.2705	0.00304378	11.3587

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Qin, X.; Gao, X.; Yang, X.; Xie, M. Complex Noise-Based Phase Retrieval Using Total Variation and Wavelet Transform Regularization. Photonics 2024, 11, 71. https://doi.org/10.3390/photonics11010071

AMA Style

Qin X, Gao X, Yang X, Xie M. Complex Noise-Based Phase Retrieval Using Total Variation and Wavelet Transform Regularization. Photonics. 2024; 11(1):71. https://doi.org/10.3390/photonics11010071

Chicago/Turabian Style

Qin, Xing, Xin Gao, Xiaoxu Yang, and Meilin Xie. 2024. "Complex Noise-Based Phase Retrieval Using Total Variation and Wavelet Transform Regularization" Photonics 11, no. 1: 71. https://doi.org/10.3390/photonics11010071

APA Style

Qin, X., Gao, X., Yang, X., & Xie, M. (2024). Complex Noise-Based Phase Retrieval Using Total Variation and Wavelet Transform Regularization. Photonics, 11(1), 71. https://doi.org/10.3390/photonics11010071

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Complex Noise-Based Phase Retrieval Using Total Variation and Wavelet Transform Regularization

Abstract

1. Introduction

2. Foundations of Phase Retrieval

2.1. Phase Retrieval Model

2.2. Total Variation

2.3. Wavelet

2.3.1. Tight Framework

2.3.2. Wavelet Tight Framework

3. Proposed Model and Numerical Algorithm

3.1. Formulation of Minimization Model

3.2. Uniqueness Analysis

3.3. Solution Existence Analysis

3.4. Numerical Model

4. Numerical Experiments

4.1. Numerical Results

4.2. Sensitivity with Complex Noise

5. Experimental Results

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Proof of Theorem 1

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI