Next Article in Journal
Study on the Removal Depth of the Surface Plastic Domain of Silicon-Modified Silicon Carbide
Previous Article in Journal
A Concise and Adaptive Sidelobe Suppression Algorithm Based on LMS Filter for Pulse-Compressed Signal of Φ-OTDR
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Complex Noise-Based Phase Retrieval Using Total Variation and Wavelet Transform Regularization

1
Xi’an Institute of Optics and Precision Mechanics, Chinese Academy of Sciences, Xi’an 710119, China
2
University of Chinese Academy of Sciences, Beijing 100049, China
3
Beijing Institute of Tracking and Telecommunications Technology, Beijing 100094, China
*
Author to whom correspondence should be addressed.
Photonics 2024, 11(1), 71; https://doi.org/10.3390/photonics11010071
Submission received: 26 October 2023 / Revised: 15 December 2023 / Accepted: 18 December 2023 / Published: 10 January 2024
(This article belongs to the Section Optical Communication and Network)

Abstract

:
This paper presents a phase retrieval algorithm that incorporates sparsity priors into total variation and framelet regularization. The proposed algorithm exploits the sparsity priors in both the gradient domain and the spatial distribution domain to impose desirable characteristics on the reconstructed image. We utilize structured illuminated patterns in holography, consisting of three light fields. The theoretical and numerical analyses demonstrate that when the illumination pattern parameters are non-integers, the three diffracted data sets are sufficient for image restoration. The proposed model is solved using the alternating direction multiplier method. The numerical experiments confirm the theoretical findings of the lighting mode settings, and the algorithm effectively recovers the object from Gaussian and salt–pepper noise.

1. Introduction

Because of the loss of phase information, reconstruct the underlying image from Fourier transform magnitudes is defined as a phase retrieval (PR) problem. Phase retrieval problems are ill-posed, because different images can have the same Fourier transform results, so the solution is not unique. It is necessary to know the image’s prior information, such as domain, nonnegative, sparse representation [1,2], etc., to constrain the numerical process to obtain the global saddle-point solution. Readers can consult [1,2,3] for applications in regularizing seismic image beyond aliasing via gradient and spectral techniques and additional information.
Gerchberg and Saxton [4] proposed an error reduction method (ER) based on projection; later, in 1982, this method was improved by Fienup’s hybrid inputs–outputs (HIO) in [1]. Derived iterative projection algorithms include hybrid projective–reflection [3,5], the iterative difference map (DF) [6] and the relaxed averaged alternation reflection (RAAR) algorithms [7]. In addition, Marchesini adopted the saddle-point optimization method [8] to solve the PR problem. The alternate projection method lacks convergent guarantee because of its alternating projection process that does not have nonconvex constraint sets. Thus, the solution converges to a local stationary point rather than the global optimal solution. Moreover, gradient-type methods have become popular; one example is the Wirtinger Flow (WF) method [9] proposed by Candès, Li and Soltanolkotabi who put forward a gradient scheme with novel update rules that are carefully initialized by the means of a spectral method. Gradient-based approaches usually have first-order convergence. Farrell solved the phase retrieval (PR) problem with a network where every agent only contains a subset of the measurements [10]. A convex method is characterized by the use of convex relationships of the quadratic equations or semidefinite programming. Candès, Strohmer and Voroninski proposed Phaselift in [11], which uses SDP lifting techniques to formulates convex trace (nuclear) norm minimization. Moretta study the impact of constraints in Phaselift in [12]. PhaseCut [13], proposed by Waldspurger, is a convex method that separates phase information and magnitudes. Yin and Xin proposed PhaseLiftOff [14], a nonconvex variant of Phaselift which cut out the Frobenius norm from the trace norm to retrieve phase information with fewer measurements. Xia studied the sparse phase retrieval method to recover a k-sparse signal in [15]. Gao studied the adaptive sparse signal reconstruction algorithm in [16].
In recent years, total variation (TV) regularization has been successfully used to solve image blind deblurring optimization problems with certain specific blurring kernels. In addition, framelet-based regularization has been adopted to solve deblurring problem with motion blur kernels. However, there is huge potential area for them to be used in phase retrieval problems from Fourier transform magnitudes. In this paper, we propose a phase retrieval algorithm based on TV regularization and the analysis-based sparsity framelet transform method to recover the image through an analysis matrix. We explicitly focus on the sparsity of image representation in phase retrieval problems from Fourier transform magnitudes. Total variation regularization can effectively guarantee a sparsity prior for the gradient domain of ground truth images with a small TV semi-norm and framelet transform can enforce a sparsity prior of target images under the redundant tight frame.
In the field of image deblurring and phase recovery, a single Gaussian noise is usually used to test the robustness of phase recovery algorithms against noise. In the field of image denoising, the robustness of the algorithm to salt-and-pepper noise is studied. Gaussian noise and salt-and-pepper noise are typical types of noise, which exist simultaneously in actual measurements. In order to simulate realistic measurement conditions, the reconstruction ability against two kinds of noise is studied for the first time. In order to simulate more realistic measurement conditions, this paper innovatively proposes to study the robustness of the algorithm with complex noise when two kinds of noise exist simultaneously. This is a very challenging problem.
Phase retrieval problems from Fourier transform magnitudes are an ill-posed problem, and these problems are more difficult than phase retrieval problems derived from motion blur. Readers can consult the work [17,18] of E. J. Candes, who studies the relationship between the number of measurements and the possibility of reconstructing an image. For research on further applications for reconstructing images from partial measurements, please see the work of Chang Huibin [19]. On that basis, our research shows that when the illumination source meets a certain parameter setting, three sets of measured values can reconstruct an image.

2. Foundations of Phase Retrieval

2.1. Phase Retrieval Model

Phase retrieval is a branch of deconvolution problems and can be expressed as follows:
b = | u | + η
  :   R e a l n 1 × n 2     C o m p l e x n 1 × n 2  is the two-dimensional discrete Fourier transform.  b  is the measurement magnitude with noise  η . For the convenience of calculation and implementation, we use vectors to represent two-dimensional discrete images in lexicographical order (connected by columns). To more easily present the modules, the symbols do not change.  u  is defined on a discrete lattice  Ω = { 0 , 1 , , n 1 × n 2 1 } R n 1 × n 2  of size  n 1 × n 2 , which denotes that image u is connected by columns. The corresponding discrete Fourier transform is expressed as follows:
u ( ω 1 + ω 2 n 2 ) = 1 n 1 n 2 0 t 1 n k 1 t 2 n k 1 u ( t 1 + n 1 t 2 ) exp ( i 2 π ( ω 1 t 1 n 1 + ω 2 t 2 n 2 ) 0 ω 1 n k 1 ,   0 ω 2 n k 1

2.2. Total Variation

The total variation (TV) model of image processing based on partial differential equation (PDE) variation method is known as the classic Rudin Osher Fatemi (ROF) model in [20]. It is one of the most common models used for image restoration, and generally consists of a fidelity term, regularization penalty items and regularization parameters. In recent years, the TV model is widely used in image denoising and other fields; readers can consult [21,22,23] for detail. Total variation regularization can effectively guarantee a sparsity prior for the gradient domain of the underlying images which have a small norm.
In the following equations,  u  denotes the image for phase retrieval which is represented as discrete 2-dimensionl matrices with a size of  m × n . The total variation in the discrete domain calculates the gradient of u, which is denoted by a gradient operator  u :
u p , q = ( 1 u p , q , 2 u p , q ) = ( d x ( u p , q ) , dy ( u p , q ) ) 1 u p , q = { u p , q u p , q + 1       i f   q < n 0                          i f   q = n 2 u p , q = { u p , q u p + 1 , q       i f   p < m 0                         i f   p = m
where  u p , q  is an element of u,  1 u p , q  is the gradient of the horizontal direction and  2 u p , q  is the gradient of the vertical direction.
The total variation (TV) regularization term of u is indicated as follows:
T V ( u ) = p m , q n | u p , q |
For the restoration model with additive noise pollution data, it can be indicated as follows:
u = arg min u L ( u ) ε L ( u ) = λ 2 | u | b Ω 2 + T V ( u )
where λ > 0 is the parameter of the total variation regular term. The total variation regularization penalty term is the semi-norm of the image gradient. One type of TV variant, namely the isotropic TV, is defined by
T V ( u ) 1 = ( d x ( u ) ) 2 + ( d y ( u ) ) 2 = p = 1 m 1 q = 1 n 1 ( u p , q u p , q + 1 ) 2 + ( u p , q u p + 1 , q ) 2 + p = 1 m 1 | u p , n u p + 1 , n | + q = 1 n 1 | u m , n u q + 1 |
Another type is anisotropic TV, which is defined by
T V ( u ) 2 = d x ( u ) + d y ( u ) = p = 1 m 1 q = 1 n 1 { | u p , q u p , q + 1 | + | u p , q u p + 1 , q | } + q = 1 n 1 | u m , q u q + 1 | + p = 1 m 1 | u p , n u p + 1 , n |

2.3. Wavelet

Images have sparse representation or approximation in redundant transformation, such as tight frame transform [24,25]. Wavelet [26] is one type of tight frame. A large number of studies have shown that sparsity prior and low rank prior regularization enable the corresponding algorithm to produce a high-quality solution [27,28].

2.3.1. Tight Framework

A tight frame in a Hilbert space is introduced as follows, and the interested readers can consult [27] for more in-depth study. Let       denote the norm of a variable in a Hilbert space  . The sequence  { ϕ n } n L  constructs a tight frame in  , when
f = ϕ n f , ϕ n ϕ n   ,   f f 2 2 = n L | f , ϕ n | 2 ,   f
where  f , ϕ n  indicates an inner product and       indicates the norm of  .
For the bounded sequence  { ϕ n } n L , let  W  denote the analysis operator and let  W  denote its adjoining operator; they are defined by
W :   f { f , ϕ n } l 2 ( N ) ,   W : { a n } l 2 ( N ) n L a n ϕ n
The sequence  { ϕ n } n L  constructs a tight frame when  W W = I ; therein,  I   :    is the identity operator.
In the following equations, u denotes the image for phase retrieval.
g = W u
u = W ( W u ) = W g
where g indicates the tight framelet transform coefficients.
The phase retrieval deconvolution optimization model based on synthetic sparsity representation is
W :   u { u , ϕ n } l 2 ( N ) ,   W : { a n } l 2 ( N ) n L a n ϕ n
The deconvolution optimization model based on analytical sparsity representation is
g ¯ = arg min u R m × n   Φ ( k u f ) + λ g 1
The deconvolution optimization model based on synthesis sparsity representation is
g ¯ = R u ¯ u ¯ = arg min u R m × n   Φ ( k u f ) + λ W u 1
The tight frame is one type of orthonormal basis generalization; redundant frame has been found to be useful in image and signal processing [25,29,30]. Analysis operator W is redundant when its column dimension is smaller than the row dimension and the two methods generate different results. The synthesis-based method aims to obtain the most spars result among all possible transform coefficient vectors, while the analysis-based method seeks the most spar solutions among all possible canonical framelet coefficient vectors. Thus, the analysis-based method is a strict subset of synthesized-based methods. The solutions of analysis-based methods are close to the underlying image have increased smoothness. This has been proven empirically in many experiments. Since a solution with a certain smoothness will have a better visual quality, the analysis-based approach was chosen for this study. The two approaches are equivalent only if  W W = W W = I . Then, the tight frame becomes a canonical orthogonal transformation.

2.3.2. Wavelet Tight Framework

A one-dimensional wavelet frame is constructed by a finite set of generators with shifts and dilations,  Ψ = { ψ 1 , ψ r } L 2 , where  ψ  is called the father wavelet and  ψ j , k  is called the wavelet. A refinable function is usually used to construct a tight wavelet frame,  φ ( t ) L 2 ( R ) , which is also known as the scale function or mother wavelet. This satisfies the two-scale equation  ψ ^ i ( 2 ω ) = h i ϕ ^ ( ω ) , where  h i  are  2 π  periodic trigonometric polynomials, which satisfies  h 0 ( 0 ) = 1 .
The unitary extension principle (UEP) [31] declares that  X ( Ψ )  forms a tight frame when
h 0 ( ω ) h 0 ( ω + γ π ) ¯ + i = 1 r h i ( ω ) h i ( ω + γ π ) ¯ = δ γ , 0   ,   γ = 0 , 1
Image processing uses two-dimensional information, so the two-dimensional wavelet frames are needed. Indeed, a two-dimensional wavelet is a tensor product of a one-dimensional wavelet. S. Malat and Meyer proposed MRA theory [31,32,33], which studies the multi-resolution analysis properties of wavelets from the perspective of a function space, and provides a unified theory for constructing a wavelet framework and a fast algorithm for orthogonal wavelet transform. In this study, a two-level piecewise linear B-spline tight frame system with tensor product filters was adopted. The piecewise linear B-spline tight frame is the simplest system in this family, which employs piecewise linear B-spline functions as  φ . The corresponding tensor product filter is:
a 0 , 0 = 1 16   ( 1 2 1 2 4 2 1 2 1 ) a 0 , 1 = 2 16 ( 1 0 1 2 0 2 1 0 1 ) a 0 , 2 = 1 16 ( 1 2 1 2 4 2 1 2 1 ) a 1 , 0 = 2 16 ( 1 2 1 0 0 0 1 2 1 ) a 1 , 1 = 1 8   ( 1 0 1 0 0 0 1 0 1 ) a 1 , 2 = 2 16 ( 1 2 1 0 0 0 1 2 1 ) a 2 , 0 = 1 16   ( 1 2 1 2 4 2 1 2 1 ) a 2 , 1 = 2 16 ( 1 0 1 2 0 2 1 0 1 ) a 2 , 2 = 1 16 ( 1 2 1 2 4 2 1 2 1 )
MATLAB software R2016b was used for the calculations. We used the wavelet frame decomposition algorithm [25], whose construction varies with boundary setting conditions. In this research, Neumann (symmetric) boundary conditions were adopted. Interested readers can refer to [27,31,34] for the principles of generating such matrices.

3. Proposed Model and Numerical Algorithm

3.1. Formulation of Minimization Model

Our work is closely related to article [17]. where E. J. Candes discussed the possibility of reconstructing the target from incomplete sampling, and  7 n 1 n 2  measurements (the size of ground truth image is  n 1 × n 2 ) were suggested in order to uniquely recover real-valued images. Chang Huibin proposed a TV regularization [19] model which can recover images from noisy measurements. In this research, a TV and wavelet-based co-exist regularization model is proposed to recover underlying images from  3 n 1 n 2  measurements with noise. The measurements were obtained by structured light illumination and image value with box constraint  u o b j e c t [ 0 , 1 ] . The model is introduced as follows:
f i n d   u s . t .   b 0 : = | ( u o b j e c t ) | ,   b 1 : = | ( u o b j e c t + D s u o b j e c t ) | ,   b 2 : = | ( u i D s u o b j e c t ) |
where   denotes the Fourier transform. Let
R ( u o b j e c t ) : = { R 0 , R 1 , R 2 }
The data R( u o b j e c t ) are obtained using three-light-field illumination
| R ( u o b j e c t ) | = { 1 ,   | 1 + exp ( i 2 π ( s 1 t 1 n 1 + s 2 t 2 n 2 ) ) | ,   | 1 + exp ( i 2 π ( s 1 t 1 n 1 + s 2 t 2 n 2 ) π 2 ) | }
where
D s u o b j e c t ( t 1 + n t 2 ) = exp ( 2 π s 1 + t 1 n 1 + 2 π s 2 + t 2 n 2 ) u o b j e c t ( t 1 + t 2 n 1 ) ,   0 t 1 n k 1 ,   0 t 2 n k 1
The research in [19,35] showed that the least square method produces a unique result for the PR problem with these three sets of data. The model is referred to as the least-square minimization problem with a box-constrained (LSB) model [35].
min 0 u 1 | R 0 | b 0 Ω 0 2 + | R | 1 b 1 Ω 1 2 + | R 2 | b 2 Ω 2 2
This study focused on the sparse prior of objects; we incorporated TV and wavelet transform regularization into (17) to guarantee an exact solution. The analysis-based sparsity approach under wavelet tight frame decomposition was selected for this study. The proposed TV and framelet-based minimization problem of the least-square type with a box constraint (TFLSB) model is presented as follows:
min 0 u 1 ε T F L S B = λ 2 ( | R 0 | b 1 Ω 0 2 + | R 1 | b 2 Ω 1 2 + | R 2 | b 3 Ω 2 2 ) + T V ( u ) + W u 1
where we adopt anisotropic TV regularization to be the term of TV(u),  W u  is the wavelet decomposition of  u , and      1  denotes the L-1 norm.

3.2. Uniqueness Analysis

Hayes [36] and Sanz [37] proved that using double the number of measurements can uniquely determine the solution of PR when the underling signal is nonnegative and finitely supported. The results are further extended with random oversampling [38,39,40], where random illumination guarantees the absolute uniqueness and resolves all types of problems. We found that when  s 1 = s 2 = N + 0.5  (where N is a positive integer), 3n1n2 measurements and additional constraints such as the underlying object  u  is real and has a non-negative value, the algorithm can produce a unique solution.
Theorem 1.
Assume that  u m , n [ 0 , 1 ]  and the DFT of u and Ds u are nonvanishing, when s1 and s2 are prime with m and n, and s1 = s2 = N + 0.5 (N is a positive integer). Then, u can be recovered with 3 mn measurements in (15).
Proof. 
This theorem applies to one-dimensional and two-dimensional cases. The complete proof is shown in Appendix A. □

3.3. Solution Existence Analysis

Theorem 2.
Let Ω denote a bounded set in Lipschitz regular domain and comprehensive data b = (b0, b1, b2) is non-negative; then, the TFLSB model (19) has at least one minimum solution  u  * ∈ BV (Ω).
Proof. 
It should be noted that  ε TFLSB   ( u )     0 . A minimizing sequence {uk}0 ≤ k ≤ ∞ satisfies (s.t.)  ε TFLSB   ( u 0 )     ε TFLSB   ( u 1 ) . It is known that  v a l u e ( u k ) [ 0 , 1 ] . The set Ω is bounded in a Lipschitz regular domain, C is a positive constant and exists as the upper limit, and s.t.  T V ( u k ) + W u k + u k 1 + + W u k 1 C . As Rellich’s compactness theorem state that there is  u B V ( Ω )  and a subsequence  { u n k } k 1 , which meet  u n k u  in the norm of  L 1 ( Ω )  when  k . Using the continuity of the fidelity term of  ε T F L S B  and lower semi-continuity of TV and wavelet regularization, one obtains  lim sup u n k u ε T F L S B ( u k ) T F L S B ( u ) . Then, u* is the one solution of  ε T F L S B . □

3.4. Numerical Model

There are several numerical methods that are used for the constrained optimization problem. At present, the Projected Gradient Descent (PGD) and Alternating Direction Method of Multipliers (ADMM) are widely used. The Alternating Direction Method of Multipliers (ADMM) was first proposed by Glowinski and Gabay, and further improved by Boyd in 2011, who demonstrated that ADMM is applicable in large-scale distributed optimization problems [41]. ADMM is a computational method for optimization problems, and is effective in solving distributed convex optimization problems, especially statistical learning problems [42,43]. Through the process of decomposition–coordination, ADMM disassembles the large complicated global problem into several smaller solvable local sub-problems, which can be computed more easily and could converge to the global optimal solution through the coordination of the sub-problems. The disadvantages of the Projected Gradient Descent method are as follows: (1) it may converge to the local optimal solution; (2) the differential at saddle point is 0, but it is not the optimal solution; (3) because of its computational complexity, it is a time-consuming method, especially when the data is large-scale. Thus, ADMM was selected to solve the constrained optimization problem in this paper.
The above minimum problem model can be described by the formula:
min p 1 1 + p 2 1 + λ 2 | z 0 | b 0 Ω 0 2 + λ 2 | z 1 | b 1 Ω 1 2 + λ 2 | z 2 | b 2 Ω 2 2 + χ ( v ) s . t . v = u       p 1 = u   p 2 = w u     z 0 = R 0   z 1 = R 1   z 2 = R 2
where
χ ( μ ) = { 0 ,   0 v 1 ,   otherwise
The augmented Lagrangian equation of  T F L S B  reads
( u , z 0 , z 1 , z 2 , p , v ) = p 1 1 + p 2 1 + λ 2 | z 0 | b 0 Ω 0 2 + λ 2 | z 1 | b 1 Ω 1 2 + λ 2 | z 2 | b 2 Ω 2 2 + χ ( v ) + R d 0 , z 0 R 0 + ρ 1 2 z 0 R 0 Ω 0 2 + R d 1 , z 1 R 1 + ρ 1 2 z 1 R 1 Ω 1 2 + R d 2 , z 2 R 2 + ρ 1 2 z 2 R 2 Ω 2 2 + q 1 , p 1 u   + ρ 3 2 p 1 u 2 2 + q 2 , p 2 W u   + ρ 4 2 p 2 W u 2 2 + w , u v + ρ 2 2 u v 2 2 + t H v 2 = p 1 1 + p 2 1 + λ 2 | z 0 | b 0 Ω 0 2 + λ 2 | z 1 | b 1 Ω 1 2 + λ 2 | z 2 | b 2 Ω 2 2 + χ ( v ) + ρ 1 2 z 0 R 0 + d 1 ρ 1 Ω 0 2 + ρ 1 2 z 1 R 1 + d 2 ρ 1 Ω 1 2 + ρ 1 2 z 2 R 2 + d 3 ρ 1 Ω 2 2 + ρ 2 2 u v + w ρ 2 2 2 + ρ 3 2 p 1 u + q 1 ρ 3 2 2 + ρ 4 2 p 2 W u + q 2 ρ 4 2 2 + t H v 2
where  q 1 , q 1 : Ω R 2 H v  is the position penalty term of the image, and  λ , ρ 1 , ρ 2 , ρ 3 , ρ 4  are weight parameters and are positive. The ADMM solution framework for the above saddle-point solution, minimizes  T F L S B  with regardto  u , z , p , v  alternately and then update the dual variables  d 0 d 1 d 2 d 3  and  q 1 , q 2 . The algorithm is summarized in the Algorithm 1.
According to the ADMM algorithm, the solution is decomposed into the following steps.
Algorithm 1 ADMM method for solving the TFLSB model (22)
Initialization:
         k = 0 ,   z i 0 = b i ( ω ) exp ( 2 π i θ i ) , p 0 = 0 ,   v 0 = 0 ,   d i 0 = 0 ,   q 1 0 = 0 ,   q 2 0 = 0
While the loop stop conditions are not satisfied, do
         u k + 1 = arg min u L T F L S B ( u , z i k , p 1 k , p 2 k , v k , d i k , w k , q 1 k , q 2 k )  
         z i k + 1 = arg min z i L T F L S B ( u k , z i , p 1 k , p 2 k , v k , d i k , w k , q 1 k , q 2 k )
         p 1 k + 1 = arg min p 1 L T F L S B ( u k , z i k , p 1 , p 2 k , v k , d i k , w k , q 1 k , q 2 k )
         p 2 k + 1 = arg min p 2 L T F L S B ( u k , z i k , p 1 k , p 2 , v k , d i k , w k , q 1 k , q 2 k )
         v k + 1 = arg min v L T F L S B ( u k , z i k , p 1 k , p 2 k , v , d i k , w k , q 1 k , q 2 k )
        Update dual variables
         { d 1 k + 1 = d 1 k + ρ 1 ( z 0 k + 1 R 0 k + 1 ) d 2 k + 1 = d 2 k + ρ 1 ( z 1 k + 1 R 1 k + 1 ) d 3 k + 1 = d 3 k + ρ 1 ( z 2 k + 1 R 2 k + 1 ) w k + 1 = w k + ρ 2 ( u k + 1 v k + 1 ) q 1 k + 1 = q 1 k + ρ 3 ( p k + 1 u k + 1 ) q 2 k + 1 = q 2 k + ρ 4 ( p 2 k + 1 w u k + 1 )
        k = k + 1
end while
output the solution  u = u k + 1  
The subproblem to solve the saddle point of u is
min u ρ 3 2 p u + q 1 ρ 3 2 + ρ 4 2 p 2 W u + q 2 ρ 4 2 2 + ρ 2 2 u v + q 2 ρ 2 2 + ρ 1 2 z 0 u + d 1 ρ 1 Ω 0 2 + ρ 1 2 z 1 ( u + D s u ) + d 2 ρ 1 Ω 1 2 + ρ 1 2 z 2 ( u i D s u ) + d 3 ρ 1 Ω 2 2
By calculating the derivative, the solution is
Δ u + ρ 2 ρ 3 u + ρ 4 ρ 3 u + ρ 1 ρ 3 ( 5 I + 2 R ( D s ) + 2 E ( D s ) ) u = ρ 1 ρ 3 ( R ( ( z 0 + d 1 ρ 1 ) ) + R ( ( z 1 + d 2 ρ 1 ) + D s ( z 1 + d 2 ρ 1 ) ¯ ) + R ( ( z 2 + d 3 ρ 1 ) i D s ( z 2 + d 3 ρ 1 ) ¯ ) ) + ρ 2 ρ 3 ( v q 2 ρ 2 ) d i v ( p 1 + q 1 ρ 3 ) + ρ 4 ρ 3 W T ( p 2 + q 2 ρ 4 )

4. Numerical Experiments

The initialization method for the proposed Algorithm 1 is as follows.
The measurements bi containing noise are used directly for the calculations without any processing. The initial phase is given randomly without any specific request.
The initialization for variables  v 0  and  z i 0  was chosen to be
z i 0 ( ω ) = { b i ( ω ) exp ( 2 π i θ 0 ) if ω Ω 0 otherwise
and  v 0 = z 0 0 = ( b 0 ( ω ) exp ( 2 π i θ 0 )   ) , where initial phase  θ 0  is derived from the standard uniform distribution in the open interval (0, 1).
The quality of the reconstructed image includes two aspects: one is the visual effect and the other is the evaluation index to analyze the difference between the reconstructed image and original image. The evaluation of visual effects varies with different people’s visual conditions. Objective evaluation indicators are very important. The affluent evaluation index of the reconstruction image evaluation used in the field of image reconstruction was adopted in this study to compare the proposed algorithm with other algorithms.
The peak signal-to-noise ratio (PSNR), structural similarity (SSIM) and signal-to-noise ratio (SNR) were used to measure the quality of the reconstruction, and the relative error (relative-error) was used to measure the convergence speed. PSNR is a comprehensive evaluation index of reconstruct image quality. MSE is the mean square error of the current calculated image compared to the ground truth image. H and V represent the number of the row and column, respectively, and n is the number of bits of storage per pixel.
M S E = 1 H v p = 1 H q = 1 v ( u ( p , q ) u g ( p , q ) ) 2 P S N R = 10 log 10 ( ( 2 H × V 1 ) 2 M S E )
SSIM (structural similarity) is a comprehensive evaluation of the image restoration quality in terms of brightness, contrast and image structure.
μ u = 1 H V i = 1 H j = 1 V u ( p , q ) σ u = 1 H V 1 ( i = 1 H j = 1 V ( u ( p , q ) μ u ) 2 ) 1 2 σ u g = 1 H W 1 i = 1 H j = 1 W ( u ( p , q ) μ u ) ( u g ( p , q ) μ u ) l ( u , u g ) = 2 μ u μ u g + C 1 μ u 2 μ u g 2 + C 1 c ( u , u g ) = 2 σ u σ u g + C 2 σ u 2 σ u g 2 + C 2 s ( u , u g ) = σ u , u g + C 3 σ u σ u g + C 3 S S I M = l ( u , u g ) c ( u , u g ) s ( u , u g )
In general, let  C 1 = 6.5   C 2 = 58.5   C 3 = 29 .
Signal-to-noise ratio (SNR) is calculated as
R M S E ( u , u g ) = j Ω | u ( p , q ) u g ( p , q ) | 2 j Ω | u ( p , q ) | 2 S N R ( u , u g ) = 10 log 10 R M S E ( u , u g )
Relative-error is defined as follows:
r e l a t i v e e r r o r = u ( p , q ) u g ( p , q ) u g ( p , q )
In all the above formulas, u represents the current reconstructed image and  u g  represents the ground truth image.

4.1. Numerical Results

We also compared the PR results for the performance of the proposed TFLSB model with three other related phase retrieval algorithms: the error reduction algorithm (ER) [3], TVB method [18] and Wirtinger flow (WTF) method [9]. The data were contaminated by the Gaussian noise  b i = b 0 + σ n i , where  b 0  is magnitudes of the original image,  σ  represents the noisy weight,  n i  represents the white Gaussian noise, and  b i  is the measurement of the real object’s Fourier transform magnitude.
The test simulation images are available on the Internet (no copyright restrictions). Our research focused on the phase retrieval problem from Fourier transform magnitudes. Its application field covers remote target imaging. Therefore, the selected images have obvious geometric features and simple textures, with a size of 256 × 256.
In order to study the algorithm’s robustness to noise, 60 dB Gaussian noise was added to the measured value to verify the robustness of the algorithms. The noise level was set according to the SNR formula and rand function.
S N R = 10 log 10 j Ω | u ( i , j ) u g ( i , j ) | 2 j Ω | u ( i , j ) | 2 = 10 log 10 j Ω | | n o i s e ( i , j ) | | 2 j Ω | u ( i , j ) | 2
The TVB method and the TFLSB algorithm use the same structured illumination pattern in this study; the relevant parameters were set as follows: the iteration numbers were 1000,  λ = 100 ρ 1 = 0.07 ρ 2 = 0.02  and  ρ 3 = 0.08 . As  s 1 = s 2 = N + 0.5  (N is a positive integer) proves that the underlying image has an optimal solution, then s1 and s2 can be set to 0.5, 1.5 and 2.5. Here, in this experiment, we set  s 1 = s 2 = 0.5 . Other value settings, such as 1.5, 2.5 and 3.5, could also obtain visually good reconstructed images and can be modified according to different graphics. The results are shown in the following figures, Figure 1, Figure 2, Figure 3, Figure 4, Figure 5 and Figure 6.
The numerical evaluation index performance of the phase retrieval method reflects the approximate degree of the distribution characteristics of the calculated image and the real object. Image vision includes sharpness, smoothness and similarity. The comparison results of the reconstructed image quality from the three images in Figure 1, Figure 3 and Figure 5 were as follows. Firstly, the ER and TVB algorithms introduced obvious defects in the originally smooth background area. Secondly, the reconstructed image of the ER algorithm was fuzzy overall, while the reconstructed image of WF was sharper. This is because these algorithms do not have a good balance between image smoothness and boundary sharpness. Although the constrained performance of the TVB algorithm was better, its staircase effect can produce obvious defects which could not be omitted. Overall, compared with the other algorithms, the proposed TFLSB model using the ADMM algorithm performed better, and the reconstructed images were more visually pleasant with few noticeable image artifacts.
The comparison results in Figure 2, Figure 4 and Figure 6 show that the proposed algorithm was more stable, converged to the optimal value and always had the lowest relative error value when dealing with images with different complexities. Table 1, Table 2 and Table 3 show the image restoration quality with the numerical evaluation indicators, PSNR, SSIM, SNR and processing time. When the iteration number was 1000, our algorithm required more time due to the algorithm’s computing complexity. However, according to the speed of iterative convergence, the solving time could be shortened by reducing the iteration number. The PSNR, SSIM and SNR indexes directly reflect the pixel correspondence between the reconstructed image and the original image. According to the results of the three reconstructed images, for the TFLSB model proposed in this paper, compared with ER, WF and TVB algorithm, the PSNR index was improved by 111.59%, 108.87% and 57.14%, respectively. The SSIM index improved by 174.49%, 194,74% and 84.37%, and the SNR index improved by 674.89%, 517.72% and 241.47%, respectively. The comparison results in Table 1, Table 2 and Table 3 directly show that our proposed FLSB model is more robust to noise and more efficient in reconstructing images compared to the other methods.

4.2. Sensitivity with Complex Noise

Image acquisition not only contains Gaussian noise, but also electromagnetic interference in the environment, and sensor internal errors, which will introduce salt–pepper noise. Salt–pepper noise, also known as pulse noise, is represented in the image as discrete distributions of pure white or black pixels. The image reconstruction ability of the TFLSB model was studied in images where Gaussian noise and salt–pepper noise coexist. Due to the excellent performance of the proposed algorithm in the previous task, in this experiment, we challenged the proposed algorithm to deal images with more complex textures to study its robustness in the presence of complex noise. The images are public images released by the Kodak Company. For the sake of consistency in this paper, the images used below are part of the original Kodak images, with a size of 256 × 256. Images were named—“Bird”, “Hat”, “Tower”.
The data were contaminated with Gaussian noise and salt–pepper noise, i.e.,  b i = b 0 + σ n i + p , where  b 0  is the magnitudes of the spatial spectrum of the original image,  σ  represents the noisy weight,  n i  represents the white Gaussian noise, p denotes the salt–pepper noise, and  b i  is the measurement of the Fourier transform magnitude of the real object. The salt–pepper noise was generated by a random function and threshold setting. First, a random matrix was generated, whose values were derived from the standard uniform distribution in the open interval (0, 1). Then, using the threshold setting, the elements of a random matrix were converted to integers 0 or 1.
Thus, the complex noise is described using the following formula:
n c o m p l e x = σ n i + p
In this experiment,  σ  = 1 and the threshold setting of salt–pepper noise is 0.9. When the value generated by the random function is greater than 0.9, white noise is obtained. The salt–pepper noise setting means that 10% of the measurements were corrupted, which is more serious than a real experiment. The results are shown in the following Figure 7.
By comparing the reconstructed images with the original images, we can observe that the reconstructed images with complex noise have significant clarity regarding visual quality. The numerical values in Table 4 show that the PSNR values of the restored images varied from 23.606 to 29.4759, the SSIM values varied from 0.355219 to 0.443401, and the SNR values varied from 24.715 dB to 27,188 dB. These data suggest that the proposed sparse prior regularization model TFLSB is robust even with complex noise, and it is also effective in processing images with complex textures. In the future, its application range could be determined by studying the relationship between noise standards, parameter settings and image complexity.

5. Experimental Results

Experimental equipment was set up to simulate reconstruction of the underlying image using Fourier transform magnitudes. The process consisted of two parts. The first part was data collection, during which the spatial spectrum modulus of the target underwent correlation computing. The second is to use the phase recovery algorithm to calculate the reconstructed image.
The schematic diagram of the experiment is shown in Figure 8. The working wavelength selected for the laser for the purpose of the experiment was 532 nm. The laser turned into a pseudothermal light source after passing through the rotating glass. In order to reduce the extra stray light caused by reflection, this experiment adopted a transmission target, which was made by hollowing out a target pattern on a metal plate, with an image size of 3 × 3 mm. The structure is shown in Figure 9.
The pixel size of the CCD camera was 6.5 μm, and the number of detection units is 2048 × 2048. In the experiment, the CCD detection frequency was 20 Hz, and the exposure time was 30 ms. The glass rotation speed was 0.3°/s. A SIM structured light modulator was set up using the internal DMD array to realize the control of the structured light. The light field distribution of structured light mode 1 is  exp ( i 2 π ( s 1 t 1 n 1 + s 2 t 2 n 2 ) . When  s 1 = s 2 = 0.5 , it is a two-dimensional sinusoidal distribution of phases in box constrained interval (0-Π). The light field distribution of structured light mode 1 is  exp ( i 2 π ( s 1 t 1 n 1 + s 2 t 2 n 2 ) π 2 ) , and compared to pattern 1, it has a phase shift.
In order to reduce the influence of the environment on the phase retrieval experiment, data acquisition was divided into two steps. First, when the measurement was not illuminated by a light source, the measurement values  I d a r k  of the experimental environment used for imaging and the inherent defects of the measurement system recorded with the detector, representing the background noise. Then, when the light source illuminates the target, the measurement values  I 1 , I 2 , I 3  were recorded. The difference between these two points can offset the impact of some of the noise.
The TFLSB algorithm in this paper is used for phase retrieval, and the results are shown in Figure 10.
Due to the limited experimental conditions, the acquisition frequency of the CCD camera was limited, as was the target spatial spectrum information that could be obtained by the pseudothermal light source, making the obtained target information very scarce and increasing the difficulty of phase recovery. When structured light lit an area, the two optical paths should be at the same frequency and there should be no phase delay, but in practical applications, it is difficult to ensure that the optical path difference between the two optical paths is 0. This error reduces reconstructed image quality. From the visual evaluation, the resolution of the reconstruction image is not good enough when compared the previous numerical simulation. The evaluation indexes PSNR, SSIM and SNR used for image restoration are not ideal, and additional research on phase retrieval should be carried out in the future. The numerical results in Table 5 directly show that our proposed FLSB model could efficiently reconstruct images in practice.

6. Conclusions

In this paper, we introduced an innovative TV and framelet-based regularization minimization phase retrieval model with a box constraint (TFLSB) for image recovery from magnitudes degraded by Gaussian and salt–pepper noise. Our proposed model incorporates isotropic TV and analysis-based wavelet regularization, enabling the enforcement of sparse priors in both the gradient domain and spatial structure domain simultaneously. Through heuristic analysis, we identified the key parameters s1 = s2 = N + 0.5 (N being a positive integer) that contribute to stable phase recovery with 3n1n2 measurements. This structural light can be easily obtained by using a structured light modulator. The TFLSB model effectively reconstructs high-quality latent images from corrupted measurement data obtained with a structured lighting model. Comparative evaluations against the ER, WF, and TVB algorithms demonstrate that the degraded images reconstructed by the TFLSB model exhibit superior image quality, with clearer edges and fewer artifacts. The evaluation indices PSNR, SSIM and SNR further confirm the significant enhancement in image quality achieved by the numerical theory and TFLSB model. Additionally, our study investigated the robustness of the TFLSB model against Gaussian and salt–pepper noise, revealing its resilience against complex noise. This provides a certain direction for the implementation of phase retrieval from Fourier transform magnitudes in practice.
Furthermore, in practice, the proposed method is able to reconstruct the underlying image from Fourier transform magnitudes, helping us to solve the phase retrieval problem in environments. Under the existing experimental conditions, the reconstructed images solved using the TFLSB algorithm are not clear enough, but there is an optimal solution, that can verify the stability and feasibility of the proposed algorithm.
It is important to note that phase retrieval remains a challenging deconvolution problem, and there are several open questions that require further exploration. The proposed algorithm was the most time consuming compared to the other algorithms, which is not conducive to real-time imaging applications. One potential improvement is the use of a more powerful industrial computer to reduce the computation time. Future work will focus on developing faster algorithms with a second-order convergence rate to reduce processing time. Additionally, we aim to incorporate more priors to enhance the quality of the reconstructed solutions for a broader range of image categories. While the proposed phase retrieval model is currently applicable to oversampling scenarios (structured illuminated patterns), our future research will explore additional patterns to enable the exact recovery of latent images in various settings.

Author Contributions

X.Q. developed the theoretical formalism. X.G. and X.Y. conceived the presented idea. X.Q. and M.X. collected and analyzed the data. X.Q. described the results and wrote the paper. X.G. supervised the work. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data are contained within the article.

Acknowledgments

The authors would like to thank Bin Wang from the Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Science, for helping us develop the theoretical formalism The insightful comments by Bin Wang is greatly appreciated and helped to improve this paper.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Proof of Theorem 1

Proof. 
Case 1 is for a one-dimensional signal, i.e.,  u = ( u 0 , u 1 , u n 1 ) R n , and  u i [ 0 , 1 ] , 0 i n 1 . The measurements are represented by  R ( u ) : = { | R 0 | , | R 1 | , | R 2 | }  denotes a discrete Fourier transformation, and  D s u = exp { i 2 π k n ( N + 0.5 ) } u = exp ( i π k n ) u ,   0 k n 1 .
Define  U = ( U 0 , U 1 , U n 1 ) : = u U = ( U 0 , U 1 , U n 1 , U n 1 ¯ U 2 ¯ , U 1 ¯ )  can obtained, where  U 1 ¯  identifies the complex conjugate operator of U.
Define  v = ( v 0 , v 1 , v n 1 ) : = D s u  and  V = ( V 0 , V 1 , V n 1 ) : = v ; then,  V = ( V 0 , V 1 , V n 1 , V n 1 ¯ V 2 ¯ , V 1 ¯ )  can obtained.
Then, two triples,  M ( u ) : = { | U | , | U + V | , | U i V | }  and  M ( u ) ¯ : = { | U ¯ | , | U ¯ + V ¯ | , | U ¯ i V ¯ | } , can be obtained.
If  U 0  and  V 0  are real-valued and non-negative, the set  U 0  and  V 0  could be solved with the triple set  { | U 0 | , | U 0 + V 0 | , | U 0 i V 0 | } :
Solve  U 1  with the triple  { | U 1 ¯ | , | U 1 ¯ + V 0 ¯ | , | U 1 ¯ i V 0 ¯ | }  if  U 1 0 .
Since  V 1 = V 0 ¯ ,
Solve  V 2  with the triple  { | U 1 ¯ | , | U 1 ¯ + V 2 ¯ | , | U 1 ¯ i V 2 ¯ | }  if  V 2 0 .
When  U = ( U 0 , U 1 , U n 1 )  has been solved, through the inverse Fourier transform of U, the exact minimum solution  u = U . □
Case 2 is for two dimension images,  u , v R n 1 × n 2 ; here, matrix forms are used to represent the images.
Similar to the 1D case, the solution process is as follows:
U i 0 , j 0 V i 0 , j 0 V i 1 , j 1 U i 1 , j 1 U i 2 , j 2
when
U i , j = U ¯ mod ( n 1 i , n 1 ) mod ( n 2 j , n 2 ) V i , j = V ¯ mod ( n 1 + 1 i , n 1 ) mod ( n 2 + 1 j , n 2 )
where
i 2 k + 2 = mod ( n 1 i 2 k + 1 , n 1 )   j 2 k + 2 = mod ( n 2 j 2 k + 1 , n 2 ) i 2 k + 1 = mod ( n 1 + 1 i 2 k , n 1 ) j 2 k + 1 = mod ( n 2 + 1 j 2 k , n 2 )

References

  1. Fienup, J.R. Phase retrieval algorithms: A comparison. Appl. Opt. 1982, 21, 2758–2769. [Google Scholar] [CrossRef] [PubMed]
  2. Lu, Y.M.; Vetterli, M. Sparse spectral factorization: Unicity and reconstruction algorithms. In Proceedings of the ICASSP 2011–2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Prague, Czech Republic, 22–27 May 2011; pp. 5976–5979. [Google Scholar] [CrossRef]
  3. Bauschke, H.H.; Combettes, P.L.; Luke, D.R. Phase retrieval, error reduction algorithm, and Fienup variants: A view from convex optimization. J. Opt. Soc. Am. A 2002, 19, 1334–1345. [Google Scholar] [CrossRef] [PubMed]
  4. Gerchberg, R.W. A practical algorithm for the determination of phase from image and diffraction plane pictures. Optik 1972, 35, 237–250. [Google Scholar]
  5. Bauschke, H.H.; Combettes, P.L.; Luke, D.R. Hybrid projection–reflection method for phase retrieval. J. Opt. Soc. Am. A 2003, 20, 1025–1034. [Google Scholar] [CrossRef] [PubMed]
  6. Elser, V. Phase retrieval by iterated projections. J. Opt. Soc. Am. A 2003, 20, 40–55. [Google Scholar] [CrossRef]
  7. Luke, D.R. Relaxed averaged alternating reflections for diffraction imaging. Inverse Probl. 2004, 21, 37–50. [Google Scholar] [CrossRef]
  8. Marchesini, S. Phase retrieval and saddle-point optimization. J. Opt. Soc. Am. A 2007, 24, 3289–3296. [Google Scholar] [CrossRef]
  9. Candes, E.J.; Li, X.; Soltanolkotabi, M. Phase Retrieval via Wirtinger Flow: Theory and Algorithms. IEEE Trans. Inf. Theory 2015, 61, 1985–2007. [Google Scholar] [CrossRef]
  10. Farrell, S.M.; Veeraraghavan, A.; Sabharwal, A.; Uribe, C.A. Distributed Generalized Wirtinger Flow for Interferometric Imaging on Networks. IFAC-PapersOnLine 2022, 55, 258–263. [Google Scholar] [CrossRef]
  11. Candès, E.J.; Strohmer, T.; Voroninski, V. PhaseLift: Exact and Stable Signal Recovery from Magnitude Measurements via Convex Programming. Commun. Pure Appl. Math. 2012, 66, 1241–1274. [Google Scholar] [CrossRef]
  12. Moretta, R.; Maisto, M.A.; Pierri, R. Numerical experiments on the impact of constraints in Phaselift. In Proceedings of the 41st Photonics & Electromagnetics Research Symposium (PIERS), Xiamen, China, 17–20 December 2019; IEEE: New York, NY, USA. [Google Scholar]
  13. Waldspurger, I.; D’aspremont, A.; Mallat, S. Phase recovery, MaxCut and complex semidefinite programming. Math. Program. 2013, 149, 47–81. [Google Scholar] [CrossRef]
  14. Yin, P.; Xin, J. PhaseLiftOff: An accurate and stable phase retrieval method based on difference of trace and Frobenius norms. Commun. Math. Sci. 2015, 13, 1033–1049. [Google Scholar] [CrossRef]
  15. Xia, Y.; Xu, Z. Sparse Phase Retrieval Via PhaseLiftOff. IEEE Trans. Signal Process. 2021, 69, 2129–2143. [Google Scholar] [CrossRef]
  16. Gao, M.; Fang, S.; Wang, J.; Zhang, X.; Cao, Y. A Dual Frequency Predistortion Adaptive Sparse Signal Reconstruction Algorithm. Teh. Vjesn.-Tech. Gaz. 2022, 29, 580–589. [Google Scholar] [CrossRef]
  17. Candes, E.J.; Eldar, Y.C.; Strohmer, T.; Voroninski, V. Phase Retrieval via Matrix Completion. SIAM Rev. 2015, 57, 225–251. [Google Scholar] [CrossRef]
  18. Chia, C.; Sesia, M.; Ho, C.-S.; Jeffrey, S.S.; Dionne, J.; Candes, E.J.; Howe, R.T. Interpretable Classification of Bacterial Raman Spectra With Knockoff Wavelets. IEEE J. Biomed. Health Inform. 2021, 26, 740–748. [Google Scholar] [CrossRef] [PubMed]
  19. Chang, H.; Lou, Y.; Ng, M.K.; Zeng, T. Phase Retrieval from Incomplete Magnitude Information via Total Variation Regularization. SIAM J. Sci. Comput. 2016, 38, A3672–A3695. [Google Scholar] [CrossRef]
  20. Leonid, I. Rudin and Stanley Osher and Emad Fatemi. Nonlinear total variation based noise removal algorithms. Phys. D Nonlinear Phenom. 1992, 60, 259–268. [Google Scholar] [CrossRef]
  21. Bessas, K. Fractional total variation denoising model with L1 fidelity. Nonlinear Anal. Int. Multidiscip. J. 2022, 222, 112926. [Google Scholar] [CrossRef]
  22. Corentin, C.; Antonin, C. Error estimates for finite differences approximations of the total variation. IMA J. Numer. Anal. 2023, 43, 692–736. [Google Scholar]
  23. Li, X.; Li, Y.; Chen, P.; Li, F. Combining convolutional sparse coding with total variation for sparse-view CT reconstruction. Appl. Opt. 2022, 61, C116–C124. [Google Scholar] [CrossRef] [PubMed]
  24. Ron, A.; Shen, Z. Affine Systems inL2(Rd): The Analysis of the Analysis Operator. J. Funct. Anal. 1997, 148, 408–447. [Google Scholar] [CrossRef]
  25. Daubechies, I.; Han, B.; Ron, A.; Shen, Z. Framelets: MRA-based constructions of wavelet frames. Appl. Comput. Harmon. Anal. 2003, 14, 1–46. [Google Scholar] [CrossRef]
  26. Mallat, S. A Wavelet Tour of Signal Processing, 2nd ed.; Academic Press: San Diego, CA, USA, 1999. [Google Scholar]
  27. Chan, R.; Shen, L.; Shen, Z. A framelet-based approach for image inpainting. Preprint 2005, 4, 325. [Google Scholar]
  28. Nnolim, U.A. Fourth-Order Partial Differential Equation Framelet Fusion-Based Colour Correction and Contrast Enhancement for Underwater Images. Int. J. Image Graph. 2022, 23, 235044. [Google Scholar] [CrossRef]
  29. Ron, A.; Shen, Z. Affine systems in L_2((R^d) II: Dual systems. J. Fourier Anal. Appl. 1997, 3, 617–637. [Google Scholar] [CrossRef]
  30. Abdollahi, F.; Lakestani, M.; Razzaghi, M. Hybrid Vessel Extraction Method Based on Tight-Frame and EM Algorithms by Using 2D Dual Tree Complex Wavelet. Informatica 2021, 32, 1–22. [Google Scholar] [CrossRef]
  31. Shen, Z. Wavelet frames and image restorations. In Proceedings of the International Congress of Mathematicians, Hyderabad, India, 19–27 August 2010; Vol. I: Plenary Lectures and Ceremonies Vols. II–IV: Invited Lectures. pp. 2834–2863, ISBN 0-12-466605-1. [Google Scholar]
  32. Mallat, S.G. A Theory for Multiresolution Signal Decomposition: The Wavelet Representation. IEEE Trans. Pattern Anal. Mach. Intell. 2009, 11, 494–513. [Google Scholar] [CrossRef]
  33. Chang, D.; Wang, Y.; Fan, R. Forecast of Large Earthquake Emergency Supplies Demand Based on PSO-BP Neural Network. Teh. Vjesn.-Tech. Gaz. 2022, 29, 561–571. [Google Scholar] [CrossRef]
  34. Cai, J.-F.; Chan, R.H.; Shen, Z. A framelet-based image inpainting algorithm. Appl. Comput. Harmon. Anal. 2008, 24, 131–149. [Google Scholar] [CrossRef]
  35. Wang, Y.; Yang, J.; Yin, W.; Zhang, Y. A New Alternating Minimization Algorithm for Total Variation Image Reconstruction. SIAM J. Imaging Sci. 2008, 1, 248–272. [Google Scholar] [CrossRef]
  36. Hayes, M. The reconstruction of a multidimensional sequence from the phase or magnitude of its Fourier transform. IEEE Trans. Acoust. Speech Signal Process 1982, 30, 140–154. [Google Scholar] [CrossRef]
  37. Sanz, J.L.C. Mathematical Considerations for the Problem of Fourier Transform Phase Retrieval from Magnitude. SIAM J. Appl. Math. 1985, 45, 651–664. [Google Scholar] [CrossRef]
  38. Fannjiang, A. Absolute uniqueness of phase retrieval with random illumination. Inverse Probl. 2012, 28, 075008. [Google Scholar] [CrossRef]
  39. Fannjiang, A.; Liao, W. Phase retrieval with random phase illumination. J. Opt. Soc. Am. A 2012, 29, 1847–1859. [Google Scholar] [CrossRef]
  40. Fannjiang, A.; Liao, W. Fourier phasing with phase-uncertain mask. Inverse Probl. 2013, 29, 125001. [Google Scholar] [CrossRef]
  41. Wen, Z.; Yang, C.; Liu, X.; Marchesini, S. Alternating direction methods for classical and ptychographic phase retrieval. Inverse Probl. 2012, 28, 115010. [Google Scholar] [CrossRef]
  42. Han, D.R. A Survey on Some Recent Developments of Alternating Direction Method of Multipliers. J. Oper. Res. Soc. China 2022, 10, 1–52. [Google Scholar] [CrossRef]
  43. Chou, H.-H.; Tsai, F.-S. Technology-Enabled Mobilization in the Emergence of a Value Co-Creating Ecosystem. J. Organ. End User Comput. 2022, 34, 1–17. [Google Scholar] [CrossRef]
Figure 1. The phase retrieval (PR) results and comparisons. (a) Original image of pine-tree (b) Fourier transform magnitudes of the image; (c) PR by ER; (d) PR by WF; (e) PR by TVB; (f) PR by TFLSB.
Figure 1. The phase retrieval (PR) results and comparisons. (a) Original image of pine-tree (b) Fourier transform magnitudes of the image; (c) PR by ER; (d) PR by WF; (e) PR by TVB; (f) PR by TFLSB.
Photonics 11 00071 g001
Figure 2. Relative error vs. iteration for image of pine-tree using different methods.
Figure 2. Relative error vs. iteration for image of pine-tree using different methods.
Photonics 11 00071 g002
Figure 3. The phase retrieval (PR) results and comparisons. (a) Original image of six-claw satellite; (b) Fourier transform magnitudes of the image; (c) PR by ER; (d) PR by WF; (e) PR by TVB; (f) PR by TFLSB.
Figure 3. The phase retrieval (PR) results and comparisons. (a) Original image of six-claw satellite; (b) Fourier transform magnitudes of the image; (c) PR by ER; (d) PR by WF; (e) PR by TVB; (f) PR by TFLSB.
Photonics 11 00071 g003
Figure 4. Relative error vs. iteration for image of six-claw satellite using different methods.
Figure 4. Relative error vs. iteration for image of six-claw satellite using different methods.
Photonics 11 00071 g004
Figure 5. The phase retrieval (PR) results and comparisons. (a) Original image of satellite; (b) Fourier transform magnitudes of the image; (c) PR by ER; (d) PR by WF; (e) PR by TVB; (f) PR by TFLSB.
Figure 5. The phase retrieval (PR) results and comparisons. (a) Original image of satellite; (b) Fourier transform magnitudes of the image; (c) PR by ER; (d) PR by WF; (e) PR by TVB; (f) PR by TFLSB.
Photonics 11 00071 g005aPhotonics 11 00071 g005b
Figure 6. Relative error vs. iteration for image of satellite using different methods.
Figure 6. Relative error vs. iteration for image of satellite using different methods.
Photonics 11 00071 g006
Figure 7. The phase retrieval (PR) results with complex noise. The images in the first column are original images: (a) “Bird”; (d) “Hat”; (g) “Tower”. The content in middle column, (b,e,h) are corresponding magnitudes of the image with Gaussian and salt–pepper noise; (c,f,i) the PR results obtained viaTFLSB.
Figure 7. The phase retrieval (PR) results with complex noise. The images in the first column are original images: (a) “Bird”; (d) “Hat”; (g) “Tower”. The content in middle column, (b,e,h) are corresponding magnitudes of the image with Gaussian and salt–pepper noise; (c,f,i) the PR results obtained viaTFLSB.
Photonics 11 00071 g007
Figure 8. Structured light illumination phase retrieval experimental system.
Figure 8. Structured light illumination phase retrieval experimental system.
Photonics 11 00071 g008
Figure 9. Target structure. (a) Target 1; (b) Target 2.
Figure 9. Target structure. (a) Target 1; (b) Target 2.
Photonics 11 00071 g009
Figure 10. The phase retrieval (PR) results. (a) “Target 1”; (b) “Target 2”. In the second row, (c,d) are the real spatial spectrum mode value distribution of the target. In the third row, (e,f) are the measured spatial spectrum mode value distribution of the target with noise. In the last row (g,h) are the images recovered by TFLSB algorithm.
Figure 10. The phase retrieval (PR) results. (a) “Target 1”; (b) “Target 2”. In the second row, (c,d) are the real spatial spectrum mode value distribution of the target. In the third row, (e,f) are the measured spatial spectrum mode value distribution of the target with noise. In the last row (g,h) are the images recovered by TFLSB algorithm.
Photonics 11 00071 g010
Table 1. Phase retrieval image quality evaluation parameters for different reconstruction algorithms for pine-tree image. The best data are shown in bold.
Table 1. Phase retrieval image quality evaluation parameters for different reconstruction algorithms for pine-tree image. The best data are shown in bold.
MethodTime (s)PSNRSSIMSNR (dB)
ER15.412113.82150.0627453.31793
WF23.886817.57210.12329.45358
TVB18.109823.31890.12107215.7823
TFLSB36.555644.80560.16649947.5823
Table 2. Phase retrieval image quality evaluation parameters for different reconstruction algorithms for six-claw image. The best data are shown in bold.
Table 2. Phase retrieval image quality evaluation parameters for different reconstruction algorithms for six-claw image. The best data are shown in bold.
MethodTime (s)PSNRSSIMSNR (dB)
ER15.78319.38880.1202049.33326
WF35.1416.29470.1540686.28477
TVB25.190523.45790.18483614.2892
TFLSB46.717236.63480.53554345.7626
Table 3. Phase retrieval image quality evaluation parameters for different reconstruction algorithm for satellite image. The best data are shown in bold.
Table 3. Phase retrieval image quality evaluation parameters for different reconstruction algorithm for satellite image. The best data are shown in bold.
MethodTime (s)PSNRSSIMSNR (dB)
ER12.001823.04350.27173110.5373
WF24.947119.09690.3014726.78469
TVB18.744822.77060.24309610.4753
TFLSB39.758428.2640.30523532.5247
Table 4. Parameters of reconstruction result with complex noise.
Table 4. Parameters of reconstruction result with complex noise.
NameTime (s)PSNRSSIMSNR (dB)
Bird35.63223.6050.42093524.7175
Hat36.216226.31380.35521925.1869
Tower38.192429.47590.44340127.188
Table 5. Parameters of reconstruction result.
Table 5. Parameters of reconstruction result.
NamePSNRSSIMSNR (dB)
Object18.717750.0093978112.2024
Object27.27050.0030437811.3587
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Qin, X.; Gao, X.; Yang, X.; Xie, M. Complex Noise-Based Phase Retrieval Using Total Variation and Wavelet Transform Regularization. Photonics 2024, 11, 71. https://doi.org/10.3390/photonics11010071

AMA Style

Qin X, Gao X, Yang X, Xie M. Complex Noise-Based Phase Retrieval Using Total Variation and Wavelet Transform Regularization. Photonics. 2024; 11(1):71. https://doi.org/10.3390/photonics11010071

Chicago/Turabian Style

Qin, Xing, Xin Gao, Xiaoxu Yang, and Meilin Xie. 2024. "Complex Noise-Based Phase Retrieval Using Total Variation and Wavelet Transform Regularization" Photonics 11, no. 1: 71. https://doi.org/10.3390/photonics11010071

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop