1. Introduction
Interferometric measurements are highly sensitive to atmospheric fluctuations and mechanical vibrations, which introduce noise into the measured interferogram. In interferometric methods affected by speckle noise and low signal strength, it is common to observe a very low signal-to-noise ratio (SNR). An example of such a system is the digital holographic interferometric (DHI) dosimeter developed by our group [
1], which determines the absorbed radiation dose by measuring phase shifts resulting from refractive index changes caused by heat energy transferred to transparent media. Noise during the phase unwrapping step of image reconstruction can lead to considerable errors. This challenge is not unique to DHI dosimetry; it also arises in other domains where accurate phase unwrapping is critical, including optical imaging [
2], magnetic resonance imaging [
3], and synthetic aperture radar [
4]. Due to the reconstruction method, the recovered phase is wrapped to the interval
. In low-signal phase maps, the measured phase may be less than
. This results in no phase wrapping occurring from the phase signal. Instead, various noise sources, such as atmospheric turbulence, mechanical vibrations, and detector noise in the CCD camera, can induce phase wraps [
2]. Consequently, in low-SNR environments typical of interferometric measurements, phase wrapping arises primarily from noise rather than from the underlying physical phase signal. This noise is present before the phase is wrapped. However, additional noise will also be introduced after wrapping due to phase decorrelation arising from the subtraction of the pre- and post-irradiation phase images [
5]. These compounded effects underscore the critical need for robust phase unwrapping algorithms in interferometric imaging applications.
There are many analytical phase unwrapping methods currently available. The main two types used for phase unwrapping can be divided into localand global unwrappers. A widely used local algorithm is the Herraez unwrapper [
6,
7], known for its robust, path-independent 2D strategy that minimizes phase errors in noisy or discontinuous regions. This method was also employed during the prototype development of the DHI dosimeter under joint research by the University of Canterbury and the University of Washington [
8], where it demonstrated promising performance but remained sensitive to noise. The Herraez algorithm performs phase unwrapping by sorting pixels according to a reliability metric and progressively unwrapping them in order of decreasing reliability, ensuring that high-confidence regions guide the process and reduce error propagation. To enable comparison with a global phase unwrapping strategy, we selected the Costantini algorithm [
9]. This method operates by solving a system of equations over the entire phase map, making it inherently more robust to noise compared with local approaches. The Herraez and Costantini algorithms were selected as they represent two of the most robust and widely validated approaches, quality-guided and minimum-norm global optimization methods, respectively, providing strategies for reliable phase unwrapping [
6].
Numerous deep learning methods have been developed for phase unwrapping. These approaches are typically leveraging architectures such as the Residual U-Net (Res-UNet) [
10] and the Separable-Residual-Dense-Inverted U-Net (SRDU-Net) [
11]. Other regression-based models for general phase unwrapping include PHU-Net [
12], PU-GAN [
13], and BCNet [
14]. However, a critical limitation shared across these models is their reliance on simplistic Gaussian noise assumptions. They typically do not account for the complex noise characteristics present in holographic measurement systems. This assumption leads to unrealistically high performance metrics and limits their practical applicability on real world data [
15]. Gaussian noise is additive and follows a normal distribution. Typically, it arises from electronic components, thermal fluctuations, or quantization errors in sensors and systems. On the other hand, Rayleigh noise is multiplicative and follows a Rayleigh distribution, which is skewed and only defined for non-negative values. It arises in coherent imaging systems like holography due to random interference of multiple wavefronts and is more relevant in speckle noise scenarios, especially in off-axis holography or scattering media [
16]. It can dominate in reconstructions where interference patterns are strong, and is therefore more challenging to filter due to its non-Gaussian nature and multiplicative behavior.
The aim of this work was to develop and characterize a robust, open-source AI-based phase unwrapping model that extends beyond published approaches by explicitly accounting for realistic noise conditions typical of low-SNR environments. Unlike previous approaches [
13,
17,
18,
19], the proposed framework incorporates phase decorrelation and Rayleigh-distributed speckle noise to better reflect practical interferometric data for holography [
5]. Model testing also includes a systematic evaluation of artifacts present in the wrapped phase images, providing deeper insight into performance under challenging measurement conditions.
3. Results
The AI model was trained under realistic interferometric measurement conditions characterized by a low SNR. Noise was introduced prior to phase wrapping to simulate acquisition-related distortions, and additional phase decorrelation noise was applied post-wrapping to reflect the effects of image subtraction. This aimed to replicate the complex noise environment typical of interferometric systems. To determine suitable noise amplitudes () and correlation factors () for the phase decorrelation noise in the dataset, various combinations of these parameters were tested to identify the conditions that produced the desired phase-wrapping behavior. was determined as the threshold above which noise-induced wrapping was observed across all correlation factors.
Two noise values were selected near the midpoint of the training datasets noise amplitude range, along with one value double that amount, to evaluate the model’s performance at both a well-trained noise level and a level beyond the training data. A representative image with
,
, and
is illustrated in
Figure A1. Furthermore,
Figure A2 highlights how the model generalizes to previously unseen noise levels, specifically at
,
, and
. As shown, the model is able to perform phase unwrapping effectively even at noise levels well above those seen during training. During model training, only noise levels up to
were utilized. The results corresponding to
Figure A1 and
Figure A2 are summarized in
Table 3.
To assess the stability and performance of the AI model against the analytical methods, PI values were plotted against noise level to summarize overall trends and the identification of performance thresholds under increasing noise. These values are shown in
Figure 3. The 95% confidence interval (CI) is also shown around the mean values. This represents the range within which the true mean is expected to lie with 95% confidence.
The AI model demonstrates generalization beyond the noise levels encountered during training. Although trained only up to a noise level of
, it continues to produce reasonable results beyond this. It is again emphasized that this analysis reflects pure unwrapping performance shown in
Figure 3a where analytical methods are evaluated relative to the noisy phase image, while the AI-based method is evaluated relative to the clean phase image. Comparing analytical methods directly to the clean phase is also done as it shows the utility of the analytical methods in recovering a clean ground-truth phase in
Figure 3b.
Table 3 and
Figure 3 provides evidence to suggest that the P2P model is effective as a phase unwrapper for complex speckle noise scenarios, including phase decorrelation noise, allowing recovery of clean, noiseless phase maps up to at least approximately
when trained up to
.
To examine how robust the trained model is to phase decorrelation noise,
Figure 4 was generated to show PI values across a range of correlation factors and noise levels.
Wrapping Artifacts
The artifact simulating a dead detector area is shown in
Figure 5. It is simulated as a circular region of radius 50 pixels with its center at pixels (350, 350). Two aspects can be analyzed using both the analytical and P2P-based unwrapping methods: (i) the effect of artifact position and (ii) the effect of artifact size on unwrapping performance. By varying the position of the artifact, it can be assessed whether its location within the phase map influences the ability of the unwrappers to recover the true phase. For instance, placing the artifact near the center of the phase peak may present a more challenging scenario compared to positioning it in lower-gradient regions. The influence of artifact position was first investigated, as illustrated in
Figure 6, before examining the effect of artifact size.
Figure 6 shows a RMSE and SSIM heatmap of different artifact positions. The artifact center is translated across the image grid in increments of 50 pixels along both the horizontal and vertical axes. Since the artifact size has a radius of 50 pixels, translating the center by 50 pixels will allow the artifact to span the entire image plane. The
x and
y-axis tick intervals correspond to 50 pixels each.
From this, it can be observed that there is a clear relationship between the phase amplitude and artifact positioning. Specifically, when the artifact is placed in a region with a steeper wrapping gradient, both RMSE and SSIM values deteriorate. This effect is evident for the P2P unwrapping method. To better visualize the regions where performance is most affected, a performance index heatmap was generated, as shown in
Figure 7. The corresponding wrapped images and predictions for the AI, Herraez, and Costantini methods are shown in
Figure A3,
Figure A4 and
Figure A5, respectively.
As seen in
Figure 7, the minimum Costantini PI is less than Herraez, indicating poor performance. Notably, there is no correlation between the wrapping gradient and the unwrapping performance for the Costantini analytical method.
In contrast, for the Herraez and P2P methods, there is a clear correlation between the phase information and unwrapping performance. However, the PI for Herraez is <0, indicating poor unwrapping performance. This demonstrates the AI model’s ability to learn underlying relationships present in the training data. Furthermore, its performance in these regions remains considerably better than that of Herraez, while also retrieving clean phase images devoid of noise characteristics, whereas Herraez and Costantini preserve the noise present in the predicted phase. Given the relationship between phase information and unwrapping performance, it is evident that larger artifacts would further degrade performance.
4. Discussion
This work investigates the feasibility and performance of a Pix2Pix-based conditional generative adversarial network for phase unwrapping in interferometric imaging under realistic and challenging noise conditions.
The study demonstrates that P2P can serve as a robust phase unwrapping framework under noise conditions representative of real interferometric measurements, including Rayleigh-distributed speckle noise and speckle phase decorrelation. In contrast to analytical approaches such as the Herraez and Costantini algorithms, the proposed AI-based method simultaneously unwraps and denoises the phase, producing outputs that are more suitable for downstream image reconstruction tasks.
A key distinction between the analytical and AI-based approaches lies in how noise is treated. In this work, noise was introduced prior to phase wrapping to reflect realistic holographic acquisition conditions. Consequently, analytical unwrappers recover phase continuity while preserving the noise present in the wrapped input, whereas the AI model learns to jointly unwrap and suppress noise. To isolate pure unwrapping performance, analytical methods were evaluated against the noisy phase image rather than the clean ground truth. Utility performance was then assessed by evaluating all predictions against the clean ground-truth phase.
This distinction is explicitly illustrated by the two subplots in
Figure 3.
Figure 3b presents a utility-based comparison, in which all methods are evaluated against the clean phase, reflecting the practical usefulness of the reconstructed output for downstream applications. Under this comparison, analytical methods perform poorly due to their inability to remove noise, whereas the P2P model maintains high reconstruction fidelity by jointly denoising and unwrapping the phase.
Figure 3a presents an unwrapping-only comparison, in which analytical methods are evaluated against the noisy phase image to assess their intended function of phase continuity recovery without penalization for residual noise. This dual presentation ensures methodological fairness while highlighting the fundamentally different objectives of analytical and AI-based approaches.
This evaluation strategy is conservative with respect to the analytical methods. As illustrated in
Figure A1 and
Figure 3, comparing analytical unwrappers directly to the clean phase would lead to substantially larger RMSE values and lower SSIM scores (RMSE
, SSIM
), as these methods are not designed to remove noise. By instead evaluating them against the noisy phase image, their reported performance reflects only their ability to correctly unwrap phase, rather than being penalized for residual noise. Even under this favorable comparison, the proposed AI model outperformed the analytical methods across higher noise levels.
The advantages of the AI-based approach become most apparent under low-SNR conditions, where analytical methods exhibit rapid performance degradation and increased variability. This behavior is reflected in the widening confidence intervals of the analytical performance indices at higher noise levels shown in
Figure 3, indicating instability and heightened sensitivity to noise-induced wrapping errors. In contrast, the AI model maintains narrow confidence intervals, suggesting more consistent and reliable behavior. While post-processing denoising could be applied to successfully analytically unwrapped images (i.e., those without unwrapping artifacts, such as those shown in
Figure A2) to improve visual quality, this approach introduces additional processing complexity and does not address the fundamental noise sensitivity of the unwrapping step itself.
While a full comparison with a state-of-the-art neural network–based phase unwrapping method is beyond the scope of this work, an initial comparison was performed between the proposed Pix2Pix model and the state-of-the-art SRDU-Net unwrapper. SRDU-Net achieved a mean SSIM and RMSE of and , respectively, across a range of noise levels. Although direct quantitative comparison is limited by differences in noise modeling and evaluation protocols, the Pix2Pix framework achieved higher SSIM and lower RMSE under substantially more challenging noise conditions, including Rayleigh and decorrelation noise not considered in prior work. Importantly, the proposed approach is fully open-source and relies on a relatively simple and well-established architecture, improving accessibility and reproducibility compared to more complex network designs.
Generalization beyond the training distribution was observed in multiple respects. The model maintained reasonable performance at noise levels exceeding those seen during training and demonstrated robustness to detector artifacts that were not explicitly included in the training data. Performance degradation beyond highlights the dependence of AI-based methods on training data coverage and realism. Similarly, the observed sensitivity to the phase decorrelation coherence factor suggests that extending the training dataset to include a range of coherence factors, rather than training solely at , would further improve model robustness.
The present study did not aim to exhaustively optimize model performance. Instead, the emphasis was placed on demonstrating the feasibility of the proposed reconstruction framework using a standard pix2pix configuration and established training practices. As such, key learning parameters, including learning rates, loss weighting terms, network depth, and adversarial training schedules, were not systematically tuned, nor was the effect of dataset size explored beyond achieving stable training behavior. Consequently, the reported performance should be interpreted as a conservative baseline rather than an upper bound. Further gains are likely achievable through targeted optimization, including hyperparameter tuning, adaptive learning rate schedules, alternative adversarial or perceptual loss formulations, and systematic exploration of dataset scaling.
The primary limitation of this and related AI-based approaches is the reliance on simulated training data. In real interferometric measurements, obtaining a true noiseless ground-truth phase is infeasible, necessitating realistic forward modeling of the imaging system. While the simulation framework used here captures key noise mechanisms relevant to holography, extension to experimental datasets will require careful validation and domain-specific retraining. The focus of this work has been on examining the potential of neural networks trained on complex noise characteristics present in holography, establishing a foundational framework upon which further holographic reconstruction research can be built.