Fringe-Based Structured-Light 3D Reconstruction: Principles, Projection Technologies, and Deep Learning Integration

Zhang, Zhongyuan; Wang, Hao; Li, Yiming; Li, Zinan; Gui, Weihua; Wang, Xiaohao; Zhang, Chaobo; Liang, Xiaojun; Li, Xinghui

doi:10.3390/s25206296

Open AccessReview

Fringe-Based Structured-Light 3D Reconstruction: Principles, Projection Technologies, and Deep Learning Integration

by

Zhongyuan Zhang

^1,2

,

Hao Wang

^1,2

,

Yiming Li

²

,

Zinan Li

¹

,

Weihua Gui

^2,3

,

Xiaohao Wang

¹

,

Chaobo Zhang

²

,

Xiaojun Liang

^2,*

and

Xinghui Li

^1,*

¹

Shenzhen International Graduate School, Tsinghua University, Shenzhen 518000, China

²

Pengcheng Laboratory, Shenzhen 518000, China

³

School of Automation, Central South University, Changsha 410083, China

^*

Authors to whom correspondence should be addressed.

Sensors 2025, 25(20), 6296; https://doi.org/10.3390/s25206296

Submission received: 12 August 2025 / Revised: 2 October 2025 / Accepted: 8 October 2025 / Published: 11 October 2025

(This article belongs to the Section Sensing and Imaging)

Download

Browse Figures

Versions Notes

Abstract

Structured-light 3D reconstruction is an active measurement technique that extracts spatial geometric information of objects by projecting fringe patterns and analyzing their distortions. It has been widely applied in industrial inspection, cultural heritage digitization, virtual reality, and other related fields. This review presents a comprehensive analysis of mainstream fringe-based reconstruction methods, including Fringe Projection Profilometry (FPP) for diffuse surfaces and Phase Measuring Deflectometry (PMD) for specular surfaces. While existing reviews typically focus on individual techniques or specific applications, they often lack a systematic comparison between these two major approaches. In particular, the influence of different projection schemes such as Digital Light Processing (DLP) and MEMS scanning mirror–based laser scanning on system performance has not yet been fully clarified. To fill this gap, the review analyzes and compares FPP and PMD with respect to measurement principles, system implementation, calibration and modeling strategies, error control mechanisms, and integration with deep learning methods. Special focus is placed on the potential of MEMS projection technology in achieving lightweight and high-dynamic-range measurement scenarios, as well as the emerging role of deep learning in enhancing phase retrieval and 3D reconstruction accuracy. This review concludes by identifying key technical challenges and offering insights into future research directions in system modeling, intelligent reconstruction, and comprehensive performance evaluation.

Keywords:

fringe structured light; fringe projection profilometry; phase measuring deflectometry; deep learning; 3D measurement

1. Introduction

Three-dimensional reconstruction technology is a key approach for recovering the spatial structure of objects from images or sensor data, and it has been widely applied in various fields such as industrial inspection, medical imaging, cultural heritage digitization, and virtual reality [1,2,3,4,5]. Based on the method of acquiring depth information, 3D reconstruction can be categorized into passive and active approaches. Passive methods rely on natural illumination and image matching—typical examples include stereo vision and multi-view geometry. However, their reconstruction accuracy is often limited by factors such as texture richness and occlusions, making them unsuitable for high-precision measurement tasks [6,7,8,9,10]. In comparison, active 3D measurement techniques maintain high reconstruction accuracy even in regions with weak or absent texture features. By introducing an additional structured light source, they provide phase information to the measured area, thereby improving the accuracy and completeness of the 3D surface data. The laser triangulation method relies on the principle of triangulation rather than phase information for reconstruction [11,12]; however, due to its line-scanning nature, its speed is generally lower than that of area-based structured-light methods. The Time-of-Flight (TOF) method estimates depth information by measuring the travel time of laser pulses between the detector and the object, and it is typically applied in large-scale scenarios on the order of hundreds of meters [13,14].

Among various active techniques, structured light has emerged as a mainstream approach for high-precision 3D reconstruction at close range [15,16,17,18], owing to its high resolution, accuracy, and system flexibility [19,20,21]. It is widely applied in scenarios such as industrial surface inspection [22,23], facial recognition [24,25], and 3D modeling [26,27]. Most structured-light systems are based on phase encoding principles and can be broadly categorized into two representative methods: FPP and PMD. FPP is suitable for diffuse surfaces and reconstructs 3D shapes by projecting multiple phase-shifted fringe patterns and extracting their phase. In contrast, PMD is designed for specular or highly reflective surfaces, acquiring gradient information by analyzing the phase variations of reflected fringe patterns, from which the 3D structure is reconstructed [28,29,30]. Depending on the projection mechanism, FPP systems can be implemented in several ways, with the most common being DLP projectors and MEMS-based micromirror systems. DLP systems offer high pattern quality and fast refresh rates, making them the dominant solution. While DLP projectors have been extensively studied and widely applied in structured-light systems, discussions often focus on their optical design and depth-of-field characteristics. In comparison, micro-electro-mechanical systems (MEMS)-based projection has received relatively less attention, despite offering distinctive advantages. By generating patterns through laser scanning, MEMS projectors naturally enable large depth-of-field projection without the need for additional focusing optics. Moreover, their compactness and lightweight design make them well-suited for complex environments and mobile platforms [31,32]. In recent years, MEMS projection has attracted increasing attention as a promising direction for lightweight structured-light systems.

A number of scholars have conducted systematic reviews and studies focusing on key components of the structured-light 3D reconstruction pipeline. Tobias Möller provided an early overview of all-solid-state PMD range imaging, highlighting its feasibility, the 2005 “Hermes Award” commercial product, and key challenges such as background illumination and temperature variations that demand robust solutions [33]. Building upon these foundations, Xu et al. categorized and summarized the system architecture of PMD, analyzing critical issues such as measurement accuracy, system complexity, and calibration difficulty [34]. He et al. systematically compared three common temporal-phase unwrapping methods in FPP—namely, Temporal Filtering, Phase Coding, and Gray-Code—and evaluated their error characteristics and reconstruction performance under different system configurations [35]. Lv et al. optimized fringe orientation, pixel matching, and 3D reconstruction models from a theoretical perspective, proposing an FPP method that balances accuracy, efficiency, and implementation simplicity [36]. Bai et al. reviewed key techniques in full-field phase-based 3D measurement, including phase error compensation, high-speed image acquisition, and the application of deep learning in complex scenarios [37]. Kulkarni and Rastogi surveyed mainstream fringe denoising algorithms, comparing their performance in terms of phase accuracy and edge preservation [38]. In parallel, Liu et al. reviewed the progress of deep learning in fringe projection, summarizing representative methods, network structures, datasets, and application scenarios, and providing a structured overview of key technical advances and future research trends in this rapidly evolving domain [39].

Although multiple technical modules of structured-light 3D reconstruction systems have been extensively studied, most existing reviews focus on a single method or specific application, and a systematic comparison between the two mainstream approaches—FPP and PMD—is still lacking. In particular, there is no unified understanding of how different projection schemes, such as DLP and MEMS, affect system performance. To address this issue, this paper starts from the general paradigm of structured-light 3D reconstruction and provides a comprehensive review and comparison of FPP, PMD, and emerging MEMS technologies, focusing on key aspects such as measurement principles, system implementation, calibration and modeling, error control, and integration with deep learning. The paper emphasizes the differences in practical adaptability and the potential for integration among these approaches. Representative reviews and studies are summarized in Table 1.

As illustrated in Figure 1, Section 1 introduces the research background and significance, while Section 2 starts from the general paradigm of fringe-structured-light 3D reconstruction, systematically presenting the principles of wrapped-phase extraction, phase unwrapping, and 3D shape recovery from phase, thereby laying the theoretical foundation for subsequent system evolution. Building on this paradigm, Section 3 focuses on the development of PMD systems, tracing their evolution from single-screen single-camera configurations to multi-screen direct PMD and multi-camera stereo PMD, gradually revealing their applicability and limitations in complex scenarios. In parallel, Section 4 shifts to FPP systems, analyzing the differences among mainstream projection technologies and examining calibration strategies and error modeling under MEMS-based projection, thereby highlighting challenges in accuracy and robustness. As traditional approaches increasingly reveal their shortcomings, Section 5 introduces the integration of deep learning into fringe-structured light, covering learning paradigms, network architecture innovations, supervision strategies, and input design, along with a discussion of evaluation metrics. Building upon these insights, Section 6 summarizes current challenges and outlines future research directions, including HDR imaging, extended depth of field, high-speed and real-time reconstruction, as well as the transferability and interpretability of deep learning methods. Finally, Section 7 concludes the paper by summarizing research progress and providing an outlook on future trends.

2. Fringe-Structured-Light 3D Reconstruction Approach

FPP and PMD are the two mainstream approaches in fringe-structured-light measurement, respectively, suited for 3D measurements of diffuse and specular surfaces. Although their system architectures differ, both methods fundamentally rely on projecting or displaying sinusoidal fringe patterns and utilizing the modulation effect imposed by the target object to recover 3D shape information [6,40,41]. As the fringe patterns undergo deformation on the object surface, their phase information directly reflects the spatial geometry of the surface. Therefore, a deterministic physical mapping exists between the phase and either depth (in FPP) or surface gradient (in PMD) [42,43]. With high-precision phase retrieval and phase unwrapping algorithms, FPP systems can construct depth maps, while PMD systems can reconstruct surface gradients and further recover the shape. Overall, the core pipeline of different fringe-structured-light 3D reconstruction methods can be abstracted as a physical sequence of “fringe modulation-phase retrieval-shape mapping.” The following sections will provide a step-by-step explanation of this reconstruction process. A representative experimental setup and workflow of FPP are illustrated in Figure 2, where the projector and camera are arranged to acquire deformed fringe patterns from the object. The subsequent processing pipeline includes phase retrieval, phase unwrapping, and mapping the recovered phase to 3D geometry, providing a concrete example of the general reconstruction process.

2.1. Wrapped-Phase Extraction

In the fringe analysis process, the primary task is to obtain the wrapped phase of the fringe pattern. Commonly used methods for wrapped-phase extraction include the phase-shifting method [45,46,47], wavelet transform method [48], and Fourier transform method [49]. Among these, the phase-shifting method has become the most widely adopted technique due to its high computational accuracy, strong robustness, and low sensitivity to environmental changes and noise [50]. In structured-light projection, sinusoidal fringe patterns are commonly adopted instead of binary patterns. The reason is that sinusoidal fringes provide smoother intensity transitions, leading to higher measurement accuracy and stronger robustness against noise and nonlinear response of the projector or camera. A typical implementation is the N-step phase-shifting method [51], where the generation of sinusoidal fringes can be described by the following equation:

I (x, y) = I_{0} + I_{m} cos (\frac{2 π}{P} x)

(1)

where

I (x, y)

denotes the projected fringe intensity at pixel

(x, y)

;

I_{0}

is the minimum projection intensity, representing the lowest brightness level of the sinusoidal fringe;

I_{m}

is the peak projection intensity, corresponding to the maximum brightness level; P is the fringe period; and x denotes the spatial coordinate along the fringe direction.

The projected fringe pattern from the projector can be described as follows:

I_{n} (x, y) = I_{A} + I_{B} cos [φ (x, y) - n \frac{2 π}{N}]

(2)

where

(x, y)

denotes the coordinates of a pixel in the 2D image;

I_{n} (x, y)

represents the intensity value at that pixel, i.e., the brightness or grayscale value of the image;

I_{A}

is the background intensity, which includes ambient light and the unmodulated portion of the signal;

I_{B}

denotes the modulated intensity, which is related to the reflectivity of the object’s surface;

n = 0, 1, 2, \dots, N - 1

is the number of phase shifts; and

φ (x, y)

is the phase at the pixel to be retrieved. According to the least squares method, the wrapped phase of the object can be calculated as follows:

φ (x, y) = arctan (\frac{\sum_{n = 0}^{N - 1} I_{n} (x, y) sin (\frac{2 π n}{N})}{\sum_{n = 0}^{N - 1} I_{n} (x, y) cos (\frac{2 π n}{N})})

(3)

The Fourier transform method is a single-frame phase extraction technique based on frequency-domain analysis. In this approach, a sinusoidal fringe pattern with a specific frequency is projected onto the object. The captured image is then transformed into the frequency domain, where filtering operations are applied to isolate the fundamental frequency component. An inverse Fourier transform is subsequently performed to recover the phase information of the fringe pattern. The primary advantage of this method lies in its ability to compute the phase from just a single image, making it well-suited for dynamic objects or real-time measurement scenarios.

According to Euler’s formula, the fringe image can be expressed as follows:

\begin{matrix} I (x, y) & = I_{A} + I_{B} cos [φ (x, y) + 2 π f_{0} x] \\ = I_{A} + I_{c} + I_{c}^{'} \\ = I_{A} + \frac{1}{2} I_{B} e^{i (φ (x, y) + 2 π f_{0} x)} + \frac{1}{2} I_{B} e^{- i (φ (x, y) + 2 π f_{0} x)} \end{matrix}

(4)

Applying the Fourier transform to Equation (3) along the x-direction yields the following:

I (f) = I_{A} (f) + I_{c} (f - f_{0}) + I_{c}^{'} (f + f_{0})

(5)

The Fourier spectrum of the fringe image primarily consists of three frequency bands: the

- 1

st order

I_{c} (f - f_{0})

, the 0th order

I_{A} (f)

, and the

+ 1

st order conjugate component

I_{c}^{'} (f + f_{0})

. Among these, the 0th order component represents the zero-frequency term and reflects the background intensity distribution, while the

\pm 1

st order components contain the essential phase information of the fringe pattern.

In practical applications, a band-pass filter is typically applied to retain the +1st order component and suppress the other frequency components, thereby enhancing the accuracy of phase extraction. The retained component is then subjected to an inverse Fourier transform, yielding

I_{c} = \frac{1}{2} I_{B} (cos (φ (x, y) + 2 π f_{0} x) + i sin (φ (x, y) + 2 π f_{0} x))

(6)

The real and imaginary parts of the Fourier spectrum of the fringe image can be expressed as follows:

\begin{matrix} Re {I_{c}} & = \frac{1}{2} I_{B} cos (φ (x, y) + 2 π f_{0} x) \end{matrix}

(7)

\begin{matrix} Im {I_{c}} & = \frac{1}{2} I_{B} sin (φ (x, y) + 2 π f_{0} x) \end{matrix}

(8)

Therefore, the wrapped phase of the object can be expressed as follows:

φ (x, y) = arctan (\frac{Im {I_{c}}}{Re {I_{c}}})

(9)

It is important to note that the obtained

φ (x, y)

is the wrapped phase, with values that are confined within the range

(- π, π]

and exhibit periodic discontinuities. Therefore, a subsequent phase unwrapping step is required to eliminate these discontinuities and recover the true absolute phase, which is essential for accurate 3D reconstruction.

2.2. Phase Unwrapping Algorithms

According to the dimensional source of information utilized during the phase unwrapping process, phase unwrapping methods in structured-light 3D reconstruction can be broadly categorized into temporal-phase unwrapping (TPU) and spatial-phase unwrapping (SPU).

2.2.1. Temporal-Phase Unwrapping

TPU refers to a class of methods that project multiple fringe patterns with different frequencies or encodings, and compute the absolute phase independently for each pixel based on the temporal variation in grayscale intensity. These methods do not rely on spatial continuity of the phase map, making them highly robust for surfaces with steep variations, discontinuities, or occlusions [52]. Depending on the type of modulation encoding used, TPU methods can be further classified into the following: Gray-code Phase Unwrapping, Multi-frequency Phase Unwrapping, Multi-wavelength Phase Unwrapping.

Gray-code Phase Unwrapping is a typical temporal-phase unwrapping method that combines structured encoding projection with the phase-shifting technique. It is widely used for absolute phase reconstruction tasks. The fundamental principle is as follows: a set of Gray-code patterns is first projected to encode the fringe periods pixel by pixel, allowing for the precise determination of each pixel’s fringe order. Subsequently, a set of sinusoidal phase-shifted fringe patterns is projected, from which the wrapped phase is extracted using a phase-shifting algorithm [53]. By integrating the encoded fringe order from the Gray-code and the wrapped phase from the phase-shifting method, the wrapped phase within the interval

(- π, π]

can be converted into a globally continuous absolute phase, enabling accurate 3D shape reconstruction. The encoding and decoding process is illustrated in Figure 3a.

Multi-frequency Phase Unwrapping is a representative temporal-phase unwrapping method. As illustrated in Figure 3b, this method utilizes the phase information obtained from low-frequency fringe patterns to assist in unwrapping the wrapped phase of high-frequency fringe patterns, thereby achieving a balance between high measurement accuracy and a large measurement range. Typically, this method involves projecting two or more sets of sinusoidal fringe patterns with different spatial frequencies and extracting the wrapped phase from each set independently [40].

\{\begin{matrix} Φ_{h} (x, y) & = φ_{h} (x, y) + 2 π k_{h} (x, y) \\ Φ_{l} (x, y) & = φ_{l} (x, y) + 2 π k_{l} (x, y) \\ Φ_{h} (x, y) & = \frac{f_{h}}{f_{l}} Φ_{l} (x, y) \end{matrix}

(10)

where

Φ_{h} (x, y)

and

Φ_{l} (x, y)

represent the unwrapped absolute phases of the high- and low-frequency fringes, respectively;

φ_{h} (x, y)

and

φ_{l} (x, y)

denote the wrapped phases extracted from the high- and low-frequency fringe patterns using the phase-shifting method;

k_{h}

and

k_{l}

are the fringe orders of the high- and low-frequency patterns, respectively; and

f_{h}

and

f_{l}

are the corresponding spatial frequencies of the projected fringe patterns.

To further resolve the fringe order

k_{h} (x, y)

, Equation (10) provides a rounding-based formulation that exploits the relationship between the high- and low-frequency wrapped phases. Specifically, the difference between the scaled low-frequency phase

\frac{f_{h}}{f_{l}} Φ_{l} (x, y)

and the high-frequency wrapped phase

φ_{h} (x, y)

is normalized by

2 π

and then rounded to the nearest integer. This process effectively determines the correct fringe order by constraining the phase discrepancy within a

2 π

range, thereby enabling the reliable recovery of the absolute high-frequency phase.

k_{h} (x, y) = Round (\frac{\frac{f_{h}}{f_{l}} Φ_{l} (x, y) - φ_{h} (x, y)}{2 π})

(11)

Once the fringe order

k_{h}

is determined, the absolute phase can be progressively recovered across different frequencies.

Multi-wavelength phase Unwrapping is a temporal technique that leverages the principle of beat frequency. As illustrated in Figure 3c, its core idea is to project multiple sets of sinusoidal fringe patterns with closely spaced spatial frequencies (or equivalently, wavelengths) to synthesize a phase map with a significantly extended equivalent wavelength. This synthetic phase greatly increases the unambiguous measurement range and effectively mitigates phase ambiguity, thereby improving the robustness and accuracy of the final reconstruction.Typically, two sets of fringe patterns with closely spaced frequencies are used, denoted by spatial frequencies

f_{1}

and

f_{2}

. The resulting synthetic phase map

φ_{e q} (x, y)

and equivalent wavelength

λ_{e q}

can be expressed as follows:

φ_{e q} (x, y) = φ_{2} (x, y) - φ_{1} (x, y)

(12)

λ_{e q} = \frac{1}{f_{e q}} = \frac{λ_{1} λ_{2}}{λ_{1} - λ_{2}}

(13)

Therefore, the fringe order

k_{2}

can be expressed as follows:

k_{2} (x, y) = Round (\frac{\frac{λ_{e q}}{λ_{2}} φ_{e q} (x, y) - φ_{2} (x, y)}{2 π})

(14)

2.2.2. Spatial-Phase Unwrapping

Unlike temporal-phase unwrapping, spatial-phase unwrapping techniques utilize phase information from neighboring pixels in space. By comparing phase differences between adjacent pixels, the method progressively removes the periodic discontinuities in the wrapped phase and recovers the true surface phase of the object. However, phase unwrapping errors in this approach tend to propagate from high-noise regions to low-noise areas and beyond. The computational strategies for spatial-phase unwrapping are generally divided into two categories: path-following local methods and path-independent global methods [54]. Among them, quality-guided unwrapping and branch-cut algorithms are representative local methods, while unweighted and weighted least-squares methods belong to the global category. Global phase unwrapping methods are typically based on the least-squares principle, which transforms the phase unwrapping problem into algebraic equations or matrix solutions to obtain a globally optimal result [55,56,57,58,59]. The basic idea is to convert the measured phase gradient field into a system of linear equations and recover the unwrapped phase through least-squares solutions (e.g., QR decomposition, i.e., orthogonal–triangular decomposition, or algebraic number theory methods). Although such methods are theoretically well-supported by algebraic and statistical tools, in practice, they tend to be sensitive to noise, less accurate in the presence of occlusions or fringe discontinuities, and computationally demanding, making them unsuitable for real-time applications. In contrast, local methods demonstrate greater robustness in handling noise, discontinuities, and complex surfaces, and thus remain the mainstream approaches in current research and applications.

Quality-Guided Phase Unwrapping has been widely studied due to its efficiency and speed [51,60]. This method evaluates the quality of the wrapped phase using a quality map, and applies a flood-fill algorithm to initiate unwrapping from high-quality regions. This strategy effectively limits the propagation of unwrapping errors into low-quality areas, thereby enhancing both accuracy and stability. Su et al. proposed a reliability-guided phase unwrapping method based on parameter mapping, in which one or more parameters—such as modulation of the fringe pattern, spatial frequency, phase differences between neighboring pixels, and signal-to-noise ratio—are used to construct a parameter map. The phase unwrapping path is then guided by the high-reliability regions of this map. As illustrated in Figure 4, this approach effectively confines phase unwrapping errors to localized areas and demonstrates strong robustness [61].

Branch-Cut Phase Unwrapping, also known as the Goldstein algorithm, was first proposed by Goldstein in 1988 [62], and is a commonly used path-dependent phase unwrapping algorithm. The main steps include the following: (1) identifying and labeling the polarity of phase residues; (2) constructing branch cuts to connect all residues and ensuring that the sum of the polarity values on each branch cut is zero; (3) bypassing the branch cuts during the unwrapping process and using the phase information from neighboring unwrapped pixels to unwrap the residues. Compared with quality-guided phase unwrapping, the branch-cut method offers stronger noise resistance. By constructing branch cuts and preventing error propagation, it effectively reduces the impact of noise on phase unwrapping.

However, the branch-cut method also has some limitations. In regions where phase residues are densely distributed, incorrect branch cuts may be generated, or the constructed branch cuts may not be globally shortest, which could lead to unwrapping errors. In addition, branch cuts may form closed loops, resulting in the “island effect,” which further aggravates local error accumulation. Therefore, the performance of the branch-cut method is highly dependent on the placement of cuts. If the noise level is high, significant unwrapping errors may occur. To address these problems, subsequent research has introduced several improvements to the Goldstein algorithm. For example, Huntley proposed placing artificial barriers or using independent unwrapping paths to avoid noise propagation and obtain unique and accurate phase unwrapping results [63]. Zheng introduced a random search-based method for locating branch cuts, which improves computational speed and solves the inaccuracy issue of branch cut construction in the Goldstein algorithm [64]. Gdeisat et al. proposed increasing the number of residues in the wrapped-phase map to improve unwrapping accuracy, but this method is computationally intensive and time-consuming [65]. To address this issue, Du et al. proposed a simplified algorithm that significantly speeds up computation, reducing processing time by more than 50% and effectively improving measurement efficiency [66].

2.3. 3D Shape Reconstruction from Phase

The recovery of 3D surface shape relies on the mapping relationship between phase and spatial geometry. In general, the phase information reflects the geometric modulation of fringe patterns on the surface of the measured object, and the degree of modulation depends on the optical path variation caused by the surface geometry. To reconstruct the 3D coordinates from the phase, PMD and FPP techniques each establish distinct geometric mapping models.

2.3.1. 3D Shape Recovery in PMD

PMD is an optical measurement technique specifically designed for 3D reconstruction of specular or highly reflective surfaces. As shown in the top part of Figure 5a, a typical PMD system consists of a liquid crystal display (LCD), a camera, and a computer. The computer generates sinusoidal fringe patterns and displays them on the LCD. These patterns are reflected by the mirror-like surface of the object and then captured by the camera. Because the specular surface geometrically modulates the fringe pattern, the captured image contains phase distortion information caused by variations in the surface normal [67,68]. After extracting the wrapped phase from the captured fringe images using techniques such as phase-shifting, the system uses a geometric model and calibration parameters to convert the phase information into the surface gradient data of the object [69]. Since the phase is proportional to the deflection angle of the surface normal vector, PMD essentially measures a gradient field that reflects the surface slope. To reconstruct the full 3D shape of the object, this gradient field must be numerically integrated over the 2D image plane to recover the relative height at each pixel, thereby producing the complete 3D surface profile [70].

In recent years, researchers have proposed a Direct Phase-Measuring Deflectometry (DPMD) system based on a dual-LCD and dual-camera setup. This architecture is designed to bypass the complex gradient integration process required in traditional PMD, enabling direct height reconstruction of specular objects [71,72]. As shown in the bottom part of Figure 5a, this method captures sinusoidal fringe patterns reflected from both a reference plane and the measured specular surface, using two LCD screens and two cameras. Each camera simultaneously acquires the distorted fringe images along two different optical paths, thereby recording the phase variations corresponding to these paths. When the fringe patterns are reflected by the object and the reference plane, the images captured by the cameras contain the phase difference between the two reflection paths. Through system calibration, this phase difference can be directly mapped to height differences on the object surface, effectively eliminating the gradient integration step required in traditional PMD. The modeling principles and technical details of this method will be further discussed in Section 3.2.

2.3.2. 3D Shape Recovery in FPP

FPP is an active optical 3D measurement technique based on phase encoding, widely used for measuring diffuse reflective surfaces. It has attracted significant attention due to its simple structure, high accuracy, and broad applicability. The core principle of FPP is to project periodic sinusoidal fringe patterns onto the surface of the measured object under known geometric relationships between the projection direction and the camera’s viewing angle. The fringe patterns are distorted by the surface geometry of the object. After being captured by the camera, the 3D shape of the surface can be reconstructed through phase decoding.

As illustrated in the top part of Figure 5b, a typical FPP system consists of a projector, the object being measured, and a camera. A geometric imaging model is established among these three components through spatial calibration. The computer controls the projector to display a sequence of phase-shifted sinusoidal fringe patterns onto the object’s surface, while the camera synchronously captures the deformed fringe images. According to the procedure described in Section 2.2, the absolute phase of the object can be retrieved. Once phase unwrapping is completed and phase discontinuities are removed, the phase information becomes spatially continuous [73]. After obtaining the absolute phase, the system must convert the phase values into the actual 3D coordinates of the object surface using a calibration model. The core task of this model is to establish a mapping between the absolute phase and the spatial geometric information. Depending on the modeling approach, these calibration models are generally classified into two categories: the phase-height model and the triangulation model [74].

The phase-height model is a method that establishes a direct functional relationship between phase and height using multiple reference planes with known elevations. It is well-suited for scenarios where the object is located near the reference plane and the surface variation is relatively smooth. Common phase-height models can be generally classified into three categories, linear models [75], inverse linear models [76], and polynomial models [77,78].

A classic phase-height model is illustrated in the bottom part of Figure 5b, where

Δ Φ_{D E} (x, y)

denotes the phase difference between the object and the reference plane,

O_{p}

,

O_{c}

represent the optical centers of the projector and camera, respectively, l denotes the baseline distance between them, d is the vertical distance between the camera and the reference plane, and p is the width of a projected stripe on reference plane. Let B be a point on the surface of the measured object, and let h denote the height of point B relative to the reference plane. According to the principle of triangulation, the height h of point B can be expressed as [79].

h = \frac{Δ Φ_{D E} \cdot p \cdot d}{Δ Φ_{D E} \cdot p + 2 π l}

(15)

where p, l, and d are the parameters that need to be calibrated in the phase-height model.

If the measurement system satisfies

l ≫ \bar{D E}

, and the actual height distribution of the object is not uniform, then according to Equation (14), the linear phase-height relationship can be expressed as follows:

h (x, y) = \frac{Δ Φ_{D E} (x, y) \cdot p (x, y) \cdot d}{2 π l} = k (x, y) Δ Φ_{D E} (x, y)

(16)

where

k (x, y)

is a proportional coefficient to be calibrated, which can be obtained through least-squares fitting using known heights from a set of reference planes. To improve modeling accuracy, phase values are typically collected at multiple height levels, and pixel-wise fitting is performed to determine

k (x, y)

, thereby yielding more accurate local reconstruction results. The linear phase-height model is simple to implement and computationally efficient, making it suitable for fast measurement tasks. However, when the system’s structural parameters do not satisfy the approximation condition (

l ≫ \bar{D E}

), the accuracy of the linear model degrades significantly.

To relax the strict geometric assumptions required by the traditional linear model, researchers have proposed the inverse linear phase-height model. This model introduces a reciprocal relationship between phase and height, establishing a linear mapping between the reciprocal of height and the reciprocal of the phase difference.

\frac{1}{h (x, y)} = a (x, y) + b (x, y) \cdot \frac{1}{Δ Φ_{D E} (x, y)}

(17)

where

a (x, y)

and

b (x, y)

are the calibration coefficients to be determined for each pixel. This model allows for more flexible configurations of the camera and projector, requiring only a shared field of view for measurement, without the need for strict coplanarity between the projection path and the reference plane. By applying least-squares fitting using multiple reference planes with known heights, the coefficients

a (x, y)

and

b (x, y)

can be efficiently determined, thus completing the system calibration. It is worth noting that Equation (16) can be rearranged as follows:

Δ Φ_{D E} (x, y) = h (x, y) Δ ϕ_{D E} (x, y) a (x, y) + h (x, y) b (x, y)

(18)

Although the two equations mentioned above appear to be different forms of the same expression, in practical applications, Equation (16) is more susceptible to noise, which can lead to significant error amplification in regions with large object height, indicating its dependency on object height [77,80]. In contrast, Equation (17) demonstrates stronger robustness against noise.

By further rearranging Equation (18), we obtain the following:

h (x, y) = \frac{Δ Φ_{D E} (x, y)}{a (x, y) Δ ϕ_{D E} (x, y) + b (x, y)}

(19)

This equation reflects the nonlinear relationship between the phase difference

Δ Φ_{D E} (x, y)

and the object height

h (x, y)

[77]. However, the nonlinear fitting process depends heavily on the initial values of

a (x, y)

and

b (x, y)

, which can affect the overall calibration accuracy and system stability. To address this issue, some researchers have proposed using polynomial fitting to model the nonlinear relationship more flexibly [78]. In this case, the height

h (x, y)

can be expressed as a polynomial function of the phase difference:

h (x, y) = \sum_{i = 0}^{n} a_{i} (x, y) {[Δ Φ_{D E} (x, y)]}^{i}

(20)

It is worth noting that although increasing the polynomial order can improve the accuracy of fitting the nonlinear relationship, an excessively high order may lead to Runge’s phenomenon [81]. Therefore, the degree of the polynomial should be carefully selected to balance fitting accuracy and model stability.

In the phase-height models described above, the system typically does not perform geometric modeling or calibration of the camera and projector. Instead, it fits a functional relationship between phase and height through empirical calibration. In contrast, the triangulation model requires precise calibration of both the camera and the projector in order to recover the 3D coordinates of the object’s surface using the principle of triangulation. A projector can be treated as an inverse camera, and its geometric parameters can be calibrated using methods similar to those used for cameras. However, unlike a camera, the projector cannot directly form an image. Therefore, it requires the assistance of a reflective surface—either the measured object or a reference plane—to reflect fringe patterns, and relies on phase encoding to establish the correspondence between projector pixels and camera pixels. In this process, phase information plays a key role in pixel matching.

As illustrated in Figure 6, projector calibration typically involves projecting vertical and horizontal fringe patterns. Using phase-shifting and temporal-phase unwrapping algorithms, the absolute phase in the vertical direction,

Φ_{v} (x_{c}, y_{c})

, and the absolute phase in the horizontal direction,

Φ_{h} (x_{c}, y_{c})

, can be obtained for each pixel.

Assuming the projector resolution is

H_{p} \times W_{p}

, with

n_{v}

vertical fringes and

n_{h}

horizontal fringes in the projected patterns, a camera pixel at

{(x_{c}, y_{c})}^{T}

corresponds to a point

{(x_{p}, y_{p})}^{T}

on the projector pixel plane. The coordinates can be computed as follows:

\begin{matrix} x_{p} & = \frac{Φ_{v} (x_{c}, y_{c}) W_{p}}{2 π n_{v}} \end{matrix}

(21)

\begin{matrix} y_{p} & = \frac{Φ_{h} (x_{c}, y_{c}) H_{p}}{2 π n_{h}} \end{matrix}

(22)

The projector can then be calibrated by following the same procedure as camera calibration [82].

3. Evolution and Advances of PMD Systems

PMD is a 3D measurement technique based on the laws of optical reflection, specifically designed for reconstructing the 3D shape of highly smooth, specular surfaces. The fundamental idea is to project sinusoidal phase-shifted fringe patterns onto a display screen and to use a camera to capture the modulated images of these patterns reflected from the object’s surface. Phase information is then extracted from the captured images to infer the surface normals or height distribution of the object. PMD is essentially a reflection-based structured-light method, and it is closely related in principle to Moiré deflectometry [83,84,85,86] while offering stronger advantages in terms of measurement accuracy, dynamic range, and system adaptability [87,88,89,90,91,92]. Depending on the system configuration, existing PMD systems can be categorized into three types: Single-screen and single-camera PMD systems, Multi-screen direct PMD systems, and Multi-camera stereo PMD systems [34].

3.1. Single-Screen and Single-Camera Systems

Among all PMD configurations, the single-screen and single-camera system has been widely adopted in both early and contemporary research on specular surface 3D measurement, due to its compact structure and minimal construction complexity [93]. This system typically consists of an LCD, a camera, and a computer. The computer controls the screen to project a sequence of sinusoidal fringe patterns onto the surface of the specular object. The camera, positioned in the reflection direction, captures the modulated fringe patterns. Through phase extraction and surface reconstruction algorithms, the 3D geometry of the surface is recovered.

As shown in Figure 7, typical single-screen single-camera PMD systems can be modeled using three different approaches, paraxial approximation model, reference-plane-based model, and surface estimation and reprojection model. The paraxial approximation model assumes small incidence and reflection angles, making it well-suited for standard specular surface measurement tasks but less accurate for large-angle scenarios. In contrast, the planar reference-based model introduces a physical reference plane to extend the applicable range, though its accuracy depends on precise calibration. The reprojection model further relaxes the small-angle constraint by incorporating full geometric relationships, thereby achieving higher accuracy in complex or large-angle measurement conditions.

The paraxial approximation model has been widely used in standard specular surface measurement tasks. This approach typically assumes that the angle between the reflected fringe direction and the surface normal is small, allowing a simplified phase-to-height mapping to be established. Based on this assumption, Häusler et al. proposed a compact single-screen single-camera PMD system suitable for objects with relatively small surface variations [94]. Later, Liu et al. further optimized the geometry of this model to maintain high measurement accuracy even when measuring mildly curved surfaces [95]. Due to its mathematical simplicity and ease of implementation, the paraxial model has been adopted in many studies and has become a classical configuration in early PMD research and industrial applications. However, this model struggles to maintain accuracy when measuring complex specular surfaces with high curvature or sharp geometric variations, limiting its applicability in high-precision tasks.

To overcome the limited measurement flexibility inherent in the paraxial approximation model, researchers have proposed the reference-plane-based model, as illustrated in Figure 7b. This method assumes that the measured specular object is adjacent or approximately parallel to a known reference plane in space. By leveraging a geometric relationship among three key points—the image point P, the object point S, and the projection point Q—the surface gradient at point S can be derived. Compared to the paraxial approximation model, this model places fewer constraints on the geometric configuration of system components, offering greater flexibility. Huang et al. developed a fast measurement system based on this structure and used the Windowed Fourier Transform algorithm to achieve dynamic 3D reconstruction from a single-frame image, successfully capturing temporal deformations of water surface perturbations [96]. Li et al. further investigated the impact of reference plane positioning errors on measurement accuracy and introduced dual-laser-assisted positioning and confocal white-light distance sensors to improve spatial localization of the reference plane [97]. However, this model is mainly applicable to nearly flat surfaces. For objects with significant curvature or large deviations from the reference plane, its measurement accuracy degrades noticeably.

The surface estimation and reprojection model, as illustrated in Figure 7c, represents a more advanced modeling framework for PMD systems, specifically developed to address the challenges associated with measuring highly curved and complex surfaces.Unlike previous models, it does not rely on a flat reference plane or paraxial assumptions. Instead, it uses a coarsely estimated surface shape as a substitute for the reference plane and iteratively refines both surface shape and normal vectors based on reflective geometry principles. Within this framework, Bothe et al. achieved high-precision measurements for various highly reflective objects, including metals, transparent plastics, and glass, and demonstrated the model’s broad applicability to complex targets [98]. Su et al. developed the Software Configurable Optical Test System, which is used for 3D measurement of large curved mirrors in astronomical telescopes. The system iteratively improves measurement accuracy through reprojection optimization [99]. It is worth noting, however, that this model relies heavily on the accuracy of the initial surface estimate. Significant estimation errors can lead to substantial reconstruction deviations. To address this issue, some studies have used external devices such as coordinate measuring machines (CMM) to acquire coarse surface data. Nevertheless, achieving high-precision registration between the coordinate system of the CMM and the PMD system remains a critical challenge in practical deployment. In response, Xu et al. proposed a calibration method that integrates the manufacturing system and the PMD measurement system, directly establishing the spatial relationship between the two for real-time surface estimation in online measurement environments [100,101].

3.2. Multi-Screen Direct PMD

To overcome the limitations of single-screen PMD systems in terms of surface normal estimation accuracy and visible measurement area, multi-screen configurations in direct PMD have been developed. As shown in Figure 8, such systems incorporate two or more display screens into the scene, allowing the viewing ray reflected from the measured point to pass through multiple known fringe patterns. This enables more stable and accurate reconstruction of surface normal [102,103,104].

A typical working principle of multi-screen PMD is illustrated in Figure 8a. Assume the camera’s viewing ray is reflected from a surface point S and sequentially passes through pixel positions

Q_{1}

and

Q_{2}

on two display screens. Given the known camera intrinsics and screen calibration data, the surface normal

\vec{n}

at point S can be derived based on a ray reflection model. However, in practical implementations, the first screen may obstruct part of the optical path, preventing the camera from seeing the second screen directly. As a result, early systems often suffered from limited visibility and required specific geometric arrangements to overcome occlusion issues. Although this approach is effective, it significantly increases system complexity and measurement time, making it unsuitable for dynamic or real-time applications. To address this issue, Li et al. proposed an improved multi-screen PMD system based on a transparent display [105]. The core idea is to use a transparent screen as the front display, allowing the camera’s line of sight to pass through it and directly observe the fringe patterns on the second screen behind. This design enables simultaneous observation of two fixed screens without any mechanical movement, greatly simplifying the system structure, improving measurement efficiency, and enhancing adaptability for wide field-of-view measurements.

DPMD is an innovative specular surface 3D measurement technique proposed in recent years. Unlike traditional PMD, which relies on gradient field integration to reconstruct 3D shape, DPMD constructs symmetric reflection paths and directly obtains the phase difference in the surface under two different optical paths. This eliminates the need for complex integration and allows for direct computation of the object’s surface height [71,72]. In this method, the camera ray is sequentially reflected by a reference plane and the measured specular surface, intersecting fringe patterns displayed on two parallel screens. When fringe patterns are displayed on two parallel screens and reflected by a specular surface, four key phase values can be obtained:

Φ_{1}

and

Φ_{2}

along the reference path, and

Φ_{1}^{'}

and

Φ_{2}^{'}

along the object path. A schematic diagram (Figure 8b) is provided to illustrate the baseline distance d between the two parallel screens and the correction factor

Δ d

accounting for possible system misalignment. Based on this geometry, the depth value h can be calculated as follows:

h = \frac{d [(Φ_{1} - Φ_{2}) - (Φ_{1}^{'} - Φ_{2}^{'})] - Δ d (Φ_{1}^{'} - Φ_{1})}{(Φ_{2} - Φ_{1}) + (Φ_{2}^{'} - Φ_{1}^{'})}

(23)

In practical systems, to achieve symmetric phase acquisition, a parallel configuration is typically constructed using one physical screen and one virtual screen created by a beam splitter. However, ensuring strict parallelism between the virtual and physical screens remains challenging and may affect the overall system accuracy. Compared to traditional PMD, DPMD exhibits better adaptability and stability when measuring specular objects with large slope variations or discontinuities, making it particularly suitable for targets with step edges or abrupt surface changes. On the other hand, since DPMD does not rely on a complete gradient field, its measurement accuracy for smooth continuous surfaces is slightly lower than that of traditional PMD methods based on gradient integration.

3.3. Multi-Camera Stereo PMD

Stereo PMD is a specular surface 3D measurement technique based on multi-sensor collaborative imaging, first introduced by Knauer et al. in 2004 [93]. This method enables multiple cameras to observe the specular object synchronously from different viewpoints. By combining the phase information of the projected fringe patterns, surface normal is estimated from each viewpoint and then matched to reconstruct the 3D shape of the target object.

As illustrated in Figure 9, a typical Stereo PMD system operates as follows: one primary camera selects a spatial point

S_{1}

, and, based on the system calibration parameters, determines its corresponding image point

P_{1}

on the screen. The corresponding phase value at this location can then be retrieved from the screen’s phase map, allowing the reflected fringe point

Q_{1}

to be identified. Using the three points

Q_{1}

,

S_{1}

, and

P_{1}

, the surface normal at point

S_{1}

can be computed according to the law of reflection. Meanwhile, a secondary (auxiliary) camera also captures the same target point

S_{1}

, producing its own image point

P_{2}

. Following the same process, a second reflection point

Q_{2}

is obtained, providing an independent estimation of the surface normal. Theoretically, the surface normals estimated from the two views should converge, allowing the recovery of the surface gradient through normal vector matching, and thereby enabling full 3D shape reconstruction. The main advantage of this method lies in its strong adaptability to surfaces with complex curvature, and its ability to achieve high reconstruction accuracy through normal matching. Studies have shown that Stereo PMD can achieve nanometer-level relative depth accuracy [106,107].

Furthermore, since it does not rely on a reference plane or prior surface estimation, it offers greater generalizability for applications involving large-scale specular surfaces or free-form reflective geometries.

4. Evolution and Advances of FPP Systems

In FPP systems, the projection module is the core front-end component for generating fringe patterns, and its performance directly determines key system metrics, including spatial resolution, projection speed, measurement depth of field, and environmental adaptability. With the continuous evolution of 3D measurement requirements—from static scenes to highly dynamic environments, from bulky setups to miniaturized devices, and from shallow-range measurements to large-depth tasks—traditional projection methods have exposed clear drawbacks. Specifically, focusing optics constrain the available depth of field, bulky hardware limits portability, and high sensitivity to ambient light reduces robustness in practical applications. In recent years, laser scanning projection technologies based on MEMS micromirrors as have attracted widespread attention due to their advantages in high precision, high speed, low power consumption, and compact structure. As shown in Figure 10, this technology achieves rapid deflection of laser beams and the generation of fringe patterns through MEMS micromirrors. Notably, the continuous advancement of MEMS technology has not only driven the development of novel projection architectures but also demonstrated strong compatibility with mainstream DLP-based systems. Given its high synergy with existing solutions and its tremendous potential in next-generation FPP systems, this chapter focuses on MEMS-based projection technologies and their applications in advanced structured-light systems.

4.1. Comparison of Mainstream Fringe Projection Technologies

In structured-light 3D measurement systems, the method of fringe pattern generation and the optical quality are among the most critical factors influencing reconstruction accuracy, robustness, and overall system performance. Different projection techniques exhibit significant differences in terms of fringe contrast, spatial resolution, refresh rate, and system size, all of which directly affect the stability of phase calculation and the system’s adaptability in dynamic or complex environments. As illustrated in Figure 11a, current mainstream fringe generation approaches can be roughly categorized into the following types: (1) optical interferometric projection, based on interference principles; (2) physical grating projection, using static optical gratings; (3) LCD-based pixel modulation projection, utilizing liquid crystal displays; (4) DLP digital projection, based on digital micromirror devices (DMD); (5) MEMS micromirror-based laser scanning projection, using micro-electro-mechanical systems. Each method has its own characteristics in terms of pattern flexibility, system complexity, cost, and suitable application scenarios. Among them, DLP projection has become the most widely adopted technique due to its high pattern flexibility and strong grayscale modulation capability. However, it typically relies on projection lenses for focusing, which limits the depth of field of the system. In contrast, MEMS micromirror projection generates fringe patterns by directly scanning a laser beam in space. This approach requires no focusing optics, and offers distinct advantages such as large depth of field, compact size, low power consumption, and mechanical simplicity, making it particularly well-suited for embedded systems, dynamic scenes, and mobile platform-based 3D measurement applications.

To further compare and analyze the practical performance of the five aforementioned structured-light projection methods, Figure 11b presents a radar chart evaluating their capabilities across five key dimensions: resolution, speed, depth of field, system size, and cost. Additionally, Table 2 summarizes the representative technical specifications of each system.

From the chart and table, it can be observed that interference-based structured-light systems generate fringe patterns through coherent beam interference, achieving sub-micron spatial resolution and excellent depth-of-field performance. These characteristics make them particularly suitable for measurements of micro/nano-scale structures and the characterization of curved surface topographies. However, such systems lack pattern programmability, impose strict requirements on environmental stability, involve complex system construction, and incur high costs, all of which limit their practical applicability. Structured-light systems based on physical gratings generate periodic fringe patterns by combining fixed grid structures with illumination sources. These systems are characterized by simple configuration and stable fringe quality, making them suitable for static measurement scenarios requiring high accuracy [109]. However, their fringe patterns are not programmable, which limits their ability to implement multi-frequency modulation or adaptive pattern adjustments. Consequently, their flexibility is significantly constrained. Furthermore, similar to interference-based projection methods, physical grating systems are non-digital and are thus inadequate for applications requiring a high diversity of fringe encodings or precise modulation in dynamic and complex environments. LCD-based structured-light systems modulate patterns by controlling the transmittance of liquid crystal elements. These systems offer advantages such as low cost, design flexibility, and low power consumption, making them well-suited for mass production at scale [110]. Nevertheless, the slow response speed of liquid crystal elements and their limited grayscale control capability result in insufficient sharpness and refresh rates for high-speed and high-precision measurements.

In addition, the pixelated structure of LCD panels introduces non-ideal responses under high-frequency fringe patterns, which negatively affects phase demodulation accuracy, thereby limiting their applicability in industrial precision inspection. DLP projection systems are currently the most mainstream digital implementation of structured-light technology. Their core component is the digital micromirror device (DMD), which enables high refresh rates, support for arbitrary pattern projection, and multi-level grayscale control [111]. In standard 8-bit mode, DLP projectors can achieve projection rates on the order of hundreds of frames per second while maintaining excellent pattern consistency and spatial resolution, making them suitable for most static and low-speed dynamic 3D reconstruction tasks. To overcome speed limitations, some studies have proposed the use of one-bit binary defocused projection strategies, enabling projection rates of over one thousand frames per second while maintaining acceptable pattern fidelity. However, DLP systems are typically equipped with front-end focusing lenses, which limit their depth of field and make them unsuitable for targets with significant depth variation or pronounced surface curvature. Moreover, the complex optical layout, large physical footprint, and high cost of DMD components present challenges for integration into portable or embedded systems. MEMS-based micromirror projection technology has steadily matured in recent years. By using single- or dual-axis resonant micromirrors to scan laser beams and generate two-dimensional fringe patterns, MEMS systems offer a significant advantage in that they can form sharp patterns without the need for focusing optics. This enables designs with ultra-large depth of field, compact size, and low power consumption. Additionally, MEMS projectors can dynamically control laser power via high-speed TTL or analog modulation, supporting wide dynamic range and frequency-controllable pattern generation. These features enable excellent real-time performance and high frame rates, making MEMS systems particularly well-suited for mobile platforms, robotic grasping, and dynamic 3D perception tasks. Owing to their beam controllability and miniaturized structure, MEMS-based solutions provide essential hardware support for the development of lightweight and intelligent structured-light systems.

In summary, MEMS-based micromirror projection technology demonstrates exceptional system integrability and environmental adaptability, owing to its lens-free configuration, large depth of field, compact size, low power consumption, and high-speed performance. These characteristics make it particularly well-suited for space-constrained, mobile, or dynamic 3D reconstruction scenarios. By employing laser beam scanning to directly render fringe patterns, MEMS projectors achieve a seamless integration of pattern precision and flexibility, effectively overcoming the trade-off constraints among volume, depth of field, and resolution typically encountered in traditional lens-based projection systems. With ongoing advancements in MEMS device fabrication precision and control algorithms, MEMS-based structured-light projection is emerging as a strong contender to DLP technology, driving the evolution of 3D reconstruction systems toward higher precision, greater miniaturization, and enhanced intelligence.

4.2. System Calibration Strategies for MEMS-Based Structured-Light Systems

Conventional structured-light systems typically employ the Phase-Height Model and the Triangulation Model for system calibration, as thoroughly reviewed in Section 2.3.2 [74]. However, due to the fundamental differences in physical mechanisms, calibration models must be adapted accordingly [112]. In particular, MEMS-scanned structured-light systems differ significantly in their projection principles and fringe generation mechanisms, rendering traditional pinhole-based projector models unsuitable for direct application. Specifically, in MEMS systems, fringe patterns are generated by a laser beam rapidly scanned by micromirrors, resulting in a dynamically varying beam incidence direction rather than a fixed projection center as in conventional projectors. This dynamic “point–line–plane” projection mechanism necessitates calibration models that incorporate the nonlinear relationship among laser scanning angles, power modulation, and camera imaging.

The design of calibration models for MEMS-based structured-light systems must take into account two essential characteristics of their projection modules: (1) the absence of focusing lenses and (2) unidirectional scanning projection [113,114]. To address these challenges, several studies have proposed calibration models tailored to MEMS micromirror scanning mechanisms. The following section introduces three representative modeling approaches: the unified model, the iso-phase surface model, and the phase-angle model.

4.2.1. Joint Calibration Model

To address the projection characteristics of MEMS micromirror-based structured-light systems, the unified model provides a physically grounded and high-precision calibration strategy. As illustrated in Figure 12a, the core idea of the unified model is to couple the spatial coordinates of the MEMS laser scanning system with the camera imaging model under a common coordinate framework, thereby establishing an analytical mapping from phase values to 3D point coordinates [32].

Since the MEMS projection process essentially involves laser beam scanning along a defined plane, it can be assumed that the spatial positions of the projected points lie on a scanning plane subject to linear constraints. Given the known distance d between the reference plane and the initial projection point, and incorporating the geometric constraints of the laser scanning trajectory, the position of the projected point in the projector coordinate system can be derived as follows:

[\begin{matrix} x_{p} \\ y_{p} \end{matrix}] = \frac{d}{Z_{p}} [\begin{matrix} X_{p} \\ Y_{p} \end{matrix}]

(24)

Meanwhile, the camera imaging process can be described by the standard pinhole projection model [82]:

[\begin{matrix} x_{c} \\ y_{c} \end{matrix}] = \frac{1}{Z_{c}} [\begin{matrix} X_{c} \\ Y_{c} \end{matrix}]

(25)

By applying the rotation matrix

R_{p c}

and translation vector

T_{p c}

to align the coordinate systems of the projector and the camera, a unified representation in the world coordinate system can be obtained:

[\begin{matrix} X_{c} \\ Y_{c} \\ Z_{c} \end{matrix}] = R_{p c} [\begin{matrix} X_{p} \\ Y_{p} \\ Z_{p} \end{matrix}] + T_{p c}

(26)

Combining Equations (23)–(25) and eliminating the intermediate variables

y_{p}

,

X_{p}

,

Y_{p}

,

Z_{p}

,

X_{c}

,

Y_{c}

, one can derive the following expression:

Z_{c} = \frac{\begin{matrix} (r_{22} r_{33} - r_{23} r_{32}) t_{1} + (r_{13} r_{32} - r_{12} r_{33}) t_{2} + (r_{12} r_{23} - r_{13} r_{22}) t_{3} + \\ (r_{12} r_{23} - r_{13} r_{22} + (r_{22} r_{33} - r_{23} r_{33}) x_{c} + (r_{12} r_{21} - r_{11} r_{22}) x_{p} / d \end{matrix}}{\begin{matrix} ((r_{22} r_{31} - r_{21} r_{32}) t_{1} + (r_{11} r_{32} - r_{12} r_{31}) t_{2} + (r_{12} r_{21} - r_{11} r_{22}) t_{3}) x_{p} / d + \\ (r_{22} r_{31} - r_{21} r_{32}) x_{c} x_{p} / d + (r_{13} r_{32} - r_{12} r_{33}) y_{c} + (r_{11} r_{32} - r_{12} r_{31}) y_{c} x_{p} / d \end{matrix}}

(27)

where

r_{i j}

and

t_{i}

represent the elements of the rotation matrix and translation vector, respectively. Each scanning position

x_{p}

corresponds to a phase value

φ

, with

φ = 2 π x_{p} / c

, where c denotes the preset fringe period constant. By substituting this phase relationship into the geometric expression of the unified model and consolidating the constant terms, two interpretable calibration models for MEMS-based structured-light systems can be further derived.

The first type is the global calibration model, which assumes that all pixels in the system share the same set of geometric and system parameters. In this model, all constant terms are incorporated into a single expression, providing a concise formulation that describes the pixel depth

Z_{c}

as a function of image coordinates

(x_{c}, y_{c})

and phase value

φ

Z_{c} = \frac{a_{1} + a_{2} Φ}{a_{3} + a_{4} x_{c} + a_{5} Φ + a_{6} x_{c} Φ + a_{7} y_{c} + a_{8} y_{c} Φ}

(28)

The second type is the per-pixel calibration model, which relaxes the unified parameter constraints imposed by the global model. This approach assumes that each pixel possesses an independent set of calibration parameters. Accordingly, in practical modeling, the constants associated with each pixel can be extracted and combined with other terms to form the following per-pixel expression [32].

Z_{c} = \frac{a_{1}^{'} + a_{2}^{'} Φ}{a_{3}^{'} + a_{4}^{'} Φ}

(29)

In both models, the unknown calibration parameters are typically solved using a linear least-squares method in conjunction with a system of homogeneous equations [74,78]. Since the above derivations are based on ideal image coordinates, while real imaging processes inevitably introduce camera distortion, it is necessary to first use the camera calibration results to convert the distorted image coordinates into ideal ones in order to ensure modeling accuracy. Moreover, MEMS-based structured-light systems generally adopt a lensless projection design, which eliminates projection distortions caused by optical lenses in the projection path, thereby simplifying the geometric modeling process.

4.2.2. Equal-Phase Surface Model

To address the calibration challenges arising from the unique imaging structure of MEMS-based projection systems, Miao et al. proposed the isophase plane model [115]. This method constructs a series of isophase light planes formed by laser scanning and leverages the geometric relationship between the camera’s imaging center and the surface reflection point to achieve pixel-wise 3D coordinate estimation. As illustrated in Figure 12b, the isophase planes can be regarded as a set of approximately parallel light planes generated at specific scanning angles. Each plane is associated with a unique phase value. By correlating the phase information received at a particular camera pixel, the intersection point between the reflected light ray and the corresponding isophase plane can be determined. Subsequently, the 3D coordinate of the measured point is derived by fitting the reflection path between the camera and the isophase planes, resulting in a mapping function between the phase and spatial coordinates.

X_{c} = \frac{1}{\sum_{n = 0}^{N} a_{n} Φ^{n}} + a_{X}, Y_{c} = \frac{1}{\sum_{n = 0}^{N} b_{n} Φ^{n}} + b_{Y}, Z_{c} = \frac{1}{\sum_{n = 0}^{N} c_{n} Φ^{n}} + c_{Z}

(30)

This method fully accounts for image distortion effects in the calibration modeling process; as a result, higher-order polynomials are often introduced in the denominator of the mapping expressions. However, to prevent overfitting caused by excessive model complexity, it is essential to carefully select the polynomial order [74]. Currently, two primary approaches are used to mitigate the impact of image distortion on system calibration. The first is based on the actual image coordinates and employs polynomial fitting to suppress image noise. The second approach involves converting all image coordinates into ideal coordinates using the intrinsic camera parameters before performing modeling. Experimental results have shown that both strategies can achieve satisfactory calibration performance in MEMS-based structured-light systems.

4.2.3. Phase-Angle Model

As shown in Figure 12c, the phase-angle model is a calibration method that directly relates the geometry of laser beam propagation to phase information, and it is particularly well-suited for laser scanning projection mechanisms commonly found in MEMS-based structured-light systems [116,117]. During the scanning process of a MEMS micromirror, each isophase position corresponds to a unique scanning angle. Therefore, given a known phase value and combined with geometric constraints, the three-dimensional spatial coordinates of a specific reflection point can be inferred.

In this model, the laser beam at a specific phase value

φ

corresponds to a unique scanning direction, and the reflected rays associated with different phase values exhibit a strictly linear relationship along the projection path. Based on this, and in combination with the camera imaging model, a direct mapping can be established between the pixel coordinates

(u, v)

, the phase

Φ

, and the spatial coordinates

(X_{c}, Y_{c}, Z_{c})

of the reflection point on the measured object. By introducing auxiliary parameters A, B, C, and D, the complex geometric computation can be simplified into the following expression:

X_{c} = \frac{- u D}{u A + v B + C}, Y_{c} = \frac{- v D}{u A + v B + C}, Z_{c} = \frac{- D}{u A + v B + C}

(31)

The primary advantage of this method lies in its independence from the need for an explicit calibration board that fully covers the camera’s field of view, as well as from the complex gradient integration process required in traditional models. This significantly simplifies the system calibration workflow. Additionally, the phase-angle model demands relatively low quantities and precision of image acquisition, allowing for complete system calibration using only a subset of captured images. This makes it particularly suitable for embedded systems or online measurement scenarios where computational resources and time are constrained.

4.3. Analysis of Systematic and Random Errors

Similar to conventional structured-light systems, MEMS-based structured-light systems are also subject to a range of common error sources. However, due to their distinctive laser scanning principles and physical implementation mechanisms, MEMS systems exhibit a series of unique error factors. These errors manifest throughout various stages of the 3D reconstruction pipeline, spanning from fringe pattern projection and image acquisition to phase extraction and the final generation of 3D point clouds [118,119,120,121,122,123]. Specifically, the error sources in MEMS structured-light systems include unstable motion of the scanning mirror, non-ideal line width of the laser stripe [124], noise from the laser source, initial phase shift errors in the mechanical rotation of the scanning mirror, and misalignment between the laser optical axis and the scanning mirror’s rotational axis [116].

4.3.1. Random Errors

In the process of structured-light 3D reconstruction, fringe pattern projection and image acquisition are two core components. Random noise, as an inevitable source of disturbance, can significantly impact the accuracy of phase extraction and 3D reconstruction. In MEMS-based structured-light systems, the primary sources of random errors include intensity fluctuations of the laser (source noise), imaging noise during camera acquisition (such as readout noise and dark current noise), and temporal or spatial jitter induced by instability during the resonant scanning of the MEMS micromirror. These noise sources manifest in the captured images as localized or global grayscale disturbances, leading to random deviations in the computed phase. In phase calculation, when using the N-step phase-shifting method to extract wrapped phase, random noise directly affects the brightness distribution of each captured frame, as illustrated in Figure 13a.

The corresponding modulation model can be described as follows [125]:

I_{n} = A + B cos (φ - δ_{n}) = A_{0} + Δ I + B cos (φ - δ_{n})

(32)

where

A_{0}

denotes the background intensity, B represents the modulation depth,

φ

is the ideal phase,

δ_{n}

is the phase shift, and

Δ I

refers to additive Gaussian white noise with zero mean and standard deviation

σ_{n}

. Based on least-squares derivation, it can be shown that the noise introduces phase errors, with a standard deviation given by the following [125,126]:

σ_{φ} = \sqrt{\frac{2}{N} \cdot \frac{σ_{n}}{B}}

(33)

If the fringe frequency is f, then after phase unwrapping, the phase range expands from

2 π

to

2 π f

. When the absolute phase is compressed back to the range of

[- π, π)

, the standard deviation of the phase error becomes the following:

σ_{φ} = \sqrt{\frac{2}{N} \cdot \frac{σ_{n}}{B f}}

(34)

To effectively mitigate the impact of random noise on the 3D reconstruction accuracy of MEMS-based structured-light systems, three optimization strategies can be considered. First, increasing the number of phase shifts can significantly enhance the robustness of phase computation and reduce noise-induced fluctuations; however, this also leads to longer acquisition times, which may compromise system efficiency. Second, improving the modulation depth of the fringe pattern is another effective means of reducing phase errors. It is important to note that the line laser used in MEMS systems is not an ideal infinitesimal beam but possesses a finite width—referred to as the “non-ideal line width”—which differs from the fringe period. This non-ideal width causes a “window smoothing effect” on the fringe pattern, thereby reducing the modulation depth. As illustrated in Figure 13b, this effect significantly degrades fringe contrast and phase quality. To address this issue, Han et al. proposed a window smoothing model and developed an optimal fringe number recommendation algorithm that can automatically determine the most suitable fringe frequency combination based on system parameters to achieve optimal reconstruction performance [124]. Finally, reducing image-level random noise is also crucial for improving phase stability. In recent years, with advances in deep learning, convolutional neural network (CNN)-based image denoising techniques [127] have been widely applied in structured-light systems. These methods can effectively suppress random noise while preserving image details, thereby further enhancing the accuracy and robustness of 3D reconstruction.

4.3.2. Impact of Line Laser Intensity Fluctuations

The standard phase-shifting method typically assumes that the background intensity and modulation amplitude remain constant across different phase-shifted fringe patterns at the same pixel location. However, in practical measurement environments, this assumption often does not hold due to fluctuations in the intensity of the line laser source. Such intensity fluctuations cause variations in both the background illumination and the modulation amplitude across phase-shifted images, which in turn lead to phase errors. The influence of light intensity fluctuations on phase-shifted fringes can be modeled using the following equation:

I_{n} = p_{n} [A + B cos (ϕ - δ_{n})] + q_{n}

(35)

where

p_{n}

denotes the proportional coefficient of the line laser intensity fluctuation, and

q_{n}

represents the additive component of the fluctuation. These two factors cause variations in the background intensity or modulation amplitude across different phase-shifted fringe images, thereby introducing phase errors. By substituting the phase-shifted images—with both background intensity offsets and modulation amplitude deviations—into the N-step phase-shifting expression, the resulting phase error can be derived [128]:

Δ ϕ \approx \frac{2}{N B} \sum_{n = 1}^{N} \frac{(p_{n} - 1) A + q_{n}}{p_{n}} sin (δ_{n} - ϕ)

(36)

To mitigate the impact of line laser intensity fluctuations on 3D reconstruction accuracy, the most direct hardware-level solution is to employ a laser source with stable output. A stable laser can fundamentally reduce intensity variations at the source, thereby avoiding phase extraction errors caused by light source instability. On the software level, post-processing techniques can effectively compensate for errors induced by intensity fluctuations. For instance, Liu et al. proposed an iterative self-calibration algorithm that rapidly extracts the phase components from fringe images and accurately compensates for deviations in background intensity and modulation amplitude [129]. This method enhances phase extraction accuracy through iterative optimization and maintains robust reconstruction performance even under unstable illumination. In addition, Lu et al. developed a histogram-based segmentation approach, in which each phase-shifted image is segmented and corrected via a linear gray-level transformation to compensate for background intensity and modulation amplitude shifts [130]. By adjusting the gray levels, this method effectively eliminates deviations caused by intensity fluctuations, thereby improving phase accuracy. Chen et al. proposed two real-time correction methods specifically designed to address source instability [131]. These techniques utilize dynamic mapping functions to correct phase errors in real time as they evolve over time. Such correction strategies not only counteract the influence of an unstable light source but also enable adaptive adjustment in dynamic environments, ultimately enhancing the precision of 3D reconstruction.

4.3.3. High-Order Harmonics

In traditional structured-light systems, the Gamma effect or system nonlinearities typically introduce higher-order harmonic errors [132,133]. This issue becomes even more pronounced in emerging structured-light systems based on MEMS micromirror scanners, where mechanical rotational errors of the mirror lead to the coupling of higher-order harmonics into the projected fringe patterns. Furthermore, when the input–output characteristics of the laser source are not accurately calibrated, similar harmonic distortions may arise. The presence of higher-order harmonics contaminates the captured images, distorting the fringe patterns and thus compromising the accuracy of phase extraction. These distortions can be mathematically described using a Fourier series expansion [118]. Let

a_{i}

denote the amplitude of the i-th harmonic component; then, the distorted fringe image containing higher-order harmonics can be expressed as follows:

I_{n}^{c} (x, y) = a_{0} + \sum_{m = 1}^{\infty} a_{i} cos (i (ϕ (x, y)) + φ_{n})

(37)

When using the N-step phase-shifting method for phase extraction, the phase error introduced by higher-order harmonics can be derived using the following expression [119,120,121]:

Δ ϕ = {tan}^{- 1} [\frac{\sum_{m = 1}^{\infty} (a_{m N + 1} - a_{m N - 1}) sin (m N ϕ (x, y))}{a_{1} + \sum_{m = 1}^{\infty} (a_{m N + 1} - a_{m N - 1}) cos (m N ϕ (x, y))}]

(38)

To mitigate the impact of higher-order harmonics on phase accuracy, one commonly adopted strategy is to increase the number of phase-shifting steps, which can effectively suppress harmonic interference. However, this inevitably leads to an increased number of required images, thereby reducing the overall reconstruction speed [122]. Therefore, the most meaningful approach is to suppress higher-order harmonic effects without significantly compromising the reconstruction efficiency.

Harmonic suppression strategies can generally be categorized into two types: active methods and passive methods. Active methods involve pre-calibration before pattern projection, whereas passive methods are implemented after the projection has occurred [123]. Specifically, Huang et al. proposed a dual three-step phase-shifting technique that enhances phase measurement accuracy by optimizing the conventional three-step phase-shifting method [134]. Cai et al. derived phase error models in both the spatial domain and the Hough Transform (HT) domain, which are used to analyze and compensate for the effects of higher-order harmonics on phase extraction [118]. Zhang et al. employed a lookup-table-based approach to correct the nonlinear errors in projectors [135]. Furthermore, Pan et al. conducted theoretical analysis on phase errors caused by non-sinusoidal waveforms and developed an iterative phase compensation algorithm to effectively reduce the impact of higher-order harmonics [136]. Song et al. proposed a system nonlinearity correction method based on mask information, where harmonic coefficients are determined using a mask image and the true phase is recovered through Gauss–Newton iteration [137].

While these methods have been extensively applied in conventional DLP-based structured-light systems, harmonic suppression techniques specifically designed for MEMS-based systems remain relatively scarce. To address this, Han et al. proposed a layered phase-shifting method based on a phase-shifting superposition framework, leveraging the fact that MEMS scanning speed is typically higher than that of the camera [138]. While these methods have been extensively applied in conventional DLP-based structured-light systems, harmonic suppression techniques specifically designed for MEMS-based systems remain relatively scarce. To address this, Han et al. proposed a layered phase-shifting method based on a phase-shifting superposition framework, leveraging the fact that MEMS scanning speed is typically higher than that of the camera.

As illustrated in Figure 14a, the internal phase-shifting method projects 12 phase-shifted patterns within a single camera exposure period. These patterns are temporally superimposed into a single image using the camera’s exposure integration, effectively suppressing harmonic distortions. The external phase-shifting method then extracts the wrapped phase from these harmonic-free composite images. Experimental results demonstrate that this approach achieves the same accuracy as a conventional 12-step phase-shifting method, while requiring only three captured images. Figure 14b illustrates the sensitivity of various harmonic orders to different internal phase-shifting step counts. Figure 14c compares the 3D reconstruction results obtained by the traditional three-step phase-shifting method and the proposed layered phase-shifting method with 3 external and 12 internal steps.

Despite the continuous progress of traditional FPP methodologies—including innovations in phase extraction, unwrapping, and calibration—these approaches remain constrained by hardware limitations, sensitivity to ambient noise, and reduced robustness in low-contrast or large-depth-of-field scenarios. At present, deep learning techniques have already been validated and applied in many domains [139,140,141]. They are capable of complementing or even surpassing traditional models by automatically learning robust representations from large-scale datasets. In this context, deep learning has emerged not only as a tool for improving accuracy and efficiency [142,143,144,145], but also as a transformative paradigm for addressing longstanding issues in fringe-structured-light reconstruction. The following section provides a systematic overview of how deep learning frameworks have been designed and adapted to meet these challenges.

5. Application of Deep Learning in Fringe-Structured Light

In traditional fringe-structured-light systems, measurement accuracy often faces significant challenges when dealing with objects that exhibit non-uniform surface reflectivity [146], complex geometries, or severe occlusions [147]. In recent years, deep learning techniques have been extensively validated and successfully applied across various fields, demonstrating powerful capabilities in feature extraction and nonlinear modeling [44,140,148,149,150]. Specifically, for fringe-based structured-light systems, deep learning offers novel solutions to improve measurement accuracy, accelerate reconstruction speed, and enhance system robustness. This chapter provides a detailed overview of the applications of deep learning in fringe projection-based structured-light systems. However, since PMD primarily targets specular objects and is constrained by its specific application scenarios, the use of deep learning in PMD remains limited. Existing studies are usually centered on single-shot approaches [151,152,153,154,155,156,157,158]. Therefore, this chapter mainly focuses on deep learning-driven FPP methods.

5.1. Learning Paradigm for Deep Learning-Driven FPP

Deep learning-based approaches can be categorized into two types—single-frame methods and multi-frame methods—following the classification of traditional phase-shifting [45,46] and Fourier-based algorithms [49,159]. In traditional multi-frame phase-shifting techniques, multiple fringe images are acquired to enhance the accuracy and robustness of phase recovery by leveraging temporal redundancy, As illustrated in Figure 15. Here,

I_{0}

denotes the original scanned object,

I_{1 - 1}

denotes the first step of the first frequency, and

I_{4 - 12}

denotes the 12th step of the fourth frequency. These methods have been thoroughly discussed in Section 2.1.

In the context of multi-frame fringe projection 3D reconstruction, deep learning models aim to learn the mapping between multi-frequency fringe patterns and depth information. Unlike conventional methods that explicitly extract phase information from the fringe images [45,46], deep learning approaches use neural networks to automatically establish this mapping, thus reducing the need for handcrafted feature design and enabling more efficient and accurate phase recovery. Such models are particularly beneficial in handling complex measurement scenarios, as they reduce acquisition redundancy while improving reconstruction speed and precision.

In contrast, single-frame approaches are inherently more challenging, as the deep learning model must infer phase information from only one fringe image. Traditional single-frame phase retrieval techniques rely on frequency domain analysis to extract phase [49,159]. However, deep learning-based single-frame models do not depend on explicit geometric constraints or analytical markers; instead, they utilize implicit features learned from large-scale datasets to recover phase information. By capturing local phase distributions and the inherent structure of the fringe image, these models can robustly estimate phase even under challenging lighting conditions, such as shadows and occlusions.

In summary, the integration of deep learning into fringe projection-based 3D reconstruction has led to significant performance improvements for both single-frame and multi-frame scenarios. Whether enhancing traditional multi-frame phase-shifting methods or addressing the complexities of single-frame phase recovery, deep learning models enable more efficient, robust, and accurate solutions for phase retrieval and depth estimation.

5.2. Deep Learning Framework Design and Advancements

Current fringe-to-phase/depth methods are primarily distinguished by three technical dimensions: network architecture, supervision strategy, and input paradigm.

5.2.1. Network Architecture Innovations

In deep learning-driven fringe-structured-light 3D reconstruction, designing an effective framework to map fringe images into phase information is of paramount importance. Current research has primarily focused on innovations in neural network architectures, particularly models tailored for fringe-to-phase regression tasks. Since this mapping is essentially a regression problem, U-Net and its variants have become the dominant approaches. By leveraging skip connections for hierarchical feature integration, U-Net effectively captures both local and global context. Comparative studies [160,161] have demonstrated that U-Net achieves higher prediction accuracy and stability than traditional CNNs and GANs. However, these benefits often come at the cost of increased computational complexity and limited cross-domain adaptability, which has motivated further architectural refinements.

Recent advances have extended the U-Net backbone with new design concepts, hybrid strategies, and pretrained modules to improve accuracy, reduce training time, and enhance generalization. For example, Wang et al. [162] proposed MSUNet++, which incorporates additional nested pathways to fuse features across multiple levels, thereby enhancing representational power for complex mappings. This improvement, however, comes with longer training and inference times. Zhu et al. [163] developed PCTNet, a CNN–Transformer hybrid network that combines local texture extraction with global context modeling. Recognized as a state-of-the-art (SOTA) method in 2023, PCTNet achieved a

43.62 %

reduction in RMSE compared with U-Net, highlighting the advantages of hybrid architectures and further advancing research in the field.

Another promising direction is the integration of pretrained models. Li et al. [164] and Cai [165] introduced pretrained ResNet and Vision Transformer initializations into U-Net variants, both of which outperformed the conventional U-Net. Notably, Cai et al. employed pretrained Vision Transformers to extract semantically rich contour features and coarse depth cues, reducing MAE by

65 %

. These results indicate that pretrained models not only accelerate convergence but also substantially improve data efficiency, which is especially valuable in FPP systems where dataset sizes are typically limited.

Overall, existing evidence suggests that hybrid architectures combining the local feature extraction capabilities of CNNs with the global context modeling strength of Transformers deliver the most balanced performance. Although such models generally incur higher computational costs, they are particularly effective in handling complex scenarios where robustness and data efficiency are crucial. Consequently, pretrained visual models have emerged as an important tool for improving the performance of fringe-structured-light 3D reconstruction under data-constrained conditions.

Architectural innovations have thus laid a solid foundation for FPP and delivered significant improvements in accuracy and robustness. Nevertheless, relying solely on architectural advances remains insufficient to fully overcome the bottlenecks caused by limited samples and restricted input information. In recent years, researchers have begun to explore more sophisticated supervision strategies, introducing multi-level supervisory signals or incorporating physical priors during training to further enhance generalization and stability. The next section will focus on the latest developments in these supervision mechanisms.

5.2.2. Supervision Strategies

Although architectural innovations have improved baseline performance, conventional end-to-end learning still faces difficulties when addressing the inherent challenges of FPP, such as limited input information and small-scale datasets. As shown in Figure 16b,c, recent studies have increasingly incorporated physical priors in combination with tailored supervision strategies, which have proven to be effective in overcoming these bottlenecks and enhancing model performance.

Representative of these efforts is deep supervision, which introduces supervisory signals at multiple levels. This approach not only alleviates shortcut learning but also regularizes hierarchical feature representations, thereby significantly improving model generalization and enabling more robust reconstruction in complex scenarios [166,167,168]. For instance, Nguyen et al. [169] implemented multi-level supervision in the hNet architecture by injecting supervisory signals at each decoding stage, which markedly improved feature learning and consistently outperformed the conventional U-Net across most applications. Inspired by MFTPU, Li et al. [164] proposed the DSAS architecture, which applies joint supervision between sub-high-frequency absolute phase and high-frequency wrapped phase. Compared with standard end-to-end methods, DSAS reduced mean absolute error (MAE) in absolute phase reconstruction by

34 %

. Extending this idea, Zhu et al. [170] proposed a triple-supervision mechanism, which added an additional supervisory branch beyond the dual-branch design. This method lowered mean squared error (MSE) by

52 %

relative to end-to-end learning, further pushing the performance boundary.

Another line of research is branch-wise supervision, which combines network predictions with physical equations. Unlike deep supervision, branch-wise methods usually require post-processing with traditional physical models at the final stage. A typical approach is to predict both the wrapped phase (or its equivalent representation) and a coarse absolute phase, which is then refined using physical equations and rounding operations to compensate for small errors and improve point cloud quality [171,172,173]. Recently, such methods have increasingly emphasized explicit integration of FPP physical principles. For example, Jiang et al. [174] proposed a “1-to-6” architecture capable of predicting three pairs of numerators and denominators. It is important to note, however, that although output designs vary, directly predicting fringe order should be avoided, as fringe order is a discrete variable and thus not well suited for regression-based deep learning models.

Overall, supervision strategies play a crucial role in enhancing the robustness and accuracy of fringe-structured-light 3D reconstruction. Deep supervision improves generalization through multi-level regularization, while branch-wise supervision tightly integrates physical priors with network predictions, further improving point cloud quality. Although these approaches differ in mechanisms and computational costs, both demonstrate strong potential to overcome the limitations of conventional end-to-end learning. It is also worth noting that the effectiveness of supervision strategies largely depends on the design of input features. Therefore, the next section will focus on the evolution of input paradigms and their impact on network performance.

5.2.3. Input Design

In FPP systems, the design of input features is of critical importance, as certain features cannot be efficiently learned by neural networks in an automatic manner. In recent years, researchers have sought to overcome this limitation through innovations in input engineering. Nguyen et al. [160] compared various fringe patterns, including speckle patterns, high-frequency fringes, low-frequency fringes, and natural images, and found that high-frequency fringes delivered the best reconstruction performance. However, in traditional Fourier transform-based methods, key parameters that strongly influence accuracy—such as fringe patterns and projection angles—have not yet been fully optimized.

Among different input designs, composite approaches such as color-composite fringes and frequency-composite fringes have demonstrated promising performance. For example, Wang et al. [162] incorporated discrete wavelet transform (DWT) components into the input features and showed that this improved RMSE accuracy by

4 %

across the entire test dataset. Li et al. [173] employed composite three-frequency fringes as input and simultaneously predicted three intermediate components: unwrapped phase, numerator, and denominator. Their proposed CDLP method outperformed traditional FT approaches, although comparisons with other benchmarks such as sinusoidal fringes remain insufficient. Zhu et al. [175] introduced the SCFPP method, which surpassed CDLP and DCFPP on their self-collected dataset, improving MAE by

20.4 %

. It is worth noting that while color-composite fringes show potential, their application in measuring colored objects still suffers from limitations and has therefore not been widely adopted.

In summary, further optimization of input feature design remains a key factor for improving both the accuracy and robustness of FPP. The performance variations observed across different fringe patterns and composite approaches not only highlight the importance of input engineering but also point to future research directions. In particular, greater emphasis should be placed on fully exploiting multi-frequency fringe information and integrating multimodal fringe features to further enhance reconstruction accuracy and adaptability.

5.3. Evaluation Metrics

In deep learning-driven FPP, designing appropriate evaluation metrics is crucial to ensure the effectiveness and robustness of models in real-world applications. To comprehensively assess model performance, a multi-dimensional evaluation framework that integrates both visualization and quantitative metrics is needed to systematically analyze network behavior across diverse scenarios.

Visual analysis plays a key role in identifying systematic errors and failure modes. In FPP tasks, effective visualization not only intuitively reflects the network’s performance under varying conditions but also reveals limitations in handling specific challenges. However, most existing studies only present phase or point cloud errors in limited scenarios, which cannot fully reflect global performance. Therefore, scene-specific fine-grained testing is recommended to highlight artifacts more clearly and reduce dataset bias.

Figure 17 comprehensively illustrates the evaluation framework of our method, which is organized into four complementary perspectives: quantitative scene evaluation, standard object validation, generalization testing on industrial materials, and unified metric reporting. Per-pixel error heatmaps, as shown in Figure 17a, are a commonly used visualization method. We present quantitative results across diverse test scenes, including multi-object scenarios, isolated targets, low-light conditions, complex textures, and single objects. The left column shows representative fringe or intensity images, while the right column displays average phase error maps (in radians). These heatmaps can clearly present the local error distribution in phase or depth reconstruction, helping to uncover systematic biases. For instance, a model trained on specific objects may generalize poorly to new objects with different textures or reflectance. Taking the FP672 dataset [160], as an example, its data mainly originate from a single statue under uniform lighting conditions. As such, it is insufficient to evaluate the network’s robustness under complex objects and varying illumination. Therefore, it is recommended to include industrially common materials such as metals during the testing phase to more effectively challenge and assess the model’s generalization capabilities.

In addition, non-uniform surface reflectance remains a major challenge, often leading to errors in fringe order prediction. Complex reflective surfaces may hinder the network’s ability to accurately infer fringe orders. To address this issue, it is suggested to augment testing with surfaces that exhibit strong reflectivity, enabling a more realistic assessment of model robustness. In Figure 17c, we validate the model’s generalization capabilities on industrially relevant materials, especially metallic and highly reflective objects. The absolute depth-error maps (unit: mm) highlight the reconstruction challenges posed by strong reflections, while the reported RMSE values under each case quantitatively measure the model’s accuracy in such non-ideal scenarios.

Beyond static scene evaluations, dynamic scene evaluation has gained increasing attention. Temporal consistency visualization offers a valuable extension to traditional static metrics, especially in revealing motion artifacts and the impact of temporal variations [176,177]. Dynamic evaluation is critical for validating model stability and consistency under continuously changing environments, making it particularly applicable to long-term industrial deployments.

To establish a robustness benchmark that more closely reflects real-world scenarios, it is recommended to introduce variable conditions such as background light intensity and changes in ambient illumination. These variations simulate common real-world disturbances (e.g., lighting fluctuations, object occlusion) and enable systematic assessment of the model’s adaptability under non-ideal conditions.

Quantitative metrics provide an objective basis for standardized model performance comparison. However, existing studies lack consensus on metric selection, making direct comparisons across methods difficult. Currently, mean absolute error (MAE) and root mean square error (RMSE) are the most widely adopted basic metrics. In recent years, Peak Signal-to-Noise Ratio (PSNR), Structural Similarity Index Measure (SSIM), and

3 σ

deviation have also been introduced to characterize prediction errors from different dimensions. These metrics reflect mean error, variance, structural fidelity, and signal-to-noise characteristics, respectively, as shown in Figure 17d. Meanwhile, standard geometric objects, such as spheres and planes, are particularly effective for evaluating the accuracy of the predicted results, as illustrated in Figure 17b. Also, to address the lack of standardization, we propose a unified set of integrated metrics, encompassing phase- and depth-level accuracy (MAE, RMSE, PSNR, SSIM), point-cloud-level fidelity (e.g., Hausdorff distance), and network efficiency (e.g., parameter count, FLOPs, inference time). Therefore, reporting such a comprehensive set of metrics is recommended to ensure more robust and holistic performance evaluation across different approaches.

In FPP systems, quantitative evaluation of point clouds has long been relatively weak. As prediction accuracy improves, relying solely on visual artifacts is no longer sufficient to comprehensively reflect model performance. Thus, it is suggested to incorporate point cloud-specific evaluation metrics such as the Hausdorff distance [178] and Iterative Closest Point (ICP) registration error [179], which accurately measure geometric deviations between predicted and ground-truth point clouds. Moreover, as shown in Figure 17b, standard objects (e.g., spheres and planes) are widely used for systematic comparison across methods due to their geometric simplicity and clearly quantifiable errors, making them valuable benchmarks.

Finally, when reporting overall performance, in addition to traditional accuracy metrics, efficiency-related metrics such as the number of network parameters (Parameters), floating-point operations per second (FLOPs), training time, and inference time should also be included. It is worth noting that parameter count and FLOPs do not always directly correlate with inference speed. Therefore, it is recommended to comprehensively report all relevant metrics to provide a more complete basis for evaluating the trade-offs between accuracy and efficiency.

6. Challenges and Perspectives

In the development of fringe-structured-light systems, addressing the challenges of reconstruction under complex environments and dynamic scenes has always been a central research focus. In recent years, as application demands continue to expand, fringe-structured-light systems are required not only to maintain measurement accuracy in high dynamic range (HDR) scenarios but also to achieve stable reconstruction under conditions such as limited depth of field and rapid target motion [179,180,181,182]. Meanwhile, real-time monitoring capabilities have become increasingly essential in fields like intelligent manufacturing and industrial inspection. As deep learning emerges as a key tool for enhancing system performance, issues related to its interpretability and transferability have also attracted growing attention.

6.1. HDR Issues

In complex surface 3D measurement tasks, high dynamic range (HDR) imaging poses a significant challenge. Highly reflective regions are prone to image saturation, while low-reflectivity areas may suffer from low signal-to-noise ratios (SNR), leading to unstable phase estimation and, consequently, degraded 3D reconstruction accuracy. Traditional FPP systems often struggle to achieve ideal imaging quality across all regions under such conditions using a single fixed exposure setting [183].

In recent years, deep learning has offered new solutions for 3D measurement under HDR conditions. Zhang et al. were the first to introduce deep neural networks into HDR 3D reconstruction, using the results of a 12-step phase-shifting method as supervision to train a 3-step model, thereby increasing the dynamic range by 4.8 times [184]. However, the 12-step images still suffer from saturation in highly reflective regions, limiting their validity as ground truth. Subsequent studies have shown that deep models can learn the mapping between fringe patterns and phase, significantly reducing the number of required projections and enabling fast reconstruction under HDR conditions [184,185,186,187].

Nevertheless, deep learning models are highly sensitive to the distribution of training data, and publicly available HDR 3D reconstruction datasets remain extremely scarce. For example, Y-FFCNet, trained on simulated data, achieved separation of specular and diffuse reflections in highly reflective regions, significantly improving reconstruction performance for metallic objects [188]. However, accurate decoupling of reflection components remains challenging, and the generalization gap caused by synthetic data has yet to be resolved. Liu et al. proposed the SP-CAN method, which simulates a multi-exposure process using a neural network to enhance feature reconstruction in HDR regions [189]. However, its reliance on low-exposure fringe images may lead to insufficient feature representation, and the optimal exposure time still requires manual tuning.

In summary, future research should focus on building HDR 3D measurement datasets that better reflect real-world scenarios to improve model robustness to illumination and reflectivity variations. Moreover, the development of self-supervised or weakly supervised learning strategies can help reduce reliance on high-quality ground truth data. Lastly, designing network architectures with improved interpretability and generalizability will be essential to promote the practical deployment of HDR 3D reconstruction technologies.

6.2. Extended Depth of Field

In structured-light 3D measurement systems, extending the depth of field (DOF) is a key research direction for enhancing system adaptability and reconstruction accuracy. In practical scenarios, variations in object height often exceed the system’s focal range, leading to severely blurred regions that cause phase errors and reconstruction deviations—commonly referred to as the “local blur problem” [190].

To tackle the challenges posed by limited DOF, traditional approaches have proposed a variety of strategies to improve robustness. For instance, Drouin et al. [191] introduced an iterative deconvolution-based pattern segmentation method that enhances image sharpness to detect blurred edges. In their subsequent work [192], they estimated spatially varying point spread functions (PSFs) by projecting dot patterns within a calibrated measurement volume, enabling more accurate modeling and compensation of blur effects. Chen et al. [193] proposed a technique that combines polarization with high-frequency fringe patterns to reduce errors caused by subsurface scattering, and further developed the Modulated PS method [194], which enables 3D reconstruction without explicitly separating direct and indirect light components. Additional strategies, such as MicroPS [195], unstructured-light techniques [196], and embedded phase-shifting methods [197], aim to suppress the influence of indirect light through high-frequency encoding, thereby enhancing the system’s ability to manage blurred regions.

While these methods have mitigated defocus-related issues to some extent, they largely remain within the realm of traditional image processing paradigms, relying heavily on pattern design and physical modeling. They have yet to fully exploit the powerful feature extraction and nonlinear modeling capabilities offered by deep learning. Future research should explore the integration of deep neural networks to build end-to-end frameworks for blur region detection and error correction. In particular, a jointly optimized approach that combines high-frequency pattern design, PSF estimation in blurred areas, and end-to-end phase error correction could substantially improve the robustness and accuracy of structured-light systems under conditions of large depth-of-field and complex surface geometries.

6.3. High-Speed Deployment and Real-Time Reconstruction

In high-speed dynamic or transient measurement scenarios, the performance of FPP is constrained by the refresh rates of projection and acquisition hardware, as well as the computational efficiency of reconstruction algorithms [198]. These limitations significantly hinder its ability to meet the demands of real-time 3D reconstruction tasks that require high speed, high accuracy, and low latency. Traditional FPP methods typically rely on capturing multiple 8-bit sinusoidal fringe patterns to extract absolute phase information. However, the system’s frame rate is often limited by the flipping speed of digital micromirror devices and the camera’s exposure time, making high-frame-rate operation difficult to achieve [199,200].

To overcome these constraints, researchers have proposed the binary defocusing technique, which generates quasi-sinusoidal fringe patterns by projecting slightly defocused 1-bit binary images [201,202,203]. This approach fully leverages the high-speed switching capability of DMDs, enabling fringe projection rates in the kilohertz range. Moreover, by reducing the imaging window, high-speed cameras can achieve acquisition rates of up to 100,000 frames per second. When combined with deep learning methods—such as image super-resolution and single-frame phase decoding—this enables ultra-fast single-frame 3D imaging, as demonstrated in the SSSR-FPP method [204]. This line of research indicates that deep learning has great potential to significantly accelerate imaging speed without compromising measurement accuracy.

Despite these promising advances, deep learning-integrated FPP systems still face several challenges in practical deployment. Future research should focus on lightweight network architecture design, platform-aware optimization strategies, and multi-task end-to-end integration. Additionally, system-level co-design of hardware and software will be essential for developing real-time 3D reconstruction systems that are high in accuracy, low in latency, and energy-efficient for real-world applications.

6.4. Transferability, Generalization, and Interpretability of Deep Learning Methods

Although traditional structured-light systems offer strong customization and high measurement accuracy, they often rely on fixed configuration parameters, limiting their adaptability across different devices and environments [205,206]. This limitation has driven researchers to explore transfer learning strategies [207] to bridge the gap between simulation and reality and to achieve cross-configuration generalization. Currently, such methods are primarily applied in speckle-based structured-light systems or downstream tasks like eye tracking [208], while their application in line-structured-light 3D measurement remains underexplored.

In recent years, the emergence of large-scale models has significantly improved generalization and transfer capabilities across various fields, providing new opportunities for the intelligent development of FPP systems. By introducing foundation models or developing domain-adaptive fine-tuning mechanisms tailored to FPP data characteristics, future systems are expected to exhibit enhanced robustness and accuracy in unseen scenarios while reducing the need for repeated task-specific training, thereby enabling more efficient cross-task adaptation.

At the same time, deep learning has reshaped the development landscape of single-frame FPP systems, often surpassing traditional methods in terms of speed, accuracy, and robustness—particularly in dynamic scenes and complex surfaces. However, the physical mechanisms underlying these advantages remain poorly understood, and deep networks are still largely treated as “black boxes.” As a result, improving the interpretability of deep models has become a research priority. Some studies have explored methods such as feature map visualization [209] to reveal how networks extract and process complex fringe patterns, aiming to shift from “black box” to “gray box” modeling and improve transparency. Nevertheless, systematic investigations into interpretability methods within FPP systems remain limited, and their impact on model reliability, tunability, and generalization performance still demands further exploration.

Future research should focus on building diverse, high-quality cross-domain FPP datasets and introducing few-shot learning and domain adaptation strategies to improve transferability and generalization. In parallel, efforts to enhance model interpretability—such as visualizing internal features—are essential for uncovering model mechanisms and improving system transparency, reliability, and controllability.

7. Conclusions

As application demands continue to grow, structured-light 3D reconstruction systems are evolving toward higher precision, greater portability, enhanced intelligence, and stronger robustness. This paper provides a systematic comparison between the two mainstream methods, namely FPP and PMD, from the perspective of system architecture and fundamental principles. It highlights their differences and complementary advantages in terms of measurement mechanisms, applicable surface types, modeling strategies, and error control approaches. At the same time, MEMS-based micromirror scanning technology is becoming a promising direction for next-generation structured-light systems because of its lens-free configuration, large depth of field, compact structure, and high-speed operation. For system calibration, unified models, isophase surface models, and phase-angle models specifically developed for MEMS systems provide effective tools for modeling nonlinear optical paths. To mitigate issues such as projection errors, light source fluctuations, and high-order harmonic distortions, researchers have introduced various compensation strategies at both the hardware and algorithmic levels, significantly improving the robustness of the overall system.

The integration of deep learning has introduced a paradigm shift in structured-light measurement. Whether for single-frame reconstruction or multi-frame nonlinear mapping, deep neural networks consistently outperform traditional algorithms. Advances in network architecture, incorporation of physical priors, input feature engineering, and evaluation metric design have opened new paths for accurate reconstruction under complex scenes.

Looking ahead, structured-light 3D reconstruction still faces several challenges and opportunities. At the hardware level, improvements are required in the precision, power stability, and cost efficiency of MEMS projectors. At the modeling level, it is important to integrate geometric priors, optical imperfections, and learning-based approaches to enhance system adaptability across different platforms and complex environments. At the intelligence level, further exploration of deep learning techniques is needed in areas such as few-shot learning, weak supervision, and multimodal fusion, with a particular focus on developing end-to-end models that incorporate physical constraints.

Moreover, to facilitate the practical deployment of structured-light systems in industrial and service settings, building generalized and portable evaluation datasets and performance metrics will be a crucial step. In summary, structured-light 3D reconstruction is undergoing a pivotal transformation through the deep integration of traditional methods and intelligent technologies, and is expected to play an increasingly important role in fields such as precision manufacturing, soft robotics, cultural preservation, and intelligent interaction. We believe this review can serve as a valuable reference for researchers and engineers, providing both a clear understanding of current advances and a forward-looking perspective on future development in the field.

Author Contributions

Conceptualization and Investigation, Z.Z., H.W., Y.L. and Z.L.; Writing—Original Draft, Z.Z., H.W., Y.L. and Z.L.; Writing—Review and Editing, Z.Z., H.W., Y.L. and Z.L.; Supervision, W.G., X.W., C.Z., X.L. (Xiaojun Liang) and X.L. (Xinghui Li); Project Administration, X.L. (Xiaojun Liang) and X.L. (Xinghui Li); Funding Acquisition, X.L. (Xiaojun Liang) and X.L. (Xinghui Li). All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Shenzhen Science and Technology Program (Grant JCYJ20240813112003005) and the Major Key Project of Pengcheng Laboratory (Grant PCL2025A03-2).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Acknowledgments

The authors would like to express their sincere gratitude to all individuals and organizations who supported this research.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

FPP	Fringe Projection Profilometry
PMD	Phase Measuring Deflectometry
DLP	Digital Light Processing
TPU	Temporal-Phase Unwrapping
SPU	Spatial-Phase Unwrapping
LCD	Liquid-Crystal Display
DMD	Digital Micromirror Device
MEMS	Micro-Electro-Mechanical Systems
CMM	Coordinate Measuring Machines
SOTA	State-of-the-Art
CNNs	Convolutional Neural Networks
MAE	Mean Absolute Error
MSE	Mean Squared Error
RMSE	Root Mean Square Error
PSNR	Peak Signal-to-Noise Ratio
SSIM	Structural Similarity Index Measure
HDR	High Dynamic Range
SNR	Signal-to-Noise Ratios
DOF	Depth of Field
PSF	Point Spread Function

References

Salvi, J.; Fernandez, S.; Pribanic, T.; Llado, X. A state of the art in structured light patterns for surface profilometry. Pattern Recognit. 2010, 43, 2666–2680. [Google Scholar] [CrossRef]
Van der Jeught, S.; Dirckx, J.J. Real-time structured light profilometry: A review. Opt. Lasers Eng. 2016, 87, 18–31. [Google Scholar] [CrossRef]
Lv, S.; Kemao, Q. Modeling the measurement precision of fringe projection profilometry. Light. Sci. Appl. 2023, 12, 257. [Google Scholar] [CrossRef]
Zhou, Q.; Qiao, X.; Ni, K.; Li, X.; Wang, X. Depth detection in interactive projection system based on one-shot black-and-white stripe pattern. Opt. Express 2017, 25, 5341–5351. [Google Scholar] [CrossRef]
Han, M.; Xing, Y.; Wang, X.; Li, X. Projection superimposition for the generation of high-resolution digital grating. Opt. Lett. 2024, 49, 4473–4476. [Google Scholar] [CrossRef] [PubMed]
Juarez-Salazar, R.; Esquivel-Hernandez, S.; Diaz-Ramirez, V.H. Optical Fringe Projection: A Straightforward Approach to 3D Metrology. Metrology 2025, 5, 47. [Google Scholar] [CrossRef]
Gao, W.; Kim, S.W.; Bosse, H.; Haitjema, H.; Chen, Y.; Lu, X.; Knapp, W.; Weckenmann, A.; Estler, W.; Kunzmann, H. Measurement technologies for precision positioning. CIRP Ann. 2015, 64, 773–796. [Google Scholar] [CrossRef]
Li, X.; Shimizu, Y.; Ito, T.; Cai, Y.; Ito, S.; Gao, W. Measurement of six-degree-of-freedom planar motions by using a multiprobe surface encoder. Opt. Eng. 2014, 53, 122405. [Google Scholar] [CrossRef]
Wu, J.; Hong, Y.; Shin, D.W.; Sato, R.; Quan, L.; Matsukuma, H.; Gao, W. On-machine calibration of pitch deviations of a linear scale grating by using a differential angle sensor. Int. J. Autom. Technol. 2024, 18, 4–10. [Google Scholar] [CrossRef]
Gao, W.; Kim, S.; Bosse, H.; Minoshima, K. Dimensional metrology based on ultrashort pulse laser and optical frequency comb. CIRP Ann. 2025, 74, 993–1018. [Google Scholar] [CrossRef]
Ding, D.; Ding, W.; Huang, R.; Fu, Y.; Xu, F. Research progress of laser triangulation on-machine measurement technology for complex surface: A review. Measurement 2023, 216, 113001. [Google Scholar] [CrossRef]
Chen, R.; Li, Y.; Xue, G.; Tao, Y.; Li, X. Laser triangulation measurement system with Scheimpflug calibration based on the Monte Carlo optimization strategy. Opt. Express 2022, 30, 25290–25307. [Google Scholar] [CrossRef]
Boesl, U. Time-of-flight mass spectrometry: Introduction to the basics. Mass Spectrom. Rev. 2017, 36, 86–109. [Google Scholar] [CrossRef]
Hansard, M.; Lee, S.; Choi, O.; Horaud, R.P. Time-of-Flight Cameras: Principles, Methods and Applications; Springer Science & Business Media: New York, NY, USA, 2012. [Google Scholar]
Ma, R.; Li, C.; Xing, Y.; Wang, S.; Ma, R.; Feng, F.; Qian, X.; Wang, X.; Li, X. Defect focused Harris3D & boundary fine-tuning optimized region growing: Lithium battery pole piece defect segmentation. Measurement 2025, 242, 116147. [Google Scholar] [CrossRef]
Li, J.; Zhou, Q.; Li, X.; Chen, R.; Ni, K. An improved low-noise processing methodology combined with PCL for industry inspection based on laser line scanner. Sensors 2019, 19, 3398. [Google Scholar] [CrossRef] [PubMed]
Chen, R.; Li, X.; Wang, X.; Li, J.; Xue, G.; Zhou, Q.; Ni, K. A planar pattern based calibration method for high precision structured laser triangulation measurement. In Proceedings of the Optical Metrology and Inspection for Industrial Applications VI, Hangzhou, China, 20–23 October 2019; SPIE: Washington, DC, USA, 2019; Volume 11189, pp. 212–218. [Google Scholar] [CrossRef]
Han, M.; Wang, X.; Li, X. Fast and accurate fringe projection based on a MEMS micro-vibration mirror. In Proceedings of the Optical Metrology and Inspection for Industrial Applications XI, Nantong, China, 12–14 October 2024; SPIE: Washington, DC, USA, 2024; Volume 13241, pp. 12–19. [Google Scholar] [CrossRef]
Almaraz-Cabral, C.C.; Gonzalez-Barbosa, J.J.; Villa, J.; Hurtado-Ramos, J.B.; Ornelas-Rodriguez, F.J.; Cordova-Esparza, D.M. Fringe projection profilometry for panoramic 3D reconstruction. Opt. Lasers Eng. 2016, 78, 106–112. [Google Scholar] [CrossRef]
Nguyen, H.; Liang, J.; Wang, Y.; Wang, Z. Accuracy assessment of fringe projection profilometry and digital image correlation techniques for three-dimensional shape measurements. J. Phys. Photonics 2021, 3, 014004. [Google Scholar] [CrossRef]
Muyshondt, P.G.; Van der Jeught, S.; Dirckx, J.J. A calibrated 3D dual-barrel otoendoscope based on fringe-projection profilometry. Opt. Lasers Eng. 2022, 149, 106795. [Google Scholar] [CrossRef]
Stavroulakis, P.; Leach, R.K. Invited review article: Review of post-process optical form metrology for industrial-grade metal additive manufactured parts. Rev. Sci. Instrum. 2016, 87, 041101. [Google Scholar] [CrossRef]
Forbes, A.; De Oliveira, M.; Dennis, M.R. Structured light. Nat. Photonics 2021, 15, 253–262. [Google Scholar] [CrossRef]
Gibelli, D.; Dolci, C.; Cappella, A.; Sforza, C. Reliability of optical devices for three-dimensional facial anatomy description: A systematic review and meta-analysis. Int. J. Oral Maxillofac. Surg. 2020, 49, 1092–1106. [Google Scholar] [CrossRef]
Antonacci, D.; Caponio, V.C.A.; Troiano, G.; Pompeo, M.G.; Gianfreda, F.; Canullo, L. Facial scanning technologies in the era of digital workflow: A systematic review and network meta-analysis. J. Prosthodont. Res. 2022, 67, 321–336. [Google Scholar] [CrossRef] [PubMed]
Zong, Y.; Duan, M.; Yu, C.; Li, J. Robust phase unwrapping algorithm for noisy and segmented phase measurements. Opt. Express 2021, 29, 24466–24485. [Google Scholar] [CrossRef] [PubMed]
Rosell-Polo, J.R.; Cheein, F.A.; Gregorio, E.; Andújar, D.; Puigdomènech, L.; Masip, J.; Escolà, A. Advances in structured light sensors applications in precision agriculture and livestock farming. Adv. Agron. 2015, 133, 71–112. [Google Scholar] [CrossRef]
Burke, J.; Pak, A.; Höfer, S.; Ziebarth, M.; Roschani, M.; Beyerer, J. Deflectometry for specular surfaces: An overview. Adv. Opt. Technol. 2023, 12, 1237687. [Google Scholar] [CrossRef]
Rolland, J.P.; Davies, M.A.; Suleski, T.J.; Evans, C.; Bauer, A.; Lambropoulos, J.C.; Falaggis, K. Freeform optics for imaging. Optica 2021, 8, 161–176. [Google Scholar] [CrossRef]
Wang, Y.; Liu, L.; Wu, J.; Chen, X.; Wang, Y. Spatial binary coding method for stripe-wise phase unwrapping. Appl. Opt. 2020, 59, 4279–4285. [Google Scholar] [CrossRef]
Yang, S.P.; Seo, Y.H.; Kim, J.B.; Kim, H.; Jeong, K.H. Optical MEMS devices for compact 3D surface imaging cameras. Micro Nano Syst. Lett. 2019, 7, 8. [Google Scholar] [CrossRef]
Han, M.; Lei, F.; Shi, W.; Lu, S.; Li, X. Uniaxial MEMS-based 3D reconstruction using pixel refinement. Opt. Express 2022, 31, 536–554. [Google Scholar] [CrossRef]
Möller, T.; Kraft, H.; Frey, J.; Albrecht, M.; Lange, R. Robust 3D Measurement with PMD Sensors; Range Imaging Day, Zürich, Switzerland; Springer Business Media: New York, NY, USA, 2005; Volume 7, p. 8. [Google Scholar]
Xu, Y.; Gao, F.; Jiang, X. A brief review of the technological advancements of phase measuring deflectometry. PhotoniX 2020, 1, 14. [Google Scholar] [CrossRef]
He, X.; Kemao, Q. A comparative study on temporal phase unwrapping methods in high-speed fringe projection profilometry. Opt. Lasers Eng. 2021, 142, 106613. [Google Scholar] [CrossRef]
Lv, S.; Tang, D.; Zhang, X.; Yang, D.; Deng, W.; Kemao, Q. Fringe projection profilometry method with high efficiency, precision, and convenience: Theoretical analysis and development. Opt. Express 2022, 30, 33515–33537. [Google Scholar] [CrossRef]
Bai, Y.; Zhang, Z.; Fu, S.; Zhao, H.; Ni, Y.; Gao, N.; Meng, Z.; Yang, Z.; Zhang, G.; Yin, W. Recent progress of full-field three-dimensional shape measurement based on phase information. Nanomanuf. Metrol. 2024, 7, 9. [Google Scholar] [CrossRef]
Kulkarni, R.; Rastogi, P. Fringe denoising algorithms: A review. Opt. Lasers Eng. 2020, 135, 106190. [Google Scholar] [CrossRef]
Liu, H.; Yan, N.; Shao, B.; Yuan, S.; Zhang, X. Deep learning in fringe projection: A review. Neurocomputing 2024, 581, 127493. [Google Scholar] [CrossRef]
Burnes, S.; Villa, J.; Moreno, G.; de la Rosa, I.; Alaniz, D.; González, E. Temporal fringe projection profilometry: Modified fringe-frequency range for error reduction. Opt. Lasers Eng. 2022, 149, 106788. [Google Scholar] [CrossRef]
Lei, F.; Ma, R.; Li, X. Use of phase-angle model for full-field 3d reconstruction under efficient local calibration. Sensors 2024, 24, 2581. [Google Scholar] [CrossRef] [PubMed]
Kim, J.; Lee, J.; Park, Y.H. Highly accurate three-dimensional measurement of large structures using multiple stereo vision with improved two-step calibration algorithm. Measurement 2024, 234, 114886. [Google Scholar] [CrossRef]
Zhou, W.; Jia, Y.; Fan, L.; Fan, G.; Lu, F. A MEMS-based real-time structured light 3-D measuring architecture on FPGA. J. Real-Time Image Process. 2024, 21, 98. [Google Scholar] [CrossRef]
Wang, H.; Zhang, C.; Qian, X.; Wang, X.; Gui, W.; Gao, W.; Liang, X.; Li, X. HDRSL Net for Accurate High Dynamic Range Imaging-based Structured Light 3D Reconstruction. IEEE Trans. Image Process. 2025, 34, 5486–5499. [Google Scholar] [CrossRef]
Srinivasan, V.; Liu, H.C.; Halioua, M. Automated phase-measuring profilometry of 3-D diffuse objects. Appl. Opt. 1984, 23, 3105–3108. [Google Scholar] [CrossRef]
Su, X.Y.; Von Bally, G.; Vukicevic, D. Phase-stepping grating profilometry: Utilization of intensity modulation analysis in complex objects evaluation. Opt. Commun. 1993, 98, 141–150. [Google Scholar] [CrossRef]
Ishikawa, K.; Yatabe, K.; Ikeda, Y.; Oikawa, Y.; Onuma, T.; Niwa, H.; Yoshii, M. Interferometric imaging of acoustical phenomena using high-speed polarization camera and 4-step parallel phase-shifting technique. In Proceedings of the Selected Papers from the 31st International Congress on High-Speed Imaging and Photonics, Osaka, Japan, 7–10 November 2017; SPIE: Washington, DC, USA, 2017; Volume 10328, pp. 93–99. [Google Scholar] [CrossRef]
Jaganathan, K.; Eldar, Y.C.; Hassibi, B. Phase retrieval: An overview of recent developments. In Optical Compressive Imaging; CRC Press: Boca Raton, FL, USA, 2016; pp. 279–312. [Google Scholar] [CrossRef]
Su, X.; Chen, W. Fourier transform profilometry: A review. Opt. Lasers Eng. 2001, 35, 263–284. [Google Scholar] [CrossRef]
Zhang, Z.; Jing, Z.; Wang, Z.; Kuang, D. Comparison of Fourier transform, windowed Fourier transform, and wavelet transform methods for phase calculation at discontinuities in fringe projection profilometry. Opt. Lasers Eng. 2012, 50, 1152–1160. [Google Scholar] [CrossRef]
Balasubramaniam, B.; Li, J.; Liu, L.; Li, B. 3d imaging with fringe projection for food and agricultural applications—A tutorial. Electronics 2023, 12, 859. [Google Scholar] [CrossRef]
Saldner, H.O.; Huntley, J.M. Temporal phase unwrapping: Application to surface profiling of discontinuous objects. Appl. Opt. 1997, 36, 2770–2775. [Google Scholar] [CrossRef]
Sansoni, G.; Carocci, M.; Rodella, R. Three-dimensional vision based on a combination of gray-code and phase-shift light projection: Analysis and compensation of the systematic errors. Appl. Opt. 1999, 38, 6565–6573. [Google Scholar] [CrossRef] [PubMed]
Zhong, J.; Zhang, Y. Absolute phase-measurement technique based on number theory in multifrequency grating projection profilometry. Appl. Opt. 2001, 40, 492–500. [Google Scholar] [CrossRef]
Hung, K.M.; Yamada, T. Phase unwrapping by regions using least-squares approach. Opt. Eng. 1998, 37, 2965–2970. [Google Scholar] [CrossRef]
Zebker, H.A.; Lu, Y. Phase unwrapping algorithms for radar interferometry: Residue-cut, least-squares, and synthesis algorithms. J. Opt. Soc. Am. A 1998, 15, 586–598. [Google Scholar] [CrossRef]
McKilliam, R.G.; Quinn, B.G.; Clarkson, I.V.L.; Moran, B.; Vellambi, B.N. Polynomial phase estimation by least squares phase unwrapping. IEEE Trans. Signal Process. 2014, 62, 1962–1975. [Google Scholar] [CrossRef]
Juarez-Salazar, R.; Robledo-Sanchez, C.; Guerrero-Sanchez, F. Phase-unwrapping algorithm by a rounding-least-squares approach. Opt. Eng. 2014, 53, 024102. [Google Scholar] [CrossRef]
Li, Y.; Zhang, Y.; Jia, D.; Zhang, M.; Ji, X.; Li, Y.; Wu, Y. Experimental Study on the Reconstruction of a Light Field through a Four-Step Phase-Shift Method and Multiple Improvement Iterations of the Least Squares Method for Phase Unwrapping. Photonics 2024, 11, 716. [Google Scholar] [CrossRef]
Asundi, A.; Wensen, Z. Fast phase-unwrapping algorithm based on a gray-scale mask and flood fill. Appl. Opt. 1998, 37, 5416–5420. [Google Scholar] [CrossRef] [PubMed]
Su, X.; Chen, W. Reliability-guided phase unwrapping algorithm: A review. Opt. Lasers Eng. 2004, 42, 245–261. [Google Scholar] [CrossRef]
Goldstein, R.M.; Zebker, H.A.; Werner, C.L. Satellite radar interferometry: Two-dimensional phase unwrapping. Radio Sci. 1988, 23, 713–720. [Google Scholar] [CrossRef]
Huntley, J. Noise-immune phase unwrapping algorithm. Appl. Opt. 1989, 28, 3268–3270. [Google Scholar] [CrossRef] [PubMed]
Zheng, D.; Da, F. A novel algorithm for branch cut phase unwrapping. Opt. Lasers Eng. 2011, 49, 609–617. [Google Scholar] [CrossRef]
Gdeisat, M.A.; Burton, D.R.; Lilley, F.; Arevalillo-Herráez, M.; Ammous, M.M. Aiding phase unwrapping by increasing the number of residues in two-dimensional wrapped-phase distributions. Appl. Opt. 2015, 54, 10073–10078. [Google Scholar] [CrossRef]
Du, G.; Wang, M.; Zhou, C.; Si, S.; Li, H.; Lei, Z.; Li, Y. A simple spatial domain algorithm to increase the residues of wrapped phase maps. J. Mod. Opt. 2017, 64, 231–237. [Google Scholar] [CrossRef]
Cheng, N.J.; Su, W.H. Phase-shifting projected fringe profilometry using binary-encoded patterns. Photonics 2021, 8, 362. [Google Scholar] [CrossRef]
Xie, X.; Tian, X.; Shou, Z.; Zeng, Q.; Wang, G.; Huang, Q.; Qin, M.; Gao, X. Deep learning phase-unwrapping method based on adaptive noise evaluation. Appl. Opt. 2022, 61, 6861–6870. [Google Scholar] [CrossRef]
Yu, J.; Da, F. Absolute phase unwrapping for objects with large depth range. IEEE Trans. Instrum. Meas. 2023, 72, 1–10. [Google Scholar] [CrossRef]
Yue, M.; Wang, J.; Zhang, J.; Zhang, Y.; Tang, Y.; Feng, X. Color crosstalk correction for synchronous measurement of full-field temperature and deformation. Opt. Lasers Eng. 2022, 150, 106878. [Google Scholar] [CrossRef]
Li, Z.; Gao, N.; Meng, Z.; Zhang, Z.; Gao, F.; Jiang, X. Aided imaging phase measuring deflectometry based on concave focusing mirror. Photonics 2023, 10, 519. [Google Scholar] [CrossRef]
Wang, Y.; Xu, Y.; Zhang, Z.; Gao, F.; Jiang, X. 3D measurement of structured specular surfaces using stereo direct phase measurement deflectometry. Machines 2021, 9, 170. [Google Scholar] [CrossRef]
Ri, S.; Takimoto, T.; Xia, P.; Wang, Q.; Tsuda, H.; Ogihara, S. Accurate phase analysis of interferometric fringes by the spatiotemporal phase-shifting method. J. Opt. 2020, 22, 105703. [Google Scholar] [CrossRef]
Feng, S.; Zuo, C.; Zhang, L.; Tao, T.; Hu, Y.; Yin, W.; Qian, J.; Chen, Q. Calibration of fringe projection profilometry: A comparative review. Opt. Lasers Eng. 2021, 143, 106622. [Google Scholar] [CrossRef]
Zhang, S.; Yau, S.T. High-resolution, real-time 3D absolute coordinate measurement based on a phase-shifting method. Opt. Express 2006, 14, 2644–2649. [Google Scholar] [CrossRef]
Zhou, W.S.; Su, X.Y. A direct mapping algorithm for phase-measuring profilometry. J. Mod. Opt. 1994, 41, 89–94. [Google Scholar] [CrossRef]
Huang, L.; Chua, P.S.; Asundi, A. Least-squares calibration method for fringe projection profilometry considering camera lens distortion. Appl. Opt. 2010, 49, 1539–1548. [Google Scholar] [CrossRef]
Zhang, Z.; Ma, H.; Zhang, S.; Guo, T.; Towers, C.E.; Towers, D.P. Simple calibration of a phase-based 3D imaging system based on uneven fringe projection. Opt. Lett. 2011, 36, 627–629. [Google Scholar] [CrossRef] [PubMed]
Takeda, M.; Mutoh, K. Fourier transform profilometry for the automatic measurement of 3-D object shapes. Appl. Opt. 1983, 22, 3977–3982. [Google Scholar] [CrossRef]
Jia, P.; Kofman, J.; English, C. Comparison of linear and nonlinear calibration methods for phase-measuring profilometry. Opt. Eng. 2007, 46, 043601. [Google Scholar] [CrossRef]
Guo, H.; He, H.; Yu, Y.; Chen, M. Least-squares calibration method for fringe projection profilometry. Opt. Eng. 2005, 44, 033603. [Google Scholar] [CrossRef]
Zhang, Z. A Flexible New Technique for Camera Calibration. IEEE Trans. Pattern Anal. Mach. Intell. 2000, 22, 1330–1334. Available online: https://api.semanticscholar.org/CorpusID:1150626 (accessed on 3 October 2025). [CrossRef]
Kafri, O.; Glatt, I. Moiré deflectometry: A ray deflection approach to optical testing. Opt. Eng. 1985, 24, 944–960. [Google Scholar] [CrossRef]
Servin, M.; Rodriguez-Vera, R.; Carpio, M.; Morales, A. Automatic fringe detection algorithm used for moiré deflectometry. Appl. Opt. 1990, 29, 3266–3270. [Google Scholar] [CrossRef]
Wang, B.; Luo, X.; Pfeifer, T.; Mischo, H. Moire deflectometry based on Fourier-transform analysis. Measurement 1999, 25, 249–253. [Google Scholar] [CrossRef]
Legarda-Saenz, R. Robust wavefront estimation using multiple directional derivatives in moiré deflectometry. Opt. Lasers Eng. 2007, 45, 915–921. [Google Scholar] [CrossRef]
Lee, H.J.; Kim, S.W. Precision profile measurement of aspheric surfaces by improved Ronchi test. Opt. Eng. 1999, 38, 1041–1047. [Google Scholar] [CrossRef]
Butel, G.P.; Smith, G.A.; Burge, J.H. Binary pattern deflectometry. Appl. Opt. 2014, 53, 923–930. [Google Scholar] [CrossRef]
Schulz, M.; Ehret, G.; Fitzenreiter, A. Scanning deflectometric form measurement avoiding path-dependent angle measurement errors. J. Eur. Opt. Soc. Rapid Publ. 2010, 5, 10026. Available online: https://api.semanticscholar.org/CorpusID:54037587 (accessed on 1 October 2025). [CrossRef]
Hao, Q.; Zhu, Q.; Wang, Y. Deflectometer with synthetically generated reference circle for aspheric surface testing. Opt. Laser Technol. 2005, 37, 375–380. [Google Scholar] [CrossRef]
van Amstel, W.D.; Baumer, S.M.; Horijon, J.L. Optical figure testing by scanning deflectometry. In Proceedings of the Optical Manufacturing and Testing III, Berlin, Germany, 26–28 May 1999; SPIE: Washington, DC, USA, 1999; Volume 3782, pp. 320–327. [Google Scholar] [CrossRef]
Miks, A.; Novak, J.; Novak, P. Method for reconstruction of shape of specular surfaces using scanning beam deflectometry. Opt. Lasers Eng. 2013, 51, 867–872. [Google Scholar] [CrossRef]
Huang, L.; Idir, M.; Zuo, C.; Asundi, A. Review of phase measuring deflectometry. Opt. Lasers Eng. 2018, 107, 247–257. [Google Scholar] [CrossRef]
Häusler, G.; Richter, C.; Leitz, K.H.; Knauer, M.C. Microdeflectometry—A novel tool to acquire three-dimensional microtopography with nanometer height resolution. Opt. Lett. 2008, 33, 396–398. [Google Scholar] [CrossRef]
Liu, Y.; Lehtonen, P.; Su, X. High-accuracy measurement for small scale specular objects based on PMD with illuminated film. Opt. Laser Technol. 2012, 44, 459–462. [Google Scholar] [CrossRef]
Huang, L.; Ng, C.S.; Asundi, A.K. Dynamic three-dimensional sensing for specular surface with monoscopic fringe reflectometry. Opt. Express 2011, 19, 12809–12814. [Google Scholar] [CrossRef] [PubMed]
Li, W.; Sandner, M.; Gesierich, A.; Burke, J. Absolute optical surface measurement with deflectometry. In Proceedings of the Interferometry XVI: Applications, San Diego, CA, USA, 13–15 August 2012; SPIE: Washington, DC, USA, 2012; Volume 8494, pp. 129–135. [Google Scholar] [CrossRef]
Bothe, T.; Li, W.; von Kopylow, C.; Juptner, W.P. High-resolution 3D shape measurement on specular surfaces by fringe reflection. In Proceedings of the Optical Metrology in Production Engineering, Strasbourg, France, 26–30 April 2004; SPIE: Washington, DC, USA, 2004; Volume 5457, pp. 411–422. Available online: https://ui.adsabs.harvard.edu/link_gateway/2004SPIE.5457..411B/doi:10.1117/12.545987 (accessed on 1 October 2025).
Su, P.; Parks, R.; Angel, R.; Wang, L.; Burge, J. A new test for optical surfaces. Spie Newsroom 2011, 20. [Google Scholar] [CrossRef]
Xu, X.; Zhang, X.; Niu, Z.; Wang, W.; Zhu, Y.; Xu, M. Self-calibration of in situ monoscopic deflectometric measurement in precision optical manufacturing. Opt. Express 2019, 27, 7523–7536. [Google Scholar] [CrossRef] [PubMed]
Xu, X.; Zhang, X.; Niu, Z.; Wang, W.; Xu, M. Extra-detection-free monoscopic deflectometry for the in situ measurement of freeform specular surfaces. Opt. Lett. 2019, 44, 4271–4274. [Google Scholar] [CrossRef] [PubMed]
Tang, Y.; Su, X.; Liu, Y.; Jing, H. 3D shape measurement of the aspheric mirror by advanced phase measuring deflectometry. Opt. Express 2008, 16, 15090–15096. [Google Scholar] [CrossRef] [PubMed]
Petz, M.; Tutsch, R. Measurement of optically effective surfaces by imaging of gratings. In Proceedings of the Optical Measurement Systems for Industrial Inspection III, Munich, Germany, 23–26 June 2003; SPIE: Washington, DC, USA, 2003; Volume 5144, pp. 288–294. [Google Scholar] [CrossRef]
Guo, H.; Feng, P.; Tao, T. Specular surface measurement by using least squares light tracking technique. Opt. Lasers Eng. 2010, 48, 166–171. [Google Scholar] [CrossRef]
Li, C.; Li, Y.; Xiao, Y.; Zhang, X.; Tu, D. Phase measurement deflectometry with refraction model and its calibration. Opt. Express 2018, 26, 33510–33522. [Google Scholar] [CrossRef]
Ren, H.; Gao, F.; Jiang, X. Iterative optimization calibration method for stereo deflectometry. Opt. Express 2015, 23, 22060–22068. [Google Scholar] [CrossRef]
Xu, Y.; Gao, F.; Zhang, Z.; Jiang, X. A holistic calibration method with iterative distortion compensation for stereo deflectometry. Opt. Lasers Eng. 2018, 106, 111–118. [Google Scholar] [CrossRef]
Han, M.; Zhang, C.; Zhang, Z.; Li, X. Review of MEMS vibration-mirror-based 3D reconstruction of structured light. Opt. Precis. Eng. 2025, 33, 1065–1090. [Google Scholar] [CrossRef]
Yang, T.; Gu, F. Overview of modulation techniques for spatially structured-light 3D imaging. Opt. Laser Technol. 2024, 169, 110037. [Google Scholar] [CrossRef]
Zhang, Q.; Su, X. Research progress of dynamic three-dimensional shape measurement. Laser Optoelectron. Prog. 2013, 50, 4–17. [Google Scholar] [CrossRef]
Zhang, Z. Review of single-shot 3D shape measurement by phase calculation-based fringe projection techniques. Opt. Lasers Eng. 2012, 50, 1097–1106. [Google Scholar] [CrossRef]
Yang, D.; Qiao, D.; Xia, C. Curved light surface model for calibration of a structured light 3D modeling system based on striped patterns. Opt. Express 2020, 28, 33240–33253. [Google Scholar] [CrossRef]
Yang, S.; Yang, T.; Wu, G.; Wu, Y.; Liu, F. Flexible and fast calibration method for uni-directional multi-line structured light system. Opt. Lasers Eng. 2023, 164, 107525. [Google Scholar] [CrossRef]
Zhang, S. Flexible and high-accuracy method for uni-directional structured light system calibration. Opt. Lasers Eng. 2021, 143, 106637. [Google Scholar] [CrossRef]
Yang, Y.; Miao, Y.; Liu, X.; Pedrini, G.; Tang, Q.; Osten, W.; Peng, X. Intrinsic parameter-free calibration of FPP using a ray phase mapping model. Opt. Lett. 2022, 47, 3564–3567. [Google Scholar] [CrossRef] [PubMed]
Lei, F.; Han, M.; Jiang, H.; Wang, X.; Li, X. A phase-angle inspired calibration strategy based on MEMS projector for 3D reconstruction with markedly reduced calibration images and parameters. Opt. Lasers Eng. 2024, 176, 108078. [Google Scholar] [CrossRef]
Li, Y.; Wu, Z.; Zhang, Q. Phase Error Compensation Technique Based on Phase-Shifting Fringe Analysis: A Review. Laser Optoelectron. Prog. 2024, 61, 0211008. [Google Scholar] [CrossRef]
Cai, Z.; Liu, X.; Jiang, H.; He, D.; Peng, X.; Huang, S.; Zhang, Z. Flexible phase error compensation based on Hilbert transform in phase shifting profilometry. Opt. Express 2015, 23, 25171–25181. [Google Scholar] [CrossRef]
Wang, Y.; Cai, J.; Zhang, D.; Chen, X.; Wang, Y. Nonlinear correction for fringe projection profilometry with shifted-phase histogram equalization. IEEE Trans. Instrum. Meas. 2022, 71, 1–9. [Google Scholar] [CrossRef]
Zhang, W.; Yu, L.; Li, W.; Xia, H.; Deng, H.; Zhang, J. Black-box phase error compensation for digital phase-shifting profilometry. IEEE Trans. Instrum. Meas. 2017, 66, 2755–2761. [Google Scholar] [CrossRef]
Wang, Y.; Xu, H.; Zhu, H.; Rao, Y.; Wang, Y. Nonlinear high-order harmonics correction for phase measuring profilometry. Opt. Laser Technol. 2024, 170, 110248. [Google Scholar] [CrossRef]
Wang, J.; Yang, Y. Triple N-step phase shift algorithm for phase error compensation in fringe projection profilometry. IEEE Trans. Instrum. Meas. 2021, 70, 1–9. [Google Scholar] [CrossRef]
Zhang, S. Comparative study on passive and active projector nonlinear gamma calibration. Appl. Opt. 2015, 54, 3834–3841. [Google Scholar] [CrossRef]
Han, M.; Jiang, H.; Lei, F.; Xing, Y.; Wang, X.; Li, X. Modeling window smoothing effect hidden in fringe projection profilometry. Measurement 2025, 242, 115852. [Google Scholar] [CrossRef]
Li, J.; Hassebrook, L.G.; Guan, C. Optimized two-frequency phase-measuring-profilometry light-sensor temporal-noise sensitivity. J. Opt. Soc. Am. A 2003, 20, 106–115. [Google Scholar] [CrossRef]
Zuo, C.; Huang, L.; Zhang, M.; Chen, Q.; Asundi, A. Temporal phase unwrapping algorithms for fringe projection profilometry: A comparative review. Opt. Lasers Eng. 2016, 85, 84–103. [Google Scholar] [CrossRef]
Yan, K.; Yu, Y.; Huang, C.; Sui, L.; Qian, K.; Asundi, A. Fringe pattern denoising based on deep learning. Opt. Commun. 2019, 437, 148–152. [Google Scholar] [CrossRef]
Zhao, Y.; Yu, H.; Bai, L.; Zheng, D.; Han, J. Accurate fringe projection profilometry using instable projection light source. Opt. Commun. 2022, 507, 127643. [Google Scholar] [CrossRef]
Liu, Q.; Wang, Y.; He, J.; Ji, F. Phase shift extraction and wavefront retrieval from interferograms with background and contrast fluctuations. J. Opt. 2015, 17, 025704. [Google Scholar] [CrossRef]
Lu, Y.; Zhang, R.; Guo, H. Correction of illumination fluctuations in phase-shifting technique by use of fringe histograms. Appl. Opt. 2015, 55, 184–197. [Google Scholar] [CrossRef]
Chen, C.; Wan, Y.; Cao, Y. Instability of projection light source and real-time phase error correction method for phase-shifting profilometry. Opt. Express 2018, 26, 4258–4270. [Google Scholar] [CrossRef]
Zheng, Z.; Gao, J.; Mo, J.; Zhang, L.; Zhang, Q. A fast self-correction method for nonlinear sinusoidal fringe images in 3-D measurement. IEEE Trans. Instrum. Meas. 2021, 70, 1–9. [Google Scholar] [CrossRef]
Wu, Z.; Guo, W.; Lu, L.; Zhang, Q. Generalized phase unwrapping method that avoids jump errors for fringe projection profilometry. Opt. Express 2021, 29, 27181–27192. [Google Scholar] [CrossRef] [PubMed]
Huang, P.S.; Hu, Q.J.; Chiang, F.P. Double three-step phase-shifting algorithm. Appl. Opt. 2002, 41, 4503–4509. [Google Scholar] [CrossRef] [PubMed]
Zhang, S.; Yau, S.T. Generic nonsinusoidal phase error correction for three-dimensional shape measurement using a digital video projector. Appl. Opt. 2006, 46, 36–43. [Google Scholar] [CrossRef]
Pan, B.; Kemao, Q.; Huang, L.; Asundi, A. Phase error analysis and compensation for nonsinusoidal waveforms in phase-shifting digital fringe projection profilometry. Opt. Lett. 2009, 34, 416–418. [Google Scholar] [CrossRef]
Song, H.; Kong, L. Mask information-based gamma correction in fringe projection profilometry. Opt. Express 2023, 31, 19478–19490. [Google Scholar] [CrossRef]
Han, M.; Shi, W.; Lu, S.; Lei, F.; Li, Y.; Wang, X.; Li, X. Internal–External Layered Phase Shifting for Phase Retrieval. IEEE Trans. Instrum. Meas. 2023, 73, 1–13. [Google Scholar] [CrossRef]
Li, K.; Zhang, Z.; Lin, J.; Sato, R.; Matsukuma, H.; Gao, W. Angle measurement based on second harmonic generation using artificial neural network. Nanomanuf. Metrol. 2023, 6, 28. [Google Scholar] [CrossRef]
Sato, R.; Li, X.; Fischer, A.; Chen, L.C.; Chen, C.; Shimomura, R.; Gao, W. Signal processing and artificial intelligence for dual-detection confocal probes. Int. J. Precis. Eng. Manuf. 2024, 25, 199–223. [Google Scholar] [CrossRef]
Gao, W.; Haitjema, H.; Fang, F.; Leach, R.; Cheung, C.; Savio, E.; Linares, J.M. On-machine and in-process surface metrology for precision manufacturing. CIRP Ann. 2019, 68, 843–866. [Google Scholar] [CrossRef]
Wang, S.; Luo, L.; Li, X. Design and parameter optimization of zero position code considering diffraction based on deep learning generative adversarial networks. Nanomanuf. Metrol. 2024, 7, 2. [Google Scholar] [CrossRef]
Li, C.; Pan, X.; Zhu, P.; Zhu, S.; Liao, C.; Tian, H.; Qian, X.; Li, X.; Wang, X.; Li, X. Style Adaptation module: Enhancing detector robustness to inter-manufacturer variability in surface defect detection. Comput. Ind. 2024, 157, 104084. [Google Scholar] [CrossRef]
Li, C.; Yan, H.; Qian, X.; Zhu, S.; Zhu, P.; Liao, C.; Tian, H.; Li, X.; Wang, X.; Li, X. A domain adaptation YOLOv5 model for industrial defect inspection. Measurement 2023, 213, 112725. [Google Scholar] [CrossRef]
Liu, C.; Zhang, C.; Liang, X.; Han, Z.; Li, Y.; Yang, C.; Gui, W.; Gao, W.; Wang, X.; Li, X. Attention Mono-depth: Attention-enhanced transformer for monocular depth estimation of volatile kiln burden surface. IEEE Trans. Circuits Syst. Video Technol. 2024, 35, 1686–1699. [Google Scholar] [CrossRef]
Li, Y.; Li, Z.; Liang, X.; Huang, H.; Qian, X.; Feng, F.; Zhang, C.; Wang, X.; Gui, W.; Li, X. Global phase accuracy enhancement of structured light system calibration and 3D reconstruction by overcoming inevitable unsatisfactory intensity modulation. Measurement 2024, 236, 114952. [Google Scholar] [CrossRef]
Li, Y.; Li, Z.; Zhang, C.; Han, M.; Lei, F.; Liang, X.; Wang, X.; Gui, W.; Li, X. Deep learning-driven one-shot dual-view 3-D reconstruction for dual-projector system. IEEE Trans. Instrum. Meas. 2023, 73, 1–14. [Google Scholar] [CrossRef]
Caggiano, A.; Zhang, J.; Alfieri, V.; Caiazzo, F.; Gao, R.; Teti, R. Machine learning-based image processing for on-line defect recognition in additive manufacturing. CIRP Ann. 2019, 68, 451–454. [Google Scholar] [CrossRef]
Wang, H.; He, X.; Zhang, C.; Liang, X.; Zhu, P.; Wang, X.; Gui, W.; Li, X.; Qian, X. Accelerating surface defect detection using normal data with an attention-guided feature distillation reconstruction network. Measurement 2025, 246, 116702. [Google Scholar] [CrossRef]
Nguyen, H.; Novak, E.; Wang, Z. Accurate 3D reconstruction via fringe-to-phase network. Measurement 2022, 190, 110663. [Google Scholar] [CrossRef]
Qiao, G.; Huang, Y.; Song, Y.; Yue, H.; Liu, Y. A single-shot phase retrieval method for phase measuring deflectometry based on deep learning. Opt. Commun. 2020, 476, 126303. [Google Scholar] [CrossRef]
Fan, L.; Wu, Z.; Wang, J.; Wei, C.; Yue, H.; Liu, Y. Deep learning-based Phase Measuring Deflectometry for single-shot 3D shape measurement and defect detection of specular objects. Opt. Express 2022, 30, 26504–26518. [Google Scholar] [CrossRef]
Fan, X.; Ma, T.; Li, C.; Li, Y.; Liu, S.; Chen, H. A deep learning-based approach to solve the height-slope ambiguity in phase measuring deflectometry. Meas. Sci. Technol. 2023, 34, 095007. [Google Scholar] [CrossRef]
Nguyen, M.T.; Ghim, Y.S.; Rhee, H.G. DYnet++: A deep learning based single-shot phase-measuring deflectometry for the 3-D measurement of complex free-form surfaces. IEEE Trans. Ind. Electron. 2023, 71, 2112–2121. [Google Scholar] [CrossRef]
Ghim, Y.S.; Rhee, H.G. Deep learning-based phase measuring deflectometry for one-shot measurement and inspection of specular free-form surfaces. In Proceedings of the Interferometry and Structured Light 2024, San Diego, CA, USA, 21–22 August 2024; SPIE: Washington, DC, USA, 2024; Volume 13135, pp. 4–7. [Google Scholar] [CrossRef]
Chen, M.; Li, Y.; Li, X.; Liang, X.; Li, Z.; Chen, W.; Wang, H.; Zhang, C.; Wang, X.; Gui, W. Single-frame structured light depth map reconstruction with absolute phase-aided supervision. In Proceedings of the Optoelectronic Imaging and Multimedia Technology XI, Nantong, China, 13–15 October 2024; SPIE: Washington, DC, USA, 2024; Volume 13239, pp. 172–177. [Google Scholar] [CrossRef]
Chen, M.; Li, Y.; Li, X.; Li, Z.; Chen, W.; Zhang, C.; Liang, X. An end-to-end structured light depth prediction approach using Mamba networks. In Proceedings of the Optoelectronic Imaging and Multimedia Technology XI, Nantong, China, 13–15 October 2024; SPIE: Washington, DC, USA, 2024; Volume 13239, pp. 178–183. [Google Scholar] [CrossRef]
Chen, W.; Li, Y.; Ma, R.; Wang, S.; Li, Z.; Zhang, C.; Chen, M.; Wang, X.; Gui, W.; Liang, X. X+ 1+ 1: A fast three-frequency heterodyne absolute phase measurement method integrating modified Fourier transform. In Proceedings of the Optical Metrology and Inspection for Industrial Applications XI, Nantong, China, 12–14 October 2024; SPIE: Washington, DC, USA, 2024; Volume 13241, pp. 27–34. [Google Scholar] [CrossRef]
Su, X.; Zhang, Q. Dynamic 3-D shape measurement method: A review. Opt. Lasers Eng. 2010, 48, 191–204. [Google Scholar] [CrossRef]
Nguyen, H.; Wang, Y.; Wang, Z. Single-shot 3D shape reconstruction using structured light and deep convolutional neural networks. Sensors 2020, 20, 3718. [Google Scholar] [CrossRef]
Wang, F.; Wang, C.; Guan, Q. Single-shot fringe projection profilometry based on deep learning and computer graphics. Opt. Express 2021, 29, 8024–8040. [Google Scholar] [CrossRef]
Wang, C.; Zhou, P.; Zhu, J. Deep learning-based end-to-end 3D depth recovery from a single-frame fringe pattern with the MSUNet++ network. Opt. Express 2023, 31, 33287–33298. [Google Scholar] [CrossRef]
Zhu, X.; Han, Z.; Zhang, Z.; Song, L.; Wang, H.; Guo, Q. PCTNet: Depth estimation from single structured light image with a parallel CNN-transformer network. Meas. Sci. Technol. 2023, 34, 085402. [Google Scholar] [CrossRef]
Li, Z.; Li, Y.; Chen, W.; Zhang, C.; Chen, M.; Wang, X.; Gui, W.; Liang, X. DSAS-S2APNet: A dual-stage auxiliary supervision network for single-frame to absolute phase prediction. In Proceedings of the Optical Metrology and Inspection for Industrial Applications XI, Nantong, China, 12–14 October 2024; SPIE: Washington, DC, USA, 2024; Volume 13241, pp. 255–260. [Google Scholar] [CrossRef]
Cai, Y.; Guo, M.; Wang, C.; Lu, X.; Zeng, X.; Sun, Y.; Ai, Y.; Xu, S.; Li, J. Ttfdnet: Precise depth estimation from single-frame fringe patterns. Sensors 2024, 24, 4733. [Google Scholar] [CrossRef] [PubMed]
Li, R.; Wang, X.; Huang, G.; Yang, W.; Zhang, K.; Gu, X.; Tran, S.N.; Garg, S.; Alty, J.; Bai, Q. A comprehensive review on deep supervision: Theories and applications. arXiv 2022, arXiv:2207.02376. [Google Scholar] [CrossRef]
Liu, X.; Xu, X.; Rao, A.; Gan, C.; Yi, L. Autogpart: Intermediate supervision search for generalizable 3d part segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 18–24 June 2022; pp. 11624–11634. [Google Scholar] [CrossRef]
Li, C.; Zia, M.Z.; Tran, Q.H.; Yu, X.; Hager, G.D.; Chandraker, M. Deep supervision with intermediate concepts. IEEE Trans. Pattern Anal. Mach. Intell. 2018, 41, 1828–1843. [Google Scholar] [CrossRef]
Nguyen, H.; Ly, K.L.; Tran, T.; Wang, Y.; Wang, Z. hNet: Single-shot 3D shape reconstruction using structured light and h-shaped global guidance network. Results Opt. 2021, 4, 100104. [Google Scholar] [CrossRef]
Zhu, X.; Zhao, H.; Song, L.; Wang, H.; Guo, Q. Triple-output phase unwrapping network with a physical prior in fringe projection profilometry. Appl. Opt. 2023, 62, 7910–7916. [Google Scholar] [CrossRef] [PubMed]
Nguyen, A.H.; Ly, K.L.; Lam, V.K.; Wang, Z. Generalized fringe-to-phase framework for single-shot 3D reconstruction integrating structured light with deep learning. Sensors 2023, 23, 4209. [Google Scholar] [CrossRef]
Qian, J.; Feng, S.; Li, Y.; Tao, T.; Han, J.; Chen, Q.; Zuo, C. Single-shot absolute 3D shape measurement with deep-learning-based color fringe projection profilometry. Opt. Lett. 2020, 45, 1842–1845. [Google Scholar] [CrossRef]
Li, Y.; Qian, J.; Feng, S.; Chen, Q.; Zuo, C. Composite fringe projection deep learning profilometry for single-shot absolute 3D shape measurement. Opt. Express 2022, 30, 3424–3442. [Google Scholar] [CrossRef]
Jiang, Y.; Qin, J.; Liu, Y.; Yang, M.; Cao, Y. Deep-Learning-Based Single-Shot Fringe Projection Profilometry Using Spatial Composite Pattern. IEEE Trans. Instrum. Meas. 2024, 73, 1–14. [Google Scholar] [CrossRef]
Zhu, X.; Lan, T.; Zhao, Y.; Wang, H.; Song, L. End-to-end color fringe depth estimation based on a three-branch U-net network. Appl. Opt. 2024, 63, 7465–7474. [Google Scholar] [CrossRef]
Shen, S.; Lu, R.; Wan, D.; Yin, J.; He, P. Real-Time 3-D Measurement With Dual-Frequency Fringes by Deep Learning. IEEE Sens. J. 2024, 24, 16576–16586. [Google Scholar] [CrossRef]
Yin, W.; Che, Y.; Li, X.; Li, M.; Hu, Y.; Feng, S.; Lam, E.Y.; Chen, Q.; Zuo, C. Physics-informed deep learning for fringe pattern analysis. Opto-Electron. Adv. 2024, 7, 230034-1. [Google Scholar] [CrossRef]
Nawaz, M.; Uvaliyev, A.; Bibi, K.; Wei, H.; Abaxi, S.M.D.; Masood, A.; Shi, P.; Ho, H.P.; Yuan, W. Unraveling the complexity of Optical Coherence Tomography image segmentation using machine and deep learning techniques: A review. Comput. Med. Imaging Graph. 2023, 108, 102269. [Google Scholar] [CrossRef]
Li, Z.; Chen, W.; Liu, C.; Lu, S.; Qian, X.; Wang, X.; Zou, Y.; Li, X. An efficient exposure fusion method for 3D measurement with high-reflective objects. In Proceedings of the Optoelectronic Imaging and Multimedia Technology XI, Nantong, China, 13–15 October 2024; SPIE: Washington, DC, USA, 2024; Volume 13239, pp. 348–356. [Google Scholar] [CrossRef]
Wang, H.; Zhang, Z.; Ma, R.; Zhang, C.; Liang, X.; Li, X. Correction of grating patterns for high dynamic range 3D measurement based on deep learning. In Proceedings of the Optoelectronic Imaging and Multimedia Technology XI, Nantong, China, 13–15 October 2024; SPIE: Washington, DC, USA, 2024; Volume 13239, pp. 317–325. [Google Scholar] [CrossRef]
Wang, H.; Lu, Z.; Huang, Z.; Li, Y.; Zhang, C.; Qian, X.; Wang, X.; Gui, W.; Liang, X.; Li, X. A High-Accuracy and Reliable End-to-End Phase Calculation Network and Its Demonstration in High Dynamic Range 3D Reconstruction. Nanomanuf. Metrol. 2025, 8, 5. [Google Scholar] [CrossRef]
Li, Y.; Chen, W.; Li, Z.; Zhang, C.; Wang, X.; Gui, W.; Gao, W.; Liang, X.; Li, X. SL3D-BF: A Real-World Structured Light 3D Dataset with Background-to-Foreground Enhancement. IEEE Trans. Circuits Syst. Video Technol. 2025, 35, 9850–9864. [Google Scholar] [CrossRef]
Zhang, S.; Yau, S.T. High dynamic range scanning technique. Opt. Eng. 2009, 48, 033604. [Google Scholar] [CrossRef]
Zhang, L.; Chen, Q.; Zuo, C.; Feng, S. High-speed high dynamic range 3D shape measurement based on deep learning. Opt. Lasers Eng. 2020, 134, 106245. [Google Scholar] [CrossRef]
Yu, H.; Chen, X.; Zhang, Z.; Zuo, C.; Zhang, Y.; Zheng, D.; Han, J. Dynamic 3-D measurement based on fringe-to-fringe transformation using deep learning. Opt. Express 2020, 28, 9405–9418. [Google Scholar] [CrossRef]
Yao, P.; Gai, S.; Chen, Y.; Chen, W.; Da, F. A multi-code 3D measurement technique based on deep learning. Opt. Lasers Eng. 2021, 143, 106623. [Google Scholar] [CrossRef]
Yao, P.; Gai, S.; Da, F. Coding-Net: A multi-purpose neural network for Fringe Projection Profilometry. Opt. Commun. 2021, 489, 126887. [Google Scholar] [CrossRef]
Song, X.; Wang, L. Y-ffc net for 3d reconstruction of highly reflective surfaces. IEEE Trans. Ind. Inform. 2024, 20, 13966–13974. [Google Scholar] [CrossRef]
Liu, X.; Chen, W.; Madhusudanan, H.; Ge, J.; Ru, C.; Sun, Y. Optical measurement of highly reflective surfaces from a single exposure. IEEE Trans. Ind. Inform. 2020, 17, 1882–1891. [Google Scholar] [CrossRef]
Nayar, S.K.; Krishnan, G.; Grossberg, M.D.; Raskar, R. Fast separation of direct and global components of a scene using high frequency illumination. In ACM SIGGRAPH 2006 Papers; Association for Computing Machinery: New York, NY, USA, 2006; pp. 935–944. [Google Scholar] [CrossRef]
Drouin, M.A.; Godin, G. Deconvolution-based structured light system with geometrically plausible regularization. In Proceedings of the 2008 Congress on Image and Signal Processing, Sanya, China, 27–30 May 2008; Volume 3, pp. 557–564. [Google Scholar] [CrossRef]
Drouin, M.A.; Godin, G.; Blais, F. Efficient representation of the variant PSF of structured light system. In Proceedings of the 2010 IEEE International Conference on Image Processing, Hong Kong, China, 26–29 September 2010; pp. 1693–1696. [Google Scholar] [CrossRef]
Chen, T.; Lensch, H.P.; Fuchs, C.; Seidel, H.P. Polarization and phase-shifting for 3D scanning of translucent objects. In Proceedings of the 2007 IEEE Conference on Computer Vision and Pattern Recognition, Minneapolis, MN, USA, 17–22 June 2007; pp. 1–8. [Google Scholar] [CrossRef]
Chen, T.; Seidel, H.P.; Lensch, H.P. Modulated phase-shifting for 3D scanning. In Proceedings of the 2008 IEEE Conference on Computer Vision and Pattern Recognition, Anchorage, AK, USA, 23–28 June 2008; pp. 1–8. [Google Scholar] [CrossRef]
Gupta, M.; Nayar, S.K. Micro phase shifting. In Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA, 16–21 June 2012; pp. 813–820. Available online: https://api.semanticscholar.org/CorpusID:14927216 (accessed on 3 October 2025).
Couture, V.; Martin, N.; Roy, S. Unstructured light scanning to overcome interreflections. In Proceedings of the 2011 International Conference on Computer Vision, Barcelona, Spain, 6–13 November 2011; pp. 1895–1902. [Google Scholar] [CrossRef]
Moreno, D.; Son, K.; Taubin, G. Embedded phase shifting: Robust phase shifting with embedded signals. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015; pp. 2301–2309. [Google Scholar] [CrossRef]
Zuo, C.; Tao, T.; Feng, S.; Huang, L.; Asundi, A.; Chen, Q. Micro Fourier transform profilometry (μFTP): 3D shape measurement at 10,000 frames per second. Opt. Lasers Eng. 2018, 102, 70–91. [Google Scholar] [CrossRef]
Zhang, S.; Van Der Weide, D.; Oliver, J. Superfast phase-shifting method for 3-D shape measurement. Opt. Express 2010, 18, 9684–9689. [Google Scholar] [CrossRef]
Gong, Y.; Zhang, S. Ultrafast 3-D shape measurement with an off-the-shelf DLP projector. Opt. Express 2010, 18, 19743–19754. [Google Scholar] [CrossRef] [PubMed]
Lei, S.; Zhang, S. Flexible 3-D shape measurement using projector defocusing. Opt. Lett. 2009, 34, 3080–3082. [Google Scholar] [CrossRef] [PubMed]
Zuo, C.; Chen, Q.; Feng, S.; Feng, F.; Gu, G.; Sui, X. Optimized pulse width modulation pattern strategy for three-dimensional profilometry with projector defocusing. Appl. Opt. 2012, 51, 4477–4490. [Google Scholar] [CrossRef] [PubMed]
Heist, S.; Lutzke, P.; Schmidt, I.; Dietrich, P.; Kühmstedt, P.; Tünnermann, A.; Notni, G. High-speed three-dimensional shape measurement using GOBO projection. Opt. Lasers Eng. 2016, 87, 90–96. [Google Scholar] [CrossRef]
Wang, B.; Chen, W.; Qian, J.; Feng, S.; Chen, Q.; Zuo, C. Single-shot super-resolved fringe projection profilometry (SSSR-FPP): 100,000 frames-per-second 3D imaging with deep learning. Light. Sci. Appl. 2025, 14, 70. [Google Scholar] [CrossRef]
Wang, Y.; Zhou, C.; Qi, X.; Li, H. UHRNet: A deep learning-based method for accurate 3D reconstruction from a single fringe-pattern. J. Mod. Opt. 2023, 70, 707–722. [Google Scholar] [CrossRef]
Li, Y.; Shen, J.; Wu, Z.; Wang, Y.; Zhang, Q. Real-time 3d imaging based on roi fringe projection and a lightweight phase-estimation network. Adv. Imaging 2024, 1, 021004. [Google Scholar] [CrossRef]
Pan, S.J.; Yang, Q. A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 2009, 22, 1345–1359. [Google Scholar] [CrossRef]
Zheng, Y.; Chao, Q.; An, Y.; Hirsh, S.; Fix, A. Fringe projection-based single-shot 3d eye tracking using deep learning and computer graphics. In Proceedings of the Optical Architectures for Displays and Sensing in Augmented, Virtual, and Mixed Reality (AR, VR, MR) IV, San Diego, CA, USA, 20–24 August 2023; SPIE: Washington, DC, USA, 2023; Volume 12449, pp. 265–275. [Google Scholar] [CrossRef]
Xu, M.; Zhang, Y.; Wan, Y.; Luo, L.; Peng, J. Single-shot multi-frequency 3D shape measurement for discontinuous surface object based on deep learning. Micromachines 2023, 14, 328. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Flowchart of the article structure.

Figure 2. Typical experimental setup and workflow of FPP-based structured-light 3D reconstruction [44].

Figure 3. Illustration of three common phase unwrapping (TPU) methods. (a) Gray-code PU showing the binary-to-decimal decoding process. (b) Multi-frequency PU using different fringe periods for coarse-to-fine unwrapping. (c) Multi-wavelength PU leveraging synthetic wavelengths to extend the unwrapping range.

Figure 4. Schematic diagram of phase unwrapping based on directed parallel mapping [61].

Figure 5. Comparison of 3D shape recovery methods. (a) PMD reconstructs surface gradients, which can be further integrated into object shape [37]. (b) FPP directly recovers depth maps via phase-based reconstruction.

Figure 6. Illustration of how the projector observes the measurement point with the aid of cameras [74].

Figure 7. Illustration of three typical phase-to-height mapping models. (a) Paraxial approximation model. (b) Planar reference-based model. (c) Shape estimation and reprojection model [34].

Figure 8. Schematic diagram of a multi-screen configuration in direct PMD systems. (a) Model based on screen movement. (b) Model based on DPMD [34].

Figure 9. Illustration of stereo deflectometry [34].

Figure 10. MEMS scanning mirror-based laser scanning FPP system.

Figure 11. (a) Comparison of typical structured-light projection methods and their performance. (b) Radar charts comparing the performance of each projection method across multiple criteria [108].

Figure 12. Three calibration models of MEMS-based structured-light systems. (a) Joint calibration model. (b) Equal-phase surface model. (c) Phase-angle model [108].

Figure 13. Illustration of two factors affecting phase accuracy in structured-light systems. (a) Random intensity noise. (b) Non-ideal line width smoothing effect [108].

Figure 14. Layered phase-shifting method proposed by Han et al. (a) Principle of the layered phase-shifting method. (b) Sensitivity of the internal phase-shifting method to harmonic distortions. (c) Experimental results of the nested (internal–external) phase-shifting method [108].

Figure 15. Schematic diagram of the traditional phase-shifting measurement process.

Figure 16. Different network supervision mechanisms. (a) End-to-End supervision. (b) Deep supervision. (c) Branch-wise design.

Figure 17. Evaluation metrics for deep learning-enabled FPP. (a) Quantitative results over test scenes under different conditions such as isolated object, dark lighting, and textured background. (b) Standard object test including 3D reconstruction and error histogram analysis. (c) Generalization validation on challenging objects like metallic surfaces. (d) Unified quantitative metrics involving phase/depth accuracy, point cloud accuracy, and network complexity.

Table 1. Representative reviews and studies on fringe-based structured-light 3D reconstruction.

Author	Year	FPP	PMD	MEMS	Deep Learning	Description
Tobias Möller et al. [33]	2005	×	✓	×	×	Early review of PMD-range imaging
Xu et al. [34]	2020	×	✓	×	×	PMD for 3D specular-surface measurement
Lv et al. [36]	2020	✓	×	×	×	FPP measurement theory
Kulkarni et al. [38]	2020	✓	×	✓	×	Fringe denoising algorithms
He et al. [35]	2021	✓	×	×	×	Temporal-phase unwrapping methods
Liu et al. [39]	2024	✓	×	×	✓	Deep learning in fringe projection
Bai et al. [37]	2024	✓	✓	×	✓	Three-dimensional shape measurement
Our article	2025	✓	✓	✓	✓	First comprehensive review systematically summarizing FPP, PMD, MEMS, and deep learning integration

Table 2. Performance comparison of typical structured-light systems.

Parameter	Interference	Physical Grating	LCD	DLP	MEMS
Accuracy	$10^{- 1}$ mm	$10^{- 3}$ mm	$10^{- 2}$ mm	$10^{- 3}$ mm	$10^{- 3}$ mm
Speed	∼50 fps	∼100 fps	∼50 fps	∼120 fps	>1000 fps
Resolution	<1 K	<1 K	∼1 K	∼1 K	>4 K
Programmable	No	No	Yes	Yes	Yes
Power Consumption	∼100 W	∼300 W	∼40 W	∼50 W	∼5 W
Cost	> $$ 10, 000$	> $$ 10, 000$	∼ $$ 1500$	∼ $$ 2000$	∼ $$ 500$
Optical Efficiency	Medium	Low	Medium	Low	High

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhang, Z.; Wang, H.; Li, Y.; Li, Z.; Gui, W.; Wang, X.; Zhang, C.; Liang, X.; Li, X. Fringe-Based Structured-Light 3D Reconstruction: Principles, Projection Technologies, and Deep Learning Integration. Sensors 2025, 25, 6296. https://doi.org/10.3390/s25206296

AMA Style

Zhang Z, Wang H, Li Y, Li Z, Gui W, Wang X, Zhang C, Liang X, Li X. Fringe-Based Structured-Light 3D Reconstruction: Principles, Projection Technologies, and Deep Learning Integration. Sensors. 2025; 25(20):6296. https://doi.org/10.3390/s25206296

Chicago/Turabian Style

Zhang, Zhongyuan, Hao Wang, Yiming Li, Zinan Li, Weihua Gui, Xiaohao Wang, Chaobo Zhang, Xiaojun Liang, and Xinghui Li. 2025. "Fringe-Based Structured-Light 3D Reconstruction: Principles, Projection Technologies, and Deep Learning Integration" Sensors 25, no. 20: 6296. https://doi.org/10.3390/s25206296

APA Style

Zhang, Z., Wang, H., Li, Y., Li, Z., Gui, W., Wang, X., Zhang, C., Liang, X., & Li, X. (2025). Fringe-Based Structured-Light 3D Reconstruction: Principles, Projection Technologies, and Deep Learning Integration. Sensors, 25(20), 6296. https://doi.org/10.3390/s25206296

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Fringe-Based Structured-Light 3D Reconstruction: Principles, Projection Technologies, and Deep Learning Integration

Abstract

1. Introduction

2. Fringe-Structured-Light 3D Reconstruction Approach

2.1. Wrapped-Phase Extraction

2.2. Phase Unwrapping Algorithms

2.2.1. Temporal-Phase Unwrapping

2.2.2. Spatial-Phase Unwrapping

2.3. 3D Shape Reconstruction from Phase

2.3.1. 3D Shape Recovery in PMD

2.3.2. 3D Shape Recovery in FPP

3. Evolution and Advances of PMD Systems

3.1. Single-Screen and Single-Camera Systems

3.2. Multi-Screen Direct PMD

3.3. Multi-Camera Stereo PMD

4. Evolution and Advances of FPP Systems

4.1. Comparison of Mainstream Fringe Projection Technologies

4.2. System Calibration Strategies for MEMS-Based Structured-Light Systems

4.2.1. Joint Calibration Model

4.2.2. Equal-Phase Surface Model

4.2.3. Phase-Angle Model

4.3. Analysis of Systematic and Random Errors

4.3.1. Random Errors

4.3.2. Impact of Line Laser Intensity Fluctuations

4.3.3. High-Order Harmonics

5. Application of Deep Learning in Fringe-Structured Light

5.1. Learning Paradigm for Deep Learning-Driven FPP

5.2. Deep Learning Framework Design and Advancements

5.2.1. Network Architecture Innovations

5.2.2. Supervision Strategies

5.2.3. Input Design

5.3. Evaluation Metrics

6. Challenges and Perspectives

6.1. HDR Issues

6.2. Extended Depth of Field

6.3. High-Speed Deployment and Real-Time Reconstruction

6.4. Transferability, Generalization, and Interpretability of Deep Learning Methods

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI