Single-Shot, Monochrome, Spatial Pixel-Encoded, Structured Light System for Determining Surface Orientations

Elahi, Ahsan; Zhu, Qidan; Lu, Jun; Farooq, Umer; Farid, Ghulam; Bilal, Muhammad; Li, Yong

doi:10.3390/photonics11111046

Open AccessArticle

Single-Shot, Monochrome, Spatial Pixel-Encoded, Structured Light System for Determining Surface Orientations

by

Ahsan Elahi

^1,*,

Qidan Zhu

^1,*,

Jun Lu

¹,

Umer Farooq

²,

Ghulam Farid

¹

,

Muhammad Bilal

¹ and

Yong Li

¹

College of Intelligent Systems, Science and Engineering, Harbin Engineering University, Harbin 150001, China

²

College of Mathematical Sciences, Harbin Engineering University, Harbin 150001, China

^*

Authors to whom correspondence should be addressed.

Photonics 2024, 11(11), 1046; https://doi.org/10.3390/photonics11111046

Submission received: 20 September 2024 / Revised: 27 October 2024 / Accepted: 4 November 2024 / Published: 7 November 2024

(This article belongs to the Special Issue Optical Sensors and Devices)

Download

Browse Figures

Versions Notes

Abstract

This study introduces a technique for determining surface orientations by projecting a monochrome, spatial pixel-encoded pattern and calculating the surface normals from single-shot measurement. Our method differs from traditional methods, such as shape from shading and shape from texture, in that it does not require relating the local surface orientations of adjacent points. We propose a multi-resolution system incorporating symbols varying in sizes from 8 × 8, 10 × 10, 12 × 12, 14 × 14, and 16 × 16 pixels. Compared to previous methods, we have achieved a denser reconstruction and obtained a 5.2 mm resolution using an 8 × 8 pattern at a depth of 110 cm. Unlike previous methods, which used local point orientations of grid intersection and multiple colors, we have used the monochrome pattern and deterministic centroid positions to compute the unit vector or direction vector between the neighboring symbols. The light plane intersections are used to calculate the tangent vectors on the surface. Surface normals are determined by the cross-product of two tangent vectors on the surface. A real experiment was conducted to measure simple plane surfaces, circular surfaces, and complex sculptures. The results show that the process of calculating surface normals is fast and reliable, and we have computed 1654 surface normals in 29.4 milliseconds for complex surfaces such as sculptures.

Keywords:

3D measurement; M-arrays; pixel-encoded; robust pseudo-random sequence; planar intersection; single-shot; spatially encoded; monochrome; structured light; surface orientation; surface normals

1. Introduction

Structured light (SL) is a useful and low-cost method for 3D measurements. Various SL methods have evolved during the past few decades [1,2]. SL can provide high accuracy and dense reconstruction; however, it is sensitive to object color and texture [3,4]. The structured light-based systems work well for measuring objects in the close range. They cannot be employed for far-distance measurement. SL techniques are mainly classified into temporal and spatial encoding schemes [5]. Temporal coding schemes include binary and gray coding, phase shift, and fringe patterns [6,7]. The spatial encoding schemes include 1D and 2D grid-indexed-based methods [8]. The advantage of spatial encoding schemes is being fast and single-shot [9]. These include the De Bruijn Sequence [10], pseudo-random sequence or M-arrays [11,12,13,14,15,16], non-formal coding schemes [17,18], and speckle projection techniques [19].

The two principal descriptors for defining shapes or objects in 3D are their position and orientation profiles. The orientation profile of the shapes is generally represented by surface normals, which describe the orientation of the neighborhood points on the surface. A surface normal at a certain point is a vector perpendicular to the tangential plane of the surface at that point. It defines the orientation of the surface at that point. Orientation can be realized by correlating the neighboring position data. However, this may not be independent information and depends on a particular abstraction of position data. Different densities of the same position data may result in various orientations, necessitating an independent determination of orientation.

The orientation profile has been determined using several methods in the computer vision community, including photometric stereo [20], texture [21,22], shading [23], specular flow [24], and polarization [25]. Traditional methods, such as texture and shading, require specific object properties to determine the surface orientation. These properties include consistent albedo or uniform texture distribution on the surface. This may restrict the application of these methods to a range of objects that lack such attributes. This work employs a monochrome spatial pixel-encoded symbol-based pattern to determine the orientation profile regardless of the position on the surface. Thus, it overcomes the limitations of traditional methods, as spatial SL encoding can produce artificial and uniform textures on the object surface regardless of position. The monochrome pattern also allows us to measure colored surfaces with non-uniform albedo.

The computation of surface normals is essential for many visualization and object registration [26], 3D geometry [27], semantic segmentation [28], classification [29], recognition [30], and representation [31] problems in artificial intelligence (AI) and robotics applications. Fast and accurate estimation of surface normals is required in many automotive applications [32].

The existing literature on SL is mainly related to determining the position profiles of a surface. Fewer works calculate surface normals through SL. For instance, a grid pattern is employed in [33] by using two orthogonal stripe patterns for static and smooth planar surfaces to find the orientation and structure of a surface. The grid pattern has the advantage of both simple point and line patterns. However, sharp discontinuities may appear as abrupt changes on the surface. So, grid coding implies weak constrictions on the surface of 3D objects [34]. In another approach, the target object is illuminated with a grid pattern [35]. The surface normals at the grid points (where the intersections of the grid lines occur) are determined by analyzing the variations in the lengths of the grid edges. Significant inaccuracies happen in some regions, mainly when the surface orientation differs significantly from the reference plane or where there is non-zero surface curvature.

In [36], surface normals were determined by projecting stripe patterns by estimating the slope and intervals between the stripes for a smooth planar surface. Labeling intersecting points of the grid is also time consuming, considerably if some parts of the object are occluded. In [37], a pseudo-random array of color dots was used as an illumination pattern. It was assumed that each circular dot from the surface would appear as an ellipse in the image. The center points of such ellipse were regarded as feature points, and a method was used to calculate the surface normals by using the shear and scaling factor of the ellipse along the epipolar lines. The approach assumes that the object surface is planar, not only in the intermediate vicinity of elliptical pattern elements, but also in the far neighborhood. So, the method could not be helpful for the measurement of objects with large curvatures. Multiple same-color dots may appear on the same epipolar line, causing ambiguous decoding.

In [38], two strip patterns were projected at different angles to compute two surface tangents for the same position. In this approach, the angle of the strips depends upon the orientation of the 3D object and gradient operation, thus resulting in outliers and erroneous noisy measurements. A method for determining surface normals using the slope and widths of the stripes was introduced in [39]. This approach relies on the assumption that the surface patches between the two stripe edges are either planar or highly smooth, and the inclination angle can be estimated from the deformed width of the stripe by comparing it to the measured width on the reference plane. In this approach, it was assumed that the pattern lighting and imaging system has a parallel orientation. Therefore, the intrinsic parameters of the camera and projector are neglected. Hence, errors due to overly simplified projection and image models are unavoidable.

Among all the mentioned attempts, the most important is [40], which utilized grid point intersections of rhombic-shaped color objects spread through the robust pseudo-random sequence as grid line feature points. Due to the surface curvature or texture, the neighboring rhombic-shaped symbols forming grid intersections may be disconnected. The accurate location of the grid-intersection points of the rhombic-shaped color symbols cannot be determined due to the presence of image noise. The grid intersection points in the image are determined by calculating the differentials of raw intensities between neighboring locations through a mask. This process results in increased image noise and decreased accuracy. Due to the use of multiple colors, the projected pattern will be immune to noise and segmentation errors influenced by the surface color. With highly saturated surface colors, the projected illumination can easily warp through the intrinsic shades of the measuring surface. This method utilized local point orientation angles to compute the normals to the intersecting planes to determine the surface tangents. When the surface curvature modulates the projected pattern due to which the neighboring rhombic-shaped objects are disconnected, hence the exact location of grid point intersections and the accurate estimation of local point orientations is impeded.

In contrast, we utilized the deterministic centroid positions of adjacent symbols to compute the surface normals. We propose improvement in the method presented in [40] by implementing it through a monochromatic, multi-resolution system. The monochromatic geometric symbol-based method is an alternative to the color-encoding technique. The monochrome geometric symbol-based methods are more resilient compared to color encoding in terms of the effect of ambient light, uneven albedos, and surface colors [41]. Instead of using local point orientation angles in computation, we used the deterministic centroid position to determine the unit vector or direction vector between the neighboring symbols. These unit vectors are utilized in calculating the normals of the intersecting planes, which in turn determine the surface tangents. These unit vector calculations will simplify the process of determining the normals to the projection-side light planes as all the unit vectors on the projection-side will be parallel to the X and Y axes. Our previous works [14,42], introduced pixel-encoded patterns for encoding SL to find the position information. We presented a flexible method to design patterns according to the surface area requirements and measurement precision by using the multi-resolution system through single-shot measurement. This was achieved by employing robust pseudo-random sequences of any required size depending upon the projector’s resolution. In this work, we suggested a grid-indexed-based SL method to determine the orientation regardless of the surface position.

2. Materials and Methods

2.1. Designing of a Pattern

This section will explain how to form a projection pattern to determine the surface orientation using spatially encoded SL patterns. These Patterns use pixel-encoded monochrome symbols varying in size from 8 × 8, 10 × 10, 12 × 12, 14 × 14, and 16 × 16 pixels and spread through corresponding robust pseudo-random sequences or M-array.

2.1.1. Defining Symbols

We propose a multi-resolution system in which the symbol sizes vary from 8 × 8, 10 × 10, 12 × 12, 14 × 14, and 16 × 16 pixels in symbol space. The symbols used to form the projection pattern are depicted in Figure 1.

These symbols have already been proposed in our previous work, and their physical properties have been analyzed and recorded [14]. To implement the algorithm for surface normal computation, we selected fourteen symbols whose centroid positions are exactly at the center of the symbol spaces. The reason for choosing symbols with a centroid at the center of the symbol space is to simplify the process of computing surface normals since the neighborhood symbols in the projection pattern will align in horizontal rows or vertical columns, and it will be explained in the subsequent Section 2.2.

2.1.2. Robust Pseudo-Random Sequences or M-Arrays

To spread the symbols in a projection pattern in a controllable manner, we generated five M-arrays of different sizes. We formed the M-arrays using the method described in our previous work [14]. We used a projector with a resolution of 800 × 1280 and proposed a multi-resolution system with a pattern designed employing fourteen symbols or symbols. We generated five M-arrays corresponding to each pixel size. Each M-array consisted of four alphanumeric bases. Therefore, four symbols were used simultaneously out of fourteen symbols in a projection pattern.

The robustness of the codewords in all the M-arrays was ensured by computing the Hamming distances between each codeword. The dimensions, number of symbols, Hamming distances, percentile of the robust codeword, and number of feature points contributed by each M-array are shown in Table 1.

The Hamming distance profiles of each M-array formed to use in the projection pattern are represented in Figure 2. The profiles of M-arrays reflected that most of the codewords have Hamming distances greater than 3, and about 60% of codewords have Hamming-distance spaces of 7, 8, and 9, which proves that the generated M-arrays are robust.

2.1.3. Formation of a Projection Pattern

We used five M-arrays with four symbols to generate five projection patterns using the symbols or symbols proposed in [14]. The number of feature points in the pattern increased with the use of smaller symbol resolution as M-arrays for smaller symbols have bigger dimensions while keeping the projector resolution constant, which is 800 × 1280. Conversely, as the symbol sizes in the pattern increased, both the number of feature points and the M-array dimensions reduced. The identical phenomenon is illustrated in the seventh column of Table 1. The number of feature points for an 8 × 8 symbol resolution is 12,496, while for a 10 × 10 resolution, it decreases to 8352, and for a 12 × 12 resolution, it is further reduced to 5187. Similarly, the pattern resolution of 14 × 14 comprises 4000 feature points, while the symbol resolution of 16 × 16 possesses the fewest feature points of 3124. So, the number of feature points has decreased from 12,496 for the 8 × 8 pattern to almost one-fourth, specifically 3124 for the 16 × 16 pattern resolution. Consequently, the employment of smaller symbol sizes enables denser reconstruction of 3D points and results in the computation of a greater number of surface normals on the measured object. The parts of five generated projection patterns using single-centroid symbols are shown in Figure 3.

2.2. Computation of Surface Normals

To compute surface normals, we used the principle described in [40] with necessary modifications. Figure 4 explains the mechanism of calculating surface normals.

2.2.1. Defining of Points Geometry

Let us consider three points, P₁ (X₁, Y₁, Z₁), P₂ (X₂, Y₂, Z₂), and P₃ (X₃, Y₃, Z₃) on the surface of a 3D object in the world coordinate system. The corresponding points on the camera image plane are P_c1 (x_c1, y_c1, f_c), P_c2 (x_c2, y_c2, f_c), and P_c3 (x_c3, y_c3, f_c). Similarly, the corresponding points on the projector pattern plane are P_p1 (x_p1, y_p1, f_p), P_p2 (x_p2, y_p2, f_p), and P_p3 (x_p3, y_p3, f_p). The projector induced points on the surface of the 3D object while the camera recorded the induced points of the projector on the surface. P₁ is the point where we compute the surface normal or, more specifically, the centroid position of the symbol in consideration induced by the projected pattern. So, P₁ is a projection of P_p1, and it is observed in the camera image as P_c1. P₂ is the right-side neighborhood point or, more specifically, the centroid of the right-side neighborhood symbol, which lies on the surface induced through the projected pattern. So, P₂ is a projection of P_p2, and it is observed in the camera image as P_c2. P₃ is the down-side neighborhood point or, more specifically, the centroid of the down-side neighborhood symbol that lies on the surface induced by the projected pattern. So, P₃ is a projection of P_p3, and it is observed in the camera image as P_c3. The points on the projected patterns are fully accessible since they are the entities under system design. Similarly, the points in the camera image are the entities under observation. Point P₁ is selected where the correspondence between the projected pattern and camera image matches exactly.

2.2.2. Defining Device Parameters, Projection, and Image Planes

C is the camera’s optical center, and P is the projector’s optical center. The optical centers of the projector and camera are at the distance of focal lengths from the midpoints of the projector pattern plane and camera image plane. The camera focal length is denoted by f_c, and the projector focal length is denoted by f_p. X_c and Y_c are the dimensions of the camera image plane, and X_p and Y_p are the dimensions of the projector pattern plane. O_c (x_oc = X_c/2, y_oc = Y_c/2, f_c) and O_p (x_op = X_p/2, y_op = Y_p/2, f_p) are the midpoints of the camera image plane and projector pattern plane, respectively, and they are exactly half of the camera and projector X and Y dimensions.

2.2.3. Defining Light Planes and Their Normals

n_px is normal to the light plane defined by the projector optical center, P, to the world coordinates, P₁ and P₂, i.e., ∏PP₁P₂. The corresponding light plane to the camera side from the camera’s optical center, C, to the point, P₁, and P₂ is ∏CP₁P₂, and its normal is denoted by n_cx. The two common vertices of these two light planes are formed by the current symbol considered for surface orientation calculation and its right-side neighborhood symbols, which are induced on the surface by the projected pattern. The edges of these light planes are passed through the centroids of current symbols and its right-side neighborhood symbol both in the projected pattern and camera image. The two light planes, i.e., ∏PP₁P₂, which is passing from the projected pattern, and ∏CP₁P₂, which is passing from the camera image, intersect each other on the curve between the two points on the surface, i.e., P₁ and P₂. Similarly, n_py is the normal to the light plane defined by the projector’s optical center, P, to the world coordinates, P₁ and P₃, i.e., ∏PP₁P₃. The corresponding light planes to the camera side from the camera’s optical center, C, to the points, P₁ and P₃, is ∏CP₁P₃, and its normal is denoted by n_cy. The two common vertices of these two light planes are formed by the current symbol considered for surface orientation calculation and its down-side neighborhood symbols, which are induced on the surface by the projected pattern. The edges of these light planes are passed through the centroids of current symbols and its down-side neighborhood symbol both in the projected pattern and camera image. The two light planes, i.e., ∏PP₁P₃, which is passing from the projected pattern, and ∏CP₁P₃, which is passing from the camera image, intersect each other on the curve between the two points on the surface, i.e., P₁ and P₃.

2.2.4. Underlying Principle

The smaller planes on the projector and camera sides, i.e., light planes from the optical centers (C or P) of the projector or camera to the coordinate points on the projector (P_p1, P_p2) or (P_p1, P_p3) and camera (P_c1, P_c2) or (P_c1, P_c3), are part of the bigger plane, i.e., light plane from the optical centers (C or P) of the projector and camera to the 3D world coordinate points (P₁, P₂) or (P₁, P₃) on the measuring object. The smaller light plane ∏PP_p1P_p2 is part of the bigger light plane ∏PP₁P₂. Similarly, the smaller light plane ∏PP_p1P_p3 is part of the bigger light plane ∏PP_p1P_p3. On the camera side, the smaller light plane ∏CP_c1P_c2 is part of the bigger light plane ∏CP₁P₂. Similarly, the smaller light plane ∏CP_c1P_c3 is part of the bigger light plane ∏CP₁P₃. Thus, any normal on the smaller plane will be the normal on the bigger plane. From the smaller planes of the projector and camera, we can easily determine the normal to the bigger planes that intersect on the 3D object since all the parameters of the smaller light planes are known.

Computation of Light Planes Normals to the Projector Side

On the projector side, the normal to the light plane ∏PP₁P₂ can be calculated from the smaller plane ∏PP_p1P_p2. The normal to this plane is simply the vector cross-product of the position vector at point, P_p1, and the unit vector between points, P₁ and P₂. Similarly, the normal to the light plane ∏PP₁P₃ can be calculated from the smaller plane ∏PP_p1P_p3. The normal to this plane is also the vector cross-product of the position vector at point, P_p1, and the unit vector between the points, P_p1 and P_p3.

n_{p} = [\begin{matrix} x_{p 1} - x_{o p} \\ y_{p 1} - y_{o p} \\ f_{p} \end{matrix}] X [\begin{matrix} ĉ_{p x} \\ ĉ_{p y} \\ 0 \end{matrix}] = |\begin{matrix} i & j & k \\ x_{p 1} - x_{o p} & y_{p 1} - y_{o p} & f_{p} \\ ĉ_{p x} & ĉ_{p y} & 0 \end{matrix}|

(1)

where

i

, j, and k are the unit vectors along the X, Y, and Z axes. ĉ_px and ĉ_py are the components of the unit vector between the points (P_p1 to P_p2) or (P_p1 to P_p3). So, if the unit vector is calculated between the right-side neighborhood points between P_p1 and P_p2, it can be written as:

ĉ_{p} = \frac{(x_{p 2} - x_{p 1}) i + (y_{p 2} - y_{p 1}) j}{\sqrt[2]{{(x_{p 2} - x_{p 1})}^{2} + {(y_{p 2} - y_{p 1})}^{2}}} = ĉ_{p x} i + ĉ_{p y} j

(2)

Similarly, if the unit vector is calculated between the down-side neighborhood points between P_p1 and P_p3, it can be written as:

ĉ_{p} = \frac{(x_{p 3} - x_{p 1}) i + (y_{p 3} - y_{p 1}) j}{\sqrt[2]{{(x_{p 3} - x_{p 1})}^{2} + {(y_{p 3} - y_{p 1})}^{2}}} = ĉ_{p x} i + ĉ_{p y} j

(3)

The advantage of using symbols in the projection pattern with a centroid exactly at the center of the symbol space will be evident here. Since the two neighborhood symbols in the projection pattern are horizontally aligned for the right-side neighborhood and vertically aligned for the down-side neighborhood. Thus, unit vector computation simplifies the process of determining the projection-side planes as all the unit vectors on the projection side are either parallel to the X- or Y-axis. Thus, for the light plane parallel to the X-axis, (x_p2 − x_p1) = 0, and ĉ_py = 1. Similarly, for the light plane parallel to the Y-axis, (y_p3 − y_p1) = 0 and ĉ_px = 1. Therefore, normals to the light plane parallel to the X-axis and Y-axis can be easily determined as:

n_{p x} = |\begin{matrix} i & j & k \\ x_{p 1} - x_{o p} & y_{p 1} - y_{o p} & f_{p} \\ 1 & 0 & 0 \end{matrix}| = f_{p} j - (y_{p 1} - y_{o p}) k = [\begin{matrix} 0 \\ f_{p} \\ - (y_{p 1} - y_{o p}) \end{matrix}]

(4)

n_{p y} = |\begin{matrix} i & j & k \\ x_{p 1} - x_{o p} & y_{p 1} - y_{o p} & f_{p} \\ 0 & 1 & 0 \end{matrix}| = - f_{p} i + (x_{p 1} - x_{o p}) k = [\begin{matrix} - f_{p} \\ 0 \\ (x_{p 1} - x_{o p}) \end{matrix}]

(5)

Computation of Light Plane Normals to the Camera Side

On the camera side, the normal to the light plane ∏CP₁P₂ can be calculated from the smaller plane ∏CP_c1P_c2. The normal to this plane is simply the vector cross-product of the position vector at point, P_c1, and the unit vector between points, P_c1 and P_c2. Similarly, the normal to the light plane ∏CP₁P₃ can be calculated from the smaller plane ∏CP_c1P_c3. The normal to this plane is also the vector cross-product of the position vector at point, P_c1, and the unit vector between the points, P_c1 and P_c3.

n_{c} = [\begin{matrix} x_{c 1} - x_{o c} \\ y_{c 1} - y_{o c} \\ f_{c} \end{matrix}] X [\begin{matrix} ĉ_{c x} \\ ĉ_{c y} \\ 0 \end{matrix}] = |\begin{matrix} i & j & k \\ x_{c 1} - x_{o c} & y_{c 1} - y_{o c} & f_{c} \\ ĉ_{c x} & ĉ_{c y} & 0 \end{matrix}|

(6)

where ĉ_cx and ĉ_cy are the components of the unit vector between the points (P_c1 to P_c2) or (P_c1 to P_c3). The arc between the real-world points, P₁ to P₂ and P₁ to P₃, on the 3D object surface will appear as the curves in the camera image points, P_c1 to P_c2 and P_c1 to P_c3. So, if the unit vector is calculated between P_c1 and right-side neighborhood point, P_c2, it can be computed as:

ĉ_{c} = \frac{(x_{c 2} - x_{c 1}) i + (y_{c 2} - y_{c 1}) j}{\sqrt[2]{{(x_{c 2} - x_{c 1})}^{2} + {(y_{c 2} - y_{c 1})}^{2}}} = ĉ_{c x} i + ĉ_{c y} j

(7)

where

ĉ_{c x} = \frac{(x_{c 2} - x_{c 1})}{\sqrt[2]{{(x_{c 2} - x_{c 1})}^{2} + {(y_{c 2} - y_{c 1})}^{2}}}

, and

ĉ_{c y} = \frac{(y_{c 2} - y_{c 1})}{\sqrt[2]{{(x_{c 2} - x_{c 1})}^{2} + {(y_{c 2} - y_{c 1})}^{2}}}

Since the unit vector of the curve between P_c1 and the right-side neighborhood point, P_c2 is calculated. So (x_c2, y_c2) can be written as (x_RNB, y_RNB) while the initial point (x_c1, y_c1) is the point where the surface normal is computed. Therefore, Equation (7) will become:

ĉ_{c R N B} = \frac{(x_{R N B} - x_{c 1}) i + (y_{R N B} - y_{c 1}) j}{\sqrt[2]{{(x_{R N B} - x_{c 1})}^{2} + {(y_{R N B} - y_{c 1})}^{2}}} = ĉ_{c x R N B} i + ĉ_{c y R N B} j

(8)

where

ĉ_{c x R N B} = \frac{(x_{R N B} - x_{c 1})}{\sqrt[2]{{(x_{R N B} - x_{c 1})}^{2} + {(y_{R N B} - y_{c 1})}^{2}}}

, and

ĉ_{c y R N B} = \frac{(y_{R N B} - y_{c 1})}{\sqrt[2]{{(x_{R N B} - x_{c 1})}^{2} + {(y_{R N B} - y_{c 1})}^{2}}}

Therefore, the normal to the light plane ∏CP₁P₂ can be calculated using Equation (6):

n_{c x} = [\begin{matrix} x_{c 1} - x_{o c} \\ y_{c 1} - y_{o c} \\ f_{c} \end{matrix}] X [\begin{matrix} ĉ_{c x R N B} \\ ĉ_{c y R N B} \\ 0 \end{matrix}] = |\begin{matrix} i & j & k \\ x_{c 1} - x_{o c} & y_{c 1} - y_{o c} & f_{c} \\ ĉ_{c x R N B} & ĉ_{c y R N B} & 0 \end{matrix}| = - {ĉ_{c y R N B} f}_{c} i + ĉ_{c x R N B} f_{c} j + \{(x_{c 1} - x_{o c}) ĉ_{c y R N B} - {(y_{c 1} - y_{o c}) ĉ}_{c x R N B}\} k = [\begin{matrix} - ĉ_{c y R N B} f_{c} \\ ĉ_{c x R N B} f_{c} \\ (x_{c 1} - x_{o c}) ĉ_{c y R N B} - {(y_{c 1} - y_{o c}) ĉ}_{c x R N B} \end{matrix}]

(9)

Similarly, if the unit vector is calculated between P_c1 and down-side neighborhood point, P_c3, it can be computed as:

ĉ_{c} = \frac{(x_{c 3} - x_{c 1}) i + (y_{c 3} - y_{c 1}) j}{\sqrt[2]{{(x_{c 3} - x_{c 1})}^{2} + {(y_{c 3} - y_{c 1})}^{2}}} = ĉ_{c x} i + ĉ_{c y} j

(10)

where

ĉ_{c x} = \frac{(x_{c 3} - x_{c 1})}{\sqrt[2]{{(x_{c 3} - x_{c 1})}^{2} + {(y_{c 3} - y_{c 1})}^{2}}}

, and

ĉ_{c y} = \frac{(y_{c 3} - y_{c 1})}{\sqrt[2]{{(x_{c 3} - x_{c 1})}^{2} + {(y_{c 3} - y_{c 1})}^{2}}}

Since the unit vector of the curve between P_c1 and the right-side neighborhood point, P_c3 is calculated. So (x_c3, y_c3) can be written as (x_DNB, y_DNB) while the initial point (x_c1, y_c1) is the point where the surface normal is computed. Therefore, Equation (10) will become:

ĉ_{c D N B} = \frac{(x_{D N B} - x_{c 1}) i + (y_{D N B} - y_{c 1}) j}{\sqrt[2]{{(x_{D N B} - x_{c 1})}^{2} + {(y_{D N B} - y_{c 1})}^{2}}} = ĉ_{c x D N B} i + ĉ_{c y D N B} j

(11)

where

ĉ_{c x D N B} = \frac{(x_{D N B} - x_{c 1})}{\sqrt[2]{{(x_{D N B} - x_{c 1})}^{2} + {(y_{D N B} - y_{c 1})}^{2}}}

, and

ĉ_{c y D N B} = \frac{(y_{D N B} - y_{c 1})}{\sqrt[2]{{(x_{D N B} - x_{c 1})}^{2} + {(y_{D N B} - y_{c 1})}^{2}}}

Therefore, the normal to the light plane ∏CP₁P₃ can be calculated using Equation (6):

n_{c y} = [\begin{matrix} x_{c 1} - x_{o c} \\ y_{c 1} - y_{o c} \\ f_{c} \end{matrix}] X [\begin{matrix} ĉ_{c x D N B} \\ ĉ_{c y D N B} \\ 0 \end{matrix}] = |\begin{matrix} i & j & k \\ x_{c 1} - x_{o c} & y_{c 1} - y_{o c} & f_{c} \\ ĉ_{c x D N B} & ĉ_{c y D N B} & 0 \end{matrix}| = - {ĉ_{c y D N B} f}_{c} i + ĉ_{c x D N B} f_{c} j + \{(x_{c 1} - x_{o c}) ĉ_{c y D N B} - {(y_{c 1} - y_{o c}) ĉ}_{c x D N B}\} k = [\begin{matrix} - ĉ_{c y D N B} f_{c} \\ ĉ_{c x D N B} f_{c} \\ (x_{c 1} - x_{o c}) ĉ_{c y D N B} - {(y_{c 1} - y_{o c}) ĉ}_{c x D N B} \end{matrix}]

(12)

Computation of Surface Tangents

The two light planes ∏PP₁P₂ (with the normal denoted as ‘n_px’), which is passing from the projected pattern plane, and ∏CP₁P₂ (with the normal denoted as ‘n_cx’), which is passing from the camera image plane, intersect on the 3D object surface on the arc forming between P₁ and P₂, which are the centroids of the current symbol and the right-side neighborhood symbol in a horizontal location. These light planes are called the x-planes of the projector and camera sides. Similarly, the two light planes ∏PP₁P₃ (with the normal denoted as ‘n_py’), which is passing from the projected pattern plane, and ∏CP₁P₃ (with the normal denoted as ‘n_cy’), which is passing from the camera image plane, intersect on the 3D object surface on the arc between P₁ and P₃, which are the centroids of the current symbol and the down-side neighborhood symbol in a vertical location. These light planes are called the y-planes of the projector and camera sides. After accommodating translation and rotation on both the camera and the projector side, the surface tangents will be calculated. The vectors, n_cx, n_cy, n_px, and n_py, after rotation and translation, will take the following form:

{T_{c} R_{c} n}_{c x} = [\begin{matrix} n_{c x 1} \\ n_{c x 2} \\ n_{c x 3} \end{matrix}], a n d {T_{c} R_{c} n}_{c y} = [\begin{matrix} n_{c y 1} \\ n_{c y 2} \\ n_{c y 3} \end{matrix}]

(13)

T_{p} R_{p} n_{p x} = [\begin{matrix} n_{p x 1} \\ n_{p x 2} \\ n_{p x 3} \end{matrix}], a n d {T_{p} R_{p} n}_{p y} = [\begin{matrix} n_{p y 1} \\ n_{p y 2} \\ n_{p y 3} \end{matrix}]

(14)

where R_p and R_c are the 3 × 3 rotation matrices, T_p and T_c are the 3 × 1 translation matrices with respect to world coordinates, or 3D object coordinates for the projector and camera, respectively. The rotation and translation are known as external or extrinsic calibration parameters for the camera and projector.

The tangent along the X-direction on the object surface will be the vector cross-product of the normals of the two corresponding intersecting x-planes:

t_{x} = {{(T}_{c} R_{c} n}_{c x}) X (T_{p} R_{p} n_{p x})

(15)

The tangent along the Y-direction on the object surface, will be the vector cross-product of the normal of the two corresponding intersecting y-planes:

t_{y} = ({T_{c} R_{c} n}_{c y}) X (T_{p} R_{p} n_{p y})

(16)

The tangents on the object surface, t_x and t_y, will then be calculated as follows:

t_{x} = [\begin{matrix} n_{c x 1} \\ n_{c x 2} \\ n_{c x 3} \end{matrix}] X [\begin{matrix} n_{p x 1} \\ n_{p x 2} \\ n_{p x 3} \end{matrix}] = |\begin{matrix} i & j & k \\ n_{c x 1} & n_{c x 2} & n_{c x 3} \\ n_{p x 1} & n_{p x 2} & n_{p x 3} \end{matrix}| = {(n}_{c x 2} n_{p x 3} - n_{c x 3} n_{p x 2}) i + (n_{c x 3} n_{p x 1} - n_{c x 1} n_{p x 3}) j + (n_{c x 1} n_{p x 2} - n_{c x 2} n_{p x 1}) k = [\begin{matrix} {(n}_{c x 2} n_{p x 3} - n_{c x 3} n_{p x 2}) \\ (n_{c x 3} n_{p x 1} - n_{c x 1} n_{p x 3}) \\ (n_{c x 1} n_{p x 2} - n_{c x 2} n_{p x 1}) \end{matrix}] = [\begin{matrix} t_{x 1} \\ t_{x 2} \\ t_{x 3} \end{matrix}]

(17)

t_{y} = [\begin{matrix} n_{c y 1} \\ n_{c y 2} \\ n_{c y 3} \end{matrix}] X [\begin{matrix} n_{p y 1} \\ n_{p y 2} \\ n_{p y 3} \end{matrix}] = |\begin{matrix} i & j & k \\ n_{c y 1} & n_{c y 2} & n_{c y 3} \\ n_{p y 1} & n_{p y 2} & n_{p y 3} \end{matrix}| = {(n}_{c y 2} n_{p y 3} - n_{c y 3} n_{p y 2}) i + (n_{c y 3} n_{p y 1} - n_{c y 1} n_{p y 3}) j + (n_{c y 1} n_{p y 2} - n_{c y 2} n_{p y 1}) k = [\begin{matrix} {(n}_{c y 2} n_{p y 3} - n_{c y 3} n_{p y 2}) \\ (n_{c y 3} n_{p y 1} - n_{c y 1} n_{p y 3}) \\ (n_{c y 1} n_{p y 2} - n_{c y 2} n_{p y 1}) \end{matrix}] = [\begin{matrix} t_{y 1} \\ t_{y 2} \\ t_{y 3} \end{matrix}]

(18)

The surface normal will be the vector cross-product of the two tangent vectors and will be computed as:

n (X, Y, Z) = t_{x} X t_{y} = [\begin{matrix} t_{x 1} \\ t_{x 2} \\ t_{x 3} \end{matrix}] X [\begin{matrix} t_{y 1} \\ t_{y 2} \\ t_{y 3} \end{matrix}] = |\begin{matrix} i & j & k \\ t_{x 1} & t_{x 2} & t_{x 3} \\ t_{y 1} & t_{y 2} & t_{y 3} \end{matrix}| = (t_{y 3} t_{x 2} - t_{y 2} t_{x 3}) i + (t_{y 1} t_{x 3} - t_{y 3} t_{x 1}) j + (t_{y 2} t_{x 1} - t_{y 1} t_{x 2}) k = [\begin{matrix} t_{y 3} t_{x 2} - t_{y 2} t_{x 3} \\ t_{y 1} t_{x 3} - t_{y 3} t_{x 1} \\ t_{y 2} t_{x 1} - t_{y 1} t_{x 2} \end{matrix}] = [\begin{matrix} n_{x} \\ n_{y} \\ n_{z} \end{matrix}]

(19)

The computation of surface normals using Equation (19) is a totally deterministic process that requires only image and projection information local to a particular point. Unlike traditional methods, such as shape from shading or texture, it does not necessitate the assumptions of the relationship between the orientations of neighborhood points. Computed surface normals are calculated from the finitely small light planes. Their values are normalized by computing the unit vectors as follows:

\hat{n} (X, Y, Z) = \frac{n_{x} i + n_{y} j + n_{z} k}{\sqrt[2]{{n_{x}}^{2} + {n_{y}}^{2} + {n_{z}}^{2}}}

(20)

2.3. Computation of 3D World Coordinates [43,44,45,46]

The projective transformation from 3D world coordinates to the camera and projector coordinates can be written as:

[\begin{matrix} X_{C} \\ Y_{C} \\ Z_{C} \\ 1 \end{matrix}] = [\begin{matrix} R_{c} T_{c} \\ 0 1 \end{matrix}] [\begin{matrix} X_{w} \\ Y_{w} \\ Z_{w} \\ 1 \end{matrix}] = M_{c} [\begin{matrix} X_{w} \\ Y_{w} \\ Z_{w} \\ 1 \end{matrix}]

(21)

[\begin{matrix} X_{P} \\ Y_{P} \\ Z_{P} \\ 1 \end{matrix}] = [\begin{matrix} R_{P} T_{P} \\ 0 1 \end{matrix}] [\begin{matrix} X_{w} \\ Y_{w} \\ Z_{w} \\ 1 \end{matrix}] = M_{p} [\begin{matrix} X_{w} \\ Y_{w} \\ Z_{w} \\ 1 \end{matrix}]

(22)

After including the internal or intrinsic camera parameters, the following transformation will be obtained:

Z_{c} [\begin{matrix} u_{c} \\ v_{c} \end{matrix}] = [\begin{matrix} f_{c} & 0 & x_{o c} \\ 0 & f_{c} & y_{o c} \\ 0 & 0 & 1 \end{matrix}] [\begin{matrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \end{matrix}] [\begin{matrix} R_{c} & T_{c} \\ 0 & 1 \end{matrix}] [\begin{matrix} X_{w} \\ Y_{W} \\ Z_{W} \\ 1 \end{matrix}]

(23)

Z_{p} [\begin{matrix} u_{p} \\ v_{p} \end{matrix}] = [\begin{matrix} f_{p} & 0 & x_{o p} \\ 0 & f_{p} & y_{o p} \\ 0 & 0 & 1 \end{matrix}] [\begin{matrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \end{matrix}] [\begin{matrix} R_{p} & T_{p} \\ 0 & 1 \end{matrix}] [\begin{matrix} X_{w} \\ Y_{W} \\ Z_{W} \\ 1 \end{matrix}]

(24)

The camera and projector coordinates are transformed into a 2D image and pattern plane (u_n, v_n) with the following relationship:

Z [\begin{matrix} \frac{X}{Z} \\ \frac{Y}{Z} \end{matrix}] = Z [\begin{matrix} u_{n} \\ v_{n} \end{matrix}] = [\begin{matrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \end{matrix}] [\begin{matrix} X \\ Y \\ Z \\ 1 \end{matrix}]

(25)

The camera and projector lens distortions, including radial distortion and tangential distortion, can be defined as:

m_{c d} = [\begin{matrix} u_{c d} \\ v_{c d} \end{matrix}] = [1 + k_{c 1} r_{c}^{2} + k_{c 2} r_{c}^{4}] [\begin{matrix} u_{c n} \\ v_{c n} \end{matrix}] + {d x}_{c}

(26)

m_{p d} = [\begin{matrix} u_{p d} \\ v_{p d} \end{matrix}] = [1 + k_{p 1} r_{p}^{2} + k_{p 2} r_{p}^{4}] [\begin{matrix} u_{p n} \\ v_{p n} \end{matrix}] + {d x}_{p}

(27)

where (u_cd, v_cd) and (u_pd, v_pd) are the camera and projector digital image coordinates after accommodating distortion effects. (u_cn, v_cn) and (u_pn, v_pn) are the coordinates of the camera and projector image plane after transformation from 3D to 2D. r_c² = u_cn² + v_cn² and r_p² = u_pn² + v_pn². k_c1, k_c2, and k_p1, k_p2 are the radial distortion parameters of the camera and projector. The tangential distortion for the camera and projector will be defined as:

{d x}_{c} = [\begin{matrix} 2 p_{1 c} u_{c n} v_{c n} + p_{2 c} ({r_{c}}^{2} + 2 {u_{c n}}^{2}) \\ p_{1 c} (r_{c}^{2} + 2 {v_{c n}}^{2}) + 2 p_{2 c} u_{c n} v_{c n} \end{matrix}]

(28)

{d x}_{p} = [\begin{matrix} 2 p_{1 p} u_{p n} v_{p n} + p_{2 p} ({r_{p}}^{2} + 2 {u_{p n}}^{2}) \\ p_{1 p} (r_{p}^{2} + 2 {v_{p n}}^{2}) + 2 p_{2 p} u_{p n} v_{p n} \end{matrix}]

(29)

where P_1c, P_2c, and P_1p, P_2p are the camera and projector’s tangential distortion parameters.

The complete transformation from 3D world coordinates to the digital image and pattern after accommodation of the camera’s extrinsic and intrinsic parameters and lens distortion for the camera and projector will be:

Z_{p} [\begin{matrix} u_{p} \\ v_{p} \\ 1 \\ 1 \end{matrix}] = m_{p d} [\begin{matrix} f_{p} & 0 & x_{o p} \\ 0 & f_{p} & y_{o p} \\ 0 & 0 & 1 \end{matrix}] [\begin{matrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \end{matrix}] [\begin{matrix} R_{p} & T_{p} \\ 0 & 1 \end{matrix}] [\begin{matrix} X_{w} \\ Y_{W} \\ Z_{W} \\ 1 \end{matrix}] = [\begin{matrix} n_{11} & n_{12} & n_{13} & n_{14} \\ n_{21} & n_{22} & n_{23} & n_{24} \\ n_{31} & n_{32} & n_{33} & n_{34} \end{matrix}] [\begin{matrix} X_{w} \\ Y_{W} \\ Z_{W} \\ 1 \end{matrix}]

(30)

Z_{c} [\begin{matrix} u_{c} \\ v_{c} \\ 1 \\ 1 \end{matrix}] = m_{c d} [\begin{matrix} f_{c} & 0 & x_{o c} \\ 0 & f_{c} & y_{o c} \\ 0 & 0 & 1 \end{matrix}] [\begin{matrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \end{matrix}] [\begin{matrix} R_{c} & T_{c} \\ 0 & 1 \end{matrix}] [\begin{matrix} X_{w} \\ Y_{W} \\ Z_{W} \\ 1 \end{matrix}] = [\begin{matrix} m_{11} & m_{12} & m_{13} & m_{14} \\ m_{21} & m_{22} & m_{23} & m_{24} \\ m_{31} & m_{32} & m_{33} & m_{34} \end{matrix}] [\begin{matrix} X_{w} \\ Y_{W} \\ Z_{W} \\ 1 \end{matrix}]

(31)

Eliminating Z_p and Z_c using the plane line triangulation method [47], the 3D world coordinates can be calculated as:

[\begin{matrix} m_{31} u_{c} - m_{11} & m_{32} u_{c} - m_{12} & m_{33} u_{c} - m_{13} \\ m_{31} v_{c} - m_{21} & m_{32} v_{c} - m_{22} & m_{33} v_{c} - m_{23} \\ n_{31} u_{p} - n_{11} & n_{32} u_{p} - n_{12} & n_{33} u_{p} - n_{13} \end{matrix}] [\begin{matrix} X_{w} \\ Y_{W} \\ Z_{W} \end{matrix}] = [\begin{matrix} m_{14} - m_{34} u_{c} \\ m_{24} - m_{34} v_{c} \\ n_{14} - n_{34} u_{p} \end{matrix}]

(32)

2.4. Decoding of Patterns

2.4.1. Preprocessing, Segmentation, and Labeling

Preprocessing is carried out as described in our previous works [14,42] by using thresholding and segmentation by using the Otsu methods [48], and segmentation and labeling are carried out by applying the algorithm specified by Haralick [49].

2.4.2. Decoding, Classification, and Computation of Parameters

The decoding and classification of the symbols are carried out using techniques described in our previous works [14,42], using area functions and shape description parameters. The only difference here in this work is that we use four symbols in the pattern compared to the three symbols and stripes previously. The four symbols we employed in the experiment are butterfly-shaped diagonally arranged square symbols, square-shaped symbols, horizontal strips, and plus-shaped (+) symbols, as shown in Figure 5.

The square-shaped symbol and plus (+)-shaped symbols are differentiated from the horizontal stripe and diagonally arranged circles based on areas, eccentricity ratio, and aspect ratio. The square has a maximum area, and the plus sign also has a greater area than horizontal strips and diagonally arranged circles. The aspect ratio and eccentricity of the square and plus signs are lower, while stripe and diagonally arranged circles have higher values. The squares were then differentiated from the plus sign based on the solidity ratio and rectangularity ratio. The square-shaped symbols have higher solidity and rectangularity ratios compared to the plus sign, which has lower values. The horizontal strip and diagonally arranged squares are then differentiated based on their orientations, as horizontal stripes have an orientation around zero while diagonally arranged squares have an orientation around forty-five degrees.

2.4.3. Calibration

Before applying the projection patterns to the surface to be measured, the projector and the camera must be calibrated by using any of the techniques available in [50,51,52,53] by computing the intrinsic and extrinsic parameters.

2.5. Experiment and Devices

2.5.1. Camera and Projector Devices

The experimental setup comprised of a digital camera (DH-HV2051UC, Daheng Imaging, Beijing, China) and a DLP projector (DELL M110, Dell Technologies, Beijing, China). The camera has a pixel resolution of 1600 × 1200. It is a progressive scan CMOS device with an 8-bit pixel depth and 10-bit analog-to-digital conversion accuracy. The pixel size is 4.2 um × 4.2 um. It can acquire ten frames per second. The projector has a pixel resolution of 800 × 1280, with a projection or throw ratio of 1.5:1 and a contrast ratio of 10,000:1 (typical at full on or off).

2.5.2. Target Surfaces

We experimented on three surfaces: (1) a simple plane surface that is approximately 800 mm wide and 600 mm long, (2) a cylindrical surface that has a radius of 150 mm and height of 406 mm, and (3) a sculpture that has a height of 223 mm, width of 190 mm, and depth of 227 mm. The standard deviation of the cylindrical surface is equal to its radius of 150 mm. The estimated standard deviation of the textured surface, i.e., sculpture, is approximately 200 to 225 mm. The surfaces to be measured are white to avoid the influence of spatial varying reflectivity.

2.5.3. Experiment Setup

Our method was endorsed through a real experiment, and we appraised the system’s performance. The surface to be measured was 110 cm apart from the projector and camera, whereas the camera and projector were separated by 18 cm. We put the target surface in the mid-position of the projector and camera. The projector and camera were tilted at 85.3 degrees to throw a pattern and record the observation. The angle of incidence from the projector to the camera was 9.4 degrees. The ambient light was firmly controlled during the recording of images so that errors could be minimized.

Pattern Employed in the Experiment

We tested and proved our method using five projection patterns with resolutions of 8 × 8, 10 × 10, 12 × 12, 14 × 14, and 16 × 16 pixels. The five projection patterns differed only in resolution (8 × 8, 10 × 10, 12 × 12, 14 × 14, and 16 × 16) and their corresponding M-arrays. Table 1 shows that all the M-arrays had four symbols and a 3 × 3 window feature, with the only difference being their dimensions. The 8 × 8 resolution projection pattern employed a 90 × 144 M-array. The 10 × 10 pattern resolution used a 75 × 117 M-array. The 12 × 12 pattern resolution utilized a 60 × 93 M-array. The 14 × 14 pattern resolution used a 51 × 81 M-array. For the 16 × 16 resolution, a 45 × 72 M-array was deployed. For the 8 × 8 and 10 × 10 resolutions, 1-pixel spacing between the symbols was used, whereas for the 12 × 12, 14 × 14, and 16 × 16 resolutions 2-pixel spacing was used. Figure 5 shows the texture of the five projection patterns used in the experiment.

3. Results

3.1. Comparison of Measured Resolution

The system is designed to perform in between the depth ranges of 40 to 250 cm. Table 2 shows the comparison of the XY-plane of the projector’s resolution and the corresponding area covered for the depth ranges of 40 to 250 cm on a flat surface. In the experiment and measurement, objects or surfaces were 110 cm from the projector and the camera. The XY-plane projector resolutions were 5.2 mm for 8 × 8, 6.3 mm for 10 × 10, 8.0 mm for 12 × 12, 9.2 mm for 14 × 14, and 10.3 mm for 16 × 16. This work achieved the exact resolution of our previous work [14]. See Table 2. We have compared our technique with position-based methodologies and orientation-based methodologies. The position-based methods were Zhou (2023) [16], Yin (2021) [19], and Nguyen (2020) [7], whereas Song (2010) [40], Davies (1998) [37], and our proposed method determine both position and orientation, while Winkelbach (2001, 2002) [38,39] specifically used it only to determine the orientation. Despite achieving the exact resolution that we presented in our previous work [14], our method has surpassed other approaches in terms of resolution, and the data presented in Table 2 confirm this assertion. More precisely, we conducted measurements with a resolution of 5.2 mm using an 8 × 8 pattern at a depth of 110 cm. In contrast, the earlier position-based methods, such as that of Zhou [16], which used a large-sized M-array, achieved 11.8 mm. Yin [19], who employed speckle and fringe projection, achieved 11.3 mm. Nguyen [7], who used a fringe-based method, had a resolution of 18.2 mm. The earlier orientation-based methods, such as Song [40], which utilized a pseudo-random sequence of diamond-shaped colored elements, had a resolution of 11.3 mm. Meanwhile, Winkelbach [38,39] employed a stripe pattern that has a measured resolution of 38.5 mm. Davies [37], who used color circles as a projection, had a measurement resolution of 25.3 mm.

3.2. Classification or Decoding of Symbols or Feature Points in a Pattern

We applied the methods described in the preceding section and [14] to classify and decode the patterns used in the experiment. The application of our process to the original projection resulted in the successful decoding of 100% of the pattern symbols, thereby proving the reliability of the applied method. Once we had verified the algorithm’s accuracy on the original pattern, we used the technique for the pattern obtained from the planar surface, circular surface, and sculpture. Table 3 displays the number of symbols or feature points detected, decoded, and classified when the experiment utilized five different patterns on the measured surfaces. Our method successfully interpreted 100% of the symbols on a planar or flat surface for all five resolutions, which further confirms its reliability. In contrast, the previous work [14] achieved a classification of 98% of primitives. This indicates an improvement in the decoding process. Our classification method demonstrates better performance than other approaches. For instance, Petriu [11] was able to decode 59% of primitives, Albitar [12] was able to decode 95%, and Ahsan [14] was able to decode 97% of primitives on circular surfaces and 85% of primitives on a sculpture. We can accurately classify 98.4% of primitives for a 16 × 16 size, 98.7% of feature points on a 14 × 14 size, 99.0% for a 12 × 12 size, 99.3% for a 10 × 10 size, and 99.5% of primitives for an 8 × 8 size when dealing with circular surfaces. Moreover, we can classify 96.2% of primitives for a 16 × 16 size, 97.1% for a 14 × 14 size, 97.6% for a 12 × 12 size, 98.4% for a 10 × 10 size, and 98.9% for an 8 × 8 size when our method is applied to a sculpture. Therefore, smaller symbol sizes exhibit a noticeable and progressive improvement in classification. The classification of smaller symbols improved due to the reduced exposure to distortion caused by surface curvature. Since more shape descriptor parameters are utilized for the classification of symbols, there are no instances of wrong classification or false detection.

Figure 6 displays the parts of decoded patterns for the original projection and their results from captured images when applied to a flat planar surface. Figure 7 illustrates the decoded patterns of the acquired images from cylindrical and sculpture surfaces. To distinguish the symbols from each other, the centroid positions of each symbol type were highlighted using distinct colors. This was performed on all five patterns used in the experiment, i.e., the 8 × 8, 10 × 10, 12 × 12, 14 × 14, and 16 × 16 resolutions. The centroid positions of the square-shaped symbol are indicated with a black star (*). The centroid positions of the butterfly-shaped diagonally arranged squares are marked with a blue star (*). The centroid positions of a horizontal strip are reflected through a green star (*). Similarly, the centroid positions of the plus sign symbol are marked with a pink star (*).

3.3. Point Clouds and Surface Normals of the Measured Objects

After applying the method explained in the previous section, the surface normals and point clouds of the measured surfaces, i.e., cylinder and sculpture, were obtained for the five patterns used in the experiment. The measured resolution for the point clouds was 5.2 mm for an 8 × 8, 6.3 mm for a 10 × 10, 8.0 mm for a 12 × 12, 9.2 mm for a 14 × 14, and 10.3 mm for a 16 × 16 pattern. Figure 8 shows the point clouds obtained from the cylinder and sculpture for the five patterns employed in the experiment. Figure 9 shows the surface normals plot of the measured surfaces for the five projection patterns.

In our previous work [14], we implemented a 16 × 16 resolution, whereas in [42], we implemented two resolutions, 14 × 14 and 16 × 16. In this work, we implemented five resolutions, i.e., 8 × 8, 10 × 10, 12 × 12, 14 × 14, and 16 × 16. We obtained a denser point cloud and more surface normals with a smaller resolution of 8 × 8. The density of the point cloud decreased with an increase in the pattern size resolution of the point cloud from 8 × 8 to 16 × 16. All the points in the point clouds lay in between the depth or Z-axis of 1040 mm to 1120 mm.

The points in the point clouds were calculated on the basis of the corresponding matching points in the M-array. We may miss at least two rows or two columns near the boundaries of the measuring surface due to the window feature (3 × 3) of the M-array. Table 4 shows the number of points for which the correspondence has been established with respect to the total number of points decoded for each pattern and surface.

The corresponding points were compared with our previous work [14], which shows an improvement in the process of attaining the corresponding points. In the process of matching corresponding points for the original patterns, the result remained the same for the exact resolution, i.e., 16 × 16. As the symbol sizes or pattern resolution reduced, the percentage of the corresponding points increased and reached up to 96.4% for 8 × 8, compared to 92.8% for 16 × 16. A smaller symbol size or resolution resulted in a greater number of corresponding points, indicating that a smaller symbol size increases the possibility of matching points in close proximity to the boundaries. This phenomenon was witnessed in all three types of surfaces: plane or flat, cylinder, and sculpture. We have made significant progress in the process of matching points compared to our previous work [14]. In the previous work on a plane surface, we achieved 82.2% of matching points. However, with the exact resolution of 16 × 16, we have now achieved 87.6% of the corresponding points. The matching point increases as the resolution decreases and rises to 93.6% for an 8 × 8 resolution. When considering a cylindrical surface, the matching points increased from the previous 76.1% to 85.5% for a 16 × 16 resolution and rose to 92.4% for an 8 × 8 resolution. For a sculpture, the matching points increased from the previous 67.9% to 74.5% for a 16 × 16 resolution and reached up to 78.7% for an 8 × 8 resolution.

3.4. Time Durations of the Different Processes

Table 5 displays the time duration of the different processes in the decoding, computation of the surface normals, and computation of the 3D point clouds and compares these values with those in [14]. Each process has a specific time duration. The time durations were measured on an average core i5 computer. The average time for each process was calculated and repeated many times for optimization to obtain the optimum values. The time measurements are specified in milliseconds. All the processes, such as filtering and thresholding, labeling, parameter calculation, classification, correspondence matching, computation of surface normals, and computation of point clouds, indicate an increase in the time duration with the complexity and texture of the surface and the number of detected primitives. With the decrease in resolution for each type of surface, the number of primitives increases, which increases the time durations. Compared to [14], the preprocessing, labeling, classification, correspondence matching, and computation of point cloud time decreased. This may be due to the better specification of the computing machine compared to that used in [14]. However, the time for parameter calculation has increased since four symbols were employed in the pattern compared to the three in [14], and more shape descriptor parameters are required to be calculated for symbol classification. The computation time of surface normals is a few milliseconds, which signifies that the method is fast and can be applied to real-time applications. For instance, for sculptures using an 8 × 8 resolution, 1654 surface normals were calculated in 29.4 milliseconds. Similarly, for cylindrical surfaces using an 8 × 8 resolution, 2517 surface normals were computed in 68.9 milliseconds.

4. Conclusions

We have described a method to find the surface orientations through a single-shot, spatially encoded monochromatic symbol-based pixel-level design. Our technique is more robust against image noise and surface color variations due to the monochromatic design. We presented multi-resolution patterns to determine the dense reconstruction for calculating the surface orientation. We have calculated orientations through surface normals and position profiles for different resolutions and compared these results quantitatively and qualitatively with previous methods. Our method differs from shape to shading and shape to texture in that it does not necessitate an assumption of the relationship between local surface orientations at neighborhood points. Therefore, we introduced a position-agnostic approach to identify the orientation independently. We applied the technique on real objects such as a plane surface, circular surface, and complex sculpture. We obtained a higher resolution compared to previous methods and computed the surface normals in a matter of milliseconds. We improved the correspondence matching and estimation of feature points in a pattern. A higher percentage of feature points has been decoded, resulting in an increased number of matching points in the correspondence. Additionally, the duration of the processing has been decreased. By employing a monochromatic pattern and a straightforward decoding approach, we can obtain a greater number of decoded primitives and points that match correspondences.

Author Contributions

Conceptualization, A.E., Q.Z., J.L. and U.F.; Methodology, A.E., Q.Z. and J.L.; Software, A.E.; Validation, A.E., J.L. and G.F.; Formal analysis, A.E. and M.B.; Investigation, A.E.; Resources, Q.Z. and Y.L.; Data curation, A.E.; Writing—original draft, A.E.; Writing—review & editing, A.E., J.L. and Y.L.; Visualization, A.E.; Supervision, Q.Z.; Project administration, Q.Z.; Funding acquisition, Q.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data will be publicly available on GIT HUB repository.

Conflicts of Interest

The authors declare no conflict of interest.

References

Salvi, J.; Fernandez, S.; Pribanic, T.; Llado, X. A State of the Art in Structured Light Patterns for Surface Profilometry. Pattern Recognit. 2010, 43, 2666–2680. [Google Scholar] [CrossRef]
Zhang, S. High-Speed 3D Shape Measurement with Structured Light Methods: A Review. Opt. Lasers Eng. 2018, 106, 119–131. [Google Scholar] [CrossRef]
Webster, J.G.; Bell, T.; Li, B.; Zhang, S. Structured Light Techniques and Applications; Wiley: Hoboken, NJ, USA, 2016; pp. 1–24. [Google Scholar]
Geng, J. Structured-Light 3D Surface Imaging: A Tutorial. Adv. Opt. Photonics 2011, 3, 128–160. [Google Scholar] [CrossRef]
Salvi, J.; Pagès, J.; Batlle, J. Pattern Codification Strategies in Structured Light Systems. Pattern Recognit. 2004, 37, 827–849. [Google Scholar] [CrossRef]
Rusinkiewicz, S.; Hall-Holt, O.; Levoy, M. Real-Time 3D Model Acquisition. ACM Trans. Graph. 2002, 21, 438–446. [Google Scholar] [CrossRef]
Nguyen, H.; Wang, Y.; Wang, Z. Single-Shot 3D Shape Reconstruction Using Structured Light and Deep Convolutional Neural Networks. Sensors 2020, 20, 3718. [Google Scholar] [CrossRef]
Ahsan, E.; QiDan, Z.; Jun, L.; Yong, L.; Muhammad, B. Grid-Indexed Based Three-Dimensional Profilometry. In Coded Optical Imaging; Liang, D.J., Ed.; Springer Nature: Cham, Switzerland, 2023. [Google Scholar]
Wang, Z. A Tutorial on Single-Shot 3D Surface Imaging Techniques. IEEE Signal Process. Magzine 2024, 41, 71–92. [Google Scholar] [CrossRef]
Pagès, J.; Salvi, J.; Collewet, C.; Forest, J. Optimised de Bruijn Patterns for One-Shot Shape Acquisition. Image Vis. Comput. 2005, 23, 707–720. [Google Scholar] [CrossRef]
Petriu, E.M.; Sakr, Z.; Spoelder, H.J.W.; Moica, A. Object Recognition Using Pseudo-Random Color Encoded Structured Light. In Proceedings of the 17th IEEE Instrumentation and Measurement Technology Conference, Baltimore, MD, USA, 1–4 May 2000; IEEE: Piscataway, NJ, USA, 2000; pp. 1237–1241. [Google Scholar]
Albitar, C.; Graebling, P.; Doignon, C. Robust Structured Light Coding for 3D Reconstruction. In Proceedings of the 2007 IEEE 11th International Conference on Computer Vision, Rio De Janeiro, Brazil, 14–21 October 2007; IEEE: Piscataway, NJ, USA, 2007; pp. 1–6. [Google Scholar]
Morano, R.A.; Ozturk, C.; Conn, R.; Dubin, S.; Zietz, S.; Nissanov, J. Structured Light Using Pseudorandom Codes. IEEE Trans. Pattern Anal. Mach. Intell. 1998, 20, 322–327. [Google Scholar] [CrossRef]
Elahi, A.; Lu, J.; Zhu, Q.D.; Yong, L. A Single-Shot, Pixel Encoded 3D Measurement Technique for Structure Light. IEEE Access 2020, 8, 127254–127271. [Google Scholar] [CrossRef]
Lu, J.; Han, J.; Ahsan, E.; Xia, G.; Xu, Q. A Structured Light Vision Measurement with Large Size M-Array for Dynamic Scenes. In Proceedings of the 35th Chinese Control Conference (CCC), Chengdu, China, 27–29 July 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 3834–3839. [Google Scholar]
Zhou, X.; Zhou, C.; Kang, Y.; Zhang, T.; Mou, X. Pattern Encoding of Robust M-Array Driven by Texture Constraints. IEEE Trans. Instrum. Meas. 2023, 72, 5014816. [Google Scholar] [CrossRef]
Maruyama, M.; Abe, S. Range Sensing by Projecting Multiple Slits with Random Cuts. IEEE Trans. Pattern Anal. Mach. Intell. 1993, 15, 647–651. [Google Scholar] [CrossRef]
Ito, M.; Ishii, A. A Three-Level Checkerboard Pattern (TCP) Projection Method for Curved Surface Measurement. Pattern Recognit. 1995, 28, 27–40. [Google Scholar] [CrossRef]
Yin, W.; Hu, Y.; Feng, S.; Huang, L.; Kemao, Q.; Chen, Q.; Zuo, C. Single-Shot 3D Shape Measurement Using an End-to-End Stereo Matching Network for Speckle Projection Profilometry. Opt. Express 2021, 29, 13388. [Google Scholar] [CrossRef]
Woodham, R.J. Photometric Method For Determining Surface Orientation From Multiple Images. Opt. Eng. 1980, 19, 139–144. [Google Scholar] [CrossRef]
Knill, D.C. Surface Orientation from Texture: Ideal Observers, Generic Observers and the Information Content of Texture Cues. Vision Res. 1998, 38, 1655–1682. [Google Scholar] [CrossRef]
Garding, J. Direct Estimation of Shape from Texture. IEEE Trans. Pattern Anal. Mach. Intell. 1993, 15, 1202–1208. [Google Scholar] [CrossRef]
Zhang, R.; Tsai, P.-S.; Cryer, J.E.; Shah, M. Shape-from-Shading: A Survey. IEEE Trans. Pattern Anal. Mach. Intell. 1999, 21, 690–706. [Google Scholar] [CrossRef]
Adato, Y.; Vasilyev, Y.; Zickler, T.; Ben-Shahar, O. Shape from Specular Flow. IEEE Trans. Pattern Anal. Mach. Intell. 2010, 32, 2054–2070. [Google Scholar] [CrossRef]
Cheng, Y.; Hu, F.; Gui, L.; Wu, L.; Lang, L. Polarization-Based Method for Object Surface Orientation Information in Passive Millimeter-Wave Imaging. IEEE Photonics J. 2016, 8, 5500112. [Google Scholar] [CrossRef]
Lu, J.; Guo, C.; Fang, Y.; Xia, G.; Wang, W.; Elahi, A. Fast Point Cloud Registration Algorithm Using Multiscale Angle Features. J. Electron. Imaging 2017, 26, 033019. [Google Scholar] [CrossRef]
Nehab, D.; Rusinkiewicz, S.; Davis, J.; Ramamoorthi, R. Efficiently Combining Positions and Normals for Precise 3D Geometry. ACM Trans. Graph. 2005, 24, 536–543. [Google Scholar] [CrossRef]
Fan, R.; Wang, H.; Cai, P.; Liu, M. SNE-RoadSeg: Incorporating Surface Normal Information into Semantic Segmentation for Accurate Freespace Detection. In Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vedaldi, A., Bischof, H., Brox, T., Frahm, J., Eds.; Springer: Cham, Switzerland, 2020; Volume 12375 LNCS, pp. 340–356. ISBN 9783030585761. [Google Scholar]
Grilli, E.; Menna, F.; Remondino, F. A Review of Point Clouds Segmentation and Classification Algorithms. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2017, 42, 339–344. [Google Scholar] [CrossRef]
Campbell, R.J.; Flynn, P.J. A Survey of Free-Form Object Representation and Recognition Techniques. Comput. Vis. Image Underst. 2001, 81, 166–210. [Google Scholar] [CrossRef]
Pomerleau, F.; Colas, F.; Siegwart, R. A Review of Point Cloud Registration Algorithms for Mobile Robotics. Found. Trends® Robot. 2015, 4, 1–104. [Google Scholar] [CrossRef]
Badino, H.; Huber, D.; Park, Y.; Kanade, T. Fast and Accurate Computation of Surface Normals from Range Images. In Proceedings of the IEEE International Conference on Robotics and Automation, Shanghai, China, 9–13 May 2011; IEEE: Piscataway, NJ, USA, 2011; pp. 3084–3091. [Google Scholar]
Wang, Y.F.; Mitiche, A.; Aggarwal, J.K. Computation of Surface Orientation and Structure of Objects Using Grid Coding. IEEE Trans. Pattern Anal. Mach. Intell. 1987, PAMI-9, 129–137. [Google Scholar] [CrossRef]
Hu, G.; Stockman, G. 3-D Surface Solution Using Structured Light and Constraint Propagation. IEEE Trans. Pattern Anal. Mach. Intell. 1989, 11, 390–402. [Google Scholar] [CrossRef]
Shrikhande, N.; Stockman, G. Surface Orientation from a Projected Grid. IEEE Trans. Pattern Anal. Mach. Intell. 1989, 11, 650–655. [Google Scholar] [CrossRef]
Asada, M.; Ichikawa, H.; Tsuji, S. Determining Surface Orientation by Projecting a Stripe Pattern. IEEE Trans. Pattern Anal. Mach. Intell. 1988, 10, 2–7. [Google Scholar] [CrossRef]
Davies, C.J.; Nixon, M.S. A Hough Transform for Detecting the Location and Orientation of Three-Dimensional Surfaces Via Color Encoded Spots. IEEE Trans. Syst. man Cybern. B Cybern. 1998, 28, 90–95. [Google Scholar] [CrossRef]
Winkelbach, S.; Wahl, F.M. Shape from 2D Edge Gradients. In Lecture Notes in Computer Science, Proceedings of the 23rd DAGM Symposium, Munich, Germany, 12–14 September 2001; Radig, B., Florczyk, S., Eds.; Springer: Berlin, Germany, 2001; pp. 377–384. [Google Scholar]
Winkelbach, S.; Wahl, F.M. Shape from Single Stripe Pattern Illumination. In Lecture Notes in Computer Science, Proceedings of the 24th DAGM Symposium, Zurich, Switzerland, 16–18 September 2002; Goo, L., Van, L., Eds.; Springer: Berlin, Germany, 2002; pp. 240–247. [Google Scholar]
Song, Z.; Chung, R. Determining Both Surface Position and Orientation in Structured-Light-Based Sensing. IEEE Trans. Pattern Anal. Mach. Intell. 2010, 32, 1770–1780. [Google Scholar] [CrossRef] [PubMed]
Shi, G.; Li, R.; Li, F.; Niu, Y.; Yang, L. Depth Sensing with Coding-Free Pattern Based on Topological Constraint. J. Vis. Commun. Image Represent. 2018, 55, 229–242. [Google Scholar] [CrossRef]
Elahi, A.; Zhu, Q.; Lu, J.; Hammad, Z.; Bilal, M.; Li, Y. Single-Shot, Pixel-Encoded Strip Patterns for High-Resolution 3D Measurement. Photonics 2023, 10, 1212. [Google Scholar] [CrossRef]
Savarese, S. Lecture 2: Camera Models; Stanford University: Stanford, CA, USA, 2015; p. 18. [Google Scholar]
Hata, K.; Savarese, S. CS231A Course Notes 1 Camera Models; Stanford University: Stanford, CA, USA, 2015; p. 16. [Google Scholar]
Collins, R. CSE486, Penn State Lecture 12: Camera Projection; Penn State University: University Park, PA, USA, 2007. [Google Scholar]
Collins, R. CSE486, Penn State Lecture 13: Camera Projection II; Penn State University: University Park, PA, USA, 2020. [Google Scholar]
Meza, J.; Vargas, R.; Romero, L.A.; Zhang, S.; Marrugo, A.G. What Is the Best Triangulation Approach for a Structured Light System? In Proceedings of the SPIE, Volume 11397: Dimensional Optical Metrology and Inspection for Practical Applications IX 113970D, Bellingham, WA, USA, 18 May 2020; SPIE: Bellingham, WA, USA, 2020; p. 113970D. [Google Scholar]
Sezgin, M.; Sankur, B. Survey over Image Thresholding Techniques and Quantitative Performance Evaluation. J. Electron. Imaging 2004, 13, 146–165. [Google Scholar] [CrossRef]
Haralick, R.M.; Shapiro, L.G. Computer and Robot Vision; Addison-Wesley Publishing Company: San Francisco, CA, USA, 1992; Volume 1, ISBN 9780201569438. [Google Scholar]
Xie, Z.; Wang, X.; Chi, S. Simultaneous Calibration of the Intrinsic and Extrinsic Parameters of Structured-Light Sensors. Opt. Lasers Eng. 2014, 58, 9–18. [Google Scholar] [CrossRef]
Nie, L.; Ye, Y.; Song, Z. Method for Calibration Accuracy Improvement of Projector-Camera-Based Structured Light System. Opt. Eng. 2017, 56, 074101. [Google Scholar] [CrossRef]
Huang, B.; Ozdemir, S.; Tang, Y.; Liao, C.; Ling, H. A Single-Shot-Per-Pose Camera-Projector Calibration System for Imperfect Planar Targets. In Proceedings of the IEEE International Symposium on Mixed and Augmented Reality Adjunct (ISMAR-Adjunct), LMU Munich, Munich, Germany, 16–20 October 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 15–20. [Google Scholar]
Moreno, D.; Taubin, G. Simple, Accurate, and Robust Projector-Camera Calibration. In Proceedings of the 2012 Second International Conference on 3D Imaging, Modeling, Processing, Visualization & Transmission, Zurich, Switzerland, 13–15 October 2012; IEEE: Piscataway, NJ, USA, 2012; pp. 464–471. [Google Scholar]

Figure 1. Symbols varying in size from (a) 8 × 8, (b) 10 × 10, (c) 12 × 12, (d) 14 × 14 and (e) 16 × 16 pixels.

Figure 2. The Hamming distance profiles of various M-arrays. Note: Each M-array has four symbols.

Figure 3. Parts of projection patterns for symbol sizes varying from (a) 8 × 8, (b) 10 × 10, (c) 12 × 12, (d) 14 × 14 and (e) 16 × 16 pixels.

Figure 4. Computation of surface normals.

Figure 5. The texture of patterns employed in experiment work: (a) 8 × 8; (b) 10 × 10; (c) 12 × 12; (d) 14 × 14; (e) 16 × 16 pixels.

Figure 6. Classification of symbols for each resolution: (a) part of the original pattern; (b) parts of plane or flat surface.

Figure 7. Classification of symbols on the measured surfaces for each resolution: (a) cylinder; (b) sculpture.

Figure 8. Point cloud for measured surfaces for each resolution: (a) cylinder; (b) sculpture.

Figure 9. Surface normals for measured surfaces for each resolution: (a) cylinder; (b) sculpture.

Table 1. Properties of M-arrays for various resolutions and number of feature points contributed in pattern.

Symbol Size	Spacing Between Consecutive Symbols	No. of Symbols Used in M-Array	M-Array Dimensions (m × n)	Average Hamming Distance	Robust Codewords (%)	No. of Feature Points in the Projected Pattern
8 × 8	1	4	90 × 144	6.7517	99.8683	12,496
10 × 10	1	4	75 × 117	6.7524	99.8667	8352
12 × 12	2	4	60 × 93	6.7503	99.8713	5187
14 × 14	2	4	51 × 81	6.7489	99.8734	4000
16 × 16	2	4	45 × 72	6.7495	99.8452	3124

Table 2. Comparison of measured projector resolutions and the area covered.

Pattern Resolution	Depth (z) cm	Area (cm²)	Proposed Method	Zhou (2023) [16]	Yin (2021) [19]	Nguyen (2020) [7]	Song (2010) [40]	Winkelbach (2001, 2002) [38,39]	Davies (1998) [37]
			Position and Orientation	Position Based Methods			Position and Orientation	Orientation	Position and Orientation
			Resolution (mm)	Resolution (mm)	Resolution (mm)	Resolution (mm)	Resolution (mm)	Resolution (mm)	Resolution (mm)
8 × 8	250	103.8 × 166	11.7	26.7	25.9	41.4	25.7	87.4	57.6 (Area reduced to 63.0 × 166)
10 × 10			14.3
12 × 12			18.3
14 × 14			20.9
16 × 16			23.5
8 × 8	200	83 × 132.8	9.4	21.3	20.7	33.1	20.6	69.9	46.1 (Area reduced to 50.4 × 132.8)
10 × 10			11.5
12 × 12			14.6
14 × 14			16.7
16 × 16			18.8
8 × 8	150	62.3 × 99.6	7.0	16.1	15.6	24.9	15.4	52.4	34.5 (Area reduced to 37.8 × 99.6)
10 × 10			8.6
12 × 12			11.0
14 × 14			12.5
16 × 16			14.1
8 × 8	120	49.8 × 79.7	5.6	12.7	12.3	19.7	12.3	41.9	27.6 (Area reduced to 30.2 × 79.7)
10 × 10			6.9
12 × 12			8.8
14 × 14			10.0
16 × 16			11.3
8 × 8	110	45.7 × 73.0	5.2	11.8	11.3	18.2	11.3	38.5	25.3 (Area reduced to 27.7 × 73.0)
10 × 10			6.3
12 × 12			8
14 × 14			9.2
16 × 16			10.3
8 × 8	100	41.5 × 66.4	4.7	10.7	10.3	16.5	10.3	35.0	23.0 (Area reduced to 25.2 × 66.4)
10 × 10			5.7
12 × 12			7.3
14 × 14			8.3
16 × 16			9.4
8 × 8	80	33.2 × 53.1	3.8	8.5	8.2	13.2	8.2	28.0	18.4 (Area reduced to 20.1 × 53.1)
10 × 10			4.6
12 × 12			5.8
14 × 14			6.7
16 × 16			7.5
8 × 8	60	24.9 × 39.8	2.8	6.4	6.2	10.0	6.2	21.0	13.8 (Area reduced to 15.1 × 39.8)
10 × 10			3.4
12 × 12			4.4
14 × 14			5.0
16 × 16			5.6
8 × 8	40	16.6 × 26.6	1.9	4.2	4.1	7.5	4.1	14.0	9.2 (Area reduced to 10.1 × 26.6)
10 × 10			2.3
12 × 12			2.9
14 × 14			3.3
16 × 16			3.8

Table 3. Detected and decoded primitives and comparison.

Pattern Type & Depth			Surface Types
Pattern Type & Depth	Primitives	Original Pattern	Plane	Cylinder	Sculpture
Pattern 1 (8 × 8 resolution) Depth 110 cm	Detected	12,496	4205	2739	2127
	Decoded	12,496	4205	2724	2103
	%	100	100	99.5	98.9
Pattern 2 (10 × 10 resolution) Depth 110 cm	Detected	8352	2810	1967	1449
	Decoded	8352	2810	1954	1426
	%	100	100	99.3	98.4
Pattern 3 (12 × 12 resolution) Depth 110 cm	Detected	5187	1717	1159	901
	Decoded	5187	1717	1148	879
	%	100	100	99.0	97.6
Pattern 4 (14 × 14 resolution) Depth 110 cm	Detected	4000	1281	951	723
	Decoded	4000	1281	939	702
	%	100	100	98.7	97.1
Pattern 5 (16 × 16 resolution) Depth 110 cm	Detected	3124	1031	749	558
	Decoded	3124	1031	737	537
	%	100	100	98.4	96.2
Ahsan (2020) [14] (16 × 16 resolution) Depth 200 cm	Detected	3124	1650	1161	689
	Decoded	3124	1617	1128	585
	%	100	98.0	97.1	84.9

Table 4. Percentage of corresponding points in the M-arrays with respect to decoded primitives.

Pattern Type & Depth			Surface Types
Pattern Type & Depth	Primitives	Original Pattern	Plane	Cylinder	Sculpture
Pattern 1 (8 × 8 resolution) Depth 110 cm	Correspondence	12,040	3937	2517	1654
	Decoded	12,496	4205	2724	2103
	%	96.4	93.6	92.4	78.7
Pattern 2 (10 × 10 resolution) Depth 110 cm	Correspondence	7980	2596	1780	1076
	Decoded	8352	2810	1954	1426
	%	95.6	92.4	91.1	75.5
Pattern 3 (12 × 12 resolution) Depth 110 cm	Correspondence	4895	1550	1014	662
	Decoded	5187	1717	1148	879
	%	94.4	90.3	88.3	75.3
Pattern 4 (14 × 14 resolution) Depth 110 cm	Correspondence	3744	1139	819	521
	Decoded	4000	1281	939	702
	%	93.6	88.9	87.2	74.2
Pattern 5 (16 × 16 resolution) Depth 110 cm	Correspondence	2898	903	631	400
	Decoded	3124	1031	737	537
	%	92.8	87.6	85.6	74.5
Ahsan (2020) [14] (16 × 16 resolution) Depth 200 cm	Correspondence	2898	1329	859	397
	Decoded	3124	1617	1128	585
	%	92.8	82.2	76.1	67.9

Table 5. The time calculation for different processes.

Surface Type	Method	Resolution	Preprocessing (Filtering + Thresholding)	Labeling	Parameter Calculation	Classification	Correspondence	Rate of Correspondence	Computation of Surface Normals	Computation of 3D Point Cloud
Original Pattern	Ahsan [14] (2020) depth: 200 cm	16 × 16	566	42	587	3.3	485	0.19	-	-
	Proposed Method depth: 110 cm	16 × 16	307.6	18.9	1105.4	1.2	836.9	0.27	-	-
		14 × 14	322.3	21.9	1366.4	1.4	1278.1	0.32	-	-
		12 × 12	342.4	26.3	1644.6	1.6	1969.6	0.38	-	-
		10 × 10	352.6	34.4	2651.8	2.6	4604.0	0.55	-	-
		8 × 8	363.7	59.2	4011.4	4.4	8437.8	0.68	-	-
Plane Surface	Ahsan (2020) [14] depth: 200 cm	16 × 16	611	53	365.6	2.2	480	0.3	-	24.7
	Proposed Method depth: 110 cm	16 × 16	380.9	17.7	472.6	0.52	265.3	0.26	18.5	22.4
		14 × 14	390.4	24.0	594.1	0.63	388.9	0.30	19.3	25.6
		12 × 12	401.3	25.3	739.9	0.75	587.0	0.34	28.5	38.2
		10 × 10	412.3	33.2	1048.1	0.92	1403.1	0.50	69.3	89.6
		8 × 8	422.6	36.5	1492.5	1.4	2301.1	0.55	174.1	207.7
Cylinder	Ahsan (2020) [14] depth: 200 cm	16 × 16	649	41.5	361	2.7	331.1	0.29	-	24.3
	Proposed Method depth: 110 cm	16 × 16	360.8	17.8	326.1	0.25	178.7	0.25	10.6	14.4
		14 × 14	371.0	19.1	415.8	0.31	292.5	0.31	15.0	21.2
		12 × 12	379.6	20.8	463.4	0.38	372.5	0.33	16.4	21.8
		10 × 10	386.7	23.2	746.8	0.62	904.9	0.46	29.3	39.4
		8 × 8	397.3	26.9	977.6	0.92	1505.2	0.55	68.9	89.4
Sculpture	Ahsan (2020) [14] depth: 200 cm	16 × 16	644	38	271	2.7	318	0.5	-	15.4
	Proposed Method depth: 110 cm	16 × 16	372.0	12.2	260.1	0.18	196.2	0.37	7.0	10.4
		14 × 14	384.5	16.2	307.2	0.25	321.1	0.46	7.4	11.4
		12 × 12	393.6	17.5	326.7	0.29	474.1	0.54	11.4	16.2
		10 × 10	401.0	18.6	524.1	0.46	934.5	0.66	19.3	26.2
		8 × 8	411.9	20.4	764.6	0.68	2097.8	1.00	29.4	42.1

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Elahi, A.; Zhu, Q.; Lu, J.; Farooq, U.; Farid, G.; Bilal, M.; Li, Y. Single-Shot, Monochrome, Spatial Pixel-Encoded, Structured Light System for Determining Surface Orientations. Photonics 2024, 11, 1046. https://doi.org/10.3390/photonics11111046

AMA Style

Elahi A, Zhu Q, Lu J, Farooq U, Farid G, Bilal M, Li Y. Single-Shot, Monochrome, Spatial Pixel-Encoded, Structured Light System for Determining Surface Orientations. Photonics. 2024; 11(11):1046. https://doi.org/10.3390/photonics11111046

Chicago/Turabian Style

Elahi, Ahsan, Qidan Zhu, Jun Lu, Umer Farooq, Ghulam Farid, Muhammad Bilal, and Yong Li. 2024. "Single-Shot, Monochrome, Spatial Pixel-Encoded, Structured Light System for Determining Surface Orientations" Photonics 11, no. 11: 1046. https://doi.org/10.3390/photonics11111046

APA Style

Elahi, A., Zhu, Q., Lu, J., Farooq, U., Farid, G., Bilal, M., & Li, Y. (2024). Single-Shot, Monochrome, Spatial Pixel-Encoded, Structured Light System for Determining Surface Orientations. Photonics, 11(11), 1046. https://doi.org/10.3390/photonics11111046

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Single-Shot, Monochrome, Spatial Pixel-Encoded, Structured Light System for Determining Surface Orientations

Abstract

1. Introduction

2. Materials and Methods

2.1. Designing of a Pattern

2.1.1. Defining Symbols

2.1.2. Robust Pseudo-Random Sequences or M-Arrays

2.1.3. Formation of a Projection Pattern

2.2. Computation of Surface Normals

2.2.1. Defining of Points Geometry

2.2.2. Defining Device Parameters, Projection, and Image Planes

2.2.3. Defining Light Planes and Their Normals

2.2.4. Underlying Principle

Computation of Light Planes Normals to the Projector Side

Computation of Light Plane Normals to the Camera Side

Computation of Surface Tangents

2.3. Computation of 3D World Coordinates [43,44,45,46]

2.4. Decoding of Patterns

2.4.1. Preprocessing, Segmentation, and Labeling

2.4.2. Decoding, Classification, and Computation of Parameters

2.4.3. Calibration

2.5. Experiment and Devices

2.5.1. Camera and Projector Devices

2.5.2. Target Surfaces

2.5.3. Experiment Setup

Pattern Employed in the Experiment

3. Results

3.1. Comparison of Measured Resolution

3.2. Classification or Decoding of Symbols or Feature Points in a Pattern

3.3. Point Clouds and Surface Normals of the Measured Objects

3.4. Time Durations of the Different Processes

4. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI