DRM-Based Colour Photometric Stereo Using Diffuse-Specular Separation for Non-Lambertian Surfaces

Li, Boren; Furukawa, Tomonari

doi:10.3390/jimaging8020040

Open AccessArticle

DRM-Based Colour Photometric Stereo Using Diffuse-Specular Separation for Non-Lambertian Surfaces

by

Boren Li

^1,† and

Tomonari Furukawa

^2,*,†,‡

¹

Beijing Institute for General Artificial Intelligence, Beijing 100124, China

²

School of Engineering and Applied Science, University of Virginia, Charlottesville, VA 22904, USA

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

^‡

Current address: 675 Old Reservoir Rd, Charlottesville, VA 22903, USA.

J. Imaging 2022, 8(2), 40; https://doi.org/10.3390/jimaging8020040

Submission received: 29 November 2021 / Revised: 3 January 2022 / Accepted: 5 January 2022 / Published: 8 February 2022

(This article belongs to the Special Issue Photometric Stereo)

Download

Browse Figures

Review Reports Versions Notes

Abstract

This paper presents a photometric stereo (PS) method based on the dichromatic reflectance model (DRM) using colour images. The proposed method estimates surface orientations for surfaces with non-Lambertian reflectance using diffuse-specular separation and contains two steps. The first step, referred to as diffuse-specular separation, initialises surface orientations in a specular invariant colour subspace and further separates the diffuse and specular components in the RGB space. In the second step, the surface orientations are refined by first initialising specular parameters via solving a log-linear regression problem owing to the separation and then fitting the DRM using Levenburg-Marquardt algorithm. Since reliable information from diffuse reflection free from specularities is adopted in the initialisations, the proposed method is robust and feasible with less observations. At pixels where dense non-Lambertian reflectances appear, signals from specularities are exploited to refine the surface orientations and the additionally acquired specular parameters are potentially valuable for more applications, such as digital relighting. The effectiveness of the newly proposed surface normal refinement step was evaluated and the accuracy in estimating surface orientations was enhanced around

30 %

on average by including this step. The proposed method was also proven effective in an experiment using synthetic input images comprised of twenty-four different reflectances of dielectric materals. A comparison with nine other PS methods on five representative datasets further prove the validity of the proposed method.

Keywords:

photometric stereo; dichromatic reflectance model; diffuse-specular separation; non-lambertian surfaces

1. Introduction

Photometric stereo (PS) estimates surface orientations using images captured from a fixed viewpoint under various illuminations and is especially powerful in acquisition of fine surface details at pixel level [1]. Surface orientation is important in a variety of fields, such as geometry segmentation for three-dimensional (3D) object recognition [2] and digital re-rendering in computer graphics. Surface geometries, which can be obtained via integrating surface orientations, have also been proven useful for applications, such as industrial quality control and reverse engineering. Due to the strength of PS and the importance of surface orientation acquisition, PS has drawn increasing interests since its debut [3]. However, making PS for a general real scene remains challenging due to the diverse reflectance properties of different materials that appear non-Lambertian [4]. This has given rise to the need for reliable estimation of surface orientations for a wide range of non-Lambertian reflectance, which essentially requires a proper imaging photometry model characterizing the forward problem and a subsequent PS method that inversely derives surface orientations.

Existing PS methods dealing with non-Lambertian reflectance can be classified into three categories. The first approximates surface reflectance using analytical bidirectional reflectance distribution function (BRDF) [5] and formulates the estimation of surface orientations as a nonlinear fitting problem. Nayar et al. [6] derived surface orientations using the Torrance-Sparrow BRDF [7] and incorporated extended sources to ensure sufficient information from specularities. Georghiades [8] inverted the same BRDF to simultaneously estimate surface orientations and resolve the generalized bas-relief ambiguity. Goldman et al. [9] assumed that general material reflectance can be represented by a convex combination of fundamental materials characterized by the Ward BRDF [10] and recovered surface orientations, fundamental material BRDFs and weight maps simultaneously for further scene editing purpose. Methods in this category exploiting information from surface reflectance are capable to derive not only surface orientations but also the other parameters in the analytical BRDFs, allowing more functionalities, such as digital relighting and material classification. However, due to the nonlinearity of analytical BRDFs and larger number of parameters to be estimated, these methods are sensitive to initialisations, numerically unstable under heavily corrupted outliers (e.g., shadows), and inapplicable when the number of observations is limited.

The second category of methods infers surface orientations through adopting the general properties of BRDF, such as isotropy, monotonicity and reciprocity. Alldrin et al. [11] developed a non-parametric PS method using bi-variate approximation of the isotropy property. Higo et al. [12] analyzed the general BRDF constraints and employed the BRDF properties of monotonicity, isotropy and visibility to vote for the most possible surface orientations for single-lobed reflectance. Shi et al. [13] proposed a bi-polynomial representation for low-frequency reflectance that was especially adaptable to the inverse problem as PS, while Ikehata et al. [14] developed another general isotropic BRDF as sum of lobes with unknown center directions. Methods in this category capitalize on the most fundamental properties of BRDF and therefore have the potential to deal with a broader range of reflectance. However, these methods are only capable to derive surface orientations with limited other functionalities and require an even larger set of observations compared with the first category. To widen the applicability, recent years have seen non-parametric BRDFs based on machine learning. Santo et al. [15,16] used a deep neural network for the first time whereas Taniai and Maehara [17] estimated surface normals and BRDFs by unsupervised learning. Ikehata [18] estimated surface normals more straightforward by deriving the so-called observation maps and using convolutional neural networks. Such data-driven approaches clearly improves the accuracy as observations directly create the BRDF or estimate surface normals. The use of a significantly large set of observations, which is key to the accuracy, and the expensive training effort using the observations make the approaches outside the scope of this paper.

Methods in the third category assume that non-Lambertian effects appear sparsely among observations and treat them as outliers. A substantial corpus of methods rely on robust statistical techniques for outlier rejection. Wu et al. [19] formulated the PS with outlier rejection as a global rank minimization problem, while Ikehata et al. [20] employed sparse Bayesian regression instead. Barsky et al. [21] initiated another line of PS researches using colour images where they first identified the significance of using specular colour in the dichromatic reflectance model (DRM) [22] for specularity rejection. Using the cue from known specular colour under the same theoretical foundation, Zickler et al. [23] derived a PS method in a novel two-dimensional (2D) specular invariant colour subspace. The major advantages of methods in the third category are their robustness and requirement for less images, while they are inefficient in the presence of dense non-Lambertian effects.

This paper presents a colour PS method to estimate surface orientations using diffuse-specular separation dealing with non-Lambertian reflectance. The proposed method models the colour imaging photometry using DRM as the forward problem and inverts it following a two-step procedure, the diffuse-specular separation and the surface normal refinement. The first step, initialising surface orientations using known specular colour similarly to [23], separates the diffuse and specular components in the RGB space and identifies outliers to reject in the UV space. In the second step, the parameters characterizing the specularity are initialised via solving a log-linear regression problem and the surface orientations are finally refined by fitting the nonlinear DRM using Levenburg-Marquardt algorithm (LMA). The proposed method robustly initialises surface orientations in a specular-free colour subspace and further fits for the nonlinear DRM with a newly derived parameter initialisation strategy. The proposed method preserves the advantage of robustness as the third category of methods, while it also benefits from the separated specular component to tackle with dense non-Lambertian reflectance as the first category. Furthermore, DRM parameters besides the surface orientations can be additionally obtained where dense non-Lambertian reflectance appears, making more potential applications feasible, such as digital relighting and material classification.

The rest of the paper is organized as follows. The next section presents the problem formulation of colour PS incorporating DRM and the relevant PS methods. Section 3 first overviews the flow and original contribution of the proposed colour DRM-based PS method and then, elucidates the specific two steps in the subsequent subsections. It follows the comprehensive evaluations on surface orientation estimation using both synthetic and real images in Section 4, while the last section summarizes the conclusions and proposes the future works.

2. Colour PS Incorporating Dichromatic Reflectance Model

2.1. Generic Colour PS Problem Formulation

Figure 1 shows the schematic diagram of the hardware components of the sensor and the necessary data for the generic colour PS. The hardware components are an RGB camera and a set of directional point sources whose positions are priorly known. Assume that ambient light is completely blocked, the colour PS requires that the RGB camera captures a colour image for the target surface with one source lit at a time. Given the N colour images illuminated under N different sources, the objective of the colour PS is to identify surface orientations pixel-wise. More specifically, given the colour image irradiance measurements at pixel

(i, j)

,

{e_{k}^{i, j *}}_{k \in [1, N]}

, under the unit illumination directions,

{{\bar{l}}_{k}^{i, j}}_{k \in [1, N]}

, estimated from the light positions, the colour PS aims to derive the unit surface normal,

{\bar{n}}^{i, j}

, by minimizing the sum of N corresponding square residuals:

ϵ^{i, j} = \sum_{k = 1}^{N} {[{(e_{k}^{i, j *})}^{T} w - f ({\bar{n}}^{i, j}, {\bar{l}}_{k}^{i, j})]}^{2} \to min_{{\bar{n}}^{i, j}} .

(1)

where

w

is a vector of weights for colour channels whereas f is a forward function of

{\bar{n}}^{i, j}

having

{\bar{l}}_{k}^{i, j}

as parameters, which corresponds to the weighted measurements. Note that

\bar{()}

and

{()}^{*}

mean the unit version and the measurement of

()

respectively throughout the paper.

2.2. Colour Image Formation Model

Figure 2 shows the coordinate setups that are necessary for the colour PS to model the imaging geometry and the light configuration. The coordinate frames are the 3D camera frame,

{C}

, the 2D pixel frame,

{P}

, and the world frame,

{W}

, where its XY plane is known and referred to as the reference plane. These coordinate frames follow the convention given in [24]. A 3D surface point,

^{{C}} \tilde{X}

, is projected to be a 2D image point,

^{{P}} x

, in the RGB camera using the perspective projection as:

^{{P}} x = \frac{1}{^{{C}} Z} M_{p} \hat{K}^{{C}} \tilde{X} = \frac{1}{^{{C}} Z} [\begin{matrix} m_{x} & 0 & o_{x} \\ 0 & m_{y} & o_{y} \\ 0 & 0 & 1 \end{matrix}] [\begin{matrix} f & 0 & 0 \\ 0 & f & 0 \\ 0 & 0 & 1 \end{matrix}]^{{C}} \tilde{X},

(2)

where

m_{x}

and

m_{y}

are each the number of pixels per unit distance along x and y direction respectively.

o_{x}

and

o_{y}

are the principal point location in

{P}

, and f is the effective focal length. Note that

^{{P}} x = {[^{{P}} x,^{{P}} y, 1]}^{⊤}

and

^{{C}} \tilde{X} = {[^{{C}} X,^{{C}} Y,^{{C}} Z]}^{⊤}

.

^{{P}} x

is rasterized into a regular grid in the camera as:

[\begin{matrix} j \\ i \end{matrix}] = ceil ([\begin{matrix} ^{{P}} x + 0.5 \\ ^{{P}} y + 0.5 \end{matrix}]),

(3)

where

ceil (\cdot)

denotes the ceiling operator.

{W}

and

{C}

are related with a 3D rigid transformation:

^{{C}} \tilde{X} = R_{W}^{{W}} \tilde{X} + t_{W}

.

Let the kth light position be

^{{C}} {\tilde{S}}_{k} = {[^{{C}} X_{k},^{{C}} Y_{k},^{{C}} Z_{k}]}^{⊤}

.

^{{C}} {\tilde{S}}_{k}

is associated in the spherical space as:

\{\begin{matrix} ^{{C}} X_{k} =^{{C}} r_{k} sin^{{C}} θ_{k} cos^{{C}} ϕ_{k} \\ ^{{C}} Y_{k} =^{{C}} r_{k} sin^{{C}} θ_{k} sin^{{C}} ϕ_{k} \\ ^{{C}} Z_{k} =^{{C}} r_{k} cos^{{C}} θ_{k}, \end{matrix}

(4)

where

^{{C}} r_{k}

,

^{{C}} θ_{k}

and

^{{C}} ϕ_{k}

are the radius, zenith and azimuth angle, respectively. The N lights are configured such that they have the same zenith angle and radius, and uniformly distributed azimuth angles around the optical axis, i.e.,

^{{C}} θ_{k} = θ

,

^{{C}} r_{k} = r

and

^{{C}} ϕ_{k} = \frac{360^{\circ} (k - 1)}{N} + ϕ_{0}

.

Having established the imaging geometry and light configuration, the imaging photometry is then modeled using the physically-based DRM [22]. The DRM properly characterizes reflectance for dielectric materials [25] and is represented by a linear combination of the diffuse and specular components. Assume that pixel

(i, j)

is shadow-free, the image irradiance for colour channel c after light strength normalisation and vignetting correction is written as:

e_{c}^{i, j} = e_{d, c}^{i, j} + e_{s, c}^{i, j} = d_{c}^{i, j} f_{d}^{i, j} + s_{c} f_{s}^{i, j},

(5)

where

e_{d, c}^{i, j}

and

e_{s, c}^{i, j}

are the diffuse and specular components, and

d_{c}^{i, j}

and

s_{c}

are diffuse and specular colours which are wavelength-dependent and represented by:

\{\begin{matrix} d_{c}^{i, j} = \int_{0}^{\infty} Φ (λ) R^{i, j} (λ) C_{c} (λ) d λ, \\ S_{c} = \int_{0}^{\infty} Φ (λ) C_{c} (λ) d λ . \end{matrix}

(6)

In the equation,

λ

is the wavelength,

Φ (λ)

is the source spectral power density (SPD),

R^{i, j} (λ)

is the spectral body reflectance, and

C_{c} (λ)

is the camera spectral sensitivity for the colour channel.

f_{d}^{i, j}

and

f_{s}^{i, j}

represent the diffuse and specular geometrical scaling factors that are characterized by BRDF.

Since the scaling factors are illumination-dependent, let the image irradiance Equation (5) generalized in the RGB colour space be with the kth illumination as:

e_{k}^{i, j} = e_{d, k}^{i, j} + e_{s, k}^{i, j} = {\bar{d}}_{R G B}^{i, j} f_{d, k}^{i, j} + {\bar{s}}_{R G B} f_{s, k}^{i, j} = [\begin{matrix} {\bar{d}}_{R}^{i, j} \\ {\bar{d}}_{G}^{i, j} \\ {\bar{d}}_{B}^{i, j} \end{matrix}] f_{d, k}^{i, j} + [\begin{matrix} {\bar{s}}_{R} \\ {\bar{s}}_{G} \\ {\bar{s}}_{B} \end{matrix}] f_{s, k}^{i, j},

(7)

where

{\bar{d}}_{R G B}^{i, j} = {d_{c}^{i, j}}_{c = R, G, B}

and

{\bar{s}}_{R G B} = {s_{c}}_{c = R, G, B}

are the unit diffuse and specular colour, respectively. The diffuse geometrical scaling factor is characterized with the kth unit illumination direction

{\bar{l}}_{k}^{i, j}

by Lambertian model as:

f_{d, k}^{i, j} = k_{d}^{i, j} {({\bar{n}}^{i, j})}^{⊤} {\bar{l}}_{k}^{i, j},

(8)

where

k_{d}^{i, j}

is the diffuse reflectance factor. Since the surface deviation is small compared with the light-surface distance,

{\bar{l}}_{k}^{i, j}

can be approximately derived using a vector from the point intersected by ray

(i, j)

and the reference plane to

^{{C}} {\tilde{S}}_{k}

. One of the common options of the specular geometrical scaling factor, on the other hand, is a model using the Blinn-Phong BRDF [26] because of its computational efficiency and high performance in characterizing a wide range of isotropic reflectance [27]:

f_{s, k}^{i, j} = k_{s}^{i, j} {({({\bar{n}}^{i, j})}^{T} {\bar{h}}_{k}^{i, j})}^{β^{i, j}},

(9)

where

k_{s}^{i, j}

is the specular reflectance factor,

{\bar{h}}_{k}^{i, j}

is the unit half vector, and

β^{i, j}

is the shininess coefficient. The unit half vector is written as:

{\bar{h}}_{k}^{i, j} = \frac{{\bar{l}}_{k}^{i, j} + {\bar{v}}^{i, j}}{∥ {\bar{l}}_{k}^{i, j} + {\bar{v}}^{i, j} ∥},

(10)

where

{\bar{v}}^{i, j}

is the unit viewer direction and determined by:

\{\begin{matrix} v^{i, j} = - {[\frac{j - o_{x}}{m_{x}}, \frac{i - o_{y}}{m_{y}}, f]}^{𐊗}, \\ {\bar{v}}^{i, j} = v^{i, j} / ‖ v^{i, j} ‖ . \end{matrix}

(11)

2.3. Colour PS Methods Using DRM

Existing colour PS methods using DRM contain two major variants, representative works of which were developed by Barsky et al. [21] and by Zickler et al. [23]. The PS method of Barsky finds the unit surface normal

{\bar{n}}^{i, j}

and the diffuse reflectance factor

k_{d}^{i, j}

such that the residual between the colour image irradiances weighted by

{\bar{d}}_{R G B}^{i, j}

and the diffuse geometrical scaling factor is minimised:

ϵ^{i, j} = \sum_{k = 1}^{N_{b}} {[{(e_{k}^{i, j *})}^{⊤} {\bar{d}}_{R G B}^{i, j} - f_{d, k}^{i, j}]}^{2} = \sum_{k = 1}^{N_{b}} {[{(e_{k}^{i, j *})}^{⊤} {\bar{d}}_{R G B}^{i, j} - k_{d}^{i, j} {({\bar{n}}^{i, j})}^{⊤} {\bar{l}}_{k}^{i, j}]}^{2} \to min_{{\bar{n}}^{i, j}, k_{d}^{i, j}},

(12)

where

N_{b} (\leq N)

is the number of specular-free and shadow-free images. The reason for the use of

N_{b}

instead of N is that the Barsky’s PS method assumes that non-Lambertian reflectance appears sparsely among observations, which should be rejected as outliers. As the first process for outlier rejection, the direct principal component analysis (DPCA) [28] on

{[E_{R G B}^{i, j *}]}^{⊤} [E_{R G B}^{i, j *}]

is performed to estimate

{\bar{d}}_{R G B}^{i, j}

, where

[E_{R G B}^{i, j *}] = {[e_{1}^{i, j *}, e_{2}^{i, j *}, \dots, e_{N}^{i, j *}]}^{⊤}

. Given

{\bar{s}}_{R G B}

and

{\bar{d}}_{R G B}^{i, j}

, the method then determines

f_{s, k}^{i, j}

as:

f_{s, k}^{i, j} = \frac{{(e_{k}^{i, j *})}^{T} {\bar{s}}_{R G B} - ({(e_{k}^{i, j *})}^{⊤} {\bar{d}}_{R G B}^{i, j}) ({({\bar{d}}_{R G B}^{i, j})}^{⊤} {\bar{s}}_{R G B})}{1 - {({({\bar{d}}_{R G B}^{i, j})}^{T} {\bar{s}}_{R G B})}^{2}} .

(13)

A threshold on

f_{s, k}^{i, j}

is finally used to determine pixels to be rejected as outliers, and this subsequently determines

N_{b}

.

The objective function (12) of this PS method is not different from that of the conventional Lambertian-based PS method [3]. Given

{{\bar{l}}_{k}^{i, j}}_{k \in [1, M]}

a priori,

{\bar{n}}^{i, j}

together with

k_{d}^{i, j}

, which minimizes the objective function (12), is analytically determined as:

\{\begin{matrix} k_{d}^{i, j} = ∥{({[L^{i, j}]}^{⊤} [L^{i, j}])}^{- 1} {[L^{i, j}]}^{⊤} [E_{R G B}^{i, j *}] {\bar{d}}_{R G B}^{i, j}∥, \\ {\bar{n}}^{i, j} = \frac{{({[L^{i, j}]}^{⊤} [L^{i, j}])}^{- 1} {[L^{i, j}]}^{⊤} [E_{R G B}^{i, j *}] {\bar{d}}_{R G B}^{i, j}}{k_{d}^{i, j}}, \end{matrix}

where

[L^{i, j}] = {[{\bar{l}}_{1}^{i, j}, {\bar{l}}_{2}^{i, j}, \dots, {\bar{l}}_{N_{b}}^{i, j}]}^{⊤}

and

rank ([L^{i, j}]) = 3

. While the effect of specularity rejection through the use of

{\bar{s}}_{R G B}

is exhibited, the estimation of

{\bar{d}}_{R G B}^{i, j}

using DPCA is erroneous due to the existence of specularities. Furthermore, all the specularities are rejected as outliers though useful information may be contained.

Unlike Barsky’s PS method, which performs PS in the RGB colour space, the PS method of Zickler estimates surface orientations in the SUV colour space, or the UV colour space as S is not considered:

\begin{matrix} ϵ^{i, j} = \sum_{k = 1}^{N} {[{(e_{U V, k}^{i, j *})}^{T} {\bar{d}}_{U V}^{i, j} - α^{i, j} {({\bar{n}}^{i, j})}^{T} {\bar{l}}_{k}^{i, j}]}^{2} \to min_{{\bar{n}}^{i, j}}, \end{matrix}

(14)

where

α^{i, j}

is a scaling factor. The UV colour subspace includes more observations than the RGB colour space in surface orientation estimation. To operate in the SUV space, the colour image irradiance is transformed from RGB to SUV as:

e_{S U V, k}^{i, j} ≜ e_{k}^{i, j} = R_{s} e_{k}^{i, j} = {\bar{d}}_{S U V}^{i, j} f_{d, k}^{i, j} + {\bar{s}}_{S U V} f_{s, k}^{i, j},

(15)

where

{\bar{d}}_{S U V}^{i, j} = {[{\bar{d}}_{U}^{i, j}, {\bar{d}}_{V}^{i, j}, {\bar{d}}_{S}^{i, j}]}^{T}

,

{\bar{s}}_{S U V} = {[0, 0, 1]}^{T}

, and

R_{s} \in S O (3)

is any transformation that yields

{\bar{s}}_{S U V} = R_{s} {\bar{s}}_{R G B} = {[0, 0, 1]}^{T}

. UV forms a specular-free colour subspace that is invariant to

f_{s, k}^{i, j}

. The Zickler’s PS method first estimates

{\bar{d}}_{U V}^{i, j}

of unit length using DPCA on

{[E_{U V}^{i, j *}]}^{⊤} [E_{U V}^{i, j *}]

. It then becomes the conventional Lambertian-based PS problem and the solution is obtained in the similar form to Equation (14). Compared with the Barsky’s PS, the Zickler’s PS utilizes more observations and advantageously performs PS in the specular-free UV colour subspace. Shape information from specularities is however neglected again, and the method can only provide diffuse component up to the normalised RGB space. The diffuse-specular separation in the RGB space remains open.

3. DRM-Based Colour PS Method Using Diffuse-Specular Separation

3.1. Overview of the Proposed Color PS Method

Figure 3 shows the flow and original contributions of the proposed DRM-based colour PS method using the diffuse-specular separation. The proposed method consists of two processes: Step 1, which is diffuse-specular separation using a PS, and Step 2, which is the surface normal refinement. Step 1 identifies good approximate surface normals, whereas Step 2 further refines and finalizes them with maximum accuracy. The steps are marked by red boxes.

Step 1 is composed of four sub-processes: diffuse colour estimation, diffuse-specular separation in RGB space, PS in UV space and the outlier estimation. After completing shadow rejection as a preprocess, and generating the shadow-free image irradiance matrix

[{\underset{̲}{E}}_{R G B}^{i, j *}]

, Step 1 begins with the diffuse colour estimation using the robust principal component analysis (RPCA) proposed in this paper and derives

{\bar{d}}_{R G B}^{i, j}

. Note that

\underset{̲}{()}

is a quantity of

()

after shadow rejection. The diffuse color estimation also outputs a specularity map, which describes the distribution of specular reflection. The diffuse-specular separability is then checked for each pixel by deriving the diffuse-specular chromatic angle

ψ^{i, j}

in RGB space. For separable pixels, the proposed method performs PS in the specular-free UV colour subspace similarly to Zickler’s PS method and derives the initial guess of the unit surface normal

{({\bar{n}}^{i, j})}_{1}

and the diffuse reflectance factor

{(k_{d}^{i, j})}_{1}

similarly to Barsky’s PS.

{()}_{1}

means the initial guess of

()

. In parallel, the specular geometrical scaling factors,

{({\underset{̲}{f}}_{s}^{i, j})}_{1}

, are derived through the diffuse-specular separation process using Equation (13) for the subsequent surface normal refinement. Observations not in the specularity map are clamped to zero in

{({\underset{̲}{f}}_{s}^{i, j})}_{1}

. The surface normals and the other parameters are then refined by fitting the nonlinear DRM using LMA with regularisation.

In Step 2, the initial guess of the specular parameters,

{(k_{s}^{i, j})}_{1}

and

{(β^{i, j})}_{1}

, are drived from

{({\bar{n}}^{i, j})}_{1}

and

{({\underset{̲}{f}}_{s}^{i, j})}_{1}

. Finally, PS for the surface normal refinement derives

{\bar{n}}^{i, j}

and

k_{d}^{i, j}

with enhanced accuracy while additionally updating

k_{s}^{i, j}

and

β^{i, j}

. The strength of the proposed method lies in this PS with the diffuse-specular separation. For pixels where sparse non-Lambertian reflectances appear, the first step is sufficient to reliably estimate the surface orientations. To tackle with more general problems with dense non-Lambertian reflectances, the second step is additionally required to exploit surface normal information from specularities in the DRM with the designed parameter initialisation strategy. The rest of this section elucidates the proposed method in detail.

3.2. Diffuse-Specular Separation

3.2.1. Diffuse Color Estimation

The proposed RPCA derives

{\bar{d}}_{R G B}^{i, j}

and the specularity map as follows:

1.: Perform principle component analysis (PCA) for ${[{\underset{̲}{E}}_{R G B}^{i, j *}]}^{⊤} [{\underset{̲}{E}}_{R G B}^{i, j *}]$ to estimate ${\bar{d}}_{R G B}^{i, j}$ ;
2.: Compute residual matrix: $[R_{d}^{i, j}] = [{\underset{̲}{E}}_{R G B}^{i, j *}] {\bar{d}}_{R G B}^{i, j} {({\bar{d}}_{R G B}^{i, j})}^{⊤} - [{\underset{̲}{E}}_{R G B}^{i, j *}]$ .
3.: Compute residual vector: $r_{d}^{i, j} = \sqrt{{(r_{d, 1}^{i, j})}^{\circ 2} + {(r_{d, 2}^{i, j})}^{\circ 2} + {(r_{d, 3}^{i, j})}^{\circ 2}}$ , where $[R_{d}^{i, j}] = [r_{d, 1}^{i, j}, r_{d, 2}^{i, j}, r_{d, 3}^{i, j}]$ and $\circ 2$ denotes the hadamard square.
4.: If the mean of $r_{d}^{i, j}$ , $mean (r_{d}^{i, j})$ , is smaller than the threshold $T_{d}$ , terminate RPCA and output the current estimate of ${\bar{d}}_{R G B}^{i, j}$ and the specularity map. Otherwise, find the element that provides the maximal value of $(\frac{r_{d}^{i, j} - mean (r_{d}^{i, j})}{std (r_{d}^{i, j})})$ and register $(i, j)$ in the specularity map. Remove the corresponding row vector in $[{\underset{̲}{E}}_{R G B}^{i, j *}]$ and repeat from step 1.

The RPCA eliminates image irradiances that are inappropriate for diffuse colour estimation.

The validity of the RPCA can be explained as follows. The shadow-free image irradiance matrix

[{\underset{̲}{E}}_{R G B}^{i, j *}]

is expanded from Equation (7) and decomposed as

\begin{matrix} [{\underset{̲}{E}}_{R G B}^{i, j *}] & = & {\underset{̲}{f}}_{d}^{i, j} {({\bar{d}}_{R G B}^{i, j})}^{⊤} + {\underset{̲}{f}}_{s}^{i, j} {({\bar{s}}_{R G B})}^{⊤} \\ = & [\begin{matrix} f_{d, 1}^{i, j} \\ f_{d, 2}^{i, j} \\ \dots \\ f_{d, N_{p}}^{i, j} \end{matrix}] [\begin{matrix} {\bar{d}}_{R}^{i, j} & {\bar{d}}_{G}^{i, j} & {\bar{d}}_{B}^{i, j} \end{matrix}] + [\begin{matrix} f_{s, 1}^{i, j} \\ f_{s, 2}^{i, j} \\ \dots \\ f_{s, N_{p}}^{i, j} \end{matrix}] [\begin{matrix} {\bar{s}}_{R} & {\bar{s}}_{G} & {\bar{s}}_{B} \end{matrix}] \end{matrix}

(16)

where most entries in

{\underset{̲}{f}}_{s}^{i, j}

are near-zero since images are often near specular-free. In such a case, a non-zero residual occurs if

{\underset{̲}{f}}_{s}^{i, j}

contains non-negligible entries. The RPCA algorithms robustly and accurately estimate

{\bar{d}}_{R G B}^{i, j}

by keeping rejecting specularities until the residual becomes smaller than

T_{d}

. Unlike Barsky’s PS method, which identifies specularities after

{\bar{d}}_{R G B}^{i, j}

estimation, the proposed algorithms derive

{\bar{d}}_{R G B}^{i, j}

and the specularity map simultaneously.

While the originality and superiority of the proposed RPCA algorithms have been identified, the performance of the RPCA algorithms depends on the value of

T_{d}

. If

T_{d}

is too large, the RPCA algorithms are tolerant to specularities and perform similarly to the DPCA algorithms. If

T_{d}

is too small, the RPCA algorithms reject innocent pixels as specularities and make images sensitive to noise.

3.2.2. Diffuse-Specular Separability Check

Having

{\bar{d}}_{R G B}^{i, j}

estimated, the diffuse-specular chromatic angle, used to perform the separability check, is derived as

cos ψ^{i, j} = {({\bar{d}}_{R G B}^{i, j})}^{T} {\bar{s}}_{R G B},

(17)

Asymptotically, the chromatic angle gives

lim_{ψ^{i, j} \to 0} {\bar{d}}_{R G B}^{i, j} = {\bar{s}}_{R G B}

(18)

This means that

ψ^{i, j}

should be sufficiently larger than 0 in order for

{\bar{d}}_{R G B}^{i, j}

and

{\bar{s}}_{R G B}

to be distinct and separable. The threshold of the chromatic angle to determine the separability is

T_{c}

. If

ψ^{i, j} \geq T_{c}

,

{\bar{d}}_{R G B}^{i, j}

and

{\bar{s}}_{R G B}

are sufficiently distinct and pixel

(i, j)

is considered as separable. Similarly to

T_{d}

,

T_{c}

should be chosen carefully. A small

T_{c}

results in yielding false separation. Once

{\bar{d}}_{R G B}^{i, j}

and

{\bar{s}}_{R G B}

have been identified,

{({\underset{̲}{f}}_{s}^{i, j})}_{1}

is then initialised using Equation (13).

3.2.3. PS in UV Space

Since the PS is performed in the UV colour space, the unit diffuse colour

{\bar{d}}_{R G B}^{i, j}

should be converted into that in UV colour space

{\bar{d}}_{U V}^{i, j}

. The PS in UV color space allows more observations as described in the Zickler’s PS method. With

R_{s}

, the estimated

{\bar{d}}_{R G B}^{i, j}

is transformed to the SUV space as

{\hat{d}}^{i, j} = {[{\hat{d}}_{U}^{i, j}, {\hat{d}}_{V}^{i, j}, {\hat{d}}_{S}^{i, j}]}^{⊤}

. The diffuse color in UV space

{\hat{d}}_{U V}^{i, j}

is then given by

{\hat{d}}_{U V}^{i, j} = {[{\hat{d}}_{U}^{i, j}, {\hat{d}}_{V}^{i, j}]}^{⊤} = κ_{z}^{i, j} {\bar{d}}_{U V}^{i, j}

(19)

where

{\bar{d}}_{U V}^{i, j} = {[{\bar{d}}_{U}^{i, j}, {\bar{d}}_{V}^{i, j}]}^{⊤}

is a unit vector, and

κ_{z}^{i, j} = \sqrt{{({\hat{d}}_{U}^{i, j})}^{2} + {({\hat{d}}_{V}^{i, j})}^{2}} > 0 .

(20)

Image irradiances of separable pixels are also transformed to the SUV colour space as

[{\underset{̲}{E}}_{S U V}^{i, j}]

using the same

R_{s}

.

[{\underset{̲}{E}}_{U V}^{i, j}]

is then formed by picking the first two columns of

[{\underset{̲}{E}}_{S U V}^{i, j}]

. Due to the presence of image noise, the PS in UV space is modified from Equation (15) to reject noise-corrupted image irradiances as outliers using studentised residuals [29]:

\begin{matrix} ϵ^{i, j} & = & \sum_{k = 1}^{N_{p}} {[{(e_{U V, k}^{i, j *})}^{⊤} {\bar{d}}_{U V}^{i, j} - {(κ_{z}^{i, j})}^{2} f_{d, k}^{i, j}]}^{2} \\ = & \sum_{k = 1}^{N_{p}} {[{(e_{U V, k}^{i, j *})}^{⊤} {\bar{d}}_{U V}^{i, j} - {(κ_{z}^{i, j})}^{2} {(k_{d}^{i, j})}_{1} {({\bar{n}}^{i, j})}_{1}^{⊤} {\bar{l}}_{k}^{i, j}]}^{2} \\ \to & min_{{({\bar{n}}^{i, j})}_{1}, {(k_{d}^{i, j})}_{1}}, \end{matrix}

(21)

where

N_{p}

is the number of shadow-free images, so

N_{b} \leq N_{p} \leq N

. The derivation of

{(k_{d}^{i, j})}_{1}

in addition to

{({\bar{n}}^{i, j})}_{1}

similarly to the Barsky’s PS method enables full recovery of the colour diffuse component.

N_{p}

is used instead of N and

N_{b}

because shadow-free images are effective for reliability.

After the PS,

{\bar{n}}^{i, j}

and

k_{d}^{i, j}

are initialised as

{({\bar{n}}^{i, j})}_{1}

and

{(k_{d}^{i, j})}_{1}

respectively. The diffuse component matrix,

[E_{d, R G B}^{i, j}]

, is then re-rendered as:

[E_{d, R G B}^{i, j}] = max ({(k_{d}^{i, j})}_{1} [L^{i, j}] {({\bar{n}}^{i, j})}_{1}, 0_{N}) {({\bar{d}}_{R G B}^{i, j})}^{T},

(22)

where

0_{N}

is an

N \times 1

vector with all zero entries. The specular component matrix,

[E_{s, R G B}^{i, j}]

, is obtained by

max ({({\underset{̲}{f}}_{s}^{i, j})}_{1} {\bar{s}}_{R G B}^{T}, 0_{N})

. With the separated colour diffuse and specular components, the proposed method allows the functionality of specular removal and intrinsic image decomposition.

3.2.4. Outlier Estimation

Let the residual vector

{\underset{̲}{r}}^{i, j}

be written as:

{\underset{̲}{r}}^{i, j} = [{\underset{̲}{E}}_{U V}^{i, j}] {\bar{d}}_{U V}^{i, j} - [{\underset{̲}{L}}^{i, j}] n^{i, j} = [{\underset{̲}{E}}_{U V}^{i, j}] {\bar{d}}_{U V}^{i, j} - [{\underset{̲}{L}}^{i, j}] ρ^{i, j} {\bar{n}}^{i, j},

(23)

where

n^{i, j} = ρ^{i, j} {\bar{n}}^{i, j}

is the scaled normal,

[{\underset{̲}{L}}^{i, j}] = {[{\bar{l}}_{1}^{i, j}, {\bar{l}}_{2}^{i, j}, \dots, {\bar{l}}_{M}^{i, j}]}^{T}

, and

ρ^{i, j} = k_{d}^{i, j} κ_{z}^{i, j}

. The hat matrix

[H_{a}^{i, j}]

[29] is represented by:

[{\underset{̲}{H}}_{a}^{i, j}] = [{\underset{̲}{L}}^{i, j}] {({[{\underset{̲}{L}}^{i, j}]}^{T} [{\underset{̲}{L}}^{i, j}])}^{- 1} {[{\underset{̲}{L}}^{i, j}]}^{T} .

(24)

Diagonal entries in

[{\underset{̲}{H}}_{a}^{i, j}]

are leverages and the kth leverage is denoted as

h_{a, k}^{i, j}

. These leverages quantify the influence that the observed

[{\underset{̲}{E}}_{U V}^{i, j}] {\bar{d}}_{U V}^{i, j}

on their predicted values

[{\underset{̲}{L}}^{i, j}] n^{i, j}

. Each entry in the studentized residual vector

{\tilde{\underset{̲}{r}}}^{i, j}

is then approximated using:

{\tilde{r}}_{k}^{i, j} = \frac{r_{k}^{i, j}}{\sqrt{{(M S E)}^{i, j} (1 - h_{a, k}^{i, j})}},

(25)

where

{(MSE)}^{i, j}

represents the mean squared error of

{\underset{̲}{r}}^{i, j}

. If any entry in

| {\tilde{\underset{̲}{r}}}^{i, j} |

exceeds

T_{o}

, this entry is estimated as outliers. The proposed PS rejects outliers until no more can be detected or

{(MSE)}^{i, j}

is smaller than a tolerance,

T_{m}

. With the general rule of thumb in detecting outliers using studentised residuals,

T_{o}

and

T_{m}

can be chosen as

2.5

and

9 σ_{n}^{2}

, where

σ_{n}

is the standard deviation of additive white Gaussian noise that can be estimated using the method given by [30].

3.3. Surface Normal Refinement

3.3.1. Specular Parameter Initialisation

If dense non-Lambertian reflectance appears at pixel

(i, j)

, which implies that more than one entry in

{({\underset{̲}{f}}_{s}^{i, j})}_{1}

are in the specularity map, surface normal refinement is necessary since the sparse non-Lambertian reflectance assumption in the diffuse-specular separation is violated. With the estimated

{({\underset{̲}{f}}_{s}^{i, j})}_{1}

, the next objective is to initialise the specular parameters,

{(k_{s}^{i, j})}_{1}

and

{(β^{i, j})}_{1}

. The specific cost functional is given by:

ϵ_{s}^{i, j} = \sum_{k = 1}^{N_{q}} {({(f_{s, k}^{i, j})}_{1} - {(k_{s}^{i, j})}_{1} {({({\bar{h}}_{k}^{i, j})}^{⊤} {({\bar{n}}^{i, j})}_{1})}^{{(β^{i, j})}_{1}})}^{2} \to min_{{(k_{s}^{i, j})}_{1}, {(β^{i, j})}_{1}},

(26)

where

N_{q}

is the number of specularity maps showing pixel

(i, j)

and

N_{q} \geq 2

. Equation (26) suggests a nonlinear least-squares problem, while it can be manipulated to the natural logarithm domain as a linear problem given by:

ϵ_{l s}^{i, j} = \sum_{k = 1}^{N_{q}} {(ln {(f_{s, k}^{i, j})}_{1} - ln {(k_{s}^{i, j})}_{1} - {(β^{i, j})}_{1} ln ({({\bar{h}}_{k}^{i, j})}^{⊤} {({\bar{n}}^{i, j})}_{1}))}^{2} \to min_{{(k_{s}^{i, j})}_{1}, {(β^{i, j})}_{1}} .

(27)

Let

{({\underset{̲}{\underset{̲}{f}}}_{s}^{i, j})}_{1}

with

N_{q}

rows consist of entries in

{({\underset{̲}{f}}_{s}^{i, j})}_{1}

in the specularity map and

{[\underset{̲}{\underset{̲}{H}}]}^{i, j} = {[{\bar{h}}_{1}^{i, j}, {\bar{h}}_{2}^{i, j}, . . ., {\bar{h}}_{N_{q}}^{i, j}]}^{T}

be the matrix comprised of the corresponding unit half vectors. Then, the solution of

ln {(k_{s}^{i, j})}_{1}

and

{(β^{i, j})}_{1}

to Equation (27) is given analytically by:

[\begin{matrix} {(β^{i, j})}_{1} \\ ln {(k_{s}^{i, j})}_{1} \end{matrix}] = {({[{\underset{̲}{\underset{̲}{S}}}^{i, j}]}^{⊤} [{\underset{̲}{\underset{̲}{S}}}^{i, j}])}^{- 1} {[{\underset{̲}{\underset{̲}{S}}}^{i, j}]}^{⊤} ln {({\underset{̲}{\underset{̲}{f}}}_{s}^{i, j})}_{1},

(28)

where

[{\underset{̲}{\underset{̲}{S}}}^{i, j}] = [ln ({[\underset{̲}{\underset{̲}{H}}]}^{i, j} {({\bar{n}}^{i, j})}_{1}), 1_{N_{q}}]

(29)

and

1_{N_{q}}

is a

N_{q} \times 1

vector with all entries equal to 1.

3.3.2. Surface Normal Refinement in DRM

All parameters in the DRM have been initialised up to this point. The LMA is then employed to iteratively refine these parameters by solving the optimization problem:

\begin{matrix} ϵ^{i, j} = \sum_{k = 1}^{N} ({(e_{k}^{i, j})}^{⊤} {\bar{s}}_{R G B} - k_{d}^{i, j} {({\bar{n}}^{i, j})}^{⊤} {\bar{l}}_{k}^{i, j} {({\bar{d}}_{R G B}^{i, j})}^{⊤} {\bar{s}}_{R G B} - k_{s}^{i, j} {({({\bar{n}}^{i, j})}^{⊤} {\bar{h}}_{k}^{i, j})}^{β^{i, j}} \\ {+ T_{α} (1 - {({\bar{n}}^{i, j})}^{⊤} {\bar{n}}^{i, j}))}^{2} \to min_{{\bar{n}}^{i, j}, k_{d}^{i, j}, k_{s}^{i, j}, β^{i, j}}, \end{matrix}

(30)

where the first three terms are from the generalised formulation, and, the fourth regularization term with constant

T_{α}

is added. The regularisation term reduces the change of surface normal refinement from the initial guess of

{({\bar{n}}^{i, j})}_{1}

and prevents overfitting. Larger value of

T_{α}

strengthens the robustness but limits the capability of surface normal refinement.

The strength of the proposed PS method lies in the determination of the surface normals while separating the diffuse and specular components in RGB space, eliminating outliers and then tuning the diffuse and specular reflectance factors simultaneously. The determination considering diffuse and specular reflections simultaneously makes the surface normal estimation more accurately. In addition, the proposed method can be used for specular removal [31] and intrinsic image decomposition [32]. Parameters characterizing

f_{s, k}

are determined at pixels, so the proposed method could also be used for digital relighting and material classification where dense non-Lambertian reflectance appears.

4. Performance Evaluation on Surface Orientation Estimation

4.1. Evaluations Using Synthetic Images

The first experiment aims to evaluate the effectiveness of surface normal refinement for dense non-Lambertian reflectance using synthetic input images. A scene with six different-coloured spheres were rendered under 32 illuminants using the Blinn-Phong model. The sphere was adopted as the scene geometry since it samples all the possibilities of surface orientations for the visible surface. Different colours of the spheres were introduced for more comprehensive evaluations of the proposed PS method on surfaces with various spectral reflectances. The 32 illuminants were chosen to provide a sufficient number of pixels with dense non-Lambertian reflectance for evaluation.

{\bar{s}}_{R G B}

was set at

{[0.5774, 0.5774, 0.5774]}^{T}

. Three colours of the spheres were red, green and blue with the same

ψ^{i, j}

of

57 . 74^{\circ}

, while the other three spheres were yellow, cyan and magenta with

ψ^{i, j}

of

35 . 26^{\circ}

. The 32 light positions were configured with

r = 442

mm and

θ = 20^{\circ}

. The reference plane was located where

R_{W} = d i a g ([1, - 1, - 1])

and

t_{W} = {[0, 0, 678]}^{T}

.

k_{d}^{i, j}

,

k_{s}^{i, j}

and

β^{i, j}

were set the same across the field of view as

0.4

,

0.2

and 100, respectively. Additive Gaussian noise was introduced with

σ_{n} = 0.02

. The image irradiance under the first illuminant is shown by Figure 4a. The parameters of the proposed PS method were configured as:

T_{d} = 0.01

,

T_{c} = 5^{\circ}

,

T_{o} = 2.5

,

T_{m} = 0.0036

,

T_{α} = 3

,

T_{x} = 0.02

and

T_{y} = 0.02

. 100 repeated tests were conducted and the mean value was adopted for evaluation.

Figure 4b shows the angular error of surface orientation estimation without surface normal refinement. As is shown, the central regions of the spheres where dense non-Lambertian reflectances appear have larger error. By including the proposed surface normal refinement step, the angular errors of surface orientations around the detected dense non-Lambertian reflectance region, shown by a ring, are significantly reduced as shown by Figure 4c. This is where the reflectance parameters are most tuned through the surface normal refinement. Errors in such regions are even smaller than those with sparse non-Lambertian reflectance, which further enhances that making specularities as meaningful signals is beneficial for surface orientation estimation. Figure 4d demonstrates the improvement percentage for quantitative evaluation. The improvement percentage is defined reduction of the error divided by the error without surface normal refinement. The accuracy of surface orientations was enhanced by

34.33 %

in median and

32.25 %

on average. The first and third quantile values of the improvement were

15.76 %

and

54.23 %

, respectively. The performance improvements onto the six different-coloured spheres are shown in Table 1. The improvements were obvious for all spheres is significant, implying that the method was applicable to a wide range of surfaces with different spectral reflectances. For spheres that had larger chromatic angle,

ψ^{i, j}

, the

{\bar{d}}_{R G B}^{i, j}

estimations were more accurate. The better estimated

{\bar{d}}_{R G B}^{i, j}

led to the more accurate recovery of

{({\underset{̲}{f}}_{s}^{i, j})}_{1}

, resulting in the more reliable initialisation of

{({\bar{n}}^{i, j})}_{1}

. Such initialisations of

{\bar{n}}^{i, j}

ultimately affected the different improvement performances. Overall, the accuracy was enhancement by including the additional surface normal refinement step and for spectral reflectances with larger chromatic angles, the improvement was more phenomenal.

In the second experiment, the performance improvement due to surface normal refinement was verified onto a wide range of reflectances. Experimental settings and parameters of the proposed method were the same as the first experiment except that different values of

k_{d}^{i, j}

,

k_{s}^{i, j}

and

β^{i, j}

were applied to represent various surface reflectances when generating synthetic images. The ratio of

κ_{d s}^{i, j} = k_{d}^{i, j} / k_{s}^{i, j}

represents the relative strength between the diffuse and specular components, whereas

β^{i, j}

indicates the width of the specular lobe. The mean, median, first and third quantile values of the improvement were adopted for evaluation and the angular error of

{\bar{d}}_{R G B}^{i, j}

estimation was used for analyses. As shown from Table 2, the improvements on all the different reflectances were significant and more obvious for larger value of

β^{i, j}

and

κ_{d s}^{i, j}

. Larger value of

β^{i, j}

suggesting narrower specular lobe made the

{\bar{d}}_{R G B}^{i, j}

estimation more accurate since the sparse non-Lambertian reflectance assumption was valid. Similarly, larger value of

κ_{d s}^{i, j}

indicating stronger diffuse components made better

{\bar{d}}_{R G B}^{i, j}

estimation due to stronger inliers. More accurate

{\bar{d}}_{R G B}^{i, j}

estimation led to bigger improvement from the surface normal refinement. In summary, with the additional surface normal refinement step, the accuracy of surface orientation estimation is enhanced by around

30 %

on average and the improvement is more obvious for reflectance that has narrower specular lobe and stronger diffuse component.

The third experiment aims to evaluate the overall performance of the proposed colour PS method onto a wide variety of dielectric material reflectances. Twenty-four material BRDFs divided into six categories were evaluated, where the different BRDFs were provided from the MERL database [33]. Experimental settings and the method parameters were exactly the same as the first experiment except that the BRDFs were replaced. The image irradiance for the first six materials under the first illuminant is shown in Figure 5a. The angular errors of the surface orientation estimation are given in Figure 5b. As is shown, the error is not only larger at the near central regions where specularities overlap but also at the boundaries. Boundary pixels in the spheres have shallow grazing angles and the error is due to the lack of modeling of the fresnel reflection in the Blinn-Phong model. Figure 5c shows the performance of surface normal estimation on the 24 materials using box-and-whisker plot. The red and black dot represent the mean and median value, respectively. The lower and upper bound of the box indicates the first and third quantile values. The six colours of the boxes suggest the six material categories. As is shown, the proposed method can accurately estimate

{\bar{n}}^{i, j}

for most dielectric materials within 5 degrees on average. The estimation performances on phenolic, plastic and rubber were consistent, whereas those on the wood stain, fabric and acrylic were also acceptable with two exceptions, the violet acrylic and the green fabric. The degradation for violet acrylic was due to its wide specular lobe with small

κ_{d s}^{i, j}

, while the reason for green fabric was because of its small

ψ^{i, j}

.

4.2. Evaluations Using Real Images

The proposed method was evaluated on five different datasets comprised of real images in the DiLigenT database [4]. Parameters of the proposed method were set the same as the first experiment.

{\bar{s}}_{R G B}

was estimated as

{[0.5774, 0.5774, 0.5774]}^{T}

. From the first to fifth column in Figure 6, the results on BUDDHA, BEAR, POT2, READING and GOBLET are respectively shown. Figure 6a shows the image irradiance under the first illuminant, while Figure 6b,c demonstrate the estimated normal map and the angular error of surface orientation estimation, respectively. As is shown from READING, the proposed method is only feasible for surfaces whose

{\bar{d}}_{R G B}^{i, j}

and

{\bar{s}}_{R G B}

are distinct. Figure 6c shows that the dense non-Lambertian problem where the specularity overlaps is mostly solved for BUDDHA, BEAR and POT2, while deficits still exist on GOBLET, which is because the DRM is not proper to characterize metallic reflectance.

The results were compared with those of nine other PS methods which are listed in Table 3. The results from the nine methods were provided from the DiLigenT database and these methods were representative and well-recognized. It is to be noted that the results of the recent machine learning based techniques reviewed in Introduction are not shown for comparison since the effect of the physics based models is investigated in this paper. Due to the data availability, refs. [21,23] were not involved in the comparison, while the essence of their works in using DRM with known

{\bar{s}}_{R G B}

was well-inherited by the proposed method.

Figure 6d shows the results from ten PS methods on the five datasets. The X-axis represents the method index and the Y-axis indicates the angular error of

{\bar{n}}^{i, j}

. The five colours of the bar represent the different datasets. The angular error of

{\bar{n}}^{i, j}

is also demonstrated using the box-and-whisker plot similar to Figure 5. As is shown, the proposed method performs well in estimating surface orientations for all the five datasets except GOBLET. The results suggest that modeling reflectance using DRM is proper for surfaces made of dielectric materials. The strength of the proposed method lies on the reliable estimation of

{\bar{d}}_{R G B}^{i, j}

with simultaneous specularity detection. The advantage is also accredited by exploiting

{\bar{n}}^{i, j}

not only from the diffuse components but also from the specularities owing to the diffuse-specular separation. In spite of its advantages, the proposed method is not effective for metallic surfaces compared with methods, such as [14]. The limitation originates from the fundamental assumption of DRM that

{\bar{s}}_{R G B}

are the same across the surface. This assumption is violated for metallic surfaces since the wavelength and geometry exhibit inter-dependency.

5. Conclusions and Future Works

A PS method using colour images dealing with non-Lambertian reflectance has been proposed. The method formulates the imaging photometry using DRM with known specular colour. It extracts surface orientations not only from the diffuse components but also specularities owing to the diffuse-specular separation. Introducing the additional surface refinement step using information from specularities, the proposed method particularly improves the accuracy for surface orientation estimation at pixels where dense non-Lambertian reflectance appear. The simultaneously acquired specular parameters can be applied for more potential functionalities, such as digital relighting and material classification.

From the experiment of validating the effectiveness of the newly proposed surface normal refinement step on surface where dense non-Lambertian reflectance appears, the results indicate that with the additional step, the accuracy is enhanced by around

30 %

on average and the improvement is more phenominal for surfaces with larger chromatic angles, stronger diffuse components and narrower specular lobes due to the better estimated unit diffuse colour. The result investigating the proposed method applicability on 24 different reflectances of dielectric materials suggests that the average angular error of surface orientation estimation for most materials are within

5^{\circ}

and bigger errors occur at surface patches with shallower grazing angles due to the limitation of the Blinn-Phong model that does not incorporate the fresnel reflection. From the systematic comparison on five datasets with nine other representative methods, the proposed method shows its descent performance on reflectances of dielectric materials and degradation on metallic surface due to the limitation of DRM. The result has also demonstrated that the proposed method is only feasible for surfaces whose diffuse and specular colour are distinct.

This paper presented the first set of results to estimate surface orientations using colour images for non-Lambertian reflectance, while it can be extended in a variety of ways. First, other analytical BRDFs can be employed instead of the Blinn-Phong model to better account for the fresnel reflection. Second, machine learning based models can be incorporated to learn and reduce errors of analytical models. Further, an alternative method can be proposed specifically for surfaces whose diffuse and specular colours are close to extend the method’s applicability. Making the method adaptive is one possible solution. Last but not least, it is also of particular interest to infer surface properties from the estimated specular parameters for the purpose of material classification.

Author Contributions

Conceptualization, B.L. and T.F.; methodology, B.L. and T.F.; software, B.L.; validation, B.L.; formal analysis, B.L.; investigation, B.L.; resources, B.L.; data curation, B.L.; writing—original draft preparation, B.L.; writing—review and editing, T.F.; visualization, B.L.; supervision, T.F.; project administration, T.F.; funding acquisition, T.F. All authors have read and agreed to the published version of the manuscript.

Funding

This work is sponsored in part by US Office of Naval Research (N00014-20-1-2468).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors would like to express their sincere gratitude to Tom McKenna at Office of Naval Research for his invaluable advice.

Conflicts of Interest

The authors declare no conflict of interest.

References

Ackermann, J.; Goesele, M. A survey of photometric stereo techniques. J. Found. Trends Comput. Graph. Vis. 2015, 9, 149–254. [Google Scholar] [CrossRef]
Klasing, K.; Althoff, D.; Wollherr, D.; Buss, M. Comparison of surface normal estimation methods for range sensing applications. In Proceedings of the 2009 IEEE International Conference on Robotics and Automation, Kobe, Japan, 12–17 May 2009; pp. 3206–3211. [Google Scholar]
Woodham, R.J. Photometric method for determining surface orientation from multiple images. Opt. Eng. 1980, 19, 139–144. [Google Scholar] [CrossRef]
Shi, B.; Zhe, W.; Mo, Z.; Duan, D.; Yeung, S.; Tan, P. A benchmark dataset and evaluation for non-Lambertian and uncalibrated photometric stereo. In Proceedings of the Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 3707–3716. [Google Scholar]
Shell, J.R. Bidirectional Reflectance: An Overview with Remote Sensing Applications & Measurement Recommendations. 2004, pp. 37–50. Available online: http://web.gps.caltech.edu/~vijay/Papers/BRDF/shell-04.pdf (accessed on 4 January 2022).
Nayar, S.K.; Ikeuchi, K.; Kanade, T. Determining shape and reflectance of hybrid surfaces by photometric sampling. IEEE Trans. Robot. Autom. 1990, 6, 418–431. [Google Scholar] [CrossRef]
Torrance, K.E.; Sparrow, E.M. Theory for off-specular reflection from roughened surfaces. J. Opt. Soc. Am. 1967, 57, 1105–1114. [Google Scholar] [CrossRef]
Georghiades, A. Incorporating the Torrance and Sparrow model of reflectance in uncalibrated photometric stereo. In Proceedings of the Proceedings Ninth IEEE International Conference on Computer Vision, Nice, France, 13–16 October 2003; pp. 816–823. [Google Scholar]
Goldman, D.B.; Curless, B.; Hertzmann, A.; Seitz, S.M. Shape and spatially-varying BRDFs from photometric stereo. IEEE Trans. PAMI 2010, 32, 1060–1071. [Google Scholar] [CrossRef] [PubMed]
Ward, G.J. Measuring and modeling anisotropic reflection. In SIGGRAPH92: Proceedings of the 19th Annual Conference and Exhibition on Computer Graphics and Interactive Techniques; Association for Computing Machinery: New York, NY, USA, 1992; pp. 265–272. [Google Scholar]
Alldrin, N.G.; Zickler, T.; Kriegman, D.J. Photometric stereo with non-parametric and spatially-varying reflectance. In Proceedings of the 2008 IEEE Conference on Computer Vision and Pattern Recognition, Anchorage, AK, USA, 23–28 June 2008; pp. 1–8. [Google Scholar]
Higo, T.; Matsushita, Y.; Ikeuchi, K. Consensus photometric stereo. In Proceedings of the 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, USA, 13–18 June 2010; pp. 1157–1164. [Google Scholar]
Shi, B.; Tan, P.; Matsushita, Y.; Ikeuchi, K. Bipolynomial modeling of low-frequency reflectances. IEEE Trans. PAMI 2014, 36, 1078–1091. [Google Scholar] [CrossRef] [PubMed]
Ikehata, S.; Aizawa, K. Photometric stereo using constrained bivariate regression for general isotropic surfaces. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Columbus, OH, USA, 23–28 June 2014; pp. 23–28. [Google Scholar]
Santo, H.; Samejima, M.; Sugano, Y.; Shi, B.; Matsushita, Y. Deep photometric stereo network. In Proceedings of the International Workshop on PBDL in Conjunction with IEEE ICCV, Venice, Italy, 22–29 October 2017; pp. 1–9. [Google Scholar]
Santo, H.; Samejima, M.; Sugano, Y.; Shi, B.; Matsushita, Y. Deep photometric stereo networks for determining surface normal and reflectances. IEEE Trans. Pattern Anal. Mach. Intell. 2022, 44, 114–128. [Google Scholar] [CrossRef] [PubMed]
Taniai, T.; Maehara, T. Neural inverse rendering for general reflectance photometric stereo. In Proceedings of the 35th International Conference on Machine Learning, Stockholm, Sweden, 10–15 July 2018. [Google Scholar]
Ikehata, S. CNN-PS: CNN-based photometric stereo for general non-convex surfaces. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 3–18. [Google Scholar]
Wu, L.; Ganesh, A.; Shi, B.; Matsushita, Y.; Wang, Y.; Ma, Y. Robust photometric stereo via low-rank matrix completion and recovery. In Proceedings of the 10th Asian Conference on Computer Vision, Queenstown, New Zealand, 8–12 November 2010; pp. 703–717. [Google Scholar]
Ikehata, S.; Wipf, D.; Matsushita, Y.; Aizawa, K. Robust photometric stereo using sparse regression. In Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA, 16–21 June 2012; pp. 318–325. [Google Scholar]
Barsky, S.; Petrou, M. The 4-source photometric stereo technique for three-dimensional surfaces in the presence of highlights and shadows. IEEE Trans. PAMI 2003, 25, 1239–1252. [Google Scholar] [CrossRef]
Shafer, S.A. Using colour to separate reflection components. Color Res. Appl. 1985, 10, 210–218. [Google Scholar] [CrossRef]
Zickler, T.; Mallick, S.P.; Kriegman, D.J.; Belhumeur, P.N. Color subspace as photometric invariants. Int. J. Comput. Vis. 2008, 79, 13–30. [Google Scholar] [CrossRef][Green Version]
Hartley, R.; Zisserman, A. Chapter 6: Camera Models. In Multiple View Geometry; Cambridge University Press: Cambridge, UK, 2003; pp. 153–177. [Google Scholar]
Angelopoulou, E. Specular highlight detection based on the fresnel reflection coefficient. In Proceedings of the 2007 IEEE 11th International Conference on Computer Vision, Rio de Janeiro, Brazil, 14–21 October 2007; pp. 1–8. [Google Scholar]
Blinn, J.F. Models of light reflection for computer synthesized pictures. In Proceedings of the 4th Annual Conference on Computer Graphics and Interactive Techniques, San Jose, CA, USA, 20–22 July 1977; pp. 192–198. [Google Scholar]
Ngan, A.; Dur, F.; Matusik, W. Experimental validation of analytical BRDF models. In Proceedings of the ACM SIGGRAPH 2004 Sketches, Los Angeles, CA, USA, 8–12 August 2004; p. 90. [Google Scholar]
Kao, Y.H.; Roy, B.V. Directed Principal Component Analysis. Oper. Res. 2014, 62, 957–972. [Google Scholar] [CrossRef][Green Version]
Pope, A.J. The Statics of Residuals and the Detection of Outliers; NOAA Technical Report; 1976; Volume 1, p. 65. Available online: https://www.ngs.noaa.gov/PUBS_LIB/TRNOS65NGS1.pdf (accessed on 4 January 2022).
Liu, C.; Freeman, W.T.; Szeliski, R.; Kang, S.B. Noise estimation from a single image. In Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’06), New York, NY, USA, 17–22 June 2006; pp. 901–908. [Google Scholar]
Artusi, A.; Banterle, F.; Chetverikov, D. A survey of specularity removal methods. Comput. Graph. Forum 2011, 30, 2208–2230. [Google Scholar] [CrossRef]
Barrow, H.; Tanenbaum, J. Recovering intrinsic scene characteristic from images. Proc. Comput. Vis. Syst. 1978, 2, 3–26. [Google Scholar]
Matusik, W.; Pfister, H.; Br, M.; McMillan, L. A Data-Driven Reflectance Model. ACM Trans. Graph. 2003, 22, 759–769. [Google Scholar] [CrossRef]
Shi, B.; Tan, P.; Matsushita, Y.; Ikeuchi, K. Elevation angle from reflectance monotonicity: Photometric stereo for general isotropic reflectances. In Proceedings of the European Conference on Computer Vision, Florence, Italy, 7–13 October 2012; pp. 455–468. [Google Scholar]

Figure 1. Schematic diagram for generic colour PS.

Figure 2. Coordinate setups for modeling imaging geometry and light configuration.

Figure 3. Flows and the original contribution of the DRM-based colour PS.

Figure 4. Evaluation of surface normal refinement: (a) Image irradiance under the first illuminant; (b) Angular error of surface orientations without surface normal refinement in degrees; (c) Angular error of surface orientations with surface normal refinement in degrees; (d) Improvement of surface orientation estimation by including surface normal refinement in percentage.

Figure 5. (a) Image irradiance for the first six material BRDFs under the first illuminant; (b) Angular error of surface orientation estimation for the first six material BRDFs; (c) Angular error of surface orientation for the twenty-four dielectric materals.

Figure 6. Evaluations on surface oreintation estimation: (a) Image irradiance under the first illuminant; (b) Estimated normal map; (c) Angular error of surface orientations; (d) Angular error of surface orientations on five datasets using ten different PS methods.

Table 1. Improvement on different-coloured spheres.

Index	Colour	$ψ^{i, j}$	Error of ${\bar{d}}_{{RGB}^{i, j}}$	Mean Improvement
1	red	$57 . 74^{\circ}$	$1 . 15^{\circ}$	$23.39 %$
2	yellow	$35 . 26^{\circ}$	$1 . 33^{\circ}$	$32.85 %$
3	green	$57 . 74^{\circ}$	$1 . 15^{\circ}$	$23.42 %$
4	cyan	$35 . 26^{\circ}$	$1 . 32^{\circ}$	$32.04 %$
5	blue	$57 . 74^{\circ}$	$1 . 15^{\circ}$	$23.15 %$
6	magenta	$35 . 26^{\circ}$	$1 . 33^{\circ}$	$32.24 %$

Table 2. Evaluation of the improvement for different reflectances.

$k_{d} / k_{s}$	$β$	Mean	Median	First Quantile	Third Quantile	Error of ${\bar{d}}_{RGB}$
$0.4 / 0.2$	100	$32.25 %$	$34.33 %$	$15.76 %$	$54.23 %$	$1 . 23^{\circ}$
$0.4 / 0.4$	100	$31.14 %$	$32.97 %$	$15.21 %$	$53.75 %$	$1 . 58^{\circ}$
$0.4 / 0.8$	100	$30.48 %$	$32.77 %$	$14.27 %$	$52.90 %$	$2 . 34^{\circ}$
$0.4 / 0.2$	20	$27.49 %$	$26.70 %$	$6.80 %$	$51.66 %$	$2 . 99^{\circ}$
$0.4 / 0.4$	20	$25.92 %$	$25.98 %$	$6.67 %$	$51.25 %$	$5 . 06^{\circ}$
$0.4 / 0.8$	20	$25.02 %$	$24.07 %$	$6.57 %$	$50.81 %$	$8 . 30^{\circ}$

Table 3. Comparative studies of PS methods.

Method Index	1	2	3	4	5	6	7	8	9	10
Reference	[3]	[9]	[11]	[12]	[19]	[20]	[13]	[34]	[14]	proposed

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, B.; Furukawa, T. DRM-Based Colour Photometric Stereo Using Diffuse-Specular Separation for Non-Lambertian Surfaces. J. Imaging 2022, 8, 40. https://doi.org/10.3390/jimaging8020040

AMA Style

Li B, Furukawa T. DRM-Based Colour Photometric Stereo Using Diffuse-Specular Separation for Non-Lambertian Surfaces. Journal of Imaging. 2022; 8(2):40. https://doi.org/10.3390/jimaging8020040

Chicago/Turabian Style

Li, Boren, and Tomonari Furukawa. 2022. "DRM-Based Colour Photometric Stereo Using Diffuse-Specular Separation for Non-Lambertian Surfaces" Journal of Imaging 8, no. 2: 40. https://doi.org/10.3390/jimaging8020040

APA Style

Li, B., & Furukawa, T. (2022). DRM-Based Colour Photometric Stereo Using Diffuse-Specular Separation for Non-Lambertian Surfaces. Journal of Imaging, 8(2), 40. https://doi.org/10.3390/jimaging8020040

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

DRM-Based Colour Photometric Stereo Using Diffuse-Specular Separation for Non-Lambertian Surfaces

Abstract

1. Introduction

2. Colour PS Incorporating Dichromatic Reflectance Model

2.1. Generic Colour PS Problem Formulation

2.2. Colour Image Formation Model

2.3. Colour PS Methods Using DRM

3. DRM-Based Colour PS Method Using Diffuse-Specular Separation

3.1. Overview of the Proposed Color PS Method

3.2. Diffuse-Specular Separation

3.2.1. Diffuse Color Estimation

3.2.2. Diffuse-Specular Separability Check

3.2.3. PS in UV Space

3.2.4. Outlier Estimation

3.3. Surface Normal Refinement

3.3.1. Specular Parameter Initialisation

3.3.2. Surface Normal Refinement in DRM

4. Performance Evaluation on Surface Orientation Estimation

4.1. Evaluations Using Synthetic Images

4.2. Evaluations Using Real Images

5. Conclusions and Future Works

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI