On Generating Synthetic Datasets for Photometric Stereo Applications

Crabu, Elisa; Rodriguez, Giuseppe

doi:10.3390/computers14050166

Open AccessArticle

On Generating Synthetic Datasets for Photometric Stereo Applications

by

Elisa Crabu

^†

and

Giuseppe Rodriguez

^*,†

Departmentof Mathematics and Computer Science, University of Cagliari, Via Ospedale, 72, 09124 Cagliari, Italy

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Computers 2025, 14(5), 166; https://doi.org/10.3390/computers14050166

Submission received: 27 March 2025 / Revised: 26 April 2025 / Accepted: 28 April 2025 / Published: 29 April 2025

(This article belongs to the Special Issue Advanced Image Processing and Computer Vision (2nd Edition))

Download

Browse Figures

Versions Notes

Abstract

The mathematical model for photometric stereo makes several restricting assumptions, which are often not fulfilled in real-life applications. Specifically, an object surface does not always satisfies Lambert’s cosine law, leading to reflection issues. Moreover, the camera and the light source, in some situations, have to be placed at a close distance from the target, rather than at infinite distance from it. When studying algorithms for these complex situations, it is extremely useful to have at disposal synthetic datasets with known exact solutions, to assert the accuracy of a solution method. The aim of this paper is to present a Matlab package which constructs such datasets on the basis of a chosen exact solution, providing a tool for simulating various real camera/light configurations. This package, starting from the mathematical expression of a surface, or from a discrete sampling, allows the user to build a set of images matching a particular light configuration. Setting various parameters makes it possible to simulate different scenarios, which can be used to investigate the performance of reconstruction algorithms in several situations and test their response to lack of ideality in data. The ability to construct large datasets is particularly useful to train machine learning based algorithms.

Keywords:

photometric stereo; shape from shading; reflection; close light; non-lambertian surface; synthetic dataset

1. Introduction

Computer vision techniques find application in many different fields. Among these, the digital reconstruction of objects is an extensively studied topic, for which various mathematical models are available. The idea behind Shape from Shading (SfS) is to derive shape and color information from a single image of the object [1,2,3]. Since this problem is ill-posed, an alternative SfS technique denoted photometric stereo (PS) [4,5] is often used to determine the three-dimensional shape of an observed target from a set of images. The original formulation requires that images are obtained by light sources placed at different known positions, theoretically at infinite distance from the target, and that the object surface is Lambertian. Under these assumptions, it was proved by Kozera [6] that for the problem to have a unique solution at least two images with different lighting must be available. In this case, the solution method is based on the resolution of a system of first-order Hamilton-Jacobi partial differential equations, which have been studied by different numerical approaches; see, e.g., [7]. However, the assumptions of the model are very limiting, especially in real-life applications.

In [8], PS was applied to an archaeological scenario, with the aim of reconstructing the digital shape of rock art carvings found in various sites; see, e.g., [9]. In this kind of applications, light source has often to be placed close to the surface, due to the structure of the excavation sites. For similar light configurations different methods have been developed, including learning-based models as those presented in [10,11]. An additional issue in archaeology is that the rock is not an ideal Lambertian reflector. This condition have been largely studied using different algorithmic approaches: some based on a statistical model [12], others on learning-based methods [13,14,15,16]. A pre-processing of non-Lambertian data has been discussed in [17], while in [8] a method to extract from dataset a subset of pictures which satisfy the ideal assumptions of the model has been introduced.

Various benchmark datasets for PS are available: some are obtained from real objects [18,19,20,21], some are synthetic datasets, as CyclePS [22], PSWild [23], and PSMix [24], built from online 3D models, for which different reflectance functions have been chosen to replicate real objects. In [25], the Blobby Shape Dataset was presented. This synthetic dataset is obtained by considering 10 fixed surfaces in 10 lighting environments, with a physically-based renderer. A software package to construct a dataset of 100 images of these surfaces, under fixed lighting conditions, is available.

The aim of this paper is to present a Matlab package which allows the user to construct arbitrarily large synthetic datasets that reflect particular observation conditions, starting from a chosen model surface, either known by its mathematical representation or by a discrete sampling. A limited choice of symbolic surfaces is available in the package, but its number can be easily extended. The user can choose the number of light sources and their position, and the package deals with two deviations from the standard Lambertian model: the “close-light configuration”, which allows the user to position selected light sources at a finite distance from the observed object, and the reflection phenomenon, that is, the possibility for the surface to reflect light specularly. Setting specific parameters, the user can produce datasets which combine such issues. Differently from available datasets, where the observation conditions are fixed, with our tool one can freely choose the scenario configuration and generate as many “pictures” as needed, by suitable changing some parameters. Two demonstration programs show how to use the package to produce datasets with different features.

Our study is not oriented to a particular reconstruction method, but produces general datasets that can be used to ascertain the performance of any method, by comparing the results with the exact solution.

The paper is organized as follows: in Section 2 we resume the photometric stereo mathematical model. Section 3 describes the algebraic setting for generating datasets with lights at a finite distance, while Section 4 shows how we deal with reflective surfaces. The software implementation and its use are discussed in Section 5, where some example datasets are also displayed. A sensitivity analysis for datasets produced by the package is performed in Section 6, where an example of its use as a data generator for reconstruction algorithms is also illustrated. Section 7 contains final considerations.

2. The Mathematical Model for Photometric Stereo

In this section we briefly review the mathematical setting introduced in [5], with the notation adopted in [26].

The photometric stereo (PS) technique provides the reconstruction of object surfaces on the basis of information contained in a set of digital pictures. The camera position is theoretically fixed at infinite distance from the object, and the light source is positioned far from the object at different directions, which are assumed to be known, in order to obtain images under different lighting conditions; see Figure 1. The assumption of infinite distance from the target for both the camera and the light sources implies that orthographic projection can be used to describe the scene and that the light rays are all parallel to a fixed vector, different for each picture.

A reference system is fixed in

R^{3}

, so that the observed object is located at the origin and the optical axis of the camera coincides with the z-axis. Each image has resolution

(r + 2) \times (s + 2)

, and is associated to a rectangular domain

Ω = [- \frac{A}{2}, \frac{A}{2}] \times [- \frac{B}{2}, \frac{B}{2}]

, where A is the horizontal size of the image and

B = (s + 1) h

is its vertical size, with

h = A / (r + 1)

.

If the surface is represented by the bivariate function

z = u (x, y), (x, y) \in Ω

, the normalized normal vector can be expressed as

n (x, y) = \frac{{[- u_{x}, - u_{y}, 1]}^{T}}{\sqrt{1 + {∥ \nabla u ∥}^{2}}},

(1)

where

u_{x}

,

u_{y}

denote the partial derivatives of u,

\nabla u

its gradient, and

∥ \cdot ∥

the 2-norm.

We consider a discretization of the domain, i.e., a grid of points with coordinates

(x_{i}, y_{j})

,

i = 0, \dots, r + 1

,

j = 0, \dots, s + 1

, and we sort the pixels lexicographically. Then, the symbols

u (x_{i}, y_{j})

,

u_{i, j}

,

u_{k}

, indifferently represent the value assumed by u at the points of the grid, where

k : = (i - 1) s + j = 1, \dots, p

indexes the internal points and

p = r s

represents their number.

Assuming that q images are available, the vector that spreads from the object to the light source is denoted by

ℓ_{t} = {[ℓ_{1, t}, ℓ_{2, t}, ℓ_{3, t}]}^{T}

,

t = 1, \dots, q

. Its norm is chosen proportional to the light intensity

Saying that the surface of the object is a Lambertian reflector means that it satisfies Lambert’s cosine law

ρ (x, y) 〈 n (x, y), ℓ_{t} 〉 = I_{t} (x, y), t = 1, \dots, q,

(2)

where the albedo

ρ (x, y)

represents the partial absorption of the light at each point of the surface and

I_{t} (x, y)

is the perceived light intensity, that is, the pixel value at the point

(x, y)

of the tth image.

By discretizing Formula (2) and ordering pixels lexicographically, Lambert’s law can be expressed in the matrix form as

D N^{T} L = M .

(3)

In this expression,

D = diag (ρ_{1}, \dots, ρ_{p})

, where

ρ_{k}

,

k = 1, \dots, p

, is the albedo at the internal points,

N = [n_{1}, \dots, n_{p}]

and

L = [ℓ_{1}, \dots, ℓ_{q}]

contain respectively the normal vectors and the lighting directions, and the tth column of

M = [m_{1}, \dots, m_{q}]

represents the vectorized image

I_{t}

of the dataset.

The standard PS formulation assumes the knowledge of lighting positions, i.e., the matrix L. Then, the solution can be easily found by setting

\tilde{N} = N D

and computing

\tilde{N} = M L^{†}

, with

L^{†}

the pseudoinverse matrix of L [27]. By normalizing the columns of

\tilde{N}

one finds an approximation of the normal vector field N and the albedo matrix D. Finally, the solution of the Poisson problem

Δ u (x, y) = f (x, y)

, where f is obtained by numerically differentiating the normal field, provides the surface of the object; see [8] for details.

In some applications, the request for known lighting is a huge limitation, as it might be rather difficult to detect the exact position of the light source during the acquisition of the images. In [26], a method to solve the photometric stereo problem under unknown lighting was analyzed, based on Hayakawa’s procedure for unknown lighting PS [28]. Other approaches for light localization have been considered; see, e.g., [29].

In this paper, we will use Equation (3) as a direct model to generate the data matrix M, given the position of the light sources, a continuous or discrete representation of the surface, and its albedo. If the continuous representation is chosen, than the functions u,

u_{x}

, and

u_{y}

must be supplied. If one prefers a discrete representation, the coordinates

(x_{i}, y_{i}, z_{i}

),

i = 1, \dots, p

, of each point of the surface are needed. In this case, the partial derivatives

u_{x}

and

u_{y}

are approximated by second order finite differences. The albedo can be either a grayscale image, or a RGB color image.

This model only approximates real observation scenes. Indeed, many studies showed that some of its assumptions are not met in experimental settings. As remarked in [8], the request on the light position is particularly limiting in archaeological applications, since several sites do not allow to position the light sources at sufficient distance from the target.

Also, real-world surfaces rarely satisfy Lambert’s law, causing distortions in collected data. Differently from the Lambertian case, whose model is discussed in this section, there is not a general model for non-Lambertian surfaces, so the available reconstruction approaches generally consider specific aspects of non-Lambertianity.

Designing algorithms that can handle these situations require the availability of realistic datasets reproducing specific scene configurations. Synthetic datasets has the advantage to allow a developer to test an algorithm performance by comparing the reconstructed solution with the exact object which generated the data.

In the next two section we focus on two particular situations, namely, “close-lights” and reflection, which have already been discussed in the Introduction and are quite common in real-world applications. The aim is to provide reliable datasets for testing the performance of reconstruction algorithms.

3. The Case of Close Lights

When a light is positioned at a finite distance from the observation, every point of the surface is illuminated from a different direction, as light rays diverge from the source. If we denote by

v_{j}

the position vector of the jth point of the surface, then the relative position of the ith light from that point is

ℓ_{i} - v_{j}

, for

i = 1, \dots, q

and

j = 1, \dots, p

. The light intensity is damped from an attenuation factor

δ_{i, j}

, which depends on the distance between the illuminated point and the light source. Its theoretical value is

δ_{i, j} = {∥ ℓ_{i} - v_{j} ∥}^{- 2}

, but it is often chosen as

δ_{i, j} = {∥ ℓ_{i} - v_{j} ∥}^{- 1}

; see Section 9.7 from [30]. Our software allows the adoption of both values.

With this notation, the light intensity produced by the ith light at the jth point is given by

m_{j, i} = ρ_{j} n_{j}^{T} {\tilde{ℓ}}_{i, j},

(4)

where

{\tilde{ℓ}}_{i, j} = δ_{i, j} (ℓ_{i} - v_{j})

;

m_{j, i}

represents the jth pixel value of the ith image.

To obtain an efficient algorithm for the generation of a synthetic dataset, we express the computation in matrix form. Let us first consider the block-diagonal matrix

N_{s} = [\begin{matrix} n_{1} \\ n_{2} \\ ⋱ \\ n_{p} \end{matrix}] \in R^{(3 p) \times p} .

Even if its size is quite large, it can be efficiently created as a sparse matrix, which requires a storage space slightly larger than the original matrix N.

We gather the position vectors of the surface points in the matrix

W = [v_{1}, \dots, v_{p}],

and define, for

i = 1, \dots, q

,

V_{i} = (ℓ_{i} u_{p}^{T} - W) Δ_{i} = [ℓ_{i} - v_{1}, \dots, ℓ_{i} - v_{p}] Δ_{i},

with

u_{p} = {(1, \dots, 1)}^{T} \in R^{p}

and

Δ_{i} = diag (δ_{i, 1}, \dots, δ_{i, p})

. The matrix

V_{i}

contains the vectors joining each surface point to the ith light source, damped by the attenuation factors

δ_{i, j}

.

We vectorize each

V_{i}

in lexicographic order in the matrix

V = [vec (V_{1}), \dots, vec (V_{q})] .

It is then immediate to verify that Equation (4) gives the generic entry of the matrix M defined by

M = D N_{s}^{T} V,

(5)

where D is the diagonal albedo matrix. Equation (5) allows for a fast and efficient construction of the dataset corresponding to a PS scenario with lights at a finite distance.

4. The Reflection Phenomenon

The basic property of Lambertian surfaces is that they diffuse light in the same way in every direction, i.e., their brightness does not depend on the point of view. This describes a class of ideal Lambertian reflectors which includes few real surfaces: actually, the majority of existing objects present characteristics for which Lambert’s law is not satisfied.

Non-ideal reflectors are such that the brightness at each point depends on the viewing angle. For this reason, some of them may present the phenomenon of reflection, which leads to distortion in the ideal PS data.

This phenomenon implies that, when the reflected light ray is parallel to the optical axis of the camera, the corresponding point presents a lighting intensity much larger than expected according to the PS model, leading to errors in the reconstruction.

Since for each pixel the angle between the normal vector

n

and the lighting vector ℓ is equal to the one between

n

and the reflected vector

r

, as in Figure 2, reflection occurs when

r

is parallel to the optical axis of the camera, that is, the z axis.

Starting from this consideration, using elementary geometric notions, the reflected vector at the point j of the ith image is given by

r_{j}^{(i)} = 2 (n_{j}^{T} ℓ_{i}) n_{j} - ℓ_{i}, i = 1, \dots, q, j = 1, \dots, p .

The above relation can be expressed in the following matrix form

R_{i} = N diag (2 N^{T} ℓ_{i}) - ℓ_{i} u_{p}^{T},

(6)

where

R_{i} = [r_{1}^{(i)}, \dots, r_{p}^{(i)}]

and

u_{p} = {(1, \dots, 1)}^{T} \in R^{p}

.

Reflection occurs at the jth point when the angle

θ_{j}^{(i)}

between

r_{j}^{(i)}

and the vector

z = {[0, 0, 1]}^{T}

is equal to zero. For detecting such situation, we consider

\cos θ_{j}^{(i)} = \frac{〈 r_{j}^{(i)}, z 〉}{∥ r_{j}^{(i)} ∥ ∥ z ∥} = \frac{{(r_{j}^{(i)})}_{3}}{∥ r_{j}^{(i)} ∥},

where

{(r_{j}^{(i)})}_{3}

is the third component of the reflected vector.

To construct a new dataset

\tilde{M}

which keeps into account reflection, we modify a given dataset M by the simple rule

{\tilde{m}}_{i, j} = \{\begin{matrix} max (m_{i, j}, κ_{R}), & if \arccos θ_{j}^{(i)} < τ_{R}, \\ m_{i, j}, & otherwise, \end{matrix}

(7)

where

τ_{R}

is a chosen tolerance, and

κ_{R}

the pixel value which characterizes reflection for the given surface. Setting a value of

τ_{R}

significantly larger than zero results in a wide reflecting area in the neighborhood of the reflection point. The choice of

κ_{R}

may be useful to specify different surface materials.

When the lights are positioned at a finite distance from the target, the computation in (6) is suitably modified according to the notation used in Section 3.

5. Software Description

This section describes a Matlab software (developed on version R2024b) to generate synthetic datasets for the photometric stereo problem. It can manage different PS configurations: the ideal one, the case of lights at finite distance, and the case of reflecting surfaces.

The psdatasynth Matlab package is available at the web page https://bugs.unica.it/cana/software.html (accessed on 27 April 2025). It consists of 6 functions and 2 demonstration scripts, listed in Table 1.

The demo program psdatagen shows how to set various parameters and call the other routines. The function makepsimages is the one that actually constructs the synthetic dataset by generating the matrix M starting from a chosen surface. The surface may be a symbolic function, selected by the choosefun, or a discretized one. The albedo matrix D is constructed by the makealbedo function. Both functions may be easily extended to add more symbolic surfaces and albedo types to the package.

The remaining functions are createlights, which constructs the light matrix L, and the visualization functions plotlights3d and psimshow. The last two functions are used internally and will rarely be called directly by the user.

The functions choosefun, createlights, makealbedo, and plotlights3d, were included without documentation in the ps3d Matlab package, introduced in [26]. In this package, we introduce few minor changes in the makealbedo function. The new functions in psdatasynth will become part of ps3d in a future release.

We now briefly describe the main functions of the package.

5.1. `choosefun`

This function allows the user to choose among 5 different model surfaces. It can be easily extended to include more examples. It takes as input an index to select the chosen example. It returns Matlab function handles that define the bivariate function

u (x, y)

representing the surface, its partial derivatives

u_{x}

,

u_{y}

, necessary to compute the normal field, and the Laplacian

Δ u = u_{x x} + u_{y y}

. The Laplacian is useful for those PS solution methods which integrate the normal field by solving a Poisson differential problem.

5.2. `makealbedo`

The function requires in input an index corresponding to the type of albedo and the size of the image, and returns the albedo image. Three types of albedo are provided and more can be added by the user.

5.3. `createlights`

This function generates the light matrix

L = [ℓ_{1}, \dots, ℓ_{q}]

containing the directions of light sources. The input consists of a vector of q angles, determining the angular coordinate of the projection of the light directions on the

x y

plane, and a vector of the same size containing their z coordinates.

5.4. `makepsimages`

This is the function which actually constructs the dataset. The input parameters are the following:

surface: contains the struct variable which defines the surface;
L, A, r, s, D: the light matrix, the horizontal size of each image, the image sizes in pixels, and the albedo matrix, respectively;
reflectau: this parameter sets the reflection constants, see text;
intens: lights intensity, default value 1 for all lights;
autoexp: automatic or manual camera exposure, allows to normalize the pixel values to 1, default value is 0 for a fixed exposure;
atten: attenuation factor for lights at finite distance; see text.

The function returns the following variables:

M: dataset matrix of size $(r s) \times q$ , containing one image per column;
X, Y, Z: coordinates of surface points;
N: matrix containing normal vectors;
h, B: discretization step and vertical size of images.

Initially, some constants are either extracted from input data or computed: the number of pixel

p = r s

, the number of images q, the discretization step h, etc.

If a symbolic representation of the surface is available, the slope of the surface points and the gradients components are directly computed. If the surface is discretized, the gradient is numerically approximated by second order finite differences.

When light sources are at infinite distance, each lighting direction is either a vector in

R^{3}

, or a vector in

R^{4}

in homogeneous coordinates, whose fourth entry is set to zero. In this scenario, the model (3) is used to construct the dataset matrix M. If, otherwise, the light vectors have a non-zero fourth component, that is, if lights are at a finite distance, the procedure described in Section 3 is implemented.

Figure 3 displays a synthetic dataset generated by the package, corresponding to a surface available in the choosefun function; see Figure 3a. The lighting directions, plotted in Figure 3b, are obtained by considering 12 angles between 0 and

11 π / 6

. Here, we consider a constant albedo and set

A = 2

,

r = s = 101

, obtaining the dataset displayed in Figure 3c.

The presence of a color RGB albedo causes the final data matrix M to have dimensions

p \times q \times 3

. The three layers of the matrix contain RGB information for each data image. In the computation, the matrices containing normals and lights vectors are the same, only the albedo matrix changes according to red, green, and blue color channels. Figure 4 shows two datasets, corresponding to the same surface and light positions as in Figure 3, with two different albedos, both generated by the function makealbedo.

If the input parameter reflectau is zero, the surface is assumed to be a perfect Lambertian reflector. If it is nonzero, it must be a vector containing the constants

τ_{R}

and

κ_{R}

of Equation (7), and the construction described in Section 4 is employed. Figure 5 shows an example of a discretized reflecting surface. The lighting directions are the same used in Figure 3, the albedo is constant. Setting

τ_{R} = 0.1

and

κ_{R} = 5

, we obtain the dataset displayed in Figure 5c, which presents reflection points. Such points are visualized in Figure 5d, reporting the difference between the reflective dataset and a nonreflective one.

The entries of the parameter intens, if provided, are used as 2-norms of the columns of L, that is, as lights intensity. If autoexp is set to 1, each image in the dataset M is divided by the largest pixel value in that image, ensuring that the brightness reaches its maximum. The parameter atten selects the attenuation factor

δ_{i, j}

for close lights: if its value is 1,

δ_{i, j} = {∥ ℓ_{i} - v_{j} ∥}^{- 2}

, if it is 0, then

δ_{i, j} = {∥ ℓ_{i} - v_{j} ∥}^{- 1}

; see the discussion in Section 3 and Section 9.7 from [30].

5.5. `psdatagen`

The scripts psdatagen provides a practical example of the construction of a dataset. We give an algorithmic description of the process.

The script initially sets some constants that characterize the type of PS scenario, and some visualization parameters. Such constants have easily comprehensible names and are extensively commented in the code. Their names resemble, as much as possible, the notation introduced in previous sections.
The first step consists of constructing the model surface. As already noted, the software accepts two formats for the surface of the target: its analytical expression or a discretization. Such alternatives are implemented in the code by a struct variable, called surface. In the first case, it contains the functions u, $u_{x}$ , and $u_{y}$ returned by choosefun, in the second one, three matrices X, Y, and Z, containing the coordinates of each point of the discretization.
The variable ray is then introduced for the purpose of selecting the distance of the light sources from the target. If its value is zero, the standard setting with lights at infinite distance is selected. A non-zero value represents the distance between the light sources and the reference origin. Similarly, the reflectau parameter is set to activate or deactivate surface reflection, as discussed in the notes to the makepsimages function.
Then, the position of the light sources is set. Various choices for the angles around the z axis are contained in the script, and the vector $z$ contains the z coordinate of each source. At this point, the light matrix L is constructed by calling the createlights function. The matrix can be optionally perturbed by Gaussian noise.
If the variable ray is nonzero, homogeneous coordinates for each source are stored in the column of the matrix $L_{h}$ . This notation signals to the function makepsimages that a scenario with “close” lights has been selected.
Finally, the albedo is obtained by the makealbedo function, and the dataset is constructed by a call to makepsimages.
Results are plotted in different figures: the model surface, light directions, and the dataset. If reflection is activated, a further figure displays the difference between images with and without reflection.

The variable produced by psdatagen are the following:

M: PS data matrix;
$N, L, D$ : normal vectors, light positions, and albedo;
$X, Y, Z$ : discrete coordinates of the synthetic object points;
$r, s$ : image size;
$A, B, h$ : physical size of the observed scene and pixels size.

The computed data can be used as a test dataset for any method intended to solve the photometric stereo problem. In particular, given the possibility to set the parameters at will, the user can create datasets of any size, under very different observation conditions, to be used as training data for machine learning methods. Furthermore, since the algorithms are coded through high-level vector and matrix operations, their Matlab implementation is quite efficient and is automatically parallelized in a multi-core or multi-processor environment, making the software considerably fast.

To illustrate this aspect, the script genbigdataset has been included in the package. It allows to select:

a list of model functions $f_{1}, \dots, f_{n_{1}}$ ;
a list of reflection tolerances $τ_{1}, \dots, τ_{n_{2}}$ ;
a list of noise levels $σ_{1}, \dots, σ_{n_{3}}$ ;
the number of light sources $n_{L}$ .

The script constructs

n_{L}

random light directions (sources at infinite distances) and

n_{L}

light positions at a finite distance. Then, it constructs synthetic pictures corresponding to all possible combinations of such parameters, generating

2 n_{1} n_{2} n_{3} n_{L}

test images.

As an experiment, we fix

n_{1} = 5

,

n_{2} = n_{3} = 2

, and

n_{L} = 100

(see the code of the function genbigdataset.m) to generate 4000 images at resolution

101 \times 101

. The scripts runs in less than 4 seconds on a desktop computer. Figure 6 shows 20 images extracted from the resulting dataset.

6. Some Numerical Simulations

To investigate the quality of the datasets produced by the package proposed in this paper, we present here some numerical experiments.

We start with a sensitivity analysis. A picture is generated by illuminating a model surface from a direction identified by a chosen vector. Then, the vector is progressively rotated around the z axis up to the angle

π / 20

radiants. For each rotation step, a new picture is generated and compared to the initial one.

Figure 7 shows the root mean squared error (RMSE) and the structural similarity (SSIM) for each image of the sequence, as computed by Matlab functions rmse and ssim, versus the angle measured in radiants. It can be seen that the RMSE linearly converges to zero as the angle decreases.

The results of a similar sensitivity analysis for the lighting distance are displayed in Figure 8. In this case a light source is initially placed at distance A (the horizontal size of the observed scene) from the target. Then, the light is progressively moved away from the subject, up to the distance

40 A

. Each picture is compared to the image corresponding to a source at infinite distance. In this case, the RMSE converges to zero exponentially as the distance increases.

We now test the performance of a dataset produced by our package as input for a reconstruction algorithm. We consider the model surface of Figure 3 and construct a sequence of datasets, each one of 7 images of size

401 \times 401

. Each dataset is produced with light sources at fixed directions and distance

γ A

from the observed target, with

γ \in {1, 3, 10, 30, 100, 300, 1000, \infty},

where

γ = \infty

means that the lights are positioned at infinite distance from the surface.

Each dataset is used as input for the reconstruction algorithm coded in the package ps3d from [26], available at https://bugs.unica.it/cana/software.html (accessed on 27 April 2025). Such algorithm first computes the normal vectors as columns of the matrix

\tilde{N}

, approximating N in (3), and then reconstructs the surface by a finite difference approach applied to a Poisson differential problem, as outlined in Section 2. This method assumes that the surface is Lambertian and that the light sources are at infinity, so we expect small errors when

γ = \infty

and increasing errors as

γ

takes smaller values.

To evaluate the quality of the reconstruction we consider the relative errors on the normal vectors and the surface

\frac{∥ N - \tilde{N} ∥_{F}}{{∥ N ∥}_{F}}, \frac{∥ U - \tilde{U} ∥_{F}}{{∥ U ∥}_{F}},

where U contains the values of the model function on the grid,

\tilde{U}

its numerical approximation, and

{∥ \cdot ∥}_{F}

denotes the Frobenius norm.

Such relative errors are displayed in the left graph of Figure 9. We can see that when

γ = \infty

the normal vectors are approximated at machine precision, showing that the synthetic dataset produced by the package is perfectly compliant with the assumptions of the standard PS model. The reconstruction error for the surface is about

10^{- 5}

, in line with the central differences approximation used to solve the Poisson problem, which ensures an error or order

O (h^{2})

, where

h ≃ 5 \cdot 10^{- 3}

in this particular setting.

When the light sources get closer to the observed surface, the errors increase. Indeed, the dataset become “non ideal”, as it progressively violates more and more the assumptions of the standard model.

We show here how a synthetic dataset may be used to simulate particular experimental conditions. When pictures are taken with a camera set for automatic exposure, the light conditions are automatically optimized for each picture. The default option for the function makepsimages is to use a “fixed” exposure for all images, but a simulation of automatic exposure can be activated by setting the input variable autoexp to the value 1, as discussed in Section 5.

The graph on the right of Figure 9 reports the results obtained by repeating the previous experiment with “automatic exposure”. Most of the errors slightly increase, showing that it is preferable to disable automatic exposure in the camera before collecting images. In particular, we observe a strong worsening of both the normal vector computation and the surface reconstruction when light sources are at infinity, situation that is practically encountered when sunlight can be used for illuminating the observed scene.

7. Conclusions

This paper presents an open source software tool to construct datasets for shape from shading problems under specific conditions. In addition to the standard photometric stereo ideal setting, light sources can be placed at any distance from the target, and surfaces with specular reflection can be simulated by the software. The chosen object surface is represented either by explicit formulae or by a discretization, and the albedo can be a grayscale or a color image. Large datasets can be constructed reproducing several observation conditions by simply setting some parameters, allowing to construct large datasets for training neural networks. Differently from available datasets, which only reproduce fixed configurations, one can freely recreate any specific condition. This allows to employ the package in different fields, e.g., in archaeology, where is extremely useful to be able to simulate particular scenarios. Having the exact solution at disposal is essential to estimate the accuracy of new algorithms, allowing to understand the reason for failures, test which algorithmic changes lead to an improvement in accuracy, and experiment with particular shooting techniques.

Author Contributions

All authors equally contributed to the research that led to this paper. All authors have read and agreed to the published version of the manuscript.

Funding

The research of GR is partially supported by European Union-Next Generation EU, Mission 4 Component 1, CUP F53D23002700006 through the PRIN 2022 project “Inverse Problems in the Imaging Sciences (IPIS)” and by Fondazione di Sardegna, Progetto biennale bando 2021, “Computational Methods and Networks in Civil Engineering (COMANCHE)”. EC and GR acknowledge partial support from the PRIN-PNRR 2022 project “AQuAInt–Approximation and Quadrature for Applicative Integral Models” (P20229RMLB). EC and GR are members of the GNCS group of INdAM and are partially supported by INdAM-GNCS 2024 Project “Algebra lineare numerica per problemi di grandi dimensioni: aspetti teorici e applicazioni”.

Data Availability Statement

Data used in numerical experiments were produced by the software package discussed in this paper. The reader can easily reproduce all datesets using the information provided in the text.

Conflicts of Interest

The authors declare no conflict of interest.

References

Zhang, R.; Tsai, P.S.; Cryer, J.; Shah, M. Shape from Shading: A Survey. IEEE Trans. Pattern Anal. Mach. Intell. 1999, 21, 690–706. [Google Scholar] [CrossRef]
Durou, J.D.; Falcone, M.; Sagona, M. Numerical methods for Shape-from-Shading: A new survey with benchmarks. Comput. Vis. Image Underst. 2008, 109, 22–43. [Google Scholar] [CrossRef]
Cristiani, E.; Falcone, M.; Tozza, S. An overview of some mathematical techniques and problems linking 3D vision to 3D printing. In Mathematical Methods for Object Reconstruction: From 3D Vision to 3D Printing; Cristiani, E., Falcone, M., Tozza, S., Eds.; Springer INdAM Series; Springer: Cham, Switzerland, 2023; Volume 54, pp. 1–34. [Google Scholar]
Woodham, R.J. Photometric stereo: A reflectance map technique for determining surface orientation from image intensity. In Proceedings of the Image Understanding Systems and Industrial Applications I, San Diego, CA, USA, 28–31 August 1978; SPIE: Bellingham WA, USA, 1979; Volume 155, pp. 136–143. [Google Scholar]
Woodham, R.J. Photometric method for determining surface orientation from multiple images. Opt. Eng. 1980, 19, 139–144. [Google Scholar] [CrossRef]
Kozera, R. Existence and uniqueness in photometric stereo. Appl. Math. Comput. 1991, 44, 1–103. [Google Scholar] [CrossRef]
Mecca, R.; Falcone, M. Uniqueness and approximation of a photometric shape-from-shading model. SIAM J. Imaging Sci. 2013, 6, 616–659. [Google Scholar] [CrossRef]
Crabu, E.; Pes, F.; Rodriguez, G.; Tanda, G. Ascertaining the ideality of photometric stereo datasets under unknown lighting. Algorithms 2023, 16, 375. [Google Scholar] [CrossRef]
Dessì, R.; Mannu, C.; Rodriguez, G.; Tanda, G.; Vanzi, M. Recent improvements in photometric stereo for rock art 3D imaging. Digit. Appl. Archaeol. Cult. Herit. (DAACH) 2015, 2, 132–139. [Google Scholar] [CrossRef]
Lichy, D.; Sengupta, S.; Jacobs, D.W. Fast light-weight near-field photometric stereo. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 18–24 June 2022; pp. 12612–12621. [Google Scholar]
Logothetis, F.; Mecca, R.; Budvytis, I.; Cipolla, R. A CNN based approach for the point-light photometric stereo problem. Int. J. Comput. Vis. 2023, 131, 101–120. [Google Scholar] [CrossRef]
Ikehata, S.; Wipf, D.; Matsushita, Y.; Aizawa, K. Robust photometric stereo using sparse regression. In Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA, 16–21 June 2012; IEEE: Piscataway, NJ, USA, 2012; pp. 318–325. [Google Scholar]
Ju, Y.; Shi, B.; Jian, M.; Qi, L.; Dong, J.; Lam, K.M. Normattention-psn: A high-frequency region enhanced photometric stereo network with normalized attention. Int. J. Comput. Vis. 2022, 130, 3014–3034. [Google Scholar] [CrossRef]
Logothetis, F.; Budvytis, I.; Mecca, R.; Cipolla, R. Px-net: Simple and efficient pixel-wise training of photometric stereo networks. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, QC, Canada, 10–17 October 2021; pp. 12757–12766. [Google Scholar]
Abada, L.; Hannachi, I.; Laallam, M.W.; Aouat, S. Enhanced three-dimensional reconstruction by photometric stereo. In Proceedings of the 2023 5th International Conference on Pattern Analysis and Intelligent Systems (PAIS), Setif, Algeria, 25–26 October 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 1–5. [Google Scholar]
Wang, K.; Qi, L.; Qin, S.; Luo, K.; Ju, Y.; Li, X.; Dong, J. Image Gradient-Aided Photometric Stereo Network. In Proceedings of the Pacific Rim International Conference on Artificial Intelligence, Kyoto, Japan, 18–24 November 2024; Springer: Cham, Switzerland, 2024; pp. 284–296. [Google Scholar]
Radow, G.; Rodriguez, G.; Mansouri Yarahmadi, A.; Breuß, M. Photometric stereo with non-Lambertian preprocessing and Hayakawa lighting estimation for highly detailed shape reconstruction. In Mathematical Methods for Object Reconstruction: From 3D Vision to 3D Printing; Cristiani, E., Falcone, M., Tozza, S., Eds.; Springer INdAM Series; Springer: Singapore, 2023; Volume 54, pp. 35–56. [Google Scholar]
Guo, H.; Ren, J.; Wang, F.; Shi, B.; Ren, M.; Matsushita, Y. DiLiGenRT: A Photometric Stereo Dataset with Quantified Roughness and Translucency. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 17–18 June 2024; pp. 11810–11820. [Google Scholar]
Mecca, R.; Logothetis, F.; Budvytis, I.; Cipolla, R. Luces: A dataset for near-field point light source photometric stereo. arXiv 2021, arXiv:2104.13135. [Google Scholar]
Ren, J.; Wang, F.; Zhang, J.; Zheng, Q.; Ren, M.; Shi, B. Diligent102: A photometric stereo benchmark dataset with controlled shape and material variation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 18–24 June 2022; pp. 12581–12590. [Google Scholar]
Shi, B.; Wu, Z.; Mo, Z.; Duan, D.; Yeung, S.K.; Tan, P. A benchmark dataset and evaluation for non-Lambertian and uncalibrated photometric stereo. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 3707–3716. [Google Scholar]
Ikehata, S. CNN-PS: CNN-based photometric stereo for general non-convex surfaces. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 3–18. [Google Scholar]
Ikehata, S. Universal photometric stereo network using global lighting contexts. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 18–24 June 2022; pp. 12591–12600. [Google Scholar]
Ikehata, S. Scalable, detailed and mask-free universal photometric stereo. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada, 18–22 June 2023; pp. 13198–13207. [Google Scholar]
Santo, H.; Samejima, M.; Sugano, Y.; Shi, B.; Matsushita, Y. Deep photometric stereo network. In Proceedings of the IEEE International Conference on Computer Vision Workshops, Venice, Italy, 22–29 October 2017; pp. 501–509. [Google Scholar]
Concas, A.; Dessì, R.; Fenu, C.; Rodriguez, G.; Vanzi, M. Identifying the lights position in photometric stereo under unknown lighting. In Proceedings of the 2021 21st International Conference on Computational Science and Its Applications (ICCSA), Cagliari, Italy, 13–16 September 2021; pp. 10–20. [Google Scholar]
Björck, Å. Numerical Methods for Least Squares Problems; SIAM: Philadelphia, PA, USA, 1996. [Google Scholar]
Hayakawa, H. Photometric stereo under a light source with arbitrary motion. J. Opt. Soc. Am. A—Opt. Image Sci. Vis. 1994, 11, 3079–3089. [Google Scholar] [CrossRef]
Jin, W.; Zhu, M.; Liu, J.; He, B.; Yu, J. Shadow-based lightsource localization with direct camera–lightsource geometry. IEEE Trans. Instrum. Meas. 2023, 73, 5005512. [Google Scholar] [CrossRef]
Brown, C.W. Learn WebGL. 2015. Available online: https://learnwebgl.brown37.net/ (accessed on 27 April 2025).

Figure 1. Configuration of the problem: the camera is fixed and a light source moves around the object, taking different positions.

Figure 2. Configuration of the light direction, the normal vector, and the reflected vector. The angles between each pair of vectors are equal. Reflection occurs when

r

is parallel to the z axis.

Figure 2. Configuration of the light direction, the normal vector, and the reflected vector. The angles between each pair of vectors are equal. Reflection occurs when

r

is parallel to the z axis.

Figure 3. An example of a synthetic dataset generated by the package under an ideal configuration.

Figure 4. Same dataset of Figure 3 with two different albedos.

Figure 5. An example of a surface with reflection points.

Figure 6. Large training dataset: each row shows 5 images for a particular test surface, with different lighting conditions, extracted from a dataset of 4000 images.

Figure 7. Sensitivity analysis with respect to the light direction: RMSE on the left, SSIM on the right; the angular perturbation of the light source is measured in radiants.

Figure 8. Sensitivity analysis with respect to the light distance: RMSE on the left, SSIM on the right, both versus the distance of the light source as a multiple of A.

Figure 9. Error analysis for the reconstruction error produced by the algorithm coded in [26]: on the left, results for an ideal dataset, on the right, results for a particular dataset non compliant with the assumptions of the model.

Table 1. Functions of the psdatasynth package.

`psdatagen`	Demo program to illustrate the use of the package.
`makepsimages`	Generate a synthetic dataset based on a chosen configuration.
`choosefun`	Return symbolic descriptions of 5 test surfaces.
`makealbedo`	Create the albedo matrix D.
`createlights`	Create the matrix containing the light source directions.
`plotlights3d`	Plot the light directions.
`psimshow`	Display images with chosen resize factor and color limits.
`genbigdataset`	Demo program to create a large dataset

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Crabu, E.; Rodriguez, G. On Generating Synthetic Datasets for Photometric Stereo Applications. Computers 2025, 14, 166. https://doi.org/10.3390/computers14050166

AMA Style

Crabu E, Rodriguez G. On Generating Synthetic Datasets for Photometric Stereo Applications. Computers. 2025; 14(5):166. https://doi.org/10.3390/computers14050166

Chicago/Turabian Style

Crabu, Elisa, and Giuseppe Rodriguez. 2025. "On Generating Synthetic Datasets for Photometric Stereo Applications" Computers 14, no. 5: 166. https://doi.org/10.3390/computers14050166

APA Style

Crabu, E., & Rodriguez, G. (2025). On Generating Synthetic Datasets for Photometric Stereo Applications. Computers, 14(5), 166. https://doi.org/10.3390/computers14050166

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

On Generating Synthetic Datasets for Photometric Stereo Applications

Abstract

1. Introduction

2. The Mathematical Model for Photometric Stereo

3. The Case of Close Lights

4. The Reflection Phenomenon