Influence of Neighborhood Size and Cross-Correlation Peak-Fitting Method on Location Accuracy

Tomás, María-Baralida; Ferrer, Belén; Mas, David

doi:10.3390/s20226596

Open AccessArticle

Influence of Neighborhood Size and Cross-Correlation Peak-Fitting Method on Location Accuracy^†

by

María-Baralida Tomás

,

Belén Ferrer

and

David Mas

^*

University Institute of Physics Applied to the Sciences and Technologies, University of Alicante, P.O. Box 99, 03080 Alicante, Spain

^*

Author to whom correspondence should be addressed.

^†

This paper is an extended version of the conference paper: Tomás, M.B.; Mas, D.; Ferrer, B. Peak-locking minimization by three adjustment methods. In Proceedings of the Optics, Photonics and Digital Technologies for Imaging Applications VI; Schelkens, P., Kozacki, T., Eds.; SPIE: Jakarta, France, 2020; p. 53.

Sensors 2020, 20(22), 6596; https://doi.org/10.3390/s20226596

Submission received: 1 October 2020 / Revised: 10 November 2020 / Accepted: 16 November 2020 / Published: 18 November 2020

(This article belongs to the Special Issue Object Tracking and Motion Analysis)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

A known technique to obtain subpixel resolution by using object tracking through cross-correlation consists of interpolating the obtained correlation function and then refining peak location. Although the technique provides accurate results, peak location is usually biased toward the closest integer coordinate. This effect is known as the peak-locking error and it strongly limits this calculation technique’s experimental accuracy. This error may differ depending on the scene and algorithm used to fit and interpolate the correlation peak, but in general, it may be attributed to a sampling problem and the presence of aliasing. Many studies in the literature analyze this effect in the Fourier domain. Here, we propose an alternative analysis on the spatial domain. According to our interpretation, the peak-locking error may be produced by a non-symmetrical sample distribution, thus provoking a bias in the result. According to this, the peak interpolant function, the size of the local domain and low-pass filters play a relevant role in diminishing the error. Our study explores these effects on different samples taken from the DIC Challenge database, and the results show that, in general, peak fitting with a Gaussian function on a relatively large domain provides the most accurate results.

Keywords:

peak-locking; cross-correlation; subpixel; Gaussian fitting; thin-plate splines; polynomial fitting

1. Introduction

Cross-correlation is a useful technique for establishing similarity between two signals. As correlation can derive from minimizing the mean square error between two signals or images [1], it is a robust tool for comparing images corrupted by Gaussian noise, which is the normal case in most circumstances during image processing where illumination is good enough. Apart from giving a similarity metric, peak location describes the position where the reference image and template match present maximum coincidence and is, thus, often used for image aligning or object tracking in a scene.

Despite the many advantages and applications of the cross-correlation [2,3,4,5,6], its standard formulation presents two main drawbacks: the dependence of the correlation result on image and template amplitudes to, thus, produce high peaks when a dark template is compared to a bright object or vice versa, even though their similarity is minimum [7] and the limited resolution, which is set, by construction, to one pixel.

The first issue is solved by using a normalized cross-correlation algorithm, which is the common approach when dealing with images. Regarding the second issue, subpixel resolution can be achieved by interpolation, which can be applied to either image before calculating their cross-correlation [8], the correlation function itself, to increase the accuracy in the location of its maximum [9]. The first approach is frequently followed to analyze deformations in solid materials as it allows for deformation mappings to be easily implemented [8]. The second one, i.e., peak interpolation, has faster and easier applications in non-deforming scenes and is, thus, adequate for aligning and tracking isolated objects. In our case, we pay attention to the second technique.

Briefly, the technique consists of interpolating the correlation function over a local area around the maximum peak and then refining the search by fitting the peak neighborhood to an analytical function [8]. This procedure may increase peak location accuracy by almost two orders of magnitude [10,11]. Despite the evident improvement, the procedure also introduces a bias error, which limits its performance. The error, known as the peak-locking or pixel-locking effect, means that the peak location obtained through local fitting is always biased toward the closest integer coordinate [12].

The origin of peak-locking has been usually attributed to an aliasing effect due to a poor image texture [13] combined with an inadequate choice of the interpolant function [14]. The problem of the aliasing can be avoided with adequate sensors and lenses. In general, according to the Nyquist limit, a pseudospeckle scene will be well sampled when the dot unit is larger than 2 px [15].

Properly choosing the interpolating function is a more delicate issue and can be better explained in the spatial domain. Consider a scene with a low-noise object and a template containing a shifted version of the object. The finer the details of the object, the narrower the correlation peak will be, since a small displacement will degrade the correspondence between object and scene [15]. If one considers a narrow local domain around the maximum of the correlation function, there will be a reduced number of samples to fit to the interpolant function. Therefore, this maximum may have an excessive weight in the fitting, and it may pull the recalculated maximum to its position in the original grid. Consequently, a bias area towards the nearer integer corresponding to the original maximum position is introduced. Notice that when the maximum is exactly in the middle of two pixels the weight distribution is balanced, and the error is 0.

One can take a larger neighborhood and thus a larger number of samples to fit in order to compensate for the excessive weight of the correlation maximum, but this would eventually include information of non-correlated positions and thus distort the final result. Additionally, since the number of samples is still not very large, any secondary peak in the neighborhood will also unbalance the fitting and pull the fitted maximum towards it.

According to this, fitting functions that only consider the peak area, i.e., quadratic functions, may show a good fitting with a small neighborhood, but would be affected by the peak-locking effect. An extensive function that considers the peak and the region around, i.e., Gaussian fitting, would compensate for this effect but would need a larger neighborhood. Therefore, a proper selection of the interpolant function together with the interpolation domain is critical to decrease the peak-locking error.

In [16], the authors proposed different interpolation algorithms in an

8 \times 8

neighborhood, and showed that the Gaussian function provided better results than the bilinear one, third-order polynomial or bicubic splines.

An additional strategy consists of filtering out the finer details of the sequence being analyzed, so that the correlation peak is softer and thus more suitable to a fitting by an analytical and derivable function. In [17], the authors introduced a defocus into the image-capturing process with an effect of reducing the peak-locking effect. This blurring was introduced experimentally by manipulating the objective. By doing so, and by adjusting the correlation peak through a Gaussian function, the results were slightly improved compared with the non-filtered results. This effect was thoroughly analyzed in [18]. Michaelis et al. [19] tested a different configuration for blurring an image using an optical diffuser. They also implemented two different interpolation functions (splines and bicubic), which gave good results for very small particle sizes.

Another strategy to diminish the peak error consists of maximizing the dynamic range of the image and template [20]. It has been shown that the accuracy in object-tracking tasks is directly related to the number of gray levels of the image [10,19]. Nevertheless, images and their luminance dynamic range are linked to the experimental setup. Hence, albeit important, it is not often a parameter that we can modify at will.

In this manuscript, we propose a combined analysis of three factors that may help to compensate for the peak-locking error and increase the accuracy of the proposed methods. Therefore, we will analyze the interaction between the two mentioned fitting functions, Gaussian and quadratic, in order to analyze their dependence with the neighborhood and its capability to reduce the error. The analysis will be complemented with the analysis of the results obtained using spline fitting. These fitting functions are more adaptable than the other two, so they may be useful in a wide range of neighborhood sizes. Additionally, the effect of a Gaussian defocus on the scene and the template is also analyzed together with the interaction with the fitting function.

The final aim of this paper is to determine which is the best method, which includes fitting function, application domain and amount of defocus, for obtaining object displacement with reduced peak-locking error. Due to the large amount of variability in images, we took a set of speckle sequences from the 2D-DIC public images bank from the Society for Experimental Mechanics (SEM) [21]. We selected the first image as a reference and the texture on it was tracked through the sequence.

Preliminary results of this analysis were presented at the SPIE Photonics Meeting 2020 [22], concluding that quadratic functions are more suitable for small neighborhoods, while Gaussian functions stand for large neighborhoods without further analysis. In what follows, we will explain the reasons for that and a final rule of thumb.

2. Methods

The purpose of the simulation is to analyze the relationship between three different peak-fitting methods and the neighborhood size together with the influence of the low-pass filters on the peak-locking method. In order to make our conclusions more general and facilitate the reproducibility of our results, we have checked our method with synthetical images taken from an image bank. Therefore, the variability due to the setup, or noise in the image, is excluded.

We tested five different sequences taken from the 2D-DIC image bank provided by the Society for Experimental Dynamics [21]. These sequences, which were synthetically generated, come with a full description of the movement and subpixel displacement of the texture, and are often used as testing benches for tracking algorithms. The sequences from 2D-DIC are images of random dots whose contrast, noise and shift differ in the distinct sequences. Figure 1 depicts the selected samples and their properties. Sequences contain horizontally and vertically shifted versions of the first image according to the specified steps. The number of frames in each sequence is the amount needed to accomplish a one-pixel accumulated displacement.

From each sequence, the provided reference was selected as the template and its position was tracked throughout the sequence by using the cross-correlation operation which is implemented here through the normalized cross-correlation algorithm, normxcorr2, in Matlab [23]:

γ (u, v) = \frac{\sum_{x, y} [f (x, y) - \bar{f_{u, v}}] [t (x - u, y - v) - \bar{t}]}{\sqrt{\sum_{x, y} {[f (x, y) - \bar{f_{u, v}}]}^{2} \sum_{x, y} {[t (x - u, y - v) - \bar{t}]}^{2}}},

(1)

where f is the image taken as a reference and t the template,

\bar{t}

is the mean value of the template and

\bar{f}

is the mean value of f(x,y) in the region under the template.

The test was carried out at full field, i.e., taking all the image as the template. Nevertheless, a frame of 8 pixels on all sides was imposed on the template in order to prevent the shifted image moving outside the boundaries of the reference image [24].

After obtaining the correlation function, a small region around the peak is selected and fitted to a soft function. These operations eventually relocate the peak inside a pixel region so that its maximum can be recalculated with incremented accuracy. As stated in the Introduction, we use a Gaussian function, a second-order polynomial and cubic spline [19,25] as fitting functions on different neighborhood sizes around the correlation peak. Cubic splines have been implemented through the thin-plate spline algorithm provided by Matlab, which provides a smoother fitting function [26,27].

The fitting was calculated on different neighborhood areas around the maximum of the correlation peak (

N b d

). This size was taken from

3 \times 3

to

11 \times 11

pixels, with the peak centered in the region, so only odd sizes were considered. Average speckle size in the samples used was estimated through autocorrelation, showing an average radius larger than 5 pixels. Therefore, areas larger than

11 \times 11

would include unmatched results, which may distort the error estimation. We also tested the influence of defocusing on the accuracy of the tracking results. In mathematical terms, blurring can be described by a convolution (3) of the image with a Gaussian function (2):

G_{r_{b}} (x, y) = e x p (- \frac{x^{2} + y^{2}}{2 \cdot r_{b}^{2}}),

(2)

f_{r_{b}} (x, y) = f ⨀ G_{r_{b}} (x, y) = \sum_{u, v} f (u, v) G_{r_{b}} (x - u, y - v),

(3)

where

r_{b}

is the blur radius.

In principle, if the camera is defocused throughout the capturing process, both the reference image and template will be blurred, so the cross-correlation between the blurred image and template can be written as (4):

C_{r_{b}} = f_{r_{b}} ⨂ t_{r_{b}} = \sum_{u, v} f (u, v) G_{r_{b}} (x + u, y + v),

(4)

where, for simplicity’s sake, we used the general definition of correlation instead of the normalized one. In any case, generalization is straightforward.

According to the basic properties of both correlation and convolution, we can rewrite Equations (4) as (5):

C_{r_{b}} = f_{r_{b}} ⨂ t_{r_{b}} = f_{r_{b}} ⨂ (t ⨀ G_{r_{b}}) = (f_{r_{b}} ⨂ t) ⨀ G_{r_{b}} = [(f ⨀ G_{r_{b}}) ⨂ t] ⨀ G_{r_{b}},

(5)

As the Gaussian function is symmetric, the above-written expression can be finally expressed as (6):

C_{r_{b}} = f_{r_{b}} ⨂ t_{r_{b}} = (f_{r_{b}} ⨂ t) ⨀ G_{r_{b}} = [(f ⨂ t) ⨀ G_{r_{b}}] ⨀ G_{r_{b}},

(6)

Thus, we can see that the effect of the blurred reference and template is a double blurring of the correlation peak. As blurring was symmetrical, the main effect was to soften the peak to, thus, make a more adequate profile for accurate fitting. Unfortunately, a double defocus can introduce excess blurring, and can also degrade the function and mask the peak, which would cancel out the obtained advantages. Therefore, it is worth analyzing the amount of blurring that provides the best possible results. Accordingly, an analysis to compare a sharp reference with a blurred template was done and is presented in the Discussion in order to check whether the double-blurring filter was redundant or not.

The calculation process started by introducing a Gaussian filter with radius

r_{b}

to both the image and template before calculating the normalized cross-correlation. The value of the radius was varied from

r_{b} = 0

(delta function, no blur) to

r_{b} = 5

. Figure 2 shows a flow chart with the algorithm implemented in Matlab. The background of the program is depicted in gray, whereas the specific parts of the fitting algorithms are represented in green (Gaussian fit), yellow (thin-plate splines) and pink (second-order polynomial fit) as they are shown in the Results. The depicted sequence was repeated for each sequence with a different

r_{b}

.

Briefly, each frame was compared with the first one in the sequence and the correlation function was obtained. Then a region around the correlation peak was fitted to three different functions over distinct neighborhood areas around the peak. The new peak position of the fitted function was then obtained. The new maximum was obtained through a minimum search of the inverse of the fitted functions, following the algorithm developed in [28]. Although it can be analytically calculated for the Gaussian and the quadratic case, we preferred to use the same method for all the functions in order to avoid distortions introduced by the calculation algorithms.

As the real movement was provided by the 2D-DIC image bank, our results were compared to the theoretical displacement to evaluate the errors and to obtain the best combination to measure target movement. The error was calculated through the mean error (

μ

) with its standard deviation (STD) and the maximum error (MaxErr) of each sequence:

μ = \frac{\sum_{i = 1}^{N} (x_{i_{c a l c}} - x_{i_{t e o r}})}{N},

(7)

S T D = \sqrt{\frac{\sum_{i = 1}^{N} {((x_{i_{c a l c}} - x_{i_{t e o r}}) - μ)}^{2}}{N}},

(8)

M a x E r r = \max (| x_{i_{c a l c}} - x_{i_{t e o r}} |) .

(9)

where

x_{i_{t e o r}}

is the reference value provided by the DIC Challenge site,

x_{i_{c a l c}}

is the value calculated through the different methods and N is the number of samples. In our case, as the measurement extended throughout the sequence, N refers to the number of frames in the evaluated sequence.

The main programs, subroutines for calculation of the maxima and obtained results can be downloaded from [29] as Supplementary Materials.

3. Results

We obtained the parameters in Equations (7)–(9) for all five samples with the three peak interpolation methods applied to neighborhoods of all odd sizes ranging from

3 \times 3

to

11 \times 11

, and with six different Gaussian filters with radii ranging from 0 (no blur) to 5, which totaled 90 tests per sample. The obtained displacement values were compared to the data provided by the DIC database and the error was evaluated. As presenting all the results would be extensive, we selected the results according to the maximum error values by simply selecting the best and worst cases for each sample and interpolation method, which corresponded to the minimum and maximum MaxError, respectively. The results are summarized in Table 1. Note that, for Table 1, for remaining calculations and graphs presented in the manuscript, errors only refer to vertical shifts. The errors obtained from the horizontal displacements were similar, but their analysis was omitted to avoid a redundant analysis.

The results showed that the best results for all the methods had mean errors below 0.005 px and standard deviations below 0.006 px. Additionally, note that the maximum error (MaxErr) for the best result in each fitting function, which can be taken as a measure of the peak-locking effect, was at least one order of magnitude smaller than the sample shift. This means that the tracking of samples was very good, provided that the parameters were well selected.

The results presented in Table 1 can serve to set the extreme results that were obtained through the different methods herein presented but did not indicate the influence of the different parameters. In order to better understand the influence of the different parameters, we depicted the error for the three fitting functions for a fixed defocus parameter

r_{b} = 2

with all the possible neighborhood sizes in Figure 3. As the

3 \times 3

neighborhood provided such bad results by the Gaussian and spline-fitting methods (see Table 1), the corresponding results were deleted from the graph. The lines for samples 3 to 5 corresponding to the

5 \times 5

area of the Gaussian fit were also deleted for the same reason. Note that a complete figure was added in the Matlab format as a Supplemental File.

The first noticeable fact in the graphs is that not all the error curves presented the typical sigmoid shape with symmetry around the 0.5 pixel shift value due to the pixel-locking effect (see the errors for Sample 1 in Figure 3). This is especially noticeable in the Gaussian case, where three of the five samples do not even show any clear trend. This unusual behavior does not imply large errors since the graphs depicted for the Gaussian case are of the same order or lower than the error obtained by the other fitting functions. Moreover, for this fitting function, and except for the non-depicted cases, the error does not strongly depend on neighborhood size, provided that it is large enough.

In the splines case, the typical sigmoid shape only appears in Samples 1, 3 and 5. The curve for

N b d = 5 \times 5

presents a very different behavior to that of the other curves. Once again, if we do not consider the anomalous cases, i.e.,

N b d = 3 \times 3

and

N b d = 5 \times 5

, there seems no systematic error dependence on neighborhood size: although we can see some differences in each individual sample, it is relatively small and dependence on size is not the same in all cases.

Finally, the curves representing the errors calculated with the second-order polynomials present the typical shape due to the peak-locking effect. In this case, the error is bigger than that obtained with the other two functions and is of the same order in all the samples. Unlike what happened with the other two fitting functions, here, we notice a marked dependence of neighborhood size, where the bigger the error, the larger the interpolation area.

Figure 4 depicts our analysis of the influence of defocusing on the error, along with the error for fixed neighborhood

N b d = 7 \times 7

and changing defocus parameter

r_{b}

. At first glance, the curves share some similarities with the curves in Figure 3, i.e., lack of the typical sigmoid shape in the same cases as before. It is also noticeable that, in all those cases, there was no significant dependence on the defocusing parameter. Notwithstanding, and as before, the error obtained by fitting the peak with a Gaussian function is equal to or lower than the error obtained by other fitting functions.

Regarding the influence of the defocus, note that the more marked the defocus, the lower the error for all the cases calculated by the quadratic polynomial. This can also be stated for the spline method, although in this case, the larger difference lies between

r_{b} = 0

and all the other cases.

According to the depicted figures, it would seem that the results obtained using the Gaussian function for fitting the peak were independent of neighborhood size and image blurring, provided that the calculation area was large enough. Moreover, the results obtained by this method did not present the typical peak-locking shape, and errors were similar to or lower than in the other methods.

Despite the obtained results, we have only analyzed the error due to the calculation area for one fixed defocusing filter and the effect of the defocus for a fixed neighborhood, respectively. Therefore, in order to gain more insight into the error dependence on the analyzed parameters, Figure 5, Figure 6 and Figure 7 reveal the plots of the variation of the three error parameters for the row shifting expressed in Equations (7)–(9) (mean error, standard deviation and maximum error, respectively) with the defocus parameter for all the neighborhood sizes and for the three fitting functions herein analyzed.

Figure 5 illustrates the mean value of the error obtained for each case according to the radius of the blurring Gaussian filter. As we were interested only in the error magnitude, the absolute value of the error is represented. A different line is depicted for each neighborhood size. As we can see in Table 1, the Gaussian function does not provide good results with small interpolation areas. Therefore, the

3 \times 3

neighborhoods graphs were deleted to facilitate the visualization of the other curves because of their large errors, with values higher than 0.5 px (see Table 1). For the same reason, the

5 \times 5

neighborhood error curve obtained for the Gaussian function was also deleted for Sample 5, with a peak close to 0.1 px. Apart from the reported cases, the mean value of the error was below 0.05 px in most cases, which went below the imposed shift between frames. The complete figure was added as Supplemental Materials to better allow the interpretation of the results.

At first glance, it would seem that the defocus increased the error when Gaussian or spline functions were used. In both cases, a defocus with radius

r_{b} = 4

seemed to produce an error reduction in some samples, but we hypothesize that this happened because the particular texture of this sequence and is not a general rule. Nevertheless, we can see that the error change due to the blurring filter was less than 1% of the shift (0.05 px in the first two samples and 0.1 in the other three). So the effect on the mean value could not be considered very strong, but could be important when using the quadratic fitting function with large interpolation areas.

Regarding the

N b d

parameter, we can see that, for the depicted Gaussian and polynomial cases, and for

N b d = 7 \times 7

,

9 \times 9

and

11 \times 11

, the larger the interpolation area, the bigger the error, but not in all cases. Note also that the

N b d = 3 \times 3

case was somewhat anomalous because in the Gaussian and spline functions, the errors obtained for the smaller case were huge in all the samples. The

5 \times 5

domain also produced large errors in Samples 3 to 5 with the Gaussian function and erratic behavior with splines. Finally, when quadratic functions are used, these two particular domains seemed to provide opposite results according to the general trend described for this case.

Despite this analysis, the mean value was a poor parameter for measuring the error. It described trends in the result but, as these sequences were artificially generated, no a strong bias was expected here, as previously seen in Figure 3 and Figure 4. In any case, we discovered that using very small interpolation areas may be problematic in the majority of cases.

Figure 6 offers the graph for the standard deviation (STD) for all the discussed cases. This parameter indicated the variability of the results. As in the previous case, the graphs with the highest values were deleted. This happened for all the curves corresponding to

N b d = 3 \times 3

in the Gaussian and spline cases, and to the

5 \times 5

curves for the Gaussian case and Samples 3 to 5. Once again, the complete graph is included as Supplemental Materials.

In the graphs, we can see that blurring may help to narrow the variability in the results. The strength of this effect depends very much on the fitting function. In the Gaussian case, the effect strongly depends on the sample as the benefit is observable only in Sample 1, while the error increases with blurring for the other samples. In the polynomial case, the effect is general, with better results for large

r_{b}

. For the spline method, the improvement is also noticeable, albeit very weak. Note that the effect on some samples is nonexistent, or even negative.

Regarding the influence of the neighborhood size, once again,

N b d = 3 \times 3

combined with Gaussian or spline functions resulted in wide variability and a large error, as mentioned above. For the Gaussian case, this dependence strongly depended on the sample as we observed the opposite behavior in different samples. With the polynomic function, the results were less dispersed the smaller the area was, which agrees with what is deduced from the table, but is the opposite to what happened with the mean error. However, the relation was very weak in that case. When spline fitting was applied, neighborhood size displays no clear dependence, except for the

3 \times 3

case.

When considering the absolute value of the standard deviation, we find that, accordingly with the previous results, the Gaussian function generally gives lower values than the other two methods.

Another useful parameter for determining the performance of each fitting function is the maximum error, which may correspond to the peak of the peak-locking error. Figure 7 displays the graphs with the maximum error for each sequence, fitting method and neighborhood in front of the blurring radius. As in previous cases, the curves corresponding to the Gaussian and spline fitting methods in a

3 \times 3

region were deleted from all the samples. The

5 \times 5

regions in Samples 3 to 5 was also deleted for the Gaussian method as they gave values around 1 px which would not, thus, allow the other cases to be visualized.

We can see that the curves depicted for the maximum error are similar to those with the standard deviation, which implies that the maximum value of the peak-locking error is probably the main source of the errors in the calculation. Hence, the depicted results confirmed the conclusions drawn from the other graphs and no further comments will be added.

4. Discussion

The results depicted above confirm the hypotheses posed in the Introduction: quadratic functions provide good results for peak fitting, but are prone to peak-locking error, while Gaussian functions give more robust results, provided that the fitting area is large enough (see Figure 4 and Figure 7). As we said, narrow peaks are supported by a few samples, so the maximum weight determines the fitting result, pulling the recalculated maximum location towards the location of the sample where the original maximum is located. Including more samples would compensate the result but would also include areas outside the peak. Eventually, the fitting neighborhood would include the peak skirt and curvature changes which would no longer fit to a paraboloidal surface, and thus the error will increase.

Gaussian functions are capable of reproducing both the peak and the planar surrounding area, although more samples are required. Because of this, the weight of the maximum is compensated, and the location error is less prone to being affected by peak-locking error. Among both situations, spline functions provide reasonable results in all neighborhood sizes, but because of their adaptability, the fitting is also biased towards the peak maximum and, thus, it is also affected by the peak-locking effect.

In order to illustrate the adaptability of each fitting function to the peak surface, we have represented in Figure 8 the correlation peak corresponding to a shift of 0.4 px in the two extreme cases calculated for Sample 1; i.e., with

3 \times 3

and

11 \times 11

neighborhoods and no blur. Although the curves may have wide variability from frame to frame and for different samples, they serve to illustrate the point herein explained. We notice there that the Gaussian function cannot adapt to the skewed shape of the local area around the maximum, while the polynomic function correctly fits to it. In the case of the larger neighborhood, the Gaussian function can reproduce the curvature change while the quadratic function just reproduces a paraboloidal dome. As we said, the spline function can adapt to both situations.

The results here shown also explain the reason why the Gaussian function seems to be more insensitive to the neighborhood size once it is large enough (see Figure 3). Because of its particular shape, the weight of the samples farthest from the center is very low and therefore will not affect the location results. In the case of the spline function, there is not such distance compensation and the result may be affected. Finally, in the quadratic case, it is clear that larger neighborhoods produce larger errors.

Regarding the blurring filter, it has a double effect in the correlation surface. On the one hand, the peak gets softer, so the fitting by smooth functions is more accurate. This affects more noticeably the results obtained with the quadratic function which are very dependent on the central samples (see Figure 4 and Figure 7). On the other hand, blurring also decreases the weight of the eccentric samples, increasing the relative weight of the central sample and thus may slightly increment the location error obtained through Gaussian fitting. In any case, the effect introduced by the Gaussian filter is similar to that produced by the Gaussian fitting function, hence explaining why the error obtained through this fitting method is not very much affected by blurring or even increases in some cases. At this point, we wish to recall Equation (6) where the mathematical formulation of the blurring filter is explained. By comparing a blurred reference to a blurred template, we obtained a double-blurring of the correlation function. As this double application of the blurring filter is redundant, we hypothesized that it is possible to compare a sharp reference to a blurred template (or vice versa) without increasing errors. Thus, we tested the results when a sharp reference was compared to a blurred template. In Figure 9, we plotted the curves with the maximum errors.

As we can see, the results are similar to those obtained using a double defocus. This means that blurring is not a decisive parameter in the peak-locking error. Although it can help to improve the results, adding a Gaussian filter (or an experimental defocus) in both the images used for the normalized cross-correlation was redundant.

This “invariability” in the template defocus proved most convenient for some experimental implementations: long-time experiments may cause small mechanical drifts in the camera or the sample and, therefore, some frames may appear slightly blurred. A defocus may take place during experiments using short depth-of-field lenses and samples whose size may change due to heat dilation, tension or swelling [30]. The herein shown results demonstrate that these changes applied to one of the images being compared had no marked effect on the final result, which remained valid and, thus, indicated that the subpixel tracking through local interpolation was a robust method.

According to the results herein presented, we can reach several conclusions. Regarding the fitting function, we found that, except for the smallest fitting region, the Gaussian fit gave the smallest errors that are, in some cases, almost one order of magnitude smaller than the errors introduced by the other methods. However, the best results in all the functions were similar, which means that all the fitting functions would display similar performance under optimal conditions.

When Gaussian or spline functions are used, small areas around the peak should be avoided. With these two functions, and except for areas

3 \times 3

and

5 \times 5

, the errors did not seem to depend on either neighborhood size or the blurring filter radius. This result agrees with the partial results represented in Figure 3 and Figure 4. On the contrary, when employing quadratic polynomials as fitting functions, both the neighborhood size and blurring strongly impacted the results, and the smaller the errors, the smaller the fitting area and the larger the defocus parameter.

The effect of blurring to improve the results was noticeable in many cases, but there are many exceptions. Hence, we cannot state that the use of blurring to diminish the error is a general benefit because it depends on both the fitting function and the sample. In any case, except for the Gaussian fitting functions, a minor defocus could help to narrow the variability of the results (taken as the standard error) and to slightly reduce the peak error. So, introducing it could be advisable.

Thus, in summary, we found the smallest error with a Gaussian fitting applied in a large neighborhood around the peak and with no blurring.

The results here obtained agree with those that appear in the literature [16], where the authors obtained better results for the Gaussian fitting than for the other methods by using an

8 \times 8

neighborhood. In [17,19], the authors reported a minor improvement when using a slight defocus and a Gaussian fitting.

5. Conclusions

In this manuscript, we tested the accuracy of the commonest subpixel tracking methods based on cross-correlation. We focused on the methods that employ local interpolation in a small area around the correlation peak to refine the maximum location. To this end, we tested the influence of neighborhood size around the peak with three different fitting functions: Gaussian, thin-plate splines and second-order polynomials. We also checked the use of defocus as a strategy for diminishing the peak-locking error and how it was affected when that defocus changed along the image sequence. All the tests were carried out in five sequences taken from the DIC Challenge site [21].

We generally noticed that the three functions provided good accuracy, and slight blurring helped to increase accuracy, despite us finding slight variation among samples. The fitting functions provided different results depending on neighborhood size. Therefore, the Gaussian function provided the best results with large neighborhoods (

11 \times 11

), while the second-order polynomial seemed to work better with small areas (

3 \times 3

). The thin-plate function apparently worked correctly with any neighborhood size. Our tests reveal that the best result was obtained for the Gaussian function with a neighborhood of

11 \times 11

and no defocus. However, as the worst adjustment was also achieved with the Gaussian function, it is important to correctly select the values of both the focus and the neighborhood depending on the fitting function. Blurring significantly improved the error for a second-order polynomial fitting, but had no clear trend for the other two studied functions.

Additionally, in the Methods section, we show that comparing two blurred images by a cross-correlation operation is the equivalent to comparing two sharp images and then introducing double deblurring into the correlation function. Accordingly, we recalculated the errors by comparing a sharp reference with a blurred template. The results showed that the performance of the three methods with all the different parameters were similar to the double deblurring case. So we can conclude that the methods presented herein are blur invariant, which means that the subpixel technique based on the interpolation of the correlation peak is robust for experimental implementations in which the focus may change due to drifts in the optical system or to the sample’s position change.

As a consequence of this, the rule of thumb that can be derived from our tests is that Gaussian fitting applied on large neighborhoods around the maximum of the correlation function may provide the most accurate results in pseudospeckle images without the need for blurring filters.

Supplementary Materials

The following are available online at http://rua.ua.es/dspace/handle/10045/110141, Matlab code: Calculate_subpixel_errors.m; Results: all numerical results in Matlab binary format.

Author Contributions

D.M. and B.F. conceived the presented idea and developed the theory. M.-B.T. implemented the programs, ran the tests and proposed some interesting conclusions. D.M. and M.-B.T. presented the final data. All authors have supervised the findings of this work. All authors have read and agreed to the published version of the manuscript.

Funding

This work has been supported by the Generalitat Valenciana and the European Social Fund (FSE) through the Recruitment of Predoctoral Research Staff ACIF/2018/211 included in the FSE Operational Program 2014–2020 of the Valencian Community. Belén Ferrer and María-Baralida Tomás acknowledge the support of the Generalitat Valenciana through Project GV/2020/077.

Conflicts of Interest

The authors declare no conflict of interest.

References

Maragos, P. Morphological correlation and mean absolute error criteria. In Proceedings of the International Conference on Acoustics Speech, and Signal Processing, Glasgow, UK, 23–26 May 1989; pp. 1568–1571. [Google Scholar]
Stanier, S.A.; Blaber, J.; Take, W.A.; White, D.J. Improved image-based deformation measurement for geotechnical applications. Can. Geotech. J. 2016, 53, 727–739. [Google Scholar] [CrossRef]
Dias-da-Costa, D.; Valença, J.; Júlio, E.; Araújo, H. Crack propagation monitoring using an image deformation approach. Struct. Control Health Monit. 2017, 24, e1973. [Google Scholar] [CrossRef]
Vora, S.R.; Bognet, B.; Patanwala, H.S.; Young, C.D.; Chang, S.-Y.; Daux, V.; Ma, A.W.K. Global strain field mapping of a particle-laden interface using digital image correlation. J. Colloid Interface Sci. 2018, 509, 94–101. [Google Scholar] [CrossRef]
Bai, R.; Wei, Y.; Lei, Z.; Jiang, H.; Tao, W.; Yan, C.; Li, X. Local zone-wise elastic-plastic constitutive parameters of Laser-welded aluminium alloy 6061 using digital image correlation. Opt. Lasers Eng. 2018, 101, 28–34. [Google Scholar] [CrossRef]
Peña, J.A.; Corral, V.; Martínez, M.A.; Peña, E. Over length quantification of the multiaxial mechanical properties of the ascending, descending and abdominal aorta using Digital Image Correlation. J. Mech. Behav. Biomed. Mater. 2018, 77, 434–445. [Google Scholar] [CrossRef] [Green Version]
Lewis, J.P. Fast Normalized Cross-Correlation. Industrial Light & Magic 1995, 10, 7. [Google Scholar]
Schreier, H.; Orteu, J.-J.; Sutton, M.A. Image Correlation for Shape, Motion and Deformation Measurements; Springer: Boston, MA, USA, 2009; ISBN 978-0-387-78746-6. [Google Scholar]
Lei, X.; Jin, Y.; Guo, J.; Zhu, C. Vibration extraction based on fast NCC algorithm and high-speed camera. Appl. Opt. 2015, 54, 8198. [Google Scholar] [CrossRef] [Green Version]
Ferrer, B.; Mas, D. Parametric evaluation of errors using isolated dots for movement measurement by image cross-correlation. Sensors 2018, 18, 525. [Google Scholar] [CrossRef] [Green Version]
Ferrer, B.; Espinosa, J.; Mas, D. A method to measure small local strains in concrete surfaces using its natural texture and image cross-correlation. Struct. Control Health Monit. 2019, 26. [Google Scholar] [CrossRef] [Green Version]
Nogueira, J.; Lecuona, A.; Nauri, S.; Legrand, M.; Rodríguez, P.A. Quantitative evaluation of PIV peak locking through a multiple Δt strategy: Relevance of the rms component. Exp. Fluids 2011, 51, 785–793. [Google Scholar] [CrossRef]
Sjödahl, M.; Benckert, L.R. Systematic and random errors in electronic speckle photography. Appl. Opt. 1994, 33, 7461. [Google Scholar] [CrossRef]
Murray, C.A.; Hoult, N.A.; Take, W.A. Dynamic measurements using digital image correlation. Int. J. Phys. Model. Geotech. 2017, 17, 41–52. [Google Scholar] [CrossRef]
Stanier, S.; Dijkstra, J.; Leśniewska, D.; Hambleton, J.; White, D.; Muir Wood, D. Vermiculate artefacts in image analysis of granular materials. Comput. Geotech. 2016, 72, 100–113. [Google Scholar] [CrossRef] [Green Version]
Nobach, H.; Damaschke, N.; Tropea, C. High-precision sub-pixel interpolation in particle image velocimetry image processing. Exp. Fluids 2005, 39, 299–304. [Google Scholar] [CrossRef]
Overmars, E.F.J.; Warncke, N.G.W.; Poelma, C.; Westerweel, J. Bias Errors in PIV: The Pixel Locking Effect Revisited 15th Int Symp on Applications of Laser Techniques to Fluid Mechanics; ITCES: Lisbon, Portugal, 2010; p. 10. [Google Scholar]
Zhou, Y.; Sun, C.; Song, Y.; Chen, J. Image pre-filtering for measurement error reduction in digital image correlation. Opt. Lasers Eng. 2015, 65, 46–56. [Google Scholar] [CrossRef]
Michaelis, D.; Neal, D.R.; Wieneke, B. Peak-locking reduction for particle image velocimetry. Meas. Sci. Technol. 2016, 27, 104005. [Google Scholar] [CrossRef]
Mas, D.; Perez, J.; Ferrer, B.; Espinosa, J. Realistic limits for subpixel movement detection. Appl. Opt. 2016, 55, 4974. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Challenge Dataset 1.0:2D-DIC. Available online: https://sem.org/2ddic (accessed on 10 July 2020).
Tomás López, M.B.; Mas, D.; Ferrer, B. Peak-locking minimization by three adjustment methods. In Proceedings of the Optics, Photonics and Digital Technologies for Imaging Applications VI; Schelkens, P., Kozacki, T., Eds.; SPIE: Jakarta, France, 2020; p. 53. [Google Scholar]
Matlab, version R2020a; MathWorks: Natick, MA, USA, 2020.
Sjödahl, M. Accuracy in electronic speckle photography. Appl. Opt. 1997, 36, 2875. [Google Scholar] [CrossRef] [PubMed]
Roesgen, T. Optimal subpixel interpolation in particle image velocimetry. Exp. Fluids 2003, 35, 252–256. [Google Scholar] [CrossRef] [Green Version]
Thin Plate Spline. Wikipedia. Available online: https://en.wikipedia.org/wiki/Thin_plate_spline (accessed on 16 October 2020).
Thin-Plate Smoothing Spline—MATLAB Tpaps—MathWorks España. Available online: https://es.mathworks.com/help/curvefit/tpaps.html (accessed on 16 October 2020).
D’Errico, J. Fminsearchbnd, Fminsearchcon. Available online: https://es.mathworks.com/matlabcentral/fileexchange/8277-fminsearchbnd-fminsearchcon (accessed on 16 October 2020).
Tomás, M.B.; Ferrer, B.; Mas, D. Supplementary Materials on Influence of Neighborhood Size and Cross-Correlation Peak-Fitting Method on Location Accuracy. Available online: http://rua.ua.es/dspace/handle/10045/110141 (accessed on 5 November 2020).
Yang, D.S.; Bornert, M.; Chanchole, S.; Gharbi, H.; Valli, P.; Gatmiri, B. Dependence of elastic properties of argillaceous rocks on moisture content investigated with optical full-field strain measurement techniques. Int. J. Rock Mech. Min. Sci. 2012, 53, 45–55. [Google Scholar] [CrossRef]

Figure 1. Images of each sample selected from 2-D DIC from the Society for Experimental Mechanics (SEM) with random dots and their properties [22].

Figure 2. Flow chart with the algorithm implemented in Matlab to reduce the peak-locking error.

Figure 3. Location error curves obtained for

r^{b} = 2

. Curves for the parameter

N b d = 3 \times 3

in Gaussian and spline cases have not been represented for better visualization purposes. Curves for the parameter

N b d = 5 \times 5

have also not been represented for Samples 3 to 5 in the Gaussian case. The complete graph in Matlab format can be downloaded as a Supplemental Materials from [29].

Figure 3. Location error curves obtained for

r^{b} = 2

. Curves for the parameter

N b d = 3 \times 3

in Gaussian and spline cases have not been represented for better visualization purposes. Curves for the parameter

N b d = 5 \times 5

have also not been represented for Samples 3 to 5 in the Gaussian case. The complete graph in Matlab format can be downloaded as a Supplemental Materials from [29].

Figure 4. Location error curves obtained for

N b d = 7 \times 7

. The complete graph in Matlab format can be downloaded as a Supplemental Materials from [29].

Figure 4. Location error curves obtained for

N b d = 7 \times 7

. The complete graph in Matlab format can be downloaded as a Supplemental Materials from [29].

Figure 5. Mean errors calculated for all the samples, fitting functions and different neighborhood sizes versus the Gaussian filter radius. The graphs for the

3 \times 3

region in the Gaussian and spline fitting surfaces were deleted for better visualization purposes. The graphs for the

5 \times 5

region were also deleted for Samples 3 to 5 in the Gaussian case. The complete graph in Matlab format can be downloaded as a Supplemental Materials from [29].

Figure 5. Mean errors calculated for all the samples, fitting functions and different neighborhood sizes versus the Gaussian filter radius. The graphs for the

3 \times 3

region in the Gaussian and spline fitting surfaces were deleted for better visualization purposes. The graphs for the

5 \times 5

region were also deleted for Samples 3 to 5 in the Gaussian case. The complete graph in Matlab format can be downloaded as a Supplemental Materials from [29].

Figure 6. Standard deviations calculated for all the samples, fitting functions and different neighborhood sizes versus the Gaussian filter radius. The graphs for the

3 \times 3

region in the Gaussian and spline fitting surfaces were deleted for better visualization purposes. The graphs for the

5 \times 5

region were also deleted for Samples 3 to 5 in the Gaussian case. The complete graph in Matlab format can be downloaded as a Supplemental Materials from [29].

Figure 6. Standard deviations calculated for all the samples, fitting functions and different neighborhood sizes versus the Gaussian filter radius. The graphs for the

3 \times 3

region in the Gaussian and spline fitting surfaces were deleted for better visualization purposes. The graphs for the

5 \times 5

region were also deleted for Samples 3 to 5 in the Gaussian case. The complete graph in Matlab format can be downloaded as a Supplemental Materials from [29].

Figure 7. Maximum error calculated for all the samples, fitting functions and different neighborhood sizes versus the Gaussian filter radius. The graphs for the

3 \times 3

region in the Gaussian and spline fitting surfaces were deleted for better visualization purposes. The graphs for the

5 \times 5

region were also deleted for Samples 3 to 5 in the Gaussian case. The complete graph in Matlab format can be downloaded as a Supplemental Materials from [29].

Figure 7. Maximum error calculated for all the samples, fitting functions and different neighborhood sizes versus the Gaussian filter radius. The graphs for the

3 \times 3

region in the Gaussian and spline fitting surfaces were deleted for better visualization purposes. The graphs for the

5 \times 5

region were also deleted for Samples 3 to 5 in the Gaussian case. The complete graph in Matlab format can be downloaded as a Supplemental Materials from [29].

Figure 8. Peak adjustment for Sample 1 with a 0.4 shift using no blur and (a)

3 \times 3

and (b)

11 \times 11

neighborhoods. For each neighborhood, we show the correlation peak and the reconstructed surface by employing the fitting functions specified above each plot.

Figure 8. Peak adjustment for Sample 1 with a 0.4 shift using no blur and (a)

3 \times 3

and (b)

11 \times 11

neighborhoods. For each neighborhood, we show the correlation peak and the reconstructed surface by employing the fitting functions specified above each plot.

Figure 9. Maximum error calculated when comparing sharp references with blurred templates for all the samples, fitting methods and different neighborhood sizes versus the Gaussian filter radius. The graphs for the

3 \times 3

region in the Gaussian and spline fitting surfaces were deleted for better visualization purposes. The graphs for the

5 \times 5

region were also deleted for Samples 3 to 5 in the Gaussian case. The complete graph in Matlab format can be downloaded as a Supplemental Materials from [29].

Figure 9. Maximum error calculated when comparing sharp references with blurred templates for all the samples, fitting methods and different neighborhood sizes versus the Gaussian filter radius. The graphs for the

3 \times 3

region in the Gaussian and spline fitting surfaces were deleted for better visualization purposes. The graphs for the

5 \times 5

region were also deleted for Samples 3 to 5 in the Gaussian case. The complete graph in Matlab format can be downloaded as a Supplemental Materials from [29].

Table 1. Minimum (best adjustment) and maximum (worst adjustment) values in pixels of

μ

, STD and maximum error (MaxErr) in each adjustment type with the values of

N b d

and

r_{b}

that provided the results. Below the sample number, the shift between successive frames is specified. The best and worst adjustments in the table are highlighted in red, and both cases were produced with the Gaussian function, with the best in Sample 3 and the worst in Sample 2.

Table 1. Minimum (best adjustment) and maximum (worst adjustment) values in pixels of

μ

, STD and maximum error (MaxErr) in each adjustment type with the values of

N b d

and

r_{b}

that provided the results. Below the sample number, the shift between successive frames is specified. The best and worst adjustments in the table are highlighted in red, and both cases were produced with the Gaussian function, with the best in Sample 3 and the worst in Sample 2.

		Gaussian Fit		Thin-Plate Splines		2^nd-Order Polynomial Fit
		Best	Worst	Best	Worst	Best	Worst
	$µ \pm σ$	4 × 10⁻⁴ ± 6 × 10⁻⁴	0.7 ± 0.7	−0.001 ± 7 × 10⁻⁴	0.004 ± 0.09	−9 × 10⁻⁴ ± 9 × 10⁻⁴	0.009 ± 0.1
Sample	MaxErr	0.0012	2.3436	0.0021	0.1105	0.0033	0.1638
1	Nbd	$11 \times 11$	$3 \times 3$	$9 \times 9$	$3 \times 3$	$3 \times 3$	$11 \times 11$
0.05px	r_b	0	3	5	0	5	0
	$µ \pm σ$	9 × 10⁻⁴ ± 0.005	1 ± 0.7	0.001 ± 0.004	0.006 ± 0.09	8 × 10⁻⁴ ± 0.005	0.006 ± 0.07
Sample	MaxErr	0.0122	1.9149	0.0096	0.1287	0.0111	0.1152
2	Nbd	$11 \times 11$	$3 \times 3$	$9 \times 9$	$3 \times 3$	$3 \times 3$	$11 \times 11$
0.05px	r_b	3	4	5	0	5	0
	$µ \pm σ$	−3 × 10⁻⁴ ± 3 × 10⁻⁴	0.8 ± 0.7	3 × 10⁻⁴ ± 4 × 10⁻⁴	−0.008 ± 0.09	2 × 10⁻⁴ ± 7 × 10⁻⁴	−0.006 ± 0.04
Sample	MaxErr	7.67 × 10⁻⁴	2.3761	0.0010	0.1155	0.0012	0.0596
3	Nbd	$7 \times 7$	$3 \times 3$	$9 \times 9$	$3 \times 3$	$3 \times 3$	$11 \times 11$
0.1px	r_b	4	2	5	0	5	0
	$µ \pm σ$	4 × 10⁻⁴ ± 0.003	0.7 ± 0.6	−0.001 ± 0.006	−0.006 ± 0.1	−0.003 ± 0.004	−0.02 ± 0.09
Sample	MaxErr	0.0063	2.0000	0.0125	0.1537	0.0094	0.1436
4	Nbd	$7 \times 7$	$3 \times 3$	$9 \times 9$	$3 \times 3$	$3 \times 3$	$11 \times 11$
0.1px	r_b	0	5	2	0	2	0
	$µ \pm σ$	9 × 10⁻⁴ ± 0.001	0.7 ± 0.7	0.002 ± 0.001	0.008 ± 0.1	0.002 ± 0.001	0.02 ± 0.1
Sample	MaxErr	0.0026	1.9000	0.0033	0.1273	0.0034	0.1651
4	Nbd	$11 \times 11$	$3 \times 3$	$9 \times 9$	$3 \times 3$	$3 \times 3$	$11 \times 11$
0.1px	r_b	0	4	4	0	5	0

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Tomás, M.-B.; Ferrer, B.; Mas, D. Influence of Neighborhood Size and Cross-Correlation Peak-Fitting Method on Location Accuracy. Sensors 2020, 20, 6596. https://doi.org/10.3390/s20226596

AMA Style

Tomás M-B, Ferrer B, Mas D. Influence of Neighborhood Size and Cross-Correlation Peak-Fitting Method on Location Accuracy. Sensors. 2020; 20(22):6596. https://doi.org/10.3390/s20226596

Chicago/Turabian Style

Tomás, María-Baralida, Belén Ferrer, and David Mas. 2020. "Influence of Neighborhood Size and Cross-Correlation Peak-Fitting Method on Location Accuracy" Sensors 20, no. 22: 6596. https://doi.org/10.3390/s20226596

APA Style

Tomás, M.-B., Ferrer, B., & Mas, D. (2020). Influence of Neighborhood Size and Cross-Correlation Peak-Fitting Method on Location Accuracy. Sensors, 20(22), 6596. https://doi.org/10.3390/s20226596

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Influence of Neighborhood Size and Cross-Correlation Peak-Fitting Method on Location Accuracy^†

Abstract

1. Introduction

2. Methods

3. Results

4. Discussion

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Influence of Neighborhood Size and Cross-Correlation Peak-Fitting Method on Location Accuracy †

Abstract

1. Introduction

2. Methods

3. Results

4. Discussion

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Influence of Neighborhood Size and Cross-Correlation Peak-Fitting Method on Location Accuracy^†