You are currently viewing a new version of our website. To view the old version click .
Sensors
  • Article
  • Open Access

23 July 2020

Efficient Similarity Point Set Registration by Transformation Decomposition

,
and
1
Digital Medical Research Center, School of Basic Medical Sciences, Fudan University, Shanghai 200032, China
2
Shanghai Key Laboratory of Medical Imaging Computing and Computer Assisted Intervention, Shanghai 200032, China
3
Academy for Engineering and Technology, Fudan University, Shanghai 200433, China
*
Author to whom correspondence should be addressed.
This article belongs to the Section Remote Sensors

Abstract

Point set registration is one of the basic problems in computer vision. When the overlap ratio between point sets is small or the relative transformation is large, local methods cannot guarantee the accuracy. However, the time complexity of the branch and bound (BnB) optimization used in most existing global methods is exponential in the dimensionality of parameter space. Therefore, seven-Degrees of Freedom (7-DoF) similarity transformation is a big challenge for BnB. In this paper, a novel rotation and scale invariant feature is introduced to decouple the optimization of translation, rotation, and scale in similarity point set registration, so that BnB optimization can be done in two lower dimensional spaces. With the transformation decomposition, the translation is first estimated and then the rotation is optimized by maximizing a robust objective function defined on consensus set. Finally, the scale is estimated according to the potential correspondences in the obtained consensus set. Experiments on synthetic data and clinical data show that our method is approximately two orders of magnitude faster than the state-of-the-art global method and more accurate than a typical local method. When the outlier ratio with respect to the inliers is up to 1.0, our method still achieves accurate registration.

1. Introduction

Point set registration is a fundamental problem in computer vision. It is widely used in three-dimensional reconstruction [,], medical image analysis [,,], mobile robots [,] and autonomous driving []. The goal of point set registration is to estimate a transformation to align two point sets, namely moving point set and reference point set. This paper focuses on the scenarios of estimating a similarity transformation between two 3D point sets.
A large body of methods have been proposed to solve the three-dimensional rigid point set registration problem [,]. Similarity point set registration has only increased a scale factor compared to rigid registration, so many studies have extended the rigid registration method to estimate similarity transformation. The iterative closest point (ICP) [] is the most typical rigid registration method; it has been extended for similarity point set registration in many researches [,,,,,,]. However, the optimization scheme of ICP is the expectation maximization (EM) [] type so it can only converge to a local optimum. Another line of research represents each point set with a probability density and registers the two point sets by aligning the two probability densities [,,]. Some methods use similar ideas to achieve non-rigid registration between two point sets []. Though these non-rigid point set registration methods can also be used for similarity registration, these methods can also only converge to a local optimum.
In recent years, some global point set registration methods have emerged, such as Go-ICP [], GOGMA [] and so on [,,,]. These methods parameterize rigid point set registration with the rigid transformation, which consists of a three-Degrees of Freedom (3-DoF) rotation and 3-DoF translation between the two point sets to be registered, but they cannot be simply extended to estimate 7-DoF similarity transformation, which consists of another DoF scale between the two point sets. In addition, most of these global methods use branch and bound (BnB) optimization framework, but the time complexity of BnB optimization is exponential in the dimensionality of the problem. It is already very slow for these global methods to perform 6-DoF rigid registration, and it will be very inefficient for them to do 7-DoF similarity registration. Asymmetric point matching (APM) [] parameterizes point set registration with the correspondence between points from each set and develops a deterministic global method to solve affine point set registration problem. APM can also be used to solve the similarity point set registration problem, but it assumes a point-to-point correspondence, which makes it difficult to register partially overlapping point sets or point sets with outliers.
In this paper, we propose a global similarity point set registration method by using BnB optimization framework. To avoid the inefficiency of the BnB-based method in solving high-dimensional problems, we introduce a transformation decomposition approach so that the translation, rotation and scale can be estimated separately. Two BnB-based algorithms are used to globally estimate the 3-DoF translation and the 3-DoF rotation, and the two algorithms are fast because the problem dimensionality is low. To the best of our knowledge, this is the first global similarity point set registration method. In addition, a consensus set-based objective function is used for the translation and rotation estimation, so the proposed method is robust to outlier and partial-overlap. Extensive experiments show that the proposed method is approximately two order of magnitudes faster than state-of-the-art global method and much more accurate than local methods in similarity point set registration.
The rest of this paper is organized as follows. In Section 2, the related work is reviewed. The proposed method is introduced in Section 3. In Section 4, results on synthetic and clinical data are presented and discussed. Finally, conclusions and limitations are discussed in Section 5.

3. Method

Suppose there is a similarity transformation between the moving point set X = x i i = 1 X and the reference point set Y = y j j = 1 Y . For a pair of corresponding points y j Y and x i X , we have
y j = s R * x i + t *
where t * R 3 and R * S O ( 3 ) are the translation and rotation from X to Y , respectively, and s is the scale.
Most existing BnB-based global methods jointly search every parameter. In this scenario, the 7-DoF similarity transformation is a big challenge for BnB, because the time complexity of BnB optimization is exponential in the dimensionality of the problem. In this paper, an efficient similarity point set registration method based on transformation decomposition is proposed. The RSIF is first introduced to decouple similarity transformation in Section 3.1. Then in Section 3.2, the BnB-based global optimal translation search algorithm is given to align the RSIF sets generated from the two original point sets to be registered. In Section 3.3, the complete similarity point set registration algorithm is given.

3.1. Rotation and Scale Invariant Feature

For a set of three points x i 1 , x i 2 , x i 3 from the moving point set, a triple ( x i 1 , x i 2 ) ( x i 1 , x i 3 ) ( x i 2 , x i 3 ) is constructed, where , denotes the angular distance between two vectors. This triple is invariant with respect to scale and rotation of x i 1 , x i 2 and x i 3 around the origin. Thus,
( s Rx i 1 , s Rx i 2 ) ( s Rx i 1 , s Rx i 3 ) ( s Rx i 2 , s Rx i 3 ) = ( x i 1 , x i 2 ) ( x i 1 , x i 3 ) ( x i 2 , x i 3 )
where R S O ( 3 ) and s R . This triple is called a RSIF.
Let p i 1 , i 2 , i 3 = ( x i 1 , x i 2 ) ( x i 1 , x i 3 ) ( x i 2 , x i 3 ) denote the RSIF constructed from a set of moving points x i 1 , x i 2 , x i 3 and q j 1 , j 2 , j 3 = ( y j 1 , y j 2 ) ( y j 1 , y j 3 ) ( y j 2 , y j 3 ) denote the RSIF constructed from a set of reference points y j 1 , y j 2 , y j 3 . According to (1) and (2),
q j 1 , j 2 , j 3 = ( y j 1 , y j 2 ) ( y j 1 , y j 3 ) ( y j 2 , y j 3 ) = ( s R ( x i 1 + t ) , s R ( x i 2 + t ) ) ( s R ( x i 1 + t ) , s R ( x i 3 + t ) ) ( s R ( x i 2 + t ) , s R ( x i 3 + t ) ) = ( x i 1 + t , x i 2 + t ) ( x i 1 + t , x i 3 + t ) ( x i 2 + t , x i 3 + t ) : = F ( p i 1 , i 2 , i 3 , t )
A function F ( p , t ) is defined to express that the two RSIFs are related by the translation t .
P = p m m = 1 M is used to denote the set of RSIFs constructed from point sets x i 1 , x i 2 , x i 3 , where x i 1 , x i 2 , x i 3 X and i 1 i 2 i 3 . Q = q n n = 1 N is used to denote the set of RSIFs constructed from point sets y j 1 , y j 2 , y j 3 , where y j 1 , y j 2 , y j 3 Y and j 1 j 2 j 3 . The problem of finding the optimal translation from X to Y is changed to be finding the optimal translation from P to Q . In practice, the cardinalities of P and Q are very large, so we match a subset of each of them as P = p m m = 1 M and Q = q n n = 1 N . The steps to obtain the subsets to match are as follows.
  • All the three-point combinations are screened from moving and reference point set. Each three-point combination can construct a RSIF. This paper chooses 300 RSIFs with the largest angular distance, and denotes them as P 1 and Q 1 .
  • For p i 1 , i 2 , i 3 = ( x i 1 , x i 2 ) ( x i 1 , x i 3 ) ( x i 2 , x i 3 ) , it is constructed from a set x i 1 , x i 2 , x i 3 . Take the difference between these three vectors and obtain a new vector g i 1 , i 2 , i 3 = x i 1 x i 2 x i 1 x i 3 x i 2 x i 3 . Take the angular distance for every two dimensions of g i 1 , i 2 , i 3 , and we have g i 1 , i 2 , i 3 = ( x i 1 x i 2 , x i 1 x i 3 ) ( x i 1 x i 2 , x i 2 x i 3 ) ( x i 1 x i 3 , x i 2 x i 3 ) .
  • Without loss of generality, it assumes that q j 1 , j 2 , j 3 = ( y j 1 , y j 2 ) ( y j 1 , y j 3 ) ( y j 2 , y j 3 ) is the corresponding RSIF of p i 1 , i 2 , i 3 . Following the previous step, we have k j 1 , j 2 , j 3 = y j 1 y j 2 y j 1 y j 3 y j 2 y j 3 and k j 1 , j 2 , j 3 = ( y j 1 y j 2 , y j 1 y j 3 ) ( y j 1 y j 2 , y j 2 y j 3 ) ( y j 1 y j 3 , y j 2 y j 3 ) = ( s R ( x i 1 + t ) s R ( x i 2 + t ) , s R ( x i 1 + t ) s R ( x i 3 + t ) ) ( s R ( x i 1 + t ) s R ( x i 2 + t ) , s R ( x i 2 + t ) s R ( x i 3 + t ) ) ( s R ( x i 1 + t ) s R ( x i 3 + t ) , s R ( x i 2 + t ) s R ( x i 3 + t ) ) = ( x i 1 x i 2 , x i 1 x i 3 ) ( x i 1 x i 2 , x i 2 x i 3 ) ( x i 1 x i 3 , x i 2 x i 3 ) .
  • According to the previous steps, g i 1 , i 2 , i 3 and k j 1 , j 2 , j 3 should be equal when the data is clean, and the Euclidean distance between them is close to 0, if there exists noise in data.
  • Finally, all the RSIFs that satisfy the condition g i 1 , i 2 , i 3 k j 1 , j 2 , j 3 ε f are chosen from P 1 and Q 1 , and denoted as P and Q , respectively.
On the basis of the consensus set, our objective function of similarity transformation is defined by
E t ( t ) = m max n F ( p m , t ) q n ε t
where is an indicator function that returns 1 if the inner condition is true and 0 otherwise, denotes the Euclidean norm in R 3 , and ε t is the translation search inlier threshold. The optimal translation is obtained by maximizing E t t .

3.2. Global Translation Search

When BnB-based algorithm is used to globally solve the maximization problem (4), the key is to find a way to calculate the upper bound of E t in a branch T of the parameter space of translation, which means we need to find a function E t ( T ) ¯ satisfying
E t ( T ) ¯ m max n F ( p m , t ) q n ε t , t T
Define D ( t ) = F ( p m , t ) q n and if it is possible to find a lower bound function
D ( T ) ̲ D t , t T
then use E t ( T ) ¯ = m max n D ( T ) ̲ ε t as the upper bound of E t in the branch T . Therefore, the problem becomes how to find D ( T ) ̲ given a translation cube T . Here, this problem is addressed by natural interval extension [,], in which we need to first calculate the bound of each element of F ( p m , t ) when t T . Since F ( p m , t ) is computed identically for each dimension, the bounds used in this paper are described using the first dimension as an example.
For p i 1 , i 2 , i 3 = ( x i 1 , x i 2 ) ( x i 1 , x i 3 ) ( x i 2 , x i 3 ) , without loss of generality, derive the bound of its first element x i 1 + t , x i 2 + t , when t falls in a branch T . It turns out to be difficult to calculate a tight angular distance bound, but a simple yet loose bound can be directly obtained from previous studies [,].
When a point x is translated by a translation in a cubic branch T , the vector corresponding to the translated point falls in a confined range, which is called the uncertainty angle bound in []. The uncertainty angle bound is the maximum deviation of the vector in the range from the one corresponding to point x translated by the center of T . The bound proposed by [] is directly used in this paper. Given a 3D point x and a cubic translation branch T centered at t 0 with half space diagonal δ t , then
( x + t 0 , x + t ) { arcsin ( δ t x + t 0 ) i f δ t x + t 0 π o t h e r w i s e : = β ( x , T )
The proof of (7) can be found in [].
Thus, the angular distance bound of x i 1 + t , x i 2 + t when t T can be determined directly by using the results of (7):
max ( ( x i 1 + t 0 , x i 2 + t 0 ) β ( x i 1 , T ) β ( x i 2 , T ) , 0 )
( x i 1 + t , x i 2 + t ) min ( ( x i 1 + t 0 , x i 2 + t 0 ) + β ( x i 1 , T ) + β ( x i 2 , T ) , π )
After calculating the bounds of x i 1 + t , x i 2 + t , where t T , the second and third elements of F p m , t can also be obtained. Then obtain D ( T ) ̲ by natural interval extension and the upper bound E t ( T ) ¯ for a translation branch T . By utilizing the upper bound E t ( T ) ¯ , we can search the translation space to find the globally optimal translation that maximizes the objective function (4) by BnB. The algorithm is outlined in Algorithm 1.
Algorithm 1:Globally Optimal Translation Search Based on RSIFs.
Sensors 20 04103 i001
After the global optimal translation t * has been found, the moving point set X = x i i = 1 X is translated to X * = x i * i = 1 X where x i * = x i + t * . Thus, there are a relative rotation and scale between X * and Y . We first estimate the optimal R * .

3.3. Similarity Point Set Registration Algorithm

After the global optimal translation t * has been found, another BnB algorithm similar to the rotation search method in [,] is used to search the global optimal rotation about the origin. Again, define the objective function on the basis of the cardinality of the inlier set:
E r ( R ) = i max j ( Rx i * , y j ) ε r
where ε r is the inlier threshold. Please note that this objective function is not influenced by the relative scale between the two point sets, so the rotation can be calculated without considering the scale factor.
To maximize (9) with BnB, it is required to find an upper bound for a branch of its parameter space. Here, the axis-angle representation of the rotation is used, and then all rotations are contained in a ball of radius π . As in many similar works [,], we enclose the ball with a cube with side length 2 π as the initial branch of BnB. Given two rotation vectors a and b in the parameter space of rotation and a 3D vector u , it was established in [] that
( R a u , R b u ) a b
where R a and R b are the matrix forms of rotation corresponding to a and b , respectively. Furthermore, given a cube B in the parameter space of rotation, let v p and v q be points at two opposite corners of B . Then, c : = ( v p + v q ) / 2 is the center of B , and its corresponding rotation matrix is denoted as R c . For any rotation a situated in the cube B ,
( R c u , R a u ) min { c a , π } min { v p v q 2 , π } : = μ B
Then, for R B ,
R c x i * , y j μ B Rx i * , y j R c x i * , y j + μ B
Then, the upper bound of the objective function can be defined as
E r ( B ) ¯ = i max j ( R c x i * , y j ) ( ε r + μ B )
Utilizing the upper bound given by Equation (13), we can search the rotation space to find the globally optimal rotation that maximizes the objective function (9). The algorithm is outlined in Algorithm 2.
Algorithm 2:Globally Optimal Rotation Search.
Sensors 20 04103 i002
By using Algorithm 2, the global optimal rotation can be obtained. At this point, there is only a scale relationship between the two sets, and the scale can be calculated according to the potential correspondences established in Algorithm 2. Concretely, apply the obtained rotation and translation to the original moving point set, and calculate the angle between any two points of the transformed moving point set and reference point set and the origin. For each point of the moving point set, find the point with the smallest angular distance in the reference point set, which is regarded as the corresponding point. Each pair of corresponding points calculates a scale, and takes the median of these scales as the scale factor estimated by our method.
The full similarity point set registration algorithm is composed of a translation search by Algorithm 1, a rotation search by Algorithm 2, and a final calculation of the scale factor. It should be noted that the decomposed problem is not exactly the same as the original problem formulated in Equation (1), but in practice, it is feasible to find the translation, the rotation and the scale separately.

4. Results and Discussion

In this section, we evaluate the performance of our method and compare it against state-of-the-art global and local methods in similarity point set registration. Because there are few new methods dedicated for similarity point set registration, we choose to compare to two recent methods, APM [] and CPD [], which can be used for similarity registration. APM and CPD are the state-of-the-art methods for global affine registration and local deformable registration, respectively. In this experiment, we study the runtimes and accuracy of these three methods on synthetic data and clinical data. Our method is implemented in MATLAB R2017b, except the parts of objective function calculation and collinear points screening, which were implemented in C. The code for the comparing methods was obtained from the authors. To avoid excessive runtimes, we set a limit for every method: if it was unable to converge and return a final solution within 1200 s, it will be terminated by force. In the experiment with synthetic data, the ground truth translation, rotation and scale between the moving point set and the reference point set are known. The accuracy of each method is mainly evaluated by success rate, where success means that the angle between the ground truth rotation and the output rotation is less than 0.1 radian and the translation error relative to the ground truth t t g t / t g t is less than 0.1, where t is the estimated translation and t g t is the ground truth translation, and the absolute error between the output scale and the ground truth scale is less than 0.1. All experiments were performed on a laptop with a 2.21 GHz Intel(R) Core(TM) i7-8750H CPU and 8 GB of RAM.

4.1. Synthetic Data

We experimented on point sets from four different data sets, which were Chef data set from University of Western Australia [,], Lucy data set from the Stanford 3D Scanning Repository [], Archer and Kids and Owl data sets from EPFL Computer Graphics and Geometry Laboratory [] and random point data uniformly distributed in [ 1 , 1 ] 3 . The original 3D models used in this experiment are shown in Figure 1. For our method, the inlier threshold of RSIFs screening was set to 0.01, the inlier threshold of translation search was set to 1, and the inlier threshold of rotation search was set to 10°. The iterations and tolerance error of CPD were set to 10,000 and 1 × 10 8 , respectively. For APM, we used three different tolerance errors: 0.1, 0.2 and 0.3.
Figure 1. The original 3D models. Archer, Chef, Kids, Lucy and Owl.

4.1.1. Runtime Comparison with APM and CPD

In this experiment, the runtimes of the three methods with respect to the number of points are studied. The original 3D models were first randomly down-sampled to different numbers of points, and then these points were uniformly scaled to fit in a cube 1 , 1 3 . For random point data, different numbers of uniformly distributed 3D points from 1 , 1 3 were randomly generated. The down-sampled model data sets and random points were used as moving point sets. For each moving point set, a random similarity transformation ( R , t , s ) (where 1 s 5 ) was applied to generate a reference point set. For each number of points, we performed 20 registrations with different relative random transformations. Figure 2 shows the median runtime and success rate of APM, CPD and our method with respective to the number of points. We use five models and random data to evaluate our method, APM and CPD. The left column of Figure 2 shows the median runtime of APM, CPD and our method. We set three different tolerance errors, 0.1, 0.2 and 0.3, to APM as denoted APM(0.1), APM(0.2) and APM(0.3). APM exceeded 1200 s in most experiments. On the contrary, as a global method, ours can converge within 1200 s in all experiments. The median runtime of our method on Archer and Kids data sets are less than 40 s, which is 30 times faster than APM. This is because we decouple the translation, rotation, and scale, and we optimize the translation and rotation separately by using two BnB algorithms. The right column of Figure 2 shows the success rate of CPD and our method, and the success rate of APM is not plotted because it cannot terminate in the time limit in most cases. We can see our method achieved 100% success rate on random data (first row), Archer (2nd row), Lucy (5th row) and Owl (last row), although our method failed once in 20 registrations of Chef data with 600 points and twice in 20 registrations of Kids data with 800 points. Meanwhile, the success rate of CPD is very low, because it can only converge to a local optimum. From these results we can see that though the local method CPD is much faster than global methods, its success rate is very low without a proper initialization. Global methods tend to be slow, but our method is much faster than APM.
Figure 2. Median runtime and success rate with respective to the number of points. Random data (first row), Archer (2nd row), Chef (3rd row), Kids (4th row), Lucy (5th row) and Owl (last row). Left column: The median runtimes of our method, asymmetric point matching (APM) and coherent point drift (CPD) with respect to the number of points. Right column: The mean success rates of our method and CPD with respect to the number of points.

4.1.2. Robustness to Outliers

In this experiment, we compare the robustness of our method, APM and CPD to outliers using the same raw data as in Section 4.1.1. These original model data sets were first randomly down-sampled to 200 points, and then each model was scaled uniformly to make it fit in a cube of 1 , 1 3 . For random point data, 200 points uniformly distributed in 1 , 1 3 were randomly generated. The down-sampled model data sets and random points were used as moving point sets. For each moving point set, a random similarity transformation ( R , t , s ) (where 1 s 5 ) was applied and random gross outlier points with different outlier percentages with respect to the inliers were added to obtain the reference point sets. For each outlier ratio, 20 registrations under different random transformations were performed. Figure 3 shows the median runtime and success rate of CPD and our method. Please note that the results of APM are not shown in Figure 3, because APM was unable to terminate within 1200 s in most experiments. The left column of Figure 3 shows the median runtime of CPD and our method. The right column of Figure 3 shows the success rate of CPD and our method. Just as mentioned above, global method APM is very slow, while our method can converge within 20 s in all experiments. Although CPD is fast, it cannot return an accurate result in most cases. Our method achieved 100% success rate, which indicates that our method is very robust to gross outliers. From Figure 3, we can see our method is much faster than the global method APM and more accurate than local method CPD. The reason is that our method uses BnB to optimize the translation and rotation separately, which can guarantee the global optimal, and the decomposition strategy greatly improve the efficiency of BnB.
Figure 3. Median runtime and success rate with respective to the outlier ratio. Random data (first row), Archer (2nd row), Chef (3rd row), Kids (4th row), Lucy (5th row) and Owl (last row). Left column: The median runtimes of our method and CPD with respect to the outlier ratio. Right column: The mean success rates of our method and CPD with respect to the outlier ratio.

4.1.3. Robustness to Missing Points

In this experiment, we study the robustness of the three methods with respect to the missing point ratio using the same raw data as in Section 4.1.1. These original model data sets were first randomly down-sampled to 200 points, and then each model was scaled uniformly to make it fit in a cube of 1 , 1 3 . For random point data, 200 points uniformly distributed in 1 , 1 3 were randomly generated. Different ratios of data points were deleted from the down-sampled model data sets and random points to generate the moving point sets, while the reference point sets were constructed by applying random similarity transformation ( R , t , s ) (where 1 s 5 ) to the entire down-sampled model data sets and random points. For each missing point ratio, 20 registrations under different random transformations were performed. Figure 4 shows the median runtime and success rate of CPD and our method. Please note that the results of APM are not shown in Figure 4, because APM was unable to terminate within 1200 s in most experiments. The left column of Figure 4 shows the median runtime of CPD and our method. The right column of Figure 4 shows the success rate of CPD and our method. From Figure 4, we can see the median runtime of our method is less than 10 s in most cases, which is much faster than the global method APM. The reason may be that the assumption of one-to-one correspondence adopted by APM is not valid in these experiments and transformation decomposition in our method can greatly improve the efficiency of BnB. Our method could successfully register the moving and reference point sets except two failed cases in 20 registrations of the Archer data with 0.2 missing point ratio and one failed case in 20 registrations of the Archer data with 0.4 missing point ratio. Although CPD is fast, it cannot return an accurate result in most cases.
Figure 4. Median runtime and success rate with respective to the missing point ratio. Random data (first row), Archer (2nd row), Chef (3rd row), Kids (4th row), Lucy (5th row) and Owl (last row). Left column: The median runtimes of our method and CPD with respect to the missing point ratio. Right column: The mean success rates of the our method and CPD with respect to the missing point ratio.

4.2. Clinical Data

In this section, we compare our method, APM and CPD in registering clinical data. Two scans from 3D MRI volume data and a laser range scanner were used. The point clouds used in these experiments are very dense, and they were down-sampled by using the pcdownsample function of MATLAB with a specified gridsize 20 to 223 and 241 points, respectively. Then these points were scaled uniformly to make it fit in a cube of 1 , 1 3 . Since there is no ground truth for transformation, in order to calculate the registration error, three targets were selected from the data to calculate the target registration error (TRE). It should be noted that due to the manual selection of the target and the weak correspondence between the data, there is a certain error between the corresponding targets. For our method, the inlier threshold of RSIFs screening was set to 0.1, the inlier threshold of translation search was set to 1, and the inlier threshold of rotation search was set to 10. The iterations and tolerance error of CPD were set to 10,000 and 1 × 10 8 , respectively. For APM, we used three different tolerance errors: 0.1, 0.2 and 0.3. Since APM requires that the number of moving points to be less than the number of reference points, two point sets were exchanged and input into APM. However, APM cannot converge within 1200 s. Then we randomly delete 18 points from the moving point set so that the number of moving points is the same as the number of reference points, but APM still cannot converge in 1200 s. Figure 5 shows the clinical data before and after registration. There is approximately relative rotation of 90 degrees between two point sets in Figure 5a. The result of our method is shown in Figure 5b, and our method could successfully register two point sets. Figure 5c shows the result of CPD. The running time of each method, TRE, mean, and standard deviation are listed in Table 1. The scale of CPD is 0.893, and ours is 1.040. CPD is much faster than our method, but it failed, while our method succeeded in 19 s.
Figure 5. Clinical data before and after registration. (a) The initial relative position of the two point sets to be registered. (b) The result of our method. (c) The result of CPD.
Table 1. The results of registration on the clinical data.

5. Conclusions

This paper focuses on the similarity point set registration problem and proposes an efficient global method by using BnB and transformation decomposition. Because the time efficiency of BnB optimization is exponential in the dimensionality of the problem, a novel rotation and scale invariant feature is proposed to decouple the optimization of translation, rotation and scale. The optimal translation is first globally optimized based on the two sets of rotation and scale invariant features constructed from the original point sets. Then, the optimal rotation between the original point sets is calculated after applying the obtained optimal translation to the moving point set. Finally, the scale is estimated through potential correspondences. Decoupling the optimization of the translation, the rotation and the scale makes the proposed algorithm much more efficient than the existing global method. When the outlier ratio with respect to inliers is up to 1.0 or missing point ratio is 0.5, our method still successfully registers the moving point set and reference point set. Extensive experiments show that the proposed method is approximately two orders of magnitude faster than state-of-the-art global method and much more accurate than local methods in similarity point set registration.
There are several limitations of this study. First of all, we have to admit that the decomposed problem is not exactly the same as the original problem. However, in practice, it is feasible to find the translation, the rotation and the scale separately. Secondly, when the outlier ratio is higher, the method may fail. Finally, the application experiment is relatively simple. In the future, we will further optimize the algorithm and enhance its applicability.
Overall, this paper proposes an efficient global similarity point registration algorithm based on transformation decomposition and BnB optimization framework. Two BnB-based algorithms are used to globally estimate translation and rotation in two three-dimensional spaces, which improve the efficiency of BnB. Thus, our method is faster than the state-of-the-art global method. At the same time, our method uses BnB optimization framework to guarantee the global optimality, which makes it more accurate than local method in similarity point set registration.

Author Contributions

Conceptualization, C.W.; Methodology, C.W. and M.W.; Software, C.W.; Validation, X.C. and M.W.; Formal analysis, C.W.; Investigation, M.W.; Resources, X.C. and M.W.; Data curation, C.W.; Writing–original draft preparation, C.W. and M.W.; Visualization, C.W.; Supervision, X.C. and M.W.; Project administration, M.W.; Funding acquisition, X.C. and M.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the National Natural Science Foundation of China (grants 81701795).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Weiss, A.; Hirshberg, D.; Black, M.J. Home 3D Body Scans from Noisy Image and Range Data. In Proceedings of the 13th IEEE International Conference on Computer Vision, Barcelona, Spain, 6–13 November 2011; pp. 1951–1958. [Google Scholar]
  2. Zhu, J.; Zhu, L.; Jiang, Z.; Li, Z.; Li, C.; Zhang, F. Scaling registration of multiview range scans via motion averaging. J. Electron. Imaging 2016, 25, 43021. [Google Scholar] [CrossRef]
  3. Rasoulian, A.; Rohling, R.; Abolmaesumi, P. Group-Wise Registration of Point Sets for Statistical Shape Models. IEEE Trans. Med. Imaging 2012, 31, 2025–2034. [Google Scholar] [CrossRef]
  4. Shen, D.G.; Davatzikos, C. HAMMER: Hierarchical Attribute Matching Mechanism for Elastic Registration. IEEE Trans. Med. Imaging 2002, 21, 1421–1439. [Google Scholar] [CrossRef] [PubMed]
  5. Wu, G.; Kim, M.; Wang, Q.; Shen, D. S-HAMMER: Hierarchical Attribute-Guided, Symmetric Diffeomorphic Registration for MR Brain Images. Hum. Brian Mapp. 2014, 35, 1044–1060. [Google Scholar] [CrossRef] [PubMed]
  6. Pomerleau, F.O.; Colas, F.; Siegwart, R. A Review of Point Cloud Registration Algorithms for Mobile Robotics. Found. Trends Robot. 2015, 4, 1–104. [Google Scholar] [CrossRef]
  7. Ma, L.; Zhu, J.; Zhu, L.; Du, S.; Cui, J. Merging grid maps of different resolutions by scaling registration. Robotica 2016, 34, 2516–2531. [Google Scholar] [CrossRef]
  8. Wang, R.; Xu, Y.; Sotelo, M.A.; Ma, Y.; Sarkodie-Gyan, T.; Li, Z.; Li, W. A Robust Registration Method for Autonomous Driving Pose Estimation in Urban Dynamic Environment Using LiDAR. Electronics 2019, 8, 43. [Google Scholar] [CrossRef]
  9. Maiseli, B.; Gu, Y.; Gao, H. Recent developments and trends in point set registration methods. J. Vis. Commun. Image Represent. 2017, 46, 95–106. [Google Scholar] [CrossRef]
  10. Tam, G.K.L.; Cheng, Z.; Lai, Y.; Langbein, F.C.; Liu, Y.; Marshall, D.; Martin, R.R.; Sun, X.; Rosin, P.L. Registration of 3D Point Clouds and Meshes: A Survey from Rigid to Nonrigid. IEEE Trans. Vis. Comput. Graph. 2013, 19, 1199–1217. [Google Scholar] [CrossRef]
  11. Besl, P.J.; McKay, N.D. A Method For Registration of 3-D Shapes. IEEE Trans. Pattern Anal. Mach. Intell. 1992, 14, 239–256. [Google Scholar] [CrossRef]
  12. Chen, H.; Zhang, X.; Du, S.; Wu, Z.; Zheng, N. A Correntropy-based Affine Iterative Closest Point Algorithm for Robust Point Set Registration. IEEE-CAA J. Autom. Sin. 2019, 6, 981–991. [Google Scholar] [CrossRef]
  13. Wu, Z.; Chen, H.; Du, S.; Fu, M.; Zhou, N.; Zheng, N. Correntropy Based Scale ICP Algorithm for Robust Point Set Registration. Pattern Recogn. 2019, 93, 14–24. [Google Scholar] [CrossRef]
  14. Yang, Y.; Fan, D.; Du, S.; Wang, M.; Chen, B.; Gao, Y. Point Set Registration With Similarity and Affine Transformations Based on Bidirectional KMPE Loss. IEEE Trans. Cybern. 2019, 1–12. [Google Scholar] [CrossRef] [PubMed]
  15. Du, S.; Cui, W.; Wu, L.; Zhang, S.; Zhang, X.; Xu, G.; Xu, M. Precise Iterative Closest Point Algorithm with Corner Point Constraint for Isotropic Scaling Registration. Multi-Media Syst. 2019, 25, 119–126. [Google Scholar] [CrossRef]
  16. Li, C.; Xue, J.; Zheng, N.; Du, S.; Zhu, J.; Tian, Z. Fast and Robust Isotropic Scaling Iterative Closest Point Algorithm. In Proceedings of the IEEE International Conference on Image Processing, Brussels, Belgium, 11–14 September 2011; pp. 1485–1488. [Google Scholar]
  17. Ying, S.; Peng, J.; Du, S. A Scale Stretch Method Based on ICP for 3D Data Registration. IEEE Trans. Autom. Sci. Eng. 2009, 3, 559–565. [Google Scholar] [CrossRef]
  18. Zha, H.B.; Ikuta, M.; Hasegawa, T. Registration of range images with different scanning resolutions. In Proceedings of the IEEE International Conference on Systems Man and Cybernetics Conference, Nashville, TN, USA, 8–11 October 2000; pp. 1495–1500. [Google Scholar]
  19. Li, H.; Hartley, R. The 3D-3D Registration Problem Revisited. In Proceedings of the IEEE International Conference on Computer Vision, Minneapolis, MN, USA, 18–23 June 2007; pp. 1947–1954. [Google Scholar]
  20. Tsin, Y.; Kanade, T. A Correlation-Based Approach to Robust Point Set Registration. In Proceedings of the European Conference on Computer Vision, Prague, Czech Republic, 11–14 May 2004; pp. 558–569. [Google Scholar]
  21. Chui, H.L.; Rangarajan, A. A Feature Registration Framework using Mixture Models. In Proceedings of the IEEE Workshop on Mathematical Methods in Biomedical Image Analysis, Hilton Head Island, SC, USA, 11–12 June 2000; pp. 190–197. [Google Scholar]
  22. Jian, B.; Vemuri, B.C. Robust Point Set Registration Using Gaussian Mixture Models. IEEE Trans. Pattern Anal. Mach. Intell. 2011, 33, 1633–1645. [Google Scholar] [CrossRef]
  23. Myronenko, A.; Song, X. Point Set Registration: Coherent Point Drift. IEEE Trans. Pattern Anal. Mach. Intell. 2010, 32, 2262–2275. [Google Scholar] [CrossRef]
  24. Yang, J.; Li, H.; Campbell, D.; Jia, Y. Go-ICP: A Globally Optimal Solution to 3D ICP Point-Set Registration. IEEE Trans. Pattern Anal. Mach. Intell. 2016, 38, 2241–2254. [Google Scholar] [CrossRef]
  25. Campbell, D.; Petersson, L. GOGMA: Globally-Optimal Gaussian Mixture Alignment. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 5685–5694. [Google Scholar]
  26. Olsson, C.; Kahl, F.; Oskarsson, M. Branch-and-Bound Methods for Euclidean Registration Problems. IEEE Trans. Pattern Anal. Mach. Intell. 2009, 31, 783–794. [Google Scholar] [CrossRef]
  27. Straub, J.; Campbell, T.; How, J.P.; Fisher, J.W.I. Efficient Global Point Cloud Alignment using Bayesian Nonparametric Mixtures. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 September 2017; pp. 2403–2412. [Google Scholar]
  28. Liu, Y.; Wang, C.; Song, Z.; Wang, M. Efficient Global Point Cloud Registration by Matching Rotation Invariant Features Through Translation Search. In Proceedings of the European Conference on Computer Vision, Munich, Germany, 8–14 September 2018; pp. 460–474. [Google Scholar]
  29. Bustos, A.P.; Chin, T.; Eriksson, A.; Li, H.; Suter, D. Fast Rotation Search with Stereographic Projections for 3D Registration. IEEE Trans. Pattern Anal. Mach. Intell. 2016, 38, 2227–2240. [Google Scholar] [CrossRef]
  30. Lian, W.; Zhang, L.; Yang, M. An Efficient Globally Optimal Algorithm for Asymmetric Point Matching. IEEE Trans. Pattern Anal. Mach. Intell. 2016, 7, 1281–1293. [Google Scholar] [CrossRef]
  31. Du, S.Y.; Zhu, J.H.; Zheng, N.N.; Zhao, J.Z.; Li, C. Isotropic Scaling Iterative Closest Point Algorithm for Partial Registration. Electron. Lett. 2011, 47, 784–799. [Google Scholar] [CrossRef]
  32. Liu, W.; Pokharel, P.P.; Principe, J.C. Correntropy: Properties and Applications in Non-Gaussian Signal Processing. IEEE Trans. Signal Process. 2007, 55, 5286–5298. [Google Scholar] [CrossRef]
  33. Chen, B.; Xing, L.; Wang, X.; Qin, J.; Zheng, N. Robust Learning with Kernel Mean p-Power Error Loss. IEEE Trans. Cybern. 2018, 48, 2101–2113. [Google Scholar] [CrossRef] [PubMed]
  34. Consolini, L.; Laurini, M.; Locatelli, M.; Rizzini, D.L. Globally Optimal Registration based on Fast Branch and Bound. arXiv 2019, arXiv:1901.09641. [Google Scholar]
  35. Dym, N.; Kovalsky, S.Z. Linearly Converging Quasi Branch and Bound Algorithms for Global Rigid Registration. In Proceedings of the IEEE International Conference on Computer Vision, Seoul, Korea, 27 October–2 November 2019; pp. 1628–1636. [Google Scholar]
  36. Yu, C.; Da, J. A Maximum Feasible Subsystem for Globally Optimal 3D Point Cloud Registration. Sensors 2018, 18, 544. [Google Scholar] [CrossRef] [PubMed]
  37. Li, X.; Liu, Y.; Wang, M.; Song, Z. GO-APSR: A Globally Optimal Affine Point Set Registration Method. IEEE Access 2019, 7, 137232–137240. [Google Scholar] [CrossRef]
  38. Hartley, R.I.A.Z. Multi-View Geometry in Computer Vision; Cambridge University Press: Cambridge, UK, 2003. [Google Scholar]
  39. Yang, H.; Carlone, L. A Polynomial-time Solution for Robust Registration with Extreme Outlier Rates. In Proceedings of the Robotics: Science and Systems, Breisgau, Germany, 22–26 June 2019. [Google Scholar]
  40. Yang, H.; Shi, J.; Carlone, L. TEASER: Fast and Certifiable Point Cloud Registration. arXiv 2020, arXiv:2001.07715. [Google Scholar]
  41. Moore, R.E.; Kearfott, R.B.; Cloud, M.J. Introduction to Interval Analysis; Society for Industrial and Applied Mathematics: Philadelphia, PA, USA, 2009. [Google Scholar]
  42. Moore, R.E. Methods and Applications of Interval Analysis; Society for Industrial and Applied Mathematics: Philadelphia, PA, USA, 1979. [Google Scholar]
  43. Brown, M.; Windridge, D.; Guillemaut, J. Globally Optimal 2D-3D Registration from Points or Lines Without Correspondences. In Proceedings of the IEEE International Conference on Computer Vision, Boston, MA, USA, 8–12 June 2015; pp. 2111–2119. [Google Scholar]
  44. Campbell, D.J.; Petersson, L.; Kneip, L.; Li, H. Globally-Optimal Inlier Set Maximisation for Camera Pose and Correspondence Estimation. IEEE Trans. Pattern Anal. Mach. Intell. 2020, 42, 328–342. [Google Scholar] [CrossRef]
  45. Hartley, R.I.; Kahl, F. Global Optimization through Rotation Space Search. Int. J. Comput. Vision 2009, 82, 64–79. [Google Scholar] [CrossRef]
  46. Mian, A.S.; Bennamoun, M.; Owens, R.A. A Novel Representation and Feature Matching Algorithm for Automatic Pairwise Registration of Range Images. Int. J. Comput. Vision 2006, 66, 19–40. [Google Scholar] [CrossRef]
  47. Mian, A.S.; Bennamoun, M.; Owens, R. Three-Dimensional Model-Based Object Recognition and Segmentation in Cluttered Scenes. IEEE Trans. Pattern Anal. Mach. Intell. 2006, 28, 1584–1601. [Google Scholar] [CrossRef] [PubMed]
  48. Available online: http://graphics.stanford.edu/data/3Dscanrep/ (accessed on 1 June 2020).
  49. Available online: https://lgg.epfl.ch/statues_dataset.php (accessed on 1 June 2020).

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.