Next Article in Journal
Dynamic Analysis and Optimal Prevention Strategies for Monkeypox Spread Modeled via the Mittag–Leffler Kernel
Next Article in Special Issue
Estimation of Fractal Dimension and Super-Resolution Reconstruction for Person Re-Identification in Images from Infrared Surveillance Camera
Previous Article in Journal
HR-UMamba++: A High-Resolution Multi-Directional Mamba Framework for Coronary Artery Segmentation in X-Ray Coronary Angiography
Previous Article in Special Issue
Fractal Dimension-Based Multi-Focus Image Fusion via AGPCNN and Consistency Verification in NSCT Domain
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Fractional-Order Spatiotemporal Unified Energy Framework for Non-Repetitive LiDAR Point Cloud Registration

1
The School of Mechanical Engineering, Shenyang Ligong University, Shenyang 110158, China
2
The School of Marine Science and Technology, Northwestern Polytechnical University, Xi’an 710072, China
*
Authors to whom correspondence should be addressed.
Fractal Fract. 2026, 10(1), 42; https://doi.org/10.3390/fractalfract10010042
Submission received: 8 December 2025 / Revised: 29 December 2025 / Accepted: 6 January 2026 / Published: 9 January 2026

Abstract

Non-repetitive scanning LiDARs provide high coverage yet exhibit irregular sampling patterns, which destabilize local features and correspondences. To address this, we propose a novel spatiotemporal unified energy framework that integrates fractional calculus into rigid pose estimation. Spatially, we introduce a Riesz fractional regularization term to impose non-local smoothness constraints on the residual field, mitigating structural inconsistencies. Temporally, we design a Grünwald–Letnikov fractional dynamics solver that leverages long-memory effects of historical gradients to reduce the risk of being trapped in local minima. Comparative experiments on the Stanford 3D, MVTec ITODD, and HomebrewedDB (HB) datasets demonstrate that our method significantly outperforms state-of-the-art geometric and learning-based approaches. Specifically, it maintains a success rate exceeding 90% even under severe sampling perturbations where traditional methods fail. Ablation studies further validate that the introduction of non-local spatial constraints and historical gradient memory significantly reshapes the energy landscape, ensuring robust convergence. This work provides a rigorous theoretical foundation for applying fractional operators to point cloud processing.

1. Introduction

Point cloud registration is a fundamental problem in autonomous navigation, robotic vision, and industrial metrology. Recently, non-repetitive scanning LiDARs (e.g., the Livox family) have been deployed on mobile platforms. Their spiral or pseudo-random patterns can achieve near full field-of-view coverage after temporal accumulation, which helps detect small objects and capture fine details, but also leads to frame-wise inconsistent sampling, sparsity, and noise. Local structures are therefore hard to reuse across frames and stable correspondences become difficult to construct, so that robust registration and multi-frame fusion under non-repetitive sampling remain challenging [1].
Rigid registration is dominated by the Iterative Closest Point (ICP) algorithm and probabilistic variants such as Generalized-ICP (G-ICP), which iteratively refine a rigid transform from nearest-neighbor matches [2,3]. However, ICP-type methods solve a non-convex problem and are highly sensitive to the initial pose, overlap ratio, occlusions, and partial views, so that small local errors can easily accumulate into global drift. Non-rigid approaches such as Coherent Point Drift (CPD) estimate a smooth deformation field and perform well in medical imaging and deformable reconstruction, but may yield implausible distortions when data are severely missing [4]. Certifiable robust methods such as TEASER and TEASER++ tolerate extreme outlier rates, yet still rely on sufficiently good putative correspondences and incur non-negligible computational cost [5]. Overall, these methods remain difficult to tune for non-repetitive LiDAR scans with weak overlap and viewpoint-dependent missing data.
Fractional calculus offers an alternative modeling and optimization paradigm by introducing non-integer-order derivatives and integrals that naturally encode non-local couplings and long-memory effects. In control and multi-agent systems, fractional-order models have been used for singular system analysis, consensus, and event-triggered or distributed control [6,7,8,9]. In image processing and computer vision, fractional-order differentiators and enhancement models can sharpen structures, suppress noise, and adapt to varying local smoothness [10,11,12], while recent fractional prescribed-performance controllers for wheeled mobile robots and time-delay nonlinear systems show the benefit of embedding performance guarantees directly into robotics-related dynamics [13,14]. Despite these advances, standard registration frameworks remain bound by integer-order calculus. Such methods rely on local gradients that are ill-defined within the spatial voids of non-repetitive scanning, and typically employ optimization schemes with short-term (exponentially decaying) memory that struggle to navigate the rugged, non-convex energy landscapes. Consequently, fractional spatio-temporal formulations—with their inherent non-locality and heavy-tailed memory—provide a rigorous mathematical basis for overcoming these geometric and dynamic bottlenecks.
Motivated by these observations, this paper proposes a fractional-order spatio-temporal energy framework for rigid registration of non-repetitive LiDAR point clouds. On the spatial side, we introduce a Riesz fractional regularization term defined on the residual field. Unlike local integer-order operators, this term imposes non-local, power-law weighted smoothness to suppress local outliers and mismatches while preserving important geometric details. On the temporal side, we embed a Grünwald–Letnikov (GL) fractional derivative into the gradient-based optimization. This accumulates past gradients with binomial-type weights, yielding an explicit long-memory update that acts as a generalized momentum term superior to classical short-memory approaches. By coupling these two ingredients into a unified variational formulation, the proposed method jointly encodes non-local geometric constraints and historical gradient memory, aiming to improve robustness and convergence under irregular sampling, occlusions, and noise, without relying on large task-specific training datasets or handcrafted features. Figure 1 presents the qualitative registration results of a conventional baseline and our method under non-repetitive sampling perturbations.

2. Preliminaries

2.1. Overview of the Point Cloud Registration Problem

In the fields of 3D environmental perception and geometric reconstruction, the fundamental task of point cloud registration is to estimate the rigid body pose between a given source point cloud and a target point cloud, thereby aligning them as closely as possible within a unified coordinate system. Let the source point cloud and the target point cloud be denoted as P s and P m , respectively:
P s = { x i R 3 i = 1 , , N s } , P m = { y j R 3 j = 1 , , N m } ,
where x i and y j represent the 3D spatial positions of the points. The variable to be solved in the registration process is the rigid transformation
T = ( R , t ) S E ( 3 ) , R S O ( 3 ) , t R 3 ,
whose action on a point x i is defined as
T ( x i ) = R x i + t .
Upon determining a specific correspondence model (such as nearest-neighbor correspondence or feature-based soft correspondence), the registration problem can be formulated as an energy minimization problem with the pose T as the independent variable. A typical formulation is given by
E class ( R , t ) = i = 1 N s ρ y π ( i ) R x i + t 2 2 ,
where π ( i ) denotes the index of the target point selected for the i-th source point (e.g., the nearest neighbor), and ρ ( · ) represents a potential robust loss function designed to suppress the influence of outliers and noise. Equation (2) encapsulates the fundamental principle underlying numerous registration methods, typified by the Iterative Closest Point (ICP) algorithm: it measures the degree of alignment between point clouds via the sum of squared weighted residuals and subsequently solves for the optimal pose using numerical techniques such as gradient-based methods or iterative closest point schemes.

2.2. Grünwald–Letnikov Fractional Derivative and Its Discretization

Unlike integer-order calculus, which involves only finite-order derivatives, fractional calculus relates the change at the current moment to the historical trajectory over a past period through non-integer-order differential operators, thereby introducing long-memory characteristics at the operator level.
Given a function f ( t ) and an order α ( 0 , 1 ) , a commonly used definition of the fractional derivative is the Riemann–Liouville (RL) type. With the left endpoint t 0 as the starting point, it is defined as
D t α t 0 RL f ( t ) = 1 Γ ( 1 α ) d d t t 0 t f ( τ ) ( t τ ) α d τ ,
where Γ ( · ) is the Gamma function. As seen from Equation (3), the value of the fractional derivative at time t is jointly determined by the function values over the interval [ t 0 , t ] through a weighted integral. The integral kernel ( t τ ) α exhibits a power-law decay, reflecting a non-local memory of historical information.
In numerical computation and discrete-time modeling, the Grünwald–Letnikov (GL) fractional derivative is more frequently employed. Let the time step be h > 0 and t n = t 0 + n h ; the GL fractional derivative is defined as
D t α t 0 GL f ( t n ) = lim h 0 1 h α k = 0 n ( 1 ) k α k f t n k ,
where
α k = α ( α 1 ) ( α k + 1 ) k ! = Γ ( α + 1 ) Γ ( k + 1 ) Γ ( α k + 1 )
are the generalized binomial coefficients. Under appropriate regularity conditions, Equation (4) is equivalent to the Riemann–Liouville definition in function space and can thus be regarded as a discrete approximation form of the fractional derivative.
With a fixed time step h, truncating the limit in Equation (4) to a finite sum yields the commonly used discretized form of the GL fractional derivative:
D t α t 0 GL f ( t n ) 1 h α k = 0 L mem 1 α k ( α ) f t n k ,
where L mem denotes the memory length, and α k ( α ) represents the corresponding discrete weights. To facilitate numerical implementation, α k ( α ) is typically generated via a binomial recurrence relation:
α 0 ( α ) = 1 , α k ( α ) = α k 1 ( α ) · α ( k 1 ) k , k = 1 , 2 , , L mem 1 .
It can be observed that { α k ( α ) } k 0 is precisely the sequence of generalized binomial coefficients for order α . This sequence possesses the following typical properties: First, the weights decay according to a power law as k increases, thereby assigning greater weight to recent history while gradually weakening the influence of the distant past. Second, when 0 < α < 1 , the weights exhibit alternating signs, which helps numerically balance the effects across different time scales.
Viewing Equation (5) as a generalization of the integer-order forward difference operator, it is evident that when α 1 and L mem = 1 , the GL fractional derivative degenerates into a first-order difference, further corresponding to the classical first-order derivative. Conversely, when α 0 , the fractional derivative gradually degenerates into the identity operator, implying no additional memory effect is introduced to the time series. Therefore, by adjusting the order α and the memory length L mem , a smooth transition between the “memoryless” integer-order case and the “long-memory” fractional-order case can be achieved, enabling the description of temporal correlations of varying intensities within a unified framework.
In subsequent sections, we will construct a fractional-order temporal dynamics update strategy based on the aforementioned GL fractional derivative. Specifically, the sequence of pose iterations will be treated as a state sequence evolving over time, and the weights from Equation (5) will be used to perform a weighted superposition of historical updates, achieving non-local temporal modulation of the optimization path.

2.3. Fractional Laplacian and Graph Representation

The previous subsection reviewed the basic forms of fractional calculus from the temporal dimension. This subsection introduces the fractional Laplacian operator and its representation on discrete graph structures from the spatial dimension, laying the foundation for constructing the fractional spatial regularization term for the point cloud residual field in the subsequent sections.
In the continuous case, given a scalar field u: Ω R defined on Ω R d , the classical (integer-order) Laplacian operator is defined as
Δ u ( x ) = · u ( x ) , x Ω ,
and the corresponding Dirichlet energy is given by
E loc ( u ) = 1 2 Ω u ( x ) 2 2 d x .
The Euler–Lagrange equation of (8) corresponds to the elliptic partial differential equation involving the Laplacian operator. Since Equation (8) involves only local gradient information, it corresponds to an integer-order, local smoothness constraint.
To introduce non-local spatial correlations, one can consider the Riesz-type fractional Laplacian operator ( Δ ) s / 2 , where s ( 0 , 1 ) is the fractional order. A common equivalent characterization is through the Gagliardo seminorm:
[ u ] H s ( Ω ) 2 = Ω Ω u ( x ) u ( y ) 2 x y 2 d + 2 s d x d y .
This energy performs a weighted accumulation over all point pairs ( x , y ) , where the weight kernel x y 2 ( d + 2 s ) exhibits a power-law decay, reflecting the non-local characterization of long-range interactions. Formally, [ u ] H s ( Ω ) 2 can be regarded as the fractional Dirichlet energy, whose variational derivative is associated with the Riesz fractional Laplacian operator ( Δ ) s / 2 u . As s 1 , this seminorm converges in an appropriate sense to the local gradient energy (8), thereby achieving a unified formulation of smoothness constraints from local to non-local within the interval s ( 0 , 1 ] .
In the discrete case, a point cloud can be viewed as a set of sampling points from the continuous domain Ω . Let { z i } i = 1 N be N sampling points in the point cloud. We construct an undirected weighted graph
G = ( V , E , W )
based on these points, where V = { 1 , , N } is the set of vertices, E V × V is the set of edges, and W = ( w i j ) R N × N is a non-negative symmetric weight matrix. A typical approach establishes edges ( i , j ) E via k-nearest neighbors or a distance threshold, and sets
w i j = φ z i z j 2 , ( i , j ) E ,
where φ ( · ) is a monotonically decreasing weight function, such as an inverse distance or Gaussian kernel, reflecting the spatial proximity between adjacent nodes on the graph.
On the graph structure, the discrete form of the integer-order (local) Dirichlet energy can be written as
E loc G ( u ) = 1 2 ( i , j ) E w i j u i u j 2 ,
where u i denotes the value of function u at vertex i. Equation (11) is closely related to the graph Laplacian matrix
L = D W , D = diag j w i j ,
and can be written in matrix form as E loc G ( u ) = 1 2 u L u , corresponding to integer-order Laplacian smoothing on the graph.
To characterize fractional smoothing on a graph structure, one can proceed from two complementary perspectives: one is via the spectral decomposition of the graph Laplacian, raising the eigenvalues { λ } of L to λ s to define the spectral fractional graph Laplacian L s ; the other is to draw inspiration from the continuous Gagliardo seminorm and discretize Equation (9) into a non-local energy on the graph. Following the second approach, we consider defining
E frac G ( u ; s ) = 1 2 ( i , j ) E w i j u i u j 2 s , 0 < s 1 ,
where u i can be a scalar field, or a specific component or norm of a vector field at vertex i. Compared to Equation (11), the fractional energy (13) generalizes the quadratic form of differences to a power function of order 2 s . When s = 1 , E frac G ( u ; 1 ) = E loc G ( u ) , degenerating into the classical graph Dirichlet energy; when s takes a value close to 0, the penalty for large differences is relatively mitigated, demonstrating stronger non-local tolerance.
From a variational perspective, the first-order variation of the fractional graph energy (13) corresponds to a certain “Riesz-type” fractional Laplacian operator on the graph structure, the specific form of which depends on the selection of the weight matrix W and the exponent 2 s . By adjusting s, one can continuously regulate between strong smoothing (s close to 1) and weak smoothing (s close to 0), thereby controlling the constraint strength on the non-local structure of graph signals within a unified framework.
In subsequent sections, we will treat the residual field in point cloud registration as a vector field defined on the target point cloud graph and specialize u i in Equation (13) as the residual or its norm, constructing a fractional spatial regularization term suitable for non-repetitive LiDAR point clouds.

3. Unified Spatiotemporal Fractional Energy Framework

This section articulates the theoretical formulation of the Unified Spatiotemporal Fractional Energy Framework. We first construct a fractional spatial regularization term via graph-based Dirichlet energy, enforcing non-local constraints on the residual field to mitigate structural sparsity. Concurrently, a fractional-order temporal dynamic solving strategy is derived based on the Grünwald–Letnikov derivative, incorporating long-term memory to modulate the optimization trajectory and enhance convergence robustness. By synthesizing these fractional mechanisms with geometric priors into a unified energy functional, we establish a rigorous minimization paradigm on the SE ( 3 ) manifold. This framework theoretically generalizes integer-order methods and effectively addresses the ill-posedness inherent in non-repetitive LiDAR point cloud registration. Figure 2 illustrates the overall pipeline of our registration method.

3.1. Fractional Spatial Regularization Term

In this paper, we introduce a fractional spatial regularization term on the residual field of point cloud registration. Unlike classical Laplacian smoothing, which imposes quadratic penalties only on local gradients, we treat the residual as a vector field defined on the target point cloud graph based on the fractional Laplacian and graph representation discussed in Section 2.3. By characterizing its non-local spatial structure through fractional Dirichlet energy, we achieve robust global constraints suitable for non-repetitive LiDAR data within a unified energy framework.

3.1.1. Residual Vector Field and Target Point Cloud Graph

Continuing the notation from Section 2.3, let the target point cloud be
P m = { y i R 3 i = 1 , , N m } .
We construct an undirected weighted graph G = ( V , E , W ) on it, where V = { 1 , , N m } is the set of vertices corresponding to the target points y i ; E V × V is the set of edges established via k-nearest neighbors or a distance threshold; and W = ( w i j ) is a non-negative symmetric weight matrix reflecting the mutual proximity of y i and y j in geometric space.
Given the current pose T = ( R , t ) S E ( 3 ) , we consider defining a 3D residual vector field on the target point cloud:
r i ( T ) R 3 , i = 1 , , N m .
The specific form of this field is determined by the matching model. For instance, in the case of hard correspondence, r i ( T ) can be defined as the difference between the target point and its corresponding transformed source point; in the case of soft correspondence, it can be defined as the linear combination of differences between the weighted mean of transformed source points and y i . Regardless of the specific correspondence mechanism employed, Equation (14) can be interpreted as “the local alignment error of the target point y i under the current pose T”.
In classical registration frameworks, the optimization process typically minimizes the weighted sum of residual norms directly, such as
i ρ r i ( T ) 2 2 .
This form utilizes only the local residual information of each point and does not explicitly exploit the spatial correlation structure of residuals on the target point cloud. For sparse and non-uniform point clouds generated by non-repetitive LiDAR, such purely local residual modeling is often insufficient to suppress high-frequency disturbances caused by viewpoint differences, occlusions, and sampling biases.

3.1.2. From Continuous Fractional Dirichlet Energy to Graph Discrete Form

To introduce non-local smoothing constraints spatially, we extend the fractional Dirichlet energy on the continuous domain to the residual vector field on the graph structure. As described in Section 2.3, for a scalar field u: Ω R , the Gagliardo-type fractional seminorm
[ u ] H s ( Ω ) 2 = Ω Ω u ( x ) u ( y ) 2 x y 2 d + 2 s d x d y , 0 < s < 1 ,
provides the fractional Dirichlet energy associated with the Riesz fractional Laplacian ( Δ ) s / 2 . For a vector field u : Ω R 3 , one can naturally replace the scalar difference with the Euclidean norm of the vector difference, yielding
[ u ] H s ( Ω ) 2 = Ω Ω u ( x ) u ( y ) 2 2 x y 2 d + 2 s d x d y , 0 < s < 1 .
Clearly, as s 1 , Equation (16) converges in an appropriate sense to the local fractional energy based on u 2 2 ; when s is smaller, the integral kernel assigns more significant weights to distant point pairs, thereby reinforcing non-local spatial interactions.
Discretizing Equation (16) onto the target point cloud graph G yields an approximation of the fractional Dirichlet energy on the graph structure. Letting u i u ( y i ) and approximating the continuous kernel and voxel volume factors with weights w i j , we have
E frac G ( u ; s ) 1 2 ( i , j ) E w i j u i u j 2 2 s , 0 < s 1 .
Compared to the graph fractional energy for the scalar case, Equation (17) acts directly on the Euclidean norm difference of the vector field. Specifically, when s = 1 ,
E frac G ( u ; 1 ) = 1 2 ( i , j ) E w i j u i u j 2 2 ,
which is structurally consistent with the classical graph Dirichlet energy. When 0 < s < 1 , the power exponent 2 s weakens the penalty for large residual differences, allowing the energy to tolerate local anomalies to some extent while maintaining global smoothness through non-local accumulation.

3.1.3. Fractional Spatial Regularization Term for the Residual Field

Based on the above derivation, we specialize the vector field in Equation (17) to the residual field from the registration process defined in Equation (14). Letting
u i = r i ( T ) , i = 1 , , N m ,
the fractional spatial regularization energy for a given pose T is defined as
E frac ( T ; s ) = 1 2 ( i , j ) E w i j r i ( T ) r j ( T ) 2 2 s , 0 < s 1 .
Physically, E frac ( T ; s ) measures the fractional “roughness” of the residual vector field along the target point cloud graph. If a pose T results in generally small differences in residuals between adjacent points, the corresponding energy is low. Conversely, if there are drastic changes or isolated mismatched pairs in local regions, their contribution to the energy will be amplified through the power function form. By adjusting the order s, a smooth transition between two extreme cases can be achieved:
  • When s approaches 1, Equation (18) approximates a quadratic penalty on r i ( T ) r j ( T ) 2 2 , imposing strong smoothness constraints on the residual field. This is beneficial for obtaining highly smooth solutions in scenarios with relatively low noise and good coverage.
  • When s approaches 0, the power exponent 2 s decreases, making the energy less sensitive to large differences on individual edges. This emphasizes evaluating the global structure through the cumulative effect of multiple edges, resulting in stronger robustness against local anomalies and incomplete viewpoint overlaps.
It is worth emphasizing that Equation (18) is a key innovation in the registration framework proposed in this paper. Existing point cloud registration methods mostly rely on weighted sums of point-to-point or point-to-plane local residuals, at most superimposing integer-order local smoothing terms in the objective function. In contrast, this paper proceeds from the perspective of fractional Laplacians and graph signal processing, treating residuals as a vector field on a graph and introducing fractional Dirichlet energy. Through this construction, we incorporate a non-local spatial regularization mechanism with a tunable order into a unified energy model, enabling the algorithm to more fully utilize global structural information on sparse, non-uniform, and partially overlapping point clouds generated by non-repetitive LiDAR.
In the following subsections, we will combine E frac ( T ; s ) with the data term and normal consistency term to form a unified registration energy, and introduce fractional dynamics in the temporal dimension to establish a complete unified spatiotemporal fractional solving framework.

3.2. Fractional-Order Temporal Dynamics Solving Strategy

In the previous subsection, we constructed the residual field regularization term E frac ( T ; s ) from a spatial perspective based on fractional graph Dirichlet energy. Simultaneously, this paper introduces a temporal dynamics solving strategy driven by fractional calculus in the temporal (iteration) dimension. We treat the pose update as a state sequence evolving over “time” and utilize the Grünwald–Letnikov fractional derivative to reconstruct historical updates with weights, thereby achieving long-memory modulation of the optimization path within a unified framework. This strategy differs fundamentally from integer-order iterative methods that rely solely on the current gradient, making it particularly suitable for non-repetitive LiDAR registration scenarios characterized by highly non-convex energy landscapes and sensitivity to initial values.

3.2.1. From Integer-Order Gradient Flow to Fractional-Order Gradient Flow

Let T ( t ) S E ( 3 ) denote the pose trajectory in continuous time t 0 , and represent its local coordinates using an appropriate parameterization (e.g., a vector ξ ( t ) R 6 in the Lie algebra se ( 3 ) ), such that
T ( t ) = exp ξ ^ ( t ) T 0 ,
where ξ ^ is the matrix representation of ξ in the Lie algebra, and T 0 is the initial pose. In classical integer-order optimization theory, the process of minimizing an energy function E ( T ) can be viewed as a gradient flow in the parameter space, with its continuous form given by
d ξ ( t ) d t = μ ξ E T ( t ) ,
where μ > 0 is the step size scale. Equation (19) indicates that the rate of change of the state is entirely determined by the energy gradient at the current moment, with historical information reflected only indirectly through the current state.
To explicitly introduce historical memory effects, we consider replacing the integer-order derivative with a fractional-order derivative. Let the order be α ( 0 , 1 ) and adopt the notation D t α 0 for the Riemann–Liouville or Grünwald–Letnikov fractional derivative given in Section 2.2. The fractional gradient flow can then be written as
D t α 0 ξ ( t ) = μ ξ E T ( t ) , 0 < α < 1 .
Compared to the integer-order case, the fractional derivative D t α 0 ξ in Equation (20) depends by definition on the entire trajectory ξ ( τ ) within the interval [ 0 , t ] (see Equations (3) and (4)). Consequently, past update history continuously influences current state evolution through power-law decay; this constitutes the mathematical basis for introducing long memory in the temporal dimension using fractional calculus in this paper.

3.2.2. Discrete Time Update Based on GL Form

In numerical implementation, we are concerned with the discrete iteration sequence { ξ n } n 0 , corresponding to time grid points t n = n h , where h > 0 is a fixed time step. Utilizing the Grünwald–Letnikov discretization formula (Equation (5) and the weight recurrence relation Equation (6)) from Section 2.2, the fractional derivative can be approximated as
D t α 0 GL ξ ( t n ) 1 h α k = 0 L mem 1 α k ( α ) ξ n k ,
where L mem is the truncated memory length, and α k ( α ) is generated by Equation (6). Substituting Equation (21) into the fractional gradient flow (20) and discretizing at t = t n , we obtain
k = 0 L mem 1 α k ( α ) ξ n k = h α μ ξ E T n ,
where T n = exp ( ξ ^ n ) T 0 . Equation (22) provides the coupling relationship between the current gradient and historical states: given a fixed α and L mem , it can be rewritten as an update rule for the current increment.
To facilitate integration with common “incremental” optimization frameworks, we express the state iteration as
ξ n + 1 = ξ n + Δ ξ n ,
where Δ ξ n R 6 is the state increment at step n, which can be further decomposed into a rotational part Δ ω n R 3 and a translational part Δ t n R 3 . Substituting (23) into Equation (22) and performing algebraic rearrangement, we derive a fractional weighted form based on historical increments. To highlight the structure of fractional memory, we write it as
Δ ξ n eff = η ξ E T n + τ k = 1 L mem 1 β k ( α ) Δ ξ n k ,
where η > 0 is the base step size, τ > 0 is the overall scaling factor for the memory term, and { β k ( α ) } k = 1 L mem 1 is the normalized memory weight sequence linearly transformed from { α k ( α ) } . Here, Δ ξ n eff in Equation (24) represents the “effective increment” actually used to update the state at step n, comprising two parts: the first term is the instantaneous update driven by the current gradient, and the second term is the fractional memory weighted superposition of the increments from the past L mem 1 steps.
Equation (24) represents one of the key innovative forms in the temporal dimension of this paper. It concretizes the non-local time convolution structure introduced by the GL fractional derivative into a power-law weighted accumulation of historical increments, such that the optimization path is no longer instantaneously determined solely by the current gradient but comprehensively considers the direction and magnitude of recent history updates. With different values of α , τ , and L mem , Equation (24) allows for a continuous transition between the following scenarios:
  • When α is close to 0, τ is small, and L mem is short, the contribution of β k ( α ) is negligible. Δ ξ n eff is primarily controlled by the current gradient term, approximating classical Gradient Descent or Gauss–Newton updates.
  • When α is close to 1, and τ and L mem take larger values, the response of memory weights to historical increments is enhanced. The algorithm exhibits significant temporal long-memory characteristics, which helps smooth the optimization path and suppress high-frequency oscillations in scenarios with complex energy landscapes and numerous local minima.

3.2.3. Geometric Implementation of Pose Update with Fractional Time Memory

Based on the aforementioned fractional increment update strategy, we execute pose updates on S E ( 3 ) in Lie algebra form. Let
Δ ξ n eff = Δ ω n eff Δ t n eff ,
where Δ ω n eff corresponds to the rotational part and Δ t n eff to the translational part. The pose update from step n to step n + 1 can be expressed as
T n + 1 = exp Δ ξ n eff ^ T n = exp Δ ω n eff ^ Δ t n eff 0 1 T n .
Here, Δ ω n eff ^ is the skew-symmetric matrix representation of Δ ω n eff , and exp ( · ) is the matrix exponential map on S O ( 3 ) . It is evident that fractional time memory acts indirectly on the updates of rotation and translation via Δ ξ n eff : the current pose depends not only on the instantaneous gradient direction but also on the combined influence of the update directions and scales from several historical steps.
From a geometric perspective, Equation (25) preserves the standard structure of updates via exponential mapping on S E ( 3 ) , while Equation (24) redefines the temporal evolution of the “step size” at the Lie algebra level. This design allows the fractional temporal dynamics proposed in this paper to be naturally embedded into existing registration frameworks without altering the pose representation and update paradigm, providing a mechanism for temporal modulation with long-memory characteristics for the subsequently constructed unified energy solver.
The distinction between this paper and existing point cloud registration methods lies in the introduction of the GL fractional derivative into pose iterative updates on S E ( 3 ) and the construction of optimization dynamics through power-law weighted accumulation of historical increments. Existing works typically employ only integer-order momentum or adaptive step size strategies, whereas this paper proceeds from fractional calculus theory, explicitly parameterizing the intensity and range of temporal memory as the order α and memory length L mem . This establishes a unified theoretical connection between integer-order optimization and long-memory optimization. In the following subsections, we will integrate the spatial fractional regularization term (18) with the temporal fractional dynamics (24) and (25) into a unified energy framework, presenting the complete registration energy model and solution process.

3.3. Construction and Solution of the Unified Energy Model

In Section 3.1 and Section 3.2, we introduced fractional regularization and fractional temporal dynamics from spatial and temporal dimensions, respectively. This subsection presents the complete unified energy model and describes how to construct a registration solving framework that incorporates both fractional spatial constraints and fractional temporal memory. The unified energy integrates existing priors—such as data consistency, entropy regularization, normal consistency, and spherical normal distribution alignment—into a single objective function. Furthermore, it employs the fractional residual field energy as a novel spatial regularization term and organizes the iterative process via the fractional time update strategy. This constitutes the overall innovation of our method compared to existing point cloud registration works.

3.3.1. Unified Energy Model and Definition of Terms

Let the current pose be T = ( R , t ) S E ( 3 ) , and the source and target point clouds be
P s = { x i } i = 1 N s , P m = { y j } j = 1 N m ,
respectively. Let G = ( V , E , W ) be the graph constructed from the target point cloud, and let { r i ( T ) } i = 1 N m be the residual vector field as defined in Equation (14). We consider a sparse set of source-target correspondence edges on G:
C loc ( T ) { ( i , j ) i { 1 , , N s } , j { 1 , , N m } } ,
which is determined jointly by local nearest neighbor search and a scale gating mechanism. For each edge ( i , j ) C loc ( T ) , let the scale-normalized geometric cost be c i j ( T ) 0 . This typically corresponds to the truncated, normalized residual under non-repetitive LiDAR settings.
On the local edge set C loc ( T ) , we obtain sparse coupling weights { π i j ( T ) } ( i , j ) C loc ( T ) via an entropy-regularized local optimal transport model, satisfying
π i j ( T ) 0 , ( i , j ) C loc ( T ) π i j ( T ) = 1 ,
which are solved numerically using Sinkhorn iterations. Based on this, we define the following four categories of local or global energy terms:
  • Data Term E data ( T ) :
    E data ( T ) = 1 N s ( i , j ) C loc ( T ) π i j ( T ) c i j ( T ) .
    This term measures the scale-normalized residual between the transformed source points and target points, serving as the dominant term reflecting geometric consistency in the unified energy.
  • Entropy Regularization Term E ent ( T ) :
    E ent ( T ) = ( i , j ) C loc ( T ) π i j ( T ) log π i j ( T ) + ε , ε > 0 .
    This term imposes entropy regularization on the local transport plan, encouraging correspondences to maintain a certain degree of spread within the local neighborhood. This prevents degeneracy into extremely few edges, thereby enhancing stability under sparse sampling conditions.
  • Normal Consistency Term E norm ( T ) : Let n i s and n j m be the unit normals of the source and target points, respectively. We calculate their alignment on each edge as
    δ i j ( T ) = R n i s n j m , ( i , j ) C loc ( T ) ,
    and define
    E norm ( T ) = 1 ( i , j ) C loc ( T ) π i j ( T ) δ i j ( T ) ( i , j ) C loc ( T ) π i j ( T ) + ε .
    When the normals of most weighted correspondence edges are nearly aligned, E norm ( T ) takes a small value; conversely, when there are numerous mismatched edges with inconsistent normals, this term increases, thereby suppressing registration results with inconsistent geometric structures.
  • Spherical Normal Distribution Alignment (SNDA) Term E snda ( T ) : We construct direction histograms h s and h m on the unit sphere for the normals of the source and target point clouds, respectively. Under pose T, h s is mapped to h s ( R ) via rotation R . The normalized correlation coefficient between them is defined as
    κ ( T ) = h m , h s ( R ) , 0 κ ( T ) 1 ,
    where the inner product is integrated discretely over the unit sphere. The SNDA energy is then defined as
    E snda ( T ) = 1 κ ( T ) .
    This term measures the consistency of normal distributions at a global scale and is capable of counteracting misleading local matches, making it particularly suitable for scenarios with uneven local coverage caused by non-repetitive scanning.
Additionally, for the residual field { r i ( T ) } , Section 3.1 provided the fractional graph Dirichlet energy (18):
E frac ( T ; s ) = 1 2 ( i , j ) E w i j r i ( T ) r j ( T ) 2 2 s , 0 < s 1 ,
which imposes non-local smoothness constraints on the residual field over the target point cloud graph via a fractional Laplacian-type kernel. This is one of the core innovative terms in the unified energy proposed herein.
In summary, we linearly combine the five sub-terms above to construct the unified spatiotemporal fractional energy model:
E uni ( T ; s ) = λ data E data ( T ) + λ ent E ent ( T ) + λ norm E norm ( T ) + λ snda E snda ( T ) + λ frac E frac ( T ; s ) ,
where the weights λ data , λ ent , λ norm , λ snda , λ frac 0 are normalized such that their sum equals 1. In this unified model, E data , E ent , E norm , and E snda comprehensively characterize local geometric consistency and global normal distribution priors, while E frac ( T ; s ) introduces non-local spatial constraints on the residual field in the form of fractional graph regularization. Specifically, when λ frac 0 and s 1 , the model degenerates into a unified energy containing only integer-order local/global geometric priors. Conversely, when the weight λ frac is significant, fractional spatial regularization plays a dominant role in non-repetitive point cloud scenarios.

3.3.2. Solving Framework Based on Fractional Temporal Dynamics

Under the unified energy (30), the registration task can be formulated as
min T S E ( 3 ) E uni ( T ; s ) .
Executing integer-order gradient descent or Gauss–Newton updates directly on S E ( 3 ) is often limited by the non-convexity of the energy landscape and local minima. Therefore, we adopt the fractional temporal dynamics strategy proposed in Section 3.2, parameterizing the pose as a Lie algebra vector ξ R 6 and organizing the solution process via the fractional increment update Equation (24) and the pose update Equation (25).
Specifically, let the pose at the n-th iteration be T n , with the corresponding Lie algebra coordinate ξ n . In each iteration step:
  • Local Linearization and Base Step Construction: In the vicinity of pose T n , we perform a first-order or second-order approximation of the local residual terms (including E data and E norm ) in the unified energy E uni ( T ; s ) . By constructing normal equations, we obtain a base increment Δ ξ n . This increment can be viewed as a one-step approximation of the integer-order gradient flow (19) in discrete time.
  • Introduction of Fractional Time Memory: We input Δ ξ n and its historical steps { Δ ξ n k } k = 1 L mem 1 into the fractional increment update formula (24) to obtain the effective increment with long memory:
    Δ ξ n eff = η ξ E uni ( T n ; s ) + τ k = 1 L mem 1 β k ( α ) Δ ξ n k ,
    where α ( 0 , 1 ) is the fractional order, { β k ( α ) } is the memory kernel generated by GL weights, and η and τ control the relative influence of the instantaneous gradient and historical memory, respectively. This step embodies our fractional innovation in the temporal dimension: unlike traditional momentum methods, the memory kernel { β k ( α ) } is derived from the rigorous mathematical definition of fractional derivatives, causing the influence intensity and time scale of historical steps to decay according to a power law rather than simple exponential decay.
  • Pose Update and Energy-Driven Line Search: We execute the update on S E ( 3 ) using the effective increment Δ ξ n eff :
    T n + 1 = exp Δ ξ n eff ^ T n ,
    and perform a 1D line search along this direction. We evaluate E uni exp ( s Δ ξ n eff ^ ) T n ; s on a set of scaling factors { s } to select a step size that satisfies sufficient decrease conditions. To enhance robustness in complex energy landscapes, we employ a non-monotonic line search with a finite window. This allows the energy to undergo finite non-monotonic changes within an iteration window and terminates the iteration at the current scale early upon detecting a “plateau” (via a threshold on the max/min energy difference within the window).

3.3.3. Complementarity of Spatial and Temporal Components

The robustness of the proposed framework stems from the fundamental complementarity between its spatial and temporal constituents. To appreciate this synergy, it is instructive to delineate the limitations of each component when functioning in isolation.
The Spatial Fractional Regularization acts as a static structural constraint. By enforcing non-local smoothness via the Riesz potential, it effectively “repairs” the jagged energy landscape caused by sparse LiDAR sampling. However, spatial regularization alone is local in nature; while it ensures that the local geometry is consistent, it cannot prevent the optimizer from converging to a wrong pose if the initial alignment is poor (i.e., it creates a smooth basin, but potentially the wrong one).
Conversely, the Temporal Fractional Dynamics function as a kinetic driver. The heavy-tailed memory provides the necessary inertia to escape shallow local minima, addressing the non-convexity of the problem. Yet, without spatial regularization, the temporal dynamics would operate on a degenerate, noise-filled landscape. In such cases, the “long memory” might drive the solution towards spurious geometric inconsistencies rather than the true alignment.
Therefore, the two components are fundamentally complementary: the spatial term ensures the geometric validity of the convergence basin, while the temporal term provides the navigational capability to reach that basin globally.
The above steps constitute a unified solving framework driven jointly by fractional spatial regularization and fractional temporal memory. Spatially, E frac ( T ; s ) in the unified energy (30) imposes non-local constraints on the target point cloud graph via fractional graph Dirichlet energy. Temporally, the update process accumulates historical increments via power-law weighting through fractional temporal dynamics, smoothing the optimization trajectory and suppressing high-frequency oscillations while maintaining local geometric consistency. In the extreme case where λ frac 0 , α 0 , and the memory length L mem takes a minimal value, this framework degenerates into traditional integer-order local optimization methods. However, in the general setting, the fractional spatiotemporal mechanism provides stronger convergence stability and global robustness for non-repetitive LiDAR point cloud registration.

3.3.4. Verification of Consistency Between Unified Energy and Pose Variation

To verify the consistency between the constructed unified energy and the actual pose error, we experimentally observe the variations in energy under different pose changes.
First, an asymmetric 3D model point cloud containing 1000 points is randomly generated and utilized as both the source and target point clouds. For each given translation magnitude | t | { 0.0 , 0.1 , 0.2 , 0.3 } m , a corresponding ground truth rigid transformation T t = ( R = I , t ) is constructed, yielding the source point cloud S t = T t ( M ) .
Subsequently, while keeping the translation t fixed, we scan different rotation errors RRE = | δ θ | [ 0 ° , 30 ° ] solely within the neighborhood of the true pose. We then calculate the unified energy using an implementation identical to that of the registration algorithm (incorporating Optimal Transport (OT), normal consistency, SNDA, and the fractional regularization term). As shown in Figure 3, the overall energy increases with the magnitude of pose variation. This phenomenon demonstrates that the proposed unified energy exhibits excellent consistency with rotation and translation errors.
In conclusion, this paper integrates graph fractional residual regularization (18) and GL-type fractional temporal dynamics (24), (25) into a single registration energy and solving framework. Theoretically, this framework treats integer-order registration methods as special cases of the fractional model; practically, it offers a new viable path for processing sparse, locally inconsistent point clouds generated by non-repetitive LiDAR.

4. Experiments and Analysis

This section aims to comprehensively validate the effectiveness and robustness of the proposed fractional spatiotemporal unified energy registration algorithm. First, a data generation pipeline simulating non-repetitive scanning characteristics is constructed to establish quantitative evaluation benchmarks. Subsequently, through comparative experiments on datasets such as Stanford 3D and MVTec ITODD, we demonstrate the significant accuracy advantages of our method over traditional geometric and advanced learning-based methods in handling point position jitter and structural missingness. Finally, ablation studies are conducted to quantitatively analyze the necessity of temporal memory, spatial regularization, and the three-stage solving strategy.

4.1. Experimental Data Preparation

The core characteristic of non-repetitive scanning LiDAR (e.g., Livox) lies in its scanning pattern (such as rosette or spiral) that does not repeat over short durations, resulting in significant local density drift and point position jitter between consecutive frames. Traditional registration datasets (e.g., 3DMatch) are typically constructed based on structured light or repetitive scanning LiDARs and lack such characteristics. To quantitatively evaluate the algorithm’s performance under non-repetitive sampling in a controlled environment, we constructed a simulation data generation pipeline. This pipeline aims to reproduce two key features of non-repetitive sampling: (1) partial overlap caused by occlusion and viewpoint limitations; and (2) local point position perturbations caused by sampling inconsistency.

4.1.1. Viewpoint Simulation and Partial Point Cloud Generation

The data generation starts from a complete, high-density model point cloud (e.g., a model from the Stanford 3D or ITODD datasets). To simulate the locality of real scanning, we perform the following steps:
  • Model Preprocessing: First, the model point cloud M is centered (setting its centroid to the origin) and normalized (scaling it within a unit bounding box) to eliminate scale and translation ambiguities.
  • Virtual Camera Scanning: We define a virtual camera pose, including its position p c a m , observation target p t g t (usually the origin), and up vector u c a m .
  • Visibility and Occlusion Judgment: By simulating a pinhole camera model, we calculate the visibility of each point in M . A point y j is considered visible if and only if it satisfies the following conditions simultaneously:
    Frustum Constraint: Point y j lies within the preset depth range [ d m i n , d m a x ] of the camera, i.e.,
    d m i n y j p c a m 2 d m a x .
    Forward Constraint: Point y j must be located in front of the camera, i.e.,
    ( y j p c a m ) · ( p t g t p c a m ) > 0 .
    Back-face Culling: To simulate occlusion by opaque surfaces, we utilize the normal vector m j at point y j . The point is deemed visible only if the angle between the normal and the viewing vector ( p c a m y j ) is less than a certain threshold θ t h r e s h (e.g., 90°).
  • Partial Point Cloud Basis: All points passing the above filtering constitute the “visible model point cloud” M v i s M . This forms the basis for subsequent perturbation.

4.1.2. Ground Truth Pose and Source Point Cloud Generation

To formulate the registration problem, we define a known rigid transformation (i.e., ground truth) T g t = ( R g t , t g t ) S E ( 3 ) . This transformation is composed of preset Euler angles ( ϕ , θ , ψ ) and a translation vector ( t x , t y , t z ) . By applying this transformation to M v i s , we generate the “original source point cloud” S o r i g :
S o r i g = x i x i = R g t y j + t g t , y j M v i s .
At this stage, we possess the model point cloud M , the source point cloud S o r i g , the perfect correspondence between them (indices of points in M v i s ), and the ground truth transformation T g t .

4.1.3. Adaptive Gaussian Perturbation (Simulating Non-Repetitive Sampling)

To simulate the central feature of non-repetitive sampling—local point position jitter—we apply an adaptive Gaussian perturbation to S o r i g (as shown in Figure 1b), generating the final “perturbed source point cloud” S d i s t . The design of this perturbation process aims to correlate the intensity of the perturbation with the local density of the point cloud.
  • Local Density Estimation: Point position jitter (in terms of absolute distance) caused by non-repetitive sampling is typically greater in sparse regions than in dense regions. We employ the K-Nearest Neighbor (K-NN) distance to quantify the local sparsity of each point x i in S o r i g . We calculate the Euclidean distance d K ( x i ) from x i to its K-th nearest neighbor. The choice of K (e.g., K = 28 ) simulates the scale of the perturbation influence. A larger d K ( x i ) indicates that the region where x i is located is sparser.
  • Adaptive Perturbation Radius: We treat d K ( x i ) as the adaptive perturbation radius r d i s t ( x i ) for point x i . This implies that points in sparse regions are allowed to jitter within a larger range.
  • Gaussian Perturbation Generation: We randomly select a certain proportion (e.g., 100%) of points x i from S o r i g for perturbation. For each selected point x i , we generate a 3D Gaussian perturbation vector δ i N ( 0 , σ i 2 I 3 ) .
    Adaptive Standard Deviation: The standard deviation σ i of the perturbation is proportional to the perturbation radius r d i s t ( x i ) of that point, i.e.,
    σ i = r d i s t ( x i ) 3 .
    The choice of σ = r / 3 is based on the 3 σ rule of the Gaussian distribution, ensuring that approximately 99.7 % of the random perturbation vectors δ i naturally fall within the radius r d i s t ( x i ) .
    Perturbation Truncation: To strictly ensure that the perturbation remains within the radius, we truncate the generated δ i :
    δ i final = δ i , if δ i 2 r d i s t ( x i ) , δ i · r d i s t ( x i ) δ i 2 , if δ i 2 > r d i s t ( x i ) .
    Generating Final Source Point Cloud:
    S d i s t = x i x i = x i + δ i final , x i S o r i g .
Through the above pipeline, we generated the registration pairs ( M , S d i s t ) for the experiments. S d i s t simulates a source point cloud observed from a specific viewpoint (causing partial overlap) that has undergone non-repetitive scanning (causing point position jitter). The selection of the K value (e.g., from K = 8 to K = 24 ) allows us to control the severity of the simulated perturbation systematically, thereby evaluating the algorithm’s performance under different perturbation levels.The datasets utilized in this paper include the Stanford 3D dataset [15], the MVTec ITODD dataset [16], and the HomebrewedDB (HB) dataset [17]. As illustrated in Figure 4, we present examples of non-repetitively sampled point clouds generated using selected models from these datasets.

4.2. Registration Evaluation Criteria

To quantitatively evaluate the registration accuracy and robustness of the proposed algorithm, we adopt two standard metrics widely used in the field of 3D registration: Rotation Error (RRE) and Relative Translation Error (RTE). Given a registration pair, the ground truth rigid transformation is known as T g t = ( R g t , t g t ) S E ( 3 ) , where R g t S O ( 3 ) is the ground truth rotation matrix and t g t R 3 is the ground truth translation vector. For the estimated transformation T ^ = ( R ^ , t ^ ) output by the algorithm, we define RRE and RTE as follows:
  • Rotation Error (RRE): RRE measures the angular difference between the estimated rotation R ^ and the ground truth rotation R g t . It is defined as the rotation angle of the relative rotation matrix R err = R g t R ^ . This angle θ can be calculated via the trace of R err . The RRE converts this radian value (or uses it directly in calculation) into degrees:
    RRE = 180 π θ = 180 π arccos tr R g t R ^ 1 2 ,
    where tr ( · ) denotes the trace of a matrix (the sum of diagonal elements), R g t denotes the transpose (inverse) of R g t , and arccos is the arccosine function.
  • Relative Translation Error (RTE): RTE measures the Euclidean distance between the estimated translation t ^ and the ground truth translation t g t :
    RTE = t ^ t g t 2 ,
    where · 2 denotes the L 2 norm (Euclidean distance) of a vector.
  • Success Rate: In non-convex registration problems, comparing average RRE/RTE alone can be misleading, as a few extreme failure cases (where RRE approaches 180°) can severely skew the mean. Therefore, we introduce the Success Rate as a more robust macroscopic evaluation metric. A registration experiment is considered successful if and only if both its RRE and RTE are below preset thresholds τ R and τ t . In the experiments of this paper, we adopt a set of strict thresholds:
    Success ( RRE 3.0 ° ) ( RTE 0.03 m ) .
    The Success Rate is defined as the percentage of successful trials out of all test cases. This metric is a key criterion for measuring the robustness of the algorithm under challenging conditions (such as the non-repetitive sampling perturbations simulated in this experiment).

4.3. Bayesian Parameter Optimization

In the fractional spatiotemporal registration framework proposed above, several critical hyperparameters significantly influence performance. These include: the spatial fractional order s, which controls the sensitivity of the Riesz fractional regularization to outlier residuals; the temporal fractional order α time and the memory scale factor κ GL , which jointly regulate the inertia and convergence trajectory of the fractional temporal dynamics; and the weight vector for the unified energy terms
λ = λ data , λ ent , λ norm , λ snda , λ frac ,
where the five components correspond to the data term, entropy regularization term, local normal consistency term, spherical normal distribution term, and fractional spatial regularization term, respectively.
Since the mapping from the aforementioned hyperparameters to the final registration error is a highly nonlinear, strongly non-convex black-box function lacking analytical gradients, relying on manual tuning or regular grid search makes it difficult to find a reasonable parameter configuration within a limited budget. Therefore, we model the hyperparameter identification problem as a global optimization problem on a bounded domain and employ Bayesian Optimization (BO) for the solution [18,19].
We denote the set of hyperparameters to be optimized as
Θ = s , α time , κ GL , λ ,
where the energy weights must satisfy the simplex constraint
λ i > 0 , i λ i = 1 ,
to ensure the physical clarity of the unified energy. For the fractional orders s and α time , we restrict their search intervals to sub-intervals of ( 0 , 1 ) (specifically, s , α time [ 0.30 , 0.95 ] in our implementation) to verify the performance advantages of non-integer derivatives compared to integer-order cases. The memory scale factor κ GL is confined to a finite positive interval (e.g., [ 0.50 , 2.50 ] ) to control the decay rate of the GL memory kernel.
The optimization objective is defined as the sum of the average Relative Rotation Error (RRE) and Relative Translation Error (RTE) on a validation dataset D val :
Θ * = arg min Θ Ω J ( Θ ) , J ( Θ ) = 1 | D val | k D val RRE k ( Θ ) + RTE k ( Θ ) ,
where ( RRE k ( Θ ) , RTE k ( Θ ) ) represent the rotation and relative translation errors obtained by running the complete registration process on the k-th validation sample under the current hyperparameters Θ . If a registration fails to output a valid pose or encounters numerical anomalies, the error for that sample is assigned a sufficiently large constant to impose a penalty in the objective function. Since the magnitudes of RRE (in degrees) and RTE (in meters) are comparable at the error scale of interest in this paper, their direct sum effectively reflects registration quality in both rotation and translation dimensions. It is important to note that subsequent comparative and ablation experiments utilize the optimal hyperparameter configuration obtained by minimizing J ( Θ ) in this section, without re-tuning for each experimental scenario.
To efficiently search for the optimum given the high evaluation cost of the objective function, we employ a Gaussian Process (GP) as the surrogate model. Assuming the scalar objective function J ( Θ ) follows a Gaussian Process prior
J ( Θ ) GP m ( Θ ) , k ( Θ , Θ ) ,
where m ( · ) is the mean function and k ( · , · ) is the covariance kernel function. Given the observed parameter-error pairs { ( Θ i , y i ) } i = 1 n , the GP posterior provides the predicted mean and variance of the objective value for any candidate point Θ , thereby characterizing the model’s comprehensive judgment of that point’s quality and uncertainty.
Based on the GP posterior, we adopt Expected Improvement (EI) as the acquisition function to select the sampling point Θ n + 1 for the next iteration. Let the current best observed objective value be
J min = min 1 i n y i ,
then EI is defined as
EI ( Θ ) = E max 0 , J min J ( Θ ) .
A larger EI value indicates a more favorable balance between “potential to achieve a smaller objective value” and “high current model uncertainty.” By repeatedly maximizing EI ( Θ ) in the search space and evaluating the new point, Bayesian optimization achieves an adaptive trade-off between exploiting known good regions and exploring unknown areas.
Under the above framework, we statistically identify a set of more suitable fractional orders s * , α time * , unified energy weights, and memory scale factor for the current data distribution. In the parameter tuning process, targeting the minimization of RRE and RTE, we set the default initial iterations to 20 and Bayesian optimization iterations to 60. We selected models from the Stanford 3D dataset and generated registration datasets according to the method described in Section 4.1. For all models in the dataset, we averaged the results over 30 iterations as the basis for evaluation. As shown in Figure 5, the data fluctuations stabilized after 40 iterations. Furthermore, we observed that for the overall parameters, the spatial fractional order parameter (s) performed well in the range of 0.4 0.6 , and the temporal fractional order ( α time ) in the range of 0.4 0.7 . In subsequent experiments, we will select the parameter combination that achieved the lowest RRE and RTE during optimization as the default hyperparameters, keeping them unchanged to ensure the objectivity of the conclusions in comparative and ablation studies.

4.4. Comparative Experiments

This section aims to systematically compare the proposed fractional spatiotemporal unified energy framework with representative traditional methods and learning-based methods to verify the effectiveness and superiority of our algorithm.

4.4.1. Comparison Methods and Characteristics

To cover different paradigms ranging from traditional geometric approaches to learning-based front-ends, we selected the following five representative methods for comparison:
  • RANSAC+FPFH+ICP: A classic three-stage pipeline comprising feature extraction (FPFH), global estimation (RANSAC), and point-to-plane ICP for superior local refinement. It serves as a traditional baseline but relies heavily on feature stability.
  • FGR+ICP: An advanced non-learning framework that optimizes global pose via robust cost functions (e.g., GNC) followed by ICP. It offers higher efficiency and stability than RANSAC under moderate outlier ratios [20].
  • RANSAC+FPFH+TEASER++: A state-of-the-art geometric solver employing Truncated Least Squares (TLS) with provable robustness bounds. It excels in scenarios with high outlier ratios and severe occlusions [21,22].
  • PREDATOR: An advanced Transformer-based network that simultaneously predicts overlap and inliers. We utilize its pre-trained model to evaluate cross-dataset generalization and robustness under non-repetitive sampling [23].
  • GeoTransformer: Encodes multi-scale geometry using GNNs and Transformers. It serves both as a strong learning-based baseline and as the front-end generator for our method, enabling a direct evaluation of different back-end solving strategies [24].
The methods mentioned above cover a range from classic geometric baselines and newer generations of fast/provably robust solvers (RANSAC+FPFH+ICP, FGR, TEASER++) to representative learning-based registration models (PREDATOR, GeoTransformer). This selection comprehensively reflects the performance upper bounds of current non-learning and learning paradigms in the scenarios focused on in this work. The fractional spatiotemporal unified energy registrar proposed in this paper introduces spatial fractional regularization and temporal fractional memory within a unified energy framework and employs a three-stage decoupled solving strategy. We focus on examining its robustness and accuracy advantages under conditions of non-repetitive sampling and structural missingness.

4.4.2. Experimental Settings

Comparative evaluations were conducted on the Stanford 3D, ITODD, and HB datasets. To ensure sufficient geometric complexity, simple primitives such as cubes and cylinders were excluded. The curated subset comprises 8 models from Stanford 3D, 11 typical industrial parts from ITODD, and 18 deformable or occluded models from HB, all exhibiting rich structural details.To simulate non-repetitive sampling, we followed the established protocol where the target point cloud remains fixed while the source undergoes viewpoint cropping, nearest-neighbor perturbation, and density unevenness processing. These perturbations are governed by the neighborhood size K { 8 , 12 , 16 , 20 , 24 } , where a larger K introduces severe local sparsity and structural missingness to mimic complex non-repetitive LiDAR characteristics.
For a fair comparison, all methods utilized identical input data regarding coordinate systems and scales. The learning-based baselines, PREDATOR and GeoTransformer, employed official pre-trained weights for inference without fine-tuning. Geometric hyperparameters were adaptively set based on the object bounding box diagonal D: voxel size δ = 0.01 D , normal estimation radius 2 δ , FPFH calculation radius 5 δ , and ICP distance threshold 1.5 δ . All stochastic methods underwent 30 independent trials per setting. Our framework adopted fixed energy weights and fractional orders derived from Bayesian optimization without dataset-specific tuning, with the solver energy threshold linearly decaying from 1.2 to 0.35 . All experiments were executed on an Intel Core i5-12400KF CPU (Intel Corporation, Santa Clara, CA, USA) with an NVIDIA RTX 4060 GPU (NVIDIA Corporation, Santa Clara, CA, USA).The implementation was based on Python 3.8.20, PyTorch 2.3.0, CUDA 12.1, and FAISS 1.7.1 (GPU enabled) for k-NN acceleration.

4.4.3. Results and Analysis

The quantitative results in Figure 6 and Table 1 demonstrate the superiority of our method.
As shown in Figure 6, while traditional geometric pipelines (e.g., FGR+ICP) deteriorate markedly as sparsity increases ( K 16 ), our method consistently achieves the lowest median errors with the tightest error distributions. Table 1 further corroborates this robustness: under severe conditions ( K = 24 ), the success rates of PREDATOR and GeoTransformer drop to 14.24% and 31.52%, respectively, whereas our method maintains a success rate above 90% (e.g., 90.61% on ITODD). Notably, although all hyperparameters and weights are obtained via Bayesian optimization on the Stanford 3D dataset, our method still performs well on datasets with markedly different point densities and geometric scales, including the industrial-part data in ITODD and the everyday-object data in HB. Together, these results indicate strong generalization and confirm that the proposed fractional spatiotemporal framework effectively mitigates the adverse effects of irregular sampling.
Empirical observations also reveal a degradation in performance under conditions of higher neighborhood sizes (K). An in-depth inspection of the experimental data indicates that registration failures are primarily attributed to three specific models: obj_000001 from the HB dataset, and obj_000015 and obj_000018 from the ITODD dataset, as shown in Figure 7. In the respective experimental groups, failures on these instances resulted in a marked escalation in the RRE and RTE metrics for our method, occurring predominantly at larger neighborhood settings ( K = 20 , K = 24 ). Notably, these models exhibit pronounced geometric characteristics: geometric symmetry (HB obj_000001, ITODD obj_000015, and obj_000018).
For symmetric objects, the inherent symmetry renders the gradient of the Spatially-Normalized Density-Aware (SNDA) energy effectively zero along the axis of symmetry, thereby inducing unconstrained rotational drift. Similarly, for repetitive structures, the failure arises from the non-local Riesz regularization when coupled with a large neighborhood K. While a large K enhances robustness against sparsity, it simultaneously functions as a low-pass filter that over-smooths the high-frequency boundaries separating repeating units. When combined with the “heavy-tailed” inertia of the Grünwald–Letnikov dynamics, the consequent reduction in energy barriers allows the solver to inadvertently overshoot the optimal alignment and settle into a phase-shifted local minimum. These cases underscore an intrinsic trade-off: the very long-memory and non-local mechanisms designed to bridge sparse voids can, in the presence of perfect symmetry or periodicity, precipitate unconstrained sliding or periodic skipping.

4.5. Ablation Studies

To quantitatively analyze the role of each key module in the fractional spatiotemporal unified energy framework, this section conducts ablation experiments on several representative models that exhibit distinct characteristics compared to other front-ends. We selected bunny from the Stanford 3D dataset, as well as obj_000001 and obj_000013 from the ITODD dataset as research objects. These three types of objects correspond to free-form surfaces with rich details, mechanical parts with planar and hole structures, and industrial components with complex contours, respectively, allowing for a comprehensive reflection of the impact of each module on registration performance under different geometric features. Moreover, we further introduce an order-oriented ablation study. Specifically, while keeping all other hyperparameters fixed, we perform a controlled sweep over the spatial fractional order and the temporal fractional order. This design enables a direct and interpretable characterization of how the registration error varies with fractional orders in space and time, thereby complementing the module-wise ablations with explicit evidence on the error–order relationship.

4.5.1. Experimental Purpose and Settings for Module-Wise Ablations

The purpose of the module-wise ablations experiments is to verify the necessity of each module in coping with non-repetitive sampling, local structural missingness, and initial pose deviations by turning off or replacing specific modules one by one while keeping the overall structure of the fractional spatiotemporal unified energy framework unchanged, and observing the changes in RRE, RTE distributions, and success rates. Specifically, in this section, we examine the following five ablation configurations:
  • no_time_frac: Disables the temporal fractional memory, relying solely on geometric increments (based on the current gradient and energy descent) to update the pose in the main processing stage;
  • no_riesz_frac: Removes the fractional spatial regularization term from the unified energy, ceasing to couple point cloud residuals across neighborhoods via the Riesz-type non-local operator;
  • no_ot: Replaces the soft correspondence term in the unified energy with 1-NN weighted SVD/ICP edges, i.e., relying only on nearest-neighbor rigid registration without explicitly solving for sparse optimal transport weights;
  • no_bracket: Disables the bracketing search mechanism in the three-stage solver (including the bootstrap stage and the main processing/polishing stages), using only fixed step sizes or local line searches for updates;
  • no_snda_boot: Skips the global rotation search based on the Spherical Normal Distribution Alignment (SNDA) prior in the bootstrap stage, entering the subsequent optimization directly from the given initial pose.
The five configurations mentioned above were evaluated on non-repetitively sampled point clouds identical to those used in the full model, with the data construction process consistent with the comparative experiments. For each object, each ablation configuration, and each non-repetitive level, independent trials were repeated 20 times. We statistically analyzed the distributions of rotation error (RRE) and relative translation error (RTE).
In addition, we provide the maximum values of RRE and RTE for each configuration to characterize the error long-tail and overall stability. All hyperparameters (including the weights of the sub-terms in the unified energy and the fractional orders) were fixed to the optimal configuration obtained via Bayesian optimization in Section 4.3. Only the corresponding modules were “turned on/off” or replaced to ensure the fairness of the ablation comparison.

4.5.2. Module-Wise Ablation Results and Analysis

The quantitative results of the module-wise ablations studies are presented in the boxplots of Figure 8 (covering three representative models) and the worst-case statistics in Table 2 and Table 3. These data comprehensively verify the necessity of each module in the proposed framework.
First, regarding the fractional temporal dynamics (no_time_frac), its absence leads to a catastrophic degradation in stability under high non-repetitive levels. As shown in Table 2 and Table 3, when K = 24 , the maximum RRE of no_time_frac surges to 177.73 ° , compared to only 0.96 ° for the full model. The boxplots further illustrate that without historical memory, the error distribution becomes extremely dispersed with numerous outliers, confirming that the fractional temporal memory effectively suppresses oscillation and prevents divergence in complex energy landscapes.
Second, the fractional spatial regularization (no_riesz_frac) plays a crucial role in maintaining geometric consistency. While its impact on the success rate is less severe than removing the temporal module, Figure 8 shows that removing this term results in a systematic “lifting” of the error medians and a widening of the interquartile ranges, especially on the ITODD industrial parts. Table 2 indicates that the maximum RRE increases from 0.96 ° to 1.44 ° at K = 24 , proving that the non-local spatial coupling effectively inhibits local drift caused by sampling jitter.
Third, the Optimal Transport module (no_ot) is vital for handling sparsity. The substitution of soft correspondences with hard nearest neighbors causes the algorithm to fail rapidly as K increases. In Table 2, no_ot exhibits a high maximum translation error of 1.568  m at K = 24 , and the boxplots show extremely long tails, indicating that explicit modeling of many-to-many relationships is essential for non-repetitive point clouds.
Finally, the bracketing search (no_bracket) and SNDA bootstrap (no_snda_boot) ensure convergence quality. Without the bracketing strategy, the solver often gets stuck in local minima, leading to a maximum RRE of 178.02 ° at high perturbation levels. Similarly, skipping the SNDA global initialization results in failure to correct large initial deviations, yielding errors comparable to random guessing ( 178.05 ° ). In summary, the full model (Ours) integrates these mechanisms to achieve the lowest worst-case errors ( Max RRE < 1 ° ) and the most compact error distributions across all scenarios.

4.5.3. Experimental Purpose and Settings for Order-Oriented Ablations

The goal of the order-oriented ablation experiments is to explicitly characterize the relationship between registration error and the fractional orders in our spatiotemporal framework. Unlike the module-wise ablations that disable specific components, this study performs a controlled scan over the fractional orders while keeping all other hyperparameters unchanged, thereby isolating the effect of fractionalization from other factors.
We consider two fractional orders: the spatial order s and the temporal order α time . For the temporal-order scan, we fix the spatial order to s = 0.5 and vary α time [ 0.5 , 1.0 ] with a step size of 0.1 . For the spatial-order scan, we fix the temporal order to α time = 0.5 and vary s [ 0.5 , 1.0 ] with the same step size. In both scans, the integer-order limit corresponds to the endpoint value 1.0 .
To obtain a single error measure that can clearly reflect the relationship between fractional orders and registration accuracy, we employ a unified evaluation metric in the order-oriented ablations to integrate the two standard error terms, RRE and RTE. Since these two quantities have different physical units, we unify them by normalizing with the same success criteria used in our evaluation. Specifically, following the success thresholds RRE 3 ° and RTE 0.03 m , we define the dimensionless composite error
J = RRE 3 + RTE 0.03 ,
where RRE is measured in degrees and RTE in meters. In this way, both terms are expressed on a comparable scale with respect to the success standard, and a smaller J indicates a better registration result.
To ensure consistency with the module-wise ablations, the order-oriented ablations are conducted on the same three representative objects: bunny from the Stanford 3D dataset, and obj_000001 and obj_000013 from the ITODD dataset. For each object and each scanned order setting, we perform the ablation only under the most challenging non-repetitive level ( K = 24 ). All other experimental settings follow exactly the protocol used in the module-wise ablations, and the evaluation is repeated with the same number of trials. The final result for each configuration is reported as the mean value of the composite metric J over repeated runs.

4.5.4. Order-Oriented Ablation Results and Analysis

The order-oriented scans reveal a clear and repeatable pattern: the composite error J is minimized at moderate fractional orders, whereas the integer-order limit is consistently suboptimal. In other words, the error–order relationship is not monotonic; pushing the order to 1.0 does not improve registration under the hardest setting, and instead increases J across all tested objects.
As shown in Table 4 and Table 5, the order-oriented ablation scans over the fractional orders exhibit clear performance trends for both α time and s. For the temporal order (with s = 0.5 fixed), the best performance concentrates around α time [ 0.6 , 0.7 ] : the minimum J is achieved at α time = 0.6 for bunny ( 0.5458 ) and obj_000013 ( 0.2387 ), and at α time = 0.7 for obj_000001 ( 0.2413 ). Moving to the integer-order limit α time = 1.0 consistently degrades the error for all three objects. A similar conclusion holds for the spatial order (with α time = 0.5 fixed): the lowest J occurs at s = 0.8 for bunny ( 0.4906 ), at s = 0.6 for obj_000001 ( 0.2367 ), and at s = 0.7 for obj_000013 ( 0.2430 ), while s = 1.0 yields the largest error in every case.
Overall, these results provide direct evidence that fractional orders are not merely interchangeable with their integer-order counterparts. Instead, there exists a practical operating interval (roughly 0.6 0.8 in this study) where the fractional spatiotemporal mechanism is most effective, offering a stable accuracy advantage across different object geometries.

4.6. Real-World Scenario Testing

To validate the registration capability of the proposed method on genuine non-repetitive LiDAR data, we conducted experiments using a Livox Mid-360 LiDAR sensor (Livox Technology Company Limited, Shenzhen, China). The experimental procedure was as follows: First, in a stationary scene, continuous frames of point clouds were acquired and accumulated to form a dense target point cloud. Subsequently, the position of the target objects was altered, and a single frame was captured by the LiDAR to serve as the sparse source point cloud for registration.
As illustrated in Figure 9, the test subject consisted of two televisions positioned side-by-side with an offset. This setup introduces significant challenges due to the presence of repetitive geometric structures combined with the irregular sparsity inherent to non-repetitive scanning. Figure 9a displays the experimental setup. Figure 9b visualizes the relative position between the source (red) and target (blue) point clouds prior to registration.
Upon obtaining the data, we performed registration using both the baseline and our method. Figure 9c presents the result using GeoTransformer. It is evident that GeoTransformer fails to align the point clouds correctly; this failure is attributed to the ambiguity caused by the coupling of non-repetitive sampling patterns and repetitive geometric features, leading to feature mismatching. In contrast, Figure 9d displays the result obtained by our method. The source and target point clouds overlap almost perfectly, achieving highly accurate alignment. This outcome demonstrates the superior performance of our method in real-world scenarios, effectively overcoming the limitations of existing learning-based approaches in such complex environments.

5. Discussion

The experimental results presented in Section 4 demonstrate that the proposed fractional spatiotemporal unified energy framework significantly outperforms state-of-the-art geometric and learning-based methods in registering non-repetitive LiDAR point clouds. This performance advantage is attributed to two core mechanisms rooted in fractional calculus. Spatially, unlike integer-order gradients that are sensitive to high-frequency jitter in irregular sampling, the Riesz fractional regularization imposes non-local constraints. By coupling residuals across a wider neighborhood via power-law kernels, it effectively mitigates the structural discontinuities inherent in non-repetitive patterns. Temporally, the Grünwald–Letnikov dynamics introduce a “heavy-tailed” memory of historical increments. As verified in the ablation studies, this long-memory effect is crucial for traversing the highly non-convex energy landscape, preventing the solver from being trapped in local minima caused by partial overlaps.
However, compared with standard local methods, the introduction of non-local operators inevitably increases computational complexity. Table 6 reports the runtime and accuracy statistics on the Stanford Bunny model under varying non-repetitive levels and sampling densities. For each configuration, experiments were repeated 30 times to report the mean results. In terms of accuracy, our method is markedly superior to both the conventional ICP-style pipeline (FGR+ICP) and the representative learning-based method (GeoTransformer). It maintains mean RRE values around 1° (max 2.248 ° ) and mean RTE close to 0.001 m (max 0.0138 m ). Nevertheless, the associated overhead is evident. Compared to the faster GeoTransformer, our method typically requires approximately 10 s per registration. The primary costs arise from: (i) residual-field evaluation of the spatial fractional non-local term over graph edges; (ii) energy-driven candidate screening in the polishing stage; and (iii) history accumulation induced by the temporal Grünwald–Letnikov update.

5.1. Analysis of Solver Convergence Properties

Given the computational overhead highlighted above, a fundamental question arises: Is this complexity justified, and why not employ classical Nesterov momentum or second-order optimization methods to accelerate convergence? The answer lies in the distinct mathematical properties required to traverse the pathological energy landscapes of non-repetitive LiDAR.
(1)
Heavy-tailed vs. Exponential Memory (Comparison with Nesterov). Classical momentum methods, including Nesterov, rely on an exponentially decaying history ( w t e λ t ). This implies a “short-term memory” mechanism. While efficient for smooth convex basins, this momentum dissipates rapidly when the solver encounters extended “flat” plateaus or continuous jittery regions characteristic of sparse LiDAR scans. In contrast, the Grünwald-Letnikov fractional definition introduces a memory kernel that follows a power-law decay ( w t t ( 1 + α ) ). This “heavy-tailed” long-term memory allows the solver to accumulate and retain directional inertia over a significantly longer horizon. It provides the sustained “kinetic energy” necessary to escape deep, wide basins of attraction where classical exponentially-decaying momentum would stagnate.
(2)
Infeasibility of Second-Order Methods. Second-order methods (e.g., Gauss-Newton) require the inversion of the Hessian matrix to normalize the gradient. As discussed in Section 3.2, the sampling pattern of non-repetitive LiDAR is spatially sparse and anisotropic, frequently rendering the Hessian matrix ill-conditioned or singular (degenerate geometry) due to the lack of constraints in directions perpendicular to the scan lines. Consequently, second-order updates become numerically unstable and computationally prohibitive ( O ( N 3 ) ), whereas our fractional first-order approach maintains numerical stability with linear complexity while effectively mimicking curvature-aware benefits through the non-local Riesz operator.

5.2. Future Horizons: Optimization and Generalization

While the current implementation prioritizes solvability over speed, the comparative results indicate that our improvement is not merely obtained by trading time for accuracy; rather, when other methods exhibit pronounced failure, our framework remains capable of delivering high-accuracy registration. To bridge the gap towards real-time deployment, future work will focus on developing fast numerical approximations, such as frequency-domain acceleration or recursive-filter approximations, alongside deeper GPU-centric implementations.
Beyond optimization for speed, we envision that the proposed framework holds significant promise for generalization to broader perception tasks. Although explicitly validated on non-repetitive LiDAR, the underlying mathematical principles—specifically the handling of data sparsity via non-local operators and optimization instability via fractional dynamics—are applicable to other domains:
  • Generalization to Sparse Modalities: The core efficacy of the fractional Riesz regularization lies in imposing structural constraints across disconnected regions. This property suggests immediate applicability to other sparse sensing modalities, such as 4D Imaging Radar or sparse visual feature matching in textureless environments, where non-local potentials can effectively “densify” geometric constraints better than integer-order Laplacians.
  • Continuous-Time Estimation: The temporal component, currently employed for iterative updates, shares a theoretical isomorphism with continuous-time trajectory estimation (e.g., in CT-SLAM). The Grünwald–Letnikov memory kernel can be reinterpreted as a trajectory smoothing prior that enforces long-range temporal coherence, potentially reducing drift in large-scale mapping tasks.
  • Beyond Rigid Transformation: Finally, the fractional Laplacian’s capability to maintain global topology while tolerating local irregularities offers a mathematically elegant avenue for non-rigid registration. In deformable scenarios, the global coupling introduced by the fractional order naturally enforces elastic constraints, preventing topological tearing during local warping.

6. Conclusions

This paper proposes a novel spatiotemporal unified energy framework for the registration of non-repetitive LiDAR point clouds, successfully bridging fractional calculus theory with 3D rigid pose estimation. By integrating Riesz fractional spatial regularization and Grünwald–Letnikov temporal dynamics, the proposed method effectively overcomes the convergence instability and geometric inconsistency caused by irregular sampling and partial overlaps. Extensive comparative experiments on the Stanford 3D, ITODD, and HB datasets demonstrate that our approach achieves state-of-the-art accuracy and robustness, maintaining a success rate of over 90% even under severe sampling perturbations where traditional methods fail. The ablation studies further confirm that the introduction of non-local spatial constraints and historical gradient memory significantly reshapes the energy landscape, avoiding local minima. This work provides a rigorous theoretical foundation for applying fractional operators to point cloud processing, offering a robust solution for high-precision perception in complex dynamic environments. Future research will explore the extension of this fractional framework to continuous-time trajectory estimation and its integration into real-time pose estimation.

Author Contributions

Conceptualization, Q.Y. and D.L.; data curation, D.L.; formal analysis, M.L.; funding acquisition, Q.Y.; investigation, L.L.; methodology, D.L.; project administration, L.L.; resource, Q.Y.; software D.L.; supervision, Q.Y.; validation, Q.Y.; visualization, M.L.; writing—original draft, D.L.; writing—review and editing, Q.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Liaoning Provincial Graduate Education and Teaching Reform Research Project [grant number LNYJG2023077]; by the Fundamental Research Funds for the Provincial Universities of Liaoning Province [grant numbers LJ212410144022 and LJ232410144074]; and by the Liaoning Provincial Science and Technology Plan Joint Plan (Natural Science Foundation—General Program) project “Research on robot active perception technology for intelligent assembly” [project number 2025-MSLH-589].

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

  1. Aijazi, A.K.; Checchin, P. Non-repetitive scanning LiDAR sensor for robust 3D point cloud registration in localization and mapping applications. Sensors 2024, 24, 378. [Google Scholar] [CrossRef]
  2. Besl, P.J.; McKay, N.D. Method for registration of 3-D shapes. In Proceedings of the Sensor Fusion IV: Control Paradigms and Data Structures; Spie: Bellingham, WA, USA, 1992; Volume 1611, pp. 586–606. [Google Scholar]
  3. Segal, A.; Haehnel, D.; Thrun, S. Generalized-icp. Robot. Sci. Syst. 2009, 2, 435. [Google Scholar]
  4. Myronenko, A.; Song, X. Point set registration: Coherent point drift. IEEE Trans. Pattern Anal. Mach. Intell. 2010, 32, 2262–2275. [Google Scholar] [CrossRef] [PubMed]
  5. Yang, H.; Shi, J.; Carlone, L. Teaser: Fast and certifiable point cloud registration. IEEE Trans. Robot. 2020, 37, 314–333. [Google Scholar] [CrossRef]
  6. Zhang, X.; Chen, Y. Admissibility and robust stabilization of continuous linear singular fractional order systems with the fractional order α: The 0 < α < 1 case. ISA Trans. 2018, 82, 42–50. [Google Scholar] [PubMed]
  7. Zhang, X.; Chen, S.; Zhang, J.X. Adaptive sliding mode consensus control based on neural network for singular fractional order multi-agent systems. Appl. Math. Comput. 2022, 434, 127442. [Google Scholar] [CrossRef]
  8. Liu, T.; Jiang, Z.P. Event-based control of nonlinear systems with partial state and output feedback. Automatica 2015, 53, 10–22. [Google Scholar] [CrossRef]
  9. Liu, T.; Qin, Z.; Hong, Y.; Jiang, Z.P. Distributed optimization of nonlinear multiagent systems: A small-gain approach. IEEE Trans. Autom. Control 2021, 67, 676–691. [Google Scholar] [CrossRef]
  10. Zhang, X.; Liu, R.; Ren, J.; Gui, Q. Adaptive fractional image enhancement algorithm based on rough set and particle swarm optimization. Fractal Fract. 2022, 6, 100. [Google Scholar] [CrossRef]
  11. Zhang, X.; Dai, L. Image enhancement based on rough set and fractional order differentiator. Fractal Fract. 2022, 6, 214. [Google Scholar] [CrossRef]
  12. Zhang, X.; Boutat, D.; Liu, D. Applications of fractional operator in image processing and stability of control systems. Fractal Fract. 2023, 7, 359. [Google Scholar] [CrossRef]
  13. Zhang, J.X.; Ding, J.; Chai, T. Fault-tolerant prescribed performance control of wheeled mobile robots: A mixed-gain adaption approach. IEEE Trans. Autom. Control 2024, 69, 5500–5507. [Google Scholar]
  14. Zhang, J.X.; Xu, K.D.; Wang, Q.G. Prescribed performance tracking control of time-delay nonlinear systems with output constraints. IEEE/CAA J. Autom. Sin. 2024, 11, 1557–1565. [Google Scholar] [CrossRef]
  15. Turk, G.; Levoy, M. Zippered polygon meshes from range images. In Proceedings of the 21st Annual Conference on Computer Graphics and Interactive Techniques, Orlando, FL, USA, 24–29 July 1994; pp. 311–318. [Google Scholar]
  16. Drost, B.; Ulrich, M.; Bergmann, P.; Hartinger, P.; Steger, C. Introducing mvtec itodd-a dataset for 3d object recognition in industry. In Proceedings of the IEEE International Conference on Computer Vision Workshops, Venice, Italy, 22–29 October 2017; pp. 2200–2208. [Google Scholar]
  17. Kaskman, R.; Zakharov, S.; Shugurov, I.; Ilic, S. Homebreweddb: Rgb-d dataset for 6d pose estimation of 3d objects. In Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, Seoul, Republic of Korea, 27–28 October 2019. [Google Scholar]
  18. Jones, D.R.; Schonlau, M.; Welch, W.J. Efficient global optimization of expensive black-box functions. J. Glob. Optim. 1998, 13, 455–492. [Google Scholar] [CrossRef]
  19. Snoek, J.; Larochelle, H.; Adams, R.P. Practical bayesian optimization of machine learning algorithms. In Proceedings of the Advances in Neural Information Processing Systems 25, Lake Tahoe, Nevada, 3–6 December 2012. [Google Scholar]
  20. Zhou, Q.Y.; Park, J.; Koltun, V. Fast global registration. In Proceedings of the European Conference on Computer Vision; Springer: Berlin/Heidelberg, Germany, 2016; pp. 766–782. [Google Scholar]
  21. Fischler, M.A.; Bolles, R.C. Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 1981, 24, 381–395. [Google Scholar] [CrossRef]
  22. Rusu, R.B.; Blodow, N.; Beetz, M. Fast point feature histograms (FPFH) for 3D registration. In Proceedings of the 2009 IEEE International Conference on Robotics and Automation, Kobe, Japan, 12–17 May 2009; IEEE: Piscataway Township, NJ, USA, 2009; pp. 3212–3217. [Google Scholar]
  23. Huang, S.; Gojcic, Z.; Usvyatsov, M.; Wieser, A.; Schindler, K. Predator: Registration of 3d point clouds with low overlap. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA, 20–25 June 2021; pp. 4267–4276. [Google Scholar]
  24. Qin, Z.; Yu, H.; Wang, C.; Guo, Y.; Peng, Y.; Ilic, S.; Hu, D.; Xu, K. Geotransformer: Fast and robust point cloud registration with geometric transformer. IEEE Trans. Pattern Anal. Mach. Intell. 2023, 45, 9806–9821. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Demonstration of registration robustness under non-repetitive sampling perturbations. (a) Clean source point cloud (green) and target point cloud (blue) without perturbations. (b) Simulation setup with perturbations: perturbed input point cloud (red), where pink lines indicate non-repetitive jitter trajectories. (c) Registration results of the GeoTransformer method. (d) Accurate alignment results achieved by our proposed fractional spatiotemporal framework.
Figure 1. Demonstration of registration robustness under non-repetitive sampling perturbations. (a) Clean source point cloud (green) and target point cloud (blue) without perturbations. (b) Simulation setup with perturbations: perturbed input point cloud (red), where pink lines indicate non-repetitive jitter trajectories. (c) Registration results of the GeoTransformer method. (d) Accurate alignment results achieved by our proposed fractional spatiotemporal framework.
Fractalfract 10 00042 g001
Figure 2. Overview of the proposed fractional-order spatiotemporal unified energy framework for robust non-repetitive LiDAR point cloud registration.
Figure 2. Overview of the proposed fractional-order spatiotemporal unified energy framework for robust non-repetitive LiDAR point cloud registration.
Fractalfract 10 00042 g002
Figure 3. Unified energy E ˜ ( T ) versus rotation error (RRE) at the ground-truth pose for different translation magnitudes | t | on a synthetic non-symmetric point cloud. For each | t | , the full unified energy (including data, entropy, normal, SNDA, and fractional terms) is evaluated using the same implementation as in the registration pipeline. In all cases the energy attains its minimum at RRE = 0 ° and increases as the rotation error grows, while larger translation magnitudes lead to consistently higher energy levels, demonstrating that lower energy reliably corresponds to a more accurate pose.
Figure 3. Unified energy E ˜ ( T ) versus rotation error (RRE) at the ground-truth pose for different translation magnitudes | t | on a synthetic non-symmetric point cloud. For each | t | , the full unified energy (including data, entropy, normal, SNDA, and fractional terms) is evaluated using the same implementation as in the registration pipeline. In all cases the energy attains its minimum at RRE = 0 ° and increases as the rotation error grows, while larger translation magnitudes lead to consistently higher energy levels, demonstrating that lower energy reliably corresponds to a more accurate pose.
Fractalfract 10 00042 g003
Figure 4. Non-repetitive sampling point clouds generated using parts of the Stanford 3D dataset, ITODD dataset, and HB dataset. (a) Stanford 3D dataset; (b) ITODD dataset; (c) HB dataset. From top to bottom, (1), (2), and (3) represent non-repetitive sampling conditions for K = 8 , 16, and 24, respectively. Blue points denote the target point cloud, and red points denote the non-repetitive point cloud with injected non-repetitive perturbations.
Figure 4. Non-repetitive sampling point clouds generated using parts of the Stanford 3D dataset, ITODD dataset, and HB dataset. (a) Stanford 3D dataset; (b) ITODD dataset; (c) HB dataset. From top to bottom, (1), (2), and (3) represent non-repetitive sampling conditions for K = 8 , 16, and 24, respectively. Blue points denote the target point cloud, and red points denote the non-repetitive point cloud with injected non-repetitive perturbations.
Fractalfract 10 00042 g004
Figure 5. Results of Bayesian optimization. (a) Iteration process of Bayesian optimization. (b) Distribution of spatial and temporal fractional orders.
Figure 5. Results of Bayesian optimization. (a) Iteration process of Bayesian optimization. (b) Distribution of spatial and temporal fractional orders.
Fractalfract 10 00042 g005
Figure 6. RRE and RTE distributions of different datasets under different comparison methods. (a) RRE distribution; (b) RTE distribution. (1) Stanford 3D dataset; (2) ITODD dataset; (3) HB dataset.
Figure 6. RRE and RTE distributions of different datasets under different comparison methods. (a) RRE distribution; (b) RTE distribution. (1) Stanford 3D dataset; (2) ITODD dataset; (3) HB dataset.
Fractalfract 10 00042 g006
Figure 7. Point-cloud models with geometric symmetry and repetitive structures: (a) HB obj_000001; (b) ITODD obj_000015; (c) ITODD obj_000018.
Figure 7. Point-cloud models with geometric symmetry and repetitive structures: (a) HB obj_000001; (b) ITODD obj_000015; (c) ITODD obj_000018.
Fractalfract 10 00042 g007
Figure 8. Statistical distributions of RRE and RTE under varying ablation configurations. (a) RRE distribution; (b) RTE distribution. The results are reported on three specific models: (1) bunny from Stanford 3D; (2) obj_000001 from ITODD; and (3) obj_000013 from ITODD.
Figure 8. Statistical distributions of RRE and RTE under varying ablation configurations. (a) RRE distribution; (b) RTE distribution. The results are reported on three specific models: (1) bunny from Stanford 3D; (2) obj_000001 from ITODD; and (3) obj_000013 from ITODD.
Fractalfract 10 00042 g008
Figure 9. Registration results in a real-world scenario using Livox Mid-360 (Livox Technology Company Limited, Shenzhen, China). (a) The experimental setup showing the LiDAR and the target objects (two offset televisions). (b) The initial relative position of the target (blue) and source (red) point clouds. (c) Failed registration result by GeoTransformer, misaligned due to repetitive features and sparse sampling. (d) Successful registration result by our method, showing precise alignment.
Figure 9. Registration results in a real-world scenario using Livox Mid-360 (Livox Technology Company Limited, Shenzhen, China). (a) The experimental setup showing the LiDAR and the target objects (two offset televisions). (b) The initial relative position of the target (blue) and source (red) point clouds. (c) Failed registration result by GeoTransformer, misaligned due to repetitive features and sparse sampling. (d) Successful registration result by our method, showing precise alignment.
Fractalfract 10 00042 g009
Table 1. Success rates (RRE < 3°, RTE < 0.03 m) of different registration methods on Stanford 3D, ITODD, and HB datasets under varying non-repetitive sampling levels (K). The highest success rate for each dataset at the same K level is highlighted in bold.
Table 1. Success rates (RRE < 3°, RTE < 0.03 m) of different registration methods on Stanford 3D, ITODD, and HB datasets under varying non-repetitive sampling levels (K). The highest success rate for each dataset at the same K level is highlighted in bold.
DatasetKSuccess Rate (RRE < 3°, RTE < 0.03 m) of the Corresponding Method %
OursGeoTrans
Former
RANSAC+
FPFH+
ICP
PREDATORFGR+
ICP
RANSAC+
FPFH+
TEASER++
Stanford
3D
8100.0068.3386.1152.7876.6711.11
12100.0066.1176.6749.4471.6713.89
16100.0067.7850.5654.4471.6711.67
20100.0054.4451.6749.4466.112.22
24100.0055.0045.0042.2260.565.00
ITODD8100.0091.5279.0920.0062.730.30
12100.0083.3363.3317.2750.910.00
16100.0070.0053.6413.9445.760.00
2099.7053.3346.6712.7337.271.82
2490.6131.5233.9414.2433.940.00
HB8100.0088.1592.7817.2283.330.00
12100.0053.5285.0015.1976.850.00
1699.4453.3377.5914.8173.700.19
2096.8544.6355.9313.7068.3313.70
2496.1130.5644.4412.7866.110.37
Table 2. Maximum Rotation Error (RRE) across all test samples under different non-repetitive levels (K). Bold indicates the lowest maximum error.
Table 2. Maximum Rotation Error (RRE) across all test samples under different non-repetitive levels (K). Bold indicates the lowest maximum error.
KMaximum Rotation Error
Oursno_riesz_fracno_time_fracno_bracketno_otno_snda_boot
80.170.940.690.940.810.64
120.381.511.871.536.301.53
160.371.7835.900.7841.962.83
200.920.9638.102.15129.76113.25
240.961.44177.73178.02167.78178.05
Table 3. Maximum Relative Translation Error (RTE) across all test samples under different non-repetitive levels (K). Bold indicates the lowest maximum error.
Table 3. Maximum Relative Translation Error (RTE) across all test samples under different non-repetitive levels (K). Bold indicates the lowest maximum error.
KMaximum Relative Translation Error
Oursno_riesz_fracno_time_fracno_bracketno_otno_snda_boot
80.0020.0140.0110.0140.0140.010
120.0040.0210.0260.0230.0750.023
160.0050.0190.7900.0160.6720.032
200.0120.0190.5710.0360.9360.925
240.0120.0201.4051.6881.5681.712
Table 4. Parameter sensitivity analysis on different datasets by varying the temporal fractional order α time with a fixed spatial order s = 0.5 .
Table 4. Parameter sensitivity analysis on different datasets by varying the temporal fractional order α time with a fixed spatial order s = 0.5 .
Fixed Spatial Order s = 0.5
Parameter SelectionBunnyObj_000001Obj_000013
α time = 0.5 0.55890.24370.2519
α time = 0.6 0.54580.24220.2387
α time = 0.7 0.54610.24130.2503
α time = 0.8 0.54980.24790.2540
α time = 0.9 0.58710.24830.2552
α time = 1.0 0.59310.24990.2609
Table 5. Parameter sensitivity analysis on different datasets by varying the spatial order s with a fixed temporal fractional order α time = 0.5 .
Table 5. Parameter sensitivity analysis on different datasets by varying the spatial order s with a fixed temporal fractional order α time = 0.5 .
Fixed Temporal Order α time = 0.5
Parameter SelectionBunnyObj_000001Obj_000013
s = 0.5 0.55890.24370.2519
s = 0.6 0.57430.23670.2436
s = 0.7 0.52600.24390.2430
s = 0.8 0.49060.24480.2465
s = 0.9 0.50840.24560.2484
s = 1.0 0.59730.25280.2521
Table 6. Results of accuracy and time under different perturbation levels with varying sampling point counts. Bold indicates the best performance in each metric category.
Table 6. Results of accuracy and time under different perturbation levels with varying sampling point counts. Bold indicates the best performance in each metric category.
PointsKFGR + ICP GeoTransformer Ours
RRE/°RTE/mTime/sRRE/°RTE/mTime/sRRE/°RTE/mTime/s
6000862.5760.52492.1802 1.60870.02010.9203 0.21020.00198.2590
1248.9910.30112.2303 3.36710.04080.9144 0.25580.00297.5648
1650.0680.40152.3084 3.52910.04590.9817 0.29520.00329.6769
2091.1590.75892.2940 3.83930.05410.9973 0.49970.00509.4807
2466.8210.41992.2051 5.41060.07761.0118 0.65610.008610.351
8000813.1250.11662.1044 0.82580.00800.8663 0.12640.00138.2137
1233.9050.21562.0067 1.17210.01060.9291 0.08720.00059.3213
1658.5240.67872.3554 1.64680.01560.8666 0.21370.00087.6903
2082.6540.68422.2675 2.57650.02860.8688 0.25350.00128.1750
2492.7400.11662.1044 3.76790.04060.8650 0.37730.00358.8328
10,00084.49650.06722.0245 0.91900.01401.0462 0.03680.000511.456
1253.9470.43992.3068 1.76170.02400.8860 0.11000.00158.0418
1637.9230.32592.0984 1.57120.01890.8604 0.13430.00208.7841
2068.4860.54592.2597 1.32390.01510.8446 0.18020.00089.3973
2468.1950.52742.3169 3.00850.03740.8918 2.24800.013811.374
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yang, Q.; Li, D.; Li, M.; Liu, L. A Fractional-Order Spatiotemporal Unified Energy Framework for Non-Repetitive LiDAR Point Cloud Registration. Fractal Fract. 2026, 10, 42. https://doi.org/10.3390/fractalfract10010042

AMA Style

Yang Q, Li D, Li M, Liu L. A Fractional-Order Spatiotemporal Unified Energy Framework for Non-Repetitive LiDAR Point Cloud Registration. Fractal and Fractional. 2026; 10(1):42. https://doi.org/10.3390/fractalfract10010042

Chicago/Turabian Style

Yang, Qi, Dongwei Li, Minghao Li, and Lu Liu. 2026. "A Fractional-Order Spatiotemporal Unified Energy Framework for Non-Repetitive LiDAR Point Cloud Registration" Fractal and Fractional 10, no. 1: 42. https://doi.org/10.3390/fractalfract10010042

APA Style

Yang, Q., Li, D., Li, M., & Liu, L. (2026). A Fractional-Order Spatiotemporal Unified Energy Framework for Non-Repetitive LiDAR Point Cloud Registration. Fractal and Fractional, 10(1), 42. https://doi.org/10.3390/fractalfract10010042

Article Metrics

Back to TopTop