Next Article in Journal
PSO-Based Ensemble Learning Enhanced with Explainable Artificial Intelligence for Breast Glandular Dose Estimation in Mammography
Previous Article in Journal
Data-Driven Offline Compensation of Robotic Welding Trajectories Using 3D Optical Metrology in Industrial Manufacturing
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Vehicle Speed Estimation Using Infrastructure-Mounted LiDAR via Rectangle Edge Matching

Department of Electronics Engineering, Korea National University of Transportation, 50 Daehak-ro, Daesowon-myeon, Chungju-si 27469, Chungcheongbuk-do, Republic of Korea
*
Author to whom correspondence should be addressed.
Appl. Sci. 2026, 16(5), 2513; https://doi.org/10.3390/app16052513
Submission received: 6 February 2026 / Revised: 27 February 2026 / Accepted: 3 March 2026 / Published: 5 March 2026

Abstract

Smart transportation infrastructure is increasingly deployed, and cooperative perception using stationary Light Detection and Ranging (LiDAR) sensors installed at intersections and along roadsides is becoming more important. However, infrastructure LiDAR often suffers from sparse point-cloud data (PCD) at long ranges and frequent occlusions, which can degrade the stability of inter-frame displacement and speed estimation. This paper proposes a real-time vehicle speed estimation method that operates robustly under sparse and partially observed conditions. The proposed approach extracts boundary points from clustered vehicle PCD and removes outliers, and then fits a 2D rectangle to the vehicle contour via Gauss–Newton optimization by minimizing distance-based residuals between boundary points and rectangle edges. To further improve robustness, we incorporate Hessian augmentation terms that account for boundary states and size variations, thereby alleviating excessive boundary violations and abnormal deformation of the width and height parameters during iterations. Next, from the fitted rectangles in consecutive frames, we construct a nearest corner with respect to the LiDAR origin and an auxiliary point, and perform 2D SVD-based alignment using only these two representative points. This enables efficient computation of inter-frame displacement and speed without full point-cloud registration (e.g., iterative closest point (ICP)). Experiments conducted at an intersection in K-City (Hwaseong, Republic of Korea) using a 40-channel LiDAR, a test vehicle (Genesis G70), and a real-time kinematic (RTK) system (MRP-2000) show that the proposed method stably preserves representative points and fits rectangles, even in sparse regions where only about two LiDAR rings are observed. Using CAN-based vehicle speed as the reference, the proposed method achieves an MAE of 0.76–1.37 kph and an RMSE of 0.90–1.58 kph over the tested speed settings (30, 50, and 70 kph, as well as high speed (~90 kph)) and trajectory scenarios. Furthermore, per-object processing-time measurements confirm the real-time feasibility of the proposed algorithm.

1. Introduction

With the proliferation of smart transportation infrastructure, an increasing number of studies have investigated traffic perception using fixed sensors deployed at intersections and along roadsides. In roadside LiDAR-based perception, preprocessing techniques such as background removal and ground removal have been actively explored to suppress false detections caused by static structures (e.g., poles, curbs, and roadside facilities) [1,2,3]. Building on these preprocessing steps, several methods have been proposed to extract object trajectories and estimate vehicle speed [4,5]. Recently, deep-learning-based approaches for object detection and tracking using roadside LiDAR have also been reported [6].
In parallel, vehicle speed estimation and traffic surveillance/analysis based on infrastructure-mounted traffic cameras have been continuously studied [7,8,9]. LiDAR-based infrastructure systems for object detection and tracking have also been introduced [10]. Furthermore, comparative evaluations of camera and LiDAR approaches have been conducted [3], and sensor-fusion-based perception and tracking—incorporating radar—have been actively investigated [11].
Despite these advances, LiDAR measurements become increasingly sparse as the sensor-target distance grows, leading to reduced point-cloud density for each object. Under long-range observation conditions typical of intersection infrastructure, such sparsity can significantly degrade the reliability of object detection and geometric modeling. Studies that explicitly address sparse point clouds and partial observations remain relatively limited. In this context, separate efforts have focused on object detection and estimation using low-channel roadside LiDAR sensors [12]. In addition, deep-learning-based classification and speed estimation using traffic camera imagery have also been reported [13].
Cooperative perception, in which infrastructure sensors provide estimated object states (e.g., position and velocity) to connected vehicles or traffic management systems, is gaining importance because it can compensate for the limitations of vehicle-only sensing, including occlusions, limited field of view, and sparsification at long range [14]. In such applications, vehicle speed is a key input to higher-level modules such as collision-risk assessment, trajectory prediction, and safe-distance decision-making. Therefore, robust real-time speed estimation under real-world infrastructure conditions is essential.
In practical roadside deployments, vehicle speed can also be measured or inferred using other sensing modalities, including inductive loops, microwave radar, video cameras, and RFID-based infrastructure. Video-based approaches are attractive due to low hardware cost and wide-area coverage; however, they are sensitive to illumination and weather, depend on calibration quality, and may raise privacy concerns [9]. RFID-based systems can provide reliable event-driven identification or coarse motion cues in instrumented zones, but require tag/reader deployment and offer limited spatial coverage [15,16]. Radar and loop detectors are mature and widely adopted in practice; nevertheless, they are often lane-dependent and provide limited shape information, which can complicate multi-object separation in dense traffic. In contrast, infrastructure-mounted LiDAR provides explicit 3D geometry for robust object separation and trajectory extraction, yet in infrastructure scenarios, frequent occlusions among vehicles, changes in viewing geometry, and range-induced sparsification can make representative-point estimation unstable, which is a key challenge targeted in this work.
Therefore, we focus on stable geometric modeling and lightweight matching that remain reliable under sparse and partially observed LiDAR returns.

2. Related Works

With the growing body of research on vehicle detection and tracking using stationary LiDAR sensors, continuous efforts have been made to provide vehicle state information from infrastructure sensors and to derive vehicle speed from tracking results. In roadside LiDAR-based detection and tracking, speed is typically computed from the inter-frame displacement of an object centroid or a representative point, and this estimated speed is then used to update trajectories and to support inter-frame association [2,6,10,17]. However, in stationary LiDAR settings, vehicle-to-vehicle occlusion occurs frequently, and point clouds tend to become sparse as the sensor–vehicle distance increases. As a result, representative-point estimation can be unreliable; in particular, when the true inter-frame displacement is small, even minor localization noise may be amplified during speed computation, causing large fluctuations in the estimated speed.
To mitigate these limitations, registration-based speed estimation methods have been proposed to more accurately estimate inter-frame correspondences in point clouds. Among point-cloud registration techniques, ICP and normal distributions transform (NDT) are widely used [18,19]. Nevertheless, due to iterative optimization and the cost of correspondence search or distribution modeling, these methods are computationally expensive, and numerous variants and efficient implementations have been studied to accelerate them [20,21]. Zhang et al. proposed a method that tracks vehicle clusters, projects 3D point clouds onto a 2D plane, and performs inter-frame template matching to precisely estimate subtle motion, thereby improving speed accuracy [5]. In addition, approaches that exploit multi-frame information within a time window and combine detection and tracking (e.g., joint detection and tracking) can be interpreted as strengthening temporal consistency across frames, ultimately improving the stability of displacement estimation required for speed computation [22]. Since computational efficiency is also critical for real-time processing, especially when speed estimation is included in the pipeline, methods for low-channel roadside LiDAR have been reported that estimate motion using limited features such as corner points or sparse keypoint matching [12].
Separately, approximating a vehicle’s boundary with a simple geometric model and estimating its heading and contour via rectangle (or L-shape) fitting has been a representative approach for LiDAR cluster-based modeling. Zhang et al. proposed an efficient L-shape fitting method for 2D laser scanner clusters based on angular search [23], and Zhao et al. presented an L-shape-based pose estimation and tracking approach for 3D LiDAR environments to improve performance [24]. However, in infrastructure scenarios, sparse observations and occlusions often prevent two orthogonal edges of a vehicle from being sufficiently observed at the same time; instead, only a single dominant line component may be captured. In such cases, if the fitting results fluctuate across frames, noise can be amplified in displacement and speed estimation. Therefore, there is a demand for a speed estimation method that remains stable under sparse and partial observations while computing inter-frame correspondences in a computationally efficient manner.
While L-shape/rectangle fitting and lightweight feature matching have been explored for LiDAR-based vehicle modeling and tracking, their performance can degrade when the observation degenerate observations (e.g., a near single-line contour) as often encountered in long-range infrastructure sensing [23,24]. Compared with prior approaches, this work introduces two key elements. First, we introduce Hessian-level stabilization for Gauss–Newton rectangle fitting, which directly regulates ill-conditioned updates near boundary states and suppresses abnormal width/height deformation without adding extra penalty terms to the objective. Second, instead of relying on dense point-cloud registration or multiple independent control points, we design a deterministic two-point correspondence (nearest corner plus an auxiliary point) and perform 2D SVD-based alignment on this minimal set, enabling stable inter-frame matching with low computational overhead under sparse and partially observed returns.
In our infrastructure LiDAR setting, observations are often sparse and partially occluded, and the visible boundary may be dominated by a single edge. To mitigate instability in representative-point displacement estimation under such conditions, we (i) robustly fit a rectangle from extracted boundary points with Hessian-level stabilization, and (ii) perform computationally efficient inter-frame alignment using a deterministic two-point correspondence derived from the fitted rectangle, avoiding full point-cloud registration.

3. Methods

Infrastructure-mounted LiDAR sensors cannot directly measure a vehicle’s state; therefore, it is essential to robustly estimate vehicle motion from sequentially acquired point clouds. Conventional tracking approaches often represent a vehicle’s position using a single point derived from a clustered point cloud, such as the centroid or a small set of boundary/feature points. However, due to LiDAR observation characteristics, point loss frequently occurs because of occlusions by surrounding vehicles and changes in viewing geometry, which introduces significant noise into the estimated feature-point locations. This issue is particularly critical for infrastructure LiDAR, where the scan interval is relatively short and the inter-frame displacement is consequently small; under such conditions, noise contained in the representative points can be greatly amplified during speed computation, leading to large errors.
As an alternative, one may consider estimating vehicle motion by accurately registering the entire point cloud (point-cloud matching). However, LiDAR typically produces thousands to tens of thousands of points per frame, and precise registration generally requires substantial computation time. This limits practical applicability in real-world road environments where real-time processing is required.
To address these challenges, this paper proposes a method that fits a rectangle to each vehicle and then performs inter-frame matching at the rectangle level. Compared with point-wise matching, the proposed approach significantly reduces computational cost, and it is designed to achieve matching accuracy comparable to feature-point-based estimation while enabling real-time operation.
In addition, LiDAR data are commonly provided to the proposed method after standard preprocessing steps for vehicle estimation. In the preprocessing stage, background removal (or ground removal and dynamic object separation), clustering, and inter-frame object association are performed, and the subsequent processing takes as input the point cloud of each object with an assigned ID.

3.1. Point-Cloud Filtering

Accurate displacement estimation generally requires point-cloud matching; however, performing matching over the entire object point cloud leads to excessive computational cost, making it impractical for real-world roadside deployment. To address this issue, we approximate each vehicle point cloud with a rectangle so that the key information required for registration can be represented in a compact geometric form.
To fit a rectangle to the input vehicle point cloud, we first apply filtering to extract only boundary points. Specifically, we compute the relative position and bearing of each point with respect to the centroid of the vehicle cluster, and partition the angular space into fixed-size sectors. For each sector, we select the point with the maximum radial distance from the centroid, and these selected points constitute the boundary point set.
The filtered boundary points may still include outliers, such as isolated points that are excessively distant from neighboring points, sensor noise, or points that do not match the true vehicle shape due to transient events (e.g., door opening). To remove such outliers, we first compute, for each point p i = ( x i , y i ) , the distance to its neighboring points.
d i = p i p i 1 ,   d i + = p i + 1 p i
In Equation (1), d i and d i + denote the Euclidean distances from the current point p i to the previous point p i 1 and the next point p i   +   1 , respectively. If both distances simultaneously exceed the distance threshold τ d ,
d i > τ d   d i + > τ d  
Points p i   that satisfy the condition in (2) are designated as outlier candidates.
Then, for each candidate point, we define the angle formed with its two neighboring points p i 1   and p i   +   1   as follows.
θ i = cos 1 p i p i 1 p i p i 1 · p i + 1 p i p i + 1 p i
In Equation (3), θ i represents the local change in boundary curvature. Using the angle threshold τ θ , we keep the point when the contour is smoothly connected ( θ i τ θ ), and remove it only when a sharp bend is observed ( θ i < τ θ ).

3.2. Rectangle Fitting

In this study, we approximate the vehicle’s 2D boundary as a rectangle, define the corresponding rectangle parameters, and estimate them by solving a nonlinear least-squares optimization problem.
x = c x c y θ w h
Equation (4) represents the rectangle parameters, where c = [ c x , c y ]   is the rectangle center, θ is the rotation angle, and w and h denote the width and height, respectively. The filtered point p i = [ x i , y i ] is transformed into the rectangle’s local coordinate frame as p i = [ x i , y i ] , defined as follows.
p i = R ( θ ) ( p i c ) , R ( θ ) = cos θ sin θ s i n θ cos θ
Therefore, it can be expressed as
x i = cos θ ( x i c x ) + sin θ ( y i c y ) , y i = sin θ ( x i c x ) + cos θ ( y i c y )
For each point, we select the closest one among the four rectangle edges (edge index k i { 0,1 , 2,3 } ) and construct the corresponding residual. Here, k i { 0,2 } corresponds to the width-direction edges (left/right), and k i { 1,3 } corresponds to the height-direction edges (top/bottom). The residual is defined in a normalized form such that it becomes zero on the boundaries | x i | = w / 2 or | y i | = h / 2 , as follows.
d i ( x ) = x i w 2 1 4 , k i { 0,2 } , y i h 2 1 4 , k i { 1,3 } .
Assuming a Gaussian error model with standard deviation σ , the nonlinear least-squares problem is formulated as follows.
m i n x   E ( x ) , E ( x ) = i = 1 N d i ( x ) 2 2 σ 2
To minimize (8), we adopt the Gauss-Newton method. The gradient and Hessian of the objective function are approximated from the residual Jacobian as follows.
E ( x ) = i = 1 N 2 σ 2 d i ( x ) d i ( x ) , H o b j ( x ) i = 1 N 2 σ 2 d i ( x ) d i ( x ) .
Here, d i x   is composed of the partial derivatives with respect to the local coordinates x i y i and the rectangle parameters c x c y θ w h . It is computed in a case-wise manner depending on whether each point is associated with a width-direction edge or a height-direction edge.
In addition, to stabilize the updates so that the points remain within the rectangle and to mitigate abnormal shrinkage or growth of the width and height parameters during iterations, we do not introduce any additional penalty term into the objective function. Instead, we apply a Hessian-level augmentation term constructed from the gradients of boundary-related functions. Specifically, for each point, we define the following boundary-related functions.
g x , i ( x ) = x i w 2 1 4 , g y , i ( x ) = y i h 2 1 4 .
These functions are normalized such that they become zero on the rectangle boundaries | x i | = w / 2 or | y i | = h / 2 . They are designed so that g ( x ) < 0 when a point lies inside the rectangle and g ( x ) > 0 when it lies outside. Therefore, the sign of g ( x ) indicates the inside/outside status, and | g ( x ) | reflects the relative degree of deviation from the boundary.
In addition, to modulate the convergence behavior of the update depending on the inside/outside status and the degree of deviation, we define a sign-inverted weight λ ( x ) as follows and use it in the Hessian augmentation term.
λ x , i x = sign g x , i x c l i p ( | g x , i ( x ) | w , λ m i n , λ m a x ) ,
λ y , i ( x ) = s i g n ( g y , i ( x ) ) c l i p ( | g y , i ( x ) | h , λ m i n , λ m a x )
Equations (11) and (12) limit the magnitude of the weight to the range λ m i n λ m a x to prevent numerical instability. By including s i g n ( g ) , the Hessian augmentation term is designed to act differently depending on whether a point lies inside the boundary ( g < 0 ) or outside ( g > 0 ). As a result, during iterative optimization, it suppresses updates that would excessively violate the boundary or drive abnormal deformation of the width and height parameters, thereby assisting more stable boundary alignment.
Accordingly, based on the above definitions, we add the following augmentation term to the Hessian.
H b n d ( x ) = 2 i = 1 N λ x , i ( x ) g x , i ( x ) g x , i ( x ) + λ y , i ( x ) g y , i ( x ) g y , i ( x )
In Equation (13), H b n d is an augmentation term that prevents the point cloud from excessively deviating beyond the rectangle boundary.
Next, to prevent excessive updates of w and h , we introduce an additional size-stabilization term. In the rectangle coordinate frame, let p x m a x = ( x m a x , y x m a x ) and p x m i n = ( x m i n , y x m i n ) denote the points attaining the maximum and minimum x i , respectively, and let p y m a x = ( x y m a x , y m a x ) and p y m i n = ( x y m i n , y m i n ) denote the points attaining the maximum and minimum y i , respectively. Accordingly, we define the point-cloud extents as
Δ x = | x m a x x m i n | , Δ y = | y m a x y m i n |
In Equation (14), x m a x and x m i n denote the maximum and minimum values of x i , respectively, and y m a x and y m i n denote the maximum and minimum values of y i , respectively.
To mitigate the tendency of the rectangle parameters to change excessively during the iterative process once the point cloud lies within the rectangle region, we construct the following vectors only when Δ x w or Δ y h .
j w ( x ) = 0 0 ϕ w ( x ) 1 0 , j h ( x ) = 0 0 ϕ h ( x ) 0 1
Here, ϕ w ( x ) and ϕ h ( x ) are defined as the partial derivatives with respect to θ , computed using the coordinate differences between the extremal points in the local frame:
ϕ w ( x ) = sin θ ( x m a x x m i n ) cos θ ( y x m a x y x m i n ) ,
ϕ h ( x ) = cos θ ( x y m a x x y m i n ) + sin θ ( y m a x y m i n )
Accordingly, the additional Hessian stabilization term is defined as follows.
H size ( x ) = 2 ( j w ( x ) j w ( x ) + j h ( x ) j h ( x ) )
Note that j w is activated only when Δ x w , and j h is activated only when Δ y h ; otherwise, the corresponding term is set to zero. These conditions indicate that the point cloud is sufficiently covered by the current w and h . In this case, H size limits excessive updates of w and h , thereby improving numerical stability.
Finally, the Hessian is given by the following.
H ( x ) H o b j ( x ) + H b n d ( x ) + H s i z e ( x )
The Gauss-Newton iterative update is computed as follows.
H ( x t ) Δ x t = E ( x t ) , x t 1 = x t + Δ x t
For the initial yaw, we consider two candidates, ψ 0 { 0 , 45 } , and initialize the size parameters from the min–max extent of the point cloud as w 0 = x m a x x m i n and h 0 = y m a x y m i n . For each initialization, we run the Gauss–Newton optimization and select the solution with the smaller final objective value as the final estimate.
Overall, the proposed optimization is based on a residual-driven Gauss–Newton update, while incorporating a boundary-state-based Hessian augmentation term ( H b n d ) and a size-stabilization term ( H s i z e ) to limit excessive updates near the boundary and in the w , h components, thereby ensuring numerical stability and robust convergence behavior.

3.3. Rectangle Matching

Finally, we compute accurate displacement and speed via matching between rectangles. This stage performs rectangle-to-rectangle alignment using the rectangles already fitted in each frame by the optimization procedure in Section 3.2, and does not rely on ICP-based registration of the full point cloud. In addition, the SVD-based alignment using two representative points can be interpreted as estimating the rotation R by aligning the direction vectors defined by the two point pairs, and then computing the translation t from the resulting rotation.
It is computed from the two point sets p and q as follows.
H = j = 1   N p ~ j q ~ j T , H = U Σ V T
R = V U T , t = q - R p -
In Equation (21), H denotes the cross-covariance matrix, and (22) computes the optimal rotation R by performing SVD on H .
For computational efficiency, we perform matching in 2D using only two points. The matching is carried out between two rectangles from consecutive frames, R k and R k 1 .
Under normal vehicle trajectories, the nearest corner does not switch to the opposite corner between adjacent frames; therefore, we consider only transitions to adjacent corners. First, for each rectangle, we define the corner closest to the LiDAR origin as a feature point, denoted by f k 1 and f k . Then, based on the relationship between the corner indices of the selected feature points in the two frames, we construct an auxiliary point s . Finally, the two-point pair f s is used as the representative set for matching.
First, when the selected corners in the two frames correspond to the same index, we determine the corner index of the auxiliary point s   from the corners adjacent to f according to a predefined rule (e.g., the next corner in the counterclockwise direction), as illustrated in Figure 1. The same indexing rule is applied to both rectangles to form the correspondence. In addition, to prevent unstable correspondence of the auxiliary point caused by differences in side lengths between the two rectangles, the auxiliary points s k and s k 1 are placed at the same relative position using the shorter of the two side lengths along the corresponding direction as a reference.
Second, when the selected corner indices differ between the two frames, we fix the feature point f k selected in frame k as the reference point f , as shown in Figure 2, and reassign the corresponding corner in frame k 1 as the auxiliary point s k 1 to form the representative pair f s . We then define the auxiliary point s k in frame k using the same adjacent-corner rule so that a consistent pair of representative points is obtained for both rectangles. Again, to reduce correspondence errors caused by differences in side lengths, the offsets of s k and s k 1 are normalized to the same relative position using the shorter of the two side lengths along the corresponding direction as the reference.
Using the two representative point sets constructed as above, we apply SVD-based alignment to estimate R k and t k , and then compute the 2D inter-frame displacement and speed from the resulting transformation.

4. Results

To validate the proposed algorithm, we conducted experiments at K-City, located in Hwaseong-si, Gyeonggi-do, Republic of Korea. The processing pipeline consisted of background subtraction via static point removal, clustering using density-based spatial clustering of applications with noise (DBSCAN) [25], and inter-frame association using the Hungarian algorithm (linear assignment) [26]; the proposed method was then applied as the final stage of the pipeline. The test site was a bidirectional five-lane intersection with a traffic island, and the infrastructure LiDAR was installed near roadside structures at approximately 6 m above the ground (Figure 3). The experimental setup included a 40-channel 3D LiDAR (Pandar40P, Hesai Technology Co., Ltd., Shanghai, China), an industrial PC (sourced from SignTelecom Co., Ltd., Seoul, Republic of Korea), a test vehicle (Genesis G70, Hyundai Motor Company, Seoul, Republic of Korea), and an RTK system (MRP-2000, MBC Broadcasting Co., Ltd., Seoul, Republic of Korea). Detailed specifications of the sensors and equipment are summarized in Table 1. The parameter settings used in the experiments are listed in Table 2.
The industrial PC used in the experiments was equipped with a GPU for potential future deep-learning-based classification; however, the speed estimation algorithm proposed in this paper was executed solely on the CPU without GPU acceleration.
Because the infrastructure LiDAR observes objects from a long distance, the shape of the acquired point cloud can vary significantly depending on the object’s position and orientation. Typically, objects are observed in the representative forms shown in Figure 4.
In Figure 4a, the vehicle is observed under nominal conditions with sufficient returns. Figure 4b illustrates a partial-observation case in which only the side surface is predominantly visible; nevertheless, the representative points are preserved and the rectangle remains stable. Figure 4c demonstrates that the proposed method can still fit a plausible rectangle even when both the front (bonnet) and rear (trunk) portions are occluded by another vehicle or an obstacle, as long as enough side-boundary evidence is retained. In contrast, Figure 4d shows a challenging case where the visible boundary becomes severely degraded (e.g., the side surface is largely missing), making the geometry underconstrained and resulting in an unstable or incorrect fit.
Following the qualitative fitting results, we conduct an ablation study to validate the numerical stabilization effect of the proposed Hessian-level augmentation. We compare the proposed method against a variant in which the Hessian stabilization terms are removed. A fitting failure is defined as the case where the Gauss–Newton optimization reaches the maximum number of iterations without convergence, or when the optimization encounters a numerical issue and the update cannot be computed. In addition, we report the average point-to-rectangle distance between the observed point cloud and the fitted rectangle to quantify fitting quality. As summarized in Table 3, removing the Hessian stabilization leads to higher failure rates and larger mean point-to-rectangle distances.
The system clocks of all devices were synchronized to a common reference time using NTP (Network Time Protocol) with a Google NTP server, and the logs were aligned based on the synchronized timestamps. To evaluate performance under various speed conditions, data were collected while driving along a straight segment at 30, 50, 70, and 90 km/h. In addition, to examine whether speed estimation remains feasible under high curvature, we included an extra scenario in which the vehicle passed the traffic island and then executed a sharp right turn. As the reference speed, we used the vehicle speed obtained from the CAN bus, while the RTK-based speed—computed by differentiating RTK position over time—was used as an auxiliary metric.
Performance evaluation was conducted only within the region of interest (ROI) of the infrastructure LiDAR because clustering and tracking reliability can degrade outside the ROI due to reduced point density at longer ranges. During data collection, other vehicles were either absent or present up to a maximum of two, located in front of/behind the target vehicle or in non-adjacent lanes. The ROI was defined as the region where the vehicle is continuously intersected by at least two LiDAR rings. Figure 5 illustrates the scenario trajectories and the LiDAR ROI.
In Figure 5, the blue marker indicates the LiDAR installation location, the red solid line denotes the ROI, the red dashed line represents the trajectory for the straight-driving scenario, and the green solid line represents the trajectory for the turning scenario.
The straight-line segment available within the ROI was approximately 90 m. Due to site constraints, each speed setting was tested once. The nominal speed settings were 30, 50, and 70 kph, as well as high-speed (~90 kph); for the high-speed run, the driver was asked to drive as fast as feasible, resulting in a larger speed variation.
Table 3 reports the root mean square error (RMSE) and mean absolute error (MAE) of the vehicle speed estimated by the proposed method with respect to the CAN-based reference speed.
Table 4 summarizes the speed estimation performance on the straight segment under driving conditions of 30, 50, 70, and ~90 kph. All metrics were computed using only the samples within the ROI, and the reference speed was obtained from the vehicle CAN. Under the tested speed conditions, the proposed method achieved an MAE of 0.76–1.37 kph and an RMSE of 0.90–1.58 kph.
Table 5 further compares the proposed method with a centroid-based baseline, in which the inter-frame correspondence is computed from the cluster centroid displacement while the rest of the pipeline is kept unchanged. The centroid baseline shows larger errors than the proposed method under the same ROI and CAN-referenced evaluation. The centroid baseline also results in wider 95% confidence intervals, which suggests higher variability and sensitivity under sparse and partially occluded observations. Temporal smoothing could be applied to either method; in this study, we report the unfiltered estimates to focus on the stability of the underlying displacement measurements.
Figure 6a–e present the speed estimation results for straight driving at 30, 50, 70, and ~90 kph and for the turning scenario, respectively, showing that the estimated speed generally follows the CAN-based reference. Figure 7 further summarizes the absolute error distributions for these scenarios using histograms. As speed increases, Figure 7 indicates a more dispersed error distribution with larger outliers; the corresponding histograms appear flatter, which may also be influenced by the relatively smaller sample size in high-speed segments.
The RTK-based speed is used only as an auxiliary reference and is obtained by time-differentiating the RTK positions. Because the RTK update rate is relatively low and measurements are missing in some segments, the resulting speed signal can be delayed and the number of valid samples can be limited. In this study, the RTK-based speed is computed as the displacement between two consecutive RTK positions divided by the corresponding time interval, and the errors with respect to RTK are summarized for reference in Table 6. Due to the limited number of valid RTK samples, we do not report confidence intervals for the RTK-referenced errors.
The errors with respect to RTK showed an MAE of 0.53–3.10 km/h and an RMSE of 0.62–3.75 km/h.
To assess cross-vehicle generalization, we conducted an additional experiment using the same cornering scenario with an SUV (Seltos, Kia Corporation, Seoul, Republic of Korea). Due to practical constraints, the additional evaluation was limited to a single scenario, and the vehicle was assessed using RTK only. The results are summarized in Table 7. Although the scope is limited, the proposed method did not exhibit catastrophic degradation when applied to a different vehicle type, indicating the potential for applicability beyond a single vehicle model.
In addition, to assess the real-time feasibility of the proposed algorithm, we measured the processing time of the tracking stage (association and speed estimation) using infrastructure LiDAR data collected at the Hwaseong intersection shown in Figure 8, excluding preprocessing steps such as background subtraction and clustering. All experiments were conducted on the same industrial PC.
The measurement was conducted during the evening rush hour when traffic volume was high, and the sample data, as shown in Figure 9, depict more than ten vehicles traveling through the scene.
The processing-time distribution of the tracking stage is shown in Figure 10. The maximum and average processing times were 34.736 ms and 18.458 ms, respectively, confirming real-time operation at the LiDAR rate of 10 Hz.

5. Discussion

Experimental results confirm that the proposed method consistently preserves representative points and robustly fits a rectangle even when the shape of point clusters varies due to changes in infrastructure LiDAR observation conditions. In particular, under geometrically challenging observation conditions—such as when the vehicle is observed from an elevated infrastructure viewpoint at longer ranges—the returned points often become sparse and biased toward a side view, producing point clouds that frequently resemble a single line segment. In addition, the method remains effective when the target vehicle is partially occluded by other objects or obstacles. Even in such cases, the proposed approach maintains the representative points and fits a rectangle, thereby improving the stability of speed estimation. Moreover, the proposed method reduces the computational cost of matching by leveraging rectangle-based representative points, while achieving high speed accuracy with a low error of approximately 1 kph in terms of MAE. These results demonstrate that the proposed method can stably estimate vehicle speed across the tested speed and trajectory scenarios, even when the object is observed in sparse regions where only about two LiDAR rings are returned, as in our experimental setup. In addition, a supplemental experiment using an SUV (Kia Seltos) was included, and the method did not show catastrophic degradation under a limited vehicle-type change. The tracking stage also achieved real-time performance during high-traffic commuting hours, with an average processing time of approximately 18 ms.
While the CAN-referenced errors tend to increase with speed, the RTK-referenced errors do not necessarily follow a monotonic trend. This difference is mainly because the RTK-based speed is obtained by time-differentiating discrete position measurements, which amplifies position noise and becomes sensitive to the RTK update rate and missing samples. In addition, when the vehicle speed is not perfectly constant within the ROI (e.g., slight acceleration/deceleration), the differencing-based RTK speed can reflect an interval-averaged value over the time step, whereas the CAN signal is closer to an instantaneous speed reference. Therefore, the RTK-referenced errors can appear to increase and then decrease across nominal speed settings, and we report RTK results only as an auxiliary reference.
In future work, we plan to extend the proposed algorithm to dynamic sensing environments, including not only fixed infrastructure LiDAR but also moving sensors such as vehicle-mounted LiDAR.

6. Conclusions

In this paper, we proposed a method for estimating vehicle speed in an infrastructure LiDAR environment by modeling a vehicle point cloud as a rectangle and performing computationally efficient, feature-point-based matching. Experiments were conducted at a designated intersection in K-City, demonstrating that the proposed method can stably fit a rectangle while consistently preserving representative points and predicting vehicle speed even under sparse point-cloud conditions comparable to our test environment, where only about two LiDAR rings are observed. The method remained effective even when the target vehicle was partially occluded by other objects or obstacles, and a supplemental experiment using an SUV (Kia Seltos) did not show catastrophic degradation under a limited vehicle-type change. The reference speed was obtained from the vehicle’s instrument-cluster speed via CAN communication, and the error between the CAN-based speed and the estimated speed achieved approximately 1 kph in terms of MAE. We also measured the per-object processing time to verify the real-time feasibility of the proposed algorithm. In future work, we plan to extend the method to dynamic sensing environments, enabling speed estimation even when the sensor is moving, such as in vehicle-mounted LiDAR systems.

Author Contributions

Conceptualization, I.H. and M.P.; methodology, I.H.; formal analysis, I.H.; investigation, I.H.; writing—original draft preparation, I.H.; writing—review and editing, M.P.; supervision, M.P.; project administration, M.P.; funding acquisition, M.P. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Ministry of Trade, Industry and Energy (MOTIE, Republic of Korea) (Project Name: Development of Fail-Operation technology in Lv.4 autonomous driving systems; Project Number: 20018055), as well as by the Korea Agency for Infrastructure Technology Advancement (KAIA) grant funded by the Ministry of Land, Infrastructure and Transport (Grant RS-2021-KA160501).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author. The data are not publicly available due to restrictions associated with the funded project and data-sharing policies.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Wu, J.; Xu, H.; Sun, Y.; Zheng, J.; Yue, R. Automatic Background Filtering Method for Roadside LiDAR Data. Transp. Res. Rec. 2018, 2672, 106–114. [Google Scholar] [CrossRef]
  2. Zhang, T.; Jin, P.J. Roadside LiDAR Vehicle Detection and Tracking Using Range and Intensity Background Subtraction. J. Adv. Transp. 2022, 2022, 2771085. [Google Scholar] [CrossRef]
  3. Chen, Z.; Xu, H.; Zhao, J.; Liu, H. A Novel Background Filtering Method with Automatic Parameter Adjustment for Real-Time Roadside-LiDAR Sensing System. IEEE Trans. Instrum. Meas. 2023, 72, 5022310. [Google Scholar] [CrossRef]
  4. Chen, J.; Tian, S.; Xu, H.; Yue, R.; Sun, Y.; Cui, Y. Architecture of Vehicle Trajectories Extraction with Roadside LiDAR Serving Connected Vehicles. IEEE Access 2019, 7, 100406–100415. [Google Scholar] [CrossRef]
  5. Zhang, J.; Xiao, W.; Coifman, B.; Mills, J.P. Vehicle Tracking and Speed Estimation From Roadside Lidar. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2020, 13, 5597–5608. [Google Scholar] [CrossRef]
  6. Lin, C.; Sun, K.; Wu, D.; Xie, Y. Vehicle Detection and Tracking with Roadside LiDAR Using Improved ResNet18 and the Hungarian Algorithm. Sensors 2023, 23, 8143. [Google Scholar] [CrossRef]
  7. Berviller, C.; Rousseau, J.; Gruyer, D.; Lambert, A. Road User Position and Speed Estimation via Deep Learning from Calibrated Fisheye Videos. Sensors 2023, 23, 2637. [Google Scholar] [CrossRef]
  8. Macko, A.; Gajdošech, L.; Kocur, V. Efficient Vision-Based Vehicle Speed Estimation. J. Real-Time Image Process. 2025, 22, 123. [Google Scholar] [CrossRef]
  9. Sochor, J.; Juránek, R.; Herout, A. Traffic Surveillance Camera Calibration by 3D Model Bounding Box Alignment for Accurate Vehicle Speed Measurement. Comput. Vis. Image Underst. 2017, 161, 87–98. [Google Scholar] [CrossRef]
  10. Zhao, J.; Xu, H.; Liu, H.; Wu, J.; Zheng, Y.; Wu, D. Detection and tracking of pedestrians and vehicles using roadside LiDAR sensors. Transp. Res. Part C Emerg. Technol. 2019, 100, 68–87. [Google Scholar] [CrossRef]
  11. Li, S.; Yoon, H.-S. Sensor Fusion-Based Vehicle Detection and Tracking Using a Single Camera and Radar at a Traffic Intersection. Sensors 2023, 23, 4888. [Google Scholar] [CrossRef] [PubMed]
  12. Lin, C.; Wang, Y.; Gong, B.; Liu, H. Vehicle Detection and Tracking Using Low-Channel Roadside LiDAR. Measurement 2023, 218, 113159. [Google Scholar] [CrossRef]
  13. Liu, B.; Li, Y.; Li, D.; Chen, L.; Zhang, J. A Vision-Based Pipeline for Vehicle Counting, Speed Estimation, and Classification. IEEE Trans. Intell. Transp. Syst. 2021, 22, 7547–7560. [Google Scholar] [CrossRef]
  14. Caillot, A.; Ouerghi, S.; Vasseur, P.; Boutteau, R.; Dupuis, Y. Survey on Cooperative Perception in an Automotive Context. IEEE Trans. Intell. Transp. Syst. 2022, 23, 14204–14223. [Google Scholar] [CrossRef]
  15. Huang, Y.; Qian, L.; Feng, A.; Wu, Y.; Zhu, W. RFID Data-Driven Vehicle Speed Prediction via Adaptive Extended Kalman Filter. Sensors 2018, 18, 2787. [Google Scholar] [CrossRef]
  16. Wang, J.; Ni, D.; Li, K. RFID-Based Vehicle Positioning and Its Applications in Connected Vehicles. Sensors 2014, 14, 4225–4238. [Google Scholar] [CrossRef]
  17. Zhang, Y.; Luo, Q.; Zhang, S.; Liu, H.; Xie, S. Anchor-free real-time vehicle tracking using LiDAR data. PLoS ONE 2025, 20, e0323287. [Google Scholar] [CrossRef]
  18. Besl, P.J.; McKay, N.D. A Method for Registration of 3-D Shapes. IEEE Trans. Pattern Anal. Mach. Intell. 1992, 14, 239–256. [Google Scholar] [CrossRef]
  19. Biber, P.; Straßer, W. The Normal Distributions Transform: A New Approach to Laser Scan Matching. In Proceedings of the 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Las Vegas, NV, USA, 27–31 October 2003; IEEE: New York, NY, USA, 2003; pp. 2743–2748. [Google Scholar] [CrossRef]
  20. Magnusson, M.; Lilienthal, A.; Duckett, T. Scan Registration for Autonomous Mining Vehicles Using 3D-NDT. J. Field Robot. 2007, 24, 803–827. [Google Scholar] [CrossRef]
  21. Rusinkiewicz, S.; Levoy, M. Efficient Variants of the ICP Algorithm. In Proceedings of the Third International Conference on 3-D Digital Imaging and Modeling, Quebec City, QC, Canada, 28 May–1 June 2001; IEEE: New York, NY, USA, 2001; pp. 145–152. [Google Scholar] [CrossRef]
  22. Zhang, J.; Xiao, W.; Mills, J.P. Optimizing Moving Object Trajectories from Roadside Lidar Data by Joint Detection and Tracking. Remote Sens. 2022, 14, 2124. [Google Scholar] [CrossRef]
  23. Zhang, X.; Xu, W.; Dong, C.; Dolan, J.M. Efficient L-shape fitting for vehicle detection using laser scanners. In Proceedings of the 2017 IEEE Intelligent Vehicles Symposium (IV), Los Angeles, CA, USA, 11–14 June 2017; IEEE: New York, NY, USA, 2017; pp. 54–59. [Google Scholar] [CrossRef]
  24. Zhao, C.; Fu, C.; Dolan, J.M.; Wang, J. L-shape fitting-based vehicle pose estimation and tracking using 3D-LiDAR. IEEE Trans. Intell. Veh. 2021, 6, 787–798. [Google Scholar] [CrossRef]
  25. Ester, M.; Kriegel, H.-P.; Sander, J.; Xu, X. A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. In Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, Portland, OR, USA, 2–4 August 1996; AAAI Press: Washington, DC, USA, 1996; pp. 226–231. [Google Scholar]
  26. Kuhn, H.W. The Hungarian method for the assignment problem. Nav. Res. Logist. Q. 1955, 2, 83–97. [Google Scholar] [CrossRef]
Figure 1. Selection of points on the rectangle for matching when the feature points are identical.
Figure 1. Selection of points on the rectangle for matching when the feature points are identical.
Applsci 16 02513 g001
Figure 2. Selection of points on the rectangle for matching when the feature points are different.
Figure 2. Selection of points on the rectangle for matching when the feature points are different.
Applsci 16 02513 g002
Figure 3. LiDAR sensor used in the experiments. The red rectangle is the LiDAR sensor.
Figure 3. LiDAR sensor used in the experiments. The red rectangle is the LiDAR sensor.
Applsci 16 02513 g003
Figure 4. Detected objects and rectangle fitting results from the infrastructure LiDAR: (a) full observation; (b) partial observation; (c) successful fitting under severe front–rear occlusion; (d) failure when the side surface is additionally occluded. The left red circle is the target object.
Figure 4. Detected objects and rectangle fitting results from the infrastructure LiDAR: (a) full observation; (b) partial observation; (c) successful fitting under severe front–rear occlusion; (d) failure when the side surface is additionally occluded. The left red circle is the target object.
Applsci 16 02513 g004
Figure 5. Map showing the test scenarios, LiDAR location, and the ROI. The arrows indicate the test scenarios.
Figure 5. Map showing the test scenarios, LiDAR location, and the ROI. The arrows indicate the test scenarios.
Applsci 16 02513 g005
Figure 6. Speed estimation results: (a) straight driving at 30 kph; (b) straight driving at 50 kph; (c) straight driving at 70 kph; (d) straight driving at ~90 kph; (e) turning scenario.
Figure 6. Speed estimation results: (a) straight driving at 30 kph; (b) straight driving at 50 kph; (c) straight driving at 70 kph; (d) straight driving at ~90 kph; (e) turning scenario.
Applsci 16 02513 g006aApplsci 16 02513 g006b
Figure 7. Error distributions of the speed estimation results: (a) straight driving at 30 kph; (b) straight driving at 50 kph; (c) straight driving at 70 kph; (d) straight driving at ~90 kph; (e) turning scenario.
Figure 7. Error distributions of the speed estimation results: (a) straight driving at 30 kph; (b) straight driving at 50 kph; (c) straight driving at 70 kph; (d) straight driving at ~90 kph; (e) turning scenario.
Applsci 16 02513 g007aApplsci 16 02513 g007bApplsci 16 02513 g007c
Figure 8. Infrastructure LiDAR installation used for processing-time analysis. The red rectangle is the LiDAR sensor.
Figure 8. Infrastructure LiDAR installation used for processing-time analysis. The red rectangle is the LiDAR sensor.
Applsci 16 02513 g008
Figure 9. Sample PCD acquired during the experiment time period.
Figure 9. Sample PCD acquired during the experiment time period.
Applsci 16 02513 g009
Figure 10. Processing-time distribution of the tracking algorithm.
Figure 10. Processing-time distribution of the tracking algorithm.
Applsci 16 02513 g010
Table 1. Specifications of the equipment used in the experiments.
Table 1. Specifications of the equipment used in the experiments.
Specification
Industrial PCCPU: Intel® Core™ i7-10700 (8 cores/16 threads)
RAM: 32 GB DDR4
GPU: NVIDIA GeForce RTX 4060 (8 GB VRAM)
OS: Ubuntu 20.04.6 LTS (Linux kernel 5.15.0-139-generic)
Pandar40P LiDAROperating principle: Time of Flight (ToF)
Scanning method: Mechanical rotation
Number of channels: 40
Measurement range: 0.3 m to 200 m (@ 10% reflectivity)
Returns (configurable): Single/Dual return (Strongest, Last)
Rotation rate: 10 Hz/20 Hz
Vertical field of view (FOV): 40° (−25° to +15°)
Horizontal field of view (FOV): 360°
Horizontal angular resolution: 0.2° (@ 10 Hz)/0.4° (@ 20 Hz)
RTK GNSS Receiver (MRP-2000)Product type: RTK GNSS receiver (LTE/DMB correction)
GNSS signals: GPS L1/L2; GLONASS L1/L2; BeiDou B1/B2; Galileo E1/E5; QZSS L1/L2
Correction input: DMB/LTE (RTCM 3.x)
Output rate: 8 Hz (multi-GNSS)/20 Hz (GPS-only)
RTK positioning accuracy: Horizontal 0.010 m + 1 ppm (CEP); Vertical 0.010 m + 1 ppm (CEP)
Note: ToF, time of flight; FOV, field of view; GNSS, global navigation satellite system; DMB, Digital Multimedia Broadcasting; CEP, circular error probable.
Table 2. Parameters used in the experiments.
Table 2. Parameters used in the experiments.
Specification
Point-Cloud Filtering Bin:   a n g l e = 0.2 ° ,   d i s t a n c e = 5.0   m
τ d = 0.5
τ θ = 25 °
Rectangle Fitting σ = 1
Max iter = 30
λ m i n = 1
λ m a x = 30
ψ 0 = 0 °   a n d   45 °
Table 3. Ablation results (with vs. without Hessian stabilization).
Table 3. Ablation results (with vs. without Hessian stabilization).
Fail Rate (%)Mean Distance (m)Median (m)
With Hessian stabilization (Proposed)0.540.0320.028
Without Hessian stabilization15.080.2530.284
Table 4. Experimental results with CAN-based reference speed.
Table 4. Experimental results with CAN-based reference speed.
Straight-Line Driving ScenarioCornering Scenario
Speed (kph)305070~90 30
MAE (kph)0.7638580.8289840.9809991.3798281.041829
MAE 95% CI (kph)0.577–0.8100.828–0.8280.638–1.3421.094–1.6670.905–1.171
RMSE (kph)0.9057800.9592461.3856961.5883111.211845
RMSE 95% CI (kph)0.663–0.9560.959–0.9590.947–1.7581.288–1.8831.075–1.343
Samples9546292784
Table 5. Comparison with a centroid baseline.
Table 5. Comparison with a centroid baseline.
MAE (kph)MAE 95% CI (kph)RMSE (kph)RMSE 95% CI (kph)
Proposed1.0418290.905–1.1711.2118451.075–1.343
Centroid baseline1.4210901.346–1.8261.7948331.730–2.111
Table 6. Experimental results with RTK-based reference speed.
Table 6. Experimental results with RTK-based reference speed.
Straight-Line Driving ScenarioCornering Scenario
Speed (kph)305070~90 30
MAE (kph)0.5357351.1591363.1088441.3874830.827692
RMSE (kph)0.6220141.4291243.7507711.4140941.100518
Table 7. Experimental results with RTK-based reference speed for comparison.
Table 7. Experimental results with RTK-based reference speed for comparison.
MAE (kph)RMSE (kph)
G70 (RTK)0.8276921.100518
Seltos (RTK)0.7440310.893958
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Hong, I.; Park, M. Vehicle Speed Estimation Using Infrastructure-Mounted LiDAR via Rectangle Edge Matching. Appl. Sci. 2026, 16, 2513. https://doi.org/10.3390/app16052513

AMA Style

Hong I, Park M. Vehicle Speed Estimation Using Infrastructure-Mounted LiDAR via Rectangle Edge Matching. Applied Sciences. 2026; 16(5):2513. https://doi.org/10.3390/app16052513

Chicago/Turabian Style

Hong, Injun, and Manbok Park. 2026. "Vehicle Speed Estimation Using Infrastructure-Mounted LiDAR via Rectangle Edge Matching" Applied Sciences 16, no. 5: 2513. https://doi.org/10.3390/app16052513

APA Style

Hong, I., & Park, M. (2026). Vehicle Speed Estimation Using Infrastructure-Mounted LiDAR via Rectangle Edge Matching. Applied Sciences, 16(5), 2513. https://doi.org/10.3390/app16052513

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop