Next Article in Journal
Ambulatory Assessment of the Dynamic Margin of Stability Using an Inertial Sensor Network
Next Article in Special Issue
Voxel-FPN: Multi-Scale Voxel Feature Aggregation for 3D Object Detection from LIDAR Point Clouds
Previous Article in Journal
Road Extraction from Unmanned Aerial Vehicle Remote Sensing Images Based on Improved Neural Networks
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Rapid Motion Segmentation of LiDAR Point Cloud Based on a Combination of Probabilistic and Evidential Approaches for Intelligent Vehicles

1
Department of Smart Vehicle Engineering, Konkuk University, Seoul 05029, Korea
2
Department of Automotive Engineering, Hanyang University, Seoul 04763, Korea
*
Author to whom correspondence should be addressed.
Sensors 2019, 19(19), 4116; https://doi.org/10.3390/s19194116
Submission received: 23 August 2019 / Revised: 16 September 2019 / Accepted: 18 September 2019 / Published: 23 September 2019
(This article belongs to the Special Issue 3D Point Clouds for Intelligent Road Transportation Systems)

Abstract

:
Point clouds from light detecting and ranging (LiDAR) sensors represent increasingly important information for environmental object detection and classification of automated and intelligent vehicles. Objects in the driving environment can be classified as either d y n a m i c or s t a t i c depending on their movement characteristics. A LiDAR point cloud is also segmented into d y n a m i c and s t a t i c points based on the motion properties of the measured objects. The segmented motion information of a point cloud can be useful for various functions in automated and intelligent vehicles. This paper presents a fast motion segmentation algorithm that segments a LiDAR point cloud into d y n a m i c and s t a t i c points in real-time. The segmentation algorithm classifies the motion of the latest point cloud based on the LiDAR’s laser beam characteristics and the geometrical relationship between consecutive LiDAR point clouds. To accurately and reliably estimate the motion state of each LiDAR point considering the measurement uncertainty, both probability theory and evidence theory are employed in the segmentation algorithm. The probabilistic and evidential algorithm segments the point cloud into three classes: d y n a m i c , s t a t i c , and u n k n o w n . Points are placed in the u n k n o w n class when LiDAR point cloud is not sufficient for motion segmentation. The point motion segmentation algorithm was evaluated quantitatively and qualitatively through experimental comparisons with previous motion segmentation methods.

1. Introduction

LiDAR systems are rapidly becoming an integral part of automated and intelligent vehicles for environmental awareness. The price of LiDAR sensors is reducing, and automakers are increasingly installing LiDAR in production vehicles for advanced intelligent functions [1,2]. LiDAR measures the distance and direction of the surrounding environment by emitting laser pulses in certain directions and measuring the time-of-flight (ToF) of each laser pulse reflected by the environment. The directions and distances can be converted to a digital 3D representation called a point cloud to express the spatial information of the surrounding environment. Because LiDAR uses light waves, the measured point cloud can represent spatial information very accurately. In addition, LiDAR point clouds can be fused with data from other sensors, such as radars, and cameras, to gain more meaningful information about the vehicle’s environment.
All objects in the driving environment are classified as dynamic or static according to the moving conditions. Therefore, the LiDAR point cloud from detected objects can also be classified as dynamic or static point-wise depending on the motion state of the object. Such point-wise classification of point cloud states can be used for safety and convenience functions in automated and intelligent vehicles. For instance, points classified as static are measured from the surfaces of static objects, such as curbs, poles, buildings, and parked vehicles. Such a static point cloud can be applied to various automated and intelligent driving functions, such as mapping, localization, and collision avoidance systems [3,4,5]. Points classified as dynamic are detected from objects that have speeds above a certain level, such as nearby moving vehicles, motorcycles, and pedestrians. These points can be used for object tracking or motion prediction, which are necessary functions for automated and intelligent vehicles for tasks such as autonomous emergency braking (AEB), lane keeping, traffic jam assistance, and adaptive cruise control (ACC) systems.
As shown in the previous examples, the information of the point cloud state is classified according to the motion is useful for automated and intelligent vehicles. This paper proposes an algorithm to rapidly segment the motion states of a point cloud detected by LiDAR in real-time. The overall process of the proposed algorithm is shown in Figure 1. The rapid motion segmentation algorithm has inputs of LiDAR’s 3D point cloud and the 3D pose (position and direction) of the LiDAR sensor. The sensor pose can be estimated from an inertial measurement unit (IMU) or the vehicle’s on-board motion sensors (such as wheel speed sensors or steering angle sensor). Then, point motion segmentation is performed by applying the laser beam characteristics to the pose correlation between consecutive LiDAR point clouds. A combination of probability theory and evidence theory is applied to accurately and reliably update the motion state of points. The algorithm performs point-wise segmentation of the point cloud into three states: d y n a m i c , s t a t i c , and u n k n o w n . d y n a m i c information is detected from an object moving above a certain speed, and s t a t i c information is detected from a stationary object. If there are insufficient consecutive LiDAR point clouds for motion classification, some points are classified as u n k n o w n . The performance of the proposed algorithm was evaluated quantitatively and qualitatively through comparison with existing methods.
This research has three main contributions: (1) reflecting the laser characteristics of LiDAR, (2) applying a combination of probabilistic and evidential approaches to update the motion state of points, and (3) online motion updated for real-time applications. This paper focuses on the characteristics of lasers, such as multi-echo, beam divergence, and horizontal and vertical resolution, so that it can segment the motion of points more accurately than existing algorithms, such as occupancy grid mapping. In addition, when updating the state information, a combination of probabilistic and evidential modeling is applied to more accurately reflect the actual LiDAR characteristics to update motion in a point-wise manner. Because all the proposed updating processes are real-time, they are suitable for real-time application in automated and intelligent vehicle systems.

2. Previous Studies

With the development of autonomous vehicles, LiDAR is being more widely used, and many studies of LiDAR are accordingly being conducted. However, there have not been many studies that classify the point-wise motion of LiDAR measurements themselves in real-time. In the field of autonomous and intelligent vehicle systems, studies related to the proposed algorithm aim to generate static environment maps or remove nonstatic points based on tracking results.
An occupancy grid map is a typical method for generating a static environment map using LiDAR measurements. The occupancy grid map divides the environment around the vehicle into a 2D grid or 3D voxel cells with uniform size. The occupancy level of each cell can be updated through the laser’s ray tracing. The occupancy level of a grid (or voxel) cell passing by the laser becomes lower because the physical space of the cell is likely to be free. Conversely, the levels of cells located on the reflecting surface become higher. Based on these principles, static objects are classified as occupied when the occupancy level exceeds a certain threshold. Contrarily, for a moving object, the occupancy level of the cell is not constantly accumulated, so it is not classified as occupied. The numerical value of each cell’s occupancy level is updated based on probability theory [6] or belief theory [7]. Static maps in large-scale traffic environment are constructed using LiDAR sensors with probabilistic and belief approaches [8,9,10]. Moras et al. presented an occupancy grid framework that generates a global static map and classifies local moving objects simultaneously [11,12,13]. Classification of traffic objects (such as vehicles, pedestrians, road curbs, and poles) is used to classify the motion of a point cloud [14,15,16].
The advantages of occupancy grid-based static point cloud classification are that its implementation is relatively straightforward and its performance is stable because it has been studied for a long time in various applications. However, occupancy grid mapping has several disadvantages for use in real-time automated and intelligent vehicle applications. Large memory is required because the driving environment must be represented by a grid or voxel cells. Also, the ray tracing method takes a long time to update all cells related to all LiDAR beams. In addition, because space is represented by discrete cells, a discretization error occurs when the resolution is coarse.
Research on the detection and tracking of moving objects using LiDAR has been conducted to recognize the driving environment of automated and intelligent vehicles [17,18]. Object detection algorithms detect surrounding objects by clustering the point cloud and generate a bounding box for the each detected object. Tracking algorithms generate tracks for detected objects to estimate their position, direction, velocity, and acceleration. Using the tracking results, we can classify the motion of a point cloud into dynamic and static states. Points in a track bounding box above a certain speed are classified as dynamic, and the remaining points are classified as static. This tracking-based point motion classification is straightforward, and the tracking results can be reused. However, it has some limitations. The point cloud clustering groups the detected points on the same object in the object detection step. Although many clustering methods have been studied, it is difficult to obtain accurate results using point cloud information alone. Incorrect clustering causes incorrect point motion classification. In addition, because the tracking has an initialization time to generate new track, it struggles to satisfy real-time motion classification. Furthermore, because it is difficult for tracking to accurately estimate the speed of slow objects, the point motion of objects is likely to be misclassified near the threshold speed.
The point motion classification algorithm presented in this paper has many advantages over previously proposed ones. First, the proposed method directly segments the point cloud into dynamic and static states using the laser beam model. Therefore, there is no chance of misclassification due to discretization error in the occupancy grid method and erroneous clustering of the tracking-based method. In addition, the proposed method does not require a large amount of memory like the occupied grid approach because it simply buffers recent point clouds. Finally, the proposed method is able to satisfy the real-time requirements of point motion classification because it does not need to update all of the gird cells to initialize a new track.

3. System Architecture

The objective of the proposed point motion segmentation algorithm is to classify the latest LiDAR point, z t n , m , into motion states, m o t i o n t n , m , in real-time. The notation of the LiDAR point and motion states are z t i m e i n d e x , o r d e r and m o t i o n t i m e i n d e x , o r d e r , respectively. The n describes the index of the laser beam from 1 to N. The m represents the order of multi-echo for the laser beam and usually has a value of 2 or less. The t represents the time for the LiDAR scan measurement. The “motion state” has three possible values: m o t i o n = { d y n a m i c , s t a t i c , u n k n o w n } . The d y n a m i c state indicates the points detected as moving objects, and the s t a t i c state represents points detected as stationary objects. The u n k n o w n state means that there is not sufficient evidence to classify the motion state as d y n a m i c or s t a t i c .
The basic principle of the proposed algorithm is to classify the motion state of the current point cloud by applying the laser characteristic model to the registration relationship between the previously measured point cloud and the current point cloud. The inputs of the proposed algorithm are the current point cloud, Z t = { z t 1 , m , z t 2 , m , , z t N , m } ; the previously buffered point clouds, Z t 1 , , Z t W + 1 , Z t W ; and the sensor pose, x t , x t 1 , , x t W + 1 , x t W , for each point cloud. W denotes the time window size of the previous data buffer to be used for motion classification of the current point cloud. There are several methods for obtaining the sensor pose x t , , x t W , such as inertial measurement unit (IMU) dead reckoning, scan matching, a high-definition (HD) map-based localization, and simultaneous localization and mapping (SLAM). To avoid loss of generality, we assume that the sensor’s pose and its uncertainty are provided. The output of the algorithm is the motion state M o t i o n t = { m o t i o n t 1 , m , m o t i o n t 2 , m , , m o t i o n t N , m } of Z t = { z t 1 , m , z t 2 , m , , z t N , m } .
The point motion segmentation algorithm consists of two steps: (1) probabilistic modeling of point motion and (2) evidential point motion classification. In the first step, the probability of M o t i o n ( t k ) t , which is a motion classification of Z t against Z t k , is updated. Figure 2 illustrates the concept of the probability update of M o t i o n ( t k ) t based on a geometrical relationship between ( Z t , x t ) and ( Z t k , x t k ) . The likelihood field of the motion can be updated using ( Z t k , x t k ) and the characteristics of the laser (such as beam divergence and multi-echo). The D y n a m i c probability for z t m , 1 and z t m + 3 , 1 will be higher when they are located in the path of the laser (green region) for the previous point cloud Z t k , and the S t a t i c probability for z t m + 2 , 1 will be higher if it is located near the previous point (red region).
In the second step, the probabilities of each motion classification M o t i o n ( t 1 ) t , , M o t i o n ( t W ) t are integrated to estimate the final motion classification M o t i o n t , as shown in Figure 3. However, the probability of M o t i o n ( t k ) t cannot be updated by previous points if the current points are not in the likelihood field of the previous points. In this case, it should be classified as u n k n o w n . However, because u n k n o w n cannot be expressed clearly using probability theory, evidence theory, which can handle the u n k n o w n state explicitly, is employed. The probabilities of M o t i o n ( t 1 ) t , , M o t i o n ( t W ) t are converted into mass (degree of belief) with consideration of LiDAR and sensor pose uncertainty and then integrated into a mass of M o t i o n t using Dempster’s combination rule. The motion states M o t i o n t = { m o t i o n t 1 , m , m o t i o n t 2 , m , , m o t i o n t N , m } of each point Z t = { z t 1 , m , z t 2 , m , , z t N , m } are determined using the integrated mass of M o t i o n t .

4. Probabilistic Modeling of LiDAR Point Motion

4.1. Characteristics of LiDAR Point Cloud

LiDAR uses rotating laser beams to measure the distances and angles from surrounding objects. A laser pulse is emitted at a specific angle, and the distance to the object for that angle can be measured using the time-of-flight (ToF) principle, as demonstrated in Figure 4. ToF represents the difference between the time the laser pulse is emitted from the diode and the time it returns to the object. The distance is calculated by multiplying this time by the speed of the laser light. Using the horizontal–vertical emitted angles and the measured distances, 3D information of surrounding objects can be reconstructed in the form of point data.
The actual LiDAR’s laser is not emitted in a straight line, as shown in the Figure 4a. The laser has a characteristic of “beam divergence”, which increases the beam’s cross section over the distance. Because of this characteristic, the farther away from the laser source an object is, the wider the area objects can be detected in. In addition, beam divergence enables multi-echoing of the emitted laser pulse, as shown in Figure 4b, and the multi-echo allows simultaneous measurement of distances to various objects. Another important characteristic of LiDAR is the uncertainty of the distance and angular measurement. Despite the use of lasers, the distance measurement is not infinitely accurate. The distance measurement accuracy is proportional to the measuring capability of the time the laser pulse takes to return. Conversely, the angular accuracy (vertical and horizontal) is discretely accurate because LiDAR is able to control and configure the emission angle. Many previous studies that used LiDAR measurements did not properly account for the above-mentioned characteristics of LiDAR (i.e., beam divergence and distance uncertainty); they treated LiDAR measurements as points with no volume and constant 3D Gaussian uncertainty. To classify LiDAR point motion with high accuracy and reliability, the proposed algorithm accurately reflects these characteristics of LiDAR.
The LiDAR point cloud measurement Z t = { z t 1 , m , z t 2 , m , , z t N , m } must be representable in both spherical and Cartesian coordinates for processing by the point motion segmentation algorithm. The measurement can be represented in spherical coordinates as Z r θ ϕ , t = { z r θ ϕ , t 1 , m , , z r θ ϕ , t i , m , , z r θ ϕ , t N , m } . The point z r θ ϕ , t i , m is represented as z r θ ϕ , t i , m = { r t i , 1 , , r t i , m , θ t i , ϕ t i } , where r is distance to a point with a second echo, θ is the vertical (polar) angle of the point, ϕ is the horizontal (azimuthal) angle of the point, and m is the number of echos. The Cartesian coordinate representation is Z x y z , t = { z x y z , t 1 , m , , z x y z , t i , m , , z x y z , t N , m } , where z x y z , t i , m = { x t i , m , y t i , m , z t i , m } .

4.2. Probabilistic Modeling for LiDAR Point Motion

The point motion segmentation classifies the latest N LiDAR points z t 1 , m , z t 2 , m , , z t N , m into the motion states m o t i o n t 1 , m , m o t i o n t 2 , m , , m o t i o n t N , m , respectively, in real-time, where m o t i o n consists of three states { d y n a m i c , s t a t i c , u n k n o w n } . The motion state of the latest point cloud is segmented based on the registration relationship for the previously buffered point cloud. In other words, the latest point cloud Z t = { z t 1 , m , z t 2 , m , , z t N , m } is segmented into M o t i o n t = { m o t i o n t 1 , m , m o t i o n t 2 , m , , m o t i o n t N , m } based on the W-buffered point clouds Z t 1 , , Z t W + 1 , Z t W and the sensor pose x t , x t 1 , , x t W + 1 , x t W of each point cloud. The sensor pose x t , , x t W can be obtained using several methods, such as an IMU dead reckoning, scan matching, HD map-based localization, and SLAM. However, the proposed algorithm assumes that the sensor’s pose information and its uncertainty is abstracted regardless of the type of pose estimation method.
Probabilistic motion modeling of the point cloud Z t can be estimated using the LiDAR sensor pose x t of Z t and the previously detected LiDAR point cloud Z t k and its sensor pose x t k , as shown in Figure 2. The probabilistic motion model of Z t to x t , Z t k , and x t k can be described as p ( M o t i o n ( t k ) t ) . The probability p ( M o t i o n ( t k ) t ) can be represented by a conditional probability for the given conditions, the past and present point cloud pairs ( Z t , Z t k ) , and their sensor pose ( x t , x t k ) , as represented in Equation (1).
p ( M o t i o n ( t k ) t ) = p ( M o t i o n t | Z t , Z t k , x t , x t k )
M o t i o n t is composed of each independent point motion { m o t i o n t 1 , m , m o t i o n t 2 , m , , m o t i o n t N , m } , so p ( M o t i o n ( t k ) t ) can be represented by the set of conditional probabilities of each point, as described by
p ( M o t i o n ( t k ) t ) = { p ( m o t i o n t 1 , m | Z t , Z t k , x t , x t k ) , , p ( m o t i o n t N , m | Z t , Z t k , x t , x t k ) } .
m o t i o n t i , m consists of two states { d y n a m i c , s t a t i c } , and the sum of p ( d y n a m i c ) and p ( s t a t i c ) is always one.
The conditional probability of one point motion can be reorganized by the Bayes rule, as represented by Equation (3).
p ( m o t i o n t i , m | z t i , m , Z t k , x t , x t k ) = p ( z t i , m | m o t i o n t i , m , Z t k , x t , x t k ) p ( m o t i o n t i , m | Z t k , x t , x t k ) p ( z t i , m | Z t k , x t , x t k )
p ( z t i , m | m o t i o n t i , m , Z t k , x t , x t k ) is the likelihood of the LiDAR point measurement for the given motion state. p ( m o t i o n t i , m | Z t k , x t , x t k ) is the predicted probability density function. The motion for given Z t k , x t , and x t k can be represented by a uniform distribution, so p ( m o t i o n t i , m = { static or dynamic } | Z t k , x t , x t k ) is 0.5 . p ( z t i , m | Z t k , x t , x t k ) can be a normalization factor by applying marginalization, as described by Equation (4).
p ( m o t i o n t i , m | z t i , m , Z t k , x t , x t k ) = m o t i o n p ( z t i , m | m o t i o n t i , m , Z t k , x t , x t k ) p ( m o t i o n t i , m | Z t k , x t , x t k )
By summarizing the above equations, the posterior probability of one point motion can be represented by the following equation,
p ( m o t i o n t i , m | z t i , m , Z t k , x t , x t k ) = η p ( z t i , m | m o t i o n t i , m , Z t k , x t , x t k )
where η is the normalization factor p ( m o t i o n t i , m | z t i , m , Z t k , x t , x t k ) of Equation (4). Equation (5) is the conditional probability of one LiDAR point motion being expressed by the likelihood of the given point motion. Therefore, the motion probability estimation problem is converted to a likelihood estimation problem for LiDAR point cloud.

4.3. Likelihood of LiDAR Point Measurement

We know that the point cloud motion probability p ( M o t i o n ( t k ) t ) can be obtained from the likelihood of the point cloud p ( z t i , m | m o t i o n t i , m , Z t k , x t , x t k ) , as described by Equation (5). The likelihood p ( z t i , m | m o t i o n t i , m , Z t k , x t , x t k ) represents a statistical state when m o t i o n t i , m is determined as s t a t i c or d y n a m i c for given Z t k , x t , and x t k . The likelihood field of one point z t i , m can be represented intuitively, as shown in Figure 5. The likelihood field of the LiDAR point z t i , m , measured at sensor pose x t at time t, is represented by points z t k j , l in point cloud Z t k measured at pose x t k at the previous time t k . The intensity of the green color indicates the likelihood of the point z t i , m in the green region being in dynamic motion, and the intensity of the red region represents the likelihood of the point z t i , m in the red region being in static motion. The likelihood field is represented in the local spherical coordinates of the previous sensor pose x t k . The point cloud Z t k is represented in spherical coordinates as Z t k = Z r θ ϕ , t k = { z r θ ϕ , t k 1 , l , , z r θ ϕ , t k j , l , , z r θ ϕ , t k J , l } . The likelihood field for the point z t i , m is constructed in a triangular-pyramid form by each previous point z r θ ϕ , t k j , l with consideration of the beam divergence characteristics of the LiDAR laser, as shown in Figure 5.
The 3D likelihood distribution of p ( z t i , m | m o t i o n t i , m , Z t k , x t , x t k ) can be divided by two 2D likelihood fields. The first one is a likelihood field for the distance–horizontal angle ( r ϕ ) plane, p ( r t i , m , ϕ t i | m o t i o n t i , m , Z t k , x t , x t k ) , and the second is a likelihood field of the distance-vertical angle ( r θ ) plane, p ( r t i , m , θ t i , m | m o t i o n t i , Z t k , x t , x t k ) . Figure 6 shows the 2D likelihood field in the ( r ϕ ) and ( r θ ) planes for the previous measurements z t k j , 1 , z t k j , 2 , z t k j + 1 , 1 and z t k j + 2 , 1 . For each horizontal and vertical angle, the likelihood field is distributed discretely based on the resolution of the horizontal and vertical laser pulses. Due to the beam divergence of the laser, the likelihood fields gradually disperse.
The cross-section of the likelihood field for one laser beam z t k j , l in Figure 6a can be represented by the likelihood value in Figure 7. Through this figure, we can more accurately analyze the distribution of likelihood for each motion. Figure 7a shows the likelihood p ( z t i , m | m o t i o n t i , m = s t a t i c , z t k j , l , x t , x t k ) when a measured point z t i , m is s t a t i c for given previous measurement z t k j , l and given poses x t and x t k . Here, the previous measurement z t k j , l can be expressed in spherical coordinates as { r t k j , 1 , r t k j , 2 , ϕ t k j } . The region where the previous LiDAR point was detected is likely to be static. LiDAR is measured using ToF, so the uncertainty of the measured distance, r t i , m , depends on the accuracy of the ToF sensor. Considering this uncertainty, the likelihood of static motion can be expressed as a Gaussian distribution, as described by Figure 7a and Equation (6).
p ( r t i , m | m o t i o n t i , m = s t a t i c , z t k j , l , x t , x t k ) = 1 σ 2 π e ( r r t i , m ) 2 / 2 σ 2
σ is the standard deviation of the distance measurement, which are different depending on the LiDAR. Figure 7b shows the likelihood p ( z t i , m | m o t i o n t i , m = d y n a m i c , z t k j , l , x t , x t k ) when a measured point z t i , m is d y n a m i c for given z t k j , l , x t , and x t k . The region where the previous LiDAR beam z t k j , l passed is likely to be free, and the location of the current LiDAR point z t i , m in the region means that this point is more likely to be detected from a dynamic object. This characteristic can be represented by the following equation:
p ( r t i , m | m o t i o n t i , m = d y n a m i c , z t k j , l , x t , x t k ) = M a x L i k e l i h o o d t k i , m p ( r t i , m | m o t i o n t i , m = s t a t i c , z t k j , l , x t , x t k ) r < = r t k j , m a x ( m ) 0 e l s e
Here, M a x L i k e l i h o o d t k i , m denotes the maximum likelihood value for the previous point measurement z t k j , l and can be represented by the following equation:
M a x L i k e l i h o o d t k i , m = p ( r t k j , l | m o t i o n t i , m = s t a t i c , z t k j , l , x t , x t k )
When the point z t i , m is located in the likelihood field for the given { z t k j , l , x t k , x t } , we can obtain the likelihood through Equations (6) and (7). Then, the probability of the point motion can be calculated through Equation (5). However, if the point z t i , m is located outside of the likelihood field, we cannot obtain the likelihood using the above equations. The area outside of the likelihood field must be dealt with as u n k n o w n , but probability theory cannot handle the u n k n o w n state explicitly. Therefore, in the next chapter, we apply evidence theory to deal with the u n k n o w n state explicitly.

5. Evidential Point Motion Classification

5.1. Evidential Modeling of LiDAR Point Motion

The process of point-wise probabilistic motion estimation achieved through the likelihood field was described in the previous section. However, there is a limitation to the probabilistic method if the point z t i , m is not located above the likelihood field. We can see this limitation in Figure 8. In Figure 8a, the measurement z t * represents a normal case because the point is located inside the likelihood field. The point motion probability p ( m o t i o n t * | z t * , z t k i , x t , x t k ) can be { s t a t i c , d y n a m i c } = { 0.5 , 0.5 } because the likelihoods for d y n a m i c and s t a t i c are the same. However, the probabilities of motion for z t # in Figure 8a and z t $ in Figure 8b are not included in the likelihood field close to { 0.5 , 0.5 } , because both likelihoods for d y n a m i c and s t a t i c are zero. This means that we cannot distinguish the difference between z t * , z t # , and z t $ in the probabilistic approach. To overcome this limitation, an evidential approach (Dempster–Shafer theory) is applied to explicitly distinguish the motion of points that are not located in the likelihood field.
Both probabilistic and evidential approaches are based on the concept of assigning weights to the hypothesized states of the measurement. However, the evidential approach allows sets of alternatives, which means new states can be created by combining existing states. The probabilistic approach deals with the two states { s t a t i c , d y n a m i c } . In the evidential approach, the two states form a frame of discernment Ω = { s t a t i c , d y n a m i c } . Dempster–Shafer theory can manage more states explicitly ( Ω , ϕ ) by extending the frame of discernment Ω to the power set 2 Ω = { s t a t i c , d y n a m i c , Ω , ϕ } . Ω is the set Ω = { s t a t i c , d y n a m i c } , which means that the point motion is s t a t i c or d y n a m i c . However, because the point motion cannot be s t a t i c and d y n a m i c simultaneously, the state Ω indicates an u n k n o w n state. ϕ is an empty set, which means that the point motion is not both s t a t i c and d y n a m i c . However, because this situation is physically impossible, the state ϕ indicates a c o n f l i c t situation. For each state of the power set 2 Ω = { s t a t i c , d y n a m i c , u n k n o w n , c o n f l i c t } in the evidential approach, a mass function M a s s is used to quantify the belief of the hypothesis. The mass functions of M a s s t i , m ( s t a t i c ) and M a s s t i , m ( d y n a m i c ) represent the belief of point z t i , m being s t a t i c and d y n a m i c , respectively. The mass function of M a s s t i , m ( u n k n o w n ) is the union of the beliefs of s t a t i c and d y n a m i c , and M a s s t i , m ( c o n f l i c t ) represents the belief that the point is conflicted by different measurements. The sum of mass functions for the power set must be o n e based on its definition in the evidential framework.
Based on the evidential approach, we can explicitly handle the points located outside the likelihood field for the given point z t k j , l as an u n k n o w n state. The boundary between the inside and outside is k σ , as shown in Figure 8. k is the tuning factor, which determines the size of the likelihood boundary, and we used k = 3 . The point z t * is located inside the likelihood field, but z t # and z t $ are located outside the likelihood field. The m a s s of point z t i , m motion for the given z t k j , l , x t , and x t k is denoted by m a s s ( t k ) t j , l i , m ( s t a t e ) for each s t a t e = { s t a t i c , d y n a m i c , u n k n o w n , c o n f l i c t } . m a s s ( t k ) t j , l i , m ( s t a t e ) can be calculated based on whether the point z t i , m is located inside or outside of the likelihood field using the following equation.
m a s s ( t k ) t j , l i , m ( s t a t i c ) = λ k p ( m o t i o n t i , m = s t a t i c | z t i , m , z t k j , l , x t , x t k ) , inside 0 , outside m a s s ( t k ) t j , l i , m ( d y n a m i c ) = λ k p ( m o t i o n t i , m = d y n a m i c | z t i , m , z t k j , l , x t , x t k ) , inside 0 , outside m a s s ( t k ) t j , l i , m ( u n k n o w n ) = 1 M a s s ( t k ) t j , l i , m ( s t a t i c ) M a s s ( t k ) t j , l i , m ( d y n a m i c ) , inside 1 , outside m a s s ( t k ) t j , l i , m ( c o n f l i c t ) = 0
The m a s s values of the s t a t i c and d y n a m i c states are calculated by the motion probability p ( m o t i o n t i , m | z t i , m , z t k j , l , x t , x t k ) and its confidence, λ k . The confidence λ k can be determined using Equation (10).
λ k = λ r e g e x p ( k τ )
λ r e g describes the confidence of the pose registration between x t and x t 1 . This value is determined by the performance of the registration method, such as IMU, scan matching, and HD mapping. If the registration is very accurate, the value is close to one; however, if it is not good, it is close to zero. The confidence λ k is also affected by the time difference k. Because the confidence of the probabilistic model decreases as the time difference k increases, the confidence λ k also decreases by e x p ( k / τ ) , where τ is the time constant that determines the decay rate.

5.2. Point Motion Segmentation by Integrating the Point Motion Masses

For the given point z t k j , l and the given poses x t and x t k , the point motion of z t i , m can be described by mass function m a s s ( t k ) t j , l i , m ( s t a t e ) . For all given previous scan points Z t k = { z t k 1 , l , , z t k j , l , , z t k N , l } and the given poses x t and x t k , several mass functions m a s s ( t k ) t 1 , l i , m ( s t a t e ) m a s s ( t k ) t N , l i , m ( s t a t e ) can be calculated. We must integrate the mass functions into one mass function m a s s ( t k ) t i , m ( s t a t e ) . In addition, for the previously buffered point clouds Z t 1 , , Z t W + 1 , Z t W in the time window W, several mass functions M a s s ( t 1 ) t i , m ( s t a t e ) , , M a s s ( t W ) t i , m ( s t a t e ) can be obtained, and these mass functions should be integrated into a single mass function M a s s t i , m ( s t a t e ) to represent the motion of one point z t i , m . To integrate two different mass values from different laser scans and times, Dempster’s combination rule (Equation (11)) is applied.
M a s s 1 M a s s 2 M a s s 1 2 ( A ) = M a s s 1 2 ( A ) 1 M a s s 1 2 ( ϕ ) , M a s s 1 2 ( ϕ ) = 0 ,
A Ω , A ϕ
Dempster’s combination rule is based on the conjunctive combination rule described by Equation (13).
M a s s 1 2 ( A ) = B C = A | B , C Ω M a s s 1 ( B ) · M a s s 2 ( B )

6. Experiments

6.1. Experimental Environments

An autonomous vehicle (A1) was used for the experiment to evaluate the proposed algorithm. A1 was equipped with two LiDARs (Velodyne VLP-16) and an IMU, as shown in Figure 9. The LiDARs provided point cloud data with a 10 Hz sampling frequency and their maximum detection range is 100 m.
LiDAR lasers beams have beam divergence, which means the beam cross section is increased over time. We designed a beam divergence model based on the specification of the LiDAR sensors, as shown in Figure 10. The horizontal and vertical beam divergence characteristics were different. The standard deviation σ of the distance accuracy was set to 3 cm. Because the distance accuracy can vary based on factors such as temperature and target reflectivity, the selection of the standard deviation σ for the probabilistic LiDAR model must consider the uncertainty. The horizontal field of view (FoV) was 360 , and the horizontal angular resolution was set to 0.2 . The vertical FoV was 30 , and the vertical resolution was 2 . Because the LiDAR controls the laser emitting angle, it was assumed that the angular uncertainties (vertical and horizontal) were negligible.
Dead reckoning was implemented using the IMU to estimate the LiDAR pose for each time step. For the experiments, raw IMU data (acceleration and gyro) from the RT3002 sensor were used without real-time kinematic (RTK) GNSS correction. The specifications of the MEMS IMU are listed in Table 1. Although the IMU was not sufficiently accurate to estimate the long-term pose of the LiDAR sensor, it can provide stable performance in short windows of approximately 50 or less (five seconds or less). In addition, the evidential integration algorithm was able to account for the inaccuracy of the IMU-based pose estimation by tuning the registration confidence λ r e g . By considering the installed MEMS IMU, we set λ r e g to 0.9.
The synchronization between the LiDAR, IMU, and point motion classification algorithm was measured by a pulse per second (PPS) signal from an RT3002. The LiDARs and IMU were precisely calibrated to be located in the same coordinate system.

6.2. Segmentation Performance Evaluation through Comparative Analysis

To evaluate the performance of the point-wise motion segmentation, experiments were conducted under various scenarios (e.g., cities and highways). The total length of the experiment road is more than two kilometers. Figure 11a shows single scene of the experimental condition, where moving cars and stationary road structures were mixed. The result of segmentation through the proposed algorithm is shown in Figure 11b. The RGB value for each point is set using the proposed motion belief algorithm. The red values represent s t a t i c state belief, the green values represent d y n a m i c state belief, and the blue values represent u n k n o w n state belief. Therefore, the objects that have a high probability of stopping will appear red, moving objects will appear green, and unsegmented objects will appear blue. As shown in Figure 11b, traffic signs and roadside trees are segmented as red, and moving cars are classified as green.
To quantitatively analyze the segmentation performance, confusion matrices for the tracking-based classification algorithm and the proposed algorithm with various time window configurations were created, as shown in Table 2. Using the occupancy grid map, static point segmentation is possible, but dynamic point classification is not possible. Therefore, the performance of the segmentation algorithm based on the occupancy grid map is not included in the confusion matrix. For the segmentation using the proposed algorithm, the point motion is classified as static or dynamic when the belief of static and dynamic is over 0.8, respectively. The true class of points used as a reference for evaluation was classified manually. Although public data is more appropriate for comparing performance with other algorithms, there is no public data labeled by point-wise motion to verify the performance of real-time motion classification. The object tracking-based point motion segmentation algorithm segments a point as d y n a m i c when it is located inside the bounding box of the track above a certain speed, and the remaining points not included in the moving track are segmented as s t a t i c . However, the performance of point-wise motion segmentation is not superior due to incorrect bounding boxes and inaccurate speed estimation by the tracker. The segmentation accuracy of the proposed algorithm is better than that of the tracking-based segmentation algorithm when the time window W is 20, 30, 50, and 100, as shown in Table 2. The segmentation performance for a time window of 50 is better than that of 100 because the drift error of pose estimation affects the segmentation performance.

6.3. Real-Time Performance Evaluation

The algorithm was verified in an RTMaps environment with a QuadCore Intel Core i5-3570K, 3600 MHz ( 36 × 100 ) CPU. Hard real-time performance could not be fully evaluated because it is not an embedded environment, but it can be optimized later by checking the soft real-time performance in the RTMaps environment. To evaluate the real-time performance of the algorithm, the occupancy grid map-based segmentation algorithm was compared with the proposed algorithm. As shown in Figure 12, the algorithm based on occupancy grid maps took a long time because all cells in the area the LiDAR beam passed were constantly updated. The larger the window size of the proposed algorithm, the more computation is required. The most appropriate time window setting for the proposed algorithm is 50, as illustrated by the confusion matrix, and its computation time is below the sampling period of the Velodyne LiDAR (100 milliseconds).

7. Conclusions

This paper proposed a segmentation algorithm to rapidly classify the motion states of a LiDAR point cloud in real-time. The motion segmentation algorithm requires inputs of point clouds and 3D pose (position and direction) of the LiDAR sensor. The point-wise motion segmentation is performed based on the laser beam characteristics and the 3D pose correlation between consecutive LiDAR points. A combination of probability and evidence theory is used to accurately and reliably segment the motion state of points into d y n a m i c , s t a t i c , and u n k n o w n .
(1) The point motion segmentation algorithm considers the characteristics of the LiDAR laser beam, such as multi-echo, beam divergence, and horizontal and vertical resolution. Therefore, the point motions are segmented more accurately and reliably than by conventional algorithms (e.g., occupancy grid mapping and tracking-based segmentation algorithm).
(2) To update the motion state of each LiDAR point, a combination of probability theory and evidence theory is applied to point motion modeling to accurately reflect the LiDAR characteristics. Probability theory is used to model the likelihood fields of LiDAR point clouds, taking into account the uncertainty of LiDAR measurements. Evidence theory is used to incorporate multipoint motion probabilities, taking into account pose uncertainty and the u n k n o w n state.
(3) The proposed point motion segmentation algorithm was evaluated experimentally. The segmentation accuracy was 86 % for a time window of W = 50 . This is better than the accuracy of the tracking-based algorithm ( 73 % ). Because the proposed algorithm can handle one LiDAR point clouds in a one-step process, when operating under 100 ms, it is suitable for real-time applications in automated and intelligent vehicle systems.
The performance of the algorithm is related to the positioning algorithm that estimates the pose of the LiDAR sensor. In future research, we plan to analyze the quantitative relationship between the LiDAR sensor positioning and the proposed algorithm performance, and to study the SLAM algorithm that classifies the point motion and simultaneously estimates the pose of the LiDAR sensor.

Author Contributions

Conceptualization, K.J.; Methodology, K.J.; Software, K.J. and S.L. and C.K.; Validation, K.J. and S.L.; Formal Analysis, S.L.; Investigation, K.J.; Resources, K.J.; Data Curation, S.L.; Writing—Original Draft Preparation, K.J.; Writing—Review & Editing, K.J. and M.S.; Visualization, S.L.; Supervision, K.J. and M.S.; Project Administration, K.J. and M.S.; Funding Acquisition, K.J.

Funding

This paper was supported by Konkuk University in 2018.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Goodin, C.; Carruth, D.; Doude, M.; Hudson, C.; Goodin, C.; Carruth, D.; Doude, M.; Hudson, C. Predicting the Influence of Rain on LIDAR in ADAS. Electronics 2019, 8, 89. [Google Scholar] [CrossRef]
  2. Takai, I.; Matsubara, H.; Soga, M.; Ohta, M.; Ogawa, M.; Yamashita, T.; Takai, I.; Matsubara, H.; Soga, M.; Ohta, M.; et al. Single-Photon Avalanche Diode with Enhanced NIR-Sensitivity for Automotive LIDAR Systems. Sensors 2016, 16, 459. [Google Scholar] [CrossRef] [PubMed]
  3. Cui, T.; Ji, S.; Shan, J.; Gong, J.; Liu, K.; Cui, T.; Ji, S.; Shan, J.; Gong, J.; Liu, K. Line-Based Registration of Panoramic Images and LiDAR Point Clouds for Mobile Mapping. Sensors 2016, 17, 70. [Google Scholar] [CrossRef] [PubMed]
  4. Im, J.H.; Im, S.H.; Jee, G.I.; Im, J.H.; Im, S.H.; Jee, G.I. Extended Line Map-Based Precise Vehicle Localization Using 3D LIDAR. Sensors 2018, 18, 3179. [Google Scholar] [CrossRef] [PubMed]
  5. An, J.; Choi, B.; Sim, K.B.; Kim, E.; An, J.; Choi, B.; Sim, K.B.; Kim, E. Novel Intersection Type Recognition for Autonomous Vehicles Using a Multi-Layer Laser Scanner. Sensors 2016, 16, 1123. [Google Scholar] [CrossRef] [PubMed]
  6. Thrun, S.; Burgard, W.; Fox, D. Probabilistic Robotics (Intelligent Robotics and Autonomous Agents); The MIT Press: Cambridge, MA, USA, 2005. [Google Scholar]
  7. Pagac, D.; Nebot, E.M.; Durrant-Whyte, H. An evidential approach to probabilistic map-building. In Proceedings of the IEEE International Conference on Robotics and Automation, Minneapolis, MN, USA, 22–28 April 1996; Volume 1, pp. 745–750. [Google Scholar] [CrossRef]
  8. Alsayed, Z.; Bresson, G.; Nashashibi, F.; Verroust-Blondet, A. PML-SLAM: A Solution for Localization in Large-Scale Urban Environments; HAL: Bengaluru, India, 2015. [Google Scholar]
  9. Jo, K.; Cho, S.; Kim, C.; Resende, P.; Bradai, B.; Nashashibi, F.; Sunwoo, M. Cloud Update of Tiled Evidential Occupancy Grid Maps for the Multi-Vehicle Mapping. Sensors 2018, 18, 4119. [Google Scholar] [CrossRef] [PubMed]
  10. Trehard, G.; Pollard, E.; Bradai, B.; Nashashibi, F. On line mapping and global positioning for autonomous driving in urban environment based on evidential SLAM. In Proceedings of the 2015 IEEE Intelligent Vehicles Symposium (IV), Seoul, Korea, 28 June–1 July 2015. [Google Scholar] [CrossRef]
  11. Moras, J.; Cherfaoui, V.; Bonnifait, P. Moving objects detection by conflict analysis in evidential grids. In Proceedings of the IEEE Intelligent Vehicles Symposium, Baden-Baden, Germany, 5–9 June 2011; pp. 1122–1127. [Google Scholar] [CrossRef]
  12. Moras, J.; Cherfaoui, V.; Bonnifait, P. A lidar perception scheme for intelligent vehicle navigation. In Proceedings of the 11th International Conference on Control Automation Robotics Vision, Singapore, 7–10 December 2010; pp. 1809–1814. [Google Scholar] [CrossRef]
  13. Moras, J.; Cherfaoui, V.; Bonnifait, P. Credibilist occupancy grids for vehicle perception in dynamic environments. In Proceedings of the IEEE International Conference on Robotics and Automation, Shanghai, China, 9–13 May 2011; pp. 84–89. [Google Scholar] [CrossRef]
  14. Huang, R.; Chen, J.; Liu, J.; Liu, L.; Yu, B.; Wu, Y.; Huang, R.; Chen, J.; Liu, J.; Liu, L.; et al. A Practical Point Cloud Based Road Curb Detection Method for Autonomous Vehicle. Information 2017, 8, 93. [Google Scholar] [CrossRef]
  15. Dimitrievski, M.; Veelaert, P.; Philips, W.; Dimitrievski, M.; Veelaert, P.; Philips, W. Behavioral Pedestrian Tracking Using a Camera and LiDAR Sensors on a Moving Vehicle. Sensors 2019, 19, 391. [Google Scholar] [CrossRef] [PubMed]
  16. Kim, C.; Cho, S.; Sunwoo, M.; Jo, K. Crowd-Sourced Mapping of New Feature Layer for High-Definition Map. Sensors 2018, 18, 4172. [Google Scholar] [CrossRef] [PubMed]
  17. Kim, D.; Jo, K.; Lee, M.; Sunwoo, M. L-Shape Model Switching-Based Precise Motion Tracking of Moving Vehicles Using Laser Scanners. IEEE Trans. Intell. Transp. Syst. 2018, 19, 598–612. [Google Scholar] [CrossRef]
  18. Petrovskaya, A.; Thrun, S. Model based vehicle detection and tracking for autonomous urban driving. Auton. Robot. 2009, 26, 123–139. [Google Scholar] [CrossRef]
Figure 1. System architecture of the rapid point motion segmentation algorithm.
Figure 1. System architecture of the rapid point motion segmentation algorithm.
Sensors 19 04116 g001
Figure 2. Probability update of M o t i o n ( t k ) t . The motion probability of the point cloud Z t can be updated based on the geometric relationship for the previous point cloud Z t k .
Figure 2. Probability update of M o t i o n ( t k ) t . The motion probability of the point cloud Z t can be updated based on the geometric relationship for the previous point cloud Z t k .
Sensors 19 04116 g002
Figure 3. Motion probabilities p ( M o t i o n ( t 1 ) t ) , , p ( M o t i o n ( t W ) t ) are integrated into the latest motion probability p ( M o t i o n t ) .
Figure 3. Motion probabilities p ( M o t i o n ( t 1 ) t ) , , p ( M o t i o n ( t W ) t ) are integrated into the latest motion probability p ( M o t i o n t ) .
Sensors 19 04116 g003
Figure 4. Light detection and ranging (LiDAR) measurement characteristics: (a) Time-of-flight (ToF) and (b) laser beam divergence and multi-echo.
Figure 4. Light detection and ranging (LiDAR) measurement characteristics: (a) Time-of-flight (ToF) and (b) laser beam divergence and multi-echo.
Sensors 19 04116 g004
Figure 5. Likelihood field for the point z t i , m is constructed by previous point z r θ ϕ , t k j , l with consideration of the beam divergence.
Figure 5. Likelihood field for the point z t i , m is constructed by previous point z r θ ϕ , t k j , l with consideration of the beam divergence.
Sensors 19 04116 g005
Figure 6. 2D likelihood field for the distance (r) and angle (horizontal ϕ (a) and vertical θ (b)) of the previous measurements z t k j , 1 , z t k j , 2 , z t k j + 1 , 1 and z t k j + 2 , 1 .
Figure 6. 2D likelihood field for the distance (r) and angle (horizontal ϕ (a) and vertical θ (b)) of the previous measurements z t k j , 1 , z t k j , 2 , z t k j + 1 , 1 and z t k j + 2 , 1 .
Sensors 19 04116 g006
Figure 7. Likelihood of s t a t i c (a) and d y n a m i c (b) for one laser beam.
Figure 7. Likelihood of s t a t i c (a) and d y n a m i c (b) for one laser beam.
Sensors 19 04116 g007
Figure 8. Problems of probabilistic motion segmentation. The probabilities of motion for z t # (a) and z t $ (b) are close to { 0.5 , 0.5 } , because both likelihoods for d y n a m i c and s t a t i c are zero.
Figure 8. Problems of probabilistic motion segmentation. The probabilities of motion for z t # (a) and z t $ (b) are close to { 0.5 , 0.5 } , because both likelihoods for d y n a m i c and s t a t i c are zero.
Sensors 19 04116 g008
Figure 9. Test vehicle and sensor (LiDAR and positioning system) configuration.
Figure 9. Test vehicle and sensor (LiDAR and positioning system) configuration.
Sensors 19 04116 g009
Figure 10. LiDAR beam divergence model: (a) divergence characteristics from VLP-16 specification and (b) divergence length over detection distance.
Figure 10. LiDAR beam divergence model: (a) divergence characteristics from VLP-16 specification and (b) divergence length over detection distance.
Sensors 19 04116 g010
Figure 11. (a) Test site with moving cars and stationary road structures and (b) classification results of point-wise LiDAR motion segmentation.
Figure 11. (a) Test site with moving cars and stationary road structures and (b) classification results of point-wise LiDAR motion segmentation.
Sensors 19 04116 g011
Figure 12. (a) table and (b) plot of Computation time for the occupancy grid map and the proposed algorithm.
Figure 12. (a) table and (b) plot of Computation time for the occupancy grid map and the proposed algorithm.
Sensors 19 04116 g012
Table 1. Specification of IMU sensor. An IMU was used inside the GNSS/INS (RT3002) to estimate the pose of the LiDAR sensor.
Table 1. Specification of IMU sensor. An IMU was used inside the GNSS/INS (RT3002) to estimate the pose of the LiDAR sensor.
Sensors 19 04116 i001
Table 2. Confusion matrices of the tracking-based algorithm and the proposed algorithm with various time window configurations.
Table 2. Confusion matrices of the tracking-based algorithm and the proposed algorithm with various time window configurations.
Sensors 19 04116 i002

Share and Cite

MDPI and ACS Style

Jo, K.; Lee, S.; Kim, C.; Sunwoo, M. Rapid Motion Segmentation of LiDAR Point Cloud Based on a Combination of Probabilistic and Evidential Approaches for Intelligent Vehicles. Sensors 2019, 19, 4116. https://doi.org/10.3390/s19194116

AMA Style

Jo K, Lee S, Kim C, Sunwoo M. Rapid Motion Segmentation of LiDAR Point Cloud Based on a Combination of Probabilistic and Evidential Approaches for Intelligent Vehicles. Sensors. 2019; 19(19):4116. https://doi.org/10.3390/s19194116

Chicago/Turabian Style

Jo, Kichun, Sumyeong Lee, Chansoo Kim, and Myoungho Sunwoo. 2019. "Rapid Motion Segmentation of LiDAR Point Cloud Based on a Combination of Probabilistic and Evidential Approaches for Intelligent Vehicles" Sensors 19, no. 19: 4116. https://doi.org/10.3390/s19194116

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop