8.1. Experimental Data
Stairs, windows, and traffic signs, as common indoor and outdoor objects, are common experimental subjects in current point cloud processing research. The long and straight contour lines and repetitive geometric structure of stairs can effectively test the capability of the proposed method in this paper to process regular geometric shapes. Windows, featuring both inner and outer boundaries with a hollow interior, can effectively evaluate the proposed algorithm’s ability to balance noise resistance and detail preservation. Traffic signs, whose point clouds may be relatively sparse in real-world scenarios, can assess the robustness of the algorithm under conditions of sparse data. To verify the representational capability of the proposed method for point cloud density and the feasibility/detection efficacy of using the skewness metric in statistical data to identify regions of non-uniform point cloud distribution, this study designed simulated uniform distribution data and selected the ISPRS public dataset for experimental analysis.
Figure 7 displays the ISPRS staircase point cloud data, where the red-highlighted areas represent the experimental data selected for this study—specifically regions containing 199 points (referred to as Region 7-2) and 412 points (referred to as Region 5-3).
The experimental environment configuration for deep learning-based point cloud boundary extraction is presented in
Table 1.
Deep learning training and testing experiments were conducted on a single platform running the Windows 10 operating system. Python was used as the programming language for the experiments, with the TensorFlow deep learning framework primarily employed to construct the training network. Point cloud visualization was mainly performed using CloudCompare software (Version: 2.12.0 (Kyiv) [Windows 64-bit]). The training dataset consisted of 600 pairs of skewness statistics for the distances and angles between the geometric center of each unit grid and its constituent points. Within each square unit grid, 10–200 randomly distributed points were generated. The distances and skewness values under uniform distribution were calculated for different point quantities as reference metrics to evaluate the uniformity or deviation of randomly distributed points. The training set and validation set were split in an 80:20 ratio. The experimental data consist of point clouds of windows from buildings within the campus of Wuhan University (
Figure 8a), as well as point clouds of traffic signs on urban roads in Wuhan from the publicly available dataset “WHU-Urban3D Dataset” (
Figure 9). Due to the large number of point cloud files,
Figure 9 displays the point cloud of a representative file and a road image, while
Figure 10 presents schematic diagrams of Traffic Sign No. 1 and No. 2. Window 1 and Window 2 were obtained by cropping from the original point cloud data (
Figure 8b,c). Window 1 has a shape close to a square and contains 4974 points, while Window 2 is rectangular in shape and comprises 5655 points. Traffic Sign 1 has a flat surface and consists of 326 points, whereas Traffic Sign 2 represents the back-side point cloud of a large traffic sign, featuring an uneven surface with 6450 points. The point cloud acquisition device was the FARO Focus S150 3D laser scanner, featuring a resolution of 976 kpts/s (at 307 m), an accuracy of 1 mm, and an optimal measurement range of 0.6–150 m. The 3D scanner’s scanning size was set as 8192 × 3413, the resolution was set as 1/5, and the quality was set as 2 times. The average scanning time was set to 5–6 min.
The simulated uniformly distributed point clouds (designated as Point Cloud 1, 2, and 3) were generated within 1-cm square grids containing 121, 289, and 441 points, respectively. Skewness values of angular and distance distributions were computed for each configuration, with quantitative results documented in
Table 2. To visually characterize the spatial distribution patterns of Point Clouds 1–3, this study employs bar charts and statistical distribution plots to illustrate the frequency distributions of both distance and angular measurements.
- (1)
The three distance bar charts exhibit identical variation trends with point numbers, showing symmetric distributions. When point numbers approach maximum or minimum values, the distance from the center point increases; when approaching the median value, points cluster closer to the center. The minimum distance occurs when points approach the grid center, where the median point number corresponds to the lowest point in the bar chart.
- (2)
The three distance distribution charts demonstrate fundamentally similar trends. Points equidistant from the grid center show even-numbered distributions. The skewness of the distance count distributions is all negative, with absolute values around 0.3.
- (3)
The three angle bar charts display identical variation patterns with point numbers, presenting symmetric distributions characterized by higher central values and lower extremes. Every 10–20 data points form a small segment, each maintaining internal symmetry. The angle value approaches zero when point numbers are near the median.
- (4)
The three angle distribution charts share essentially identical trends. Angles predominantly concentrate within 10 degrees. For instance, in
Figure 13’s 441-point angle statistics, 288 angles fall between 0–6 degrees. The skewness of angle count distributions is all positive, with values increasing proportionally to point quantities.
From the above three types of statistical graphs, it can be concluded that the distance and angle from the point cloud to the center point of the grid follow the following pattern: The distribution trends of distance and angle quantity bar charts for uniform point clouds are similar. The statistical skewness of distances is approximately −0.3, while that of angles is positive and increases with point quantity. The closer real point clouds conform to this distribution pattern, the more uniform their distribution becomes. These examples demonstrate that integrating statistical information of distances and angles within the grid can accurately characterize the point distribution in grid regions, thereby enabling the description of point density distribution in those areas.
8.3. Detection of Non-Uniform Regions in Point Clouds
Figure 14 and
Figure 15 present bar charts and statistical plots for Subregion 7-2 grid and the 14 × 14 reference point cloud, depicting the distribution of inliers, their distances to the center point, and their angles. Comparison with the reference point cloud reveals that the distance bar chart in
Figure 14 exhibits an asymmetric distribution, with the minimum point skewed to the right. This indicates that the point cloud within this grid is not center-symmetric relative to the grid center point. The distance statistical plot shows a higher concentration of points within the 0.08–0.10 m range, suggesting the presence of numerous points distributed near the grid edges. The angle bar chart also displays an asymmetric distribution. Both the angle bar chart and statistical plot indicate a higher density of obtuse angles, predominantly clustered in regions with higher point numbers (higher indices). This observation suggests the point cloud likely contains more rows, with continuous sequences of single-point rows concentrated primarily in these higher-indexed regions.
Table 3 presents a statistical comparison between the 196 uniformly distributed reference points in the 14 × 14 grid and the point cloud of Subregion 7-2. The actual data shows that both the mean and median angles are smaller than those of the reference point cloud. Furthermore, the angular skewness value of 4.08 is 50.6% higher than the reference value of 2.71, exceeding the 30% threshold and indicating abnormal angular skewness. This suggests that adjacent point angles within Subregion 7-2 are generally smaller, and the point density in the northwest quadrant is lower compared to the southwest quadrant. While the mean and median distances in Subregion 7-2 are close to the reference values, the distance skewness is 143% lower than the standard value, exceeding the 40% threshold and signifying abnormal distance skewness. Therefore, it can be inferred that the Subregion 7-2 grid exhibits significant inhomogeneity.
As
Table 3 shows, the Grid 7-2 shows smaller mean and median values for angles compared to the reference point cloud, along with lower angular skewness. This indicates that the angular mode in the 7-2 grid is smaller than both its own mean and the mean of the reference uniform point cloud, implying that adjacent points in the 7-2 area exhibit smaller inter-point angles. Additionally, the distance mean and median value in the 7-2 grid are smaller than those of the reference point cloud, while the distance skewness is negative and less than −0.3. This suggests that the distance mean of the actual data exceeds its mode and is also greater than the reference data mean. Consequently, it can be inferred that points in the 7-2 grid are distributed farther from the grid center.
Figure 16 and
Figure 17 present the statistical distribution diagrams of distances and angles for the 5-3 area point cloud and the 20 × 20 reference point cloud. Through comparative analysis, the distance bar chart reveals a dual-minimum-point pattern, indicating the presence of two points near the grid center in the 5-3 area without central symmetry. The distance statistical diagram aligns with the reference point cloud’s overall trend but shows fewer counts in the 0.04–0.06 m range, suggesting a near-uniform distribution of distances from points to the grid center in the 5-3 area and sparse clustering around the center. Meanwhile, the angle bar chart exhibits multiple obtuse angles approaching 180°, reflecting greater dispersion of the point cloud. The angle statistical diagram further highlights more obtuse angles in the 5-3 area than the reference point cloud, implying a higher concentration of row-structured data within this grid region.
Table 4 compares the statistical data of 400 uniformly distributed reference points with those of the actual point cloud. The results indicate that the angular skewness of the actual data is smaller than that of the reference data, while its mean value is larger, suggesting that adjacent point angles in the actual point cloud are more dispersed. Meanwhile, the angular skewness difference reaches 58%, indicating an angular skewness anomaly. The distance skewness difference (22%) falls within the normal range. These findings demonstrate that Area 5-3 is a non-uniform region.
Based on the two sets of simulation experiments conducted, decision thresholds for determining uniform distribution were established as follows: Due to typically greater fluctuations in distance skewness, its threshold was relaxed to 50%, while angular skewness, being more sensitive to variations, was set at 40%. Consequently, the relative difference thresholds are defined as = 40% for angular skewness and = 50% for distance skewness.
8.4. Deep Learning-Based Object Boundary Extraction
After data training and parameter tuning, our deep learning model achieved final evaluation metrics of 0.4880 validation loss and 85.96% validation accuracy.
Figure 18 documents the training dynamics: In
Figure 18a, the solid blue line (training accuracy) shows a rapid initial ascent followed by stabilization, indicating effective learning on training data where accuracy consistently improved with increasing epochs. Conversely, the dashed red line (validation accuracy) exhibits slower improvement with marginal decline after stabilization, suggesting suboptimal generalization and model performance approaching its limit.
Figure 18b reveals complementary loss patterns: The solid blue line (training loss) demonstrates a sharp early decline converging to stability, reflecting continuous error reduction, while the dashed red line (validation loss) displays a slower descent with a slight post-stabilization increase, collectively demonstrating model convergence.
To validate the accuracy of the proposed boundary detection method, this experiment compares its performance against traditional density-based boundary detection approaches. The traditional density-based boundary extraction method mentioned in this paper employs a point count statistics approach within a neighborhood radius to calculate point density and subsequently identify boundary regions. By setting a search radius and a minimum point count threshold, this method counts the number of points in the neighborhood of each point as the density value. Regions with a density below the threshold are determined to be boundary areas. The miss rate
and false positive rate
serve as evaluation metrics for both methods.
Table 5 presents a comparative analysis of boundary extraction results on Window 1 point cloud data:
As shown in
Table 5, compared with traditional methods, the proposed approach exhibits fewer false negatives and a lower false negative rate (0.47%) at the boundary of Window 1. However, it shows more false positives with a higher false positive rate (43.44%). The analysis suggests this may result from including irrelevant points during the point cloud meshing process. When the grid size is set smaller, the reduced number of points per grid makes it difficult to accurately assess point distribution uniformity. Therefore, to ensure both distribution assessment accuracy and boundary extraction integrity, this experiment deliberately included a small number of adjacent irrelevant points when configuring the grid size.
Table 6 compares the boundary extraction results of the two methods for the point cloud of Window 2. As shown in
Table 6, compared with traditional methods, the proposed approach exhibits fewer false negatives and a lower false negative rate (13.69%) at the boundary of Window 2. However, it shows more false positives with a higher false positive rate (21.81%), further validating the hypothesis that the proposed method includes irrelevant points during point cloud meshing.
Collectively analyzing both tables, the proposed method demonstrates an overall lower false negative rate and achieves higher boundary integrity compared to traditional density-based boundary extraction methods. While the traditional method yields boundaries with closer spatial alignment to the ideal boundary, it suffers from poorer completeness.
Figure 19 presents experimental results of two boundary extraction attempts and the ideal boundary for Window 1. For comparative analysis, extracted boundaries are highlighted with bold points in the figures.
Both
Figure 19b,c demonstrate roughly rectangular boundary extraction results. However, the boundary points in
Figure 19b exhibit discontinuous distribution along the rectangular edges with poor connectivity, resulting in fragmented and visibly broken edges. In contrast, the boundary extraction in
Figure 19c yields smoother, more continuous points with enhanced connectivity, forming a more complete and well-defined rectangular contour without noticeable discontinuities.
Figure 20 presents experimental results of two boundary extraction attempts alongside the ideal boundary for Window 2, where critical performance differences are observed: while the traditional method fails to capture the upper and right edges of the window sill in
Figure 20b, our proposed approach (
Figure 20c) successfully extracts the complete boundary—visually confirming the quantitative superiority established in prior analyses.
Collectively evaluating the results, the proposed boundary extraction method demonstrates superior positional accuracy and continuity over traditional point cloud density-based approaches.
To verify the applicability and robustness of the method proposed in this study, boundary extraction of traffic signs was also conducted.
Figure 21 presents the ideal boundary, the boundary extraction results of Traffic Sign 1 obtained by the traditional method and the method proposed in this study. For comparative analysis, extracted boundaries are highlighted with bold points in the figures.
As can be seen from
Figure 21, compared with the ideal boundary, the boundaries extracted by the above two methods are relatively wider. The method proposed in this paper can fully extract the boundary of Traffic Sign 1, whereas traditional methods perform poorly in extracting the non-smooth bottom edge.
Table 7 presents a comparison of the results of boundary extraction for Traffic Sign No. 1 using the two methods. The traditional method has a missed detection rate (
) of 16.13%, whereas the method proposed in this paper did not miss any true boundary points, indicating that our method significantly outperforms the traditional method in detecting true boundary points of Traffic Sign 1. In terms of false positive rate (
), our method has reduced it by approximately 21% compared to the traditional method, demonstrating its superior ability to distinguish between boundaries and non-boundaries.
To verify the extraction effectiveness of proposed method under complex conditions such as protrusions and textures on object surfaces, we selected the back-side point cloud of a traffic sign as Traffic Sign 2 for experimental purposes (shown in
Figure 10c).
Figure 22 displays the ideal boundary, the boundary extracted by the traditional method, and the boundary extracted by proposed method for Traffic Sign 2. Overall, both methods yielded unsatisfactory extraction results for non-smooth surfaces. The traditional method sometimes identified the support frame of the traffic sign while failing to recognize its panel. In contrast, the proposed method occasionally identified both the support frame and the panel simultaneously.
Table 8 presents a comparison of the extraction results of the two methods for Traffic Sign No. 2. In terms of the missed detection rate (
), the traditional method stands at 16.78%, while our method achieves 0%, indicating its exceptional capability in capturing boundaries. Regarding the false detection rate (
), proposed method has significantly reduced it from 327.69% to 110.92%. Although the new method has reduced the false detection rate, the 110.92% false detection rate still indicates that a significant number of non-boundary points are being misclassified. This suggests that the proposed method lacks the utilization of spatial distribution or texture features, and relying solely on density information may not be sufficient to accurately distinguish between complex boundaries and non-boundaries on the surface.
Table 9 presents the summary of the miss rate. From the perspective of the average miss rate, the traditional density-based boundary detection method has an average miss rate of 40.73%, while the proposed method boasts an average miss rate of merely 3.54%. This indicates that, on the whole, the proposed method can more effectively capture actually existing boundaries in boundary detection, significantly reducing instances of missed detections, and its performance is markedly superior to that of the traditional method. Although the miss rates of the proposed method fluctuate to some extent across different scenarios, the overall range of fluctuation is relatively small, and all miss rates remain at a low level. This demonstrates that the method presented in this paper exhibits a certain degree of stability and adaptability. In contrast, the traditional method shows significant variations in miss rates across different scenarios, indicating its poor adaptability to varying conditions and suggesting that it may not be able to maintain satisfactory boundary detection performance in diverse scenarios.
Table 10 presents the summary of the false positive rate. The traditional method suffers from an alarmingly high average false positive rate of 101.21%, implying that, on the whole, the boundaries detected by this method significantly interfere with subsequent analysis and processing, potentially leading to erroneous decisions and conclusions. In contrast, the proposed method in this paper achieves an average false positive rate of 54.12%, which, while lower than that of the traditional method, still remains at a relatively high level. The proposed method also encounters the issue of excessive false detections in scenarios with complex surfaces, necessitating further algorithm optimization by incorporating features such as point cloud texture, semantics, and local characteristics to reduce the false positive rate.