4.4. Comparative Experiments
We compare the proposed method with representative point cloud completion methods, including PCN, GRNet, TopNet, PointTr, SnowflakeNet, AnchorFormer, ODG, and SymmCompletion. These baselines cover voxel/grid-based, point-based generative, Transformer-based, dictionary-guided, and symmetry-guided completion methods. The comparisons are conducted on PCN, MVP, and KITTI to evaluate completion performance under synthetic single-view incompleteness, synthetic multi-view occlusion, and real-world LiDAR sparsity, respectively.
Results on PCN. As shown in
Table 2, we evaluate the proposed method on the PCN dataset and compare it with representative baselines, including PCN, GRNet, TopNet, PointTr, SnowflakeNet, AnchorFormer, ODG, and SymmCompletion. Following the standard PCN evaluation protocol, we report the L1 Chamfer Distance (CD-L1,
; lower is better) and the F-score at the 1% threshold (higher is better).
The results in
Table 2 show that the proposed method achieves the lowest average CD-L1 of 6.26 and the highest F-score of 0.852 on the PCN dataset. Compared with earlier methods such as PCN, TopNet, and GRNet, our method substantially improves both geometric accuracy and overlap quality. Compared with recent competitive methods, including SnowflakeNet, AnchorFormer, ODG, and SymmCompletion, our method remains consistently competitive and achieves the best average performance.
It should be noted that the performance margin over SymmCompletion on PCN is relatively small. Since repeated independent training of the full model and the strongest baselines requires substantial computational resources, we report single-run results following the common benchmark protocol in this revision. Therefore, the PCN comparison should be interpreted as demonstrating competitive performance against the strongest symmetry-guided baseline rather than statistically significant superiority. Multi-run mean and standard deviation results will be included in future extended evaluations when sufficient computational resources are available.
At the category level, our method obtains the best or tied-best CD-L1 on Airplane, Car, Sofa, Table, and Watercraft, and remains close to the best results on Cabinet, Chair, and Lamp. These results suggest that the proposed structure-aware symmetry-guided refinement is effective not only for highly regular categories such as Airplane and Car, but also for categories with more diverse local structures.
Qualitative comparisons in
Figure 4 further corroborate the quantitative trends. Across all four categories, early baselines such as GRNet, TopNet, and PCN tend to produce over-smoothed or incomplete structures under heavy missing regions. Transformer-based methods such as PointTr, AnchorFormer, and ODG improve global plausibility, but may still exhibit point sparsity, local holes, or blurred boundaries in thin structures. By contrast, the proposed method generates more compact and structurally consistent reconstructions with clearer contours and more uniform point distributions. In particular, on
Car and
Boat, our results better preserve the overall hull/body shape and reduce missing regions; on
Airplane, the wings and tail are reconstructed with improved continuity; and on
Sofa, the seat and back surfaces are more complete with fewer artifacts.
A detailed comparison on the
Car category is further provided in
Figure 5. The zoomed-in regions show that our method produces denser and more continuous local structures, especially around boundary and surface regions, while better preserving the global vehicle shape.
Results on MVP. We further evaluate the proposed method on the MVP dataset, which contains more diverse partial views and more complex multi-view occlusion patterns than PCN. We compare our method with five representative baselines, including PointTr, SnowflakeNet, AnchorFormer, SymmCompletion, and ODG.
Table 3 reports the category-wise CD-L2 and the overall F-score.
As shown in
Table 3, the proposed method achieves the lowest average CD-L2 of 2.17 and the highest F-score of 0.874. Compared with SymmCompletion and ODG, our method improves both average CD-L2 and F-score, indicating that the proposed graph-based structural reasoning and confidence-weighted fusion are beneficial under diverse multi-view occlusion patterns.
At the category level, the proposed method achieves the best CD-L2 in seven out of 16 categories, including Chair, Table, Sofa, Lamp, Watercraft, Bench, and Bookshelf, while remaining competitive in the remaining categories. The results also show that the method performs well on both strongly structured categories and categories with weaker or more local symmetry. This suggests that SymGraphNet does not merely learn strict mirror symmetry, but captures more general symmetry-guided structural correspondences.
Qualitative comparisons on MVP are shown in
Figure 6. Compared with the baseline methods, our method produces more complete object structures and more continuous local surfaces under complex multi-view occlusions. The improvements are particularly visible in categories with missing legs, thin structures, or incomplete planar surfaces, where the proposed progressive refinement strategy helps reduce local holes and improve point distribution uniformity.
Results on KITTI. To further evaluate real-world generalization, we test the proposed method on the KITTI dataset. Unlike PCN and MVP, KITTI contains sparse and noisy LiDAR scans from driving scenarios, making it more challenging for point cloud completion. Following common practice, we report Fréchet Distance (FD) and Maximum Mean Discrepancy (MMD) for quantitative evaluation.
Table 4 reports the quantitative results on KITTI. The proposed method achieves the best FD of 2.518 and the best MMD of 1.123 among the compared methods. These results indicate that the proposed framework generalizes reasonably well from synthetic training data to sparse and noisy real-world LiDAR scans.
The qualitative results in
Figure 7 further show the visual comparison with the reference KITTI scan. Since KITTI does not provide complete ground truth shapes in the same way as synthetic datasets, this reference scan is used only for qualitative visualization. SnowflakeNet and SymmCompletion may produce missing regions or uneven point distributions around key vehicle structures such as the roof and body boundaries. In contrast, our method generates smoother and more complete vehicle shapes with more uniform point distributions. This suggests that the structure-aware coarse prior and progressive refinement strategy are helpful for real-world point cloud completion.
Efficiency analysis. In addition to reconstruction quality, we evaluate the inference efficiency of the proposed method.
Table 5 reports the number of parameters and inference latency under the same test setting with batch size 1. Although our method has a comparable parameter count to SymmCompletion, its latency is higher. This indicates that parameter count and runtime are not strictly correlated in point cloud completion networks.
The higher latency mainly comes from three sources. First, SymGraphNet constructs an on-the-fly feature-space k-NN graph, which introduces additional neighbor search and indexing operations. Second, GAT-based message passing involves irregular memory access and scatter/gather operations, which are usually less GPU-friendly than dense MLP or Transformer computations. Third, the two-stage Cross-Aware Decoder performs dual cross-attention and token-wise confidence-gated fusion at each refinement stage, introducing additional computation during progressive refinement.
Nevertheless, the increased computational cost is associated with the improved structural modeling ability of the proposed method. Compared with purely point-based or Transformer-based refinement, the feature-space graph reasoning provides additional structural correspondence modeling, while the confidence-weighted decoder improves the fusion of partial-observation and symmetry-guided cues. Therefore, the proposed method provides a trade-off between reconstruction accuracy and inference efficiency.
For applications with strict real-time constraints, the current graph construction and graph attention operations may become the main computational bottleneck. Future optimization can include lightweight graph construction, cached neighborhood computation, or replacing GATConv with more efficient local aggregation operators.