Robust Point Cloud Registration via Rotation-Equivariant Geometric Encoding and State Space Models
Abstract
1. Introduction
- We propose MG-Conv, a rotation-equivariant convolution that explicitly computes and aggregates intrinsic geometric attributes. It significantly enriches the discriminative power of local features without compromising the network’s global equivariance.
- We construct HGSAM, a hybrid module that decouples geometric modeling from context aggregation. By combining the precision of Transformers with the linear efficiency of Mamba, this module enables the model to focus more on overlapping regions while maintaining low computational complexity.
- We develop PCRH-P, incorporating spatial diversity sampling, consistency pruning, and soft-weighted refinement to substantially improve the reliability and stability of pose estimation.
- Extensive experiments on indoor and outdoor datasets demonstrate that our method outperforms existing approaches.
2. Related Works
2.1. Feature Extraction
2.2. Transformer on Registration
2.3. State Space Models in 3D Vision
3. Methods
3.1. Multivariate Geometric Feature Extraction
3.1.1. Multivariate Geometric Combination Encoder
3.1.2. Multivariate Geometry-Guided Rotation-Equivariant Convolution
3.2. Hybrid Geometry-State Aggregation Module
3.2.1. Local Geometric Self-Attention
3.2.2. Mamba Encoder
3.2.3. Cross-Attention
3.3. Physically Consistent Robust Hypothesis Proposer
3.3.1. Spatial Diversity Sampling
3.3.2. Point Matching
3.3.3. Loss Function
3.3.4. Feature Norm Consistency Pruning
3.3.5. Gaussian Soft-Weighted Refinement
4. Results
4.1. Implementation Details
4.2. Indoor Benchmark: 3DMatch and 3DLoMatch
4.2.1. Evaluation Metrics for Indoor Benchmarks
4.2.2. Registration Results for Indoor Benchmarks
4.2.3. Qualitative Visualization
4.3. Outdoor Benchmark: KITTI
4.3.1. Evaluation Metrics for Outdoor Benchmarks
4.3.2. Registration Results for Outdoor Benchmarks
4.4. Ablation Study
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| MGCE | Multivariate Geometric Combination Encoding |
| HGSAM | Hybrid Geometric-State Aggregation Module |
| PCRH-P | Physically Consistent Robust Hypothesis Proposer |
| VN | Vector Neuron |
| PPF | Point Pair Feature |
| SSM | State Space Model |
| RANSAC | Random Sample Consensus |
| Local-GSA | Local Geometric Self-Attention |
| PIR | Patch Inlier Ratio |
| IR | Inlier Ratio |
| RR | Registration Recall |
| FMR | Feature Matching Recall |
| RTE | Relative Translation Error |
| RRE | Relative Rotation Error |
| RMSE | Root Mean Square Error |
References
- Xu, Z.; Jiang, D. AI-Powered Plant Science: Transforming Forestry Monitoring, Disease Prediction, and Climate Adaptation. Plants 2025, 14, 1626. [Google Scholar] [CrossRef] [PubMed]
- Cheng, Y.C.; Lee, H.Y.; Tulyakov, S.; Schwing, A.G.; Gui, L.Y. Sdfusion: Multimodal 3d shape completion, reconstruction, and generation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada, 17–24 June 2023; pp. 4456–4465. [Google Scholar] [CrossRef]
- Heiselman, J.S.; Collins, J.A.; Ringel, M.J.; Peter Kingham, T.; Jarnagin, W.R.; Miga, M.I. The image-to-physical liver registration sparse data challenge: Comparison of state-of-the-art using a common dataset. J. Med. Imaging 2024, 11, 015001. [Google Scholar] [CrossRef] [PubMed]
- Brightman, N.; Fan, L.; Zhao, Y. Point cloud registration: A mini-review of current state, challenging issues and future directions. Aims Geosci. 2023, 9, 68–85. [Google Scholar] [CrossRef]
- Wang, Y.; Solomon, J.M. Deep closest point: Learning representations for point cloud registration. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, Republic of Korea, 27 October–2 November 2019; pp. 3523–3532. [Google Scholar] [CrossRef]
- Lu, W.; Wan, G.; Zhou, Y.; Fu, X.; Yuan, P.; Song, S. Deepvcp: An end-to-end deep neural network for point cloud registration. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, Republic of Korea, 27 October–2 November 2019; pp. 12–21. [Google Scholar] [CrossRef]
- Pais, G.D.; Ramalingam, S.; Govindu, V.M.; Nascimento, J.C.; Chellappa, R.; Miraldo, P. 3dregnet: A deep neural network for 3d point registration. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 13–19 June 2020; pp. 7193–7203. [Google Scholar] [CrossRef]
- Choy, C.; Park, J.; Koltun, V. Fully convolutional geometric features. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, Republic of Korea, 27 October–2 November 2019; pp. 8958–8966. [Google Scholar] [CrossRef]
- Slimani, K.; Achard, C.; Tamadazte, B. RoCNet++: Triangle-based descriptor for accurate and robust point cloud registration. Pattern Recognit. 2024, 147, 110108. [Google Scholar] [CrossRef]
- Han, T.; Zhang, R.; Kan, J.; Dong, R.; Zhao, X.; Yao, S. A point cloud registration framework with color information integration. Remote Sens. 2024, 16, 743. [Google Scholar] [CrossRef]
- Dey, E.K.; Tarsha Kurdi, F.; Awrangjeb, M.; Stantic, B. Effective selection of variable point neighbourhood for feature point extraction from aerial building point cloud data. Remote Sens. 2021, 13, 1520. [Google Scholar] [CrossRef]
- Wang, Y.; Sun, Y.; Liu, Z.; Sarma, S.E.; Bronstein, M.M.; Solomon, J.M. Dynamic graph cnn for learning on point clouds. ACM Trans. Graph. 2019, 38, 1–12. [Google Scholar] [CrossRef]
- Thomas, H.; Qi, C.R.; Deschaud, J.E.; Marcotegui, B.; Goulette, F.; Guibas, L.J. Kpconv: Flexible and deformable convolution for point clouds. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, Republic of Korea, 27 October–2 November 2019; pp. 6411–6420. [Google Scholar]
- Xu, M.; Ding, R.; Zhao, H.; Qi, X. Paconv: Position adaptive convolution with dynamic kernel assembling on point clouds. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA, 20–25 June 2021; pp. 3173–3182. [Google Scholar]
- Wang, H.; Liu, Y.; Hu, Q.; Wang, B.; Chen, J.; Dong, Z.; Guo, Y.; Wang, W.; Yang, B. RoReg: Pairwise point cloud registration with oriented descriptors and local rotations. IEEE Trans. Pattern Anal. Mach. Intell. 2023, 45, 10376–10393. [Google Scholar] [CrossRef]
- Wang, H.; Liu, Y.; Dong, Z.; Wang, W. You only hypothesize once: Point cloud registration with rotation-equivariant descriptors. In Proceedings of the 30th ACM International Conference on Multimedia, Lisboa, Portugal, 10–14 October 2022; pp. 1630–1641. [Google Scholar] [CrossRef]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is all you need. arXiv 2017, arXiv:1706.03762. [Google Scholar] [CrossRef]
- Zhang, C.; Wan, H.; Shen, X.; Wu, Z. Patchformer: An efficient point transformer with patch attention. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 18–24 June 2022; pp. 11799–11808. [Google Scholar] [CrossRef]
- Gu, A.; Dao, T. Mamba: Linear-time sequence modeling with selective state spaces. arXiv 2024, arXiv:2312.00752. [Google Scholar] [CrossRef]
- Liang, D.; Zhou, X.; Xu, W.; Zhu, X.; Zou, Z.; Ye, X.; Tan, X.; Bai, X. Pointmamba: A simple state space model for point cloud analysis. Adv. Neural Inf. Process. Syst. 2024, 37, 32653–32677. [Google Scholar]
- Fischler, M.A.; Bolles, R.C. Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 1981, 24, 381–395. [Google Scholar] [CrossRef]
- Tarsha-Kurdi, F.; Landes, T.; Grussenmeyer, P. Extended RANSAC algorithm for automatic detection of building roof planes from LiDAR data. Photogramm. J. Finl. 2008, 21, 97–109. [Google Scholar]
- Wang, Y.; Solomon, J.M. Prnet: Self-supervised learning for partial-to-partial registration. arXiv 2019, arXiv:1910.12240. [Google Scholar] [CrossRef]
- Aoki, Y.; Goforth, H.; Srivatsan, R.A.; Lucey, S. Pointnetlk: Robust & efficient point cloud registration using pointnet. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 15–20 June 2019; pp. 7163–7172. [Google Scholar] [CrossRef]
- Yew, Z.J.; Lee, G.H. Rpm-net: Robust point matching using learned features. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 13–19 June 2020; pp. 11824–11833. [Google Scholar] [CrossRef]
- Yu, H.; Hou, J.; Qin, Z.; Saleh, M.; Shugurov, I.; Wang, K.; Busam, B.; Ilic, S. Riga: Rotation-invariant and globally-aware descriptors for point cloud registration. IEEE Trans. Pattern Anal. Mach. Intell. 2024, 46, 3796–3812. [Google Scholar] [CrossRef]
- Ao, S.; Hu, Q.; Wang, H.; Xu, K.; Guo, Y. Buffer: Balancing accuracy, efficiency, and generalizability in point cloud registration. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada, 17–24 June 2023; pp. 1255–1264. [Google Scholar] [CrossRef]
- Deng, C.; Litany, O.; Duan, Y.; Poulenard, A.; Tagliasacchi, A.; Guibas, L.J. Vector neurons: A general framework for so (3)-equivariant networks. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, QC, Canada, 10–17 October 2021; pp. 12200–12209. [Google Scholar] [CrossRef]
- Ao, S.; Hu, Q.; Yang, B.; Markham, A.; Guo, Y. Spinnet: Learning a general surface descriptor for 3d point cloud registration. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA, 20–25 June 2021; pp. 11753–11762. [Google Scholar] [CrossRef]
- Yao, R.; Du, S.; Cui, W.; Tang, C.; Chengwu, Y. PARE-Net: Position-aware rotation-equivariant networks for robust point cloud registration. In Proceedings of the European Conference on Computer Vision, Milan, Italy, 29 September–4 October 2024; pp. 287–303. [Google Scholar] [CrossRef]
- Zhang, Y.; Zhang, W.; Li, J. Partial-to-partial point cloud registration by rotation invariant features and spatial geometric consistency. Remote Sens. 2023, 15, 3054. [Google Scholar] [CrossRef]
- Fu, K.; Luo, J.; Luo, X.; Liu, S.; Zhang, C.; Wang, M. Robust Point Cloud Registration Framework Based on Deep Graph Matching. IEEE Trans. Pattern Anal. Mach. Intell. 2022, 45, 6183–6195. [Google Scholar] [CrossRef]
- Huang, S.; Gojcic, Z.; Usvyatsov, M.; Wieser, A.; Schindler, K. Predator: Registration of 3d point clouds with low overlap. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA, 20–25 June 2021; pp. 4267–4276. [Google Scholar] [CrossRef]
- Yu, H.; Li, F.; Saleh, M.; Busam, B.; Ilic, S. Cofinet: Reliable coarse-to-fine correspondences for robust pointcloud registration. Adv. Neural Inf. Process. Syst. 2021, 34, 23872–23884. [Google Scholar]
- Yew, Z.J.; Lee, G.H. Regtr: End-to-end point cloud correspondences with transformers. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 18–24 June 2022; pp. 6677–6686. [Google Scholar] [CrossRef]
- Yu, H.; Qin, Z.; Hou, J.; Saleh, M.; Li, D.; Busam, B.; Ilic, S. Rotation-invariant transformer for point cloud matching. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada, 17–24 June 2023; pp. 5384–5393. [Google Scholar] [CrossRef]
- Qin, Z.; Yu, H.; Wang, C.; Guo, Y.; Peng, Y.; Ilic, S.; Hu, D.; Xu, K. Geotransformer: Fast and robust point cloud registration with geometric transformer. IEEE Trans. Pattern Anal. Mach. Intell. 2023, 45, 9806–9821. [Google Scholar] [CrossRef]
- Wu, X.; Wei, X.; Xu, H.; Li, C.; Hou, Y.; Yin, Y.; He, W. PointCNT: A One-Stage Point Cloud Registration Approach Based on Complex Network Theory. Remote Sens. 2023, 15, 3545. [Google Scholar] [CrossRef]
- Ma, T.; Han, G.; Chu, Y.; Ren, H. Sparse-to-Dense Point Cloud Registration Based on Rotation-Invariant Features. Remote Sens. 2024, 16, 2485. [Google Scholar] [CrossRef]
- Xu, Z.; Gao, X.; Jiang, X.; Cheng, S.; Zhang, Q.; Li, W.; Gao, X. S2Reg: Structure-semantics collaborative point cloud registration. Pattern Recognit. 2025, 161, 111290. [Google Scholar] [CrossRef]
- Liu, J.; Wang, G.; Liu, Z.; Jiang, C.; Pollefeys, M.; Wang, H. Regformer: An efficient projection-aware transformer network for large-scale point cloud registration. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Paris, France, 1–6 October 2023; pp. 8451–8460. [Google Scholar] [CrossRef]
- He, Q.; Zhang, J.; Peng, J.; He, H.; Li, X.; Wang, Y.; Wang, C. Pointrwkv: Efficient rwkv-like model for hierarchical point cloud learning. AAAI Conf. Artif. Intell. 2025, 39, 3410–3418. [Google Scholar] [CrossRef]
- Zhu, L.; Liao, B.; Zhang, Q.; Wang, X.; Liu, W.; Wang, X. Vision mamba: Efficient visual representation learning with bidirectional state space model. arXiv 2024, arXiv:2401.09417. [Google Scholar] [CrossRef]
- Wang, Z.; Chen, Z.; Wu, Y.; Zhao, Z.; Zhou, L.; Xu, D. Pointramba: A hybrid transformer-mamba framework for point cloud analysis. arXiv 2024, arXiv:2405.15463. [Google Scholar]
- Qi, C.R.; Yi, L.; Su, H.; Guibas, L.J. Pointnet++: Deep hierarchical feature learning on point sets in a metric space. Adv. Neural Inf. Process. Syst. 2017, 30, 5105–5114. [Google Scholar]
- Arun, K.S.; Huang, T.S.; Blostein, S.D. Least-squares fitting of two 3-D point sets. IEEE Trans. Pattern Anal. Mach. Intell. 1987, PAMI-9, 698–700. [Google Scholar] [CrossRef]
- Ran, H.; Liu, J.; Wang, C. Surface representation for point clouds. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 18–24 June 2022; pp. 18942–18952. [Google Scholar]
- Sarlin, P.E.; DeTone, D.; Malisiewicz, T.; Rabinovich, A. Superglue: Learning feature matching with graph neural networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 13–19 June 2020; pp. 4938–4947. [Google Scholar] [CrossRef]
- Zeng, A.; Song, S.; Nießner, M.; Fisher, M.; Xiao, J.; Funkhouser, T. 3dmatch: Learning local geometric descriptors from rgb-d reconstructions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 1802–1811. [Google Scholar] [CrossRef]
- Bai, X.; Luo, Z.; Zhou, L.; Fu, H.; Quan, L.; Tai, C.L. D3Feat: Joint learning of dense detection and description of 3D local features. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 13–19 June 2020; pp. 6359–6367. [Google Scholar] [CrossRef]
- Geiger, A.; Lenz, P.; Urtasun, R. Are we ready for autonomous driving? The kitti vision benchmark suite. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA, 16–21 June 2012; pp. 3354–3361. [Google Scholar] [CrossRef]








| Category | Parameter | Symbol | Value |
|---|---|---|---|
| Architecture | Neighborhood Size | K | 35 |
| Initial Voxel Size | 0.025 m | ||
| Feature Dimensions | 32/256 | ||
| HGSAM | Neighborhood Size | ||
| Number of Iterations | L | 3 | |
| Number of Attention Heads | h | 4 | |
| Input Dimension | 768 | ||
| Output Dimension | 192 | ||
| PCRH-P | Perturbation Magnitude | ||
| Pruning Threshold | 0.08 | ||
| Acceptance Radius | 0.1 m | ||
| Number of Hypotheses | 1000 | ||
| Refinement Steps | 5 | ||
| Optimizer | Adam Hyperparameters | 0.9, 0.999, | |
| Weight Decay | - | ||
| Learning Rate Decay | - | 0.95 | |
| Fixed Random Seed | - | 7351 |
| Method | 3DMatch | 3DLoMatch | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 5000 | 2500 | 1000 | 500 | 250 | 5000 | 2500 | 1000 | 500 | 250 | |
| Feature Matching Recall (%) ↑ | ||||||||||
| FCGF [8] | 97.4 | 97.3 | 97.0 | 96.7 | 96.6 | 76.6 | 75.4 | 74.2 | 71.7 | 67.3 |
| D3Feat [50] | 95.6 | 95.4 | 94.5 | 94.1 | 93.1 | 67.3 | 66.7 | 67.0 | 66.7 | 66.5 |
| Predator [33] | 96.6 | 96.6 | 96.5 | 96.3 | 96.5 | 78.6 | 77.4 | 76.3 | 75.7 | 75.3 |
| YOHO [16] | 98.2 | 97.6 | 97.5 | 97.7 | 96.0 | 79.4 | 78.1 | 76.3 | 73.8 | 69.1 |
| CoFiNet [34] | 98.1 | 98.3 | 98.1 | 98.2 | 98.3 | 83.1 | 83.5 | 83.3 | 83.1 | 82.6 |
| GeoTransformer [37] | 97.9 | 97.9 | 97.9 | 97.9 | 97.6 | 88.3 | 88.6 | 88.8 | 88.6 | 88.3 |
| PARENet [30] | 98.5 | 98.5 | 98.5 | 98.5 | 98.7 | 87.3 | 87.3 | 87.3 | 87.4 | 87.1 |
| Ours | 99.0 | 99.0 | 99.0 | 98.8 | 98.8 | 88.2 | 88.2 | 88.2 | 88.0 | 87.8 |
| Inlier Ratio (%) ↑ | ||||||||||
| FCGF [8] | 56.8 | 54.1 | 48.7 | 42.5 | 34.1 | 21.4 | 20.0 | 17.2 | 14.8 | 11.6 |
| D3Feat [50] | 39.0 | 38.8 | 40.4 | 41.5 | 51.8 | 13.2 | 13.1 | 14.0 | 14.6 | 15.0 |
| Predator [33] | 58.0 | 58.4 | 57.1 | 54.1 | 49.3 | 26.7 | 28.1 | 28.3 | 27.5 | 25.8 |
| YOHO [16] | 64.4 | 60.7 | 55.7 | 46.4 | 41.2 | 25.9 | 23.3 | 22.6 | 18.2 | 15.0 |
| CoFiNet [34] | 49.8 | 42.2 | 51.9 | 52.2 | 52.2 | 24.4 | 25.9 | 26.7 | 26.8 | 26.9 |
| GeoTransformer [37] | 71.9 | 75.2 | 76.0 | 82.2 | 85.1 | 43.5 | 45.3 | 46.2 | 52.9 | 57.7 |
| PARENet [30] | 75.3 | 75.3 | 75.3 | 77.7 | 79.4 | 45.2 | 45.3 | 45.3 | 47.6 | 49.3 |
| Ours | 76.9 | 76.9 | 76.9 | 79.2 | 80.7 | 47.2 | 47.2 | 47.2 | 49.3 | 51.3 |
| Registration Recall (%) ↑ | ||||||||||
| FCGF [8] | 85.1 | 84.7 | 83.3 | 81.6 | 71.4 | 40.1 | 41.7 | 38.2 | 35.4 | 26.8 |
| D3Feat [50] | 81.6 | 84.5 | 83.4 | 82.4 | 77.9 | 37.2 | 42.7 | 46.9 | 43.8 | 39.1 |
| Predator [33] | 89.0 | 89.9 | 90.6 | 88.5 | 86.6 | 59.8 | 61.2 | 62.4 | 60.8 | 58.1 |
| YOHO [16] | 90.2 | 90.3 | 89.1 | 88.6 | 84.5 | 65.2 | 65.5 | 63.2 | 56.5 | 48.0 |
| CoFiNet [34] | 89.3 | 88.9 | 88.4 | 87.4 | 87.0 | 67.5 | 66.2 | 64.2 | 63.1 | 61.0 |
| GeoTransformer [37] | 92.0 | 91.8 | 91.8 | 91.4 | 91.2 | 75.0 | 74.8 | 72.2 | 74.1 | 73.5 |
| PARENet [30] | 92.5 | 92.9 | 92.7 | 91.8 | 91.4 | 75.1 | 74.9 | 73.2 | 74.3 | 71.8 |
| Ours | 94.2 | 94.1 | 93.4 | 93.0 | 92.9 | 75.9 | 76.0 | 75.7 | 74.8 | 73.3 |
| Model | Estimator | Samples | Size (MB) | RR (%) ↑ | RRE (°) ↓ | RTE (m) ↓ |
|---|---|---|---|---|---|---|
| FCGF [8] | RANSAC-50k | 5000 | 8.76 | 87.6 | – | – |
| RoReg [15] | RANSAC-50k | 5000 | 10.06 | 93.0 | – | – |
| Predator [33] | LGR | all | 7.43 | 89.0 | 2.029 | 0.064 |
| GeoTrans [37] | LGR | all | 9.83 | 92.5 | 1.772 | 0.061 |
| PARENet [30] | FHP | all | 3.84 | 95.0 | 1.888 | 0.062 |
| Ours | PCRH-P | all | 5.40 | 96.3 | 1.705 | 0.062 |
| Model | GFLOPs ↓ | Model Time (ms) ↓ | Pose Time (ms) ↓ | Peak Mem (GB) ↓ | Throughput (p/s) ↑ |
|---|---|---|---|---|---|
| GeoTrans [37] | 154.614 | 351.87 | 13.75 | 3.81 | 2.84 |
| PARENet [30] | 67.562 | 445.55 | 15.10 | 3.59 | 2.24 |
| Ours | 14.482 | 658.20 | 15.35 | 3.63 | 1.52 |
| Model | Kitchen | Home_1 | Home_2 | Hotel_1 | Hotel_2 | Hotel_3 | Study | MIT_Lab | Mean |
|---|---|---|---|---|---|---|---|---|---|
| Registration Recall (%) ↑ | |||||||||
| Predator [33] | 97.9 | 97.2 | 74.5 | 98.5 | 96.2 | 88.6 | 86.1 | 73.4 | 89.1 |
| GeoTransformer [37] | 98.2 | 98.1 | 83.6 | 97.8 | 92.3 | 88.5 | 90.2 | 91.1 | 92.5 |
| PARENet [30] | 99.6 | 98.1 | 85.5 | 99.5 | 97.4 | 92.3 | 88.5 | 93.3 | 94.3 |
| Ours | 99.1 | 98.1 | 85.5 | 98.9 | 100.0 | 100.0 | 90.6 | 97.8 | 96.3 |
| Rotation Error (°) ↓ | |||||||||
| Predator [33] | 1.861 | 1.806 | 2.473 | 2.045 | 1.600 | 2.458 | 2.067 | 1.926 | 2.029 |
| GeoTransformer [37] | 1.829 | 1.534 | 2.076 | 1.569 | 1.553 | 1.715 | 1.914 | 1.986 | 1.772 |
| PARENet [30] | 2.482 | 1.598 | 2.253 | 1.641 | 1.632 | 1.628 | 2.105 | 1.765 | 1.888 |
| Ours | 2.304 | 1.371 | 1.995 | 1.439 | 1.328 | 1.685 | 1.876 | 1.647 | 1.705 |
| Translation Error (m) ↓ | |||||||||
| Predator [33] | 0.048 | 0.055 | 0.070 | 0.073 | 0.060 | 0.065 | 0.080 | 0.063 | 0.064 |
| GeoTransformer [37] | 0.047 | 0.052 | 0.062 | 0.057 | 0.061 | 0.051 | 0.080 | 0.078 | 0.061 |
| PARENet [30] | 0.043 | 0.049 | 0.082 | 0.058 | 0.060 | 0.048 | 0.079 | 0.073 | 0.062 |
| Ours | 0.040 | 0.051 | 0.080 | 0.059 | 0.060 | 0.053 | 0.081 | 0.075 | 0.062 |
| Model | Estimator | Size (MB) | RR (%) ↑ | RRE (°) ↓ | RTE (cm) ↓ |
|---|---|---|---|---|---|
| FCGF [8] | RANSAC-50k | 8.76 | 96.6 | 0.30 | 9.5 |
| D3Feat [50] | RANSAC-50k | 14.08 | 98.8 | 0.30 | 7.2 |
| Predator [33] | LGR | 22.77 | 99.8 | 0.27 | 6.8 |
| CoFiNet [34] | LGR | 5.48 | 99.8 | 0.41 | 8.2 |
| GeoTransformer [37] | LGR | 25.50 | 99.8 | 0.27 | 6.8 |
| Ours | PCRH-P | 2.53 | 99.8 | 0.22 | 5.2 |
| Model | Backbone | Feature Interaction | Estimator | PIR (%) ↑ | FMR (%) ↑ | IR (%) ↑ | RR (%) ↑ | Time (s) ↓ |
|---|---|---|---|---|---|---|---|---|
| Model A | PARE-Conv | Self + Cross | FHP | 84.1 | 98.4 | 72.0 | 94.3 | 0.165 |
| Model B | MG-Conv | Self + Cross | FHP | 85.3+1.2 | 98.6+0.2 | 73.6+1.6 | 94.9+0.6 | 0.176 |
| Model C | MG-Conv | Self + Self + Cross | FHP | 85.4+1.3 | 98.6+0.2 | 75.4+3.4 | 95.0+0.7 | 0.649 |
| Model D | PARE-Conv | HGSAM | FHP | 84.8+0.7 | 98.6+0.2 | 76.7+4.7 | 95.2+0.9 | 0.182 |
| Model E | MG-Conv | HGSAM | FHP | 85.3+1.2 | 99.0+0.6 | 76.9+4.9 | 95.8+1.5 | 0.190 |
| Model F | MG-Conv | HGSAM | PCRH-P | 85.3+1.2 | 99.0+0.6 | 76.9+4.9 | 96.3+2.0 | 0.192 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Li, J.; Liu, J.; Chen, A.; Shen, H.; Yuan, J. Robust Point Cloud Registration via Rotation-Equivariant Geometric Encoding and State Space Models. J. Imaging 2026, 12, 214. https://doi.org/10.3390/jimaging12050214
Li J, Liu J, Chen A, Shen H, Yuan J. Robust Point Cloud Registration via Rotation-Equivariant Geometric Encoding and State Space Models. Journal of Imaging. 2026; 12(5):214. https://doi.org/10.3390/jimaging12050214
Chicago/Turabian StyleLi, Junjie, Jiajun Liu, Anqi Chen, Huifang Shen, and Jianya Yuan. 2026. "Robust Point Cloud Registration via Rotation-Equivariant Geometric Encoding and State Space Models" Journal of Imaging 12, no. 5: 214. https://doi.org/10.3390/jimaging12050214
APA StyleLi, J., Liu, J., Chen, A., Shen, H., & Yuan, J. (2026). Robust Point Cloud Registration via Rotation-Equivariant Geometric Encoding and State Space Models. Journal of Imaging, 12(5), 214. https://doi.org/10.3390/jimaging12050214

