3D Point Cloud Instance Segmentation Considering Global Shape Contour Constraints
Abstract
:1. Introduction
- We proposed a Transformer network module that can efficiently encode global shape contour information called Global Shape Attention, GSA.
- We fused this GSA module with convolutional features to obtain instance features with global and local information, which isused to alleviate the problem that similar instances cannot be segmented.
- An end-to-end point cloud instance segmentation network was designed, which achieved good results in both indoor and outdoor point cloud instance segmentation tasks.
2. Related Work
2.1. Bottom–Up
2.1.1. Clustering Based on High-Dimensional Features
2.1.2. Vote Clustering Based on the Geometric Center
2.2. Top–Down
2.3. Directly Predicting Instance Masks
3. Method
3.1. Whole Network Framework
3.2. Feature Fusion Module
3.3. Global Shape Attention Module
3.4. Instance Query Module
3.5. Instance Mask Module
3.6. Loss Function
4. Experiments and Discussion
4.1. Experiments
4.1.1. S3DIS Dataset
4.1.2. ScanNet Dataset
4.1.3. STPLS3D Dataset
4.2. Discussion
4.2.1. Comparison
4.2.2. Visualization of Shape Contour Features
4.2.3. Model Size and Complexity
4.2.4. Influence of Initial Sampling Position
4.2.5. Position Encoding
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Yi, L.; Zhao, W.; Wang, H.; Sung, M.; Guibas, L.J. GSPN: Generative Shape Proposal Network for 3D Instance Segmentation in Point Cloud. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15–20 June 2019; pp. 3942–3951. [Google Scholar]
- Hou, J.; Dai, A.; NieBner, M. 3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15–20 June 2019; pp. 4416–4425. [Google Scholar]
- Yang, B.; Wang, J.; Clark, R.; Hu, Q.; Wang, S.; Markham, A.; Trigoni, N. Learning Object Bounding Boxes for 3D Instance Segmentation on Point Clouds. In Advances in Neural Information Processing Systems; Curran Associates, Inc.: New York, NY, USA, 2019; Volume 32. [Google Scholar]
- Wang, W.; Yu, R.; Huang, Q.; Neumann, U. SGPN: Similarity Group Proposal Network for 3D Point Cloud Instance Segmentation. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–22 June 2018; pp. 2569–2578. [Google Scholar]
- Wang, X.; Liu, S.; Shen, X.; Shen, C.; Jia, J. Associatively Segmenting Instances and Semantics in Point Clouds. arXiv 2019, arXiv:1902.09852. [Google Scholar]
- Elich, C.; Engelmann, F.; Kontogianni, T.; Leibe, B. 3D-BEVIS: Bird’s-Eye-View Instance Segmentation. Pattern Recognit. 2019, 11824, 48–61. [Google Scholar]
- Carion, N.; Massa, F.; Synnaeve, G.; Usunier, N.; Kirillov, A.; Zagoruyko, S. End-to-End Object Detection with Transformers. arXiv 2020, arXiv:2005.12872. [Google Scholar]
- Zhu, X.; Su, W.; Lu, L.; Li, B.; Wang, X.; Dai, J. Deformable DETR: Deformable Transformers for End-to-End Object Detection. arXiv 2021, arXiv:2010.04159. [Google Scholar]
- Schult, J.; Engelmann, F.; Hermans, A.; Litany, O.; Tang, S.; Leibe, B. Mask3D for 3D Semantic Instance Segmentation. arXiv 2022, arXiv:2210.03105. [Google Scholar]
- Kuhn, H. The Hungarian Method for the Assignment Problem. Nav. Res. Logist. NRL 1955, 2, 83–97. [Google Scholar] [CrossRef]
- Kuhn, H.W. Variants of the Hungarian Method for Assignment Problems. Nav. Res. Logist. 1956, 3, 253–258. [Google Scholar] [CrossRef]
- Armeni, I.; Sener, O.; Zamir, A.R.; Jiang, H.; Brilakis, I.; Fischer, M.; Savarese, S. 3D Semantic Parsing of Large-Scale Indoor Spaces. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 1534–1543. [Google Scholar]
- Dai, A.; Chang, A.X.; Savva, M.; Halber, M.; Funkhouser, T.; Nießner, M. ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 2432–2443. [Google Scholar]
- Chen, M.; Hu, Q.; Hugues, T.; Feng, A.; Hou, Y.; McCullough, K.; Soibelman, L. STPLS3D: A Large-Scale Synthetic and Real Aerial Photogrammetry 3D Point Cloud Dataset. arXiv 2022, arXiv:2203.09065. [Google Scholar]
- Ding, Z.; Han, X.; Niethammer, M. VoteNet: A Deep Learning Label Fusion Method for Multi-Atlas Segmentation. arXiv 2019, arXiv:1904.08963. [Google Scholar]
- Vu, T.; Kim, K.; Luu, T.M.; Nguyen, X.T.; Yoo, C.D. SoftGroup for 3D Instance Segmentation on Point Clouds. arXiv 2022, arXiv:2203.01509. [Google Scholar]
- Qi, C.R.; Su, H.; Mo, K.; Guibas, L.J. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation. arXiv 2017, arXiv:1612.00593. [Google Scholar]
- Qi, C.R.; Yi, L.; Su, H.; Guibas, L.J. PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space. arXiv 2017, arXiv:1706.02413. [Google Scholar]
- Xv, J.; Deng, F.; Liu, H. Point Cloud Convolution Network Based on Spatial Location Correspondence. ISPRS Int. J. Geo-Inf. 2022, 11, 591. [Google Scholar] [CrossRef]
- Loshchilov, I.; Hutter, F. Decoupled Weight Decay Regularization. arXiv 2019, arXiv:1711.05101. [Google Scholar]
- Smith, L.N.; Topin, N. Super-Convergence: Very Fast Training of Neural Networks Using Large Learning Rates. arXiv 2018, arXiv:1708.07120. [Google Scholar]
- Jiang, L.; Zhao, H.; Shi, S.; Liu, S.; Fu, C.-W.; Jia, J. PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 13–19 June 2020; pp. 4866–4875. [Google Scholar]
- Zhong, M.; Chen, X.; Chen, X.; Zeng, G.; Wang, Y. MaskGroup: Hierarchical Point Grouping and Masking for 3D Instance Segmentation. arXiv 2022, arXiv:2203.14662. [Google Scholar]
- Liang, Z.; Li, Z.; Xu, S.; Tan, M.; Jia, K. Instance Segmentation in 3D Scenes Using Semantic Superpoint Tree Networks. In Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 11–17 October 2021; pp. 2763–2772. [Google Scholar]
- Engelmann, F.; Bokeloh, M.; Fathi, A.; Leibe, B.; NieBner, M. 3D-MPA: Multi-Proposal Aggregation for 3D Semantic Instance Segmentation. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 13–19 June 2020; pp. 9028–9037. [Google Scholar]
- He, T.; Shen, C.; van den Hengel, A. DyCo3D: Robust Instance Segmentation of 3D Point Clouds through Dynamic Convolution. In Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA, 20–25 June 2021; pp. 354–363. [Google Scholar]
- Han, L.; Zheng, T.; Xu, L.; Fang, L. OccuSeg: Occupancy-Aware 3D Instance Segmentation. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 13–19 June 2020; pp. 2937–2946. [Google Scholar]
- Chen, S.; Fang, J.; Zhang, Q.; Liu, W.; Wang, X. Hierarchical Aggregation for 3D Instance Segmentation. In Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 11–17 October 2021; pp. 15447–15456. [Google Scholar]












| Method | mAP | mAP50 | Prec50 | Rec50 | 
|---|---|---|---|---|
| SGPN [4] | - | - | 36.0 | 28.7 | 
| ASIS [5] | - | - | 55.3 | 42.4 | 
| 3D-Bonet [3] | - | - | 57.5 | 40.2 | 
| PointGroup [22] | - | 57.8 | 61.9 | 62.1 | 
| MaskGroup [23] | - | 65.0 | 62.9 | 64.7 | 
| SSTNet [24] | 42.7 | 59.3 | 65.5 | 64.2 | 
| Mask3D [9] | 56.6 | 68.4 | 68.7 | 66.3 | 
| Ours | 52.4 | 66.9 | 63.6 | 64.3 | 
| ScanNet Val | ScanNet Test | |||
|---|---|---|---|---|
| Method | mAP | mAP50 | mAP | mAP50 | 
| SGPN [4] | - | - | 4.9 | 14.3 | 
| GSPN [1] | 19.3 | 37.8 | - | 30.6 | 
| 3D-SIS [2] | - | 18.7 | 16.1 | 38.2 | 
| 3D-BoNet [3] | - | - | 25.3 | 48.8 | 
| MTML [25] | 20.3 | 40.2 | 28.2 | 54.9 | 
| 3D-MPA [25] | 35.5 | 59.1 | 35.5 | 61.1 | 
| DyCo3D [26] | 35.4 | 57.6 | 39.5 | 64.1 | 
| PointGroup [22] | 34.8 | 57.6 | 40.7 | 63.6 | 
| MaskGroup [23] | 42.0 | 63.3 | 43.4 | 66.4 | 
| OccuSeg [27] | 44.2 | 60.7 | 48.6 | 67.2 | 
| SSTNet [24] | 49.4 | 64.3 | 50.6 | 69.8 | 
| HAIS [28] | 43.5 | 64.1 | 45.7 | 69.9 | 
| SoftGroup [16] | 46.0 | 67.6 | 50.4 | 76.1 | 
| Mask3D | 55.2 | 73.7 | 50.6 | 78.0 | 
| Ours | 54.5 | 70.5 | 49.8 | 77.3 | 
| AP | AP50 | AP25 | |
|---|---|---|---|
| Building | 0.822 | 0.905 | 0.918 | 
| Low vegetation | 0.329 | 0.617 | 0.731 | 
| Middle vegetation | 0.361 | 0.539 | 0.676 | 
| High vegetation | 0.488 | 0.922 | 0.985 | 
| Car | 0.831 | 0.912 | 0.985 | 
| Trucks | 0.815 | 0.9 | 0.946 | 
| Aircraft | 0.603 | 0.808 | 0.850 | 
| Military vehicles | 0.816 | 0.818 | 0.886 | 
| Bikes | 0.218 | 0.547 | 0.695 | 
| Motorcycle | 0.604 | 0.819 | 0.912 | 
| Light pole | 0.602 | 0.816 | 0.902 | 
| Street sign | 0.213 | 0.431 | 0.541 | 
| Clutter | 0.601 | 0.733 | 0.792 | 
| Fence | 0.442 | 0.627 | 0.860 | 
| Average (mAP) | 0.551 | 0.717 | 0.815 | 
| Method | Model Size | Run Time (ms) | 
|---|---|---|
| HAIS [28] | 30.856 M | 339 | 
| SoftGroup [16] | 30.858 M | 345 | 
| Mask3D [9] | 39.617 M | 339 | 
| Ours | 15.7 M | 253 | 
| mAP | mAP50 | |
|---|---|---|
| Same initial location | 50.1 | 63.4 | 
| Different initial location | 50.4 | 63.9 | 
| mIoU | |
|---|---|
| With PE | 64.0 | 
| Without PE | 63.9 | 
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. | 
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Xv, J.; Deng, F. 3D Point Cloud Instance Segmentation Considering Global Shape Contour Constraints. Remote Sens. 2023, 15, 4939. https://doi.org/10.3390/rs15204939
Xv J, Deng F. 3D Point Cloud Instance Segmentation Considering Global Shape Contour Constraints. Remote Sensing. 2023; 15(20):4939. https://doi.org/10.3390/rs15204939
Chicago/Turabian StyleXv, Jiabin, and Fei Deng. 2023. "3D Point Cloud Instance Segmentation Considering Global Shape Contour Constraints" Remote Sensing 15, no. 20: 4939. https://doi.org/10.3390/rs15204939
APA StyleXv, J., & Deng, F. (2023). 3D Point Cloud Instance Segmentation Considering Global Shape Contour Constraints. Remote Sensing, 15(20), 4939. https://doi.org/10.3390/rs15204939
 
        


 
       