A Multi-Perspective Recursive Slice Framework with Cross-Slice Attention for Plant Point Cloud Instance Segmentation
Abstract
1. Introduction
1.1. Analysis of Existing Work
1.2. Contributions of Our Work
- (1)
- We propose a novel multi-directional recursive slicing strategy that systematically converts unstructured plant point clouds into ordered sequences, providing a robust foundation for sequential feature learning.
- (2)
- We develop a bidirectional LSTM-based architecture to recursively aggregate slice-level features, enabling the extraction of global representations that encapsulate critical local contextual cues.
- (3)
- We introduce an attention-embedding module that explicitly and bidirectionally fuses semantic and instance features, strengthening their discriminative capacity and improving the model’s ability to resolve instance boundaries.
- (4)
- We validate the proposed framework through comprehensive experiments on multiple public plant point cloud datasets. The results demonstrate that our method outperforms state-of-the-art approaches in terms of segmentation accuracy, boundary preservation, and adaptability to complex vegetation scenarios.
2. Materials and Methods
2.1. Multi-Directional Recursive Slicing and Feature Encoding
2.1.1. Multi-Directional Recursive Slicing Strategy
2.1.2. Intra-Slice Feature Encoding and Sequential Representation
2.1.3. Contextual Modeling Across Slices with Bidirectional LSTM
2.2. Bidirectional Fusion Module with Cross-Slice Attention Embedding
2.2.1. Parallel Decoding Branches and Feature Initialization
2.2.2. Dual Parallel Attention Fusion Mechanism
2.2.3. Multi-Scale Cross-Slice Attention
2.3. Semantic-Aware Instance Clustering and Joint Optimization
2.3.1. Semantic-Aware Mean Shift Clustering
2.3.2. Bandwidth Adaptation Mechanism
2.3.3. Multi-Task Joint Loss Function
3. Results
3.1. Datasets Description
3.2. Implementation Details
3.3. Result Evaluation and Analysis
3.3.1. Result Display and Evaluation
3.3.2. Ablation Study Analysis
3.3.3. Running Time
3.4. Performance Comparison
3.5. Limitations
4. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Jin, S.; Li, D.; Yun, T.; Tang, J.; Wang, K.; Li, S.; Yang, H.; Yang, S.; Xu, S.; Cao, L.; et al. Deep learning for three-dimensional (3D) plant phenomics. Plant Phenomics 2025, 7, 100107. [Google Scholar] [CrossRef]
- Roggiolani, G.; Bailey, B.N.; Behley, J.; Stachniss, C. Generation of labeled leaf point clouds for plants trait estimation. Plant Phenomics 2025, 7, 100071. [Google Scholar] [CrossRef] [PubMed]
- Li, D.; Shi, G.; Kong, W.; Wang, S.; Chen, Y. A leaf segmentation and phenotypic feature extraction framework for Multiview stereo plant point clouds. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2020, 13, 2321–2336. [Google Scholar] [CrossRef]
- Rana, S.; Hensel, O.; Nasirahmadi, A. From vineyard to vision: Multi-domain analysis and mitigation of grape cluster detection failures in complex viticultural environments. Results Eng. 2026, 29, 108833. [Google Scholar] [CrossRef]
- Jin, S.; Su, Y.; Wu, F.; Pang, S.; Gao, S.; Hu, T.; Liu, J.; Guo, Q. Stem-Leaf Segmentation and Phenotypic Trait Extraction of Individual Maize Using Terrestrial LiDAR Data. IEEE Trans. Geosci. Remote Sens. 2019, 57, 1336–1346. [Google Scholar] [CrossRef]
- Yang, X.; Shang, Z.; Huang, H.; Liu, C.; Xu, T. SMFCA-Net: Sparse Multifrequency Cross-Attention Network for Single-Plant Point Cloud Classification and Segmentation. IEEE Trans. Geosci. Remote Sens. 2026, 64, 5700917. [Google Scholar] [CrossRef]
- Shin, S.; Zhou, K.; Vankadari, M.; Markha, A.; Trigoni, N. Spherical Mask: Coarse-to-Fine 3D Point Cloud Instance Segmentation with Spherical Representation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 16–22 June 2024. [Google Scholar]
- Wang, P.; Tang, Y.; Liao, Z.; Yan, Y.; Dai, L.; Liu, S.; Jiang, T. Road-Side Individual Tree Segmentation from Urban MLS Point Clouds Using Metric Learning. Remote Sens. 2023, 15, 1992. [Google Scholar] [CrossRef]
- Roh, W.; Jung, H.; Nam, G.; Yeom, J.; Park, H.; Yoon, S.; Kim, S. Edge-Aware 3D Instance Segmentation Network with Intelligent Semantic Prior. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 16–22 June 2024. [Google Scholar]
- Fang, L.; You, Z.; Shen, G.; Chen, Y.; Li, J. A joint deep learning network of point clouds and multiple views for roadside object classification from lidar point clouds. ISPRS J. Photogramm. Remote Sens. 2022, 193, 115–136. [Google Scholar] [CrossRef]
- Lei, H.; Akhtar, N.; Mian, A. Spherical kernel for efficient graph convolution on 3D point clouds. IEEE Trans. Pattern Anal. Mach. Intell. 2021, 43, 3664–3680. [Google Scholar] [CrossRef]
- Jiang, T.; Sun, J.; Liu, S.; Zhang, X.; Wu, Q.; Wang, Y. Hierarchical semantic segmentation of urban scene point clouds via group proposal and graph attention network. Int. J. Appl. Earth Obs. Geoinf. 2021, 105, 102626. [Google Scholar] [CrossRef]
- Huang, S.; Ma, Z.; Mu, T.; Fu, H.; Hu, S. Supervoxel convolution for online 3D semantic segmentation. ACM Trans. Graph. 2021, 40, 1–15. [Google Scholar] [CrossRef]
- Charles, R.Q.; Su, H.; Kaichun, M.; Guibas, L.J. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017. [Google Scholar]
- Jiang, T.; Wang, Y.; Liu, S.; Cong, Y.; Dai, L.; Sun, J. Local and global structure for urban ALS point cloud semantic segmentation with ground-aware attention. IEEE Trans. Geosci. Remote Sens. 2022, 60, 5702615. [Google Scholar] [CrossRef]
- Luo, H.; Chen, C.; Fang, L.; Khoshelham, K.; Shen, G. MS-RRFSegNet: Multiscale regional relation feature segmentation network for semantic segmentation of urban scene point clouds. IEEE Trans. Geosci. Remote Sens. 2020, 58, 8301–8315. [Google Scholar] [CrossRef]
- Landrieu, L.; Simonovsky, M. Large-scale point cloud semantic segmentation with superpoint graphs. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 4558–4567. [Google Scholar]
- Li, D.; Ahmed, F.; Wang, Z. 3D-NOD: 3D new organ detection in plant growth by a spatiotemporal point cloud deep segmentation framework. Plant Phenomics 2025, 7, 100002. [Google Scholar] [CrossRef]
- Yu, Q.; Du, H.; Liu, C.; Yu, X. When 3D Bounding-Box Meets SAM: Point Cloud Instance Segmentation with Weak-and-Noisy Supervision. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, USA, 3–8 January 2024. [Google Scholar]
- Yan, M.; Zhang, J.; Zhu, Y.; Wang, H. MaskClustering: View Consensus Based Mask Graph Clustering for Open-Vocabulary 3D Instance Segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 16–22 June 2024. [Google Scholar]
- Jiang, L.; Zhao, H.; Shi, S.; Liu, S.; Fu, C.W.; Jia, J. PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 13–19 June 2020. [Google Scholar]
- Vu, T.; Kim, K.; Nguyen, T.; Luu, T.M.; Kim, J.; Yoo, C.D. Scalable SoftGroup for 3D Instance Segmentation on Point Clouds. IEEE Trans. Pattern Anal. Mach. Intell. 2024, 46, 1981–1995. [Google Scholar] [CrossRef]
- Chen, S.; Fang, J.; Zhang, Q.; Liu, W.; Wang, X. Hierarchical Aggregation for 3D Instance Segmentation. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 10–17 October 2021. [Google Scholar]
- Zhao, L.; Tao, W. JSNet++: Dynamic Filters and Pointwise Correlation for 3D Point Cloud Instance and Semantic Segmentation. IEEE Trans. Circuit Syst. Video Technol. 2025, 33, 1854–1867. [Google Scholar] [CrossRef]
- Fang, Z.; Zhuang, C.; Lu, Z.; Wang, Y.; Liu, L.; Xiao, J. BGPSeg: Boundary-Guided Primitive Instance Segmentation of Point Clouds. IEEE Trans. Image Process. 2025, 34, 1454–1468. [Google Scholar] [CrossRef]
- Li, D.; Shi, G.L.; Li, J.S.; Chen, Y.L.; Zhang, S.Y.; Xiang, S.Y.; Jin, S.C. PlantNet: A dual-function point cloud segmentation network for multiple plant species. ISPRS J. Photogramm. Remote Sens. 2022, 184, 243–263. [Google Scholar] [CrossRef]
- Yu, F.; Liu, K.; Zhang, Y.; Zhu, C.; Xu, K. PartNet: A Recursive Part Decomposition Network for Fine-grained and Hierarchical Shape Segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15–20 June 2019. [Google Scholar]
- Jiang, T.; Liu, S.; Zhang, Q.; Zhao, L.; Sun, J.; Wang, Y. ShrimpSeg: A local-global structure for mantis shrimp point cloud segmentation network with contextual reasoning. Appl. Opt. 2023, 62, 97–103. [Google Scholar] [CrossRef]
- Huang, Q.; Wang, W.; Neumann, U. Recurrent Slice Networks for 3D Segmentation of Point Clouds. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–23 June 2018. [Google Scholar]
- Luo, Z.; Liu, D.; Li, J.; Chen, Y.; Xiao, Z.; Juniorc, J.; Gonçalvesc, W.; Wang, C. Learning sequential slice representation with an attention-embedding network for 3D shape recognition and retrieval in MLS point clouds. ISPRS J. Photogramm. Remote Sens. 2021, 176, 237–249. [Google Scholar] [CrossRef]
- Chen, X.; Wu, P.; Wu, Y.; Aboud, L.; Postolache, O.; Wang, Z. Ship trajectory prediction via a transformer-based model by considering spatial-temporal dependency. Intell. Robot. 2025, 5, 562–578. [Google Scholar] [CrossRef]
- Wang, D.; Takoudjou, S.M.; Casella, E. LeWoS: A universal leaf-wood classification method to facilitate the 3D modelling of large tropical trees using terrestrial LiDAR. Methods Ecol. Evol. 2020, 11, 376–389. [Google Scholar] [CrossRef]
- Li, L.; Li, Q.; Xu, G.; Zhou, P.; Tu, J.; Li, J.; Li, M.; Yao, J. A boundary-aware point clustering approach in Euclidean and embedding spaces for roof plane segmentation. ISPRS J. Photogramm. Remote Sens. 2024, 218, 518–530. [Google Scholar] [CrossRef]
- Qi, C.R.; Yi, L.; Su, H.; Guibas, L.J. PointNet++: Deep hierarchical feature learning on point sets in a metric space. In Proceedings of the Advances in Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017; pp. 5099–5108. [Google Scholar]
- Wang, D. Unsupervised semantic and instance segmentation of forest point clouds. ISPRS J. Photogramm. Remote Sens. 2020, 165, 86–97. [Google Scholar] [CrossRef]
- Li, D.; Liu, L.; Xu, S.; Jin, S. TrackPlant3D: 3D organ growth tracking framework for organ-level dynamic phenotyping. Comput. Electron. Agric. 2024, 226, 109435. [Google Scholar] [CrossRef]
- Yang, X.; Miao, T.; Tian, X.; Wang, D.; Zhao, J.; Lin, L.; Zhu, C.; Yang, T.; Xu, T. Maize stem–leaf segmentation framework based on deformable point clouds. ISPRS J. Photogramm. Remote Sens. 2024, 211, 49–66. [Google Scholar] [CrossRef]
- Du, R.; Ma, Z.; Xie, P.; He, Y.; Cen, H. PST: Plant segmentation transformer for 3D point clouds of rapeseed plants at the podding stage. ISPRS J. Photogramm. Remote Sens. 2023, 195, 380–392. [Google Scholar] [CrossRef]
- Liu, D.; Zhang, Y.; Ren, Y.; Pan, D.; He, X.; Cong, M.; Yu, G. Multi-directional attention: A lightweight attention module for slender structures. Intell. Robot. 2025, 5, 827–843. [Google Scholar] [CrossRef]
- Zhao, L.; Tao, W. JSNet: Joint instance and semantic segmentation of 3d point clouds. In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), New York, NY, USA, 7–12 February 2020; pp. 12951–12958. [Google Scholar]
- Jiang, T.; Wang, Y.; Liu, S.; Zhang, Q.; Zhao, L.; Sun, J. Instance recognition of street trees from urban point clouds using a three-stage neural network. ISPRS J. Photogramm. Remote Sens. 2023, 199, 305–334. [Google Scholar] [CrossRef]
- Wang, Y.; Jiang, T.; Yu, M.; Tao, S.; Sun, J.; Liu, S. Semantic-Based Building Extraction from LiDAR Point Clouds Using Contexts and Optimization in Complex Environment. Sensors 2020, 20, 3386. [Google Scholar] [CrossRef] [PubMed]
- Sun, Y.; Zhang, Z.; Sun, K.; Li, S.; Yu, J.; Miao, L.; Zhang, Z.; Li, Y.; Zhao, H.; Hu, Z.; et al. Soybean-MVS: Annotated Three-Dimensional Model Dataset of Whole Growth Period Soybeans for 3D Plant Organ Segmentation. Agriculture 2023, 13, 1321. [Google Scholar] [CrossRef]
- Zhao, L.; Wu, S.; Fu, J.; Fang, S.; Liu, S.; Jiang, T. Panoptic Plant Recognition in 3D Point Clouds: A Dual-Representation Learning Approach with the PP3D Dataset. Remote Sens. 2025, 17, 2673. [Google Scholar] [CrossRef]
- Li, Z.; Li, M.; Shi, L.; Li, D. A novel fatigue driving detection method based on whale optimization and Attention-enhanced GRU. Intell. Robot. 2024, 4, 230–243. [Google Scholar] [CrossRef]
- Kolodiazhnyi, M.; Vorontsova, A.; Konushin, A.; Rukhovich, D. OneFormer3D: One Transformer for Unified Point Cloud Segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 16–22 June 2024. [Google Scholar]
- Liu, S.; Wang, G.; Fang, H.; Huang, M.; Jiang, T.; Wang, Y. MRSliceNet: Multi-Scale Recursive Slice and Context Fusion Network for Instance Segmentation of Leaves from Plant Point Clouds. Plants 2025, 14, 3349. [Google Scholar] [CrossRef]









| Methods | Soybean-MVS | PP3D | ||||
|---|---|---|---|---|---|---|
| AP (%) | AP25 (%) | AP50 (%) | AP (%) | AP25 (%) | AP50 (%) | |
| PointGroup [21] | 25.98 | 28.26 | 27.08 | 21.67 | 39.55 | 35.32 |
| SCNet [44] | 44.01 | 50.49 | 47.45 | 46.75 | 60.18 | 55.22 |
| MRSliceNet [47] | 46.75 | 50.63 | 48.95 | 48.95 | 69.61 | 60.56 |
| MPRSF-CSA (ours) | 53.80 | 62.94 | 57.34 | 53.02 | 72.11 | 65.37 |
| Methods | AP (%) |
|---|---|
| Model A | 46.22 |
| Model B | 48.02 |
| Model B * | 48.42 |
| Model C | 49.42 |
| Model D | 50.82 |
| Model E | 51.52 |
| Model F | 51.69 |
| Full Model | 53.02 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Liu, S.; Fang, S.; Zhang, L.; Wang, P.; Cheng, X.; Xu, L.; Sun, J.; Jiang, T. A Multi-Perspective Recursive Slice Framework with Cross-Slice Attention for Plant Point Cloud Instance Segmentation. Agriculture 2026, 16, 956. https://doi.org/10.3390/agriculture16090956
Liu S, Fang S, Zhang L, Wang P, Cheng X, Xu L, Sun J, Jiang T. A Multi-Perspective Recursive Slice Framework with Cross-Slice Attention for Plant Point Cloud Instance Segmentation. Agriculture. 2026; 16(9):956. https://doi.org/10.3390/agriculture16090956
Chicago/Turabian StyleLiu, Shan, Shilin Fang, Luhao Zhang, Pengcheng Wang, Xiaorong Cheng, Lei Xu, Jian Sun, and Tengping Jiang. 2026. "A Multi-Perspective Recursive Slice Framework with Cross-Slice Attention for Plant Point Cloud Instance Segmentation" Agriculture 16, no. 9: 956. https://doi.org/10.3390/agriculture16090956
APA StyleLiu, S., Fang, S., Zhang, L., Wang, P., Cheng, X., Xu, L., Sun, J., & Jiang, T. (2026). A Multi-Perspective Recursive Slice Framework with Cross-Slice Attention for Plant Point Cloud Instance Segmentation. Agriculture, 16(9), 956. https://doi.org/10.3390/agriculture16090956

