Long-Tail Learning for Three-Dimensional Pavement Distress Segmentation Using Point Clouds Reconstructed from a Consumer Camera
Highlights
- A workflow encompassing low-cost and readily deployable acquisition, multi-class pavement distress automated segmentation, and geometric information extraction, based on point clouds.
- An end-to-end network for segmenting distress from pavement point clouds, incorporating a long-tail class imbalance mitigation strategy and a dual-stream feature fusion module.
- Enables scalable, low-cost 3D pavement inspection and monitoring using consumer-grade imaging, reducing reliance on expensive scanning systems and improving deploy ability for routine inspections.
- Provides engineering-ready 3D distress outputs to support condition assessment, maintenance prioritization, and integration into intelligent pavement management workflows.
Abstract
1. Introduction
2. Data Acquisition and Processing
2.1. Point Cloud Generation
2.2. Point Cloud Processing
2.3. Dataset Construction
3. Methodology
3.1. Network Structure
3.2. Re-Sampling
3.3. Re-Weighting
3.4. Post-Processing and Geometric Measurement
4. Experimental Setup
5. Results
5.1. Re-Sampling and Re-Weighting
5.2. Model Testing
5.3. Ablation Test
5.4. Comparative Test
5.5. Field Evaluation
6. Discussion
7. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Kothai, R.; Prabakaran, N.; Srinivasa Murthy, Y.V.; Reddy Cenkeramaddi, L.; Kakani, V. Pavement Distress Detection, Classification, and Analysis Using Machine Learning Algorithms: A Survey. IEEE Access 2024, 12, 126943–126960. [Google Scholar] [CrossRef]
- Mathavan, S.; Kamal, K.; Rahman, M. A Review of Three-Dimensional Imaging Technologies for Pavement Distress Detection and Measurements. IEEE Trans. Intell. Transp. Syst. 2015, 16, 2353–2362. [Google Scholar] [CrossRef]
- Faisal, A.; Gargoum, S. Cost-effective LiDAR for pothole detection and quantification using a low-point-density approach. Autom. Constr. 2025, 172, 106006. [Google Scholar] [CrossRef]
- Zhang, D.; Zou, Q.; Lin, H.; Xu, X.; He, L.; Gui, R.; Li, Q. Automatic pavement defect detection using 3D laser profiling technology. Autom. Constr. 2018, 96, 350–365. [Google Scholar] [CrossRef]
- Chen, X.; Li, J.; Huang, S.; Cui, H.; Liu, P.; Sun, Q. An Automatic Concrete Crack-Detection Method Fusing Point Clouds and Images Based on Improved Otsu’s Algorithm. Sensors 2021, 21, 1581. [Google Scholar] [CrossRef]
- Liu, R.; Yang, J.; Ren, H.; Cong, B.; Chang, C. Research on a pavement pothole extraction method based on vehicle-borne continuous laser scanning point cloud. Meas. Sci. Technol. 2022, 33, 115204. [Google Scholar] [CrossRef]
- Song, H.; Zhang, J.; Zuo, J.; Liang, X.; Han, W.; Ge, J. Subsidence Detection for Urban Roads Using Mobile Laser Scanner Data. Remote Sens. 2022, 14, 2240. [Google Scholar] [CrossRef]
- Fan, R.; Ozgunalp, U.; Wang, Y.; Liu, M.; Pitas, I. Rethinking Road Surface 3-D Reconstruction and Pothole Detection: From Perspective Transformation to Disparity Map Segmentation. IEEE Trans. Cybern. 2022, 52, 5799–5808. [Google Scholar] [CrossRef] [PubMed]
- Khan, N.H.R.; Kumar, S.V. Terrestrial LiDAR derived 3D point cloud model, digital elevation model (DEM) and hillshade map for identification and evaluation of pavement distresses. Results Eng. 2024, 23, 102680. [Google Scholar] [CrossRef]
- Sun, Q.; Qiao, L.; Shen, Y. Pavement Potholes Quantification: A Study Based on 3D Point Cloud Analysis. IEEE Access 2025, 13, 12945–12955. [Google Scholar] [CrossRef]
- Li, J.; Liu, T.; Wang, X. Advanced pavement distress recognition and 3D reconstruction by using GA-DenseNet and binocular stereo vision. Measurement 2022, 201, 111760. [Google Scholar] [CrossRef]
- Zhang, H.; Wang, C.; Tian, S.; Lu, B.; Zhang, L.; Ning, X.; Bai, X. Deep learning-based 3D point cloud classification: A systematic survey and outlook. Displays 2023, 79, 102456. [Google Scholar] [CrossRef]
- Tang, X.; Huang, F.; Li, C.; Ban, D. A survey on end-to-end point cloud learning. IET Image Process. 2023, 17, 1307–1321. [Google Scholar] [CrossRef]
- Su, H.; Maji, S.; Kalogerakis, E.; Learned-Miller, E. Multi-View Convolutional Neural Networks for 3D Shape Recognition. In Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile, 7–13 December 2015; pp. 945–953. [Google Scholar]
- Yu, T.; Meng, J.; Yuan, J. Multi-View Harmonized Bilinear Network for 3D Object Recognition. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–22 June 2018; pp. 186–194. [Google Scholar]
- Li, Y.; Pirk, S.; Su, H.; Qi, C.R.; Guibas, L.J. FPNN: Field Probing Neural Networks for 3D Data. Adv. Neural Inf. Process. Syst. 2016, 29, 9. [Google Scholar]
- Wang, L.; Huang, Y.; Shan, J.; He, L. MSNet: Multi-Scale Convolutional Network for Point Cloud Classification. Remote Sens. 2018, 10, 612. [Google Scholar] [CrossRef]
- Qi, C.R.; Su, H.; Mo, K.; Guibas, L.J. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation; Cornell University Library: Ithaca, Greece, 2017. [Google Scholar]
- Qi, C.R.; Yi, L.; Su, H.; Guibas, L.J. PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space. arXiv 2017, arXiv:1706.02413. [Google Scholar]
- Wang, Y.; Sun, Y.; Liu, Z.; Sarma, S.E.; Bronstein, M.M.; Solomon, J.M. Dynamic Graph CNN for Learning on Point Clouds. ACM Trans. Graph. 2018, 38, 1–12. [Google Scholar] [CrossRef]
- Guo, M.; Cai, J.; Liu, Z.; Mu, T.; Martin, R.R.; Hu, S. PCT: Point cloud transformer. Comput. Vis. Media 2021, 7, 187–199. [Google Scholar] [CrossRef]
- Qian, G.; Li, Y.; Peng, H.; Mai, J.; Hammoud, H.A.A.K.; Elhoseiny, M.; Ghanem, B. PointNeXt: Revisiting PointNet++ with Improved Training and Scaling Strategies. Adv. Neural Inf. Process. Syst. 2022, 35, 23192–23204. [Google Scholar]
- Shim, S.; Kim, J.; Cho, G.; Lee, S. Stereo-vision-based 3D concrete crack detection using adversarial learning with balanced ensemble discriminator networks. Struct. Health Monit. 2023, 22, 1353–1375. [Google Scholar] [CrossRef]
- Ma, L.; Li, J. SD-GCN: Saliency-based dilated graph convolution network for pavement crack extraction from 3D point clouds. Int. J. Appl. Earth Obs. 2022, 111, 102836. [Google Scholar] [CrossRef]
- Feng, H.; Ma, L.; Yu, Y.; Chen, Y.; Li, J. SCL-GCN: Stratified Contrastive Learning Graph Convolution Network for pavement crack detection from mobile LiDAR point clouds. Int. J. Appl. Earth Obs. 2023, 118, 103248. [Google Scholar] [CrossRef]
- Faris, A.; Charlotte, S.; Michael, M.; Thierry, P. PointCrack3D: Crack Detection in Unstructured Environments using a 3D-Point-Cloud-Based Deep Neural Network. arXiv 2021, arXiv:2111.11615. [Google Scholar] [CrossRef]
- Fan, J.; Song, W.; Zhang, J.; Sun, S.; Jia, G.; Jin, G. PAN: Improved PointNet++ for Pavement Crack Information Extraction. Electronics 2024, 13, 3340. [Google Scholar] [CrossRef]
- Pascucci, N.; Dominici, D.; Habib, A. LiDAR-Based Road Cracking Detection: Machine Learning Comparison, Intensity Normalization, and Open-Source WebGIS for Infrastructure Maintenance. Remote Sens. 2025, 17, 1543. [Google Scholar] [CrossRef]
- Feng, H.; Li, W.; Luo, Z.; Chen, Y.; Fatholahi, S.N.; Cheng, M.; Wang, C.; Junior, J.M.; Li, J. GCN-Based Pavement Crack Detection Using Mobile LiDAR Point Clouds. IEEE Trans. Intell. Transp. Syst. 2022, 23, 11052–11061. [Google Scholar] [CrossRef]
- Feng, H.; Li, W.; Ma, L.; Chen, Y.; Guan, H.; Yu, Y.; Junior, J.M.; Li, J. Crack-U2 Net: Multiscale Feature Learning Network for Pavement Crack Detection From Large-Scale MLS Point Clouds. IEEE Trans. Intell. Transp. Syst. 2024, 25, 17952–17964. [Google Scholar] [CrossRef]
- Wang, N.; Dong, J.; Fang, H.; Li, B.; Zhai, K.; Ma, D.; Shen, Y.; Hu, H. 3D reconstruction and segmentation system for pavement potholes based on improved structure-from-motion (SFM) and deep learning. Constr. Build. Mater. 2023, 398, 132499. [Google Scholar] [CrossRef]
- Dong, J.; Wang, N.; Fang, H.; Lu, H.; Ma, D.; Hu, H. Automatic augmentation and segmentation system for three-dimensional point cloud of pavement potholes by fusion convolution and transformer. Adv. Eng. Inform. 2024, 60, 17. [Google Scholar] [CrossRef]
- Zhang, Y.; Kang, B.; Hooi, B.; Yan, S.; Feng, J. Deep Long-Tailed Learning: A Survey. IEEE Trans. Pattern Anal. Mach. Intell. 2023, 45, 10795–10816. [Google Scholar] [CrossRef]
- Chen, W.; Yang, K.; Yu, Z.; Shi, Y.; Chen, C.L.P. A survey on imbalanced learning: Latest research, applications and future directions. Artif. Intell. Rev. 2024, 57, 51. [Google Scholar] [CrossRef]
- Zizhao, Z.; Tomas, P. Learning Fast Sample Re-weighting Without Reward Data. In Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 11–17 October 2021; pp. 705–714. [Google Scholar]
- Mazzini, D.; Napoletano, P.; Piccoli, F.; Schettini, R. A Novel Approach to Data Augmentation for Pavement Distress Segmentation. Comput. Ind. 2020, 121, 103225. [Google Scholar] [CrossRef]
- Xu, Z.; Dai, Z.; Sun, Z.; Zuo, C.; Song, H.; Yuan, C. Enhancing Pavement Distress Detection Using a Morphological Constraints-Based Data Augmentation Method. Coatings 2023, 13, 764. [Google Scholar] [CrossRef]
- Wei, T.; Shi, J.; Tu, W.; Li, Y. Robust Long-Tailed Learning under Label Noise. Front. Comput. Sci. 2026, 20, 2001321. [Google Scholar] [CrossRef]
- Panella, F.; Lipani, A.; Boehm, J. Semantic segmentation of cracks: Data challenges and architecture. Autom. Constr. 2022, 135, 104110. [Google Scholar] [CrossRef]
- Fan, Y.; Hu, Z.; Li, Q.; Sun, Y.; Chen, J.; Zhou, Q. CrackNet: A Hybrid Model for Crack Segmentation with Dynamic Loss Function. Sensors 2024, 24, 7134. [Google Scholar] [CrossRef]
- Shi, J.; We, T.; Zhou, Z.; Shao, J.; Han, X.; Li, Y. Long-Tail Learning with Foundation Model: Heavy Fine-Tuning Hurts. arXiv 2024, arXiv:2309.10019. [Google Scholar]
- Li, M.; Liu, Y.; Lu, Y.; Zhang, Y.; Cheung, Y.; Huang, H. Improving Visual Prompt Tuning by Gaussian Neighborhood Minimization for Long-Tailed Visual Recognition. Adv. Neural Inf. Process. Syst. 2024, 37, 103985–104009. [Google Scholar]
- Zhang, J.; Sun, S.; Song, W.; Li, Y.; Teng, Q. A novel convolutional neural network for enhancing the continuity of pavement crack detection. Sci. Rep. 2024, 14, 20. [Google Scholar] [CrossRef]
- Lin, T.; Goyal, P.; Girshick, R.; He, K.; Dollar, P. Focal Loss for Dense Object Detection. IEEETrans. Pattern Anal. Mach. Intell. 2020, 42, 318–327. [Google Scholar] [CrossRef] [PubMed]
- Haeyong, K.; Thang, V.; Yoo, C.D. Learning Imbalanced Datasets With Maximum Margin Loss. In Proceedings of the 2021 IEEE International Conference on Image Processing (ICIP), Anchorage, AK, USA, 19–22 September 2021; pp. 1269–1273. [Google Scholar]
- Liang, D.; Zhou, X.; Xu, W.; Zhu, X.; Zou, Z.; Ye, X.; Tan, X.; Bai, X. PointMamba: A Simple State Space Model for Point Cloud Analysis. Adv. Neural Inf. Process. Syst. 2024, 37, 32653–32677. [Google Scholar]
















| Sampling Strategy | Points Sampled in Testing Set | Accuracy (%) | mIoU (%) | ||
|---|---|---|---|---|---|
| Potholes | Cracks | ||||
| Random sampling | 41,962 | 135,020 | 96.70 | 67.84 | |
| Adaptive interval sampling | k = 10, β = 3 | 74,436 | 194,452 | 96.21 | 67.55 |
| k = 15, β = 3 | 75,720 | 203,616 | 95.88 | 68.26 | |
| k = 20, β = 3 | 76,778 | 202,289 | 95.32 | 68.03 | |
| k = 10, β = 5 | 77,445 | 213,447 | 96.92 | 67.96 | |
| k = 15, β = 5 | 79,746 | 217,162 | 94.56 | 68.81 | |
| k = 20, β = 5 | 78,844 | 209,452 | 95.48 | 67.82 | |
| Number | Network | mIoU (%) | F1-Pot (%) | F1-Crack (%) | Accuracy (%) |
|---|---|---|---|---|---|
| ① | Baseline | 71.30 | 82.75 | 66.32 | 94.47 |
| ② | +Geo-Sem DSFF | 75.03 | 85.72 | 71.11 | 94.54 |
| ③ | +Geo-Sem DSFF + GradientConv | 76.89 | 87.19 | 73.79 | 95.27 |
| ④ | +Geo-Sem DSFF + GradientConv + Edgeconv | 77.62 | 88.51 | 74.08 | 95.06 |
| ⑤ | +Geo-Sem DSFF + GradientConv + Edgeconv + Global interaction | 78.45 | 88.93 | 74.20 | 95.43 |
| (a) Comparative tests without long-tail class imbalance mitigation strategy | ||||||
| Network | mIoU (%) | F1-Pot (%) | F1-Crack (%) | Accuracy (%) | ||
| PointNet | 64.25 | 74.15 | 52.01 | 93.52 | ||
| PointNet++ | 66.89 | 76.62 | 62.15 | 93.99 | ||
| PointNeXt | 69.60 | 78.98 | 66.06 | 94.49 | ||
| PointTransformer | 70.49 | 80.39 | 66.65 | 94.61 | ||
| PointMamba | 72.27 | 84.76 | 68.10 | 94.51 | ||
| PointPaveSeg | 73.64 | 83.03 | 71.51 | 94.64 | ||
| (b) Comparative tests with long-tail class imbalance mitigation strategy | ||||||
| Network | mIoU (%) | F1-Pot (%) | F1-Crack (%) | Accuracy (%) | Parameters (M) | Inference Time (s/item) |
| PointNet | 66.41 | 80.49 | 54.17 | 94.95 | 3.625 | 1.04 |
| PointNet++ | 71.47 | 82.12 | 67.26 | 94.70 | 1.744 | 0.87 |
| PointNeXt | 74.65 | 85.13 | 69.38 | 94.66 | 1.807 | 0.94 |
| PointTransformer | 73.91 | 85.72 | 68.43 | 95.09 | 3.094 | 3.26 |
| PointMamba | 77.46 | 89.74 | 70.22 | 95.65 | 12.558 | 2.06 |
| PointPaveSeg | 78.45 | 88.93 | 74.20 | 95.43 | 3.880 | 1.11 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Cheng, P.; Yi, J.; Pei, Z.; Liu, Z.; Jiang, D.; Abdukadir, A. Long-Tail Learning for Three-Dimensional Pavement Distress Segmentation Using Point Clouds Reconstructed from a Consumer Camera. Remote Sens. 2026, 18, 1008. https://doi.org/10.3390/rs18071008
Cheng P, Yi J, Pei Z, Liu Z, Jiang D, Abdukadir A. Long-Tail Learning for Three-Dimensional Pavement Distress Segmentation Using Point Clouds Reconstructed from a Consumer Camera. Remote Sensing. 2026; 18(7):1008. https://doi.org/10.3390/rs18071008
Chicago/Turabian StyleCheng, Pengjian, Junyan Yi, Zhongshi Pei, Zengxin Liu, Dayong Jiang, and Abduhaibir Abdukadir. 2026. "Long-Tail Learning for Three-Dimensional Pavement Distress Segmentation Using Point Clouds Reconstructed from a Consumer Camera" Remote Sensing 18, no. 7: 1008. https://doi.org/10.3390/rs18071008
APA StyleCheng, P., Yi, J., Pei, Z., Liu, Z., Jiang, D., & Abdukadir, A. (2026). Long-Tail Learning for Three-Dimensional Pavement Distress Segmentation Using Point Clouds Reconstructed from a Consumer Camera. Remote Sensing, 18(7), 1008. https://doi.org/10.3390/rs18071008

