An Improved YOLOv8 Model for Pavement Distress Detection Under Low-Computing-Power Conditions
Highlights
- The proposed model shows advantages over 10 SOTA models in pavement detection.
- Integrates LSKA, DIoU loss, and Soft-NMS into YOLOv8n, achieving 78.3% mAP@0.5.
- Pothole detection AP↑22.1%, strip patch detection AP↑17.7%.
- Maintains 160 FPS (GPU) and 68 FPS (low-cost CPU laptop).
Abstract
1. Introduction
1.1. Background
1.2. Related Works
2. Methods
2.1. YOLOv8
2.2. Improved YOLOv8
2.3. LSKA Mechanism
2.4. Loss Function
2.5. Soft-Non-Maximal Suppression (Soft-NMS)
3. Experiment
3.1. Dataset
3.2. Experiment Setting and Evaluation Metrics
4. Results and Discussion
4.1. Evaluation of Testing Results
4.2. Ablation Test Analysis
4.3. Comparison with Different Model Frameworks
4.4. CPU-Based Model Deployment Inference Experiments
5. Conclusions
- (1)
- The improved model achieves 78.3% mAP@0.5 (+8.1%) and 49.0% mAP@0.5:0.95 (+7.1%) compared to the baseline YOLOv8n, accompanied by a 5% improvement in F1-score. Notably, AP gains of 22.1% for potholes and 17.7% for strip patches validate the model’s enhanced adaptability to small target detection and complex background environments.
- (2)
- Ablation studies quantify the contribution of each module: the LSKA mechanism serves as the primary driver for performance gains (+3.8% mAP@0.5) by suppressing background noise; Soft-NMS significantly reduces false negatives (+3.3% mAP@0.5); and the DIoU loss optimizes bounding box regression accuracy.
- (3)
- While the model experiences marginal increases in computational complexity (FLOPs: 8.3 G, +0.2 G) and parameter size (6.8 M, +0.5 M), it sustains high real-time efficiency, maintaining 160 FPS on GPU and 68 FPS on CPU. This confirms that the proposed model achieves a superior balance between detection precision and computational cost compared to the baseline.
- (4)
- Extensive benchmarking against state-of-the-art models—including YOLOv3-tiny, YOLOv5n, YOLOv6n, YOLOv7-tiny, YOLOv8n, YOLOv8s, YOLOv9-c, YOLOv13n, YOLOv26n, and RTDETR-l—confirms the method’s effectiveness in balancing accuracy and speed.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Al-Sabaeei, A.M.; Souliman, M.I.; Jagadeesh, A. Smartphone applications for pavement condition monitoring: A review. Constr. Build. Mater. 2024, 410, 134207. [Google Scholar] [CrossRef]
- Chen, C.; Zheng, Z.; Xu, T.; Guo, S.; Feng, S.; Yao, W.; Lan, Y. YOLO-Based UAV Technology: A Review of the Research and Its Applications. Drones 2023, 7, 190. [Google Scholar] [CrossRef]
- El Hakea, A.H.; Fakhr, M.W. Recent computer vision applications for pavement distress and condition assessment. Autom. Constr. 2023, 146, 104664. [Google Scholar] [CrossRef]
- Ren, M.; Zhang, X.; Chen, X.; Zhou, B.; Feng, Z. YOLOv5s-M: A deep learning network model for road pavement damage detection from urban street-view imagery. Int. J. Appl. Earth Obs. Geoinf. 2023, 120, 103335. [Google Scholar] [CrossRef]
- Wan, F.; Sun, C.; He, H.; Lei, G.; Xu, L.; Xiao, T. YOLO-LRDD: A lightweight method for road damage detection based on improved YOLOv5s. EURASIP J. Adv. Signal Process. 2022, 2022, 98. [Google Scholar] [CrossRef]
- Ning, Z.; Wang, H.; Li, S.; Xu, Z. YOLOv7-RDD: A Lightweight Efficient Pavement Distress Detection Model. IEEE Trans. Intell. Transp. Syst. 2024, 25, 6994–7003. [Google Scholar] [CrossRef]
- Zhu, J.; Zhong, J.; Ma, T.; Huang, X.; Zhang, W.; Zhou, Y. Pavement distress detection using convolutional neural networks with images captured via UAV. Autom. Constr. 2022, 133, 103991. [Google Scholar] [CrossRef]
- Greenwood, W.W.; Lynch, J.P.; Zekkos, D. Applications of UAVs in Civil Infrastructure. J. Infrastruct. Syst. 2019, 25, 04019002. [Google Scholar] [CrossRef]
- Alonso, P.; de Gordoa, J.A.I.; Ortega, J.D.; García, S.; Iriarte, F.J.; Nieto, M. Automatic UAV-based airport pavement inspection using mixed real and virtual scenarios. In Proceedings of the SPIE 12701, Fifteenth International Conference on Machine Vision (ICMV 2022), Rome, Italy, 18–20 November 2022; p. 1270118. [Google Scholar] [CrossRef]
- Amieghemen, G.E.; Sherif, M.M. Deep convolutional neural network ensemble for pavement crack detection using high elevation UAV images. Struct. Infrastruct. Eng. 2023, 21, 1008–1023. [Google Scholar] [CrossRef]
- Ma, D.; Fang, H.; Wang, N.; Zhang, C.; Dong, J.; Hu, H. Automatic Detection and Counting System for Pavement Cracks Based on PCGAN and YOLO-MF. IEEE Trans. Intell. Transp. Syst. 2022, 23, 22166–22178. [Google Scholar] [CrossRef]
- He, X.; Tang, Z.; Deng, Y.; Zhou, G.; Wang, Y.; Li, L. UAV-based road crack object-detection algorithm. Autom. Constr. 2023, 154, 105014. [Google Scholar] [CrossRef]
- Zhang, Y.; Zuo, Z.; Xu, X.; Wu, J.; Zhu, J.; Zhang, H.; Wang, J.; Tian, Y. Road damage detection using UAV images based on multi-level attention mechanism. Autom. Constr. 2022, 144, 104613. [Google Scholar] [CrossRef]
- Wang, W.; Xu, X.; Peng, J.; Hu, W.; Wu, D. Fine-Grained Detection of Pavement Distress Based on Integrated Data Using Digital Twin. Appl. Sci. 2023, 13, 4549. [Google Scholar] [CrossRef]
- Zou, Z.; Chen, K.; Shi, Z.; Guo, Y.; Ye, J. Object detection in 20 years: A survey. Proc. IEEE 2023, 111, 257–276. [Google Scholar] [CrossRef]
- Girshick, R.; Donahue, J.; Darrell, T.; Malik, J. Rich feature hierarchies for accurate object detection and semantic segmentation. Proc. IEEE Conf. Comput. Vis. Pattern Recognit. 2014, 2014, 580–587. [Google Scholar] [CrossRef]
- Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. NIPS 2016, 39, 1137–1149. [Google Scholar] [CrossRef]
- Liu, C.; Li, J.; Gao, J.; Gao, Z.; Chen, Z. Combination of pixel-wise and region-based deep learning for pavement inspection and segmentation. Int. J. Pavement Eng. 2022, 23, 3011–3023. [Google Scholar] [CrossRef]
- Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You Only Look Once: Unified, Real-Time Object Detection. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016. [Google Scholar] [CrossRef]
- Liu, W.; Anguelov, D.; Erhan, D.; Szegedy, C.; Reed, S.; Fu, C.Y.; Berg, A.C. SSD: Single shot multibox detector. In Proceedings of the Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, 11–14 October 2016; Proceedings, Part I 14, 2016. pp. 21–37. [Google Scholar] [CrossRef]
- Carion, N.; Massa, F.; Synnaeve, G.; Usunier, N.; Kirillov, A.; Zagoruyko, S. End-to-end object detection with transformers. In European Conference on Computer Vision; Springer International Publishing: Cham, Switzerland, 2020; pp. 213–229. [Google Scholar] [CrossRef]
- Dai, Z.; Cai, B.; Lin, Y.; Chen, J. Unsupervised pre-training for detection transformers. IEEE Trans. Pattern Anal. Mach. Intell. 2023, 45, 12772–12782. [Google Scholar] [CrossRef] [PubMed]
- Sun, C.; Ai, Y.; Qi, X.; Wang, S.; Zhang, W. A single-shot model for traffic-related pedestrian detection. Pattern Anal. Appl. 2022, 25, 853–865. [Google Scholar] [CrossRef]
- Lin, Z.; Wang, H.; Li, S. Pavement anomaly detection based on transformer and self-supervised learning. Autom. Constr. 2022, 143, 104544. [Google Scholar] [CrossRef]
- Du, Y.; Pan, N.; Xu, Z.; Deng, F.; Shen, Y.; Kang, H. Pavement distress detection and classification based on YOLO network. Int. J. Pavement Eng. 2021, 22, 1659–1672. [Google Scholar] [CrossRef]
- Manjusha, M.; Sunitha, V. A review of advanced pavement distress evaluation techniques using unmanned aerial vehicles. Int. J. Pavement Eng. 2023, 24, 2268796. [Google Scholar] [CrossRef]
- Lei, X.; Liu, C.; Li, L.; Wang, G. Automated Pavement Distress Detection and Deterioration Analysis Using Street View Map. IEEE Access 2020, 8, 76163–76172. [Google Scholar] [CrossRef]
- Lee, J.; Hwang, K. YOLO with adaptive frame control for real-time object detection applications. Multimed. Tools Appl. 2022, 81, 36375–36396. [Google Scholar] [CrossRef]
- Wang, C.Y.; Bochkovskiy, A.; Liao, H.Y.M. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit. 2023, 2023, 7464–7475. [Google Scholar] [CrossRef]
- Lin, T.Y.; Dollár, P.; Girshick, R.; He, K.; Hariharan, B.; Belongie, S. Feature pyramid networks for object detection. Proc. IEEE Conf. Comput. Vis. Pattern Recognit. 2017, 2017, 2117–2125. [Google Scholar] [CrossRef]
- Liu, S.; Qi, L.; Qin, H.; Shi, J.; Jia, J. Path aggregation network for instance segmentation. Proc. IEEE Conf. Comput. Vis. Pattern Recognit. 2018, 2018, 8759–8768. [Google Scholar] [CrossRef]
- Lau, K.W.; Po, L.M.; Rehman, Y.A.U. Large separable kernel attention: Rethinking the large kernel attention design in cnn. Expert Syst. With Appl. 2024, 236, 121352. [Google Scholar] [CrossRef]
- Zheng, Z.; Wang, P.; Liu, W.; Li, J.; Ye, R.; Ren, D. Distance-IoU loss: Faster and better learning for bounding box regression. Proc. AAAI Conf. Artif. Intell. 2020, 34, 12993–13000. [Google Scholar] [CrossRef]
- Bodla, N.; Singh, B.; Chellappa, R.; Davis, L.S. Soft-NMS–Improving object detection with one line of code. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017; pp. 5561–5569. [Google Scholar] [CrossRef]
- Hu, J.; Shen, L.; Sun, G. Squeeze-and-excitation networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–23 June 2018; pp. 7132–7141. [Google Scholar]
- Woo, S.; Park, J.; Lee, J.Y.; Kweon, I.S. CBAM: Convolutional block attention module. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 3–19. [Google Scholar]
- Dosovitskiy, A.; Beyer, L.; Lucas, B.; Kolesnikov, A.; Weissenborn, D.; Zhai, X.; Unterthiner, T.; Dehghani, M.; Minderer, M.; Heigold, G.; et al. An image is worth 16x16 words: Transformers for image recognition at scale. In Proceedings of the ICLR 2021, Virtual, 3–7 May 2021. [Google Scholar]
- Arya, D.; Maeda, H.; Ghosh, S.K. RDD2022: A multi-national image dataset for automatic road damage detection. Geosci. Data J. 2024, 11, 846–862. [Google Scholar] [CrossRef]
- Ren, M.; Zhang, X.; Zhi, X.; Wei, Y.; Feng, Z. An annotated street view image dataset for automated road damage detection. Sci. Data 2024, 11, 407. [Google Scholar] [CrossRef]











| Mechanism | Focus | Receptive Field | Complexity | Suitability for YOLOv8n |
|---|---|---|---|---|
| SE [35] | Channel | Global (Pool) | Low | Limited (ignores spatial structure) |
| CBAM [36] | Channel + Spatial | Local (7 × 7) | Moderate | Sub-optimal for elongated cracks |
| Transformer [37] | Spatial | Global (MHSA) | High (O(N2)) | Too heavy for real-time edge devices |
| LSKA (Ours) | Spatial | Global | Low (Linear) | Optimal (Efficiency + Receptive Field) |
| Model | mAP@0.5 (%) | mAP@0.5:0.95 (%) | F1 (%) | Params (M) | FLOPs (G) | Latency (ms) | FPS | Model Size (M) |
|---|---|---|---|---|---|---|---|---|
| YOLOv8n | 68.7 ± 1.5 | 40.8 ± 1.1 | 67.7 ± 1.3 | 3.007 | 8.1 | 6.15 ±0.31 | 162.6 | 6.3 |
| Ours (Improved YOLOv8n) | 76.7 ± 1.3 | 48.3 ± 0.7 | 72.9 ± 1.1 | 3.28 | 8.3 | 6.24 ± 0.11 | 160.3 | 6.8 |
| Model | AP@0.5 (%) | |||||||
|---|---|---|---|---|---|---|---|---|
| Longitude Cracks | Transverse Cracks | Net Crack | Pothole | Zebra Crossing Loss | Manhole | STRIP Patch | Block Patch | |
| YOLOv8n | 64.2 | 62 | 80.9 | 53.4 | 93.5 | 85.8 | 52.1 | 69.5 |
| Ours | 76.5 | 67 | 81.5 | 75.5 | 93 | 87.4 | 69.8 | 75.4 |
| Increase | +12.3 | +5 | +0.6 | +22.1 | −0.5 | +1.6 | +17.7 | +5.9 |
| Distress type | Images | Instances | YOLOv8 AP50 | YOLOv8 mAP50–95 | Ours AP50 | Ours AP50–95 |
|---|---|---|---|---|---|---|
| longitudinal_crack | 1000 | 567 | 0.451 | 0.248 | 0.471 | 0.259 |
| transverse_crack | 1000 | 473 | 0.462 | 0.238 | 0.476 | 0.254 |
| alligator_crack | 1000 | 260 | 0.508 | 0.290 | 0.545 | 0.301 |
| pothole | 1000 | 131 | 0.387 | 0.126 | 0.362 | 0.161 |
| manhole_cover | 1000 | 452 | 0.666 | 0.379 | 0.672 | 0.388 |
| longitudinal_patch | 1000 | 961 | 0.600 | 0.366 | 0.593 | 0.351 |
| transverse_patch | 1000 | 406 | 0.471 | 0.237 | 0.484 | 0.245 |
| Average (Mean) | - | - | 0.506 | 0.269 | 0.515 | 0.280 |
| Model No. | Modules | mAP@0.5 (%) | mAP@0.5:0.95 (%) | F1 (%) | FLOPs (G) | ||
|---|---|---|---|---|---|---|---|
| LSKA | DIoU | Soft-NMS | |||||
| 1 | × | × | × | 70.2 | 41.9 | 69 | 8.1 |
| 2 | √ | × | × | 74 | 43.2 | 74 | 8.3 |
| 3 | × | √ | × | 72.2 | 42 | 70 | 8.1 |
| 4 | × | × | √ | 73.5 | 46.4 | 70 | 8.1 |
| 5 | × | √ | √ | 76.8 | 47.9 | 72 | 8.1 |
| 6 (Our model) | √ | √ | √ | 78.3 | 49 | 74 | 8.3 |
| Model | mAP@0.5 (%) | mAP@0.5:0.95 (%) | F1 (%) | Params (M) | FLOPs (G) | Latency (ms) | FPS | TT (h) | Model Size (M) |
|---|---|---|---|---|---|---|---|---|---|
| RTDETR-l | 62.7 | 35.6 | 60 | 32 | 103.5 | 19.07 ± 0.47 | 52.4 | 2.425 | 66.2 |
| YOLOv3-tiny | 63.2 | 33.1 | 62 | 12.13 | 18.9 | 2.63 ± 0.04 | 379.8 | 0.421 | 24.4 |
| YOLOv5n | 71.4 | 41.2 | 72 | 2.50 | 7.1 | 6.87 ± 0.07 | 145.5 | 0.446 | 5.3 |
| YOLOv6n | 66 | 37.6 | 63 | 4.23 | 11.8 | 6.02 ± 0.12 | 166.1 | 0.374 | 8.7 |
| YOLOv7-tiny | 64 | 35.7 | 62 | 6.03 | 13.1 | 6.60 ± 0.39 | 151.4 | 0.772 | 12.3 |
| YOLOv8n | 70.2 | 41.9 | 69 | 3.01 | 8.1 | 6.15 ± 0.31 | 162.6 | 0.396 | 6.3 |
| YOLOv13n | 57.5 | 30.4 | 57 | 2.83 | 6.4 | / | / | / | 5.7 |
| YOLOv26n | 69.2 | 43.8 | 70 | 2.38 | 5.2 | / | / | / | 4.8 |
| YOLOv8s | 75.4 | 45.6 | 73 | 11.13 | 28.5 | 6.91 ± 0.13 | 144.7 | 0.547 | 22.5 |
| YOLOv9-c | 76.3 | 48.7 | 75 | 50.71 | 236.7 | 28.72 ± 0.67 | 34.8 | / | 98.1 |
| Ours | 78.3 | 49 | 74 | 3.28 | 8.3 | 6.24 ± 0.11 | 160.3 | 0.423 | 6.8 |
| Model | Latency (ms) | FPS | Model Size (M) | |||
|---|---|---|---|---|---|---|
| Median | Average | Min | Max | |||
| YOLOv3-tiny | 5601.65 | 7040.15 | 900.34 | 32,692.22 | 17.95 | 46.3 |
| YOLOv5n | 497.55 | 891.18 | 14.02 | 9495.56 | 71.49 | 9.7 |
| YOLOv6n | 507.85 | 1894.55 | 115.76 | 9597.07 | 33.66 | 16.3 |
| YOLOv8n | 596.57 | 989.59 | 200.72 | 7291.52 | 64.4 | 11.7 |
| YOLOv8s | 504.96 | 1520.58 | 105.48 | 18,200.6 | 41.9 | 42.6 |
| Our model | 506.57 | 934.76 | 114 | 8701.4 | 68.07 | 12.7 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Tang, Y.; Yang, Z.; Xu, Z.; Zhou, Y.; Wang, H. An Improved YOLOv8 Model for Pavement Distress Detection Under Low-Computing-Power Conditions. Sensors 2026, 26, 3373. https://doi.org/10.3390/s26113373
Tang Y, Yang Z, Xu Z, Zhou Y, Wang H. An Improved YOLOv8 Model for Pavement Distress Detection Under Low-Computing-Power Conditions. Sensors. 2026; 26(11):3373. https://doi.org/10.3390/s26113373
Chicago/Turabian StyleTang, Yi, Ziyi Yang, Zhoucong Xu, You Zhou, and Hui Wang. 2026. "An Improved YOLOv8 Model for Pavement Distress Detection Under Low-Computing-Power Conditions" Sensors 26, no. 11: 3373. https://doi.org/10.3390/s26113373
APA StyleTang, Y., Yang, Z., Xu, Z., Zhou, Y., & Wang, H. (2026). An Improved YOLOv8 Model for Pavement Distress Detection Under Low-Computing-Power Conditions. Sensors, 26(11), 3373. https://doi.org/10.3390/s26113373

