Application of Improved YOLOv8 Image Model in Urban Manhole Cover Defect Management and Detection: Case Study
Abstract
1. Introduction
2. Related Work
3. Urban Manhole Cover Intelligent Operation and Maintenance Management System
3.1. The Dilemma of Traditional Management Modes
3.2. Intelligent Management System Architecture
4. The Proposed Method
4.1. YOLOv8 Network Architecture
4.2. The Improved YOLOv8 Manhole Cover Defect Detection Model
4.2.1. Lightweight Backbone Network
4.2.2. Attention Mechanism
4.2.3. Feature Fusion Optimization
4.3. Loss Function Design
5. Experimental Design
5.1. Dataset Construction
5.1.1. Data Acquisition
5.1.2. Data Analysis
5.1.3. Data Augmentation
5.2. Experimental Setup
5.2.1. Experimental Environment Configuration
5.2.2. Evaluation Criteria for Experiments
5.3. Results and Analysis
5.3.1. Performance Assessment
5.3.2. Ablation Study
5.3.3. Comparison of Experiments with Different Models
6. Case Study
- Inspection module: This includes the setting of daily inspection routes, task allocation, and inspection personnel information. Real-time inspection personnel positions can be seen in the system. Inspection personnel upload manhole cover images to the system by taking photos. The inspection routes are updated on a weekly basis based on last week’s manhole cover defects and repair locations.
- Manhole cover defect recognition module: Embedded with a smart recognition system for manhole cover images, this classifies and recognizes defects in manhole cover images uploaded during inspections, filters out and reports pictures with defects, and regularly uploads manhole cover images uploaded during inspections to the manhole cover health image database.
- Maintenance work order distribution module: This reviews the accuracy of identified defect images, determines the defect repair plan, and determines the work order distribution time and maintenance cycle based on the urgency of the manhole cover defect.
7. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Zhong, Z.; Sun, L.; Huo, Q. An anchor-free region proposal network for Faster R-CNN-based text detection approaches. Int. J. Doc. Anal. Recognit. 2019, 22, 315–327. [Google Scholar] [CrossRef]
- Wu, X.; Luo, C.; Zhang, Q.; Zhou, J.; Yang, H.; Li, Y. Text detection and recognition for natural scene images using deep convolutional neural networks. Comput. Mater. Contin. 2019, 61, 289–300. [Google Scholar] [CrossRef]
- Hinton, G.E.; Salakhutdinov, R.R. Reducing the dimesionality of data with neural networks. Science 2006, 313, 504–507. [Google Scholar] [CrossRef]
- Chen, R.B.; Hou, W.; Li, Y.; Ye, X.E.; Ma, W.Y. Recognition of lanes on complex roads based on video image. Adv. Mater. Res. 2014, 950, 298–305. [Google Scholar] [CrossRef]
- Wei, H.; Tan, J.L. Detection of incomplete ellipse in images with strong noise by iterative randomized Hough transform (IRHT). Pattern Recognit. 2007, 41, 1268–1279. [Google Scholar]
- Kong, T.; Dai, J. Disease detection of pavement manhole cover based on improved YOLOv5. Remote Sens. Inf. 2023, 38, 40–46. [Google Scholar]
- Chen, J.; Zou, C. Research on rapid detection method of pavement defects by improving YOLOv5. Electron. Meas. Technol. 2023, 46, 129–135. [Google Scholar]
- Cheng, Y.; Xiong, Z.; Wang, Y.-H. Improved classical hough transform applied to the manhole cover s detection and location. Opt. Tech. 2006, 32, 504–508. [Google Scholar]
- Huang, W.; Zhao, H. VRD-YOLO: Real-time vehicle-mounted-images road damage detection based on improved YOLOv5s. J. Nanjing Univ. Inf. Sci. Technol. 2024, 1–21. [Google Scholar]
- Xu, M.; Han, Y. Research on high-precision manhole cover extraction and settlement disease detection method based on laser point cloud. Chin. J. Lasers 2021, 48, 63–77. [Google Scholar]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans Pattern Anal. Mach. Intell. 2015, 37, 1904–1916. [Google Scholar] [CrossRef]
- Pang, J.; Li, C.; Shi, J.; Xu, Z.; Feng, H. R2-CNN: Fast tiny object detection in large-scale remote sensing images. IEEE Trans Geosci. Remote Sens. 2019, 57, 5512–5524. [Google Scholar] [CrossRef]
- Binelo, M.O.; de Almeida, A.L.F.; Cavalcanti, F.R.P. MIMO array capacity optimization using a genetic algorithm. IEEE Trans Veh. Technol. 2011, 60, 2471–2481. [Google Scholar] [CrossRef]
- Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 1137–1149. [Google Scholar] [CrossRef]
- Zou, Q.; Zhang, Z.; Li, Q.; Qi, X.; Wang, Q.; Wang, S. Deep Crack: Learning hierarchical convolutional features for crack detection. IEEE Trans. Image Process. 2019, 28, 1498–1512. [Google Scholar] [CrossRef]
- Hussain, M. YOLO-v1 to YOLO-v8, the Rise of YOLO and Its Complementary Nature toward Digital Manufacturing and Industrial Defect Detection. Machines 2023, 11, 677. [Google Scholar] [CrossRef]
- Xu, D.; Wu, Y. Improved YOLO-V3 with DenseNet for Multi-Scale Remote Sensing Target Detection. Sensors 2020, 20, 4276. [Google Scholar] [CrossRef]
- Adibhatla, V.A.; Chih, H.-C.; Hsu, C.-C.; Cheng, J.; Abbod, M.F.; Shieh, J.-S. Applying deep learning to defect detection in printed circuit boards via a newest model of you-only-look-once. Math. Biosci. Eng. 2021, 18, 4411–4428. [Google Scholar] [CrossRef]
- Zhou, X.; Yi, J.; Xie, G.; Jia, Y.; Xu, G.; Sun, M. Human detection algorithm based on improved YOLO v4. Inf. Technol. Control. 2022, 51, 485–498. [Google Scholar] [CrossRef]
- Yu, Q.; Han, Y.; Gao, X.; Lin, W.; Han, Y. Comparative Analysis of Improved YOLO v5 Models for Corrosion Detection in Coastal Environments. J. Mar. Sci. Eng. 2024, 12, 1754. [Google Scholar] [CrossRef]
- Mao, M.; Hong, M. YOLO object detection for real-time fabric defect inspection in the textile industry: A review of YOLOv1 to YOLOv11. Sensors 2025, 25, 2270. [Google Scholar] [CrossRef] [PubMed]
- Dang, F.; Chen, D.; Lu, Y.; Li, Z. HA-FPN: Hierarchical Attention Feature Pyramid Network for Object Detection. Sensors 2023, 23, 4508. [Google Scholar] [CrossRef] [PubMed]
- Qiao, S.; Chen, L.-C.; Yuille, A. DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA, 20–25 June 2021; pp. 10213–10224. [Google Scholar]
- Dai, X.; Chen, Y.; Xiao, B.; Chen, D.; Liu, M.; Yuan, L.; Zhang, L. Dynamic Head: Unifying Object Detection Heads with Attentions. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA, 20–25 June 2021; pp. 7373–7382. [Google Scholar]
- Tan, M.; Pang, R.; Le, Q.V. EfficientDet: Scalable and Efficient Object Detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 13–19 June 2020; pp. 10781–10790. [Google Scholar]
- Woo, S.; Park, J.; Lee, J.-Y.; Kweon, I.S. CBAM: Convolutional Block Attention Module. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 3–19. [Google Scholar]
- Liu, K.; Li, A.; Wen, X.; Chen, H.; Yang, P. An Adaptive Image Segmentation Network for Surface Defect Detection. IEEE Trans. Neural Netw. Learn. Syst. 2023, 34, 6062–6074. [Google Scholar] [CrossRef]
- Guan, H.; Li, J.; Yu, Y.; Wang, C.; Chapman, M.; Yang, B. Real-time detection of road manhole covers with a deep learning model. Sci. Rep. 2023, 13, 16730. [Google Scholar] [CrossRef]
- Li, H.; Song, D.; Liu, Y.; Li, B. Automatic Pavement Crack Detection by Multi-Scale Image Fusion. IEEE Trans. Intell. Transp. Syst. 2019, 20, 2025–2036. [Google Scholar] [CrossRef]
- Tran, T.S.; Tran, V.P.; Lee, H.J.; Flores, J.M.; Le, V.P. A two-step sequential automated crack detection and severity classification process for asphalt pavements. Int. J. Pavement Eng. 2022, 23, 2019–2033. [Google Scholar] [CrossRef]
- Wang, W.; Wang, M.; Li, H.; Zhao, H.; Wang, K.; He, C.; Wang, J.; Zheng, S.; Chen, J. Pavement crack image acquisition methods and crack extraction algorithms: A review. J. Traffic Transp. Eng. (Engl. Ed.) 2019, 6, 535–556. [Google Scholar] [CrossRef]
- Wu, Z.Y.; Kalfarisi, R.; Kouyoumdjian, F.; Taelman, C. Applying deep convolutional neural network with 3D reality mesh model for water tank crack detection and evaluation. Urban Water J. 2020, 17, 682–695. [Google Scholar] [CrossRef]
- Yu, Y.; Li, J.; Guan, H.; Wang, C.; Yu, J. Automated Detection of Road Manhole and Sewer Well Covers From Mobile LiDAR Point Clouds. IEEE Geosci. Remote Sens. Lett. 2014, 11, 1549–1553. [Google Scholar]
- Yu, Y. Automated extraction of manhole covers using mobile lidar data. Remote Sens. Lett. 2014, 5, 1042–1050. [Google Scholar]
- Yang, M.M.; Wan, Y.C.; Liu, X.L.; Xu, J.Z.; Chen, M.L.; Sheng, P. Laser data based automatic recognition and maintenance of road markings from MLS system. Opt. Laser Technol. 2018, 107, 192–203. [Google Scholar] [CrossRef]
- Liu, J.H. Research on algorithm of automatically recognizing and positioning road manhole covers based on vehicle-mounted sensors. Appl. Res. Comput. 2011, 28, 3138–3140. [Google Scholar]
- Yang, B.; Fang, L.; Li, J. Semi-automated extraction and delineation of 3D roads of street scene from mobile laser scanning point clouds. ISPRS J. Photogram. Remote Sens. 2013, 79, 80–93. [Google Scholar] [CrossRef]
- Xu, Z.S.; Lei, X.D.; Guan, H.Y. Multi-scale local feature enhanced transformer network for pavement crack detection. J. Image Graph. 2023, 28, 1019–1028. [Google Scholar] [CrossRef]
- LI, L.F.; Xue, B. Detection method of manhole cover theft based on improved DTW algorithm. Audio Eng. 2022, 46, 111–116. [Google Scholar]
- Gong, Z.; Liang, L.; Yang, Z.; You, J.; Cai, Y.; Liu, H. A detection method of stealing and destroying manhole cover based on sound signal processing. Mach. Tool Hydraul. 2019, 47, 39–43. [Google Scholar]
- Canny, J. A computational approach to edge detection. IEEE Trans. Pattern Anal. Mach. Intell. 1986, 8, 679–698. [Google Scholar] [CrossRef] [PubMed]
- Hu, F.; Xia, G.-S.; Hu, J.; Zhang, L. Transferring deep convolutional neural networks for the scene classification of high-resolution remote sensing imagery. Remote Sens. 2015, 7, 14680–14707. [Google Scholar] [CrossRef]
- Zou, W.Y.; Wang, X.; Sun, M.; Lin, Y. Generic object detection with dense neural patterns and regionlets. arXiv 2014, arXiv:1404.4316. [Google Scholar]
- Donahue, J.; Jia, Y.; Vinyals, O.; Hoffman, J.; Zhang, N.; Tzeng, E.; Darrell, T. DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition. In Proceedings of the 31st International conference on machine learning, Beijing, China, 21–26 June 2014; Curran Associates: Red Hook, NY, USA, 2014; pp. 988–996. [Google Scholar]
- Lecun, Y.; Bottou, L.; Bengio, Y.; Haffner, P. Gradient-based learning applied to document recognition. Proc. IEEE 1998, 86, 2278–2324. [Google Scholar] [CrossRef]
- Sun, L.; Huo, Q.; Jia, W.; Chen, K. A robust approach for text detection from natural scene images. Pattern Recognit. J. Pattern Recognit. Soc. 2015, 48, 2906–2920. [Google Scholar] [CrossRef]
- Wang, P.; Li, Q.; Gong, W. Wavelet-Based Pavement Distress Image Edge Detection with À Trous Algorithm. Transp. Res. Rec. 2007, 2024, 73–81. [Google Scholar] [CrossRef]
- Subirats, P.; Dumoulin, J.; Legeay, V.; Barba, D. Automation of Pavement Surface Crack Detection using the Continuous Wavelet Transform. In Proceedings of the 2006 IEEE International Conference on Image Processing, Atlanta, GA, USA, 8–11 October 2006; Institute of Electrical and Electronics Engineers: Piscataway, NJ, USA, 2006; pp. 3037–3040. [Google Scholar]
- Tanaka, N.; Mouri, M. A Detection Method of Cracks and Structural Objects of the RoadSurface Image. In Proceedings of the IAPR Workshop on Machine Vision Applications (MVA2000), Tokyo, Japan, 28–30 November 2000; pp. 387–390. [Google Scholar]
Environment Item | Configuration |
---|---|
CPU | Intel Core i9-13900K |
GPU | NVIDIA RTX 4090 |
Memory | 64 GB DDR5 |
Operating System | Ubuntu 20.04 LTS |
Python Version | Python 3.11.5 |
CUDA Version | CUDA 12.1 |
Pytorch Version | Pytorch 2.1 |
Item | Configuration |
---|---|
Optimizer | AdamW |
Learning Rate | 0.001 |
Batch Size | 64 |
Training Epochs | 200 |
Early Stopping Patience | 100 |
Image Dimensions | 512 × 512 |
Model | YOLOv8 | EfficientNetV2 | Outlook Attention | NAS-FPN | Accuracy | Precision | Recall | F1-Score | Inference Time on RTX 4090 (ms) |
---|---|---|---|---|---|---|---|---|---|
YOLOv8-Base | √ | 0.951 | 0.9609 | 0.9405 | 0.9506 | 8.5 | |||
YOLOv8-OAN | √ | √ | √ | 0.962 | 0.97 | 0.954 | 0.973 | 12.8 | |
YOLOv8-EN | √ | √ | √ | 0.967 | 0.963 | 0.954 | 0.965 | 7.2 | |
YOLOv8-EA | √ | √ | √ | 0.965 | 0.975 | 0.96 | 0.963 | 8.9 | |
YOLOv8-EAN | √ | √ | √ | √ | 0.9812 | 0.98796 | 0.97095 | 0.9794 | 9.8 |
Model | Accuracy | Precision | Recall | F1-Score | Inference Time on RTX 4090 (ms) |
---|---|---|---|---|---|
YOLOv8-Base | 0.951 | 0.9609 | 0.9405 | 0.9506 | 8.5 |
YOLOv8-ResNet50 | 0.923 | 0.925 | 0.900 | 0.912 | 15.2 |
YOLOv8-MobileNetV3 | 0.895 | 0.898 | 0.883 | 0.890 | 4.8 |
YOLOv8-DenseNet121 | 0.908 | 0.910 | 0.895 | 0.902 | 11.5 |
YOLOv8-EfficientNetV2 | 0.965 | 0.962 | 0.951 | 0.957 | 7.3 |
Model | mAP@0.5/% | mAP@[0.5–0.95]/% | F1-Score/% | Params/M | GFLOPs | Inference Time on RTX 4090 (ms) |
---|---|---|---|---|---|---|
MobileNet | 91.36 | 90.7 | 91.56 | 4.23 | 5.8 | 8.2 |
Faster R-CNN | 87.87 | 86.5 | 87.24 | 34.5 | 120 | 125.8 |
EfficientNet | 91.27 | 90.85 | 91.57 | 7.8 | 4.0 | 12.5 |
RetinaNet | 85.68 | 85 | 85.74 | 41.2 | 100 | 115.4 |
Improved YOLOv8 | 98.12 | 97.10 | 97.94 | 2.12 | 7.3 | 9.8 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Ding, Y.; Han, B.; Jiang, H.; Hu, H.; Xue, L.; Weng, J.; Tang, Z.; Liu, Y. Application of Improved YOLOv8 Image Model in Urban Manhole Cover Defect Management and Detection: Case Study. Sensors 2025, 25, 4144. https://doi.org/10.3390/s25134144
Ding Y, Han B, Jiang H, Hu H, Xue L, Weng J, Tang Z, Liu Y. Application of Improved YOLOv8 Image Model in Urban Manhole Cover Defect Management and Detection: Case Study. Sensors. 2025; 25(13):4144. https://doi.org/10.3390/s25134144
Chicago/Turabian StyleDing, Yanqiong, Baojiang Han, Hua Jiang, Hao Hu, Lei Xue, Jiasen Weng, Zhili Tang, and Yuzhang Liu. 2025. "Application of Improved YOLOv8 Image Model in Urban Manhole Cover Defect Management and Detection: Case Study" Sensors 25, no. 13: 4144. https://doi.org/10.3390/s25134144
APA StyleDing, Y., Han, B., Jiang, H., Hu, H., Xue, L., Weng, J., Tang, Z., & Liu, Y. (2025). Application of Improved YOLOv8 Image Model in Urban Manhole Cover Defect Management and Detection: Case Study. Sensors, 25(13), 4144. https://doi.org/10.3390/s25134144