A Contrast-Enhanced Feature Reconstruction for Fixed PTZ Camera-Based Crack Recognition in Expressways
Abstract
1. Introduction
- Deep learning-based models typically excel at recognizing objects with well-defined shapes and structures, whereas cracks exhibit random and indistinct morphologies.
- Varying shooting angles and lighting conditions introduce randomly distributed road texture shadows in the images, causing visual confusion between cracks and the background (Figure 2a).
- In early-stage pavement cracking, discontinuous crack structures lead to ambiguous boundaries and increased misclassification (Figure 2b).
2. Related Works
2.1. Review of Crack Pre-Processing
2.1.1. Crack Pre-Processing Based on Threshold Segmentation
2.1.2. Crack Pre-Processing Based on Transform Domains
2.1.3. Crack Pre-Processing Based on Filtering Technique
2.1.4. Other Crack Pre-Processing Methods
2.2. Review of Crack Recognition Methods
2.2.1. Challenges in Pavement Crack Recognition Using Fixed PTZ Vision
2.2.2. Existing Approaches and Their Limitations
- Low Visibility of Cracks: crack pixels constitute only a small percentage of the overall image, rendering them difficult to distinguish from the background.
- Complex and Random Background Textures: cracks are often obscured by pavement textures, reducing detection effectiveness.
2.2.3. Contrast-Enhanced Feature Reconstruction
3. Methodology
3.1. Comparison-Based Pixel Transformation of CEFR
3.2. Nonlinear Stretching of CEFR
3.3. Generation of CEFR Feature Salient Map
3.3.1. Feature Reconstruction Strategy
- Feature Embedding: crack features from are embedded into the original image I, such that the reconstructed crack edge features of converge with the original image to obtain , as shown in Equation (6).
3.3.2. CEFR-Based Crack Pre-Processing Results
- The crack pixel values become uniform, meaning all crack pixels are set to 0.
- The crack boundaries appear clearly discernible, generating a strong pixel value step and rendering the pavement background and cracks easily distinguishable.
- Blurring in the original crack image is eliminated, crack image dithering is suppressed, and the reconstructed crack image exhibits more robust pixel relationships. Hence, the useful information in the original crack image is clearly and fully preserved.
- Although the background texture shadows are also enhanced, they lack a clear directional trend, unlike cracks. This is due to their small size and non-aggregate distribution. Consequently, in CEFR-enhanced images, distinct differences exist between background texture shadows and cracks, ensuring that the background texture shadows do not interfere with crack feature representation.
4. Experiment and Results
4.1. Data Preparation
4.2. Experiment Model
4.3. Parameters Setting
4.4. Model Evaluation
4.5. Experiment Results and Analysis
4.5.1. Effect of CEFR Parameter Settings on Crack Recognition Performance
4.5.2. Effect of CEFR Transforming Window Scale
4.5.3. Optimizing YOLOv5 for Crack Recognition
- CEFR Parameters: , , , .
- YOLOv5 Configuration: Hard swish activation function.
4.5.4. Comparison with Existing Methods
4.5.5. Generalisability of CEFR to Other Object Recognition Models, Datasets, and Tasks
5. Discussion
5.1. Contributions of the Paper
5.2. Limitations and Future Research
- Ground Truth Misalignment: In our experiments, labels were based on the original images and applied to both original and CEFR-enhanced images to maintain consistency. However, CEFR enhancement can reveal previously unclear cracks, causing bounding box coordinates to deviate from the ground truth. This may lead to higher leakage rates or reduced confidence levels in the enhanced dataset.
- Diverse Road Conditions and Scenarios: Expressway crack recognition involves various scenarios and complex operating conditions. A large-scale dataset with diverse crack samples is needed to enhance dataset completeness and improve the robustness of neural network prediction weights.
6. Conclusions
- Effectively enhances crack images and improves expressway crack recognition performance under fixed-PTZ vision.
- Shows strong generalizability across different datasets and object detection models.
- Works effectively in crack segmentation tasks, further proving CEFR versatility.
- CEFR image pre-processing has less computational cost.
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- mot.gov. Give Full Play to the Key Role of Investment to Promote the Overall Improvement of Economic Performance. 2023. Available online: https://www.mot.gov.cn/jiaotongyaowen/202303/t20230306_3769450.html (accessed on 6 March 2023).
- Obunguta, F.; Matsushima, K.; Susaki, J. Probabilistic management of pavement defects with image processing techniques. J. Civ. Eng. Manag. 2024, 30, 114–132. [Google Scholar] [CrossRef]
- Chun, P.-J.; Yamane, T.; Tsuzuki, Y. Automatic detection of cracks in asphalt pavement using deep learning to overcome weaknesses in images and GIS visualization. Appl. Sci. 2021, 11, 892. [Google Scholar] [CrossRef]
- Luo, W.; Liu, L.; Li, L. Measuring rutting dimension and lateral position using 3D line scanning laser and inertial measuring unit. Autom. Constr. 2020, 111, 103056. [Google Scholar] [CrossRef]
- Li, Y.; Liu, C.; Yue, G.; Gao, Q.; Du, Y. Deep learning-based pavement subsurface distress detection via ground penetrating radar data. Autom. Constr. 2022, 142, 104516. [Google Scholar] [CrossRef]
- Zhang, L.; Xu, W.; Zhu, L.; Yuan, X.; Zhang, C. Study on pavement defect detection based on image processing utilizing UAV. J. Phys. Conf. Ser. 2019, 1168, 042011. [Google Scholar] [CrossRef]
- Zhu, J.; Zhong, J.; Ma, T.; Huang, X.; Zhang, W.; Zhou, Y. Pavement distress detection using convolutional neural networks with images captured via UAV. Autom. Constr. 2022, 133, 103991. [Google Scholar] [CrossRef]
- Li, Y.; Ma, J.; Zhao, Z.; Shi, G. A Novel Approach for UAV Image Crack Detection. Sensors 2022, 22, 3305. [Google Scholar] [CrossRef]
- Shao, C.; Feng, X.; Li, J. Crack Detection and Measurement Using PTZ Camera–Based Image Processing Method on Expressways. J. Transp. Eng. Part B Pavements 2022, 148, 04022043. [Google Scholar] [CrossRef]
- Otsu, N. A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 1979, 9, 62–66. [Google Scholar] [CrossRef]
- Quan, Y.; Sun, J.; Zhang, Y.; Zhang, H. The method of the road surface crack detection by the improved Otsu threshold. In Proceedings of the 2019 IEEE International Conference on Mechatronics and Automation (ICMA), Tianjin, China, 4–7 August 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 1615–1620. [Google Scholar]
- Oliveira, H.; Correia, P.L. Automatic road crack segmentation using entropy and image dynamic thresholding. In Proceedings of the 2009 17th European Signal Processing Conference, Glasgow, UK, 24–28 August 2009; pp. 622–626. [Google Scholar]
- Chen, C.; Seo, H.; Jun, C.; Zhao, Y. A potential crack region method to detect crack using image processing of multiple thresholding. Signal Image Video Process. 2022, 16, 1673–1681. [Google Scholar] [CrossRef]
- Wang, S.; Tang, W. Pavement crack segmentation algorithm based on local optimal threshold of cracks density distribution. In Proceedings of the Advanced Intelligent Computing: 7th International Conference, ICIC 2011, Zhengzhou, China, 11–14 August 2011; Revised Selected Papers 7. Springer: Berlin/Heidelberg, Germany, 2012; pp. 298–302. [Google Scholar]
- Akagic, A.; Buza, E.; Omanovic, S.; Karabegovic, A. Pavement crack detection using Otsu thresholding for image segmentation. In Proceedings of the 2018 41st International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO), Opatija, Croatia, 21–25 May 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 1092–1097. [Google Scholar]
- Safaei, N.; Smadi, O.; Safaei, B.; Masoud, A. Efficient road crack detection based on an adaptive pixel-level segmentation algorithm. Transp. Res. Rec. 2021, 2675, 370–381. [Google Scholar] [CrossRef]
- Peng, L.; Chao, W.; Shuangmiao, L.; Baocai, F. Research on Crack Detection Method of Airport Runway Based on Twice-Threshold Segmentation. In Proceedings of the 2015 Fifth International Conference on Instrumentation and Measurement, Computer, Communication and Control (IMCCC), Qinhuangdao, China, 18–20 September 2015; pp. 1716–1720. [Google Scholar] [CrossRef]
- Ma, C.; Wang, W.; Zhao, C.; Di, F.; Zhu, Z. Pavement Cracks Detection Based on FDWT. In Proceedings of the 2009 International Conference on Computational Intelligence and Software Engineering, Wuhan, China, 11–13 December 2009; pp. 1–4. [Google Scholar] [CrossRef]
- Ying, L.; Salari, E. Beamlet transform-based technique for pavement crack detection and classification. Comput.-Aided Civ. Infrastruct. Eng. 2010, 25, 572–580. [Google Scholar] [CrossRef]
- Sun, L.; Qian, Z. Multi-scale wavelet transform filtering of non-uniform pavement surface image background for automated pavement distress identification. Measurement 2016, 86, 26–40. [Google Scholar] [CrossRef]
- Hamici, Z.; Obaidat, T.I.A.S. Pavement Images Denoising with Cracks Detection and Classification Using 2D Discrete Wavelet Transform and Savitzky-Golay Filters. In Proceedings of the 2019 IEEE International Conference on Signal and Image Processing Applications (ICSIPA), Kuala Lumpur, Malaysia, 17–19 September 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 100–105. [Google Scholar]
- Matarneh, S.; Elghaish, F.; Al-Ghraibah, A.; Abdellatef, E.; Edwards, D.J. An automatic image processing based on Hough transform algorithm for pavement crack detection and classification. Smart Sustain. Built Environ. 2025, 14, 1–22. [Google Scholar] [CrossRef]
- Li, S.; Cao, Y.; Cai, H. Automatic pavement-crack detection and segmentation based on steerable matched filtering and an active contour model. J. Comput. Civ. Eng. 2017, 31, 04017045. [Google Scholar] [CrossRef]
- Liang, J.; Gu, X.; Chen, Y. Fast and robust pavement crack distress segmentation utilizing steerable filtering and local order energy. Constr. Build. Mater. 2020, 262, 120084. [Google Scholar] [CrossRef]
- Wang, W.; Li, H.; Wang, K.; He, C.; Bai, M. Pavement crack detection on geodesic shadow removal with local oriented filter on LOF and improved Level set. Constr. Build. Mater. 2020, 237, 117750. [Google Scholar] [CrossRef]
- Sun, M.; Zhao, H.; Li, J. Road crack detection network under noise based on feature pyramid structure with feature enhancement (road crack detection under noise). IET Image Process. 2022, 16, 809–822. [Google Scholar] [CrossRef]
- Sun, L.; Kamaliardakani, M.; Zhang, Y. Weighted neighborhood pixels segmentation method for automated detection of cracks on pavement surface images. J. Comput. Civ. Eng. 2016, 30, 04015021. [Google Scholar] [CrossRef]
- Ullah, A.; Sun, Z.; Elahi, H.; Afzal, F.; Khatoon, A.; Sayed, N.; Ahmad, I. Experimental and Numerical Research of Paved Microcrack Using Histogram Equalization for Detection and Segmentation. Math. Probl. Eng. 2022, 2022, 684983. [Google Scholar] [CrossRef]
- Liu, J.; Zhao, Z.; Lv, C.; Ding, Y.; Chang, H.; Xie, Q. An image enhancement algorithm to improve road tunnel crack transfer detection. Constr. Build. Mater. 2022, 348, 128583. [Google Scholar] [CrossRef]
- Qu, Z.; Cao, C.; Liu, L.; Zhou, D.Y. A deeply supervised convolutional neural network for pavement crack detection with multiscale feature fusion. IEEE Trans. Neural Netw. Learn. Syst. 2021, 33, 4890–4899. [Google Scholar] [CrossRef]
- Li, B.L.; Qi, Y.; Fan, J.S.; Liu, Y.F.; Liu, C. A grid-based classification and box-based detection fusion model for asphalt pavement crack. Comput.-Aided Civ. Infrastruct. Eng. 2023, 38, 2279–2299. [Google Scholar] [CrossRef]
- Qi, Y.; He, Y.; Qi, X.; Zhang, Y.; Yang, G. Dynamic snake convolution based on topological geometric constraints for tubular structure segmentation. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Paris, France, 1–6 October 2023; pp. 6070–6079. [Google Scholar]
- Yao, H.; Liu, Y.; Li, X.; You, Z.; Feng, Y.; Lu, W. A detection method for pavement cracks combining object detection and attention mechanism. IEEE Trans. Intell. Transp. Syst. 2022, 23, 22179–22189. [Google Scholar] [CrossRef]
- Ranyal, E.; Sadhu, A.; Jain, K. Enhancing pavement health assessment: An attention-based approach for accurate crack detection, measurement, and mapping. Expert Syst. Appl. 2024, 247, 123314. [Google Scholar] [CrossRef]
- Xiang, X.; Wang, Z.; Qiao, Y. An improved YOLOv5 crack detection method combined with transformer. IEEE Sens. J. 2022, 22, 14328–14335. [Google Scholar] [CrossRef]
- Zhu, G.; Liu, J.; Fan, Z.; Yuan, D.; Ma, P.; Wang, M.; Sheng, W.; Wang, K.C. A lightweight encoder–decoder network for automatic pavement crack detection. Comput.-Aided Civ. Infrastruct. Eng. 2024, 39, 1743–1765. [Google Scholar] [CrossRef]
- Huyan, J.; Li, W.; Tighe, S.; Zhai, J.; Xu, Z.; Chen, Y. Detection of sealed and unsealed cracks with complex backgrounds using deep convolutional neural network. Autom. Constr. 2019, 107, 102946. [Google Scholar] [CrossRef]
- Huang, Z.; Kang, W.; Wu, Q.; Chen, X. A new descriptor resistant to affine transformation and monotonic intensity change. Comput. Vis. Image Underst. 2014, 120, 117–125. [Google Scholar] [CrossRef]
- Shao, C.; Ding, Q.; Luo, H.; Chang, Z.; Zhang, C.; Zheng, T. Step-by-step pipeline processing approach for line segment detection. IET Image Process. 2017, 11, 416–424. [Google Scholar] [CrossRef]
- Shao, C.; Zhang, L.; Pan, W. PTZ camera-based image processing for automatic crack size measurement in expressways. IEEE Sens. J. 2021, 21, 23352–23361. [Google Scholar] [CrossRef]
- Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You only look once: Unified, real-time object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 779–788. [Google Scholar]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
Scale | P | R | F1 | mAP@0.5 | mAP 0.5:0.95 |
---|---|---|---|---|---|
POD | 0.942 | 0.935 | 0.938 | 0.956 | 0.646 |
3 | 0.979 | 0.959 | 0.969 | 0.982 | 0.692 |
5 | 0.961 | 0.967 | 0.964 | 0.98 | 0.71 |
7 | 0.948 | 0.965 | 0.956 | 0.977 | 0.714 |
Activate Function | P | R | F1 | mAP 0.5 | mAP 0.5:0.95 |
---|---|---|---|---|---|
POD | 0.942 | 0.935 | 0.938 | 0.956 | 0.646 |
Hardswish | 0.979 | 0.959 | 0.969 | 0.982 | 0.692 |
ReLU | 0.952 | 0.927 | 0.939 | 0.972 | 0.684 |
SiLU | 0.961 | 0.948 | 0.954 | 0.975 | 0.708 |
Sigmoid | 0.937 | 0.925 | 0.931 | 0.963 | 0.661 |
Classification | P | R | F1 | AP 0.5 | AP 0.5:0.95 |
---|---|---|---|---|---|
Longitudinal | 0.966 | 0.94 | 0.953 | 0.982 | 0.692 |
Oblique | 0.974 | 0.98 | 0.977 | 0.973 | 0.67 |
Transversal | 0.995 | 1 | 0.997 | 0.995 | 0.629 |
Repaired | 0.98 | 0.916 | 0.947 | 0.991 | 0.825 |
Methods | P | R | F1 | mAP0.5 | mAP0.5:0.95 |
---|---|---|---|---|---|
YOLOv5 with Hardswish ((w/o)CEFR)) | 0.942 | 0.935 | 0.938 | 0.956 | 0.646 |
YOLOv5 with Hardswish ((w)CEFR)) | 0.979 | 0.959 | 0.969 | 0.982 | 0.692 |
Yao’ method ((w/o)CEFR)) | 0.954 | 0.934 | 0.944 | 0.953 | 0.684 |
Liu’ method ((w/o)CEFR)) | 0.953 | 0.918 | 0.935 | 0.957 | 0.668 |
Model | Dataset | P | R | F1 | mAP 0.5 | mAP 0.5:0.95 |
---|---|---|---|---|---|---|
YOLOv5 | PTZD (W/O(CEFR)) | 0.942 | 0.935 | 0.938 | 0.956 | 0.646 |
PTZD (W(CEFR)) | 0.961 | 0.948 | 0.954 | 0.975 | 0.708 | |
RT -DETR | PTZD (W/O(CEFR)) | 0.882 | 0.89 | 0.878 | 0.916 | 0.645 |
PTZD (W(CEFR)) | 0.95 | 0.908 | 0.971 | 0.963 | 0.687 | |
YOLOv8 | PTZD (W/O(CEFR)) | 0.981 | 0.958 | 0.969 | 0.984 | 0.776 |
PTZD (W(CEFR)) | 0.974 | 0.954 | 0.964 | 0.99 | 0.79 | |
YOLOv5 -DySnake | PTZD (W/O(CEFR)) | 0.956 | 0.949 | 0.952 | 0.966 | 0.719 |
PTZD (W(CEFR)) | 0.949 | 0.951 | 0.950 | 0.968 | 0.741 | |
YOLOv5 | CrackF (W/O(CEFR)) | 0.484 | 0.516 | 0.50 | 0.518 | 0.222 |
CrackF (W(CEFR)) | 0.566 | 0.629 | 0.596 | 0.602 | 0.269 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Feng, X.; Shao, C. A Contrast-Enhanced Feature Reconstruction for Fixed PTZ Camera-Based Crack Recognition in Expressways. Electronics 2025, 14, 2617. https://doi.org/10.3390/electronics14132617
Feng X, Shao C. A Contrast-Enhanced Feature Reconstruction for Fixed PTZ Camera-Based Crack Recognition in Expressways. Electronics. 2025; 14(13):2617. https://doi.org/10.3390/electronics14132617
Chicago/Turabian StyleFeng, Xuezhi, and Chunyan Shao. 2025. "A Contrast-Enhanced Feature Reconstruction for Fixed PTZ Camera-Based Crack Recognition in Expressways" Electronics 14, no. 13: 2617. https://doi.org/10.3390/electronics14132617
APA StyleFeng, X., & Shao, C. (2025). A Contrast-Enhanced Feature Reconstruction for Fixed PTZ Camera-Based Crack Recognition in Expressways. Electronics, 14(13), 2617. https://doi.org/10.3390/electronics14132617