MDPI - Publisher of Open Access Journals

17 pages, 1416 KiB

Open AccessArticle

A Transformer-Based Pavement Crack Segmentation Model with Local Perception and Auxiliary Convolution Layers

by Yi Zhu, Ting Cao and Yiqing Yang

Electronics 2025, 14(14), 2834; https://doi.org/10.3390/electronics14142834 - 15 Jul 2025

Viewed by 374

Crack detection in complex pavement scenarios remains challenging due to the sparse small-target features and computational inefficiency of existing methods. To address these limitations, this study proposes an enhanced architecture based on Mask2Former. The framework integrates two key innovations. A Local Perception Module [...] Read more.

Crack detection in complex pavement scenarios remains challenging due to the sparse small-target features and computational inefficiency of existing methods. To address these limitations, this study proposes an enhanced architecture based on Mask2Former. The framework integrates two key innovations. A Local Perception Module (LPM) reconstructs geometric topological relationships through a Sequence-Space Dynamic Transformation Mechanism (DS2M), enhancing neighborhood feature extraction via depthwise separable convolutions. Simultaneously, an Auxiliary Convolutional Layer (ACL) combines lightweight residual convolutions with shallow high-resolution features, preserving critical edge details through channel attention weighting. Experimental evaluations demonstrate the model’s superior performance, achieving improvements of 3.2% in mIoU and 2.7% in mAcc compared to baseline methods, while maintaining computational efficiency with only 12.8 GFLOPs. These results validate the effectiveness of geometric relationship modeling and hierarchical feature fusion for pavement crack detection, suggesting practical potential for infrastructure maintenance systems. The proposed approach balances precision and efficiency, offering a viable solution for real-world applications with complex crack patterns and hardware constraints. Full article

► Show Figures

Figure 1

25 pages, 11680 KiB

Open AccessArticle

ETAFHrNet: A Transformer-Based Multi-Scale Network for Asymmetric Pavement Crack Segmentation

by Chao Tan, Jiaqi Liu, Zhedong Zhao, Rufei Liu, Peng Tan, Aishu Yao, Shoudao Pan and Jingyi Dong

Appl. Sci. 2025, 15(11), 6183; https://doi.org/10.3390/app15116183 - 30 May 2025

Cited by 1 | Viewed by 703

Abstract

Accurate segmentation of pavement cracks from high-resolution remote sensing imagery plays a crucial role in automated road condition assessment and infrastructure maintenance. However, crack structures often exhibit asymmetry, irregular morphology, and multi-scale variations, posing significant challenges to conventional CNN-based methods in real-world environments. [...] Read more.

Accurate segmentation of pavement cracks from high-resolution remote sensing imagery plays a crucial role in automated road condition assessment and infrastructure maintenance. However, crack structures often exhibit asymmetry, irregular morphology, and multi-scale variations, posing significant challenges to conventional CNN-based methods in real-world environments. Specifically, the proposed ETAFHrNet focuses on two predominant pavement-distress morphologies—linear cracks (transverse and longitudinal) and alligator cracks—and has been empirically validated on their intersections and branching patterns over both asphalt and concrete road surfaces. In this work, we present ETAFHrNet, a novel attention-guided segmentation network designed to address the limitations of traditional architectures in detecting fine-grained and asymmetric patterns. ETAFHrNet integrates Transformer-based global attention and multi-scale hybrid feature fusion, enhancing both contextual perception and detail sensitivity. The network introduces two key modules: the Efficient Hybrid Attention Transformer (EHAT), which captures long-range dependencies, and the Cross-Scale Hybrid Attention Module (CSHAM), which adaptively fuses features across spatial resolutions. To support model training and benchmarking, we also propose QD-Crack, a high-resolution, pixel-level annotated dataset collected from real-world road inspection scenarios. Experimental results show that ETAFHrNet significantly outperforms existing methods—including U-Net, DeepLabv3+, and HRNet—in both segmentation accuracy and generalization ability. These findings demonstrate the effectiveness of interpretable, multi-scale attention architectures in complex object detection and image classification tasks, making our approach relevant for broader applications, such as autonomous driving, remote sensing, and smart infrastructure systems. Full article

(This article belongs to the Special Issue Object Detection and Image Classification)

► Show Figures

Figure 1

20 pages, 24073 KiB

Open AccessArticle

Comparison of Directional and Diffused Lighting for Pixel-Level Segmentation of Concrete Cracks

by Hamish Dow, Marcus Perry, Jack McAlorum and Sanjeetha Pennada

Infrastructures 2025, 10(6), 129; https://doi.org/10.3390/infrastructures10060129 - 25 May 2025

Viewed by 475

Abstract

Visual inspections of concrete infrastructure in low-light environments require external lighting to ensure adequate visibility. Directional lighting sources, where an image scene is illuminated with an angled lighting source from one direction, can enhance the visibility of surface defects in an image. This [...] Read more.

Visual inspections of concrete infrastructure in low-light environments require external lighting to ensure adequate visibility. Directional lighting sources, where an image scene is illuminated with an angled lighting source from one direction, can enhance the visibility of surface defects in an image. This paper compares directional and diffused scene illumination images for pixel-level concrete crack segmentation. A novel directional lighting image segmentation algorithm is proposed, which applies crack segmentation image processing techniques to each directionally lit image before combining all images into a single output, highlighting the extremities of the defect. This method was benchmarked against two diffused lighting crack detection techniques across a dataset with crack widths typically ranging from 0.07 mm to 0.4 mm. When tested on cracked and uncracked data, the directional lighting method significantly outperformed other benchmarked diffused lighting methods, attaining a 10% higher true-positive rate (TPR), 12% higher intersection over union (IoU), and 10% higher F1 score with minimal impact on precision. Further testing on only cracked data revealed that directional lighting was superior across all crack widths in the dataset. This research shows that directional lighting can enhance pixel-level crack segmentation in infrastructure requiring external illumination, such as low-light indoor spaces (e.g., tunnels and containment structures) or night-time outdoor inspections (e.g., pavement and bridges). Full article

(This article belongs to the Section Infrastructures Inspection and Maintenance)

► Show Figures

Figure 1

17 pages, 5756 KiB

Open AccessArticle

PPDD: Egocentric Crack Segmentation in the Port Pavement with Deep Learning-Based Methods

by Hyemin Yoon, Hoe-Kyoung Kim and Sangjin Kim

Appl. Sci. 2025, 15(10), 5446; https://doi.org/10.3390/app15105446 - 13 May 2025

Viewed by 639

Abstract

Road infrastructure is a critical component of modern society, with its maintenance directly influencing traffic safety and logistical efficiency. In this context, automated crack detection technology plays a vital role in reducing maintenance costs and enhancing operational efficiency. However, previous studies are limited [...] Read more.

Road infrastructure is a critical component of modern society, with its maintenance directly influencing traffic safety and logistical efficiency. In this context, automated crack detection technology plays a vital role in reducing maintenance costs and enhancing operational efficiency. However, previous studies are limited by the fact that they provide only bounding box or segmentation mask annotations for a restricted number of crack classes and use a relatively small size of datasets. To address these limitations and advance deep learning-based crack segmentation, this study introduces a novel crack segmentation dataset that reflects real-world road conditions. The proposed dataset includes various types of cracks and defects—such as slippage, rutting, and construction-related cracks—and provides polygon-based segmentation masks captured from an egocentric, vehicle-mounted perspective. Using this dataset, we evaluated the performance of semantic and instance segmentation models. Notably, SegFormer achieved the highest Pixel Accuracy (PA) and mean Intersection over Union (mIoU) for semantic segmentation, while YOLOv7 exhibited outstanding detection performance for alligator crack class, recording an

{A P}_{50}

of 87.2% and

A P

of 57.5%. In contrast, all models struggled with the reflection crack type, indicating the inherent segmentation challenges. Overall, this study provides a practical and robust foundation for future research in automated road crack segmentation. Additional resources including the dataset and annotation details can be found at our GitHub repository. Full article

(This article belongs to the Section Computing and Artificial Intelligence)

► Show Figures

Figure 1

19 pages, 31846 KiB

Open AccessArticle

Proposal of an Integrated Method of Unmanned Aerial Vehicle and Artificial Intelligence for Crack Detection, Classification, and PCI Calculation of Airport Pavements

by Valerio Perri, Misagh Ketabdari, Stefano Cimichella, Maurizio Crispino and Emanuele Toraldo

Sustainability 2025, 17(7), 3180; https://doi.org/10.3390/su17073180 - 3 Apr 2025

Viewed by 962

Abstract

Assessing the condition of airport pavements is essential to ensure operational safety and efficiency. This study presents an innovative, fully automated approach to calculate the Pavement Condition Index (PCI) by combining UAV-based aerial photogrammetry with advanced Artificial Intelligence (AI) techniques. The method follows [...] Read more.

Assessing the condition of airport pavements is essential to ensure operational safety and efficiency. This study presents an innovative, fully automated approach to calculate the Pavement Condition Index (PCI) by combining UAV-based aerial photogrammetry with advanced Artificial Intelligence (AI) techniques. The method follows three key steps: first, analyzing orthophotos of individual pavement sections using a custom-trained AI model designed for precise crack detection and classification; second, utilizing skeletonization and semantic mask analysis to measure crack characteristics; and third, automating the PCI calculation for faster and more consistent evaluations. By leveraging high-resolution Unmanned Aerial Vehicle (UAV) imagery and advanced segmentation models, this approach achieves superior accuracy in detecting transverse and longitudinal cracks. The automated PCI calculation minimizes the need for human intervention, reduces errors, and supports more efficient, data-driven decision-making for airport pavement management. This study demonstrates the transformative potential of integrating UAV and AI technologies to facilitate infrastructure maintenance and enhance safety protocols. Full article

(This article belongs to the Special Issue Next-Generation Roads: The Impact of Artificial Intelligence and Virtual Reality on the Planning, Design and Management of Sustainable Infrastructures)

► Show Figures

Figure 1

19 pages, 3527 KiB

Open AccessArticle

A Lightweight and High-Accuracy Model for Pavement Crack Segmentation

by Yuhui Yu, Wenjun Xia, Zhangyan Zhao and Bin He

Appl. Sci. 2024, 14(24), 11632; https://doi.org/10.3390/app142411632 - 12 Dec 2024

Viewed by 2552

Abstract

Pavement cracks significantly affect road safety and longevity, making accurate crack segmentation essential for effective maintenance. Although deep learning methods have demonstrated excellent performance in this task, their large network architectures limit their applicability on resource-constrained devices. To address this challenge, this paper [...] Read more.

Pavement cracks significantly affect road safety and longevity, making accurate crack segmentation essential for effective maintenance. Although deep learning methods have demonstrated excellent performance in this task, their large network architectures limit their applicability on resource-constrained devices. To address this challenge, this paper proposes a lightweight, fully convolutional neural network model, enhanced with spatial information. First, the backbone network structure is optimized to improve the efficiency of spatial information utilization. Second, by incorporating adaptive feature reassembly and wavelet transforms, the up-sampling and down-sampling processes are refined, enhancing the model capacity to capture spatial information. Lastly, a dynamic combined loss function is employed during training to further improve model attention on crack edge details. To validate the model performance, we trained and tested it on the Crack500 dataset and applied the trained model directly to the AsphaltCrack300 dataset. Experimental results indicate that the proposed model achieved an MIoU of 80.37% and an F1-score of 78.22% on the Crack500 dataset, representing increases of 3.08% and 5.62%, respectively, compared to EfficientNet. On the AsphaltCrack300 dataset, the model exhibited strong robustness, significantly outperforming other mainstream models. Additionally, its lightweight design provides clear advantages, making it well suited for realworld applications with limited computational resources. Full article

► Show Figures

Figure 1

21 pages, 10985 KiB

Open AccessArticle

A Novel Multi-Scale Feature Enhancement U-Shaped Network for Pixel-Level Road Crack Segmentation

by Jing Wang, Benlan Shen, Guodong Li, Jiao Gao and Chao Chen

Electronics 2024, 13(22), 4503; https://doi.org/10.3390/electronics13224503 - 16 Nov 2024

Cited by 1 | Viewed by 1112

Abstract

Timely and accurate detection of pavement cracks, the most common type of road damage, is essential for ensuring road safety. Automatic image segmentation of cracks can accurately locate their pixel positions. This paper proposes a Multi-Scale Feature Enhanced U-shaped Network (MFE-UNet) for pavement [...] Read more.

Timely and accurate detection of pavement cracks, the most common type of road damage, is essential for ensuring road safety. Automatic image segmentation of cracks can accurately locate their pixel positions. This paper proposes a Multi-Scale Feature Enhanced U-shaped Network (MFE-UNet) for pavement crack detection. This network model uses a Residual Detail-Enhanced Block (RDEB) instead of a conventional convolution in the encoder–decoder process. The block combines Efficient Multi-Scale Attention to enhance its feature extraction performance. The Multi-Scale Gating Feature Fusion (MGFF) is incorporated into the skip connections, enhancing the fusion of multi-scale features to capture finer crack details while maintaining rich semantic information. Furthermore, we created a pavement crack image dataset named China_MCrack, consisting of 1500 images collected from road surfaces using smartphone-mounted motorbikes. The proposed network was trained and tested on the China_MCrack, DeepCrack, and Crack-Forest datasets, with additional generalization experiments on the BochumCrackDataset. The results were compared with those of the U-Net model, ResUNet, and Attention U-Net. The experimental results show that the proposed MFE-UNet model achieves accuracies of 82.95%, 91.71%, and 69.02% on three datasets, namely, China_MCrack, DeepCrack, and Crack-Forest datasets, respectively, and the F1_score is improved by 1–4% compared with other networks. Experimental results demonstrate that the proposed method is effective in detecting cracks at the pixel level. Full article

(This article belongs to the Special Issue Emerging Technologies in Computational Intelligence)

► Show Figures

Figure 1

21 pages, 2380 KiB

Open AccessArticle

Crack Detection, Classification, and Segmentation on Road Pavement Material Using Multi-Scale Feature Aggregation and Transformer-Based Attention Mechanisms

by Arselan Ashraf, Ali Sophian and Ali Aryo Bawono

Constr. Mater. 2024, 4(4), 655-675; https://doi.org/10.3390/constrmater4040036 - 16 Oct 2024

Cited by 4 | Viewed by 3468

Abstract

This paper introduces a novel approach to pavement material crack detection, classification, and segmentation using advanced deep learning techniques, including multi-scale feature aggregation and transformer-based attention mechanisms. The proposed methodology significantly enhances the model’s ability to handle varying crack sizes, shapes, and complex [...] Read more.

This paper introduces a novel approach to pavement material crack detection, classification, and segmentation using advanced deep learning techniques, including multi-scale feature aggregation and transformer-based attention mechanisms. The proposed methodology significantly enhances the model’s ability to handle varying crack sizes, shapes, and complex pavement textures. Trained on a dataset of 10,000 images, the model achieved substantial performance improvements across all tasks after integrating transformer-based attention. Detection precision increased from 88.7% to 94.3%, and IoU improved from 78.8% to 93.2%. In classification, precision rose from 88.3% to 94.8%, and recall improved from 86.8% to 94.2%. For segmentation, the Dice Coefficient increased from 80.3% to 94.7%, and IoU for segmentation advanced from 74.2% to 92.3%. These results underscore the model’s robustness and accuracy in identifying pavement cracks in challenging real-world scenarios. This framework not only advances automated pavement maintenance but also provides a foundation for future research focused on optimizing real-time processing and extending the model’s applicability to more diverse pavement conditions. Full article

► Show Figures

Figure 1

22 pages, 7018 KiB

Open AccessArticle

PAN: Improved PointNet++ for Pavement Crack Information Extraction

by Jiakai Fan, Weidong Song, Jinhe Zhang, Shangyu Sun, Guohui Jia and Guang Jin

Electronics 2024, 13(16), 3340; https://doi.org/10.3390/electronics13163340 - 22 Aug 2024

Cited by 2 | Viewed by 1630

Abstract

Maintenance and repair of expressways are becoming increasingly important due to the growing frequency of their use. Accurate pavement crack information extraction helps with routine maintenance and reduces the risk of traffic accidents. The traditional 2D crack image detection method has limitations and [...] Read more.

Maintenance and repair of expressways are becoming increasingly important due to the growing frequency of their use. Accurate pavement crack information extraction helps with routine maintenance and reduces the risk of traffic accidents. The traditional 2D crack image detection method has limitations and cannot effectively obtain depth information. Three-dimensional crack extraction from 3D point cloud has become a new solution that can capture pavement crack information more comprehensively and accurately. However, the existing algorithms are not effective in the feature extraction of cracks due to the different and irregular shapes and sizes of pavement cracks and interference from the external environment. To solve this, a new method for detecting pavement cracks in point clouds, namely point attention net (PAN), is herein proposed. It uses a two-branch attention fusion module to focus on space and feature information in the cloud and capture features of crack points at different scales. It also uses the Poly Loss function to solve the imbalance of foreground and background points in pavement point cloud data. Experiments on the LNTU-RDD-LiDAR dataset were carried out to verify the effectiveness of the proposed method. Compared with the traditional method and the latest point cloud segmentation technology, the performance indexes of mIoU, Acc, F1, and Rec achieved significant improvement, reaching 75.4%, 91.5%, 75.4%, and 67.1%, respectively. Full article

(This article belongs to the Special Issue Fault Detection Technology Based on Deep Learning)

► Show Figures

Figure 1

14 pages, 7304 KiB

Open AccessArticle

Automatic Detection of Urban Pavement Distress and Dropped Objects with a Comprehensive Dataset Collected via Smartphone

by Lin Xu, Kaimin Fu, Tao Ma, Fanlong Tang and Jianwei Fan

Buildings 2024, 14(6), 1546; https://doi.org/10.3390/buildings14061546 - 27 May 2024

Cited by 3 | Viewed by 1445

Abstract

Pavement distress seriously affects the quality of pavement and reduces driving comfort and safety. The dropped objects from vehicles have increased the risks of traffic accidents. Therefore, automatic detection of urban pavement distress and dropped objects is an effective method to timely evaluate [...] Read more.

Pavement distress seriously affects the quality of pavement and reduces driving comfort and safety. The dropped objects from vehicles have increased the risks of traffic accidents. Therefore, automatic detection of urban pavement distress and dropped objects is an effective method to timely evaluate pavement condition. Firstly, this paper utilized a portable platform to collect pavement distress and dropped objects to establish a high-quality dataset. Six types of pavement distresses: transverse crack, longitudinal crack, alligator crack, oblique crack, potholes, and repair, and three types of dropped objects: plastic bottle, metal bottle, and tetra pak were included in this comprehensive dataset. Secondly, the real-time YOLO series detection models were used to classify and localize the pavement distresses and dropped objects. In addition, segmentation models W-segnet, U-Net, and SegNet were utilized to achieve pixel-level detection of pavement distress and dropped objects. The results show that YOLOv8 outperformed YOLOv5 and YOLOv7 with a MAP of 0.889. W-segnet showed an overall MIoU of 70.65% and 68.33% on the training set and test set, respectively, being superior to the comparison model and being able to achieve high-precision pixel-level segmentation. Finally, the trained models were performed on the holdout dataset for the generalization test. The proposed methods integrated the detection of urban pavement distress and dropped objects, which could significantly contribute to driving safety. Full article

(This article belongs to the Special Issue Urban Infrastructure Construction and Management)

► Show Figures

Figure 1

17 pages, 5439 KiB

Open AccessArticle

Pavement Crack Detection Based on the Improved Swin-Unet Model

by Song Chen, Zhixuan Feng, Guangqing Xiao, Xilong Chen, Chuxiang Gao, Mingming Zhao and Huayang Yu

Buildings 2024, 14(5), 1442; https://doi.org/10.3390/buildings14051442 - 16 May 2024

Cited by 13 | Viewed by 3084

Abstract

Accurate pavement surface crack detection is crucial for analyzing pavement survey data and the development of maintenance strategies. On the basis of Swin-Unet, this study develops the improved Swin-Unet (iSwin-Unet) model with the developed skip attention module and the residual Swin Transformer block. [...] Read more.

Accurate pavement surface crack detection is crucial for analyzing pavement survey data and the development of maintenance strategies. On the basis of Swin-Unet, this study develops the improved Swin-Unet (iSwin-Unet) model with the developed skip attention module and the residual Swin Transformer block. Based on the channel attention mechanism, the pavement crack region can be better captured while the crack feature channels can be assigned more weights. Taking advantage of the developed residual Swin Transformer block, the encoder architecture can globally model the pavement crack feature. Meanwhile, the crack feature information can be efficiently exchanged. To verify the pavement crack detection performance of the proposed model, we compare the training performance and visualization results with the other three models, which are Swin-Unet, Swin Transformer, and Unet, respectively. Three public benchmarks (CFD, Crack500, and CrackSC) have been adopted for the purpose of training, validation, and testing. Based on the test results, it can be found that the developed iSwin-Unet achieves a significant increase in mF1 score, mPrecision, and mRecall compared to the existing models, thereby establishing its efficacy in pavement crack detection and underlining its significant advancements over current methodologies. Full article

(This article belongs to the Section Building Materials, and Repair & Renovation)

► Show Figures

Figure 1

22 pages, 4136 KiB

Open AccessArticle

DepthCrackNet: A Deep Learning Model for Automatic Pavement Crack Detection

by Alireza Saberironaghi and Jing Ren

J. Imaging 2024, 10(5), 100; https://doi.org/10.3390/jimaging10050100 - 26 Apr 2024

Cited by 7 | Viewed by 3836

Abstract

Detecting cracks in the pavement is a vital component of ensuring road safety. Since manual identification of these cracks can be time-consuming, an automated method is needed to speed up this process. However, creating such a system is challenging due to factors including [...] Read more.

Detecting cracks in the pavement is a vital component of ensuring road safety. Since manual identification of these cracks can be time-consuming, an automated method is needed to speed up this process. However, creating such a system is challenging due to factors including crack variability, variations in pavement materials, and the occurrence of miscellaneous objects and anomalies on the pavement. Motivated by the latest progress in deep learning applied to computer vision, we propose an effective U-Net-shaped model named DepthCrackNet. Our model employs the Double Convolution Encoder (DCE), composed of a sequence of convolution layers, for robust feature extraction while keeping parameters optimally efficient. We have incorporated the TriInput Multi-Head Spatial Attention (TMSA) module into our model; in this module, each head operates independently, capturing various spatial relationships and boosting the extraction of rich contextual information. Furthermore, DepthCrackNet employs the Spatial Depth Enhancer (SDE) module, specifically designed to augment the feature extraction capabilities of our segmentation model. The performance of the DepthCrackNet was evaluated on two public crack datasets: Crack500 and DeepCrack. In our experimental studies, the network achieved mIoU scores of 77.0% and 83.9% with the Crack500 and DeepCrack datasets, respectively. Full article

(This article belongs to the Special Issue Deep Learning in Image Analysis: Progress and Challenges)

► Show Figures

Figure 1

17 pages, 5086 KiB

Open AccessArticle

Automated Pavement Condition Index Assessment with Deep Learning and Image Analysis: An End-to-End Approach

by Eldor Ibragimov, Yongsoo Kim, Jung Hee Lee, Junsang Cho and Jong-Jae Lee

Sensors 2024, 24(7), 2333; https://doi.org/10.3390/s24072333 - 6 Apr 2024

Cited by 7 | Viewed by 5123

Abstract

The degradation of road pavements due to environmental factors is a pressing issue in infrastructure maintenance, necessitating precise identification of pavement distresses. The pavement condition index (PCI) serves as a critical metric for evaluating pavement conditions, essential for effective budget allocation and performance [...] Read more.

The degradation of road pavements due to environmental factors is a pressing issue in infrastructure maintenance, necessitating precise identification of pavement distresses. The pavement condition index (PCI) serves as a critical metric for evaluating pavement conditions, essential for effective budget allocation and performance tracking. Traditional manual PCI assessment methods are limited by labor intensity, subjectivity, and susceptibility to human error. Addressing these challenges, this paper presents a novel, end-to-end automated method for PCI calculation, integrating deep learning and image processing technologies. The first stage employs a deep learning algorithm for accurate detection of pavement cracks, followed by the application of a segmentation-based skeleton algorithm in image processing to estimate crack width precisely. This integrated approach enhances the assessment process, providing a more comprehensive evaluation of pavement integrity. The validation results demonstrate a 95% accuracy in crack detection and 90% accuracy in crack width estimation. Leveraging these results, the automated PCI rating is achieved, aligned with standards, showcasing significant improvements in the efficiency and reliability of PCI evaluations. This method offers advancements in pavement maintenance strategies and potential applications in broader road infrastructure management. Full article

(This article belongs to the Special Issue Toward Green and Intelligent Transportation Infrastructure: Road Non-destructive Testing and Structural Health Monitoring Technologies)

► Show Figures

Figure 1

19 pages, 16460 KiB

Open AccessArticle

The Crack Diffusion Model: An Innovative Diffusion-Based Method for Pavement Crack Detection

by Haoyuan Zhang, Ning Chen, Mei Li and Shanjun Mao

Remote Sens. 2024, 16(6), 986; https://doi.org/10.3390/rs16060986 - 11 Mar 2024

Cited by 18 | Viewed by 3288

Abstract

Pavement crack detection is of significant importance in ensuring road safety and smooth traffic flow. However, pavement cracks come in various shapes and forms which exhibit spatial continuity, and algorithms need to adapt to different types of cracks while preserving their continuity. To [...] Read more.

Pavement crack detection is of significant importance in ensuring road safety and smooth traffic flow. However, pavement cracks come in various shapes and forms which exhibit spatial continuity, and algorithms need to adapt to different types of cracks while preserving their continuity. To address these challenges, an innovative crack detection framework, CrackDiff, based on the generative diffusion model, is proposed. It leverages the learning capabilities of the generative diffusion model for the data distribution and latent spatial relationships of cracks across different sample timesteps and generates more accurate and continuous crack segmentation results. CrackDiff uses crack images as guidance for the diffusion model and employs a multi-task UNet architecture to predict mask and noise simultaneously at each sampling step, enhancing the robustness of generations. Compared to other models, CrackDiff generates more accurate and stable results. Through experiments on the Crack500 and DeepCrack pavement datasets, CrackDiff achieves the best performance (F1 = 0.818 and mIoU = 0.841 on Crack500, and F1 = 0.841 and mIoU = 0.862 on DeepCrack). Full article

► Show Figures

Figure 1

21 pages, 3740 KiB

Open AccessArticle

A Multi-Stage Feature Aggregation and Structure Awareness Network for Concrete Bridge Crack Detection

by Erhu Zhang, Tao Jiang and Jinghong Duan

Sensors 2024, 24(5), 1542; https://doi.org/10.3390/s24051542 - 28 Feb 2024

Cited by 8 | Viewed by 1519

Abstract

One of the most significant problems affecting a concrete bridge’s safety is cracks. However, detecting concrete bridge cracks is still challenging due to their slender nature, low contrast, and background interference. The existing convolutional methods with square kernels struggle to capture crack features [...] Read more.

One of the most significant problems affecting a concrete bridge’s safety is cracks. However, detecting concrete bridge cracks is still challenging due to their slender nature, low contrast, and background interference. The existing convolutional methods with square kernels struggle to capture crack features effectively, fail to perceive the long-range dependencies between crack regions, and have weak suppression ability for background noises, leading to low detection precision of bridge cracks. To address this problem, a multi-stage feature aggregation and structure awareness network (MFSA-Net) for pixel-level concrete bridge crack detection is proposed in this paper. Specifically, in the coding stage, a structure-aware convolution block is proposed by combining square convolution with strip convolution to perceive the linear structure of concrete bridge cracks. Square convolution is used to capture detailed local information. In contrast, strip convolution is employed to interact with the local features to establish the long-range dependence relationship between discrete crack regions. Unlike the self-attention mechanism, strip convolution also suppresses background interference near crack regions. Meanwhile, the feature attention fusion block is presented for fusing features from the encoder and decoder at the same stage, which can sharpen the edges of concrete bridge cracks. In order to fully utilize the shallow detail features and deep semantic features, the features from different stages are aggregated to obtain fine-grained segmentation results. The proposed MFSA-Net was trained and evaluated on the publicly available concrete bridge crack dataset and achieved average results of 73.74%, 77.04%, 75.30%, and 60.48% for precision, recall, F1 score, and IoU, respectively, on three typical sub-datasets, thus showing optimal performance in comparison with other existing methods. MFSA-Net also gained optimal performance on two publicly available concrete pavement crack datasets, thereby indicating its adaptability to crack detection across diverse scenarios. Full article

(This article belongs to the Special Issue Advanced Sensing Systems for Structural Monitoring and Damage Identification of Buildings and Bridges)

► Show Figures

Figure 1

Search Results (58)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (58)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI