Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (189)

Search Parameters:
Keywords = automatic crack detection

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
22 pages, 11218 KB  
Article
Image-Assisted Residual Load-Bearing Capacity Assessment of Plain Concrete Beams Using U-Net Crack Segmentation and Phase-Field Simulation
by Simeng Wang, Wen Zhao, Yuanyan Liang and Huiming Wang
Buildings 2026, 16(12), 2334; https://doi.org/10.3390/buildings16122334 - 11 Jun 2026
Viewed by 133
Abstract
Concrete cracks are ubiquitous in practical engineering structures and continuously affect structural safety and durability. Crack images provide important visual evidence of damage evolution; however, crack images alone are insufficient to determine the residual load-bearing capacity of concrete members. Although the development of [...] Read more.
Concrete cracks are ubiquitous in practical engineering structures and continuously affect structural safety and durability. Crack images provide important visual evidence of damage evolution; however, crack images alone are insufficient to determine the residual load-bearing capacity of concrete members. Although the development of deep learning algorithms has significantly improved the automatic detection of concrete surface cracks, most existing methods remain limited to the extraction of crack geometric features and lack a direct connection with mechanical performance. To explore the relationship between image-based crack geometry and mechanical response, this study combines U-Net-based crack segmentation, OpenCV-based crack geometry extraction, and phase-field fracture simulation to establish a preliminary visual–mechanical framework for plain concrete beams. In this framework, surface crack images are first segmented using a U-Net model, and crack length, average width, and propagation path are extracted from the predicted binary masks. The extracted crack length is then used as the primary variable to match the observed crack state with the phase-field crack evolution sequence. Once the corresponding simulation stage is identified, the associated load level and residual load-bearing capacity can be obtained from the simulated load–crack mouth opening displacement (Load–CMOD) response. Through a mixed-mode I–II fracture test, the crack geometric features extracted by deep learning are compared with the phase-field simulation results. The results show that the error in crack length is within 2.5%. Meanwhile, the relative error between the simulated peak load and the experimental value was 1.57%, which preliminarily verified the correlation between image-based crack information and the load-bearing capacity of plain concrete beams. The method is further applied to a Mode I fracture test without recorded load-bearing capacity data. By mapping the crack length identified from the image, namely 36.89 mm, to the phase-field evolution sequence, the load-bearing capacity of the member at this stage is estimated to be 74.4% of the peak load. The results indicate that the crack geometry extracted from images can be correlated with phase-field crack evolution, thereby supporting preliminary residual load-bearing capacity assessment of plain concrete beams. However, the proposed framework should be regarded as a case-level feasibility study rather than a general structural assessment method. Before broader engineering application, further validation using synchronized crack image sequences, crack mouth opening displacement (CMOD) measurements, and load records is required. Full article
(This article belongs to the Section Building Structures)
Show Figures

Figure 1

32 pages, 20674 KB  
Article
MCGC-Net: A Text-Enhanced Geometry-Consistent Network for UAV-Based Road Crack Detection
by Zhoujun Ou, Shicong He, Rongwei Bu, Peng Wang and Gufeng Gong
Sensors 2026, 26(11), 3487; https://doi.org/10.3390/s26113487 - 1 Jun 2026
Viewed by 332
Abstract
With the rapid development of unmanned aerial vehicle (UAV) remote sensing and deep learning, road crack detection has become an important component of road condition assessment and intelligent road maintenance. However, accurately detecting cracks from UAV images remains challenging due to complex background [...] Read more.
With the rapid development of unmanned aerial vehicle (UAV) remote sensing and deep learning, road crack detection has become an important component of road condition assessment and intelligent road maintenance. However, accurately detecting cracks from UAV images remains challenging due to complex background environments, slender crack structures, blurred boundaries, and irregular crack shapes and orientations. Traditional methods that rely solely on visual information often struggle to achieve stable and accurate detection performance under these conditions. To address these challenges, this paper proposes a Multimodal Crack Geometry-Consistent Network (MCGC-Net) for high-precision road crack detection in complex road scenes. First, a UAV-based multimodal road crack dataset with image-text annotations is constructed. Specifically, crack-related textual descriptions are automatically generated from crack annotations using predefined semantic templates, which summarize crack morphology, spatial distribution characteristics, and structural properties. These semantic descriptions provide high-level semantic prior information for crack representation learning. Second, a Multimodal Contrastive Semantic Gating module (MCSG) is introduced to leverage automatically generated crack semantic descriptions and in-batch image-text semantic differences to guide visual feature learning, thereby improving the discrimination between crack and non-crack regions under complex background conditions. Furthermore, a Crack-Aware Slenderness Loss (CASL) is proposed to explicitly constrain slenderness consistency between predicted boxes and ground-truth boxes, improving localization stability for slender crack targets. In addition, a KAN-based Nonlinear Channel Attention mechanism (KAN-CA) is introduced to enhance feature representation capability for complex crack structures. Experimental results demonstrate that the proposed MCGC-Net effectively improves crack detection accuracy and structural representation capability under complex road environments. The proposed method provides a practical and reliable solution for UAV-based intelligent road crack detection. Full article
(This article belongs to the Section Vehicular Sensing)
Show Figures

Figure 1

27 pages, 39300 KB  
Article
Multi-Frame Temporal Integration for 3-D Shape Measurement of Freely Falling Small Objects Using a High-Speed Camera Array
by Hao Duan, Shaopeng Hu, Feiyue Wang, Kohei Shimasaki and Idaku Ishii
Sensors 2026, 26(11), 3457; https://doi.org/10.3390/s26113457 - 30 May 2026
Viewed by 214
Abstract
Dynamic three-dimensional (3-D) reconstruction of small objects moving at high speed is fundamentally limited by the number of viewpoints that a fixed camera array can provide at any single time instant. When the camera count is insufficient, single-frame multi-view stereo produces incomplete or [...] Read more.
Dynamic three-dimensional (3-D) reconstruction of small objects moving at high speed is fundamentally limited by the number of viewpoints that a fixed camera array can provide at any single time instant. When the camera count is insufficient, single-frame multi-view stereo produces incomplete or inaccurate geometry. This paper proposes a multi-frame temporal integration approach that overcomes this limitation by exploiting the rigid-body assumption: because a falling object maintains its shape across consecutive frames, images captured at different time instants can be combined into a single, viewpoint-enriched reconstruction. A three-layer circular array of 32 synchronized RGB cameras captures 1440 × 1080 images at 160 fps, and a free-fall-oriented algorithm automatically detects active frames, selects informative temporal windows, and feeds the accumulated multi-frame images into a structure-from-motion and multi-view stereo (SfM-MVS) pipeline, effectively multiplying the number of viewpoints without additional hardware. The algorithm simultaneously recovers the 6-DOF pose trajectory of each object from the SfM-estimated camera parameters. Progressive accumulation experiments on freely falling soybeans (approximately 9–10 mm diameter) show that a single 32-camera frame already achieves an F-score exceeding 0.97 at a 0.5 mm threshold against an industrial structured-light scanner reference, and that accumulating additional temporal frames reaches a stable convergence plateau with both objects reaching a plateau F-score of 0.984. Beyond approximately one to two accumulated frames, additional frames yield diminishing returns, confirming that a small number of temporal frames is sufficient for convergent sub-millimeter accuracy. Across 30 independent free-fall trials with three objects, the system achieves an overall mean error of 0.146±0.033 mm and an overall F-score of 0.980±0.006—a mean relative error of approximately 1.6% on 8–10 mm targets—and fine surface features such as structural cracks are resolved at a fidelity sufficient for visual defect identification. These results establish rigid-body multi-frame temporal integration as an effective strategy for high-throughput, non-contact 3-D inspection of small objects in motion. Full article
Show Figures

Figure 1

26 pages, 5946 KB  
Article
Intelligent Recognition and Restoration of Mural Damage Based on DeepLabv3 and Stable Diffusion
by Chong Rong, Dashuai Yang, Wenkai Tian, Yi Tao, Qiuwei Wang and Peng Wang
Buildings 2026, 16(10), 2012; https://doi.org/10.3390/buildings16102012 - 20 May 2026
Viewed by 222
Abstract
Murals are not merely independent visual artworks. Rather, they are an integral part of architectural heritage, directly attached to buildings’ structural elements, such as brick walls and vaults. However, murals are susceptible to various building-related types of damage, including structural cracks and moisture-induced [...] Read more.
Murals are not merely independent visual artworks. Rather, they are an integral part of architectural heritage, directly attached to buildings’ structural elements, such as brick walls and vaults. However, murals are susceptible to various building-related types of damage, including structural cracks and moisture-induced peeling, due to long-term exposure to environmental factors and geological changes. As the progressive deterioration of these murals hastens the loss of mural value, professional assessment and restoration are urgently required. To tackle the issues of low efficiency in traditional structural damage detection and the absence of predictable repair plans, this paper presents a semi-automatic building-mural protection solution that integrates morphological assessment of mural deterioration with computer vision technology. This study establishes an image prediction system that integrates intelligent damage identification with virtual restoration. First, employing the PaddleSeg deep learning framework and the DeepLabv3 semantic segmentation model, this study used existing mural damage datasets to build a recognition model. The model allows for intelligent identification and labeling of multiple damage types. Subsequently, relying on the ComfyUI platform, Stable Diffusion was used to construct a virtual restoration model. LoRA (low-rank adaptation) technology was introduced to fine-tune the model specifically for the mural style, thus enhancing the directivity and accuracy of virtual restoration. Finally, by applying the results of the recognition model to the virtual restoration model, this study built an integrated system for mural damage diagnosis and virtual restoration. The results show that the damage recognition model achieved a mean intersection over union (mIoU) of 47.8% and a pixel accuracy of 77.97% on the test set, validating the feasibility of using semantic segmentation for mural damage detection. This study presents an integrated workflow framework integrating automatic damage identification and intelligent repair. As an expert-assisted tool, this framework shows application potential for preliminary exploration of mural disease diagnosis and virtual restoration plans, providing technical references for the digital protection of cultural heritage. Full article
Show Figures

Figure 1

20 pages, 7406 KB  
Article
Convolutional Neural Network for Specimen-Invariant Structural Health Monitoring of FRC Under Flexural Loading
by George M. Sapidis, Ioannis Kansizoglou, Maria C. Naoum, Nikos A. Papadopoulos, Konstantinos A. Tsintotas, Maristella E. Voutetaki and Antonios Gasteratos
Sensors 2026, 26(9), 2788; https://doi.org/10.3390/s26092788 - 29 Apr 2026
Viewed by 729
Abstract
Reinforced Concrete (RC) structures experience progressive degradation over their service life due to mechanical loading and environmental exposure, leading to reduced bearing capacity and compromised structural safety. Incorporating discrete fibers into concrete mitigates crack propagation and enhances ductility, resulting in fiber-reinforced concrete (FRC) [...] Read more.
Reinforced Concrete (RC) structures experience progressive degradation over their service life due to mechanical loading and environmental exposure, leading to reduced bearing capacity and compromised structural safety. Incorporating discrete fibers into concrete mitigates crack propagation and enhances ductility, resulting in fiber-reinforced concrete (FRC) with superior fracture energy, durability, and sustainability characteristics. Despite these advantages, research on Structural Health Monitoring (SHM) techniques for FRC elements remains limited. The Electromechanical Impedance (EMI) method, which exploits piezoelectric transducers as both actuators and sensors, offers high sensitivity for detecting early-stage damage by monitoring variations in local mechanical impedance. This study investigates the effectiveness of a deep learning-enabled EMI framework for assessing the structural condition of FRC beams under flexural loading. A one-dimensional convolutional neural network (1D-CNN) is proposed to automatically extract salient features from high-frequency EMI signatures and classify structural health into three predefined states. The model is rigorously evaluated using specimen-invariant validation to ensure generalization across different FRC specimens, addressing a critical limitation of conventional cross-validation approaches in SHM research. Experimental tests on FRC beams instrumented with surface-bonded PZT transducers provide a dataset of 264 EMI responses for training and validation, enabling direct comparison between common and specimen-invariant validation schemes. The results demonstrate the superior robustness of the specimen-invariant approach and confirm the capability of the proposed 1D-CNN to identify flexural damage progression in FRC elements accurately. An ablation study further highlights the contribution of each architectural component to overall model performance. The findings underscore the potential of integrating EMI-based sensing with advanced deep learning models for reliable, automated, and scalable SHM of next-generation resilient concrete infrastructures. Full article
(This article belongs to the Special Issue Sensor-Based Structural Health Monitoring of Civil Infrastructure)
Show Figures

Figure 1

24 pages, 5736 KB  
Article
Improved Parameter-Driven Automated Three-Class Segmentation for Concrete CT: A Reproducible Pipeline for Large-Scale Dataset Production
by Youxi Wang, Tianqi Zhang and Xinxiao Chen
Buildings 2026, 16(8), 1620; https://doi.org/10.3390/buildings16081620 - 20 Apr 2026
Viewed by 329
Abstract
The automated production of large-scale labeled datasets from concrete X-ray computed tomography (CT) images is a fundamental prerequisite for training and validating deep learning-based segmentation models. However, existing methods either require extensive manual annotation or rely on domain-specific deep learning models that themselves [...] Read more.
The automated production of large-scale labeled datasets from concrete X-ray computed tomography (CT) images is a fundamental prerequisite for training and validating deep learning-based segmentation models. However, existing methods either require extensive manual annotation or rely on domain-specific deep learning models that themselves demand labeled data—a circular dependency. This paper presents a parameter-driven three-class segmentation framework that automatically classifies each pixel in a concrete CT slice into one of three material phases: void (air pores and cracks), coarse aggregate, and mortar matrix, generating annotation masks suitable for large-scale dataset production without manual labeling. The proposed method combines: (1) fixed-threshold void detection calibrated to concrete CT grayscale characteristics; (2) adaptive percentile-based initial segmentation responsive to image-specific statistics; (3) multi-criteria connected component scoring based on area, shape descriptors (circularity, solidity, compactness, extent, aspect ratio), intensity distribution, and boundary gradient; (4) material science-informed size constraints aligned with concrete phase volume fractions; and (5) a material continuity enforcement module that applies topological hole-filling and conditional convex-hull consolidation to eliminate internal contamination within accepted aggregate regions, reducing boundary roughness by 7.6% and recovering misclassified boundary pixels. All parameters are centralized in a configuration file, enabling reproducible batch processing of 224 × 224 pixel CT slices at 0.07–1.12 s per image. Evaluated on 1007 224 × 224 concrete CT patches cropped from 200 representative scan frames, the framework produces three-class segmentation masks with physically consistent void fractions (mean 3.2%), aggregate fractions (mean 32.4%), and mortar fractions (mean 64.4%), all within ranges reported in the concrete CT literature (used as a dataset-scale QC screen, not a validation metric). Primary outputs and the archived image–mask pairs for this work are provided as an 8-bit patch archive. For pixel-wise validation, we report IoU, Dice, and pixel accuracy on an independently labeled subset that can be unambiguously paired with the released predictions: averaged over 57 matched patches, mean pixel accuracy is 88.6%, macro-mean IoU is 74.7%, and macro-mean Dice is 84.9%. The framework provides a fully automated annotation pipeline for dataset production, eliminating manual labeling costs for concrete CT image collections. The generated datasets are suitable for training semantic segmentation networks such as U-Net and its variants. Full article
(This article belongs to the Section Building Materials, and Repair & Renovation)
Show Figures

Figure 1

26 pages, 10629 KB  
Article
LRD-DETR: A Lightweight RT-DETR-Based Model for Road Distress Detection
by Chen Dong and Yunwei Zhang
Sensors 2026, 26(8), 2375; https://doi.org/10.3390/s26082375 - 12 Apr 2026
Viewed by 543
Abstract
Intelligent road distress detection technology has emerged as an important research topic in the field of highway maintenance. However, the accuracy and practicality of pavement distress detection are constrained by multiple factors, primarily including the irregular shapes of distress, the tendency for fine [...] Read more.
Intelligent road distress detection technology has emerged as an important research topic in the field of highway maintenance. However, the accuracy and practicality of pavement distress detection are constrained by multiple factors, primarily including the irregular shapes of distress, the tendency for fine cracks to be overlooked, and the high parameter count of detection models that makes deployment difficult. Therefore, this study proposes a lightweight road distress detection model based on an improved RT-DETR architecture—LRD-DETR. First, this work integrates the C2f-LFEM module with the ADown adaptive down-sampling strategy into the backbone network, significantly reducing the number of model parameters and computational load while effectively enhancing the representation capacity of multi-scale pavement distress features. Second, a frequency-domain spatial attention is embedded in the S4 feature layer, where synergistic integration of frequency-domain filtering and spatial attention enables detail enhancement of distress edges and contours, automatically focuses on the distress regions, and suppresses background interference. The polarity-aware linear attention is incorporated into the S5 feature layer, by explicitly modeling polarity interactions, it effectively captures textural discrepancies between damaged regions and the intact road surface, and a learnable power function dynamically rescales attention weights to strengthen distress-specific feature responses. Finally, a cross-scale spatial feature fusion module (CSF2M) is developed to reconstruct and fuse multi-level spatial featurez, thereby improving detection robustness for pavement distresses with diverse morphologies under complex background conditions. Quantitative experiments indicate that, in contrast with the baseline RT-DETR, the presented framework improves the F1-score by 7.1% and mAP@50 by 9.0%, while reducing computational complexity and parameter quantity by 43.8% and 38.0%, respectively. These advantages enable LRD-DETR to be suitably deployed on resource-limited embedded platforms for real-time road distress detection. Full article
(This article belongs to the Special Issue AI and Smart Sensors for Intelligent Transportation Systems)
Show Figures

Figure 1

50 pages, 7780 KB  
Systematic Review
Intelligent Eyes on Buildings: A Scientometric Mapping and Systematic Review of AI-Based Crack Detection and Predictive Diagnostics of Building Structures
by Mehdi Mohagheghi, Ali Bahadori-Jahromi and Shah Room
Encyclopedia 2026, 6(4), 75; https://doi.org/10.3390/encyclopedia6040075 - 27 Mar 2026
Viewed by 1145
Abstract
Artificial Intelligence (AI)-based crack detection in buildings uses computer vision and deep learning to automatically identify structural cracks from inspection images. In recent years, many studies have explored this topic, but the overall development of the field, its methodological practices, and the remaining [...] Read more.
Artificial Intelligence (AI)-based crack detection in buildings uses computer vision and deep learning to automatically identify structural cracks from inspection images. In recent years, many studies have explored this topic, but the overall development of the field, its methodological practices, and the remaining challenges are still not fully clear. Unlike most previous reviews that focus mainly on technical methods, this study combines a large-scale scientometric mapping of the research field with a focused technical analysis of recent AI-based crack detection methods specifically applied to building structures. This study therefore provides a dual-layer review covering research published between 2015 and 2025. A total of 146 Scopus-indexed publications were analysed using Visualization of Similarities viewer (VOSviewer) to examine publication growth, thematic evolution, collaboration patterns, and citation structures. In addition, a focused technical review of 36 highly relevant studies was carried out to analyse task formulations, model families, datasets, evaluation protocols, and methodological practices. The results show a rapid increase in research activity after 2020, largely driven by advances in deep-learning and Unmanned Aerial Vehicle (UAV)-based inspections. At the same time, collaboration networks remain uneven, and citation influence is concentrated in a limited number of research communities. The technical review further shows that most studies focus on detection-level tasks, particularly You Only Look Once (YOLO)-based models, while predictive diagnostics, automated inspection reporting, and decision-oriented Structural Health Monitoring (SHM) are still rarely addressed. Current datasets and evaluation protocols also remain mostly perception-oriented, which makes it difficult to assess robustness, generalisability and long-term predictive capability. Full article
Show Figures

Figure 1

19 pages, 4237 KB  
Article
Intelligent Measurement of Concrete Crack Width Based on U-Net Deep Learning and Binocular Vision 3D Reconstruction
by Dedong Xiao, Gaoxin Wang, Kai Wang, Shukui Liu, Guangbin Shang, Qi-Ang Wang, Xiaohua Fan, Minghui Hu, Richeng Liu, Guozhao Chen and Zhihao Chen
Appl. Sci. 2026, 16(5), 2355; https://doi.org/10.3390/app16052355 - 28 Feb 2026
Viewed by 551
Abstract
The concrete cracking problem can seriously affect the durability and safety of civil structures. Accurately and quickly measuring the width of concrete cracks can help control defect development in a timely manner. Current research mainly relies on pixel detection of two-dimensional images, which [...] Read more.
The concrete cracking problem can seriously affect the durability and safety of civil structures. Accurately and quickly measuring the width of concrete cracks can help control defect development in a timely manner. Current research mainly relies on pixel detection of two-dimensional images, which lacks real three-dimensional information about crack lesions. Detection results are also obviously affected by various factors, such as shooting distance and posture, resulting in poor accuracy. Therefore, this paper presents an engineering-integrated solution that combines U-Net-based crack segmentation with binocular vision 3D reconstruction. The focus is placed on the practical deployment of the integrated pipeline, the optimization of key parameters under real inspection conditions, and the experimental validation of measurement accuracy on actual concrete cracks. Firstly, the U-Net deep learning algorithm is used to automatically identify and segment the concrete crack region; then, a binocular vision-based 3D reconstruction pipeline is adopted, and a parallax rejection algorithm based on a “double-threshold” decision is proposed to improve the fidelity of crack disparity maps, and the effect of the filter window size on the concrete crack region is analyzed; finally, an intelligent measurement method based on the 3D reconstruction model is proposed, and the measurement results of concrete crack width can be calculated directly from the 3D reconstruction model. The results show that (1) the model can identify the characteristics of the crack, and the detection effect at 4:00 p.m. is the best, because at this time the light is more uniform with less shadow and moderate contrast between the crack and its background; (2) the reconstruction of the 3D point cloud model of the concrete crack with a filtering window of size 9 × 9 is the best; (3) the maximum error between the calculated and measured values of crack width is 0.31mm, the minimum error is 0.07mm, and the average error is 0.15 mm, which indicates that the measurement accuracy reaches the sub-millimetre level and verifies the validity of the proposed method in this paper. Full article
(This article belongs to the Section Civil Engineering)
Show Figures

Figure 1

18 pages, 4072 KB  
Article
Classification and Contour Recognition of Welding Defects in Magneto-Optical Images
by Nvjie Ma, Guoying Zhang, Huazhuo Liang, Shichao Gu, Congyi Wang, Yanxi Zhang and Xiangdong Gao
Metals 2026, 16(3), 267; https://doi.org/10.3390/met16030267 - 28 Feb 2026
Cited by 1 | Viewed by 390
Abstract
In the field of magneto-optical imaging nondestructive testing for welding defects, multi-angle detection of welding defects has already been achieved. However, research on automatic defect recognition and contour extraction remains insufficient. Therefore, to enable automatic detection of welding defects using magneto-optical imaging technology, [...] Read more.
In the field of magneto-optical imaging nondestructive testing for welding defects, multi-angle detection of welding defects has already been achieved. However, research on automatic defect recognition and contour extraction remains insufficient. Therefore, to enable automatic detection of welding defects using magneto-optical imaging technology, it is essential to address the key issues of defect recognition and contour extraction in magneto-optical images. The dataset in this article includes five types of images: defect-free, lack-of-fusion, cracks, pits, and Weld reinforcement. Firstly, the Mask R-CNN detection method is used to perform defect recognition and contour segmentation on the original magneto-optical image dataset. The detection results indicate that the recognition rate of lack-of-fusion and Weld reinforcement in the original magneto-optical image is not high, and the recognition accuracy of pits and cracks is extremely low. Subsequently, the magneto-optical image dataset was preprocessed using the differential level set method, and the mask R-CNN algorithm was used to identify defect types and segment defect contours. Comparing the results of two experiments, it was found that the detection accuracy of the preprocessed dataset was higher, and the overall recognition accuracy increased by 30%. Full article
Show Figures

Figure 1

18 pages, 4195 KB  
Article
WeldSimAM and EnNWD Co-Optimization: Enhancing Lightweight YOLOv11 for Multi-Scale Weld Defect Detection
by Wenquan Huang, Qing Cheng and Jing Zhu
Technologies 2026, 14(3), 140; https://doi.org/10.3390/technologies14030140 - 26 Feb 2026
Viewed by 752
Abstract
In the context of Industry 4.0, reliable automatic inspection of weld surface defects is critical for structural safety, yet current deep learning-based detectors struggle with the extreme scale variation and anisotropic shapes characteristic of weld flaws such as pores, cracks, and lack of [...] Read more.
In the context of Industry 4.0, reliable automatic inspection of weld surface defects is critical for structural safety, yet current deep learning-based detectors struggle with the extreme scale variation and anisotropic shapes characteristic of weld flaws such as pores, cracks, and lack of fusion. Existing YOLO-family models, although effective on general-purpose datasets, often fail to robustly localize tiny defects and long, slender discontinuities while remaining lightweight enough for industrial edge deployment. A critical research gap lies in the lack of task-specific optimization for weld defects: standard attention mechanisms are isotropic and cannot capture linear defect continuity, while existing loss functions ignore scale disparity between tiny pores (area < 100 pixels2) and large incomplete fusion defects (area > 5000 pixels2), leading to unstable regression. Here, we propose a dual-optimized lightweight YOLOv11 framework tailored for weld defect detection that addresses both feature representation and bounding-box regression. Here, we propose a dual-optimized lightweight YOLOv11 framework tailored for weld defect detection that addresses both feature representation and bounding-box regression. First, we introduce WeldSimAM, an enhanced attention module that augments parameter-free SimAM with directional (horizontal/vertical) and channel-wise enhancement to better capture the directional texture of linear weld defects. Second, we develop an Enhanced Normalized Wasserstein Distance (EnNWD) loss, which incorporates scale-disparity penalties and relative-area-based weighting to mitigate sample imbalance and improve regression accuracy for tiny and large-aspect-ratio targets. Validated via 10-fold cross-validation on three datasets (self-built + two public), the method achieves 99.48% mAP@0.5 and 73.29% mAP@0.5:0.95, outperforming YOLOv11 by 0.13 and 3.76 percentage points (p < 0.01, two-tailed t-test), with 5.21 MB and 132 FPS on NVIDIA RTX 4090. It also surpasses non-YOLO SOTA methods (e.g., EfficientDet-Lite3) by 3.8–5.5 percentage points in mAP@0.5 (p < 0.05), offering a practical real-time solution for industrial inspection. Full article
(This article belongs to the Section Manufacturing Technology)
Show Figures

Figure 1

22 pages, 5222 KB  
Article
A Two-Stage Concrete Crack Segmentation Method Based on the Improved YOLOv11 and Segment Anything Model
by Ru Zhang, Chaodong Guan, Yi Fang, Yuanfeng Duan and Xiaodong Sui
Buildings 2026, 16(4), 794; https://doi.org/10.3390/buildings16040794 - 14 Feb 2026
Cited by 1 | Viewed by 878
Abstract
During long-term service, concrete structures are exposed to various adverse factors, which often lead to the formation of numerous surface cracks. These cracks pose serious threats to structural safety and durability. Therefore, accurately identifying crack characteristics is essential for evaluating the service performance [...] Read more.
During long-term service, concrete structures are exposed to various adverse factors, which often lead to the formation of numerous surface cracks. These cracks pose serious threats to structural safety and durability. Therefore, accurately identifying crack characteristics is essential for evaluating the service performance of concrete structures. A two-stage concrete crack segmentation method is presented in this study. The crack is initially located by the improved YOLOv11 that integrates three novel modules, namely Multi-scale Edge Information Enhancement, Efficient-Detection, and P2-Level Feature Integration, to form the MEP-YOLOv11 model. Then, the detected region is taken as input prompts for Segment Anything Model (SAM) to achieve precise crack segmentation. This approach eliminates the need for manual prompting in SAM, enabling automatic crack feature identification. The average Accuracy, precision, and Intersection over Union (IoU) for crack segmentation are 95.98%, 92.60%, and 0.77, respectively. To further enhance the robustness of the two-stage segmentation method under non-uniform illumination conditions, a mask re-input strategy is introduced. The crack mask generated by SAM using bounding-box prompts is fed back into SAM to guide a second round of segmentation. Experimental results demonstrate that the improved method maintains high segmentation performance, with an average Accuracy of 92.38%, precision of 85.70%, and IoU of 0.64. Overall, the proposed method meets engineering requirements for high-precision and efficient crack detection and segmentation, showing strong potential for practical inspection tasks. Full article
Show Figures

Figure 1

25 pages, 9322 KB  
Article
Study on Image Processing Algorithm for Post-Earthquake Bridge Crack Detection Based on Improved Retinex and Wavelet Transform
by Xiaoyan Yang, Changjiang Liu, Shaoping Luo and Zhonglin Li
Buildings 2026, 16(4), 713; https://doi.org/10.3390/buildings16040713 - 9 Feb 2026
Viewed by 462
Abstract
Post-earthquake bridge crack detection is a critical step in assessing structural safety. Traditional manual detection of bridge cracks is time-consuming, labor-intensive, and poses significant risks. This paper focuses on the automatic identification of structural cracks by analyzing their morphology, orientation, and distribution characteristics, [...] Read more.
Post-earthquake bridge crack detection is a critical step in assessing structural safety. Traditional manual detection of bridge cracks is time-consuming, labor-intensive, and poses significant risks. This paper focuses on the automatic identification of structural cracks by analyzing their morphology, orientation, and distribution characteristics, and preliminarily distinguishes them from non-structural damages such as surface stains and coating peeling. Therefore, this paper proposes a bridge crack recognition algorithm based on image processing. First, the input crack image undergoes preprocessing to obtain a binary image, reducing measurement errors caused by environmental factors or uneven illumination, using an improved Retinex algorithm to enhance image brightness. Second, an improved wavelet transform method is employed to remove large-area noise. Then, connected component analysis is used to filter out point-like and patch-like noise, resulting in a complete and clear crack skeleton. Finally, the crack length, width, and other characteristic values are obtained using an image pixel coordinate calculation method, achieving non-contact, non-destructive measurement of concrete surface crack characteristics. The algorithm is based on two-dimensional image processing and does not directly measure crack depth, but the extracted parameters such as length, width, and area ratio provide important surface-based evidence for rapid post-earthquake bridge structural safety assessment. Multiple experimental results show that the proposed algorithm has a maximum width measurement relative error of less than 2.3%, a length measurement relative error within 8%, and an average peak signal-to-noise ratio (PSNR) of the denoised image increased to 74.73 dB. This algorithm provides an effective automated detection tool for rapid post-earthquake bridge safety assessment. Full article
(This article belongs to the Section Building Structures)
Show Figures

Figure 1

22 pages, 1366 KB  
Systematic Review
Inspection and Evaluation of Urban Pavement Deterioration Using Drones: Review of Methods, Challenges, and Future Trends
by Pablo Julián López-González, David Reyes-González, Oscar Moreno-Vázquez, Rodrigo Vivar-Ocampo, Sergio Aurelio Zamora-Castro, Lorena del Carmen Santos Cortés, Brenda Suemy Trujillo-García and Joaquín Sangabriel-Lomelí
Future Transp. 2026, 6(1), 10; https://doi.org/10.3390/futuretransp6010010 - 4 Jan 2026
Cited by 2 | Viewed by 1726
Abstract
The rapid growth of urban areas has increased the need for more efficient methods of pavement inspection and maintenance. However, conventional techniques remain slow, labor-intensive, and limited in spatial coverage, and their performance is strongly affected by traffic, weather conditions, and operational constraints. [...] Read more.
The rapid growth of urban areas has increased the need for more efficient methods of pavement inspection and maintenance. However, conventional techniques remain slow, labor-intensive, and limited in spatial coverage, and their performance is strongly affected by traffic, weather conditions, and operational constraints. In response to these challenges, it is essential to synthesize the technological advances that improve inspection efficiency, coverage, and data quality compared to traditional approaches. Herein, we present a systematic review of the state of the art on the use of unmanned aerial vehicles (UAVs) for monitoring and assessing pavement deterioration, highlighting as a key contribution the comparative integration of sensors (photogrammetry, LiDAR, and thermography) with recent automatic damage-detection algorithms. A structured review methodology was applied, including the search, selection, and critical analysis of specialized studies on UAV-based pavement evaluation. The results indicate that UAV photogrammetry can achieve sub-centimeter accuracy (<1 cm) in 3D reconstructions, LiDAR systems can improve deformation detection by up to 35%, and AI-based algorithms can increase crack-identification accuracy by 10% to 25% compared with manual methods. Finally, the synthesis shows that multi-sensor integration and digital twins offer strong potential to enhance predictive maintenance and support the transition towards smarter and more sustainable urban infrastructure management strategies. Full article
Show Figures

Figure 1

23 pages, 5850 KB  
Article
Durability Assessment of Marine Steel-Reinforced Concrete Using Machine Vision: A Case Study on Corrosion Damage and Geometric Deformation in Shield Tunnels
by Yanzhi Qi, Xipeng Wang, Zhi Ding and Yaozhi Luo
Buildings 2026, 16(1), 107; https://doi.org/10.3390/buildings16010107 - 25 Dec 2025
Cited by 1 | Viewed by 537
Abstract
The rapid urbanization of coastal regions has intensified the demand for durable underground infrastructure like shield tunnels, where reinforced concrete (RC) structures are critical yet susceptible to long-term degradation in marine environments. This study develops an integrated machine vision-based framework for assessing the [...] Read more.
The rapid urbanization of coastal regions has intensified the demand for durable underground infrastructure like shield tunnels, where reinforced concrete (RC) structures are critical yet susceptible to long-term degradation in marine environments. This study develops an integrated machine vision-based framework for assessing the long-term durability of RC in marine shield tunnels by synergistically combining point cloud analysis and deep learning-based damage recognition. The methodology involves preprocessing tunnel point clouds to extract the centerline and cross-sections, enabling the quantification of geometric deformations, including segment misalignment and elliptical distortion. Concurrently, an advanced YOLOv8 model is employed to automatically identify and classify surface corrosion damages—specifically water leakage, cracks, and spalling—from images, achieving high detection accuracies (e.g., 95.6% for leakage). By fusing the geometric indicators with damage metrics, a quantitative risk scoring system is established to evaluate structural durability. Experimental results on a real-world tunnel segment demonstrate the framework’s effectiveness in correlating surface defects with underlying geometric irregularities. This integrated approach offers a data-driven solution for the continuous health monitoring and residual life prediction of RC tunnel linings in marine conditions, bridging the gap between visual inspection and structural performance assessment. Full article
Show Figures

Figure 1

Back to TopTop