Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (816)

Search Parameters:
Keywords = image crack detection

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
33 pages, 42918 KB  
Article
Intelligent Detection and Preventive Conservation of Surface Deterioration for Chaoshan Overseas-Chinese Residences in the Humid Coastal Lingnan Region Under Disaster-Prone Weather Conditions: A Case Study of Yingchuan Shijia
by Tukun Wang, Jingyang Li, Zeyao Kang, Yucheng Ou and Xi Wang
Buildings 2026, 16(12), 2459; https://doi.org/10.3390/buildings16122459 (registering DOI) - 22 Jun 2026
Abstract
The humid coastal Lingnan region of South China, including the Chaoshan area of eastern Guangdong, is frequently exposed to disaster-prone weather conditions such as high humidity, typhoon-related winds, heavy rainfall, and salt-laden coastal air. These long-term environmental exposures may contribute to surface deterioration [...] Read more.
The humid coastal Lingnan region of South China, including the Chaoshan area of eastern Guangdong, is frequently exposed to disaster-prone weather conditions such as high humidity, typhoon-related winds, heavy rainfall, and salt-laden coastal air. These long-term environmental exposures may contribute to surface deterioration risks of architectural heritage. Located in Shantou, Yingchuan Shijia has shown five visible surface deterioration types—cracks, staining, saltpetering, plants, and spalling—under the combined influence of environmental exposure, material aging, previous disturbance, and insufficient maintenance. To address the limitations of manual inspection, this study explores a conservation-oriented intelligent workflow integrating YOLO-based detection, digital documentation, and screening-level conservation interpretation. Digital documentation used UAV imagery, mobile LiDAR scanning, measured drawings, and SketchUp-based three-dimensional modeling. The dataset was built in three stages: a 99-image preliminary dataset, where YOLOv8 showed only basic learning capability with low performance metrics, including Precision of 33.0 ± 3.0%, Recall of 28.0 ± 1.0%, mAP50 of 25.0 ± 1.0%, and mAP50-95 of 11.0 ± 1.0%; a 362-image non-augmented case-study dataset, where YOLOv8 still showed limited performance, with mAP50 of 20.0 ± 1.0% and mAP50-95 of 8.0 ± 1.0%; and a final YOLO-format case-study dataset of 2000 images after training-set-only augmentation using 11 geometric and photometric transformation methods. After augmentation, YOLOv8 mAP50 increased to 62.0 ± 2.0%. Under the same augmented-data condition, YOLOv13 showed Precision of 89.0 ± 1.0%, Recall of 77.0 ± 1.0%, mAP50 of 84.0 ± 1.0%, and mAP50-95 of 65.0 ± 1.0%, indicating relatively higher validation performance than YOLOv8. In the normalized confusion matrix, the background missed-detection values for cracks and saltpetering were 0.29 and 0.22, respectively, indicating that weak-feature and low-contrast deterioration types remained challenging. Based on YOLOv13, a mini program was developed to organize detection outputs and provide field-oriented preliminary conservation hints. Overall, this study provides a preliminary workflow linking digital collection, image-based deterioration detection, Grad-CAM visualization, and assisted field recording for the preventive conservation of Chaoshan overseas-Chinese residences in humid coastal heritage environments. Full article
Show Figures

Figure 1

20 pages, 5609 KB  
Article
Enhanced YOLO11n for UAV-Based Surface Crack Detection in Mining Subsidence Areas
by Mo Wang, Nan Zhao, Chuangchuang Liu, Wanxiang Rao and Zhijun Zhang
Processes 2026, 14(12), 1988; https://doi.org/10.3390/pr14121988 (registering DOI) - 18 Jun 2026
Viewed by 188
Abstract
Mining-subsidence-induced surface cracks pose substantial risks to ecological systems, infrastructure stability, and mining safety. Their thin, elongated, discontinuous, and low-contrast characteristics make accurate detection from unmanned aerial vehicle (UAV) imagery challenging, particularly under complex environmental conditions. This study proposes an enhanced YOLO11n framework [...] Read more.
Mining-subsidence-induced surface cracks pose substantial risks to ecological systems, infrastructure stability, and mining safety. Their thin, elongated, discontinuous, and low-contrast characteristics make accurate detection from unmanned aerial vehicle (UAV) imagery challenging, particularly under complex environmental conditions. This study proposes an enhanced YOLO11n framework for detecting surface cracks in mining subsidence areas. Switchable Atrous Convolution (SAConv) was incorporated to strengthen multi-scale feature extraction, while Cascaded Group Attention (CGA) was introduced to suppress background interference and improve feature discrimination, and Shape-IoU loss was adopted to enhance the localization of slender crack targets. The model was evaluated using 5000 annotated UAV images collected in the Zhungeer mining area. It achieved a precision of 85.6%, a recall of 77.9%, an mAP@0.5 of 84.3%, and an F1-score of 81.6%. Compared with the baseline YOLO11n, precision, recall, and mAP@0.5 increased by 1.4, 4.6, and 3.2 percentage points, respectively. Cross-dataset evaluation on the public Crack500 dataset further demonstrated improved robustness under domain variation. These results indicate that the proposed framework improves the detection and localization of slender and discontinuous cracks in complex mining environments, supporting its application in UAV-based geological hazard monitoring. Full article
Show Figures

Figure 1

20 pages, 3119 KB  
Article
Engineering Structure Crack Detection Method Combining TAPFormer Model and Morphological Mask Reasoning Rules
by Hao Peng, Lintao Zhang, Gang Li, Yu Du and Han Wu
Buildings 2026, 16(12), 2419; https://doi.org/10.3390/buildings16122419 - 17 Jun 2026
Viewed by 170
Abstract
To address challenges such as complex background interference, limited long-range modeling capabilities of CNNs, and poor generalization in steel-concrete cross-material scenarios, this study proposes an enhanced detection framework. This framework integrates a TAPFormer with morphological reasoning rules. The method utilizes TAPFormer as the [...] Read more.
To address challenges such as complex background interference, limited long-range modeling capabilities of CNNs, and poor generalization in steel-concrete cross-material scenarios, this study proposes an enhanced detection framework. This framework integrates a TAPFormer with morphological reasoning rules. The method utilizes TAPFormer as the backbone network. It captures global topological features of cracks through a Task-Aware Query mechanism. This approach compensates for the deficiencies of traditional convolutional operators in modeling the continuity of thin and long cracks. Furthermore, a mask reasoning module based on geometric priors is developed to handle unstructured interferences, such as marker pen marks, welds, and concrete holes. This module defines logical criteria, including edge curvature consistency, axial aspect ratios, and endpoint extension directions. These criteria are used to perform topological repair and filter false positives in the initial segmentation masks. A hybrid dataset containing 4500 cross-material damage images was used for validation. The results show that the proposed method achieves a mean IoU of 86.72% and an F1-score of 90.36%. Notably, the method filters over 91.0% of false positives caused by manual marker pen marks in interference-rich scenarios. Compared to mainstream state-of-the-art models, the IoU improves by at least 5.48%. The results show that the proposed framework improves the robustness and logical self-consistency of crack identification in complex engineering environments. Full article
(This article belongs to the Special Issue Advances in Building Structure Analysis and Health Monitoring)
Show Figures

Figure 1

24 pages, 5867 KB  
Article
Integrated Fault Diagnosis in Grid-Connected PV Systems: Synergizing Infrared Thermography and Advanced Signal Processing
by Filippo Laganà, Danilo Pratticò, Luigi Bibbò, Salvatore A. Pullano and Salvatore Calcagno
Appl. Sci. 2026, 16(12), 6036; https://doi.org/10.3390/app16126036 - 15 Jun 2026
Viewed by 122
Abstract
Early identification of thermal and electrical anomalies in grid-connected photovoltaic (PV) systems is becoming increasingly important to reduce energy losses, limit power quality (PQ) degradation, and avoid excessive operating stress on power electronic converters. Conventional electrical monitoring methods can provide overall performance information, [...] Read more.
Early identification of thermal and electrical anomalies in grid-connected photovoltaic (PV) systems is becoming increasingly important to reduce energy losses, limit power quality (PQ) degradation, and avoid excessive operating stress on power electronic converters. Conventional electrical monitoring methods can provide overall performance information, but they are generally unable to detect and localize early-stage defects occurring at module or cell level. In this context, the present study proposes an integrated diagnostic framework that combines non-destructive infrared thermography (IRT) with advanced electrical signal processing techniques for PV condition monitoring. The proposed approach correlates thermographic information, capable of revealing defects such as hotspots, cell cracks, and bypass diode failures, with high-frequency electrical signal analysis based on frequency-domain and time–frequency methods, together with deep learning-driven thermographic segmentation. By associating thermal acquisitions with electrical PQ indicators, the framework enables the early detection of physical defects linked to inefficient Maximum Power Point Tracking (MPPT) operation and progressive degradation of PV system performance. The methodology was experimentally validated on a grid-connected photovoltaic installation under different fault conditions, including hotspots, bypass diode anomalies, and localized overheating effects, demonstrating the potential of the proposed approach for predictive maintenance and intelligent PV monitoring applications. The obtained results indicate that the proposed framework improves the reliability of photovoltaic fault detection by combining thermographic inspection with advanced electrical signal analysis and AI-based defect interpretation, thus supporting predictive maintenance strategies in smart PV infrastructures. The proposed approach demonstrates image segmentation capabilities, as evidenced by a precision (PA) of 96.88%, a mean IoU (mIoU) of 77.83% and a macro F1-score of 87.47%. The proposed framework maintained reduced computational requirements compatible with real-time monitoring applications. Full article
(This article belongs to the Special Issue Fault Diagnosis and Condition Monitoring of Power Electronics Systems)
Show Figures

Figure 1

30 pages, 11756 KB  
Article
Real-Time Crack Segmentation and Geometric Parameter Calculation of Mandrel Bars Based on an Improved YOLO Framework
by Jianzhao Cao, Zhu Sun, Jingguo Ding and Xu Li
Metals 2026, 16(6), 657; https://doi.org/10.3390/met16060657 - 14 Jun 2026
Viewed by 228
Abstract
Surface cracks on mandrel bars affect product quality and production stability in seamless steel pipe manufacturing. Existing vision-based methods mainly rely on bounding-box detection, which is insufficient for precise crack delineation and geometric characterization. This study proposes a lightweight segmentation framework for online [...] Read more.
Surface cracks on mandrel bars affect product quality and production stability in seamless steel pipe manufacturing. Existing vision-based methods mainly rely on bounding-box detection, which is insufficient for precise crack delineation and geometric characterization. This study proposes a lightweight segmentation framework for online mandrel bar crack inspection using grayscale industrial images. Based on YOLO11n-seg, the framework incorporates single-channel input adaptation, lightweight network reconfiguration, and crack-oriented feature enhancement to improve the extraction of weak, thin, and irregular cracks while reducing computational cost. A dedicated industrial dataset and a sample-balancing strategy are introduced to alleviate severe crack–background imbalance. Based on the predicted pixel-level masks, crack area, projected length, maximum width, and average width are calculated for online evaluation. Experimental results show that the proposed method achieves a mask mAP@0.5 of 88.5%, a false negative rate of 1.72%, and real-time inference at 204 FPS with 3.01 GFLOPs. Field deployment further demonstrates the effectiveness of the proposed framework for online crack inspection and geometric parameter calculation of mandrel bars. Full article
(This article belongs to the Special Issue Recent Progress in Metal Rolling Processes)
Show Figures

Figure 1

22 pages, 11218 KB  
Article
Image-Assisted Residual Load-Bearing Capacity Assessment of Plain Concrete Beams Using U-Net Crack Segmentation and Phase-Field Simulation
by Simeng Wang, Wen Zhao, Yuanyan Liang and Huiming Wang
Buildings 2026, 16(12), 2334; https://doi.org/10.3390/buildings16122334 - 11 Jun 2026
Viewed by 177
Abstract
Concrete cracks are ubiquitous in practical engineering structures and continuously affect structural safety and durability. Crack images provide important visual evidence of damage evolution; however, crack images alone are insufficient to determine the residual load-bearing capacity of concrete members. Although the development of [...] Read more.
Concrete cracks are ubiquitous in practical engineering structures and continuously affect structural safety and durability. Crack images provide important visual evidence of damage evolution; however, crack images alone are insufficient to determine the residual load-bearing capacity of concrete members. Although the development of deep learning algorithms has significantly improved the automatic detection of concrete surface cracks, most existing methods remain limited to the extraction of crack geometric features and lack a direct connection with mechanical performance. To explore the relationship between image-based crack geometry and mechanical response, this study combines U-Net-based crack segmentation, OpenCV-based crack geometry extraction, and phase-field fracture simulation to establish a preliminary visual–mechanical framework for plain concrete beams. In this framework, surface crack images are first segmented using a U-Net model, and crack length, average width, and propagation path are extracted from the predicted binary masks. The extracted crack length is then used as the primary variable to match the observed crack state with the phase-field crack evolution sequence. Once the corresponding simulation stage is identified, the associated load level and residual load-bearing capacity can be obtained from the simulated load–crack mouth opening displacement (Load–CMOD) response. Through a mixed-mode I–II fracture test, the crack geometric features extracted by deep learning are compared with the phase-field simulation results. The results show that the error in crack length is within 2.5%. Meanwhile, the relative error between the simulated peak load and the experimental value was 1.57%, which preliminarily verified the correlation between image-based crack information and the load-bearing capacity of plain concrete beams. The method is further applied to a Mode I fracture test without recorded load-bearing capacity data. By mapping the crack length identified from the image, namely 36.89 mm, to the phase-field evolution sequence, the load-bearing capacity of the member at this stage is estimated to be 74.4% of the peak load. The results indicate that the crack geometry extracted from images can be correlated with phase-field crack evolution, thereby supporting preliminary residual load-bearing capacity assessment of plain concrete beams. However, the proposed framework should be regarded as a case-level feasibility study rather than a general structural assessment method. Before broader engineering application, further validation using synchronized crack image sequences, crack mouth opening displacement (CMOD) measurements, and load records is required. Full article
(This article belongs to the Section Building Structures)
Show Figures

Figure 1

23 pages, 4565 KB  
Article
Application of G–L Fractional-Order Differentiation in Wood Veneer Defect Image Enhancement
by Jun Zhang, Wenqi Ma, Jiagui Wang and Guodong Wu
Fractal Fract. 2026, 10(6), 392; https://doi.org/10.3390/fractalfract10060392 - 6 Jun 2026
Viewed by 233
Abstract
Image enhancement is of pivotal importance in the detection of defects in wood veneers. However, acquired images frequently exhibit signs of blurring, uneven illumination, and insufficient contrast, which can lead to a reduction in the accuracy of defect recognition. In this study, an [...] Read more.
Image enhancement is of pivotal importance in the detection of defects in wood veneers. However, acquired images frequently exhibit signs of blurring, uneven illumination, and insufficient contrast, which can lead to a reduction in the accuracy of defect recognition. In this study, an algorithm based on Grünwald–Letnikov (G–L) fractional-order differentiation is proposed for the enhancement of wood veneer defect images. Initially, the gain characteristics of differential amplitude-frequency responses on high- and low-frequency image components are analyzed, and the feasibility of the method is demonstrated by linking these characteristics with the frequency-domain distributions of live knot, dead knot, and crack defects. Secondly, an eight-direction mask operator is constructed based on the G–L definition, and a DC component preservation factor is introduced to eliminate the luminance drift caused by mask truncation. The application of the mask is performed independently on the R, G, and B channels, and a dynamic blending mechanism is designed to achieve a balance between texture enhancement and structural fidelity. Finally, a set of six evaluation metrics (AG, E, PSNR, RMSE, SSIM, and VIF) is employed to assess the quality of enhanced images. The proposed algorithm is then compared with five existing algorithms (SSR, MSR, MSRCR, CLAHE, and AGC) under both noise-free and additive white Gaussian noise conditions. The findings indicate that the G–L fractional-order differentiation algorithm facilitates a more balanced representation of image features, thereby enhancing contrast, brightness, and textural contours. This approach results in more authentic color reproduction and superior visual quality. Full article
(This article belongs to the Special Issue Applications of Fractional-Order Grey Models, 2nd Edition)
Show Figures

Figure 1

41 pages, 4419 KB  
Review
A Review of UAV-Based Crack Detection in Civil Infrastructure: A Multi-Level Visual Analysis Framework, Scene Adaptability, and Challenges
by Yue Bai, Wei Quan, Xuming Shi, Zeyi Yan and Guoliang Yuan
Remote Sens. 2026, 18(11), 1806; https://doi.org/10.3390/rs18111806 - 2 Jun 2026
Viewed by 401
Abstract
Civil infrastructure plays a critical role in ensuring societal safety and economic development. However, structural damages such as cracks inevitably occur during long-term service. Traditional manual inspection methods are insufficient to meet the demands of large-scale and routine monitoring. Unmanned Aerial Vehicles (UAV) [...] Read more.
Civil infrastructure plays a critical role in ensuring societal safety and economic development. However, structural damages such as cracks inevitably occur during long-term service. Traditional manual inspection methods are insufficient to meet the demands of large-scale and routine monitoring. Unmanned Aerial Vehicles (UAV) remote sensing has become an important approach for Structural Health Monitoring (SHM), owing to its high spatial resolution imaging capability and superior operational flexibility. Nevertheless, existing studies focus on optimizing individual algorithms, lacking a systematic analysis oriented toward multi-scenario engineering applications. Therefore, we present a comprehensive review of UAV-based crack detection techniques for infrastructure using remote sensing imagery. First, publicly available datasets, UAV platforms, and evaluation metrics are systematically summarized. Then a multi-level visual analysis framework for UAV inspection is established. The framework categorizes existing methodologies into five levels: image-level classification, object-level detection, pixel-level segmentation, geometric quantification, and three-dimensional (3D) reconstruction, followed by a systematic evaluation of representative methods. Furthermore, the applicability of different methods across diverse scenarios, including bridges, pavements, dams, building facades and wind turbine blades, is systematically explored. Finally, the key challenges and future research directions are discussed. This review aims to provide a systematic theoretical foundation and methodological reference for advancing UAV-based infrastructure crack inspection from algorithm development toward practical multi-scenario engineering applications. Full article
Show Figures

Figure 1

18 pages, 5508 KB  
Article
EMN-Net: A Lightweight YOLOv8-Based Model for Real-Time Surface Defect Detection of Pharmaceutical Tablets
by Jiaxi An, Lujing Zhou, Dianting Liu, Xinpeng Zheng, Zhiyi Zhou and Heng Wang
Algorithms 2026, 19(6), 438; https://doi.org/10.3390/a19060438 - 1 Jun 2026
Viewed by 275
Abstract
Continuous manufacturing has emerged as the prevailing paradigm in the modern pharmaceutical industry, imposing stringent demands for efficient, real-time inspection methods. Furthermore, deploying high-performance deep learning models on industrial edge devices remains challenging due to computational constraints and the difficulty of detecting micro-defects [...] Read more.
Continuous manufacturing has emerged as the prevailing paradigm in the modern pharmaceutical industry, imposing stringent demands for efficient, real-time inspection methods. Furthermore, deploying high-performance deep learning models on industrial edge devices remains challenging due to computational constraints and the difficulty of detecting micro-defects (e.g., micro-cracks and spots). This paper proposes EMN-net, a lightweight defect detection model built upon the YOLOv8n architecture. The proposed algorithm integrates a MobileNetV3 backbone, the Efficient Local Attention (ELA) mechanism and the Normalized Wasserstein Distance (NWD) loss function to balance computational efficiency with sensitivity toward micro-defects. Evaluated on a self-built industrial tablet dataset expanded to 3086 images, EMN-net achieves an mAP50 of 97.8%, representing a 2.5% improvement over the baseline YOLOv8n. the computational complexity is reduced to 4.4 GFLOPs, while the inference throughput reaches 118 FPS, satisfying the real-time requirements of high-speed production lines. Additionally, the model exhibits improved robustness under simulated motion blur and sensor noise. EMN-net presents a balanced automated visual inspection solution for edge devices in continuous pharmaceutical manufacturing. Full article
(This article belongs to the Special Issue Modern Algorithms for Image Processing and Computer Vision)
Show Figures

Figure 1

32 pages, 20674 KB  
Article
MCGC-Net: A Text-Enhanced Geometry-Consistent Network for UAV-Based Road Crack Detection
by Zhoujun Ou, Shicong He, Rongwei Bu, Peng Wang and Gufeng Gong
Sensors 2026, 26(11), 3487; https://doi.org/10.3390/s26113487 - 1 Jun 2026
Viewed by 370
Abstract
With the rapid development of unmanned aerial vehicle (UAV) remote sensing and deep learning, road crack detection has become an important component of road condition assessment and intelligent road maintenance. However, accurately detecting cracks from UAV images remains challenging due to complex background [...] Read more.
With the rapid development of unmanned aerial vehicle (UAV) remote sensing and deep learning, road crack detection has become an important component of road condition assessment and intelligent road maintenance. However, accurately detecting cracks from UAV images remains challenging due to complex background environments, slender crack structures, blurred boundaries, and irregular crack shapes and orientations. Traditional methods that rely solely on visual information often struggle to achieve stable and accurate detection performance under these conditions. To address these challenges, this paper proposes a Multimodal Crack Geometry-Consistent Network (MCGC-Net) for high-precision road crack detection in complex road scenes. First, a UAV-based multimodal road crack dataset with image-text annotations is constructed. Specifically, crack-related textual descriptions are automatically generated from crack annotations using predefined semantic templates, which summarize crack morphology, spatial distribution characteristics, and structural properties. These semantic descriptions provide high-level semantic prior information for crack representation learning. Second, a Multimodal Contrastive Semantic Gating module (MCSG) is introduced to leverage automatically generated crack semantic descriptions and in-batch image-text semantic differences to guide visual feature learning, thereby improving the discrimination between crack and non-crack regions under complex background conditions. Furthermore, a Crack-Aware Slenderness Loss (CASL) is proposed to explicitly constrain slenderness consistency between predicted boxes and ground-truth boxes, improving localization stability for slender crack targets. In addition, a KAN-based Nonlinear Channel Attention mechanism (KAN-CA) is introduced to enhance feature representation capability for complex crack structures. Experimental results demonstrate that the proposed MCGC-Net effectively improves crack detection accuracy and structural representation capability under complex road environments. The proposed method provides a practical and reliable solution for UAV-based intelligent road crack detection. Full article
(This article belongs to the Section Vehicular Sensing)
Show Figures

Figure 1

27 pages, 39300 KB  
Article
Multi-Frame Temporal Integration for 3-D Shape Measurement of Freely Falling Small Objects Using a High-Speed Camera Array
by Hao Duan, Shaopeng Hu, Feiyue Wang, Kohei Shimasaki and Idaku Ishii
Sensors 2026, 26(11), 3457; https://doi.org/10.3390/s26113457 - 30 May 2026
Viewed by 262
Abstract
Dynamic three-dimensional (3-D) reconstruction of small objects moving at high speed is fundamentally limited by the number of viewpoints that a fixed camera array can provide at any single time instant. When the camera count is insufficient, single-frame multi-view stereo produces incomplete or [...] Read more.
Dynamic three-dimensional (3-D) reconstruction of small objects moving at high speed is fundamentally limited by the number of viewpoints that a fixed camera array can provide at any single time instant. When the camera count is insufficient, single-frame multi-view stereo produces incomplete or inaccurate geometry. This paper proposes a multi-frame temporal integration approach that overcomes this limitation by exploiting the rigid-body assumption: because a falling object maintains its shape across consecutive frames, images captured at different time instants can be combined into a single, viewpoint-enriched reconstruction. A three-layer circular array of 32 synchronized RGB cameras captures 1440 × 1080 images at 160 fps, and a free-fall-oriented algorithm automatically detects active frames, selects informative temporal windows, and feeds the accumulated multi-frame images into a structure-from-motion and multi-view stereo (SfM-MVS) pipeline, effectively multiplying the number of viewpoints without additional hardware. The algorithm simultaneously recovers the 6-DOF pose trajectory of each object from the SfM-estimated camera parameters. Progressive accumulation experiments on freely falling soybeans (approximately 9–10 mm diameter) show that a single 32-camera frame already achieves an F-score exceeding 0.97 at a 0.5 mm threshold against an industrial structured-light scanner reference, and that accumulating additional temporal frames reaches a stable convergence plateau with both objects reaching a plateau F-score of 0.984. Beyond approximately one to two accumulated frames, additional frames yield diminishing returns, confirming that a small number of temporal frames is sufficient for convergent sub-millimeter accuracy. Across 30 independent free-fall trials with three objects, the system achieves an overall mean error of 0.146±0.033 mm and an overall F-score of 0.980±0.006—a mean relative error of approximately 1.6% on 8–10 mm targets—and fine surface features such as structural cracks are resolved at a fidelity sufficient for visual defect identification. These results establish rigid-body multi-frame temporal integration as an effective strategy for high-throughput, non-contact 3-D inspection of small objects in motion. Full article
Show Figures

Figure 1

19 pages, 3100 KB  
Article
Shield Tunnel Crack Detection Based on Improved Unet
by Gang Ming, Xiao-Wei Ye, Da Hang, Jian-She Qin and Jie Li
Sensors 2026, 26(11), 3360; https://doi.org/10.3390/s26113360 - 26 May 2026
Viewed by 290
Abstract
Unet, a deep learning architecture, has become one of the most widely used models for crack detection in the tunneling field. Although it performs well in overall crack image segmentation, it still has issues of limited feature expression capability and inaccurate segmentation. To [...] Read more.
Unet, a deep learning architecture, has become one of the most widely used models for crack detection in the tunneling field. Although it performs well in overall crack image segmentation, it still has issues of limited feature expression capability and inaccurate segmentation. To address these problems, DTA-Unet was proposed based on dynamic convolution decomposition (DCD) and triple attention (TA). Firstly, the model used Unet as the baseline network and replaced traditional convolutions in the encoding-decoding process with DCD to enhance its feature extraction ability. Secondly, TA was combined with attention gate (AG) in the skip connections of the network, eliminating redundant information in spatial and channel dimensions to highlight the crack area. Finally, the proposed model was tested on crack datasets and compared with the conventional Unet model, image processing algorithms, and other deep neural network models in terms of detection performance on the datasets. The results show that it outperforms other advanced methods in crack detection performance. The proposed method is of significance to the maintenance of shield tunnel cracks. Full article
(This article belongs to the Section Sensing and Imaging)
Show Figures

Figure 1

29 pages, 2568 KB  
Article
Crack Segmentation Model for Low-Quality Crack Images Based on Feature Integration and Triple Attention
by Yonghua Xie and Yuyang Wang
Appl. Sci. 2026, 16(11), 5185; https://doi.org/10.3390/app16115185 - 22 May 2026
Viewed by 191
Abstract
To address the problem of road crack detection in low-quality pavement images, existing semantic segmentation methods still have shortcomings such as missed crack detection and inaccurate localization due to weak crack boundaries, low contrast, and complex pavement texture. To address these limitations, this [...] Read more.
To address the problem of road crack detection in low-quality pavement images, existing semantic segmentation methods still have shortcomings such as missed crack detection and inaccurate localization due to weak crack boundaries, low contrast, and complex pavement texture. To address these limitations, this study proposes a crack segmentation model based on feature integration and a triple attention mechanism. The model uses DeepLabv3+ as the backbone network and introduces the proposed three-dimensional interactive attention module after feature extraction. The attention module enhances the extraction of key features related to the spatial location and morphological details of cracks, thereby improving the ability of crack location. A hierarchical feature integration branch is introduced in the cross-layer connection, and a dimension-aware selective fusion module is used to enhance the saliency of small cracks in complex backgrounds. In addition, the proposed multi-group dilation feature fusion module is introduced to improve the multi-scale modeling of small and slender cracks and reduce background interference. The experimental results on Crack500 and GAPS384 datasets show that the proposed model achieves better overall segmentation performance than the comparison model, especially in reducing the missed detection of weak, small, and discontinuous cracks in low-quality pavement images. Complexity analysis further shows that the proposed model maintains practical inference efficiency rather than relying on too large a model size. These results show that the proposed method provides an effective solution for low-quality road crack segmentation, but it still needs to be further verified in actual detection scenarios. Full article
(This article belongs to the Section Computing and Artificial Intelligence)
Show Figures

Figure 1

19 pages, 3072 KB  
Article
Morphology-Adaptive YOLO for Underwater Crack Detection in Hydraulic Structures
by Zhe Chen, Changning Zhou, Jingkun Guo and Guangjun Yin
Water 2026, 18(10), 1241; https://doi.org/10.3390/w18101241 - 21 May 2026
Viewed by 347
Abstract
Accurate underwater crack detection is essential for the condition monitoring of hydraulic structures. However, reliable detection in underwater inspection imagery remains challenging because of low visibility, complex backgrounds, large-scale variation, and irregular crack morphology. To improve detection under these conditions, we develop MA-YOLO, [...] Read more.
Accurate underwater crack detection is essential for the condition monitoring of hydraulic structures. However, reliable detection in underwater inspection imagery remains challenging because of low visibility, complex backgrounds, large-scale variation, and irregular crack morphology. To improve detection under these conditions, we develop MA-YOLO, a YOLOv11-based detector that adapts feature representation to underwater crack morphology. The proposed method integrates a broader receptive field spatial pyramid pooling module to enhance multi-scale feature extraction, a morphological attention module to improve the representation of irregular crack patterns, and an extra-large detection head to better detect magnified cracks in close-range underwater images. Experiments on the underwater crack dataset (UCD) show that MA-YOLO outperforms both conventional detectors and recent underwater object-specific detectors. Relative to YOLOv11, MA-YOLO increases mAP@0.5 from 91.2% to 92.9% and mAP@0.5:0.95 from 60.0% to 63.0%, while maintaining a lightweight architecture and real-time inference capability. The results demonstrate the effectiveness of morphology-adaptive feature modeling for image-based underwater crack detection and its potential for practical monitoring of submerged hydraulic structures. Full article
(This article belongs to the Section Hydraulics and Hydrodynamics)
Show Figures

Figure 1

26 pages, 5946 KB  
Article
Intelligent Recognition and Restoration of Mural Damage Based on DeepLabv3 and Stable Diffusion
by Chong Rong, Dashuai Yang, Wenkai Tian, Yi Tao, Qiuwei Wang and Peng Wang
Buildings 2026, 16(10), 2012; https://doi.org/10.3390/buildings16102012 - 20 May 2026
Viewed by 251
Abstract
Murals are not merely independent visual artworks. Rather, they are an integral part of architectural heritage, directly attached to buildings’ structural elements, such as brick walls and vaults. However, murals are susceptible to various building-related types of damage, including structural cracks and moisture-induced [...] Read more.
Murals are not merely independent visual artworks. Rather, they are an integral part of architectural heritage, directly attached to buildings’ structural elements, such as brick walls and vaults. However, murals are susceptible to various building-related types of damage, including structural cracks and moisture-induced peeling, due to long-term exposure to environmental factors and geological changes. As the progressive deterioration of these murals hastens the loss of mural value, professional assessment and restoration are urgently required. To tackle the issues of low efficiency in traditional structural damage detection and the absence of predictable repair plans, this paper presents a semi-automatic building-mural protection solution that integrates morphological assessment of mural deterioration with computer vision technology. This study establishes an image prediction system that integrates intelligent damage identification with virtual restoration. First, employing the PaddleSeg deep learning framework and the DeepLabv3 semantic segmentation model, this study used existing mural damage datasets to build a recognition model. The model allows for intelligent identification and labeling of multiple damage types. Subsequently, relying on the ComfyUI platform, Stable Diffusion was used to construct a virtual restoration model. LoRA (low-rank adaptation) technology was introduced to fine-tune the model specifically for the mural style, thus enhancing the directivity and accuracy of virtual restoration. Finally, by applying the results of the recognition model to the virtual restoration model, this study built an integrated system for mural damage diagnosis and virtual restoration. The results show that the damage recognition model achieved a mean intersection over union (mIoU) of 47.8% and a pixel accuracy of 77.97% on the test set, validating the feasibility of using semantic segmentation for mural damage detection. This study presents an integrated workflow framework integrating automatic damage identification and intelligent repair. As an expert-assisted tool, this framework shows application potential for preliminary exploration of mural disease diagnosis and virtual restoration plans, providing technical references for the digital protection of cultural heritage. Full article
Show Figures

Figure 1

Back to TopTop