Edge-Enhanced CrackNet for Underwater Crack Detection in Concrete Dams

Wu, Xiaobian; Zhang, Weibo; Shen, Guangze; Sheng, Jinbao

doi:10.3390/app151910326

Open AccessArticle

Edge-Enhanced CrackNet for Underwater Crack Detection in Concrete Dams^†

¹

Nanjing Hydraulic Research Institute, Nanjing 210029, China

²

NHRI R&D Tech Group Co., Ltd., Nanjing 210029, China

³

Dam Safety Management Center of the Ministry of Water Resources, Nanjing 210029, China

^*

Author to whom correspondence should be addressed.

^†

This paper is an extension of a conference paper. Edge Enhanced Underwater CrackNet for Dam Underwater Crack Detection. In Proceedings of the International Symposium “Common Challenges, Shared Future, Better Dams” ICOLD-CIGB 2025, Chengdu, China, 16–23 May 2025.

Appl. Sci. 2025, 15(19), 10326; https://doi.org/10.3390/app151910326

Submission received: 17 August 2025 / Revised: 8 September 2025 / Accepted: 10 September 2025 / Published: 23 September 2025

(This article belongs to the Special Issue Civil Structural Health Monitoring: Techniques, Systems and Applications)

Download

Browse Figures

Versions Notes

Abstract

Featured Application

This study provides an underwater image enhancement method that effectively improves the detection performance of crack recognition algorithms under complex flow field and low-illumination underwater conditions and provides a new solution for concrete crack detection algorithms applied to dam surfaces under deep-water conditions.

Abstract

Underwater crack detection in dam structures is of significant importance to ensure structural safety, assess operational conditions, and prevent potential disasters. Traditional crack detection methods face various limitations when applied to underwater environments, particularly in high dam underwater environments where image quality is influenced by factors such as water flow disturbances, light diffraction effects, and low contrast, making it difficult for conventional methods to accurately extract crack features. This study proposes a dual-stage underwater crack detection method based on Cycle-GAN and YOLOv11 called Edge-Enhanced Underwater CrackNet (E²UCN) to overcome the limitations of existing image enhancement methods in retaining crack details and improving detection accuracy. First, underwater concrete crack images were collected using an underwater remotely operated vehicle (ROV), and various complex underwater environments were simulated to construct a test dataset. Then, an improved Cycle-GAN image style transfer method was used to enhance the underwater images. Unlike conventional GAN-based underwater image enhancement methods that focus on global visual quality, our model specifically constrains edge preservation and high-frequency crack textures, providing a novel solution tailored for crack detection tasks. Subsequently, the YOLOv11 model was employed to perform object detection on the enhanced underwater crack images, effectively extracting crack features and achieving high-precision crack detection. The experimental results show that the proposed method significantly outperforms traditional methods in terms of crack detection accuracy, edge clarity, and adaptability to complex backgrounds, effectively improving underwater crack detection accuracy (precision = 0.995, F1 = 0.99762, mAP@0.5 = 0.995, and mAP@0.5:0.95 = 0.736) and providing a feasible technological solution for intelligent inspection of high dam underwater cracks.

Keywords:

dam safety; small-object detection; underwater crack detection

1. Introduction

Currently, China has the largest number of high dams, with 588 dams over 70 m, 233 dams over 100 m, and 23 dams over 200 m. Thus, high dam safety is a major challenge for national water safety and public safety. Cracks are one of the most common structural defects in such underwater engineering facilities during long-term operation. They can lead to reduced structural strength, water leakage, and even catastrophic consequences, posing a serious threat to safety. In some cases, the cracks exist underwater, which is even more challenging, as mature methods for above-water detection cannot be directly applied. Therefore, determining methods for early detection and precise evaluation of cracks both above and below water is of critical importance for ensuring the stability of these engineering structures.

Traditional crack detection methods mainly rely on draining reservoirs to clear interference factors combined with manual inspections to clearly observe underwater structural defects [1,2], an approach that is both time-consuming and economically costly and requires a considerable amount of human resources, potentially having a negative impact on the ecological environment in certain cases. It is also not suitable for high dams and large reservoirs. In contrast, modern detection methods primarily rely on divers or portable detection equipment for underwater visual inspections, with saturation diving required for high dams, which is both dangerous and costly. Moreover, divers often lack professional knowledge of hydraulic engineering, lowering the reliability of their detection results. Most recently, with the advancements in ROVs and underwater imaging technologies, the possibility of intelligent inspection for underwater cracks has emerged. However, traditional image-processing methods perform poorly in complex underwater environments, and their detection accuracy and adaptability are severely limited [3]. Therefore, research on crack defect detection models in complex underwater environments is of great significance.

One of the core challenges in underwater crack detection tasks is the low quality of underwater images. Due to the influence of various disturbance fields in the underwater environment (such as flow fields and hydrodynamic fields), underwater images often exhibit significant motion blur and light diffraction effects. Additionally, the absorption and scattering of light in water result in reduced image contrast, color distortion, and detail blurring, which severely affects the performance of crack detection models [4]. Therefore, underwater image enhancement techniques have become a fundamental research direction for improving the accuracy of underwater crack detection. Traditional underwater image data enhancement algorithms, such as histogram equalization [5] and homomorphic filtering [6], are relatively simple, but have poor robustness, making them unsuitable for image enhancement tasks in high dam underwater multi-field disturbance environments. In recent years, with the rapid development of machine learning [7] and deep learning technologies [8], neural network-based image enhancement methods have made significant progress in the field of image processing [9,10]. Recent unpaired enhancement approaches (e.g., WaterGAN [11], Shallow-uwnet [12], UColor [13], PUIE-Net [14]) and datasets show that paired data are not strictly required, but these methods focus mainly on perceptual quality rather than preserving crack features for detection. Meanwhile, modern detectors such as YOLOv8–v11 demonstrate strong robustness, though pipelines that explicitly constrain edge/high-frequency cues for underwater cracks remain scarce. For example, Wang Yue et al. [15] proposed a multi-scale attention and contrast learning-based underwater image enhancement algorithm, which effectively extracts multi-level image features by combining an encoder–decoder structure with multi-scale channel pixel attention modules. The improvements achieved in PSNR and SSIM metrics were 4.4% and 2.8%, respectively, significantly improving image clarity. Additionally, Du Feiyu et al. [16] proposed a domain-adaptive underwater image enhancement method combining convolutional neural networks with a multi-head attention mechanism and adversarial learning to achieve image enhancement under unsupervised conditions. Although these methods achieved good results in restoring visual quality, they focus more on enhancing the overall visual effect of the image rather than preserving and enhancing crack details, such as crack boundaries, which may lead to blurred crack edges or feature loss, thereby affecting subsequent crack detection accuracy. In other words, current underwater enhancement pipelines rarely optimize for crack-salient edges and high-frequency cues, leaving a gap between visually pleasing restoration and useful feature retention for detectors. Furthermore, existing underwater image enhancement models require paired images for training [17,18], but paired underwater crack images are difficult to obtain as they require the use of ROVs for image collection, making it challenging to create paired image datasets for model training.

In the field of underwater crack detection, traditional crack detection methods, such as Markov random fields [19] and Sobel operators [20], rely on edge detection algorithms in their image processing techniques. Although these methods perform well in simple scenarios, they are highly sensitive to noise and easily affected by background interference and optical artifacts in complex underwater environments, making it difficult to effectively extract crack features. In recent years, with the continuous development of object detection technologies, deep learning-based models have achieved significant progress in underwater crack detection. For instance, Shi et al. proposed a method called CrackYOLO [21], based on the YOLOv5 model, which introduces a feature fusion module, a Res2C3 feature extraction module, and a BCAtt attention mechanism, significantly improving crack detection performance. It achieved 94.3% mAP and a detection speed of 151 FPS in underwater crack detection tasks. Moreover, Mao Yingchi et al. [22] proposed a multi-task enhanced crack image detection method (ME-Faster R-CNN) based on Faster R-CNN, which improves the regional proposal network (RPN) and introduces the multi-source adaptive balancing TrAdaBoost method, effectively improving the detection capability for multiple targets and small target cracks. In experiments, it achieved an 82.52% average intersection over union (IoU) and 80.02% average precision (mAP), an improvement of 1.06% and 1.56%, respectively, compared to traditional Faster R-CNN methods. Additionally, Huang et al. [23] tackled the inherent limitations of redundant architectural components and deficient multi-scale feature extraction in the canonical YOLOv5 framework by introducing an enhanced model that synergistically integrates attention mechanisms with the Complete-IoU (CIoU) loss [24], thereby substantially elevating real-time detection accuracy. Concurrently, a refined YOLOv8-derived architecture was developed, which exhibits markedly superior robustness and detection fidelity when confronted with the severe visual degradations characteristic of the underwater domain, thereby advancing the state of the art in marine object detection [25]. These research outcomes show that deep learning-based object detection methods can effectively handle complex underwater crack detection tasks in various fields and achieve a good balance between detection speed and accuracy. However, due to the lack of underwater crack data, research on dam underwater crack detection based on object detection algorithms is relatively limited [26].

To address the deficiencies in the existing research on underwater image enhancement and crack detection, this paper proposes a dual-stage underwater crack detection method based on Cycle-GAN [27] and YOLOv11 [28] called Edge-Enhanced Underwater CrackNet (E²UCN) in order to pay sufficient attention to crack feature details in complex underwater environments and achieve high-precision crack detection. First, during the underwater crack image collection process, we used a P200 underwater remotely operated vehicle (ROV) to capture artificial crack images in an underwater concrete tank, simulating various complex underwater scenarios, including flow field disturbances, optical diffraction, and low-contrast environments, to simulate the real environment of high dams. Next, in the image enhancement stage, we designed a Cycle-GAN-based underwater image style transfer method named the CycleGAN-Based Underwater Image Enhancement (CGBUIE) model to improve underwater image quality and highlight crack detail features. Although hybrid GAN-based approaches such as attention-guided CycleGAN [29,30] and ESRGAN variants [31,32] have achieved impressive results in natural image enhancement, they primarily optimize for perceptual quality rather than structural crack feature preservation. In contrast, our method explicitly constrains edge and frequency information, making it more suitable for downstream crack detection tasks. Furthermore, the CGBUIE model introduces Sobel operators [33] and high-frequency transformations [34,35] in the loss function to constrain the edge information and high-frequency detail retention in the generated image, preventing crack edges from becoming blurred or details from being lost. The Sobel operator extracts prominent edge information from the image, while high-frequency transformations enhance crack texture features, enabling the enhanced image to achieve both visual style transfer to an above-water environment and crack boundary and detail retention at the feature level [36,37,38]. Finally, in the crack detection stage, we first trained the YOLOv11 model on an above-water concrete crack dataset so it could learn key crack features and prominent edge features, then applied the trained model to the enhanced underwater crack images generated by the CGBUIE model for accurate underwater crack detection. During the detection process, YOLOv11, with its optimized network architecture and multi-scale feature extraction capability, is able to better capture the subtle features and irregular edges of cracks, particularly in cases where the cracks are small and complex in shape [39]. Unlike existing underwater image enhancement approaches, which mostly emphasize overall image clarity, our method explicitly integrates the Sobel operator and high-frequency Fourier constraints into CycleGAN, ensuring the preservation of crack edges and the fine textures critical for subsequent detection. The enhanced underwater crack images not only significantly improved the visibility of cracks but also provided high-quality input for the model, enabling fast and accurate crack localization and classification in complex underwater environments. Experimental results showed that the proposed method performs well in terms of crack edge clarity, object localization accuracy, and adaptability to complex backgrounds.

The novelty of E²UCN lies in its dual-stage architecture: (i) an enhanced CycleGAN that incorporates Sobel and Fourier constraints to explicitly preserve crack-specific features during the style transfer process, and (ii) its integration with YOLOv11 for robust crack detection. This was further substantiated by comprehensive ablation studies. The remainder of this paper is organized as follows: Section 2 describes the proposed E²UCN and its edge-/texture-aware enhancement. Section 3 presents the datasets, annotation protocols, and ablation design, discusses the results, uncertainties, and analysis. Section 4 concludes with the study’s contributions and limitations.

2. Proposed Method

The E²UCN framework, the CGBUIE model, and the YOLOv11 model used in this study will be described in detail in this section. Specifically, the E²UCN model is shown in Figure 1.

2.1. CycleGAN-Based Underwater Image Enhancement Model

Based on the style transfer functionality of CycleGAN, this study combines Sobel operators and high-frequency filtering to perform style transfer between underwater crack images and above-water crack images, aiming to enhance the images. The core idea of CycleGAN is to map between different domains through unsupervised learning without paired training data. This enables its widespread application in underwater crack detection, particularly when large annotated datasets are unavailable. CycleGAN achieves this goal by introducing two generators and two discriminators. Generators are used to generate images similar to the target style, while discriminators judge the difference between the generated image and the real image, thereby guiding the generator to optimize its generation effect.

In CycleGAN (https://github.com/junyanz/CycleGAN, accessed on 9 September 2025), generator

G_{x}

maps source domain images to target domain images. Meanwhile, generator

G_{y}

maps target domain images back to the source domain. Discriminators

D_{x}

and

D_{y}

are used to distinguish generated images from real images, thus guiding the generator to optimize its style transfer effect. The aim is to minimize the difference between generated and real images while ensuring that the generated image can restore the original image after being mapped back. This process is achieved by introducing cyclic consistency loss based on the adversarial loss of the original GAN.

Adversarial loss is used to train the generator to produce realistic images, forcing the generated images to “fool” the discriminator. For generator

G_{x}

, the goal is to minimize the following loss function:

L_{G A N} (G_{x}, D_{y}, x, y) = E_{\{y ~ p_{d a t a} (y)\}} [\log D_{y} (y)] + E_{\{x ~ p_{d a t a} (x)\}} [\log (1 - D_{y} (G_{x} (x)))],

(1)

where

x

and

y

are real images from the source and target domains,

G_{x} (x)

is the image generated by generator

G_{x}

, and

D_{y}

is the discriminator.

To ensure that the generated image can still restore the original image after being mapped back, CycleGAN introduces cyclic consistency loss. For generators G_x and G_y, their goals are defined as follows:

L_{C y c l e} (G_{x}, D_{y}, x, y) = E_{\{y ~ p_{d a t a} (x)\}} [G_{y} (G_{x} (x)) - x_{1}] + E_{\{x ~ p_{d a t a} (y)\}} [G_{x} (G_{y} (y)) - y_{1}],

(2)

This loss uses the L1 norm to measure the difference between the generated image and the original image, forcing the generated image to maintain the structural features of the source image. Therefore, the total loss function of the original CycleGAN is as follows:

L_{C y c l e G A N} = L_{G A N} (G_{x}, D_{y}, x, y) + L_{G A N} (G_{y}, D_{x}, y, x) + λ_{c y c l e} L_{C y c l e} (G_{x}, D_{y}, x, y),

(3)

where

λ_{c y c l e}

is the weight balancing adversarial loss and cyclic consistency loss.

However, experiments have shown that the style transfer images generated by the original CycleGAN, focusing mainly on style transfer (e.g., color, contrast of the above-water images), fail to retain the edge information and texture details of the image. To enhance the edge details of the style-transferred images, this paper introduces the E²UCN model.

The Sobel operator extracts edge information from an image by computing the gradients and is effective in retaining the image’s edge features. Here, based on the original CycleGAN, the basic principle of the Sobel operator is applied and further improved to construct Sobel loss. First, the Sobel operator is applied to compute the gradients of the input image, obtaining the gradients in the horizontal (

x

) and vertical (

y

) directions. Specifically, the Sobel operator used in this paper is as follows:

S o b e l_{x} = [\begin{matrix} - 1 & 0 & 1 \\ - 2 & 0 & 2 \\ - 1 & 0 & 1 \end{matrix}], S o b e l_{y} = [\begin{matrix} - 1 & - 2 & - 1 \\ 0 & 0 & 0 \\ 1 & 2 & 1 \end{matrix}],

(4)

where

S o b e l_{x}

and

S o b e l_{y}

are the convolution kernels of the Sobel filter in the

x

and

y

directions, respectively. The gradients calculated by the two operators can be represented as follows:

\nabla_{x, y} (i, j) = \sum_{m = 0}^{2} \sum_{n = 0}^{2} I (i + m, j + n) \cdot s o b e l_{x, y} (m, n),

(5)

where

i

and

j

represent the pixel positions in the image, and

\nabla x (\cdot)

and

\nabla y (\cdot)

represent the gradients in the x and y directions, respectively. Then, the gradient magnitude of the image is calculated to represent the edge information of the restored image. The specific formula is as follows:

\nabla (i, j) = \sqrt{\nabla_{x} {(i, j)}^{2} + \nabla_{y} {(i, j)}^{2} + ε},

(6)

where

ε

is a small constant to avoid numerical instability when the gradient is zero.

The Sobel loss proposed in this paper measures the gradient magnitude difference between the generated image and the real image using the L1 norm, comparing the edge differences in both the x and y directions between the generated image and the original image to simulate the edge enhancement effect of the Sobel operator. The specific formula is as follows:

L_{s o b e l} = E_{x ~ P_{d a t a} (x)} [\nabla G_{x} (x) - \nabla x_{1}],

(7)

where

G_{x} (x)

is the generated image, x is the real image, and

\nabla G_{x} (x)

and

\nabla x

are the gradients of the generated image and the real image, respectively.

By constructing Sobel loss, CycleGAN is forced to preserve the edge information of the real image during the image generation process, better retaining the crack edge features in the image and improving the accuracy of subsequent object detection tasks. However, while the generated image with Sobel loss retains the edge features of the cracks, there is still a significant amount of blurring and loss of the internal texture information of the cracks. Therefore, to further enhance the texture information in the image, and enable the subsequent object detection model to recognize and extract cracks from the image, this paper constructs a high-frequency loss function. Specifically, by using the Fourier transform and other high-frequency transformations, the image can be converted from the spatial domain to the frequency domain, allowing for better extraction and retention of high-frequency information such as textures and edges. Based on this, in the CGBUIE model, a high-frequency loss function is used to compare the high-frequency information between the real image and the generated image, preserving and enhancing the texture information.

In detail, the original image

x

and the generated image

G_{x}

are first transformed from the spatial domain to the frequency domain using the fast Fourier transform. Assuming the discrete image signal is

Ω

, the specific formula is as follows:

F (Ω (h, w)) = \sum_{n = 0}^{N - 1} Ω [n] e^{- i 2 π \frac{h w n}{N}},

(8)

where

N

is the signal length,

n

is the time index, and

k

is the frequency index. After applying the fast Fourier transform, the original image and the generated image are represented as

F_{r e a l} (h, w)

and

F_{f a k e} (h, w)

, respectively. High-frequency components are extracted by applying a high-frequency filter to remove low-frequency parts of the frequency domain signal, and the absolute value is taken to retain the magnitude of the high-frequency components, i.e., the texture details in the original image. The specific rule is as follows:

H (h, w) = \{\begin{matrix} |F (Ω (h, w))|, i f h < c u t o f f o r w < c u t o f f \\ 0, o t h e r w i s e \end{matrix},

(9)

where

h

and

w

represent the height and width in the frequency domain, i.e., the dimensions of the frequency domain, and

c u t o f f

is a manually set threshold for determining the high-frequency filtering threshold. Finally, the L1 norm loss is calculated between the original image and the generated image based on the magnitude of the high-frequency components to further enhance the texture details of the generated image. The formula is as follows:

L_{H F} = E_{x ~ P_{d a t a} (x)} [H (G_{x} (x)) - H {(x)}_{1}] .

(10)

Therefore, the overall loss function of the E²UCN model proposed in this paper, called Sobel-Frequency Hybrid Loss (SFHLoss), can be expressed as follows:

S F H L o s s = L_{G A N} (G_{x}, D_{y}, x, y) + L_{G A N} (G_{y}, D_{x}, y, x) + L_{c y c l e} (G_{x}, D_{y}, x, y) + L_{s o b e l} + L_{H F}

(11)

By adding Sobel loss and high-frequency loss, the edge and texture details of the image are effectively preserved, which helps the subsequent object detection model accurately extract the crack locations.

2.2. YOLOv11 Model

After image enhancement, images with enhanced texture details are obtained. Subsequently, this study uses the YOLOv11 (https://github.com/ultralytics/ultralytics, accessed on 9 September 2025), model for automatic underwater crack detection. YOLOv11, based on the previous generations of the YOLO model, significantly improves detection accuracy and speed through various innovations and optimizations, demonstrating excellent performance, particularly in the detection of fine targets such as cracks. The specific model architecture is shown in Figure 2.

The core structure of YOLOv11 includes the C3k2 module, the SPPF module, the C2PSA module, and a lightweight design. These innovations enable YOLOv11 to efficiently process the enhanced underwater crack images and achieve accurate crack localization and classification.

First, the C3k2 module in YOLOv11 adopts an improved CSP (Cross-Stage Partial) structure, which optimizes the feature extraction process by using smaller convolution kernels (e.g.,

3 \times 3

kernels). The C3k2 module splits the input feature map into two parts, performs convolution on each part, and then merges them. This design effectively reduces the number of parameters while maintaining feature extraction capabilities. Next, the SPPF (Spatial Pyramid Pooling—Fast) module, another key component of YOLOv11, quickly merges feature maps of different scales through multi-scale pooling. This module significantly enhances the network’s ability to detect targets of different sizes, which is particularly relevant as scale differences are common in crack images. By aggregating global features, the SPPF module improves the model’s detection accuracy. It generates multi-level feature maps through pooling operations at different scales and ultimately merges these feature maps into a global feature representation, thus enhancing the network’s sensitivity to multi-scale information in crack images.

Furthermore, the Convolutional block with the Parallel Spatial Attention (C2PSA) module introduced by YOLOv11 further optimizes the extraction of spatial features. The C2PSA module uses parallel spatial attention mechanisms to focus on key areas in the image (such as the edges of cracks or the cracks themselves), effectively improving the model’s recognition ability against complex backgrounds. The C2PSA module combines both channel attention and spatial attention mechanisms, and, through multi-head attention, it further enhances the feature expression capabilities, allowing YOLOv11 to more accurately localize cracks.

To enhance the model’s lightweight design, YOLOv11 introduces the MobileViT backbone network and depthwise separable convolutions (DWConv), significantly reducing the computational load and the number of parameters while maintaining high accuracy. MobileViT combines the advantages of convolutional neural networks (CNN) and Transformers, enabling efficient information encoding and fusion to capture complex features in crack images while maintaining a low computational overhead. This design makes YOLOv11 suitable for resource-constrained devices such as embedded systems and drones, meeting the demand for crack detection on edge devices.

The loss function of YOLOv11 consists of three main components: classification loss (

L_{c l s}

), bounding box regression loss (

L_{b o x}

), and distribution focal loss (

L_{d f l}

).

L_{c l s}

mainly optimizes the prediction of object categories,

L_{b o x}

is used to optimize the prediction of object locations, and

L_{d f l}

optimizes the confidence of the bounding boxes.

Through this multi-task loss function optimization strategy, YOLOv11 can effectively balance accuracy and speed in crack detection tasks, particularly showing high accuracy and robustness in detecting small cracks in complex backgrounds.

3. Experimental Analysis

In this section, the data collection process and experimental setup are introduced. A series of experiments, including quantitative comparisons of image enhancement and crack detection metrics, detection result images, and ablation studies, is used to validate the effectiveness of the proposed E²UCN.

3.1. Data Collection and Experimental Setup

The underwater image dataset used in this study was collected from the physical model pool at the Tangtu Experimental Base of the Nanjing Hydraulic Research Institute. The test pool dimensions are

11.0 m \times 5.9 m \times 4.2 m

(

l e n g t h \times w i d t h \times d e p t h

), with a depth of 3.4 m below the ground surface and a surrounding wall height of 0.8 m, and the pool’s sidewalls are reinforced with carbon-fiber fabric. The pool contains an underwater tunnel and a dam test scenario with concrete of grade C30. The underwater tunnel dimensions are

6 m \times 4 m \times 3.3 m

(

l e n g t h \times w i d t h \times h e i g h t

), with typical defects set inside. The tunnel is 3.0 m wide and 2.08 m high, providing sufficient space for robotic operations, as shown in Figure 3.

During the data collection process, the mini underwater P200 robot “Qianjiao” (Manufacturer: Qianxin Innovation Technology Co., Ltd.; City: Shenzhen; Country: China) was used to capture underwater optical images. The original video data were processed into eight typical images with a size of

256 \times 256

pixels, simulating various lighting and visual conditions, including normal light, low contrast, light scattering, and non-uniform lighting, as shown in Figure 4a.

The dataset used for training YOLOv11 was the Roboflow crack dataset, which was collected by researchers working on transportation and public safety. The dataset contains 4029 different static images of cracks divided into training, testing, and validation sets, each with corresponding labels. For the CycleGAN training, 10 underwater crack images and 26 above-water crack images were collected, among which 8 typical underwater images (with scattering, insufficient illumination, and blur) were selected for testing. In addition, the dataset for validation contained 11 images, covering cracks with diverse widths, depths, and orientations and several real underwater crack images. Although the number of self-collected underwater images was limited, representative cases with scattering, blurring, and color distortion were selected to ensure diversity in crack characteristics. The experiments were conducted on a high-performance personal computer equipped with an NVIDIA RTX 3080ti 12GB GPU and an AMD5800X CPU. All models were constructed and tested using PyTorch version 1.10.0. During network training, the high-frequency filtering parameter

c u t o f f

was set to 10.

3.2. Experimental Results and Analysis

To validate the effectiveness of the CGBUIE model, several ablation experiments were designed [40,41,42]. First, the original CycleGAN was used for the style transfer of underwater images. Then, Sobel operators and high-frequency loss were added, and subjective evaluation of the resulting images compared to the designed CGBUIE model was performed. The enhanced results from different models are shown in Figure 4. The images were then input into the trained YOLOv11 model for detection. During the evaluation process, precision, recall, mAP (mean average precision), F1-score, and other metrics were recorded for each case to comprehensively evaluate the performance of YOLOv11 with different input images. Visual comparisons were also made between the original and enhanced images in terms of crack detection, analyzing the enhancement method’s effect on detection accuracy and feature recognition. The specific methods and corresponding experimental results are shown in the table below.

Table 1 shows the models used for target detection, where the checked boxes represent the models used in this experiment, and the first row, with no checked boxes, represents the use of the original underwater crack images. From the experimental results, it can be seen that the image enhancement methods significantly affect the performance of YOLOv11 in underwater crack detection. Firstly, Sobel operators primarily enhance the edge information of the image, which helps YOLOv11 achieve better detection results in crack edge localization. However, although Sobel operators improve edge clarity, they may lose some texture and detail information when enhancing the edges, leading to relatively poor performance in complex textured regions. Therefore, although the recall is very high, the mAP50-95 shows a certain decline, reflecting the model’s performance in broader detection areas, which may be affected, thus reducing overall detection performance.

Secondly, high-frequency loss focuses on enhancing the details and high-frequency information in the image, particularly the finer parts of cracks. Compared with Sobel operators, the high-frequency loss image enhancement method is better at preserving the fine texture information of cracks, thus improving detection precision and recall. However, high-frequency enhanced images may also introduce some noise, particularly in more complex backgrounds, leading to minor errors in the detection boxes. Nevertheless, the improvements in the F1-score and mAP50 indicate that high-frequency loss has a significant effect on detail restoration.

Overall, the goal of image enhancement is to improve detection performance by enhancing edge clarity and restoring details, and, when Sobel operators and high-frequency loss are combined, their advantages complement each other toward achieving that goal. In other words, Sobel operators effectively enhance edges but may lose details, while high-frequency loss restores details but may introduce noise. Therefore, using both methods together maximizes the retention of crack feature information in the enhanced underwater images, effectively improving YOLOv11’s overall performance in crack detection, particularly for small cracks and complex backgrounds, with precision, recall, and F1-scores reaching near-perfect levels.

The changes in the loss function and performance metrics during YOLOv11 training are shown in Figure 5, with the ground truth illustrated in line (a). The detection results after applying different enhancement models are shown in Figure 6.

It can be observed that YOLOv11 exhibits some deficiencies in the original images (Figure 6b), particularly when the crack details are blurry or the background is complex. In these cases, the detection confidence is generally lower, and crack localization accuracy is affected. Specifically, the original image, due to its blurry details and low contrast, poses challenges for YOLOv11 in accurately detecting cracks.

With the application of the original CycleGAN model for style transfer (Figure 6c), the detection results of YOLOv11 showed significant improvement. CycleGAN enhanced the overall image clarity, particularly improving the representation of edges and textures, which effectively increased the model’s crack localization accuracy. The enhanced image’s improved details and contrast allowed YOLOv11 to more accurately identify cracks, with a significant increase in detection confidence, reflecting the positive impact of image enhancement on detection results.

After further introducing SobelLoss (Figure 6d), YOLOv11’s crack detection performance improved further. The Sobel operator enhanced the image’s edge details, significantly improving the clarity of the crack contours, and the model’s precision in crack localization was improved. However, despite the positive effect of Sobel on edge enhancement, its ability to preserve image texture information is relatively weak, which may lead to the loss of details in small cracks, affecting detection accuracy. Some false positives and missed detections were still observed, suggesting that, while edge enhancement is beneficial, detail restoration remains a challenge for the model.

When high-frequency loss is applied for image enhancement (Figure 6e), focusing on the high-frequency components of the image, the fine details and subtle features of the cracks are more effectively restored, improving YOLOv11’s ability to recognize small cracks. The enhanced image made crack detection more precise, and confidence increased, particularly for low-contrast and complex backgrounds, where the model demonstrated stronger robustness. However, excessive high-frequency enhancement could introduce background noise, causing instability in some areas of the detection results, highlighting the sensitivity of the enhancement method to background interference.

Finally, the combination of SobelLoss and high-frequency loss (Figure 6f) demonstrated the best crack detection performance. This combination not only strengthened the edge details of the image but also effectively restored more texture information, making crack localization more precise, and the image details richer. YOLOv11 performed at its best with these enhanced images, with its overall detection precision and recall being significantly improved. By integrating both edge enhancement and detail restoration, the model’s adaptability to complex environments was significantly improved, and detection confidence was generally higher, further validating the superiority of combining SobelLoss and high-frequency loss for enhancing image detail restoration and model robustness.

In addition to YOLOv11, we conducted comparative experiments with YOLOv5 and YOLOv8 to provide a broader reference for detection performance. As summarized in Table 2 and Figure 7, all three models benefited from the proposed image enhancement strategy. YOLOv5 achieved a precision of 0.91 and a recall of 1.0, resulting in an F1-score of 0.953, with mAP@0.5 = 0.876 and mAP@0.5:0.95 = 0.752. YOLOv8 exhibited higher overall precision and recall (1.0 and 0.995, respectively), yielding an F1-score of 0.997, and the highest mAP@0.5 value of 0.995, although its performance at stricter IoU thresholds (mAP@0.5:0.95 = 0.685) was comparatively lower. YOLOv11 attained balanced and robust performance, with precision = 0.995, recall = 1.0, F1-score = 0.998, and mAP@0.5 = 0.995, while maintaining a competitive mAP@0.5:0.95 of 0.732.

These results show that, although YOLOv8 achieved the highest detection accuracy under lenient IoU criteria, YOLOv11 demonstrated superior robustness in handling small and irregular crack patterns, reflected in its higher F1-score and improved performance at more stringent IoU thresholds compared to YOLOv8. Therefore, YOLOv11 was selected as the primary detection model in this study. More comprehensive comparisons with other detectors, such as Faster R-CNN and Transformer-based architectures, will be explored in future work to further validate the generality of the proposed approach.

4. Conclusions

This study combines the advantages of image style transfer, detail restoration, and edge enhancement, fully leveraging the complementary effects of Sobel operators and high-frequency filtering. A CycleGAN-based underwater image enhancement method (the CGBUIE model) was proposed, which effectively improves the edge and detail information of the generated images by introducing Sobel operators and high-frequency filtering as loss functions. Specifically, by training on underwater crack images and above-water crack images, the style transfer of underwater images to above-water image styles was achieved, enhancing image visibility and detail expression while improving the robustness of the crack detection model. On this basis, YOLOv11 was used to train the model on the Crack-Seg dataset, constructing a detection model capable of effectively recognizing cracks. The experimental results show that the enhanced underwater crack images significantly improved YOLOv11’s detection performance, detection confidence, and accuracy with complex backgrounds and in low-contrast conditions. Specifically, on a real underwater validation set, E²UCN achieved precision = 0.995, F1 = 0.99762, mAP@0.5 = 0.995, and mAP@0.5:0.95 = 0.736.

In summary, the primary contribution of this work lies in the integration of Sobel operators and high-frequency Fourier constraints into the CycleGAN framework, which ensures the preservation of critical crack-related details during the enhancement process. Together with YOLOv11, this creates a powerful and reliable system for underwater crack detection in challenging environments.

However, this study has certain limitations. The underwater dataset used is relatively small, with only three real underwater images supplementing the dataset. This limited dataset size restricts the diversity and robustness of the model, which may affect its performance in more varied scenarios. Future research should focus on expanding this dataset to increase its diversity and improve the model’s generalization capabilities.

Author Contributions

Methodology, X.W. and J.S.; software, W.Z. and X.W.; validation, W.Z.; investigation, X.W. and W.Z.; resources, X.W. and G.S.; writing—original draft preparation, W.Z.; writing—review and editing, X.W. All authors have read and agreed to the published version of the manuscript.

Funding

This work was funded by the the National Key R&D Program of China under Grant (No. 2022YFC3005401), the National Natural Science Foundation of China under Grant (No. 52309159), the National Natural Science Foundation of China—Joint Fund under Grant (No. U23B20150), the Jiangsu Province Youth Science and Technology Talent Support Program under Grant (No. JSTJ-2024-082), the Scientific Research Fund of Nanjing Hydraulic Research Institute under Grant (No. Y724001), the Technology Talent and Platform Program of Yunnan Province under Grant (No. 202405AK340002),and the Science and Technology Projects of China Huaneng Group under Grant HNKJ20-H46.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available from the author on request.

Acknowledgments

This paper is an extension of a conference paper. Edge Enhanced Underwater CrackNet for Dam Underwater Crack Detection. In Proceedings of the International Symposium “Common Challenges, Shared Future, Better Dams” ICOLD-CIGB 2025, Chengdu, China, 16–23 May 2025 [43].

Conflicts of Interest

Author X.W. was employed by the company NHRI R&D Tech Group Co., Ltd. The remaining authors declare that the re-search was con-ducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Gong, X.N.; Zhang, C.S.; Jia, J.S. Dam Danger Assessment and Risk Reinforcement Technology; China Construction Industry Press: Beijing, China, 2021. (In Chinese) [Google Scholar]
Xiang, Y.; Wang, Y.; Chen, Z.; Dai, B.; Chen, S.; Shen, G. Underwater defect detection and diagnosis assessment for high dams: Current status and challenges. Adv. Water Sci. 2024, 35, 153–164. [Google Scholar]
Raveendran, S.; Patil, M.D.; Birajdar, G.K. Underwater image enhancement: A comprehensive review, recent trends, challenges and applications. Artif. Intell. Rev. 2021, 54, 5413–5467. [Google Scholar] [CrossRef]
Akkaynak, D.; Treibitz, T. Sea-thru: A Method for Removing Water from Underwater Images. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15–20 June 2019; pp. 1682–1691. [Google Scholar]
Pizer, S.M.; Amburn, E.P.; Austin, J.D.; Cromartie, R.; Geselowitz, A.; Greer, T.; ter Haar Romeny, B.; Zimmerman, J.B.; Zuiderveld, K. Adaptive histogram equalization and its variations. Comput. Vis. Graph. Image Process. 1987, 39, 355–368. [Google Scholar] [CrossRef]
Chen, L.; Liang, X.; Peng, Q.; Liu, W. Adaptive homomorphic filtering algorithm for low illumination image enhancement. Comput. Sci. Appl. 2023, 13, 450–457. (In Chinese) [Google Scholar] [CrossRef]
Li, C.; Zhang, W.; Zhang, Y.; Chen, Z. Adaptively Dictionary Construction for Hyperspectral Target Detection. IEEE Geosci. Remote Sens. Lett. 2023, 20, 5502005. [Google Scholar] [CrossRef]
Wang, Y.; Xiang, Y.; Dai, B.; Li, J. Dam early warning model based on structural anomaly identification and dynamic effect variables selection. Structures 2025, 74, 108507. [Google Scholar] [CrossRef]
Habib, A.; Yildirim, U. Distribution of strong input energy in base-isolated structures with complex nonlinearity: A parametric assessment. Multidiscip. Model. Mater. Struct. 2023, 19, 324–340. [Google Scholar] [CrossRef]
Habib, A.; Barakat, S.; Al-Toubat, S.; Junaid, M.T.; Maalej, M. Developing machine learning models for identifying the failure potential of fire-exposed FRP-strengthened concrete beams. Arab. J. Sci. Eng. 2025, 50, 8475–8490. [Google Scholar] [CrossRef]
Li, J.; Skinner, K.A.; Eustice, R.M.; Johnson-Roberson, M. WaterGAN: Unsupervised generative network to enable real-time color correction of monocular underwater images. IEEE Robot. Autom. Lett. 2017, 3, 387–394. [Google Scholar] [CrossRef]
Naik, A.; Swarnakar, A.; Mittal, K. Shallow-uwnet: Compressed model for underwater image enhancement (student abstract). In Proceedings of the AAAI Conference on Artificial Intelligence, Online, 2–9 February 2021; Volume 35, pp. 15853–15854. [Google Scholar]
Lin, P.; Wang, Y.; Li, Y.; Fan, Z.; Fu, X. Underwater color correction network with knowledge transfer. IEEE Trans. Multimed. 2024, 26, 8088–8103. [Google Scholar] [CrossRef]
Fu, Z.; Wang, W.; Huang, Y.; Ding, X.; Ma, K. Uncertainty inspired underwater image enhancement. In Computer Vision—ECCV 2022, 17th European Conference, Tel Aviv, Israel, 23–27 October 2022; Springer Nature: Cham, Switzerland, 2022; pp. 465–482. [Google Scholar]
Wang, Y.; Fan, H.; Liu, S.; Tang, Y. Underwater Image Enhancement Based on Multi-scale Attention and Contrastive Learning. Laser Optoelectron. Prog. 2024, 61, 0437008. (In Chinese) [Google Scholar]
Du, F.; Wang, H.; Yao, H.; Chen, X. Domain-Adaptive Underwater Image Enhancement Algorithm. Comput. Mod. 2024, 55–60. (In Chinese) [Google Scholar] [CrossRef]
Li, J.; Skinner, K.; Eustice, R.M.; Johnson-Roberson, M. WaterGAN: Unsupervised Generative Network to Enable Real-time Color Correction of Monocular Underwater Images. arXiv 2017, arXiv:1702.07392. [Google Scholar] [CrossRef]
Islam, M.J.; Xia, Y.; Sattar, J. Fast underwater image enhancement for improved visual perception. IEEE Robot. Autom. Lett. 2020, 5, 3227–3234. [Google Scholar] [CrossRef]
Hambon, J. Electronical imaging of structural concrete: Investigation by Bayesian stationary wavelet field. Electron. Imaging 2009, 9, 764–773. [Google Scholar]
Gao, Y.; Wang, Y.; Zhou, D. The Image Recognition, Automatic Measurement and Seam Tracking Technology in Arc Welding Process. In Proceedings of the 2010 8th World Congress on Intelligent Control and Automation, Jinan, China, 7–9 July 2010; pp. 2327–2332. [Google Scholar]
Shi, P.; Shao, S.; Fan, X.; Xin, Y.; Zhou, Z.; Cao, P.; Li, X.; Zhu, S. CrackYOLO: Towards efficient dam crack detection for underwater scenes. Pattern Anal. Appl. 2024, 27, 105. (In Chinese) [Google Scholar] [CrossRef]
Mao, Y.; Ping, P.; Chen, J.; Chen, H. Crack Detection with Multi-task Enhanced Faster R-CNN Model. In Proceedings of the IEEE International Conference on Image Processing, Online, 25–28 October 2020; pp. 1234–1238. [Google Scholar]
Huang, B.; Kang, F.; Tang, Y. Real-Time Crack Detection Method for Concrete Dams Based on Object Detection. J. Tsinghua Univ. (Sci. Ed.) 2023, 63, 1078–1086. [Google Scholar] [CrossRef]
Zheng, Z.; Wang, P.; Liu, W.; Li, J.; Ye, R.; Ren, D. Distance-IoU Loss: Faster and Better Learning for Bounding Box Regression. In Proceedings of the the AAAI Conference on Artificial Intelligence, New York, NY, USA, 7–12 February 2020; pp. 12993–13000. [Google Scholar]
Huang, J.; Fang, C.; Zheng, X.; Liu, J. YOLOv8-UC: An Improved YOLOv8-Based Underwater Object Detection Algorithm. IEEE Access 2024, 12, 172186–172195. [Google Scholar] [CrossRef]
Liu, C.; Li, H.; Wang, S.; Zhu, M.; Wang, D.; Fan, X.; Wang, Z. A dataset and benchmark of underwater object detection for robot picking. In Proceedings of the 2021 IEEE International Conference on Multimedia & Expo Workshops (ICMEW), Shenzhen, China, 5–9 July 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 1–6. [Google Scholar]
Zhu, J.-Y.; Park, T.; Isola, P.; Efros, A.A. Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks. In Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; pp. 2242–2251. [Google Scholar]
Zhang, Y.; Wu, K.; Fang, K. Comprehensive Performance Evaluation of YOLOv11: Advancements, Benchmarks, and Real-World Applications. arXiv 2024, arXiv:2411.18871. [Google Scholar]
Tang, H.; Liu, H.; Xu, D.; Torr, P.H.S.; Sebe, N. AttentionGAN: Unpaired Image-to-Image Translation Using Attention-Guided Generative Adversarial Networks. IEEE Trans. Neural Netw. Learn. Syst. 2023, 34, 1972–1987. [Google Scholar] [CrossRef]
Hou, M.; He, X.; Dou, F.; Zhang, X.; Guo, Z.; Feng, Z. Semi—Supervised image super-resolution with attention CycleGAN. IET Image Process. 2022, 16, 1181–1193. [Google Scholar] [CrossRef]
Wang, X.; Yu, K.; Wu, S.; Gu, J.; Liu, Y.; Dong, C.; Qiao, Y.; Change Loy, C. Esrgan: Enhanced super-resolution generative adversarial networks. In Proceedings of the European Conference on Computer Vision (ECCV) Workshops, Munich, Germany, 8–14 September 2018. [Google Scholar]
Wei, Z.; Huang, Y.; Chen, Y.; Zheng, C.; Gao, J. A-ESRGAN: Training real-world blind super-resolution with attention U-Net Discriminators. In PRICAI 2023: Trends in Artificial Intelligence, 20th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2023, Jakarta, Indonesia, 15–19 November 2023; Springer Nature: Singapore, 2023; pp. 16–27. [Google Scholar]
Kanopoulos, N.; Vasanthavada, N.; Baker, R.L. Design of an image edge detection filter using the Sobel operator. IEEE J. Solid-State Circuits 1988, 23, 358–367. [Google Scholar] [CrossRef]
Cooley, J.W.; Tukey, J.W. An algorithm for the machine calculation of complex Fourier series. Math. Comput. 1965, 19, 297–301. [Google Scholar] [CrossRef]
Gonzalez, R.C.; Woods, R.E. Digital Image Processing, 3rd ed.; Pearson: London, UK, 2008. [Google Scholar]
Jiang, L.; Dai, B.; Wu, W.; Loy, C.C. Focal Frequency Loss for Image Reconstruction and Synthesis. In Proceedings of the 2021 Conference on Neural Information Processing Systems, Online, 6–14 December 2021. [Google Scholar]
Kim, M.W.; Cho, N.I. Wavelet-Domain High-Frequency Loss for Perceptual Quality. In Proceedings of the 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, USA, 2–7 January 2023. [Google Scholar]
Wu, J.; Zhang, G.; Fan, Y. LM-CycleGAN: Improving Underwater Image Quality Through Learned Perceptual Image Patch Similarity and Multi-Scale Adaptive Fusion Attention. Sensors 2024, 24, 7425. [Google Scholar] [CrossRef] [PubMed]
Chen, D.; Kang, F.; Li, J.; Zhu, S.; Liang, X. Enhancement of underwater dam crack images using a multi-feature CycleGAN. Autom. Constr. 2024, 167, 105727. [Google Scholar] [CrossRef]
Powers, D.M.W. Evaluation: From Precision, Recall and F-Measure to ROC, Informedness, Markedness and Correlation. J. Mach. Learn. Technol. 2011, 2, 37–63. [Google Scholar]
Everingham, M.; Van Gool, L.; Williams, C.K.I.; Winn, J.; Zisserman, A. The PASCAL Visual Object Classes (VOC) Challenge. Int. J. Comput. Vis. 2010, 88, 303–338. [Google Scholar] [CrossRef]
Yacouby, R.; Axman, D. Probabilistic extension of precision, recall, and f1 score for more thorough evaluation of classification models. In Proceedings of the First Workshop on Evaluation and Comparison of NLP Systems, Online, November 2020; pp. 79–91. Available online: https://aclanthology.org/2020.eval4nlp-1.9/ (accessed on 9 September 2025).
Zhang, W.; Wu, X.; Xiang, Y. Edge enhanced underwater CrackNet for dam underwater crack detection. In Proceedings of the International Symposium “Common Challenges, Shared Future, Better Dams,” ICOLD-CIGB 2025, Chengdu, China, 16–23 May 2025; International Commission on Large Dams: Paris, France, 2025. [Google Scholar]

Figure 1. The architecture of the proposed model: (a) structural diagram of E²UCN; (b) structural diagram of CGBUIE.

Figure 2. The architecture of YOLOv11.

Figure 3. Schematic diagram of the construction layout of the test platform and the cracked wall. Specially, R1–R6 represent six different types of cracks with varying orientations.

Figure 4. Image enhancement results, where (a) is the original image, (b) is the image generated using the original CycleGAN model, (c) is the image generated after using SobelLoss, (d) is the image obtained using high-frequency loss, and (e) is the image obtained using SobelLoss and high-frequency loss.

Figure 5. Loss function curve and performance index variation curve after training YOLOv11 on the Crack-Seg dataset.

Figure 6. Target detection results, where (a) is the ground truth, (b) is the original image detection results, (c) is the images generated by the original CycleGAN model detection results, (d) is the images generated by SobelLoss detection results, (e) is the images generated by high-frequency loss detection results, and (f) is the images generated by SobelLoss and high-frequency loss detection results.

Figure 7. Target detection results, where (a) is the ground truth, (b) is the YOLOv11 results, (c) is the YOLOv5 results, and (d) is the YOLOv8 results.

Table 1. Ablation experiment module design and evaluation metrics.

CycleGAN	SobelLoss	HFLoss	Detection Evaluation Indicators
CycleGAN	SobelLoss	HFLoss	Precision	Recall	F1-Score	mAP50	mAP50-95
			0.93	0.875	0.902	0.876	0.565
√			0.994	1	0.930	0.982	0.624
√	√		0.869	1	0.996	0.991	0.548
√		√	0.993	1	0.997	0.993	0.72
√	√	√	0.995	1	0.997	0.995	0.732

Table 2. Comparative experiments and corresponding evaluation metrics. The highest score of each evaluation metric is highlighted in bold.

Model Name	Detection Evaluation Indicators
	Precision	Recall	F1-Score	mAP50	mAP50-95
YOLOv5	0.91	1	0.953	0.876	0.752
YOLOv8	1	0.995	0.997	0.995	0.685
YOLOv11	0.995	1	0.998	0.995	0.732

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wu, X.; Zhang, W.; Shen, G.; Sheng, J. Edge-Enhanced CrackNet for Underwater Crack Detection in Concrete Dams. Appl. Sci. 2025, 15, 10326. https://doi.org/10.3390/app151910326

AMA Style

Wu X, Zhang W, Shen G, Sheng J. Edge-Enhanced CrackNet for Underwater Crack Detection in Concrete Dams. Applied Sciences. 2025; 15(19):10326. https://doi.org/10.3390/app151910326

Chicago/Turabian Style

Wu, Xiaobian, Weibo Zhang, Guangze Shen, and Jinbao Sheng. 2025. "Edge-Enhanced CrackNet for Underwater Crack Detection in Concrete Dams" Applied Sciences 15, no. 19: 10326. https://doi.org/10.3390/app151910326

APA Style

Wu, X., Zhang, W., Shen, G., & Sheng, J. (2025). Edge-Enhanced CrackNet for Underwater Crack Detection in Concrete Dams. Applied Sciences, 15(19), 10326. https://doi.org/10.3390/app151910326

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Edge-Enhanced CrackNet for Underwater Crack Detection in Concrete Dams^†

Abstract

Featured Application

Abstract

1. Introduction

2. Proposed Method

2.1. CycleGAN-Based Underwater Image Enhancement Model

2.2. YOLOv11 Model

3. Experimental Analysis

3.1. Data Collection and Experimental Setup

3.2. Experimental Results and Analysis

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Edge-Enhanced CrackNet for Underwater Crack Detection in Concrete Dams †

Abstract

Featured Application

Abstract

1. Introduction

2. Proposed Method

2.1. CycleGAN-Based Underwater Image Enhancement Model

2.2. YOLOv11 Model

3. Experimental Analysis

3.1. Data Collection and Experimental Setup

3.2. Experimental Results and Analysis

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Edge-Enhanced CrackNet for Underwater Crack Detection in Concrete Dams^†