DIDW-YOLOv11: The Steel Surface Defect Detection Method Based on Improved YOLOv11 Network
Abstract
1. Introduction
- The backbone network creatively adopts the C3k2-DIMB module as the core feature extraction component, in which the DIMB module employs two different scales of DynamicInceptionDWConv2d, which enhance the adaptive extraction capability of complex defect features;
- The IDWFSPPF module is designed to replace the traditional SPPF module, which uses average pooling to assist max pooling and enhances the model’s ability to fuse local and global feature information;
- An auxiliary detection head (ADH) is used to optimize the original detection head. The ADH is composed of an Anchor Free and an Aux Head. By utilizing the auxiliary head to detect shallow feature information in advance and combining the shallow features with the deep features, the auxiliary detection head is very helpful in effectively mitigating noise interference and the risk of overfitting.
2. Related Works
3. Proposed Model
3.1. DIDW-YOLOv11 Architecture Overview
- Backbone. The backbone performs multi-scale feature extraction, which creatively adopts C3k2-DIMB as the primary feature extraction module. We propose the C3k2-DIMB module by combining the Dynamic Inception Mixer Block (DIMB) constructed with DynamicInceptionDWConv2d and the traditional C3k2 module, which allows the model to perform dynamically adaptive depthwise convolution operations based on the distribution characteristics of input features, while efficiently integrating multi-scale and multi-directional defect feature information. The DynamicInceptionDWConv2d module combines depthwise separable convolutions with dynamic kernel weight adjustment mechanisms, which can reduce the number of parameters and improve computational efficiency compared to traditional full-channel convolutions;
- Neck. The neck adopts a Path Aggregation Network structure, incorporating four C3k2-DIMBs for cross-scale feature fusion. The C3k2-DIMB integrates DynamicInceptionDWConv2d equipped with three parallel depthwise kernels and a dynamic weight adjustment mechanism, addressing the insufficient feature extraction depth of the original C3k2 module. Additionally, the backbone embeds the IDWFSPPF module with mixed pooling strategy to improve the traditional SPPF, enriching multi-scale feature representation while retaining fine-grained information of small targets;
- Head. An auxiliary detection head (ADH) is introduced with multi-loss supervision in order to enhance the model’s anti-interference ability against local noise and its dependence on redundant features, which integrates Anchor Free for deep semantic features, and Aux Head for shallow detail features. The total loss of ADH is computed as a weighted combination of coarse loss and fine loss, optimizing the way the head processes feature maps.
3.2. The C3k2_DIMB Module
3.2.1. The Structure of C3k2_DIMB
3.2.2. The DIMB Module
3.3. IDWFSPPF Module
3.4. Auxiliary Detection Head Module
4. Experiments
4.1. Data Collection and Data Preprocessing
4.2. Implementation Details
4.3. Evaluation Metric
4.4. Ablation Experiments
4.5. Comparative Experiments
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Wang, X.; Wang, Z.; Guo, C.; Han, Y.; Zhao, J.; Lu, N.; Tang, H. Application and Prospect of New Steel Corrugated Plate Technology in Infrastructure Fields. IOP Conf. Ser. Mater. Sci. Eng. 2020, 741, 012099. [Google Scholar] [CrossRef]
- Xiong, Z.; Li, Q.; Mao, Q.; Zou, Q. A 3D Laser Profiling System for Rail Surface Defect Detection. Sensors 2017, 17, 1791. [Google Scholar] [CrossRef]
- Amin, D.; Akhter, S. Deep learning-based defect detection system in steel sheet surfaces. In 2020 IEEE Region 10 Symposium (TENSYMP); IEEE: New York, NY, USA, 2020; pp. 444–448. [Google Scholar]
- Luo, S.; Hou, J.; Zheng, B.; Zhong, X.; Liu, P. Research on edge detection algorithm of work piece defect in machine vision detection system. In 2022 IEEE 6th Information Technology and Mechatronics Engineering Conference (ITOEC); IEEE: New York, NY, USA, 2022; pp. 1231–1235. [Google Scholar]
- Gaidhane, V.H.; Rani, A.; Singh, V. An improved edge detection approach and its application in defect detection. IOP Conf. Ser. Mater. Sci. Eng. 2017, 244, 012017. [Google Scholar] [CrossRef]
- Cao, G.; Ruan, S.; Peng, Y.; Huang, S.; Kwok, N. Large-complex-surface defect detection by hybrid gradient threshold segmentation and image registration. IEEE Access 2018, 6, 36235–36246. [Google Scholar] [CrossRef]
- Neogi, N.; Mohanta, D.K.; Dutta, P.K. Defect detection of steel surfaces with global adaptive percentile thresholding of gradient image. J. Inst. Eng. Ser. B 2017, 98, 557–565. [Google Scholar] [CrossRef]
- Wang, H.; Gu, J.; Wang, S. An effective intrusion detection framework based on SVM with feature augmentation. Knowl.-Based Syst. 2017, 136, 130–139. [Google Scholar] [CrossRef]
- Zhiqiang, W.; Jun, L. A review of object detection based on convolutional neural network. In 2017 36th Chinese Control Conference (CCC); IEEE: New York, NY, USA, 2017; pp. 11104–11109. [Google Scholar]
- Pernkopf, F. Detection of surface defects on raw steel blocks using Bayesian network classifiers. Pattern Anal. Appl. 2004, 7, 333–342. [Google Scholar] [CrossRef]
- Zhao, C.; Shu, X.; Yan, X.; Zuo, X.; Zhu, F. RDD-YOLO: A modified YOLO for detection of steel surface defects. Measurement 2023, 214, 112776. [Google Scholar] [CrossRef]
- Girshick, R. Fast r-cnn. In 2015 IEEE International Conference on Computer Vision (ICCV); IEEE: New York, NY, USA, 2015; pp. 1440–1448. [Google Scholar]
- He, K.; Gkioxari, G.; Dollár, P.; Girshick, R. Mask r-cnn. In 2017 IEEE International Conference on Computer Vision (ICCV); IEEE: New York, NY, USA, 2017; pp. 2961–2969. [Google Scholar]
- Akhyar, F.; Lin, C.Y.; Muchtar, K.; Wu, T.Y.; Ng, H.F. High efficient single-stage steel surface defect detection. In 2019 16th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS); IEEE: New York, NY, USA, 2019; pp. 1–4. [Google Scholar]
- Feng, X.; Gao, X.; Luo, L. X-SDD: A new benchmark for hot rolled steel strip surface defects detection. Symmetry 2021, 13, 706. [Google Scholar] [CrossRef]
- Liu, X.; Gao, J. Surface defect detection method of hot rolling strip based on improved SSD model. In Database Systems for Advanced Applications. DASFAA 2021 International Workshops; Springer International Publishing: Cham, Switzerland, 2021; pp. 209–222. [Google Scholar]
- Yang, Z.; Liu, Y. A steel surface defect detection method based on improved RetinaNet. Sci. Rep. 2025, 15, 6045. [Google Scholar] [CrossRef] [PubMed]
- Sharma, M.; Lim, J.-T.; Chae, Y.-G. Steel Surface Defect Detection Using the RetinaNet Detection Model. Int. J. Internet Broadcast. Commun. 2022, 14, 136–146. [Google Scholar] [CrossRef]
- Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You only look once: Unified, real-time object detection. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: New York, NY, USA, 2016; pp. 779–788. [Google Scholar]
- Redmon, J.; Farhadi, A. Yolov3: An incremental improvement. arXiv 2018, arXiv:1804.02767. [Google Scholar] [CrossRef]
- Bochkovskiy, A.; Wang, C.Y.; Liao, H.Y.M. Yolov4: Optimal speed and accuracy of object detection. arXiv 2020, arXiv:2004.10934. [Google Scholar] [CrossRef]
- Li, M.; Wang, H.; Wan, Z. Surface defect detection of steel strips based on improved YOLOv4. Comput. Electr. Eng. 2022, 102, 108208. [Google Scholar] [CrossRef]
- Li, C.; Li, L.; Jiang, H.; Weng, K.; Geng, Y.; Li, L.; Ke, Z.; Li, Q.; Cheng, M.; Nie, W. YOLOv6: A single-stage object detection framework for industrial applications. arXiv 2022, arXiv:2209.02976. [Google Scholar] [CrossRef]
- Wang, C.Y.; Bochkovskiy, A.; Liao, H.Y.M. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: New York, NY, USA, 2023; pp. 7464–7475. [Google Scholar]
- Reis, D.; Kupec, J.; Hong, J.; Daoudi, A. Real-time flying object detection with YOLOv8. arXiv 2023, arXiv:2305.09972. [Google Scholar]
- Wang, C.Y.; Yeh, I.H.; Mark Liao, H.Y. Yolov9: Learning what you want to learn using programmable gradient information. In Computer Vision—ECCV 2024; Springer Nature: Cham, Switzerland, 2024; pp. 1–21. [Google Scholar]
- Wang, A.; Chen, H.; Liu, L.; Chen, K.; Lin, Z.; Han, J.; Ding, G. Yolov10: Real-time end-to-end object detection. Adv. Neural Inf. Process. Syst. 2024, 37, 107984–108011. [Google Scholar]
- Khanam, R.; Hussain, M. Yolov11: An overview of the key architectural enhancements. arXiv 2024, arXiv:2410.17725. [Google Scholar] [CrossRef]
- Ge, Z.; Liu, S.; Wang, F.; Li, Z.; Sun, J. Yolox: Exceeding yolo series in 2021. arXiv 2021, arXiv:2107.08430. [Google Scholar] [CrossRef]
- Wen, X.; Shan, J.; He, Y.; Song, K. Steel surface defect recognition: A survey. Coatings 2022, 13, 17. [Google Scholar] [CrossRef]
- Zhao, W.; Chen, F.; Huang, H.; Li, D.; Cheng, W. A new steel defect detection algorithm based on deep learning. Comput. Intell. Neurosci. 2021, 2021, 5592878. [Google Scholar] [CrossRef]
- Wang, L.; Liu, X.; Ma, J.; Su, W.; Li, H. Real-time steel surface defect detection with improved multi-scale YOLO-v5. Processes 2023, 11, 1357. [Google Scholar] [CrossRef]
- Song, H. RSTD-YOLOv7: A steel surface defect detection based on improved YOLOv7. Sci. Rep. 2025, 15, 19649. [Google Scholar] [CrossRef] [PubMed]
- Song, X.; Cao, S.; Zhang, J.; Hou, Z. Steel surface defect detection algorithm based on YOLOv8. Electronics 2024, 13, 988. [Google Scholar] [CrossRef]
- Zheng, T.; Yu, L.; Shi, Y.; Niu, F. A lightweight steel surface defect detection network based on YOLOv9. AIP Adv. 2025, 15, 055317. [Google Scholar] [CrossRef]
- Haoyan, H.; Jinwu, T.; Haibin, W.; Xinyun, L. Ead-yolov10: Lightweight steel surface defect detection algorithm research based on yolov10 improvement. IEEE Access 2025, 13, 55382–55397. [Google Scholar] [CrossRef]
- Yang, L.; Li, Z.; Hu, X.; Shao, M.; Zhao, Y.; Zhou, C. CTC-YOLO: An improved YOLOv11 algorithm for steel surface defect detection. Eng. Res. Express 2025, 7, 035265. [Google Scholar] [CrossRef]
- Dang, Z.; Wang, X. FD-YQA0L011: A Feature-Enhanced Deep Learning Model for Steel Surface Defect Detection. IEEE Access 2025, 13, 63981–63993. [Google Scholar] [CrossRef]
- Huang, B.; Wang, M. Steel Surface Defect Detection Algorithm Based on DM-YOLOv11. In 2025 5th International Conference on Artificial Intelligence and Industrial Technology Applications (AIITA); IEEE: New York, NY, USA, 2025; pp. 848–853. [Google Scholar]
- Liang, E. Steel surface defect detection algorithm based on improved YOLO11. In ISCCN ‘25: Proceedings of the 2025 4th International Conference on Intelligent Systems, Communications and Computer Networks; Association for Computing Machinery: New York, NY, USA, 2025; pp. 185–191. [Google Scholar]
- Zong, G.; Shan, W. Mcfd: A Multi-Backbone Dynamic Convolution Fusion and Focus Diffusion Pyramid Network for Infrared Small Target Detection. Available online: https://ssrn.com/abstract=5266815 (accessed on 2 June 2026).









| Defect Class | Images | Detects |
|---|---|---|
| Crazing | 300 | 689 |
| Patches | 300 | 881 |
| Inclusion | 300 | 1011 |
| Pitted surface | 300 | 432 |
| Rolled-in scale | 300 | 628 |
| Scratches | 300 | 548 |
| Total | 1800 | 4189 |
| Model | YOLO11s | C3k2-DIMB | IDWFSPPF | ADH | P (%) | R (%) | mAP@0.5 (%) | FPS |
|---|---|---|---|---|---|---|---|---|
| 1 | √ | 71.5 | 70.7 | 76.6 | 124 | |||
| 2 | √ | √ | 76.6 | 71.1 | 77.9 | 93 | ||
| 3 | √ | √ | 70.6 | 73.2 | 78.3 | 127 | ||
| 4 | √ | √ | 80.5 | 71.1 | 79.5 | 151 | ||
| 5 | √ | √ | √ | 70.6 | 73.4 | 78.5 | 93 | |
| 6 | √ | √ | √ | 78.3 | 74.8 | 79.9 | 83 | |
| 7 | √ | √ | √ | 73.1 | 73.2 | 78.0 | 130 | |
| 8 | √ | √ | √ | √ | 77.7 | 75.6 | 81.5 | 118 |
| YOLO11s | C3k2-DIMB | IDWFSPPF | ADH | mAP@0.5 (%) | |||||
|---|---|---|---|---|---|---|---|---|---|
| Cr | In | Pa | Ps | Rs | Sc | ||||
| √ | 40.5 | 85.9 | 91.0 | 80.3 | 67.1 | 94.8 | |||
| √ | √ | 33.9 | 87.3 | 97.4 | 95.4 | 61.5 | 92.2 | ||
| √ | √ | 41.6 | 80.7 | 94.0 | 88.9 | 69.4 | 95.3 | ||
| √ | √ | 40.5 | 86.1 | 96.3 | 93.2 | 64.3 | 96.4 | ||
| √ | √ | √ | √ | 44.2 | 87.2 | 97.6 | 93.9 | 70.2 | 95.7 |
| Experiments | P (%) | R (%) | mAP@0.5 (%) | FPS |
|---|---|---|---|---|
| Faster-RCNN | 70.9 | 71.6 | 76.0 | 18 |
| SSD | 75.6 | 66.3 | 74.1 | 89 |
| YOLOv5s | 66.8 | 71.3 | 73.8 | 116 |
| YOLOv7 | 71.7 | 70.7 | 76.5 | 64 |
| YOLOv8 | 67.7 | 68.8 | 74.2 | 113 |
| YOLOv9 | 68.1 | 73.1 | 75.5 | 109 |
| YOLOv10 | 70.2 | 70.5 | 75.9 | 102 |
| YOLOv11 | 71.8 | 70.5 | 76.6 | 124 |
| FD-YOLO11 | 74.0 | 77.7 | 81.1 | 110 |
| DIDW-YOLOv11 (ours) | 77.7 | 75.6 | 81.5 | 118 |
| Experiments | P (%) | R (%) | mAP@0.5 (%) | FPS |
|---|---|---|---|---|
| Faster-RCNN | 60.2 | 60.6 | 64.8 | 32 |
| SSD | 56.0 | 59.6 | 57.1 | 138 |
| YOLOv5s | 62.1 | 58.0 | 68.6 | 179 |
| YOLOv7 | 66.6 | 42.5 | 63.6 | 86 |
| YOLOv8s | 61.6 | 59.4 | 68.2 | 185 |
| YOLOv9s | 70.9 | 54.3 | 67.4 | 177 |
| YOLOv10s | 61.0 | 59.1 | 64.6 | 165 |
| YOLOv11s | 69.4 | 53.7 | 67.2 | 183 |
| FD-YOLOv11 | 68.9 | 68.7 | 71.3 | 161 |
| DIDW-YOLOv11 (ours) | 68.0 | 69.2 | 72.0 | 159 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Jiang, J.; Zhang, Y.; Xue, Z.; Wang, C. DIDW-YOLOv11: The Steel Surface Defect Detection Method Based on Improved YOLOv11 Network. Electronics 2026, 15, 2593. https://doi.org/10.3390/electronics15122593
Jiang J, Zhang Y, Xue Z, Wang C. DIDW-YOLOv11: The Steel Surface Defect Detection Method Based on Improved YOLOv11 Network. Electronics. 2026; 15(12):2593. https://doi.org/10.3390/electronics15122593
Chicago/Turabian StyleJiang, Jiajun, Yaodan Zhang, Ziyang Xue, and Chuzheng Wang. 2026. "DIDW-YOLOv11: The Steel Surface Defect Detection Method Based on Improved YOLOv11 Network" Electronics 15, no. 12: 2593. https://doi.org/10.3390/electronics15122593
APA StyleJiang, J., Zhang, Y., Xue, Z., & Wang, C. (2026). DIDW-YOLOv11: The Steel Surface Defect Detection Method Based on Improved YOLOv11 Network. Electronics, 15(12), 2593. https://doi.org/10.3390/electronics15122593
